I've been using Claude as a research partner in the strategy-build loop for most of 2026. It's now one of the most leveraged pieces of my workflow. Worth writing up what it's actually good for, where it falls apart, and the checks I run before I trust any of its output.
What Claude is good for.
Red-teaming risk rules. Hand it your portfolio-level correlation matrix, your vol contributions per strategy, and your sizing logic, and ask it to argue against your current weights. The output is sharper than most discretionary risk managers. Not because Claude is smarter, but because it has the patience to walk every pair of strategies and ask 'are these actually independent or are they pulling the same trade through a different feature?' That's the question retail almost never asks itself.
Surfacing factor exposures you missed. If you have 28 strategies and you think they're uncorrelated, Claude will pull out the hidden vol exposure, the hidden carry exposure, the hidden trend-following exposure dressed up as something else. It does this through specification, not data, which means it gets it directionally right and quantitatively wrong.
Drafting feature ideas. Give it the instrument, the regime you're trying to capture, and the existing features in the model, and it'll suggest variants you'd take a week to write down on your own. Some of the suggestions are garbage. The ratio of useful to garbage is better than scrolling QuantStart for three hours.
Walking through correlation logic when you're tired. Trading-research nights at 11pm are when reasoning breaks. Claude doesn't get tired.
Where it breaks.
Over-confident pattern-matching on stats. Ask it about the historical Sharpe of a CTA strategy in 2022 and it will quote a number with three decimal places. That number is hallucinated unless the model has access to the data.
Made-up instrument-specific detail. The broker financing cost it quotes for gold CFDs is generic. Your broker, your tier, your account is specific. Don't conflate them.
Plausible-sounding strategy ideas with no edge. Most candidate strategies Claude suggests get cut in the first robustness pass. The signal-to-noise is still high enough to use it.
The three checks before you trust the output.
Stat-source verification. Every number Claude quotes needs a source. If you can't trace it, treat it as zero information.
Instrument-specificity check. Generic FX/gold/index claims need checking against your specific broker setup.
Falsifiability check. Any strategy idea needs a falsifiable hook. If you can't write the hook from Claude's description, the idea isn't a strategy. It's a vibe.
Used this way, Claude is a research partner. Used without these checks, it's a confident-sounding error generator. The difference is you and your "operator discipline".