Skip to content
Cumulant

Working paper

Robustness in Backtest Inference

How much of a historical investment edge survives multiple testing, costs, and out-of-sample evaluation.

Status

Working paper

Version

Working paper

Date

2026-05

Authors

Aryan Patel

Abstract

A method-focused study of how inference choices, model selection, and repeated testing inflate the apparent performance of historical trading strategies, and what survives once those effects are controlled for. The work develops a structure-preserving randomization test that holds the trade structure and price path fixed and re-randomizes only entry placement, separating profitability from genuine entry-timing skill. A companion working paper presents the method and results in full.

Research question

How much of a historical investment edge survives multiple testing, costs, and out-of-sample evaluation?

Methods

  • Structure-preserving randomization
  • Multiple-testing control
  • Out-of-sample evaluation
  • Sensitivity to declared measures
  • Cross-asset robustness

Data

Gold futures and a cross-asset panel of public daily market data; provenance recorded.

Results

Working paper. Across the examined panel, profitability is common but no rule shows robust, measure-invariant entry-timing skill after multiplicity control. The work is not published.

Limitations

Findings are primarily single-asset; the cross-asset panel is exploratory, and conclusions depend on the declared re-randomization measures.

Code availability

Internal. Release forthcoming.

Data availability

Public market data.

AI disclosure

AI agents assisted with literature retrieval, code generation, analysis support, critique, and drafting. Deterministic systems produced and verified the estimates. A human researcher approved the question, design, interpretation, and release.

Reproduction status

Internal validation.