strategies,
from passive to active
Strategy Results
The main overview of your backtest. Shows how much money you made (or lost), your portfolio composition, capital evolution, and key performance metrics for your selected time window.
The overall profitability and behavior of your strategy on the time window you selected. This is your first-look summary before diving into robustness, benchmarking, or efficiency.
A positive P&L is necessary but not sufficient. Check the Sharpe ratio — are you being compensated for the risk you're taking? Look at the max drawdown — could you stomach that loss in practice? If the capital evolution line is highly volatile, even a profitable strategy may be uncomfortable to hold. Use the representativeness gauge to check whether your window is typical or an outlier before drawing conclusions.
Robustness
A strategy that worked once may have been lucky. This section tests it across 50 different time windows and produces a single reliability score.
Whether your strategy's performance is consistent and repeatable, or just the result of favorable market timing. The robustness score (0–100) is computed from 4 pillars using a weighted geometric mean — meaning if any single pillar scores zero, the overall score drops to zero. All pillars are evaluated across all 50 rolling windows.
A score above 50 means the strategy works across most market conditions. Below 30 means it likely depends on specific timing.
Benchmarking
Every backtest is compared head-to-head against a benchmark (default: MSCI World via URTH ETF). Not just once — across all 50 rolling windows.
Whether your strategy genuinely outperforms the market, or just rides the same wave. The benchmark edge score (0–100) is computed from 3 pillars: Win Rate (15%), Information Ratio (55%), and Sharpe Advantage (30%). A strategy with positive returns but negative alpha is essentially an expensive index fund.
A score above 50 means your strategy delivers comparable or better results than the benchmark in most scenarios. Below 30, you'd likely be better off buying the index.
Efficiency
Modern Portfolio Theory meets rolling-window robustness. Measures both how well your weights are allocated and how effectively your assets diversify risk.
Whether your portfolio allocation is efficient — meaning you're getting the maximum possible return for your level of risk. The efficiency score (0–100) is computed from 2 pillars: Allocation Efficiency (70%) and Diversification (30%). Requires at least 2 assets; for single-asset backtests, a comparison view is shown instead.
For single-asset backtests, the efficiency score is not available. Instead, the frontier chart shows your asset plotted against individual alternatives for comparison.
Trades
The full transaction history: every buy, every sell, every fee. See exactly what the strategy did and when.
The granular execution of your strategy. Useful for understanding trade frequency, average position size, and verifying the strategy behaves as expected.
Check trade frequency — too many trades means high fees. Look for clustering (many trades at once) which may indicate the strategy reacts too aggressively. The CSV export lets you reconcile with your own analysis tools.
Metrics Glossary
Every metric computed in your backtest, with formula and interpretation ranges. All values are medians across the 50 rolling windows.
Total Return
The overall percentage gain or loss on your investment, including all cash flows.
IRR Internal Rate of Return / XIRR
Annualized return that accounts for the timing and size of each cash flow. The true measure of performance for strategies with multiple investments (DCA, Value Averaging, etc.).
CAGR Compound Annual Growth Rate
The smoothed annualized return assuming profits are reinvested. Used for single-flow strategies (Buy & Hold). For multi-flow strategies, IRR/XIRR is used instead.
Sharpe Ratio
Return per unit of total risk (volatility). Measures how well the strategy compensates you for the uncertainty you endure. Computed using weekly returns for stability.
Sortino Ratio
Like Sharpe, but only penalizes downside volatility. Ignores upside moves, which investors actually want. Computed using weekly returns.
Calmar Ratio
Annualized return divided by worst drawdown. Measures return relative to the worst pain you'd experience.
Volatility Annualized
Standard deviation of weekly returns, annualized. Measures how much the portfolio value fluctuates. Falls back to daily returns when fewer than 14 days of data.
Max Drawdown
The largest peak-to-trough decline in portfolio value. Answers: “What's the worst it can get?”
Max Drawdown Duration
The longest period (in days) from a peak to full recovery. Answers: “How long could I be underwater?”
Value at Risk 5% VaR
The 5th percentile of returns across all rolling windows. In 95% of scenarios, your return will be better than this number.
Win Rate
Percentage of rolling windows where the strategy was profitable. A high win rate means the strategy works across most market conditions.
Excess Return
The median difference in annualized returns between your strategy and the benchmark across all rolling windows. Positive = outperformance.
Information Ratio
Measures the consistency of outperformance. Higher means the excess return is reliable, not driven by a few lucky windows. Contributes 55% of the Benchmark Edge Score.
Beta
Sensitivity to benchmark movements. A beta of 1.2 means the strategy moves 20% more than the market.
Limitations
No backtest is a crystal ball. Here's what our simulations do not account for.