Use it as the first step of the testing loop: Obstacle Course (minutes) → a short backtest → a full backtest. If the course says FAIL, fix the prompt before you spend a backtest on it.
Running it
- In the app, open Backtesting and switch to the Obstacle Course tab.
- The strategy prompt and notes from your current agent are prefilled — edit them freely, or paste any prompt. Only the prompt is tested.
- Pick a decision model. The cost updates with the model — that’s the only thing that changes the price.
- Click Run Obstacle Course. Results are ready in ~3–10 minutes.
Reading the report
Verdict — the gate
| Verdict | Meaning |
|---|---|
| PASS | Clears every critical situation, consistently. |
| WEAK | Too many sub-optimal decisions, or inconsistent behavior in a critical situation. |
| FAIL | Took a forbidden action in a critical situation — e.g. failed to protect a large winner at a trend top. |
Cell matrix
A heatmap of every station × position-context (for exampletop / winner). Each cell shows pass/miss/fail counts, a consistency bar, and the full mix of actions your prompt chose — “WAIT 2 of 6 at the top” is exactly the kind of intel you’re looking for. Click any cell to drill in.
Decision drill
Click any graded decision to read the full reasoning, the raw model response, and the exact frozen market state the model was shown — no re-running required.Cost
Each run is charged a flat, model-aware credit fee, shown before you launch:- A cheaper model is a cheaper run.
- The test battery itself is fixed, so nothing other than the model changes the price.
- The fee is charged at launch and refunded automatically if the engine can’t start the run.
- You can have one run in progress at a time.
From chat / MCP
You can also run the course from Agent Chat or the MCP connection with theshekel_run_obstacle_course tool, and read results with shekel_obstacle_course_report — same verdict, cell matrix, and per-decision drill.