DRL Alpha Bot process is live at port 5050 with real Binance WebSocket data feeding BTC & ETH 5-minute candles.
2
Resolve Circuit Breakers
Check the Risk tab for any active halts. When the daily loss limit is triggered during paper trading, you can safely reset it to continue learning.
3
Train the PPO Agent
The DRL model has no trained weights yet. Run the CSCV training pipeline — it uses 90 days of real Binance 5-min candles, runs 50 hyperparameter trials, and applies the Bailey et al. PBO overfitting filter. Takes ~30 minutes.
Loading data…
4
Validate Model Quality (PBO < 10%)
After training, the PBO (Probability of Backtest Overfitting) score must be below 10%. Only then will the bot switch from heuristic fallback to the trained PPO agent. Check the Agent tab for results.
5
Connect to Real Polymarket Markets
Currently using synthetic markets derived from Binance momentum — this is normal when no "BTC Up/Down 5-min" contracts are live on Polymarket. The Markets tab shows when real contracts appear. Strategy signals are identical in both modes.
6
Analyze Performance & Iterate
Monitor the Overview tab for equity curve, win rate, and Sharpe ratio. The 12-dim feature vector (Volume, RSI, DX, UltOsc, OBV, HT Phase, Price Divergence, Time Left, Spread, 5m Return, 1m Momentum, Vol Spike) drives the PPO policy. Retrain when win rate drops below 48% or drawdown exceeds 15%.
How the 5-Min Prediction Model Works
📡
Binance Latency Edge
Real-time WebSocket stream — price data arrives 2-3s before Polymarket updates