Datasets & Courses

The same data and models that power our API. Train your own models, backtest strategies, or learn to build a prediction market bot from scratch.

Course

Polymarket Bot Course

6 Jupyter notebooks: scraping ESPN, Elo ratings, WP models, backtesting, live bot, deployment. Build a complete prediction system from scratch.

6 Jupyter notebooks ESPN scraping to live deployment Working end-to-end code
Module 1: Scraping ESPN
Module 2: Elo Ratings
Module 3: WP Models
Module 4: Backtesting
Module 5: Live Bot
Module 6: Deployment
$49
one-time, instant download
Buy Course

Exclusive Market Data

Not available anywhere else Preview samples →

Tick-level orderbook data from Polymarket and Kalshi. These platforms do not publish historical market data — we recorded it ourselves.

Exclusive

Kalshi MLB 1-Minute Candles (Full Season)

2,203 games, 737K+ candle bars from the 2025 MLB season. OHLCV + bid/ask at 1-minute resolution with game outcomes. Kalshi does not publish historical data.

2,203 games, Apr-Oct 2025 67 MB
$149 one-time
Premium

Polymarket Orderbook Archive

Tick-level bid/ask/spread/volume snapshots from Polymarket sports markets. Score-synced with live game state. 30+ days of continuous recording. Polymarket has no historical orderbook API.

Millions of ticks, Dec 2025 - Jan 2026 ~2 GB (Parquet)
$499 one-time
Research

Prediction Market Microstructure Pack

Research-grade analysis of Kalshi market dynamics: spread compression events, quote freezes, recovery curves, leader-lag clusters, reversion patterns. Includes charts and methodology.

20M+ row feed + analytics + 27 charts 93 MB
$149 one-time

Training Datasets

Best Seller

ESPN WP Training Data (7 Sports)

25.6M rows of in-game state snapshots across NBA, NCAAMB, NCAAWB, CFB, NFL, NHL, MLB. Parquet format. 2020-2026.

60,702 games, 33 parquet files 107 MB
$99 one-time

NCAAB Analytics Mega-Pack

2015-2024: leak-free features (140 cols), raw box scores, player stats, lineups, recruiting, KenPom-style ratings, SRS, shooting splits.

31K+ games, 688 MB unzipped 160 MB
$149 one-time

NHL Odds + MoneyPuck Bundle

Historical odds data (2020-2025) + MoneyPuck advanced analytics. Player-level and goalie stats with data dictionary.

5 seasons of odds + advanced stats 215 MB
$99 one-time

MLB Complete Analytics

Retrosheet play-by-play (2020-2024), game info, player stats, and historical odds. Ready for pitcher modeling and game simulation.

5 seasons, EVA/EVN + CSV 41 MB
$79 one-time

Tennis ATP/WTA Match Data + Odds

11 years of ATP and WTA match results (2015-2025) with player stats, surface, tournament tier. Plus betting odds (2020-2025). Ready for Elo and serve-return modeling.

22 CSV files + 6 odds XLSX 13 MB
$49 one-time
Best Value

Complete Mega Bundle

Everything: WP training data, NCAAB mega-pack, NHL odds, MLB analytics, Kalshi candles, tennis data, microstructure pack, AND the bot course.

All 11 products Save $574+ vs buying separately
$1,173 separately
$599
Get Everything

What you can build with this data

1.

Win probability models — 25M+ labeled game-state snapshots with ESPN WP as baseline. Train LR, XGBoost, or neural nets.

2.

Prediction market bots — the course walks you through building a complete Polymarket bot with edge detection and live execution.

3.

Market microstructure research — tick-level orderbook data for studying price discovery, spread dynamics, and liquidity patterns in prediction markets.

4.

Custom backtests — test entry criteria against real game outcomes and real market prices. Score, period, Elo, bid/ask — all included.

5.

Elo rating systems — cleaned game results for 258+ teams across 10 sports. Build your own Elo, Glicko, or TrueSkill.

6.

Academic papers — prediction market efficiency, price reaction to scoring events, sports betting market analysis. Cite-ready with provenance metadata.

Datasets delivered as ZIP files containing Apache Parquet and CSV. Compatible with pandas, polars, DuckDB, and Spark.

Instant download after purchase. Payments processed securely via Stripe.

Questions? admin@zenhodl.net

Want real-time signals instead? See API plans →