What I Built
For WBC 2026 (World Baseball Classic), I created three scouting tools using MLB Statcast data:
- Streamlit Scouting Dashboards — 30 apps covering all 20 national teams (17 batter + 13 pitcher dashboards)
- Kaggle Dataset — Statcast data for 20 countries, 540,000+ pitches
- Kaggle EDA Notebook — Country-by-country analysis
1. Scouting Dashboards
GitHub: https://github.com/yasumorishima/wbc-scouting
The dashboards visualize Statcast data for MLB-affiliated WBC 2026 roster players.
- Batter dashboards: 17 countries, 105 players
- Pitcher dashboards: 13 countries, 86 players
Each national team has its own independent Streamlit app (30 apps total).
Batter Dashboard Features
Spray charts, exit velocity, launch angle, and count-based performance.
Strike zone split into a 3x3 grid with performance heatmaps for each zone.
Pitcher Dashboard Features
Pitch location distribution by pitch type, L/R splits, and count-based tendencies.
Pitch movement chart showing horizontal and vertical break by pitch type.
Dashboard URLs
| Country | Batters | Pitchers |
|---|---|---|
| USA | wbc-usa-batters | wbc-usa-pitchers |
| Japan | wbc-japan-batters | wbc-japan-pitchers |
| Dominican Republic | wbc-dr-batters | wbc-dr-pitchers |
| Mexico | wbc-mex-batters | wbc-mex-pitchers |
| Puerto Rico | wbc-pr-batters | wbc-pr-pitchers |
| Korea | wbc-kor-batters | wbc-kor-pitchers |
| Netherlands | wbc-ned-batters | wbc-ned-pitchers |
| Canada | wbc-can-batters | wbc-can-pitchers |
| Italy | wbc-ita-batters | wbc-ita-pitchers |
| Israel | wbc-isr-batters | wbc-isr-pitchers |
| Great Britain | wbc-gb-batters | wbc-gb-pitchers |
| Panama | wbc-pan-batters | wbc-pan-pitchers |
| Colombia | wbc-col-batters | wbc-col-pitchers |
| Cuba | wbc-cuba-batters | — |
| Chinese Taipei | wbc-twn-batters | — |
| Nicaragua | wbc-nic-batters | — |
| Australia | wbc-aus-batters | — |
Note: Streamlit apps go to sleep after inactivity. If you see "Zzzz" or "Your app is in the oven," just wait a moment or reload the page.
2. Kaggle Dataset
https://www.kaggle.com/datasets/yasunorim/wbc-2026-scouting
Pitch-by-pitch Statcast data for MLB-affiliated WBC 2026 roster players. Data sourced from Baseball Savant via pybaseball.
Files
| File | Contents |
|---|---|
statcast_batters.csv (36MB) |
324,099 pitches faced, 18 countries |
statcast_pitchers.csv (29MB) |
217,139 pitches thrown, 14 countries |
batter_summary.csv |
Per-player batting summary: 105 players, 19 countries |
pitcher_summary.csv |
Per-player pitching summary: 86 players, 14 countries |
rosters.csv |
Full WBC 2026 roster: 309 players, 20 countries |
stadiums.csv |
MLB stadium coordinates for spray chart rendering |
3. Kaggle EDA Notebook
https://www.kaggle.com/code/yasunorim/wbc-2026-scouting-eda-statcast-analysis
An exploratory analysis of the dataset, including country-by-country fastball velocity comparisons and batting profiles.
Data Notes
Roster data is based on official WBC 2026 announcements (Baseball America, February 2026). Some players may be missing or have since been added/removed from rosters. All data reflects MLB regular season performance only — it doesn't predict WBC performance directly.
Links
- Dashboard (GitHub): https://github.com/yasumorishima/wbc-scouting
- Dataset (Kaggle): https://www.kaggle.com/datasets/yasunorim/wbc-2026-scouting
- EDA Notebook (Kaggle): https://www.kaggle.com/code/yasunorim/wbc-2026-scouting-eda-statcast-analysis






Top comments (0)