A Python program that uses Scrapy to scrape baseball statistics from https://baseballsavant.mlb.com to use to calculate players that should start improving or getting worse.
- Install Docker at https://www.docker.com/get-started
- Run
source update.sh
in terminal to set up Splash and install dependencies - Navigate into Scrapy-Ians-Player-Predictions directory and run
scrapy crawl PlayerSpider -o data.json
to run spider - Check Scrapy-Ians-Player-Predictions directory for data.json, which will hold the output of PlayerSpider.
- An up to date output of this spider is kept in this AWS S3 bucket.
- The data scraped using this spider is used for the Ians-Player-Predictions repository.