This project was intended to be set up to continuously track PGA Tour stats week to week.
The data pipeline was built with a particular order in mind. The goal was to first provide some tournament information, then gather the strokes gained data for that week and enter into database.
- First manually populate the tournamentstable with the information about where a particular event is played.
- Next, the scrape.pyscript needs to be run to collect the data from the internet- There are few constants that need to be updated to successfully scrape webpage
 
- Then, load.pyneeds to be run to transform and enter rest of data into database.
