Could we add a requirements.txt file? Right now it is unclear what is needed to run the script. The same applies to the data prep repo.