Quant Data Explorer is an open-source project for collecting, cleaning, and documenting financial datasets. It is designed for beginners who want to learn quantitative finance and basic data analysis.
- Collect free financial datasets from public sources.
- Clean and standardize data for analysis.
- Create beginner-friendly Jupyter notebooks for visualization and simple calculations.
- Contribute your own datasets, cleaning scripts, or notebooks.
- Collect and organize initial stock market datasets.
- Add Jupyter notebook templates for basic analysis.
- Improve dataset documentation.
- Enable community contributions for new datasets and notebooks.
- Fork the repository and clone it locally.
- Follow the instructions in
CONTRIBUTING.md. - Start by adding datasets, cleaning scripts, or beginner notebooks.
Build a community-driven hub of open financial data to help beginners learn and explore quantitative finance.
This project is open to contributors of all levels. Below is a list of 25 tasks you can help with.
- Check if a task is already taken by looking at the Issues.
- Claim a task by commenting on the Issue or assigning it to yourself.
- Change the issue label to "In Progress" when you start.
- Submit a Pull Request linked to the Issue when done.
- Tasks inactive for 10 days may be reopened.
- Add new financial datasets
- Clean existing datasets
- Standardize data formats
- Add metadata for datasets
- Create automated scripts to download new datasets
- Maintain a changelog for dataset updates
- Write data validation scripts
- Build dataset quality checks
- Implement basic error handling for missing values
- Benchmark data loading performance
- Write helper functions for loading datasets
- Add CSV/JSON export functions
- Create a simple search tool for datasets
- Improve folder structure for data organization
- Add example Jupyter notebooks for data analysis
- Create summary statistics scripts
- Build visualizations of key datasets
- Add data visualization templates
- Document datasets in README
- Write README examples showing data usage
- Write beginner-friendly tutorials for dataset exploration
- Document workflow for contributing new data
- Add references to data sources
- Add contribution guidelines for new datasets
- Add unit tests for helper functions