A document retrieval system with multiple ranking algorithms and evaluation capabilities.
playground.ipynb- Main pipeline and experimentation notebookrankers/- Various ranking algorithms (BM25, TF-IDF, embedding-based)indexes/- Indexing implementations for different approacheschroma_db/- Vector database storage (not in remote repository, created in playground.ipynb)evaluator.py- Evaluation metrics and testingdocument_loader.py- Document processing utilitiesresults_*.json- Evaluation results from different models
Run the main pipeline in playground.ipynb to experiment with different retrieval approaches and see the full evaluation.