Document-Retrieval

Document rerieval from corpus using BM25 algorithm.

Corpus is present in project folder "/Dataset_Algorithms"
First, We create the inverted index for the given corpus and save it in index.out file
tokenization, stop words removal and stemming are done for inverted index.
Used PorterStemmer for stemming
This index.out will have all the root words in it and serve as input for BM25 algorithm.

This project is created as a Spring project and should be run on application server.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
RishiReddy_Bokka_Document_Retrieval		RishiReddy_Bokka_Document_Retrieval
.gitattributes		.gitattributes
.gitignore		.gitignore
README.m.txt		README.m.txt
README.md		README.md

Provide feedback