Skip to content

PriyaSKulkarni/TF-IDF-based-Search-Engine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 

Repository files navigation

TF-IDF-based-Search-Engine

CSE 5334: Data Mining project - Toy Search Engine

Developed a Python-based toy "search engine" for processing a corpus, generating TF-IDF vectors for documents, and determining the highest cosine similarity score for a given query. Employed Jupyter notebook for clear instructions and adhered to Python 3.5.1+ requirements. Installed and utilized NLTK modules for advanced natural language processing, following guidelines from the NLTK website and referencing their informative book.

Skills: Data Mining, Python, Jupyter Notebook, NLTK- natural language processing toolkit

About

CSE 5334: Data Mining

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages