CSE 5334: Data Mining project - Toy Search Engine
Developed a Python-based toy "search engine" for processing a corpus, generating TF-IDF vectors for documents, and determining the highest cosine similarity score for a given query. Employed Jupyter notebook for clear instructions and adhered to Python 3.5.1+ requirements. Installed and utilized NLTK modules for advanced natural language processing, following guidelines from the NLTK website and referencing their informative book.
Skills: Data Mining, Python, Jupyter Notebook, NLTK- natural language processing toolkit