Skip to content

tanghaotommy/search-engine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

46 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

search-engine

This is a group porject with Jiawei Gu on writing a search engine from scratch (without popular libs like Lucene).

Demo website: Search Engine for ics.uci.edu

The html source files are provided after we did the crawling on ics.uci.edu.

The programming language is python and the databse is mongodb.

We implemented forward indexing, inverted indexing, tf-idf scoring, cosine similartity, pagerank algorithm and RankSVM to learn the weights of cosine similarity and pagerank. Also, we developed a simple web interface for searching based on the flask framework of python.

About

Search Engine for ics.uci.edu

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors