Recommendation System

This folder contains Jupyter Notebook files of a recommendation system. There is no frontend in the project due to time constraints. Entering userID and videoID needs to be done manually.

Datasets

5 datasets are used in this project.

interactions_100k_with_relationships.csv
- Interaction ID, User ID, PostID, Interaction Type, Interaction Timestamp, Watch Duration
posts_100k_with_relationships.csv
- Post ID, Tags/Keywords, Category (Reel or Image), UploaderID, Upload Time, Timestamp, Views, Likes, Dislikes, Comments
users_100k_with_relationships.csv
- UserID, Username, Full Name, Gender, Country, Language Preference, Interests
TikTok Data.csv (obtained from kaggle)
UserData.csv (obtained from kaggle)

Note : First 3 datasets are custom generated

Notebooks

1. AllParamsRec.ipynb

This notebook implements a recommendation system that blends posts based on:

20% liked by friends: Recommendations based on friends' interactions.
30% new: Exploration of potentially interesting unseen content.
50% past interactions: Leverages user's past preferences.

2. Top_Users.ipynb

This identifies the top 10 users most similar to a target user using cosine similarity. It utilizes user information such as demographics, interests, and gender. Can be modified to use bio, age, language etc for each user.

3. VideoRecommendation.ipynb

This notebook recommends video content based on transcript similarity. It employs the following techniques:

CountVectorizer: Creates a numerical representation of the video transcripts, focusing on word frequencies.
Stemming: Reduces words to their base forms (e.g., "running" and "runs" become "run").
Cosine Similarity: Measures the similarity between video transcripts based on their word representations.

Running the Notebooks

Clone this repository to your local machine.
Install the required Python libraries using pip install <library_name>.
Open the notebooks in a Jupyter Notebook environment.
Download and load the datasets in your environment.
Update the code to point to your specific data paths.
Run the code cells to execute the recommendation algorithms.

Disclaimer

This code is for educational purposes only and may require modifications for specific use cases. The datasets used are provided in the repository.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Recommendation System

Datasets

Notebooks

Running the Notebooks

Disclaimer

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
AllParamsRec.ipynb		AllParamsRec.ipynb
Tiktok data.csv		Tiktok data.csv
Top_Users.ipynb		Top_Users.ipynb
UserData.csv		UserData.csv
VideoRecommendation.ipynb		VideoRecommendation.ipynb
interactions_100k_with_relationships.csv		interactions_100k_with_relationships.csv
posts_100k_with_relationships.csv		posts_100k_with_relationships.csv
readme.md		readme.md
users_100k_with_relationships.csv		users_100k_with_relationships.csv

Folders and files

Latest commit

History

Repository files navigation

Recommendation System

Datasets

Notebooks

Running the Notebooks

Disclaimer

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages