GitHub - tzhenyu/SpotCheck: A proposed solution for e-commerce fraud detection for FutureHack! 2025. Made by team TARUMT NOT TARC

Fake Review Detection System for Shopee.

demo.mp4

SpotCheck is built to solve the problem of misleading reviews in online shopping. By combining a browser extension, LLM-powered backend, and a hybrid vector-relational database (Supabase), SpotCheck delivers real-time verdicts such as:

✅ Real
❌ Fake
⚠️ Irrelevant

How does it work?

User visits a Shopee product page.
Extension scrapes all reviews.
Backend analyzes each review using LLMs and past data.
Verdicts and explanations are shown inline.
User shops with more confidence and clarity.

🎯 Types of Fake Reviews Detected

1. Paid or Sponsored Reviews

Overly positive language with little detail
Keywords like “cheap”, “must buy”, “premium” spammed across unrelated products
Appears across multiple users with only slight variations

2. Copy-Paste Spam

Same comment posted by different users
Same user posts same/similar comment across multiple products
Often used in review farms or bots

3. Template-Based Reviews

Generic structure like: “Item arrived fast. Quality is good. Will buy again.”
Detected using semantic similarity (MiniLM + vector search)

4. Bot-Like Activity

Reviewer posts dozens of reviews in a short time
Repeated review structure regardless of product
Same review length, tone, and timing pattern

5. Irrelevant Comments

Comments that don’t match product context (e.g., review for a phone case saying “delicious and fresh”)
Detected via LLM vibe-checking and embedding mismatch

🧭 Scope

🛒 Shopee as e-commerce platform
📌 Focused on shirts category for data collection
🐧 Linux-based local installation
🗝️ Optional Google GenAI API key deployment for easier deployment
☁️ Supabase Cloud for easy access to database

🔧 Technical Execution

Solution Architecture

Tech Stack

Component	Description
Browser Extension	Scrapes and displays verdicts on Shopee
HTML/CSS/JS	Chromium-based browser extension development
FastAPI	Backend server managing LLM logic
MiniLM (Local)	Embedding generation and semantic analysis
Supabase	Relational + vector database backend
pgvector	Embedding similarity search
Python	Data pipeline, inference, and backend logic

Browser Extension

Scrapes data from Shopee product pages:
- Username
- Comment content
- Timestamp
- Star rating
- Product name
- Product URL
Sends the data to the backend server for the analysis.
Displays final verdicts and explanations directly on the page.

🔍 Backend Analysis

PostgreSQL (Relational DB)

Stores metadata: username, timestamp, product info, etc.
Hosts external review datasets (e.g., Kaggle 10k reviews, cleaned for vector embedding)
Enables behavior-based analysis using SQL queries

pgvector (Vector DB)

Stores sentence embeddings of review comments
Enables semantic similarity search (RAG-style logic)
Helps detect reused or templated comments across users/products

Statistical Analysis Pipeline

Semantic Analysis

Embeds each review comment using all-MiniLM-L6-v2.
Performs vector similarity search via pgvector.
Returns the top 5 most similar comments and similarity scores.
Purpose: Detect bot/paid reviews that are semantically similar to known fake templates.

Behavior Analysis

Runs multiple SQL queries to extract behavioral patterns:

Repetition: Has this user posted the same comment before?
Duplication: How many different users posted this same comment?
Spam: Is the comment reused across multiple products?
Activity: How frequently does this user post reviews?
Length: How long is the comment (character count)?

LLM Decision Pipeline

Stage	Description
LLM 1	Classifies review as Genuine / Suspicious / Irrelevant (based on "vibe")
LLM 2	Further evaluates Suspicious reviews using semantic + behavioral evidence

Suspicious reviews flagged by LLM 1 are sent through both semantic + behavioral analysis. Verdicts are generated based on aggregated insights.

If Gemini API key is not provided, the system defaults to the local Ollama model.

Continuous Learning

Live feedback loop: New reviews are stored for retraining purposes.
Fine-tuning: External datasets (e.g. Kaggle) used to improve classification accuracy.
Preprocessing: Comments are cleaned (emoji stripping, whitespace removal) to improve vector quality and matching performance.

📥 Deployment

Cloud Deploy (Minimal Setup)

Requirements

Python 3.12.3 – for running the backend server
Chromium-based browser – required for the extension (e.g., Chrome, Brave)
Google Gemini API Key – Get your API key here

1. Backend Setup

Clone the repository and install dependencies:

git clone https://github.com/your-repo-name/SpotCheck.git
cd SpotCheck
pip install -r requirements.txt

Run the backend server:

python ./backend/backend.py

2. Install Browser Extension

Open your Chromium browser
Navigate to: chrome://extensions/
Enable Developer Mode (top-right toggle)
Click "Load unpacked"
Select the src/ folder from the cloned repo
Click the extension in the browser
Put Google Gemini API key in the browser popup

Local Deployment (Recommended)

Installation steps included in INSTALL.md

🏎️ Performance

Run	`analyze_comments` (Local LLM)	`analyze_comments` (Gemini LLM)
1	4.08	14.57
2	4.84	24.44
3	4.55	25.17
4	11.09	16.54
5	11.15	11.18
Avg	7.14s	18.38s

Local LLM is ~61.14% faster than Gemini LLM on average.

❓ Why not using agentic tools?

We've tried using LangChain Agent to allow local deployed LLM to decide which analysis to perform to determine if the review is real or fake. It took around a minute to process due to its chain of thoughts. We didn't try using Gemini LLM on LangChain Agent as we are concerned with the network latency. But hey, at least we tried :')

✅ Real results

Backend logs

INFO:backend:Received 6 comments for analysis
INFO:backend:No Gemini API key provided for analysis
INFO:backend:Extracted usernames: ['nifailahmad', 's*****a', 's*****d', 's*****d', 'd*****n', 'm*****i']
INFO:backend:Batch analyzing 6 comments
using ollama
LLM response raw text: Here are the classifications:

1. Genuine: Specific details about product quality and seller's service.
2. Genuine: Detailed review with specific measurements, material description, and satisfaction with purchase.
3. Suspicious: Repetitive phrases ("jden mmg terbaik", "xpremium") and overly positive language.
4. Suspicious: Same as Review 3, repetitive phrases and overly positive language.
5. Genuine: Specific details about delivery speed, packaging, and product quality.
6. Genuine: Detailed review with specific comments on fabric thickness, stitching, and satisfaction with purchase.
INFO:backend:analyze_comments_batch_ollama completed in 4.72 seconds for 6 comments

INFO:backend:Processing result 0: explanation='Genuine: Specific details about product quality an...', starts_with_suspicious=False
INFO:backend:Processing result 1: explanation='Genuine: Detailed review with specific measurement...', starts_with_suspicious=False
INFO:backend:Processing result 2: explanation='Suspicious: Repetitive phrases ("jden mmg terbaik"...', starts_with_suspicious=True

Batches: 100%|████████████████████████████████████████████████████████████████████████████████| 1/1 [00:00<00:00, 74.32it/s]
INFO:backend:Semantic analysis for comment 2: 2 scores
INFO:backend:Calling collect_behavioral_signals for comment 2 with username='s*****d'
INFO:backend:collect_behavioral_signals called with username='s*****d', comment_length=197, table='product_reviews'
INFO:backend:Added evidence: User reused comment 3 times
INFO:backend:Behavioral analysis for comment 2 returned 1 evidence items: ['User reused the same comment.']

INFO:backend:Processing result 3: explanation='Suspicious: Same as Review 3, repetitive phrases a...', starts_with_suspicious=True

INFO:backend:Found suspicious comment 3: username='s*****d', comment='jden mmg terbaik..dh bli brand...'
Batches: 100%|███████████████████████████████████████████████████████████████████████████████| 1/1 [00:00<00:00, 225.78it/s]
INFO:backend:Semantic analysis for comment 3: 2 scores
INFO:backend:Calling collect_behavioral_signals for comment 3 with username='s*****d'
INFO:backend:collect_behavioral_signals called with username='s*****d', comment_length=197, table='product_reviews'
INFO:backend:Added evidence: User reused comment 3 times
INFO:backend:Behavioral analysis for comment 3 returned 1 evidence items: ['User reused the same comment.']

INFO:backend:suspicious_comments input: [{"comment": "jden mmg terbaik..dh bli brand mig xsedap..xpremium..jden jugak premium kain cantik sedap pakai dah basuh bnyak kali tetap ok..beli selai2 dekt live dapat murah giler\ud83d\ude02..total dekat 20 helai dh beli", "username": "s*****d", "analysis": [1.0, 1.0], "behavioral": ["User reused the same comment."]}, {"comment": "jden mmg terbaik..dh bli brand mig xsedap..xpremium..jden jugak premium kain cantik sedap pakai dah basuh bnyak kali tetap ok..beli selai2 dekt live dapat murah giler\ud83d\ude02..total dekat 20 helai dh beli", "username": "s*****d", "analysis": [1.0, 1.0], "behavioral": ["User reused the same comment."]}]
INFO:backend:determine_review_genuinty called with 2 suspicious comments
INFO:backend:Processing 2 semantic scores and 2 behavioral results

INFO:backend:determine_review_genuinty result: [{"comment": "jden mmg terbaik..dh bli brand mig xsedap..xpremium..jden jugak premium kain cantik sedap pakai dah basuh bnyak kali tetap ok..beli selai2 dekt live dapat murah giler\ud83d\ude02..total dekat 20 helai dh beli", "verdict": "FAKE", "explanation": "This review is fake because the user reused the same comment multiple times."}, {"comment": "jden mmg terbaik..dh bli brand mig xsedap..xpremium..jden jugak premium kain cantik sedap pakai dah basuh bnyak kali tetap ok..beli selai2 dekt live dapat murah giler\ud83d\ude02..total dekat 20 helai dh beli", "verdict": "FAKE", "explanation": "This review is fake because the user reused the same comment multiple times."}]
INFO:backend:analyze_comments completed in 10.92 seconds for 6 comments

⚠️ Limitations

High false positive rate: Some genuine reviews are mislabeled due to overlap in structure or reused wording.
Context hallucination: LLM may infer fake/real reasons not present in the original text.
Limited language understanding: Malay slang or mixed languages reduce accuracy.
Duplicate detection is surface-level: It doesn’t detect rewritten or paraphrased fakes.

🔧 Potential Improvements

Fine-tune LLM on real-world Shopee reviews (SEA language mixed).
Include user behavior (time of review, purchase patterns) for stronger signals.
Using more metadata to feed LLM for better accuracy.
Integrate GPT-4o or fine-tuned local models for faster, more reliable classification.

🙌 Credits

Brought to you by team TARUMT NOT TARC

TAN ZHEN YU
JONATHAN HO YOON CHOON
LYE WEI LUN
TAN GUO ZHI

Name		Name	Last commit message	Last commit date
Latest commit History 130 Commits
.github		.github
backend		backend
image		image
src		src
.env		.env
.gitignore		.gitignore
INSTALL.md		INSTALL.md
LICENSE		LICENSE
README.md		README.md
install.sh		install.sh
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

How does it work?

🎯 Types of Fake Reviews Detected

1. Paid or Sponsored Reviews

2. Copy-Paste Spam

3. Template-Based Reviews

4. Bot-Like Activity

5. Irrelevant Comments

🧭 Scope

🔧 Technical Execution

Solution Architecture

Tech Stack

Browser Extension

🔍 Backend Analysis

PostgreSQL (Relational DB)

pgvector (Vector DB)

Statistical Analysis Pipeline

Semantic Analysis

Behavior Analysis

LLM Decision Pipeline

Continuous Learning

📥 Deployment

Cloud Deploy (Minimal Setup)

Requirements

1. Backend Setup

2. Install Browser Extension

Local Deployment (Recommended)

🏎️ Performance

❓ Why not using agentic tools?

✅ Real results

Backend logs

⚠️ Limitations

🔧 Potential Improvements

🙌 Credits

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors

Uh oh!

Languages