First Place in the EVENTA 2025 Track 1 (held at ACM MM 2025)

ENRIC: EveNt-AwaRe Captioning with Image Retrieval via UnCertainty-Guided Re-ranking and Semantic Ensemble Reasoning

Team: cerebro (Members: Nam-Quan Nguyen, Minh-Hoang Le, Vinh-Toan Vong)

Final Leaderboard

Team	Rank	Overall	AP	R@1	R@10	CLIP	CIDEr
cerebro (Ours)	1	0.5501	0.991	0.989	0.995	0.826	0.210
SodaBread	2	0.5457	0.982	0.977	0.988	0.870	0.204
Re: Zero Slavery	3	0.4515	0.955	0.945	0.973	0.732	0.156
ITxTK9	4	0.4200	0.966	0.955	0.983	0.828	0.133
noname_	5	0.2824	0.708	0.663	0.801	0.783	0.081

Descriptions

This repository contains the solution developed by the cerebro team from the University of Science - VNUHCM to address the challenge posed in EVENTA - Track 1: Event-Enriched Image Retrieval and Captioning.


Overview of the Retrieval and Re-ranking Module


Overview of the Captioning Module

To reproduce our results, please follow the instructions provided below.

Instructions

🔹 Phase 1: Retrieval & Reranking

Create Embeddings

python step_1_create_embeddings.py --input_folder data/database --output_folder embeddings/database
python step_1_create_embeddings.py --input_folder data/track1_private/query --output_folder embeddings/query

Initial Retrieval

python step_1_retrieval.py --database_folder embeddings/database --query_folder embeddings/query

Reranking

python step_1_rerank.py

🔹 Phase 2: Captioning & Semantic Reasoning

Create the crawling database

Crawl articles and images from URLs in the origin database:
```
python step_2_0_crawling.py
```

Create Embeddings for crawled images:

python step_1_create_embeddings.py --input_folder imgs --output_folder embeddings/maching_new_database_internvlg

For each article, create a json file mapping the origin image and the crawled image
```
python step_2_0_matching_image.py
```
Create database_new.json
```
python step_2_0_new_database.py
```
Create output corresponding to final_json_result/context_extraction_image_article.json
```
python step_2_0_create_result.py
```

Generate Query Captions

python step_2_create_caption_query.py

Retrieve First Article Summary

python step_2_first_article_summary.py

Caption Enhancement via Strategies

Using Question Answering:

python step_2_caption_process.py --qa --strategy questions_answers

Using Named Entity Extraction:

python step_2_caption_process.py --name_entity --strategy name_entity

Using Chain-of-Thought Reasoning:

python step_2_caption_process.py --strategy cot_5_things_fact_more_event

Note:

After generating captions using cot_5_things_fact_more_event, ensure your submission captions are clean:

🔹 Remove any unwanted newlines like \n\n or stray \n.
🔹 Convert captions into proper single-line strings.

You can use the provided post_processing.py script to automatically clean the final CSV before generating the submission file.

Contact

Nam-Quan Nguyen (nnquan23@apcs.fitus.edu.vn)
Minh-Hoang Le (lmhoang22@apcs.fitus.edu.vn)
Vinh-Toan Vong (vvtoan22@apcs.fitus.edu.vn)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

First Place in the EVENTA 2025 Track 1 (held at ACM MM 2025)

ENRIC: EveNt-AwaRe Captioning with Image Retrieval via UnCertainty-Guided Re-ranking and Semantic Ensemble Reasoning

Final Leaderboard

Descriptions

Instructions

🔹 Phase 1: Retrieval & Reranking

🔹 Phase 2: Captioning & Semantic Reasoning

Note:

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
assemble_caption_prompt_template		assemble_caption_prompt_template
assemble_result		assemble_result
data		data
.gitignore		.gitignore
README.md		README.md
conversation.py		conversation.py
internvl.py		internvl.py
llama3.py		llama3.py
llamassemblers.py		llamassemblers.py
logit_scale.pt		logit_scale.pt
post_processing.py		post_processing.py
requirements.txt		requirements.txt
step_1_create_embeddings.py		step_1_create_embeddings.py
step_1_rerank.py		step_1_rerank.py
step_1_retrieval.py		step_1_retrieval.py
step_2_0_crawling.py		step_2_0_crawling.py
step_2_0_create_result.py		step_2_0_create_result.py
step_2_0_matching_image.py		step_2_0_matching_image.py
step_2_0_new_database.py		step_2_0_new_database.py
step_2_caption_process.py		step_2_caption_process.py
step_2_create_caption_query.py		step_2_create_caption_query.py
step_2_first_article_summary.py		step_2_first_article_summary.py
step_2_merge_all_elements.py		step_2_merge_all_elements.py

Folders and files

Latest commit

History

Repository files navigation

First Place in the EVENTA 2025 Track 1 (held at ACM MM 2025)

ENRIC: EveNt-AwaRe Captioning with Image Retrieval via UnCertainty-Guided Re-ranking and Semantic Ensemble Reasoning

Final Leaderboard

Descriptions

Instructions

🔹 Phase 1: Retrieval & Reranking

🔹 Phase 2: Captioning & Semantic Reasoning

Note:

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages