Skip to content

NamQuanProject/Enric

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

First Place in the EVENTA 2025 Track 1 (held at ACM MM 2025)

ENRIC: EveNt-AwaRe Captioning with Image Retrieval via UnCertainty-Guided Re-ranking and Semantic Ensemble Reasoning

[Challenge] [Paper]

Team: cerebro (Members: Nam-Quan Nguyen, Minh-Hoang Le, Vinh-Toan Vong)

Final Leaderboard

Team Rank Overall AP R@1 R@10 CLIP CIDEr
cerebro (Ours) 1 0.5501 0.991 0.989 0.995 0.826 0.210
SodaBread 2 0.5457 0.982 0.977 0.988 0.870 0.204
Re: Zero Slavery 3 0.4515 0.955 0.945 0.973 0.732 0.156
ITxTK9 4 0.4200 0.966 0.955 0.983 0.828 0.133
noname_ 5 0.2824 0.708 0.663 0.801 0.783 0.081

Descriptions

This repository contains the solution developed by the cerebro team from the University of Science - VNUHCM to address the challenge posed in EVENTA - Track 1: Event-Enriched Image Retrieval and Captioning.

image
Overview of the Retrieval and Re-ranking Module
image
Overview of the Captioning Module

To reproduce our results, please follow the instructions provided below.

Instructions

🔹 Phase 1: Retrieval & Reranking

  1. Create Embeddings
python step_1_create_embeddings.py --input_folder data/database --output_folder embeddings/database
python step_1_create_embeddings.py --input_folder data/track1_private/query --output_folder embeddings/query
  1. Initial Retrieval
python step_1_retrieval.py --database_folder embeddings/database --query_folder embeddings/query
  1. Reranking
python step_1_rerank.py

🔹 Phase 2: Captioning & Semantic Reasoning

  1. Create the crawling database
  • Crawl articles and images from URLs in the origin database:

    python step_2_0_crawling.py
  • Create Embeddings for crawled images:

    python step_1_create_embeddings.py --input_folder imgs --output_folder embeddings/maching_new_database_internvlg
  • For each article, create a json file mapping the origin image and the crawled image

    python step_2_0_matching_image.py
  • Create database_new.json

    python step_2_0_new_database.py
  • Create output corresponding to final_json_result/context_extraction_image_article.json

    python step_2_0_create_result.py
  1. Generate Query Captions
python step_2_create_caption_query.py
  1. Retrieve First Article Summary
python step_2_first_article_summary.py
  1. Caption Enhancement via Strategies
  • Using Question Answering:

    python step_2_caption_process.py --qa --strategy questions_answers
  • Using Named Entity Extraction:

    python step_2_caption_process.py --name_entity --strategy name_entity
  • Using Chain-of-Thought Reasoning:

    python step_2_caption_process.py --strategy cot_5_things_fact_more_event

Note:

After generating captions using cot_5_things_fact_more_event, ensure your submission captions are clean:

  • 🔹 Remove any unwanted newlines like \n\n or stray \n.
  • 🔹 Convert captions into proper single-line strings.

You can use the provided post_processing.py script to automatically clean the final CSV before generating the submission file.

Contact

About

ENRIC: EveNt-AwaRe Captioning with Image Retrieval via UnCertainty-Guided Re-ranking and Semantic Ensemble Reasoning

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors