Hello MM-Eureka Team,
First of all, great work! I wanted to start reproducing the results.
But first I have a question: There are 54931 lines in the JSON file (is in line with what you have written in 2.2 in the paper).
But there are only 47363 training images (in train folders) and 68 eval images (eval folders).
Even if I add the 4844 lines of code in JSON where the blank_image.jpg occurs (for text-only inputs) there are still ca. 2-3k images missing.
Am I doing any mistake or are some images missing? Are there cases where you use the same image for different questions?
If there is something missing could you please provide the data? Or explain if I am misunderstanding?
Thank you and best regards :)
Hello MM-Eureka Team,
First of all, great work! I wanted to start reproducing the results.
But first I have a question: There are 54931 lines in the JSON file (is in line with what you have written in 2.2 in the paper).
But there are only 47363 training images (in train folders) and 68 eval images (eval folders).
Even if I add the 4844 lines of code in JSON where the blank_image.jpg occurs (for text-only inputs) there are still ca. 2-3k images missing.
Am I doing any mistake or are some images missing? Are there cases where you use the same image for different questions?
If there is something missing could you please provide the data? Or explain if I am misunderstanding?
Thank you and best regards :)