To Accuracy and Beyond: Explore LLM Level of Knowledge via Elimination

Authors

Abstract

Until 2022, Google was the primary tool for answering questions. Recently, with the rise of Large Language Models (LLMs), there has been a significant shift in how people search for information online. This shift highlights the importance of question-answering abilities for LLMs, and particularly, improving their results and characterizing their knowledge.

This work aims to determine LLMs' level of knowledge on Multiple-Choice Questions (MCQ) by using various elimination strategies and investigating whether those strategies can enhance their abilities. Our findings suggest that LLMs do not improve their performance on MCQ by using elimination strategies, which might be due to their partial knowledge.

Elimination Strategy Example

Code Instructions

Run the following commands:

pip install datasets
pip install accelerate -U
pip install --upgrade transformers

Change the TOKEN and PATH constants (as described in elimination.py).
Change the RUN_XXX and ANALYZE constants to control your running options.

Results

Results can be found in 'Plots', 'Figures' and 'Data' directories.

This project was done as part of an advanced NLP course (67664) in the Hebrew University of Jerusalem.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Data		Data
Figures		Figures
Plots		Plots
.gitignore		.gitignore
README.md		README.md
To Accuracy and Beyond.pdf		To Accuracy and Beyond.pdf
elimination_project.py		elimination_project.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

To Accuracy and Beyond: Explore LLM Level of Knowledge via Elimination

Authors

Abstract

Elimination Strategy Example

Code Instructions

Results

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

To Accuracy and Beyond: Explore LLM Level of Knowledge via Elimination

Authors

Abstract

Elimination Strategy Example

Code Instructions

Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages