DVAGen is a fully open-source, unified framework designed for training, evaluation, and visualization of dynamic vocabulary-augmented language models.
- [2025/07/18] DVAGen v0.1.0 released!
Download the repository and install DVAGen:
git clone https://github.com/AntNLP/DVAGen.git
cd DVAGen
pip install -e .Note that the faiss-gpu package is required if you intend to retrieve supporting documents using the GPU with FAISSRetriever.
faiss-gpu contains both GPU and CPU indices and may be incompatible with the CPU-only version (faiss-cpu).
For further information, please refer to the FAISS documentation.
Use the following command to launch a CLI or WebUI tool for chatting.
dvagen chat --config_path examples/chat.yamlTo evaluate a model on various tasks, use the following command:
dvagen eval --config_path examples/eval.yamlBy default, we use deepspeed to launch the training script. To train a model, use the following command:
dvagen train [deepspeed_args] --config_path examples/train.yamlAn example for training command.
dvagen train --num_gpus 1 \
--num_nodes 1 \
--master_addr "localhost" \
--master_port 9901 \
--config_path examples/train.yamlDetails of the configuration files are available in the examples/README.md file.
