From b81db264d2453877105331f9b4db294ed2cc7789 Mon Sep 17 00:00:00 2001 From: Shi Yu Date: Fri, 13 Jun 2025 18:37:58 -0400 Subject: [PATCH] Remove `--input_base` and `--output_base` since they are positional arguments --- README.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index 2f37e8f..17fadf2 100644 --- a/README.md +++ b/README.md @@ -99,8 +99,8 @@ The script `annotate_data/annotate.sh` does large-scale data annotation using a Example usage: ```python python select_training_data.py \ - --input_base "datasets/Corpus-200B" \ - --output_base "datasets/selected/Baseline-30B" \ + "datasets/Corpus-200B" \ + "datasets/selected/Baseline-30B" \ --num_tokens 30000000000 \ --do_sample \ --num_proc 16