rui/ds1000 by rays1024 · Pull Request #44 · Yale-LILY/NLP4Code

rays1024 · 2023-04-14T20:48:55Z

No description provided.

.DS_Store

niansong1996 · 2023-04-24T16:01:51Z

@rays1024 Any updates on this? btw, the title for your PR is kind of confusing. If you are working on two different things (length calculator and ds-1000 dataset), you should have two separate branches and two separate PRs accordingly.

rays1024 · 2023-04-24T16:06:05Z

@rays1024 Any updates on this? btw, the title for your PR is kind of confusing. If you are working on two different things (length calculator and ds-1000 dataset), you should have two separate branches and two separate PRs accordingly.

Sorry I have been pretty busy with projects and assignments, but I will work on the issue and try to have an update by Friday. I'll also make a new PR for length_calculator as well. Thanks for the reminder!

niansong1996 · 2023-04-25T15:59:06Z

I see, thanks for the update. Let me know if anything changes.

…t llama-based model uses empty string as tokenizer_eos_token

…rchat, etc

update branch to latest main branch

rays1024 · 2023-06-28T07:02:01Z

Updating this pull request's name to "DS1000 dataset" to avoid confusion. Updates on length_calculator will be made on another branch/PR

niansong1996

Haven't tested the actual functionality but the executors and datasets look good in general.

Made some comments, can you fix the raised issue?

Thanks!

niansong1996 · 2023-07-03T02:22:47Z

DS_1000/ds1000.jsonl

please put this file in the data directory

niansong1996 · 2023-07-03T02:23:07Z

DS_1000/README.md

We don't want DS-1000's license and readme files in our repo

I'll delete the license, but the readme file is the instructions for evaluation and creating jsonl file.

niansong1996 · 2023-07-03T02:23:42Z

DS_1000/ds1000.py

Is this part of the testing harness for DS-1000?

Yes, this script with the zip file are needed for evaluation. The DS-1000 evaluation requires having the original data so that the problems could be loaded into a DS1000Problem class. This class is then used for evaluation.

niansong1996 · 2023-07-03T02:24:27Z

DS_1000/ds1000_data.zip

zip files should not be checked in (i.e., committed to online repo) unless there is a strong reason in doing so

I can work on removing the need for the original problem data, but these data in the zip file are indeed needed for evaluation.

niansong1996 · 2023-07-03T02:25:26Z

DS_1000/preprocess.py

Why is this appending lines (i.e., using open mode a)?

Each iteration of the for loop will append one dictionary as a new line to the jsonl file, which is why I used 'a' here. If other ways are commonly used, I can definitely change that!

niansong1996 · 2023-07-03T02:26:40Z

length_calculator.py

We should probably put this file in analysis or utils or preprocessing

…to rui/ds-1000 Merging with Yilun's branch to run evaluation on DS1000

rays1024 · 2023-07-25T00:52:29Z

Using jsonargparse==4.15.0 would resolve the previous problems. Still working on fixing the directory problem when evaluating 208 or more problems.

rays1024 · 2023-07-25T00:54:55Z

Error with torch tensor occurred when evaluating incoder-1b, same as this discussion. I added incoder-1b to the if statement at line 159 of seq2seq_model.py to fix this issue

This reverts commit f7e930b.

rays1024 · 2023-07-28T00:36:03Z

fixed result saving issue by adding a safe executing wrapper in the DS1000Executor class. The wrapper uses execute in safe_execution_util as a template.

added length_calculator

c2a2116

niansong1996 reviewed Apr 20, 2023

View reviewed changes

.DS_Store Outdated Show resolved Hide resolved

yilunzhao and others added 11 commits April 25, 2023 16:48

re-upload the implementation for llama, alpaca, santacoder

529edce

Merge branch 'main' into yilunzhao/llm_implementation

7caf0ea

try to pass CI check

9dbc8cd

Merge branch 'main' into yilunzhao/llm_implementation

06c7858

modify process_output function in exectutor.py to handle the case tha…

ce36f13

…t llama-based model uses empty string as tokenizer_eos_token

fix error related to llama eos_token

60fd89b

add starcoder, gpt-neox-20b; test gpt-j-6b

83546d9

archieve code for NeurIPS exps; add gpt-4, pythia, replit, dolly, sta…

86d3bd5

…rchat, etc

added dataset and executor for DS1000

d10a59a

Merge branch 'main' into rui/ds-1000

a9df68b

update branch to latest main branch

removed all .DS_Store

40c02b5

rays1024 changed the title ~~[wip] added length_calculator~~ [wip] DS1000 dataset Jun 28, 2023

Rui Shen added 3 commits June 28, 2023 23:21

added ds1000 prompt file

af3ffa2

fixed missing file and added ds1000 prompt file

2bfee22

zipped ds1000_data and removed irrelevant files

9e8b571

niansong1996 reviewed Jul 3, 2023

View reviewed changes

rays1024 added 5 commits July 4, 2023 23:35

class variable ds_data now initialize in init

fb089c0

Merge remote-tracking branch 'origin/yilunzhao/llm_implementation' in…

cfb10c7

…to rui/ds-1000 Merging with Yilun's branch to run evaluation on DS1000

fixed ds1000 gpt-neo-125M evaluation bugs

adb4c79

some code clean up

39a4fd4

included reference code in prompt

0ecdb5f

rays1024 changed the title ~~[wip] DS1000 dataset~~ DS1000 dataset Jul 9, 2023

rays1024 changed the title ~~DS1000 dataset~~ rui/ds1000 Jul 9, 2023

updated ds1000.yaml

5666a22

rays1024 added 4 commits July 15, 2023 22:14

fixed evaluation error at 158/159th problem

27e0e70

fixed evaluation bug

4ba669d

fixed torch tensor bug

dc4fc0c

undo changes caused by wrong jsonargparse version

62250c5

rays1024 added 3 commits July 27, 2023 03:29

fixed ds1000 saving issue

f7e930b

Revert "fixed ds1000 saving issue"

1ed0607

This reverts commit f7e930b.

fixed result saving issue

90cba4f

rays1024 added 6 commits July 31, 2023 00:32

evaluating raw output without cutting off at keywords

4229001

No custom instruction and using only DS1000 prompt

b787508

reverted raw output and uses keyword cutoff in execution

73b9676

no cutoff at execution to check raw output

d41eb3b

updated gitignore

5fad2f3

execution now preprocess model output

b3aa6ba

niansong1996 mentioned this pull request Sep 20, 2023

Adding more general code generation datasest #33

Open

Conversation

rays1024 commented Apr 14, 2023

Uh oh!

Uh oh!

niansong1996 commented Apr 24, 2023

Uh oh!

rays1024 commented Apr 24, 2023

Uh oh!

niansong1996 commented Apr 25, 2023

Uh oh!

rays1024 commented Jun 28, 2023

Uh oh!

niansong1996 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rays1024 Jul 3, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rays1024 Jul 3, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rays1024 commented Jul 25, 2023

Uh oh!

rays1024 commented Jul 25, 2023

Uh oh!

rays1024 commented Jul 28, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rays1024 Jul 3, 2023 •

edited

Loading

rays1024 Jul 3, 2023 •

edited

Loading