Skip to content

Commit cd84ba0

Browse files
committed
Add new papers
1 parent 022b7a0 commit cd84ba0

11 files changed

Lines changed: 986 additions & 925 deletions

README.md

Lines changed: 788 additions & 897 deletions
Large diffs are not rendered by default.

data/ack_repos.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -10,6 +10,7 @@
1010
- https://github.com/eltociear/awesome-AI-driven-development
1111
- https://github.com/YuyaoGe/Awesome-Vibe-Coding
1212
- https://github.com/codefuse-ai/Awesome-Code-LLM
13+
- https://github.com/juyongjiang/CodeLLMSurvey
1314
- https://github.com/xjywhu/Awesome-Multimodal-LLM-for-Code
1415
- https://github.com/github/awesome-copilot
1516
- https://github.com/steel-dev/awesome-web-agents

data/papers_3d_object_design.yaml

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -158,6 +158,14 @@
158158
github: "https://github.com/LiuHengyu321/IR3D-Bench"
159159
website: "https://ir3d-bench.github.io/"
160160

161+
- title: "SceneGenAgent: Precise Industrial Scene Generation with Coding Agent"
162+
authors: "Xiao Xia, Dan Zhang, Zibo Liao, Zhenyu Hou, Tianrui Sun, Jing Li, Ling Fu, Yuxiao Dong"
163+
venue: "ACL 2025"
164+
links:
165+
paper: "https://aclanthology.org/2025.acl-long.873/"
166+
github: ""
167+
website: ""
168+
161169
- title: "SceneMotifCoder: Example-driven Visual Program Learning for Generating 3D Object Arrangements"
162170
authors: "Hou In Ivan Tam, Hou In Derek Pun, Austin T. Wang, Angel X. Chang, Manolis Savva"
163171
venue: "3DV 2025"

data/papers_code_generation.yaml

Lines changed: 25 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,12 @@
1+
- title: "Does AI-Assisted Coding Deliver? A Difference-in-Differences Study of Cursor's Impact on Software Projects"
2+
authors: "Hao He, Courtney Miller, Shyam Agarwal, Christian Kästner, Bogdan Vasilescu"
3+
venue: "arXiv 2025/11"
4+
links:
5+
paper: "https://arxiv.org/abs/2511.04427"
6+
github: ""
7+
website: ""
8+
tags: "Empirical Study"
9+
110
- title: "Lost in Code Generation: Reimagining the Role of Software Models in AI-driven Software Engineering"
211
authors: "Jürgen Cito, Dominik Bork"
312
venue: "arXiv 2025/11"
@@ -138,13 +147,13 @@
138147
github: "https://github.com/FSoft-AI4Code/HyperAgent"
139148
website: ""
140149

141-
- title: "Verbal Process Supervision Elicits Better Coding Agents"
142-
authors: "Hao-Yuan Chen, Cheng-Pong Huang, Jui-Ming Yao"
143-
venue: "arXiv 2025"
144-
links:
145-
paper: "https://arxiv.org/abs/2503.18494"
146-
github: ""
147-
website: ""
150+
# - title: "Verbal Process Supervision Elicits Better Coding Agents"
151+
# authors: "Hao-Yuan Chen, Cheng-Pong Huang, Jui-Ming Yao"
152+
# venue: "arXiv 2025"
153+
# links:
154+
# paper: "https://arxiv.org/abs/2503.18494"
155+
# github: ""
156+
# website: ""
148157

149158
- title: "KernelBench: Can LLMs Write Efficient GPU Kernels?"
150159
authors: "Anne Ouyang, Simon Guo, Simran Arora, Alex L. Zhang, William Hu, Christopher Ré, Azalia Mirhoseini"
@@ -154,6 +163,14 @@
154163
github: "https://github.com/ScalingIntelligence/KernelBench"
155164
website: "https://scalingintelligence.stanford.edu/blogs/kernelbench/"
156165

166+
- title: "EpiCoder: Encompassing Diversity and Complexity in Code Generation"
167+
authors: "Yaoxiang Wang, Haoling Li, Xin Zhang, Jie Wu, Xiao Liu, Wenxiang Hu, Zhongxin Guo, Yangyu Huang, Ying Xin, Yujiu Yang, Jinsong Su, Qi Chen, Scarlett Li"
168+
venue: "ICML 2025"
169+
links:
170+
paper: "https://arxiv.org/abs/2501.04694"
171+
github: ""
172+
website: ""
173+
157174
- title: "On the Impacts of Contexts on Repository-Level Code Generation"
158175
authors: "Nam Le Hai, Dung Manh Nguyen, Nghi D. Q. Bui"
159176
venue: "NAACL 2025 Findings"
@@ -258,7 +275,7 @@
258275
github: ""
259276
website: ""
260277

261-
- title: "CodeTree: Agentguided Tree Search for Code Generation with Large Language Models"
278+
- title: "CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models"
262279
authors: "Jierui Li, Hung Le, Yingbo Zhou, Caiming Xiong, Silvio Savarese, Doyen Sahoo"
263280
venue: "arXiv 2024"
264281
links:

data/papers_issue_localization.yaml

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -47,6 +47,14 @@
4747
github: "https://github.com/SalesforceAIResearch/SweRank"
4848
website: ""
4949

50+
- title: "RANGER: Repository-Level Agent for Graph-Enhanced Retrieval"
51+
authors: "Pratik Shah, Rajat Ghosh, Aryan Singhal, Debojyoti Dutta"
52+
venue: "arXiv 2025/09"
53+
links:
54+
paper: "https://arxiv.org/abs/2509.25257"
55+
github: ""
56+
website: ""
57+
5058
- title: "Benchmarking and Enhancing LLM Agents in Localizing Linux Kernel Bugs"
5159
authors: "Zhenhao Zhou, Zhuochen Huang, Yike He, Chong Wang, Jiajun Wang, Yijian Wu, Xin Peng, Yiling Lou"
5260
venue: "arXiv 2025/05"

data/papers_issue_resolution.yaml

Lines changed: 135 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -6,6 +6,14 @@
66
github: "https://github.com/EuniAI/Prometheus"
77
website: "https://euni.ai/"
88

9+
- title: "Unified Software Engineering Agent as AI Software Engineer"
10+
authors: "Leonhard Applis, Yuntong Zhang, Shanchao Liang, Nan Jiang, Lin Tan, Abhik Roychoudhury"
11+
venue: "ICSE 2026"
12+
links:
13+
paper: "https://arxiv.org/abs/2506.14683"
14+
github: ""
15+
website: ""
16+
917
- title: "Beyond Final Code: A Process-Oriented Error Analysis of Software Development Agents in Real-World GitHub Scenarios"
1018
authors: "Zhi Chen, Wei Ma, Lingxiao Jiang"
1119
venue: "ICSE 2026"
@@ -24,6 +32,14 @@
2432
website: ""
2533
tags: "Empirical Study"
2634

35+
- title: "Confucius Code Agent: Scalable Agent Scaffolding for Real-World Codebases"
36+
authors: "Zhaodong Wang, Zhenting Qi, Sherman Wong, Nathan Hu, Samuel Lin, Jun Ge, Erwin Gao, Wenlin Chen, Yilun Du, Minlan Yu, Ying Zhang"
37+
venue: "arXiv 2025/12"
38+
links:
39+
paper: "https://arxiv.org/abs/2512.10398"
40+
github: ""
41+
website: ""
42+
2743
- title: "Is Vibe Coding Safe? Benchmarking Vulnerability of Agent-Generated Code in Real-World Tasks"
2844
authors: "Songwen Zhao, Danqing Wang, Kexun Zhang, Jiaxuan Luo, Zhuo Li, Lei Li"
2945
venue: "arXiv 2025/12"
@@ -48,6 +64,30 @@
4864
github: "https://github.com/OpenHands/software-agent-sdk"
4965
website: ""
5066

67+
- title: "CodeClash: Benchmarking Goal-Oriented Software Engineering"
68+
authors: "John Yang, Kilian Lieret, Joyce Yang, Carlos E. Jimenez, Ofir Press, Ludwig Schmidt, Diyi Yang"
69+
venue: "arXiv 2025/11"
70+
links:
71+
paper: "https://arxiv.org/abs/2511.00839"
72+
github: ""
73+
website: "https://codeclash.ai/"
74+
75+
- title: "InfCode: Adversarial Iterative Refinement of Tests and Patches for Reliable Software Issue Resolution"
76+
authors: "KeFan Li, Mengfei Wang, Hengzhi Zhang, Zhichao Li, Yuan Yuan, Mu Li, Xiang Gao, Hailong Sun, Chunming Hu, Weifeng Lv"
77+
venue: "arXiv 2025/11"
78+
links:
79+
paper: "https://arxiv.org/abs/2511.16004"
80+
github: ""
81+
website: ""
82+
83+
- title: "Agent READMEs: An Empirical Study of Context Files for Agentic Coding"
84+
authors: "Worawalan Chatlatanagulchai, Hao Li, Yutaro Kashiwa, Brittany Reid, Kundjanasith Thonglek, Pattara Leelaprute, Arnon Rungsawang, Bundit Manaskasemsak, Bram Adams, Ahmed E. Hassan, Hajimu Iida"
85+
venue: "arXiv 2025/11"
86+
links:
87+
paper: "https://arxiv.org/abs/2511.12884"
88+
github: ""
89+
website: ""
90+
5191
- title: "Understanding Code Agent Behaviour: An Empirical Study of Success and Failure Trajectories"
5292
authors: "Oorja Majgaonkar, Zhiwei Fei, Xiang Li, Federica Sarro, He Ye"
5393
venue: "arXiv 2025/11"
@@ -114,6 +154,14 @@
114154
github: ""
115155
website: ""
116156

157+
- title: "Process-Level Trajectory Evaluation for Environment Configuration in Software Engineering Agents"
158+
authors: "Jiayi Kuang, Yinghui Li, Xin Zhang, Yangning Li, Di Yin, Xing Sun, Ying Shen, Philip S. Yu"
159+
venue: "arXiv 2025/10"
160+
links:
161+
paper: "https://arxiv.org/abs/2510.25694"
162+
github: ""
163+
website: ""
164+
117165
- title: "BugPilot: Complex Bug Generation for Efficient Learning of SWE Skills"
118166
authors: "Atharv Sonwane, Isadora White, Hyunji Lee, Matheus Pereira, Lucas Caccia, Minseon Kim, Zhengyan Shi, Chinmay Singh, Alessandro Sordoni, Marc-Alexandre Côté, Xingdi Yuan"
119167
venue: "arXiv 2025/10"
@@ -188,6 +236,22 @@
188236
github: "https://github.com/QuantaAlpha/GitTaskBench"
189237
website: "https://gittaskbench.github.io/"
190238

239+
- title: "RepoForge: Training a SOTA Fast-thinking SWE Agent with an End-to-End Data Curation Pipeline Synergizing SFT and RL at Scale"
240+
authors: "Zhilong Chen, Chengzong Zhao, Boyuan Chen, Dayi Lin, Yihao Chen, Arthur Leung, Gopi Krishnan Rajbahadur, Gustavo A. Oliva, Haoxiang Zhang, Aaditya Bhatia, Chong Chun Yong, Ahmed E. Hassan"
241+
venue: "arXiv 2025/08"
242+
links:
243+
paper: "https://arxiv.org/abs/2508.01550"
244+
github: ""
245+
website: ""
246+
247+
- title: "Understanding Software Engineering Agents Through the Lens of Traceability: An Empirical Study"
248+
authors: "Ira Ceka, Saurabh Pujar, Shyam Ramji, Luca Buratti, Gail Kaiser, Baishakhi Ray"
249+
venue: "arXiv 2025/06"
250+
links:
251+
paper: "https://arxiv.org/abs/2506.08311"
252+
github: ""
253+
website: ""
254+
191255
- title: "SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?"
192256
authors: "Xiang Deng, Jeff Da, Edwin Pan, Yannis Yiming He, Charles Ide, Kanak Garg, Niklas Lauffer, Andrew Park, Nitin Pasari, Chetan Rane, Karmini Sampath, Maya Krishnan, Srivatsa Kundurthy, Sean Hendryx, Zifan Wang, Chen Bo Calvin Zhang, Noah Jacobson, Bing Liu, Brad Kenstler"
193257
venue: "arXiv 2025"
@@ -268,6 +332,14 @@
268332
github: "https://github.com/yingweima2022/SWE-Reasoner"
269333
website: ""
270334

335+
- title: "AutoCodeSherpa: Symbolic Explanations in AI Coding Agents"
336+
authors: "Sungmin Kang, Haifeng Ruan, Abhik Roychoudhury"
337+
venue: "arXiv 2025/07"
338+
links:
339+
paper: "https://arxiv.org/abs/2507.22414"
340+
github: ""
341+
website: ""
342+
271343
- title: "Satori-SWE: Evolutionary Test-Time Scaling for Sample-Efficient Software Engineering"
272344
authors: "Guangtao Zeng, Maohao Shen, Delin Chen, Zhenting Qi, Subhro Das, Dan Gutfreund, David Cox, Gregory Wornell, Wei Lu, Zhang-Wei Hong, Chuang Gan"
273345
venue: "arXiv 2025/05"
@@ -276,6 +348,14 @@
276348
github: "https://github.com/satori-reasoning/Satori-SWE"
277349
website: "https://satori-reasoning.github.io/"
278350

351+
- title: "CrashFixer: A crash resolution agent for the Linux kernel"
352+
authors: "Alex Mathai, Chenxi Huang, Suwei Ma, Jihwan Kim, Hailie Mitchell, Aleksandr Nogikh, Petros Maniatis, Franjo Ivančić, Junfeng Yang, Baishakhi Ray"
353+
venue: "arXiv 2025/04"
354+
links:
355+
paper: "https://arxiv.org/abs/2504.20412"
356+
github: ""
357+
website: ""
358+
279359
- title: "DARS: Dynamic Action Re-Sampling to Enhance Coding Agent Performance by Adaptive Tree Traversal"
280360
authors: "Vaibhav Aggarwal, Ojasv Kamal, Abhinav Japesh, Zhijing Jin, Bernhard Schölkopf"
281361
venue: "arXiv 2025/03"
@@ -500,6 +580,14 @@
500580
github: ""
501581
website: ""
502582

583+
# - title: "Large Language Model Critics for Execution-Free Evaluation of Code Changes"
584+
# authors: "Aashish Yadavally, Hoan Nguyen, Laurent Callot, Gauthier Guinet"
585+
# venue: "arXiv 2025/01"
586+
# links:
587+
# paper: "https://arxiv.org/abs/2501.16655"
588+
# github: ""
589+
# website: ""
590+
503591
- title: "Can Agents Fix Agent Issues?"
504592
authors: "Alfin Wijaya Rahardja, Junwei Liu, Weitong Chen, Zhenpeng Chen, Yiling Lou"
505593
venue: "NeurIPS 2025"
@@ -580,6 +668,14 @@
580668
github: "https://github.com/Hambaobao/SWE-Flow"
581669
website: ""
582670

671+
- title: "Nemotron-CORTEXA: Enhancing LLM Agents for Software Engineering Tasks via Improved Localization and Solution Diversity"
672+
authors: "Atefeh Sohrabizadeh, Jialin Song, Mingjie Liu, Rajarshi Roy, Chankyu Lee, Jonathan Raiman, Bryan Catanzaro"
673+
venue: "ICML 2025"
674+
links:
675+
paper: "https://openreview.net/forum?id=k6p8UKRdH7"
676+
github: ""
677+
website: ""
678+
583679
- title: "Guided Search Strategies in Non-Serializable Environments with Applications to Software Engineering Agents"
584680
authors: "Karina Zainullina, Alexander Golubev, Maria Trofimova, Sergei Polezhaev, Ibragim Badertdinov, Daria Litvintseva, Simon Karasik, Filipp Fisin, Sergei Skvortsov, Maksim Nekrashevich, Anton Shevtsov, Boris Yangel"
585681
venue: "ICML 2025"
@@ -652,6 +748,14 @@
652748
github: ""
653749
website: ""
654750

751+
- title: "Understanding Software Engineering Agents: A Study of Thought-Action-Result Trajectories"
752+
authors: "Islem Bouzenia, Michael Pradel"
753+
venue: "ASE 2025"
754+
links:
755+
paper: "https://arxiv.org/abs/2506.18824"
756+
github: ""
757+
website: ""
758+
655759
- title: "SPICE: An Automated SWE-Bench Labeling Pipeline for Issue Clarity, Test Coverage, and Effort Estimation"
656760
authors: "Gustavo A. Oliva, Gopi Krishnan Rajbahadur, Aaditya Bhatia, Haoxiang Zhang, Yihao Chen, Zhilong Chen, Arthur Leung, Dayi Lin, Boyuan Chen, Ahmed E. Hassan"
657761
venue: "ASE 2025"
@@ -692,6 +796,14 @@
692796
github: "https://github.com/InternLM/SWE-Fixer"
693797
website: ""
694798

799+
- title: "SynFix: Dependency-Aware Program Repair via RelationGraph Analysis"
800+
authors: "Xunzhu Tang, Jiechao Gao, Jin Xu, Tiezhu Sun, Yewei Song, Saad Ezzini, Wendkûuni C. Ouédraogo, Jacques Klein, Tegawendé F. Bissyandé"
801+
venue: "ACL 2025 Findings"
802+
links:
803+
paper: "https://aclanthology.org/2025.findings-acl.252/"
804+
github: ""
805+
website: ""
806+
695807
- title: "UniDebugger: Hierarchical Multi-Agent Framework for Unified Software Debugging"
696808
authors: "Cheryl Lee, Chunqiu Steven Xia, Longji Yang, Jen-tse Huang, Zhouruixing Zhu, Lingming Zhang, Michael R. Lyu"
697809
venue: "EMNLP 2025"
@@ -732,6 +844,14 @@
732844
github: ""
733845
website: ""
734846

847+
- title: "Agentic Program Repair from Test Failures at Scale: A Neuro-symbolic approach with static analysis and test execution feedback"
848+
authors: "Chandra Maddila, Adam Tait, Claire Chang, Daniel Cheng, Nauman Ahmad, Vijayaraghavan Murali, Marshall Roch, Arnaud Avondet, Aaron Meltzer, Victor Montalvao, Michael Hopko, Chris Waterson, Parth Thakkar, Renuka Fernandez, Kristian Kristensen, Sivan Barzily, Sherry Chen, Rui Abreu, Nachiappan Nagappan, Payam Shodjai, Killian Murphy, James Everingham, Aparna Ramani, Peter C. Rigby"
849+
venue: "TSE 2025"
850+
links:
851+
paper: "https://arxiv.org/abs/2507.18755"
852+
github: ""
853+
website: ""
854+
735855
- title: "AutoCodeRover: Autonomous Program Improvement"
736856
authors: "Yuntong Zhang, Haifeng Ruan, Zhiyu Fan, Abhik Roychoudhury"
737857
venue: "ISSTA 2024"
@@ -772,18 +892,18 @@
772892
github: "https://github.com/SWE-bench/SWE-bench"
773893
website: "https://www.swebench.com/"
774894

775-
- title: "CodeR: Issue Resolving with Multi-Agent and Task Graphs"
776-
authors: "Dong Chen, Shaoxin Lin, Muhan Zeng, Daoguang Zan, Jian-Gang Wang, Anton Cheshkov, Jun Sun, Hao Yu, Guoliang Dong, Artem Aliev, Jie Wang, Xiao Cheng, Guangtai Liang, Yuchi Ma, Pan Bian, Tao Xie, Qianxiang Wang"
777-
venue: "arXiv 2024"
778-
links:
779-
paper: "https://arxiv.org/abs/2406.01304"
780-
github: "https://github.com/NL2Code/CodeR"
781-
website: ""
782-
783-
- title: "MarsCode Agent: AI-native Automated Bug Fixing"
784-
authors: "Yizhou Liu, Pengfei Gao, Xinchen Wang, Jie Liu, Yexuan Shi, Zhao Zhang, Chao Peng"
785-
venue: "arXiv 2025"
786-
links:
787-
paper: "https://arxiv.org/abs/2409.00899"
788-
github: ""
789-
website: "https://se-research.bytedance.com/"
895+
# - title: "CodeR: Issue Resolving with Multi-Agent and Task Graphs"
896+
# authors: "Dong Chen, Shaoxin Lin, Muhan Zeng, Daoguang Zan, Jian-Gang Wang, Anton Cheshkov, Jun Sun, Hao Yu, Guoliang Dong, Artem Aliev, Jie Wang, Xiao Cheng, Guangtai Liang, Yuchi Ma, Pan Bian, Tao Xie, Qianxiang Wang"
897+
# venue: "arXiv 2024"
898+
# links:
899+
# paper: "https://arxiv.org/abs/2406.01304"
900+
# github: "https://github.com/NL2Code/CodeR"
901+
# website: ""
902+
903+
# - title: "MarsCode Agent: AI-native Automated Bug Fixing"
904+
# authors: "Yizhou Liu, Pengfei Gao, Xinchen Wang, Jie Liu, Yexuan Shi, Zhao Zhang, Chao Peng"
905+
# venue: "arXiv 2025"
906+
# links:
907+
# paper: "https://arxiv.org/abs/2409.00899"
908+
# github: ""
909+
# website: "https://se-research.bytedance.com/"

data/papers_machine_learning_engineering.yaml

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -6,6 +6,14 @@
66
github: ""
77
website: ""
88

9+
- title: "ArchPilot: A Proxy-Guided Multi-Agent Approach for Machine Learning Engineering"
10+
authors: "Zhuowen Yuan, Tao Liu, Yang Yang, Yang Wang, Feng Qi, Kaushik Rangadurai, Bo Li, Shuang Yang"
11+
venue: "arXiv 2025/11"
12+
links:
13+
paper: "https://arxiv.org/abs/2511.03985"
14+
github: ""
15+
website: ""
16+
917
- title: "DeepCode: Open Agentic Coding"
1018
authors: "Data Intelligence Lab@HKU"
1119
venue: "2025"

data/papers_sql_engineering.yaml

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,11 @@
1+
# - title: "BAPPA: Benchmarking Agents, Plans, and Pipelines for Automated Text-to-SQL Generation"
2+
# authors: "Fahim Ahmed, Md Mubtasim Ahasan, Jahir Sadik Monon, Muntasir Wahed, M Ashraful Amin, A K M Mahbubur Rahman, Amin Ahsan Ali"
3+
# venue: "arXiv 2025/11"
4+
# links:
5+
# paper: "https://arxiv.org/abs/2511.04153"
6+
# github: ""
7+
# website: ""
8+
19
- title: "SWE-SQL: Illuminating LLM Pathways to Solve User SQL Issues in Real-World Applications"
210
authors: "Jinyang Li, Xiaolong Li, Ge Qu, Per Jacobsson, Bowen Qin, Binyuan Hui, Shuzheng Si, Nan Huo, Xiaohan Xu, Yue Zhang, Ziwei Tang, Yuanshuai Li, Florensia Widjaja, Xintong Zhu, Feige Zhou, Yongfeng Huang, Yannis Papakonstantinou, Fatma Ozcan, Chenhao Ma, Reynold Cheng"
311
venue: "arXiv 2025"

docs/static/badges/papers.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
{"schemaVersion": 1, "label": "Papers", "message": "481", "color": "brightgreen"}
1+
{"schemaVersion": 1, "label": "Papers", "message": "497", "color": "brightgreen"}

docs/static/badges/papers.svg

Lines changed: 3 additions & 3 deletions
Loading

0 commit comments

Comments
 (0)