Picture for Chenglei Si

Chenglei Si

The Ideation-Execution Gap: Execution Outcomes of LLM-Generated versus Human Research Ideas

Add code
Jun 25, 2025
Viaarxiv icon

Contextual Experience Replay for Self-Improvement of Language Agents

Add code
Jun 07, 2025
Viaarxiv icon

Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers

Add code
Sep 06, 2024
Figure 1 for Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers
Figure 2 for Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers
Figure 3 for Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers
Figure 4 for Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers
Viaarxiv icon

Configurable Foundation Models: Building LLMs from a Modular Perspective

Add code
Sep 04, 2024
Figure 1 for Configurable Foundation Models: Building LLMs from a Modular Perspective
Figure 2 for Configurable Foundation Models: Building LLMs from a Modular Perspective
Figure 3 for Configurable Foundation Models: Building LLMs from a Modular Perspective
Figure 4 for Configurable Foundation Models: Building LLMs from a Modular Perspective
Viaarxiv icon

Towards Bidirectional Human-AI Alignment: A Systematic Review for Clarifications, Framework, and Future Directions

Add code
Jun 17, 2024
Viaarxiv icon

The Prompt Report: A Systematic Survey of Prompting Techniques

Add code
Jun 06, 2024
Viaarxiv icon

Best Practices and Lessons Learned on Synthetic Data for Language Models

Add code
Apr 11, 2024
Figure 1 for Best Practices and Lessons Learned on Synthetic Data for Language Models
Viaarxiv icon

Design2Code: How Far Are We From Automating Front-End Engineering?

Add code
Mar 05, 2024
Figure 1 for Design2Code: How Far Are We From Automating Front-End Engineering?
Figure 2 for Design2Code: How Far Are We From Automating Front-End Engineering?
Figure 3 for Design2Code: How Far Are We From Automating Front-End Engineering?
Figure 4 for Design2Code: How Far Are We From Automating Front-End Engineering?
Viaarxiv icon

Large Language Models Help Humans Verify Truthfulness -- Except When They Are Convincingly Wrong

Add code
Oct 19, 2023
Viaarxiv icon

Mixture of Prompt Experts for Generalizable and Interpretable Question Answering

Add code
May 24, 2023
Viaarxiv icon