Picture for Yangyi Chen

Yangyi Chen

SyncMind: Measuring Agent Out-of-Sync Recovery in Collaborative Software Engineering

Add code
Feb 10, 2025
Viaarxiv icon

OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis

Add code
Jan 08, 2025
Viaarxiv icon

Scaling Laws for Predicting Downstream Performance in LLMs

Add code
Oct 11, 2024
Figure 1 for Scaling Laws for Predicting Downstream Performance in LLMs
Figure 2 for Scaling Laws for Predicting Downstream Performance in LLMs
Figure 3 for Scaling Laws for Predicting Downstream Performance in LLMs
Figure 4 for Scaling Laws for Predicting Downstream Performance in LLMs
Viaarxiv icon

A Single Transformer for Scalable Vision-Language Modeling

Add code
Jul 08, 2024
Viaarxiv icon

SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales

Add code
May 31, 2024
Figure 1 for SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales
Figure 2 for SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales
Figure 3 for SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales
Figure 4 for SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales
Viaarxiv icon

Executable Code Actions Elicit Better LLM Agents

Add code
Feb 01, 2024
Viaarxiv icon

ViStruct: Visual Structural Knowledge Extraction via Curriculum Guided Code-Vision Representation

Add code
Nov 22, 2023
Figure 1 for ViStruct: Visual Structural Knowledge Extraction via Curriculum Guided Code-Vision Representation
Figure 2 for ViStruct: Visual Structural Knowledge Extraction via Curriculum Guided Code-Vision Representation
Figure 3 for ViStruct: Visual Structural Knowledge Extraction via Curriculum Guided Code-Vision Representation
Figure 4 for ViStruct: Visual Structural Knowledge Extraction via Curriculum Guided Code-Vision Representation
Viaarxiv icon

Prudent Silence or Foolish Babble? Examining Large Language Models' Responses to the Unknown

Add code
Nov 16, 2023
Viaarxiv icon

R-Tuning: Teaching Large Language Models to Refuse Unknown Questions

Add code
Nov 16, 2023
Figure 1 for R-Tuning: Teaching Large Language Models to Refuse Unknown Questions
Figure 2 for R-Tuning: Teaching Large Language Models to Refuse Unknown Questions
Figure 3 for R-Tuning: Teaching Large Language Models to Refuse Unknown Questions
Figure 4 for R-Tuning: Teaching Large Language Models to Refuse Unknown Questions
Viaarxiv icon

DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback

Add code
Nov 16, 2023
Figure 1 for DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback
Figure 2 for DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback
Figure 3 for DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback
Figure 4 for DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback
Viaarxiv icon