Picture for Yue Wang

Yue Wang

Zhongguancun Academy

Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models

Add code
May 01, 2025
Figure 1 for Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models
Figure 2 for Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models
Figure 3 for Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models
Figure 4 for Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models
Viaarxiv icon

MINT: Multi-Vector Search Index Tuning

Add code
Apr 28, 2025
Viaarxiv icon

GenCLS++: Pushing the Boundaries of Generative Classification in LLMs Through Comprehensive SFT and RL Studies Across Diverse Datasets

Add code
Apr 28, 2025
Viaarxiv icon

RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning

Add code
Apr 26, 2025
Figure 1 for RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning
Figure 2 for RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning
Figure 3 for RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning
Figure 4 for RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning
Viaarxiv icon

Symbolic Representation for Any-to-Any Generative Tasks

Add code
Apr 24, 2025
Figure 1 for Symbolic Representation for Any-to-Any Generative Tasks
Figure 2 for Symbolic Representation for Any-to-Any Generative Tasks
Figure 3 for Symbolic Representation for Any-to-Any Generative Tasks
Figure 4 for Symbolic Representation for Any-to-Any Generative Tasks
Viaarxiv icon

DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

Add code
Apr 15, 2025
Figure 1 for DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning
Figure 2 for DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning
Figure 3 for DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning
Figure 4 for DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning
Viaarxiv icon

Redefining Machine Translation on Social Network Services with Large Language Models

Add code
Apr 10, 2025
Viaarxiv icon

WaveHiTS: Wavelet-Enhanced Hierarchical Time Series Modeling for Wind Direction Nowcasting in Eastern Inner Mongolia

Add code
Apr 09, 2025
Figure 1 for WaveHiTS: Wavelet-Enhanced Hierarchical Time Series Modeling for Wind Direction Nowcasting in Eastern Inner Mongolia
Figure 2 for WaveHiTS: Wavelet-Enhanced Hierarchical Time Series Modeling for Wind Direction Nowcasting in Eastern Inner Mongolia
Figure 3 for WaveHiTS: Wavelet-Enhanced Hierarchical Time Series Modeling for Wind Direction Nowcasting in Eastern Inner Mongolia
Figure 4 for WaveHiTS: Wavelet-Enhanced Hierarchical Time Series Modeling for Wind Direction Nowcasting in Eastern Inner Mongolia
Viaarxiv icon

Domain-Conditioned Scene Graphs for State-Grounded Task Planning

Add code
Apr 09, 2025
Viaarxiv icon

Grounding 3D Object Affordance with Language Instructions, Visual Observations and Interactions

Add code
Apr 07, 2025
Figure 1 for Grounding 3D Object Affordance with Language Instructions, Visual Observations and Interactions
Figure 2 for Grounding 3D Object Affordance with Language Instructions, Visual Observations and Interactions
Figure 3 for Grounding 3D Object Affordance with Language Instructions, Visual Observations and Interactions
Figure 4 for Grounding 3D Object Affordance with Language Instructions, Visual Observations and Interactions
Viaarxiv icon