Picture for Chenxi Wang

Chenxi Wang

HeartBench: Probing Core Dimensions of Anthropomorphic Intelligence in LLMs

Add code
Dec 26, 2025
Viaarxiv icon

When Personalization Tricks Detectors: The Feature-Inversion Trap in Machine-Generated Text Detection

Add code
Oct 14, 2025
Viaarxiv icon

Scaling Agents via Continual Pre-training

Add code
Sep 16, 2025
Viaarxiv icon

Mutual-Supervised Learning for Sequential-to-Parallel Code Translation

Add code
Jun 11, 2025
Figure 1 for Mutual-Supervised Learning for Sequential-to-Parallel Code Translation
Figure 2 for Mutual-Supervised Learning for Sequential-to-Parallel Code Translation
Figure 3 for Mutual-Supervised Learning for Sequential-to-Parallel Code Translation
Figure 4 for Mutual-Supervised Learning for Sequential-to-Parallel Code Translation
Viaarxiv icon

AutoGPS: Automated Geometry Problem Solving via Multimodal Formalization and Deductive Reasoning

Add code
May 29, 2025
Viaarxiv icon

SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models

Add code
May 29, 2025
Figure 1 for SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models
Figure 2 for SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models
Figure 3 for SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models
Figure 4 for SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models
Viaarxiv icon

ManipLVM-R1: Reinforcement Learning for Reasoning in Embodied Manipulation with Large Vision-Language Models

Add code
May 22, 2025
Viaarxiv icon

Evaluate Bias without Manual Test Sets: A Concept Representation Perspective for LLMs

Add code
May 21, 2025
Figure 1 for Evaluate Bias without Manual Test Sets: A Concept Representation Perspective for LLMs
Figure 2 for Evaluate Bias without Manual Test Sets: A Concept Representation Perspective for LLMs
Figure 3 for Evaluate Bias without Manual Test Sets: A Concept Representation Perspective for LLMs
Figure 4 for Evaluate Bias without Manual Test Sets: A Concept Representation Perspective for LLMs
Viaarxiv icon

Audio Jailbreak: An Open Comprehensive Benchmark for Jailbreaking Large Audio-Language Models

Add code
May 21, 2025
Figure 1 for Audio Jailbreak: An Open Comprehensive Benchmark for Jailbreaking Large Audio-Language Models
Figure 2 for Audio Jailbreak: An Open Comprehensive Benchmark for Jailbreaking Large Audio-Language Models
Figure 3 for Audio Jailbreak: An Open Comprehensive Benchmark for Jailbreaking Large Audio-Language Models
Figure 4 for Audio Jailbreak: An Open Comprehensive Benchmark for Jailbreaking Large Audio-Language Models
Viaarxiv icon

Enhancing Satellite Object Localization with Dilated Convolutions and Attention-aided Spatial Pooling

Add code
May 08, 2025
Figure 1 for Enhancing Satellite Object Localization with Dilated Convolutions and Attention-aided Spatial Pooling
Figure 2 for Enhancing Satellite Object Localization with Dilated Convolutions and Attention-aided Spatial Pooling
Figure 3 for Enhancing Satellite Object Localization with Dilated Convolutions and Attention-aided Spatial Pooling
Figure 4 for Enhancing Satellite Object Localization with Dilated Convolutions and Attention-aided Spatial Pooling
Viaarxiv icon