Picture for Zirui Song

Zirui Song

SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models

Add code
May 29, 2025
Viaarxiv icon

Divide-Fuse-Conquer: Eliciting "Aha Moments" in Multi-Scenario Games

Add code
May 22, 2025
Viaarxiv icon

ManipLVM-R1: Reinforcement Learning for Reasoning in Embodied Manipulation with Large Vision-Language Models

Add code
May 22, 2025
Viaarxiv icon

Evaluate Bias without Manual Test Sets: A Concept Representation Perspective for LLMs

Add code
May 21, 2025
Viaarxiv icon

Audio Jailbreak: An Open Comprehensive Benchmark for Jailbreaking Large Audio-Language Models

Add code
May 21, 2025
Viaarxiv icon

Evaluating and Mitigating Bias in AI-Based Medical Text Generation

Add code
Apr 24, 2025
Viaarxiv icon

Motion Anything: Any to Motion Generation

Add code
Mar 10, 2025
Viaarxiv icon

Word Form Matters: LLMs' Semantic Reconstruction under Typoglycemia

Add code
Mar 03, 2025
Viaarxiv icon

PedDet: Adaptive Spectral Optimization for Multimodal Pedestrian Detection

Add code
Feb 21, 2025
Viaarxiv icon

Geolocation with Real Human Gameplay Data: A Large-Scale Dataset and Human-Like Reasoning Framework

Add code
Feb 19, 2025
Viaarxiv icon