Picture for Xingjun Ma

Xingjun Ma

ViSRA: A Video-based Spatial Reasoning Agent for Multi-modal Large Language Models

Add code
May 11, 2026
Viaarxiv icon

From Order to Distribution: A Spectral Characterization of Forgetting in Continual Learning

Add code
Apr 15, 2026
Viaarxiv icon

HazardArena: Evaluating Semantic Safety in Vision-Language-Action Models

Add code
Apr 14, 2026
Viaarxiv icon

Beyond Surface Judgments: Human-Grounded Risk Evaluation of LLM-Generated Disinformation

Add code
Apr 08, 2026
Viaarxiv icon

Steering the Verifiability of Multimodal AI Hallucinations

Add code
Apr 08, 2026
Viaarxiv icon

AgentHazard: A Benchmark for Evaluating Harmful Behavior in Computer-Use Agents

Add code
Apr 03, 2026
Viaarxiv icon

PixelSmile: Toward Fine-Grained Facial Expression Editing

Add code
Mar 26, 2026
Viaarxiv icon

Attention in Space: Functional Roles of VLM Heads for Spatial Reasoning

Add code
Mar 21, 2026
Viaarxiv icon

OOD-MMSafe: Advancing MLLM Safety from Harmful Intent to Hidden Consequences

Add code
Mar 10, 2026
Viaarxiv icon

OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens

Add code
Mar 02, 2026
Viaarxiv icon