Picture for Xuanjing Huang

Xuanjing Huang

Code2Logic: Game-Code-Driven Data Synthesis for Enhancing VLMs General Reasoning

Add code
May 20, 2025
Viaarxiv icon

WorldPM: Scaling Human Preference Modeling

Add code
May 15, 2025
Viaarxiv icon

A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in Large Language Models

Add code
May 12, 2025
Viaarxiv icon

EcoLANG: Efficient and Effective Agent Communication Language Induction for Social Simulation

Add code
May 11, 2025
Viaarxiv icon

Effective Length Extrapolation via Dimension-Wise Positional Embeddings Manipulation

Add code
Apr 26, 2025
Viaarxiv icon

Toward Generalizable Evaluation in the LLM Era: A Survey Beyond Benchmarks

Add code
Apr 26, 2025
Viaarxiv icon

Improving RL Exploration for LLM Reasoning through Retrospective Replay

Add code
Apr 19, 2025
Viaarxiv icon

Revisiting LLM Evaluation through Mechanism Interpretability: a New Metric and Model Utility Law

Add code
Apr 10, 2025
Viaarxiv icon

FamilyTool: A Multi-hop Personalized Tool Use Benchmark

Add code
Apr 09, 2025
Viaarxiv icon

Mitigating Object Hallucinations in MLLMs via Multi-Frequency Perturbations

Add code
Mar 19, 2025
Viaarxiv icon