Picture for Anxiang Zeng

Anxiang Zeng

SEA-Vision: A Multilingual Benchmark for Comprehensive Document and Scene Text Understanding in Southeast Asia

Add code
Mar 16, 2026
Viaarxiv icon

CCCaption: Dual-Reward Reinforcement Learning for Complete and Correct Image Captioning

Add code
Feb 25, 2026
Viaarxiv icon

ManCAR: Manifold-Constrained Latent Reasoning with Adaptive Test-Time Computation for Sequential Recommendation

Add code
Feb 23, 2026
Viaarxiv icon

Towards On-Policy SFT: Distribution Discriminant Theory and its Applications in LLM Training

Add code
Feb 12, 2026
Viaarxiv icon

Rethinking Generative Recommender Tokenizer: Recsys-Native Encoding and Semantic Quantization Beyond LLMs

Add code
Feb 02, 2026
Viaarxiv icon

Orchestrating Tokens and Sequences: Dynamic Hybrid Policy Optimization for RLVR

Add code
Jan 09, 2026
Viaarxiv icon

Reveal Hidden Pitfalls and Navigate Next Generation of Vector Similarity Search from Task-Centric Views

Add code
Dec 15, 2025
Figure 1 for Reveal Hidden Pitfalls and Navigate Next Generation of Vector Similarity Search from Task-Centric Views
Figure 2 for Reveal Hidden Pitfalls and Navigate Next Generation of Vector Similarity Search from Task-Centric Views
Figure 3 for Reveal Hidden Pitfalls and Navigate Next Generation of Vector Similarity Search from Task-Centric Views
Figure 4 for Reveal Hidden Pitfalls and Navigate Next Generation of Vector Similarity Search from Task-Centric Views
Viaarxiv icon

Each Prompt Matters: Scaling Reinforcement Learning Without Wasting Rollouts on Hundred-Billion-Scale MoE

Add code
Dec 08, 2025
Viaarxiv icon

Towards Reliable Evaluation of Large Language Models for Multilingual and Multimodal E-Commerce Applications

Add code
Oct 23, 2025
Viaarxiv icon

Compass-Thinker-7B Technical Report

Add code
Aug 12, 2025
Figure 1 for Compass-Thinker-7B Technical Report
Figure 2 for Compass-Thinker-7B Technical Report
Viaarxiv icon