Picture for Qiang Liu

Qiang Liu

Linda

Policy Gradient Primal-Dual Method for Safe Reinforcement Learning from Human Feedback

Add code
Apr 21, 2026
Viaarxiv icon

MDPBench: A Benchmark for Multilingual Document Parsing in Real-World Scenarios

Add code
Mar 30, 2026
Viaarxiv icon

RealChart2Code: Advancing Chart-to-Code Generation with Real Data and Multi-Task Evaluation

Add code
Mar 26, 2026
Viaarxiv icon

Gumbel Distillation for Parallel Text Generation

Add code
Mar 23, 2026
Viaarxiv icon

MultiBind: A Benchmark for Attribute Misbinding in Multi-Subject Generation

Add code
Mar 23, 2026
Viaarxiv icon

AIGQ: An End-to-End Hybrid Generative Architecture for E-commerce Query Recommendation

Add code
Mar 20, 2026
Viaarxiv icon

SCAN: Sparse Circuit Anchor Interpretable Neuron for Lifelong Knowledge Editing

Add code
Mar 16, 2026
Viaarxiv icon

$ abla$-Reasoner: LLM Reasoning via Test-Time Gradient Descent in Latent Space

Add code
Mar 05, 2026
Viaarxiv icon

Multimodal Adaptive Retrieval Augmented Generation through Internal Representation Learning

Add code
Feb 28, 2026
Viaarxiv icon

CLFEC: A New Task for Unified Linguistic and Factual Error Correction in paragraph-level Chinese Professional Writing

Add code
Feb 27, 2026
Viaarxiv icon