Picture for Dacheng Tao

Dacheng Tao

and Other Contributors

Improving large language models with concept-aware fine-tuning

Add code
Jun 09, 2025
Viaarxiv icon

GeometryZero: Improving Geometry Solving for LLM with Group Contrastive Policy Optimization

Add code
Jun 08, 2025
Viaarxiv icon

SRD: Reinforcement-Learned Semantic Perturbation for Backdoor Defense in VLMs

Add code
Jun 05, 2025
Viaarxiv icon

Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer

Add code
May 30, 2025
Viaarxiv icon

Revisiting Overthinking in Long Chain-of-Thought from the Perspective of Self-Doubt

Add code
May 29, 2025
Viaarxiv icon

Pangu Embedded: An Efficient Dual-system LLM Reasoner with Metacognition

Add code
May 29, 2025
Viaarxiv icon

Resolving Knowledge Conflicts in Domain-specific Data Selection: A Case Study on Medical Instruction-tuning

Add code
May 28, 2025
Viaarxiv icon

On Geometry-Enhanced Parameter-Efficient Fine-Tuning for 3D Scene Segmentation

Add code
May 28, 2025
Viaarxiv icon

GoMatching++: Parameter- and Data-Efficient Arbitrary-Shaped Video Text Spotting and Benchmarking

Add code
May 28, 2025
Viaarxiv icon

Pangu Pro MoE: Mixture of Grouped Experts for Efficient Sparsity

Add code
May 28, 2025
Viaarxiv icon