Picture for Kai Han

Kai Han

and Other Contributors

OmniEval: A Benchmark for Evaluating Omni-modal Models with Visual, Auditory, and Textual Inputs

Add code
Jun 26, 2025
Viaarxiv icon

EAQuant: Enhancing Post-Training Quantization for MoE Models via Expert-Aware Optimization

Add code
Jun 16, 2025
Viaarxiv icon

Generalized Category Discovery under the Long-Tailed Distribution

Add code
Jun 14, 2025
Viaarxiv icon

Pangu Embedded: An Efficient Dual-system LLM Reasoner with Metacognition

Add code
May 29, 2025
Viaarxiv icon

Pangu Pro MoE: Mixture of Grouped Experts for Efficient Sparsity

Add code
May 28, 2025
Viaarxiv icon

Semantic Correspondence: Unified Benchmarking and a Strong Baseline

Add code
May 26, 2025
Viaarxiv icon

Panoptic Captioning: Seeking An Equivalency Bridge for Image and Text

Add code
May 22, 2025
Viaarxiv icon

Circle-RoPE: Cone-like Decoupled Rotary Positional Embedding for Large Vision-Language Models

Add code
May 22, 2025
Viaarxiv icon

A Microgravity Simulation Experimental Platform For Small Space Robots In Orbit

Add code
Apr 26, 2025
Viaarxiv icon

A Physics-guided Multimodal Transformer Path to Weather and Climate Sciences

Add code
Apr 19, 2025
Viaarxiv icon