Picture for Lei Wang

Lei Wang

Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences

Prototype-Guided Curriculum Learning for Zero-Shot Learning

Add code
Aug 11, 2025
Viaarxiv icon

HSA-Net: Hierarchical and Structure-Aware Framework for Efficient and Scalable Molecular Language Modeling

Add code
Aug 10, 2025
Viaarxiv icon

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

Add code
Jul 29, 2025
Figure 1 for HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels
Figure 2 for HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels
Figure 3 for HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels
Figure 4 for HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels
Viaarxiv icon

BOASF: A Unified Framework for Speeding up Automatic Machine Learning via Adaptive Successive Filtering

Add code
Jul 28, 2025
Viaarxiv icon

Representation Entanglement for Generation:Training Diffusion Transformers Is Much Easier Than You Think

Add code
Jul 02, 2025
Figure 1 for Representation Entanglement for Generation:Training Diffusion Transformers Is Much Easier Than You Think
Figure 2 for Representation Entanglement for Generation:Training Diffusion Transformers Is Much Easier Than You Think
Figure 3 for Representation Entanglement for Generation:Training Diffusion Transformers Is Much Easier Than You Think
Figure 4 for Representation Entanglement for Generation:Training Diffusion Transformers Is Much Easier Than You Think
Viaarxiv icon

ChartReasoner: Code-Driven Modality Bridging for Long-Chain Reasoning in Chart Question Answering

Add code
Jun 11, 2025
Viaarxiv icon

SeerAttention-R: Sparse Attention Adaptation for Long Reasoning

Add code
Jun 10, 2025
Figure 1 for SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Figure 2 for SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Figure 3 for SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Figure 4 for SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Viaarxiv icon

Truth in the Few: High-Value Data Selection for Efficient Multi-Modal Reasoning

Add code
Jun 05, 2025
Viaarxiv icon

TableEval: A Real-World Benchmark for Complex, Multilingual, and Multi-Structured Table Question Answering

Add code
Jun 04, 2025
Viaarxiv icon

ChartMind: A Comprehensive Benchmark for Complex Real-world Multimodal Chart Question Answering

Add code
May 29, 2025
Viaarxiv icon