Picture for Hanchi Sun

Hanchi Sun

MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

Add code
Apr 06, 2026
Viaarxiv icon

Expert Threshold Routing for Autoregressive Language Modeling with Dynamic Computation Allocation and Load Balancing

Add code
Mar 12, 2026
Viaarxiv icon

Thinking Before Looking: Improving Multimodal LLM Reasoning via Mitigating Visual Hallucination

Add code
Nov 15, 2024
Figure 1 for Thinking Before Looking: Improving Multimodal LLM Reasoning via Mitigating Visual Hallucination
Figure 2 for Thinking Before Looking: Improving Multimodal LLM Reasoning via Mitigating Visual Hallucination
Figure 3 for Thinking Before Looking: Improving Multimodal LLM Reasoning via Mitigating Visual Hallucination
Figure 4 for Thinking Before Looking: Improving Multimodal LLM Reasoning via Mitigating Visual Hallucination
Viaarxiv icon

Compression-Realized Deep Structural Network for Video Quality Enhancement

Add code
May 10, 2024
Figure 1 for Compression-Realized Deep Structural Network for Video Quality Enhancement
Figure 2 for Compression-Realized Deep Structural Network for Video Quality Enhancement
Figure 3 for Compression-Realized Deep Structural Network for Video Quality Enhancement
Figure 4 for Compression-Realized Deep Structural Network for Video Quality Enhancement
Viaarxiv icon

Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models

Add code
Feb 28, 2024
Figure 1 for Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models
Figure 2 for Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models
Figure 3 for Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models
Figure 4 for Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models
Viaarxiv icon