Picture for Xiaonan Nie

Xiaonan Nie

Context Unrolling in Omni Models

Add code
Apr 23, 2026
Viaarxiv icon

Seedance 2.0: Advancing Video Generation for World Complexity

Add code
Apr 15, 2026
Viaarxiv icon

UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation

Add code
Mar 24, 2026
Viaarxiv icon

Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model

Add code
Dec 23, 2025
Viaarxiv icon

Seedance 1.0: Exploring the Boundaries of Video Generation Models

Add code
Jun 10, 2025
Figure 1 for Seedance 1.0: Exploring the Boundaries of Video Generation Models
Figure 2 for Seedance 1.0: Exploring the Boundaries of Video Generation Models
Figure 3 for Seedance 1.0: Exploring the Boundaries of Video Generation Models
Figure 4 for Seedance 1.0: Exploring the Boundaries of Video Generation Models
Viaarxiv icon

Emerging Properties in Unified Multimodal Pretraining

Add code
May 20, 2025
Figure 1 for Emerging Properties in Unified Multimodal Pretraining
Figure 2 for Emerging Properties in Unified Multimodal Pretraining
Figure 3 for Emerging Properties in Unified Multimodal Pretraining
Figure 4 for Emerging Properties in Unified Multimodal Pretraining
Viaarxiv icon

ByteScale: Efficient Scaling of LLM Training with a 2048K Context Length on More Than 12,000 GPUs

Add code
Feb 28, 2025
Viaarxiv icon

DataSculpt: Crafting Data Landscapes for LLM Post-Training through Multi-objective Partitioning

Add code
Sep 02, 2024
Figure 1 for DataSculpt: Crafting Data Landscapes for LLM Post-Training through Multi-objective Partitioning
Figure 2 for DataSculpt: Crafting Data Landscapes for LLM Post-Training through Multi-objective Partitioning
Figure 3 for DataSculpt: Crafting Data Landscapes for LLM Post-Training through Multi-objective Partitioning
Figure 4 for DataSculpt: Crafting Data Landscapes for LLM Post-Training through Multi-objective Partitioning
Viaarxiv icon

BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline

Add code
Aug 27, 2024
Figure 1 for BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline
Figure 2 for BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline
Figure 3 for BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline
Figure 4 for BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline
Viaarxiv icon

Efficiently Training 7B LLM with 1 Million Sequence Length on 8 GPUs

Add code
Jul 16, 2024
Figure 1 for Efficiently Training 7B LLM with 1 Million Sequence Length on 8 GPUs
Figure 2 for Efficiently Training 7B LLM with 1 Million Sequence Length on 8 GPUs
Figure 3 for Efficiently Training 7B LLM with 1 Million Sequence Length on 8 GPUs
Figure 4 for Efficiently Training 7B LLM with 1 Million Sequence Length on 8 GPUs
Viaarxiv icon