Picture for Bin Cui

Bin Cui

SAS-Bench: A Fine-Grained Benchmark for Evaluating Short Answer Scoring with Large Language Models

Add code
May 15, 2025
Viaarxiv icon

Galvatron: An Automatic Distributed System for Efficient Foundation Model Training

Add code
Apr 30, 2025
Figure 1 for Galvatron: An Automatic Distributed System for Efficient Foundation Model Training
Figure 2 for Galvatron: An Automatic Distributed System for Efficient Foundation Model Training
Viaarxiv icon

FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal Understanding

Add code
Apr 14, 2025
Figure 1 for FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal Understanding
Figure 2 for FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal Understanding
Figure 3 for FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal Understanding
Figure 4 for FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal Understanding
Viaarxiv icon

Training-free Diffusion Acceleration with Bottleneck Sampling

Add code
Mar 27, 2025
Viaarxiv icon

TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation

Add code
Mar 06, 2025
Figure 1 for TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation
Figure 2 for TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation
Figure 3 for TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation
Figure 4 for TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation
Viaarxiv icon

ByteScale: Efficient Scaling of LLM Training with a 2048K Context Length on More Than 12,000 GPUs

Add code
Feb 28, 2025
Viaarxiv icon

Training-free and Adaptive Sparse Attention for Efficient Long Video Generation

Add code
Feb 28, 2025
Figure 1 for Training-free and Adaptive Sparse Attention for Efficient Long Video Generation
Figure 2 for Training-free and Adaptive Sparse Attention for Efficient Long Video Generation
Figure 3 for Training-free and Adaptive Sparse Attention for Efficient Long Video Generation
Figure 4 for Training-free and Adaptive Sparse Attention for Efficient Long Video Generation
Viaarxiv icon

MathClean: A Benchmark for Synthetic Mathematical Data Cleaning

Add code
Feb 26, 2025
Figure 1 for MathClean: A Benchmark for Synthetic Mathematical Data Cleaning
Figure 2 for MathClean: A Benchmark for Synthetic Mathematical Data Cleaning
Figure 3 for MathClean: A Benchmark for Synthetic Mathematical Data Cleaning
Figure 4 for MathClean: A Benchmark for Synthetic Mathematical Data Cleaning
Viaarxiv icon

Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening

Add code
Feb 17, 2025
Figure 1 for Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening
Figure 2 for Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening
Figure 3 for Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening
Figure 4 for Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening
Viaarxiv icon

HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation

Add code
Feb 17, 2025
Viaarxiv icon