Picture for Tianyi Zhou

Tianyi Zhou

Don't Think Longer, Think Wisely: Optimizing Thinking Dynamics for Large Reasoning Models

Add code
May 27, 2025
Viaarxiv icon

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Add code
May 14, 2025
Viaarxiv icon

Federated Adapter on Foundation Models: An Out-Of-Distribution Approach

Add code
May 02, 2025
Viaarxiv icon

VideoHallu: Evaluating and Mitigating Multi-modal Hallucinations for Synthetic Videos

Add code
May 02, 2025
Viaarxiv icon

Skill Discovery for Software Scripting Automation via Offline Simulations with LLMs

Add code
Apr 29, 2025
Viaarxiv icon

WALL-E 2.0: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents

Add code
Apr 22, 2025
Viaarxiv icon

Exploring Expert Failures Improves LLM Agent Tuning

Add code
Apr 18, 2025
Viaarxiv icon

GraphicBench: A Planning Benchmark for Graphic Design with Language Agents

Add code
Apr 15, 2025
Figure 1 for GraphicBench: A Planning Benchmark for Graphic Design with Language Agents
Figure 2 for GraphicBench: A Planning Benchmark for Graphic Design with Language Agents
Figure 3 for GraphicBench: A Planning Benchmark for Graphic Design with Language Agents
Figure 4 for GraphicBench: A Planning Benchmark for Graphic Design with Language Agents
Viaarxiv icon

How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients

Add code
Apr 14, 2025
Figure 1 for How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients
Figure 2 for How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients
Figure 3 for How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients
Figure 4 for How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients
Viaarxiv icon

ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness

Add code
Apr 10, 2025
Figure 1 for ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness
Figure 2 for ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness
Figure 3 for ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness
Figure 4 for ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness
Viaarxiv icon