Picture for Tao Zhang

Tao Zhang

Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

Add code
Oct 22, 2025
Viaarxiv icon

Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model

Add code
Oct 21, 2025
Figure 1 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Figure 2 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Figure 3 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Figure 4 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Viaarxiv icon

Native Hybrid Attention for Efficient Sequence Modeling

Add code
Oct 08, 2025
Figure 1 for Native Hybrid Attention for Efficient Sequence Modeling
Figure 2 for Native Hybrid Attention for Efficient Sequence Modeling
Figure 3 for Native Hybrid Attention for Efficient Sequence Modeling
Figure 4 for Native Hybrid Attention for Efficient Sequence Modeling
Viaarxiv icon

HTMformer: Hybrid Time and Multivariate Transformer for Time Series Forecasting

Add code
Oct 08, 2025
Figure 1 for HTMformer: Hybrid Time and Multivariate Transformer for Time Series Forecasting
Figure 2 for HTMformer: Hybrid Time and Multivariate Transformer for Time Series Forecasting
Figure 3 for HTMformer: Hybrid Time and Multivariate Transformer for Time Series Forecasting
Figure 4 for HTMformer: Hybrid Time and Multivariate Transformer for Time Series Forecasting
Viaarxiv icon

AutoCodeBench: Large Language Models are Automatic Code Benchmark Generators

Add code
Aug 12, 2025
Figure 1 for AutoCodeBench: Large Language Models are Automatic Code Benchmark Generators
Figure 2 for AutoCodeBench: Large Language Models are Automatic Code Benchmark Generators
Figure 3 for AutoCodeBench: Large Language Models are Automatic Code Benchmark Generators
Figure 4 for AutoCodeBench: Large Language Models are Automatic Code Benchmark Generators
Viaarxiv icon

A Novel Modeling Framework and Data Product for Extended VIIRS-like Artificial Nighttime Light Image Reconstruction (1986-2024)

Add code
Aug 01, 2025
Figure 1 for A Novel Modeling Framework and Data Product for Extended VIIRS-like Artificial Nighttime Light Image Reconstruction (1986-2024)
Figure 2 for A Novel Modeling Framework and Data Product for Extended VIIRS-like Artificial Nighttime Light Image Reconstruction (1986-2024)
Figure 3 for A Novel Modeling Framework and Data Product for Extended VIIRS-like Artificial Nighttime Light Image Reconstruction (1986-2024)
Figure 4 for A Novel Modeling Framework and Data Product for Extended VIIRS-like Artificial Nighttime Light Image Reconstruction (1986-2024)
Viaarxiv icon

$S^3$LAM: Surfel Splatting SLAM for Geometrically Accurate Tracking and Mapping

Add code
Jul 28, 2025
Viaarxiv icon

Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology

Add code
Jul 10, 2025
Figure 1 for Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology
Figure 2 for Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology
Figure 3 for Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology
Figure 4 for Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology
Viaarxiv icon

MCAM: Multimodal Causal Analysis Model for Ego-Vehicle-Level Driving Video Understanding

Add code
Jul 08, 2025
Viaarxiv icon

Jump-Start Reinforcement Learning with Self-Evolving Priors for Extreme Monopedal Locomotion

Add code
Jul 01, 2025
Viaarxiv icon