Picture for Bin Wang

Bin Wang

and Other Contributors

IFEval-Audio: Benchmarking Instruction-Following Capability in Audio-based Large Language Models

Add code
May 22, 2025
Viaarxiv icon

Towards Spoken Mathematical Reasoning: Benchmarking Speech-based Models over Multi-faceted Math Problems

Add code
May 21, 2025
Viaarxiv icon

Enhance Mobile Agents Thinking Process Via Iterative Preference Learning

Add code
May 18, 2025
Figure 1 for Enhance Mobile Agents Thinking Process Via Iterative Preference Learning
Figure 2 for Enhance Mobile Agents Thinking Process Via Iterative Preference Learning
Figure 3 for Enhance Mobile Agents Thinking Process Via Iterative Preference Learning
Figure 4 for Enhance Mobile Agents Thinking Process Via Iterative Preference Learning
Viaarxiv icon

Accelerating Diffusion-based Super-Resolution with Dynamic Time-Spatial Sampling

Add code
May 17, 2025
Figure 1 for Accelerating Diffusion-based Super-Resolution with Dynamic Time-Spatial Sampling
Figure 2 for Accelerating Diffusion-based Super-Resolution with Dynamic Time-Spatial Sampling
Figure 3 for Accelerating Diffusion-based Super-Resolution with Dynamic Time-Spatial Sampling
Figure 4 for Accelerating Diffusion-based Super-Resolution with Dynamic Time-Spatial Sampling
Viaarxiv icon

Mobile-Bench-v2: A More Realistic and Comprehensive Benchmark for VLM-based Mobile Agents

Add code
May 17, 2025
Figure 1 for Mobile-Bench-v2: A More Realistic and Comprehensive Benchmark for VLM-based Mobile Agents
Figure 2 for Mobile-Bench-v2: A More Realistic and Comprehensive Benchmark for VLM-based Mobile Agents
Figure 3 for Mobile-Bench-v2: A More Realistic and Comprehensive Benchmark for VLM-based Mobile Agents
Figure 4 for Mobile-Bench-v2: A More Realistic and Comprehensive Benchmark for VLM-based Mobile Agents
Viaarxiv icon

FG-CLIP: Fine-Grained Visual and Textual Alignment

Add code
May 08, 2025
Viaarxiv icon

Point2Primitive: CAD Reconstruction from Point Cloud by Direct Primitive Prediction

Add code
May 04, 2025
Figure 1 for Point2Primitive: CAD Reconstruction from Point Cloud by Direct Primitive Prediction
Figure 2 for Point2Primitive: CAD Reconstruction from Point Cloud by Direct Primitive Prediction
Figure 3 for Point2Primitive: CAD Reconstruction from Point Cloud by Direct Primitive Prediction
Figure 4 for Point2Primitive: CAD Reconstruction from Point Cloud by Direct Primitive Prediction
Viaarxiv icon

100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models

Add code
May 01, 2025
Viaarxiv icon

Shifts in Doctors' Eye Movements Between Real and AI-Generated Medical Images

Add code
Apr 21, 2025
Viaarxiv icon

Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs

Add code
Apr 10, 2025
Figure 1 for Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs
Figure 2 for Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs
Figure 3 for Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs
Figure 4 for Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs
Viaarxiv icon