Picture for Jing Bi

Jing Bi

ZeroSep: Separate Anything in Audio with Zero Training

Add code
May 29, 2025
Viaarxiv icon

MMPerspective: Do MLLMs Understand Perspective? A Comprehensive Benchmark for Perspective Perception, Reasoning, and Robustness

Add code
May 26, 2025
Viaarxiv icon

$I^2G$: Generating Instructional Illustrations via Text-Conditioned Diffusion

Add code
May 22, 2025
Viaarxiv icon

Attention to Detail: Fine-Scale Feature Preservation-Oriented Geometric Pre-training for AI-Driven Surrogate Modeling

Add code
Apr 27, 2025
Viaarxiv icon

Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting

Add code
Apr 09, 2025
Viaarxiv icon

Why Reasoning Matters? A Survey of Advancements in Multimodal Reasoning (v1)

Add code
Apr 04, 2025
Viaarxiv icon

VERIFY: A Benchmark of Visual Explanation and Reasoning for Investigating Multimodal Reasoning Fidelity

Add code
Mar 14, 2025
Viaarxiv icon

Generative AI for Cel-Animation: A Survey

Add code
Jan 08, 2025
Viaarxiv icon

Unveiling Visual Perception in Language Models: An Attention Head Analysis Approach

Add code
Dec 24, 2024
Viaarxiv icon

Enhancing the Reasoning Capabilities of Small Language Models via Solution Guidance Fine-Tuning

Add code
Dec 13, 2024
Figure 1 for Enhancing the Reasoning Capabilities of Small Language Models via Solution Guidance Fine-Tuning
Figure 2 for Enhancing the Reasoning Capabilities of Small Language Models via Solution Guidance Fine-Tuning
Figure 3 for Enhancing the Reasoning Capabilities of Small Language Models via Solution Guidance Fine-Tuning
Figure 4 for Enhancing the Reasoning Capabilities of Small Language Models via Solution Guidance Fine-Tuning
Viaarxiv icon