Picture for Chung-Ching Lin

Chung-Ching Lin

ProImage-Bench: Rubric-Based Evaluation for Professional Image Generation

Add code
Dec 13, 2025
Viaarxiv icon

SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models

Add code
Oct 08, 2025
Figure 1 for SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models
Figure 2 for SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models
Figure 3 for SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models
Figure 4 for SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models
Viaarxiv icon

ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs

Add code
Jun 11, 2025
Viaarxiv icon

Audio-Aware Large Language Models as Judges for Speaking Styles

Add code
Jun 06, 2025
Viaarxiv icon

Point-RFT: Improving Multimodal Reasoning with Visually Grounded Reinforcement Finetuning

Add code
May 26, 2025
Figure 1 for Point-RFT: Improving Multimodal Reasoning with Visually Grounded Reinforcement Finetuning
Figure 2 for Point-RFT: Improving Multimodal Reasoning with Visually Grounded Reinforcement Finetuning
Figure 3 for Point-RFT: Improving Multimodal Reasoning with Visually Grounded Reinforcement Finetuning
Figure 4 for Point-RFT: Improving Multimodal Reasoning with Visually Grounded Reinforcement Finetuning
Viaarxiv icon

SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement

Add code
Apr 10, 2025
Figure 1 for SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement
Figure 2 for SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement
Figure 3 for SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement
Figure 4 for SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement
Viaarxiv icon

Measurement of LLM's Philosophies of Human Nature

Add code
Apr 03, 2025
Viaarxiv icon

Zero-Shot Audio-Visual Editing via Cross-Modal Delta Denoising

Add code
Mar 26, 2025
Figure 1 for Zero-Shot Audio-Visual Editing via Cross-Modal Delta Denoising
Figure 2 for Zero-Shot Audio-Visual Editing via Cross-Modal Delta Denoising
Figure 3 for Zero-Shot Audio-Visual Editing via Cross-Modal Delta Denoising
Figure 4 for Zero-Shot Audio-Visual Editing via Cross-Modal Delta Denoising
Viaarxiv icon

GenXD: Generating Any 3D and 4D Scenes

Add code
Nov 05, 2024
Figure 1 for GenXD: Generating Any 3D and 4D Scenes
Figure 2 for GenXD: Generating Any 3D and 4D Scenes
Figure 3 for GenXD: Generating Any 3D and 4D Scenes
Figure 4 for GenXD: Generating Any 3D and 4D Scenes
Viaarxiv icon

SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation

Add code
Oct 30, 2024
Figure 1 for SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation
Figure 2 for SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation
Figure 3 for SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation
Figure 4 for SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation
Viaarxiv icon