Picture for Jaeyoung Do

Jaeyoung Do

Seoul National University, Korea

VIRST: Video-Instructed Reasoning Assistant for SpatioTemporal Segmentation

Add code
Mar 28, 2026
Viaarxiv icon

MEDIC-AD: Towards Medical Vision-Language Model's Clinical Intelligence

Add code
Mar 28, 2026
Viaarxiv icon

3rd Place of MeViS-Audio Track of the 5th PVUW: VIRST-Audio

Add code
Mar 24, 2026
Viaarxiv icon

RFEval: Benchmarking Reasoning Faithfulness under Counterfactual Reasoning Intervention in Large Reasoning Models

Add code
Feb 19, 2026
Viaarxiv icon

MATA: Multi-Agent Framework for Reliable and Flexible Table Question Answering

Add code
Feb 10, 2026
Viaarxiv icon

VALUEFLOW: Toward Pluralistic and Steerable Value-based Alignment in Large Language Models

Add code
Feb 03, 2026
Viaarxiv icon

MatKV: Trading Compute for Flash Storage in LLM Inference

Add code
Dec 20, 2025
Viaarxiv icon

Don't Let It Fade: Preserving Edits in Diffusion Language Models via Token Timestep Allocation

Add code
Oct 30, 2025
Viaarxiv icon

SECOND: Mitigating Perceptual Hallucination in Vision-Language Models via Selective and Contrastive Decoding

Add code
Jun 10, 2025
Viaarxiv icon

MathReader : Text-to-Speech for Mathematical Documents

Add code
Jan 13, 2025
Figure 1 for MathReader : Text-to-Speech for Mathematical Documents
Figure 2 for MathReader : Text-to-Speech for Mathematical Documents
Figure 3 for MathReader : Text-to-Speech for Mathematical Documents
Figure 4 for MathReader : Text-to-Speech for Mathematical Documents
Viaarxiv icon