Picture for Youngjoon Jang

Youngjoon Jang

On the Nature of Attention Sink that Shapes Decoding Strategy in MLLMs

Add code
Mar 15, 2026
Viaarxiv icon

FastAV: Efficient Token Pruning for Audio-Visual Large Language Model Inference

Add code
Jan 19, 2026
Viaarxiv icon

LP-CFM: Perceptual Invariance-Aware Conditional Flow Matching for Speech Modeling

Add code
Dec 23, 2025
Figure 1 for LP-CFM: Perceptual Invariance-Aware Conditional Flow Matching for Speech Modeling
Figure 2 for LP-CFM: Perceptual Invariance-Aware Conditional Flow Matching for Speech Modeling
Figure 3 for LP-CFM: Perceptual Invariance-Aware Conditional Flow Matching for Speech Modeling
Figure 4 for LP-CFM: Perceptual Invariance-Aware Conditional Flow Matching for Speech Modeling
Viaarxiv icon

Segment, Embed, and Align: A Universal Recipe for Aligning Subtitles to Signing

Add code
Dec 08, 2025
Viaarxiv icon

Lost in Translation, Found in Embeddings: Sign Language Translation and Alignment

Add code
Dec 08, 2025
Figure 1 for Lost in Translation, Found in Embeddings: Sign Language Translation and Alignment
Figure 2 for Lost in Translation, Found in Embeddings: Sign Language Translation and Alignment
Figure 3 for Lost in Translation, Found in Embeddings: Sign Language Translation and Alignment
Figure 4 for Lost in Translation, Found in Embeddings: Sign Language Translation and Alignment
Viaarxiv icon

From Ambiguity to Accuracy: The Transformative Effect of Coreference Resolution on Retrieval-Augmented Generation systems

Add code
Jul 10, 2025
Viaarxiv icon

Fork-Merge Decoding: Enhancing Multimodal Understanding in Audio-Visual Large Language Models

Add code
May 27, 2025
Viaarxiv icon

AVCD: Mitigating Hallucinations in Audio-Visual Large Language Models through Contrastive Decoding

Add code
May 27, 2025
Viaarxiv icon

Test-Time Augmentation for Pose-invariant Face Recognition

Add code
May 14, 2025
Viaarxiv icon

Deep Understanding of Sign Language for Sign to Subtitle Alignment

Add code
Mar 05, 2025
Viaarxiv icon