Picture for Yiling Huang

Yiling Huang

Not Search, But Scan: Benchmarking MLLMs on Scan-Oriented Academic Paper Reasoning

Add code
Mar 27, 2026
Viaarxiv icon

THEMIS: Towards Holistic Evaluation of MLLMs for Scientific Paper Fraud Forensics

Add code
Mar 26, 2026
Viaarxiv icon

FinMMDocR: Benchmarking Financial Multimodal Reasoning with Scenario Awareness, Document Understanding, and Multi-Step Computation

Add code
Dec 31, 2025
Viaarxiv icon

FinMMR: Make Financial Numerical Reasoning More Multimodal, Comprehensive, and Challenging

Add code
Aug 06, 2025
Viaarxiv icon

PNeSM: Arbitrary 3D Scene Stylization via Prompt-Based Neural Style Mapping

Add code
Mar 13, 2024
Viaarxiv icon

DiarizationLM: Speaker Diarization Post-Processing with Large Language Models

Add code
Jan 16, 2024
Figure 1 for DiarizationLM: Speaker Diarization Post-Processing with Large Language Models
Figure 2 for DiarizationLM: Speaker Diarization Post-Processing with Large Language Models
Figure 3 for DiarizationLM: Speaker Diarization Post-Processing with Large Language Models
Figure 4 for DiarizationLM: Speaker Diarization Post-Processing with Large Language Models
Viaarxiv icon

ArtBank: Artistic Style Transfer with Pre-trained Diffusion Model and Implicit Style Prompt Bank

Add code
Dec 11, 2023
Figure 1 for ArtBank: Artistic Style Transfer with Pre-trained Diffusion Model and Implicit Style Prompt Bank
Figure 2 for ArtBank: Artistic Style Transfer with Pre-trained Diffusion Model and Implicit Style Prompt Bank
Figure 3 for ArtBank: Artistic Style Transfer with Pre-trained Diffusion Model and Implicit Style Prompt Bank
Figure 4 for ArtBank: Artistic Style Transfer with Pre-trained Diffusion Model and Implicit Style Prompt Bank
Viaarxiv icon

Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network

Add code
Sep 15, 2023
Figure 1 for Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network
Figure 2 for Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network
Figure 3 for Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network
Figure 4 for Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network
Viaarxiv icon

USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models

Add code
Sep 14, 2023
Viaarxiv icon

Selective inference using randomized group lasso estimators for general models

Add code
Jun 24, 2023
Viaarxiv icon