Picture for Sifan Li

Sifan Li

AudioRouter: Data Efficient Audio Understanding via RL based Dual Reasoning

Add code
Feb 11, 2026
Viaarxiv icon

OptiSQL: Executable SQL Generation from Optical Tokens

Add code
Jan 21, 2026
Viaarxiv icon

Detecting and Mitigating Insertion Hallucination in Video-to-Audio Generation

Add code
Oct 09, 2025
Viaarxiv icon

Replace in Translation: Boost Concept Alignment in Counterfactual Text-to-Image

Add code
May 20, 2025
Figure 1 for Replace in Translation: Boost Concept Alignment in Counterfactual Text-to-Image
Figure 2 for Replace in Translation: Boost Concept Alignment in Counterfactual Text-to-Image
Figure 3 for Replace in Translation: Boost Concept Alignment in Counterfactual Text-to-Image
Figure 4 for Replace in Translation: Boost Concept Alignment in Counterfactual Text-to-Image
Viaarxiv icon