Picture for Zihao Lin

Zihao Lin

UniTemp: Unlocking Video Generation in Any Temporal Order via Bidirectional Distillation

Add code
Jun 17, 2026
Viaarxiv icon

A Survey on LLM-based Conversational User Simulation

Add code
Apr 27, 2026
Viaarxiv icon

Think, Act, Build: An Agentic Framework with Vision Language Models for Zero-Shot 3D Visual Grounding

Add code
Apr 02, 2026
Viaarxiv icon

MiLDEdit: Reasoning-Based Multi-Layer Design Document Editing

Add code
Jan 08, 2026
Viaarxiv icon

SuperFlow: Training Flow Matching Models with RL on the Fly

Add code
Dec 17, 2025
Viaarxiv icon

Simple Lines, Big Ideas: Towards Interpretable Assessment of Human Creativity from Drawings

Add code
Nov 17, 2025
Viaarxiv icon

R2I-Bench: Benchmarking Reasoning-Driven Text-to-Image Generation

Add code
May 29, 2025
Figure 1 for R2I-Bench: Benchmarking Reasoning-Driven Text-to-Image Generation
Figure 2 for R2I-Bench: Benchmarking Reasoning-Driven Text-to-Image Generation
Figure 3 for R2I-Bench: Benchmarking Reasoning-Driven Text-to-Image Generation
Figure 4 for R2I-Bench: Benchmarking Reasoning-Driven Text-to-Image Generation
Viaarxiv icon

Localizing Knowledge in Diffusion Transformers

Add code
May 24, 2025
Viaarxiv icon

A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models

Add code
Feb 22, 2025
Figure 1 for A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models
Figure 2 for A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models
Figure 3 for A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models
Figure 4 for A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models
Viaarxiv icon

Knowing When to Stop: Dynamic Context Cutoff for Large Language Models

Add code
Feb 03, 2025
Figure 1 for Knowing When to Stop: Dynamic Context Cutoff for Large Language Models
Figure 2 for Knowing When to Stop: Dynamic Context Cutoff for Large Language Models
Figure 3 for Knowing When to Stop: Dynamic Context Cutoff for Large Language Models
Figure 4 for Knowing When to Stop: Dynamic Context Cutoff for Large Language Models
Viaarxiv icon