Picture for Tieyuan Chen

Tieyuan Chen

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Add code
Apr 22, 2026
Viaarxiv icon

VidLaDA: Bidirectional Diffusion Large Language Models for Efficient Video Understanding

Add code
Jan 25, 2026
Viaarxiv icon

Enhancing Video Large Language Models with Structured Multi-Video Collaborative Reasoning (early version)

Add code
Sep 16, 2025
Viaarxiv icon

Grove MoE: Towards Efficient and Superior MoE LLMs with Adjugate Experts

Add code
Aug 11, 2025
Viaarxiv icon

Looking Beyond Visible Cues: Implicit Video Question Answering via Dual-Clue Reasoning

Add code
Jun 09, 2025
Figure 1 for Looking Beyond Visible Cues: Implicit Video Question Answering via Dual-Clue Reasoning
Figure 2 for Looking Beyond Visible Cues: Implicit Video Question Answering via Dual-Clue Reasoning
Figure 3 for Looking Beyond Visible Cues: Implicit Video Question Answering via Dual-Clue Reasoning
Figure 4 for Looking Beyond Visible Cues: Implicit Video Question Answering via Dual-Clue Reasoning
Viaarxiv icon

Unleashing Diffusion Transformers for Visual Correspondence by Modulating Massive Activations

Add code
May 24, 2025
Figure 1 for Unleashing Diffusion Transformers for Visual Correspondence by Modulating Massive Activations
Figure 2 for Unleashing Diffusion Transformers for Visual Correspondence by Modulating Massive Activations
Figure 3 for Unleashing Diffusion Transformers for Visual Correspondence by Modulating Massive Activations
Figure 4 for Unleashing Diffusion Transformers for Visual Correspondence by Modulating Massive Activations
Viaarxiv icon

Contrastive Representation Distillation via Multi-Scale Feature Decoupling

Add code
Feb 09, 2025
Figure 1 for Contrastive Representation Distillation via Multi-Scale Feature Decoupling
Figure 2 for Contrastive Representation Distillation via Multi-Scale Feature Decoupling
Figure 3 for Contrastive Representation Distillation via Multi-Scale Feature Decoupling
Figure 4 for Contrastive Representation Distillation via Multi-Scale Feature Decoupling
Viaarxiv icon

MECD+: Unlocking Event-Level Causal Graph Discovery for Video Reasoning

Add code
Jan 16, 2025
Figure 1 for MECD+: Unlocking Event-Level Causal Graph Discovery for Video Reasoning
Figure 2 for MECD+: Unlocking Event-Level Causal Graph Discovery for Video Reasoning
Figure 3 for MECD+: Unlocking Event-Level Causal Graph Discovery for Video Reasoning
Figure 4 for MECD+: Unlocking Event-Level Causal Graph Discovery for Video Reasoning
Viaarxiv icon

CSTA: Spatial-Temporal Causal Adaptive Learning for Exemplar-Free Video Class-Incremental Learning

Add code
Jan 13, 2025
Figure 1 for CSTA: Spatial-Temporal Causal Adaptive Learning for Exemplar-Free Video Class-Incremental Learning
Figure 2 for CSTA: Spatial-Temporal Causal Adaptive Learning for Exemplar-Free Video Class-Incremental Learning
Figure 3 for CSTA: Spatial-Temporal Causal Adaptive Learning for Exemplar-Free Video Class-Incremental Learning
Figure 4 for CSTA: Spatial-Temporal Causal Adaptive Learning for Exemplar-Free Video Class-Incremental Learning
Viaarxiv icon

MECD: Unlocking Multi-Event Causal Discovery in Video Reasoning

Add code
Sep 26, 2024
Figure 1 for MECD: Unlocking Multi-Event Causal Discovery in Video Reasoning
Figure 2 for MECD: Unlocking Multi-Event Causal Discovery in Video Reasoning
Figure 3 for MECD: Unlocking Multi-Event Causal Discovery in Video Reasoning
Figure 4 for MECD: Unlocking Multi-Event Causal Discovery in Video Reasoning
Viaarxiv icon