Picture for Tao Xiang

Tao Xiang

Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation

Add code
Apr 27, 2026
Viaarxiv icon

Rays as Pixels: Learning A Joint Distribution of Videos and Camera Trajectories

Add code
Apr 10, 2026
Viaarxiv icon

Intelligent Forensics in Next-Generation Mobile Networks: Evidence, Methods, and Applications

Add code
Mar 31, 2026
Viaarxiv icon

TransText: Alpha-as-RGB Representation for Transparent Text Animation

Add code
Mar 19, 2026
Viaarxiv icon

VecGlypher: Unified Vector Glyph Generation with Language Models

Add code
Feb 25, 2026
Viaarxiv icon

HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming

Add code
Dec 24, 2025
Figure 1 for HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming
Figure 2 for HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming
Figure 3 for HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming
Figure 4 for HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming
Viaarxiv icon

OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory

Add code
Dec 08, 2025
Figure 1 for OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory
Figure 2 for OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory
Figure 3 for OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory
Figure 4 for OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory
Viaarxiv icon

Mixture of States: Routing Token-Level Dynamics for Multimodal Generation

Add code
Nov 15, 2025
Viaarxiv icon

Stackelberg Game-Driven Defense for ISAC Against Channel Attacks in Low-Altitude Networks

Add code
Nov 09, 2025
Viaarxiv icon

Solving the Hubbard model with Neural Quantum States

Add code
Jul 03, 2025
Viaarxiv icon