Picture for Felix Juefei-Xu

Felix Juefei-Xu

UniT: Unified Multimodal Chain-of-Thought Test-time Scaling

Add code
Feb 12, 2026
Viaarxiv icon

HairWeaver: Few-Shot Photorealistic Hair Motion Synthesis with Sim-to-Real Guided Video Diffusion

Add code
Feb 11, 2026
Viaarxiv icon

Non-Markov Multi-Round Conversational Image Generation with History-Conditioned MLLMs

Add code
Jan 28, 2026
Viaarxiv icon

PhyGDPO: Physics-Aware Groupwise Direct Preference Optimization for Physically Consistent Text-to-Video Generation

Add code
Dec 31, 2025
Viaarxiv icon

Exploring MLLM-Diffusion Information Transfer with MetaCanvas

Add code
Dec 12, 2025
Viaarxiv icon

Beyond Pixels: Semantic-aware Typographic Attack for Geo-Privacy Protection

Add code
Nov 16, 2025
Viaarxiv icon

Pushing the Limits of Safety: A Technical Report on the ATLAS Challenge 2025

Add code
Jun 14, 2025
Viaarxiv icon

Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models

Add code
Apr 24, 2025
Viaarxiv icon

A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment

Add code
Apr 22, 2025
Viaarxiv icon

Transfer between Modalities with MetaQueries

Add code
Apr 08, 2025
Figure 1 for Transfer between Modalities with MetaQueries
Figure 2 for Transfer between Modalities with MetaQueries
Figure 3 for Transfer between Modalities with MetaQueries
Figure 4 for Transfer between Modalities with MetaQueries
Viaarxiv icon