Picture for Linjiang Huang

Linjiang Huang

FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehensive Benchmark

Add code
Sep 11, 2025
Viaarxiv icon

AeroDuo: Aerial Duo for UAV-based Vision and Language Navigation

Add code
Aug 21, 2025
Viaarxiv icon

GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning

Add code
May 22, 2025
Viaarxiv icon

SOLVE: Synergy of Language-Vision and End-to-End Networks for Autonomous Driving

Add code
May 22, 2025
Viaarxiv icon

GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing

Add code
Mar 13, 2025
Viaarxiv icon

FlexDrive: Toward Trajectory Flexibility in Driving Scene Reconstruction and Rendering

Add code
Feb 28, 2025
Viaarxiv icon

GaussianPainter: Painting Point Cloud into 3D Gaussians with Normal Guidance

Add code
Dec 23, 2024
Figure 1 for GaussianPainter: Painting Point Cloud into 3D Gaussians with Normal Guidance
Figure 2 for GaussianPainter: Painting Point Cloud into 3D Gaussians with Normal Guidance
Figure 3 for GaussianPainter: Painting Point Cloud into 3D Gaussians with Normal Guidance
Figure 4 for GaussianPainter: Painting Point Cloud into 3D Gaussians with Normal Guidance
Viaarxiv icon

FreeEdit: Mask-free Reference-based Image Editing with Multi-modal Instruction

Add code
Sep 26, 2024
Viaarxiv icon

FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis

Add code
Mar 19, 2024
Viaarxiv icon

Improving Weakly Supervised Temporal Action Localization by Bridging Train-Test Gap in Pseudo Labels

Add code
Apr 17, 2023
Viaarxiv icon