Picture for Rui Tian

Rui Tian

HiDream-I1: A High-Efficient Image Generative Foundation Model with Sparse Diffusion Transformer

Add code
May 28, 2025
Viaarxiv icon

UniGen: Enhanced Training & Test-Time Strategies for Unified Multimodal Understanding and Generation

Add code
May 20, 2025
Viaarxiv icon

Multi-Modality Driven LoRA for Adverse Condition Depth Estimation

Add code
Dec 28, 2024
Viaarxiv icon

REDUCIO! Generating 1024$\times$1024 Video within 16 Seconds using Extremely Compressed Motion Latents

Add code
Nov 20, 2024
Viaarxiv icon

DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs

Add code
Jun 06, 2024
Viaarxiv icon

S3-SLAM: Sparse Tri-plane Encoding for Neural Implicit SLAM

Add code
Apr 28, 2024
Viaarxiv icon

A Practical Large-Scale Roadside Multi-View Multi-Sensor Spatial Synchronization Framework for Intelligent Transportation Systems

Add code
Nov 04, 2023
Viaarxiv icon

UniQuadric: A SLAM Backend for Unknown Rigid Object 3D Tracking and Light-Weight Modeling

Add code
Oct 02, 2023
Viaarxiv icon

ResFormer: Scaling ViTs with Multi-Resolution Training

Add code
Dec 01, 2022
Viaarxiv icon

Rethinking Skip Connections in Encoder-decoder Networks for Monocular Depth Estimation

Add code
Aug 29, 2022
Figure 1 for Rethinking Skip Connections in Encoder-decoder Networks for Monocular Depth Estimation
Figure 2 for Rethinking Skip Connections in Encoder-decoder Networks for Monocular Depth Estimation
Figure 3 for Rethinking Skip Connections in Encoder-decoder Networks for Monocular Depth Estimation
Figure 4 for Rethinking Skip Connections in Encoder-decoder Networks for Monocular Depth Estimation
Viaarxiv icon