Picture for Tiejun Huang

Tiejun Huang

Video-XL: Extra-Long Vision Language Model for Hour-Scale Video Understanding

Add code
Sep 24, 2024
Figure 1 for Video-XL: Extra-Long Vision Language Model for Hour-Scale Video Understanding
Figure 2 for Video-XL: Extra-Long Vision Language Model for Hour-Scale Video Understanding
Figure 3 for Video-XL: Extra-Long Vision Language Model for Hour-Scale Video Understanding
Figure 4 for Video-XL: Extra-Long Vision Language Model for Hour-Scale Video Understanding
Viaarxiv icon

OmniGen: Unified Image Generation

Add code
Sep 17, 2024
Figure 1 for OmniGen: Unified Image Generation
Figure 2 for OmniGen: Unified Image Generation
Figure 3 for OmniGen: Unified Image Generation
Figure 4 for OmniGen: Unified Image Generation
Viaarxiv icon

ShapeMamba-EM: Fine-Tuning Foundation Model with Local Shape Descriptors and Mamba Blocks for 3D EM Image Segmentation

Add code
Aug 26, 2024
Figure 1 for ShapeMamba-EM: Fine-Tuning Foundation Model with Local Shape Descriptors and Mamba Blocks for 3D EM Image Segmentation
Figure 2 for ShapeMamba-EM: Fine-Tuning Foundation Model with Local Shape Descriptors and Mamba Blocks for 3D EM Image Segmentation
Figure 3 for ShapeMamba-EM: Fine-Tuning Foundation Model with Local Shape Descriptors and Mamba Blocks for 3D EM Image Segmentation
Figure 4 for ShapeMamba-EM: Fine-Tuning Foundation Model with Local Shape Descriptors and Mamba Blocks for 3D EM Image Segmentation
Viaarxiv icon

PRTGS: Precomputed Radiance Transfer of Gaussian Splats for Real-Time High-Quality Relighting

Add code
Aug 07, 2024
Viaarxiv icon

Multimodal Large Language Models for Bioimage Analysis

Add code
Jul 29, 2024
Viaarxiv icon

SpikeGS: 3D Gaussian Splatting from Spike Streams with High-Speed Camera Motion

Add code
Jul 14, 2024
Viaarxiv icon

SpikeGS: Reconstruct 3D scene via fast-moving bio-inspired sensors

Add code
Jul 04, 2024
Figure 1 for SpikeGS: Reconstruct 3D scene via fast-moving bio-inspired sensors
Figure 2 for SpikeGS: Reconstruct 3D scene via fast-moving bio-inspired sensors
Figure 3 for SpikeGS: Reconstruct 3D scene via fast-moving bio-inspired sensors
Figure 4 for SpikeGS: Reconstruct 3D scene via fast-moving bio-inspired sensors
Viaarxiv icon

52B to 1T: Lessons Learned via Tele-FLM Series

Add code
Jul 03, 2024
Figure 1 for 52B to 1T: Lessons Learned via Tele-FLM Series
Figure 2 for 52B to 1T: Lessons Learned via Tele-FLM Series
Figure 3 for 52B to 1T: Lessons Learned via Tele-FLM Series
Figure 4 for 52B to 1T: Lessons Learned via Tele-FLM Series
Viaarxiv icon

MLVU: A Comprehensive Benchmark for Multi-Task Long Video Understanding

Add code
Jun 06, 2024
Figure 1 for MLVU: A Comprehensive Benchmark for Multi-Task Long Video Understanding
Figure 2 for MLVU: A Comprehensive Benchmark for Multi-Task Long Video Understanding
Figure 3 for MLVU: A Comprehensive Benchmark for Multi-Task Long Video Understanding
Figure 4 for MLVU: A Comprehensive Benchmark for Multi-Task Long Video Understanding
Viaarxiv icon

SpikeMM: Flexi-Magnification of High-Speed Micro-Motions

Add code
Jun 01, 2024
Viaarxiv icon