Picture for Liwei Chen

Liwei Chen

$\text{VG}^2$GT: Voxel-Gaussian Splatting Visual Geometry Grounded Transformer

Add code
Jun 01, 2026
Viaarxiv icon

A neural operator framework for data-driven discovery of stability and receptivity in physical systems

Add code
Apr 21, 2026
Viaarxiv icon

Revisiting Robust RAG: Do We Still Need Complex Robust Training in the Era of Powerful LLMs?

Add code
Feb 17, 2025
Viaarxiv icon

Following the Autoregressive Nature of LLM Embeddings via Compression and Alignment

Add code
Feb 17, 2025
Viaarxiv icon

RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance

Add code
May 23, 2024
Viaarxiv icon

Harder Tasks Need More Experts: Dynamic Routing in MoE Models

Add code
Mar 12, 2024
Viaarxiv icon

Probing Multimodal Large Language Models for Global and Local Semantic Representation

Add code
Feb 27, 2024
Figure 1 for Probing Multimodal Large Language Models for Global and Local Semantic Representation
Figure 2 for Probing Multimodal Large Language Models for Global and Local Semantic Representation
Figure 3 for Probing Multimodal Large Language Models for Global and Local Semantic Representation
Figure 4 for Probing Multimodal Large Language Models for Global and Local Semantic Representation
Viaarxiv icon

Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization

Add code
Feb 06, 2024
Figure 1 for Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization
Figure 2 for Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization
Figure 3 for Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization
Figure 4 for Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization
Viaarxiv icon

A Step Closer to Comprehensive Answers: Constrained Multi-Stage Question Decomposition with Large Language Models

Add code
Nov 13, 2023
Figure 1 for A Step Closer to Comprehensive Answers: Constrained Multi-Stage Question Decomposition with Large Language Models
Figure 2 for A Step Closer to Comprehensive Answers: Constrained Multi-Stage Question Decomposition with Large Language Models
Figure 3 for A Step Closer to Comprehensive Answers: Constrained Multi-Stage Question Decomposition with Large Language Models
Figure 4 for A Step Closer to Comprehensive Answers: Constrained Multi-Stage Question Decomposition with Large Language Models
Viaarxiv icon

Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization

Add code
Sep 29, 2023
Figure 1 for Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization
Figure 2 for Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization
Figure 3 for Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization
Figure 4 for Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization
Viaarxiv icon