Picture for Zhijian Liu

Zhijian Liu

PFAvatar: Pose-Fusion 3D Personalized Avatar Reconstruction from Real-World Outfit-of-the-Day Photos

Add code
Nov 18, 2025
Viaarxiv icon

ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference

Add code
Nov 13, 2025
Viaarxiv icon

3D Aware Region Prompted Vision Language Model

Add code
Sep 16, 2025
Figure 1 for 3D Aware Region Prompted Vision Language Model
Figure 2 for 3D Aware Region Prompted Vision Language Model
Figure 3 for 3D Aware Region Prompted Vision Language Model
Figure 4 for 3D Aware Region Prompted Vision Language Model
Viaarxiv icon

ArcMemo: Abstract Reasoning Composition with Lifelong LLM Memory

Add code
Sep 04, 2025
Figure 1 for ArcMemo: Abstract Reasoning Composition with Lifelong LLM Memory
Figure 2 for ArcMemo: Abstract Reasoning Composition with Lifelong LLM Memory
Figure 3 for ArcMemo: Abstract Reasoning Composition with Lifelong LLM Memory
Figure 4 for ArcMemo: Abstract Reasoning Composition with Lifelong LLM Memory
Viaarxiv icon

Scaling RL to Long Videos

Add code
Jul 10, 2025
Viaarxiv icon

Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding

Add code
May 28, 2025
Viaarxiv icon

Token-Efficient Long Video Understanding for Multimodal LLMs

Add code
Mar 06, 2025
Figure 1 for Token-Efficient Long Video Understanding for Multimodal LLMs
Figure 2 for Token-Efficient Long Video Understanding for Multimodal LLMs
Figure 3 for Token-Efficient Long Video Understanding for Multimodal LLMs
Figure 4 for Token-Efficient Long Video Understanding for Multimodal LLMs
Viaarxiv icon

LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

Add code
Feb 20, 2025
Viaarxiv icon

NVILA: Efficient Frontier Visual Language Models

Add code
Dec 05, 2024
Figure 1 for NVILA: Efficient Frontier Visual Language Models
Figure 2 for NVILA: Efficient Frontier Visual Language Models
Figure 3 for NVILA: Efficient Frontier Visual Language Models
Figure 4 for NVILA: Efficient Frontier Visual Language Models
Viaarxiv icon

VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge

Add code
Nov 19, 2024
Figure 1 for VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge
Figure 2 for VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge
Figure 3 for VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge
Figure 4 for VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge
Viaarxiv icon