Picture for Zhuang Liu

Zhuang Liu

Transformers without Normalization

Add code
Mar 13, 2025
Viaarxiv icon

Idiosyncrasies in Large Language Models

Add code
Feb 17, 2025
Figure 1 for Idiosyncrasies in Large Language Models
Figure 2 for Idiosyncrasies in Large Language Models
Figure 3 for Idiosyncrasies in Large Language Models
Figure 4 for Idiosyncrasies in Large Language Models
Viaarxiv icon

MetaMorph: Multimodal Understanding and Generation via Instruction Tuning

Add code
Dec 18, 2024
Figure 1 for MetaMorph: Multimodal Understanding and Generation via Instruction Tuning
Figure 2 for MetaMorph: Multimodal Understanding and Generation via Instruction Tuning
Figure 3 for MetaMorph: Multimodal Understanding and Generation via Instruction Tuning
Figure 4 for MetaMorph: Multimodal Understanding and Generation via Instruction Tuning
Viaarxiv icon

Understanding Bias in Large-Scale Visual Datasets

Add code
Dec 02, 2024
Viaarxiv icon

LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding

Add code
Oct 22, 2024
Figure 1 for LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding
Figure 2 for LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding
Figure 3 for LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding
Figure 4 for LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding
Viaarxiv icon

Amphista: Accelerate LLM Inference with Bi-directional Multiple Drafting Heads in a Non-autoregressive Style

Add code
Jun 19, 2024
Figure 1 for Amphista: Accelerate LLM Inference with Bi-directional Multiple Drafting Heads in a Non-autoregressive Style
Figure 2 for Amphista: Accelerate LLM Inference with Bi-directional Multiple Drafting Heads in a Non-autoregressive Style
Figure 3 for Amphista: Accelerate LLM Inference with Bi-directional Multiple Drafting Heads in a Non-autoregressive Style
Figure 4 for Amphista: Accelerate LLM Inference with Bi-directional Multiple Drafting Heads in a Non-autoregressive Style
Viaarxiv icon

Explainable Few-shot Knowledge Tracing

Add code
May 23, 2024
Figure 1 for Explainable Few-shot Knowledge Tracing
Figure 2 for Explainable Few-shot Knowledge Tracing
Figure 3 for Explainable Few-shot Knowledge Tracing
Figure 4 for Explainable Few-shot Knowledge Tracing
Viaarxiv icon

Wasserstein Dependent Graph Attention Network for Collaborative Filtering with Uncertainty

Add code
Apr 09, 2024
Figure 1 for Wasserstein Dependent Graph Attention Network for Collaborative Filtering with Uncertainty
Figure 2 for Wasserstein Dependent Graph Attention Network for Collaborative Filtering with Uncertainty
Figure 3 for Wasserstein Dependent Graph Attention Network for Collaborative Filtering with Uncertainty
Figure 4 for Wasserstein Dependent Graph Attention Network for Collaborative Filtering with Uncertainty
Viaarxiv icon

A Decade's Battle on Dataset Bias: Are We There Yet?

Add code
Mar 13, 2024
Figure 1 for A Decade's Battle on Dataset Bias: Are We There Yet?
Figure 2 for A Decade's Battle on Dataset Bias: Are We There Yet?
Figure 3 for A Decade's Battle on Dataset Bias: Are We There Yet?
Figure 4 for A Decade's Battle on Dataset Bias: Are We There Yet?
Viaarxiv icon

Massive Activations in Large Language Models

Add code
Feb 27, 2024
Figure 1 for Massive Activations in Large Language Models
Figure 2 for Massive Activations in Large Language Models
Figure 3 for Massive Activations in Large Language Models
Figure 4 for Massive Activations in Large Language Models
Viaarxiv icon