Picture for Tao Liang

Tao Liang

Grounding Language with Vision: A Conditional Mutual Information Calibrated Decoding Strategy for Reducing Hallucinations in LVLMs

Add code
May 26, 2025
Viaarxiv icon

Code2Logic: Game-Code-Driven Data Synthesis for Enhancing VLMs General Reasoning

Add code
May 20, 2025
Viaarxiv icon

Route Sparse Autoencoder to Interpret Large Language Models

Add code
Mar 11, 2025
Viaarxiv icon

SPEED: Scalable, Precise, and Efficient Concept Erasure for Diffusion Models

Add code
Mar 10, 2025
Viaarxiv icon

PFDial: A Structured Dialogue Instruction Fine-tuning Method Based on UML Flowcharts

Add code
Mar 09, 2025
Viaarxiv icon

Predicting Large Language Model Capabilities on Closed-Book QA Tasks Using Only Information Available Prior to Training

Add code
Feb 06, 2025
Viaarxiv icon

Neuron-Level Sequential Editing for Large Language Models

Add code
Oct 05, 2024
Figure 1 for Neuron-Level Sequential Editing for Large Language Models
Figure 2 for Neuron-Level Sequential Editing for Large Language Models
Figure 3 for Neuron-Level Sequential Editing for Large Language Models
Figure 4 for Neuron-Level Sequential Editing for Large Language Models
Viaarxiv icon

STAR: Scale-wise Text-to-image generation via Auto-Regressive representations

Add code
Jun 16, 2024
Figure 1 for STAR: Scale-wise Text-to-image generation via Auto-Regressive representations
Figure 2 for STAR: Scale-wise Text-to-image generation via Auto-Regressive representations
Figure 3 for STAR: Scale-wise Text-to-image generation via Auto-Regressive representations
Figure 4 for STAR: Scale-wise Text-to-image generation via Auto-Regressive representations
Viaarxiv icon

Multi-scale Cooperative Multimodal Transformers for Multimodal Sentiment Analysis in Videos

Add code
Jun 17, 2022
Figure 1 for Multi-scale Cooperative Multimodal Transformers for Multimodal Sentiment Analysis in Videos
Figure 2 for Multi-scale Cooperative Multimodal Transformers for Multimodal Sentiment Analysis in Videos
Figure 3 for Multi-scale Cooperative Multimodal Transformers for Multimodal Sentiment Analysis in Videos
Figure 4 for Multi-scale Cooperative Multimodal Transformers for Multimodal Sentiment Analysis in Videos
Viaarxiv icon

LI-Net: Large-Pose Identity-Preserving Face Reenactment Network

Add code
Apr 07, 2021
Figure 1 for LI-Net: Large-Pose Identity-Preserving Face Reenactment Network
Figure 2 for LI-Net: Large-Pose Identity-Preserving Face Reenactment Network
Figure 3 for LI-Net: Large-Pose Identity-Preserving Face Reenactment Network
Figure 4 for LI-Net: Large-Pose Identity-Preserving Face Reenactment Network
Viaarxiv icon