Picture for Deqing Fu

Deqing Fu

Resa: Transparent Reasoning Models via SAEs

Add code
Jun 11, 2025
Viaarxiv icon

Textual Steering Vectors Can Improve Visual Understanding in Multimodal Large Language Models

Add code
May 20, 2025
Viaarxiv icon

VisualLens: Personalization through Visual History

Add code
Nov 25, 2024
Figure 1 for VisualLens: Personalization through Visual History
Figure 2 for VisualLens: Personalization through Visual History
Figure 3 for VisualLens: Personalization through Visual History
Figure 4 for VisualLens: Personalization through Visual History
Viaarxiv icon

TLDR: Token-Level Detective Reward Model for Large Vision Language Models

Add code
Oct 07, 2024
Figure 1 for TLDR: Token-Level Detective Reward Model for Large Vision Language Models
Figure 2 for TLDR: Token-Level Detective Reward Model for Large Vision Language Models
Figure 3 for TLDR: Token-Level Detective Reward Model for Large Vision Language Models
Figure 4 for TLDR: Token-Level Detective Reward Model for Large Vision Language Models
Viaarxiv icon

Pre-trained Large Language Models Use Fourier Features to Compute Addition

Add code
Jun 05, 2024
Viaarxiv icon

IsoBench: Benchmarking Multimodal Foundation Models on Isomorphic Representations

Add code
Apr 02, 2024
Figure 1 for IsoBench: Benchmarking Multimodal Foundation Models on Isomorphic Representations
Figure 2 for IsoBench: Benchmarking Multimodal Foundation Models on Isomorphic Representations
Figure 3 for IsoBench: Benchmarking Multimodal Foundation Models on Isomorphic Representations
Figure 4 for IsoBench: Benchmarking Multimodal Foundation Models on Isomorphic Representations
Viaarxiv icon

Simplicity Bias of Transformers to Learn Low Sensitivity Functions

Add code
Mar 11, 2024
Viaarxiv icon

DeLLMa: A Framework for Decision Making Under Uncertainty with Large Language Models

Add code
Feb 04, 2024
Viaarxiv icon

DreamSync: Aligning Text-to-Image Generation with Image Understanding Feedback

Add code
Nov 29, 2023
Figure 1 for DreamSync: Aligning Text-to-Image Generation with Image Understanding Feedback
Figure 2 for DreamSync: Aligning Text-to-Image Generation with Image Understanding Feedback
Figure 3 for DreamSync: Aligning Text-to-Image Generation with Image Understanding Feedback
Figure 4 for DreamSync: Aligning Text-to-Image Generation with Image Understanding Feedback
Viaarxiv icon

Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Models

Add code
Oct 26, 2023
Viaarxiv icon