Picture for Hengyi Wang

Hengyi Wang

FreeText: Training-Free Text Rendering in Diffusion Transformers via Attention Localization and Spectral Glyph Injection

Add code
Jan 02, 2026
Viaarxiv icon

AuthSig: Safeguarding Scanned Signatures Against Unauthorized Reuse in Paperless Workflows

Add code
Nov 12, 2025
Viaarxiv icon

LaVA-Man: Learning Visual Action Representations for Robot Manipulation

Add code
Aug 26, 2025
Viaarxiv icon

Token-Level Uncertainty Estimation for Large Language Model Reasoning

Add code
May 16, 2025
Viaarxiv icon

Variational Language Concepts for Interpreting Foundation Language Models

Add code
Oct 04, 2024
Figure 1 for Variational Language Concepts for Interpreting Foundation Language Models
Figure 2 for Variational Language Concepts for Interpreting Foundation Language Models
Figure 3 for Variational Language Concepts for Interpreting Foundation Language Models
Figure 4 for Variational Language Concepts for Interpreting Foundation Language Models
Viaarxiv icon

3D Reconstruction with Spatial Memory

Add code
Aug 28, 2024
Viaarxiv icon

Probabilistic Conceptual Explainers: Trustworthy Conceptual Explanations for Vision Foundation Models

Add code
Jun 18, 2024
Viaarxiv icon

Multimodal Needle in a Haystack: Benchmarking Long-Context Capability of Multimodal Large Language Models

Add code
Jun 17, 2024
Figure 1 for Multimodal Needle in a Haystack: Benchmarking Long-Context Capability of Multimodal Large Language Models
Figure 2 for Multimodal Needle in a Haystack: Benchmarking Long-Context Capability of Multimodal Large Language Models
Figure 3 for Multimodal Needle in a Haystack: Benchmarking Long-Context Capability of Multimodal Large Language Models
Figure 4 for Multimodal Needle in a Haystack: Benchmarking Long-Context Capability of Multimodal Large Language Models
Viaarxiv icon

Continual Learning of Large Language Models: A Comprehensive Survey

Add code
Apr 25, 2024
Figure 1 for Continual Learning of Large Language Models: A Comprehensive Survey
Figure 2 for Continual Learning of Large Language Models: A Comprehensive Survey
Figure 3 for Continual Learning of Large Language Models: A Comprehensive Survey
Figure 4 for Continual Learning of Large Language Models: A Comprehensive Survey
Viaarxiv icon

MorpheuS: Neural Dynamic 360° Surface Reconstruction from Monocular RGB-D Video

Add code
Dec 01, 2023
Viaarxiv icon