Picture for Jia-Hong Huang

Jia-Hong Huang

A-MAR: Agent-based Multimodal Art Retrieval for Fine-Grained Artwork Understanding

Add code
Apr 21, 2026
Viaarxiv icon

A Novel Automatic Framework for Speaker Drift Detection in Synthesized Speech

Add code
Apr 07, 2026
Viaarxiv icon

Are a Thousand Words Better Than a Single Picture? Beyond Images -- A Framework for Multi-Modal Knowledge Graph Dataset Enrichment

Add code
Mar 17, 2026
Viaarxiv icon

DeepEyeNet: Generating Medical Report for Retinal Images

Add code
Sep 16, 2025
Viaarxiv icon

MaCP: Minimal yet Mighty Adaptation via Hierarchical Cosine Projection

Add code
May 29, 2025
Figure 1 for MaCP: Minimal yet Mighty Adaptation via Hierarchical Cosine Projection
Figure 2 for MaCP: Minimal yet Mighty Adaptation via Hierarchical Cosine Projection
Figure 3 for MaCP: Minimal yet Mighty Adaptation via Hierarchical Cosine Projection
Figure 4 for MaCP: Minimal yet Mighty Adaptation via Hierarchical Cosine Projection
Viaarxiv icon

SSH: Sparse Spectrum Adaptation via Discrete Hartley Transformation

Add code
Feb 08, 2025
Figure 1 for SSH: Sparse Spectrum Adaptation via Discrete Hartley Transformation
Figure 2 for SSH: Sparse Spectrum Adaptation via Discrete Hartley Transformation
Figure 3 for SSH: Sparse Spectrum Adaptation via Discrete Hartley Transformation
Figure 4 for SSH: Sparse Spectrum Adaptation via Discrete Hartley Transformation
Viaarxiv icon

Gradient Weight-normalized Low-rank Projection for Efficient LLM Training

Add code
Dec 27, 2024
Viaarxiv icon

Image2Text2Image: A Novel Framework for Label-Free Evaluation of Image-to-Text Generation with Text-to-Image Diffusion Models

Add code
Nov 08, 2024
Figure 1 for Image2Text2Image: A Novel Framework for Label-Free Evaluation of Image-to-Text Generation with Text-to-Image Diffusion Models
Figure 2 for Image2Text2Image: A Novel Framework for Label-Free Evaluation of Image-to-Text Generation with Text-to-Image Diffusion Models
Figure 3 for Image2Text2Image: A Novel Framework for Label-Free Evaluation of Image-to-Text Generation with Text-to-Image Diffusion Models
Figure 4 for Image2Text2Image: A Novel Framework for Label-Free Evaluation of Image-to-Text Generation with Text-to-Image Diffusion Models
Viaarxiv icon

Parameter-Efficient Fine-Tuning via Selective Discrete Cosine Transform

Add code
Oct 09, 2024
Viaarxiv icon

Personalized Video Summarization using Text-Based Queries and Conditional Modeling

Add code
Aug 27, 2024
Viaarxiv icon