Picture for Kyuhong Shim

Kyuhong Shim

Towards Comprehensive Scene Understanding: Integrating First and Third-Person Views for LVLMs

Add code
May 28, 2025
Viaarxiv icon

Voicing Personas: Rewriting Persona Descriptions into Style Prompts for Controllable Text-to-Speech

Add code
May 21, 2025
Viaarxiv icon

Visually Guided Decoding: Gradient-Free Hard Prompt Inversion with Language Models

Add code
May 13, 2025
Viaarxiv icon

Chain-of-Rank: Enhancing Large Language Models for Domain-Specific RAG in Edge Device

Add code
Feb 21, 2025
Viaarxiv icon

Learning Primitive Relations for Compositional Zero-Shot Learning

Add code
Jan 24, 2025
Viaarxiv icon

Unlocking Transfer Learning for Open-World Few-Shot Recognition

Add code
Nov 15, 2024
Viaarxiv icon

Preserving Pre-trained Representation Space: On Effectiveness of Prefix-tuning for Large Multi-modal Models

Add code
Oct 29, 2024
Viaarxiv icon

Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP

Add code
Oct 11, 2024
Figure 1 for Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP
Figure 2 for Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP
Figure 3 for Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP
Figure 4 for Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP
Viaarxiv icon

InfiniPot: Infinite Context Processing on Memory-Constrained LLMs

Add code
Oct 02, 2024
Viaarxiv icon

Crayon: Customized On-Device LLM via Instant Adapter Blending and Edge-Server Hybrid Inference

Add code
Jun 11, 2024
Viaarxiv icon