Picture for Jy-yong Sohn

Jy-yong Sohn

Transformers in the Dark: Navigating Unknown Search Spaces via Bandit Feedback

Add code
Mar 25, 2026
Viaarxiv icon

Fine-Tuning Without Forgetting In-Context Learning: A Theoretical Analysis of Linear Attention Models

Add code
Feb 26, 2026
Viaarxiv icon

Soft Task-Aware Routing of Experts for Equivariant Representation Learning

Add code
Oct 31, 2025
Viaarxiv icon

On the Similarities of Embeddings in Contrastive Learning

Add code
Jun 11, 2025
Figure 1 for On the Similarities of Embeddings in Contrastive Learning
Figure 2 for On the Similarities of Embeddings in Contrastive Learning
Figure 3 for On the Similarities of Embeddings in Contrastive Learning
Figure 4 for On the Similarities of Embeddings in Contrastive Learning
Viaarxiv icon

Understanding the behavior of representation forgetting in continual learning

Add code
May 28, 2025
Viaarxiv icon

FinDER: Financial Dataset for Question Answering and Evaluating Retrieval-Augmented Generation

Add code
Apr 22, 2025
Viaarxiv icon

A Theoretical Framework for Preventing Class Collapse in Supervised Contrastive Learning

Add code
Mar 11, 2025
Figure 1 for A Theoretical Framework for Preventing Class Collapse in Supervised Contrastive Learning
Figure 2 for A Theoretical Framework for Preventing Class Collapse in Supervised Contrastive Learning
Figure 3 for A Theoretical Framework for Preventing Class Collapse in Supervised Contrastive Learning
Figure 4 for A Theoretical Framework for Preventing Class Collapse in Supervised Contrastive Learning
Viaarxiv icon

Linq-Embed-Mistral Technical Report

Add code
Dec 04, 2024
Figure 1 for Linq-Embed-Mistral Technical Report
Figure 2 for Linq-Embed-Mistral Technical Report
Figure 3 for Linq-Embed-Mistral Technical Report
Figure 4 for Linq-Embed-Mistral Technical Report
Viaarxiv icon

Buffer-based Gradient Projection for Continual Federated Learning

Add code
Sep 03, 2024
Viaarxiv icon

Memorization Capacity for Additive Fine-Tuning with Small ReLU Networks

Add code
Aug 01, 2024
Figure 1 for Memorization Capacity for Additive Fine-Tuning with Small ReLU Networks
Figure 2 for Memorization Capacity for Additive Fine-Tuning with Small ReLU Networks
Figure 3 for Memorization Capacity for Additive Fine-Tuning with Small ReLU Networks
Figure 4 for Memorization Capacity for Additive Fine-Tuning with Small ReLU Networks
Viaarxiv icon