Picture for Nojun Kwak

Nojun Kwak

Understanding Differential Transformer Unchains Pretrained Self-Attentions

Add code
May 22, 2025
Viaarxiv icon

S3D: Sketch-Driven 3D Model Generation

Add code
May 07, 2025
Viaarxiv icon

A Revisit to the Decoder for Camouflaged Object Detection

Add code
Mar 18, 2025
Viaarxiv icon

Unlocking the Potential of Unlabeled Data in Semi-Supervised Domain Generalization

Add code
Mar 18, 2025
Viaarxiv icon

DivCon-NeRF: Generating Augmented Rays with Diversity and Consistency for Few-shot View Synthesis

Add code
Mar 17, 2025
Viaarxiv icon

ROODI: Reconstructing Occluded Objects with Denoising Inpainters

Add code
Mar 13, 2025
Viaarxiv icon

Bi-ICE: An Inner Interpretable Framework for Image Classification via Bi-directional Interactions between Concept and Input Embeddings

Add code
Nov 26, 2024
Figure 1 for Bi-ICE: An Inner Interpretable Framework for Image Classification via Bi-directional Interactions between Concept and Input Embeddings
Viaarxiv icon

Decompose the model: Mechanistic interpretability in image models with Generalized Integrated Gradients (GIG)

Add code
Sep 03, 2024
Figure 1 for Decompose the model: Mechanistic interpretability in image models with Generalized Integrated Gradients (GIG)
Figure 2 for Decompose the model: Mechanistic interpretability in image models with Generalized Integrated Gradients (GIG)
Figure 3 for Decompose the model: Mechanistic interpretability in image models with Generalized Integrated Gradients (GIG)
Figure 4 for Decompose the model: Mechanistic interpretability in image models with Generalized Integrated Gradients (GIG)
Viaarxiv icon

MERLIN: Multimodal Embedding Refinement via LLM-based Iterative Navigation for Text-Video Retrieval-Rerank Pipeline

Add code
Jul 17, 2024
Figure 1 for MERLIN: Multimodal Embedding Refinement via LLM-based Iterative Navigation for Text-Video Retrieval-Rerank Pipeline
Figure 2 for MERLIN: Multimodal Embedding Refinement via LLM-based Iterative Navigation for Text-Video Retrieval-Rerank Pipeline
Figure 3 for MERLIN: Multimodal Embedding Refinement via LLM-based Iterative Navigation for Text-Video Retrieval-Rerank Pipeline
Figure 4 for MERLIN: Multimodal Embedding Refinement via LLM-based Iterative Navigation for Text-Video Retrieval-Rerank Pipeline
Viaarxiv icon

Practical Dataset Distillation Based on Deep Support Vectors

Add code
May 01, 2024
Viaarxiv icon