Picture for Song Park

Song Park

Seeing What You Say: Expressive Image Generation from Speech

Add code
Nov 05, 2025
Viaarxiv icon

DNNs May Determine Major Properties of Their Outputs Early, with Timing Possibly Driven by Bias

Add code
Feb 12, 2025
Viaarxiv icon

MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image Segmentation

Add code
Nov 28, 2024
Viaarxiv icon

Probabilistic Language-Image Pre-Training

Add code
Oct 24, 2024
Figure 1 for Probabilistic Language-Image Pre-Training
Figure 2 for Probabilistic Language-Image Pre-Training
Figure 3 for Probabilistic Language-Image Pre-Training
Figure 4 for Probabilistic Language-Image Pre-Training
Viaarxiv icon

Rotary Position Embedding for Vision Transformer

Add code
Mar 20, 2024
Viaarxiv icon

Forging Tokens for Improved Storage-efficient Training

Add code
Dec 15, 2023
Viaarxiv icon

SeiT: Storage-Efficient Vision Training with Tokens Using 1% of Pixel Storage

Add code
Mar 20, 2023
Figure 1 for SeiT: Storage-Efficient Vision Training with Tokens Using 1% of Pixel Storage
Figure 2 for SeiT: Storage-Efficient Vision Training with Tokens Using 1% of Pixel Storage
Figure 3 for SeiT: Storage-Efficient Vision Training with Tokens Using 1% of Pixel Storage
Figure 4 for SeiT: Storage-Efficient Vision Training with Tokens Using 1% of Pixel Storage
Viaarxiv icon

Similarity of Neural Architectures Based on Input Gradient Transferability

Add code
Oct 20, 2022
Figure 1 for Similarity of Neural Architectures Based on Input Gradient Transferability
Figure 2 for Similarity of Neural Architectures Based on Input Gradient Transferability
Figure 3 for Similarity of Neural Architectures Based on Input Gradient Transferability
Figure 4 for Similarity of Neural Architectures Based on Input Gradient Transferability
Viaarxiv icon

ECCV Caption: Correcting False Negatives by Collecting Machine-and-Human-verified Image-Caption Associations for MS-COCO

Add code
Apr 14, 2022
Figure 1 for ECCV Caption: Correcting False Negatives by Collecting Machine-and-Human-verified Image-Caption Associations for MS-COCO
Figure 2 for ECCV Caption: Correcting False Negatives by Collecting Machine-and-Human-verified Image-Caption Associations for MS-COCO
Figure 3 for ECCV Caption: Correcting False Negatives by Collecting Machine-and-Human-verified Image-Caption Associations for MS-COCO
Figure 4 for ECCV Caption: Correcting False Negatives by Collecting Machine-and-Human-verified Image-Caption Associations for MS-COCO
Viaarxiv icon

Few-shot Font Generation with Weakly Supervised Localized Representations

Add code
Dec 22, 2021
Figure 1 for Few-shot Font Generation with Weakly Supervised Localized Representations
Figure 2 for Few-shot Font Generation with Weakly Supervised Localized Representations
Figure 3 for Few-shot Font Generation with Weakly Supervised Localized Representations
Figure 4 for Few-shot Font Generation with Weakly Supervised Localized Representations
Viaarxiv icon