Picture for Kimin Lee

Kimin Lee

By My Eyes: Grounding Multimodal Large Language Models with Sensor Data via Visual Prompting

Add code
Jul 15, 2024
Viaarxiv icon

Beyond Thumbs Up/Down: Untangling Challenges of Fine-Grained Feedback for Text-to-Image Generation

Add code
Jun 24, 2024
Viaarxiv icon

Aligning Large Language Models with Self-generated Preference Data

Add code
Jun 06, 2024
Viaarxiv icon

Benchmarking Mobile Device Control Agents across Diverse Configurations

Add code
Apr 25, 2024
Figure 1 for Benchmarking Mobile Device Control Agents across Diverse Configurations
Figure 2 for Benchmarking Mobile Device Control Agents across Diverse Configurations
Figure 3 for Benchmarking Mobile Device Control Agents across Diverse Configurations
Figure 4 for Benchmarking Mobile Device Control Agents across Diverse Configurations
Viaarxiv icon

Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models

Add code
Apr 05, 2024
Viaarxiv icon

Confidence-aware Reward Optimization for Fine-tuning Text-to-Image Models

Add code
Apr 02, 2024
Viaarxiv icon

Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences

Add code
Dec 14, 2023
Figure 1 for Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences
Figure 2 for Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences
Figure 3 for Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences
Figure 4 for Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences
Viaarxiv icon

InstructBooth: Instruction-following Personalized Text-to-Image Generation

Add code
Dec 04, 2023
Figure 1 for InstructBooth: Instruction-following Personalized Text-to-Image Generation
Figure 2 for InstructBooth: Instruction-following Personalized Text-to-Image Generation
Figure 3 for InstructBooth: Instruction-following Personalized Text-to-Image Generation
Figure 4 for InstructBooth: Instruction-following Personalized Text-to-Image Generation
Viaarxiv icon

Guide Your Agent with Adaptive Multimodal Rewards

Add code
Sep 19, 2023
Figure 1 for Guide Your Agent with Adaptive Multimodal Rewards
Figure 2 for Guide Your Agent with Adaptive Multimodal Rewards
Figure 3 for Guide Your Agent with Adaptive Multimodal Rewards
Figure 4 for Guide Your Agent with Adaptive Multimodal Rewards
Viaarxiv icon

StyleDrop: Text-to-Image Generation in Any Style

Add code
Jun 01, 2023
Figure 1 for StyleDrop: Text-to-Image Generation in Any Style
Figure 2 for StyleDrop: Text-to-Image Generation in Any Style
Figure 3 for StyleDrop: Text-to-Image Generation in Any Style
Figure 4 for StyleDrop: Text-to-Image Generation in Any Style
Viaarxiv icon