Picture for Dongyeop Kang

Dongyeop Kang

UC Berkeley

Toward Evaluative Thinking: Meta Policy Optimization with Evolving Reward Models

Add code
Apr 28, 2025
Viaarxiv icon

LawFlow : Collecting and Simulating Lawyers' Thought Processes

Add code
Apr 26, 2025
Viaarxiv icon

Stealing Creator's Workflow: A Creator-Inspired Agentic Framework with Iterative Feedback Loop for Improved Scientific Short-form Generation

Add code
Apr 26, 2025
Viaarxiv icon

Learning Explainable Dense Reward Shapes via Bayesian Optimization

Add code
Apr 22, 2025
Viaarxiv icon

Align to Structure: Aligning Large Language Models with Structural Information

Add code
Apr 04, 2025
Viaarxiv icon

A Framework for Robust Cognitive Evaluation of LLMs

Add code
Apr 03, 2025
Viaarxiv icon

Learning a High-quality Robotic Wiping Policy Using Systematic Reward Analysis and Visual-Language Model Based Curriculum

Add code
Feb 18, 2025
Viaarxiv icon

RoSTE: An Efficient Quantization-Aware Supervised Fine-Tuning Approach for Large Language Models

Add code
Feb 13, 2025
Viaarxiv icon

ScholaWrite: A Dataset of End-to-End Scholarly Writing Process

Add code
Feb 05, 2025
Figure 1 for ScholaWrite: A Dataset of End-to-End Scholarly Writing Process
Figure 2 for ScholaWrite: A Dataset of End-to-End Scholarly Writing Process
Figure 3 for ScholaWrite: A Dataset of End-to-End Scholarly Writing Process
Figure 4 for ScholaWrite: A Dataset of End-to-End Scholarly Writing Process
Viaarxiv icon

Anchors Aweigh! Sail for Optimal Unified Multi-Modal Representations

Add code
Oct 02, 2024
Figure 1 for Anchors Aweigh! Sail for Optimal Unified Multi-Modal Representations
Figure 2 for Anchors Aweigh! Sail for Optimal Unified Multi-Modal Representations
Figure 3 for Anchors Aweigh! Sail for Optimal Unified Multi-Modal Representations
Figure 4 for Anchors Aweigh! Sail for Optimal Unified Multi-Modal Representations
Viaarxiv icon