Picture for Dongyeop Kang

Dongyeop Kang

UC Berkeley

Confidence Calibration and Rationalization for LLMs via Multi-Agent Deliberation

Add code
Apr 14, 2024
Figure 1 for Confidence Calibration and Rationalization for LLMs via Multi-Agent Deliberation
Figure 2 for Confidence Calibration and Rationalization for LLMs via Multi-Agent Deliberation
Figure 3 for Confidence Calibration and Rationalization for LLMs via Multi-Agent Deliberation
Figure 4 for Confidence Calibration and Rationalization for LLMs via Multi-Agent Deliberation
Viaarxiv icon

Reinforcement Learning with Dynamic Multi-Reward Weighting for Multi-Style Controllable Generation

Add code
Feb 21, 2024
Viaarxiv icon

Talk Through It: End User Directed Manipulation Learning

Add code
Feb 19, 2024
Figure 1 for Talk Through It: End User Directed Manipulation Learning
Figure 2 for Talk Through It: End User Directed Manipulation Learning
Figure 3 for Talk Through It: End User Directed Manipulation Learning
Figure 4 for Talk Through It: End User Directed Manipulation Learning
Viaarxiv icon

Shallow Synthesis of Knowledge in GPT-Generated Texts: A Case Study in Automatic Related Work Composition

Add code
Feb 19, 2024
Viaarxiv icon

Chain-of-Instructions: Compositional Instruction Tuning on Large Language Models

Add code
Feb 18, 2024
Figure 1 for Chain-of-Instructions: Compositional Instruction Tuning on Large Language Models
Figure 2 for Chain-of-Instructions: Compositional Instruction Tuning on Large Language Models
Figure 3 for Chain-of-Instructions: Compositional Instruction Tuning on Large Language Models
Figure 4 for Chain-of-Instructions: Compositional Instruction Tuning on Large Language Models
Viaarxiv icon

Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback

Add code
Feb 16, 2024
Figure 1 for Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback
Figure 2 for Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback
Figure 3 for Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback
Figure 4 for Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback
Viaarxiv icon

II-MMR: Identifying and Improving Multi-modal Multi-hop Reasoning in Visual Question Answering

Add code
Feb 16, 2024
Figure 1 for II-MMR: Identifying and Improving Multi-modal Multi-hop Reasoning in Visual Question Answering
Figure 2 for II-MMR: Identifying and Improving Multi-modal Multi-hop Reasoning in Visual Question Answering
Figure 3 for II-MMR: Identifying and Improving Multi-modal Multi-hop Reasoning in Visual Question Answering
Figure 4 for II-MMR: Identifying and Improving Multi-modal Multi-hop Reasoning in Visual Question Answering
Viaarxiv icon

Threads of Subtlety: Detecting Machine-Generated Texts Through Discourse Motifs

Add code
Feb 16, 2024
Viaarxiv icon

Under the Surface: Tracking the Artifactuality of LLM-Generated Data

Add code
Jan 30, 2024
Figure 1 for Under the Surface: Tracking the Artifactuality of LLM-Generated Data
Figure 2 for Under the Surface: Tracking the Artifactuality of LLM-Generated Data
Figure 3 for Under the Surface: Tracking the Artifactuality of LLM-Generated Data
Figure 4 for Under the Surface: Tracking the Artifactuality of LLM-Generated Data
Viaarxiv icon

SelectLLM: Can LLMs Select Important Instructions to Annotate?

Add code
Jan 29, 2024
Figure 1 for SelectLLM: Can LLMs Select Important Instructions to Annotate?
Figure 2 for SelectLLM: Can LLMs Select Important Instructions to Annotate?
Figure 3 for SelectLLM: Can LLMs Select Important Instructions to Annotate?
Figure 4 for SelectLLM: Can LLMs Select Important Instructions to Annotate?
Viaarxiv icon