Picture for Ruihua Song

Ruihua Song

YuLan: An Open-source Large Language Model

Add code
Jun 28, 2024
Viaarxiv icon

Multi-task Manipulation Policy Modeling with Visuomotor Latent Diffusion

Add code
Mar 12, 2024
Figure 1 for Multi-task Manipulation Policy Modeling with Visuomotor Latent Diffusion
Figure 2 for Multi-task Manipulation Policy Modeling with Visuomotor Latent Diffusion
Figure 3 for Multi-task Manipulation Policy Modeling with Visuomotor Latent Diffusion
Figure 4 for Multi-task Manipulation Policy Modeling with Visuomotor Latent Diffusion
Viaarxiv icon

SpeechComposer: Unifying Multiple Speech Tasks with Prompt Composition

Add code
Jan 31, 2024
Viaarxiv icon

What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Instruction Tuning

Add code
Nov 02, 2023
Figure 1 for What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Instruction Tuning
Figure 2 for What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Instruction Tuning
Figure 3 for What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Instruction Tuning
Figure 4 for What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Instruction Tuning
Viaarxiv icon

Parrot: Enhancing Multi-Turn Chat Models by Learning to Ask Questions

Add code
Oct 11, 2023
Figure 1 for Parrot: Enhancing Multi-Turn Chat Models by Learning to Ask Questions
Figure 2 for Parrot: Enhancing Multi-Turn Chat Models by Learning to Ask Questions
Figure 3 for Parrot: Enhancing Multi-Turn Chat Models by Learning to Ask Questions
Figure 4 for Parrot: Enhancing Multi-Turn Chat Models by Learning to Ask Questions
Viaarxiv icon

ViCo: Engaging Video Comment Generation with Human Preference Rewards

Add code
Aug 22, 2023
Figure 1 for ViCo: Engaging Video Comment Generation with Human Preference Rewards
Figure 2 for ViCo: Engaging Video Comment Generation with Human Preference Rewards
Figure 3 for ViCo: Engaging Video Comment Generation with Human Preference Rewards
Figure 4 for ViCo: Engaging Video Comment Generation with Human Preference Rewards
Viaarxiv icon

Pave the Way to Grasp Anything: Transferring Foundation Models for Universal Pick-Place Robots

Add code
Jun 25, 2023
Figure 1 for Pave the Way to Grasp Anything: Transferring Foundation Models for Universal Pick-Place Robots
Figure 2 for Pave the Way to Grasp Anything: Transferring Foundation Models for Universal Pick-Place Robots
Figure 3 for Pave the Way to Grasp Anything: Transferring Foundation Models for Universal Pick-Place Robots
Figure 4 for Pave the Way to Grasp Anything: Transferring Foundation Models for Universal Pick-Place Robots
Viaarxiv icon

RecAgent: A Novel Simulation Paradigm for Recommender Systems

Add code
Jun 05, 2023
Figure 1 for RecAgent: A Novel Simulation Paradigm for Recommender Systems
Figure 2 for RecAgent: A Novel Simulation Paradigm for Recommender Systems
Viaarxiv icon

AlphaBlock: Embodied Finetuning for Vision-Language Reasoning in Robot Manipulation

Add code
May 30, 2023
Figure 1 for AlphaBlock: Embodied Finetuning for Vision-Language Reasoning in Robot Manipulation
Figure 2 for AlphaBlock: Embodied Finetuning for Vision-Language Reasoning in Robot Manipulation
Figure 3 for AlphaBlock: Embodied Finetuning for Vision-Language Reasoning in Robot Manipulation
Figure 4 for AlphaBlock: Embodied Finetuning for Vision-Language Reasoning in Robot Manipulation
Viaarxiv icon

ComedicSpeech: Text To Speech For Stand-up Comedies in Low-Resource Scenarios

Add code
May 20, 2023
Figure 1 for ComedicSpeech: Text To Speech For Stand-up Comedies in Low-Resource Scenarios
Figure 2 for ComedicSpeech: Text To Speech For Stand-up Comedies in Low-Resource Scenarios
Figure 3 for ComedicSpeech: Text To Speech For Stand-up Comedies in Low-Resource Scenarios
Figure 4 for ComedicSpeech: Text To Speech For Stand-up Comedies in Low-Resource Scenarios
Viaarxiv icon