Picture for Kai Yu

Kai Yu

Sherman

AlignSum: Data Pyramid Hierarchical Fine-tuning for Aligning with Human Summarization Preference

Add code
Oct 01, 2024
Viaarxiv icon

TRANSAGENT: An LLM-Based Multi-Agent System for Code Translation

Add code
Sep 30, 2024
Viaarxiv icon

SciDFM: A Large Language Model with Mixture-of-Experts for Science

Add code
Sep 27, 2024
Figure 1 for SciDFM: A Large Language Model with Mixture-of-Experts for Science
Figure 2 for SciDFM: A Large Language Model with Mixture-of-Experts for Science
Figure 3 for SciDFM: A Large Language Model with Mixture-of-Experts for Science
Figure 4 for SciDFM: A Large Language Model with Mixture-of-Experts for Science
Viaarxiv icon

vec2wav 2.0: Advancing Voice Conversion via Discrete Token Vocoders

Add code
Sep 03, 2024
Figure 1 for vec2wav 2.0: Advancing Voice Conversion via Discrete Token Vocoders
Figure 2 for vec2wav 2.0: Advancing Voice Conversion via Discrete Token Vocoders
Figure 3 for vec2wav 2.0: Advancing Voice Conversion via Discrete Token Vocoders
Figure 4 for vec2wav 2.0: Advancing Voice Conversion via Discrete Token Vocoders
Viaarxiv icon

UrFound: Towards Universal Retinal Foundation Models via Knowledge-Guided Masked Modeling

Add code
Aug 10, 2024
Viaarxiv icon

DiveSound: LLM-Assisted Automatic Taxonomy Construction for Diverse Audio Generation

Add code
Jul 18, 2024
Viaarxiv icon

Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

Add code
Jul 15, 2024
Figure 1 for Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Figure 2 for Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Figure 3 for Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Figure 4 for Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Viaarxiv icon

Semi-supervised Learning for Code-Switching ASR with Large Language Model Filter

Add code
Jul 05, 2024
Viaarxiv icon

On the Effectiveness of Acoustic BPE in Decoder-Only TTS

Add code
Jul 04, 2024
Viaarxiv icon

IBSEN: Director-Actor Agent Collaboration for Controllable and Interactive Drama Script Generation

Add code
Jul 01, 2024
Figure 1 for IBSEN: Director-Actor Agent Collaboration for Controllable and Interactive Drama Script Generation
Figure 2 for IBSEN: Director-Actor Agent Collaboration for Controllable and Interactive Drama Script Generation
Figure 3 for IBSEN: Director-Actor Agent Collaboration for Controllable and Interactive Drama Script Generation
Figure 4 for IBSEN: Director-Actor Agent Collaboration for Controllable and Interactive Drama Script Generation
Viaarxiv icon