Picture for Qian Yang

Qian Yang

MSceneSpeech: A Multi-Scene Speech Dataset For Expressive Speech Synthesis

Add code
Jul 19, 2024
Viaarxiv icon

Qwen2-Audio Technical Report

Add code
Jul 15, 2024
Viaarxiv icon

Decompose and Compare Consistency: Measuring VLMs' Answer Reliability via Task-Decomposition Consistency Comparison

Add code
Jul 10, 2024
Viaarxiv icon

CodeHalu: Code Hallucinations in LLMs Driven by Execution-based Verification

Add code
Apr 30, 2024
Figure 1 for CodeHalu: Code Hallucinations in LLMs Driven by Execution-based Verification
Figure 2 for CodeHalu: Code Hallucinations in LLMs Driven by Execution-based Verification
Figure 3 for CodeHalu: Code Hallucinations in LLMs Driven by Execution-based Verification
Figure 4 for CodeHalu: Code Hallucinations in LLMs Driven by Execution-based Verification
Viaarxiv icon

A Piece of Theatre: Investigating How Teachers Design LLM Chatbots to Assist Adolescent Cyberbullying Education

Add code
Feb 27, 2024
Figure 1 for A Piece of Theatre: Investigating How Teachers Design LLM Chatbots to Assist Adolescent Cyberbullying Education
Figure 2 for A Piece of Theatre: Investigating How Teachers Design LLM Chatbots to Assist Adolescent Cyberbullying Education
Viaarxiv icon

AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension

Add code
Feb 12, 2024
Viaarxiv icon

Leveraging Generative AI for Clinical Evidence Summarization Needs to Achieve Trustworthiness

Add code
Nov 19, 2023
Figure 1 for Leveraging Generative AI for Clinical Evidence Summarization Needs to Achieve Trustworthiness
Viaarxiv icon

Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models

Add code
Nov 14, 2023
Figure 1 for Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models
Figure 2 for Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models
Figure 3 for Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models
Figure 4 for Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models
Viaarxiv icon

The Participatory Turn in AI Design: Theoretical Foundations and the Current State of Practice

Add code
Oct 02, 2023
Viaarxiv icon

Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias

Add code
Jun 06, 2023
Figure 1 for Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias
Figure 2 for Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias
Figure 3 for Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias
Figure 4 for Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias
Viaarxiv icon