Picture for Kun Zhou

Kun Zhou

Hierarchical Control of Emotion Rendering in Speech Synthesis

Add code
Dec 17, 2024
Figure 1 for Hierarchical Control of Emotion Rendering in Speech Synthesis
Figure 2 for Hierarchical Control of Emotion Rendering in Speech Synthesis
Figure 3 for Hierarchical Control of Emotion Rendering in Speech Synthesis
Figure 4 for Hierarchical Control of Emotion Rendering in Speech Synthesis
Viaarxiv icon

RETQA: A Large-Scale Open-Domain Tabular Question Answering Dataset for Real Estate Sector

Add code
Dec 13, 2024
Figure 1 for RETQA: A Large-Scale Open-Domain Tabular Question Answering Dataset for Real Estate Sector
Figure 2 for RETQA: A Large-Scale Open-Domain Tabular Question Answering Dataset for Real Estate Sector
Figure 3 for RETQA: A Large-Scale Open-Domain Tabular Question Answering Dataset for Real Estate Sector
Figure 4 for RETQA: A Large-Scale Open-Domain Tabular Question Answering Dataset for Real Estate Sector
Viaarxiv icon

MaterialPicker: Multi-Modal Material Generation with Diffusion Transformers

Add code
Dec 04, 2024
Figure 1 for MaterialPicker: Multi-Modal Material Generation with Diffusion Transformers
Figure 2 for MaterialPicker: Multi-Modal Material Generation with Diffusion Transformers
Figure 3 for MaterialPicker: Multi-Modal Material Generation with Diffusion Transformers
Figure 4 for MaterialPicker: Multi-Modal Material Generation with Diffusion Transformers
Viaarxiv icon

Enhancing Visual Reasoning with Autonomous Imagination in Multimodal Large Language Models

Add code
Nov 27, 2024
Figure 1 for Enhancing Visual Reasoning with Autonomous Imagination in Multimodal Large Language Models
Figure 2 for Enhancing Visual Reasoning with Autonomous Imagination in Multimodal Large Language Models
Figure 3 for Enhancing Visual Reasoning with Autonomous Imagination in Multimodal Large Language Models
Figure 4 for Enhancing Visual Reasoning with Autonomous Imagination in Multimodal Large Language Models
Viaarxiv icon

ARM: Appearance Reconstruction Model for Relightable 3D Generation

Add code
Nov 16, 2024
Figure 1 for ARM: Appearance Reconstruction Model for Relightable 3D Generation
Figure 2 for ARM: Appearance Reconstruction Model for Relightable 3D Generation
Figure 3 for ARM: Appearance Reconstruction Model for Relightable 3D Generation
Figure 4 for ARM: Appearance Reconstruction Model for Relightable 3D Generation
Viaarxiv icon

Self-Calibrated Listwise Reranking with Large Language Models

Add code
Nov 07, 2024
Figure 1 for Self-Calibrated Listwise Reranking with Large Language Models
Figure 2 for Self-Calibrated Listwise Reranking with Large Language Models
Figure 3 for Self-Calibrated Listwise Reranking with Large Language Models
Figure 4 for Self-Calibrated Listwise Reranking with Large Language Models
Viaarxiv icon

Exploring the Design Space of Visual Context Representation in Video MLLMs

Add code
Oct 17, 2024
Figure 1 for Exploring the Design Space of Visual Context Representation in Video MLLMs
Figure 2 for Exploring the Design Space of Visual Context Representation in Video MLLMs
Figure 3 for Exploring the Design Space of Visual Context Representation in Video MLLMs
Figure 4 for Exploring the Design Space of Visual Context Representation in Video MLLMs
Viaarxiv icon

GS^3: Efficient Relighting with Triple Gaussian Splatting

Add code
Oct 15, 2024
Viaarxiv icon

Extracting and Transferring Abilities For Building Multi-lingual Ability-enhanced Large Language Models

Add code
Oct 10, 2024
Figure 1 for Extracting and Transferring Abilities For Building Multi-lingual Ability-enhanced Large Language Models
Figure 2 for Extracting and Transferring Abilities For Building Multi-lingual Ability-enhanced Large Language Models
Figure 3 for Extracting and Transferring Abilities For Building Multi-lingual Ability-enhanced Large Language Models
Figure 4 for Extracting and Transferring Abilities For Building Multi-lingual Ability-enhanced Large Language Models
Viaarxiv icon

Enabling Synergistic Full-Body Control in Prompt-Based Co-Speech Motion Generation

Add code
Oct 01, 2024
Figure 1 for Enabling Synergistic Full-Body Control in Prompt-Based Co-Speech Motion Generation
Figure 2 for Enabling Synergistic Full-Body Control in Prompt-Based Co-Speech Motion Generation
Figure 3 for Enabling Synergistic Full-Body Control in Prompt-Based Co-Speech Motion Generation
Figure 4 for Enabling Synergistic Full-Body Control in Prompt-Based Co-Speech Motion Generation
Viaarxiv icon