Picture for Bowen Zhang

Bowen Zhang

Hunyuan3D 2.1: From Images to High-Fidelity 3D Assets with Production-Ready PBR Material

Add code
Jun 18, 2025
Viaarxiv icon

Abstract, Align, Predict: Zero-Shot Stance Detection via Cognitive Inductive Reasoning

Add code
Jun 16, 2025
Viaarxiv icon

Automated evaluation of children's speech fluency for low-resource languages

Add code
May 26, 2025
Viaarxiv icon

MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder

Add code
May 12, 2025
Viaarxiv icon

CSE-SFP: Enabling Unsupervised Sentence Representation Learning via a Single Forward Pass

Add code
May 01, 2025
Viaarxiv icon

C-MTCSD: A Chinese Multi-Turn Conversational Stance Detection Dataset

Add code
Apr 14, 2025
Viaarxiv icon

PBR3DGen: A VLM-guided Mesh Generation with High-quality PBR Texture

Add code
Mar 14, 2025
Viaarxiv icon

DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation

Add code
Mar 13, 2025
Viaarxiv icon

OR-LLM-Agent: Automating Modeling and Solving of Operations Research Optimization Problem with Reasoning Large Language Model

Add code
Mar 13, 2025
Viaarxiv icon

CLIP-UP: A Simple and Efficient Mixture-of-Experts CLIP Training Recipe with Sparse Upcycling

Add code
Feb 03, 2025
Figure 1 for CLIP-UP: A Simple and Efficient Mixture-of-Experts CLIP Training Recipe with Sparse Upcycling
Figure 2 for CLIP-UP: A Simple and Efficient Mixture-of-Experts CLIP Training Recipe with Sparse Upcycling
Figure 3 for CLIP-UP: A Simple and Efficient Mixture-of-Experts CLIP Training Recipe with Sparse Upcycling
Figure 4 for CLIP-UP: A Simple and Efficient Mixture-of-Experts CLIP Training Recipe with Sparse Upcycling
Viaarxiv icon