Picture for Bowen Zhang

Bowen Zhang

Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis

Add code
Jul 31, 2025
Viaarxiv icon

RareSpot: Spotting Small and Rare Wildlife in Aerial Imagery with Multi-Scale Consistency and Context-Aware Augmentation

Add code
Jun 23, 2025
Viaarxiv icon

Hunyuan3D 2.1: From Images to High-Fidelity 3D Assets with Production-Ready PBR Material

Add code
Jun 18, 2025
Viaarxiv icon

Abstract, Align, Predict: Zero-Shot Stance Detection via Cognitive Inductive Reasoning

Add code
Jun 16, 2025
Viaarxiv icon

Automated evaluation of children's speech fluency for low-resource languages

Add code
May 26, 2025
Viaarxiv icon

MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder

Add code
May 12, 2025
Viaarxiv icon

CSE-SFP: Enabling Unsupervised Sentence Representation Learning via a Single Forward Pass

Add code
May 01, 2025
Viaarxiv icon

C-MTCSD: A Chinese Multi-Turn Conversational Stance Detection Dataset

Add code
Apr 14, 2025
Viaarxiv icon

PBR3DGen: A VLM-guided Mesh Generation with High-quality PBR Texture

Add code
Mar 14, 2025
Viaarxiv icon

OR-LLM-Agent: Automating Modeling and Solving of Operations Research Optimization Problem with Reasoning Large Language Model

Add code
Mar 13, 2025
Viaarxiv icon