Picture for Wenhao Zhang

Wenhao Zhang

Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models

Add code
May 23, 2025
Viaarxiv icon

CoT-Vid: Dynamic Chain-of-Thought Routing with Self Verification for Training-Free Video Reasoning

Add code
May 17, 2025
Viaarxiv icon

3D Human Pose Estimation via Spatial Graph Order Attention and Temporal Body Aware Transformer

Add code
May 02, 2025
Viaarxiv icon

Revisiting CAD Model Generation by Learning Raster Sketch

Add code
Mar 02, 2025
Viaarxiv icon

Leveraging Dual Process Theory in Language Agent Framework for Real-time Simultaneous Human-AI Collaboration

Add code
Feb 17, 2025
Viaarxiv icon

Recommender systems and reinforcement learning for building control and occupant interaction: A text-mining driven review of scientific literature

Add code
Nov 13, 2024
Viaarxiv icon

Mutual Theory of Mind in Human-AI Collaboration: An Empirical Study with LLM-driven AI Agents in a Real-time Shared Workspace Task

Add code
Sep 13, 2024
Viaarxiv icon

The Synergy between Data and Multi-Modal Large Language Models: A Survey from Co-Development Perspective

Add code
Jul 11, 2024
Viaarxiv icon

SCSA: Exploring the Synergistic Effects Between Spatial and Channel Attention

Add code
Jul 06, 2024
Viaarxiv icon

Partitioned Hankel-based Diffusion Models for Few-shot Low-dose CT Reconstruction

Add code
May 27, 2024
Figure 1 for Partitioned Hankel-based Diffusion Models for Few-shot Low-dose CT Reconstruction
Figure 2 for Partitioned Hankel-based Diffusion Models for Few-shot Low-dose CT Reconstruction
Figure 3 for Partitioned Hankel-based Diffusion Models for Few-shot Low-dose CT Reconstruction
Figure 4 for Partitioned Hankel-based Diffusion Models for Few-shot Low-dose CT Reconstruction
Viaarxiv icon