Picture for Yuxuan Wang

Yuxuan Wang

Sherman

MedGEN-Bench: Contextually entangled benchmark for open-ended multimodal medical generation

Add code
Nov 18, 2025
Viaarxiv icon

Virtual Width Networks

Add code
Nov 17, 2025
Figure 1 for Virtual Width Networks
Figure 2 for Virtual Width Networks
Figure 3 for Virtual Width Networks
Figure 4 for Virtual Width Networks
Viaarxiv icon

A digital SRAM-based compute-in-memory macro for weight-stationary dynamic matrix multiplication in Transformer attention score computation

Add code
Nov 15, 2025
Figure 1 for A digital SRAM-based compute-in-memory macro for weight-stationary dynamic matrix multiplication in Transformer attention score computation
Figure 2 for A digital SRAM-based compute-in-memory macro for weight-stationary dynamic matrix multiplication in Transformer attention score computation
Figure 3 for A digital SRAM-based compute-in-memory macro for weight-stationary dynamic matrix multiplication in Transformer attention score computation
Figure 4 for A digital SRAM-based compute-in-memory macro for weight-stationary dynamic matrix multiplication in Transformer attention score computation
Viaarxiv icon

ParaS2S: Benchmarking and Aligning Spoken Language Models for Paralinguistic-aware Speech-to-Speech Interaction

Add code
Nov 11, 2025
Viaarxiv icon

NeuSpring: Neural Spring Fields for Reconstruction and Simulation of Deformable Objects from Videos

Add code
Nov 11, 2025
Viaarxiv icon

LooGLE v2: Are LLMs Ready for Real World Long Dependency Challenges?

Add code
Oct 26, 2025
Viaarxiv icon

Omni-Captioner: Data Pipeline, Models, and Benchmark for Omni Detailed Perception

Add code
Oct 14, 2025
Viaarxiv icon

AudioMarathon: A Comprehensive Benchmark for Long-Context Audio Understanding and Efficiency in Audio LLMs

Add code
Oct 08, 2025
Figure 1 for AudioMarathon: A Comprehensive Benchmark for Long-Context Audio Understanding and Efficiency in Audio LLMs
Figure 2 for AudioMarathon: A Comprehensive Benchmark for Long-Context Audio Understanding and Efficiency in Audio LLMs
Figure 3 for AudioMarathon: A Comprehensive Benchmark for Long-Context Audio Understanding and Efficiency in Audio LLMs
Figure 4 for AudioMarathon: A Comprehensive Benchmark for Long-Context Audio Understanding and Efficiency in Audio LLMs
Viaarxiv icon

UltraHiT: A Hierarchical Transformer Architecture for Generalizable Internal Carotid Artery Robotic Ultrasonography

Add code
Sep 17, 2025
Viaarxiv icon

Hunyuan3D Studio: End-to-End AI Pipeline for Game-Ready 3D Asset Generation

Add code
Sep 16, 2025
Figure 1 for Hunyuan3D Studio: End-to-End AI Pipeline for Game-Ready 3D Asset Generation
Figure 2 for Hunyuan3D Studio: End-to-End AI Pipeline for Game-Ready 3D Asset Generation
Figure 3 for Hunyuan3D Studio: End-to-End AI Pipeline for Game-Ready 3D Asset Generation
Figure 4 for Hunyuan3D Studio: End-to-End AI Pipeline for Game-Ready 3D Asset Generation
Viaarxiv icon