Picture for Wen Wu

Wen Wu

Sherman

Joint Shape-Position Optimization Enhanced 2D DOA Estimation in Movable Antenna Systems

Add code
Apr 05, 2026
Viaarxiv icon

Can Heterogeneous Language Models Be Fused?

Add code
Apr 02, 2026
Viaarxiv icon

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Add code
Mar 26, 2026
Viaarxiv icon

STEP: Scientific Time-Series Encoder Pretraining via Cross-Domain Distillation

Add code
Mar 19, 2026
Viaarxiv icon

CAST-TTS: A Simple Cross-Attention Framework for Unified Timbre Control in TTS

Add code
Mar 17, 2026
Viaarxiv icon

One Brain, Omni Modalities: Towards Unified Non-Invasive Brain Decoding with Large Language Models

Add code
Feb 25, 2026
Viaarxiv icon

HIPPO: Accelerating Video Large Language Models Inference via Holistic-aware Parallel Speculative Decoding

Add code
Jan 13, 2026
Viaarxiv icon

MENTOR: A Metacognition-Driven Self-Evolution Framework for Uncovering and Mitigating Implicit Risks in LLMs on Domain Tasks

Add code
Nov 10, 2025
Viaarxiv icon

SPEAR: A Unified SSL Framework for Learning Speech and Audio Representations

Add code
Oct 29, 2025
Figure 1 for SPEAR: A Unified SSL Framework for Learning Speech and Audio Representations
Figure 2 for SPEAR: A Unified SSL Framework for Learning Speech and Audio Representations
Figure 3 for SPEAR: A Unified SSL Framework for Learning Speech and Audio Representations
Figure 4 for SPEAR: A Unified SSL Framework for Learning Speech and Audio Representations
Viaarxiv icon

Towards Cross-Task Suicide Risk Detection via Speech LLM

Add code
Sep 26, 2025
Viaarxiv icon