Picture for Wei Li

Wei Li

Victor

LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale

Add code
Apr 22, 2025
Viaarxiv icon

DSPO: Direct Semantic Preference Optimization for Real-World Image Super-Resolution

Add code
Apr 21, 2025
Viaarxiv icon

Efficient Spiking Point Mamba for Point Cloud Analysis

Add code
Apr 19, 2025
Viaarxiv icon

Spiking Neural Network for Intra-cortical Brain Signal Decoding

Add code
Apr 12, 2025
Viaarxiv icon

Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs

Add code
Apr 10, 2025
Viaarxiv icon

On the Suitability of Reinforcement Fine-Tuning to Visual Tasks

Add code
Apr 08, 2025
Viaarxiv icon

BRIDGES: Bridging Graph Modality and Large Language Models within EDA Tasks

Add code
Apr 07, 2025
Viaarxiv icon

JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization

Add code
Mar 30, 2025
Viaarxiv icon

Harmonizing Visual Representations for Unified Multimodal Understanding and Generation

Add code
Mar 27, 2025
Viaarxiv icon

OpenHuEval: Evaluating Large Language Model on Hungarian Specifics

Add code
Mar 27, 2025
Viaarxiv icon