Picture for Bo Ren

Bo Ren

Lightweight Prompt Biasing for Contextualized End-to-End ASR Systems

Add code
Jun 06, 2025
Viaarxiv icon

Towards Efficient Speech-Text Jointly Decoding within One Speech Language Model

Add code
Jun 04, 2025
Viaarxiv icon

TransparentGS: Fast Inverse Rendering of Transparent Objects with Gaussians

Add code
May 01, 2025
Viaarxiv icon

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Add code
Mar 03, 2025
Viaarxiv icon

AlignFormer: Modality Matching Can Achieve Better Zero-shot Instruction-Following Speech-LLM

Add code
Dec 02, 2024
Viaarxiv icon

Masked Angle-Aware Autoencoder for Remote Sensing Images

Add code
Aug 04, 2024
Viaarxiv icon

Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View Synthesis

Add code
Jun 10, 2024
Viaarxiv icon

On decoder-only architecture for speech-to-text and large language model integration

Add code
Jul 14, 2023
Viaarxiv icon

Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution

Add code
May 12, 2023
Viaarxiv icon

Multi-Space Neural Radiance Fields

Add code
May 07, 2023
Viaarxiv icon