Picture for Shaolei Zhang

Shaolei Zhang

LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis

Add code
May 05, 2025
Viaarxiv icon

Prompt Guiding Multi-Scale Adaptive Sparse Representation-driven Network for Low-Dose CT MAR

Add code
Apr 28, 2025
Viaarxiv icon

Prompt-Guided Dual-Path UNet with Mamba for Medical Image Segmentation

Add code
Mar 25, 2025
Viaarxiv icon

LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token

Add code
Jan 07, 2025
Viaarxiv icon

Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation

Add code
Jan 01, 2025
Figure 1 for Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation
Figure 2 for Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation
Figure 3 for Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation
Figure 4 for Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation
Viaarxiv icon

Auto-RAG: Autonomous Retrieval-Augmented Generation for Large Language Models

Add code
Nov 29, 2024
Viaarxiv icon

BayLing 2: A Multilingual Large Language Model with Efficient Language Alignment

Add code
Nov 25, 2024
Viaarxiv icon

LLaMA-Omni: Seamless Speech Interaction with Large Language Models

Add code
Sep 10, 2024
Figure 1 for LLaMA-Omni: Seamless Speech Interaction with Large Language Models
Figure 2 for LLaMA-Omni: Seamless Speech Interaction with Large Language Models
Figure 3 for LLaMA-Omni: Seamless Speech Interaction with Large Language Models
Figure 4 for LLaMA-Omni: Seamless Speech Interaction with Large Language Models
Viaarxiv icon

Agent-SiMT: Agent-assisted Simultaneous Machine Translation with Large Language Models

Add code
Jun 12, 2024
Viaarxiv icon

A Non-autoregressive Generation Framework for End-to-End Simultaneous Speech-to-Any Translation

Add code
Jun 11, 2024
Viaarxiv icon