Picture for Jiaming Wang

Jiaming Wang

OrthCaps: An Orthogonal CapsNet with Sparse Attention Routing and Pruning

Add code
Mar 20, 2024
Figure 1 for OrthCaps: An Orthogonal CapsNet with Sparse Attention Routing and Pruning
Figure 2 for OrthCaps: An Orthogonal CapsNet with Sparse Attention Routing and Pruning
Figure 3 for OrthCaps: An Orthogonal CapsNet with Sparse Attention Routing and Pruning
Figure 4 for OrthCaps: An Orthogonal CapsNet with Sparse Attention Routing and Pruning
Viaarxiv icon

An Embarrassingly Simple Approach for LLM with Strong ASR Capacity

Add code
Feb 13, 2024
Viaarxiv icon

Probable Object Location (POLo) Score Estimation for Efficient Object Goal Navigation

Add code
Nov 14, 2023
Viaarxiv icon

LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT

Add code
Oct 11, 2023
Figure 1 for LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT
Figure 2 for LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT
Figure 3 for LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT
Figure 4 for LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT
Viaarxiv icon

Deep Reinforcement Learning Based Framework for Mobile Energy Disseminator Dispatching to Charge On-the-Road Electric Vehicles

Add code
Aug 29, 2023
Figure 1 for Deep Reinforcement Learning Based Framework for Mobile Energy Disseminator Dispatching to Charge On-the-Road Electric Vehicles
Figure 2 for Deep Reinforcement Learning Based Framework for Mobile Energy Disseminator Dispatching to Charge On-the-Road Electric Vehicles
Figure 3 for Deep Reinforcement Learning Based Framework for Mobile Energy Disseminator Dispatching to Charge On-the-Road Electric Vehicles
Figure 4 for Deep Reinforcement Learning Based Framework for Mobile Energy Disseminator Dispatching to Charge On-the-Road Electric Vehicles
Viaarxiv icon

kTrans: Knowledge-Aware Transformer for Binary Code Embedding

Add code
Aug 24, 2023
Figure 1 for kTrans: Knowledge-Aware Transformer for Binary Code Embedding
Figure 2 for kTrans: Knowledge-Aware Transformer for Binary Code Embedding
Figure 3 for kTrans: Knowledge-Aware Transformer for Binary Code Embedding
Figure 4 for kTrans: Knowledge-Aware Transformer for Binary Code Embedding
Viaarxiv icon

FunASR: A Fundamental End-to-End Speech Recognition Toolkit

Add code
May 18, 2023
Figure 1 for FunASR: A Fundamental End-to-End Speech Recognition Toolkit
Figure 2 for FunASR: A Fundamental End-to-End Speech Recognition Toolkit
Figure 3 for FunASR: A Fundamental End-to-End Speech Recognition Toolkit
Figure 4 for FunASR: A Fundamental End-to-End Speech Recognition Toolkit
Viaarxiv icon

TOLD: A Novel Two-Stage Overlap-Aware Framework for Speaker Diarization

Add code
Mar 08, 2023
Figure 1 for TOLD: A Novel Two-Stage Overlap-Aware Framework for Speaker Diarization
Figure 2 for TOLD: A Novel Two-Stage Overlap-Aware Framework for Speaker Diarization
Figure 3 for TOLD: A Novel Two-Stage Overlap-Aware Framework for Speaker Diarization
Figure 4 for TOLD: A Novel Two-Stage Overlap-Aware Framework for Speaker Diarization
Viaarxiv icon

MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for Speech Recognition

Add code
Nov 29, 2022
Figure 1 for MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for Speech Recognition
Figure 2 for MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for Speech Recognition
Figure 3 for MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for Speech Recognition
Figure 4 for MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for Speech Recognition
Viaarxiv icon

TANet: A new Paradigm for Global Face Super-resolution via Transformer-CNN Aggregation Network

Add code
Sep 16, 2021
Figure 1 for TANet: A new Paradigm for Global Face Super-resolution via Transformer-CNN Aggregation Network
Figure 2 for TANet: A new Paradigm for Global Face Super-resolution via Transformer-CNN Aggregation Network
Figure 3 for TANet: A new Paradigm for Global Face Super-resolution via Transformer-CNN Aggregation Network
Figure 4 for TANet: A new Paradigm for Global Face Super-resolution via Transformer-CNN Aggregation Network
Viaarxiv icon