Picture for Di Jiang

Di Jiang

May

Multimodal Peer Review Simulation with Actionable To-Do Recommendations for Community-Aware Manuscript Revisions

Add code
Nov 14, 2025
Viaarxiv icon

CantoASR: Prosody-Aware ASR-LALM Collaboration for Low-Resource Cantonese

Add code
Nov 06, 2025
Viaarxiv icon

Contextualized Token Discrimination for Speech Search Query Correction

Add code
Sep 04, 2025
Figure 1 for Contextualized Token Discrimination for Speech Search Query Correction
Figure 2 for Contextualized Token Discrimination for Speech Search Query Correction
Figure 3 for Contextualized Token Discrimination for Speech Search Query Correction
Figure 4 for Contextualized Token Discrimination for Speech Search Query Correction
Viaarxiv icon

EraRAG: Efficient and Incremental Retrieval Augmented Generation for Growing Corpora

Add code
Jun 26, 2025
Viaarxiv icon

Technical Report: A Practical Guide to Kaldi ASR Optimization

Add code
Jun 08, 2025
Viaarxiv icon

QualBench: Benchmarking Chinese LLMs with Localized Professional Qualifications for Vertical Domain Evaluation

Add code
May 08, 2025
Figure 1 for QualBench: Benchmarking Chinese LLMs with Localized Professional Qualifications for Vertical Domain Evaluation
Figure 2 for QualBench: Benchmarking Chinese LLMs with Localized Professional Qualifications for Vertical Domain Evaluation
Figure 3 for QualBench: Benchmarking Chinese LLMs with Localized Professional Qualifications for Vertical Domain Evaluation
Figure 4 for QualBench: Benchmarking Chinese LLMs with Localized Professional Qualifications for Vertical Domain Evaluation
Viaarxiv icon

Dialogue Language Model with Large-Scale Persona Data Engineering

Add code
Dec 12, 2024
Figure 1 for Dialogue Language Model with Large-Scale Persona Data Engineering
Figure 2 for Dialogue Language Model with Large-Scale Persona Data Engineering
Figure 3 for Dialogue Language Model with Large-Scale Persona Data Engineering
Figure 4 for Dialogue Language Model with Large-Scale Persona Data Engineering
Viaarxiv icon

Dial-In LLM: Human-Aligned Dialogue Intent Clustering with LLM-in-the-loop

Add code
Dec 12, 2024
Figure 1 for Dial-In LLM: Human-Aligned Dialogue Intent Clustering with LLM-in-the-loop
Figure 2 for Dial-In LLM: Human-Aligned Dialogue Intent Clustering with LLM-in-the-loop
Figure 3 for Dial-In LLM: Human-Aligned Dialogue Intent Clustering with LLM-in-the-loop
Figure 4 for Dial-In LLM: Human-Aligned Dialogue Intent Clustering with LLM-in-the-loop
Viaarxiv icon

ASR-EC Benchmark: Evaluating Large Language Models on Chinese ASR Error Correction

Add code
Dec 04, 2024
Viaarxiv icon

Acoustic Model Optimization over Multiple Data Sources: Merging and Valuation

Add code
Oct 21, 2024
Figure 1 for Acoustic Model Optimization over Multiple Data Sources: Merging and Valuation
Figure 2 for Acoustic Model Optimization over Multiple Data Sources: Merging and Valuation
Figure 3 for Acoustic Model Optimization over Multiple Data Sources: Merging and Valuation
Figure 4 for Acoustic Model Optimization over Multiple Data Sources: Merging and Valuation
Viaarxiv icon