Picture for Hongjie Chen

Hongjie Chen

Dolby Laboratories

Variable-Length Audio Fingerprinting

Add code
Mar 25, 2026
Viaarxiv icon

Seeking Universal Shot Language Understanding Solutions

Add code
Mar 19, 2026
Viaarxiv icon

InfinityStory: Unlimited Video Generation with World Consistency and Character-Aware Shot Transitions

Add code
Mar 04, 2026
Viaarxiv icon

Human-Aligned MLLM Judges for Fine-Grained Image Editing Evaluation: A Benchmark, Framework, and Analysis

Add code
Feb 13, 2026
Viaarxiv icon

Segment Length Matters: A Study of Segment Lengths on Audio Fingerprinting Performance

Add code
Jan 25, 2026
Viaarxiv icon

A Unified Spoken Language Model with Injected Emotional-Attribution Thinking for Human-like Interaction

Add code
Jan 08, 2026
Viaarxiv icon

Measuring Time-Series Dataset Similarity using Wasserstein Distance

Add code
Jul 29, 2025
Figure 1 for Measuring Time-Series Dataset Similarity using Wasserstein Distance
Figure 2 for Measuring Time-Series Dataset Similarity using Wasserstein Distance
Figure 3 for Measuring Time-Series Dataset Similarity using Wasserstein Distance
Figure 4 for Measuring Time-Series Dataset Similarity using Wasserstein Distance
Viaarxiv icon

TELEVAL: A Dynamic Benchmark Designed for Spoken Language Models in Chinese Interactive Scenarios

Add code
Jul 24, 2025
Viaarxiv icon

BoSS: Beyond-Semantic Speech

Add code
Jul 23, 2025
Viaarxiv icon

A Survey on Long-Video Storytelling Generation: Architectures, Consistency, and Cinematic Quality

Add code
Jul 09, 2025
Figure 1 for A Survey on Long-Video Storytelling Generation: Architectures, Consistency, and Cinematic Quality
Figure 2 for A Survey on Long-Video Storytelling Generation: Architectures, Consistency, and Cinematic Quality
Viaarxiv icon