Talking


English to Central Kurdish Speech Translation: Corpus Creation, Evaluation, and Orthographic Standardization

Add code
Apr 01, 2026
Viaarxiv icon

Making Sense of AI Agents Hype: Adoption, Architectures, and Takeaways from Practitioners

Add code
Mar 31, 2026
Viaarxiv icon

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Add code
Mar 29, 2026
Viaarxiv icon

Leveraging Avatar Fingerprinting: A Multi-Generator Photorealistic Talking-Head Public Database and Benchmark

Add code
Mar 27, 2026
Viaarxiv icon

Good Scores, Bad Data: A Metric for Multimodal Coherence

Add code
Mar 26, 2026
Viaarxiv icon

Real Talk, Virtual Faces: A Formal Concept Analysis of Personality and Sentiment in Influencer Audiences

Add code
Mar 25, 2026
Viaarxiv icon

Toward Integrated Sensing, Communications, and Edge Intelligence Networks

Add code
Mar 24, 2026
Viaarxiv icon

Multimodal Training to Unimodal Deployment: Leveraging Unstructured Data During Training to Optimize Structured Data Only Deployment

Add code
Mar 23, 2026
Viaarxiv icon

Timing In stand-up Comedy: Text, Audio, Laughter, Kinesics (TIC-TALK): Pipeline and Database for the Multimodal Study of Comedic Timing

Add code
Mar 23, 2026
Viaarxiv icon

Current LLMs still cannot 'talk much' about grammar modules: Evidence from syntax

Add code
Mar 23, 2026
Viaarxiv icon