Alert button

"Time": models, code, and papers
Alert button

R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid Landmarks Encoding and Progressive Multilayer Conditioning

Dec 09, 2023
Zhiling Ye, LiangGuo Zhang, Dingheng Zeng, Quan Lu, Ning Jiang

Viaarxiv icon

Context-aware Decoding Reduces Hallucination in Query-focused Summarization

Add code
Bookmark button
Alert button
Dec 31, 2023
Zhichao Xu

Viaarxiv icon

Distributed Multi-Object Tracking Under Limited Field of View Heterogeneous Sensors with Density Clustering

Dec 31, 2023
Fei Chen, Hoa Van Nguyen, Alex S. Leong, Sabita Panicker, Robin Baker, Damith C. Ranasinghe

Viaarxiv icon

Brain Tumor Segmentation Based on Deep Learning, Attention Mechanisms, and Energy-Based Uncertainty Prediction

Add code
Bookmark button
Alert button
Dec 31, 2023
Zachary Schwehr, Sriman Achanta

Viaarxiv icon

A Multi-Task, Multi-Modal Approach for Predicting Categorical and Dimensional Emotions

Dec 31, 2023
Alex-Răzvan Ispas, Théo Deschamps-Berger, Laurence Devillers

Viaarxiv icon

TrailBlazer: Trajectory Control for Diffusion-Based Video Generation

Add code
Bookmark button
Alert button
Dec 31, 2023
Wan-Duo Kurt Ma, J. P. Lewis, W. Bastiaan Kleijn

Viaarxiv icon

TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding

Add code
Bookmark button
Alert button
Dec 04, 2023
Shuhuai Ren, Linli Yao, Shicheng Li, Xu Sun, Lu Hou

Viaarxiv icon

LLM4VG: Large Language Models Evaluation for Video Grounding

Dec 28, 2023
Wei Feng, Xin Wang, Hong Chen, Zeyang Zhang, Zihan Song, Yuwei Zhou, Wenwu Zhu

Viaarxiv icon

Stateful FastConformer with Cache-based Inference for Streaming Automatic Speech Recognition

Dec 27, 2023
Vahid Noroozi, Somshubra Majumdar, Ankur Kumar, Jagadeesh Balam, Boris Ginsburg

Viaarxiv icon

Integrating Edges into U-Net Models with Explainable Activation Maps for Brain Tumor Segmentation using MR Images

Jan 02, 2024
Subin Sahayam, Umarani Jayaraman

Viaarxiv icon