Alert button

"Text": models, code, and papers
Alert button

Careless Whisper: Speech-to-Text Hallucination Harms

Feb 12, 2024
Allison Koenecke, Anna Seo Gyeong Choi, Katelyn Mei, Hilke Schellmann, Mona Sloane

Viaarxiv icon

KnowPhish: Large Language Models Meet Multimodal Knowledge Graphs for Enhancing Reference-Based Phishing Detection

Mar 04, 2024
Yuexin Li, Chengyu Huang, Shumin Deng, Mei Lin Lock, Tri Cao, Nay Oo, Bryan Hooi, Hoon Wei Lim

Figure 1 for KnowPhish: Large Language Models Meet Multimodal Knowledge Graphs for Enhancing Reference-Based Phishing Detection
Figure 2 for KnowPhish: Large Language Models Meet Multimodal Knowledge Graphs for Enhancing Reference-Based Phishing Detection
Figure 3 for KnowPhish: Large Language Models Meet Multimodal Knowledge Graphs for Enhancing Reference-Based Phishing Detection
Figure 4 for KnowPhish: Large Language Models Meet Multimodal Knowledge Graphs for Enhancing Reference-Based Phishing Detection
Viaarxiv icon

Transcription and translation of videos using fine-tuned XLSR Wav2Vec2 on custom dataset and mBART

Mar 01, 2024
Aniket Tathe, Anand Kamble, Suyash Kumbharkar, Atharva Bhandare, Anirban C. Mitra

Figure 1 for Transcription and translation of videos using fine-tuned XLSR Wav2Vec2 on custom dataset and mBART
Figure 2 for Transcription and translation of videos using fine-tuned XLSR Wav2Vec2 on custom dataset and mBART
Figure 3 for Transcription and translation of videos using fine-tuned XLSR Wav2Vec2 on custom dataset and mBART
Viaarxiv icon

DivAvatar: Diverse 3D Avatar Generation with a Single Prompt

Feb 27, 2024
Weijing Tao, Biwen Lei, Kunhao Liu, Shijian Lu, Miaomiao Cui, Xuansong Xie, Chunyan Miao

Viaarxiv icon

Multimodal Learned Sparse Retrieval with Probabilistic Expansion Control

Feb 27, 2024
Thong Nguyen, Mariya Hendriksen, Andrew Yates, Maarten de Rijke

Viaarxiv icon

Artwork Explanation in Large-scale Vision Language Models

Feb 29, 2024
Kazuki Hayashi, Yusuke Sakai, Hidetaka Kamigaito, Katsuhiko Hayashi, Taro Watanabe

Figure 1 for Artwork Explanation in Large-scale Vision Language Models
Figure 2 for Artwork Explanation in Large-scale Vision Language Models
Figure 3 for Artwork Explanation in Large-scale Vision Language Models
Figure 4 for Artwork Explanation in Large-scale Vision Language Models
Viaarxiv icon

Partial Federated Learning

Mar 03, 2024
Tiantian Feng, Anil Ramakrishna, Jimit Majmudar, Charith Peris, Jixuan Wang, Clement Chung, Richard Zemel, Morteza Ziyadi, Rahul Gupta

Figure 1 for Partial Federated Learning
Figure 2 for Partial Federated Learning
Figure 3 for Partial Federated Learning
Figure 4 for Partial Federated Learning
Viaarxiv icon

Introduction to Algogens

Mar 03, 2024
Amir Shachar

Viaarxiv icon

MobileSpeech: A Fast and High-Fidelity Framework for Mobile Zero-Shot Text-to-Speech

Feb 14, 2024
Shengpeng Ji, Ziyue Jiang, Hanting Wang, Jialong Zuo, Zhou Zhao

Viaarxiv icon

Emotion Classification in Short English Texts using Deep Learning Techniques

Feb 25, 2024
Siddhanth Bhat

Viaarxiv icon