Alert button

"Information": models, code, and papers
Alert button

CALM: Contrastive Cross-modal Speaking Style Modeling for Expressive Text-to-Speech Synthesis

Aug 30, 2023
Yi Meng, Xiang Li, Zhiyong Wu, Tingtian Li, Zixun Sun, Xinyu Xiao, Chi Sun, Hui Zhan, Helen Meng

Figure 1 for CALM: Contrastive Cross-modal Speaking Style Modeling for Expressive Text-to-Speech Synthesis
Figure 2 for CALM: Contrastive Cross-modal Speaking Style Modeling for Expressive Text-to-Speech Synthesis
Figure 3 for CALM: Contrastive Cross-modal Speaking Style Modeling for Expressive Text-to-Speech Synthesis
Figure 4 for CALM: Contrastive Cross-modal Speaking Style Modeling for Expressive Text-to-Speech Synthesis
Viaarxiv icon

A Multimodal Learning Framework for Comprehensive 3D Mineral Prospectivity Modeling with Jointly Learned Structure-Fluid Relationships

Sep 06, 2023
Yang Zheng, Hao Deng, Ruisheng Wang, Jingjie Wu

Viaarxiv icon

Anomalous Sound Detection Using Self-Attention-Based Frequency Pattern Analysis of Machine Sounds

Aug 27, 2023
Hejing Zhang, Jian Guan, Qiaoxi Zhu, Feiyang Xiao, Youde Liu

Figure 1 for Anomalous Sound Detection Using Self-Attention-Based Frequency Pattern Analysis of Machine Sounds
Figure 2 for Anomalous Sound Detection Using Self-Attention-Based Frequency Pattern Analysis of Machine Sounds
Figure 3 for Anomalous Sound Detection Using Self-Attention-Based Frequency Pattern Analysis of Machine Sounds
Figure 4 for Anomalous Sound Detection Using Self-Attention-Based Frequency Pattern Analysis of Machine Sounds
Viaarxiv icon

Distributional Data Augmentation Methods for Low Resource Language

Sep 09, 2023
Mosleh Mahamud, Zed Lee, Isak Samsten

Viaarxiv icon

DeRi-IGP: Manipulating Rigid Objects Using Deformable Objects via Iterative Grasp-Pull

Sep 09, 2023
Zixing Wang, Ahmed H. Qureshi

Figure 1 for DeRi-IGP: Manipulating Rigid Objects Using Deformable Objects via Iterative Grasp-Pull
Figure 2 for DeRi-IGP: Manipulating Rigid Objects Using Deformable Objects via Iterative Grasp-Pull
Figure 3 for DeRi-IGP: Manipulating Rigid Objects Using Deformable Objects via Iterative Grasp-Pull
Figure 4 for DeRi-IGP: Manipulating Rigid Objects Using Deformable Objects via Iterative Grasp-Pull
Viaarxiv icon

Exploring Music Genre Classification: Algorithm Analysis and Deployment Architecture

Sep 09, 2023
Ayan Biswas, Supriya Dhabal, Palaniandavar Venkateswaran

Figure 1 for Exploring Music Genre Classification: Algorithm Analysis and Deployment Architecture
Figure 2 for Exploring Music Genre Classification: Algorithm Analysis and Deployment Architecture
Figure 3 for Exploring Music Genre Classification: Algorithm Analysis and Deployment Architecture
Figure 4 for Exploring Music Genre Classification: Algorithm Analysis and Deployment Architecture
Viaarxiv icon

osmAG: Hierarchical Semantic Topometric Area Graph Maps in the OSM Format for Mobile Robotics

Sep 09, 2023
Delin Feng, Chengqian Li, Yongqi Zhang, Chen Yu, Soeren Schwertfeger

Figure 1 for osmAG: Hierarchical Semantic Topometric Area Graph Maps in the OSM Format for Mobile Robotics
Figure 2 for osmAG: Hierarchical Semantic Topometric Area Graph Maps in the OSM Format for Mobile Robotics
Figure 3 for osmAG: Hierarchical Semantic Topometric Area Graph Maps in the OSM Format for Mobile Robotics
Figure 4 for osmAG: Hierarchical Semantic Topometric Area Graph Maps in the OSM Format for Mobile Robotics
Viaarxiv icon

Enhancing Hyperedge Prediction with Context-Aware Self-Supervised Learning

Sep 11, 2023
Yunyong Ko, Hanghang Tong, Sang-Wook Kim

Figure 1 for Enhancing Hyperedge Prediction with Context-Aware Self-Supervised Learning
Figure 2 for Enhancing Hyperedge Prediction with Context-Aware Self-Supervised Learning
Figure 3 for Enhancing Hyperedge Prediction with Context-Aware Self-Supervised Learning
Figure 4 for Enhancing Hyperedge Prediction with Context-Aware Self-Supervised Learning
Viaarxiv icon

SayNav: Grounding Large Language Models for Dynamic Planning to Navigation in New Environments

Sep 11, 2023
Abhinav Rajvanshi, Karan Sikka, Xiao Lin, Bhoram Lee, Han-Pang Chiu, Alvaro Velasquez

Figure 1 for SayNav: Grounding Large Language Models for Dynamic Planning to Navigation in New Environments
Figure 2 for SayNav: Grounding Large Language Models for Dynamic Planning to Navigation in New Environments
Figure 3 for SayNav: Grounding Large Language Models for Dynamic Planning to Navigation in New Environments
Figure 4 for SayNav: Grounding Large Language Models for Dynamic Planning to Navigation in New Environments
Viaarxiv icon

Towards Content-based Pixel Retrieval in Revisited Oxford and Paris

Sep 11, 2023
Guoyuan An, Woo Jae Kim, Saelyne Yang, Rong Li, Yuchi Huo, Sung-Eui Yoon

Figure 1 for Towards Content-based Pixel Retrieval in Revisited Oxford and Paris
Figure 2 for Towards Content-based Pixel Retrieval in Revisited Oxford and Paris
Figure 3 for Towards Content-based Pixel Retrieval in Revisited Oxford and Paris
Figure 4 for Towards Content-based Pixel Retrieval in Revisited Oxford and Paris
Viaarxiv icon