Alert button
Picture for Huayi Wang

Huayi Wang

Alert button

Rethinking Similarity Search: Embracing Smarter Mechanisms over Smarter Data

Aug 02, 2023
Renzhi Wu, Jingfan Meng, Jie Jeff Xu, Huayi Wang, Kexin Rong

Figure 1 for Rethinking Similarity Search: Embracing Smarter Mechanisms over Smarter Data
Figure 2 for Rethinking Similarity Search: Embracing Smarter Mechanisms over Smarter Data
Figure 3 for Rethinking Similarity Search: Embracing Smarter Mechanisms over Smarter Data
Figure 4 for Rethinking Similarity Search: Embracing Smarter Mechanisms over Smarter Data

In this vision paper, we propose a shift in perspective for improving the effectiveness of similarity search. Rather than focusing solely on enhancing the data quality, particularly machine learning-generated embeddings, we advocate for a more comprehensive approach that also enhances the underpinning search mechanisms. We highlight three novel avenues that call for a redefinition of the similarity search problem: exploiting implicit data structures and distributions, engaging users in an iterative feedback loop, and moving beyond a single query vector. These novel pathways have gained relevance in emerging applications such as large-scale language models, video clip retrieval, and data labeling. We discuss the corresponding research challenges posed by these new problem areas and share insights from our preliminary discoveries.

Viaarxiv icon

Image-free multi-character recognition

Dec 20, 2021
Huayi Wang, Chunli Zhu, Liheng Bian

Figure 1 for Image-free multi-character recognition
Figure 2 for Image-free multi-character recognition
Figure 3 for Image-free multi-character recognition
Figure 4 for Image-free multi-character recognition

The recently developed image-free sensing technique maintains the advantages of both the light hardware and software, which has been applied in simple target classification and motion tracking. In practical applications, however, there usually exist multiple targets in the field of view, where existing trials fail to produce multi-semantic information. In this letter, we report a novel image-free sensing technique to tackle the multi-target recognition challenge for the first time. Different from the convolutional layer stack of image-free single-pixel networks, the reported CRNN network utilities the bidirectional LSTM architecture to predict the distribution of multiple characters simultaneously. The framework enables to capture the long-range dependencies, providing a high recognition accuracy of multiple characters. We demonstrated the technique's effectiveness in license plate detection, which achieved 87.60% recognition accuracy at a 5% sampling rate with a higher than 100 FPS refresh rate.

* 17pages, 4figures 
Viaarxiv icon