Alert button

"Information": models, code, and papers
Alert button

Random forests for detecting weak signals and extracting physical information: a case study of magnetic navigation

Add code
Bookmark button
Alert button
Feb 21, 2024
Mohammadamin Moradi, Zheng-Meng Zhai, Aaron Nielsen, Ying-Cheng Lai

Viaarxiv icon

CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoors Object Detection from Multi-view Images

Add code
Bookmark button
Alert button
Mar 07, 2024
Guanlin Shen, Jingwei Huang, Zhihua Hu, Bin Wang

Figure 1 for CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoors Object Detection from Multi-view Images
Figure 2 for CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoors Object Detection from Multi-view Images
Figure 3 for CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoors Object Detection from Multi-view Images
Figure 4 for CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoors Object Detection from Multi-view Images
Viaarxiv icon

That's My Point: Compact Object-centric LiDAR Pose Estimation for Large-scale Outdoor Localisation

Mar 07, 2024
Georgi Pramatarov, Matthew Gadd, Paul Newman, Daniele De Martini

Figure 1 for That's My Point: Compact Object-centric LiDAR Pose Estimation for Large-scale Outdoor Localisation
Figure 2 for That's My Point: Compact Object-centric LiDAR Pose Estimation for Large-scale Outdoor Localisation
Figure 3 for That's My Point: Compact Object-centric LiDAR Pose Estimation for Large-scale Outdoor Localisation
Figure 4 for That's My Point: Compact Object-centric LiDAR Pose Estimation for Large-scale Outdoor Localisation
Viaarxiv icon

TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document

Add code
Bookmark button
Alert button
Mar 07, 2024
Yuliang Liu, Biao Yang, Qiang Liu, Zhang Li, Zhiyin Ma, Shuo Zhang, Xiang Bai

Figure 1 for TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document
Figure 2 for TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document
Figure 3 for TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document
Figure 4 for TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document
Viaarxiv icon

Robust Wake Word Spotting With Frame-Level Cross-Modal Attention Based Audio-Visual Conformer

Mar 04, 2024
Haoxu Wang, Ming Cheng, Qiang Fu, Ming Li

Figure 1 for Robust Wake Word Spotting With Frame-Level Cross-Modal Attention Based Audio-Visual Conformer
Figure 2 for Robust Wake Word Spotting With Frame-Level Cross-Modal Attention Based Audio-Visual Conformer
Figure 3 for Robust Wake Word Spotting With Frame-Level Cross-Modal Attention Based Audio-Visual Conformer
Figure 4 for Robust Wake Word Spotting With Frame-Level Cross-Modal Attention Based Audio-Visual Conformer
Viaarxiv icon

SGD with Partial Hessian for Deep Neural Networks Optimization

Add code
Bookmark button
Alert button
Mar 05, 2024
Ying Sun, Hongwei Yong, Lei Zhang

Figure 1 for SGD with Partial Hessian for Deep Neural Networks Optimization
Figure 2 for SGD with Partial Hessian for Deep Neural Networks Optimization
Figure 3 for SGD with Partial Hessian for Deep Neural Networks Optimization
Figure 4 for SGD with Partial Hessian for Deep Neural Networks Optimization
Viaarxiv icon

On the Road to Portability: Compressing End-to-End Motion Planner for Autonomous Driving

Mar 02, 2024
Kaituo Feng, Changsheng Li, Dongchun Ren, Ye Yuan, Guoren Wang

Figure 1 for On the Road to Portability: Compressing End-to-End Motion Planner for Autonomous Driving
Figure 2 for On the Road to Portability: Compressing End-to-End Motion Planner for Autonomous Driving
Figure 3 for On the Road to Portability: Compressing End-to-End Motion Planner for Autonomous Driving
Figure 4 for On the Road to Portability: Compressing End-to-End Motion Planner for Autonomous Driving
Viaarxiv icon

Personalised Drug Identifier for Cancer Treatment with Transformers using Auxiliary Information

Feb 16, 2024
Aishwarya Jayagopal, Hansheng Xue, Ziyang He, Robert J. Walsh, Krishna Kumar Hariprasannan, David Shao Peng Tan, Tuan Zea Tan, Jason J. Pitt, Anand D. Jeyasekharan, Vaibhav Rajan

Viaarxiv icon

Debiasing Large Visual Language Models

Add code
Bookmark button
Alert button
Mar 08, 2024
Yi-Fan Zhang, Weichen Yu, Qingsong Wen, Xue Wang, Zhang Zhang, Liang Wang, Rong Jin, Tieniu Tan

Figure 1 for Debiasing Large Visual Language Models
Figure 2 for Debiasing Large Visual Language Models
Figure 3 for Debiasing Large Visual Language Models
Figure 4 for Debiasing Large Visual Language Models
Viaarxiv icon

Towards a Psychology of Machines: Large Language Models Predict Human Memory

Mar 08, 2024
Markus Huff, Elanur Ulakçı

Figure 1 for Towards a Psychology of Machines: Large Language Models Predict Human Memory
Figure 2 for Towards a Psychology of Machines: Large Language Models Predict Human Memory
Figure 3 for Towards a Psychology of Machines: Large Language Models Predict Human Memory
Viaarxiv icon