Alert button

"Information": models, code, and papers
Alert button

TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document

Mar 07, 2024
Yuliang Liu, Biao Yang, Qiang Liu, Zhang Li, Zhiyin Ma, Shuo Zhang, Xiang Bai

Figure 1 for TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document
Figure 2 for TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document
Figure 3 for TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document
Figure 4 for TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document
Viaarxiv icon

That's My Point: Compact Object-centric LiDAR Pose Estimation for Large-scale Outdoor Localisation

Mar 07, 2024
Georgi Pramatarov, Matthew Gadd, Paul Newman, Daniele De Martini

Viaarxiv icon

CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoors Object Detection from Multi-view Images

Mar 07, 2024
Guanlin Shen, Jingwei Huang, Zhihua Hu, Bin Wang

Figure 1 for CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoors Object Detection from Multi-view Images
Figure 2 for CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoors Object Detection from Multi-view Images
Figure 3 for CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoors Object Detection from Multi-view Images
Figure 4 for CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoors Object Detection from Multi-view Images
Viaarxiv icon

Wisdom of the Silicon Crowd: LLM Ensemble Prediction Capabilities Rival Human Crowd Accuracy

Mar 06, 2024
Philipp Schoenegger, Indre Tuminauskaite, Peter S. Park, Philip E. Tetlock

Viaarxiv icon

Attention Guidance Mechanism for Handwritten Mathematical Expression Recognition

Mar 05, 2024
Yutian Liu, Wenjun Ke, Jianguo Wei

Viaarxiv icon

FinReport: Explainable Stock Earnings Forecasting via News Factor Analyzing Model

Mar 05, 2024
Xiangyu Li, Xinjie Shen, Yawen Zeng, Xiaofen Xing, Jin Xu

Figure 1 for FinReport: Explainable Stock Earnings Forecasting via News Factor Analyzing Model
Figure 2 for FinReport: Explainable Stock Earnings Forecasting via News Factor Analyzing Model
Figure 3 for FinReport: Explainable Stock Earnings Forecasting via News Factor Analyzing Model
Figure 4 for FinReport: Explainable Stock Earnings Forecasting via News Factor Analyzing Model
Viaarxiv icon

Text classification of column headers with a controlled vocabulary: leveraging LLMs for metadata enrichment

Mar 05, 2024
Margherita Martorana, Tobias Kuhn, Lise Stork, Jacco van Ossenbruggen

Viaarxiv icon

Multi-Scale Subgraph Contrastive Learning

Mar 05, 2024
Yanbei Liu, Yu Zhao, Xiao Wang, Lei Geng, Zhitao Xiao

Figure 1 for Multi-Scale Subgraph Contrastive Learning
Figure 2 for Multi-Scale Subgraph Contrastive Learning
Figure 3 for Multi-Scale Subgraph Contrastive Learning
Figure 4 for Multi-Scale Subgraph Contrastive Learning
Viaarxiv icon

Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding

Mar 05, 2024
Zhenyu Zhang, Runjin Chen, Shiwei Liu, Zhewei Yao, Olatunji Ruwase, Beidi Chen, Xiaoxia Wu, Zhangyang Wang

Figure 1 for Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding
Figure 2 for Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding
Figure 3 for Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding
Figure 4 for Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding
Viaarxiv icon

Query Augmentation by Decoding Semantics from Brain Signals

Mar 03, 2024
Ziyi Ye, Jingtao Zhan, Qingyao Ai, Yiqun Liu, Maarten de Rijke, Christina Lioma, Tuukka Ruotsalo

Viaarxiv icon