Alert button

"Information": models, code, and papers
Alert button

Poly Kernel Inception Network for Remote Sensing Detection

Add code
Bookmark button
Alert button
Mar 20, 2024
Xinhao Cai, Qiuxia Lai, Yuwei Wang, Wenguan Wang, Zeren Sun, Yazhou Yao

Figure 1 for Poly Kernel Inception Network for Remote Sensing Detection
Figure 2 for Poly Kernel Inception Network for Remote Sensing Detection
Figure 3 for Poly Kernel Inception Network for Remote Sensing Detection
Figure 4 for Poly Kernel Inception Network for Remote Sensing Detection
Viaarxiv icon

Bootstrapping Reinforcement Learning with Imitation for Vision-Based Agile Flight

Mar 18, 2024
Jiaxu Xing, Angel Romero, Leonard Bauersfeld, Davide Scaramuzza

Figure 1 for Bootstrapping Reinforcement Learning with Imitation for Vision-Based Agile Flight
Figure 2 for Bootstrapping Reinforcement Learning with Imitation for Vision-Based Agile Flight
Figure 3 for Bootstrapping Reinforcement Learning with Imitation for Vision-Based Agile Flight
Figure 4 for Bootstrapping Reinforcement Learning with Imitation for Vision-Based Agile Flight
Viaarxiv icon

FLex: Joint Pose and Dynamic Radiance Fields Optimization for Stereo Endoscopic Videos

Mar 18, 2024
Florian Philipp Stilz, Mert Asim Karaoglu, Felix Tristram, Nassir Navab, Benjamin Busam, Alexander Ladikos

Figure 1 for FLex: Joint Pose and Dynamic Radiance Fields Optimization for Stereo Endoscopic Videos
Figure 2 for FLex: Joint Pose and Dynamic Radiance Fields Optimization for Stereo Endoscopic Videos
Figure 3 for FLex: Joint Pose and Dynamic Radiance Fields Optimization for Stereo Endoscopic Videos
Figure 4 for FLex: Joint Pose and Dynamic Radiance Fields Optimization for Stereo Endoscopic Videos
Viaarxiv icon

CRS-Diff: Controllable Generative Remote Sensing Foundation Model

Add code
Bookmark button
Alert button
Mar 18, 2024
Datao Tang, Xiangyong Cao, Xingsong Hou, Zhongyuan Jiang, Deyu Meng

Figure 1 for CRS-Diff: Controllable Generative Remote Sensing Foundation Model
Figure 2 for CRS-Diff: Controllable Generative Remote Sensing Foundation Model
Figure 3 for CRS-Diff: Controllable Generative Remote Sensing Foundation Model
Figure 4 for CRS-Diff: Controllable Generative Remote Sensing Foundation Model
Viaarxiv icon

EMIE-MAP: Large-Scale Road Surface Reconstruction Based on Explicit Mesh and Implicit Encoding

Mar 18, 2024
Wenhua Wu, Qi Wang, Guangming Wang, Junping Wang, Tiankun Zhao, Yang Liu, Dongchao Gao, Zhe Liu, Hesheng Wang

Figure 1 for EMIE-MAP: Large-Scale Road Surface Reconstruction Based on Explicit Mesh and Implicit Encoding
Figure 2 for EMIE-MAP: Large-Scale Road Surface Reconstruction Based on Explicit Mesh and Implicit Encoding
Figure 3 for EMIE-MAP: Large-Scale Road Surface Reconstruction Based on Explicit Mesh and Implicit Encoding
Figure 4 for EMIE-MAP: Large-Scale Road Surface Reconstruction Based on Explicit Mesh and Implicit Encoding
Viaarxiv icon

HAC: Hash-grid Assisted Context for 3D Gaussian Splatting Compression

Add code
Bookmark button
Alert button
Mar 21, 2024
Yihang Chen, Qianyi Wu, Jianfei Cai, Mehrtash Harandi, Weiyao Lin

Figure 1 for HAC: Hash-grid Assisted Context for 3D Gaussian Splatting Compression
Figure 2 for HAC: Hash-grid Assisted Context for 3D Gaussian Splatting Compression
Figure 3 for HAC: Hash-grid Assisted Context for 3D Gaussian Splatting Compression
Figure 4 for HAC: Hash-grid Assisted Context for 3D Gaussian Splatting Compression
Viaarxiv icon

Ranking Distillation for Open-Ended Video Question Answering with Insufficient Labels

Mar 21, 2024
Tianming Liang, Chaolei Tan, Beihao Xia, Wei-Shi Zheng, Jian-Fang Hu

Figure 1 for Ranking Distillation for Open-Ended Video Question Answering with Insufficient Labels
Figure 2 for Ranking Distillation for Open-Ended Video Question Answering with Insufficient Labels
Figure 3 for Ranking Distillation for Open-Ended Video Question Answering with Insufficient Labels
Figure 4 for Ranking Distillation for Open-Ended Video Question Answering with Insufficient Labels
Viaarxiv icon

Editing Knowledge Representation of Language Lodel via Rephrased Prefix Prompts

Mar 21, 2024
Yuchen Cai, Ding Cao, Rongxi Guo, Yaqin Wen, Guiquan Liu, Enhong Chen

Figure 1 for Editing Knowledge Representation of Language Lodel via Rephrased Prefix Prompts
Figure 2 for Editing Knowledge Representation of Language Lodel via Rephrased Prefix Prompts
Figure 3 for Editing Knowledge Representation of Language Lodel via Rephrased Prefix Prompts
Figure 4 for Editing Knowledge Representation of Language Lodel via Rephrased Prefix Prompts
Viaarxiv icon

EventDance: Unsupervised Source-free Cross-modal Adaptation for Event-based Object Recognition

Mar 21, 2024
Xu Zheng, Lin Wang

Figure 1 for EventDance: Unsupervised Source-free Cross-modal Adaptation for Event-based Object Recognition
Figure 2 for EventDance: Unsupervised Source-free Cross-modal Adaptation for Event-based Object Recognition
Figure 3 for EventDance: Unsupervised Source-free Cross-modal Adaptation for Event-based Object Recognition
Figure 4 for EventDance: Unsupervised Source-free Cross-modal Adaptation for Event-based Object Recognition
Viaarxiv icon

MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?

Add code
Bookmark button
Alert button
Mar 21, 2024
Renrui Zhang, Dongzhi Jiang, Yichi Zhang, Haokun Lin, Ziyu Guo, Pengshuo Qiu, Aojun Zhou, Pan Lu, Kai-Wei Chang, Peng Gao, Hongsheng Li

Figure 1 for MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?
Figure 2 for MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?
Figure 3 for MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?
Figure 4 for MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?
Viaarxiv icon