Alert button

"Image": models, code, and papers
Alert button

Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld

Nov 28, 2023
Yijun Yang, Tianyi Zhou, Kanxue Li, Dapeng Tao, Lusong Li, Li Shen, Xiaodong He, Jing Jiang, Yuhui Shi

Figure 1 for Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld
Figure 2 for Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld
Figure 3 for Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld
Figure 4 for Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld
Viaarxiv icon

Spiking Neural Networks with Dynamic Time Steps for Vision Transformers

Nov 28, 2023
Gourav Datta, Zeyu Liu, Anni Li, Peter A. Beerel

Viaarxiv icon

An Innovative Tool for Uploading/Scraping Large Image Datasets on Social Networks

Nov 01, 2023
Nicolò Fabio Arceri, Oliver Giudice, Sebastiano Battiato

Viaarxiv icon

A Principled Hierarchical Deep Learning Approach to Joint Image Compression and Classification

Oct 30, 2023
Siyu Qi, Achintha Wijesinghe, Lahiru D. Chamain, Zhi Ding

Viaarxiv icon

Towards Machine Learning-based Quantitative Hyperspectral Image Guidance for Brain Tumor Resection

Nov 17, 2023
David Black, Declan Byrne, Anna Walke, Sidong Liu, Antonio Di leva, Sadahiro Kaneko, Walter Stummer, Septimiu Salcudean, Eric Suero Molina

Viaarxiv icon

Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models

Nov 20, 2023
Rohit Gandikota, Joanna Materzynska, Tingrui Zhou, Antonio Torralba, David Bau

Figure 1 for Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models
Figure 2 for Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models
Figure 3 for Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models
Figure 4 for Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models
Viaarxiv icon

Nighttime Thermal Infrared Image Colorization with Feedback-based Object Appearance Learning

Add code
Bookmark button
Alert button
Oct 24, 2023
Fu-Ya Luo, Shu-Lin Liu, Yi-Jun Cao, Kai-Fu Yang, Chang-Yong Xie, Yong Liu, Yong-Jie Li

Viaarxiv icon

Generating Human-Centric Visual Cues for Human-Object Interaction Detection via Large Vision-Language Models

Nov 26, 2023
Yu-Wei Zhan, Fan Liu, Xin Luo, Liqiang Nie, Xin-Shun Xu, Mohan Kankanhalli

Viaarxiv icon

OCT2Confocal: 3D CycleGAN based Translation of Retinal OCT Images to Confocal Microscopy

Nov 26, 2023
Xin Tian, Nantheera Anantrasirichai, Lindsay Nicholson, Alin Achim

Viaarxiv icon

Image-Based Soil Organic Carbon Remote Sensing from Satellite Images with Fourier Neural Operator and Structural Similarity

Nov 21, 2023
Ken C. L. Wong, Levente Klein, Ademir Ferreira da Silva, Hongzhi Wang, Jitendra Singh, Tanveer Syeda-Mahmood

Viaarxiv icon