Alert button

"Image": models, code, and papers
Alert button

Uncertainty-aware Sampling for Long-tailed Semi-supervised Learning

Jan 09, 2024
Kuo Yang, Duo Li, Menghan Hu, Guangtao Zhai, Xiaokang Yang, Xiao-Ping Zhang

Viaarxiv icon

SonicVisionLM: Playing Sound with Vision Language Models

Jan 09, 2024
Zhifeng Xie, Shengye Yu, Mengtian Li, Qile He, Chaofeng Chen, Yu-Gang Jiang

Viaarxiv icon

RS-DGC: Exploring Neighborhood Statistics for Dynamic Gradient Compression on Remote Sensing Image Interpretation

Dec 29, 2023
Weiying Xie, Zixuan Wang, Jitao Ma, Daixun Li, Yunsong Li

Viaarxiv icon

FRED: Towards a Full Rotation-Equivariance in Aerial Image Object Detection

Dec 22, 2023
Chanho Lee, Jinsu Son, Hyounguk Shon, Yunho Jeon, Junmo Kim

Viaarxiv icon

Rich Human Feedback for Text-to-Image Generation

Dec 15, 2023
Youwei Liang, Junfeng He, Gang Li, Peizhao Li, Arseniy Klimovskiy, Nicholas Carolan, Jiao Sun, Jordi Pont-Tuset, Sarah Young, Feng Yang, Junjie Ke, Krishnamurthy Dj Dvijotham, Katie Collins, Yiwen Luo, Yang Li, Kai J Kohlhoff, Deepak Ramachandran, Vidhya Navalpakkam

Viaarxiv icon

EPNet: An Efficient Pyramid Network for Enhanced Single-Image Super-Resolution with Reduced Computational Requirements

Dec 20, 2023
Xin Xu, Jinman Park, Paul Fieguth

Viaarxiv icon

Semantic-aware Data Augmentation for Text-to-image Synthesis

Dec 13, 2023
Zhaorui Tan, Xi Yang, Kaizhu Huang

Viaarxiv icon

A Dual Domain Multi-exposure Image Fusion Network based on the Spatial-Frequency Integration

Dec 17, 2023
Guang Yang, Jie Li, Xinbo Gao

Viaarxiv icon

Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting

Dec 21, 2023
Junwu Zhang, Zhenyu Tang, Yatian Pang, Xinhua Cheng, Peng Jin, Yida Wei, Munan Ning, Li Yuan

Viaarxiv icon

An attempt to generate new bridge types from latent space of PixelCNN

Jan 11, 2024
Hongjun Zhang

Viaarxiv icon