Alert button

"Image": models, code, and papers
Alert button

mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding

Add code
Bookmark button
Alert button
Mar 19, 2024
Anwen Hu, Haiyang Xu, Jiabo Ye, Ming Yan, Liang Zhang, Bo Zhang, Chen Li, Ji Zhang, Qin Jin, Fei Huang, Jingren Zhou

Figure 1 for mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding
Figure 2 for mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding
Figure 3 for mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding
Figure 4 for mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding
Viaarxiv icon

Ambient Diffusion Posterior Sampling: Solving Inverse Problems with Diffusion Models trained on Corrupted Data

Add code
Bookmark button
Alert button
Mar 13, 2024
Asad Aali, Giannis Daras, Brett Levac, Sidharth Kumar, Alexandros G. Dimakis, Jonathan I. Tamir

Figure 1 for Ambient Diffusion Posterior Sampling: Solving Inverse Problems with Diffusion Models trained on Corrupted Data
Figure 2 for Ambient Diffusion Posterior Sampling: Solving Inverse Problems with Diffusion Models trained on Corrupted Data
Figure 3 for Ambient Diffusion Posterior Sampling: Solving Inverse Problems with Diffusion Models trained on Corrupted Data
Figure 4 for Ambient Diffusion Posterior Sampling: Solving Inverse Problems with Diffusion Models trained on Corrupted Data
Viaarxiv icon

AADNet: Attention aware Demoiréing Network

Mar 13, 2024
M Rakesh Reddy, Shubham Mandloi, Aman Kumar

Figure 1 for AADNet: Attention aware Demoiréing Network
Figure 2 for AADNet: Attention aware Demoiréing Network
Figure 3 for AADNet: Attention aware Demoiréing Network
Viaarxiv icon

3DFIRES: Few Image 3D REconstruction for Scenes with Hidden Surface

Add code
Bookmark button
Alert button
Mar 13, 2024
Linyi Jin, Nilesh Kulkarni, David Fouhey

Figure 1 for 3DFIRES: Few Image 3D REconstruction for Scenes with Hidden Surface
Figure 2 for 3DFIRES: Few Image 3D REconstruction for Scenes with Hidden Surface
Figure 3 for 3DFIRES: Few Image 3D REconstruction for Scenes with Hidden Surface
Figure 4 for 3DFIRES: Few Image 3D REconstruction for Scenes with Hidden Surface
Viaarxiv icon

Simulating Battery-Powered TinyML Systems Optimised using Reinforcement Learning in Image-Based Anomaly Detection

Mar 08, 2024
Jared M. Ping, Ken J. Nixon

Figure 1 for Simulating Battery-Powered TinyML Systems Optimised using Reinforcement Learning in Image-Based Anomaly Detection
Figure 2 for Simulating Battery-Powered TinyML Systems Optimised using Reinforcement Learning in Image-Based Anomaly Detection
Figure 3 for Simulating Battery-Powered TinyML Systems Optimised using Reinforcement Learning in Image-Based Anomaly Detection
Figure 4 for Simulating Battery-Powered TinyML Systems Optimised using Reinforcement Learning in Image-Based Anomaly Detection
Viaarxiv icon

Region-Adaptive Transform with Segmentation Prior for Image Compression

Add code
Bookmark button
Alert button
Mar 01, 2024
Yuxi Liu, Wenhan Yang, Huihui Bai, Yunchao Wei, Yao Zhao

Figure 1 for Region-Adaptive Transform with Segmentation Prior for Image Compression
Figure 2 for Region-Adaptive Transform with Segmentation Prior for Image Compression
Figure 3 for Region-Adaptive Transform with Segmentation Prior for Image Compression
Figure 4 for Region-Adaptive Transform with Segmentation Prior for Image Compression
Viaarxiv icon

MGIC: A Multi-Label Gradient Inversion Attack based on Canny Edge Detection on Federated Learning

Mar 13, 2024
Can Liu, Jin Wang

Figure 1 for MGIC: A Multi-Label Gradient Inversion Attack based on Canny Edge Detection on Federated Learning
Figure 2 for MGIC: A Multi-Label Gradient Inversion Attack based on Canny Edge Detection on Federated Learning
Figure 3 for MGIC: A Multi-Label Gradient Inversion Attack based on Canny Edge Detection on Federated Learning
Figure 4 for MGIC: A Multi-Label Gradient Inversion Attack based on Canny Edge Detection on Federated Learning
Viaarxiv icon

Rule-driven News Captioning

Mar 14, 2024
Ning Xu, Tingting Zhang, Hongshuo Tian, An-An Liu

Figure 1 for Rule-driven News Captioning
Figure 2 for Rule-driven News Captioning
Figure 3 for Rule-driven News Captioning
Figure 4 for Rule-driven News Captioning
Viaarxiv icon

Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering

Add code
Bookmark button
Alert button
Mar 14, 2024
Zeyu Liu, Weicong Liang, Zhanhao Liang, Chong Luo, Ji Li, Gao Huang, Yuhui Yuan

Figure 1 for Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering
Figure 2 for Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering
Figure 3 for Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering
Figure 4 for Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering
Viaarxiv icon

Motion and temporal B0 shift corrections for quantitative susceptibility mapping (QSM) and R2* mapping using dual-echo spiral navigators and conjugate-phase reconstruction

Mar 18, 2024
Yuguang Meng, Jason W. Allen, Vahid Khalilzad Sharghi, Deqiang Qiu

Figure 1 for Motion and temporal B0 shift corrections for quantitative susceptibility mapping (QSM) and R2* mapping using dual-echo spiral navigators and conjugate-phase reconstruction
Figure 2 for Motion and temporal B0 shift corrections for quantitative susceptibility mapping (QSM) and R2* mapping using dual-echo spiral navigators and conjugate-phase reconstruction
Figure 3 for Motion and temporal B0 shift corrections for quantitative susceptibility mapping (QSM) and R2* mapping using dual-echo spiral navigators and conjugate-phase reconstruction
Figure 4 for Motion and temporal B0 shift corrections for quantitative susceptibility mapping (QSM) and R2* mapping using dual-echo spiral navigators and conjugate-phase reconstruction
Viaarxiv icon