Picture for Chenglong Liu

Chenglong Liu

Refining Sentence Embedding Model through Ranking Sentences Generation with Large Language Models

Add code
Feb 19, 2025
Viaarxiv icon

PerPO: Perceptual Preference Optimization via Discriminative Rewarding

Add code
Feb 05, 2025
Figure 1 for PerPO: Perceptual Preference Optimization via Discriminative Rewarding
Figure 2 for PerPO: Perceptual Preference Optimization via Discriminative Rewarding
Figure 3 for PerPO: Perceptual Preference Optimization via Discriminative Rewarding
Figure 4 for PerPO: Perceptual Preference Optimization via Discriminative Rewarding
Viaarxiv icon

Corner2Net: Detecting Objects as Cascade Corners

Add code
Nov 24, 2024
Figure 1 for Corner2Net: Detecting Objects as Cascade Corners
Figure 2 for Corner2Net: Detecting Objects as Cascade Corners
Figure 3 for Corner2Net: Detecting Objects as Cascade Corners
Figure 4 for Corner2Net: Detecting Objects as Cascade Corners
Viaarxiv icon

General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Add code
Sep 03, 2024
Figure 1 for General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Figure 2 for General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Figure 3 for General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Figure 4 for General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Viaarxiv icon

ReCon1M:A Large-scale Benchmark Dataset for Relation Comprehension in Remote Sensing Imagery

Add code
Jun 10, 2024
Figure 1 for ReCon1M:A Large-scale Benchmark Dataset for Relation Comprehension in Remote Sensing Imagery
Figure 2 for ReCon1M:A Large-scale Benchmark Dataset for Relation Comprehension in Remote Sensing Imagery
Figure 3 for ReCon1M:A Large-scale Benchmark Dataset for Relation Comprehension in Remote Sensing Imagery
Figure 4 for ReCon1M:A Large-scale Benchmark Dataset for Relation Comprehension in Remote Sensing Imagery
Viaarxiv icon

VIP: Versatile Image Outpainting Empowered by Multimodal Large Language Model

Add code
Jun 03, 2024
Figure 1 for VIP: Versatile Image Outpainting Empowered by Multimodal Large Language Model
Figure 2 for VIP: Versatile Image Outpainting Empowered by Multimodal Large Language Model
Figure 3 for VIP: Versatile Image Outpainting Empowered by Multimodal Large Language Model
Figure 4 for VIP: Versatile Image Outpainting Empowered by Multimodal Large Language Model
Viaarxiv icon

Focus Anywhere for Fine-grained Multi-page Document Understanding

Add code
May 23, 2024
Figure 1 for Focus Anywhere for Fine-grained Multi-page Document Understanding
Figure 2 for Focus Anywhere for Fine-grained Multi-page Document Understanding
Figure 3 for Focus Anywhere for Fine-grained Multi-page Document Understanding
Figure 4 for Focus Anywhere for Fine-grained Multi-page Document Understanding
Viaarxiv icon

OneChart: Purify the Chart Structural Extraction via One Auxiliary Token

Add code
Apr 15, 2024
Figure 1 for OneChart: Purify the Chart Structural Extraction via One Auxiliary Token
Figure 2 for OneChart: Purify the Chart Structural Extraction via One Auxiliary Token
Figure 3 for OneChart: Purify the Chart Structural Extraction via One Auxiliary Token
Figure 4 for OneChart: Purify the Chart Structural Extraction via One Auxiliary Token
Viaarxiv icon

TAFormer: A Unified Target-Aware Transformer for Video and Motion Joint Prediction in Aerial Scenes

Add code
Mar 27, 2024
Figure 1 for TAFormer: A Unified Target-Aware Transformer for Video and Motion Joint Prediction in Aerial Scenes
Figure 2 for TAFormer: A Unified Target-Aware Transformer for Video and Motion Joint Prediction in Aerial Scenes
Figure 3 for TAFormer: A Unified Target-Aware Transformer for Video and Motion Joint Prediction in Aerial Scenes
Figure 4 for TAFormer: A Unified Target-Aware Transformer for Video and Motion Joint Prediction in Aerial Scenes
Viaarxiv icon

Remote Sensing Image Change Detection with Graph Interaction

Add code
Jul 05, 2023
Viaarxiv icon