Picture for Yuliang Liu

Yuliang Liu

R-CoT: Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal Models

Add code
Oct 23, 2024
Figure 1 for R-CoT: Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal Models
Figure 2 for R-CoT: Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal Models
Figure 3 for R-CoT: Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal Models
Figure 4 for R-CoT: Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal Models
Viaarxiv icon

PDF-WuKong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling

Add code
Oct 08, 2024
Viaarxiv icon

LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models

Add code
Sep 04, 2024
Viaarxiv icon

Mini-Monkey: Multi-Scale Adaptive Cropping for Multimodal Large Language Models

Add code
Aug 09, 2024
Figure 1 for Mini-Monkey: Multi-Scale Adaptive Cropping for Multimodal Large Language Models
Figure 2 for Mini-Monkey: Multi-Scale Adaptive Cropping for Multimodal Large Language Models
Figure 3 for Mini-Monkey: Multi-Scale Adaptive Cropping for Multimodal Large Language Models
Figure 4 for Mini-Monkey: Multi-Scale Adaptive Cropping for Multimodal Large Language Models
Viaarxiv icon

Mini-Monkey: Alleviate the Sawtooth Effect by Multi-Scale Adaptive Cropping

Add code
Aug 04, 2024
Figure 1 for Mini-Monkey: Alleviate the Sawtooth Effect by Multi-Scale Adaptive Cropping
Figure 2 for Mini-Monkey: Alleviate the Sawtooth Effect by Multi-Scale Adaptive Cropping
Figure 3 for Mini-Monkey: Alleviate the Sawtooth Effect by Multi-Scale Adaptive Cropping
Figure 4 for Mini-Monkey: Alleviate the Sawtooth Effect by Multi-Scale Adaptive Cropping
Viaarxiv icon

Multi-Prompting Decoder Helps Better Language Understanding

Add code
Jun 10, 2024
Viaarxiv icon

MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks

Add code
Jun 07, 2024
Figure 1 for MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks
Figure 2 for MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks
Figure 3 for MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks
Figure 4 for MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks
Viaarxiv icon

Puzzle Pieces Picker: Deciphering Ancient Chinese Characters with Radical Reconstruction

Add code
Jun 05, 2024
Figure 1 for Puzzle Pieces Picker: Deciphering Ancient Chinese Characters with Radical Reconstruction
Figure 2 for Puzzle Pieces Picker: Deciphering Ancient Chinese Characters with Radical Reconstruction
Figure 3 for Puzzle Pieces Picker: Deciphering Ancient Chinese Characters with Radical Reconstruction
Figure 4 for Puzzle Pieces Picker: Deciphering Ancient Chinese Characters with Radical Reconstruction
Viaarxiv icon

Deciphering Oracle Bone Language with Diffusion Models

Add code
Jun 02, 2024
Figure 1 for Deciphering Oracle Bone Language with Diffusion Models
Figure 2 for Deciphering Oracle Bone Language with Diffusion Models
Figure 3 for Deciphering Oracle Bone Language with Diffusion Models
Figure 4 for Deciphering Oracle Bone Language with Diffusion Models
Viaarxiv icon

Dataset and Benchmark for Urdu Natural Scenes Text Detection, Recognition and Visual Question Answering

Add code
May 21, 2024
Viaarxiv icon