Picture for Jiang Liu

Jiang Liu

Boosting Private Domain Understanding of Efficient MLLMs: A Tuning-free, Adaptive, Universal Prompt Optimization Framework

Add code
Dec 27, 2024
Viaarxiv icon

SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer

Add code
Dec 14, 2024
Figure 1 for SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer
Figure 2 for SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer
Figure 3 for SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer
Figure 4 for SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer
Viaarxiv icon

M$^{3}$D: A Multimodal, Multilingual and Multitask Dataset for Grounded Document-level Information Extraction

Add code
Dec 05, 2024
Figure 1 for M$^{3}$D: A Multimodal, Multilingual and Multitask Dataset for Grounded Document-level Information Extraction
Figure 2 for M$^{3}$D: A Multimodal, Multilingual and Multitask Dataset for Grounded Document-level Information Extraction
Figure 3 for M$^{3}$D: A Multimodal, Multilingual and Multitask Dataset for Grounded Document-level Information Extraction
Figure 4 for M$^{3}$D: A Multimodal, Multilingual and Multitask Dataset for Grounded Document-level Information Extraction
Viaarxiv icon

Memory-Augmented Multimodal LLMs for Surgical VQA via Self-Contained Inquiry

Add code
Nov 17, 2024
Figure 1 for Memory-Augmented Multimodal LLMs for Surgical VQA via Self-Contained Inquiry
Figure 2 for Memory-Augmented Multimodal LLMs for Surgical VQA via Self-Contained Inquiry
Figure 3 for Memory-Augmented Multimodal LLMs for Surgical VQA via Self-Contained Inquiry
Figure 4 for Memory-Augmented Multimodal LLMs for Surgical VQA via Self-Contained Inquiry
Viaarxiv icon

Accelerating Non-Maximum Suppression: A Graph Theory Perspective

Add code
Sep 30, 2024
Figure 1 for Accelerating Non-Maximum Suppression: A Graph Theory Perspective
Figure 2 for Accelerating Non-Maximum Suppression: A Graph Theory Perspective
Figure 3 for Accelerating Non-Maximum Suppression: A Graph Theory Perspective
Figure 4 for Accelerating Non-Maximum Suppression: A Graph Theory Perspective
Viaarxiv icon

Align$^2$LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation

Add code
Sep 27, 2024
Figure 1 for Align$^2$LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation
Figure 2 for Align$^2$LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation
Figure 3 for Align$^2$LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation
Figure 4 for Align$^2$LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation
Viaarxiv icon

TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition

Add code
Aug 19, 2024
Figure 1 for TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition
Figure 2 for TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition
Figure 3 for TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition
Figure 4 for TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition
Viaarxiv icon

MM-UNet: A Mixed MLP Architecture for Improved Ophthalmic Image Segmentation

Add code
Aug 16, 2024
Figure 1 for MM-UNet: A Mixed MLP Architecture for Improved Ophthalmic Image Segmentation
Figure 2 for MM-UNet: A Mixed MLP Architecture for Improved Ophthalmic Image Segmentation
Figure 3 for MM-UNet: A Mixed MLP Architecture for Improved Ophthalmic Image Segmentation
Figure 4 for MM-UNet: A Mixed MLP Architecture for Improved Ophthalmic Image Segmentation
Viaarxiv icon

Is Dataset Quality Still a Concern in Diagnosis Using Large Foundation Model?

Add code
May 21, 2024
Viaarxiv icon

Edge Intelligence Optimization for Large Language Model Inference with Batching and Quantization

Add code
May 12, 2024
Viaarxiv icon