Picture for Juncheng Li

Juncheng Li

Topological GCN for Improving Detection of Hip Landmarks from B-Mode Ultrasound Images

Add code
Aug 24, 2024
Viaarxiv icon

TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition

Add code
Aug 19, 2024
Figure 1 for TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition
Figure 2 for TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition
Figure 3 for TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition
Figure 4 for TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition
Viaarxiv icon

Auto-Encoding Morph-Tokens for Multimodal LLM

Add code
May 03, 2024
Viaarxiv icon

Sim-Grasp: Learning 6-DOF Grasp Policies for Cluttered Environments Using a Synthetic Benchmark

Add code
May 01, 2024
Figure 1 for Sim-Grasp: Learning 6-DOF Grasp Policies for Cluttered Environments Using a Synthetic Benchmark
Figure 2 for Sim-Grasp: Learning 6-DOF Grasp Policies for Cluttered Environments Using a Synthetic Benchmark
Figure 3 for Sim-Grasp: Learning 6-DOF Grasp Policies for Cluttered Environments Using a Synthetic Benchmark
Figure 4 for Sim-Grasp: Learning 6-DOF Grasp Policies for Cluttered Environments Using a Synthetic Benchmark
Viaarxiv icon

WorldGPT: Empowering LLM as Multimodal World Model

Add code
Apr 28, 2024
Figure 1 for WorldGPT: Empowering LLM as Multimodal World Model
Figure 2 for WorldGPT: Empowering LLM as Multimodal World Model
Figure 3 for WorldGPT: Empowering LLM as Multimodal World Model
Figure 4 for WorldGPT: Empowering LLM as Multimodal World Model
Viaarxiv icon

LASER: Tuning-Free LLM-Driven Attention Control for Efficient Text-conditioned Image-to-Animation

Add code
Apr 23, 2024
Figure 1 for LASER: Tuning-Free LLM-Driven Attention Control for Efficient Text-conditioned Image-to-Animation
Figure 2 for LASER: Tuning-Free LLM-Driven Attention Control for Efficient Text-conditioned Image-to-Animation
Figure 3 for LASER: Tuning-Free LLM-Driven Attention Control for Efficient Text-conditioned Image-to-Animation
Figure 4 for LASER: Tuning-Free LLM-Driven Attention Control for Efficient Text-conditioned Image-to-Animation
Viaarxiv icon

Fact :Teaching MLLMs with Faithful, Concise and Transferable Rationales

Add code
Apr 17, 2024
Figure 1 for Fact :Teaching MLLMs with Faithful, Concise and Transferable Rationales
Figure 2 for Fact :Teaching MLLMs with Faithful, Concise and Transferable Rationales
Figure 3 for Fact :Teaching MLLMs with Faithful, Concise and Transferable Rationales
Figure 4 for Fact :Teaching MLLMs with Faithful, Concise and Transferable Rationales
Viaarxiv icon

HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models

Add code
Mar 20, 2024
Figure 1 for HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models
Figure 2 for HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models
Figure 3 for HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models
Figure 4 for HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models
Viaarxiv icon

Momentor: Advancing Video Large Language Model with Fine-Grained Temporal Reasoning

Add code
Feb 18, 2024
Figure 1 for Momentor: Advancing Video Large Language Model with Fine-Grained Temporal Reasoning
Figure 2 for Momentor: Advancing Video Large Language Model with Fine-Grained Temporal Reasoning
Figure 3 for Momentor: Advancing Video Large Language Model with Fine-Grained Temporal Reasoning
Figure 4 for Momentor: Advancing Video Large Language Model with Fine-Grained Temporal Reasoning
Viaarxiv icon

De-fine: Decomposing and Refining Visual Programs with Auto-Feedback

Add code
Nov 25, 2023
Viaarxiv icon