Picture for Jun Liu

Jun Liu

Siemens

Grasping by Spiraling: Reproducing Elephant Movements with Rigid-Soft Robot Synergy

Add code
Apr 02, 2025
Viaarxiv icon

FortisAVQA and MAVEN: a Benchmark Dataset and Debiasing Framework for Robust Multimodal Reasoning

Add code
Apr 02, 2025
Figure 1 for FortisAVQA and MAVEN: a Benchmark Dataset and Debiasing Framework for Robust Multimodal Reasoning
Figure 2 for FortisAVQA and MAVEN: a Benchmark Dataset and Debiasing Framework for Robust Multimodal Reasoning
Figure 3 for FortisAVQA and MAVEN: a Benchmark Dataset and Debiasing Framework for Robust Multimodal Reasoning
Figure 4 for FortisAVQA and MAVEN: a Benchmark Dataset and Debiasing Framework for Robust Multimodal Reasoning
Viaarxiv icon

POPEN: Preference-Based Optimization and Ensemble for LVLM-Based Reasoning Segmentation

Add code
Apr 01, 2025
Figure 1 for POPEN: Preference-Based Optimization and Ensemble for LVLM-Based Reasoning Segmentation
Figure 2 for POPEN: Preference-Based Optimization and Ensemble for LVLM-Based Reasoning Segmentation
Figure 3 for POPEN: Preference-Based Optimization and Ensemble for LVLM-Based Reasoning Segmentation
Figure 4 for POPEN: Preference-Based Optimization and Ensemble for LVLM-Based Reasoning Segmentation
Viaarxiv icon

Resource-Efficient Federated Fine-Tuning Large Language Models for Heterogeneous Data

Add code
Mar 27, 2025
Viaarxiv icon

GKG-LLM: A Unified Framework for Generalized Knowledge Graph Construction

Add code
Mar 17, 2025
Figure 1 for GKG-LLM: A Unified Framework for Generalized Knowledge Graph Construction
Figure 2 for GKG-LLM: A Unified Framework for Generalized Knowledge Graph Construction
Figure 3 for GKG-LLM: A Unified Framework for Generalized Knowledge Graph Construction
Figure 4 for GKG-LLM: A Unified Framework for Generalized Knowledge Graph Construction
Viaarxiv icon

$φ$-Decoding: Adaptive Foresight Sampling for Balanced Inference-Time Exploration and Exploitation

Add code
Mar 17, 2025
Viaarxiv icon

Niagara: Normal-Integrated Geometric Affine Field for Scene Reconstruction from a Single View

Add code
Mar 16, 2025
Figure 1 for Niagara: Normal-Integrated Geometric Affine Field for Scene Reconstruction from a Single View
Figure 2 for Niagara: Normal-Integrated Geometric Affine Field for Scene Reconstruction from a Single View
Figure 3 for Niagara: Normal-Integrated Geometric Affine Field for Scene Reconstruction from a Single View
Figure 4 for Niagara: Normal-Integrated Geometric Affine Field for Scene Reconstruction from a Single View
Viaarxiv icon

VA-AR: Learning Velocity-Aware Action Representations with Mixture of Window Attention

Add code
Mar 14, 2025
Viaarxiv icon

Making Every Step Effective: Jailbreaking Large Vision-Language Models Through Hierarchical KV Equalization

Add code
Mar 14, 2025
Figure 1 for Making Every Step Effective: Jailbreaking Large Vision-Language Models Through Hierarchical KV Equalization
Figure 2 for Making Every Step Effective: Jailbreaking Large Vision-Language Models Through Hierarchical KV Equalization
Figure 3 for Making Every Step Effective: Jailbreaking Large Vision-Language Models Through Hierarchical KV Equalization
Figure 4 for Making Every Step Effective: Jailbreaking Large Vision-Language Models Through Hierarchical KV Equalization
Viaarxiv icon

STEAD: Spatio-Temporal Efficient Anomaly Detection for Time and Compute Sensitive Applications

Add code
Mar 11, 2025
Viaarxiv icon