Picture for Di Wu

Di Wu

La Trobe University, Melbourne, Australia

Rethinking Federated Graph Foundation Models: A Graph-Language Alignment-based Approach

Add code
Jan 29, 2026
Viaarxiv icon

Quantization-Aware Distillation for NVFP4 Inference Accuracy Recovery

Add code
Jan 27, 2026
Viaarxiv icon

Mugi: Value Level Parallelism For Efficient LLMs

Add code
Jan 15, 2026
Viaarxiv icon

LLMTrack: Semantic Multi-Object Tracking with Multi-modal Large Language Models

Add code
Jan 10, 2026
Viaarxiv icon

STAR-S: Improving Safety Alignment through Self-Taught Reasoning on Safety Rules

Add code
Jan 07, 2026
Viaarxiv icon

Noise-Aware and Dynamically Adaptive Federated Defense Framework for SAR Image Target Recognition

Add code
Dec 31, 2025
Viaarxiv icon

RoboMIND 2.0: A Multimodal, Bimanual Mobile Manipulation Dataset for Generalizable Embodied Intelligence

Add code
Dec 31, 2025
Viaarxiv icon

Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model

Add code
Dec 23, 2025
Viaarxiv icon

StereoMV2D: A Sparse Temporal Stereo-Enhanced Framework for Robust Multi-View 3D Object Detection

Add code
Dec 19, 2025
Figure 1 for StereoMV2D: A Sparse Temporal Stereo-Enhanced Framework for Robust Multi-View 3D Object Detection
Figure 2 for StereoMV2D: A Sparse Temporal Stereo-Enhanced Framework for Robust Multi-View 3D Object Detection
Figure 3 for StereoMV2D: A Sparse Temporal Stereo-Enhanced Framework for Robust Multi-View 3D Object Detection
Figure 4 for StereoMV2D: A Sparse Temporal Stereo-Enhanced Framework for Robust Multi-View 3D Object Detection
Viaarxiv icon

Structure-Aware Decoding Mechanisms for Complex Entity Extraction with Large-Scale Language Models

Add code
Dec 16, 2025
Viaarxiv icon