Picture for Anton van den Hengel

Anton van den Hengel

the University of Adelaide

Towards Higher Effective Rank in Parameter-efficient Fine-tuning using Khatri--Rao Product

Add code
Aug 01, 2025
Viaarxiv icon

Let Your Video Listen to Your Music!

Add code
Jun 23, 2025
Viaarxiv icon

Transformers Pretrained on Procedural Data Contain Modular Structures for Algorithmic Reasoning

Add code
May 28, 2025
Viaarxiv icon

Continual Learning on CLIP via Incremental Prompt Tuning with Intrinsic Textual Anchors

Add code
May 27, 2025
Viaarxiv icon

Seeing the Trees for the Forest: Rethinking Weakly-Supervised Medical Visual Grounding

Add code
May 21, 2025
Viaarxiv icon

FlowDubber: Movie Dubbing with LLM-based Semantic-aware Learning and Flow Matching based Voice Enhancing

Add code
May 02, 2025
Figure 1 for FlowDubber: Movie Dubbing with LLM-based Semantic-aware Learning and Flow Matching based Voice Enhancing
Figure 2 for FlowDubber: Movie Dubbing with LLM-based Semantic-aware Learning and Flow Matching based Voice Enhancing
Figure 3 for FlowDubber: Movie Dubbing with LLM-based Semantic-aware Learning and Flow Matching based Voice Enhancing
Figure 4 for FlowDubber: Movie Dubbing with LLM-based Semantic-aware Learning and Flow Matching based Voice Enhancing
Viaarxiv icon

ProgRoCC: A Progressive Approach to Rough Crowd Counting

Add code
Apr 18, 2025
Viaarxiv icon

Negate or Embrace: On How Misalignment Shapes Multimodal Representation Learning

Add code
Apr 16, 2025
Viaarxiv icon

The Devil is in the Distributions: Explicit Modeling of Scene Content is Key in Zero-Shot Video Captioning

Add code
Mar 31, 2025
Figure 1 for The Devil is in the Distributions: Explicit Modeling of Scene Content is Key in Zero-Shot Video Captioning
Figure 2 for The Devil is in the Distributions: Explicit Modeling of Scene Content is Key in Zero-Shot Video Captioning
Figure 3 for The Devil is in the Distributions: Explicit Modeling of Scene Content is Key in Zero-Shot Video Captioning
Figure 4 for The Devil is in the Distributions: Explicit Modeling of Scene Content is Key in Zero-Shot Video Captioning
Viaarxiv icon

MATHGLANCE: Multimodal Large Language Models Do Not Know Where to Look in Mathematical Diagrams

Add code
Mar 26, 2025
Viaarxiv icon