Picture for Jin Ma

Jin Ma

Towards Automated Community Notes Generation with Large Vision Language Models for Combating Contextual Deception

Add code
Mar 23, 2026
Viaarxiv icon

A Large-Scale Remote Sensing Dataset and VLM-based Algorithm for Fine-Grained Road Hierarchy Classification

Add code
Mar 22, 2026
Viaarxiv icon

Omni IIE Bench: Benchmarking the Practical Capabilities of Image Editing Models

Add code
Mar 16, 2026
Viaarxiv icon

Beyond Closed-Pool Video Retrieval: A Benchmark and Agent Framework for Real-World Video Search and Moment Localization

Add code
Feb 10, 2026
Viaarxiv icon

VDE Bench: Evaluating The Capability of Image Editing Models to Modify Visual Documents

Add code
Jan 27, 2026
Viaarxiv icon

Rank4Gen: RAG-Preference-Aligned Document Set Selection and Ranking

Add code
Jan 16, 2026
Viaarxiv icon

Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss

Add code
Dec 29, 2025
Viaarxiv icon

Virtual Width Networks

Add code
Nov 17, 2025
Figure 1 for Virtual Width Networks
Figure 2 for Virtual Width Networks
Figure 3 for Virtual Width Networks
Figure 4 for Virtual Width Networks
Viaarxiv icon

DisPatch: Disarming Adversarial Patches in Object Detection with Diffusion Models

Add code
Sep 04, 2025
Figure 1 for DisPatch: Disarming Adversarial Patches in Object Detection with Diffusion Models
Figure 2 for DisPatch: Disarming Adversarial Patches in Object Detection with Diffusion Models
Figure 3 for DisPatch: Disarming Adversarial Patches in Object Detection with Diffusion Models
Figure 4 for DisPatch: Disarming Adversarial Patches in Object Detection with Diffusion Models
Viaarxiv icon

ARC-Hunyuan-Video-7B: Structured Video Comprehension of Real-World Shorts

Add code
Jul 28, 2025
Viaarxiv icon