Picture for Tong Zhang

Tong Zhang

Nanjing University of Science and Technology, Nanjing, China

Multi-Modal Hypergraph Enhanced LLM Learning for Recommendation

Add code
Apr 13, 2025
Figure 1 for Multi-Modal Hypergraph Enhanced LLM Learning for Recommendation
Figure 2 for Multi-Modal Hypergraph Enhanced LLM Learning for Recommendation
Figure 3 for Multi-Modal Hypergraph Enhanced LLM Learning for Recommendation
Figure 4 for Multi-Modal Hypergraph Enhanced LLM Learning for Recommendation
Viaarxiv icon

Refining CLIP's Spatial Awareness: A Visual-Centric Perspective

Add code
Apr 03, 2025
Figure 1 for Refining CLIP's Spatial Awareness: A Visual-Centric Perspective
Figure 2 for Refining CLIP's Spatial Awareness: A Visual-Centric Perspective
Figure 3 for Refining CLIP's Spatial Awareness: A Visual-Centric Perspective
Figure 4 for Refining CLIP's Spatial Awareness: A Visual-Centric Perspective
Viaarxiv icon

VGRP-Bench: Visual Grid Reasoning Puzzle Benchmark for Large Vision-Language Models

Add code
Apr 02, 2025
Figure 1 for VGRP-Bench: Visual Grid Reasoning Puzzle Benchmark for Large Vision-Language Models
Figure 2 for VGRP-Bench: Visual Grid Reasoning Puzzle Benchmark for Large Vision-Language Models
Figure 3 for VGRP-Bench: Visual Grid Reasoning Puzzle Benchmark for Large Vision-Language Models
Figure 4 for VGRP-Bench: Visual Grid Reasoning Puzzle Benchmark for Large Vision-Language Models
Viaarxiv icon

ASGO: Adaptive Structured Gradient Optimization

Add code
Mar 26, 2025
Figure 1 for ASGO: Adaptive Structured Gradient Optimization
Figure 2 for ASGO: Adaptive Structured Gradient Optimization
Figure 3 for ASGO: Adaptive Structured Gradient Optimization
Figure 4 for ASGO: Adaptive Structured Gradient Optimization
Viaarxiv icon

FDS: Frequency-Aware Denoising Score for Text-Guided Latent Diffusion Image Editing

Add code
Mar 24, 2025
Figure 1 for FDS: Frequency-Aware Denoising Score for Text-Guided Latent Diffusion Image Editing
Figure 2 for FDS: Frequency-Aware Denoising Score for Text-Guided Latent Diffusion Image Editing
Figure 3 for FDS: Frequency-Aware Denoising Score for Text-Guided Latent Diffusion Image Editing
Figure 4 for FDS: Frequency-Aware Denoising Score for Text-Guided Latent Diffusion Image Editing
Viaarxiv icon

Generating Multimodal Driving Scenes via Next-Scene Prediction

Add code
Mar 19, 2025
Viaarxiv icon

RAG-RL: Advancing Retrieval-Augmented Generation via RL and Curriculum Learning

Add code
Mar 17, 2025
Viaarxiv icon

Monte Carlo Diffusion for Generalizable Learning-Based RANSAC

Add code
Mar 12, 2025
Figure 1 for Monte Carlo Diffusion for Generalizable Learning-Based RANSAC
Figure 2 for Monte Carlo Diffusion for Generalizable Learning-Based RANSAC
Figure 3 for Monte Carlo Diffusion for Generalizable Learning-Based RANSAC
Figure 4 for Monte Carlo Diffusion for Generalizable Learning-Based RANSAC
Viaarxiv icon

ROCM: RLHF on consistency models

Add code
Mar 08, 2025
Viaarxiv icon

SEOE: A Scalable and Reliable Semantic Evaluation Framework for Open Domain Event Detection

Add code
Mar 05, 2025
Viaarxiv icon