Picture for Zheng Huang

Zheng Huang

Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO

Add code
May 27, 2025
Viaarxiv icon

Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration

Add code
May 26, 2025
Viaarxiv icon

"I know myself better, but not really greatly": Using LLMs to Detect and Explain LLM-Generated Texts

Add code
Feb 18, 2025
Viaarxiv icon

Interpreting Object-level Foundation Models via Visual Precision Search

Add code
Nov 25, 2024
Figure 1 for Interpreting Object-level Foundation Models via Visual Precision Search
Figure 2 for Interpreting Object-level Foundation Models via Visual Precision Search
Figure 3 for Interpreting Object-level Foundation Models via Visual Precision Search
Figure 4 for Interpreting Object-level Foundation Models via Visual Precision Search
Viaarxiv icon

Detecting Machine-Generated Texts: Not Just "AI vs Humans" and Explainability is Complicated

Add code
Jun 26, 2024
Figure 1 for Detecting Machine-Generated Texts: Not Just "AI vs Humans" and Explainability is Complicated
Figure 2 for Detecting Machine-Generated Texts: Not Just "AI vs Humans" and Explainability is Complicated
Figure 3 for Detecting Machine-Generated Texts: Not Just "AI vs Humans" and Explainability is Complicated
Figure 4 for Detecting Machine-Generated Texts: Not Just "AI vs Humans" and Explainability is Complicated
Viaarxiv icon

GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation Models

Add code
Jun 18, 2024
Figure 1 for GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation Models
Figure 2 for GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation Models
Figure 3 for GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation Models
Figure 4 for GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation Models
Viaarxiv icon

Enhancing Size Generalization in Graph Neural Networks through Disentangled Representation Learning

Add code
Jun 07, 2024
Viaarxiv icon

Support or Refute: Analyzing the Stance of Evidence to Detect Out-of-Context Mis- and Disinformation

Add code
Nov 16, 2023
Viaarxiv icon

Empowering Next POI Recommendation with Multi-Relational Modeling

Add code
Apr 24, 2022
Figure 1 for Empowering Next POI Recommendation with Multi-Relational Modeling
Figure 2 for Empowering Next POI Recommendation with Multi-Relational Modeling
Figure 3 for Empowering Next POI Recommendation with Multi-Relational Modeling
Figure 4 for Empowering Next POI Recommendation with Multi-Relational Modeling
Viaarxiv icon

Scale Invariant Domain Generalization Image Recapture Detection

Add code
Oct 07, 2021
Figure 1 for Scale Invariant Domain Generalization Image Recapture Detection
Figure 2 for Scale Invariant Domain Generalization Image Recapture Detection
Figure 3 for Scale Invariant Domain Generalization Image Recapture Detection
Figure 4 for Scale Invariant Domain Generalization Image Recapture Detection
Viaarxiv icon