Picture for Tiantian Zhang

Tiantian Zhang

Sample By Step, Optimize By Chunk: Chunk-Level GRPO For Text-to-Image Generation

Add code
Oct 24, 2025
Viaarxiv icon

Distribution Preference Optimization: A Fine-grained Perspective for LLM Unlearning

Add code
Oct 06, 2025
Viaarxiv icon

Reinforcement Fine-Tuning Powers Reasoning Capability of Multimodal Large Language Models

Add code
May 24, 2025
Viaarxiv icon

Evaluation Study on SAM 2 for Class-agnostic Instance-level Segmentation

Add code
Sep 04, 2024
Figure 1 for Evaluation Study on SAM 2 for Class-agnostic Instance-level Segmentation
Figure 2 for Evaluation Study on SAM 2 for Class-agnostic Instance-level Segmentation
Figure 3 for Evaluation Study on SAM 2 for Class-agnostic Instance-level Segmentation
Figure 4 for Evaluation Study on SAM 2 for Class-agnostic Instance-level Segmentation
Viaarxiv icon

Incorporating Clinical Guidelines through Adapting Multi-modal Large Language Model for Prostate Cancer PI-RADS Scoring

Add code
May 14, 2024
Viaarxiv icon

Adaptive Intra-Class Variation Contrastive Learning for Unsupervised Person Re-Identification

Add code
Apr 06, 2024
Viaarxiv icon

Replay-enhanced Continual Reinforcement Learning

Add code
Nov 20, 2023
Figure 1 for Replay-enhanced Continual Reinforcement Learning
Figure 2 for Replay-enhanced Continual Reinforcement Learning
Figure 3 for Replay-enhanced Continual Reinforcement Learning
Figure 4 for Replay-enhanced Continual Reinforcement Learning
Viaarxiv icon

Implementation and Evaluation of Physical Layer Key Generation on SDR based LoRa Platform

Add code
Aug 30, 2023
Figure 1 for Implementation and Evaluation of Physical Layer Key Generation on SDR based LoRa Platform
Figure 2 for Implementation and Evaluation of Physical Layer Key Generation on SDR based LoRa Platform
Figure 3 for Implementation and Evaluation of Physical Layer Key Generation on SDR based LoRa Platform
Figure 4 for Implementation and Evaluation of Physical Layer Key Generation on SDR based LoRa Platform
Viaarxiv icon

A Surrogate-Assisted Controller for Expensive Evolutionary Reinforcement Learning

Add code
Jan 01, 2022
Figure 1 for A Surrogate-Assisted Controller for Expensive Evolutionary Reinforcement Learning
Figure 2 for A Surrogate-Assisted Controller for Expensive Evolutionary Reinforcement Learning
Figure 3 for A Surrogate-Assisted Controller for Expensive Evolutionary Reinforcement Learning
Figure 4 for A Surrogate-Assisted Controller for Expensive Evolutionary Reinforcement Learning
Viaarxiv icon

Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge Distillation

Add code
Sep 01, 2021
Figure 1 for Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge Distillation
Figure 2 for Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge Distillation
Figure 3 for Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge Distillation
Figure 4 for Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge Distillation
Viaarxiv icon