Picture for Tianxiang Hao

Tianxiang Hao

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Add code
Jul 02, 2025
Viaarxiv icon

Urban1960SatSeg: Unsupervised Semantic Segmentation of Mid-20$^{th}$ century Urban Landscapes with Satellite Imageries

Add code
Jun 12, 2025
Viaarxiv icon

DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval

Add code
Jun 10, 2025
Viaarxiv icon

SpecOffload: Unlocking Latent GPU Capacity for LLM Inference on Resource-Constrained Devices

Add code
May 15, 2025
Viaarxiv icon

TempMe: Video Temporal Token Merging for Efficient Text-Video Retrieval

Add code
Sep 02, 2024
Viaarxiv icon

Learn from the Learnt: Source-Free Active Domain Adaptation via Contrastive Sampling and Visual Persistence

Add code
Jul 26, 2024
Viaarxiv icon

Quantized Prompt for Efficient Generalization of Vision-Language Models

Add code
Jul 15, 2024
Viaarxiv icon

PYRA: Parallel Yielding Re-Activation for Training-Inference Efficient Task Adaptation

Add code
Mar 14, 2024
Figure 1 for PYRA: Parallel Yielding Re-Activation for Training-Inference Efficient Task Adaptation
Figure 2 for PYRA: Parallel Yielding Re-Activation for Training-Inference Efficient Task Adaptation
Figure 3 for PYRA: Parallel Yielding Re-Activation for Training-Inference Efficient Task Adaptation
Figure 4 for PYRA: Parallel Yielding Re-Activation for Training-Inference Efficient Task Adaptation
Viaarxiv icon

Re-parameterized Low-rank Prompt: Generalize a Vision-Language Model within 0.5K Parameters

Add code
Dec 17, 2023
Viaarxiv icon

Consolidator: Mergeable Adapter with Grouped Connections for Visual Adaptation

Add code
Apr 30, 2023
Viaarxiv icon