Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Gilhan Park

Mitigating Semantic Collapse in Partially Relevant Video Retrieval

Oct 31, 2025

WonJun Moon, MinSeok Jung, Gilhan Park, Tae-Young Kim, Cheol-Ho Cho, Woojin Jun, Jae-Pil Heo

Figure 1 for Mitigating Semantic Collapse in Partially Relevant Video Retrieval

Figure 2 for Mitigating Semantic Collapse in Partially Relevant Video Retrieval

Figure 3 for Mitigating Semantic Collapse in Partially Relevant Video Retrieval

Figure 4 for Mitigating Semantic Collapse in Partially Relevant Video Retrieval

Abstract:Partially Relevant Video Retrieval (PRVR) seeks videos where only part of the content matches a text query. Existing methods treat every annotated text-video pair as a positive and all others as negatives, ignoring the rich semantic variation both within a single video and across different videos. Consequently, embeddings of both queries and their corresponding video-clip segments for distinct events within the same video collapse together, while embeddings of semantically similar queries and segments from different videos are driven apart. This limits retrieval performance when videos contain multiple, diverse events. This paper addresses the aforementioned problems, termed as semantic collapse, in both the text and video embedding spaces. We first introduce Text Correlation Preservation Learning, which preserves the semantic relationships encoded by the foundation model across text queries. To address collapse in video embeddings, we propose Cross-Branch Video Alignment (CBVA), a contrastive alignment method that disentangles hierarchical video representations across temporal scales. Subsequently, we introduce order-preserving token merging and adaptive CBVA to enhance alignment by producing video segments that are internally coherent yet mutually distinctive. Extensive experiments on PRVR benchmarks demonstrate that our framework effectively prevents semantic collapse and substantially improves retrieval accuracy.

* Accpeted to NeurIPS 2025. Code is available at https://github.com/admins97/MSC_PRVR

Via

Access Paper or Ask Questions

Mitigating Background Shift in Class-Incremental Semantic Segmentation

Jul 16, 2024

Gilhan Park, WonJun Moon, SuBeen Lee, Tae-Young Kim, Jae-Pil Heo

Figure 1 for Mitigating Background Shift in Class-Incremental Semantic Segmentation

Figure 2 for Mitigating Background Shift in Class-Incremental Semantic Segmentation

Figure 3 for Mitigating Background Shift in Class-Incremental Semantic Segmentation

Figure 4 for Mitigating Background Shift in Class-Incremental Semantic Segmentation

Abstract:Class-Incremental Semantic Segmentation(CISS) aims to learn new classes without forgetting the old ones, using only the labels of the new classes. To achieve this, two popular strategies are employed: 1) pseudo-labeling and knowledge distillation to preserve prior knowledge; and 2) background weight transfer, which leverages the broad coverage of background in learning new classes by transferring background weight to the new class classifier. However, the first strategy heavily relies on the old model in detecting old classes while undetected pixels are regarded as the background, thereby leading to the background shift towards the old classes(i.e., misclassification of old class as background). Additionally, in the case of the second approach, initializing the new class classifier with background knowledge triggers a similar background shift issue, but towards the new classes. To address these issues, we propose a background-class separation framework for CISS. To begin with, selective pseudo-labeling and adaptive feature distillation are to distill only trustworthy past knowledge. On the other hand, we encourage the separation between the background and new classes with a novel orthogonal objective along with label-guided output distillation. Our state-of-the-art results validate the effectiveness of these proposed methods.

* Accepted to ECCV 2024. Code is available at http://github.com/RoadoneP/ECCV2024_MBS

Via

Access Paper or Ask Questions