Picture for Houqiang Li

Houqiang Li

From Yes-Men to Truth-Tellers: Addressing Sycophancy in Large Language Models with Pinpoint Tuning

Add code
Sep 03, 2024
Viaarxiv icon

AdaptVision: Dynamic Input Scaling in MLLMs for Versatile Scene Understanding

Add code
Aug 30, 2024
Viaarxiv icon

LaneTCA: Enhancing Video Lane Detection with Temporal Context Aggregation

Add code
Aug 25, 2024
Figure 1 for LaneTCA: Enhancing Video Lane Detection with Temporal Context Aggregation
Figure 2 for LaneTCA: Enhancing Video Lane Detection with Temporal Context Aggregation
Figure 3 for LaneTCA: Enhancing Video Lane Detection with Temporal Context Aggregation
Figure 4 for LaneTCA: Enhancing Video Lane Detection with Temporal Context Aggregation
Viaarxiv icon

Scaling up Multimodal Pre-training for Sign Language Understanding

Add code
Aug 16, 2024
Figure 1 for Scaling up Multimodal Pre-training for Sign Language Understanding
Figure 2 for Scaling up Multimodal Pre-training for Sign Language Understanding
Figure 3 for Scaling up Multimodal Pre-training for Sign Language Understanding
Figure 4 for Scaling up Multimodal Pre-training for Sign Language Understanding
Viaarxiv icon

SwinShadow: Shifted Window for Ambiguous Adjacent Shadow Detection

Add code
Aug 07, 2024
Figure 1 for SwinShadow: Shifted Window for Ambiguous Adjacent Shadow Detection
Figure 2 for SwinShadow: Shifted Window for Ambiguous Adjacent Shadow Detection
Figure 3 for SwinShadow: Shifted Window for Ambiguous Adjacent Shadow Detection
Figure 4 for SwinShadow: Shifted Window for Ambiguous Adjacent Shadow Detection
Viaarxiv icon

SEDS: Semantically Enhanced Dual-Stream Encoder for Sign Language Retrieval

Add code
Jul 23, 2024
Figure 1 for SEDS: Semantically Enhanced Dual-Stream Encoder for Sign Language Retrieval
Figure 2 for SEDS: Semantically Enhanced Dual-Stream Encoder for Sign Language Retrieval
Figure 3 for SEDS: Semantically Enhanced Dual-Stream Encoder for Sign Language Retrieval
Figure 4 for SEDS: Semantically Enhanced Dual-Stream Encoder for Sign Language Retrieval
Viaarxiv icon

Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene Synthesis

Add code
Jul 07, 2024
Figure 1 for Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene Synthesis
Figure 2 for Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene Synthesis
Figure 3 for Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene Synthesis
Figure 4 for Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene Synthesis
Viaarxiv icon

RoFIR: Robust Fisheye Image Rectification Framework Impervious to Optical Center Deviation

Add code
Jun 27, 2024
Figure 1 for RoFIR: Robust Fisheye Image Rectification Framework Impervious to Optical Center Deviation
Figure 2 for RoFIR: Robust Fisheye Image Rectification Framework Impervious to Optical Center Deviation
Figure 3 for RoFIR: Robust Fisheye Image Rectification Framework Impervious to Optical Center Deviation
Figure 4 for RoFIR: Robust Fisheye Image Rectification Framework Impervious to Optical Center Deviation
Viaarxiv icon

Text-Animator: Controllable Visual Text Video Generation

Add code
Jun 25, 2024
Figure 1 for Text-Animator: Controllable Visual Text Video Generation
Figure 2 for Text-Animator: Controllable Visual Text Video Generation
Figure 3 for Text-Animator: Controllable Visual Text Video Generation
Figure 4 for Text-Animator: Controllable Visual Text Video Generation
Viaarxiv icon

Prediction and Reference Quality Adaptation for Learned Video Compression

Add code
Jun 20, 2024
Viaarxiv icon