Picture for Yifeng Gao

Yifeng Gao

A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5

Add code
Jan 16, 2026
Viaarxiv icon

SyncThink: A Training-Free Strategy to Align Inference Termination with Reasoning Saturation

Add code
Jan 07, 2026
Viaarxiv icon

Polynomial Closed Form Model for Ultra-Wideband Transmission Systems

Add code
Aug 29, 2025
Viaarxiv icon

TRACE: Grounding Time Series in Context for Multimodal Embedding and Retrieval

Add code
Jun 10, 2025
Viaarxiv icon

ThinkLess: A Training-Free Inference-Efficient Method for Reducing Reasoning Redundancy

Add code
May 21, 2025
Figure 1 for ThinkLess: A Training-Free Inference-Efficient Method for Reducing Reasoning Redundancy
Figure 2 for ThinkLess: A Training-Free Inference-Efficient Method for Reducing Reasoning Redundancy
Figure 3 for ThinkLess: A Training-Free Inference-Efficient Method for Reducing Reasoning Redundancy
Figure 4 for ThinkLess: A Training-Free Inference-Efficient Method for Reducing Reasoning Redundancy
Viaarxiv icon

SafeVid: Toward Safety Aligned Video Large Multimodal Models

Add code
May 17, 2025
Viaarxiv icon

MTBench: A Multimodal Time Series Benchmark for Temporal Reasoning and Question Answering

Add code
Mar 21, 2025
Figure 1 for MTBench: A Multimodal Time Series Benchmark for Temporal Reasoning and Question Answering
Figure 2 for MTBench: A Multimodal Time Series Benchmark for Temporal Reasoning and Question Answering
Figure 3 for MTBench: A Multimodal Time Series Benchmark for Temporal Reasoning and Question Answering
Figure 4 for MTBench: A Multimodal Time Series Benchmark for Temporal Reasoning and Question Answering
Viaarxiv icon

Token Pruning in Multimodal Large Language Models: Are We Solving the Right Problem?

Add code
Feb 17, 2025
Viaarxiv icon

Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More

Add code
Feb 17, 2025
Figure 1 for Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More
Figure 2 for Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More
Figure 3 for Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More
Figure 4 for Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More
Viaarxiv icon

SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model

Add code
Jun 17, 2024
Figure 1 for SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model
Figure 2 for SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model
Figure 3 for SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model
Figure 4 for SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model
Viaarxiv icon