Picture for Ji Qi

Ji Qi

MM-THEBench: Do Reasoning MLLMs Think Reasonably?

Add code
Jan 30, 2026
Viaarxiv icon

MemoryRewardBench: Benchmarking Reward Models for Long-Term Memory Management in Large Language Models

Add code
Jan 24, 2026
Viaarxiv icon

$\texttt{MemoryRewardBench}$: Benchmarking Reward Models for Long-Term Memory Management in Large Language Models

Add code
Jan 17, 2026
Viaarxiv icon

LANCET: Neural Intervention via Structural Entropy for Mitigating Faithfulness Hallucinations in LLMs

Add code
Jan 04, 2026
Viaarxiv icon

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Add code
Jul 02, 2025
Figure 1 for GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
Figure 2 for GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
Figure 3 for GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
Figure 4 for GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
Viaarxiv icon

SFNet: Fusion of Spatial and Frequency-Domain Features for Remote Sensing Image Forgery Detection

Add code
Jun 25, 2025
Figure 1 for SFNet: Fusion of Spatial and Frequency-Domain Features for Remote Sensing Image Forgery Detection
Figure 2 for SFNet: Fusion of Spatial and Frequency-Domain Features for Remote Sensing Image Forgery Detection
Figure 3 for SFNet: Fusion of Spatial and Frequency-Domain Features for Remote Sensing Image Forgery Detection
Figure 4 for SFNet: Fusion of Spatial and Frequency-Domain Features for Remote Sensing Image Forgery Detection
Viaarxiv icon

TextVidBench: A Benchmark for Long Video Scene Text Understanding

Add code
Jun 05, 2025
Viaarxiv icon

Hard Negative Contrastive Learning for Fine-Grained Geometric Understanding in Large Multimodal Models

Add code
May 26, 2025
Viaarxiv icon

Lightweight Transformer via Unrolling of Mixed Graph Algorithms for Traffic Forecast

Add code
May 19, 2025
Figure 1 for Lightweight Transformer via Unrolling of Mixed Graph Algorithms for Traffic Forecast
Figure 2 for Lightweight Transformer via Unrolling of Mixed Graph Algorithms for Traffic Forecast
Figure 3 for Lightweight Transformer via Unrolling of Mixed Graph Algorithms for Traffic Forecast
Figure 4 for Lightweight Transformer via Unrolling of Mixed Graph Algorithms for Traffic Forecast
Viaarxiv icon

An LMM for Efficient Video Understanding via Reinforced Compression of Video Cubes

Add code
Apr 21, 2025
Figure 1 for An LMM for Efficient Video Understanding via Reinforced Compression of Video Cubes
Figure 2 for An LMM for Efficient Video Understanding via Reinforced Compression of Video Cubes
Figure 3 for An LMM for Efficient Video Understanding via Reinforced Compression of Video Cubes
Figure 4 for An LMM for Efficient Video Understanding via Reinforced Compression of Video Cubes
Viaarxiv icon