Picture for Yizhen Zhang

Yizhen Zhang

See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning

Add code
Dec 26, 2025
Figure 1 for See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning
Figure 2 for See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning
Figure 3 for See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning
Figure 4 for See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning
Viaarxiv icon

VideoZoomer: Reinforcement-Learned Temporal Focusing for Long Video Reasoning

Add code
Dec 26, 2025
Viaarxiv icon

OpenAI GPT-5 System Card

Add code
Dec 19, 2025
Viaarxiv icon

MeSH: Memory-as-State-Highways for Recursive Transformers

Add code
Oct 09, 2025
Viaarxiv icon

PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning

Add code
Jun 17, 2025
Viaarxiv icon

UQABench: Evaluating User Embedding for Prompting LLMs in Personalized Question Answering

Add code
Feb 26, 2025
Viaarxiv icon

Unlocking Scaling Law in Industrial Recommendation Systems with a Three-step Paradigm based Large User Model

Add code
Feb 12, 2025
Viaarxiv icon

A Dual-Stream Neural Network Explains the Functional Segregation of Dorsal and Ventral Visual Pathways in Human Brains

Add code
Oct 20, 2023
Figure 1 for A Dual-Stream Neural Network Explains the Functional Segregation of Dorsal and Ventral Visual Pathways in Human Brains
Figure 2 for A Dual-Stream Neural Network Explains the Functional Segregation of Dorsal and Ventral Visual Pathways in Human Brains
Figure 3 for A Dual-Stream Neural Network Explains the Functional Segregation of Dorsal and Ventral Visual Pathways in Human Brains
Figure 4 for A Dual-Stream Neural Network Explains the Functional Segregation of Dorsal and Ventral Visual Pathways in Human Brains
Viaarxiv icon

Human Eyes Inspired Recurrent Neural Networks are More Robust Against Adversarial Noises

Add code
Jun 15, 2022
Figure 1 for Human Eyes Inspired Recurrent Neural Networks are More Robust Against Adversarial Noises
Figure 2 for Human Eyes Inspired Recurrent Neural Networks are More Robust Against Adversarial Noises
Figure 3 for Human Eyes Inspired Recurrent Neural Networks are More Robust Against Adversarial Noises
Figure 4 for Human Eyes Inspired Recurrent Neural Networks are More Robust Against Adversarial Noises
Viaarxiv icon

Explainable Semantic Space by Grounding Language to Vision with Cross-Modal Contrastive Learning

Add code
Nov 13, 2021
Figure 1 for Explainable Semantic Space by Grounding Language to Vision with Cross-Modal Contrastive Learning
Figure 2 for Explainable Semantic Space by Grounding Language to Vision with Cross-Modal Contrastive Learning
Figure 3 for Explainable Semantic Space by Grounding Language to Vision with Cross-Modal Contrastive Learning
Figure 4 for Explainable Semantic Space by Grounding Language to Vision with Cross-Modal Contrastive Learning
Viaarxiv icon