Picture for Muhammad Kamran Janjua

Muhammad Kamran Janjua

Don't Show Pixels, Show Cues: Unlocking Visual Tool Reasoning in Language Models via Perception Programs

Add code
Apr 14, 2026
Viaarxiv icon

Panoptic Pairwise Distortion Graph

Add code
Apr 13, 2026
Viaarxiv icon

Learning Truncated Causal History Model for Video Restoration

Add code
Oct 15, 2024
Figure 1 for Learning Truncated Causal History Model for Video Restoration
Figure 2 for Learning Truncated Causal History Model for Video Restoration
Figure 3 for Learning Truncated Causal History Model for Video Restoration
Figure 4 for Learning Truncated Causal History Model for Video Restoration
Viaarxiv icon

CascadedGaze: Efficiency in Global Context Extraction for Image Restoration

Add code
Jan 26, 2024
Figure 1 for CascadedGaze: Efficiency in Global Context Extraction for Image Restoration
Figure 2 for CascadedGaze: Efficiency in Global Context Extraction for Image Restoration
Figure 3 for CascadedGaze: Efficiency in Global Context Extraction for Image Restoration
Figure 4 for CascadedGaze: Efficiency in Global Context Extraction for Image Restoration
Viaarxiv icon

GVFs in the Real World: Making Predictions Online for Water Treatment

Add code
Dec 04, 2023
Viaarxiv icon

Movement-induced Priors for Deep Stereo

Add code
Oct 18, 2020
Figure 1 for Movement-induced Priors for Deep Stereo
Figure 2 for Movement-induced Priors for Deep Stereo
Figure 3 for Movement-induced Priors for Deep Stereo
Figure 4 for Movement-induced Priors for Deep Stereo
Viaarxiv icon

Deep Latent Space Learning for Cross-modal Mapping of Audio and Visual Signals

Add code
Sep 18, 2019
Figure 1 for Deep Latent Space Learning for Cross-modal Mapping of Audio and Visual Signals
Figure 2 for Deep Latent Space Learning for Cross-modal Mapping of Audio and Visual Signals
Figure 3 for Deep Latent Space Learning for Cross-modal Mapping of Audio and Visual Signals
Figure 4 for Deep Latent Space Learning for Cross-modal Mapping of Audio and Visual Signals
Viaarxiv icon

Do Cross Modal Systems Leverage Semantic Relationships?

Add code
Sep 03, 2019
Figure 1 for Do Cross Modal Systems Leverage Semantic Relationships?
Figure 2 for Do Cross Modal Systems Leverage Semantic Relationships?
Figure 3 for Do Cross Modal Systems Leverage Semantic Relationships?
Figure 4 for Do Cross Modal Systems Leverage Semantic Relationships?
Viaarxiv icon

Learning Inward Scaled Hypersphere Embedding: Exploring Projections in Higher Dimensions

Add code
Oct 16, 2018
Figure 1 for Learning Inward Scaled Hypersphere Embedding: Exploring Projections in Higher Dimensions
Figure 2 for Learning Inward Scaled Hypersphere Embedding: Exploring Projections in Higher Dimensions
Figure 3 for Learning Inward Scaled Hypersphere Embedding: Exploring Projections in Higher Dimensions
Figure 4 for Learning Inward Scaled Hypersphere Embedding: Exploring Projections in Higher Dimensions
Viaarxiv icon

Image and Encoded Text Fusion for Multi-Modal Classification

Add code
Oct 03, 2018
Figure 1 for Image and Encoded Text Fusion for Multi-Modal Classification
Figure 2 for Image and Encoded Text Fusion for Multi-Modal Classification
Figure 3 for Image and Encoded Text Fusion for Multi-Modal Classification
Figure 4 for Image and Encoded Text Fusion for Multi-Modal Classification
Viaarxiv icon