Picture for Fahad Shahbaz Khan

Fahad Shahbaz Khan

Hierarchical Self-Supervised Adversarial Training for Robust Vision Models in Histopathology

Add code
Mar 13, 2025
Figure 1 for Hierarchical Self-Supervised Adversarial Training for Robust Vision Models in Histopathology
Figure 2 for Hierarchical Self-Supervised Adversarial Training for Robust Vision Models in Histopathology
Figure 3 for Hierarchical Self-Supervised Adversarial Training for Robust Vision Models in Histopathology
Figure 4 for Hierarchical Self-Supervised Adversarial Training for Robust Vision Models in Histopathology
Viaarxiv icon

LLM Post-Training: A Deep Dive into Reasoning Large Language Models

Add code
Feb 28, 2025
Viaarxiv icon

C-Drag: Chain-of-Thought Driven Motion Controller for Video Generation

Add code
Feb 27, 2025
Figure 1 for C-Drag: Chain-of-Thought Driven Motion Controller for Video Generation
Figure 2 for C-Drag: Chain-of-Thought Driven Motion Controller for Video Generation
Figure 3 for C-Drag: Chain-of-Thought Driven Motion Controller for Video Generation
Figure 4 for C-Drag: Chain-of-Thought Driven Motion Controller for Video Generation
Viaarxiv icon

AirCast: Improving Air Pollution Forecasting Through Multi-Variable Data Alignment

Add code
Feb 25, 2025
Figure 1 for AirCast: Improving Air Pollution Forecasting Through Multi-Variable Data Alignment
Figure 2 for AirCast: Improving Air Pollution Forecasting Through Multi-Variable Data Alignment
Figure 3 for AirCast: Improving Air Pollution Forecasting Through Multi-Variable Data Alignment
Figure 4 for AirCast: Improving Air Pollution Forecasting Through Multi-Variable Data Alignment
Viaarxiv icon

Time Travel: A Comprehensive Benchmark to Evaluate LMMs on Historical and Cultural Artifacts

Add code
Feb 20, 2025
Viaarxiv icon

InterLCM: Low-Quality Images as Intermediate States of Latent Consistency Models for Effective Blind Face Restoration

Add code
Feb 04, 2025
Figure 1 for InterLCM: Low-Quality Images as Intermediate States of Latent Consistency Models for Effective Blind Face Restoration
Figure 2 for InterLCM: Low-Quality Images as Intermediate States of Latent Consistency Models for Effective Blind Face Restoration
Figure 3 for InterLCM: Low-Quality Images as Intermediate States of Latent Consistency Models for Effective Blind Face Restoration
Figure 4 for InterLCM: Low-Quality Images as Intermediate States of Latent Consistency Models for Effective Blind Face Restoration
Viaarxiv icon

LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs

Add code
Jan 10, 2025
Figure 1 for LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs
Figure 2 for LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs
Figure 3 for LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs
Figure 4 for LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs
Viaarxiv icon

Mask Factory: Towards High-quality Synthetic Data Generation for Dichotomous Image Segmentation

Add code
Dec 26, 2024
Viaarxiv icon

Discriminative Image Generation with Diffusion Models for Zero-Shot Learning

Add code
Dec 23, 2024
Figure 1 for Discriminative Image Generation with Diffusion Models for Zero-Shot Learning
Figure 2 for Discriminative Image Generation with Diffusion Models for Zero-Shot Learning
Figure 3 for Discriminative Image Generation with Diffusion Models for Zero-Shot Learning
Figure 4 for Discriminative Image Generation with Diffusion Models for Zero-Shot Learning
Viaarxiv icon

EarthDial: Turning Multi-sensory Earth Observations to Interactive Dialogues

Add code
Dec 19, 2024
Figure 1 for EarthDial: Turning Multi-sensory Earth Observations to Interactive Dialogues
Figure 2 for EarthDial: Turning Multi-sensory Earth Observations to Interactive Dialogues
Figure 3 for EarthDial: Turning Multi-sensory Earth Observations to Interactive Dialogues
Figure 4 for EarthDial: Turning Multi-sensory Earth Observations to Interactive Dialogues
Viaarxiv icon