Picture for Rajarshi Roy

Rajarshi Roy

Max Planck Institute for Software Systems, Germany

About Time: Model-free Reinforcement Learning with Timed Reward Machines

Add code
Dec 19, 2025
Figure 1 for About Time: Model-free Reinforcement Learning with Timed Reward Machines
Figure 2 for About Time: Model-free Reinforcement Learning with Timed Reward Machines
Figure 3 for About Time: Model-free Reinforcement Learning with Timed Reward Machines
Figure 4 for About Time: Model-free Reinforcement Learning with Timed Reward Machines
Viaarxiv icon

A Comprehensive Dataset for Human vs. AI Generated Text Detection

Add code
Oct 26, 2025
Figure 1 for A Comprehensive Dataset for Human vs. AI Generated Text Detection
Figure 2 for A Comprehensive Dataset for Human vs. AI Generated Text Detection
Figure 3 for A Comprehensive Dataset for Human vs. AI Generated Text Detection
Figure 4 for A Comprehensive Dataset for Human vs. AI Generated Text Detection
Viaarxiv icon

DETONATE: A Benchmark for Text-to-Image Alignment and Kernelized Direct Preference Optimization

Add code
Jun 17, 2025
Viaarxiv icon

Learning Probabilistic Temporal Logic Specifications for Stochastic Systems

Add code
May 17, 2025
Viaarxiv icon

What is Formal Verification without Specifications? A Survey on mining LTL Specifications

Add code
Jan 27, 2025
Viaarxiv icon

DPO Kernels: A Semantically-Aware, Kernel-Enhanced, and Divergence-Rich Paradigm for Direct Preference Optimization

Add code
Jan 08, 2025
Figure 1 for DPO Kernels: A Semantically-Aware, Kernel-Enhanced, and Divergence-Rich Paradigm for Direct Preference Optimization
Figure 2 for DPO Kernels: A Semantically-Aware, Kernel-Enhanced, and Divergence-Rich Paradigm for Direct Preference Optimization
Figure 3 for DPO Kernels: A Semantically-Aware, Kernel-Enhanced, and Divergence-Rich Paradigm for Direct Preference Optimization
Figure 4 for DPO Kernels: A Semantically-Aware, Kernel-Enhanced, and Divergence-Rich Paradigm for Direct Preference Optimization
Viaarxiv icon

Nemotron-4 340B Technical Report

Add code
Jun 17, 2024
Figure 1 for Nemotron-4 340B Technical Report
Figure 2 for Nemotron-4 340B Technical Report
Figure 3 for Nemotron-4 340B Technical Report
Figure 4 for Nemotron-4 340B Technical Report
Viaarxiv icon

CircuitVAE: Efficient and Scalable Latent Circuit Optimization

Add code
Jun 13, 2024
Figure 1 for CircuitVAE: Efficient and Scalable Latent Circuit Optimization
Figure 2 for CircuitVAE: Efficient and Scalable Latent Circuit Optimization
Figure 3 for CircuitVAE: Efficient and Scalable Latent Circuit Optimization
Figure 4 for CircuitVAE: Efficient and Scalable Latent Circuit Optimization
Viaarxiv icon

NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models

Add code
May 27, 2024
Figure 1 for NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models
Figure 2 for NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models
Figure 3 for NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models
Figure 4 for NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models
Viaarxiv icon

A High-Fidelity Simulation Framework for Grasping Stability Analysis in Human Casualty Manipulation

Add code
Apr 04, 2024
Figure 1 for A High-Fidelity Simulation Framework for Grasping Stability Analysis in Human Casualty Manipulation
Figure 2 for A High-Fidelity Simulation Framework for Grasping Stability Analysis in Human Casualty Manipulation
Figure 3 for A High-Fidelity Simulation Framework for Grasping Stability Analysis in Human Casualty Manipulation
Figure 4 for A High-Fidelity Simulation Framework for Grasping Stability Analysis in Human Casualty Manipulation
Viaarxiv icon