Picture for Vinija Jain

Vinija Jain

Reasoning or Rhetoric? An Empirical Analysis of Moral Reasoning Explanations in Large Language Models

Add code
Mar 23, 2026
Viaarxiv icon

The Reasoning Trap -- Logical Reasoning as a Mechanistic Pathway to Situational Awareness

Add code
Mar 10, 2026
Viaarxiv icon

When Shallow Wins: Silent Failures and the Depth-Accuracy Paradox in Latent Reasoning

Add code
Mar 03, 2026
Viaarxiv icon

I Can't Believe It's Not Robust: Catastrophic Collapse of Safety Classifiers under Embedding Drift

Add code
Mar 01, 2026
Viaarxiv icon

Neural FOXP2 -- Language Specific Neuron Steering for Targeted Language Improvement in LLMs

Add code
Feb 01, 2026
Viaarxiv icon

Stochastic CHAOS: Why Deterministic Inference Kills, and Distributional Variability Is the Heartbeat of Artifical Cognition

Add code
Jan 12, 2026
Viaarxiv icon

AlignMerge - Alignment-Preserving Large Language Model Merging via Fisher-Guided Geometric Constraints

Add code
Dec 18, 2025
Figure 1 for AlignMerge - Alignment-Preserving Large Language Model Merging via Fisher-Guided Geometric Constraints
Figure 2 for AlignMerge - Alignment-Preserving Large Language Model Merging via Fisher-Guided Geometric Constraints
Figure 3 for AlignMerge - Alignment-Preserving Large Language Model Merging via Fisher-Guided Geometric Constraints
Figure 4 for AlignMerge - Alignment-Preserving Large Language Model Merging via Fisher-Guided Geometric Constraints
Viaarxiv icon

A Comprehensive Dataset for Human vs. AI Generated Text Detection

Add code
Oct 26, 2025
Figure 1 for A Comprehensive Dataset for Human vs. AI Generated Text Detection
Figure 2 for A Comprehensive Dataset for Human vs. AI Generated Text Detection
Figure 3 for A Comprehensive Dataset for Human vs. AI Generated Text Detection
Figure 4 for A Comprehensive Dataset for Human vs. AI Generated Text Detection
Viaarxiv icon

Investigating Hallucination in Conversations for Low Resource Languages

Add code
Jul 30, 2025
Viaarxiv icon

AdversariaL attacK sAfety aLIgnment(ALKALI): Safeguarding LLMs through GRACE: Geometric Representation-Aware Contrastive Enhancement- Introducing Adversarial Vulnerability Quality Index (AVQI)

Add code
Jun 11, 2025
Figure 1 for AdversariaL attacK sAfety aLIgnment(ALKALI): Safeguarding LLMs through GRACE: Geometric Representation-Aware Contrastive Enhancement- Introducing Adversarial Vulnerability Quality Index (AVQI)
Figure 2 for AdversariaL attacK sAfety aLIgnment(ALKALI): Safeguarding LLMs through GRACE: Geometric Representation-Aware Contrastive Enhancement- Introducing Adversarial Vulnerability Quality Index (AVQI)
Figure 3 for AdversariaL attacK sAfety aLIgnment(ALKALI): Safeguarding LLMs through GRACE: Geometric Representation-Aware Contrastive Enhancement- Introducing Adversarial Vulnerability Quality Index (AVQI)
Figure 4 for AdversariaL attacK sAfety aLIgnment(ALKALI): Safeguarding LLMs through GRACE: Geometric Representation-Aware Contrastive Enhancement- Introducing Adversarial Vulnerability Quality Index (AVQI)
Viaarxiv icon