Picture for Miguel Ballesteros

Miguel Ballesteros

Tree-based Dialogue Reinforced Policy Optimization for Red-Teaming Attacks

Add code
Oct 02, 2025
Viaarxiv icon

MetaSynth: Meta-Prompting-Driven Agentic Scaffolds for Diverse Synthetic Data Generation

Add code
Apr 17, 2025
Figure 1 for MetaSynth: Meta-Prompting-Driven Agentic Scaffolds for Diverse Synthetic Data Generation
Figure 2 for MetaSynth: Meta-Prompting-Driven Agentic Scaffolds for Diverse Synthetic Data Generation
Figure 3 for MetaSynth: Meta-Prompting-Driven Agentic Scaffolds for Diverse Synthetic Data Generation
Figure 4 for MetaSynth: Meta-Prompting-Driven Agentic Scaffolds for Diverse Synthetic Data Generation
Viaarxiv icon

Unraveling and Mitigating Safety Alignment Degradation of Vision-Language Models

Add code
Oct 11, 2024
Figure 1 for Unraveling and Mitigating Safety Alignment Degradation of Vision-Language Models
Figure 2 for Unraveling and Mitigating Safety Alignment Degradation of Vision-Language Models
Figure 3 for Unraveling and Mitigating Safety Alignment Degradation of Vision-Language Models
Figure 4 for Unraveling and Mitigating Safety Alignment Degradation of Vision-Language Models
Viaarxiv icon

Detecting Training Data of Large Language Models via Expectation Maximization

Add code
Oct 10, 2024
Figure 1 for Detecting Training Data of Large Language Models via Expectation Maximization
Figure 2 for Detecting Training Data of Large Language Models via Expectation Maximization
Figure 3 for Detecting Training Data of Large Language Models via Expectation Maximization
Figure 4 for Detecting Training Data of Large Language Models via Expectation Maximization
Viaarxiv icon

Active Evaluation Acquisition for Efficient LLM Benchmarking

Add code
Oct 08, 2024
Figure 1 for Active Evaluation Acquisition for Efficient LLM Benchmarking
Figure 2 for Active Evaluation Acquisition for Efficient LLM Benchmarking
Figure 3 for Active Evaluation Acquisition for Efficient LLM Benchmarking
Figure 4 for Active Evaluation Acquisition for Efficient LLM Benchmarking
Viaarxiv icon

General Purpose Verification for Chain of Thought Prompting

Add code
Apr 30, 2024
Figure 1 for General Purpose Verification for Chain of Thought Prompting
Figure 2 for General Purpose Verification for Chain of Thought Prompting
Figure 3 for General Purpose Verification for Chain of Thought Prompting
Figure 4 for General Purpose Verification for Chain of Thought Prompting
Viaarxiv icon

NewsQs: Multi-Source Question Generation for the Inquiring Mind

Add code
Feb 28, 2024
Figure 1 for NewsQs: Multi-Source Question Generation for the Inquiring Mind
Figure 2 for NewsQs: Multi-Source Question Generation for the Inquiring Mind
Figure 3 for NewsQs: Multi-Source Question Generation for the Inquiring Mind
Figure 4 for NewsQs: Multi-Source Question Generation for the Inquiring Mind
Viaarxiv icon

Characterizing and Measuring Linguistic Dataset Drift

Add code
May 26, 2023
Figure 1 for Characterizing and Measuring Linguistic Dataset Drift
Figure 2 for Characterizing and Measuring Linguistic Dataset Drift
Figure 3 for Characterizing and Measuring Linguistic Dataset Drift
Viaarxiv icon

Taxonomy Expansion for Named Entity Recognition

Add code
May 22, 2023
Figure 1 for Taxonomy Expansion for Named Entity Recognition
Figure 2 for Taxonomy Expansion for Named Entity Recognition
Figure 3 for Taxonomy Expansion for Named Entity Recognition
Figure 4 for Taxonomy Expansion for Named Entity Recognition
Viaarxiv icon

A Weak Supervision Approach for Few-Shot Aspect Based Sentiment

Add code
May 19, 2023
Figure 1 for A Weak Supervision Approach for Few-Shot Aspect Based Sentiment
Figure 2 for A Weak Supervision Approach for Few-Shot Aspect Based Sentiment
Figure 3 for A Weak Supervision Approach for Few-Shot Aspect Based Sentiment
Figure 4 for A Weak Supervision Approach for Few-Shot Aspect Based Sentiment
Viaarxiv icon