Picture for Alberto Mario Ceballos Arroyo

Alberto Mario Ceballos Arroyo

Do Natural Language Descriptions of Model Activations Convey Privileged Information?

Add code
Sep 16, 2025
Viaarxiv icon

Open (Clinical) LLMs are Sensitive to Instruction Phrasings

Add code
Jul 12, 2024
Figure 1 for Open (Clinical) LLMs are Sensitive to Instruction Phrasings
Figure 2 for Open (Clinical) LLMs are Sensitive to Instruction Phrasings
Figure 3 for Open (Clinical) LLMs are Sensitive to Instruction Phrasings
Figure 4 for Open (Clinical) LLMs are Sensitive to Instruction Phrasings
Viaarxiv icon