Picture for Raymond Douglas

Raymond Douglas

The Artificial Self: Characterising the landscape of AI identity

Add code
Mar 11, 2026
Viaarxiv icon

Latent Introspection: Models Can Detect Prior Concept Injections

Add code
Feb 26, 2026
Viaarxiv icon

Who's in Charge? Disempowerment Patterns in Real-World LLM Usage

Add code
Jan 27, 2026
Viaarxiv icon

Evaluating Language Model Character Traits

Add code
Oct 05, 2024
Figure 1 for Evaluating Language Model Character Traits
Figure 2 for Evaluating Language Model Character Traits
Figure 3 for Evaluating Language Model Character Traits
Figure 4 for Evaluating Language Model Character Traits
Viaarxiv icon

Limitations of Agents Simulated by Predictive Models

Add code
Feb 08, 2024
Viaarxiv icon

Mitigating the Problem of Strong Priors in LMs with Context Extrapolation

Add code
Jan 31, 2024
Figure 1 for Mitigating the Problem of Strong Priors in LMs with Context Extrapolation
Figure 2 for Mitigating the Problem of Strong Priors in LMs with Context Extrapolation
Figure 3 for Mitigating the Problem of Strong Priors in LMs with Context Extrapolation
Figure 4 for Mitigating the Problem of Strong Priors in LMs with Context Extrapolation
Viaarxiv icon