Topic


What Really Matters in Many-Shot Attacks? An Empirical Study of Long-Context Vulnerabilities in LLMs

Add code
May 26, 2025
Viaarxiv icon

Delving into Multilingual Ethical Bias: The MSQAD with Statistical Hypothesis Tests for Large Language Models

Add code
May 25, 2025
Viaarxiv icon

Do Large Language Models (Really) Need Statistical Foundations?

Add code
May 25, 2025
Viaarxiv icon

LLLMs: A Data-Driven Survey of Evolving Research on Limitations of Large Language Models

Add code
May 25, 2025
Viaarxiv icon

Hypercube-RAG: Hypercube-Based Retrieval-Augmented Generation for In-domain Scientific Question-Answering

Add code
May 25, 2025
Viaarxiv icon

Multilingual Question Answering in Low-Resource Settings: A Dzongkha-English Benchmark for Foundation Models

Add code
May 24, 2025
Viaarxiv icon

B-score: Detecting biases in large language models using response history

Add code
May 24, 2025
Viaarxiv icon

Writing Like the Best: Exemplar-Based Expository Text Generation

Add code
May 24, 2025
Viaarxiv icon

Towards an automatic method for generating topical vocabulary test forms for specific reading passages

Add code
May 24, 2025
Viaarxiv icon

From Generation to Detection: A Multimodal Multi-Task Dataset for Benchmarking Health Misinformation

Add code
May 24, 2025
Viaarxiv icon