Opus 100


Regional Bias in Large Language Models

Add code
Jan 22, 2026
Viaarxiv icon

Does Inference Scaling Improve Reasoning Faithfulness? A Multi-Model Analysis of Self-Consistency Tradeoffs

Add code
Jan 10, 2026
Viaarxiv icon

Can Large Language Models Solve Engineering Equations? A Systematic Comparison of Direct Prediction and Solver-Assisted Approaches

Add code
Jan 05, 2026
Viaarxiv icon

LegalRikai: Open Benchmark -- Benchmark for Complex Japanese Corporate Legal Tasks

Add code
Dec 15, 2025
Viaarxiv icon

AA-Omniscience: Evaluating Cross-Domain Knowledge Reliability in Large Language Models

Add code
Nov 17, 2025
Figure 1 for AA-Omniscience: Evaluating Cross-Domain Knowledge Reliability in Large Language Models
Figure 2 for AA-Omniscience: Evaluating Cross-Domain Knowledge Reliability in Large Language Models
Figure 3 for AA-Omniscience: Evaluating Cross-Domain Knowledge Reliability in Large Language Models
Figure 4 for AA-Omniscience: Evaluating Cross-Domain Knowledge Reliability in Large Language Models
Viaarxiv icon

ORGEval: Graph-Theoretic Evaluation of LLMs in Optimization Modeling

Add code
Oct 31, 2025
Viaarxiv icon

Reactor Mk.1 performances: MMLU, HumanEval and BBH test results

Add code
Jun 15, 2024
Figure 1 for Reactor Mk.1 performances: MMLU, HumanEval and BBH test results
Viaarxiv icon

LACE: A light-weight, causal model for enhancing coded speech through adaptive convolutions

Add code
Jul 13, 2023
Figure 1 for LACE: A light-weight, causal model for enhancing coded speech through adaptive convolutions
Figure 2 for LACE: A light-weight, causal model for enhancing coded speech through adaptive convolutions
Figure 3 for LACE: A light-weight, causal model for enhancing coded speech through adaptive convolutions
Figure 4 for LACE: A light-weight, causal model for enhancing coded speech through adaptive convolutions
Viaarxiv icon

Who is Speaking Actually? Robust and Versatile Speaker Traceability for Voice Conversion

Add code
May 09, 2023
Figure 1 for Who is Speaking Actually? Robust and Versatile Speaker Traceability for Voice Conversion
Figure 2 for Who is Speaking Actually? Robust and Versatile Speaker Traceability for Voice Conversion
Figure 3 for Who is Speaking Actually? Robust and Versatile Speaker Traceability for Voice Conversion
Figure 4 for Who is Speaking Actually? Robust and Versatile Speaker Traceability for Voice Conversion
Viaarxiv icon

Demystify Optimization Challenges in Multilingual Transformers

Add code
Apr 20, 2021
Figure 1 for Demystify Optimization Challenges in Multilingual Transformers
Figure 2 for Demystify Optimization Challenges in Multilingual Transformers
Figure 3 for Demystify Optimization Challenges in Multilingual Transformers
Figure 4 for Demystify Optimization Challenges in Multilingual Transformers
Viaarxiv icon