Mit Benchmark


HypER: Hyperbolic Echo State Networks for Capturing Stretch-and-Fold Dynamics in Chaotic Flows

Add code
Aug 25, 2025
Viaarxiv icon

MT$^{3}$: Scaling MLLM-based Text Image Machine Translation via Multi-Task Reinforcement Learning

Add code
May 26, 2025
Viaarxiv icon

GeistBERT: Breathing Life into German NLP

Add code
Jun 13, 2025
Viaarxiv icon

WixQA: A Multi-Dataset Benchmark for Enterprise Retrieval-Augmented Generation

Add code
May 13, 2025
Viaarxiv icon

Ensuring Reproducibility in Generative AI Systems for General Use Cases: A Framework for Regression Testing and Open Datasets

Add code
May 02, 2025
Viaarxiv icon

T-VEC: A Telecom-Specific Vectorization Model with Enhanced Semantic Understanding via Deep Triplet Loss Fine-Tuning

Add code
Apr 23, 2025
Viaarxiv icon

Improving Multilingual Capabilities with Cultural and Local Knowledge in Large Language Models While Enhancing Native Performance

Add code
Apr 13, 2025
Figure 1 for Improving Multilingual Capabilities with Cultural and Local Knowledge in Large Language Models While Enhancing Native Performance
Figure 2 for Improving Multilingual Capabilities with Cultural and Local Knowledge in Large Language Models While Enhancing Native Performance
Figure 3 for Improving Multilingual Capabilities with Cultural and Local Knowledge in Large Language Models While Enhancing Native Performance
Figure 4 for Improving Multilingual Capabilities with Cultural and Local Knowledge in Large Language Models While Enhancing Native Performance
Viaarxiv icon

LEMUR Neural Network Dataset: Towards Seamless AutoML

Add code
Apr 14, 2025
Viaarxiv icon

PyGDA: A Python Library for Graph Domain Adaptation

Add code
Mar 13, 2025
Figure 1 for PyGDA: A Python Library for Graph Domain Adaptation
Viaarxiv icon

VLDBench: Vision Language Models Disinformation Detection Benchmark

Add code
Feb 17, 2025
Figure 1 for VLDBench: Vision Language Models Disinformation Detection Benchmark
Figure 2 for VLDBench: Vision Language Models Disinformation Detection Benchmark
Figure 3 for VLDBench: Vision Language Models Disinformation Detection Benchmark
Figure 4 for VLDBench: Vision Language Models Disinformation Detection Benchmark
Viaarxiv icon