Picture for Sotaro Takeshita

Sotaro Takeshita

CommonLID: Re-evaluating State-of-the-Art Language Identification Performance on Web Data

Add code
Jan 25, 2026
Viaarxiv icon

GerAV: Towards New Heights in German Authorship Verification using Fine-Tuned LLMs on a New Benchmark

Add code
Jan 20, 2026
Viaarxiv icon

SciLaD: A Large-Scale, Transparent, Reproducible Dataset for Natural Scientific Language Processing

Add code
Dec 12, 2025
Viaarxiv icon

Randomly Removing 50% of Dimensions in Text Embeddings has Minimal Impact on Retrieval and Classification Tasks

Add code
Aug 25, 2025
Viaarxiv icon

DeepSeek vs. o3-mini: How Well can Reasoning LLMs Evaluate MT and Summarization?

Add code
Apr 10, 2025
Viaarxiv icon

ROUGE-K: Do Your Summaries Have Keywords?

Add code
Mar 08, 2024
Figure 1 for ROUGE-K: Do Your Summaries Have Keywords?
Figure 2 for ROUGE-K: Do Your Summaries Have Keywords?
Figure 3 for ROUGE-K: Do Your Summaries Have Keywords?
Figure 4 for ROUGE-K: Do Your Summaries Have Keywords?
Viaarxiv icon

ACLSum: A New Dataset for Aspect-based Summarization of Scientific Publications

Add code
Mar 08, 2024
Figure 1 for ACLSum: A New Dataset for Aspect-based Summarization of Scientific Publications
Figure 2 for ACLSum: A New Dataset for Aspect-based Summarization of Scientific Publications
Figure 3 for ACLSum: A New Dataset for Aspect-based Summarization of Scientific Publications
Figure 4 for ACLSum: A New Dataset for Aspect-based Summarization of Scientific Publications
Viaarxiv icon

VADIS -- a VAriable Detection, Interlinking and Summarization system

Add code
Dec 20, 2023
Viaarxiv icon

Towards Automated Survey Variable Search and Summarization in Social Science Publications

Add code
Sep 14, 2022
Figure 1 for Towards Automated Survey Variable Search and Summarization in Social Science Publications
Figure 2 for Towards Automated Survey Variable Search and Summarization in Social Science Publications
Figure 3 for Towards Automated Survey Variable Search and Summarization in Social Science Publications
Figure 4 for Towards Automated Survey Variable Search and Summarization in Social Science Publications
Viaarxiv icon

X-SCITLDR: Cross-Lingual Extreme Summarization of Scholarly Documents

Add code
May 30, 2022
Figure 1 for X-SCITLDR: Cross-Lingual Extreme Summarization of Scholarly Documents
Figure 2 for X-SCITLDR: Cross-Lingual Extreme Summarization of Scholarly Documents
Figure 3 for X-SCITLDR: Cross-Lingual Extreme Summarization of Scholarly Documents
Figure 4 for X-SCITLDR: Cross-Lingual Extreme Summarization of Scholarly Documents
Viaarxiv icon