Picture for N J Karthika

N J Karthika

Samasāmayik: A Parallel Dataset for Hindi-Sanskrit Machine Translation

Add code
Mar 25, 2026
Viaarxiv icon

MorphTok: Morphologically Grounded Tokenization for Indian Languages

Add code
Apr 14, 2025
Figure 1 for MorphTok: Morphologically Grounded Tokenization for Indian Languages
Figure 2 for MorphTok: Morphologically Grounded Tokenization for Indian Languages
Figure 3 for MorphTok: Morphologically Grounded Tokenization for Indian Languages
Figure 4 for MorphTok: Morphologically Grounded Tokenization for Indian Languages
Viaarxiv icon

Semantically Cohesive Word Grouping in Indian Languages

Add code
Jan 07, 2025
Figure 1 for Semantically Cohesive Word Grouping in Indian Languages
Figure 2 for Semantically Cohesive Word Grouping in Indian Languages
Figure 3 for Semantically Cohesive Word Grouping in Indian Languages
Figure 4 for Semantically Cohesive Word Grouping in Indian Languages
Viaarxiv icon