Alert button
Picture for Sergey Troshin

Sergey Troshin

Alert button

HSE University, Russia

CodeBPE: Investigating Subtokenization Options for Large Language Model Pretraining on Source Code

Add code
Bookmark button
Alert button
Aug 01, 2023
Nadezhda Chirkova, Sergey Troshin

Viaarxiv icon

SantaCoder: don't reach for the stars!

Add code
Bookmark button
Alert button
Jan 09, 2023
Loubna Ben Allal, Raymond Li, Denis Kocetkov, Chenghao Mou, Christopher Akiki, Carlos Munoz Ferrandis, Niklas Muennighoff, Mayank Mishra, Alex Gu, Manan Dey, Logesh Kumar Umapathi, Carolyn Jane Anderson, Yangtian Zi, Joel Lamy Poirier, Hailey Schoelkopf, Sergey Troshin, Dmitry Abulkhanov, Manuel Romero, Michael Lappert, Francesco De Toni, Bernardo García del Río, Qian Liu, Shamik Bose, Urvashi Bhattacharyya, Terry Yue Zhuo, Ian Yu, Paulo Villegas, Marco Zocca, Sourab Mangrulkar, David Lansky, Huu Nguyen, Danish Contractor, Luis Villa, Jia Li, Dzmitry Bahdanau, Yacine Jernite, Sean Hughes, Daniel Fried, Arjun Guha, Harm de Vries, Leandro von Werra

Figure 1 for SantaCoder: don't reach for the stars!
Figure 2 for SantaCoder: don't reach for the stars!
Figure 3 for SantaCoder: don't reach for the stars!
Figure 4 for SantaCoder: don't reach for the stars!
Viaarxiv icon

Probing Pretrained Models of Source Code

Add code
Bookmark button
Alert button
Feb 16, 2022
Sergey Troshin, Nadezhda Chirkova

Figure 1 for Probing Pretrained Models of Source Code
Figure 2 for Probing Pretrained Models of Source Code
Figure 3 for Probing Pretrained Models of Source Code
Figure 4 for Probing Pretrained Models of Source Code
Viaarxiv icon

Machine Learning Methods for Spectral Efficiency Prediction in Massive MIMO Systems

Add code
Bookmark button
Alert button
Dec 29, 2021
Evgeny Bobrov, Sergey Troshin, Nadezhda Chirkova, Ekaterina Lobacheva, Sviatoslav Panchenko, Dmitry Vetrov, Dmitry Kropotov

Figure 1 for Machine Learning Methods for Spectral Efficiency Prediction in Massive MIMO Systems
Figure 2 for Machine Learning Methods for Spectral Efficiency Prediction in Massive MIMO Systems
Figure 3 for Machine Learning Methods for Spectral Efficiency Prediction in Massive MIMO Systems
Figure 4 for Machine Learning Methods for Spectral Efficiency Prediction in Massive MIMO Systems
Viaarxiv icon

A Simple Approach for Handling Out-of-Vocabulary Identifiers in Deep Learning for Source Code

Add code
Bookmark button
Alert button
Oct 23, 2020
Nadezhda Chirkova, Sergey Troshin

Figure 1 for A Simple Approach for Handling Out-of-Vocabulary Identifiers in Deep Learning for Source Code
Figure 2 for A Simple Approach for Handling Out-of-Vocabulary Identifiers in Deep Learning for Source Code
Viaarxiv icon

Empirical Study of Transformers for Source Code

Add code
Bookmark button
Alert button
Oct 15, 2020
Nadezhda Chirkova, Sergey Troshin

Figure 1 for Empirical Study of Transformers for Source Code
Figure 2 for Empirical Study of Transformers for Source Code
Figure 3 for Empirical Study of Transformers for Source Code
Figure 4 for Empirical Study of Transformers for Source Code
Viaarxiv icon