Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jinwoong Kim

GroupSHAP-Guided Integration of Financial News Keywords and Technical Indicators for Stock Price Prediction

Oct 27, 2025

Minjoo Kim, Jinwoong Kim, Sangjin Park

Abstract:Recent advances in finance-specific language models such as FinBERT have enabled the quantification of public sentiment into index-based measures, yet compressing diverse linguistic signals into single metrics overlooks contextual nuances and limits interpretability. To address this limitation, explainable AI techniques, particularly SHAP (SHapley Additive Explanations), have been employed to identify influential features. However, SHAP's computational cost grows exponentially with input features, making it impractical for large-scale text-based financial data. This study introduces a GRU-based forecasting framework enhanced with GroupSHAP, which quantifies contributions of semantically related keyword groups rather than individual tokens, substantially reducing computational burden while preserving interpretability. We employed FinBERT to embed news articles from 2015 to 2024, clustered them into coherent semantic groups, and applied GroupSHAP to measure each group's contribution to stock price movements. The resulting group-level SHAP variables across multiple topics were used as input features for the prediction model. Empirical results from one-day-ahead forecasting of the S&P 500 index throughout 2024 demonstrate that our approach achieves a 32.2% reduction in MAE and a 40.5% reduction in RMSE compared with benchmark models without the GroupSHAP mechanism. This research presents the first application of GroupSHAP in news-driven financial forecasting, showing that grouped sentiment representations simultaneously enhance interpretability and predictive performance.

* ICAIF 2025 Workshop
* 6 pages

Via

Access Paper or Ask Questions

DaG LLM ver 1.0: Pioneering Instruction-Tuned Language Modeling for Korean NLP

Nov 23, 2023

Dongjun Jang, Sangah Lee, Sungjoo Byun, Jinwoong Kim, Jean Seo, Minseok Kim, Soyeon Kim, Chaeyoung Oh, Jaeyoon Kim, Hyemi Jo(+1 more)

Abstract:This paper presents the DaG LLM (David and Goliath Large Language Model), a language model specialized for Korean and fine-tuned through Instruction Tuning across 41 tasks within 13 distinct categories.

Via

Access Paper or Ask Questions

CHOPT : Automated Hyperparameter Optimization Framework for Cloud-Based Machine Learning Platforms

Oct 16, 2018

Jinwoong Kim, Minkyu Kim, Heungseok Park, Ernar Kusdavletov, Dongjun Lee, Adrian Kim, Ji-Hoon Kim, Jung-Woo Ha, Nako Sung

Figure 1 for CHOPT : Automated Hyperparameter Optimization Framework for Cloud-Based Machine Learning Platforms

Figure 2 for CHOPT : Automated Hyperparameter Optimization Framework for Cloud-Based Machine Learning Platforms

Figure 3 for CHOPT : Automated Hyperparameter Optimization Framework for Cloud-Based Machine Learning Platforms

Figure 4 for CHOPT : Automated Hyperparameter Optimization Framework for Cloud-Based Machine Learning Platforms

Abstract:Many hyperparameter optimization (HyperOpt) methods assume restricted computing resources and mainly focus on enhancing performance. Here we propose a novel cloud-based HyperOpt (CHOPT) framework which can efficiently utilize shared computing resources while supporting various HyperOpt algorithms. We incorporate convenient web-based user interfaces, visualization, and analysis tools, enabling users to easily control optimization procedures and build up valuable insights with an iterative analysis procedure. Furthermore, our framework can be incorporated with any cloud platform, thus complementarily increasing the efficiency of conventional deep learning frameworks. We demonstrate applications of CHOPT with tasks such as image recognition and question-answering, showing that our framework can find hyperparameter configurations competitive with previous work. We also show CHOPT is capable of providing interesting observations through its analysing tools

* 10 pages, 9 figures. arXiv admin note: text overlap with arXiv:1807.01774 by other authors

Via

Access Paper or Ask Questions

NSML: Meet the MLaaS platform with a real-world case study

Oct 08, 2018

Hanjoo Kim, Minkyu Kim, Dongjoo Seo, Jinwoong Kim, Heungseok Park, Soeun Park, Hyunwoo Jo, KyungHyun Kim, Youngil Yang, Youngkwan Kim(+2 more)

Figure 1 for NSML: Meet the MLaaS platform with a real-world case study

Figure 2 for NSML: Meet the MLaaS platform with a real-world case study

Figure 3 for NSML: Meet the MLaaS platform with a real-world case study

Figure 4 for NSML: Meet the MLaaS platform with a real-world case study

Abstract:The boom of deep learning induced many industries and academies to introduce machine learning based approaches into their concern, competitively. However, existing machine learning frameworks are limited to sufficiently fulfill the collaboration and management for both data and models. We proposed NSML, a machine learning as a service (MLaaS) platform, to meet these demands. NSML helps machine learning work be easily launched on a NSML cluster and provides a collaborative environment which can afford development at enterprise scale. Finally, NSML users can deploy their own commercial services with NSML cluster. In addition, NSML furnishes convenient visualization tools which assist the users in analyzing their work. To verify the usefulness and accessibility of NSML, we performed some experiments with common examples. Furthermore, we examined the collaborative advantages of NSML through three competitions with real-world use cases.

Via

Access Paper or Ask Questions