An unsupervised approach for sentiment analysis via financial texts
DOI:
10.46223/HCMCOUJS.tech.en.15.2.3684.2025Keywords:
autoencoder; deep clustering; natural language processing; transformer; unsupervised sentiment analysisAbstract
The rapidly increasing volume of textual data has made manual labeling extremely costly and time-consuming. To address this limitation, researchers have gradually focused on unsupervised learning techniques that enable models to classify text without relying on labeled data. Among these, deep clustering has garnered significant interest. However, most existing deep clustering methods are primarily designed for computer vision tasks. In this paper, we propose modifications to two of the most powerful deep clustering methods, including DEKM and DeepCluster, by integrating transformer algorithms in the Natural Language Processing (NLP) domain, enabling these methods to handle textual data. With the proposed methods, we achieved the best results on the test set of the Financial Phrase Bank (FPB) dataset with an accuracy of 57.71% and on the test set of the Twitter Financial News (TFN) dataset with an accuracy of 65.58%. Although these results are still lower than those of traditional supervised deep learning methods, we have demonstrated that the performance of our proposed methods can be further improved when trained with more data. This highlights the promising potential of deep clustering methods for natural language processing tasks. Especially when addressing tasks where the data is either unlabeled or lacks sufficient labeling.Downloads
Download data is not yet available.
References
Downloads
Received:
23-08-2024
Accepted:
17-10-2024
Published:
13-01-2025
Statistics Views
Abstract: 244 PDF: 210How to Cite
Pham, C. C., Nguyen, B. V., & Nguyen, H. Q. (2025). An unsupervised approach for sentiment analysis via financial texts. HO CHI MINH CITY OPEN UNIVERSITY JOURNAL OF SCIENCE - ENGINEERING AND TECHNOLOGY, 15(2), 46–54. https://doi.org/10.46223/HCMCOUJS.tech.en.15.2.3684.2025
License
Copyright (c) 2025 Cong Chi Pham; Bay Van Nguyen; Huy Quoc Nguyen

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.