Journal of Enterprise and Business Intelligence


Analyzing the Interplay Between Social Media Sentiment and Traditional Public Opinion in Politics



Journal of Enterprise and Business Intelligence

Received On : 06 January 2025

Revised On : 12 March 2025

Accepted On : 16 May 2025

Published On : 05 October 2025

Volume 05, Issue 04

Pages : 187-197


Abstract


In the present day, due to the growing number of Web 2.0 tools, users are producing vast quantities of data in a massive and constantly changing manner. Opinion mining or Sentiment analysis (SA) is an informative technique to help obtain useful information from users’ data automatically. Over the years, a number of SA challenges have been solved using deep learning techniques which has led to the achievement of state-of-art performances. Thus, it is necessary to solicit help to help the researchers to learn the present progress and outstanding issues to be solved quickly. This paper discusses the multidimensional nature of public opinion in South Korea through analyzing the entire Korean tweets from January 1, 2023 to December 31, 2023. Using Twitter (X) API, we collected over 5 million tweets with emphasis on phrases related to South Korean president and significant events in the country. Through the application of set filters, the authors were able to arrive at a dataset of approximately 4 million tweets. These developed tweets were then taken through rigorous preprocessing in order to make them ready for SA. In the current work, the BERT model was chosen to act as the major focus of the study. This model was particularly trained on a given tagged dataset of Korean text for the purpose of classifying the text’s sentiment. The findings of the present study reveal complex patterns of the processes that define the nature of online sentiment, thus revealing that several forces work in concert to form public opinion. Despite certain concordances between the results of the online SA and the offline public opinion polls on presidential job performance approval, the relationship is rather weak.


Keywords


Sentiment Analysis, Sentence-Level Sentiment Analysis, Supervised Deep Learning Methods, Bidirectional Encoder Representations from Transformers Model, Word Opinion Mining.


  1. W. X. Zhao, S. Li, Y. He, E. Y. Chang, J.-R. Wen, and X. Li, “Connecting Social Media to E-Commerce: Cold-Start Product Recommendation Using Microblogging Information,” IEEE Transactions on Knowledge and Data Engineering, vol. 28, no. 5, pp. 1147–1159, May 2016, doi: 10.1109/tkde.2015.2508816.
  2. M. R. Auer, “The Policy Sciences of Social Media,” Policy Studies Journal, vol. 39, no. 4, pp. 709–736, Nov. 2011, doi: 10.1111/j.1541-0072.2011.00428.x.
  3. R. R. Pai and S. Alathur, “Assessing mobile health applications with twitter analytics,” International Journal of Medical Informatics, vol. 113, pp. 72–84, May 2018, doi: 10.1016/j.ijmedinf.2018.02.016.
  4. M. D. Devika, C. Sunitha, and A. Ganesh, “Sentiment Analysis: A Comparative Study on Different Approaches,” Procedia Computer Science, vol. 87, pp. 44–49, 2016, doi: 10.1016/j.procs.2016.05.124.
  5. K. Schouten and F. Frasincar, “Survey on Aspect-Level Sentiment Analysis,” IEEE Transactions on Knowledge and Data Engineering, vol. 28, no. 3, pp. 813–830, Mar. 2016, doi: 10.1109/tkde.2015.2485209.
  6. M. Rhanoui, M. Mikram, S. Yousfi, and S. Barzali, “A CNN-BiLSTM Model for Document-Level Sentiment Analysis,” Machine Learning and Knowledge Extraction, vol. 1, no. 3, pp. 832–847, Jul. 2019, doi: 10.3390/make1030048.
  7. M. Araújo, A. Pereira, and F. Benevenuto, “A comparative study of machine translation for multilingual sentence-level sentiment analysis,” Information Sciences, vol. 512, pp. 1078–1102, Feb. 2020, doi: 10.1016/j.ins.2019.10.031.
  8. Md. S. Islam et al., “‘Challenges and future in deep learning for sentiment analysis: a comprehensive review and a proposed novel hybrid approach,’” Artificial Intelligence Review, vol. 57, no. 3, Mar. 2024, doi: 10.1007/s10462-023-10651-9.
  9. J. Zhang, F. Liu, W. Xu, and H. Yu, “Feature Fusion Text Classification Model Combining CNN and BiGRU with Multi-Attention Mechanism,” Future Internet, vol. 11, no. 11, p. 237, Nov. 2019, doi: 10.3390/fi11110237.
  10. R. Moradi, R. Berangi, and B. Minaei, “A survey of regularization strategies for deep models,” Artificial Intelligence Review, vol. 53, no. 6, pp. 3947–3986, Dec. 2019, doi: 10.1007/s10462-019-09784-7.
  11. A. Bello, S.-C. Ng, and M.-F. Leung, “A BERT Framework to Sentiment Analysis of Tweets,” Sensors, vol. 23, no. 1, p. 506, Jan. 2023, doi: 10.3390/s23010506.
  12. L. Yue, W. Chen, X. Li, W. Zuo, and M. Yin, “A survey of sentiment analysis in social media,” Knowledge and Information Systems, vol. 60, no. 2, pp. 617–663, Jul. 2018, doi: 10.1007/s10115-018-1236-4.
  13. K. Ravi and V. Ravi, “A survey on opinion mining and sentiment analysis: Tasks, approaches and applications,” Knowledge-Based Systems, vol. 89, pp. 14–46, Nov. 2015, doi: 10.1016/j.knosys.2015.06.015.
  14. M. Hagenau, M. Liebmann, and D. Neumann, “Automated news reading: Stock price prediction based on financial news using context-capturing features,” Decision Support Systems, vol. 55, no. 3, pp. 685–697, Jun. 2013, doi: 10.1016/j.dss.2013.02.006.
  15. K. Lan, D.-T. Wang, S. Fong, L.-S. Liu, K. K. L. Wong, and N. Dey, “A survey of data mining and deep learning in bioinformatics,” Journal of Medical Systems, vol. 42, no. 8, Jun. 2018, doi: 10.1007/s10916-018-1003-9.
  16. A. Hernández-Blanco, B. Herrera-Flores, D. Tomás, and B. Navarro-Colorado, “A Systematic Review of Deep Learning Approaches to Educational Data Mining,” Complexity, vol. 2019, no. 1, Jan. 2019, doi: 10.1155/2019/1306039.
  17. J. Li, A. Sun, J. Han, and C. Li, “A Survey on Deep Learning for Named Entity Recognition,” IEEE Transactions on Knowledge and Data Engineering, vol. 34, no. 1, pp. 50–70, Jan. 2022, doi: 10.1109/tkde.2020.2981314.
  18. Z. Lu, W. Kim, and W. J. Wilbur, “Evaluating Relevance Ranking Strategies for MEDLINE Retrieval,” Journal of the American Medical Informatics Association, vol. 16, no. 1, pp. 32–36, Jan. 2009, doi: 10.1197/jamia.m2935.
  19. S. Maqsood, A. Shahid, F. Nazar, M. Asif, M. Ahmad, and M. Mazzara, “C-POS: A Context-Aware Adaptive Part-of-Speech Language Learning Framework,” IEEE Access, vol. 8, pp. 30720–30733, 2020, doi: 10.1109/access.2020.2972591.
  20. K. Adnan and R. Akbar, “An analytical study of information extraction from unstructured and multidimensional big data,” Journal of Big Data, vol. 6, no. 1, Oct. 2019, doi: 10.1186/s40537-019-0254-8.
  21. S. Dargan, M. Kumar, M. R. Ayyagari, and G. Kumar, “A Survey of Deep Learning and Its Applications: A New Paradigm to Machine Learning,” Archives of Computational Methods in Engineering, vol. 27, no. 4, pp. 1071–1092, Jun. 2019, doi: 10.1007/s11831-019-09344-w.
  22. M. Khalil, A. S. McGough, Z. Pourmirza, M. Pazhoohesh, and S. Walker, “Machine Learning, Deep Learning and Statistical Analysis for forecasting building energy consumption — A systematic review,” Engineering Applications of Artificial Intelligence, vol. 115, p. 105287, Oct. 2022, doi: 10.1016/j.engappai.2022.105287.
  23. G. Litjens et al., “A survey on deep learning in medical image analysis,” Medical Image Analysis, vol. 42, pp. 60–88, Dec. 2017, doi: 10.1016/j.media.2017.07.005.
  24. A. Sivaram, L. Das, and V. Venkatasubramanian, “Hidden representations in deep neural networks: Part 1. Classification problems,” Computers & Chemical Engineering, vol. 134, p. 106669, Mar. 2020, doi: 10.1016/j.compchemeng.2019.106669.
  25. P. P. Brahma, D. Wu, and Y. She, “Why Deep Learning Works: A Manifold Disentanglement Perspective,” IEEE Transactions on Neural Networks and Learning Systems, vol. 27, no. 10, pp. 1997–2008, Oct. 2016, doi: 10.1109/tnnls.2015.2496947.
  26. M. E. Mavroforakis and S. Theodoridis, “A geometric approach to Support Vector Machine (SVM) classification,” IEEE Transactions on Neural Networks, vol. 17, no. 3, pp. 671–682, May 2006, doi: 10.1109/tnn.2006.873281.
  27. N. Friedman, M. Linial, I. Nachman, and D. Pe’er, “Using Bayesian Networks to Analyze Expression Data,” Journal of Computational Biology, vol. 7, no. 3–4, pp. 601–620, Aug. 2000, doi: 10.1089/106652700750050961.
  28. J. R. Quinlan, “Induction of Decision Trees,” Machine Learning, vol. 1, no. 1, pp. 81–106, Mar. 1986, doi: 10.1023/a:1022643204877.
  29. B. Fitzgerald and D. Howcroft, “Towards Dissolution of the is Research Debate: From Polarization to Polarity,” Journal of Information Technology, vol. 13, no. 4, pp. 313–326, Dec. 1998, doi: 10.1177/026839629801300409.
  30. M. Taboada, J. Brooke, M. Tofiloski, K. Voll, and M. Stede, “Lexicon-Based Methods for Sentiment Analysis,” Computational Linguistics, vol. 37, no. 2, pp. 267–307, Jun. 2011, doi: 10.1162/coli_a_00049.
  31. J. A. Balazs and J. D. Velásquez, “Opinion Mining and Information Fusion: A survey,” Information Fusion, vol. 27, pp. 95–110, Jan. 2016, doi: 10.1016/j.inffus.2015.06.002.
  32. M. Ebrahimi, A. H. Yazdavar, and A. Sheth, “Challenges of Sentiment Analysis for Dynamic Events,” IEEE Intelligent Systems, vol. 32, no. 5, pp. 70–75, Sep. 2017, doi: 10.1109/mis.2017.3711649.
  33. E. Hajic̈ova, “A Natural history of negation,” Lingua, vol. 82, no. 2–3, pp. 253–261, Nov. 1990, doi: 10.1016/0024-3841(90)90064-r.
  34. D. Antonakaki, P. Fragopoulou, and S. Ioannidis, “A survey of Twitter research: Data model, graph structure, sentiment analysis and attacks,” Expert Systems with Applications, vol. 164, p. 114006, Feb. 2021, doi: 10.1016/j.eswa.2020.114006.
  35. C. Gerlitz and B. Rieder, “Mining One Percent of Twitter: Collections, Baselines, Sampling,” M/C Journal, vol. 16, no. 2, Mar. 2013, doi: 10.5204/mcj.620.
  36. D. Khurana, A. Koli, K. Khatter, and S. Singh, “Natural language processing: state of the art, current trends and challenges,” Multimedia Tools and Applications, vol. 82, no. 3, pp. 3713–3744, Jul. 2022, doi: 10.1007/s11042-022-13428-4.
  37. T. Winograd, “Understanding natural language,” Cognitive Psychology, vol. 3, no. 1, pp. 1–191, Jan. 1972, doi: 10.1016/0010-0285(72)90002-3.
  38. M. A. K. Halliday, “Towards a language-based theory of learning,” Linguistics and Education, vol. 5, no. 2, pp. 93–116, Jan. 1993, doi: 10.1016/0898-5898(93)90026-7.
  39. Z. Huang and W. Zhao, “Combination of ELMo Representation and CNN Approaches to Enhance Service Discovery,” IEEE Access, vol. 8, pp. 130782–130796, 2020, doi: 10.1109/access.2020.3009393.
  40. M. Wankhade, A. C. S. Rao, and C. Kulkarni, “A survey on sentiment analysis methods, applications, and challenges,” Artificial Intelligence Review, vol. 55, no. 7, pp. 5731–5780, Feb. 2022, doi: 10.1007/s10462-022-10144-1.
  41. B. N. D. Santos, R. M. Marcacini, and S. O. Rezende, “Multi-Domain Aspect Extraction Using Bidirectional Encoder Representations From Transformers,” IEEE Access, vol. 9, pp. 91604–91613, 2021, doi: 10.1109/access.2021.3089099.
  42. R. Adams, S. Jeanrenaud, J. Bessant, D. Denyer, and P. Overy, “Sustainability‐oriented Innovation: A Systematic Review,” International Journal of Management Reviews, vol. 18, no. 2, pp. 180–205, May 2015, doi: 10.1111/ijmr.12068.
  43. D. Boyd, S. Golder, and G. Lotan, “Tweet, Tweet, Retweet: Conversational Aspects of Retweeting on Twitter,” 2010 43rd Hawaii International Conference on System Sciences, pp. 1–10, Jan. 2010, doi: 10.1109/hicss.2010.412.
  44. C. Trapnell et al., “Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks,” Nature Protocols, vol. 7, no. 3, pp. 562–578, Mar. 2012, doi: 10.1038/nprot.2012.016.
  45. J. Lee and K. E. Brown, “‘Make Korea with America Great Again’: An Articulation and Assemblage of South Korean Extreme Right Practices,” Communication, Culture and Critique, vol. 11, no. 1, pp. 53–66, Mar. 2018, doi: 10.1093/ccc/tcx004.
  46. Z. Jianqiang, G. Xiaolin, and Z. Xuejun, “Deep Convolution Neural Networks for Twitter Sentiment Analysis,” IEEE Access, vol. 6, pp. 23253–23260, 2018, doi: 10.1109/access.2017.2776930.
  47. T. Vaiyapuri et al., “Sustainable Artificial Intelligence-Based Twitter Sentiment Analysis on COVID-19 Pandemic,” Sustainability, vol. 15, no. 8, p. 6404, Apr. 2023, doi: 10.3390/su15086404.
  48. Y. M. González, “Reforming to Avoid Reform: Strategic Policy Substitution and the Reform Gap in Policing,” Perspectives on Politics, vol. 21, no. 1, pp. 59–77, Mar. 2022, doi: 10.1017/s1537592722000135.
  49. D. J. Hemel and E. A. Posner, “Presidential Obstruction of Justice,” SSRN Electronic Journal, 2017, doi: 10.2139/ssrn.3004876.
  50. S. R. Cotten and S. S. Gupta, “Characteristics of online and offline health information seekers and factors that discriminate between them,” Social Science & Medicine, vol. 59, no. 9, pp. 1795–1806, Nov. 2004, doi: 10.1016/j.socscimed.2004.02.020.
  51. S. Valenzuela, “Unpacking the Use of Social Media for Protest Behavior,” American Behavioral Scientist, vol. 57, no. 7, pp. 920–942, Mar. 2013, doi: 10.1177/0002764213479375.
  52. I. Dylko, I. Dolgov, W. Hoffman, N. Eckhart, M. Molina, and O. Aaziz, “The dark side of technology: An experimental investigation of the influence of customizability technology on online political selective exposure,” Computers in Human Behavior, vol. 73, pp. 181–190, Aug. 2017, doi: 10.1016/j.chb.2017.03.031.

CRediT Author Statement


The author reviewed the results and approved the final version of the manuscript.


Acknowledgements


Author(s) thanks to Korea Institute for Advancement of Technology (KIAT) for research lab and equipment support.


Funding


This research was supported by the Korea Institute for Advancement of Technology (KIAT) funded by the Ministry of Trade, Industry and Energy (MOTIE), Republic of Korea.


Ethics declarations


Conflict of interest

The authors have no conflicts of interest to declare that are relevant to the content of this article.


Availability of data and materials


No data available for above study.


Author information


Contributions

All authors have equal contribution in the paper and all authors have read and agreed to the published version of the manuscript.


Corresponding author


Rights and permissions


Open Access This article is licensed under a Creative Commons Attribution NoDerivs is a more restrictive license. It allows you to redistribute the material commercially or non-commercially but the user cannot make any changes whatsoever to the original, i.e. no derivatives of the original work. To view a copy of this license, visit https://creativecommons.org/licenses/by-nc-nd/4.0/


Cite this article


Seo jun, “Analyzing the Interplay Between Social Media Sentiment and Traditional Public Opinion in Politics”, Journal of Enterprise and Business Intelligence, vol.5, no.4, pp. 187-197, October 2025. doi: 10.53759/5181/JEBI202505019.


Copyright


© 2025 Seo jun. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.