Naver News: South Korean online platform operated by the Naver Corporation and launched in 1998 (news service, 2000-present, available open news search, 1990-present)
Big KINDS: Integrated database for most major newspapers published in South Korea 1990-present
Text Mining of Naver News
Search term: 메타버스 (Metaverse)
Web scrapping using R (Naver API is available)
Sentiment analysis and word network
10,677 after removing duplicates (in Jan. to Apr. 2023)
9,525 news after removing news less than 500 characters (n = 1,152
min character = 500, max character = 13,634 per news
Pre-Process and Tokenize Texts (News Articles)
R version 4.3.0
Among at least 30 occurrences within each news
At least .12 of correlations with “Cybersecurity”
library(tidytext): unnest_tokens() to transform text data into a “tidy” format
library(KoNLP): SimplePos09() to tokenize each into morphemes (the smallest meaningful unit)
library(widyr): pairwise_cor() to find correlations of words associated with “Cybersecurity”