Korbicz, Józef (1951- ) - red. ; Uciński, Dariusz - red.
Despite the rapid growth of other types of social media, Internet discussion forums remain a highly popular communication channel and a useful source of text data for analyzing user interests and sentiments. Being suited to richer, deeper, and longer discussions than microblogging services, they particularly well reflect topics of long-term, persisting involvement and areas of specialized knowledge or experience. Discovering and characterizing such topics and areas by text mining algorithms is therefore an interesting and useful research direction. ; This work presents a case study in which selected classification algorithms are applied to posts from a Polish discussion forum devoted to psychoactive substances received from home-grown plants, such as hashish or marijuana. The utility of two different vector text representations is examined: the simple bag of words representation and the more refined embedded global vectors one. ; While the former is found to work well for the multinomial naive Bayes algorithm, the latter turns out more useful for other classification algorithms: logistic regression, SVMs, and random forests. The obtained results suggest that post-classification can be applied for measuring publication intensity of particular topics and, in the case of forums related to psychoactive substances, for monitoring the risk of drug-related crime.
Zielona Góra: Uniwersytet Zielonogórski
AMCS, volume 28, number 4 (2018) ; click here to follow the link
Biblioteka Uniwersytetu Zielonogórskiego
Jul 14, 2025
Jul 9, 2025
34
https://zbc.uz.zgora.pl/repozytorium/publication/100899
| Edition name | Date |
|---|---|
| A case study in text mining of discussion forum posts: Classification with bag of words and global vectors | Jul 14, 2025 |
Cichosz, Paweł Korbicz, Józef (1951- ) - red. Uciński, Dariusz - red.
Sajdak, Anna Magda-Adamowicz, Marzenna - red. Pasterniak-Kobyłecka, Ewa - red.
Ganán, David Caballé, Santi Conesa, Jordi Conesa, Fatos Korbicz, Józef (1951- ) - red. Uciński, Dariusz - red.
Zhang, Weimin Zhou, Luyao Shao, Min Wang, Cui Wang, Yu Woźniak, Marcin - ed. Kumar, Yogesh - ed. Ijaz, Muhammad Fazal - ed.
Cichosz, Paweł Pawełczak, Łukasz Abaev, Pavel - ed. Razumchik, Rostislav - ed. Kołodziej, Joanna - ed.