Abstract
Smoking is one of the leading causes of preventable death, being responsible for about six million deaths annually worldwide. Most smokers want to quit, but many find quitting difficult. The Internet enables people interested in quitting smoking to connect with others via online communities; however, the characteristics of these discussions are not well understood. This work aims to explore the textual cues of an online community interested in quitting smoking: www.reddit.com/r/stopsmoking – “a place for redditors to motivate each other to quit smoking”. A total of approximately 5,000 posts were randomly selected from the community. Four subgroups of posts based on the cessation days of abstainers were defined: S0: within the first week, S1: within the first month (excluding cohort S0), S2: from second month to one year, and S3: beyond one year. Psycho-linguistic features and content topics were extracted from the posts and analysed. Machine learning techniques were used to discriminate the online conversations in the first week S0 from the other subgroups. Topics and psycho-linguistic features were found to be highly valid predictors of the subgroups, possibly providing an important step in understanding social media and its use in studies of smoking and other addictions in online settings.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
The list of all topics can be accessed via http://bit.ly/21Z0o4r.
References
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
Chen, A.T., Zhu, S.-H., Conway, M.: What online communities can tell us about electronic cigarettes and hookah use: a study using text mining and visualization techniques. J. Med. Internet Res. 17(9), e220 (2015)
Friedman, J., Hastie, T., Tibshirani, R.: Regularization paths for generalized linear models via coordinate descent. J. Stat. Softw. 33(1), 1 (2010)
Kumar, M., Dredze, M., Coppersmith, G., De Choudhury, M.: Detecting changes in suicide content manifested in social media following celebrity suicides. In: Proceedings of the ACM Conference on Hypertext & Social Media, pp. 85–94 (2015)
Nguyen, T., Phung, D., Adams, B., Venkatesh, S.: A sentiment-aware approach to community formation in social media. In: Proceedings of the International AAAI Conference on Weblogs and Social Media, pp. 527–530 (2012)
Nguyen, T., Phung, D., Venkatesh, S.: Analysis of psycholinguistic processes and topics in online autism communities. In: Proceedings of the IEEE International Conference on Multimedia and Expo, pp. 1–6 (2013)
Pennebaker, J.W., Francis, M.E., Booth, R.J.: Linguistic Inquiry and Word Count (LIWC) [Computer software]. LIWC Inc. (2007)
Tamersoy, A., De Choudhury, M., Chau, D.H.: Characterizing smoking and drinking abstinence from social media. In: Proceedings of the ACM Conference on Hypertext & Social Media, pp. 139–148 (2015)
Acknowledgment
This work is partially supported by the Telstra-Deakin Centre of Excellence in Big Data and Machine Learning.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Nguyen, T., Borland, R., Yearwood, J., Yong, HH., Venkatesh, S., Phung, D. (2016). Discriminative Cues for Different Stages of Smoking Cessation in Online Community. In: Cellary, W., Mokbel, M., Wang, J., Wang, H., Zhou, R., Zhang, Y. (eds) Web Information Systems Engineering – WISE 2016. WISE 2016. Lecture Notes in Computer Science(), vol 10042. Springer, Cham. https://doi.org/10.1007/978-3-319-48743-4_12
Download citation
DOI: https://doi.org/10.1007/978-3-319-48743-4_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-48742-7
Online ISBN: 978-3-319-48743-4
eBook Packages: Computer ScienceComputer Science (R0)