Babatunde Kazeem Oladejo
Finding Records in Social Media: A Natural Language Processing Fundamentals Exploration
Oladejo, Babatunde Kazeem; Hadžidedić, Sunčica; Ganić, Emir
Authors
Contributors
Jasminka Hasic Telalovic
Editor
Muhamed Kantardzic
Editor
Abstract
Social media postings are now routinely used as proof of activities, events, or transactions in news media, academic institutions, governments, judicial courts, commerce, and various other organizations. The need to preserve social media content as records has drawn the interest of academic researchers, industry professionals, and policy makers. Despite the importance of this research area, selection of records from a pool of social media content remains an area of low research activity. This paper explores the use of Natural Language Processing methods to classify and select records from a pool of tweets (twitter social media content). We experiment with various characteristics of the data and NLP parameters with the goal of determining optimal parameters for training a supervised machine learning classifier. This paper can serve as an aid for understanding the fundamental elements of automating the selection of social media records.
Citation
Oladejo, B. K., Hadžidedić, S., & Ganić, E. (2021). Finding Records in Social Media: A Natural Language Processing Fundamentals Exploration. In J. Hasic Telalovic, & M. Kantardzic (Eds.), Mediterranean Forum - Data Science Conference (151-164). Springer, Cham. https://doi.org/10.1007/978-3-030-72805-2_11
Online Publication Date | Apr 2, 2021 |
---|---|
Publication Date | 2021 |
Deposit Date | Sep 7, 2021 |
Publicly Available Date | Sep 7, 2021 |
Pages | 151-164 |
Series Title | Communications in Computer and Information Science |
Series Number | 1343 |
Book Title | Mediterranean Forum - Data Science Conference |
ISBN | 9783030728045 |
DOI | https://doi.org/10.1007/978-3-030-72805-2_11 |
Public URL | https://durham-repository.worktribe.com/output/1653376 |
Files
Accepted Book Chapter
(632 Kb)
PDF
Copyright Statement
The final authenticated version is available online at https://doi.org/10.1007/978-3-030-72805-2_11
You might also like
Electronic records management – a state of the art review
(2021)
Journal Article
Downloadable Citations
About Durham Research Online (DRO)
Administrator e-mail: dro.admin@durham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search