Statistical Analysis of Classification Algorithms for Predicting Socioeconomics Status of Twitter Users

Public Deposited

Analytics

Resource Type

Creator

Abstract

The purpose of this study is to compare a series of well-known statistical machine learning techniques that classify online social network (OSN) Twitter users based on their socioeconomic status (upper/middle/lower). In the experiments, five classification algorithms are employed for the classification task. Logistic Regression, Support Vector Machine (SVM), Naïve Bayes (NB), k-Nearest Neighbors, and Decision Tree are applied on high-dimensional data set extracted from the users’ platform-based and profile-based behavior on Twitter. These algorithms are theoretically investigated and experimentally evaluated in terms of four performance measures: accuracy, precision, recall and, AUC. Ensemble methods are employed to improve the performance of the aforementioned algorithms. MANOVA is employed to examine if their performance measures are significantly different. ANOVA is used to analyze the differences of the classifiers for each performance measure. The analyses indicate a significant difference among these algorithms; both SVM and NB achieve good performance on our high-dimensional OSN data.

Subject

Language

Publisher

Thesis Degree Level

Thesis Degree Name

Thesis Degree Discipline

Identifier

Rights Notes

Copyright © 2017 the author(s). Theses may be used for non-commercial research, educational, or related academic purposes only. Such uses include personal study, research, scholarship, and teaching. Theses may only be shared by linking to Carleton University Institutional Repository and no part may be used without proper attribution to the author. No part may be used for commercial purposes directly or indirectly via a for-profit platform; no adaptation or derivative works are permitted without consent from the copyright owner.

Date Created

Relations

In Collection:

Thumbnail	Title	Date Uploaded	Visibility	Actions
	zhou-statisticalanalysisofclassificationalgorithms.pdf	2023-05-05	Public	Download