TY - JOUR
T1 - Two-layered Blogger identification model integrating profile and instance-based methods
AU - Mohtasseb, H.
AU - Ahmed, A.
PY - 2012/4/20
Y1 - 2012/4/20
N2 - This paper introduces a two-layered framework that improves the result of authorship identification within larger sample numbers of bloggers as compared with earlier work. Previous studies are mainly divided into two categories: profile-based and instance-based methods. Each of these approaches has its advantages and limitations. The two-layered framework presented here integrates the two previous approaches and presents a new solution to a key problem in authorship identification, namely the drop in accuracy experienced as the number of authors increases. The paper begins by illustrating the regular instance-based core model and the investigated features. It then introduces a new psycholinguistic profile representation of authors, presents similarity grouping extraction over profiles, and applies blogger identification utilizing the two-layered approach. The results confirm the improvement introduced by the proposed two-layered approach against our regular classifier, as well as a selected baseline, for an extended number of users.
AB - This paper introduces a two-layered framework that improves the result of authorship identification within larger sample numbers of bloggers as compared with earlier work. Previous studies are mainly divided into two categories: profile-based and instance-based methods. Each of these approaches has its advantages and limitations. The two-layered framework presented here integrates the two previous approaches and presents a new solution to a key problem in authorship identification, namely the drop in accuracy experienced as the number of authors increases. The paper begins by illustrating the regular instance-based core model and the investigated features. It then introduces a new psycholinguistic profile representation of authors, presents similarity grouping extraction over profiles, and applies blogger identification utilizing the two-layered approach. The results confirm the improvement introduced by the proposed two-layered approach against our regular classifier, as well as a selected baseline, for an extended number of users.
UR - http://www.scopus.com/inward/record.url?eid=2-s2.0-85027950863&partnerID=MN8TOARS
U2 - 10.1007/s10115-011-0398-0
DO - 10.1007/s10115-011-0398-0
M3 - Article (journal)
SN - 0219-1377
VL - 31
SP - 1
EP - 21
JO - Knowledge and Information Systems
JF - Knowledge and Information Systems
IS - 1
ER -