Twitter user geolocation method based on single-point toponym matching and local toponym filtering

The availability of accurate toponyms in user tweets is crucial for geolocating Twitter users.However, existing methods for locating Twitter users often suffer from limited quantity and reliability of acquired toponyms, thus impacting the accuracy of user geolocation.To address this issue, a twitter...

Full description

Saved in:
Bibliographic Details
Main Authors: Jin XUE, Fuxiang YUAN, Yimin LIU, Meng ZHANG, Yaqiong QIAO, Xiangyang LUO
Format: Article
Language:English
Published: POSTS&TELECOM PRESS Co., LTD 2023-08-01
Series:网络与信息安全学报
Subjects:
Online Access:http://www.cjnis.com.cn/thesisDetails#10.11959/j.issn.2096-109x.2023053
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841529663513427968
author Jin XUE
Fuxiang YUAN
Yimin LIU
Meng ZHANG
Yaqiong QIAO
Xiangyang LUO
author_facet Jin XUE
Fuxiang YUAN
Yimin LIU
Meng ZHANG
Yaqiong QIAO
Xiangyang LUO
author_sort Jin XUE
collection DOAJ
description The availability of accurate toponyms in user tweets is crucial for geolocating Twitter users.However, existing methods for locating Twitter users often suffer from limited quantity and reliability of acquired toponyms, thus impacting the accuracy of user geolocation.To address this issue, a twitter user geolocation method based on single-point toponym matching and local toponym filtering was proposed.A toponym type discriminating algorithm based on the aggregation degree of locations of the toponym was designed.In the proposed algorithm, a single-point toponym database was generated to provide more reliable toponyms extracted from tweets.Then, according to a proposed local place name filtering algorithm based on the aggregation degree of user location, the aggregation degree of user location centered on the longitude and latitude of toponyms and the average longitude and latitude of users were calculated.This process helped in extracting local toponyms with a high aggregation degree, which enhances the reliability of toponyms used in geolocation.Finally, a user-toponym heterogeneous graph was constructed based on user social relationships and user mentions of toponyms, and users were located by graph representation learning and neural networks.A large number of user geolocation experiments were conducted based on two commonly used public datasets in this field, namely GEOTEXT and TW-US.Comparisons with nine existing typical methods for Twitter user geolocation, including HGNN, ReLP, and GCN, demonstrate that our proposed method achieves significantly higher geolocation accuracy.On the GEOTEXT dataset, the average error is reduced by 7.3~342.8 km, the median error is reduced by 2.4~354.4 km, and the accuracy of large area-level geolocation is improved by 1.3%~26.3%.On the TW-US dataset, the average error is reduced by 8.6~246.6 km, the median error is reduced by 5.7~149.7 km, and the accuracy of large area-level geolocation is improved by 1.5%~20.5%.
format Article
id doaj-art-f242fdb922104bfd8bae0873b3c73815
institution Kabale University
issn 2096-109X
language English
publishDate 2023-08-01
publisher POSTS&TELECOM PRESS Co., LTD
record_format Article
series 网络与信息安全学报
spelling doaj-art-f242fdb922104bfd8bae0873b3c738152025-01-15T03:16:43ZengPOSTS&TELECOM PRESS Co., LTD网络与信息安全学报2096-109X2023-08-019536359579210Twitter user geolocation method based on single-point toponym matching and local toponym filteringJin XUEFuxiang YUANYimin LIUMeng ZHANGYaqiong QIAOXiangyang LUOThe availability of accurate toponyms in user tweets is crucial for geolocating Twitter users.However, existing methods for locating Twitter users often suffer from limited quantity and reliability of acquired toponyms, thus impacting the accuracy of user geolocation.To address this issue, a twitter user geolocation method based on single-point toponym matching and local toponym filtering was proposed.A toponym type discriminating algorithm based on the aggregation degree of locations of the toponym was designed.In the proposed algorithm, a single-point toponym database was generated to provide more reliable toponyms extracted from tweets.Then, according to a proposed local place name filtering algorithm based on the aggregation degree of user location, the aggregation degree of user location centered on the longitude and latitude of toponyms and the average longitude and latitude of users were calculated.This process helped in extracting local toponyms with a high aggregation degree, which enhances the reliability of toponyms used in geolocation.Finally, a user-toponym heterogeneous graph was constructed based on user social relationships and user mentions of toponyms, and users were located by graph representation learning and neural networks.A large number of user geolocation experiments were conducted based on two commonly used public datasets in this field, namely GEOTEXT and TW-US.Comparisons with nine existing typical methods for Twitter user geolocation, including HGNN, ReLP, and GCN, demonstrate that our proposed method achieves significantly higher geolocation accuracy.On the GEOTEXT dataset, the average error is reduced by 7.3~342.8 km, the median error is reduced by 2.4~354.4 km, and the accuracy of large area-level geolocation is improved by 1.3%~26.3%.On the TW-US dataset, the average error is reduced by 8.6~246.6 km, the median error is reduced by 5.7~149.7 km, and the accuracy of large area-level geolocation is improved by 1.5%~20.5%.http://www.cjnis.com.cn/thesisDetails#10.11959/j.issn.2096-109x.2023053user geolocationuser-generated texttoponymsocial media
spellingShingle Jin XUE
Fuxiang YUAN
Yimin LIU
Meng ZHANG
Yaqiong QIAO
Xiangyang LUO
Twitter user geolocation method based on single-point toponym matching and local toponym filtering
网络与信息安全学报
user geolocation
user-generated text
toponym
social media
title Twitter user geolocation method based on single-point toponym matching and local toponym filtering
title_full Twitter user geolocation method based on single-point toponym matching and local toponym filtering
title_fullStr Twitter user geolocation method based on single-point toponym matching and local toponym filtering
title_full_unstemmed Twitter user geolocation method based on single-point toponym matching and local toponym filtering
title_short Twitter user geolocation method based on single-point toponym matching and local toponym filtering
title_sort twitter user geolocation method based on single point toponym matching and local toponym filtering
topic user geolocation
user-generated text
toponym
social media
url http://www.cjnis.com.cn/thesisDetails#10.11959/j.issn.2096-109x.2023053
work_keys_str_mv AT jinxue twitterusergeolocationmethodbasedonsinglepointtoponymmatchingandlocaltoponymfiltering
AT fuxiangyuan twitterusergeolocationmethodbasedonsinglepointtoponymmatchingandlocaltoponymfiltering
AT yiminliu twitterusergeolocationmethodbasedonsinglepointtoponymmatchingandlocaltoponymfiltering
AT mengzhang twitterusergeolocationmethodbasedonsinglepointtoponymmatchingandlocaltoponymfiltering
AT yaqiongqiao twitterusergeolocationmethodbasedonsinglepointtoponymmatchingandlocaltoponymfiltering
AT xiangyangluo twitterusergeolocationmethodbasedonsinglepointtoponymmatchingandlocaltoponymfiltering