Multi-factor person entity relation extraction model based on distant supervision

Aiming at the problem that the basic assumption of distant supervision was too strong and easy to produce noise data,a model of the person entity relation extraction which could automatically filter the training data generated by distant supervision was proposed.For training data generation,the data...

Full description

Saved in:
Bibliographic Details
Main Authors: Yangchen HUANG, Yan JIA, Liang GAN, Jing XU, Jiuming HUANG, Zhonghe HE
Format: Article
Language:zho
Published: Editorial Department of Journal on Communications 2018-07-01
Series:Tongxin xuebao
Subjects:
Online Access:http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2018118/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Aiming at the problem that the basic assumption of distant supervision was too strong and easy to produce noise data,a model of the person entity relation extraction which could automatically filter the training data generated by distant supervision was proposed.For training data generation,the data produced by distant supervision would be filtered by multiple instance learning and the method of TF-IDF-based relation keyword detecting,which tried to make the training data has the manual annotation quality.Furthermore,the model combined lexical and syntactic features to extract the effective relation feature vector from two angles of words and semantics for classifier.The experiment results on large scale real-world datasets show that the proposed model outperforms other relation extraction methods which based on distant supervision.
ISSN:1000-436X