Big Data Cleaning Based on Improved CLOF and Random Forest for Distribution Networks
In order to improve the data quality, the big data cleaning method for distribution networks is studied in this paper. First, the Local Outlier Factor (LOF) algorithm based on DBSCAN clustering is used to detect outliers. However, due to the difficulty in determining the LOF threshold, a method of d...
Saved in:
Main Authors: | Jie Liu, Yijia Cao, Yong Li, Yixiu Guo, Wei Deng |
---|---|
Format: | Article |
Language: | English |
Published: |
China electric power research institute
2024-01-01
|
Series: | CSEE Journal of Power and Energy Systems |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9299499/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Enhancing imputation accuracy for catch-all missing data mechanisms with DFBETAS and leverage
by: Fares Qeadan, et al.
Published: (2025-12-01) -
Application of random forest in big data completion
by: Zheng WANG, et al.
Published: (2016-12-01) -
Quantum Circuit for Imputation of Missing Data
by: Claudio Sanavio, et al.
Published: (2024-01-01) -
The Performance of Multiple Imputation in Social Surveys with Missing Data from Planned Missingness and Item Nonresponse
by: Julian B. Axenfeld, et al.
Published: (2024-08-01) -
Machine Learning Based Method for Insurance Fraud Detection on Class Imbalance Datasets With Missing Values
by: Ahmed A. Khalil, et al.
Published: (2024-01-01)