Detect Anomalies in Photovoltaic Systems Using Isolation Forest (Preliminary Results)

ENEA-IRIS Open Archive è l’archivio della produzione scientifica dell'ENEA, realizzato con l'obiettivo di raccogliere, catalogare e rendere facilmente accessibili in rete i risultati della ricerca. Gli autori dell’ENEA provvedono a depositare le proprie pubblicazioni (articoli su rivista, presentazioni a congressi, report, ecc.). In particolare, quelle finanziate dalla Commissione Europea nell’ambito del programma H2020 (che prevede il deposito obbligatorio in un Repository), una volta caricate, vengono automaticamente importate dal portale europeo OpenAIRE. È possibile inserire, o importare direttamente dalle banche dati previste, le informazioni descrittive del documento e anche allegare, ove consentito dalla normativa sul diritto d'autore, il testo completo della pubblicazione.

ENEA-IRIS Open Archive utilizza la piattaforma IRIS (Institutional Research Information System) sviluppata da CINECA.

Solar energy from Photovoltaic (PV) systems is one of the greatest growing renewable sources of energy. PV systems, although generally quite robust, are subject to failures that can adversely affect energy conversion or even pose safety concerns. Being able to promptly and automatically detect fails/anomalies is essential to improve PV systems reliability while maintaining the expected efficiency. In this paper are investigated five unsupervised Machine Learning (ML) methods for PV plants’ faults detection. The tested methods include Isolation Forest (IF), one-Class Support Vector Machines, Local Outlier Factors, Deep Learning Autoencoders, Gaussian Mixtures Models (GMM). Dealing with anomalies in PV means to handle a highly unbalanced dataset, as anomalies are relatively rare events, hopefully. Moreover, it is usually quite challenging to have precise labels for each fault event. For this reason in this paper are analysed only unsupervised ML models, that does not require labelled data. However, being able to quantify models’ performance precisely is a quite challenging task as it requires expert support or labels existence, even if they can be imprecise. To accurately represents the model’s performance for such a highly unbalanced dataset are reported metrics more suitable for such a task as balanced accuracy, recall, F1, Matthews Correlation Coefficient and Cohen Kappa. For the reason outlined above regarding the available labels, the results summarised in this paper can be considered as preliminary and require a more suitable dataset with precise labels. These preliminary results show that GMM could be highly effective to operate in the PV anomaly detection field; however, some models as IF and Autoencoders, that have proven to be very effective in different but demanding fields, deserve further investigation.

Detect Anomalies in Photovoltaic Systems Using Isolation Forest (Preliminary Results)

Ferlito S.;De Vito S.;Di Francia G.

2021-01-01

Abstract

Solar energy from Photovoltaic (PV) systems is one of the greatest growing renewable sources of energy. PV systems, although generally quite robust, are subject to failures that can adversely affect energy conversion or even pose safety concerns. Being able to promptly and automatically detect fails/anomalies is essential to improve PV systems reliability while maintaining the expected efficiency. In this paper are investigated five unsupervised Machine Learning (ML) methods for PV plants’ faults detection. The tested methods include Isolation Forest (IF), one-Class Support Vector Machines, Local Outlier Factors, Deep Learning Autoencoders, Gaussian Mixtures Models (GMM). Dealing with anomalies in PV means to handle a highly unbalanced dataset, as anomalies are relatively rare events, hopefully. Moreover, it is usually quite challenging to have precise labels for each fault event. For this reason in this paper are analysed only unsupervised ML models, that does not require labelled data. However, being able to quantify models’ performance precisely is a quite challenging task as it requires expert support or labels existence, even if they can be imprecise. To accurately represents the model’s performance for such a highly unbalanced dataset are reported metrics more suitable for such a task as balanced accuracy, recall, F1, Matthews Correlation Coefficient and Cohen Kappa. For the reason outlined above regarding the available labels, the results summarised in this paper can be considered as preliminary and require a more suitable dataset with precise labels. These preliminary results show that GMM could be highly effective to operate in the PV anomaly detection field; however, some models as IF and Autoencoders, that have proven to be very effective in different but demanding fields, deserve further investigation.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2021
			
	Codice ISBN
	
				978-3-030-69550-7
978-3-030-69551-4
			
	Parole chiave
	
				Anomaly detection
Autoencoders
Ensemble
Gaussian mixture models
Isolation forest
Photovoltaic systems
			
	Appare nelle tipologie:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12079/65957

Citazioni

ND

2

social impact