In this document we provide an overview on the development and the integration in ENEAGRID of some tools able to download data from Web (Web Crawling), manage and display a large amount of data (Big Data), and extract from data relevant hidden information (Data Mining). We collected all these instruments inside the so called Web Crawling Project. Further, the corresponding environment, called Virtual Laboratory, is able to offer the possibility to use all these integrated tools remotely and by a simple graphical interface. A detailed description of the developed web application will be illustrated. Finally, some experimental results on the behaviour of the Web Crawling tool will be reported. © Copyright 2017 by SCITEPRESS - Science and Technology Publications, Lda. All rights reserved.

A collaborative environment forweb crawling and web data analysis in ENEAGRID

Migliori, S.;Guarnieri, G.;Funel, A.;De Rosa, M.;Bracco, G.;Ambrosino, F.;Ponti, G.
2017

Abstract

In this document we provide an overview on the development and the integration in ENEAGRID of some tools able to download data from Web (Web Crawling), manage and display a large amount of data (Big Data), and extract from data relevant hidden information (Data Mining). We collected all these instruments inside the so called Web Crawling Project. Further, the corresponding environment, called Virtual Laboratory, is able to offer the possibility to use all these integrated tools remotely and by a simple graphical interface. A detailed description of the developed web application will be illustrated. Finally, some experimental results on the behaviour of the Web Crawling tool will be reported. © Copyright 2017 by SCITEPRESS - Science and Technology Publications, Lda. All rights reserved.
Data mining;Web crawling;Text data;Big data;Networking
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: http://hdl.handle.net/20.500.12079/5064
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
social impact