Please use this identifier to cite or link to this item: http://hdl.handle.net/2445/52824
Full metadata record
DC FieldValueLanguage
dc.contributor.authorTapiador, D.-
dc.contributor.authorO'Mullane, William-
dc.contributor.authorBrown, A. G. A.-
dc.contributor.authorLuri Carrascoso, Xavier-
dc.contributor.authorHuedo, E.-
dc.contributor.authorOsuna, P.-
dc.date.accessioned2014-03-24T08:27:59Z-
dc.date.available2014-03-24T08:27:59Z-
dc.date.issued2014-05-13-
dc.identifier.issn0010-4655-
dc.identifier.urihttp://hdl.handle.net/2445/52824-
dc.description.abstractThe European Space Agency's Gaia mission will create the largest and most precise three dimensional chart of our galaxy (the Milky Way), by providing unprecedented position, parallax, proper motion, and radial velocity measurements for about one billion stars. The resulting catalogue will be made available to the scientific community and will be analyzed in many different ways, including the production of a variety of statistics. The latter will often entail the generation of multidimensional histograms and hypercubes as part of the precomputed statistics for each data release, or for scientific analysis involving either the final data products or the raw data coming from the satellite instruments. In this paper we present and analyze a generic framework that allows the hypercube generation to be easily done within a MapReduce infrastructure, providing all the advantages of the new Big Data analysis paradigmbut without dealing with any specific interface to the lower level distributed system implementation (Hadoop). Furthermore, we show how executing the framework for different data storage model configurations (i.e. row or column oriented) and compression techniques can considerably improve the response time of this type of workload for the currently available simulated data of the mission. In addition, we put forward the advantages and shortcomings of the deployment of the framework on a public cloud provider, benchmark against other popular solutions available (that are not always the best for such ad-hoc applications), and describe some user experiences with the framework, which was employed for a number of dedicated astronomical data analysis techniques workshops.-
dc.format.extent11 p.-
dc.format.mimetypeapplication/pdf-
dc.language.isoeng-
dc.publisherElsevier B.V.-
dc.relation.isformatofVersió postprint del document publicat a: http://dx.doi.org/10.1016/j.cpc.2014.02.010-
dc.relation.ispartofComputer Physics Communications, 2014, vol. 185, num. 5, p. 1429-1438-
dc.relation.urihttp://dx.doi.org/10.1016/j.cpc.2014.02.010-
dc.rights(c) Elsevier B.V., 2014-
dc.sourceArticles publicats en revistes (Física Quàntica i Astrofísica)-
dc.subject.classificationDiagrames-
dc.subject.classificationMineria de dades-
dc.subject.classificationEstadística-
dc.subject.classificationComputació distribuïda-
dc.subject.otherDiagrams-
dc.subject.otherData mining-
dc.subject.otherStatistics-
dc.subject.otherComputational grids (Computer systems)-
dc.subject.otherVia Làctia-
dc.titleA Framework for Building Hypercubes Using MapReduce-
dc.typeinfo:eu-repo/semantics/article-
dc.typeinfo:eu-repo/semantics/acceptedVersion-
dc.identifier.idgrec633595-
dc.date.updated2014-03-24T08:27:59Z-
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess-
Appears in Collections:Articles publicats en revistes (Física Quàntica i Astrofísica)

Files in This Item:
File Description SizeFormat 
633595.pdf1.17 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.