Processat i visualització d'entorns big data

dc.contributor.advisorBiosca Trias, Enric
dc.contributor.authorBlesa Sierra, César
dc.date.accessioned2014-11-05T10:07:38Z
dc.date.available2014-11-05T10:07:38Z
dc.date.issued2014-06-20
dc.descriptionTreballs Finals de Grau d'Enginyeria Informàtica, Facultat de Matemàtiques, Universitat de Barcelona, Any: 2014, Director: Enric Biosca Triasca
dc.description.abstractThis project proposes the union of two concepts with a growing trend in the technology sector, such as Big Data and Business Intelligence, resulting in an interactive display to the user a set of public data. Making worth of Big Data and Business Intelligence tools for large volumes of data such as Cloudera based on the Hadoop framework, implementation of MapReduce distributed programming paradigm, developed by Google, plays an important role for its scalability and ease to parallelize one software. In addition we also make use of Pentaho BI Suite is a set of free programs to generate business intelligence (BI), including integrated reporting tools. For this reason they are used in this project. Starting from a large set of public data on Wikipedia about the searches performed daily, structured a traditional Business Intelligence architecture to treat, process and store the files mentioned above. This is the first part of the process. The second part of the process consists of extracting information that have previously downloaded and stored through the dataset files in order to display the information in a way that the user can draw their own conclusions, ie, making reports. Finally, the creation of dashboards that goes a step beyond the typical implementation of a display Business Intelligence, icomo are reporting. To achieve this purpose we will use most of the tools that allow programs Pentaho BI Suite with all that important information.ca
dc.format.extent70 p.
dc.format.mimetypeapplication/pdf
dc.identifier.urihttps://hdl.handle.net/2445/59410
dc.language.isospaca
dc.rightsmemòria: cc-by-sa (c) César Blesa Sierra, 2014
dc.rightscodi: GPL (c) César Blesa Sierra, 2014
dc.rights.accessRightsinfo:eu-repo/semantics/openAccessca
dc.rights.urihttp://creativecommons.org/licenses/by-sa/3.0/es
dc.rights.urihttp://www.gnu.org/licenses/gpl-3.0.ca.html
dc.sourceTreballs Finals de Grau (TFG) - Enginyeria Informàtica
dc.subject.classificationMineria de dadescat
dc.subject.classificationSistemes d'informació per a la gestiócat
dc.subject.classificationProgramaricat
dc.subject.classificationTreballs de fi de graucat
dc.subject.classificationGestió de la informacióca
dc.subject.classificationBases de dadesca
dc.subject.otherData miningeng
dc.subject.otherManagement information systemseng
dc.subject.otherComputer softwareeng
dc.subject.otherBachelor's theseseng
dc.subject.otherInformation resources managementeng
dc.subject.otherDatabaseseng
dc.titleProcessat i visualització d'entorns big dataca
dc.typeinfo:eu-repo/semantics/bachelorThesisca

Fitxers

Paquet original

Mostrant 1 - 2 de 2
Carregant...
Miniatura
Nom:
codi_font.zip
Mida:
52.25 MB
Format:
ZIP file
Descripció:
Codi font
Carregant...
Miniatura
Nom:
memoria.pdf
Mida:
4.08 MB
Format:
Adobe Portable Document Format
Descripció:
Memòria