N.World
We are modernising information systems to bring the
information for citizens
NASTAT Project
The challenge
Nastat is the Statistical Institute of Navarre, the public body responsible for the collection, processing and dissemination of statistical data in the Autonomous Community of Navarre.

The aim has been to to cover the needs linked to the processing and dissemination of different statistical operations which, until now, were carried out manually by the agency's technicians.

The solution
Bosonit has implemented innovative technologies based on big data, cloud storage and more dynamic and attractive visualisations for the user.
01. Centralise information
02. Automate ETL processes
03. Bringing the data closer

The need arose to group all of Nastat's information through the design and implementation of a Big Data architecture in a cloud environment.

ETL processes were automated to reduce the manual work of data collection and processing, thus optimising resources.

The statistical dissemination processes were improved through the creation of visualisations using Business Intelligence tools to facilitate the interpretation of the data by the end user. Also, the back-end of the new website was developed to integrate all the elements.

01. Centralise information

The need arose to group all of Nastat's information through the design and implementation of a Big Data architecture in a cloud environment.

02. Automate ETL processes

ETL processes were automated to reduce the manual work of data collection and processing, thus optimising resources.

03. Bringing the data closer

The statistical dissemination processes were improved through the creation of visualisations using Business Intelligence tools to facilitate the interpretation of the data by the end user. Also, the back-end of the new website was developed to integrate all the elements.

Technologies used
  • Python for the integration of data with the HDFS system, both from Nastat files and from other files extracted by web scraping techniques.
  • Spark for the development of ETL processes through the PySpark API.
  • Hive for data aggregation, consultation and analysis.
  • Jenkins for the automation of the entire data lifecycle, from its collection to its final form.
  • Power BI to facilitate the dissemination of data to end-users through interactive visualisations.
  • Liferay for the creation of the web portal where all the information will be published.

?

The project has made it possible to create an application necessary to facilitate the work of Nastat members, thus promoting the digital transformation of public institutions and bringing data closer to citizens.

?