This project was part of the required coursework for the "Middleware Technologies for Distributed Systems" course at Polimi. It was an invaluable opportunity to learn Scala and Spark.
WhatThis projects analyzes a large Covid-19 dataset. I computed complex statistics on a large dataset with Spark
HowThis project has been implemented in Scala with Spark as a data analysis engine.