Covid-19 Data Analysis with Spark
github
Why

This project was part of the required coursework for the "Middleware Technologies for Distributed Systems" course at Polimi. It was an invaluable opportunity to learn Scala and Spark.

What

This projects analyzes a large Covid-19 dataset. I computed complex statistics on a large dataset with Spark

How

This project has been implemented in Scala with Spark as a data analysis engine.