As I am currently spending a semester abroad, I am particularly interested in where Erasmus students come from and which countries they go to. To explore this, I use the dataset available at data.europa.eu, specifically the file titled:
“Erasmus+ Raw Mobility Data – Key Action 2”
This dataset provides information on all Erasmus+ mobilities since 2014. My project focuses on aggregating how many students have gone from each sending country to each receiving country over the years.
The dataset contains information on individual Erasmus+ mobilities, including the following relevant fields:
Only include long-term study mobility of pupils
Aggregate the total number of participants for each combination of Sending Country
and Receiving Country
, since all data since 2014 should be included.
Since the data is provided as an Excel XLSX file, which cannot be accessed directly from the internet via a URL in R, it would be great if you could include the following steps:
In order to answer the questions, visualize the following
create a heatmap, which highlights the intensity of mobility flows between countries. Each cell should represent the number of participants from one sending country to one receiving country. Apply a clear color scale to highlight intensity differences. You can choose whatever colors you want. The visualization could look like this (not bases on real data ./heatmap_example_datadraft_SvenjaPoerstel.PNG)
To further explore groupings and flow dynamics, create a Sankey flow diagram.
Generate a barplot to show the top 10 most popular receiving countries, the bars should be sorted in descending order by the number of incoming participants.