Last week I had the pleasure to attend the first "London DataDive" organised by DataKind UK; a non-profit organisation which aims to bring data scientists and social organisations together. The noble idea behind DataKind is that charities and public bodies often sit on a pile of data which might be valuable to our society, but that those organisations often lack the tools and resources needed to put these data to use.
The "DataDive" which I attended took place in the Mozilla Space in Central London and had 80 volunteer data scientists show up. On the first night, four charities were given the opportunity to present their problems and data sets, followed by a small socialising party. On the next day, a solid 12 hours of non-stop software hacking and data analysis ensued. Both The Guardian and The Economist posted a summary of the charities and the hack day projects involved.
I decided to join the Oxfam camp, which had brought along a nice data set on global food prices. These prices have seen sharp and sudden increases in recent years. Nobody is exactly sure why the price of food has become so high and volatile, but the impact on the well-being and security of people across poor and rural areas is significant.
With a small team I decided to focus on the question: "Does rainfall affect food prices in Kenya?"
Although the amount of data exploration that can happen in a single day is limited, we found evidence for three surprising conclusions:
- rainfall in Kenya has decreased by up to 50% since 1980, probably due to deforestation;
- the price of Maize in Kenya has risen by up to 400% since 2000;
- there was anomalous rainfall in Kenya in 1998, 2009 and 2010.
Interestingly, the anomalous rainfall in 2009 and 2010 coincided with spikes in the price of Maize, which is a crop known to be sensitive to changing rain patterns. However, many factors are involved, and hence a more comprehensive study of rainfall and food prices across the world would be needed before a causal relationship can be established. Such a relationship, if any, is probably small compared to the global market trends.
The results of our data exploration are summarised in the slides above. The plots were made with Python and PostgreSQL. I learned loads and hope to meet the awesome data analysts from Oxfam again in the future!