View on GitHub

OpenDataToronto

Analytics with Toronto and Ontario open data

Welcome to OpenDataToronto! This repo repo houses my analysis of city and provincial open datasets, many of which were for the Open Toronto open data book club meetup. All analysis was done in Jupyter notebooks, and a list of package requirements can be found in the README.md under each project’s folder.

27/11/2018 - Toronto Bikeshare Trips July - December 2016

Analysis ipynb

The Toronto Q3 and Q4 bikeshare ridedata captures (anonymized) trips taken with the Toronto bikeshare system between July 1st and December 31st, 2016. The data includes the trip start and end times, duration, and origin and destination stations.

The visualization is deployed on Amazon AWS using Dash. Click on the link below to see more.



OTF


Plot of the number of rides per hour for the average weekday and weekend, for the Toronto bikeshare system in Q3-Q4 2016. Rides are subdivided into bikeshare members and casual riders (those without a bikeshare app account). Weekdays see an increase in the number of member riders during rush hour, and weekends a general increase in leisure rides.

20/4/2017 - Toronto City Sidewalk Inventory

Analysis ipynb
Meetup Page

The Toronto Sidewalk Inventory is a geospatial dataset that gives the availability of sidewalks along Toronto’s transportation corridors. I plot a map of these sidewalks, and investigate the correlation between lack of sidewalks along the roads of a neighbourhood with its population density and median household income. I find that the fraction of underdeveloped or missing sidewalk tends to depend on population density and on zoning, with densely populated areas and residential and commercial zones having very little missing sidewalk. Missing sidewalk, however, is not correlated with neighbourhood income, so there doesn’t appear to be any favouritism from city hall toward richer neighbourhoods.

OTF


Plot of sidewalk availability for the roadways of Toronto, with the old Toronto municipality boundaries overplotted for reference. Colours indicate roadway type and sidewalk availability. Red, yellow and green roads indicating no sidewalks, sidewalk available on one side of the road only, and available on both sides, respectively, while light and dark blue roads are actually walkways, pathways or recreational trails.


OTF


Sidewalk inventory for roadways of varying types. Institutional roads are those bordering universities, schools and hospitals; open roads are roads within or bordering parks, recreation areas, golf courses, marinas and cemeteries; and other roads are utility and transportation roads. The fraction of sidewalk in a particular state is indicated by the length of its corresponding bar section. The meanings of colours are given by the legend.


23/2/2017 - MyDem0cracy

Analysis ipynb
Meetup Page

The MyDem0cracy Canadian electoral reform survey (note the replacement of “o” by “0”) was produced by a group of concerned citizens as a complementary survey to the controversial MyDemocracy.ca survey by the Government of Canada. The survey solicits freeform comments from participants, which are then posted to let subsequent participants vote (“agree”, “disagree” or “neutral”) on the comments. I investigate the consensus opinion arising from these comments, and attempt to determine clusters of voters with similar opinions. In the aggregate, I find that participants are all highly in favour of greater political education and civic engagement. They disagree, however, with how electoral reform should proceed, with one large group of users advocating for proportional representation, another in favour of the current system, and a third without a strong opinion either way.

OTF


Plot of vote standard deviation vs. vote arithmetic mean of MyDem0cracy Canadian electoral reform survey comments. Marker sizes represent the total number of votes a comment received, while colour represents the response fraction, the percentage of the people who visitied MyDemo0cracy.ca since the question was posted that voted on the comment. The dashed grey line represents the largest standard deviation value for a given mean. Click here or on the picture to go to the interactive version, where you can hover the mouse over a marker to read the text of its comment and the number of agree, disagree and neutral votes for it.


24/11/2016 - Ontario Trillium Fund

Analysis ipynb
Meetup Page

The Ontario Trillium Foundation (OTF) is an agency of the Ontario government that annually allocates more than $136 million dollars in social/community program funding province-wide. In accordance with the Ontario government’s Open Data Directive, OTF provides data on successful grant applications over the last two decades on their open datapage. I perform an exploratory analysis on this data, examining how aggregate, per-capita and per-project funding is divided into different project areas and geographic regions, and how this changes with time. I find that:

OTF


Per-capita OTF Spending in Ontario's census geographic regions. Brighter and warmer colours represent higher funding per capita. Populations are taken from 2011 census, and annual funding is averaged from its FY 2010 to 2016 values. Click here or on the picture to go to the interactive version, where you can hover the mouse over a census area to see its name, population, number of OTF grants given per year, funding per year, median funding per project, and funding per capita (all OTF values are also averaged from FY 2010 to 2016).