ICC 2017 presentation

ICC 2017

Using space-time cube for visualization of active transportation patterns derived from public webcams

Anna Petrasova, J. Aaron Hipp, Helena Mitasova

North Carolina State University

Public webcams

Rich source of spatio-temporal information

weather, traffic, changes in environment, phenology, ...
active transportation behavior in urban areas

Public webcams is a great source of spatio-temporal information, not just for tourists to check weather or traffic, but they have been used also for research in different areas. For example, to observe changes in environment, or in ecology when trees start to bloom. Public webcams in urban areas especially is also a valuable source of information about people and their transportation behavior, which is the main topic of this talk. Naturally, the widespread availability of the webcams all over the world together with the huge amount of data coming from each webcam every half an hour for example, has made this datasource highly suitable for research in computer vision and machine learning. BUT, to actually do any work with this data, there has to be a centralised archive, of webcam images to facilitate the research.

AMOS

The Archive of Many Outdoor Scenes

collection of long-term timelapse imagery from publicly accessible outdoor webcams around the world
1,128,087,180 images taken from 29945 webcams
a project of the Media and Machines Lab Washington University in St. Louis
online browsing of images and download available
metadata and tags to improve discoverability of webcams

From image to information

How to get from image to information useful for analysis?

Artificial intelligence

machine learning
neural networks

https://xkcd.com/1838

Artficial artficial intelligence

Amazon Mechanical Turk
crowdsourcing marketplace platform

fake chess-playing machine (late 18th century)

mTurk HITs (Human Intelligence Tasks)

HITs processing workflow

Georeferencing

Using coordinate system of the webcam image:

distances in the image represent varying distances in reality
we can't integrate other geospatial datasets (streets, POIs) or information from other webcams

Solution is to compute projective transformation by matching 4+ stable features in the webcam image to the same features in the orthophoto.

Georeferencing: example

Caveats: some webcams change orientation, many objects such as benches, traffic marking are unsuitable as GCPs, stable objects such as statues can move too

Distortions

Small errors in the mTurk outlines result in large spatial errors further from the webcam

STC visualization

Space-time density of pedestrians represented as a 3D volume, computed using multivariate Kernel Density Estimation (KDE) with different spatial and temporal bandwidths

Pedestrian density visualization

webcam 9706 (July), Ehingen, Germany

Pedestrian density visualization

webcam 10823 (July), Überlingen, Germany

Effects of plaza reconstruction

webcam 3760 in 2012 (Jul - Sep), Victoria Square, Adelaide, Australia

Effects of plaza reconstruction

webcam 3760 in 2014 (Jul - Sep), Victoria Square, Adelaide, Australia

Change in pedestrian density (2014 minus 2012)

Positive values ~ increase in density in 2014
Negative values ~ decrease in density in 2014

High density of pedestrians and vehicles (webcam 5599)

if (P > percentile^P₉₉ AND V > percentile^V₉₉, V + P)

Challenges: webcam geometry and view

areas hidden behind trees or other objects
assumes pedestrians and vehicles on a horizontal plane, otherwise we get large spatial errors

Challenges: mTurk reliability

Traffic lights, statues mistakenly marked as pedestrians, machine learning approaches would avoid this type of error

Software

Python libraries
Jupyter Notebook for data exploration
Georeferencing: scikit-image, GRASS GIS
KDE: SciPy, Statsmodels
Rendering: ParaView, GRASS GIS

github.com/petrasovaa/amos-visualization

Conclusion & Future work

new method for harvesting and visualization of spatio-temporal information about active transportation
new way for cities to detect and analyze changes in active transportation behavior in an unintrusive way
georeferenced data give us the ability to incorporate other geospatial data and methods (e.g., solar radiation modeling)
possible thanks to the synergy between crowdsourcing technologies (AMOS, mTurk, open source software)
machine learning techniques trained by mTurk data will enable us to analyze much larger data volume in real-time, possibly leading to the discovery of more patterns

The presented methodology shows an effective way to harvest and visualize active transportation data and could be used actively by cities to detect and analyze changes in active transportation behavior in a cheap and nonintrusive way. With the georeferncing step we can incorporate other geospatial data and methods, for example we were thinking of using solar radiation modeling to see how the shades influence people's behavior. This research was possible thanks to the synergy between the different crowdsourcing technologies and platforms including AMOS, mechanical Turk, and open source software. Finally, the current and future research is focused on using the mechanical turk data as a training set for machine learning techniques, which would ultimately allow us to process the entire database of images, and would reveal many more spatio-temporal patterns in the data.

Appendix

References:

Hipp, J. A., Adlakha, D., Gernes, R., Kargol, A., Pless, R., Drive, O. B., Louis, S. (2013). Do You See What I See: Crowdsource Annotation of Captured Scenes, 24–25. http://doi.org/10.1145/2526667.2526671
Hipp, J. A., Manteiga, A., Burgess, A., Stylianou, A., Pless, R. (2016). Webcams, Crowdsourcing, and Enhanced Crosswalks: Developing a Novel Method to Analyze Active Transportation. Front. Public Health, 4(97). http://doi.org/10.3389/fpubh.2016.00097
Jacobs, N., Roman, N., Pless, R. (2007). Consistent temporal variations in many outdoor scenes. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. http://doi.org/10.1109/CVPR.2007.383258

Reliability results for annotation of pedestrians in 720 webcam scenes.

Hipp, J. A., Adlakha, D., Gernes, R., Kargol, A., Pless, R., Drive, O. B., Louis, S. (2013). Do You See What I See: Crowdsource Annotation of Captured Scenes, 24–25. http://doi.org/10.1145/2526667.2526671