 MRC Epidemiology Unit
MRC Epidemiology Unit 
All are invited to the Cambridge Digital Humanities Seminar:
Using images at scale to understand environments and behaviours
Wednesday November 21, 2018
GR06/07, Faculty of English
Convenors:
Dr James Woodcock – Centre for Diet and Activity Research (CEDAR), University of Cambridge
Dr Rahul Goel – Centre for Diet and Activity Research (CEDAR), University of Cambridge
Dr Carola-Bibiane Schönlieb – Reader in Applied and Computational Analysis, Department of Applied Mathematics and Theoretical Physics, University of Cambridge
Dr Anne Alexander – Cambridge Digital Humanities
Book four FREE place online here: https://www.eventbrite.co.uk/e/using-images-at-scale-to-understand-environments-and-behaviours-tickets-50737316680
Image big data are increasingly being used to understand the built and natural environment and to observe behaviours within it. Data sources include satellite and airborne imagery, 360 street views, and fixed video or time lapse traffic and CCTV cameras. While some of these sources are newer than others what has been changing are the quality of the images, the geographical coverage, and the potential for assessing changes over time. At the same time improvements in machine learning have made it possible to turn images into quantitative data at scale.
In this workshop we will explore the challenges that researchers face when using images at scale to understand environments and behaviours, building on work at Cambridge to estimate cycling levels, using satellite data to estimate motor vehicle volume, and planned data collection in Kenya using 360 cameras. We will be using the following themes to help structure our discussion.
Data capture and access
What do researchers need to understand about how these images are created in order to interpret them accurately at scale? How do we deal with the challenge of working with ‘image data’ which no longer needs to be rendered into something that humans would recognise as an image for us to work with it? With the increasing drive towards the outsourcing of data capture to various third parties and subcontractors what questions should researchers be asking about these processes in order to build effective models?
Data analysis and triangulation
What theoretical and practical problems arise from efforts to combine heterogeneous large-scale image datasets to model human behaviours? How do we account for different kinds of temporality and spatiality in large-scale image datasets when making statistical inferences and developing models of behaviour – for example if we combine image data composed of a sequence of snapshots from fixed locations such as traffic cameras with data filmed by a drone? What methods should we use to render gaps and ambiguity in the data more visible to end-users of our results?
Identity and human rights
What methods should researchers use to protect the rights of humans who may be identifiable in these datasets? Can we reliably anonymise large-scale image datasets? Whose consent should we seek to capture, process, analyse or publish such datasets and when should we seek it?
Data management, re-use and preservation
What infrastructure do we need to manage large-scale image datasets in the present? Should we be attempting to preserve them for the future?
A sandwich lunch will be available for participants so please help us cater accurately by cancelling your ticket or letting us know if you can no longer attend. Please contact Michelle Maciejewska (mm405@cam.ac.uk) with any special dietary requirements by 14 November.
FREE booking online here: https://www.eventbrite.co.uk/e/using-images-at-scale-to-understand-environments-and-behaviours-tickets-50737316680