Abstract
Cameras are now ubiquitous in our lives. A given activity is often captured by multiple people from different viewpoints resulting in a sizable collection of photograph footage. We present a method that effectively organizes this spatiotemporal content. Given an unorganized collection of photographs taken by a number of photographers, capturing some dynamic event at a number of time steps, we would like to organize the collection into a space–time table. The organization is an embedding of the photographs into clusters that preserve the viewpoint and time order. Our method relies on a self-organizing map (SOM), which is a neural network that embeds the training data (the set of images) into a discrete domain. We introduce BiSOM, which is a variation of SOM that considers two features (space and time) rather than a single one, to layout the given photograph collection into a table. We demonstrate our method on several challenging datasets, using different space and time descriptors.
Original language | English |
---|---|
Pages (from-to) | 417-430 |
Number of pages | 14 |
Journal | Visual Computer |
Volume | 34 |
Issue number | 3 |
DOIs | |
State | Published - 1 Mar 2018 |
Keywords
- Image organization
- Self-organizing maps
- Spatial ordering
- Temporal ordering