Find below a selection of datasets maintained by us. All data is only for research purposes, unless stated differently. Please make sure to reference the authors properly when using the data.
|
Over 15K images of 20 people recorded with a Kinect while turning their heads around freely. For each frame, depth and rgb images are provided, together with ground in the form of the 3D location of the head and its rotation angles. Information and download pageRelated publications:
|
The corpus contains high quality dynamic (25 fps) 3D scans of faces recorded while pronouncing a set of English sentences. Affective states were induced by showing emotional video clips to the speakers. The data has been annotated by tracking all frames using a generic face template, segmenting the speech signal into single phonemes, and evaluating the emotions conveyed by the recorded sequences by means of an online survey.
Information and request page
|
Related publications:
|
Walking pedestrians in busy scenarios from a bird eye view. Manually annotated. Data used for training in our ICCV09 paper "You'll Never Walk Alone: Modeling Social Behavior for Multi-target Tracking"
A dataset for testing object class detection algorithms. It contains 255 test images and features five diverse shape-based classes (apple logos, bottles, giraffes, mugs, and swans).
Related publications:
The Extended ETHZ shape classes is a larger database of shape categories, created by merging ETHZ shape classes with Konrad Schindler's 4x50 closed shapes. This is (almost) a superset of each of the two older databases. Please refer to the README for details on the differences and how to use the new larger dataset.
Related publications:
Range images of faces with ground truth used in our CVPR'08 paper "Real-Time Face Pose Estimation from Single Range Images".
The sequence contains 1175 stereo camera pairs acquired with setup mounted on top of a moving vehicle. The stereo setup has a fixed baseline, and the cameras are calibrated internally and with respect to each other.
Three pedestrian crossing sequences used in our ICCV'07 paper. Each sequence comes with ground-truth bounding box annotations for the objects to be tracked, as well as a camera calibration. The annotation files for the pedestrian crossing sequences contain bounding box annotations for every fourth frame.
The set was recorded in Zurich, using a pair of cameras mounted on a mobile platform. It contains 12'298 annotated pedestrians in roughly 2'000 frames.
The goal of the ZuBuD Image Database is to share image data sets with researcheres around the world. To facilitate this, we have created this site, which contains over 1005 images about Zurich city building. The detail information about the database can be found on our Technical Report:TR-260.
We will be adding new data to this site as time permits. Furthermore, we will now accept datasets from other researchers, to add to our archive. If you would like to contribute for this, please contact Hao Shao. The full sized images themselves are stored in PNG (Portable Network Graphics) format.
|
ZuBuD
tar-gzipped (486MB) | Created: April 2003 |
|
ZuBuD Query Images
tar-gzipped (3,1MB) Ground truth mapping (txt) | Created: April 2003 |
Created: April 2003
The data contains dynamic sagittal 2D images acquired during free breathing. More specifically