Existing publicly available image databases, like the COIL [Murase95], have been very influential. One corner-stone for the COGVIS-project is therefore the construction of a common object database, which serves as the basis for both psychophysical and computational studies concerning object recognition and categorization. In this section, we present the ETH-80 image set, a first subset of the COGVIS database, targeted specifically to the task of object categorization. The ETH-80 database contains 80 objects from 8 carefully chosen categories, high-resolution color images, and segmentation masks for every image. MotivationIt is important to emphasize that the notion and the abstraction level of object classes is far from being uniquely and clearly defined. Notably, the question of how humans organize knowledge at different levels has received much attention in Cognitive Psychology [Brown58]. Taking an example from Brown's work, a dog can not only be thought of as a dog, but also as a boxer, a quadruped, or in general an animate being [Brown58]. Yet, dog is the term that comes to mind most easily, which is by no means accidental. Experiments show that there is a basic level in human categorization at which most knowledge is organized [Rosch76]. According to Rosch et al. [Rosch76,Lakoff87], this basic level is also
These points are the motivation for us to address multi-level object categorization rather than the less clearly defined problem of object classification. Basic level categorization is easiest for humans. At the next lower levels, subordinate categories and the exemplar level used in object identification can be found. The next higher level, superordinate categories, requires a higher degree of abstraction and world knowledge. It is thus useful to start the generic object recognition task in the framework of basic-level categories, which seem to be a good starting point for visual classification. The current version of the database is restricted to basic level categories. In a first step, we explicitly do not want to model functional categories (e.g. ``things you can sit on'') and ad-hoc categories (e.g. ``things you can find in an office environment'') [Barsalou83]. Even though those categories are important, they exist only on a higher level of abstraction and require a high degree of world knowledge and experience living in the real world. The Database
For every image, we provide a high-quality segmentation mask, so that shape and contour based methods can be easily applied. An example segmentation mask and the extracted contour can be seen in the figure below. (Click on any of the images to see them in their full resolution).
The intended test mode is leave-one-object-out crossvalidation. This means we train with 79 objects and test with the one unknown object. Recognition is considered successful if the correct category label is assigned. The results are averaged over all 80 possible test objects. We use the database for a best case analysis: categorization of unknown objects under the same viewing conditions, with a near-perfect figure-ground segmentation, and known scale. In a practical application, such perfect information is seldomly available. But if an algorithm does not work under these ideal conditions, it is likely to fail in practice. We have used this database to compare different methods for object categorization. In particular, we want to address the question of what the role of color, texture, and shape is for this task. For this reason, we have analyzed the performance of several state-of-the-art appearance- and contour-based recognition methods on the database categories. A detailed description of the experiments can be found here. Structure:
Publications:
References:Links:Contact:
|
|



