The development of image recognition systems is a complex and highly specialized process, which requires expert knowledge in areas of image recognition and machine learning. Although image recognition systems have a large potential for future applications in everyday life, their application, as of today, is limited by the lack of access to appropriate development tools. The purpose of this project is the development of a software framework for users without expert knowledge in the areas of computer vision and machine learning. The development of such a software framework requires the adaptation of the standard development process in computer vision to the needs of non-expert users. In detail, the framework developed and presented in this work, called FOREST (Flexible Object REcognition SysTem)
highly automates the development process
simplifies the development of non-automatable components, and
provides intuitive user interfaces which require no training or previous knowledge.
In contrast to existing development tools, FOREST does not aim to provide a tool for a specialized development process, but instead lets users adapt its generic recognition functionality to the intended recognition task. FOREST requires only an image data source and the annotations for the image data to learn a classifier for the intended task, e.g., the recognition of left open windows.
FOREST implements its flexible recognition functionality by providing a large set of different image region detection and feature description algorithms. A Boosting classifier is then used to select discriminative image features for the intended recognition task.
The efficient annotation of images is an important aspect for the successful deployment of such a framework. Therefore, efficient annotation techniques were investigated and a semi-automatic annotation process was proposed. An image data set is clustered according to similarity and presented to the user, who may efficiently annotate clusters of images in one go. The clustering can be interactively recalculated using an adapted similarity metric, based on partial annotations provided by the user.
J. Moehrmann & G. Heidemann. FOREST - A Flexible Object Recognition System. In Proceedings of the International Conference on Pattern Recognition Applications and Methods (ICPRAM-2015), 2 : 119-127, ISBN: 978-989-758-077-2, 2015. SCITEPRESS. | DOI | BibTeX
J. Moehrmann. Effiziente Erstellung aufgabenspezifischer Bilderkennungssysteme. PhD thesis, Universität Osnabrück, Institute of Cognitive Science, 2014. | URL | BibTeX
J. Moehrmann & G. Heidemann. Semi-Interactive Image Annotation with Visual Feedback. In ECCV Workshop on Human-Machine Communication for Visual Recognition and Search 2014. | PDF | BibTeX
J. Moehrmann & G. Heidemann. Efficient development of user-defined image recognition systems. In Computer Vision-ACCV 2012 Workshops, 7728 : 242-253, ISBN: 978-3-642-37409-8, 2013. | DOI | URL | BibTeX
J. Moehrmann & G. Heidemann. Semi-automatic Image Annotation. In Computer Analysis of Images and Patterns, 8048 : 266-273, ISBN: 978-3-642-40245-6, 2013. | DOI | BibTeX
J. Moehrmann & G. Heidemann. Efficient Annotation of Image Data Sets for Computer Vision Applications. In Proceedings of the 1st International Workshop on Visual Interfaces for Ground Truth Collection in Computer Vision Applications, pages: 2:1-2:6, 2012. ACM. | DOI | BibTeX
J. Moehrmann, S. Bernstein, T. Schlegel, G. Werner & G. Heidemann. Improving the Usability of Hierarchical Representations for Interactively Labeling Large Image Data Sets. In Human-Computer Interaction. Design and Development Approaches, 6761 : 618-627, ISBN: 978-3-642-21601-5, 2011. Springer. | DOI | BibTeX
J. Moehrmann, A. Burkovski, E. Baranovskiy, G.A. Heinze, A. Rapoport & G. Heidemann. A Discussion on Visual Interactive Data Exploration Using Self-Organizing Maps. In Proc. 8th Int. Workshop on Advances in Self-Organizing Maps (WSOM 2011), 6731 : 178-187, ISBN: 978-3-642-21565-0, 2011. Springer. | DOI | BibTeX
J. Moehrmann & G. Heidemann. Automatic Trajectory Clustering for Generating Ground Truth Data Sets. In Image Processing: Machine Vision Applications III, 7538 (1) : 753808-1-753808-9, 2010. SPIE. | PDF | BibTeX
J. Moehrmann, G. Heidemann, O. Siemoneit, C. Hubig, U.P. Kaeppeler & P. Levi. Context Generation with Image Based Sensors: An Interdisciplinary Enquiry on Technical and Social Issues and their Implications for System Design. In Proc. World Academy of Science, Engineering and Technology, 61 : 311-317, 2010. WASET. | PDF | BibTeX
J. Moehrmann, X. Wang & G. Heidemann. Motion Based Situation Recognition in Group Meetings. In Image Processing: Machine Vision Applications III, 7538 (1) : 75380N-1-75380N-9, 2010. SPIE. | PDF | BibTeX