Large-scale multimedia indexing and search
Development of scalable methods for performing content-based image search in massive image collections. Research involves a) the extension of state-of-the-art feature extraction, aggregation and indexing schemes for speeding up near-duplicate search, while maintaining competitive levels of retrieval accuracy, and b) the application and extension of dimensionality reduction and approximate computation techniques to the problem of concept detection, in both massive scale and incremental settings.
Multimedia analysis using knowledge and context
We study semantic multimedia analysis not merely as a problem of machine learning but more from the perspective of how to optimally utilize explicit knowledge and contextual information to boost the analysis efficiency. In our research we investigate different types of context ranging from the experts’ provided knowledge encoded in a formal ontology and the crowdsourced knowledge derived from the collective intelligence of social networks, all the way to knowledge that extends across media in multi-modal problems.
Development of knowledge structures, languages and tools for multimedia analysis through extension of Semantic Web languages with multimedia descriptions rules and relations. Development of reasoning techniques for multimedia applications in order to improve multimedia analysis, removal of uncertainty and integration of different analysis results, rules and relations based on spatiotemporal constraints. Development of reasoning-based ontology and content alignment techniques.
eHealth and Medical Imaging
Semantic web technologies for medical applications, knowledge structures for biomedical data, intelligence processing and reasoning for diagnosis assistance and risk assessment, semantic processing of multimedia medical databases, feature extraction from medical images, medical images processing and analysis, medical tools and interfaces.
Patent Image Search
We deal with the problem of patent segmentation to images using low level visual features and machine learning techniques. We propose novel visual features in order to support content-based search of patent images and complex technical drawings. In addition, we investigate concept extraction from patent figures combining features from different modalities with early and late fusion techniques.
SALIC: Social Active Learning for Image Classification
We investigate ways to fully automate the process of learning image classification models, so as to allow computer vision systems to scale in multiple domains and concepts. In particular, we study how the process of active learning can be fully automated in the context of social networks by replacing the human oracle with the user tagged images obtained from social media. New samples obtained from social media are used to expand the training set of image classifiers in both volume and variability. In addressing the noisy nature of user-contributed tags we seek to jointly maximize the informativeness of the selected samples together with our confidence about their actual content. In this way, we manage to benefit from the huge volume of multimedia content shared through social media in training computer vision systems.
We work on methods for detecting and localizing forgeries in digital media items. Research focuses on tampering localization algorithms for digital images collected from Web and social media environments. In parallel to improving the state-of-the-art in terms of detection rates, we also work on maximizing the interpretability and unambiguousness of the outputs, which usually take the form of localization heat maps.