Ing. Nikolaos Efthymiadis

All publications

Authors: Vasileios Psomas, Ph.D., Kakogeorgiou, I., Ing. Nikolaos Efthymiadis, doc. Georgios Tolias, Ph.D., prof. Mgr. Ondřej Chum, Ph.D., Avrithis, Y., Karantzalos, K.
Publication: IGARSS 2024 - 2024 IEEE International Geoscience and Remote Sensing Symposium Proceedings. Piscataway: IEEE, 2024. p. 8526-8534. ISSN 2153-6996. ISBN 979-8-3503-6033-2.
Year: 2024

DOI: 10.1109/IGARSS53475.2024.10642874
Link: https://doi.org/10.1109/IGARSS53475.2024.10642874
Department: Visual Recognition Group
Annotation:
This work introduces composed image retrieval to remote sensing. It allows to query a large image archive by image examples alternated by a textual description, enriching the descriptive power over unimodal queries, either visual or textual. Various attributes can be modified by the textual part, such as shape, color, or context. A novel method fusing image-to-image and text-to-image similarity is introduced. We demonstrate that a vision-language model possesses sufficient descriptive power and no further learning step or training data are necessary. We present a new evaluation benchmark focused on color, context, density, existence, quantity, and shape modifications. Our work not only sets the state-of-the-art for this task, but also serves as a foundational step in addressing a gap in the field of remote sensing image retrieval. Code at: https://github.com/billpsomas/rscir.

Authors: Ing. Nikolaos Efthymiadis, doc. Georgios Tolias, Ph.D., prof. Mgr. Ondřej Chum, Ph.D.,
Publication: 2022 26th International Conference on Pattern Recognition (ICPR). Piscataway: IEEE, 2022. p. 3595-3602. ISSN 2831-7475. ISBN 978-1-6654-9062-7.
Year: 2022

DOI: 10.1109/ICPR56361.2022.9956233
Link: https://doi.org/10.1109/ICPR56361.2022.9956233
Department: Visual Recognition Group
Annotation:
This work addresses scaling up the sketch classification task into a large number of categories. Collecting sketches for training is a slow and tedious process that has so far precluded any attempts to large-scale sketch recognition. We overcome the lack of training sketch data by exploiting labeled collections of natural images that are easier to obtain. To bridge the domain gap we present a novel augmentation technique that is tailored to the task of learning sketch recognition from a training set of natural images. Randomization is introduced in the parameters of edge detection and edge selection. Natural images are translated to a pseudo-novel domain called "randomized Binary Thin Edges" (rBTE), which is used as a training domain instead of natural images. The ability to scale up is demonstrated by training CNN-based sketch recognition of more than 2.5 times larger number of categories than used previously. For this purpose, a dataset of natural images from 874 categories is constructed by combining a number of popular computer vision datasets. The categories are selected to be suitable for sketch recognition. To estimate the performance, a subset of 393 categories with sketches is also collected.