2022 |
Boutsi, A-M; Bakalos, N; Ioannidis, C POSE ESTIMATION THROUGH MASK-R CNN AND VSLAM IN LARGE-SCALE OUTDOORS AUGMENTED REALITY Journal Article In: ISPRS Ann. Photogramm. Remote Sens. Spat. Inf. Sci., vol. V-4-2022, no. 4, pp. 197–204, 2022, ISSN: 2194-9050. Abstract | Links | BibTeX | Tags: 3D rendering, Augmented Reality, CNN, deep learning, image recognition, pose estimation @article{boutsi2022pose, Abstract. Deep Learning (DL) ingrained into Mobile Augmented Reality (MAR) enables a new information-delivery paradigm. In the context of 6 DoF pose estimation, powerful DL networks could provide a direct solution for AR systems. However, their concurrent operation requires a significant number of computations per frame and yields to both misclassifications and localization errors. In this paper, a hybrid and lightweight solution on 3D tracking of arbitrary geometry for outdoor MAR scenarios is presented. The camera pose information obtained by ARCore SDK and vSLAM algorithm is combined with the semantic and geometric output of a CNN-object detector to validate and improve tracking performance in large-scale and uncontrolled outdoor environments. The methodology involves three main steps: i) training of the Mask-R CNN model to extract the class, bounding box and mask predictions, ii) real-time detection, segmentation and localization of the region of interest (ROI) in camera frames, and iii) computation of 2D-3D correspondences to enhance pose estimation of a 3D overlay. The dataset holds 30 images of the rock of St. Modestos – Modi in Meteora, Greece in which the ROI is an area with characteristic geological features. The comparative evaluation between the prototype system and the original one, as well as with R-CNN and FAST-R CNN detectors demonstrates higher precision accuracy and stable visualization at half a kilometre distance, while tracking time has decreased at 42% during far-field AR session. |
2020 |
Bakalos, Nikolaos; Rallis, Ioannis; Doulamis, Nikolaos; Doulamis, Anastasios; Voulodimos, Athanasios; Protopapadakis, Eftychios Adaptive Convolutionally Enchanced Bi-Directional Lstm Networks For Choreographic Modeling Proceedings Article In: 2020 IEEE Int. Conf. Image Process., pp. 1826–1830, IEEE, 2020, ISBN: 978-1-7281-6395-6. Abstract | Links | BibTeX | Tags: CNN, Convolutional LSTM, Folkloric dances, Intangible Cultural Heritage, LSTM, Posture identification @inproceedings{Bakalos2020, In this paper, we present a deep learning scheme for classification of choreographic primitives from RGB images. The proposed framework combines the representational power of feature maps, extracted by Convolutional Neural Networks, with the long-term dependency modeling capabilities of Long Short-Term Memory recurrent neural networks. In addition, it uses AutoRegressive and Moving Average (ARMA) filter into the convolutionally enriched LSTM filter to face dance dynamic characteristics. Finally, an adaptive weight updating strategy is introduced for improving classification modeling performance The framework is used for the recognition of dance primitives (basic dance postures) and is experimentally validated with real-world sequences of traditional Greek folk dances. |
2018 |
Agrafiotis, P; Skarlatos, D; Forbes, T; Poullis, C; Skamantzari, M; Georgopoulos, A Underwater photogrammetry in very shallow waters: Main challenges and caustics effect removal Journal Article In: Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. - ISPRS Arch., vol. 42, no. 2, pp. 15–22, 2018, ISSN: 16821750. Abstract | Links | BibTeX | Tags: Caustics, CNN, SfM MVS, Underwater 3D reconstruction @article{Agrafiotis2018, In this paper, main challenges of underwater photogrammetry in shallow waters are described and analysed. The very short camera to object distance in such cases, as well as buoyancy issues, wave effects and turbidity of the waters are challenges to be resolved. Additionally, the major challenge of all, caustics, is addressed by a new approach for caustics removal (Forbes et al., 2018) which is applied in order to investigate its performance in terms of SfM-MVS and 3D reconstruction results. In the proposed approach the complex problem of removing caustics effects is addressed by classifying and then removing them from the images. We propose and test a novel solution based on two small and easily trainable Convolutional Neural Networks (CNNs). Real ground truth for caustics is not easily available. We show how a small set of synthetic data can be used to train the network and later transfer the le arning to real data with robustness to intra-class variation. The proposed solution results in caustic-free images which can be further used for other tasks as may be needed. |
2022 |
POSE ESTIMATION THROUGH MASK-R CNN AND VSLAM IN LARGE-SCALE OUTDOORS AUGMENTED REALITY Journal Article In: ISPRS Ann. Photogramm. Remote Sens. Spat. Inf. Sci., vol. V-4-2022, no. 4, pp. 197–204, 2022, ISSN: 2194-9050. |
2020 |
Adaptive Convolutionally Enchanced Bi-Directional Lstm Networks For Choreographic Modeling Proceedings Article In: 2020 IEEE Int. Conf. Image Process., pp. 1826–1830, IEEE, 2020, ISBN: 978-1-7281-6395-6. |
2018 |
Underwater photogrammetry in very shallow waters: Main challenges and caustics effect removal Journal Article In: Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. - ISPRS Arch., vol. 42, no. 2, pp. 15–22, 2018, ISSN: 16821750. |