Volume Table of Contents

PROCEEDINGS VOLUME 13000

SPIE PHOTONICS EUROPE | 7-12 APRIL 2024

Real-time Processing of Image, Depth, and Video Information 2024

Editor(s): Matthias F. Carlsohn

Editor Affiliations +

IN THIS VOLUME

6 Sessions, 20 Papers, 17 Presentations

Digital Twins (3)

Embedded Systems (4)

Neural Nets and Deep Learning (3)

Optical Image Processing (2)

Real-time Implementations (5)

Digital Poster Session (4)

SPIE PHOTONICS EUROPE

7-12 April 2024

Strasbourg, France

Present at an SPIE Conference

Subscribe to Digital Library

VIEW ALL ABSTRACTS +

Digital Twins

Road intersection analysis: integrating image processing into digital twin technologies Presentation

Francisco C. Vázquez-Donaire, Alejandra Abalo-García, Antonio Sanz Montemayor, Juan Jose Pantrigo

Proceedings Volume Real-time Processing of Image, Depth, and Video Information 2024, 1300002 https://doi.org/10.1117/12.3017507

Read Abstract +

Digital twin of the technological process for grinding helical flutes of a cutting tool Presentation + Paper

Petr M. Pivkin

Proceedings Volume Real-time Processing of Image, Depth, and Video Information 2024, 1300003 (2024) https://doi.org/10.1117/12.3022984

Read Abstract +

Advancing green computer vision: principles and practices for sustainable development for real-time computer vision applications Presentation + Paper

Mark A. M. Kramer, Peter M. Roth

Proceedings Volume Real-time Processing of Image, Depth, and Video Information 2024, 1300004 (2024) https://doi.org/10.1117/12.3025256

Read Abstract +

Embedded Systems

A real-time demonstrator for image classification using FPGA-based logic neural networks Presentation + Paper

David Concha, Francisco J. Garcia-Espinosa, Iván Ramirez, Luis Alberto Aranda

Proceedings Volume Real-time Processing of Image, Depth, and Video Information 2024, 1300005 (2024) https://doi.org/10.1117/12.3017459

Read Abstract +

Energy-efficient real-time computer vision applications in practice Presentation + Paper

Mark A. M. Kramer, Peter M. Roth

Proceedings Volume Real-time Processing of Image, Depth, and Video Information 2024, 1300006 (2024) https://doi.org/10.1117/12.3025261

Read Abstract +

Self-adapting reconfigurable multiply-accumulator for real-time image processing in embedded systems Presentation + Paper

Andrea Fasolino, Paola Vitolo, Rosalba Liguori, Luigi Di Benedetto, Alfredo Rubino, Gian Domenico Licciardo

Proceedings Volume Real-time Processing of Image, Depth, and Video Information 2024, 1300007 (2024) https://doi.org/10.1117/12.3016834

Read Abstract +

Implementation of the image super-resolution DWT based algorithm on Raspberry Pi platform for real-time applications Presentation + Paper

Raul J. Osorno-Ortiz, Volodymyr I. Ponomaryov, Rogelio Reyes-Reyes, Clara Cruz-Ramos, Beatriz P. Garcia-Salgado, Sergiy Sadovnychiy

Proceedings Volume Real-time Processing of Image, Depth, and Video Information 2024, 1300008 (2024) https://doi.org/10.1117/12.3017319

Read Abstract +

Neural Nets and Deep Learning

Deep learning approach for a machine-human interface based on optical real-time gesture recognition for automated guided vehicles Presentation + Paper

Kiran Raj Krishnakumar, Laura Gersmeier, Leif Ole Harders, Stephan Hussmann

Proceedings Volume Real-time Processing of Image, Depth, and Video Information 2024, 1300009 (2024) https://doi.org/10.1117/12.3016404

Read Abstract +

Optimizing urban intersection management: a visible light communication approach for cooperative trajectories and traffic signals Presentation + Paper

G. Galvão, M. A. Vieira, M. Vieira, M. Vestias, P. Vieira, P. Louro

Proceedings Volume Real-time Processing of Image, Depth, and Video Information 2024, 130000A (2024) https://doi.org/10.1117/12.3016816

Read Abstract +

This paper introduces an integrated approach to address challenges in traffic monitoring and control, alongside traffic simulation, by leveraging Visible Light Communication (VLC) technology. The proposed method optimizes traffic light signals and vehicle and pedestrians trajectories at urban intersections, incorporating Vehicle-to-Vehicle (V2V), Vehicle-to- Infrastructure (V2I), Infrastructures-to-Vehicles (I2V), and Pedestrians-to-Infrastructures (P2I) VLC communication. Experimental results demonstrate the feasibility of implementing these VLC modes in adaptive traffic control systems. Through modulated light, information exchange occurs between connected vehicles (CVs) and infrastructure elements like streetlamps and traffic light signals. Cooperative CVs share position and speed data via V2V communication within control zones, enabling adaptability to various traffic movements during signal phases. By utilizing Reinforcement Learning and the Simulation of Urban Mobility (SUMO) agent-based simulator, optimal traffic light control policies are determined. Unlike conventional methods focused solely on maximizing traffic capacity, this approach integrates traffic efficiency and safety considerations, including pedestrian concerns at intersections. Simulation scenarios adapted from real-world environments, such as Lisbon, feature interconnected intersections with traffic flow impact. A deep reinforcement learning algorithm dynamically manages traffic flows during peak hours via V2V and V/P2I communications, while prioritizing pedestrian and vehicle waiting times. VLC mechanisms facilitate queue/request/response interactions. A comparative analysis highlight the proposed approach's benefits in throughput, delay reduction, and minimizing vehicle stops, revealing improved patterns for signal and trajectory optimization. Evaluation on separate training and test sets ensures model reliability and effectiveness.

A novel lightweight multi-attentive general ship detection model for detection of ships in optical and SAR satellite imagery Presentation + Paper

Shovakar Bhattacharjee, Palaniswamy Shanmugam, Sukhendu Das

Proceedings Volume Real-time Processing of Image, Depth, and Video Information 2024, 130000B (2024) https://doi.org/10.1117/12.3016869

Read Abstract +

Satellite imagery-based ship detection is indispensable in maritime surveillance and monitoring the naval activities. Machine learning is an effective approach that enables the process to be automatic and more accurate as compared to many other approaches. Generally, optical and synthetic aperture radar satellite images are often employed for detecting/locating various marine activities using different methods. However, models trained on one set of images often yield large uncertainties when testing on other sets of images due to the complex scene characteristics. This study proposes a novel lightweight computationally efficient deep learning-based general ship detection model called the Multi- Attentive General Ship Detector (MAGSD) for detecting ships in both optical and SAR satellite images. The model is trained with the SAR Ship Dataset (SDD), which has ship instances from Gaofen-3 and Sentinel-1 SAR satellite images, and the MASATI dataset that contains ship instances from the Microsoft Bing Map. The proposed model focuses on the attention-guided convolutional neural network for extracting feature maps for detection, which bridges the gap between SAR and optical image characteristics constraints by focusing on different levels of convolutional features in the network. The model is built with a novel feature extractor that has fourteen convolutional layers with six max pool layers and six attention layers, connecting several convolutional points to focus on local features in different depth maps which serve as the backbone of the model. The comparative analysis showed the robustness of the proposed model over the state-of-the-art baseline model YOLOv5s, with a precision of 8.2% and a recall of 9.63%. These results indicate that the proposed model holds the potential to serve as an efficient tool for ship detection in any satellite images and contributes to the enhanced coastal surveillance and bolsters global naval security.

Optical Image Processing

Immersive hybrid real-time video communication using mixed camera setups Presentation + Paper

M. Zepp, D. Chen, A. Hilsmann, P. Eisert, R. Schäfer, K. Prebeck, O. Schreer, I. Feldmann

Proceedings Volume Real-time Processing of Image, Depth, and Video Information 2024, 130000D (2024) https://doi.org/10.1117/12.3022323

Read Abstract +

Lithium-niobate photonic integrated circuits for GHz, sub-picojoule/bit optical image processing Presentation + Paper

J. Rasmus Bankwitz, Jelle Dijkstra, Ravi Pradip, Philipp Schultzen, Liam McRae, Julius Römer, Akhil Varri, Francesco Lenzini, Wolfram H. P. Pernice

Proceedings Volume Real-time Processing of Image, Depth, and Video Information 2024, 130000G (2024) https://doi.org/10.1117/12.3022398

Read Abstract +

Real-time Implementations

Multiple GPU parallel real time segmentation on breast lesions for ultrasound videos Presentation + Paper

Oscar Garcia-Avila, Volodymyr Ponomaryov, Jose Agustin Almaraz-Damian, Beatriz Paulina Garcia-Salgado, Rogelio Reyes-Reyes, Clara Cruz-Ramos

Proceedings Volume Real-time Processing of Image, Depth, and Video Information 2024, 130000H (2024) https://doi.org/10.1117/12.3017297

Read Abstract +

Multithreading approach for white blood cell segmentation implementation Presentation + Paper

Beatriz P. Garcia-Salgado, Volodymyr I. Ponomaryov, Jose Luis Diaz-Resendiz, Yeredith G. Mora-Martinez, Jose A. Alamaraz-Damian, Rogelio Reyes-Reyes, Sergiy Sadovnychiy

Proceedings Volume Real-time Processing of Image, Depth, and Video Information 2024, 130000I (2024) https://doi.org/10.1117/12.3017031

Read Abstract +

Real-time on-board satellite cloud cover detection hardware architecture using spaceborne remote sensing imagery Presentation + Paper

Paola Vitolo, Andrea Fasolino, Rosalba Liguori, Luigi Di Benedetto, Alfredo Rubino, Gian Domenico Licciardo

Proceedings Volume Real-time Processing of Image, Depth, and Video Information 2024, 130000J (2024) https://doi.org/10.1117/12.3017554

Read Abstract +

Considerations on the search of a fast non-iterative inverse discrete Radon transform Presentation + Paper

Óscar Gómez-Cárdenes, José Gil Marichal-Hernández, Fernando Luis Rosa-González, Jung-Young Son, Rafael Pérez Jiménez

Proceedings Volume Real-time Processing of Image, Depth, and Video Information 2024, 130000K (2024) https://doi.org/10.1117/12.3021992

Read Abstract +

Real-time stroke detection using deep learning and federated learning Presentation + Paper

Abdussalam Elhanashi, Pierpaolo Dini, Sergio Saponara, Qinghe Zheng, Ibrahim Alsharif

Proceedings Volume Real-time Processing of Image, Depth, and Video Information 2024, 130000L (2024) https://doi.org/10.1117/12.3012948

Read Abstract +

Stroke is a devastating and life-threatening medical condition that demands immediate intervention. Timely diagnosis and treatment are paramount in reducing mortality and mitigating long-term disabilities associated with stroke. This research aims to address these critical needs by proposing a real-time stroke detection system based on Deep Learning (DL) with the incorporation of Federated Learning (FL), which offers improved accuracy and privacy preservation. The purpose of this research is to develop an efficient and accurate model capable of distinguishing between stroke and non-stroke cases in real-time, assisting healthcare professionals in making rapid and informed decisions. Stroke detection has traditionally relied on manual interpretation of medical images, which is time-consuming and prone to human error. DL techniques have shown significant promise in automating this process, but the need for large and diverse datasets, as well as privacy concerns, remains challenging. To achieve this goal, our methodology involves training the DL model on extensive datasets containing both stroke and non-stroke medical images. This training process will enable the model to learn complex patterns and features associated with stroke, thereby improving its diagnostic accuracy. Furthermore, we will employ Federated Learning, a decentralized training approach, to enhance privacy while maintaining model performance. This approach allows the model to learn from data distributed across multiple healthcare institutions without sharing sensitive patient information. The proposed approach has been executed on NVIDIA platforms, taking advantage of their advanced GPU capabilities to enable real-time processing and analysis. This optimized model has the potential to revolutionize stroke diagnosis and patient care, ultimately saving lives and improving the quality of healthcare services in the field of neurology.

Digital Poster Session

Enhancing dental bitewing radiograph datasets: a preprocessing approach for AI detection and diagnoses Poster + Paper

Wafaa Al Nassan, Talal Bonny, Mohammad Al-Shabi

Proceedings Volume Real-time Processing of Image, Depth, and Video Information 2024, 130000N (2024) https://doi.org/10.1117/12.3013192

Read Abstract +

Algorithm for detecting objects and specialized tags in low light conditions and low camera resolution Poster + Paper

Evgenii Semenishchev, Marina Zdanova, Andrey Alepko, Viacheslav Voronin

Proceedings Volume Real-time Processing of Image, Depth, and Video Information 2024, 130000O (2024) https://doi.org/10.1117/12.3023940

Read Abstract +

Real-time deep learning-based object recognition in augmented reality Poster + Paper

V. Egipko, M. Zhdanova, N. Gapon, V. Voronin, E. Semenishchev

Proceedings Volume Real-time Processing of Image, Depth, and Video Information 2024, 130000P (2024) https://doi.org/10.1117/12.3024957

Read Abstract +

Eye tracking system for controlling home devices Poster + Paper

Abderrahman Boudiaf, Abdullah Al-Amodi, Abdulaziz Alfahl, Talal Bonny, Wafaa Al Nassan

Proceedings Volume Real-time Processing of Image, Depth, and Video Information 2024, 130000Q (2024) https://doi.org/10.1117/12.3028926

Read Abstract +

Keywords/Phrases

Search In:

Publication Years