Project state
closed
Project start
April 2024
Funding duration
15 months
Universities involved
ZHAW
The projectteam developed a hardware and software platform to enable real-time smart 3D vision for assistive robots.
A novel hardware system was developed that integrates a pair of event-based vision sensors (DVS) and a pair of conventional RGB cameras to an FPGA. Simple sensor fusion is performed on the FPGA, streaming both event- and image-data for further processing. This system enables validation of algorithms combining fast event-based processing (e.g., real-time object tracking) with conventional image processing (e.g., image recognition and 3D vision). Conventional images benefit from larger datasets and pre-trained artificial neural networks, improving detection, recognition, and segmentation quality. However, detection speed remains a limitation for use in robot motion control loops.
Several algorithms were validated to enable real-time smart vision with 3D assessment for safe and efficient robot operation around humans:
a) Event-frames accumulated via the FPGA pipeline were made similar to grayscale images, allowing direct application of standard ANNs (YOLO, MediaPipe) trained on conventional images. Good results were obtained with high detection rates controlled through event-frame accumulation time.
b) Datasets were collected using calibrated and synchronized DVS-image sensor pairs, using conventional image detections as labels for DVS-data. These datasets enabled training of various neural networks (including spiking neural networks) with different event-based representations (e.g., time surfaces).
c) Different depth-perception algorithms were tested, including DVS-based stereo, RGB-based stereo, and time-of-flight sensors. Each method has distinct limitations (accuracy, range, surface properties, texture dependence). Safe robot operation requires complementary depth-perception streams.
Demonstration and Testing Multiple demonstrations showed system performance in closed motion control loops of robot arms, including obstacle avoidance and target acquisition. The technology was tested with elderly residents at a care facility (Haus Tabea) and at public events.
The YuRo vision systems were demonstrated in multiple events:
- June-August 2025, EXPO2025 in Osaka: the camera was the key element in a interactive demonstrator, in which visitors of the Swiss Pavilion suing the World Expo could take over control of the robot arm with their hand. The hand was detected and its motion estimated in real-time using the developed YuRo camera. The detections were used in the control loop of the robot arm.
- 3 September 2024 Booth at Digital Health Lab Day 2024, Winterthur: the vision system was combined with a light-weight robot arm from UniTree. In a demo visitors could control the end-effector of the arm with their hand and build a tower with such “remote control”
- 1 November 2024 Booth at Swiss Robotics Day 2024, Bern: same demo
- 26 Mai 2025 Booth at Life Science Zurich Innovation Conference 2025, Zurich: same demo
- Multiple other events, such as Open-I, BSc events at ZHAW, AI+X summit 2024 and 2025
- As the follow-up of the project, two new demos were prepared for the Swiss Robotics Day 2025 in Lausanne: combining hand-detection based control of the robot arm with obstacle avoidance, and high-speed visual control, demonstrated with reactive robot arm that we developed and built
Next Steps Applications for follow-up projects have been submitted. A spin-off draft has been signed with ZHAW to commercialize the developed technology. The spin-off has secured space at Büro Zürich Innovation Park in Dübendorf for one year.
Prof. Dr. Yulia Sandamirskaya, ZHAW School of Life Sciences and Facility Management
Call type: 1st Founder Call