Efficient Encoding and Decoding of Voxelized Models for Machine Learning-Based Applications

Point clouds have become a popular training data for many practical applications of machine learning in the fields of environmental modeling and precision agriculture. In order to reduce high space requirements and the effect of noise in the data, point clouds are often transformed to a structured r...

Full description

Saved in:
Bibliographic Details
Main Authors: Damjan Strnad, Stefan Kohek, Borut Zalik, Libor Vasa, Andrej Nerat
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10829598/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841550795199217664
author Damjan Strnad
Stefan Kohek
Borut Zalik
Libor Vasa
Andrej Nerat
author_facet Damjan Strnad
Stefan Kohek
Borut Zalik
Libor Vasa
Andrej Nerat
author_sort Damjan Strnad
collection DOAJ
description Point clouds have become a popular training data for many practical applications of machine learning in the fields of environmental modeling and precision agriculture. In order to reduce high space requirements and the effect of noise in the data, point clouds are often transformed to a structured representation such as a voxel grid. Storing, transmitting and consuming voxelized geometry, however, remains a challenging problem for machine learning pipelines running on devices with limited amount of on-chip memory with low access latency. A viable solution is to store the data in a compact encoded format, and perform on-the-fly decoding when it is needed for processing. Such on-demand expansion must be fast in order to avoid introducing substantial additional delay to the pipeline. This can be achieved by parallel decoding, which is particularly suitable for massively parallel architecture of GPUs on which the majority of machine learning is currently executed. In this paper, we present such method for efficient and parallelizable encoding/decoding of voxelized geometry. The method employs multi-level context-aware prediction of voxel occupancy based on the extracted binary feature prediction table, and encodes the residual grid with a pointerless sparse voxel octree (PSVO). We particularly focused on encoding the datasets of voxelized trees, obtained from both synthetic tree models and LiDAR point clouds of real trees. The method achieved 15.6% and 12.8% reduction of storage size with respect to plain PSVO on synthetic and real dataset, respectively. We also tested the method on a general set of diverse voxelized objects, where an average 11% improvement of storage space was achieved.
format Article
id doaj-art-be0f1af5c9a04696b52f9e26d2c6962a
institution Kabale University
issn 2169-3536
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-be0f1af5c9a04696b52f9e26d2c6962a2025-01-10T00:01:45ZengIEEEIEEE Access2169-35362025-01-01135551556110.1109/ACCESS.2025.352620210829598Efficient Encoding and Decoding of Voxelized Models for Machine Learning-Based ApplicationsDamjan Strnad0https://orcid.org/0000-0003-4468-0290Stefan Kohek1https://orcid.org/0000-0002-6210-0889Borut Zalik2https://orcid.org/0000-0003-4372-5020Libor Vasa3https://orcid.org/0000-0002-0213-3769Andrej Nerat4https://orcid.org/0000-0003-1559-9776Faculty of Electrical Engineering and Computer Science, University of Maribor, Maribor, SloveniaFaculty of Electrical Engineering and Computer Science, University of Maribor, Maribor, SloveniaFaculty of Electrical Engineering and Computer Science, University of Maribor, Maribor, SloveniaFaculty of Applied Sciences, University of West Bohemia, Pilsen, Czech RepublicFaculty of Electrical Engineering and Computer Science, University of Maribor, Maribor, SloveniaPoint clouds have become a popular training data for many practical applications of machine learning in the fields of environmental modeling and precision agriculture. In order to reduce high space requirements and the effect of noise in the data, point clouds are often transformed to a structured representation such as a voxel grid. Storing, transmitting and consuming voxelized geometry, however, remains a challenging problem for machine learning pipelines running on devices with limited amount of on-chip memory with low access latency. A viable solution is to store the data in a compact encoded format, and perform on-the-fly decoding when it is needed for processing. Such on-demand expansion must be fast in order to avoid introducing substantial additional delay to the pipeline. This can be achieved by parallel decoding, which is particularly suitable for massively parallel architecture of GPUs on which the majority of machine learning is currently executed. In this paper, we present such method for efficient and parallelizable encoding/decoding of voxelized geometry. The method employs multi-level context-aware prediction of voxel occupancy based on the extracted binary feature prediction table, and encodes the residual grid with a pointerless sparse voxel octree (PSVO). We particularly focused on encoding the datasets of voxelized trees, obtained from both synthetic tree models and LiDAR point clouds of real trees. The method achieved 15.6% and 12.8% reduction of storage size with respect to plain PSVO on synthetic and real dataset, respectively. We also tested the method on a general set of diverse voxelized objects, where an average 11% improvement of storage space was achieved.https://ieeexplore.ieee.org/document/10829598/Voxel gridfeature predictiontree modelsprediction-based encodingkey voxelsresiduals
spellingShingle Damjan Strnad
Stefan Kohek
Borut Zalik
Libor Vasa
Andrej Nerat
Efficient Encoding and Decoding of Voxelized Models for Machine Learning-Based Applications
IEEE Access
Voxel grid
feature prediction
tree models
prediction-based encoding
key voxels
residuals
title Efficient Encoding and Decoding of Voxelized Models for Machine Learning-Based Applications
title_full Efficient Encoding and Decoding of Voxelized Models for Machine Learning-Based Applications
title_fullStr Efficient Encoding and Decoding of Voxelized Models for Machine Learning-Based Applications
title_full_unstemmed Efficient Encoding and Decoding of Voxelized Models for Machine Learning-Based Applications
title_short Efficient Encoding and Decoding of Voxelized Models for Machine Learning-Based Applications
title_sort efficient encoding and decoding of voxelized models for machine learning based applications
topic Voxel grid
feature prediction
tree models
prediction-based encoding
key voxels
residuals
url https://ieeexplore.ieee.org/document/10829598/
work_keys_str_mv AT damjanstrnad efficientencodinganddecodingofvoxelizedmodelsformachinelearningbasedapplications
AT stefankohek efficientencodinganddecodingofvoxelizedmodelsformachinelearningbasedapplications
AT borutzalik efficientencodinganddecodingofvoxelizedmodelsformachinelearningbasedapplications
AT liborvasa efficientencodinganddecodingofvoxelizedmodelsformachinelearningbasedapplications
AT andrejnerat efficientencodinganddecodingofvoxelizedmodelsformachinelearningbasedapplications