Analysis of archive formats and program solutions for compression of text files

The subject of study in this article and research is widely used archive formats for file compression, features of their implementation, text compression rates, and the time required for existing programs for different platforms. The goal of the work is to simplify the process of choosing an archive...

Full description

Saved in:
Bibliographic Details
Main Authors: Artem Perepelitsyn, Alona Chepelevych, Andrii Litvinov
Format: Article
Language:English
Published: National Aerospace University «Kharkiv Aviation Institute» 2024-11-01
Series:Авіаційно-космічна техніка та технологія
Subjects:
Online Access:http://nti.khai.edu/ojs/index.php/aktt/article/view/2691
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841550147456073728
author Artem Perepelitsyn
Alona Chepelevych
Andrii Litvinov
author_facet Artem Perepelitsyn
Alona Chepelevych
Andrii Litvinov
author_sort Artem Perepelitsyn
collection DOAJ
description The subject of study in this article and research is widely used archive formats for file compression, features of their implementation, text compression rates, and the time required for existing programs for different platforms. The goal of the work is to simplify the process of choosing an archive format and program solutions for working with it for compressing text files, with taking into account time requirements, compression ratio, and opensource. The task is to perform an analysis of existing technologies and tools involved in the data archiving process, to analyze archive formats that are widely used, to perform an analysis of important features of archive formats, to perform an experimental study of compression parameters for a specific set of formats and software tools, to propose steps for integrating archives into the project of system with ensuring the compatibility. According to the tasks, the following results were obtained. The application of data compression in information storage systems is discussed. The available formats for archiving in Ubuntu are considered. The detailed analysis of widely used archive formats is performed. The features of the zip and rar formats for working with large files are analyzed. An experimental study of compression parameters for large-sized reference text files using ten combinations based on seven formats and four software tools is performed. Compression parameters of the text with use of the same archive formats using different software tools are investigated. Recommendations for integrating archives into the project of system with ensuring the compatibility are proposed. The use of zpaq for the compression of text information is proposed. Conclusions. The scientific novelty of the obtained results is in the fact that the analysis and experimental study of existing archive formats allows simplifying the process of making a decision on using the required archive format based on the requirements for archiving time, compression ratio, and the possibility of using software implementation for a specific platform. The obtained research results allow to propose the use of the open source archive format zpaq for compressing text or a set of project versions and documentation to achieve a compression ratio that is twice better than for rar format, and two percent better than for 7z and txz formats.
format Article
id doaj-art-e79acf4eda154085bb47ffac3f1d8a70
institution Kabale University
issn 1727-7337
2663-2217
language English
publishDate 2024-11-01
publisher National Aerospace University «Kharkiv Aviation Institute»
record_format Article
series Авіаційно-космічна техніка та технологія
spelling doaj-art-e79acf4eda154085bb47ffac3f1d8a702025-01-10T07:31:31ZengNational Aerospace University «Kharkiv Aviation Institute»Авіаційно-космічна техніка та технологія1727-73372663-22172024-11-010610311110.32620/aktt.2024.6.102395Analysis of archive formats and program solutions for compression of text filesArtem Perepelitsyn0Alona Chepelevych1Andrii Litvinov2National Aerospace University «Kharkiv Aviation Institute», KharkivNational Aerospace University «Kharkiv Aviation Institute», KharkivNational Aerospace University «Kharkiv Aviation Institute», KharkivThe subject of study in this article and research is widely used archive formats for file compression, features of their implementation, text compression rates, and the time required for existing programs for different platforms. The goal of the work is to simplify the process of choosing an archive format and program solutions for working with it for compressing text files, with taking into account time requirements, compression ratio, and opensource. The task is to perform an analysis of existing technologies and tools involved in the data archiving process, to analyze archive formats that are widely used, to perform an analysis of important features of archive formats, to perform an experimental study of compression parameters for a specific set of formats and software tools, to propose steps for integrating archives into the project of system with ensuring the compatibility. According to the tasks, the following results were obtained. The application of data compression in information storage systems is discussed. The available formats for archiving in Ubuntu are considered. The detailed analysis of widely used archive formats is performed. The features of the zip and rar formats for working with large files are analyzed. An experimental study of compression parameters for large-sized reference text files using ten combinations based on seven formats and four software tools is performed. Compression parameters of the text with use of the same archive formats using different software tools are investigated. Recommendations for integrating archives into the project of system with ensuring the compatibility are proposed. The use of zpaq for the compression of text information is proposed. Conclusions. The scientific novelty of the obtained results is in the fact that the analysis and experimental study of existing archive formats allows simplifying the process of making a decision on using the required archive format based on the requirements for archiving time, compression ratio, and the possibility of using software implementation for a specific platform. The obtained research results allow to propose the use of the open source archive format zpaq for compressing text or a set of project versions and documentation to achieve a compression ratio that is twice better than for rar format, and two percent better than for 7z and txz formats.http://nti.khai.edu/ojs/index.php/aktt/article/view/2691формати архівівrarzip7ztar.gztar.xzzpaqархівування данихстиснення тексту
spellingShingle Artem Perepelitsyn
Alona Chepelevych
Andrii Litvinov
Analysis of archive formats and program solutions for compression of text files
Авіаційно-космічна техніка та технологія
формати архівів
rar
zip
7z
tar.gz
tar.xz
zpaq
архівування даних
стиснення тексту
title Analysis of archive formats and program solutions for compression of text files
title_full Analysis of archive formats and program solutions for compression of text files
title_fullStr Analysis of archive formats and program solutions for compression of text files
title_full_unstemmed Analysis of archive formats and program solutions for compression of text files
title_short Analysis of archive formats and program solutions for compression of text files
title_sort analysis of archive formats and program solutions for compression of text files
topic формати архівів
rar
zip
7z
tar.gz
tar.xz
zpaq
архівування даних
стиснення тексту
url http://nti.khai.edu/ojs/index.php/aktt/article/view/2691
work_keys_str_mv AT artemperepelitsyn analysisofarchiveformatsandprogramsolutionsforcompressionoftextfiles
AT alonachepelevych analysisofarchiveformatsandprogramsolutionsforcompressionoftextfiles
AT andriilitvinov analysisofarchiveformatsandprogramsolutionsforcompressionoftextfiles