Analysis of archive formats and program solutions for compression of text files
The subject of study in this article and research is widely used archive formats for file compression, features of their implementation, text compression rates, and the time required for existing programs for different platforms. The goal of the work is to simplify the process of choosing an archive...
Saved in:
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
National Aerospace University «Kharkiv Aviation Institute»
2024-11-01
|
Series: | Авіаційно-космічна техніка та технологія |
Subjects: | |
Online Access: | http://nti.khai.edu/ojs/index.php/aktt/article/view/2691 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | The subject of study in this article and research is widely used archive formats for file compression, features of their implementation, text compression rates, and the time required for existing programs for different platforms. The goal of the work is to simplify the process of choosing an archive format and program solutions for working with it for compressing text files, with taking into account time requirements, compression ratio, and opensource. The task is to perform an analysis of existing technologies and tools involved in the data archiving process, to analyze archive formats that are widely used, to perform an analysis of important features of archive formats, to perform an experimental study of compression parameters for a specific set of formats and software tools, to propose steps for integrating archives into the project of system with ensuring the compatibility. According to the tasks, the following results were obtained. The application of data compression in information storage systems is discussed. The available formats for archiving in Ubuntu are considered. The detailed analysis of widely used archive formats is performed. The features of the zip and rar formats for working with large files are analyzed. An experimental study of compression parameters for large-sized reference text files using ten combinations based on seven formats and four software tools is performed. Compression parameters of the text with use of the same archive formats using different software tools are investigated. Recommendations for integrating archives into the project of system with ensuring the compatibility are proposed. The use of zpaq for the compression of text information is proposed. Conclusions. The scientific novelty of the obtained results is in the fact that the analysis and experimental study of existing archive formats allows simplifying the process of making a decision on using the required archive format based on the requirements for archiving time, compression ratio, and the possibility of using software implementation for a specific platform. The obtained research results allow to propose the use of the open source archive format zpaq for compressing text or a set of project versions and documentation to achieve a compression ratio that is twice better than for rar format, and two percent better than for 7z and txz formats. |
---|---|
ISSN: | 1727-7337 2663-2217 |