Performance comparison of Java compression algorithms

Performance Comparison of Java Compression Algorithms

original

2016/12/13 21:35

Reading 4.6W

preface
In game development, the necessary information initialization is often carried out when the player enters the game. Usually, the initialization information packet is relatively large, usually about 30-40kb. It is still necessary to compress it before sending messages. I just saw it a while ago, and listed some common compression algorithms, as shown in the following figure:

Whether it is sharable indicates whether it is possible to search any position of the data stream and further read data down. This function is particularly suitable in Hadoop's MapReduce.
The following is a brief introduction to these compression formats, and a pressure test is carried out to compare their performance

DEFLATE
DEFLATE is a lossless data compression algorithm that uses LZ77 algorithm and Huffman Coding at the same time. The source code of DEFLATE compression and decompression can be found on the free and universal compression library zlib. zlib official website: http://www.zlib.net/
The zlib compression library is supported in the jdk. The compression class Deflater and decompression class Inflator provide native methods

 private native int deflateBytes(long addr, byte[] b, int off, int len, int flush);

 private native int inflateBytes(long addr, byte[] b, int off, int len) throws DataFormatException;

You can directly use the compression class Deflater and decompression class Inflater provided by jdk. The code is as follows:

 public static byte[] compress(byte input[]) { ByteArrayOutputStream bos = new ByteArrayOutputStream(); Deflater compressor = new Deflater(1); try { compressor.setInput(input); compressor.finish(); final byte[] buf = new byte[2048]; while (! compressor.finished()) { int count = compressor.deflate(buf); bos.write(buf, 0, count); } } finally { compressor.end(); } return bos.toByteArray(); } public static byte[] uncompress(byte[] input) throws DataFormatException { ByteArrayOutputStream bos = new ByteArrayOutputStream(); Inflater decompressor = new Inflater(); try { decompressor.setInput(input); final byte[] buf = new byte[2048]; while (! decompressor.finished()) { int count = decompressor.inflate(buf); bos.write(buf, 0, count); } } finally { decompressor.end(); } return bos.toByteArray(); }

You can specify the compression level of the algorithm, so that you can balance the compression time and the output file size. The optional levels are 0 (no compression) and 1 (fast compression) to 9 (slow compression). Here, speed takes precedence.

gzip
The implementation algorithm of gzip is still deflate, but the file header and file tail are added to the deflate format. Similarly, jdk also provides support for gzip, which are GZIPOutputStream and GZIPInputStream classes, respectively. It can also be found that GZIPOutputStream inherits from DeflaterOutputStream, GZIPInputStream inherits from InflaterInputStream, and writeHeader and writeTailer methods can be found in the source code:

 private void writeHeader() throws IOException { ...... } private void writeTrailer(byte[] buf, int offset) throws IOException { ...... }

The specific code is as follows:

 public static byte[] compress(byte srcBytes[]) { ByteArrayOutputStream out = new ByteArrayOutputStream(); GZIPOutputStream gzip; try { gzip = new GZIPOutputStream(out); gzip.write(srcBytes); gzip.close(); } catch (IOException e) { e.printStackTrace(); } return out.toByteArray(); } public static byte[] uncompress(byte[] bytes) { ByteArrayOutputStream out = new ByteArrayOutputStream(); ByteArrayInputStream in = new ByteArrayInputStream(bytes); try { GZIPInputStream ungzip = new GZIPInputStream(in); byte[] buffer = new byte[2048]; int n; while ((n = ungzip.read(buffer)) >= 0) { out.write(buffer, 0, n); } } catch (IOException e) { e.printStackTrace(); } return out.toByteArray(); }

bzip2
Bzip2 is a data compression algorithm and program developed by Julian Seward and released according to the free software/open source software protocol. Seward released bzip2 0.15 for the first time in July 1996. In the following years, the stability of this compression tool has improved and become increasingly popular. Seward released version 1.0 in late 2000. More wikis bzip2
Bzip2 has higher compression efficiency than traditional gzip, but its compression speed is slower.
Bzip2 is not implemented in jdk, but it is implemented in commons compress. Maven introduces:

 <dependency> <groupId>org.apache.commons</groupId> <artifactId>commons-compress</artifactId> <version>1.12</version> </dependency>

The specific code is as follows:

 public static byte[] compress(byte srcBytes[]) throws IOException { ByteArrayOutputStream out = new ByteArrayOutputStream(); BZip2CompressorOutputStream bcos = new BZip2CompressorOutputStream(out); bcos.write(srcBytes); bcos.close(); return out.toByteArray(); } public static byte[] uncompress(byte[] bytes) { ByteArrayOutputStream out = new ByteArrayOutputStream(); ByteArrayInputStream in = new ByteArrayInputStream(bytes); try { BZip2CompressorInputStream ungzip = new BZip2CompressorInputStream( in); byte[] buffer = new byte[2048]; int n; while ((n = ungzip.read(buffer)) >= 0) { out.write(buffer, 0, n); } } catch (IOException e) { e.printStackTrace(); } return out.toByteArray(); }

The compression algorithms of lzo, lz4 and snappy described below have the priority of compression speed, but the compression efficiency is lower.

lzo
LZO is a data compression algorithm dedicated to decompression speed. LZO is the abbreviation of Lempel Ziv Oberhumer. This algorithm is lossless. More wikis LZO
Third party libraries need to be imported. Maven imports:

 <dependency> <groupId>org.anarres.lzo</groupId> <artifactId>lzo-core</artifactId> <version>1.0.5</version> </dependency>

Specific implementation code:

 public static byte[] compress(byte srcBytes[]) throws IOException { LzoCompressor compressor = LzoLibrary.getInstance().newCompressor( LzoAlgorithm.LZO1X, null); ByteArrayOutputStream os = new ByteArrayOutputStream(); LzoOutputStream cs = new LzoOutputStream(os, compressor); cs.write(srcBytes); cs.close(); return os.toByteArray(); } public static byte[] uncompress(byte[] bytes) throws IOException { LzoDecompressor decompressor = LzoLibrary.getInstance() .newDecompressor(LzoAlgorithm.LZO1X, null); ByteArrayOutputStream baos = new ByteArrayOutputStream(); ByteArrayInputStream is = new ByteArrayInputStream(bytes); LzoInputStream us = new LzoInputStream(is, decompressor); int count; byte[] buffer = new byte[2048]; while ((count = us.read(buffer)) != - 1) { baos.write(buffer, 0, count); } return baos.toByteArray(); }

lz4
LZ4 is a lossless data compression algorithm, focusing on compression and decompression speed More wikis lz4
Maven introduces third-party libraries:

 <dependency> <groupId>net.jpountz.lz4</groupId> <artifactId>lz4</artifactId> <version>1.2.0</version> </dependency>

Specific code implementation:

 public static byte[] compress(byte srcBytes[]) throws IOException { LZ4Factory factory = LZ4Factory.fastestInstance(); ByteArrayOutputStream byteOutput = new ByteArrayOutputStream(); LZ4Compressor compressor = factory.fastCompressor(); LZ4BlockOutputStream compressedOutput = new LZ4BlockOutputStream( byteOutput, 2048, compressor); compressedOutput.write(srcBytes); compressedOutput.close(); return byteOutput.toByteArray(); } public static byte[] uncompress(byte[] bytes) throws IOException { LZ4Factory factory = LZ4Factory.fastestInstance(); ByteArrayOutputStream baos = new ByteArrayOutputStream(); LZ4FastDecompressor decompresser = factory.fastDecompressor(); LZ4BlockInputStream lzis = new LZ4BlockInputStream( new ByteArrayInputStream(bytes), decompresser); int count; byte[] buffer = new byte[2048]; while ((count = lzis.read(buffer)) != - 1) { baos.write(buffer, 0, count); } lzis.close(); return baos.toByteArray(); }

snappy
Snappy (formerly known as Zippy) is a fast data compression and decompression library written by Google in C++language based on the idea of LZ77, and was open source in 2011. Its goal is not maximum compression rate or compatibility with other compression libraries, but very high speed and reasonable compression rate. More wikis snappy
Maven introduces third-party libraries:

 <dependency> <groupId>org.xerial.snappy</groupId> <artifactId>snappy-java</artifactId> <version>1.1.2.6</version> </dependency>

Specific code implementation:

 public static byte[] compress(byte srcBytes[]) throws IOException { return  Snappy.compress(srcBytes); } public static byte[] uncompress(byte[] bytes) throws IOException { return Snappy.uncompress(bytes); }

Pressure test
The following compression and decompression tests are carried out for 35kb player data. 35kb data is relatively small. All the following test results are only for the specified data range, and do not indicate which compression algorithm is good or bad.
Test environment:
jdk：1.7.0_79
cpu： i5-4570@3.20GHz 4-core
memory：4G

Perform 2000 compression and decompression tests on 35kb data. The test code is as follows:

 public static void main(String[] args) throws Exception { FileInputStream fis = new FileInputStream(new File("player.dat")); FileChannel channel = fis.getChannel(); ByteBuffer bb = ByteBuffer.allocate((int) channel.size()); channel.read(bb); byte[] beforeBytes = bb.array(); int times = 2000; System. out. println ("Size before compression:"+beforeBytes. length+"bytes"); long startTime1 = System.currentTimeMillis(); byte[] afterBytes = null; for (int i = 0;  i < times; i++) { afterBytes = GZIPUtil.compress(beforeBytes); } long endTime1 = System.currentTimeMillis(); System. out. println ("Compressed size:"+afterBytes. length+"bytes"); System. out. println ("Compression times:"+times+", time:"+(endTime1 - startTime1) + "ms"); byte[] resultBytes = null; long startTime2 = System.currentTimeMillis(); for (int i = 0;  i < times; i++) { resultBytes = GZIPUtil.uncompress(afterBytes); } System. out. println ("uncompressed size:"+resultBytes. length+"bytes"); long endTime2 = System.currentTimeMillis(); System. out. println ("Decompression times:"+times+", time:"+(endTime2 - startTime2) + "ms"); }

GZIPUtil in the code is replaced according to different algorithms, and the test results are shown in the following figure:

The statistics of the size before compression, the size after compression, the compression time, the decompression time, and the CPU peak are made respectively

summary
From the results, deflate, gzip and bzip2 pay more attention to the compression ratio, and the compression and decompression time will be longer; The compression algorithms of lzo, lz4 and snappy have the priority of compression speed, and the compression ratio will be slightly lower; Lzo, lz4 and snappy are lower at the cpu peak. Because within the tolerance of the compression rate, we pay more attention to the compression and decompression time, as well as the use of the CPU. All of us finally use SNAPPY. It is not difficult to find that SNAPPY has the lowest compression and decompression time, as well as the CPU peak, and there are not many disadvantages in the pressure rate.

Personal blog: codingo.xyz

kakai 2024-05-10 10:21

The world only knows that Android was created by Google. Several people know that Android is only a product acquired by Google. Similarly, what is the problem with Huawei's contribution to the collection of OGG open source work and integration into its own proprietary product line?

oldpig 2024-04-28 09:59

”Huawei contributed all the source code "?, the title is completely inconsistent with the content.

Bright 2024-05-19 23:25

What a fool! I killed myself. How can people deal with me later.

Ding Yun H 2024-06-07 20:44

There is no querydsl. Since querydsl was used, I can't look at other forms anymore

Xiao Xu Middle aged 2024-06-10 07:05

Learn

Francesca 2024-05-19 18:00

Wine runs the Android emulator of Windows. Chrome OS is installed in the Android emulator. Linux environment is installed in chrome OS. Linux environment is installed in the Linux environment. Wine is installed in the Android emulator

One code Yma 2024-05-09 09:58

Recently, I often go to interviews. People who hate Ali background most regard me as a fool, even though I am a fool

yh2216 2024-06-09 13:15

Like c++

-SORA- 2024-04-30 17:07

When this happened in a foreign country, the comment area suddenly became very objective and rational**

Qin Liming 2024-05-11 09:12

be devoid of any sense of shame

Small and beautiful software development 2024-06-08 23:03

It's mainly about waist training

pan3793 2024-06-07 22:26

Let AI give AI a score

monkey_cici 2024-05-09 00:25

My I9 CPU, 64GB memory module and 3080Ti computer are inferior to the top configuration of 19999 on a tablet

yh2216 2024-06-09 23:03

I remember saying that one year C++was the language of the year,

GDWhisperer 2024-05-15 17:23

I transferred tens of thousands of yuan to my own account, which was under risk control. How did I do this? The bank should be responsible for this**

sunday12345 2024-05-15 18:31

What does the bank do? It's blamed on the remote desktop. Persimmons really pick up soft pinches~?

Francesca 2024-06-09 13:21

But the end of closed source must be open source, because many people who are dissatisfied with closed source have created open source, so the end of open source is not necessarily closed source, but to find a business model that is open source= Free Admission

zhangleijie 2024-06-08 10:08

pretty good

H Fine water and long flow H 2024-06-10 09:39

I haven't heard about whether fartran has paid. I'm in the top ten

Shuimu Yi'an 2024-05-20 09:58

The news should be read continuously. I'm waiting for the third news besides rustdesk and teamviewer. Localized remote desktop software is far ahead.

generation

Code e person 2024-06-09 10:03

Prepare the next project and try it

infoworld 2024-05-11 15:12

Universities should use open source free software instead of commercial ones. In this way, hands and feet will not be tied technically.

One code Yma 2024-05-06 09:14

My technical article was moved by CSDN. Why didn't anyone step on the sewing machine? This kind of report is a joke to me. The monsters with background are fine, and the monsters without background fight to death

zoujiaqing 2024-06-07 21:22

I dare not use it

CodeDoger 2024-05-02 20:48

35 It's too old to go to work and too early to retire at 60

abeet 2024-06-08 20:38

There are no pictures, for fear that we will learn, right

gamedot 2024-05-17 11:14

Old Zhou is deeply concerned about Huawei's great cause of open source. He is not a Huawei person, but has Huawei's soul.

Single structure 2024-05-11 10:09

Selected as Open Source China's disgrace pillar

brucepapa 2024-06-09 21:02

I also have several backaches... After a few days of exercise, it will be much better to focus on stretching the back muscles.

golyu 2024-06-10 14:45

If only this was the library of solidjs

kangert 2024-06-09 20:07

The problem of docker hub is very uncomfortable

Wang Zheng 2024-06-08 09:46

You said, "All the tests are graduate students" and smiled. I don't know my level is low.

intown 2024-06-07 18:20

I can't pull down any mirror image these two days

MrChen89 2024-04-29 09:18

There are a group of people like this. I don't know what they have experienced. When it comes to HW, I can't say anything good, even if it's neutral

kangert 2024-06-09 20:10

Really need to practice

osc_566335 2024-04-28 14:44

This is also called floor washing? Does it mean that Tesla will not wash the floor if it releases all the source code? Some people HWptds? That is to say, the language is ambiguous, which will also rise to the washing ground? Are some people too focused? Think the people he pays attention to must be staring at?

Chief taxi captain 2024-05-17 11:17

I suggest that 360 open source all its products, and then become the leading enterprise in the domestic open source industry through open source, leading everyone to compete with foreign enterprises

zhy 2024-05-16 13:16

At the end of Shannon is Nong

zhuzhua 2024-05-21 10:08

I'm laughing to death. Those who have been deeply kidnapped dare not pay? Who will use the domestic open source framework of small companies in the future will be 213!!! Wait for harvesting later

Xiao Xu Middle aged 2024-06-08 12:43

Do AI functions need networking? Will it be 404?

Li Yinghui 2024-05-09 16:40

Buddhism has a good word, evil opinion. In dealing with the world, it is meaningless to draw conclusions from preset positions; It is also important to receive good logic training.

osc_27546117 2024-06-09 22:36

Learned electric programming and expected its progress

Xiao Xu Middle aged 2024-06-08 10:12

First place in making money!! Money and treasures will be plentiful

Monkeys think of apes 2024-05-31 18:31

You can cheat your brother. Just don't cheat yourself

osc_92224065 2024-04-29 10:57

Long term oppressed outsourcing of state-owned enterprises

osc_73978543 2024-06-07 17:48

Thanks for your advice

muwanqing123 2024-06-09 08:28

Bullshit authentication

zoujiaqing 2024-06-07 21:21

Spring boot was not updated last year

Ma Nong Little Fatty Brother 2024-05-16 14:40

I give you six seconds. I give you six moves with the same effect in the martial arts contest, which shows the invincibility and confidence of the master

zzeric 2024-04-28 20:01

Although France is the parent community, the core developers of OCCT on github are all Russians. Without Russians, the French parent community cannot continue to operate. So Huawei took over, moved to China, changed its name and resumed open source and community operations. What's the problem?

Kevin586 2024-06-08 14:41

Dream is garbage, which can also be listed and refresh my cognition

Yeah, for 2024-05-17 13:42

That's too right. Old Zhou can't control Google, but he can control 360. Do not do to others what you do not want. All 360 products should be opened first.

Yoona520 2024-05-17 16:34

Zhou Hongyi is now living more and more like a clown. If he stays behind the scenes, he has to become an online celebrity. Can you learn from Lei Jun?

Xiao_f 2024-06-07 22:59

One thing to say, compared with other domestic manufacturers, Qwen's relaxed licensing fully demonstrates the style of a large factory

Xiaoming, the teacher of Mingjiao 2024-06-07 17:50

There is no need to spray. There are many companies that can make money. Meituan, Didi and Ruixing are all making money. At least they are doing something

Xiaoxia cat ball 2024-06-09 21:29

Very good, come on

lyh97157268 2024-06-09 20:58

xiaoqibabby 2024-05-15 17:36

The bank is strongly required to be responsible for

Happy LeapFrog 2024-05-18 09:18

But the question is: "What's the use of this for ordinary Android users?" Now the answer seems to be: "Almost nothing.".

SnailJob 2024-06-09 09:13

Yes, please continue to follow Snail Job

Hot content

Popular comments of the whole site

About the author

Author's Album

Author's other popular articles

Hot News

Hot software

OSCHINA Community

Online tools

Introduction

QQ group

Public account

Video number