-
Notifications
You must be signed in to change notification settings - Fork 2.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
NEON implementation for Adler32 #251
Closed
Closed
Changes from 1 commit
Commits
Show all changes
2 commits
Select commit
Hold shift + click to select a range
File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev
Previous commit
Using ARMv8 CRC32 specific instruction
CRC32 affects performance for both image decompression (PNG) as also in general browsing while accessing websites that serve content using compression (i.e. Content-Encoding: gzip). This first patch implements an optimized CRC32 function using the dedicated instruction available in ARMv8. It should be between 6x (A53: 116ms X 22ms for a 4Kx4Kx4 buffer) to 10x faster (A72: 91ms x 9ms) than the C implementation currently used by zlib. Details: https://bugs.chromium.org/p/chromium/issues/detail?id=709716 Change-Id: I069408ebc06c49a3c2be4ba3253319e025ee09d7
- Loading branch information
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,67 @@ | ||
/* Copyright (C) 1995-2011, 2016 Mark Adler | ||
* Copyright (C) 2017 ARM Holdings Inc. | ||
* Authors: Adenilson Cavalcanti <[email protected]> | ||
* Yang Zhang <[email protected]> | ||
* This software is provided 'as-is', without any express or implied | ||
* warranty. In no event will the authors be held liable for any damages | ||
* arising from the use of this software. | ||
* Permission is granted to anyone to use this software for any purpose, | ||
* including commercial applications, and to alter it and redistribute it | ||
* freely, subject to the following restrictions: | ||
* 1. The origin of this software must not be misrepresented; you must not | ||
* claim that you wrote the original software. If you use this software | ||
* in a product, an acknowledgment in the product documentation would be | ||
* appreciated but is not required. | ||
* 2. Altered source versions must be plainly marked as such, and must not be | ||
* misrepresented as being the original software. | ||
* 3. This notice may not be removed or altered from any source distribution. | ||
*/ | ||
#include <arm_acle.h> | ||
// Depending on the compiler flavor, size_t may be defined in | ||
// one or the other header. See: | ||
// http://stackoverflow.com/questions/26410466/gcc-linaro-compiler-throws-error-unknown-type-name-size-t | ||
#include <stdint.h> | ||
#include <stddef.h> | ||
|
||
uint32_t armv8_crc32_little(uint32_t crc, | ||
const unsigned char *buf, | ||
size_t len) { | ||
uint32_t c; | ||
const uint32_t *buf4; | ||
|
||
c = crc; | ||
c = ~c; | ||
while (len && ((ptrdiff_t)buf & 3)) { | ||
c = __crc32b(c, *buf++); | ||
len--; | ||
} | ||
|
||
buf4 = (const uint32_t *)(const void *)buf; | ||
|
||
while (len >= 32) { | ||
c = __crc32w(c, *buf4++); | ||
c = __crc32w(c, *buf4++); | ||
c = __crc32w(c, *buf4++); | ||
c = __crc32w(c, *buf4++); | ||
c = __crc32w(c, *buf4++); | ||
c = __crc32w(c, *buf4++); | ||
c = __crc32w(c, *buf4++); | ||
c = __crc32w(c, *buf4++); | ||
len -= 32; | ||
} | ||
|
||
while (len >= 4) { | ||
c = __crc32w(c, *buf4++); | ||
len -= 4; | ||
} | ||
|
||
buf = (const unsigned char *)buf4; | ||
if (len) { | ||
do { | ||
c = __crc32b(c, *buf++); | ||
} while (--len); | ||
} | ||
|
||
c = ~c; | ||
return c; | ||
} |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In a second thought, this should handle the case of buf == Z_NULL.