Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Highly-compressable data with long run lengths #12

Open
ivan-tkatchev opened this issue Mar 31, 2015 · 1 comment
Open

Highly-compressable data with long run lengths #12

ivan-tkatchev opened this issue Mar 31, 2015 · 1 comment

Comments

@ivan-tkatchev
Copy link

Examples being VT100 escape codes from a terminal or images in PPM format. Such data is highly compressable, with 20x compression rate or more. In some codecs this kind of data will cause strange behavior and trigger edge cases.

Here is an example file: https://www.dropbox.com/s/5gzk7ro4ze7v3xw/testimage.ppm?dl=0

(Just a screencap from a terminal emulator on my machine, nothing that could have licensing issues.)

@ivan-tkatchev
Copy link
Author

P.S. This file triggers strange behavior in gzip -- gzip results in a compressed file that's 3.5 times larger than when compressed with bzip2. This has practical implications: after converting the PPM to PNG the file is almost twice as big as when converted to GIF; this really should never happen. (PNG uses deflate under the hood.)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant