Skip to main content

Questions tagged [compression]

Filter by
Sorted by
Tagged with
5 votes
5 answers
771 views

I often see the claim that various data warehouse/analytical database systems derive significant performance benefits from compressing their data stores. On the face of it, though, this seems to be ...
Mason Wheeler's user avatar
0 votes
5 answers
293 views

We have a huge amount of queries hitting our API that request a minor or major extract of some huge files lying around on our mounted hard drives. The data needs to be extracted from the files and ...
glades's user avatar
  • 493
1 vote
1 answer
2k views

I have a jar file, for example foo.jar. My code contains a lot of libraries (almost 75 jar dependencies). I am not using anything like maven or gradle, I'm just using pure java with pure jar files as ...
Day Trip's user avatar
0 votes
1 answer
388 views

I am writing a c# program where I need to print a lot of small barcodes in a 100x100 grid on a piece of paper. I then scan/photograph the paper and read the barcodes again. Each barcode only need to ...
DrDress's user avatar
  • 127
7 votes
2 answers
645 views

What is difference between Average length of codes and Average length of codewords in Huffman Algorithm? is both the same meaning? I get stuck in some facts: I see a fact that marked as False: for a ...
Emma Nic.'s user avatar
  • 183
7 votes
2 answers
887 views

Software libraries targetting resource constrained environments like embedded systems use conditional compilation to allow consumers to shave space by removing unused features from the final binaries ...
TZubiri's user avatar
  • 443
2 votes
2 answers
571 views

Today I went across a weird case for which I have no explanation, so here I am. I have two files with identical content, but one is encoded in UTF-8 and the other one is in IBM EBCDIC. Both of them ...
rodripf's user avatar
  • 137
0 votes
2 answers
139 views

I have thinking about this idea for over 5 years and i don't have the complete technical knowledge to fully grasp the idea I'm having. The premise of the idea is to have an extremely high base number ...
Necro's user avatar
  • 105
30 votes
5 answers
8k views

This question is about how many bits are required to store a range. Or put another way, for a given number of bits, what is the maximum range that can be stored and how? Imagine we want to store a ...
rghome's user avatar
  • 688
3 votes
2 answers
135 views

I'm a high school student interested in topics of computer programming. Recently I became interested in file compression, and in my head I tried to combine this with a completely different part of ...
Goel Nimi's user avatar
2 votes
3 answers
2k views

I am looking for text compression algorithms (natural language compression, rather than compression of arbitrary binary data). I have seen for example An Efficient Compression Code for Text ...
Lance Pollard's user avatar
-2 votes
1 answer
197 views

Is it possible to compress true random permutation using low order polynomial interpolation? If yes, how it can be achieved?
user9340043's user avatar
0 votes
1 answer
492 views

I was wondering if what I have in mind already exists in any known compression programs/algorithms or not. We know that Seed gives us constant sequence of random numbers. so if we be able to find seed ...
M.kazem Akhgary's user avatar
4 votes
1 answer
371 views

This question was inspired by MessagePack, but I'm looking for a general answer about the advantages of in-app vs. external compression. For network I/O, doesn't the transport protocol (at least ...
Kevin Krumwiede's user avatar
2 votes
1 answer
105 views

I have a stream of binary data. Assume no prior knowledge about the expected pattern in input data. The symbols can represent binary data or other symbols, hence hierarchical. The output should ...
Quark's user avatar
  • 37
4 votes
5 answers
640 views

A while ago I asked a question about custom text data formats, instead of using existing tools such as XML, JSON, YAML, etc. Now, in favor of converting our custom format to a relational database and ...
Chris Cirefice's user avatar
3 votes
1 answer
521 views

Is there a possible test to check if a PDF file contains text or it is created by scanning paper sheets ? text : plain text that, for example, I can copy & paste while I am reading the PDF. Not ...
Massimo's user avatar
  • 131
5 votes
2 answers
585 views

I have a collection of strings which have a lot of common substrings, and I'm trying to find a good way to define tokens to compress them. For instance, if my strings are: s1 = "String" s2 = "Bool" ...
ErikR's user avatar
  • 296
9 votes
2 answers
1k views

I have a folder containing about 9,000 JPEG photos (about 30Gb), which I want to archive with some sort of compression. I understand that compressing JPEGs is not normally very effective, but these ...
Stephen's user avatar
  • 201
1 vote
1 answer
1k views

I need to compress an id for marketing campaigns. The current campaign id is 32-bit integer but obviously this is too long for a customer to type by hand. I would like to compress this to minimum ...
user594883's user avatar
3 votes
2 answers
277 views

I am looking for algorithm or idea for the following problem. Suppose we have a data type, say 64-bit integer. Now we have a relatively small set of such items, say few hundred at most. The simplest ...
haael's user avatar
  • 133
1 vote
0 answers
2k views

I am writing a python program which parses zip (currently only zlib, using DEFLATE compression) files and verifies the correctness of their headers and data. One of the things I'm trying to achieve is ...
S B's user avatar
  • 11
2 votes
2 answers
2k views

When using Golomb/Rice code in image compression, it is inevitable for us to meet large values. Golomb coding uses a tunable parameter M to divide an input value N into two parts : q, the result of a ...
dongbao wu's user avatar
2 votes
2 answers
2k views

I've recently come across an application by Yahoo called SmushIt. Apparently it does lossless compression on images. Sometimes the image size is reduced by as much as 90%. This of course has major ...
Alternatex's user avatar
  • 1,031
1 vote
2 answers
2k views

I had a question regarding compression and calculation of checksum/hash of data. I would like to know if checksum has to be calculated before or after the compression of data before transmission. ...
redDragon's user avatar
  • 105