Hadoop feat. Lzo - save disk space and speed up your programs
The whole point of Hadoop is to process very large datasets. This implies that you will be using a lot of disk space, all those big files replicated a couple of times add up. Let's look at how we can compress text files, saving disk space without losing performance.