Tuesday, January 14, 2014

Append files in Hadoop?

Can I really append a file in HDFS OR do i need to always replace the file? http://hadoop4mapreduce.blogspot.com/2012/08/two-methods-to-append-content-to-file.html

Monday, January 6, 2014

Compression in Hadoop

I am currently researching various options to do compression in Hadoop and found the following articles useful:
My project needs are = should have minimum performance impact + should give decent compression gains. It looks like my choice is splitable LZO.

Sunday, December 1, 2013

Cascading for Workflows in Hadoop

Cascading allows workflows in Hadoop with lingual that makes sql based pipelining

Wednesday, November 13, 2013

MapReduce in Eclipse

Good article on how to setup eclipse for MapReduce development http://www.thecloudavenue.com/2012/10/debugging-hadoop-mapreduce-program-in.html

Thursday, June 20, 2013

HDInsight Getting Started Link

I have been reading up on Microsoft HDInsight which is a new Hadoop Distribution in the market. It has a cloud service that is very easy to use and has a unique feature that allows one to save the data on low cost storage and later bring the Hadoop cluster back online when needed. Here is a getting started on this... http://www.windowsazure.com/en-us/manage/services/hdinsight/get-started-hdinsight/#header-7