Monthly Archive: July 2013

0

Remotely debug hadoop

Source: http://www.gluster.org/2013/07/deep-dive-into-hadoop-with-bigtop-and-eclipse-remote-debuggers/ Deep dive into Hadoop with Bigtop and Eclipse Remote Debuggers Thanks to a little hack session with bradley childs over at Red Hat this week, I learned a new trick: Remote debugging of JVM (Hadoop + MR2) apps in...

0

mapred Vs. mapreduce

mapred Vs. mapreduce Resources http://stackoverflow.com/questions/7598422/is-it-better-to-use-the-mapred-or-the-mapreduce-package-to-create-a-hadoop-job http://stackoverflow.com/questions/10986633/hadoop-configuration-mapred-vs-mapreduce http://www.slideshare.net/sh1mmer/upgrading-to-the-new-map-reduce-api Related posts: Remotely debug hadoop Creating Hive tables on compressed files Hadoop Hive UDTF Tutorial – Extending Apache Hive with Table Functions ​DistCp Between HA Clusters

0

Sample test data generators

Resources http://www.webresourcesdepot.com/test-sample-data-generators/ http://databene.org/databene-benerator/similar-products.html Tools GenerateData GenerateData is a free, open source script written in JavaScript, PHP and MySQL that lets you quickly generate large volumes of custom data in a variety of formats for use in testing software, populating databases....

0

Debug Hadoop source code using an Intellij Idea

Source: http://www.techbite.in/2013/05/debug-hadoop-source-code-using.html If you are someone who wants to dive into Hadoop source code and get a feel of the implementation details of all the abstracted out nitty-gritties of Hadoop’s architectural overview, and want to get your hands dirty by...