Typical Linux setup - http://www.michael-noll.com/wiki/Running_Hadoop_On_Ubuntu_Linux_(Multi-Node_Cluster) http://www.thegeekstuff.com/2012/02/hadoop-standalone-installation/ http://hadoop.apache.org/common/docs/stable/mapred_tutorial.html#MapReduce+-+User+Interfaces SQL interface - https://www.ibm.com/developerworks/opensource/library/os-mapreducesql/ The components and their functions in the Hadoop ecosystem - http://nosql.mypopescu.com/post/17564108280/the-components-and-their-functions-in-the-hadoop?ef125c00 How to due with small files - http://www.cloudera.com/blog/2009/02/02/the-small-files-problem/ HDFS Vs KFS - http://www.nabble.com/HDFS-Vs-KFS-tt19089077.html RealTime - http://www.nabble.com/realtime-hadoop-tt18078957.html http://www.nabble.com/Realtime-Map-Reduce-%3D-Supercomputing-for-the-Masses--tt17581830.html#a17581830 Difference between joining and reducing - http://markmail.org/message/2hvv2ax6h7k5svxo Multiple artifact sets and privacy - http://markmail.org/message/kklzfkfbh7jegpmp Scaling Hadoop to 4000 nodes at Yahoo! - http://developer.yahoo.net/blogs/hadoop/2008/09/scaling_hadoop_to_4000_nodes_a.html Basic - http://storageconference.org/2010/Papers/MSST/Shvachko.pdf http://mycloudresearch.wordpress.com/2012/03/14/simple-hadoop-overview/ http://www.javacodegeeks.com/2012/05/mapreduce-questions-and-answers-part-1.html http://www.javacodegeeks.com/2012/05/mapreduce-questions-and-answers-part-2.html Using R - http://www.infoq.com/presentations/Distributed-Data-Analysis-with-Hadoop-and-R http://www.slideshare.net/ChicagoHUG/getting-started-with-r-hadoop-chug-20120815 Links to tutorials for a lot of difference frameworks - http://blog.matthewrathbone.com/post/39783477991/a-quick-guide-to-hadoop-map-reduce-frameworks http://hortonworks.com/blog/windows-explorer-experience-hdfs/ http://hadoopecosystemtable.github.io/