–Get Started with Hadoop: from Evaluation to a Production Server
Posted by Brett Sheppard on June 7, 2011
Hadoop is is growing up. Apache Software Foundation (ASF) Hadoop and its related projects and sub-projects are maturing as an integrated, loosely coupled stack to store, process and analyze huge volumes of varied semi-structured, unstructured and raw data. This piece provides tips, cautions and best practices for an organization that would like to evaluate Hadoop and deploy an initial cluster. It focuses on the Hadoop Distributed File System (HDFS) and MapReduce. If you are looking for details on Hive, Pig or related projects and tools, you will be disappointed in this specific article, but I do provide links for where you can find more information. You can also refer to the live or archived presentations at the Yahoo Developer Network Hadoop Summit 2011 on June 29, 2011 in Santa Clara, Calif., and Hadoop World 2011, sponsored by Cloudera, in New York City on November 8-9, 2011.
This article is available on the O’Reilly Media sites at http://oreil.ly/lEPwQL
Hadoopのはじめ方 | 大規模計算ドットコム said
[...] 米ガートナー社の元シニアアナリストのBrett Sheppard氏のブログにHadoopの入門編が紹介されている。最初は、ApacheのWebサイトにあるQuick Startからはじめ、ネットワーク認証プロトコルにはKerberosのサイト、実践的なトレーニングも含めた書籍としてはオライリー社のHadoop: The Definitive Guideなどが紹介されている。かなり網羅されているので、これからHadoopを考えているなら参考になるだろう。 図表1:Kerberos: The Network Authentication Protocol [...]