Thoughts on analytics, data management, visualization and collaboration

Archive for the ‘Hadoop’ Category

–Hadoop Examples

Posted by Brett Sheppard on July 31, 2011

Organizations in multiple industries and the public sector are using Hadoop as one part of their integrated data architectures to obtain the highest value from their data. Hadoop is an increasingly popular option to process, store and analyze huge volumes of semi-structured, unstructured or raw data, often from disparate data sources.

The following are 10 public-domain examples of Hadoop production clusters, with URL links if you would like additional information: Read the rest of this entry »

Posted in Big Data, Hadoop | 5 Comments »

–Outliers and Coexistence are the New Normal for Big Data

Posted by Brett Sheppard on March 24, 2011

Many enterprise architectures have evolved into coexistence environments to manage and benefit from advanced analytics, with help from technology and cloud service providers that are ramping up integration capabilities. Letting data speak for itself through analysis of entire data sets is eclipsing modeling from sub-sets. In the past, all too often what was once disregarded as an “outlier” in a data model turned out to be the telltale signs of a micro-trend that became a major event.The combination of (1) coexistence approaches to manage big data volumes, complexity and speed together with (2) analysis of complete data sets is driving operational efficiency, revenue growth and enablement of new business models. Outliers and coexistence are the new normal for big data.

This article is available on the O’Reilly Media sites at http://oreil.ly/fT8T1E

Posted in Big Data, Clouds, Hadoop | 1 Comment »

–Big Data 2011 Preview

Posted by Brett Sheppard on January 31, 2011

During the 2011 National Football League (NFL) playoff TV broadcasts — amid commercials with Anheuser-Busch Clydesdales and auto racing driver Danica Patrick — an ad appeared with an IBM researcher talking about data analytics. In the IBM TV ad, Dr. David Ferrucci discusses how an IBM Watson supercomputer competes in a Jeopardy! game by integrating analytics, natural language capabilities and rapid search of disparate data.

While at first glance NFL TV broadcasts may seem an unusual forum for a discussion of data analytics, Big Data offers important tools for enterprises of all sizes to improve operational efficiencies, grow revenues, and empower new business models.

Read the rest of this entry »

Posted in Big Data, Clouds, Hadoop | 4 Comments »

–Next LAMP Stack

Posted by Brett Sheppard on July 27, 2010


The Next LAMP Stack: Hadoop Platform for Big Data Analytics

Editor’s note: a shorter version of this article appeared on GigaOM.

Many Fortune 500 and mid-size enterprises are intrigued by Hadoop for Big Data analytics and are funding Hadoop test/dev projects, but would like to see Hadoop evolve into a more fully integrated analytics platform, similar to what the LAMP (Linux, Apache HTTP Server, MySQL and PHP) stack has enabled for web applications. For example, head of technology strategy and innovation at credit card giant Visa, Joe Cunningham, told the audience at last year’s Hadoop World that he would like to see Visa’s use of Hadoop evolve from an alpha/beta environment into mainstream use for transaction analysis, but has concerns about integration and operations management.

Read the rest of this entry »

Posted in Big Data, Hadoop | Tagged: , , , , | 10 Comments »

–Karmasphere Releases

Posted by Brett Sheppard on July 25, 2010

Making Hadoop Accessible for Enterprise Developers and Analysts

Karmasphere offers front-end client software that enables developers and analysts who are not necessarily Hadoop specialists to develop, debug and deploy Hadoop jobs to virtually any private, public or hybrid Hadoop cluster. You can download the NetBeans or Eclipse versions of the no-cost Karmasphere Studio Community Edition, or apply for a limited beta of the just-announced Karmasphere Studio Professional Edition and Analyst Edition, at Karmasphere’s software download page. Editor’s note: This is a reprint of an article that first appeared on Big Data News.

Read the rest of this entry »

Posted in Big Data, Hadoop | Tagged: | Leave a Comment »

–Accel Event Recap

Posted by Brett Sheppard on July 22, 2010

Hadoop, Memcached and Solid-state Storage

Hosted at Stanford University, Accel Partners brought together executives from four of their portfolio companies to discuss evolution of a “New Data Stack” incorporating Hadoop, memcached and sold-state storage. Read the rest of this entry »

Posted in Hadoop | Tagged: , , , , , , , , | Leave a Comment »