Using Hadoop just got easier, thanks to Teradata’s introduction of SQL-H, a new query interface to analyze data from Hadoop. Most Hadoop access methods require preproc
essing and staging of data from the Hadoop Distributed File System (HDFS) using technologies such as MapReduce. These approaches require new skills and technologies, introducing more time and costs for users, which offset the benefits of Hadoop, which according to our big data benchmark research include increasing the speed of analysis. Teradata has announced support for SQL-H not only for its own Aster Database 5.0, which it expects to release in the third quarter, but also supporting the commercial version of Hadoop through Hortonworks.
Use of a familiar query interface, by contrast, reduces staffing and training issues required for learning more Hadoop-specific interfaces, which our research found to be the top two obstacles to big data analytics. Teradata Aster accomplishes this through utilizing use of HCatalog to get access to metadata that can be queried against using Aster SQL, ODBC, JDBC and ultimately any analytics or business intelligence tool, since the data then looks like a database table structure. The need to extract and store data from Hadoop into other database systems and thereby lose the computing power of Hadoop has been the Achilles heel of this big data technology. Analysts who want interactive and iterative discovery of their data now do not have to depend on the Hadoop Hive query language interface and can use more familiar tools like MicroStrategy and Tableau for analytics. Teradata Aster incorporates derivative analytics in its technology to be applied to data in Hadoop, including the customer and transaction data that, according to our research, top the list of types used. Its capabilities include analytics around paths, text, statistics, segmentation and broader customer interaction.
Teradata Aster has an advantage over EMC Greenplum, IBM and Oracle, which do not provide this level of direct integration with Hadoop today. Their approach requires data duplication and does not leverage the extended power of Hadoop and use of HCatalog for metadata knowledge about the data itself. I expect that if other vendors want to exploit the power of Hadoop they will need to expand their support of it over the coming year.
The introduction of SQL-H in Teradata Aster helps analysts streamline their analytics while reducing the custom coding and development required from IT staffers. Utilizing the Aster platform provides other computational processing advantages in its scale-out approach using a range of server technologies. According to our research, one-third of organizations plan to use Hadoop. For Teradata Aster, support for Hadoop builds on its existing big data support. Organizations looking to further exploit Hadoop to analyze large volumes of data quickly should find Teradata Aster SQL-H a welcome advancement for their data and analytic options.
Regards,
Mark Smith – CEO & Chief Research Officer

Business Exchange
Google+
Klout
Kred
LinkedIn
Plaxo
Twitter
Facebook Fan Page
Ventana Research Website
10 comments
Comments feed for this article
June 19, 2012 at 1:38 pm
David Menninger (@dmenningeremc)
Mark,
I thought I should correct your comment about EMC Greenplum requiring duplication of data in order to access it with SQL. Greenplum supports direct access to Hadoop data via external tables without requiring any duplication of data. This functionality as been available for over a year beginning with Version 4.1 of the Greenplum Database.
Dave
EMC Greenplum
July 6, 2012 at 2:35 pm
Ventana Research
Thanks for the clarification and correction on use of HDFS external tables with EMC Greenplum. Also, this is a great resource for others looking at your integration – http://www.greenplum.com/sites/default/files/EMC_Greenplum_Hadoop_DB_TB_0.pdf . Also, maybe you can help get some information flowing from EMC as we get little to no communication, updates or information on EMC and/or Greenplum.
Mark
June 22, 2012 at 7:04 pm
Hortonworks Leads a Fast and Growing Herd of Hadoop «
[...] needs. Competitor Teradata Aster has also announced support of HCatalog and has created SQL-H, as I recently analyzed, which can help in query and retrieval into [...]
June 22, 2012 at 7:05 pm
Hortonworks Leads a Fast and Growing Herd of Hadoop «
[...] needs. Competitor Teradata Aster has also announced support of HCatalog and has created SQL-H, as I recently analyzed, which can help in query and retrieval into [...]
July 3, 2012 at 10:07 am
Donal Daly
Thanks a lot for your excellent summary of the benefits of SQL-H. However, I would like to add that, while you mention customer and transaction data in particular, I see even more benefits in analyzing multi-structured data. After all, companies like Amazon, eBay, Facebook and Twitter – to name but a few – use HDFS to store huge amounts of data, mainly texts, pictures and other interactions. Enabling access to HDFS using standard SQL will increase the number of users that can take advantage of such multi-structured data substantially, as was pointed out by one of my colleagues: http://blogs.teradata.com/emea/Democratizing-big-data/ I am quite sure that, with a growing amount of users, new possibilities for using data stored in HDFS will soon open up and I would be looking forward to continuing the discussion on this topic.
October 24, 2012 at 7:35 pm
Teradata’s Big Data and Analytics Strategy Unveiled «
[...] part of Teradata’s big data and analytics strategy is its integration of Aster that my colleague assessed, which the company acquired about a year ago, into the Teradata portfolio. Aster offers some big [...]
October 24, 2012 at 7:37 pm
Teradata’s Big Data and Analytics Strategy Unveiled «
[...] part of Teradata’s big data and analytics strategy is its integration of Aster that my colleague assessed, which the company acquired about a year ago, into the Teradata portfolio. Aster offers some big [...]
October 29, 2012 at 6:12 pm
The Big Data in Teradata «
[...] of Aster Data, which provided Teradata an anchor point for accessing data across the enterprise, as I recently assessed. Aster Data provides a unified approach to accessing Hadoop data from analytics or business [...]
October 29, 2012 at 6:12 pm
The Big Data in Teradata «
[...] Aster Data, which provided Teradata an anchor point for accessing data across the enterprise, as I recently assessed. Aster Data provides a unified approach to accessing Hadoop data from analytics or business [...]
November 1, 2012 at 8:37 am
The Big Data in Teradata | Strategic HRStrategic HR
[...] Aster Data, which provided Teradata an anchor point for accessing data across the enterprise, as I recently assessed. Aster Data provides a unified approach to accessing Hadoop data from analytics or business [...]