Create an Account
username: password:
 
  MemeStreams Logo

[#HADOOP-5815] Sqoop: A database import tool for Hadoop - ASF JIRA

search

Lost
Picture of Lost
My Blog
My Profile
My Audience
My Sources
Send Me a Message

sponsored links

Lost's topics
Arts
Business
Games
Health and Wellness
Home and Garden
Miscellaneous
Current Events
Recreation
Local Information
Science
Society
Sports
Technology

support us

Get MemeStreams Stuff!


 
[#HADOOP-5815] Sqoop: A database import tool for Hadoop - ASF JIRA
Topic: Technology 9:58 pm EDT, May 19, 2009

Overview:

Sqoop is a tool designed to help users import existing relational databases into their Hadoop clusters. Sqoop uses JDBC to connect to a database, examine the schema for tables, and auto-generate the necessary classes to import data into HDFS. It then instantiates a MapReduce job to read the table from the database via the DBInputFormat (JDBC-based InputFormat). The table is read into a set of files loaded into HDFS. Both SequenceFile and text-based targets are supported.

Longer term, Sqoop will support automatic connectivity to Hive, with the ability to load data files directly into the Hive warehouse directory, and also to inject the appropriate table definition into the metastore.

[#HADOOP-5815] Sqoop: A database import tool for Hadoop - ASF JIRA



 
 
Powered By Industrial Memetics
RSS2.0