博客
关于我
sqoop安装
阅读量:174 次
发布时间:2019-02-28

本文共 4548 字,大约阅读时间需要 15 分钟。

一 概述
sqoop是apache下一款“Hadoop和关系数据库服务器之间传送数据”的工具。
导入数据:Mysql、Oracle导入数据到hadoop的HDFS、Hive、HBase等数据存储系统。
导出数据:从Hadoop的文件系统中导出数据到关系数据库mysql等。
Sqoop的本质还是一个命令行工具,和HDFS、Hive相比,并没有什么高深的理论。
把数据从非hadoop导到hadoop系统叫导入。
把数据从hadoop导到非hadoop系统叫导出。
二 工作机制
将导入或导出命令翻译成mapreduce程序来实现。
在翻译出的mapreduce中主要是对inputformat和outputformat进行定制。
三 Sqoop安装
1 下载Sqoop
[root@centos opt]# wget http://archive.apache.org/dist/sqoop/1.4.6/sqoop-1.4.6.bin__hadoop-2.0.4-alpha.tar.gz
2 安装Sqoop
[root@centos opt]# tar -zxvf sqoop-1.4.6.bin__hadoop-2.0.4-alpha.tar.gz[root@centos opt]# mv sqoop-1.4.6.bin__hadoop-2.0.4-alpha sqoop-1.4.6
3 修改sqoop-env.sh
[root@centos conf]# vi sqoop-env.sh[root@centos conf]# cat sqoop-env.sh# Licensed to the Apache Software Foundation (ASF) under one or more# contributor license agreements.  See the NOTICE file distributed with# this work for additional information regarding copyright ownership.# The ASF licenses this file to You under the Apache License, Version 2.0# (the "License"); you may not use this file except in compliance with# the License.  You may obtain a copy of the License at##     http://www.apache.org/licenses/LICENSE-2.0## Unless required by applicable law or agreed to in writing, software# distributed under the License is distributed on an "AS IS" BASIS,# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.# See the License for the specific language governing permissions and# limitations under the License.# included in all the hadoop scripts with source command# should not be executable directly# also should not be passed any arguments, since we need original $*# Set Hadoop-specific environment variables here.#Set path to where bin/hadoop is availableexport HADOOP_COMMON_HOME=/opt/hadoop-2.7.4#Set path to where hadoop-*-core.jar is availablexport HADOOP_MAPRED_HOME=/opt/hadoop-2.7.4#set the path to where bin/hbase is availableexport HBASE_HOME=/opt/hbase-1.2.6#Set the path to where bin/hive is available#export HIVE_HOME=#Set the path for where zookeper config dir isexport ZOOCFGDIR=/opt/zookeeper-3.4.6/conf
4 配置环境变量etc/profile
export SQOOP_HOME=/opt/sqoop-1.4.6export PATH=$PATH:$SQOOP_HOME/bin
四 上传mysql的驱动
[root@centos lib]# ll mysql-connector-java-5.1.30-bin.jar
-rw-r--r-- 1 root root 954041 Mar 11  2014 mysql-connector-java-5.1.30-bin.jar
五 简单测试
[root@centos lib]# sqoop help/opt/sqoop-1.4.6/conf/sqoop-env.sh: line 26: xport: command not foundWarning: /opt/sqoop-1.4.6/../hcatalog does not exist! HCatalog jobs will fail.Please set $HCAT_HOME to the root of your HCatalog installation.Warning: /opt/sqoop-1.4.6/../accumulo does not exist! Accumulo imports will fail.Please set $ACCUMULO_HOME to the root of your Accumulo installation.Warning: /opt/sqoop-1.4.6/../zookeeper does not exist! Accumulo imports will fail.Please set $ZOOKEEPER_HOME to the root of your Zookeeper installation.18/01/01 11:52:08 INFO sqoop.Sqoop: Running Sqoop version: 1.4.6usage: sqoop COMMAND [ARGS]Available commands:  codegen            Generate code to interact with database records  create-hive-table  Import a table definition into Hive  eval               Evaluate a SQL statement and display the results  export             Export an HDFS directory to a database table  help               List available commands  import             Import a table from a database to HDFS  import-all-tables  Import tables from a database to HDFS  import-mainframe   Import datasets from a mainframe server to HDFS  job                Work with saved jobs  list-databases     List available databases on a server  list-tables        List available tables in a database  merge              Merge results of incremental imports  metastore          Run a standalone Sqoop metastore  version            Display version informationSee 'sqoop help COMMAND' for information on a specific command.[root@centos lib]# sqoop version/opt/sqoop-1.4.6/conf/sqoop-env.sh: line 26: xport: command not foundWarning: /opt/sqoop-1.4.6/../hcatalog does not exist! HCatalog jobs will fail.Please set $HCAT_HOME to the root of your HCatalog installation.Warning: /opt/sqoop-1.4.6/../accumulo does not exist! Accumulo imports will fail.Please set $ACCUMULO_HOME to the root of your Accumulo installation.Warning: /opt/sqoop-1.4.6/../zookeeper does not exist! Accumulo imports will fail.Please set $ZOOKEEPER_HOME to the root of your Zookeeper installation.18/01/01 11:52:54 INFO sqoop.Sqoop: Running Sqoop version: 1.4.6Sqoop 1.4.6git commit id c0c5a81723759fa575844a0a1eae8f510fa32c25Compiled by root on Mon Apr 27 14:38:36 CST 2015
六 参考
你可能感兴趣的文章
mysql 多字段删除重复数据,保留最小id数据
查看>>
MySQL 多表联合查询:UNION 和 JOIN 分析
查看>>
MySQL 大数据量快速插入方法和语句优化
查看>>
mysql 如何给SQL添加索引
查看>>
mysql 字段区分大小写
查看>>
mysql 字段合并问题(group_concat)
查看>>
mysql 字段类型类型
查看>>
MySQL 字符串截取函数,字段截取,字符串截取
查看>>
MySQL 存储引擎
查看>>
mysql 存储过程 注入_mysql 视图 事务 存储过程 SQL注入
查看>>
MySQL 存储过程参数:in、out、inout
查看>>
mysql 存储过程每隔一段时间执行一次
查看>>
mysql 存在update不存在insert
查看>>
Mysql 学习总结(86)—— Mysql 的 JSON 数据类型正确使用姿势
查看>>
Mysql 学习总结(87)—— Mysql 执行计划(Explain)再总结
查看>>
Mysql 学习总结(88)—— Mysql 官方为什么不推荐用雪花 id 和 uuid 做 MySQL 主键
查看>>
Mysql 学习总结(89)—— Mysql 库表容量统计
查看>>
mysql 实现主从复制/主从同步
查看>>
mysql 审核_审核MySQL数据库上的登录
查看>>
mysql 导入 sql 文件时 ERROR 1046 (3D000) no database selected 错误的解决
查看>>