Friday, September 12, 2014

Apache Sqoop -Part 2: Sqoop Installation

Prerequisites:

1) You must have at least core Hadoop on your system.

2) Verify the HDP repositories are available.
     yum list sqoop

3) The output should list at least one Sqoop package similar to the following:
     sqoop_2_3_4_0_3485-1.4.6.2.3.4.0-3485.el6.noarch
     sqoop_2_2_0_0_2041-metastore-1.4.5.2.2.0.0-2041.el6.noarch
     sqoop_2_3_4_47_1-metastore-1.4.6.2.3.4.47-1.el6.noarch
     sqoop_2_2_0_0_2041-1.4.5.2.2.0.0-2041.el6.noarch
     sqoop-1.4.5.2.2.0.0-2041.el6.noarch
     sqoop_2_3_4_0_3485-metastore-1.4.6.2.3.4.0-3485.el6.noarch
     sqoop_2_3_4_47_1-1.4.6.2.3.4.47-1.el6.noarch
     sqoop-metastore-1.4.5.2.2.0.0-2041.el6.noarch
 
4) Installing Sqoop
    On RHEL/Cent OS
     yum install sqoop --> Install sqoop Client on all the machines.

5) Verify the installation
     rmp -qa | grep sqoop

6) Place Teradata/HDP drivers in /usr/hdp/2.3.4.0-3485/sqoop/lib
Ex:  -rwxr-xr-x 1 root root  944586 Apr 28 18:08 terajdbc4.jar
        -rwxr-xr-x 1 root root 1691339 Apr 28 18:09 teradata-connector-1.3.2-hadoop210.jar
        -rwxr-xr-x 1 root root    2405 Apr 28 18:09 tdgssconfig.jar
        -rw-r--r-- 1 root root   14146 Apr 28 18:09 opencsv-2.3.jar
        -rw-r--r-- 1 root root  585020 Aug 10 22:18 sqljdbc4.jar

7) Have a fun:)

Error:
15/06/11 15:47:33 ERROR orm.CompilationManager: Sqoop requires a JDK that can compile Java code.
15/06/11 15:47:33 ERROR orm.CompilationManager: Please install a JDK and set $JAVA_HOME to use it.
15/06/11 15:47:33 ERROR tool.ImportTool: Encountered IOException running import job: java.io.IOException: Could not start Java compiler.
Solution : JDK is missing install JDK and try again.

Validation:
[abc@xyz~]$ rpm -qa | grep jdk
java-1.7.0-openjdk-1.7.0.71-2.5.3.1.el6.x86_64
java-1.7.0-openjdk-devel-1.7.0.71-2.5.3.1.el6.x86_64
[abc@xyz ~]$


No comments:

Post a Comment