Running an external jar from Aws Hadoop
Hadoop AWS
- First select the services and click on EMR from Analytics.
- Then click on the add cluster.
- Fill the Details of Cluster.
- Cluster name as Ananthapur-jntu
- Here we are checking the Logging
- Browse the s3 folder with the amar2017/feb
- Launch mode should be Step Extension
- After that select step type as custom jar and click on configure.
- The below image is showing the details.
- After clicking on the configure button we will see the popup like shown below
- Name as Custom JAR
- Jar location should be s3://amar2017/inputJar/wcount.jar
- Fill the Arguments with org.myorg.wordcount, s3://amar2017/deleteme.txt, s3://amar2017/output3
- Select the Action on failure as Terminate cluster
- Then click on add button.
- How to fill the details as shown below.
- Software configuration
- Select vendor as Amazon
- select Release as emr-5.3.1
- Hardware Configuration
- select instance type as m1.medium
- And number of instanses as 3
- Security and access
- Permissions checking as default
- After that click on create cluster button
- Details as shown in the below image
- You will see the Cluster Ananthapur is Starting as shown below.
- The below image is showing that in cluster list Ananthapur is starting.
- After complishing the process we can see like below image.
- To see the result of AWS Hadoop go to the services.
- Select S3 under storage.
- After clicking on S3
- select amar2017
- Anad then select output3 folder
- You will see the list of files as shown below
- Open the part-r-00000 file
- You will see the page as shown below
- Click on the Download button
- And Open the downloaded file you will see the result as shown below.