Manage files on hdfs with ambari files view

Prerequisites
Outline
- Download the Motorists Related Datasets
Produce a Directory in HDFS, Upload personal files and List Contents
Prerequisites
Outline
- Download the Motorists Related Datasets
Produce a Directory in HDFS, Upload personal files and List Contents
Discover Space Utilization inside a HDFS Directory
Download File From HDFS to Local Machine
Explore Two Advanced Features
- Concatenate Files
- Copy Files or Directories recursively
Summary
Further Studying
- Ambari File View – Accessing HDFS using Ambari View

In the last tutorial, we learned to handle files around the Hadoop Distributed File System (HDFS) using the command line. Now we’ll use Ambari Files View to do most of the file management operations on HDFS that people learned with CLI, but with the web-based interface.

Prerequisites

Downloaded and deployed the Hortonworks Data Platform (HDP) Sandbox
Understanding the Ropes from the HDP Sandbox

Outline

Download the Motorists Related Datasets
Produce a Directory in HDFS, Upload personal files and List Contents
Discover Space Utilization inside a HDFS Directory
Download Files From HDFS to Local Machine
Explore Two Advanced Features
Summary
Further Studying

We’ll download geolocation.csv and trucks.csv data onto our local filesystems from the sandbox. The instructions are targeted at mac and linux users.

Then, we’ll download geolocation.csv and trucks.csv data onto our local filesystems from the sandbox. The instructions are targeted at mac and linux users.

1. Open a terminal in your local machine, SSH in to the sandbox:

ssh root@sandbox-hdp.hortonworks.com -p 2222

Note: If you are on VMware or Docker, make sure that you map the sandbox IP towards the correct hostname within the hosts file. Map your Sandbox IP

2. Open another terminal, improve your current directory to Downloads then copy the instructions to download the geolocation.csv and trucks.csv files. We’ll rely on them basically we learn file management operations.

#Improve your current directory to Downloads

cd Downloads

#Download geolocation.csv

wget https://github.com/hortonworks/data-tutorials/raw/master/tutorials/hdp/manage-files-on-hdfs-via-cli-ambari-files-view/assets/motorists-datasets/geolocation.csv

#Download trucks.csv

wget https://github.com/hortonworks/data-tutorials/raw/master/tutorials/hdp/manage-files-on-hdfs-via-cli-ambari-files-view/assets/motorists-datasets/trucks.csv

#Create directory for motorists-datasets

mkdir motorists-datasets

#Slowly move the geolocation and trucks csv files towards the directory

mv geolocation.csv trucks.csv motorists-datasets/

Produce a Directory in HDFS, Upload personal files and List Contents

Create Directory Tree in User

1. Login to Ambari Interface at sandbox-hdp.hortonworks.com:8080. Make use of the following login credentials in Table 1.

Table 1: Ambari Login credentials

Username	Password
admin	**setup process

Setup Ambari Admin Password By hand

2. Now we have admin rights, we are able to manage files on HDFS using Files View. Hover within the Ambari Selector Icon , go into the Files View web-interface.

The Files View Interface can look using the following default folders.

3. We’ll create 3 folders while using Files View web-interface. All three folders: hadoop, geolocations and trucks the final two that will live in the hadoop folder, which resides in user.

Navigate in to the user folder. Click on the new folder button , an add new folder window seems and name the folder hadoop. Press enter or +Add

4. Navigate in to the hadoop folder. Produce the two folders: geolocation and trucks following a process mentioned in the last instruction.

Upload Local Machine Files to HDFS

We’ll upload two files from your local machine: geolocation.csv and trucks.csv to appropriate HDFS directories.

1. Travel through the road /user/hadoop/geolocation or maybe you are already in hadoop, go into the geolocation folder. Click on the upload button to transfer geolocation.csv into HDFS.

An Upload file window seems:

2. Click the cloud by having an arrow. A window with files out of your local machine seems, find geolocation.csv within the Downloads/motorists_datasets folder, select it after which press open button.

3. In Files View, visit the hadoop folder and go into the trucks folder. Repeat the upload file tactic to upload trucks.csv.

View and look at Directory Contents

Every time we open a directory, the Files View instantly lists the contents. Earlier we began within the user directory.

1. Let us navigate to the user directory to look at the facts provided by the contents. Reference the look below when you browse the Directory Contents Overview.