Tag: hadoop

Running Pig

Pig contains multiple modes that can be specified to configure how Pig scripts and Pig statements will be executed. Execution Modes Pig has two execution modes: local and MapReduce.┬áRunning …

Working with Snakebite in Python

Snakebite is a Python package, created by Spotify, that provides a Python client library, allowing HDFS to be accessed programmatically from Python applications. The client library uses protobuf messages …

HDFS Command Reference

The commands demonstrated in this section are the basic file operations needed to begin using HDFS. Below is a full listing of file manipulation commands possible with hdfs dfs. …

Interacting with HDFS

Interacting with Hadoop Distributed File System (HDFS) is primarily performed from the command line using the script named hdfs. The hdfs script has the following usage: $ hdfs COMMAND …