Skip to content

This ansible role host a set of modules aimed to manipulate file and directory on HDFS (The Hadoop Distributed File System).

License

Notifications You must be signed in to change notification settings

BROADSoftware/hdfs_modules

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

hdfs_modules

This ansible role host a set of modules aimed to manipulate file and directory on HDFS (The Hadoop Distributed File System).

  • hdfs_file: Equivalent of the ansible files/file module, but on HDFS. Doc at this location

  • hdfs_info: Equivalent of the ansible files/info module, but on HDFS. Doc at this location

  • hdfs_cmd: Equivalent of the ansible commands/command module, but on HDFS. Doc at this location

Requirements

These modules need the python-requests package to be present on the remote node.

To be able to access kerberos protected cluster, python-requests-kerberos is also required

Example Playbook

- hosts: edge_node1
  roles:
  - hdfs_modules
  tasks:
  # Create a directory if it does not exist.
  # If already existing, adjust owner, group and mode if different.
  - hdfs_file: hdfs_path=/user/joe/some_directory owner=joe group=users mode=0755 state=directory

  # How to copy a file from the file system of the targeted host to HDFS
  - hdfs_cmd: cmd="sudo -u joe hdfs dfs -put /etc/services /user/joe/services" hdfs_creates=/user/joe/passwd1 uses_shell=True

  # And ajust owner, group and mode on the file
  - hdfs_file: hdfs_path=/user/joe/passwd1 owner=joe group=users mode=0644

  # But, better to use hdfs_put for this copy
  - hdfs_put: src=/etc/services hdfs_dest=/usr/joe/services  owner=joe group=users mode=0644

  # Can also copy a folder recursively
  - hdfs_put: src=/etc/hadoop/conf/ hdfs_dest=/usr/joe/some_directory hdfs_user=joe

License

GNU GPL

Click on the Link to see the full text.

About

This ansible role host a set of modules aimed to manipulate file and directory on HDFS (The Hadoop Distributed File System).

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages