hdfs_modules

hdfs_modules

This ansible role host a set of modules aimed to manipulate file and directory on HDFS (The Hadoop Distributed File System).

  • hdfs_file: Equivalent of the ansible files/file module, but on HDFS. Doc at this location

  • hdfs_info: Equivalent of the ansible files/info module, but on HDFS. Doc at this location

  • hdfs_cmd: Equivalent of the ansible commands/command module, but on HDFS. Doc at this location

Requirements

These modules need the python-requests package to be present on the remote node.

To be able to access kerberos protected cluster, python-requests-kerberos is also required

Example Playbook

- hosts: edge_node1
  roles:
  - hdfs_modules
  tasks:
  # Create a directory if it does not exist.
  # If already existing, adjust owner, group and mode if different.
  - hdfs_file: hdfs_path=/user/joe/some_directory owner=joe group=users mode=0755 state=directory

  # How to copy a file from the file system of the targeted host to HDFS
  - hdfs_cmd: cmd="sudo -u joe hdfs dfs -put /etc/services /user/joe/services" hdfs_creates=/user/joe/passwd1 uses_shell=True

  # And ajust owner, group and mode on the file
  - hdfs_file: hdfs_path=/user/joe/passwd1 owner=joe group=users mode=0644

  # But, better to use hdfs_put for this copy
  - hdfs_put: src=/etc/services hdfs_dest=/usr/joe/services  owner=joe group=users mode=0644

  # Can also copy a folder recursively
  - hdfs_put: src=/etc/hadoop/conf/ hdfs_dest=/usr/joe/some_directory hdfs_user=joe

License

GNU GPL

Click on the Link to see the full text.

About

HDFS aware Files-like modules (info, file and command)

Install
ansible-galaxy install BROADSoftware/hdfs_modules
GitHub repository
License
gpl-3.0
Downloads
810