NGP IRIS, or just Iris, is a tool for interacting with a Hitachi Content Platform (HCP) using S3 in the boto3
package. NGP Iris is designed with two use cases in mind:
- A simple, clear, real-time interaction with NGPr file management
- Improving process flow for performing off-site data analysis by using automated transfer scripts
Both of these cases can be achieved as either a Python package or as a Command Line Interface (CLI).
- Python 3.11.51
- NGPr credentials (see "NGPr credentials")
Iris can be installed via PyPi by running the following:
pip install NGPIris
If you wish, you can also install Iris with the following steps:
- Clone this repository
- Open a terminal in your local copy of the repository
- Run
pip install .
. This will install Iris along with the required Python packages in your Python environment
In order to use Iris, a JSON file containing your credentials for the NGPr. The template of the JSON file can be found in credentials/credentials_template.json. Depending on your needs, you can either enter only the credentials for the HCP, only for the HCI or both. Do note that you can't leave some parts of either the HCP or HCI credentials empty:
{
"hcp" : {
"endpoint" : "some_endpoint",
"aws_access_key_id" : "",
"aws_secret_access_key" : ""
},
"hci" : {
"username" : "some_user",
"password" : "some_password",
"address" : "some_address",
"auth_port" : "some_auth_port",
"api_port" : "some_api_port"
}
}
This will prompt Iris to complain about incomplete credentials (since the entries aws_access_key_id
and aws_secret_access_key
are empty). Of course, the same error would occur if the reverse between the HCP and HCI fields would be true.
NOTE: the endpoint
field should not contain https://
or any port number.
A thorough package documentation can be found in the technical documentation page.
Iris can be used as a Python package or by using the command line. The following sections cover some examples of how Iris might be used as a package and how to use its various commands. However, we highly recommend checking out the tutorial containing more example use cases.
In order to connect to the HCP, we first need to create an HCPHandler
object and mount it to some bucket:
from NGPIris.hcp import HCPHandler
hcp_h = HCPHandler("credentials.json")
hcp_h.mount_bucket("myBucket")
If you are unsure which buckets you are allowed to see, you can use hcp_h.list_buckets()
in order to list all available buckets to you.
When you have successfully mounted a bucket, you can then do different operations onto the bucket. Object names on the bucket can be listed by typing print(hcp_h.list_objects(True))
.
# Upload a single file to HCP
hcp_h.upload_file("myFile")
# Upload folder contents to HCP
hcp_h.upload_folder("myFiles/")
# Download a single object from HCP
hcp_h.download_file("myFile")
In order to connect to the HCI, we first need to create an HCIHandler
object and request an authorization token:
from NGPIris.hci import HCIHandler
hci_h = HCIHandler("credentials.json")
hci_h.request_token()
Note that the token is stored inside of the HCIHandler
object called hci_h
. We can now request a list of indexes that are available by typing print(hci_h.list_index_names())
. We can also look up information about a certain index with print(hci_h.look_up_index("myIndex"))
. It is recommended to combine the use of the pretty print module pprint
and the json
module for this output, as it is mostly unreadable otherwise:
from NGPIris.hci import HCIHandler
from pprint import pprint
import json
hci_h = HCIHandler("credentials.json")
hci_h.request_token()
pprint(
json.dumps(
hci_h.look_up_index("myIndex"),
indent = 4
)
)
The utils
module can be contains two functions: one for converting a string to base64
encoding and one for MD5
encoding.
NGP Iris comes with two commands: iris
and iris_generate_credentials_file
. The latter command is used solely to generate the .json
credentials file. Running iris_generate_credentials_file --help
we get the following:
Usage: iris_generate_credentials_file [OPTIONS]
Generate blank credentials file for the HCI and HCP.
WARNING: This file will store sensisitve information (such as passwords) in
plaintext.
Options:
--path TEXT Path for where to put the new credentials file
--name TEXT Custom name for the credentials file
--help Show this message and exit.
The iris
command is used for communicating with the HCP and HCI. This includes upload and download to and from the HCP/HCI. Running iris --help
yields the following:
Usage: iris [OPTIONS] CREDENTIALS COMMAND [ARGS]...
NGP Intelligence and Repository Interface Software, IRIS.
CREDENTIALS refers to the path to the JSON credentials file.
Options:
--version Show the version and exit.
--help Show this message and exit.
Commands:
delete-folder Delete a folder from an HCP bucket/namespace.
delete-object Delete an object from an HCP bucket/namespace.
download Download files from an HCP bucket/namespace.
list-buckets List the available buckets/namespaces on the HCP.
list-objects List the objects in a certain bucket/namespace on the...
simple-search Make simple search using substrings in a...
test-connection Test the connection to a bucket/namespace.
upload Upload files to an HCP bucket/namespace.
Usage: iris CREDENTIALS delete-folder [OPTIONS] FOLDER BUCKET
Delete a folder from an HCP bucket/namespace.
FOLDER is the name of the folder to be deleted.
BUCKET is the name of the bucket where the folder to be deleted exist.
Options:
--help Show this message and exit.
Usage: iris CREDENTIALS delete-object [OPTIONS] OBJECT BUCKET
Delete an object from an HCP bucket/namespace.
OBJECT is the name of the object to be deleted.
BUCKET is the name of the bucket where the object to be deleted exist.
Options:
--help Show this message and exit.
Usage: iris CREDENTIALS download [OPTIONS] OBJECT BUCKET LOCAL_PATH
Download files from an HCP bucket/namespace.
OBJECT is the name of the object to be downloaded.
BUCKET is the name of the upload destination bucket.
LOCAL_PATH is the path to where the downloaded objects are to be stored
locally.
Options:
--help Show this message and exit.
Usage: iris CREDENTIALS list-buckets [OPTIONS]
List the available buckets/namespaces on the HCP.
Options:
--help Show this message and exit.
Usage: iris CREDENTIALS list-objects [OPTIONS] BUCKET
List the objects in a certain bucket/namespace on the HCP.
BUCKET is the name of the bucket in which to list its objects.
Options:
-no, --name-only BOOLEAN Output only the name of the objects instead of all
the associated metadata
--help Show this message and exit.
Usage: iris CREDENTIALS simple-search [OPTIONS] BUCKET SEARCH_STRING
Make simple search using substrings in a bucket/namespace on the HCP.
BUCKET is the name of the bucket in which to make the search.
SEARCH_STRING is any string that is to be used for the search.
Options:
-cs, --case_sensitive BOOLEAN Use case sensitivity?
--help Show this message and exit.
Usage: iris CREDENTIALS test-connection [OPTIONS] BUCKET
Test the connection to a bucket/namespace.
BUCKET is the name of the bucket for which a connection test should be made.
Options:
--help Show this message and exit.
Usage: iris CREDENTIALS upload [OPTIONS] FILE_OR_FOLDER BUCKET
Upload files to an HCP bucket/namespace.
FILE-OR-FOLDER is the path to the file or folder of files to be uploaded.
BUCKET is the name of the upload destination bucket.
Options:
--help Show this message and exit.
Assuming that the repository has been cloned, run the following tests:
pytype
pytest
With force flag -f
:
cd docs/
sphinx-apidoc ../NGPIris/ -o . -F -f
Without force flag
cd docs/
sphinx-apidoc ../NGPIris/ -o . -F
cd docs/
make html
Footnotes
-
Most versions of Python 3 should work ↩