Download from Azure Storage
The download process will take some time. Ensure you have adequate storage to receive the dataset and a stable internet connection during the download. You may choose to copy the files to your preferred cloud storage provider instead of downloading them to your local machine.
Once the dataset is ready for download you will receive an email instructing you to view the details of your dataset. Once you log in you will see instructions for downloading your files from the Azure Blob Storage Container. You will need to use a tool to download the files. There are several tools available. Here are a few we recommend.
AzCopy
AzCopy is a command-line utility designed for copying data to and from Azure Blob, File, and Table storage.
RClone
RClone is a command-line program to manage files on cloud storage. It supports a wide range of cloud storage providers, including Azure Blob Storage.
Azure Storage Explorer
Azure Storage Explorer is a free tool from Microsoft that allows you to easily manage your Azure Storage resources.
Downloading the Files
Once you have selected and set up your preferred tool, you will need to set up a SAS URL with the Read and List privileges at a minimum and container details to download your files. Follow the instructions specific to the tool you are using to authenticate with the SAS URL and download the files from the Azure Blob Storage Container.
- AzCopy
- RClone
- Azure Storage Explorer
AzCopy Example
Detailed instructions on AzCopy, its installation, and use are available from the Official Microsoft Documentation Below is a brief summary of steps you'll need to accomplish your download:
-
Visit the AzCopy documentation page and download the compressed
azcopyexecutable for your respective operating system. These executables are available as compressed zip files for Windows/Mac users and Tarball files for Linux users. -
Extract the contents of the compressed executable in a known location on your local machine's drive.
- Windows and Mac users can use built-in zip file applications or use 3rd-party applications like 7zip for this
- For linux users, please see documentation for your specific distro for instructions on decompressing the provided tarball
- Possible locations for this are: in your home directory; in your machine's C:/ or binary directory
- For ease of use, consider adding the fully-qualified path to the AzCopy directory to your system PATH. If you choose not to do this you will need to execute any AzCopy commands from the directory where AzCopy is installed.
-
Open a terminal or powershell instance and execute the following (or similar):
azcopy copy "https://youraccount.blob.core.windows.net/yourcontainer?...[SAS Token]..." /
"C:\local\path" /
--recursive=true -
Where the https URL is the SAS URI you obtained from the portal and "C:\local\path" is the local path on your target machine.
-
Please allow time for the download to complete. Depending on the amount of data you selected, this will take several hours.
RClone Example
-
Rclone is available for Download as a single executable. Download the executable specific to operating system and follow the [installation instructions] to install rclone.
-
Next you'll need to configure your connection to the Fairhub Portal Azure Blob Storage. Open your terminal or command line program of choice and run:
rclone configthen follow the download instructions . For authentication you will be using the Shared Access Signature URL that you just obtained above; be sure to leave the
accountandkeyconfiguration options blank and fill in thesas_url. Remember the name you give to the Azure Blob Storage configuration, as you will need it in subsequent steps. -
Once Azure Blob Storage configuration is complete, you can either use it for download to your local user system or you can configure an additional connection to any other cloud storage provider. For example, to configure a connection to AWS S3 or a cloud storage provider that is compatible with AWS S3 Backend, follow configuration instructions .
-
Once your configuration is complete, you can download your requested materials by issuing the following command:
rclone copy example-azure-config-name:blob-container-name path/to/local/directoryOr, for transfer to other cloud storage providers:
rclone copy example-azure-config-name:blob-container-name example-destination-provider-config-name:path/to/directory -
Please allow time for the download to complete. Depending on the amount of data you selected, this could take several hours.
Azure Storage Explorer
Once you have installed Azure Storage Explorer you can set up the connection to your storage container.
-
Expand the tree menu under the Explore frame and right click on "Storage Accounts"
Click "Connect to Azure Storage ..." to select your storage resource.

-
Select Blob container or directory

-
Choose the Shared access signature URL (SAS) connection option.

-
Paste in the Blob container or directory SAS URL input a SAS URL that has at a minimum the Read and List privileges. This will start with
https. By default it will use the container name for the Display name. Optionally, provide a memorable Display name.
-
Review the summary and click "Connect" if the details are correct.

-
You can now browse the container. The top level will contain a file created when you verified the connection with fairhub.io and a folder name with your request identifier.

-
Inside the request folder you see the dataset folder and a CSV which contains md5 hashes for all files in your request.

-
Within the dataset folder you will find all the data types requested along with high level AI-READI metadata. You can either choose to download all data or select one or more folders to download. Then click the Download button to start the process.
