Checks prior to download
Is the dataset available for download?
Most datasets are available using our Download client, except for specific legacy datasets listed here. For these ones, please email EGA Helpdesk to request an Aspera download account.
If the dataset you have been granted access to contains encrypted data (filename ending in .gpg), please reach out to Helpdesk for detailed information on how to download it.
PyEGA3: general information
The PyEGA3 download client is available at its GitHub repository. In its README you will find a step by step guide (See the video below) on how to use it to download files you have been granted access to.
**ADD VIDEO TUTORIAL video tutorial **
PyEGA3 implements the GA4GH-compliant HTSGET protocol supporting requests over genomic ranges. This enables the download of specific regions of interest rather than the entire file. Although this is not possible for all the files held by EGA: if the dataset does not contain index files (e.g. filename ending in *.crai), then genomic range requests cannot be performed.
How do I download the datasets to which I have been granted access?
After setting up your EGA download account, you can proceed to download using the EGA download client – PyEGA3. The pyEGA3 download client is a python-based tool for viewing and downloading files from authorized EGA datasets. This download client is continuously being developed for more user-friendly download experiences.
Can I download the dataset’s metadata via the download client?
No, the download client cannot be used to download metadata. Registered EGA users can download metadata of an authorised dataset by logging into the EGA webpage https://ega-archive.org/ and navigating to the dataset of choice. Approximately two thirds down the page you will find the option to download the metadata as a zip file.