SnpEff: Genomic variant annotations and functional effect prediction toolbox

SnpEff Genetic variant annotation and functional effect prediction toolbox. It annotates and predicts the effects of genetic variants on genes and proteins (such as amino acid changes).

For more information on the data, see the User Manual.

Note

Microsoft provides Azure Open Datasets on an “as is” basis. Microsoft makes no warranties, express or implied, guarantees or conditions with respect to your use of the datasets. To the extent permitted under your local law, Microsoft disclaims all liability for any damages or losses, including direct, consequential, special, indirect, incidental or punitive, resulting from your use of the datasets.

This dataset is provided under the original terms that Microsoft received source data. The dataset may include data sourced from Microsoft.

Data source

This dataset is a mirror of http://downloads.sourceforge.net/project/snpeff/databases/

Data volumes and update frequency

This dataset contains approximately 2 TB of data and is updated monthly.

Storage location

This dataset is stored in the West US 2 and West Central US Azure regions. Allocating compute resources in West US 2 or West Central US is recommended for affinity.

Data Access

West US 2: 'https://datasetsnpeff.blob.core.windows.net/dataset'

West Central US: 'https://datasetsnpeff-secondary.blob.core.windows.net/dataset'

SAS Token: sv=2019-10-10&st=2020-09-01T00%3A00%3A00Z&se=2050-09-01T00%3A00%3A00Z&si=prod&sr=c&sig=isafOa9tGnYBAvsXFUMDGMTbsG2z%2FShaihzp7JE5dHw%3D

Use Terms

Data is available without restrictions. More information and citation details, see Accessing and using data in ClinVar.

Contact

For any questions or feedback about this dataset, contact Pablo Cingolani.

Next steps

View the rest of the datasets in the Open Datasets catalog.