1000 Genomes - 1000 Genomes Project

http://www.1000genomes.org/

Program Description: Launched in 2003, the 1000 Genomes Project is the first project to sequence the genomes of a large number of people, to provide a comprehensive resource on human genetic variation. The goal of the 1000 Genomes Project is to find most genetic variants that have frequencies of at least 1% in the populations studied. 1000 Genomes uses the Amazon Web Service (AWS) cloud to share genetic data with researchers everywhere.

Year Started: 2008.

Organization Description: It is a collaboration of numerous public and private organizations the U.S, Canada, EU, UK, China, and the Caribbean

Data Description: Index data is available for download from the NIH/NCBI and ENA; the actual sequence data is hosted by AWS and consists of over 200TB of data. Users are expected to analyze the data using AWS rather than thru downloading.


Project Type: Data Repository and Analysis

Project Domains: Biological Sciences


Budget: ~40 Million US


Program Data

Location Lat/Lon Coordinates Location Type Data Type Data Generation Single Data Instance Size (TB) Estimated Daily Data Size (GB) Estimated Annual Data Size (PB) Average Sustained Throughput (Gbps) Maximum Sustained Throughput (Gbps) Online Repository Size (PB) Total Repository Size (PB) Delay Tolerance (minutes) Jitter Sensitive? Uses the Cloud?
Amazon Tokyo Zone, Japan 35.69,139.69 Data Repository and Analysis Data On demand 200.00 200.00 200.00 - Yes
Amazon Singapore Zone, Singapore 1.35,103.82 Data Repository and Analysis Data On demand 200.00 200.00 200.00 - Yes
Amazon Sydney Zone, Australia -33.87,151.21 Data Repository and Analysis Data On demand 200.00 200.00 200.00 - Yes
Amazon Ireland Zone, Ireland 53.33,-6.25 Data Repository and Analysis Data On demand 200.00 200.00 200.00 - Yes
Amazon Sao Paulo Zone, Brazil -23.55,-46.63 Data Repository and Analysis Data On demand 200.00 200.00 200.00 - Yes
Amazon Ashburn, VA Zone, United States 39.04,-77.49 Data Repository and Analysis Data On demand 200.00 200.00 200.00 - Yes
Amazon California Zone, United States 36.78,-119.42 Data Repository and Analysis Data On demand 200.00 200.00 200.00 - Yes
Amazon Oregon Zone, United States 43.8,-120.55 Data Repository and Analysis Data On demand 200.00 200.00 200.00 - Yes