The Genome Center in Geneva is sequencing a large number of COVID19 viral genomes. These require a large amount of computation to process, analyze, and understand the data.
What it does
The system preprocesses raw data into separate samples, and then runs reference based alignment in a highly parallel manner using containers to scale. Custom applications will be built to do sample separation, data conversion, and alignment. Data will be stored in object form using a Ceph cluster.
We currently have a team in place. If you want to join, you will need solid/advanced C++ skills and preferably some familiarity with bioinformatics and sequencing.
How I built it
Challenges I ran into
Accomplishments that I'm proud of
What I learned
What's next for Scalable Viral Alignments