DATA SCIENTIST
December 23, 2017 - January 24, 2018
DATA SCIENTIST
Computer Science and Software Engineering
GRAIL is a life sciences company whose mission is to detect cancer early when it can be cured. GRAIL is using the power of high-intensity sequencing, population-scale clinical trials, and state of the art Computer Science and Data Science to enhance the scientific understanding of cancer biology and develop blood tests for early-stage cancer detection. Â We are seeking passionate and talented individuals to join us in realizing our mission, which has the potential to dramatically reduce the global burden of cancer.
POSITION SUMMARY
Our data science team is responsible for cleaning, preparing, and analyzing ever increasing data sets to identify patterns to enable to early detection of cancer. We deeply understand our data and use those insights to build better methods, pipelines, and assays. As a data scientist, you will build models based on some of the largest, richest biological datasets in the world. Your rigorous analysis will guide our assay and bioinformatic pipeline development. Working closely with scientists, clinicians, and engineers, you will develop new ways to pull signals out of ultra-deep sequencing data and identify cancer at its earliest stages.
TASKS AND RESPONSIBILITIES
- Work with large, complex data sets. Solve difficult, non-routine analysis problems, applying advanced analytical methods as needed. Conduct end-to-end analysis that include design, data gathering, processing, analysis, iteration with stakeholders, and dissemination of results.
- Build and prototype analysis pipelines iteratively to provide insights at scale. Develop comprehensive understanding of relevant biology, assays, data structures, and available features.
- Interact cross-functionally with a wide variety of people and teams including research, software, clinical, research, and product development.
MINIMUM QUALIFICATIONS
- 2+ years of relevant work experience in data analysis or related field. (e.g., as a statistician / data scientist / computational biologist / bioinformatician).
PREFERRED BACKGROUND
- PhD degree in a quantitative discipline (e.g., statistics, computational biology, computer science, mathematics, physics, electrical engineering).
- 4+ years of relevant work experience in data analysis or related field. (e.g., as a statistician / data scientist / computational biologist) including deep expertise in stochastic modeling, high-dimensional classification, and/or unsupervised learning methods.
- Experience with next generation sequencing data analysis (DNA, RNA, or epigenetic analysis).
- Deep experience with a statistical programming language (e.g., R).
- Demonstrated expertise in one programming language (Python, Go, C++, etc.), proficiency in Linux environment, experience with database languages (e.g., SQL), experience with version control practices and tools (Git, Perforce, etc.).
- Demonstrated experience with and track record of implementing reproducible research practices.
- Applied experience with machine learning on large datasets.
- Demonstrated effective written and verbal communication skills.
- Demonstrated leadership and self-direction. Demonstrated willingness to both teach others and learn new techniques.