Data Engineer, Translational Genomics
South San Francisco, CA 
Share
Posted 14 days ago
Job Description
The PositionDescription

If you are a Big Data engineer and want to work on something that truly can change the world, this job is for you. Biology is approaching an inflection where we can directly leverage data to understand the cellular basis of human diseases and from this generate therapeutics that can treat these diseases.Our Translational Genomics initiative is spearheading this effort and bringing together data from human genetics, functional genomics, molecular biology, disease model engineering, and tissue and cellular profiling. We need a Data Engineering Lead to help us create a next-generation data engine that scalably and rigorously ingests and transforms data generated from this initiative so they are ready for machine-driven analysis. The Data Engineering Lead will act as an architect and engineering manager tasked to oversee the construction and operation of this data engine.This data engine will be used to help assemble an exabyte scale connected and computable data universe composed of high-value internally and externally generated data and results that we can build our data science efforts on top of. Your efforts will therefore directly enable computational discovery of disease targets and from these potentially life-saving therapies.

A person hired in this position will

  • Work on a team that will architect and deliver a next-generation data engine that enables scalable, flexible, and rigorous data transformations using modern data management practices.

  • Help build and deliver data infrastructure that will enable machines to crawl and compute on and across all our data.

  • Work with a cross-functional team of scientists and engineers to design and deliver these solutions.

  • Collaborate across the informatics organization via presentations and collaborations.

Successful candidates will meet many of the following requirements

Must-have requirements

You have a BS in a computational discipline with 8 years of work experience or a Masters with 5 years of experience.

7+ years experience architecting and developing scalable pipelines, frameworks and platforms to power data science efforts in distributed cloud environments, 5 of which are on AWS.

Multiple years of experience working on teams to software to deliver solutions.

Exceptional communication skills.

Nice to haves:

  • Practical understanding of the data management practices required to power rigorous data science and enable advanced analytics like AI & ML.

  • Hands-on experience working with the following technologies, frameworks, and languages: Java, Scala, Python, Spark, Airflow, RabbitMQ, Spring (nice to have).

  • Experience working on projects focused on omics data (nice to have)

What to expect from us
  • A highly collaborative and dynamic research environment where we aim to advance the rate of scientific discovery using purposefully built solutions.

  • Access to large multimodal omic datasets focused on disease biology, samples and compute resources.

  • Access to state-of-the-art technologies and pioneering research.

  • Participation in seminar series featuring academic and industry scientists.

  • Campus-like lifestyle with a healthy work-life balance.

  • Mentored opportunities to further develop professional skills.

Who we are

A member of the Roche Group, Genentech has been at the forefront of the biotechnology industry for more than 40 years, using human genetic information to develop novel medicines for serious and life-threatening diseases. We are a research-driven biotechnology company, whose medical innovations for cancer and other serious illnesses make a difference for patients across the globe. Please take this opportunity to learn about Genentech where we believe that our employees are our most important asset & are dedicated to remaining a great place to work.

Genentech is an equal opportunity employer & prohibits unlawful discrimination based on race, color, religion, gender, sexual orientation, gender identity/expression, national origin/ancestry, age, disability, marital & veteran status. For more information about equal employment opportunities, visit our Genentech Careers page. The expected salary range for this position based on the primary location of California is $130,100 - 241,500. Actual pay will be determined based on experience, qualifications, geographic location, and other job-related factors permitted by law. A discretionary annual bonus may be available based on individual and Company performance. This position also qualifies for the benefits detailed at the link provided below.

#gCS

#tech4lifeDataAnalytics

Genentech is an equal opportunity employer, and we embrace the increasingly diverse world around us. Genentech prohibits unlawful discrimination based on race, color, religion, gender, sexual orientation, gender identity or expression, national origin or ancestry, age, disability, marital status and veteran status.


Roche is an Equal Opportunity Employer & prohibits unlawful discrimination based on race, color, religion, gender, sexual orientation, gender identity/expression, national origin/ancestry, age, disability, marital & veteran status.

 

Job Summary
Company
Start Date
As soon as possible
Employment Term and Type
Regular, Full Time
Required Education
Bachelor's Degree
Required Experience
8+ years
Email this Job to Yourself or a Friend
Indicates required fields