Principal Software Engineer, Data Platform

  • Ancestry
  • Oct 01, 2021

Job Description

About Ancestry:When you join Ancestry, you join a human-centered company where every person’s story is important. We believe that by discovering the struggles and triumphs of our past, we can foster deeper bonds and more meaningful connections among families and communities. Our talented team of scientists, engineers, genealogists, historians, and storytellers is dedicated to empowering customers around the world from all backgrounds on their journeys of personal discovery.
With more than 27+ billion digitized global historical records, 100 million family trees, and 18 million people in our growing AncestryDNA database, Ancestry helps customers discover their family story and gain a new level of understanding about their lives. Passionate about dedicating your work to enriching people’s lives? You belong at Ancestry.
We are looking for an accomplished Principal Data Engineer to join our team in Lehi, UT. This is an opportunity to work with unique, large datasets. You will take a lead role in furthering the big data footprint at Ancestry and work with Business Intelligence, Data Infrastructure, and Data Services teams in developing and maturing our data systems and architecture.

What You Will Do...

Be a technical lead on the data platform team, responsible for scaling the platform to meet Ancestry's data growth.Deploy real-time automated data streams from numerous sources into the data platform.Develop data auditing strategies to ensure data accuracy and integrity.Deploy IAC (Infrastructure as Code) to lay down the infrastructure that the data pipelines use.

Who You Are...

Proficient in Java/Scala, or Python with 5+ years of experience in an enterprise environmentBS degree in Computer Science or related field required, MS preferredExpert in Big Data ecosystems, including KafkaExpertise in building and deploying streaming Spark solutions in AWSProficiency in database technologies MySQL (Aurora), Redshift or equivalentExpert with Terraform, CloudFormation, or other infrastructure as code toolMastery of one of the following data formats Parquet, AVRO, ORCExperience with Test Driven Code Development, SCM tools such as GIT, SVN, Jenkins build and deployment automationExperience implementing open source technologiesRESTful web service developmentExperience with HBase or comparable NoSQLStrong grasp of algorithms and data structuresGood familiarity with in Linux/Unix, scripting and administrationExperience with AWS Cloud automated deployments
Additional Information:Ancestry is an Equal Opportunity Employer that makes employment decisions without regard to race, color, religious creed, national origin, ancestry, sex, pregnancy, sexual orientation, gender, gender identity, gender expression, age, mental or physical disability, medical condition, military or veteran status, citizenship, marital status, genetic information, or any other characteristic protected by applicable law. In addition, Ancestry will provide reasonable accommodations for qualified individuals with disabilities.
All job offers are contingent on a background check screen that complies with applicable law.  For San Francisco office candidates, pursuant to the San Francisco Fair Chance Ordinance, Ancestry will consider for employment qualified applicants with arrest and conviction records.  
Ancestry is not accepting unsolicited assistance from search firms for this employment opportunity. All resumes submitted by search firms to any employee at Ancestry via-email, the Internet or in any form and/or method without a valid written search agreement in place for this position will be deemed the sole property of Ancestry. No fee will be paid in the event the candidate is hired by Ancestry as a result of the referral or through other means.