Siri Data Engineer

Siri Data Engineer

Santa Clara Valley, CA 2016-10-29 - –

Apple Apple

Siri is seeking skilled Data Engineer to join the data science team. This person would build the vision for data infrastructure and BI tools, work with engineers and data scientists to optimize metrics, and establish best practices for data driven product devcopement.

Why is this important to Siri?

The quality of our product is dependent upon data availability and reliability. What you build helps to improve the experience on billions of devices used by millions of people.

Key Qualifications

Responsibilities

Work closely with data scientists, analyst and engineers to design and maintain scalable data models and pipelines

Develop and maintain cross-platform ETL processes

Lead development of architecture and standards for a business metric warehouse

Work with data scientists to develop a scalable data visualization platform

Help data scientists optimize productionized Spark jobs along with Pig and Hive queries.

Implement systems for tracking data quality and consistency

Architect, build and launch new data models that provide intuitive analytics to your customers

Design, build and launch extremely efficient & reliable data pipelines to move data (both large and small amounts) to our ridiculously large Data Warehouse

Design and develop new systems and tools to enable folks to consume and understand data faster

Minimum qualifications:
2+ years of experience developing and maintaining large scale ETL data infrastructures using open source technologies.

2+ years of Spark and Hadoop applications development using MapReduce framework experience is necessary.

2+ years of SQL (MySQL, Oracle, pgSQL, Hive, etc) experience is required. Apache Pig experience is plus.

2+ years of Java and/or Scala development experience is necessary.

Strong experience building software applications using TDD (test driven development).

Preferred qualifications:
BS/MS in Computer Science or a related field (ideal) or 5+ years experience in developing big data applications.

Love to use and develop open source technologies like Spark Hadoop, Hive, Presto, Flume and Kafka.

Strong scripting ability in Scala / Python / Bash

Experience with Java / Scala is preferred

Working with data at the petabyte scale

Bonus Points

Share your Github and show us your magic.

Open source contribution.

Description

The quality of our product is dependent upon data availability and reliability. What you build helps to improve the experience on billions of devices used by millions of people.

The Siri team is looking for an exceptional data engineer to work on internal tools and services that help us to drive the execution for the entire Siri team. The tools you will work on touch every critical team at Apple.

Please include your Github/Bitbucket repository while applying.

Education

Master's or Ph.D. degree in Computer Science, Mathematics, Statistics or equivalent.

To apply for this job please visit tinyurl.com.