Los Gatos, California
Remote, United States
Netflix is the world's leading streaming entertainment service with 204 million paid memberships in over 190 countries enjoying TV series, documentaries, and feature films across a wide variety of genres and languages. Members can watch as much as they want, anytime, anywhere, on any internet-connected screen. Members can play, pause and resume watching, all without commercials or commitments.
We are a data-driven company that strives to make every aspect of our workflow data-aware. Real-Time Data Infrastructure is responsible for building scalable data platforms to enable users from all backgrounds to have easy and timely access to these rich datasets in a streaming fashion. We work on core streaming engines (Flink and Mantis), messaging engines (Kafka), schema infrastructure, and platforms to build robust pipelines connecting various sources to sinks.
About This Role:
This role focuses on building the next-generation Data Movement and Processing Platform to support abstraction on top of complex stream processing concepts such as filters, projections, windowing, sessionization, and deduplication, to name a few. We envision supporting Streaming SQL and architect a high-leverage collaborative platform where users can plug in custom patterns ranging from a simple udf to a complex source/sink connector.
If you are passionate about building data platforms as a service, if designing large-scale distributed systems challenges you, if encapsulating stream processing core concepts as a product excites you, - you would be an excellent fit for this position. You would work with exceptionally talented engineers in a high-impact team building a product that would be used for building critical data movement and processing pipelines to solve a variety of problems ranging from data engineering to machine learning to real-time personalization and many more.
Design and develop the next-generation stream processing platform to support building customized data pipelines.
Design and develop abstraction layers to make real-time stream processing available to broader sets of users
Build a large-scale distributed, scalable, self-service platform that can seamlessly scale to handle trillions of events per day.
Innovate and collaborate with partners to help them build business-critical applications and insights on the real-time data infrastructure.
Work on, and contribute to open-source software and have industry impact.
You have years of experience building high-performance distributed systems solving large-scale engineering challenges.
You have expertise in Java, Scala, or other JVM-based languages
You have worked on or have a deep understanding of messaging/queuing systems or stream processing systems.
You have familiarity with architectures of data stores such as Cassandra and CockroachDB.
You are a good communicator, self-driven and work with and through partner teams.
You are passionate about products and wants to increase the impact and usability of stream processing platforms.
Your experience building a self-service platform with appropriate developer tooling and abstractions is a plus.
Share this listing: