Do what you love. Love what you do.
At Workday, we help the world’s largest organizations adapt to what’s next by bringing finance, HR, and planning into a single enterprise cloud. We work hard, and we’re serious about what we do. But we like to have fun, too. We put people first, celebrate diversity, drive innovation, and do good in the communities where we live and work.
Role & Responsibilities
You will provide architectural leadership for Workday’s Observability Platform and Services. You will be the domain expert across all the pillars of Observability.
You will help ensure Workday MTTx metrics are best in class by leveraging the Observability platform and services.
You will ensure a seamless Observability experience across data domains (logs, metrics, traces, alerts) and different infrastructure types (Virtualized, Kubernetes, Bare metal, etc.)
You will drive standardization of the Observability tooling, develop and publish standards, leverage open source specifications(OpenTelemetry etc)
You will closely collaborate with the Infrastructure and Technology Operations teams.
You will evangelize Observability to all Workday engineering teams. Establish best practices, frameworks, and help onboard engineering teams.
About the Team
You have a BS/MS in Computer Science or a related technical field
You have a strong software engineering background and a proven track record of delivering high quality products at massive scale
You are able to write high quality code (Java/Scala/Go/Python) and be able to design systems that meet the business needs.
You have hands-on expertise in Observability platforms like Prometheus, Cortex, Grafana, Elasticsearch, M3DB, Jaeger/Zipkin etc. Expertise in building systems using these technologies will be a plus.
You have extensive experience with data collection agents, loggers, and visualizing and correlating data sets from multiple domains.
You have experience with real-time alerting, in-stream enrichment and correlation of events, Rules and ML based alerting.
You have deployed Observability across the OSI stack.
You have outstanding presentation skills to both technical and executive audiences.
You have strong communication skills both written and verbal.
The Data Platform and Observability team is based in Pleasanton,CA; Boston,MA and Dublin, Ireland. We enable real time insights across Workday’s platforms, infrastructure and applications. Our focus is on the development of a large scale distributed data platform to support mission critical Workday applications.
The team provides software for collection, ingestion, storage & visualization of critical data assets. We handle 100s of terabytes of data in the form of billions of messages produced daily by Workday applications and underlying services. If you enjoy writing efficient software or tuning and scaling large distributed systems you will enjoy working with us.
Do you want to work on leveraging Workday’s vast computing resources with its rich and extensive datasets? To work with world class engineers and facilitate the development of the Observability data platform? If so, we should chat.
Workday is an Equal Opportunity Employer including individuals with disabilities and protected veterans.