Sr. Site Reliability Engineer, Consumer


March 9, 2021

San Francisco, CA 94103, US

Company Description
Twitter is what’s happening and what people are talking about right now. For us, life's not about a job, it's about purpose. We feel real change starts with conversation. Here, your voice matters. Come as you are and together we'll do what's right (not what's easy) to serve the public conversation.
Job Description

Who We Are

SREs (Site Reliability Engineers) work on improving the availability, scalability, performance, and reliability of Twitter’s production services.
Our core infrastructure receives hundreds of millions of tweets per day and serves tens of billions of API requests. We also serve over 2+ billion search queries per day, render hundreds of millions of ad impressions, and process hundreds of terabytes of log and interaction data daily.
What You’ll Do
You will be dedicated to improving the reliability of our end-to-end platform. Your work will integrate directly with Twitter's products. You will dive deep into gnarly operational issues; from the software, systems, automation, and process perspectives. You will understand the challenges around integrating disparate infrastructures into a new facility, processes, and procedures. You will work with open-source technologies and the wider SRE community and actively participate in the vision to move away from high operational cost tasks such as break/fix, cluster migrations, new service buildouts, abuse, etc. You will contribute to services that can shrink and expand based on demand, self-heal, automatically rollout, etc.

  • You will perform deep dives into both systemic and latent reliability issues; partner with software and systems engineers across the organization to produce and roll out fixes.

  • You will troubleshoot issues across the entire stack: hardware, software, application, and network,

  • You will drive standardization efforts across multiple disciplines and services in conjunction with embedded SREs throughout the organization.

  • You will mentor SREs on standard methodology for everything from monitoring to troubleshooting complex code issues.

  • You will identify and drive opportunities to improve automation for the company; scope and create automation for deployment, management, and visibility of our services.

  • You will participate in code reviews for projects primarily written in Java and Scala, built on open-source libraries such as Finagle, and running on both physical and virtualized platforms.

  • You will represent the SRE organization in design reviews and operational readiness exercises for new and existing services.


Who You Are

  • Solid understanding of systems and application design, including the operational trade-offs of various designs.

  • Practical knowledge of various aspects of service design like messaging protocols & behavior, caching strategies and software design practices.

  • Practical, solid knowledge of shell scripting and at least one higher-level language (Python or Scala preferred).

  • Demonstrable knowledge & experience of Linux servers, specifically RHEL/CentOS, TCP/IP, HTTP, and experience supporting multi-tier web application architectures.

  • Experience in handling services in a large scale environment.

  • Practical experience in Java or Scala.

  • Work well with and be able to influence a myriad of personalities at all levels.

  • Ability to prioritize tasks and work independently.

  • Be adaptable and able to focus on the simplest, most efficient & reliable solutions.

  • Track record of successful practical problem solving, excellent written and social communication, and documentation skills.

  • B.S. in computer science or similar field or equivalent experience.

Desired Qualifications

  • Ability to lead technical teams through design and implementation across an organization.

  • Experience with existing open-source projects such as Scribe, ZooKeeper, and Apache Mesos.

Additional Information

A few other things we value:

  • Challenge - We solve some of the industry’s hardest problems. Come to be challenged, learn, and thrive as an engineer.
  • Diversity - Diversity makes us a better organization and team. We value diverse backgrounds, ideas, and experiences.
  • Work, Life, Balance - We work hard, but we believe with hard work should come balance.

We will ensure that individuals with disabilities are provided a reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request an accommodation.