Site Reliability Engineer


April 29, 2021

We are looking for a Site Reliability Engineer to join our team of specialists. This role responsibilities includes operating, monitoring and maintaining the production systems used by our customers and provide Tier 1 maintenance and support.
This position requires working in shifts to fulfill 24X7X365 support to mission critical systems.
What you'll be doing:

  • Monitor equipment, applications and processes through various tools applications and consoles

  • Identify problems and decide on the best course of action to resolve

  • Administer, configure, and execute ad hoc jobs

  • Work with Tier 2 and Tier 3 support as required

  • Coordinate routine maintenance as per published procedures

  • Work with the team, schedule hardware maintenance to avoid downtime and maintain service levels

  • Create and update accurate operations problem reports on production incidents and problems

  • Develop documentation for Operations processes

  • Work rotating shifts, including weekends and holidaysÍž and overtime as the need arises

What we need to see:

  • Proximity to Sterling, VA.

  • Driving license required. Occasional travel required.

  • Post-secondary diploma or degree in the information technology field, or equivalent work experience

  • 2+ years of related worked experience

  • Demonstrated experience with an Enterprise Data center system

  • Physical labor to Rack/Unrack network equipment in data center, including pulling cables over the racks in a data center environment

  • Previous experience performing operational activities including batch processing, system backups, maintenance, monitor and provide Level 1 network and server support, monitor and respond to data center environmental alarms, monitor various application systems

  • Experience in analyzing and resolving problems as well as customer conflicts in a customer-focused manner

  • Experience handling special requests for network configuration changes, system reboots, performing server and network switch reboots, file restores, web updates and terminal messaging

  • Knowledge of both Linux and Windows operating system, network stack and associated tools

  • Excellent written and oral communication skills as well as excellent customer relations skills

  • Must be proficient with computers and typing

NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and talented people in the world working for us. If you're creative and autonomous, we want to hear from you.
NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars. NVIDIA is looking for great people like you to help us accelerate the next wave of artificial intelligence.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression , sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.