Associate Service Reliability Engineer

Location US-MA-Boston
Posting date 6 days ago(3/13/2018 10:54 AM)
Job ID
Software Engineering, Systems Engineering

Company description

At Red Hat, we connect an innovative community of customers, partners, and contributors to deliver an open source stack of trusted, high-performing solutions. We offer cloud, Linux, middleware, storage, and virtualization technologies, together with award-winning global customer support, consulting, and implementation services. Red Hat is a rapidly growing company supporting more than 90% of Fortune 500 companies.

Job summary

The Red Hat OpenShift Online Service Reliability Engineering (SRE) team is looking for an Associate Service Reliability Engineer to join our newly formed team in Boston, MA. In this role, you will play a key role within the team responsible for keeping the OpenShift environment available and secure. The Boston team will interact with other SRE teams and product engineering resources around the world to deliver large, containerized cluster environments. You’ll be responsible for provisioning, upgrades, problem detection and automated recovery scenarios, incident management, and understanding complicated, interconnected data points to resolve faults when issues arise. We'll need you to be able to work in a complicated and fast-paced environment, while quickly learning new skills and creating ways to consistently meet service level agreements and keep a globally-distributed, cloud-based, containerized service running for our customers.

Primary job responsibilities

  • Interact with automated monitoring and healing infrastructure to ensure healthy environments
  • Develop automation to auto-correct or completely prevent issues in our online solution
  • Participate in product release cycles, deploying code to integration, staging and production environments, integrating with continuous integration (CI) and continuous delivery (CD) tooling, monitoring and change management
  • Perform software updates, peer code reviews, testing, and CVE analysis; respond to security threats
  • Identify single points of failure and other high-risk architecture issues; propose and implement more resilient resolutions
  • Resolve customer issues in cooperation with Red Hat's global customer support team
  • Create and maintain standard operating procedures (SOPs) for performing maintenance tasks, applying configuration changes and remediating problems in our environment
  • Participate in a regular shift and on-call rotation

Required skills

  • 1-2 years of experience managing Linux servers running Red Hat Enterprise Linux (RHEL), CentOS, or Fedora hosted at a cloud provider like Amazon Web Services (AWS), Google Compute Engine (GCE), or Microsoft Azure
  • 1+ year(s) of experience with enterprise systems monitoring; knowledge of Zabbix or Nagios is preferred
  • 1+ year(s) of experience with enterprise configuration management like Ansible, Puppet, or Chef
  • 1+ year(s) of experience with object-oriented programming with at least one language; Golang, Java, or Python preferred
  • 1+ year(s) of experience delivering a hosted service
  • Demonstrated ability to quickly and accurately troubleshoot system issues
  • Solid understanding of standard TCP/IP networking and common protocols like DNS and HTTP
  • Excellent communications skills and experience working directly with and presenting to customers
  • Willingness to work during weekends
  • 1+ year(s) of experience with Kubernetes and with Docker-based containers is a plus

Red Hat is proud to be an equal opportunity workplace and an affirmative action employer. We review applications for employment without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, citizenship, age, uniformed services, genetic information, physical or mental disability, medical condition, marital status, or any other basis prohibited by law.

Red Hat does not seek or accept unsolicited resumes or CVs from recruitment agencies. We are not responsible for, and will not pay, any fees, commissions, or any other payment related to unsolicited resumes or CVs except as required in a written contract between Red Hat and the recruitment agency or party requesting payment of a fee.


Interested in this job?

Sorry the Share function is not working properly at this moment. Please refresh the page and try again later.
Share on your newsfeed