• Principal Site Reliability Engineer

    Location US-WA-Remote
    Posting date 2 months ago(9/26/2018 3:23 AM)
    Job ID
    65050
    Category
    Software Engineering, Systems Engineering
  • Company description

    At Red Hat, we connect an innovative community of customers, partners, and contributors to deliver an open source stack of trusted, high-performing solutions. We offer cloud, Linux, middleware, storage, and virtualization technologies, together with award-winning global customer support, consulting, and implementation services. Red Hat is a rapidly growing company supporting more than 90% of Fortune 500 companies.

    Job summary

    The Red Hat OpenShift Service Delivery team is looking for a Principal Site Reliability Engineer to join us Washington. In this role, you will join a team that hosts, develops, and supports the OpenShift project on the Azure cloud. This is a new partnership between Microsoft and Red Hat, and you can be part of defining how it moves forward. You’ll be a key member of our OpenShift team and focus on standardization of containers, which will include work around Red Hat Container Catalog, Kubernetes, and OpenShift environments. We'll need you to be able to work in a fast-paced and sometimes chaotic environment, with the goal of making our end users and customers more effective with every release. Candidates in San Francisco, CA and on the West Coast will also be considered.

    Primary job responsibilities

    • Contribute meaningfully to the code and mitigate CVEs
    • Work closely with engineers on OpenShift to become a contributor to both the upstream and downstream OpenShift projects to deliver functionality
    • Interact with multiple teams within Red Hat as well as with the open source community
    • Partner with Support to troubleshoot deep technical issues
    • Look for opportunities to improve how we deliver a hosted offering to our customers

    Required skills

    • Experience writing code for B2B organizations
    • Knowledge of Azure, Kubernetes, or Openshift; solid skills in at least one of these areas and capable of learning the rest quickly
    • Ability to communicate with end users through IRC, forums, and e-mail
    • Experience implementing and improving continuous integration (CIú and deployment (CD) pipelines
    • Knowledge of how to ensure a hosted service is secure and stable, including establishing processes around that

    The following is considered a plus:

    • Experience with Prometheus
    • Linux experience
    • Experience in a Site Reliability Engineering (SRE) position for a cloud based company


    Red Hat is proud to be an equal opportunity workplace and an affirmative action employer. We review applications for employment without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, citizenship, age, veteran status, genetic information, physical or mental disability, medical condition, marital status, or any other basis prohibited by law.


    Red Hat does not seek or accept unsolicited resumes or CVs from recruitment agencies. We are not responsible for, and will not pay, any fees, commissions, or any other payment related to unsolicited resumes or CVs except as required in a written contract between Red Hat and the recruitment agency or party requesting payment of a fee.

     

    Interested in this job?

    Sorry the Share function is not working properly at this moment. Please refresh the page and try again later.
    Share on your newsfeed