• OpenShift Service Reliability Engineering (SRE) Internship

    Location US-NC-Raleigh
    Posting date 2 months ago(10/3/2019 12:11 PM)
    Job ID
    Software Engineering
  • Company description

    At Red Hat, we connect an innovative community of customers, partners, and contributors to deliver an open source stack of trusted, high-performing solutions. We offer cloud, Linux, middleware, storage, and virtualization technologies, together with award-winning global customer support, consulting, and implementation services. Red Hat is a rapidly growing company supporting more than 90% of Fortune 500 companies.

    Job summary

    The Red Hat OpenShift Service Reliability Engineering (SRE) team is looking for an Intern to join us in Raleigh, NC. In this role, you will work as part of the first team to host and manage the code for Red Hat OpenShift, which is Enterprise Kubernetes, in the public cloud. You’ll play a key role within the team, as you’ll be responsible for keeping the Red Hat OpenShift platform environment available and secure. Along with the rest of your team, you will interact with other site reliability engineers and product engineering associates around the world to deliver large, containerized cluster environments. You'll be responsible for provisioning, upgrades, problem detection and automated recovery scenarios, incident management, and understanding complicated, interconnected data points to resolve faults when issues arise. You’ll need to be able to work in a complicated and fast-paced environment while quickly learning new skills and creating ways to consistently meet service-level agreements (SLAs) and keep a globally-distributed, cloud-based, containerized service (Enterprise Kubernetes) running for our customers.

    Primary job responsibilities

    • Actively work to automatically detect potential issues in a large virtualized environment
    • Write automation scripts to autocorrect or completely prevent issues in our online offering
    • Track and review changes in a highly dynamic environment
    • Identify single points of failure and other high-risk architecture issues and propose more resilient solutions
    • Perform and oversee releases to ensure that proper life cycle and policies are followed
    • Perform software updates, testing, and Common Vulnerabilities and Exposures (CVE) analysis
    • Respond to security threats

    Required skills

    • Ability to work full-time hours during summer 2020 in the location listed
    • Experience running Linux servers (any distribution); Red Hat Enterprise Linux (RHEL), CentOS, or Fedora are a plus
    • Basic knowledge of configuration management systems like Puppet or Chef; Red Hat Ansible Automation is a plus
    • Demonstrated ability to quickly and accurately troubleshoot issues
    • Solid foundation in a programming language like Golang, Python, or C#; Golang is a plus
    • Basic knowledge of monitoring systems like Prometheus, Nagios, or Zabbix is a plus
    • Basic knowledge of cloud technologies, e.g., Infrastructure-as-a-Service (IaaS), Platform-as-a-Service (PaaS), Red Hat OpenStack Platform, Amazon Web Services (AWS), Google Cloud Platform (GCP), or Microsoft Azure is a plus
    • Previous code contributions to open source projects or code samples on GitHub are a plus

    Red Hat is proud to be an equal opportunity workplace and an affirmative action employer. We review applications for employment without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, citizenship, age, uniformed services, genetic information, physical or mental disability, medical condition, marital status, or any other basis prohibited by law.

    Red Hat does not seek or accept unsolicited resumes or CVs from recruitment agencies. We are not responsible for, and will not pay, any fees, commissions, or any other payment related to unsolicited resumes or CVs except as required in a written contract between Red Hat and the recruitment agency or party requesting payment of a fee.


    Interested in this job?

    Sorry the Share function is not working properly at this moment. Please refresh the page and try again later.
    Share on your newsfeed