Location: REMOTE
Description: Our client is currently seeking a Site Reliability Engineer (SRE) - Remote
Job Description:
Responsibilities:
Necessary experience:
• Well versed with the entire software development lifecycle, DevOps, and SRE practices
• Expertise and operational experience at scale - designing and operating highly available, scalable and fault-tolerant systems using container platforms
• Experience with operational monitoring tools (AppDynamics, NewRelic, Instana, CatchPoint) with a mindset towards predictive analysis
• Experience with Splunk or ELK Stack, Grafana, DataDog, or Sysdig
• Working knowledge of the automation tools such as Ansible, Terraform, or Chef
• Experience with Pivotal Cloud Foundry (PCF), OpenShift (OCP), Amazon Web Service (AWS), and Google Cloud Platform (GCP)
Minimum Qualifications:
• You have 5+ years of SRE experience in a highly customer-focused environment.
• You have 3+ years experience successfully managing a team of engineers on large-scale projects that included technical deep-dives and production troubleshooting in the areas of: distributed systems, programming, configuration management, networking, storage, and operating systems
• You possess strong leadership skills and the ability to motivate teams.
• You bring a strong perspective and collaborative partnership that drives change, and motivates engineers to develop simple solutions to complex operational or reliability challenges.
• You have experience formulating a team's technical strategy and roadmap, and you've collaborated and partnered effectively with several other teams.
• You are capable of leading a discussion with upper management, and are able to tailor the level of technical detail to suit your audience.
B.S. in Computer Science or equivalent experienceThe Judge Group
Atlanta Georgia
United States
Information Technology
(No Timezone Provided)
Location: REMOTE
Description: Our client is currently seeking a Site Reliability Engineer (SRE) - Remote
Job Description:
Responsibilities:
Necessary experience:
• Well versed with the entire software development lifecycle, DevOps, and SRE practices
• Expertise and operational experience at scale - designing and operating highly available, scalable and fault-tolerant systems using container platforms
• Experience with operational monitoring tools (AppDynamics, NewRelic, Instana, CatchPoint) with a mindset towards predictive analysis
• Experience with Splunk or ELK Stack, Grafana, DataDog, or Sysdig
• Working knowledge of the automation tools such as Ansible, Terraform, or Chef
• Experience with Pivotal Cloud Foundry (PCF), OpenShift (OCP), Amazon Web Service (AWS), and Google Cloud Platform (GCP)
Minimum Qualifications:
• You have 5+ years of SRE experience in a highly customer-focused environment.
• You have 3+ years experience successfully managing a team of engineers on large-scale projects that included technical deep-dives and production troubleshooting in the areas of: distributed systems, programming, configuration management, networking, storage, and operating systems
• You possess strong leadership skills and the ability to motivate teams.
• You bring a strong perspective and collaborative partnership that drives change, and motivates engineers to develop simple solutions to complex operational or reliability challenges.
• You have experience formulating a team's technical strategy and roadmap, and you've collaborated and partnered effectively with several other teams.
• You are capable of leading a discussion with upper management, and are able to tailor the level of technical detail to suit your audience.
B.S. in Computer Science or equivalent experience