Lead System Reliability Engineer - Eden Prairie, MN or Boston, MA or Telecommute

UnitedHealth Group

Atlanta Georgia

United States

Information Technology
(No Timezone Provided)

Optum is a company that's on the rise. We're expanding in multiple directions, across borders and, most of all, in the way we think. Here, innovation isn't about another gadget, it's about transforming the health care industry. Ready to make a difference? Make yourself at home with us and start doing your life's best work.(sm)

UnitedHealth Care Technology Benefit Operations creates opportunities to deliver technology products supporting UHC Operations in simplifying the member and employee experience, delivering provider services and enabling cost stewardship and efficiency. Using data analytics, broad capabilities, artificial intelligence and the latest technologies made by experts, our people are continually innovating with a purpose to solve real-world challenges - ultimately driving better health care benefit utilization and reduced costs.

You'll enjoy the flexibility to telecommute* from anywhere within the U.S. as you take on some tough challenges

Primary Responsibilities:

  • Serve as a subject matter expert to internal stakeholders on all aspects of the System Reliability Engineering (SRE) discipline, driving increased reliability into various systems requiring support, across the BenOps Organization.
  • Guide Reliability via Enterprise Transformation, influencing teams as they work towards maintaining 99.99% reliability targets, and beyond
  • Partner with SRE/Devops team to identify anti patterns and optimization strategies, create fallback options and help develop self-healing capabilities across the organization in a sustainable manner
  • Drive results through increasing reliability of applications and failover capabilities
  • Lead teams outside their immediate organization in implementing SRE principles
  • Complete tasks with minimal guidance and oversight, and review work of others.
  • Actively mentors team members, through code reviews, brown bags, tech talks, design reviews
  • Lead change and innovation - pursue opportunities to adopt new technologies and drive adoption to enhance business outcomes - drive and mange high-quality execution across organizational lines.
  • Grow and maintain knowledge of and leverage cutting edge IT industry / marketplace technologies and trends to support highly available distributed systems, and the transformation of legacy systems

You'll be rewarded and recognized for your performance in an environment that will challenge you and give you clear direction on what it takes to succeed in your role as well as provide development for other roles you may be interested in.

Required Qualifications:

  • 4+ years of professional IT experience (system administration, software development, or a combination ), with steadily increasing responsibilities.
  • 2+ years of experience designing and building highly distributed, reliable systems.
  • 2+ years of experience as a Site Reliability Engineer
  • Experience in a Site Reliability Engineering (SRE) role
  • Experience designing and building highly distributed, reliable systems
  • Hands-on development experience with one or more modern scripting/programming languages (Python, Java, C#, Ruby, Go, React Frameworks, groovy, Scala)
  • Experience with Agile & CI/CD methodologies and enabling Automation within development teams

Preferred Qualifications:

  • Experience leading technical teams and/or managing IT Project timelines and deliverables
  • Mixed skill set of system administration/operations and development with full stack experience
  • Experience with Application Performance Management Tools like New Relic, Dynatrace; log management tool like Splunk and queues like Kafka
  • Experience of designing/architecting and building cloud based high throughput platforms and applications using Docker, Java/JEE stack, .NET, language tools and frameworks (Spring, Spring Boot, Hibernate, Web Services, Spring cloud), Web servers like Apache Tomcat
  • Experience with modern technologies/telemetry like Grafana, Prometheus, Jaeger, Elasticsearch
  • Good understanding of networking concepts (e.g. routing, firewalls, proxies) and technologies (e.g. HTTP, TLS, Nginx)

UnitedHealth Group requires all new hires and employees to report their COVID-19 vaccination status.

Technology Careers with Optum. Information and technology have amazing power to transform the health care industry and improve people's lives. This is where it's happening. This is where you'll help solve the problems that have never been solved. We're freeing information so it can be used safely and securely wherever it's needed. We're creating the very best ideas that can most easily be put into action to help our clients improve the quality of care and lower costs for millions. This is where the best and the brightest work together to make positive change a reality. This is the place to do your life's best work.(sm)

Colorado Residents Only: The salary/hourly range for Colorado residents is $94,500 to $171,700. Pay is based on several factors including but not limited to education, work experience, certifications, etc. As of the date of this posting, In addition to your salary, UHG offers the following benefits for this position, subject to applicable eligibility requirements: Health, dental, and vision plans; wellness program; flexible spending accounts; paid parking or public transportation costs; 401(k) retirement plan; employee stock purchase plan; life insurance, short-term disability insurance, and long-term disability insurance; business travel accident insurance; Employee Assistance Program; PTO; and employee-paid critical illness and accident insurance.

*All Telecommuters will be required to adhere to UnitedHealth Group's Telecommuter Policy.

Diversity creates a healthier atmosphere: UnitedHealth Group is an Equal Employment Opportunity/Affirmative Action employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, age, national origin, protected veteran status, disability status, sexual orientation, gender identity or expression, marital status, genetic information, or any other characteristic protected by law.

UnitedHealth Group is a drug-free workplace. Candidates are required to pass a drug test before beginning employment.

Job Keywords: Lead System Reliability Engineer, Telecommute

Lead System Reliability Engineer - Eden Prairie, MN or Boston, MA or Telecommute

UnitedHealth Group

Atlanta Georgia

United States

Information Technology

(No Timezone Provided)

Optum is a company that's on the rise. We're expanding in multiple directions, across borders and, most of all, in the way we think. Here, innovation isn't about another gadget, it's about transforming the health care industry. Ready to make a difference? Make yourself at home with us and start doing your life's best work.(sm)

UnitedHealth Care Technology Benefit Operations creates opportunities to deliver technology products supporting UHC Operations in simplifying the member and employee experience, delivering provider services and enabling cost stewardship and efficiency. Using data analytics, broad capabilities, artificial intelligence and the latest technologies made by experts, our people are continually innovating with a purpose to solve real-world challenges - ultimately driving better health care benefit utilization and reduced costs.

You'll enjoy the flexibility to telecommute* from anywhere within the U.S. as you take on some tough challenges

Primary Responsibilities:

  • Serve as a subject matter expert to internal stakeholders on all aspects of the System Reliability Engineering (SRE) discipline, driving increased reliability into various systems requiring support, across the BenOps Organization.
  • Guide Reliability via Enterprise Transformation, influencing teams as they work towards maintaining 99.99% reliability targets, and beyond
  • Partner with SRE/Devops team to identify anti patterns and optimization strategies, create fallback options and help develop self-healing capabilities across the organization in a sustainable manner
  • Drive results through increasing reliability of applications and failover capabilities
  • Lead teams outside their immediate organization in implementing SRE principles
  • Complete tasks with minimal guidance and oversight, and review work of others.
  • Actively mentors team members, through code reviews, brown bags, tech talks, design reviews
  • Lead change and innovation - pursue opportunities to adopt new technologies and drive adoption to enhance business outcomes - drive and mange high-quality execution across organizational lines.
  • Grow and maintain knowledge of and leverage cutting edge IT industry / marketplace technologies and trends to support highly available distributed systems, and the transformation of legacy systems

You'll be rewarded and recognized for your performance in an environment that will challenge you and give you clear direction on what it takes to succeed in your role as well as provide development for other roles you may be interested in.

Required Qualifications:

  • 4+ years of professional IT experience (system administration, software development, or a combination ), with steadily increasing responsibilities.
  • 2+ years of experience designing and building highly distributed, reliable systems.
  • 2+ years of experience as a Site Reliability Engineer
  • Experience in a Site Reliability Engineering (SRE) role
  • Experience designing and building highly distributed, reliable systems
  • Hands-on development experience with one or more modern scripting/programming languages (Python, Java, C#, Ruby, Go, React Frameworks, groovy, Scala)
  • Experience with Agile & CI/CD methodologies and enabling Automation within development teams

Preferred Qualifications:

  • Experience leading technical teams and/or managing IT Project timelines and deliverables
  • Mixed skill set of system administration/operations and development with full stack experience
  • Experience with Application Performance Management Tools like New Relic, Dynatrace; log management tool like Splunk and queues like Kafka
  • Experience of designing/architecting and building cloud based high throughput platforms and applications using Docker, Java/JEE stack, .NET, language tools and frameworks (Spring, Spring Boot, Hibernate, Web Services, Spring cloud), Web servers like Apache Tomcat
  • Experience with modern technologies/telemetry like Grafana, Prometheus, Jaeger, Elasticsearch
  • Good understanding of networking concepts (e.g. routing, firewalls, proxies) and technologies (e.g. HTTP, TLS, Nginx)

UnitedHealth Group requires all new hires and employees to report their COVID-19 vaccination status.

Technology Careers with Optum. Information and technology have amazing power to transform the health care industry and improve people's lives. This is where it's happening. This is where you'll help solve the problems that have never been solved. We're freeing information so it can be used safely and securely wherever it's needed. We're creating the very best ideas that can most easily be put into action to help our clients improve the quality of care and lower costs for millions. This is where the best and the brightest work together to make positive change a reality. This is the place to do your life's best work.(sm)

Colorado Residents Only: The salary/hourly range for Colorado residents is $94,500 to $171,700. Pay is based on several factors including but not limited to education, work experience, certifications, etc. As of the date of this posting, In addition to your salary, UHG offers the following benefits for this position, subject to applicable eligibility requirements: Health, dental, and vision plans; wellness program; flexible spending accounts; paid parking or public transportation costs; 401(k) retirement plan; employee stock purchase plan; life insurance, short-term disability insurance, and long-term disability insurance; business travel accident insurance; Employee Assistance Program; PTO; and employee-paid critical illness and accident insurance.

*All Telecommuters will be required to adhere to UnitedHealth Group's Telecommuter Policy.

Diversity creates a healthier atmosphere: UnitedHealth Group is an Equal Employment Opportunity/Affirmative Action employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, age, national origin, protected veteran status, disability status, sexual orientation, gender identity or expression, marital status, genetic information, or any other characteristic protected by law.

UnitedHealth Group is a drug-free workplace. Candidates are required to pass a drug test before beginning employment.

Job Keywords: Lead System Reliability Engineer, Telecommute