Skip to main content

Site Reliability Engineer DevOps Engineer

Orlando, FL
Permanent
Metasys Technologies

Site Reliability Engineer (SRE) / DevOps Engineer
Location: Orlando, FL (onsite)
Duration: 12+ months

Client is looking for a highly skilled SRE / DevOps Engineer to design, implement, and maintain scalable, reliable, and secure cloud infrastructure. The ideal candidate will have strong expertise in cloud platforms, automation, CI/CD, observability, and container orchestration, with a focus on improving system reliability, performance, and operational efficiency.

Key Responsibilities

  • Design, build, and maintain highly available and scalable cloud infrastructure on AWS and Azure
  • Implement and manage Infrastructure as Code (IaC) using Terraform and CloudFormation
  • Automate configuration management using tools like Ansible or Chef
  • Deploy and manage containerized applications using Docker and Kubernetes
  • Build and optimize CI/CD pipelines using Jenkins or Harness
  • Monitor system performance, availability, and reliability using Splunk, AppDynamics, and other observability tools
  • Collaborate with development teams to improve deployment processes and system resilience
  • Implement incident management and ITIL processes using ServiceNow
  • Troubleshoot production issues and perform root cause analysis (RCA)
  • Ensure security, compliance, and best practices across infrastructure and applications
  • Manage and optimize CDN and edge delivery solutions (Akamai)

Required Skills & Qualifications

  • Strong experience in AWS (certification preferred) and Microsoft Azure
  • Hands-on expertise in Terraform and/or CloudFormation
  • Experience with configuration management tools (Ansible/Chef)
  • Solid understanding of containerization (Docker) and orchestration (Kubernetes)
  • Proficiency in Python and/or JavaScript scripting
  • Experience with CI/CD tools (Jenkins, Harness)
  • Knowledge of monitoring and observability tools (Splunk, AppDynamics)
  • Familiarity with ITIL processes and ServiceNow
  • Strong understanding of networking, security, and system design principles
  • Experience working in Agile/DevOps environments

Preferred / Nice-to-Have Skills

  • Experience with multi-cloud or hybrid cloud environments
  • Knowledge of SRE principles (SLI/SLO/SLA, error budgets)
  • Exposure to chaos engineering or reliability testing
  • Experience with log aggregation and distributed tracing
  • Familiarity with CDN technologies (Akamai) and edge computing

Key Competencies

  • Strong problem-solving and troubleshooting skills
  • Ability to work in high-pressure production environments
  • Excellent communication and collaboration skills
  • Proactive mindset toward automation and reliability improvements
  • Work on modern cloud-native technologies
  • Opportunity to build high-scale, resilient systems
  • Collaborative and innovation-driven environment

Metasys Technologies is an equal opportunity employer. All applicants will be considered for employment without attention to race, color, religion, sex, sexual orientation, gender identify, national origin, veteran or disability status.

Job Type: Permanent

Job ID: 253742842