Skip to main content

Sr DevOps Engineer -Hybrid

Tyler, TX
Permanent
Description:
Our Cloud Infrastructure Platform Engineering team plays a pivotal role in designing and deploying robust infrastructure solutions to support SAP products and services.
RESPONSIBILITIES:
  • Write, modify, run terraform to deploy and maintain infrastructure across multiple cloud service providers. Be able to debug errors when deploying terraform.
  • Run ansible playbooks to perform configuration and maintenance tasks. Be able to develop new ansible roles and troubleshooting as errors occur.
  • Use GitLab with multiple repositories, create and improve CI utilizing runners to maintain infrastructure.
  • Configure, build, and deploy containerized services using Docker and/or Kubernetes.
  • Develop standard operating procedures, knowledge base articles, technical bulletins, and other documents in support of the infrastructure.
  • Operate in a security-first mindset, performing all other responsibilities with security in mind.
  • Implement monitoring, config management, and logging capabilities to manage a multiple tenant cloud infrastructure across multiple cloud service providers.
  • Use generative AI elements to increase efficiency and speed, improve accuracy and consistency, enhance security, and better manage resources where practical and within security boundary guidelines.
KNOWLEDGE AND SKILLS:
  • Knowledge of AWS foundational technologies (EC2, S3, IAM, Route53, VPC).
  • Knowledge of Azure foundational technologies (Virtual Networks, Application Gateway, Storage Account, Virtual Machines, Load Balancer, Resource Groups).
  • Knowledge of GCP foundational technologies (Projects, Compute Engine, GKE, Cloud Storage, VPC).
  • Proficient in using Terraform to build modules for deploying infrastructure in AWS, Azure, and/or GCP.
  • Proficient in CI/CD technologies including pipeline design and implementation using tools such as GitLab runners and ConcourseCI.
  • Working knowledge of HashiCorp Vault for secret management, including secure storage and retrieval of credentials, API keys, and certificates.
  • Experience with Ansible for automation, configuration management, and playbook development
  • Proficient in a scripting or programming language for automation (bash, python).
  • Experience with RBAC design and Single Sign-On (SSO) for identity management.
  • Investigation skills to review detailed Infrastructure and configurations.
  • Experience with logging and log processing in a cloud environment.
  • Working knowledge of networking: ip routing, subnetting, network troubleshooting.
  • Strong Linux OS skills (deployment, configuration, and troubleshooting).
  • Knowledge of ITIL-compliant ticket management using ServiceNow, including incident resolution, change requests and sprint planning
PREFERRED QUALIFICATIONS:
  • Expertise in designing, analyzing and troubleshooting large-scale distributed systems.
  • Ability to debug and optimize code and automate routine tasks.
  • Systematic problem-solving approach coupled with strong communication skills and a sense of ownership and drive.
  • Hands-on experience developing infrastructure as code (IaC).
  • 5-8+ years of experience with the following technologies: Terraform, Ansible, CI/CD, Secrets Management, Identity Management (SSO) .
  • Experience with Linux operating system internals and administration (e.g., filesystems, inodes, system calls, hardening) and networking (e.g., TCP / IP, routing, DNS, network topologies, SDN).

Job Type: Permanent

Job ID: 253408097