Site Reliability Engineer
Posted
Job Title: Site Reliability Engineer
Duration- Fulltime Permanent
Location: Pittsburgh, PA/Strongsville, OH (Onsite From Day 1)
Job Description:
Skill: Site Reliability Engineer (Full Stack Developer)
Must skills:
Duration- Fulltime Permanent
Location: Pittsburgh, PA/Strongsville, OH (Onsite From Day 1)
Job Description:
Skill: Site Reliability Engineer (Full Stack Developer)
Must skills:
- Part of a full stack agile team supporting Pega, Java, ETL, Data Engineers, VB (.NET), and Mainframe development.
- Actively participate in all Agile ceremonies, including daily stand-ups, sprint planning, and retrospectives.
- Experience with Medallion Architecture a plus.
- Experience in Containerization (Kubernetes / Docker) a plus.
- Experience with Marketing applications (Pega, Salesforce, Adobe, Zafin, Naehas) is a plus.
- Experience with Visual Basic (.NET) a plus.
- Experience with SaaS based solutions a plus.
- Financial Services industry knowledge.
- Support and maintain mission-critical applications developed in COBOL, DB2, Pega, VB .Net, and Java, including diagnosing and resolving application and database performance issues.
- Monitor and maintain the health, performance, and reliability of large-scale Hadoop clusters and big data environments, ensuring optimal resource utilization and uptime.
- Develop, automate, and optimize data pipelines using SQL, Python, and PySpark for efficient data ingestion, transformation, and processing.
- Troubleshoot and resolve complex issues related to Informatica ETL processes, ensuring data quality, consistency, and timely delivery.
- Implement and enforce best practices for site reliability, including automated monitoring, alerting, and incident response for both big data platforms and legacy systems.
- Collaborate with development, QA, and infrastructure teams to support application deployments, upgrades, and integration across diverse technologies.
- Document operational procedures, incident reports, and system configurations to support knowledge sharing and business continuity.
- Continuously evaluate and recommend improvements for system scalability, security, and reliability in both big data and legacy application environments.
- Ensure data security, governance, and compliance standards are met within all data engineering processes.
- Familiarity with Dynatrace and Log Scale.
- Familiarity with tools like Jira, Confluence, Alation.
