Skip to main content

SAS Data Engineer- Linux bilingual in Korean

Tyler, TX
Permanent

Posted

[SAS Datalake Engineer]

1. Node Replacement and Upgrades:

- Assess Isilon and NetApp cluster capacity and performance metrics using OneFS and ONTAP to determine the need for node replacements or upgrades.

- Manage the integration of new Isilon and NetApp nodes, ensuring system compatibility and minimal downtime during scalability operations.

- Oversee the upgrade and replacement of HPC, Kubernetes, Greenplum, Impala cluster servers, and GPU server hardware to enhance computational and graphics processing capabilities.

2. Troubleshooting and Problem Resolution:

- Utilize Impala Insight IQ and NetApp tools for in-depth analysis to diagnose and resolve issues related to throughput, latency, and capacity.

- Leverage One FS and ONTAP troubleshooting tools and logs to identify and rectify system errors and hardware malfunctions.

- Address performance bottlenecks and system failures in HPC, Kubernetes, Greenplum, GPU servers, Impala, and NetApp environments to ensure operational efficiency.

3. System Monitoring and Maintenance:

- Implement continuous monitoring with Isilon One FS event logs, NetApp ONTAP tools, SNMP alerts, and Grafana to detect anomalies early.

- Schedule regular maintenance using Isilon's and NetApp's software suites to ensure optimal performance and longevity of the storage systems.

- Monitor and maintain HPC, Kubernetes, Greenplum, GPU, Impala, and NetApp server environments, applying necessary updates and performing system health checks regularly.

4. Patches and Updates:

- Apply Impala and NetApp-specific patches and firmware updates to nodes and system software, ensuring all components are secure and up-to-date.

- Test new software updates in a sandbox environment to evaluate their impact on performance and stability before deployment across the cluster.

- Manage and coordinate software updates and patch installations across HPC, Kubernetes, Greenplum, GPU servers, Impala, and NetApp systems to maintain software integrity and security.

5. Technical Support and Consultation:

- Provide specialized support and consultation for optimizing storage solutions with Isilon and NetApp, including data migration, system expansion, and configuration tuning.

- Offer technical support for optimizing HPC, Kubernetes, Greenplum, and GPU server configurations, improving computational resources and data processing workflows.

- Assist with the design, configuration, and optimization of NetApp storage architectures to ensure efficient data management.

6. End-of-Life (EOL) Hardware Management:

- Manage the phased decommissioning of aging hardware in Isilon and NetApp systems, ensuring compliance with environmental standards and security protocols.

- Coordinate the replacement of EOL hardware for HPC, Kubernetes, Greenplum, GPU servers, Isilon, and NetApp to prevent performance degradation.

7. Integration and System Compatibility:

- Ensure that Impala, NetApp, HPC, Kubernetes, Greenplum, and GPU storage solutions are fully integrated and compatible with the overall IT infrastructure.

- Develop and implement strategies for the effective integration of storage and processing technologies to maximize performance and resource utilization.

8. Documentation and Knowledge Sharing:

- Maintain detailed documentation of system configurations, upgrades, and troubleshooting activities to streamline processes and support team onboarding.

- Conduct training sessions or create guides for internal teams to share knowledge about system operations and best practices.

9. Disaster Recovery and Backup:

- Plan and test disaster recovery procedures for Impala, NetApp, and HPC/Kubernetes/Greenplum/GPU servers to ensure data availability and business continuity.

- Ensure backup and recovery processes are in place and regularly tested to minimize data loss risks.

10. Capacity Planning:

- Develop capacity planning strategies to proactively scale storage and compute resources based on performance trends and business needs.

11. Automation:

- Develop and maintain automation scripts for routine tasks such as monitoring, data migration, or maintenance to improve system efficiency.

- Utilize automation tools to streamline repetitive tasks, enhancing system performance and reducing manual errors.

12. Security and Compliance:

- Ensure compliance with organizational security standards and relevant regulations during all maintenance, upgrades, and troubleshooting activities.

- Implement or coordinate with security teams on applying best practices and hardening measures for Isilon, NetApp, HPC, Kubernetes, Greenplum, and GPU servers.

13. Data Center Work:

- Participate in data center activities, including racking and initial installation of servers and storage equipment.

- Ensure proper cabling, power connections, and network configurations during initial setup to facilitate smooth integration into the IT environment.

Requirements

  • 46 years of hands-on Linux support experience in enterprise environments
  • Strong Linux administration and troubleshooting skills (RHEL/CentOS/Ubuntu)
  • Experience with system monitoring, log analysis, and production support
  • Basic scripting knowledge (Bash/Python preferred)
  • Understanding of networking fundamentals (TCP/IP, DNS, SSH)
  • Familiarity with ETL/data pipeline support and batch job monitoring
  • Experience with cloud environments (AWS/Azure/GCP) is a plus
  • Knowledge of Docker/Kubernetes is preferred
  • Strong problem-solving and communication skills

Benefits

Comprehensive health insurance, 401K, PTO, sick days, end year bonus

Job Type: Permanent

Job ID: 254802431