Splunk Platform Engineer
Trident Consulting is seeking aSplunk Platform Engineer for one of our clients in" Holmdel, NJ & Bethlehem, PA. A global leader in business and technology services.
- Position: Splunk Platform Engineer
- Location: Holmdel, NJ & Bethlehem, PA
- Mode: Contract W2
- Hybrid 3 Days onsite / 2 Days remote
- Duration: 6+ Months (Possible extension)
- Pay rate: 85/hr -90/hr. on W2
Job Overview
We are seeking aSenior Observability Engineerwith strong expertise inSplunk administration and monitoring platformsto join the Enterprise Observability Engineering team.
The ideal candidate will be responsible forconfiguring, administering, and maintaining enterprise observability toolsincludingSplunk (primary), AppDynamics, OpenTelemetry, and Zenossto ensure reliability, visibility, and optimal performance of enterprise IT systems.
This role involves working closely withDevOps, infrastructure, and application teamsto implement monitoring strategies and improve system observability.
Key Responsibilities
Observability Platform Administration
- Administer and configureSplunk, AppDynamics, OpenTelemetry (OTEL), and Zenossplatforms.
- Implement monitoring solutions aligned with enterprise observability standards.
- Perform upgrades, patching, and security hardening of observability tools.
Monitoring & System Reliability
- Monitor the performance and health of observability platforms.
- Ensurehigh availability and data integrityof monitoring systems.
- Troubleshoot and resolve monitoring platform issues.
Dashboard & Alert Management
- Design and maintainmonitoring dashboards, alerts, and reports.
- Collaborate with stakeholders to define monitoring requirements.
- Implement alerting mechanisms for proactive issue detection.
Data Management & Optimization
- Manage data ingestion and onboarding into monitoring systems.
- Optimize platform performance through configuration tuning.
- Manage resource utilization and storage within observability platforms.
Incident Response & Troubleshooting
- Supportincident investigation and root cause analysis.
- Leverage observability data (metrics, logs, events, traces) to resolve issues.
- Collaborate with IT and DevOps teams during incident response.
Documentation & Best Practices
- Maintain documentation for monitoring configurations and procedures.
- Establish observability standards and best practices across the organization.
User Support & Training
- Provide technical support to internal teams using monitoring platforms.
- Conduct training sessions to improve observability tool adoption.
Required Qualifications
- Bachelor's degree in computer science, Information Technology, or related field.
- 57+ years of experiencein Observability, Monitoring, or Site Reliability Engineering.
- Strong hands-on experience with:
- Splunk (Administration, configuration, and implementation)
- AppDynamics
- Zenoss
Technical Skills
- Strong understanding ofMELT framework:
- Metrics
- Events
- Logs
- Traces
- Experience withOpenTelemetryincluding:
- Instrumentation patterns
- Context propagation
- Collectors
- Sampling
- Experience withKubernetes observability
- Strong knowledge ofIT infrastructure, applications, and networking
- Experience withscripting or automation (Python, Bash)
- Experience withcloud platforms such as AWS or Azure
Preferred Qualifications
- Experience with monitoring tools such as:
- Prometheus
- Grafana
- Knowledge ofDevOps practices and CI/CD pipelines
- Experience withInfrastructure as Code (Terraform or Ansible)
- Familiarity withGit-based workflows
