Gathering your results ...
2 days
Not Specified
Not Specified
Not Specified
<p>Title: Senior Site Reliability Engineer</p> <p>Location: Charlotte, NC</p> <p>Alternative Location: Phoenix, AZ, Irving, TX</p> <p>Duration: 12 months</p> <p>Work Engagement: W2</p> <p>Work Schedule: 3 days in office/2 days remote</p> <p>Benefits on offer for this contract position: Health Insurance, Life insurance, 401K and Voluntary Benefits</p> <p>Summary:</p> <p>We are seeking an experienced Platform Reliability / SRE Engineer to ensure the reliability, performance, and smooth operation of our enterprise Harness Continuous Delivery (CD) platform. This role is hands-on, automation-focused, and central to supporting our development teams across multiple environments.</p> <p>Responsibilities:</p> <p>Platform Reliability & Operations</p> <ul> <li>Ensure end-to-end reliability, availability, and performance of the Harness CD platform across non-prod, prod, and BCP environments </li><li>Monitor and report on SLIs, SLOs, error budgets, deployment success rates, and platform health </li><li>Lead incident response and troubleshooting for deployment failures, outages, or performance issues </li><li>Identify and resolve scaling, performance, and capacity challenges across delegates, pipelines, Kubernetes clusters, and cloud integrations </li></ul> <p>Automation & Engineering Excellence</p> <ul> <li>Build automation for provisioning, configuration, scaling, upgrades, and ongoing maintenance of Harness components </li><li>Develop Infrastructure as Code (IaC) using Terraform, Ansible, Helm, or similar tools </li><li>Automate operational tasks including delegate lifecycle management, cluster onboarding, secret rotation, and pipeline validation </li><li>Reduce manual work by creating repeatable, self-service automation workflows </li></ul> <p>DevOps & CI/CD Integration</p> <ul> <li>Maintain and improve integrations between Harness and tools such as GitHub, Jenkins, Azure DevOps, Kubernetes/OpenShift, and cloud platforms </li><li>Enhance developer experience by supporting efficient, reliable deployment pipelines </li><li>Partner with DevOps teams on deployment strategies (blue/green, canary, rolling updates) </li><li>Work with Security teams to embed DevSecOps practices, including policy enforcement and governance pipelines </li></ul> <p>Observability & Monitoring</p> <ul> <li>Build and maintain monitoring, logging, dashboards, and alerting for all Harness components </li><li>Use tools such as Splunk, Prometheus, Grafana, or AppDynamics to create actionable alerts </li><li>Detect and escalate issues such as pipeline delays, delegate saturation, API errors, and Kubernetes resource constraints </li><li>Support proactive monitoring to reduce detection and resolution time </li></ul> <p>Modernization & Continuous Improvement</p> <ul> <li>Assist with Harness upgrades, patches, and lifecycle maintenance </li><li>Support modernization initiatives such as containerization, cloud-native deployments, and multi-cloud expansion </li><li>Assist with resiliency activities including BCP testing and backup verification </li><li>Evaluate new Harness features and modules for enterprise adoption </li></ul> <p>Technical Leadership</p> <ul> <li>Serve as a technical SME for the Harness platform </li><li>Create documentation, architecture details, and operational runbooks </li><li>Partner with senior engineers to enhance automation standards and platform best practices </li></ul> <p>Qualifications:</p> <ul> <li>Applicants must be authorized to work for ANY employer in the U.S. This position is not eligible for visa sponsorship. </li><li>Demonstrated experience in DevOps, SRE, Platform Engineering, or Cloud Engineering </li><li>Demonstrated hands-on experience with Harness CD </li><li>Strong experience with Kubernetes/OpenShift, Linux, and cloud deployment best practices </li><li>Solid understanding of CI/CD workflows and release automation </li><li>Experience applying SRE concepts (SLIs, SLOs, error budgets, reliability improvements) </li><li>Strong scripting and automation skills using Python, Bash, PowerShell, and Ansible </li><li>Experience with Infrastructure as Code (Terraform, Ansible, Helm, or similar) </li><li>Experience with monitoring and logging tools such as Prometheus, Grafana, Splunk, ELK, or AppDynamics </li><li>Strong troubleshooting skills across containers, OS, networking, platforms, and cloud environments </li><li>Data center migration experience (preferred) </li><li>Experience supporting enterprise-scale CD platforms (preferred) </li><li>Experience in hybrid cloud or cloud-native environments (Azure, GCP) (preferred) </li><li>Familiarity with DevSecOps, governance models, and policy automation (preferred) </li><li>Experience supporting complex upgrades, migrations, or modernization projects (preferred) </li></ul>
POST A JOB
It's completely FREE to post your jobs on ZiNG! There's no catch, no credit card needed, and no limits to number of job posts.
The first step is to SIGN UP so that you can manage all your job postings under your profile.
If you already have an account, you can LOGIN to post a job or manage your other postings.
Thank you for helping us get Americans back to work!
It's completely FREE to post your jobs on ZiNG! There's no catch, no credit card needed, and no limits to number of job posts.
The first step is to SIGN UP so that you can manage all your job postings under your profile.
If you already have an account, you can LOGIN to post a job or manage your other postings.
Thank you for helping us get Americans back to work!