Gathering your results ...
5 days
Not Specified
Not Specified
Not Specified
<p>America's Test Kitchen (ATK) is seeking a Senior Site Reliability Engineer (SRE) to focus on the stability, scalability, and performance of our core Cloud Infrastructure and Database Systems. This high-impact role is focused on applying software engineering principles to operations, reducing toil, and ensuring the reliability of our high-traffic website, app, and digital subscription platforms.</p> <p>The successful candidate will be a proficient software developer and an expert in cloud architecture who thrives on designing and implementing automated infrastructure solutions, optimizing complex database performance, and collaborating closely with development teams to build resilient services.</p> <p>This is a newly created role and will report to the VP, Engineering and will be a key contributor to ATK's DevOps and infrastructure strategy.</p> <p>Core Technical Responsibilities</p> <p>Reliability Engineering & Cloud Infrastructure:</p> <ul> <li> <p>Infrastructure-as-Code (IaC): Design, implement, and maintain our cloud infrastructure using AWS CDK. Focus on high availability, disaster recovery, and cost efficiency.</p> </li><li> <p>Automated Operations: Develop robust automation using code to manage infrastructure, deploy applications, handle monitoring, and execute system recovery, driving down manual effort.</p> </li><li> <p>Observability: Implement and manage comprehensive monitoring, logging, and alerting systems to provide deep visibility into system health, performance, and key Service Level Objectives (SLOs).</p> </li><li> <p>Incident Response: Lead incident response, root cause analysis (RCA), and post-mortem processes to identify and resolve systems-level issues and prevent recurrence.</p> </li></ul> <p>Database Performance and Development:</p> <ul> <li> <p>Database Management: Own the operational health and performance tuning of critical relational and NoSQL database systems in the cloud.</p> </li><li> <p>Software Development: Act as a contributing developer, writing clean, well-tested code in core ATK services.</p> </li><li> <p>Security and Compliance: Implement and enforce security best practices across infrastructure and data layers, including network segmentation, access control (IAM), and encryption.</p> </li></ul> <p>Skills and Experience Required</p> <p>Technical Expertise & Development Proficiency:</p> <ul> <li> <p>SRE/DevOps Experience: 5+ years of progressive experience in an SRE, or highly technical systems engineering role.</p> </li><li> <p>Cloud Architecture: Expert-level, hands-on experience designing and managing production environments in AWS (e.g., EC2, Lambda, ECS/EKS, VPC, RDS).</p> </li><li> <p>Database Mastery: Deep understanding of database internals, performance tuning, and operational management for data stores.</p> </li><li> <p>Coding Skills: Proven proficiency in at least one modern programming language used for systems automation, tooling, and backend service development.</p> </li><li> <p>Containerization: Strong experience with container orchestration technologies.</p> </li><li> <p>IaC Tools: Hands-on expertise with Infrastructure-as-Code tools.</p> </li></ul> <p>Execution and Communication:</p> <ul> <li> <p>Problem Solver: Exceptional ability to diagnose and solve complex production issues across multiple domains (network, application, database, and infrastructure).</p> </li><li> <p>Collaboration: Strong track record of successfully partnering with software development teams to improve service reliability and delivery pipelines.</p> </li><li> <p>Technical Communication: Ability to clearly and concisely communicate technical concepts, status, and post-mortems to both engineering and leadership teams.</p> </li></ul> <p>Qualifications</p> <ul> <li> <p>Bachelor's degree in Computer Science, Engineering, or equivalent professional experience</p> </li><li> <p>5+ years of experience in software development and/or site reliability engineering</p> </li><li> <p>Extensive AWS experience</p> </li><li> <p>Experience with high-traffic, customer-facing websites and apps</p> </li><li> <p>Mastery of</p> </li></ul>
POST A JOB
It's completely FREE to post your jobs on ZiNG! There's no catch, no credit card needed, and no limits to number of job posts.
The first step is to SIGN UP so that you can manage all your job postings under your profile.
If you already have an account, you can LOGIN to post a job or manage your other postings.
Thank you for helping us get Americans back to work!
It's completely FREE to post your jobs on ZiNG! There's no catch, no credit card needed, and no limits to number of job posts.
The first step is to SIGN UP so that you can manage all your job postings under your profile.
If you already have an account, you can LOGIN to post a job or manage your other postings.
Thank you for helping us get Americans back to work!