Engineering Manager SRE/DevOps
Published on May 08, 2023
Founded in 2015 by former NSA cyber operators, Huntress was built on a simple premise: to force hackers to earn every inch of their access. Today’s cyber-attacks aren’t limited to large organizations with the security tools that can ward off threats. Hackers don't discriminate and will find a way to penetrate any vulnerability in any size business. Huntress enables IT providers and resellers to stop hidden threats that sneak past preventive security tools. Through a combination of expert human threat hunters, a comprehensive platform, and a desire to make the world a safer place, we’re working to deliver cybersecurity to the 99%—those small to midsize businesses that make up the backbone of our economy. Join the hunt and help us stop hackers in their tracks!
- Lead the design, build, maintenance, and operations of our AWS Cloud infrastructure to enable reliable and rapid deployment of services with effective monitoring and resilient operations
- Build an observability plan and use it to maintain and improve reliability
- Lead the build of automated tools for cloud operations such as automated remediation of vulnerabilities, auto-scaling, etc.
- Lead the build-out of a scalable, efficient, and robust Continuous Integration and Continuous Deployment (CI/CD) pipeline
- Lead the build-out of an automated test framework that validates end-to-end system functionality
- Design and execute plan to transition observability tools and responsibilities of appropriate systems and applications to Product Teams
- Support Product Teams instrumenting observability and alerting into product systems and applications
- Participate in an on call rotation
- Coach, and mentor a diverse team of engineers with emphasis on collaboration, teamwork and creativity
- Lead the team to build and set standards for team excellence, driving performance improvements at both team and individual levels
- Lead with transparency, candidly challenge assumptions, exhibit integrity above all else
- Hands-on experience performing as a Site Reliability Engineer (SRE)
- Experience defining, negotiating, measuring, and satisfying Service Level Objectives (SLOs)
- Hands-on expertise working with AWS Cloud Environments and managed services (Redis, RDS, S3, etc)
- Hands-on expertise developing infrastructure-as-code (IaC)
- Experience working on Linux based infrastructure
- Proficiency in scripting – Shell scripts, Python preferred
- Experience supporting ruby-on-rails applications (or similar) in production environments
- Configuration and infrastructure management experience with cloud-based databases
- Experience with monitoring tools such as DataDog
- Effective communication and interpersonal skills, ability to work and coordinate between multiple teams
- Experience developing and maintaining CI/CD pipelines, processes, and tools
- Minimum of a BS in Computer Science or Engineering field or equivalent experience
- 5+ years of experience developing complex software products
- 3+ years of experience hiring, managing, coaching, and mentoring engineers who may be at various stages of their career
- Experience with different software development methodologies such as agile, scrum, and kanban
- Excellent technical, diagnostic, and troubleshooting skills
- 100% remote work environment - since our founding in 2015
- Generous paid time off policy including vacation, sick time, and paid holidays
- 12 weeks paid parental leave
- Highly competitive and comprehensive medical, dental, and vision benefits plans
- 401(k) with 5% contribution regardless of employee contribution
- Life and Disability insurance plans
- Stock options for all full-time employees
- One-time $500 stipend to build/upgrade home office
- Annual allowance for education and professional development assistance
- $75 USD/month digital reimbursement
- Access to both Udemy and BetterUp platforms for coaching, personal, and professional growth