About the role
AddEvent is seeking a skilled Senior DevOps Engineer to join our engineering team and drive the automation, scalability, and reliability of our infrastructure. In this role, you will partner with developers, QA, and product teams to design, implement, and maintain systems that support fast, secure, and reliable software delivery. As our first dedicated DevOps engineer, you will assume responsibility for our cloud infrastructure, monitoring and observability, CI/CD pipeline, security, and IAC.
This is a full-time, fully remote position for someone within PT to ET time zones and comes with an excellent compensation package, including a competitive salary, equity, and top-tier health, dental, and vision coverage.
What you'll be responsible for
- Manage and optimize our AWS infrastructure with a focus on performance, scalability, security, and cost efficiency.
- Introduce and enforce best practices for observability, including centralized logging, metrics, and alerting.
- Monitor, troubleshoot, and optimize system performance, availability, and reliability.
- Implement Infrastructure as Code for consistent and repeatable environment provisioning.
- Design, implement, and maintain CI/CD pipelines for seamless software delivery.
- Ensure security and compliance across all systems, including network security, identity, and access control.
- Collaborate closely with software engineering teams to streamline deployment processes and improve developer productivity.
- Perform capacity planning and disaster recovery design/testing.
- Understand cloud spend and recommend areas for cost reduction and performance improvement; ensure efficient use of cloud resources by leveraging cost management tools and best practices.
- Drive adoption of DevOps culture and practices across the engineering team.
Requirements
- 5+ years of experience in DevOps, Site Reliability Engineering (SRE), or a related role.
- Deep expertise in AWS, including EC2, ECR, VPCs, SGs, RDS, CloudWatch, CloudFront, CloudTrail, S3, GuardDuty, Inspector, etc.
- Strong proficiency with CI/CD tools (Bitbucket Pipelines, GitLab CI, CircleCI, etc.).
- Proficiency in infrastructure-as-code frameworks, particularly Terraform
- Solid understanding of networking, security principles, and system administration (Linux/Unix).