Cloud Operations Engineer (Remote – US/EST time zone preferred)
We are seeking a highly motivated Cloud Operations Engineer to join our fully remote team. This role is crucial to maintaining and optimizing our cloud infrastructure, primarily within Amazon Web Services (AWS), with a strong emphasis on using infrastructure as code. The ideal candidate will be a proactive problem-solver with a passion for building robust, scalable, and secure cloud environments. This is a great opportunity for someone who may be relatively new to Cloud Operations or interested in transitioning from a development role.
While the role is fully remote within the US or Canada, most work is conducted during US Eastern or US Central time zone business hours to facilitate team collaboration. We offer flexibility in scheduling to accommodate individual needs.
Responsibilities:
- Design, implement, and manage cloud infrastructure using Infrastructure as Code (IaC) principles, primarily with AWS.
- Develop and maintain cloud resources using AWS Cloud Development Kit (CDK) and TypeScript.
- Monitor cloud services and applications to ensure optimal performance, availability, and security using Datadog.
- Implement and manage data backup and disaster recovery strategies, ensuring data integrity and rapid restoration capabilities.
- Ensure compliance with relevant industry standards and internal security policies, especially in environments with higher compliance requirements (e.g., SOC 2, FedRAMP).
- Automate operational tasks and workflows to improve efficiency and reduce manual effort.
- Participate in on-call rotations for incident response and troubleshooting critical issues.
- Collaborate with development and other teams to streamline deployment processes and improve overall system reliability.
- Document infrastructure, processes, and procedures clearly and concisely.
- Stay up-to-date with the latest AWS services, best practices, and industry trends.