ROLE RESPONSIBILITIES
We are looking for a CloudOps Engineer with expertise in AWS cloud operations, cloud security, and infrastructure automation. This role is critical in ensuring our cloud platforms are secure, scalable, cost-efficient, and highly available. You will take ownership of cloud optimization, security posture management, and automation using Infrastructure as Code and scripting.
Cloud Operations & Reliability
- Operate and maintain cloud infrastructure on AWS, ensuring high availability, performance, and scalability
- Monitor, troubleshoot, and resolve complex cloud infrastructure issues across multiple environments
- Implement best practices for cloud operations, resilience, and disaster recovery
- Collaborate with application teams to improve operational excellence for serverless workloads
Cloud Security
- Design, implement, and maintain secure cloud architectures following security best practices
- Identify, analyze, and remediate security issues and vulnerability findings from security tools and audits
- Work closely with security teams to improve cloud security posture and compliance
- Implement IAM policies, network security controls, encryption, and logging/monitoring
Infrastructure as Code & Automation
- Design and manage infrastructure using Terraform and IaC best practices
- Build and maintain automation scripts and tools using Python (or similar scripting languages)
- Automate cloud operations, security controls, monitoring, and remediation workflows
Cloud Optimization
- Drive cloud cost optimization initiatives (FinOps mindset)
- Optimize performance, scalability, and resource utilization across AWS services
- Provide recommendations and implement improvements to reduce waste and improve efficiency
REQUIRED SKILLS & EXPERIENCE
- 2+ years of experience in Cloud Operations
- Strong hands-on experience with AWS services, including:
- EC2, VPC, IAM, RDS, S3, CloudWatch, CloudTrail, AppSync, API Gateway
- Serverless services such as AWS Lambda
- Solid understanding of networking concepts (VPC design, routing, security groups, NACLs, VPNs)
- Proven experience in cloud security, vulnerability management, and incident remediation
- Hands-on experience with Terraform for Infrastructure as Code
- Strong automation skills using Python (or equivalent scripting language)
- Experience implementing monitoring, logging, and alerting solutions
- Ability to troubleshoot complex cloud and security issues under pressure
Nice to Have
- Experience with cloud security tools (e.g., GuardDuty, Wiz, etc)
- Knowledge of DevOps / CI/CD pipelines
- Familiarity with container platforms (EKS, ECS, Docker)
- AWS certifications (e.g., Solutions Architect, Security Specialty)
Soft Skills
- Strong problem-solving and analytical skills
- Ability to work independently and take ownership of cloud environments
- Clear English communication skills and ability to collaborate across teams
- Security-first mindset with a passion for automation and continuous improvement