Responsibilities:
- Manage VM/Cloud Infrastructure: Ensure that web servers and cloud services (AWS, GCP) are stable and perform optimally. Manage both on-prem and cloud-based infrastructure following DevSecOps best practices, including network design and segmentation
- Develop and Maintain Scripts and Tools: Write and maintain scripts (bash, python) to automate routine tasks and improve system efficiency.
- Build and Contribute to Our Monitoring System: Set up and manage monitoring systems using Prometheus and Grafana to track system performance and send alerts.
- Prepare CI/CD Pipelines: Implement and maintain automated deployment pipelines using GitLab CI, ArgoCD, and FluxCD.
- Design and Optimize Infrastructure: Design and optimize the infrastructure to ensure system stability, minimize downtime, and enhance overall performance.
- Web Server and Platform Management: Manage web server configurations and security, including Nginx, Kubernetes ingress, load balancers, DNS, WAF, and firewall rules, ensuring high availability and secure operations.
- Collaborate with Development Teams: Work closely with development and production teams to streamline deployment processes and resolve system and security-related issues.
Qualifications:
- Experience: At least 5 years in DevOps, System Engineering, or SRE roles with strong hands-on experience managing web servers and Linux systems.
- Strong Linux/Unix Knowledge: Deep expertise in Linux system administration (Ubuntu, CentOS, RedHat), including performance tuning, troubleshooting, and security hardening.
- Strong Networking Fundamentals: Solid understanding of networking concepts including TCP/IP, DNS, HTTP/HTTPS, TLS, NAT, load balancing, VPNs, and firewall rules, with the ability to troubleshoot complex network issues across on-prem and cloud environments.
- Cloud Experience: Strong experience designing, deploying, and operating cloud infrastructure on AWS and GCP, including VPC design, subnets, routing tables, security groups, NACLs, IAM, and cloud load balancers.
- CI/CD Tools Experience: Proven hands-on experience with CI/CD tools such as Jenkins, GitLab CI, GitHub Actions, and ArgoCD, FluxCD including pipeline design, automation, and security integration.
- Kubernetes & Containerization: Strong experience with Docker and Kubernetes (GKE, EKS), including cluster networking, ingress/egress, service meshes (nice to have), and workload security.
- Monitoring & Observability: Setup and operation of monitoring, alerting, and observability systems, such as Prometheus, Grafana, and APM tools, to improve visibility into system and application health and support proactive operations.
- Scripting Skills: Proficient in writing automation scripts (bash, python) for system administration tasks.
- Plus: Experience with DBA or database management (MySQL, PostgreSQL, MongoDB) is a bonus
Job Benefits:
We believe that motivation & personality of the employees are the only shortcut to the promotion of the corporate and contributions to the society. We will try our best to create a corporate environment where all employees can realize their dreams and goals.
Featured benefits include:
- Have opportunity to work with global merchants and join the dynamic, young and friendly project team; stable career path;
- Attractive salary based on skills and experience; 13th month salary & seniority bonus; Employee's marriage, maternity bonus; Birthday voucher gift;
- Annual salary review;
- Premium Healthcare, annual health check;
- Regular technical seminar & external/ internal training courses;
- Providing free coffee, tea & snack;
- Internal engagement events: Teambuilding; Town-hall, birthday gift voucher, mid-autumn, new year and kick-off parties, yearly company trip;
- FireGroup Sports Clubs: Running, Football, Badminton, etc;
- Laptop/ PC/ Monitor are provided