Senior DevOps Engineer
FIREGROUP AT A GLANCE
Founded in 2016 in Vietnam, FireGroup Technology is committed to building world-class SaaS products that make a global impact on e-commerce. From the outset, we envisioned a world where running an online business is easy, effective, and sustainable, and we have been bringing that vision to life for nearly a decade.
Our portfolio includes TrueProfit, Transcy, OneMobile, OneLoyalty, Zopi, Promer, Ali Reviews. Trusted by more than 450,000 merchants in over 175 countries and recognized by global leaders such as Shopify, Google, Meta, TikTok, and Amazon, FireGroup is proud to elevate Vietnamese innovation onto the world stage.
Why build your career with FireGroup
- Make a Global Impact: Contribute to products trusted by merchants worldwide and help solve real challenges that transform e-commerce.
- Build with an AI-first Mindset: Use AI as your daily co-pilot to make smarter decisions and accelerate problem-solving at scale.
- Unleash Your Potential: Fast-track your growth with career mobility, diverse learning resources, and an agile environment that empowers bold thinkers.
In a culture that connects us as a whole
What truly defines us is not only our technology but also our people, guided by our seven core values of Courage, Creativity, Growth, Teamwork, Ownership, Trust, and Empathy. At FireGroup, work is more than a job; it is a legacy in the making. We move with speed to solve meaningful challenges, and every Firer is encouraged to dream boldly, lead with courage, and create impact that endures.
Discover how our values come to life in every team and product we build on our culture page.
Responsibilities
- Manage VM/Cloud Infrastructure: Ensure that web servers and cloud services (AWS, GCP) are stable and perform optimally. Manage both on-prem and cloud-based infrastructure following DevSecOps best practices, including network design and segmentation
- Develop and Maintain Scripts and Tools: Write and maintain scripts (bash, python) to automate routine tasks and improve system efficiency.
- Build and Contribute to Our Monitoring System: Set up and manage monitoring systems using Prometheus and Grafana to track system performance and send alerts.
- Prepare CI/CD Pipelines: Implement and maintain automated deployment pipelines using GitLab CI, ArgoCD, and FluxCD.
- Design and Optimize Infrastructure: Design and optimize the infrastructure to ensure system stability, minimize downtime, and enhance overall performance.
- Web Server and Platform Management: Manage web server configurations and security, including Nginx, Kubernetes ingress, load balancers, DNS, WAF, and firewall rules, ensuring high availability and secure operations.
- Collaborate with Development Teams: Work closely with development and production teams to streamline deployment processes and resolve system and security-related issues.
Qualifications
- Experience: At least 5 years in DevOps, System Engineering, or SRE roles with strong hands-on experience managing web servers and Linux systems.
- Strong Linux/Unix Knowledge: Deep expertise in Linux system administration (Ubuntu, CentOS, RedHat), including performance tuning, troubleshooting, and security hardening.
- Strong Networking Fundamentals: Solid understanding of networking concepts including TCP/IP, DNS, HTTP/HTTPS, TLS, NAT, load balancing, VPNs, and firewall rules, with the ability to troubleshoot complex network issues across on-prem and cloud environments.
- Cloud Experience: Strong experience designing, deploying, and operating cloud infrastructure on AWS and GCP, including VPC design, subnets, routing tables, security groups, NACLs, IAM, and cloud load balancers.
- CI/CD Tools Experience: Proven hands-on experience with CI/CD tools such as Jenkins, GitLab CI, GitHub Actions, and ArgoCD, FluxCD including pipeline design, automation, and security integration.
- Kubernetes & Containerization: Strong experience with Docker and Kubernetes (GKE, EKS), including cluster networking, ingress/egress, service meshes (nice to have), and workload security.
- Monitoring & Observability: Setup and operation of monitoring, alerting, and observability systems, such as Prometheus, Grafana, and APM tools, to improve visibility into system and application health and support proactive operations.
- Scripting Skills: Proficient in writing automation scripts (bash, python) for system administration tasks.
- Plus: Experience with DBA or database management (MySQL, PostgreSQL, MongoDB) is a bonus
Job Benefits
We believe that motivation & personality of the employees are the only shortcut to the promotion of the corporate and contributions to the society. We will try our best to create a corporate environment where all employees can realize their dreams and goals.
Featured benefits include:
- Have opportunity to work with global merchants and join the dynamic, young and friendly project team; stable career path;
- Attractive salary based on skills and experience; 13th month salary & seniority bonus; Employee’s marriage, maternity bonus; Birthday voucher gift;
- Annual salary review;
- Premium Healthcare, annual health check;
- Regular technical seminar & external/ internal training courses;
- Providing free coffee, tea & snack;
- Internal engagement events: Teambuilding; Town-hall, birthday gift voucher, mid-autumn, new year and kick-off parties, yearly company trip;
- FireGroup Sports Clubs: Running, Football, Badminton, etc;
- Laptop/ PC/ Monitor are provided
Contacts
Should you need more information about this job, reach out to us at: