Mô Tả Công Việc
LeapXpert, the enterprise-grade responsible business communication platform, provides organizations peace of mind by creating an accessible digital record of all business interactions carried out over mobile messaging applications.
LeapXpert's Federated Messaging Orchestration Platform (FMOP™) is an interoperable, mobile-first solution that provides unparalleled visibility into data from instant messaging applications, as well as governance and control. This enables enterprises to embrace a customer-centered approach while maintaining professional conduct and ensuring compliance.
We are proud to have achieved some of the most prestigious awards in the regulatory and fintech industries. And to take it further, we need to find our best teammates!
We're looking for a passionate Site Reliability Engineer to be a part of our rapidly growing, award-winning team, who is responsible for ensuring stability / reliability of our platform across environments. You will take charge of the continuous analysis of the existing infrastructure from the reliability perspective, centered around removing performance bottlenecks, optimizing the infrastructure, the toolkit, and the workflows involved in running it.
Building a product is a highly collaborative effort, and as such, a strong team player with a commitment to perfection is desired.
Responsibilities:
- Monitor and analyze the performance, stability, and security of our software systems and infrastructure
- Identify and troubleshoot issues proactively, employing effective root cause analysis and problem-solving techniques
- Develop and implement automation tools and scripts to streamline deployment, configuration, and monitoring processes
- Collaborate with software engineers to optimize system performance, scalability, and reliability
- Design, implement and maintain observability stack to ensure system health and availability
- Participate in the on-call rotation to provide prompt response and resolution to production incidents
- Continuously research, evaluate, and implement best practices, tools, and technologies to enhance system reliability and efficiency.
Yêu Cầu Công Việc
Must-have Qualifications :
- At least 3 years of proven experience as a Site Reliability Engineer or in similar roles, managing complex software systems in a production environment
- Strong knowledge of cloud infrastructure platforms (AWS, Azure, or GCP) and containerization technologies (Docker, Kubernetes)
- Proficiency in scripting and automation using languages like Python, Bash, or Javascript
- Experience with monitoring and logging tools such as Prometheus, LGTM stack
- Solid understanding of networking principles, system administration, and security best practices
- Ability to collaborate effectively with cross-functional teams and communicate complex technical concepts clearly
- Strong problem-solving skills and the ability to work well under pressure
- Experience with system performance testing.
- Good English communication skills.
Preferred Qualifications:
- Certification in cloud technologies (AWS Certified, Azure Certified, etc.)
- Knowledge of database system (SQL, NoSQL), messaging system (Kafka)
- Familiarity with testing frameworks (Grafana K6, Apache jMeter)
- Experience with CI/CD pipelines and IaC tools (Github Actions, Terraform)
- Familiarity with Agile/Scrum methodologies and working in an Agile development environment.
Hình thức
Quyền Lợi
If you are looking for:
- An awesome job with an attractive remuneration package
- Hybrid working model with flexible working time and place. Available office in Binh Thanh District with good, comfortable environment and settings.
- Friendly colleagues who support each other to win as a team
- A flat, product-focused organization and Agile team to let you add your value and ideas to the product and company
- Opportunities to learn and be trained in applied new technologies and methodologies
- Facing new and innovative challenges to deliver commercial-grade, world-class product
- Career growth in multiple directions, based on your preferences and abilities
Benefits:
- 13th Salary, paid pro-rata, every month to allow more flexible financial plans
- 18 annual Leaves and 1 birthday leaves
- Full salary on probation
- Fully social insurance according to Vietnam Labor Law
- Premium Health Care Insurance
- Annual health check and vaccination
- Lunch and parking allowances
- Annual Performance Review
- Attractive career path
- Flexible working time and place with a hybrid working model.
- Advanced technical solutions, agile culture, and the opportunity to work with the latest technologies
- ESOP based on contribution