Careem is building the Everything App to simplify everyday life across the Middle East by integrating transportation, food and grocery delivery, payment management, and more into a single platform. Since its inception in 2012, Careem has empowered over 2.5 million Captains to earn income and served more than 70 million customers. Operating in over 70 cities across 10 countries, from Morocco to Pakistan, the company is dedicated to innovation and excellence. The team focuses on advancing persistent storage technologies and enhancing cloud-native solutions to scale the Careem platform while improving service reliability and performance.
Key Responsibilities
Design, build, and maintain Kafka clusters and their ecosystems to ensure resilience, reliability, and scalability for services used by millions daily. Lead the development and operation of storage clusters and associated infrastructure on AWS, influencing the product lifecycle from design through deployment. Develop support tools and technical processes that simplify workflows and empower engineers across multiple services. Identify opportunities to automate and scale systems while maintaining strict security and reliability standards. Participate actively in on-call rotations, contributing to incident response improvements and operational excellence.
Required Qualifications
A minimum of 5 years’ experience in site reliability engineering or related fields is essential. Proven expertise in developing, operating, and troubleshooting storage clusters or other highly available systems at scale is required. Hands-on experience running Kafka on Kubernetes environments is necessary. Proficiency in one or more programming languages such as Go, Python, Java, Groovy, Scala, or Ruby is expected. Strong background in cloud infrastructure, preferably AWS, is important. Experience with infrastructure automation using Infrastructure as Code principles is required. Solid Unix/Linux skills, including knowledge of network stacks and scripting capabilities, are essential. Familiarity with incident response and incident management practices is advantageous.
Preferred Qualifications and Benefits
Experience with automation tools related to monitoring, continuous integration/continuous deployment (CI/CD), and security enhancements is a plus. A passion for building scalable, secure, and reliable systems that support a large user base is highly valued. Commitment to fostering a collaborative environment that encourages innovation and continuous improvement is important. Careem is an equal opportunity employer that values diversity and inclusion, ensuring a workplace free from discrimination based on any protected status under applicable laws. Demographic data collection is voluntary and used solely for internal equal employment opportunity monitoring and diversity initiatives. Joining Careem means contributing to a transformative platform impacting millions while growing your career in a dynamic, purpose-driven organization.
Monitoring Unix Kubernetes Incident Management Kafka Groovy Scala Security Enhancements Ruby storage clusters AWS Infrastructure Automation GO site reliability engineering Java LINUX Incident Response Scripting CI/CD Python Network Stack Infrastructure as Code
Industry:
Total Positions:
1 Post
Job Shift:
First Shift (Day)
Job Type:
Job Location:
Gender
No Preference
Age
18 - 65 Years
Experience
3 Years - 5 Years
Apply Before:
Oct 17, 2025
Monthly based
Dera Ghazi Khan Division,Punjab,Pakistan
Dera Ghazi Khan Division,Punjab,Pakistan