Our Journey So Far
At Snapp, we’re redefining how cities move. Our ride-hailing and mobility platform connects millions of riders and drivers every day, delivering safe, reliable, and efficient transport solutions. Powered by real-time data and robust infrastructure, we make urban travel faster, simpler, and more sustainable.We operate with the mindset of a global tech leader and the agility of a startup, building services that scale across markets while staying responsive to local needs.
Your Impact
In this role, you will strengthen the SRE Platform team’s mission by advancing the foundational platforms that automate manual workflows and elevate system reliability. Your work will ensure our staging environments remain stable and production-like, empowering QA and development teams to test, validate, and deploy their applications with confidence. You will also contribute to operational excellence through active participation in the weekly on-call rotation, supporting consistent and dependable infrastructure performance.
What You’ll Drive Forward
- Automate and optimize operational processes
- Enhance and maintain the observability stack
- Oversee test/staging environments management
- Develop and support critical production components
- Handle and resolve production incidents
- Participate in the on-call rotation
What Powers Your Drive
- Strong teamwork and collaboration skills
- Solid understanding of SRE concepts, including SLIs, SLOs, SLAs, and Error Budgets
- Proficiency in Python or another scripting language
- Strong grasp of software engineering principles
- Hands-on experience with observability and monitoring tools such as Prometheus and Grafana
- Familiarity with logging stacks (e.g., ELK, Loki) and tracing systems (e.g., Jaeger, Tempo)
- Understanding of RDBMS and Redis
- Experience working with Kubernetes and related tooling (e.g., Helm)
Ready to Get on Board?
Help us shape the future of ride-hailing and urban mobility. Submit your CV and let’s build smarter cities together.