At Okala, we are looking to hire a skilled and proactive Site Reliability Engineer (SRE) who will play a critical role in maintaining system reliability, improving automation, and supporting scalable production environments.
With a hands-on mindset, you will help drive operational excellence and ensure high availability of our services.
your story belongs here.
What You'll Be Doing:
- Deploy updates, patches, and fixes while providing advanced technical support
- Design and build automation tools to reduce errors and enhance system stability and customer experience
- Perform root cause analysis on production incidents and resolve complex technical issues
- Develop scripts to automate repetitive operational tasks
- Design and maintain procedures for system troubleshooting and preventive maintenance
- Rapidly build effective working relationships with teams and drive issues toward resolution through clear communication
- Configure and maintain monitoring and alerting systems to ensure service reliability
- Participate in on-call rotations to support 24/7 operational environments
What You Bring:
- Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent practical experience)
- At least 1 year of hands-on experience in Site Reliability Engineering, DevOps, or a closely related role
- Strong expertise in Linux system administration and troubleshooting in production environments
- Excellent scripting and automation skills using Bash, Python, or similar languages
- Solid understanding of databases, storage systems, and SQL
- Deep knowledge of containerization and orchestration, especially Kubernetes
- Strong experience designing and maintaining CI/CD pipelines (GitLab CI/CD preferred)
- Experience implementing automated deployment, testing, and rollback strategies
- Hands-on experience with monitoring, logging, and alerting (observability) tools
- Familiarity with configuration management and infrastructure automation tools such as Ansible
- Strong understanding of networking concepts (DNS, routing, firewalls, load balancing)
- Experience with distributed systems and event streaming platforms (e.g., Kafka)
- Strong problem-solving skills including incident response, root cause analysis, and reliability improvement
- Proactive, self-driven, and solution-oriented mindset
Why Okala?
Because at Okala, you’re not just maintaining systems — you’re helping power one of Iran’s leading online retail platforms.
With services across 200+ cities and millions of daily users, we move fast, learn continuously, and are always looking for smarter, more scalable solutions.
What You’ll Enjoy:
- A dynamic and friendly work environment
- Weekly gatherings: Mafia Night, Cinema Night & more
- Access to practical and high-quality training programs
- Free breakfast, commuting & lunch subsidies
- Supplementary health insurance, in-house doctor, and parking
- Birthday gifts, seasonal bonuses, and exclusive Okala discount codes
Ready to build, automate, and scale with us?
Let’s write the next chapter of success together.