Jobs at Hub71 startups

Are you ready to join a vibrant community of tech startups that are shaping the future of innovation?

The Hub71 careers portal connects you with the leading startups that are transforming industries at the heart of Abu Dhabi's Global Tech Ecosystem. Explore a diverse range of opportunities with high-potential startups that are scaling globally from the UAE capital.

Background Icon

Senior Site Reliability Engineer (AWS/EKS, GitOps)

Purpl

Purpl

Software Engineering
Beirut, Lebanon
Posted on Jul 2, 2025

Purpl is a Lebanese fintech startup that provides secure and user-friendly financial solutions, including instant money transfers and QR code payments. The company is dedicated to driving economic development across Lebanon and the region by promoting the adoption of financial technology. Purpl offers exceptional customer care, zero-fee ATM cash outs, and is constantly developing new features to enhance the user experience.

Position Overview

As a Senior Site Reliability Engineer, you will play a crucial leading role in ensuring our infrastructure's reliability, scalability, and security on AWS and EKS (Elastic Kubernetes Service). You will play a key role in implementing GitOps practices to streamline our deployment pipelines and enhance operational efficiency. This role requires a deep understanding of cloud-native technologies, Kubernetes orchestration, and a strong commitment to delivering robust solutions.

Key Responsibilities

  • Design, build, and maintain scalable and secure infrastructure on the cloud.
  • Implement and optimize GitOps workflows for continuous integration and delivery (CI/CD).
  • Manage AWS services including VPC, EKS, WAF, EC2, ALB, RDS (PostgreSQL), CloudWatch, SES, ElastiCache, Transfer Family (SFTP), and SNS.
  • Configure and manage Kubernetes clusters, including using csi-ebs, aws-load-balancer-controller, and Rancher (Kubernetes dashboard).
  • Set up and maintain monitoring and logging solutions using Prometheus, Promtail, Loki, Grafana, and Sentry.
  • Automate infrastructure deployment and configuration management using Terraform or OpenTofu.
  • Collaborate with the different business units of the company to understand the business and their requirements, and be able to effectively design and implement, and ensure smooth deployment and operation of the systems.
  • Monitor, observe, and improve systems performance, reliability, and availability.
  • Automate tasks using scripting languages and configuration management tools.
  • Conduct root cause analysis for production errors and implement solutions to prevent recurrence.
  • Participate in on-call rotation and respond to incidents promptly.
  • Stay up-to-date with industry trends and best practices in SRE, cloud infrastructure, and DevOps methodologies.

Required Skills

  • Proven experience as a Site Reliability Engineer or similar role, with a focus on AWS and Kubernetes.
  • Hands-on experience with GitOps principles and tools (e.g., Git, Flux, Argo CD, GitLab CI/CD).
  • Proficiency in scripting and automation using Python, Bourne(-Again) shell, or similar languages.
  • Experience with Linux iptables, security appliances, routing software, and VPN technologies, such as Cisco ASA, OPNSense, and WireGuard.
  • Strong knowledge of containerization and orchestration technologies (Docker, Kubernetes).
  • Experience with infrastructure-as-code tools (e.g., Terraform, OpenTofu, CloudFormation).
  • Experience with monitoring tools such as Prometheus, Grafana, and Sentry.
  • Solid understanding of networking, security, and monitoring concepts in cloud environments.
  • Excellent troubleshooting skills and ability to analyze complex systems.
  • Strong communication skills and ability to collaborate effectively with cross-functional teams.

Nice-to-Haves

  • Familiarity with Google Cloud Platform (GCP) is a plus.
  • Relevant certifications (e.g., AWS Certified DevOps Engineer, Kubernetes certifications) are a plus.

Qualifications

  • Bachelor's degree in Computer Science, Network Engineering, Information Technology, or a related field (or equivalent experience).
  • 5+ years of professional Site Reliability Engineer experience.
  • Strong problem-solving skills and ability to troubleshoot complex issues.
  • Excellent communication skills and ability to work effectively in a collaborative team environment.

Why Join Us

  • Opportunity to work with cutting-edge technologies in a collaborative and innovative environment.
  • Competitive compensation package with comprehensive benefits.
  • Career growth and professional development opportunities.
  • Flexible work environment and a culture that values work-life balance.
  • Impactful role in shaping the future of our infrastructure and operations.

Equal Opportunity Employer

We are an equal-opportunity employer and value diversity at our company. We do not discriminate based on race, religion, color, national origin, gender, sexual orientation, age, marital status, or disability status.