Site Reliability Engineer (SRE) Dicetek LLC

Posted on 24 Feb

Send me Jobs like this

Experience

5 - 10 Years

Job Location

Dubai - United Arab Emirates (UAE)

Education

Bachelors in Computer Application(Computers), Bachelor of Technology/Engineering(Computers), Bachelor of Science(Computers), Master of Technology/Engineering(Computers)

Nationality

Any Nationality

Gender

Any

Vacancy

1 Vacancy

Job Description

Roles & Responsibilities

Reliability & Operations
- Ensure high availability, reliability, and performance of production systems.
- Define, monitor, and manage SLIs, SLOs, and SLAs.
- Lead incident response, root cause analysis (RCA), and post-incident reviews.
- Implement proactive monitoring and alerting to prevent outages.
Automation & Engineering
- Automate repetitive operational tasks using scripting and infrastructure-as-code.
- Improve system reliability through engineering solutions rather than manual intervention.
- Reduce toil by building tools, automation, and self-healing systems.
Cloud & Infrastructure
- Design and manage scalable infrastructure on cloud platforms (AWS / Azure / GCP).
- Manage containerized workloads using Docker and Kubernetes.
- Implement and maintain CI/CD pipelines for safe and frequent deployments.
Monitoring & Observability
- Build and maintain observability solutions using tools such as:
- Prometheus, Grafana
- ELK / OpenSearch
- Datadog, New Relic
- Track system performance, capacity planning, and error budgets.
Security & Compliance
- Ensure reliability best practices aligned with security standards.
- Participate in on-call rotations and ensure secure system operations.
- Collaborate with security teams to implement secure infrastructure practices.

Desired Candidate Profile

Bachelor’s degree in Computer Science, Engineering, or related field.
Strong experience in Linux/Unix system administration.
Proficiency in at least one scripting or programming language:
Python, Go, Bash, or Java
Experience with cloud platforms (AWS / Azure / GCP).
Hands-on experience with Kubernetes and container orchestration.
Knowledge of networking fundamentals (TCP/IP, DNS, load balancing).
Experience with monitoring, alerting, and incident management.

Preferred / Nice-to-Have Skills

Experience implementing SRE best practices from Google SRE principles.
Knowledge of Terraform, Ansible, or CloudFormation.
Experience with service mesh (Istio, Linkerd).
Understanding of chaos engineering tools (Gremlin, Chaos Mesh).
Experience in fintech, banking, or high-availability systems.

Employment Type

Full Time

Company Industry

Department / Functional Area

Keywords

Automation
Infrastructure As Code
DevOps Engineer
Service Reliability Engineer
Technical Operations Engineer
Security Practices
Performance Tuning
Site Operations Engineer

Dicetek LLC

Dicetek is a global IT Solutions and Services Company established in 2006 with its corporate headquarters in Singapore. We continue to expand our global network while providing value-added cost-effective consulting services to our clients. DICETEK has operational offices in India, UAE, Singapore & USA. As a world-class company with a regional focus, we primarily concentrate on providing Information Technology Solutions and Professional Consulting Services, across different verticals like Banking & Financial Services, Telecom, Government, Oil & Gas, Logistics, Supply Chain, Real Estate & Manufacturing. We have a solid reputation in the technology industry for providing excellent services to our clients. Our values are represented by our integrity, thought leadership, and commitment to maintaining a high-level of excellence in the constantly evolving world of Information Technology.

Rizwana Ashfaq Ashfaq - Manager- Talent Acquisition

Office No. 307 - 3rd Floor, New Century Tower, Port Saeed Road,Opp. Deira City Centre, Dubai - United Arab Emirates., Dubai, United Arab Emirates (UAE)

https://www.dicetek.net