Platform Site Reliability Engineer Dicetek LLC

Employer Active

Posted 53 min ago

Experience

5 - 10 Years

Education

Bachelors in Computer Application(Computers), Bachelor of Technology/Engineering(Computers), Masters in Computer Application(Computers), Master of Technology/Engineering(Computers)

Nationality

Any Nationality

Gender

Any

Vacancy

1 Vacancy

Job Description

Roles & Responsibilities

  • Resiliency Engineering (SRE): Implement "Chaos Engineering" and load testing to ensure web/mobile backends can handle banking-scale traffic. Maintain high availability through automated recovery scripts.

  • Automated Regression: Build CI/CD-integrated test suites using Python that validate both the application logic and the infrastructure state (IaC validation).

  • Observability & SLIs: Define and monitor Service Level Indicators (SLIs) and Objectives (SLOs). Set up advanced alerting in Azure Monitor or AWS CloudWatch to catch performance degradation before users do.

  • Security & Compliance Testing: Automate security scans and compliance checks to ensure all AI data handling meets strict banking data residency and privacy protocols.

Desired Candidate Profile

  • Technical & Professional Requirements:

    • Automation Stack: High proficiency in Python (for AI testing) and framework automation (PyTest, Selenium, or Robot Framework).

    • Cloud Infrastructure: Strong hands-on experience with Azure or AWS, specifically regarding networking, scaling, and serverless reliability.

    • AI/ML Understanding: Understanding of Prompt Engineering and how to evaluate AI model outputs (RAG evaluation, ROUGE/BLEU scores, or custom LLM-benchmarks).

    • Monitoring Tools: Experience with Grafana, Prometheus, or native cloud monitoring tools to build real-time reliability dashboards.

    • FinOps Awareness: Ability to identify "expensive" failing tests or inefficient cloud resource usage during the testing phase.

    Recommended Skillset & Tools:

    • Languages: Python (Mandatory), Bash scripting.

    • Tools: GitHub Actions (CI/CD), Terraform (reading/validating), K6 or JMeter (Performance).

    • AI Frameworks: DeepEval, Ragas, or LangSmith (for automated AI evaluation).

Employment Type

    Full Time

Company Industry

Department / Functional Area

Keywords

  • Automation
  • Infrastructure As Code
  • Cloud Operations Engineer
  • Alerting
  • Technical Operations Engineer
  • Performance Tuning
  • DevOps Engineer SRE Focus
  • Incident Response

Dicetek LLC

Dicetek is a global IT Solutions and Services Company established in 2006 with its corporate headquarters in Singapore. We continue to expand our global network while providing value-added cost-effective consulting services to our clients. DICETEK has operational offices in India, UAE, Singapore & USA. As a world-class company with a regional focus, we primarily concentrate on providing Information Technology Solutions and Professional Consulting Services, across different verticals like Banking & Financial Services, Telecom, Government, Oil & Gas, Logistics, Supply Chain, Real Estate & Manufacturing. We have a solid reputation in the technology industry for providing excellent services to our clients. Our values are represented by our integrity, thought leadership, and commitment to maintaining a high-level of excellence in the constantly evolving world of Information Technology.

Read More

Rizwana Ashfaq Ashfaq - Manager- Talent Acquisition

Office No. 307 - 3rd Floor, New Century Tower, Port Saeed Road,Opp. Deira City Centre, Dubai - United Arab Emirates., Dubai, United Arab Emirates (UAE)

https://www.dicetek.net