Data Engineer (PySpark)
GSSTech Group
Posted on 20 Feb
Send me Jobs like this
Experience
5 - 7 Years
Job Location
Education
Bachelor of Science(Computers)
Nationality
Any Nationality
Gender
Not Mentioned
Vacancy
1 Vacancy
Job Description
Roles & Responsibilities
Key Responsibilities
- Design, develop, and maintain scalable ETL/ELT pipelines using PySpark on CDP
- Ensure data integrity, reliability, and performance optimisation
- Develop ingestion frameworks to collect data from relational databases, APIs, streaming sources, and file systems
- Load structured and unstructured data into Data Lake/Data Warehouse environments
- Process, cleanse, and transform large-scale datasets using PySpark
- Build reusable data processing components
- Tune Spark jobs and Cloudera components for optimal performance
- Optimise memory, partitioning, and execution plans
- Reduce ETL runtime and improve cluster efficiency
- Implement data validation checks and monitoring mechanisms
- Ensure end-to-end data quality and governance standards
- Automate workflows using tools such as Apache Oozie, Apache Airflow, or similar orchestration frameworks
- Maintain CI/CD integration for data pipelines
- Monitor pipeline health and troubleshoot failures
- Provide production support and continuous improvements
Required Skills & Qualifications
- 5+ years of experience in Data Engineering
- Strong hands-on experience in PySpark
- Experience working on Cloudera Data Platform (CDP)
- Strong knowledge of Hadoop ecosystem (HDFS, Hive, Impala, YARN)
- Proficiency in SQL and data modelling concepts
- Experience with workflow orchestration tools (Airflow, Oozie, etc.)
- Good understanding of data warehousing concepts
- Experience with performance tuning and optimisation
Good to Have
- Experience with cloud platforms (AWS, Azure, GCP)
- Knowledge of streaming tools (Kafka, Spark Streaming)
- Exposure to DevOps practices and CI/CD pipelines
- Banking/Financial Services domain experience
Desired Candidate Profile
Company Industry
Department / Functional Area
Keywords
- Data Engineer (PySpark)
Disclaimer: Naukrigulf.com is only a platform to bring jobseekers & employers together. Applicants are advised to research the bonafides of the prospective employer independently. We do NOT endorse any requests for money payments and strictly advice against sharing personal or bank related information. We also recommend you visit Security Advice for more information. If you suspect any fraud or malpractice, email us at abuse@naukrigulf.com
Similar Jobs
Data Engineer
DUBAI PROPERTIES GROUP LLC
- 3 - 6 Years
- Dubai - United Arab Emirates (UAE)
Data Engineer & Senior Data engineer (immediate to 30 days NP)banking
Sphere IT Consultants DWC LLC
- 5 - 10 Years
- Dubai - United Arab Emirates (UAE)
Data Engineer
Dicetek LLC
- 5 - 10 Years
- Abu Dhabi , Dubai - United Arab Emirates (UAE)
Azure Data Engineer
Starlink WLL
- 8 - 14 Years
- Doha - Qatar
ETIC, Fabric Data Engineer, Manager
PricewaterhouseCoopers
- 8 - 13 Years
- Egypt - Egypt