Pyspark Developer

Valuelabs

Employer Active

Posted 5 hrs ago

Experience

5 - 7 Years

Education

Bachelor of Science(Computers)

Nationality

Any Nationality

Gender

Not Mentioned

Vacancy

1 Vacancy

Job Description

Roles & Responsibilities

As a Data Engineer, you will design and maintain scalable ETL pipelines, ensuring high data quality, performance, and reliability across our data ecosystem.

Key Responsibilities

  • Build and optimize ETL pipelines using PySpark on CDP
  • Ingest data from RDBMS, APIs, and file systems into data lakes/warehouses
  • Cleanse and transform large datasets to support analytics
  • Tune PySpark code and Cloudera components for performance
  • Implement data quality checks and validation routines
  • Automate workflows using Apache Oozie, Airflow, or similar tools
  • Monitor pipeline health and troubleshoot issues
  • Collaborate with cross-functional teams to meet data requirements
  • Document processes, code, and configurations

Desired Candidate Profile

Required Qualifications

  • Bachelors/Masters in Computer Science, Data Engineering, or related field
  • Minimum 5 years of experience in Data Engineering roles

Technical Skills

  • PySpark (RDDs, DataFrames, optimization)
  • Cloudera Data Platform (Cloudera Manager, Hive, Impala, HDFS, HBase)
  • Data Warehousing & SQL (Hive, Impala)
  • Big Data Tools (Hadoop, Kafka)
  • Workflow Orchestration (Oozie, Airflow)
  • Linux scripting and automation

Soft Skills

  • Strong analytical and problem-solving abilities
  • Excellent communication skills
  • Team player with the ability to work independently
  • Detail-oriented with a focus on data accuracy

Department / Functional Area

Keywords

  • Pyspark Developer

Disclaimer: Naukrigulf.com is only a platform to bring jobseekers & employers together. Applicants are advised to research the bonafides of the prospective employer independently. We do NOT endorse any requests for money payments and strictly advice against sharing personal or bank related information. We also recommend you visit Security Advice for more information. If you suspect any fraud or malpractice, email us at abuse@naukrigulf.com

Valuelabs

ValueLabs is a global technology company committed to delivering innovative solutions and exceptional client experiences. We are expanding our data engineering team in Hyderabad and are looking for a skilled Data Engineer with strong expertise and the Cloudera Data Platform (CDP).

https://www.naukri.com/job-listings-270426910410

Similar Jobs

Data Engineer

Data Engineer

Azure Data Engineer

Analyst "Data Analyst"

Allelife Consulting LLC

  • 3 - 7 Years
  • Dubai - United Arab Emirates (UAE)
View All