Data Engineer
Institute Of Foundation Models
Posted 30+ days ago
Send me Jobs like this
Experience
1 - 7 Years
Job Location
Education
Bachelor of Science(Computers)
Nationality
Any Nationality
Gender
Not Mentioned
Vacancy
1 Vacancy
Job Description
Roles & Responsibilities
As a Data Engineer specializing in Natural Language Processing (NLP) and large-scale data processing, you will quickly and effectively gather, curate, and prepare high-quality datasets to support cutting-edge NLP research. Your role will be instrumental in enabling researchers by delivering essential data through efficient and scalable engineering practices, including web crawling, LLM-generated content refinement, and robust data pipelines, primarily leveraging Python and related technologies.
Key Responsibilities
- Rapidly collect, curate, and preprocess datasets based on detailed specifications provided by NLP researchers, delivering data within tight timelines (typically within 1-2 days).
- Develop and maintain efficient web crawling solutions, APIs, and automated workflows to continuously improve data collection processes.
- Refine and evaluate outputs from Large Language Models (LLMs) to generate structured datasets suitable for model training and benchmarking.
- Implement scalable data pipelines, ensuring efficient data processing, storage, retrieval, and distribution to research teams.
- Collaborate closely with researchers and engineers to ensure collected data meets specified quality and relevance criteria.
- Document data collection methodologies, dataset characteristics, and pipeline architecture clearly and effectively.
- Engage with peer teams and participate in technical reviews to uphold best practices and data quality standards.
- Represent MBZUAI at industry and research forums, showcasing technical capabilities in large-scale data processing and AI data infrastructure.
- Perform all other duties as reasonably directed by the line manager commensurate with these functional objectives.
Desired Candidate Profile
Bachelor's degree in Computer Science, Data Science, Engineering, or a related technical field required
Master s degree or equivalent experience in Computer Science, Data Engineering, or related technical fields preferred.
Company Industry
- Education
- Training
- Teaching
- Academics
Department / Functional Area
- IT Software
Keywords
- Data Engineer
Disclaimer: Naukrigulf.com is only a platform to bring jobseekers & employers together. Applicants are advised to research the bonafides of the prospective employer independently. We do NOT endorse any requests for money payments and strictly advice against sharing personal or bank related information. We also recommend you visit Security Advice for more information. If you suspect any fraud or malpractice, email us at abuse@naukrigulf.com
Institute Of Foundation Models
We are a dedicated research lab for building, understanding, using, and risk-managing foundation models. Our mandate is to advance research, nurture the next generation of AI builders, and drive transformative contributions to a knowledge-driven economy. As part of our team, you ll have the opportunity to work on the core of cutting-edge foundation model training, alongside world-class researchers, data scientists, and engineers, tackling the most fundamental and impactful challenges in AI development. You will participate in the development of groundbreaking AI solutions that have the potential to reshape entire industries. Strategic and innovative problem-solving skills will be instrumental in establishing MBZUAI as a global hub for high-performance computing in deep learning, driving impactful discoveries that inspire the next generation of AI pioneers.
https://jobs.lever.co/ifm-us/b6c7bc76-7a13-4a2c-84ce-d01c61d08d80
Similar Jobs
Data Engineer
Dicetek LLC
- 5 - 10 Years
- Abu Dhabi , Dubai - United Arab Emirates (UAE)
Data Engineer
DUBAI PROPERTIES GROUP LLC
- 3 - 6 Years
- Dubai - United Arab Emirates (UAE)
Data Engineer & Senior Data engineer (immediate to 30 days NP)banking
Sphere IT Consultants DWC LLC
- 5 - 10 Years
- Dubai - United Arab Emirates (UAE)
Data Analyst
Dulsco Group
- 5 - 6 Years
- Dubai - United Arab Emirates (UAE)
Azure Data Engineer
Starlink WLL
- 8 - 14 Years
- Doha - Qatar