VLM Engineer

Technology Innovation Institute

Employer Active

Posted 14 hrs ago

Experience

1 - 3 Years

Education

Bachelor of Technology/Engineering(Computers)

Nationality

Any Nationality

Gender

Not Mentioned

Vacancy

1 Vacancy

Job Description

Roles & Responsibilities

As part of TII s Artificial Intelligence Research Center, the Extreme-Scale Language Model team is developing and implementing innovative deep learning technologies with a broad range of applications, from Natural Language Processing to Perception and Vision. Our team has developed the Falcon models and is now planning to continue our journey into cutting-edge applied research in the topic of large language models.

Key Responsibilities:

  • Vision Model Ablation Studies: Conduct comprehensive ablation studies on vision models to assess the impact of various components and configurations. Collaborate with researchers to analyze and report on the effectiveness of different model architectures and settings.
  • Data Ablation Research: Partner with team members to perform data ablation studies, identifying optimal data types and structures for training vision-language models. Also, analyze the impact of different data inputs on model performance, particularly focusing on vision-language alignment.
  • Model Evaluation: Develop and implement robust evaluation protocols for vision language models. Assess model performance across diverse benchmarks and real-world scenarios.
  • Model Training and Optimization: Engage in model training, with an emphasis on integrating LLMs with vision models like CLIP.

Desired Candidate Profile

Skills Required:

  • Expertise in machine learning, particularly in vision-language models and LLMs.
  • Strong understanding of model architectures like CLIP and their application in vision language tasks.
  • Proficiency in distributed training techniques and multi-GPU optimization.
  • Experience with deep learning frameworks (e.g., PyTorch).
  • Strong analytical skills for conducting ablation studies and evaluating model performance.
  • Familiarity with dataset curation and processing for vision and language tasks.

Qualifications:

  • PhD degree in deep learning.
  • Proven track record of research and development in vision-language models.
  • Publication record in top-tier conferences is highly desirable

Department / Functional Area

Keywords

Disclaimer: Naukrigulf.com is only a platform to bring jobseekers & employers together. Applicants are advised to research the bonafides of the prospective employer independently. We do NOT endorse any requests for money payments and strictly advice against sharing personal or bank related information. We also recommend you visit Security Advice for more information. If you suspect any fraud or malpractice, email us at abuse@naukrigulf.com