JavaScript is disabled!
Please enable JavaScript in your web browser!

COMPUTER VISION ENGINEER LLM and AI Integration DUNCAN AND ROSS MANAGEMENT CONSULTANCIES

Posted 18 min ago

Experience

3 - 7 Years

Education

Bachelor of Science(Computers), Master of Technology/Engineering(Computers)

Nationality

Any Nationality

Gender

Any

Vacancy

1 Vacancy

Job Description

Roles & Responsibilities

  • Develop and implement computer vision models for image classification, object detection, segmentation, facial recognition, and visual understanding.
  • Integrate vision models with LLMs (e.g., GPT, LLaVA, CLIP, or multimodal models) to build systems that interpret and describe visual content.
  • Design AI pipelines that combine text, images, and video data for multimodal learning and reasoning.
  • Utilize deep learning frameworks (TensorFlow, PyTorch, OpenCV) to prototype and deploy models.
  • Collaborate with data scientists and AI researchers to fine-tune vision-language models for specific tasks such as visual QA, captioning, or scene analysis.
  • Implement data preprocessing, augmentation, and annotation pipelines for large-scale image datasets.
  • Conduct performance benchmarking, optimization, and deployment of models in production environments.
  • Research and experiment with emerging techniques in Generative AI, multimodal transformers, and neural architecture optimization.
  • Develop APIs and tools for internal teams to utilize vision + LLM capabilities.
  • Ensure compliance with ethical AI practices, including bias mitigation and data privacy.

Desired Candidate Profile

QUALIFICATIONS:

  • Bachelors or Masters degree in Computer Science, AI, Computer Vision, or related field (PhD preferred).
  • 3-7 years of experience in computer vision, deep learning, or multimodal AI.
  • Strong proficiency in Python and frameworks such as PyTorch, TensorFlow, Keras, and OpenCV.
  • Experience integrating LLMs (GPT, Claude, Gemini, or open-source models) with vision systems.
  • Solid understanding of transformer architectures, CNNs, diffusion models, and attention mechanisms.
  • Familiarity with multimodal datasets (COCO, Visual Genome, etc.) and evaluation metrics for vision tasks.
  • Experience with cloud-based AI tools (Azure AI, AWS Sagemaker, Google Vertex AI, etc.).
  • Ability to write clean, scalable, and production-grade code.
  • Strong analytical, problem-solving, and communication skills.

PREFERRED QUALIFICATIONS:

  • Experience with multimodal LLM frameworks such as CLIP, BLIP, LLaVA, or Kosmos-2.
  • Background in natural language processing and prompt engineering.
  • Hands-on experience with edge deployment (NVIDIA Jetson, OpenVINO, ONNX).
  • Knowledge of reinforcement learning, generative models, or 3D vision.
  • Publications or open-source contributions in AI research are a plus.

Employment Type

    Full Time

Company Industry

Department / Functional Area

Keywords

  • COMPUTER VISION ENGINEER LLM And AI Integration
  • AI Integration
  • LLMs
  • AI Engineer

DUNCAN AND ROSS MANAGEMENT CONSULTANCIES

Duncan & Ross Engineering & Technology offers integrated and customer oriented services in the different industries such as Energy, Utilities, Oil & Gas, Construction, Healthcare & Life Sciences, Banking & Finance, Technology, Media & Telecommunications, Aerospace & Defense, Transport & Railway. We support our clients with Engineering Services onsite and offsite and Technology Services (Cloud & Security, Applications, Data & Analytics). Duncan & Ross can draw on its worldwide network of human capital, engineering consulting and quality oriented solutions. Our customers will profit from these synergies that play an important role in the processing of our projects.

Read More

Fabien Claeys - Operations Director

Ubora Business Tower Level 705 Post Code: 5002353 Dubai Knowledge Village, Dubai, United Arab Emirates (UAE)