Applied LLM Engineer
Tanishkaa Advisory
Multiple VacanciesEmployer Active
Posted 21 hrs ago
Send me Jobs like this
Nationality
Any Nationality
Gender
Not Mentioned
Vacancy
3 Vacancies
Job Description
Roles & Responsibilities
Run LoRA/QLoRA with HF stack (transformers, datasets, peft, accelerate, bitsandbytes).
Quantize/export (AWQ, GPTQ, GGUF), stand up inference (vLLM, llama.cpp, TensorRT-LLM)
Build RAG: chunking, embeddings (bge/e5), vector stores (pgvector/Qdrant),
PyTorch + HF ecosystem; hands-on LoRA/QLoRA runs and inference serving.
Quantization know-how (AWQ/GPTQ/GGUF) export formats.
Retrieval & reranking experience; solid Python engineering practices.
Desired Candidate Profile
Required Candidate profile
PyTorch + HF ecosystem; hands-on LoRA/QLoRA runs and inference serving.
Quantization know-how (AWQ/GPTQ/GGUF) export formats.
Retrieval & reranking experience; solid Python engineering practices.
Company Industry
- Consulting
- Management Consulting
- Advisory Services
Department / Functional Area
- IT Software
Keywords
- Applied LLM Engineer
Disclaimer: Naukrigulf.com is only a platform to bring jobseekers & employers together. Applicants are advised to research the bonafides of the prospective employer independently. We do NOT endorse any requests for money payments and strictly advice against sharing personal or bank related information. We also recommend you visit Security Advice for more information. If you suspect any fraud or malpractice, email us at abuse@naukrigulf.com
Tanishkaa Advisory