Teknologji Informacioni (IT)Prishtinë1 javë më parëFull Time
AI Engineer (m/f/d)
Location: Prishtina
About Us:
Borek Solutions is a global company headquartered in Braunschweig, Germany, with offices in Vadodara (India) and Prishtina (Kosovo). We specialize in IT, Business Process Management, and Sales, helping partners overcome talent shortages and accelerate growth. With a heritage dating back to the 17th century, we are committed to innovation, progress, and high-quality delivery.
Your Role:
As an AI Engineer, you will design, build, and deploy production-grade AI systems that leverage state-of-the-art machine learning techniques and large language models (LLMs) to solve real-world problems at scale. Working cross-functionally with product, data, and infrastructure teams, you will ship intelligent features, optimize model performance, and translate cutting-edge research into tangible business value.
Your Responsibilities:
Design, develop, and maintain ML pipelines and LLM-powered applications from prototype through production deployment.
Fine-tune, prompt-engineer, and evaluate large language models (e.g., GPT, Claude, Llama, Mistral) for domain-specific use cases.
Build retrieval-augmented generation (RAG) systems, agentic workflows, and tool-use architectures around foundation models.
Develop and maintain robust evaluation frameworks, including automated benchmarks, human-in-the-loop evaluation, and A/B testing.
Optimize model inference for latency, throughput, and cost, including quantization, distillation, and efficient serving strategies.
Collaborate with data engineers to build and curate high-quality training and evaluation datasets.
Stay current with the rapidly evolving AI/ML landscape and translate research breakthroughs into production-ready features.
Contribute to technical design documents, architecture reviews, and engineering best practices across the AI team.
Your Profile:
3+ years of experience in Machine Learning Engineering, AI Engineering, or a closely related role.
Strong proficiency in Python and ML frameworks such as PyTorch, TensorFlow, or JAX.
Hands-on experience working with large language models — fine-tuning, prompt engineering, RLHF, or building LLM-based applications.