Senior AI Engineer
We are looking for Senior AI Engineer, expert in Python focused on AI/ML, NLP, or LLM -based applications.
Job description
Project overview
For our client we are enriching our existing development team which will be responsible for further development of one of our most successful products. The product is an integrated delivery platform that leverages the depth and breadth of real-world engagements’ experience. It also enables cross-team collaboration, real-time transparency and insightful decision-making.
Responsibilities:
· Architect end -to-end LLM -based pipelines using RAG, embeddings, and orchestration layers.· Design multi -agent systems using LangChain, LangGraph, CrewAI, or AutoGen, emphasizing modularity and scalability.
· Evaluate model options (e.g., OpenAI, Azure OpenAI) based on tradeoffs in performance, cost, and capabilities.
· Define embedding strategies, context windows, and prompt structures for complex reasoning tasks.
· Build chunking and ingestion pipelines for PDFs and unstructured documents using Python SDKs (LangChain, pyPDF, RecursiveCharacterTextSplitter).
· Integrate and deploy vector stores (e.g., Azure AI Search, FAISS, postgres, chromadb ) with semantic search and reranking techniques.
· Implement CI/CD pipelines, test coverage, and API integrations (e.g., via Postman, GitHub Actions).
· Build and maintain well -structured, reusable Python classes for LLM tools, pipelines, agents, and evaluation modules.
· Debug and optimize complex, distributed Python systems involving multiple services and third -party APIs.
· Develop and maintain Jupyter Notebooks for prototyping, analysis, and executive storytelling, balancing clarity and depth.
· Create and deploy Docker containers for reproducible development, testing, and production environments.
· Work with CI/CD pipelines (GitHub Actions or similar) to ensure high -quality, testable code.
· Use vibe coding tools (Cursor, GitHub Copilot) to accelerate development while ensuring security, compliance, and reusability.
· Develop and orchestrate autonomous agents with well -defined roles, tools, and memory -sharing strategies.
· Implement observer and fallback agents to enhance system resilience and reduce hallucinations.
· Use telemetry and observability tools (e.g., Data dog, Lang Fuse) to monitor, debug, and optimize performance.
· Define and track evaluation metrics for GenAI and RAG systems (faithfulness, precision, recall, F1, semantic similarity).
· Use sklearn, NumPy, and Ragas evaluation loops to validate model performance and content grounding.
· Implement output schema validation, prompt constraints, and quality assurance processes for enterprise -readiness.
· Build interactive prototypes using Streamlit, LangFlow, or Jupyter Notebooks to demonstrate capabilities and insights.
· Translate complex models into clear, explainable insights for executive stakeholders.
· Lead design sessions and mentor developers on AI best practices and tooling.
Prerequisites and skills
Skills Required:
· 8+ years of experience in software engineering, with at least 3+ years focused on AI/ML, NLP, or LLM -based applications.
· Expert -level proficiency in Python, with a strong focus on building and debugging modular, class -based code for reusable, production -grade systems.
· Proven experience designing and deploying GenAI solutions using prompt engineering, embedding -based retrieval, and multi -agent orchestration patterns.
· Deep hands -on experience with modern AI/LLM frameworks including LangChain, LangGraph, CrewAI, AutoGen, and Hugging Face.
· Demonstrated ability to ingest, chunk, and semantically index large -scale unstructured data (e.g., PDFs, HTML, JSON) using LangChain, embedding models, and custom pipelines.
· Proficient in deploying and querying vector databases such as Azure AI Search, FAISS , or Postgres with pgvector for semantic search applications.
· Strong working knowledge of Jupyter Notebooks, Docker, GitHub, and RESTful API integration to accelerate development and prototyping workflows.
· Familiarity with DevOps and CI/CD pipelines (e.g., GitHub Actions) to enable rapid iteration, testing, and deployment of AI pipelines.
· Skilled in evaluating LLM and RAG pipeline performance using metrics like semantic similarity, precision, recall, and faithfulness with tools such as sklearn, NumPy, and RAGAS.
· Hands -on experience implementing observability and telemetry using platforms like LangFuse , Datadog, or custom logging/tracing solutions.
· Comfortable working with AI -powered developer tools such as GitHub Copilot, Cursor, and other vibe coding assistants to enhance velocity and maintainability.
Preferred Qualifications
· Experience with multimodal LLMs, vision -language models, or tool -augmented inference.
· Understanding of reasoning models, ReAct -style prompting, and planner -executor agents.
· Experience with agile ceremonies, sprint planning, and cross -functional delivery teams.
· Demonstrated ability to evaluate and compare multiple RAG pipelines using structured and human -in-the-loop evaluation methods.
· Strong communication skills — able to interface with engineering and executive audiences alike.
Further information
Seniority: Senior
Location: Fully remote, with possible occasional in person team sessions / workshops / gatherings (i.e. 1x quarter) likely to take place in Prague
US Hours overlap needed (2-6 pm CET)?: Minimum 2-6pm CET, preferred 2
Language: EN
Start: ASAP
About the company
Technology and consulting software company.
Apply for a position

You will be guided through the selection process by Jessica. If you have any questions? Call +420 605 006 814.
Personal data
I hereby give my consent to the processing of my personal data contained in the job application, professional CV, personal questionnaire, personal data obtained from the contact form on the company’s website and personal data obtained during the interview at GrainTech Bohemia s. r. o. in accordance with Act No. 18/2018 Coll. on the Protection of Personal Data (hereinafter referred to as “Act No. 18/2018 Coll.”) by GrainTech Bohemia s. r. o. for the purpose of employment mediation. Consent may be revoked in writing at any time, otherwise the consent expires 3 years from the date of its granting and the data will be anonymised and further used exclusively for statistical purposes. I also acknowledge that the rights of the data subject are regulated in Section 59 et seq. of Act No. 18/2018 Coll. I declare that I fully understand the conditions of processing my personal data and I give my consent knowingly, voluntarily and without reservation.