Role overview
For a global pharmaceutical company digitalizing the distribution of medical content for healthcare professionals — we are looking for an AI Engineer. The client's team is building an AI-powered query processing system that delivers accurate, safety-checked responses to doctors and medical staff in real time.
Responsibilities
- Designing and optimizing a multi-phase query processing pipeline with deterministic fast-paths, vector search, and parallel processing
- Integrating and managing Azure OpenAI models for query classification, HyDE rewriting, and safety evaluation
- Managing OpenSearch indices for FAQ, SmPC, and PromoMats content with cosine similarity scoring
- Developing and maintaining a safety guardrails framework covering adverse events, off-label use, PII detection, and prompt injection
- Optimizing LLM prompts, temperature settings, and latency to meet P95 response time targets
- Implementing fallback strategies and circuit-breaker patterns for service resilience
- Designing global resource pooling (boto3 sessions, thread pools, HTTP clients) to minimize per-request overhead
Basic qualifications
- Hands-on experience with Azure OpenAI and LLM orchestration
- Strong Python skills including async and threaded processing
- Practical knowledge of vector search and OpenSearch
- Experience with prompt engineering, evaluation metrics, and performance tuning
- Nice to have: safety guardrails design, embedding model A/B testing, circuit-breaker patterns, FastAPI
Benefits
- 150 000 Kč / měsíc nebo si řekni o víc
- Freelancer
- Hybrid
- Remote
- Plný úvazek
- Praha
Tags & focus areas
Used for matching and alerts on DevFound Fulltime Remote Ai Ai Engineer Generative Ai