Company Overview 10Pearls is an award-winning end-to-end digital innovation company that helps businesses imagine and build the future. We are proud to announce that 10Pearls was named as winner of the Best Tech Work Culture Timmy Award in Washington DC by Tech in Motion, recognized on the Inc. 5000 Fastest-Growing Companies List, and was ranked the #1 Most Diverse Midsize Company in Greater Washington. We partner with businesses to help them transform, scale, and accelerate by adopting digital and exponential technologies. Our work has ranged from creating highly usable, secure digital experiences, mobile and software products, to helping businesses modernize through cloud adoption and development and the digitalization of their business processes. Our clientele is highly diverse, including Global 1000 enterprises, mid-market businesses, and high-growth start-ups. But those are just the facts. What makes us unique is that we have true heart and soul. We have a strong focus on a double bottom line and actively support and engage with the communities where we live and work to make the world a better place. In a nutshell, we believe in doing well, while doing good, and know how to balance the two.
Role 10Pearls is seeking a Senior AI Engineer with strong expertise in LLMOps and Generative AI systems. The ideal candidate will have hands-on experience building and scaling LLM-powered applications, working with RAG pipelines, prompt engineering, and Azure AI ecosystem. You will play a key role in designing, deploying, and optimizing intelligent AI workflows in production environments.
Responsibilities • Design and develop LLM-powered applications using modern frameworks like LangChain and LangGraph • Build and optimize RAG (Retrieval-Augmented Generation) pipelines using Azure AI Search • Integrate and manage Azure OpenAI services for scalable AI solutions • Implement prompt engineering and prompt versioning strategies for consistent model performance • Develop and maintain LLMOps pipelines for deployment, monitoring, and iteration of AI models • Work with Redis or similar caching systems to optimize performance and reduce latency • Collaborate with cross-functional teams (Product, Engineering, Data) to deliver AI-driven features • Monitor model performance and continuously improve accuracy, cost, and latency • Ensure best practices in scalability, security, and reliability of AI systems
Requirements • Bachelor’s or Master’s degree in Computer Science, AI, or related field • 3–6 years of experience in AI/ML or backend development • Hands-on experience with LLMs and Generative AI applications • Strong proficiency in Python • Experience working with Azure OpenAI, Azure AI Search, or similar platforms • Solid understanding of RAG architectures and vector-based retrieval systems • Experience with LangChain, LangGraph, or similar orchestration frameworks • Exposure to MLOps / LLMOps practices (deployment, monitoring, versioning) • Strong problem-solving and analytical skills
Nice to Have • Experience with multi-agent AI systems • Exposure to vector databases and embeddings optimization • Experience optimizing LLM cost-performance tradeoffs • Familiarity with cloud-based deployment and monitoring tools