Supply chains run the world. We make them run themselves. At Sensos, we turn real-world data into a trusted foundation that enables companies to transform their supply chains into autonomous execution services, acting before problems arise, at scale. We're a fast-moving global team at the intersection of data, agentic AI, autonomous execution, and global supply chains.
As a Data Scientist focused on AI Agents, you will design autonomous agents that monitor real-time supply chain visibility data and execute automated actions directly within Sensos's core platform. You'll own this work end-to-end, from shaping requirements with Product to deploying and evaluating production systems.
Responsibilities:
Design and deploy agent systems. Architect and ship multi-agent workflows and decision-making logic that monitor end-to-end supply chain logistics and trigger automated, real-time responses.
Own evaluation and observability. Build the eval frameworks, monitoring, and safe-rollout practices that let us trust autonomous agents in production - measuring accuracy, drift, and real-world impact against supply chain outcomes.
Translate product needs into systems. Partner directly with Product Management to turn business logic, user needs, and logistics metrics into scalable analytical and agentic solutions.
Build with engineering. Work alongside backend engineers to integrate real-time IoT telemetry and streaming pipelines into agent-driven actions.
Requirements:
BSc or MSc in Computer Science, Data Science, or related field (research experience is an advantage).
3+ years in Data Science, Machine Learning, or a closely related field.
GenAI in production. Proven track record building production GenAI applications: multi-step agents, RAG pipelines, tool-augmented LLMs, and structured outputs.
Strong evaluation and observability practices for LLM systems - this is a must. You've designed eval sets, run regression tests on prompts and agents, monitored production LLM behavior, and made safe-rollout decisions based on real metrics.
Agent orchestration. Hands-on experience designing and orchestrating multi-step or multi-agent systems - managing state, tool use, retries, fallbacks, and handoffs between agents.
Voice agents. Experience building conversational voice agents that handle real-time phone or voice interactions.
Strong ML fundamentals. Solid grounding in classical ML and deep learning - model selection, training, evaluation metrics, handling imbalanced data, feature engineering, and the trade-offs between model families. Comfortable reasoning about when an LLM is the right tool and when it isn't.
Hands-on with the modern agent stack. Proficient in Python. Familiarity with major LLM providers like OpenAI, Azure OpenAI, Google Vertex AI, Anthropic, etc.
Startup mindset. Comfortable working independently in a fast-paced environment, owning ambiguous problems, and finding creative paths forward.
Nice to have
Hands-on experience deploying models on Microsoft Azure (e.g., Azure Foundry, Azure AI Search).
Background in supply chain, logistics, IoT, or other real-time/streaming domains.