SUMMARY:
- Data Science Architect with over 13+ years of experience in MLOPS, Analytics, and Data Science.
- Proven track record of leading successful teams and implementing innovative solutions to drive business growth.
- Result-oriented performer, with extensive team management and customer-facing experience.
- Extensive experience in mapping ad-hoc business requirements to a deliverable analytics solution.
- Hands-on strong> experience with Cloud platforms GCP, AWS, Azure, and DevOps tools like Docker, Kubernetes, and Jenkins.
- Strong communication and leadership skills with a focus on collaboration and teamwork.
Read More
Autonomous Triage Agent for ML jobs running on internal AI platform
- Engineered a read-only AI Agent to automate the triage of complex ML job failures, assisting platform users to quickly diagnose errors.
- Built a custom Python MCP Server to provide the agent with a secure, real-time interface to Kubernetes logs and cluster metadata.
- Architected a RAG pipeline using Amazon Aurora (pgvector) with HNSW indexing, enabling high-speed semantic retrieval of runbooks.
- Implemented Agentic Memory within Aurora to store and correlate incident briefs, identifying recurring failure patterns automatically.
- Reduced median time-to-first-plausible-diagnosis by 15–25% and standardized the triage process across distributed engineering teams.
Read More