Work Experiences

Professional ML engineering experience delivering production systems with measurable business impact across e-commerce and IoT domains.

Senior ML Engineer (Contract) at Mid-sized European E-commerce Marketplace (Client - NDA)

Aug 2022 - Nov 2025Remote | Europe

RAG-Powered Search & Discovery

Built a production-grade RAG system that improved search-to-purchase conversion by 4% and delivered sub-500ms p99 latency at 3M queries/month

  • Led design of BM25 + dense vectors with re-ranking to define latency/error SLOs and cost targets upfront
  • Built evaluation framework (offline metrics, CI/CD regression tests, load envelopes) including scalable LLM-driven approach to create golden dataset for ranking evaluation
  • Architected for scale (3M queries/month) and delivered p99 latency under 500 ms at EUR 0.001 per query, improving search-to-purchase by 4%
PythonLLMsVector SearchRAG ArchitectureAWS

Customer Lifetime Value (CLV) Prediction

Built a production CLV prediction system that improved marketing ROI by 12% through value-based customer targeting and segmentation

  • Built segmented CLV models (early-stage vs established). Improved RMSE by 15-20%, Gini 0.47 to 0.62 for budget-constrained CRM, enabling 12% higher marketing ROI
  • Diagnosed feedback loop where top-decile targeting reinforced bias; led cross-functional exploration budget experiment (10% random allocation), retrained on less-biased data for better high-potential discovery
  • Shipped weekly batch pipeline scoring 300K+ profiles with segment monitoring, costing €75/month
PythonXGBoostAWS SageMakerAWS GlueAWS Step Functions

Real-Time Purchase Intent Scoring

Deployed a session-level purchase propensity model achieving 5% conversion uplift with 40% reduction in p99 latency through Redis optimization

  • Deployed sub-second scoring for 60K+ daily predictions (Feast + ElastiCache) with Spark Structured Streaming features (50K+ events/hour) and class-weighted LightGBM (20:1 imbalance)
  • Optimized feature fetch to achieve 40% p99 latency reduction
  • Resolved production calibration breakdown eroding marketing trust; added segment-wise reliability analysis + post-hoc calibration, restoring confidence.
PythonLightGBMSpark Structured StreamingFeast Feature StoreRedis/ElastiCache

LLM Generated Review Summaries

Implemented automated review summarisation using LLMs to surface key insights and sentiment for shoppers

  • Fine-tuned Mistral-7B on 100K+ reviews; cut LLM cost 60% vs GPT-4 baseline while maintaining quality
  • Partnered with product and legal to define governance standards (model cards, versioning, bias-aware retrieval, PII redaction)
  • Productionized batch RAG with automated quality checks (hallucination/toxicity/relevance)
PythonLLMsAWS LambdaNLP
eSMART Technologies

ML Engineer at eSMART Technologies

Jun 2018 - Dec 2020Remote | Renens, Switzerland

NEC Labs America

Machine Learning Research Intern at NEC Labs America

Feb 2017 – Jul 2017Onsite | Princeton, New Jersey, USA

Education

Other

Signalchip Innovations

Signal Processing Engineer at Signalchip Innovations

Feb 2012 – May 2015Bangalore, India

Worked on multi-user detection algorithms for WCDMA/LTE systems, contributing to 1 US patent in wireless communications signal processing

  • Joined as 5th employee of semiconductor startup, contributing to core signal processing algorithms
  • Developed and validated WCDMA uplink receiver algorithms (Path Searcher, RAKE, Multiuser Detection), achieved 3GPP NodeB receiver conformance (BER <0.001) across multi-path/interference channel conditions
  • Patent: Developed symbol level interference cancellation method (US Patent 9602240), Patent: Developed optimized channel estimation system reducing computational complexity (US Patent 20160365991)
WCDMALTEOctaveDigital Signal ProcessingWireless Communications
Indian Institute of Science

Junior Research Fellow at Indian Institute of Science

Jun 2011 – Jan 2012Bangalore, India

Research fellowship focused on developing efficient image reconstruction algorithms for Diffuse Optical Tomography, resulting in a published paper in Medical Physics journal

  • Developed a data-resolution matrix method (from the sensitivity/Jacobian model + regularization) to identify independent measurements. Used diagonal vs. off-diagonal structure to select informative measurements and reduce data-collection time.
  • Achieved ~20% more independent measurements than singular-value selection while preserving reconstruction quality
  • Published first-author paper in Medical Physics journal.
MATLABDiffuse Optical TomographyImage ReconstructionInverse ProblemsOptimization Algorithms

Work With Me

I bring hands-on experience delivering production MLOps and GenAI systems at moderate scale—with minimal infrastructure footprint and cost-effective architectures. I'm excited to collaborate on building next-generation Agentic AI systems. Whether you need expertise in MLOps, GenAI, or Agentic AI—let's connect.

Contact Me