Applied Scientist, Amazon Alexa NLU

  • Worked on adopting a multi‑lingual encoder architecture for meeting latency constraints and minimising cost of NLU models.
  • Performed A/B testing on the new architecture to expose it to small percentage of traffic to analyse the performance before launch.
  • Captured metrics like latency, friction to come up with patches to fix the failures iteratively, also calibrated model confidence to reduce false accepts and false rejects