π― Currently, I am in Menlo Park, CA as a Research Scientist Intern at Meta.
π― Summer 2024 I was in Seattle, WA as a Research Intern at Microsoft Research.
π I work on Speech synthesis with probabilistic generative models
π¬ Ask me about: Python, Deep Learning, Machine Learning and Generative modelling
π« Reach me or read my blog at: https://shivammehta25.github.io/
π¬ Open for collaborations and interesting projects!
My recent works:
β‘ MAGI: Multimodal Audio and Gesture, Integrated: https://shivammehta25.github.io/MAGI/
β‘ π΅ Matcha-TTS: https://shivammehta25.github.io/Matcha-TTS/
β‘ Unified speech and gesture synthesis using flow matching: https://shivammehta25.github.io/Match-TTSG/
β‘ Diff-TTSG: https://shivammehta25.github.io/Diff-TTSG/
β‘ OverFlow: https://shivammehta25.github.io/OverFlow
β‘ Neural HMM TTS: https://shivammehta25.github.io/Neural-HMM