Phoneme-to-Audio Alignment with Recurrent Neural Networks for Speaking and Singing Voice Permalink
Y. Teytaut and A. Roebel. Phoneme-to-audio alignment with recurrent neural networks for speaking and singing voice. In Proceedings of Interspeech 2021, pages 61–65. International Speech Communication Association; ISCA, 2021