Survey of Text-to-Speech synthesis (TTS) from WaveNet in 2016 to VALL-E in Jan, 2023.

  • Read WaveNet paper
  • Explored opensource music datasets
  • Decided to implement music generation conditioned on mood/theme based on WaveNet architecture
  • choose dataset