On the lookout for the correct text-to-speech mannequin? The 1.6 billion parameter mannequin Dia could be…