and do you see a need for this to begin with? to me it seems like cosyvoice gives much better results