Talking face generation aims to synthesize a sequence of face images and voices of characters corresponding to their identities, ...