Skip to content

New to using Kaldi, just need a model to extract good voice embeddings in a python script from .wav files #4944

Open
@PhilipAmadasun

Description

@PhilipAmadasun

Does anyone have an example python script that uses one on the x-vector extraction models developed here to extract embeddings? I've gone through some of the repo and have not found any such thing.

I've tried other pre-trained embedding models like that from pyannote embeddings but the extracted vectors were not very accurate representations of speakers when scrutini9zed with cosine similarity (A lot of false positives and negatives).

I'm still testing an embedding model from speech brain but would love to try that developed in kaldi as it was recommended to me.

I would be very grateful for any help in this matter.

Metadata

Metadata

Assignees

No one assigned

    Labels

    staleStale bot on the loose

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions