Skip to content

如何提升说话人识别的准确率 #411

@chenfuckthesky

Description

@chenfuckthesky

在使用wespeaker的过程中,发现很多时候无法把说话人分离开,比如附件里的这个录音,是一男一女两个人在对话,音色的差别听上去还挺大的,但是最后测试的结果是下面这样的。所以我的问题是,有没有什么参数,比如相似度之类的,可以提升准确率。我仔细看了Speaker这个类,但是没有收获:

('unk', 0.1, 1.9, 0)
('unk', 2.0, 4.1, 0)
('unk', 4.7, 5.7, 0)
('unk', 29.8, 30.3, 0)
('unk', 32.5, 33.2, 0)
('unk', 33.5, 36.1, 0)
('unk', 36.5, 38.9, 0)
('unk', 40.3, 44.8, 0)
('unk', 45.1, 46.2, 0)
('unk', 46.9, 50.6, 0)
('unk', 50.7, 56.4, 0)
('unk', 58.3, 64.4, 0)
('unk', 67.2, 67.7, 0)
('unk', 67.9, 73.1, 0)
('unk', 73.9, 74.9, 0)
('unk', 76.9, 78.4, 0)
('unk', 78.9, 81.4, 0)
('unk', 83.6, 85.4, 0)
('unk', 88.5, 91.6, 0)
('unk', 91.7, 92.3, 0)
('unk', 92.6, 94.1, 0)
('unk', 94.3, 94.7, 0)
('unk', 94.8, 95.4, 0)

1ac0e486-68a2-11ee-a110-591e6b00846c-all.zip

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions