Skip to content

MHA_MQA_GQA代码 #11

@TS-mark

Description

@TS-mark

总结的很好,有一个小问题

        if attention_mask != None:
            attention_scores += attention_mask * -1e-9

这里这个值应该是-1e9

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions