Skip to content

using MMseqs2 to get the totally different proteins from one dataset compared to another dataset #973

@liyue9129

Description

@liyue9129

Hi !

I want to use MMseqs2 to obtain protein sequences from the query dataset that are completely dissimilar to the protein sequences in the target dataset (e.g., with a similarity threshold of 0.3). What should I do? Can I achieve this goal using the following code:

`
mmseqs search queryDB targetDB resultDB tmp

mmseqs filterresult queryDB targetDB resultDB resultDB0.3 --max-seq-id 0.3

mmseqs createtsv queryDB targetDB resultDB0.3 resultDB0.3.tsv
`

The results I obtained using the above code only show the protein sequences from the query dataset that are below the threshold for certain proteins in the target dataset( resultDB0.3.txt ), which is confusing to me. Did I make a mistake?

Thank you!
Best wishes!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions