MMseqs2 hangs at prefilter stage with large database (1B sequences, 4.1TB) using MPI + GPU build #994

hughuang12 · 2025-05-09T02:18:21Z

Hello,

I am compiling MMseqs2 from the latest source package downloaded from the project repository, with both MPI and GPU support enabled. After running tests on small datasets, I have confirmed that the installation is functioning correctly.

However, when working with large-scale data (specifically, a .faa file containing approximately 1 billion protein sequences, which results in a 4.1 TB database after mmseqs createdb), both mmseqs search and mmseqs linclust hang at the following stage:

Create directory search_tmp_multi
search createdb/MASH_oceans_proteins createdb/MASH_oceans_proteins results_multi/search_MASH_oceans search_tmp_multi --threads 96

MMseqs Version: GITDIR-NOTFOUND-MPI
Substitution matrix aa:blosum62.out,nucl:nucleotide.out
Add backtrace false
Alignment mode 2
Alignment mode 0
Allow wrapped scoring false
E-value threshold 0.001
Seq. id. threshold 0
Min alignment length 0
Seq. id. mode 0
Alternative alignments 0
Coverage threshold 0
Coverage mode 0
Max sequence length 65535
Compositional bias 1
Compositional bias scale 1
Max reject 2147483647
Max accept 2147483647
Include identical seq. id. false
Preload mode 0
Pseudo count a substitution:1.100,context:1.400
Pseudo count b substitution:4.100,context:5.800
Score bias 0
Realign hits false
Realign score bias -0.2
Realign max seqs 2147483647
Correlation score weight 0
Gap open cost aa:11,nucl:5
Gap extension cost aa:1,nucl:2
Zdrop 40
Threads 96
Compressed 0
Verbosity 3
Seed substitution matrix aa:VTML80.out,nucl:nucleotide.out
Sensitivity 5.7
k-mer length 0
Target search mode 0
k-score seq:2147483647,prof:2147483647
Alphabet size aa:21,nucl:5
Max results per query 300
Split database 0
Split mode 2
Split memory limit 0
Diagonal scoring true
Exact k-mer matching 0
Mask residues 1
Mask residues probability 0.9
Mask lower case residues 0
Mask lower letter repeating N times 0
Minimum diagonal score 15
Selected taxa
Spaced k-mers 1
Spaced k-mer pattern
Local temporary path
Use GPU 0
Use GPU server 0
Wait for GPU server 600
Prefilter mode 0
Rescore mode 0
Remove hits by seq. id. and coverage false
Sort results 0
Mask profile 1
Profile E-value threshold 0.1
Global sequence weighting false
Allow deletions false
Filter MSA 1
Use filter only at N seqs 0
Maximum seq. id. threshold 0.9
Minimum seq. id. 0.0
Minimum score per column -20
Minimum coverage 0
Select N most diverse seqs 1000
Pseudo count mode 0
Profile output mode 0
Min codons in orf 30
Max codons in length 32734
Max orf gaps 2147483647
Contig start mode 2
Contig end mode 2
Orf start mode 1
Forward frames 1,2,3
Reverse frames 1,2,3
Translation table 1
Translate orf 0
Use all table starts false
Offset of numeric ids 0
Create lookup 0
Overlap between sequences 0
Sequence split mode 1
Header split mode 0
Chain overlapping alignments 0
Merge query 1
Search type 0
Search iterations 1
Start sensitivity 4
Search steps 1
Exhaustive search mode false
Filter results during exhaustive search 0
Strand selection 1
LCA search mode false
Disk space limit 0
MPI runner mpirun -pernode -np 4
Force restart with latest tmp false
Remove temporary files false
Translation mode 0

MPI Init
Rank: 0 Size: 4
prefilter createdb/MASH_oceans_proteins createdb/MASH_oceans_proteins search_tmp_multi/14618197232166913224/pref_0 --sub-mat 'aa:blosum62.out,nucl:nucleotide.out' --seed-sub-mat 'aa:VTML80.out,nucl:nucleotide.out' -k 0 --target-search-mode 0 --k-score seq:2147483647,prof:2147483647 --alph-size aa:21,nucl:5 --max-seq-len 65535 --max-seqs 300 --split 0 --split-mode 2 --split-memory-limit 0 -c 0 --cov-mode 0 --comp-bias-corr 1 --comp-bias-corr-scale 1 --diag-score 1 --exact-kmer-matching 0 --mask 1 --mask-prob 0.9 --mask-lower-case 0 --mask-n-repeat 0 --min-ungapped-score 15 --add-self-matches 0 --spaced-kmer-mode 1 --db-load-mode 0 --pca substitution:1.100,context:1.400 --pcb substitution:4.100,context:5.800 --threads 96 --compressed 0 -v 3 -s 5.7

I am running this on four nodes with 96 CPU cores and approximately 700 GB of RAM each.

Could you please advise me on how to resolve this issue?

The text was updated successfully, but these errors were encountered:

martin-steinegger · 2025-05-16T17:59:49Z

We no longer actively develop MPI, while still testing it. Also we did not implement MPI support for GPU usage.
It is likely more effective to manually split the database.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

MMseqs2 hangs at prefilter stage with large database (1B sequences, 4.1TB) using MPI + GPU build #994

MMseqs2 hangs at prefilter stage with large database (1B sequences, 4.1TB) using MPI + GPU build #994

hughuang12 commented May 9, 2025

martin-steinegger commented May 16, 2025

Uh oh!

MMseqs2 hangs at prefilter stage with large database (1B sequences, 4.1TB) using MPI + GPU build #994

MMseqs2 hangs at prefilter stage with large database (1B sequences, 4.1TB) using MPI + GPU build #994

Comments

hughuang12 commented May 9, 2025

martin-steinegger commented May 16, 2025

Uh oh!