[BioBrew Users] Unable to get mpiblast running
Bastian Friedrich
bastian at bastian-friedrich.de
Thu Apr 6 17:27:34 EDT 2006
Hi Glen,
thank you for your quick response.
On Thursday 06 April 2006 15:53, Glen Otero wrote:
>
> I usually don't see these types of errors. Here are a few questions:
>
> How did you format the database for mpiblast?
/usr/local/bin/mpiformatdb --nfrags=28 -i Hs.seq.uniq
was the latest call, but I had used
mpiformatdb -N 28 -i Hs.seq.uniq
earlier.
> Is the mpiblast database on a shared filesystem, like NFS (I don't
> think symlinks will work)?
Currently, I have created a /export/data/blastdb/ on the frontend; this
was rsynced to /state/partition1/blastdb on the compute nodes; on the
frontend, I had a directory /state/partition1 (on the root
partition...) containing a symlink to /export/data/blastdb.
I have just used a bind mount on the frontend (no more symlinking), but
this was not successful, either.
The first tests were done via NFS, which did not work either.
> How did you launch the job, SGE?
In the future, we surely want to use mpiblast in an SGE environment;
currently, it was started from the command line.
> Can you try a smaller job using just the 6 compute nodes (and
> formatting the db into 6 pieces)?
Wow, I get a new one now:
===============================
bastian at frontend:/state/partition1/blastdb> mpiformatdb -N 6 -i
Hs.seq.uniq
[...]
[... semi-manual distributing data to /state/partiton1/blastdb of all
nodes ...]
bastian at frontend:/state/partition1/blastdb> cd ~/tmp03
bastian at frontend:~/tmp03> /opt/mpich/gnu/sbin/cleanipcs
bastian at frontend:~/tmp03> cluster-fork /opt/mpich/gnu/sbin/cleanipcs
[...]
bastian at frontend:~/tmp03> mpirun -np 6 /usr/local/bin/mpiblast -p blastn
-d Hs.seq.uniq -i IL2RA -o blast_results
54p3_2934: p4_error: : 0
3 0.078125 Bailing out with signal 11
[3] MPI Abort by user Aborting program !
[3] Aborting program!
2p1_28697: p4_error: interrupt SIGx: 13
p5_17962: p4_error: : 0
0.0742188 Bailing out with signal 11
[5] MPI Abort by user Aborting program !
[5] Aborting program!
p4_21219: p4_error: : 0
rm_l_4_21279: (0.367188) net_send: could not write to fd=5, errno = 104
0.078125 Bailing out with signal 11
[4] MPI Abort by user Aborting program !
[4] Aborting program!
p2_13443: p4_error: : 0
0.078125 Bailing out with signal 11
[2] MPI Abort by user Aborting program !
[2] Aborting program!
rm_l_3_2994: (0.644531) net_send: could not write to fd=5, errno = 104
p1_28697: (7.242188) net_send: could not write to fd=5, errno = 32
rm_l_2_13503: (6.929688) net_send: could not write to fd=5, errno = 104
p2_13443: (6.929688) net_send: could not write to fd=5, errno = 32
p5_17962: (6.093750) net_send: could not write to fd=5, errno = 32
===============================
Signal 11 seems to be a segfault? Something's going awfully wrong
here...
> Can you try a smaller blast job using p53, p53db from ftp://
> ftp.bioinformatics.org/pub/biobrew/ and blastp?
This works! :)) The first time I see mpiblast actually working :)
Unfortunately, we are looking forward to blasting against the 17 GB
genebank... Any more ideas?
Thx again,
Bastian
--
Bastian Friedrich bastian at bastian-friedrich.de
Adress & Fon available on my HP http://www.bastian-friedrich.de/
\~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~\
\ Computers make very fast, very accurate mistakes.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
Url : http://bioinformatics.org/pipermail/biobrew-users/attachments/20060406/6f4156f9/attachment.bin
More information about the BioBrew-Users
mailing list