[Pkg-openmpi-maintainers] Bug#790540: /usr/lib/libmpi.so.1: Segfault in MPI_Init for simplest MPI program
Tom Epperly
tepperly at gmail.com
Tue Jun 30 01:32:27 UTC 2015
Package: libopenmpi1.6
Version: 1.6.5-10
Severity: normal
File: /usr/lib/libmpi.so.1
Dear Maintainer,
* What led up to the situation?
I was trying to install QBox http://eslab.ucdavis.edu/software/qbox/index.htm.
I compiled it, but it segfaults when I run it with mpirun -np 1 ./qb
qb needs a variety of packages that require MPI, so openmpi was installed
using apt-get by installing the dependencies for QBox.
Oddly enough, QBox doesn't segfault when I run it without mpirun.
I made a simple program to reproduce the behavior.
# cat mpitest.c
#include <mpi.h>
int
main(int argc, char **argv)
{
MPI_Init(&argc, &argv);
MPI_Finalize();
}
* What exactly did you do (or not do) that was effective (or
ineffective)?
Here are the steps needed to reproduce the behavior.
faerun:/tmp# mpicc -o mpitest mpitest.c
faerun:/tmp# mpirun -np 1 /tmp/mpitest
[faerun:20716] *** Process received signal ***
[faerun:20716] Signal: Segmentation fault (11)
[faerun:20716] Signal code: (128)
[faerun:20716] Failing at address: (nil)
[faerun:20716] [ 0] /lib/x86_64-linux-gnu/libpthread.so.0(+0xf8d0) [0x7f7b39bef8d0]
[faerun:20716] [ 1] /usr/lib/x86_64-linux-gnu/libhwloc.so.5(+0x244c5) [0x7f7b39e244c5]
[faerun:20716] [ 2] /usr/lib/x86_64-linux-gnu/libhwloc.so.5(+0xc1c7) [0x7f7b39e0c1c7]
[faerun:20716] [ 3] /usr/lib/x86_64-linux-gnu/libhwloc.so.5(hwloc_topology_load+0x1ab) [0x7f7b39e0d15a]
[faerun:20716] [ 4] /usr/lib/libmpi.so.1(opal_hwloc_unpack+0xc7) [0x7f7b3a34e107]
[faerun:20716] [ 5] /usr/lib/libmpi.so.1(opal_dss_unpack_buffer+0x6e) [0x7f7b3a3497ce]
[faerun:20716] [ 6] /usr/lib/libmpi.so.1(opal_dss_unpack+0xae) [0x7f7b3a34a6de]
[faerun:20716] [ 7] /usr/lib/libmpi.so.1(orte_util_nidmap_init+0x2b1) [0x7f7b3a315591]
[faerun:20716] [ 8] /usr/lib/openmpi/lib/openmpi/mca_ess_env.so(+0x1b29) [0x7f7b380b9b29]
[faerun:20716] [ 9] /usr/lib/libmpi.so.1(orte_init+0x193) [0x7f7b3a2fdc63]
[faerun:20716] [10] /usr/lib/libmpi.so.1(ompi_mpi_init+0x166) [0x7f7b3a2b81d6]
[faerun:20716] [11] /usr/lib/libmpi.so.1(MPI_Init+0x16a) [0x7f7b3a2d095a]
[faerun:20716] [12] /tmp/mpitest() [0x400708]
[faerun:20716] [13] /lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xf5) [0x7f7b39851b45]
[faerun:20716] [14] /tmp/mpitest() [0x400619]
[faerun:20716] *** End of error message ***
--------------------------------------------------------------------------
mpirun noticed that process rank 0 with PID 20716 on node faerun exited on signal 11 (Segmentation fault).
--------------------------------------------------------------------------
faerun:/tmp# /tmp/mpitest
faerun:/tmp#
* What was the outcome of this action?
Note the segfault listed above.
* What outcome did you expect instead?
I expected mpitest to run without segfaulting. Note, running mpitest
without mpirun doesn't segfault. Using mpirun should allow me to run
using multiple threads.
faerun:/tmp# dpkg --list '*openmpi*'
Desired=Unknown/Install/Remove/Purge/Hold
| Status=Not/Inst/Conf-files/Unpacked/halF-conf/Half-inst/trig-aWait/Trig-pend
|/ Err?=(none)/Reinst-required (Status,Err: uppercase=bad)
||/ Name Version Architecture Description
+++-=====================-===============-===============-===============================================
ii libblacs-openmpi1 1.1-33 amd64 Basic Linear Algebra Comm. Subprograms - Shared
ii libopenmpi-dev 1.6.5-10 amd64 high performance message passing library -- hea
un libopenmpi1 <none> <none> (no description available)
rc libopenmpi1.3 1.4.5-1 amd64 high performance message passing library -- sha
ii libopenmpi1.6 1.6.5-10 amd64 high performance message passing library -- sha
un libopenmpi2 <none> <none> (no description available)
ii libscalapack-openmpi1 1.8.0-12.1 amd64 Scalable Linear Algebra Package - Shared libs.
un octave-openmpi-ext <none> <none> (no description available)
ii openmpi-bin 1.6.5-10 amd64 high performance message passing library -- bin
ii openmpi-checkpoint 1.6.5-10 amd64 high performance message passing library -- che
ii openmpi-common 1.6.5-10 all high performance message passing library -- com
un openmpi-dev <none> <none> (no description available)
un openmpi-libs0 <none> <none> (no description available)
un yorick-mpy-openmpi <none> <none> (no description available)
-- System Information:
Debian Release: stretch/sid
APT prefers unstable
APT policy: (500, 'unstable'), (1, 'experimental')
Architecture: amd64 (x86_64)
Kernel: Linux 3.10-2-amd64 (SMP w/4 CPU cores)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/bash
Init: systemd (via /run/systemd/system)
Versions of packages libopenmpi1.6 depends on:
ii libc6 2.19-18
ii libcr0 0.8.5-2.2
ii libgcc1 1:5.1.1-9
ii libgfortran3 5.1.1-9
ii libhwloc5 1.11.0-1
ii libibverbs1 1.1.8-1.1
ii libltdl7 2.4.2-1.11
ii libquadmath0 5.1.1-9
ii libstdc++6 5.1.1-9
libopenmpi1.6 recommends no packages.
libopenmpi1.6 suggests no packages.
-- no debconf information
More information about the Pkg-openmpi-maintainers
mailing list