Skip to Content.
Sympa Menu

charm - Re: [charm] disable hwloc?

charm AT lists.siebelschool.illinois.edu

Subject: Charm++ parallel programming system

List archive

Re: [charm] disable hwloc?


Chronological Thread 
  • From: Jozsef Bakosi <jbakosi AT lanl.gov>
  • To: Evan Ramos <evan AT hpccharm.com>
  • Cc: charm <charm AT lists.cs.illinois.edu>
  • Subject: Re: [charm] disable hwloc?
  • Date: Tue, 1 May 2018 23:31:16 -0600
  • Authentication-results: illinois.edu; spf=pass smtp.mailfrom=jbakosi AT lanl.gov; dmarc=pass header.from=lanl.gov

Hi Evan,

Applying the patch below allows linking fine, but I get a segfault at runtime
(running in serial):

(gdb) run
Starting program: /home/quinoa/quinoa/build/Main/meshconv Main/meshconv
[New LWP 18798]

Thread 1 "meshconv" received signal SIGSEGV, Segmentation fault.
0x00007ffff79b4c10 in opal_hwloc191_hwloc_components_init ()
(gdb) where
#0 0x00007ffff79b4c10 in opal_hwloc191_hwloc_components_init ()
#1 0x00007ffff79a0077 in opal_hwloc191_hwloc_topology_init ()
#2 0x00007ffff7983174 in opal_hwloc_base_get_topology ()
#3 0x00007ffff774b686 in ompi_mpi_init ()
#4 0x00007ffff7760660 in PMPI_Init_thread ()
#5 0x00007ffff76beeee in LrtsInit (argc=0x7fffffffebdc, argv=0x7fffffffebd0,
numNodes=0x7ffff7feb398 <_Cmi_numnodes>, myNodeID=0x7ffff7feb2e0
<_Cmi_mynode>) at machine.c:1440
#6 0x00007ffff76bd130 in ConverseInit (argc=2, argv=0x7fffffffeca8,
fn=0x7ffff75c4b0b <_initCharm(int, char**)>, usched=0, initret=0) at
machine-common-core.c:1286
#7 0x00007ffff75c2a11 in main (argc=2, argv=0x7fffffffeca8) at main.C:9

Thanks for looking into this,
Jozsef

On 05.01.2018 18:07, Evan Ramos wrote:
> It is not possible to disable hwloc, since we rely on it to query
> hardware topology and set affinities. We also cannot rely on whatever
> version may be linked into OpenMPI due to potential mismatches with
> our code. However, it looks like this issue may have a simple fix.
> Could you test this change:
>
>
> diff --git a/contrib/hwloc/include/hwloc/rename.h
> b/contrib/hwloc/include/hwloc/rename.h
> index 9a0c5fae5..39660f4d3 100644
> --- a/contrib/hwloc/include/hwloc/rename.h
> +++ b/contrib/hwloc/include/hwloc/rename.h
> @@ -489,6 +489,8 @@ extern "C" {
> #define hwloc_component_type_t HWLOC_NAME(component_type_t)
> #define hwloc_component HWLOC_NAME(component)
>
> +#define hwloc_linux_component HWLOC_NAME(linux_component)
> +
> #define hwloc_plugin_check_namespace HWLOC_NAME(plugin_check_namespace)
>
> #define hwloc_insert_object_by_cpuset HWLOC_NAME(insert_object_by_cpuset)
>
>
> If this resolves the issue, I will fix it in our tree and report it
> upstream so that this commit can be partially reverted:
> https://github.com/open-mpi/hwloc/commit/93abf09fee121c55b99f578d62e3ea21decdfbed
>
> Regards,
> --
> Evan A. Ramos
> Software Engineer
> Charmworks, Inc.
>
>
> On Tue, May 1, 2018 at 10:56 AM, Jozsef Bakosi
> <jbakosi AT lanl.gov>
> wrote:
> > Hi folks,
> >
> > Is it possible to disable hwloc in Charm++? I'm getting:
> >
> > /opt/openmpi/lib/libopen-pal.a(topology-linux.o):(.data.rel.ro.local+0x40):
> > multiple definition of `hwloc_linux_component'
> > <charm-install-dir>/charm/bin/../lib/libhwloc_embedded.a(topology-linux.o):(.data.rel.ro.local+0x0):
> > first defined here
> >
> > Thanks,
> > Jozsef



Archive powered by MHonArc 2.6.19.

Top of Page