charm AT lists.siebelschool.illinois.edu
Subject: Charm++ parallel programming system
List archive
- From: Marcin Mielniczuk <marmistrz.dev AT zoho.eu>
- To: Sam White <white67 AT illinois.edu>
- Cc: "charm AT lists.cs.illinois.edu" <charm AT lists.cs.illinois.edu>
- Subject: Re: [charm] How to verify that AMPI load balancing works?
- Date: Mon, 3 Jun 2019 18:50:06 +0200
- Arc-authentication-results: i=1; mx.zohomail.eu; dkim=pass header.i=zoho.eu; spf=pass smtp.mailfrom=marmistrz.dev AT zoho.eu; dmarc=pass header.from=<marmistrz.dev AT zoho.eu> header.from=<marmistrz.dev AT zoho.eu>
- Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=zohomail.eu; s=zohoarc; t=1559580608; h=Content-Type:Cc:Date:From:In-Reply-To:MIME-Version:Message-ID:References:Subject:To:ARC-Authentication-Results; bh=+mjMuASqK/7nTyeZGtMmKwiyECCNs3nNJc2lh+djfIU=; b=brnJKYYiyEL2ATKvZ1lztgV5wuWKhpy5AAqxOSVNPo6Eaj9TlkPFXmigqKo+dsG/xVYPXC/HB9tayIAq+MnqetYUjDj1fmr/qXECC2sPznMz7TyKVriUroWKh3EYZ1nzLSVgQwSEy4I4nlSE7q5SE8l8jsYNJqSXvsGN0m7IgLE=
- Arc-seal: i=1; a=rsa-sha256; t=1559580608; cv=none; d=zohomail.eu; s=zohoarc; b=G9MOCuexUJQUsJVca+KtP8+hIW9Q0hcJ/tacJix5MGyMgoU/FX24g9YCIvlX1tNb57PXA/OEW8EIujN7qtFB5ypCzC3eU/Id8X+J8voanyWBnzjg7uaqxo5IhXQOjnyyhtEG5+sq/gnZd+4JXU+4qrKQGNOX/Rd/UiFapmZsWq4=
- Authentication-results: illinois.edu; spf=pass smtp.mailfrom=marmistrz.dev AT zoho.eu; dkim=pass header.d=zoho.eu header.s=admin; dmarc=pass header.from=zoho.eu
Yes, with AMPI_INFO_LB_SYNC the migration is attempted, but the program crashes. What exactly do you mean by a synchronization point? Do you mean that there are no asynchronous/immediate MPI calls pending or that I should manually call MPI_Barrier to make sure that all local variables on all nodes are synchronized? I suppose that the crash may be related with ASLR. charmrun seems
to make use of setarch to disable ASLR, and I know there's
+isomalloc_sync, but I can't make use of the latter because I
don't have a shared filesystem. I'm a little confused why there's
both of them. setarch should be enough Btw. for some reasons I receive every message in this thread twice. Regards, On 03.06.2019 18:14, Sam White wrote:
Yes AMPI_Register_just_migrated will work for that.
Which version of AMPI are you using, v6.9.0 or the current
master branch? Can you try using the AMPI_INFO_LB_SYNC info
rather than ASYNC for the call to AMPI_Migrate() and re-running?
The ASYNC mode is not tested and is more experimental. If you
still don't see migration happening as expected I'd be
interested in seeing the output with +LBDebug.
Thank you, Sam On Mon, Jun 3, 2019 at 10:54
AM Marcin Mielniczuk <marmistrz.dev AT zoho.eu> wrote:
Hi Sam, I understand that AMPI_Register_just_migrated is the proper way to do 4? If so, then I confirm that the migration only happens once, on the first call to AMPI_Migrate. I do call AMPI_Migrate multiple times. This can be seen looking at the "trying to migrate" lines in the stdout and the following lines in the source code: https://github.com/marmistrz/heat_solver/blob/master/main.cpp#L286-L291 Regards,
|
- [charm] How to verify that AMPI load balancing works?, Marcin Mielniczuk, 06/03/2019
- <Possible follow-up(s)>
- Re: [charm] How to verify that AMPI load balancing works?, Sam White, 06/03/2019
- Re: [charm] How to verify that AMPI load balancing works?, Marcin Mielniczuk, 06/03/2019
- Message not available
- Re: [charm] How to verify that AMPI load balancing works?, Sam White, 06/03/2019
- Re: [charm] How to verify that AMPI load balancing works?, Marcin Mielniczuk, 06/03/2019
- Re: [charm] How to verify that AMPI load balancing works?, Sam White, 06/03/2019
Archive powered by MHonArc 2.6.19.