charm AT lists.siebelschool.illinois.edu
Subject: Charm++ parallel programming system
List archive
- From: Jozsef Bakosi <jbakosi AT gmail.com>
- To: Ronak Buch <rabuch2 AT illinois.edu>
- Cc: "charm AT cs.uiuc.edu" <charm AT cs.uiuc.edu>, rspavel AT lanl.gov
- Subject: Re: [charm] Parallel read
- Date: Wed, 22 Feb 2017 14:48:37 -0700
Hi Ronak,
We would like to give the existing parallel read of ckio a try.
The way I see it at this point, our options are as follows:
1. From a chare group, do a parallel read in which not all group elements participate (how many, e.g., a percentage, controlled by the user),
2. Do the same as 1, but from a node group, using a similar user-adjustable parameter specifying the number of nodes that should participate in the read.
3. Use ckio's read.
Of these three options, I assume option 3 would also be able to overlap communication and computation, so ultimately we would probably like that best.. Also as I see, the above is also in the order of increasing complexity, but potentially, also in the order of larger return over investment.
I think we would like to give ckio's reader a try just to learn more about option 3 and to be able to better scope out what we can get out of it for our purposes using a small test application.
Could you please merge master into that branch so we can give it a try and let us know when we can try it?
Thanks,
Jozsef
On Sat, Feb 11, 2017 at 6:13 PM, Ronak Buch <rabuch2 AT illinois.edu> wrote:
RonakHi Jozsef,There was some work on adding read capabilities to CkIO a while back, it's on the gerrit/rohan/ckioread branch of charm. The reading facility added there should work, but nobody has been using it in a production applications, so it's out of date (and it hasn't been rebased on a long time). Now that there's some interest in using parallel file reading, I'll update the branch and do some performance tests and get back to you.Otherwise, a node group level file I/O scheme should work well in the meantime. Your sketch on how things should be distributed makes sense to me.Thanks,On Fri, Feb 10, 2017 at 9:37 PM, Jozsef Bakosi <jbakosi AT gmail.com> wrote:Hi folks,
Currently, I have file/PE-style read of a mesh that is in a single file and about a few tens of GB containing a few hundred million cells and their node coordinates. This is okay but does not scale very well beyond a few thousand cores.
I have read the paper on ckio from 2011 and looked at the source and it is definitely something I would like to explore for writing checkpoints in the future, but I wonder what my options are for input. ckio does not seem to have a capability at this point for reading.
Another option that seems appealing is to use Charm++'s node groups and/or the "Physical Node API" for simply replacing my file/PE read with a node/PE read and then after reading my input file in that fashion in a second step I would distribute the data to group and/or chare array elements.
What do you suggest? At this point I am interested in expl oring both quick/dirty as well as good/longer-term solutions.
Thanks,Jozsef
- [charm] Parallel read, Jozsef Bakosi, 02/10/2017
- <Possible follow-up(s)>
- Re: [charm] Parallel read, Ronak Buch, 02/11/2017
- Re: [charm] Parallel read, Jozsef Bakosi, 02/22/2017
- Message not available
- Re: [charm] Parallel read, Ronak Buch, 02/24/2017
Archive powered by MHonArc 2.6.19.