BreakMeshByBlock robustness #12033

jiangwen84 · 2018-08-23T15:49:10Z

Rationale

There are two main issues with existing BreakMeshByBlock

the interface sideset is not assigned correctly.
~~the connectivity of the elements with news nodes are not correct~~ -> lindsayad comment: I don't believe we can ever get this correct since in a strict sense after node replacement the former neighbors are not neighbors anymore

Description

resolve those issues in BreakMeshByBlock.

Impact

Improvement and bug fixes.

lindsayad · 2019-10-22T19:32:18Z

It seems that BreakMeshByBlock(Generator) suffers from a fair number of robustness issues. Based on @arovinelli's comment here it does not work with recover (at least when using InterfaceKernels?) nor with DistributedMesh. Additionally, according to the test suite it doesn't work in parallel in debug mode either. It would be nice if we could make this object a bit more robust such that it can work in all the traditional MOOSE scenarios.

When run in parallel in debug mode these split mesh tests fail assertions in libMesh, specifically some node_touched assertions. BreakMeshByBlockGenerator needs to be made more robust in the future Refs idaholab#14124 idaholab#12033

arovinelli · 2019-10-23T15:46:19Z

@lindsayad making it work with distributed mesh should be double even if a bit cumbersome. For the restart I believe the major problem is that the mesh connectivity is not saved there ant it might be difficult to reconstruct it when the mesh is already split (if we have large dispalcements between the two blocks is going to be a impossible). Any idea on how to solve this?

It seems that BreakMeshByBlock(Generator) suffers from a fair number of robustness issues. Based on @arovinelli's comment here it does not work with recover (at least when using InterfaceKernels?) nor with DistributedMesh. Additionally, according to the test suite it doesn't work in parallel in debug mode either. It would be nice if we could make this object a bit more robust such that it can work in all the traditional MOOSE scenarios.

permcody · 2019-11-07T18:05:11Z

Closing this specific issue based on the merged PRs listed.

lindsayad · 2019-11-07T18:49:10Z

Err I don't think this should be closed, #14216 specifically references this issue because it is still an issue

arovinelli · 2019-11-14T01:40:44Z

@lindsayad and @permcody I need to run some big jobs on a cluster and therefore I need to fix at the least to enable the distributed mesh option. I guess that the problem here is to sync different MPI process to add node and update element simultaneously. Can you point me to any example doing something similar to give me an example to follow and simplify my work? Of course if Ica n fix some issue this will be merged.
Thanks again

permcody · 2019-11-14T16:51:07Z

@arovinelli - You can jump around other generators and see if others have been updated. It can be pretty complex.

Here's a better idea. Instead of trying to get it working with DistributedMesh, I recommend that you just use the "split mesh" capability in MOOSE. The short explanation is that you run a job with a few extra CLI options that will just run the mesh steps (reading it in, generating it, applying transformations, etc) and will write it back out as separate files ready to run for a larger distributed run. The nice part is when you do this, the memory used by your larger run and the startup time will be drastically reduced. Take a look here:

https://www.mooseframework.org/syntax/Mesh/splitting.html

arovinelli · 2019-11-14T17:14:57Z

@permcody thanks for the reply this is good to know.
One question would the file written from the mesh split contain the face-face neighbor information, or it will be reconstructed later? If it is reconstructed later than I need a way to reassign the correct neighbors

permcody · 2019-11-14T17:42:31Z

face-face neighbor information? I assume you mean the new internal boundaries you are trying to add?

The splitter system is designed to run all mesh setup tasks including running RelationshipManagers. So if you have Generators that add boundaries that information will be saved. If you have objects that require a "wider stencil", those ghost elements should be written. Ideally everything you need should be preserved.

arovinelli · 2019-11-14T17:49:14Z

@permcody face-face neighbor information I mean elem->face->neighbor_elem information
The BreackMeshByBlockGenerator add nodes such that neighboring element on two different block do not share the same nodes anymore. So the mesh is in fact broken, but the elem->face->neighbor_elem information are still there. If the elem->face->neighbor_elem info are not saved by the mesh split, I need to reconstruct it be cause otherwise the interface will not work anymore

ref idaholab#12033

arovinelli · 2019-11-19T17:30:07Z

@permcody and @lindsayad
so I tried to use the split mesh tool, howeve when treying use the split mesh things brake
this is the output

*** Error in `../../../../tensor_mechanics-opt': malloc(): memory corruption (fast): 0x0000000000f35f10 ***
*** Error in `../../../../tensor_mechanics-opt': malloc(): memory corruption (fast): 0x000000000228aff0 ***
*** Error in `../../../../tensor_mechanics-opt': malloc(): memory corruption (fast): 0x00000000024d4270 ***

permcody · 2019-11-19T18:12:02Z

This might mean that you don't have enough ghosting saved in the split mesh file. It's important that each object in the simulation advertises its individual needs as far as accessing nodes, elements, doc indices etc outside of its partition. Does this sim run with Distributed Mesh? That's worth checking before continuing down the Split Mesh path.

…

On Tue, Nov 19, 2019 at 12:30 PM Andrea Rovinelli ***@***.***> wrote: @permcody <https://github.com/permcody> and @lindsayad <https://github.com/lindsayad> so I tried to use the split mesh tool, howeve when treying use the split mesh things brake this is the output *** Error in `../../../../tensor_mechanics-opt': malloc(): memory corruption (fast): 0x0000000000f35f10 *** *** Error in `../../../../tensor_mechanics-opt': malloc(): memory corruption (fast): 0x000000000228aff0 *** *** Error in `../../../../tensor_mechanics-opt': malloc(): memory corruption (fast): 0x00000000024d4270 *** — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#12033?email_source=notifications&email_token=AAXFOIGESJJOYTKIYM5AU6TQUQPCJA5CNFSM4FRHHAFKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEEPBBSQ#issuecomment-555618506>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAXFOIA5HD4DYKLZABIT2XTQUQPCJANCNFSM4FRHHAFA> .

arovinelli · 2019-11-19T18:24:58Z

@permcody it will not as

moose/framework/src/meshgenerators/BreakMeshByBlockGenerator.C

Lines 34 to 39 in 5cb1f32

    
           BreakMeshByBlockGenerator::BreakMeshByBlockGenerator(const InputParameters & parameters) 
        
             : BreakMeshByBlockGeneratorBase(parameters), _input(getMesh("input")) 
        
           { 
        
             if (typeid(_input).name() == typeid(DistributedMesh).name()) 
        
               mooseError("BreakMeshByBlockGenerator only works with ReplicatedMesh."); 
        
           }

permcody · 2019-11-19T22:12:47Z

OK to be clear, Split Mesh is always for Distributed Mesh runs. We could probably improve error messaging along those lines. This Generator will need to be updated to work for DistributedMesh AND include appropriate RelationshipManager additions before you can expect it to work with split mesh.

…

On Tue, Nov 19, 2019 at 1:25 PM Andrea Rovinelli ***@***.***> wrote: @permcody <https://github.com/permcody> it will not as https://github.com/idaholab/moose/blob/5cb1f32e014b750c4815b85a41310b3d80852929/framework/src/meshgenerators/BreakMeshByBlockGenerator.C#L34-L39 — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#12033?email_source=notifications&email_token=AAXFOIB5ED2XJFGOUMSDVP3QUQVP5A5CNFSM4FRHHAFKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEEPG55A#issuecomment-555642612>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAXFOIDM4A6OLNKFNF3AIXTQUQVP5ANCNFSM4FRHHAFA> .

arovinelli · 2019-11-19T22:24:13Z

@permcody thanks for the reply
I believe it would be much easier going the other way around start from a disconnected mesh and join coincident faces (e.g. the one having nodes sharing the same locations). I'll work on that

permcody · 2019-11-20T13:27:41Z

I believe it would be much easier going the other way around start from a disconnected mesh and join coincident faces (e.g. the one having nodes sharing the same locations).

It shouldn't be more difficult one way or the other. When working with a DistributedMesh in the MeshGenerator system, you'll have access to the full mesh on each processor (well in almost all cases). The way you deal with DistributedMesh is that you inspect the "processor_id" on each element to know whether the current processor owns it or not.

arovinelli · 2019-11-20T13:36:08Z

Ok, I didn't know that. I Have a couple of questions?

when adding a new node, which processor should be in charge of adding the new node? the processor owning the element to which the node will belong?
Also once a node has been added, how will I broadcast this information to all the processors?

It shouldn't be more difficult one way or the other. When working with a DistributedMesh in the MeshGenerator system, you'll have access to the full mesh on each processor (well in almost all cases). The way you deal with DistributedMesh is that you inspect the "processor_id" on each element to know whether the current processor owns it or not.

permcody · 2019-11-20T13:43:18Z

During the generation phase you want the mesh to be consistent on all processors. All processors will have to add the same Node with the same IDs. If anything gets out of sync, you'll run into problems. I should say that it is possible to generate a DistributedMesh where each processor truly works with only their portion of the mesh, but that's very advanced and we aren't doing that in more than maybe one place in MOOSE.

If you need to perform parallel communication you may do so. However, hopefully since you are doing the same work on all processors you won't need to worry about this step.

arovinelli · 2019-11-20T13:50:40Z

If all processors have access to all the mesh and all processor must do the same thing, why should I be worried if a node and or element is remote or local? Are there any caveats for remote node and elements?

During the generation phase you want the mesh to be consistent on all processors. All processors will have to add the same Node with the same IDs. If anything gets out of sync, you'll run into problems. I should say that it is possible to generate a DistributedMesh where each processor truly works with only their portion of the mesh, but that's very advanced and we aren't doing that in more than maybe one place in MOOSE.

If you need to perform parallel communication you may do so. However, hopefully since you are doing the same work on all processors you won't need to worry about this step.

permcody · 2019-11-20T14:09:28Z

If all processors have access to all the mesh and all processor must do the same thing, why should I be worried if a node and or element is remote or local? Are there any caveats for remote node and elements?

I'll admit that I'm not sure we've fleshed out every single possible case when creating a tree of generators. We are still polishing up this system and trying to create documentation and guidance for everyone to use. My concern is that you might generate a sequence of generators A -> B -> C. It might turn out that you need to call the "prepare_for_use()" method on the Mesh between the stages of the generator because you need to call certain "exploration methods" like retrieving neighboring elements of elements, etc. I'm not an expert at what you can and can't do but many methods aren't available for use until you've done an intermediate prepare. However, with DistributedMesh once you do that, I believe (again need to test and verify) that libMesh will go ahead and throw away remote elements.

Now you get into one of the later stages and you no longer have all of the elements available on all processors. Now if you plan to add new elements/nodes everything gets significantly more complicated. You no longer need to do the same work on all ranks, but you likely can't just do the work on one rank either. An example would be where you were going to add new elements on a free surface right on a processor boundary. Multiple processors would need to work together to add a new element due to ghosting.

For now, we've kind of glossed over some of these really nasty edge cases (literal) and just considered the more normal case of having the full mesh available on all ranks during the generation phase. That'll satisfy nearly every case and if you are willing to work with Split Mesh, perhaps 100% of cases. Yeah we are figuring this all out as we go to. It's all pretty new and you are once again working on the bleeding edge.

For now, I would recommend that you assume you have all information on all ranks and just do the same work everywhere. In the future we will continue to think about how to make this easier for our developers and give them the tools they need to deal with truly Distributed cases.

lindsayad · 2019-11-20T15:44:35Z

However, with DistributedMesh once you do that, I believe (again need to test and verify) that libMesh will go ahead and throw away remote elements.

By default it will, but there is a method to stop this from happening: MeshBase::allow_remote_element_removal

permcody · 2019-11-20T15:56:46Z

By default it will, but there is a method to stop this from happening: MeshBase::allow_remote_element_removal

True, but is this something we want to always turn on during MeshGeneration? Perhaps, but maybe not. Design designs that just haven't been made.

still not Distributed

lindsayad · 2020-03-17T04:06:29Z

The documentation says "Only [manually set id] in parallel if you are manually keeping ids consistent",

@roystgnr where is this documentation?

still not Distributed

Compare with centroids instead of global node ids. This is extremely useful when comparing side elements that are coincident but may not actually share the same nodes. See idaholab/moose#12033 for application and libMesh#2362 for more inspiration. This probably won't work but let's give it a shot

still not Distributed

lindsayad · 2020-06-23T23:55:40Z

The documentation says "Only [manually set id] in parallel if you are manually keeping ids consistent",

@roystgnr where is the documentation guiding when to manually set ids?

roystgnr · 2020-06-24T19:15:06Z

It's in the doxygen comments for MeshBase::add_point and MeshBase::add_elem

roystgnr · 2020-06-24T19:16:33Z

But that's when - I'm not sure where we've got proper documentation as to how. I remember writing it out at least once but that might have been on libmesh-users or libmesh-devel.

lindsayad · 2020-06-24T19:59:33Z

Ah ok, I was looking at add_node and didn't think to look at add_point

…

On Wed, Jun 24, 2020 at 12:16 PM roystgnr ***@***.***> wrote: But that's *when* - I'm not sure where we've got proper documentation as to *how*. I remember writing it out at least once but that might have been on libmesh-users or libmesh-devel. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#12033 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACOGA4BKZ6N2BJ3EH53ZKNDRYJGKBANCNFSM4FRHHAFA> .

lindsayad · 2022-05-26T16:13:27Z

@jiangwen84 can you comment or put an 'x' on whether the first bullet in your original post has been fixed? I want to close this issue in favor of a couple more specific, newer issues, but I want to know what we've addressed already first

jiangwen84 · 2022-05-26T16:27:38Z

Yes, I fixed that.

lindsayad · 2022-05-26T16:29:24Z

Great, thanks!

lindsayad · 2022-05-26T16:30:32Z

Closing in favor of more specific issue #21154. I think issues have gotten too long when github forces us to expand comments

jiangwen84 mentioned this issue Aug 23, 2018

Resolved several issues in BreakMeshByBlock #12034

Merged

permcody closed this as completed in #12034 Aug 27, 2018

lindsayad changed the title ~~Connectivity and assigning interface boundary issues in BreakMeshByBlock~~ BreakMeshByBlock robustness Oct 22, 2019

lindsayad reopened this Oct 22, 2019

lindsayad mentioned this issue Oct 22, 2019

Fixups for 14124 #14216

Merged

permcody closed this as completed Nov 7, 2019

lindsayad reopened this Nov 7, 2019

arovinelli pushed a commit to arovinelli/moose that referenced this issue Nov 19, 2019

run_split crashes

2c461af

ref idaholab#12033

arovinelli pushed a commit to arovinelli/moose that referenced this issue Dec 13, 2019

ref idaholab#12033

8a7eca1

arovinelli pushed a commit to arovinelli/moose that referenced this issue Dec 13, 2019

all debug tests fails even ofr replicated idaholab#12033

968a630

arovinelli pushed a commit to arovinelli/moose that referenced this issue Dec 13, 2019

fix debug issue in idaholab#12033

e6098fc

still not Distributed

arovinelli pushed a commit to arovinelli/moose that referenced this issue Dec 18, 2019

ref idaholab#12033

c6259ae

arovinelli pushed a commit to arovinelli/moose that referenced this issue Dec 18, 2019

all debug tests fails even ofr replicated idaholab#12033

be35261

arovinelli pushed a commit to arovinelli/moose that referenced this issue Dec 18, 2019

fix debug issue in idaholab#12033

c91c887

still not Distributed

arovinelli pushed a commit to arovinelli/moose that referenced this issue Jan 6, 2020

all debug tests fails even ofr replicated idaholab#12033

6a9b732

arovinelli pushed a commit to arovinelli/moose that referenced this issue Jan 6, 2020

fix debug issue in idaholab#12033

3c3f5ba

still not Distributed

lindsayad pushed a commit to arovinelli/moose that referenced this issue Mar 18, 2020

all debug tests fails even ofr replicated idaholab#12033

f850f60

lindsayad pushed a commit to arovinelli/moose that referenced this issue Mar 18, 2020

fix debug issue in idaholab#12033

041beb9

still not Distributed

lindsayad mentioned this issue Mar 19, 2020

Change Elem::operator== comparison libMesh/libmesh#2474

Closed

lindsayad pushed a commit to lindsayad/moose that referenced this issue Mar 20, 2020

all debug tests fails even ofr replicated idaholab#12033

19f4973

lindsayad pushed a commit to lindsayad/moose that referenced this issue Mar 20, 2020

fix debug issue in idaholab#12033

70c95ae

still not Distributed

arovinelli pushed a commit to arovinelli/moose that referenced this issue Jun 1, 2020

remove limtations on check method as idaholab#12033 has been resolved

e3fd2e9

milljm pushed a commit to milljm/moose that referenced this issue Jul 15, 2020

remove limtations on check method as idaholab#12033 has been resolved

f277518

aeslaughter added the P: normal A defect affecting operation with a low possibility of significantly affects. label Apr 12, 2021

lindsayad mentioned this issue May 26, 2022

Make BreakMeshByBlockGenerator work with distributed mesh #21154

Open

lindsayad closed this as completed May 26, 2022

hugary1995 mentioned this issue May 27, 2022

BreakMeshByBlock connectivity #21161

Open

ttruster mentioned this issue Apr 21, 2023

Allowing separated InterfaceKernels using periodic boundaries #24140

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BreakMeshByBlock robustness #12033

BreakMeshByBlock robustness #12033

jiangwen84 commented Aug 23, 2018 •

edited

Loading

lindsayad commented Oct 22, 2019

arovinelli commented Oct 23, 2019

permcody commented Nov 7, 2019

lindsayad commented Nov 7, 2019

arovinelli commented Nov 14, 2019

permcody commented Nov 14, 2019

arovinelli commented Nov 14, 2019

permcody commented Nov 14, 2019

arovinelli commented Nov 14, 2019

arovinelli commented Nov 19, 2019

permcody commented Nov 19, 2019 via email

arovinelli commented Nov 19, 2019

permcody commented Nov 19, 2019 via email

arovinelli commented Nov 19, 2019

permcody commented Nov 20, 2019

arovinelli commented Nov 20, 2019

permcody commented Nov 20, 2019

arovinelli commented Nov 20, 2019

permcody commented Nov 20, 2019

lindsayad commented Nov 20, 2019

permcody commented Nov 20, 2019

lindsayad commented Mar 17, 2020

lindsayad commented Jun 23, 2020 •

edited

Loading

roystgnr commented Jun 24, 2020

roystgnr commented Jun 24, 2020

lindsayad commented Jun 24, 2020 via email

lindsayad commented May 26, 2022

jiangwen84 commented May 26, 2022

lindsayad commented May 26, 2022

lindsayad commented May 26, 2022

BreakMeshByBlock robustness #12033

BreakMeshByBlock robustness #12033

Comments

jiangwen84 commented Aug 23, 2018 • edited Loading

Rationale

Description

Impact

lindsayad commented Oct 22, 2019

arovinelli commented Oct 23, 2019

permcody commented Nov 7, 2019

lindsayad commented Nov 7, 2019

arovinelli commented Nov 14, 2019

permcody commented Nov 14, 2019

arovinelli commented Nov 14, 2019

permcody commented Nov 14, 2019

arovinelli commented Nov 14, 2019

arovinelli commented Nov 19, 2019

permcody commented Nov 19, 2019 via email

arovinelli commented Nov 19, 2019

permcody commented Nov 19, 2019 via email

arovinelli commented Nov 19, 2019

permcody commented Nov 20, 2019

arovinelli commented Nov 20, 2019

permcody commented Nov 20, 2019

arovinelli commented Nov 20, 2019

permcody commented Nov 20, 2019

lindsayad commented Nov 20, 2019

permcody commented Nov 20, 2019

lindsayad commented Mar 17, 2020

lindsayad commented Jun 23, 2020 • edited Loading

roystgnr commented Jun 24, 2020

roystgnr commented Jun 24, 2020

lindsayad commented Jun 24, 2020 via email

lindsayad commented May 26, 2022

jiangwen84 commented May 26, 2022

lindsayad commented May 26, 2022

lindsayad commented May 26, 2022

jiangwen84 commented Aug 23, 2018 •

edited

Loading

lindsayad commented Jun 23, 2020 •

edited

Loading