[Vulkan] Track image layout internally #5597

PENGUINLIONG · 2022-08-02T05:02:43Z

For a same reason as discussed in #5540 (comment) this PR propose a mechanism to track image layouts internally:

When an image is created or imported, it's initial state is always VK_IMAGE_LAYOUT_UNDEFINED; VulkanDevice tracks a global state for each image;
When the user inserts image barrier by VulkanCommandList::image_transition, the command list tracks the initial layout of the image when it's first referred by any command, and a pending image layout that will be realized after the execution of the command list.
When the user submits a command list, the initial image layout records are compared with the current records of VulkanDevice; errors are raised if those records mismatch, because a previously submitted command list has invalidated the layouts assumed by the current list.
If the layout records perfectly match, the command list is submitted and tracked layouts in VulkanDevice are updated with pending layouts in VulkanCommandList.

netlify · 2022-08-02T05:02:50Z

✅ Deploy Preview for docsite-preview canceled.

Name	Link
🔨 Latest commit	`e199ec5`
🔍 Latest deploy log	https://app.netlify.com/sites/docsite-preview/deploys/62f341e944d71700086aea55

bobcao3 · 2022-08-03T07:03:42Z

Why do you need to track the layout?

bobcao3 · 2022-08-03T07:06:07Z

For the user, you do not need to know about the layout, you can always assume they are in "undefined"

A layout_transition(undefined, anything) is always valid. It might be sub-optimal because the underlying access flags and stages can not be specified as detailed as possible with a undefined initial layout, but no one bothers going into that kinds of detail.

Moreover, on desktop GPUs, layouts are not a thing. They are purely here for tiled GPUs such as mobile chips. A layout transition on desktop is simply a memory barrier, so it doesn't matter which layout you specify.

---- edit here:

For mobile it does matter... But this is not the correct way to track layout due to the problem we will have with ordering that I mentioned in the next post. You only know all the layouts when the user submit the command buffer, at which point you can't really modify the commands you have recorded. Thus it is better to just establish a consistent convention in terms of API, where when the user provide the image to the system, it will need to assume Taichi takes full control of that image until execution completes. The user might need to specify a final layout as well

If the tracking is really just for validation purposes (where an incorrect initial layout results in an error), then we should let Vulkan validation layer to do its job. It doesn't really simplify the API in any way.

bobcao3 · 2022-08-03T07:09:56Z

Furthermore, this tracking is flawed. We have async compute queues in Vulkan, and we can and do submit multiple command buffers instead of just one big one for the entire frame. If other command buffers have their layout transition run first, it will simply break this and be invalid. The order we record the command list is not guaranteed to be the same order when it's executed on the GPU.

PENGUINLIONG · 2022-08-03T07:51:20Z

For the user, you do not need to know about the layout, you can always assume they are in "undefined"

A layout_transition(undefined, anything) is always valid. It might be sub-optimal because the underlying access flags and stages can not be specified as detailed as possible with a undefined initial layout, but no one bothers going into that kinds of detail.

Moreover, on desktop GPUs, layouts are not a thing. They are purely here for tiled GPUs such as mobile chips. A layout transition on desktop is simply a memory barrier, so it doesn't matter which layout you specify.

I understand your concerns but your reasoning is only valid when the images are always write-only, so there is no need to preserve existing data during layout transition. The image layout actually cannot be assumed undefined, as described in the Vulkan specification.

VK_IMAGE_LAYOUT_UNDEFINED specifies that the layout is unknown. [...] This layout can be used in place of the current image layout in a layout transition, but doing so will cause the contents of the image’s memory to be undefined.

Logically, regardless of any specific implementation, the layout transition involves an inplace R/W of the existing texel data to relocate the texels in memory to serve certain needs of hardware design. So oldLayout guides the driver and the device to collect data from memory correctly. It might not be a problem on the devices we have tested on but we will have to pay for the divergence from the specification sooner or later. We won't be able to preserve memory data if oldLayout is always undefined. We certainly can give up tracking layouts, but that's only possible if images are always in VK_IMAGE_LAYOUT_GENERAL, and there is a price for it in terms of performance.

Furthermore, this tracking is flawed. We have async compute queues in Vulkan, and we can and do submit multiple command buffers instead of just one big one for the entire frame. If other command buffers have their layout transition run first, it will simply break this and be invalid. The order we record the command list is not guaranteed to be the same order when it's executed on the GPU.

I do have considered this in the design of the tracking mechanism; and that's why I proposed double tracking on both Device and CommandList levels. The device records are only updated after a successful submit, and command lists can only be successfully submitted if the initial layouts assumed by them perfectly matches the device records. If we are going to support multi-queues, which is not yet implemented AFAIK, I have integrated event primitives to synchronize different queues for on-device executions, if needed. If "using VK_IMAGE_LAYOUT_UNDEFINED as source layout all the time" is already not conformant to the specification, I don't think it's different from tracking a wrong layout.

PENGUINLIONG · 2022-08-05T06:09:42Z

So I have adapted the impl to track image layouts on a runtime level. @bobcao3 , any comments?

taichi/runtime/gfx/runtime.cpp

taichi/rhi/vulkan/vulkan_device.h

taichi/rhi/vulkan/vulkan_device.cpp

bobcao3

LGTM

PENGUINLIONG · 2022-08-08T02:22:21Z

/rebase

PENGUINLIONG · 2022-08-10T02:10:22Z

/rebase

PENGUINLIONG · 2022-08-10T05:27:28Z

/rebase

for more information, see https://pre-commit.ci

PENGUINLIONG requested review from bobcao3, ailzhang and k-ye August 2, 2022 05:54

PENGUINLIONG force-pushed the vk-image-layout branch 2 times, most recently from aa8f824 to 5b8995c Compare August 5, 2022 06:07

bobcao3 reviewed Aug 6, 2022

View reviewed changes

taichi/runtime/gfx/runtime.cpp Outdated Show resolved Hide resolved

bobcao3 reviewed Aug 7, 2022

View reviewed changes

taichi/runtime/gfx/runtime.cpp Outdated Show resolved Hide resolved

bobcao3 reviewed Aug 7, 2022

View reviewed changes

taichi/rhi/vulkan/vulkan_device.h Outdated Show resolved Hide resolved

bobcao3 reviewed Aug 7, 2022

View reviewed changes

taichi/rhi/vulkan/vulkan_device.cpp Outdated Show resolved Hide resolved

bobcao3 self-requested a review August 7, 2022 20:30

bobcao3 approved these changes Aug 7, 2022

View reviewed changes

taichi-gardener force-pushed the vk-image-layout branch from 98a4ac5 to 6aaed60 Compare August 8, 2022 02:22

taichi-gardener force-pushed the vk-image-layout branch from 7345a24 to 34481ad Compare August 10, 2022 02:11

PENGUINLIONG and others added 8 commits August 10, 2022 05:28

Track image layout internally

8d809f3

Track layout for imported images

0bca963

Make get_image_layout public

1caaad2

[pre-commit.ci] auto fixes from pre-commit.com hooks

f57d82b

for more information, see https://pre-commit.ci

Track image layout in runtime level instead of device level

fd4abb3

[pre-commit.ci] auto fixes from pre-commit.com hooks

c37cdb7

for more information, see https://pre-commit.ci

Removed unrelated results

9c97130

Minor fixes

00e15ce

pre-commit-ci bot and others added 12 commits August 10, 2022 05:28

[pre-commit.ci] auto fixes from pre-commit.com hooks

ce5e03a

for more information, see https://pre-commit.ci

Minor fixes

b2b3144

Minor fixes

8a523be

[pre-commit.ci] auto fixes from pre-commit.com hooks

a3225c8

for more information, see https://pre-commit.ci

Disable validation

3ced5b6

Update runtime.cpp

7682b79

Resolved review comments

3c334c4

Moved image tracking registration to create_image

a964e86

[pre-commit.ci] auto fixes from pre-commit.com hooks

3fcebeb

for more information, see https://pre-commit.ci

Fixed texture layout transition

f7e51fa

Support creating texture in OpenGL backend

d55a56c

[pre-commit.ci] auto fixes from pre-commit.com hooks

e199ec5

for more information, see https://pre-commit.ci

taichi-gardener force-pushed the vk-image-layout branch from 34481ad to e199ec5 Compare August 10, 2022 05:28

PENGUINLIONG merged commit 3895ff7 into taichi-dev:master Aug 10, 2022

PENGUINLIONG deleted the vk-image-layout branch August 10, 2022 06:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Vulkan] Track image layout internally #5597

[Vulkan] Track image layout internally #5597

PENGUINLIONG commented Aug 2, 2022

netlify bot commented Aug 2, 2022 •

edited

Loading

bobcao3 commented Aug 3, 2022

bobcao3 commented Aug 3, 2022 •

edited

Loading

bobcao3 commented Aug 3, 2022

PENGUINLIONG commented Aug 3, 2022

PENGUINLIONG commented Aug 5, 2022

bobcao3 left a comment

PENGUINLIONG commented Aug 8, 2022

PENGUINLIONG commented Aug 10, 2022

PENGUINLIONG commented Aug 10, 2022

[Vulkan] Track image layout internally #5597

[Vulkan] Track image layout internally #5597

Conversation

PENGUINLIONG commented Aug 2, 2022

netlify bot commented Aug 2, 2022 • edited Loading

✅ Deploy Preview for docsite-preview canceled.

bobcao3 commented Aug 3, 2022

bobcao3 commented Aug 3, 2022 • edited Loading

bobcao3 commented Aug 3, 2022

PENGUINLIONG commented Aug 3, 2022

PENGUINLIONG commented Aug 5, 2022

bobcao3 left a comment

Choose a reason for hiding this comment

PENGUINLIONG commented Aug 8, 2022

PENGUINLIONG commented Aug 10, 2022

PENGUINLIONG commented Aug 10, 2022

netlify bot commented Aug 2, 2022 •

edited

Loading

bobcao3 commented Aug 3, 2022 •

edited

Loading