Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[UR] [V2] Add wait before enqueue in command buffer #17709

Merged
merged 8 commits into from
Apr 2, 2025

Conversation

Xewar313
Copy link
Contributor

@Xewar313 Xewar313 commented Mar 28, 2025

According to @MichalMrozek, zeCommandListImmediateAppendCommandListsExp has the same requirements as zeCommandQueueExecuteCommandLists, because of this, the command list must not be referenced by device when it is enqueued. This PR fixes this issue by adding event to synchronize append and execution

@Xewar313 Xewar313 requested review from a team as code owners March 28, 2025 12:50
@Xewar313 Xewar313 requested a review from reble March 28, 2025 12:50
if (phEvent == nullptr) {
phEvent = &internalEvent;
}
UR_CALL(hCommandBuffer->awaitExecution(commandListLocked));
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This will block on host. The currentExecution event, if not null, should be simply added to the wait listwhen enqueuing the command list.

EwanC added a commit to reble/llvm that referenced this pull request Mar 28, 2025
Taken from intel#17709

Co-authored-by:  Mikołaj Komar <[email protected]>
EwanC added a commit to reble/llvm that referenced this pull request Mar 28, 2025
Taken from intel#17709

Co-authored-by:  Mikołaj Komar <[email protected]>
EwanC added a commit to reble/llvm that referenced this pull request Mar 31, 2025
Taken from intel#17709

Co-authored-by:  Mikołaj Komar <[email protected]>
EwanC added a commit to reble/llvm that referenced this pull request Mar 31, 2025
Taken from intel#17709

Co-authored-by:  Mikołaj Komar <[email protected]>
@Xewar313
Copy link
Contributor Author

Xewar313 commented Apr 1, 2025

The failure on CI is not related to code changes and has been reported in #17066. @intel/llvm-gatekeepers please merge.

EwanC added a commit to reble/llvm that referenced this pull request Apr 1, 2025
Taken from intel#17709

Co-authored-by:  Mikołaj Komar <[email protected]>
EwanC added a commit to reble/llvm that referenced this pull request Apr 1, 2025
Taken from intel#17709

Co-authored-by:  Mikołaj Komar <[email protected]>
@Xewar313
Copy link
Contributor Author

Xewar313 commented Apr 2, 2025

@intel/llvm-gatekeepers bump

@sommerlukas
Copy link
Contributor

@intel/llvm-gatekeepers bump

There is two CI failures on this PR. Please review them, fix them if they are related to the changes in this PR or add an explanation as to why the CI failure is unrelated before we can merge this PR.

@Xewar313
Copy link
Contributor Author

Xewar313 commented Apr 2, 2025

@sommerlukas Changes in this PR only affect unified-runtime L0 v2 adapter and are completely unrelated to the failures on CI (Jenkins failed due to internal error, and e2e failed on some unstable test, previously it was the one described in #17066, now it unexpectedly passes some different test)

@Xewar313
Copy link
Contributor Author

Xewar313 commented Apr 2, 2025

Moreover it passed CI on commit before, and the changes in the last commit are functionally cosmetical, without any influence on program logic

@sommerlukas sommerlukas merged commit 3967086 into intel:sycl Apr 2, 2025
41 of 44 checks passed
@sommerlukas
Copy link
Contributor

Merged this now. @Xewar313 Can you please file a Github issue for the unexpectedly passing test, so we track this?

@Xewar313
Copy link
Contributor Author

Xewar313 commented Apr 4, 2025

I have filed the issue #17842

martygrant pushed a commit that referenced this pull request Apr 4, 2025
This PR disables flaky fill test for command buffer that was enabled in
#17709.
The issue is connected to the bug in the driver that is patched in new
version, but the CI machines still have the old one, which causes it to
sometimes fail (for example
https://github.com/intel/llvm/actions/runs/14250564960/job/39942652796?pr=17836)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants