Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Gpu code #58

Open
wants to merge 205 commits into
base: master
Choose a base branch
from
Open

Gpu code #58

wants to merge 205 commits into from

Conversation

abouzied-nasar
Copy link

No description provided.

… b to stop an error related to prototype overloading in dummy.c in src/cuda
… task.c and h queue.h and removed unused vars from runner_doiact_functions_hydro_gpu.h
… tasks. Code now hangs so there must be some issue with task activation. problem for tomorrow!
@abouzied-nasar
Copy link
Author

This is the current state of the gpu code as implemented in the latest SWIFT version. It's not working properly yet as I think there is an issue with the GPU task activation (the code hangs before the first time step) and I need to implement a mechanism for creating deps for unpack tasks whic do not belong to a cell

…ave been source of hanging bug. Not the case :(
… deps for unpack tasks. CPU version works perfectly but GPU code now hangs for some reason
…gpu and re-worked runner_main_clean.cu but code still hangs
…gpu and re-worked runner_main_clean.cu but code still hangs
…. Removed them all and code still hangs. Will copy back previous files innext commit
…ing to with not unskipping tasks properly. IFDEFs are totally correct and code does not hang when GPU code commented out
…ng. Code is doing something but I don't think it's actually progressing through time steps
… where the number of leaf tasks will exceed target_n_tasks. Code is not working as we need to figure out when to lock/unlock and also correctly monitor how we are locking/unlocking on a top task level (not leaf level)
… code at the end for usein de-bugging/timing if required
…e from those which we actually use. We now basically have 6 functions for self and pair dens, grad and forc loops
…y be one function with if statement or some sort of variable over-loading to allow one function to work on 3 different data types. Something for another day!
…much clean of un-necessary allocations. There are some counters that need to be remove eventually as they are just for debugging and timing but left in for now
…w move onto sorting out recursion through sub_pair tasks.
…s about negative wait though when using recursion through pair tasks
…only have been done once for the top level task. Moved decrementing from the GPU launch function into runner_main so that s->waiting only decremented once per top level task
…npacking but I still need to sort out the counters for all eventualities
…rse->pack->offload->re-pack_if_necessary->offload_again_if_necessary
…(probably). Need to figure out how to get the locking/unlocking business to work properly though as I am now unpacking to a few unlocked cells
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants