Remove old reduction "tuning" from OpenMP Target variants #484

rhornung67 · 2024-10-08T19:20:25Z

Summary

This PR removes the old reductions from OpenMP Target variants of kernels with reductions.
It also removes declarations of tuning methods in kernel class headers when those methods are unimplemented.
Lastly, is contains some minor changes for code consistency, cleanup, etc.

…rm-ompt-reduction-tunings

…r kernels.

I don't know how this code compiled.

rchen20 · 2024-10-08T20:06:21Z

src/algorithm/REDUCE_SUM-OMPTarget.cpp

+      RAJA::forall<RAJA::omp_target_parallel_for_exec<threads_per_team>>(
+        RAJA::RangeSegment(ibegin, iend),
+        RAJA::expt::Reduce<RAJA::operators::plus>(&tsum),
+        [=] (Index_type i, Real_type& sum) {


Real_type& sum would need to be RAJA::expt::ValOp<Real_type, RAJA::operators::plus> & sum if we are pulling in the latest changes from RAJA/develop.

We are not on RAJA develop yet. That will come later. One chunk of changes at a time.

rchen20 · 2024-10-08T20:10:03Z

src/basic/REDUCE3_INT-OMPTarget.cpp

+      m_vmin = RAJA_MIN(m_vmin, static_cast<Int_type>(tvmin));
+      m_vmax = RAJA_MAX(m_vmax, static_cast<Int_type>(tvmax));


I'm sure there's a good reason, but why are we doing min and max operations outside of the forall here?

Technically, it shouldn't matter when the kernel runs multiple times, but we want to make sure we are not just seeing the result from the last run through the kernel. This change was not made in this PR. It's been like that since the beginning.

Ok, this looks fine because we're using the macros on integer types. If this were inside a lambda, we'd need to use .min() on the ValOp type in the near future.

Yup. Those changes will be coming.

rhornung67 added 9 commits October 3, 2024 15:07

Only use new reductions in OMP Target variants

85d0c5a

Fix erroneous deletions.

45e7d4b

Merge branch 'bugfix/burmark1/multireduce_ompt' into task/rhornung67/…

9a9bef8

…rm-ompt-reduction-tunings

Squash unused arg warnings

2b02fe0

Change variant string name to be consistent with enum name

1d164e4

Remove extraneous lambda and make string message consistent with othe…

70f8c16

…r kernels.

Make checksum scale factor a class member.

537db6f

I don't know how this code compiled.

Remove unused member function declarations

c87862c

Merge branch 'develop' into task/rhornung67/rm-ompt-reduction-tunings

b6bbf64

rhornung67 requested review from MrBurmark, artv3 and rchen20 October 8, 2024 19:20

artv3 approved these changes Oct 8, 2024

View reviewed changes

rchen20 reviewed Oct 8, 2024

View reviewed changes

rchen20 approved these changes Oct 8, 2024

View reviewed changes

MrBurmark approved these changes Oct 10, 2024

View reviewed changes

rhornung67 merged commit 9af20b3 into develop Oct 10, 2024
24 checks passed

rhornung67 deleted the task/rhornung67/rm-ompt-reduction-tunings branch October 10, 2024 17:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove old reduction "tuning" from OpenMP Target variants #484

Remove old reduction "tuning" from OpenMP Target variants #484

rhornung67 commented Oct 8, 2024

rchen20 Oct 8, 2024

rhornung67 Oct 8, 2024 •

edited

Loading

rchen20 Oct 8, 2024

rhornung67 Oct 8, 2024

rchen20 Oct 8, 2024

rhornung67 Oct 8, 2024

		m_vmin = RAJA_MIN(m_vmin, static_cast<Int_type>(tvmin));
		m_vmax = RAJA_MAX(m_vmax, static_cast<Int_type>(tvmax));

Remove old reduction "tuning" from OpenMP Target variants #484

Remove old reduction "tuning" from OpenMP Target variants #484

Conversation

rhornung67 commented Oct 8, 2024

Summary

rchen20 Oct 8, 2024

Choose a reason for hiding this comment

rhornung67 Oct 8, 2024 • edited Loading

Choose a reason for hiding this comment

rchen20 Oct 8, 2024

Choose a reason for hiding this comment

rhornung67 Oct 8, 2024

Choose a reason for hiding this comment

rchen20 Oct 8, 2024

Choose a reason for hiding this comment

rhornung67 Oct 8, 2024

Choose a reason for hiding this comment

rhornung67 Oct 8, 2024 •

edited

Loading