Allow to temporarily set the current registry even if it is not associated with a worker thread #1166

adamreichold · 2024-05-12T07:29:57Z

Reproducing the issue in #1165.

Will into whether this can be changed without introducing deadlocks...

adamreichold · 2024-05-12T08:59:06Z

Will into whether this can be changed without introducing deadlocks...

While the change to temporarily stash a reference to a "foreign" registry in a TLS variable appears to pass the test suite, I am admittedly not confident about the implications of having such a current registry that is not associated with a worker thread in the first place.

I am also unsure about the performance implications of having the TLS access in Registry::current. I did try to mitigate this by at least also using it to fetch the global registry instead of doing that in separate step though.

@awused Could you give this a try whether that would work for your use case at least?

adamreichold · 2024-05-12T09:21:38Z

I am admittedly not confident about the implications of having such a current registry that is not associated with a worker thread in the first place.

But then again, this should be fine as for example the main thread is always in this relation w.r.t. the global pool, right?

awused · 2024-05-12T15:19:29Z

@awused Could you give this a try whether that would work for your use case at least?

Here are tests I added to iter/test.rs. Based on the documentation I initially would have expected both of these tests to pass, but I don't think they've changed with this PR.

#[test]
#[cfg_attr(any(target_os = "emscripten", target_family = "wasm"), ignore)]
fn scope_par_iter_which_pool() {
    let pool = ThreadPoolBuilder::new()
        .num_threads(1)
        .thread_name(|_| "worker".to_owned())
        .build()
        .unwrap();

    // Determine which pool is currently installed here
    // by checking the thread name seen by spawned work items.
    pool.scope(|_scope| {
        let (name_send, name_recv) = channel();

        let v = [0; 1];

        v.par_iter().for_each(|_| {
            let name = thread::current().name().map(ToOwned::to_owned);

            name_send.send(name).unwrap();
        });

        let name = name_recv.recv().unwrap();

        assert_eq!(name.as_deref(), Some("worker"));
    });
}

#[test]
#[cfg_attr(any(target_os = "emscripten", target_family = "wasm"), ignore)]
fn in_place_scope_par_iter_which_pool() {
    let pool = ThreadPoolBuilder::new()
        .num_threads(1)
        .thread_name(|_| "worker".to_owned())
        .build()
        .unwrap();

    // Determine which pool is currently installed here
    // by checking the thread name seen by spawned work items.
    pool.in_place_scope(|_scope| {
        let (name_send, name_recv) = channel();

        let v = [0; 1];

        v.par_iter().for_each(|_| {
            let name = thread::current().name().map(ToOwned::to_owned);

            name_send.send(name).unwrap();
        });

        let name = name_recv.recv().unwrap();

        assert_eq!(name.as_deref(), Some("worker"));
    });
}

adamreichold · 2024-05-12T15:31:38Z

As you can infer from the assertion failure

---- iter::test::in_place_scope_par_iter_which_pool stdout ----
thread 'iter::test::in_place_scope_par_iter_which_pool' panicked at src/iter/test.rs:2387:9:
assertion `left == right` failed
  left: Some("iter::test::in_place_scope_par_iter_which_pool")
 right: Some("worker")
note: run with `RUST_BACKTRACE=1` environment variable to display a backtrace

the second does not pass because it makes the assumption that all work would end up on the worker threads, but for the parallel iterators (and rather for any join-based interface) this will not be the case and some of the work can be executed directly on the main thread (which is the test thread in this case).

The test as written does not check which worker pool ends up being used. Please have a look at the tests I added here which use the (global) spawn function instead of the parallel iterators exactly for this reason.

adamreichold · 2024-05-12T15:36:18Z

However, extending the tests to use more work and checker for either the main thread or the worker threads, still fails, so while the original test did not check this, the work does not seem to end up on the right pool after all. Will investigate...

adamreichold · 2024-05-12T15:45:53Z

I was missing more direct usages of global_registry, especially called via join and now both tests pass. Note that I used the following version of your second test case:

#[test]
#[cfg_attr(any(target_os = "emscripten", target_family = "wasm"), ignore)]
fn in_place_scope_par_iter_which_pool() {
    let pool = ThreadPoolBuilder::new()
        .num_threads(1)
        .thread_name(|_| "worker".to_owned())
        .build()
        .unwrap();

    // Determine which pool is currently installed here
    // by checking the thread name seen by spawned work items.
    pool.in_place_scope(|_scope| {
        let (name_send, name_recv) = std::sync::mpsc::channel();

        let v = [0; 128];

        v.par_iter().for_each(|_| {
            let name = std::thread::current().name().map(ToOwned::to_owned);

            name_send.send(name).unwrap();
        });

        drop(name_send);

        for name in name_recv {
            let name = name.unwrap();
            assert!(name.contains("in_place_scope_par_iter_which_pool") || name == "worker");
        }
    });
}

which does end up submitting work into the pool. But I think I would still prefer to have a more targetted test case using join in rayon-core. Will work on that...

awused · 2024-05-12T15:47:26Z

the second does not pass because it makes the assumption that all work would end up on the worker threads, but for the parallel iterators (and rather for any join-based interface) this will not be the case

That is the entire bug report in #1165. The documentation is unclear on this point and makes it sound like it will be the case, which is why I suggested updating the documentation for clarity.

some of the work can be executed directly on the main thread (which is the test thread in this case).

I don't think that's the case, at least the documentation for rayon as a whole doesn't state that and I've never observed a parallel iterator using the current thread unless it's already in a rayon threadpool. I would only expect that behaviour if I deliberately set use_current_thread on the threadpool when it was built. If I run a bare par_iter with no scope/install/etc, I never see work execute on the main thread.

Huh, I guess it can do that, I have no idea what is different between my code and this test.

…k from within in_place_scope.

…iated with a worker thread

adamreichold · 2024-05-12T15:55:22Z

Huh, I guess it can do that, I have no idea what is different between my code and this test.

Expect for asserting multiple names to be one of two choice, the only real difference in the workload is that you used a vector of length one whereas I used one of length 128 to ensure that some of the work would end up on the worker threads (having None as their thread names for the default global pool).

adamreichold · 2024-05-12T15:57:51Z

That is the entire bug report in #1165.

I tend to disagree. I think the bug here is that the when work is dispatched onto worker threads via global interfaces like spawn and join (and thereby ParallelIterator), they end up on the global thread pool and not on the one executing the scope, i.e. there is an inconsistency between scope.join and crate::join which this PR tries to remove.

awused · 2024-05-12T16:10:41Z

Huh, I guess it can do that, I have no idea what is different between my code and this test.

Expect for asserting multiple names to be one of two choice, the only real difference in the workload is that you used a vector of length one whereas I used one of length 128 to ensure that some of the work would end up on the worker threads (having None as their thread names for the default global pool).

I was running it with enough items (even tried adding sleep to make sure all available threads were finding work). These asserts all pass when run against the current rayon release - with this PR the second run should assert "outer" instead.

    ThreadPoolBuilder::new()
        .thread_name(|u| format!("global"))
        .build_global()
        .unwrap();

    let stuff = vec![0; 50000];
    stuff.par_iter().for_each(|_| {
         assert_eq!(std::thread::current().name(), Some("global"));
        std::thread::sleep(Duration::from_millis(1));
    });

    let outer_pool = ThreadPoolBuilder::new().thread_name(|u| "outer".to_owned()).build().unwrap();

    outer_pool.in_place_scope(|_| {
        stuff.par_iter().for_each(|_| {
            assert_eq!(std::thread::current().name(), Some("global"));
            std::thread::sleep(Duration::from_millis(1));
        })
    });

I still can't figure out what exactly is different with the test being run compared to this code. Based on that test, my code should fail.

adamreichold · 2024-05-12T16:25:16Z

with this PR the second run should assert "outer" instead.

Exactly and with the currently pushed version, the code above does indeed fail with

thread 'outer' panicked at src/iter/test.rs:2352:13:
assertion `left == right` failed
  left: Some("outer")
 right: Some("global")

I still can't figure out what exactly is different with the test being run compared to this code. Based on that test, my code should fail.

I think I lost you on which test code we are talking about exactly. At least for the code posted in #1166 (comment), the problem was the vector length of one which meant there was no splitting at all and the single invocation happened directly on the main/test thread. Using more work meant join was called at least once which ended up on the global thread pool, i.e. had None as its thread name.

awused · 2024-05-12T16:30:15Z

I think I lost you on which test code we are talking about exactly.

In the end I don't think it matters much really, it's a tangential issue that shouldn't make a material difference in program execution since the calling thread is still blocked until the parallel iterator ends anyway.

This PR does seems to address the issue.

…allow reifying the ambient capability

adamreichold force-pushed the in-place-scope-which-pool branch from 1b7ad17 to 2ae3595 Compare May 12, 2024 09:01

adamreichold force-pushed the in-place-scope-which-pool branch from 2ae3595 to 97b2c9e Compare May 12, 2024 09:29

adamreichold changed the title ~~Add test case demonstrating that the global pool is used to spawn work from within in_place_scope.~~ Allow to temporarily set the current registry even if it is not associated with a worker thread May 12, 2024

adamreichold marked this pull request as ready for review May 12, 2024 09:30

adamreichold force-pushed the in-place-scope-which-pool branch from 97b2c9e to 36e5d84 Compare May 12, 2024 09:35

adamreichold force-pushed the in-place-scope-which-pool branch from 36e5d84 to 301b603 Compare May 12, 2024 15:44

adamreichold added 2 commits May 12, 2024 17:52

Add test case demonstrating that the global pool is used to spawn wor…

6545ba6

…k from within in_place_scope.

Allow to temporarily set the current registry even if it is not assoc…

326fd64

…iated with a worker thread

adamreichold force-pushed the in-place-scope-which-pool branch from 301b603 to 326fd64 Compare May 12, 2024 15:53

cuviper mentioned this pull request May 12, 2024

in_place_scope documentation is confusing/unclear #1165

Open

WIP: Expose current thread pool via the interface of ThreadPool to …

8289ada

…allow reifying the ambient capability

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow to temporarily set the current registry even if it is not associated with a worker thread #1166

Allow to temporarily set the current registry even if it is not associated with a worker thread #1166

adamreichold commented May 12, 2024

adamreichold commented May 12, 2024

adamreichold commented May 12, 2024

awused commented May 12, 2024

adamreichold commented May 12, 2024

adamreichold commented May 12, 2024

adamreichold commented May 12, 2024

awused commented May 12, 2024 •

edited

Loading

adamreichold commented May 12, 2024

adamreichold commented May 12, 2024

awused commented May 12, 2024 •

edited

Loading

adamreichold commented May 12, 2024

awused commented May 12, 2024 •

edited

Loading

Allow to temporarily set the current registry even if it is not associated with a worker thread #1166

Are you sure you want to change the base?

Allow to temporarily set the current registry even if it is not associated with a worker thread #1166

Conversation

adamreichold commented May 12, 2024

adamreichold commented May 12, 2024

adamreichold commented May 12, 2024

awused commented May 12, 2024

adamreichold commented May 12, 2024

adamreichold commented May 12, 2024

adamreichold commented May 12, 2024

awused commented May 12, 2024 • edited Loading

adamreichold commented May 12, 2024

adamreichold commented May 12, 2024

awused commented May 12, 2024 • edited Loading

adamreichold commented May 12, 2024

awused commented May 12, 2024 • edited Loading

awused commented May 12, 2024 •

edited

Loading

awused commented May 12, 2024 •

edited

Loading

awused commented May 12, 2024 •

edited

Loading