Update `beta` distribution parameters #643

MustaphaU · 2024-10-01T03:46:15Z

Issue #, if available:

Description of changes:

Update the beta distribution parameters in the simulate_experiment function to avoid bias towards lower success probability.

The current specification of the beta distribution:

theta = np.random.beta(conversions + 1, exposures + 1)

treats every exposure as a failure, that is overstates the failures thus undervalues the success probabilities of the variations. The effect is pronounced for variations with very high baseline conversion rates but less severe for variations with extremely low conversion rates.

Traditionally, the Thompson Sampling Algorithm for the Bernoulli Bandit Thompson Sampling algorithm is:

$$\begin{align*} 1: & \text{for } t = 1, 2, \ldots \text{ do:} \\\ 2: & \quad \quad \text{Sample model:} \\\ 3: & \quad \quad \text{for } k = 1 \text{ to } K \text{ do:} \\\ 4: & \quad \quad \quad \text{Sample } \theta_k \sim \text{beta}(\alpha_k, \beta_k) \\\ 5: & \quad \quad \text{end for} \\\ 6: & \quad \quad \text{Select and apply action:} \\\ 7: & \quad \quad x_t \leftarrow \arg\max_k \theta_k \\\ 8: & \quad \quad \text{Apply } x_t \text{ and observe } r_t \\\ 9: & \quad \quad \text{Update distribution:} \\\ 10: & \quad \quad (\alpha_{x_t}, \beta_{x_t}) \leftarrow (\alpha_{x_t} + r_t, \beta_{x_t} + 1 - r_t) \\\ 11: & \text{end for} \end{align*}$$

Where α, β represent the parameters of each arm i.e. the success and failure counts, respectively OR the number of conversions and non-conversions, respectively.

non-conversions (or beta)  = exposures - conversions

Description of testing performed to validate your changes (required if pull request includes CloudFormation or source code changes):

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

…iment.ipynb Update the beta distribution parameters in the `simulate_experiment` function to avoid bias towards lower success probability. The current specification of the beta distribution: ``` theta = np.random.beta(conversions + 1, exposures + 1) ``` treats every exposure as a failure, that is overstates the failures thus undervalues the success probabilities of the variations. The effect is pronounced for variations with very high baseline conversion rates but less severe for variations with extremely low conversion rates. Traditionally, the Thompson Sampling Algorithm for the Bernoulli Bandit is: ```math \begin{align*} 1: & \text{for } t = 1, 2, \ldots \text{ do:} \\ 2: & \quad \quad \text{Sample model:} \\ 3: & \quad \quad \text{for } k = 1 \text{ to } K \text{ do:} \\ 4: & \quad \quad \quad \text{Sample } \theta_k \sim \text{beta}(\alpha_k, \beta_k) \\ 5: & \quad \quad \text{$$end for$$} \\ 6: \\ 7: & \quad \quad \text{Select and apply action:} \\ 8: & \quad \quad x_t \leftarrow argmax_k \theta_k \\ 9: & \quad \quad \text{Apply } x_t \text{ and observe } r_t \\ 10: \\ 11: & \quad \quad \text{Update distribution:} \\ 12: & \quad \quad (\alpha_{x_t}, \beta_{x_t}) \leftarrow (\alpha_{x_t} + r_t, \beta_{x_t} + 1 - r_t) \\ 13: & \text{end for} \end{align*} ``` Where α, β represent the parameters of each arm i.e. the success and failure counts, respectively OR the number of `conversions` and `non-conversions`, respectively. ``` non-conversions (or beta) = exposures - conversions ```

james-jory

Thank you for catching this oversight!

…ndex` method Update the beta distribution parameters in the `_select_variation_index` method to avoid bias towards lower success probability. The current specification of the beta distribution: ``` theta = np.random.beta(conversions + 1, exposures + 1) ``` treats every exposure as a failure, that is overstates the failures thus undervalues the success probabilities of the variations. The effect is pronounced for variations with very high baseline conversion rates but less severe for variations with extremely low conversion rates. aws-samples#643

MustaphaU · 2024-10-04T21:10:10Z

Thank you for catching this oversight!

Thank you @james-jory !
#647

…ndex` method (#647) Update the beta distribution parameters in the `_select_variation_index` method to avoid bias towards lower success probability. The current specification of the beta distribution: ``` theta = np.random.beta(conversions + 1, exposures + 1) ``` treats every exposure as a failure, that is overstates the failures thus undervalues the success probabilities of the variations. The effect is pronounced for variations with very high baseline conversion rates but less severe for variations with extremely low conversion rates. #643 Co-authored-by: James Jory <[email protected]>

MustaphaU added 2 commits September 30, 2024 23:44

Merge branch 'master' into patch-2

bd45f87

james-jory self-requested a review October 4, 2024 18:09

james-jory approved these changes Oct 4, 2024

View reviewed changes

Merge branch 'master' into patch-2

61e5430

james-jory merged commit 01f9d3d into aws-samples:master Oct 4, 2024
2 checks passed

MustaphaU mentioned this pull request Oct 4, 2024

Update the beta distribution parameters #647

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update `beta` distribution parameters #643

Update `beta` distribution parameters #643

MustaphaU commented Oct 1, 2024 •

edited

Loading

james-jory left a comment

MustaphaU commented Oct 4, 2024

Update beta distribution parameters #643

Update beta distribution parameters #643

Conversation

MustaphaU commented Oct 1, 2024 • edited Loading

james-jory left a comment

Choose a reason for hiding this comment

MustaphaU commented Oct 4, 2024

Update `beta` distribution parameters #643

Update `beta` distribution parameters #643

MustaphaU commented Oct 1, 2024 •

edited

Loading