New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

sum_to_zero_vector case study #229

Open

mitzimorris wants to merge 4 commits into master from case-study/sum-to-zero

Member

mitzimorris commented Mar 3, 2025

Transferring the contents of github repo: https://github.com/mitzimorris/sum_to_zero_vector to this repo.

This case study introduces the sum_to_zero_vector. It demonstrates a simple workflow for evaluating performance of different ways to impose a sum-to-zero constraint on a parameter vector.

The HTML file is self-contained. To re-render the HTML, this requires the stan-dev/quarto-config repo for the Stan website styling.

mitzimorris added 3 commits

August 5, 2024 14:54


          Merge branch 'master' of https://github.com/stan-dev/example-models

85cdd5d


          update README, LICENSE

049d7ba


          case study, from mitzimorris github repo

039bec4

mitzimorris requested review from WardBrian and spinkney

March 3, 2025 16:03

Member Author

mitzimorris commented Mar 3, 2025

hi @spinkney and @WardBrian - not sure if we need reviews to add case studies, but I would appreciate any feedback you might have, if you have time.

WardBrian approved these changes

View reviewed changes

Member

WardBrian left a comment

I only have one real comment, otherwise this looks great!

Will you also open a PR to add the rendered version to the website?

jupyter/sum-to-zero/sum_to_zero_evaluation.qmd Outdated Show resolved Hide resolved

jupyter/sum-to-zero/sum_to_zero_evaluation.qmd Outdated Show resolved Hide resolved


          changes per review

e367d4c

spinkney requested changes

View reviewed changes

Collaborator

spinkney left a comment

I mostly added comments that hopefully helps the reader to see the differences and the results more clearly. I'm happy that you put this together and I'm happy to chat about any of the comments if you'd like.

jupyter/sum-to-zero/sum_to_zero_evaluation.qmd

Comment on lines +28 to +29

Collaborator

spinkney Mar 6, 2025

I found it hard to understand what all the code drop downs are. Can you add a sentence saying what this is. For example, Code to load libraries and setup environment.

jupyter/sum-to-zero/sum_to_zero_evaluation.qmd

+              >A sum to zero vector is exactly what the name suggests. A vector where the sum of the elements equals 0.
+              If you put a normal prior on the zero-sum vector the resulting variance will be less than the intended normal variance.
+              To get the same variance as the intended normal prior do

Collaborator

spinkney Mar 6, 2025

Please add the discourse username for quotes

jupyter/sum-to-zero/sum_to_zero_evaluation.qmd

+              and the hard and soft sum-to-zero implementations.
+              We fit each model to the same dataset, using the same random seed, and then
+              compare the summary statistics for the constrained parameter values.
+              Since the models are equivalent, we expect that all three implementations

Collaborator

spinkney Mar 6, 2025

The models are nearly equivalent. The hard sum-to-zero, the soft sum-to-zero, and the built-in all have different implied prior distributions that are being placed on the vector.

jupyter/sum-to-zero/sum_to_zero_evaluation.qmd


		* The specified test sensitivity and specificity

		In order to fit this model, we need to put a sum-to-zero constraint on the categorical variables.

Collaborator

spinkney Mar 6, 2025

Not true, we could drop a category or do some other type of contrast coding. I suggest rewriting to "We find the sum-to-zero constraint on teh categorical variables to be preferable to dropping a category for reference or other contrast coding strategy because it let's us model all the categories as offset from the mean.

jupyter/sum-to-zero/sum_to_zero_evaluation.qmd



		##### Instantiate the data generating model.

Collaborator

spinkney Mar 6, 2025

Add a sentence here about what this is

jupyter/sum-to-zero/sum_to_zero_evaluation.qmd

		#### Model 1: `sum_to_zero_vector`

		This model is in file [binomial_4_preds_ozs.stan](https://github.com/stan-dev/example-models/tree/master/jupyter/sum-to-zero/stan/binomial_4_preds_ozs.stan).

Collaborator

spinkney Mar 6, 2025

I suggest showing at least the parameters block for the sum_to_zero_vector. For the below python code, I suggest putting all the following code in one block.

jupyter/sum-to-zero/sum_to_zero_evaluation.qmd

		```

		#### Model 2: Hard sum-to-zero constraint

Collaborator

spinkney Mar 6, 2025

Highlight what has changed from the previous Stan code. I suggest putting all the python into one block. This goes for the following section as well.

jupyter/sum-to-zero/sum_to_zero_evaluation.qmd

		```

		#### Runtime performance

Collaborator

spinkney Mar 6, 2025

Why even have this section if there isn't a table to show?

Collaborator

spinkney Mar 6, 2025

I suggest merging this with the below and changing the title to something like: Model Checking, Comparison, and Efficiency

jupyter/sum-to-zero/sum_to_zero_evaluation.qmd

		```

		Eth

Collaborator

spinkney Mar 6, 2025

What is this? What does all the code do? Don't assume your readers know python that well!

jupyter/sum-to-zero/sum_to_zero_evaluation.qmd

+              display_side_by_side(small_html, large_html)
+              ```
+              All models have R-hat values of 1.00 for all group-level parameters and high effective sample sizes.

Collaborator

spinkney Mar 6, 2025

I highly suggest to put all the results into a table so the reader can see the differences across all the models without having to read the code output. I don't think you even need to show this code. Just a table and talk about the differences.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet