Randomness conciseness #10

sammymuench · 2024-02-09T03:18:05Z

Pull request for updated code. Changes in annual ipynb, make_xy_data_splits, and intro ipynb. data dir is now included with a couple large files removed.

kheuton

I commented on things that should change. In brief: I don't love the environment variable system we've got now, but unless we get rid of it completely we should not silently override it. Most changes requested are small

kheuton · 2024-02-20T16:54:02Z

experiment-runner/fit_and_predict.py

@@ -206,6 +206,7 @@ def calc_score_dict_uncertainty(model, x_df, y_df, split_name,
            'INTPTLAT', 'INTPTLON']

    tr, va, te = make_xy_data_splits.load_xy_splits(
+        data_dir = '../cook-county/cleaning-cook-county/data_dir',


This shouldn't be necessary. load_xy_splits uses the data directory set by the users environment variable. I don't love the environment variable either, but by hardcoding this path here it silently breaks how the DATA_DIR environment variable works

kheuton · 2024-02-20T19:03:30Z

cook-county/cleaning-cook-county/extract_dataset.py

@@ -7,7 +7,7 @@
    parser = argparse.ArgumentParser()
    parser.add_argument('--steps_to_run', default='1,2,3,4,5', type=str)
    args = parser.parse_args()
-
+    os.environ['DATA_DIR'] = 'data_dir'


You can add this as the default on the next line, but we shouldn't overwrite the user's environment here

kheuton · 2024-02-20T19:36:34Z

cook-county/cleaning-cook-county/cook-county-annual.ipynb

-Dont override environment variable, but can proivde ./data_dir as a default if needed

Why was the rounding change necessary?

…ch will generate yearly semi and quarterly dfs in one command. This script also has option to add code to easily generate monthly, biweekly, and weekly dfs. The data has been rearranged to do this correct with new "data_dir" in the right spot. Also, this commit added compare_timesteps.py and compare_timesteps_fit_and_predict.py, scripts which can compare the predictive power of 2 timesteps using one as a baseline.

https://github.com/tufts-ml/opioid-overdose-models into randomness-conciseness

sammymuench added 2 commits February 7, 2024 17:07

added conciseness and randomness updates

1b559e7

deleted large files

2be3fb3

kheuton self-requested a review February 12, 2024 21:14

kheuton requested changes Feb 20, 2024

View reviewed changes

sammymuench added 4 commits April 15, 2024 17:28

Merge branches 'randomness-conciseness' and 'randomness-conciseness' of

d15f094

https://github.com/tufts-ml/opioid-overdose-models into randomness-conciseness

Updated lag fairness

7f27f0d

fair lookback period and small errors

ed93fb4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Randomness conciseness #10

Randomness conciseness #10

sammymuench commented Feb 9, 2024

kheuton left a comment

kheuton Feb 20, 2024

kheuton Feb 20, 2024

kheuton Feb 20, 2024

Randomness conciseness #10

Are you sure you want to change the base?

Randomness conciseness #10

Conversation

sammymuench commented Feb 9, 2024

kheuton left a comment

Choose a reason for hiding this comment

kheuton Feb 20, 2024

Choose a reason for hiding this comment

kheuton Feb 20, 2024

Choose a reason for hiding this comment

kheuton Feb 20, 2024

Choose a reason for hiding this comment