GitHub - rbroc/mental-health-llm-bias

- 1_create_questionnaire_specs.py creates json files (saved under specs.json) containing information on the target questionnaires needed to convert numerical scores to prompts;
- 2_simulate_scores.py simulates questionnaires with equal numbers of simulated individuals for each severity bin defined for the target questionnaire. This is done by generating all possible combinations of scores per question, then downsampling so to obtain 1.000 total examples, equally distributed across severity bins. Outputs are saved under scores;
- 3_scores_to_narratives.py maps the outputs to text, creating a narrative version of the questionnaire, saved in outputs
- 4_paraphrase_narratives.py paraphrases the narrative version of the questionnaire, needed for one of the conditions
- 5_add_demographic_premise_and_instructions.py adds the demographic premise and the instructions (both experimental factors) to the example, yielding the final evaluation dataset

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
evaluation/data/model_completions		evaluation/data/model_completions
legacy		legacy
nbs/explore_generations		nbs/explore_generations
outputs		outputs
.gitignore		.gitignore
1_create_questionnaire_specs.py		1_create_questionnaire_specs.py
2_simulate_scores.py		2_simulate_scores.py
3_scores_to_narratives.py		3_scores_to_narratives.py
3_scores_to_narratives_no_pronouns.py		3_scores_to_narratives_no_pronouns.py
4_paraphrase_narratives.py		4_paraphrase_narratives.py
5_pronouns_replacement.py		5_pronouns_replacement.py
5_select_instances_to_review.ipynb		5_select_instances_to_review.ipynb
6_combine_paraphrases.py		6_combine_paraphrases.py
7_add_demographic_premise_and_instructions.py		7_add_demographic_premise_and_instructions.py
8_output_format_analysis.ipynb		8_output_format_analysis.ipynb
8_prompting.py		8_prompting.py
README.md		README.md
nohup.out		nohup.out

Provide feedback