FAIR:Finding Accessible Inequalities Research in Public Health (the FAIR Database)

Shield:

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

FAIR:Finding Accessible Inequalities Research in Public Health (the FAIR Database)

This is the source code repo for project FAIR

System Overview

Requirements

pip install nltk numpy scikit-learn scikit-image matplotlib torchtext
# requirements from pytorch-transformers/wiki
pip install transformers pymediawiki

Workflow

Get pre-defined wikipedia categories (we call it candidate categories/candidate list). These categories are the ones we want to use to summarize/label a given abstract/paper (We also mannually reviewed the list and removed categories that are not relavent).
For finding similar and related topics:
- get a ClinicalBERT embeddings for each categories (in the candidate categories)
```
/sources/Obtain_and_save_embeddingspre_for_predefined_categories.ipynb
```
- given a category, retrievel the most similar categories via calculating the cosine similarity between each categories
```
similarity_given_anytopic.ipynb
```
For labelling a paper:
- 1. get unigram, bigram and trigram in the abstract (step 2).
- 1. save ngrams that also show up in the candidate list (step 2).
- 1. get all nouns in the abstract (step 3).
- 1. retrieve the related categories of nouns, and save the related categories that also show up in the candidate list (step 3).
- 1. combine lists from step b and c (step 4).
```
Label_arbitrary_paper.ipynb
```
PPlus_classifier contains two models for PROGRESS-Plus classifiers.

Name		Name	Last commit message	Last commit date
Latest commit History 142 Commits
.idea		.idea
Plus_classifiers		Plus_classifiers
images		images
new_data_results		new_data_results
results		results
sources		sources
.gitignore		.gitignore
Label_arbitrary_paper.ipynb		Label_arbitrary_paper.ipynb
README.md		README.md
Race.png		Race.png
similarity_given_anytopic.ipynb		similarity_given_anytopic.ipynb
ttt.py		ttt.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FAIR:Finding Accessible Inequalities Research in Public Health (the FAIR Database)

Requirements

Workflow

About

Releases

Packages

Languages

casszhao/FAIR

Folders and files

Latest commit

History

Repository files navigation

FAIR:Finding Accessible Inequalities Research in Public Health (the FAIR Database)

Requirements

Workflow

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages