🔥 fhiry - FHIR to pandas dataframe for data analytics, AI and ML

Virtual flattened view of FHIR Bundle / ndjson / FHIR server / BigQuery!

🔥 FHIRy is a python package to facilitate health data analytics and machine learning by converting a folder of FHIR bundles/ndjson from bulk data export into a pandas data frame for analysis. You can import the dataframe into ML packages such as Tensorflow and PyTorch. FHIRy also supports FHIR server search and FHIR tables on BigQuery.

Test this with the synthea sample or the downloaded ndjson from the SMART Bulk data server. Use the 'Discussions' tab above for feature requests.

✨ Checkout this template for Multimodal machine learning in healthcare!

UPDATE 1

Recently added support for LLM based natural language queries of FHIR bundles/ndjson using llama-index. Please install the llm extras as follows. Please be cognizant of the privacy issues with publically hosted LLMs. Any feedback will be highly appreciated. See usage!

pip install fhiry[llm]

See usage.

UPDATE 2

Added support for converting a FHIR Bundle to its textual representation for LLMs. You can also convert individual FHIR resources including Patient, Condition, Observation, Procedure, Medication, AllergyIntolerance and DocumentReference.

from fhiry import FlattenFhir
bundle = json.load(jsonfile)
flatten_fhir = FlattenFhir(bundle)
print(flatten_fhir.flattened)

Installation

Stable

pip install fhiry

Latest dev version

pip install git+https://github.com/dermatologist/fhiry.git

Usage

1. Import FHIR bundles (JSON) from folder to pandas dataframe

import fhiry.parallel as fp
df = fp.process('/path/to/fhir/resources')
print(df.info())

Example source data set: Synthea

Jupyter notebook example: notebooks/synthea.ipynb

2. Import NDJSON from folder to pandas dataframe

import fhiry.parallel as fp
df = fp.ndjson('/path/to/fhir/ndjson/files')
print(df.info())

Example source data set: SMART Bulk Data Server Export

Jupyter notebook example: notebooks/ndjson.ipynb

3. Import FHIR Search results to pandas dataframe

Fetch and import resources from FHIR Search API results to pandas dataframe.

Documentation: fhir-search.md

Example: Import all conditions with a certain code from FHIR Server

Fetch and import all condition resources with Snomed (Codesystem http://snomed.info/sct) Code 39065001 in the FHIR element Condition.code (resource type specific FHIR search parameter code) to a pandas dataframe:

from fhiry.fhirsearch import Fhirsearch

fs = Fhirsearch(fhir_base_url = "http://fhir-server:8080/fhir")

my_fhir_search_parameters = {
    "code": "http://snomed.info/sct|39065001",
}

df = fs.search(resource_type = "Condition", search_parameters = my_fhir_search_parameters)

print(df.info())

4. Import Google BigQuery FHIR dataset

from fhiry.bqsearch import BQsearch
bqs = BQsearch()

df = bqs.search("SELECT * FROM `bigquery-public-data.fhir_synthea.patient` LIMIT 20") # can be a path to .sql file

Filters

Pass a config json to any of the constructors:

config_json can be a path to a json file.

df = fp.process('/path/to/fhir/resources', config_json='{ "REMOVE": ["resource.text.div"], "RENAME": { "resource.id": "id" }  }')

fs = Fhirsearch(fhir_base_url = "http://fhir-server:8080/fhir", config_json = '{ "REMOVE": ["resource.text.div"], "RENAME": { "resource.id": "id" }  }')

bqs = BQsearch('{ "REMOVE": ["resource.text.div"], "RENAME": { "resource.id": "id" }  }')

Columns

see df.columns

patientId
fullUrl
resource.resourceType
resource.id
resource.name
resource.telecom
resource.gender
...
...
...

Documentation

Give us a star ⭐️

If you find this project useful, give us a star. It helps others discover the project.

Name		Name	Last commit message	Last commit date
Latest commit History 327 Commits
.devcontainer		.devcontainer
.github		.github
.vscode		.vscode
docs		docs
examples		examples
notebooks		notebooks
notes		notes
src		src
tests		tests
.coveragerc		.coveragerc
.gitignore		.gitignore
.readthedocs.yml		.readthedocs.yml
AUTHORS.md		AUTHORS.md
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.txt		LICENSE.txt
README.md		README.md
dev-requirements.in		dev-requirements.in
dev-requirements.txt		dev-requirements.txt
fhir-search.md		fhir-search.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py
t_install.py		t_install.py
test.sh		test.sh
tox.ini		tox.ini
update.sh		update.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🔥 fhiry - FHIR to pandas dataframe for data analytics, AI and ML

UPDATE 1

UPDATE 2

Installation

Stable

Latest dev version

Usage

1. Import FHIR bundles (JSON) from folder to pandas dataframe

2. Import NDJSON from folder to pandas dataframe

3. Import FHIR Search results to pandas dataframe

Example: Import all conditions with a certain code from FHIR Server

4. Import Google BigQuery FHIR dataset

Filters

Columns

Documentation

Give us a star ⭐️

Contributors

About

Releases 9

Packages

Contributors 6

Languages

License

dermatologist/fhiry

Folders and files

Latest commit

History

Repository files navigation

🔥 fhiry - FHIR to pandas dataframe for data analytics, AI and ML

UPDATE 1

UPDATE 2

Installation

Stable

Latest dev version

Usage

1. Import FHIR bundles (JSON) from folder to pandas dataframe

2. Import NDJSON from folder to pandas dataframe

3. Import FHIR Search results to pandas dataframe

Example: Import all conditions with a certain code from FHIR Server

4. Import Google BigQuery FHIR dataset

Filters

Columns

Documentation

Give us a star ⭐️

Contributors

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 9

Packages 0

Contributors 6

Languages

Packages