synthea
ETL-Synthea
synthea | ETL-Synthea | |
---|---|---|
8 | 1 | |
2,002 | 92 | |
1.4% | - | |
8.2 | 6.3 | |
5 days ago | 12 days ago | |
Java | R | |
Apache License 2.0 | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
synthea
- Survey on Synthea Use to Shape the Future of Open Source Medical Records
- Synthea: Open-Source Synthetic Patient Generation
-
Simulated Hospital
As someone working in this arena, I offer an alternative perspective for your consideration: healthcare was an early adopter of information technology and as a result many of its most core technologies come from a nearly unrecognizable time in computing. These systems are “outdated” as a result of success.
The current prevalence of these venerable technologies may be in part due to regulation, but more often has to do with their success.
HL7v2 is just token delimited ascii. Not unlike the similarly primitive but ubiquitous csv. The fields within it are defined by standards documents and once you use it a little, you can read enough to get the gist of most messages. As you might guess, modules in your language of choice are used to parse and compose HL7v2 so its detail isn’t that important.
Something I’d like to point out about Google Hospital is that under the hood it uses MITRE’s Synthea to generate synthetic patient data.
https://www.healthcareittoday.com/2017/09/13/open-source-too...
https://synthetichealth.github.io/synthea/
- Looking for Mock Hospital Dataset. Financial, Human Resource, Departments, In/Out Patients Data.
-
Will pay for realistic large dataset of HL7 messages
Have you tried Synthea? https://github.com/synthetichealth/synthea
- Healthcare datasets with multiple continuous variables
- I'm being threatened to be sued by my college for copyright infringement
ETL-Synthea
-
Synthea: Open-Source Synthetic Patient Generation
I recommend the OMOP schema as a goto standard for EHR data. There's an ETL pipeline for converting Synthea output into OMOP.
https://github.com/OHDSI/ETL-Synthea
What are some alternatives?
simhospital
airbyte - The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
fhir - Official source for the HL7 FHIR Specification
Benthos - Fancy stream processing made operationally mundane
FHIR-Converter - Conversion utility to translate legacy data formats into FHIR
log-synth - Generates more or less realistic log data for testing simple aggregation queries.
clojure-hl7-messaging-2-parser - HL7 v2.x Messaging Parser
Mage - 🧙 The modern replacement for Airflow. Mage is an open-source data pipeline tool for transforming and integrating data. https://github.com/mage-ai/mage-ai
data-analysis
doris - Apache Doris is an easy-to-use, high performance and unified analytics database.
JSL - The JSL is an open-source discrete event simulation library written in Java