Thank you. Your reply enabled us to create an aggregate DD (=data dictionary).
The goal of our funded project tries to do exactly such harmonization. Your data sharing platform is very modern and we are big fans!
If you would be interested, we would love to collaborate perhaps. Even if you would just take time to review our results on your variables/studies and give us our opinion about it. Our project is described here: https://github.com/lhncbc/CDE/tree/master/hiv/#aims
Our data sharing statement set of recomendations are here https://github.com/lhncbc/CDE/tree/master/CONSIDER and short list of CDEs here https://github.com/lhncbc/CDE/tree/master/IGNITE
I am trying to export for each NSRR study the data dictionary.
I used the nsrr R package to get a nice list.
df = nsrr_datasets()
a = nsrr_dataset_files("shhs", path = "datasets")
I also see the .json files on github. e.g., https://github.com/nsrr/heartbeat-data-dictionary
The repo shows an option to export all variables into a single CSV file. (using spout).
I would rather avoid ruby and the spout tool.
I don't need the actual study data for 14 studies - only the data dictionaries. (which have no sensitive data)
How can obtain CSV dictionaries for all NSRR studies.
(either as 15 files or as one HUGE file with column study)
Also, has there been any effort to "tag" corresponding variables across the studies?
E.g., sex at birth being a very common variable (=data element)