I am trying to export for each NSRR study the data dictionary.
I used the nsrr R package to get a nice list.
df = nsrr_datasets()
a = nsrr_dataset_files("shhs", path = "datasets")
I also see the .json files on github. e.g., https://github.com/nsrr/heartbeat-data-dictionary
The repo shows an option to export all variables into a single CSV file. (using spout).
I would rather avoid ruby and the spout tool.
I don't need the actual study data for 14 studies - only the data dictionaries. (which have no sensitive data)
How can obtain CSV dictionaries for all NSRR studies.
(either as 15 files or as one HUGE file with column study)
Also, has there been any effort to "tag" corresponding variables across the studies?
E.g., sex at birth being a very common variable (=data element)
The data dictionary CSV files are publicly available for each dataset. You'll find them in the Files area under the datasets folder, e.g.
The data dictionary files you see (domains, forms, variables) represent what you get from running the Spout export.
Tags as you describe do not exist, though we do have some data harmonization enhancements in mind for the future.
Thanks for your interest in the site!
Thank you. Your reply enabled us to create an aggregate DD (=data dictionary).
The goal of our funded project tries to do exactly such harmonization. Your data sharing platform is very modern and we are big fans!
If you would be interested, we would love to collaborate perhaps. Even if you would just take time to review our results on your variables/studies and give us our opinion about it. Our project is described here: https://github.com/lhncbc/CDE/tree/master/hiv/#aims
Our data sharing statement set of recomendations are here https://github.com/lhncbc/CDE/tree/master/CONSIDER and short list of CDEs here https://github.com/lhncbc/CDE/tree/master/IGNITE
I'm glad that helped. Thank you for the kind words. I will share your links with other members of the team and bring it up at our next group meeting.
We are familiar with CDEs and have toyed around a bit with linking NSRR data back to established CDEs. What you describe in your aims for HIV/AIDS sounds very similar to what we have contemplated for studies/trials of sleep.