A handful of EDFs appear to have values outside the range of the digital min/max specified in their headers, which I believe is not permitted under EDF specs [Q4 in http://www.edfplus.info/specs/edffaq.html ] and could cause an issue for some EDF readers.
Affected files: (study_ID) (this is for the EEG and ECG channels)
chat_300420, chat_300298, chat_300539, chat_300640
mros_aa2618, mros_aa2649, mros_aa3601, mros_aa3624, mros_aa3780, mros_aa5006
shhs_202833, shhs_202947, shhs_203716, shhs_204581
A minor comment/request: would it be possible to harmonize channel names (and units) within a study, across all EDFs?
For example, in the SHHS study, the first EEG channel is always "EEG". The second one is either "EEG(sec)", "EEG2", or "EEG(SEC)". As channels are not necessarily in the same order across EDFs, it is useful to extract channels by their labels. To facilitate automated processing across 1000s of EDFs, ideally labels would be similar (within a study, at least). A similar principle applies to the units -- e.g. CHAT C3 & C4 channels are sometimes uV, sometimes mV for different EDFs.
Beyond these minor issues, I wonder whether it may be desirable to post harmonized EDFs that also have some basic level of artifact correction or flagging of clearly aberrant epochs, etc? i.e. to perform centrally some of the core steps that most subsequent users of the data would otherwise presumably be performing themselves. On the other hand, I can see the value in retaining exact "archival" versions of datasets, warts and all, for other reasons.
All 2651 files in shhs/files/annotations-staging/shhs2/ appear to be identical. I imagine there was a problem uploading those files to the NSRR perhaps?
(All are 6.36kb with exactly 1086 epochs, i.e. that clearly do not correspond to the matched SHHS2 EDFs.)
Can this be resolved soon or is it possible to access these files in any other way? Many thanks!
PS. the 'nsrr' download tool is very nice.