| Item No | Recommendation | Page No |
---|---|---|---|
Objectives | |||
 Background/rationale | 1 | Explain the scientific background and rationale for the study being reported in one or two sentences |  |
 Prespecified hypotheses: | 2 | State prespecified hypotheses in on or two sentences |  |
Study design: data sources selection & variables selection & data integration | |||
 Data sources | 3a | Describe the time coverage |  |
3b | Describe the geographic coverage | Â | |
3c | Describe the sample size | Â | |
3d | Describe the demographic distribution | Â | |
3e | Describe the cohort criteria | Â | |
3f | Describe the sources of biases (e.g., sample bias) | Â | |
3 g | Describe the data collection approach |  | |
 Dependent variables | 4a | State the variable definition and variable type (e.g., primary outcome variable, secondary outcome variable) |  |
4b | State the data source of dependent variable | Â | |
4c | State the data type (e.g., numerical, categorical, date-time) of dependent variable | Â | |
4d | State descriptive statistics (e.g., min, max. Median, value range, percentile) of dependent variable | Â | |
4e | State the NIMHDa domains and levels of dependent variable | Â | |
 Independent variables | 5a | State the variable definition and variable type (e.g., primary predictor, secondary predictor) |  |
5b | State the data source of dependent variable | Â | |
5c | State the data type (e.g., numerical, categorical, date-time) of dependent variable | Â | |
5b | State descriptive statistics (e.g., min, max. Median, value range, percentile) of independent variable | Â | |
5e | State the NIMHD domains and levels of independent variable | Â | |
 Controlled variables | 6a | State the variables type (e.g., numerical, categorical) of controlled variable |  |
6b | State the data source of controlled variable | Â | |
6c | State descriptive statistics (e.g., min, max. Median, value range, percentile) of controlled variable | Â | |
6d | State the NIMHD domains and levels of controlled variable | Â | |
 Missing data | 7a | For each data source, describe whether required or expected variable that is not present |  |
7b | For each variable, describe method of how to handle missing data | Â | |
7c | For each variable, describe the missing rate | Â | |
Data integration | |||
 Data processing | 8a | Data extraction: for each variable, describe how to process the raw data source to extract the variable |  |
8b | Data cleaning: for each variable, describe the method used to detect and correct (or remove) the incorrect records, missing values or outliers | Â | |
 Integration strategy | 9 | Describe the integration strategy for each variable:1) Integrate with variables from same level, 2) Integrate with variables from different levels, and 3) Creation of additional computed elements |  |
 Integration algorithm | 10 | For each variable, describe the algorithm used to integrate it with variables from other data sources |  |
 Variable validation | 11 | For each variable, describe data validation rule for the selected variable. Rule should identify both the variable and the validation algorithms |  |
 Integrated variable | 12 | Describe the variable after integration and basic descriptive statistics (e.g., min, max. Median, value range, percentile) |  |