Skip to main content

Table 2 ATTEST reporting guideline checklist

From: An ontology-based documentation of data discovery and integration process in cancer outcomes research

 

Item No

Recommendation

Page

No

Objectives

 Background/rationale

1

Explain the scientific background and rationale for the study being reported in one or two sentences

 

 Prespecified hypotheses:

2

State prespecified hypotheses in on or two sentences

 

Study design: data sources selection & variables selection & data integration

 Data sources

3a

Describe the time coverage

 

3b

Describe the geographic coverage

 

3c

Describe the sample size

 

3d

Describe the demographic distribution

 

3e

Describe the cohort criteria

 

3f

Describe the sources of biases (e.g., sample bias)

 

3 g

Describe the data collection approach

 

 Dependent variables

4a

State the variable definition and variable type (e.g., primary outcome variable, secondary outcome variable)

 

4b

State the data source of dependent variable

 

4c

State the data type (e.g., numerical, categorical, date-time) of dependent variable

 

4d

State descriptive statistics (e.g., min, max. Median, value range, percentile) of dependent variable

 

4e

State the NIMHDa domains and levels of dependent variable

 

 Independent variables

5a

State the variable definition and variable type (e.g., primary predictor, secondary predictor)

 

5b

State the data source of dependent variable

 

5c

State the data type (e.g., numerical, categorical, date-time) of dependent variable

 

5b

State descriptive statistics (e.g., min, max. Median, value range, percentile) of independent variable

 

5e

State the NIMHD domains and levels of independent variable

 

 Controlled variables

6a

State the variables type (e.g., numerical, categorical) of controlled variable

 

6b

State the data source of controlled variable

 

6c

State descriptive statistics (e.g., min, max. Median, value range, percentile) of controlled variable

 

6d

State the NIMHD domains and levels of controlled variable

 

 Missing data

7a

For each data source, describe whether required or expected variable that is not present

 

7b

For each variable, describe method of how to handle missing data

 

7c

For each variable, describe the missing rate

 

Data integration

 Data processing

8a

Data extraction: for each variable, describe how to process the raw data source to extract the variable

 

8b

Data cleaning: for each variable, describe the method used to detect and correct (or remove) the incorrect records, missing values or outliers

 

 Integration strategy

9

Describe the integration strategy for each variable:1) Integrate with variables from same level, 2) Integrate with variables from different levels, and 3) Creation of additional computed elements

 

 Integration algorithm

10

For each variable, describe the algorithm used to integrate it with variables from other data sources

 

 Variable validation

11

For each variable, describe data validation rule for the selected variable. Rule should identify both the variable and the validation algorithms

 

 Integrated variable

12

Describe the variable after integration and basic descriptive statistics (e.g., min, max. Median, value range, percentile)

 
  1. Please document the items for each data source and variable separately
  2. aNational Institute on Minority Health and Health Disparities (NIMHD)