Skip to main content

Table 1 Description of the dataset

From: MODILM: towards better complex diseases classification using a novel multi-omics data integration learning model

Dataset

Categories

Number of features

Number of features for training

miRNA

mRNA

meth

miRNA

mRNA

meth

ROSMAP

NC:169, AD:182

309

55889

23788

200

200

200

LGG-2

Low:210, Hight:311

2158

20531

485578

557

2000

2000

BRCA

Normal-like:115, Basal-like:131, HER2-enriched:46, Luminal A:436, Luminal B:147

2239

20531

485578

503

1000

1000

SKCM

Keratin:98, Immune:163, MITF-low:59

2221

20531

485578

235

2000

2000

LGG-4

I:146, II:138, III:324, IV:120

2158

20531

485578

557

2000

2000

LUSC

Basal:10, Classical:16, Secretory:18,

Primitive:8

2214

20531

485578

296

2000

2000

  1. The second column is the type of samples contained in the dataset. Also, miRNA in the table refers to miRNA expression data. mRNA refers to mRNA expression data. meth refers to DNA methylation data