Figure S1: Batch correction of datasets. (A-B) Batch correction results of GSE76019 and GSE76021 datasets. (C-D) Internal batch correction results of GSE19750 dataset.

A

· GSE19750-1 · GSE19750-2

B

· GSE19750-1 · GSE19750-2

100

80

50

40.

Dim.2

Dim.2

0

0

-50

-40

-50

0

50

100

-50

0

50

100

Dim.1

Dim. 1

C

· GSE76019 · GSE76021

75

D

· GSE76019 GSE76021

50

50

25

Dim.2

25

Dim.2

0

0

-25

-25

-50

-50

-50

0

50

Dim. 1

100

-40

0

Dim. 1

40

80

CTNNB1
0.1280.184
MUC160.0770.237
TP530.0260.289
TTN0.0510.158
CNTNAP50.0000.158
HMCN10.0260.132
PKHD10.0510.105
APOB0.0260.105
ASXL30.0260.105
KMT2B0.0260.105
MEN10.0260.105
NF10.0260.105
PRKAR1A0.0510.079
SVEP10.0510.079
TUT70.0260.105
CMYA50.0260.079
CSMD10.0260.079
DAXX0.0260.079
DST0.0000.105
FAT40.0510.053

B

**

Cohort

*

12q13.2-Amp

Low

5p13.1-Amp

High

5q35.3-Amp

5p13.2-Amp

frequency

5p14.1-Amp

22q12.1-Del

0.75

0.2

22q11.21-Del

0.50

0.1

17p13.1-Del

**

0.25

13q14.2-Del

0.0

11p15.5-Del

3q13.31-Del

17q21.31-Del


11q24.1-Del

11q14.1-Del

0.7690.763
0.7690.763
0.7690.763
0.7950.711
0.7440.763
0.7690.658
0.7950.605
0.7950.605
0.7690.605
0.7440.632
0.5130.632
0.4620.500
0.3850.474
0.2560.605
0.2820.526
0.2560.368
0.2310.395
0.1030.526
0.2310.368
0.2560.342

Cohort

Low

High

5q31.2-Amp

frequency

1.00

1p36.23-Del

*

0.00

12q14.1-Amp

12q14.3-Amp 12q15-Amp 5p15.33-Amp

Figure S2: Differences in mutation frequency of genes and chromosomal fragment variation frequency between high and low CDC20 expression subgroups. (A) Mutation frequency of top 20 frequently mutated genes between high and low CDC20 expression groups. (B) The frequency of top 10 chromosomal amplifications and top 10 chromosomal deletions between CDC20 high-expression and low-expression groups. * P < 0.05, ** P< 0.01, *** P < 0.001, **** P < 0.0001.