Chapter 3 Combined cohort

We made a discovery cohort by combining 41, 12 plasma collected from first cohort at a follow-up visit, and 17 controls sample from the second cohort. Thess plasma samples were all run in the same assay group. Using this cohort, we explored the complement components that are best in distinguishing FSHD from controls.

Summary

The code chunks here reproduced Fig. 4a-c and Suppl. Fig. 3a-d.

First of all, we tidy the dataset.

3.1 Explore components best distinguish FSHD from controls

3.1.1 \(t\)-test

The \(t\)-test result is similar to that of cohort2 – C4b and C3 are significantly elevated (p-value < 0.05). Factor D and Bb are mildly elevated.

\(t\)-test barplot (Suppl. Fig. 3a)

Combind cohort per-compoment t-tests: FSHD vs. controls. * negative -10Log10(p-value) indicates negative t-statistics.

Figure 3.1: Combind cohort per-compoment t-tests: FSHD vs. controls. * negative -10Log10(p-value) indicates negative t-statistics.

3.1.2 Correlation tests

We performed Pearson Correlatin test among 15 components on FSHDs in the combined cohort and observed seven highly correlated compoments (Pearson > 0.45) and FSHD-associated (based on t-tests):

  • C3, FactorH and Factor D (alternative pathway focused). Dropped correlated Factor I and B because they are not relatively associated with FSHD levels
  • C4/C4b/C5/sC5b-9 (classical pathway focused)

correlation code reference: https://paulvanderlaken.com/2018/09/10/simpler-correlation-analysis-in-r-using-tidyverse-principles/

Suppl. Fig. 3b

The correlation matrix among the complement components.

Figure 3.2: The correlation matrix among the complement components.

Network focused on the seven selected components: Suppl. Fig. 3c.

Correlation network among the seven highly correlated complement components. The scaled color represents the Pearson correlation.

Figure 3.3: Correlation network among the seven highly correlated complement components. The scaled color represents the Pearson correlation.

3.2 Panel of seven - Classification and composite z-score

The correlation and t-tests together revealed seven complement components (Factor H, Factor D, C3, C4, C4b, C5, sC5b-9) that are showing stronger association with FHSD than other components. Here we used these seven complement levels to (1) classify the FSHD samples by hierarchical cluster analysis and (2) construct per-sample composite z-scores.

The code chunks below made Suppl. Fig. 3d-g (boxplots, heatmap, waterfall, composite z-score boxplot).

3.2.2 Heatmap/clustering

Using hierarchical cluster and heatmap visualization, we observed a subset of 12 FSHD samples have elevated levels in the classical pathway and terminal components (C4, C4b, C5, and sC5b-9), and 7 in alternative pathway (C3, Factor H/D). Together these 19 FSHDs are labelled as “elevated” FSHD. Based on the panel of 7 factors, we made heatmap showing the clusters and PCA depicting the simple sample distance among all the samples.

Note Control C20 (ID=“3170”) shows the characteristics of elevated FSHDs, and cluster with the FHSDs.

Rearrange samples by clusters Rearrange the samples based on the FSHD elevation in classical/terminal and alternative pathways.

Heatmap of panel 7 (Suppl Fig 3e. Arrange the samples by clusters of elevated FSHDs in different pathways.

Figure 3.5: Heatmap of panel 7 (Suppl Fig 3e. Arrange the samples by clusters of elevated FSHDs in different pathways.

3.2.3 Composite z-score

The per-sample composite z-score is the sum of z-scores of selected complement components: \(S_i = \sum_{i=1}^{m} Z_{i,j}\), where \(j\) denotes the complement, m the number of selected complements, and \(Z_{i,j}\) the normalized value (z-score). It gives a 1D projection of how each sample deviates up or down from each other. Using this score, we can observe difference between FSHD and controls.

Composite z-score usign seven selected complement components.

Figure 3.6: Composite z-score usign seven selected complement components.

t-test between FSHD and controls

Composite zscore by sample groups (Suppl. Fig. 3f).

Figure 3.7: Composite zscore by sample groups (Suppl. Fig. 3f).

Visualization by PCA ??

3.3 Panel of four - Classification and composite z-scores

We selected four best componets out of the panel of seven that are most associated with FSHD (C3, C4b, Facter D and sC5b-9). We performed similar analysis as above and made boxplots of panel of four, heatmap/cluster, waterfall and boxplots of composite z-score (4).

The code chunks below made Fig. 4a-d.

3.3.3 Composite z-score

Composite z-score usign seven selected complement components.

Figure 3.10: Composite z-score usign seven selected complement components.

t-test between FSHD and controls

Boxplot of composite z-score by sample groups

Figure 3.11: Boxplot of composite z-score by sample groups

3.4 Clinical severity score and D4Z4 repeats

We determined whether Clinical Severity Score (CSS) or D4Z4 repeat number are correlated with complement levels by using Pearson correlation test. Code chunks below yield the results.