GvHD Use Case Results
2.2.2 GvHD Description
4.2.2 GvHD Discussion
To evaluate algorithms for selecting target cell populations for FlowDx, we need a quantitative way to measure how close we are to human gating techniques. Additionally we want to evaluate how close one expert is to a group of experts. Therefore, the Match Ratio was born. See pdf for more details about the Match Ratio. The first step is to have a group of experts gate a dataset and create a consensus gate.
GvHD dataset provided by Ryan Brinkman of Terry Fox Laboratory/British Columbia Cancer Agency. See link for the gating instructions.
Goal was to adjust pre-made gates in the workspace to select for the CD3/CD4/CD8b+ for all 12 timepoints in the GvHD run 1.
Looking at sample GvHD1.055, there is a wide variability in cell count of the target population ranging from 0 cells to 627 cells.

Figure 1. Overlay of CD3+/CD4+/CD8b+ T-cells gated by several analysts. There is a large
difference between the number of cells selected by contributing lab, and the external analysts.
There is a large variability (134 ± 171) between the number of cells selected by Brinkman's lab, which contributed the data, and the rest of the gaters. The discrepancy highlights the need to have a more descriptive and specific procedure for gating, but such a procedure was not provided at the time of this exercise.
The match ratio was calculated on the individual cells in the CD3/CD4/CD8b+ population for each expert and compared to the consensus. A Match Ratio of 1.0 matches the consensus perfectly.
Everyone was very close in their Match Ratio, even when the number of cells in each expert's final population had a high variability. This is due to the experts agreeing that the majority of the cells were not in the final population.
Table 1. Statistics on CD3/CD4/CD8b+ population in Sample GvHD1.055
| GvHD1.055 |
Match Ratio | Count | Freq of total | MFI CD4 | MFI CD8b |
| Aaron_1 | 0.99515 | 131 | 0.0095 | 41.27 | 57.14 |
| Aaron_2 | 0.99989 | 0 | 0.0000 | - - | - - |
| Aaron_3 | 0.99575 | 307 | 0.0223 | 18.24 | 48.59 |
| Brinkman* | 0.97865 | 627 | 0.0434 | 16.5 | 26.91 |
| Claudio | 0.99986 | 127 | 0.0092 | 13.05 | 55.27 |
| Helene | 0.99898 | 193 | 0.0140 | 14.75 | 63.52 |
| John | 0.99991 | 5 | 0.0004 | 11.17 | 47.73 |
| Junichi | 0.99984 | 143 | 0.0104 | 12.62 | 59.48 |
| Maciej | 0.99990 | 5 | 0.0004 | 12.68 | 41.77 |
| M | 0.99993 | 1 | 0.0001 | 14.99 | 70.41 |
| M2 | 0.99993 | 1 | 0.0001 | 14.99 | 70.41 |
| Nick | 0.99974 | 122 | 0.0089 | 11.93 | 68.61 |
| Qianjun | 0.99723 | 296 | 0.0215 | 16.42 | 45.63 |
| Sach | 0.99982 | 51 | 0.0037 | 22.08 | 69.98 |
| Sach2 | 0.99989 | 4 | 0.0003 | 55.95 | 139.13 |
| Starting.wsp | 0.99723 | 296 | 0.0215 | 16.42 | 45.63 |
| mean | 0.99761 | 142.5 | 0.0104 | 19.54 | 60.68 |
| st dev | 0.00531 | 165.5 | 0.0120 | 12.43 | 25.08 |
| CV | 0.00532 | 1.16 | 1.16 | 0.64 | 0.41 |
We learned that:
1) We need a more-specific SOP for analysis. (Ryan Brinkman's lab and M used the same gate definitions for all samples, but most others adjusted the gates for each sample.) Which is the better scientific method -- one gate definition for all samples, or adjust the gates for each sample?
2) If there is high variability within the consensus, it becomes easier for the match ratio to be very close to the maximum of 1.0.
3) We need feedback as to whether the inter-gater variability of the manual gating is within acceptable limits from the contributors.
Additional results show similar situations for the other samples in this run.