Cell Biochemistry Martinsried


Cytomics, a Practical Approach


G.Valet


1. Enrichment of Predictive Parameters by Repetitive Data Sieving

Data sieving (L1) represents an inductive approach for the exhaustive information extraction from large multi-parametric data spaces in view of predictive or diagnostic goals. Hypothesis driven data collection (fig.1a) is followed by data sieving (fig.1b) and interpretation (fig.1c) of the resulting predictive data patterns for individualized classification of unkown patients
The data pattern may serve in repetitive rounds as starting point for new multi-parametric experiments followed by data sieving and interpretation of the resulting new data pattern.
Large multi-parametric data spaces can be investigated in this way in a comparatively short time. Besides the predictive data pattern the vast majority of excluded non informative parameters are equally of interest because they are positively excluded from further hypothesis or concept development. This limits substantially the amount of thinking possibilities which may otherwise very quickly become an innovation limiting problem.


2. Practical Determination of Predictive Data Patterns

The ultimately desired high statistical significance of results for clinical applications is initially in conflict with the search for individually predictive parameter patterns through the collection of large amounts of multi-parametric information from flow cytometry of heterogeneous cellular suspensions, bead arrays or DNA and protein expression arrays. A two phase strategy (L2) is therefore appropriate (fig.2). The initial pilot phase study (fig.2 phase 1) is performed at an acceptable minimum of statistical stringency such as a significance level of P<0.05 or P<0.10. The majority of uninformative parameters can be eliminated at this stage by data sieving.


In the second phase (fig.2 phase 2), the remaining discriminatory parameters for disease course prediction are analysed in statistically large patient groups (L3). This provides exact numbers for the reliability of individualized disease course predictions and eliminates pseudo-informative parameters which have slipped for random statistical reasons into the group of informative parameters during the first phase. Informative parameters may likewise have been lost for random statistical reasons into the group of non-informative parameters during the initial phase. They may be recoverable during the later deductive hypothesis and concept forming phase from the molecular context of the final predictive parameter pattern.

L1. G.Valet Predictive Medicine by Cytomics: Challenges and Potential. JBRHA 16:164-167, (2002)
L2. G.Valet, A.Tárnok: Cytomics in predictive medicine. Cytometry (2003), in press
L3. G.Valet, R.Repp, H.Link, A.Ehninger, M.Gramatzki and SHG-AML study group: Pretherapeutic identification of high risk acute myeloid leukemia (AML) patients from immunophenotype, cytogenetic and clinical parameters. Cytometry (2003), in press
Ln. further readings


© 2024 G.Valet
1965-2006: Max-Planck-Institut für Biochemie, Am Klopferspitz 18a, D-82152 Martinsried, Germany
Last Update: Mar.10,2003