The prediction of three dimensional (3D) protein structures from their amino acid sequence is a typical example for the complexity problems encountered already at the biomolecular level, still far away from the structural and functional complexity of entire cells. Exact predictions of protein structure from amino acid sequences are still difficult after more than 30 years of intensive research and despite the explosive development of hard- and software capacities in the meantime.
The predictive medicine by cytomics concept (L1-L3) was developed as a consequence of these considerations. It generated recent thoughts about the challenges of a human cytome project which have raised interest ( for details).
Essential progress in single cell molecular analysis is achieved by the continuous development of new image and flow cytometric instrumentation like confocal or laser scanning microscopy with multicolor or spectral imaging, multiphoton fluorescence excitation, fluorescence resonance energy transfer (FRET), fast fluorescence imaging in flow, optical cell stretching in flow, miniaturised flow cytometry within laboratories on a chip or laser microdissection of tissues. Biomolecular analysis techniques like bead arrays, single cell polymerase chain reaction (PCR), tyramide signal amplification or biomolecule labeling by quantum dots, magnetic nanobeads and aptamers open new horizons of sensitivity and molecular specificity at the single cell level.
The dimensionality of cell data can be substantial, in multilevel molecular cell profiling for example with repeated multicolor fluorescence staining protocols on numerous different cell populations. Viable cells may be initially stained for cell functions like intracellular pH, transmembrane potentials or Ca2+ levels, followed by fixation to remove the functional stains and restaining for specific extra- or intracellular constitutents like antigens, lipids or carbohydrate structures. After destaining, specific nucleic acids may be stained. Microscopic image capture and analysis systems using their relocation capacities will increasingly permit such staining sequences. Traditional visual and quantitative evaluations of gated two or three dimensional cytometric histograms collect only a very limited amount of the available information and one is never certain whether the really relevant information has been collected. Experience has also shown that it is not easy to provide quality controlled consensus strategies for multiparameter data evaluation.
There is also little pre-existing interpretation knowledge on very complex cytometric multiparameter data spaces. Essential informations may therefore be lost simply due to the lack of awareness.
It is for example empirically advisable in this effort to use self adapting and contiguous gates for the automated evaluation of flow cytometrically well known cell population entities like lympho-, mono- or granulocytes as defined by forward (FSC) or sideward (SSC) light scatter or by typical antigenic properties like the expression of CD45 antigen. The subsequent fluorescence gating can be equally automated by standard quadrant evaluation at fixed threshold levels in the gated two parameter histograms. It is of no primary importance at this stage that cell population boundaries are respected since relevant information will be picked up anywhere by the subsequent data sieving analysis for the most discriminatory data patterns, provided the information of more than 95% or all cells has been accessed during the information collection phase.
Data sieving as an alternative non-statistical knowledge extraction strategy does not require mathematical assumptions on value distributions, missing values do not have to be reconstituted, the analysis is suitable for parallel computation and inherently fast because only data thresholding is required for classification.
Reference groups of the same type, when established in different laboratories will be indistinguishable by classification against each other, provided they are composed of representative reference individuals or samples, and measured with longterm precision and specific reagents. Reference groups can be defined by consensus. In this way, the standardised and laboratory independent classification of relational data is possible and relational databases from different laboratories can be merged into larger standard databases. In case the classification reveals differences between correctly composed reference groups from various laboratories, this is an indication for methodological or ethnic differences.
A relational system for the objective molecular description of diseases and elementary cellular states like differentiation, maturation, division or malignancyat at the cellular level can then be established in this way. Different cell types will be in a standardised relation to each other in some kind of periodic system of cells.
The first level addresses the behaviour of cells in their life cycle including cell cycle control, biomolecule synthesis, import and export of molecules, energy and oxido-reductive balance, organelle functions to name only some important phenomena. It seems also important to address the very significant dispersion of many cell population parameters ranging between 1:10 until 1:10.000 as evidenced by the coefficients of variations of their value distribution curves. The multiparametric molecular heterogeneity of cells in their combinatorial multiplicity may be of high importance for the reliable adaptation of cells to new conditions and for the susceptibility or resistance to diseases or therapy.
At the second level, single cell preparations either as collected or after mechanic or enzymatic preparation are investigated by flow or image cytometry to determine the molecular status of normal or diseased cells as descriptors for health and disease. This discrimination does not necessarily depend on the representative original in-situ assembly of cells within tissues, provided conclusions can be derived from the molecular status of specific cells or cell combinations.
The third level concerns cells in assembled tissues. The molecular interrelation and proximity of cells within an intact cellular architecture can be studied under the most complex conditions and at the ultimate organisational level of cell systems in health and disease.
- The necessary knowledge is obtained from multilevel single cell molecular analysis close to in-vivo conditions in combination with exhaustive bioinformatic knowledge extraction (cytomics).
- Results are stored in a standardised relational knowledge system or framework for scientific hypothesis development as well as for directly medicine related purposes like predictive medicine by cytomics that is the prediction on the therapy dependent future disease course in individual patients, furthermore the possibilities of personalized therapy and the search for new pharmaceutical targets.
- The establishment of such a system, using the various single cell oriented molecular technologies in conjunction with specific biomolecule labelling in a specially focused human cytome project, represents a combined challenge to science, medicine and technological innovation.
|© 2004 G.Valet|