CytoSpec - an APPLICATION FOR HYPERSPECTRAL IMAGING


 

File Pulldown Menu

Load
Save
Save Matlab
Import ASCII
Import Binary
Export
Delete
Clear
Plot
Customize
Batch Multiple Files
Exit

Imaging

Chemical Imaging
Frequency Maps
3D-Deconvolution
HCA Imaging
PCA Imaging
Synthon Imaging (ANN)
ANN Imaging

Preprocessing

Calculation of Derivative Spectra
Normalization (Vector, Offset)
Cut
Interpolate
Smooth
ABS <--> TR Conversion
Subtraction
Dispersion Correction
Quality test
Baseline Correction
Water Vapor Compensation
Noise Correction
Batch Preprocessing

Multivariate Statistics

Hierarchical Cluster Analysis
Principal Component Analysis
k-Means Clustering
Fuzzy C-Means Clustering

Tools

Display Options
Display Spectra
Set Display limits
Grid On/Off
Set Colors
Capture
Export Maps
Map Statistics
Display Large Maps
Define ROI
Display Colorbar
Swap Data Blocks
Rotate
Flip

File Information

Show History
Show Instrument Parameters
Show measurement Parameters
Show Additional Parameters
Edit Parameters

 

PULL DOWN MENU "MULTIVARIATE STATISTICS"

 

Multivariate Statistics Menu
 

MULTIVARIATE STATISTICS - HCA


 
    This function is used to prepare spectral data for hierarchical cluster analysis (HCA) imaging. The HCA imaging routine itself is extensively described in the chapter HCA imaging ('Image Manipulation' pull down menu).
 
 HCA imaging
 
    The window for data preparation is the same as for other multivariate imaging approaches, such as PCA, KMC or FCM imaging. To start HCA imaging you can either prepare the data from spectral multi file (described here), or load results of a HCA carried out before (see chapter HCA imaging ).
     
    Reformatting data sets for HCA imaging: The following data manipulations are performed to prepare data sets for HCA:
     
    • regions from the selected spectra are selected (data block of your choice),
    • spectra are scanned for negative quality test result, or unselected regions of interest, and eliminated.
    •  
    In the HCA window, select the number of spectral regions (min:1; max:4) and choose the data source block by selecting the appropriate radio button (e.g. of the derivative data block). Then, indicate the wavenumber limits of the spectral regions. These regions may overlap, that is they may appear two-, three-, or even fourfold in the reformatted spectra. Press 'goto HCA' or 'cancel' by pressing the appropriate button.
     
    Important: Spectra with a negative Quality Test result, or unselected Regions of Interest are automatically excluded from HCA imaging.

MULTIVARIATE STATISTICS - PCA


 
    This function is used to prepare spectral data for principal component analysis (PCA) imaging. The PCA imaging routine itself is extensively described in the chapter PCA imaging ('Image Manipulation' pull down menu).
 
PCA Imaging
 
    The window for data preparation is the same as for other multivariate imaging approaches, such as HCA, KMC or FCM imaging. To start PCA imaging you can either prepare the data from spectral multi file (described here), or load results of a PCA carried out before (see chapter PCA imaging ).
     
    Reformatting data sets for PCA imaging: The following data manipulations are performed to prepare data sets for PCA:
     
    • regions from the selected spectra are selected (data block of your choice),
    • spectra are scanned for negative quality test result, or unselected regions of interest, and eliminated.
    •  
    In the PCA window, select the number of spectral regions (min:1; max:4) and choose the data source block by selecting the appropriate radio button (e.g. of the derivative data block). Then, indicate the wavenumber limits of the spectral regions. These regions may overlap, that is they may appear two-, three-, or even fourfold in the reformatted spectra. Press 'goto PCA' or 'cancel' by pressing the appropriate button.
     
    Important: Spectra with a negative Quality Test result, or unselected Regions of Interest are automatically excluded from PCA imaging.

k-MEANS CLUSTER (KMC) IMAGING


 
    This function is used to prepare spectral data for k-means cluster (KMC) imaging.
 
KMC Imaging
 
    The window for data preparation is the same as for other multivariate imaging approaches, such as HCA, PCA, or FCM imaging. Before KMC imaging can be started you have to prepare the data from a spectral multi file.
     
    Reformatting data sets for KMC imaging: The following data manipulations are performed to prepare data sets for KMC:
     
    • regions from the selected spectra are selected (data block of your choice),
    • spectra are scanned for negative quality test result, or unselected regions of interest, and eliminated.
    •  
    In the KMC window, select the number of spectral regions (min:1; max:4) and choose the data source block by selecting the appropriate radio button (e.g. of the derivative data block). Then, indicate the wavenumber limits of the spectral regions. These regions may overlap, that is they may appear two-, three-, or even fourfold in the reformatted spectra. Press 'goto KMC' or 'cancel' by pressing the appropriate button.
     
    Important: Spectra with a negative Quality Test result, or unselected Regions of Interest are automatically excluded from KMC imaging.
     
    Once data preparation has been finished, the following KMC imaging dialog box will appear:
KMC Imaging
    How to start KMC imaging? Once data preparation has been finished, the KMC imaging dialog box appears. Indicate how many classes are assumed to be present in the data set ('number of cluster') and the number of training cycles (default: 100). Press the 'image' button if you wish to start the application or hit 'cancel' to abort the function.
     
    Button 'spectra': This option becomes available when the k-means clustering calculations are finished. When this button is pressed, a new window comes up that offers additional options such as the calculation of cluster mean spectra or sorting individual map spectra by its cluster membership.
     

     
    calculate and/or store average spectra
     

    save all spectra: all spectra used for KMC imaging are stored as double column ASCII data. Use the standard windows dialog box to set the path and create the ASCII data files. File names: corenamex_i: x-th spectrum of the i-th cluster.
     
    disp averages: average spectra of the selected data block are calculated and plotted in the same color like in the cluster map.
     
    save averages: average and standard deviation spectra of the selected data block are calculated. To store these spectra (double column ASCII) use the standard windows dialog box to set the path and chose a file name. Average spectra and standard deviation spectra of the i-th cluster are stored per default in separate files (corename_i - for average spectra and corename_std_i for the standard deviation spectra).
     
    Principles of k-means clustering: The algorithm of k-means clustering has been suggested by J.B. MacQueen in 1967 :
    J.B. MacQueen. In L.M. LeCam and J. Neymann (eds) Proceedings of Fifth Berkeley Symposium on Mathematical Statistics and Probability. 1967 281-297.
     
    Related web links (Wikipedia):
     
    In the CytoSpec implementation, MacQueens k-means cluster algorithm is used. k-means clustering is a non hierarchical clustering method, which obtains a "hard" (crisp) class membership for each spectrum, that is the class membership of an individual spectrum can be take only the values of zero or one. It uses an iterative algorithm to update randomly selected initial cluster centers, and to obtain the class membership for each spectrum, assuming well-defined boundaries between the clusters. MacQueens iterative algorithm of KMC can be described as follows: Spectra are illustrated as points in a p-dimensional space (p is the number of features of the spectra. In this space a number of k points is initially chosen, where each point represents a cluster to be made. Then, distance values between the points and all objects (spectra) are calculated. Objects are assigned to a cluster on the basis of a minimal distance value. Next, centroids of the clusters are calculated and distance values between the centroids and each of the objects are re-calculated. Then, if the closest centroid is not associated with the cluster to which the object currently belongs, the object will switch its cluster membership to the cluster with the closest centroid. The centroid's positions are re-calculated every time a component has changed the cluster membership. This continues until none of the objects has been re-assigned.
     
    Reference to the literature:
     

FUZZY C-MEANS CLUSTER (FCM) IMAGING


 
    This function is used to prepare spectral data for fuzzy C-means cluster (FCM) imaging.
 
FCM Imaging
 
    The window for data preparation is the same as for other multivariate imaging approaches, such as HCA, PCA, or KMC imaging. Before FCM imaging can be started you have to prepare the data from a spectral multi file.
     
    Reformatting data sets for FCM imaging: The following data manipulations are performed to prepare data sets for FCM:
     
    • regions from the selected spectra are selected (data block of your choice),
    • spectra are scanned for negative quality test result, or unselected regions of interest, and eliminated.
    •  
    In the FCM window, select the number of spectral regions (min:1; max:4) and choose the data source block by selecting the appropriate radio button (e.g. of the derivative data block). Then, indicate the wavenumber limits of the spectral regions. These regions may overlap, that is they may appear two-, three-, or even fourfold in the reformatted spectra. Press 'goto FCM' or 'cancel' by pressing the appropriate button.
     
    Important: Spectra with a negative Quality Test result, or unselected Regions of Interest are automatically excluded from FCM imaging.
FCM Imaging
 
    How to start FCM imaging? Once data preparation has been finished, the FCM imaging dialog box appears. Indicate how many classes are assumed to be present in the data set ('how many clusters to search for') and an exit criterion ('minimum amount of improvements', default: 0.0005). Press the 'C-means' button if you wish to start the application or hit 'cancel' to abort the function. When the calculations are finished select a cluster that should be used to reassemble the image ('display which cluster'). Note that images can be reassembled from one cluster, only!
     
    Button 'spectra': This option becomes available when the fuzzy C-means clustering calculations are finished. When this button is pressed, a new window comes up that offers additional options such as the calculation of cluster mean spectra or sorting individual map spectra by its cluster membership. By pressing any of the buttons (except button 'done') a new FCM imaging approach is initiated by using the settings of the FCM imaging dialog box.
     
calculate and/or store average spectra
 
    save all spectra: all spectra used for FCM imaging are stored as double column ASCII data. Use the standard windows dialog box to set the path and create the ASCII data files. File names: corenamex_i: x-th spectrum of the i-th cluster.
     
    disp averages: average spectra of the selected data block are calculated and plotted.
     
    save averages: average and standard deviation spectra of the selected data block are calculated. To store these spectra (double column ASCII) use the standard windows dialog box to set the path and chose a file name. Average spectra and standard deviation spectra of the i-th cluster are stored per default in separate files (corename_i - for average spectra and corename_std_i for the standard deviation spectra).
     
    Principles of fuzzy C-means clustering: Principles of FCM clustering are given in the following publication:
    J.C. Bezdek. Pattern Recognition with Fuzzy Objective Function Algorithms, 1981 New York. Plenum Press.
     
    Related web links:
     
    FCM clustering is a non hierarchical clustering method. This clustering technique partitions objects into groups (cluster) whose members show a certain degree of similarity. Unlike k-means clustering, the output of FCM clustering is a membership function, which defines the degree of membership of a given spectrum to the clusters. The values of the membership function can vary between one (highest degree of cluster membership) and zero (no class membership), where the sum of the C cluster membership values for one object equals one.
     
    Thus, this method departs from the classical two-valued (0 or 1) logic, and uses "soft" linguistic system variables and a continuous range of true values in the interval [0,1]. FCM imaging uses a fuzzy iterative algorithm to calculate the class membership grade for each spectrum. The iterations in FCM clustering are based on minimizing an objective function, which represents the distance from any given data point (spectrum) to the actual cluster center weighted by that data points membership grade.
     
    The advantage of the fuzzy C-means clustering over k-means clustering is that both outliers and data, which display properties of more than one class can be characterized by assigning nonzero class membership values to several clusters. In the similarity maps assembled by FCM clustering the membership values are encoded by the colormap, that is by the color intensities in the case of single color colormaps.
     
    Reference to the literature:
     

[ GENERAL | FILE | PREPROCESSING | MULTIVARIATE STATISTICS | IMAGE MANIPULATIONS | TOOLS | FILE INFO | HELP | GLOSSARY ]

Copyright (c) 2000-2008 CytoSpec. All rights reserved.