data analysis

Sunday, 4. September 2005

SOM + genes

Interpreting patterns of gene expression with self-organizing maps: methods and application to hematopoietic differentiation.
-> [pdf]
som yeast
SOMs take a fundamentally different approach. They attempt
to provide an ‘‘executive summary’’ of a massive data set
by extracting the n most prominent patterns (where n is the
number of nodes in the geometry) and arranging them so that
similar patterns occur as neighbors in the SOM. As with all
exploratory data analysis tools, the use of SOMs involves
inspection of the data to extract insights.
SOMs are widely used in data mining because they have
many desirable mathematical properties, including scaling well
to large data sets. In our own hands, we have indeed found
them valuable in analyses involving hundreds of experiments.


A SOM calculated on the twoday-survey data, with a 4x4 grid, and pre-ordered columns:
som data01
Such a chart wont scale well with a high number of variables.

a visualized SOM of the iris data

som-iris
-> Plotting Eight Direction Arranged Maps or Self-Organizing Maps
-> SOM package in R
-> EDAM in R

Same plot, but SOM with a finer grid:
som-iris-2
(Obviously there are not enough data sets to do that)

EDAM-Plot of Iris:
edam-iris
Raabe, N. (2003). Vergleich von Kohonen Self-Organizing-Maps mit einem nichtsimultanen Klassifikations- und Visualisierungsverfahren. Diploma Thesis, Department of Statistics, University of Dortmund.
-> Raabe + Diplomarbeit

Diplomarbeit vergleicht SOM + EDAM hinsichtlich Visualisierungs- (=Topologie-Erhaltungs-)-Güte und Klassifizierungsgüte. Conclusio: SOM erhält räumliche Distanzen (=Toplogie) besser, EDAM klassifiziert besser (Unterschied sei aber bei hochdimensionalen Daten möglicherweise weniger stark).

Wednesday, 17. August 2005

Using the evolutionary algorithm on the survey data

Computation of 200 steps of the evolutionary algorithm for our survey data (500 cases x 26 variables) took more than 5 minutes in R.
bertin-survey
bertin-survey1

Search

 

currently reading



William N. Venables, Brian D. Ripley
Modern Applied Statistics with S

Recent Updates

John
Amoxicillin And Clavulanate 250mg With No Prescription...
Smithe526 (guest) - 13. May, 21:03
Hi, I am doing a project...
Hi, I am doing a project for my school using this doc2mat...
Sangeetha (guest) - 2. Mar, 10:35
mountain vizualization...
By the way, here they explain how the mountain visualization...
Tatiana (guest) - 10. Mar, 02:12
hi, I wonder how did...
hi, I wonder how did you make scrin shorts of the mountin...
Tatiana (guest) - 10. Mar, 02:10
SOM + genes
Interpreting patterns of gene expression with self-organizing...
michi - 4. Sep, 23:03

data analysis
diary
linkdump
literature
software
Profil
Logout
Subscribe Weblog