Arbeitspapier

Detecting high-order interactions of single nucleotide polymorphisms using genetic programming

Motivation: Not individual single nucleotide polymorphisms (SNPs), but high-order interactions of SNPs are assumed to be responsible for complex diseases such as cancer. Therefore, one of the major goals of genetic association studies concerned with such genotype data is the identification of these high-order interactions. This search is additionally impeded by the fact that these interactions often are only explanatory for a relatively small subgroup of patients. Most of the feature selection methods proposed in the literature, unfortunately, fail at this task, since they can either only identify individual variables or interactions of a low order, or try to find rules that are explanatory for a high percentage of the observations. In this paper, we present a procedure based on genetic programming and multi-valued logic that enables the identification of high-order interactions of categorical variables such as SNPs. This method called GPAS (Genetic Programming for Association Studies) cannot only be used for feature selection, but can also be employed for discrimination. Results: In an application to the genotype data from the GENICA study, an association study concerned with sporadic breast cancer, GPAS is able to identify high-order interactions of SNPs leading to a considerably increased breast cancer risk for different subsets of patients that are not found by other feature selection methods. As an application to a subset of the HapMap data shows, GPAS is not restricted to association studies comprising several ten SNPs, but can also be employed to analyze whole-genome data.

Sprache
Englisch

Erschienen in
Series: Technical Report ; No. 2007,24

Ereignis
Geistige Schöpfung
(wer)
Nunkesser, Robin
Bernholt, Thorsten
Schwender, Holger
Ickstadt, Katja
Wegener, Ing
Ereignis
Veröffentlichung
(wer)
Universität Dortmund, Sonderforschungsbereich 475 - Komplexitätsreduktion in Multivariaten Datenstrukturen
(wo)
Dortmund
(wann)
2007

Handle
Letzte Aktualisierung
20.09.2024, 08:24 MESZ

Datenpartner

Dieses Objekt wird bereitgestellt von:
ZBW - Deutsche Zentralbibliothek für Wirtschaftswissenschaften - Leibniz-Informationszentrum Wirtschaft. Bei Fragen zum Objekt wenden Sie sich bitte an den Datenpartner.

Objekttyp

  • Arbeitspapier

Beteiligte

  • Nunkesser, Robin
  • Bernholt, Thorsten
  • Schwender, Holger
  • Ickstadt, Katja
  • Wegener, Ing
  • Universität Dortmund, Sonderforschungsbereich 475 - Komplexitätsreduktion in Multivariaten Datenstrukturen

Entstanden

  • 2007

Ähnliche Objekte (12)