“Multi-marker
tagging single nucleotide polymorphism selection using estimation of
distribution algorithms”
by
R. Santana,
A. Mendiburu,
N. Zaitlen, E. Eskin, and
J. A. Lozano.
*Artificial Intelligence in Medicine*, vol. 50, 2010, pp. 193-201.

Objectives This paper presents an optimization algorithm for the automatic selection of a minimal subset of tagging single nucleotide polymorphisms (SNPs). Methods and materials The determination of the set of minimal tagging SNPs is approached as an optimization problem in which each tagged SNP can be covered by a single tagging SNP or by a pair of tagging SNPs. The problem is solved using an estimation of distribution algorithm (EDA) which takes advantage of the underlying topological structure defined by the SNP correlations to model the problem interactions. The EDA stochastically searches the constrained space of feasible solutions. It is evaluated across HapMap reference panel data sets. Results The EDA was compared with a SAT solver, able to find the single-marker minimal tagging sets, and with the Tagger program. The percentage of reduction ranged from 10 to 43 percentage in the number of tagging SNPs of the minimal multi-marker tagging set found by the EDA with respect to the other algorithms. Conclusions The introduced algorithm is effective for the identification of minimal multi-marker SNP sets, which considerably reduce the dimension of the tagging SNP set in comparison with single-marker sets. Other variants of the SNP problem can be treated following the same approach.

**BibTeX entry:**

@article{Santana_et_al:2010d, author = {R. Santana and A. Mendiburu and N. Zaitlen and E. Eskin and J. A. Lozano}, title = {Multi-marker tagging single nucleotide polymorphism selection using estimation of distribution algorithms}, journal = {Artificial Intelligence in Medicine}, volume = {50}, pages = {193-201}, year = {2010}, url = {http://www.sciencedirect.com/science/article/pii/S0933365710000758} }

(This webpage was created with bibtex2web.)

Back to Roberto Santana publications.