K-mer

From Wikipedia, the free encyclopedia

k-mers (or x-mers where x can be virtually any consonant of choice) usually refer to specific n-tuples or n-grams of nucleic acid or amino acid sequences that can be used to identify certain regions within biomolecules like DNA (e.g. for gene prediction) or proteins. Either k-mer strings as such can be used for finding regions of interest, or k-mer statistics giving discrete probability distributions of a number of possible k-mer combinations (or rather permutations with repetitions) are used. Specific short k-mers are called oligomers or "oligos" for short.

[edit] See also

[edit] Examples

  • Dimer = AGAGAGAGAGAGAG
  • Trimer = AAGAAGAAGAAG