Class ExtendedIUPACProtein
source code
Alphabet --+
|
SingleLetterAlphabet --+
|
ProteinAlphabet --+
|
ExtendedIUPACProtein
- Known Subclasses:
-
Extended uppercase IUPAC protein single letter alphabet including X
etc.
In addition to the standard 20 single letter protein codes, this
includes:
B = "Asx"; Aspartic acid (R) or Asparagine (N) X =
"Xxx"; Unknown or 'other' amino acid Z = "Glx";
Glutamic acid (E) or Glutamine (Q) J = "Xle"; Leucine (L) or
Isoleucine (I), used in mass-spec (NMR) U = "Sec";
Selenocysteine O = "Pyl"; Pyrrolysine
This alphabet is not intended to be used with X for Selenocysteine (an
ad-hoc standard prior to the IUPAC adoption of U instead).