; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MELO3C022431 (gene) of Melon (DHL92) v4 genome

Gene IDMELO3C022431
OrganismCucumis melo DHL92 (Melon (DHL92) v4)
DescriptionAgglutinin domain-containing protein
Genome locationchr11:31183034..31184002
RNA-Seq ExpressionMELO3C022431
SyntenyMELO3C022431
Gene Ontology termsNA
InterPro domainsIPR008998 - Agglutinin domain
IPR036242 - Agglutinin domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039924.1 uncharacterized protein E6C27_scaffold122G002040 [Cucumis melo var. makuwa]1.8e-10969.11Show/hide
Query:  MGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINE--DYYGDDFHFDKGDS
        +GKAGTDILGGAVKGAGK+VETVG+  EKAPVVGGIGTVVEGTGKAIENVG+ATE+ GE+VF+ +E+   + +  D  + Q     D   +D+  D  DS
Subjt:  MGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINE--DYYGDDFHFDKGDS

Query:  KESEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSVKNIVGP
                              G+D ++ DDIDEAEK+LMKSDI+DSNYEEEEE+EE  KVIPKN SLK +RN KYLRYISESEN DGLLRYS KNIVGP
Subjt:  KESEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSVKNIVGP

Query:  YSKFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLVAIVEDI
        YSKF++  SK+KPGFFHIRCCYNNKFWVRLSE+S+YIAAIANEEEDDTSKWSCTLFEPIFVPEK  L YIRHVQLNTFLC+AEG P PYNDCLVA VEDI
Subjt:  YSKFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLVAIVEDI

Query:  STIDENLALSALME
        + IDENL LSA+ +
Subjt:  STIDENLALSALME

KAE8646726.1 hypothetical protein Csa_004904 [Cucumis sativus]1.8e-12277.56Show/hide
Query:  MGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINEDYYGDDFHFDKGDSKE
        +GKAGTDILGGAVKGAGK VETVGNAAEKAPVVGGIGTVVEGTGKAIENV                                             GDSKE
Subjt:  MGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINEDYYGDDFHFDKGDSKE

Query:  SEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSVKNIVGPYS
        SEKAP+D+LKMLN E+AR+RGE +D+AD+IDEAEKELMKSDIND+NYEE EEDEES KVIPKNFSLKCVRNNKYLRYISESENTDGLLRYS KNIVGPYS
Subjt:  SEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSVKNIVGPYS

Query:  KFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLVAIVEDIST
        KFAIR SK+KPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWS TLFEPIFV EK  LCYIRHVQLN FLCIAEGAPFPYNDCLVA VEDIST
Subjt:  KFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLVAIVEDIST

Query:  IDENLALSALME
        IDENLALSA+M+
Subjt:  IDENLALSALME

KAE8646727.1 hypothetical protein Csa_005365 [Cucumis sativus]5.1e-9669.26Show/hide
Query:  APVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINEDYYGDDFHFDKGDSKESEKAPEDLLKMLNAELARQRGEDQDEADD
        APVVG +GTVVEGTGKAIENVG+ATE+ GE+VFE KE+ KP++  K       +E+Y  DD    K    E ++  ED  + ++  +A   G+D ++ DD
Subjt:  APVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINEDYYGDDFHFDKGDSKESEKAPEDLLKMLNAELARQRGEDQDEADD

Query:  IDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSVKNIVGPYSKFAIRLSKSKPGFFHIRCCYNNKFWVRLS
        IDEAEK+LMKSDI+DSNYEEEEE+EE  KVIPKN SLK +RN KYLRYISESEN DGLLR+S KNIVGPYSKF++  SK+KPGFFHIRCCYNNKFWVRLS
Subjt:  IDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSVKNIVGPYSKFAIRLSKSKPGFFHIRCCYNNKFWVRLS

Query:  ENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLVAIVEDISTIDENLALSALME
        E+S+YIAA+ANEEEDDTSKWSCTLFEPIFVPEK  L YIRHVQLNTFLC+AEG P PYNDCLVA VEDI+TIDENL L A+ +
Subjt:  ENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLVAIVEDISTIDENLALSALME

XP_004140683.2 uncharacterized protein LOC101212952 [Cucumis sativus]2.3e-10970.51Show/hide
Query:  MGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINEDYYGDDFHFDKGDSKE
        +GKAGTDILGGAVKGAGK+VETVG+  EKAPVVG +GTVVEGTGKAIENVG+ATE+ GE+VFE KE+ KP++  K       +E+Y  DD    K    E
Subjt:  MGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINEDYYGDDFHFDKGDSKE

Query:  SEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSVKNIVGPYS
         ++  ED  + ++  +A   G+D ++ DDIDEAEK+LMKSDI+DSNYEEEEE+EE  KVIPKN SLK +RN KYLRYISESEN DGLLR+S KNIVGPYS
Subjt:  SEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSVKNIVGPYS

Query:  KFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLVAIVEDIST
        KF++  SK+KPGFFHIRCCYNNKFWVRLSE+S+YIAA+ANEEEDDTSKWSCTLFEPIFVPEK  L YIRHVQLNTFLC+AEG P PYNDCLVA VEDI+T
Subjt:  KFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLVAIVEDIST

Query:  IDENLALSALME
        IDENL L A+ +
Subjt:  IDENLALSALME

XP_008460195.1 PREDICTED: uncharacterized protein LOC103499080 [Cucumis melo]5.2e-10968.79Show/hide
Query:  MGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINE--DYYGDDFHFDKGDS
        +GKAGTDILGGAVKGAGK+VETVG+  EKAPVVGGIGTVVEGTGKAIENVG+ATE+ GE+VF+ +E+   + +  D  + Q     D   +D+  D  DS
Subjt:  MGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINE--DYYGDDFHFDKGDS

Query:  KESEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSVKNIVGP
                              G+D ++ DDIDEAEK+LMKSDI+DSNYEEEEE+EE  KVIPKN SLK +RN KYLRYISESEN DGLLRYS KNIVGP
Subjt:  KESEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSVKNIVGP

Query:  YSKFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLVAIVEDI
        YSKF++  SK+KPGFFHIRCCYNNKFWVRLSE+S+YIAAIANEEEDDTSKWSCTLFEPIFVPEK    YIRHVQLNTFLC+AEG P PYNDCLVA VEDI
Subjt:  YSKFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLVAIVEDI

Query:  STIDENLALSALME
        + IDENL LSA+ +
Subjt:  STIDENLALSALME

TrEMBL top hitse value%identityAlignment
A0A0A0K983 Uncharacterized protein1.1e-10970.51Show/hide
Query:  MGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINEDYYGDDFHFDKGDSKE
        +GKAGTDILGGAVKGAGK+VETVG+  EKAPVVG +GTVVEGTGKAIENVG+ATE+ GE+VFE KE+ KP++  K       +E+Y  DD    K    E
Subjt:  MGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINEDYYGDDFHFDKGDSKE

Query:  SEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSVKNIVGPYS
         ++  ED  + ++  +A   G+D ++ DDIDEAEK+LMKSDI+DSNYEEEEE+EE  KVIPKN SLK +RN KYLRYISESEN DGLLR+S KNIVGPYS
Subjt:  SEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSVKNIVGPYS

Query:  KFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLVAIVEDIST
        KF++  SK+KPGFFHIRCCYNNKFWVRLSE+S+YIAA+ANEEEDDTSKWSCTLFEPIFVPEK  L YIRHVQLNTFLC+AEG P PYNDCLVA VEDI+T
Subjt:  KFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLVAIVEDIST

Query:  IDENLALSALME
        IDENL L A+ +
Subjt:  IDENLALSALME

A0A0A0KD65 Uncharacterized protein6.3e-15391.03Show/hide
Query:  MGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINEDYYGDDFHFDKGDSKE
        +GKAGTDILGGAVKGAGK VETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEK KPKKDLKDT LDQINEDYYGDDFHFD+GDSKE
Subjt:  MGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINEDYYGDDFHFDKGDSKE

Query:  SEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSVKNIVGPYS
        SEKAP+D+LKMLN E+AR+RGE +D+AD+IDEAEKELMKSDIND+NYEE EEDEES KVIPKNFSLKCVRNNKYLRYISESENTDGLLRYS KNIVGPYS
Subjt:  SEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSVKNIVGPYS

Query:  KFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLVAIVEDIST
        KFAIR SK+KPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWS TLFEPIFV EK  LCYIRHVQLN FLCIAEGAPFPYNDCLVA VEDIST
Subjt:  KFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLVAIVEDIST

Query:  IDENLALSALME
        IDENLALSA+M+
Subjt:  IDENLALSALME

A0A1S3CBI1 uncharacterized protein LOC1034990802.5e-10968.79Show/hide
Query:  MGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINE--DYYGDDFHFDKGDS
        +GKAGTDILGGAVKGAGK+VETVG+  EKAPVVGGIGTVVEGTGKAIENVG+ATE+ GE+VF+ +E+   + +  D  + Q     D   +D+  D  DS
Subjt:  MGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINE--DYYGDDFHFDKGDS

Query:  KESEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSVKNIVGP
                              G+D ++ DDIDEAEK+LMKSDI+DSNYEEEEE+EE  KVIPKN SLK +RN KYLRYISESEN DGLLRYS KNIVGP
Subjt:  KESEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSVKNIVGP

Query:  YSKFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLVAIVEDI
        YSKF++  SK+KPGFFHIRCCYNNKFWVRLSE+S+YIAAIANEEEDDTSKWSCTLFEPIFVPEK    YIRHVQLNTFLC+AEG P PYNDCLVA VEDI
Subjt:  YSKFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLVAIVEDI

Query:  STIDENLALSALME
        + IDENL LSA+ +
Subjt:  STIDENLALSALME

A0A5A7T8Z0 Uncharacterized protein8.7e-11069.11Show/hide
Query:  MGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINE--DYYGDDFHFDKGDS
        +GKAGTDILGGAVKGAGK+VETVG+  EKAPVVGGIGTVVEGTGKAIENVG+ATE+ GE+VF+ +E+   + +  D  + Q     D   +D+  D  DS
Subjt:  MGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINE--DYYGDDFHFDKGDS

Query:  KESEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSVKNIVGP
                              G+D ++ DDIDEAEK+LMKSDI+DSNYEEEEE+EE  KVIPKN SLK +RN KYLRYISESEN DGLLRYS KNIVGP
Subjt:  KESEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSVKNIVGP

Query:  YSKFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLVAIVEDI
        YSKF++  SK+KPGFFHIRCCYNNKFWVRLSE+S+YIAAIANEEEDDTSKWSCTLFEPIFVPEK  L YIRHVQLNTFLC+AEG P PYNDCLVA VEDI
Subjt:  YSKFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLVAIVEDI

Query:  STIDENLALSALME
        + IDENL LSA+ +
Subjt:  STIDENLALSALME

A0A6J1GPP7 uncharacterized protein LOC1114563419.0e-9160.32Show/hide
Query:  MGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINEDYYGDDFHFDKGDSKE
        +GKAGTD LGG +KGAGK+VETVG+ AEKAP+VGG+GTVVE TGKAIEN+G+ TE+ GE+VF+  E   PK+       DQ+ EDY  DD          
Subjt:  MGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINEDYYGDDFHFDKGDSKE

Query:  SEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSD---INDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSVKNIVG
                                   +DIDEAEK+LM  +   + D + + +++DE  AK IPKNFSLK  RNNKYLRYISESE+TDGLLR+S KNIVG
Subjt:  SEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSD---INDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSVKNIVG

Query:  PYSKFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLVAIVED
        PYSKFAIR S+++PG  HIRCCYNNKFWVRLSE+S+YIAAIANEEE+D SKWSCTLFEPIF+P+K +  YIRHVQLNTFLC+AE  P PYNDCL A VED
Subjt:  PYSKFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLVAIVED

Query:  ISTIDENLALSALME
        ISTID+NL L   M+
Subjt:  ISTIDENLALSALME

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAAAGCTGGGACTGACATTTTGGGAGGAGCTGTGAAAGGAGCAGGAAAGGTTGTTGAAACAGTGGGGAATGCGGCTGAGAAGGCGCCTGTCGTTGGTGGC
ATCGGAACTGTCGTGGAGGGAACCGGAAAGGCTATCGAAAATGTTGGTAAGGCGACTGAGAATTTGGGTGAAAAAGTATTTGAAAACAAAGAAAAGATCAAGCCG
AAAAAAGATCTTAAAGATACTACATTGGACCAAATTAATGAAGATTATTATGGTGATGACTTCCATTTTGACAAAGGCGATTCAAAAGAAAGCGAAAAAGCCCCC
GAAGATTTATTGAAGATGCTTAATGCTGAATTGGCACGGCAGCGTGGTGAAGATCAAGACGAAGCCGATGACATAGATGAAGCAGAGAAGGAGTTGATGAAGAGT
GATATAAACGATTCAAACTATGAAGAAGAGGAAGAAGATGAAGAATCAGCAAAGGTAATCCCGAAGAACTTCTCCCTCAAATGCGTCCGCAACAACAAATACCTT
CGGTATATAAGCGAAAGTGAAAACACGGATGGACTTCTTCGATACTCCGTCAAGAACATCGTTGGCCCGTATTCAAAATTTGCCATCCGCTTATCGAAAAGTAAG
CCAGGTTTCTTCCACATAAGATGTTGTTACAACAACAAATTCTGGGTTCGTTTATCCGAAAACTCCGACTACATTGCAGCCATTGCCAATGAAGAAGAAGATGAT
ACATCAAAGTGGTCGTGCACTTTGTTTGAACCGATTTTTGTACCGGAGAAAGCCGAACTCTGTTACATTCGTCATGTTCAACTTAACACCTTCCTTTGCATAGCT
GAAGGAGCTCCTTTTCCTTACAATGATTGTTTAGTTGCAATAGTAGAAGACATATCAACTATTGATGAGAATCTTGCCCTCTCAGCCCTCATGGAATGTAACAAC
ACCTTAAAAGAAAAATTGTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAAAAGCTGGGACTGACATTTTGGGAGGAGCTGTGAAAGGAGCAGGAAAGGTTGTTGAAACAGTGGGGAATGCGGCTGAGAAGGCGCCTGTCGTTGGTGGC
ATCGGAACTGTCGTGGAGGGAACCGGAAAGGCTATCGAAAATGTTGGTAAGGCGACTGAGAATTTGGGTGAAAAAGTATTTGAAAACAAAGAAAAGATCAAGCCG
AAAAAAGATCTTAAAGATACTACATTGGACCAAATTAATGAAGATTATTATGGTGATGACTTCCATTTTGACAAAGGCGATTCAAAAGAAAGCGAAAAAGCCCCC
GAAGATTTATTGAAGATGCTTAATGCTGAATTGGCACGGCAGCGTGGTGAAGATCAAGACGAAGCCGATGACATAGATGAAGCAGAGAAGGAGTTGATGAAGAGT
GATATAAACGATTCAAACTATGAAGAAGAGGAAGAAGATGAAGAATCAGCAAAGGTAATCCCGAAGAACTTCTCCCTCAAATGCGTCCGCAACAACAAATACCTT
CGGTATATAAGCGAAAGTGAAAACACGGATGGACTTCTTCGATACTCCGTCAAGAACATCGTTGGCCCGTATTCAAAATTTGCCATCCGCTTATCGAAAAGTAAG
CCAGGTTTCTTCCACATAAGATGTTGTTACAACAACAAATTCTGGGTTCGTTTATCCGAAAACTCCGACTACATTGCAGCCATTGCCAATGAAGAAGAAGATGAT
ACATCAAAGTGGTCGTGCACTTTGTTTGAACCGATTTTTGTACCGGAGAAAGCCGAACTCTGTTACATTCGTCATGTTCAACTTAACACCTTCCTTTGCATAGCT
GAAGGAGCTCCTTTTCCTTACAATGATTGTTTAGTTGCAATAGTAGAAGACATATCAACTATTGATGAGAATCTTGCCCTCTCAGCCCTCATGGAATGTAACAAC
ACCTTAAAAGAAAAATTGTTTTAG
Protein sequenceShow/hide protein sequence
MGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINEDYYGDDFHFDKGDSKESEKAP
EDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSVKNIVGPYSKFAIRLSKSK
PGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLVAIVEDISTIDENLALSALMECNN
TLKEKLF