; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0017572 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0017572
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionAgglutinin domain-containing protein
Genome locationchr11:32100300..32101286
RNA-Seq ExpressionPay0017572
SyntenyPay0017572
Gene Ontology termsNA
InterPro domainsIPR008998 - Agglutinin domain
IPR036242 - Agglutinin domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039924.1 uncharacterized protein E6C27_scaffold122G002040 [Cucumis melo var. makuwa]2.1e-11369.69Show/hide
Query:  MNLFKGMGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINE--DYYGDDFH
        MNLFKG+GKAGTDILGGAVKGAGK+VETVG+  EKAPVVGGIGTVVEGTGKAIENVG+ATE+ GE+VF+ +E+   + +  D  + Q     D   +D+ 
Subjt:  MNLFKGMGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINE--DYYGDDFH

Query:  FDKGDSKESEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSV
         D  DS                      G+D ++ DDIDEAEK+LMKSDI+DSNYEEEEE+EE  KVIPKN SLK +RN KYLRYISESEN DGLLRYS 
Subjt:  FDKGDSKESEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSV

Query:  KNIVGPYSKFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLV
        KNIVGPYSKF++  SK+KPGFFHIRCCYNNKFWVRLSE+S+YIAAIANEEEDDTSKWSCTLFEPIFVPEK  L YIRHVQLNTFLC+AEG P PYNDCLV
Subjt:  KNIVGPYSKFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLV

Query:  AIVEDISTIDENLALSALME
        A VEDI+ IDENL LSA+ +
Subjt:  AIVEDISTIDENLALSALME

KAE8646726.1 hypothetical protein Csa_004904 [Cucumis sativus]2.8e-12677.99Show/hide
Query:  MNLFKGMGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINEDYYGDDFHFD
        MNLFKG+GKAGTDILGGAVKGAGK VETVGNAAEKAPVVGGIGTVVEGTGKAIENV                                            
Subjt:  MNLFKGMGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINEDYYGDDFHFD

Query:  KGDSKESEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSVKN
         GDSKESEKAP+D+LKMLN E+AR+RGE +D+AD+IDEAEKELMKSDIND+NYEE EEDEES KVIPKNFSLKCVRNNKYLRYISESENTDGLLRYS KN
Subjt:  KGDSKESEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSVKN

Query:  IVGPYSKFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLVAI
        IVGPYSKFAIR SK+KPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWS TLFEPIFV EK  LCYIRHVQLN FLCIAEGAPFPYNDCLVA 
Subjt:  IVGPYSKFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLVAI

Query:  VEDISTIDENLALSALME
        VEDISTIDENLALSA+M+
Subjt:  VEDISTIDENLALSALME

KAE8646727.1 hypothetical protein Csa_005365 [Cucumis sativus]5.2e-9669.26Show/hide
Query:  APVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINEDYYGDDFHFDKGDSKESEKAPEDLLKMLNAELARQRGEDQDEADD
        APVVG +GTVVEGTGKAIENVG+ATE+ GE+VFE KE+ KP++  K       +E+Y  DD    K    E ++  ED  + ++  +A   G+D ++ DD
Subjt:  APVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINEDYYGDDFHFDKGDSKESEKAPEDLLKMLNAELARQRGEDQDEADD

Query:  IDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSVKNIVGPYSKFAIRLSKSKPGFFHIRCCYNNKFWVRLS
        IDEAEK+LMKSDI+DSNYEEEEE+EE  KVIPKN SLK +RN KYLRYISESEN DGLLR+S KNIVGPYSKF++  SK+KPGFFHIRCCYNNKFWVRLS
Subjt:  IDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSVKNIVGPYSKFAIRLSKSKPGFFHIRCCYNNKFWVRLS

Query:  ENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLVAIVEDISTIDENLALSALME
        E+S+YIAA+ANEEEDDTSKWSCTLFEPIFVPEK  L YIRHVQLNTFLC+AEG P PYNDCLVA VEDI+TIDENL L A+ +
Subjt:  ENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLVAIVEDISTIDENLALSALME

XP_004140683.2 uncharacterized protein LOC101212952 [Cucumis sativus]2.7e-11371.07Show/hide
Query:  MNLFKGMGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINEDYYGDDFHFD
        MNLFKG+GKAGTDILGGAVKGAGK+VETVG+  EKAPVVG +GTVVEGTGKAIENVG+ATE+ GE+VFE KE+ KP++  K       +E+Y  DD    
Subjt:  MNLFKGMGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINEDYYGDDFHFD

Query:  KGDSKESEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSVKN
        K    E ++  ED  + ++  +A   G+D ++ DDIDEAEK+LMKSDI+DSNYEEEEE+EE  KVIPKN SLK +RN KYLRYISESEN DGLLR+S KN
Subjt:  KGDSKESEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSVKN

Query:  IVGPYSKFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLVAI
        IVGPYSKF++  SK+KPGFFHIRCCYNNKFWVRLSE+S+YIAA+ANEEEDDTSKWSCTLFEPIFVPEK  L YIRHVQLNTFLC+AEG P PYNDCLVA 
Subjt:  IVGPYSKFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLVAI

Query:  VEDISTIDENLALSALME
        VEDI+TIDENL L A+ +
Subjt:  VEDISTIDENLALSALME

XP_008460195.1 PREDICTED: uncharacterized protein LOC103499080 [Cucumis melo]6.1e-11369.38Show/hide
Query:  MNLFKGMGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINE--DYYGDDFH
        MNLFKG+GKAGTDILGGAVKGAGK+VETVG+  EKAPVVGGIGTVVEGTGKAIENVG+ATE+ GE+VF+ +E+   + +  D  + Q     D   +D+ 
Subjt:  MNLFKGMGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINE--DYYGDDFH

Query:  FDKGDSKESEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSV
         D  DS                      G+D ++ DDIDEAEK+LMKSDI+DSNYEEEEE+EE  KVIPKN SLK +RN KYLRYISESEN DGLLRYS 
Subjt:  FDKGDSKESEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSV

Query:  KNIVGPYSKFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLV
        KNIVGPYSKF++  SK+KPGFFHIRCCYNNKFWVRLSE+S+YIAAIANEEEDDTSKWSCTLFEPIFVPEK    YIRHVQLNTFLC+AEG P PYNDCLV
Subjt:  KNIVGPYSKFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLV

Query:  AIVEDISTIDENLALSALME
        A VEDI+ IDENL LSA+ +
Subjt:  AIVEDISTIDENLALSALME

TrEMBL top hitse value%identityAlignment
A0A0A0K983 Uncharacterized protein1.3e-11371.07Show/hide
Query:  MNLFKGMGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINEDYYGDDFHFD
        MNLFKG+GKAGTDILGGAVKGAGK+VETVG+  EKAPVVG +GTVVEGTGKAIENVG+ATE+ GE+VFE KE+ KP++  K       +E+Y  DD    
Subjt:  MNLFKGMGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINEDYYGDDFHFD

Query:  KGDSKESEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSVKN
        K    E ++  ED  + ++  +A   G+D ++ DDIDEAEK+LMKSDI+DSNYEEEEE+EE  KVIPKN SLK +RN KYLRYISESEN DGLLR+S KN
Subjt:  KGDSKESEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSVKN

Query:  IVGPYSKFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLVAI
        IVGPYSKF++  SK+KPGFFHIRCCYNNKFWVRLSE+S+YIAA+ANEEEDDTSKWSCTLFEPIFVPEK  L YIRHVQLNTFLC+AEG P PYNDCLVA 
Subjt:  IVGPYSKFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLVAI

Query:  VEDISTIDENLALSALME
        VEDI+TIDENL L A+ +
Subjt:  VEDISTIDENLALSALME

A0A0A0KD65 Uncharacterized protein9.6e-15791.19Show/hide
Query:  MNLFKGMGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINEDYYGDDFHFD
        MNLFKG+GKAGTDILGGAVKGAGK VETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEK KPKKDLKDT LDQINEDYYGDDFHFD
Subjt:  MNLFKGMGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINEDYYGDDFHFD

Query:  KGDSKESEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSVKN
        +GDSKESEKAP+D+LKMLN E+AR+RGE +D+AD+IDEAEKELMKSDIND+NYEE EEDEES KVIPKNFSLKCVRNNKYLRYISESENTDGLLRYS KN
Subjt:  KGDSKESEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSVKN

Query:  IVGPYSKFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLVAI
        IVGPYSKFAIR SK+KPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWS TLFEPIFV EK  LCYIRHVQLN FLCIAEGAPFPYNDCLVA 
Subjt:  IVGPYSKFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLVAI

Query:  VEDISTIDENLALSALME
        VEDISTIDENLALSA+M+
Subjt:  VEDISTIDENLALSALME

A0A1S3CBI1 uncharacterized protein LOC1034990802.9e-11369.38Show/hide
Query:  MNLFKGMGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINE--DYYGDDFH
        MNLFKG+GKAGTDILGGAVKGAGK+VETVG+  EKAPVVGGIGTVVEGTGKAIENVG+ATE+ GE+VF+ +E+   + +  D  + Q     D   +D+ 
Subjt:  MNLFKGMGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINE--DYYGDDFH

Query:  FDKGDSKESEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSV
         D  DS                      G+D ++ DDIDEAEK+LMKSDI+DSNYEEEEE+EE  KVIPKN SLK +RN KYLRYISESEN DGLLRYS 
Subjt:  FDKGDSKESEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSV

Query:  KNIVGPYSKFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLV
        KNIVGPYSKF++  SK+KPGFFHIRCCYNNKFWVRLSE+S+YIAAIANEEEDDTSKWSCTLFEPIFVPEK    YIRHVQLNTFLC+AEG P PYNDCLV
Subjt:  KNIVGPYSKFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLV

Query:  AIVEDISTIDENLALSALME
        A VEDI+ IDENL LSA+ +
Subjt:  AIVEDISTIDENLALSALME

A0A5A7T8Z0 Uncharacterized protein1.0e-11369.69Show/hide
Query:  MNLFKGMGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINE--DYYGDDFH
        MNLFKG+GKAGTDILGGAVKGAGK+VETVG+  EKAPVVGGIGTVVEGTGKAIENVG+ATE+ GE+VF+ +E+   + +  D  + Q     D   +D+ 
Subjt:  MNLFKGMGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINE--DYYGDDFH

Query:  FDKGDSKESEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSV
         D  DS                      G+D ++ DDIDEAEK+LMKSDI+DSNYEEEEE+EE  KVIPKN SLK +RN KYLRYISESEN DGLLRYS 
Subjt:  FDKGDSKESEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSV

Query:  KNIVGPYSKFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLV
        KNIVGPYSKF++  SK+KPGFFHIRCCYNNKFWVRLSE+S+YIAAIANEEEDDTSKWSCTLFEPIFVPEK  L YIRHVQLNTFLC+AEG P PYNDCLV
Subjt:  KNIVGPYSKFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLV

Query:  AIVEDISTIDENLALSALME
        A VEDI+ IDENL LSA+ +
Subjt:  AIVEDISTIDENLALSALME

A0A6J1GPP7 uncharacterized protein LOC1114563411.5e-9360.44Show/hide
Query:  MNLFKGMGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINEDYYGDDFHFD
        MNL +G+GKAGTD LGG +KGAGK+VETVG+ AEKAP+VGG+GTVVE TGKAIEN+G+ TE+ GE+VF+  E   PK+       DQ+ EDY  DD    
Subjt:  MNLFKGMGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINEDYYGDDFHFD

Query:  KGDSKESEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSD---INDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYS
                                         +DIDEAEK+LM  +   + D + + +++DE  AK IPKNFSLK  RNNKYLRYISESE+TDGLLR+S
Subjt:  KGDSKESEKAPEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSD---INDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYS

Query:  VKNIVGPYSKFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCL
         KNIVGPYSKFAIR S+++PG  HIRCCYNNKFWVRLSE+S+YIAAIANEEE+D SKWSCTLFEPIF+P+K +  YIRHVQLNTFLC+AE  P PYNDCL
Subjt:  VKNIVGPYSKFAIRLSKSKPGFFHIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCL

Query:  VAIVEDISTIDENLALSALME
         A VEDISTID+NL L   M+
Subjt:  VAIVEDISTIDENLALSALME

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCTTTTCAAAGGAATGGGAAAAGCTGGGACTGACATTTTGGGAGGAGCTGTGAAAGGAGCAGGAAAGGTTGTTGAAACAGTGGGGAATGCGGCTGAGAAGGCGCC
TGTCGTTGGTGGCATCGGAACTGTCGTGGAGGGAACCGGAAAGGCTATCGAAAATGTTGGTAAGGCGACTGAGAATTTGGGTGAAAAAGTATTTGAAAACAAAGAAAAGA
TCAAGCCGAAAAAAGATCTTAAAGATACTACATTGGACCAAATTAATGAAGATTATTATGGTGATGACTTCCATTTTGACAAAGGCGATTCAAAAGAAAGCGAAAAAGCC
CCCGAAGATTTATTGAAGATGCTTAATGCTGAATTGGCACGGCAGCGTGGTGAAGATCAAGACGAAGCCGATGACATAGATGAAGCAGAGAAGGAGTTGATGAAGAGTGA
TATAAACGATTCAAACTATGAAGAAGAGGAAGAAGATGAAGAATCAGCAAAGGTAATCCCGAAGAACTTCTCCCTCAAATGCGTCCGCAACAACAAATACCTTCGGTATA
TAAGCGAAAGTGAAAACACGGATGGACTTCTTCGATACTCCGTCAAGAACATCGTTGGCCCGTATTCAAAATTTGCCATCCGCTTATCGAAAAGTAAGCCAGGTTTCTTC
CACATAAGATGTTGTTACAACAACAAATTCTGGGTTCGTTTATCCGAAAACTCCGACTACATTGCAGCCATTGCCAATGAAGAAGAAGATGATACATCAAAGTGGTCGTG
CACTTTGTTTGAACCGATTTTTGTACCGGAGAAAGCCGAACTCTGTTACATTCGTCATGTTCAACTTAACACCTTCCTTTGCATAGCTGAAGGAGCTCCTTTTCCTTACA
ATGATTGTTTAGTTGCAATAGTAGAAGACATATCAACTATTGATGAGAATCTTGCCCTCTCAGCCCTCATGGAATGTAACAACACCTTAAAAGAAAAATTGTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATCTTTTCAAAGGAATGGGAAAAGCTGGGACTGACATTTTGGGAGGAGCTGTGAAAGGAGCAGGAAAGGTTGTTGAAACAGTGGGGAATGCGGCTGAGAAGGCGCC
TGTCGTTGGTGGCATCGGAACTGTCGTGGAGGGAACCGGAAAGGCTATCGAAAATGTTGGTAAGGCGACTGAGAATTTGGGTGAAAAAGTATTTGAAAACAAAGAAAAGA
TCAAGCCGAAAAAAGATCTTAAAGATACTACATTGGACCAAATTAATGAAGATTATTATGGTGATGACTTCCATTTTGACAAAGGCGATTCAAAAGAAAGCGAAAAAGCC
CCCGAAGATTTATTGAAGATGCTTAATGCTGAATTGGCACGGCAGCGTGGTGAAGATCAAGACGAAGCCGATGACATAGATGAAGCAGAGAAGGAGTTGATGAAGAGTGA
TATAAACGATTCAAACTATGAAGAAGAGGAAGAAGATGAAGAATCAGCAAAGGTAATCCCGAAGAACTTCTCCCTCAAATGCGTCCGCAACAACAAATACCTTCGGTATA
TAAGCGAAAGTGAAAACACGGATGGACTTCTTCGATACTCCGTCAAGAACATCGTTGGCCCGTATTCAAAATTTGCCATCCGCTTATCGAAAAGTAAGCCAGGTTTCTTC
CACATAAGATGTTGTTACAACAACAAATTCTGGGTTCGTTTATCCGAAAACTCCGACTACATTGCAGCCATTGCCAATGAAGAAGAAGATGATACATCAAAGTGGTCGTG
CACTTTGTTTGAACCGATTTTTGTACCGGAGAAAGCCGAACTCTGTTACATTCGTCATGTTCAACTTAACACCTTCCTTTGCATAGCTGAAGGAGCTCCTTTTCCTTACA
ATGATTGTTTAGTTGCAATAGTAGAAGACATATCAACTATTGATGAGAATCTTGCCCTCTCAGCCCTCATGGAATGTAACAACACCTTAAAAGAAAAATTGTTTTAG
Protein sequenceShow/hide protein sequence
MNLFKGMGKAGTDILGGAVKGAGKVVETVGNAAEKAPVVGGIGTVVEGTGKAIENVGKATENLGEKVFENKEKIKPKKDLKDTTLDQINEDYYGDDFHFDKGDSKESEKA
PEDLLKMLNAELARQRGEDQDEADDIDEAEKELMKSDINDSNYEEEEEDEESAKVIPKNFSLKCVRNNKYLRYISESENTDGLLRYSVKNIVGPYSKFAIRLSKSKPGFF
HIRCCYNNKFWVRLSENSDYIAAIANEEEDDTSKWSCTLFEPIFVPEKAELCYIRHVQLNTFLCIAEGAPFPYNDCLVAIVEDISTIDENLALSALMECNNTLKEKLF