; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0014729 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0014729
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
Descriptionprotamine P1 family protein
Genome locationchr08:28617856..28619157
RNA-Seq ExpressionPay0014729
SyntenyPay0014729
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047280.1 uncharacterized protein E6C27_scaffold908G00730 [Cucumis melo var. makuwa]8.7e-13798.86Show/hide
Query:  MKQLSKSISSPSRTDLFPPPLMSFLRADAGNRSKSSRSRSSPIFVRKKNVAIETEEPSSPKVTCMGQVRTNKRSSNKTPAVRCRWIRSVLSFNRRHCRTF
        MKQLSKSISSPSRTDLFPPPLMSFLRADAGNRSKSSRSRSSPIF+RKKNVAIETEEPSSPKVTCMGQVRTNKRSSNKTPAVRCRWIRSVLSFNRRHCRTF
Subjt:  MKQLSKSISSPSRTDLFPPPLMSFLRADAGNRSKSSRSRSSPIFVRKKNVAIETEEPSSPKVTCMGQVRTNKRSSNKTPAVRCRWIRSVLSFNRRHCRTF

Query:  WNRSAMLFRGKREIRRISESRVGNEAEDSEKDEEDDDGRDRDAVYASSSVPSPPKNALILTRCRSTPNRSSFNGNRYRSSSITSDGSTVEEEEKTERGFG
        WNRSAMLFRGKREIRRISESRVGNEAEDSEKDEEDDDGRDRDAVYASSSVPSPPKNALILTRCRSTPNRSSFN NRYRSSSITSDGSTVEEEEKTERGFG
Subjt:  WNRSAMLFRGKREIRRISESRVGNEAEDSEKDEEDDDGRDRDAVYASSSVPSPPKNALILTRCRSTPNRSSFNGNRYRSSSITSDGSTVEEEEKTERGFG

Query:  NNTASKIESRNSERLLKKLESSKGDGDCKSVNGNRNLILTRCKSEPARIAEKLYGELNLQEEER
        NNTASKIE RNSERLLKKLESSKGDGDCKSVNGNRNLILTRCKSEPARIAEKLYGELNLQEEER
Subjt:  NNTASKIESRNSERLLKKLESSKGDGDCKSVNGNRNLILTRCKSEPARIAEKLYGELNLQEEER

XP_008449922.1 PREDICTED: uncharacterized protein LOC103491651 [Cucumis melo]1.8e-14598.92Show/hide
Query:  MKQLSKSISSPSRTDLFPPPLMSFLRADAGNRSKSSRSRSSPIFVRKKNVAIETEEPSSPKVTCMGQVRTNKRSSNKTPAVRCRWIRSVLSFNRRHCRTF
        MKQLSKSISSPSRTDLFPPPLMSFLRADAGNRSKSSRSRSSPIF+RKKNVAIETEEPSSPKVTCMGQVRTNKRSSNKTPAVRCRWIRSVLSFNRRHCRTF
Subjt:  MKQLSKSISSPSRTDLFPPPLMSFLRADAGNRSKSSRSRSSPIFVRKKNVAIETEEPSSPKVTCMGQVRTNKRSSNKTPAVRCRWIRSVLSFNRRHCRTF

Query:  WNRSAMLFRGKREIRRISESRVGNEAEDSEKDEEDDDGRDRDAVYASSSVPSPPKNALILTRCRSTPNRSSFNGNRYRSSSITSDGSTVEEEEKTERGFG
        WNRSAMLFRGKREIRRISESRVGNEAEDSEKDEEDDDGRDRDAVYASSSVPSPPKNALILTRCRSTPNRSSFN NRYRSSSITSDGSTVEEEEKTERGFG
Subjt:  WNRSAMLFRGKREIRRISESRVGNEAEDSEKDEEDDDGRDRDAVYASSSVPSPPKNALILTRCRSTPNRSSFNGNRYRSSSITSDGSTVEEEEKTERGFG

Query:  NNTASKIESRNSERLLKKLESSKGDGDCKSVNGNRNLILTRCKSEPARIAEKLYGELNLQEEERWVMNKKNSYILNNL
        NNTASKIE RNSERLLKKLESSKGDGDCKSVNGNRNLILTRCKSEPARIAEKLYGELNLQEEERWVMNKKNSYILNNL
Subjt:  NNTASKIESRNSERLLKKLESSKGDGDCKSVNGNRNLILTRCKSEPARIAEKLYGELNLQEEERWVMNKKNSYILNNL

XP_011658465.2 uncharacterized protein LOC105436016 [Cucumis sativus]2.6e-12589.32Show/hide
Query:  MKQLSKSISSPSRTDLFPPPLMSFLRADAGNRSKSSRSRSSPIFVRKKNVAIETEEPSSPKVTCMGQVRTNKRSSNKTPAVRCRWIRSVLSFNRRHCRTF
        MKQLSKSISSPSRTDLFPPPLMSFLRADAGNRSKSSRSRSSPIFV KKNVAIET+EPSSPKVTCMGQVRTNK SSNKTPAVRCRWIRSVLSFNRRHCRTF
Subjt:  MKQLSKSISSPSRTDLFPPPLMSFLRADAGNRSKSSRSRSSPIFVRKKNVAIETEEPSSPKVTCMGQVRTNKRSSNKTPAVRCRWIRSVLSFNRRHCRTF

Query:  WNRSAMLFRGKREIRRISESRVGNEAEDSEKDEEDDDGRDRDAVYASSSVPSPPKNALILTRCRSTPNRSSFNGNRYRSSSITSDGST-VEEEEKTERGF
        WNRSAML RGKREIRRISESRVGNEAEDSEKDEE+DDGRD DAVY+S SVPSPPKNALIL+RCRS PNRSSFNG RYRSSSITSDG+  VEEEEKTE GF
Subjt:  WNRSAMLFRGKREIRRISESRVGNEAEDSEKDEEDDDGRDRDAVYASSSVPSPPKNALILTRCRSTPNRSSFNGNRYRSSSITSDGST-VEEEEKTERGF

Query:  GNNTASKIESRNSERLLKKLESSKGDGDCKSVNGNR--NLILTRCKSEPARIAEKLYGEL-NLQEEERWVMNKKNSYILNN
         NN ASKIE   SERLLKK+ESSKGDGD KSVNGNR  NLILTR KSEP RIAEKLYGEL NLQEE+RWVMNKK  YILNN
Subjt:  GNNTASKIESRNSERLLKKLESSKGDGDCKSVNGNR--NLILTRCKSEPARIAEKLYGEL-NLQEEERWVMNKKNSYILNN

XP_023543942.1 uncharacterized protein LOC111803666 [Cucurbita pepo subsp. pepo]9.8e-8871.96Show/hide
Query:  MKQLSKSISSPSRTDLFPPPLMSFLRADAGNRSKSSRSRSSPIFVRKKNVAIETEEPSSPKVTCMGQVRTNKRSSNKTPAVRCRWIRSVLSFNRRHCRTF
        MKQ  K ISSPSR DLFPPPLMSFLRADAGNRSKS RSRSSPIF+RKKNVAIET+EPSSPKVTCMGQVRTNKRSS + PA RCRWIRSVLSFNRRHCRTF
Subjt:  MKQLSKSISSPSRTDLFPPPLMSFLRADAGNRSKSSRSRSSPIFVRKKNVAIETEEPSSPKVTCMGQVRTNKRSSNKTPAVRCRWIRSVLSFNRRHCRTF

Query:  WNRSAMLFRGKREIRR---ISESRVGNEAEDSEKDEEDDDGRDRDAVYASSSVPSPPKNALILTRCRSTPNRSSFNGNRYRSSSITSDGSTVE--EEEKT
        WNRS M F+GKREIRR   I+ESRV +EAEDSE++EE  +G  RD V+ASS+ PSPPKNALILTRCRS P+RSSF GNRY  SSI SD +  E  E EKT
Subjt:  WNRSAMLFRGKREIRR---ISESRVGNEAEDSEKDEEDDDGRDRDAVYASSSVPSPPKNALILTRCRSTPNRSSFNGNRYRSSSITSDGSTVE--EEEKT

Query:  ERGFGNNTASKIESRNSERLLKKLESSKGDGDCKSVN-----------GNRNLILTRCKSEPARIAEKLYG
        E   GN  ASK ES+NSER+ +KLE+S G+ D  SVN            NR+LILTRCKSEP RI E+LYG
Subjt:  ERGFGNNTASKIESRNSERLLKKLESSKGDGDCKSVN-----------GNRNLILTRCKSEPARIAEKLYG

XP_038882779.1 uncharacterized protein LOC120073931 [Benincasa hispida]1.5e-9676.34Show/hide
Query:  MKQLSKSISSPSRTDLFPPPLMSFLRADAGNRSKSSRSRSSPIFVRKKN-VAIETEEPSSPKVTCMGQVRTNKRSSNKTPAVRCRWIRSVLSFNRRHCRT
        MK+LSKSISSPSRTDLFPPPLMSFLRADAGNRSKS RSRSSPIFV KKN VAIET+EPSSPKVTCMGQVR    SSNKTPA RCRWIRSVLSFNRR+CRT
Subjt:  MKQLSKSISSPSRTDLFPPPLMSFLRADAGNRSKSSRSRSSPIFVRKKN-VAIETEEPSSPKVTCMGQVRTNKRSSNKTPAVRCRWIRSVLSFNRRHCRT

Query:  FWNRSAMLFRGKREIRR---ISESRVGNEAEDSEKDEEDDDGRDRDAVYASSSVPSPPKNALILTRCRSTPNRSSFNGNRYRSSSITSDGSTVEEEEKTE
        FWN SAM FR K EIRR   I ESRVGNEAEDSEKDEE+D G  RDAV+ SSSVPSPPKNALILTRCRS PNR+SF GNRYRS  ITSDGS  EEEEK E
Subjt:  FWNRSAMLFRGKREIRR---ISESRVGNEAEDSEKDEEDDDGRDRDAVYASSSVPSPPKNALILTRCRSTPNRSSFNGNRYRSSSITSDGSTVEEEEKTE

Query:  RGFGNNTASKIESRNSERLLKKLESSKGDGDCKSVNG-----------NRNLILTRCKSEPARIAEKLYGELNLQEEER
           GN+ AS+IE R  E L  K+E++KGDGDC+ V+            NR LILTRCKSEPARIAEK+YGELNL+EEER
Subjt:  RGFGNNTASKIESRNSERLLKKLESSKGDGDCKSVNG-----------NRNLILTRCKSEPARIAEKLYGELNLQEEER

TrEMBL top hitse value%identityAlignment
A0A0A0KK43 Uncharacterized protein1.3e-12589.32Show/hide
Query:  MKQLSKSISSPSRTDLFPPPLMSFLRADAGNRSKSSRSRSSPIFVRKKNVAIETEEPSSPKVTCMGQVRTNKRSSNKTPAVRCRWIRSVLSFNRRHCRTF
        MKQLSKSISSPSRTDLFPPPLMSFLRADAGNRSKSSRSRSSPIFV KKNVAIET+EPSSPKVTCMGQVRTNK SSNKTPAVRCRWIRSVLSFNRRHCRTF
Subjt:  MKQLSKSISSPSRTDLFPPPLMSFLRADAGNRSKSSRSRSSPIFVRKKNVAIETEEPSSPKVTCMGQVRTNKRSSNKTPAVRCRWIRSVLSFNRRHCRTF

Query:  WNRSAMLFRGKREIRRISESRVGNEAEDSEKDEEDDDGRDRDAVYASSSVPSPPKNALILTRCRSTPNRSSFNGNRYRSSSITSDGST-VEEEEKTERGF
        WNRSAML RGKREIRRISESRVGNEAEDSEKDEE+DDGRD DAVY+S SVPSPPKNALIL+RCRS PNRSSFNG RYRSSSITSDG+  VEEEEKTE GF
Subjt:  WNRSAMLFRGKREIRRISESRVGNEAEDSEKDEEDDDGRDRDAVYASSSVPSPPKNALILTRCRSTPNRSSFNGNRYRSSSITSDGST-VEEEEKTERGF

Query:  GNNTASKIESRNSERLLKKLESSKGDGDCKSVNGNR--NLILTRCKSEPARIAEKLYGEL-NLQEEERWVMNKKNSYILNN
         NN ASKIE   SERLLKK+ESSKGDGD KSVNGNR  NLILTR KSEP RIAEKLYGEL NLQEE+RWVMNKK  YILNN
Subjt:  GNNTASKIESRNSERLLKKLESSKGDGDCKSVNGNR--NLILTRCKSEPARIAEKLYGEL-NLQEEERWVMNKKNSYILNN

A0A1S3BN59 uncharacterized protein LOC1034916518.5e-14698.92Show/hide
Query:  MKQLSKSISSPSRTDLFPPPLMSFLRADAGNRSKSSRSRSSPIFVRKKNVAIETEEPSSPKVTCMGQVRTNKRSSNKTPAVRCRWIRSVLSFNRRHCRTF
        MKQLSKSISSPSRTDLFPPPLMSFLRADAGNRSKSSRSRSSPIF+RKKNVAIETEEPSSPKVTCMGQVRTNKRSSNKTPAVRCRWIRSVLSFNRRHCRTF
Subjt:  MKQLSKSISSPSRTDLFPPPLMSFLRADAGNRSKSSRSRSSPIFVRKKNVAIETEEPSSPKVTCMGQVRTNKRSSNKTPAVRCRWIRSVLSFNRRHCRTF

Query:  WNRSAMLFRGKREIRRISESRVGNEAEDSEKDEEDDDGRDRDAVYASSSVPSPPKNALILTRCRSTPNRSSFNGNRYRSSSITSDGSTVEEEEKTERGFG
        WNRSAMLFRGKREIRRISESRVGNEAEDSEKDEEDDDGRDRDAVYASSSVPSPPKNALILTRCRSTPNRSSFN NRYRSSSITSDGSTVEEEEKTERGFG
Subjt:  WNRSAMLFRGKREIRRISESRVGNEAEDSEKDEEDDDGRDRDAVYASSSVPSPPKNALILTRCRSTPNRSSFNGNRYRSSSITSDGSTVEEEEKTERGFG

Query:  NNTASKIESRNSERLLKKLESSKGDGDCKSVNGNRNLILTRCKSEPARIAEKLYGELNLQEEERWVMNKKNSYILNNL
        NNTASKIE RNSERLLKKLESSKGDGDCKSVNGNRNLILTRCKSEPARIAEKLYGELNLQEEERWVMNKKNSYILNNL
Subjt:  NNTASKIESRNSERLLKKLESSKGDGDCKSVNGNRNLILTRCKSEPARIAEKLYGELNLQEEERWVMNKKNSYILNNL

A0A5A7TVU1 Uncharacterized protein4.2e-13798.86Show/hide
Query:  MKQLSKSISSPSRTDLFPPPLMSFLRADAGNRSKSSRSRSSPIFVRKKNVAIETEEPSSPKVTCMGQVRTNKRSSNKTPAVRCRWIRSVLSFNRRHCRTF
        MKQLSKSISSPSRTDLFPPPLMSFLRADAGNRSKSSRSRSSPIF+RKKNVAIETEEPSSPKVTCMGQVRTNKRSSNKTPAVRCRWIRSVLSFNRRHCRTF
Subjt:  MKQLSKSISSPSRTDLFPPPLMSFLRADAGNRSKSSRSRSSPIFVRKKNVAIETEEPSSPKVTCMGQVRTNKRSSNKTPAVRCRWIRSVLSFNRRHCRTF

Query:  WNRSAMLFRGKREIRRISESRVGNEAEDSEKDEEDDDGRDRDAVYASSSVPSPPKNALILTRCRSTPNRSSFNGNRYRSSSITSDGSTVEEEEKTERGFG
        WNRSAMLFRGKREIRRISESRVGNEAEDSEKDEEDDDGRDRDAVYASSSVPSPPKNALILTRCRSTPNRSSFN NRYRSSSITSDGSTVEEEEKTERGFG
Subjt:  WNRSAMLFRGKREIRRISESRVGNEAEDSEKDEEDDDGRDRDAVYASSSVPSPPKNALILTRCRSTPNRSSFNGNRYRSSSITSDGSTVEEEEKTERGFG

Query:  NNTASKIESRNSERLLKKLESSKGDGDCKSVNGNRNLILTRCKSEPARIAEKLYGELNLQEEER
        NNTASKIE RNSERLLKKLESSKGDGDCKSVNGNRNLILTRCKSEPARIAEKLYGELNLQEEER
Subjt:  NNTASKIESRNSERLLKKLESSKGDGDCKSVNGNRNLILTRCKSEPARIAEKLYGELNLQEEER

A0A6J1ECV0 uncharacterized protein LOC1114330212.2e-8570.85Show/hide
Query:  MKQLSKSISSPSRTDLFPPPLMSFLRADAGNRSKSSRSRSSPIFVRKKNVAIETEEPSSPKVTCMGQVRTNKRSSNKTPAVRCRWIRSVLSFNRRHCRTF
        MKQ  K ISSPSR DLFPPPLMSFLRADAGNRSKS RSRSSPIF+RKKNVAIET+EPSSPKVTCMGQVRTNKRSS + PA RCRWIRSVLSFNRR CRTF
Subjt:  MKQLSKSISSPSRTDLFPPPLMSFLRADAGNRSKSSRSRSSPIFVRKKNVAIETEEPSSPKVTCMGQVRTNKRSSNKTPAVRCRWIRSVLSFNRRHCRTF

Query:  WNRSAMLFRGKREIRR---ISESRVGNEAEDSEKDEEDDDGRDRDAVYASSSVPSPPKNALILTRCRSTPNRSSFNGNRYRSSSITSDGSTVE--EEEKT
        WNRS M F+G REIRR   I+ESRV +EAEDSE++EE  +G  RD V+ASS+ PSPPKNALILTRCRS P+RSSF  NRY  SSI SD +  E  E EKT
Subjt:  WNRSAMLFRGKREIRR---ISESRVGNEAEDSEKDEEDDDGRDRDAVYASSSVPSPPKNALILTRCRSTPNRSSFNGNRYRSSSITSDGSTVE--EEEKT

Query:  ERGFGNNTASKIESRNSERLLKKLESSKGDGDCKSVN-----------GNRNLILTRCKSEPARIAEKLYG
        E    N  ASKIES+NSER+ +KLE+S G+ D  SVN            NR+LILTRCKSEP RI E+LYG
Subjt:  ERGFGNNTASKIESRNSERLLKKLESSKGDGDCKSVN-----------GNRNLILTRCKSEPARIAEKLYG

A0A6J1IHY6 uncharacterized protein LOC1114776261.1e-8470.63Show/hide
Query:  MKQLSKSISSPSRTDLFPPPLMSFLRADAGNRSKSSRSRSSPIFVRKKNVAIETEEPSSPKVTCMGQVRTNKRSSNKTPAVRCRWIRSVLSFNRRHCRTF
        MKQ  K ISSPSR DLFPPPLMSFLRADAGNRSKS RSRSSPIF+RKKNV IET+EPSSPKVTCMGQVRTNKRSS + PA RCRWIRSVLSFNRRHCRTF
Subjt:  MKQLSKSISSPSRTDLFPPPLMSFLRADAGNRSKSSRSRSSPIFVRKKNVAIETEEPSSPKVTCMGQVRTNKRSSNKTPAVRCRWIRSVLSFNRRHCRTF

Query:  WNRSAMLFRGKREIRR---ISESRVGNEAEDSEKDEEDDDGRDRDAVYASSSVPSPPKNALILTRCRSTPNRSSFNGNRYRSSSITSDGSTVEEEEKTER
        WNRS M F+GK EIRR   I+ESRV +EAEDSE++EE  +G  RDAV+ASS+ PSPPKNALILTRCRS P+RSSF GN  RS            EE+ + 
Subjt:  WNRSAMLFRGKREIRR---ISESRVGNEAEDSEKDEEDDDGRDRDAVYASSSVPSPPKNALILTRCRSTPNRSSFNGNRYRSSSITSDGSTVEEEEKTER

Query:  GFGNNTASKIESRNSERLLKKLESSKGDGDCKSVN-----------GNRNLILTRCKSEPARIAEKLYG
        G GN  ASKIESRNSER+ KKLE+S G+ D  SVN            NR+LILTRCKSEP RI EKLYG
Subjt:  GFGNNTASKIESRNSERLLKKLESSKGDGDCKSVN-----------GNRNLILTRCKSEPARIAEKLYG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G37100.1 protamine P1 family protein8.4e-2133.45Show/hide
Query:  SKSISSPSRTDLFPPPLMSFLRADAGNRSKSSRSRSSPIFVRKKN--VAIETEEPSSPKVTCMGQVRTNKRSSNKTPAV--------------RCRWIRS
        S+ +SSP RT+  PP LM FLR  + +RS+ SRSR  PIF R+KN   A ET+EP+SPKVTCMGQVR N+    K                  RC W+++
Subjt:  SKSISSPSRTDLFPPPLMSFLRADAGNRSKSSRSRSSPIFVRKKN--VAIETEEPSSPKVTCMGQVRTNKRSSNKTPAV--------------RCRWIRS

Query:  VL---SFNR-----------RHCRTFWNRSAMLFRGKREIRRISESRVGNEAEDSEKDEE--DDDGRDRDAVYASSSVPSPPKNALILTRCRSTPNRSSF
             SF             R  ++F + S      KR     SE   G    + E+ EE   ++ ++ +A    S   +PP+NA +LTRCRS P RS  
Subjt:  VL---SFNR-----------RHCRTFWNRSAMLFRGKREIRRISESRVGNEAEDSEKDEE--DDDGRDRDAVYASSSVPSPPKNALILTRCRSTPNRSSF

Query:  NGNRYRSSSITSDGSTVEEEEKTERGFGNNTASKIESRNSERLL-KKLESSKGDGDCKSVNGN--RNLILTRCKSEPARIAEKL
        + N        ++ +  +    +E    +       +  +ERL   + ES+  +   +SV G+  + LILTRC SEPAR+  ++
Subjt:  NGNRYRSSSITSDGSTVEEEEKTERGFGNNTASKIESRNSERLL-KKLESSKGDGDCKSVNGN--RNLILTRCKSEPARIAEKL

AT5G03110.1 FUNCTIONS IN: molecular_function unknown7.1e-2030.93Show/hide
Query:  KSISSPSRTDLFPPPLMSFLR--ADAGNRSKS-----SRSRSSPIFVRK-KNVAIETEEPSSPKVTCMGQVRTNK-------RSSNKTPAVRCRWIRSVL
        + +SSP R + +PPP M FLR  ++ G+ S+S      RSR+SP+FVR+ K+ A   +EPSSPKVTCMGQVR N+        S +     RC W+R+  
Subjt:  KSISSPSRTDLFPPPLMSFLR--ADAGNRSKS-----SRSRSSPIFVRK-KNVAIETEEPSSPKVTCMGQVRTNK-------RSSNKTPAVRCRWIRSVL

Query:  SFN----RRHCRTFWNR--------SAMLFRGKREIRRISESRVGNEAEDSEKDEEDDDGRDRDAVYASSSVPSPPKNALILTRCRSTPNRSSFNGNRYR
         +N    +    TFW +        +    + K   R   +        + +++ E ++  +   ++ S +  +PP NAL+LTR RS P RSS    R+ 
Subjt:  SFN----RRHCRTFWNR--------SAMLFRGKREIRRISESRVGNEAEDSEKDEEDDDGRDRDAVYASSSVPSPPKNALILTRCRSTPNRSSFNGNRYR

Query:  SSSITSDGSTVEEEEKTERGFGNNTASKIESRNSERLLKKLES-------SKGDGDCKSVNGNRNLILTRCKSEPARIAEKLYGELNLQEE
                     EE  +R   +    + E  +SE  ++K+          + + +   +   R  +LTR KSEPARI EK+   L  +EE
Subjt:  SSSITSDGSTVEEEEKTERGFGNNTASKIESRNSERLLKKLES-------SKGDGDCKSVNGNRNLILTRCKSEPARIAEKLYGELNLQEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCAATTATCCAAATCGATTTCCAGTCCCAGTCGAACCGACCTGTTTCCGCCGCCATTGATGAGCTTCCTTAGAGCCGATGCCGGAAATCGTAGTAAAAGCAGCCG
GTCTCGCTCCAGTCCGATCTTCGTCAGGAAGAAGAACGTCGCCATTGAAACTGAAGAACCGTCCTCTCCGAAGGTTACTTGTATGGGACAAGTTCGCACCAATAAACGCT
CTTCTAATAAAACTCCCGCCGTTCGTTGCCGGTGGATTAGAAGCGTCCTCTCTTTCAATCGACGCCACTGTCGAACGTTCTGGAACAGGTCGGCGATGCTCTTCCGTGGA
AAACGTGAAATTAGACGGATCTCTGAATCTCGCGTCGGAAACGAAGCGGAAGATTCGGAGAAAGATGAAGAGGACGACGACGGAAGAGATAGAGATGCGGTTTATGCGTC
GTCTTCGGTGCCATCGCCGCCGAAAAACGCTCTCATTCTGACGAGATGTAGATCTACGCCAAATCGTTCGTCGTTTAACGGTAATCGGTACAGGAGTTCGTCGATTACAA
GCGACGGAAGTACTGTCGAAGAAGAAGAGAAAACAGAGCGCGGTTTTGGAAACAACACGGCTTCCAAAATAGAGTCACGAAACTCGGAACGATTATTGAAGAAACTGGAA
AGTTCAAAGGGAGATGGAGATTGTAAGTCTGTAAATGGCAATCGGAACCTGATTCTGACGAGATGTAAATCGGAACCTGCGAGAATTGCGGAGAAACTTTACGGAGAATT
GAATCTTCAGGAAGAAGAAAGGTGGGTTATGAATAAGAAAAATTCTTACATACTGAATAACTTGTGA
mRNA sequenceShow/hide mRNA sequence
AAAAATTTCAAAAAAATATAATATCAAAAGAAGAAAAATTTGAAAGGCGAAAAAGAAGCAGATGGCAAATTTCCAAAATAGAAAGAATACAAACTCAAAGGGGATAAACA
GATAGCTCCGTGACATGAAAACTAAAGGGACCAATCACCAGCACAGGTTCTTAGTGGACGAACAAACAAAGACCAAACGAATCCAAACGGAAACGATTCAATTCATGCTT
TGATATCATCATTTTTTTTTCTCACAGATATTCTTCGCAACAATAATGAAGCAATTATCCAAATCGATTTCCAGTCCCAGTCGAACCGACCTGTTTCCGCCGCCATTGAT
GAGCTTCCTTAGAGCCGATGCCGGAAATCGTAGTAAAAGCAGCCGGTCTCGCTCCAGTCCGATCTTCGTCAGGAAGAAGAACGTCGCCATTGAAACTGAAGAACCGTCCT
CTCCGAAGGTTACTTGTATGGGACAAGTTCGCACCAATAAACGCTCTTCTAATAAAACTCCCGCCGTTCGTTGCCGGTGGATTAGAAGCGTCCTCTCTTTCAATCGACGC
CACTGTCGAACGTTCTGGAACAGGTCGGCGATGCTCTTCCGTGGAAAACGTGAAATTAGACGGATCTCTGAATCTCGCGTCGGAAACGAAGCGGAAGATTCGGAGAAAGA
TGAAGAGGACGACGACGGAAGAGATAGAGATGCGGTTTATGCGTCGTCTTCGGTGCCATCGCCGCCGAAAAACGCTCTCATTCTGACGAGATGTAGATCTACGCCAAATC
GTTCGTCGTTTAACGGTAATCGGTACAGGAGTTCGTCGATTACAAGCGACGGAAGTACTGTCGAAGAAGAAGAGAAAACAGAGCGCGGTTTTGGAAACAACACGGCTTCC
AAAATAGAGTCACGAAACTCGGAACGATTATTGAAGAAACTGGAAAGTTCAAAGGGAGATGGAGATTGTAAGTCTGTAAATGGCAATCGGAACCTGATTCTGACGAGATG
TAAATCGGAACCTGCGAGAATTGCGGAGAAACTTTACGGAGAATTGAATCTTCAGGAAGAAGAAAGGTGGGTTATGAATAAGAAAAATTCTTACATACTGAATAACTTGT
GATTACACGATTGAATATCTTCGGGTTTATAGATTTGCTCCATGAATTATGAGAAAACGCAAGGTTTGGCTATGAATTTTGTTTTTGTTAGACTGAATTTGGCGGGTGCA
GAAATGGAGATTTTGGATGAGGCGGCTAAATGGAAGAACAGTGAAAGTCAGTTGAGCTTTAATTATTAGTGCATTAGATTGTAAATTAGTCG
Protein sequenceShow/hide protein sequence
MKQLSKSISSPSRTDLFPPPLMSFLRADAGNRSKSSRSRSSPIFVRKKNVAIETEEPSSPKVTCMGQVRTNKRSSNKTPAVRCRWIRSVLSFNRRHCRTFWNRSAMLFRG
KREIRRISESRVGNEAEDSEKDEEDDDGRDRDAVYASSSVPSPPKNALILTRCRSTPNRSSFNGNRYRSSSITSDGSTVEEEEKTERGFGNNTASKIESRNSERLLKKLE
SSKGDGDCKSVNGNRNLILTRCKSEPARIAEKLYGELNLQEEERWVMNKKNSYILNNL