; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0042131 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0042131
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRRP15-like protein
Genome locationchr13:36902708..36904863
RNA-Seq ExpressionLag0042131
SyntenyLag0042131
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047668.1 RRP15-like protein [Cucumis melo var. makuwa]9.2e-10573.93Show/hide
Query:  MKGGHGALEMAKTVMEVADLTWSAIECCHHHMPSDDTAKRTPTEEEELEALRSENRRLRNLLKQNLDLLQKLSESHCLLKDCPPDVFPLFALPFSVM--N
        MKGGHGALE+AKTV+EVAD+ WSAIECCHHH PS D+A+R  TEEEEL+ALRSENRRLR LL+QNLDLLQ +SESHCLLKDCPPDVFP F   F +    
Subjt:  MKGGHGALEMAKTVMEVADLTWSAIECCHHHMPSDDTAKRTPTEEEELEALRSENRRLRNLLKQNLDLLQKLSESHCLLKDCPPDVFPLFALPFSVM--N

Query:  LSGPSLDYDQKSF--CTRLVATVDSEKFLNEIKSLNEASNDGISYEFPFREA------TADILVNVSREAPSWWIWITEDMVPSKVEEWSGIDDESYVIV
         SG        +F    RLVATVDSEKFLNEIKSLNEAS DGISYEFPFREA      TADILVNVSREAPSWW+W+TEDMVP+KVEEWSGIDDE+YVIV
Subjt:  LSGPSLDYDQKSF--CTRLVATVDSEKFLNEIKSLNEASNDGISYEFPFREA------TADILVNVSREAPSWWIWITEDMVPSKVEEWSGIDDESYVIV

Query:  SEEHVVDAVAHFMAR------------------SIAKALNGMGSKMDKMFEIWHAGLLFYSLATWGLALAGLYNGRAILK
        SEEHVVDAVAHFMAR                  +IAKAL+GMGSK++KMFEIWHAGLLFYSLATWGLALAGLY GRAILK
Subjt:  SEEHVVDAVAHFMAR------------------SIAKALNGMGSKMDKMFEIWHAGLLFYSLATWGLALAGLYNGRAILK

KAG6602090.1 hypothetical protein SDJN03_07323, partial [Cucurbita argyrosperma subsp. sororia]8.6e-10373.19Show/hide
Query:  MKGGHGALEMAKTVMEVADLTWSAIECCHHHMPSDDTAKRTPTEEEELEALRSENRRLRNLLKQNLDLLQKLSESHCLLKDCPPDVFPLFALPFSVMNLS
        MKGGHGALE+AKTVMEVAD+ W+AIECCHHH PS+D A+R  TEEE+LEALRSENRRLRNLL+QNLDLLQ+LSESHCLLKDCPPD++             
Subjt:  MKGGHGALEMAKTVMEVADLTWSAIECCHHHMPSDDTAKRTPTEEEELEALRSENRRLRNLLKQNLDLLQKLSESHCLLKDCPPDVFPLFALPFSVMNLS

Query:  GPSLDYDQKSFCTRLVATVDSEKFLNEIKSLNEASNDGISYEFPFREA------TADILVNVSREAPSWWIWITEDMVPSKVEEWSGIDDESYVIVSEEH
                     RLVATVDSEKFLNEIKSLNEAS DGI+YEFPFREA      TA+ILVNVSR+APSWWIW+TEDMVPSKVEEWSGIDDESYVIVSEEH
Subjt:  GPSLDYDQKSFCTRLVATVDSEKFLNEIKSLNEASNDGISYEFPFREA------TADILVNVSREAPSWWIWITEDMVPSKVEEWSGIDDESYVIVSEEH

Query:  VVDAVAHFMAR------------------SIAKALNGMGSKMDKMFEIWHAGLLFYSLATWGLALAGLYNGRAILK
        VVDAVAHFMAR                  +IAKAL GMGSKM+KMFEIWHAGLLFYSLATWGLALAGLY GRAILK
Subjt:  VVDAVAHFMAR------------------SIAKALNGMGSKMDKMFEIWHAGLLFYSLATWGLALAGLYNGRAILK

KAG7032794.1 hypothetical protein SDJN02_06844 [Cucurbita argyrosperma subsp. argyrosperma]2.5e-10272.83Show/hide
Query:  MKGGHGALEMAKTVMEVADLTWSAIECCHHHMPSDDTAKRTPTEEEELEALRSENRRLRNLLKQNLDLLQKLSESHCLLKDCPPDVFPLFALPFSVMNLS
        MKGGHGALE+AKTVMEVAD+ W+AIECCHHH PS+D  +R  TEEE+LEALRSENRRLRNLL+QNLDLLQ+LSESHCLLKDCPPD++             
Subjt:  MKGGHGALEMAKTVMEVADLTWSAIECCHHHMPSDDTAKRTPTEEEELEALRSENRRLRNLLKQNLDLLQKLSESHCLLKDCPPDVFPLFALPFSVMNLS

Query:  GPSLDYDQKSFCTRLVATVDSEKFLNEIKSLNEASNDGISYEFPFREA------TADILVNVSREAPSWWIWITEDMVPSKVEEWSGIDDESYVIVSEEH
                     RLVATVDSEKFLNEIKSLNEAS DGI+YEFPFREA      TA+ILVNVSR+APSWWIW+TEDMVPSKVEEWSGIDDESYVIVSEEH
Subjt:  GPSLDYDQKSFCTRLVATVDSEKFLNEIKSLNEASNDGISYEFPFREA------TADILVNVSREAPSWWIWITEDMVPSKVEEWSGIDDESYVIVSEEH

Query:  VVDAVAHFMAR------------------SIAKALNGMGSKMDKMFEIWHAGLLFYSLATWGLALAGLYNGRAILK
        VVDAVAHFMAR                  +IAKAL GMGSKM+KMFEIWHAGLLFYSLATWGLALAGLY GRAILK
Subjt:  VVDAVAHFMAR------------------SIAKALNGMGSKMDKMFEIWHAGLLFYSLATWGLALAGLYNGRAILK

XP_022958280.1 uncharacterized protein LOC111459551 [Cucurbita moschata]2.7e-10472.18Show/hide
Query:  DRDGRFVQMKGGHGALEMAKTVMEVADLTWSAIECCHHHMPSDDTAKRTPTEEEELEALRSENRRLRNLLKQNLDLLQKLSESHCLLKDCPPDVFPLFAL
        D + RF+QMKGGHGALE+AKTVMEVAD+ W+AIECCHHH PS+D A+   TEEE+LEALRSENRRLRNLL+QNLDLLQ+LSESHCLLKDCPPD++     
Subjt:  DRDGRFVQMKGGHGALEMAKTVMEVADLTWSAIECCHHHMPSDDTAKRTPTEEEELEALRSENRRLRNLLKQNLDLLQKLSESHCLLKDCPPDVFPLFAL

Query:  PFSVMNLSGPSLDYDQKSFCTRLVATVDSEKFLNEIKSLNEASNDGISYEFPFREA------TADILVNVSREAPSWWIWITEDMVPSKVEEWSGIDDES
                             RLVATVDSEKFLNEIKSLNEAS DGI+YEFPFREA      TA+ILVNVSR+APSWWIW+TEDMVPSKVEEWSGIDDES
Subjt:  PFSVMNLSGPSLDYDQKSFCTRLVATVDSEKFLNEIKSLNEASNDGISYEFPFREA------TADILVNVSREAPSWWIWITEDMVPSKVEEWSGIDDES

Query:  YVIVSEEHVVDAVAHFMAR------------------SIAKALNGMGSKMDKMFEIWHAGLLFYSLATWGLALAGLYNGRAILK
        YVIVSEEHVVDAVAHFMAR                  +IAKAL GMGSKM+KMFEIWHAGLLFYSLATWGLALAGLY GRAILK
Subjt:  YVIVSEEHVVDAVAHFMAR------------------SIAKALNGMGSKMDKMFEIWHAGLLFYSLATWGLALAGLYNGRAILK

XP_038884152.1 uncharacterized protein LOC120075067 [Benincasa hispida]7.0e-10574.28Show/hide
Query:  MKGGHGALEMAKTVMEVADLTWSAIECCHHHMPSDDTAKRTPTEEEELEALRSENRRLRNLLKQNLDLLQKLSESHCLLKDCPPDVFPLFALPFSVMNLS
        MKGGHGALEMAKTV+EVAD+ WSAIECCHHH PSDD  +RTPTEEEEL+ALRS+NRRLRNLL+QNLDLLQKLSESHCLLKDCPPD++             
Subjt:  MKGGHGALEMAKTVMEVADLTWSAIECCHHHMPSDDTAKRTPTEEEELEALRSENRRLRNLLKQNLDLLQKLSESHCLLKDCPPDVFPLFALPFSVMNLS

Query:  GPSLDYDQKSFCTRLVATVDSEKFLNEIKSLNEASNDGISYEFPFREA------TADILVNVSREAPSWWIWITEDMVPSKVEEWSGIDDESYVIVSEEH
                     RLVATVDSEKFLNEIKSL EAS DGISYEFPFREA      TADILVNVSREAPSWWIW+TEDMVPSKVEEWSGIDDESYVIVSEEH
Subjt:  GPSLDYDQKSFCTRLVATVDSEKFLNEIKSLNEASNDGISYEFPFREA------TADILVNVSREAPSWWIWITEDMVPSKVEEWSGIDDESYVIVSEEH

Query:  VVDAVAHFMAR------------------SIAKALNGMGSKMDKMFEIWHAGLLFYSLATWGLALAGLYNGRAILK
        VVDAVAHFMAR                  +IAKAL+GMGSK++KMFEIWHAG+LFYSLATWGLALAGLY GRAILK
Subjt:  VVDAVAHFMAR------------------SIAKALNGMGSKMDKMFEIWHAGLLFYSLATWGLALAGLYNGRAILK

TrEMBL top hitse value%identityAlignment
A0A1S3B4J2 uncharacterized protein LOC1034859141.0e-10172.1Show/hide
Query:  MKGGHGALEMAKTVMEVADLTWSAIECCHHHMPSDDTAKRTPTEEEELEALRSENRRLRNLLKQNLDLLQKLSESHCLLKDCPPDVFPLFALPFSVMNLS
        MKGGHGALE+AKTV+EVAD+ WSAIECCHHH PS D+A+R  TEEEEL+ALRSENRRLR LL+QNLDLLQ +SESHCLLKDCPPD++             
Subjt:  MKGGHGALEMAKTVMEVADLTWSAIECCHHHMPSDDTAKRTPTEEEELEALRSENRRLRNLLKQNLDLLQKLSESHCLLKDCPPDVFPLFALPFSVMNLS

Query:  GPSLDYDQKSFCTRLVATVDSEKFLNEIKSLNEASNDGISYEFPFREA------TADILVNVSREAPSWWIWITEDMVPSKVEEWSGIDDESYVIVSEEH
                     RLVATVDSEKFLNEIKSLNEAS DGISYEFPFREA      TADILVNVSREAPSWW+W+TEDMVP+KVEEWSGIDDE+YVIVSEEH
Subjt:  GPSLDYDQKSFCTRLVATVDSEKFLNEIKSLNEASNDGISYEFPFREA------TADILVNVSREAPSWWIWITEDMVPSKVEEWSGIDDESYVIVSEEH

Query:  VVDAVAHFMAR------------------SIAKALNGMGSKMDKMFEIWHAGLLFYSLATWGLALAGLYNGRAILK
        VVDAVAHFMAR                  +IAKAL+GMGSK++KMFEIWHAGLLFYSLATWGLALAGLY GRAILK
Subjt:  VVDAVAHFMAR------------------SIAKALNGMGSKMDKMFEIWHAGLLFYSLATWGLALAGLYNGRAILK

A0A5A7TXM2 RRP15-like protein4.5e-10573.93Show/hide
Query:  MKGGHGALEMAKTVMEVADLTWSAIECCHHHMPSDDTAKRTPTEEEELEALRSENRRLRNLLKQNLDLLQKLSESHCLLKDCPPDVFPLFALPFSVM--N
        MKGGHGALE+AKTV+EVAD+ WSAIECCHHH PS D+A+R  TEEEEL+ALRSENRRLR LL+QNLDLLQ +SESHCLLKDCPPDVFP F   F +    
Subjt:  MKGGHGALEMAKTVMEVADLTWSAIECCHHHMPSDDTAKRTPTEEEELEALRSENRRLRNLLKQNLDLLQKLSESHCLLKDCPPDVFPLFALPFSVM--N

Query:  LSGPSLDYDQKSF--CTRLVATVDSEKFLNEIKSLNEASNDGISYEFPFREA------TADILVNVSREAPSWWIWITEDMVPSKVEEWSGIDDESYVIV
         SG        +F    RLVATVDSEKFLNEIKSLNEAS DGISYEFPFREA      TADILVNVSREAPSWW+W+TEDMVP+KVEEWSGIDDE+YVIV
Subjt:  LSGPSLDYDQKSF--CTRLVATVDSEKFLNEIKSLNEASNDGISYEFPFREA------TADILVNVSREAPSWWIWITEDMVPSKVEEWSGIDDESYVIV

Query:  SEEHVVDAVAHFMAR------------------SIAKALNGMGSKMDKMFEIWHAGLLFYSLATWGLALAGLYNGRAILK
        SEEHVVDAVAHFMAR                  +IAKAL+GMGSK++KMFEIWHAGLLFYSLATWGLALAGLY GRAILK
Subjt:  SEEHVVDAVAHFMAR------------------SIAKALNGMGSKMDKMFEIWHAGLLFYSLATWGLALAGLYNGRAILK

A0A6J1H323 uncharacterized protein LOC1114595511.3e-10472.18Show/hide
Query:  DRDGRFVQMKGGHGALEMAKTVMEVADLTWSAIECCHHHMPSDDTAKRTPTEEEELEALRSENRRLRNLLKQNLDLLQKLSESHCLLKDCPPDVFPLFAL
        D + RF+QMKGGHGALE+AKTVMEVAD+ W+AIECCHHH PS+D A+   TEEE+LEALRSENRRLRNLL+QNLDLLQ+LSESHCLLKDCPPD++     
Subjt:  DRDGRFVQMKGGHGALEMAKTVMEVADLTWSAIECCHHHMPSDDTAKRTPTEEEELEALRSENRRLRNLLKQNLDLLQKLSESHCLLKDCPPDVFPLFAL

Query:  PFSVMNLSGPSLDYDQKSFCTRLVATVDSEKFLNEIKSLNEASNDGISYEFPFREA------TADILVNVSREAPSWWIWITEDMVPSKVEEWSGIDDES
                             RLVATVDSEKFLNEIKSLNEAS DGI+YEFPFREA      TA+ILVNVSR+APSWWIW+TEDMVPSKVEEWSGIDDES
Subjt:  PFSVMNLSGPSLDYDQKSFCTRLVATVDSEKFLNEIKSLNEASNDGISYEFPFREA------TADILVNVSREAPSWWIWITEDMVPSKVEEWSGIDDES

Query:  YVIVSEEHVVDAVAHFMAR------------------SIAKALNGMGSKMDKMFEIWHAGLLFYSLATWGLALAGLYNGRAILK
        YVIVSEEHVVDAVAHFMAR                  +IAKAL GMGSKM+KMFEIWHAGLLFYSLATWGLALAGLY GRAILK
Subjt:  YVIVSEEHVVDAVAHFMAR------------------SIAKALNGMGSKMDKMFEIWHAGLLFYSLATWGLALAGLYNGRAILK

A0A6J1JTR4 uncharacterized protein LOC1114877661.1e-10072.46Show/hide
Query:  MKGGHGALEMAKTVMEVADLTWSAIECCHHHMPSDDTAKRTPTEEEELEALRSENRRLRNLLKQNLDLLQKLSESHCLLKDCPPDVFPLFALPFSVMNLS
        MKGGHGALE+AKTVMEVAD+ W+AIE CHHH PS+D A+R  TEEE LEALRSENRRLRNLL+QNLDLLQ+LSESHCLLKDCPPD++             
Subjt:  MKGGHGALEMAKTVMEVADLTWSAIECCHHHMPSDDTAKRTPTEEEELEALRSENRRLRNLLKQNLDLLQKLSESHCLLKDCPPDVFPLFALPFSVMNLS

Query:  GPSLDYDQKSFCTRLVATVDSEKFLNEIKSLNEASNDGISYEFPFREA------TADILVNVSREAPSWWIWITEDMVPSKVEEWSGIDDESYVIVSEEH
                     RLVATVDSEKFLNEIKSLNEAS DGI+YEFPFRE       TA+ILVNVSR+APSWWIW+TEDMVPSKVEEWSGIDDESYVIVSEEH
Subjt:  GPSLDYDQKSFCTRLVATVDSEKFLNEIKSLNEASNDGISYEFPFREA------TADILVNVSREAPSWWIWITEDMVPSKVEEWSGIDDESYVIVSEEH

Query:  VVDAVAHFMAR------------------SIAKALNGMGSKMDKMFEIWHAGLLFYSLATWGLALAGLYNGRAILK
        VVDAVAHFMAR                  +IAKAL GMGSKM+KMFEIWHAGLLFYSLATWGLALAGLY GRAILK
Subjt:  VVDAVAHFMAR------------------SIAKALNGMGSKMDKMFEIWHAGLLFYSLATWGLALAGLYNGRAILK

A0A6J1JXE7 uncharacterized protein LOC1114897223.7e-9971.01Show/hide
Query:  MKGGHGALEMAKTVMEVADLTWSAIECCHHHMPSDDTAKRTPTEEEELEALRSENRRLRNLLKQNLDLLQKLSESHCLLKDCPPDVFPLFALPFSVMNLS
        MKGGHGALE+AKT MEVAD+ W+AIEC +HH P DD  KRTPTEEE L+ALRSENRRLR LL+QNL+LLQKLSESHCLL DCPPD++             
Subjt:  MKGGHGALEMAKTVMEVADLTWSAIECCHHHMPSDDTAKRTPTEEEELEALRSENRRLRNLLKQNLDLLQKLSESHCLLKDCPPDVFPLFALPFSVMNLS

Query:  GPSLDYDQKSFCTRLVATVDSEKFLNEIKSLNEASNDGISYEFPFREA------TADILVNVSREAPSWWIWITEDMVPSKVEEWSGIDDESYVIVSEEH
                     RLVATVDSEKFLNEIKSLNEAS DGI+YEFPFREA      TADILVNVSREAPSWW+W+TEDMVPSKVEEWSGIDDESYVIVSEEH
Subjt:  GPSLDYDQKSFCTRLVATVDSEKFLNEIKSLNEASNDGISYEFPFREA------TADILVNVSREAPSWWIWITEDMVPSKVEEWSGIDDESYVIVSEEH

Query:  VVDAVAHFMAR------------------SIAKALNGMGSKMDKMFEIWHAGLLFYSLATWGLALAGLYNGRAILK
        VVDAVAHFMAR                  +IAKAL+ MG  M+KMFEIWHAGLLFYSLATWGLALAGLY GRAILK
Subjt:  VVDAVAHFMAR------------------SIAKALNGMGSKMDKMFEIWHAGLLFYSLATWGLALAGLYNGRAILK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G44770.1 unknown protein5.6e-5241.67Show/hide
Query:  HGALEMAKTVMEVADLTWSAIECCHHHMPSDD-----TAKRTPTEEEELEALRSENRRLRNLLKQNLDLLQKLSESHCLLKDCPPDVFPLFALPFSVMNL
        H A+E+ KTV+EVAD+ W+A+E  HHH    D     T   +   + ELEALR ENRRLR LL+ NL L + L+ES     DCP D++            
Subjt:  HGALEMAKTVMEVADLTWSAIECCHHHMPSDD-----TAKRTPTEEEELEALRSENRRLRNLLKQNLDLLQKLSESHCLLKDCPPDVFPLFALPFSVMNL

Query:  SGPSLDYDQKSFCTRLVATVDSEKFLNEIKSLNEASNDGISYEFPFREATAD------ILVNVSREAPSWWIWITEDMVPSKVEEWSGIDDESYVIVSEE
                      RLV  V S  FL  +++L +A ++G   +FPF+E T D      +L+ +  + PSWW+ +T+DMVPS VEE S ID+E Y++V+EE
Subjt:  SGPSLDYDQKSFCTRLVATVDSEKFLNEIKSLNEASNDGISYEFPFREATAD------ILVNVSREAPSWWIWITEDMVPSKVEEWSGIDDESYVIVSEE

Query:  HVVDAVAHFMARSI---AKALN--------------GMGSKMDKMFEIWHAGLLFYSLATWGLALAGLYNGRAILK
        HV+DAVAHF+A+ I    KA N                 SK+ K+ +IWHAG +FY+L+TWGLA  GLY  R +LK
Subjt:  HVVDAVAHFMARSI---AKALN--------------GMGSKMDKMFEIWHAGLLFYSLATWGLALAGLYNGRAILK

AT1G44770.2 unknown protein5.6e-5241.82Show/hide
Query:  HGALEMAKTVMEVADLTWSAIECCHHHMPSDD-----TAKRTPTEEEELEALRSENRRLRNLLKQNLDLLQKLSESHCLLKDCPPDVFPLFALPFSVMNL
        H A+E+ KTV+EVAD+ W+A+E  HHH    D     T   +   + ELEALR ENRRLR LL+ NL L + L+ES     DCP D++            
Subjt:  HGALEMAKTVMEVADLTWSAIECCHHHMPSDD-----TAKRTPTEEEELEALRSENRRLRNLLKQNLDLLQKLSESHCLLKDCPPDVFPLFALPFSVMNL

Query:  SGPSLDYDQKSFCTRLVATVDSEKFLNEIKSLNEASNDGISYEFPFREATAD-----ILVNVSREAPSWWIWITEDMVPSKVEEWSGIDDESYVIVSEEH
                      RLV  V S  FL  +++L +A ++G   +FPF+E T D     +L+ +  + PSWW+ +T+DMVPS VEE S ID+E Y++V+EEH
Subjt:  SGPSLDYDQKSFCTRLVATVDSEKFLNEIKSLNEASNDGISYEFPFREATAD-----ILVNVSREAPSWWIWITEDMVPSKVEEWSGIDDESYVIVSEEH

Query:  VVDAVAHFMARSI---AKALN--------------GMGSKMDKMFEIWHAGLLFYSLATWGLALAGLYNGRAILK
        V+DAVAHF+A+ I    KA N                 SK+ K+ +IWHAG +FY+L+TWGLA  GLY  R +LK
Subjt:  VVDAVAHFMARSI---AKALN--------------GMGSKMDKMFEIWHAGLLFYSLATWGLALAGLYNGRAILK

AT4G24590.1 unknown protein4.0e-0528.29Show/hide
Query:  DSEKFLNEIKSLNEASNDGISYEFPFREATADILVNVSR---EAPSWWIWITEDMVPSKVEEWSGIDDESYVIVSEEHVVDAVAHFMAR-----------
        +++KF + IKS  E ++        FRE      V+V +   E  S W  ++ED +    EE  G  ++ YV+V EE + D +A FMA            
Subjt:  DSEKFLNEIKSLNEASNDGISYEFPFREATADILVNVSR---EAPSWWIWITEDMVPSKVEEWSGIDDESYVIVSEEHVVDAVAHFMAR-----------

Query:  ---SIAKALNGMGS---KMDKMFEIWHAGLLFYSLATWGLALAGLYNGRAIL
            + KAL+ M S   +  K+ + W    + Y++A+W     G+Y    IL
Subjt:  ---SIAKALNGMGS---KMDKMFEIWHAGLLFYSLATWGLALAGLYNGRAIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAGGCCCCTAAATCCCAATTGCCCCAAAACGACGCCATTATTGAAGCGGGGCGATCGCGACGGAAGGTTTGTCCAGATGAAAGGCGGCCATGGCGCTCTGGAGAT
GGCCAAGACGGTCATGGAGGTCGCAGACCTGACCTGGTCTGCGATTGAATGTTGCCATCACCACATGCCCAGCGATGATACTGCCAAGCGCACGCCTACAGAGGAAGAAG
AGCTTGAGGCTCTGCGATCGGAGAATCGGAGATTGAGGAATTTGCTCAAGCAAAACCTCGACCTCCTTCAGAAGCTATCCGAATCGCACTGCTTGTTGAAGGATTGCCCT
CCTGATGTATTTCCTCTATTTGCCCTCCCGTTTTCAGTTATGAATCTTTCCGGCCCTAGTTTAGATTATGATCAGAAGTCTTTTTGTACGCGTCTCGTCGCTACAGTGGA
TTCTGAAAAGTTCTTAAATGAAATTAAATCCCTCAATGAAGCATCAAACGATGGAATTAGCTATGAATTTCCCTTCAGGGAAGCTACAGCTGATATTCTTGTGAATGTTA
GCCGTGAAGCACCCAGTTGGTGGATATGGATTACTGAAGATATGGTTCCGAGCAAAGTTGAGGAATGGAGTGGGATCGATGATGAAAGTTATGTGATTGTATCTGAAGAA
CATGTGGTGGATGCAGTTGCCCACTTTATGGCTAGATCAATTGCAAAAGCGCTGAATGGTATGGGCAGCAAGATGGACAAGATGTTTGAAATTTGGCATGCTGGGCTGCT
GTTTTATTCCTTAGCCACTTGGGGACTTGCACTGGCAGGCTTGTACAACGGTCGTGCTATATTGAAACGGCTGCCACTGGTATTCACCATACAAGCAAAGCGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAAGGCCCCTAAATCCCAATTGCCCCAAAACGACGCCATTATTGAAGCGGGGCGATCGCGACGGAAGGTTTGTCCAGATGAAAGGCGGCCATGGCGCTCTGGAGAT
GGCCAAGACGGTCATGGAGGTCGCAGACCTGACCTGGTCTGCGATTGAATGTTGCCATCACCACATGCCCAGCGATGATACTGCCAAGCGCACGCCTACAGAGGAAGAAG
AGCTTGAGGCTCTGCGATCGGAGAATCGGAGATTGAGGAATTTGCTCAAGCAAAACCTCGACCTCCTTCAGAAGCTATCCGAATCGCACTGCTTGTTGAAGGATTGCCCT
CCTGATGTATTTCCTCTATTTGCCCTCCCGTTTTCAGTTATGAATCTTTCCGGCCCTAGTTTAGATTATGATCAGAAGTCTTTTTGTACGCGTCTCGTCGCTACAGTGGA
TTCTGAAAAGTTCTTAAATGAAATTAAATCCCTCAATGAAGCATCAAACGATGGAATTAGCTATGAATTTCCCTTCAGGGAAGCTACAGCTGATATTCTTGTGAATGTTA
GCCGTGAAGCACCCAGTTGGTGGATATGGATTACTGAAGATATGGTTCCGAGCAAAGTTGAGGAATGGAGTGGGATCGATGATGAAAGTTATGTGATTGTATCTGAAGAA
CATGTGGTGGATGCAGTTGCCCACTTTATGGCTAGATCAATTGCAAAAGCGCTGAATGGTATGGGCAGCAAGATGGACAAGATGTTTGAAATTTGGCATGCTGGGCTGCT
GTTTTATTCCTTAGCCACTTGGGGACTTGCACTGGCAGGCTTGTACAACGGTCGTGCTATATTGAAACGGCTGCCACTGGTATTCACCATACAAGCAAAGCGGTGA
Protein sequenceShow/hide protein sequence
MKRPLNPNCPKTTPLLKRGDRDGRFVQMKGGHGALEMAKTVMEVADLTWSAIECCHHHMPSDDTAKRTPTEEEELEALRSENRRLRNLLKQNLDLLQKLSESHCLLKDCP
PDVFPLFALPFSVMNLSGPSLDYDQKSFCTRLVATVDSEKFLNEIKSLNEASNDGISYEFPFREATADILVNVSREAPSWWIWITEDMVPSKVEEWSGIDDESYVIVSEE
HVVDAVAHFMARSIAKALNGMGSKMDKMFEIWHAGLLFYSLATWGLALAGLYNGRAILKRLPLVFTIQAKR