; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg000036 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg000036
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionroot hair specific 4
Genome locationscaffold1:212121..212873
RNA-Seq ExpressionSpg000036
SyntenySpg000036
Gene Ontology termsGO:0048564 - photosystem I assembly (biological process)
GO:0080183 - response to photooxidative stress (biological process)
GO:0009535 - chloroplast thylakoid membrane (cellular component)
InterPro domainsIPR040340 - Chloroplast enhancing stress tolerance protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7034827.1 hypothetical protein SDJN02_04559, partial [Cucurbita argyrosperma subsp. argyrosperma]9.1e-8569.43Show/hide
Query:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLESYSVPNSAYSSPRLGS-------------GLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR
        +D L PLS GTT R YE V DVVIE+STQ KLESYS PNSAYSSP LGS             GL+RSKSCGEGRG+A PHGL+ENRV I E   K +   
Subjt:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLESYSVPNSAYSSPRLGS-------------GLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR

Query:  -KAWRFRCGALCLLLP---GFGLKIGKGKGERKEEA----EAGGCISISISR-RVSLEKFECGSWASSGMVVHEDGE--AGMGSLYFDLPMELIRNSVGA
         KA RFRCGALCLLLP   G G K+GKGK ERKEE     E GGCISISIS  RVSLEKFECGSWASSGMV HEDGE  +GMGSLYFDLPMELIRNSVGA
Subjt:  -KAWRFRCGALCLLLP---GFGLKIGKGKGERKEEA----EAGGCISISISR-RVSLEKFECGSWASSGMVVHEDGE--AGMGSLYFDLPMELIRNSVGA

Query:  QTQTQPVKAAFVFDRDGNGEGVYRLPVWTKGKLAEESSSPSPSPCIITPRLRKARREFNALLEAH
        +TQ+ P + AFVF+RD  G GV+ LPVWTK KLAEES   + SPC+ITPRLR+AR EFNALLEAH
Subjt:  QTQTQPVKAAFVFDRDGNGEGVYRLPVWTKGKLAEESSSPSPSPCIITPRLRKARREFNALLEAH

XP_022925963.1 uncharacterized protein LOC111433224 [Cucurbita moschata]9.1e-8569.43Show/hide
Query:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLESYSVPNSAYSSPRLGS-------------GLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR
        +D L PLS GTT R YE V DVVIE+STQ KLESYS PNSAYSSP LGS             GL+RSKSCGEGRG+A PHGL+ENRV I E   K +   
Subjt:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLESYSVPNSAYSSPRLGS-------------GLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR

Query:  -KAWRFRCGALCLLLP---GFGLKIGKGKGERKEEA----EAGGCISISISR-RVSLEKFECGSWASSGMVVHEDGE--AGMGSLYFDLPMELIRNSVGA
         KA RFRCGALCLLLP   G G K+GKGK ERKEE     E GGCISISIS  RVSLEKFECGSWASSGMV HEDGE  +GMGSLYFDLPMELIRNSVGA
Subjt:  -KAWRFRCGALCLLLP---GFGLKIGKGKGERKEEA----EAGGCISISISR-RVSLEKFECGSWASSGMVVHEDGE--AGMGSLYFDLPMELIRNSVGA

Query:  QTQTQPVKAAFVFDRDGNGEGVYRLPVWTKGKLAEESSSPSPSPCIITPRLRKARREFNALLEAH
        +TQ+ P + AFVF+RD  G GV+ LPVWTK KLAEES   + SPC+ITPRLR+AR EFNALLEAH
Subjt:  QTQTQPVKAAFVFDRDGNGEGVYRLPVWTKGKLAEESSSPSPSPCIITPRLRKARREFNALLEAH

XP_022978627.1 uncharacterized protein LOC111478548 [Cucurbita maxima]1.4e-8569.81Show/hide
Query:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLESYSVPNSAYSSPRLGS-------------GLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR
        +D L PLS GTT RSYE V DVVI++STQ KLESYS PNSAYSSP LGS             GL+RSKSCGEGRG+A PHGL++NRV I E   K +   
Subjt:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLESYSVPNSAYSSPRLGS-------------GLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR

Query:  -KAWRFRCGALCLLLP---GFGLKIGKGKGERKEEA----EAGGCISISISR-RVSLEKFECGSWASSGMVVHEDGEA--GMGSLYFDLPMELIRNSVGA
         KA RFRCGALCLLLP   G G K+GKGK ERKEE     E GGCISISIS  RVSLEKFECGSWASSGMV HEDGE+  GMGSLYFDLPMELIRNSVGA
Subjt:  -KAWRFRCGALCLLLP---GFGLKIGKGKGERKEEA----EAGGCISISISR-RVSLEKFECGSWASSGMVVHEDGEA--GMGSLYFDLPMELIRNSVGA

Query:  QTQTQPVKAAFVFDRDGNGEGVYRLPVWTKGKLAEESSSPSPSPCIITPRLRKARREFNALLEAH
        +TQ+ P +AAFVFDRD  G GV+ LPVWTK KLAEES   + SPC+ITPRLR+AR EFNALLEAH
Subjt:  QTQTQPVKAAFVFDRDGNGEGVYRLPVWTKGKLAEESSSPSPSPCIITPRLRKARREFNALLEAH

XP_023543164.1 uncharacterized protein LOC111803119 [Cucurbita pepo subsp. pepo]1.1e-8570.19Show/hide
Query:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLESYSVPNSAYSSPRLGS-------------GLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR
        +D L PLS GTT R YE V DVVIE+STQ KLESYS PNSAYSSP LGS             GL+RSKSCGEGRGRA PHGL+ENRV I E   K +   
Subjt:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLESYSVPNSAYSSPRLGS-------------GLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR

Query:  -KAWRFRCGALCLLLP---GFGLKIGKGKGERKEEA----EAGGCISISISR-RVSLEKFECGSWASSGMVVHEDGE--AGMGSLYFDLPMELIRNSVGA
         KA RFRCGALCLLLP   G G K+GKGK ERKEE     E GGCISISIS  RVSLEKFECGSWASSGMV HEDGE  +GMGSLYFDLPMELIRNSVGA
Subjt:  -KAWRFRCGALCLLLP---GFGLKIGKGKGERKEEA----EAGGCISISISR-RVSLEKFECGSWASSGMVVHEDGE--AGMGSLYFDLPMELIRNSVGA

Query:  QTQTQPVKAAFVFDRDGNGEGVYRLPVWTKGKLAEESSSPSPSPCIITPRLRKARREFNALLEAH
        +TQ+ P + AFVFDRD  G GV+ LPVWTK KLAEES   + SPC+ITPRLR+AR EFNALLEAH
Subjt:  QTQTQPVKAAFVFDRDGNGEGVYRLPVWTKGKLAEESSSPSPSPCIITPRLRKARREFNALLEAH

XP_038877520.1 uncharacterized protein LOC120069777 [Benincasa hispida]1.4e-8570.93Show/hide
Query:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLESYSVPNSAYSSPRL---------GSGLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGRKAWR
        +DT+ PLS GTTSRSYEFV DVVIE+STQLKL SYSVPNSAYSSPRL         G GL+RSKSCGEGRG+A PH L+EN+V + E   K +   KA R
Subjt:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLESYSVPNSAYSSPRL---------GSGLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGRKAWR

Query:  FRCGALCLLLP---GFGLKIG----KGKGERKEEAEAGGCISISISRRVSLEKFECGSWASSGMVVHEDGEAGMGSLYFDLPMELIRNSVGAQTQTQPVK
        FRCGALCLLLP   G G K+G    KGK ERKEEAE G CISISISRRVSLEKFECGSWASSGMVVHEDGE  +GS YFDLPMELIRNSVG QTQ+ PV 
Subjt:  FRCGALCLLLP---GFGLKIG----KGKGERKEEAEAGGCISISISRRVSLEKFECGSWASSGMVVHEDGEAGMGSLYFDLPMELIRNSVGAQTQTQPVK

Query:  AAFVFDRDGNGEGVYRLPVWTKGKLAEESSSPSPSPCIITPRLRKARREFNALLEAHT
        AAFVFD        + LP+WTK  LAEES   + SPCIITPRLRKAR EFNALLEAHT
Subjt:  AAFVFDRDGNGEGVYRLPVWTKGKLAEESSSPSPSPCIITPRLRKARREFNALLEAHT

TrEMBL top hitse value%identityAlignment
A0A0A0KME4 Uncharacterized protein1.1e-7568.25Show/hide
Query:  PLSGGTTSRSYEFVGDVVIELSTQLKLESYSVPNSAYSSPRLG------SGLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR-KAWRFR-CGAL
        PLS GT+SR YEFV DVVIE+S QL     S PNSAYSSPRL        GL+RS+SCGEGRG+A PHGL+EN+V + E  +K +    K  RFR CGAL
Subjt:  PLSGGTTSRSYEFVGDVVIELSTQLKLESYSVPNSAYSSPRLG------SGLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR-KAWRFR-CGAL

Query:  CLLLPGFGLKIG----KGKGERKEEAEAGGCISISISRRVSLEKFECGSWASSGMVVHEDGEAGMGSLYFDLPMELIRNSVGAQTQTQPVKAAFVFDRDG
        CLLLP  G K+G    KGK E++EEAE G CISISISRRVSLEKFECGSWASSGMVVHEDGE+  GSLYFDLPMELIRNSV AQTQ+ PV AAFVF    
Subjt:  CLLLPGFGLKIG----KGKGERKEEAEAGGCISISISRRVSLEKFECGSWASSGMVVHEDGEAGMGSLYFDLPMELIRNSVGAQTQTQPVKAAFVFDRDG

Query:  NGEGVYRLPVWTKGKLAEESSSPSPSPCIITPRLRKARREFNALLEAHTHTL
        NG G     VW K KLAEES   + SPCIITPRLRKAR+EFNALLEAHTH L
Subjt:  NGEGVYRLPVWTKGKLAEESSSPSPSPCIITPRLRKARREFNALLEAHTHTL

A0A1S3AZD3 uncharacterized protein LOC1034842326.2e-7164.03Show/hide
Query:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLESYSVPNSAYSSPRL------GSGLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR-KAWRFR
        SD + PLS   +SR YEFV DVV+ +S QL     S PNS YSSPRL        GL+RSKSCG+GRG+A PHGL+EN++   E  +K +    K  RF+
Subjt:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLESYSVPNSAYSSPRL------GSGLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR-KAWRFR

Query:  CGALCLLLPGFGLKIG----KGKGERKEEAEAGGCISISISRRVSLEKFECGSWASSGMVVHEDGEAGMGSLYFDLPMELIRNSVGAQTQTQPVKAAFVF
        CGALCLLLP  G K+G    KGK E+KEEAE G CISISISRRVSL+KFECGSWASSGMVVHE+GE+  GSLYFDLPMELIRNSV AQ+Q+ PV AAFVF
Subjt:  CGALCLLLPGFGLKIG----KGKGERKEEAEAGGCISISISRRVSLEKFECGSWASSGMVVHEDGEAGMGSLYFDLPMELIRNSVGAQTQTQPVKAAFVF

Query:  DRDGNGEGVYRLPVWTKGKLAEESSSPSPSPCIITPRLRKARREFNALLEAHT
        D     EG     VW K KLA+ES   + SPCIITPRLRKAR+EFNALLEAHT
Subjt:  DRDGNGEGVYRLPVWTKGKLAEESSSPSPSPCIITPRLRKARREFNALLEAHT

A0A5A7UFW0 Ycf3-interacting protein 11.1e-7064.43Show/hide
Query:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLESYSVPNSAYSSPRL------GSGLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR-KAWRFR
        SD + PLS   +SR YEFV DVV+ +S QL     S PNS YSSPRL        GL+RSKSCG+GRG+A PHGL+EN++   E  +K +    K  RF+
Subjt:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLESYSVPNSAYSSPRL------GSGLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR-KAWRFR

Query:  CGALCLLLPGFGLKIG----KGKGERKEEAEAGGCISISISRRVSLEKFECGSWASSGMVVHEDGEAGMGSLYFDLPMELIRNSVGAQTQTQPVKAAFVF
        CGALCLLLP  G K+G    KGK E+KEEAE G CISISISRRVSLEKFECGSWASSGMVVHE+GE+  GSLYFDLPMELIRNSV AQ+Q+ PV AAFVF
Subjt:  CGALCLLLPGFGLKIG----KGKGERKEEAEAGGCISISISRRVSLEKFECGSWASSGMVVHEDGEAGMGSLYFDLPMELIRNSVGAQTQTQPVKAAFVF

Query:  DRDGNGEGVYRLPVWTKGKLAEESSSPSPSPCIITPRLRKARREFNALLEAHT
        D    G+GV   P     KLAEES   + SPCIITPRLRKAR+EFNALLEAHT
Subjt:  DRDGNGEGVYRLPVWTKGKLAEESSSPSPSPCIITPRLRKARREFNALLEAHT

A0A6J1EGQ7 uncharacterized protein LOC1114332244.4e-8569.43Show/hide
Query:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLESYSVPNSAYSSPRLGS-------------GLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR
        +D L PLS GTT R YE V DVVIE+STQ KLESYS PNSAYSSP LGS             GL+RSKSCGEGRG+A PHGL+ENRV I E   K +   
Subjt:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLESYSVPNSAYSSPRLGS-------------GLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR

Query:  -KAWRFRCGALCLLLP---GFGLKIGKGKGERKEEA----EAGGCISISISR-RVSLEKFECGSWASSGMVVHEDGE--AGMGSLYFDLPMELIRNSVGA
         KA RFRCGALCLLLP   G G K+GKGK ERKEE     E GGCISISIS  RVSLEKFECGSWASSGMV HEDGE  +GMGSLYFDLPMELIRNSVGA
Subjt:  -KAWRFRCGALCLLLP---GFGLKIGKGKGERKEEA----EAGGCISISISR-RVSLEKFECGSWASSGMVVHEDGE--AGMGSLYFDLPMELIRNSVGA

Query:  QTQTQPVKAAFVFDRDGNGEGVYRLPVWTKGKLAEESSSPSPSPCIITPRLRKARREFNALLEAH
        +TQ+ P + AFVF+RD  G GV+ LPVWTK KLAEES   + SPC+ITPRLR+AR EFNALLEAH
Subjt:  QTQTQPVKAAFVFDRDGNGEGVYRLPVWTKGKLAEESSSPSPSPCIITPRLRKARREFNALLEAH

A0A6J1ILL2 uncharacterized protein LOC1114785486.8e-8669.81Show/hide
Query:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLESYSVPNSAYSSPRLGS-------------GLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR
        +D L PLS GTT RSYE V DVVI++STQ KLESYS PNSAYSSP LGS             GL+RSKSCGEGRG+A PHGL++NRV I E   K +   
Subjt:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLESYSVPNSAYSSPRLGS-------------GLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR

Query:  -KAWRFRCGALCLLLP---GFGLKIGKGKGERKEEA----EAGGCISISISR-RVSLEKFECGSWASSGMVVHEDGEA--GMGSLYFDLPMELIRNSVGA
         KA RFRCGALCLLLP   G G K+GKGK ERKEE     E GGCISISIS  RVSLEKFECGSWASSGMV HEDGE+  GMGSLYFDLPMELIRNSVGA
Subjt:  -KAWRFRCGALCLLLP---GFGLKIGKGKGERKEEA----EAGGCISISISR-RVSLEKFECGSWASSGMVVHEDGEA--GMGSLYFDLPMELIRNSVGA

Query:  QTQTQPVKAAFVFDRDGNGEGVYRLPVWTKGKLAEESSSPSPSPCIITPRLRKARREFNALLEAH
        +TQ+ P +AAFVFDRD  G GV+ LPVWTK KLAEES   + SPC+ITPRLR+AR EFNALLEAH
Subjt:  QTQTQPVKAAFVFDRDGNGEGVYRLPVWTKGKLAEESSSPSPSPCIITPRLRKARREFNALLEAH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30850.1 root hair specific 42.1e-1834.44Show/hide
Query:  FRCGALCLLLPGFGL-KIGKGKGERKEEAE-----AGGCISISISRRVSLEKFECGSWASSGMVVHEDGEAGMGSLYFDLPMELI----RNSVGAQTQTQ
        F+C A CL LPGFG  K+ +   +R+   E     A      ++S R SLEKFECGSWAS+  ++ ++G      L+FD P+E+     R   G +   +
Subjt:  FRCGALCLLLPGFGL-KIGKGKGERKEEAE-----AGGCISISISRRVSLEKFECGSWASSGMVVHEDGEAGMGSLYFDLPMELI----RNSVGAQTQTQ

Query:  PVKAAFVFDRD---------------------GNGEGVYRLPVWTKGKLAEESSSPSPSPCIITPRLRKARREFNALLEA
        PV + F+FDR+                            R+   T    A  S   SP  C ITPRLRKAR +FN  L A
Subjt:  PVKAAFVFDRD---------------------GNGEGVYRLPVWTKGKLAEESSSPSPSPCIITPRLRKARREFNALLEA

AT2G34910.1 BEST Arabidopsis thaliana protein match is: root hair specific 4 (TAIR:AT1G30850.1)5.1e-1735.84Show/hide
Query:  FRCGALCLLLPGFGLK-IGKGKGE---RKEEAEAGGCISISISRRVSLEKFECGSWASSGMVVHEDGEAGMGSLYFDLPMELIRNSVGAQTQTQPVKAAF
        F+C A CL LPGFG + +   K E   +K+  +A    + ++S   SLEKFECGSWAS+  +  E+G      LY DLP+E+I+   G     +PV + F
Subjt:  FRCGALCLLLPGFGLK-IGKGKGE---RKEEAEAGGCISISISRRVSLEKFECGSWASSGMVVHEDGEAGMGSLYFDLPMELIRNSVGAQTQTQPVKAAF

Query:  VFDRDGNG---EGVYRLPVWTKGK----LAEES-------------SSPSPSPCIITPRLRKARREFNALLEA
         FD++        V +      G+    LAE S             S P+     ITPRL KAR +FN  L A
Subjt:  VFDRDGNG---EGVYRLPVWTKGK----LAEES-------------SSPSPSPCIITPRLRKARREFNALLEA

AT4G20190.1 unknown protein6.4e-2036.61Show/hide
Query:  FRCGALCLLLPGFGLKIGKGK---GERKEEAEAGGCISIS----------------ISRRVSLEKFECGSWASSGMVVHEDGEAGMGSLYFDLPMELIRN
        F+C ALCL LPGF     KGK     RK ++      +++                +S R SLE+FECGSW SS M+   D  A +G  +FDLP ELI+ 
Subjt:  FRCGALCLLLPGFGLKIGKGK---GERKEEAEAGGCISIS----------------ISRRVSLEKFECGSWASSGMVVHEDGEAGMGSLYFDLPMELIRN

Query:  SVGAQTQTQPVKAAFVFDRDGNGEGVYRLPVWTKGKLAEES-----------SSPSPSPC----IITPRLRKARREFNALLEA
          G   Q  PV AAFVFD++ N +   +  + T G  +  S           SSP   P      ITPRL +A  +F++ LEA
Subjt:  SVGAQTQTQPVKAAFVFDRDGNGEGVYRLPVWTKGKLAEES-----------SSPSPSPC----IITPRLRKARREFNALLEA

AT5G44660.1 unknown protein2.1e-1832.59Show/hide
Query:  ESYSVPNSAYSSPRLGSGL----------------SRSKSCGE-GRGRAPPHGLVENRVAIGESEEKARRGRKAW--RFRCGALCLLLPGFGLKIGKGKG
        E  S+PNS   SP+  SGL                 RSKSCG   +  +     + N   I     K+         RF+C ALCL LPGF     KGK 
Subjt:  ESYSVPNSAYSSPRLGSGL----------------SRSKSCGE-GRGRAPPHGLVENRVAIGESEEKARRGRKAW--RFRCGALCLLLPGFGLKIGKGKG

Query:  ERKEEAE--------------AGGCISIS--------------ISRRVSLEKFECGSWASSGMVVHEDGEAGMGSLYFDLPMELIRNSVGAQTQTQPVKA
         R  + +              +   I++S              IS R S+EKF+CGS+ S        GE G G+ +FDLP ELI++  G     +PV A
Subjt:  ERKEEAE--------------AGGCISIS--------------ISRRVSLEKFECGSWASSGMVVHEDGEAGMGSLYFDLPMELIRNSVGAQTQTQPVKA

Query:  AFVFDR---DGNGEGVYRLPVWTKGKLAEES----------SSP--SPSPCIITPRLRKARREFNALLEA
        AFVFD+   +   +GV ++   +K + A ES          SSP   P+   I+PRL +A + FNA LEA
Subjt:  AFVFDR---DGNGEGVYRLPVWTKGKLAEES----------SSP--SPSPCIITPRLRKARREFNALLEA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCACTTCCACCCTCTGATACATTACGACCTCTATCTGGTGGCACCACTAGCAGAAGCTACGAATTTGTTGGGGATGTGGTTATTGAGTTGTCGACGCAATTGAAGTT
GGAAAGCTACAGTGTCCCAAACTCGGCCTATTCATCCCCTCGGTTGGGCAGTGGACTGAGTCGGAGTAAATCCTGTGGTGAAGGAAGAGGGAGGGCACCACCGCATGGTC
TTGTTGAGAATAGAGTGGCCATAGGGGAAAGCGAAGAGAAAGCGAGGCGCGGGAGGAAAGCTTGGCGTTTCAGATGTGGGGCACTCTGCTTGTTGCTGCCAGGATTTGGT
CTTAAGATTGGAAAAGGGAAGGGGGAGAGAAAGGAAGAGGCGGAGGCAGGAGGGTGTATATCCATATCCATATCGAGGAGAGTTTCTTTAGAAAAATTCGAATGCGGTTC
ATGGGCTTCATCGGGCATGGTGGTTCATGAGGACGGGGAGGCAGGGATGGGGAGCCTTTATTTTGATCTGCCAATGGAATTGATAAGGAACAGTGTGGGTGCTCAGACAC
AAACACAACCAGTAAAGGCCGCTTTTGTATTCGATAGAGATGGGAACGGGGAGGGAGTTTATCGTCTTCCTGTTTGGACCAAAGGAAAATTGGCGGAGGAATCAAGCTCC
CCATCCCCATCTCCATGCATCATTACCCCACGCTTGCGCAAAGCCAGACGAGAGTTCAATGCACTTTTGGAAGCCCACACTCACACTCTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCACTTCCACCCTCTGATACATTACGACCTCTATCTGGTGGCACCACTAGCAGAAGCTACGAATTTGTTGGGGATGTGGTTATTGAGTTGTCGACGCAATTGAAGTT
GGAAAGCTACAGTGTCCCAAACTCGGCCTATTCATCCCCTCGGTTGGGCAGTGGACTGAGTCGGAGTAAATCCTGTGGTGAAGGAAGAGGGAGGGCACCACCGCATGGTC
TTGTTGAGAATAGAGTGGCCATAGGGGAAAGCGAAGAGAAAGCGAGGCGCGGGAGGAAAGCTTGGCGTTTCAGATGTGGGGCACTCTGCTTGTTGCTGCCAGGATTTGGT
CTTAAGATTGGAAAAGGGAAGGGGGAGAGAAAGGAAGAGGCGGAGGCAGGAGGGTGTATATCCATATCCATATCGAGGAGAGTTTCTTTAGAAAAATTCGAATGCGGTTC
ATGGGCTTCATCGGGCATGGTGGTTCATGAGGACGGGGAGGCAGGGATGGGGAGCCTTTATTTTGATCTGCCAATGGAATTGATAAGGAACAGTGTGGGTGCTCAGACAC
AAACACAACCAGTAAAGGCCGCTTTTGTATTCGATAGAGATGGGAACGGGGAGGGAGTTTATCGTCTTCCTGTTTGGACCAAAGGAAAATTGGCGGAGGAATCAAGCTCC
CCATCCCCATCTCCATGCATCATTACCCCACGCTTGCGCAAAGCCAGACGAGAGTTCAATGCACTTTTGGAAGCCCACACTCACACTCTATGA
Protein sequenceShow/hide protein sequence
MALPPSDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLESYSVPNSAYSSPRLGSGLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGRKAWRFRCGALCLLLPGFG
LKIGKGKGERKEEAEAGGCISISISRRVSLEKFECGSWASSGMVVHEDGEAGMGSLYFDLPMELIRNSVGAQTQTQPVKAAFVFDRDGNGEGVYRLPVWTKGKLAEESSS
PSPSPCIITPRLRKARREFNALLEAHTHTL