; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0014170 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0014170
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionroot hair specific 4
Genome locationchr1:55882344..55883108
RNA-Seq ExpressionLag0014170
SyntenyLag0014170
Gene Ontology termsGO:0048564 - photosystem I assembly (biological process)
GO:0080183 - response to photooxidative stress (biological process)
GO:0009535 - chloroplast thylakoid membrane (cellular component)
InterPro domainsIPR040340 - Chloroplast enhancing stress tolerance protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7034827.1 hypothetical protein SDJN02_04559, partial [Cucurbita argyrosperma subsp. argyrosperma]5.1e-8368.68Show/hide
Query:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLGSYSVPNSAYSPPRLGS-------------GLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR
        +D L PLS GTT R YE V DVVIE+STQ KL SYS PNSAYS P LGS             GL+RSKSCGEGRG+A PHGL+ENRV I E   K +   
Subjt:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLGSYSVPNSAYSPPRLGS-------------GLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR

Query:  -KAWRFRCGALCLLLL---GFGLKIGKGKGERKEEAE--AEAGGCISISISR-RVSLEKFECGSWASSGMVVHEDGE--AGMGSLYFDLPMELIRNSVGA
         KA RFRCGALCLLL    G G K+GKGK ERKEE E   E GGCISISIS  RVSLEKFECGSWASSGMV HEDGE  +GMGSLYFDLPMELIRNSVGA
Subjt:  -KAWRFRCGALCLLLL---GFGLKIGKGKGERKEEAE--AEAGGCISISISR-RVSLEKFECGSWASSGMVVHEDGE--AGMGSLYFDLPMELIRNSVGA

Query:  QTQTQTQPVKAAFVFDRD--GVYRLPVWTKGKLAEESGSPSPSPCIITPRLRKARREFNALLEAH
        +TQ+   P + AFVF+RD  GV+ LPVWTK KLAEESG  + SPC+ITPRLR+AR EFNALLEAH
Subjt:  QTQTQTQPVKAAFVFDRD--GVYRLPVWTKGKLAEESGSPSPSPCIITPRLRKARREFNALLEAH

XP_022925963.1 uncharacterized protein LOC111433224 [Cucurbita moschata]5.1e-8368.68Show/hide
Query:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLGSYSVPNSAYSPPRLGS-------------GLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR
        +D L PLS GTT R YE V DVVIE+STQ KL SYS PNSAYS P LGS             GL+RSKSCGEGRG+A PHGL+ENRV I E   K +   
Subjt:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLGSYSVPNSAYSPPRLGS-------------GLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR

Query:  -KAWRFRCGALCLLLL---GFGLKIGKGKGERKEEAE--AEAGGCISISISR-RVSLEKFECGSWASSGMVVHEDGE--AGMGSLYFDLPMELIRNSVGA
         KA RFRCGALCLLL    G G K+GKGK ERKEE E   E GGCISISIS  RVSLEKFECGSWASSGMV HEDGE  +GMGSLYFDLPMELIRNSVGA
Subjt:  -KAWRFRCGALCLLLL---GFGLKIGKGKGERKEEAE--AEAGGCISISISR-RVSLEKFECGSWASSGMVVHEDGE--AGMGSLYFDLPMELIRNSVGA

Query:  QTQTQTQPVKAAFVFDRD--GVYRLPVWTKGKLAEESGSPSPSPCIITPRLRKARREFNALLEAH
        +TQ+   P + AFVF+RD  GV+ LPVWTK KLAEESG  + SPC+ITPRLR+AR EFNALLEAH
Subjt:  QTQTQTQPVKAAFVFDRD--GVYRLPVWTKGKLAEESGSPSPSPCIITPRLRKARREFNALLEAH

XP_022978627.1 uncharacterized protein LOC111478548 [Cucurbita maxima]7.8e-8469.06Show/hide
Query:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLGSYSVPNSAYSPPRLGS-------------GLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR
        +D L PLS GTT RSYE V DVVI++STQ KL SYS PNSAYS P LGS             GL+RSKSCGEGRG+A PHGL++NRV I E   K +   
Subjt:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLGSYSVPNSAYSPPRLGS-------------GLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR

Query:  -KAWRFRCGALCLLLL---GFGLKIGKGKGERKEEAE--AEAGGCISISISR-RVSLEKFECGSWASSGMVVHEDGEA--GMGSLYFDLPMELIRNSVGA
         KA RFRCGALCLLL    G G K+GKGK ERKEE E   E GGCISISIS  RVSLEKFECGSWASSGMV HEDGE+  GMGSLYFDLPMELIRNSVGA
Subjt:  -KAWRFRCGALCLLLL---GFGLKIGKGKGERKEEAE--AEAGGCISISISR-RVSLEKFECGSWASSGMVVHEDGEA--GMGSLYFDLPMELIRNSVGA

Query:  QTQTQTQPVKAAFVFDRD--GVYRLPVWTKGKLAEESGSPSPSPCIITPRLRKARREFNALLEAH
        +TQ+   P +AAFVFDRD  GV+ LPVWTK KLAEESG  + SPC+ITPRLR+AR EFNALLEAH
Subjt:  QTQTQTQPVKAAFVFDRD--GVYRLPVWTKGKLAEESGSPSPSPCIITPRLRKARREFNALLEAH

XP_023543164.1 uncharacterized protein LOC111803119 [Cucurbita pepo subsp. pepo]6.0e-8469.43Show/hide
Query:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLGSYSVPNSAYSPPRLGS-------------GLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR
        +D L PLS GTT R YE V DVVIE+STQ KL SYS PNSAYS P LGS             GL+RSKSCGEGRGRA PHGL+ENRV I E   K +   
Subjt:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLGSYSVPNSAYSPPRLGS-------------GLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR

Query:  -KAWRFRCGALCLLLL---GFGLKIGKGKGERKEEAE--AEAGGCISISISR-RVSLEKFECGSWASSGMVVHEDGE--AGMGSLYFDLPMELIRNSVGA
         KA RFRCGALCLLL    G G K+GKGK ERKEE E   E GGCISISIS  RVSLEKFECGSWASSGMV HEDGE  +GMGSLYFDLPMELIRNSVGA
Subjt:  -KAWRFRCGALCLLLL---GFGLKIGKGKGERKEEAE--AEAGGCISISISR-RVSLEKFECGSWASSGMVVHEDGE--AGMGSLYFDLPMELIRNSVGA

Query:  QTQTQTQPVKAAFVFDRD--GVYRLPVWTKGKLAEESGSPSPSPCIITPRLRKARREFNALLEAH
        +TQ+   P + AFVFDRD  GV+ LPVWTK KLAEESG  + SPC+ITPRLR+AR EFNALLEAH
Subjt:  QTQTQTQPVKAAFVFDRD--GVYRLPVWTKGKLAEESGSPSPSPCIITPRLRKARREFNALLEAH

XP_038877520.1 uncharacterized protein LOC120069777 [Benincasa hispida]4.1e-8571.09Show/hide
Query:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLGSYSVPNSAYSPPRL---------GSGLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGRKAWR
        +DT+ PLS GTTSRSYEFV DVVIE+STQLKLGSYSVPNSAYS PRL         G GL+RSKSCGEGRG+A PH L+EN+V + E   K +   KA R
Subjt:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLGSYSVPNSAYSPPRL---------GSGLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGRKAWR

Query:  FRCGALCLLLL---GFGLKIGKGKGERKEE--AEAEAGGCISISISRRVSLEKFECGSWASSGMVVHEDGEAGMGSLYFDLPMELIRNSVGAQTQTQTQP
        FRCGALCLLL    G G K+GKGK + KEE   EAE G CISISISRRVSLEKFECGSWASSGMVVHEDGE  +GS YFDLPMELIRNSVG QTQ+   P
Subjt:  FRCGALCLLLL---GFGLKIGKGKGERKEE--AEAEAGGCISISISRRVSLEKFECGSWASSGMVVHEDGEAGMGSLYFDLPMELIRNSVGAQTQTQTQP

Query:  VKAAFVFDRDGVYRLPVWTKGKLAEESGSPSPSPCIITPRLRKARREFNALLEAHT
        V AAFVFD    + LP+WTK  LAEESG  + SPCIITPRLRKAR EFNALLEAHT
Subjt:  VKAAFVFDRDGVYRLPVWTKGKLAEESGSPSPSPCIITPRLRKARREFNALLEAHT

TrEMBL top hitse value%identityAlignment
A0A0A0KME4 Uncharacterized protein2.7e-7467.2Show/hide
Query:  PLSGGTTSRSYEFVGDVVIELSTQLKLGSYSVPNSAYSPPRLG------SGLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR-KAWRFR-CGAL
        PLS GT+SR YEFV DVVIE+S QL     S PNSAYS PRL        GL+RS+SCGEGRG+A PHGL+EN+V + E  +K +    K  RFR CGAL
Subjt:  PLSGGTTSRSYEFVGDVVIELSTQLKLGSYSVPNSAYSPPRLG------SGLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR-KAWRFR-CGAL

Query:  CLLLLGFGLKIGKG--KGERKEEAEAEAGGCISISISRRVSLEKFECGSWASSGMVVHEDGEAGMGSLYFDLPMELIRNSVGAQTQTQTQPVKAAFVFDR
        CLLL   G K+GKG  KG+ ++  EAE G CISISISRRVSLEKFECGSWASSGMVVHEDGE+  GSLYFDLPMELIRNSV AQTQ+   PV AAFVF+ 
Subjt:  CLLLLGFGLKIGKG--KGERKEEAEAEAGGCISISISRRVSLEKFECGSWASSGMVVHEDGEAGMGSLYFDLPMELIRNSVGAQTQTQTQPVKAAFVFDR

Query:  DGVYRLPVWTKGKLAEESGSPSPSPCIITPRLRKARREFNALLEAHTHTL
         G     VW K KLAEESG  + SPCIITPRLRKAR+EFNALLEAHTH L
Subjt:  DGVYRLPVWTKGKLAEESGSPSPSPCIITPRLRKARREFNALLEAHTHTL

A0A1S3AZD3 uncharacterized protein LOC1034842321.5e-6962.7Show/hide
Query:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLGSYSVPNSAYSPPRL------GSGLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR-KAWRFR
        SD + PLS   +SR YEFV DVV+ +S QL     S PNS YS PRL        GL+RSKSCG+GRG+A PHGL+EN++   E  +K +    K  RF+
Subjt:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLGSYSVPNSAYSPPRL------GSGLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR-KAWRFR

Query:  CGALCLLLLGFGLKIGKG--KGERKEEAEAEAGGCISISISRRVSLEKFECGSWASSGMVVHEDGEAGMGSLYFDLPMELIRNSVGAQTQTQTQPVKAAF
        CGALCLLL   G K+GKG  KG+ +++ EAE G CISISISRRVSL+KFECGSWASSGMVVHE+GE+  GSLYFDLPMELIRNSV AQ+Q+   PV AAF
Subjt:  CGALCLLLLGFGLKIGKG--KGERKEEAEAEAGGCISISISRRVSLEKFECGSWASSGMVVHEDGEAGMGSLYFDLPMELIRNSVGAQTQTQTQPVKAAF

Query:  VFD-RDGVYRLPVWTKGKLAEESGSPSPSPCIITPRLRKARREFNALLEAHT
        VFD ++G     VW K KLA+ESG  + SPCIITPRLRKAR+EFNALLEAHT
Subjt:  VFD-RDGVYRLPVWTKGKLAEESGSPSPSPCIITPRLRKARREFNALLEAHT

A0A5A7UFW0 Ycf3-interacting protein 11.5e-6963.35Show/hide
Query:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLGSYSVPNSAYSPPRL------GSGLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR-KAWRFR
        SD + PLS   +SR YEFV DVV+ +S QL     S PNS YS PRL        GL+RSKSCG+GRG+A PHGL+EN++   E  +K +    K  RF+
Subjt:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLGSYSVPNSAYSPPRL------GSGLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR-KAWRFR

Query:  CGALCLLLLGFGLKIGKG--KGERKEEAEAEAGGCISISISRRVSLEKFECGSWASSGMVVHEDGEAGMGSLYFDLPMELIRNSVGAQTQTQTQPVKAAF
        CGALCLLL   G K+GKG  KG+ +++ EAE G CISISISRRVSLEKFECGSWASSGMVVHE+GE+  GSLYFDLPMELIRNSV AQ+Q+   PV AAF
Subjt:  CGALCLLLLGFGLKIGKG--KGERKEEAEAEAGGCISISISRRVSLEKFECGSWASSGMVVHEDGEAGMGSLYFDLPMELIRNSVGAQTQTQTQPVKAAF

Query:  VFDRDGVYRLPVWTKGKLAEESGSPSPSPCIITPRLRKARREFNALLEAHT
        VFD  GV   P     KLAEESG  + SPCIITPRLRKAR+EFNALLEAHT
Subjt:  VFDRDGVYRLPVWTKGKLAEESGSPSPSPCIITPRLRKARREFNALLEAHT

A0A6J1EGQ7 uncharacterized protein LOC1114332242.5e-8368.68Show/hide
Query:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLGSYSVPNSAYSPPRLGS-------------GLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR
        +D L PLS GTT R YE V DVVIE+STQ KL SYS PNSAYS P LGS             GL+RSKSCGEGRG+A PHGL+ENRV I E   K +   
Subjt:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLGSYSVPNSAYSPPRLGS-------------GLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR

Query:  -KAWRFRCGALCLLLL---GFGLKIGKGKGERKEEAE--AEAGGCISISISR-RVSLEKFECGSWASSGMVVHEDGE--AGMGSLYFDLPMELIRNSVGA
         KA RFRCGALCLLL    G G K+GKGK ERKEE E   E GGCISISIS  RVSLEKFECGSWASSGMV HEDGE  +GMGSLYFDLPMELIRNSVGA
Subjt:  -KAWRFRCGALCLLLL---GFGLKIGKGKGERKEEAE--AEAGGCISISISR-RVSLEKFECGSWASSGMVVHEDGE--AGMGSLYFDLPMELIRNSVGA

Query:  QTQTQTQPVKAAFVFDRD--GVYRLPVWTKGKLAEESGSPSPSPCIITPRLRKARREFNALLEAH
        +TQ+   P + AFVF+RD  GV+ LPVWTK KLAEESG  + SPC+ITPRLR+AR EFNALLEAH
Subjt:  QTQTQTQPVKAAFVFDRD--GVYRLPVWTKGKLAEESGSPSPSPCIITPRLRKARREFNALLEAH

A0A6J1ILL2 uncharacterized protein LOC1114785483.8e-8469.06Show/hide
Query:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLGSYSVPNSAYSPPRLGS-------------GLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR
        +D L PLS GTT RSYE V DVVI++STQ KL SYS PNSAYS P LGS             GL+RSKSCGEGRG+A PHGL++NRV I E   K +   
Subjt:  SDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLGSYSVPNSAYSPPRLGS-------------GLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGR

Query:  -KAWRFRCGALCLLLL---GFGLKIGKGKGERKEEAE--AEAGGCISISISR-RVSLEKFECGSWASSGMVVHEDGEA--GMGSLYFDLPMELIRNSVGA
         KA RFRCGALCLLL    G G K+GKGK ERKEE E   E GGCISISIS  RVSLEKFECGSWASSGMV HEDGE+  GMGSLYFDLPMELIRNSVGA
Subjt:  -KAWRFRCGALCLLLL---GFGLKIGKGKGERKEEAE--AEAGGCISISISR-RVSLEKFECGSWASSGMVVHEDGEA--GMGSLYFDLPMELIRNSVGA

Query:  QTQTQTQPVKAAFVFDRD--GVYRLPVWTKGKLAEESGSPSPSPCIITPRLRKARREFNALLEAH
        +TQ+   P +AAFVFDRD  GV+ LPVWTK KLAEESG  + SPC+ITPRLR+AR EFNALLEAH
Subjt:  QTQTQTQPVKAAFVFDRD--GVYRLPVWTKGKLAEESGSPSPSPCIITPRLRKARREFNALLEAH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30850.1 root hair specific 41.5e-1635Show/hide
Query:  FRCGALCLLLLGFGL-KIGKGKGERKEEAEAE---AGGCISISISRRVSLEKFECGSWASSGMVVHEDGEAGMGSLYFDLPMELIR-NSVGAQTQTQTQ-
        F+C A CL L GFG  K+ +   +R+   E +   A      ++S R SLEKFECGSWAS+  ++ ++G      L+FD P+E+ + NS G       Q 
Subjt:  FRCGALCLLLLGFGL-KIGKGKGERKEEAEAE---AGGCISISISRRVSLEKFECGSWASSGMVVHEDGEAGMGSLYFDLPMELIR-NSVGAQTQTQTQ-

Query:  PVKAAFVFDRD-------------------------GVYRLPVWTKGKLAEESGSPSPSPCIITPRLRKARREFNALLEA
        PV + F+FDR+                            R+   T    A  S   SP  C ITPRLRKAR +FN  L A
Subjt:  PVKAAFVFDRD-------------------------GVYRLPVWTKGKLAEESGSPSPSPCIITPRLRKARREFNALLEA

AT2G34910.1 BEST Arabidopsis thaliana protein match is: root hair specific 4 (TAIR:AT1G30850.1)4.1e-1435.96Show/hide
Query:  FRCGALCLLLLGFGLK-IGKGKGE---RKEEAEAEAGGCISISISRRVSLEKFECGSWASSGMVVHEDGEAGMGSLYFDLPMELIRNSVGAQTQTQTQPV
        F+C A CL L GFG + +   K E   +K+  +A +    ++S+S   SLEKFECGSWAS+  +  E+G      LY DLP+E+I+   G   Q   +PV
Subjt:  FRCGALCLLLLGFGLK-IGKGKGE---RKEEAEAEAGGCISISISRRVSLEKFECGSWASSGMVVHEDGEAGMGSLYFDLPMELIRNSVGAQTQTQTQPV

Query:  KAAFVFDRD-------GVYRLPVWTKGK----LAE--------------ESGSPSPSPCIITPRLRKARREFNALLEA
         + F FD++        V +      G+    LAE              +S   SP  C ITPRL KAR +FN  L A
Subjt:  KAAFVFDRD-------GVYRLPVWTKGK----LAE--------------ESGSPSPSPCIITPRLRKARREFNALLEA

AT4G20190.1 unknown protein1.7e-1536.02Show/hide
Query:  FRCGALCLLLLGFGLKIGKGK---GERKEEAEAEAGGCISIS--------------ISRRVSLEKFECGSWASSGMVVHEDGEAGMGSLYFDLPMELIRN
        F+C ALCL L GF     KGK     RK ++       ++ S              +S R SLE+FECGSW SS M+   D  A +G  +FDLP ELI+ 
Subjt:  FRCGALCLLLLGFGLKIGKGK---GERKEEAEAEAGGCISIS--------------ISRRVSLEKFECGSWASSGMVVHEDGEAGMGSLYFDLPMELIRN

Query:  SVGAQTQTQTQPVKAAFVFDRD--------GVYRLPVWTKGKLAEES--------GSPSPSPC----IITPRLRKARREFNALLEA
          G     Q  PV AAFVFD++        GV +    +K + + ES         SP   P      ITPRL +A  +F++ LEA
Subjt:  SVGAQTQTQTQPVKAAFVFDRD--------GVYRLPVWTKGKLAEES--------GSPSPSPC----IITPRLRKARREFNALLEA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGGCAGCCATGGCGCTTCCACCCTCTGATACATTACGACCTCTATCTGGTGGCACCACTAGCAGAAGCTACGAATTTGTTGGGGATGTGGTTATTGAGTTGTCGAC
GCAATTGAAGTTGGGAAGCTACAGTGTCCCAAACTCGGCCTATTCACCCCCTCGGTTGGGCAGTGGACTGAGTCGGAGTAAATCCTGTGGTGAAGGAAGAGGGAGGGCAC
CACCGCATGGTCTTGTTGAGAATAGAGTGGCCATAGGGGAAAGCGAAGAGAAAGCGAGGCGCGGGAGGAAAGCTTGGCGTTTCAGATGTGGGGCACTCTGCTTGTTGCTG
CTAGGATTTGGTCTTAAGATTGGAAAAGGGAAGGGGGAGAGAAAGGAAGAGGCGGAGGCAGAGGCAGGAGGGTGTATATCCATATCCATATCGAGGAGAGTTTCTTTAGA
AAAATTCGAATGCGGTTCATGGGCTTCATCGGGCATGGTGGTTCATGAGGACGGGGAGGCAGGGATGGGGAGCCTTTATTTTGATCTGCCAATGGAATTGATAAGGAACA
GTGTGGGTGCTCAAACACAAACACAAACACAACCAGTAAAGGCCGCTTTTGTATTCGATAGAGATGGAGTTTATCGTCTTCCTGTTTGGACCAAAGGAAAATTGGCGGAG
GAATCAGGCTCCCCATCCCCATCTCCATGCATCATTACCCCACGCTTGCGCAAAGCCAGACGAGAGTTCAATGCACTTTTGGAAGCCCACACTCACACTCTATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCGGCAGCCATGGCGCTTCCACCCTCTGATACATTACGACCTCTATCTGGTGGCACCACTAGCAGAAGCTACGAATTTGTTGGGGATGTGGTTATTGAGTTGTCGAC
GCAATTGAAGTTGGGAAGCTACAGTGTCCCAAACTCGGCCTATTCACCCCCTCGGTTGGGCAGTGGACTGAGTCGGAGTAAATCCTGTGGTGAAGGAAGAGGGAGGGCAC
CACCGCATGGTCTTGTTGAGAATAGAGTGGCCATAGGGGAAAGCGAAGAGAAAGCGAGGCGCGGGAGGAAAGCTTGGCGTTTCAGATGTGGGGCACTCTGCTTGTTGCTG
CTAGGATTTGGTCTTAAGATTGGAAAAGGGAAGGGGGAGAGAAAGGAAGAGGCGGAGGCAGAGGCAGGAGGGTGTATATCCATATCCATATCGAGGAGAGTTTCTTTAGA
AAAATTCGAATGCGGTTCATGGGCTTCATCGGGCATGGTGGTTCATGAGGACGGGGAGGCAGGGATGGGGAGCCTTTATTTTGATCTGCCAATGGAATTGATAAGGAACA
GTGTGGGTGCTCAAACACAAACACAAACACAACCAGTAAAGGCCGCTTTTGTATTCGATAGAGATGGAGTTTATCGTCTTCCTGTTTGGACCAAAGGAAAATTGGCGGAG
GAATCAGGCTCCCCATCCCCATCTCCATGCATCATTACCCCACGCTTGCGCAAAGCCAGACGAGAGTTCAATGCACTTTTGGAAGCCCACACTCACACTCTATGA
Protein sequenceShow/hide protein sequence
MPAAMALPPSDTLRPLSGGTTSRSYEFVGDVVIELSTQLKLGSYSVPNSAYSPPRLGSGLSRSKSCGEGRGRAPPHGLVENRVAIGESEEKARRGRKAWRFRCGALCLLL
LGFGLKIGKGKGERKEEAEAEAGGCISISISRRVSLEKFECGSWASSGMVVHEDGEAGMGSLYFDLPMELIRNSVGAQTQTQTQPVKAAFVFDRDGVYRLPVWTKGKLAE
ESGSPSPSPCIITPRLRKARREFNALLEAHTHTL