; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh13G002980 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh13G002980
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionPhotosystem II protein D1
Genome locationCmo_Chr13:3229860..3231853
RNA-Seq ExpressionCmoCh13G002980
SyntenyCmoCh13G002980
Gene Ontology termsGO:0006397 - mRNA processing (biological process)
GO:0009772 - photosynthetic electron transport in photosystem II (biological process)
GO:0009536 - plastid (cellular component)
GO:0034357 - photosynthetic membrane (cellular component)
GO:0005488 - binding (molecular function)
GO:0045156 - electron transporter, transferring electrons within the cyclic electron transport pathway of photosynthesis activity (molecular function)
InterPro domainsIPR000484 - Photosynthetic reaction centre, L/M
IPR024937 - Domain X
IPR036854 - Photosystem II protein D1/D2 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA9893312.1 unnamed protein product [Spirodela intermedia]2.9e-3256.17Show/hide
Query:  RPWIVVAYSAPVVAATAGFLIYPIGQGSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASVL---------------------AW---------L
        RPWI VAYSAPV AATA FLIYPIGQGSFSD MP GISGTFNFMI   AEHNILMHPFHMLG+A V                      AW         L
Subjt:  RPWIVVAYSAPVVAATAGFLIYPIGQGSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASVL---------------------AW---------L

Query:  PIQRYAWFLGNFYL-----------------ILNRANLGMEVMHERNAHNFPLDLAALEVPS
         I   A+ L  F                   I+NRANLGMEVMHERNAHNFPLDLAA+E PS
Subjt:  PIQRYAWFLGNFYL-----------------ILNRANLGMEVMHERNAHNFPLDLAALEVPS

KAF3449803.1 hypothetical protein FNV43_RR05881 [Rhamnella rubrinervis]5.3e-3432.8Show/hide
Query:  KIETLVPIKTPCLDHWLKALFCNVLGHPIRKPTWIDSSFLLLID---CISGNLSHYYRGSSNKKNF------------------------IYFHFLGFEL
        K++TL PI +P +    K  FC+ LGHPI K TW DSS   +ID    I  NLSHYY GSS KK+                         ++    G EL
Subjt:  KIETLVPIKTPCLDHWLKALFCNVLGHPIRKPTWIDSSFLLLID---CISGNLSHYYRGSSNKKNF------------------------IYFHFLGFEL

Query:  ----------------------------------------------------WLVTQQY--------------CTPSFKKG-KFTISGRIIYG-------
                                                            W     Y              C P    G +  +SG ++YG       
Subjt:  ----------------------------------------------------WLVTQQY--------------CTPSFKKG-KFTISGRIIYG-------

Query:  -----ARRASYMDGGRVGIELPYG-RPWIVVAYSAPVVAATAGFLIYPIGQGSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASVL--------
             A  A YM G    +    G RPWI VAYSAPV AATA FLIYPIGQGSFSD MP GISGTFNFMI   AEHNILMHPFHMLG+A V         
Subjt:  -----ARRASYMDGGRVGIELPYG-RPWIVVAYSAPVVAATAGFLIYPIGQGSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASVL--------

Query:  --------------------------------------------------------------AW---------LPIQRYAWFLGNFYL------------
                                                                      AW         L I   A+ L  F              
Subjt:  --------------------------------------------------------------AW---------LPIQRYAWFLGNFYL------------

Query:  -----ILNRANLGMEVMHERNAHNFPLDLAALEVPS
             I+NRANLGMEVMHERNAHNFPLDLAA++VPS
Subjt:  -----ILNRANLGMEVMHERNAHNFPLDLAALEVPS

KAG5618063.1 hypothetical protein H5410_017887, partial [Solanum commersonii]1.2e-3377.57Show/hide
Query:  PVVAATAGFLIYPIGQGSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASVLAWLPIQRYAWFLG----NFYLILNRANLGMEVMHERNAHNFPL
        PV AATA FLIYPIGQGSFSD MP GISGTFNFMI   AEHNILMHPFHMLG+A VLA  PIQ YAW LG     +  I+NRANLGMEVMHERNAHNFPL
Subjt:  PVVAATAGFLIYPIGQGSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASVLAWLPIQRYAWFLG----NFYLILNRANLGMEVMHERNAHNFPL

Query:  DLAALEV
        DLAA+E+
Subjt:  DLAALEV

KJB15227.1 hypothetical protein B456_002G165800 [Gossypium raimondii]7.6e-3335.25Show/hide
Query:  IETLVPIKTPCLDHWLKALFCNVLGHPIRKPTWIDSSFLLLID---CISGNLSHYYRGSSNK-------------KNFIYFHFLGFEL--------WLVT
        ++T +PI T  +    K  FCN L HPI KPTW+DS     ID    IS NLS Y++   ++             +N +Y  + G  +        ++ T
Subjt:  IETLVPIKTPCLDHWLKALFCNVLGHPIRKPTWIDSSFLLLID---CISGNLSHYYRGSSNK-------------KNFIYFHFLGFEL--------WLVT

Query:  QQYCT-----------PSFKKGKFTISGRII----------YGARRASYMD-----GG---------RVGIELPYG-----------RPWIVVAYSAPVV
          + T           P ++     ISG II          Y  R A+ +D     GG          +G+    G           RPWI VAYS  +V
Subjt:  QQYCT-----------PSFKKGKFTISGRII----------YGARRASYMD-----GG---------RVGIELPYG-----------RPWIVVAYSAPVV

Query:  AATAGFLIYPIGQGSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASV----------------LAWLPIQRYA--------------------W
         AT  FLIYPI +GSFSD MP GISGTF FMI   AEHNILMHPFHMLG+ASV                L W  I +YA                    W
Subjt:  AATAGFLIYPIGQGSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASV----------------LAWLPIQRYA--------------------W

Query:  F----------------------------LGNFYLILNRANLGMEVMHERNAHNFPLDLAALEVPS
        F                            +  +  I+N ANLGMEVMHERNAHNFPLDLAA+E PS
Subjt:  F----------------------------LGNFYLILNRANLGMEVMHERNAHNFPLDLAALEVPS

OMP12461.1 Photosynthetic reaction centre, L/M [Corchorus olitorius]1.2e-3864.74Show/hide
Query:  GKFTISGRII-YGARRASYMDGGRVGIELPYG-RPWIVVAYSAPVVAATAGFLIYPIGQGSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASVL
        G   ISG II   A  A YM G    +    G RPWI VAYSAPV AATA FLIYPIGQGSFSD MP GISGTFNFMI   AEHNILMHPFHMLG+A V 
Subjt:  GKFTISGRII-YGARRASYMDGGRVGIELPYG-RPWIVVAYSAPVVAATAGFLIYPIGQGSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASVL

Query:  AWLPIQRYAWFL--------GNFYLILNRANLGMEVMHERNAHNFPLDLAALEVPS
               YAWFL          +  I+NRANLGMEVMHERNAHNFPLDLAA+E PS
Subjt:  AWLPIQRYAWFL--------GNFYLILNRANLGMEVMHERNAHNFPLDLAALEVPS

TrEMBL top hitse value%identityAlignment
A0A0D2MBZ4 Uncharacterized protein3.7e-3335.25Show/hide
Query:  IETLVPIKTPCLDHWLKALFCNVLGHPIRKPTWIDSSFLLLID---CISGNLSHYYRGSSNK-------------KNFIYFHFLGFEL--------WLVT
        ++T +PI T  +    K  FCN L HPI KPTW+DS     ID    IS NLS Y++   ++             +N +Y  + G  +        ++ T
Subjt:  IETLVPIKTPCLDHWLKALFCNVLGHPIRKPTWIDSSFLLLID---CISGNLSHYYRGSSNK-------------KNFIYFHFLGFEL--------WLVT

Query:  QQYCT-----------PSFKKGKFTISGRII----------YGARRASYMD-----GG---------RVGIELPYG-----------RPWIVVAYSAPVV
          + T           P ++     ISG II          Y  R A+ +D     GG          +G+    G           RPWI VAYS  +V
Subjt:  QQYCT-----------PSFKKGKFTISGRII----------YGARRASYMD-----GG---------RVGIELPYG-----------RPWIVVAYSAPVV

Query:  AATAGFLIYPIGQGSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASV----------------LAWLPIQRYA--------------------W
         AT  FLIYPI +GSFSD MP GISGTF FMI   AEHNILMHPFHMLG+ASV                L W  I +YA                    W
Subjt:  AATAGFLIYPIGQGSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASV----------------LAWLPIQRYA--------------------W

Query:  F----------------------------LGNFYLILNRANLGMEVMHERNAHNFPLDLAALEVPS
        F                            +  +  I+N ANLGMEVMHERNAHNFPLDLAA+E PS
Subjt:  F----------------------------LGNFYLILNRANLGMEVMHERNAHNFPLDLAALEVPS

A0A1R3KZE5 Photosynthetic reaction centre, L/M5.9e-3964.74Show/hide
Query:  GKFTISGRII-YGARRASYMDGGRVGIELPYG-RPWIVVAYSAPVVAATAGFLIYPIGQGSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASVL
        G   ISG II   A  A YM G    +    G RPWI VAYSAPV AATA FLIYPIGQGSFSD MP GISGTFNFMI   AEHNILMHPFHMLG+A V 
Subjt:  GKFTISGRII-YGARRASYMDGGRVGIELPYG-RPWIVVAYSAPVVAATAGFLIYPIGQGSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASVL

Query:  AWLPIQRYAWFL--------GNFYLILNRANLGMEVMHERNAHNFPLDLAALEVPS
               YAWFL          +  I+NRANLGMEVMHERNAHNFPLDLAA+E PS
Subjt:  AWLPIQRYAWFL--------GNFYLILNRANLGMEVMHERNAHNFPLDLAALEVPS

A0A5D2FID8 Photosystem II protein D12.4e-3234.97Show/hide
Query:  KALFCNVLGHPIRKPTWIDSSFLLLID---CISGNLSHYYRGSSNK-------------KNFIYFHFLGFEL--WLVT--------------------QQ
        KA FCN LGHPI KPTW D     +ID    IS NLSHY+    ++             +N IY  + G  +   L+T                    ++
Subjt:  KALFCNVLGHPIRKPTWIDSSFLLLID---CISGNLSHYYRGSSNK-------------KNFIYFHFLGFEL--WLVT--------------------QQ

Query:  YCTPSFKKGKFTISGRII----------YGARRASYMD-----GG---------RVGIELPYG-----------RPWIVVAYSAPVVAATAGFLIYPIGQ
          + S   G   ISG II          Y    A+ +D     GG          +G+    G           RPWI VAYSAPV AATA FLIYPIGQ
Subjt:  YCTPSFKKGKFTISGRII----------YGARRASYMD-----GG---------RVGIELPYG-----------RPWIVVAYSAPVVAATAGFLIYPIGQ

Query:  GSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASVL-----------------------------------------------------------
        GSFSD MP GIS TFNFMI   AEHNILMHPFHMLG+A V                                                            
Subjt:  GSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASVL-----------------------------------------------------------

Query:  -----------AW---------LPIQRYAWFLGNFYL-----------------ILNRANLGMEVMHERNAHNFPLDLAALEVPSI
                   AW         L I   A+ L  F                   I+NRANLGMEVMHERNAHNFPLDLAA+E PSI
Subjt:  -----------AW---------LPIQRYAWFLGNFYL-----------------ILNRANLGMEVMHERNAHNFPLDLAALEVPSI

A0A5J5UQF9 Photosystem II protein D1 (Fragment)1.2e-3134.25Show/hide
Query:  TLVPIKTPCLDHWLKALFCNVLGHPIRKPTWIDSSFLLLID---CISGNLSHYYRGSSNK-------------KNFIYFHFLGFEL--------------
        T +PI T  +    KA FCN LGHPI KPTW D     +ID    IS NLSHY+    ++             +N IY  + G  +              
Subjt:  TLVPIKTPCLDHWLKALFCNVLGHPIRKPTWIDSSFLLLID---CISGNLSHYYRGSSNK-------------KNFIYFHFLGFEL--------------

Query:  WLVT--------QQYCTPSFKKGKFTISGRII----------YGARRASYMD-----GG---------RVGIELPYG-----------RPWIVVAYSAPV
        ++V+        ++  + S   G   ISG II          Y    A+ +D     GG          +G+    G           RPWI VAYSAPV
Subjt:  WLVT--------QQYCTPSFKKGKFTISGRII----------YGARRASYMD-----GG---------RVGIELPYG-----------RPWIVVAYSAPV

Query:  VAATAGFLIYPIGQGSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASVL---------------------------------------------
         AATA FLIYPIGQGSFSD MP GIS TFNFMI   AEHNILMHPFHMLG+A V                                              
Subjt:  VAATAGFLIYPIGQGSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASVL---------------------------------------------

Query:  -------------------------AW---------LPIQRYAWFLGNFYL-----------------ILNRANLGMEVMHERNAHNFPLDLAALEVPSI
                                 AW         L I   A+ L  F                   I+NRANLGMEVM ERNAHNFPLDLAA+E PSI
Subjt:  -------------------------AW---------LPIQRYAWFLGNFYL-----------------ILNRANLGMEVMHERNAHNFPLDLAALEVPSI

A0A6V7PYN2 Uncharacterized protein9.0e-3254.55Show/hide
Query:  RPWIVVAYSAPVVAATAGFLIYPIGQGSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASVLA------------------------------W-
        RPWI VAYSAPV AATA FLIYPIGQGSFSD MP GISGTFNFMI   AEHNILMHPFHMLG+A V                                W 
Subjt:  RPWIVVAYSAPVVAATAGFLIYPIGQGSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASVLA------------------------------W-

Query:  --LPIQRYAWFLGNFYL-----------------ILNRANLGMEVMHERNAHNFPLDLAALEVPS
          L I   A+ L  F                   I+NRANLGMEVMHERNAHNFPLDLAA+E PS
Subjt:  --LPIQRYAWFLGNFYL-----------------ILNRANLGMEVMHERNAHNFPLDLAALEVPS

SwissProt top hitse value%identityAlignment
A1E9Y8 Photosystem II protein D19.6e-3144.34Show/hide
Query:  RPWIVVAYSAPVVAATAGFLIYPIGQGSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASVL---------------------------------
        RPWI VAYSAPV AATA FLIYPIGQGSFSD MP GISGTFNFMI   AEHNILMHPFHMLG+A V                                  
Subjt:  RPWIVVAYSAPVVAATAGFLIYPIGQGSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASVL---------------------------------

Query:  -------------------------------------AW---------LPIQRYAWFLGNFYL-----------------ILNRANLGMEVMHERNAHNF
                                             AW         L I   A+ L  F                   I+NRANLGMEVMHERNAHNF
Subjt:  -------------------------------------AW---------LPIQRYAWFLGNFYL-----------------ILNRANLGMEVMHERNAHNF

Query:  PLDLAALEVPSI
        PLDLAALEVPSI
Subjt:  PLDLAALEVPSI

A8Y9F0 Photosystem II protein D11.6e-3043.87Show/hide
Query:  RPWIVVAYSAPVVAATAGFLIYPIGQGSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASVL---------------------------------
        RPWI VAYSAPV AATA FLIYPIGQGSFSD MP GISGTFNFMI   AEHNILMHPFHMLG+A V                                  
Subjt:  RPWIVVAYSAPVVAATAGFLIYPIGQGSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASVL---------------------------------

Query:  -------------------------------------AW---------LPIQRYAWFLGNFYL-----------------ILNRANLGMEVMHERNAHNF
                                             AW         L I   A+ L  F                   I+NRANLGMEVMHERNAHNF
Subjt:  -------------------------------------AW---------LPIQRYAWFLGNFYL-----------------ILNRANLGMEVMHERNAHNF

Query:  PLDLAALEVPSI
        PLDLAALEVPS+
Subjt:  PLDLAALEVPSI

P0C432 Photosystem II protein D11.6e-3043.87Show/hide
Query:  RPWIVVAYSAPVVAATAGFLIYPIGQGSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASVL---------------------------------
        RPWI VAYSAPV AATA FLIYPIGQGSFSD MP GISGTFNFMI   AEHNILMHPFHMLG+A V                                  
Subjt:  RPWIVVAYSAPVVAATAGFLIYPIGQGSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASVL---------------------------------

Query:  -------------------------------------AW---------LPIQRYAWFLGNFYL-----------------ILNRANLGMEVMHERNAHNF
                                             AW         L I   A+ L  F                   I+NRANLGMEVMHERNAHNF
Subjt:  -------------------------------------AW---------LPIQRYAWFLGNFYL-----------------ILNRANLGMEVMHERNAHNF

Query:  PLDLAALEVPSI
        PLDLAALEVPS+
Subjt:  PLDLAALEVPSI

P0C433 Photosystem II protein D11.6e-3043.87Show/hide
Query:  RPWIVVAYSAPVVAATAGFLIYPIGQGSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASVL---------------------------------
        RPWI VAYSAPV AATA FLIYPIGQGSFSD MP GISGTFNFMI   AEHNILMHPFHMLG+A V                                  
Subjt:  RPWIVVAYSAPVVAATAGFLIYPIGQGSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASVL---------------------------------

Query:  -------------------------------------AW---------LPIQRYAWFLGNFYL-----------------ILNRANLGMEVMHERNAHNF
                                             AW         L I   A+ L  F                   I+NRANLGMEVMHERNAHNF
Subjt:  -------------------------------------AW---------LPIQRYAWFLGNFYL-----------------ILNRANLGMEVMHERNAHNF

Query:  PLDLAALEVPSI
        PLDLAALEVPS+
Subjt:  PLDLAALEVPSI

Q6ENJ7 Photosystem II protein D11.6e-3043.87Show/hide
Query:  RPWIVVAYSAPVVAATAGFLIYPIGQGSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASVL---------------------------------
        RPWI VAYSAPV AATA FLIYPIGQGSFSD MP GISGTFNFMI   AEHNILMHPFHMLG+A V                                  
Subjt:  RPWIVVAYSAPVVAATAGFLIYPIGQGSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASVL---------------------------------

Query:  -------------------------------------AW---------LPIQRYAWFLGNFYL-----------------ILNRANLGMEVMHERNAHNF
                                             AW         L I   A+ L  F                   I+NRANLGMEVMHERNAHNF
Subjt:  -------------------------------------AW---------LPIQRYAWFLGNFYL-----------------ILNRANLGMEVMHERNAHNF

Query:  PLDLAALEVPSI
        PLDLAALEVPS+
Subjt:  PLDLAALEVPSI

Arabidopsis top hitse value%identityAlignment
ATCG00020.1 photosystem II reaction center protein A1.3e-3043.13Show/hide
Query:  RPWIVVAYSAPVVAATAGFLIYPIGQGSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASVL---------------------------------
        RPWI VAYSAPV AATA FLIYPIGQGSFSD MP GISGTFNFMI   AEHNILMHPFHMLG+A V                                  
Subjt:  RPWIVVAYSAPVVAATAGFLIYPIGQGSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASVL---------------------------------

Query:  -------------------------------------AW---------LPIQRYAWFLGNFYL-----------------ILNRANLGMEVMHERNAHNF
                                             AW         L I   A+ L  F                   I+NRANLGMEVMHERNAHNF
Subjt:  -------------------------------------AW---------LPIQRYAWFLGNFYL-----------------ILNRANLGMEVMHERNAHNF

Query:  PLDLAALEVPS
        PLDLAA+E PS
Subjt:  PLDLAALEVPS

ATCG00040.1 maturase K2.8e-0944.44Show/hide
Query:  KIETLVPIKTPCLDHWLKALFCNVLGHPIRKPTWIDSS---FLLLIDCISGNLSHYYRGSSNKKNFIYFHFL
        K+++ +PI +  +    K  FCNVLGHPI K TW DSS    L     I  N+SHYY GSS KKN     ++
Subjt:  KIETLVPIKTPCLDHWLKALFCNVLGHPIRKPTWIDSS---FLLLIDCISGNLSHYYRGSSNKKNFIYFHFL

ATCG00270.1 photosystem II reaction center protein D2.0e-0737.31Show/hide
Query:  RPWIVVAYSAPVVAATAGFLIYPIGQGSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASVL
        RP+  +A+S P+    + FLIYP+GQ  +  +   G++  F F++     HN  ++PFHM+G+A VL
Subjt:  RPWIVVAYSAPVVAATAGFLIYPIGQGSFSDSMPDGISGTFNFMI---AEHNILMHPFHMLGLASVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTATGCTACAAAATCGAGACACTAGTTCCTATTAAGACTCCTTGCTTGGATCATTGGCTAAAAGCGCTATTTTGTAATGTCTTAGGGCATCCCATTAGGAAGCCGAC
CTGGATCGATTCGTCTTTTCTATTATTGATTGATTGTATATCCGGAAATCTTTCTCATTATTACAGAGGATCTTCAAACAAAAAGAATTTTATATACTTTCACTTTCTTG
GCTTCGAACTTTGGCTCGTAACACAACAGTACTGTACGCCTTCTTTTAAAAAGGGGAAATTCACAATTAGTGGAAGAATTATATACGGAGCAAGAAGAGCATCTTACATG
GATGGAGGGAGAGTGGGAATTGAGCTTCCGTATGGGCGTCCTTGGATTGTTGTTGCATATTCCGCTCCTGTTGTAGCTGCTACTGCTGGTTTCTTGATCTACCCAATTGG
TCAAGGAAGCTTTTCTGACAGTATGCCTGACGGAATCTCTGGTACTTTCAACTTCATGATTGCTGAGCACAACATCCTTATGCACCCATTCCACATGTTAGGTCTAGCTA
GTGTATTGGCTTGGCTCCCTATTCAGCGCTATGCATGGTTCCTTGGTAACTTCTATCTAATTCTTAACCGTGCTAATCTTGGTATGGAAGTTATGCATGAACGTAATGCT
CACAACTTCCCTCTAGACCTAGCTGCTCTTGAAGTTCCATCTATCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTATGCTACAAAATCGAGACACTAGTTCCTATTAAGACTCCTTGCTTGGATCATTGGCTAAAAGCGCTATTTTGTAATGTCTTAGGGCATCCCATTAGGAAGCCGAC
CTGGATCGATTCGTCTTTTCTATTATTGATTGATTGTATATCCGGAAATCTTTCTCATTATTACAGAGGATCTTCAAACAAAAAGAATTTTATATACTTTCACTTTCTTG
GCTTCGAACTTTGGCTCGTAACACAACAGTACTGTACGCCTTCTTTTAAAAAGGGGAAATTCACAATTAGTGGAAGAATTATATACGGAGCAAGAAGAGCATCTTACATG
GATGGAGGGAGAGTGGGAATTGAGCTTCCGTATGGGCGTCCTTGGATTGTTGTTGCATATTCCGCTCCTGTTGTAGCTGCTACTGCTGGTTTCTTGATCTACCCAATTGG
TCAAGGAAGCTTTTCTGACAGTATGCCTGACGGAATCTCTGGTACTTTCAACTTCATGATTGCTGAGCACAACATCCTTATGCACCCATTCCACATGTTAGGTCTAGCTA
GTGTATTGGCTTGGCTCCCTATTCAGCGCTATGCATGGTTCCTTGGTAACTTCTATCTAATTCTTAACCGTGCTAATCTTGGTATGGAAGTTATGCATGAACGTAATGCT
CACAACTTCCCTCTAGACCTAGCTGCTCTTGAAGTTCCATCTATCTAG
Protein sequenceShow/hide protein sequence
MLCYKIETLVPIKTPCLDHWLKALFCNVLGHPIRKPTWIDSSFLLLIDCISGNLSHYYRGSSNKKNFIYFHFLGFELWLVTQQYCTPSFKKGKFTISGRIIYGARRASYM
DGGRVGIELPYGRPWIVVAYSAPVVAATAGFLIYPIGQGSFSDSMPDGISGTFNFMIAEHNILMHPFHMLGLASVLAWLPIQRYAWFLGNFYLILNRANLGMEVMHERNA
HNFPLDLAALEVPSI