; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh13G003110 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh13G003110
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionPhotosystem II CP43 reaction center protein
Genome locationCmo_Chr13:3346646..3365375
RNA-Seq ExpressionCmoCh13G003110
SyntenyCmoCh13G003110
Gene Ontology termsGO:0009767 - photosynthetic electron transport chain (biological process)
GO:0009521 - photosystem (cellular component)
GO:0016168 - chlorophyll binding (molecular function)
InterPro domainsIPR000932 - Photosystem antenna protein-like
IPR036001 - Photosystem antenna protein-like superfamily
IPR044900 - Photosystem II CP43 reaction centre protein superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAD7478429.1 hypothetical protein E3N88_01565 [Mikania micrantha]9.3e-1633.59Show/hide
Query:  KEPLGF---GGNARLIHLDGKLLEAGSCSPCQINRILLWAEAMNLFEVAHFIPKKPIYEQRSIFLPRTPSY------YSRGRLDPKLLK-----------
        +E  GF    GNARLI+L GKLL A          I+ WA AMNLFEVAHF+P+KP+YEQ  I LP   +            L P+ L+           
Subjt:  KEPLGF---GGNARLIHLDGKLLEAGSCSPCQINRILLWAEAMNLFEVAHFIPKKPIYEQRSIFLPRTPSY------YSRGRLDPKLLK-----------

Query:  ---------------------LNQPFMFVR-------------------------------------------------------------PLVANIGSR
                             L +PF + R                                                              L AN+GS 
Subjt:  ---------------------LNQPFMFVR-------------------------------------------------------------PLVANIGSR

Query:  TYSTTGLGKYLMRSPTIEKRHFEGRDRKKETMCFWDLRAPWLEPIRGPNRLDLKIREGKGIQ
            TGLGKYLMRSPT E   F G     ETM FWDLRAPWLEP+R PN LDL  R  K IQ
Subjt:  TYSTTGLGKYLMRSPTIEKRHFEGRDRKKETMCFWDLRAPWLEPIRGPNRLDLKIREGKGIQ

KAG8363205.1 hypothetical protein BUALT_BualtPtG0001800 [Buddleja alternifolia]1.4e-1633.97Show/hide
Query:  KEPLGF---GGNARLIHLDGKLLEAGSCSPCQINRILLWAEAMNLFEVAHFIPKKPIYEQRSIFLPRTPSY------YSRGRLDPKLLK-----------
        +E  GF    GNARLI+L GKLL A          I+ WA AMNLFEVAHF+P+KP+YEQ  I LP   +            L P+ L+           
Subjt:  KEPLGF---GGNARLIHLDGKLLEAGSCSPCQINRILLWAEAMNLFEVAHFIPKKPIYEQRSIFLPRTPSY------YSRGRLDPKLLK-----------

Query:  ---------------------LNQPFMFVR-------------------------------------------------------------PLVANIGSR
                             L +PF + R                                                              L AN+GS 
Subjt:  ---------------------LNQPFMFVR-------------------------------------------------------------PLVANIGSR

Query:  TYSTTGLGKYLMRSPTIEKRHFEGRDRKKETMCFWDLRAPWLEPIRGPNRLDLKIREGKGIQ
            TGLGKYLMRSPT E   F G     ETM FWDLRAPWLEP+RGPN LDL  R  K IQ
Subjt:  TYSTTGLGKYLMRSPTIEKRHFEGRDRKKETMCFWDLRAPWLEPIRGPNRLDLKIREGKGIQ

OTF84537.1 putative photosynthetic reaction centre, L/M, Photosystem antenna protein-like protein [Helianthus annuus]5.6e-2140Show/hide
Query:  KEPLGF---GGNARLIHLDGKLLEAGSCSPCQINRILLWAEAMNLFEVAHFIPKKPIYEQRSIFLPR----------------TPSYYSRG---------
        +E  GF    GNARLI+L GKLL A          I+ WA AMNLFEVAHF+P+KP+YEQ  I LP                 T  Y+  G         
Subjt:  KEPLGF---GGNARLIHLDGKLLEAGSCSPCQINRILLWAEAMNLFEVAHFIPKKPIYEQRSIFLPR----------------TPSYYSRG---------

Query:  ---------RLDPKLLKLNQPF--------------MFVRP-------------------LVANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRDRKKETM
                  L P+ L+ + PF              +F  P                   L AN+GS     TGLGKYLMRSPT E   F G     ETM
Subjt:  ---------RLDPKLLKLNQPF--------------MFVRP-------------------LVANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRDRKKETM

Query:  CFWDLRAPWLEPIRGPNRLDLKIREGKGIQ
         FWDLRAPWLEP+RGPN LDL  R  K IQ
Subjt:  CFWDLRAPWLEPIRGPNRLDLKIREGKGIQ

OTF84569.1 putative photosystem II CP43 reaction center protein [Helianthus annuus]2.3e-1431.67Show/hide
Query:  KEPLGF---GGNARLIHLDGKLLEAGSCSPCQINRILLWAEAMNLFEVAHFIPKKPIYEQRSIFLPRTPSY------YSRGRLDPKLLK-----------
        +E  GF    GNARLI+L GKLL A          I+ WA AMNLFEVAHF+P+KP+YEQ  I LP   +            L P+ L+           
Subjt:  KEPLGF---GGNARLIHLDGKLLEAGSCSPCQINRILLWAEAMNLFEVAHFIPKKPIYEQRSIFLPRTPSY------YSRGRLDPKLLK-----------

Query:  ----------------------------------------LNQPFMFVR---------------------------------------------------
                                                L +PF + R                                                   
Subjt:  ----------------------------------------LNQPFMFVR---------------------------------------------------

Query:  ----------PLVANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRDRKKETMCFWDLRAPWLEPIRGPNRLDLKIREGKGIQ
                   L AN+GS     TGLGKYLMRSPT E   F G     ETM FWDLRAPWLEP+RGPN LDL  R  K IQ
Subjt:  ----------PLVANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRDRKKETMCFWDLRAPWLEPIRGPNRLDLKIREGKGIQ

VAH69885.1 unnamed protein product [Triticum turgidum subsp. durum]1.3e-1432.48Show/hide
Query:  KEPLGF---GGNARLIHLDGKLLEAGSCSPCQINRILLWAEAMNLFEVAHFIPKKPIYEQRSIFLPRTPSYYSRG----RLDPKLLKLNQPFM-------
        KE  GF    GNARLI+L GKLL A          I+ WA AMNLFEVAHF+P+KP+YEQ  I LP   +    G     L P+ L+ + PF        
Subjt:  KEPLGF---GGNARLIHLDGKLLEAGSCSPCQINRILLWAEAMNLFEVAHFIPKKPIYEQRSIFLPRTPSYYSRG----RLDPKLLKLNQPFM-------

Query:  ------------------FVRPLVANIGSRTYST------------------------------------------------------------------
                          F+  L A      Y T                                                                  
Subjt:  ------------------FVRPLVANIGSRTYST------------------------------------------------------------------

Query:  ----------------TGLGKYLMRSPTIEKRHFEGRDRKKETMCFWDLRAPWLEPIRGPNRLDLKIREGKGIQ
                        TGLGKYLMRSPT E   F G     ETM FWDLRAPWLEP+RGPN LDL  R  K IQ
Subjt:  ----------------TGLGKYLMRSPTIEKRHFEGRDRKKETMCFWDLRAPWLEPIRGPNRLDLKIREGKGIQ

TrEMBL top hitse value%identityAlignment
A0A0E0NEA4 Photosystem II D2 protein3.8e-1536.45Show/hide
Query:  KEPLGF---GGNARLIHLDGKLLEAGSCSPCQINRILLWAEAMNLFEVAHFIPKKPIYEQRSIFLPR----------------TPSYYSRG---------
        +E  GF    GNARLI+L GKLL A          I+ WA AMNLFEVAHF+P+KP+YEQ  I LP                 T  Y+  G         
Subjt:  KEPLGF---GGNARLIHLDGKLLEAGSCSPCQINRILLWAEAMNLFEVAHFIPKKPIYEQRSIFLPR----------------TPSYYSRG---------

Query:  -----------RLDPKLLKLNQPFM-FV---RPLVANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRDRK-----------KETMCFWDLRAPWLEPIRGP
                    L P+ L+ + PF  +V   R  +  I        G+G +L+    ++  +F G                ETM FWDLRAPWLEP+RGP
Subjt:  -----------RLDPKLLKLNQPFM-FV---RPLVANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRDRK-----------KETMCFWDLRAPWLEPIRGP

Query:  NRLDLKIREGKGIQ
        N LDL  R  K IQ
Subjt:  NRLDLKIREGKGIQ

A0A1Y3BWP9 Photosystem II D2 protein2.7e-2140Show/hide
Query:  KEPLGF---GGNARLIHLDGKLLEAGSCSPCQINRILLWAEAMNLFEVAHFIPKKPIYEQRSIFLPR----------------TPSYYSRG---------
        +E  GF    GNARLI+L GKLL A          I+ WA AMNLFEVAHF+P+KP+YEQ  I LP                 T  Y+  G         
Subjt:  KEPLGF---GGNARLIHLDGKLLEAGSCSPCQINRILLWAEAMNLFEVAHFIPKKPIYEQRSIFLPR----------------TPSYYSRG---------

Query:  ---------RLDPKLLKLNQPF--------------MFVRP-------------------LVANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRDRKKETM
                  L P+ L+ + PF              +F  P                   L AN+GS     TGLGKYLMRSPT E   F G     ETM
Subjt:  ---------RLDPKLLKLNQPF--------------MFVRP-------------------LVANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRDRKKETM

Query:  CFWDLRAPWLEPIRGPNRLDLKIREGKGIQ
         FWDLRAPWLEP+RGPN LDL  R  K IQ
Subjt:  CFWDLRAPWLEPIRGPNRLDLKIREGKGIQ

A0A446P6B7 Uncharacterized protein6.5e-1532.48Show/hide
Query:  KEPLGF---GGNARLIHLDGKLLEAGSCSPCQINRILLWAEAMNLFEVAHFIPKKPIYEQRSIFLPRTPSYYSRG----RLDPKLLKLNQPFM-------
        KE  GF    GNARLI+L GKLL A          I+ WA AMNLFEVAHF+P+KP+YEQ  I LP   +    G     L P+ L+ + PF        
Subjt:  KEPLGF---GGNARLIHLDGKLLEAGSCSPCQINRILLWAEAMNLFEVAHFIPKKPIYEQRSIFLPRTPSYYSRG----RLDPKLLKLNQPFM-------

Query:  ------------------FVRPLVANIGSRTYST------------------------------------------------------------------
                          F+  L A      Y T                                                                  
Subjt:  ------------------FVRPLVANIGSRTYST------------------------------------------------------------------

Query:  ----------------TGLGKYLMRSPTIEKRHFEGRDRKKETMCFWDLRAPWLEPIRGPNRLDLKIREGKGIQ
                        TGLGKYLMRSPT E   F G     ETM FWDLRAPWLEP+RGPN LDL  R  K IQ
Subjt:  ----------------TGLGKYLMRSPTIEKRHFEGRDRKKETMCFWDLRAPWLEPIRGPNRLDLKIREGKGIQ

A0A5N6Q1F4 OBG-type G domain-containing protein4.5e-1633.59Show/hide
Query:  KEPLGF---GGNARLIHLDGKLLEAGSCSPCQINRILLWAEAMNLFEVAHFIPKKPIYEQRSIFLPRTPSY------YSRGRLDPKLLK-----------
        +E  GF    GNARLI+L GKLL A          I+ WA AMNLFEVAHF+P+KP+YEQ  I LP   +            L P+ L+           
Subjt:  KEPLGF---GGNARLIHLDGKLLEAGSCSPCQINRILLWAEAMNLFEVAHFIPKKPIYEQRSIFLPRTPSY------YSRGRLDPKLLK-----------

Query:  ---------------------LNQPFMFVR-------------------------------------------------------------PLVANIGSR
                             L +PF + R                                                              L AN+GS 
Subjt:  ---------------------LNQPFMFVR-------------------------------------------------------------PLVANIGSR

Query:  TYSTTGLGKYLMRSPTIEKRHFEGRDRKKETMCFWDLRAPWLEPIRGPNRLDLKIREGKGIQ
            TGLGKYLMRSPT E   F G     ETM FWDLRAPWLEP+R PN LDL  R  K IQ
Subjt:  TYSTTGLGKYLMRSPTIEKRHFEGRDRKKETMCFWDLRAPWLEPIRGPNRLDLKIREGKGIQ

A0A803M6L1 Uncharacterized protein4.6e-2147.46Show/hide
Query:  KEPLGF---GGNARLIHLDGKLLEAGSCSPCQINRILLWAEAMNLFEVAHFIPKKPIYEQRSIF-----------LPRTPSYYSRGRLDPKLLKLNQPFM
        +E  GF    GNARLI+L GKLL A          I+ WA AMNLFEVA F+P+KPI    S+F               PS +  G   P+  +  Q F 
Subjt:  KEPLGF---GGNARLIHLDGKLLEAGSCSPCQINRILLWAEAMNLFEVAHFIPKKPIYEQRSIF-----------LPRTPSYYSRGRLDPKLLKLNQPFM

Query:  FV---RPLVANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRDRKKETMCFWDLRAPWLEPIRGPNRLDLKIREGKGIQ
        F+   + L AN+GS     TGLGKYLMRSPT E   F G     E M FWDLRAPWLEP+RGPN LDL  R  K IQ
Subjt:  FV---RPLVANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRDRKKETMCFWDLRAPWLEPIRGPNRLDLKIREGKGIQ

SwissProt top hitse value%identityAlignment
A4QJB1 Photosystem II CP43 reaction center protein2.9e-1259.26Show/hide
Query:  QPFMFV---RPLVANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRDRKKETMCFWDLRAPWLEPIRGPNRLDLKIREGKGIQ
        Q F F+   + L AN+GS     TGLGKYLMRSPT E   F G     ETM FWDLRAPWLEP+RGPN LDL  R  K IQ
Subjt:  QPFMFV---RPLVANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRDRKKETMCFWDLRAPWLEPIRGPNRLDLKIREGKGIQ

A4QJB1 Photosystem II CP43 reaction center protein2.1e-1056.06Show/hide
Query:  KEPLGF---GGNARLIHLDGKLLEAGSCSPCQINRILLWAEAMNLFEVAHFIPKKPIYEQRSIFLP
        +E  GF    GNARLI+L GKLL A          I+ WA AMNLFEVAHF+P+KP+YEQ  I LP
Subjt:  KEPLGF---GGNARLIHLDGKLLEAGSCSPCQINRILLWAEAMNLFEVAHFIPKKPIYEQRSIFLP

A9LYI1 Photosystem II CP43 reaction center protein2.9e-1259.26Show/hide
Query:  QPFMFV---RPLVANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRDRKKETMCFWDLRAPWLEPIRGPNRLDLKIREGKGIQ
        Q F F+   + L AN+GS     TGLGKYLMRSPT E   F G     ETM FWDLRAPWLEP+RGPN LDL  R  K IQ
Subjt:  QPFMFV---RPLVANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRDRKKETMCFWDLRAPWLEPIRGPNRLDLKIREGKGIQ

A9LYI1 Photosystem II CP43 reaction center protein2.1e-1056.06Show/hide
Query:  KEPLGF---GGNARLIHLDGKLLEAGSCSPCQINRILLWAEAMNLFEVAHFIPKKPIYEQRSIFLP
        +E  GF    GNARLI+L GKLL A          I+ WA AMNLFEVAHF+P+KP+YEQ  I LP
Subjt:  KEPLGF---GGNARLIHLDGKLLEAGSCSPCQINRILLWAEAMNLFEVAHFIPKKPIYEQRSIFLP

Q2QD93 Photosystem II CP43 reaction center protein2.2e-1260.49Show/hide
Query:  QPFMFV---RPLVANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRDRKKETMCFWDLRAPWLEPIRGPNRLDLKIREGKGIQ
        Q F F+   + L ANIGS     TGLGKYLMRSPT E   F G     ETM FWDLRAPWLEP+RGPN LDL  R  K IQ
Subjt:  QPFMFV---RPLVANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRDRKKETMCFWDLRAPWLEPIRGPNRLDLKIREGKGIQ

Q2QD93 Photosystem II CP43 reaction center protein2.1e-1056.06Show/hide
Query:  KEPLGF---GGNARLIHLDGKLLEAGSCSPCQINRILLWAEAMNLFEVAHFIPKKPIYEQRSIFLP
        +E  GF    GNARLI+L GKLL A          I+ WA AMNLFEVAHF+P+KP+YEQ  I LP
Subjt:  KEPLGF---GGNARLIHLDGKLLEAGSCSPCQINRILLWAEAMNLFEVAHFIPKKPIYEQRSIFLP

Q3V538 Photosystem II CP43 reaction center protein2.9e-1259.26Show/hide
Query:  QPFMFV---RPLVANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRDRKKETMCFWDLRAPWLEPIRGPNRLDLKIREGKGIQ
        Q F F+   + L AN+GS     TGLGKYLMRSPT E   F G     ETM FWDLRAPWLEP+RGPN LDL  R  K IQ
Subjt:  QPFMFV---RPLVANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRDRKKETMCFWDLRAPWLEPIRGPNRLDLKIREGKGIQ

Q3V538 Photosystem II CP43 reaction center protein2.1e-1056.06Show/hide
Query:  KEPLGF---GGNARLIHLDGKLLEAGSCSPCQINRILLWAEAMNLFEVAHFIPKKPIYEQRSIFLP
        +E  GF    GNARLI+L GKLL A          I+ WA AMNLFEVAHF+P+KP+YEQ  I LP
Subjt:  KEPLGF---GGNARLIHLDGKLLEAGSCSPCQINRILLWAEAMNLFEVAHFIPKKPIYEQRSIFLP

Q85FM3 Photosystem II CP43 reaction center protein2.9e-1262.5Show/hide
Query:  QPFMFV---RPLVANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRDRKKETMCFWDLRAPWLEPIRGPNRLDL
        Q F F+   + L ANIGS     TGLGKYLMRSPT E   F G     ETM FWDLRAPWLEP+RGPN LDL
Subjt:  QPFMFV---RPLVANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRDRKKETMCFWDLRAPWLEPIRGPNRLDL

Q85FM3 Photosystem II CP43 reaction center protein1.8e-0954.55Show/hide
Query:  KEPLGF---GGNARLIHLDGKLLEAGSCSPCQINRILLWAEAMNLFEVAHFIPKKPIYEQRSIFLP
        +E  GF    GNARLI+L GKLL A          I+ WA AMNLFEVAHF+ +KP+YEQ  I LP
Subjt:  KEPLGF---GGNARLIHLDGKLLEAGSCSPCQINRILLWAEAMNLFEVAHFIPKKPIYEQRSIFLP

Arabidopsis top hitse value%identityAlignment
ATCG00280.1 photosystem II reaction center protein C2.1e-1359.26Show/hide
Query:  QPFMFV---RPLVANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRDRKKETMCFWDLRAPWLEPIRGPNRLDLKIREGKGIQ
        Q F F+   + L AN+GS     TGLGKYLMRSPT E   F G     ETM FWDLRAPWLEP+RGPN LDL  R  K IQ
Subjt:  QPFMFV---RPLVANIGSRTYSTTGLGKYLMRSPTIEKRHFEGRDRKKETMCFWDLRAPWLEPIRGPNRLDLKIREGKGIQ

ATCG00280.1 photosystem II reaction center protein C1.5e-1156.06Show/hide
Query:  KEPLGF---GGNARLIHLDGKLLEAGSCSPCQINRILLWAEAMNLFEVAHFIPKKPIYEQRSIFLP
        +E  GF    GNARLI+L GKLL A          I+ WA AMNLFEVAHF+P+KP+YEQ  I LP
Subjt:  KEPLGF---GGNARLIHLDGKLLEAGSCSPCQINRILLWAEAMNLFEVAHFIPKKPIYEQRSIFLP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CTGGTCGTAACCAAGAAAGAACCCCTTGGTTTCGGTGGGAATGCCCGACTTATCCATTTAGACGGTAAACTACTGGAGGCGGGCTCATGTAGCCCATGCCAGATTAATCG
TATTCTGTTATGGGCCGAAGCAATGAACCTATTCGAAGTGGCTCATTTTATACCTAAGAAGCCCATCTATGAACAAAGATCCATTTTCCTTCCCCGCACGCCTAGCTACT
ACTCTAGGGGCCGACTGGACCCGAAGCTTCTCAAGCTCAACCAACCGTTTATGTTTGTAAGACCTCTTGTAGCTAACATTGGATCAAGGACCTACTCTACTACTGGTTTA
GGTAAATATCTAATGCGTTCTCCGACCATAGAGAAAAGGCATTTTGAGGGGAGAGATAGAAAGAAAGAGACTATGTGCTTTTGGGATCTACGTGCTCCTTGGTTAGAACC
AATAAGGGGTCCTAATCGTTTGGACTTGAAAATAAGGGAAGGGAAGGGAATACAGGGGCAGGGGAAGGAGTTCCATCCGGGAGAGCACTCAGGTAATATCGGTGAAGGAG
AAAGATTAGTTCAAGAGTTCAATCTATCTGGTTCAGTAGTTTACCGGGTGAAATCAGGTTCAGTTCCATCTGGGAAGGGAATACTGTCATTCCGGACAGGAGAGGAAAAG
AAGAATATAGTCGTCCAGTTTACCCTCTTCAATCAGGAGAGTACGTCAGGGAAAGAAGGGACAAGGGAAGGAGTGAAAGCAAGAAGGTTCATGAGTTTACAGTTCAAGAA
TACATAG
mRNA sequenceShow/hide mRNA sequence
CTGGTCGTAACCAAGAAAGAACCCCTTGGTTTCGGTGGGAATGCCCGACTTATCCATTTAGACGGTAAACTACTGGAGGCGGGCTCATGTAGCCCATGCCAGATTAATCG
TATTCTGTTATGGGCCGAAGCAATGAACCTATTCGAAGTGGCTCATTTTATACCTAAGAAGCCCATCTATGAACAAAGATCCATTTTCCTTCCCCGCACGCCTAGCTACT
ACTCTAGGGGCCGACTGGACCCGAAGCTTCTCAAGCTCAACCAACCGTTTATGTTTGTAAGACCTCTTGTAGCTAACATTGGATCAAGGACCTACTCTACTACTGGTTTA
GGTAAATATCTAATGCGTTCTCCGACCATAGAGAAAAGGCATTTTGAGGGGAGAGATAGAAAGAAAGAGACTATGTGCTTTTGGGATCTACGTGCTCCTTGGTTAGAACC
AATAAGGGGTCCTAATCGTTTGGACTTGAAAATAAGGGAAGGGAAGGGAATACAGGGGCAGGGGAAGGAGTTCCATCCGGGAGAGCACTCAGGTAATATCGGTGAAGGAG
AAAGATTAGTTCAAGAGTTCAATCTATCTGGTTCAGTAGTTTACCGGGTGAAATCAGGTTCAGTTCCATCTGGGAAGGGAATACTGTCATTCCGGACAGGAGAGGAAAAG
AAGAATATAGTCGTCCAGTTTACCCTCTTCAATCAGGAGAGTACGTCAGGGAAAGAAGGGACAAGGGAAGGAGTGAAAGCAAGAAGGTTCATGAGTTTACAGTTCAAGAA
TACATAG
Protein sequenceShow/hide protein sequence
LVVTKKEPLGFGGNARLIHLDGKLLEAGSCSPCQINRILLWAEAMNLFEVAHFIPKKPIYEQRSIFLPRTPSYYSRGRLDPKLLKLNQPFMFVRPLVANIGSRTYSTTGL
GKYLMRSPTIEKRHFEGRDRKKETMCFWDLRAPWLEPIRGPNRLDLKIREGKGIQGQGKEFHPGEHSGNIGEGERLVQEFNLSGSVVYRVKSGSVPSGKGILSFRTGEEK
KNIVVQFTLFNQESTSGKEGTREGVKARRFMSLQFKNT