; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh15G010400 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh15G010400
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionPhotosystem II CP43 reaction center protein
Genome locationCmo_Chr15:6589440..6591632
RNA-Seq ExpressionCmoCh15G010400
SyntenyCmoCh15G010400
Gene Ontology termsGO:0009772 - photosynthetic electron transport in photosystem II (biological process)
GO:0009521 - photosystem (cellular component)
GO:0009536 - plastid (cellular component)
GO:0016168 - chlorophyll binding (molecular function)
GO:0045156 - electron transporter, transferring electrons within the cyclic electron transport pathway of photosynthesis activity (molecular function)
InterPro domainsIPR000932 - Photosystem antenna protein-like
IPR036001 - Photosystem antenna protein-like superfamily
IPR036854 - Photosystem II protein D1/D2 superfamily
IPR044900 - Photosystem II CP43 reaction centre protein superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4348575.1 hypothetical protein F8388_000254 [Cannabis sativa]2.1e-16262.55Show/hide
Query:  MSALGVVGLALNLRTYDFISQEIRAGEDLEFETFYTKNLLLDEGIRAWMAAQDQPHENTPY----IPLGAGCDQETTSFAWWAGNARLINLSDKLLGAHV
        MSALGVVGLALNLR YDF+SQEIRA ED EFETFYTKN+LL+EGIRAWMAAQDQPHEN  +    +P  AG DQETT FAWWAGNARLINLS KLLGAHV
Subjt:  MSALGVVGLALNLRTYDFISQEIRAGEDLEFETFYTKNLLLDEGIRAWMAAQDQPHENTPY----IPLGAGCDQETTSFAWWAGNARLINLSDKLLGAHV

Query:  AHAELIVFWAGAMNLFEVVHFVPEKPMYEQGLIFLPPLATLGWGVGP-----------------------------------------------------
        AHA LIVFWAGAMNLFEV HFVPEKPMYEQGLI LP LATLGWGVGP                                                     
Subjt:  AHAELIVFWAGAMNLFEVVHFVPEKPMYEQGLIFLPPLATLGWGVGP-----------------------------------------------------

Query:  --------------GIGAFLLVFKALYFG---------------------------------------------------------RSICILGGIWHILT
                      GIGAFLLVFKALYFG                                                          SICILGGIWHILT
Subjt:  --------------GIGAFLLVFKALYFG---------------------------------------------------------RSICILGGIWHILT

Query:  KSFAWARHALVCSGEAYLSYSLGALSVFGFIACCFVWFNNTAYPSDFYRPTRPEASQAQAFTFLVRDQCLVANVGSAQGPTGLGKYLMLRP--EKSFLGE
        K FAWAR ALV SGEAYLSYSLGALSVFGFIACCFVWFNNTAYPS+FY PT PEASQAQAFTFLVRDQ L ANVGSAQGPTGLGKYLM  P  E  F GE
Subjt:  KSFAWARHALVCSGEAYLSYSLGALSVFGFIACCFVWFNNTAYPSDFYRPTRPEASQAQAFTFLVRDQCLVANVGSAQGPTGLGKYLMLRP--EKSFLGE

Query:  RLCVFGI-CVLLEPLRDPNGLDLSRLKKDIQPWQERHSMEYMTHAPFGSLDSVGGVTIEINAINYVSPRSWLATSHFVLGFFLFVGHLWHVGKGSCSSPG
         +  + +    LEPLR PNGLDLSRLKKDIQPWQER S EYMTHAP GSL+SVGGV  EINA+NYVSPRSWLATSHFVLGFFLFVGHLWH G+   ++ G
Subjt:  RLCVFGI-CVLLEPLRDPNGLDLSRLKKDIQPWQERHSMEYMTHAPFGSLDSVGGVTIEINAINYVSPRSWLATSHFVLGFFLFVGHLWHVGKGSCSSPG

Query:  FEKGIDRDFE
        FEKGIDRD E
Subjt:  FEKGIDRDFE

KAF9660693.1 hypothetical protein SADUNF_SadunfPtG0002900 [Salix dunnii]9.6e-16362.55Show/hide
Query:  MSALGVVGLALNLRTYDFISQEIRAGEDLEFETFYTKNLLLDEGIRAWMAAQDQPHENTPY----IPLGAGCDQETTSFAWWAGNARLINLSDKLLGAHV
        MSALGVVGLALNLR YDF+SQEIRA ED EFETFYTKN+LL+EGIRAWMAAQDQPHEN  +    +P  AG DQETT FAWWAGNARLINLS KLLGAHV
Subjt:  MSALGVVGLALNLRTYDFISQEIRAGEDLEFETFYTKNLLLDEGIRAWMAAQDQPHENTPY----IPLGAGCDQETTSFAWWAGNARLINLSDKLLGAHV

Query:  AHAELIVFWAGAMNLFEVVHFVPEKPMYEQGLIFLPPLATLGWGVGP-----------------------------------------------------
        AHA LIVFWAGAMNLFEV HFVPEKPMYEQGLI LP LATLGWGVGP                                                     
Subjt:  AHAELIVFWAGAMNLFEVVHFVPEKPMYEQGLIFLPPLATLGWGVGP-----------------------------------------------------

Query:  --------------GIGAFLLVFKALYFG---------------------------------------------------------RSICILGGIWHILT
                      GIGAFLLVFKALYFG                                                          SICILGGIWHILT
Subjt:  --------------GIGAFLLVFKALYFG---------------------------------------------------------RSICILGGIWHILT

Query:  KSFAWARHALVCSGEAYLSYSLGALSVFGFIACCFVWFNNTAYPSDFYRPTRPEASQAQAFTFLVRDQCLVANVGSAQGPTGLGKYLMLRP--EKSFLGE
        K FAWAR ALV SGEAYLSYSLGAL+VFGFIACCFVWFNNTAYPS+FY PT PEASQAQAFTFLVRDQ L ANVGSAQGPTGLGKYLM  P  E  F GE
Subjt:  KSFAWARHALVCSGEAYLSYSLGALSVFGFIACCFVWFNNTAYPSDFYRPTRPEASQAQAFTFLVRDQCLVANVGSAQGPTGLGKYLMLRP--EKSFLGE

Query:  RLCVFGI-CVLLEPLRDPNGLDLSRLKKDIQPWQERHSMEYMTHAPFGSLDSVGGVTIEINAINYVSPRSWLATSHFVLGFFLFVGHLWHVGKGSCSSPG
         +  + +    LEPLR PNGLDLSRLKKDIQPWQER S EYMTHAP GSL+SVGGV  EINA+NYVSPRSWLATSHFVLGFFLFVGHLWH G+   ++ G
Subjt:  RLCVFGI-CVLLEPLRDPNGLDLSRLKKDIQPWQERHSMEYMTHAPFGSLDSVGGVTIEINAINYVSPRSWLATSHFVLGFFLFVGHLWHVGKGSCSSPG

Query:  FEKGIDRDFE
        FEKGIDRDFE
Subjt:  FEKGIDRDFE

XP_022975445.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111474790 [Cucurbita maxima]4.8e-16261.85Show/hide
Query:  MSALGVVGLALNLRTYDFISQEIRAGEDLEFETFYTKNLLLDEGIRAWMAAQDQPHENTPY----IPLG---------AGCDQETTSFAWWAGNARLINL
        MSALGVVGLALNLR YDF+SQEIRA ED EFETFYTKN+LL+EGIRAWMAAQDQPHEN  +    +P G         AG DQETT FAWWAGNARLINL
Subjt:  MSALGVVGLALNLRTYDFISQEIRAGEDLEFETFYTKNLLLDEGIRAWMAAQDQPHENTPY----IPLG---------AGCDQETTSFAWWAGNARLINL

Query:  SDKLLGAHVAHAELIVFWAGAMNLFEVVHFVPEKPMYEQGLIFLPPLATLGWGVGP--------------------------------------------
        S KLLGAHVAHA LIVFWAGAMNLFEV HFVPEKPMYEQGLI LP LATLGWGVGP                                            
Subjt:  SDKLLGAHVAHAELIVFWAGAMNLFEVVHFVPEKPMYEQGLIFLPPLATLGWGVGP--------------------------------------------

Query:  -----------------------GIGAFLLVFKALYFG---------------------------------------------------------RSICI
                               GIGAFLLVFKALYFG                                                          SICI
Subjt:  -----------------------GIGAFLLVFKALYFG---------------------------------------------------------RSICI

Query:  LGGIWHILTKSFAWARHALVCSGEAYLSYSLGALSVFGFIACCFVWFNNTAYPSDFYRPTRPEASQAQAFTFLVRDQCLVANVGSAQGPTGLGKYLMLRP
        LGGIWHILTK FAWAR ALV SGEAYLSYSLGALSVFGFIACCFVWFNNTAYPS+FY PT PEASQAQAFTFLVRDQ L ANVGSAQGPTGLGKYLM  P
Subjt:  LGGIWHILTKSFAWARHALVCSGEAYLSYSLGALSVFGFIACCFVWFNNTAYPSDFYRPTRPEASQAQAFTFLVRDQCLVANVGSAQGPTGLGKYLMLRP

Query:  --EKSFLGERLCVFGI-CVLLEPLRDPNGLDLSRLKKDIQPWQERHSMEYMTHAPFGSLDSVGGVTIEINAINYVSPRSWLATSHFVLGFFLFVGHLWHV
          E  F GE +  + +    LEPLR PNGLDLSRLKKDIQPWQER S EYMTHAP GSL+SVGGV  EINA+NYVSPRSWLATSHFVLGFFLFVGHLWH 
Subjt:  --EKSFLGERLCVFGI-CVLLEPLRDPNGLDLSRLKKDIQPWQERHSMEYMTHAPFGSLDSVGGVTIEINAINYVSPRSWLATSHFVLGFFLFVGHLWHV

Query:  GKGSCSSPGFEKGIDRDFE
        G+   ++ GFEKGIDRDFE
Subjt:  GKGSCSSPGFEKGIDRDFE

XP_029152626.1 LOW QUALITY PROTEIN: uncharacterized protein LOC114927083 [Arachis hypogaea]1.4e-16161.66Show/hide
Query:  MSALGVVGLALNLRTYDFISQEIRAGEDLEFETFYTKNLLLDEGIRAWMAAQDQPHENTPY----IPLG---------AGCDQETTSFAWWAGNARLINL
        MSALGVVGLALNLR YDF+SQEIRA ED EFETFYTKN+LL+EGIRAWMAAQDQPHEN  +    +P G          G DQETT FAWWAGNARLINL
Subjt:  MSALGVVGLALNLRTYDFISQEIRAGEDLEFETFYTKNLLLDEGIRAWMAAQDQPHENTPY----IPLG---------AGCDQETTSFAWWAGNARLINL

Query:  SDKLLGAHVAHAELIVFWAGAMNLFEVVHFVPEKPMYEQGLIFLPPLATLGWGVGP--------------------------------------------
        S KLLGAHVAHA LIVFWAGAMNLFEV HFVPEKPMYEQGLI LP LATLGWGVGP                                            
Subjt:  SDKLLGAHVAHAELIVFWAGAMNLFEVVHFVPEKPMYEQGLIFLPPLATLGWGVGP--------------------------------------------

Query:  -----------------------GIGAFLLVFKALYFG---------------------------------------------------------RSICI
                               GIGAFLLVFKALYFG                                                          SICI
Subjt:  -----------------------GIGAFLLVFKALYFG---------------------------------------------------------RSICI

Query:  LGGIWHILTKSFAWARHALVCSGEAYLSYSLGALSVFGFIACCFVWFNNTAYPSDFYRPTRPEASQAQAFTFLVRDQCLVANVGSAQGPTGLGKYLMLRP
        LGGIWHILTK FAWAR ALV SGEAYLSYSLGALSVFGFIACCFVWFNNTAYPS+FY PT PEASQAQAFTFLVRDQ L ANVGSAQGPTGLGKYLM  P
Subjt:  LGGIWHILTKSFAWARHALVCSGEAYLSYSLGALSVFGFIACCFVWFNNTAYPSDFYRPTRPEASQAQAFTFLVRDQCLVANVGSAQGPTGLGKYLMLRP

Query:  --EKSFLGERLCVFGI-CVLLEPLRDPNGLDLSRLKKDIQPWQERHSMEYMTHAPFGSLDSVGGVTIEINAINYVSPRSWLATSHFVLGFFLFVGHLWHV
          E  F GE +  + +    LEPLR PNGLDLSRLKKDIQPWQER S EYMTHAP GSL+SVGGV  EINA+NYVSPRSWLATSHFVLGFFLFVGHLWH 
Subjt:  --EKSFLGERLCVFGI-CVLLEPLRDPNGLDLSRLKKDIQPWQERHSMEYMTHAPFGSLDSVGGVTIEINAINYVSPRSWLATSHFVLGFFLFVGHLWHV

Query:  GKGSCSSPGFEKGIDRDFE
        G+   ++ GFEKGIDRDFE
Subjt:  GKGSCSSPGFEKGIDRDFE

XP_040954099.1 LOW QUALITY PROTEIN: photosystem II CP43 reaction center protein-like [Gossypium hirsutum]1.4e-16161.66Show/hide
Query:  MSALGVVGLALNLRTYDFISQEIRAGEDLEFETFYTKNLLLDEGIRAWMAAQDQPHENTPY----IPLG---------AGCDQETTSFAWWAGNARLINL
        MSALGVVGLALNLR YDF+SQEIRA ED EFETFYTKN+LL+EGIRAWMAAQDQPHEN  +    +P G         AG DQETT FAWWAGNARLINL
Subjt:  MSALGVVGLALNLRTYDFISQEIRAGEDLEFETFYTKNLLLDEGIRAWMAAQDQPHENTPY----IPLG---------AGCDQETTSFAWWAGNARLINL

Query:  SDKLLGAHVAHAELIVFWAGAMNLFEVVHFVPEKPMYEQGLIFLPPLATLGWGVGP--------------------------------------------
        S KLLGAHVAHA LIVFWAGAMNLFEV HFVPEKPMYEQGLI LP LATLGWGVGP                                            
Subjt:  SDKLLGAHVAHAELIVFWAGAMNLFEVVHFVPEKPMYEQGLIFLPPLATLGWGVGP--------------------------------------------

Query:  -----------------------GIGAFLLVFKALYFG---------------------------------------------------------RSICI
                               GIGAFLLVFKALYFG                                                          SICI
Subjt:  -----------------------GIGAFLLVFKALYFG---------------------------------------------------------RSICI

Query:  LGGIWHILTKSFAWARHALVCSGEAYLSYSLGALSVFGFIACCFVWFNNTAYPSDFYRPTRPEASQAQAFTFLVRDQCLVANVGSAQGPTGLGKYLMLRP
         GGIWHILTK FAWAR ALV SGEAYLSYSLGALSVFGFIACCFVWFNNTAYPS+FY PT PEASQAQAFTFLVRDQ L ANVGSAQGPTGLGKYLM  P
Subjt:  LGGIWHILTKSFAWARHALVCSGEAYLSYSLGALSVFGFIACCFVWFNNTAYPSDFYRPTRPEASQAQAFTFLVRDQCLVANVGSAQGPTGLGKYLMLRP

Query:  --EKSFLGERLCVFGI-CVLLEPLRDPNGLDLSRLKKDIQPWQERHSMEYMTHAPFGSLDSVGGVTIEINAINYVSPRSWLATSHFVLGFFLFVGHLWHV
          E  F GE +  + +    LEPLR PNGLDLSRLKKDIQPWQER S EYMTHAP GSL+SVGGV  EINA+NYVSPRSWLATSHFVLGFFLFVGHLWH 
Subjt:  --EKSFLGERLCVFGI-CVLLEPLRDPNGLDLSRLKKDIQPWQERHSMEYMTHAPFGSLDSVGGVTIEINAINYVSPRSWLATSHFVLGFFLFVGHLWHV

Query:  GKGSCSSPGFEKGIDRDFE
        G+   ++ GFEKGIDRDFE
Subjt:  GKGSCSSPGFEKGIDRDFE

TrEMBL top hitse value%identityAlignment
A0A6A5MPC5 Photosystem II D2 protein8.8e-16262.16Show/hide
Query:  MSALGVVGLALNLRTYDFISQEIRAGEDLEFETFYTKNLLLDEGIRAWMAAQDQPHENTPY----IPLGAGCDQETTSFAWWAGNARLINLSDKLLGAHV
        MSALGVVGLALNLR YDF+SQEIRA ED EFETFYTKN+LL+EGIRAWMAAQDQPHEN  +    +P   G DQETT FAWWAGNARLINLS KLLGAHV
Subjt:  MSALGVVGLALNLRTYDFISQEIRAGEDLEFETFYTKNLLLDEGIRAWMAAQDQPHENTPY----IPLGAGCDQETTSFAWWAGNARLINLSDKLLGAHV

Query:  AHAELIVFWAGAMNLFEVVHFVPEKPMYEQGLIFLPPLATLGWGVGP-----------------------------------------------------
        AHA LIVFWAGAMNLFEV HFVPEKPMYEQGLI LP LATLGWGVGP                                                     
Subjt:  AHAELIVFWAGAMNLFEVVHFVPEKPMYEQGLIFLPPLATLGWGVGP-----------------------------------------------------

Query:  --------------GIGAFLLVFKALYFG---------------------------------------------------------RSICILGGIWHILT
                      GIGAFLLVFKALYFG                                                          SICI GGIWHILT
Subjt:  --------------GIGAFLLVFKALYFG---------------------------------------------------------RSICILGGIWHILT

Query:  KSFAWARHALVCSGEAYLSYSLGALSVFGFIACCFVWFNNTAYPSDFYRPTRPEASQAQAFTFLVRDQCLVANVGSAQGPTGLGKYLMLRP--EKSFLGE
        K FAWAR ALV SGEAYLSYSL ALSVFGFIACCFVWFNNTAYPS+FY PT PEASQAQAFTFLVRDQ L ANVGSAQGPTGLGKYLM  P  E  F GE
Subjt:  KSFAWARHALVCSGEAYLSYSLGALSVFGFIACCFVWFNNTAYPSDFYRPTRPEASQAQAFTFLVRDQCLVANVGSAQGPTGLGKYLMLRP--EKSFLGE

Query:  RLCVFGI-CVLLEPLRDPNGLDLSRLKKDIQPWQERHSMEYMTHAPFGSLDSVGGVTIEINAINYVSPRSWLATSHFVLGFFLFVGHLWHVGKGSCSSPG
         +  + +    LEPLR PNGLDLSRLKKDIQPWQER S EYMTHAP GSL+SVGGV  EINA+NYVSPRSWLATSHFVLGFFLFVGHLWH G+   ++ G
Subjt:  RLCVFGI-CVLLEPLRDPNGLDLSRLKKDIQPWQERHSMEYMTHAPFGSLDSVGGVTIEINAINYVSPRSWLATSHFVLGFFLFVGHLWHVGKGSCSSPG

Query:  FEKGIDRDFE
        FEKGIDRDFE
Subjt:  FEKGIDRDFE

A0A6J1IGQ7 Photosystem II D2 protein2.3e-16261.85Show/hide
Query:  MSALGVVGLALNLRTYDFISQEIRAGEDLEFETFYTKNLLLDEGIRAWMAAQDQPHENTPY----IPLG---------AGCDQETTSFAWWAGNARLINL
        MSALGVVGLALNLR YDF+SQEIRA ED EFETFYTKN+LL+EGIRAWMAAQDQPHEN  +    +P G         AG DQETT FAWWAGNARLINL
Subjt:  MSALGVVGLALNLRTYDFISQEIRAGEDLEFETFYTKNLLLDEGIRAWMAAQDQPHENTPY----IPLG---------AGCDQETTSFAWWAGNARLINL

Query:  SDKLLGAHVAHAELIVFWAGAMNLFEVVHFVPEKPMYEQGLIFLPPLATLGWGVGP--------------------------------------------
        S KLLGAHVAHA LIVFWAGAMNLFEV HFVPEKPMYEQGLI LP LATLGWGVGP                                            
Subjt:  SDKLLGAHVAHAELIVFWAGAMNLFEVVHFVPEKPMYEQGLIFLPPLATLGWGVGP--------------------------------------------

Query:  -----------------------GIGAFLLVFKALYFG---------------------------------------------------------RSICI
                               GIGAFLLVFKALYFG                                                          SICI
Subjt:  -----------------------GIGAFLLVFKALYFG---------------------------------------------------------RSICI

Query:  LGGIWHILTKSFAWARHALVCSGEAYLSYSLGALSVFGFIACCFVWFNNTAYPSDFYRPTRPEASQAQAFTFLVRDQCLVANVGSAQGPTGLGKYLMLRP
        LGGIWHILTK FAWAR ALV SGEAYLSYSLGALSVFGFIACCFVWFNNTAYPS+FY PT PEASQAQAFTFLVRDQ L ANVGSAQGPTGLGKYLM  P
Subjt:  LGGIWHILTKSFAWARHALVCSGEAYLSYSLGALSVFGFIACCFVWFNNTAYPSDFYRPTRPEASQAQAFTFLVRDQCLVANVGSAQGPTGLGKYLMLRP

Query:  --EKSFLGERLCVFGI-CVLLEPLRDPNGLDLSRLKKDIQPWQERHSMEYMTHAPFGSLDSVGGVTIEINAINYVSPRSWLATSHFVLGFFLFVGHLWHV
          E  F GE +  + +    LEPLR PNGLDLSRLKKDIQPWQER S EYMTHAP GSL+SVGGV  EINA+NYVSPRSWLATSHFVLGFFLFVGHLWH 
Subjt:  --EKSFLGERLCVFGI-CVLLEPLRDPNGLDLSRLKKDIQPWQERHSMEYMTHAPFGSLDSVGGVTIEINAINYVSPRSWLATSHFVLGFFLFVGHLWHV

Query:  GKGSCSSPGFEKGIDRDFE
        G+   ++ GFEKGIDRDFE
Subjt:  GKGSCSSPGFEKGIDRDFE

A0A6P5GV07 Photosystem II D2 protein2.4e-15960.69Show/hide
Query:  MSALGVVGLALNLRTYDFISQEIRAGEDLEFETFYTKNLLLDEGIRAWMAAQDQPHENTPY----IPLG---------AGCDQETTSFAWWAGNARLINL
        MSA+GVVGLALNLR YDF+SQEIRA ED EFETFYTKN+LL+EGIRAWMAAQDQPHEN  +    +P G         AG DQETT FAWWAGNARLINL
Subjt:  MSALGVVGLALNLRTYDFISQEIRAGEDLEFETFYTKNLLLDEGIRAWMAAQDQPHENTPY----IPLG---------AGCDQETTSFAWWAGNARLINL

Query:  SDKLLGAHVAHAELIVFWAGAMNLFEVVHFVPEKPMYEQGLIFLPPLATLGWGVGP--------------------------------------------
        S KLLGAHVAHA LIVFWAGAMNLFEV HFVPE+PMYEQGLI LP LATLGWGVGP                                            
Subjt:  SDKLLGAHVAHAELIVFWAGAMNLFEVVHFVPEKPMYEQGLIFLPPLATLGWGVGP--------------------------------------------

Query:  -----------------------GIGAFLLVFKALYFG---------------------------------------------------------RSICI
                               GIGAFLLV KALYFG                                                          SICI
Subjt:  -----------------------GIGAFLLVFKALYFG---------------------------------------------------------RSICI

Query:  LGGIWHILTKSFAWARHALVCSGEAYLSYSLGALSVFGFIACCFVWFNNTAYPSDFYRPTRPEASQAQAFTFLVRDQCLVANVGSAQGPTGLGKYLMLRP
        LGGIWHILTK FAWAR A V SGEAYLSYSLGALSVFGFIACCFVWFNNTAYPS+FY PT PEASQAQAFTFLVRDQ L ANVGSAQGPTGLGKYLM  P
Subjt:  LGGIWHILTKSFAWARHALVCSGEAYLSYSLGALSVFGFIACCFVWFNNTAYPSDFYRPTRPEASQAQAFTFLVRDQCLVANVGSAQGPTGLGKYLMLRP

Query:  --EKSFLGERLCVFGI-CVLLEPLRDPNGLDLSRLKKDIQPWQERHSMEYMTHAPFGSLDSVGGVTIEINAINYVSPRSWLATSHFVLGFFLFVGHLWHV
          E  F GE +  + +    LEPLR PNGLDLSRLKKDIQPWQER S EYMTHAP GSL+SVGGV  EINA+NYVSPRSWLATSHFVLGFF FVGHLWH 
Subjt:  --EKSFLGERLCVFGI-CVLLEPLRDPNGLDLSRLKKDIQPWQERHSMEYMTHAPFGSLDSVGGVTIEINAINYVSPRSWLATSHFVLGFFLFVGHLWHV

Query:  GKGSCSSPGFEKGIDRDFE
        G+   ++ GFEKGIDRD E
Subjt:  GKGSCSSPGFEKGIDRDFE

A0A6P5MJ25 Photosystem II D2 protein2.4e-15960.89Show/hide
Query:  MSALGVVGLALNLRTYDFISQEIRAGEDLEFETFYTKNLLLDEGIRAWMAAQDQPHENTPY----IPLG---------AGCDQETTSFAWWAGNARLINL
        MSALGVVGLALNLR +DF+SQEIRA ED EFETFYTKN+LL+EGIRAWMAAQDQPHEN  +    +P G          G DQETT FAWWAGNARLINL
Subjt:  MSALGVVGLALNLRTYDFISQEIRAGEDLEFETFYTKNLLLDEGIRAWMAAQDQPHENTPY----IPLG---------AGCDQETTSFAWWAGNARLINL

Query:  SDKLLGAHVAHAELIVFWAGAMNLFEVVHFVPEKPMYEQGLIFLPPLATLGWGVGP--------------------------------------------
        S KLLGAHVAHA LIVFWAGAMNLF+V HFVPEKPMYEQGLI LP LATLGWGVGP                                            
Subjt:  SDKLLGAHVAHAELIVFWAGAMNLFEVVHFVPEKPMYEQGLIFLPPLATLGWGVGP--------------------------------------------

Query:  -----------------------GIGAFLLVFKALYFG---------------------------------------------------------RSICI
                               GIGAFLLVFKALYFG                                                          SICI
Subjt:  -----------------------GIGAFLLVFKALYFG---------------------------------------------------------RSICI

Query:  LGGIWHILTKSFAWARHALVCSGEAYLSYSLGALSVFGFIACCFVWFNNTAYPSDFYRPTRPEASQAQAFTFLVRDQCLVANVGSAQGPTGLGKYLMLRP
        LGGIWHILTK FAWAR ALV SG AYLSYSLGALSVFGFIACCFVWFNNTAYPS+FY PT PEASQAQAFTFLVRDQ L ANVGSAQGPTGLGKYLM  P
Subjt:  LGGIWHILTKSFAWARHALVCSGEAYLSYSLGALSVFGFIACCFVWFNNTAYPSDFYRPTRPEASQAQAFTFLVRDQCLVANVGSAQGPTGLGKYLMLRP

Query:  --EKSFLGERLCVFGI-CVLLEPLRDPNGLDLSRLKKDIQPWQERHSMEYMTHAPFGSLDSVGGVTIEINAINYVSPRSWLATSHFVLGFFLFVGHLWHV
          E  F G  +  + +    LEPLR PNGLDLSRLKKDIQPWQER S EYMTHAP GSL+SVGGV  EINA+NYVSPRSWLATSHFVLGFFLFVGHLWH 
Subjt:  --EKSFLGERLCVFGI-CVLLEPLRDPNGLDLSRLKKDIQPWQERHSMEYMTHAPFGSLDSVGGVTIEINAINYVSPRSWLATSHFVLGFFLFVGHLWHV

Query:  GKGSCSSPGFEKGIDRDFE
        G+   ++ GFEKGIDRDFE
Subjt:  GKGSCSSPGFEKGIDRDFE

A0A7J6DR41 Photosystem II D2 protein1.0e-16262.55Show/hide
Query:  MSALGVVGLALNLRTYDFISQEIRAGEDLEFETFYTKNLLLDEGIRAWMAAQDQPHENTPY----IPLGAGCDQETTSFAWWAGNARLINLSDKLLGAHV
        MSALGVVGLALNLR YDF+SQEIRA ED EFETFYTKN+LL+EGIRAWMAAQDQPHEN  +    +P  AG DQETT FAWWAGNARLINLS KLLGAHV
Subjt:  MSALGVVGLALNLRTYDFISQEIRAGEDLEFETFYTKNLLLDEGIRAWMAAQDQPHENTPY----IPLGAGCDQETTSFAWWAGNARLINLSDKLLGAHV

Query:  AHAELIVFWAGAMNLFEVVHFVPEKPMYEQGLIFLPPLATLGWGVGP-----------------------------------------------------
        AHA LIVFWAGAMNLFEV HFVPEKPMYEQGLI LP LATLGWGVGP                                                     
Subjt:  AHAELIVFWAGAMNLFEVVHFVPEKPMYEQGLIFLPPLATLGWGVGP-----------------------------------------------------

Query:  --------------GIGAFLLVFKALYFG---------------------------------------------------------RSICILGGIWHILT
                      GIGAFLLVFKALYFG                                                          SICILGGIWHILT
Subjt:  --------------GIGAFLLVFKALYFG---------------------------------------------------------RSICILGGIWHILT

Query:  KSFAWARHALVCSGEAYLSYSLGALSVFGFIACCFVWFNNTAYPSDFYRPTRPEASQAQAFTFLVRDQCLVANVGSAQGPTGLGKYLMLRP--EKSFLGE
        K FAWAR ALV SGEAYLSYSLGALSVFGFIACCFVWFNNTAYPS+FY PT PEASQAQAFTFLVRDQ L ANVGSAQGPTGLGKYLM  P  E  F GE
Subjt:  KSFAWARHALVCSGEAYLSYSLGALSVFGFIACCFVWFNNTAYPSDFYRPTRPEASQAQAFTFLVRDQCLVANVGSAQGPTGLGKYLMLRP--EKSFLGE

Query:  RLCVFGI-CVLLEPLRDPNGLDLSRLKKDIQPWQERHSMEYMTHAPFGSLDSVGGVTIEINAINYVSPRSWLATSHFVLGFFLFVGHLWHVGKGSCSSPG
         +  + +    LEPLR PNGLDLSRLKKDIQPWQER S EYMTHAP GSL+SVGGV  EINA+NYVSPRSWLATSHFVLGFFLFVGHLWH G+   ++ G
Subjt:  RLCVFGI-CVLLEPLRDPNGLDLSRLKKDIQPWQERHSMEYMTHAPFGSLDSVGGVTIEINAINYVSPRSWLATSHFVLGFFLFVGHLWHVGKGSCSSPG

Query:  FEKGIDRDFE
        FEKGIDRD E
Subjt:  FEKGIDRDFE

SwissProt top hitse value%identityAlignment
A4GGA1 Photosystem II CP43 reaction center protein6.8e-13560.45Show/hide
Query:  GCDQETTSFAWWAGNARLINLSDKLLGAHVAHAELIVFWAGAMNLFEVVHFVPEKPMYEQGLIFLPPLATLGWGVGP-----------------------
        G DQETT FAWWAGNARLINLS KLLGAHVAHA LIVFWAGAMNLFEV HFVPEKPMYEQGLI LP LATLGWGVGP                       
Subjt:  GCDQETTSFAWWAGNARLINLSDKLLGAHVAHAELIVFWAGAMNLFEVVHFVPEKPMYEQGLIFLPPLATLGWGVGP-----------------------

Query:  --------------------------------------------GIGAFLLVFKALYFG-----------------------------------------
                                                    GIGAFLLVFKALYFG                                         
Subjt:  --------------------------------------------GIGAFLLVFKALYFG-----------------------------------------

Query:  ----------------RSICILGGIWHILTKSFAWARHALVCSGEAYLSYSLGALSVFGFIACCFVWFNNTAYPSDFYRPTRPEASQAQAFTFLVRDQCL
                         SICILGGIWHILTK FAWAR ALV SGEAYLSYSLGALSVFGFIACCFVWFNNTAYPS+FY PT PEASQAQAFTFLVRDQ L
Subjt:  ----------------RSICILGGIWHILTKSFAWARHALVCSGEAYLSYSLGALSVFGFIACCFVWFNNTAYPSDFYRPTRPEASQAQAFTFLVRDQCL

Query:  VANVGSAQGPTGLGKYLMLRP--EKSFLGERLCVFGI-CVLLEPLRDPNGLDLSRLKKDIQPWQERHSMEYMTHAPFGSLDSVGGVTIEINAINYVSPRS
         ANVGSAQGPTGLGKYLM  P  E  F GE +  + +    LEPLR PNGLDLSRLKKDIQPWQER S EYMTHAP GSL+SVGGV  EINA+NYVSPRS
Subjt:  VANVGSAQGPTGLGKYLMLRP--EKSFLGERLCVFGI-CVLLEPLRDPNGLDLSRLKKDIQPWQERHSMEYMTHAPFGSLDSVGGVTIEINAINYVSPRS

Query:  WLATSHFVLGFFLFVGHLWHVGKGSCSSPGFEKGIDRDFE
        WLATSHFVLGFFLFVGHLWH G+   ++ GFEKGIDRDFE
Subjt:  WLATSHFVLGFFLFVGHLWHVGKGSCSSPGFEKGIDRDFE

B1NWE6 Photosystem II CP43 reaction center protein3.0e-13560.54Show/hide
Query:  AGCDQETTSFAWWAGNARLINLSDKLLGAHVAHAELIVFWAGAMNLFEVVHFVPEKPMYEQGLIFLPPLATLGWGVGP----------------------
        AG DQETT FAWWAGNARLINLS KLLGAHVAHA LIVFWAGAMNLFEV HFVPEKPMYEQGLI LP LATLGWGVGP                      
Subjt:  AGCDQETTSFAWWAGNARLINLSDKLLGAHVAHAELIVFWAGAMNLFEVVHFVPEKPMYEQGLIFLPPLATLGWGVGP----------------------

Query:  ---------------------------------------------GIGAFLLVFKALYFG----------------------------------------
                                                     GIGAFLLVFKALYFG                                        
Subjt:  ---------------------------------------------GIGAFLLVFKALYFG----------------------------------------

Query:  -----------------RSICILGGIWHILTKSFAWARHALVCSGEAYLSYSLGALSVFGFIACCFVWFNNTAYPSDFYRPTRPEASQAQAFTFLVRDQC
                          SICILGGIWHILTK FAWAR ALV SGEAYLSYSLGALSVFGFIACCFVWFNNTAYPS+FY PT PEASQAQAFTFLVRDQ 
Subjt:  -----------------RSICILGGIWHILTKSFAWARHALVCSGEAYLSYSLGALSVFGFIACCFVWFNNTAYPSDFYRPTRPEASQAQAFTFLVRDQC

Query:  LVANVGSAQGPTGLGKYLMLRP--EKSFLGERLCVFGI-CVLLEPLRDPNGLDLSRLKKDIQPWQERHSMEYMTHAPFGSLDSVGGVTIEINAINYVSPR
        L ANVGSAQGPTGLGKYLM  P  E  F GE +  + +    LEPLR PNGLDLSRLKKDIQPWQER S EYMTHAP GSL+SVGGV  EINA+NYVSPR
Subjt:  LVANVGSAQGPTGLGKYLMLRP--EKSFLGERLCVFGI-CVLLEPLRDPNGLDLSRLKKDIQPWQERHSMEYMTHAPFGSLDSVGGVTIEINAINYVSPR

Query:  SWLATSHFVLGFFLFVGHLWHVGKGSCSSPGFEKGIDRDFE
        SWLATSHFVLGFFLFVGHLWH G+   ++ GFEKGIDRDFE
Subjt:  SWLATSHFVLGFFLFVGHLWHVGKGSCSSPGFEKGIDRDFE

Q09X21 Photosystem II CP43 reaction center protein5.2e-13560.09Show/hide
Query:  AGCDQETTSFAWWAGNARLINLSDKLLGAHVAHAELIVFWAGAMNLFEVVHFVPEKPMYEQGLIFLPPLATLGWGVGP----------------------
        AG DQETT FAWWAGNARLINLS KLLGAHVAHA LIVFWAGAMNLFEV HFVPEKPMYEQGLI LP LATLGWGVGP                      
Subjt:  AGCDQETTSFAWWAGNARLINLSDKLLGAHVAHAELIVFWAGAMNLFEVVHFVPEKPMYEQGLIFLPPLATLGWGVGP----------------------

Query:  ---------------------------------------------GIGAFLLVFKALYFG----------------------------------------
                                                     GIGAFLLVFKALYFG                                        
Subjt:  ---------------------------------------------GIGAFLLVFKALYFG----------------------------------------

Query:  -----------------RSICILGGIWHILTKSFAWARHALVCSGEAYLSYSLGALSVFGFIACCFVWFNNTAYPSDFYRPTRPEASQAQAFTFLVRDQC
                          SICILGGIWHILTK FAWAR ALV SGEAYLSYSLGALS+FGF+ACCFVWFNNTAYPS+FY PT PEASQAQAFTFLVRDQ 
Subjt:  -----------------RSICILGGIWHILTKSFAWARHALVCSGEAYLSYSLGALSVFGFIACCFVWFNNTAYPSDFYRPTRPEASQAQAFTFLVRDQC

Query:  LVANVGSAQGPTGLGKYLMLRP--EKSFLGERLCVFGI-CVLLEPLRDPNGLDLSRLKKDIQPWQERHSMEYMTHAPFGSLDSVGGVTIEINAINYVSPR
        L ANVGSAQGPTGLGKYLM  P  E  F GE +  + +    LEPLR PNGLDLSRLKKDIQPWQER S EYMTHAP GSL+SVGGV  EINA+NYVSPR
Subjt:  LVANVGSAQGPTGLGKYLMLRP--EKSFLGERLCVFGI-CVLLEPLRDPNGLDLSRLKKDIQPWQERHSMEYMTHAPFGSLDSVGGVTIEINAINYVSPR

Query:  SWLATSHFVLGFFLFVGHLWHVGKGSCSSPGFEKGIDRDFE
        SWLATSHFVLGFFLFVGHLWH G+   ++ GFEKGIDRDFE
Subjt:  SWLATSHFVLGFFLFVGHLWHVGKGSCSSPGFEKGIDRDFE

Q14FG1 Photosystem II CP43 reaction center protein6.8e-13560.32Show/hide
Query:  AGCDQETTSFAWWAGNARLINLSDKLLGAHVAHAELIVFWAGAMNLFEVVHFVPEKPMYEQGLIFLPPLATLGWGVGP----------------------
        AG DQETT FAWWAGNARLINLS KLLGAHVAHA LIVFWAGAMNLFEV HFVPEKPMYEQGLI LP LATLGWGVGP                      
Subjt:  AGCDQETTSFAWWAGNARLINLSDKLLGAHVAHAELIVFWAGAMNLFEVVHFVPEKPMYEQGLIFLPPLATLGWGVGP----------------------

Query:  ---------------------------------------------GIGAFLLVFKALYFG----------------------------------------
                                                     GIGAFLLVFKALYFG                                        
Subjt:  ---------------------------------------------GIGAFLLVFKALYFG----------------------------------------

Query:  -----------------RSICILGGIWHILTKSFAWARHALVCSGEAYLSYSLGALSVFGFIACCFVWFNNTAYPSDFYRPTRPEASQAQAFTFLVRDQC
                          SICILGGIWHILTK FAWAR ALV SGEAYLSYSLGAL+VFGFIACCFVWFNNTAYPS+FY PT PEASQAQAFTFLVRDQ 
Subjt:  -----------------RSICILGGIWHILTKSFAWARHALVCSGEAYLSYSLGALSVFGFIACCFVWFNNTAYPSDFYRPTRPEASQAQAFTFLVRDQC

Query:  LVANVGSAQGPTGLGKYLMLRP--EKSFLGERLCVFGI-CVLLEPLRDPNGLDLSRLKKDIQPWQERHSMEYMTHAPFGSLDSVGGVTIEINAINYVSPR
        L ANVGSAQGPTGLGKYLM  P  E  F GE +  + +    LEPLR PNGLDLSRLKKDIQPWQER S EYMTHAP GSL+SVGGV  EINA+NYVSPR
Subjt:  LVANVGSAQGPTGLGKYLMLRP--EKSFLGERLCVFGI-CVLLEPLRDPNGLDLSRLKKDIQPWQERHSMEYMTHAPFGSLDSVGGVTIEINAINYVSPR

Query:  SWLATSHFVLGFFLFVGHLWHVGKGSCSSPGFEKGIDRDFE
        SWLATSHFVLGFFLFVGHLWH G+   ++ GFEKGIDRDFE
Subjt:  SWLATSHFVLGFFLFVGHLWHVGKGSCSSPGFEKGIDRDFE

Q2QD93 Photosystem II CP43 reaction center protein4.0e-13560.32Show/hide
Query:  AGCDQETTSFAWWAGNARLINLSDKLLGAHVAHAELIVFWAGAMNLFEVVHFVPEKPMYEQGLIFLPPLATLGWGVGP----------------------
        AG DQETT FAWWAGNARLINLS KLLGAHVAHA LIVFWAGAMNLFEV HFVPEKPMYEQGLI LP LATLGWGVGP                      
Subjt:  AGCDQETTSFAWWAGNARLINLSDKLLGAHVAHAELIVFWAGAMNLFEVVHFVPEKPMYEQGLIFLPPLATLGWGVGP----------------------

Query:  ---------------------------------------------GIGAFLLVFKALYFG----------------------------------------
                                                     GIGAFLLVFKALYFG                                        
Subjt:  ---------------------------------------------GIGAFLLVFKALYFG----------------------------------------

Query:  -----------------RSICILGGIWHILTKSFAWARHALVCSGEAYLSYSLGALSVFGFIACCFVWFNNTAYPSDFYRPTRPEASQAQAFTFLVRDQC
                          SICILGGIWHILTK FAWAR ALV SGEAYLSYSLGALSVFGFIACCFVWFNNTAYPS+FY PT PEASQAQAFTFLVRDQ 
Subjt:  -----------------RSICILGGIWHILTKSFAWARHALVCSGEAYLSYSLGALSVFGFIACCFVWFNNTAYPSDFYRPTRPEASQAQAFTFLVRDQC

Query:  LVANVGSAQGPTGLGKYLMLRP--EKSFLGERLCVFGI-CVLLEPLRDPNGLDLSRLKKDIQPWQERHSMEYMTHAPFGSLDSVGGVTIEINAINYVSPR
        L AN+GSAQGPTGLGKYLM  P  E  F GE +  + +    LEPLR PNGLDLSRLKKDIQPWQER S EYMTHAP GSL+SVGGV  EINA+NYVSPR
Subjt:  LVANVGSAQGPTGLGKYLMLRP--EKSFLGERLCVFGI-CVLLEPLRDPNGLDLSRLKKDIQPWQERHSMEYMTHAPFGSLDSVGGVTIEINAINYVSPR

Query:  SWLATSHFVLGFFLFVGHLWHVGKGSCSSPGFEKGIDRDFE
        SWLATSHFVLGFFLFVGHLWH G+   ++ GFEKGIDRDFE
Subjt:  SWLATSHFVLGFFLFVGHLWHVGKGSCSSPGFEKGIDRDFE

Arabidopsis top hitse value%identityAlignment
ATCG00270.1 photosystem II reaction center protein D2.2e-2489.66Show/hide
Query:  MSALGVVGLALNLRTYDFISQEIRAGEDLEFETFYTKNLLLDEGIRAWMAAQDQPHEN
        MSALGVVGLALNLR YDF+SQEIRA ED EFETFYTKN+LL+EGIRAWMAAQDQPHEN
Subjt:  MSALGVVGLALNLRTYDFISQEIRAGEDLEFETFYTKNLLLDEGIRAWMAAQDQPHEN

ATCG00280.1 photosystem II reaction center protein C7.7e-13459.41Show/hide
Query:  AGCDQETTSFAWWAGNARLINLSDKLLGAHVAHAELIVFWAGAMNLFEVVHFVPEKPMYEQGLIFLPPLATLGWGVGP----------------------
        AG DQETT FAWWAGNARLINLS KLLGAHVAHA LIVFWAGAMNLFEV HFVPEKPMYEQGLI LP LATLGWGVGP                      
Subjt:  AGCDQETTSFAWWAGNARLINLSDKLLGAHVAHAELIVFWAGAMNLFEVVHFVPEKPMYEQGLIFLPPLATLGWGVGP----------------------

Query:  ---------------------------------------------GIGAFLLVFKALYFG----------------------------------------
                                                     G+GAFLLVFKALYFG                                        
Subjt:  ---------------------------------------------GIGAFLLVFKALYFG----------------------------------------

Query:  -----------------RSICILGGIWHILTKSFAWARHALVCSGEAYLSYSLGALSVFGFIACCFVWFNNTAYPSDFYRPTRPEASQAQAFTFLVRDQC
                          SICI GGIWHILTK FAWAR ALV SGEAYLSYSL ALSV GFIACCFVWFNNTAYPS+FY PT PEASQAQAFTFLVRDQ 
Subjt:  -----------------RSICILGGIWHILTKSFAWARHALVCSGEAYLSYSLGALSVFGFIACCFVWFNNTAYPSDFYRPTRPEASQAQAFTFLVRDQC

Query:  LVANVGSAQGPTGLGKYLMLRP--EKSFLGERLCVFGI-CVLLEPLRDPNGLDLSRLKKDIQPWQERHSMEYMTHAPFGSLDSVGGVTIEINAINYVSPR
        L ANVGSAQGPTGLGKYLM  P  E  F GE +  + +    LEPLR PNGLDLSRLKKDIQPWQER S EYMTHAP GSL+SVGGV  EINA+NYVSPR
Subjt:  LVANVGSAQGPTGLGKYLMLRP--EKSFLGERLCVFGI-CVLLEPLRDPNGLDLSRLKKDIQPWQERHSMEYMTHAPFGSLDSVGGVTIEINAINYVSPR

Query:  SWLATSHFVLGFFLFVGHLWHVGKGSCSSPGFEKGIDRDFE
        SWL+TSHFVLGFFLFVGHLWH G+   ++ GFEKGIDRDFE
Subjt:  SWLATSHFVLGFFLFVGHLWHVGKGSCSSPGFEKGIDRDFE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGCTCTTGGAGTAGTTGGTCTAGCCCTGAACCTACGTACCTATGACTTCATTTCTCAGGAAATTCGTGCAGGGGAAGATCTTGAATTTGAGACTTTCTATACCAA
AAATCTTCTCTTAGACGAAGGTATTCGTGCTTGGATGGCGGCTCAAGATCAGCCTCATGAAAACACACCTTATATTCCCTTAGGAGCTGGTTGTGACCAAGAAACCACCA
GTTTCGCTTGGTGGGCCGGGAATGCCCGACTTATCAATTTATCCGATAAACTACTAGGGGCTCATGTAGCCCATGCCGAATTAATCGTATTCTGGGCCGGAGCAATGAAC
CTATTCGAAGTGGTTCATTTCGTACCTGAGAAGCCCATGTATGAACAAGGATTGATTTTCCTTCCCCCCCTAGCTACTCTAGGTTGGGGCGTAGGTCCTGGTATAGGTGC
TTTTCTTCTAGTATTCAAAGCTCTTTATTTTGGGCGTTCCATTTGTATACTTGGTGGAATTTGGCATATCTTAACTAAATCGTTTGCATGGGCTCGCCACGCACTTGTAT
GTTCTGGAGAAGCTTACTTGTCTTATAGTTTAGGTGCTTTATCTGTTTTTGGTTTTATTGCTTGTTGTTTTGTCTGGTTCAATAATACTGCTTATCCGAGTGACTTTTAC
AGACCGACTAGACCCGAAGCTTCTCAAGCTCAAGCTTTTACATTTCTAGTTAGAGACCAATGTCTTGTAGCTAACGTTGGATCCGCTCAAGGACCTACTGGTTTAGGTAA
ATATCTAATGCTCCGACCAGAGAAGTCATTTTTGGGGGAGAGACTATGCGTTTTTGGGATCTGCGTTCTCCTTGAACCACTAAGGGATCCTAATGGTTTGGACTTAAGTA
GGCTGAAAAAAGATATACAACCTTGGCAAGAACGACATTCCATGGAATATATGACCCATGCTCCTTTCGGTTCTTTAGATTCCGTGGGTGGTGTAACTATTGAAATTAAT
GCAATCAATTATGTCTCTCCTAGAAGTTGGCTAGCTACCTCTCATTTTGTTCTAGGATTCTTCCTATTTGTAGGTCATTTATGGCATGTAGGAAAAGGCTCGTGCAGCAG
CCCAGGATTTGAAAAGGGAATTGATCGTGATTTTGAA
mRNA sequenceShow/hide mRNA sequence
TAACTGGTTTATGGATGAGTGCTCTTGGAGTAGTTGGTCTAGCCCTGAACCTACGTACCTATGACTTCATTTCTCAGGAAATTCGTGCAGGGGAAGATCTTGAATTTGAG
ACTTTCTATACCAAAAATCTTCTCTTAGACGAAGGTATTCGTGCTTGGATGGCGGCTCAAGATCAGCCTCATGAAAACACACCTTATATTCCCTTAGGAGCTGGTTGTGA
CCAAGAAACCACCAGTTTCGCTTGGTGGGCCGGGAATGCCCGACTTATCAATTTATCCGATAAACTACTAGGGGCTCATGTAGCCCATGCCGAATTAATCGTATTCTGGG
CCGGAGCAATGAACCTATTCGAAGTGGTTCATTTCGTACCTGAGAAGCCCATGTATGAACAAGGATTGATTTTCCTTCCCCCCCTAGCTACTCTAGGTTGGGGCGTAGGT
CCTGGTATAGGTGCTTTTCTTCTAGTATTCAAAGCTCTTTATTTTGGGCGTTCCATTTGTATACTTGGTGGAATTTGGCATATCTTAACTAAATCGTTTGCATGGGCTCG
CCACGCACTTGTATGTTCTGGAGAAGCTTACTTGTCTTATAGTTTAGGTGCTTTATCTGTTTTTGGTTTTATTGCTTGTTGTTTTGTCTGGTTCAATAATACTGCTTATC
CGAGTGACTTTTACAGACCGACTAGACCCGAAGCTTCTCAAGCTCAAGCTTTTACATTTCTAGTTAGAGACCAATGTCTTGTAGCTAACGTTGGATCCGCTCAAGGACCT
ACTGGTTTAGGTAAATATCTAATGCTCCGACCAGAGAAGTCATTTTTGGGGGAGAGACTATGCGTTTTTGGGATCTGCGTTCTCCTTGAACCACTAAGGGATCCTAATGG
TTTGGACTTAAGTAGGCTGAAAAAAGATATACAACCTTGGCAAGAACGACATTCCATGGAATATATGACCCATGCTCCTTTCGGTTCTTTAGATTCCGTGGGTGGTGTAA
CTATTGAAATTAATGCAATCAATTATGTCTCTCCTAGAAGTTGGCTAGCTACCTCTCATTTTGTTCTAGGATTCTTCCTATTTGTAGGTCATTTATGGCATGTAGGAAAA
GGCTCGTGCAGCAGCCCAGGATTTGAAAAGGGAATTGATCGTGATTTTGAA
Protein sequenceShow/hide protein sequence
MSALGVVGLALNLRTYDFISQEIRAGEDLEFETFYTKNLLLDEGIRAWMAAQDQPHENTPYIPLGAGCDQETTSFAWWAGNARLINLSDKLLGAHVAHAELIVFWAGAMN
LFEVVHFVPEKPMYEQGLIFLPPLATLGWGVGPGIGAFLLVFKALYFGRSICILGGIWHILTKSFAWARHALVCSGEAYLSYSLGALSVFGFIACCFVWFNNTAYPSDFY
RPTRPEASQAQAFTFLVRDQCLVANVGSAQGPTGLGKYLMLRPEKSFLGERLCVFGICVLLEPLRDPNGLDLSRLKKDIQPWQERHSMEYMTHAPFGSLDSVGGVTIEIN
AINYVSPRSWLATSHFVLGFFLFVGHLWHVGKGSCSSPGFEKGIDRDFE