; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh10G008560 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh10G008560
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionProtein of unknown function (DUF707)
Genome locationCmo_Chr10:4050502..4055729
RNA-Seq ExpressionCmoCh10G008560
SyntenyCmoCh10G008560
Gene Ontology termsNA
InterPro domainsIPR007877 - Protein of unknown function DUF707


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7023759.1 hypothetical protein SDJN02_14785 [Cucurbita argyrosperma subsp. argyrosperma]3.0e-15365.42Show/hide
Query:  MKFSGCLPLLAEQKSRHSCFCTLLPVTTLLCLVLFVGSAYIAPDYRQELHVSLLASSYLWCTILWNVGSSYDRFDRKEGIKISKWGIDGFVRSKFSKCEN
        MKFSGCLPLLAEQKSRHSCFCTLLPVTTLLCLVLFVGSAYIAPDYRQ                                 KISKWGIDGFVRSKFSKCEN
Subjt:  MKFSGCLPLLAEQKSRHSCFCTLLPVTTLLCLVLFVGSAYIAPDYRQELHVSLLASSYLWCTILWNVGSSYDRFDRKEGIKISKWGIDGFVRSKFSKCEN

Query:  QCRPNGSDPLPKDIVVTASNLEMRPLWGASKRSYQKPVNVSSNLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHVLITCV
        QCRPNGSDPLPKDIVVTASNLEMRPLWGASKRSYQKPVNVSSNLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHV     
Subjt:  QCRPNGSDPLPKDIVVTASNLEMRPLWGASKRSYQKPVNVSSNLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHVLITCV

Query:  LFILFMK-------VAEFHSVSL------VLDLVMSRYVHIIESEGLEISQPALDPLKSEVHHQITARGRRSKVHRSSPRSLISYELICCINFQSLIIDI
            F K       VA ++ + L      V +   ++YVHIIESEGLEISQPALDPLKSEVHHQITARGRRSKVHR                        
Subjt:  LFILFMK-------VAEFHSVSL------VLDLVMSRYVHIIESEGLEISQPALDPLKSEVHHQITARGRRSKVHRSSPRSLISYELICCINFQSLIIDI

Query:  LIHCTYAFISRLLLCGSIFGLSGELTNQVMVEKVVISTMDRNDGPGFFPSVMALCLVYDPGGCYPFLGLSYIIQLNESQKNDLIHAWGLDMQLGYCAQAR
            TY   +    C  I   +   T  + +   V S                            +  + Y+IQ      NDLIHAWGLDMQLGYCAQ  
Subjt:  LIHCTYAFISRLLLCGSIFGLSGELTNQVMVEKVVISTMDRNDGPGFFPSVMALCLVYDPGGCYPFLGLSYIIQLNESQKNDLIHAWGLDMQLGYCAQAR

Query:  GDRTKNVGVVDSEYVIHHGRPTLGGPEENETSSKSPVKDHRADVRRQSYIELAVFRKRWLKAAEQDECWQDPYPETMYSNTTL
        GDRTKNVGVVDSEYVIHHGRPTLGGPEENETSSKSPVKDHRADVRRQSYIELAVFRKRW KAAEQDECWQDPYPETM SNTTL
Subjt:  GDRTKNVGVVDSEYVIHHGRPTLGGPEENETSSKSPVKDHRADVRRQSYIELAVFRKRWLKAAEQDECWQDPYPETMYSNTTL

XP_022960783.1 uncharacterized protein LOC111461481 [Cucurbita moschata]4.1e-15565.84Show/hide
Query:  MKFSGCLPLLAEQKSRHSCFCTLLPVTTLLCLVLFVGSAYIAPDYRQELHVSLLASSYLWCTILWNVGSSYDRFDRKEGIKISKWGIDGFVRSKFSKCEN
        MKFSGCLPLLAEQKSRHSCFCTLLPVTTLLCLVLFVGSAYIAPDYRQ                                 KISKWGIDGFVRSKFSKCEN
Subjt:  MKFSGCLPLLAEQKSRHSCFCTLLPVTTLLCLVLFVGSAYIAPDYRQELHVSLLASSYLWCTILWNVGSSYDRFDRKEGIKISKWGIDGFVRSKFSKCEN

Query:  QCRPNGSDPLPKDIVVTASNLEMRPLWGASKRSYQKPVNVSSNLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHVLITCV
        QCRPNGSDPLPKDIVVTASNLEMRPLWGASKRSYQKPVNVSSNLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHV     
Subjt:  QCRPNGSDPLPKDIVVTASNLEMRPLWGASKRSYQKPVNVSSNLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHVLITCV

Query:  LFILFMK-------VAEFHSVSL------VLDLVMSRYVHIIESEGLEISQPALDPLKSEVHHQITARGRRSKVHRSSPRSLISYELICCINFQSLIIDI
            F K       VA ++ + L      V +   ++YVHIIESEGLEISQPALDPLKSEVHHQITARGRRSKVHR                        
Subjt:  LFILFMK-------VAEFHSVSL------VLDLVMSRYVHIIESEGLEISQPALDPLKSEVHHQITARGRRSKVHRSSPRSLISYELICCINFQSLIIDI

Query:  LIHCTYAFISRLLLCGSIFGLSGELTNQVMVEKVVISTMDRNDGPGFFPSVMALCLVYDPGGCYPFLGLSYIIQLNESQKNDLIHAWGLDMQLGYCAQAR
            TY   +    C  I   +   T  + +   V S                            +  + Y+IQ      NDLIHAWGLDMQLGYCAQ  
Subjt:  LIHCTYAFISRLLLCGSIFGLSGELTNQVMVEKVVISTMDRNDGPGFFPSVMALCLVYDPGGCYPFLGLSYIIQLNESQKNDLIHAWGLDMQLGYCAQAR

Query:  GDRTKNVGVVDSEYVIHHGRPTLGGPEENETSSKSPVKDHRADVRRQSYIELAVFRKRWLKAAEQDECWQDPYPETMYSNTTL
        GDRTKNVGVVDSEYVIHHGRPTLGGPEENETSSKSPVKDHRADVRRQSYIELAVFRKRWLKAAEQDECWQDPYPETMYSNTTL
Subjt:  GDRTKNVGVVDSEYVIHHGRPTLGGPEENETSSKSPVKDHRADVRRQSYIELAVFRKRWLKAAEQDECWQDPYPETMYSNTTL

XP_022988269.1 uncharacterized protein LOC111485571 [Cucurbita maxima]2.6e-14963.77Show/hide
Query:  MKFSGCLPLLAEQKSRHSCFCTLLPVTTLLCLVLFVGSAYIAPDYRQELHVSLLASSYLWCTILWNVGSSYDRFDRKEGIKISKWGIDGFVRSKFSKCEN
        MKFSGCLPLLAEQKSRHSCFCTLLPVTTLLCLVLFVGSAYIAPDYR+                                 KIS WGIDGFVRSKF+KCEN
Subjt:  MKFSGCLPLLAEQKSRHSCFCTLLPVTTLLCLVLFVGSAYIAPDYRQELHVSLLASSYLWCTILWNVGSSYDRFDRKEGIKISKWGIDGFVRSKFSKCEN

Query:  QCRPNGSDPLPKDIVVTASNLEMRPLWGASKRSYQKPVNVSSNLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHVLITCV
        QCRPNGSDPLPKDIVVT SNLEMRPLWGASKRSYQKPVNVSSNLFAVAVGIKQKDLVNTMV KFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHV     
Subjt:  QCRPNGSDPLPKDIVVTASNLEMRPLWGASKRSYQKPVNVSSNLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHVLITCV

Query:  LFILFMK-------VAEFHSVSL------VLDLVMSRYVHIIESEGLEISQPALDPLKSEVHHQITARGRRSKVHRSSPRSLISYELICCINFQSLIIDI
            F K       VA ++ + L      V +    +YVHIIESEGLEISQPALDPLKSEVHHQITARGRRSKVHR                        
Subjt:  LFILFMK-------VAEFHSVSL------VLDLVMSRYVHIIESEGLEISQPALDPLKSEVHHQITARGRRSKVHRSSPRSLISYELICCINFQSLIIDI

Query:  LIHCTYAFISRLLLCGSIFGLSGELTNQVMVEKVVISTMDRNDGPGFFPSVMALCLVYDPGGCYPFLGLSYIIQLNESQKNDLIHAWGLDMQLGYCAQAR
            TY   +    C  I   +   T  + +   V S                            +  + Y+IQ      NDLIHAWGLDMQLGYCAQ  
Subjt:  LIHCTYAFISRLLLCGSIFGLSGELTNQVMVEKVVISTMDRNDGPGFFPSVMALCLVYDPGGCYPFLGLSYIIQLNESQKNDLIHAWGLDMQLGYCAQAR

Query:  GDRTKNVGVVDSEYVIHHGRPTLGGPEENETSSKSPVKDHRADVRRQSYIELAVFRKRWLKAAEQDECWQDPYPETMYSNTTL
        GDRTKNVGVVDSEYVIHHGRPTLGGPEENETSSKSPVKDHRADVRRQSYIELAVFRKRW +AAEQDECW DPYPETM  NTTL
Subjt:  GDRTKNVGVVDSEYVIHHGRPTLGGPEENETSSKSPVKDHRADVRRQSYIELAVFRKRWLKAAEQDECWQDPYPETMYSNTTL

XP_023516014.1 uncharacterized protein LOC111780008 [Cucurbita pepo subsp. pepo]9.5e-15264.8Show/hide
Query:  MKFSGCLPLLAEQKSRHSCFCTLLPVTTLLCLVLFVGSAYIAPDYRQELHVSLLASSYLWCTILWNVGSSYDRFDRKEGIKISKWGIDGFVRSKFSKCEN
        MKFSGCLPLLAEQKSRHSCFCTLLPVTTLLCLVLFVGSAYIAPDYRQ                                 KISKWGIDGFVRSKFSKCEN
Subjt:  MKFSGCLPLLAEQKSRHSCFCTLLPVTTLLCLVLFVGSAYIAPDYRQELHVSLLASSYLWCTILWNVGSSYDRFDRKEGIKISKWGIDGFVRSKFSKCEN

Query:  QCRPNGSDPLPKDIVVTASNLEMRPLWGASKRSYQKPVNVSSNLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHVLITCV
        QCRPNGSDPLPKDIVVTASNLEMRPLWGASK SYQKPVNVSSNLFAVAVGIKQKDLVNTMV KFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHV     
Subjt:  QCRPNGSDPLPKDIVVTASNLEMRPLWGASKRSYQKPVNVSSNLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHVLITCV

Query:  LFILFMK-------VAEFHSVSL------VLDLVMSRYVHIIESEGLEISQPALDPLKSEVHHQITARGRRSKVHRSSPRSLISYELICCINFQSLIIDI
            F K       VA ++ + L      V +   ++YVHIIESEGLEISQPALDPLKSEVHHQITARGRRSKVHR                        
Subjt:  LFILFMK-------VAEFHSVSL------VLDLVMSRYVHIIESEGLEISQPALDPLKSEVHHQITARGRRSKVHRSSPRSLISYELICCINFQSLIIDI

Query:  LIHCTYAFISRLLLCGSIFGLSGELTNQVMVEKVVISTMDRNDGPGFFPSVMALCLVYDPGGCYPFLGLSYIIQLNESQKNDLIHAWGLDMQLGYCAQAR
            TY   +    C  I   +   T  + +   V S                            +  + Y+IQ      NDLIHAWGLDMQLGYCAQ  
Subjt:  LIHCTYAFISRLLLCGSIFGLSGELTNQVMVEKVVISTMDRNDGPGFFPSVMALCLVYDPGGCYPFLGLSYIIQLNESQKNDLIHAWGLDMQLGYCAQAR

Query:  GDRTKNVGVVDSEYVIHHGRPTLGGPEENETSSKSPVKDHRADVRRQSYIELAVFRKRWLKAAEQDECWQDPYPETMYSNTTL
        GDRTKNVGVVDSEYVIHHGRPTLGGPEENETSSKSPVKDHRADVRRQSYIELAVFRKRW KAAEQDECW+DPYPETM SNTTL
Subjt:  GDRTKNVGVVDSEYVIHHGRPTLGGPEENETSSKSPVKDHRADVRRQSYIELAVFRKRWLKAAEQDECWQDPYPETMYSNTTL

XP_038880303.1 uncharacterized protein LOC120071938 isoform X1 [Benincasa hispida]4.3e-13658.09Show/hide
Query:  MKFSGCLPLLAEQKSRHSCFCTLLPVTTLLCLVLFVGSAYIAPDYRQELHVSLLASSYLWCTILWNVGSSYDRFDRKEGIKISKWGIDGFVRSKFSKCEN
        MKFSGCLPLL+EQKSR+SC C+LLP  +L+CLVLFVGS Y+APDYR+                                 KIS+WGIDG V SKF+KCEN
Subjt:  MKFSGCLPLLAEQKSRHSCFCTLLPVTTLLCLVLFVGSAYIAPDYRQELHVSLLASSYLWCTILWNVGSSYDRFDRKEGIKISKWGIDGFVRSKFSKCEN

Query:  QCRPNGSDPLPKDIVVTASNLEMRPLWGASKRSYQKPVNVSSNLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHVLITCV
        QCRPNGS+PLPKDIVVTASNLEMRPLWGASKRSYQ PVN S NLFA+AVGIKQKDLVN MVTKFL+SDFAVMLFHYDGIVD+W+EF+WSNRVVHV     
Subjt:  QCRPNGSDPLPKDIVVTASNLEMRPLWGASKRSYQKPVNVSSNLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHVLITCV

Query:  LFILFMK-------VAEFHSVSL------VLDLVMSRYVHIIESEGLEISQPALDPLKSEVHHQITARGRRSKVHRSSPRSLISYELICCINFQSLIIDI
            F K       VAE++ + L      V +    +YVHII+SEGLEISQPALDP KSEVHHQITARGRRS VHR + R                    
Subjt:  LFILFMK-------VAEFHSVSL------VLDLVMSRYVHIIESEGLEISQPALDPLKSEVHHQITARGRRSKVHRSSPRSLISYELICCINFQSLIIDI

Query:  LIHCTYAFISRLLLCGSIFGLSGELTNQVMVEKVVISTMDRNDGPGFFPSVMALCLVYDPGGCYPFLGLSYIIQLNESQKNDLIHAWGLDMQLGYCAQAR
                        S  G   +               D +  P     +  +  V+       +  + Y+IQ      NDLIHAWGLDMQLGYCAQ  
Subjt:  LIHCTYAFISRLLLCGSIFGLSGELTNQVMVEKVVISTMDRNDGPGFFPSVMALCLVYDPGGCYPFLGLSYIIQLNESQKNDLIHAWGLDMQLGYCAQAR

Query:  GDRTKNVGVVDSEYVIHHGRPTLGGPEENETSSKSPVKDHRADVRRQSYIELAVFRKRWLKAAEQDECWQDPYPETMYSNTT
        GDRTKNVGVVDSEYVIH+GRPTLGGPEENETSSKS VKDHRADVRRQSYIEL VFRKRW KAAEQDECW DPYPET+  NT+
Subjt:  GDRTKNVGVVDSEYVIHHGRPTLGGPEENETSSKSPVKDHRADVRRQSYIELAVFRKRWLKAAEQDECWQDPYPETMYSNTT

TrEMBL top hitse value%identityAlignment
A0A0A0M0M3 Uncharacterized protein1.8e-13258.09Show/hide
Query:  MKFSGCLPLLAEQKSRHSCFCTLLPVTTLLCLVLFVGSAYIAPDYRQELHVSLLASSYLWCTILWNVGSSYDRFDRKEGIKISKWGIDGFVRSKFSKCEN
        MKFSGCLPLLAEQKSR+SC C+ LP  +LLCL LFVGS Y+APDYR+                                 KIS+WGIDG V SKF+KCE 
Subjt:  MKFSGCLPLLAEQKSRHSCFCTLLPVTTLLCLVLFVGSAYIAPDYRQELHVSLLASSYLWCTILWNVGSSYDRFDRKEGIKISKWGIDGFVRSKFSKCEN

Query:  QCRPNGSDPLPKDIVVTASNLEMRPLWGASKRSYQKPVNVSSNLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHVLITCV
        QCRPNGS+PLPKDIVVTASNLEMRPLWGASKRSYQ PVN SSN+FA+AVGIKQKDLVN MVTKFL+SDFAVMLFHYDGIVD+WK F WSNRV+HV     
Subjt:  QCRPNGSDPLPKDIVVTASNLEMRPLWGASKRSYQKPVNVSSNLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHVLITCV

Query:  LFILFMK-------VAEFHSVSL------VLDLVMSRYVHIIESEGLEISQPALDPLKSEVHHQITARGRRSKVHRSSPRSLISYELICCINFQSLIIDI
            F K       V E++ V L      V +     YV II+SEGLEISQPALDP KSEVHHQITARGRRS VHR + R     +  C +N        
Subjt:  LFILFMK-------VAEFHSVSL------VLDLVMSRYVHIIESEGLEISQPALDPLKSEVHHQITARGRRSKVHRSSPRSLISYELICCINFQSLIIDI

Query:  LIHCTYAFISRLLLCGSIFGLSGELTNQVMVEKVVISTMDRNDGPGFFPSVMALCLVYDPGGCYPFLGLSYIIQLNESQKNDLIHAWGLDMQLGYCAQAR
                                            ST     G   +  +MA         C     + Y+IQ      NDLIHAWGLDMQLGYCAQ  
Subjt:  LIHCTYAFISRLLLCGSIFGLSGELTNQVMVEKVVISTMDRNDGPGFFPSVMALCLVYDPGGCYPFLGLSYIIQLNESQKNDLIHAWGLDMQLGYCAQAR

Query:  GDRTKNVGVVDSEYVIHHGRPTLGGPEENETSSKSPVKDHRADVRRQSYIELAVFRKRWLKAAEQDECWQDPYPETMYSNTT
        GDRTKNVGVVDSEYVIH+GRPTLGGPEENETSSKS VKDHRADVRRQSYIEL VFRKRW KAAEQDECWQDPYPET+   T+
Subjt:  GDRTKNVGVVDSEYVIHHGRPTLGGPEENETSSKSPVKDHRADVRRQSYIELAVFRKRWLKAAEQDECWQDPYPETMYSNTT

A0A1S3B872 uncharacterized protein LOC103487236 isoform X17.4e-13458.3Show/hide
Query:  MKFSGCLPLLAEQKSRHSCFCTLLPVTTLLCLVLFVGSAYIAPDYRQELHVSLLASSYLWCTILWNVGSSYDRFDRKEGIKISKWGIDGFVRSKFSKCEN
        MK SGCLPLLAEQKSRHSC C+LLP  +LLCL LFVGS Y+AP+YR+                                 KIS+WGIDG V SKF+KCE 
Subjt:  MKFSGCLPLLAEQKSRHSCFCTLLPVTTLLCLVLFVGSAYIAPDYRQELHVSLLASSYLWCTILWNVGSSYDRFDRKEGIKISKWGIDGFVRSKFSKCEN

Query:  QCRPNGSDPLPKDIVVTASNLEMRPLWGASKRSYQKPVNVSSNLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHVLITCV
        QCRPNGS+PLPKDIVVTASNLEMRPLWGASKRSYQ PVN SSN+FA+AVGIKQKDLVN MVTKFL+SDFAVMLFHYDGIVD+WK+F+WSNRV+HV     
Subjt:  QCRPNGSDPLPKDIVVTASNLEMRPLWGASKRSYQKPVNVSSNLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHVLITCV

Query:  LFILFMK-------VAEFHSVSL------VLDLVMSRYVHIIESEGLEISQPALDPLKSEVHHQITARGRRSKVHRSSPRSLISYELICCINFQSLIIDI
            F K       V E+  V L      V +     YVHIIESEGLEISQPALDP KSEVHHQITARGRRS VHR + R     +  C +N        
Subjt:  LFILFMK-------VAEFHSVSL------VLDLVMSRYVHIIESEGLEISQPALDPLKSEVHHQITARGRRSKVHRSSPRSLISYELICCINFQSLIIDI

Query:  LIHCTYAFISRLLLCGSIFGLSGELTNQVMVEKVVISTMDRNDGPGFFPSVMALCLVYDPGGCYPFLGLSYIIQLNESQKNDLIHAWGLDMQLGYCAQAR
                                            ST     G   +  +MA         C     + Y+IQ      NDLIHAWGLDMQLGYCAQ  
Subjt:  LIHCTYAFISRLLLCGSIFGLSGELTNQVMVEKVVISTMDRNDGPGFFPSVMALCLVYDPGGCYPFLGLSYIIQLNESQKNDLIHAWGLDMQLGYCAQAR

Query:  GDRTKNVGVVDSEYVIHHGRPTLGGPEENETSSKSPVKDHRADVRRQSYIELAVFRKRWLKAAEQDECWQDPYPETMYSNTT
        GDRTKNVGVVDSEYVIH+GRPTLGGPEENETSS S VKDHRADVRRQSYIEL VFRKRW KAAEQDECWQDPYPET+   T+
Subjt:  GDRTKNVGVVDSEYVIHHGRPTLGGPEENETSSKSPVKDHRADVRRQSYIELAVFRKRWLKAAEQDECWQDPYPETMYSNTT

A0A6J1H9Z6 uncharacterized protein LOC1114614812.0e-15565.84Show/hide
Query:  MKFSGCLPLLAEQKSRHSCFCTLLPVTTLLCLVLFVGSAYIAPDYRQELHVSLLASSYLWCTILWNVGSSYDRFDRKEGIKISKWGIDGFVRSKFSKCEN
        MKFSGCLPLLAEQKSRHSCFCTLLPVTTLLCLVLFVGSAYIAPDYRQ                                 KISKWGIDGFVRSKFSKCEN
Subjt:  MKFSGCLPLLAEQKSRHSCFCTLLPVTTLLCLVLFVGSAYIAPDYRQELHVSLLASSYLWCTILWNVGSSYDRFDRKEGIKISKWGIDGFVRSKFSKCEN

Query:  QCRPNGSDPLPKDIVVTASNLEMRPLWGASKRSYQKPVNVSSNLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHVLITCV
        QCRPNGSDPLPKDIVVTASNLEMRPLWGASKRSYQKPVNVSSNLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHV     
Subjt:  QCRPNGSDPLPKDIVVTASNLEMRPLWGASKRSYQKPVNVSSNLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHVLITCV

Query:  LFILFMK-------VAEFHSVSL------VLDLVMSRYVHIIESEGLEISQPALDPLKSEVHHQITARGRRSKVHRSSPRSLISYELICCINFQSLIIDI
            F K       VA ++ + L      V +   ++YVHIIESEGLEISQPALDPLKSEVHHQITARGRRSKVHR                        
Subjt:  LFILFMK-------VAEFHSVSL------VLDLVMSRYVHIIESEGLEISQPALDPLKSEVHHQITARGRRSKVHRSSPRSLISYELICCINFQSLIIDI

Query:  LIHCTYAFISRLLLCGSIFGLSGELTNQVMVEKVVISTMDRNDGPGFFPSVMALCLVYDPGGCYPFLGLSYIIQLNESQKNDLIHAWGLDMQLGYCAQAR
            TY   +    C  I   +   T  + +   V S                            +  + Y+IQ      NDLIHAWGLDMQLGYCAQ  
Subjt:  LIHCTYAFISRLLLCGSIFGLSGELTNQVMVEKVVISTMDRNDGPGFFPSVMALCLVYDPGGCYPFLGLSYIIQLNESQKNDLIHAWGLDMQLGYCAQAR

Query:  GDRTKNVGVVDSEYVIHHGRPTLGGPEENETSSKSPVKDHRADVRRQSYIELAVFRKRWLKAAEQDECWQDPYPETMYSNTTL
        GDRTKNVGVVDSEYVIHHGRPTLGGPEENETSSKSPVKDHRADVRRQSYIELAVFRKRWLKAAEQDECWQDPYPETMYSNTTL
Subjt:  GDRTKNVGVVDSEYVIHHGRPTLGGPEENETSSKSPVKDHRADVRRQSYIELAVFRKRWLKAAEQDECWQDPYPETMYSNTTL

A0A6J1JL34 uncharacterized protein LOC1114855711.3e-14963.77Show/hide
Query:  MKFSGCLPLLAEQKSRHSCFCTLLPVTTLLCLVLFVGSAYIAPDYRQELHVSLLASSYLWCTILWNVGSSYDRFDRKEGIKISKWGIDGFVRSKFSKCEN
        MKFSGCLPLLAEQKSRHSCFCTLLPVTTLLCLVLFVGSAYIAPDYR+                                 KIS WGIDGFVRSKF+KCEN
Subjt:  MKFSGCLPLLAEQKSRHSCFCTLLPVTTLLCLVLFVGSAYIAPDYRQELHVSLLASSYLWCTILWNVGSSYDRFDRKEGIKISKWGIDGFVRSKFSKCEN

Query:  QCRPNGSDPLPKDIVVTASNLEMRPLWGASKRSYQKPVNVSSNLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHVLITCV
        QCRPNGSDPLPKDIVVT SNLEMRPLWGASKRSYQKPVNVSSNLFAVAVGIKQKDLVNTMV KFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHV     
Subjt:  QCRPNGSDPLPKDIVVTASNLEMRPLWGASKRSYQKPVNVSSNLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHVLITCV

Query:  LFILFMK-------VAEFHSVSL------VLDLVMSRYVHIIESEGLEISQPALDPLKSEVHHQITARGRRSKVHRSSPRSLISYELICCINFQSLIIDI
            F K       VA ++ + L      V +    +YVHIIESEGLEISQPALDPLKSEVHHQITARGRRSKVHR                        
Subjt:  LFILFMK-------VAEFHSVSL------VLDLVMSRYVHIIESEGLEISQPALDPLKSEVHHQITARGRRSKVHRSSPRSLISYELICCINFQSLIIDI

Query:  LIHCTYAFISRLLLCGSIFGLSGELTNQVMVEKVVISTMDRNDGPGFFPSVMALCLVYDPGGCYPFLGLSYIIQLNESQKNDLIHAWGLDMQLGYCAQAR
            TY   +    C  I   +   T  + +   V S                            +  + Y+IQ      NDLIHAWGLDMQLGYCAQ  
Subjt:  LIHCTYAFISRLLLCGSIFGLSGELTNQVMVEKVVISTMDRNDGPGFFPSVMALCLVYDPGGCYPFLGLSYIIQLNESQKNDLIHAWGLDMQLGYCAQAR

Query:  GDRTKNVGVVDSEYVIHHGRPTLGGPEENETSSKSPVKDHRADVRRQSYIELAVFRKRWLKAAEQDECWQDPYPETMYSNTTL
        GDRTKNVGVVDSEYVIHHGRPTLGGPEENETSSKSPVKDHRADVRRQSYIELAVFRKRW +AAEQDECW DPYPETM  NTTL
Subjt:  GDRTKNVGVVDSEYVIHHGRPTLGGPEENETSSKSPVKDHRADVRRQSYIELAVFRKRWLKAAEQDECWQDPYPETMYSNTTL

A0A6J1KNU9 uncharacterized protein LOC111495920 isoform X29.0e-13257.17Show/hide
Query:  MKFSGCLPLLAEQKSRHSCFCTLLPVTTLLCLVLFVGSAYIAPDYRQELHVSLLASSYLWCTILWNVGSSYDRFDRKEGIKISKWGIDGFVRSKFSKCEN
        MKFSGCLPLLAEQKSR+S  C + P+ +LLCLVLFVGSAY+APDYR+                                 +I +WGIDG V SKF+KCEN
Subjt:  MKFSGCLPLLAEQKSRHSCFCTLLPVTTLLCLVLFVGSAYIAPDYRQELHVSLLASSYLWCTILWNVGSSYDRFDRKEGIKISKWGIDGFVRSKFSKCEN

Query:  QCRPNGSDPLPKDIVVTASNLEMRPLWGASKRSYQKPVNVSSNLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHVLITCV
        QCRPNGS+PLPKDIVVTASNLEMRPLWGASK SYQ PVN SSNLFA AVGIKQKDLVN MVTKFL+SDFAVMLFHYDGIVD+WK+F+WSNRV+HV     
Subjt:  QCRPNGSDPLPKDIVVTASNLEMRPLWGASKRSYQKPVNVSSNLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHVLITCV

Query:  LFILFMK-------VAEFHSVSL------VLDLVMSRYVHIIESEGLEISQPALDPLKSEVHHQITARGRRSKVHRSSPRSLISYELICCINFQSLIIDI
            F K       VAE++ + L      V      +YVHIIESEGLEISQPALDP +SEVHHQITARGRRS VHR + +     +  C +N        
Subjt:  LFILFMK-------VAEFHSVSL------VLDLVMSRYVHIIESEGLEISQPALDPLKSEVHHQITARGRRSKVHRSSPRSLISYELICCINFQSLIIDI

Query:  LIHCTYAFISRLLLCGSIFGLSGELTNQVMVEKVVISTMDRNDGPGFFPSVMALCLVYDPGGCYPFLGLSYIIQLNESQKNDLIHAWGLDMQLGYCAQAR
                 S+   C          T  + +   V S                            +  + Y+IQ      NDLIH WGLDMQLGYCAQ  
Subjt:  LIHCTYAFISRLLLCGSIFGLSGELTNQVMVEKVVISTMDRNDGPGFFPSVMALCLVYDPGGCYPFLGLSYIIQLNESQKNDLIHAWGLDMQLGYCAQAR

Query:  GDRTKNVGVVDSEYVIHHGRPTLGGPEENETSSKSPVKDHRADVRRQSYIELAVFRKRWLKAAEQDECWQDPYPETMYSNT
        GDRTKNVGVVD+EY+IH+GRPTLGGPEENETSSKS VKDHRADVRRQSYIEL VFRKRW KAA+QDECWQDPYPET+  NT
Subjt:  GDRTKNVGVVDSEYVIHHGRPTLGGPEENETSSKSPVKDHRADVRRQSYIELAVFRKRWLKAAEQDECWQDPYPETMYSNT

SwissProt top hitse value%identityAlignment
A1EA19 Photosystem I assembly protein Ycf46.4e-1070.45Show/hide
Query:  LLASSYLWCTILWNVGSSYDRFDRKEGIK-ISKWGIDGFVRSKF
        L  SSYLWCTILWNVGS YDRFDRKEGI  I +WG  G  R  F
Subjt:  LLASSYLWCTILWNVGSSYDRFDRKEGIK-ISKWGIDGFVRSKF

A8Y9I0 Photosystem I assembly protein Ycf46.4e-1070.45Show/hide
Query:  LLASSYLWCTILWNVGSSYDRFDRKEGIK-ISKWGIDGFVRSKF
        L  SSYLWCTILWNVGS YDRFDRKEGI  I +WG  G  R  F
Subjt:  LLASSYLWCTILWNVGSSYDRFDRKEGIK-ISKWGIDGFVRSKF

P20454 Photosystem I assembly protein Ycf46.4e-1070.45Show/hide
Query:  LLASSYLWCTILWNVGSSYDRFDRKEGIK-ISKWGIDGFVRSKF
        L  SSYLWCTILWNVGS YDRFDRKEGI  I +WG  G  R  F
Subjt:  LLASSYLWCTILWNVGSSYDRFDRKEGIK-ISKWGIDGFVRSKF

P62719 Photosystem I assembly protein Ycf46.4e-1070.45Show/hide
Query:  LLASSYLWCTILWNVGSSYDRFDRKEGIK-ISKWGIDGFVRSKF
        L  SSYLWCTILWNVGS YDRFDRKEGI  I +WG  G  R  F
Subjt:  LLASSYLWCTILWNVGSSYDRFDRKEGIK-ISKWGIDGFVRSKF

Q6L602 Photosystem I assembly protein Ycf46.4e-1070.45Show/hide
Query:  LLASSYLWCTILWNVGSSYDRFDRKEGIK-ISKWGIDGFVRSKF
        L  SSYLWCTILWNVGS YDRFDRKEGI  I +WG  G  R  F
Subjt:  LLASSYLWCTILWNVGSSYDRFDRKEGIK-ISKWGIDGFVRSKF

Arabidopsis top hitse value%identityAlignment
AT1G11170.1 Protein of unknown function (DUF707)7.2e-4934.41Show/hide
Query:  LPKDIVVTASNLEMRPLWGASKRSYQKPVNVSSNLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHVLITCVLFILFMKVA
        LP+ I+ + S+LE++PLW       +     + NL A+ VG+KQK  V+ +V KFL ++F ++LFHYDG +DKW + EWS++ +H++        F K  
Subjt:  LPKDIVVTASNLEMRPLWGASKRSYQKPVNVSSNLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHVLITCVLFILFMKVA

Query:  EFHSVSLVLDLVM-------------SRYVHIIESEGLEISQPALDPLKSEVHHQITARGRRSKVHRSSPRSLISYELICCINFQSLIIDILIHCTYAFI
            V  + D +               RY+ I++S GLEISQPALD   +E+HH+IT R +  K HR   R  I+     C N  S              
Subjt:  EFHSVSLVLDLVM-------------SRYVHIIESEGLEISQPALDPLKSEVHHQITARGRRSKVHRSSPRSLISYELICCINFQSLIIDILIHCTYAFI

Query:  SRLLLCGSIFGLSGELTNQVMVEKVVISTMDRNDGPGFFPSVMALCLVYDPGGCYPFLGLSYIIQLNESQKNDLIHAWGLDMQLGYCAQARGDRTKNVGV
                                         D P         C  +  G    F   +++   N  Q NDL+H WG+DM+LGYCAQ  GDRTKNVG+
Subjt:  SRLLLCGSIFGLSGELTNQVMVEKVVISTMDRNDGPGFFPSVMALCLVYDPGGCYPFLGLSYIIQLNESQKNDLIHAWGLDMQLGYCAQARGDRTKNVGV

Query:  VDSEYVIHHGRPTLGG--PEENETSSKSPVK-------DHRADVRRQSYIELAVFRKRWLKAAEQDECWQDP
        VDSEY++H G  TLG   PE+ +T+     +       D R ++RRQS  EL  F++RW KA E+D  W DP
Subjt:  VDSEYVIHHGRPTLGG--PEENETSSKSPVK-------DHRADVRRQSYIELAVFRKRWLKAAEQDECWQDP

AT1G61240.1 Protein of unknown function (DUF707)9.8e-4632.43Show/hide
Query:  LPKDIVVTASNLEMRPLWGASKRSYQKPVNVSSNLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHVLITCVLFILFMKVA
        LP  I+   S+LE++PLW +S    +     + NL A+ VG+KQKD V+ +V KFL ++F V+LFHYDG +D+W + EWS++ +H++        F K  
Subjt:  LPKDIVVTASNLEMRPLWGASKRSYQKPVNVSSNLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHVLITCVLFILFMKVA

Query:  EFHSVSLVLDLVM-------------SRYVHIIESEGLEISQPALDPLKSEVHHQITARGRRSKVHRSSPRSLISYELICCINFQSLIIDILIHCTYAFI
            +  + D V               +Y+ I+++ GLEISQPAL P  +EVHH+IT R R    HR                                 
Subjt:  EFHSVSLVLDLVM-------------SRYVHIIESEGLEISQPALDPLKSEVHHQITARGRRSKVHRSSPRSLISYELICCINFQSLIIDILIHCTYAFI

Query:  SRLLLCGSIFGLSGELTNQVMVEKVVISTMDRNDGPGFFPSVMALCLVYDPGGCYPFLGLSYIIQLNESQKNDLIHAWGLDMQLGYCAQARGDRTKNVGV
                          +V   +  +   + ++GP         C  +  G    F   ++    N  Q NDL+H WG+DM+LGYCAQ  GDR+K VG+
Subjt:  SRLLLCGSIFGLSGELTNQVMVEKVVISTMDRNDGPGFFPSVMALCLVYDPGGCYPFLGLSYIIQLNESQKNDLIHAWGLDMQLGYCAQARGDRTKNVGV

Query:  VDSEYVIHHGRPTLGGP--EENETSSKSPVK--------DHRADVRRQSYIELAVFRKRWLKAAEQDECW
        VDSEY+ H G  TLGG    + + S++S V         D R ++RRQS  EL  F++RW +A  +D+ W
Subjt:  VDSEYVIHHGRPTLGGP--EENETSSKSPVK--------DHRADVRRQSYIELAVFRKRWLKAAEQDECW

AT4G12840.1 Protein of unknown function (DUF707)3.8e-6637Show/hide
Query:  RQELHVSLLASSYLWCTILWNVGSSYDRFDRKEGIKISKWGIDGFVR---SKFSKCENQCRPNGSDPLPKDIVVTASNLEMRPLWGASKRSYQKPVNVSS
        R+ L +  L S   +  +++ +G+++   D KEGI     G    +R   +K   C+ Q RP GS+ LP+ IV + S+LEMRPLWGA +    KP     
Subjt:  RQELHVSLLASSYLWCTILWNVGSSYDRFDRKEGIKISKWGIDGFVR---SKFSKCENQCRPNGSDPLPKDIVVTASNLEMRPLWGASKRSYQKPVNVSS

Query:  NLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHVLITCVLFILFMK------VAEFHSVSLVLDLVMS-------RYVHII
        +L A+AVGI+QK+ VN +V KF +S+F VMLFHYDG VD+WKEFEWS+  +H+ +       F K      +   +S   + D  +        RYV II
Subjt:  NLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHVLITCVLFILFMK------VAEFHSVSLVLDLVMS-------RYVHII

Query:  ESEGLEISQPALDPLKSEVHHQITARGRRSKVHRSSPRSLISYELICCINFQSLIIDILIHCTYAFISRLLLCGSIFGLSGELTNQVMVEKVVISTMDRN
        + E LEISQPALDP  SEVHHQ+T+R ++S+VHR                            TY  I R                            + +
Subjt:  ESEGLEISQPALDPLKSEVHHQITARGRRSKVHRSSPRSLISYELICCINFQSLIIDILIHCTYAFISRLLLCGSIFGLSGELTNQVMVEKVVISTMDRN

Query:  DGPGFFPSVMALCLVYDPGGCYPFLGLSYIIQLNESQKNDLIHAWGLDMQLGYCAQARGDRTKNVGVVDSEYVIHHGRPTL-GGPEENETSS--------
         GP     V  +  V+       +    ++IQ      NDL H WG+D QLGYCAQ  GDRTKN+G+VDSEY++H G PTL GG  EN+T S        
Subjt:  DGPGFFPSVMALCLVYDPGGCYPFLGLSYIIQLNESQKNDLIHAWGLDMQLGYCAQARGDRTKNVGVVDSEYVIHHGRPTL-GGPEENETSS--------

Query:  ------KSPVKDHRADVRRQSYIELAVFRKRWLKAAEQDECWQDPY
               S V   R +VR+Q+Y+EL  F+ RW  A + DECW D +
Subjt:  ------KSPVKDHRADVRRQSYIELAVFRKRWLKAAEQDECWQDPY

AT4G12840.2 Protein of unknown function (DUF707)3.8e-6637Show/hide
Query:  RQELHVSLLASSYLWCTILWNVGSSYDRFDRKEGIKISKWGIDGFVR---SKFSKCENQCRPNGSDPLPKDIVVTASNLEMRPLWGASKRSYQKPVNVSS
        R+ L +  L S   +  +++ +G+++   D KEGI     G    +R   +K   C+ Q RP GS+ LP+ IV + S+LEMRPLWGA +    KP     
Subjt:  RQELHVSLLASSYLWCTILWNVGSSYDRFDRKEGIKISKWGIDGFVR---SKFSKCENQCRPNGSDPLPKDIVVTASNLEMRPLWGASKRSYQKPVNVSS

Query:  NLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHVLITCVLFILFMK------VAEFHSVSLVLDLVMS-------RYVHII
        +L A+AVGI+QK+ VN +V KF +S+F VMLFHYDG VD+WKEFEWS+  +H+ +       F K      +   +S   + D  +        RYV II
Subjt:  NLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHVLITCVLFILFMK------VAEFHSVSLVLDLVMS-------RYVHII

Query:  ESEGLEISQPALDPLKSEVHHQITARGRRSKVHRSSPRSLISYELICCINFQSLIIDILIHCTYAFISRLLLCGSIFGLSGELTNQVMVEKVVISTMDRN
        + E LEISQPALDP  SEVHHQ+T+R ++S+VHR                            TY  I R                            + +
Subjt:  ESEGLEISQPALDPLKSEVHHQITARGRRSKVHRSSPRSLISYELICCINFQSLIIDILIHCTYAFISRLLLCGSIFGLSGELTNQVMVEKVVISTMDRN

Query:  DGPGFFPSVMALCLVYDPGGCYPFLGLSYIIQLNESQKNDLIHAWGLDMQLGYCAQARGDRTKNVGVVDSEYVIHHGRPTL-GGPEENETSS--------
         GP     V  +  V+       +    ++IQ      NDL H WG+D QLGYCAQ  GDRTKN+G+VDSEY++H G PTL GG  EN+T S        
Subjt:  DGPGFFPSVMALCLVYDPGGCYPFLGLSYIIQLNESQKNDLIHAWGLDMQLGYCAQARGDRTKNVGVVDSEYVIHHGRPTL-GGPEENETSS--------

Query:  ------KSPVKDHRADVRRQSYIELAVFRKRWLKAAEQDECWQDPY
               S V   R +VR+Q+Y+EL  F+ RW  A + DECW D +
Subjt:  ------KSPVKDHRADVRRQSYIELAVFRKRWLKAAEQDECWQDPY

AT4G18530.1 Protein of unknown function (DUF707)2.9e-6636.25Show/hide
Query:  SCFCTLLPVTTLLCLVLFVGSAYIAPDYRQELHVSLLASSYLWCTILWNV-GSSYDRFDRKEGIKISKWGIDGFVRSKFSKCENQCRPNGSDPLPKDIVV
        SC C++L  T L+C   F+ +AY+A D++++L             + W +    ++  D+ +    +            S C+N  +P G++ LP+ I+ 
Subjt:  SCFCTLLPVTTLLCLVLFVGSAYIAPDYRQELHVSLLASSYLWCTILWNV-GSSYDRFDRKEGIKISKWGIDGFVRSKFSKCENQCRPNGSDPLPKDIVV

Query:  TASNLEMRPLWGASKRSYQKPVNVSSNLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHVLITCVLFILFMK-------VA
          SNLE + LW       ++P N S +L A+AVGIKQK+LVN ++ KF   DFAVMLFHYDG+VD WK++ W+N  +HV +       F K       VA
Subjt:  TASNLEMRPLWGASKRSYQKPVNVSSNLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHVLITCVLFILFMK-------VA

Query:  EFHSVSL------VLDLVMSRYVHIIESEGLEISQPALDPLKSEVHHQITARGRRSKVHRSSPRSLISYELICCINFQSLIIDILIHCTYAFISRLLLCG
        E+  + L      V      RY+ I++ EGLEISQPALD  KSEVHH ITAR ++SKVHR   +                                    
Subjt:  EFHSVSL------VLDLVMSRYVHIIESEGLEISQPALDPLKSEVHHQITARGRRSKVHRSSPRSLISYELICCINFQSLIIDILIHCTYAFISRLLLCG

Query:  SIFGLSGELTNQVMVEKVVISTMDRNDGPGFFPSVMALCLVYDPGGCYPFLGLSYIIQLNESQKNDLIHAWGLDMQLGYCAQARGDRTKNVGVVDSEYVI
          +  SG                D +  P     V  +  V+       +    Y+IQ      NDLIHAWGLD QLGYCAQ  GDR KNVGVVD+EY+I
Subjt:  SIFGLSGELTNQVMVEKVVISTMDRNDGPGFFPSVMALCLVYDPGGCYPFLGLSYIIQLNESQKNDLIHAWGLDMQLGYCAQARGDRTKNVGVVDSEYVI

Query:  HHGRPTLGGPE------ENETSSKSPVK------DHRADVRRQSYIELAVFRKRWLKAAEQDECWQDPY
        H+G PTLG  E       NET SKS         D+R +VR +S++E+  F++RW KA   D CW DPY
Subjt:  HHGRPTLGGPE------ENETSSKSPVK------DHRADVRRQSYIELAVFRKRWLKAAEQDECWQDPY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTTCTCCGGTTGTTTGCCCTTACTTGCAGAGCAAAAAAGTAGGCATTCTTGTTTTTGTACCCTTCTTCCAGTCACTACTTTGCTTTGTCTTGTATTGTTTGTGGG
GAGTGCATATATAGCACCAGATTATAGACAGGAATTGCACGTCTCCTTATTAGCTAGCTCCTATTTGTGGTGCACAATTTTGTGGAATGTAGGTAGCAGTTACGATCGGT
TCGATAGAAAAGAAGGAATAAAAATCTCTAAATGGGGAATAGATGGTTTTGTCAGATCAAAGTTCAGTAAATGTGAGAATCAATGTAGGCCAAATGGAAGCGACCCGCTA
CCTAAAGACATTGTTGTCACTGCATCTAACTTGGAAATGCGACCGCTATGGGGTGCGTCGAAGCGTTCTTATCAGAAACCCGTTAACGTATCGAGTAATTTATTCGCCGT
TGCCGTCGGGATTAAACAAAAAGATCTTGTGAATACAATGGTAACAAAGTTTCTAGCTAGCGACTTTGCTGTGATGCTTTTCCATTATGATGGTATCGTGGACAAATGGA
AGGAATTTGAGTGGAGTAATCGTGTAGTACATGTTCTTATTACATGTGTTTTGTTCATTCTATTTATGAAAGTTGCTGAGTTTCACTCTGTTTCATTGGTTCTTGATCTG
GTGATGTCTAGGTATGTACATATTATTGAAAGTGAAGGGCTAGAGATATCACAACCAGCTCTTGATCCACTCAAATCAGAGGTGCACCATCAAATTACTGCACGTGGGAG
GCGATCGAAAGTGCACAGGTCTTCTCCCCGTTCCCTGATTTCGTATGAACTGATTTGTTGTATTAACTTTCAATCATTAATCATTGACATTTTGATACATTGTACATATG
CTTTCATCAGTCGTTTACTATTGTGTGGTTCTATCTTTGGTCTGTCAGGAGAACTTACAAACCAAGTAATGGTGGAAAAGGTTGTGATATCAACAATGGATCGAAATGAT
GGCCCCGGTTTTTTCCCGAGCGTCATGGCGTTGTGTTTGGTATATGATCCAGGTGGGTGCTACCCTTTTCTTGGCCTATCATATATTATTCAACTAAACGAAAGTCAAAA
AAATGATTTGATCCATGCTTGGGGCTTAGATATGCAACTGGGATATTGTGCACAGGCAAGAGGTGATCGAACAAAGAACGTCGGTGTCGTTGACTCCGAGTATGTAATCC
ATCATGGACGACCGACGCTCGGTGGTCCAGAAGAAAATGAGACATCTTCAAAGTCTCCTGTAAAGGATCATAGGGCTGATGTAAGAAGGCAGTCCTATATTGAACTAGCT
GTATTTAGAAAAAGATGGCTGAAGGCTGCTGAACAAGATGAGTGCTGGCAAGATCCATACCCAGAGACAATGTATAGCAACACCACTTTATAA
mRNA sequenceShow/hide mRNA sequence
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGCTCCCCAATAACTTTCCTTCTCTTCCACGTCTCCTTTTAAATTCATTTCCCGACGAGGGCAAC
ATTGTGGATGTTGTGGCCGAGTGTTTTCCGGTGAACGTTGTTGGGTATTGAAATTTGTGCAGTTTTCGCCTCTTTTCTGTCTCTTTTCATACCAATCTATCTCGAATTTC
ATGGAAACATGAAGTTCTCCGGTTGTTTGCCCTTACTTGCAGAGCAAAAAAGTAGGCATTCTTGTTTTTGTACCCTTCTTCCAGTCACTACTTTGCTTTGTCTTGTATTG
TTTGTGGGGAGTGCATATATAGCACCAGATTATAGACAGGAATTGCACGTCTCCTTATTAGCTAGCTCCTATTTGTGGTGCACAATTTTGTGGAATGTAGGTAGCAGTTA
CGATCGGTTCGATAGAAAAGAAGGAATAAAAATCTCTAAATGGGGAATAGATGGTTTTGTCAGATCAAAGTTCAGTAAATGTGAGAATCAATGTAGGCCAAATGGAAGCG
ACCCGCTACCTAAAGACATTGTTGTCACTGCATCTAACTTGGAAATGCGACCGCTATGGGGTGCGTCGAAGCGTTCTTATCAGAAACCCGTTAACGTATCGAGTAATTTA
TTCGCCGTTGCCGTCGGGATTAAACAAAAAGATCTTGTGAATACAATGGTAACAAAGTTTCTAGCTAGCGACTTTGCTGTGATGCTTTTCCATTATGATGGTATCGTGGA
CAAATGGAAGGAATTTGAGTGGAGTAATCGTGTAGTACATGTTCTTATTACATGTGTTTTGTTCATTCTATTTATGAAAGTTGCTGAGTTTCACTCTGTTTCATTGGTTC
TTGATCTGGTGATGTCTAGGTATGTACATATTATTGAAAGTGAAGGGCTAGAGATATCACAACCAGCTCTTGATCCACTCAAATCAGAGGTGCACCATCAAATTACTGCA
CGTGGGAGGCGATCGAAAGTGCACAGGTCTTCTCCCCGTTCCCTGATTTCGTATGAACTGATTTGTTGTATTAACTTTCAATCATTAATCATTGACATTTTGATACATTG
TACATATGCTTTCATCAGTCGTTTACTATTGTGTGGTTCTATCTTTGGTCTGTCAGGAGAACTTACAAACCAAGTAATGGTGGAAAAGGTTGTGATATCAACAATGGATC
GAAATGATGGCCCCGGTTTTTTCCCGAGCGTCATGGCGTTGTGTTTGGTATATGATCCAGGTGGGTGCTACCCTTTTCTTGGCCTATCATATATTATTCAACTAAACGAA
AGTCAAAAAAATGATTTGATCCATGCTTGGGGCTTAGATATGCAACTGGGATATTGTGCACAGGCAAGAGGTGATCGAACAAAGAACGTCGGTGTCGTTGACTCCGAGTA
TGTAATCCATCATGGACGACCGACGCTCGGTGGTCCAGAAGAAAATGAGACATCTTCAAAGTCTCCTGTAAAGGATCATAGGGCTGATGTAAGAAGGCAGTCCTATATTG
AACTAGCTGTATTTAGAAAAAGATGGCTGAAGGCTGCTGAACAAGATGAGTGCTGGCAAGATCCATACCCAGAGACAATGTATAGCAACACCACTTTATAA
Protein sequenceShow/hide protein sequence
MKFSGCLPLLAEQKSRHSCFCTLLPVTTLLCLVLFVGSAYIAPDYRQELHVSLLASSYLWCTILWNVGSSYDRFDRKEGIKISKWGIDGFVRSKFSKCENQCRPNGSDPL
PKDIVVTASNLEMRPLWGASKRSYQKPVNVSSNLFAVAVGIKQKDLVNTMVTKFLASDFAVMLFHYDGIVDKWKEFEWSNRVVHVLITCVLFILFMKVAEFHSVSLVLDL
VMSRYVHIIESEGLEISQPALDPLKSEVHHQITARGRRSKVHRSSPRSLISYELICCINFQSLIIDILIHCTYAFISRLLLCGSIFGLSGELTNQVMVEKVVISTMDRND
GPGFFPSVMALCLVYDPGGCYPFLGLSYIIQLNESQKNDLIHAWGLDMQLGYCAQARGDRTKNVGVVDSEYVIHHGRPTLGGPEENETSSKSPVKDHRADVRRQSYIELA
VFRKRWLKAAEQDECWQDPYPETMYSNTTL