; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0021704 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0021704
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionHornerin-like
Genome locationchr05:10262512..10267898
RNA-Seq ExpressionIVF0021704
SyntenyIVF0021704
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138941.1 uncharacterized protein LOC101209678 [Cucumis sativus]1.95e-26987.68Show/hide
Query:  MDLNKTVAHYSQNGDLTKDDNFGDTTLSLNCFGFGVRKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNVGYNKTKTQVASLPEEISPS
        MDLNKTVAHYSQN DLTKDDNFGDTTLSLNCFGFG RKSSGCEVALNDLNFNFSY+PDDGCRLVLGLGPTPSANCDDYYNVGYNKTK QVASLPEEISPS
Subjt:  MDLNKTVAHYSQNGDLTKDDNFGDTTLSLNCFGFGVRKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNVGYNKTKTQVASLPEEISPS

Query:  DSILQLGLSGGTNEVSSVVECSVSAETDVSATYLINQWTAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNLLIQQEVLEIDSRNQLSQELSPT
        DS+LQLGLSGGTNEVSSVVECSVSAETDVS TYLI+QW AEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSN+LIQQE+LE DSRNQLSQ LSPT
Subjt:  DSILQLGLSGGTNEVSSVVECSVSAETDVSATYLINQWTAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNLLIQQEVLEIDSRNQLSQELSPT

Query:  VEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG
        VEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGC KGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG
Subjt:  VEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG

Query:  RRCGYSGGCAKAARGKSGLCIRHGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQYERCTKGAQGSTMYCKAHGGGNDVYLPDAQKVLKGVLLFARDMVG
        RRCGYSGGCAKAARGKSGLCIRHGGGKRCKM+GCTRSAEGHAGLCISHGGGRRCQYE CTKGAQGSTMYCKAHGGG         K  +G     +   G
Subjt:  RRCGYSGGCAKAARGKSGLCIRHGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQYERCTKGAQGSTMYCKAHGGGNDVYLPDAQKVLKGVLLFARDMVG

Query:  GSVASLMGVGFAQKVYMGAQTF
        G      G G   K   G   F
Subjt:  GSVASLMGVGFAQKVYMGAQTF

XP_008457144.1 PREDICTED: uncharacterized protein LOC103496890 [Cucumis melo]8.98e-28091.23Show/hide
Query:  MDLNKTVAHYSQNGDLTKDDNFGDTTLSLNCFGFGVRKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNVGYNKTKTQVASLPEEISPS
        MDLNKTVAHYSQNGDLTKDDNFGDTTLSLNCFGFGVRKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNVGYNKTKTQVASLPEEISPS
Subjt:  MDLNKTVAHYSQNGDLTKDDNFGDTTLSLNCFGFGVRKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNVGYNKTKTQVASLPEEISPS

Query:  DSILQLGLSGGTNEVSSVVECSVSAETDVSATYLINQWTAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNLLIQQEVLEIDSRNQLSQELSPT
        DSILQLGLSGGTNEVSSVVECSVSAETDVSATYLINQWTAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNLLIQQEVLEIDSRNQLSQELSPT
Subjt:  DSILQLGLSGGTNEVSSVVECSVSAETDVSATYLINQWTAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNLLIQQEVLEIDSRNQLSQELSPT

Query:  VEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG
        VEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG
Subjt:  VEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG

Query:  RRCGYSGGCAKAARGKSGLCIRHGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQYERCTKGAQGSTMYCKAHGGGNDVYLPDAQKVLKGVLLFARDMVG
        RRCGYSGGCAKAARGKSGLCIRHGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQYERCTKGAQGSTMYCKAHGGG         K  +G     +   G
Subjt:  RRCGYSGGCAKAARGKSGLCIRHGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQYERCTKGAQGSTMYCKAHGGGNDVYLPDAQKVLKGVLLFARDMVG

Query:  GSVASLMGVGFAQKVYMGAQTF
        G      G G   K   G   F
Subjt:  GSVASLMGVGFAQKVYMGAQTF

XP_022928785.1 uncharacterized protein LOC111435606 [Cucurbita moschata]6.63e-25884.36Show/hide
Query:  MDLNKTVAHYSQNGDLTKDDNFGDTTLSLNCFGFGVRKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNVGYNKTKTQVASLPEEISPS
        MDLNKTVAHYSQNGDLTKDDNFGDTTLSLNCFGFG RKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNVGY K K  V SLPEEISPS
Subjt:  MDLNKTVAHYSQNGDLTKDDNFGDTTLSLNCFGFGVRKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNVGYNKTKTQVASLPEEISPS

Query:  DSILQLGLSGGTNEVSSVVECSVSAETDVSATYLINQWTAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNLLIQQEVLEIDSRNQLSQELSPT
        DSILQLGLSGGTNE SSVVECS+S +TDVSATYL+NQW AEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSN+LIQQ  L+ D+ NQLSQELSPT
Subjt:  DSILQLGLSGGTNEVSSVVECSVSAETDVSATYLINQWTAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNLLIQQEVLEIDSRNQLSQELSPT

Query:  VEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG
        +EYSLG+VIDQT KSVCSDHQ +NPK+CKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKT+FCIAHGGG
Subjt:  VEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG

Query:  RRCGYSGGCAKAARGKSGLCIRHGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQYERCTKGAQGSTMYCKAHGGGNDVYLPDAQKVLKGVLLFARDMVG
        RRCGYSGGCAKAARGKSGLCIRHGGGKRCKME CTRSAEG AGLCISHGGGRRCQYE CTKGAQGSTMYCKAHGGG         K  +G     +   G
Subjt:  RRCGYSGGCAKAARGKSGLCIRHGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQYERCTKGAQGSTMYCKAHGGGNDVYLPDAQKVLKGVLLFARDMVG

Query:  GSVASLMGVGFAQKVYMGAQTF
        G      G G   K   G   F
Subjt:  GSVASLMGVGFAQKVYMGAQTF

XP_023550234.1 uncharacterized protein LOC111808470 [Cucurbita pepo subsp. pepo]3.30e-25884.6Show/hide
Query:  MDLNKTVAHYSQNGDLTKDDNFGDTTLSLNCFGFGVRKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNVGYNKTKTQVASLPEEISPS
        MDLNKTVAHYSQNGDLTKDDNFGDTTLSLNCFGFG RKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNVGY K K  V SLPEEISPS
Subjt:  MDLNKTVAHYSQNGDLTKDDNFGDTTLSLNCFGFGVRKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNVGYNKTKTQVASLPEEISPS

Query:  DSILQLGLSGGTNEVSSVVECSVSAETDVSATYLINQWTAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNLLIQQEVLEIDSRNQLSQELSPT
        DSILQLGLSGGTNE SSVVECS+S +TDVSATYLINQW AEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSN+LIQQ  L+ D+ NQLSQELSPT
Subjt:  DSILQLGLSGGTNEVSSVVECSVSAETDVSATYLINQWTAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNLLIQQEVLEIDSRNQLSQELSPT

Query:  VEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG
        +EYSLG+VIDQT KSVCSDHQ +NPK+CKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKT+FCIAHGGG
Subjt:  VEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG

Query:  RRCGYSGGCAKAARGKSGLCIRHGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQYERCTKGAQGSTMYCKAHGGGNDVYLPDAQKVLKGVLLFARDMVG
        RRCGYSGGCAKAARGKSGLCIRHGGGKRCKME CTRSAEG AGLCISHGGGRRCQYE CTKGAQGSTMYCKAHGGG         K  +G     +   G
Subjt:  RRCGYSGGCAKAARGKSGLCIRHGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQYERCTKGAQGSTMYCKAHGGGNDVYLPDAQKVLKGVLLFARDMVG

Query:  GSVASLMGVGFAQKVYMGAQTF
        G      G G   K   G   F
Subjt:  GSVASLMGVGFAQKVYMGAQTF

XP_038896076.1 uncharacterized protein LOC120084247 [Benincasa hispida]7.44e-26286.02Show/hide
Query:  MDLNKTVAHYSQNGDLTKDDNFGDTTLSLNCFGFGVRKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNVGYNKTKTQVASLPEEISPS
        MDLNKTVAHYSQNGDLTKDDNFGDTTLSLNCFGFG RKSSGC VALNDLNF+F YAPDDGCRLVLGLGPTPSANCDDYYNVGYNK K QV SLP EISP+
Subjt:  MDLNKTVAHYSQNGDLTKDDNFGDTTLSLNCFGFGVRKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNVGYNKTKTQVASLPEEISPS

Query:  DSILQLGLSGGTNEVSSVVECSVSAETDVSATYLINQWTAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNLLIQQEVLEIDSRNQLSQELSPT
        DSILQLGLSGGTNE SSV+ECSVSAETDVSA YLINQW AEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTS +LIQQE+LE DS NQLSQELSPT
Subjt:  DSILQLGLSGGTNEVSSVVECSVSAETDVSATYLINQWTAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNLLIQQEVLEIDSRNQLSQELSPT

Query:  VEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG
        VEYSLGTVIDQTTKSV SDHQANN KRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG
Subjt:  VEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG

Query:  RRCGYSGGCAKAARGKSGLCIRHGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQYERCTKGAQGSTMYCKAHGGGNDVYLPDAQKVLKGVLLFARDMVG
        RRCGYSGGCAKAARGKSGLCIRHGGGKRCKMEGCTRSAEG AGLCISHGGGRRCQYE CTKGAQGSTMYCKAHGGG         K  +G     +   G
Subjt:  RRCGYSGGCAKAARGKSGLCIRHGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQYERCTKGAQGSTMYCKAHGGGNDVYLPDAQKVLKGVLLFARDMVG

Query:  GSVASLMGVGFAQKVYMGAQTF
        G      G G   K   G   F
Subjt:  GSVASLMGVGFAQKVYMGAQTF

TrEMBL top hitse value%identityAlignment
A0A0A0LI30 Uncharacterized protein2.7e-21487.68Show/hide
Query:  MDLNKTVAHYSQNGDLTKDDNFGDTTLSLNCFGFGVRKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNVGYNKTKTQVASLPEEISPS
        MDLNKTVAHYSQN DLTKDDNFGDTTLSLNCFGFG RKSSGCEVALNDLNFNFSY+PDDGCRLVLGLGPTPSANCDDYYNVGYNKTK QVASLPEEISPS
Subjt:  MDLNKTVAHYSQNGDLTKDDNFGDTTLSLNCFGFGVRKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNVGYNKTKTQVASLPEEISPS

Query:  DSILQLGLSGGTNEVSSVVECSVSAETDVSATYLINQWTAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNLLIQQEVLEIDSRNQLSQELSPT
        DS+LQLGLSGGTNEVSSVVECSVSAETDVS TYLI+QW AEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSN+LIQQE+LE DSRNQLSQ LSPT
Subjt:  DSILQLGLSGGTNEVSSVVECSVSAETDVSATYLINQWTAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNLLIQQEVLEIDSRNQLSQELSPT

Query:  VEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG
        VEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGC KGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG
Subjt:  VEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG

Query:  RRCGYSGGCAKAARGKSGLCIRHGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQYERCTKGAQGSTMYCKAHGGGNDVYLPDAQKVLKGVLLFARDMVG
        RRCGYSGGCAKAARGKSGLCIRHGGGKRCKM+GCTRSAEGHAGLCISHGGGRRCQYE CTKGAQGSTMYCKAHGGG         K  +G     +   G
Subjt:  RRCGYSGGCAKAARGKSGLCIRHGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQYERCTKGAQGSTMYCKAHGGGNDVYLPDAQKVLKGVLLFARDMVG

Query:  GSVASLMGVGFAQKVYMGAQTF
        G      G G   K   G   F
Subjt:  GSVASLMGVGFAQKVYMGAQTF

A0A1S3C643 uncharacterized protein LOC1034968903.5e-22291.23Show/hide
Query:  MDLNKTVAHYSQNGDLTKDDNFGDTTLSLNCFGFGVRKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNVGYNKTKTQVASLPEEISPS
        MDLNKTVAHYSQNGDLTKDDNFGDTTLSLNCFGFGVRKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNVGYNKTKTQVASLPEEISPS
Subjt:  MDLNKTVAHYSQNGDLTKDDNFGDTTLSLNCFGFGVRKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNVGYNKTKTQVASLPEEISPS

Query:  DSILQLGLSGGTNEVSSVVECSVSAETDVSATYLINQWTAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNLLIQQEVLEIDSRNQLSQELSPT
        DSILQLGLSGGTNEVSSVVECSVSAETDVSATYLINQWTAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNLLIQQEVLEIDSRNQLSQELSPT
Subjt:  DSILQLGLSGGTNEVSSVVECSVSAETDVSATYLINQWTAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNLLIQQEVLEIDSRNQLSQELSPT

Query:  VEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG
        VEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG
Subjt:  VEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG

Query:  RRCGYSGGCAKAARGKSGLCIRHGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQYERCTKGAQGSTMYCKAHGGGNDVYLPDAQKVLKGVLLFARDMVG
        RRCGYSGGCAKAARGKSGLCIRHGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQYERCTKGAQGSTMYCKAHGGG         K  +G     +   G
Subjt:  RRCGYSGGCAKAARGKSGLCIRHGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQYERCTKGAQGSTMYCKAHGGGNDVYLPDAQKVLKGVLLFARDMVG

Query:  GSVASLMGVGFAQKVYMGAQTF
        G      G G   K   G   F
Subjt:  GSVASLMGVGFAQKVYMGAQTF

A0A5A7TC07 Hornerin-like3.5e-22291.23Show/hide
Query:  MDLNKTVAHYSQNGDLTKDDNFGDTTLSLNCFGFGVRKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNVGYNKTKTQVASLPEEISPS
        MDLNKTVAHYSQNGDLTKDDNFGDTTLSLNCFGFGVRKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNVGYNKTKTQVASLPEEISPS
Subjt:  MDLNKTVAHYSQNGDLTKDDNFGDTTLSLNCFGFGVRKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNVGYNKTKTQVASLPEEISPS

Query:  DSILQLGLSGGTNEVSSVVECSVSAETDVSATYLINQWTAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNLLIQQEVLEIDSRNQLSQELSPT
        DSILQLGLSGGTNEVSSVVECSVSAETDVSATYLINQWTAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNLLIQQEVLEIDSRNQLSQELSPT
Subjt:  DSILQLGLSGGTNEVSSVVECSVSAETDVSATYLINQWTAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNLLIQQEVLEIDSRNQLSQELSPT

Query:  VEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG
        VEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG
Subjt:  VEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG

Query:  RRCGYSGGCAKAARGKSGLCIRHGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQYERCTKGAQGSTMYCKAHGGGNDVYLPDAQKVLKGVLLFARDMVG
        RRCGYSGGCAKAARGKSGLCIRHGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQYERCTKGAQGSTMYCKAHGGG         K  +G     +   G
Subjt:  RRCGYSGGCAKAARGKSGLCIRHGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQYERCTKGAQGSTMYCKAHGGGNDVYLPDAQKVLKGVLLFARDMVG

Query:  GSVASLMGVGFAQKVYMGAQTF
        G      G G   K   G   F
Subjt:  GSVASLMGVGFAQKVYMGAQTF

A0A6J1ESJ3 uncharacterized protein LOC1114356061.7e-20584.36Show/hide
Query:  MDLNKTVAHYSQNGDLTKDDNFGDTTLSLNCFGFGVRKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNVGYNKTKTQVASLPEEISPS
        MDLNKTVAHYSQNGDLTKDDNFGDTTLSLNCFGFG RKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNVGY K K  V SLPEEISPS
Subjt:  MDLNKTVAHYSQNGDLTKDDNFGDTTLSLNCFGFGVRKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNVGYNKTKTQVASLPEEISPS

Query:  DSILQLGLSGGTNEVSSVVECSVSAETDVSATYLINQWTAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNLLIQQEVLEIDSRNQLSQELSPT
        DSILQLGLSGGTNE SSVVECS+S +TDVSATYL+NQW AEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSN+LIQQ  L+ D+ NQLSQELSPT
Subjt:  DSILQLGLSGGTNEVSSVVECSVSAETDVSATYLINQWTAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNLLIQQEVLEIDSRNQLSQELSPT

Query:  VEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG
        +EYSLG+VIDQT KSVCSDHQ +NPK+CKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKT+FCIAHGGG
Subjt:  VEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG

Query:  RRCGYSGGCAKAARGKSGLCIRHGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQYERCTKGAQGSTMYCKAHGGGNDVYLPDAQKVLKGVLLFARDMVG
        RRCGYSGGCAKAARGKSGLCIRHGGGKRCKME CTRSAEG AGLCISHGGGRRCQYE CTKGAQGSTMYCKAHGGG         K  +G     +   G
Subjt:  RRCGYSGGCAKAARGKSGLCIRHGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQYERCTKGAQGSTMYCKAHGGGNDVYLPDAQKVLKGVLLFARDMVG

Query:  GSVASLMGVGFAQKVYMGAQTF
        G      G G   K   G   F
Subjt:  GSVASLMGVGFAQKVYMGAQTF

A0A6J1I0Z8 uncharacterized protein LOC1114688954.3e-20483.89Show/hide
Query:  MDLNKTVAHYSQNGDLTKDDNFGDTTLSLNCFGFGVRKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNVGYNKTKTQVASLPEEISPS
        MDLNKTVAHYSQNGDLTKDDNFGDTTLSLNCFGFG RKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNVGY K K  V SLPEEISPS
Subjt:  MDLNKTVAHYSQNGDLTKDDNFGDTTLSLNCFGFGVRKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNVGYNKTKTQVASLPEEISPS

Query:  DSILQLGLSGGTNEVSSVVECSVSAETDVSATYLINQWTAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNLLIQQEVLEIDSRNQLSQELSPT
        DSILQLGLSGGTNE +SVVECS+S +TDVSATYLINQW AEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSN+LIQQ  L+ D+ NQLSQELSPT
Subjt:  DSILQLGLSGGTNEVSSVVECSVSAETDVSATYLINQWTAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNLLIQQEVLEIDSRNQLSQELSPT

Query:  VEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG
        +E+SLG+VIDQT KSVCSDHQ +NPK+CKYFGC KGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKT+FCIAHGGG
Subjt:  VEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG

Query:  RRCGYSGGCAKAARGKSGLCIRHGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQYERCTKGAQGSTMYCKAHGGGNDVYLPDAQKVLKGVLLFARDMVG
        RRCGYSGGCAKAARGKSGLCIRHGGGKRCKME CTRSAEG AGLCISHGGGRRCQYE CTKGAQGSTMYCKAHGGG         K  +G     +   G
Subjt:  RRCGYSGGCAKAARGKSGLCIRHGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQYERCTKGAQGSTMYCKAHGGGNDVYLPDAQKVLKGVLLFARDMVG

Query:  GSVASLMGVGFAQKVYMGAQTF
        G      G G   K   G   F
Subjt:  GSVASLMGVGFAQKVYMGAQTF

SwissProt top hitse value%identityAlignment
Q9SZ67 Probable WRKY transcription factor 193.5e-5450.43Show/hide
Query:  HQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGGRRCGYSG------------
        + +++ K C+  GC+KGAR ASG CI HGGG RCQKP C KGAE +T YCKAHGGGRRC++LGCTK AEG T+FCIAHGGGRRC +              
Subjt:  HQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGGRRCGYSG------------

Query:  ------------GCAKA-------------------------ARGKSGLCIRHGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQYERCTKGAQGSTMYC
                    GC K+                         ARG+SGLC+ HGGGKRC+ E CT+SAEG +GLCISHGGGRRCQ   CTKGA+GS M+C
Subjt:  ------------GCAKA-------------------------ARGKSGLCIRHGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQYERCTKGAQGSTMYC

Query:  KA---------HGGGN--DVYLPDAQKVLKGV
        KA          GGGN   V   DA   LK V
Subjt:  KA---------HGGGN--DVYLPDAQKVLKGV

Arabidopsis top hitse value%identityAlignment
AT1G64140.1 BEST Arabidopsis thaliana protein match is: loricrin-related (TAIR:AT5G64550.1)3.0e-6946.75Show/hide
Query:  LQLGLSGGTNEVSSVVECSVSAETDVSATYLINQWTAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNLL---IQQEVLEIDSRNQLSQELSPT
        L+L LSGG +  S +      A    S   ++      AN       +E +T   + G  +P+L  +    TS+ L    +  ++      +LS   + T
Subjt:  LQLGLSGGTNEVSSVVECSVSAETDVSATYLINQWTAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNLL---IQQEVLEIDSRNQLSQELSPT

Query:  VEYSLGTVIDQTTKSVCSD-HQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGG
           S GT     ++ +      +++ K C+  GC KGARGASG CI HGGG RCQK GC+KGAE RT YCKAHGGGRRC+ LGCTKSAEG+T+FCIAHGG
Subjt:  VEYSLGTVIDQTTKSVCSD-HQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGG

Query:  GRRCGYSGGCAKAARGKSGLCIRHGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQYERCTKGAQGSTMYCKAHGGGNDVYLPDAQKVLKGVLLFARDMV
        GRRC +   C +AARG+SGLCIRHGGGKRC+ E CT+SAEG +GLCISHGGGRRCQ   CTKGAQGSTM+CKAHGGG         K  +G   F +   
Subjt:  GRRCGYSGGCAKAARGKSGLCIRHGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQYERCTKGAQGSTMYCKAHGGGNDVYLPDAQKVLKGVLLFARDMV

Query:  GGSVASLMGVGFAQKVYMGAQTF
        GG   +  G     K   G   F
Subjt:  GGSVASLMGVGFAQKVYMGAQTF

AT4G12020.1 protein kinase family protein2.5e-5550.43Show/hide
Query:  HQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGGRRCGYSG------------
        + +++ K C+  GC+KGAR ASG CI HGGG RCQKP C KGAE +T YCKAHGGGRRC++LGCTK AEG T+FCIAHGGGRRC +              
Subjt:  HQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGGRRCGYSG------------

Query:  ------------GCAKA-------------------------ARGKSGLCIRHGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQYERCTKGAQGSTMYC
                    GC K+                         ARG+SGLC+ HGGGKRC+ E CT+SAEG +GLCISHGGGRRCQ   CTKGA+GS M+C
Subjt:  ------------GCAKA-------------------------ARGKSGLCIRHGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQYERCTKGAQGSTMYC

Query:  KA---------HGGGN--DVYLPDAQKVLKGV
        KA          GGGN   V   DA   LK V
Subjt:  KA---------HGGGN--DVYLPDAQKVLKGV

AT5G09670.1 loricrin-related1.3e-9649.88Show/hide
Query:  GDTTLSLNCFGFGVRKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNVGYNKTKTQVASLPEEISPSDSILQLGLSGGTNEVSSVVECS
        GDT LSL C G       G E   N    + S   DDGCRLVLGLGPT ++ C   ++VG N +    AS     + +DS+LQLG              +
Subjt:  GDTTLSLNCFGFGVRKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNVGYNKTKTQVASLPEEISPSDSILQLGLSGGTNEVSSVVECS

Query:  VSAETDVSATYLINQWTAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNLLIQQEVLEIDSRNQLSQELSPTVEYSLGTVIDQTTKSVCSDHQA
        VS +T          ++   N   IP+VDEGS+SAK+SGGYMPSLL  P +   + + Q +       +Q+SQE SP  E+ +               + 
Subjt:  VSAETDVSATYLINQWTAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNLLIQQEVLEIDSRNQLSQELSPTVEYSLGTVIDQTTKSVCSDHQA

Query:  NNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGGRRCGYSGGCAKAARGKSGLCIR
        +NP++CK+ GC KGARGASGLCI HGGG RCQKPGCNKGAES+T +CK HGGG+RC+HLGCTKSAEGKT+FCI+HGGGRRC +  GC KAARG+SGLCI+
Subjt:  NNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGGRRCGYSGGCAKAARGKSGLCIR

Query:  HGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQY-ERCTKGAQGSTMYCKAHGGGNDVYLPDAQKVLKGVLLFARDMVGGSVASLMGVGFAQKVYMGAQT
        HGGGKRC +E CTRSAEG AGLCISHGGG+RCQY   C KGAQGST YCKAHGGG         K  +G     +   GG      G G   K   G   
Subjt:  HGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQY-ERCTKGAQGSTMYCKAHGGGNDVYLPDAQKVLKGVLLFARDMVGGSVASLMGVGFAQKVYMGAQT

Query:  F
        F
Subjt:  F

AT5G09670.2 loricrin-related1.3e-9649.88Show/hide
Query:  GDTTLSLNCFGFGVRKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNVGYNKTKTQVASLPEEISPSDSILQLGLSGGTNEVSSVVECS
        GDT LSL C G       G E   N    + S   DDGCRLVLGLGPT ++ C   ++VG N +    AS     + +DS+LQLG              +
Subjt:  GDTTLSLNCFGFGVRKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNVGYNKTKTQVASLPEEISPSDSILQLGLSGGTNEVSSVVECS

Query:  VSAETDVSATYLINQWTAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNLLIQQEVLEIDSRNQLSQELSPTVEYSLGTVIDQTTKSVCSDHQA
        VS +T          ++   N   IP+VDEGS+SAK+SGGYMPSLL  P +   + + Q +       +Q+SQE SP  E+ +               + 
Subjt:  VSAETDVSATYLINQWTAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNLLIQQEVLEIDSRNQLSQELSPTVEYSLGTVIDQTTKSVCSDHQA

Query:  NNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGGRRCGYSGGCAKAARGKSGLCIR
        +NP++CK+ GC KGARGASGLCI HGGG RCQKPGCNKGAES+T +CK HGGG+RC+HLGCTKSAEGKT+FCI+HGGGRRC +  GC KAARG+SGLCI+
Subjt:  NNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGGRRCGYSGGCAKAARGKSGLCIR

Query:  HGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQY-ERCTKGAQGSTMYCKAHGGGNDVYLPDAQKVLKGVLLFARDMVGGSVASLMGVGFAQKVYMGAQT
        HGGGKRC +E CTRSAEG AGLCISHGGG+RCQY   C KGAQGST YCKAHGGG         K  +G     +   GG      G G   K   G   
Subjt:  HGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQY-ERCTKGAQGSTMYCKAHGGGNDVYLPDAQKVLKGVLLFARDMVGGSVASLMGVGFAQKVYMGAQT

Query:  F
        F
Subjt:  F

AT5G64550.1 loricrin-related1.2e-11352.21Show/hide
Query:  MDLNKTVAHYSQNGDLTKDDNFGDTTLSLNCFGFGVRKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNV----GYNKTKTQVASLPEE
        MDLN++V H+S+   + K DNFGDT LSL C G    +  G     + L  + S  PD GCRLVLGLGPTP +    YYNV      NK      S+ E 
Subjt:  MDLNKTVAHYSQNGDLTKDDNFGDTTLSLNCFGFGVRKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNV----GYNKTKTQVASLPEE

Query:  ISPSDSILQLGLSGGTNEVSSVVECSVSAETDVSATYLINQWTAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNLLIQQEVLEIDSR---NQL
         S  +SILQLG    T +  S +E S+    D +              +S   VDEGSTSA++SGGYMPSLLFAPR  T N+     + E  +    +  
Subjt:  ISPSDSILQLGLSGGTNEVSSVVECSVSAETDVSATYLINQWTAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNLLIQQEVLEIDSR---NQL

Query:  SQELSPTVEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEF
        + +LS   E+S+    D++  +  S  + +NPK+CK+ GC KGARGASGLCIGHGGG RCQK GCNKGAES+T +CKAHGGG+RCQHLGCTKSAEGKT+ 
Subjt:  SQELSPTVEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEF

Query:  CIAHGGGRRCGYSGGCAKAARGKSGLCIRHGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQYERCTKGAQGSTMYCKAHGGGNDVYLPDAQKVLKGVLL
        CI+HGGGRRCG+  GCAKAARGKSGLCI+HGGGKRC++E CTRSAEG AGLCISHGGGRRCQ   CTKGAQGST YCKAHGGG         K  +G   
Subjt:  CIAHGGGRRCGYSGGCAKAARGKSGLCIRHGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQYERCTKGAQGSTMYCKAHGGGNDVYLPDAQKVLKGVLL

Query:  FARDMVGGSVASLMGVGFAQKVYMGAQTF
          +   GG      G G   K   G  +F
Subjt:  FARDMVGGSVASLMGVGFAQKVYMGAQTF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTGAACAAGACTGTAGCACATTATTCTCAAAATGGCGATCTCACAAAAGATGACAACTTTGGTGACACTACTTTGAGCTTGAATTGTTTTGGCTTTGGAGTAAG
AAAATCTTCTGGATGTGAGGTTGCTTTAAATGACCTCAACTTCAACTTCAGTTACGCCCCAGATGATGGATGTAGACTGGTACTTGGACTTGGTCCAACTCCAAGTGCTA
ACTGTGATGATTATTACAATGTTGGATATAATAAGACTAAGACACAAGTTGCATCTTTACCAGAAGAAATATCACCGAGTGACTCAATATTGCAGCTTGGTCTTTCTGGA
GGGACTAATGAAGTTTCAAGTGTGGTCGAATGTTCAGTTTCAGCAGAAACTGATGTCAGTGCAACTTATCTGATAAACCAATGGACTGCTGAAGCTAATCAACTGTCTAT
CCCACTGGTTGATGAGGGTTCTACCTCAGCAAAAAAATCGGGTGGGTATATGCCATCACTTCTTTTTGCTCCAAGAATGGGCACTTCAAACCTTCTGATTCAACAGGAGG
TTCTTGAAATTGATAGCAGAAATCAGCTGAGCCAAGAACTATCACCTACTGTAGAATACTCTTTAGGAACTGTAATTGACCAGACAACCAAAAGCGTGTGTTCAGATCAT
CAAGCAAATAATCCTAAAAGGTGCAAATACTTTGGCTGCGAGAAGGGTGCACGAGGAGCATCTGGTCTTTGTATTGGTCATGGAGGTGGACATAGATGCCAGAAACCTGG
GTGCAATAAGGGAGCCGAGAGTCGTACTGCTTACTGTAAAGCTCATGGTGGAGGAAGGAGGTGCCAACATTTAGGGTGCACCAAAAGTGCTGAGGGGAAGACAGAATTTT
GCATTGCTCATGGTGGTGGTAGACGTTGTGGATATTCAGGTGGATGTGCGAAGGCTGCACGTGGAAAGTCGGGCCTTTGTATTAGACATGGTGGGGGAAAACGATGTAAG
ATGGAAGGCTGCACTCGTAGTGCTGAAGGACATGCTGGTCTATGCATTTCTCATGGTGGTGGACGCCGTTGCCAGTATGAACGCTGCACAAAGGGTGCACAAGGAAGTAC
CATGTATTGTAAGGCCCATGGGGGAGGAAACGATGTATATTTGCCGGATGCACAAAAGGTGCTGAAGGGAGTACTCCTCTTTGCAAGGGACATGGTGGGGGGAAGCGTTG
CCTCTTTGATGGGGGTGGGATTTGCCCAAAAAGTGTACATGGGGGCACAAACTTTTGTGTTGCTCATGGTGGTGGAAAGAGGTGTGTGGTGTCAGGATGCACAAAAAGTG
CTCGTGGGCGCACTGATTGCTGTGTCAGACATGGTGGAGGCAAGCGATGCAAATTCGAGAACTGTGGAAAGAGTGCCCAAGGGAGCACAGACTTCTGCAAAGCCCATGGA
GGTGGAAAACGATGCACATGGGGAGAAGGCAAGTGCGAAAAATTTGCAAGGGGTAAGAGTGGTTTATGTGCTGCTCATAGTAGCATGA
mRNA sequenceShow/hide mRNA sequence
AGATTCCATATGAGCCGCCCACATCAAATTCCATTTTCTAAAAAACCAAATTTATTAAATTAATAATTATTTAATTACCACATCCTCTTCCACCACAATTTCTTTTCTTC
TCTTCCACTTGGACTTCGAGTTTTTTGTTCGCTCCTTTTTTCTGTTTTTCTTTCTCTCTTTTTTCCCCCATTTCTCTCTCATTCTCTTACAACGTTTCCTCATGCCTTAG
ATTGCTCCTCTACTTTCTACTCTTCTCCGATTCTCTTCTACATATTGCTTCATCACGTACCTACTATTCTGCTATTGTTGTGTCCAACAATGCGTTTGAATTGATGGAGA
GGCATCATTTCATTGACATTGGTCGTGTTATAAGATGTACCATTTTCAGGGTTGATGAAGCTGAATGAGTTCTCATCAAGGAGCATAAGGAGCTCCCCAGATTTCTGCAT
ATCAACATTACTCGTTCTCATCAAGGGGTTCTATCCAAGAAACGTCATTAGATACACTAGTAATTGATTAAAAAGAAATTGTTGCATTTTTGGTGGGAAGATCTTTGGTT
AGCATATAAAGCTGATGGATAGATCAGATACGTCGGGCTTTGTTAGCGGTGGAGGTAGTAGTAATCTGTTAGAATCTGTTTTGGGTGCTGATATGTACATCTGCTTTTAG
TATTCCTTACACCTTGTTGGCAAGATTATCCTATTAGTTTGAACTTCAGTGTTGAAAGAATGGATTTGAACAAGACTGTAGCACATTATTCTCAAAATGGCGATCTCACA
AAAGATGACAACTTTGGTGACACTACTTTGAGCTTGAATTGTTTTGGCTTTGGAGTAAGAAAATCTTCTGGATGTGAGGTTGCTTTAAATGACCTCAACTTCAACTTCAG
TTACGCCCCAGATGATGGATGTAGACTGGTACTTGGACTTGGTCCAACTCCAAGTGCTAACTGTGATGATTATTACAATGTTGGATATAATAAGACTAAGACACAAGTTG
CATCTTTACCAGAAGAAATATCACCGAGTGACTCAATATTGCAGCTTGGTCTTTCTGGAGGGACTAATGAAGTTTCAAGTGTGGTCGAATGTTCAGTTTCAGCAGAAACT
GATGTCAGTGCAACTTATCTGATAAACCAATGGACTGCTGAAGCTAATCAACTGTCTATCCCACTGGTTGATGAGGGTTCTACCTCAGCAAAAAAATCGGGTGGGTATAT
GCCATCACTTCTTTTTGCTCCAAGAATGGGCACTTCAAACCTTCTGATTCAACAGGAGGTTCTTGAAATTGATAGCAGAAATCAGCTGAGCCAAGAACTATCACCTACTG
TAGAATACTCTTTAGGAACTGTAATTGACCAGACAACCAAAAGCGTGTGTTCAGATCATCAAGCAAATAATCCTAAAAGGTGCAAATACTTTGGCTGCGAGAAGGGTGCA
CGAGGAGCATCTGGTCTTTGTATTGGTCATGGAGGTGGACATAGATGCCAGAAACCTGGGTGCAATAAGGGAGCCGAGAGTCGTACTGCTTACTGTAAAGCTCATGGTGG
AGGAAGGAGGTGCCAACATTTAGGGTGCACCAAAAGTGCTGAGGGGAAGACAGAATTTTGCATTGCTCATGGTGGTGGTAGACGTTGTGGATATTCAGGTGGATGTGCGA
AGGCTGCACGTGGAAAGTCGGGCCTTTGTATTAGACATGGTGGGGGAAAACGATGTAAGATGGAAGGCTGCACTCGTAGTGCTGAAGGACATGCTGGTCTATGCATTTCT
CATGGTGGTGGACGCCGTTGCCAGTATGAACGCTGCACAAAGGGTGCACAAGGAAGTACCATGTATTGTAAGGCCCATGGGGGAGGAAACGATGTATATTTGCCGGATGC
ACAAAAGGTGCTGAAGGGAGTACTCCTCTTTGCAAGGGACATGGTGGGGGGAAGCGTTGCCTCTTTGATGGGGGTGGGATTTGCCCAAAAAGTGTACATGGGGGCACAAA
CTTTTGTGTTGCTCATGGTGGTGGAAAGAGGTGTGTGGTGTCAGGATGCACAAAAAGTGCTCGTGGGCGCACTGATTGCTGTGTCAGACATGGTGGAGGCAAGCGATGCA
AATTCGAGAACTGTGGAAAGAGTGCCCAAGGGAGCACAGACTTCTGCAAAGCCCATGGAGGTGGAAAACGATGCACATGGGGAGAAGGCAAGTGCGAAAAATTTGCAAGG
GGTAAGAGTGGTTTATGTGCTGCTCATAGTAGCATGATCCAAGATCGAGAAACAAACAAGGGTAGCCTGATTGGACCAGGACTTTTCCATGGTCTGGTATCTGCTTCTGC
TGCGTCTACAGTTGGAGATAGCTTGGACCACTATAATACATCTTCTGCAATCAGTTTCATATGTGATTCAATTGATTCTGCAGAGAAGCCTACGAAGCGACATCAACTTA
TACCACCACAGGTATTGGTTCCATCCTCAATGAAATCATCAGCTTCATATTCTAGTTTCTTGAGTACAGAGAAGGGAGAGGAAGATGGGAACGGATATTGTATTGGCAAA
AAATTCCTTGAATATTCAATTCCTGAGGGAAGGGTTCATGGAGGTGGGCTCATGTCATTGCTTGGCGGGCATTTGAAGATGAAGAATATGAATAATGGCATTTAAGGATA
TCTGATCCGAAAGCCACTGGAAATGGAGAAATCCAGGTAATTTAGGAGTTATTTAATAACGCATTGGTATCATCATGTACATAAAACTTCAGTTGGGGATTTGTGAATCG
TGTACTGTGATAATAATGATTAGATTTGTAGTTTTACATTATGGAATTTGTCAGGGCTATCTGATTTAAAACATTATCAGTACTTTCGTGTCAATAGGCTCTTTTATTTG
TATATCGTTATAAAAGCCAAGCCTATGCTTTGGTGATTTGTAGTGGTGGTGTTACTTTCTGAAAAACTTTGTCCACAGCTACTCCAAGTTTTGAGTGTTTGTTGATGCTT
TCTGATCCGTACAGAATTGAGCTATAAATAGCATTTTATTTGGAAAGTAGGAAAAGCATGG
Protein sequenceShow/hide protein sequence
MDLNKTVAHYSQNGDLTKDDNFGDTTLSLNCFGFGVRKSSGCEVALNDLNFNFSYAPDDGCRLVLGLGPTPSANCDDYYNVGYNKTKTQVASLPEEISPSDSILQLGLSG
GTNEVSSVVECSVSAETDVSATYLINQWTAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNLLIQQEVLEIDSRNQLSQELSPTVEYSLGTVIDQTTKSVCSDH
QANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGGRRCGYSGGCAKAARGKSGLCIRHGGGKRCK
MEGCTRSAEGHAGLCISHGGGRRCQYERCTKGAQGSTMYCKAHGGGNDVYLPDAQKVLKGVLLFARDMVGGSVASLMGVGFAQKVYMGAQTFVLLMVVERGVWCQDAQKV
LVGALIAVSDMVEASDANSRTVERVPKGAQTSAKPMEVENDAHGEKASAKNLQGVRVVYVLLIVA