; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G020720 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G020720
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionProtein of unknown function (DUF1644)
Genome locationchr02:26904546..26911262
RNA-Seq ExpressionLsi02G020720
SyntenyLsi02G020720
Gene Ontology termsNA
InterPro domainsIPR006045 - Cupin 1
IPR011051 - RmlC-like cupin domain superfamily
IPR012866 - Protein of unknown function DUF1644
IPR014710 - RmlC-like jelly roll fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139654.1 uncharacterized protein LOC101208460 isoform X2 [Cucumis sativus]4.3e-21896.92Show/hide
Query:  VFKMAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFSSP-TNN
        +FKMAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFS+P TNN
Subjt:  VFKMAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFSSP-TNN

Query:  LGLSIDLNEVDDNQNINERNTVASGGLPGVALGDNGTENSNRTVDTNEAGDLDTAGSGSITERVEQEGLDAGNSSEYSNLKCPMCRGAVLGLEVVEEARE
        LGLSIDLNEVDDNQNINERNTVAS GLPG+ALGDNGTENSNRTVDTNEAGD+DTAGSGSITERV+QEGLDAGNSSEYSNLKCPMCRGAVLGLEV+EEARE
Subjt:  LGLSIDLNEVDDNQNINERNTVASGGLPGVALGDNGTENSNRTVDTNEAGDLDTAGSGSITERVEQEGLDAGNSSEYSNLKCPMCRGAVLGLEVVEEARE

Query:  YLNLKKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDVNGPL
        YLNLKKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDVNGPL
Subjt:  YLNLKKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDVNGPL

Query:  LTSFFLFHMFGSVDGAREPRPRSRSWVRHRRSGGGTPVPERRFLWGENLLGLQEDADEDFRIYIGMGDDASPPTRRRRVTRPGSDADQP
        LTSFFLFHMFGSV+GAREPRPRSRSWVRHRRSGGGTPV ERRFLWGENLLGLQED DEDFRIYIGMGDD SPPTRRRRVTRPGSDADQP
Subjt:  LTSFFLFHMFGSVDGAREPRPRSRSWVRHRRSGGGTPVPERRFLWGENLLGLQEDADEDFRIYIGMGDDASPPTRRRRVTRPGSDADQP

XP_004139655.1 uncharacterized protein LOC101208460 isoform X3 [Cucumis sativus]8.1e-21797.15Show/hide
Query:  MAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFSSP-TNNLGL
        MAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFS+P TNNLGL
Subjt:  MAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFSSP-TNNLGL

Query:  SIDLNEVDDNQNINERNTVASGGLPGVALGDNGTENSNRTVDTNEAGDLDTAGSGSITERVEQEGLDAGNSSEYSNLKCPMCRGAVLGLEVVEEAREYLN
        SIDLNEVDDNQNINERNTVAS GLPG+ALGDNGTENSNRTVDTNEAGD+DTAGSGSITERV+QEGLDAGNSSEYSNLKCPMCRGAVLGLEV+EEAREYLN
Subjt:  SIDLNEVDDNQNINERNTVASGGLPGVALGDNGTENSNRTVDTNEAGDLDTAGSGSITERVEQEGLDAGNSSEYSNLKCPMCRGAVLGLEVVEEAREYLN

Query:  LKKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDVNGPLLTS
        LKKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDVNGPLLTS
Subjt:  LKKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDVNGPLLTS

Query:  FFLFHMFGSVDGAREPRPRSRSWVRHRRSGGGTPVPERRFLWGENLLGLQEDADEDFRIYIGMGDDASPPTRRRRVTRPGSDADQP
        FFLFHMFGSV+GAREPRPRSRSWVRHRRSGGGTPV ERRFLWGENLLGLQED DEDFRIYIGMGDD SPPTRRRRVTRPGSDADQP
Subjt:  FFLFHMFGSVDGAREPRPRSRSWVRHRRSGGGTPVPERRFLWGENLLGLQEDADEDFRIYIGMGDDASPPTRRRRVTRPGSDADQP

XP_008447269.1 PREDICTED: uncharacterized protein LOC103489748 isoform X1 [Cucumis melo]1.9e-21896.44Show/hide
Query:  VTYLVFKMAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFSSP
        +TYLVFKMAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFS+P
Subjt:  VTYLVFKMAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFSSP

Query:  -TNNLGLSIDLNEVDDNQNINERNTVASGGLPGVALGDNGTENSNRTVDTNEAGDLDTAGSGSITERVEQEGLDAGNSSEYSNLKCPMCRGAVLGLEVVE
         TNNLGLSIDLNEVDDNQNINERNTVAS GLPGVALGDNGTENSNRTVDTNEAGD+DTAGSGSITERV+Q GLDAGNSSEY NLKCPMCRGAVLGLEV+E
Subjt:  -TNNLGLSIDLNEVDDNQNINERNTVASGGLPGVALGDNGTENSNRTVDTNEAGDLDTAGSGSITERVEQEGLDAGNSSEYSNLKCPMCRGAVLGLEVVE

Query:  EAREYLNLKKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDV
        EAREYLNLKKRSCSRETCSFSGNYQELRRHARRVHP SRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDG+VAGERDNGTGDV
Subjt:  EAREYLNLKKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDV

Query:  NGPLLTSFFLFHMFGSVDGAREPRPRSRSWVRHRRSGGGTPVPERRFLWGENLLGLQEDADEDFRIYIGMGDDASPPTRRRRVTRPGSDADQP
        NGPLLTSFFLFHMFGSV+GAREPRPRSRSWVRHRRSGGGTPV ERRFLWGENLLGLQEDADEDFRIYIGMGDD SPPTRRRRVTRPGSDADQP
Subjt:  NGPLLTSFFLFHMFGSVDGAREPRPRSRSWVRHRRSGGGTPVPERRFLWGENLLGLQEDADEDFRIYIGMGDDASPPTRRRRVTRPGSDADQP

XP_011659176.1 uncharacterized protein LOC101208460 isoform X1 [Cucumis sativus]2.7e-22096.95Show/hide
Query:  VTYLVFKMAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFSSP
        +TYLVFKMAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFS+P
Subjt:  VTYLVFKMAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFSSP

Query:  -TNNLGLSIDLNEVDDNQNINERNTVASGGLPGVALGDNGTENSNRTVDTNEAGDLDTAGSGSITERVEQEGLDAGNSSEYSNLKCPMCRGAVLGLEVVE
         TNNLGLSIDLNEVDDNQNINERNTVAS GLPG+ALGDNGTENSNRTVDTNEAGD+DTAGSGSITERV+QEGLDAGNSSEYSNLKCPMCRGAVLGLEV+E
Subjt:  -TNNLGLSIDLNEVDDNQNINERNTVASGGLPGVALGDNGTENSNRTVDTNEAGDLDTAGSGSITERVEQEGLDAGNSSEYSNLKCPMCRGAVLGLEVVE

Query:  EAREYLNLKKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDV
        EAREYLNLKKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDV
Subjt:  EAREYLNLKKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDV

Query:  NGPLLTSFFLFHMFGSVDGAREPRPRSRSWVRHRRSGGGTPVPERRFLWGENLLGLQEDADEDFRIYIGMGDDASPPTRRRRVTRPGSDADQP
        NGPLLTSFFLFHMFGSV+GAREPRPRSRSWVRHRRSGGGTPV ERRFLWGENLLGLQED DEDFRIYIGMGDD SPPTRRRRVTRPGSDADQP
Subjt:  NGPLLTSFFLFHMFGSVDGAREPRPRSRSWVRHRRSGGGTPVPERRFLWGENLLGLQEDADEDFRIYIGMGDDASPPTRRRRVTRPGSDADQP

XP_038900082.1 uncharacterized protein LOC120087237 isoform X1 [Benincasa hispida]5.1e-21996.44Show/hide
Query:  VTYLVFKMAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFSS-
        +TYLVFKMAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFSS 
Subjt:  VTYLVFKMAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFSS-

Query:  PTNNLGLSIDLNEVDDNQNINERNTVASGGLPGVALGDNGTENSNRTVDTNEAGDLDTAGSGSITERVEQEGLDAGNSSEYSNLKCPMCRGAVLGLEVVE
        PTNNLGLSIDLNEVDDNQNINERN VAS GLPGVALGDNGTENSNRTVDTNEAGDLDTAGSG ITERVEQEGLDAGNSSEYSNLKCPMCRGAVLGLEV+E
Subjt:  PTNNLGLSIDLNEVDDNQNINERNTVASGGLPGVALGDNGTENSNRTVDTNEAGDLDTAGSGSITERVEQEGLDAGNSSEYSNLKCPMCRGAVLGLEVVE

Query:  EAREYLNLKKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDV
        E REYLNLKKRSCSRETCSFSGNYQELRRHARRVHPT+RP+V DPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDV
Subjt:  EAREYLNLKKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDV

Query:  NGPLLTSFFLFHMFGSVDGAREPRPRSRSWVRHRRSGGGTPVPERRFLWGENLLGLQEDADEDFRIYIGMGDDASPPTRRRRVTRPGSDADQP
        NGPLLTSFFLFHMFGSVDGAREPRPRSRSWVRHRRSGGGTPVPERRFLWGENLLGLQEDA+EDFRIYIGM +D+SPPTRRRRVTRPGSDADQP
Subjt:  NGPLLTSFFLFHMFGSVDGAREPRPRSRSWVRHRRSGGGTPVPERRFLWGENLLGLQEDADEDFRIYIGMGDDASPPTRRRRVTRPGSDADQP

TrEMBL top hitse value%identityAlignment
A0A0A0K9V2 Uncharacterized protein3.9e-21797.15Show/hide
Query:  MAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFSSP-TNNLGL
        MAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFS+P TNNLGL
Subjt:  MAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFSSP-TNNLGL

Query:  SIDLNEVDDNQNINERNTVASGGLPGVALGDNGTENSNRTVDTNEAGDLDTAGSGSITERVEQEGLDAGNSSEYSNLKCPMCRGAVLGLEVVEEAREYLN
        SIDLNEVDDNQNINERNTVAS GLPG+ALGDNGTENSNRTVDTNEAGD+DTAGSGSITERV+QEGLDAGNSSEYSNLKCPMCRGAVLGLEV+EEAREYLN
Subjt:  SIDLNEVDDNQNINERNTVASGGLPGVALGDNGTENSNRTVDTNEAGDLDTAGSGSITERVEQEGLDAGNSSEYSNLKCPMCRGAVLGLEVVEEAREYLN

Query:  LKKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDVNGPLLTS
        LKKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDVNGPLLTS
Subjt:  LKKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDVNGPLLTS

Query:  FFLFHMFGSVDGAREPRPRSRSWVRHRRSGGGTPVPERRFLWGENLLGLQEDADEDFRIYIGMGDDASPPTRRRRVTRPGSDADQP
        FFLFHMFGSV+GAREPRPRSRSWVRHRRSGGGTPV ERRFLWGENLLGLQED DEDFRIYIGMGDD SPPTRRRRVTRPGSDADQP
Subjt:  FFLFHMFGSVDGAREPRPRSRSWVRHRRSGGGTPVPERRFLWGENLLGLQEDADEDFRIYIGMGDDASPPTRRRRVTRPGSDADQP

A0A1S3BGH8 uncharacterized protein LOC103489748 isoform X19.4e-21996.44Show/hide
Query:  VTYLVFKMAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFSSP
        +TYLVFKMAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFS+P
Subjt:  VTYLVFKMAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFSSP

Query:  -TNNLGLSIDLNEVDDNQNINERNTVASGGLPGVALGDNGTENSNRTVDTNEAGDLDTAGSGSITERVEQEGLDAGNSSEYSNLKCPMCRGAVLGLEVVE
         TNNLGLSIDLNEVDDNQNINERNTVAS GLPGVALGDNGTENSNRTVDTNEAGD+DTAGSGSITERV+Q GLDAGNSSEY NLKCPMCRGAVLGLEV+E
Subjt:  -TNNLGLSIDLNEVDDNQNINERNTVASGGLPGVALGDNGTENSNRTVDTNEAGDLDTAGSGSITERVEQEGLDAGNSSEYSNLKCPMCRGAVLGLEVVE

Query:  EAREYLNLKKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDV
        EAREYLNLKKRSCSRETCSFSGNYQELRRHARRVHP SRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDG+VAGERDNGTGDV
Subjt:  EAREYLNLKKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDV

Query:  NGPLLTSFFLFHMFGSVDGAREPRPRSRSWVRHRRSGGGTPVPERRFLWGENLLGLQEDADEDFRIYIGMGDDASPPTRRRRVTRPGSDADQP
        NGPLLTSFFLFHMFGSV+GAREPRPRSRSWVRHRRSGGGTPV ERRFLWGENLLGLQEDADEDFRIYIGMGDD SPPTRRRRVTRPGSDADQP
Subjt:  NGPLLTSFFLFHMFGSVDGAREPRPRSRSWVRHRRSGGGTPVPERRFLWGENLLGLQEDADEDFRIYIGMGDDASPPTRRRRVTRPGSDADQP

A0A1S3BH14 uncharacterized protein LOC103489748 isoform X21.5e-21696.4Show/hide
Query:  VFKMAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFSSP-TNN
        +FKMAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFS+P TNN
Subjt:  VFKMAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFSSP-TNN

Query:  LGLSIDLNEVDDNQNINERNTVASGGLPGVALGDNGTENSNRTVDTNEAGDLDTAGSGSITERVEQEGLDAGNSSEYSNLKCPMCRGAVLGLEVVEEARE
        LGLSIDLNEVDDNQNINERNTVAS GLPGVALGDNGTENSNRTVDTNEAGD+DTAGSGSITERV+Q GLDAGNSSEY NLKCPMCRGAVLGLEV+EEARE
Subjt:  LGLSIDLNEVDDNQNINERNTVASGGLPGVALGDNGTENSNRTVDTNEAGDLDTAGSGSITERVEQEGLDAGNSSEYSNLKCPMCRGAVLGLEVVEEARE

Query:  YLNLKKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDVNGPL
        YLNLKKRSCSRETCSFSGNYQELRRHARRVHP SRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDG+VAGERDNGTGDVNGPL
Subjt:  YLNLKKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDVNGPL

Query:  LTSFFLFHMFGSVDGAREPRPRSRSWVRHRRSGGGTPVPERRFLWGENLLGLQEDADEDFRIYIGMGDDASPPTRRRRVTRPGSDADQP
        LTSFFLFHMFGSV+GAREPRPRSRSWVRHRRSGGGTPV ERRFLWGENLLGLQEDADEDFRIYIGMGDD SPPTRRRRVTRPGSDADQP
Subjt:  LTSFFLFHMFGSVDGAREPRPRSRSWVRHRRSGGGTPVPERRFLWGENLLGLQEDADEDFRIYIGMGDDASPPTRRRRVTRPGSDADQP

A0A5A7TG05 DUF1644 domain-containing protein4.3e-21696.89Show/hide
Query:  MAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFSSP-TNNLGL
        MAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFS+P TNNLGL
Subjt:  MAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFSSP-TNNLGL

Query:  SIDLNEVDDNQNINERNTVASGGLPGVALGDNGTENSNRTVDTNEAGDLDTAGSGSITERVEQEGLDAGNSSEYSNLKCPMCRGAVLGLEVVEEAREYLN
        SIDLNEVDDNQNINERNTV S GLPGVALGDNGTENSNRTVDTNEAGD+DTAGSGSITERV+QEGLDAGNSSEY NLKCPMCRGAVLGLEV+EEAREYLN
Subjt:  SIDLNEVDDNQNINERNTVASGGLPGVALGDNGTENSNRTVDTNEAGDLDTAGSGSITERVEQEGLDAGNSSEYSNLKCPMCRGAVLGLEVVEEAREYLN

Query:  LKKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDVNGPLLTS
        LKKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDG+VAGERDNGTGDVNGPLLTS
Subjt:  LKKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDVNGPLLTS

Query:  FFLFHMFGSVDGAREPRPRSRSWVRHRRSGGGTPVPERRFLWGENLLGLQEDADEDFRIYIGMGDDASPPTRRRRVTRPGSDADQP
        FFLFHMFGSV+GAREPRPRSRSWVRHRRSGGGTPV ERRFLWGENLLGLQEDADEDFRIYIGMGDD SPPTRRRRVTRPGSDADQP
Subjt:  FFLFHMFGSVDGAREPRPRSRSWVRHRRSGGGTPVPERRFLWGENLLGLQEDADEDFRIYIGMGDDASPPTRRRRVTRPGSDADQP

A0A5D3DSI0 DUF1644 domain-containing protein1.1e-21697.15Show/hide
Query:  MAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFSSP-TNNLGL
        MAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFS+P TNNLGL
Subjt:  MAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFSSP-TNNLGL

Query:  SIDLNEVDDNQNINERNTVASGGLPGVALGDNGTENSNRTVDTNEAGDLDTAGSGSITERVEQEGLDAGNSSEYSNLKCPMCRGAVLGLEVVEEAREYLN
        SIDLNEVDDNQNINERNTVAS GLPGVALGDNGTENSNRTVDTNEAGD+DTAGSGSITERV+QEGLDAGNSSEY NLKCPMCRGAVLGLEV+EEAREYLN
Subjt:  SIDLNEVDDNQNINERNTVASGGLPGVALGDNGTENSNRTVDTNEAGDLDTAGSGSITERVEQEGLDAGNSSEYSNLKCPMCRGAVLGLEVVEEAREYLN

Query:  LKKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDVNGPLLTS
        LKKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDG+VAGERDNGTGDVNGPLLTS
Subjt:  LKKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDVNGPLLTS

Query:  FFLFHMFGSVDGAREPRPRSRSWVRHRRSGGGTPVPERRFLWGENLLGLQEDADEDFRIYIGMGDDASPPTRRRRVTRPGSDADQP
        FFLFHMFGSV+GAREPRPRSRSWVRHRRSGGGTPV ERRFLWGENLLGLQEDADEDFRIYIGMGDD SPPTRRRRVTRPGSDADQP
Subjt:  FFLFHMFGSVDGAREPRPRSRSWVRHRRSGGGTPVPERRFLWGENLLGLQEDADEDFRIYIGMGDDASPPTRRRRVTRPGSDADQP

SwissProt top hitse value%identityAlignment
B5KVH4 11S globulin seed storage protein 11.8e-0925.74Show/hide
Query:  LKAMNPKQYFKGAGGSYHKWLPSDYPVLAQSKVGAGALLLHPRGFTIPHYADASKVGYVLQGNNGVTGFIFP----------------------NTSNEE
        L A+ P    +   G    W P ++  L  + V      + P G  +PHY++A ++ Y+ +G  G+TG +FP                         +++
Subjt:  LKAMNPKQYFKGAGGSYHKWLPSDYPVLAQSKVGAGALLLHPRGFTIPHYADASKVGYVLQGNNGVTGFIFP----------------------NTSNEE

Query:  VINLKKGDLIPVPAGIASWWFNNGDSDLEIVFLGET
        + + ++GD+I  PAG+A W +N+G S +  +FL +T
Subjt:  VINLKKGDLIPVPAGIASWWFNNGDSDLEIVFLGET

P04405 Glycinin G25.8e-0825.68Show/hide
Query:  QNLKAMNPKQYFKGAGGSYHKWLPSDYPVLAQSKVGAGALLLHPRGFTIPHYADASKVGYVLQGNNGVTGFIF---------------------PNTSNE
        Q L A+ P    +  GG    W P++ P    + V      L+      P Y +  +  Y+ QG NG+ G IF                     P   ++
Subjt:  QNLKAMNPKQYFKGAGGSYHKWLPSDYPVLAQSKVGAGALLLHPRGFTIPHYADASKVGYVLQGNNGVTGFIF---------------------PNTSNE

Query:  EVINLKKGDLIPVPAGIASWWFNNGDSDLEIVFLGETKTAHVSESEFP
        +V   ++GDLI VP G+A W +NN D+ +  V + +T +      + P
Subjt:  EVINLKKGDLIPVPAGIASWWFNNGDSDLEIVFLGETKTAHVSESEFP

P05190 Legumin type B1.5e-0826.28Show/hide
Query:  NLKAMNPKQYFKGAGGSYHKWLPSDYPVLAQSKVGAGALLLHPRGFTIPHYADASKVGYVLQGNNGVTGFIFPN----------------------TSNE
        N+ A+ P    +   G    W P ++P L  + V      + P G  +P Y+ + ++ Y++QG  GV G   P                        S++
Subjt:  NLKAMNPKQYFKGAGGSYHKWLPSDYPVLAQSKVGAGALLLHPRGFTIPHYADASKVGYVLQGNNGVTGFIFPN----------------------TSNE

Query:  EVINLKKGDLIPVPAGIASWWFNNGDSDLEIVFLGET
        ++   +KGD+I +P+GI  W +NNGD  L  + L +T
Subjt:  EVINLKKGDLIPVPAGIASWWFNNGDSDLEIVFLGET

P09800 Legumin B4.0e-0930.37Show/hide
Query:  QNLKAMNPKQYFKGAGGSYHKWLPSDYPVLAQSKVGAGALLLHP---RGFTIPHYADASKVGYVLQGNNGVTGFIFP-------------------NTSN
        QNL A+ PK  F+   G    W  ++     Q +    A L H    +G  +P +  A  + YV QG  G+ G +FP                      +
Subjt:  QNLKAMNPKQYFKGAGGSYHKWLPSDYPVLAQSKVGAGALLLHP---RGFTIPHYADASKVGYVLQGNNGVTGFIFP-------------------NTSN

Query:  EEVINLKKGDLIPVPAGIASWWFNNGDSDLEIVFL
        +++  LK+GD++ +PAG+A W FNNG S L +V L
Subjt:  EEVINLKKGDLIPVPAGIASWWFNNGDSDLEIVFL

P11828 Glycinin G32.6e-0826.21Show/hide
Query:  QNLKAMNPKQYFKGAGGSYHKWLPSDYPVLAQSKVGAGALLLHPRGFTIPHYADASKVGYVLQGNNGVTGFIF------------------PNTSNEEVI
        Q L A+ P    +  GG    W P++ P    + V      L+      P Y +A +  Y+ QG +G+ G IF                  P   ++++ 
Subjt:  QNLKAMNPKQYFKGAGGSYHKWLPSDYPVLAQSKVGAGALLLHPRGFTIPHYADASKVGYVLQGNNGVTGFIF------------------PNTSNEEVI

Query:  NLKKGDLIPVPAGIASWWFNNGDSDLEIVFLGETKTAHVSESEFP
        + ++GDLI VP G A W +NN D+ +  V L +T +      + P
Subjt:  NLKKGDLIPVPAGIASWWFNNGDSDLEIVFLGETKTAHVSESEFP

Arabidopsis top hitse value%identityAlignment
AT1G68140.1 Protein of unknown function (DUF1644)9.7e-4335.25Show/hide
Query:  KELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFSSPTNNLGLSIDLNEVDDNQNINERNTV
        ++ + V C +CM+ PHNAVLLLCSSH KGC+PY+C TS R+SNC DQ+KK                                                  
Subjt:  KELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFSSPTNNLGLSIDLNEVDDNQNINERNTV

Query:  ASGGLPGVALGDNGTENSNRTVDTNEAGDLDTAGSGSITERVEQEGLDAGNSSEYSNLKCPMCRGAVLGLEVVEEAREYLNLKKRSCSRETCSFSGNYQE
                                  +  L T+G   I            N SE  NL CP+CRG V G  +V+ AR++LNLKKR C +E C ++G ++E
Subjt:  ASGGLPGVALGDNGTENSNRTVDTNEAGDLDTAGSGSITERVEQEGLDAGNSSEYSNLKCPMCRGAVLGLEVVEEAREYLNLKKRSCSRETCSFSGNYQE

Query:  LRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIE----NGDGMVAGERDNGTGDVNG-PLLTSFFLFHMFGS
        LR+H +  HP+++P  +DP  E+ WRRLE + +  DV+S IRS MPG +V GDYVIE    NG     G  D+G     G  L+  F L H FG+
Subjt:  LRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIE----NGDGMVAGERDNGTGDVNG-PLLTSFFLFHMFGS

AT1G68140.3 Protein of unknown function (DUF1644)9.7e-4335.25Show/hide
Query:  KELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFSSPTNNLGLSIDLNEVDDNQNINERNTV
        ++ + V C +CM+ PHNAVLLLCSSH KGC+PY+C TS R+SNC DQ+KK                                                  
Subjt:  KELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFSSPTNNLGLSIDLNEVDDNQNINERNTV

Query:  ASGGLPGVALGDNGTENSNRTVDTNEAGDLDTAGSGSITERVEQEGLDAGNSSEYSNLKCPMCRGAVLGLEVVEEAREYLNLKKRSCSRETCSFSGNYQE
                                  +  L T+G   I            N SE  NL CP+CRG V G  +V+ AR++LNLKKR C +E C ++G ++E
Subjt:  ASGGLPGVALGDNGTENSNRTVDTNEAGDLDTAGSGSITERVEQEGLDAGNSSEYSNLKCPMCRGAVLGLEVVEEAREYLNLKKRSCSRETCSFSGNYQE

Query:  LRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIE----NGDGMVAGERDNGTGDVNG-PLLTSFFLFHMFGS
        LR+H +  HP+++P  +DP  E+ WRRLE + +  DV+S IRS MPG +V GDYVIE    NG     G  D+G     G  L+  F L H FG+
Subjt:  LRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIE----NGDGMVAGERDNGTGDVNG-PLLTSFFLFHMFGS

AT3G24740.1 Protein of unknown function (DUF1644)5.3e-9751.38Show/hide
Query:  MAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFSSPTNNLGLS
        MAGVKR++  +SD+ ALHKELDEVSCP+CMDHPHNAVLLLCSSH KGC+ YICDTS+RHSNC D+FKKL  E+   P              +P  NL   
Subjt:  MAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFSSPTNNLGLS

Query:  IDLNEVDDNQNINERNTVASGGLPGVALGDNGTENSNRTVDTNEAGDLDTAGSGSITERVEQEGLDAGNSSEYSNLKCPMCRGAVLGLEVVEEAREYLNL
           +   +N+++ E  T +              E+ NR       G    + S     RVE+E      S + +NLKCP+CRG VLG +VVEE R YL+ 
Subjt:  IDLNEVDDNQNINERNTVASGGLPGVALGDNGTENSNRTVDTNEAGDLDTAGSGSITERVEQEGLDAGNSSEYSNLKCPMCRGAVLGLEVVEEAREYLNL

Query:  KKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDVNGPLLTSF
        K RSCSRE+CSF+GNYQ+LRRHARR HPT+RP+  DPSRERAWRRLE QRE GD+VSAIRSAMPGA+VVGDYVIENGD   AGER+ G G     L T+ 
Subjt:  KKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDVNGPLLTSF

Query:  FLFHMFGSVD---------GAREPRPRSRSWVRHRRSGGGTPVPERRFLWGENLLGLQEDA----DEDFRIYIGMGDDASP-PTRRRRVTRPGSDADQP
         LF M GS+D         G      RSR+W  HRRS       +R +LWGENLLGLQ++     DE+FR+    G  ++P P RRRR  RP S  + P
Subjt:  FLFHMFGSVD---------GAREPRPRSRSWVRHRRSGGGTPVPERRFLWGENLLGLQEDA----DEDFRIYIGMGDDASP-PTRRRRVTRPGSDADQP

AT3G24740.2 Protein of unknown function (DUF1644)5.3e-9751.38Show/hide
Query:  MAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFSSPTNNLGLS
        MAGVKR++  +SD+ ALHKELDEVSCP+CMDHPHNAVLLLCSSH KGC+ YICDTS+RHSNC D+FKKL  E+   P              +P  NL   
Subjt:  MAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFSSPTNNLGLS

Query:  IDLNEVDDNQNINERNTVASGGLPGVALGDNGTENSNRTVDTNEAGDLDTAGSGSITERVEQEGLDAGNSSEYSNLKCPMCRGAVLGLEVVEEAREYLNL
           +   +N+++ E  T +              E+ NR       G    + S     RVE+E      S + +NLKCP+CRG VLG +VVEE R YL+ 
Subjt:  IDLNEVDDNQNINERNTVASGGLPGVALGDNGTENSNRTVDTNEAGDLDTAGSGSITERVEQEGLDAGNSSEYSNLKCPMCRGAVLGLEVVEEAREYLNL

Query:  KKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDVNGPLLTSF
        K RSCSRE+CSF+GNYQ+LRRHARR HPT+RP+  DPSRERAWRRLE QRE GD+VSAIRSAMPGA+VVGDYVIENGD   AGER+ G G     L T+ 
Subjt:  KKRSCSRETCSFSGNYQELRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDVNGPLLTSF

Query:  FLFHMFGSVD---------GAREPRPRSRSWVRHRRSGGGTPVPERRFLWGENLLGLQEDA----DEDFRIYIGMGDDASP-PTRRRRVTRPGSDADQP
         LF M GS+D         G      RSR+W  HRRS       +R +LWGENLLGLQ++     DE+FR+    G  ++P P RRRR  RP S  + P
Subjt:  FLFHMFGSVD---------GAREPRPRSRSWVRHRRSGGGTPVPERRFLWGENLLGLQEDA----DEDFRIYIGMGDDASP-PTRRRRVTRPGSDADQP

AT3G25910.1 Protein of unknown function (DUF1644)6.3e-5035.77Show/hide
Query:  KELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFSSPTNNLGLSIDLNEVDDNQNINERNTV
        KE +E  CP+CM+HPHN +LL+CSS+  GC+PY+CDTSHRHSNCFDQF+K  +E    P LS  L       + PT       ++ +VD +         
Subjt:  KELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFSSPTNNLGLSIDLNEVDDNQNINERNTV

Query:  ASGGLPGVALGDNGTENSNRTVDTNEAGDLDTAGSGSITERVEQEGLDAGNSSEYSNLKCPMCRGAVLGLEVVEEAREYLNLKKRSCSRETCSFSGNYQE
        A+  +  V L D G        +  E   ++    G +T   +QE       ++   L CP+CRG +    VV+ AR ++N K RSCS ETC FSG+Y +
Subjt:  ASGGLPGVALGDNGTENSNRTVDTNEAGDLDTAGSGSITERVEQEGLDAGNSSEYSNLKCPMCRGAVLGLEVVEEAREYLNLKKRSCSRETCSFSGNYQE

Query:  LRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDVNGPLLTSFFLFHMFGSVDGAREPRPR
        LR+HAR +HP  RP+  DP R+R+WRRLERQ ++GD++S ++S+       G   I N DG +  +           LLT +FL  +F           R
Subjt:  LRRHARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDVNGPLLTSFFLFHMFGSVDGAREPRPR

Query:  SRSWVRHRRSGGGTPVPER----RFLWGENLLGLQEDADEDFRIYIGMGDDASPPTRRRRVTRPGSDAD
        S SW    R+   T    R      LWGE+  G    +  D        +  S   RRR   R   D D
Subjt:  SRSWVRHRRSGGGTPVPER----RFLWGENLLGLQEDADEDFRIYIGMGDDASPPTRRRRVTRPGSDAD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACAAAACTTGAAGGCAATGAATCCAAAACAGTATTTCAAGGGAGCTGGAGGATCATATCACAAATGGCTTCCTTCCGACTATCCCGTCCTTGCTCAGTCCAAAGT
TGGTGCCGGTGCGCTCCTCCTCCACCCTCGAGGCTTCACCATTCCTCACTACGCGGATGCCTCGAAAGTCGGCTACGTTCTTCAAGGTAACAATGGAGTTACCGGATTCA
TCTTTCCAAACACCTCCAATGAAGAAGTGATCAACCTAAAAAAAGGAGACCTAATTCCGGTCCCTGCCGGAATTGCCTCGTGGTGGTTCAACAACGGAGATTCCGATTTG
GAAATTGTCTTTTTGGGCGAAACGAAAACCGCTCATGTCTCTGAATCTGAATTTCCATTTATTGGACAATGTGGCTTGGTTGTTGTGGTTGAAAGCCTTCCTCCTAATGT
CGTTCGGTCGCCGGTGTACGTTGTGGGGCCGTCCGATCAACTGATCTATGTTGCTCGAGGGTTGGGGATGATCCAAATCGTCGGATTGTCGTTGAGCAAAATAGATGTTC
ATGTGGAAATTGGTCAGTTGGTTTTTGTTCCCAAGTACTTTGCTGTTGTGACGTACCTAGTGTTCAAGATGGCTGGTGTGAAACGAAGAATTCATAATGATTCAGATATC
CTTGCTTTGCATAAAGAATTGGATGAAGTCTCCTGCCCTATCTGCATGGACCACCCACATAATGCTGTTCTTCTACTTTGCAGCTCTCACCACAAGGGTTGTAAACCTTA
TATATGTGACACAAGCCATAGGCATTCAAATTGTTTTGATCAATTCAAAAAATTAAGAGAAGAAACTAGGAAAAGTCCACGCTTATCAAGTCCTTTACCAATAAATCCAT
ATAGTTTTAGTAGTCCTACAAACAACTTGGGTTTGAGCATTGATTTGAATGAAGTTGATGATAATCAAAATATAAATGAAAGGAACACTGTCGCATCTGGTGGATTACCT
GGTGTAGCTTTAGGGGATAATGGAACTGAAAATTCTAATAGAACTGTAGACACAAATGAGGCTGGAGATTTGGACACTGCTGGTTCTGGGTCCATAACTGAAAGGGTTGA
GCAAGAAGGTCTGGATGCTGGGAACTCATCTGAGTATTCAAACTTGAAGTGCCCCATGTGCCGAGGAGCTGTGCTAGGCTTGGAAGTTGTAGAAGAAGCAAGAGAATATC
TTAATCTCAAGAAACGAAGTTGCTCCCGTGAAACTTGTTCATTCTCTGGCAACTACCAAGAACTACGCAGGCATGCTAGAAGAGTTCACCCGACGTCAAGGCCTGCTGTC
ATAGACCCATCTAGAGAACGAGCATGGCGACGCTTGGAGAGACAAAGAGAAGTTGGTGATGTTGTTAGTGCCATTCGCTCAGCCATGCCCGGTGCCCTTGTGGTTGGAGA
CTACGTCATTGAAAATGGAGATGGTATGGTGGCAGGTGAGAGAGACAATGGCACAGGTGATGTCAATGGGCCATTGCTGACTAGTTTCTTTCTGTTTCATATGTTTGGGT
CGGTTGATGGTGCTCGAGAGCCAAGACCACGTTCAAGGTCTTGGGTGAGGCATCGACGCTCTGGAGGAGGAACACCTGTACCGGAGCGCCGGTTCCTTTGGGGTGAAAAC
CTTTTGGGATTACAAGAAGATGCAGACGAAGATTTCCGCATATACATTGGTATGGGCGATGATGCATCACCACCTACGAGAAGGAGACGTGTAACCAGGCCTGGATCTGA
TGCAGATCAACCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAACAAAACTTGAAGGCAATGAATCCAAAACAGTATTTCAAGGGAGCTGGAGGATCATATCACAAATGGCTTCCTTCCGACTATCCCGTCCTTGCTCAGTCCAAAGT
TGGTGCCGGTGCGCTCCTCCTCCACCCTCGAGGCTTCACCATTCCTCACTACGCGGATGCCTCGAAAGTCGGCTACGTTCTTCAAGGTAACAATGGAGTTACCGGATTCA
TCTTTCCAAACACCTCCAATGAAGAAGTGATCAACCTAAAAAAAGGAGACCTAATTCCGGTCCCTGCCGGAATTGCCTCGTGGTGGTTCAACAACGGAGATTCCGATTTG
GAAATTGTCTTTTTGGGCGAAACGAAAACCGCTCATGTCTCTGAATCTGAATTTCCATTTATTGGACAATGTGGCTTGGTTGTTGTGGTTGAAAGCCTTCCTCCTAATGT
CGTTCGGTCGCCGGTGTACGTTGTGGGGCCGTCCGATCAACTGATCTATGTTGCTCGAGGGTTGGGGATGATCCAAATCGTCGGATTGTCGTTGAGCAAAATAGATGTTC
ATGTGGAAATTGGTCAGTTGGTTTTTGTTCCCAAGTACTTTGCTGTTGTGACGTACCTAGTGTTCAAGATGGCTGGTGTGAAACGAAGAATTCATAATGATTCAGATATC
CTTGCTTTGCATAAAGAATTGGATGAAGTCTCCTGCCCTATCTGCATGGACCACCCACATAATGCTGTTCTTCTACTTTGCAGCTCTCACCACAAGGGTTGTAAACCTTA
TATATGTGACACAAGCCATAGGCATTCAAATTGTTTTGATCAATTCAAAAAATTAAGAGAAGAAACTAGGAAAAGTCCACGCTTATCAAGTCCTTTACCAATAAATCCAT
ATAGTTTTAGTAGTCCTACAAACAACTTGGGTTTGAGCATTGATTTGAATGAAGTTGATGATAATCAAAATATAAATGAAAGGAACACTGTCGCATCTGGTGGATTACCT
GGTGTAGCTTTAGGGGATAATGGAACTGAAAATTCTAATAGAACTGTAGACACAAATGAGGCTGGAGATTTGGACACTGCTGGTTCTGGGTCCATAACTGAAAGGGTTGA
GCAAGAAGGTCTGGATGCTGGGAACTCATCTGAGTATTCAAACTTGAAGTGCCCCATGTGCCGAGGAGCTGTGCTAGGCTTGGAAGTTGTAGAAGAAGCAAGAGAATATC
TTAATCTCAAGAAACGAAGTTGCTCCCGTGAAACTTGTTCATTCTCTGGCAACTACCAAGAACTACGCAGGCATGCTAGAAGAGTTCACCCGACGTCAAGGCCTGCTGTC
ATAGACCCATCTAGAGAACGAGCATGGCGACGCTTGGAGAGACAAAGAGAAGTTGGTGATGTTGTTAGTGCCATTCGCTCAGCCATGCCCGGTGCCCTTGTGGTTGGAGA
CTACGTCATTGAAAATGGAGATGGTATGGTGGCAGGTGAGAGAGACAATGGCACAGGTGATGTCAATGGGCCATTGCTGACTAGTTTCTTTCTGTTTCATATGTTTGGGT
CGGTTGATGGTGCTCGAGAGCCAAGACCACGTTCAAGGTCTTGGGTGAGGCATCGACGCTCTGGAGGAGGAACACCTGTACCGGAGCGCCGGTTCCTTTGGGGTGAAAAC
CTTTTGGGATTACAAGAAGATGCAGACGAAGATTTCCGCATATACATTGGTATGGGCGATGATGCATCACCACCTACGAGAAGGAGACGTGTAACCAGGCCTGGATCTGA
TGCAGATCAACCATGAGAGAATTCATCTTCTGTGGTAAACATCTATGCCCTTGAGGCGGGCATCCCAGCATTGATCCGTTGTGTGGGGTCGTCTGGAACATCAAGAAATA
TGTGCTTCCGAGTCCTCAAGGGTTCAGTTCGCCATCATCTTGATGCAAAATACACAGCCAGCATTTGGCTCACACAGGAAGCAAGAAGAGGGAAGACCCGATGAGATGAA
TAAGCATCATCTTTTCGAAGTTCTCAGATGGAATCATAATCCTTTTGCTTTATTTTATTTGAATTTAAGTTCTATATGATTGTAAGTCTCTTCCTAACCTTTGTATATAA
TGGGATGATGATGATTTCATTTGTTTAATTTGGTGTGATGATGGTTTTATTTATATAATCTGTCAAGTTAATAGGATTATATTTATATGTGGTTACTGAATCTTCTGTAT
GTATGCTTGGAAGATTGCATTATTATTTATAAGCTCATGTTTATAGTGACGTCAATGAAAAATAACGTGTTTGTCAA
Protein sequenceShow/hide protein sequence
MEQNLKAMNPKQYFKGAGGSYHKWLPSDYPVLAQSKVGAGALLLHPRGFTIPHYADASKVGYVLQGNNGVTGFIFPNTSNEEVINLKKGDLIPVPAGIASWWFNNGDSDL
EIVFLGETKTAHVSESEFPFIGQCGLVVVVESLPPNVVRSPVYVVGPSDQLIYVARGLGMIQIVGLSLSKIDVHVEIGQLVFVPKYFAVVTYLVFKMAGVKRRIHNDSDI
LALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHSNCFDQFKKLREETRKSPRLSSPLPINPYSFSSPTNNLGLSIDLNEVDDNQNINERNTVASGGLP
GVALGDNGTENSNRTVDTNEAGDLDTAGSGSITERVEQEGLDAGNSSEYSNLKCPMCRGAVLGLEVVEEAREYLNLKKRSCSRETCSFSGNYQELRRHARRVHPTSRPAV
IDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAGERDNGTGDVNGPLLTSFFLFHMFGSVDGAREPRPRSRSWVRHRRSGGGTPVPERRFLWGEN
LLGLQEDADEDFRIYIGMGDDASPPTRRRRVTRPGSDADQP