; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg012719 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg012719
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionHeat-inducible transcription repressor
Genome locationscaffold1:16822045..16826136
RNA-Seq ExpressionSpg012719
SyntenySpg012719
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033696.1 hypothetical protein SDJN02_03421, partial [Cucurbita argyrosperma subsp. argyrosperma]5.1e-15871.22Show/hide
Query:  MAYRRRQGITRASTFKEEIHRPSDEFDHNSISSSSSSSSSLAAQAIRASAGQRDSSHSSAYGGTSA-FAPGHLRSKSFAGENVTLKSESKSGFWGVLARK
        MAYRRRQ ITRASTFKEEIH P+D+ DHNSISSSSSSSSSLAAQAIRASA  RDSS+SSA+ G+S+ F+PGH+RSKSFAGEN+T KSESKSGFWGVLARK
Subjt:  MAYRRRQGITRASTFKEEIHRPSDEFDHNSISSSSSSSSSLAAQAIRASAGQRDSSHSSAYGGTSA-FAPGHLRSKSFAGENVTLKSESKSGFWGVLARK

Query:  AKAILEEDDIAIEDETSRFQPINSSTRSQEPCQSTDYNSKKSDNPAR-KGLDAISTSLNPLGDTFEKAFEEGRTIVETKTADI-----KMQIRRKGN-NE
        AKAILEEDDIAIE+ETS FQPI++ST SQEPCQSTD++SKK DNPA+ KGLDAI+TS N LGDT +KA +EGR IV  KT DI     K+QIRRKGN +E
Subjt:  AKAILEEDDIAIEDETSRFQPINSSTRSQEPCQSTDYNSKKSDNPAR-KGLDAISTSLNPLGDTFEKAFEEGRTIVETKTADI-----KMQIRRKGN-NE

Query:  GLYSPANNLWQQPNIQSPEPQMQTHHETQLKASRDVSLALHASSSRLNTKFSNSAKGIEKTNVCPRLECNESVEVVAMATAAKAKLLLRELKTIKADLTF
        GL+   NN WQQPN+ SPEP+MQ HHETQLKASRD                                        VAMATAAKAKLLLRELKTIKA+L F
Subjt:  GLYSPANNLWQQPNIQSPEPQMQTHHETQLKASRDVSLALHASSSRLNTKFSNSAKGIEKTNVCPRLECNESVEVVAMATAAKAKLLLRELKTIKADLTF

Query:  AKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPISTSPEITKILS
        AKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTE YP STSPEIT++L 
Subjt:  AKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPISTSPEITKILS

Query:  NSVSPRSSTSPTSPSALSVNSPATMEDLP---VHPKQDKDD-------------KDSDGEPTPTTAS-EEEGTNKSPLPSSSS
        N  SP SSTSPTSPSA      A MEDLP   VHPK+ KDD             +DS+G+P+PTT S EEEGT  SPLPSS+S
Subjt:  NSVSPRSSTSPTSPSALSVNSPATMEDLP---VHPKQDKDD-------------KDSDGEPTPTTAS-EEEGTNKSPLPSSSS

XP_008463299.1 PREDICTED: uncharacterized protein LOC103501490 isoform X1 [Cucumis melo]4.0e-16374.74Show/hide
Query:  MAYRRRQGITRASTFKEEIHRPSDEFDHNSISSSSSSSSSLAAQAIRASAGQRDSSHSSAYGGTSA-FAPGHLRSKSFAGENVTLKSESKSGFWGVLARK
        MAYRRRQ ITRASTFKEEIH PSDEFDHNSISSSSSSSSSLAAQAIRASA   DSSHSSA+ G S+ F+PGH+RSKSFAGENV  KSESKSGFWGVLARK
Subjt:  MAYRRRQGITRASTFKEEIHRPSDEFDHNSISSSSSSSSSLAAQAIRASAGQRDSSHSSAYGGTSA-FAPGHLRSKSFAGENVTLKSESKSGFWGVLARK

Query:  AKAILEEDDIAIEDETSRFQPINSSTRSQEPCQSTDYNSKKSDNPA-RKGLDAISTSLNPLGDTFEKAFEEGRTIVETKTADI-----KMQIRRKGNN-E
        AKAILEEDDIAIE+E SRFQPIN+S RSQEPCQST+ +SKK+DNPA RKGLDAISTSLN LGDTFEKA+ EGRTIVE KTADI     K+QIR+KGNN E
Subjt:  AKAILEEDDIAIEDETSRFQPINSSTRSQEPCQSTDYNSKKSDNPA-RKGLDAISTSLNPLGDTFEKAFEEGRTIVETKTADI-----KMQIRRKGNN-E

Query:  GLYSPANNLWQQPNIQSPEPQMQTHHETQLKASRDVSLALHASSSRLNTKFSNSAKGIEKTNVCPRLECNESVEVVAMATAAKAKLLLRELKTIKADLTF
        GLY   NN WQQPNIQSPEP MQTHHETQLKASRD                                        VAMATAAKAKLLLRELKTIKADL F
Subjt:  GLYSPANNLWQQPNIQSPEPQMQTHHETQLKASRDVSLALHASSSRLNTKFSNSAKGIEKTNVCPRLECNESVEVVAMATAAKAKLLLRELKTIKADLTF

Query:  AKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPISTSPEITKILS
        AKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPISTSPEITK LS
Subjt:  AKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPISTSPEITKILS

Query:  NSVSPRSSTSPTSPSALSVNSPATMEDLPVHPKQDKD--DKDSDGEPTPTTA-------SEEEGTNKSPLPSSSS
        NSVSPRS TSP+SP  +    P      P+  KQDKD  DKDSD  PT +T         EEEGT   PLPS++S
Subjt:  NSVSPRSSTSPTSPSALSVNSPATMEDLPVHPKQDKD--DKDSDGEPTPTTA-------SEEEGTNKSPLPSSSS

XP_011653735.2 uncharacterized protein LOC101205031 isoform X1 [Cucumis sativus]2.3e-16675.58Show/hide
Query:  MAYRRRQGITRASTFKEEIHRPSDEFDHNSISSSSSSSSSLAAQAIRASAGQRDSSHSSAYGGTSA-FAPGHLRSKSFAGENVTLKSESKSGFWGVLARK
        MAYRRRQ ITRASTFKEEIH PSDE DHNSISSSSSSSSSLAAQAIRASA QRDSSHSSA+ G S+ F+PGHLRSKSFAGEN+  KS+SKSGFWGVLARK
Subjt:  MAYRRRQGITRASTFKEEIHRPSDEFDHNSISSSSSSSSSLAAQAIRASAGQRDSSHSSAYGGTSA-FAPGHLRSKSFAGENVTLKSESKSGFWGVLARK

Query:  AKAILEEDDIAIEDETSRFQPINSSTRSQEPCQSTDYNSKKSDNPA-RKGLDAISTSLNPLGDTFEKAFEEGRTIVETKTADI-----KMQIRRKGNN-E
        AKAILEEDDIAIEDE SRFQPIN+S RSQEPCQST+ +SKK+DNPA RKGLDAISTSLN LGDTFEKA+EEGRTIVE KTADI     K+QIR+KGNN E
Subjt:  AKAILEEDDIAIEDETSRFQPINSSTRSQEPCQSTDYNSKKSDNPA-RKGLDAISTSLNPLGDTFEKAFEEGRTIVETKTADI-----KMQIRRKGNN-E

Query:  GLYSPANNLWQQPNIQSPEPQMQTHHETQLKASRDVSLALHASSSRLNTKFSNSAKGIEKTNVCPRLECNESVEVVAMATAAKAKLLLRELKTIKADLTF
        GLY   NN WQQPNIQSPEP MQTHHETQLKASRD                                        VAMATAAKAKLLLRELKTIKADL F
Subjt:  GLYSPANNLWQQPNIQSPEPQMQTHHETQLKASRDVSLALHASSSRLNTKFSNSAKGIEKTNVCPRLECNESVEVVAMATAAKAKLLLRELKTIKADLTF

Query:  AKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPISTSPEITKILS
        AKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPISTSPEITK+LS
Subjt:  AKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPISTSPEITKILS

Query:  NSVSPRSSTSPTSPSALSVNSPATMEDLPVHPKQDKD--DKDSDGEPTPTTA-------SEEEGTNKSPLPSSSS
        NSVSPRS TSP+SP  +    P      P+  KQDKD  DKDSD  PT +T         EEEGT K+PLPSS+S
Subjt:  NSVSPRSSTSPTSPSALSVNSPATMEDLPVHPKQDKD--DKDSDGEPTPTTA-------SEEEGTNKSPLPSSSS

XP_022978186.1 uncharacterized protein LOC111478244 [Cucurbita maxima]1.4e-16072.75Show/hide
Query:  MAYRRRQGITRASTFKEEIHRPSDEFDHNSISSSSSSSSSLAAQAIRASAGQRDSSHSSAYGGTSA-FAPGHLRSKSFAGENVTLKSESKSGFWGVLARK
        MAYRRRQ ITRASTFKEEIHRP+D+ DHNSISSSSSSSSSLA QAIRASA  RDSS+SSA+ G+S+ F+PGH+RSKSFAGEN+T KSESKSGFWGVLARK
Subjt:  MAYRRRQGITRASTFKEEIHRPSDEFDHNSISSSSSSSSSLAAQAIRASAGQRDSSHSSAYGGTSA-FAPGHLRSKSFAGENVTLKSESKSGFWGVLARK

Query:  AKAILEEDDIAIEDETSRFQPINSSTRSQEPCQSTDYNSKKSDNPAR-KGLDAISTSLNPLGDTFEKAFEEGRTIVETKTADI-----KMQIRRKGN-NE
        AKAILEEDDIAIE+ETS FQPIN+ST SQE CQ TD++SKKSDNPA+ KGLDAI+TSLN LGDT +KA +EGR IVE KT DI     K+QIRRKGN +E
Subjt:  AKAILEEDDIAIEDETSRFQPINSSTRSQEPCQSTDYNSKKSDNPAR-KGLDAISTSLNPLGDTFEKAFEEGRTIVETKTADI-----KMQIRRKGN-NE

Query:  GLYSPANNLWQQPNIQSPEPQMQTHHETQLKASRDVSLALHASSSRLNTKFSNSAKGIEKTNVCPRLECNESVEVVAMATAAKAKLLLRELKTIKADLTF
        GL+   NN WQQPN+ SPEPQMQ HHETQLKASRD                                        VAMATAAKAKLLLRELKTIKA+L F
Subjt:  GLYSPANNLWQQPNIQSPEPQMQTHHETQLKASRDVSLALHASSSRLNTKFSNSAKGIEKTNVCPRLECNESVEVVAMATAAKAKLLLRELKTIKADLTF

Query:  AKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPISTSPEITKILS
        AKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTE YP STSPEIT++L 
Subjt:  AKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPISTSPEITKILS

Query:  NSVSPRSSTSPTSPSALSVNSPATMEDLP---VHPKQDKDD-------KDSDGEPTPTTAS-EEEGTNKSPLPSSSS
        N  SP SSTSPTSPSA      A MEDLP   VHPK++KDD       +DS+G+P+PTT S EEEGT  SPLPSS+S
Subjt:  NSVSPRSSTSPTSPSALSVNSPATMEDLP---VHPKQDKDD-------KDSDGEPTPTTAS-EEEGTNKSPLPSSSS

XP_038882986.1 uncharacterized protein LOC120074073 [Benincasa hispida]2.3e-16676.69Show/hide
Query:  MAYRRRQGITRASTFKEEIHRPSDEFDHNSISSSSSSSSSLAAQAIRASAGQRDSSHSSAYGGTSA-FAPGHLRSKSFAGENVTLKSESKSGFWGVLARK
        MAYRRRQGITRASTFKEEIH  SDEFDHNSISSSSSSSSSLAAQAIRAS   RDSS SSA GG+S+ F+ GH+RSKSFAGE+VT KSESKSGFWGVLARK
Subjt:  MAYRRRQGITRASTFKEEIHRPSDEFDHNSISSSSSSSSSLAAQAIRASAGQRDSSHSSAYGGTSA-FAPGHLRSKSFAGENVTLKSESKSGFWGVLARK

Query:  AKAILEEDDIAIEDETSRFQPINSSTRSQEPCQSTDYNSKKSDNPA-RKGLDAISTSLNPLGDTFEKAFEEGRTIVETKTADI-----KMQIRRKGNN-E
        AKAILEEDDI IE+ETSRFQ IN+ST SQEPCQSTDY+SKKSDNPA RKGLDAISTSLN LGDTFEKAFEEGR+IVETKTADI     K+QIR+KGNN E
Subjt:  AKAILEEDDIAIEDETSRFQPINSSTRSQEPCQSTDYNSKKSDNPA-RKGLDAISTSLNPLGDTFEKAFEEGRTIVETKTADI-----KMQIRRKGNN-E

Query:  GLYSPANNLWQQPNIQSPEPQMQTHHETQLKASRDVSLALHASSSRLNTKFSNSAKGIEKTNVCPRLECNESVEVVAMATAAKAKLLLRELKTIKADLTF
        GL+   NN WQQP+IQSPEP MQTHHETQLKASRD                                        VAMATAAKAKLLLRELKT+KADL F
Subjt:  GLYSPANNLWQQPNIQSPEPQMQTHHETQLKASRDVSLALHASSSRLNTKFSNSAKGIEKTNVCPRLECNESVEVVAMATAAKAKLLLRELKTIKADLTF

Query:  AKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPISTSPEITKILS
        AKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLT QDVVY DEGMEEVTEVYPISTSPEITKILS
Subjt:  AKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPISTSPEITKILS

Query:  NSVSPRSSTSPTSPSALSVNSPATMEDLP-VHPKQDKD-DKDSDGEPTPTTA----SEEEGTNKSPLPSSSS
        +SVSPRS TS  SPS     S A MEDLP V PKQDKD  K SDG P+PTT      EEEGT+K PLPSS+S
Subjt:  NSVSPRSSTSPTSPSALSVNSPATMEDLP-VHPKQDKD-DKDSDGEPTPTTA----SEEEGTNKSPLPSSSS

TrEMBL top hitse value%identityAlignment
A0A0A0KXX4 Uncharacterized protein1.1e-16675.58Show/hide
Query:  MAYRRRQGITRASTFKEEIHRPSDEFDHNSISSSSSSSSSLAAQAIRASAGQRDSSHSSAYGGTSA-FAPGHLRSKSFAGENVTLKSESKSGFWGVLARK
        MAYRRRQ ITRASTFKEEIH PSDE DHNSISSSSSSSSSLAAQAIRASA QRDSSHSSA+ G S+ F+PGHLRSKSFAGEN+  KS+SKSGFWGVLARK
Subjt:  MAYRRRQGITRASTFKEEIHRPSDEFDHNSISSSSSSSSSLAAQAIRASAGQRDSSHSSAYGGTSA-FAPGHLRSKSFAGENVTLKSESKSGFWGVLARK

Query:  AKAILEEDDIAIEDETSRFQPINSSTRSQEPCQSTDYNSKKSDNPA-RKGLDAISTSLNPLGDTFEKAFEEGRTIVETKTADI-----KMQIRRKGNN-E
        AKAILEEDDIAIEDE SRFQPIN+S RSQEPCQST+ +SKK+DNPA RKGLDAISTSLN LGDTFEKA+EEGRTIVE KTADI     K+QIR+KGNN E
Subjt:  AKAILEEDDIAIEDETSRFQPINSSTRSQEPCQSTDYNSKKSDNPA-RKGLDAISTSLNPLGDTFEKAFEEGRTIVETKTADI-----KMQIRRKGNN-E

Query:  GLYSPANNLWQQPNIQSPEPQMQTHHETQLKASRDVSLALHASSSRLNTKFSNSAKGIEKTNVCPRLECNESVEVVAMATAAKAKLLLRELKTIKADLTF
        GLY   NN WQQPNIQSPEP MQTHHETQLKASRD                                        VAMATAAKAKLLLRELKTIKADL F
Subjt:  GLYSPANNLWQQPNIQSPEPQMQTHHETQLKASRDVSLALHASSSRLNTKFSNSAKGIEKTNVCPRLECNESVEVVAMATAAKAKLLLRELKTIKADLTF

Query:  AKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPISTSPEITKILS
        AKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPISTSPEITK+LS
Subjt:  AKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPISTSPEITKILS

Query:  NSVSPRSSTSPTSPSALSVNSPATMEDLPVHPKQDKD--DKDSDGEPTPTTA-------SEEEGTNKSPLPSSSS
        NSVSPRS TSP+SP  +    P      P+  KQDKD  DKDSD  PT +T         EEEGT K+PLPSS+S
Subjt:  NSVSPRSSTSPTSPSALSVNSPATMEDLPVHPKQDKD--DKDSDGEPTPTTA-------SEEEGTNKSPLPSSSS

A0A1S3CJA7 uncharacterized protein LOC103501490 isoform X11.9e-16374.74Show/hide
Query:  MAYRRRQGITRASTFKEEIHRPSDEFDHNSISSSSSSSSSLAAQAIRASAGQRDSSHSSAYGGTSA-FAPGHLRSKSFAGENVTLKSESKSGFWGVLARK
        MAYRRRQ ITRASTFKEEIH PSDEFDHNSISSSSSSSSSLAAQAIRASA   DSSHSSA+ G S+ F+PGH+RSKSFAGENV  KSESKSGFWGVLARK
Subjt:  MAYRRRQGITRASTFKEEIHRPSDEFDHNSISSSSSSSSSLAAQAIRASAGQRDSSHSSAYGGTSA-FAPGHLRSKSFAGENVTLKSESKSGFWGVLARK

Query:  AKAILEEDDIAIEDETSRFQPINSSTRSQEPCQSTDYNSKKSDNPA-RKGLDAISTSLNPLGDTFEKAFEEGRTIVETKTADI-----KMQIRRKGNN-E
        AKAILEEDDIAIE+E SRFQPIN+S RSQEPCQST+ +SKK+DNPA RKGLDAISTSLN LGDTFEKA+ EGRTIVE KTADI     K+QIR+KGNN E
Subjt:  AKAILEEDDIAIEDETSRFQPINSSTRSQEPCQSTDYNSKKSDNPA-RKGLDAISTSLNPLGDTFEKAFEEGRTIVETKTADI-----KMQIRRKGNN-E

Query:  GLYSPANNLWQQPNIQSPEPQMQTHHETQLKASRDVSLALHASSSRLNTKFSNSAKGIEKTNVCPRLECNESVEVVAMATAAKAKLLLRELKTIKADLTF
        GLY   NN WQQPNIQSPEP MQTHHETQLKASRD                                        VAMATAAKAKLLLRELKTIKADL F
Subjt:  GLYSPANNLWQQPNIQSPEPQMQTHHETQLKASRDVSLALHASSSRLNTKFSNSAKGIEKTNVCPRLECNESVEVVAMATAAKAKLLLRELKTIKADLTF

Query:  AKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPISTSPEITKILS
        AKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPISTSPEITK LS
Subjt:  AKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPISTSPEITKILS

Query:  NSVSPRSSTSPTSPSALSVNSPATMEDLPVHPKQDKD--DKDSDGEPTPTTA-------SEEEGTNKSPLPSSSS
        NSVSPRS TSP+SP  +    P      P+  KQDKD  DKDSD  PT +T         EEEGT   PLPS++S
Subjt:  NSVSPRSSTSPTSPSALSVNSPATMEDLPVHPKQDKD--DKDSDGEPTPTTA-------SEEEGTNKSPLPSSSS

A0A5D3C4F3 Actin cytoskeleton-regulatory complex protein pan11.9e-16374.74Show/hide
Query:  MAYRRRQGITRASTFKEEIHRPSDEFDHNSISSSSSSSSSLAAQAIRASAGQRDSSHSSAYGGTSA-FAPGHLRSKSFAGENVTLKSESKSGFWGVLARK
        MAYRRRQ ITRASTFKEEIH PSDEFDHNSISSSSSSSSSLAAQAIRASA   DSSHSSA+ G S+ F+PGH+RSKSFAGENV  KSESKSGFWGVLARK
Subjt:  MAYRRRQGITRASTFKEEIHRPSDEFDHNSISSSSSSSSSLAAQAIRASAGQRDSSHSSAYGGTSA-FAPGHLRSKSFAGENVTLKSESKSGFWGVLARK

Query:  AKAILEEDDIAIEDETSRFQPINSSTRSQEPCQSTDYNSKKSDNPA-RKGLDAISTSLNPLGDTFEKAFEEGRTIVETKTADI-----KMQIRRKGNN-E
        AKAILEEDDIAIE+E SRFQPIN+S RSQEPCQST+ +SKK+DNPA RKGLDAISTSLN LGDTFEKA+ EGRTIVE KTADI     K+QIR+KGNN E
Subjt:  AKAILEEDDIAIEDETSRFQPINSSTRSQEPCQSTDYNSKKSDNPA-RKGLDAISTSLNPLGDTFEKAFEEGRTIVETKTADI-----KMQIRRKGNN-E

Query:  GLYSPANNLWQQPNIQSPEPQMQTHHETQLKASRDVSLALHASSSRLNTKFSNSAKGIEKTNVCPRLECNESVEVVAMATAAKAKLLLRELKTIKADLTF
        GLY   NN WQQPNIQSPEP MQTHHETQLKASRD                                        VAMATAAKAKLLLRELKTIKADL F
Subjt:  GLYSPANNLWQQPNIQSPEPQMQTHHETQLKASRDVSLALHASSSRLNTKFSNSAKGIEKTNVCPRLECNESVEVVAMATAAKAKLLLRELKTIKADLTF

Query:  AKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPISTSPEITKILS
        AKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPISTSPEITK LS
Subjt:  AKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPISTSPEITKILS

Query:  NSVSPRSSTSPTSPSALSVNSPATMEDLPVHPKQDKD--DKDSDGEPTPTTA-------SEEEGTNKSPLPSSSS
        NSVSPRS TSP+SP  +    P      P+  KQDKD  DKDSD  PT +T         EEEGT   PLPS++S
Subjt:  NSVSPRSSTSPTSPSALSVNSPATMEDLPVHPKQDKD--DKDSDGEPTPTTA-------SEEEGTNKSPLPSSSS

A0A6J1GFX8 uncharacterized protein LOC1114535961.2e-15771.91Show/hide
Query:  MAYRRRQGITRASTFKEEIHRPSDEFDHNSISSSSSSSSSLAAQAIRASAGQRDSSHSSAYGGTSA-FAPGHLRSKSFAGENVTLKSESKSGFWGVLARK
        MAYRRRQ ITRASTFKEEIH P+D+ DHNSI   SSSSSSLAAQAIRASA  RDSS+SSA+ G+S+ F+PGH+RSKSFAGEN+T KSESKSGFWGVLARK
Subjt:  MAYRRRQGITRASTFKEEIHRPSDEFDHNSISSSSSSSSSLAAQAIRASAGQRDSSHSSAYGGTSA-FAPGHLRSKSFAGENVTLKSESKSGFWGVLARK

Query:  AKAILEEDDIAIEDETSRFQPINSSTRSQEPCQSTDYNSKKSDNPAR-KGLDAISTSLNPLGDTFEKAFEEGRTIVETKTADI-----KMQIRRKGN-NE
        AKAILEEDDIAIE+ETS FQPIN+ST SQEPCQSTD++SKKSDNPA+ KGLDAI+TS N LGDT +KA +EGR IV  KT DI     K+QIRRKGN +E
Subjt:  AKAILEEDDIAIEDETSRFQPINSSTRSQEPCQSTDYNSKKSDNPAR-KGLDAISTSLNPLGDTFEKAFEEGRTIVETKTADI-----KMQIRRKGN-NE

Query:  GLYSPANNLWQQPNIQSPEPQMQTHHETQLKASRDVSLALHASSSRLNTKFSNSAKGIEKTNVCPRLECNESVEVVAMATAAKAKLLLRELKTIKADLTF
        GL+   NN WQQPN+ SPEP+MQ HHETQLKASRD                                        VAMATAAKAKLLLRELKTIKA+L F
Subjt:  GLYSPANNLWQQPNIQSPEPQMQTHHETQLKASRDVSLALHASSSRLNTKFSNSAKGIEKTNVCPRLECNESVEVVAMATAAKAKLLLRELKTIKADLTF

Query:  AKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPISTSPEITKILS
        AKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTE YP STSPEIT++L 
Subjt:  AKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPISTSPEITKILS

Query:  NSVSPRSSTSPTSPSALSVNSPATMEDLP---VHPKQDKDD-------KDSDGEPTPTTAS-EEEGTNKSPLPSSSS
        N  SP SSTSPTSPSA      A MEDLP   VHPK+ KDD       +DS+G+P+PTT S EEEGT  SPLPSS+S
Subjt:  NSVSPRSSTSPTSPSALSVNSPATMEDLP---VHPKQDKDD-------KDSDGEPTPTTAS-EEEGTNKSPLPSSSS

A0A6J1IM16 uncharacterized protein LOC1114782446.9e-16172.75Show/hide
Query:  MAYRRRQGITRASTFKEEIHRPSDEFDHNSISSSSSSSSSLAAQAIRASAGQRDSSHSSAYGGTSA-FAPGHLRSKSFAGENVTLKSESKSGFWGVLARK
        MAYRRRQ ITRASTFKEEIHRP+D+ DHNSISSSSSSSSSLA QAIRASA  RDSS+SSA+ G+S+ F+PGH+RSKSFAGEN+T KSESKSGFWGVLARK
Subjt:  MAYRRRQGITRASTFKEEIHRPSDEFDHNSISSSSSSSSSLAAQAIRASAGQRDSSHSSAYGGTSA-FAPGHLRSKSFAGENVTLKSESKSGFWGVLARK

Query:  AKAILEEDDIAIEDETSRFQPINSSTRSQEPCQSTDYNSKKSDNPAR-KGLDAISTSLNPLGDTFEKAFEEGRTIVETKTADI-----KMQIRRKGN-NE
        AKAILEEDDIAIE+ETS FQPIN+ST SQE CQ TD++SKKSDNPA+ KGLDAI+TSLN LGDT +KA +EGR IVE KT DI     K+QIRRKGN +E
Subjt:  AKAILEEDDIAIEDETSRFQPINSSTRSQEPCQSTDYNSKKSDNPAR-KGLDAISTSLNPLGDTFEKAFEEGRTIVETKTADI-----KMQIRRKGN-NE

Query:  GLYSPANNLWQQPNIQSPEPQMQTHHETQLKASRDVSLALHASSSRLNTKFSNSAKGIEKTNVCPRLECNESVEVVAMATAAKAKLLLRELKTIKADLTF
        GL+   NN WQQPN+ SPEPQMQ HHETQLKASRD                                        VAMATAAKAKLLLRELKTIKA+L F
Subjt:  GLYSPANNLWQQPNIQSPEPQMQTHHETQLKASRDVSLALHASSSRLNTKFSNSAKGIEKTNVCPRLECNESVEVVAMATAAKAKLLLRELKTIKADLTF

Query:  AKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPISTSPEITKILS
        AKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTE YP STSPEIT++L 
Subjt:  AKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPISTSPEITKILS

Query:  NSVSPRSSTSPTSPSALSVNSPATMEDLP---VHPKQDKDD-------KDSDGEPTPTTAS-EEEGTNKSPLPSSSS
        N  SP SSTSPTSPSA      A MEDLP   VHPK++KDD       +DS+G+P+PTT S EEEGT  SPLPSS+S
Subjt:  NSVSPRSSTSPTSPSALSVNSPATMEDLP---VHPKQDKDD-------KDSDGEPTPTTAS-EEEGTNKSPLPSSSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30050.1 unknown protein3.7e-7448.02Show/hide
Query:  MAYRRRQGITRASTFKEEIHRPSDEFDHNSI-----------SSSSSSSSSLAAQAIRASAGQRDSSHSSAYGGTSAFAPGHLRSKSFAGENVTLKSESK
        MAYRRRQGITRASTFKE+I+    + DH  +           S S SS SSLAAQAIRAS        S A G T             A E+   KSES+
Subjt:  MAYRRRQGITRASTFKEEIHRPSDEFDHNSI-----------SSSSSSSSSLAAQAIRASAGQRDSSHSSAYGGTSAFAPGHLRSKSFAGENVTLKSESK

Query:  SGFWGVLARKAKAILEEDDIAIEDETSRFQPINSSTRSQEPCQSTDYNSKKSDNPARKGLDAISTSLNPLGDTFEKAFEEGRTIVETKTADIKMQIRRKG
         GFWG+LA+KAK+ILE+++                 + Q+  Q  D   + S+   RK +D I+TSLN +GD+FEKAFEEGRTIV +       QIRRKG
Subjt:  SGFWGVLARKAKAILEEDDIAIEDETSRFQPINSSTRSQEPCQSTDYNSKKSDNPARKGLDAISTSLNPLGDTFEKAFEEGRTIVETKTADIKMQIRRKG

Query:  NNEGLYSPANNLWQQPNIQSP-EPQMQTH-HETQLKASRDVSLALHASSSRLNTKFSNSAKGIEKTNVCPRLECNESVEVVAMATAAKAKLLLRELKTIK
         ++ + S  NN  Q     SP +P  Q +  E+QLKASRD                                        VAMATAAKAKLLLRELKT+K
Subjt:  NNEGLYSPANNLWQQPNIQSP-EPQMQTH-HETQLKASRDVSLALHASSSRLNTKFSNSAKGIEKTNVCPRLECNESVEVVAMATAAKAKLLLRELKTIK

Query:  ADLTFAKERCAQLEEENKILRENREKGDNR-ADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPISTSPE
        ADL FAKERC+QLEEENK LR+NR+KG+N  ADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVY+DEG+EEV EV     +P 
Subjt:  ADLTFAKERCAQLEEENKILRENREKGDNR-ADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPISTSPE

Query:  ITKILS---------NSVSPRSSTSPTSPSALSVNSPATMEDLPVHPKQDKDDKDSDGEPTPTTASEEEGTNKSPLPSS
        IT+ LS          S+SP S +SP SPS LSV++    +  P+  +Q   +  +   P P        T+    PSS
Subjt:  ITKILS---------NSVSPRSSTSPTSPSALSVNSPATMEDLPVHPKQDKDDKDSDGEPTPTTASEEEGTNKSPLPSS

AT2G30530.1 unknown protein5.9e-6445.59Show/hide
Query:  KEEIHRPSDEFDHNSISS------SSSSSSSLAAQAIRASAGQRDSSHSSAYGG-TSAFAPGHLRSKSFAGENVTLKS--ESKSGFWGVLARKAKAILEE
        KE +  P+    ++S SS      SSS++SSLAA+AIRAS+  RDSS SSAY   +SA  P   +  + A E  ++KS  E K GFWG LA KAKA L+E
Subjt:  KEEIHRPSDEFDHNSISS------SSSSSSSLAAQAIRASAGQRDSSHSSAYGG-TSAFAPGHLRSKSFAGENVTLKS--ESKSGFWGVLARKAKAILEE

Query:  DDIAIEDETSRFQPINSSTRSQEPCQSTDYNSKKSDNPA-RKGLDAISTSLNPLGDTFEKAFEEGRTIVETKTADIKMQIRRKGNNEGLYSPANNLWQQP
        DD     ++ +    +  + +    +      +KS+NP+ ++ LDAI++SLN +G T     EEG T VE +TA I  + R+K            + ++P
Subjt:  DDIAIEDETSRFQPINSSTRSQEPCQSTDYNSKKSDNPA-RKGLDAISTSLNPLGDTFEKAFEEGRTIVETKTADIKMQIRRKGNNEGLYSPANNLWQQP

Query:  NI--QSPEPQMQTHHETQLKASRDVSLALHASSSRLNTKFSNSAKGIEKTNVCPRLECNESVEVVAMATAAKAKLLLRELKTIKADLTFAKERCAQLEEE
        ++      P++Q   E QLKASRD                                        VAMA AAKAKLLLRELK +K+DL FAK+RCAQLEEE
Subjt:  NI--QSPEPQMQTHHETQLKASRDVSLALHASSSRLNTKFSNSAKGIEKTNVCPRLECNESVEVVAMATAAKAKLLLRELKTIKADLTFAKERCAQLEEE

Query:  NKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPISTSPEITKILSNSVSPRSS
        NK+LRENR       DDDL+RLQLETLLAEKARLAHENSIY REN +LR +VEYHQLTMQDVVY DE  EEVTEVYPI+ S  ++    NS +P  S
Subjt:  NKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPISTSPEITKILSNSVSPRSS

AT4G02800.1 unknown protein5.1e-2330.13Show/hide
Query:  SFAGENVTLKSESKSGFWGVLARKAKAILEEDDIAIEDETSRFQPINSSTRSQEPCQSTDYNSKKSDNPAR---KGLDAISTSLNPLGDTFEKAFEEGRT
        SF   + ++   S    W  +  +        D+ +E+ +   +P+ ++   +         SK+  N +    KG D++S +L+ L    + A +  R 
Subjt:  SFAGENVTLKSESKSGFWGVLARKAKAILEEDDIAIEDETSRFQPINSSTRSQEPCQSTDYNSKKSDNPAR---KGLDAISTSLNPLGDTFEKAFEEGRT

Query:  IVETKTADIKMQIRRKGNNEGLYSPANNLWQQPNIQSPEPQMQTHHETQLKASRDVSLALHASSSRLNTKFSNSAKGIEKTNVCPRLECNESVEVVAMAT
        + +  +            +E L+S         N+++ + Q Q   E + +         H S        SN  +   K       +  +  + +A++ 
Subjt:  IVETKTADIKMQIRRKGNNEGLYSPANNLWQQPNIQSPEPQMQTHHETQLKASRDVSLALHASSSRLNTKFSNSAKGIEKTNVCPRLECNESVEVVAMAT

Query:  AAKAKLLLRELKTIKADLTFAKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGM
        AAKA  L RELKTIK+DL+F +ERC  LEEENK LR+   KG    +DDL+RLQLE LLAEKARLA+EN+   REN+ L ++VEYHQ+T QD   L    
Subjt:  AAKAKLLLRELKTIKADLTFAKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGM

Query:  EEVTEVYPISTS
        E+V + + +  S
Subjt:  EEVTEVYPISTS

AT5G01970.1 unknown protein2.7e-7247.41Show/hide
Query:  MAYRRRQGITRASTFKEEIHRPSDEFDHNSISSSSSSSSSLAAQAIRASAGQRDSSHSSAYGGTSAFAPGHLRSKSFAGENVTLKSESKSGFWGVLARKA
        MAYRRRQGI + +TFKEE+ R                   L  ++I A   +   + SS        A    RSK+F  E          G WGV+A+KA
Subjt:  MAYRRRQGITRASTFKEEIHRPSDEFDHNSISSSSSSSSSLAAQAIRASAGQRDSSHSSAYGGTSAFAPGHLRSKSFAGENVTLKSESKSGFWGVLARKA

Query:  KAILEEDDIAIEDETSRFQPINSSTRSQEPCQS-----TDYNSKKSDNP-ARKGLDAISTSLNPLGDTFEKAFEEGRTIVETKTADI-----KMQIRRKG
        K+++E+D               SS RS    QS     +D   KK DNP  R+GLD +++SLN +GDTFEKAFE+GRT+VE KTADI     K+Q RR+G
Subjt:  KAILEEDDIAIEDETSRFQPINSSTRSQEPCQS-----TDYNSKKSDNP-ARKGLDAISTSLNPLGDTFEKAFEEGRTIVETKTADI-----KMQIRRKG

Query:  -----NNEGLYSPANNLWQQPNIQSPEPQMQTH---HETQLKASRDVSLALHASSSRLNTKFSNSAKGIEKTNVCPRLECNESVEVVAMATAAKAKLLLR
              N+      ++ W+    +SPE  MQ +   HETQLKASRD                                        VAMATAAKAKLLLR
Subjt:  -----NNEGLYSPANNLWQQPNIQSPEPQMQTH---HETQLKASRDVSLALHASSSRLNTKFSNSAKGIEKTNVCPRLECNESVEVVAMATAAKAKLLLR

Query:  ELKTIKADLTFAKERCAQLEEENKILRE-NREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYP
        ELKT+KADL FAKERCAQLEEENK LRE +REKG N AD+DLIRLQLE+LLAEKARLAHENS+YARENRFLREIVEYHQLTMQDVVY+DEG EEVT+V  
Subjt:  ELKTIKADLTFAKERCAQLEEENKILRE-NREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYP

Query:  ISTSPEITKILSNSVSPRSSTSPT
           SP ++ ++++S + RS + P+
Subjt:  ISTSPEITKILSNSVSPRSSTSPT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTATAGGAGAAGACAAGGAATCACGAGGGCTTCCACTTTCAAAGAAGAGATCCATCGTCCTTCCGACGAGTTCGATCACAATTCCATCTCTTCTTCCTCGTCTTC
CTCTTCTTCTCTTGCCGCTCAGGCGATCCGAGCTTCTGCTGGTCAACGTGATTCTTCGCATTCCTCTGCTTATGGCGGTACCTCCGCTTTTGCCCCTGGCCATTTACGAT
CCAAGAGTTTTGCTGGCGAGAATGTAACACTCAAAAGTGAGTCGAAATCCGGGTTTTGGGGCGTACTGGCTCGGAAAGCTAAGGCCATTCTCGAGGAAGATGATATAGCC
ATAGAAGATGAAACAAGTAGGTTCCAGCCCATTAATAGCTCAACCCGTAGCCAGGAGCCATGTCAGTCTACTGATTACAACTCGAAAAAATCGGACAATCCTGCACGGAA
GGGTTTGGATGCAATCTCAACTTCGCTTAATCCACTTGGGGACACATTTGAAAAGGCATTTGAGGAAGGTCGCACAATTGTGGAAACTAAGACTGCAGACATCAAAATGC
AGATTAGAAGAAAGGGCAACAATGAAGGATTATATTCTCCTGCGAATAATCTATGGCAACAACCGAATATTCAGTCTCCAGAACCACAGATGCAAACACATCATGAAACT
CAACTGAAGGCATCCCGAGACGTAAGCCTGGCTTTACATGCCTCTTCTTCACGGCTTAATACGAAGTTTTCTAATTCTGCAAAAGGAATTGAAAAGACCAATGTCTGTCC
TAGATTAGAATGCAATGAATCCGTCGAAGTTGTGGCGATGGCGACTGCTGCCAAAGCCAAACTATTACTTCGGGAGCTGAAAACCATTAAAGCAGATCTGACTTTTGCTA
AAGAAAGATGTGCACAATTAGAGGAAGAAAACAAAATTCTCAGGGAAAATCGTGAGAAGGGAGACAATCGTGCAGATGATGACTTGATTCGGCTTCAACTCGAGACTCTT
CTGGCTGAGAAGGCTCGTTTGGCGCATGAGAATTCAATATATGCCCGCGAGAACCGTTTCCTAAGGGAAATTGTGGAGTACCACCAACTCACCATGCAGGATGTGGTTTA
CTTAGACGAGGGAATGGAAGAAGTTACAGAAGTTTATCCCATTTCCACATCTCCAGAGATCACCAAAATACTTTCTAATTCTGTTTCACCACGCTCCTCAACCTCTCCAA
CCTCACCTTCAGCTCTCTCTGTGAACTCACCAGCAACCATGGAAGATCTTCCTGTTCATCCAAAACAAGACAAGGATGACAAAGACAGTGATGGAGAGCCAACTCCCACA
ACAGCTTCTGAGGAAGAAGGTACAAATAAATCCCCCTTGCCTTCTAGTTCTTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTATAGGAGAAGACAAGGAATCACGAGGGCTTCCACTTTCAAAGAAGAGATCCATCGTCCTTCCGACGAGTTCGATCACAATTCCATCTCTTCTTCCTCGTCTTC
CTCTTCTTCTCTTGCCGCTCAGGCGATCCGAGCTTCTGCTGGTCAACGTGATTCTTCGCATTCCTCTGCTTATGGCGGTACCTCCGCTTTTGCCCCTGGCCATTTACGAT
CCAAGAGTTTTGCTGGCGAGAATGTAACACTCAAAAGTGAGTCGAAATCCGGGTTTTGGGGCGTACTGGCTCGGAAAGCTAAGGCCATTCTCGAGGAAGATGATATAGCC
ATAGAAGATGAAACAAGTAGGTTCCAGCCCATTAATAGCTCAACCCGTAGCCAGGAGCCATGTCAGTCTACTGATTACAACTCGAAAAAATCGGACAATCCTGCACGGAA
GGGTTTGGATGCAATCTCAACTTCGCTTAATCCACTTGGGGACACATTTGAAAAGGCATTTGAGGAAGGTCGCACAATTGTGGAAACTAAGACTGCAGACATCAAAATGC
AGATTAGAAGAAAGGGCAACAATGAAGGATTATATTCTCCTGCGAATAATCTATGGCAACAACCGAATATTCAGTCTCCAGAACCACAGATGCAAACACATCATGAAACT
CAACTGAAGGCATCCCGAGACGTAAGCCTGGCTTTACATGCCTCTTCTTCACGGCTTAATACGAAGTTTTCTAATTCTGCAAAAGGAATTGAAAAGACCAATGTCTGTCC
TAGATTAGAATGCAATGAATCCGTCGAAGTTGTGGCGATGGCGACTGCTGCCAAAGCCAAACTATTACTTCGGGAGCTGAAAACCATTAAAGCAGATCTGACTTTTGCTA
AAGAAAGATGTGCACAATTAGAGGAAGAAAACAAAATTCTCAGGGAAAATCGTGAGAAGGGAGACAATCGTGCAGATGATGACTTGATTCGGCTTCAACTCGAGACTCTT
CTGGCTGAGAAGGCTCGTTTGGCGCATGAGAATTCAATATATGCCCGCGAGAACCGTTTCCTAAGGGAAATTGTGGAGTACCACCAACTCACCATGCAGGATGTGGTTTA
CTTAGACGAGGGAATGGAAGAAGTTACAGAAGTTTATCCCATTTCCACATCTCCAGAGATCACCAAAATACTTTCTAATTCTGTTTCACCACGCTCCTCAACCTCTCCAA
CCTCACCTTCAGCTCTCTCTGTGAACTCACCAGCAACCATGGAAGATCTTCCTGTTCATCCAAAACAAGACAAGGATGACAAAGACAGTGATGGAGAGCCAACTCCCACA
ACAGCTTCTGAGGAAGAAGGTACAAATAAATCCCCCTTGCCTTCTAGTTCTTCCTAA
Protein sequenceShow/hide protein sequence
MAYRRRQGITRASTFKEEIHRPSDEFDHNSISSSSSSSSSLAAQAIRASAGQRDSSHSSAYGGTSAFAPGHLRSKSFAGENVTLKSESKSGFWGVLARKAKAILEEDDIA
IEDETSRFQPINSSTRSQEPCQSTDYNSKKSDNPARKGLDAISTSLNPLGDTFEKAFEEGRTIVETKTADIKMQIRRKGNNEGLYSPANNLWQQPNIQSPEPQMQTHHET
QLKASRDVSLALHASSSRLNTKFSNSAKGIEKTNVCPRLECNESVEVVAMATAAKAKLLLRELKTIKADLTFAKERCAQLEEENKILRENREKGDNRADDDLIRLQLETL
LAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPISTSPEITKILSNSVSPRSSTSPTSPSALSVNSPATMEDLPVHPKQDKDDKDSDGEPTPT
TASEEEGTNKSPLPSSSS