; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002965 (gene) of Snake gourd v1 genome

Gene IDTan0002965
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDNA binding protein, putative isoform 1
Genome locationLG08:69202961..69214261
RNA-Seq ExpressionTan0002965
SyntenyTan0002965
Gene Ontology termsGO:0006383 - transcription by RNA polymerase III (biological process)
GO:0000127 - transcription factor TFIIIC complex (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001680 - WD40 repeat
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR017956 - AT hook, DNA-binding motif
IPR036322 - WD40-repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144681.1 uncharacterized protein LOC111014310 isoform X1 [Momordica charantia]0.0e+0075.38Show/hide
Query:  MEEHQHQVE--------ASRGTSGKKGKKK-PVVRD-KELEKRAKKEAEATLVSEEQPAGRSDDPRIKVSEFDHCVENHFRAIDTIFKLCGEAEDGDDGI
        MEEH H VE        AS  T GK+GK+K PV R  KE   RAKK+       EEQP GR D   IKV EFDHC ENHFRA+DTI +LCGEAEDGD GI
Subjt:  MEEHQHQVE--------ASRGTSGKKGKKK-PVVRD-KELEKRAKKEAEATLVSEEQPAGRSDDPRIKVSEFDHCVENHFRAIDTIFKLCGEAEDGDDGI

Query:  DESDIQRFSSSTIFLREWRYYNYEPKTVKFAGDSSGPGGKDADITINLPQFSSAAVLKDRAPPEAAKSLDFRNFAMYVGGPIWALDWWPQVHERTDSLIK
        DESDIQRFSSS  FLREWR+YNYEPKTVKFA D  G  GKD DITINLPQFSSAAVLK+  P  AA SLD+RNF MYVGGP+WALDW PQV E+TD+LIK
Subjt:  DESDIQRFSSSTIFLREWRYYNYEPKTVKFAGDSSGPGGKDADITINLPQFSSAAVLKDRAPPEAAKSLDFRNFAMYVGGPIWALDWWPQVHERTDSLIK

Query:  CEFIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTEIQEP-----------TKNTEASSNLSSQQKRPRGRPPGRKKKGESNLPSKPKRPRGRPKKKQ
        CEFIAVSAHPPGSSYHKMG PL GRGMVQIWCLVHGTE  EP            K  E SS+LSSQ KRPRGRPPG KKKG S+LPS+PKRPRGRPKKKQ
Subjt:  CEFIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTEIQEP-----------TKNTEASSNLSSQQKRPRGRPPGRKKKGESNLPSKPKRPRGRPKKKQ

Query:  VESNDNRGDNYQLVQTLSIECPVGSSNLLDIDG-AKHSEEPVLLGKSVERQRSTLQEDSTFNSEDEVPAQKRRVRRKVGTKNHVDDMGTLTPTENREDGS
          SNDN GDN Q+VQ+LS+E P GSSNLL+IDG  K+SEE +LLG SVERQ+STLQ  ST NS+DE PAQKRRVRRKVGTKNH+DDMGTL  T NREDGS
Subjt:  VESNDNRGDNYQLVQTLSIECPVGSSNLLDIDG-AKHSEEPVLLGKSVERQRSTLQEDSTFNSEDEVPAQKRRVRRKVGTKNHVDDMGTLTPTENREDGS

Query:  NAINLQANDNVISEHSGEDTLLCNNISDNAVSDTSSEFSIPESVALPRVVLCLAHNGKVAWDLKWKPTKACTIKNKHRMGYLAVLLGNGSLEVWEVPFPH
        + I+ Q N+NVISE+SGEDTLLCNNIS NA      EFSIPESVALPRVVLCLAHNGKVAWDLKWKP+ ACT   KHRMGYLAVLLGNGSLEVWE+PFPH
Subjt:  NAINLQANDNVISEHSGEDTLLCNNISDNAVSDTSSEFSIPESVALPRVVLCLAHNGKVAWDLKWKPTKACTIKNKHRMGYLAVLLGNGSLEVWEVPFPH

Query:  VVKAIYSKFNGEGTDPRFMKLKPIFRCSMLRSANTQSIPLTVEWSSTPPNDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRGVAWAPSESD
        VVKAIYSKFN EGTDPRF+KLKPIFR +ML+SAN QSIPLTVEWSSTPP DYL AGC+DGTVALWKFSANSTCEDTRPLLRFSADTVPIR VAWAP+ESD
Subjt:  VVKAIYSKFNGEGTDPRFMKLKPIFRCSMLRSANTQSIPLTVEWSSTPPNDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRGVAWAPSESD

Query:  PETANVVLTVGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLPKAAYDVPVTGKPFTAIKQKGLHTYFCTSSAIWSAQV
        PE+ANVVLT  HGGLKFWDLRDPFRPLWD+HPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSL KAAYDVPVTGKPFT  KQ+GLH+Y+ +S AIWS QV
Subjt:  PETANVVLTVGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLPKAAYDVPVTGKPFTAIKQKGLHTYFCTSSAIWSAQV

Query:  SRQTGMVAYCGADGTVVRFQLTTKAVVKENSRNRTPQFVCDYFTDEQSTITIHTPESDVPVPLKKVANRSEPTPLSMRAILSE-IQSNEGNHKTATVSVS
        SRQTGMVAYC ADG V+RFQLTT+AV K++SRNRTP F+C+Y T+E+S ITIH+P S VP PLKK +N+S+  PLS RAILS+ I+SNEGNHKTAT + S
Subjt:  SRQTGMVAYCGADGTVVRFQLTTKAVVKENSRNRTPQFVCDYFTDEQSTITIHTPESDVPVPLKKVANRSEPTPLSMRAILSE-IQSNEGNHKTATVSVS

Query:  ESEALALRCDDDNSVESGSEDTPM----KNKTQSKCKKKGGVDDQELECSDE----QTNNDRVPSSGDNFENLPPRSVALHRVRWNMNIGSERWLCYGGA
        E+EALA+  D+D SV+SGSEDT M    KN+TQSKCKKK  VD Q LECSDE    QT  D +P SGDNFE  PP+SVALHRVRWNMN GSERWLCYGG 
Subjt:  ESEALALRCDDDNSVESGSEDTPM----KNKTQSKCKKKGGVDDQELECSDE----QTNNDRVPSSGDNFENLPPRSVALHRVRWNMNIGSERWLCYGGA

Query:  AGILRCQEIMLSALDKNLMKKK
        AGI+RCQEI+LS  DK LM+KK
Subjt:  AGILRCQEIMLSALDKNLMKKK

XP_022930748.1 uncharacterized protein LOC111437104 [Cucurbita moschata]0.0e+0077.79Show/hide
Query:  MAMEEHQHQVEASRGTSGKKGKKKPVVR-DKELEKRAKKEAEATLVSEEQPAGRSDDPRIKVSEFDHCVENHFRAIDTIFKLCGEAEDGDDGIDESDIQR
        MAME+  +  EAS GT   KGKKKPV R  KE E RAKK+ EAT V+EEQPAGRSD P IKV EFD+CVENHF+AIDT+ +LCG+ EDG   IDESDIQR
Subjt:  MAMEEHQHQVEASRGTSGKKGKKKPVVR-DKELEKRAKKEAEATLVSEEQPAGRSDDPRIKVSEFDHCVENHFRAIDTIFKLCGEAEDGDDGIDESDIQR

Query:  FSSSTIFLREWRYYNYEPKTVKFAGDSSGPGGKDADITINLPQFSSAAVLKDRAPPEAAKSLDFRNFAMYVGGPIWALDWWPQVHERTDSLIKCEFIAVS
        FSSSTIFLREWRYYNYEPKTVKFA DSSGP  KDADI++NLPQFSSAA LK+ APPEAA S+DFRNF MYVGGPIWALDW PQ HERTDSLIKCE+IAVS
Subjt:  FSSSTIFLREWRYYNYEPKTVKFAGDSSGPGGKDADITINLPQFSSAAVLKDRAPPEAAKSLDFRNFAMYVGGPIWALDWWPQVHERTDSLIKCEFIAVS

Query:  AHPPGSSYHKMGIPLTGRGMVQIWCLVHGTEIQ--EPTKNTEASSNLSSQQKRPRGRPPGRKKKGESNLPSKPKRPRGRPKKKQVESNDNRGDNYQLVQT
        AHPP SSYHKMGIPLTGRGMVQIWC+VHGTE    EPTKN+ ASSNLSSQ ++PRGRPPGRKK   SNLPS+PKRPRGRPKKKQ ESNDN          
Subjt:  AHPPGSSYHKMGIPLTGRGMVQIWCLVHGTEIQ--EPTKNTEASSNLSSQQKRPRGRPPGRKKKGESNLPSKPKRPRGRPKKKQVESNDNRGDNYQLVQT

Query:  LSIECPVGSSNLLDIDGAKHSEEPVLLGKSVERQRSTLQEDSTFNSEDEVPAQKRRVRRKVGTKNHVDDMGTLTPTENREDGSNAINLQANDNVISEHSG
                                     SVE  RSTLQE  T NS+DEV AQK+RVRRKV TKNHVDD+G L+ TENRE+ SNAINLQAN+NVISE+SG
Subjt:  LSIECPVGSSNLLDIDGAKHSEEPVLLGKSVERQRSTLQEDSTFNSEDEVPAQKRRVRRKVGTKNHVDDMGTLTPTENREDGSNAINLQANDNVISEHSG

Query:  EDTLLCNNISDNAVSDTSS-EFSIPESVALPRVVLCLAHNGKVAWDLKWKPTKACTIKNKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGTDP
        EDTLLCNN+S+NA  D SS EFSIPESVALPRVVLCLAHNGKVAWDLKWKPT AC  K KHRMGYLAVLLGNGSLEVWEVPFPHV++AIYSK NGEGTDP
Subjt:  EDTLLCNNISDNAVSDTSS-EFSIPESVALPRVVLCLAHNGKVAWDLKWKPTKACTIKNKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGTDP

Query:  RFMKLKPIFRCSMLRSANTQSIPLTVEWSSTPPNDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRGVAWAPSESDPETANVVLTVGHGGLK
        RF+KL P FRCSMLRSAN QSIPLTVEWSSTPP DYLLAGCHDGTVALWKF+ANS CEDTRPLLRFSADTVPIRGVAWAP++SDPE  NV+LT GHGGLK
Subjt:  RFMKLKPIFRCSMLRSANTQSIPLTVEWSSTPPNDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRGVAWAPSESDPETANVVLTVGHGGLK

Query:  FWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLPKAAYDVPVTGKPFTAIKQKGLHTYFCTSSAIWSAQVSRQTGMVAYCGADGTV
        FWDLRDPFRPLWDLHPAPR+IYSLDWL DP C ILSFDDGTLRLLSLPKAA DVPVTGKPFT +KQKGLHTY CTS+AIWS QVSRQTGMVAYCGADGTV
Subjt:  FWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLPKAAYDVPVTGKPFTAIKQKGLHTYFCTSSAIWSAQVSRQTGMVAYCGADGTV

Query:  VRFQLTTKAVVKENSRNRTPQFVCDYFTDEQSTITIHTPESDVPVPLKKVANRSEPTPLSMRAILSEIQSNEGNHKTATVSVSESEALALRCDDDNSVES
        VRFQLTTKAVVKENSRNRTPQFVCDYFT+EQSTITIHTPE DVP PLKK++NR +P PLSMRAILSEIQSNEGNHKTA  S+SE+  LA   DDD +VES
Subjt:  VRFQLTTKAVVKENSRNRTPQFVCDYFTDEQSTITIHTPESDVPVPLKKVANRSEPTPLSMRAILSEIQSNEGNHKTATVSVSESEALALRCDDDNSVES

Query:  GSEDTP----MKNKTQSKCKKKGGVDDQELECSDE----QTNNDRVPSSGDNFENLPPRSVALHRVRWNMNIGSERWLCYGGAAGILRCQEIMLSALDKN
        GSEDTP     K+KTQSKCKKKG  D+QE ECSDE     TNN   P SGD+ ENLPP+SVA+HRVRWNMN GSERWLCYGGAAG+LRCQEI+LSALDK 
Subjt:  GSEDTP----MKNKTQSKCKKKGGVDDQELECSDE----QTNNDRVPSSGDNFENLPPRSVALHRVRWNMNIGSERWLCYGGAAGILRCQEIMLSALDKN

Query:  LMKKK
        LM KK
Subjt:  LMKKK

XP_023005875.1 uncharacterized protein LOC111498749 [Cucurbita maxima]0.0e+0077.88Show/hide
Query:  MAMEEHQHQVEASRGTSGKKGKKKPVVR-DKELEKRAKKEAEATLVSEEQPAGRSDDPRIKVSEFDHCVENHFRAIDTIFKLCGEAEDGDDGIDESDIQR
        MAME+  +  EAS GT   KGKKKPV R  KE E RAKK+ EAT V+EEQ AGRSD P IKV EFDHCVENHF+AIDT+ +LCG+ EDG   IDE DIQR
Subjt:  MAMEEHQHQVEASRGTSGKKGKKKPVVR-DKELEKRAKKEAEATLVSEEQPAGRSDDPRIKVSEFDHCVENHFRAIDTIFKLCGEAEDGDDGIDESDIQR

Query:  FSSSTIFLREWRYYNYEPKTVKFAGDSSGPGGKDADITINLPQFSSAAVLKDRAPPEAAKSLDFRNFAMYVGGPIWALDWWPQVHERTDSLIKCEFIAVS
        FSSSTIFLREWRYYNYEPKTVKFA DS GP  KDADI++NLPQFSSAA LK+ APPEAA SLDFRNF MYVGGPIWALDW PQ HERTDSLIKCE+IAVS
Subjt:  FSSSTIFLREWRYYNYEPKTVKFAGDSSGPGGKDADITINLPQFSSAAVLKDRAPPEAAKSLDFRNFAMYVGGPIWALDWWPQVHERTDSLIKCEFIAVS

Query:  AHPPGSSYHKMGIPLTGRGMVQIWCLVHGTEIQ--EPTKNTEASSNLSSQQKRPRGRPPGRKKKGESNLPSKPKRPRGRPKKKQVESNDNRGDNYQLVQT
        AHPP SSYHKMGIPLTGRGMVQIWC+VHGTE    EPTKN+ ASSNLSSQ ++PRGRPPGRKK   SNLPS+PKRPRGRPKKKQ ESNDN          
Subjt:  AHPPGSSYHKMGIPLTGRGMVQIWCLVHGTEIQ--EPTKNTEASSNLSSQQKRPRGRPPGRKKKGESNLPSKPKRPRGRPKKKQVESNDNRGDNYQLVQT

Query:  LSIECPVGSSNLLDIDGAKHSEEPVLLGKSVERQRSTLQEDSTFNSEDEVPAQKRRVRRKVGTKNHVDDMGTLTPTENREDGSNAINLQANDNVISEHSG
                                     SVE  RSTLQE  T NS+DEV AQK+R+RRKV TKNHVDD+GTL  TENRE+ SNAINLQAN+NVISE+SG
Subjt:  LSIECPVGSSNLLDIDGAKHSEEPVLLGKSVERQRSTLQEDSTFNSEDEVPAQKRRVRRKVGTKNHVDDMGTLTPTENREDGSNAINLQANDNVISEHSG

Query:  EDTLLCNNISDNAVSDTSSEFSIPESVALPRVVLCLAHNGKVAWDLKWKPTKACTIKNKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGTDPR
        EDTLLCNN+S+NA  D SS FSIPESVALPRVVLCLAHNGKVAWDLKWKPT AC  K KHRMGYLA+LLGNGSLEVWEVPFPHV++AIYSK NGEGTDPR
Subjt:  EDTLLCNNISDNAVSDTSSEFSIPESVALPRVVLCLAHNGKVAWDLKWKPTKACTIKNKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGTDPR

Query:  FMKLKPIFRCSMLRSANTQSIPLTVEWSSTPPNDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRGVAWAPSESDPETANVVLTVGHGGLKF
        F+KLKP FRCSMLRSAN QSIPLTVEWSSTPP DYLLAGCHDGTVALWKFSANS CEDTRPLLRFSADTVPIRGVAWAP++SDPE  NV+LT GHGGLKF
Subjt:  FMKLKPIFRCSMLRSANTQSIPLTVEWSSTPPNDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRGVAWAPSESDPETANVVLTVGHGGLKF

Query:  WDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLPKAAYDVPVTGKPFTAIKQKGLHTYFCTSSAIWSAQVSRQTGMVAYCGADGTVV
        WDLRDPFRPLWDLHPAPR+IYSLDWL DPRC ILSFDDGTLRLLSLPKAA DVPVTGKPFT +KQKGLHTY CTS+AIWS QVSRQTGMVAYCGADGTVV
Subjt:  WDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLPKAAYDVPVTGKPFTAIKQKGLHTYFCTSSAIWSAQVSRQTGMVAYCGADGTVV

Query:  RFQLTTKAVVKENSRNRTPQFVCDYFTDEQSTITIHTPESDVPVPLKKVANRSEPTPLSMRAILSEIQSNEGNHKTATVSVSESEALALRCDDDNSVESG
        RFQLTTKAVVKENSRNRTPQFVCDYFT+EQSTITIHTPE DVP PLKK++NR +P PLSMRAILSEIQSNEGNHKTA  S+SE+  LA   DDD +VESG
Subjt:  RFQLTTKAVVKENSRNRTPQFVCDYFTDEQSTITIHTPESDVPVPLKKVANRSEPTPLSMRAILSEIQSNEGNHKTATVSVSESEALALRCDDDNSVESG

Query:  SEDTP----MKNKTQSKCKKKGGVDDQELECSDE----QTNNDRVPSSGDNFENLPPRSVALHRVRWNMNIGSERWLCYGGAAGILRCQEIMLSALDKNL
        SEDTP     K+KTQSKCKKKG  D+QE ECSDE     TNN   P SGD+ ENLPP+SVA+HRVRWNMN GSERWLCYGGAAG+LRCQEI+LSALDK L
Subjt:  SEDTP----MKNKTQSKCKKKGGVDDQELECSDE----QTNNDRVPSSGDNFENLPPRSVALHRVRWNMNIGSERWLCYGGAAGILRCQEIMLSALDKNL

Query:  MKKK
        M KK
Subjt:  MKKK

XP_023539530.1 uncharacterized protein LOC111800171 [Cucurbita pepo subsp. pepo]0.0e+0077.68Show/hide
Query:  MAMEEHQHQVEASRGTSGKKGKKKPVVR-DKELEKRAKKEAEATLVSEEQPAGRSDDPRIKVSEFDHCVENHFRAIDTIFKLCGEAEDGDDGIDESDIQR
        MAME+  +  EAS GT   KGKKKPV R  KE E RAKK+ EAT V+EEQPAGRSD P IK+ EFD+CVENHF+AIDT+ +LCG+ EDG   IDESDIQR
Subjt:  MAMEEHQHQVEASRGTSGKKGKKKPVVR-DKELEKRAKKEAEATLVSEEQPAGRSDDPRIKVSEFDHCVENHFRAIDTIFKLCGEAEDGDDGIDESDIQR

Query:  FSSSTIFLREWRYYNYEPKTVKFAGDSSGPGGKDADITINLPQFSSAAVLKDRAPPEAAKSLDFRNFAMYVGGPIWALDWWPQVHERTDSLIKCEFIAVS
        FSSSTIFLREWRYYNYEPKTVKFA DS GP  KDADI++NLPQFSSAA LK+ APPEAA SLDFRNF MYVGGPIWALDW PQ HERTDSLIKCE+IAVS
Subjt:  FSSSTIFLREWRYYNYEPKTVKFAGDSSGPGGKDADITINLPQFSSAAVLKDRAPPEAAKSLDFRNFAMYVGGPIWALDWWPQVHERTDSLIKCEFIAVS

Query:  AHPPGSSYHKMGIPLTGRGMVQIWCLVHGTEIQ--EPTKNTEASSNLSSQQKRPRGRPPGRKKKGESNLPSKPKRPRGRPKKKQVESNDNRGDNYQLVQT
        AHPP SSYHKMGIPLTGRGMVQIWC+VHGTE    EPTK++ ASSNLSSQ ++PRGRPPGRKK   SNLPS+PKRPRGRPKKKQ E NDN          
Subjt:  AHPPGSSYHKMGIPLTGRGMVQIWCLVHGTEIQ--EPTKNTEASSNLSSQQKRPRGRPPGRKKKGESNLPSKPKRPRGRPKKKQVESNDNRGDNYQLVQT

Query:  LSIECPVGSSNLLDIDGAKHSEEPVLLGKSVERQRSTLQEDSTFNSEDEVPAQKRRVRRKVGTKNHVDDMGTLTPTENREDGSNAINLQANDNVISEHSG
                                     SVE  RSTLQE  T NS+DEV AQK+RVRRKV TKNHVDD+GTL+ TENRE+ SNAINLQAN+NVISE+SG
Subjt:  LSIECPVGSSNLLDIDGAKHSEEPVLLGKSVERQRSTLQEDSTFNSEDEVPAQKRRVRRKVGTKNHVDDMGTLTPTENREDGSNAINLQANDNVISEHSG

Query:  EDTLLCNNISDNAVSDTSS-EFSIPESVALPRVVLCLAHNGKVAWDLKWKPTKACTIKNKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGTDP
        EDTLLCNN+S+N   D SS EFSIPESVALPRVVLCLAHNGKVAWDLKWKPT AC  K KHRMGYLAVLLGNGSLEVWEVPFPHV++AIYSK NGEGTDP
Subjt:  EDTLLCNNISDNAVSDTSS-EFSIPESVALPRVVLCLAHNGKVAWDLKWKPTKACTIKNKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGTDP

Query:  RFMKLKPIFRCSMLRSANTQSIPLTVEWSSTPPNDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRGVAWAPSESDPETANVVLTVGHGGLK
        RF+KLKP FRCS+LRSAN QSIPLTVEWSSTPP DYLLAGCHDGTVALWKFSANS CEDTRPLLRFSADTVPIRGVAWAP++SDPE  NV+LT GHGGLK
Subjt:  RFMKLKPIFRCSMLRSANTQSIPLTVEWSSTPPNDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRGVAWAPSESDPETANVVLTVGHGGLK

Query:  FWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLPKAAYDVPVTGKPFTAIKQKGLHTYFCTSSAIWSAQVSRQTGMVAYCGADGTV
        FWDLRDPFRPLWDLHPAPR+IYSLDWL DP C ILSFDDGTLRLLSLPKAA DVPVTGKPFT +KQKGLHTY CTS+AIWS QVSRQTGMVAYCGADGTV
Subjt:  FWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLPKAAYDVPVTGKPFTAIKQKGLHTYFCTSSAIWSAQVSRQTGMVAYCGADGTV

Query:  VRFQLTTKAVVKENSRNRTPQFVCDYFTDEQSTITIHTPESDVPVPLKKVANRSEPTPLSMRAILSEIQSNEGNHKTATVSVSESEALALRCDDDNSVES
        VRFQLTTKAVVKENSRNRTPQFVCDYFT+EQSTITIHTPE DVP PLKK++NR +P PLSMRAILSEIQSNEGNHKTA  S+SE+  LA   DDD +VES
Subjt:  VRFQLTTKAVVKENSRNRTPQFVCDYFTDEQSTITIHTPESDVPVPLKKVANRSEPTPLSMRAILSEIQSNEGNHKTATVSVSESEALALRCDDDNSVES

Query:  GSEDTP----MKNKTQSKCKKKGGVDDQELECSDE----QTNNDRVPSSGDNFENLPPRSVALHRVRWNMNIGSERWLCYGGAAGILRCQEIMLSALDKN
        GSEDTP     K+KTQSKCKKKG  D+QE ECSDE     TNN   P SGD+ ENLPPRSVA+HRVRWNMN GSERWLCYGGAAG+LRCQEI+LSALDK 
Subjt:  GSEDTP----MKNKTQSKCKKKGGVDDQELECSDE----QTNNDRVPSSGDNFENLPPRSVALHRVRWNMNIGSERWLCYGGAAGILRCQEIMLSALDKN

Query:  LMKKK
        LM KK
Subjt:  LMKKK

XP_038903194.1 uncharacterized protein LOC120089853 [Benincasa hispida]0.0e+0076.42Show/hide
Query:  MEEHQHQVEASRGTSGKKGKKKPVVRD-KELEKRAKKEAEATL----VSEEQPAGRSDDPRIKVSEFDHCVENHFRAIDTIFKLCGEAEDGDDGIDESDI
        MEE Q Q + S GTS KKGKKKP  R+ K+ EK A+ +  AT     V++ QP GR D P++KVSEFDHC+ENHF A+DTI +LC EAEDG  GIDESDI
Subjt:  MEEHQHQVEASRGTSGKKGKKKPVVRD-KELEKRAKKEAEATL----VSEEQPAGRSDDPRIKVSEFDHCVENHFRAIDTIFKLCGEAEDGDDGIDESDI

Query:  QRFSSSTIFLREWRYYNYEPKTVKFAGDSSGPGGKDADITINLPQFSSAAVLKDRAPPEAAKSLDFRNFAMYVGGPIWALDWWPQVHERTDSLIKCEFIA
        QRF+SSTIFLREWR+YNYEPK +KFA DS GP GKDADITI LPQFSSAAVLK+ APP A  SLDFRNFAM+VGGP+WALDW PQVHERTDSLIKCEFIA
Subjt:  QRFSSSTIFLREWRYYNYEPKTVKFAGDSSGPGGKDADITINLPQFSSAAVLKDRAPPEAAKSLDFRNFAMYVGGPIWALDWWPQVHERTDSLIKCEFIA

Query:  VSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTEIQEPTKNTEASSNLSSQQKRPRGRPPGRKKKGESNLPSKPKRPRGRPKKKQVESNDNRGDNYQLVQT
        VSAHPPGSSYHKMGIPLTGRGMVQIWC VHGTE  EPT   E  ++LSSQ KRPRGRP GRKK G S LP +PKRPRGRPKKKQ ESND +GD+  LVQ 
Subjt:  VSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTEIQEPTKNTEASSNLSSQQKRPRGRPPGRKKKGESNLPSKPKRPRGRPKKKQVESNDNRGDNYQLVQT

Query:  LSIECPVGSSNLLDIDGA-KHSEEPVLLGKSVERQRSTLQEDSTFNSEDEVPAQKRRVRRKVGTKNHVDDMGTLTPTENREDGSNAINLQANDNVISEHS
         SIE PVGSSNLL++DG  K+SE  VLL  SVER+RSTLQE ST NSEDEVPAQKRRVRRK   KNHV D+G L+ TENREDGSNAI+L+AN+NV+ E+S
Subjt:  LSIECPVGSSNLLDIDGA-KHSEEPVLLGKSVERQRSTLQEDSTFNSEDEVPAQKRRVRRKVGTKNHVDDMGTLTPTENREDGSNAINLQANDNVISEHS

Query:  GEDTLLCNNISDNAVSDTSS-EFSIPESVALPRVVLCLAHNGKVAWDLKWKPTKACTIKNKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGTD
        GED LLC NIS NAV DTSS EFSIPESVALPRVVLCLAHNGKVAWDLKWKPT A T   K RMGYLAVLLGNGSLEVWEVPFPH VKAIYSKFNGEGTD
Subjt:  GEDTLLCNNISDNAVSDTSS-EFSIPESVALPRVVLCLAHNGKVAWDLKWKPTKACTIKNKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGTD

Query:  PRFMKLKPIFRCSMLRSANTQSIPLTVEWSSTPPNDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRGVAWAPSESDPETANVVLTVGHGGL
        PRF+KLKPIFRCSMLR+ANTQSIPLTVEWS TPP DYLLAGCHDGTVALWKFSANS+CEDTRPLLRFSADTVPIR VAWAPSES  E+ANV+LT GHGGL
Subjt:  PRFMKLKPIFRCSMLRSANTQSIPLTVEWSSTPPNDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRGVAWAPSESDPETANVVLTVGHGGL

Query:  KFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLPKAAYDVPVTGKPFTAIKQKGLHTYFCTSSAIWSAQVSRQTGMVAYCGADGT
        KFWDLRDPFRPLWDLHPAPR+IYSLDWLP+PRCV LSFDDGTLRLLSL KAAYDVPVTG+PFTAIKQKGLHTYFC+S AIWS QVSRQTGMVAYCGADG 
Subjt:  KFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLPKAAYDVPVTGKPFTAIKQKGLHTYFCTSSAIWSAQVSRQTGMVAYCGADGT

Query:  VVRFQLTTKAVVKENSRNRTPQFVCDYFTDEQSTITIHTPESDVPVPLKKVANRSEPTPLSMRAILSE-IQSNEGNHKTATVSVSESEALALRCDDDNSV
        VVRFQLTTKA  KENSR+RTP +VC+Y T+E+STITIH+P  ++P  LKK++N+SE  PLSMRAILS+ +QSNEGNHKTAT    E+E+ AL  D D  V
Subjt:  VVRFQLTTKAVVKENSRNRTPQFVCDYFTDEQSTITIHTPESDVPVPLKKVANRSEPTPLSMRAILSE-IQSNEGNHKTATVSVSESEALALRCDDDNSV

Query:  ESGSEDTPM----KNKTQSKCKKKGGVDDQELECSDE-------------QTNNDRVPSSGDNFENLPPRSVALHRVRWNMNIGSERWLCYGGAAGILRC
        ESG EDT M    KN+TQSKCKK  GV++Q+L+CSDE             QT+   VP S D FE+LPP+SVA+HRVRWNMNIGSERWLCYGGAAGILRC
Subjt:  ESGSEDTPM----KNKTQSKCKKKGGVDDQELECSDE-------------QTNNDRVPSSGDNFENLPPRSVALHRVRWNMNIGSERWLCYGGAAGILRC

Query:  QEIMLSALDKNLMKKK
        QEI+LS LD  LMKKK
Subjt:  QEIMLSALDKNLMKKK

TrEMBL top hitse value%identityAlignment
A0A5D3DPQ1 DNA binding protein, putative isoform 10.0e+0072.32Show/hide
Query:  EEHQHQVEASRGTSGKKGKKKPVVRDKELEKRAKKE------------AEATLVSEEQPAGRSDD--PRIKVSEFDHCVENHFRAIDTIFKLCGEAEDGD
        +E +   + ++ TS K  KK P    KE EKRAKK+              +T V+E Q   R +D  P++KVSEFD CVENHFRA+D I +LC EAE+GD
Subjt:  EEHQHQVEASRGTSGKKGKKKPVVRDKELEKRAKKE------------AEATLVSEEQPAGRSDD--PRIKVSEFDHCVENHFRAIDTIFKLCGEAEDGD

Query:  DGIDESDIQRFSSSTIFLREWRYYNYEPKTVKFAGDSSGPGGKDADITINLPQFSSAAVLKDRAPPEAAKSLDFRNFAMYVGGPIWALDWWPQVHERTDS
         GIDESDIQRFSSSTIFLREWR+YNYE KT+KFA DS+GP GKDADITINLPQFSSAAVLK  APP A+ SLDFRNFAM+VGGP+WA+DW PQVH RT+S
Subjt:  DGIDESDIQRFSSSTIFLREWRYYNYEPKTVKFAGDSSGPGGKDADITINLPQFSSAAVLKDRAPPEAAKSLDFRNFAMYVGGPIWALDWWPQVHERTDS

Query:  LIKCEFIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTEIQEPTKNTEASSNLSSQQKRPRGRPPGRKKKGESNLPSKPKRPRGRPKKKQVESNDNRG
        LIKCEFIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTE  EP    E  S+LSSQ K+PRGRPPGRKKK  S LPS PKRPRGRPKK+Q ES D +G
Subjt:  LIKCEFIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTEIQEPTKNTEASSNLSSQQKRPRGRPPGRKKKGESNLPSKPKRPRGRPKKKQVESNDNRG

Query:  DNYQLVQTLSIECPVGSSNLLDIDGA-KHSEEPVLLGKSVERQRSTLQEDSTFNSEDEVPAQKRRVRRKVGTKNHVDDMGTLTPTENREDGSNAINLQAN
        DN QLVQ  S+E PVGSS+LL+IDG  K++E  VLL  +VER+RSTLQE ST NSEDEVPA+KRRVRRKV ++N VDD+G  + TE +EDGS A N +A+
Subjt:  DNYQLVQTLSIECPVGSSNLLDIDGA-KHSEEPVLLGKSVERQRSTLQEDSTFNSEDEVPAQKRRVRRKVGTKNHVDDMGTLTPTENREDGSNAINLQAN

Query:  DNVISEHSGEDTLLCNNISDNAVSDTSS-EFSIPESVALPRVVLCLAHNGKVAWDLKWKPTKACTIKNKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYS
        +NV SE+SGED LLC +IS+N V D SS EFSIPESVALPRVVLCLAHNGKVAWDLKWKP  ACT   KHRMGYLAVLLGNGSLEVWEVPFPH VK IYS
Subjt:  DNVISEHSGEDTLLCNNISDNAVSDTSS-EFSIPESVALPRVVLCLAHNGKVAWDLKWKPTKACTIKNKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYS

Query:  KFNGEGTDPRFMKLKPIFRCSMLRSANTQSIPLTVEWSSTPPNDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRGVAWAPSESDPETANVV
        KFNGEGTDPRF+KLKPIFRCS LR+ANTQSIPLTVEWS  PP DYLLAGCHDGTVALWKFSANS+CEDTRPLLRFSADTVPIR VAWAPSES+ E+ANV+
Subjt:  KFNGEGTDPRFMKLKPIFRCSMLRSANTQSIPLTVEWSSTPPNDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRGVAWAPSESDPETANVV

Query:  LTVGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLPKAAYDVPVTGKPFTAIKQKGLHTYFCTSSAIWSAQVSRQTGMV
        LT GHGGLKFWDLRDPFRPLWDLHPAPR+IYSLDWLP+PRCV LSFDDGTLRLLSL KAA DVP TG+PFTAIKQKGLHTY C+S AIWS QVSRQTGMV
Subjt:  LTVGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLPKAAYDVPVTGKPFTAIKQKGLHTYFCTSSAIWSAQVSRQTGMV

Query:  AYCGADGTVVRFQLTTKAVVKENSRNRTPQFVCDYFTDEQSTITIHTPESDVPVPLKKVANRSEPTPLSMRAILSE-IQSNEGNHKTATVSVSESEALAL
        AYCGADG VVRFQLTTKA  KENSR+RTP +VC+Y T+E+S IT  +P  +VP+PLKK++N+SE  PLSMRAILS+ +QSNEGNHKTAT S  E+EA ++
Subjt:  AYCGADGTVVRFQLTTKAVVKENSRNRTPQFVCDYFTDEQSTITIHTPESDVPVPLKKVANRSEPTPLSMRAILSE-IQSNEGNHKTATVSVSESEALAL

Query:  RCDDDNSVESGSEDTPM----KNKTQSKCKKKGGVDDQELECSDEQTNN---------------------DRVPSSGDNFENLPPRSVALHRVRWNMNIG
          D D  VESGSEDTP+    KN+TQ KCKKK GV++ ELEC+ E  ++                     D VPSSGD+FENLPP+SVA+HRVRWNMN+G
Subjt:  RCDDDNSVESGSEDTPM----KNKTQSKCKKKGGVDDQELECSDEQTNN---------------------DRVPSSGDNFENLPPRSVALHRVRWNMNIG

Query:  SERWLCYGGAAGILRCQEIMLSALDKNLMKKK
        SE+WLCYGGA+GILRCQE++LSALD  LMKKK
Subjt:  SERWLCYGGAAGILRCQEIMLSALDKNLMKKK

A0A6J1CU50 uncharacterized protein LOC111014310 isoform X10.0e+0075.38Show/hide
Query:  MEEHQHQVE--------ASRGTSGKKGKKK-PVVRD-KELEKRAKKEAEATLVSEEQPAGRSDDPRIKVSEFDHCVENHFRAIDTIFKLCGEAEDGDDGI
        MEEH H VE        AS  T GK+GK+K PV R  KE   RAKK+       EEQP GR D   IKV EFDHC ENHFRA+DTI +LCGEAEDGD GI
Subjt:  MEEHQHQVE--------ASRGTSGKKGKKK-PVVRD-KELEKRAKKEAEATLVSEEQPAGRSDDPRIKVSEFDHCVENHFRAIDTIFKLCGEAEDGDDGI

Query:  DESDIQRFSSSTIFLREWRYYNYEPKTVKFAGDSSGPGGKDADITINLPQFSSAAVLKDRAPPEAAKSLDFRNFAMYVGGPIWALDWWPQVHERTDSLIK
        DESDIQRFSSS  FLREWR+YNYEPKTVKFA D  G  GKD DITINLPQFSSAAVLK+  P  AA SLD+RNF MYVGGP+WALDW PQV E+TD+LIK
Subjt:  DESDIQRFSSSTIFLREWRYYNYEPKTVKFAGDSSGPGGKDADITINLPQFSSAAVLKDRAPPEAAKSLDFRNFAMYVGGPIWALDWWPQVHERTDSLIK

Query:  CEFIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTEIQEP-----------TKNTEASSNLSSQQKRPRGRPPGRKKKGESNLPSKPKRPRGRPKKKQ
        CEFIAVSAHPPGSSYHKMG PL GRGMVQIWCLVHGTE  EP            K  E SS+LSSQ KRPRGRPPG KKKG S+LPS+PKRPRGRPKKKQ
Subjt:  CEFIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTEIQEP-----------TKNTEASSNLSSQQKRPRGRPPGRKKKGESNLPSKPKRPRGRPKKKQ

Query:  VESNDNRGDNYQLVQTLSIECPVGSSNLLDIDG-AKHSEEPVLLGKSVERQRSTLQEDSTFNSEDEVPAQKRRVRRKVGTKNHVDDMGTLTPTENREDGS
          SNDN GDN Q+VQ+LS+E P GSSNLL+IDG  K+SEE +LLG SVERQ+STLQ  ST NS+DE PAQKRRVRRKVGTKNH+DDMGTL  T NREDGS
Subjt:  VESNDNRGDNYQLVQTLSIECPVGSSNLLDIDG-AKHSEEPVLLGKSVERQRSTLQEDSTFNSEDEVPAQKRRVRRKVGTKNHVDDMGTLTPTENREDGS

Query:  NAINLQANDNVISEHSGEDTLLCNNISDNAVSDTSSEFSIPESVALPRVVLCLAHNGKVAWDLKWKPTKACTIKNKHRMGYLAVLLGNGSLEVWEVPFPH
        + I+ Q N+NVISE+SGEDTLLCNNIS NA      EFSIPESVALPRVVLCLAHNGKVAWDLKWKP+ ACT   KHRMGYLAVLLGNGSLEVWE+PFPH
Subjt:  NAINLQANDNVISEHSGEDTLLCNNISDNAVSDTSSEFSIPESVALPRVVLCLAHNGKVAWDLKWKPTKACTIKNKHRMGYLAVLLGNGSLEVWEVPFPH

Query:  VVKAIYSKFNGEGTDPRFMKLKPIFRCSMLRSANTQSIPLTVEWSSTPPNDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRGVAWAPSESD
        VVKAIYSKFN EGTDPRF+KLKPIFR +ML+SAN QSIPLTVEWSSTPP DYL AGC+DGTVALWKFSANSTCEDTRPLLRFSADTVPIR VAWAP+ESD
Subjt:  VVKAIYSKFNGEGTDPRFMKLKPIFRCSMLRSANTQSIPLTVEWSSTPPNDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRGVAWAPSESD

Query:  PETANVVLTVGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLPKAAYDVPVTGKPFTAIKQKGLHTYFCTSSAIWSAQV
        PE+ANVVLT  HGGLKFWDLRDPFRPLWD+HPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSL KAAYDVPVTGKPFT  KQ+GLH+Y+ +S AIWS QV
Subjt:  PETANVVLTVGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLPKAAYDVPVTGKPFTAIKQKGLHTYFCTSSAIWSAQV

Query:  SRQTGMVAYCGADGTVVRFQLTTKAVVKENSRNRTPQFVCDYFTDEQSTITIHTPESDVPVPLKKVANRSEPTPLSMRAILSE-IQSNEGNHKTATVSVS
        SRQTGMVAYC ADG V+RFQLTT+AV K++SRNRTP F+C+Y T+E+S ITIH+P S VP PLKK +N+S+  PLS RAILS+ I+SNEGNHKTAT + S
Subjt:  SRQTGMVAYCGADGTVVRFQLTTKAVVKENSRNRTPQFVCDYFTDEQSTITIHTPESDVPVPLKKVANRSEPTPLSMRAILSE-IQSNEGNHKTATVSVS

Query:  ESEALALRCDDDNSVESGSEDTPM----KNKTQSKCKKKGGVDDQELECSDE----QTNNDRVPSSGDNFENLPPRSVALHRVRWNMNIGSERWLCYGGA
        E+EALA+  D+D SV+SGSEDT M    KN+TQSKCKKK  VD Q LECSDE    QT  D +P SGDNFE  PP+SVALHRVRWNMN GSERWLCYGG 
Subjt:  ESEALALRCDDDNSVESGSEDTPM----KNKTQSKCKKKGGVDDQELECSDE----QTNNDRVPSSGDNFENLPPRSVALHRVRWNMNIGSERWLCYGGA

Query:  AGILRCQEIMLSALDKNLMKKK
        AGI+RCQEI+LS  DK LM+KK
Subjt:  AGILRCQEIMLSALDKNLMKKK

A0A6J1EW91 uncharacterized protein LOC1114371040.0e+0077.79Show/hide
Query:  MAMEEHQHQVEASRGTSGKKGKKKPVVR-DKELEKRAKKEAEATLVSEEQPAGRSDDPRIKVSEFDHCVENHFRAIDTIFKLCGEAEDGDDGIDESDIQR
        MAME+  +  EAS GT   KGKKKPV R  KE E RAKK+ EAT V+EEQPAGRSD P IKV EFD+CVENHF+AIDT+ +LCG+ EDG   IDESDIQR
Subjt:  MAMEEHQHQVEASRGTSGKKGKKKPVVR-DKELEKRAKKEAEATLVSEEQPAGRSDDPRIKVSEFDHCVENHFRAIDTIFKLCGEAEDGDDGIDESDIQR

Query:  FSSSTIFLREWRYYNYEPKTVKFAGDSSGPGGKDADITINLPQFSSAAVLKDRAPPEAAKSLDFRNFAMYVGGPIWALDWWPQVHERTDSLIKCEFIAVS
        FSSSTIFLREWRYYNYEPKTVKFA DSSGP  KDADI++NLPQFSSAA LK+ APPEAA S+DFRNF MYVGGPIWALDW PQ HERTDSLIKCE+IAVS
Subjt:  FSSSTIFLREWRYYNYEPKTVKFAGDSSGPGGKDADITINLPQFSSAAVLKDRAPPEAAKSLDFRNFAMYVGGPIWALDWWPQVHERTDSLIKCEFIAVS

Query:  AHPPGSSYHKMGIPLTGRGMVQIWCLVHGTEIQ--EPTKNTEASSNLSSQQKRPRGRPPGRKKKGESNLPSKPKRPRGRPKKKQVESNDNRGDNYQLVQT
        AHPP SSYHKMGIPLTGRGMVQIWC+VHGTE    EPTKN+ ASSNLSSQ ++PRGRPPGRKK   SNLPS+PKRPRGRPKKKQ ESNDN          
Subjt:  AHPPGSSYHKMGIPLTGRGMVQIWCLVHGTEIQ--EPTKNTEASSNLSSQQKRPRGRPPGRKKKGESNLPSKPKRPRGRPKKKQVESNDNRGDNYQLVQT

Query:  LSIECPVGSSNLLDIDGAKHSEEPVLLGKSVERQRSTLQEDSTFNSEDEVPAQKRRVRRKVGTKNHVDDMGTLTPTENREDGSNAINLQANDNVISEHSG
                                     SVE  RSTLQE  T NS+DEV AQK+RVRRKV TKNHVDD+G L+ TENRE+ SNAINLQAN+NVISE+SG
Subjt:  LSIECPVGSSNLLDIDGAKHSEEPVLLGKSVERQRSTLQEDSTFNSEDEVPAQKRRVRRKVGTKNHVDDMGTLTPTENREDGSNAINLQANDNVISEHSG

Query:  EDTLLCNNISDNAVSDTSS-EFSIPESVALPRVVLCLAHNGKVAWDLKWKPTKACTIKNKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGTDP
        EDTLLCNN+S+NA  D SS EFSIPESVALPRVVLCLAHNGKVAWDLKWKPT AC  K KHRMGYLAVLLGNGSLEVWEVPFPHV++AIYSK NGEGTDP
Subjt:  EDTLLCNNISDNAVSDTSS-EFSIPESVALPRVVLCLAHNGKVAWDLKWKPTKACTIKNKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGTDP

Query:  RFMKLKPIFRCSMLRSANTQSIPLTVEWSSTPPNDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRGVAWAPSESDPETANVVLTVGHGGLK
        RF+KL P FRCSMLRSAN QSIPLTVEWSSTPP DYLLAGCHDGTVALWKF+ANS CEDTRPLLRFSADTVPIRGVAWAP++SDPE  NV+LT GHGGLK
Subjt:  RFMKLKPIFRCSMLRSANTQSIPLTVEWSSTPPNDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRGVAWAPSESDPETANVVLTVGHGGLK

Query:  FWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLPKAAYDVPVTGKPFTAIKQKGLHTYFCTSSAIWSAQVSRQTGMVAYCGADGTV
        FWDLRDPFRPLWDLHPAPR+IYSLDWL DP C ILSFDDGTLRLLSLPKAA DVPVTGKPFT +KQKGLHTY CTS+AIWS QVSRQTGMVAYCGADGTV
Subjt:  FWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLPKAAYDVPVTGKPFTAIKQKGLHTYFCTSSAIWSAQVSRQTGMVAYCGADGTV

Query:  VRFQLTTKAVVKENSRNRTPQFVCDYFTDEQSTITIHTPESDVPVPLKKVANRSEPTPLSMRAILSEIQSNEGNHKTATVSVSESEALALRCDDDNSVES
        VRFQLTTKAVVKENSRNRTPQFVCDYFT+EQSTITIHTPE DVP PLKK++NR +P PLSMRAILSEIQSNEGNHKTA  S+SE+  LA   DDD +VES
Subjt:  VRFQLTTKAVVKENSRNRTPQFVCDYFTDEQSTITIHTPESDVPVPLKKVANRSEPTPLSMRAILSEIQSNEGNHKTATVSVSESEALALRCDDDNSVES

Query:  GSEDTP----MKNKTQSKCKKKGGVDDQELECSDE----QTNNDRVPSSGDNFENLPPRSVALHRVRWNMNIGSERWLCYGGAAGILRCQEIMLSALDKN
        GSEDTP     K+KTQSKCKKKG  D+QE ECSDE     TNN   P SGD+ ENLPP+SVA+HRVRWNMN GSERWLCYGGAAG+LRCQEI+LSALDK 
Subjt:  GSEDTP----MKNKTQSKCKKKGGVDDQELECSDE----QTNNDRVPSSGDNFENLPPRSVALHRVRWNMNIGSERWLCYGGAAGILRCQEIMLSALDKN

Query:  LMKKK
        LM KK
Subjt:  LMKKK

A0A6J1F7U5 uncharacterized protein LOC111441649 isoform X10.0e+0076.13Show/hide
Query:  MEEHQHQVEASRGTSGKKGKKKPVVRDKELEKRAKKEAEATLVSEEQPAGRSDDPRIKVSEFDHCVENHFRAIDTIFKLCGEAEDGDDGIDESDIQRFSS
        MEE  HQ EAS GTS KKGKKK V  + E +KRAKK+  AT V+E QP GR DD R+KVSEFDHCVENHFRAID I +L GEAE+G+ G+DESD QRFSS
Subjt:  MEEHQHQVEASRGTSGKKGKKKPVVRDKELEKRAKKEAEATLVSEEQPAGRSDDPRIKVSEFDHCVENHFRAIDTIFKLCGEAEDGDDGIDESDIQRFSS

Query:  STIFLREWRYYNYEPKTVKFAGDSSGPGGKDADITINLPQFSSAAVLKDRAPPEAAKSLDFRNFAMYVGGPIWALDWWPQVHERTDSLIKCEFIAVSAHP
        ST FLREW++YNYEPKTVKF  DS  P GKDADIT+ LPQFSSAAVLK+ APP A  SLDFRNF M+VGGP+WA+DW P VHERTDSLIKCEFIAVSAHP
Subjt:  STIFLREWRYYNYEPKTVKFAGDSSGPGGKDADITINLPQFSSAAVLKDRAPPEAAKSLDFRNFAMYVGGPIWALDWWPQVHERTDSLIKCEFIAVSAHP

Query:  PGSSYHKMGIPLTGRGMVQIWCLVHGTEI--QEPTKNTEASSNLSSQQKRPRGRPPGRKKKGESNLPSKPKRPRGRPKKKQVESN-DNRGDNYQLVQTLS
        PGSSYH MGIPL+GRGMVQIWCLVHGTE    E T  TE   +  SQ KRPRGRPPGRKK G S LPS+PKRPRGRPKKKQ E N DN+  +YQLVQ LS
Subjt:  PGSSYHKMGIPLTGRGMVQIWCLVHGTEI--QEPTKNTEASSNLSSQQKRPRGRPPGRKKKGESNLPSKPKRPRGRPKKKQVESN-DNRGDNYQLVQTLS

Query:  IECPVGSSNLLDIDGAKH-SEEPVLLGKSVERQRSTLQEDSTFNSEDEVPAQKRRVRRKVGTKNHVDDMGTLTPTENREDGSNAINLQANDNVISEHSGE
        +E P  SSNLL+ID   H SE+PV L  SVER  ST++E ST NSEDEVP QKRRVRR   TKNHVDD+GTL+  ENREDGSNA N +AN+NV SE+SGE
Subjt:  IECPVGSSNLLDIDGAKH-SEEPVLLGKSVERQRSTLQEDSTFNSEDEVPAQKRRVRRKVGTKNHVDDMGTLTPTENREDGSNAINLQANDNVISEHSGE

Query:  DTLLCNNISDNAVSDT-SSEFSIPESVALPRVVLCLAHNGKVAWDLKWKPTKACTIKNKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGTDPR
        DT LC NIS+ A+ DT S+ FSIPE+VALPR+VLCLAHNGKVAWDLKWKPT A T K K RMGYLAVLLGNGSLEVWEVPFPHVVKAIYSK NGEGTDPR
Subjt:  DTLLCNNISDNAVSDT-SSEFSIPESVALPRVVLCLAHNGKVAWDLKWKPTKACTIKNKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGTDPR

Query:  FMKLKPIFRCSMLRSANTQSIPLTVEWSSTPPNDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRGVAWAPSESDPETANVVLTVGHGGLKF
        F+KLKP FRCSMLRSA+TQSIPLTVEWS TPP DYLLAGCHDGTVALWKFSA+ST EDTRPLLRFSADTVPIR VAWAPSES+PE+ NV+L   HGG+KF
Subjt:  FMKLKPIFRCSMLRSANTQSIPLTVEWSSTPPNDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRGVAWAPSESDPETANVVLTVGHGGLKF

Query:  WDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLPKAAYDVPVTGKPFTAIKQKGLHTYFCTSSAIWSAQVSRQTGMVAYCGADGTVV
        WDLRDPFRPLWDLHPAPR+IYSLDWLP+PRCV LSFDDGTLRLLSL KAAYDVPVTG+PFTAIKQKGLHTY C+  AIWS QVSRQTGMVAYCGADG VV
Subjt:  WDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLPKAAYDVPVTGKPFTAIKQKGLHTYFCTSSAIWSAQVSRQTGMVAYCGADGTVV

Query:  RFQLTTKAVVKENSRNRTPQFVCDYFTDEQSTITIHTPESDVPVPLKKVANRSEPTPLSMRAILSE-IQSNEGNHKTATVSVSESEALALRCDDDNSVES
        RFQLTTKAV KENSRNRTP FVC+Y T+EQS ITIH+P SDVP+PLKK++N+SE  PLSMRAILS+ +Q NEGN K+AT S  E+E+ AL  DDD  VES
Subjt:  RFQLTTKAVVKENSRNRTPQFVCDYFTDEQSTITIHTPESDVPVPLKKVANRSEPTPLSMRAILSE-IQSNEGNHKTATVSVSESEALALRCDDDNSVES

Query:  GSEDTPM----KNKTQSKCKKKGGVDDQEL----ECSDEQTNNDRVPSSGDNFENLPPRSVALHRVRWNMNIGSERWLCYGGAAGILRCQEIMLSALDKN
        GSEDTPM    KN+TQSK KKK GV +QEL    E SD QT++D VP  G++FEN PP+SVALHR+RWNMNIGSERWL YGGAAGILRCQEI+LSALDK 
Subjt:  GSEDTPM----KNKTQSKCKKKGGVDDQEL----ECSDEQTNNDRVPSSGDNFENLPPRSVALHRVRWNMNIGSERWLCYGGAAGILRCQEIMLSALDKN

Query:  LMKKK
        LM KK
Subjt:  LMKKK

A0A6J1KYL7 uncharacterized protein LOC1114987490.0e+0077.88Show/hide
Query:  MAMEEHQHQVEASRGTSGKKGKKKPVVR-DKELEKRAKKEAEATLVSEEQPAGRSDDPRIKVSEFDHCVENHFRAIDTIFKLCGEAEDGDDGIDESDIQR
        MAME+  +  EAS GT   KGKKKPV R  KE E RAKK+ EAT V+EEQ AGRSD P IKV EFDHCVENHF+AIDT+ +LCG+ EDG   IDE DIQR
Subjt:  MAMEEHQHQVEASRGTSGKKGKKKPVVR-DKELEKRAKKEAEATLVSEEQPAGRSDDPRIKVSEFDHCVENHFRAIDTIFKLCGEAEDGDDGIDESDIQR

Query:  FSSSTIFLREWRYYNYEPKTVKFAGDSSGPGGKDADITINLPQFSSAAVLKDRAPPEAAKSLDFRNFAMYVGGPIWALDWWPQVHERTDSLIKCEFIAVS
        FSSSTIFLREWRYYNYEPKTVKFA DS GP  KDADI++NLPQFSSAA LK+ APPEAA SLDFRNF MYVGGPIWALDW PQ HERTDSLIKCE+IAVS
Subjt:  FSSSTIFLREWRYYNYEPKTVKFAGDSSGPGGKDADITINLPQFSSAAVLKDRAPPEAAKSLDFRNFAMYVGGPIWALDWWPQVHERTDSLIKCEFIAVS

Query:  AHPPGSSYHKMGIPLTGRGMVQIWCLVHGTEIQ--EPTKNTEASSNLSSQQKRPRGRPPGRKKKGESNLPSKPKRPRGRPKKKQVESNDNRGDNYQLVQT
        AHPP SSYHKMGIPLTGRGMVQIWC+VHGTE    EPTKN+ ASSNLSSQ ++PRGRPPGRKK   SNLPS+PKRPRGRPKKKQ ESNDN          
Subjt:  AHPPGSSYHKMGIPLTGRGMVQIWCLVHGTEIQ--EPTKNTEASSNLSSQQKRPRGRPPGRKKKGESNLPSKPKRPRGRPKKKQVESNDNRGDNYQLVQT

Query:  LSIECPVGSSNLLDIDGAKHSEEPVLLGKSVERQRSTLQEDSTFNSEDEVPAQKRRVRRKVGTKNHVDDMGTLTPTENREDGSNAINLQANDNVISEHSG
                                     SVE  RSTLQE  T NS+DEV AQK+R+RRKV TKNHVDD+GTL  TENRE+ SNAINLQAN+NVISE+SG
Subjt:  LSIECPVGSSNLLDIDGAKHSEEPVLLGKSVERQRSTLQEDSTFNSEDEVPAQKRRVRRKVGTKNHVDDMGTLTPTENREDGSNAINLQANDNVISEHSG

Query:  EDTLLCNNISDNAVSDTSSEFSIPESVALPRVVLCLAHNGKVAWDLKWKPTKACTIKNKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGTDPR
        EDTLLCNN+S+NA  D SS FSIPESVALPRVVLCLAHNGKVAWDLKWKPT AC  K KHRMGYLA+LLGNGSLEVWEVPFPHV++AIYSK NGEGTDPR
Subjt:  EDTLLCNNISDNAVSDTSSEFSIPESVALPRVVLCLAHNGKVAWDLKWKPTKACTIKNKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGTDPR

Query:  FMKLKPIFRCSMLRSANTQSIPLTVEWSSTPPNDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRGVAWAPSESDPETANVVLTVGHGGLKF
        F+KLKP FRCSMLRSAN QSIPLTVEWSSTPP DYLLAGCHDGTVALWKFSANS CEDTRPLLRFSADTVPIRGVAWAP++SDPE  NV+LT GHGGLKF
Subjt:  FMKLKPIFRCSMLRSANTQSIPLTVEWSSTPPNDYLLAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRGVAWAPSESDPETANVVLTVGHGGLKF

Query:  WDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLPKAAYDVPVTGKPFTAIKQKGLHTYFCTSSAIWSAQVSRQTGMVAYCGADGTVV
        WDLRDPFRPLWDLHPAPR+IYSLDWL DPRC ILSFDDGTLRLLSLPKAA DVPVTGKPFT +KQKGLHTY CTS+AIWS QVSRQTGMVAYCGADGTVV
Subjt:  WDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLPKAAYDVPVTGKPFTAIKQKGLHTYFCTSSAIWSAQVSRQTGMVAYCGADGTVV

Query:  RFQLTTKAVVKENSRNRTPQFVCDYFTDEQSTITIHTPESDVPVPLKKVANRSEPTPLSMRAILSEIQSNEGNHKTATVSVSESEALALRCDDDNSVESG
        RFQLTTKAVVKENSRNRTPQFVCDYFT+EQSTITIHTPE DVP PLKK++NR +P PLSMRAILSEIQSNEGNHKTA  S+SE+  LA   DDD +VESG
Subjt:  RFQLTTKAVVKENSRNRTPQFVCDYFTDEQSTITIHTPESDVPVPLKKVANRSEPTPLSMRAILSEIQSNEGNHKTATVSVSESEALALRCDDDNSVESG

Query:  SEDTP----MKNKTQSKCKKKGGVDDQELECSDE----QTNNDRVPSSGDNFENLPPRSVALHRVRWNMNIGSERWLCYGGAAGILRCQEIMLSALDKNL
        SEDTP     K+KTQSKCKKKG  D+QE ECSDE     TNN   P SGD+ ENLPP+SVA+HRVRWNMN GSERWLCYGGAAG+LRCQEI+LSALDK L
Subjt:  SEDTP----MKNKTQSKCKKKGGVDDQELECSDE----QTNNDRVPSSGDNFENLPPRSVALHRVRWNMNIGSERWLCYGGAAGILRCQEIMLSALDKNL

Query:  MKKK
        M KK
Subjt:  MKKK

SwissProt top hitse value%identityAlignment
Q5RDC3 General transcription factor 3C polypeptide 22.1e-0923.03Show/hide
Query:  AWDLKWKPTKACTIKNKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGTDPRFMKLKPIFRCSMLRSANTQSIP-------LTVEWSSTPPNDY
        AW+L   P KA  +    R+G LA+   +G + ++ +P P   +A+ ++   +   P   K++ +   + L+  + Q+         L++ W  T P+ +
Subjt:  AWDLKWKPTKACTIKNKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGTDPRFMKLKPIFRCSMLRSANTQSIP-------LTVEWSSTPPNDY

Query:  LLAGCHDGTVALWKFSANSTCEDTR---------PLLRFSADTVPIRGVAWAPSESDPETANVVLTVGHG-GLKFWDLRDPFRPLWDLHPAPRMIYSLDW
        L AG ++G V  W    NS  +  R         P   F A    +R + W  + S     + + + G    +KFWDLR P+ P+  +     +   L W
Subjt:  LLAGCHDGTVALWKFSANSTCEDTR---------PLLRFSADTVPIRGVAWAPSESDPETANVVLTVGHG-GLKFWDLRDPFRPLWDLHPAPRMIYSLDW

Query:  LPDPRCVILSFDDGTLRLLSLPKAAYDVPVTGKPFTAIKQKGLHTYFCT--SSAIWSAQVSRQTGMVAYCGADGTVVRFQLTTKAV----VKENSRNRTP
        L     V ++ D+          A+Y   + G  +      G   YF       +WS   S   G +A     G ++   L   A+    VK     R P
Subjt:  LPDPRCVILSFDDGTLRLLSLPKAAYDVPVTGKPFTAIKQKGLHTYFCT--SSAIWSAQVSRQTGMVAYCGADGTVVRFQLTTKAV----VKENSRNRTP

Query:  QFVCDY--FTDEQSTITIHTPESDVPVPLK
         +  D   + D        +  S VP P K
Subjt:  QFVCDY--FTDEQSTITIHTPESDVPVPLK

Q8BL74 General transcription factor 3C polypeptide 22.1e-0923.15Show/hide
Query:  WDLKWKPTKAC----TIKNK---HRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGTDPRFMKLKPIFRCSMLRSANTQSIP-------LTVEWSS
        WDLK+ P+ A     T++      R+G LA+   +G + ++ +P P   +A+ ++   +   P   K++ +   + L+  + Q+         L++ W  
Subjt:  WDLKWKPTKAC----TIKNK---HRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGTDPRFMKLKPIFRCSMLRSANTQSIP-------LTVEWSS

Query:  TPPNDYLLAGCHDGTVALWKFSANSTCEDTR---------PLLRFSADTVPIRGVAWAPSESDPETANVVLTVGHG-GLKFWDLRDPFRPLWDLHPAPRM
        T P+ +L AG ++G V  W    NS  +  R         P   F A    +R + W  + S     + +++ G    +KFWDLR P+ P   ++   R 
Subjt:  TPPNDYLLAGCHDGTVALWKFSANSTCEDTR---------PLLRFSADTVPIRGVAWAPSESDPETANVVLTVGHG-GLKFWDLRDPFRPLWDLHPAPRM

Query:  IYS-LDWLPDPRCVILSFDDGTLRLLSLPKAAYDVPVTGKPFTAIKQKGLHTYFCT--SSAIWSAQVSRQTGMVAYCGADGTVVRFQLTTKAV----VKE
        + + L WL     V ++ D+          A+Y   + G  +      G   YF       +WS   S   G VA     G ++   L   A     VK+
Subjt:  IYS-LDWLPDPRCVILSFDDGTLRLLSLPKAAYDVPVTGKPFTAIKQKGLHTYFCT--SSAIWSAQVSRQTGMVAYCGADGTVVRFQLTTKAV----VKE

Query:  NSRNRTPQFVCDY--FTDEQSTITIHTPESDVPVPLK
         +  R P +  D   + D        +  S+ P P K
Subjt:  NSRNRTPQFVCDY--FTDEQSTITIHTPESDVPVPLK

Q8WUA4 General transcription factor 3C polypeptide 21.2e-0923.03Show/hide
Query:  AWDLKWKPTKACTIKNKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGTDPRFMKLKPIFRCSMLRSANTQSIP-------LTVEWSSTPPNDY
        AW+L   P KA  +    R+G LA+   +G + ++ +P P   +A+ ++   +   P   K++ +   + L+  + Q+         L++ W  T P+ +
Subjt:  AWDLKWKPTKACTIKNKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGTDPRFMKLKPIFRCSMLRSANTQSIP-------LTVEWSSTPPNDY

Query:  LLAGCHDGTVALWKFSANSTCEDTR---------PLLRFSADTVPIRGVAWAPSESDPETANVVLTVGHG-GLKFWDLRDPFRPLWDLHPAPRMIYSLDW
        L AG ++G V  W    NS  +  R         P   F A    +R + W  + S     + +++ G    +KFWDLR P+ P+  +     +   L W
Subjt:  LLAGCHDGTVALWKFSANSTCEDTR---------PLLRFSADTVPIRGVAWAPSESDPETANVVLTVGHG-GLKFWDLRDPFRPLWDLHPAPRMIYSLDW

Query:  LPDPRCVILSFDDGTLRLLSLPKAAYDVPVTGKPFTAIKQKGLHTYFCT--SSAIWSAQVSRQTGMVAYCGADGTVVRFQLTTKAV----VKENSRNRTP
        L     V ++ D+          A+Y   + G  +      G   YF       +WS   S   G +A     G ++   L   A+    VK     R P
Subjt:  LPDPRCVILSFDDGTLRLLSLPKAAYDVPVTGKPFTAIKQKGLHTYFCT--SSAIWSAQVSRQTGMVAYCGADGTVVRFQLTTKAV----VKENSRNRTP

Query:  QFVCDY--FTDEQSTITIHTPESDVPVPLK
         +  D   + D        +  S VP P K
Subjt:  QFVCDY--FTDEQSTITIHTPESDVPVPLK

Arabidopsis top hitse value%identityAlignment
AT1G19485.1 Transducin/WD40 repeat-like superfamily protein2.5e-19945.4Show/hide
Query:  DDPRIKVSEFDHCVENHFRAIDTIFKLCGEAEDGDDGIDESDIQRFSSSTIFLREWRYYNYEPKTVKFAGDSSGPGGKDADITINLPQFSSAAVLKDRAP
        D     +S FD+  E+H +A+++I  LCGEA   +  IDE+DI   SSS  FLREWR+YN+EPK+  F  ++          +  LPQFSSA   K +  
Subjt:  DDPRIKVSEFDHCVENHFRAIDTIFKLCGEAEDGDDGIDESDIQRFSSSTIFLREWRYYNYEPKTVKFAGDSSGPGGKDADITINLPQFSSAAVLKDRAP

Query:  PEAAKSLD--FRNFAMYVGGPIWALDWWPQVHERTDSLIKCEFIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGT------EIQEPTKNTEASSNL--
         + + S     ++F M+VGG +WA++W P+VH   D+  KCEF+AV+ HPP S  HK+GIPL GRG++QIWC+++ T      ++ +  K     S    
Subjt:  PEAAKSLD--FRNFAMYVGGPIWALDWWPQVHERTDSLIKCEFIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGT------EIQEPTKNTEASSNL--

Query:  ------SSQQKRPRGRPPGRKKKGESNLPSKPKRPRGRPKKKQ-VESNDNRGDNYQLVQTLSIECPVGSSNLLDIDGAKHSEEPVLLGKSVERQRSTLQE
              +++ K+PRGRP  RK   E+   ++PK+PRGRP+KK   E      D+   V+ LS+  P   ++++     +   E  +    V  + S  Q 
Subjt:  ------SSQQKRPRGRPPGRKKKGESNLPSKPKRPRGRPKKKQ-VESNDNRGDNYQLVQTLSIECPVGSSNLLDIDGAKHSEEPVLLGKSVERQRSTLQE

Query:  DSTFNSEDEVPAQKRRVRRKVGTKNHVDDMGTLTPTENREDGSNAINLQANDNVISEHSGEDTLLCNNISDNAVSDTSSEFSIPESVALPRVVLCLAHNG
         S+ N+  ++P +++R +                 T++ E+    + L+ ++ V                 N  S  SS  S  + VALPRVVLCLAHNG
Subjt:  DSTFNSEDEVPAQKRRVRRKVGTKNHVDDMGTLTPTENREDGSNAINLQANDNVISEHSGEDTLLCNNISDNAVSDTSSEFSIPESVALPRVVLCLAHNG

Query:  KVAWDLKWKPTKACTIKNKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGTDPRFMKLKPIFRCSMLRSANTQSIPLTVEWSSTPPNDYLLAGC
        KV WD+KW+P+ A    NKH MGYLAVLLGNGSLEVW+VP P    A+Y       TDPRF+KL P+F+CS L+  +T+SIPLTVEWS+    D+LLAGC
Subjt:  KVAWDLKWKPTKACTIKNKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGTDPRFMKLKPIFRCSMLRSANTQSIPLTVEWSSTPPNDYLLAGC

Query:  HDGTVALWKFSANSTCEDTRPLLRFSADTVPIRGVAWAPSESDPETANVVLTVGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGT
        HDGTVALWKFS   + EDTRPLL FSADT PIR VAWAP ESD E+AN+V T GH GLKFWDLRDPFRPLWDLHP PR IYSLDWL DP CV+LSFDDGT
Subjt:  HDGTVALWKFSANSTCEDTRPLLRFSADTVPIRGVAWAPSESDPETANVVLTVGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGT

Query:  LRLLSLPKAAYDVPVTGKPFTAIKQKGLHTYFCTSSAIWSAQVSRQTGMVAYCGADGTVVRFQLTTKAVVKENSRNRTPQFVCDYFTDEQSTITIHTPES
        LR+LSL K AYDVP TG+P+   KQ+GL  Y C++  IWS QVSR TG+ AYC ADG++  F+LTTKAV K+ +RNRTP ++C   T + ST  +H+P  
Subjt:  LRLLSLPKAAYDVPVTGKPFTAIKQKGLHTYFCTSSAIWSAQVSRQTGMVAYCGADGTVVRFQLTTKAVVKENSRNRTPQFVCDYFTDEQSTITIHTPES

Query:  DVPVPLKKVANRSEPTPLSMRAILSEIQSNEGNHKTATVSVSESEALALRCDDDNSVESGSEDTPMKNKTQSKCKKKGGVDDQELE---------CSDEQ
        D+P+ LKK    +      +R++L+E  S   +      +VS+ + LA    +D  +ES SE T   N   +K K K G ++   E         C  E 
Subjt:  DVPVPLKKVANRSEPTPLSMRAILSEIQSNEGNHKTATVSVSESEALALRCDDDNSVESGSEDTPMKNKTQSKCKKKGGVDDQELE---------CSDEQ

Query:  TNNDR-------VPSSGDNFENLPPRSVALHRVRWNMNIGSERWLCYGGAAGILRCQEI
           +          S+G   E  PP+ VA+HRVRWNMN GSERWLCYGGAAGI+RCQEI
Subjt:  TNNDR-------VPSSGDNFENLPPRSVALHRVRWNMNIGSERWLCYGGAAGILRCQEI

AT1G19485.2 Transducin/WD40 repeat-like superfamily protein2.5e-19945.4Show/hide
Query:  DDPRIKVSEFDHCVENHFRAIDTIFKLCGEAEDGDDGIDESDIQRFSSSTIFLREWRYYNYEPKTVKFAGDSSGPGGKDADITINLPQFSSAAVLKDRAP
        D     +S FD+  E+H +A+++I  LCGEA   +  IDE+DI   SSS  FLREWR+YN+EPK+  F  ++          +  LPQFSSA   K +  
Subjt:  DDPRIKVSEFDHCVENHFRAIDTIFKLCGEAEDGDDGIDESDIQRFSSSTIFLREWRYYNYEPKTVKFAGDSSGPGGKDADITINLPQFSSAAVLKDRAP

Query:  PEAAKSLD--FRNFAMYVGGPIWALDWWPQVHERTDSLIKCEFIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGT------EIQEPTKNTEASSNL--
         + + S     ++F M+VGG +WA++W P+VH   D+  KCEF+AV+ HPP S  HK+GIPL GRG++QIWC+++ T      ++ +  K     S    
Subjt:  PEAAKSLD--FRNFAMYVGGPIWALDWWPQVHERTDSLIKCEFIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGT------EIQEPTKNTEASSNL--

Query:  ------SSQQKRPRGRPPGRKKKGESNLPSKPKRPRGRPKKKQ-VESNDNRGDNYQLVQTLSIECPVGSSNLLDIDGAKHSEEPVLLGKSVERQRSTLQE
              +++ K+PRGRP  RK   E+   ++PK+PRGRP+KK   E      D+   V+ LS+  P   ++++     +   E  +    V  + S  Q 
Subjt:  ------SSQQKRPRGRPPGRKKKGESNLPSKPKRPRGRPKKKQ-VESNDNRGDNYQLVQTLSIECPVGSSNLLDIDGAKHSEEPVLLGKSVERQRSTLQE

Query:  DSTFNSEDEVPAQKRRVRRKVGTKNHVDDMGTLTPTENREDGSNAINLQANDNVISEHSGEDTLLCNNISDNAVSDTSSEFSIPESVALPRVVLCLAHNG
         S+ N+  ++P +++R +                 T++ E+    + L+ ++ V                 N  S  SS  S  + VALPRVVLCLAHNG
Subjt:  DSTFNSEDEVPAQKRRVRRKVGTKNHVDDMGTLTPTENREDGSNAINLQANDNVISEHSGEDTLLCNNISDNAVSDTSSEFSIPESVALPRVVLCLAHNG

Query:  KVAWDLKWKPTKACTIKNKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGTDPRFMKLKPIFRCSMLRSANTQSIPLTVEWSSTPPNDYLLAGC
        KV WD+KW+P+ A    NKH MGYLAVLLGNGSLEVW+VP P    A+Y       TDPRF+KL P+F+CS L+  +T+SIPLTVEWS+    D+LLAGC
Subjt:  KVAWDLKWKPTKACTIKNKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGTDPRFMKLKPIFRCSMLRSANTQSIPLTVEWSSTPPNDYLLAGC

Query:  HDGTVALWKFSANSTCEDTRPLLRFSADTVPIRGVAWAPSESDPETANVVLTVGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGT
        HDGTVALWKFS   + EDTRPLL FSADT PIR VAWAP ESD E+AN+V T GH GLKFWDLRDPFRPLWDLHP PR IYSLDWL DP CV+LSFDDGT
Subjt:  HDGTVALWKFSANSTCEDTRPLLRFSADTVPIRGVAWAPSESDPETANVVLTVGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGT

Query:  LRLLSLPKAAYDVPVTGKPFTAIKQKGLHTYFCTSSAIWSAQVSRQTGMVAYCGADGTVVRFQLTTKAVVKENSRNRTPQFVCDYFTDEQSTITIHTPES
        LR+LSL K AYDVP TG+P+   KQ+GL  Y C++  IWS QVSR TG+ AYC ADG++  F+LTTKAV K+ +RNRTP ++C   T + ST  +H+P  
Subjt:  LRLLSLPKAAYDVPVTGKPFTAIKQKGLHTYFCTSSAIWSAQVSRQTGMVAYCGADGTVVRFQLTTKAVVKENSRNRTPQFVCDYFTDEQSTITIHTPES

Query:  DVPVPLKKVANRSEPTPLSMRAILSEIQSNEGNHKTATVSVSESEALALRCDDDNSVESGSEDTPMKNKTQSKCKKKGGVDDQELE---------CSDEQ
        D+P+ LKK    +      +R++L+E  S   +      +VS+ + LA    +D  +ES SE T   N   +K K K G ++   E         C  E 
Subjt:  DVPVPLKKVANRSEPTPLSMRAILSEIQSNEGNHKTATVSVSESEALALRCDDDNSVESGSEDTPMKNKTQSKCKKKGGVDDQELE---------CSDEQ

Query:  TNNDR-------VPSSGDNFENLPPRSVALHRVRWNMNIGSERWLCYGGAAGILRCQEI
           +          S+G   E  PP+ VA+HRVRWNMN GSERWLCYGGAAGI+RCQEI
Subjt:  TNNDR-------VPSSGDNFENLPPRSVALHRVRWNMNIGSERWLCYGGAAGILRCQEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATGGAAGAACATCAGCATCAAGTGGAAGCGTCCAGAGGTACTAGCGGCAAGAAAGGTAAGAAGAAACCAGTGGTTCGGGATAAGGAGCTGGAGAAAAGAGCTAA
GAAGGAAGCGGAGGCTACTTTAGTCAGCGAAGAGCAACCTGCTGGACGTTCTGATGACCCCAGGATCAAGGTTTCGGAGTTTGATCACTGTGTTGAAAATCATTTTAGAG
CCATTGATACAATTTTCAAGCTCTGTGGTGAGGCAGAGGATGGCGATGACGGAATTGACGAAAGTGACATTCAACGATTTTCATCATCCACCATTTTCTTAAGGGAATGG
AGGTACTACAATTATGAACCGAAAACTGTCAAGTTTGCTGGCGACTCGAGTGGCCCTGGGGGAAAGGATGCTGACATCACAATAAACTTACCGCAGTTTTCTTCTGCCGC
TGTTCTAAAGGACAGAGCACCGCCTGAAGCTGCCAAATCTTTGGATTTCAGAAATTTTGCTATGTATGTTGGTGGGCCTATTTGGGCTTTAGATTGGTGGCCTCAAGTTC
ATGAAAGGACAGACTCTCTTATCAAATGTGAGTTTATTGCTGTGTCTGCTCATCCACCTGGCTCTTCTTATCACAAGATGGGTATCCCACTAACTGGAAGAGGTATGGTG
CAAATATGGTGCTTAGTGCATGGCACTGAAATCCAAGAACCGACCAAGAATACTGAGGCATCATCAAACTTATCATCTCAACAAAAGAGGCCTAGAGGAAGGCCACCAGG
GCGTAAGAAAAAGGGGGAATCAAACTTGCCATCTAAACCAAAGAGGCCTAGAGGAAGACCTAAAAAGAAACAAGTAGAATCCAATGATAACAGGGGTGACAACTACCAAC
TTGTCCAGACCCTATCTATTGAATGCCCAGTTGGTTCATCCAACTTGCTTGATATTGATGGTGCCAAACATTCTGAAGAACCTGTATTACTGGGAAAGAGTGTTGAAAGA
CAGAGGAGTACCTTACAAGAAGATTCTACCTTCAATTCTGAAGATGAAGTTCCTGCGCAAAAGAGAAGAGTGAGAAGAAAAGTTGGGACCAAGAATCATGTTGATGACAT
GGGAACGTTAACACCTACAGAGAATCGAGAAGATGGATCCAATGCTATCAATCTTCAGGCAAATGATAATGTTATAAGTGAACATTCTGGGGAAGACACTCTATTATGTA
ATAATATTTCAGATAATGCTGTTTCAGACACCAGTTCTGAATTTTCTATTCCCGAGAGTGTTGCTTTGCCTAGAGTAGTATTGTGCTTAGCTCACAATGGAAAGGTAGCA
TGGGATTTGAAATGGAAGCCAACTAAGGCGTGTACTATCAAAAACAAGCATCGAATGGGTTACCTTGCTGTCTTGCTGGGCAACGGATCTCTAGAAGTCTGGGAGGTTCC
TTTTCCCCATGTAGTGAAGGCCATCTATTCTAAATTCAATGGGGAGGGTACAGACCCTCGCTTTATGAAGTTGAAGCCTATTTTCAGATGCTCGATGTTGAGAAGTGCAA
ATACACAGAGCATCCCTCTGACTGTGGAATGGTCATCAACACCTCCTAATGATTATCTACTCGCTGGATGCCATGATGGAACAGTCGCCTTGTGGAAGTTTTCTGCAAAT
AGTACATGTGAAGATACGAGGCCTTTACTTCGTTTTAGTGCAGATACAGTTCCAATAAGAGGTGTTGCATGGGCACCAAGTGAAAGCGATCCCGAAACTGCAAATGTGGT
GCTTACGGTTGGTCATGGAGGTTTAAAATTTTGGGACCTAAGAGATCCTTTCCGTCCTTTGTGGGACCTTCATCCAGCGCCAAGGATGATATATAGTCTTGATTGGCTTC
CTGATCCTAGATGCGTTATTTTATCCTTTGATGATGGAACCTTGAGACTTCTCAGTTTGCCGAAGGCTGCGTATGATGTTCCAGTAACGGGTAAACCCTTCACAGCAATA
AAACAAAAAGGTTTACACACTTACTTTTGTACATCATCTGCTATCTGGAGTGCTCAAGTGTCAAGGCAGACAGGCATGGTTGCATACTGTGGTGCTGATGGAACTGTTGT
TCGTTTTCAGCTTACAACCAAAGCAGTGGTGAAAGAAAATTCACGTAATCGAACCCCACAGTTTGTATGCGATTACTTTACTGACGAGCAGTCAACTATTACAATCCACA
CTCCAGAATCAGATGTTCCAGTTCCTTTGAAGAAGGTTGCCAACAGATCTGAACCCACGCCATTGTCCATGCGAGCTATTTTATCTGAAATTCAGTCAAATGAAGGAAAT
CACAAAACCGCCACTGTTTCAGTATCAGAAAGTGAAGCATTAGCCCTTCGCTGCGACGATGATAATAGTGTTGAATCTGGATCTGAGGATACACCGATGAAAAACAAAAC
TCAATCAAAGTGCAAGAAGAAGGGTGGAGTGGATGACCAAGAATTGGAATGTAGTGATGAGCAAACGAATAATGACAGAGTGCCTAGTTCGGGTGATAACTTTGAAAATT
TGCCTCCCAGATCAGTTGCATTGCATAGAGTGAGATGGAACATGAACATAGGGAGTGAAAGATGGTTGTGTTATGGTGGAGCAGCTGGAATTCTACGCTGTCAGGAGATC
ATGTTGTCTGCCCTCGATAAGAATTTGATGAAGAAGAAATGA
mRNA sequenceShow/hide mRNA sequence
GGAAGACAAGTAGGCCTAGGGTTTTGCAAACAGAATATTTCGTCTTTTTCTGAGCACGCTTACCGAGACTTCCTTCATCATCCTCTCCATCCACCCTCCTTCGTTTCGAG
CCCAGTTTCGCGCCCCACTGCAAGTTGCCCATCGACCACACAACAGTCAGTCGACACGCTCCGGTCCGACCAACCTCGCAGATCTGTTCGTCACGCCGTCGCCTGCCTTC
AGCCGGAGTTCCGTCCGCGACCGCGACCACACGCAGGAGTCTCGATCGGCGTCAGGCTTGCAGTCGCGACCCACTGTCGACTAACCGAGCTTCACCGCCGCCAAAGTTGA
ACATAACTGGTTAAATTATTATATTCTTAATGGGAGGTCAAGGTTCAAGTTAAAAAAAAAGAAATATTCTTTTTTTAAAATCCCAAAACGACGTGTTTTGCTTGTGAAGA
GTCTGGTAGACTGATGCTTAGGGTTTGAATCTCCGTAAACATATAGCCTTCAACAAATCAAACTTGAAGATGGATGGCTGGGGATTGAATCTCCTCAAGCTTTAATAAAA
ATTGGAATTTTCAAAGCTTTACAGAGAATGGCAATGGAAGAACATCAGCATCAAGTGGAAGCGTCCAGAGGTACTAGCGGCAAGAAAGGTAAGAAGAAACCAGTGGTTCG
GGATAAGGAGCTGGAGAAAAGAGCTAAGAAGGAAGCGGAGGCTACTTTAGTCAGCGAAGAGCAACCTGCTGGACGTTCTGATGACCCCAGGATCAAGGTTTCGGAGTTTG
ATCACTGTGTTGAAAATCATTTTAGAGCCATTGATACAATTTTCAAGCTCTGTGGTGAGGCAGAGGATGGCGATGACGGAATTGACGAAAGTGACATTCAACGATTTTCA
TCATCCACCATTTTCTTAAGGGAATGGAGGTACTACAATTATGAACCGAAAACTGTCAAGTTTGCTGGCGACTCGAGTGGCCCTGGGGGAAAGGATGCTGACATCACAAT
AAACTTACCGCAGTTTTCTTCTGCCGCTGTTCTAAAGGACAGAGCACCGCCTGAAGCTGCCAAATCTTTGGATTTCAGAAATTTTGCTATGTATGTTGGTGGGCCTATTT
GGGCTTTAGATTGGTGGCCTCAAGTTCATGAAAGGACAGACTCTCTTATCAAATGTGAGTTTATTGCTGTGTCTGCTCATCCACCTGGCTCTTCTTATCACAAGATGGGT
ATCCCACTAACTGGAAGAGGTATGGTGCAAATATGGTGCTTAGTGCATGGCACTGAAATCCAAGAACCGACCAAGAATACTGAGGCATCATCAAACTTATCATCTCAACA
AAAGAGGCCTAGAGGAAGGCCACCAGGGCGTAAGAAAAAGGGGGAATCAAACTTGCCATCTAAACCAAAGAGGCCTAGAGGAAGACCTAAAAAGAAACAAGTAGAATCCA
ATGATAACAGGGGTGACAACTACCAACTTGTCCAGACCCTATCTATTGAATGCCCAGTTGGTTCATCCAACTTGCTTGATATTGATGGTGCCAAACATTCTGAAGAACCT
GTATTACTGGGAAAGAGTGTTGAAAGACAGAGGAGTACCTTACAAGAAGATTCTACCTTCAATTCTGAAGATGAAGTTCCTGCGCAAAAGAGAAGAGTGAGAAGAAAAGT
TGGGACCAAGAATCATGTTGATGACATGGGAACGTTAACACCTACAGAGAATCGAGAAGATGGATCCAATGCTATCAATCTTCAGGCAAATGATAATGTTATAAGTGAAC
ATTCTGGGGAAGACACTCTATTATGTAATAATATTTCAGATAATGCTGTTTCAGACACCAGTTCTGAATTTTCTATTCCCGAGAGTGTTGCTTTGCCTAGAGTAGTATTG
TGCTTAGCTCACAATGGAAAGGTAGCATGGGATTTGAAATGGAAGCCAACTAAGGCGTGTACTATCAAAAACAAGCATCGAATGGGTTACCTTGCTGTCTTGCTGGGCAA
CGGATCTCTAGAAGTCTGGGAGGTTCCTTTTCCCCATGTAGTGAAGGCCATCTATTCTAAATTCAATGGGGAGGGTACAGACCCTCGCTTTATGAAGTTGAAGCCTATTT
TCAGATGCTCGATGTTGAGAAGTGCAAATACACAGAGCATCCCTCTGACTGTGGAATGGTCATCAACACCTCCTAATGATTATCTACTCGCTGGATGCCATGATGGAACA
GTCGCCTTGTGGAAGTTTTCTGCAAATAGTACATGTGAAGATACGAGGCCTTTACTTCGTTTTAGTGCAGATACAGTTCCAATAAGAGGTGTTGCATGGGCACCAAGTGA
AAGCGATCCCGAAACTGCAAATGTGGTGCTTACGGTTGGTCATGGAGGTTTAAAATTTTGGGACCTAAGAGATCCTTTCCGTCCTTTGTGGGACCTTCATCCAGCGCCAA
GGATGATATATAGTCTTGATTGGCTTCCTGATCCTAGATGCGTTATTTTATCCTTTGATGATGGAACCTTGAGACTTCTCAGTTTGCCGAAGGCTGCGTATGATGTTCCA
GTAACGGGTAAACCCTTCACAGCAATAAAACAAAAAGGTTTACACACTTACTTTTGTACATCATCTGCTATCTGGAGTGCTCAAGTGTCAAGGCAGACAGGCATGGTTGC
ATACTGTGGTGCTGATGGAACTGTTGTTCGTTTTCAGCTTACAACCAAAGCAGTGGTGAAAGAAAATTCACGTAATCGAACCCCACAGTTTGTATGCGATTACTTTACTG
ACGAGCAGTCAACTATTACAATCCACACTCCAGAATCAGATGTTCCAGTTCCTTTGAAGAAGGTTGCCAACAGATCTGAACCCACGCCATTGTCCATGCGAGCTATTTTA
TCTGAAATTCAGTCAAATGAAGGAAATCACAAAACCGCCACTGTTTCAGTATCAGAAAGTGAAGCATTAGCCCTTCGCTGCGACGATGATAATAGTGTTGAATCTGGATC
TGAGGATACACCGATGAAAAACAAAACTCAATCAAAGTGCAAGAAGAAGGGTGGAGTGGATGACCAAGAATTGGAATGTAGTGATGAGCAAACGAATAATGACAGAGTGC
CTAGTTCGGGTGATAACTTTGAAAATTTGCCTCCCAGATCAGTTGCATTGCATAGAGTGAGATGGAACATGAACATAGGGAGTGAAAGATGGTTGTGTTATGGTGGAGCA
GCTGGAATTCTACGCTGTCAGGAGATCATGTTGTCTGCCCTCGATAAGAATTTGATGAAGAAGAAATGAAACTTGTTTGAAGAGAACAGGCTTTTGCCAAAACGCTTATA
TGGACAGCCAATTTTCATGAAACGACTCGTGCTCTCGCCCAATGCTTTGTACAGTCCTAAGATCAGGTTTGCTTCTACTGGTTAAAGGGAAGCCATTTTGAGTCGCAACC
ATGATCGCAATCGAGCCATGGTAGTGCAGTCATCCTTGAGGGGGCCGAATTATGGGAGCAGCCAAAGGAATAAGAAAACCGAAGACCATTAACTGTGGAAGTTGGTTAAA
AGGAGTTACAAGAGCCTAAGGTTGTTGTGTTCGGTTTGAAACAAACAGATGTCGATTTTTACACGGCGATCAAGATCATGGTGTAGAATAAACTGTATTCTAAAATTAGT
GACACGAAGGTCGTCTCCTTAGCTTATGGAAGTCGGGTTTAGAACGTCGTGGAGCTTGATTACCTGAGGAGGATGATGGGGTTTGTGTTTCATCTTTTAGGGACTTTTTC
ATTTTGGTATTTTGCCTACTTTAACTGCTTCTGTAATATAAATTAAGTTTTTAGATGTTGTTTCCTCCATGAACTTTTTTATATTACTGTGCATAATTTTATGAGTCTCT
TCAAGGATTTGAAATGAAATAATATAATGTACTTGCATATGAACATTGGAGAGGATGCCCAAAAATATTATGGTGTTTGGATTGAGGAGTTCAACACCACTTTGTGTGGC
CTAAGGATATTTCAACTCTAATGGTGCTCTTCACAAATTTAATAAATTCCTCTTCTTTACTTTTA
Protein sequenceShow/hide protein sequence
MAMEEHQHQVEASRGTSGKKGKKKPVVRDKELEKRAKKEAEATLVSEEQPAGRSDDPRIKVSEFDHCVENHFRAIDTIFKLCGEAEDGDDGIDESDIQRFSSSTIFLREW
RYYNYEPKTVKFAGDSSGPGGKDADITINLPQFSSAAVLKDRAPPEAAKSLDFRNFAMYVGGPIWALDWWPQVHERTDSLIKCEFIAVSAHPPGSSYHKMGIPLTGRGMV
QIWCLVHGTEIQEPTKNTEASSNLSSQQKRPRGRPPGRKKKGESNLPSKPKRPRGRPKKKQVESNDNRGDNYQLVQTLSIECPVGSSNLLDIDGAKHSEEPVLLGKSVER
QRSTLQEDSTFNSEDEVPAQKRRVRRKVGTKNHVDDMGTLTPTENREDGSNAINLQANDNVISEHSGEDTLLCNNISDNAVSDTSSEFSIPESVALPRVVLCLAHNGKVA
WDLKWKPTKACTIKNKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGTDPRFMKLKPIFRCSMLRSANTQSIPLTVEWSSTPPNDYLLAGCHDGTVALWKFSAN
STCEDTRPLLRFSADTVPIRGVAWAPSESDPETANVVLTVGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLPKAAYDVPVTGKPFTAI
KQKGLHTYFCTSSAIWSAQVSRQTGMVAYCGADGTVVRFQLTTKAVVKENSRNRTPQFVCDYFTDEQSTITIHTPESDVPVPLKKVANRSEPTPLSMRAILSEIQSNEGN
HKTATVSVSESEALALRCDDDNSVESGSEDTPMKNKTQSKCKKKGGVDDQELECSDEQTNNDRVPSSGDNFENLPPRSVALHRVRWNMNIGSERWLCYGGAAGILRCQEI
MLSALDKNLMKKK