; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022873 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022873
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionDNA binding protein, putative isoform 1
Genome locationtig00000589:2679357..2692728
RNA-Seq ExpressionSgr022873
SyntenySgr022873
Gene Ontology termsGO:0006383 - transcription by RNA polymerase III (biological process)
GO:0000127 - transcription factor TFIIIC complex (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001680 - WD40 repeat
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR017956 - AT hook, DNA-binding motif
IPR036322 - WD40-repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7017442.1 General transcription factor 3C polypeptide 2 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0077.12Show/hide
Query:  VEVEPPAAEAS--TGCKRGKKKPVGREKKEAQGRATKKKAGATSVDEEQSTGRLDGMPIKVSEFDHCVENHFSAIDTISKLCGEAEDGDGGIDESDIQRF
        +E  P  AEAS  T CK+GKKK V     E Q RA KKKAGATSV+E Q TGRLD   +KVSEFDHCVENHF AID I++L GEAE+G+GG+DESD QRF
Subjt:  VEVEPPAAEAS--TGCKRGKKKPVGREKKEAQGRATKKKAGATSVDEEQSTGRLDGMPIKVSEFDHCVENHFSAIDTISKLCGEAEDGDGGIDESDIQRF

Query:  SSSTIFLREWRYYNYESKTVKFASDSRGPEGKDADITINLPQFSSAAVLKNGTPSEATTSLDWRNFVMYVGGPVWALDWCPQVHERTDSLIKCEFIAVSA
        SSST FLREW++YNYE KTVKF SDSR PEGKDADIT+ LPQFSSAAVLKNG P  ATTSLD+RNF+M+VGGPVWA+DWCP VHER+DSLIKCEFIAVSA
Subjt:  SSSTIFLREWRYYNYESKTVKFASDSRGPEGKDADITINLPQFSSAAVLKNGTPSEATTSLDWRNFVMYVGGPVWALDWCPQVHERTDSLIKCEFIAVSA

Query:  HPPGSSYHKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAAKGKSNPKKVEVSSDLSSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKKEESN-DNM
        HPPGSSYH MG PL+GRGMVQIWCL+HGTE+HE+E T+A + K         SDL SQPKRPRGRPPG KKNGAS LPSQPKRPRGRPK K+EE N DN 
Subjt:  HPPGSSYHKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAAKGKSNPKKVEVSSDLSSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKKEESN-DNM

Query:  GDNYQFVQALSIEYP----------VVPKNSEELVLLENSVERQKCTLQEVSTCNSEDEVPAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSNGINLQA
          +YQ VQ LS+EYP           VP NSE+ V LENSVER   T++E+STCNSEDEVP QKRRVRR A TKNH+DD+G L+L ENREDGSN  N +A
Subjt:  GDNYQFVQALSIEYP----------VVPKNSEELVLLENSVERQKCTLQEVSTCNSEDEVPAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSNGINLQA

Query:  NENVISKNSGEDTLLCNNISKNAVLDTSSIEFSIPGSVALPRVVLCLAHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIY
        NENV S+ SGEDTLLC NIS+ A+LDT S  FSIP +VALPR+VLCLAHNGKVAWDLKWKP NART+KCK RMGYLAVLLGNGSLEVWEVPFPHVVKAIY
Subjt:  NENVISKNSGEDTLLCNNISKNAVLDTSSIEFSIPGSVALPRVVLCLAHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIY

Query:  SKFNGEGMDPRFVKLKPIFKCSMLRSANTQSIPLTVEWSSTPPYDYLFAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESDPESANV
        SK NGEG DPRFVKLKP F+CSMLRSA+TQSIPLTVEWS TPPYDYL AGCHDGTVALWKFSANST EDTRPLLRFSADTVPIRAVAWAPSES+PES NV
Subjt:  SKFNGEGMDPRFVKLKPIFKCSMLRSANTQSIPLTVEWSSTPPYDYLFAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESDPESANV

Query:  VLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQVSRQTGM
        +L A HGG+KFWDLRDPFRPLWDLHPAPR+IYSLDWLP+PRCV LSFDDGTLRLLSLLKAAYDVPVTG+PFT  KQ+GLH+YCCS FAIWS+QVSRQTGM
Subjt:  VLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQVSRQTGM

Query:  VAYCGADGAVHRFQLTTRAVEKDHSRSRTPHFVCEYLTEEQSTITIHSPASDVPFPLKKLSNKSDPPLSMRAILSDSIQSNEENHKTAMASASENETLAL
        VAYCGADGAV RFQLTT+AV+K++SR+RTPHFVCEYLTEEQS ITIHSPASDVP PLKKLSNKS+ PLSMRAILSDS+Q NE N K+A  SA  NE+ AL
Subjt:  VAYCGADGAVHRFQLTTRAVEKDHSRSRTPHFVCEYLTEEQSTITIHSPASDVPFPLKKLSNKSDPPLSMRAILSDSIQSNEENHKTAMASASENETLAL

Query:  SYGNDVSIESGSEDTLMSIKKRNQTQSKGKKKGVDNQALECSDEPNDAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGGAAGILRCQE
         Y +DV +ESGSEDT MSI+ +NQTQSK KKKGV NQ LE S EP+D+Q   DV+PG G  FE FPPKSVA+HR+RWNMN GSERWL YGGAAGILRCQE
Subjt:  SYGNDVSIESGSEDTLMSIKKRNQTQSKGKKKGVDNQALECSDEPNDAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGGAAGILRCQE

Query:  IVLSDLDKK
        IVLS LDKK
Subjt:  IVLSDLDKK

XP_022144681.1 uncharacterized protein LOC111014310 isoform X1 [Momordica charantia]0.0e+0083.83Show/hide
Query:  EHHHQVE-VEP--PAAEASTGCKRGKKK-PVGREKKEAQGRATKKKAGATSVDEEQSTGRLDGMPIKVSEFDHCVENHFSAIDTISKLCGEAEDGDGGID
        EHHH VE VEP  PAA  STG KRGK+K PV R+KKEA GRA KKK G  S DEEQ TGRLDG+ IKV EFDHC ENHF A+DTI++LCGEAEDGDGGID
Subjt:  EHHHQVE-VEP--PAAEASTGCKRGKKK-PVGREKKEAQGRATKKKAGATSVDEEQSTGRLDGMPIKVSEFDHCVENHFSAIDTISKLCGEAEDGDGGID

Query:  ESDIQRFSSSTIFLREWRYYNYESKTVKFASDSRGPEGKDADITINLPQFSSAAVLKNGTPSEATTSLDWRNFVMYVGGPVWALDWCPQVHERTDSLIKC
        ESDIQRFSSS  FLREWR+YNYE KTVKFASD RG EGKD DITINLPQFSSAAVLKNGTPS A TSLDWRNFVMYVGGPVWALDWCPQV E+TD+LIKC
Subjt:  ESDIQRFSSSTIFLREWRYYNYESKTVKFASDSRGPEGKDADITINLPQFSSAAVLKNGTPSEATTSLDWRNFVMYVGGPVWALDWCPQVHERTDSLIKC

Query:  EFIAVSAHPPGSSYHKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAAKGKSNPKKVEVSSDLSSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKKE
        EFIAVSAHPPGSSYHKMGTPL GRGMVQIWCL+HGTENHE EP  A K KS PKK EVSSDLSSQPKRPRGRPPG KK GASDLPSQPKRPRGRPKKK+E
Subjt:  EFIAVSAHPPGSSYHKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAAKGKSNPKKVEVSSDLSSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKKE

Query:  ESNDNMGDNYQFVQALSIEYPV----------VPKNSEELVLLENSVERQKCTLQEVSTCNSEDEVPAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSN
         SNDNMGDN Q VQ+LS+EYP            PKNSEEL+LL NSVERQK TLQ VSTCNS+DE PAQKRRVRRK GTKNHIDDMG L  T NREDGS+
Subjt:  ESNDNMGDNYQFVQALSIEYPV----------VPKNSEELVLLENSVERQKCTLQEVSTCNSEDEVPAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSN

Query:  GINLQANENVISKNSGEDTLLCNNISKNAVLDTSSIEFSIPGSVALPRVVLCLAHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLLGNGSLEVWEVPFPH
         I+ Q NENVIS+ SGEDTLLCNNISKNA       EFSIP SVALPRVVLCLAHNGKVAWDLKWKP NA T+ CKHRMGYLAVLLGNGSLEVWE+PFPH
Subjt:  GINLQANENVISKNSGEDTLLCNNISKNAVLDTSSIEFSIPGSVALPRVVLCLAHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLLGNGSLEVWEVPFPH

Query:  VVKAIYSKFNGEGMDPRFVKLKPIFKCSMLRSANTQSIPLTVEWSSTPPYDYLFAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESD
        VVKAIYSKFN EG DPRFVKLKPIF+ +ML+SAN QSIPLTVEWSSTPPYDYLFAGC+DGTVALWKFSANSTCEDTRPLLRFSADTVPIR VAWAP+ESD
Subjt:  VVKAIYSKFNGEGMDPRFVKLKPIFKCSMLRSANTQSIPLTVEWSSTPPYDYLFAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESD

Query:  PESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQV
        PESANVVLTA HGGLKFWDLRDPFRPLWD+HPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSY  SSFAIWSVQV
Subjt:  PESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQV

Query:  SRQTGMVAYCGADGAVHRFQLTTRAVEKDHSRSRTPHFVCEYLTEEQSTITIHSPASDVPFPLKKLSNKSDPPLSMRAILSDSIQSNEENHKTAMASASE
        SRQTGMVAYC ADGAV RFQLTTRAVEKDHSR+RTPHF+CEYLTEE+S ITIHSPAS VPFPLKK SNKSD PLS RAILSDSI+SNE NHKTA A+ASE
Subjt:  SRQTGMVAYCGADGAVHRFQLTTRAVEKDHSRSRTPHFVCEYLTEEQSTITIHSPASDVPFPLKKLSNKSDPPLSMRAILSDSIQSNEENHKTAMASASE

Query:  NETLALSYGNDVSIESGSEDTLMSIKKRNQTQSKGKKKGVDNQALECSDEPNDAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGGAAG
        NE LA+ + NDVS++SGSEDTLMS+KK+NQTQSK KKK VD+QALECSDEPNDAQ K D LPGSG++FE FPPKSVA+HRVRWNMNTGSERWLCYGG AG
Subjt:  NETLALSYGNDVSIESGSEDTLMSIKKRNQTQSKGKKKGVDNQALECSDEPNDAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGGAAG

Query:  ILRCQEIVLSDLDKK
        I+RCQEIVLSD DKK
Subjt:  ILRCQEIVLSDLDKK

XP_022934485.1 uncharacterized protein LOC111441649 isoform X1 [Cucurbita moschata]0.0e+0077.01Show/hide
Query:  VEVEPPAAEAS--TGCKRGKKKPVGREKKEAQGRATKKKAGATSVDEEQSTGRLDGMPIKVSEFDHCVENHFSAIDTISKLCGEAEDGDGGIDESDIQRF
        +E  P  AEAS  T CK+GKKK V  E  E Q RA KKK GATSV+E Q TGRLD   +KVSEFDHCVENHF AID I++L GEAE+G+GG+DESD QRF
Subjt:  VEVEPPAAEAS--TGCKRGKKKPVGREKKEAQGRATKKKAGATSVDEEQSTGRLDGMPIKVSEFDHCVENHFSAIDTISKLCGEAEDGDGGIDESDIQRF

Query:  SSSTIFLREWRYYNYESKTVKFASDSRGPEGKDADITINLPQFSSAAVLKNGTPSEATTSLDWRNFVMYVGGPVWALDWCPQVHERTDSLIKCEFIAVSA
        SSST FLREW++YNYE KTVKF SDSR PEGKDADIT+ LPQFSSAAVLKNG P  AT SLD+RNF+M+VGGPVWA+DWCP VHERTDSLIKCEFIAVSA
Subjt:  SSSTIFLREWRYYNYESKTVKFASDSRGPEGKDADITINLPQFSSAAVLKNGTPSEATTSLDWRNFVMYVGGPVWALDWCPQVHERTDSLIKCEFIAVSA

Query:  HPPGSSYHKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAAKGKSNPKKVEVSSDLSSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKKEESN-DNM
        HPPGSSYH MG PL+GRGMVQIWCL+HGTE+HE+E T+A + K         SDL SQPKRPRGRPPG KKNGAS LPSQPKRPRGRPKKK+EE N DN 
Subjt:  HPPGSSYHKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAAKGKSNPKKVEVSSDLSSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKKEESN-DNM

Query:  GDNYQFVQALSIEYPVVPK----------NSEELVLLENSVERQKCTLQEVSTCNSEDEVPAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSNGINLQA
          +YQ VQ LS+EYP V            NSE+ V LENSVER   T++E+STCNSEDEVP QKRRVRR A TKNH+DD+G L+L ENREDGSN  N +A
Subjt:  GDNYQFVQALSIEYPVVPK----------NSEELVLLENSVERQKCTLQEVSTCNSEDEVPAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSNGINLQA

Query:  NENVISKNSGEDTLLCNNISKNAVLDTSSIEFSIPGSVALPRVVLCLAHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIY
        NENV S+ SGEDT LC NIS+ A+LDT S  FSIP +VALPR+VLCLAHNGKVAWDLKWKP NART+KCK RMGYLAVLLGNGSLEVWEVPFPHVVKAIY
Subjt:  NENVISKNSGEDTLLCNNISKNAVLDTSSIEFSIPGSVALPRVVLCLAHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIY

Query:  SKFNGEGMDPRFVKLKPIFKCSMLRSANTQSIPLTVEWSSTPPYDYLFAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESDPESANV
        SK NGEG DPRFVKLKP F+CSMLRSA+TQSIPLTVEWS TPPYDYL AGCHDGTVALWKFSA+ST EDTRPLLRFSADTVPIRAVAWAPSES+PES NV
Subjt:  SKFNGEGMDPRFVKLKPIFKCSMLRSANTQSIPLTVEWSSTPPYDYLFAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESDPESANV

Query:  VLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQVSRQTGM
        +L A HGG+KFWDLRDPFRPLWDLHPAPR+IYSLDWLP+PRCV LSFDDGTLRLLSLLKAAYDVPVTG+PFT  KQ+GLH+YCCS FAIWS+QVSRQTGM
Subjt:  VLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQVSRQTGM

Query:  VAYCGADGAVHRFQLTTRAVEKDHSRSRTPHFVCEYLTEEQSTITIHSPASDVPFPLKKLSNKSDPPLSMRAILSDSIQSNEENHKTAMASASENETLAL
        VAYCGADGAV RFQLTT+AV+K++SR+RTPHFVCEYLTEEQS ITIHSPASDVP PLKKLSNKS+ PLSMRAILSDS+Q NE N K+A  SA ENE+ AL
Subjt:  VAYCGADGAVHRFQLTTRAVEKDHSRSRTPHFVCEYLTEEQSTITIHSPASDVPFPLKKLSNKSDPPLSMRAILSDSIQSNEENHKTAMASASENETLAL

Query:  SYGNDVSIESGSEDTLMSIKKRNQTQSKGKKKGVDNQALECSDEPNDAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGGAAGILRCQE
         Y +DV +ESGSEDT MSI+ +NQTQSK KKKGV NQ LE S EP+D+Q   DV+PG G  FE FPPKSVA+HR+RWNMN GSERWL YGGAAGILRCQE
Subjt:  SYGNDVSIESGSEDTLMSIKKRNQTQSKGKKKGVDNQALECSDEPNDAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGGAAGILRCQE

Query:  IVLSDLDKK
        IVLS LDKK
Subjt:  IVLSDLDKK

XP_023528187.1 uncharacterized protein LOC111791176 [Cucurbita pepo subsp. pepo]0.0e+0077.67Show/hide
Query:  VEVEPPAAEAS--TGCKRGKKKPVGREKKEAQGRATKKKAGATSVDEEQSTGRLDGMPIKVSEFDHCVENHFSAIDTISKLCGEAEDGDGGIDESDIQRF
        +E  P  AEAS  T CK+GKKK V  E  E Q RA KKKAGATSV+E Q TGRLD   +KVSEFDHCVENHF AID I++L GEAE+G+GG+DESD QRF
Subjt:  VEVEPPAAEAS--TGCKRGKKKPVGREKKEAQGRATKKKAGATSVDEEQSTGRLDGMPIKVSEFDHCVENHFSAIDTISKLCGEAEDGDGGIDESDIQRF

Query:  SSSTIFLREWRYYNYESKTVKFASDSRGPEGKDADITINLPQFSSAAVLKNGTPSEATTSLDWRNFVMYVGGPVWALDWCPQVHERTDSLIKCEFIAVSA
        SSST FLREW++YNYE KTVKF SDSR PEGKDADIT+ LPQFSSAAVLKNG P  ATTSLD+RNF+M+VGGPVWA+DWCP VHERTDSLIKCEFIAVSA
Subjt:  SSSTIFLREWRYYNYESKTVKFASDSRGPEGKDADITINLPQFSSAAVLKNGTPSEATTSLDWRNFVMYVGGPVWALDWCPQVHERTDSLIKCEFIAVSA

Query:  HPPGSSYHKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAAKGKSNPKKVEVSSDLSSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKKEESN-DNM
        HPPGSSYH MG PL+GRGMVQIWCL+HGTE+HE+E T+A + K         SDL SQPKRPRGRPPG KKNGAS LPSQPKRPRGRPKKK+EE N DN 
Subjt:  HPPGSSYHKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAAKGKSNPKKVEVSSDLSSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKKEESN-DNM

Query:  GDNYQFVQALSIEYP----------VVPKNSEELVLLENSVERQKCTLQEVSTCNSEDEVPAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSNGINLQA
          +YQ VQ LS+EYP           VP NSE+ V LENSVER   T++E+STCNSEDEVP QKRRVRR A TKNH+DD+G L+L ENREDG N  N +A
Subjt:  GDNYQFVQALSIEYP----------VVPKNSEELVLLENSVERQKCTLQEVSTCNSEDEVPAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSNGINLQA

Query:  NENVISKNSGEDTLLCNNISKNAVLDTSSIEFSIPGSVALPRVVLCLAHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIY
        NENV S+ SGEDTLLC NIS+NA+LDT S  FSIP SVALPR+VLCLAHNGKVAWDLKWKP NART+KCK RMGYLAVLLGNGSLEVWEVPFPHVVKAIY
Subjt:  NENVISKNSGEDTLLCNNISKNAVLDTSSIEFSIPGSVALPRVVLCLAHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIY

Query:  SKFNGEGMDPRFVKLKPIFKCSMLRSANTQSIPLTVEWSSTPPYDYLFAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESDPESANV
        SK NGEG DPRFV+LKP F+CSMLRSA+TQSIPLTVEWS TPPYDYL AGCHDGTVALWKFSANST EDTRPLLRFSADTVPIRAVAWAPSES+PES NV
Subjt:  SKFNGEGMDPRFVKLKPIFKCSMLRSANTQSIPLTVEWSSTPPYDYLFAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESDPESANV

Query:  VLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQVSRQTGM
        +L A HGG+KFWDLRDPFRPLWDLHPAPR+IYSLDWLP+PRCV LSFDDGTLRLLSLLKAAYDVPVTG+PFT  KQ+GLH+YCCS FAIWS+QVSRQTGM
Subjt:  VLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQVSRQTGM

Query:  VAYCGADGAVHRFQLTTRAVEKDHSRSRTPHFVCEYLTEEQSTITIHSPASDVPFPLKKLSNKSDPPLSMRAILSDSIQSNEENHKTAMASASENETLAL
        VAYCGADGAV RFQLTT+AV+K++SR+RTPHFVCEYLTEEQS ITIHSPASDVP PLKKL+NKS+ PLSMRAILSDS+Q NE N K+A  SA ENE+ AL
Subjt:  VAYCGADGAVHRFQLTTRAVEKDHSRSRTPHFVCEYLTEEQSTITIHSPASDVPFPLKKLSNKSDPPLSMRAILSDSIQSNEENHKTAMASASENETLAL

Query:  SYGNDVSIESGSEDTLMSIKKRNQTQSKGKKKGVDNQALECSDEPNDAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGGAAGILRCQE
         Y +DV +ESGSEDT MSI+ +NQTQSK KKKGV NQ LE S EP+D+Q   DV+PGSG+ FE FPPKSVA+HR+RWNMN GSERWLCYGGAAGILRCQE
Subjt:  SYGNDVSIESGSEDTLMSIKKRNQTQSKGKKKGVDNQALECSDEPNDAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGGAAGILRCQE

Query:  IVLSDLDKK
        IVLS LDKK
Subjt:  IVLSDLDKK

XP_038903194.1 uncharacterized protein LOC120089853 [Benincasa hispida]0.0e+0077.56Show/hide
Query:  EVEP-PAAEASTGCKRGKKKPVGREKKEAQGRATKKKAG---ATSVDEEQSTGRLDGMPIKVSEFDHCVENHFSAIDTISKLCGEAEDGDGGIDESDIQR
        E++P P     T  K+GKKKP  REKK+++  A  K       TSV++ Q TGRLDG  +KVSEFDHC+ENHF+A+DTI +LC EAE  DGGIDESDIQR
Subjt:  EVEP-PAAEASTGCKRGKKKPVGREKKEAQGRATKKKAG---ATSVDEEQSTGRLDGMPIKVSEFDHCVENHFSAIDTISKLCGEAEDGDGGIDESDIQR

Query:  FSSSTIFLREWRYYNYESKTVKFASDSRGPEGKDADITINLPQFSSAAVLKNGTPSEATTSLDWRNFVMYVGGPVWALDWCPQVHERTDSLIKCEFIAVS
        F+SSTIFLREWR+YNYE K +KFASDSRGPEGKDADITI LPQFSSAAVLKNG P  ATTSLD+RNF M+VGGPVWALDWCPQVHERTDSLIKCEFIAVS
Subjt:  FSSSTIFLREWRYYNYESKTVKFASDSRGPEGKDADITINLPQFSSAAVLKNGTPSEATTSLDWRNFVMYVGGPVWALDWCPQVHERTDSLIKCEFIAVS

Query:  AHPPGSSYHKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAAKGKSNPKKVEVSSDLSSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKKEESNDNM
        AHPPGSSYHKMG PLTGRGMVQIWC +HGTE++  EPTN           E  +DLSSQPKRPRGRP G KKNGAS LP QPKRPRGRPKKK+EESND  
Subjt:  AHPPGSSYHKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAAKGKSNPKKVEVSSDLSSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKKEESNDNM

Query:  GDNYQFVQALSIEYPV----------VPKNSEELVLLENSVERQKCTLQEVSTCNSEDEVPAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSNGINLQA
        GD+   VQA SIE PV          VPKNSE +VLLENSVER++ TLQEVSTCNSEDEVPAQKRRVRRK   KNH+ D+GML+LTENREDGSN I+L+A
Subjt:  GDNYQFVQALSIEYPV----------VPKNSEELVLLENSVERQKCTLQEVSTCNSEDEVPAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSNGINLQA

Query:  NENVISKNSGEDTLLCNNISKNAVLDTSSIEFSIPGSVALPRVVLCLAHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIY
        NENV+ + SGED LLC NIS NAVLDTSSIEFSIP SVALPRVVLCLAHNGKVAWDLKWKP NA T  CK RMGYLAVLLGNGSLEVWEVPFPH VKAIY
Subjt:  NENVISKNSGEDTLLCNNISKNAVLDTSSIEFSIPGSVALPRVVLCLAHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIY

Query:  SKFNGEGMDPRFVKLKPIFKCSMLRSANTQSIPLTVEWSSTPPYDYLFAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESDPESANV
        SKFNGEG DPRFVKLKPIF+CSMLR+ANTQSIPLTVEWS TPPYDYL AGCHDGTVALWKFSANS+CEDTRPLLRFSADTVPIRAVAWAPSES  ESANV
Subjt:  SKFNGEGMDPRFVKLKPIFKCSMLRSANTQSIPLTVEWSSTPPYDYLFAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESDPESANV

Query:  VLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQVSRQTGM
        +LTAGHGGLKFWDLRDPFRPLWDLHPAPR+IYSLDWLP+PRCV LSFDDGTLRLLSLLKAAYDVPVTG+PFT  KQ+GLH+Y CSS+AIWS+QVSRQTGM
Subjt:  VLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQVSRQTGM

Query:  VAYCGADGAVHRFQLTTRAVEKDHSRSRTPHFVCEYLTEEQSTITIHSPASDVPFPLKKLSNKSDPPLSMRAILSDSIQSNEENHKTAMASASENETLAL
        VAYCGADGAV RFQLTT+A +K++SR RTPH+VCEYLTEE+STITIHSP  ++PF LKKLSNKS+ PLSMRAILSDS+QSNE NHKTA A A ENE+ AL
Subjt:  VAYCGADGAVHRFQLTTRAVEKDHSRSRTPHFVCEYLTEEQSTITIHSPASDVPFPLKKLSNKSDPPLSMRAILSDSIQSNEENHKTAMASASENETLAL

Query:  SYGNDVSIESGSEDTLMSIKKRNQTQSKGKKKGVDNQALECSDEPN---------DAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGG
            DV +ESG EDTLMSIKK+N+TQSK  KKGV+NQ L+CSDEPN         D Q  A V+PGS + FE  PPKSVAMHRVRWNMN GSERWLCYGG
Subjt:  SYGNDVSIESGSEDTLMSIKKRNQTQSKGKKKGVDNQALECSDEPN---------DAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGG

Query:  AAGILRCQEIVLSDLDKK
        AAGILRCQEIVLS LD K
Subjt:  AAGILRCQEIVLSDLDKK

TrEMBL top hitse value%identityAlignment
A0A6J1CU50 uncharacterized protein LOC111014310 isoform X10.0e+0083.83Show/hide
Query:  EHHHQVE-VEP--PAAEASTGCKRGKKK-PVGREKKEAQGRATKKKAGATSVDEEQSTGRLDGMPIKVSEFDHCVENHFSAIDTISKLCGEAEDGDGGID
        EHHH VE VEP  PAA  STG KRGK+K PV R+KKEA GRA KKK G  S DEEQ TGRLDG+ IKV EFDHC ENHF A+DTI++LCGEAEDGDGGID
Subjt:  EHHHQVE-VEP--PAAEASTGCKRGKKK-PVGREKKEAQGRATKKKAGATSVDEEQSTGRLDGMPIKVSEFDHCVENHFSAIDTISKLCGEAEDGDGGID

Query:  ESDIQRFSSSTIFLREWRYYNYESKTVKFASDSRGPEGKDADITINLPQFSSAAVLKNGTPSEATTSLDWRNFVMYVGGPVWALDWCPQVHERTDSLIKC
        ESDIQRFSSS  FLREWR+YNYE KTVKFASD RG EGKD DITINLPQFSSAAVLKNGTPS A TSLDWRNFVMYVGGPVWALDWCPQV E+TD+LIKC
Subjt:  ESDIQRFSSSTIFLREWRYYNYESKTVKFASDSRGPEGKDADITINLPQFSSAAVLKNGTPSEATTSLDWRNFVMYVGGPVWALDWCPQVHERTDSLIKC

Query:  EFIAVSAHPPGSSYHKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAAKGKSNPKKVEVSSDLSSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKKE
        EFIAVSAHPPGSSYHKMGTPL GRGMVQIWCL+HGTENHE EP  A K KS PKK EVSSDLSSQPKRPRGRPPG KK GASDLPSQPKRPRGRPKKK+E
Subjt:  EFIAVSAHPPGSSYHKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAAKGKSNPKKVEVSSDLSSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKKE

Query:  ESNDNMGDNYQFVQALSIEYPV----------VPKNSEELVLLENSVERQKCTLQEVSTCNSEDEVPAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSN
         SNDNMGDN Q VQ+LS+EYP            PKNSEEL+LL NSVERQK TLQ VSTCNS+DE PAQKRRVRRK GTKNHIDDMG L  T NREDGS+
Subjt:  ESNDNMGDNYQFVQALSIEYPV----------VPKNSEELVLLENSVERQKCTLQEVSTCNSEDEVPAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSN

Query:  GINLQANENVISKNSGEDTLLCNNISKNAVLDTSSIEFSIPGSVALPRVVLCLAHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLLGNGSLEVWEVPFPH
         I+ Q NENVIS+ SGEDTLLCNNISKNA       EFSIP SVALPRVVLCLAHNGKVAWDLKWKP NA T+ CKHRMGYLAVLLGNGSLEVWE+PFPH
Subjt:  GINLQANENVISKNSGEDTLLCNNISKNAVLDTSSIEFSIPGSVALPRVVLCLAHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLLGNGSLEVWEVPFPH

Query:  VVKAIYSKFNGEGMDPRFVKLKPIFKCSMLRSANTQSIPLTVEWSSTPPYDYLFAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESD
        VVKAIYSKFN EG DPRFVKLKPIF+ +ML+SAN QSIPLTVEWSSTPPYDYLFAGC+DGTVALWKFSANSTCEDTRPLLRFSADTVPIR VAWAP+ESD
Subjt:  VVKAIYSKFNGEGMDPRFVKLKPIFKCSMLRSANTQSIPLTVEWSSTPPYDYLFAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESD

Query:  PESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQV
        PESANVVLTA HGGLKFWDLRDPFRPLWD+HPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSY  SSFAIWSVQV
Subjt:  PESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQV

Query:  SRQTGMVAYCGADGAVHRFQLTTRAVEKDHSRSRTPHFVCEYLTEEQSTITIHSPASDVPFPLKKLSNKSDPPLSMRAILSDSIQSNEENHKTAMASASE
        SRQTGMVAYC ADGAV RFQLTTRAVEKDHSR+RTPHF+CEYLTEE+S ITIHSPAS VPFPLKK SNKSD PLS RAILSDSI+SNE NHKTA A+ASE
Subjt:  SRQTGMVAYCGADGAVHRFQLTTRAVEKDHSRSRTPHFVCEYLTEEQSTITIHSPASDVPFPLKKLSNKSDPPLSMRAILSDSIQSNEENHKTAMASASE

Query:  NETLALSYGNDVSIESGSEDTLMSIKKRNQTQSKGKKKGVDNQALECSDEPNDAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGGAAG
        NE LA+ + NDVS++SGSEDTLMS+KK+NQTQSK KKK VD+QALECSDEPNDAQ K D LPGSG++FE FPPKSVA+HRVRWNMNTGSERWLCYGG AG
Subjt:  NETLALSYGNDVSIESGSEDTLMSIKKRNQTQSKGKKKGVDNQALECSDEPNDAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGGAAG

Query:  ILRCQEIVLSDLDKK
        I+RCQEIVLSD DKK
Subjt:  ILRCQEIVLSDLDKK

A0A6J1EW91 uncharacterized protein LOC1114371040.0e+0075.81Show/hide
Query:  PAAEASTGCKRGKKKPVGREKKEAQGRATKKKAGATSVDEEQSTGRLDGMPIKVSEFDHCVENHFSAIDTISKLCGEAEDGDGGIDESDIQRFSSSTIFL
        P A   TGC +GKKKPV R+KKE++ RA KKK  ATSV+EEQ  GR DG  IKV EFD+CVENHF AIDT+++LCG+ EDG G IDESDIQRFSSSTIFL
Subjt:  PAAEASTGCKRGKKKPVGREKKEAQGRATKKKAGATSVDEEQSTGRLDGMPIKVSEFDHCVENHFSAIDTISKLCGEAEDGDGGIDESDIQRFSSSTIFL

Query:  REWRYYNYESKTVKFASDSRGPEGKDADITINLPQFSSAAVLKNGTPSEATTSLDWRNFVMYVGGPVWALDWCPQVHERTDSLIKCEFIAVSAHPPGSSY
        REWRYYNYE KTVKFASDS GPE KDADI++NLPQFSSAA LKNG P EA TS+D+RNFVMYVGGP+WALDWCPQ HERTDSLIKCE+IAVSAHPP SSY
Subjt:  REWRYYNYESKTVKFASDSRGPEGKDADITINLPQFSSAAVLKNGTPSEATTSLDWRNFVMYVGGPVWALDWCPQVHERTDSLIKCEFIAVSAHPPGSSY

Query:  HKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAAKGKSNPKKVEVSSDLSSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKKEESNDNMGDNYQFVQ
        HKMG PLTGRGMVQIWC++HGTE HEAEPT         K    SS+LSSQP++PRGRPPG KKN AS+LPSQPKRPRGRP KKK+ESND          
Subjt:  HKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAAKGKSNPKKVEVSSDLSSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKKEESNDNMGDNYQFVQ

Query:  ALSIEYPVVPKNSEELVLLENSVERQKCTLQEVSTCNSEDEVPAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSNGINLQANENVISKNSGEDTLLCNN
                            NSVER   TLQEV TCNS+DEV AQK+RVRRK  TKNH+DD+GML+LTENRE+ SN INLQANENVIS+ SGEDTLLCNN
Subjt:  ALSIEYPVVPKNSEELVLLENSVERQKCTLQEVSTCNSEDEVPAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSNGINLQANENVISKNSGEDTLLCNN

Query:  ISKNAVLDTSSIEFSIPGSVALPRVVLCLAHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGMDPRFVKLKPI
        +S+NA LD SSIEFSIP SVALPRVVLCLAHNGKVAWDLKWKP NA  +KCKHRMGYLAVLLGNGSLEVWEVPFPHV++AIYSK NGEG DPRFVKL P 
Subjt:  ISKNAVLDTSSIEFSIPGSVALPRVVLCLAHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGMDPRFVKLKPI

Query:  FKCSMLRSANTQSIPLTVEWSSTPPYDYLFAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESDPESANVVLTAGHGGLKFWDLRDPF
        F+CSMLRSAN QSIPLTVEWSSTPPYDYL AGCHDGTVALWKF+ANS CEDTRPLLRFSADTVPIR VAWAP++SDPE  NV+LTAGHGGLKFWDLRDPF
Subjt:  FKCSMLRSANTQSIPLTVEWSSTPPYDYLFAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESDPESANVVLTAGHGGLKFWDLRDPF

Query:  RPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQVSRQTGMVAYCGADGAVHRFQLTTR
        RPLWDLHPAPR+IYSLDWL DP C ILSFDDGTLRLLSL KAA DVPVTGKPFT  KQ+GLH+YCC+S AIWSVQVSRQTGMVAYCGADG V RFQLTT+
Subjt:  RPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQVSRQTGMVAYCGADGAVHRFQLTTR

Query:  AVEKDHSRSRTPHFVCEYLTEEQSTITIHSPASDVPFPLKKLSNKSD-PPLSMRAILSDSIQSNEENHKTAMASASENETLALSYGNDVSIESGSEDTLM
        AV K++SR+RTP FVC+Y TEEQSTITIH+P  DVPFPLKK+SN+ D PPLSMRAILS+ IQSNE NHKTA AS SEN TLA    +D ++ESGSEDT  
Subjt:  AVEKDHSRSRTPHFVCEYLTEEQSTITIHSPASDVPFPLKKLSNKSD-PPLSMRAILSDSIQSNEENHKTAMASASENETLALSYGNDVSIESGSEDTLM

Query:  SIKKRNQTQSKGKKKG-VDNQALECSDEPNDAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGGAAGILRCQEIVLSDLDKK
         I K+++TQSK KKKG  DNQ  ECSDE ND        PGSG+  E  PPKSVA+HRVRWNMNTGSERWLCYGGAAG+LRCQEI+LS LDKK
Subjt:  SIKKRNQTQSKGKKKG-VDNQALECSDEPNDAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGGAAGILRCQEIVLSDLDKK

A0A6J1F7U5 uncharacterized protein LOC111441649 isoform X10.0e+0077.01Show/hide
Query:  VEVEPPAAEAS--TGCKRGKKKPVGREKKEAQGRATKKKAGATSVDEEQSTGRLDGMPIKVSEFDHCVENHFSAIDTISKLCGEAEDGDGGIDESDIQRF
        +E  P  AEAS  T CK+GKKK V  E  E Q RA KKK GATSV+E Q TGRLD   +KVSEFDHCVENHF AID I++L GEAE+G+GG+DESD QRF
Subjt:  VEVEPPAAEAS--TGCKRGKKKPVGREKKEAQGRATKKKAGATSVDEEQSTGRLDGMPIKVSEFDHCVENHFSAIDTISKLCGEAEDGDGGIDESDIQRF

Query:  SSSTIFLREWRYYNYESKTVKFASDSRGPEGKDADITINLPQFSSAAVLKNGTPSEATTSLDWRNFVMYVGGPVWALDWCPQVHERTDSLIKCEFIAVSA
        SSST FLREW++YNYE KTVKF SDSR PEGKDADIT+ LPQFSSAAVLKNG P  AT SLD+RNF+M+VGGPVWA+DWCP VHERTDSLIKCEFIAVSA
Subjt:  SSSTIFLREWRYYNYESKTVKFASDSRGPEGKDADITINLPQFSSAAVLKNGTPSEATTSLDWRNFVMYVGGPVWALDWCPQVHERTDSLIKCEFIAVSA

Query:  HPPGSSYHKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAAKGKSNPKKVEVSSDLSSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKKEESN-DNM
        HPPGSSYH MG PL+GRGMVQIWCL+HGTE+HE+E T+A + K         SDL SQPKRPRGRPPG KKNGAS LPSQPKRPRGRPKKK+EE N DN 
Subjt:  HPPGSSYHKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAAKGKSNPKKVEVSSDLSSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKKEESN-DNM

Query:  GDNYQFVQALSIEYPVVPK----------NSEELVLLENSVERQKCTLQEVSTCNSEDEVPAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSNGINLQA
          +YQ VQ LS+EYP V            NSE+ V LENSVER   T++E+STCNSEDEVP QKRRVRR A TKNH+DD+G L+L ENREDGSN  N +A
Subjt:  GDNYQFVQALSIEYPVVPK----------NSEELVLLENSVERQKCTLQEVSTCNSEDEVPAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSNGINLQA

Query:  NENVISKNSGEDTLLCNNISKNAVLDTSSIEFSIPGSVALPRVVLCLAHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIY
        NENV S+ SGEDT LC NIS+ A+LDT S  FSIP +VALPR+VLCLAHNGKVAWDLKWKP NART+KCK RMGYLAVLLGNGSLEVWEVPFPHVVKAIY
Subjt:  NENVISKNSGEDTLLCNNISKNAVLDTSSIEFSIPGSVALPRVVLCLAHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIY

Query:  SKFNGEGMDPRFVKLKPIFKCSMLRSANTQSIPLTVEWSSTPPYDYLFAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESDPESANV
        SK NGEG DPRFVKLKP F+CSMLRSA+TQSIPLTVEWS TPPYDYL AGCHDGTVALWKFSA+ST EDTRPLLRFSADTVPIRAVAWAPSES+PES NV
Subjt:  SKFNGEGMDPRFVKLKPIFKCSMLRSANTQSIPLTVEWSSTPPYDYLFAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESDPESANV

Query:  VLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQVSRQTGM
        +L A HGG+KFWDLRDPFRPLWDLHPAPR+IYSLDWLP+PRCV LSFDDGTLRLLSLLKAAYDVPVTG+PFT  KQ+GLH+YCCS FAIWS+QVSRQTGM
Subjt:  VLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQVSRQTGM

Query:  VAYCGADGAVHRFQLTTRAVEKDHSRSRTPHFVCEYLTEEQSTITIHSPASDVPFPLKKLSNKSDPPLSMRAILSDSIQSNEENHKTAMASASENETLAL
        VAYCGADGAV RFQLTT+AV+K++SR+RTPHFVCEYLTEEQS ITIHSPASDVP PLKKLSNKS+ PLSMRAILSDS+Q NE N K+A  SA ENE+ AL
Subjt:  VAYCGADGAVHRFQLTTRAVEKDHSRSRTPHFVCEYLTEEQSTITIHSPASDVPFPLKKLSNKSDPPLSMRAILSDSIQSNEENHKTAMASASENETLAL

Query:  SYGNDVSIESGSEDTLMSIKKRNQTQSKGKKKGVDNQALECSDEPNDAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGGAAGILRCQE
         Y +DV +ESGSEDT MSI+ +NQTQSK KKKGV NQ LE S EP+D+Q   DV+PG G  FE FPPKSVA+HR+RWNMN GSERWL YGGAAGILRCQE
Subjt:  SYGNDVSIESGSEDTLMSIKKRNQTQSKGKKKGVDNQALECSDEPNDAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGGAAGILRCQE

Query:  IVLSDLDKK
        IVLS LDKK
Subjt:  IVLSDLDKK

A0A6J1J0H6 uncharacterized protein LOC1114815740.0e+0074.67Show/hide
Query:  VEVEPPAAEAS--TGCKRGKKKPVGREKKEAQGRATKKKAGATSVDEEQSTGRLDGMPIKVSEFDHCVENHFSAIDTISKLCGEAEDGDGGIDESDIQRF
        +E  P  AEAS  T CK+GKKK V  E+     +  KKKAGATSV+E Q TGRLD   +KVSEFDHCVENHF AID I++L GEAE+G+GG+DESD QRF
Subjt:  VEVEPPAAEAS--TGCKRGKKKPVGREKKEAQGRATKKKAGATSVDEEQSTGRLDGMPIKVSEFDHCVENHFSAIDTISKLCGEAEDGDGGIDESDIQRF

Query:  SSSTIFLREWRYYNYESKTVKFASDSRGPEGKDADITINLPQFSSAAVLKNGTPSEATTSLDWRNFVMYVGGPVWALDWCPQVHERTDSLIKCEFIAVSA
        SSST FLREW++YNYE KTVKF SDSR PEGKDADIT+ LPQFSSAAVLKNG P  ATTSLD+RNF+M+VGGPVWA+DWCP VHERTDSLIKCEFIAVSA
Subjt:  SSSTIFLREWRYYNYESKTVKFASDSRGPEGKDADITINLPQFSSAAVLKNGTPSEATTSLDWRNFVMYVGGPVWALDWCPQVHERTDSLIKCEFIAVSA

Query:  HPPGSSYHKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAAKGKSNPKKVEVSSDLSSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKKEESNDNMG
        HPPGSSYH MG PL+GRGMVQIWCL+HGTE+HE+E TNA + K        +SDL SQPKRPRGRPPG KKNGAS L SQ KRPRGRPKKK+EE NDN  
Subjt:  HPPGSSYHKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAAKGKSNPKKVEVSSDLSSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKKEESNDNMG

Query:  DNYQFVQALSIEYP----------VVPKNSEELVLLENSVERQKCTLQEVSTCNSEDEVPAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSNGINLQAN
         +YQ VQ LS+EYP           VP NSE+LV LENSVER   T++E+STCNSEDEVP QKRR RR A TKNH+DD+G                    
Subjt:  DNYQFVQALSIEYP----------VVPKNSEELVLLENSVERQKCTLQEVSTCNSEDEVPAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSNGINLQAN

Query:  ENVISKNSGEDTLLCNNISKNAVLDTSSIEFSIPGSVALPRVVLCLAHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYS
                     LC NIS+NA+LDT S  FSIP SVALPR+VLCLAHNGKVAWDLKWKP NART+KCK RMGYLAVLLGNGSLEVWE+PFPHVVKAIYS
Subjt:  ENVISKNSGEDTLLCNNISKNAVLDTSSIEFSIPGSVALPRVVLCLAHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYS

Query:  KFNGEGMDPRFVKLKPIFKCSMLRSANTQSIPLTVEWSSTPPYDYLFAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESDPESANVV
          NGEG DPRFVKLKP F+CSMLRSA+TQSIPLTVEWS TPPYDYL AGCHDGTVALWKFSANST EDTRPLLRFSADTVPIRAVAWAPSES+PES NV+
Subjt:  KFNGEGMDPRFVKLKPIFKCSMLRSANTQSIPLTVEWSSTPPYDYLFAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESDPESANVV

Query:  LTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQVSRQTGMV
        L A HGG+KFWDLRDPFRPLWDLHPAPR+IYSLDWLP+PRCV LSFDDGTLRLLSLLKAAYDVPVTG+PFT  KQ+GLH+YCCS FAIWS+QVSRQTGMV
Subjt:  LTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQVSRQTGMV

Query:  AYCGADGAVHRFQLTTRAVEKDHSRSRTPHFVCEYLTEEQSTITIHSPASDVPFPLKKLSNKSDPPLSMRAILSDSIQSNEENHKTAMASASENETLALS
        AYCGADGAV RFQLTT+AV+K++SR+RTPHFVCEYLTEEQS ITIHSPASDVP PLKKLSNKS+ PLSMRAILSDS+Q NE N K+A  SA ENE+ AL 
Subjt:  AYCGADGAVHRFQLTTRAVEKDHSRSRTPHFVCEYLTEEQSTITIHSPASDVPFPLKKLSNKSDPPLSMRAILSDSIQSNEENHKTAMASASENETLALS

Query:  YGNDVSIESGSEDTLMSIKKRNQTQSKGKKKGVDNQALECSDEPNDAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGGAAGILRCQEI
        Y +DV +ESGSEDT MSI+ +NQTQSK KKKGV NQ LE S EP+D+Q   DV+PG G+ FE FPPKSVA+HR+RWNMN GSERWLCYGGAAGILRCQEI
Subjt:  YGNDVSIESGSEDTLMSIKKRNQTQSKGKKKGVDNQALECSDEPNDAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGGAAGILRCQEI

Query:  VLSDLDKK
        VLS LDKK
Subjt:  VLSDLDKK

A0A6J1KYL7 uncharacterized protein LOC1114987490.0e+0076.04Show/hide
Query:  PAAEASTGCKRGKKKPVGREKKEAQGRATKKKAGATSVDEEQSTGRLDGMPIKVSEFDHCVENHFSAIDTISKLCGEAEDGDGGIDESDIQRFSSSTIFL
        P A   TGC +GKKKPV R+KKE++ RA KKK  ATSV+EEQ+ GR DG  IKV EFDHCVENHF AIDT+++LCG+ EDG G IDE DIQRFSSSTIFL
Subjt:  PAAEASTGCKRGKKKPVGREKKEAQGRATKKKAGATSVDEEQSTGRLDGMPIKVSEFDHCVENHFSAIDTISKLCGEAEDGDGGIDESDIQRFSSSTIFL

Query:  REWRYYNYESKTVKFASDSRGPEGKDADITINLPQFSSAAVLKNGTPSEATTSLDWRNFVMYVGGPVWALDWCPQVHERTDSLIKCEFIAVSAHPPGSSY
        REWRYYNYE KTVKFASDSRGPE KDADI++NLPQFSSAA LKNG P EA TSLD+RNFVMYVGGP+WALDWCPQ HERTDSLIKCE+IAVSAHPP SSY
Subjt:  REWRYYNYESKTVKFASDSRGPEGKDADITINLPQFSSAAVLKNGTPSEATTSLDWRNFVMYVGGPVWALDWCPQVHERTDSLIKCEFIAVSAHPPGSSY

Query:  HKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAAKGKSNPKKVEVSSDLSSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKKEESNDNMGDNYQFVQ
        HKMG PLTGRGMVQIWC++HGTE+HEAEPT         K    SS+LSSQP++PRGRPPG KKN AS+LPSQPKRPRGRP KKK+ESND          
Subjt:  HKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAAKGKSNPKKVEVSSDLSSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKKEESNDNMGDNYQFVQ

Query:  ALSIEYPVVPKNSEELVLLENSVERQKCTLQEVSTCNSEDEVPAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSNGINLQANENVISKNSGEDTLLCNN
                            NSVER   TLQEV TCNS+DEV AQK+R+RRK  TKNH+DD+G L LTENRE+ SN INLQANENVIS+ SGEDTLLCNN
Subjt:  ALSIEYPVVPKNSEELVLLENSVERQKCTLQEVSTCNSEDEVPAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSNGINLQANENVISKNSGEDTLLCNN

Query:  ISKNAVLDTSSIEFSIPGSVALPRVVLCLAHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGMDPRFVKLKPI
        +S+NA LD SSI FSIP SVALPRVVLCLAHNGKVAWDLKWKP NA  +KCKHRMGYLA+LLGNGSLEVWEVPFPHV++AIYSK NGEG DPRFVKLKP 
Subjt:  ISKNAVLDTSSIEFSIPGSVALPRVVLCLAHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGMDPRFVKLKPI

Query:  FKCSMLRSANTQSIPLTVEWSSTPPYDYLFAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESDPESANVVLTAGHGGLKFWDLRDPF
        F+CSMLRSAN QSIPLTVEWSSTPPYDYL AGCHDGTVALWKFSANS CEDTRPLLRFSADTVPIR VAWAP++SDPE  NV+LTAGHGGLKFWDLRDPF
Subjt:  FKCSMLRSANTQSIPLTVEWSSTPPYDYLFAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESDPESANVVLTAGHGGLKFWDLRDPF

Query:  RPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQVSRQTGMVAYCGADGAVHRFQLTTR
        RPLWDLHPAPR+IYSLDWL DPRC ILSFDDGTLRLLSL KAA DVPVTGKPFT  KQ+GLH+YCC+S AIWSVQVSRQTGMVAYCGADG V RFQLTT+
Subjt:  RPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQVSRQTGMVAYCGADGAVHRFQLTTR

Query:  AVEKDHSRSRTPHFVCEYLTEEQSTITIHSPASDVPFPLKKLSNKSD-PPLSMRAILSDSIQSNEENHKTAMASASENETLALSYGNDVSIESGSEDTLM
        AV K++SR+RTP FVC+Y TEEQSTITIH+P  DVPFPLKK+SN+ D PPLSMRAILS+ IQSNE NHKTA AS SEN TLA  + +D ++ESGSEDT  
Subjt:  AVEKDHSRSRTPHFVCEYLTEEQSTITIHSPASDVPFPLKKLSNKSD-PPLSMRAILSDSIQSNEENHKTAMASASENETLALSYGNDVSIESGSEDTLM

Query:  SIKKRNQTQSKGKKKG-VDNQALECSDEPNDAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGGAAGILRCQEIVLSDLDKK
        SI K+++TQSK KKKG  DNQ  ECSDE ND        PGSG+  E  PPKSVA+HRVRWNMNTGSERWLCYGGAAG+LRCQEI+LS LDKK
Subjt:  SIKKRNQTQSKGKKKG-VDNQALECSDEPNDAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGGAAGILRCQEIVLSDLDKK

SwissProt top hitse value%identityAlignment
Q5RDC3 General transcription factor 3C polypeptide 27.4e-0536.11Show/hide
Query:  YVGGPVWALDWCPQVHERTDSLIKCEFIAVSAHPPGSSYHKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAA
        + GGP+WALDWCP V E   +    +++A+ + P  +  H +    +G G++Q+W L  GT   E+ P N A
Subjt:  YVGGPVWALDWCPQVHERTDSLIKCEFIAVSAHPPGSSYHKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAA

Q8BL74 General transcription factor 3C polypeptide 24.3e-0534.72Show/hide
Query:  YVGGPVWALDWCPQVHERTDSLIKCEFIAVSAHPPGSSYHKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAA
        + GGP+WALDWCP       S    +++A+ + P  +  H +    +G G++Q+W L  GT   E+ P N A
Subjt:  YVGGPVWALDWCPQVHERTDSLIKCEFIAVSAHPPGSSYHKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAA

Q8WUA4 General transcription factor 3C polypeptide 27.4e-0536.11Show/hide
Query:  YVGGPVWALDWCPQVHERTDSLIKCEFIAVSAHPPGSSYHKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAA
        + GGP+WALDWCP V E   +    +++A+ + P  +  H +    +G G++Q+W L  GT   E+ P N A
Subjt:  YVGGPVWALDWCPQVHERTDSLIKCEFIAVSAHPPGSSYHKMGTPLTGRGMVQIWCLLHGTENHEAEPTNAA

Arabidopsis top hitse value%identityAlignment
AT1G19485.1 Transducin/WD40 repeat-like superfamily protein1.6e-21247.85Show/hide
Query:  LDGMPIKVSEFDHCVENHFSAIDTISKLCGEAEDGDGGIDESDIQRFSSSTIFLREWRYYNYESKTVKFASDS-RGPEGKDADITINLPQFSSAAVLKNG
        +DG    +S FD+  E+H  A+++I+ LCGEA   +  IDE+DI   SSS  FLREWR+YN+E K+  F +++ +  + KD + +  LPQFSSA   K  
Subjt:  LDGMPIKVSEFDHCVENHFSAIDTISKLCGEAEDGDGGIDESDIQRFSSSTIFLREWRYYNYESKTVKFASDS-RGPEGKDADITINLPQFSSAAVLKNG

Query:  TPSEATTSLD--WRNFVMYVGGPVWALDWCPQVHERTDSLIKCEFIAVSAHPPGSSYHKMGTPLTGRGMVQIWCLLHGT-ENHEAEPTNAAKGKSNPKKV
           + ++S     ++FVM+VGG VWA++WCP+VH   D+  KCEF+AV+ HPP S  HK+G PL GRG++QIWC+++ T +    + ++  K  +   + 
Subjt:  TPSEATTSLD--WRNFVMYVGGPVWALDWCPQVHERTDSLIKCEFIAVSAHPPGSSYHKMGTPLTGRGMVQIWCLLHGT-ENHEAEPTNAAKGKSNPKKV

Query:  EVSSDL--SSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKK-EESNDNMGDNYQFVQALSIEYP---VVPKNSEELVLLENSVERQKC----TLQE
        + S +   +++PK+PRGR   P+K+      ++PK+PRGRP+KK   E    + D+  +V+ALS+ YP   VVP      +L E  V   K     + Q 
Subjt:  EVSSDL--SSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKK-EESNDNMGDNYQFVQALSIEYP---VVPKNSEELVLLENSVERQKC----TLQE

Query:  VSTCNSEDEVPAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSNGINLQANE---NVISKNSGEDTLLCNNISKNAVLDTSSIEFSIPGSVALPRVVLCL
        +S+ N+  ++P +++R +                 T++ E+    + L+ +E   NV SK S       + IS++               VALPRVVLCL
Subjt:  VSTCNSEDEVPAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSNGINLQANE---NVISKNSGEDTLLCNNISKNAVLDTSSIEFSIPGSVALPRVVLCL

Query:  AHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGMDPRFVKLKPIFKCSMLRSANTQSIPLTVEWSSTPPYDYL
        AHNGKV WD+KW+P  A  S  KH MGYLAVLLGNGSLEVW+VP P    A+Y        DPRFVKL P+FKCS L+  +T+SIPLTVEWS+    D+L
Subjt:  AHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGMDPRFVKLKPIFKCSMLRSANTQSIPLTVEWSSTPPYDYL

Query:  FAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESDPESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSF
         AGCHDGTVALWKFS   + EDTRPLL FSADT PIRAVAWAP ESD ESAN+V TAGH GLKFWDLRDPFRPLWDLHP PR IYSLDWL DP CV+LSF
Subjt:  FAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESDPESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSF

Query:  DDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQVSRQTGMVAYCGADGAVHRFQLTTRAVEKDHSRSRTPHFVCEYLTEEQSTITIH
        DDGTLR+LSL+K AYDVP TG+P+  TKQQGL  Y CS+F IWS+QVSR TG+ AYC ADG++  F+LTT+AVEKD +R+RTPH++C  LT + ST  +H
Subjt:  DDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQVSRQTGMVAYCGADGAVHRFQLTTRAVEKDHSRSRTPHFVCEYLTEEQSTITIH

Query:  SPASDVPFPLKK-LSNKSDPPLSMRAILSDSIQSNEENHKTAMASASENETLALSYGNDVSIESGSEDTLMSIKKRNQTQSKGKKKGVDNQ---ALECSD
        SP  D+P  LKK +    +    +R++L++S      N        S+ + LA ++  D  +ES SE T     K    + K   +  +++   AL C  
Subjt:  SPASDVPFPLKK-LSNKSDPPLSMRAILSDSIQSNEENHKTAMASASENETLALSYGNDVSIESGSEDTLMSIKKRNQTQSKGKKKGVDNQ---ALECSD

Query:  E---PNDAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGGAAGILRCQEI
        E     + + KA     +G   E FPPK VAMHRVRWNMN GSERWLCYGGAAGI+RCQEI
Subjt:  E---PNDAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGGAAGILRCQEI

AT1G19485.2 Transducin/WD40 repeat-like superfamily protein1.6e-21247.85Show/hide
Query:  LDGMPIKVSEFDHCVENHFSAIDTISKLCGEAEDGDGGIDESDIQRFSSSTIFLREWRYYNYESKTVKFASDS-RGPEGKDADITINLPQFSSAAVLKNG
        +DG    +S FD+  E+H  A+++I+ LCGEA   +  IDE+DI   SSS  FLREWR+YN+E K+  F +++ +  + KD + +  LPQFSSA   K  
Subjt:  LDGMPIKVSEFDHCVENHFSAIDTISKLCGEAEDGDGGIDESDIQRFSSSTIFLREWRYYNYESKTVKFASDS-RGPEGKDADITINLPQFSSAAVLKNG

Query:  TPSEATTSLD--WRNFVMYVGGPVWALDWCPQVHERTDSLIKCEFIAVSAHPPGSSYHKMGTPLTGRGMVQIWCLLHGT-ENHEAEPTNAAKGKSNPKKV
           + ++S     ++FVM+VGG VWA++WCP+VH   D+  KCEF+AV+ HPP S  HK+G PL GRG++QIWC+++ T +    + ++  K  +   + 
Subjt:  TPSEATTSLD--WRNFVMYVGGPVWALDWCPQVHERTDSLIKCEFIAVSAHPPGSSYHKMGTPLTGRGMVQIWCLLHGT-ENHEAEPTNAAKGKSNPKKV

Query:  EVSSDL--SSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKK-EESNDNMGDNYQFVQALSIEYP---VVPKNSEELVLLENSVERQKC----TLQE
        + S +   +++PK+PRGR   P+K+      ++PK+PRGRP+KK   E    + D+  +V+ALS+ YP   VVP      +L E  V   K     + Q 
Subjt:  EVSSDL--SSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKK-EESNDNMGDNYQFVQALSIEYP---VVPKNSEELVLLENSVERQKC----TLQE

Query:  VSTCNSEDEVPAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSNGINLQANE---NVISKNSGEDTLLCNNISKNAVLDTSSIEFSIPGSVALPRVVLCL
        +S+ N+  ++P +++R +                 T++ E+    + L+ +E   NV SK S       + IS++               VALPRVVLCL
Subjt:  VSTCNSEDEVPAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSNGINLQANE---NVISKNSGEDTLLCNNISKNAVLDTSSIEFSIPGSVALPRVVLCL

Query:  AHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGMDPRFVKLKPIFKCSMLRSANTQSIPLTVEWSSTPPYDYL
        AHNGKV WD+KW+P  A  S  KH MGYLAVLLGNGSLEVW+VP P    A+Y        DPRFVKL P+FKCS L+  +T+SIPLTVEWS+    D+L
Subjt:  AHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGMDPRFVKLKPIFKCSMLRSANTQSIPLTVEWSSTPPYDYL

Query:  FAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESDPESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSF
         AGCHDGTVALWKFS   + EDTRPLL FSADT PIRAVAWAP ESD ESAN+V TAGH GLKFWDLRDPFRPLWDLHP PR IYSLDWL DP CV+LSF
Subjt:  FAGCHDGTVALWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESDPESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSF

Query:  DDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQVSRQTGMVAYCGADGAVHRFQLTTRAVEKDHSRSRTPHFVCEYLTEEQSTITIH
        DDGTLR+LSL+K AYDVP TG+P+  TKQQGL  Y CS+F IWS+QVSR TG+ AYC ADG++  F+LTT+AVEKD +R+RTPH++C  LT + ST  +H
Subjt:  DDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYCCSSFAIWSVQVSRQTGMVAYCGADGAVHRFQLTTRAVEKDHSRSRTPHFVCEYLTEEQSTITIH

Query:  SPASDVPFPLKK-LSNKSDPPLSMRAILSDSIQSNEENHKTAMASASENETLALSYGNDVSIESGSEDTLMSIKKRNQTQSKGKKKGVDNQ---ALECSD
        SP  D+P  LKK +    +    +R++L++S      N        S+ + LA ++  D  +ES SE T     K    + K   +  +++   AL C  
Subjt:  SPASDVPFPLKK-LSNKSDPPLSMRAILSDSIQSNEENHKTAMASASENETLALSYGNDVSIESGSEDTLMSIKKRNQTQSKGKKKGVDNQ---ALECSD

Query:  E---PNDAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGGAAGILRCQEI
        E     + + KA     +G   E FPPK VAMHRVRWNMN GSERWLCYGGAAGI+RCQEI
Subjt:  E---PNDAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWLCYGGAAGILRCQEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACATCATCATCAAGTGGAAGTGGAGCCTCCGGCAGCGGAAGCATCCACTGGCTGCAAGAGAGGGAAGAAGAAACCAGTGGGTCGGGAAAAGAAGGAAGCGCAGGG
AAGAGCTACGAAGAAGAAGGCGGGGGCAACTTCGGTCGACGAAGAGCAATCTACAGGCCGTTTAGATGGCATGCCGATTAAGGTTTCGGAGTTTGATCATTGTGTTGAAA
ATCATTTTAGCGCCATTGATACAATTTCCAAGCTCTGTGGTGAGGCAGAGGATGGCGATGGCGGAATTGACGAAAGTGACATTCAGAGGTTTTCGTCATCCACCATTTTC
TTAAGGGAATGGAGGTACTATAATTACGAATCGAAAACCGTCAAGTTTGCTAGTGACTCGAGAGGCCCTGAGGGTAAGGATGCTGACATCACGATAAACTTACCACAGTT
TTCTTCTGCTGCTGTTCTGAAGAATGGAACACCATCTGAAGCCACTACATCTCTAGACTGGAGAAATTTTGTTATGTATGTTGGCGGGCCTGTTTGGGCTTTAGATTGGT
GTCCTCAAGTTCATGAAAGAACTGACTCCCTAATCAAATGTGAGTTTATTGCTGTTTCTGCTCATCCACCTGGCTCTTCTTATCACAAGATGGGTACCCCACTCACAGGA
AGAGGTATGGTGCAGATATGGTGCTTACTCCATGGCACTGAAAACCATGAAGCAGAGCCAACCAATGCAGCAAAGGGCAAATCGAACCCTAAAAAGGTTGAGGTATCATC
AGACTTATCATCTCAACCAAAGAGGCCTAGAGGAAGACCACCAGGGCCTAAGAAAAATGGGGCATCAGACTTGCCATCTCAACCGAAGAGGCCTAGAGGAAGACCTAAAA
AGAAAAAAGAAGAATCCAATGATAACATGGGTGACAATTACCAATTTGTTCAGGCCCTTTCTATTGAATACCCAGTTGTCCCTAAAAATTCTGAAGAACTTGTATTACTG
GAAAATAGTGTTGAAAGACAGAAGTGTACCTTACAAGAAGTTTCTACATGCAATTCTGAAGATGAAGTTCCTGCGCAGAAGAGAAGAGTGAGAAGAAAAGCTGGGACTAA
GAATCATATTGATGACATGGGGATGTTAACACTTACAGAAAATCGAGAAGATGGATCCAATGGTATCAATCTTCAGGCAAATGAGAATGTTATAAGCAAAAATTCTGGGG
AAGACACTCTATTATGTAATAATATTTCAAAGAATGCTGTATTAGACACTAGCTCAATTGAATTCTCTATTCCCGGGAGTGTTGCTTTGCCTAGAGTGGTATTGTGCTTA
GCTCACAATGGGAAGGTGGCATGGGATTTGAAATGGAAGCCATTTAATGCGCGTACTAGCAAATGCAAGCACCGAATGGGCTACCTTGCTGTCTTGCTGGGCAACGGATC
TCTAGAAGTCTGGGAGGTCCCTTTTCCTCACGTAGTGAAGGCAATCTATTCTAAATTCAATGGGGAGGGTATGGATCCTCGCTTTGTTAAGTTGAAGCCTATTTTCAAAT
GCTCGATGTTGAGAAGTGCAAATACACAGAGCATCCCTCTGACAGTGGAATGGTCGTCAACACCTCCTTATGATTATCTATTCGCTGGATGCCATGATGGAACAGTTGCC
TTGTGGAAGTTTTCTGCAAATAGTACCTGTGAAGATACAAGGCCTTTACTTCGTTTTAGTGCAGATACAGTTCCCATAAGAGCGGTTGCATGGGCACCAAGTGAAAGCGA
TCCAGAAAGTGCAAATGTGGTACTTACTGCTGGTCATGGAGGTTTAAAGTTTTGGGACCTAAGAGATCCATTCCGTCCTTTGTGGGACCTTCATCCAGCGCCCAGGATGA
TATATAGCCTGGATTGGCTTCCTGATCCTAGATGCGTTATTTTATCCTTTGATGATGGAACATTGAGACTTCTCAGTTTGCTAAAGGCTGCATATGACGTTCCAGTAACC
GGTAAACCCTTCACAGGGACTAAACAACAAGGTTTACATAGTTACTGTTGTTCATCATTTGCTATCTGGAGTGTTCAAGTGTCACGGCAGACAGGCATGGTTGCATACTG
CGGTGCTGATGGAGCTGTTCACCGTTTCCAGCTTACTACCAGAGCGGTGGAGAAAGATCATTCACGTAGTCGAACCCCGCATTTTGTATGCGAATACTTAACTGAGGAGC
AATCAACTATTACAATCCACTCTCCAGCATCAGATGTTCCATTCCCTTTGAAGAAGCTGTCCAACAAATCCGACCCGCCATTGTCCATGCGAGCTATTTTATCTGATTCA
ATTCAGTCAAATGAAGAAAATCACAAGACGGCCATGGCTTCTGCATCGGAAAATGAGACATTAGCCCTTAGCTATGGCAATGATGTCAGTATTGAATCTGGGTCTGAGGA
TACACTGATGTCCATCAAGAAGAGAAACCAAACTCAATCAAAGGGCAAGAAGAAGGGAGTGGATAACCAAGCATTGGAATGTAGTGATGAGCCTAACGATGCACAAGCGA
AAGCTGACGTATTGCCGGGTTCGGGCAATGATTTTGAAATTTTCCCTCCCAAATCAGTTGCAATGCATAGAGTGAGATGGAACATGAACACAGGGAGTGAAAGATGGTTG
TGCTATGGTGGAGCAGCTGGAATTCTACGATGTCAGGAGATTGTGTTGTCTGACCTCGATAAGAAATCTGGTTTGCTTCTGTTGCTTAAAGGGAAGCCATTTCTAATCAG
AACAGTGATCACGGTGCCTTCTTCATCCTTGAGGGGCCAAATTATGGGAGCCGCCAAAAAGAATGACAGAGCATGGGATAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAACATCATCATCAAGTGGAAGTGGAGCCTCCGGCAGCGGAAGCATCCACTGGCTGCAAGAGAGGGAAGAAGAAACCAGTGGGTCGGGAAAAGAAGGAAGCGCAGGG
AAGAGCTACGAAGAAGAAGGCGGGGGCAACTTCGGTCGACGAAGAGCAATCTACAGGCCGTTTAGATGGCATGCCGATTAAGGTTTCGGAGTTTGATCATTGTGTTGAAA
ATCATTTTAGCGCCATTGATACAATTTCCAAGCTCTGTGGTGAGGCAGAGGATGGCGATGGCGGAATTGACGAAAGTGACATTCAGAGGTTTTCGTCATCCACCATTTTC
TTAAGGGAATGGAGGTACTATAATTACGAATCGAAAACCGTCAAGTTTGCTAGTGACTCGAGAGGCCCTGAGGGTAAGGATGCTGACATCACGATAAACTTACCACAGTT
TTCTTCTGCTGCTGTTCTGAAGAATGGAACACCATCTGAAGCCACTACATCTCTAGACTGGAGAAATTTTGTTATGTATGTTGGCGGGCCTGTTTGGGCTTTAGATTGGT
GTCCTCAAGTTCATGAAAGAACTGACTCCCTAATCAAATGTGAGTTTATTGCTGTTTCTGCTCATCCACCTGGCTCTTCTTATCACAAGATGGGTACCCCACTCACAGGA
AGAGGTATGGTGCAGATATGGTGCTTACTCCATGGCACTGAAAACCATGAAGCAGAGCCAACCAATGCAGCAAAGGGCAAATCGAACCCTAAAAAGGTTGAGGTATCATC
AGACTTATCATCTCAACCAAAGAGGCCTAGAGGAAGACCACCAGGGCCTAAGAAAAATGGGGCATCAGACTTGCCATCTCAACCGAAGAGGCCTAGAGGAAGACCTAAAA
AGAAAAAAGAAGAATCCAATGATAACATGGGTGACAATTACCAATTTGTTCAGGCCCTTTCTATTGAATACCCAGTTGTCCCTAAAAATTCTGAAGAACTTGTATTACTG
GAAAATAGTGTTGAAAGACAGAAGTGTACCTTACAAGAAGTTTCTACATGCAATTCTGAAGATGAAGTTCCTGCGCAGAAGAGAAGAGTGAGAAGAAAAGCTGGGACTAA
GAATCATATTGATGACATGGGGATGTTAACACTTACAGAAAATCGAGAAGATGGATCCAATGGTATCAATCTTCAGGCAAATGAGAATGTTATAAGCAAAAATTCTGGGG
AAGACACTCTATTATGTAATAATATTTCAAAGAATGCTGTATTAGACACTAGCTCAATTGAATTCTCTATTCCCGGGAGTGTTGCTTTGCCTAGAGTGGTATTGTGCTTA
GCTCACAATGGGAAGGTGGCATGGGATTTGAAATGGAAGCCATTTAATGCGCGTACTAGCAAATGCAAGCACCGAATGGGCTACCTTGCTGTCTTGCTGGGCAACGGATC
TCTAGAAGTCTGGGAGGTCCCTTTTCCTCACGTAGTGAAGGCAATCTATTCTAAATTCAATGGGGAGGGTATGGATCCTCGCTTTGTTAAGTTGAAGCCTATTTTCAAAT
GCTCGATGTTGAGAAGTGCAAATACACAGAGCATCCCTCTGACAGTGGAATGGTCGTCAACACCTCCTTATGATTATCTATTCGCTGGATGCCATGATGGAACAGTTGCC
TTGTGGAAGTTTTCTGCAAATAGTACCTGTGAAGATACAAGGCCTTTACTTCGTTTTAGTGCAGATACAGTTCCCATAAGAGCGGTTGCATGGGCACCAAGTGAAAGCGA
TCCAGAAAGTGCAAATGTGGTACTTACTGCTGGTCATGGAGGTTTAAAGTTTTGGGACCTAAGAGATCCATTCCGTCCTTTGTGGGACCTTCATCCAGCGCCCAGGATGA
TATATAGCCTGGATTGGCTTCCTGATCCTAGATGCGTTATTTTATCCTTTGATGATGGAACATTGAGACTTCTCAGTTTGCTAAAGGCTGCATATGACGTTCCAGTAACC
GGTAAACCCTTCACAGGGACTAAACAACAAGGTTTACATAGTTACTGTTGTTCATCATTTGCTATCTGGAGTGTTCAAGTGTCACGGCAGACAGGCATGGTTGCATACTG
CGGTGCTGATGGAGCTGTTCACCGTTTCCAGCTTACTACCAGAGCGGTGGAGAAAGATCATTCACGTAGTCGAACCCCGCATTTTGTATGCGAATACTTAACTGAGGAGC
AATCAACTATTACAATCCACTCTCCAGCATCAGATGTTCCATTCCCTTTGAAGAAGCTGTCCAACAAATCCGACCCGCCATTGTCCATGCGAGCTATTTTATCTGATTCA
ATTCAGTCAAATGAAGAAAATCACAAGACGGCCATGGCTTCTGCATCGGAAAATGAGACATTAGCCCTTAGCTATGGCAATGATGTCAGTATTGAATCTGGGTCTGAGGA
TACACTGATGTCCATCAAGAAGAGAAACCAAACTCAATCAAAGGGCAAGAAGAAGGGAGTGGATAACCAAGCATTGGAATGTAGTGATGAGCCTAACGATGCACAAGCGA
AAGCTGACGTATTGCCGGGTTCGGGCAATGATTTTGAAATTTTCCCTCCCAAATCAGTTGCAATGCATAGAGTGAGATGGAACATGAACACAGGGAGTGAAAGATGGTTG
TGCTATGGTGGAGCAGCTGGAATTCTACGATGTCAGGAGATTGTGTTGTCTGACCTCGATAAGAAATCTGGTTTGCTTCTGTTGCTTAAAGGGAAGCCATTTCTAATCAG
AACAGTGATCACGGTGCCTTCTTCATCCTTGAGGGGCCAAATTATGGGAGCCGCCAAAAAGAATGACAGAGCATGGGATAACTGA
Protein sequenceShow/hide protein sequence
MEHHHQVEVEPPAAEASTGCKRGKKKPVGREKKEAQGRATKKKAGATSVDEEQSTGRLDGMPIKVSEFDHCVENHFSAIDTISKLCGEAEDGDGGIDESDIQRFSSSTIF
LREWRYYNYESKTVKFASDSRGPEGKDADITINLPQFSSAAVLKNGTPSEATTSLDWRNFVMYVGGPVWALDWCPQVHERTDSLIKCEFIAVSAHPPGSSYHKMGTPLTG
RGMVQIWCLLHGTENHEAEPTNAAKGKSNPKKVEVSSDLSSQPKRPRGRPPGPKKNGASDLPSQPKRPRGRPKKKKEESNDNMGDNYQFVQALSIEYPVVPKNSEELVLL
ENSVERQKCTLQEVSTCNSEDEVPAQKRRVRRKAGTKNHIDDMGMLTLTENREDGSNGINLQANENVISKNSGEDTLLCNNISKNAVLDTSSIEFSIPGSVALPRVVLCL
AHNGKVAWDLKWKPFNARTSKCKHRMGYLAVLLGNGSLEVWEVPFPHVVKAIYSKFNGEGMDPRFVKLKPIFKCSMLRSANTQSIPLTVEWSSTPPYDYLFAGCHDGTVA
LWKFSANSTCEDTRPLLRFSADTVPIRAVAWAPSESDPESANVVLTAGHGGLKFWDLRDPFRPLWDLHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVT
GKPFTGTKQQGLHSYCCSSFAIWSVQVSRQTGMVAYCGADGAVHRFQLTTRAVEKDHSRSRTPHFVCEYLTEEQSTITIHSPASDVPFPLKKLSNKSDPPLSMRAILSDS
IQSNEENHKTAMASASENETLALSYGNDVSIESGSEDTLMSIKKRNQTQSKGKKKGVDNQALECSDEPNDAQAKADVLPGSGNDFEIFPPKSVAMHRVRWNMNTGSERWL
CYGGAAGILRCQEIVLSDLDKKSGLLLLLKGKPFLIRTVITVPSSSLRGQIMGAAKKNDRAWDN