; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS019545 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS019545
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionDNA binding protein, putative isoform 1
Genome locationscaffold729:331624..339581
RNA-Seq ExpressionMS019545
SyntenyMS019545
Gene Ontology termsGO:0006383 - transcription by RNA polymerase III (biological process)
GO:0000127 - transcription factor TFIIIC complex (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001680 - WD40 repeat
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR017956 - AT hook, DNA-binding motif
IPR036322 - WD40-repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7017442.1 General transcription factor 3C polypeptide 2 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0071.1Show/hide
Query:  PPVPAASTGTGGKRGKRKQPVARQKKEAPGRAKKKPGGASADEEQPTGRLDGVGIKVLEFDHCAENHFRAMDTIAELCGEAEDGDGGIDESDIQRFSSSA
        P    AS GT  K+GK+K     + ++   RAKKK G  S +E QPTGRLD   +KV EFDHC ENHFRA+D IAEL GEAE+G+GG+DESD QRFSSS 
Subjt:  PPVPAASTGTGGKRGKRKQPVARQKKEAPGRAKKKPGGASADEEQPTGRLDGVGIKVLEFDHCAENHFRAMDTIAELCGEAEDGDGGIDESDIQRFSSSA

Query:  FFLREWRFYNYEPKTVKFASDLRGSEGKDGDITINLPQFSSAAVLKNGTPSGAATSLDWRNFVMYVGGPVWALDWCPQVLEKTDALIKCEFIAVSAHPPG
         FLREW+FYNYEPKTVKF SD R  EGKD DIT+ LPQFSSAAVLKNG P GA TSLD+RNF+M+VGGPVWA+DWCP V E++D+LIKCEFIAVSAHPPG
Subjt:  FFLREWRFYNYEPKTVKFASDLRGSEGKDGDITINLPQFSSAAVLKNGTPSGAATSLDWRNFVMYVGGPVWALDWCPQVLEKTDALIKCEFIAVSAHPPG

Query:  SSYHKMGTPLIGRGMVQIWCLVHGTENHEPEPAYATKCKSKPKKDEVSSDLSSQPKRPRGRPPGTKKKGASDLPSQPKRPRGRPKKKQEGSN-DNMGDNN
        SSYH MG PL GRGMVQIWCLVHGTE+HE E   AT+CK         SDL SQPKRPRGRPPG KK GAS LPSQPKRPRGRPK KQE  N DN   + 
Subjt:  SSYHKMGTPLIGRGMVQIWCLVHGTENHEPEPAYATKCKSKPKKDEVSSDLSSQPKRPRGRPPGTKKKGASDLPSQPKRPRGRPKKKQEGSN-DNMGDNN

Query:  QIVQSLSVEYPAGSSNLLEIDGDPKNSEELLLLGNSVERQKSTLQAVSTCNSKDEGPAQKRRVRRKVGTKNHIDDMGTLPFTVNREDGSSTISFQENENV
        Q+VQ LSVEYP  SSNLLEID  P NSE+ + L NSVER  ST++ +STCNS+DE P QKRRVRR   TKNH+DD+GTL    NREDGS+  + + NENV
Subjt:  QIVQSLSVEYPAGSSNLLEIDGDPKNSEELLLLGNSVERQKSTLQAVSTCNSKDEGPAQKRRVRRKVGTKNHIDDMGTLPFTVNREDGSSTISFQENENV

Query:  ISEYSGEDTLLCNNISKNA-------EFSIPESVALPRVVLCLAHNGKVAWDLKWKPSNACTTNCKHRMGYLAVLLGNGSLEVWEIPFPHVVKAIYSKFN
         SEYSGEDTLLC NIS+ A        FSIPE+VALPR+VLCLAHNGKVAWDLKWKP+NA TT CK RMGYLAVLLGNGSLEVWE+PFPHVVKAIYSK N
Subjt:  ISEYSGEDTLLCNNISKNA-------EFSIPESVALPRVVLCLAHNGKVAWDLKWKPSNACTTNCKHRMGYLAVLLGNGSLEVWEIPFPHVVKAIYSKFN

Query:  REGTDPRFVKLKPIFRSTMLKSANIQSIPLTVEWSSTPPYDYLFAGCNDGTVALWKFSANSTCEGCFSCPNF-----------------DPESANVVLTA
         EGTDPRFVKLKP FR +ML+SA+ QSIPLTVEWS TPPYDYL AGC+DGTVALWKFSANST E       F                 +PES NV+L A
Subjt:  REGTDPRFVKLKPIFRSTMLKSANIQSIPLTVEWSSTPPYDYLFAGCNDGTVALWKFSANSTCEGCFSCPNF-----------------DPESANVVLTA

Query:  SHGGLKFWDLRDPFRPLWDIHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYYGSSFAIWSVQVSRQTGMVAYC
        SHGG+KFWDLRDPFRPLWD+HPAPR+IYSLDWLP+PRCV LSFDDGTLRLLSLLKAAYDVPVTG+PFT  KQ+GLH+Y  S FAIWS+QVSRQTGMVAYC
Subjt:  SHGGLKFWDLRDPFRPLWDIHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYYGSSFAIWSVQVSRQTGMVAYC

Query:  SADGAVLRFQLTTRAVEKDHSRNRTPHFICEYLTEEESAITIHSPASGVPFPLKKASNKSDLPLSFRAILSDSIESNEGNHKTATASASENEALAIYHDN
         ADGAV+RFQLTT+AV+K++SRNRTPHF+CEYLTEE+S ITIHSPAS VP PLKK SNKS+ PLS RAILSDS++ NEGN K+AT SA  NE+ A+ +D+
Subjt:  SADGAVLRFQLTTRAVEKDHSRNRTPHFICEYLTEEESAITIHSPASGVPFPLKKASNKSDLPLSFRAILSDSIESNEGNHKTATASASENEALAIYHDN

Query:  DVSVDSGSEDTLMSMKKKNQTQSKCKKKEVDSQALECSDEPNDAQTKTDELPGSGDNFETFPPKSVALHRVRWNMNTGSERWLCYGGEAGIIRCQEIVLS
        DV V+SGSEDT MS++ KNQTQSK KKK V +Q LE S EP+D+QT  D +PG G++FE FPPKSVALHR+RWNMN GSERWL YGG AGI+RCQEIVLS
Subjt:  DVSVDSGSEDTLMSMKKKNQTQSKCKKKEVDSQALECSDEPNDAQTKTDELPGSGDNFETFPPKSVALHRVRWNMNTGSERWLCYGGEAGIIRCQEIVLS

Query:  DFDKKLMRKK
          DKKLM KK
Subjt:  DFDKKLMRKK

XP_022144681.1 uncharacterized protein LOC111014310 isoform X1 [Momordica charantia]0.0e+0097.46Show/hide
Query:  VEVVEPPVPAASTGTGGKRGKRKQPVARQKKEAPGRAKKKPGGASADEEQPTGRLDGVGIKVLEFDHCAENHFRAMDTIAELCGEAEDGDGGIDESDIQR
        VEVVEPPVPAAST TGGKRGKRKQPVARQKKEAPGRAKKKPGGASADEEQPTGRLDGVGIKVLEFDHCAENHFRAMDTIAELCGEAEDGDGGIDESDIQR
Subjt:  VEVVEPPVPAASTGTGGKRGKRKQPVARQKKEAPGRAKKKPGGASADEEQPTGRLDGVGIKVLEFDHCAENHFRAMDTIAELCGEAEDGDGGIDESDIQR

Query:  FSSSAFFLREWRFYNYEPKTVKFASDLRGSEGKDGDITINLPQFSSAAVLKNGTPSGAATSLDWRNFVMYVGGPVWALDWCPQVLEKTDALIKCEFIAVS
        FSSSAFFLREWRFYNYEPKTVKFASDLRGSEGKDGDITINLPQFSSAAVLKNGTPSGAATSLDWRNFVMYVGGPVWALDWCPQVLEKTDALIKCEFIAVS
Subjt:  FSSSAFFLREWRFYNYEPKTVKFASDLRGSEGKDGDITINLPQFSSAAVLKNGTPSGAATSLDWRNFVMYVGGPVWALDWCPQVLEKTDALIKCEFIAVS

Query:  AHPPGSSYHKMGTPLIGRGMVQIWCLVHGTENHEPEPAYATKCKSKPKKDEVSSDLSSQPKRPRGRPPGTKKKGASDLPSQPKRPRGRPKKKQEGSNDNM
        AHPPGSSYHKMGTPLIGRGMVQIWCLVHGTENHEPEPAYATKCKSKPKKDEVSSDLSSQPKRPRGRPPGTKKKGASDLPSQPKRPRGRPKKKQEGSNDNM
Subjt:  AHPPGSSYHKMGTPLIGRGMVQIWCLVHGTENHEPEPAYATKCKSKPKKDEVSSDLSSQPKRPRGRPPGTKKKGASDLPSQPKRPRGRPKKKQEGSNDNM

Query:  GDNNQIVQSLSVEYPAGSSNLLEIDGDPKNSEELLLLGNSVERQKSTLQAVSTCNSKDEGPAQKRRVRRKVGTKNHIDDMGTLPFTVNREDGSSTISFQE
        GDNNQIVQSLSVEYPAGSSNLLEIDGDPKNSEELLLLGNSVERQKSTLQAVSTCNSKDEGPAQKRRVRRKVGTKNHIDDMGTLPFTVNREDGSSTISFQE
Subjt:  GDNNQIVQSLSVEYPAGSSNLLEIDGDPKNSEELLLLGNSVERQKSTLQAVSTCNSKDEGPAQKRRVRRKVGTKNHIDDMGTLPFTVNREDGSSTISFQE

Query:  NENVISEYSGEDTLLCNNISKNAEFSIPESVALPRVVLCLAHNGKVAWDLKWKPSNACTTNCKHRMGYLAVLLGNGSLEVWEIPFPHVVKAIYSKFNREG
        NENVISEYSGEDTLLCNNISKNAEFSIPESVALPRVVLCLAHNGKVAWDLKWKPSNACTTNCKHRMGYLAVLLGNGSLEVWEIPFPHVVKAIYSKFNREG
Subjt:  NENVISEYSGEDTLLCNNISKNAEFSIPESVALPRVVLCLAHNGKVAWDLKWKPSNACTTNCKHRMGYLAVLLGNGSLEVWEIPFPHVVKAIYSKFNREG

Query:  TDPRFVKLKPIFRSTMLKSANIQSIPLTVEWSSTPPYDYLFAGCNDGTVALWKFSANSTCEGC-----FS-----------CPN-FDPESANVVLTASHG
        TDPRFVKLKPIFRSTMLKSANIQSIPLTVEWSSTPPYDYLFAGCNDGTVALWKFSANSTCE       FS            PN  DPESANVVLTASHG
Subjt:  TDPRFVKLKPIFRSTMLKSANIQSIPLTVEWSSTPPYDYLFAGCNDGTVALWKFSANSTCEGC-----FS-----------CPN-FDPESANVVLTASHG

Query:  GLKFWDLRDPFRPLWDIHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYYGSSFAIWSVQVSRQTGMVAYCSAD
        GLKFWDLRDPFRPLWDIHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYYGSSFAIWSVQVSRQTGMVAYCSAD
Subjt:  GLKFWDLRDPFRPLWDIHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYYGSSFAIWSVQVSRQTGMVAYCSAD

Query:  GAVLRFQLTTRAVEKDHSRNRTPHFICEYLTEEESAITIHSPASGVPFPLKKASNKSDLPLSFRAILSDSIESNEGNHKTATASASENEALAIYHDNDVS
        GAVLRFQLTTRAVEKDHSRNRTPHFICEYLTEEESAITIHSPASGVPFPLKKASNKSDLPLSFRAILSDSIESNEGNHKTATA+ASENEALAIYHDNDVS
Subjt:  GAVLRFQLTTRAVEKDHSRNRTPHFICEYLTEEESAITIHSPASGVPFPLKKASNKSDLPLSFRAILSDSIESNEGNHKTATASASENEALAIYHDNDVS

Query:  VDSGSEDTLMSMKKKNQTQSKCKKKEVDSQALECSDEPNDAQTKTDELPGSGDNFETFPPKSVALHRVRWNMNTGSERWLCYGGEAGIIRCQEIVLSDFD
        VDSGSEDTLMSMKKKNQTQSKCKKKEVDSQALECSDEPNDAQTKTDELPGSGDNFETFPPKSVALHRVRWNMNTGSERWLCYGGEAGIIRCQEIVLSDFD
Subjt:  VDSGSEDTLMSMKKKNQTQSKCKKKEVDSQALECSDEPNDAQTKTDELPGSGDNFETFPPKSVALHRVRWNMNTGSERWLCYGGEAGIIRCQEIVLSDFD

Query:  KKLMRKK
        KKLMRKK
Subjt:  KKLMRKK

XP_022144682.1 uncharacterized protein LOC111014310 isoform X2 [Momordica charantia]0.0e+0097.02Show/hide
Query:  MYVGGPVWALDWCPQVLEKTDALIKCEFIAVSAHPPGSSYHKMGTPLIGRGMVQIWCLVHGTENHEPEPAYATKCKSKPKKDEVSSDLSSQPKRPRGRPP
        MYVGGPVWALDWCPQVLEKTDALIKCEFIAVSAHPPGSSYHKMGTPLIGRGMVQIWCLVHGTENHEPEPAYATKCKSKPKKDEVSSDLSSQPKRPRGRPP
Subjt:  MYVGGPVWALDWCPQVLEKTDALIKCEFIAVSAHPPGSSYHKMGTPLIGRGMVQIWCLVHGTENHEPEPAYATKCKSKPKKDEVSSDLSSQPKRPRGRPP

Query:  GTKKKGASDLPSQPKRPRGRPKKKQEGSNDNMGDNNQIVQSLSVEYPAGSSNLLEIDGDPKNSEELLLLGNSVERQKSTLQAVSTCNSKDEGPAQKRRVR
        GTKKKGASDLPSQPKRPRGRPKKKQEGSNDNMGDNNQIVQSLSVEYPAGSSNLLEIDGDPKNSEELLLLGNSVERQKSTLQAVSTCNSKDEGPAQKRRVR
Subjt:  GTKKKGASDLPSQPKRPRGRPKKKQEGSNDNMGDNNQIVQSLSVEYPAGSSNLLEIDGDPKNSEELLLLGNSVERQKSTLQAVSTCNSKDEGPAQKRRVR

Query:  RKVGTKNHIDDMGTLPFTVNREDGSSTISFQENENVISEYSGEDTLLCNNISKNAEFSIPESVALPRVVLCLAHNGKVAWDLKWKPSNACTTNCKHRMGY
        RKVGTKNHIDDMGTLPFTVNREDGSSTISFQENENVISEYSGEDTLLCNNISKNAEFSIPESVALPRVVLCLAHNGKVAWDLKWKPSNACTTNCKHRMGY
Subjt:  RKVGTKNHIDDMGTLPFTVNREDGSSTISFQENENVISEYSGEDTLLCNNISKNAEFSIPESVALPRVVLCLAHNGKVAWDLKWKPSNACTTNCKHRMGY

Query:  LAVLLGNGSLEVWEIPFPHVVKAIYSKFNREGTDPRFVKLKPIFRSTMLKSANIQSIPLTVEWSSTPPYDYLFAGCNDGTVALWKFSANSTCEGC-----
        LAVLLGNGSLEVWEIPFPHVVKAIYSKFNREGTDPRFVKLKPIFRSTMLKSANIQSIPLTVEWSSTPPYDYLFAGCNDGTVALWKFSANSTCE       
Subjt:  LAVLLGNGSLEVWEIPFPHVVKAIYSKFNREGTDPRFVKLKPIFRSTMLKSANIQSIPLTVEWSSTPPYDYLFAGCNDGTVALWKFSANSTCEGC-----

Query:  FS-----------CPN-FDPESANVVLTASHGGLKFWDLRDPFRPLWDIHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTK
        FS            PN  DPESANVVLTASHGGLKFWDLRDPFRPLWDIHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTK
Subjt:  FS-----------CPN-FDPESANVVLTASHGGLKFWDLRDPFRPLWDIHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTK

Query:  QQGLHSYYGSSFAIWSVQVSRQTGMVAYCSADGAVLRFQLTTRAVEKDHSRNRTPHFICEYLTEEESAITIHSPASGVPFPLKKASNKSDLPLSFRAILS
        QQGLHSYYGSSFAIWSVQVSRQTGMVAYCSADGAVLRFQLTTRAVEKDHSRNRTPHFICEYLTEEESAITIHSPASGVPFPLKKASNKSDLPLSFRAILS
Subjt:  QQGLHSYYGSSFAIWSVQVSRQTGMVAYCSADGAVLRFQLTTRAVEKDHSRNRTPHFICEYLTEEESAITIHSPASGVPFPLKKASNKSDLPLSFRAILS

Query:  DSIESNEGNHKTATASASENEALAIYHDNDVSVDSGSEDTLMSMKKKNQTQSKCKKKEVDSQALECSDEPNDAQTKTDELPGSGDNFETFPPKSVALHRV
        DSIESNEGNHKTATA+ASENEALAIYHDNDVSVDSGSEDTLMSMKKKNQTQSKCKKKEVDSQALECSDEPNDAQTKTDELPGSGDNFETFPPKSVALHRV
Subjt:  DSIESNEGNHKTATASASENEALAIYHDNDVSVDSGSEDTLMSMKKKNQTQSKCKKKEVDSQALECSDEPNDAQTKTDELPGSGDNFETFPPKSVALHRV

Query:  RWNMNTGSERWLCYGGEAGIIRCQEIVLSDFDKKLMRKK
        RWNMNTGSERWLCYGGEAGIIRCQEIVLSDFDKKLMRKK
Subjt:  RWNMNTGSERWLCYGGEAGIIRCQEIVLSDFDKKLMRKK

XP_023528187.1 uncharacterized protein LOC111791176 [Cucurbita pepo subsp. pepo]0.0e+0071.76Show/hide
Query:  PPVPAASTGTGGKRGKRKQPVARQKKEAPGRAKKKPGGASADEEQPTGRLDGVGIKVLEFDHCAENHFRAMDTIAELCGEAEDGDGGIDESDIQRFSSSA
        P    AS GT  K+GK+K   +   +E   RAKKK G  S +E QPTGRLD   +KV EFDHC ENHFRA+D IAEL GEAE+G+GG+DESD QRFSSS 
Subjt:  PPVPAASTGTGGKRGKRKQPVARQKKEAPGRAKKKPGGASADEEQPTGRLDGVGIKVLEFDHCAENHFRAMDTIAELCGEAEDGDGGIDESDIQRFSSSA

Query:  FFLREWRFYNYEPKTVKFASDLRGSEGKDGDITINLPQFSSAAVLKNGTPSGAATSLDWRNFVMYVGGPVWALDWCPQVLEKTDALIKCEFIAVSAHPPG
         FLREW+FYNYEPKTVKF SD R  EGKD DIT+ LPQFSSAAVLKNG P GA TSLD+RNF+M+VGGPVWA+DWCP V E+TD+LIKCEFIAVSAHPPG
Subjt:  FFLREWRFYNYEPKTVKFASDLRGSEGKDGDITINLPQFSSAAVLKNGTPSGAATSLDWRNFVMYVGGPVWALDWCPQVLEKTDALIKCEFIAVSAHPPG

Query:  SSYHKMGTPLIGRGMVQIWCLVHGTENHEPEPAYATKCKSKPKKDEVSSDLSSQPKRPRGRPPGTKKKGASDLPSQPKRPRGRPKKKQEGSN-DNMGDNN
        SSYH MG PL GRGMVQIWCLVHGTE+HE E   AT+CK         SDL SQPKRPRGRPPG KK GAS LPSQPKRPRGRPKKKQE  N DN   + 
Subjt:  SSYHKMGTPLIGRGMVQIWCLVHGTENHEPEPAYATKCKSKPKKDEVSSDLSSQPKRPRGRPPGTKKKGASDLPSQPKRPRGRPKKKQEGSN-DNMGDNN

Query:  QIVQSLSVEYPAGSSNLLEIDGDPKNSEELLLLGNSVERQKSTLQAVSTCNSKDEGPAQKRRVRRKVGTKNHIDDMGTLPFTVNREDGSSTISFQENENV
        Q+VQ LSVEYP  SSNLLEID  P NSE+ + L NSVER  ST++ +STCNS+DE P QKRRVRR   TKNH+DD+GTL    NREDG +  + + NENV
Subjt:  QIVQSLSVEYPAGSSNLLEIDGDPKNSEELLLLGNSVERQKSTLQAVSTCNSKDEGPAQKRRVRRKVGTKNHIDDMGTLPFTVNREDGSSTISFQENENV

Query:  ISEYSGEDTLLCNNISKNA-------EFSIPESVALPRVVLCLAHNGKVAWDLKWKPSNACTTNCKHRMGYLAVLLGNGSLEVWEIPFPHVVKAIYSKFN
         SEYSGEDTLLC NIS+NA        FSIPESVALPR+VLCLAHNGKVAWDLKWKP+NA TT CK RMGYLAVLLGNGSLEVWE+PFPHVVKAIYSK N
Subjt:  ISEYSGEDTLLCNNISKNA-------EFSIPESVALPRVVLCLAHNGKVAWDLKWKPSNACTTNCKHRMGYLAVLLGNGSLEVWEIPFPHVVKAIYSKFN

Query:  REGTDPRFVKLKPIFRSTMLKSANIQSIPLTVEWSSTPPYDYLFAGCNDGTVALWKFSANSTCEGCFSCPNF-----------------DPESANVVLTA
         EGTDPRFV+LKP FR +ML+SA+ QSIPLTVEWS TPPYDYL AGC+DGTVALWKFSANST E       F                 +PES NV+L A
Subjt:  REGTDPRFVKLKPIFRSTMLKSANIQSIPLTVEWSSTPPYDYLFAGCNDGTVALWKFSANSTCEGCFSCPNF-----------------DPESANVVLTA

Query:  SHGGLKFWDLRDPFRPLWDIHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYYGSSFAIWSVQVSRQTGMVAYC
        SHGG+KFWDLRDPFRPLWD+HPAPR+IYSLDWLP+PRCV LSFDDGTLRLLSLLKAAYDVPVTG+PFT  KQ+GLH+Y  S FAIWS+QVSRQTGMVAYC
Subjt:  SHGGLKFWDLRDPFRPLWDIHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYYGSSFAIWSVQVSRQTGMVAYC

Query:  SADGAVLRFQLTTRAVEKDHSRNRTPHFICEYLTEEESAITIHSPASGVPFPLKKASNKSDLPLSFRAILSDSIESNEGNHKTATASASENEALAIYHDN
         ADGAV+RFQLTT+AV+K++SRNRTPHF+CEYLTEE+S ITIHSPAS VP PLKK +NKS+ PLS RAILSDS++ NEGN K+AT SA ENE+ A+ +D+
Subjt:  SADGAVLRFQLTTRAVEKDHSRNRTPHFICEYLTEEESAITIHSPASGVPFPLKKASNKSDLPLSFRAILSDSIESNEGNHKTATASASENEALAIYHDN

Query:  DVSVDSGSEDTLMSMKKKNQTQSKCKKKEVDSQALECSDEPNDAQTKTDELPGSGDNFETFPPKSVALHRVRWNMNTGSERWLCYGGEAGIIRCQEIVLS
        DV V+SGSEDT MS++ KNQTQSK KKK V +Q LE S EP+D+QT  D +PGSGD+FE FPPKSVALHR+RWNMN GSERWLCYGG AGI+RCQEIVLS
Subjt:  DVSVDSGSEDTLMSMKKKNQTQSKCKKKEVDSQALECSDEPNDAQTKTDELPGSGDNFETFPPKSVALHRVRWNMNTGSERWLCYGGEAGIIRCQEIVLS

Query:  DFDKKLMRKK
          DKKLM KK
Subjt:  DFDKKLMRKK

XP_038903194.1 uncharacterized protein LOC120089853 [Benincasa hispida]0.0e+0071.88Show/hide
Query:  PVPAASTGTGGKRGKRKQPVARQKKEAPGRAKKKPGG----ASADEEQPTGRLDGVGIKVLEFDHCAENHFRAMDTIAELCGEAEDGDGGIDESDIQRFS
        P P  S GT  K+GK+K P AR+KK++   A+ KPG      S ++ QPTGRLDG  +KV EFDHC ENHF AMDTI ELC EAE  DGGIDESDIQRF+
Subjt:  PVPAASTGTGGKRGKRKQPVARQKKEAPGRAKKKPGG----ASADEEQPTGRLDGVGIKVLEFDHCAENHFRAMDTIAELCGEAEDGDGGIDESDIQRFS

Query:  SSAFFLREWRFYNYEPKTVKFASDLRGSEGKDGDITINLPQFSSAAVLKNGTPSGAATSLDWRNFVMYVGGPVWALDWCPQVLEKTDALIKCEFIAVSAH
        SS  FLREWRFYNYEPK +KFASD RG EGKD DITI LPQFSSAAVLKNG P GA TSLD+RNF M+VGGPVWALDWCPQV E+TD+LIKCEFIAVSAH
Subjt:  SSAFFLREWRFYNYEPKTVKFASDLRGSEGKDGDITINLPQFSSAAVLKNGTPSGAATSLDWRNFVMYVGGPVWALDWCPQVLEKTDALIKCEFIAVSAH

Query:  PPGSSYHKMGTPLIGRGMVQIWCLVHGTENHEPEPAYATKCKSKPKKDEVSSDLSSQPKRPRGRPPGTKKKGASDLPSQPKRPRGRPKKKQEGSNDNMGD
        PPGSSYHKMG PL GRGMVQIWC VHGTE++EP              +E  +DLSSQPKRPRGRP G KK GAS LP QPKRPRGRPKKKQE SND  GD
Subjt:  PPGSSYHKMGTPLIGRGMVQIWCLVHGTENHEPEPAYATKCKSKPKKDEVSSDLSSQPKRPRGRPPGTKKKGASDLPSQPKRPRGRPKKKQEGSNDNMGD

Query:  NNQIVQSLSVEYPAGSSNLLEIDGDPKNSEELLLLGNSVERQKSTLQAVSTCNSKDEGPAQKRRVRRKVGTKNHIDDMGTLPFTVNREDGSSTISFQENE
        +  +VQ+ S+E P GSSNLLE+DG PKNSE ++LL NSVER++STLQ VSTCNS+DE PAQKRRVRRK   KNH+ D+G L  T NREDGS+ IS + NE
Subjt:  NNQIVQSLSVEYPAGSSNLLEIDGDPKNSEELLLLGNSVERQKSTLQAVSTCNSKDEGPAQKRRVRRKVGTKNHIDDMGTLPFTVNREDGSSTISFQENE

Query:  NVISEYSGEDTLLCNNISKNA-------EFSIPESVALPRVVLCLAHNGKVAWDLKWKPSNACTTNCKHRMGYLAVLLGNGSLEVWEIPFPHVVKAIYSK
        NV+ EYSGED LLC NIS NA       EFSIPESVALPRVVLCLAHNGKVAWDLKWKP+NA T NCK RMGYLAVLLGNGSLEVWE+PFPH VKAIYSK
Subjt:  NVISEYSGEDTLLCNNISKNA-------EFSIPESVALPRVVLCLAHNGKVAWDLKWKPSNACTTNCKHRMGYLAVLLGNGSLEVWEIPFPHVVKAIYSK

Query:  FNREGTDPRFVKLKPIFRSTMLKSANIQSIPLTVEWSSTPPYDYLFAGCNDGTVALWKFSANSTCEGCFSCPNFD-----------------PESANVVL
        FN EGTDPRFVKLKPIFR +ML++AN QSIPLTVEWS TPPYDYL AGC+DGTVALWKFSANS+CE       F                   ESANV+L
Subjt:  FNREGTDPRFVKLKPIFRSTMLKSANIQSIPLTVEWSSTPPYDYLFAGCNDGTVALWKFSANSTCEGCFSCPNFD-----------------PESANVVL

Query:  TASHGGLKFWDLRDPFRPLWDIHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYYGSSFAIWSVQVSRQTGMVA
        TA HGGLKFWDLRDPFRPLWD+HPAPR+IYSLDWLP+PRCV LSFDDGTLRLLSLLKAAYDVPVTG+PFT  KQ+GLH+Y+ SS+AIWS+QVSRQTGMVA
Subjt:  TASHGGLKFWDLRDPFRPLWDIHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYYGSSFAIWSVQVSRQTGMVA

Query:  YCSADGAVLRFQLTTRAVEKDHSRNRTPHFICEYLTEEESAITIHSPASGVPFPLKKASNKSDLPLSFRAILSDSIESNEGNHKTATASASENEALAIYH
        YC ADGAV+RFQLTT+A +K++SR+RTPH++CEYLTEEES ITIHSP + +PF LKK SNKS+ PLS RAILSDS++SNEGNHKTATA A ENE+ A+  
Subjt:  YCSADGAVLRFQLTTRAVEKDHSRNRTPHFICEYLTEEESAITIHSPASGVPFPLKKASNKSDLPLSFRAILSDSIESNEGNHKTATASASENEALAIYH

Query:  DNDVSVDSGSEDTLMSMKKKNQTQSKCKKKEVDSQALECSDEPN---------DAQTKTDELPGSGDNFETFPPKSVALHRVRWNMNTGSERWLCYGGEA
        D DV V+SG EDTLMS+KKKN+TQSKC KK V++Q L+CSDEPN         D QT    +PGS D FE+ PPKSVA+HRVRWNMN GSERWLCYGG A
Subjt:  DNDVSVDSGSEDTLMSMKKKNQTQSKCKKKEVDSQALECSDEPN---------DAQTKTDELPGSGDNFETFPPKSVALHRVRWNMNTGSERWLCYGGEA

Query:  GIIRCQEIVLSDFDKKLMRKK
        GI+RCQEIVLS  D KLM+KK
Subjt:  GIIRCQEIVLSDFDKKLMRKK

TrEMBL top hitse value%identityAlignment
A0A0A0LGM2 Uncharacterized protein0.0e+0068.29Show/hide
Query:  EVVEPPVPAASTGTGGKRGKRKQPVARQKKEAPGRAKKK----------PGGASADEEQPTGRLDGV--GIKVLEFDHCAENHFRAMDTIAELCGEAEDG
        E  EP   A  T   GK    K+P A++KKE   RAKKK                ++ Q T RLD V   +KV EFD C ENHFRAMD I ELC EAEDG
Subjt:  EVVEPPVPAASTGTGGKRGKRKQPVARQKKEAPGRAKKK----------PGGASADEEQPTGRLDGV--GIKVLEFDHCAENHFRAMDTIAELCGEAEDG

Query:  DGGIDESDIQRFSSSAFFLREWRFYNYEPKTVKFASDLRGSEGKDGDITINLPQFSSAAVLKNGTPSGAATSLDWRNFVMYVGGPVWALDWCPQVLEKTD
        DGGIDESDIQRFSSS  FLREWRFYNYEPKT+KFA+D RG EGKD DITI+LPQFSSAAVLK G P GA+TSLD+RNF M+VGGPVWA+DWCPQV E+T+
Subjt:  DGGIDESDIQRFSSSAFFLREWRFYNYEPKTVKFASDLRGSEGKDGDITINLPQFSSAAVLKNGTPSGAATSLDWRNFVMYVGGPVWALDWCPQVLEKTD

Query:  ALIKCEFIAVSAHPPGSSYHKMGTPLIGRGMVQIWCLVHGTENHEPEPAYATKCKSKPKKDEVSSDLSSQPKRPRGRPPGTKKKGASDLPSQPKRPRGRP
        +LIKCEFIAVSAHPPGSSYHKMG PL GRGMVQIWCLVHGTE++EP               E  SDLSSQPKRPRGRPPG K+KGAS LPSQPKRPRGRP
Subjt:  ALIKCEFIAVSAHPPGSSYHKMGTPLIGRGMVQIWCLVHGTENHEPEPAYATKCKSKPKKDEVSSDLSSQPKRPRGRPPGTKKKGASDLPSQPKRPRGRP

Query:  KKKQEGSND-NMGDNNQIVQSLSVEYPAGSSNLLEIDGDPKNSEELLLLGNSVERQKSTLQAVSTCNSKDEGPAQKRRVRRKVGTKNHIDDMGTLPFTVN
        KK+Q+ SND   GDN Q+VQ  S+E P GSSNLLEIDG PKN+E  +LL N+VER+ STLQ VSTC+S+DE PA+KRRVRRKV  +N +DD+G L     
Subjt:  KKKQEGSND-NMGDNNQIVQSLSVEYPAGSSNLLEIDGDPKNSEELLLLGNSVERQKSTLQAVSTCNSKDEGPAQKRRVRRKVGTKNHIDDMGTLPFTVN

Query:  REDGSSTISFQENENVISEYSGEDTLLCNNISKNA-------EFSIPESVALPRVVLCLAHNGKVAWDLKWKPSNACTTNCKHRMGYLAVLLGNGSLEVW
        +EDGS   + + NENV SEYSGED LLC +IS+N        EFSIPESVALPRVVLCLAHNGKVAWDLKWKP NACT NCKHRMGYLAVLLGNGSLEVW
Subjt:  REDGSSTISFQENENVISEYSGEDTLLCNNISKNA-------EFSIPESVALPRVVLCLAHNGKVAWDLKWKPSNACTTNCKHRMGYLAVLLGNGSLEVW

Query:  EIPFPHVVKAIYSKFNREGTDPRFVKLKPIFRSTMLKSANIQSIPLTVEWSSTPPYDYLFAGCNDGTVALWKFSANSTCEGCFSCPNF------------
        E+PFPH VKAIYSKFN EGTDPRF+KLKPIFR + L++ N QSIPLTVEWS TPPYDYL AGC+DGTVALWKFSANS+CE       F            
Subjt:  EIPFPHVVKAIYSKFNREGTDPRFVKLKPIFRSTMLKSANIQSIPLTVEWSSTPPYDYLFAGCNDGTVALWKFSANSTCEGCFSCPNF------------

Query:  -----DPESANVVLTASHGGLKFWDLRDPFRPLWDIHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYYGSSFA
             D ESANV+LTA HGGLKFWDLRDPFRPLWD+HPAPR+IYSLDWLP+PRCV LSFDDGTLRLLSLLKAA DVP TG+PFT  KQ+GLH+Y  SS+A
Subjt:  -----DPESANVVLTASHGGLKFWDLRDPFRPLWDIHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYYGSSFA

Query:  IWSVQVSRQTGMVAYCSADGAVLRFQLTTRAVEKDHSRNRTPHFICEYLTEEESAITIHSPASGVPFPLKKASNKSDLPLSFRAILSDSIESNEGNHKTA
        IWS+QVSRQTGMVAYC ADGAV+RFQLTT+A +K++SR+RTPH++CEYLTEEES IT  SP   VP PLKK SNKS+ PLS RAILSDS++SNE   KTA
Subjt:  IWSVQVSRQTGMVAYCSADGAVLRFQLTTRAVEKDHSRNRTPHFICEYLTEEESAITIHSPASGVPFPLKKASNKSDLPLSFRAILSDSIESNEGNHKTA

Query:  TASASENEALAIYHDNDVSVDSGSEDTLMSMKKKNQTQSKCKKKEVDSQALECSDEPND-----------------AQTKTDELPGSGDNFETFPPKSVA
        TAS  ENEA  I  D DV V+SGSEDTL   KKKN+TQ KC K+ V+   LECSDEP D                 AQ   D LP SGD+FE  PPKSVA
Subjt:  TASASENEALAIYHDNDVSVDSGSEDTLMSMKKKNQTQSKCKKKEVDSQALECSDEPND-----------------AQTKTDELPGSGDNFETFPPKSVA

Query:  LHRVRWNMNTGSERWLCYGGEAGIIRCQEIVLSDFDKKLMRKK
        +HRVRWNMN GSE WLCYGG AGI+RC+EIVLS  D KLM+KK
Subjt:  LHRVRWNMNTGSERWLCYGGEAGIIRCQEIVLSDFDKKLMRKK

A0A5D3DPQ1 DNA binding protein, putative isoform 10.0e+0067.69Show/hide
Query:  EVVEPPVPAASTGTGGKRGKRKQPVARQKKEAPGRAKKKPGGA------------SADEEQPTGRLDGV--GIKVLEFDHCAENHFRAMDTIAELCGEAE
        E  EP   A  T   GK    K+P A++KKE   RAKKK   A            S +E Q T RL+ V   +KV EFD C ENHFRAMD I ELC EAE
Subjt:  EVVEPPVPAASTGTGGKRGKRKQPVARQKKEAPGRAKKKPGGA------------SADEEQPTGRLDGV--GIKVLEFDHCAENHFRAMDTIAELCGEAE

Query:  DGDGGIDESDIQRFSSSAFFLREWRFYNYEPKTVKFASDLRGSEGKDGDITINLPQFSSAAVLKNGTPSGAATSLDWRNFVMYVGGPVWALDWCPQVLEK
        +GDGGIDESDIQRFSSS  FLREWRFYNYE KT+KFA+D  G EGKD DITINLPQFSSAAVLK G P GA+TSLD+RNF M+VGGPVWA+DWCPQV  +
Subjt:  DGDGGIDESDIQRFSSSAFFLREWRFYNYEPKTVKFASDLRGSEGKDGDITINLPQFSSAAVLKNGTPSGAATSLDWRNFVMYVGGPVWALDWCPQVLEK

Query:  TDALIKCEFIAVSAHPPGSSYHKMGTPLIGRGMVQIWCLVHGTENHEPEPAYATKCKSKPKKDEVSSDLSSQPKRPRGRPPGTKKKGASDLPSQPKRPRG
        T++LIKCEFIAVSAHPPGSSYHKMG PL GRGMVQIWCLVHGTEN+EP               E  SDLSSQPK+PRGRPPG KKK AS LPS PKRPRG
Subjt:  TDALIKCEFIAVSAHPPGSSYHKMGTPLIGRGMVQIWCLVHGTENHEPEPAYATKCKSKPKKDEVSSDLSSQPKRPRGRPPGTKKKGASDLPSQPKRPRG

Query:  RPKKKQEGSNDNMGDNNQIVQSLSVEYPAGSSNLLEIDGDPKNSEELLLLGNSVERQKSTLQAVSTCNSKDEGPAQKRRVRRKVGTKNHIDDMGTLPFTV
        RPKK+Q+ S D  GDN Q+VQ  S+E P GSS+LLEIDG PKN+E  +LL N+VER++STLQ VSTCNS+DE PA+KRRVRRKV ++N +DD+G    T 
Subjt:  RPKKKQEGSNDNMGDNNQIVQSLSVEYPAGSSNLLEIDGDPKNSEELLLLGNSVERQKSTLQAVSTCNSKDEGPAQKRRVRRKVGTKNHIDDMGTLPFTV

Query:  NREDGSSTISFQENENVISEYSGEDTLLCNNISKNA-------EFSIPESVALPRVVLCLAHNGKVAWDLKWKPSNACTTNCKHRMGYLAVLLGNGSLEV
         +EDGS   + + +ENV SEYSGED LLC +IS+N        EFSIPESVALPRVVLCLAHNGKVAWDLKWKP NACT NCKHRMGYLAVLLGNGSLEV
Subjt:  NREDGSSTISFQENENVISEYSGEDTLLCNNISKNA-------EFSIPESVALPRVVLCLAHNGKVAWDLKWKPSNACTTNCKHRMGYLAVLLGNGSLEV

Query:  WEIPFPHVVKAIYSKFNREGTDPRFVKLKPIFRSTMLKSANIQSIPLTVEWSSTPPYDYLFAGCNDGTVALWKFSANSTCEGCFSCPNFDP---------
        WE+PFPH VK IYSKFN EGTDPRFVKLKPIFR + L++AN QSIPLTVEWS  PPYDYL AGC+DGTVALWKFSANS+CE       F           
Subjt:  WEIPFPHVVKAIYSKFNREGTDPRFVKLKPIFRSTMLKSANIQSIPLTVEWSSTPPYDYLFAGCNDGTVALWKFSANSTCEGCFSCPNFDP---------

Query:  --------ESANVVLTASHGGLKFWDLRDPFRPLWDIHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYYGSSF
                ESANV+LTA HGGLKFWDLRDPFRPLWD+HPAPR+IYSLDWLP+PRCV LSFDDGTLRLLSLLKAA DVP TG+PFT  KQ+GLH+Y  SS+
Subjt:  --------ESANVVLTASHGGLKFWDLRDPFRPLWDIHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYYGSSF

Query:  AIWSVQVSRQTGMVAYCSADGAVLRFQLTTRAVEKDHSRNRTPHFICEYLTEEESAITIHSPASGVPFPLKKASNKSDLPLSFRAILSDSIESNEGNHKT
        AIWS+QVSRQTGMVAYC ADGAV+RFQLTT+A +K++SR+RTPH++CEYLTEEES IT  SP   VP PLKK SNKS+ PLS RAILSDS++SNEGNHKT
Subjt:  AIWSVQVSRQTGMVAYCSADGAVLRFQLTTRAVEKDHSRNRTPHFICEYLTEEESAITIHSPASGVPFPLKKASNKSDLPLSFRAILSDSIESNEGNHKT

Query:  ATASASENEALAIYHDNDVSVDSGSEDTLMSMKKKNQTQSKCKKKEVDSQALECSDEPND-----------------AQTKTDELPGSGDNFETFPPKSV
        ATAS  ENEA +I  D DV V+SGSEDT +S KKKN+TQ KCKKK V++  LEC+ EP D                 A+   D +P SGD+FE  PPKSV
Subjt:  ATASASENEALAIYHDNDVSVDSGSEDTLMSMKKKNQTQSKCKKKEVDSQALECSDEPND-----------------AQTKTDELPGSGDNFETFPPKSV

Query:  ALHRVRWNMNTGSERWLCYGGEAGIIRCQEIVLSDFDKKLMRKK
        A+HRVRWNMN GSE+WLCYGG +GI+RCQE+VLS  D KLM+KK
Subjt:  ALHRVRWNMNTGSERWLCYGGEAGIIRCQEIVLSDFDKKLMRKK

A0A6J1CU50 uncharacterized protein LOC111014310 isoform X10.0e+0097.46Show/hide
Query:  VEVVEPPVPAASTGTGGKRGKRKQPVARQKKEAPGRAKKKPGGASADEEQPTGRLDGVGIKVLEFDHCAENHFRAMDTIAELCGEAEDGDGGIDESDIQR
        VEVVEPPVPAAST TGGKRGKRKQPVARQKKEAPGRAKKKPGGASADEEQPTGRLDGVGIKVLEFDHCAENHFRAMDTIAELCGEAEDGDGGIDESDIQR
Subjt:  VEVVEPPVPAASTGTGGKRGKRKQPVARQKKEAPGRAKKKPGGASADEEQPTGRLDGVGIKVLEFDHCAENHFRAMDTIAELCGEAEDGDGGIDESDIQR

Query:  FSSSAFFLREWRFYNYEPKTVKFASDLRGSEGKDGDITINLPQFSSAAVLKNGTPSGAATSLDWRNFVMYVGGPVWALDWCPQVLEKTDALIKCEFIAVS
        FSSSAFFLREWRFYNYEPKTVKFASDLRGSEGKDGDITINLPQFSSAAVLKNGTPSGAATSLDWRNFVMYVGGPVWALDWCPQVLEKTDALIKCEFIAVS
Subjt:  FSSSAFFLREWRFYNYEPKTVKFASDLRGSEGKDGDITINLPQFSSAAVLKNGTPSGAATSLDWRNFVMYVGGPVWALDWCPQVLEKTDALIKCEFIAVS

Query:  AHPPGSSYHKMGTPLIGRGMVQIWCLVHGTENHEPEPAYATKCKSKPKKDEVSSDLSSQPKRPRGRPPGTKKKGASDLPSQPKRPRGRPKKKQEGSNDNM
        AHPPGSSYHKMGTPLIGRGMVQIWCLVHGTENHEPEPAYATKCKSKPKKDEVSSDLSSQPKRPRGRPPGTKKKGASDLPSQPKRPRGRPKKKQEGSNDNM
Subjt:  AHPPGSSYHKMGTPLIGRGMVQIWCLVHGTENHEPEPAYATKCKSKPKKDEVSSDLSSQPKRPRGRPPGTKKKGASDLPSQPKRPRGRPKKKQEGSNDNM

Query:  GDNNQIVQSLSVEYPAGSSNLLEIDGDPKNSEELLLLGNSVERQKSTLQAVSTCNSKDEGPAQKRRVRRKVGTKNHIDDMGTLPFTVNREDGSSTISFQE
        GDNNQIVQSLSVEYPAGSSNLLEIDGDPKNSEELLLLGNSVERQKSTLQAVSTCNSKDEGPAQKRRVRRKVGTKNHIDDMGTLPFTVNREDGSSTISFQE
Subjt:  GDNNQIVQSLSVEYPAGSSNLLEIDGDPKNSEELLLLGNSVERQKSTLQAVSTCNSKDEGPAQKRRVRRKVGTKNHIDDMGTLPFTVNREDGSSTISFQE

Query:  NENVISEYSGEDTLLCNNISKNAEFSIPESVALPRVVLCLAHNGKVAWDLKWKPSNACTTNCKHRMGYLAVLLGNGSLEVWEIPFPHVVKAIYSKFNREG
        NENVISEYSGEDTLLCNNISKNAEFSIPESVALPRVVLCLAHNGKVAWDLKWKPSNACTTNCKHRMGYLAVLLGNGSLEVWEIPFPHVVKAIYSKFNREG
Subjt:  NENVISEYSGEDTLLCNNISKNAEFSIPESVALPRVVLCLAHNGKVAWDLKWKPSNACTTNCKHRMGYLAVLLGNGSLEVWEIPFPHVVKAIYSKFNREG

Query:  TDPRFVKLKPIFRSTMLKSANIQSIPLTVEWSSTPPYDYLFAGCNDGTVALWKFSANSTCEGC-----FS-----------CPN-FDPESANVVLTASHG
        TDPRFVKLKPIFRSTMLKSANIQSIPLTVEWSSTPPYDYLFAGCNDGTVALWKFSANSTCE       FS            PN  DPESANVVLTASHG
Subjt:  TDPRFVKLKPIFRSTMLKSANIQSIPLTVEWSSTPPYDYLFAGCNDGTVALWKFSANSTCEGC-----FS-----------CPN-FDPESANVVLTASHG

Query:  GLKFWDLRDPFRPLWDIHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYYGSSFAIWSVQVSRQTGMVAYCSAD
        GLKFWDLRDPFRPLWDIHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYYGSSFAIWSVQVSRQTGMVAYCSAD
Subjt:  GLKFWDLRDPFRPLWDIHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYYGSSFAIWSVQVSRQTGMVAYCSAD

Query:  GAVLRFQLTTRAVEKDHSRNRTPHFICEYLTEEESAITIHSPASGVPFPLKKASNKSDLPLSFRAILSDSIESNEGNHKTATASASENEALAIYHDNDVS
        GAVLRFQLTTRAVEKDHSRNRTPHFICEYLTEEESAITIHSPASGVPFPLKKASNKSDLPLSFRAILSDSIESNEGNHKTATA+ASENEALAIYHDNDVS
Subjt:  GAVLRFQLTTRAVEKDHSRNRTPHFICEYLTEEESAITIHSPASGVPFPLKKASNKSDLPLSFRAILSDSIESNEGNHKTATASASENEALAIYHDNDVS

Query:  VDSGSEDTLMSMKKKNQTQSKCKKKEVDSQALECSDEPNDAQTKTDELPGSGDNFETFPPKSVALHRVRWNMNTGSERWLCYGGEAGIIRCQEIVLSDFD
        VDSGSEDTLMSMKKKNQTQSKCKKKEVDSQALECSDEPNDAQTKTDELPGSGDNFETFPPKSVALHRVRWNMNTGSERWLCYGGEAGIIRCQEIVLSDFD
Subjt:  VDSGSEDTLMSMKKKNQTQSKCKKKEVDSQALECSDEPNDAQTKTDELPGSGDNFETFPPKSVALHRVRWNMNTGSERWLCYGGEAGIIRCQEIVLSDFD

Query:  KKLMRKK
        KKLMRKK
Subjt:  KKLMRKK

A0A6J1CUD2 uncharacterized protein LOC111014310 isoform X20.0e+0097.02Show/hide
Query:  MYVGGPVWALDWCPQVLEKTDALIKCEFIAVSAHPPGSSYHKMGTPLIGRGMVQIWCLVHGTENHEPEPAYATKCKSKPKKDEVSSDLSSQPKRPRGRPP
        MYVGGPVWALDWCPQVLEKTDALIKCEFIAVSAHPPGSSYHKMGTPLIGRGMVQIWCLVHGTENHEPEPAYATKCKSKPKKDEVSSDLSSQPKRPRGRPP
Subjt:  MYVGGPVWALDWCPQVLEKTDALIKCEFIAVSAHPPGSSYHKMGTPLIGRGMVQIWCLVHGTENHEPEPAYATKCKSKPKKDEVSSDLSSQPKRPRGRPP

Query:  GTKKKGASDLPSQPKRPRGRPKKKQEGSNDNMGDNNQIVQSLSVEYPAGSSNLLEIDGDPKNSEELLLLGNSVERQKSTLQAVSTCNSKDEGPAQKRRVR
        GTKKKGASDLPSQPKRPRGRPKKKQEGSNDNMGDNNQIVQSLSVEYPAGSSNLLEIDGDPKNSEELLLLGNSVERQKSTLQAVSTCNSKDEGPAQKRRVR
Subjt:  GTKKKGASDLPSQPKRPRGRPKKKQEGSNDNMGDNNQIVQSLSVEYPAGSSNLLEIDGDPKNSEELLLLGNSVERQKSTLQAVSTCNSKDEGPAQKRRVR

Query:  RKVGTKNHIDDMGTLPFTVNREDGSSTISFQENENVISEYSGEDTLLCNNISKNAEFSIPESVALPRVVLCLAHNGKVAWDLKWKPSNACTTNCKHRMGY
        RKVGTKNHIDDMGTLPFTVNREDGSSTISFQENENVISEYSGEDTLLCNNISKNAEFSIPESVALPRVVLCLAHNGKVAWDLKWKPSNACTTNCKHRMGY
Subjt:  RKVGTKNHIDDMGTLPFTVNREDGSSTISFQENENVISEYSGEDTLLCNNISKNAEFSIPESVALPRVVLCLAHNGKVAWDLKWKPSNACTTNCKHRMGY

Query:  LAVLLGNGSLEVWEIPFPHVVKAIYSKFNREGTDPRFVKLKPIFRSTMLKSANIQSIPLTVEWSSTPPYDYLFAGCNDGTVALWKFSANSTCEGC-----
        LAVLLGNGSLEVWEIPFPHVVKAIYSKFNREGTDPRFVKLKPIFRSTMLKSANIQSIPLTVEWSSTPPYDYLFAGCNDGTVALWKFSANSTCE       
Subjt:  LAVLLGNGSLEVWEIPFPHVVKAIYSKFNREGTDPRFVKLKPIFRSTMLKSANIQSIPLTVEWSSTPPYDYLFAGCNDGTVALWKFSANSTCEGC-----

Query:  FS-----------CPN-FDPESANVVLTASHGGLKFWDLRDPFRPLWDIHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTK
        FS            PN  DPESANVVLTASHGGLKFWDLRDPFRPLWDIHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTK
Subjt:  FS-----------CPN-FDPESANVVLTASHGGLKFWDLRDPFRPLWDIHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTK

Query:  QQGLHSYYGSSFAIWSVQVSRQTGMVAYCSADGAVLRFQLTTRAVEKDHSRNRTPHFICEYLTEEESAITIHSPASGVPFPLKKASNKSDLPLSFRAILS
        QQGLHSYYGSSFAIWSVQVSRQTGMVAYCSADGAVLRFQLTTRAVEKDHSRNRTPHFICEYLTEEESAITIHSPASGVPFPLKKASNKSDLPLSFRAILS
Subjt:  QQGLHSYYGSSFAIWSVQVSRQTGMVAYCSADGAVLRFQLTTRAVEKDHSRNRTPHFICEYLTEEESAITIHSPASGVPFPLKKASNKSDLPLSFRAILS

Query:  DSIESNEGNHKTATASASENEALAIYHDNDVSVDSGSEDTLMSMKKKNQTQSKCKKKEVDSQALECSDEPNDAQTKTDELPGSGDNFETFPPKSVALHRV
        DSIESNEGNHKTATA+ASENEALAIYHDNDVSVDSGSEDTLMSMKKKNQTQSKCKKKEVDSQALECSDEPNDAQTKTDELPGSGDNFETFPPKSVALHRV
Subjt:  DSIESNEGNHKTATASASENEALAIYHDNDVSVDSGSEDTLMSMKKKNQTQSKCKKKEVDSQALECSDEPNDAQTKTDELPGSGDNFETFPPKSVALHRV

Query:  RWNMNTGSERWLCYGGEAGIIRCQEIVLSDFDKKLMRKK
        RWNMNTGSERWLCYGGEAGIIRCQEIVLSDFDKKLMRKK
Subjt:  RWNMNTGSERWLCYGGEAGIIRCQEIVLSDFDKKLMRKK

A0A6J1F7U5 uncharacterized protein LOC111441649 isoform X10.0e+0071.1Show/hide
Query:  PPVPAASTGTGGKRGKRKQPVARQKKEAPGRAKKKPGGASADEEQPTGRLDGVGIKVLEFDHCAENHFRAMDTIAELCGEAEDGDGGIDESDIQRFSSSA
        P    AS GT  K+GK+K   +   +E   RAKKK G  S +E QPTGRLD   +KV EFDHC ENHFRA+D IAEL GEAE+G+GG+DESD QRFSSS 
Subjt:  PPVPAASTGTGGKRGKRKQPVARQKKEAPGRAKKKPGGASADEEQPTGRLDGVGIKVLEFDHCAENHFRAMDTIAELCGEAEDGDGGIDESDIQRFSSSA

Query:  FFLREWRFYNYEPKTVKFASDLRGSEGKDGDITINLPQFSSAAVLKNGTPSGAATSLDWRNFVMYVGGPVWALDWCPQVLEKTDALIKCEFIAVSAHPPG
         FLREW+FYNYEPKTVKF SD R  EGKD DIT+ LPQFSSAAVLKNG P GA  SLD+RNF+M+VGGPVWA+DWCP V E+TD+LIKCEFIAVSAHPPG
Subjt:  FFLREWRFYNYEPKTVKFASDLRGSEGKDGDITINLPQFSSAAVLKNGTPSGAATSLDWRNFVMYVGGPVWALDWCPQVLEKTDALIKCEFIAVSAHPPG

Query:  SSYHKMGTPLIGRGMVQIWCLVHGTENHEPEPAYATKCKSKPKKDEVSSDLSSQPKRPRGRPPGTKKKGASDLPSQPKRPRGRPKKKQEGSN-DNMGDNN
        SSYH MG PL GRGMVQIWCLVHGTE+HE E   AT+CK         SDL SQPKRPRGRPPG KK GAS LPSQPKRPRGRPKKKQE  N DN   + 
Subjt:  SSYHKMGTPLIGRGMVQIWCLVHGTENHEPEPAYATKCKSKPKKDEVSSDLSSQPKRPRGRPPGTKKKGASDLPSQPKRPRGRPKKKQEGSN-DNMGDNN

Query:  QIVQSLSVEYPAGSSNLLEIDGDPKNSEELLLLGNSVERQKSTLQAVSTCNSKDEGPAQKRRVRRKVGTKNHIDDMGTLPFTVNREDGSSTISFQENENV
        Q+VQ LSVEYP  SSNLLEID    NSE+ + L NSVER  ST++ +STCNS+DE P QKRRVRR   TKNH+DD+GTL    NREDGS+  + + NENV
Subjt:  QIVQSLSVEYPAGSSNLLEIDGDPKNSEELLLLGNSVERQKSTLQAVSTCNSKDEGPAQKRRVRRKVGTKNHIDDMGTLPFTVNREDGSSTISFQENENV

Query:  ISEYSGEDTLLCNNISKNA-------EFSIPESVALPRVVLCLAHNGKVAWDLKWKPSNACTTNCKHRMGYLAVLLGNGSLEVWEIPFPHVVKAIYSKFN
         SEYSGEDT LC NIS+ A        FSIPE+VALPR+VLCLAHNGKVAWDLKWKP+NA TT CK RMGYLAVLLGNGSLEVWE+PFPHVVKAIYSK N
Subjt:  ISEYSGEDTLLCNNISKNA-------EFSIPESVALPRVVLCLAHNGKVAWDLKWKPSNACTTNCKHRMGYLAVLLGNGSLEVWEIPFPHVVKAIYSKFN

Query:  REGTDPRFVKLKPIFRSTMLKSANIQSIPLTVEWSSTPPYDYLFAGCNDGTVALWKFSANSTCEGCFSCPNF-----------------DPESANVVLTA
         EGTDPRFVKLKP FR +ML+SA+ QSIPLTVEWS TPPYDYL AGC+DGTVALWKFSA+ST E       F                 +PES NV+L A
Subjt:  REGTDPRFVKLKPIFRSTMLKSANIQSIPLTVEWSSTPPYDYLFAGCNDGTVALWKFSANSTCEGCFSCPNF-----------------DPESANVVLTA

Query:  SHGGLKFWDLRDPFRPLWDIHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYYGSSFAIWSVQVSRQTGMVAYC
        SHGG+KFWDLRDPFRPLWD+HPAPR+IYSLDWLP+PRCV LSFDDGTLRLLSLLKAAYDVPVTG+PFT  KQ+GLH+Y  S FAIWS+QVSRQTGMVAYC
Subjt:  SHGGLKFWDLRDPFRPLWDIHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYYGSSFAIWSVQVSRQTGMVAYC

Query:  SADGAVLRFQLTTRAVEKDHSRNRTPHFICEYLTEEESAITIHSPASGVPFPLKKASNKSDLPLSFRAILSDSIESNEGNHKTATASASENEALAIYHDN
         ADGAV+RFQLTT+AV+K++SRNRTPHF+CEYLTEE+S ITIHSPAS VP PLKK SNKS+ PLS RAILSDS++ NEGN K+AT SA ENE+ A+ +D+
Subjt:  SADGAVLRFQLTTRAVEKDHSRNRTPHFICEYLTEEESAITIHSPASGVPFPLKKASNKSDLPLSFRAILSDSIESNEGNHKTATASASENEALAIYHDN

Query:  DVSVDSGSEDTLMSMKKKNQTQSKCKKKEVDSQALECSDEPNDAQTKTDELPGSGDNFETFPPKSVALHRVRWNMNTGSERWLCYGGEAGIIRCQEIVLS
        DV V+SGSEDT MS++ KNQTQSK KKK V +Q LE S EP+D+QT  D +PG G++FE FPPKSVALHR+RWNMN GSERWL YGG AGI+RCQEIVLS
Subjt:  DVSVDSGSEDTLMSMKKKNQTQSKCKKKEVDSQALECSDEPNDAQTKTDELPGSGDNFETFPPKSVALHRVRWNMNTGSERWLCYGGEAGIIRCQEIVLS

Query:  DFDKKLMRKK
          DKKLM KK
Subjt:  DFDKKLMRKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G19485.1 Transducin/WD40 repeat-like superfamily protein2.0e-18843.51Show/hide
Query:  LDGVGIKVLEFDHCAENHFRAMDTIAELCGEAEDGDGGIDESDIQRFSSSAFFLREWRFYNYEPKTVKFASDLRGSEGKDGDITINLPQFSSAAV--LKN
        +DG    +  FD+ AE+H +A+++I +LCGEA   +  IDE+DI   SSS  FLREWR YN+EPK+  F ++   +       +  LPQFSSA    +K 
Subjt:  LDGVGIKVLEFDHCAENHFRAMDTIAELCGEAEDGDGGIDESDIQRFSSSAFFLREWRFYNYEPKTVKFASDLRGSEGKDGDITINLPQFSSAAV--LKN

Query:  GTPSGAATSLDWRNFVMYVGGPVWALDWCPQVLEKTDALIKCEFIAVSAHPPGSSYHKMGTPLIGRGMVQIWCLVHGT--ENHEPEPAYATKCKSKPKKD
             +++    ++FVM+VGG VWA++WCP+V    DA  KCEF+AV+ HPP S  HK+G PLIGRG++QIWC+++ T  ++         K   K +K 
Subjt:  GTPSGAATSLDWRNFVMYVGGPVWALDWCPQVLEKTDALIKCEFIAVSAHPPGSSYHKMGTPLIGRGMVQIWCLVHGT--ENHEPEPAYATKCKSKPKKD

Query:  EVSSDL-SSQPKRPRGRPPGTKKKGASDLPSQPKRPRGRPKKKQEGSND-NMGDNNQIVQSLSVEYPAGSSNLLEIDGDPKNSEELLLLGNSVERQKSTL
               +++PK+PRGRP    +K   +  ++PK+PRGRP+KK        + D+   V++LSV YP  S     +   P        +  +    + + 
Subjt:  EVSSDL-SSQPKRPRGRPPGTKKKGASDLPSQPKRPRGRPKKKQEGSND-NMGDNNQIVQSLSVEYPAGSSNLLEIDGDPKNSEELLLLGNSVERQKSTL

Query:  QAVSTCNSKDEGPAQKRRVRRKVGTKNHIDDMGTLPFTVNREDGSSTISFQENENVISEYSGEDTLLCNNISKNAEFSIPES-VALPRVVLCLAHNGKVA
        Q +S+ N+  + P +++R + K                 + E+  + +  + +E V             N+       I E  VALPRVVLCLAHNGKV 
Subjt:  QAVSTCNSKDEGPAQKRRVRRKVGTKNHIDDMGTLPFTVNREDGSSTISFQENENVISEYSGEDTLLCNNISKNAEFSIPES-VALPRVVLCLAHNGKVA

Query:  WDLKWKPSNACTTNCKHRMGYLAVLLGNGSLEVWEIPFPHVVKAIYSKFNREGTDPRFVKLKPIFRSTMLKSANIQSIPLTVEWSSTPPYDYLFAGCNDG
        WD+KW+PS A  +  KH MGYLAVLLGNGSLEVW++P P    A+Y    +  TDPRFVKL P+F+ + LK  + +SIPLTVEWS+    D+L AGC+DG
Subjt:  WDLKWKPSNACTTNCKHRMGYLAVLLGNGSLEVWEIPFPHVVKAIYSKFNREGTDPRFVKLKPIFRSTMLKSANIQSIPLTVEWSSTPPYDYLFAGCNDG

Query:  TVALWKFSANSTCEG-----------------CFSCPNFDPESANVVLTASHGGLKFWDLRDPFRPLWDIHPAPRMIYSLDWLPDPRCVILSFDDGTLRL
        TVALWKFS   + E                   ++    D ESAN+V TA H GLKFWDLRDPFRPLWD+HP PR IYSLDWL DP CV+LSFDDGTLR+
Subjt:  TVALWKFSANSTCEG-----------------CFSCPNFDPESANVVLTASHGGLKFWDLRDPFRPLWDIHPAPRMIYSLDWLPDPRCVILSFDDGTLRL

Query:  LSLLKAAYDVPVTGKPFTGTKQQGLHSYYGSSFAIWSVQVSRQTGMVAYCSADGAVLRFQLTTRAVEKDHSRNRTPHFICEYLTEEESAITIHSPASGVP
        LSL+K AYDVP TG+P+  TKQQGL  Y  S+F IWS+QVSR TG+ AYC+ADG++  F+LTT+AVEKD +RNRTPH++C  LT ++S   +HSP   +P
Subjt:  LSLLKAAYDVPVTGKPFTGTKQQGLHSYYGSSFAIWSVQVSRQTGMVAYCSADGAVLRFQLTTRAVEKDHSRNRTPHFICEYLTEEESAITIHSPASGVP

Query:  FPLKK-ASNKSDLPLSFRAILSDSIESNEGNHKTATASASENEALAIYHDNDVSVDSGSEDT----LMSMKKKNQTQSKCKKKEVDSQALECSDE---PN
          LKK      +     R++L++S      N        S+ + LA  H  D  ++S SE T      S  KK +  ++ ++ E +S+AL C  E     
Subjt:  FPLKK-ASNKSDLPLSFRAILSDSIESNEGNHKTATASASENEALAIYHDNDVSVDSGSEDT----LMSMKKKNQTQSKCKKKEVDSQALECSDE---PN

Query:  DAQTKTDELPGSGDNFETFPPKSVALHRVRWNMNTGSERWLCYGGEAGIIRCQEI
        + + K      +G   E FPPK VA+HRVRWNMN GSERWLCYGG AGI+RCQEI
Subjt:  DAQTKTDELPGSGDNFETFPPKSVALHRVRWNMNTGSERWLCYGGEAGIIRCQEI

AT1G19485.2 Transducin/WD40 repeat-like superfamily protein2.0e-18843.51Show/hide
Query:  LDGVGIKVLEFDHCAENHFRAMDTIAELCGEAEDGDGGIDESDIQRFSSSAFFLREWRFYNYEPKTVKFASDLRGSEGKDGDITINLPQFSSAAV--LKN
        +DG    +  FD+ AE+H +A+++I +LCGEA   +  IDE+DI   SSS  FLREWR YN+EPK+  F ++   +       +  LPQFSSA    +K 
Subjt:  LDGVGIKVLEFDHCAENHFRAMDTIAELCGEAEDGDGGIDESDIQRFSSSAFFLREWRFYNYEPKTVKFASDLRGSEGKDGDITINLPQFSSAAV--LKN

Query:  GTPSGAATSLDWRNFVMYVGGPVWALDWCPQVLEKTDALIKCEFIAVSAHPPGSSYHKMGTPLIGRGMVQIWCLVHGT--ENHEPEPAYATKCKSKPKKD
             +++    ++FVM+VGG VWA++WCP+V    DA  KCEF+AV+ HPP S  HK+G PLIGRG++QIWC+++ T  ++         K   K +K 
Subjt:  GTPSGAATSLDWRNFVMYVGGPVWALDWCPQVLEKTDALIKCEFIAVSAHPPGSSYHKMGTPLIGRGMVQIWCLVHGT--ENHEPEPAYATKCKSKPKKD

Query:  EVSSDL-SSQPKRPRGRPPGTKKKGASDLPSQPKRPRGRPKKKQEGSND-NMGDNNQIVQSLSVEYPAGSSNLLEIDGDPKNSEELLLLGNSVERQKSTL
               +++PK+PRGRP    +K   +  ++PK+PRGRP+KK        + D+   V++LSV YP  S     +   P        +  +    + + 
Subjt:  EVSSDL-SSQPKRPRGRPPGTKKKGASDLPSQPKRPRGRPKKKQEGSND-NMGDNNQIVQSLSVEYPAGSSNLLEIDGDPKNSEELLLLGNSVERQKSTL

Query:  QAVSTCNSKDEGPAQKRRVRRKVGTKNHIDDMGTLPFTVNREDGSSTISFQENENVISEYSGEDTLLCNNISKNAEFSIPES-VALPRVVLCLAHNGKVA
        Q +S+ N+  + P +++R + K                 + E+  + +  + +E V             N+       I E  VALPRVVLCLAHNGKV 
Subjt:  QAVSTCNSKDEGPAQKRRVRRKVGTKNHIDDMGTLPFTVNREDGSSTISFQENENVISEYSGEDTLLCNNISKNAEFSIPES-VALPRVVLCLAHNGKVA

Query:  WDLKWKPSNACTTNCKHRMGYLAVLLGNGSLEVWEIPFPHVVKAIYSKFNREGTDPRFVKLKPIFRSTMLKSANIQSIPLTVEWSSTPPYDYLFAGCNDG
        WD+KW+PS A  +  KH MGYLAVLLGNGSLEVW++P P    A+Y    +  TDPRFVKL P+F+ + LK  + +SIPLTVEWS+    D+L AGC+DG
Subjt:  WDLKWKPSNACTTNCKHRMGYLAVLLGNGSLEVWEIPFPHVVKAIYSKFNREGTDPRFVKLKPIFRSTMLKSANIQSIPLTVEWSSTPPYDYLFAGCNDG

Query:  TVALWKFSANSTCEG-----------------CFSCPNFDPESANVVLTASHGGLKFWDLRDPFRPLWDIHPAPRMIYSLDWLPDPRCVILSFDDGTLRL
        TVALWKFS   + E                   ++    D ESAN+V TA H GLKFWDLRDPFRPLWD+HP PR IYSLDWL DP CV+LSFDDGTLR+
Subjt:  TVALWKFSANSTCEG-----------------CFSCPNFDPESANVVLTASHGGLKFWDLRDPFRPLWDIHPAPRMIYSLDWLPDPRCVILSFDDGTLRL

Query:  LSLLKAAYDVPVTGKPFTGTKQQGLHSYYGSSFAIWSVQVSRQTGMVAYCSADGAVLRFQLTTRAVEKDHSRNRTPHFICEYLTEEESAITIHSPASGVP
        LSL+K AYDVP TG+P+  TKQQGL  Y  S+F IWS+QVSR TG+ AYC+ADG++  F+LTT+AVEKD +RNRTPH++C  LT ++S   +HSP   +P
Subjt:  LSLLKAAYDVPVTGKPFTGTKQQGLHSYYGSSFAIWSVQVSRQTGMVAYCSADGAVLRFQLTTRAVEKDHSRNRTPHFICEYLTEEESAITIHSPASGVP

Query:  FPLKK-ASNKSDLPLSFRAILSDSIESNEGNHKTATASASENEALAIYHDNDVSVDSGSEDT----LMSMKKKNQTQSKCKKKEVDSQALECSDE---PN
          LKK      +     R++L++S      N        S+ + LA  H  D  ++S SE T      S  KK +  ++ ++ E +S+AL C  E     
Subjt:  FPLKK-ASNKSDLPLSFRAILSDSIESNEGNHKTATASASENEALAIYHDNDVSVDSGSEDT----LMSMKKKNQTQSKCKKKEVDSQALECSDE---PN

Query:  DAQTKTDELPGSGDNFETFPPKSVALHRVRWNMNTGSERWLCYGGEAGIIRCQEI
        + + K      +G   E FPPK VA+HRVRWNMN GSERWLCYGG AGI+RCQEI
Subjt:  DAQTKTDELPGSGDNFETFPPKSVALHRVRWNMNTGSERWLCYGGEAGIIRCQEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GTGGAAGTGGTGGAGCCTCCGGTACCGGCAGCATCCACTGGCACTGGCGGCAAGAGGGGGAAGAGGAAGCAACCAGTGGCTCGGCAGAAGAAGGAAGCGCCGGGAAGAGC
TAAGAAGAAGCCGGGGGGAGCTTCGGCCGACGAAGAGCAACCTACAGGTCGCTTAGATGGCGTCGGGATTAAGGTTTTGGAGTTTGATCATTGTGCTGAGAATCATTTTA
GAGCCATGGATACAATTGCCGAGCTCTGTGGCGAGGCAGAGGACGGCGATGGCGGAATTGACGAAAGTGACATTCAGCGGTTTTCATCGTCCGCCTTTTTCTTAAGGGAA
TGGAGGTTCTATAATTATGAACCGAAGACCGTCAAGTTTGCTAGCGACCTGAGAGGCTCTGAGGGTAAGGATGGTGACATCACGATAAACTTACCACAGTTTTCTTCTGC
TGCTGTTCTAAAGAATGGAACACCGTCTGGAGCCGCTACATCTCTGGACTGGAGAAACTTTGTCATGTATGTTGGTGGGCCTGTTTGGGCTTTAGATTGGTGTCCTCAAG
TTCTTGAAAAAACTGATGCCCTTATCAAATGTGAGTTTATTGCTGTTTCTGCTCATCCACCTGGCTCTTCTTATCACAAGATGGGTACCCCACTCATAGGAAGAGGTATG
GTGCAGATATGGTGCTTAGTGCATGGAACTGAAAACCATGAACCAGAGCCGGCCTATGCAACGAAGTGCAAATCGAAGCCTAAAAAAGATGAGGTATCATCAGATTTATC
ATCTCAACCAAAGAGGCCTAGAGGAAGGCCACCAGGAACTAAGAAAAAGGGGGCATCAGACTTGCCATCTCAACCAAAGAGGCCTAGAGGAAGACCTAAAAAGAAACAAG
AGGGATCCAATGATAACATGGGTGACAATAATCAAATTGTTCAGTCCCTTTCTGTTGAATACCCGGCTGGTTCATCCAACTTGCTTGAGATTGATGGCGACCCCAAAAAT
TCTGAAGAACTTCTATTACTGGGAAATAGTGTTGAAAGACAGAAGAGTACATTACAAGCAGTTTCTACATGCAATTCTAAGGATGAAGGTCCTGCGCAGAAGAGAAGAGT
GAGAAGAAAAGTTGGGACTAAGAATCATATTGATGACATGGGAACATTACCATTTACAGTAAATCGAGAAGATGGATCCTCTACTATCAGTTTTCAGGAAAATGAGAATG
TTATAAGTGAATATTCTGGAGAAGATACTCTTTTATGTAATAATATTTCAAAGAATGCTGAATTTTCTATTCCAGAGAGTGTTGCTTTGCCTAGAGTAGTGTTGTGCTTA
GCTCATAATGGAAAGGTAGCATGGGATTTGAAATGGAAGCCATCTAATGCGTGTACTACCAACTGCAAGCACCGAATGGGCTACCTTGCTGTCTTGCTGGGCAACGGATC
TCTAGAAGTCTGGGAGATTCCTTTTCCTCACGTAGTGAAAGCCATCTATTCTAAATTCAATAGGGAGGGTACAGATCCTCGCTTTGTGAAGTTGAAGCCTATTTTCAGGA
GCACGATGTTGAAAAGTGCAAATATACAGAGCATCCCTCTGACTGTGGAATGGTCGTCAACACCTCCCTATGATTATCTATTTGCTGGATGCAATGATGGAACAGTTGCC
TTGTGGAAGTTTTCTGCAAATAGTACCTGTGAAGGTTGTTTTTCTTGCCCGAACTTCGATCCAGAAAGTGCAAATGTGGTGCTTACTGCTAGTCATGGAGGTTTAAAATT
TTGGGACCTAAGAGATCCTTTCCGTCCTTTGTGGGACATTCATCCAGCACCCAGGATGATATATAGTCTGGATTGGCTTCCTGATCCTAGATGCGTTATTTTATCCTTTG
ATGATGGAACATTGAGACTTCTCAGTCTGCTAAAGGCTGCATACGACGTTCCAGTAACTGGTAAACCCTTCACAGGGACAAAACAACAAGGCTTACATAGTTACTATGGT
TCATCATTTGCTATCTGGAGTGTTCAAGTGTCACGGCAGACAGGCATGGTTGCATATTGCAGTGCCGATGGAGCTGTTCTCCGTTTCCAGCTTACTACGAGAGCAGTTGA
GAAAGATCATTCACGTAATCGAACCCCACATTTCATATGTGAATACTTAACTGAGGAGGAATCGGCTATTACAATCCACTCTCCAGCCTCAGGTGTTCCATTCCCTTTGA
AGAAGGCATCCAACAAATCCGACCTGCCACTGTCCTTTCGAGCTATTCTATCTGATTCAATTGAGTCAAATGAAGGAAATCACAAGACGGCCACAGCTTCTGCATCGGAA
AATGAAGCATTAGCCATTTACCACGACAATGATGTCAGCGTTGATTCTGGATCTGAGGATACACTGATGTCCATGAAGAAGAAAAACCAAACTCAATCAAAGTGCAAGAA
GAAGGAAGTGGATAGCCAAGCACTGGAATGTAGCGACGAACCTAATGATGCACAGACAAAAACCGATGAATTGCCTGGTTCGGGGGATAATTTTGAAACTTTCCCTCCAA
AATCAGTTGCATTGCATAGAGTGAGATGGAACATGAACACAGGGAGTGAGAGATGGTTGTGCTATGGTGGAGAAGCTGGAATAATACGTTGTCAGGAGATTGTGTTGTCT
GACTTCGATAAGAAGTTGATGAGGAAGAAG
mRNA sequenceShow/hide mRNA sequence
GTGGAAGTGGTGGAGCCTCCGGTACCGGCAGCATCCACTGGCACTGGCGGCAAGAGGGGGAAGAGGAAGCAACCAGTGGCTCGGCAGAAGAAGGAAGCGCCGGGAAGAGC
TAAGAAGAAGCCGGGGGGAGCTTCGGCCGACGAAGAGCAACCTACAGGTCGCTTAGATGGCGTCGGGATTAAGGTTTTGGAGTTTGATCATTGTGCTGAGAATCATTTTA
GAGCCATGGATACAATTGCCGAGCTCTGTGGCGAGGCAGAGGACGGCGATGGCGGAATTGACGAAAGTGACATTCAGCGGTTTTCATCGTCCGCCTTTTTCTTAAGGGAA
TGGAGGTTCTATAATTATGAACCGAAGACCGTCAAGTTTGCTAGCGACCTGAGAGGCTCTGAGGGTAAGGATGGTGACATCACGATAAACTTACCACAGTTTTCTTCTGC
TGCTGTTCTAAAGAATGGAACACCGTCTGGAGCCGCTACATCTCTGGACTGGAGAAACTTTGTCATGTATGTTGGTGGGCCTGTTTGGGCTTTAGATTGGTGTCCTCAAG
TTCTTGAAAAAACTGATGCCCTTATCAAATGTGAGTTTATTGCTGTTTCTGCTCATCCACCTGGCTCTTCTTATCACAAGATGGGTACCCCACTCATAGGAAGAGGTATG
GTGCAGATATGGTGCTTAGTGCATGGAACTGAAAACCATGAACCAGAGCCGGCCTATGCAACGAAGTGCAAATCGAAGCCTAAAAAAGATGAGGTATCATCAGATTTATC
ATCTCAACCAAAGAGGCCTAGAGGAAGGCCACCAGGAACTAAGAAAAAGGGGGCATCAGACTTGCCATCTCAACCAAAGAGGCCTAGAGGAAGACCTAAAAAGAAACAAG
AGGGATCCAATGATAACATGGGTGACAATAATCAAATTGTTCAGTCCCTTTCTGTTGAATACCCGGCTGGTTCATCCAACTTGCTTGAGATTGATGGCGACCCCAAAAAT
TCTGAAGAACTTCTATTACTGGGAAATAGTGTTGAAAGACAGAAGAGTACATTACAAGCAGTTTCTACATGCAATTCTAAGGATGAAGGTCCTGCGCAGAAGAGAAGAGT
GAGAAGAAAAGTTGGGACTAAGAATCATATTGATGACATGGGAACATTACCATTTACAGTAAATCGAGAAGATGGATCCTCTACTATCAGTTTTCAGGAAAATGAGAATG
TTATAAGTGAATATTCTGGAGAAGATACTCTTTTATGTAATAATATTTCAAAGAATGCTGAATTTTCTATTCCAGAGAGTGTTGCTTTGCCTAGAGTAGTGTTGTGCTTA
GCTCATAATGGAAAGGTAGCATGGGATTTGAAATGGAAGCCATCTAATGCGTGTACTACCAACTGCAAGCACCGAATGGGCTACCTTGCTGTCTTGCTGGGCAACGGATC
TCTAGAAGTCTGGGAGATTCCTTTTCCTCACGTAGTGAAAGCCATCTATTCTAAATTCAATAGGGAGGGTACAGATCCTCGCTTTGTGAAGTTGAAGCCTATTTTCAGGA
GCACGATGTTGAAAAGTGCAAATATACAGAGCATCCCTCTGACTGTGGAATGGTCGTCAACACCTCCCTATGATTATCTATTTGCTGGATGCAATGATGGAACAGTTGCC
TTGTGGAAGTTTTCTGCAAATAGTACCTGTGAAGGTTGTTTTTCTTGCCCGAACTTCGATCCAGAAAGTGCAAATGTGGTGCTTACTGCTAGTCATGGAGGTTTAAAATT
TTGGGACCTAAGAGATCCTTTCCGTCCTTTGTGGGACATTCATCCAGCACCCAGGATGATATATAGTCTGGATTGGCTTCCTGATCCTAGATGCGTTATTTTATCCTTTG
ATGATGGAACATTGAGACTTCTCAGTCTGCTAAAGGCTGCATACGACGTTCCAGTAACTGGTAAACCCTTCACAGGGACAAAACAACAAGGCTTACATAGTTACTATGGT
TCATCATTTGCTATCTGGAGTGTTCAAGTGTCACGGCAGACAGGCATGGTTGCATATTGCAGTGCCGATGGAGCTGTTCTCCGTTTCCAGCTTACTACGAGAGCAGTTGA
GAAAGATCATTCACGTAATCGAACCCCACATTTCATATGTGAATACTTAACTGAGGAGGAATCGGCTATTACAATCCACTCTCCAGCCTCAGGTGTTCCATTCCCTTTGA
AGAAGGCATCCAACAAATCCGACCTGCCACTGTCCTTTCGAGCTATTCTATCTGATTCAATTGAGTCAAATGAAGGAAATCACAAGACGGCCACAGCTTCTGCATCGGAA
AATGAAGCATTAGCCATTTACCACGACAATGATGTCAGCGTTGATTCTGGATCTGAGGATACACTGATGTCCATGAAGAAGAAAAACCAAACTCAATCAAAGTGCAAGAA
GAAGGAAGTGGATAGCCAAGCACTGGAATGTAGCGACGAACCTAATGATGCACAGACAAAAACCGATGAATTGCCTGGTTCGGGGGATAATTTTGAAACTTTCCCTCCAA
AATCAGTTGCATTGCATAGAGTGAGATGGAACATGAACACAGGGAGTGAGAGATGGTTGTGCTATGGTGGAGAAGCTGGAATAATACGTTGTCAGGAGATTGTGTTGTCT
GACTTCGATAAGAAGTTGATGAGGAAGAAG
Protein sequenceShow/hide protein sequence
VEVVEPPVPAASTGTGGKRGKRKQPVARQKKEAPGRAKKKPGGASADEEQPTGRLDGVGIKVLEFDHCAENHFRAMDTIAELCGEAEDGDGGIDESDIQRFSSSAFFLRE
WRFYNYEPKTVKFASDLRGSEGKDGDITINLPQFSSAAVLKNGTPSGAATSLDWRNFVMYVGGPVWALDWCPQVLEKTDALIKCEFIAVSAHPPGSSYHKMGTPLIGRGM
VQIWCLVHGTENHEPEPAYATKCKSKPKKDEVSSDLSSQPKRPRGRPPGTKKKGASDLPSQPKRPRGRPKKKQEGSNDNMGDNNQIVQSLSVEYPAGSSNLLEIDGDPKN
SEELLLLGNSVERQKSTLQAVSTCNSKDEGPAQKRRVRRKVGTKNHIDDMGTLPFTVNREDGSSTISFQENENVISEYSGEDTLLCNNISKNAEFSIPESVALPRVVLCL
AHNGKVAWDLKWKPSNACTTNCKHRMGYLAVLLGNGSLEVWEIPFPHVVKAIYSKFNREGTDPRFVKLKPIFRSTMLKSANIQSIPLTVEWSSTPPYDYLFAGCNDGTVA
LWKFSANSTCEGCFSCPNFDPESANVVLTASHGGLKFWDLRDPFRPLWDIHPAPRMIYSLDWLPDPRCVILSFDDGTLRLLSLLKAAYDVPVTGKPFTGTKQQGLHSYYG
SSFAIWSVQVSRQTGMVAYCSADGAVLRFQLTTRAVEKDHSRNRTPHFICEYLTEEESAITIHSPASGVPFPLKKASNKSDLPLSFRAILSDSIESNEGNHKTATASASE
NEALAIYHDNDVSVDSGSEDTLMSMKKKNQTQSKCKKKEVDSQALECSDEPNDAQTKTDELPGSGDNFETFPPKSVALHRVRWNMNTGSERWLCYGGEAGIIRCQEIVLS
DFDKKLMRKK