; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015669 (gene) of Snake gourd v1 genome

Gene IDTan0015669
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptiontranscription factor MYB3R-2 isoform X3
Genome locationLG09:69769891..69782949
RNA-Seq ExpressionTan0015669
SyntenyTan0015669
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589626.1 Myb-like protein L, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0080.93Show/hide
Query:  MSRRSHDDGGDEKLSAGEEDDEDDAI-DDMEVLRGACRLAGVNPEEYINHPSSSLAAGD-----DSDEVDDFELVRNIRKQFSIADDEQPLSILPPASPD
        MSRRSH DGGD++L A EEDDEDD + DDME LR ACRLAGVN E+YIN   S  AAGD     DSD+VDD EL+RNI+ +FSIA DE+PLSILPP + D
Subjt:  MSRRSHDDGGDEKLSAGEEDDEDDAI-DDMEVLRGACRLAGVNPEEYINHPSSSLAAGD-----DSDEVDDFELVRNIRKQFSIADDEQPLSILPPASPD

Query:  EEEDDFETLRRIQRRFGAYETDILSNKPNQSCDFDVPLEMDSNNTAIERQTYPERSSMLAFEKGSLPKAGLAFIDAIKKNRSQQKFIRSKMIHLEARIEE
        EEEDDFETLR IQRRF AYE+DILSNKP+QSCD D PL+MDS+NT +ER T  ERSSM+AFEKGSLPKA LAFIDAIKKNRSQQKFIRSKMIHLEARIEE
Subjt:  EEEDDFETLRRIQRRFGAYETDILSNKPNQSCDFDVPLEMDSNNTAIERQTYPERSSMLAFEKGSLPKAGLAFIDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  TKKLRKHFKILKDFQGSCRRRTSCALSQTIDPRVQLISAGKPQAKDSSKKDKRLSAMYYGPAENSHVAYYKMALTKFPPSVDRKRWSNVERENLGKGIRQ
         KKLRK FK+LK FQGSCRR+T+CAL+Q +DPRVQLISAGKPQAKDSSKKDKRLSAM YGPAENSHVA Y+ ALTKF P VDRKRWSN ERENLGKGIRQ
Subjt:  TKKLRKHFKILKDFQGSCRRRTSCALSQTIDPRVQLISAGKPQAKDSSKKDKRLSAMYYGPAENSHVAYYKMALTKFPPSVDRKRWSNVERENLGKGIRQ

Query:  QFQEMVLQISVDQISGLQGFLADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNAWTTSEDKNLLFTIQQ
        QFQEMVLQISVDQIS +QGF A+SDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRN+WTTSEDKNLLFTIQQ
Subjt:  QFQEMVLQISVDQISGLQGFLADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNAWTTSEDKNLLFTIQQ

Query:  KGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLQSAVAILGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRL
        KGLNNWI++AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDED+KL+SAVAI G GDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRL
Subjt:  KGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLQSAVAILGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRL

Query:  KIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCKWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQ
        KIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRC+WTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPN+VPLLQEARKIQ
Subjt:  KIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCKWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQ

Query:  KAALISNFVDRESERPALGPTDFRPIPNTNLLCNTDDPKPAPKRNVKTRRMPVSRKNEKSATGDADAPKKRKSNNRRNQADATAQGGIANNVPE--EIKS
        K ALISNFVDRESERPALGPTDFRP+PN++LLCNTDDP+ AP RNV+ RRMPVSR NEKSA G  DAPKKRKSNN+RN+AD TAQ   ANN     E+KS
Subjt:  KAALISNFVDRESERPALGPTDFRPIPNTNLLCNTDDPKPAPKRNVKTRRMPVSRKNEKSATGDADAPKKRKSNNRRNQADATAQGGIANNVPE--EIKS

Query:  IKPQRKRNRHGAYTPKRKGVLELSSNSERCAEQNLDTRSLEVQLNGKE-SEMTNNDCAEAVDENGMEVCENNVAEKLSKRDVCCSEQEGNQNSTGSSGVS
         KPQRKR RHGAYT +RKG  ++  NSERCAEQN DT+SLEVQLN KE +E  N+DC E VDENGMEV EN  AE  S+  VC SEQE NQNSTGSSGVS
Subjt:  IKPQRKRNRHGAYTPKRKGVLELSSNSERCAEQNLDTRSLEVQLNGKE-SEMTNNDCAEAVDENGMEVCENNVAEKLSKRDVCCSEQEGNQNSTGSSGVS

Query:  VLSEMTNDTDEYNPFILPDTALLASSTAGDDIMVRNGESVADKDLDDSNSFSLPLNCLELRTTDSEGVNSYSVGEITDKDDNICKPQGRRKKNSKGSNES
        VLSEMTND DEYNP   PDT LLAS TA D I+   G +VADKDLDDSNSFSLP +CLELRTTDSEGV+SYSV E TDK   +CKPQGRRKKNSK SN+S
Subjt:  VLSEMTNDTDEYNPFILPDTALLASSTAGDDIMVRNGESVADKDLDDSNSFSLPLNCLELRTTDSEGVNSYSVGEITDKDDNICKPQGRRKKNSKGSNES

Query:  QGSLVVSGQQGELERSGTKSPRRHNQSKKRKHNTINTSSLGTLEAVEEVDDCTLLGFLQKRLKR-TTTHEKKVD---STLLEV---DNDPTIASLLEGKL
        Q SL VS QQ E E SG     R NQSKKRKH+  NTS LGT+EAVEEVDDCTL GFLQKRLKR TTTH+KKVD   ST  EV   DNDPT+A LL+ KL
Subjt:  QGSLVVSGQQGELERSGTKSPRRHNQSKKRKHNTINTSSLGTLEAVEEVDDCTLLGFLQKRLKR-TTTHEKKVD---STLLEV---DNDPTIASLLEGKL

Query:  KRKKHLA
        KRKK L+
Subjt:  KRKKHLA

KAG7023314.1 Myb-like protein L [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0080.24Show/hide
Query:  MSRRSHDDGGDEKLSAGEEDDEDDAI-DDMEVLRGACRLAGVNPEEYINHPSSSLAAGD-----DSDEVDDFELVRNIRKQFSIADDEQPLSILPPASPD
        MSRRSH DGGD++L A EEDDEDD + DDME LR ACRLAGVN E+YIN   S  AAGD     DSD+VDD EL+RNI+ +FSIA DE+PLSILPP + D
Subjt:  MSRRSHDDGGDEKLSAGEEDDEDDAI-DDMEVLRGACRLAGVNPEEYINHPSSSLAAGD-----DSDEVDDFELVRNIRKQFSIADDEQPLSILPPASPD

Query:  EEEDDFETLRRIQRRFGAYETDILSNKPNQSCDFDVPLEMDSNNTAIERQTYPERSSMLAFEKGSLPKAGLAFIDAIKKNRSQQKFIRSKMIHLEARIEE
        EEEDDFETLR IQRRF AYE+DILSNKP+QSCD D PL+MDS+NT +ER T  ERSSM+AFEKGSLPKA LAFIDAIKKNRSQQKFIRSKMIHLEARIEE
Subjt:  EEEDDFETLRRIQRRFGAYETDILSNKPNQSCDFDVPLEMDSNNTAIERQTYPERSSMLAFEKGSLPKAGLAFIDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  TKKLRKHFKILKDFQGSCRRRTSCALSQTIDPRVQLISAGKPQAKDSSKKDKRLSAMYYGPAENSHVAYYKMALTKFPPSVDRKRWSNVERENLGKGIRQ
         KKLRK FK+LK FQGSCRR+T+CAL+Q +DPRVQLISAGKPQAKDSSKKDKRLSAM YGPAENSHVA Y+ ALTKF P VDRKRWSN ERENLGKGIRQ
Subjt:  TKKLRKHFKILKDFQGSCRRRTSCALSQTIDPRVQLISAGKPQAKDSSKKDKRLSAMYYGPAENSHVAYYKMALTKFPPSVDRKRWSNVERENLGKGIRQ

Query:  QFQEMVLQISVDQI----------SGLQGFLADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNAWTTSE
        QFQEMVLQISVDQI          S +QGF A+SDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRN+WTTSE
Subjt:  QFQEMVLQISVDQI----------SGLQGFLADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNAWTTSE

Query:  DKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLQSAVAILGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRG
        DKNLLFTIQQKGLNNWI++AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDED+KL+SAVAI G GDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRG
Subjt:  DKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLQSAVAILGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRG

Query:  YFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCKWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEV
        YFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRC+WTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPN+V
Subjt:  YFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCKWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEV

Query:  PLLQEARKIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNTDDPKPAPKRNVKTRRMPVSRKNEKSATGDADAPKKRKSNNRRNQADATAQGGIAN
        PLLQEARKIQK ALISNFVDRESERPALGPTDFRP+PN++LLCNTDDP+ APKRNV+ RRMPVSR NEKSA G  DAPKKRKSNN+RN+AD TAQ   AN
Subjt:  PLLQEARKIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNTDDPKPAPKRNVKTRRMPVSRKNEKSATGDADAPKKRKSNNRRNQADATAQGGIAN

Query:  NVPE--EIKSIKPQRKRNRHGAYTPKRKGVLELSSNSERCAEQNLDTRSLEVQLNGKE-SEMTNNDCAEAVDENGMEVCENNVAEKLSKRDVCCSEQEGN
        N     E+KS KPQRKR RHGAYT +RKG  ++  NSERCAEQN DT+SLEVQLN KE +E  N+DC E VDENGMEV EN  AE  S+  VC SEQE N
Subjt:  NVPE--EIKSIKPQRKRNRHGAYTPKRKGVLELSSNSERCAEQNLDTRSLEVQLNGKE-SEMTNNDCAEAVDENGMEVCENNVAEKLSKRDVCCSEQEGN

Query:  QNSTGSSGVSVLSEMTNDTDEYNPFILPDTALLASSTAGDDIMVRNGESVADKDLDDSNSFSLPLNCLELRTTDSEGVNSYSVGEITDKDDNICKPQGRR
        QNSTGSSGVSVLSEMTND DEYNP   PDT LLAS TA D I+   G +VADKDLDDSNSFSLP +CLELRTTDSEGV+SYSV E TDK   +CKPQGRR
Subjt:  QNSTGSSGVSVLSEMTNDTDEYNPFILPDTALLASSTAGDDIMVRNGESVADKDLDDSNSFSLPLNCLELRTTDSEGVNSYSVGEITDKDDNICKPQGRR

Query:  KKNSKGSNESQGSLVVSGQQGELERSGTKSPRRHNQSKKRKHNTINTSSLGTLEAVEEVDDCTLLGFLQKRLKR-TTTHEKKVD---STLLEV---DNDP
        KKNSK SN+SQ SL VS QQ ELE SG     R NQSKKRKH+  NTS LGT+EAVEEVDDCTL GFLQKRLKR TTTH+KKVD   ST  EV   DNDP
Subjt:  KKNSKGSNESQGSLVVSGQQGELERSGTKSPRRHNQSKKRKHNTINTSSLGTLEAVEEVDDCTLLGFLQKRLKR-TTTHEKKVD---STLLEV---DNDP

Query:  TIASLLEGKLKRKKHLA
        T+A LL+ KLKR+K L+
Subjt:  TIASLLEGKLKRKKHLA

XP_022987958.1 uncharacterized protein LOC111485355 isoform X1 [Cucurbita maxima]0.0e+0081.31Show/hide
Query:  MSRRSHDDGGDEKLSAGEEDDEDDAI-DDMEVLRGACRLAGVNPEEYINHPSSSLAAGD-----DSDEVDDFELVRNIRKQFSIADDEQPLSILPPASPD
        MSRRSH DGGD++L A EEDDEDD + DDME LR ACRLAGVN E+Y+N   S  AAGD     DSD+VDD EL+RNI+ +FSIA DEQPLSILPP + D
Subjt:  MSRRSHDDGGDEKLSAGEEDDEDDAI-DDMEVLRGACRLAGVNPEEYINHPSSSLAAGD-----DSDEVDDFELVRNIRKQFSIADDEQPLSILPPASPD

Query:  EEEDDFETLRRIQRRFGAYETDILSNKPNQSCDFDVPLEMDSNNTAIERQTYPERSSMLAFEKGSLPKAGLAFIDAIKKNRSQQKFIRSKMIHLEARIEE
        EEEDDFETLR IQRRF AYE+DILSNKP+QSCD D PL+MDS+NT +ER T  ERSSM+AFEKGSLPKA LAFIDAIKKNRSQQKF+RSKMIHLEARIEE
Subjt:  EEEDDFETLRRIQRRFGAYETDILSNKPNQSCDFDVPLEMDSNNTAIERQTYPERSSMLAFEKGSLPKAGLAFIDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  TKKLRKHFKILKDFQGSCRRRTSCALSQTIDPRVQLISAGKP-QAKDSSKKDKRLSAMYYGPAENSHVAYYKMALTKFPPSVDRKRWSNVERENLGKGIR
         KKLRK FK+LK FQGSCRR+T+CALSQ +DPRVQLISAGKP QAKDSSKKDKRLSAM YGPAENSHVA Y++ALTKF P VDRKRWSN ERENLGKGIR
Subjt:  TKKLRKHFKILKDFQGSCRRRTSCALSQTIDPRVQLISAGKP-QAKDSSKKDKRLSAMYYGPAENSHVAYYKMALTKFPPSVDRKRWSNVERENLGKGIR

Query:  QQFQEMVLQISVDQISGLQGFLADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNAWTTSEDKNLLFTIQ
        QQFQEMVLQISVDQIS +QGF A+SDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRN WTTSEDKNLLFTIQ
Subjt:  QQFQEMVLQISVDQISGLQGFLADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNAWTTSEDKNLLFTIQ

Query:  QKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLQSAVAILGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSR
        QKGLNNWID+AVSLGTNRTPFQ LSRYQRSLNASILK EWTKDED+KL+SAVAI G GDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSR
Subjt:  QKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLQSAVAILGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSR

Query:  LKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCKWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKI
        LKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRC+WTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPN+VPLLQEARKI
Subjt:  LKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCKWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKI

Query:  QKAALISNFVDRESERPALGPTDFRPIPNTNLLCNTDDPKPAPKRNVKTRRMPVSRKNEKSATGDADAPKKRKSNNRRNQADATAQGGIANNVPE--EIK
        QK ALISNFVDRESERPALGPTDFRP+PN++LLCNTDDP+ APKRNV+TRRMPVSR NEKSA G  DAPKKRKSNN+RN+ D TAQ   A+N     E+K
Subjt:  QKAALISNFVDRESERPALGPTDFRPIPNTNLLCNTDDPKPAPKRNVKTRRMPVSRKNEKSATGDADAPKKRKSNNRRNQADATAQGGIANNVPE--EIK

Query:  SIKPQRKRNRHGAYTPKRKGVLELSSNSERCAEQNLDTRSLEVQLNGKE-SEMTNNDCAEAVDENGMEVCENNVAEKLSKRDVCCSEQEGNQNSTGSSGV
        S KPQRKR RHGAYT +RKG  ++  NSERCAEQN DTR+LEVQLN KE +E  N+DC E VDENGMEV EN  AE  S+  VC SEQE NQNSTGSSGV
Subjt:  SIKPQRKRNRHGAYTPKRKGVLELSSNSERCAEQNLDTRSLEVQLNGKE-SEMTNNDCAEAVDENGMEVCENNVAEKLSKRDVCCSEQEGNQNSTGSSGV

Query:  SVLSEMTNDTDEYNPFILPDTALLASSTAGDDIMVRNGESVADKDLDDSNSFSLPLNCLELRTTDSEGVNSYSVGEITDKDDNICKPQGRRKKNSKGSNE
        SVLSEMTND DEYNP  LPDT LLAS TA DDI+   G +VADKDLD SNSFSLP +CLELRTTDSEGV+SYSV E TDK   +CKPQGRRKKNSK SN+
Subjt:  SVLSEMTNDTDEYNPFILPDTALLASSTAGDDIMVRNGESVADKDLDDSNSFSLPLNCLELRTTDSEGVNSYSVGEITDKDDNICKPQGRRKKNSKGSNE

Query:  SQGSLVVSGQQGELERSGTKSPRRHNQSKKRKHNTINTSSLGTLEAVEEVDDCTLLGFLQKRLKR-TTTHEKKVD---STLLEV---DNDPTIASLLEGK
        SQ SL VS QQ ELE SGT    R NQ KKRKH++ NTS LGT+EAVEEVDDCTLLGFLQKRLKR TTTH KKVD   ST  EV   DNDPT+A LL+ K
Subjt:  SQGSLVVSGQQGELERSGTKSPRRHNQSKKRKHNTINTSSLGTLEAVEEVDDCTLLGFLQKRLKR-TTTHEKKVD---STLLEV---DNDPTIASLLEGK

Query:  LKRKKH
        LKRKKH
Subjt:  LKRKKH

XP_023515735.1 uncharacterized protein LOC111779809 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0081.09Show/hide
Query:  MSRRSHDDGGDEKLSAGEEDDEDDAI-DDMEVLRGACRLAGVNPEEYINHPSSSLAAGD-----DSDEVDDFELVRNIRKQFSIADDEQPLSILPPASPD
        MSRRSH DGGD++L A EEDDEDD + DDME LR ACRLAGVN E+YIN   S  AAGD     DSD+VDD EL+RNI+ +FSIA DEQPLSILPP + D
Subjt:  MSRRSHDDGGDEKLSAGEEDDEDDAI-DDMEVLRGACRLAGVNPEEYINHPSSSLAAGD-----DSDEVDDFELVRNIRKQFSIADDEQPLSILPPASPD

Query:  EEEDDFETLRRIQRRFGAYETDILSNKPNQSCDFDVPLEMDSNNTAIERQTYPERSSMLAFEKGSLPKAGLAFIDAIKKNRSQQKFIRSKMIHLEARIEE
        EEEDDFE LR IQRRF AYE+DILSNKP+QSCD D PL+MDS NT +ER T  ERSSM+AFEKGSLPKA LAFIDAIKKNRSQQKFIRSKMIHLEARIEE
Subjt:  EEEDDFETLRRIQRRFGAYETDILSNKPNQSCDFDVPLEMDSNNTAIERQTYPERSSMLAFEKGSLPKAGLAFIDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  TKKLRKHFKILKDFQGSCRRRTSCALSQTIDPRVQLISAGKPQAKDSSKKDKRLSAMYYGPAENSHVAYYKMALTKFPPSVDRKRWSNVERENLGKGIRQ
         KKLRK FK+LK FQGSCRR+T+CAL+Q +DPRVQLISAGKPQAKDSSKKDKRLS+M YGPAENSHVA Y+ A TKF P VDRKRWSN ERENLGKGIRQ
Subjt:  TKKLRKHFKILKDFQGSCRRRTSCALSQTIDPRVQLISAGKPQAKDSSKKDKRLSAMYYGPAENSHVAYYKMALTKFPPSVDRKRWSNVERENLGKGIRQ

Query:  QFQEMVLQISVDQISGLQGFLADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNAWTTSEDKNLLFTIQQ
        QFQEMVLQISVDQIS +QGF A+SDDLDNILASIK LDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRN WTTSEDKNLLFTIQQ
Subjt:  QFQEMVLQISVDQISGLQGFLADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNAWTTSEDKNLLFTIQQ

Query:  KGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLQSAVAILGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRL
        KGLNNWI++AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDED+KL+SAVA+ G GDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRL
Subjt:  KGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLQSAVAILGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRL

Query:  KIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCKWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQ
        KIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRC+WTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPN+VPLLQEARKIQ
Subjt:  KIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCKWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQ

Query:  KAALISNFVDRESERPALGPTDFRPIPNTNLLCNTDDPKPAPKRNVKTRRMPVSRKNEKSATGDADAPKKRKSNNRRNQADATAQGGIANNVPE--EIKS
        K ALISNFVDRESERPALGPTDFRP+PN++LLCNTDDP+ APKRNV+TRRMPVSR NEKSA G  DAPK+RKSNN+RN+AD TAQ    NN     E+KS
Subjt:  KAALISNFVDRESERPALGPTDFRPIPNTNLLCNTDDPKPAPKRNVKTRRMPVSRKNEKSATGDADAPKKRKSNNRRNQADATAQGGIANNVPE--EIKS

Query:  IKPQRKRNRHGAYTPKRKGVLELSSNSERCAEQNLDTRSLEVQLNGKE-SEMTNNDCAEAVDENGMEVCENNVAEKLSKRDVCCSEQEGNQNSTGSSGVS
         KPQRKR RHGAYT +RKG  ++  NSERCAEQN DTRS+EVQLN KE +E  N+DC E VDENGMEV EN  AE  S+  VC SEQE NQNSTGSSGVS
Subjt:  IKPQRKRNRHGAYTPKRKGVLELSSNSERCAEQNLDTRSLEVQLNGKE-SEMTNNDCAEAVDENGMEVCENNVAEKLSKRDVCCSEQEGNQNSTGSSGVS

Query:  VLSEMTNDTDEYNPFILPDTALLASSTAGDDIMVRNGESVADKDLDDSNSFSLPLNCLELRTTDSEGVNSYSVGEITDKDDNICKPQGRRKKNSKGSNES
        VLSEMTND DEYNP  LPDT LLAS TA DDI+   G +VADKDLDDSNSFSLP +CLELRTTDSEGV+SYSV E TDK   +CKPQGRRKKNSK SN+S
Subjt:  VLSEMTNDTDEYNPFILPDTALLASSTAGDDIMVRNGESVADKDLDDSNSFSLPLNCLELRTTDSEGVNSYSVGEITDKDDNICKPQGRRKKNSKGSNES

Query:  QGSLVVSGQQGELERSGTKSPRRHNQSKKRKHNTINTSSLGTLEAVEEVDDCTLLGFLQKRLKR-TTTHEKKVD---STLLEV---DNDPTIASLLEGKL
        Q SL VS QQ ELE SGT    R NQSKKRKH+  NTS LGT+EAVEEVDDCTL GFLQKRLKR TTTH+KKVD   ST  EV   DNDPT+A LL  KL
Subjt:  QGSLVVSGQQGELERSGTKSPRRHNQSKKRKHNTINTSSLGTLEAVEEVDDCTLLGFLQKRLKR-TTTHEKKVD---STLLEV---DNDPTIASLLEGKL

Query:  KRKKH
        KRKKH
Subjt:  KRKKH

XP_023515736.1 uncharacterized protein LOC111779809 isoform X2 [Cucurbita pepo subsp. pepo]0.0e+0081Show/hide
Query:  MSRRSHDDGGDEKLSAGEEDDEDDAI-DDMEVLRGACRLAGVNPEEYINHPSSSLAAGD-----DSDEVDDFELVRNIRKQFSIADDEQPLSILPPASPD
        MSRRSH DGGD++L A EEDDEDD + DDME LR ACRLAGVN E+YIN   S  AAGD     DSD+VDD EL+RNI+ +FSIA DEQPLSILPP + D
Subjt:  MSRRSHDDGGDEKLSAGEEDDEDDAI-DDMEVLRGACRLAGVNPEEYINHPSSSLAAGD-----DSDEVDDFELVRNIRKQFSIADDEQPLSILPPASPD

Query:  EEEDDFETLRRIQRRFGAYETDILSNKPNQSCDFDVPLEMDSNNTAIERQTYPERSSMLAFEKGSLPKAGLAFIDAIKKNRSQQKFIRSKMIHLEARIEE
        EEEDDFE LR IQRRF AYE+DILSNKP+QSCD D PL+MDS NT +ER T  ERSSM+AFEKGSLPKA LAFIDAIKKNRSQQKFIRSKMIHLEARIEE
Subjt:  EEEDDFETLRRIQRRFGAYETDILSNKPNQSCDFDVPLEMDSNNTAIERQTYPERSSMLAFEKGSLPKAGLAFIDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  TKKLRKHFKILKDFQGSCRRRTSCALSQTIDPRVQLISAGKPQAKDSSKKDKRLSAMYYGPAENSHVAYYKMALTKFPPSVDRKRWSNVERENLGKGIRQ
         KKLRK FK+LK FQGSCRR+T+CAL+Q +DPRVQLISAGKPQAKDSSKKDKRLS+M YGPAENSHVA Y+ A TKF P VDRKRWSN ERENLGKGIRQ
Subjt:  TKKLRKHFKILKDFQGSCRRRTSCALSQTIDPRVQLISAGKPQAKDSSKKDKRLSAMYYGPAENSHVAYYKMALTKFPPSVDRKRWSNVERENLGKGIRQ

Query:  QFQEMVLQISVDQISGLQGFLADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNAWTTSEDKNLLFTIQQ
        QFQEMVLQISVDQIS +QGF A+SDDLDNILASIK LDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRN WTTSEDKNLLFTIQQ
Subjt:  QFQEMVLQISVDQISGLQGFLADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNAWTTSEDKNLLFTIQQ

Query:  KGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLQSAVAILGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRL
        KGLNNWI++AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDED+KL+SAVA+ G GDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRL
Subjt:  KGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLQSAVAILGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRL

Query:  KIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCKWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQ
        KIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRC+WTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNEC RRWKKLFPN+VPLLQEARKIQ
Subjt:  KIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCKWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQ

Query:  KAALISNFVDRESERPALGPTDFRPIPNTNLLCNTDDPKPAPKRNVKTRRMPVSRKNEKSATGDADAPKKRKSNNRRNQADATAQGGIANNVPE--EIKS
        K ALISNFVDRESERPALGPTDFRP+PN++LLCNTDDP+ APKRNV+TRRMPVSR NEKSA G  DAPK+RKSNN+RN+AD TAQ    NN     E+KS
Subjt:  KAALISNFVDRESERPALGPTDFRPIPNTNLLCNTDDPKPAPKRNVKTRRMPVSRKNEKSATGDADAPKKRKSNNRRNQADATAQGGIANNVPE--EIKS

Query:  IKPQRKRNRHGAYTPKRKGVLELSSNSERCAEQNLDTRSLEVQLNGKE-SEMTNNDCAEAVDENGMEVCENNVAEKLSKRDVCCSEQEGNQNSTGSSGVS
         KPQRKR RHGAYT +RKG  ++  NSERCAEQN DTRS+EVQLN KE +E  N+DC E VDENGMEV EN  AE  S+  VC SEQE NQNSTGSSGVS
Subjt:  IKPQRKRNRHGAYTPKRKGVLELSSNSERCAEQNLDTRSLEVQLNGKE-SEMTNNDCAEAVDENGMEVCENNVAEKLSKRDVCCSEQEGNQNSTGSSGVS

Query:  VLSEMTNDTDEYNPFILPDTALLASSTAGDDIMVRNGESVADKDLDDSNSFSLPLNCLELRTTDSEGVNSYSVGEITDKDDNICKPQGRRKKNSKGSNES
        VLSEMTND DEYNP  LPDT LLAS TA DDI+   G +VADKDLDDSNSFSLP +CLELRTTDSEGV+SYSV E TDK   +CKPQGRRKKNSK SN+S
Subjt:  VLSEMTNDTDEYNPFILPDTALLASSTAGDDIMVRNGESVADKDLDDSNSFSLPLNCLELRTTDSEGVNSYSVGEITDKDDNICKPQGRRKKNSKGSNES

Query:  QGSLVVSGQQGELERSGTKSPRRHNQSKKRKHNTINTSSLGTLEAVEEVDDCTLLGFLQKRLKR-TTTHEKKVD---STLLEV---DNDPTIASLLEGKL
        Q SL VS QQ ELE SGT    R NQSKKRKH+  NTS LGT+EAVEEVDDCTL GFLQKRLKR TTTH+KKVD   ST  EV   DNDPT+A LL  KL
Subjt:  QGSLVVSGQQGELERSGTKSPRRHNQSKKRKHNTINTSSLGTLEAVEEVDDCTLLGFLQKRLKR-TTTHEKKVD---STLLEV---DNDPTIASLLEGKL

Query:  KRKKH
        KRKKH
Subjt:  KRKKH

TrEMBL top hitse value%identityAlignment
A0A6J1C075 uncharacterized protein LOC111007172 isoform X20.0e+0075.42Show/hide
Query:  MSRRSHDDGGDEKLSAGEEDDEDDAID-DMEVLRGACRLAGVNPEEYINHPSSSLAA---------GDDSDEVDDFELVRNIRKQFSIADDEQ-------
        MSRRSH++GGD++LS GEEDDEDD  D DME LR ACRL G NPEEY N P S  A          G DSD+VDD ELVRNIR +FSIA D++       
Subjt:  MSRRSHDDGGDEKLSAGEEDDEDDAID-DMEVLRGACRLAGVNPEEYINHPSSSLAA---------GDDSDEVDDFELVRNIRKQFSIADDEQ-------

Query:  PLSILPPASPDEEEDDFETLRRIQRRFGAYETDILSNKPNQSCDFDVPLEMDSNNTAIERQTYPERSSMLAFEKGSLPKAGLAFIDAIKKNRSQQKFIRS
        PLS LPP SPDEEEDDFETLR IQRRF AYE+D LSN  +QSCDF  PLEMDSN T + RQT   RSSMLA EKG+LPKA LAFIDAIKKNRSQQKFIRS
Subjt:  PLSILPPASPDEEEDDFETLRRIQRRFGAYETDILSNKPNQSCDFDVPLEMDSNNTAIERQTYPERSSMLAFEKGSLPKAGLAFIDAIKKNRSQQKFIRS

Query:  KMIHLEARIEETKKLRKHFKILKDFQGSCRRRTSCALSQTIDPRVQLISAGKPQAKDSSKKDKRLSAMYYGPAENSHVAYYKMALTKFPPSVDRKRWSNV
        KMIHLEARIEE KKLR+ FKILKDFQ SCRRRT   LSQ IDPRVQLISA KPQ KDSSKKDK+LSAMYYGPAENSHVA Y+MALTKFP SV+RK+WSNV
Subjt:  KMIHLEARIEETKKLRKHFKILKDFQGSCRRRTSCALSQTIDPRVQLISAGKPQAKDSSKKDKRLSAMYYGPAENSHVAYYKMALTKFPPSVDRKRWSNV

Query:  ERENLGKGIRQQFQEMVLQISVDQISGLQGFLADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNAWTTS
        ERENLGKGIRQQFQEMVLQIS+DQISGLQGF ADSDDLDNI ASIKDLDITPE+IREFLPKVNWDKLAS YL GRSGAECEARWLNFEDPLINRN W+TS
Subjt:  ERENLGKGIRQQFQEMVLQISVDQISGLQGFLADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNAWTTS

Query:  EDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLQSAVAILGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKR
        EDKNLL TIQQKGLNNWI+IAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDED+KL+SAVA+LGVGDWQA+ASTLEGR G QCSNRWKKSLDPARTKR
Subjt:  EDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLQSAVAILGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKR

Query:  GYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCKWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNE
        G FTPDED+RLKIAVLLFGPKNWNKKAEF+PGRNQVQCRERWFNCLDPSLR+C+WTEEEDLRLEIAIQEHGYSW KVAACVPSRTDNECRRRWKKLFPNE
Subjt:  GYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCKWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNE

Query:  VPLLQEARKIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNTDDPKPAPKRNVKTR-RMPVSRKNEKSATGDADAPKKRKSNNRRNQADATAQGGI
        VPLLQEAR+IQKAALISNFVDRESERPALGPTDFRPIPNTNLLCN DDP PAPKRNVK R R+P   + EK+ATG  DAPK+RKSNN  NQ         
Subjt:  VPLLQEARKIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNTDDPKPAPKRNVKTR-RMPVSRKNEKSATGDADAPKKRKSNNRRNQADATAQGGI

Query:  ANNVPEEIKSIKPQRKRNRHGAYTPKRKGVLELSSNSERCAEQNLDTRSLEVQLNGKESEMTNNDCAEAVDENGMEVCENNVAEKLSKRDVCCSEQEGNQ
        AN   E++ S KPQR++NR+GAYT KRKGVLE   N+E+CAEQNL+T SLEVQLN   S  TN++C E V ENGME CEN VAEKLSK D+   EQE  Q
Subjt:  ANNVPEEIKSIKPQRKRNRHGAYTPKRKGVLELSSNSERCAEQNLDTRSLEVQLNGKESEMTNNDCAEAVDENGMEVCENNVAEKLSKRDVCCSEQEGNQ

Query:  NSTGSSGVSVLSEMTNDTDEYNPFILPDTALLASSTAGDDIMVRNGESVADKDLDDSNSFSLPLNCLELRTTDSEGVNSYSVGEITDKDDNICKPQGRRK
        NSTGSSGVSVLSEM ND DEYNP ILPDTA LA +T  DDI  R    VAD DL DSNSFSLP  CL LRT DSEGV+S SVGE TDK D + K QGRR+
Subjt:  NSTGSSGVSVLSEMTNDTDEYNPFILPDTALLASSTAGDDIMVRNGESVADKDLDDSNSFSLPLNCLELRTTDSEGVNSYSVGEITDKDDNICKPQGRRK

Query:  KNSKGSNESQGSLVVSGQQGELERSGTKSPRRHNQSKKRKHNTINTSSLGTLEAVEEVDDCTLLGFLQKRLKRTTTHEKKVD---STLLEVD---NDPTI
        + ++              + ELERS  K    HNQ KKRKHN+  TS  GTLE VEE DDCTL GFLQK+LK+TTTH++K D   S  LEV+   ND TI
Subjt:  KNSKGSNESQGSLVVSGQQGELERSGTKSPRRHNQSKKRKHNTINTSSLGTLEAVEEVDDCTLLGFLQKRLKRTTTHEKKVD---STLLEVD---NDPTI

Query:  ASLLEGKLKRKKH
        AS L+   K+KKH
Subjt:  ASLLEGKLKRKKH

A0A6J1E2J4 uncharacterized protein LOC111430000 isoform X20.0e+0080.9Show/hide
Query:  MSRRSHDDGGDEKLSAGEEDDEDDAI-DDMEVLRGACRLAGVNPEEYINHPSSSLAAGD-----DSDEVDDFELVRNIRKQFSIADDEQPLSILPPASPD
        MSRRSH DGGD++L A EEDDEDD + DDME LR ACRLAGVN E+ IN   S  AAGD     DSD+VDD EL+RNI+ +FS A DEQPLSILPP + D
Subjt:  MSRRSHDDGGDEKLSAGEEDDEDDAI-DDMEVLRGACRLAGVNPEEYINHPSSSLAAGD-----DSDEVDDFELVRNIRKQFSIADDEQPLSILPPASPD

Query:  EEEDDFETLRRIQRRFGAYETDILSNKPNQSCDFDVPLEMDSNNTAIERQTYPERSSMLAFEKGSLPKAGLAFIDAIKKNRSQQKFIRSKMIHLEARIEE
        EEEDDFETLR IQRRF AYE+DILSNKP+QSCD D PL+MDS+NT + R T  ERSSM+AFEKGSLPKA LAFIDAIKKNRSQQKFIRSKMIHLEARIEE
Subjt:  EEEDDFETLRRIQRRFGAYETDILSNKPNQSCDFDVPLEMDSNNTAIERQTYPERSSMLAFEKGSLPKAGLAFIDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  TKKLRKHFKILKDFQGSCRRRTSCALSQTIDPRVQLISAGKPQAKDSSKKDKRLSAMYYGPAENSHVAYYKMALTKFPPSVDRKRWSNVERENLGKGIRQ
         KKLRK FK+LK FQGSCRR+T+CAL+Q +DPRVQLISAGKPQAKDSSKKDKRLSAM YGPAENSHVA Y+ ALTKF P VDRKRWSN ERENLGKGIRQ
Subjt:  TKKLRKHFKILKDFQGSCRRRTSCALSQTIDPRVQLISAGKPQAKDSSKKDKRLSAMYYGPAENSHVAYYKMALTKFPPSVDRKRWSNVERENLGKGIRQ

Query:  QFQEMVLQISVDQISGLQGFLADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNAWTTSEDKNLLFTIQQ
        QFQEMVLQISVDQIS +QGF A+SDDLDNILASIK LDITPEKIREFLPKVNWDKLA MYL+GRSGAECEARWLNFEDPLINRN+WTTSEDKNLLFTIQQ
Subjt:  QFQEMVLQISVDQISGLQGFLADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNAWTTSEDKNLLFTIQQ

Query:  KGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLQSAVAILGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRL
        KGLNNWI++AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDED+KL+SAVAI G GDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRL
Subjt:  KGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLQSAVAILGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRL

Query:  KIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCKWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQ
        KIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRC+WTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNEC RRWKKLFPN+VPLLQEARKIQ
Subjt:  KIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCKWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQ

Query:  KAALISNFVDRESERPALGPTDFRPIPNTNLLCNTDDPKPAPKRNVKTRRMPVSRKNEKSATGDADAPKKRKSNNRRNQADATAQGGIANNVPE--EIKS
        K ALISNFVDRESERPALGPTDFRP+PN++LLCNTDDP+ APKRNV+ RRMPVSR NEKSA G  DAPKK KSNN+RNQAD TAQ   ANN     E+KS
Subjt:  KAALISNFVDRESERPALGPTDFRPIPNTNLLCNTDDPKPAPKRNVKTRRMPVSRKNEKSATGDADAPKKRKSNNRRNQADATAQGGIANNVPE--EIKS

Query:  IKPQRKRNRHGAYTPKRKGVLELSSNSERCAEQNLDTRSLEVQLNGKE-SEMTNNDCAEAVDENGMEVCENNVAEKLSKRDVCCSEQEGNQNSTGSSGVS
         KPQRKR RHGAYT +RKG  ++  NSERCAEQN DTRSLEVQLN KE +E  N+DC E VDENGMEV EN  AE  S+  VC SEQE NQNSTGSSGVS
Subjt:  IKPQRKRNRHGAYTPKRKGVLELSSNSERCAEQNLDTRSLEVQLNGKE-SEMTNNDCAEAVDENGMEVCENNVAEKLSKRDVCCSEQEGNQNSTGSSGVS

Query:  VLSEMTNDTDEYNPFILPDTALLASSTAGDDIMVRNGESVADKDLDDSNSFSLPLNCLELRTTDSEGVNSYSVGEITDKDDNICKPQGRRKKNSKGSNES
        VLSEMTND DEYNP   PDT LLAS TA DDI+   G +VADKDLDDSNSFSLP +CLELRTTDSEGV+SYSV E TDK   +CKPQGRRKKNSK SN+S
Subjt:  VLSEMTNDTDEYNPFILPDTALLASSTAGDDIMVRNGESVADKDLDDSNSFSLPLNCLELRTTDSEGVNSYSVGEITDKDDNICKPQGRRKKNSKGSNES

Query:  QGSLVVSGQQGELERSGTKSPRRHNQSKKRKHNTINTSSLGTLEAVEEVDDCTLLGFLQKRLKR-TTTHEKKVD---STLLEV---DNDPTIASLLEGKL
        Q SL VS QQ ELE SG     R NQSKKRKH+  NTS LGT+EAVEEVDDCTL GFLQKRLKR TTTH+KKVD   ST  EV   DNDPT+A LL+ KL
Subjt:  QGSLVVSGQQGELERSGTKSPRRHNQSKKRKHNTINTSSLGTLEAVEEVDDCTLLGFLQKRLKR-TTTHEKKVD---STLLEV---DNDPTIASLLEGKL

Query:  KRKKH
        KRKKH
Subjt:  KRKKH

A0A6J1E6Z7 uncharacterized protein LOC111430000 isoform X10.0e+0081Show/hide
Query:  MSRRSHDDGGDEKLSAGEEDDEDDAI-DDMEVLRGACRLAGVNPEEYINHPSSSLAAGD-----DSDEVDDFELVRNIRKQFSIADDEQPLSILPPASPD
        MSRRSH DGGD++L A EEDDEDD + DDME LR ACRLAGVN E+ IN   S  AAGD     DSD+VDD EL+RNI+ +FS A DEQPLSILPP + D
Subjt:  MSRRSHDDGGDEKLSAGEEDDEDDAI-DDMEVLRGACRLAGVNPEEYINHPSSSLAAGD-----DSDEVDDFELVRNIRKQFSIADDEQPLSILPPASPD

Query:  EEEDDFETLRRIQRRFGAYETDILSNKPNQSCDFDVPLEMDSNNTAIERQTYPERSSMLAFEKGSLPKAGLAFIDAIKKNRSQQKFIRSKMIHLEARIEE
        EEEDDFETLR IQRRF AYE+DILSNKP+QSCD D PL+MDS+NT + R T  ERSSM+AFEKGSLPKA LAFIDAIKKNRSQQKFIRSKMIHLEARIEE
Subjt:  EEEDDFETLRRIQRRFGAYETDILSNKPNQSCDFDVPLEMDSNNTAIERQTYPERSSMLAFEKGSLPKAGLAFIDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  TKKLRKHFKILKDFQGSCRRRTSCALSQTIDPRVQLISAGKPQAKDSSKKDKRLSAMYYGPAENSHVAYYKMALTKFPPSVDRKRWSNVERENLGKGIRQ
         KKLRK FK+LK FQGSCRR+T+CAL+Q +DPRVQLISAGKPQAKDSSKKDKRLSAM YGPAENSHVA Y+ ALTKF P VDRKRWSN ERENLGKGIRQ
Subjt:  TKKLRKHFKILKDFQGSCRRRTSCALSQTIDPRVQLISAGKPQAKDSSKKDKRLSAMYYGPAENSHVAYYKMALTKFPPSVDRKRWSNVERENLGKGIRQ

Query:  QFQEMVLQISVDQISGLQGFLADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNAWTTSEDKNLLFTIQQ
        QFQEMVLQISVDQIS +QGF A+SDDLDNILASIK LDITPEKIREFLPKVNWDKLA MYL+GRSGAECEARWLNFEDPLINRN+WTTSEDKNLLFTIQQ
Subjt:  QFQEMVLQISVDQISGLQGFLADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNAWTTSEDKNLLFTIQQ

Query:  KGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLQSAVAILGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRL
        KGLNNWI++AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDED+KL+SAVAI G GDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRL
Subjt:  KGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLQSAVAILGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRL

Query:  KIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCKWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQ
        KIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRC+WTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPN+VPLLQEARKIQ
Subjt:  KIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCKWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQ

Query:  KAALISNFVDRESERPALGPTDFRPIPNTNLLCNTDDPKPAPKRNVKTRRMPVSRKNEKSATGDADAPKKRKSNNRRNQADATAQGGIANNVPE--EIKS
        K ALISNFVDRESERPALGPTDFRP+PN++LLCNTDDP+ APKRNV+ RRMPVSR NEKSA G  DAPKK KSNN+RNQAD TAQ   ANN     E+KS
Subjt:  KAALISNFVDRESERPALGPTDFRPIPNTNLLCNTDDPKPAPKRNVKTRRMPVSRKNEKSATGDADAPKKRKSNNRRNQADATAQGGIANNVPE--EIKS

Query:  IKPQRKRNRHGAYTPKRKGVLELSSNSERCAEQNLDTRSLEVQLNGKE-SEMTNNDCAEAVDENGMEVCENNVAEKLSKRDVCCSEQEGNQNSTGSSGVS
         KPQRKR RHGAYT +RKG  ++  NSERCAEQN DTRSLEVQLN KE +E  N+DC E VDENGMEV EN  AE  S+  VC SEQE NQNSTGSSGVS
Subjt:  IKPQRKRNRHGAYTPKRKGVLELSSNSERCAEQNLDTRSLEVQLNGKE-SEMTNNDCAEAVDENGMEVCENNVAEKLSKRDVCCSEQEGNQNSTGSSGVS

Query:  VLSEMTNDTDEYNPFILPDTALLASSTAGDDIMVRNGESVADKDLDDSNSFSLPLNCLELRTTDSEGVNSYSVGEITDKDDNICKPQGRRKKNSKGSNES
        VLSEMTND DEYNP   PDT LLAS TA DDI+   G +VADKDLDDSNSFSLP +CLELRTTDSEGV+SYSV E TDK   +CKPQGRRKKNSK SN+S
Subjt:  VLSEMTNDTDEYNPFILPDTALLASSTAGDDIMVRNGESVADKDLDDSNSFSLPLNCLELRTTDSEGVNSYSVGEITDKDDNICKPQGRRKKNSKGSNES

Query:  QGSLVVSGQQGELERSGTKSPRRHNQSKKRKHNTINTSSLGTLEAVEEVDDCTLLGFLQKRLKR-TTTHEKKVD---STLLEV---DNDPTIASLLEGKL
        Q SL VS QQ ELE SG     R NQSKKRKH+  NTS LGT+EAVEEVDDCTL GFLQKRLKR TTTH+KKVD   ST  EV   DNDPT+A LL+ KL
Subjt:  QGSLVVSGQQGELERSGTKSPRRHNQSKKRKHNTINTSSLGTLEAVEEVDDCTLLGFLQKRLKR-TTTHEKKVD---STLLEV---DNDPTIASLLEGKL

Query:  KRKKH
        KRKKH
Subjt:  KRKKH

A0A6J1JK98 uncharacterized protein LOC111485355 isoform X20.0e+0081.21Show/hide
Query:  MSRRSHDDGGDEKLSAGEEDDEDDAI-DDMEVLRGACRLAGVNPEEYINHPSSSLAAGD-----DSDEVDDFELVRNIRKQFSIADDEQPLSILPPASPD
        MSRRSH DGGD++L A EEDDEDD + DDME LR ACRLAGVN E+Y+N   S  AAGD     DSD+VDD EL+RNI+ +FSIA DEQPLSILPP + D
Subjt:  MSRRSHDDGGDEKLSAGEEDDEDDAI-DDMEVLRGACRLAGVNPEEYINHPSSSLAAGD-----DSDEVDDFELVRNIRKQFSIADDEQPLSILPPASPD

Query:  EEEDDFETLRRIQRRFGAYETDILSNKPNQSCDFDVPLEMDSNNTAIERQTYPERSSMLAFEKGSLPKAGLAFIDAIKKNRSQQKFIRSKMIHLEARIEE
        EEEDDFETLR IQRRF AYE+DILSNKP+QSCD D PL+MDS+NT +ER T  ERSSM+AFEKGSLPKA LAFIDAIKKNRSQQKF+RSKMIHLEARIEE
Subjt:  EEEDDFETLRRIQRRFGAYETDILSNKPNQSCDFDVPLEMDSNNTAIERQTYPERSSMLAFEKGSLPKAGLAFIDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  TKKLRKHFKILKDFQGSCRRRTSCALSQTIDPRVQLISAGKP-QAKDSSKKDKRLSAMYYGPAENSHVAYYKMALTKFPPSVDRKRWSNVERENLGKGIR
         KKLRK FK+LK FQGSCRR+T+CALSQ +DPRVQLISAGKP QAKDSSKKDKRLSAM YGPAENSHVA Y++ALTKF P VDRKRWSN ERENLGKGIR
Subjt:  TKKLRKHFKILKDFQGSCRRRTSCALSQTIDPRVQLISAGKP-QAKDSSKKDKRLSAMYYGPAENSHVAYYKMALTKFPPSVDRKRWSNVERENLGKGIR

Query:  QQFQEMVLQISVDQISGLQGFLADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNAWTTSEDKNLLFTIQ
        QQFQEMVLQISVDQIS +QGF A+SDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRN WTTSEDKNLLFTIQ
Subjt:  QQFQEMVLQISVDQISGLQGFLADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNAWTTSEDKNLLFTIQ

Query:  QKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLQSAVAILGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSR
        QKGLNNWID+AVSLGTNRTPFQ LSRYQRSLNASILK EWTKDED+KL+SAVAI G GDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSR
Subjt:  QKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLQSAVAILGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSR

Query:  LKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCKWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKI
        LKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRC+WTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNEC RRWKKLFPN+VPLLQEARKI
Subjt:  LKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCKWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKI

Query:  QKAALISNFVDRESERPALGPTDFRPIPNTNLLCNTDDPKPAPKRNVKTRRMPVSRKNEKSATGDADAPKKRKSNNRRNQADATAQGGIANNVPE--EIK
        QK ALISNFVDRESERPALGPTDFRP+PN++LLCNTDDP+ APKRNV+TRRMPVSR NEKSA G  DAPKKRKSNN+RN+ D TAQ   A+N     E+K
Subjt:  QKAALISNFVDRESERPALGPTDFRPIPNTNLLCNTDDPKPAPKRNVKTRRMPVSRKNEKSATGDADAPKKRKSNNRRNQADATAQGGIANNVPE--EIK

Query:  SIKPQRKRNRHGAYTPKRKGVLELSSNSERCAEQNLDTRSLEVQLNGKE-SEMTNNDCAEAVDENGMEVCENNVAEKLSKRDVCCSEQEGNQNSTGSSGV
        S KPQRKR RHGAYT +RKG  ++  NSERCAEQN DTR+LEVQLN KE +E  N+DC E VDENGMEV EN  AE  S+  VC SEQE NQNSTGSSGV
Subjt:  SIKPQRKRNRHGAYTPKRKGVLELSSNSERCAEQNLDTRSLEVQLNGKE-SEMTNNDCAEAVDENGMEVCENNVAEKLSKRDVCCSEQEGNQNSTGSSGV

Query:  SVLSEMTNDTDEYNPFILPDTALLASSTAGDDIMVRNGESVADKDLDDSNSFSLPLNCLELRTTDSEGVNSYSVGEITDKDDNICKPQGRRKKNSKGSNE
        SVLSEMTND DEYNP  LPDT LLAS TA DDI+   G +VADKDLD SNSFSLP +CLELRTTDSEGV+SYSV E TDK   +CKPQGRRKKNSK SN+
Subjt:  SVLSEMTNDTDEYNPFILPDTALLASSTAGDDIMVRNGESVADKDLDDSNSFSLPLNCLELRTTDSEGVNSYSVGEITDKDDNICKPQGRRKKNSKGSNE

Query:  SQGSLVVSGQQGELERSGTKSPRRHNQSKKRKHNTINTSSLGTLEAVEEVDDCTLLGFLQKRLKR-TTTHEKKVD---STLLEV---DNDPTIASLLEGK
        SQ SL VS QQ ELE SGT    R NQ KKRKH++ NTS LGT+EAVEEVDDCTLLGFLQKRLKR TTTH KKVD   ST  EV   DNDPT+A LL+ K
Subjt:  SQGSLVVSGQQGELERSGTKSPRRHNQSKKRKHNTINTSSLGTLEAVEEVDDCTLLGFLQKRLKR-TTTHEKKVD---STLLEV---DNDPTIASLLEGK

Query:  LKRKKH
        LKRKKH
Subjt:  LKRKKH

A0A6J1JKV7 uncharacterized protein LOC111485355 isoform X10.0e+0081.31Show/hide
Query:  MSRRSHDDGGDEKLSAGEEDDEDDAI-DDMEVLRGACRLAGVNPEEYINHPSSSLAAGD-----DSDEVDDFELVRNIRKQFSIADDEQPLSILPPASPD
        MSRRSH DGGD++L A EEDDEDD + DDME LR ACRLAGVN E+Y+N   S  AAGD     DSD+VDD EL+RNI+ +FSIA DEQPLSILPP + D
Subjt:  MSRRSHDDGGDEKLSAGEEDDEDDAI-DDMEVLRGACRLAGVNPEEYINHPSSSLAAGD-----DSDEVDDFELVRNIRKQFSIADDEQPLSILPPASPD

Query:  EEEDDFETLRRIQRRFGAYETDILSNKPNQSCDFDVPLEMDSNNTAIERQTYPERSSMLAFEKGSLPKAGLAFIDAIKKNRSQQKFIRSKMIHLEARIEE
        EEEDDFETLR IQRRF AYE+DILSNKP+QSCD D PL+MDS+NT +ER T  ERSSM+AFEKGSLPKA LAFIDAIKKNRSQQKF+RSKMIHLEARIEE
Subjt:  EEEDDFETLRRIQRRFGAYETDILSNKPNQSCDFDVPLEMDSNNTAIERQTYPERSSMLAFEKGSLPKAGLAFIDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  TKKLRKHFKILKDFQGSCRRRTSCALSQTIDPRVQLISAGKP-QAKDSSKKDKRLSAMYYGPAENSHVAYYKMALTKFPPSVDRKRWSNVERENLGKGIR
         KKLRK FK+LK FQGSCRR+T+CALSQ +DPRVQLISAGKP QAKDSSKKDKRLSAM YGPAENSHVA Y++ALTKF P VDRKRWSN ERENLGKGIR
Subjt:  TKKLRKHFKILKDFQGSCRRRTSCALSQTIDPRVQLISAGKP-QAKDSSKKDKRLSAMYYGPAENSHVAYYKMALTKFPPSVDRKRWSNVERENLGKGIR

Query:  QQFQEMVLQISVDQISGLQGFLADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNAWTTSEDKNLLFTIQ
        QQFQEMVLQISVDQIS +QGF A+SDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRN WTTSEDKNLLFTIQ
Subjt:  QQFQEMVLQISVDQISGLQGFLADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNAWTTSEDKNLLFTIQ

Query:  QKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLQSAVAILGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSR
        QKGLNNWID+AVSLGTNRTPFQ LSRYQRSLNASILK EWTKDED+KL+SAVAI G GDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSR
Subjt:  QKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLQSAVAILGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSR

Query:  LKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCKWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKI
        LKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRC+WTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPN+VPLLQEARKI
Subjt:  LKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCKWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKI

Query:  QKAALISNFVDRESERPALGPTDFRPIPNTNLLCNTDDPKPAPKRNVKTRRMPVSRKNEKSATGDADAPKKRKSNNRRNQADATAQGGIANNVPE--EIK
        QK ALISNFVDRESERPALGPTDFRP+PN++LLCNTDDP+ APKRNV+TRRMPVSR NEKSA G  DAPKKRKSNN+RN+ D TAQ   A+N     E+K
Subjt:  QKAALISNFVDRESERPALGPTDFRPIPNTNLLCNTDDPKPAPKRNVKTRRMPVSRKNEKSATGDADAPKKRKSNNRRNQADATAQGGIANNVPE--EIK

Query:  SIKPQRKRNRHGAYTPKRKGVLELSSNSERCAEQNLDTRSLEVQLNGKE-SEMTNNDCAEAVDENGMEVCENNVAEKLSKRDVCCSEQEGNQNSTGSSGV
        S KPQRKR RHGAYT +RKG  ++  NSERCAEQN DTR+LEVQLN KE +E  N+DC E VDENGMEV EN  AE  S+  VC SEQE NQNSTGSSGV
Subjt:  SIKPQRKRNRHGAYTPKRKGVLELSSNSERCAEQNLDTRSLEVQLNGKE-SEMTNNDCAEAVDENGMEVCENNVAEKLSKRDVCCSEQEGNQNSTGSSGV

Query:  SVLSEMTNDTDEYNPFILPDTALLASSTAGDDIMVRNGESVADKDLDDSNSFSLPLNCLELRTTDSEGVNSYSVGEITDKDDNICKPQGRRKKNSKGSNE
        SVLSEMTND DEYNP  LPDT LLAS TA DDI+   G +VADKDLD SNSFSLP +CLELRTTDSEGV+SYSV E TDK   +CKPQGRRKKNSK SN+
Subjt:  SVLSEMTNDTDEYNPFILPDTALLASSTAGDDIMVRNGESVADKDLDDSNSFSLPLNCLELRTTDSEGVNSYSVGEITDKDDNICKPQGRRKKNSKGSNE

Query:  SQGSLVVSGQQGELERSGTKSPRRHNQSKKRKHNTINTSSLGTLEAVEEVDDCTLLGFLQKRLKR-TTTHEKKVD---STLLEV---DNDPTIASLLEGK
        SQ SL VS QQ ELE SGT    R NQ KKRKH++ NTS LGT+EAVEEVDDCTLLGFLQKRLKR TTTH KKVD   ST  EV   DNDPT+A LL+ K
Subjt:  SQGSLVVSGQQGELERSGTKSPRRHNQSKKRKHNTINTSSLGTLEAVEEVDDCTLLGFLQKRLKR-TTTHEKKVD---STLLEV---DNDPTIASLLEGK

Query:  LKRKKH
        LKRKKH
Subjt:  LKRKKH

SwissProt top hitse value%identityAlignment
P46200 Transcriptional activator Myb5.7e-3240.34Show/hide
Query:  KREWTKDEDEKLQSAVAILGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL
        K  WT++EDEKL+  V   G  DW+ +A+ L  RT  QC +RW+K L+P   K G +T +ED R+   V  +GPK W+  A+ L GR   QCRERW N L
Subjt:  KREWTKDEDEKLQSAVAILGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL

Query:  DPSLRRCKWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVP---LLQEARKIQKAALISNF
        +P +++  WTEEED  +  A +  G  WA++A  +P RTDN  +  W      +V     LQE+ K  + A+ ++F
Subjt:  DPSLRRCKWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVP---LLQEARKIQKAALISNF

P91868 snRNA-activating protein complex subunit 4 homolog2.0e-3233.08Show/hide
Query:  VNWDKLASMYLRG-RSGAECEARWLNFEDPLINRNAWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLQS
        V W  +A+   +G R+    +++W N  +P  N+  W+  E + L +  +     +W  +A++LGTNRT +QC+ +Y+  ++     +EW++DED KL +
Subjt:  VNWDKLASMYLRG-RSGAECEARWLNFEDPLINRNAWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLQS

Query:  AVAILGVG---DWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLR-RCKWT
           I  +     W  VA  + GRT  Q   R+  +LD A  K G +T  ED  L  AV  +G K+W K A+ +  RN  QCRERW N L+ S     ++T
Subjt:  AVAILGVG---DWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLR-RCKWT

Query:  EEEDLRLEIAIQEHGY-SWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVD
          ED +L  A++  G  +WAK    +P +T  + RRR+ +L          A K++ AA   N VD
Subjt:  EEEDLRLEIAIQEHGY-SWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVD

Q54NA6 Myb-like protein L6.7e-5733.11Show/hide
Query:  PAENSHVAYYKMALTKFPPSVDRKRWSNVERENLGKGIRQQ-FQEMVLQISVDQIS---------GLQGFLADSDDLDNILASIKDLDITPEKIREFLPK
        PA+N      ++     P     +RW+  E E L KGI+++  Q+ + ++S D++S          +Q    ++++ +N + +  + +   +K     P 
Subjt:  PAENSHVAYYKMALTKFPPSVDRKRWSNVERENLGKGIRQQ-FQEMVLQISVDQIS---------GLQGFLADSDDLDNILASIKDLDITPEKIREFLPK

Query:  VNWDKLASMYLRGRSGAECEARWLNFEDPLINRNAWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLQSA
        +       +    RS  E   RW N +DP IN+  +T  EDK LL   ++   + W  I++ LGTNRTP  C+ RYQRSLN+ ++KREWTK+EDE L   
Subjt:  VNWDKLASMYLRGRSGAECEARWLNFEDPLINRNAWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLQSA

Query:  VAILGVG---DWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCKWTEE
        + +   G   DWQ +   + GRTG QC +RW K+LDP+  K+G ++P+ED  L  AV  +G  NW      + GR  VQCRER+ N LDP L + +WT +
Subjt:  VAILGVG---DWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCKWTEE

Query:  EDLRLEIAIQEHGY-SWAKVAACVPSRTDNECRRRWKKL--FPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNTDDPKPAPKR
        ED RL     + G   W+ VA  + +RTDN+C RRWK+L    N +   QE    +K   +SNF  R+ ER  L   D   I          + K  PK 
Subjt:  EDLRLEIAIQEHGY-SWAKVAACVPSRTDNECRRRWKKL--FPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNTDDPKPAPKR

Query:  NVKTRRMPVSRKNEKSATGDADAPKKRKSNNRRNQA----DATAQGGIANNVPEEI
        N KT+ +        + T         K++N  NQ     D +   GI ++  +E+
Subjt:  NVKTRRMPVSRKNEKSATGDADAPKKRKSNNRRNQA----DATAQGGIANNVPEEI

Q5SXM2 snRNA-activating protein complex subunit 41.5e-4028.02Show/hide
Query:  IKKNRSQQKFIRSKMIHLEARIEETKKLRKHFKILKDFQGSCRRRTSCALSQTIDPRVQLISAGKPQAKDSSKKDKRLSAMYYGPAENSHVAYYKMALTK
        ++ N   Q+ I+ K+   EA +   +   +  ++++D  GS  + T     +++ P   +    KP  KD             GP  N           K
Subjt:  IKKNRSQQKFIRSKMIHLEARIEETKKLRKHFKILKDFQGSCRRRTSCALSQTIDPRVQLISAGKPQAKDSSKKDKRLSAMYYGPAENSHVAYYKMALTK

Query:  FPPSVDRKRWSNVERENLGKGIRQQFQEMVLQISVDQISGL-QGFLADSDDLD---------NILASIKDLDITPEK--IREFLPKVNWDKLASMYLRG-
            +   +W N E+  L K +     + +LQ  + ++  L Q     S +L+              I+D++  PE+  +   L   +W+K++++   G 
Subjt:  FPPSVDRKRWSNVERENLGKGIRQQFQEMVLQISVDQISGL-QGFLADSDDLD---------NILASIKDLDITPEK--IREFLPKVNWDKLASMYLRG-

Query:  RSGAECEARWLNFEDPLINRNAWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLQSAVAILGVGD---WQ
        RS  E    W N E P IN+  W+  E++ L       G   W  IA  LGT+R+ FQCL ++Q+  N ++ ++EWT++ED  L   V  + VG    ++
Subjt:  RSGAECEARWLNFEDPLINRNAWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLQSAVAILGVGD---WQ

Query:  AVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCKWTEEEDLRLEIAIQEHG
         +   +EGR   Q   RW KSLDP   K+GY+ P+ED++L  AV  +G ++W K  E +PGR+  QCR+R+   L  SL++ +W  +E+ +L   I+++G
Subjt:  AVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCKWTEEEDLRLEIAIQEHG

Query:  YS-WAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARK
           WAK+A+ +P R+ ++C  +WK +   +  L +  R+
Subjt:  YS-WAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARK

Q8BP86 snRNA-activating protein complex subunit 47.4e-4034.36Show/hide
Query:  IKDLDITPEK--IREFLPKVNWDKLASMYLRG-RSGAECEARWLNFEDPLINRNAWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSL
        I+D++  PE+  +   L   +W+K++++   G RS  E    W + E P I++  W+T E + L       G   W  +A  LGT+R+ FQCL ++Q+  
Subjt:  IKDLDITPEK--IREFLPKVNWDKLASMYLRG-RSGAECEARWLNFEDPLINRNAWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSL

Query:  NASILKREWTKDEDEKLQSAVAILGVGD---WQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQC
        N ++ ++EWT++ED  L   V  + VG+   ++ +   +EGR   Q   RW KSLDP+  KRG++ P+ED++L  AV  +G ++W K  E +PGR+  QC
Subjt:  NASILKREWTKDEDEKLQSAVAILGVGD---WQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQC

Query:  RERWFNCLDPSLRRCKWTEEEDLRLEIAIQEHGYS-WAKVAACVPSRTDNECRRRWKKL
        R+R+   L  SL++ +W  +E+ +L   I+++G   WA++A+ +P R+ ++C  +WK L
Subjt:  RERWFNCLDPSLRRCKWTEEEDLRLEIAIQEHGYS-WAKVAACVPSRTDNECRRRWKKL

Arabidopsis top hitse value%identityAlignment
AT3G09370.1 myb domain protein 3r-35.1e-2840.82Show/hide
Query:  KREWTKDEDEKLQSAVAILGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL
        K  WT +EDE L+ AV       W+ +A +   RT  QC +RW+K L+P   K G +T +ED ++   V  +GP  W+  A+ LPGR   QCRERW N L
Subjt:  KREWTKDEDEKLQSAVAILGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL

Query:  DPSLRRCKWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW
        +P + +  WT EE++ L  A + HG  WA++A  +P RTDN  +  W
Subjt:  DPSLRRCKWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW

AT3G09370.2 myb domain protein 3r-35.1e-2840.82Show/hide
Query:  KREWTKDEDEKLQSAVAILGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL
        K  WT +EDE L+ AV       W+ +A +   RT  QC +RW+K L+P   K G +T +ED ++   V  +GP  W+  A+ LPGR   QCRERW N L
Subjt:  KREWTKDEDEKLQSAVAILGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL

Query:  DPSLRRCKWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW
        +P + +  WT EE++ L  A + HG  WA++A  +P RTDN  +  W
Subjt:  DPSLRRCKWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW

AT3G18100.1 myb domain protein 4r14.9e-16443.24Show/hide
Query:  EKLSAGEEDDEDDAIDDMEVLRG-ACRLAGVNPEEYINHPSSSLAAGDDSDEVDDFELVRNIRKQFSIADDEQPLSILPP--ASPDEEEDDFETLRRIQR
        E +  GE   + +  DD E+LR    +LA        + P   L+   DS+  DDFE++R+I+ Q S++ D      LPP   S DEE+D FETLR I+R
Subjt:  EKLSAGEEDDEDDAIDDMEVLRG-ACRLAGVNPEEYINHPSSSLAAGDDSDEVDDFELVRNIRKQFSIADDEQPLSILPP--ASPDEEEDDFETLRRIQR

Query:  RFGAY-----ETDILSN-----KPNQSCDFDVPLEMDSNNTAIER--------QTYPERSSM----LAFEKGSLPKAGLAFIDAIKKNRSQQKFIRSKMI
        RF AY     E   +++     K   + D +   E+ S +   E          T P+   +    +     S P+A  AF+DAI++NR+ QKF+R K+ 
Subjt:  RFGAY-----ETDILSN-----KPNQSCDFDVPLEMDSNNTAIER--------QTYPERSSM----LAFEKGSLPKAGLAFIDAIKKNRSQQKFIRSKMI

Query:  HLEARIEETKKLRKHFKILKDFQGSCRRRTSCALSQTIDPRVQLISAGKPQAKDSSK----------KDKRLSAMYYGPAENSHVAYYKMALTKFPPSVD
         +EA IE+ +K +K+ +I+KDFQ SC+R T  AL Q  DPRV+LIS  K    DSS+           DK++S +  GPAEN  V  Y+MAL K+P SV 
Subjt:  HLEARIEETKKLRKHFKILKDFQGSCRRRTSCALSQTIDPRVQLISAGKPQAKDSSK----------KDKRLSAMYYGPAENSHVAYYKMALTKFPPSVD

Query:  RKRWSNVERENLGKGIRQQFQEMVLQISVDQISGLQGFLADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLIN
        R++WS  E +NL KG++Q+ Q+++L  ++++ S L+G    + D+D I  SI +L+ITPE IR+FLPK+NWD   S+ ++ RS AECEARW++ EDPLIN
Subjt:  RKRWSNVERENLGKGIRQQFQEMVLQISVDQISGLQGFLADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLIN

Query:  RNAWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLQSAVAILGVGDWQAVASTLEGRTGPQCSNRWKKSL
           WT +EDKNLL TI+Q  L +W+DIAVSLGTNRTPFQCL+RYQRSLN SILK+EWT +ED++L++AV + G  DWQ+VA+ L+GRTG QCSNRWKKSL
Subjt:  RNAWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLQSAVAILGVGDWQAVASTLEGRTGPQCSNRWKKSL

Query:  DPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCKWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW
         P  T++G ++ +ED R+K+AV LFG +NW+K ++F+PGR Q QCRERW NCLDP + R KWTEEED +L  AI EHGYSW+KVA  +  RTDN+C RRW
Subjt:  DPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCKWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW

Query:  KKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNTDDPKPAPKRNVKTRRMPVSRKNEKSATGDADAPKKRKSNNRRNQADA
        K+L+P++V LLQEAR++QK A + NFVDRESERPAL  +    +P+ +L    D      KR  K ++    R+ ++   G     K    +  R + + 
Subjt:  KKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNTDDPKPAPKRNVKTRRMPVSRKNEKSATGDADAPKKRKSNNRRNQADA

Query:  TAQGGIANNVPEEIKSIKPQRKRNRHGAYTPK--RKGVLELSSNSERCAEQNLDTRSLEVQLNGKESEMTNNDCAEAVDENGME
          +    N   E + +++   +   +    PK  RK V E    +E                NG E  M   +C   + +N  E
Subjt:  TAQGGIANNVPEEIKSIKPQRKRNRHGAYTPK--RKGVLELSSNSERCAEQNLDTRSLEVQLNGKESEMTNNDCAEAVDENGME

AT3G18100.2 myb domain protein 4r13.3e-15247.22Show/hide
Query:  SLPKAGLAFIDAIKKNRSQQKFIRSKMIHLEARIEETKKLRKHFKILKDFQGSCRRRTSCALSQTIDPRVQLISAGKPQAKDSSK----------KDKRL
        S P+A  AF+DAI++NR+ QKF+R K+  +EA IE+ +K +K+ +I+KDFQ SC+R T  AL Q  DPRV+LIS  K    DSS+           DK++
Subjt:  SLPKAGLAFIDAIKKNRSQQKFIRSKMIHLEARIEETKKLRKHFKILKDFQGSCRRRTSCALSQTIDPRVQLISAGKPQAKDSSK----------KDKRL

Query:  SAMYYGPAENSHVAYYKMALTKFPPSVDRKRWSNVERENLGKGIRQQFQEMVLQISVDQISGLQGFLADSDDLDNILASIKDLDITPEKIREFLPKVNWD
        S +  GPAEN  V  Y+MAL K+P SV R++WS  E +NL KG++Q+ Q+++L  ++++ S L+G    + D+D I  SI +L+ITPE IR+FLPK+NWD
Subjt:  SAMYYGPAENSHVAYYKMALTKFPPSVDRKRWSNVERENLGKGIRQQFQEMVLQISVDQISGLQGFLADSDDLDNILASIKDLDITPEKIREFLPKVNWD

Query:  KLASMYLRGRSGAECEARWLNFEDPLINRNAWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLQSAVAIL
           S+ ++ RS AECEARW++ EDPLIN   WT +EDKNLL TI+Q  L +W+DIAVSLGTNRTPFQCL+RYQRSLN SILK+EWT +ED++L++AV + 
Subjt:  KLASMYLRGRSGAECEARWLNFEDPLINRNAWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLQSAVAIL

Query:  GVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCKWTEEEDLRLEI
        G  DWQ+VA+ L+GRTG QCSNRWKKSL P  T++G ++ +ED R+K+AV LFG +NW+K ++F+PGR Q QCRERW NCLDP + R KWTEEED +L  
Subjt:  GVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCKWTEEEDLRLEI

Query:  AIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNTDDPKPAPKRNVKTRRMPVS
        AI EHGYSW+KVA  +  RTDN+C RRWK+L+P++V LLQEAR++QK A + NFVDRESERPAL  +    +P+ +L    D      KR  K ++    
Subjt:  AIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNTDDPKPAPKRNVKTRRMPVS

Query:  RKNEKSATGDADAPKKRKSNNRRNQADATAQGGIANNVPEEIKSIKPQRKRNRHGAYTPK--RKGVLELSSNSERCAEQNLDTRSLEVQLNGKESEMTNN
        R+ ++   G     K    +  R + +   +    N   E + +++   +   +    PK  RK V E    +E                NG E  M   
Subjt:  RKNEKSATGDADAPKKRKSNNRRNQADATAQGGIANNVPEEIKSIKPQRKRNRHGAYTPK--RKGVLELSSNSERCAEQNLDTRSLEVQLNGKESEMTNN

Query:  DCAEAVDENGME
        +C   + +N  E
Subjt:  DCAEAVDENGME

AT3G18100.3 myb domain protein 4r11.5e-14139.38Show/hide
Query:  MSRRSHDDGGDEKLSAGEEDDEDDAIDDMEVLRGACRLAGVNPEEYINHPSSSLAAG-------DDSDEVDDFELVRNIRKQFSIADD----EQPLSILP
        M+R S  +  D+     ++D+EDD  +D+E LR AC ++ VN +++ +   S    G        DS+  DDFE++R I+ Q + + D      P   L 
Subjt:  MSRRSHDDGGDEKLSAGEEDDEDDAIDDMEVLRGACRLAGVNPEEYINHPSSSLAAG-------DDSDEVDDFELVRNIRKQFSIADD----EQPLSILP

Query:  PASPDEEEDDFETLRRIQRRFGAYETDILSNKPNQSCDFDVP---LEMDSNNTAIE-----RQTYPERSSMLAFEKGSLPKAGLAFIDAIKKNRSQQKFI
          S  E EDDFE +R I+ +             + S D  +P   L  D  + A E     R+ +    +   F   S  K      + +   ++Q+ + 
Subjt:  PASPDEEEDDFETLRRIQRRFGAYETDILSNKPNQSCDFDVP---LEMDSNNTAIE-----RQTYPERSSMLAFEKGSLPKAGLAFIDAIKKNRSQQKFI

Query:  RSKMIHLEARIEETKKLRKHFKI-------LKDF---------QGSCRRRTSCALSQTIDPRVQLISAGKPQAKDSSK----------KDKRLSAMYYGP
          KM+  + R++ + KL +   +       +++F         Q S R R +  +    DPRV+LIS  K    DSS+           DK++S +  GP
Subjt:  RSKMIHLEARIEETKKLRKHFKI-------LKDF---------QGSCRRRTSCALSQTIDPRVQLISAGKPQAKDSSK----------KDKRLSAMYYGP

Query:  AENSHVAYYKMALTKFPPSVDRKRWSNVERENLGKGIRQQFQEMVLQISVDQISGLQGFLADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYL
        AEN  V  Y+MAL K+P SV R++WS  E +NL KG++Q+ Q+++L  ++++ S L+G    + D+D I  SI +L+ITPE IR+FLPK+NWD   S+ +
Subjt:  AENSHVAYYKMALTKFPPSVDRKRWSNVERENLGKGIRQQFQEMVLQISVDQISGLQGFLADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYL

Query:  RGRSGAECEARWLNFEDPLINRNAWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLQSAVAILGVGDWQA
        + RS AECEARW++ EDPLIN   WT +EDKNLL TI+Q  L +W+DIAVSLGTNRTPFQCL+RYQRSLN SILK+EWT +ED++L++AV + G  DWQ+
Subjt:  RGRSGAECEARWLNFEDPLINRNAWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEKLQSAVAILGVGDWQA

Query:  VASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCKWTEEEDLRLEIAIQEHGY
        VA+ L+GRTG QCSNRWKKSL P  T++G ++ +ED R+K+AV LFG +NW+K ++F+PGR Q QCRERW NCLDP + R KWTEEED +L  AI EHGY
Subjt:  VASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCKWTEEEDLRLEIAIQEHGY

Query:  SWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNTDDPKPAPKRNVKTRRMPVSRKNEKSA
        SW+KVA  +  RTDN+C RRWK+L+P++V LLQEAR++QK A + NFVDRESERPAL  +    +P+ +L    D      KR  K ++    R+ ++  
Subjt:  SWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNTDDPKPAPKRNVKTRRMPVSRKNEKSA

Query:  TGDADAPKKRKSNNRRNQADATAQGGIANNVPEEIKSIKPQRKRNRHGAYTPK--RKGVLELSSNSERCAEQNLDTRSLEVQLNGKESEMTNNDCAEAVD
         G     K    +  R + +   +    N   E + +++   +   +    PK  RK V E    +E                NG E  M   +C   + 
Subjt:  TGDADAPKKRKSNNRRNQADATAQGGIANNVPEEIKSIKPQRKRNRHGAYTPK--RKGVLELSSNSERCAEQNLDTRSLEVQLNGKESEMTNNDCAEAVD

Query:  ENGME
        +N  E
Subjt:  ENGME


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCGCCGCAGCCACGACGACGGCGGTGACGAGAAGCTTTCCGCCGGCGAGGAAGACGATGAAGACGATGCCATTGATGACATGGAAGTCCTTCGGGGAGCCTGTAG
GCTTGCTGGAGTTAATCCTGAGGAGTATATTAATCATCCGTCGTCGTCGCTTGCCGCCGGAGATGATTCCGATGAAGTAGATGATTTCGAACTTGTTCGGAATATTCGGA
AGCAGTTCTCGATTGCGGACGATGAGCAGCCGTTGAGTATTCTCCCACCGGCGTCTCCGGACGAGGAGGAAGACGATTTCGAGACGCTTCGTCGGATTCAGCGGCGCTTT
GGGGCGTATGAAACTGATATCTTGAGCAATAAACCCAATCAATCTTGTGACTTTGATGTGCCTCTGGAGATGGATTCTAACAACACAGCTATTGAGAGACAGACATACCC
AGAGAGGTCATCTATGCTAGCCTTTGAAAAGGGAAGCTTGCCAAAGGCTGGATTGGCATTTATTGATGCCATCAAGAAGAATAGGTCCCAGCAGAAGTTTATTCGCAGTA
AGATGATTCACCTTGAAGCTAGAATTGAGGAGACCAAAAAGCTCAGAAAACATTTCAAAATTCTCAAAGATTTCCAAGGTTCATGTAGAAGAAGAACAAGTTGTGCACTG
TCTCAAACGATAGATCCTCGAGTTCAGTTAATATCAGCTGGAAAACCACAGGCTAAGGATTCATCAAAGAAGGACAAACGATTGTCTGCAATGTATTATGGCCCAGCTGA
GAATTCTCATGTTGCATACTACAAAATGGCATTGACAAAGTTTCCACCTTCCGTAGATCGAAAAAGATGGTCCAATGTAGAAAGGGAGAATCTAGGGAAGGGAATACGAC
AACAATTTCAAGAGATGGTGCTTCAGATCTCGGTGGATCAAATTAGTGGGCTACAAGGATTTTTAGCAGATTCAGATGATCTGGATAACATTCTTGCATCAATAAAAGAC
CTTGACATTACTCCCGAAAAGATTAGGGAATTTTTGCCAAAAGTTAATTGGGACAAATTGGCTTCCATGTATCTTCGGGGTCGCTCAGGGGCAGAATGTGAAGCAAGGTG
GTTGAATTTTGAAGATCCCCTAATTAATCGGAATGCATGGACTACAAGTGAGGATAAGAATCTTTTGTTTACTATTCAACAGAAAGGGTTGAATAACTGGATTGACATAG
CAGTTTCATTGGGTACAAACAGAACTCCTTTTCAGTGCTTGTCTCGGTATCAAAGGAGTTTAAACGCATCCATATTAAAGAGGGAGTGGACCAAAGATGAGGATGAGAAA
CTTCAATCTGCTGTTGCTATTCTTGGTGTGGGAGATTGGCAGGCTGTAGCTTCTACTTTAGAAGGACGAACTGGTCCACAGTGCTCTAATAGGTGGAAAAAATCCCTCGA
CCCAGCTAGGACAAAAAGAGGCTACTTCACCCCAGATGAAGACAGTCGCTTGAAAATTGCTGTACTGCTTTTTGGGCCTAAAAATTGGAACAAGAAAGCCGAATTTTTAC
CTGGTCGAAATCAAGTTCAATGCAGAGAAAGATGGTTCAATTGCTTAGATCCTTCCTTGAGAAGATGTAAATGGACAGAAGAGGAGGATTTAAGGCTAGAGATAGCAATT
CAGGAACATGGATATAGCTGGGCTAAGGTAGCTGCATGTGTGCCGTCACGTACAGATAATGAGTGCCGGAGGAGATGGAAGAAGTTGTTTCCCAATGAAGTTCCTTTGCT
CCAGGAAGCTAGAAAGATTCAGAAGGCTGCTCTTATTAGCAACTTTGTTGATAGGGAATCAGAGCGTCCTGCTCTTGGTCCTACTGACTTTCGACCTATACCAAATACAA
ATTTATTATGTAATACTGATGATCCAAAACCTGCCCCGAAAAGAAATGTGAAAACGAGAAGGATGCCAGTGTCAAGGAAAAATGAAAAGAGTGCTACTGGTGATGCTGAT
GCTCCTAAGAAGAGGAAATCAAATAACCGGAGGAATCAAGCTGATGCAACTGCTCAGGGCGGTATTGCAAATAATGTCCCAGAGGAGATTAAATCTATAAAACCTCAAAG
AAAACGAAATAGGCATGGAGCTTATACTCCTAAGAGGAAAGGGGTTCTGGAGCTAAGTTCTAACAGTGAAAGATGTGCTGAACAGAATTTGGACACTCGGAGCCTCGAGG
TGCAGCTGAATGGTAAGGAATCCGAGATGACCAACAATGACTGTGCCGAGGCTGTTGATGAGAATGGTATGGAAGTATGTGAGAACAATGTTGCAGAGAAGCTTTCTAAA
AGAGATGTATGCTGTTCAGAACAAGAAGGAAATCAGAACTCTACAGGATCTTCTGGAGTGTCGGTGTTGTCGGAAATGACTAATGACACGGACGAGTACAATCCCTTTAT
CCTTCCAGATACAGCACTGCTGGCTAGTAGTACTGCTGGGGATGATATTATGGTGAGGAATGGGGAGAGTGTTGCAGACAAAGATCTGGATGACAGTAACAGCTTCTCGT
TACCGCTCAATTGCTTAGAACTCAGGACAACTGACAGTGAAGGTGTCAACAGTTATTCTGTTGGTGAAATTACAGACAAAGACGATAACATTTGCAAGCCCCAAGGCAGA
AGGAAGAAAAATAGTAAAGGATCAAATGAGAGTCAGGGTTCTTTGGTGGTTTCTGGTCAACAAGGGGAGCTGGAGAGGTCGGGGACGAAGTCCCCTCGTCGTCATAATCA
ATCAAAGAAGAGAAAGCATAACACTATAAATACGAGTTCATTAGGAACACTGGAGGCAGTTGAAGAGGTCGATGACTGCACTCTCCTTGGCTTTCTGCAAAAAAGATTGA
AGAGAACAACCACTCATGAAAAGAAAGTTGATAGCACTCTCCTAGAAGTTGATAATGATCCTACTATTGCCTCGCTCCTCGAGGGTAAATTGAAGAGAAAAAAGCATCTA
GCGCCTAGTGGTGGAGGTTAA
mRNA sequenceShow/hide mRNA sequence
AAGACCGCCACAATTCCCTCCGGGGAGCCCCTCTCAACGTTTCGACTGCAACCTTTGAGTCAGTGATTCGTGTCGTACGTTTCGCTCATTCACCTATGTCTCGCCGCAGC
CACGACGACGGCGGTGACGAGAAGCTTTCCGCCGGCGAGGAAGACGATGAAGACGATGCCATTGATGACATGGAAGTCCTTCGGGGAGCCTGTAGGCTTGCTGGAGTTAA
TCCTGAGGAGTATATTAATCATCCGTCGTCGTCGCTTGCCGCCGGAGATGATTCCGATGAAGTAGATGATTTCGAACTTGTTCGGAATATTCGGAAGCAGTTCTCGATTG
CGGACGATGAGCAGCCGTTGAGTATTCTCCCACCGGCGTCTCCGGACGAGGAGGAAGACGATTTCGAGACGCTTCGTCGGATTCAGCGGCGCTTTGGGGCGTATGAAACT
GATATCTTGAGCAATAAACCCAATCAATCTTGTGACTTTGATGTGCCTCTGGAGATGGATTCTAACAACACAGCTATTGAGAGACAGACATACCCAGAGAGGTCATCTAT
GCTAGCCTTTGAAAAGGGAAGCTTGCCAAAGGCTGGATTGGCATTTATTGATGCCATCAAGAAGAATAGGTCCCAGCAGAAGTTTATTCGCAGTAAGATGATTCACCTTG
AAGCTAGAATTGAGGAGACCAAAAAGCTCAGAAAACATTTCAAAATTCTCAAAGATTTCCAAGGTTCATGTAGAAGAAGAACAAGTTGTGCACTGTCTCAAACGATAGAT
CCTCGAGTTCAGTTAATATCAGCTGGAAAACCACAGGCTAAGGATTCATCAAAGAAGGACAAACGATTGTCTGCAATGTATTATGGCCCAGCTGAGAATTCTCATGTTGC
ATACTACAAAATGGCATTGACAAAGTTTCCACCTTCCGTAGATCGAAAAAGATGGTCCAATGTAGAAAGGGAGAATCTAGGGAAGGGAATACGACAACAATTTCAAGAGA
TGGTGCTTCAGATCTCGGTGGATCAAATTAGTGGGCTACAAGGATTTTTAGCAGATTCAGATGATCTGGATAACATTCTTGCATCAATAAAAGACCTTGACATTACTCCC
GAAAAGATTAGGGAATTTTTGCCAAAAGTTAATTGGGACAAATTGGCTTCCATGTATCTTCGGGGTCGCTCAGGGGCAGAATGTGAAGCAAGGTGGTTGAATTTTGAAGA
TCCCCTAATTAATCGGAATGCATGGACTACAAGTGAGGATAAGAATCTTTTGTTTACTATTCAACAGAAAGGGTTGAATAACTGGATTGACATAGCAGTTTCATTGGGTA
CAAACAGAACTCCTTTTCAGTGCTTGTCTCGGTATCAAAGGAGTTTAAACGCATCCATATTAAAGAGGGAGTGGACCAAAGATGAGGATGAGAAACTTCAATCTGCTGTT
GCTATTCTTGGTGTGGGAGATTGGCAGGCTGTAGCTTCTACTTTAGAAGGACGAACTGGTCCACAGTGCTCTAATAGGTGGAAAAAATCCCTCGACCCAGCTAGGACAAA
AAGAGGCTACTTCACCCCAGATGAAGACAGTCGCTTGAAAATTGCTGTACTGCTTTTTGGGCCTAAAAATTGGAACAAGAAAGCCGAATTTTTACCTGGTCGAAATCAAG
TTCAATGCAGAGAAAGATGGTTCAATTGCTTAGATCCTTCCTTGAGAAGATGTAAATGGACAGAAGAGGAGGATTTAAGGCTAGAGATAGCAATTCAGGAACATGGATAT
AGCTGGGCTAAGGTAGCTGCATGTGTGCCGTCACGTACAGATAATGAGTGCCGGAGGAGATGGAAGAAGTTGTTTCCCAATGAAGTTCCTTTGCTCCAGGAAGCTAGAAA
GATTCAGAAGGCTGCTCTTATTAGCAACTTTGTTGATAGGGAATCAGAGCGTCCTGCTCTTGGTCCTACTGACTTTCGACCTATACCAAATACAAATTTATTATGTAATA
CTGATGATCCAAAACCTGCCCCGAAAAGAAATGTGAAAACGAGAAGGATGCCAGTGTCAAGGAAAAATGAAAAGAGTGCTACTGGTGATGCTGATGCTCCTAAGAAGAGG
AAATCAAATAACCGGAGGAATCAAGCTGATGCAACTGCTCAGGGCGGTATTGCAAATAATGTCCCAGAGGAGATTAAATCTATAAAACCTCAAAGAAAACGAAATAGGCA
TGGAGCTTATACTCCTAAGAGGAAAGGGGTTCTGGAGCTAAGTTCTAACAGTGAAAGATGTGCTGAACAGAATTTGGACACTCGGAGCCTCGAGGTGCAGCTGAATGGTA
AGGAATCCGAGATGACCAACAATGACTGTGCCGAGGCTGTTGATGAGAATGGTATGGAAGTATGTGAGAACAATGTTGCAGAGAAGCTTTCTAAAAGAGATGTATGCTGT
TCAGAACAAGAAGGAAATCAGAACTCTACAGGATCTTCTGGAGTGTCGGTGTTGTCGGAAATGACTAATGACACGGACGAGTACAATCCCTTTATCCTTCCAGATACAGC
ACTGCTGGCTAGTAGTACTGCTGGGGATGATATTATGGTGAGGAATGGGGAGAGTGTTGCAGACAAAGATCTGGATGACAGTAACAGCTTCTCGTTACCGCTCAATTGCT
TAGAACTCAGGACAACTGACAGTGAAGGTGTCAACAGTTATTCTGTTGGTGAAATTACAGACAAAGACGATAACATTTGCAAGCCCCAAGGCAGAAGGAAGAAAAATAGT
AAAGGATCAAATGAGAGTCAGGGTTCTTTGGTGGTTTCTGGTCAACAAGGGGAGCTGGAGAGGTCGGGGACGAAGTCCCCTCGTCGTCATAATCAATCAAAGAAGAGAAA
GCATAACACTATAAATACGAGTTCATTAGGAACACTGGAGGCAGTTGAAGAGGTCGATGACTGCACTCTCCTTGGCTTTCTGCAAAAAAGATTGAAGAGAACAACCACTC
ATGAAAAGAAAGTTGATAGCACTCTCCTAGAAGTTGATAATGATCCTACTATTGCCTCGCTCCTCGAGGGTAAATTGAAGAGAAAAAAGCATCTAGCGCCTAGTGGTGGA
GGTTAAAAAAAAGCACAATTTTGTTGCATTTTGAGATATATATTTGAAGCACTAATTTCAATTCACGATATGCTGATGGAGCTCGGGGAAAAGGTTGATTGAAGGCTGTA
CAAATCTTGGCGATTGTGAGAGAGTATCAAATAAAGCTTCCAAGGTTACGCAAGGGCTAAAATGCACAACTATAGGTTTGATTTGATAGTTGGCGAATTTTTGTTTTGTG
ATCATATTCACAGCGAGCAAATTTTGATGAGTTGCTTGGTAGTGATGATCTTTCTCTTTGTATATTTCTCACCACCTTGATTTAAATGAAATCATAGGATGTGTATAGGT
TTTTATTATTTTCTTTATGTATATTTTTTCCCCTTTTTCATAATTACCCTTACCCATTTCCATTGTTAGAAATTCTCAGTATTAGTACATATTATTAAAGCA
Protein sequenceShow/hide protein sequence
MSRRSHDDGGDEKLSAGEEDDEDDAIDDMEVLRGACRLAGVNPEEYINHPSSSLAAGDDSDEVDDFELVRNIRKQFSIADDEQPLSILPPASPDEEEDDFETLRRIQRRF
GAYETDILSNKPNQSCDFDVPLEMDSNNTAIERQTYPERSSMLAFEKGSLPKAGLAFIDAIKKNRSQQKFIRSKMIHLEARIEETKKLRKHFKILKDFQGSCRRRTSCAL
SQTIDPRVQLISAGKPQAKDSSKKDKRLSAMYYGPAENSHVAYYKMALTKFPPSVDRKRWSNVERENLGKGIRQQFQEMVLQISVDQISGLQGFLADSDDLDNILASIKD
LDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNAWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDEK
LQSAVAILGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCKWTEEEDLRLEIAI
QEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNTDDPKPAPKRNVKTRRMPVSRKNEKSATGDAD
APKKRKSNNRRNQADATAQGGIANNVPEEIKSIKPQRKRNRHGAYTPKRKGVLELSSNSERCAEQNLDTRSLEVQLNGKESEMTNNDCAEAVDENGMEVCENNVAEKLSK
RDVCCSEQEGNQNSTGSSGVSVLSEMTNDTDEYNPFILPDTALLASSTAGDDIMVRNGESVADKDLDDSNSFSLPLNCLELRTTDSEGVNSYSVGEITDKDDNICKPQGR
RKKNSKGSNESQGSLVVSGQQGELERSGTKSPRRHNQSKKRKHNTINTSSLGTLEAVEEVDDCTLLGFLQKRLKRTTTHEKKVDSTLLEVDNDPTIASLLEGKLKRKKHL
APSGGG