; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0009023 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0009023
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionsnRNA-activating protein complex subunit 4
Genome locationchr06:23260696..23276147
RNA-Seq ExpressionPI0009023
SyntenyPI0009023
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011650584.1 uncharacterized protein LOC101216287 [Cucumis sativus]0.0e+0090.16Show/hide
Query:  MSLRNHGDEIDVEHLADKEDGVVDEDMEVLQRAYRLVGV------NPRLSSPTVGDADPGSDSDDVDDFELLRNIQNRFSIVADEQVSILPQVSAEEEED
        MSLRNH DEIDVEH ADKEDGVVDEDMEVLQRAYRL GV      NPRLSSP  GDADPGSDSDDVDDFELLR+IQNRFSI+ADEQ    P VSA+EEED
Subjt:  MSLRNHGDEIDVEHLADKEDGVVDEDMEVLQRAYRLVGV------NPRLSSPTVGDADPGSDSDDVDDFELLRNIQNRFSIVADEQVSILPQVSAEEEED

Query:  EFEILRSIQRRFAAYESDTLSNEPNQSRDYVGSLKRDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKKL
        EFE+LRSIQRRFAAYESDTLSN+PNQSRDYVGSLK DSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKKL
Subjt:  EFEILRSIQRRFAAYESDTLSNEPNQSRDYVGSLKRDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKKL

Query:  RKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRVSGMYYGPAENSHVACYRMAMAKFPRVDRKKWSIVERENLGKGIRQQFQEM
        RKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKR+SGMYYGP ENSHVACYRM +AKFP VDRKKWSIVERENLGKGIRQQFQEM
Subjt:  RKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRVSGMYYGPAENSHVACYRMAMAKFPRVDRKKWSIVERENLGKGIRQQFQEM

Query:  VLQISVDQISGPQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDNLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNN
        VLQISVDQISGPQG+SGDSDDLDNILASIKDLDIAP+KIREFLPKVNWD LASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDK+LLFTIQQKGLNN
Subjt:  VLQISVDQISGPQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDNLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNN

Query:  WIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVL
        WIE+AVSLGTNRTPFQCLSRYQRSLNASILKREWTK+EDD+LRSAVA FGVRDWQAVAS LEGRAGTQCSNRWKKSLDPART++GYFTPDED RLKIAVL
Subjt:  WIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVL

Query:  LFGPKIGTRKQNFCLVGIKFNAEKGVWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAAL
        LFGPK   +K  F L G      +  WFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFP+EVPLLQEARKIQKAAL
Subjt:  LFGPKIGTRKQNFCLVGIKFNAEKGVWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAAL

Query:  ISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQEGIANNTSVVPEEVQSSKPQRK
        ISNFVDRETERPALGPADFRPRPNTD LCNTDGP PAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQ GIA NTS VPEEVQSSKPQRK
Subjt:  ISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQEGIANNTSVVPEEVQSSKPQRK

Query:  RNRRGAYTAKRVGVPELRSDSEWCAKQNLDTHSLGLQLNSKESERTNSDCTETVDENIMEVFENKVAEKLTEKNACFSEPEKNQNSTGSSGVSVLSEMTN
        RNRRGAYTAKR+GVPELRSDSEWCAKQNLDT SLGLQLNSKESER+NS+CTETVDENIMEV ENKVAEKLTE+NACFSEPEKNQNSTGSSGVSVLSEMTN
Subjt:  RNRRGAYTAKRVGVPELRSDSEWCAKQNLDTHSLGLQLNSKESERTNSDCTETVDENIMEVFENKVAEKLTEKNACFSEPEKNQNSTGSSGVSVLSEMTN

Query:  DLVDYNPSILTDTTLLASTTVDDIEELKGKSVAERDLDDSNSVSLPHSCLELRTIDCEGVDSYSVDEFTAKSNGVCNPTQGRRKKNSKTSNNSHDYLFLS
        DLVDYNPSILTDTTL ASTTVDDIEELKGKS A+RDLDDSNS SL HSCLELRT+D EGVDSYSVDE+TAKSNGVCNPTQGRRKKNSKTSNNSHD L + 
Subjt:  DLVDYNPSILTDTTLLASTTVDDIEELKGKSVAERDLDDSNSVSLPHSCLELRTIDCEGVDSYSVDEFTAKSNGVCNPTQGRRKKNSKTSNNSHDYLFLS

Query:  RQQIEQETSGTNKPHCHNQSKKREHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRT-MTHNETVDCSSSTPLEV---DNEPTIASFLYKLKRKKHQRP
        RQQI QET GT KP  HNQSKKR+H+NTG STL T EAVEEVDDCTLVGFLQKRLKRT MTHNETVDCSS+ PL+V   DNEPTIASFL KLKRKKHQRP
Subjt:  RQQIEQETSGTNKPHCHNQSKKREHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRT-MTHNETVDCSSSTPLEV---DNEPTIASFLYKLKRKKHQRP

Query:  SGGELN
        SG ELN
Subjt:  SGGELN

XP_023515735.1 uncharacterized protein LOC111779809 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0077.07Show/hide
Query:  MSLRNHGDEIDVEHLA---DKEDGVVDEDMEVLQRAYRLVGV------NPRLSSPTVGDADPGSDSDDVDDFELLRNIQNRFSIVADEQ-VSILPQVSAE
        MS R+H D  D E  A   D ED +VD+DME L+RA RL GV      NPRLS P  GDA+ GSDSDDVDD ELLRNIQNRFSI ADEQ +SILP V+A+
Subjt:  MSLRNHGDEIDVEHLA---DKEDGVVDEDMEVLQRAYRLVGV------NPRLSSPTVGDADPGSDSDDVDDFELLRNIQNRFSIVADEQ-VSILPQVSAE

Query:  EEEDEFEILRSIQRRFAAYESDTLSNEPNQSRDYVGSLKRDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FE+LRSIQRRFAAYESD LSN+P+QS D  G LK DS++  VE  TSS+R SM+AFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEE
Subjt:  EEEDEFEILRSIQRRFAAYESDTLSNEPNQSRDYVGSLKRDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRVSGMYYGPAENSHVACYRMAMAKFPRVDRKKWSIVERENLGKGIRQQ
        NKKLRKR K+LK FQGSC+R+T+CAL+QM+DPRVQLISA KPQAKDSSKKDKR+S M YGPAENSHVACYR A  KF  VDRK+WS  ERENLGKGIRQQ
Subjt:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRVSGMYYGPAENSHVACYRMAMAKFPRVDRKKWSIVERENLGKGIRQQ

Query:  FQEMVLQISVDQISGPQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDNLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQK
        FQEMVLQISVDQIS  QG S +SDDLDNILASIK LDI PEKIREFLPKVNWD LASMYL+GRSGAECEARWLNFEDPLINR+PWTTSEDKNLLFTIQQK
Subjt:  FQEMVLQISVDQISGPQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDNLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQK

Query:  GLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLK
        GLNNWIE+AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDEDDKLRSAVA+FG  DWQAVAS LEGR G QCSNRWKKSLDPARTKRGYFTPDEDSRLK
Subjt:  GLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLK

Query:  IAVLLFGPKIGTRKQNFCLVGIKFNAEKGVWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQ
        IAVLLFGPK   +K  F L G      +  WFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPN+VPLLQEARKIQ
Subjt:  IAVLLFGPKIGTRKQNFCLVGIKFNAEKGVWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQ

Query:  KAALISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQEGIANNTSVVPEEVQSSK
        K ALISNFVDRE+ERPALGP DFRP PN+ LLCNTD P  APKRNV+TR+MPVSRNEKSA GDAPK+RKSN QR + D TAQ    NNTS VP EV+S+K
Subjt:  KAALISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQEGIANNTSVVPEEVQSSK

Query:  PQRKRNRRGAYTAKRVGVPELRSDSEWCAKQNLDTHSLGLQLNSKE-SERTNSDCTETVDENIMEVFENKVAEKLTEKNACFSEPEKNQNSTGSSGVSVL
        PQRKR R GAYT +R G P++  +SE CA+QN DT S+ +QLN KE +ER NSDC ETVDEN MEVFENK AE  +E   CFSE E+NQNSTGSSGVSVL
Subjt:  PQRKRNRRGAYTAKRVGVPELRSDSEWCAKQNLDTHSLGLQLNSKE-SERTNSDCTETVDENIMEVFENKVAEKLTEKNACFSEPEKNQNSTGSSGVSVL

Query:  SEMTNDLVDYNPSILTDTTLLASTTVDDIEELKGKSVAERDLDDSNSVSLPHSCLELRTIDCEGVDSYSVDEFTAKSNGVCNPTQGRRKKNSKTSNNSHD
        SEMTND+ +YNPS L DTTLLAS T DDI E KG +VA++DLDDSNS SLP SCLELRT D EGVDSYSVDEFT KS+GVC P QGRRKKNSK SN S D
Subjt:  SEMTNDLVDYNPSILTDTTLLASTTVDDIEELKGKSVAERDLDDSNSVSLPHSCLELRTIDCEGVDSYSVDEFTAKSNGVCNPTQGRRKKNSKTSNNSHD

Query:  YLFLSRQQIEQETSGTNKPHCHNQSKKREHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKR-TMTHNETVDCSSSTPLEV---DNEPTIASFLY-KLKR
         L +S QQ E E SGTN+ H  NQSKKR+H+ T TS LGT+EAVEEVDDCTL GFLQKRLKR T TH++ VD SSSTP EV   DN+PT+A  L  KLKR
Subjt:  YLFLSRQQIEQETSGTNKPHCHNQSKKREHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKR-TMTHNETVDCSSSTPLEV---DNEPTIASFLY-KLKR

Query:  KKH
        KKH
Subjt:  KKH

XP_023515736.1 uncharacterized protein LOC111779809 isoform X2 [Cucurbita pepo subsp. pepo]0.0e+0076.97Show/hide
Query:  MSLRNHGDEIDVEHLA---DKEDGVVDEDMEVLQRAYRLVGV------NPRLSSPTVGDADPGSDSDDVDDFELLRNIQNRFSIVADEQ-VSILPQVSAE
        MS R+H D  D E  A   D ED +VD+DME L+RA RL GV      NPRLS P  GDA+ GSDSDDVDD ELLRNIQNRFSI ADEQ +SILP V+A+
Subjt:  MSLRNHGDEIDVEHLA---DKEDGVVDEDMEVLQRAYRLVGV------NPRLSSPTVGDADPGSDSDDVDDFELLRNIQNRFSIVADEQ-VSILPQVSAE

Query:  EEEDEFEILRSIQRRFAAYESDTLSNEPNQSRDYVGSLKRDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FE+LRSIQRRFAAYESD LSN+P+QS D  G LK DS++  VE  TSS+R SM+AFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEE
Subjt:  EEEDEFEILRSIQRRFAAYESDTLSNEPNQSRDYVGSLKRDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRVSGMYYGPAENSHVACYRMAMAKFPRVDRKKWSIVERENLGKGIRQQ
        NKKLRKR K+LK FQGSC+R+T+CAL+QM+DPRVQLISA KPQAKDSSKKDKR+S M YGPAENSHVACYR A  KF  VDRK+WS  ERENLGKGIRQQ
Subjt:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRVSGMYYGPAENSHVACYRMAMAKFPRVDRKKWSIVERENLGKGIRQQ

Query:  FQEMVLQISVDQISGPQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDNLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQK
        FQEMVLQISVDQIS  QG S +SDDLDNILASIK LDI PEKIREFLPKVNWD LASMYL+GRSGAECEARWLNFEDPLINR+PWTTSEDKNLLFTIQQK
Subjt:  FQEMVLQISVDQISGPQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDNLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQK

Query:  GLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLK
        GLNNWIE+AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDEDDKLRSAVA+FG  DWQAVAS LEGR G QCSNRWKKSLDPARTKRGYFTPDEDSRLK
Subjt:  GLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLK

Query:  IAVLLFGPKIGTRKQNFCLVGIKFNAEKGVWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQ
        IAVLLFGPK   +K  F L G      +  WFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNEC RRWKKLFPN+VPLLQEARKIQ
Subjt:  IAVLLFGPKIGTRKQNFCLVGIKFNAEKGVWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQ

Query:  KAALISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQEGIANNTSVVPEEVQSSK
        K ALISNFVDRE+ERPALGP DFRP PN+ LLCNTD P  APKRNV+TR+MPVSRNEKSA GDAPK+RKSN QR + D TAQ    NNTS VP EV+S+K
Subjt:  KAALISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQEGIANNTSVVPEEVQSSK

Query:  PQRKRNRRGAYTAKRVGVPELRSDSEWCAKQNLDTHSLGLQLNSKE-SERTNSDCTETVDENIMEVFENKVAEKLTEKNACFSEPEKNQNSTGSSGVSVL
        PQRKR R GAYT +R G P++  +SE CA+QN DT S+ +QLN KE +ER NSDC ETVDEN MEVFENK AE  +E   CFSE E+NQNSTGSSGVSVL
Subjt:  PQRKRNRRGAYTAKRVGVPELRSDSEWCAKQNLDTHSLGLQLNSKE-SERTNSDCTETVDENIMEVFENKVAEKLTEKNACFSEPEKNQNSTGSSGVSVL

Query:  SEMTNDLVDYNPSILTDTTLLASTTVDDIEELKGKSVAERDLDDSNSVSLPHSCLELRTIDCEGVDSYSVDEFTAKSNGVCNPTQGRRKKNSKTSNNSHD
        SEMTND+ +YNPS L DTTLLAS T DDI E KG +VA++DLDDSNS SLP SCLELRT D EGVDSYSVDEFT KS+GVC P QGRRKKNSK SN S D
Subjt:  SEMTNDLVDYNPSILTDTTLLASTTVDDIEELKGKSVAERDLDDSNSVSLPHSCLELRTIDCEGVDSYSVDEFTAKSNGVCNPTQGRRKKNSKTSNNSHD

Query:  YLFLSRQQIEQETSGTNKPHCHNQSKKREHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKR-TMTHNETVDCSSSTPLEV---DNEPTIASFLY-KLKR
         L +S QQ E E SGTN+ H  NQSKKR+H+ T TS LGT+EAVEEVDDCTL GFLQKRLKR T TH++ VD SSSTP EV   DN+PT+A  L  KLKR
Subjt:  YLFLSRQQIEQETSGTNKPHCHNQSKKREHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKR-TMTHNETVDCSSSTPLEV---DNEPTIASFLY-KLKR

Query:  KKH
        KKH
Subjt:  KKH

XP_038905712.1 uncharacterized protein LOC120091681 isoform X1 [Benincasa hispida]0.0e+0083.43Show/hide
Query:  MSLRNHGDEIDVEHLADKEDGVVDEDMEVLQRAYRLVGV------NPRLSSPTVGDADPGSDSDDVDDFELLRNIQNRFSIVADEQ-VSILPQVSAEEEE
        MS  NHGDE DVE  A+KED VVDEDMEVLQRAYRLVGV      NPRLSSP VGDA+ G DSDD DDFELLRNIQNRFSIV DEQ +S LP VS +EEE
Subjt:  MSLRNHGDEIDVEHLADKEDGVVDEDMEVLQRAYRLVGV------NPRLSSPTVGDADPGSDSDDVDDFELLRNIQNRFSIVADEQ-VSILPQVSAEEEE

Query:  DEFEILRSIQRRFAAYESDTLSNEPNQSRDYVGSLKRDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK
        DEFE+LRSIQRRFAAYESD LSN+PN+SRDYVGSLK DS + A ESQTSSKRPSM+AFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK
Subjt:  DEFEILRSIQRRFAAYESDTLSNEPNQSRDYVGSLKRDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK

Query:  LRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRVSGMYYGPAENSHVACYRMAMAKFPRVDRKKWSIVERENLGKGIRQQFQE
        LRKRCKILKDFQGSCKR+T+CALSQMIDPRVQLISAAKPQAKDSSKKDKR+SGMYYGPAENSHVACYRMA+AKFPRVDRKKWSIVERENLGKGIRQQFQE
Subjt:  LRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRVSGMYYGPAENSHVACYRMAMAKFPRVDRKKWSIVERENLGKGIRQQFQE

Query:  MVLQISVDQISGPQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDNLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLN
        MVLQISVDQISG QG S DSDDLDNILASIKDLDI PEKIREFLPKVNWD LASMYL GRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLN
Subjt:  MVLQISVDQISGPQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDNLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLN

Query:  NWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAV
        NWIEIAVS GTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVAS LEGRAGTQCSNRWKKSLDPARTKRG+FTPDED+RLKIAV
Subjt:  NWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAV

Query:  LLFGPKIGTRKQNFCLVGIKFNAEKGVWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAA
        LL GPK   +K  F L G      +  WFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDN+CRRRWKKLFPNEVPLLQEARKIQKAA
Subjt:  LLFGPKIGTRKQNFCLVGIKFNAEKGVWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAA

Query:  LISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQEGIANNTSVVPEEVQSSKPQR
        LISNFVDRE+ERPALGPADFRPR NTD+LC+TD P+PAPKRN KTRKMPVSRNEKSATGDAP+KRKSNYQR Q DATA+ GIANNTS VPEEVQS KP R
Subjt:  LISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQEGIANNTSVVPEEVQSSKPQR

Query:  KRNRRGAYTAKRVGVPELRSDSEWCAKQNLDTHSLGLQLNSKESERTNSDCTETVDENIMEVFENKVAEKLTEKNACFSEPEKNQNSTGSSGVSVLSEMT
        KRNR  A T KR GV EL S ++WCAKQNL+T S+G+QL+SKE E TNSD TETVD N +EVFENK+A+KL+E++  FSEPE+NQNSTGSSGVSVLSEMT
Subjt:  KRNRRGAYTAKRVGVPELRSDSEWCAKQNLDTHSLGLQLNSKESERTNSDCTETVDENIMEVFENKVAEKLTEKNACFSEPEKNQNSTGSSGVSVLSEMT

Query:  NDLVDYNPSILTDTTLLASTTVDDIEELKGKSVAERDLDDSNSVSLPHSCLELRTIDCEGVDSYSVDEFTAKSNGVCNPTQGRRKKNSKTSNNSHDYLFL
        ND+ +YNPSIL DTTLLASTTVDDIEELKGKS A+RDLDDSNS SLP SCLELRTID EGVDSYSVD+ T KS+ VC   QGRRKKNSKTS+ +H+Y FL
Subjt:  NDLVDYNPSILTDTTLLASTTVDDIEELKGKSVAERDLDDSNSVSLPHSCLELRTIDCEGVDSYSVDEFTAKSNGVCNPTQGRRKKNSKTSNNSHDYLFL

Query:  SRQQIEQETSGTNKPHCHNQSKKREHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTMTHNETVDCSSSTPLEVDNEPT--IASFLYKLKRKKHQRPS
        S QQ+EQE  G N+P   NQSKKR+H++T TS LGTLEAVEEVD+CTLVGFLQKRLK+       VDCSS TPLEVDN+    IASFL KLKRKKHQ PS
Subjt:  SRQQIEQETSGTNKPHCHNQSKKREHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTMTHNETVDCSSSTPLEVDNEPT--IASFLYKLKRKKHQRPS

Query:  GG
         G
Subjt:  GG

XP_038905717.1 uncharacterized protein LOC120091681 isoform X2 [Benincasa hispida]0.0e+0083.43Show/hide
Query:  MSLRNHGDEIDVEHLADKEDGVVDEDMEVLQRAYRLVGV------NPRLSSPTVGDADPGSDSDDVDDFELLRNIQNRFSIVADEQ-VSILPQVSAEEEE
        MS  NHGDE DVE  A+KED VVDEDMEVLQRAYRLVGV      NPRLSSP VGDA+ G DSDD DDFELLRNIQNRFSIV DEQ +S LP VS +EEE
Subjt:  MSLRNHGDEIDVEHLADKEDGVVDEDMEVLQRAYRLVGV------NPRLSSPTVGDADPGSDSDDVDDFELLRNIQNRFSIVADEQ-VSILPQVSAEEEE

Query:  DEFEILRSIQRRFAAYESDTLSNEPNQSRDYVGSLKRDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK
        DEFE+LRSIQRRFAAYESD LSN+PN+SRDYVGSLK DS + A ESQTSSKRPSM+AFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK
Subjt:  DEFEILRSIQRRFAAYESDTLSNEPNQSRDYVGSLKRDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK

Query:  LRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRVSGMYYGPAENSHVACYRMAMAKFPRVDRKKWSIVERENLGKGIRQQFQE
        LRKRCKILKDFQGSCKR+T+CALSQMIDPRVQLISAAKPQAKDSSKKDKR+SGMYYGPAENSHVACYRMA+AKFPRVDRKKWSIVERENLGKGIRQQFQE
Subjt:  LRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRVSGMYYGPAENSHVACYRMAMAKFPRVDRKKWSIVERENLGKGIRQQFQE

Query:  MVLQISVDQISGPQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDNLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLN
        MVLQISVDQISG QG S DSDDLDNILASIKDLDI PEKIREFLPKVNWD LASMYL GRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLN
Subjt:  MVLQISVDQISGPQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDNLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLN

Query:  NWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAV
        NWIEIAVS GTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVAS LEGRAGTQCSNRWKKSLDPARTKRG+FTPDED+RLKIAV
Subjt:  NWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAV

Query:  LLFGPKIGTRKQNFCLVGIKFNAEKGVWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAA
        LL GPK   +K  F L G      +  WFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDN+CRRRWKKLFPNEVPLLQEARKIQKAA
Subjt:  LLFGPKIGTRKQNFCLVGIKFNAEKGVWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAA

Query:  LISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQEGIANNTSVVPEEVQSSKPQR
        LISNFVDRE+ERPALGPADFRPR NTD+LC+TD P+PAPKRN KTRKMPVSRNEKSATGDAP+KRKSNYQR Q DATA+ GIANNTS VPEEVQS KP R
Subjt:  LISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQEGIANNTSVVPEEVQSSKPQR

Query:  KRNRRGAYTAKRVGVPELRSDSEWCAKQNLDTHSLGLQLNSKESERTNSDCTETVDENIMEVFENKVAEKLTEKNACFSEPEKNQNSTGSSGVSVLSEMT
        KRNR  A T KR GV EL S ++WCAKQNL+T S+G+QL+SKE E TNSD TETVD N +EVFENK+A+KL+E++  FSEPE+NQNSTGSSGVSVLSEMT
Subjt:  KRNRRGAYTAKRVGVPELRSDSEWCAKQNLDTHSLGLQLNSKESERTNSDCTETVDENIMEVFENKVAEKLTEKNACFSEPEKNQNSTGSSGVSVLSEMT

Query:  NDLVDYNPSILTDTTLLASTTVDDIEELKGKSVAERDLDDSNSVSLPHSCLELRTIDCEGVDSYSVDEFTAKSNGVCNPTQGRRKKNSKTSNNSHDYLFL
        ND+ +YNPSIL DTTLLASTTVDDIEELKGKS A+RDLDDSNS SLP SCLELRTID EGVDSYSVD+ T KS+ VC   QGRRKKNSKTS+ +H+Y FL
Subjt:  NDLVDYNPSILTDTTLLASTTVDDIEELKGKSVAERDLDDSNSVSLPHSCLELRTIDCEGVDSYSVDEFTAKSNGVCNPTQGRRKKNSKTSNNSHDYLFL

Query:  SRQQIEQETSGTNKPHCHNQSKKREHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTMTHNETVDCSSSTPLEVDNEPT--IASFLYKLKRKKHQRPS
        S QQ+EQE  G N+P   NQSKKR+H++T TS LGTLEAVEEVD+CTLVGFLQKRLK+       VDCSS TPLEVDN+    IASFL KLKRKKHQ PS
Subjt:  SRQQIEQETSGTNKPHCHNQSKKREHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTMTHNETVDCSSSTPLEVDNEPT--IASFLYKLKRKKHQRPS

Query:  GG
         G
Subjt:  GG

TrEMBL top hitse value%identityAlignment
A0A0A0L2R2 Uncharacterized protein0.0e+0090.16Show/hide
Query:  MSLRNHGDEIDVEHLADKEDGVVDEDMEVLQRAYRLVGV------NPRLSSPTVGDADPGSDSDDVDDFELLRNIQNRFSIVADEQVSILPQVSAEEEED
        MSLRNH DEIDVEH ADKEDGVVDEDMEVLQRAYRL GV      NPRLSSP  GDADPGSDSDDVDDFELLR+IQNRFSI+ADEQ    P VSA+EEED
Subjt:  MSLRNHGDEIDVEHLADKEDGVVDEDMEVLQRAYRLVGV------NPRLSSPTVGDADPGSDSDDVDDFELLRNIQNRFSIVADEQVSILPQVSAEEEED

Query:  EFEILRSIQRRFAAYESDTLSNEPNQSRDYVGSLKRDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKKL
        EFE+LRSIQRRFAAYESDTLSN+PNQSRDYVGSLK DSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKKL
Subjt:  EFEILRSIQRRFAAYESDTLSNEPNQSRDYVGSLKRDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKKL

Query:  RKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRVSGMYYGPAENSHVACYRMAMAKFPRVDRKKWSIVERENLGKGIRQQFQEM
        RKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKR+SGMYYGP ENSHVACYRM +AKFP VDRKKWSIVERENLGKGIRQQFQEM
Subjt:  RKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRVSGMYYGPAENSHVACYRMAMAKFPRVDRKKWSIVERENLGKGIRQQFQEM

Query:  VLQISVDQISGPQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDNLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNN
        VLQISVDQISGPQG+SGDSDDLDNILASIKDLDIAP+KIREFLPKVNWD LASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDK+LLFTIQQKGLNN
Subjt:  VLQISVDQISGPQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDNLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNN

Query:  WIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVL
        WIE+AVSLGTNRTPFQCLSRYQRSLNASILKREWTK+EDD+LRSAVA FGVRDWQAVAS LEGRAGTQCSNRWKKSLDPART++GYFTPDED RLKIAVL
Subjt:  WIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVL

Query:  LFGPKIGTRKQNFCLVGIKFNAEKGVWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAAL
        LFGPK   +K  F L G      +  WFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFP+EVPLLQEARKIQKAAL
Subjt:  LFGPKIGTRKQNFCLVGIKFNAEKGVWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAAL

Query:  ISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQEGIANNTSVVPEEVQSSKPQRK
        ISNFVDRETERPALGPADFRPRPNTD LCNTDGP PAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQ GIA NTS VPEEVQSSKPQRK
Subjt:  ISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQEGIANNTSVVPEEVQSSKPQRK

Query:  RNRRGAYTAKRVGVPELRSDSEWCAKQNLDTHSLGLQLNSKESERTNSDCTETVDENIMEVFENKVAEKLTEKNACFSEPEKNQNSTGSSGVSVLSEMTN
        RNRRGAYTAKR+GVPELRSDSEWCAKQNLDT SLGLQLNSKESER+NS+CTETVDENIMEV ENKVAEKLTE+NACFSEPEKNQNSTGSSGVSVLSEMTN
Subjt:  RNRRGAYTAKRVGVPELRSDSEWCAKQNLDTHSLGLQLNSKESERTNSDCTETVDENIMEVFENKVAEKLTEKNACFSEPEKNQNSTGSSGVSVLSEMTN

Query:  DLVDYNPSILTDTTLLASTTVDDIEELKGKSVAERDLDDSNSVSLPHSCLELRTIDCEGVDSYSVDEFTAKSNGVCNPTQGRRKKNSKTSNNSHDYLFLS
        DLVDYNPSILTDTTL ASTTVDDIEELKGKS A+RDLDDSNS SL HSCLELRT+D EGVDSYSVDE+TAKSNGVCNPTQGRRKKNSKTSNNSHD L + 
Subjt:  DLVDYNPSILTDTTLLASTTVDDIEELKGKSVAERDLDDSNSVSLPHSCLELRTIDCEGVDSYSVDEFTAKSNGVCNPTQGRRKKNSKTSNNSHDYLFLS

Query:  RQQIEQETSGTNKPHCHNQSKKREHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRT-MTHNETVDCSSSTPLEV---DNEPTIASFLYKLKRKKHQRP
        RQQI QET GT KP  HNQSKKR+H+NTG STL T EAVEEVDDCTLVGFLQKRLKRT MTHNETVDCSS+ PL+V   DNEPTIASFL KLKRKKHQRP
Subjt:  RQQIEQETSGTNKPHCHNQSKKREHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRT-MTHNETVDCSSSTPLEV---DNEPTIASFLYKLKRKKHQRP

Query:  SGGELN
        SG ELN
Subjt:  SGGELN

A0A6J1E2J4 uncharacterized protein LOC111430000 isoform X20.0e+0076.87Show/hide
Query:  MSLRNHGDEIDVEHLA---DKEDGVVDEDMEVLQRAYRLVGV------NPRLSSPTVGDADPGSDSDDVDDFELLRNIQNRFSIVADEQ-VSILPQVSAE
        MS R+H D  D E  A   D ED +VD+DME L+RA RL GV      NPRLS P  GDA+ GSDSDDVDD ELLRNIQNRFS  ADEQ +SILP V+A+
Subjt:  MSLRNHGDEIDVEHLA---DKEDGVVDEDMEVLQRAYRLVGV------NPRLSSPTVGDADPGSDSDDVDDFELLRNIQNRFSIVADEQ-VSILPQVSAE

Query:  EEEDEFEILRSIQRRFAAYESDTLSNEPNQSRDYVGSLKRDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FE LRSIQRRFAAYESD LSN+P+QS D  G LK DSD+  V   TSS+R SM+AFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEE
Subjt:  EEEDEFEILRSIQRRFAAYESDTLSNEPNQSRDYVGSLKRDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRVSGMYYGPAENSHVACYRMAMAKFPRVDRKKWSIVERENLGKGIRQQ
        NKKLRKR K+LK FQGSC+R+T+CAL+QM+DPRVQLISA KPQAKDSSKKDKR+S M YGPAENSHVACYR A+ KF  VDRK+WS  ERENLGKGIRQQ
Subjt:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRVSGMYYGPAENSHVACYRMAMAKFPRVDRKKWSIVERENLGKGIRQQ

Query:  FQEMVLQISVDQISGPQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDNLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQK
        FQEMVLQISVDQIS  QG S +SDDLDNILASIK LDI PEKIREFLPKVNWD LA MYLQGRSGAECEARWLNFEDPLINR+ WTTSEDKNLLFTIQQK
Subjt:  FQEMVLQISVDQISGPQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDNLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQK

Query:  GLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLK
        GLNNWIE+AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDEDDKLRSAVAIFG  DWQAVAS LEGR G QCSNRWKKSLDPARTKRGYFTPDEDSRLK
Subjt:  GLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLK

Query:  IAVLLFGPKIGTRKQNFCLVGIKFNAEKGVWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQ
        IAVLLFGPK   +K  F L G      +  WFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNEC RRWKKLFPN+VPLLQEARKIQ
Subjt:  IAVLLFGPKIGTRKQNFCLVGIKFNAEKGVWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQ

Query:  KAALISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQEGIANNTSVVPEEVQSSK
        K ALISNFVDRE+ERPALGP DFRP PN+ LLCNTD P  APKRNV+ R+MPVSRNEKSA GDAPKK KSN QR Q D TAQ   ANNTS VP EV+S+K
Subjt:  KAALISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQEGIANNTSVVPEEVQSSK

Query:  PQRKRNRRGAYTAKRVGVPELRSDSEWCAKQNLDTHSLGLQLNSKE-SERTNSDCTETVDENIMEVFENKVAEKLTEKNACFSEPEKNQNSTGSSGVSVL
        PQRKR R GAYT +R G P++  +SE CA+QN DT SL +QLN KE +ER NSDC ETVDEN MEVFENK AE  +E   CFSE E+NQNSTGSSGVSVL
Subjt:  PQRKRNRRGAYTAKRVGVPELRSDSEWCAKQNLDTHSLGLQLNSKE-SERTNSDCTETVDENIMEVFENKVAEKLTEKNACFSEPEKNQNSTGSSGVSVL

Query:  SEMTNDLVDYNPSILTDTTLLASTTVDDIEELKGKSVAERDLDDSNSVSLPHSCLELRTIDCEGVDSYSVDEFTAKSNGVCNPTQGRRKKNSKTSNNSHD
        SEMTND+ +YNPS   DTTLLAS T DDI E KG +VA++DLDDSNS SLP SCLELRT D EGVDSYSVDEFT KS+GVC P QGRRKKNSK SN S D
Subjt:  SEMTNDLVDYNPSILTDTTLLASTTVDDIEELKGKSVAERDLDDSNSVSLPHSCLELRTIDCEGVDSYSVDEFTAKSNGVCNPTQGRRKKNSKTSNNSHD

Query:  YLFLSRQQIEQETSGTNKPHCHNQSKKREHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKR-TMTHNETVDCSSSTPLEV---DNEPTIASFLY-KLKR
         L +S QQ E E SG N+ H  NQSKKR+H+ T TS LGT+EAVEEVDDCTL GFLQKRLKR T TH++ VD SSSTP EV   DN+PT+A  L  KLKR
Subjt:  YLFLSRQQIEQETSGTNKPHCHNQSKKREHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKR-TMTHNETVDCSSSTPLEV---DNEPTIASFLY-KLKR

Query:  KKH
        KKH
Subjt:  KKH

A0A6J1E6Z7 uncharacterized protein LOC111430000 isoform X10.0e+0076.97Show/hide
Query:  MSLRNHGDEIDVEHLA---DKEDGVVDEDMEVLQRAYRLVGV------NPRLSSPTVGDADPGSDSDDVDDFELLRNIQNRFSIVADEQ-VSILPQVSAE
        MS R+H D  D E  A   D ED +VD+DME L+RA RL GV      NPRLS P  GDA+ GSDSDDVDD ELLRNIQNRFS  ADEQ +SILP V+A+
Subjt:  MSLRNHGDEIDVEHLA---DKEDGVVDEDMEVLQRAYRLVGV------NPRLSSPTVGDADPGSDSDDVDDFELLRNIQNRFSIVADEQ-VSILPQVSAE

Query:  EEEDEFEILRSIQRRFAAYESDTLSNEPNQSRDYVGSLKRDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FE LRSIQRRFAAYESD LSN+P+QS D  G LK DSD+  V   TSS+R SM+AFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEE
Subjt:  EEEDEFEILRSIQRRFAAYESDTLSNEPNQSRDYVGSLKRDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRVSGMYYGPAENSHVACYRMAMAKFPRVDRKKWSIVERENLGKGIRQQ
        NKKLRKR K+LK FQGSC+R+T+CAL+QM+DPRVQLISA KPQAKDSSKKDKR+S M YGPAENSHVACYR A+ KF  VDRK+WS  ERENLGKGIRQQ
Subjt:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRVSGMYYGPAENSHVACYRMAMAKFPRVDRKKWSIVERENLGKGIRQQ

Query:  FQEMVLQISVDQISGPQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDNLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQK
        FQEMVLQISVDQIS  QG S +SDDLDNILASIK LDI PEKIREFLPKVNWD LA MYLQGRSGAECEARWLNFEDPLINR+ WTTSEDKNLLFTIQQK
Subjt:  FQEMVLQISVDQISGPQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDNLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQK

Query:  GLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLK
        GLNNWIE+AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDEDDKLRSAVAIFG  DWQAVAS LEGR G QCSNRWKKSLDPARTKRGYFTPDEDSRLK
Subjt:  GLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLK

Query:  IAVLLFGPKIGTRKQNFCLVGIKFNAEKGVWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQ
        IAVLLFGPK   +K  F L G      +  WFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPN+VPLLQEARKIQ
Subjt:  IAVLLFGPKIGTRKQNFCLVGIKFNAEKGVWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQ

Query:  KAALISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQEGIANNTSVVPEEVQSSK
        K ALISNFVDRE+ERPALGP DFRP PN+ LLCNTD P  APKRNV+ R+MPVSRNEKSA GDAPKK KSN QR Q D TAQ   ANNTS VP EV+S+K
Subjt:  KAALISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQEGIANNTSVVPEEVQSSK

Query:  PQRKRNRRGAYTAKRVGVPELRSDSEWCAKQNLDTHSLGLQLNSKE-SERTNSDCTETVDENIMEVFENKVAEKLTEKNACFSEPEKNQNSTGSSGVSVL
        PQRKR R GAYT +R G P++  +SE CA+QN DT SL +QLN KE +ER NSDC ETVDEN MEVFENK AE  +E   CFSE E+NQNSTGSSGVSVL
Subjt:  PQRKRNRRGAYTAKRVGVPELRSDSEWCAKQNLDTHSLGLQLNSKE-SERTNSDCTETVDENIMEVFENKVAEKLTEKNACFSEPEKNQNSTGSSGVSVL

Query:  SEMTNDLVDYNPSILTDTTLLASTTVDDIEELKGKSVAERDLDDSNSVSLPHSCLELRTIDCEGVDSYSVDEFTAKSNGVCNPTQGRRKKNSKTSNNSHD
        SEMTND+ +YNPS   DTTLLAS T DDI E KG +VA++DLDDSNS SLP SCLELRT D EGVDSYSVDEFT KS+GVC P QGRRKKNSK SN S D
Subjt:  SEMTNDLVDYNPSILTDTTLLASTTVDDIEELKGKSVAERDLDDSNSVSLPHSCLELRTIDCEGVDSYSVDEFTAKSNGVCNPTQGRRKKNSKTSNNSHD

Query:  YLFLSRQQIEQETSGTNKPHCHNQSKKREHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKR-TMTHNETVDCSSSTPLEV---DNEPTIASFLY-KLKR
         L +S QQ E E SG N+ H  NQSKKR+H+ T TS LGT+EAVEEVDDCTL GFLQKRLKR T TH++ VD SSSTP EV   DN+PT+A  L  KLKR
Subjt:  YLFLSRQQIEQETSGTNKPHCHNQSKKREHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKR-TMTHNETVDCSSSTPLEV---DNEPTIASFLY-KLKR

Query:  KKH
        KKH
Subjt:  KKH

A0A6J1JK98 uncharacterized protein LOC111485355 isoform X20.0e+0076.59Show/hide
Query:  MSLRNHGDEIDVEHLA---DKEDGVVDEDMEVLQRAYRLVG------VNPRLSSPTVGDADPGSDSDDVDDFELLRNIQNRFSIVADEQ-VSILPQVSAE
        MS R+H D  D E  A   D ED +VD+DME L+RA RL G      VNP+LS P  GDA+ GSDSDDVDD ELLRNIQNRFSI ADEQ +SILP V+A+
Subjt:  MSLRNHGDEIDVEHLA---DKEDGVVDEDMEVLQRAYRLVG------VNPRLSSPTVGDADPGSDSDDVDDFELLRNIQNRFSIVADEQ-VSILPQVSAE

Query:  EEEDEFEILRSIQRRFAAYESDTLSNEPNQSRDYVGSLKRDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FE LRSIQRRFAAYESD LSN+P+QS D  G LK DSD+  VE  TSS+R SM+AFEKGSLPKAALAF+DAIKKNRSQQKF+RSKMIHLEARIEE
Subjt:  EEEDEFEILRSIQRRFAAYESDTLSNEPNQSRDYVGSLKRDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKP-QAKDSSKKDKRVSGMYYGPAENSHVACYRMAMAKFPRVDRKKWSIVERENLGKGIRQ
        NKKLRKR K+LK FQGSC+R+T+CALSQM+DPRVQLISA KP QAKDSSKKDKR+S M YGPAENSHVACYR+A+ KF  VDRK+WS  ERENLGKGIRQ
Subjt:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKP-QAKDSSKKDKRVSGMYYGPAENSHVACYRMAMAKFPRVDRKKWSIVERENLGKGIRQ

Query:  QFQEMVLQISVDQISGPQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDNLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQ
        QFQEMVLQISVDQIS  QG S +SDDLDNILASIKDLDI PEKIREFLPKVNWD LASMYL+GRSGAECEARWLNFEDPLINR+PWTTSEDKNLLFTIQQ
Subjt:  QFQEMVLQISVDQISGPQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDNLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQ

Query:  KGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRL
        KGLNNWI++AVSLGTNRTPFQ LSRYQRSLNASILK EWTKDEDDKLRSAVAIFG  DWQAVAS LEGR G QCSNRWKKSLDPARTKRGYFTPDEDSRL
Subjt:  KGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRL

Query:  KIAVLLFGPKIGTRKQNFCLVGIKFNAEKGVWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKI
        KIAVLLFGPK   +K  F L G      +  WFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNEC RRWKKLFPN+VPLLQEARKI
Subjt:  KIAVLLFGPKIGTRKQNFCLVGIKFNAEKGVWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKI

Query:  QKAALISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQEGIANNTSVVPEEVQSS
        QK ALISNFVDRE+ERPALGP DFRP PN+ LLCNTD P  APKRNV+TR+MPVSRNEKSA GDAPKKRKSN QR + D TAQ   A+NTS VP EV+S+
Subjt:  QKAALISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQEGIANNTSVVPEEVQSS

Query:  KPQRKRNRRGAYTAKRVGVPELRSDSEWCAKQNLDTHSLGLQLNSKE-SERTNSDCTETVDENIMEVFENKVAEKLTEKNACFSEPEKNQNSTGSSGVSV
        KPQRKR R GAYT +R G P++  +SE CA+QN DT +L +QLN KE +ER NSDC ETVDEN MEVFENK AE  +E   CFSE E+NQNSTGSSGVSV
Subjt:  KPQRKRNRRGAYTAKRVGVPELRSDSEWCAKQNLDTHSLGLQLNSKE-SERTNSDCTETVDENIMEVFENKVAEKLTEKNACFSEPEKNQNSTGSSGVSV

Query:  LSEMTNDLVDYNPSILTDTTLLASTTVDDIEELKGKSVAERDLDDSNSVSLPHSCLELRTIDCEGVDSYSVDEFTAKSNGVCNPTQGRRKKNSKTSNNSH
        LSEMTND+ +YNPS L DTTLLAS T DDI E KG +VA++DLD SNS SLP SCLELRT D EGVDSYSVDEFT KS+ VC P QGRRKKNSK SN S 
Subjt:  LSEMTNDLVDYNPSILTDTTLLASTTVDDIEELKGKSVAERDLDDSNSVSLPHSCLELRTIDCEGVDSYSVDEFTAKSNGVCNPTQGRRKKNSKTSNNSH

Query:  DYLFLSRQQIEQETSGTNKPHCHNQSKKREHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKR-TMTHNETVDCSSSTPLEV---DNEPTIASFL-YKLK
        D L +S QQ E E SGTN+ H  NQ KKR+H++T TS LGT+EAVEEVDDCTL+GFLQKRLKR T TH + VD SSST  EV   DN+PT+A  L  KLK
Subjt:  DYLFLSRQQIEQETSGTNKPHCHNQSKKREHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKR-TMTHNETVDCSSSTPLEV---DNEPTIASFL-YKLK

Query:  RKKH
        RKKH
Subjt:  RKKH

A0A6J1JKV7 uncharacterized protein LOC111485355 isoform X10.0e+0076.69Show/hide
Query:  MSLRNHGDEIDVEHLA---DKEDGVVDEDMEVLQRAYRLVG------VNPRLSSPTVGDADPGSDSDDVDDFELLRNIQNRFSIVADEQ-VSILPQVSAE
        MS R+H D  D E  A   D ED +VD+DME L+RA RL G      VNP+LS P  GDA+ GSDSDDVDD ELLRNIQNRFSI ADEQ +SILP V+A+
Subjt:  MSLRNHGDEIDVEHLA---DKEDGVVDEDMEVLQRAYRLVG------VNPRLSSPTVGDADPGSDSDDVDDFELLRNIQNRFSIVADEQ-VSILPQVSAE

Query:  EEEDEFEILRSIQRRFAAYESDTLSNEPNQSRDYVGSLKRDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FE LRSIQRRFAAYESD LSN+P+QS D  G LK DSD+  VE  TSS+R SM+AFEKGSLPKAALAF+DAIKKNRSQQKF+RSKMIHLEARIEE
Subjt:  EEEDEFEILRSIQRRFAAYESDTLSNEPNQSRDYVGSLKRDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKP-QAKDSSKKDKRVSGMYYGPAENSHVACYRMAMAKFPRVDRKKWSIVERENLGKGIRQ
        NKKLRKR K+LK FQGSC+R+T+CALSQM+DPRVQLISA KP QAKDSSKKDKR+S M YGPAENSHVACYR+A+ KF  VDRK+WS  ERENLGKGIRQ
Subjt:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKP-QAKDSSKKDKRVSGMYYGPAENSHVACYRMAMAKFPRVDRKKWSIVERENLGKGIRQ

Query:  QFQEMVLQISVDQISGPQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDNLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQ
        QFQEMVLQISVDQIS  QG S +SDDLDNILASIKDLDI PEKIREFLPKVNWD LASMYL+GRSGAECEARWLNFEDPLINR+PWTTSEDKNLLFTIQQ
Subjt:  QFQEMVLQISVDQISGPQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDNLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQ

Query:  KGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRL
        KGLNNWI++AVSLGTNRTPFQ LSRYQRSLNASILK EWTKDEDDKLRSAVAIFG  DWQAVAS LEGR G QCSNRWKKSLDPARTKRGYFTPDEDSRL
Subjt:  KGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRL

Query:  KIAVLLFGPKIGTRKQNFCLVGIKFNAEKGVWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKI
        KIAVLLFGPK   +K  F L G      +  WFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPN+VPLLQEARKI
Subjt:  KIAVLLFGPKIGTRKQNFCLVGIKFNAEKGVWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKI

Query:  QKAALISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQEGIANNTSVVPEEVQSS
        QK ALISNFVDRE+ERPALGP DFRP PN+ LLCNTD P  APKRNV+TR+MPVSRNEKSA GDAPKKRKSN QR + D TAQ   A+NTS VP EV+S+
Subjt:  QKAALISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQEGIANNTSVVPEEVQSS

Query:  KPQRKRNRRGAYTAKRVGVPELRSDSEWCAKQNLDTHSLGLQLNSKE-SERTNSDCTETVDENIMEVFENKVAEKLTEKNACFSEPEKNQNSTGSSGVSV
        KPQRKR R GAYT +R G P++  +SE CA+QN DT +L +QLN KE +ER NSDC ETVDEN MEVFENK AE  +E   CFSE E+NQNSTGSSGVSV
Subjt:  KPQRKRNRRGAYTAKRVGVPELRSDSEWCAKQNLDTHSLGLQLNSKE-SERTNSDCTETVDENIMEVFENKVAEKLTEKNACFSEPEKNQNSTGSSGVSV

Query:  LSEMTNDLVDYNPSILTDTTLLASTTVDDIEELKGKSVAERDLDDSNSVSLPHSCLELRTIDCEGVDSYSVDEFTAKSNGVCNPTQGRRKKNSKTSNNSH
        LSEMTND+ +YNPS L DTTLLAS T DDI E KG +VA++DLD SNS SLP SCLELRT D EGVDSYSVDEFT KS+ VC P QGRRKKNSK SN S 
Subjt:  LSEMTNDLVDYNPSILTDTTLLASTTVDDIEELKGKSVAERDLDDSNSVSLPHSCLELRTIDCEGVDSYSVDEFTAKSNGVCNPTQGRRKKNSKTSNNSH

Query:  DYLFLSRQQIEQETSGTNKPHCHNQSKKREHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKR-TMTHNETVDCSSSTPLEV---DNEPTIASFL-YKLK
        D L +S QQ E E SGTN+ H  NQ KKR+H++T TS LGT+EAVEEVDDCTL+GFLQKRLKR T TH + VD SSST  EV   DN+PT+A  L  KLK
Subjt:  DYLFLSRQQIEQETSGTNKPHCHNQSKKREHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKR-TMTHNETVDCSSSTPLEV---DNEPTIASFL-YKLK

Query:  RKKH
        RKKH
Subjt:  RKKH

SwissProt top hitse value%identityAlignment
P10242 Transcriptional activator Myb5.3e-2233.89Show/hide
Query:  KREWTKDEDDKLRSAVAIFGVRDWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKIGTRKQNFCLVGIKFNAEKGV---W
        K  WT++ED+KL+  V   G  DW+ +A+ L  R   QC +RW+K L+P   K G +T +ED R+   V  +GPK    + +     +K    K     W
Subjt:  KREWTKDEDDKLRSAVAIFGVRDWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKIGTRKQNFCLVGIKFNAEKGV---W

Query:  FNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVP---LLQEARKIQKAALISNF
         N L+P +++  WTEEED  +  A +  G  WA++A  +P RTDN  +  W      +V     LQE+ K  + A+ ++F
Subjt:  FNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVP---LLQEARKIQKAALISNF

P46200 Transcriptional activator Myb5.3e-2233.89Show/hide
Query:  KREWTKDEDDKLRSAVAIFGVRDWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKIGTRKQNFCLVGIKFNAEKGV---W
        K  WT++ED+KL+  V   G  DW+ +A+ L  R   QC +RW+K L+P   K G +T +ED R+   V  +GPK    + +     +K    K     W
Subjt:  KREWTKDEDDKLRSAVAIFGVRDWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKIGTRKQNFCLVGIKFNAEKGV---W

Query:  FNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVP---LLQEARKIQKAALISNF
         N L+P +++  WTEEED  +  A +  G  WA++A  +P RTDN  +  W      +V     LQE+ K  + A+ ++F
Subjt:  FNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVP---LLQEARKIQKAALISNF

P91868 snRNA-activating protein complex subunit 4 homolog2.4e-2229.09Show/hide
Query:  VNWDNLASMYLQG-RSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRS
        V W  +A+   +G R+    +++W N  +P  N++ W+  E + L +  +     +W  +A++LGTNRT +QC+ +Y+  ++     +EW++DED KL +
Subjt:  VNWDNLASMYLQG-RSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRS

Query:  AVAIFGVR---DWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPK--------IGTRKQNFCLVGIKFNAEKGVWFNCLDP
           I  +     W  VA  + GR   Q   R+  +LD A  K G +T  ED  L  AV  +G K        +  R  + C         +  W N L+ 
Subjt:  AVAIFGVR---DWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPK--------IGTRKQNFCLVGIKFNAEKGVWFNCLDP

Query:  SLRRCE-WTEEEDLRLEIAIQEHGY-SWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVD
        S    E +T  ED +L  A++  G  +WAK    +P +T  + RRR+ +L          A K++ AA   N VD
Subjt:  SLRRCE-WTEEEDLRLEIAIQEHGY-SWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVD

Q54NA6 Myb-like protein L1.5e-4833.95Show/hide
Query:  PAENSHVACYRMAMAKFPRVDR-KKWSIVERENLGKGIRQQ-FQEMVLQISVDQISGP---------QGVSGDSDDLDNILASIKDLDIAPEKIREFLPK
        PA+N      R+     P   + ++W+  E E L KGI+++  Q+ + ++S D++S           Q  S ++++ +N + +  + +   +K     P 
Subjt:  PAENSHVACYRMAMAKFPRVDR-KKWSIVERENLGKGIRQQ-FQEMVLQISVDQISGP---------QGVSGDSDDLDNILASIKDLDIAPEKIREFLPK

Query:  VNWDNLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSA
        +   +   +    RS  E   RW N +DP IN+ P+T  EDK LL   ++   + W +I++ LGTNRTP  C+ RYQRSLN+ ++KREWTK+ED+ L   
Subjt:  VNWDNLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSA

Query:  VAI--FGVR-DWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKIGTRKQNFCLVGIKFNAEKGV-----WFNCLDPSLRR
        + +   G R DWQ +   + GR G QC +RW K+LDP+  K+G ++P+ED  L  AV  +G      K N+ L+         V     + N LDP L +
Subjt:  VAI--FGVR-DWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKIGTRKQNFCLVGIKFNAEKGV-----WFNCLDPSLRR

Query:  CEWTEEEDLRLEIAIQEHGY-SWAKVAACVPSRTDNECRRRWKKL--FPNEVPLLQEARKIQKAALISNFVDRETERPAL
          WT +ED RL     + G   W+ VA  + +RTDN+C RRWK+L    N +   QE    +K   +SNF  R+ ER  L
Subjt:  CEWTEEEDLRLEIAIQEHGY-SWAKVAACVPSRTDNECRRRWKKL--FPNEVPLLQEARKIQKAALISNFVDRETERPAL

Q8BP86 snRNA-activating protein complex subunit 47.2e-2725.47Show/hide
Query:  IKKNRSQQKFIRSKMIHLEARIEENKKLRKRCKILKDFQGS-CKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRVSGMYYGPAENSHV-ACYRMAM
        ++ N   Q+ IR K+  +   + +N++ ++  +IL D  G+ C +       + +   + +    KP  KD      +V+G+  GP  N          +
Subjt:  IKKNRSQQKFIRSKMIHLEARIEENKKLRKRCKILKDFQGS-CKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRVSGMYYGPAENSHV-ACYRMAM

Query:  AKFPRVDRKKWSIVERENLGKGIRQQFQEMVLQ---ISVDQISGPQG-VSGDSD------DLDNILASIKDLDIAPEK--IREFLPKVNWDNLASMYLQG
          F ++   KW   E+  L K +     + +LQ   + ++ +   Q  VS + +       +      I+D++  PE+  +   L   +W+ ++++  +G
Subjt:  AKFPRVDRKKWSIVERENLGKGIRQQFQEMVLQ---ISVDQISGPQG-VSGDSD------DLDNILASIKDLDIAPEK--IREFLPKVNWDNLASMYLQG

Query:  -RSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRD---W
         RS  E    W + E P I++  W+T E + L       G   W  +A  LGT+R+ FQCL ++Q+  N ++ ++EWT++ED  L   V    V +   +
Subjt:  -RSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRD---W

Query:  QAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKIGTRKQNFCLVGIKFNAEKGVWFNCLDPSLRRCEWTEEEDLRLEIAIQE
        + +   +EGR   Q   RW KSLDP+  KRG++ P+ED++L  AV  +G +   + +   + G      +  +   L  SL++  W  +E+ +L   I++
Subjt:  QAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKIGTRKQNFCLVGIKFNAEKGVWFNCLDPSLRRCEWTEEEDLRLEIAIQE

Query:  HGYS-WAKVAACVPSRTDNECRRRWKKL
        +G   WA++A+ +P R+ ++C  +WK L
Subjt:  HGYS-WAKVAACVPSRTDNECRRRWKKL

Arabidopsis top hitse value%identityAlignment
AT3G09370.1 myb domain protein 3r-32.5e-1935.57Show/hide
Query:  KREWTKDEDDKLRSAVAIFGVRDWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGP-KIGTRKQNFCLVGIKFNAEKGVWFN
        K  WT +ED+ LR AV  F  + W+ +A +   R   QC +RW+K L+P   K G +T +ED ++   V  +GP K     Q+  L G      +  W N
Subjt:  KREWTKDEDDKLRSAVAIFGVRDWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGP-KIGTRKQNFCLVGIKFNAEKGVWFN

Query:  CLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW
         L+P + +  WT EE++ L  A + HG  WA++A  +P RTDN  +  W
Subjt:  CLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW

AT3G09370.2 myb domain protein 3r-32.5e-1935.57Show/hide
Query:  KREWTKDEDDKLRSAVAIFGVRDWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGP-KIGTRKQNFCLVGIKFNAEKGVWFN
        K  WT +ED+ LR AV  F  + W+ +A +   R   QC +RW+K L+P   K G +T +ED ++   V  +GP K     Q+  L G      +  W N
Subjt:  KREWTKDEDDKLRSAVAIFGVRDWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGP-KIGTRKQNFCLVGIKFNAEKGVWFN

Query:  CLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW
         L+P + +  WT EE++ L  A + HG  WA++A  +P RTDN  +  W
Subjt:  CLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW

AT3G18100.1 myb domain protein 4r17.4e-15245.77Show/hide
Query:  SDSDDVDDFELLRNIQNRFSIVADEQVSILP-QVSAEEEEDEFEILRSIQRRFAAY-----ESDTLSNEPNQSRDYVGSLKRDSDDIAVESQTSSKRP--
        SDS+  DDFE++R+I+++ S+  D  VS+ P  +S +EE+D FE LR+I+RRF+AY     E   +++   + +    S    S +I   S T    P  
Subjt:  SDSDDVDDFELLRNIQNRFSIVADEQVSILP-QVSAEEEEDEFEILRSIQRRFAAY-----ESDTLSNEPNQSRDYVGSLKRDSDDIAVESQTSSKRP--

Query:  ---------------SMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAK
                         +     S P+AA AFVDAI++NR+ QKF+R K+  +EA IE+N+K +K  +I+KDFQ SCKR T  AL Q  DPRV+LIS  K
Subjt:  ---------------SMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAK

Query:  PQAKDSSK----------KDKRVSGMYYGPAENSHVACYRMAMAKFP-RVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQISGPQGVSGDSDDLDNIL
            DSS+           DK++S +  GPAEN  V  YRMA+ K+P  V R+KWS  E +NL KG++Q+ Q+++L  ++++ S    + G + D+D I 
Subjt:  PQAKDSSK----------KDKRVSGMYYGPAENSHVACYRMAMAKFP-RVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQISGPQGVSGDSDDLDNIL

Query:  ASIKDLDIAPEKIREFLPKVNWDNLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSLGTNRTPFQCLSRYQRSLN
         SI +L+I PE IR+FLPK+NWD   S+ ++ RS AECEARW++ EDPLIN  PWT +EDKNLL TI+Q  L +W++IAVSLGTNRTPFQCL+RYQRSLN
Subjt:  ASIKDLDIAPEKIREFLPKVNWDNLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSLGTNRTPFQCLSRYQRSLN

Query:  ASILKREWTKDEDDKLRSAVAIFGVRDWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKIGTRKQNFCLVGIKFNAEKGV
         SILK+EWT +EDD+LR+AV +FG +DWQ+VA+ L+GR GTQCSNRWKKSL P  T++G ++ +ED R+K+AV LFG +   +   F + G      +  
Subjt:  ASILKREWTKDEDDKLRSAVAIFGVRDWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKIGTRKQNFCLVGIKFNAEKGV

Query:  WFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRETERPALGPADFRPRPNTD
        W NCLDP + R +WTEEED +L  AI EHGYSW+KVA  +  RTDN+C RRWK+L+P++V LLQEAR++QK A + NFVDRE+ERPAL  +     P+  
Subjt:  WFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRETERPALGPADFRPRPNTD

Query:  LLCNTDGPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQEGIANNTSVVPEEVQSSKPQR
        L    D      KR  K +K    R         PK+R+   +    D   QE    N +V   E  +   +R
Subjt:  LLCNTDGPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQEGIANNTSVVPEEVQSSKPQR

AT3G18100.2 myb domain protein 4r14.1e-14249.64Show/hide
Query:  SLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSK----------KDKRV
        S P+AA AFVDAI++NR+ QKF+R K+  +EA IE+N+K +K  +I+KDFQ SCKR T  AL Q  DPRV+LIS  K    DSS+           DK++
Subjt:  SLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSK----------KDKRV

Query:  SGMYYGPAENSHVACYRMAMAKFP-RVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQISGPQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWD
        S +  GPAEN  V  YRMA+ K+P  V R+KWS  E +NL KG++Q+ Q+++L  ++++ S    + G + D+D I  SI +L+I PE IR+FLPK+NWD
Subjt:  SGMYYGPAENSHVACYRMAMAKFP-RVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQISGPQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWD

Query:  NLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIF
           S+ ++ RS AECEARW++ EDPLIN  PWT +EDKNLL TI+Q  L +W++IAVSLGTNRTPFQCL+RYQRSLN SILK+EWT +EDD+LR+AV +F
Subjt:  NLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIF

Query:  GVRDWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKIGTRKQNFCLVGIKFNAEKGVWFNCLDPSLRRCEWTEEEDLRLE
        G +DWQ+VA+ L+GR GTQCSNRWKKSL P  T++G ++ +ED R+K+AV LFG +   +   F + G      +  W NCLDP + R +WTEEED +L 
Subjt:  GVRDWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKIGTRKQNFCLVGIKFNAEKGVWFNCLDPSLRRCEWTEEEDLRLE

Query:  IAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPAPKRNVKTRKMPV
         AI EHGYSW+KVA  +  RTDN+C RRWK+L+P++V LLQEAR++QK A + NFVDRE+ERPAL  +     P+  L    D      KR  K +K   
Subjt:  IAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPAPKRNVKTRKMPV

Query:  SRNEKSATGDAPKKRKSNYQRFQTDATAQEGIANNTSVVPEEVQSSKPQR
         R         PK+R+   +    D   QE    N +V   E  +   +R
Subjt:  SRNEKSATGDAPKKRKSNYQRFQTDATAQEGIANNTSVVPEEVQSSKPQR

AT3G18100.3 myb domain protein 4r12.7e-13041.26Show/hide
Query:  RNHGDEIDVEHLADKEDGVVDEDMEVLQRAYRLVGVN-PRLSSPT-------VGDADPGSDSDDVDDFELLRNIQNRFSIVADEQVSILPQ-----VSAE
        RN   E D +   D+ED  + ED+E L+RA  +  VN  + +S T       VG  +  SDS++ DDFE+LR I+++ +   D   S  P      +S  
Subjt:  RNHGDEIDVEHLADKEDGVVDEDMEVLQRAYRLVGVN-PRLSSPT-------VGDADPGSDSDDVDDFELLRNIQNRFSIVADEQVSILPQ-----VSAE

Query:  EEEDEFEILRSIQRRFAAYESDTLSNEPNQSRDYVGSLKRDSDDIAVESQTSSKR-----PSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLE
        E ED+FE++RSI+ +        LS   + S   +G L  D +D A E+  + +R      +   F   S  K      + +   ++Q+ +   KM+  +
Subjt:  EEEDEFEILRSIQRRFAAYESDTLSNEPNQSRDYVGSLKRDSDDIAVESQTSSKR-----PSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLE

Query:  ARIEENKKLRKRC-----------KILKDFQGSCKRRTSCALS--QMIDPRVQLISAAKPQAKDSSK----------KDKRVSGMYYGPAENSHVACYRM
         R++ + KL +R              L++     K+R+S   +  +M DPRV+LIS  K    DSS+           DK++S +  GPAEN  V  YRM
Subjt:  ARIEENKKLRKRC-----------KILKDFQGSCKRRTSCALS--QMIDPRVQLISAAKPQAKDSSK----------KDKRVSGMYYGPAENSHVACYRM

Query:  AMAKFP-RVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQISGPQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDNLASMYLQGRSGAECEAR
        A+ K+P  V R+KWS  E +NL KG++Q+ Q+++L  ++++ S    + G + D+D I  SI +L+I PE IR+FLPK+NWD   S+ ++ RS AECEAR
Subjt:  AMAKFP-RVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQISGPQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDNLASMYLQGRSGAECEAR

Query:  WLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASALEGRAGT
        W++ EDPLIN  PWT +EDKNLL TI+Q  L +W++IAVSLGTNRTPFQCL+RYQRSLN SILK+EWT +EDD+LR+AV +FG +DWQ+VA+ L+GR GT
Subjt:  WLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASALEGRAGT

Query:  QCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKIGTRKQNFCLVGIKFNAEKGVWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVP
        QCSNRWKKSL P  T++G ++ +ED R+K+AV LFG +   +   F + G      +  W NCLDP + R +WTEEED +L  AI EHGYSW+KVA  + 
Subjt:  QCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKIGTRKQNFCLVGIKFNAEKGVWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVP

Query:  SRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSN
         RTDN+C RRWK+L+P++V LLQEAR++QK A + NFVDRE+ERPAL  +     P+  L    D      KR  K +K    R         PK+R+  
Subjt:  SRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSN

Query:  YQRFQTDATAQEGIANNTSVVPEEVQSSKPQR
         +    D   QE    N +V   E  +   +R
Subjt:  YQRFQTDATAQEGIANNTSVVPEEVQSSKPQR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCTCCGCAACCATGGTGATGAAATTGACGTTGAGCATCTTGCTGACAAGGAAGATGGTGTGGTTGATGAGGACATGGAAGTTCTTCAGAGAGCCTATAGGCTTGT
TGGTGTTAATCCCAGGTTGTCATCACCTACTGTCGGTGATGCTGATCCTGGTTCTGATTCTGATGACGTTGATGATTTTGAACTTCTTCGAAATATTCAGAACCGGTTCT
CGATTGTGGCTGATGAGCAGGTGAGTATTCTCCCACAAGTGTCTGCAGAAGAGGAGGAAGATGAATTTGAGATACTTCGTTCGATACAGCGGCGCTTTGCAGCGTATGAA
AGTGATACTTTGAGCAATGAACCCAATCAGTCTCGTGACTATGTTGGGTCTCTGAAGAGGGATTCTGATGACATAGCTGTTGAGAGCCAGACATCCTCTAAAAGGCCATC
CATGCTAGCCTTTGAAAAGGGAAGCTTGCCAAAGGCTGCATTGGCATTTGTTGATGCCATCAAGAAAAATAGGTCCCAGCAGAAGTTTATTCGTAGTAAGATGATTCATC
TTGAAGCTAGAATTGAGGAGAACAAAAAGCTCAGAAAACGTTGCAAAATTCTCAAAGATTTCCAGGGTTCATGTAAGCGAAGAACAAGTTGTGCACTGTCTCAAATGATA
GATCCTCGAGTCCAGTTAATTTCAGCTGCAAAACCACAAGCAAAGGATTCATCAAAGAAGGACAAACGAGTATCTGGAATGTATTATGGCCCGGCTGAGAATTCTCATGT
TGCGTGTTACAGAATGGCAATGGCGAAGTTTCCCCGTGTGGATCGAAAGAAATGGTCCATTGTAGAAAGGGAGAATCTAGGGAAGGGAATAAGACAGCAGTTTCAGGAGA
TGGTGCTTCAGATTTCAGTGGATCAAATCAGTGGGCCACAAGGAGTTTCTGGAGATTCAGACGATTTAGATAACATTCTTGCATCCATAAAAGACCTTGACATCGCTCCT
GAAAAGATTAGGGAATTTCTGCCAAAAGTTAATTGGGACAACTTGGCTTCCATGTATCTTCAGGGTCGCTCAGGGGCAGAATGTGAAGCAAGGTGGTTGAATTTTGAAGA
CCCCCTAATTAATCGAGATCCATGGACTACAAGTGAGGATAAAAATCTTTTGTTTACTATCCAACAGAAGGGGTTGAATAACTGGATTGAGATAGCAGTTTCATTGGGCA
CAAACAGAACTCCTTTTCAGTGCTTGTCTCGGTATCAAAGGAGTTTAAATGCTTCCATATTAAAGAGGGAGTGGACCAAAGATGAGGATGATAAACTTCGATCTGCTGTT
GCTATATTTGGCGTGAGAGATTGGCAGGCAGTAGCTTCTGCTTTGGAAGGACGAGCTGGCACACAGTGCTCTAATAGGTGGAAAAAATCCCTTGACCCAGCTAGGACAAA
AAGAGGTTATTTCACTCCAGATGAAGACAGTCGCTTGAAAATTGCTGTACTGCTCTTTGGGCCCAAAATTGGAACAAGAAAGCAGAATTTTTGCCTGGTCGGAATCAAGT
TCAATGCAGAGAAAGGTGTATGGTTTAATTGTTTAGATCCTTCCTTGAGAAGATGTGAGTGGACAGAAGAGGAGGATTTAAGGCTGGAGATAGCAATTCAGGAACATGGA
TATAGCTGGGCTAAGGTAGCTGCATGTGTGCCATCACGAACAGATAATGAGTGTCGGAGGAGATGGAAGAAGTTATTTCCCAATGAAGTTCCTTTGCTCCAGGAAGCTAG
AAAGATTCAGAAGGCTGCTCTTATTAGCAACTTTGTTGATAGGGAAACAGAGCGTCCTGCTCTTGGTCCCGCTGACTTTCGACCCAGGCCAAATACAGATTTATTATGTA
ACACTGATGGTCCAAGACCTGCCCCAAAAAGAAATGTGAAGACGAGAAAGATGCCAGTGTCAAGGAATGAAAAGAGTGCTACTGGTGATGCTCCAAAGAAGAGGAAATCA
AATTACCAGAGGTTTCAGACTGATGCAACTGCTCAGGAGGGTATTGCAAATAATACCTCTGTTGTCCCAGAAGAGGTTCAATCTTCAAAGCCTCAAAGAAAACGAAATAG
ACGCGGAGCTTATACTGCAAAGAGAGTAGGGGTACCGGAGCTACGTTCTGACAGCGAGTGGTGTGCTAAACAGAATTTGGACACTCACAGCCTTGGGTTGCAGCTGAATA
GTAAGGAATCTGAGAGGACCAACAGTGACTGCACCGAGACTGTTGATGAGAATATTATGGAAGTTTTTGAGAACAAAGTTGCAGAGAAGCTTACTGAAAAAAATGCATGC
TTTTCAGAACCAGAAAAAAATCAAAACTCAACCGGATCTTCTGGAGTCTCGGTATTGTCGGAAATGACTAACGACTTGGTTGACTATAATCCCTCTATCCTCACAGATAC
AACATTGTTGGCCAGTACTACCGTGGATGATATCGAAGAATTGAAGGGGAAGAGTGTTGCAGAAAGGGATCTGGATGACAGTAACAGTGTCTCGTTACCGCACAGTTGCT
TAGAACTCCGGACAATTGACTGTGAAGGTGTAGACAGTTATTCTGTGGACGAATTTACAGCTAAAAGCAATGGGGTTTGCAATCCCACCCAAGGCAGAAGGAAGAAAAAT
AGCAAAACATCAAATAACAGCCATGATTATTTGTTTCTTTCTCGTCAACAAATTGAGCAAGAGACGTCGGGAACTAACAAGCCTCATTGTCACAATCAATCAAAGAAGAG
AGAACATAACAATACAGGTACAAGTACGTTAGGAACATTGGAGGCAGTTGAGGAGGTTGATGACTGCACTCTCGTGGGGTTTTTGCAAAAAAGATTGAAGAGGACAATGA
CACATAACGAGACAGTTGATTGCAGTTCTAGTACTCCTTTAGAAGTTGATAATGAGCCTACCATTGCCTCGTTTCTTTATAAATTGAAGAGAAAAAAGCATCAGCGGCCT
AGTGGTGGTGAGTTAAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCTCCGCAACCATGGTGATGAAATTGACGTTGAGCATCTTGCTGACAAGGAAGATGGTGTGGTTGATGAGGACATGGAAGTTCTTCAGAGAGCCTATAGGCTTGT
TGGTGTTAATCCCAGGTTGTCATCACCTACTGTCGGTGATGCTGATCCTGGTTCTGATTCTGATGACGTTGATGATTTTGAACTTCTTCGAAATATTCAGAACCGGTTCT
CGATTGTGGCTGATGAGCAGGTGAGTATTCTCCCACAAGTGTCTGCAGAAGAGGAGGAAGATGAATTTGAGATACTTCGTTCGATACAGCGGCGCTTTGCAGCGTATGAA
AGTGATACTTTGAGCAATGAACCCAATCAGTCTCGTGACTATGTTGGGTCTCTGAAGAGGGATTCTGATGACATAGCTGTTGAGAGCCAGACATCCTCTAAAAGGCCATC
CATGCTAGCCTTTGAAAAGGGAAGCTTGCCAAAGGCTGCATTGGCATTTGTTGATGCCATCAAGAAAAATAGGTCCCAGCAGAAGTTTATTCGTAGTAAGATGATTCATC
TTGAAGCTAGAATTGAGGAGAACAAAAAGCTCAGAAAACGTTGCAAAATTCTCAAAGATTTCCAGGGTTCATGTAAGCGAAGAACAAGTTGTGCACTGTCTCAAATGATA
GATCCTCGAGTCCAGTTAATTTCAGCTGCAAAACCACAAGCAAAGGATTCATCAAAGAAGGACAAACGAGTATCTGGAATGTATTATGGCCCGGCTGAGAATTCTCATGT
TGCGTGTTACAGAATGGCAATGGCGAAGTTTCCCCGTGTGGATCGAAAGAAATGGTCCATTGTAGAAAGGGAGAATCTAGGGAAGGGAATAAGACAGCAGTTTCAGGAGA
TGGTGCTTCAGATTTCAGTGGATCAAATCAGTGGGCCACAAGGAGTTTCTGGAGATTCAGACGATTTAGATAACATTCTTGCATCCATAAAAGACCTTGACATCGCTCCT
GAAAAGATTAGGGAATTTCTGCCAAAAGTTAATTGGGACAACTTGGCTTCCATGTATCTTCAGGGTCGCTCAGGGGCAGAATGTGAAGCAAGGTGGTTGAATTTTGAAGA
CCCCCTAATTAATCGAGATCCATGGACTACAAGTGAGGATAAAAATCTTTTGTTTACTATCCAACAGAAGGGGTTGAATAACTGGATTGAGATAGCAGTTTCATTGGGCA
CAAACAGAACTCCTTTTCAGTGCTTGTCTCGGTATCAAAGGAGTTTAAATGCTTCCATATTAAAGAGGGAGTGGACCAAAGATGAGGATGATAAACTTCGATCTGCTGTT
GCTATATTTGGCGTGAGAGATTGGCAGGCAGTAGCTTCTGCTTTGGAAGGACGAGCTGGCACACAGTGCTCTAATAGGTGGAAAAAATCCCTTGACCCAGCTAGGACAAA
AAGAGGTTATTTCACTCCAGATGAAGACAGTCGCTTGAAAATTGCTGTACTGCTCTTTGGGCCCAAAATTGGAACAAGAAAGCAGAATTTTTGCCTGGTCGGAATCAAGT
TCAATGCAGAGAAAGGTGTATGGTTTAATTGTTTAGATCCTTCCTTGAGAAGATGTGAGTGGACAGAAGAGGAGGATTTAAGGCTGGAGATAGCAATTCAGGAACATGGA
TATAGCTGGGCTAAGGTAGCTGCATGTGTGCCATCACGAACAGATAATGAGTGTCGGAGGAGATGGAAGAAGTTATTTCCCAATGAAGTTCCTTTGCTCCAGGAAGCTAG
AAAGATTCAGAAGGCTGCTCTTATTAGCAACTTTGTTGATAGGGAAACAGAGCGTCCTGCTCTTGGTCCCGCTGACTTTCGACCCAGGCCAAATACAGATTTATTATGTA
ACACTGATGGTCCAAGACCTGCCCCAAAAAGAAATGTGAAGACGAGAAAGATGCCAGTGTCAAGGAATGAAAAGAGTGCTACTGGTGATGCTCCAAAGAAGAGGAAATCA
AATTACCAGAGGTTTCAGACTGATGCAACTGCTCAGGAGGGTATTGCAAATAATACCTCTGTTGTCCCAGAAGAGGTTCAATCTTCAAAGCCTCAAAGAAAACGAAATAG
ACGCGGAGCTTATACTGCAAAGAGAGTAGGGGTACCGGAGCTACGTTCTGACAGCGAGTGGTGTGCTAAACAGAATTTGGACACTCACAGCCTTGGGTTGCAGCTGAATA
GTAAGGAATCTGAGAGGACCAACAGTGACTGCACCGAGACTGTTGATGAGAATATTATGGAAGTTTTTGAGAACAAAGTTGCAGAGAAGCTTACTGAAAAAAATGCATGC
TTTTCAGAACCAGAAAAAAATCAAAACTCAACCGGATCTTCTGGAGTCTCGGTATTGTCGGAAATGACTAACGACTTGGTTGACTATAATCCCTCTATCCTCACAGATAC
AACATTGTTGGCCAGTACTACCGTGGATGATATCGAAGAATTGAAGGGGAAGAGTGTTGCAGAAAGGGATCTGGATGACAGTAACAGTGTCTCGTTACCGCACAGTTGCT
TAGAACTCCGGACAATTGACTGTGAAGGTGTAGACAGTTATTCTGTGGACGAATTTACAGCTAAAAGCAATGGGGTTTGCAATCCCACCCAAGGCAGAAGGAAGAAAAAT
AGCAAAACATCAAATAACAGCCATGATTATTTGTTTCTTTCTCGTCAACAAATTGAGCAAGAGACGTCGGGAACTAACAAGCCTCATTGTCACAATCAATCAAAGAAGAG
AGAACATAACAATACAGGTACAAGTACGTTAGGAACATTGGAGGCAGTTGAGGAGGTTGATGACTGCACTCTCGTGGGGTTTTTGCAAAAAAGATTGAAGAGGACAATGA
CACATAACGAGACAGTTGATTGCAGTTCTAGTACTCCTTTAGAAGTTGATAATGAGCCTACCATTGCCTCGTTTCTTTATAAATTGAAGAGAAAAAAGCATCAGCGGCCT
AGTGGTGGTGAGTTAAATTAA
Protein sequenceShow/hide protein sequence
MSLRNHGDEIDVEHLADKEDGVVDEDMEVLQRAYRLVGVNPRLSSPTVGDADPGSDSDDVDDFELLRNIQNRFSIVADEQVSILPQVSAEEEEDEFEILRSIQRRFAAYE
SDTLSNEPNQSRDYVGSLKRDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRCKILKDFQGSCKRRTSCALSQMI
DPRVQLISAAKPQAKDSSKKDKRVSGMYYGPAENSHVACYRMAMAKFPRVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQISGPQGVSGDSDDLDNILASIKDLDIAP
EKIREFLPKVNWDNLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAV
AIFGVRDWQAVASALEGRAGTQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKIGTRKQNFCLVGIKFNAEKGVWFNCLDPSLRRCEWTEEEDLRLEIAIQEHG
YSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKS
NYQRFQTDATAQEGIANNTSVVPEEVQSSKPQRKRNRRGAYTAKRVGVPELRSDSEWCAKQNLDTHSLGLQLNSKESERTNSDCTETVDENIMEVFENKVAEKLTEKNAC
FSEPEKNQNSTGSSGVSVLSEMTNDLVDYNPSILTDTTLLASTTVDDIEELKGKSVAERDLDDSNSVSLPHSCLELRTIDCEGVDSYSVDEFTAKSNGVCNPTQGRRKKN
SKTSNNSHDYLFLSRQQIEQETSGTNKPHCHNQSKKREHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTMTHNETVDCSSSTPLEVDNEPTIASFLYKLKRKKHQRP
SGGELN