; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0016396 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0016396
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionsnRNA-activating protein complex subunit 4
Genome locationchr06:25346847..25364581
RNA-Seq ExpressionPay0016396
SyntenyPay0016396
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008452207.1 PREDICTED: snRNA-activating protein complex subunit 4 [Cucumis melo]0.0e+0099.62Show/hide
Query:  MSLHNHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSDDVDDFELLRDIQNRFSIVADEQPLSTLSPVSADEEE
        MSLHNHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSDDVDDFELLRDIQNRFSIVADEQPLSTLSPVSADEEE
Subjt:  MSLHNHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSDDVDDFELLRDIQNRFSIVADEQPLSTLSPVSADEEE

Query:  DEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK
        DEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK
Subjt:  DEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK

Query:  LRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQFQE
        LRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQFQE
Subjt:  LRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQFQE

Query:  MVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQKGLN
        MVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQKGLN
Subjt:  MVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQKGLN

Query:  NWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLKIAV
        NWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLKIAV
Subjt:  NWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLKIAV

Query:  MLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL
        MLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL
Subjt:  MLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL

Query:  ISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK
        ISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK
Subjt:  ISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK

Query:  RNRRGASNAKRIGVPDLRSDSKWCAKQNLDTQSLGLQLNSKESDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQ
        RNRRGASNAKRIGVPDLRSDS+WCAKQNLDTQSLGLQLNSKESDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKK++
Subjt:  RNRRGASNAKRIGVPDLRSDSKWCAKQNLDTQSLGLQLNSKESDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQ

XP_011650584.1 uncharacterized protein LOC101216287 [Cucumis sativus]0.0e+0092.25Show/hide
Query:  MSLHNHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSDDVDDFELLRDIQNRFSIVADEQPLSTLSPVSADEEE
        MSL NHVDEIDVEH ADKEDGVVDEDMEVLQRAYRL GVNPEDYI+PR SS  AGDADPGSDSDDVDDFELLRDIQNRFSI+ADEQP ST  PVSADEEE
Subjt:  MSLHNHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSDDVDDFELLRDIQNRFSIVADEQPLSTLSPVSADEEE

Query:  DEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK
        DEFEMLRSIQRRFAAYESDTLSNKP+QSRDY GSLK+DSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK
Subjt:  DEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK

Query:  LRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQFQE
        LRKRCKILKDFQ SCKRRTS ALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGP ENSHVAC+RM LAKFP VDRKKWSIVERENLGKGIRQQFQE
Subjt:  LRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQFQE

Query:  MVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQKGLN
        MVLQISVDQ SG QG+SGDSDDLDNILASIKDLDIAP+KIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWT SEDK+LLFTIQQKGLN
Subjt:  MVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQKGLN

Query:  NWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLKIAV
        NWIE+AVSLGTNRTPFQCLSRYQRSLNASILKREWTK+EDD+LRSAVA FGVRDWQAVASTLEGRAGTQCSNRWKKSLDPART++GYFTP ED+RLKIAV
Subjt:  NWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLKIAV

Query:  MLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL
        +LFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL
Subjt:  MLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL

Query:  ISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK
        ISNFVDRETERPALGPADFRPRPNTD LCNTDGP P PKRNVKTRK PVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK
Subjt:  ISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK

Query:  RNRRGASNAKRIGVPDLRSDSKWCAKQNLDTQSLGLQLNSKESDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQNSTGSSGVSVLSEMTS
        RNRRGA  AKRIGVP+LRSDS+WCAKQNLDT+SLGLQLNSKES+R+NS+CTETVDENI+EV ENKVAEKLTE+ ACFSE +KNQNSTGSSGVSVLSEMT+
Subjt:  RNRRGASNAKRIGVPDLRSDSKWCAKQNLDTQSLGLQLNSKESDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQNSTGSSGVSVLSEMTS

Query:  GFVDYNPSILTDTTLLASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQGRRKKNGKTSNNSHDNLFLP
          VDYNPSILTDTTL ASTTVDDIEELKGKS ADRDLDDSNSFSL HSCLELRTVDSEGVDSYSVDE+TAKSNGVCNPTQGRRKKN KTSNNSHDNL +P
Subjt:  GFVDYNPSILTDTTLLASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQGRRKKNGKTSNNSHDNLFLP

Query:  SQQIEQETLGTKKPHRHNQSKKRKHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCRSSTPLNVDDDDNEPTIASFLNKLKRKKHQRP
         QQI QETLGTKKP  HNQSKKRKH+NTG STL T EAVEEVDDCTLVGFLQKRLKRTA+THNETVDC S+ PL VD+DDNEPTIASFLNKLKRKKHQRP
Subjt:  SQQIEQETLGTKKPHRHNQSKKRKHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCRSSTPLNVDDDDNEPTIASFLNKLKRKKHQRP

Query:  SGCELN
        SG ELN
Subjt:  SGCELN

XP_023515735.1 uncharacterized protein LOC111779809 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0078.24Show/hide
Query:  MSLHNHVDEIDVEHRA---DKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSDDVDDFELLRDIQNRFSIVADEQPLSTLSPVSAD
        MS  +H D  D E  A   D ED +VD+DME L+RA RL GVN EDYI+PR S   AGDA+ GSDSDDVDD ELLR+IQNRFSI ADEQPLS L PV+AD
Subjt:  MSLHNHVDEIDVEHRA---DKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSDDVDDFELLRDIQNRFSIVADEQPLSTLSPVSAD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FEMLRSIQRRFAAYESD LSNKPDQS D DG LKMDS++  VE  TSS+R SM+AFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQ
        NKKLRKR K+LK FQ SC+R+T+ AL+QM+DPRVQLISA KPQAKDSSKKDKRLS M YGPAENSHVAC+R A  KF  VDRK+WS  ERENLGKGIRQQ
Subjt:  NKKLRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQ

Query:  FQEMVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQK
        FQEMVLQISVDQ S  QG S +SDDLDNILASIK LDI PEKIREFLPKVNWDKLASMYL+GRSGAECEARWLNFEDPLINR+PWT SEDKNLLFTIQQK
Subjt:  FQEMVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQK

Query:  GLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLK
        GLNNWIE+AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDEDDKLRSAVA+FG  DWQAVASTLEGR G QCSNRWKKSLDPARTKRGYFTP ED RLK
Subjt:  GLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLK

Query:  IAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQK
        IAV+LFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFP++VPLLQEARKIQK
Subjt:  IAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQK

Query:  AALISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKP
         ALISNFVDRE+ERPALGP DFRP PN+ LLCNTD P   PKRNV+TR+ PVSRNEKSA GDAPK+RKSN QR + D TAQV    NTS VP EV+S+KP
Subjt:  AALISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKP

Query:  QRKRNRRGASNAKRIGVPDLRSDSKWCAKQNLDTQSLGLQLNSKE-SDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQNSTGSSGVSVLS
        QRKR R GA   +R G P +  +S+ CA+QN DT+S+ +QLN KE ++R NSDC ETVDEN +EVFENK AE  +E + CFSEQ++NQNSTGSSGVSVLS
Subjt:  QRKRNRRGASNAKRIGVPDLRSDSKWCAKQNLDTQSLGLQLNSKE-SDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQNSTGSSGVSVLS

Query:  EMTSGFVDYNPSILTDTTLLASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQGRRKKNGKTSNNSHDN
        EMT+   +YNPS L DTTLLAS T DDI E KG +VAD+DLDDSNSFSLP SCLELRT DSEGVDSYSVDEFT KS+GVC P QGRRKKN K SN S D+
Subjt:  EMTSGFVDYNPSILTDTTLLASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQGRRKKNGKTSNNSHDN

Query:  LFLPSQQIEQETLGTKKPHRHNQSKKRKHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCRSSTPLNVDDDDNEPTIASFLN-KLKRK
        L +  QQ E E  GT + HR NQSKKRKH+ T TS LGT+EAVEEVDDCTL GFLQKRLKRT  TH++ VD  SSTP  VD+DDN+PT+A  LN KLKRK
Subjt:  LFLPSQQIEQETLGTKKPHRHNQSKKRKHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCRSSTPLNVDDDDNEPTIASFLN-KLKRK

Query:  KH
        KH
Subjt:  KH

XP_038905712.1 uncharacterized protein LOC120091681 isoform X1 [Benincasa hispida]0.0e+0084.52Show/hide
Query:  MSLHNHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSDDVDDFELLRDIQNRFSIVADEQPLSTLSPVSADEEE
        MS HNH DE DVE  A+KED VVDEDMEVLQRAYRLVGVNPEDYI+PR SS   GDA+ G DSDD DDFELLR+IQNRFSIV DEQPLSTL PVS DEEE
Subjt:  MSLHNHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSDDVDDFELLRDIQNRFSIVADEQPLSTLSPVSADEEE

Query:  DEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK
        DEFEMLRSIQRRFAAYESD LSNKP++SRDY GSLKMDS + A ESQTSSKRPSM+AFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK
Subjt:  DEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK

Query:  LRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQFQE
        LRKRCKILKDFQ SCKR+T+ ALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVAC+RMALAKFPRVDRKKWSIVERENLGKGIRQQFQE
Subjt:  LRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQFQE

Query:  MVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQKGLN
        MVLQISVDQ SG QG S DSDDLDNILASIKDLDI PEKIREFLPKVNWDKLASMYL GRSGAECEARWLNFEDPLINRDPWT SEDKNLLFTIQQKGLN
Subjt:  MVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQKGLN

Query:  NWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLKIAV
        NWIEIAVS GTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRG+FTP ED RLKIAV
Subjt:  NWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLKIAV

Query:  MLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL
        +L GPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDN+CRRRWKKLFP+EVPLLQEARKIQKAAL
Subjt:  MLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL

Query:  ISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK
        ISNFVDRE+ERPALGPADFRPR NTD+LC+TD P+P PKRN KTRK PVSRNEKSATGDAP+KRKSNYQR Q DATA+VGIA NTS VPEEVQS KP RK
Subjt:  ISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK

Query:  RNRRGASNAKRIGVPDLRSDSKWCAKQNLDTQSLGLQLNSKESDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQNSTGSSGVSVLSEMTS
        RNR  A   KR GV +L S +KWCAKQNL+T+S+G+QL+SKE + TNSD TETVD N LEVFENK+A+KL+E+   FSE ++NQNSTGSSGVSVLSEMT+
Subjt:  RNRRGASNAKRIGVPDLRSDSKWCAKQNLDTQSLGLQLNSKESDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQNSTGSSGVSVLSEMTS

Query:  GFVDYNPSILTDTTLLASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQGRRKKNGKTSNNSHDNLFLP
           +YNPSIL DTTLLASTTVDDIEELKGKS ADRDLDDSNSFSLP SCLELRT+D EGVDSYSVD+ T KS+ VC   QGRRKKN KTS+ +H+  FL 
Subjt:  GFVDYNPSILTDTTLLASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQGRRKKNGKTSNNSHDNLFLP

Query:  SQQIEQETLGTKKPHRHNQSKKRKHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCRSSTPLNVDDDDNEPTIASFLNKLKRKKHQRP
         QQ+EQE LG  +P   NQSKKRKH++T TS LGTLEAVEEVD+CTLVGFLQKRLK+        VDC S TPL VD+DDN+  IASFLNKLKRKKHQ P
Subjt:  SQQIEQETLGTKKPHRHNQSKKRKHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCRSSTPLNVDDDDNEPTIASFLNKLKRKKHQRP

Query:  S
        S
Subjt:  S

XP_038905717.1 uncharacterized protein LOC120091681 isoform X2 [Benincasa hispida]0.0e+0084.52Show/hide
Query:  MSLHNHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSDDVDDFELLRDIQNRFSIVADEQPLSTLSPVSADEEE
        MS HNH DE DVE  A+KED VVDEDMEVLQRAYRLVGVNPEDYI+PR SS   GDA+ G DSDD DDFELLR+IQNRFSIV DEQPLSTL PVS DEEE
Subjt:  MSLHNHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSDDVDDFELLRDIQNRFSIVADEQPLSTLSPVSADEEE

Query:  DEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK
        DEFEMLRSIQRRFAAYESD LSNKP++SRDY GSLKMDS + A ESQTSSKRPSM+AFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK
Subjt:  DEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK

Query:  LRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQFQE
        LRKRCKILKDFQ SCKR+T+ ALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVAC+RMALAKFPRVDRKKWSIVERENLGKGIRQQFQE
Subjt:  LRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQFQE

Query:  MVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQKGLN
        MVLQISVDQ SG QG S DSDDLDNILASIKDLDI PEKIREFLPKVNWDKLASMYL GRSGAECEARWLNFEDPLINRDPWT SEDKNLLFTIQQKGLN
Subjt:  MVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQKGLN

Query:  NWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLKIAV
        NWIEIAVS GTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRG+FTP ED RLKIAV
Subjt:  NWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLKIAV

Query:  MLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL
        +L GPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDN+CRRRWKKLFP+EVPLLQEARKIQKAAL
Subjt:  MLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL

Query:  ISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK
        ISNFVDRE+ERPALGPADFRPR NTD+LC+TD P+P PKRN KTRK PVSRNEKSATGDAP+KRKSNYQR Q DATA+VGIA NTS VPEEVQS KP RK
Subjt:  ISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK

Query:  RNRRGASNAKRIGVPDLRSDSKWCAKQNLDTQSLGLQLNSKESDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQNSTGSSGVSVLSEMTS
        RNR  A   KR GV +L S +KWCAKQNL+T+S+G+QL+SKE + TNSD TETVD N LEVFENK+A+KL+E+   FSE ++NQNSTGSSGVSVLSEMT+
Subjt:  RNRRGASNAKRIGVPDLRSDSKWCAKQNLDTQSLGLQLNSKESDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQNSTGSSGVSVLSEMTS

Query:  GFVDYNPSILTDTTLLASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQGRRKKNGKTSNNSHDNLFLP
           +YNPSIL DTTLLASTTVDDIEELKGKS ADRDLDDSNSFSLP SCLELRT+D EGVDSYSVD+ T KS+ VC   QGRRKKN KTS+ +H+  FL 
Subjt:  GFVDYNPSILTDTTLLASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQGRRKKNGKTSNNSHDNLFLP

Query:  SQQIEQETLGTKKPHRHNQSKKRKHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCRSSTPLNVDDDDNEPTIASFLNKLKRKKHQRP
         QQ+EQE LG  +P   NQSKKRKH++T TS LGTLEAVEEVD+CTLVGFLQKRLK+        VDC S TPL VD+DDN+  IASFLNKLKRKKHQ P
Subjt:  SQQIEQETLGTKKPHRHNQSKKRKHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCRSSTPLNVDDDDNEPTIASFLNKLKRKKHQRP

Query:  S
        S
Subjt:  S

TrEMBL top hitse value%identityAlignment
A0A0A0L2R2 Uncharacterized protein0.0e+0092.25Show/hide
Query:  MSLHNHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSDDVDDFELLRDIQNRFSIVADEQPLSTLSPVSADEEE
        MSL NHVDEIDVEH ADKEDGVVDEDMEVLQRAYRL GVNPEDYI+PR SS  AGDADPGSDSDDVDDFELLRDIQNRFSI+ADEQP ST  PVSADEEE
Subjt:  MSLHNHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSDDVDDFELLRDIQNRFSIVADEQPLSTLSPVSADEEE

Query:  DEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK
        DEFEMLRSIQRRFAAYESDTLSNKP+QSRDY GSLK+DSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK
Subjt:  DEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK

Query:  LRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQFQE
        LRKRCKILKDFQ SCKRRTS ALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGP ENSHVAC+RM LAKFP VDRKKWSIVERENLGKGIRQQFQE
Subjt:  LRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQFQE

Query:  MVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQKGLN
        MVLQISVDQ SG QG+SGDSDDLDNILASIKDLDIAP+KIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWT SEDK+LLFTIQQKGLN
Subjt:  MVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQKGLN

Query:  NWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLKIAV
        NWIE+AVSLGTNRTPFQCLSRYQRSLNASILKREWTK+EDD+LRSAVA FGVRDWQAVASTLEGRAGTQCSNRWKKSLDPART++GYFTP ED+RLKIAV
Subjt:  NWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLKIAV

Query:  MLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL
        +LFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL
Subjt:  MLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL

Query:  ISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK
        ISNFVDRETERPALGPADFRPRPNTD LCNTDGP P PKRNVKTRK PVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK
Subjt:  ISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK

Query:  RNRRGASNAKRIGVPDLRSDSKWCAKQNLDTQSLGLQLNSKESDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQNSTGSSGVSVLSEMTS
        RNRRGA  AKRIGVP+LRSDS+WCAKQNLDT+SLGLQLNSKES+R+NS+CTETVDENI+EV ENKVAEKLTE+ ACFSE +KNQNSTGSSGVSVLSEMT+
Subjt:  RNRRGASNAKRIGVPDLRSDSKWCAKQNLDTQSLGLQLNSKESDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQNSTGSSGVSVLSEMTS

Query:  GFVDYNPSILTDTTLLASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQGRRKKNGKTSNNSHDNLFLP
          VDYNPSILTDTTL ASTTVDDIEELKGKS ADRDLDDSNSFSL HSCLELRTVDSEGVDSYSVDE+TAKSNGVCNPTQGRRKKN KTSNNSHDNL +P
Subjt:  GFVDYNPSILTDTTLLASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQGRRKKNGKTSNNSHDNLFLP

Query:  SQQIEQETLGTKKPHRHNQSKKRKHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCRSSTPLNVDDDDNEPTIASFLNKLKRKKHQRP
         QQI QETLGTKKP  HNQSKKRKH+NTG STL T EAVEEVDDCTLVGFLQKRLKRTA+THNETVDC S+ PL VD+DDNEPTIASFLNKLKRKKHQRP
Subjt:  SQQIEQETLGTKKPHRHNQSKKRKHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCRSSTPLNVDDDDNEPTIASFLNKLKRKKHQRP

Query:  SGCELN
        SG ELN
Subjt:  SGCELN

A0A1S3BUG0 snRNA-activating protein complex subunit 40.0e+0099.62Show/hide
Query:  MSLHNHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSDDVDDFELLRDIQNRFSIVADEQPLSTLSPVSADEEE
        MSLHNHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSDDVDDFELLRDIQNRFSIVADEQPLSTLSPVSADEEE
Subjt:  MSLHNHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSDDVDDFELLRDIQNRFSIVADEQPLSTLSPVSADEEE

Query:  DEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK
        DEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK
Subjt:  DEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK

Query:  LRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQFQE
        LRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQFQE
Subjt:  LRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQFQE

Query:  MVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQKGLN
        MVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQKGLN
Subjt:  MVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQKGLN

Query:  NWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLKIAV
        NWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLKIAV
Subjt:  NWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLKIAV

Query:  MLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL
        MLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL
Subjt:  MLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL

Query:  ISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK
        ISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK
Subjt:  ISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK

Query:  RNRRGASNAKRIGVPDLRSDSKWCAKQNLDTQSLGLQLNSKESDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQ
        RNRRGASNAKRIGVPDLRSDS+WCAKQNLDTQSLGLQLNSKESDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKK++
Subjt:  RNRRGASNAKRIGVPDLRSDSKWCAKQNLDTQSLGLQLNSKESDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQ

A0A6J1E2J4 uncharacterized protein LOC111430000 isoform X20.0e+0077.94Show/hide
Query:  MSLHNHVDEIDVEHRA---DKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSDDVDDFELLRDIQNRFSIVADEQPLSTLSPVSAD
        MS  +HVD  D E  A   D ED +VD+DME L+RA RL GVN ED I+PR S   AGDA+ GSDSDDVDD ELLR+IQNRFS  ADEQPLS L PV+AD
Subjt:  MSLHNHVDEIDVEHRA---DKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSDDVDDFELLRDIQNRFSIVADEQPLSTLSPVSAD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FE LRSIQRRFAAYESD LSNKPDQS D DG LKMDSD+  V   TSS+R SM+AFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQ
        NKKLRKR K+LK FQ SC+R+T+ AL+QM+DPRVQLISA KPQAKDSSKKDKRLS M YGPAENSHVAC+R AL KF  VDRK+WS  ERENLGKGIRQQ
Subjt:  NKKLRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQ

Query:  FQEMVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQK
        FQEMVLQISVDQ S  QG S +SDDLDNILASIK LDI PEKIREFLPKVNWDKLA MYLQGRSGAECEARWLNFEDPLINR+ WT SEDKNLLFTIQQK
Subjt:  FQEMVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQK

Query:  GLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLK
        GLNNWIE+AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDEDDKLRSAVAIFG  DWQAVASTLEGR G QCSNRWKKSLDPARTKRGYFTP ED RLK
Subjt:  GLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLK

Query:  IAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQK
        IAV+LFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNEC RRWKKLFP++VPLLQEARKIQK
Subjt:  IAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQK

Query:  AALISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKP
         ALISNFVDRE+ERPALGP DFRP PN+ LLCNTD P   PKRNV+ R+ PVSRNEKSA GDAPKK KSN QR Q D TAQV  A NTS VP EV+S+KP
Subjt:  AALISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKP

Query:  QRKRNRRGASNAKRIGVPDLRSDSKWCAKQNLDTQSLGLQLNSKE-SDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQNSTGSSGVSVLS
        QRKR R GA   +R G P +  +S+ CA+QN DT+SL +QLN KE ++R NSDC ETVDEN +EVFENK AE  +E + CFSEQ++NQNSTGSSGVSVLS
Subjt:  QRKRNRRGASNAKRIGVPDLRSDSKWCAKQNLDTQSLGLQLNSKE-SDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQNSTGSSGVSVLS

Query:  EMTSGFVDYNPSILTDTTLLASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQGRRKKNGKTSNNSHDN
        EMT+   +YNPS   DTTLLAS T DDI E KG +VAD+DLDDSNSFSLP SCLELRT DSEGVDSYSVDEFT KS+GVC P QGRRKKN K SN S D+
Subjt:  EMTSGFVDYNPSILTDTTLLASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQGRRKKNGKTSNNSHDN

Query:  LFLPSQQIEQETLGTKKPHRHNQSKKRKHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCRSSTPLNVDDDDNEPTIASFL-NKLKRK
        L +  QQ E E  G  + HR NQSKKRKH+ T TS LGT+EAVEEVDDCTL GFLQKRLKRT  TH++ VD  SSTP  VD+DDN+PT+A  L +KLKRK
Subjt:  LFLPSQQIEQETLGTKKPHRHNQSKKRKHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCRSSTPLNVDDDDNEPTIASFL-NKLKRK

Query:  KH
        KH
Subjt:  KH

A0A6J1E6Z7 uncharacterized protein LOC111430000 isoform X10.0e+0078.04Show/hide
Query:  MSLHNHVDEIDVEHRA---DKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSDDVDDFELLRDIQNRFSIVADEQPLSTLSPVSAD
        MS  +HVD  D E  A   D ED +VD+DME L+RA RL GVN ED I+PR S   AGDA+ GSDSDDVDD ELLR+IQNRFS  ADEQPLS L PV+AD
Subjt:  MSLHNHVDEIDVEHRA---DKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSDDVDDFELLRDIQNRFSIVADEQPLSTLSPVSAD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FE LRSIQRRFAAYESD LSNKPDQS D DG LKMDSD+  V   TSS+R SM+AFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQ
        NKKLRKR K+LK FQ SC+R+T+ AL+QM+DPRVQLISA KPQAKDSSKKDKRLS M YGPAENSHVAC+R AL KF  VDRK+WS  ERENLGKGIRQQ
Subjt:  NKKLRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQ

Query:  FQEMVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQK
        FQEMVLQISVDQ S  QG S +SDDLDNILASIK LDI PEKIREFLPKVNWDKLA MYLQGRSGAECEARWLNFEDPLINR+ WT SEDKNLLFTIQQK
Subjt:  FQEMVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQK

Query:  GLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLK
        GLNNWIE+AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDEDDKLRSAVAIFG  DWQAVASTLEGR G QCSNRWKKSLDPARTKRGYFTP ED RLK
Subjt:  GLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLK

Query:  IAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQK
        IAV+LFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFP++VPLLQEARKIQK
Subjt:  IAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQK

Query:  AALISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKP
         ALISNFVDRE+ERPALGP DFRP PN+ LLCNTD P   PKRNV+ R+ PVSRNEKSA GDAPKK KSN QR Q D TAQV  A NTS VP EV+S+KP
Subjt:  AALISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKP

Query:  QRKRNRRGASNAKRIGVPDLRSDSKWCAKQNLDTQSLGLQLNSKE-SDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQNSTGSSGVSVLS
        QRKR R GA   +R G P +  +S+ CA+QN DT+SL +QLN KE ++R NSDC ETVDEN +EVFENK AE  +E + CFSEQ++NQNSTGSSGVSVLS
Subjt:  QRKRNRRGASNAKRIGVPDLRSDSKWCAKQNLDTQSLGLQLNSKE-SDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQNSTGSSGVSVLS

Query:  EMTSGFVDYNPSILTDTTLLASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQGRRKKNGKTSNNSHDN
        EMT+   +YNPS   DTTLLAS T DDI E KG +VAD+DLDDSNSFSLP SCLELRT DSEGVDSYSVDEFT KS+GVC P QGRRKKN K SN S D+
Subjt:  EMTSGFVDYNPSILTDTTLLASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQGRRKKNGKTSNNSHDN

Query:  LFLPSQQIEQETLGTKKPHRHNQSKKRKHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCRSSTPLNVDDDDNEPTIASFL-NKLKRK
        L +  QQ E E  G  + HR NQSKKRKH+ T TS LGT+EAVEEVDDCTL GFLQKRLKRT  TH++ VD  SSTP  VD+DDN+PT+A  L +KLKRK
Subjt:  LFLPSQQIEQETLGTKKPHRHNQSKKRKHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCRSSTPLNVDDDDNEPTIASFL-NKLKRK

Query:  KH
        KH
Subjt:  KH

A0A6J1JKV7 uncharacterized protein LOC111485355 isoform X10.0e+0077.87Show/hide
Query:  MSLHNHVDEIDVEHRA---DKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSDDVDDFELLRDIQNRFSIVADEQPLSTLSPVSAD
        MS  +HVD  D E  A   D ED +VD+DME L+RA RL GVN EDY++P+ S   AGDA+ GSDSDDVDD ELLR+IQNRFSI ADEQPLS L PV+AD
Subjt:  MSLHNHVDEIDVEHRA---DKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSDDVDDFELLRDIQNRFSIVADEQPLSTLSPVSAD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FE LRSIQRRFAAYESD LSNKPDQS D DG LKMDSD+  VE  TSS+R SM+AFEKGSLPKAALAF+DAIKKNRSQQKF+RSKMIHLEARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKP-QAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQ
        NKKLRKR K+LK FQ SC+R+T+ ALSQM+DPRVQLISA KP QAKDSSKKDKRLS M YGPAENSHVAC+R+AL KF  VDRK+WS  ERENLGKGIRQ
Subjt:  NKKLRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKP-QAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQ

Query:  QFQEMVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQ
        QFQEMVLQISVDQ S  QG S +SDDLDNILASIKDLDI PEKIREFLPKVNWDKLASMYL+GRSGAECEARWLNFEDPLINR+PWT SEDKNLLFTIQQ
Subjt:  QFQEMVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQ

Query:  KGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRL
        KGLNNWI++AVSLGTNRTPFQ LSRYQRSLNASILK EWTKDEDDKLRSAVAIFG  DWQAVASTLEGR G QCSNRWKKSLDPARTKRGYFTP ED RL
Subjt:  KGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRL

Query:  KIAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQ
        KIAV+LFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFP++VPLLQEARKIQ
Subjt:  KIAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQ

Query:  KAALISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSK
        K ALISNFVDRE+ERPALGP DFRP PN+ LLCNTD P   PKRNV+TR+ PVSRNEKSA GDAPKKRKSN QR + D TAQV  A NTS VP EV+S+K
Subjt:  KAALISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSK

Query:  PQRKRNRRGASNAKRIGVPDLRSDSKWCAKQNLDTQSLGLQLNSKE-SDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQNSTGSSGVSVL
        PQRKR R GA   +R G P +  +S+ CA+QN DT++L +QLN KE ++R NSDC ETVDEN +EVFENK AE  +E + CFSEQ++NQNSTGSSGVSVL
Subjt:  PQRKRNRRGASNAKRIGVPDLRSDSKWCAKQNLDTQSLGLQLNSKE-SDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQNSTGSSGVSVL

Query:  SEMTSGFVDYNPSILTDTTLLASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQGRRKKNGKTSNNSHD
        SEMT+   +YNPS L DTTLLAS T DDI E KG +VAD+DLD SNSFSLP SCLELRT DSEGVDSYSVDEFT KS+ VC P QGRRKKN K SN S D
Subjt:  SEMTSGFVDYNPSILTDTTLLASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQGRRKKNGKTSNNSHD

Query:  NLFLPSQQIEQETLGTKKPHRHNQSKKRKHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCRSSTPLNVDDDDNEPTIASFL-NKLKR
        +L +  QQ E E  GT + HR NQ KKRKH++T TS LGT+EAVEEVDDCTL+GFLQKRLKRT  TH + VD  SST   VD+DDN+PT+A  L  KLKR
Subjt:  NLFLPSQQIEQETLGTKKPHRHNQSKKRKHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCRSSTPLNVDDDDNEPTIASFL-NKLKR

Query:  KKH
        KKH
Subjt:  KKH

SwissProt top hitse value%identityAlignment
P91868 snRNA-activating protein complex subunit 4 homolog2.9e-3129Show/hide
Query:  VNWDKLASMYLQG-RSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQKGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRS
        V W  +A+   +G R+    +++W N  +P  N++ W+  E + L +  +     +W  +A++LGTNRT +QC+ +Y+  ++     +EW++DED KL +
Subjt:  VNWDKLASMYLQG-RSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQKGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRS

Query:  AVAIFGVR---DWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCE-WT
           I  +     W  VA  + GR   Q   R+  +LD A  K G +T  EDV L  AV  +G K+W K A+ +  RN  QCRERW N L+ S    E +T
Subjt:  AVAIFGVR---DWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCE-WT

Query:  EEEDLRLEIAIQEHGY-SWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAALISNFVD------RETERPALGPADFRPRPNTDLLCNTDGP
          ED +L  A++  G  +WAK    +P +T  + RRR+ +L          A K++ AA   N VD      R  E   L   D             +  
Subjt:  EEEDLRLEIAIQEHGY-SWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAALISNFVD------RETERPALGPADFRPRPNTDLLCNTDGP

Query:  RPVPKRNVKTRKTPVSRNEKSATGDAPK-------KRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSK
              N    +TP    ++ +  + P        K K +YQ+ Q      V    N + + +E+ SS+
Subjt:  RPVPKRNVKTRKTPVSRNEKSATGDAPK-------KRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSK

Q08759 Transcriptional activator Myb1.4e-3037.57Show/hide
Query:  LSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGR
        LS+ +R L     K  WT++ED+KL+  V   G  +W+ +AS L  R   QC +RW+K L+P   K G +T  ED R+   V  +GPK W+  A+ L GR
Subjt:  LSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGR

Query:  NQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLF---PDEVPLLQEARKIQKAALISNF
           QCRERW N L+P +++  WTEEED  +  A +  G  WA++A  +P RTDN  +  W        ++   LQ + K  +  +++NF
Subjt:  NQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLF---PDEVPLLQEARKIQKAALISNF

Q54NA6 Myb-like protein L2.8e-5536.32Show/hide
Query:  PAENSHVACHRMALAKFPRVDR-KKWSIVERENLGKGIRQQ-FQEMVLQISVDQFSG----------QQGVSGDSDDLDNI-----LASIKDLDIAPEKI
        PA+N      R+     P   + ++W+  E E L KGI+++  Q+ + ++S D+ S           Q+  + ++++ +NI       SIKD        
Subjt:  PAENSHVACHRMALAKFPRVDR-KKWSIVERENLGKGIRQQ-FQEMVLQISVDQFSG----------QQGVSGDSDDLDNI-----LASIKDLDIAPEKI

Query:  REFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQKGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDED
         + + +V  + L       RS  E   RW N +DP IN+ P+T  EDK LL   ++   + W +I++ LGTNRTP  C+ RYQRSLN+ ++KREWTK+ED
Subjt:  REFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQKGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDED

Query:  DKLRSAVAI--FGVR-DWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRR
        + L   + +   G R DWQ +   + GR G QC +RW K+LDP+  K+G ++P ED  L  AV  +G  NW      + GR  VQCRER+ N LDP L +
Subjt:  DKLRSAVAI--FGVR-DWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRR

Query:  CEWTEEEDLRLEIAIQEHGY-SWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKI--QKAALISNFVDRETERPAL
          WT +ED RL     + G   W+ VA  + +RTDN+C RRWK+L      L     K+  +K   +SNF  R+ ER  L
Subjt:  CEWTEEEDLRLEIAIQEHGY-SWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKI--QKAALISNFVDRETERPAL

Q5SXM2 snRNA-activating protein complex subunit 45.4e-3827.56Show/hide
Query:  IKKNRSQQKFIRSKMIHLEARIEENKKLRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHV-ACHRMALA
        ++ N   Q+ I+ K+      + +N++ ++  ++++D   S  + T     + + P   +    KP  KD      +++G+  GP  N          + 
Subjt:  IKKNRSQQKFIRSKMIHLEARIEENKKLRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHV-ACHRMALA

Query:  KFPRVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQFSG-QQGVSGDSDDLD---------NILASIKDLDIAPEK--IREFLPKVNWDKLASMYLQG-
         F  +   KW   E+  L K +     + +LQ  + +     Q  S  S +L+              I+D++  PE+  +   L   +W+K++++  +G 
Subjt:  KFPRVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQFSG-QQGVSGDSDDLD---------NILASIKDLDIAPEK--IREFLPKVNWDKLASMYLQG-

Query:  RSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQKGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRD---WQ
        RS  E    W N E P IN+  W+  E++ L       G   W +IA  LGT+R+ FQCL ++Q+  N ++ ++EWT++ED  L   V    V     ++
Subjt:  RSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQKGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRD---WQ

Query:  AVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHG
         +   +EGR   Q   RW KSLDP   K+GY+ P ED +L  AV  +G ++W K  E +PGR+  QCR+R+   L  SL++  W  +E+ +L   I+++G
Subjt:  AVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHG

Query:  YS-WAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARK
           WAK+A+ +P R+ ++C  +WK +   +  L +  R+
Subjt:  YS-WAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARK

Q8BP86 snRNA-activating protein complex subunit 49.2e-3833.59Show/hide
Query:  IKDLDIAPEK--IREFLPKVNWDKLASMYLQG-RSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQKGLNNWIEIAVSLGTNRTPFQCLSRYQRSL
        I+D++  PE+  +   L   +W+K++++  +G RS  E    W + E P I++  W+  E + L       G   W  +A  LGT+R+ FQCL ++Q+  
Subjt:  IKDLDIAPEK--IREFLPKVNWDKLASMYLQG-RSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQKGLNNWIEIAVSLGTNRTPFQCLSRYQRSL

Query:  NASILKREWTKDEDDKLRSAVAIFGVRD---WQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQC
        N ++ ++EWT++ED  L   V    V +   ++ +   +EGR   Q   RW KSLDP+  KRG++ P ED +L  AV  +G ++W K  E +PGR+  QC
Subjt:  NASILKREWTKDEDDKLRSAVAIFGVRD---WQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQC

Query:  RERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYS-WAKVAACVPSRTDNECRRRWKKL
        R+R+   L  SL++  W  +E+ +L   I+++G   WA++A+ +P R+ ++C  +WK L
Subjt:  RERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYS-WAKVAACVPSRTDNECRRRWKKL

Arabidopsis top hitse value%identityAlignment
AT3G09370.1 myb domain protein 3r-32.3e-2840.82Show/hide
Query:  KREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCL
        K  WT +ED+ LR AV  F  + W+ +A +   R   QC +RW+K L+P   K G +T  ED ++   V  +GP  W+  A+ LPGR   QCRERW N L
Subjt:  KREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCL

Query:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW
        +P + +  WT EE++ L  A + HG  WA++A  +P RTDN  +  W
Subjt:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW

AT3G09370.2 myb domain protein 3r-32.3e-2840.82Show/hide
Query:  KREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCL
        K  WT +ED+ LR AV  F  + W+ +A +   R   QC +RW+K L+P   K G +T  ED ++   V  +GP  W+  A+ LPGR   QCRERW N L
Subjt:  KREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCL

Query:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW
        +P + +  WT EE++ L  A + HG  WA++A  +P RTDN  +  W
Subjt:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW

AT3G18100.1 myb domain protein 4r12.1e-16248.24Show/hide
Query:  SDSDDVDDFELLRDIQNRFSIVADEQPLSTLSPV--SADEEEDEFEMLRSIQRRFAAY-----ESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRP
        SDS+  DDFE++R I+++ S+  D     +L P+  S DEE+D FE LR+I+RRF+AY     E   +++   + +    S    S +I   S T    P
Subjt:  SDSDDVDDFELLRDIQNRFSIVADEQPLSTLSPV--SADEEEDEFEMLRSIQRRFAAY-----ESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRP

Query:  -----------------SMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISA
                           +     S P+AA AFVDAI++NR+ QKF+R K+  +EA IE+N+K +K  +I+KDFQ SCKR T  AL Q  DPRV+LIS 
Subjt:  -----------------SMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISA

Query:  AKPQAKDSSK----------KDKRLSGMYYGPAENSHVACHRMALAKFP-RVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQFSGQQGVSGDSDDLDN
         K    DSS+           DK++S +  GPAEN  V  +RMAL K+P  V R+KWS  E +NL KG++Q+ Q+++L  ++++ S  +G    + D+D 
Subjt:  AKPQAKDSSK----------KDKRLSGMYYGPAENSHVACHRMALAKFP-RVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQFSGQQGVSGDSDDLDN

Query:  ILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQKGLNNWIEIAVSLGTNRTPFQCLSRYQRS
        I  SI +L+I PE IR+FLPK+NWD   S+ ++ RS AECEARW++ EDPLIN  PWTA+EDKNLL TI+Q  L +W++IAVSLGTNRTPFQCL+RYQRS
Subjt:  ILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQKGLNNWIEIAVSLGTNRTPFQCLSRYQRS

Query:  LNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQCRE
        LN SILK+EWT +EDD+LR+AV +FG +DWQ+VA+ L+GR GTQCSNRWKKSL P  T++G ++  ED R+K+AV LFG +NW+K ++F+PGR Q QCRE
Subjt:  LNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQCRE

Query:  RWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAALISNFVDRETERPALGPADFRPRPNT
        RW NCLDP + R +WTEEED +L  AI EHGYSW+KVA  +  RTDN+C RRWK+L+P +V LLQEAR++QK A + NFVDRE+ERPAL  +     P+ 
Subjt:  RWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAALISNFVDRETERPALGPADFRPRPNT

Query:  DLLCNTDGPRPVPKRNVKTRKTPVSRNEKSATGDAPKKRKSNYQRFQTDATAQ
         L    D      KR  K +K+   R         PK+R+   +    D   Q
Subjt:  DLLCNTDGPRPVPKRNVKTRKTPVSRNEKSATGDAPKKRKSNYQRFQTDATAQ

AT3G18100.2 myb domain protein 4r12.3e-15353.03Show/hide
Query:  SLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSK----------KDKRL
        S P+AA AFVDAI++NR+ QKF+R K+  +EA IE+N+K +K  +I+KDFQ SCKR T  AL Q  DPRV+LIS  K    DSS+           DK++
Subjt:  SLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSK----------KDKRL

Query:  SGMYYGPAENSHVACHRMALAKFP-RVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWD
        S +  GPAEN  V  +RMAL K+P  V R+KWS  E +NL KG++Q+ Q+++L  ++++ S  +G    + D+D I  SI +L+I PE IR+FLPK+NWD
Subjt:  SGMYYGPAENSHVACHRMALAKFP-RVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWD

Query:  KLASMYLQGRSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQKGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIF
           S+ ++ RS AECEARW++ EDPLIN  PWTA+EDKNLL TI+Q  L +W++IAVSLGTNRTPFQCL+RYQRSLN SILK+EWT +EDD+LR+AV +F
Subjt:  KLASMYLQGRSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQKGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIF

Query:  GVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEI
        G +DWQ+VA+ L+GR GTQCSNRWKKSL P  T++G ++  ED R+K+AV LFG +NW+K ++F+PGR Q QCRERW NCLDP + R +WTEEED +L  
Subjt:  GVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEI

Query:  AIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAALISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPVPKRNVKTRKTPVS
        AI EHGYSW+KVA  +  RTDN+C RRWK+L+P +V LLQEAR++QK A + NFVDRE+ERPAL  +     P+  L    D      KR  K +K+   
Subjt:  AIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAALISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPVPKRNVKTRKTPVS

Query:  RNEKSATGDAPKKRKSNYQRFQTDATAQ
        R         PK+R+   +    D   Q
Subjt:  RNEKSATGDAPKKRKSNYQRFQTDATAQ

AT3G18100.3 myb domain protein 4r12.4e-14243.8Show/hide
Query:  NHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSIT---AGDADPGSDSDDVDDFELLRDIQNRFSIVAD----EQPLSTLSPVSAD
        N + E D +   D+ED  + ED+E L+RA  +  VN  D    ++ SI     G  +  SDS++ DDFE+LR I+++ +   D      P   LS +S  
Subjt:  NHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSIT---AGDADPGSDSDDVDDFELLRDIQNRFSIVAD----EQPLSTLSPVSAD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKR-----PSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLE
        E ED+FEM+RSI+ +        LS   D S    G L  D +D A E+  + +R      +   F   S  K      + +   ++Q+ +   KM+  +
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKR-----PSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLE

Query:  ARIEENKKLRKRC-----------KILKDFQCSCKRRTSSALS--QMIDPRVQLISAAKPQAKDSSK----------KDKRLSGMYYGPAENSHVACHRM
         R++ + KL +R              L++     K+R+S   +  +M DPRV+LIS  K    DSS+           DK++S +  GPAEN  V  +RM
Subjt:  ARIEENKKLRKRC-----------KILKDFQCSCKRRTSSALS--QMIDPRVQLISAAKPQAKDSSK----------KDKRLSGMYYGPAENSHVACHRM

Query:  ALAKFP-RVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEAR
        AL K+P  V R+KWS  E +NL KG++Q+ Q+++L  ++++ S  +G    + D+D I  SI +L+I PE IR+FLPK+NWD   S+ ++ RS AECEAR
Subjt:  ALAKFP-RVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEAR

Query:  WLNFEDPLINRDPWTASEDKNLLFTIQQKGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGT
        W++ EDPLIN  PWTA+EDKNLL TI+Q  L +W++IAVSLGTNRTPFQCL+RYQRSLN SILK+EWT +EDD+LR+AV +FG +DWQ+VA+ L+GR GT
Subjt:  WLNFEDPLINRDPWTASEDKNLLFTIQQKGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVRDWQAVASTLEGRAGT

Query:  QCSNRWKKSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPS
        QCSNRWKKSL P  T++G ++  ED R+K+AV LFG +NW+K ++F+PGR Q QCRERW NCLDP + R +WTEEED +L  AI EHGYSW+KVA  +  
Subjt:  QCSNRWKKSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPS

Query:  RTDNECRRRWKKLFPDEVPLLQEARKIQKAALISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNEKSATGDAPKKRKSNY
        RTDN+C RRWK+L+P +V LLQEAR++QK A + NFVDRE+ERPAL  +     P+  L    D      KR  K +K+   R         PK+R+   
Subjt:  RTDNECRRRWKKLFPDEVPLLQEARKIQKAALISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNEKSATGDAPKKRKSNY

Query:  QRFQTDATAQ
        +    D   Q
Subjt:  QRFQTDATAQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCTCCACAACCATGTTGATGAAATTGACGTTGAGCATCGTGCTGACAAGGAAGATGGTGTGGTTGATGAGGACATGGAAGTTCTTCAGAGAGCCTATAGGCTTGT
TGGTGTTAATCCTGAGGATTATATTCATCCCAGGTCGTCATCAATTACTGCTGGAGACGCTGATCCTGGTTCTGATTCTGATGATGTTGATGATTTTGAACTTCTTCGAG
ATATTCAGAATCGGTTCTCGATTGTGGCTGATGAGCAGCCACTGAGTACTCTTTCACCAGTGTCAGCAGACGAGGAGGAAGATGAATTTGAGATGCTTCGTTCTATACAG
CGGCGCTTTGCAGCGTATGAAAGTGATACTTTGAGCAATAAACCCGATCAGTCTCGTGACTATGATGGGTCTCTGAAGATGGATTCTGACGACATAGCTGTTGAGAGCCA
GACATCCTCAAAAAGGCCATCCATGCTAGCCTTTGAAAAGGGAAGCTTGCCAAAGGCTGCGTTGGCATTTGTTGATGCCATCAAGAAGAATAGGTCCCAGCAGAAGTTTA
TTCGTAGTAAGATGATTCATCTTGAAGCTAGAATAGAGGAGAACAAAAAACTCAGAAAACGTTGCAAAATTCTCAAAGATTTCCAGTGTTCATGTAAGCGAAGAACAAGT
TCTGCACTGTCTCAAATGATAGACCCTCGAGTCCAGTTAATTTCAGCTGCAAAACCACAAGCAAAGGATTCATCAAAGAAGGACAAACGATTATCTGGAATGTATTATGG
CCCGGCTGAGAATTCTCATGTGGCGTGTCACAGAATGGCATTGGCAAAGTTTCCCCGTGTTGATCGAAAGAAATGGTCCATTGTAGAAAGGGAGAATCTTGGGAAGGGAA
TAAGACAGCAGTTTCAGGAGATGGTGCTTCAGATTTCAGTGGATCAATTCAGTGGGCAACAAGGAGTTTCTGGAGATTCAGATGATTTGGATAACATTCTTGCATCAATA
AAAGATCTTGACATCGCTCCTGAAAAGATTAGGGAATTTCTGCCAAAAGTTAATTGGGACAAATTGGCTTCCATGTACCTTCAGGGTCGCTCAGGGGCAGAATGTGAAGC
AAGGTGGTTGAATTTTGAAGACCCCCTAATTAATCGAGATCCATGGACTGCAAGTGAGGATAAAAATCTTTTGTTTACTATCCAACAGAAGGGCTTGAATAACTGGATTG
AGATAGCAGTTTCATTGGGCACAAATAGAACTCCCTTCCAGTGCTTGTCTCGGTATCAAAGGAGTTTAAATGCTTCCATATTAAAGAGGGAGTGGACCAAAGATGAGGAT
GATAAACTTCGATCTGCTGTTGCTATATTTGGCGTGAGAGATTGGCAGGCAGTAGCTTCTACTTTGGAAGGACGAGCTGGCACACAGTGCTCTAATAGGTGGAAAAAATC
CCTTGACCCAGCTAGGACAAAAAGAGGTTATTTCACTCCATATGAAGACGTTCGTTTGAAAATTGCTGTAATGCTCTTTGGGCCCAAAAATTGGAACAAGAAAGCAGAAT
TTTTGCCTGGTCGGAATCAAGTTCAATGCAGAGAAAGATGGTTTAATTGTTTAGATCCTTCCTTGAGAAGATGTGAGTGGACAGAAGAGGAGGATTTAAGGCTGGAGATA
GCAATTCAGGAACATGGATATAGCTGGGCTAAGGTAGCTGCATGTGTGCCATCACGTACAGATAATGAGTGTCGCAGGAGATGGAAGAAGTTATTTCCCGATGAAGTTCC
CTTGCTCCAGGAAGCTAGAAAGATTCAGAAGGCTGCTCTCATTAGCAACTTTGTTGATAGGGAAACAGAGCGTCCTGCTCTTGGTCCTGCTGACTTTCGACCCAGGCCAA
ATACAGATTTATTATGTAATACTGATGGTCCAAGACCTGTCCCAAAAAGAAATGTGAAAACAAGAAAGACGCCCGTGTCAAGGAATGAAAAGAGTGCTACTGGGGATGCT
CCAAAGAAGAGGAAATCAAATTATCAGAGGTTTCAGACGGATGCAACTGCTCAGGTGGGTATTGCATATAATACCTCTTTTGTCCCAGAAGAGGTTCAATCTTCAAAGCC
TCAAAGGAAACGAAATAGACGTGGAGCTTCTAATGCAAAGAGAATAGGGGTACCGGACCTACGTTCTGACAGCAAGTGGTGTGCTAAACAGAATTTGGACACTCAGAGCC
TTGGGTTGCAGCTGAATAGTAAGGAATCTGATAGGACCAACAGTGACTGTACCGAGACTGTCGATGAGAACATTCTGGAAGTTTTTGAGAACAAAGTTGCAGAGAAGCTT
ACTGAAAAAATTGCATGCTTCTCGGAACAAAAAAAAAATCAAAACTCAACCGGATCTTCTGGAGTCTCAGTATTGTCGGAAATGACTAGCGGCTTCGTTGACTATAATCC
CTCTATCCTCACAGATACAACATTGTTAGCTAGTACTACTGTGGATGATATCGAAGAATTGAAGGGCAAGAGTGTTGCAGACAGGGATCTGGATGACAGTAACAGTTTCT
CGTTACCGCACAGTTGCTTAGAACTCCGGACAGTTGACAGTGAAGGCGTAGACAGTTATTCTGTGGACGAATTTACAGCTAAAAGCAATGGGGTTTGCAATCCCACCCAA
GGCAGGAGGAAGAAAAATGGCAAAACATCAAATAACAGCCACGATAATTTGTTTCTTCCTAGTCAACAAATTGAGCAAGAGACATTGGGAACGAAGAAGCCTCATCGTCA
TAATCAATCAAAGAAGAGAAAACATAACAATACAGGTACAAGTACGTTAGGAACATTGGAGGCAGTTGAGGAGGTTGATGACTGCACTCTCGTGGGGTTCTTGCAAAAAA
GATTGAAGAGGACGGCAGTGACACATAACGAGACAGTTGATTGCCGTTCTAGTACTCCTTTAAATGTTGATGATGACGATAATGAGCCTACCATTGCCTCTTTTCTTAAT
AAATTGAAGAGAAAAAAGCATCAGCGGCCTAGTGGTTGTGAGTTAAATTAA
mRNA sequenceShow/hide mRNA sequence
AATTCATTAATTCTTTCTTCCTCCAACCCTACGCACAAAAACTCCTTCCAATTAGTTTTGCGTCTTCAGCCCCTTGCCTACGCTTCAAGCACCTGTTGCCGCCAACCCAT
CGCGCCACCGACTGGCTCTGCCTCCGTTCGATCGTCGGCCATCCAAGCGTATCCGGCAAGTTGCCGGAAACTTCACTCTCCATCTCTCTCTACGACCCGTTCCATCACAG
CCGCGCGTCGCCAATCTCCTGCCAGCCATCGCCACCACTGCCATTCGCCGGATTCCGCGTTCGCCTGCCGTCTTCCACCGGTCTCTGCCTCTATCACCGTATTATCTTGC
GTTTGGTATGTTTTGCTCACACAGCCATGTCTCTCCACAACCATGTTGATGAAATTGACGTTGAGCATCGTGCTGACAAGGAAGATGGTGTGGTTGATGAGGACATGGAA
GTTCTTCAGAGAGCCTATAGGCTTGTTGGTGTTAATCCTGAGGATTATATTCATCCCAGGTCGTCATCAATTACTGCTGGAGACGCTGATCCTGGTTCTGATTCTGATGA
TGTTGATGATTTTGAACTTCTTCGAGATATTCAGAATCGGTTCTCGATTGTGGCTGATGAGCAGCCACTGAGTACTCTTTCACCAGTGTCAGCAGACGAGGAGGAAGATG
AATTTGAGATGCTTCGTTCTATACAGCGGCGCTTTGCAGCGTATGAAAGTGATACTTTGAGCAATAAACCCGATCAGTCTCGTGACTATGATGGGTCTCTGAAGATGGAT
TCTGACGACATAGCTGTTGAGAGCCAGACATCCTCAAAAAGGCCATCCATGCTAGCCTTTGAAAAGGGAAGCTTGCCAAAGGCTGCGTTGGCATTTGTTGATGCCATCAA
GAAGAATAGGTCCCAGCAGAAGTTTATTCGTAGTAAGATGATTCATCTTGAAGCTAGAATAGAGGAGAACAAAAAACTCAGAAAACGTTGCAAAATTCTCAAAGATTTCC
AGTGTTCATGTAAGCGAAGAACAAGTTCTGCACTGTCTCAAATGATAGACCCTCGAGTCCAGTTAATTTCAGCTGCAAAACCACAAGCAAAGGATTCATCAAAGAAGGAC
AAACGATTATCTGGAATGTATTATGGCCCGGCTGAGAATTCTCATGTGGCGTGTCACAGAATGGCATTGGCAAAGTTTCCCCGTGTTGATCGAAAGAAATGGTCCATTGT
AGAAAGGGAGAATCTTGGGAAGGGAATAAGACAGCAGTTTCAGGAGATGGTGCTTCAGATTTCAGTGGATCAATTCAGTGGGCAACAAGGAGTTTCTGGAGATTCAGATG
ATTTGGATAACATTCTTGCATCAATAAAAGATCTTGACATCGCTCCTGAAAAGATTAGGGAATTTCTGCCAAAAGTTAATTGGGACAAATTGGCTTCCATGTACCTTCAG
GGTCGCTCAGGGGCAGAATGTGAAGCAAGGTGGTTGAATTTTGAAGACCCCCTAATTAATCGAGATCCATGGACTGCAAGTGAGGATAAAAATCTTTTGTTTACTATCCA
ACAGAAGGGCTTGAATAACTGGATTGAGATAGCAGTTTCATTGGGCACAAATAGAACTCCCTTCCAGTGCTTGTCTCGGTATCAAAGGAGTTTAAATGCTTCCATATTAA
AGAGGGAGTGGACCAAAGATGAGGATGATAAACTTCGATCTGCTGTTGCTATATTTGGCGTGAGAGATTGGCAGGCAGTAGCTTCTACTTTGGAAGGACGAGCTGGCACA
CAGTGCTCTAATAGGTGGAAAAAATCCCTTGACCCAGCTAGGACAAAAAGAGGTTATTTCACTCCATATGAAGACGTTCGTTTGAAAATTGCTGTAATGCTCTTTGGGCC
CAAAAATTGGAACAAGAAAGCAGAATTTTTGCCTGGTCGGAATCAAGTTCAATGCAGAGAAAGATGGTTTAATTGTTTAGATCCTTCCTTGAGAAGATGTGAGTGGACAG
AAGAGGAGGATTTAAGGCTGGAGATAGCAATTCAGGAACATGGATATAGCTGGGCTAAGGTAGCTGCATGTGTGCCATCACGTACAGATAATGAGTGTCGCAGGAGATGG
AAGAAGTTATTTCCCGATGAAGTTCCCTTGCTCCAGGAAGCTAGAAAGATTCAGAAGGCTGCTCTCATTAGCAACTTTGTTGATAGGGAAACAGAGCGTCCTGCTCTTGG
TCCTGCTGACTTTCGACCCAGGCCAAATACAGATTTATTATGTAATACTGATGGTCCAAGACCTGTCCCAAAAAGAAATGTGAAAACAAGAAAGACGCCCGTGTCAAGGA
ATGAAAAGAGTGCTACTGGGGATGCTCCAAAGAAGAGGAAATCAAATTATCAGAGGTTTCAGACGGATGCAACTGCTCAGGTGGGTATTGCATATAATACCTCTTTTGTC
CCAGAAGAGGTTCAATCTTCAAAGCCTCAAAGGAAACGAAATAGACGTGGAGCTTCTAATGCAAAGAGAATAGGGGTACCGGACCTACGTTCTGACAGCAAGTGGTGTGC
TAAACAGAATTTGGACACTCAGAGCCTTGGGTTGCAGCTGAATAGTAAGGAATCTGATAGGACCAACAGTGACTGTACCGAGACTGTCGATGAGAACATTCTGGAAGTTT
TTGAGAACAAAGTTGCAGAGAAGCTTACTGAAAAAATTGCATGCTTCTCGGAACAAAAAAAAAATCAAAACTCAACCGGATCTTCTGGAGTCTCAGTATTGTCGGAAATG
ACTAGCGGCTTCGTTGACTATAATCCCTCTATCCTCACAGATACAACATTGTTAGCTAGTACTACTGTGGATGATATCGAAGAATTGAAGGGCAAGAGTGTTGCAGACAG
GGATCTGGATGACAGTAACAGTTTCTCGTTACCGCACAGTTGCTTAGAACTCCGGACAGTTGACAGTGAAGGCGTAGACAGTTATTCTGTGGACGAATTTACAGCTAAAA
GCAATGGGGTTTGCAATCCCACCCAAGGCAGGAGGAAGAAAAATGGCAAAACATCAAATAACAGCCACGATAATTTGTTTCTTCCTAGTCAACAAATTGAGCAAGAGACA
TTGGGAACGAAGAAGCCTCATCGTCATAATCAATCAAAGAAGAGAAAACATAACAATACAGGTACAAGTACGTTAGGAACATTGGAGGCAGTTGAGGAGGTTGATGACTG
CACTCTCGTGGGGTTCTTGCAAAAAAGATTGAAGAGGACGGCAGTGACACATAACGAGACAGTTGATTGCCGTTCTAGTACTCCTTTAAATGTTGATGATGACGATAATG
AGCCTACCATTGCCTCTTTTCTTAATAAATTGAAGAGAAAAAAGCATCAGCGGCCTAGTGGTTGTGAGTTAAATTAAGGTAGGATTACGTAGTTTTGTTGCATTTTTGGA
ATACCTCAGCAACATCACATTAAATCTAGGAAGAAGGTTGATGGAAGGCTCTGTAGAAATAAAGCTTCCAAGGCTGCAAATGGCTAAATGCTCAACCGTACGTTTGATTC
TATGGTCGACACATTTTTGTTTTATGATTTAGCTAAGAGGGCTTTGGCAATTTTGTTTTCTTATGGTTGATTTCATCTTTTTTTTTTTTACCTCTCCTAATTTGTTCATT
TTTTTTGAGAAGGTTGGATAATCTTCTCTTTGTACATTTATCTCGTAATCTTAATGAAACTATA
Protein sequenceShow/hide protein sequence
MSLHNHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSDDVDDFELLRDIQNRFSIVADEQPLSTLSPVSADEEEDEFEMLRSIQ
RRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRCKILKDFQCSCKRRTS
SALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQFSGQQGVSGDSDDLDNILASI
KDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTASEDKNLLFTIQQKGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDED
DKLRSAVAIFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEI
AIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAALISNFVDRETERPALGPADFRPRPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNEKSATGDA
PKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRKRNRRGASNAKRIGVPDLRSDSKWCAKQNLDTQSLGLQLNSKESDRTNSDCTETVDENILEVFENKVAEKL
TEKIACFSEQKKNQNSTGSSGVSVLSEMTSGFVDYNPSILTDTTLLASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQ
GRRKKNGKTSNNSHDNLFLPSQQIEQETLGTKKPHRHNQSKKRKHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCRSSTPLNVDDDDNEPTIASFLN
KLKRKKHQRPSGCELN