; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0022521 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0022521
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionsnRNA-activating protein complex subunit 4
Genome locationchr06:17086751..17101823
RNA-Seq ExpressionIVF0022521
SyntenyIVF0022521
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008452207.1 PREDICTED: snRNA-activating protein complex subunit 4 [Cucumis melo]0.092.35Show/hide
Query:  MSLHNHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSNDVDDFELLRDIQNRFSIVADEQPLSTLSPVSADEEE
        MSLHNHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDS+DVDDFELLRDIQNRFSIVADEQPLSTLSPVSADEEE
Subjt:  MSLHNHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSNDVDDFELLRDIQNRFSIVADEQPLSTLSPVSADEEE

Query:  DEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK
        DEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK
Subjt:  DEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK

Query:  LRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQFQE
        LRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQFQE
Subjt:  LRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQFQE

Query:  MVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWT-------------KGLN
        MVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWT             KGLN
Subjt:  MVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWT-------------KGLN

Query:  NWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDNFDLLLLYLA----------------------WKKSLDPARTKRGYFTPYEDVRLKIAV
        NWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDD     +                          WKKSLDPARTKRGYFTPYEDVRLKIAV
Subjt:  NWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDNFDLLLLYLA----------------------WKKSLDPARTKRGYFTPYEDVRLKIAV

Query:  MLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL
        MLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL
Subjt:  MLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL

Query:  ISNFVDRETER----------LPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNGKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK
        ISNFVDRETER           PNTDLLCNTDGPRPVPKRNVKTRKTPVSRN KSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK
Subjt:  ISNFVDRETER----------LPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNGKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK

Query:  RNRRGASNAKRIGVPDLRSDSEWCAKQNLDTQSLGLQLNSKESDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQ
        RNRRGASNAKRIGVPDLRSDSEWCAKQNLDTQSLGLQLNSKESDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKK++
Subjt:  RNRRGASNAKRIGVPDLRSDSEWCAKQNLDTQSLGLQLNSKESDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQ

XP_011650584.1 uncharacterized protein LOC101216287 [Cucumis sativus]0.087.08Show/hide
Query:  MSLHNHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSNDVDDFELLRDIQNRFSIVADEQPLSTLSPVSADEEE
        MSL NHVDEIDVEH ADKEDGVVDEDMEVLQRAYRL GVNPEDYI+PR SS  AGDADPGSDS+DVDDFELLRDIQNRFSI+ADEQP ST  PVSADEEE
Subjt:  MSLHNHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSNDVDDFELLRDIQNRFSIVADEQPLSTLSPVSADEEE

Query:  DEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK
        DEFEMLRSIQRRFAAYESDTLSNKP+QSRDY GSLK+DSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK
Subjt:  DEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK

Query:  LRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQFQE
        LRKRCKILKDFQ SCKRRTS ALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGP ENSHVAC+RM LAKFP VDRKKWSIVERENLGKGIRQQFQE
Subjt:  LRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQFQE

Query:  MVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWT-------------KGLN
        MVLQISVDQ SG QG+SGDSDDLDNILASIKDLDIAP+KIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWT             KGLN
Subjt:  MVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWT-------------KGLN

Query:  NWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDNFDLLLLYLA----------------------WKKSLDPARTKRGYFTPYEDVRLKIAV
        NWIE+AVSLGTNRTPFQCLSRYQRSLNASILKREWTK+EDD     +                          WKKSLDPART++GYFTP ED+RLKIAV
Subjt:  NWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDNFDLLLLYLA----------------------WKKSLDPARTKRGYFTPYEDVRLKIAV

Query:  MLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL
        +LFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL
Subjt:  MLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL

Query:  ISNFVDRETER----------LPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNGKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK
        ISNFVDRETER           PNTD LCNTDGP P PKRNVKTRK PVSRN KSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK
Subjt:  ISNFVDRETER----------LPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNGKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK

Query:  RNRRGASNAKRIGVPDLRSDSEWCAKQNLDTQSLGLQLNSKESDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQNSTGSSGVSVLSEMTS
        RNRRGA  AKRIGVP+LRSDSEWCAKQNLDT+SLGLQLNSKES+R+NS+CTETVDENI+EV ENKVAEKLTE+ ACFSE +KNQNSTGSSGVSVLSEMT+
Subjt:  RNRRGASNAKRIGVPDLRSDSEWCAKQNLDTQSLGLQLNSKESDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQNSTGSSGVSVLSEMTS

Query:  GFVDYNPSILTDTTLLASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQGRRKKNGKTSNNSHDNLFLP
          VDYNPSILTDTTL ASTTVDDIEELKGKS ADRDLDDSNSFSL HSCLELRTVDSEGVDSYSVDE+TAKSNGVCNPTQGRRKKN KTSNNSHDNL +P
Subjt:  GFVDYNPSILTDTTLLASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQGRRKKNGKTSNNSHDNLFLP

Query:  SQQIEQETLGTKKPHRHNQSKKRKHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCSSSTPLNVDDDDNEPTIASFLNKLKRKKHQRP
         QQI QETLGTKKP  HNQSKKRKH+NTG STL T EAVEEVDDCTLVGFLQKRLKRTA+THNETVDCSS+ PL VD+DDNEPTIASFLNKLKRKKHQRP
Subjt:  SQQIEQETLGTKKPHRHNQSKKRKHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCSSSTPLNVDDDDNEPTIASFLNKLKRKKHQRP

Query:  SGCELN
        SG ELN
Subjt:  SGCELN

XP_023515735.1 uncharacterized protein LOC111779809 isoform X1 [Cucurbita pepo subsp. pepo]0.073.45Show/hide
Query:  MSLHNHVDEIDVEHRA---DKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSNDVDDFELLRDIQNRFSIVADEQPLSTLSPVSAD
        MS  +H D  D E  A   D ED +VD+DME L+RA RL GVN EDYI+PR S   AGDA+ GSDS+DVDD ELLR+IQNRFSI ADEQPLS L PV+AD
Subjt:  MSLHNHVDEIDVEHRA---DKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSNDVDDFELLRDIQNRFSIVADEQPLSTLSPVSAD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FEMLRSIQRRFAAYESD LSNKPDQS D DG LKMDS++  VE  TSS+R SM+AFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQ
        NKKLRKR K+LK FQ SC+R+T+ AL+QM+DPRVQLISA KPQAKDSSKKDKRLS M YGPAENSHVAC+R A  KF  VDRK+WS  ERENLGKGIRQQ
Subjt:  NKKLRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQ

Query:  FQEMVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWT-------------K
        FQEMVLQISVDQ S  QG S +SDDLDNILASIK LDI PEKIREFLPKVNWDKLASMYL+GRSGAECEARWLNFEDPLINR+PWT             K
Subjt:  FQEMVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWT-------------K

Query:  GLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDNFDLLLLYLA----------------------WKKSLDPARTKRGYFTPYEDVRLK
        GLNNWIE+AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDEDD     +                          WKKSLDPARTKRGYFTP ED RLK
Subjt:  GLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDNFDLLLLYLA----------------------WKKSLDPARTKRGYFTPYEDVRLK

Query:  IAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQK
        IAV+LFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFP++VPLLQEARKIQK
Subjt:  IAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQK

Query:  AALISNFVDRETER----------LPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNGKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKP
         ALISNFVDRE+ER          +PN+ LLCNTD P   PKRNV+TR+ PVSRN KSA GDAPK+RKSN QR + D TAQV    NTS VPE V+S+KP
Subjt:  AALISNFVDRETER----------LPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNGKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKP

Query:  QRKRNRRGASNAKRIGVPDLRSDSEWCAKQNLDTQSLGLQLNSKE-SDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQNSTGSSGVSVLS
        QRKR R GA   +R G P +  +SE CA+QN DT+S+ +QLN KE ++R NSDC ETVDEN +EVFENK AE  +E + CFSEQ++NQNSTGSSGVSVLS
Subjt:  QRKRNRRGASNAKRIGVPDLRSDSEWCAKQNLDTQSLGLQLNSKE-SDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQNSTGSSGVSVLS

Query:  EMTSGFVDYNPSILTDTTLLASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQGRRKKNGKTSNNSHDN
        EMT+   +YNPS L DTTLLAS T DDI E KG +VAD+DLDDSNSFSLP SCLELRT DSEGVDSYSVDEFT KS+GVC P QGRRKKN K SN S D+
Subjt:  EMTSGFVDYNPSILTDTTLLASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQGRRKKNGKTSNNSHDN

Query:  LFLPSQQIEQETLGTKKPHRHNQSKKRKHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCSSSTPLNVDDDDNEPTIASFLN-KLKRK
        L +  QQ E E  GT + HR NQSKKRKH+ T TS LGT+EAVEEVDDCTL GFLQKRLKRT  TH++ VD SSSTP  VD+DDN+PT+A  LN KLKRK
Subjt:  LFLPSQQIEQETLGTKKPHRHNQSKKRKHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCSSSTPLNVDDDDNEPTIASFLN-KLKRK

Query:  KH
        KH
Subjt:  KH

XP_038905712.1 uncharacterized protein LOC120091681 isoform X1 [Benincasa hispida]0.078.82Show/hide
Query:  MSLHNHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSNDVDDFELLRDIQNRFSIVADEQPLSTLSPVSADEEE
        MS HNH DE DVE  A+KED VVDEDMEVLQRAYRLVGVNPEDYI+PR SS   GDA+ G DS+D DDFELLR+IQNRFSIV DEQPLSTL PVS DEEE
Subjt:  MSLHNHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSNDVDDFELLRDIQNRFSIVADEQPLSTLSPVSADEEE

Query:  DEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK
        DEFEMLRSIQRRFAAYESD LSNKP++SRDY GSLKMDS + A ESQTSSKRPSM+AFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK
Subjt:  DEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK

Query:  LRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQFQE
        LRKRCKILKDFQ SCKR+T+ ALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVAC+RMALAKFPRVDRKKWSIVERENLGKGIRQQFQE
Subjt:  LRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQFQE

Query:  MVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWT-------------KGLN
        MVLQISVDQ SG QG S DSDDLDNILASIKDLDI PEKIREFLPKVNWDKLASMYL GRSGAECEARWLNFEDPLINRDPWT             KGLN
Subjt:  MVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWT-------------KGLN

Query:  NWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDNFDLLLLYLA----------------------WKKSLDPARTKRGYFTPYEDVRLKIAV
        NWIEIAVS GTNRTPFQCLSRYQRSLNASILKREWTKDEDD     +                          WKKSLDPARTKRG+FTP ED RLKIAV
Subjt:  NWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDNFDLLLLYLA----------------------WKKSLDPARTKRGYFTPYEDVRLKIAV

Query:  MLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL
        +L GPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDN+CRRRWKKLFP+EVPLLQEARKIQKAAL
Subjt:  MLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL

Query:  ISNFVDRETERLP----------NTDLLCNTDGPRPVPKRNVKTRKTPVSRNGKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK
        ISNFVDRE+ER            NTD+LC+TD P+P PKRN KTRK PVSRN KSATGDAP+KRKSNYQR Q DATA+VGIA NTS VPEEVQS KP RK
Subjt:  ISNFVDRETERLP----------NTDLLCNTDGPRPVPKRNVKTRKTPVSRNGKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK

Query:  RNRRGASNAKRIGVPDLRSDSEWCAKQNLDTQSLGLQLNSKESDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQNSTGSSGVSVLSEMTS
        RNR  A   KR GV +L S+ +WCAKQNL+T+S+G+QL+SKE + TNSD TETVD N LEVFENK+A+KL+E+   FSE ++NQNSTGSSGVSVLSEMT+
Subjt:  RNRRGASNAKRIGVPDLRSDSEWCAKQNLDTQSLGLQLNSKESDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQNSTGSSGVSVLSEMTS

Query:  GFVDYNPSILTDTTLLASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQGRRKKNGKTSNNSHDNLFLP
           +YNPSIL DTTLLASTTVDDIEELKGKS ADRDLDDSNSFSLP SCLELRT+D EGVDSYSVD+ T KS+ VC   QGRRKKN KTS+ +H+  FL 
Subjt:  GFVDYNPSILTDTTLLASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQGRRKKNGKTSNNSHDNLFLP

Query:  SQQIEQETLGTKKPHRHNQSKKRKHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCSSSTPLNVDDDDNEPTIASFLNKLKRKKHQRP
         QQ+EQE LG  +P   NQSKKRKH++T TS LGTLEAVEEVD+CTLVGFLQKRLK+        VDCSS TPL VD+DDN+  IASFLNKLKRKKHQ P
Subjt:  SQQIEQETLGTKKPHRHNQSKKRKHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCSSSTPLNVDDDDNEPTIASFLNKLKRKKHQRP

Query:  S
        S
Subjt:  S

XP_038905717.1 uncharacterized protein LOC120091681 isoform X2 [Benincasa hispida]0.078.82Show/hide
Query:  MSLHNHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSNDVDDFELLRDIQNRFSIVADEQPLSTLSPVSADEEE
        MS HNH DE DVE  A+KED VVDEDMEVLQRAYRLVGVNPEDYI+PR SS   GDA+ G DS+D DDFELLR+IQNRFSIV DEQPLSTL PVS DEEE
Subjt:  MSLHNHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSNDVDDFELLRDIQNRFSIVADEQPLSTLSPVSADEEE

Query:  DEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK
        DEFEMLRSIQRRFAAYESD LSNKP++SRDY GSLKMDS + A ESQTSSKRPSM+AFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK
Subjt:  DEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK

Query:  LRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQFQE
        LRKRCKILKDFQ SCKR+T+ ALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVAC+RMALAKFPRVDRKKWSIVERENLGKGIRQQFQE
Subjt:  LRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQFQE

Query:  MVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWT-------------KGLN
        MVLQISVDQ SG QG S DSDDLDNILASIKDLDI PEKIREFLPKVNWDKLASMYL GRSGAECEARWLNFEDPLINRDPWT             KGLN
Subjt:  MVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWT-------------KGLN

Query:  NWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDNFDLLLLYLA----------------------WKKSLDPARTKRGYFTPYEDVRLKIAV
        NWIEIAVS GTNRTPFQCLSRYQRSLNASILKREWTKDEDD     +                          WKKSLDPARTKRG+FTP ED RLKIAV
Subjt:  NWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDNFDLLLLYLA----------------------WKKSLDPARTKRGYFTPYEDVRLKIAV

Query:  MLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL
        +L GPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDN+CRRRWKKLFP+EVPLLQEARKIQKAAL
Subjt:  MLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL

Query:  ISNFVDRETERLP----------NTDLLCNTDGPRPVPKRNVKTRKTPVSRNGKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK
        ISNFVDRE+ER            NTD+LC+TD P+P PKRN KTRK PVSRN KSATGDAP+KRKSNYQR Q DATA+VGIA NTS VPEEVQS KP RK
Subjt:  ISNFVDRETERLP----------NTDLLCNTDGPRPVPKRNVKTRKTPVSRNGKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK

Query:  RNRRGASNAKRIGVPDLRSDSEWCAKQNLDTQSLGLQLNSKESDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQNSTGSSGVSVLSEMTS
        RNR  A   KR GV +L S+ +WCAKQNL+T+S+G+QL+SKE + TNSD TETVD N LEVFENK+A+KL+E+   FSE ++NQNSTGSSGVSVLSEMT+
Subjt:  RNRRGASNAKRIGVPDLRSDSEWCAKQNLDTQSLGLQLNSKESDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQNSTGSSGVSVLSEMTS

Query:  GFVDYNPSILTDTTLLASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQGRRKKNGKTSNNSHDNLFLP
           +YNPSIL DTTLLASTTVDDIEELKGKS ADRDLDDSNSFSLP SCLELRT+D EGVDSYSVD+ T KS+ VC   QGRRKKN KTS+ +H+  FL 
Subjt:  GFVDYNPSILTDTTLLASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQGRRKKNGKTSNNSHDNLFLP

Query:  SQQIEQETLGTKKPHRHNQSKKRKHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCSSSTPLNVDDDDNEPTIASFLNKLKRKKHQRP
         QQ+EQE LG  +P   NQSKKRKH++T TS LGTLEAVEEVD+CTLVGFLQKRLK+        VDCSS TPL VD+DDN+  IASFLNKLKRKKHQ P
Subjt:  SQQIEQETLGTKKPHRHNQSKKRKHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCSSSTPLNVDDDDNEPTIASFLNKLKRKKHQRP

Query:  S
        S
Subjt:  S

TrEMBL top hitse value%identityAlignment
A0A0A0L2R2 Uncharacterized protein0.0e+0087.08Show/hide
Query:  MSLHNHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSNDVDDFELLRDIQNRFSIVADEQPLSTLSPVSADEEE
        MSL NHVDEIDVEH ADKEDGVVDEDMEVLQRAYRL GVNPEDYI+PR SS  AGDADPGSDS+DVDDFELLRDIQNRFSI+ADEQP ST  PVSADEEE
Subjt:  MSLHNHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSNDVDDFELLRDIQNRFSIVADEQPLSTLSPVSADEEE

Query:  DEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK
        DEFEMLRSIQRRFAAYESDTLSNKP+QSRDY GSLK+DSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK
Subjt:  DEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK

Query:  LRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQFQE
        LRKRCKILKDFQ SCKRRTS ALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGP ENSHVAC+RM LAKFP VDRKKWSIVERENLGKGIRQQFQE
Subjt:  LRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQFQE

Query:  MVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWT-------------KGLN
        MVLQISVDQ SG QG+SGDSDDLDNILASIKDLDIAP+KIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWT             KGLN
Subjt:  MVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWT-------------KGLN

Query:  NWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDNFDLLLLYLA----------------------WKKSLDPARTKRGYFTPYEDVRLKIAV
        NWIE+AVSLGTNRTPFQCLSRYQRSLNASILKREWTK+EDD     +                          WKKSLDPART++GYFTP ED+RLKIAV
Subjt:  NWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDNFDLLLLYLA----------------------WKKSLDPARTKRGYFTPYEDVRLKIAV

Query:  MLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL
        +LFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL
Subjt:  MLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL

Query:  ISNFVDRETER----------LPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNGKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK
        ISNFVDRETER           PNTD LCNTDGP P PKRNVKTRK PVSRN KSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK
Subjt:  ISNFVDRETER----------LPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNGKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK

Query:  RNRRGASNAKRIGVPDLRSDSEWCAKQNLDTQSLGLQLNSKESDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQNSTGSSGVSVLSEMTS
        RNRRGA  AKRIGVP+LRSDSEWCAKQNLDT+SLGLQLNSKES+R+NS+CTETVDENI+EV ENKVAEKLTE+ ACFSE +KNQNSTGSSGVSVLSEMT+
Subjt:  RNRRGASNAKRIGVPDLRSDSEWCAKQNLDTQSLGLQLNSKESDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQNSTGSSGVSVLSEMTS

Query:  GFVDYNPSILTDTTLLASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQGRRKKNGKTSNNSHDNLFLP
          VDYNPSILTDTTL ASTTVDDIEELKGKS ADRDLDDSNSFSL HSCLELRTVDSEGVDSYSVDE+TAKSNGVCNPTQGRRKKN KTSNNSHDNL +P
Subjt:  GFVDYNPSILTDTTLLASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQGRRKKNGKTSNNSHDNLFLP

Query:  SQQIEQETLGTKKPHRHNQSKKRKHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCSSSTPLNVDDDDNEPTIASFLNKLKRKKHQRP
         QQI QETLGTKKP  HNQSKKRKH+NTG STL T EAVEEVDDCTLVGFLQKRLKRTA+THNETVDCSS+ PL VD+DDNEPTIASFLNKLKRKKHQRP
Subjt:  SQQIEQETLGTKKPHRHNQSKKRKHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCSSSTPLNVDDDDNEPTIASFLNKLKRKKHQRP

Query:  SGCELN
        SG ELN
Subjt:  SGCELN

A0A1S3BUG0 snRNA-activating protein complex subunit 40.0e+0092.35Show/hide
Query:  MSLHNHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSNDVDDFELLRDIQNRFSIVADEQPLSTLSPVSADEEE
        MSLHNHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDS+DVDDFELLRDIQNRFSIVADEQPLSTLSPVSADEEE
Subjt:  MSLHNHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSNDVDDFELLRDIQNRFSIVADEQPLSTLSPVSADEEE

Query:  DEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK
        DEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK
Subjt:  DEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK

Query:  LRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQFQE
        LRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQFQE
Subjt:  LRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQFQE

Query:  MVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWT-------------KGLN
        MVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWT             KGLN
Subjt:  MVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWT-------------KGLN

Query:  NWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDNFDLLLLYLA----------------------WKKSLDPARTKRGYFTPYEDVRLKIAV
        NWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDD     +                          WKKSLDPARTKRGYFTPYEDVRLKIAV
Subjt:  NWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDNFDLLLLYLA----------------------WKKSLDPARTKRGYFTPYEDVRLKIAV

Query:  MLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL
        MLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL
Subjt:  MLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL

Query:  ISNFVDRETER----------LPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNGKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK
        ISNFVDRETER           PNTDLLCNTDGPRPVPKRNVKTRKTPVSRN KSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK
Subjt:  ISNFVDRETER----------LPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNGKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK

Query:  RNRRGASNAKRIGVPDLRSDSEWCAKQNLDTQSLGLQLNSKESDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQ
        RNRRGASNAKRIGVPDLRSDSEWCAKQNLDTQSLGLQLNSKESDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKK++
Subjt:  RNRRGASNAKRIGVPDLRSDSEWCAKQNLDTQSLGLQLNSKESDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQ

A0A6J1E2J4 uncharacterized protein LOC111430000 isoform X20.0e+0073.05Show/hide
Query:  MSLHNHVDEIDVEHRA---DKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSNDVDDFELLRDIQNRFSIVADEQPLSTLSPVSAD
        MS  +HVD  D E  A   D ED +VD+DME L+RA RL GVN ED I+PR S   AGDA+ GSDS+DVDD ELLR+IQNRFS  ADEQPLS L PV+AD
Subjt:  MSLHNHVDEIDVEHRA---DKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSNDVDDFELLRDIQNRFSIVADEQPLSTLSPVSAD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FE LRSIQRRFAAYESD LSNKPDQS D DG LKMDSD+  V   TSS+R SM+AFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQ
        NKKLRKR K+LK FQ SC+R+T+ AL+QM+DPRVQLISA KPQAKDSSKKDKRLS M YGPAENSHVAC+R AL KF  VDRK+WS  ERENLGKGIRQQ
Subjt:  NKKLRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQ

Query:  FQEMVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWT-------------K
        FQEMVLQISVDQ S  QG S +SDDLDNILASIK LDI PEKIREFLPKVNWDKLA MYLQGRSGAECEARWLNFEDPLINR+ WT             K
Subjt:  FQEMVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWT-------------K

Query:  GLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDNFDLLLLYLA----------------------WKKSLDPARTKRGYFTPYEDVRLK
        GLNNWIE+AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDEDD     +                          WKKSLDPARTKRGYFTP ED RLK
Subjt:  GLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDNFDLLLLYLA----------------------WKKSLDPARTKRGYFTPYEDVRLK

Query:  IAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQK
        IAV+LFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNEC RRWKKLFP++VPLLQEARKIQK
Subjt:  IAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQK

Query:  AALISNFVDRETER----------LPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNGKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKP
         ALISNFVDRE+ER          +PN+ LLCNTD P   PKRNV+ R+ PVSRN KSA GDAPKK KSN QR Q D TAQV  A NTS VP EV+S+KP
Subjt:  AALISNFVDRETER----------LPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNGKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKP

Query:  QRKRNRRGASNAKRIGVPDLRSDSEWCAKQNLDTQSLGLQLNSKE-SDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQNSTGSSGVSVLS
        QRKR R GA   +R G P +  +SE CA+QN DT+SL +QLN KE ++R NSDC ETVDEN +EVFENK AE  +E + CFSEQ++NQNSTGSSGVSVLS
Subjt:  QRKRNRRGASNAKRIGVPDLRSDSEWCAKQNLDTQSLGLQLNSKE-SDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQNSTGSSGVSVLS

Query:  EMTSGFVDYNPSILTDTTLLASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQGRRKKNGKTSNNSHDN
        EMT+   +YNPS   DTTLLAS T DDI E KG +VAD+DLDDSNSFSLP SCLELRT DSEGVDSYSVDEFT KS+GVC P QGRRKKN K SN S D+
Subjt:  EMTSGFVDYNPSILTDTTLLASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQGRRKKNGKTSNNSHDN

Query:  LFLPSQQIEQETLGTKKPHRHNQSKKRKHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCSSSTPLNVDDDDNEPTIASFL-NKLKRK
        L +  QQ E E  G  + HR NQSKKRKH+ T TS LGT+EAVEEVDDCTL GFLQKRLKRT  TH++ VD SSSTP  VD+DDN+PT+A  L +KLKRK
Subjt:  LFLPSQQIEQETLGTKKPHRHNQSKKRKHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCSSSTPLNVDDDDNEPTIASFL-NKLKRK

Query:  KH
        KH
Subjt:  KH

A0A6J1E6Z7 uncharacterized protein LOC111430000 isoform X10.0e+0073.15Show/hide
Query:  MSLHNHVDEIDVEHRA---DKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSNDVDDFELLRDIQNRFSIVADEQPLSTLSPVSAD
        MS  +HVD  D E  A   D ED +VD+DME L+RA RL GVN ED I+PR S   AGDA+ GSDS+DVDD ELLR+IQNRFS  ADEQPLS L PV+AD
Subjt:  MSLHNHVDEIDVEHRA---DKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSNDVDDFELLRDIQNRFSIVADEQPLSTLSPVSAD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FE LRSIQRRFAAYESD LSNKPDQS D DG LKMDSD+  V   TSS+R SM+AFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQ
        NKKLRKR K+LK FQ SC+R+T+ AL+QM+DPRVQLISA KPQAKDSSKKDKRLS M YGPAENSHVAC+R AL KF  VDRK+WS  ERENLGKGIRQQ
Subjt:  NKKLRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQ

Query:  FQEMVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWT-------------K
        FQEMVLQISVDQ S  QG S +SDDLDNILASIK LDI PEKIREFLPKVNWDKLA MYLQGRSGAECEARWLNFEDPLINR+ WT             K
Subjt:  FQEMVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWT-------------K

Query:  GLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDNFDLLLLYLA----------------------WKKSLDPARTKRGYFTPYEDVRLK
        GLNNWIE+AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDEDD     +                          WKKSLDPARTKRGYFTP ED RLK
Subjt:  GLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDNFDLLLLYLA----------------------WKKSLDPARTKRGYFTPYEDVRLK

Query:  IAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQK
        IAV+LFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFP++VPLLQEARKIQK
Subjt:  IAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQK

Query:  AALISNFVDRETER----------LPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNGKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKP
         ALISNFVDRE+ER          +PN+ LLCNTD P   PKRNV+ R+ PVSRN KSA GDAPKK KSN QR Q D TAQV  A NTS VP EV+S+KP
Subjt:  AALISNFVDRETER----------LPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNGKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKP

Query:  QRKRNRRGASNAKRIGVPDLRSDSEWCAKQNLDTQSLGLQLNSKE-SDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQNSTGSSGVSVLS
        QRKR R GA   +R G P +  +SE CA+QN DT+SL +QLN KE ++R NSDC ETVDEN +EVFENK AE  +E + CFSEQ++NQNSTGSSGVSVLS
Subjt:  QRKRNRRGASNAKRIGVPDLRSDSEWCAKQNLDTQSLGLQLNSKE-SDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQNSTGSSGVSVLS

Query:  EMTSGFVDYNPSILTDTTLLASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQGRRKKNGKTSNNSHDN
        EMT+   +YNPS   DTTLLAS T DDI E KG +VAD+DLDDSNSFSLP SCLELRT DSEGVDSYSVDEFT KS+GVC P QGRRKKN K SN S D+
Subjt:  EMTSGFVDYNPSILTDTTLLASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQGRRKKNGKTSNNSHDN

Query:  LFLPSQQIEQETLGTKKPHRHNQSKKRKHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCSSSTPLNVDDDDNEPTIASFL-NKLKRK
        L +  QQ E E  G  + HR NQSKKRKH+ T TS LGT+EAVEEVDDCTL GFLQKRLKRT  TH++ VD SSSTP  VD+DDN+PT+A  L +KLKRK
Subjt:  LFLPSQQIEQETLGTKKPHRHNQSKKRKHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCSSSTPLNVDDDDNEPTIASFL-NKLKRK

Query:  KH
        KH
Subjt:  KH

A0A6J1JKV7 uncharacterized protein LOC111485355 isoform X10.0e+0072.98Show/hide
Query:  MSLHNHVDEIDVEHRA---DKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSNDVDDFELLRDIQNRFSIVADEQPLSTLSPVSAD
        MS  +HVD  D E  A   D ED +VD+DME L+RA RL GVN EDY++P+ S   AGDA+ GSDS+DVDD ELLR+IQNRFSI ADEQPLS L PV+AD
Subjt:  MSLHNHVDEIDVEHRA---DKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSNDVDDFELLRDIQNRFSIVADEQPLSTLSPVSAD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FE LRSIQRRFAAYESD LSNKPDQS D DG LKMDSD+  VE  TSS+R SM+AFEKGSLPKAALAF+DAIKKNRSQQKF+RSKMIHLEARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKP-QAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQ
        NKKLRKR K+LK FQ SC+R+T+ ALSQM+DPRVQLISA KP QAKDSSKKDKRLS M YGPAENSHVAC+R+AL KF  VDRK+WS  ERENLGKGIRQ
Subjt:  NKKLRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKP-QAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQ

Query:  QFQEMVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWT-------------
        QFQEMVLQISVDQ S  QG S +SDDLDNILASIKDLDI PEKIREFLPKVNWDKLASMYL+GRSGAECEARWLNFEDPLINR+PWT             
Subjt:  QFQEMVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWT-------------

Query:  KGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDNFDLLLLYLA----------------------WKKSLDPARTKRGYFTPYEDVRL
        KGLNNWI++AVSLGTNRTPFQ LSRYQRSLNASILK EWTKDEDD     +                          WKKSLDPARTKRGYFTP ED RL
Subjt:  KGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDNFDLLLLYLA----------------------WKKSLDPARTKRGYFTPYEDVRL

Query:  KIAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQ
        KIAV+LFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFP++VPLLQEARKIQ
Subjt:  KIAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQ

Query:  KAALISNFVDRETER----------LPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNGKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSK
        K ALISNFVDRE+ER          +PN+ LLCNTD P   PKRNV+TR+ PVSRN KSA GDAPKKRKSN QR + D TAQV  A NTS VP EV+S+K
Subjt:  KAALISNFVDRETER----------LPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNGKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSK

Query:  PQRKRNRRGASNAKRIGVPDLRSDSEWCAKQNLDTQSLGLQLNSKE-SDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQNSTGSSGVSVL
        PQRKR R GA   +R G P +  +SE CA+QN DT++L +QLN KE ++R NSDC ETVDEN +EVFENK AE  +E + CFSEQ++NQNSTGSSGVSVL
Subjt:  PQRKRNRRGASNAKRIGVPDLRSDSEWCAKQNLDTQSLGLQLNSKE-SDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQNSTGSSGVSVL

Query:  SEMTSGFVDYNPSILTDTTLLASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQGRRKKNGKTSNNSHD
        SEMT+   +YNPS L DTTLLAS T DDI E KG +VAD+DLD SNSFSLP SCLELRT DSEGVDSYSVDEFT KS+ VC P QGRRKKN K SN S D
Subjt:  SEMTSGFVDYNPSILTDTTLLASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQGRRKKNGKTSNNSHD

Query:  NLFLPSQQIEQETLGTKKPHRHNQSKKRKHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCSSSTPLNVDDDDNEPTIASFL-NKLKR
        +L +  QQ E E  GT + HR NQ KKRKH++T TS LGT+EAVEEVDDCTL+GFLQKRLKRT  TH + VD SSST   VD+DDN+PT+A  L  KLKR
Subjt:  NLFLPSQQIEQETLGTKKPHRHNQSKKRKHNNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCSSSTPLNVDDDDNEPTIASFL-NKLKR

Query:  KKH
        KKH
Subjt:  KKH

SwissProt top hitse value%identityAlignment
P10242 Transcriptional activator Myb1.2e-2332.37Show/hide
Query:  EARWLNFEDPLINRDPWTKGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDNFDLLLLYLAWKKSLDPARTKRGYFTPYEDVRLKIAV
        + RW   ED  + +     G ++W  IA  L  NRT  QC  R+Q+ LN  ++K  WTK+ED                                R+   V
Subjt:  EARWLNFEDPLINRDPWTKGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDNFDLLLLYLAWKKSLDPARTKRGYFTPYEDVRLKIAV

Query:  MLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVP---LLQEARKIQK
          +GPK W+  A+ L GR   QCRERW N L+P +++  WTEEED  +  A +  G  WA++A  +P RTDN  +  W      +V     LQE+ K  +
Subjt:  MLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVP---LLQEARKIQK

Query:  AALISNF
         A+ ++F
Subjt:  AALISNF

Q08759 Transcriptional activator Myb2.1e-2330.43Show/hide
Query:  EARWLNFEDPLINRDPWTKGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDNFDLLLLYLAWKKSLDPARTKRGYFTPYEDVRLKIAV
        + RW   ED  + +     G   W ++  S   NRT  QC  R+Q+ LN  ++K  WTK+ED                                R+   V
Subjt:  EARWLNFEDPLINRDPWTKGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDNFDLLLLYLAWKKSLDPARTKRGYFTPYEDVRLKIAV

Query:  MLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLF---PDEVPLLQEARKIQK
          +GPK W+  A+ L GR   QCRERW N L+P +++  WTEEED  +  A +  G  WA++A  +P RTDN  +  W        ++   LQ + K  +
Subjt:  MLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLF---PDEVPLLQEARKIQK

Query:  AALISNF
          +++NF
Subjt:  AALISNF

Q54NA6 Myb-like protein L1.2e-3930.07Show/hide
Query:  PAENSHVACHRMALAKFPRVDR-KKWSIVERENLGKGIRQQ-FQEMVLQISVDQFSG----------QQGVSGDSDDLDNI-----LASIKDLDIAPEKI
        PA+N      R+     P   + ++W+  E E L KGI+++  Q+ + ++S D+ S           Q+  + ++++ +NI       SIKD        
Subjt:  PAENSHVACHRMALAKFPRVDR-KKWSIVERENLGKGIRQQ-FQEMVLQISVDQFSG----------QQGVSGDSDDLDNI-----LASIKDLDIAPEKI

Query:  REFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTK-------------GLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDED
         + + +V  + L       RS  E   RW N +DP IN+ P+TK               + W +I++ LGTNRTP  C+ RYQRSLN+ ++KREWTK+ED
Subjt:  REFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTK-------------GLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDED

Query:  D---------------NFDLLLLYL----------AWKKSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRR
        +               ++  +  Y+           W K+LDP+  K+G ++P ED  L  AV  +G  NW      + GR  VQCRER+ N LDP L +
Subjt:  D---------------NFDLLLLYL----------AWKKSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRR

Query:  CEWTEEEDLRLEIAIQEHGY-SWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKI--QKAALISNFVDRETER--LPNTDLLCNTDGPRPVPKRNV
          WT +ED RL     + G   W+ VA  + +RTDN+C RRWK+L      L     K+  +K   +SNF  R+ ER  L   D++   +   P   +  
Subjt:  CEWTEEEDLRLEIAIQEHGY-SWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKI--QKAALISNFVDRETER--LPNTDLLCNTDGPRPVPKRNV

Query:  KTRKTPVSRNGKSATGDAPKKRKSNYQRFQTDATAQVGI
        KT  +  + +    T +       N      D +   GI
Subjt:  KTRKTPVSRNGKSATGDAPKKRKSNYQRFQTDATAQVGI

Q5SXM2 snRNA-activating protein complex subunit 46.1e-3125.74Show/hide
Query:  IKKNRSQQKFIRSKMIHLEARIEENKKLRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHV-ACHRMALA
        ++ N   Q+ I+ K+      + +N++ ++  ++++D   S  + T     + + P   +    KP  KD      +++G+  GP  N          + 
Subjt:  IKKNRSQQKFIRSKMIHLEARIEENKKLRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHV-ACHRMALA

Query:  KFPRVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQFSG-QQGVSGDSDDLD---------NILASIKDLDIAPEK--IREFLPKVNWDKLASMYLQG-
         F  +   KW   E+  L K +     + +LQ  + +     Q  S  S +L+              I+D++  PE+  +   L   +W+K++++  +G 
Subjt:  KFPRVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQFSG-QQGVSGDSDDLD---------NILASIKDLDIAPEK--IREFLPKVNWDKLASMYLQG-

Query:  RSGAECEARWLNFEDPLINRDPWTK-------------GLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDED-------------------
        RS  E    W N E P IN+  W++             G   W +IA  LGT+R+ FQCL ++Q+  N ++ ++EWT++ED                   
Subjt:  RSGAECEARWLNFEDPLINRDPWTK-------------GLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDED-------------------

Query:  ------DNFDLLLLYLAWKKSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHG
              +  D + L   W KSLDP   K+GY+ P ED +L  AV  +G ++W K  E +PGR+  QCR+R+   L  SL++  W  +E+ +L   I+++G
Subjt:  ------DNFDLLLLYLAWKKSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHG

Query:  YS-WAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARK
           WAK+A+ +P R+ ++C  +WK +   +  L +  R+
Subjt:  YS-WAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARK

Q8BP86 snRNA-activating protein complex subunit 42.7e-3130.89Show/hide
Query:  IKDLDIAPEK--IREFLPKVNWDKLASMYLQG-RSGAECEARWLNFEDPLINRDPW-------------TKGLNNWIEIAVSLGTNRTPFQCLSRYQRSL
        I+D++  PE+  +   L   +W+K++++  +G RS  E    W + E P I++  W             T G   W  +A  LGT+R+ FQCL ++Q+  
Subjt:  IKDLDIAPEK--IREFLPKVNWDKLASMYLQG-RSGAECEARWLNFEDPLINRDPW-------------TKGLNNWIEIAVSLGTNRTPFQCLSRYQRSL

Query:  NASILKREWTKDED-------------------------DNFDLLLLYLAWKKSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQC
        N ++ ++EWT++ED                         +  D + L   W KSLDP+  KRG++ P ED +L  AV  +G ++W K  E +PGR+  QC
Subjt:  NASILKREWTKDED-------------------------DNFDLLLLYLAWKKSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQC

Query:  RERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYS-WAKVAACVPSRTDNECRRRWKKL
        R+R+   L  SL++  W  +E+ +L   I+++G   WA++A+ +P R+ ++C  +WK L
Subjt:  RERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYS-WAKVAACVPSRTDNECRRRWKKL

Arabidopsis top hitse value%identityAlignment
AT3G18100.1 myb domain protein 4r15.7e-13343.41Show/hide
Query:  SDSNDVDDFELLRDIQNRFSIVADEQPLSTLSPV--SADEEEDEFEMLRSIQRRFAAY-----ESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRP
        SDS   DDFE++R I+++ S+  D     +L P+  S DEE+D FE LR+I+RRF+AY     E   +++   + +    S    S +I   S T    P
Subjt:  SDSNDVDDFELLRDIQNRFSIVADEQPLSTLSPV--SADEEEDEFEMLRSIQRRFAAY-----ESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRP

Query:  -----------------SMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISA
                           +     S P+AA AFVDAI++NR+ QKF+R K+  +EA IE+N+K +K  +I+KDFQ SCKR T  AL Q  DPRV+LIS 
Subjt:  -----------------SMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISA

Query:  AKPQAKDSSK----------KDKRLSGMYYGPAENSHVACHRMALAKFP-RVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQFSGQQGVSGDSDDLDN
         K    DSS+           DK++S +  GPAEN  V  +RMAL K+P  V R+KWS  E +NL KG++Q+ Q+++L  ++++ S  +G    + D+D 
Subjt:  AKPQAKDSSK----------KDKRLSGMYYGPAENSHVACHRMALAKFP-RVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQFSGQQGVSGDSDDLDN

Query:  ILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWT-------------KGLNNWIEIAVSLGTNRTPFQCLSRYQRS
        I  SI +L+I PE IR+FLPK+NWD   S+ ++ RS AECEARW++ EDPLIN  PWT               L +W++IAVSLGTNRTPFQCL+RYQRS
Subjt:  ILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWT-------------KGLNNWIEIAVSLGTNRTPFQCLSRYQRS

Query:  LNASILKREWTKDEDDNFDLLLLYLA----------------------WKKSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQCRE
        LN SILK+EWT +EDD     +                          WKKSL P  T++G ++  ED R+K+AV LFG +NW+K ++F+PGR Q QCRE
Subjt:  LNASILKREWTKDEDDNFDLLLLYLA----------------------WKKSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQCRE

Query:  RWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAALISNFVDRETER--LPNTDLLCNTDG
        RW NCLDP + R +WTEEED +L  AI EHGYSW+KVA  +  RTDN+C RRWK+L+P +V LLQEAR++QK A + NFVDRE+ER  L  + +L   D 
Subjt:  RWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAALISNFVDRETER--LPNTDLLCNTDG

Query:  PRPVPKRNVKTRKTPVSRNGKSATGDAPKKRKSNYQRFQTDATAQ
               +V  +K   ++  KS     PK+R+   +    D   Q
Subjt:  PRPVPKRNVKTRKTPVSRNGKSATGDAPKKRKSNYQRFQTDATAQ

AT3G18100.2 myb domain protein 4r16.3e-12447.12Show/hide
Query:  SLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSK----------KDKRL
        S P+AA AFVDAI++NR+ QKF+R K+  +EA IE+N+K +K  +I+KDFQ SCKR T  AL Q  DPRV+LIS  K    DSS+           DK++
Subjt:  SLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRCKILKDFQCSCKRRTSSALSQMIDPRVQLISAAKPQAKDSSK----------KDKRL

Query:  SGMYYGPAENSHVACHRMALAKFP-RVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWD
        S +  GPAEN  V  +RMAL K+P  V R+KWS  E +NL KG++Q+ Q+++L  ++++ S  +G    + D+D I  SI +L+I PE IR+FLPK+NWD
Subjt:  SGMYYGPAENSHVACHRMALAKFP-RVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWD

Query:  KLASMYLQGRSGAECEARWLNFEDPLINRDPWT-------------KGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDNFDLLLLYL
           S+ ++ RS AECEARW++ EDPLIN  PWT               L +W++IAVSLGTNRTPFQCL+RYQRSLN SILK+EWT +EDD     +   
Subjt:  KLASMYLQGRSGAECEARWLNFEDPLINRDPWT-------------KGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDNFDLLLLYL

Query:  A----------------------WKKSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEI
                               WKKSL P  T++G ++  ED R+K+AV LFG +NW+K ++F+PGR Q QCRERW NCLDP + R +WTEEED +L  
Subjt:  A----------------------WKKSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEI

Query:  AIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAALISNFVDRETER--LPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNGKSATG
        AI EHGYSW+KVA  +  RTDN+C RRWK+L+P +V LLQEAR++QK A + NFVDRE+ER  L  + +L   D        +V  +K   ++  KS   
Subjt:  AIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAALISNFVDRETER--LPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNGKSATG

Query:  DAPKKRKSNYQRFQTDATAQ
          PK+R+   +    D   Q
Subjt:  DAPKKRKSNYQRFQTDATAQ

AT3G18100.3 myb domain protein 4r18.5e-11339.32Show/hide
Query:  NHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSIT---AGDADPGSDSNDVDDFELLRDIQNRFSIVAD----EQPLSTLSPVSAD
        N + E D +   D+ED  + ED+E L+RA  +  VN  D    ++ SI     G  +  SDS + DDFE+LR I+++ +   D      P   LS +S  
Subjt:  NHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSIT---AGDADPGSDSNDVDDFELLRDIQNRFSIVAD----EQPLSTLSPVSAD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKR-----PSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLE
        E ED+FEM+RSI+ +        LS   D S    G L  D +D A E+  + +R      +   F   S  K      + +   ++Q+ +   KM+  +
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKR-----PSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLE

Query:  ARIEENKKLRKRC-----------KILKDFQCSCKRRTSSALS--QMIDPRVQLISAAKPQAKDSSK----------KDKRLSGMYYGPAENSHVACHRM
         R++ + KL +R              L++     K+R+S   +  +M DPRV+LIS  K    DSS+           DK++S +  GPAEN  V  +RM
Subjt:  ARIEENKKLRKRC-----------KILKDFQCSCKRRTSSALS--QMIDPRVQLISAAKPQAKDSSK----------KDKRLSGMYYGPAENSHVACHRM

Query:  ALAKFP-RVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEAR
        AL K+P  V R+KWS  E +NL KG++Q+ Q+++L  ++++ S  +G    + D+D I  SI +L+I PE IR+FLPK+NWD   S+ ++ RS AECEAR
Subjt:  ALAKFP-RVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQFSGQQGVSGDSDDLDNILASIKDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEAR

Query:  WLNFEDPLINRDPWT-------------KGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDNFDLLLLYLA-----------------
        W++ EDPLIN  PWT               L +W++IAVSLGTNRTPFQCL+RYQRSLN SILK+EWT +EDD     +                     
Subjt:  WLNFEDPLINRDPWT-------------KGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDNFDLLLLYLA-----------------

Query:  -----WKKSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPS
             WKKSL P  T++G ++  ED R+K+AV LFG +NW+K ++F+PGR Q QCRERW NCLDP + R +WTEEED +L  AI EHGYSW+KVA  +  
Subjt:  -----WKKSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPS

Query:  RTDNECRRRWKKLFPDEVPLLQEARKIQKAALISNFVDRETER--LPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNGKSATGDAPKKRKSNYQRFQTDAT
        RTDN+C RRWK+L+P +V LLQEAR++QK A + NFVDRE+ER  L  + +L   D        +V  +K   ++  KS     PK+R+   +    D  
Subjt:  RTDNECRRRWKKLFPDEVPLLQEARKIQKAALISNFVDRETER--LPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNGKSATGDAPKKRKSNYQRFQTDAT

Query:  AQ
         Q
Subjt:  AQ

AT5G11510.1 myb domain protein 3r-41.6e-2331.66Show/hide
Query:  PKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTKGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDNFDLLLLYLAWKKSLD
        P+    KL      G +      +W   ED ++ +   +    NW +IA     +RT  QCL R+Q+ LN  ++K  WTK+ED+    +++ L  K    
Subjt:  PKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTKGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDNFDLLLLYLAWKKSLD

Query:  PARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW
                               +GPK W+  A FLPGR   QCRERW N L+P++ +  WT+EE+L L  A Q +G  WA++   +P R+DN  +  W
Subjt:  PARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW

AT5G11510.2 myb domain protein 3r-41.6e-2331.66Show/hide
Query:  PKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTKGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDNFDLLLLYLAWKKSLD
        P+    KL      G +      +W   ED ++ +   +    NW +IA     +RT  QCL R+Q+ LN  ++K  WTK+ED+    +++ L  K    
Subjt:  PKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTKGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDNFDLLLLYLAWKKSLD

Query:  PARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW
                               +GPK W+  A FLPGR   QCRERW N L+P++ +  WT+EE+L L  A Q +G  WA++   +P R+DN  +  W
Subjt:  PARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCTCCACAACCATGTTGATGAAATTGACGTTGAGCATCGTGCTGACAAGGAAGATGGTGTGGTTGATGAGGACATGGAAGTTCTTCAGAGAGCCTATAGGCTTGT
TGGTGTTAATCCTGAGGATTATATTCATCCCAGGTCGTCATCAATTACTGCTGGAGACGCTGATCCTGGTTCTGATTCTAATGATGTTGATGATTTTGAACTTCTTCGAG
ATATTCAGAATCGGTTCTCGATTGTGGCTGATGAGCAGCCACTGAGTACTCTTTCACCAGTGTCAGCAGACGAGGAGGAAGATGAATTTGAGATGCTTCGTTCTATACAG
CGGCGCTTTGCAGCGTATGAAAGTGATACTTTGAGCAATAAACCCGATCAGTCTCGTGACTATGATGGGTCTCTGAAGATGGATTCTGACGACATAGCTGTTGAGAGCCA
GACATCCTCAAAAAGGCCATCCATGCTAGCCTTTGAAAAGGGAAGCTTGCCAAAGGCTGCGTTGGCATTTGTTGATGCCATCAAGAAGAATAGGTCCCAGCAGAAGTTTA
TTCGTAGTAAGATGATTCATCTTGAAGCTAGAATAGAGGAGAACAAAAAACTCAGAAAACGTTGCAAAATTCTCAAAGATTTCCAGTGTTCATGTAAGCGAAGAACAAGT
TCTGCACTGTCTCAAATGATAGACCCTCGAGTCCAGTTAATTTCAGCTGCAAAACCACAAGCAAAGGATTCATCAAAGAAGGACAAACGATTATCTGGAATGTATTATGG
CCCGGCTGAGAATTCTCATGTTGCGTGTCACAGAATGGCATTGGCAAAGTTTCCCCGTGTTGATCGAAAGAAATGGTCCATTGTAGAAAGGGAGAATCTTGGGAAGGGAA
TAAGACAGCAGTTTCAGGAGATGGTGCTTCAGATTTCAGTGGATCAATTCAGTGGGCAACAAGGAGTTTCTGGAGATTCAGATGATTTGGATAACATTCTTGCATCAATA
AAAGATCTTGACATCGCTCCTGAAAAGATTAGGGAATTTCTGCCAAAAGTTAATTGGGACAAATTGGCTTCCATGTACCTTCAGGGTCGCTCAGGGGCAGAATGTGAAGC
AAGGTGGTTGAATTTTGAAGACCCCCTAATTAATCGAGATCCATGGACTAAGGGCTTGAATAACTGGATTGAGATAGCAGTTTCATTGGGCACAAATAGAACTCCCTTCC
AGTGCTTGTCTCGGTATCAAAGGAGTTTAAACGCTTCCATATTAAAGAGGGAGTGGACCAAAGATGAGGATGATAACTTCGATCTGCTGTTGCTATATTTGGCGTGGAAA
AAATCCCTTGACCCAGCTAGGACAAAAAGAGGTTATTTCACTCCATATGAAGACGTTCGTTTGAAAATTGCTGTAATGCTCTTTGGGCCCAAAAATTGGAACAAGAAAGC
AGAATTTTTGCCTGGTCGGAATCAAGTTCAATGCAGAGAAAGATGGTTTAATTGTTTAGATCCTTCCTTGAGAAGATGTGAGTGGACAGAAGAGGAGGATTTAAGGCTGG
AGATAGCAATTCAGGAACATGGATATAGCTGGGCTAAGGTAGCTGCATGTGTGCCATCACGTACAGATAATGAGTGTCGCAGGAGATGGAAGAAGTTATTTCCCGATGAA
GTTCCCTTGCTCCAGGAAGCTAGAAAGATTCAGAAGGCTGCTCTCATTAGCAACTTTGTTGATAGGGAAACAGAGCGTCTGCCAAATACAGATTTATTATGTAATACTGA
TGGTCCAAGACCTGTCCCAAAAAGAAATGTGAAAACAAGAAAGACGCCCGTGTCAAGGAATGGAAAGAGTGCTACTGGGGATGCTCCAAAGAAGAGGAAATCAAATTATC
AGAGGTTTCAGACGGATGCAACTGCTCAGGTGGGTATTGCATATAATACCTCTTTTGTCCCAGAAGAGGTTCAATCTTCAAAGCCTCAAAGGAAACGAAATAGACGTGGA
GCTTCTAATGCAAAGAGAATAGGGGTACCGGACCTACGTTCTGACAGCGAGTGGTGTGCTAAACAGAATTTGGACACTCAGAGCCTTGGGTTGCAGCTGAATAGTAAGGA
ATCTGATAGGACCAACAGTGACTGTACCGAGACTGTCGATGAGAACATTCTGGAAGTTTTTGAGAACAAAGTTGCAGAGAAGCTTACTGAAAAAATTGCATGCTTCTCGG
AACAAAAAAAAAATCAAAACTCAACCGGATCTTCTGGAGTCTCAGTATTGTCGGAAATGACTAGCGGCTTCGTTGACTATAATCCCTCTATCCTCACAGATACAACATTG
TTAGCTAGTACTACTGTGGATGATATCGAAGAATTGAAGGGCAAGAGTGTTGCAGACAGGGATCTGGATGACAGTAACAGTTTCTCGTTACCGCACAGTTGCTTAGAACT
CCGGACAGTTGACAGTGAAGGCGTAGACAGTTATTCTGTGGACGAATTTACAGCTAAAAGCAATGGGGTTTGCAATCCCACCCAAGGCAGGAGGAAGAAAAATGGCAAAA
CATCAAATAACAGCCACGATAATTTGTTTCTTCCTAGTCAACAAATTGAGCAAGAGACATTGGGAACGAAGAAGCCTCATCGTCATAATCAATCAAAGAAGAGAAAACAT
AACAATACAGGTACAAGTACGTTAGGAACATTGGAGGCAGTTGAGGAGGTTGATGACTGCACTCTCGTGGGGTTCTTGCAAAAAAGATTGAAGAGGACGGCAGTGACACA
TAACGAGACAGTTGATTGCAGTTCTAGTACTCCTTTAAATGTTGATGATGACGATAATGAGCCTACCATTGCCTCTTTTCTTAATAAATTGAAGAGAAAAAAGCATCAGC
GGCCTAGTGGTTGTGAGTTAAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCTCCACAACCATGTTGATGAAATTGACGTTGAGCATCGTGCTGACAAGGAAGATGGTGTGGTTGATGAGGACATGGAAGTTCTTCAGAGAGCCTATAGGCTTGT
TGGTGTTAATCCTGAGGATTATATTCATCCCAGGTCGTCATCAATTACTGCTGGAGACGCTGATCCTGGTTCTGATTCTAATGATGTTGATGATTTTGAACTTCTTCGAG
ATATTCAGAATCGGTTCTCGATTGTGGCTGATGAGCAGCCACTGAGTACTCTTTCACCAGTGTCAGCAGACGAGGAGGAAGATGAATTTGAGATGCTTCGTTCTATACAG
CGGCGCTTTGCAGCGTATGAAAGTGATACTTTGAGCAATAAACCCGATCAGTCTCGTGACTATGATGGGTCTCTGAAGATGGATTCTGACGACATAGCTGTTGAGAGCCA
GACATCCTCAAAAAGGCCATCCATGCTAGCCTTTGAAAAGGGAAGCTTGCCAAAGGCTGCGTTGGCATTTGTTGATGCCATCAAGAAGAATAGGTCCCAGCAGAAGTTTA
TTCGTAGTAAGATGATTCATCTTGAAGCTAGAATAGAGGAGAACAAAAAACTCAGAAAACGTTGCAAAATTCTCAAAGATTTCCAGTGTTCATGTAAGCGAAGAACAAGT
TCTGCACTGTCTCAAATGATAGACCCTCGAGTCCAGTTAATTTCAGCTGCAAAACCACAAGCAAAGGATTCATCAAAGAAGGACAAACGATTATCTGGAATGTATTATGG
CCCGGCTGAGAATTCTCATGTTGCGTGTCACAGAATGGCATTGGCAAAGTTTCCCCGTGTTGATCGAAAGAAATGGTCCATTGTAGAAAGGGAGAATCTTGGGAAGGGAA
TAAGACAGCAGTTTCAGGAGATGGTGCTTCAGATTTCAGTGGATCAATTCAGTGGGCAACAAGGAGTTTCTGGAGATTCAGATGATTTGGATAACATTCTTGCATCAATA
AAAGATCTTGACATCGCTCCTGAAAAGATTAGGGAATTTCTGCCAAAAGTTAATTGGGACAAATTGGCTTCCATGTACCTTCAGGGTCGCTCAGGGGCAGAATGTGAAGC
AAGGTGGTTGAATTTTGAAGACCCCCTAATTAATCGAGATCCATGGACTAAGGGCTTGAATAACTGGATTGAGATAGCAGTTTCATTGGGCACAAATAGAACTCCCTTCC
AGTGCTTGTCTCGGTATCAAAGGAGTTTAAACGCTTCCATATTAAAGAGGGAGTGGACCAAAGATGAGGATGATAACTTCGATCTGCTGTTGCTATATTTGGCGTGGAAA
AAATCCCTTGACCCAGCTAGGACAAAAAGAGGTTATTTCACTCCATATGAAGACGTTCGTTTGAAAATTGCTGTAATGCTCTTTGGGCCCAAAAATTGGAACAAGAAAGC
AGAATTTTTGCCTGGTCGGAATCAAGTTCAATGCAGAGAAAGATGGTTTAATTGTTTAGATCCTTCCTTGAGAAGATGTGAGTGGACAGAAGAGGAGGATTTAAGGCTGG
AGATAGCAATTCAGGAACATGGATATAGCTGGGCTAAGGTAGCTGCATGTGTGCCATCACGTACAGATAATGAGTGTCGCAGGAGATGGAAGAAGTTATTTCCCGATGAA
GTTCCCTTGCTCCAGGAAGCTAGAAAGATTCAGAAGGCTGCTCTCATTAGCAACTTTGTTGATAGGGAAACAGAGCGTCTGCCAAATACAGATTTATTATGTAATACTGA
TGGTCCAAGACCTGTCCCAAAAAGAAATGTGAAAACAAGAAAGACGCCCGTGTCAAGGAATGGAAAGAGTGCTACTGGGGATGCTCCAAAGAAGAGGAAATCAAATTATC
AGAGGTTTCAGACGGATGCAACTGCTCAGGTGGGTATTGCATATAATACCTCTTTTGTCCCAGAAGAGGTTCAATCTTCAAAGCCTCAAAGGAAACGAAATAGACGTGGA
GCTTCTAATGCAAAGAGAATAGGGGTACCGGACCTACGTTCTGACAGCGAGTGGTGTGCTAAACAGAATTTGGACACTCAGAGCCTTGGGTTGCAGCTGAATAGTAAGGA
ATCTGATAGGACCAACAGTGACTGTACCGAGACTGTCGATGAGAACATTCTGGAAGTTTTTGAGAACAAAGTTGCAGAGAAGCTTACTGAAAAAATTGCATGCTTCTCGG
AACAAAAAAAAAATCAAAACTCAACCGGATCTTCTGGAGTCTCAGTATTGTCGGAAATGACTAGCGGCTTCGTTGACTATAATCCCTCTATCCTCACAGATACAACATTG
TTAGCTAGTACTACTGTGGATGATATCGAAGAATTGAAGGGCAAGAGTGTTGCAGACAGGGATCTGGATGACAGTAACAGTTTCTCGTTACCGCACAGTTGCTTAGAACT
CCGGACAGTTGACAGTGAAGGCGTAGACAGTTATTCTGTGGACGAATTTACAGCTAAAAGCAATGGGGTTTGCAATCCCACCCAAGGCAGGAGGAAGAAAAATGGCAAAA
CATCAAATAACAGCCACGATAATTTGTTTCTTCCTAGTCAACAAATTGAGCAAGAGACATTGGGAACGAAGAAGCCTCATCGTCATAATCAATCAAAGAAGAGAAAACAT
AACAATACAGGTACAAGTACGTTAGGAACATTGGAGGCAGTTGAGGAGGTTGATGACTGCACTCTCGTGGGGTTCTTGCAAAAAAGATTGAAGAGGACGGCAGTGACACA
TAACGAGACAGTTGATTGCAGTTCTAGTACTCCTTTAAATGTTGATGATGACGATAATGAGCCTACCATTGCCTCTTTTCTTAATAAATTGAAGAGAAAAAAGCATCAGC
GGCCTAGTGGTTGTGAGTTAAATTAA
Protein sequenceShow/hide protein sequence
MSLHNHVDEIDVEHRADKEDGVVDEDMEVLQRAYRLVGVNPEDYIHPRSSSITAGDADPGSDSNDVDDFELLRDIQNRFSIVADEQPLSTLSPVSADEEEDEFEMLRSIQ
RRFAAYESDTLSNKPDQSRDYDGSLKMDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRCKILKDFQCSCKRRTS
SALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPAENSHVACHRMALAKFPRVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQFSGQQGVSGDSDDLDNILASI
KDLDIAPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTKGLNNWIEIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDNFDLLLLYLAWK
KSLDPARTKRGYFTPYEDVRLKIAVMLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDE
VPLLQEARKIQKAALISNFVDRETERLPNTDLLCNTDGPRPVPKRNVKTRKTPVSRNGKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRKRNRRG
ASNAKRIGVPDLRSDSEWCAKQNLDTQSLGLQLNSKESDRTNSDCTETVDENILEVFENKVAEKLTEKIACFSEQKKNQNSTGSSGVSVLSEMTSGFVDYNPSILTDTTL
LASTTVDDIEELKGKSVADRDLDDSNSFSLPHSCLELRTVDSEGVDSYSVDEFTAKSNGVCNPTQGRRKKNGKTSNNSHDNLFLPSQQIEQETLGTKKPHRHNQSKKRKH
NNTGTSTLGTLEAVEEVDDCTLVGFLQKRLKRTAVTHNETVDCSSSTPLNVDDDDNEPTIASFLNKLKRKKHQRPSGCELN