; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G06560 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G06560
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionsnRNA-activating protein complex subunit 4
Genome locationChr3:5736632..5748670
RNA-Seq ExpressionCSPI03G06560
SyntenyCSPI03G06560
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011650584.1 uncharacterized protein LOC101216287 [Cucumis sativus]0.0e+0098.61Show/hide
Query:  MSLRNHVDEIDVEHPADQEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQSTLPPVSADEEE
        MSLRNHVDEIDVEHPAD+EDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQST  PVSADEEE
Subjt:  MSLRNHVDEIDVEHPADQEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQSTLPPVSADEEE

Query:  DEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRQSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHFEARIEENKK
        DEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKR SMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIH EARIEENKK
Subjt:  DEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRQSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHFEARIEENKK

Query:  LRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPSVDRKKWSIVERENLGKGIRQQFQE
        LRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRM LAKFP VDRKKWSIVERENLGKGIRQQFQE
Subjt:  LRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPSVDRKKWSIVERENLGKGIRQQFQE

Query:  MVLQISVDQISGPQGISGDSDDLDNILASIKDLDISPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLN
        MVLQISVDQISGPQGISGDSDDLDNILASIKDLDI+PDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLN
Subjt:  MVLQISVDQISGPQGISGDSDDLDNILASIKDLDISPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLN

Query:  NWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAV
        NWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTK+EDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAV
Subjt:  NWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAV

Query:  LLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL
        LLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL
Subjt:  LLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL

Query:  ISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK
        ISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK
Subjt:  ISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK

Query:  RNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKESERSNSNCTETVDENIMEVLENKVAEKLTEENGCFSEPEKNQNSTGSSGISVLSEMTN
        RNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKESERSNSNCTETVDENIMEVLENKVAEKLTEEN CFSEPEKNQNSTGSSG+SVLSEMTN
Subjt:  RNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKESERSNSNCTETVDENIMEVLENKVAEKLTEENGCFSEPEKNQNSTGSSGISVLSEMTN

Query:  DLVDYNPSILTDTTLFASTTVDDTEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSDGVCNPTQGRRKKNSKTSNNSHDNLLIP
        DLVDYNPSILTDTTLFASTTVDD EELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKS+GVCNPTQGRRKKNSKTSNNSHDNLLIP
Subjt:  DLVDYNPSILTDTTLFASTTVDDTEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSDGVCNPTQGRRKKNSKTSNNSHDNLLIP

Query:  RQQIVQETLGTKKPLHDNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLNKLKRKKHQRP
        RQQIVQETLGTKKPLH NQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLNKLKRKKHQRP
Subjt:  RQQIVQETLGTKKPLHDNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLNKLKRKKHQRP

Query:  SGDELN
        SGDELN
Subjt:  SGDELN

XP_023515735.1 uncharacterized protein LOC111779809 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0078.34Show/hide
Query:  MSLRNHVDEIDVEHPA---DQEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQSTLPPVSAD
        MS R+H D  D E PA   D ED +VD+DME L+RA RLAGVN EDYINPRLS PAAGDA+ GSDSDDVDD ELLR+IQNRFSI ADEQP S LPPV+AD
Subjt:  MSLRNHVDEIDVEHPA---DQEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQSTLPPVSAD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRQSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHFEARIEE
        EEED+FEMLRSIQRRFAAYESD LSNKP+QS D  G LK+DS++  VE  TSS+R SM+AFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIH EARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRQSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHFEARIEE

Query:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPSVDRKKWSIVERENLGKGIRQQ
        NKKLRKR K+LK FQGSC+R+T+CAL+QM+DPRVQLISA KPQAKDSSKKDKRLS M YGP ENSHVACYR A  KF  VDRK+WS  ERENLGKGIRQQ
Subjt:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPSVDRKKWSIVERENLGKGIRQQ

Query:  FQEMVLQISVDQISGPQGISGDSDDLDNILASIKDLDISPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQK
        FQEMVLQISVDQIS  QG S +SDDLDNILASIK LDI+P+KIREFLPKVNWDKLASMYL+GRSGAECEARWLNFEDPLINR+PWTTSEDK+LLFTIQQK
Subjt:  FQEMVLQISVDQISGPQGISGDSDDLDNILASIKDLDISPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQK

Query:  GLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLK
        GLNNWIE+AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDEDD+LRSAVA FG  DWQAVASTLEGR G QCSNRWKKSLDPART++GYFTPDED RLK
Subjt:  GLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLK

Query:  IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQK
        IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFP++VPLLQEARKIQK
Subjt:  IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQK

Query:  AALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKP
         ALISNFVDRE+ERPALGP DFRP PN+  LCNTD P  APKRNV+TR+MPVSRNEKSA GDAPK+RKSN QR + D TAQV    NTS VP EV+S+KP
Subjt:  AALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKP

Query:  QRKRNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKE-SERSNSNCTETVDENIMEVLENKVAEKLTEENGCFSEPEKNQNSTGSSGISVLS
        QRKR R GAYT +R G P++  +SE CA+QN DT S+ +QLN KE +ER NS+C ETVDEN MEV ENK AE  +E   CFSE E+NQNSTGSSG+SVLS
Subjt:  QRKRNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKE-SERSNSNCTETVDENIMEVLENKVAEKLTEENGCFSEPEKNQNSTGSSGISVLS

Query:  EMTNDLVDYNPSILTDTTLFASTTVDDTEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSDGVCNPTQGRRKKNSKTSNNSHDN
        EMTND+ +YNPS L DTTL AS T DD  E KG + AD+DLDDSNSFSL  SCLELRT DSEGVDSYSVDE+T KS GVC P QGRRKKNSK SN S D+
Subjt:  EMTNDLVDYNPSILTDTTLFASTTVDDTEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSDGVCNPTQGRRKKNSKTSNNSHDN

Query:  LLIPRQQIVQETLGTKKPLHDNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLN-KLKRK
         L+  QQ   E  GT +    NQSKKRKHS T  S L T EAVEEVDDCTL GFLQKRLKRT  TH++ VD SS+ P +VDNDDN+PT+A  LN KLKRK
Subjt:  LLIPRQQIVQETLGTKKPLHDNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLN-KLKRK

Query:  KH
        KH
Subjt:  KH

XP_023515736.1 uncharacterized protein LOC111779809 isoform X2 [Cucurbita pepo subsp. pepo]0.0e+0078.24Show/hide
Query:  MSLRNHVDEIDVEHPA---DQEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQSTLPPVSAD
        MS R+H D  D E PA   D ED +VD+DME L+RA RLAGVN EDYINPRLS PAAGDA+ GSDSDDVDD ELLR+IQNRFSI ADEQP S LPPV+AD
Subjt:  MSLRNHVDEIDVEHPA---DQEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQSTLPPVSAD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRQSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHFEARIEE
        EEED+FEMLRSIQRRFAAYESD LSNKP+QS D  G LK+DS++  VE  TSS+R SM+AFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIH EARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRQSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHFEARIEE

Query:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPSVDRKKWSIVERENLGKGIRQQ
        NKKLRKR K+LK FQGSC+R+T+CAL+QM+DPRVQLISA KPQAKDSSKKDKRLS M YGP ENSHVACYR A  KF  VDRK+WS  ERENLGKGIRQQ
Subjt:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPSVDRKKWSIVERENLGKGIRQQ

Query:  FQEMVLQISVDQISGPQGISGDSDDLDNILASIKDLDISPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQK
        FQEMVLQISVDQIS  QG S +SDDLDNILASIK LDI+P+KIREFLPKVNWDKLASMYL+GRSGAECEARWLNFEDPLINR+PWTTSEDK+LLFTIQQK
Subjt:  FQEMVLQISVDQISGPQGISGDSDDLDNILASIKDLDISPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQK

Query:  GLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLK
        GLNNWIE+AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDEDD+LRSAVA FG  DWQAVASTLEGR G QCSNRWKKSLDPART++GYFTPDED RLK
Subjt:  GLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLK

Query:  IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQK
        IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNEC RRWKKLFP++VPLLQEARKIQK
Subjt:  IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQK

Query:  AALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKP
         ALISNFVDRE+ERPALGP DFRP PN+  LCNTD P  APKRNV+TR+MPVSRNEKSA GDAPK+RKSN QR + D TAQV    NTS VP EV+S+KP
Subjt:  AALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKP

Query:  QRKRNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKE-SERSNSNCTETVDENIMEVLENKVAEKLTEENGCFSEPEKNQNSTGSSGISVLS
        QRKR R GAYT +R G P++  +SE CA+QN DT S+ +QLN KE +ER NS+C ETVDEN MEV ENK AE  +E   CFSE E+NQNSTGSSG+SVLS
Subjt:  QRKRNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKE-SERSNSNCTETVDENIMEVLENKVAEKLTEENGCFSEPEKNQNSTGSSGISVLS

Query:  EMTNDLVDYNPSILTDTTLFASTTVDDTEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSDGVCNPTQGRRKKNSKTSNNSHDN
        EMTND+ +YNPS L DTTL AS T DD  E KG + AD+DLDDSNSFSL  SCLELRT DSEGVDSYSVDE+T KS GVC P QGRRKKNSK SN S D+
Subjt:  EMTNDLVDYNPSILTDTTLFASTTVDDTEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSDGVCNPTQGRRKKNSKTSNNSHDN

Query:  LLIPRQQIVQETLGTKKPLHDNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLN-KLKRK
         L+  QQ   E  GT +    NQSKKRKHS T  S L T EAVEEVDDCTL GFLQKRLKRT  TH++ VD SS+ P +VDNDDN+PT+A  LN KLKRK
Subjt:  LLIPRQQIVQETLGTKKPLHDNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLN-KLKRK

Query:  KH
        KH
Subjt:  KH

XP_038905712.1 uncharacterized protein LOC120091681 isoform X1 [Benincasa hispida]0.0e+0084.12Show/hide
Query:  MSLRNHVDEIDVEHPADQEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQSTLPPVSADEEE
        MS  NH DE DVE PA++ED VVDEDMEVLQRAYRL GVNPEDYINPRLSSPA GDA+ G DSDD DDFELLR+IQNRFSI+ DEQP STLPPVS DEEE
Subjt:  MSLRNHVDEIDVEHPADQEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQSTLPPVSADEEE

Query:  DEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRQSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHFEARIEENKK
        DEFEMLRSIQRRFAAYESD LSNKPN+SRDYVGSLK+DS + A ESQTSSKR SM+AFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIH EARIEENKK
Subjt:  DEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRQSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHFEARIEENKK

Query:  LRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPSVDRKKWSIVERENLGKGIRQQFQE
        LRKRCKILKDFQGSCKR+T+CALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGP ENSHVACYRMALAKFP VDRKKWSIVERENLGKGIRQQFQE
Subjt:  LRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPSVDRKKWSIVERENLGKGIRQQFQE

Query:  MVLQISVDQISGPQGISGDSDDLDNILASIKDLDISPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLN
        MVLQISVDQISG QG S DSDDLDNILASIKDLDI+P+KIREFLPKVNWDKLASMYL GRSGAECEARWLNFEDPLINRDPWTTSEDK+LLFTIQQKGLN
Subjt:  MVLQISVDQISGPQGISGDSDDLDNILASIKDLDISPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLN

Query:  NWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAV
        NWIE+AVS GTNRTPFQCLSRYQRSLNASILKREWTKDEDD+LRSAVA FGVRDWQAVASTLEGRAGTQCSNRWKKSLDPART++G+FTPDED RLKIAV
Subjt:  NWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAV

Query:  LLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL
        LL GPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDN+CRRRWKKLFP+EVPLLQEARKIQKAAL
Subjt:  LLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL

Query:  ISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK
        ISNFVDRE+ERPALGPADFRPR NTD LC+TD P PAPKRN KTRKMPVSRNEKSATGDAP+KRKSNYQR Q DATA+VGIA NTS VPEEVQS KP RK
Subjt:  ISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK

Query:  RNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKESERSNSNCTETVDENIMEVLENKVAEKLTEENGCFSEPEKNQNSTGSSGISVLSEMTN
        RNR  A T KR GV EL S ++WCAKQNL+T S+G+QL+SKE E +NS+ TETVD N +EV ENK+A+KL+E +  FSEPE+NQNSTGSSG+SVLSEMTN
Subjt:  RNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKESERSNSNCTETVDENIMEVLENKVAEKLTEENGCFSEPEKNQNSTGSSGISVLSEMTN

Query:  DLVDYNPSILTDTTLFASTTVDDTEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSDGVCNPTQGRRKKNSKTSNNSHDNLLIP
        D+ +YNPSIL DTTL ASTTVDD EELKGKS ADRDLDDSNSFSL  SCLELRT+D EGVDSYSVD+ T KS  VC   QGRRKKNSKTS+ +H+   + 
Subjt:  DLVDYNPSILTDTTLFASTTVDDTEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSDGVCNPTQGRRKKNSKTSNNSHDNLLIP

Query:  RQQIVQETLGTKKPLHDNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLNKLKRKKHQRP
         QQ+ QE LG  +P H NQSKKRKHS+T  S L T EAVEEVD+CTLVGFLQKRLK+        VDCSS  PL+VDNDDN+  IASFLNKLKRKKHQ P
Subjt:  RQQIVQETLGTKKPLHDNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLNKLKRKKHQRP

Query:  S
        S
Subjt:  S

XP_038905717.1 uncharacterized protein LOC120091681 isoform X2 [Benincasa hispida]0.0e+0084.12Show/hide
Query:  MSLRNHVDEIDVEHPADQEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQSTLPPVSADEEE
        MS  NH DE DVE PA++ED VVDEDMEVLQRAYRL GVNPEDYINPRLSSPA GDA+ G DSDD DDFELLR+IQNRFSI+ DEQP STLPPVS DEEE
Subjt:  MSLRNHVDEIDVEHPADQEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQSTLPPVSADEEE

Query:  DEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRQSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHFEARIEENKK
        DEFEMLRSIQRRFAAYESD LSNKPN+SRDYVGSLK+DS + A ESQTSSKR SM+AFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIH EARIEENKK
Subjt:  DEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRQSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHFEARIEENKK

Query:  LRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPSVDRKKWSIVERENLGKGIRQQFQE
        LRKRCKILKDFQGSCKR+T+CALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGP ENSHVACYRMALAKFP VDRKKWSIVERENLGKGIRQQFQE
Subjt:  LRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPSVDRKKWSIVERENLGKGIRQQFQE

Query:  MVLQISVDQISGPQGISGDSDDLDNILASIKDLDISPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLN
        MVLQISVDQISG QG S DSDDLDNILASIKDLDI+P+KIREFLPKVNWDKLASMYL GRSGAECEARWLNFEDPLINRDPWTTSEDK+LLFTIQQKGLN
Subjt:  MVLQISVDQISGPQGISGDSDDLDNILASIKDLDISPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLN

Query:  NWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAV
        NWIE+AVS GTNRTPFQCLSRYQRSLNASILKREWTKDEDD+LRSAVA FGVRDWQAVASTLEGRAGTQCSNRWKKSLDPART++G+FTPDED RLKIAV
Subjt:  NWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAV

Query:  LLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL
        LL GPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDN+CRRRWKKLFP+EVPLLQEARKIQKAAL
Subjt:  LLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL

Query:  ISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK
        ISNFVDRE+ERPALGPADFRPR NTD LC+TD P PAPKRN KTRKMPVSRNEKSATGDAP+KRKSNYQR Q DATA+VGIA NTS VPEEVQS KP RK
Subjt:  ISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK

Query:  RNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKESERSNSNCTETVDENIMEVLENKVAEKLTEENGCFSEPEKNQNSTGSSGISVLSEMTN
        RNR  A T KR GV EL S ++WCAKQNL+T S+G+QL+SKE E +NS+ TETVD N +EV ENK+A+KL+E +  FSEPE+NQNSTGSSG+SVLSEMTN
Subjt:  RNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKESERSNSNCTETVDENIMEVLENKVAEKLTEENGCFSEPEKNQNSTGSSGISVLSEMTN

Query:  DLVDYNPSILTDTTLFASTTVDDTEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSDGVCNPTQGRRKKNSKTSNNSHDNLLIP
        D+ +YNPSIL DTTL ASTTVDD EELKGKS ADRDLDDSNSFSL  SCLELRT+D EGVDSYSVD+ T KS  VC   QGRRKKNSKTS+ +H+   + 
Subjt:  DLVDYNPSILTDTTLFASTTVDDTEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSDGVCNPTQGRRKKNSKTSNNSHDNLLIP

Query:  RQQIVQETLGTKKPLHDNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLNKLKRKKHQRP
         QQ+ QE LG  +P H NQSKKRKHS+T  S L T EAVEEVD+CTLVGFLQKRLK+        VDCSS  PL+VDNDDN+  IASFLNKLKRKKHQ P
Subjt:  RQQIVQETLGTKKPLHDNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLNKLKRKKHQRP

Query:  S
        S
Subjt:  S

TrEMBL top hitse value%identityAlignment
A0A0A0L2R2 Uncharacterized protein0.0e+0098.61Show/hide
Query:  MSLRNHVDEIDVEHPADQEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQSTLPPVSADEEE
        MSLRNHVDEIDVEHPAD+EDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQST  PVSADEEE
Subjt:  MSLRNHVDEIDVEHPADQEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQSTLPPVSADEEE

Query:  DEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRQSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHFEARIEENKK
        DEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKR SMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIH EARIEENKK
Subjt:  DEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRQSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHFEARIEENKK

Query:  LRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPSVDRKKWSIVERENLGKGIRQQFQE
        LRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRM LAKFP VDRKKWSIVERENLGKGIRQQFQE
Subjt:  LRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPSVDRKKWSIVERENLGKGIRQQFQE

Query:  MVLQISVDQISGPQGISGDSDDLDNILASIKDLDISPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLN
        MVLQISVDQISGPQGISGDSDDLDNILASIKDLDI+PDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLN
Subjt:  MVLQISVDQISGPQGISGDSDDLDNILASIKDLDISPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLN

Query:  NWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAV
        NWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTK+EDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAV
Subjt:  NWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAV

Query:  LLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL
        LLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL
Subjt:  LLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL

Query:  ISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK
        ISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK
Subjt:  ISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK

Query:  RNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKESERSNSNCTETVDENIMEVLENKVAEKLTEENGCFSEPEKNQNSTGSSGISVLSEMTN
        RNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKESERSNSNCTETVDENIMEVLENKVAEKLTEEN CFSEPEKNQNSTGSSG+SVLSEMTN
Subjt:  RNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKESERSNSNCTETVDENIMEVLENKVAEKLTEENGCFSEPEKNQNSTGSSGISVLSEMTN

Query:  DLVDYNPSILTDTTLFASTTVDDTEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSDGVCNPTQGRRKKNSKTSNNSHDNLLIP
        DLVDYNPSILTDTTLFASTTVDD EELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKS+GVCNPTQGRRKKNSKTSNNSHDNLLIP
Subjt:  DLVDYNPSILTDTTLFASTTVDDTEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSDGVCNPTQGRRKKNSKTSNNSHDNLLIP

Query:  RQQIVQETLGTKKPLHDNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLNKLKRKKHQRP
        RQQIVQETLGTKKPLH NQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLNKLKRKKHQRP
Subjt:  RQQIVQETLGTKKPLHDNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLNKLKRKKHQRP

Query:  SGDELN
        SGDELN
Subjt:  SGDELN

A0A6J1E2J4 uncharacterized protein LOC111430000 isoform X20.0e+0077.94Show/hide
Query:  MSLRNHVDEIDVEHPA---DQEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQSTLPPVSAD
        MS R+HVD  D E PA   D ED +VD+DME L+RA RLAGVN ED INPRLS PAAGDA+ GSDSDDVDD ELLR+IQNRFS  ADEQP S LPPV+AD
Subjt:  MSLRNHVDEIDVEHPA---DQEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQSTLPPVSAD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRQSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHFEARIEE
        EEED+FE LRSIQRRFAAYESD LSNKP+QS D  G LK+DSD+  V   TSS+R SM+AFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIH EARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRQSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHFEARIEE

Query:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPSVDRKKWSIVERENLGKGIRQQ
        NKKLRKR K+LK FQGSC+R+T+CAL+QM+DPRVQLISA KPQAKDSSKKDKRLS M YGP ENSHVACYR AL KF  VDRK+WS  ERENLGKGIRQQ
Subjt:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPSVDRKKWSIVERENLGKGIRQQ

Query:  FQEMVLQISVDQISGPQGISGDSDDLDNILASIKDLDISPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQK
        FQEMVLQISVDQIS  QG S +SDDLDNILASIK LDI+P+KIREFLPKVNWDKLA MYLQGRSGAECEARWLNFEDPLINR+ WTTSEDK+LLFTIQQK
Subjt:  FQEMVLQISVDQISGPQGISGDSDDLDNILASIKDLDISPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQK

Query:  GLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLK
        GLNNWIE+AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDEDD+LRSAVA FG  DWQAVASTLEGR G QCSNRWKKSLDPART++GYFTPDED RLK
Subjt:  GLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLK

Query:  IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQK
        IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNEC RRWKKLFP++VPLLQEARKIQK
Subjt:  IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQK

Query:  AALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKP
         ALISNFVDRE+ERPALGP DFRP PN+  LCNTD P  APKRNV+ R+MPVSRNEKSA GDAPKK KSN QR Q D TAQV  A NTS VP EV+S+KP
Subjt:  AALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKP

Query:  QRKRNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKE-SERSNSNCTETVDENIMEVLENKVAEKLTEENGCFSEPEKNQNSTGSSGISVLS
        QRKR R GAYT +R G P++  +SE CA+QN DT SL +QLN KE +ER NS+C ETVDEN MEV ENK AE  +E   CFSE E+NQNSTGSSG+SVLS
Subjt:  QRKRNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKE-SERSNSNCTETVDENIMEVLENKVAEKLTEENGCFSEPEKNQNSTGSSGISVLS

Query:  EMTNDLVDYNPSILTDTTLFASTTVDDTEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSDGVCNPTQGRRKKNSKTSNNSHDN
        EMTND+ +YNPS   DTTL AS T DD  E KG + AD+DLDDSNSFSL  SCLELRT DSEGVDSYSVDE+T KS GVC P QGRRKKNSK SN S D+
Subjt:  EMTNDLVDYNPSILTDTTLFASTTVDDTEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSDGVCNPTQGRRKKNSKTSNNSHDN

Query:  LLIPRQQIVQETLGTKKPLHDNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFL-NKLKRK
         L+  QQ   E  G  +    NQSKKRKHS T  S L T EAVEEVDDCTL GFLQKRLKRT  TH++ VD SS+ P +VDNDDN+PT+A  L +KLKRK
Subjt:  LLIPRQQIVQETLGTKKPLHDNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFL-NKLKRK

Query:  KH
        KH
Subjt:  KH

A0A6J1E6Z7 uncharacterized protein LOC111430000 isoform X10.0e+0078.04Show/hide
Query:  MSLRNHVDEIDVEHPA---DQEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQSTLPPVSAD
        MS R+HVD  D E PA   D ED +VD+DME L+RA RLAGVN ED INPRLS PAAGDA+ GSDSDDVDD ELLR+IQNRFS  ADEQP S LPPV+AD
Subjt:  MSLRNHVDEIDVEHPA---DQEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQSTLPPVSAD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRQSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHFEARIEE
        EEED+FE LRSIQRRFAAYESD LSNKP+QS D  G LK+DSD+  V   TSS+R SM+AFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIH EARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRQSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHFEARIEE

Query:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPSVDRKKWSIVERENLGKGIRQQ
        NKKLRKR K+LK FQGSC+R+T+CAL+QM+DPRVQLISA KPQAKDSSKKDKRLS M YGP ENSHVACYR AL KF  VDRK+WS  ERENLGKGIRQQ
Subjt:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPSVDRKKWSIVERENLGKGIRQQ

Query:  FQEMVLQISVDQISGPQGISGDSDDLDNILASIKDLDISPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQK
        FQEMVLQISVDQIS  QG S +SDDLDNILASIK LDI+P+KIREFLPKVNWDKLA MYLQGRSGAECEARWLNFEDPLINR+ WTTSEDK+LLFTIQQK
Subjt:  FQEMVLQISVDQISGPQGISGDSDDLDNILASIKDLDISPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQK

Query:  GLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLK
        GLNNWIE+AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDEDD+LRSAVA FG  DWQAVASTLEGR G QCSNRWKKSLDPART++GYFTPDED RLK
Subjt:  GLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLK

Query:  IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQK
        IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFP++VPLLQEARKIQK
Subjt:  IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQK

Query:  AALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKP
         ALISNFVDRE+ERPALGP DFRP PN+  LCNTD P  APKRNV+ R+MPVSRNEKSA GDAPKK KSN QR Q D TAQV  A NTS VP EV+S+KP
Subjt:  AALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKP

Query:  QRKRNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKE-SERSNSNCTETVDENIMEVLENKVAEKLTEENGCFSEPEKNQNSTGSSGISVLS
        QRKR R GAYT +R G P++  +SE CA+QN DT SL +QLN KE +ER NS+C ETVDEN MEV ENK AE  +E   CFSE E+NQNSTGSSG+SVLS
Subjt:  QRKRNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKE-SERSNSNCTETVDENIMEVLENKVAEKLTEENGCFSEPEKNQNSTGSSGISVLS

Query:  EMTNDLVDYNPSILTDTTLFASTTVDDTEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSDGVCNPTQGRRKKNSKTSNNSHDN
        EMTND+ +YNPS   DTTL AS T DD  E KG + AD+DLDDSNSFSL  SCLELRT DSEGVDSYSVDE+T KS GVC P QGRRKKNSK SN S D+
Subjt:  EMTNDLVDYNPSILTDTTLFASTTVDDTEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSDGVCNPTQGRRKKNSKTSNNSHDN

Query:  LLIPRQQIVQETLGTKKPLHDNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFL-NKLKRK
         L+  QQ   E  G  +    NQSKKRKHS T  S L T EAVEEVDDCTL GFLQKRLKRT  TH++ VD SS+ P +VDNDDN+PT+A  L +KLKRK
Subjt:  LLIPRQQIVQETLGTKKPLHDNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFL-NKLKRK

Query:  KH
        KH
Subjt:  KH

A0A6J1JK98 uncharacterized protein LOC111485355 isoform X20.0e+0077.77Show/hide
Query:  MSLRNHVDEIDVEHPA---DQEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQSTLPPVSAD
        MS R+HVD  D E PA   D ED +VD+DME L+RA RLAGVN EDY+NP+LS PAAGDA+ GSDSDDVDD ELLR+IQNRFSI ADEQP S LPPV+AD
Subjt:  MSLRNHVDEIDVEHPA---DQEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQSTLPPVSAD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRQSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHFEARIEE
        EEED+FE LRSIQRRFAAYESD LSNKP+QS D  G LK+DSD+  VE  TSS+R SM+AFEKGSLPKAALAF+DAIKKNRSQQKF+RSKMIH EARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRQSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHFEARIEE

Query:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKP-QAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPSVDRKKWSIVERENLGKGIRQ
        NKKLRKR K+LK FQGSC+R+T+CALSQM+DPRVQLISA KP QAKDSSKKDKRLS M YGP ENSHVACYR+AL KF  VDRK+WS  ERENLGKGIRQ
Subjt:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKP-QAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPSVDRKKWSIVERENLGKGIRQ

Query:  QFQEMVLQISVDQISGPQGISGDSDDLDNILASIKDLDISPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQ
        QFQEMVLQISVDQIS  QG S +SDDLDNILASIKDLDI+P+KIREFLPKVNWDKLASMYL+GRSGAECEARWLNFEDPLINR+PWTTSEDK+LLFTIQQ
Subjt:  QFQEMVLQISVDQISGPQGISGDSDDLDNILASIKDLDISPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQ

Query:  KGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRL
        KGLNNWI++AVSLGTNRTPFQ LSRYQRSLNASILK EWTKDEDD+LRSAVA FG  DWQAVASTLEGR G QCSNRWKKSLDPART++GYFTPDED RL
Subjt:  KGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRL

Query:  KIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQ
        KIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNEC RRWKKLFP++VPLLQEARKIQ
Subjt:  KIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQ

Query:  KAALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSK
        K ALISNFVDRE+ERPALGP DFRP PN+  LCNTD P  APKRNV+TR+MPVSRNEKSA GDAPKKRKSN QR + D TAQV  A NTS VP EV+S+K
Subjt:  KAALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSK

Query:  PQRKRNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKE-SERSNSNCTETVDENIMEVLENKVAEKLTEENGCFSEPEKNQNSTGSSGISVL
        PQRKR R GAYT +R G P++  +SE CA+QN DT +L +QLN KE +ER NS+C ETVDEN MEV ENK AE  +E   CFSE E+NQNSTGSSG+SVL
Subjt:  PQRKRNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKE-SERSNSNCTETVDENIMEVLENKVAEKLTEENGCFSEPEKNQNSTGSSGISVL

Query:  SEMTNDLVDYNPSILTDTTLFASTTVDDTEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSDGVCNPTQGRRKKNSKTSNNSHD
        SEMTND+ +YNPS L DTTL AS T DD  E KG + AD+DLD SNSFSL  SCLELRT DSEGVDSYSVDE+T KS  VC P QGRRKKNSK SN S D
Subjt:  SEMTNDLVDYNPSILTDTTLFASTTVDDTEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSDGVCNPTQGRRKKNSKTSNNSHD

Query:  NLLIPRQQIVQETLGTKKPLHDNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFL-NKLKR
        + L+  QQ   E  GT +    NQ KKRKHS+T  S L T EAVEEVDDCTL+GFLQKRLKRT  TH + VD SS+   +VDNDDN+PT+A  L  KLKR
Subjt:  NLLIPRQQIVQETLGTKKPLHDNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFL-NKLKR

Query:  KKH
        KKH
Subjt:  KKH

A0A6J1JKV7 uncharacterized protein LOC111485355 isoform X10.0e+0077.87Show/hide
Query:  MSLRNHVDEIDVEHPA---DQEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQSTLPPVSAD
        MS R+HVD  D E PA   D ED +VD+DME L+RA RLAGVN EDY+NP+LS PAAGDA+ GSDSDDVDD ELLR+IQNRFSI ADEQP S LPPV+AD
Subjt:  MSLRNHVDEIDVEHPA---DQEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQSTLPPVSAD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRQSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHFEARIEE
        EEED+FE LRSIQRRFAAYESD LSNKP+QS D  G LK+DSD+  VE  TSS+R SM+AFEKGSLPKAALAF+DAIKKNRSQQKF+RSKMIH EARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRQSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHFEARIEE

Query:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKP-QAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPSVDRKKWSIVERENLGKGIRQ
        NKKLRKR K+LK FQGSC+R+T+CALSQM+DPRVQLISA KP QAKDSSKKDKRLS M YGP ENSHVACYR+AL KF  VDRK+WS  ERENLGKGIRQ
Subjt:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKP-QAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPSVDRKKWSIVERENLGKGIRQ

Query:  QFQEMVLQISVDQISGPQGISGDSDDLDNILASIKDLDISPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQ
        QFQEMVLQISVDQIS  QG S +SDDLDNILASIKDLDI+P+KIREFLPKVNWDKLASMYL+GRSGAECEARWLNFEDPLINR+PWTTSEDK+LLFTIQQ
Subjt:  QFQEMVLQISVDQISGPQGISGDSDDLDNILASIKDLDISPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQ

Query:  KGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRL
        KGLNNWI++AVSLGTNRTPFQ LSRYQRSLNASILK EWTKDEDD+LRSAVA FG  DWQAVASTLEGR G QCSNRWKKSLDPART++GYFTPDED RL
Subjt:  KGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRL

Query:  KIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQ
        KIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFP++VPLLQEARKIQ
Subjt:  KIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQ

Query:  KAALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSK
        K ALISNFVDRE+ERPALGP DFRP PN+  LCNTD P  APKRNV+TR+MPVSRNEKSA GDAPKKRKSN QR + D TAQV  A NTS VP EV+S+K
Subjt:  KAALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSK

Query:  PQRKRNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKE-SERSNSNCTETVDENIMEVLENKVAEKLTEENGCFSEPEKNQNSTGSSGISVL
        PQRKR R GAYT +R G P++  +SE CA+QN DT +L +QLN KE +ER NS+C ETVDEN MEV ENK AE  +E   CFSE E+NQNSTGSSG+SVL
Subjt:  PQRKRNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKE-SERSNSNCTETVDENIMEVLENKVAEKLTEENGCFSEPEKNQNSTGSSGISVL

Query:  SEMTNDLVDYNPSILTDTTLFASTTVDDTEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSDGVCNPTQGRRKKNSKTSNNSHD
        SEMTND+ +YNPS L DTTL AS T DD  E KG + AD+DLD SNSFSL  SCLELRT DSEGVDSYSVDE+T KS  VC P QGRRKKNSK SN S D
Subjt:  SEMTNDLVDYNPSILTDTTLFASTTVDDTEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSDGVCNPTQGRRKKNSKTSNNSHD

Query:  NLLIPRQQIVQETLGTKKPLHDNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFL-NKLKR
        + L+  QQ   E  GT +    NQ KKRKHS+T  S L T EAVEEVDDCTL+GFLQKRLKRT  TH + VD SS+   +VDNDDN+PT+A  L  KLKR
Subjt:  NLLIPRQQIVQETLGTKKPLHDNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFL-NKLKR

Query:  KKH
        KKH
Subjt:  KKH

SwissProt top hitse value%identityAlignment
P91868 snRNA-activating protein complex subunit 4 homolog3.7e-3131.95Show/hide
Query:  VNWDKLASMYLQG-RSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDRLRS
        V W  +A+   +G R+    +++W N  +P  N++ W+  E + L +  +     +W  +A++LGTNRT +QC+ +Y+  ++     +EW++DED +L +
Subjt:  VNWDKLASMYLQG-RSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDRLRS

Query:  AVATFGVR---DWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCE-WT
              +     W  VA  + GR   Q   R+  +LD A  + G +T  ED+ L  AV  +G K+W K A+ +  RN  QCRERW N L+ S    E +T
Subjt:  AVATFGVR---DWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCE-WT

Query:  EEEDLRLEIAIQEHGY-SWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAALISNFVD
          ED +L  A++  G  +WAK    +P +T  + RRR+ +L          A K++ AA   N VD
Subjt:  EEEDLRLEIAIQEHGY-SWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAALISNFVD

Q08759 Transcriptional activator Myb1.9e-3037.04Show/hide
Query:  LSRYQRSLNASILKREWTKDEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGR
        LS+ +R L     K  WT++ED++L+  V   G  +W+ +AS L  R   QC +RW+K L+P    KG +T +ED R+   V  +GPK W+  A+ L GR
Subjt:  LSRYQRSLNASILKREWTKDEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGR

Query:  NQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLF---PDEVPLLQEARKIQKAALISNF
           QCRERW N L+P +++  WTEEED  +  A +  G  WA++A  +P RTDN  +  W        ++   LQ + K  +  +++NF
Subjt:  NQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLF---PDEVPLLQEARKIQKAALISNF

Q54NA6 Myb-like protein L8.3e-5536.75Show/hide
Query:  KKWSIVERENLGKGIRQQ-FQEMVLQISVDQISGP---------QGISGDSDDLDNILASIKDLDISPDKIREFLPKVNWDKLASMYLQGRSGAECEARW
        ++W+  E E L KGI+++  Q+ + ++S D++S           Q  S ++++ +N + +  + +   DK     P +       +    RS  E   RW
Subjt:  KKWSIVERENLGKGIRQQ-FQEMVLQISVDQISGP---------QGISGDSDDLDNILASIKDLDISPDKIREFLPKVNWDKLASMYLQGRSGAECEARW

Query:  LNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDRLRSAVA--TFGVR-DWQAVASTLEGRA
         N +DP IN+ P+T  EDK LL   ++   + W ++++ LGTNRTP  C+ RYQRSLN+ ++KREWTK+ED+ L   +     G R DWQ +   + GR 
Subjt:  LNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDRLRSAVA--TFGVR-DWQAVASTLEGRA

Query:  GTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGY-SWAKVAAC
        G QC +RW K+LDP+  +KG ++P+ED  L  AV  +G  NW      + GR  VQCRER+ N LDP L +  WT +ED RL     + G   W+ VA  
Subjt:  GTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGY-SWAKVAAC

Query:  VPSRTDNECRRRWKKLFPDEVPLLQEARKI--QKAALISNFVDRETERPAL
        + +RTDN+C RRWK+L      L     K+  +K   +SNF  R+ ER  L
Subjt:  VPSRTDNECRRRWKKLFPDEVPLLQEARKI--QKAALISNFVDRETERPAL

Q5SXM2 snRNA-activating protein complex subunit 41.1e-3827.33Show/hide
Query:  IKKNRSQQKFIRSKMIHFEARIEENKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHV-ACYRMALA
        ++ N   Q+ I+ K+      + +N++ ++  ++++D  GS  + T     + + P   +    KP  KD      +++G+  GP  N          + 
Subjt:  IKKNRSQQKFIRSKMIHFEARIEENKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHV-ACYRMALA

Query:  KFPSVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQISG-PQGISGDSDDLD---------NILASIKDLDISPDK--IREFLPKVNWDKLASMYLQG-
         F  +   KW   E+  L K +     + +LQ  + ++    Q  S  S +L+              I+D++  P++  +   L   +W+K++++  +G 
Subjt:  KFPSVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQISG-PQGISGDSDDLD---------NILASIKDLDISPDK--IREFLPKVNWDKLASMYLQG-

Query:  RSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDRLRSAVATFGVRD---WQ
        RS  E    W N E P IN+  W+  E++ L       G   W ++A  LGT+R+ FQCL ++Q+  N ++ ++EWT++ED  L   V    V     ++
Subjt:  RSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDRLRSAVATFGVRD---WQ

Query:  AVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHG
         +   +EGR   Q   RW KSLDP   +KGY+ P+ED +L  AV  +G ++W K  E +PGR+  QCR+R+   L  SL++  W  +E+ +L   I+++G
Subjt:  AVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHG

Query:  YS-WAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARK
           WAK+A+ +P R+ ++C  +WK +   +  L +  R+
Subjt:  YS-WAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARK

Q8BP86 snRNA-activating protein complex subunit 44.6e-3732.82Show/hide
Query:  IKDLDISPDK--IREFLPKVNWDKLASMYLQG-RSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNWIEMAVSLGTNRTPFQCLSRYQRSL
        I+D++  P++  +   L   +W+K++++  +G RS  E    W + E P I++  W+T E + L       G   W  +A  LGT+R+ FQCL ++Q+  
Subjt:  IKDLDISPDK--IREFLPKVNWDKLASMYLQG-RSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNWIEMAVSLGTNRTPFQCLSRYQRSL

Query:  NASILKREWTKDEDDRLRSAVATFGVRD---WQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQC
        N ++ ++EWT++ED  L   V    V +   ++ +   +EGR   Q   RW KSLDP+  ++G++ P+ED +L  AV  +G ++W K  E +PGR+  QC
Subjt:  NASILKREWTKDEDDRLRSAVATFGVRD---WQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQC

Query:  RERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYS-WAKVAACVPSRTDNECRRRWKKL
        R+R+   L  SL++  W  +E+ +L   I+++G   WA++A+ +P R+ ++C  +WK L
Subjt:  RERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYS-WAKVAACVPSRTDNECRRRWKKL

Arabidopsis top hitse value%identityAlignment
AT3G09370.1 myb domain protein 3r-33.6e-2941.5Show/hide
Query:  KREWTKDEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL
        K  WT +ED+ LR AV TF  + W+ +A +   R   QC +RW+K L+P    KG +T +ED ++   V  +GP  W+  A+ LPGR   QCRERW N L
Subjt:  KREWTKDEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL

Query:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW
        +P + +  WT EE++ L  A + HG  WA++A  +P RTDN  +  W
Subjt:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW

AT3G09370.2 myb domain protein 3r-33.6e-2941.5Show/hide
Query:  KREWTKDEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL
        K  WT +ED+ LR AV TF  + W+ +A +   R   QC +RW+K L+P    KG +T +ED ++   V  +GP  W+  A+ LPGR   QCRERW N L
Subjt:  KREWTKDEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL

Query:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW
        +P + +  WT EE++ L  A + HG  WA++A  +P RTDN  +  W
Subjt:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW

AT3G18100.1 myb domain protein 4r11.4e-16344.37Show/hide
Query:  RNHVDEIDVEHPADQEDGVVDEDMEVLQRAYRLAGVNPEDYINP--RLSSPAAGDADPGSDSDDVDDFELLRDIQNR----------------FSILADE
        RN + E D +   D+ED  + ED+E L+RA  ++ VN + + +    +     G  +  SDS++ DDFE+LR I+++                 S+L+D 
Subjt:  RNHVDEIDVEHPADQEDGVVDEDMEVLQRAYRLAGVNPEDYINP--RLSSPAAGDADPGSDSDDVDDFELLRDIQNR----------------FSILADE

Query:  QPQS------------------TLPPV--SADEEEDEFEMLRSIQRRFAAY-----ESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSK---------
        + +                   +LPP+  S DEE+D FE LR+I+RRF+AY     E   +++   + +    S    S +I   S T            
Subjt:  QPQS------------------TLPPV--SADEEEDEFEMLRSIQRRFAAY-----ESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSK---------

Query:  --------RQSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHFEARIEENKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKD
                +   +     S P+AA AFVDAI++NR+ QKF+R K+   EA IE+N+K +K  +I+KDFQ SCKR T  AL Q  DPRV+LIS  K    D
Subjt:  --------RQSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHFEARIEENKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKD

Query:  SSK----------KDKRLSGMYYGPDENSHVACYRMALAKFP-SVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQISGPQGISGDSDDLDNILASIKD
        SS+           DK++S +  GP EN  V  YRMAL K+P SV R+KWS  E +NL KG++Q+ Q+++L  ++++ S    + G + D+D I  SI +
Subjt:  SSK----------KDKRLSGMYYGPDENSHVACYRMALAKFP-SVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQISGPQGISGDSDDLDNILASIKD

Query:  LDISPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILK
        L+I+P+ IR+FLPK+NWD   S+ ++ RS AECEARW++ EDPLIN  PWT +EDK+LL TI+Q  L +W+++AVSLGTNRTPFQCL+RYQRSLN SILK
Subjt:  LDISPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILK

Query:  REWTKDEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLD
        +EWT +EDD+LR+AV  FG +DWQ+VA+ L+GR GTQCSNRWKKSL P  TRKG ++ +ED R+K+AV LFG +NW+K ++F+PGR Q QCRERW NCLD
Subjt:  REWTKDEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLD

Query:  PSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAALISNFVDRETERPALGPADFRPRPNTDSLCNTD
        P + R +WTEEED +L  AI EHGYSW+KVA  +  RTDN+C RRWK+L+P +V LLQEAR++QK A + NFVDRE+ERPAL  +     P+       D
Subjt:  PSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAALISNFVDRETERPALGPADFRPRPNTDSLCNTD

Query:  GPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQ
              KR  K +K    R         PK+R+   +    D   Q
Subjt:  GPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQ

AT3G18100.2 myb domain protein 4r13.3e-15252.65Show/hide
Query:  SLPKAALAFVDAIKKNRSQQKFIRSKMIHFEARIEENKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSK----------KDKRL
        S P+AA AFVDAI++NR+ QKF+R K+   EA IE+N+K +K  +I+KDFQ SCKR T  AL Q  DPRV+LIS  K    DSS+           DK++
Subjt:  SLPKAALAFVDAIKKNRSQQKFIRSKMIHFEARIEENKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSK----------KDKRL

Query:  SGMYYGPDENSHVACYRMALAKFP-SVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQISGPQGISGDSDDLDNILASIKDLDISPDKIREFLPKVNWD
        S +  GP EN  V  YRMAL K+P SV R+KWS  E +NL KG++Q+ Q+++L  ++++ S    + G + D+D I  SI +L+I+P+ IR+FLPK+NWD
Subjt:  SGMYYGPDENSHVACYRMALAKFP-SVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQISGPQGISGDSDDLDNILASIKDLDISPDKIREFLPKVNWD

Query:  KLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDRLRSAVATF
           S+ ++ RS AECEARW++ EDPLIN  PWT +EDK+LL TI+Q  L +W+++AVSLGTNRTPFQCL+RYQRSLN SILK+EWT +EDD+LR+AV  F
Subjt:  KLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDRLRSAVATF

Query:  GVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEI
        G +DWQ+VA+ L+GR GTQCSNRWKKSL P  TRKG ++ +ED R+K+AV LFG +NW+K ++F+PGR Q QCRERW NCLDP + R +WTEEED +L  
Subjt:  GVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEI

Query:  AIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVS
        AI EHGYSW+KVA  +  RTDN+C RRWK+L+P +V LLQEAR++QK A + NFVDRE+ERPAL  +     P+       D      KR  K +K    
Subjt:  AIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVS

Query:  RNEKSATGDAPKKRKSNYQRFQTDATAQ
        R         PK+R+   +    D   Q
Subjt:  RNEKSATGDAPKKRKSNYQRFQTDATAQ

AT3G18100.3 myb domain protein 4r11.1e-14242.96Show/hide
Query:  RNHVDEIDVEHPADQEDGVVDEDMEVLQRAYRLAGVNPEDYINP--RLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILAD----EQPQSTLPPVSAD
        RN + E D +   D+ED  + ED+E L+RA  ++ VN + + +    +     G  +  SDS++ DDFE+LR I+++ +   D      P   L  +S  
Subjt:  RNHVDEIDVEHPADQEDGVVDEDMEVLQRAYRLAGVNPEDYINP--RLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILAD----EQPQSTLPPVSAD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKR-----QSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHFE
        E ED+FEM+RSI+ +        LS   + S   +G L  D +D A E+  + +R     ++   F   S  K      + +   ++Q+ +   KM+  +
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKR-----QSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHFE

Query:  ARIEENKKLRKRC-----------KILKDFQGSCKRRTSCALS--QMIDPRVQLISAAKPQAKDSSK----------KDKRLSGMYYGPDENSHVACYRM
         R++ + KL +R              L++     K+R+S   +  +M DPRV+LIS  K    DSS+           DK++S +  GP EN  V  YRM
Subjt:  ARIEENKKLRKRC-----------KILKDFQGSCKRRTSCALS--QMIDPRVQLISAAKPQAKDSSK----------KDKRLSGMYYGPDENSHVACYRM

Query:  ALAKFP-SVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQISGPQGISGDSDDLDNILASIKDLDISPDKIREFLPKVNWDKLASMYLQGRSGAECEAR
        AL K+P SV R+KWS  E +NL KG++Q+ Q+++L  ++++ S    + G + D+D I  SI +L+I+P+ IR+FLPK+NWD   S+ ++ RS AECEAR
Subjt:  ALAKFP-SVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQISGPQGISGDSDDLDNILASIKDLDISPDKIREFLPKVNWDKLASMYLQGRSGAECEAR

Query:  WLNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDRLRSAVATFGVRDWQAVASTLEGRAGT
        W++ EDPLIN  PWT +EDK+LL TI+Q  L +W+++AVSLGTNRTPFQCL+RYQRSLN SILK+EWT +EDD+LR+AV  FG +DWQ+VA+ L+GR GT
Subjt:  WLNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDRLRSAVATFGVRDWQAVASTLEGRAGT

Query:  QCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPS
        QCSNRWKKSL P  TRKG ++ +ED R+K+AV LFG +NW+K ++F+PGR Q QCRERW NCLDP + R +WTEEED +L  AI EHGYSW+KVA  +  
Subjt:  QCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPS

Query:  RTDNECRRRWKKLFPDEVPLLQEARKIQKAALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNY
        RTDN+C RRWK+L+P +V LLQEAR++QK A + NFVDRE+ERPAL  +     P+       D      KR  K +K    R         PK+R+   
Subjt:  RTDNECRRRWKKLFPDEVPLLQEARKIQKAALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNY

Query:  QRFQTDATAQ
        +    D   Q
Subjt:  QRFQTDATAQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCTCCGCAACCATGTTGATGAAATTGACGTTGAGCATCCTGCTGACCAGGAAGATGGTGTGGTTGATGAGGACATGGAAGTTCTTCAGAGAGCCTATAGGCTTGC
TGGTGTTAATCCTGAGGATTATATTAATCCCAGGTTGTCATCACCTGCTGCCGGAGATGCTGATCCTGGTTCTGATTCTGATGATGTTGATGATTTCGAACTTCTTCGAG
ATATTCAGAATCGGTTCTCGATTTTGGCTGATGAGCAGCCGCAGAGTACTCTCCCACCAGTGTCTGCAGACGAGGAGGAAGATGAATTTGAGATGCTTCGTTCAATACAG
CGGCGCTTTGCAGCGTATGAAAGTGATACTTTGAGCAATAAACCCAATCAGTCTCGTGACTATGTTGGATCTCTGAAGTTGGATTCTGACGACATAGCTGTTGAGAGCCA
GACATCCTCAAAACGGCAATCCATGCTAGCCTTTGAAAAGGGAAGCTTGCCAAAGGCTGCGTTGGCATTTGTTGATGCCATCAAGAAGAATAGGTCCCAGCAGAAGTTTA
TTCGTAGTAAGATGATTCATTTTGAAGCTAGAATTGAGGAGAACAAAAAGCTCAGGAAACGTTGCAAAATTCTCAAAGATTTCCAGGGTTCATGTAAGCGAAGAACAAGT
TGTGCACTGTCTCAAATGATAGATCCTCGAGTCCAGTTAATTTCAGCTGCAAAACCACAAGCAAAGGATTCATCAAAGAAGGACAAACGACTATCTGGAATGTATTATGG
CCCGGATGAGAATTCTCATGTTGCGTGTTACAGAATGGCATTGGCAAAGTTCCCCTCTGTTGATCGAAAGAAATGGTCCATTGTAGAAAGGGAGAATCTAGGGAAGGGAA
TAAGACAGCAGTTTCAGGAGATGGTGCTTCAGATTTCAGTGGATCAAATCAGTGGGCCACAAGGAATTTCTGGAGATTCAGATGATTTGGATAACATTCTTGCATCAATA
AAAGACCTTGACATCTCTCCTGACAAGATTAGGGAATTTCTGCCAAAAGTTAATTGGGACAAATTGGCTTCCATGTATCTTCAGGGTCGCTCAGGTGCAGAATGTGAAGC
AAGATGGTTGAATTTTGAAGACCCCCTAATTAATCGAGATCCATGGACTACAAGTGAGGATAAAAGTCTTTTGTTTACTATCCAACAGAAGGGGTTGAATAACTGGATTG
AGATGGCAGTTTCATTGGGCACAAACAGAACTCCTTTTCAGTGCCTGTCTCGGTATCAAAGGAGTTTAAATGCTTCCATATTAAAGAGGGAGTGGACCAAAGATGAGGAT
GATAGACTTCGATCTGCTGTTGCTACATTTGGCGTGAGAGATTGGCAGGCAGTAGCTTCTACTTTGGAAGGACGAGCAGGCACACAGTGCTCTAATAGGTGGAAAAAATC
CCTTGACCCAGCTAGGACAAGAAAAGGTTATTTCACTCCAGATGAAGACATTCGCTTGAAAATTGCTGTACTGCTCTTTGGGCCCAAAAATTGGAACAAGAAAGCAGAGT
TTTTGCCTGGTCGGAATCAAGTTCAATGCAGAGAAAGATGGTTTAATTGTTTAGATCCTTCCTTGAGAAGATGTGAGTGGACAGAAGAGGAGGATTTAAGGCTGGAGATA
GCAATTCAGGAACATGGATATAGCTGGGCTAAGGTAGCTGCATGTGTGCCATCACGTACAGATAATGAGTGTCGGAGGAGATGGAAGAAGTTATTTCCCGATGAAGTTCC
TTTGCTCCAGGAAGCTAGAAAGATTCAGAAGGCTGCTCTTATTAGCAACTTTGTTGATAGGGAAACAGAGCGTCCTGCTCTTGGTCCTGCTGACTTTCGACCCAGGCCAA
ATACAGATTCATTATGTAACACTGATGGTCCAATACCTGCCCCAAAAAGAAATGTGAAAACAAGAAAGATGCCAGTGTCAAGGAATGAAAAGAGTGCTACTGGTGATGCT
CCAAAGAAGAGGAAATCAAATTATCAGAGGTTTCAGACTGATGCAACTGCTCAGGTGGGTATTGCATATAATACCTCTTTTGTCCCAGAAGAGGTTCAATCTTCAAAGCC
TCAAAGAAAACGAAATAGACGTGGAGCTTATACTGCAAAGAGAATAGGGGTACCGGAACTACGTTCTGACAGCGAGTGGTGTGCTAAACAGAATTTGGACACTGAGAGCC
TTGGGTTGCAGCTGAATAGTAAGGAATCTGAGAGGAGCAACAGTAACTGTACCGAGACTGTTGATGAGAATATTATGGAAGTTCTTGAGAACAAAGTTGCAGAGAAGCTT
ACTGAAGAAAATGGATGCTTCTCAGAACCAGAAAAAAATCAAAACTCAACCGGATCTTCTGGAATCTCAGTATTGTCAGAAATGACTAACGACCTGGTTGACTATAATCC
CTCTATCCTCACAGATACAACATTGTTTGCTAGTACTACTGTGGATGATACTGAAGAATTGAAGGGGAAGAGTGCTGCAGACAGGGATTTGGATGACAGTAATAGTTTCT
CGTTAGCACACAGTTGCTTAGAACTCCGGACAGTTGACAGTGAAGGCGTAGACAGTTATTCTGTGGACGAATATACAGCTAAGAGCGATGGGGTTTGTAATCCCACCCAA
GGCAGAAGGAAGAAAAATAGCAAAACATCAAATAACAGCCATGATAATTTGCTTATTCCTCGTCAACAAATTGTGCAAGAGACATTGGGAACGAAGAAGCCTCTTCATGA
TAATCAATCAAAGAAGAGAAAACATAGCAATACAGGTCCAAGTACGTTAAAAACATCGGAGGCAGTTGAGGAGGTTGATGACTGCACTCTCGTGGGGTTCTTGCAAAAAA
GATTGAAGAGGACGGCAATGACACATAACGAGACAGTTGATTGCAGTTCTAATGCTCCTTTAAAAGTTGATAATGACGATAATGAGCCTACCATTGCCTCGTTTCTTAAT
AAATTGAAGAGAAAAAAGCATCAGCGGCCTAGTGGTGATGAGTTAAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCTCCGCAACCATGTTGATGAAATTGACGTTGAGCATCCTGCTGACCAGGAAGATGGTGTGGTTGATGAGGACATGGAAGTTCTTCAGAGAGCCTATAGGCTTGC
TGGTGTTAATCCTGAGGATTATATTAATCCCAGGTTGTCATCACCTGCTGCCGGAGATGCTGATCCTGGTTCTGATTCTGATGATGTTGATGATTTCGAACTTCTTCGAG
ATATTCAGAATCGGTTCTCGATTTTGGCTGATGAGCAGCCGCAGAGTACTCTCCCACCAGTGTCTGCAGACGAGGAGGAAGATGAATTTGAGATGCTTCGTTCAATACAG
CGGCGCTTTGCAGCGTATGAAAGTGATACTTTGAGCAATAAACCCAATCAGTCTCGTGACTATGTTGGATCTCTGAAGTTGGATTCTGACGACATAGCTGTTGAGAGCCA
GACATCCTCAAAACGGCAATCCATGCTAGCCTTTGAAAAGGGAAGCTTGCCAAAGGCTGCGTTGGCATTTGTTGATGCCATCAAGAAGAATAGGTCCCAGCAGAAGTTTA
TTCGTAGTAAGATGATTCATTTTGAAGCTAGAATTGAGGAGAACAAAAAGCTCAGGAAACGTTGCAAAATTCTCAAAGATTTCCAGGGTTCATGTAAGCGAAGAACAAGT
TGTGCACTGTCTCAAATGATAGATCCTCGAGTCCAGTTAATTTCAGCTGCAAAACCACAAGCAAAGGATTCATCAAAGAAGGACAAACGACTATCTGGAATGTATTATGG
CCCGGATGAGAATTCTCATGTTGCGTGTTACAGAATGGCATTGGCAAAGTTCCCCTCTGTTGATCGAAAGAAATGGTCCATTGTAGAAAGGGAGAATCTAGGGAAGGGAA
TAAGACAGCAGTTTCAGGAGATGGTGCTTCAGATTTCAGTGGATCAAATCAGTGGGCCACAAGGAATTTCTGGAGATTCAGATGATTTGGATAACATTCTTGCATCAATA
AAAGACCTTGACATCTCTCCTGACAAGATTAGGGAATTTCTGCCAAAAGTTAATTGGGACAAATTGGCTTCCATGTATCTTCAGGGTCGCTCAGGTGCAGAATGTGAAGC
AAGATGGTTGAATTTTGAAGACCCCCTAATTAATCGAGATCCATGGACTACAAGTGAGGATAAAAGTCTTTTGTTTACTATCCAACAGAAGGGGTTGAATAACTGGATTG
AGATGGCAGTTTCATTGGGCACAAACAGAACTCCTTTTCAGTGCCTGTCTCGGTATCAAAGGAGTTTAAATGCTTCCATATTAAAGAGGGAGTGGACCAAAGATGAGGAT
GATAGACTTCGATCTGCTGTTGCTACATTTGGCGTGAGAGATTGGCAGGCAGTAGCTTCTACTTTGGAAGGACGAGCAGGCACACAGTGCTCTAATAGGTGGAAAAAATC
CCTTGACCCAGCTAGGACAAGAAAAGGTTATTTCACTCCAGATGAAGACATTCGCTTGAAAATTGCTGTACTGCTCTTTGGGCCCAAAAATTGGAACAAGAAAGCAGAGT
TTTTGCCTGGTCGGAATCAAGTTCAATGCAGAGAAAGATGGTTTAATTGTTTAGATCCTTCCTTGAGAAGATGTGAGTGGACAGAAGAGGAGGATTTAAGGCTGGAGATA
GCAATTCAGGAACATGGATATAGCTGGGCTAAGGTAGCTGCATGTGTGCCATCACGTACAGATAATGAGTGTCGGAGGAGATGGAAGAAGTTATTTCCCGATGAAGTTCC
TTTGCTCCAGGAAGCTAGAAAGATTCAGAAGGCTGCTCTTATTAGCAACTTTGTTGATAGGGAAACAGAGCGTCCTGCTCTTGGTCCTGCTGACTTTCGACCCAGGCCAA
ATACAGATTCATTATGTAACACTGATGGTCCAATACCTGCCCCAAAAAGAAATGTGAAAACAAGAAAGATGCCAGTGTCAAGGAATGAAAAGAGTGCTACTGGTGATGCT
CCAAAGAAGAGGAAATCAAATTATCAGAGGTTTCAGACTGATGCAACTGCTCAGGTGGGTATTGCATATAATACCTCTTTTGTCCCAGAAGAGGTTCAATCTTCAAAGCC
TCAAAGAAAACGAAATAGACGTGGAGCTTATACTGCAAAGAGAATAGGGGTACCGGAACTACGTTCTGACAGCGAGTGGTGTGCTAAACAGAATTTGGACACTGAGAGCC
TTGGGTTGCAGCTGAATAGTAAGGAATCTGAGAGGAGCAACAGTAACTGTACCGAGACTGTTGATGAGAATATTATGGAAGTTCTTGAGAACAAAGTTGCAGAGAAGCTT
ACTGAAGAAAATGGATGCTTCTCAGAACCAGAAAAAAATCAAAACTCAACCGGATCTTCTGGAATCTCAGTATTGTCAGAAATGACTAACGACCTGGTTGACTATAATCC
CTCTATCCTCACAGATACAACATTGTTTGCTAGTACTACTGTGGATGATACTGAAGAATTGAAGGGGAAGAGTGCTGCAGACAGGGATTTGGATGACAGTAATAGTTTCT
CGTTAGCACACAGTTGCTTAGAACTCCGGACAGTTGACAGTGAAGGCGTAGACAGTTATTCTGTGGACGAATATACAGCTAAGAGCGATGGGGTTTGTAATCCCACCCAA
GGCAGAAGGAAGAAAAATAGCAAAACATCAAATAACAGCCATGATAATTTGCTTATTCCTCGTCAACAAATTGTGCAAGAGACATTGGGAACGAAGAAGCCTCTTCATGA
TAATCAATCAAAGAAGAGAAAACATAGCAATACAGGTCCAAGTACGTTAAAAACATCGGAGGCAGTTGAGGAGGTTGATGACTGCACTCTCGTGGGGTTCTTGCAAAAAA
GATTGAAGAGGACGGCAATGACACATAACGAGACAGTTGATTGCAGTTCTAATGCTCCTTTAAAAGTTGATAATGACGATAATGAGCCTACCATTGCCTCGTTTCTTAAT
AAATTGAAGAGAAAAAAGCATCAGCGGCCTAGTGGTGATGAGTTAAATTAA
Protein sequenceShow/hide protein sequence
MSLRNHVDEIDVEHPADQEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQSTLPPVSADEEEDEFEMLRSIQ
RRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRQSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHFEARIEENKKLRKRCKILKDFQGSCKRRTS
CALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPSVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQISGPQGISGDSDDLDNILASI
KDLDISPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDED
DRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEI
AIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDA
PKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRKRNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKESERSNSNCTETVDENIMEVLENKVAEKL
TEENGCFSEPEKNQNSTGSSGISVLSEMTNDLVDYNPSILTDTTLFASTTVDDTEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSDGVCNPTQ
GRRKKNSKTSNNSHDNLLIPRQQIVQETLGTKKPLHDNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLN
KLKRKKHQRPSGDELN