; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G973 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G973
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionsnRNA-activating protein complex subunit 4
Genome locationctg1:5795346..5800258
RNA-Seq ExpressionCucsat.G973
SyntenyCucsat.G973
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011650584.1 uncharacterized protein LOC101216287 [Cucumis sativus]0.099.9Show/hide
Query:  MSLRNHVDEIDVEHPADKEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQSTPVSADEEEDE
        MSLRNHVDEIDVEHPADKEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQSTPVSADEEEDE
Subjt:  MSLRNHVDEIDVEHPADKEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQSTPVSADEEEDE

Query:  FEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKKLR
        FEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKKLR
Subjt:  FEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKKLR

Query:  KRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPPVDRKKWSIVERENLGKGIRQQFQEMV
        KRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRM LAKFPPVDRKKWSIVERENLGKGIRQQFQEMV
Subjt:  KRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPPVDRKKWSIVERENLGKGIRQQFQEMV

Query:  LQISVDQISGPQGISGDSDDLDNILASIKDLDIAPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNW
        LQISVDQISGPQGISGDSDDLDNILASIKDLDIAPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNW
Subjt:  LQISVDQISGPQGISGDSDDLDNILASIKDLDIAPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNW

Query:  IEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLL
        IEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLL
Subjt:  IEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLL

Query:  FGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAALIS
        FGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAALIS
Subjt:  FGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAALIS

Query:  NFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRKRN
        NFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRKRN
Subjt:  NFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRKRN

Query:  RRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKESERSNSNCTETVDENIMEVLENKVAEKLTEENACFSEPEKNQNSTGSSGVSVLSEMTNDL
        RRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKESERSNSNCTETVDENIMEVLENKVAEKLTEENACFSEPEKNQNSTGSSGVSVLSEMTNDL
Subjt:  RRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKESERSNSNCTETVDENIMEVLENKVAEKLTEENACFSEPEKNQNSTGSSGVSVLSEMTNDL

Query:  VDYNPSILTDTTLFASTTVDDIEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSNGVCNPTQGRRKKNSKTSNNSHDNLLIPRQ
        VDYNPSILTDTTLFASTTVDDIEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSNGVCNPTQGRRKKNSKTSNNSHDNLLIPRQ
Subjt:  VDYNPSILTDTTLFASTTVDDIEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSNGVCNPTQGRRKKNSKTSNNSHDNLLIPRQ

Query:  QIVQETLGTKKPLHHNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLNKLKRKKHQRPSG
        QIVQETLGTKKPLHHNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLNKLKRKKHQRPSG
Subjt:  QIVQETLGTKKPLHHNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLNKLKRKKHQRPSG

Query:  DELN
        DELN
Subjt:  DELN

XP_023515735.1 uncharacterized protein LOC111779809 isoform X1 [Cucurbita pepo subsp. pepo]0.078.44Show/hide
Query:  MSLRNHVDEIDVEHPA---DKEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQST--PVSAD
        MS R+H D  D E PA   D ED +VD+DME L+RA RLAGVN EDYINPRLS PAAGDA+ GSDSDDVDD ELLR+IQNRFSI ADEQP S   PV+AD
Subjt:  MSLRNHVDEIDVEHPA---DKEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQST--PVSAD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FEMLRSIQRRFAAYESD LSNKP+QS D  G LK+DS++  VE  TSS+R SM+AFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPPVDRKKWSIVERENLGKGIRQQ
        NKKLRKR K+LK FQGSC+R+T+CAL+QM+DPRVQLISA KPQAKDSSKKDKRLS M YGP ENSHVACYR A  KF PVDRK+WS  ERENLGKGIRQQ
Subjt:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPPVDRKKWSIVERENLGKGIRQQ

Query:  FQEMVLQISVDQISGPQGISGDSDDLDNILASIKDLDIAPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQK
        FQEMVLQISVDQIS  QG S +SDDLDNILASIK LDI P+KIREFLPKVNWDKLASMYL+GRSGAECEARWLNFEDPLINR+PWTTSEDK+LLFTIQQK
Subjt:  FQEMVLQISVDQISGPQGISGDSDDLDNILASIKDLDIAPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQK

Query:  GLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLK
        GLNNWIE+AVSLGTNRTPFQCLSRYQRSLNASILK EWTK+EDD+LRSAVA FG  DWQAVASTLEGR G QCSNRWKKSLDPART++GYFTPDED RLK
Subjt:  GLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLK

Query:  IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQK
        IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFP++VPLLQEARKIQK
Subjt:  IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQK

Query:  AALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKP
         ALISNFVDRE+ERPALGP DFRP PN+  LCNTD P  APKRNV+TR+MPVSRNEKSA GDAPK+RKSN QR + D TAQV    NTS VPE V+S+KP
Subjt:  AALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKP

Query:  QRKRNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKE-SERSNSNCTETVDENIMEVLENKVAEKLTEENACFSEPEKNQNSTGSSGVSVLS
        QRKR R GAYT +R G P++  +SE CA+QN DT S+ +QLN KE +ER NS+C ETVDEN MEV ENK AE  +E   CFSE E+NQNSTGSSGVSVLS
Subjt:  QRKRNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKE-SERSNSNCTETVDENIMEVLENKVAEKLTEENACFSEPEKNQNSTGSSGVSVLS

Query:  EMTNDLVDYNPSILTDTTLFASTTVDDIEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSNGVCNPTQGRRKKNSKTSNNSHDN
        EMTND+ +YNPS L DTTL AS T DDI E KG + AD+DLDDSNSFSL  SCLELRT DSEGVDSYSVDE+T KS+GVC P QGRRKKNSK SN S D+
Subjt:  EMTNDLVDYNPSILTDTTLFASTTVDDIEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSNGVCNPTQGRRKKNSKTSNNSHDN

Query:  LLIPRQQIVQETLGTKKPLHHNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLN-KLKRK
        L +  QQ   E  GT +    NQSKKRKHS T  S L T EAVEEVDDCTL GFLQKRLKRT  TH++ VD SS+ P +VDNDDN+PT+A  LN KLKRK
Subjt:  LLIPRQQIVQETLGTKKPLHHNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLN-KLKRK

Query:  KH
        KH
Subjt:  KH

XP_023515736.1 uncharacterized protein LOC111779809 isoform X2 [Cucurbita pepo subsp. pepo]0.078.34Show/hide
Query:  MSLRNHVDEIDVEHPA---DKEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQST--PVSAD
        MS R+H D  D E PA   D ED +VD+DME L+RA RLAGVN EDYINPRLS PAAGDA+ GSDSDDVDD ELLR+IQNRFSI ADEQP S   PV+AD
Subjt:  MSLRNHVDEIDVEHPA---DKEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQST--PVSAD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FEMLRSIQRRFAAYESD LSNKP+QS D  G LK+DS++  VE  TSS+R SM+AFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPPVDRKKWSIVERENLGKGIRQQ
        NKKLRKR K+LK FQGSC+R+T+CAL+QM+DPRVQLISA KPQAKDSSKKDKRLS M YGP ENSHVACYR A  KF PVDRK+WS  ERENLGKGIRQQ
Subjt:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPPVDRKKWSIVERENLGKGIRQQ

Query:  FQEMVLQISVDQISGPQGISGDSDDLDNILASIKDLDIAPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQK
        FQEMVLQISVDQIS  QG S +SDDLDNILASIK LDI P+KIREFLPKVNWDKLASMYL+GRSGAECEARWLNFEDPLINR+PWTTSEDK+LLFTIQQK
Subjt:  FQEMVLQISVDQISGPQGISGDSDDLDNILASIKDLDIAPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQK

Query:  GLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLK
        GLNNWIE+AVSLGTNRTPFQCLSRYQRSLNASILK EWTK+EDD+LRSAVA FG  DWQAVASTLEGR G QCSNRWKKSLDPART++GYFTPDED RLK
Subjt:  GLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLK

Query:  IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQK
        IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRR WKKLFP++VPLLQEARKIQK
Subjt:  IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQK

Query:  AALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKP
         ALISNFVDRE+ERPALGP DFRP PN+  LCNTD P  APKRNV+TR+MPVSRNEKSA GDAPK+RKSN QR + D TAQV    NTS VPE V+S+KP
Subjt:  AALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKP

Query:  QRKRNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKE-SERSNSNCTETVDENIMEVLENKVAEKLTEENACFSEPEKNQNSTGSSGVSVLS
        QRKR R GAYT +R G P++  +SE CA+QN DT S+ +QLN KE +ER NS+C ETVDEN MEV ENK AE  +E   CFSE E+NQNSTGSSGVSVLS
Subjt:  QRKRNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKE-SERSNSNCTETVDENIMEVLENKVAEKLTEENACFSEPEKNQNSTGSSGVSVLS

Query:  EMTNDLVDYNPSILTDTTLFASTTVDDIEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSNGVCNPTQGRRKKNSKTSNNSHDN
        EMTND+ +YNPS L DTTL AS T DDI E KG + AD+DLDDSNSFSL  SCLELRT DSEGVDSYSVDE+T KS+GVC P QGRRKKNSK SN S D+
Subjt:  EMTNDLVDYNPSILTDTTLFASTTVDDIEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSNGVCNPTQGRRKKNSKTSNNSHDN

Query:  LLIPRQQIVQETLGTKKPLHHNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLN-KLKRK
        L +  QQ   E  GT +    NQSKKRKHS T  S L T EAVEEVDDCTL GFLQKRLKRT  TH++ VD SS+ P +VDNDDN+PT+A  LN KLKRK
Subjt:  LLIPRQQIVQETLGTKKPLHHNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLN-KLKRK

Query:  KH
        KH
Subjt:  KH

XP_038905712.1 uncharacterized protein LOC120091681 isoform X1 [Benincasa hispida]0.084.32Show/hide
Query:  MSLRNHVDEIDVEHPADKEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQST--PVSADEEE
        MS  NH DE DVE PA+KED VVDEDMEVLQRAYRL GVNPEDYINPRLSSPA GDA+ G DSDD DDFELLR+IQNRFSI+ DEQP ST  PVS DEEE
Subjt:  MSLRNHVDEIDVEHPADKEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQST--PVSADEEE

Query:  DEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK
        DEFEMLRSIQRRFAAYESD LSNKPN+SRDYVGSLK+DS + A ESQTSSKRPSM+AFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK
Subjt:  DEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK

Query:  LRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPPVDRKKWSIVERENLGKGIRQQFQE
        LRKRCKILKDFQGSCKR+T+CALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGP ENSHVACYRMALAKFP VDRKKWSIVERENLGKGIRQQFQE
Subjt:  LRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPPVDRKKWSIVERENLGKGIRQQFQE

Query:  MVLQISVDQISGPQGISGDSDDLDNILASIKDLDIAPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLN
        MVLQISVDQISG QG S DSDDLDNILASIKDLDI P+KIREFLPKVNWDKLASMYL GRSGAECEARWLNFEDPLINRDPWTTSEDK+LLFTIQQKGLN
Subjt:  MVLQISVDQISGPQGISGDSDDLDNILASIKDLDIAPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLN

Query:  NWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAV
        NWIE+AVS GTNRTPFQCLSRYQRSLNASILKREWTK+EDD+LRSAVA FGVRDWQAVASTLEGRAGTQCSNRWKKSLDPART++G+FTPDED RLKIAV
Subjt:  NWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAV

Query:  LLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL
        LL GPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDN+CRRRWKKLFP+EVPLLQEARKIQKAAL
Subjt:  LLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL

Query:  ISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK
        ISNFVDRE+ERPALGPADFRPR NTD LC+TD P PAPKRN KTRKMPVSRNEKSATGDAP+KRKSNYQR Q DATA+VGIA NTS VPEEVQS KP RK
Subjt:  ISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK

Query:  RNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKESERSNSNCTETVDENIMEVLENKVAEKLTEENACFSEPEKNQNSTGSSGVSVLSEMTN
        RNR  A T KR GV EL S+ +WCAKQNL+T S+G+QL+SKE E +NS+ TETVD N +EV ENK+A+KL+E +  FSEPE+NQNSTGSSGVSVLSEMTN
Subjt:  RNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKESERSNSNCTETVDENIMEVLENKVAEKLTEENACFSEPEKNQNSTGSSGVSVLSEMTN

Query:  DLVDYNPSILTDTTLFASTTVDDIEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSNGVCNPTQGRRKKNSKTSNNSHDNLLIP
        D+ +YNPSIL DTTL ASTTVDDIEELKGKS ADRDLDDSNSFSL  SCLELRT+D EGVDSYSVD+ T KS+ VC   QGRRKKNSKTS+ +H+   + 
Subjt:  DLVDYNPSILTDTTLFASTTVDDIEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSNGVCNPTQGRRKKNSKTSNNSHDNLLIP

Query:  RQQIVQETLGTKKPLHHNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLNKLKRKKHQRP
         QQ+ QE LG  +P H NQSKKRKHS+T  S L T EAVEEVD+CTLVGFLQKRLK+        VDCSS  PL+VDNDDN+  IASFLNKLKRKKHQ P
Subjt:  RQQIVQETLGTKKPLHHNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLNKLKRKKHQRP

Query:  S
        S
Subjt:  S

XP_038905717.1 uncharacterized protein LOC120091681 isoform X2 [Benincasa hispida]0.084.32Show/hide
Query:  MSLRNHVDEIDVEHPADKEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQST--PVSADEEE
        MS  NH DE DVE PA+KED VVDEDMEVLQRAYRL GVNPEDYINPRLSSPA GDA+ G DSDD DDFELLR+IQNRFSI+ DEQP ST  PVS DEEE
Subjt:  MSLRNHVDEIDVEHPADKEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQST--PVSADEEE

Query:  DEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK
        DEFEMLRSIQRRFAAYESD LSNKPN+SRDYVGSLK+DS + A ESQTSSKRPSM+AFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK
Subjt:  DEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKK

Query:  LRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPPVDRKKWSIVERENLGKGIRQQFQE
        LRKRCKILKDFQGSCKR+T+CALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGP ENSHVACYRMALAKFP VDRKKWSIVERENLGKGIRQQFQE
Subjt:  LRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPPVDRKKWSIVERENLGKGIRQQFQE

Query:  MVLQISVDQISGPQGISGDSDDLDNILASIKDLDIAPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLN
        MVLQISVDQISG QG S DSDDLDNILASIKDLDI P+KIREFLPKVNWDKLASMYL GRSGAECEARWLNFEDPLINRDPWTTSEDK+LLFTIQQKGLN
Subjt:  MVLQISVDQISGPQGISGDSDDLDNILASIKDLDIAPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLN

Query:  NWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAV
        NWIE+AVS GTNRTPFQCLSRYQRSLNASILKREWTK+EDD+LRSAVA FGVRDWQAVASTLEGRAGTQCSNRWKKSLDPART++G+FTPDED RLKIAV
Subjt:  NWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAV

Query:  LLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL
        LL GPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDN+CRRRWKKLFP+EVPLLQEARKIQKAAL
Subjt:  LLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAAL

Query:  ISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK
        ISNFVDRE+ERPALGPADFRPR NTD LC+TD P PAPKRN KTRKMPVSRNEKSATGDAP+KRKSNYQR Q DATA+VGIA NTS VPEEVQS KP RK
Subjt:  ISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRK

Query:  RNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKESERSNSNCTETVDENIMEVLENKVAEKLTEENACFSEPEKNQNSTGSSGVSVLSEMTN
        RNR  A T KR GV EL S+ +WCAKQNL+T S+G+QL+SKE E +NS+ TETVD N +EV ENK+A+KL+E +  FSEPE+NQNSTGSSGVSVLSEMTN
Subjt:  RNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKESERSNSNCTETVDENIMEVLENKVAEKLTEENACFSEPEKNQNSTGSSGVSVLSEMTN

Query:  DLVDYNPSILTDTTLFASTTVDDIEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSNGVCNPTQGRRKKNSKTSNNSHDNLLIP
        D+ +YNPSIL DTTL ASTTVDDIEELKGKS ADRDLDDSNSFSL  SCLELRT+D EGVDSYSVD+ T KS+ VC   QGRRKKNSKTS+ +H+   + 
Subjt:  DLVDYNPSILTDTTLFASTTVDDIEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSNGVCNPTQGRRKKNSKTSNNSHDNLLIP

Query:  RQQIVQETLGTKKPLHHNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLNKLKRKKHQRP
         QQ+ QE LG  +P H NQSKKRKHS+T  S L T EAVEEVD+CTLVGFLQKRLK+        VDCSS  PL+VDNDDN+  IASFLNKLKRKKHQ P
Subjt:  RQQIVQETLGTKKPLHHNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLNKLKRKKHQRP

Query:  S
        S
Subjt:  S

TrEMBL top hitse value%identityAlignment
A0A0A0L2R2 Uncharacterized protein0.099.9Show/hide
Query:  MSLRNHVDEIDVEHPADKEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQSTPVSADEEEDE
        MSLRNHVDEIDVEHPADKEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQSTPVSADEEEDE
Subjt:  MSLRNHVDEIDVEHPADKEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQSTPVSADEEEDE

Query:  FEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKKLR
        FEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKKLR
Subjt:  FEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKKLR

Query:  KRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPPVDRKKWSIVERENLGKGIRQQFQEMV
        KRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRM LAKFPPVDRKKWSIVERENLGKGIRQQFQEMV
Subjt:  KRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPPVDRKKWSIVERENLGKGIRQQFQEMV

Query:  LQISVDQISGPQGISGDSDDLDNILASIKDLDIAPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNW
        LQISVDQISGPQGISGDSDDLDNILASIKDLDIAPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNW
Subjt:  LQISVDQISGPQGISGDSDDLDNILASIKDLDIAPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNW

Query:  IEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLL
        IEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLL
Subjt:  IEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLL

Query:  FGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAALIS
        FGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAALIS
Subjt:  FGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAALIS

Query:  NFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRKRN
        NFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRKRN
Subjt:  NFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRKRN

Query:  RRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKESERSNSNCTETVDENIMEVLENKVAEKLTEENACFSEPEKNQNSTGSSGVSVLSEMTNDL
        RRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKESERSNSNCTETVDENIMEVLENKVAEKLTEENACFSEPEKNQNSTGSSGVSVLSEMTNDL
Subjt:  RRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKESERSNSNCTETVDENIMEVLENKVAEKLTEENACFSEPEKNQNSTGSSGVSVLSEMTNDL

Query:  VDYNPSILTDTTLFASTTVDDIEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSNGVCNPTQGRRKKNSKTSNNSHDNLLIPRQ
        VDYNPSILTDTTLFASTTVDDIEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSNGVCNPTQGRRKKNSKTSNNSHDNLLIPRQ
Subjt:  VDYNPSILTDTTLFASTTVDDIEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSNGVCNPTQGRRKKNSKTSNNSHDNLLIPRQ

Query:  QIVQETLGTKKPLHHNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLNKLKRKKHQRPSG
        QIVQETLGTKKPLHHNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLNKLKRKKHQRPSG
Subjt:  QIVQETLGTKKPLHHNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLNKLKRKKHQRPSG

Query:  DELN
        DELN
Subjt:  DELN

A0A6J1E2J4 uncharacterized protein LOC111430000 isoform X20.078.04Show/hide
Query:  MSLRNHVDEIDVEHPA---DKEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQST--PVSAD
        MS R+HVD  D E PA   D ED +VD+DME L+RA RLAGVN ED INPRLS PAAGDA+ GSDSDDVDD ELLR+IQNRFS  ADEQP S   PV+AD
Subjt:  MSLRNHVDEIDVEHPA---DKEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQST--PVSAD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FE LRSIQRRFAAYESD LSNKP+QS D  G LK+DSD+  V   TSS+R SM+AFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPPVDRKKWSIVERENLGKGIRQQ
        NKKLRKR K+LK FQGSC+R+T+CAL+QM+DPRVQLISA KPQAKDSSKKDKRLS M YGP ENSHVACYR AL KF PVDRK+WS  ERENLGKGIRQQ
Subjt:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPPVDRKKWSIVERENLGKGIRQQ

Query:  FQEMVLQISVDQISGPQGISGDSDDLDNILASIKDLDIAPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQK
        FQEMVLQISVDQIS  QG S +SDDLDNILASIK LDI P+KIREFLPKVNWDKLA MYLQGRSGAECEARWLNFEDPLINR+ WTTSEDK+LLFTIQQK
Subjt:  FQEMVLQISVDQISGPQGISGDSDDLDNILASIKDLDIAPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQK

Query:  GLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLK
        GLNNWIE+AVSLGTNRTPFQCLSRYQRSLNASILK EWTK+EDD+LRSAVA FG  DWQAVASTLEGR G QCSNRWKKSLDPART++GYFTPDED RLK
Subjt:  GLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLK

Query:  IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQK
        IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRR WKKLFP++VPLLQEARKIQK
Subjt:  IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQK

Query:  AALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKP
         ALISNFVDRE+ERPALGP DFRP PN+  LCNTD P  APKRNV+ R+MPVSRNEKSA GDAPKK KSN QR Q D TAQV  A NTS VPE V+S+KP
Subjt:  AALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKP

Query:  QRKRNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKE-SERSNSNCTETVDENIMEVLENKVAEKLTEENACFSEPEKNQNSTGSSGVSVLS
        QRKR R GAYT +R G P++  +SE CA+QN DT SL +QLN KE +ER NS+C ETVDEN MEV ENK AE  +E   CFSE E+NQNSTGSSGVSVLS
Subjt:  QRKRNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKE-SERSNSNCTETVDENIMEVLENKVAEKLTEENACFSEPEKNQNSTGSSGVSVLS

Query:  EMTNDLVDYNPSILTDTTLFASTTVDDIEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSNGVCNPTQGRRKKNSKTSNNSHDN
        EMTND+ +YNPS   DTTL AS T DDI E KG + AD+DLDDSNSFSL  SCLELRT DSEGVDSYSVDE+T KS+GVC P QGRRKKNSK SN S D+
Subjt:  EMTNDLVDYNPSILTDTTLFASTTVDDIEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSNGVCNPTQGRRKKNSKTSNNSHDN

Query:  LLIPRQQIVQETLGTKKPLHHNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFL-NKLKRK
        L +  QQ   E  G  +    NQSKKRKHS T  S L T EAVEEVDDCTL GFLQKRLKRT  TH++ VD SS+ P +VDNDDN+PT+A  L +KLKRK
Subjt:  LLIPRQQIVQETLGTKKPLHHNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFL-NKLKRK

Query:  KH
        KH
Subjt:  KH

A0A6J1E6Z7 uncharacterized protein LOC111430000 isoform X10.078.14Show/hide
Query:  MSLRNHVDEIDVEHPA---DKEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQST--PVSAD
        MS R+HVD  D E PA   D ED +VD+DME L+RA RLAGVN ED INPRLS PAAGDA+ GSDSDDVDD ELLR+IQNRFS  ADEQP S   PV+AD
Subjt:  MSLRNHVDEIDVEHPA---DKEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQST--PVSAD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FE LRSIQRRFAAYESD LSNKP+QS D  G LK+DSD+  V   TSS+R SM+AFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPPVDRKKWSIVERENLGKGIRQQ
        NKKLRKR K+LK FQGSC+R+T+CAL+QM+DPRVQLISA KPQAKDSSKKDKRLS M YGP ENSHVACYR AL KF PVDRK+WS  ERENLGKGIRQQ
Subjt:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPPVDRKKWSIVERENLGKGIRQQ

Query:  FQEMVLQISVDQISGPQGISGDSDDLDNILASIKDLDIAPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQK
        FQEMVLQISVDQIS  QG S +SDDLDNILASIK LDI P+KIREFLPKVNWDKLA MYLQGRSGAECEARWLNFEDPLINR+ WTTSEDK+LLFTIQQK
Subjt:  FQEMVLQISVDQISGPQGISGDSDDLDNILASIKDLDIAPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQK

Query:  GLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLK
        GLNNWIE+AVSLGTNRTPFQCLSRYQRSLNASILK EWTK+EDD+LRSAVA FG  DWQAVASTLEGR G QCSNRWKKSLDPART++GYFTPDED RLK
Subjt:  GLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLK

Query:  IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQK
        IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFP++VPLLQEARKIQK
Subjt:  IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQK

Query:  AALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKP
         ALISNFVDRE+ERPALGP DFRP PN+  LCNTD P  APKRNV+ R+MPVSRNEKSA GDAPKK KSN QR Q D TAQV  A NTS VPE V+S+KP
Subjt:  AALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKP

Query:  QRKRNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKE-SERSNSNCTETVDENIMEVLENKVAEKLTEENACFSEPEKNQNSTGSSGVSVLS
        QRKR R GAYT +R G P++  +SE CA+QN DT SL +QLN KE +ER NS+C ETVDEN MEV ENK AE  +E   CFSE E+NQNSTGSSGVSVLS
Subjt:  QRKRNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKE-SERSNSNCTETVDENIMEVLENKVAEKLTEENACFSEPEKNQNSTGSSGVSVLS

Query:  EMTNDLVDYNPSILTDTTLFASTTVDDIEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSNGVCNPTQGRRKKNSKTSNNSHDN
        EMTND+ +YNPS   DTTL AS T DDI E KG + AD+DLDDSNSFSL  SCLELRT DSEGVDSYSVDE+T KS+GVC P QGRRKKNSK SN S D+
Subjt:  EMTNDLVDYNPSILTDTTLFASTTVDDIEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSNGVCNPTQGRRKKNSKTSNNSHDN

Query:  LLIPRQQIVQETLGTKKPLHHNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFL-NKLKRK
        L +  QQ   E  G  +    NQSKKRKHS T  S L T EAVEEVDDCTL GFLQKRLKRT  TH++ VD SS+ P +VDNDDN+PT+A  L +KLKRK
Subjt:  LLIPRQQIVQETLGTKKPLHHNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFL-NKLKRK

Query:  KH
        KH
Subjt:  KH

A0A6J1JK98 uncharacterized protein LOC111485355 isoform X20.077.87Show/hide
Query:  MSLRNHVDEIDVEHPA---DKEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQST--PVSAD
        MS R+HVD  D E PA   D ED +VD+DME L+RA RLAGVN EDY+NP+LS PAAGDA+ GSDSDDVDD ELLR+IQNRFSI ADEQP S   PV+AD
Subjt:  MSLRNHVDEIDVEHPA---DKEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQST--PVSAD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FE LRSIQRRFAAYESD LSNKP+QS D  G LK+DSD+  VE  TSS+R SM+AFEKGSLPKAALAF+DAIKKNRSQQKF+RSKMIHLEARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQ-AKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPPVDRKKWSIVERENLGKGIRQ
        NKKLRKR K+LK FQGSC+R+T+CALSQM+DPRVQLISA KPQ AKDSSKKDKRLS M YGP ENSHVACYR+AL KF PVDRK+WS  ERENLGKGIRQ
Subjt:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQ-AKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPPVDRKKWSIVERENLGKGIRQ

Query:  QFQEMVLQISVDQISGPQGISGDSDDLDNILASIKDLDIAPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQ
        QFQEMVLQISVDQIS  QG S +SDDLDNILASIKDLDI P+KIREFLPKVNWDKLASMYL+GRSGAECEARWLNFEDPLINR+PWTTSEDK+LLFTIQQ
Subjt:  QFQEMVLQISVDQISGPQGISGDSDDLDNILASIKDLDIAPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQ

Query:  KGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRL
        KGLNNWI++AVSLGTNRTPFQ LSRYQRSLNASILK EWTK+EDD+LRSAVA FG  DWQAVASTLEGR G QCSNRWKKSLDPART++GYFTPDED RL
Subjt:  KGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRL

Query:  KIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQ
        KIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRR WKKLFP++VPLLQEARKIQ
Subjt:  KIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQ

Query:  KAALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSK
        K ALISNFVDRE+ERPALGP DFRP PN+  LCNTD P  APKRNV+TR+MPVSRNEKSA GDAPKKRKSN QR + D TAQV  A NTS VPE V+S+K
Subjt:  KAALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSK

Query:  PQRKRNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKE-SERSNSNCTETVDENIMEVLENKVAEKLTEENACFSEPEKNQNSTGSSGVSVL
        PQRKR R GAYT +R G P++  +SE CA+QN DT +L +QLN KE +ER NS+C ETVDEN MEV ENK AE  +E   CFSE E+NQNSTGSSGVSVL
Subjt:  PQRKRNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKE-SERSNSNCTETVDENIMEVLENKVAEKLTEENACFSEPEKNQNSTGSSGVSVL

Query:  SEMTNDLVDYNPSILTDTTLFASTTVDDIEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSNGVCNPTQGRRKKNSKTSNNSHD
        SEMTND+ +YNPS L DTTL AS T DDI E KG + AD+DLD SNSFSL  SCLELRT DSEGVDSYSVDE+T KS+ VC P QGRRKKNSK SN S D
Subjt:  SEMTNDLVDYNPSILTDTTLFASTTVDDIEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSNGVCNPTQGRRKKNSKTSNNSHD

Query:  NLLIPRQQIVQETLGTKKPLHHNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLN-KLKR
        +L +  QQ   E  GT +    NQ KKRKHS+T  S L T EAVEEVDDCTL+GFLQKRLKRT  TH + VD SS+   +VDNDDN+PT+A  L  KLKR
Subjt:  NLLIPRQQIVQETLGTKKPLHHNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLN-KLKR

Query:  KKH
        KKH
Subjt:  KKH

A0A6J1JKV7 uncharacterized protein LOC111485355 isoform X10.077.97Show/hide
Query:  MSLRNHVDEIDVEHPA---DKEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQST--PVSAD
        MS R+HVD  D E PA   D ED +VD+DME L+RA RLAGVN EDY+NP+LS PAAGDA+ GSDSDDVDD ELLR+IQNRFSI ADEQP S   PV+AD
Subjt:  MSLRNHVDEIDVEHPA---DKEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQST--PVSAD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FE LRSIQRRFAAYESD LSNKP+QS D  G LK+DSD+  VE  TSS+R SM+AFEKGSLPKAALAF+DAIKKNRSQQKF+RSKMIHLEARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQ-AKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPPVDRKKWSIVERENLGKGIRQ
        NKKLRKR K+LK FQGSC+R+T+CALSQM+DPRVQLISA KPQ AKDSSKKDKRLS M YGP ENSHVACYR+AL KF PVDRK+WS  ERENLGKGIRQ
Subjt:  NKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQ-AKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPPVDRKKWSIVERENLGKGIRQ

Query:  QFQEMVLQISVDQISGPQGISGDSDDLDNILASIKDLDIAPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQ
        QFQEMVLQISVDQIS  QG S +SDDLDNILASIKDLDI P+KIREFLPKVNWDKLASMYL+GRSGAECEARWLNFEDPLINR+PWTTSEDK+LLFTIQQ
Subjt:  QFQEMVLQISVDQISGPQGISGDSDDLDNILASIKDLDIAPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQ

Query:  KGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRL
        KGLNNWI++AVSLGTNRTPFQ LSRYQRSLNASILK EWTK+EDD+LRSAVA FG  DWQAVASTLEGR G QCSNRWKKSLDPART++GYFTPDED RL
Subjt:  KGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRL

Query:  KIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQ
        KIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFP++VPLLQEARKIQ
Subjt:  KIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQ

Query:  KAALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSK
        K ALISNFVDRE+ERPALGP DFRP PN+  LCNTD P  APKRNV+TR+MPVSRNEKSA GDAPKKRKSN QR + D TAQV  A NTS VPE V+S+K
Subjt:  KAALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSK

Query:  PQRKRNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKE-SERSNSNCTETVDENIMEVLENKVAEKLTEENACFSEPEKNQNSTGSSGVSVL
        PQRKR R GAYT +R G P++  +SE CA+QN DT +L +QLN KE +ER NS+C ETVDEN MEV ENK AE  +E   CFSE E+NQNSTGSSGVSVL
Subjt:  PQRKRNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKE-SERSNSNCTETVDENIMEVLENKVAEKLTEENACFSEPEKNQNSTGSSGVSVL

Query:  SEMTNDLVDYNPSILTDTTLFASTTVDDIEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSNGVCNPTQGRRKKNSKTSNNSHD
        SEMTND+ +YNPS L DTTL AS T DDI E KG + AD+DLD SNSFSL  SCLELRT DSEGVDSYSVDE+T KS+ VC P QGRRKKNSK SN S D
Subjt:  SEMTNDLVDYNPSILTDTTLFASTTVDDIEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSNGVCNPTQGRRKKNSKTSNNSHD

Query:  NLLIPRQQIVQETLGTKKPLHHNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLN-KLKR
        +L +  QQ   E  GT +    NQ KKRKHS+T  S L T EAVEEVDDCTL+GFLQKRLKRT  TH + VD SS+   +VDNDDN+PT+A  L  KLKR
Subjt:  NLLIPRQQIVQETLGTKKPLHHNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLN-KLKR

Query:  KKH
        KKH
Subjt:  KKH

SwissProt top hitse value%identityAlignment
P46200 Transcriptional activator Myb1.4e-3039.2Show/hide
Query:  KREWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL
        K  WT+EED++L+  V   G  DW+ +A+ L  R   QC +RW+K L+P    KG +T +ED R+   V  +GPK W+  A+ L GR   QCRERW N L
Subjt:  KREWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL

Query:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVP---LLQEARKIQKAALISNF
        +P +++  WTEEED  +  A +  G  WA++A  +P RTDN  +  W      +V     LQE+ K  + A+ ++F
Subjt:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVP---LLQEARKIQKAALISNF

Q08759 Transcriptional activator Myb1.1e-3037.57Show/hide
Query:  LSRYQRSLNASILKREWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGR
        LS+ +R L     K  WT+EED++L+  V   G  +W+ +AS L  R   QC +RW+K L+P    KG +T +ED R+   V  +GPK W+  A+ L GR
Subjt:  LSRYQRSLNASILKREWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGR

Query:  NQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLF---PDEVPLLQEARKIQKAALISNF
           QCRERW N L+P +++  WTEEED  +  A +  G  WA++A  +P RTDN  +  W        ++   LQ + K  +  +++NF
Subjt:  NQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLF---PDEVPLLQEARKIQKAALISNF

Q54NA6 Myb-like protein L2.2e-5537.04Show/hide
Query:  KKWSIVERENLGKGIRQQ-FQEMVLQISVDQISGP---------QGISGDSDDLDNILASIKDLDIAPDKIREFLPKVNWDKLASMYLQGRSGAECEARW
        ++W+  E E L KGI+++  Q+ + ++S D++S           Q  S ++++ +N + +  + +   DK     P +       +    RS  E   RW
Subjt:  KKWSIVERENLGKGIRQQ-FQEMVLQISVDQISGP---------QGISGDSDDLDNILASIKDLDIAPDKIREFLPKVNWDKLASMYLQGRSGAECEARW

Query:  LNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKEEDDRLRSAVA--TFGVR-DWQAVASTLEGRA
         N +DP IN+ P+T  EDK LL   ++   + W ++++ LGTNRTP  C+ RYQRSLN+ ++KREWTKEED+ L   +     G R DWQ +   + GR 
Subjt:  LNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKEEDDRLRSAVA--TFGVR-DWQAVASTLEGRA

Query:  GTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGY-SWAKVAAC
        G QC +RW K+LDP+  +KG ++P+ED  L  AV  +G  NW      + GR  VQCRER+ N LDP L +  WT +ED RL     + G   W+ VA  
Subjt:  GTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGY-SWAKVAAC

Query:  VPSRTDNECRRRWKKLFPDEVPLLQEARKI--QKAALISNFVDRETERPAL
        + +RTDN+C RRWK+L      L     K+  +K   +SNF  R+ ER  L
Subjt:  VPSRTDNECRRRWKKLFPDEVPLLQEARKI--QKAALISNFVDRETERPAL

Q5SXM2 snRNA-activating protein complex subunit 43.7e-3927.56Show/hide
Query:  IKKNRSQQKFIRSKMIHLEARIEENKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHV-ACYRMALA
        ++ N   Q+ I+ K+      + +N++ ++  ++++D  GS  + T     + + P   +    KP  KD      +++G+  GP  N          + 
Subjt:  IKKNRSQQKFIRSKMIHLEARIEENKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHV-ACYRMALA

Query:  KFPPVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQISG-PQGISGDSDDLD---------NILASIKDLDIAPDK--IREFLPKVNWDKLASMYLQG-
         F  +   KW   E+  L K +     + +LQ  + ++    Q  S  S +L+              I+D++  P++  +   L   +W+K++++  +G 
Subjt:  KFPPVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQISG-PQGISGDSDDLD---------NILASIKDLDIAPDK--IREFLPKVNWDKLASMYLQG-

Query:  RSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKEEDDRLRSAVATFGVRD---WQ
        RS  E    W N E P IN+  W+  E++ L       G   W ++A  LGT+R+ FQCL ++Q+  N ++ ++EWT+EED  L   V    V     ++
Subjt:  RSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKEEDDRLRSAVATFGVRD---WQ

Query:  AVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHG
         +   +EGR   Q   RW KSLDP   +KGY+ P+ED +L  AV  +G ++W K  E +PGR+  QCR+R+   L  SL++  W  +E+ +L   I+++G
Subjt:  AVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHG

Query:  YS-WAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARK
           WAK+A+ +P R+ ++C  +WK +   +  L +  R+
Subjt:  YS-WAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARK

Q8BP86 snRNA-activating protein complex subunit 43.8e-3626.93Show/hide
Query:  IKKNRSQQKFIRSKMIHLEARIEENKKLRKRCKILKDFQGS-CKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHV-ACYRMAL
        ++ N   Q+ IR K+  +   + +N++ ++  +IL D  G+ C +       + +   + +    KP  KD      +++G+  GP  N          +
Subjt:  IKKNRSQQKFIRSKMIHLEARIEENKKLRKRCKILKDFQGS-CKRRTSCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHV-ACYRMAL

Query:  AKFPPVDRKKWSIVERENLGKGIRQQFQEMVLQ---ISVDQISGPQG-ISGDSD------DLDNILASIKDLDIAPDK--IREFLPKVNWDKLASMYLQG
          F  +   KW   E+  L K +     + +LQ   + ++ +   Q  +S + +       +      I+D++  P++  +   L   +W+K++++  +G
Subjt:  AKFPPVDRKKWSIVERENLGKGIRQQFQEMVLQ---ISVDQISGPQG-ISGDSD------DLDNILASIKDLDIAPDK--IREFLPKVNWDKLASMYLQG

Query:  -RSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKEEDDRLRSAVATFGVRD---W
         RS  E    W + E P I++  W+T E + L       G   W  +A  LGT+R+ FQCL ++Q+  N ++ ++EWT+EED  L   V    V +   +
Subjt:  -RSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKEEDDRLRSAVATFGVRD---W

Query:  QAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEH
        + +   +EGR   Q   RW KSLDP+  ++G++ P+ED +L  AV  +G ++W K  E +PGR+  QCR+R+   L  SL++  W  +E+ +L   I+++
Subjt:  QAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEH

Query:  GYS-WAKVAACVPSRTDNECRRRWKKL
        G   WA++A+ +P R+ ++C  +WK L
Subjt:  GYS-WAKVAACVPSRTDNECRRRWKKL

Arabidopsis top hitse value%identityAlignment
AT3G09370.1 myb domain protein 3r-32.1e-2942.18Show/hide
Query:  KREWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL
        K  WT EED+ LR AV TF  + W+ +A +   R   QC +RW+K L+P    KG +T +ED ++   V  +GP  W+  A+ LPGR   QCRERW N L
Subjt:  KREWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL

Query:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW
        +P + +  WT EE++ L  A + HG  WA++A  +P RTDN  +  W
Subjt:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW

AT3G09370.2 myb domain protein 3r-32.1e-2942.18Show/hide
Query:  KREWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL
        K  WT EED+ LR AV TF  + W+ +A +   R   QC +RW+K L+P    KG +T +ED ++   V  +GP  W+  A+ LPGR   QCRERW N L
Subjt:  KREWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL

Query:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW
        +P + +  WT EE++ L  A + HG  WA++A  +P RTDN  +  W
Subjt:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW

AT3G18100.1 myb domain protein 4r14.2e-16344.5Show/hide
Query:  RNHVDEIDVEHPADKEDGVVDEDMEVLQRAYRLAGVNPEDY------INPR----------------------LSSPAAGDADPG------------SDS
        RN + E D +   D+ED  + ED+E L+RA  ++ VN + +      I P                       + S  A   D G            SDS
Subjt:  RNHVDEIDVEHPADKEDGVVDEDMEVLQRAYRLAGVNPEDY------INPR----------------------LSSPAAGDADPG------------SDS

Query:  DDVDDFELLRDIQNRFSILADEQPQSTPVSADEEEDEFEMLRSIQRRFAAY-----ESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRP-------
        +  DDFE++R I+++ S+  D       +S DEE+D FE LR+I+RRF+AY     E   +++   + +    S    S +I   S T    P       
Subjt:  DDVDDFELLRDIQNRFSILADEQPQSTPVSADEEEDEFEMLRSIQRRFAAY-----ESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRP-------

Query:  ----------SMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKD
                    +     S P+AA AFVDAI++NR+ QKF+R K+  +EA IE+N+K +K  +I+KDFQ SCKR T  AL Q  DPRV+LIS  K    D
Subjt:  ----------SMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKD

Query:  SSK----------KDKRLSGMYYGPDENSHVACYRMALAKFP-PVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQISGPQGISGDSDDLDNILASIKD
        SS+           DK++S +  GP EN  V  YRMAL K+P  V R+KWS  E +NL KG++Q+ Q+++L  ++++ S    + G + D+D I  SI +
Subjt:  SSK----------KDKRLSGMYYGPDENSHVACYRMALAKFP-PVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQISGPQGISGDSDDLDNILASIKD

Query:  LDIAPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILK
        L+I P+ IR+FLPK+NWD   S+ ++ RS AECEARW++ EDPLIN  PWT +EDK+LL TI+Q  L +W+++AVSLGTNRTPFQCL+RYQRSLN SILK
Subjt:  LDIAPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILK

Query:  REWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLD
        +EWT EEDD+LR+AV  FG +DWQ+VA+ L+GR GTQCSNRWKKSL P  TRKG ++ +ED R+K+AV LFG +NW+K ++F+PGR Q QCRERW NCLD
Subjt:  REWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLD

Query:  PSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAALISNFVDRETERPALGPADFRPRPNTDSLCNTD
        P + R +WTEEED +L  AI EHGYSW+KVA  +  RTDN+C RRWK+L+P +V LLQEAR++QK A + NFVDRE+ERPAL  +     P+       D
Subjt:  PSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAALISNFVDRETERPALGPADFRPRPNTDSLCNTD

Query:  GPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQ
              KR  K +K    R         PK+R+   +    D   Q
Subjt:  GPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNYQRFQTDATAQ

AT3G18100.2 myb domain protein 4r14.4e-15252.65Show/hide
Query:  SLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSK----------KDKRL
        S P+AA AFVDAI++NR+ QKF+R K+  +EA IE+N+K +K  +I+KDFQ SCKR T  AL Q  DPRV+LIS  K    DSS+           DK++
Subjt:  SLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRCKILKDFQGSCKRRTSCALSQMIDPRVQLISAAKPQAKDSSK----------KDKRL

Query:  SGMYYGPDENSHVACYRMALAKFP-PVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQISGPQGISGDSDDLDNILASIKDLDIAPDKIREFLPKVNWD
        S +  GP EN  V  YRMAL K+P  V R+KWS  E +NL KG++Q+ Q+++L  ++++ S    + G + D+D I  SI +L+I P+ IR+FLPK+NWD
Subjt:  SGMYYGPDENSHVACYRMALAKFP-PVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQISGPQGISGDSDDLDNILASIKDLDIAPDKIREFLPKVNWD

Query:  KLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKEEDDRLRSAVATF
           S+ ++ RS AECEARW++ EDPLIN  PWT +EDK+LL TI+Q  L +W+++AVSLGTNRTPFQCL+RYQRSLN SILK+EWT EEDD+LR+AV  F
Subjt:  KLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKEEDDRLRSAVATF

Query:  GVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEI
        G +DWQ+VA+ L+GR GTQCSNRWKKSL P  TRKG ++ +ED R+K+AV LFG +NW+K ++F+PGR Q QCRERW NCLDP + R +WTEEED +L  
Subjt:  GVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEI

Query:  AIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVS
        AI EHGYSW+KVA  +  RTDN+C RRWK+L+P +V LLQEAR++QK A + NFVDRE+ERPAL  +     P+       D      KR  K +K    
Subjt:  AIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVS

Query:  RNEKSATGDAPKKRKSNYQRFQTDATAQ
        R         PK+R+   +    D   Q
Subjt:  RNEKSATGDAPKKRKSNYQRFQTDATAQ

AT3G18100.3 myb domain protein 4r11.8e-14242.96Show/hide
Query:  RNHVDEIDVEHPADKEDGVVDEDMEVLQRAYRLAGVNPEDYINP--RLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQSTP------VSAD
        RN + E D +   D+ED  + ED+E L+RA  ++ VN + + +    +     G  +  SDS++ DDFE+LR I+++ +   D    S P      +S  
Subjt:  RNHVDEIDVEHPADKEDGVVDEDMEVLQRAYRLAGVNPEDYINP--RLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQSTP------VSAD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKR-----PSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLE
        E ED+FEM+RSI+ +        LS   + S   +G L  D +D A E+  + +R      +   F   S  K      + +   ++Q+ +   KM+  +
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKR-----PSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLE

Query:  ARIEENKKLRKRC-----------KILKDFQGSCKRRTSCALS--QMIDPRVQLISAAKPQAKDSSK----------KDKRLSGMYYGPDENSHVACYRM
         R++ + KL +R              L++     K+R+S   +  +M DPRV+LIS  K    DSS+           DK++S +  GP EN  V  YRM
Subjt:  ARIEENKKLRKRC-----------KILKDFQGSCKRRTSCALS--QMIDPRVQLISAAKPQAKDSSK----------KDKRLSGMYYGPDENSHVACYRM

Query:  ALAKFP-PVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQISGPQGISGDSDDLDNILASIKDLDIAPDKIREFLPKVNWDKLASMYLQGRSGAECEAR
        AL K+P  V R+KWS  E +NL KG++Q+ Q+++L  ++++ S    + G + D+D I  SI +L+I P+ IR+FLPK+NWD   S+ ++ RS AECEAR
Subjt:  ALAKFP-PVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQISGPQGISGDSDDLDNILASIKDLDIAPDKIREFLPKVNWDKLASMYLQGRSGAECEAR

Query:  WLNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGT
        W++ EDPLIN  PWT +EDK+LL TI+Q  L +W+++AVSLGTNRTPFQCL+RYQRSLN SILK+EWT EEDD+LR+AV  FG +DWQ+VA+ L+GR GT
Subjt:  WLNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGT

Query:  QCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPS
        QCSNRWKKSL P  TRKG ++ +ED R+K+AV LFG +NW+K ++F+PGR Q QCRERW NCLDP + R +WTEEED +L  AI EHGYSW+KVA  +  
Subjt:  QCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPS

Query:  RTDNECRRRWKKLFPDEVPLLQEARKIQKAALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNY
        RTDN+C RRWK+L+P +V LLQEAR++QK A + NFVDRE+ERPAL  +     P+       D      KR  K +K    R         PK+R+   
Subjt:  RTDNECRRRWKKLFPDEVPLLQEARKIQKAALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNY

Query:  QRFQTDATAQ
        +    D   Q
Subjt:  QRFQTDATAQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCTCCGCAACCATGTTGATGAAATTGACGTTGAGCATCCTGCTGACAAGGAAGATGGTGTGGTTGATGAGGACATGGAAGTTCTTCAGAGAGCCTATAGGCTTGC
TGGTGTTAATCCTGAGGATTATATTAATCCCAGGTTGTCATCACCTGCTGCCGGAGATGCTGATCCTGGTTCTGATTCTGATGATGTTGATGATTTCGAACTTCTTCGAG
ATATTCAGAATCGGTTCTCGATTTTGGCTGATGAGCAGCCGCAGAGTACACCAGTGTCTGCAGACGAGGAGGAAGATGAATTTGAGATGCTTCGTTCAATACAGCGGCGC
TTTGCAGCGTATGAAAGTGATACTTTGAGCAATAAACCCAATCAGTCTCGTGACTATGTTGGATCTCTGAAGTTGGATTCTGACGACATAGCTGTTGAGAGCCAGACATC
CTCAAAACGGCCATCCATGCTAGCCTTTGAAAAGGGAAGCTTGCCAAAGGCCGCGTTGGCATTTGTTGATGCCATCAAGAAGAATAGGTCCCAGCAGAAGTTTATTCGTA
GTAAGATGATTCATCTTGAAGCTAGAATTGAGGAGAACAAAAAGCTCAGAAAACGTTGCAAAATTCTCAAAGATTTCCAGGGTTCATGTAAGCGAAGAACAAGTTGTGCA
CTGTCTCAAATGATAGATCCTCGAGTCCAGTTAATTTCAGCTGCAAAACCACAAGCAAAGGATTCATCAAAGAAGGACAAACGACTATCTGGAATGTATTATGGCCCGGA
TGAGAATTCTCATGTTGCGTGTTACAGAATGGCATTGGCAAAGTTCCCCCCTGTTGATCGAAAGAAATGGTCCATTGTAGAAAGGGAGAATCTAGGGAAGGGAATAAGAC
AGCAGTTTCAGGAGATGGTGCTTCAGATTTCAGTGGATCAAATCAGTGGGCCACAAGGAATTTCTGGAGATTCAGATGATTTGGATAACATTCTTGCATCAATAAAAGAC
CTTGACATCGCTCCTGACAAGATTAGGGAATTTCTGCCAAAAGTTAATTGGGACAAATTGGCTTCCATGTATCTTCAGGGTCGCTCAGGTGCAGAATGTGAAGCAAGATG
GTTGAATTTTGAAGACCCCCTAATTAATCGAGATCCATGGACTACAAGTGAGGATAAAAGTCTTTTGTTTACTATCCAACAGAAGGGGTTGAATAACTGGATTGAGATGG
CAGTTTCATTGGGCACAAACAGAACTCCTTTTCAGTGCCTGTCTCGGTATCAAAGAAGTTTAAATGCTTCCATATTAAAGAGGGAGTGGACCAAAGAGGAGGATGATAGA
CTTCGATCTGCTGTTGCTACATTTGGCGTGAGAGATTGGCAGGCAGTAGCTTCTACTTTGGAAGGACGAGCTGGCACACAGTGCTCTAATAGGTGGAAAAAATCCCTTGA
CCCAGCTAGGACAAGAAAAGGTTATTTCACTCCAGATGAAGACATTCGCTTGAAAATTGCTGTACTGCTCTTTGGGCCCAAAAATTGGAACAAGAAAGCAGAGTTTTTGC
CTGGTCGGAATCAAGTTCAATGCAGAGAAAGATGGTTTAATTGTTTAGATCCTTCCTTGAGAAGATGTGAGTGGACAGAAGAGGAGGATTTAAGGCTGGAGATAGCAATT
CAGGAACATGGATATAGCTGGGCTAAGGTAGCTGCATGTGTGCCATCACGTACAGATAATGAGTGTCGGAGGAGATGGAAGAAGTTATTTCCCGATGAAGTTCCTTTGCT
CCAGGAAGCTAGAAAGATTCAGAAGGCTGCTCTTATTAGCAACTTTGTTGATAGGGAAACAGAGCGTCCTGCTCTTGGTCCTGCTGACTTTCGACCCAGGCCAAATACAG
ATTCATTATGTAACACTGATGGTCCAATACCTGCCCCAAAAAGAAATGTGAAAACAAGAAAGATGCCAGTGTCAAGGAATGAAAAGAGTGCTACTGGTGATGCTCCAAAG
AAGAGGAAATCAAATTATCAAAGGTTTCAGACTGATGCAACTGCTCAGGTGGGTATTGCATATAATACCTCTTTTGTCCCAGAAGAGGTTCAATCTTCAAAGCCTCAAAG
AAAACGAAATAGACGTGGAGCTTATACTGCAAAGAGAATAGGGGTACCGGAACTACGTTCTGACAGCGAGTGGTGTGCTAAACAGAATTTGGACACTGAGAGCCTTGGGT
TGCAGCTGAATAGTAAGGAATCTGAGAGGAGCAACAGTAACTGTACCGAGACTGTTGATGAGAATATTATGGAAGTTCTTGAGAACAAAGTTGCAGAGAAGCTTACTGAA
GAAAATGCATGCTTCTCAGAACCAGAAAAAAATCAAAACTCAACCGGATCTTCTGGAGTCTCAGTATTGTCAGAAATGACTAACGACTTGGTTGACTATAATCCCTCTAT
CCTCACAGATACAACATTGTTTGCTAGTACTACTGTGGATGATATTGAAGAATTGAAGGGGAAGAGTGCTGCAGACAGGGATTTGGATGACAGTAATAGTTTCTCGTTAG
CACACAGTTGCTTAGAACTCCGGACAGTTGACAGTGAAGGCGTAGACAGTTATTCTGTGGACGAATATACAGCTAAGAGCAATGGGGTTTGTAATCCCACCCAAGGCAGA
AGGAAGAAAAATAGCAAAACATCAAATAACAGCCATGATAATTTGCTTATTCCTCGTCAACAAATTGTGCAAGAGACATTGGGAACGAAGAAGCCTCTTCATCATAATCA
ATCAAAGAAGAGAAAACATAGCAATACAGGTCCAAGTACGTTAAAAACATCGGAGGCAGTTGAGGAGGTTGATGACTGCACTCTCGTGGGGTTCTTGCAAAAAAGATTGA
AGAGGACGGCAATGACACATAACGAGACAGTTGATTGCAGTTCTAATGCTCCTTTAAAAGTTGATAATGACGATAATGAGCCTACCATTGCCTCGTTTCTTAATAAATTG
AAGAGAAAAAAGCATCAGCGGCCTAGTGGTGATGAGTTAAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCTCCGCAACCATGTTGATGAAATTGACGTTGAGCATCCTGCTGACAAGGAAGATGGTGTGGTTGATGAGGACATGGAAGTTCTTCAGAGAGCCTATAGGCTTGC
TGGTGTTAATCCTGAGGATTATATTAATCCCAGGTTGTCATCACCTGCTGCCGGAGATGCTGATCCTGGTTCTGATTCTGATGATGTTGATGATTTCGAACTTCTTCGAG
ATATTCAGAATCGGTTCTCGATTTTGGCTGATGAGCAGCCGCAGAGTACACCAGTGTCTGCAGACGAGGAGGAAGATGAATTTGAGATGCTTCGTTCAATACAGCGGCGC
TTTGCAGCGTATGAAAGTGATACTTTGAGCAATAAACCCAATCAGTCTCGTGACTATGTTGGATCTCTGAAGTTGGATTCTGACGACATAGCTGTTGAGAGCCAGACATC
CTCAAAACGGCCATCCATGCTAGCCTTTGAAAAGGGAAGCTTGCCAAAGGCCGCGTTGGCATTTGTTGATGCCATCAAGAAGAATAGGTCCCAGCAGAAGTTTATTCGTA
GTAAGATGATTCATCTTGAAGCTAGAATTGAGGAGAACAAAAAGCTCAGAAAACGTTGCAAAATTCTCAAAGATTTCCAGGGTTCATGTAAGCGAAGAACAAGTTGTGCA
CTGTCTCAAATGATAGATCCTCGAGTCCAGTTAATTTCAGCTGCAAAACCACAAGCAAAGGATTCATCAAAGAAGGACAAACGACTATCTGGAATGTATTATGGCCCGGA
TGAGAATTCTCATGTTGCGTGTTACAGAATGGCATTGGCAAAGTTCCCCCCTGTTGATCGAAAGAAATGGTCCATTGTAGAAAGGGAGAATCTAGGGAAGGGAATAAGAC
AGCAGTTTCAGGAGATGGTGCTTCAGATTTCAGTGGATCAAATCAGTGGGCCACAAGGAATTTCTGGAGATTCAGATGATTTGGATAACATTCTTGCATCAATAAAAGAC
CTTGACATCGCTCCTGACAAGATTAGGGAATTTCTGCCAAAAGTTAATTGGGACAAATTGGCTTCCATGTATCTTCAGGGTCGCTCAGGTGCAGAATGTGAAGCAAGATG
GTTGAATTTTGAAGACCCCCTAATTAATCGAGATCCATGGACTACAAGTGAGGATAAAAGTCTTTTGTTTACTATCCAACAGAAGGGGTTGAATAACTGGATTGAGATGG
CAGTTTCATTGGGCACAAACAGAACTCCTTTTCAGTGCCTGTCTCGGTATCAAAGAAGTTTAAATGCTTCCATATTAAAGAGGGAGTGGACCAAAGAGGAGGATGATAGA
CTTCGATCTGCTGTTGCTACATTTGGCGTGAGAGATTGGCAGGCAGTAGCTTCTACTTTGGAAGGACGAGCTGGCACACAGTGCTCTAATAGGTGGAAAAAATCCCTTGA
CCCAGCTAGGACAAGAAAAGGTTATTTCACTCCAGATGAAGACATTCGCTTGAAAATTGCTGTACTGCTCTTTGGGCCCAAAAATTGGAACAAGAAAGCAGAGTTTTTGC
CTGGTCGGAATCAAGTTCAATGCAGAGAAAGATGGTTTAATTGTTTAGATCCTTCCTTGAGAAGATGTGAGTGGACAGAAGAGGAGGATTTAAGGCTGGAGATAGCAATT
CAGGAACATGGATATAGCTGGGCTAAGGTAGCTGCATGTGTGCCATCACGTACAGATAATGAGTGTCGGAGGAGATGGAAGAAGTTATTTCCCGATGAAGTTCCTTTGCT
CCAGGAAGCTAGAAAGATTCAGAAGGCTGCTCTTATTAGCAACTTTGTTGATAGGGAAACAGAGCGTCCTGCTCTTGGTCCTGCTGACTTTCGACCCAGGCCAAATACAG
ATTCATTATGTAACACTGATGGTCCAATACCTGCCCCAAAAAGAAATGTGAAAACAAGAAAGATGCCAGTGTCAAGGAATGAAAAGAGTGCTACTGGTGATGCTCCAAAG
AAGAGGAAATCAAATTATCAAAGGTTTCAGACTGATGCAACTGCTCAGGTGGGTATTGCATATAATACCTCTTTTGTCCCAGAAGAGGTTCAATCTTCAAAGCCTCAAAG
AAAACGAAATAGACGTGGAGCTTATACTGCAAAGAGAATAGGGGTACCGGAACTACGTTCTGACAGCGAGTGGTGTGCTAAACAGAATTTGGACACTGAGAGCCTTGGGT
TGCAGCTGAATAGTAAGGAATCTGAGAGGAGCAACAGTAACTGTACCGAGACTGTTGATGAGAATATTATGGAAGTTCTTGAGAACAAAGTTGCAGAGAAGCTTACTGAA
GAAAATGCATGCTTCTCAGAACCAGAAAAAAATCAAAACTCAACCGGATCTTCTGGAGTCTCAGTATTGTCAGAAATGACTAACGACTTGGTTGACTATAATCCCTCTAT
CCTCACAGATACAACATTGTTTGCTAGTACTACTGTGGATGATATTGAAGAATTGAAGGGGAAGAGTGCTGCAGACAGGGATTTGGATGACAGTAATAGTTTCTCGTTAG
CACACAGTTGCTTAGAACTCCGGACAGTTGACAGTGAAGGCGTAGACAGTTATTCTGTGGACGAATATACAGCTAAGAGCAATGGGGTTTGTAATCCCACCCAAGGCAGA
AGGAAGAAAAATAGCAAAACATCAAATAACAGCCATGATAATTTGCTTATTCCTCGTCAACAAATTGTGCAAGAGACATTGGGAACGAAGAAGCCTCTTCATCATAATCA
ATCAAAGAAGAGAAAACATAGCAATACAGGTCCAAGTACGTTAAAAACATCGGAGGCAGTTGAGGAGGTTGATGACTGCACTCTCGTGGGGTTCTTGCAAAAAAGATTGA
AGAGGACGGCAATGACACATAACGAGACAGTTGATTGCAGTTCTAATGCTCCTTTAAAAGTTGATAATGACGATAATGAGCCTACCATTGCCTCGTTTCTTAATAAATTG
AAGAGAAAAAAGCATCAGCGGCCTAGTGGTGATGAGTTAAATTAA
Protein sequenceShow/hide protein sequence
MSLRNHVDEIDVEHPADKEDGVVDEDMEVLQRAYRLAGVNPEDYINPRLSSPAAGDADPGSDSDDVDDFELLRDIQNRFSILADEQPQSTPVSADEEEDEFEMLRSIQRR
FAAYESDTLSNKPNQSRDYVGSLKLDSDDIAVESQTSSKRPSMLAFEKGSLPKAALAFVDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRCKILKDFQGSCKRRTSCA
LSQMIDPRVQLISAAKPQAKDSSKKDKRLSGMYYGPDENSHVACYRMALAKFPPVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQISGPQGISGDSDDLDNILASIKD
LDIAPDKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNWIEMAVSLGTNRTPFQCLSRYQRSLNASILKREWTKEEDDR
LRSAVATFGVRDWQAVASTLEGRAGTQCSNRWKKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAI
QEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARKIQKAALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEKSATGDAPK
KRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRKRNRRGAYTAKRIGVPELRSDSEWCAKQNLDTESLGLQLNSKESERSNSNCTETVDENIMEVLENKVAEKLTE
ENACFSEPEKNQNSTGSSGVSVLSEMTNDLVDYNPSILTDTTLFASTTVDDIEELKGKSAADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSNGVCNPTQGR
RKKNSKTSNNSHDNLLIPRQQIVQETLGTKKPLHHNQSKKRKHSNTGPSTLKTSEAVEEVDDCTLVGFLQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFLNKL
KRKKHQRPSGDELN