; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC10G192780 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC10G192780
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionsnRNA-activating protein complex subunit 4
Genome locationCicolChr10:10260246..10284897
RNA-Seq ExpressionCcUC10G192780
SyntenyCcUC10G192780
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017884 - SANT domain
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011650584.1 uncharacterized protein LOC101216287 [Cucumis sativus]0.0e+0082.9Show/hide
Query:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSSVAGDANPSSDSDDVDDFELLRIIQNRFSCVADEQPLSTLPPMSLDEEE
        MS RNH DE DVE P  KED VVDEDME L+RAY+  GVNPEDYINPRLSS  AGDA+P SDSDDVDDFELLR IQNRFS +ADEQP ST  P+S DEEE
Subjt:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSSVAGDANPSSDSDDVDDFELLRIIQNRFSCVADEQPLSTLPPMSLDEEE

Query:  DEFEMLRSIQRRFAAYESNTLSNKPNQSCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKK
        DEFEMLRSIQRRFAAYES+TLSNKPNQS D+VGSLK+D DD AVESQT+SKRPSMLAFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEENKK
Subjt:  DEFEMLRSIQRRFAAYESNTLSNKPNQSCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKK

Query:  LRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQFQE
        LRKR KILKDFQGSCKRRT+CALSQMIDPRVQLISAAKPQAKDSSKKDKRLS M+YGP ENSHVACYRM L KFP VDRKKWS+VERENLGKGIRQQFQE
Subjt:  LRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQFQE

Query:  MVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTNEDKNLLFTIQQKGLN
        MVLQISVDQISG QG S DSDDLDNILASIKDLDI P+KIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTT+EDK+LLFTIQQKGLN
Subjt:  MVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTNEDKNLLFTIQQKGLN

Query:  NWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAV
        NWIE+AVS GTNRTPFQCLSRYQRSLNASILKREWTK+EDD+LR+AVATFG+ DWQAVASTLEGR G QCSNRWKKSLDPART++G+FTPDED RLKIAV
Subjt:  NWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAV

Query:  LLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAAL
        LLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDN+CRRRWKKLFP+EVPLLQEARKIQKAAL
Subjt:  LLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAAL

Query:  ISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSSNEKSATGDALKKRKSNCQRNQADETAQAGIANNTSSVPEEVKSLKPQRK
        ISNFVDRE+ERPALGP DFRPRPNTD LCN+  P PAPKRNVKTRKMPVS NEKSATGDA KKRKSN QR Q D TAQ GIA NTS VPEEV+S KPQRK
Subjt:  ISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSSNEKSATGDALKKRKSNCQRNQADETAQAGIANNTSSVPEEVKSLKPQRK

Query:  QNRHGAYTAESIGVPELCPNSEWCAKQSVDTQSLGVQLNSKESEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFSEPEAIQNSTGSSGVSVLSEMTN
        +NR GAYTA+ IGVPEL  +SEWCAKQ++DT+SLG+QLNSKESE +NS+C ETVDEN MEV ENKVAEKL+E + CFSEPE  QNSTGSSGVSVLSEMTN
Subjt:  QNRHGAYTAESIGVPELCPNSEWCAKQSVDTQSLGVQLNSKESEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFSEPEAIQNSTGSSGVSVLSEMTN

Query:  DMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDNEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDHSFVS
        D+ +YNPSIL DTTL ASTTVDD++ELK KS ADRDLDD NSFSL  S LELRT+D+EGVDSYS+DE+T KS+GVC P   RRKKNSKTS+ S D+  + 
Subjt:  DMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDNEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDHSFVS

Query:  CQQVELER-SPNEPRRHNQSKKRKHNSTNTNL----EAVEEVDDCTLVGFLQKRLKRTT-TNDKKVDCSSSTPLEVDTDDNDPTLASFLNKLKRKKHQPS
         QQ+  E     +P  HNQSKKRKH++T  +     EAVEEVDDCTLVGFLQKRLKRT  T+++ VDCSS+ PL+VD DDN+PT+ASFLNKLKRKKHQ  
Subjt:  CQQVELER-SPNEPRRHNQSKKRKHNSTNTNL----EAVEEVDDCTLVGFLQKRLKRTT-TNDKKVDCSSSTPLEVDTDDNDPTLASFLNKLKRKKHQPS

Query:  SGGELN
        SG ELN
Subjt:  SGGELN

XP_023515735.1 uncharacterized protein LOC111779809 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0081.84Show/hide
Query:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSSVAGDANPSSDSDDVDDFELLRIIQNRFSCVADEQPLSTLPPMSLD
        MS R+H D GD ELP S+   EDD+VD+DME LRRA +  GVN EDYINPRLS   AGDAN  SDSDDVDD ELLR IQNRFS  ADEQPLS LPP++ D
Subjt:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSSVAGDANPSSDSDDVDDFELLRIIQNRFSCVADEQPLSTLPPMSLD

Query:  EEEDEFEMLRSIQRRFAAYESNTLSNKPNQSCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FEMLRSIQRRFAAYES+ LSNKP+QSCD  G LKMD ++T VE  T+S+R SM+AFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESNTLSNKPNQSCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQ
        NKKLRKRFK+LK FQGSC+R+TTCAL+QM+DPRVQLISA KPQAKDSSKKDKRLS+M YGPAENSHVACYR A  KF  VDRK+WS  ERENLGKGIRQQ
Subjt:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQ

Query:  FQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTNEDKNLLFTIQQK
        FQEMVLQISVDQIS +QGFSA+SDDLDNILASIK LDITPEKIREFLPKVNWDKLASMYL+GRSGAECEARWLNFEDPLINR+PWTT+EDKNLLFTIQQK
Subjt:  FQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTNEDKNLLFTIQQK

Query:  GLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLK
        GLNNWIE+AVS GTNRTPFQCLSRYQRSLNASILK EWTKDEDDKLR+AVA FG GDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRG+FTPDEDSRLK
Subjt:  GLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLK

Query:  IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQK
        IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDN+CRRRWKKLFPN+VPLLQEARKIQK
Subjt:  IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQK

Query:  AALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSSNEKSATGDALKKRKSNCQRNQADETAQAGIANNTSSVPEEVKSLKP
         ALISNFVDRESERPALGPTDFRP PN+ LLCN+ DP  APKRNV+TR+MPVS NEKSA GDA K+RKSN QRN+ADETAQ    NNTSSVP EVKS KP
Subjt:  AALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSSNEKSATGDALKKRKSNCQRNQADETAQAGIANNTSSVPEEVKSLKP

Query:  QRKQNRHGAYTAESIGVPELCPNSEWCAKQSVDTQSLGVQLNSKE-SEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFSEPEAIQNSTGSSGVSVLS
        QRK+ RHGAYT    G P++  NSE CA+Q+ DT+S+ VQLN KE +E  NSDC ETVDENGMEVFENK AE  SE  VCFSE E  QNSTGSSGVSVLS
Subjt:  QRKQNRHGAYTAESIGVPELCPNSEWCAKQSVDTQSLGVQLNSKE-SEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFSEPEAIQNSTGSSGVSVLS

Query:  EMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDNEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDH
        EMTNDMDEYNPS LPDTTLLAS T DD+ E K  +VAD+DLDD NSFSLP S LELRT D+EGVDSYS+DEFTDKSHGVCK P  RRKKNSK S+KSQD 
Subjt:  EMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDNEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDH

Query:  SFVSCQQVELERS-PNEPRRHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKR-TTTNDKKVDCSSSTPLEVDTDDNDPTLASFLN-KLKRK
        S VSCQQ ELE S  NE  R NQSKKRKH+ TNT+    +EAVEEVDDCTL GFLQKRLKR TTT+DKKVD SSSTP EVD DDNDPTLA  LN KLKRK
Subjt:  SFVSCQQVELERS-PNEPRRHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKR-TTTNDKKVDCSSSTPLEVDTDDNDPTLASFLN-KLKRK

Query:  KH
        KH
Subjt:  KH

XP_023515736.1 uncharacterized protein LOC111779809 isoform X2 [Cucurbita pepo subsp. pepo]0.0e+0081.74Show/hide
Query:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSSVAGDANPSSDSDDVDDFELLRIIQNRFSCVADEQPLSTLPPMSLD
        MS R+H D GD ELP S+   EDD+VD+DME LRRA +  GVN EDYINPRLS   AGDAN  SDSDDVDD ELLR IQNRFS  ADEQPLS LPP++ D
Subjt:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSSVAGDANPSSDSDDVDDFELLRIIQNRFSCVADEQPLSTLPPMSLD

Query:  EEEDEFEMLRSIQRRFAAYESNTLSNKPNQSCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FEMLRSIQRRFAAYES+ LSNKP+QSCD  G LKMD ++T VE  T+S+R SM+AFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESNTLSNKPNQSCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQ
        NKKLRKRFK+LK FQGSC+R+TTCAL+QM+DPRVQLISA KPQAKDSSKKDKRLS+M YGPAENSHVACYR A  KF  VDRK+WS  ERENLGKGIRQQ
Subjt:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQ

Query:  FQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTNEDKNLLFTIQQK
        FQEMVLQISVDQIS +QGFSA+SDDLDNILASIK LDITPEKIREFLPKVNWDKLASMYL+GRSGAECEARWLNFEDPLINR+PWTT+EDKNLLFTIQQK
Subjt:  FQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTNEDKNLLFTIQQK

Query:  GLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLK
        GLNNWIE+AVS GTNRTPFQCLSRYQRSLNASILK EWTKDEDDKLR+AVA FG GDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRG+FTPDEDSRLK
Subjt:  GLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLK

Query:  IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQK
        IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDN+C RRWKKLFPN+VPLLQEARKIQK
Subjt:  IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQK

Query:  AALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSSNEKSATGDALKKRKSNCQRNQADETAQAGIANNTSSVPEEVKSLKP
         ALISNFVDRESERPALGPTDFRP PN+ LLCN+ DP  APKRNV+TR+MPVS NEKSA GDA K+RKSN QRN+ADETAQ    NNTSSVP EVKS KP
Subjt:  AALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSSNEKSATGDALKKRKSNCQRNQADETAQAGIANNTSSVPEEVKSLKP

Query:  QRKQNRHGAYTAESIGVPELCPNSEWCAKQSVDTQSLGVQLNSKE-SEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFSEPEAIQNSTGSSGVSVLS
        QRK+ RHGAYT    G P++  NSE CA+Q+ DT+S+ VQLN KE +E  NSDC ETVDENGMEVFENK AE  SE  VCFSE E  QNSTGSSGVSVLS
Subjt:  QRKQNRHGAYTAESIGVPELCPNSEWCAKQSVDTQSLGVQLNSKE-SEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFSEPEAIQNSTGSSGVSVLS

Query:  EMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDNEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDH
        EMTNDMDEYNPS LPDTTLLAS T DD+ E K  +VAD+DLDD NSFSLP S LELRT D+EGVDSYS+DEFTDKSHGVCK P  RRKKNSK S+KSQD 
Subjt:  EMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDNEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDH

Query:  SFVSCQQVELERS-PNEPRRHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKR-TTTNDKKVDCSSSTPLEVDTDDNDPTLASFLN-KLKRK
        S VSCQQ ELE S  NE  R NQSKKRKH+ TNT+    +EAVEEVDDCTL GFLQKRLKR TTT+DKKVD SSSTP EVD DDNDPTLA  LN KLKRK
Subjt:  SFVSCQQVELERS-PNEPRRHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKR-TTTNDKKVDCSSSTPLEVDTDDNDPTLASFLN-KLKRK

Query:  KH
        KH
Subjt:  KH

XP_038905712.1 uncharacterized protein LOC120091681 isoform X1 [Benincasa hispida]0.0e+0085.46Show/hide
Query:  VRCAHSCSSMSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSSVAGDANPSSDSDDVDDFELLRIIQNRFSCVADEQPLSTL
        VR AH  ++MS  NH DEGDVELP +KEDDVVDEDME L+RAY+ VGVNPEDYINPRLSS   GDAN   DSDD DDFELLR IQNRFS V DEQPLSTL
Subjt:  VRCAHSCSSMSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSSVAGDANPSSDSDDVDDFELLRIIQNRFSCVADEQPLSTL

Query:  PPMSLDEEEDEFEMLRSIQRRFAAYESNTLSNKPNQSCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHL
        PP+SLDEEEDEFEMLRSIQRRFAAYES+ LSNKPN+S D+VGSLKMD  +TA ESQT+SKRPSM+AFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHL
Subjt:  PPMSLDEEEDEFEMLRSIQRRFAAYESNTLSNKPNQSCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHL

Query:  EARIEENKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLG
        EARIEENKKLRKR KILKDFQGSCKR+TTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLS M+YGPAENSHVACYRMAL KFPRVDRKKWS+VERENLG
Subjt:  EARIEENKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLG

Query:  KGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTNEDKNLL
        KGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYL GRSGAECEARWLNFEDPLINRDPWTT+EDKNLL
Subjt:  KGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTNEDKNLL

Query:  FTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPD
        FTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLR+AVA FG+ DWQAVASTLEGR G QCSNRWKKSLDPARTKRGHFTPD
Subjt:  FTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPD

Query:  EDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQE
        ED+RLKIAVLL GPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQE
Subjt:  EDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQE

Query:  ARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSSNEKSATGDALKKRKSNCQRNQADETAQAGIANNTSSVPEE
        ARKIQKAALISNFVDRESERPALGP DFRPR NTD+LC++ DP+PAPKRN KTRKMPVS NEKSATGDA +KRKSN QRNQAD TA+ GIANNTSSVPEE
Subjt:  ARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSSNEKSATGDALKKRKSNCQRNQADETAQAGIANNTSSVPEE

Query:  VKSLKPQRKQNRHGAYTAESIGVPELCPNSEWCAKQSVDTQSLGVQLNSKESEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFSEPEAIQNSTGSSG
        V+SLKP RK+NRH A T +  GV EL  N +WCAKQ+++T+S+GVQL+SKE E TNSD  ETVD NG+EVFENK+A+KLSERDV FSEPE  QNSTGSSG
Subjt:  VKSLKPQRKQNRHGAYTAESIGVPELCPNSEWCAKQSVDTQSLGVQLNSKESEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFSEPEAIQNSTGSSG

Query:  VSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDNEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSS
        VSVLSEMTNDMDEYNPSILPDTTLLASTTVDD++ELK KS ADRDLDD NSFSLP S LELRTID EGVDSYS+D+ TDKSH VCK P  RRKKNSKTS 
Subjt:  VSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDNEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSS

Query:  KSQDHSFVSCQQVELER-SPNEPRRHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKRTTTNDKKVDCSSSTPLEVDTDDNDPTLASFLNKL
        K+ ++SF+SCQQVE ER   NEPR  NQSKKRKH+STNT+    LEAVEEVD+CTLVGFLQKRL       KKVDCSS TPLEVD DDND  +ASFLNKL
Subjt:  KSQDHSFVSCQQVELER-SPNEPRRHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKRTTTNDKKVDCSSSTPLEVDTDDNDPTLASFLNKL

Query:  KRKKHQPSSGG
        KRKKHQP S G
Subjt:  KRKKHQPSSGG

XP_038905717.1 uncharacterized protein LOC120091681 isoform X2 [Benincasa hispida]0.0e+0085.83Show/hide
Query:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSSVAGDANPSSDSDDVDDFELLRIIQNRFSCVADEQPLSTLPPMSLDEEE
        MS  NH DEGDVELP +KEDDVVDEDME L+RAY+ VGVNPEDYINPRLSS   GDAN   DSDD DDFELLR IQNRFS V DEQPLSTLPP+SLDEEE
Subjt:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSSVAGDANPSSDSDDVDDFELLRIIQNRFSCVADEQPLSTLPPMSLDEEE

Query:  DEFEMLRSIQRRFAAYESNTLSNKPNQSCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKK
        DEFEMLRSIQRRFAAYES+ LSNKPN+S D+VGSLKMD  +TA ESQT+SKRPSM+AFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEENKK
Subjt:  DEFEMLRSIQRRFAAYESNTLSNKPNQSCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKK

Query:  LRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQFQE
        LRKR KILKDFQGSCKR+TTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLS M+YGPAENSHVACYRMAL KFPRVDRKKWS+VERENLGKGIRQQFQE
Subjt:  LRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQFQE

Query:  MVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTNEDKNLLFTIQQKGLN
        MVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYL GRSGAECEARWLNFEDPLINRDPWTT+EDKNLLFTIQQKGLN
Subjt:  MVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTNEDKNLLFTIQQKGLN

Query:  NWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAV
        NWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLR+AVA FG+ DWQAVASTLEGR G QCSNRWKKSLDPARTKRGHFTPDED+RLKIAV
Subjt:  NWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAV

Query:  LLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAAL
        LL GPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAAL
Subjt:  LLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAAL

Query:  ISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSSNEKSATGDALKKRKSNCQRNQADETAQAGIANNTSSVPEEVKSLKPQRK
        ISNFVDRESERPALGP DFRPR NTD+LC++ DP+PAPKRN KTRKMPVS NEKSATGDA +KRKSN QRNQAD TA+ GIANNTSSVPEEV+SLKP RK
Subjt:  ISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSSNEKSATGDALKKRKSNCQRNQADETAQAGIANNTSSVPEEVKSLKPQRK

Query:  QNRHGAYTAESIGVPELCPNSEWCAKQSVDTQSLGVQLNSKESEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFSEPEAIQNSTGSSGVSVLSEMTN
        +NRH A T +  GV EL  N +WCAKQ+++T+S+GVQL+SKE E TNSD  ETVD NG+EVFENK+A+KLSERDV FSEPE  QNSTGSSGVSVLSEMTN
Subjt:  QNRHGAYTAESIGVPELCPNSEWCAKQSVDTQSLGVQLNSKESEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFSEPEAIQNSTGSSGVSVLSEMTN

Query:  DMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDNEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDHSFVS
        DMDEYNPSILPDTTLLASTTVDD++ELK KS ADRDLDD NSFSLP S LELRTID EGVDSYS+D+ TDKSH VCK P  RRKKNSKTS K+ ++SF+S
Subjt:  DMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDNEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDHSFVS

Query:  CQQVELER-SPNEPRRHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKRTTTNDKKVDCSSSTPLEVDTDDNDPTLASFLNKLKRKKHQPSS
        CQQVE ER   NEPR  NQSKKRKH+STNT+    LEAVEEVD+CTLVGFLQKRL       KKVDCSS TPLEVD DDND  +ASFLNKLKRKKHQP S
Subjt:  CQQVELER-SPNEPRRHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKRTTTNDKKVDCSSSTPLEVDTDDNDPTLASFLNKLKRKKHQPSS

Query:  GG
         G
Subjt:  GG

TrEMBL top hitse value%identityAlignment
A0A0A0L2R2 Uncharacterized protein0.0e+0082.9Show/hide
Query:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSSVAGDANPSSDSDDVDDFELLRIIQNRFSCVADEQPLSTLPPMSLDEEE
        MS RNH DE DVE P  KED VVDEDME L+RAY+  GVNPEDYINPRLSS  AGDA+P SDSDDVDDFELLR IQNRFS +ADEQP ST  P+S DEEE
Subjt:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSSVAGDANPSSDSDDVDDFELLRIIQNRFSCVADEQPLSTLPPMSLDEEE

Query:  DEFEMLRSIQRRFAAYESNTLSNKPNQSCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKK
        DEFEMLRSIQRRFAAYES+TLSNKPNQS D+VGSLK+D DD AVESQT+SKRPSMLAFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEENKK
Subjt:  DEFEMLRSIQRRFAAYESNTLSNKPNQSCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKK

Query:  LRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQFQE
        LRKR KILKDFQGSCKRRT+CALSQMIDPRVQLISAAKPQAKDSSKKDKRLS M+YGP ENSHVACYRM L KFP VDRKKWS+VERENLGKGIRQQFQE
Subjt:  LRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQFQE

Query:  MVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTNEDKNLLFTIQQKGLN
        MVLQISVDQISG QG S DSDDLDNILASIKDLDI P+KIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTT+EDK+LLFTIQQKGLN
Subjt:  MVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTNEDKNLLFTIQQKGLN

Query:  NWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAV
        NWIE+AVS GTNRTPFQCLSRYQRSLNASILKREWTK+EDD+LR+AVATFG+ DWQAVASTLEGR G QCSNRWKKSLDPART++G+FTPDED RLKIAV
Subjt:  NWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAV

Query:  LLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAAL
        LLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDN+CRRRWKKLFP+EVPLLQEARKIQKAAL
Subjt:  LLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAAL

Query:  ISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSSNEKSATGDALKKRKSNCQRNQADETAQAGIANNTSSVPEEVKSLKPQRK
        ISNFVDRE+ERPALGP DFRPRPNTD LCN+  P PAPKRNVKTRKMPVS NEKSATGDA KKRKSN QR Q D TAQ GIA NTS VPEEV+S KPQRK
Subjt:  ISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSSNEKSATGDALKKRKSNCQRNQADETAQAGIANNTSSVPEEVKSLKPQRK

Query:  QNRHGAYTAESIGVPELCPNSEWCAKQSVDTQSLGVQLNSKESEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFSEPEAIQNSTGSSGVSVLSEMTN
        +NR GAYTA+ IGVPEL  +SEWCAKQ++DT+SLG+QLNSKESE +NS+C ETVDEN MEV ENKVAEKL+E + CFSEPE  QNSTGSSGVSVLSEMTN
Subjt:  QNRHGAYTAESIGVPELCPNSEWCAKQSVDTQSLGVQLNSKESEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFSEPEAIQNSTGSSGVSVLSEMTN

Query:  DMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDNEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDHSFVS
        D+ +YNPSIL DTTL ASTTVDD++ELK KS ADRDLDD NSFSL  S LELRT+D+EGVDSYS+DE+T KS+GVC P   RRKKNSKTS+ S D+  + 
Subjt:  DMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDNEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDHSFVS

Query:  CQQVELER-SPNEPRRHNQSKKRKHNSTNTNL----EAVEEVDDCTLVGFLQKRLKRTT-TNDKKVDCSSSTPLEVDTDDNDPTLASFLNKLKRKKHQPS
         QQ+  E     +P  HNQSKKRKH++T  +     EAVEEVDDCTLVGFLQKRLKRT  T+++ VDCSS+ PL+VD DDN+PT+ASFLNKLKRKKHQ  
Subjt:  CQQVELER-SPNEPRRHNQSKKRKHNSTNTNL----EAVEEVDDCTLVGFLQKRLKRTT-TNDKKVDCSSSTPLEVDTDDNDPTLASFLNKLKRKKHQPS

Query:  SGGELN
        SG ELN
Subjt:  SGGELN

A0A6J1E2J4 uncharacterized protein LOC111430000 isoform X20.0e+0081.64Show/hide
Query:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSSVAGDANPSSDSDDVDDFELLRIIQNRFSCVADEQPLSTLPPMSLD
        MS R+H D GD ELP S+   EDD+VD+DME LRRA +  GVN ED INPRLS   AGDAN  SDSDDVDD ELLR IQNRFS  ADEQPLS LPP++ D
Subjt:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSSVAGDANPSSDSDDVDDFELLRIIQNRFSCVADEQPLSTLPPMSLD

Query:  EEEDEFEMLRSIQRRFAAYESNTLSNKPNQSCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FE LRSIQRRFAAYES+ LSNKP+QSCD  G LKMD D+T V   T+S+R SM+AFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESNTLSNKPNQSCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQ
        NKKLRKRFK+LK FQGSC+R+TTCAL+QM+DPRVQLISA KPQAKDSSKKDKRLSAM YGPAENSHVACYR AL KF  VDRK+WS  ERENLGKGIRQQ
Subjt:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQ

Query:  FQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTNEDKNLLFTIQQK
        FQEMVLQISVDQIS +QGFSA+SDDLDNILASIK LDITPEKIREFLPKVNWDKLA MYLQGRSGAECEARWLNFEDPLINR+ WTT+EDKNLLFTIQQK
Subjt:  FQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTNEDKNLLFTIQQK

Query:  GLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLK
        GLNNWIE+AVS GTNRTPFQCLSRYQRSLNASILK EWTKDEDDKLR+AVA FG GDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRG+FTPDEDSRLK
Subjt:  GLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLK

Query:  IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQK
        IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDN+C RRWKKLFPN+VPLLQEARKIQK
Subjt:  IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQK

Query:  AALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSSNEKSATGDALKKRKSNCQRNQADETAQAGIANNTSSVPEEVKSLKP
         ALISNFVDRESERPALGPTDFRP PN+ LLCN+ DP  APKRNV+ R+MPVS NEKSA GDA KK KSN QRNQADETAQ   ANNTSSVP EVKS KP
Subjt:  AALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSSNEKSATGDALKKRKSNCQRNQADETAQAGIANNTSSVPEEVKSLKP

Query:  QRKQNRHGAYTAESIGVPELCPNSEWCAKQSVDTQSLGVQLNSKE-SEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFSEPEAIQNSTGSSGVSVLS
        QRK+ RHGAYT    G P++  NSE CA+Q+ DT+SL VQLN KE +E  NSDC ETVDENGMEVFENK AE  SE  VCFSE E  QNSTGSSGVSVLS
Subjt:  QRKQNRHGAYTAESIGVPELCPNSEWCAKQSVDTQSLGVQLNSKE-SEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFSEPEAIQNSTGSSGVSVLS

Query:  EMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDNEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDH
        EMTNDMDEYNPS  PDTTLLAS T DD+ E K  +VAD+DLDD NSFSLP S LELRT D+EGVDSYS+DEFTDKSHGVCK P  RRKKNSK S+KSQD 
Subjt:  EMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDNEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDH

Query:  SFVSCQQVELERS-PNEPRRHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKR-TTTNDKKVDCSSSTPLEVDTDDNDPTLASFL-NKLKRK
        S VSCQQ ELE S  NE  R NQSKKRKH+ TNT+    +EAVEEVDDCTL GFLQKRLKR TTT+DKKVD SSSTP EVD DDNDPTLA  L +KLKRK
Subjt:  SFVSCQQVELERS-PNEPRRHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKR-TTTNDKKVDCSSSTPLEVDTDDNDPTLASFL-NKLKRK

Query:  KH
        KH
Subjt:  KH

A0A6J1E6Z7 uncharacterized protein LOC111430000 isoform X10.0e+0081.74Show/hide
Query:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSSVAGDANPSSDSDDVDDFELLRIIQNRFSCVADEQPLSTLPPMSLD
        MS R+H D GD ELP S+   EDD+VD+DME LRRA +  GVN ED INPRLS   AGDAN  SDSDDVDD ELLR IQNRFS  ADEQPLS LPP++ D
Subjt:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSSVAGDANPSSDSDDVDDFELLRIIQNRFSCVADEQPLSTLPPMSLD

Query:  EEEDEFEMLRSIQRRFAAYESNTLSNKPNQSCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FE LRSIQRRFAAYES+ LSNKP+QSCD  G LKMD D+T V   T+S+R SM+AFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESNTLSNKPNQSCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQ
        NKKLRKRFK+LK FQGSC+R+TTCAL+QM+DPRVQLISA KPQAKDSSKKDKRLSAM YGPAENSHVACYR AL KF  VDRK+WS  ERENLGKGIRQQ
Subjt:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQ

Query:  FQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTNEDKNLLFTIQQK
        FQEMVLQISVDQIS +QGFSA+SDDLDNILASIK LDITPEKIREFLPKVNWDKLA MYLQGRSGAECEARWLNFEDPLINR+ WTT+EDKNLLFTIQQK
Subjt:  FQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTNEDKNLLFTIQQK

Query:  GLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLK
        GLNNWIE+AVS GTNRTPFQCLSRYQRSLNASILK EWTKDEDDKLR+AVA FG GDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRG+FTPDEDSRLK
Subjt:  GLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLK

Query:  IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQK
        IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDN+CRRRWKKLFPN+VPLLQEARKIQK
Subjt:  IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQK

Query:  AALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSSNEKSATGDALKKRKSNCQRNQADETAQAGIANNTSSVPEEVKSLKP
         ALISNFVDRESERPALGPTDFRP PN+ LLCN+ DP  APKRNV+ R+MPVS NEKSA GDA KK KSN QRNQADETAQ   ANNTSSVP EVKS KP
Subjt:  AALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSSNEKSATGDALKKRKSNCQRNQADETAQAGIANNTSSVPEEVKSLKP

Query:  QRKQNRHGAYTAESIGVPELCPNSEWCAKQSVDTQSLGVQLNSKE-SEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFSEPEAIQNSTGSSGVSVLS
        QRK+ RHGAYT    G P++  NSE CA+Q+ DT+SL VQLN KE +E  NSDC ETVDENGMEVFENK AE  SE  VCFSE E  QNSTGSSGVSVLS
Subjt:  QRKQNRHGAYTAESIGVPELCPNSEWCAKQSVDTQSLGVQLNSKE-SEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFSEPEAIQNSTGSSGVSVLS

Query:  EMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDNEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDH
        EMTNDMDEYNPS  PDTTLLAS T DD+ E K  +VAD+DLDD NSFSLP S LELRT D+EGVDSYS+DEFTDKSHGVCK P  RRKKNSK S+KSQD 
Subjt:  EMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDNEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDH

Query:  SFVSCQQVELERS-PNEPRRHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKR-TTTNDKKVDCSSSTPLEVDTDDNDPTLASFL-NKLKRK
        S VSCQQ ELE S  NE  R NQSKKRKH+ TNT+    +EAVEEVDDCTL GFLQKRLKR TTT+DKKVD SSSTP EVD DDNDPTLA  L +KLKRK
Subjt:  SFVSCQQVELERS-PNEPRRHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKR-TTTNDKKVDCSSSTPLEVDTDDNDPTLASFL-NKLKRK

Query:  KH
        KH
Subjt:  KH

A0A6J1JK98 uncharacterized protein LOC111485355 isoform X20.0e+0081.06Show/hide
Query:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSSVAGDANPSSDSDDVDDFELLRIIQNRFSCVADEQPLSTLPPMSLD
        MS R+H D GD ELP S+   EDD+VD+DME LRRA +  GVN EDY+NP+LS   AGDAN  SDSDDVDD ELLR IQNRFS  ADEQPLS LPP++ D
Subjt:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSSVAGDANPSSDSDDVDDFELLRIIQNRFSCVADEQPLSTLPPMSLD

Query:  EEEDEFEMLRSIQRRFAAYESNTLSNKPNQSCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FE LRSIQRRFAAYES+ LSNKP+QSCD  G LKMD D+T VE  T+S+R SM+AFEKGSLPKAALAFIDAIKKNRSQQKF+RSKMIHLEARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESNTLSNKPNQSCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKP-QAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQ
        NKKLRKRFK+LK FQGSC+R+TTCALSQM+DPRVQLISA KP QAKDSSKKDKRLSAM YGPAENSHVACYR+AL KF  VDRK+WS  ERENLGKGIRQ
Subjt:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKP-QAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQ

Query:  QFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTNEDKNLLFTIQQ
        QFQEMVLQISVDQIS +QGFSA+SDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYL+GRSGAECEARWLNFEDPLINR+PWTT+EDKNLLFTIQQ
Subjt:  QFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTNEDKNLLFTIQQ

Query:  KGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRL
        KGLNNWI++AVS GTNRTPFQ LSRYQRSLNASILK EWTKDEDDKLR+AVA FG GDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRG+FTPDEDSRL
Subjt:  KGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRL

Query:  KIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQ
        KIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDN+C RRWKKLFPN+VPLLQEARKIQ
Subjt:  KIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQ

Query:  KAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSSNEKSATGDALKKRKSNCQRNQADETAQAGIANNTSSVPEEVKSLK
        K ALISNFVDRESERPALGPTDFRP PN+ LLCN+ DP  APKRNV+TR+MPVS NEKSA GDA KKRKSN QRN+ DETAQ   A+NTSSVP EVKS K
Subjt:  KAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSSNEKSATGDALKKRKSNCQRNQADETAQAGIANNTSSVPEEVKSLK

Query:  PQRKQNRHGAYTAESIGVPELCPNSEWCAKQSVDTQSLGVQLNSKE-SEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFSEPEAIQNSTGSSGVSVL
        PQRK+ RHGAYT    G P++  NSE CA+Q+ DT++L VQLN KE +E  NSDC ETVDENGMEVFENK AE  SE  VCFSE E  QNSTGSSGVSVL
Subjt:  PQRKQNRHGAYTAESIGVPELCPNSEWCAKQSVDTQSLGVQLNSKE-SEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFSEPEAIQNSTGSSGVSVL

Query:  SEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDNEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQD
        SEMTNDMDEYNPS LPDTTLLAS T DD+ E K  +VAD+DLD  NSFSLP S LELRT D+EGVDSYS+DEFTDKSH VCK P  RRKKNSK S+KSQD
Subjt:  SEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDNEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQD

Query:  HSFVSCQQVELERS-PNEPRRHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKR-TTTNDKKVDCSSSTPLEVDTDDNDPTLASFL-NKLKR
         S VSCQQ ELE S  NE  R NQ KKRKH+STNT+    +EAVEEVDDCTL+GFLQKRLKR TTT+ KKVD SSST  EVD DDNDPTLA  L  KLKR
Subjt:  HSFVSCQQVELERS-PNEPRRHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKR-TTTNDKKVDCSSSTPLEVDTDDNDPTLASFL-NKLKR

Query:  KKH
        KKH
Subjt:  KKH

A0A6J1JKV7 uncharacterized protein LOC111485355 isoform X10.0e+0081.16Show/hide
Query:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSSVAGDANPSSDSDDVDDFELLRIIQNRFSCVADEQPLSTLPPMSLD
        MS R+H D GD ELP S+   EDD+VD+DME LRRA +  GVN EDY+NP+LS   AGDAN  SDSDDVDD ELLR IQNRFS  ADEQPLS LPP++ D
Subjt:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSSVAGDANPSSDSDDVDDFELLRIIQNRFSCVADEQPLSTLPPMSLD

Query:  EEEDEFEMLRSIQRRFAAYESNTLSNKPNQSCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FE LRSIQRRFAAYES+ LSNKP+QSCD  G LKMD D+T VE  T+S+R SM+AFEKGSLPKAALAFIDAIKKNRSQQKF+RSKMIHLEARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESNTLSNKPNQSCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKP-QAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQ
        NKKLRKRFK+LK FQGSC+R+TTCALSQM+DPRVQLISA KP QAKDSSKKDKRLSAM YGPAENSHVACYR+AL KF  VDRK+WS  ERENLGKGIRQ
Subjt:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKP-QAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQ

Query:  QFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTNEDKNLLFTIQQ
        QFQEMVLQISVDQIS +QGFSA+SDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYL+GRSGAECEARWLNFEDPLINR+PWTT+EDKNLLFTIQQ
Subjt:  QFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTNEDKNLLFTIQQ

Query:  KGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRL
        KGLNNWI++AVS GTNRTPFQ LSRYQRSLNASILK EWTKDEDDKLR+AVA FG GDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRG+FTPDEDSRL
Subjt:  KGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRL

Query:  KIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQ
        KIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDN+CRRRWKKLFPN+VPLLQEARKIQ
Subjt:  KIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQ

Query:  KAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSSNEKSATGDALKKRKSNCQRNQADETAQAGIANNTSSVPEEVKSLK
        K ALISNFVDRESERPALGPTDFRP PN+ LLCN+ DP  APKRNV+TR+MPVS NEKSA GDA KKRKSN QRN+ DETAQ   A+NTSSVP EVKS K
Subjt:  KAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSSNEKSATGDALKKRKSNCQRNQADETAQAGIANNTSSVPEEVKSLK

Query:  PQRKQNRHGAYTAESIGVPELCPNSEWCAKQSVDTQSLGVQLNSKE-SEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFSEPEAIQNSTGSSGVSVL
        PQRK+ RHGAYT    G P++  NSE CA+Q+ DT++L VQLN KE +E  NSDC ETVDENGMEVFENK AE  SE  VCFSE E  QNSTGSSGVSVL
Subjt:  PQRKQNRHGAYTAESIGVPELCPNSEWCAKQSVDTQSLGVQLNSKE-SEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFSEPEAIQNSTGSSGVSVL

Query:  SEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDNEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQD
        SEMTNDMDEYNPS LPDTTLLAS T DD+ E K  +VAD+DLD  NSFSLP S LELRT D+EGVDSYS+DEFTDKSH VCK P  RRKKNSK S+KSQD
Subjt:  SEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDNEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQD

Query:  HSFVSCQQVELERS-PNEPRRHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKR-TTTNDKKVDCSSSTPLEVDTDDNDPTLASFL-NKLKR
         S VSCQQ ELE S  NE  R NQ KKRKH+STNT+    +EAVEEVDDCTL+GFLQKRLKR TTT+ KKVD SSST  EVD DDNDPTLA  L  KLKR
Subjt:  HSFVSCQQVELERS-PNEPRRHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKR-TTTNDKKVDCSSSTPLEVDTDDNDPTLASFL-NKLKR

Query:  KKH
        KKH
Subjt:  KKH

SwissProt top hitse value%identityAlignment
P10242 Transcriptional activator Myb1.0e-3139.77Show/hide
Query:  KREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL
        K  WT++ED+KL+  V   G  DW+ +A+ L  RT  QC +RW+K L+P   K G +T +ED R+   V  +GPK W+  A+ L GR   QCRERW N L
Subjt:  KREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL

Query:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVP---LLQEARKIQKAALISNF
        +P +++  WTEEED  +  A +  G  WA++A  +P RTDN  +  W      +V     LQE+ K  + A+ ++F
Subjt:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVP---LLQEARKIQKAALISNF

P46200 Transcriptional activator Myb1.0e-3139.77Show/hide
Query:  KREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL
        K  WT++ED+KL+  V   G  DW+ +A+ L  RT  QC +RW+K L+P   K G +T +ED R+   V  +GPK W+  A+ L GR   QCRERW N L
Subjt:  KREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL

Query:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVP---LLQEARKIQKAALISNF
        +P +++  WTEEED  +  A +  G  WA++A  +P RTDN  +  W      +V     LQE+ K  + A+ ++F
Subjt:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVP---LLQEARKIQKAALISNF

Q54NA6 Myb-like protein L6.0e-5633.79Show/hide
Query:  HYGPAENSHVACYRMALMKFPRVDR-KKWSVVERENLGKGIRQQ-FQEMVLQISVDQIS---------GLQGFSADSDDLDNILASIKDLDITPEKIREF
        HY PA+N      R+     P   + ++W+  E E L KGI+++  Q+ + ++S D++S          +Q  S ++++ +N + +  + +   +K    
Subjt:  HYGPAENSHVACYRMALMKFPRVDR-KKWSVVERENLGKGIRQQ-FQEMVLQISVDQIS---------GLQGFSADSDDLDNILASIKDLDITPEKIREF

Query:  LPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTNEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL
         P +       +    RS  E   RW N +DP IN+ P+T  EDK LL   ++   + W +I++  GTNRTP  C+ RYQRSLN+ ++KREWTK+ED+ L
Subjt:  LPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTNEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL

Query:  RAAVATFGLG---DWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEW
           +     G   DWQ +   + GRTG QC +RW K+LDP+  K+G ++P+ED  L  AV  +G  NW      + GR  VQCRER+ N LDP L +  W
Subjt:  RAAVATFGLG---DWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEW

Query:  TEEEDLRLEIAIQEHGY-SWAKVAACVPSRTDNDCRRRWKKL--FPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPA
        T +ED RL     + G   W+ VA  + +RTDN C RRWK+L    N +   QE    +K   +SNF  R+ ER  L   D             ++ +  
Subjt:  TEEEDLRLEIAIQEHGY-SWAKVAACVPSRTDNDCRRRWKKL--FPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPA

Query:  PKRNVKTRKMPVSSNEKSATGDALKKRKSNCQRNQ
        PK N KT+ +  +S   ++T       K++   NQ
Subjt:  PKRNVKTRKMPVSSNEKSATGDALKKRKSNCQRNQ

Q5SXM2 snRNA-activating protein complex subunit 45.1e-3927.33Show/hide
Query:  IKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHV-ACYRMALM
        ++ N   Q+ I+ K+      + +N++ ++  ++++D  GS  + T     + + P   +    KP  KD             GP  N          + 
Subjt:  IKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHV-ACYRMALM

Query:  KFPRVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQISGL-QGFSADSDDLD---------NILASIKDLDITPEK--IREFLPKVNWDKLASMYLQG-
         F  +   KW   E+  L K +     + +LQ  + ++  L Q  S  S +L+              I+D++  PE+  +   L   +W+K++++  +G 
Subjt:  KFPRVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQISGL-QGFSADSDDLD---------NILASIKDLDITPEK--IREFLPKVNWDKLASMYLQG-

Query:  RSGAECEARWLNFEDPLINRDPWTTNEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGD---WQ
        RS  E    W N E P IN+  W+  E++ L       G   W +IA   GT+R+ FQCL ++Q+  N ++ ++EWT++ED  L   V    +G    ++
Subjt:  RSGAECEARWLNFEDPLINRDPWTTNEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGD---WQ

Query:  AVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHG
         +   +EGR   Q   RW KSLDP   K+G++ P+ED++L  AV  +G ++W K  E +PGR+  QCR+R+   L  SL++  W  +E+ +L   I+++G
Subjt:  AVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHG

Query:  YS-WAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARK
           WAK+A+ +P R+ + C  +WK +   +  L +  R+
Subjt:  YS-WAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARK

Q8BP86 snRNA-activating protein complex subunit 45.7e-3833.59Show/hide
Query:  IKDLDITPEK--IREFLPKVNWDKLASMYLQG-RSGAECEARWLNFEDPLINRDPWTTNEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSL
        I+D++  PE+  +   L   +W+K++++  +G RS  E    W + E P I++  W+T E + L       G   W  +A   GT+R+ FQCL ++Q+  
Subjt:  IKDLDITPEK--IREFLPKVNWDKLASMYLQG-RSGAECEARWLNFEDPLINRDPWTTNEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSL

Query:  NASILKREWTKDEDDKLRAAVATFGLGD---WQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQC
        N ++ ++EWT++ED  L   V    +G+   ++ +   +EGR   Q   RW KSLDP+  KRG + P+ED++L  AV  +G ++W K  E +PGR+  QC
Subjt:  NASILKREWTKDEDDKLRAAVATFGLGD---WQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQC

Query:  RERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYS-WAKVAACVPSRTDNDCRRRWKKL
        R+R+   L  SL++  W  +E+ +L   I+++G   WA++A+ +P R+ + C  +WK L
Subjt:  RERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYS-WAKVAACVPSRTDNDCRRRWKKL

Arabidopsis top hitse value%identityAlignment
AT3G09370.1 myb domain protein 3r-32.9e-2942.18Show/hide
Query:  KREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL
        K  WT +ED+ LR AV TF    W+ +A +   RT  QC +RW+K L+P   K G +T +ED ++   V  +GP  W+  A+ LPGR   QCRERW N L
Subjt:  KREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL

Query:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRW
        +P + +  WT EE++ L  A + HG  WA++A  +P RTDN  +  W
Subjt:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRW

AT3G09370.2 myb domain protein 3r-32.9e-2942.18Show/hide
Query:  KREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL
        K  WT +ED+ LR AV TF    W+ +A +   RT  QC +RW+K L+P   K G +T +ED ++   V  +GP  W+  A+ LPGR   QCRERW N L
Subjt:  KREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL

Query:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRW
        +P + +  WT EE++ L  A + HG  WA++A  +P RTDN  +  W
Subjt:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRW

AT3G18100.1 myb domain protein 4r13.2e-16144.28Show/hide
Query:  SSMSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINP--RLSSSVAGDANPSSDSDDVDDFELLRIIQNRFSCVAD------------
        +S+   + DD+ D       E+D + ED+E LRRA     VN + + +    +     G     SDS++ DDFE+LR I+++ +   D            
Subjt:  SSMSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINP--RLSSSVAGDANPSSDSDDVDDFELLRIIQNRFSCVAD------------

Query:  -----------------EQPLS-----TLPPMSL--DEEEDEFEMLRSIQRRFAAYESNTLSNKPNQSCDHVGSLKMDFDDTAVESQTASKRPSMLAF--
                         +  LS     +LPP+ L  DEE+D FE LR+I+RRF+AY+ N  S     +  H    ++   D    S+  S+  +  +F  
Subjt:  -----------------EQPLS-----TLPPMSL--DEEEDEFEMLRSIQRRFAAYESNTLSNKPNQSCDHVGSLKMDFDDTAVESQTASKRPSMLAF--

Query:  ---------------------EKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAA
                                S P+AA AF+DAI++NR+ QKF+R K+  +EA IE+N+K +K  +I+KDFQ SCKR T  AL Q  DPRV+LIS  
Subjt:  ---------------------EKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAA

Query:  KPQAKDSSK----------KDKRLSAMHYGPAENSHVACYRMALMKFP-RVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNI
        K    DSS+           DK++S +  GPAEN  V  YRMAL K+P  V R+KWS  E +NL KG++Q+ Q+++L  ++++ S L+G +    D+D I
Subjt:  KPQAKDSSK----------KDKRLSAMHYGPAENSHVACYRMALMKFP-RVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNI

Query:  LASIKDLDITPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTNEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSL
          SI +L+ITPE IR+FLPK+NWD   S+ ++ RS AECEARW++ EDPLIN  PWT  EDKNLL TI+Q  L +W++IAVS GTNRTPFQCL+RYQRSL
Subjt:  LASIKDLDITPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRDPWTTNEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSL

Query:  NASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRER
        N SILK+EWT +EDD+LR AV  FG  DWQ+VA+ L+GRTG QCSNRWKKSL P  T++G ++ +ED R+K+AV LFG +NW+K ++F+PGR Q QCRER
Subjt:  NASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRER

Query:  WFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTD
        W NCLDP + R +WTEEED +L  AI EHGYSW+KVA  +  RTDN C RRWK+L+P++V LLQEAR++QK A + NFVDRESERPAL  +     P+  
Subjt:  WFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTD

Query:  LLCNSVDPRPAPKRNVKTRKMPVSSNEKSATGDALKKRKSNCQRN---QADETAQAGIANN
            S++P P      K RK     ++        +K   NC  +   Q +ET      NN
Subjt:  LLCNSVDPRPAPKRNVKTRKMPVSSNEKSATGDALKKRKSNCQRN---QADETAQAGIANN

AT3G18100.2 myb domain protein 4r11.0e-15152.7Show/hide
Query:  SLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSK----------KDKRL
        S P+AA AF+DAI++NR+ QKF+R K+  +EA IE+N+K +K  +I+KDFQ SCKR T  AL Q  DPRV+LIS  K    DSS+           DK++
Subjt:  SLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSK----------KDKRL

Query:  SAMHYGPAENSHVACYRMALMKFP-RVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWD
        S +  GPAEN  V  YRMAL K+P  V R+KWS  E +NL KG++Q+ Q+++L  ++++ S L+G +    D+D I  SI +L+ITPE IR+FLPK+NWD
Subjt:  SAMHYGPAENSHVACYRMALMKFP-RVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWD

Query:  KLASMYLQGRSGAECEARWLNFEDPLINRDPWTTNEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATF
           S+ ++ RS AECEARW++ EDPLIN  PWT  EDKNLL TI+Q  L +W++IAVS GTNRTPFQCL+RYQRSLN SILK+EWT +EDD+LR AV  F
Subjt:  KLASMYLQGRSGAECEARWLNFEDPLINRDPWTTNEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATF

Query:  GLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEI
        G  DWQ+VA+ L+GRTG QCSNRWKKSL P  T++G ++ +ED R+K+AV LFG +NW+K ++F+PGR Q QCRERW NCLDP + R +WTEEED +L  
Subjt:  GLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEI

Query:  AIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVS
        AI EHGYSW+KVA  +  RTDN C RRWK+L+P++V LLQEAR++QK A + NFVDRESERPAL  +     P+      S++P P      K RK    
Subjt:  AIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVS

Query:  SNEKSATGDALKKRKSNCQRN---QADETAQAGIANN
         ++        +K   NC  +   Q +ET      NN
Subjt:  SNEKSATGDALKKRKSNCQRN---QADETAQAGIANN

AT3G18100.3 myb domain protein 4r11.5e-14243.31Show/hide
Query:  SSMSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINP--RLSSSVAGDANPSSDSDDVDDFELLRIIQNRFSCVADEQPLSTLPPMSL
        +S+   + DD+ D       E+D + ED+E LRRA     VN + + +    +     G     SDS++ DDFE+LR I+++ +   D    S+ PPM L
Subjt:  SSMSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINP--RLSSSVAGDANPSSDSDDVDDFELLRIIQNRFSCVADEQPLSTLPPMSL

Query:  -----DEEEDEFEMLRSIQRRFAAYESNTLSNKPNQSCDHVGSLKMDFDDTAVESQTASKR-----PSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRS
              E ED+FEM+RSI+        + LS   + S   +G L  D +D A E+  A +R      +   F   S  K      + +   ++Q+ +   
Subjt:  -----DEEEDEFEMLRSIQRRFAAYESNTLSNKPNQSCDHVGSLKMDFDDTAVESQTASKR-----PSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRS

Query:  KMIHLEARIEENKKLRKRF-----------KILKDFQGSCKRRT--TCALSQMIDPRVQLISAAKPQAKDSSK----------KDKRLSAMHYGPAENSH
        KM+  + R++ + KL +R              L++     K+R+  T    +M DPRV+LIS  K    DSS+           DK++S +  GPAEN  
Subjt:  KMIHLEARIEENKKLRKRF-----------KILKDFQGSCKRRT--TCALSQMIDPRVQLISAAKPQAKDSSK----------KDKRLSAMHYGPAENSH

Query:  VACYRMALMKFP-RVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLQGRSG
        V  YRMAL K+P  V R+KWS  E +NL KG++Q+ Q+++L  ++++ S L+G +    D+D I  SI +L+ITPE IR+FLPK+NWD   S+ ++ RS 
Subjt:  VACYRMALMKFP-RVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLQGRSG

Query:  AECEARWLNFEDPLINRDPWTTNEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTL
        AECEARW++ EDPLIN  PWT  EDKNLL TI+Q  L +W++IAVS GTNRTPFQCL+RYQRSLN SILK+EWT +EDD+LR AV  FG  DWQ+VA+ L
Subjt:  AECEARWLNFEDPLINRDPWTTNEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTL

Query:  EGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKV
        +GRTG QCSNRWKKSL P  T++G ++ +ED R+K+AV LFG +NW+K ++F+PGR Q QCRERW NCLDP + R +WTEEED +L  AI EHGYSW+KV
Subjt:  EGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKV

Query:  AACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSSNEKSATGDALK
        A  +  RTDN C RRWK+L+P++V LLQEAR++QK A + NFVDRESERPAL  +     P+      S++P P      K RK     ++        +
Subjt:  AACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSSNEKSATGDALK

Query:  KRKSNCQRN---QADETAQAGIANN
        K   NC  +   Q +ET      NN
Subjt:  KRKSNCQRN---QADETAQAGIANN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGCGGCAGTGGCAGAAAACGACACGATAGTGGCTAGTGGAGGATTTGGACGCCCATACGACACCACTGGCCTTGCCCAACTTACGACTGTGAGTTTCAAGCAAGC
GGCAACACTAAGGCGACGGCAAACAACCCACCTACAGATGCAAGTACGTTGTGCTCATTCATGTTCATCCATGTCTCACCGCAACCATGACGATGAAGGTGACGTTGAGC
TTCCTGTCAGCAAGGAAGATGATGTGGTTGATGAGGACATGGAAGCCCTTCGGAGAGCCTATAAGCATGTTGGAGTTAATCCTGAGGATTACATTAATCCTAGGTTGTCA
TCATCTGTTGCAGGAGATGCTAATCCTAGTTCTGATTCTGATGATGTTGATGATTTTGAACTTCTTCGAATTATTCAGAACCGGTTCTCATGTGTGGCTGATGAGCAGCC
GTTGAGTACTCTCCCACCAATGTCCCTAGATGAGGAGGAAGATGAATTTGAGATGCTTCGTTCAATTCAGCGGCGCTTTGCAGCGTACGAAAGTAATACTTTGAGCAATA
AACCCAATCAGTCGTGTGACCATGTTGGGTCTTTGAAGATGGATTTTGATGACACAGCTGTTGAGAGTCAGACAGCCTCAAAAAGGCCATCCATGCTAGCCTTTGAAAAG
GGAAGCTTGCCAAAAGCTGCATTGGCATTTATTGATGCCATCAAGAAAAATAGGTCCCAGCAGAAGTTTATTCGTAGTAAGATGATTCATCTTGAAGCTAGAATTGAGGA
GAACAAAAAGCTCAGAAAACGTTTCAAAATACTCAAAGATTTCCAGGGTTCATGTAAACGGAGAACAACCTGTGCACTGTCTCAAATGATAGATCCTCGAGTCCAGTTGA
TTTCAGCTGCAAAACCACAAGCAAAGGATTCATCAAAGAAGGACAAACGATTATCTGCAATGCATTATGGCCCAGCTGAGAATTCTCATGTTGCATGCTACAGAATGGCA
TTGATGAAGTTTCCTCGTGTAGATCGAAAAAAATGGTCCGTTGTAGAAAGGGAGAATCTAGGGAAGGGAATAAGACAGCAATTTCAGGAGATGGTGCTTCAGATTTCAGT
GGATCAAATTAGTGGGCTACAAGGATTTTCTGCAGATTCAGATGATTTGGATAACATTCTTGCATCAATAAAAGACCTTGACATCACTCCTGAAAAGATTAGGGAATTTC
TGCCAAAAGTTAATTGGGACAAGTTGGCTTCCATGTATCTTCAGGGTCGCTCAGGGGCAGAATGTGAAGCAAGGTGGTTGAATTTCGAAGACCCCCTAATTAATCGGGAT
CCATGGACTACAAATGAGGATAAGAATCTTTTGTTTACTATCCAACAGAAAGGGTTGAATAACTGGATTGAGATAGCAGTTTCGTCGGGTACAAATAGAACTCCTTTTCA
GTGCTTGTCTCGGTATCAAAGGAGTTTAAACGCTTCCATATTAAAGAGGGAGTGGACCAAAGACGAGGATGATAAACTTCGAGCTGCTGTTGCTACATTTGGCCTGGGAG
ATTGGCAGGCAGTAGCTTCTACTTTGGAAGGACGAACTGGCCCACAGTGCTCTAATAGGTGGAAAAAATCCCTTGACCCAGCTAGGACAAAAAGAGGTCATTTCACTCCA
GATGAAGACAGTCGCTTGAAAATTGCTGTACTGCTCTTTGGGCCTAAAAATTGGAACAAGAAAGCAGAATTTTTACCTGGTCGAAATCAAGTTCAATGCAGAGAAAGATG
GTTTAATTGTTTAGATCCTTCCTTGAGAAGATGTGAATGGACAGAAGAGGAGGATTTAAGGCTGGAGATAGCAATTCAGGAACATGGATATAGCTGGGCTAAGGTAGCTG
CATGTGTGCCGTCACGTACAGATAATGATTGTCGGAGGAGATGGAAGAAGTTATTTCCCAATGAAGTTCCGTTACTCCAGGAAGCTAGAAAGATTCAGAAGGCTGCTCTT
ATTAGCAACTTTGTTGATAGGGAATCAGAGCGTCCTGCTCTTGGTCCTACTGACTTTCGACCCAGGCCAAACACAGATTTATTGTGTAATAGTGTTGATCCAAGACCTGC
TCCCAAAAGAAATGTGAAGACGAGAAAGATGCCAGTGTCAAGCAATGAAAAGAGTGCTACTGGTGATGCTCTAAAGAAGAGGAAATCAAATTGCCAGAGGAATCAGGCTG
ATGAAACTGCTCAGGCGGGTATTGCAAATAATACCTCTTCTGTCCCAGAGGAGGTTAAATCTCTAAAGCCTCAAAGAAAACAAAATAGACATGGAGCTTATACCGCAGAA
AGTATAGGGGTTCCGGAGCTATGTCCTAACAGTGAGTGGTGTGCTAAACAGAGTGTGGACACTCAGAGCCTTGGGGTGCAGCTTAATAGTAAGGAATCTGAGGGGACCAA
CAGCGACTGCATCGAGACTGTTGATGAGAATGGTATGGAGGTATTTGAGAACAAAGTTGCAGAGAAACTTTCTGAAAGAGATGTATGTTTTTCAGAACCAGAAGCAATTC
AAAACTCTACCGGATCTTCTGGAGTCTCGGTATTGTCAGAAATGACTAATGACATGGATGAATACAATCCGTCTATCCTTCCAGATACAACATTGTTGGCTAGTACTACT
GTGGATGATCTCAAAGAATTGAAGAGGAAGAGTGTTGCAGACAGAGATCTGGATGACTGTAACAGTTTCTCGTTACCGTGCAGTGACTTGGAACTCAGGACAATCGACAA
TGAAGGTGTGGACAGTTATTCCATGGATGAATTTACAGATAAAAGCCATGGGGTTTGCAAGCCCCCCCATACGAGAAGGAAGAAAAACAGCAAAACATCAAGTAAGAGCC
AGGATCATTCCTTTGTTTCTTGTCAACAAGTGGAGTTGGAGAGGTCCCCAAATGAGCCTCGTCGTCATAATCAATCAAAGAAGAGAAAACATAACAGTACAAATACGAAT
CTGGAGGCAGTTGAGGAGGTTGATGACTGCACACTCGTGGGCTTTCTGCAAAAAAGATTGAAGAGGACGACAACAAATGACAAGAAAGTTGATTGCAGTTCAAGTACTCC
CCTAGAAGTCGATACTGATGACAATGATCCTACCCTTGCCTCGTTTCTTAATAAATTAAAGAGAAAAAAGCATCAGCCGAGTAGTGGTGGTGAGTTAAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGGCGGCAGTGGCAGAAAACGACACGATAGTGGCTAGTGGAGGATTTGGACGCCCATACGACACCACTGGCCTTGCCCAACTTACGACTGTGAGTTTCAAGCAAGC
GGCAACACTAAGGCGACGGCAAACAACCCACCTACAGATGCAAGTACGTTGTGCTCATTCATGTTCATCCATGTCTCACCGCAACCATGACGATGAAGGTGACGTTGAGC
TTCCTGTCAGCAAGGAAGATGATGTGGTTGATGAGGACATGGAAGCCCTTCGGAGAGCCTATAAGCATGTTGGAGTTAATCCTGAGGATTACATTAATCCTAGGTTGTCA
TCATCTGTTGCAGGAGATGCTAATCCTAGTTCTGATTCTGATGATGTTGATGATTTTGAACTTCTTCGAATTATTCAGAACCGGTTCTCATGTGTGGCTGATGAGCAGCC
GTTGAGTACTCTCCCACCAATGTCCCTAGATGAGGAGGAAGATGAATTTGAGATGCTTCGTTCAATTCAGCGGCGCTTTGCAGCGTACGAAAGTAATACTTTGAGCAATA
AACCCAATCAGTCGTGTGACCATGTTGGGTCTTTGAAGATGGATTTTGATGACACAGCTGTTGAGAGTCAGACAGCCTCAAAAAGGCCATCCATGCTAGCCTTTGAAAAG
GGAAGCTTGCCAAAAGCTGCATTGGCATTTATTGATGCCATCAAGAAAAATAGGTCCCAGCAGAAGTTTATTCGTAGTAAGATGATTCATCTTGAAGCTAGAATTGAGGA
GAACAAAAAGCTCAGAAAACGTTTCAAAATACTCAAAGATTTCCAGGGTTCATGTAAACGGAGAACAACCTGTGCACTGTCTCAAATGATAGATCCTCGAGTCCAGTTGA
TTTCAGCTGCAAAACCACAAGCAAAGGATTCATCAAAGAAGGACAAACGATTATCTGCAATGCATTATGGCCCAGCTGAGAATTCTCATGTTGCATGCTACAGAATGGCA
TTGATGAAGTTTCCTCGTGTAGATCGAAAAAAATGGTCCGTTGTAGAAAGGGAGAATCTAGGGAAGGGAATAAGACAGCAATTTCAGGAGATGGTGCTTCAGATTTCAGT
GGATCAAATTAGTGGGCTACAAGGATTTTCTGCAGATTCAGATGATTTGGATAACATTCTTGCATCAATAAAAGACCTTGACATCACTCCTGAAAAGATTAGGGAATTTC
TGCCAAAAGTTAATTGGGACAAGTTGGCTTCCATGTATCTTCAGGGTCGCTCAGGGGCAGAATGTGAAGCAAGGTGGTTGAATTTCGAAGACCCCCTAATTAATCGGGAT
CCATGGACTACAAATGAGGATAAGAATCTTTTGTTTACTATCCAACAGAAAGGGTTGAATAACTGGATTGAGATAGCAGTTTCGTCGGGTACAAATAGAACTCCTTTTCA
GTGCTTGTCTCGGTATCAAAGGAGTTTAAACGCTTCCATATTAAAGAGGGAGTGGACCAAAGACGAGGATGATAAACTTCGAGCTGCTGTTGCTACATTTGGCCTGGGAG
ATTGGCAGGCAGTAGCTTCTACTTTGGAAGGACGAACTGGCCCACAGTGCTCTAATAGGTGGAAAAAATCCCTTGACCCAGCTAGGACAAAAAGAGGTCATTTCACTCCA
GATGAAGACAGTCGCTTGAAAATTGCTGTACTGCTCTTTGGGCCTAAAAATTGGAACAAGAAAGCAGAATTTTTACCTGGTCGAAATCAAGTTCAATGCAGAGAAAGATG
GTTTAATTGTTTAGATCCTTCCTTGAGAAGATGTGAATGGACAGAAGAGGAGGATTTAAGGCTGGAGATAGCAATTCAGGAACATGGATATAGCTGGGCTAAGGTAGCTG
CATGTGTGCCGTCACGTACAGATAATGATTGTCGGAGGAGATGGAAGAAGTTATTTCCCAATGAAGTTCCGTTACTCCAGGAAGCTAGAAAGATTCAGAAGGCTGCTCTT
ATTAGCAACTTTGTTGATAGGGAATCAGAGCGTCCTGCTCTTGGTCCTACTGACTTTCGACCCAGGCCAAACACAGATTTATTGTGTAATAGTGTTGATCCAAGACCTGC
TCCCAAAAGAAATGTGAAGACGAGAAAGATGCCAGTGTCAAGCAATGAAAAGAGTGCTACTGGTGATGCTCTAAAGAAGAGGAAATCAAATTGCCAGAGGAATCAGGCTG
ATGAAACTGCTCAGGCGGGTATTGCAAATAATACCTCTTCTGTCCCAGAGGAGGTTAAATCTCTAAAGCCTCAAAGAAAACAAAATAGACATGGAGCTTATACCGCAGAA
AGTATAGGGGTTCCGGAGCTATGTCCTAACAGTGAGTGGTGTGCTAAACAGAGTGTGGACACTCAGAGCCTTGGGGTGCAGCTTAATAGTAAGGAATCTGAGGGGACCAA
CAGCGACTGCATCGAGACTGTTGATGAGAATGGTATGGAGGTATTTGAGAACAAAGTTGCAGAGAAACTTTCTGAAAGAGATGTATGTTTTTCAGAACCAGAAGCAATTC
AAAACTCTACCGGATCTTCTGGAGTCTCGGTATTGTCAGAAATGACTAATGACATGGATGAATACAATCCGTCTATCCTTCCAGATACAACATTGTTGGCTAGTACTACT
GTGGATGATCTCAAAGAATTGAAGAGGAAGAGTGTTGCAGACAGAGATCTGGATGACTGTAACAGTTTCTCGTTACCGTGCAGTGACTTGGAACTCAGGACAATCGACAA
TGAAGGTGTGGACAGTTATTCCATGGATGAATTTACAGATAAAAGCCATGGGGTTTGCAAGCCCCCCCATACGAGAAGGAAGAAAAACAGCAAAACATCAAGTAAGAGCC
AGGATCATTCCTTTGTTTCTTGTCAACAAGTGGAGTTGGAGAGGTCCCCAAATGAGCCTCGTCGTCATAATCAATCAAAGAAGAGAAAACATAACAGTACAAATACGAAT
CTGGAGGCAGTTGAGGAGGTTGATGACTGCACACTCGTGGGCTTTCTGCAAAAAAGATTGAAGAGGACGACAACAAATGACAAGAAAGTTGATTGCAGTTCAAGTACTCC
CCTAGAAGTCGATACTGATGACAATGATCCTACCCTTGCCTCGTTTCTTAATAAATTAAAGAGAAAAAAGCATCAGCCGAGTAGTGGTGGTGAGTTAAACTAGGAAGGAT
GACATAGTTTTGTTGCATTTTTGGGATATCTGAAGCAATATCACATTGGATCTCCGAAGAAGGTTGATGGAAGGTTGTACAAATAAAGCTTCCAAGGCCGCAAAATGGCG
AAATGTCAACTATACGGTTGATTCTATGGTTGACAAATTTTTGTTTTGTGATTTAGATTAGAGGGCTTTGGCAATTTTCTTTTGTTATGGTTCATTTTCTTTTC
Protein sequenceShow/hide protein sequence
MKAAVAENDTIVASGGFGRPYDTTGLAQLTTVSFKQAATLRRRQTTHLQMQVRCAHSCSSMSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLS
SSVAGDANPSSDSDDVDDFELLRIIQNRFSCVADEQPLSTLPPMSLDEEEDEFEMLRSIQRRFAAYESNTLSNKPNQSCDHVGSLKMDFDDTAVESQTASKRPSMLAFEK
GSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMA
LMKFPRVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLQGRSGAECEARWLNFEDPLINRD
PWTTNEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTP
DEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAAL
ISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSSNEKSATGDALKKRKSNCQRNQADETAQAGIANNTSSVPEEVKSLKPQRKQNRHGAYTAE
SIGVPELCPNSEWCAKQSVDTQSLGVQLNSKESEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFSEPEAIQNSTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTT
VDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDNEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDHSFVSCQQVELERSPNEPRRHNQSKKRKHNSTNTN
LEAVEEVDDCTLVGFLQKRLKRTTTNDKKVDCSSSTPLEVDTDDNDPTLASFLNKLKRKKHQPSSGGELN