; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi07G005900 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi07G005900
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionsnRNA-activating protein complex subunit 4
Genome locationchr07:6257700..6271220
RNA-Seq ExpressionLsi07G005900
SyntenyLsi07G005900
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017884 - SANT domain
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011650584.1 uncharacterized protein LOC101216287 [Cucumis sativus]0.0e+0071.61Show/hide
Query:  DTLSNKPNQSCDYVGSLKMDSDNTAVESQIASERPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRR
        DTLSNKPNQS DYVGSLK+DSD+ AVESQ +S+RPSMLAFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEENKKLRKR KILKDFQGSCKRR
Subjt:  DTLSNKPNQSCDYVGSLKMDSDNTAVESQIASERPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRR

Query:  TSCALSQMIDPRIQLMSAAKPQAKDSSKKPCPITFTTRSLFEQKDKRLSAMYYGPAENSHVACYRMALTKFPRVDRKKWSIVERENLGKGIRQQFQEMVL
        TSCALSQMIDPR+QL+SAAKPQAKDSSK               KDKRLS MYYGP ENSHVACYRM L KFP VDRKKWSIVERENLGKGIRQQFQEMVL
Subjt:  TSCALSQMIDPRIQLMSAAKPQAKDSSKKPCPITFTTRSLFEQKDKRLSAMYYGPAENSHVACYRMALTKFPRVDRKKWSIVERENLGKGIRQQFQEMVL

Query:  QISVDQIRIIFCTLPRSTGIGPSVKDDEIEDENDGLSQMKGSFTVCDPLIDEFELRSGEMLVQKRTLNFDQESTLTLKIRVKVKYDSSYHKISINSSWGS
        QISVDQI                                                                                             
Subjt:  QISVDQIRIIFCTLPRSTGIGPSVKDDEIEDENDGLSQMKGSFTVCDPLIDEFELRSGEMLVQKRTLNFDQESTLTLKIRVKVKYDSSYHKISINSSWGS

Query:  YCRSGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARYFLTESILIFFEACQLQSASQSGLHYVCDDNRHRHLWLN
           SG QG S DSDDLDNILASIKDLDI P+KIREFLPKVNWDKLASMYL+GRSGAECEAR                                    WLN
Subjt:  YCRSGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARYFLTESILIFFEACQLQSASQSGLHYVCDDNRHRHLWLN

Query:  FEDPLINRDPWITSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASIIKREWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQCS
        FEDPLINRDPW TSEDK+LLFTIQQKGLNNWIE+AVS GTNRTPFQCLSRYQRSLNASI+KREWTK+EDD+LRSA+A FG+ DWQAVASTLEGR GTQCS
Subjt:  FEDPLINRDPWITSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASIIKREWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQCS

Query:  NRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSW
        NR         WKKSLDPART++G+FTPDED RLKIAVLLFGPKNWNKKAEFLPGRNQVQCRER   WFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSW
Subjt:  NRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSW

Query:  AKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCDTDDPGPAPKRNVKTRKMPVSRNEKNVTGG
        AKVAACVPSRTDN+CRRRWKKLFP+EVPLLQEARKIQKAALISNFVDRE+ERPALGP DFRPRPNTD LC+TD P PAPKRNVKTRKMPVSRNEK+ TG 
Subjt:  AKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCDTDDPGPAPKRNVKTRKMPVSRNEKNVTGG

Query:  APKKKKSNCQRNQADTTAQVGIANNTSSVPEEVKSLKPQRKRNRHGAYTAKSIGVLELRPNNEWCAKQNLDTRTLGVQLNGKESEWTNSDCTETVDENGM
        APKK+KSN QR Q D TAQVGIA NTS VPEEV+S KPQRKRNR GAYTAK IGV ELR ++EWCAKQNLDT +LG+QLN KESE +NS+CTETVDEN M
Subjt:  APKKKKSNCQRNQADTTAQVGIANNTSSVPEEVKSLKPQRKRNRHGAYTAKSIGVLELRPNNEWCAKQNLDTRTLGVQLNGKESEWTNSDCTETVDENGM

Query:  EVFENKAAEKLSEKDACFSEPEENQNSTTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDIKELKGKSDAERNLDDSNSFSLPRSDLELRT-DSE
        EV ENK AEKL+E++ACFSEPE+NQNS TGSSGVSVLSEMTND+ +YNPSIL DTTL ASTTVDDI+ELKGKS A+R+LDDSNSFSL  S LELRT DSE
Subjt:  EVFENKAAEKLSEKDACFSEPEENQNSTTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDIKELKGKSDAERNLDDSNSFSLPRSDLELRT-DSE

Query:  GVDSYSVDEFTDKSHGVCKPPQGRRKKNSKTSNKSQDYSFVSCQQVELERSGTNKPRHHNQSEKRKRNSTNTNL----EAVEEVDDCTLVGFLQKRSKKT
        GVDSYSVDE+T KS+GVC P QGRRKKNSKTSN S D   +  QQ+  E  GT KP HHNQS+KRK ++T  +     EAVEEVDDCTLVGFLQKR K+T
Subjt:  GVDSYSVDEFTDKSHGVCKPPQGRRKKNSKTSNKSQDYSFVSCQQVELERSGTNKPRHHNQSEKRKRNSTNTNL----EAVEEVDDCTLVGFLQKRSKKT

Query:  KITNDKKVDCSSNTPLEVDNDDNDLTIASFLNKLKRKKH
         +T+++ VDCSSN PL+VDNDDN+ TIASFLNKLKRKKH
Subjt:  KITNDKKVDCSSNTPLEVDNDDNDLTIASFLNKLKRKKH

XP_023515735.1 uncharacterized protein LOC111779809 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0070.12Show/hide
Query:  DTLSNKPNQSCDYVGSLKMDSDNTAVESQIASERPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRR
        D LSNKP+QSCD  G LKMDS+NT VE   +SER SM+AFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFK+LK FQGSC+R+
Subjt:  DTLSNKPNQSCDYVGSLKMDSDNTAVESQIASERPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRR

Query:  TSCALSQMIDPRIQLMSAAKPQAKDSSKKPCPITFTTRSLFEQKDKRLSAMYYGPAENSHVACYRMALTKFPRVDRKKWSIVERENLGKGIRQQFQEMVL
        T+CAL+QM+DPR+QL+SA KPQAKDSSK               KDKRLS+M YGPAENSHVACYR A TKF  VDRK+WS  ERENLGKGIRQQFQEMVL
Subjt:  TSCALSQMIDPRIQLMSAAKPQAKDSSKKPCPITFTTRSLFEQKDKRLSAMYYGPAENSHVACYRMALTKFPRVDRKKWSIVERENLGKGIRQQFQEMVL

Query:  QISVDQIRIIFCTLPRSTGIGPSVKDDEIEDENDGLSQMKGSFTVCDPLIDEFELRSGEMLVQKRTLNFDQESTLTLKIRVKVKYDSSYHKISINSSWGS
        QISVDQI                                                                                             
Subjt:  QISVDQIRIIFCTLPRSTGIGPSVKDDEIEDENDGLSQMKGSFTVCDPLIDEFELRSGEMLVQKRTLNFDQESTLTLKIRVKVKYDSSYHKISINSSWGS

Query:  YCRSGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARYFLTESILIFFEACQLQSASQSGLHYVCDDNRHRHLWLN
           S +QGFSA+SDDLDNILASIK LDITPEKIREFLPKVNWDKLASMYLRGRSGAECEAR                                    WLN
Subjt:  YCRSGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARYFLTESILIFFEACQLQSASQSGLHYVCDDNRHRHLWLN

Query:  FEDPLINRDPWITSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASIIKREWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQCS
        FEDPLINR+PW TSEDKNLLFTIQQKGLNNWIE+AVS GTNRTPFQCLSRYQRSLNASI+K EWTKDEDDKLRSA+A+FG GDWQAVASTLEGRTG QCS
Subjt:  FEDPLINRDPWITSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASIIKREWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQCS

Query:  NRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSW
        NR         WKKSLDPARTKRG+FTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRER   WFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSW
Subjt:  NRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSW

Query:  AKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCDTDDPGPAPKRNVKTRKMPVSRNEKNVTGG
        AKVAACVPSRTDN+CRRRWKKLFPN+VPLLQEARKIQK ALISNFVDRESERPALGPTDFRP PN+ LLC+TDDP  APKRNV+TR+MPVSRNEK+  G 
Subjt:  AKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCDTDDPGPAPKRNVKTRKMPVSRNEKNVTGG

Query:  APKKKKSNCQRNQADTTAQVGIANNTSSVPEEVKSLKPQRKRNRHGAYTAKSIGVLELRPNNEWCAKQNLDTRTLGVQLNGKE-SEWTNSDCTETVDENG
        APK++KSN QRN+AD TAQV   NNTSSVP EVKS KPQRKR RHGAYT +  G  ++  N+E CA+QN DTR++ VQLN KE +E  NSDC ETVDENG
Subjt:  APKKKKSNCQRNQADTTAQVGIANNTSSVPEEVKSLKPQRKRNRHGAYTAKSIGVLELRPNNEWCAKQNLDTRTLGVQLNGKE-SEWTNSDCTETVDENG

Query:  MEVFENKAAEKLSEKDACFSEPEENQNSTTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDIKELKGKSDAERNLDDSNSFSLPRSDLELR-TDS
        MEVFENKAAE  SE   CFSE EENQNS TGSSGVSVLSEMTNDMDEYNPS LPDTTLLAS T DDI E KG + A+++LDDSNSFSLP+S LELR TDS
Subjt:  MEVFENKAAEKLSEKDACFSEPEENQNSTTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDIKELKGKSDAERNLDDSNSFSLPRSDLELR-TDS

Query:  EGVDSYSVDEFTDKSHGVCKPPQGRRKKNSKTSNKSQDYSFVSCQQVELERSGTNKPRHHNQSEKRKRNSTNTN----LEAVEEVDDCTLVGFLQKRSKK
        EGVDSYSVDEFTDKSHGVCK PQGRRKKNSK SNKSQD S VSCQQ ELE SGTN+    NQS+KRK + TNT+    +EAVEEVDDCTL GFLQKR K+
Subjt:  EGVDSYSVDEFTDKSHGVCKPPQGRRKKNSKTSNKSQDYSFVSCQQVELERSGTNKPRHHNQSEKRKRNSTNTN----LEAVEEVDDCTLVGFLQKRSKK

Query:  TKITNDKKVDCSSNTPLEVDNDDNDLTIASFLN-KLKRKKH
        T  T+DKKVD SS+TP EVDNDDND T+A  LN KLKRKKH
Subjt:  TKITNDKKVDCSSNTPLEVDNDDNDLTIASFLN-KLKRKKH

XP_038905712.1 uncharacterized protein LOC120091681 isoform X1 [Benincasa hispida]0.0e+0074.11Show/hide
Query:  DTLSNKPNQSCDYVGSLKMDSDNTAVESQIASERPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRR
        D LSNKPN+S DYVGSLKMDS NTA ESQ +S+RPSM+AFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEENKKLRKR KILKDFQGSCKR+
Subjt:  DTLSNKPNQSCDYVGSLKMDSDNTAVESQIASERPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRR

Query:  TSCALSQMIDPRIQLMSAAKPQAKDSSKKPCPITFTTRSLFEQKDKRLSAMYYGPAENSHVACYRMALTKFPRVDRKKWSIVERENLGKGIRQQFQEMVL
        T+CALSQMIDPR+QL+SAAKPQAKDSSK               KDKRLS MYYGPAENSHVACYRMAL KFPRVDRKKWSIVERENLGKGIRQQFQEMVL
Subjt:  TSCALSQMIDPRIQLMSAAKPQAKDSSKKPCPITFTTRSLFEQKDKRLSAMYYGPAENSHVACYRMALTKFPRVDRKKWSIVERENLGKGIRQQFQEMVL

Query:  QISVDQIRIIFCTLPRSTGIGPSVKDDEIEDENDGLSQMKGSFTVCDPLIDEFELRSGEMLVQKRTLNFDQESTLTLKIRVKVKYDSSYHKISINSSWGS
        QISVDQI                                                                                             
Subjt:  QISVDQIRIIFCTLPRSTGIGPSVKDDEIEDENDGLSQMKGSFTVCDPLIDEFELRSGEMLVQKRTLNFDQESTLTLKIRVKVKYDSSYHKISINSSWGS

Query:  YCRSGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARYFLTESILIFFEACQLQSASQSGLHYVCDDNRHRHLWLN
           SGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYL GRSGAECEAR                                    WLN
Subjt:  YCRSGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARYFLTESILIFFEACQLQSASQSGLHYVCDDNRHRHLWLN

Query:  FEDPLINRDPWITSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASIIKREWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQCS
        FEDPLINRDPW TSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASI+KREWTKDEDDKLRSA+AIFG+ DWQAVASTLEGR GTQCS
Subjt:  FEDPLINRDPWITSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASIIKREWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQCS

Query:  NRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSW
        NR         WKKSLDPARTKRGHFTPDED+RLKIAVLL GPKNWNKKAEFLPGRNQVQCRER   WFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSW
Subjt:  NRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSW

Query:  AKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCDTDDPGPAPKRNVKTRKMPVSRNEKNVTGG
        AKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGP DFRPR NTD+LC TDDP PAPKRN KTRKMPVSRNEK+ TG 
Subjt:  AKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCDTDDPGPAPKRNVKTRKMPVSRNEKNVTGG

Query:  APKKKKSNCQRNQADTTAQVGIANNTSSVPEEVKSLKPQRKRNRHGAYTAKSIGVLELRPNNEWCAKQNLDTRTLGVQLNGKESEWTNSDCTETVDENGM
        AP+K+KSN QRNQAD TA+VGIANNTSSVPEEV+SLKP RKRNRH A T K  GVLEL  +N+WCAKQNL+TR++GVQL+ KE E TNSD TETVD NG+
Subjt:  APKKKKSNCQRNQADTTAQVGIANNTSSVPEEVKSLKPQRKRNRHGAYTAKSIGVLELRPNNEWCAKQNLDTRTLGVQLNGKESEWTNSDCTETVDENGM

Query:  EVFENKAAEKLSEKDACFSEPEENQNSTTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDIKELKGKSDAERNLDDSNSFSLPRSDLELRT-DSE
        EVFENK A+KLSE+D  FSEPEENQNS TGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDI+ELKGKS A+R+LDDSNSFSLP S LELRT D E
Subjt:  EVFENKAAEKLSEKDACFSEPEENQNSTTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDIKELKGKSDAERNLDDSNSFSLPRSDLELRT-DSE

Query:  GVDSYSVDEFTDKSHGVCKPPQGRRKKNSKTSNKSQDYSFVSCQQVELERSGTNKPRHHNQSEKRKRNSTNTN----LEAVEEVDDCTLVGFLQKRSKKT
        GVDSYSVD+ TDKSH VCK PQGRRKKNSKTS+K+ +YSF+SCQQVE ER G N+PRH NQS+KRK +STNT+    LEAVEEVD+CTLVGFLQKR    
Subjt:  GVDSYSVDEFTDKSHGVCKPPQGRRKKNSKTSNKSQDYSFVSCQQVELERSGTNKPRHHNQSEKRKRNSTNTN----LEAVEEVDDCTLVGFLQKRSKKT

Query:  KITNDKKVDCSSNTPLEVDNDDNDLTIASFLNKLKRKKH
             KKVDCSS TPLEVDNDDND  IASFLNKLKRKKH
Subjt:  KITNDKKVDCSSNTPLEVDNDDNDLTIASFLNKLKRKKH

XP_038905717.1 uncharacterized protein LOC120091681 isoform X2 [Benincasa hispida]0.0e+0074.11Show/hide
Query:  DTLSNKPNQSCDYVGSLKMDSDNTAVESQIASERPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRR
        D LSNKPN+S DYVGSLKMDS NTA ESQ +S+RPSM+AFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEENKKLRKR KILKDFQGSCKR+
Subjt:  DTLSNKPNQSCDYVGSLKMDSDNTAVESQIASERPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRR

Query:  TSCALSQMIDPRIQLMSAAKPQAKDSSKKPCPITFTTRSLFEQKDKRLSAMYYGPAENSHVACYRMALTKFPRVDRKKWSIVERENLGKGIRQQFQEMVL
        T+CALSQMIDPR+QL+SAAKPQAKDSSK               KDKRLS MYYGPAENSHVACYRMAL KFPRVDRKKWSIVERENLGKGIRQQFQEMVL
Subjt:  TSCALSQMIDPRIQLMSAAKPQAKDSSKKPCPITFTTRSLFEQKDKRLSAMYYGPAENSHVACYRMALTKFPRVDRKKWSIVERENLGKGIRQQFQEMVL

Query:  QISVDQIRIIFCTLPRSTGIGPSVKDDEIEDENDGLSQMKGSFTVCDPLIDEFELRSGEMLVQKRTLNFDQESTLTLKIRVKVKYDSSYHKISINSSWGS
        QISVDQI                                                                                             
Subjt:  QISVDQIRIIFCTLPRSTGIGPSVKDDEIEDENDGLSQMKGSFTVCDPLIDEFELRSGEMLVQKRTLNFDQESTLTLKIRVKVKYDSSYHKISINSSWGS

Query:  YCRSGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARYFLTESILIFFEACQLQSASQSGLHYVCDDNRHRHLWLN
           SGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYL GRSGAECEAR                                    WLN
Subjt:  YCRSGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARYFLTESILIFFEACQLQSASQSGLHYVCDDNRHRHLWLN

Query:  FEDPLINRDPWITSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASIIKREWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQCS
        FEDPLINRDPW TSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASI+KREWTKDEDDKLRSA+AIFG+ DWQAVASTLEGR GTQCS
Subjt:  FEDPLINRDPWITSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASIIKREWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQCS

Query:  NRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSW
        NR         WKKSLDPARTKRGHFTPDED+RLKIAVLL GPKNWNKKAEFLPGRNQVQCRER   WFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSW
Subjt:  NRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSW

Query:  AKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCDTDDPGPAPKRNVKTRKMPVSRNEKNVTGG
        AKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGP DFRPR NTD+LC TDDP PAPKRN KTRKMPVSRNEK+ TG 
Subjt:  AKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCDTDDPGPAPKRNVKTRKMPVSRNEKNVTGG

Query:  APKKKKSNCQRNQADTTAQVGIANNTSSVPEEVKSLKPQRKRNRHGAYTAKSIGVLELRPNNEWCAKQNLDTRTLGVQLNGKESEWTNSDCTETVDENGM
        AP+K+KSN QRNQAD TA+VGIANNTSSVPEEV+SLKP RKRNRH A T K  GVLEL  +N+WCAKQNL+TR++GVQL+ KE E TNSD TETVD NG+
Subjt:  APKKKKSNCQRNQADTTAQVGIANNTSSVPEEVKSLKPQRKRNRHGAYTAKSIGVLELRPNNEWCAKQNLDTRTLGVQLNGKESEWTNSDCTETVDENGM

Query:  EVFENKAAEKLSEKDACFSEPEENQNSTTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDIKELKGKSDAERNLDDSNSFSLPRSDLELRT-DSE
        EVFENK A+KLSE+D  FSEPEENQNS TGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDI+ELKGKS A+R+LDDSNSFSLP S LELRT D E
Subjt:  EVFENKAAEKLSEKDACFSEPEENQNSTTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDIKELKGKSDAERNLDDSNSFSLPRSDLELRT-DSE

Query:  GVDSYSVDEFTDKSHGVCKPPQGRRKKNSKTSNKSQDYSFVSCQQVELERSGTNKPRHHNQSEKRKRNSTNTN----LEAVEEVDDCTLVGFLQKRSKKT
        GVDSYSVD+ TDKSH VCK PQGRRKKNSKTS+K+ +YSF+SCQQVE ER G N+PRH NQS+KRK +STNT+    LEAVEEVD+CTLVGFLQKR    
Subjt:  GVDSYSVDEFTDKSHGVCKPPQGRRKKNSKTSNKSQDYSFVSCQQVELERSGTNKPRHHNQSEKRKRNSTNTN----LEAVEEVDDCTLVGFLQKRSKKT

Query:  KITNDKKVDCSSNTPLEVDNDDNDLTIASFLNKLKRKKH
             KKVDCSS TPLEVDNDDND  IASFLNKLKRKKH
Subjt:  KITNDKKVDCSSNTPLEVDNDDNDLTIASFLNKLKRKKH

XP_038905721.1 uncharacterized protein LOC120091681 isoform X3 [Benincasa hispida]0.0e+0073.95Show/hide
Query:  MDSDNTAVESQIASERPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRRTSCALSQMIDPRIQLMSA
        MDS NTA ESQ +S+RPSM+AFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEENKKLRKR KILKDFQGSCKR+T+CALSQMIDPR+QL+SA
Subjt:  MDSDNTAVESQIASERPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRRTSCALSQMIDPRIQLMSA

Query:  AKPQAKDSSKKPCPITFTTRSLFEQKDKRLSAMYYGPAENSHVACYRMALTKFPRVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQIRIIFCTLPRST
        AKPQAKDSSK               KDKRLS MYYGPAENSHVACYRMAL KFPRVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQI           
Subjt:  AKPQAKDSSKKPCPITFTTRSLFEQKDKRLSAMYYGPAENSHVACYRMALTKFPRVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQIRIIFCTLPRST

Query:  GIGPSVKDDEIEDENDGLSQMKGSFTVCDPLIDEFELRSGEMLVQKRTLNFDQESTLTLKIRVKVKYDSSYHKISINSSWGSYCRSGLQGFSADSDDLDN
                                                                                             SGLQGFSADSDDLDN
Subjt:  GIGPSVKDDEIEDENDGLSQMKGSFTVCDPLIDEFELRSGEMLVQKRTLNFDQESTLTLKIRVKVKYDSSYHKISINSSWGSYCRSGLQGFSADSDDLDN

Query:  ILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARYFLTESILIFFEACQLQSASQSGLHYVCDDNRHRHLWLNFEDPLINRDPWITSEDKN
        ILASIKDLDITPEKIREFLPKVNWDKLASMYL GRSGAECEAR                                    WLNFEDPLINRDPW TSEDKN
Subjt:  ILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARYFLTESILIFFEACQLQSASQSGLHYVCDDNRHRHLWLNFEDPLINRDPWITSEDKN

Query:  LLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASIIKREWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQCSNRLILFTISDQWKKSLDP
        LLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASI+KREWTKDEDDKLRSA+AIFG+ DWQAVASTLEGR GTQCSNR         WKKSLDP
Subjt:  LLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASIIKREWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQCSNRLILFTISDQWKKSLDP

Query:  ARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRR
        ARTKRGHFTPDED+RLKIAVLL GPKNWNKKAEFLPGRNQVQCRER   WFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRR
Subjt:  ARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRR

Query:  WKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCDTDDPGPAPKRNVKTRKMPVSRNEKNVTGGAPKKKKSNCQRNQADTTA
        WKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGP DFRPR NTD+LC TDDP PAPKRN KTRKMPVSRNEK+ TG AP+K+KSN QRNQAD TA
Subjt:  WKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCDTDDPGPAPKRNVKTRKMPVSRNEKNVTGGAPKKKKSNCQRNQADTTA

Query:  QVGIANNTSSVPEEVKSLKPQRKRNRHGAYTAKSIGVLELRPNNEWCAKQNLDTRTLGVQLNGKESEWTNSDCTETVDENGMEVFENKAAEKLSEKDACF
        +VGIANNTSSVPEEV+SLKP RKRNRH A T K  GVLEL  +N+WCAKQNL+TR++GVQL+ KE E TNSD TETVD NG+EVFENK A+KLSE+D  F
Subjt:  QVGIANNTSSVPEEVKSLKPQRKRNRHGAYTAKSIGVLELRPNNEWCAKQNLDTRTLGVQLNGKESEWTNSDCTETVDENGMEVFENKAAEKLSEKDACF

Query:  SEPEENQNSTTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDIKELKGKSDAERNLDDSNSFSLPRSDLELRT-DSEGVDSYSVDEFTDKSHGVC
        SEPEENQNS TGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDI+ELKGKS A+R+LDDSNSFSLP S LELRT D EGVDSYSVD+ TDKSH VC
Subjt:  SEPEENQNSTTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDIKELKGKSDAERNLDDSNSFSLPRSDLELRT-DSEGVDSYSVDEFTDKSHGVC

Query:  KPPQGRRKKNSKTSNKSQDYSFVSCQQVELERSGTNKPRHHNQSEKRKRNSTNTN----LEAVEEVDDCTLVGFLQKRSKKTKITNDKKVDCSSNTPLEV
        K PQGRRKKNSKTS+K+ +YSF+SCQQVE ER G N+PRH NQS+KRK +STNT+    LEAVEEVD+CTLVGFLQKR         KKVDCSS TPLEV
Subjt:  KPPQGRRKKNSKTSNKSQDYSFVSCQQVELERSGTNKPRHHNQSEKRKRNSTNTN----LEAVEEVDDCTLVGFLQKRSKKTKITNDKKVDCSSNTPLEV

Query:  DNDDNDLTIASFLNKLKRKKH
        DNDDND  IASFLNKLKRKKH
Subjt:  DNDDNDLTIASFLNKLKRKKH

TrEMBL top hitse value%identityAlignment
A0A0A0L2R2 Uncharacterized protein0.0e+0071.61Show/hide
Query:  DTLSNKPNQSCDYVGSLKMDSDNTAVESQIASERPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRR
        DTLSNKPNQS DYVGSLK+DSD+ AVESQ +S+RPSMLAFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEENKKLRKR KILKDFQGSCKRR
Subjt:  DTLSNKPNQSCDYVGSLKMDSDNTAVESQIASERPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRR

Query:  TSCALSQMIDPRIQLMSAAKPQAKDSSKKPCPITFTTRSLFEQKDKRLSAMYYGPAENSHVACYRMALTKFPRVDRKKWSIVERENLGKGIRQQFQEMVL
        TSCALSQMIDPR+QL+SAAKPQAKDSSK               KDKRLS MYYGP ENSHVACYRM L KFP VDRKKWSIVERENLGKGIRQQFQEMVL
Subjt:  TSCALSQMIDPRIQLMSAAKPQAKDSSKKPCPITFTTRSLFEQKDKRLSAMYYGPAENSHVACYRMALTKFPRVDRKKWSIVERENLGKGIRQQFQEMVL

Query:  QISVDQIRIIFCTLPRSTGIGPSVKDDEIEDENDGLSQMKGSFTVCDPLIDEFELRSGEMLVQKRTLNFDQESTLTLKIRVKVKYDSSYHKISINSSWGS
        QISVDQI                                                                                             
Subjt:  QISVDQIRIIFCTLPRSTGIGPSVKDDEIEDENDGLSQMKGSFTVCDPLIDEFELRSGEMLVQKRTLNFDQESTLTLKIRVKVKYDSSYHKISINSSWGS

Query:  YCRSGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARYFLTESILIFFEACQLQSASQSGLHYVCDDNRHRHLWLN
           SG QG S DSDDLDNILASIKDLDI P+KIREFLPKVNWDKLASMYL+GRSGAECEAR                                    WLN
Subjt:  YCRSGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARYFLTESILIFFEACQLQSASQSGLHYVCDDNRHRHLWLN

Query:  FEDPLINRDPWITSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASIIKREWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQCS
        FEDPLINRDPW TSEDK+LLFTIQQKGLNNWIE+AVS GTNRTPFQCLSRYQRSLNASI+KREWTK+EDD+LRSA+A FG+ DWQAVASTLEGR GTQCS
Subjt:  FEDPLINRDPWITSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASIIKREWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQCS

Query:  NRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSW
        NR         WKKSLDPART++G+FTPDED RLKIAVLLFGPKNWNKKAEFLPGRNQVQCRER   WFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSW
Subjt:  NRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSW

Query:  AKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCDTDDPGPAPKRNVKTRKMPVSRNEKNVTGG
        AKVAACVPSRTDN+CRRRWKKLFP+EVPLLQEARKIQKAALISNFVDRE+ERPALGP DFRPRPNTD LC+TD P PAPKRNVKTRKMPVSRNEK+ TG 
Subjt:  AKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCDTDDPGPAPKRNVKTRKMPVSRNEKNVTGG

Query:  APKKKKSNCQRNQADTTAQVGIANNTSSVPEEVKSLKPQRKRNRHGAYTAKSIGVLELRPNNEWCAKQNLDTRTLGVQLNGKESEWTNSDCTETVDENGM
        APKK+KSN QR Q D TAQVGIA NTS VPEEV+S KPQRKRNR GAYTAK IGV ELR ++EWCAKQNLDT +LG+QLN KESE +NS+CTETVDEN M
Subjt:  APKKKKSNCQRNQADTTAQVGIANNTSSVPEEVKSLKPQRKRNRHGAYTAKSIGVLELRPNNEWCAKQNLDTRTLGVQLNGKESEWTNSDCTETVDENGM

Query:  EVFENKAAEKLSEKDACFSEPEENQNSTTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDIKELKGKSDAERNLDDSNSFSLPRSDLELRT-DSE
        EV ENK AEKL+E++ACFSEPE+NQNS TGSSGVSVLSEMTND+ +YNPSIL DTTL ASTTVDDI+ELKGKS A+R+LDDSNSFSL  S LELRT DSE
Subjt:  EVFENKAAEKLSEKDACFSEPEENQNSTTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDIKELKGKSDAERNLDDSNSFSLPRSDLELRT-DSE

Query:  GVDSYSVDEFTDKSHGVCKPPQGRRKKNSKTSNKSQDYSFVSCQQVELERSGTNKPRHHNQSEKRKRNSTNTNL----EAVEEVDDCTLVGFLQKRSKKT
        GVDSYSVDE+T KS+GVC P QGRRKKNSKTSN S D   +  QQ+  E  GT KP HHNQS+KRK ++T  +     EAVEEVDDCTLVGFLQKR K+T
Subjt:  GVDSYSVDEFTDKSHGVCKPPQGRRKKNSKTSNKSQDYSFVSCQQVELERSGTNKPRHHNQSEKRKRNSTNTNL----EAVEEVDDCTLVGFLQKRSKKT

Query:  KITNDKKVDCSSNTPLEVDNDDNDLTIASFLNKLKRKKH
         +T+++ VDCSSN PL+VDNDDN+ TIASFLNKLKRKKH
Subjt:  KITNDKKVDCSSNTPLEVDNDDNDLTIASFLNKLKRKKH

A0A6J1E2J4 uncharacterized protein LOC111430000 isoform X20.0e+0070.03Show/hide
Query:  DTLSNKPNQSCDYVGSLKMDSDNTAVESQIASERPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRR
        D LSNKP+QSCD  G LKMDSDNT V    +SER SM+AFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFK+LK FQGSC+R+
Subjt:  DTLSNKPNQSCDYVGSLKMDSDNTAVESQIASERPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRR

Query:  TSCALSQMIDPRIQLMSAAKPQAKDSSKKPCPITFTTRSLFEQKDKRLSAMYYGPAENSHVACYRMALTKFPRVDRKKWSIVERENLGKGIRQQFQEMVL
        T+CAL+QM+DPR+QL+SA KPQAKDSSK               KDKRLSAM YGPAENSHVACYR ALTKF  VDRK+WS  ERENLGKGIRQQFQEMVL
Subjt:  TSCALSQMIDPRIQLMSAAKPQAKDSSKKPCPITFTTRSLFEQKDKRLSAMYYGPAENSHVACYRMALTKFPRVDRKKWSIVERENLGKGIRQQFQEMVL

Query:  QISVDQIRIIFCTLPRSTGIGPSVKDDEIEDENDGLSQMKGSFTVCDPLIDEFELRSGEMLVQKRTLNFDQESTLTLKIRVKVKYDSSYHKISINSSWGS
        QISVDQI                                                                                             
Subjt:  QISVDQIRIIFCTLPRSTGIGPSVKDDEIEDENDGLSQMKGSFTVCDPLIDEFELRSGEMLVQKRTLNFDQESTLTLKIRVKVKYDSSYHKISINSSWGS

Query:  YCRSGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARYFLTESILIFFEACQLQSASQSGLHYVCDDNRHRHLWLN
           S +QGFSA+SDDLDNILASIK LDITPEKIREFLPKVNWDKLA MYL+GRSGAECEAR                                    WLN
Subjt:  YCRSGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARYFLTESILIFFEACQLQSASQSGLHYVCDDNRHRHLWLN

Query:  FEDPLINRDPWITSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASIIKREWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQCS
        FEDPLINR+ W TSEDKNLLFTIQQKGLNNWIE+AVS GTNRTPFQCLSRYQRSLNASI+K EWTKDEDDKLRSA+AIFG GDWQAVASTLEGRTG QCS
Subjt:  FEDPLINRDPWITSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASIIKREWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQCS

Query:  NRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSW
        NR         WKKSLDPARTKRG+FTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRER   WFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSW
Subjt:  NRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSW

Query:  AKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCDTDDPGPAPKRNVKTRKMPVSRNEKNVTGG
        AKVAACVPSRTDN+C RRWKKLFPN+VPLLQEARKIQK ALISNFVDRESERPALGPTDFRP PN+ LLC+TDDP  APKRNV+ R+MPVSRNEK+  G 
Subjt:  AKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCDTDDPGPAPKRNVKTRKMPVSRNEKNVTGG

Query:  APKKKKSNCQRNQADTTAQVGIANNTSSVPEEVKSLKPQRKRNRHGAYTAKSIGVLELRPNNEWCAKQNLDTRTLGVQLNGKE-SEWTNSDCTETVDENG
        APKK KSN QRNQAD TAQV  ANNTSSVP EVKS KPQRKR RHGAYT +  G  ++  N+E CA+QN DTR+L VQLN KE +E  NSDC ETVDENG
Subjt:  APKKKKSNCQRNQADTTAQVGIANNTSSVPEEVKSLKPQRKRNRHGAYTAKSIGVLELRPNNEWCAKQNLDTRTLGVQLNGKE-SEWTNSDCTETVDENG

Query:  MEVFENKAAEKLSEKDACFSEPEENQNSTTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDIKELKGKSDAERNLDDSNSFSLPRSDLELR-TDS
        MEVFENKAAE  SE   CFSE EENQNS TGSSGVSVLSEMTNDMDEYNPS  PDTTLLAS T DDI E KG + A+++LDDSNSFSLP+S LELR TDS
Subjt:  MEVFENKAAEKLSEKDACFSEPEENQNSTTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDIKELKGKSDAERNLDDSNSFSLPRSDLELR-TDS

Query:  EGVDSYSVDEFTDKSHGVCKPPQGRRKKNSKTSNKSQDYSFVSCQQVELERSGTNKPRHHNQSEKRKRNSTNTN----LEAVEEVDDCTLVGFLQKRSKK
        EGVDSYSVDEFTDKSHGVCK PQGRRKKNSK SNKSQD S VSCQQ ELE SG N+    NQS+KRK + TNT+    +EAVEEVDDCTL GFLQKR K+
Subjt:  EGVDSYSVDEFTDKSHGVCKPPQGRRKKNSKTSNKSQDYSFVSCQQVELERSGTNKPRHHNQSEKRKRNSTNTN----LEAVEEVDDCTLVGFLQKRSKK

Query:  TKITNDKKVDCSSNTPLEVDNDDNDLTIASFL-NKLKRKKH
        T  T+DKKVD SS+TP EVDNDDND T+A  L +KLKRKKH
Subjt:  TKITNDKKVDCSSNTPLEVDNDDNDLTIASFL-NKLKRKKH

A0A6J1E6Z7 uncharacterized protein LOC111430000 isoform X10.0e+0070.12Show/hide
Query:  DTLSNKPNQSCDYVGSLKMDSDNTAVESQIASERPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRR
        D LSNKP+QSCD  G LKMDSDNT V    +SER SM+AFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFK+LK FQGSC+R+
Subjt:  DTLSNKPNQSCDYVGSLKMDSDNTAVESQIASERPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRR

Query:  TSCALSQMIDPRIQLMSAAKPQAKDSSKKPCPITFTTRSLFEQKDKRLSAMYYGPAENSHVACYRMALTKFPRVDRKKWSIVERENLGKGIRQQFQEMVL
        T+CAL+QM+DPR+QL+SA KPQAKDSSK               KDKRLSAM YGPAENSHVACYR ALTKF  VDRK+WS  ERENLGKGIRQQFQEMVL
Subjt:  TSCALSQMIDPRIQLMSAAKPQAKDSSKKPCPITFTTRSLFEQKDKRLSAMYYGPAENSHVACYRMALTKFPRVDRKKWSIVERENLGKGIRQQFQEMVL

Query:  QISVDQIRIIFCTLPRSTGIGPSVKDDEIEDENDGLSQMKGSFTVCDPLIDEFELRSGEMLVQKRTLNFDQESTLTLKIRVKVKYDSSYHKISINSSWGS
        QISVDQI                                                                                             
Subjt:  QISVDQIRIIFCTLPRSTGIGPSVKDDEIEDENDGLSQMKGSFTVCDPLIDEFELRSGEMLVQKRTLNFDQESTLTLKIRVKVKYDSSYHKISINSSWGS

Query:  YCRSGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARYFLTESILIFFEACQLQSASQSGLHYVCDDNRHRHLWLN
           S +QGFSA+SDDLDNILASIK LDITPEKIREFLPKVNWDKLA MYL+GRSGAECEAR                                    WLN
Subjt:  YCRSGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARYFLTESILIFFEACQLQSASQSGLHYVCDDNRHRHLWLN

Query:  FEDPLINRDPWITSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASIIKREWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQCS
        FEDPLINR+ W TSEDKNLLFTIQQKGLNNWIE+AVS GTNRTPFQCLSRYQRSLNASI+K EWTKDEDDKLRSA+AIFG GDWQAVASTLEGRTG QCS
Subjt:  FEDPLINRDPWITSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASIIKREWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQCS

Query:  NRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSW
        NR         WKKSLDPARTKRG+FTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRER   WFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSW
Subjt:  NRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSW

Query:  AKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCDTDDPGPAPKRNVKTRKMPVSRNEKNVTGG
        AKVAACVPSRTDN+CRRRWKKLFPN+VPLLQEARKIQK ALISNFVDRESERPALGPTDFRP PN+ LLC+TDDP  APKRNV+ R+MPVSRNEK+  G 
Subjt:  AKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCDTDDPGPAPKRNVKTRKMPVSRNEKNVTGG

Query:  APKKKKSNCQRNQADTTAQVGIANNTSSVPEEVKSLKPQRKRNRHGAYTAKSIGVLELRPNNEWCAKQNLDTRTLGVQLNGKE-SEWTNSDCTETVDENG
        APKK KSN QRNQAD TAQV  ANNTSSVP EVKS KPQRKR RHGAYT +  G  ++  N+E CA+QN DTR+L VQLN KE +E  NSDC ETVDENG
Subjt:  APKKKKSNCQRNQADTTAQVGIANNTSSVPEEVKSLKPQRKRNRHGAYTAKSIGVLELRPNNEWCAKQNLDTRTLGVQLNGKE-SEWTNSDCTETVDENG

Query:  MEVFENKAAEKLSEKDACFSEPEENQNSTTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDIKELKGKSDAERNLDDSNSFSLPRSDLELR-TDS
        MEVFENKAAE  SE   CFSE EENQNS TGSSGVSVLSEMTNDMDEYNPS  PDTTLLAS T DDI E KG + A+++LDDSNSFSLP+S LELR TDS
Subjt:  MEVFENKAAEKLSEKDACFSEPEENQNSTTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDIKELKGKSDAERNLDDSNSFSLPRSDLELR-TDS

Query:  EGVDSYSVDEFTDKSHGVCKPPQGRRKKNSKTSNKSQDYSFVSCQQVELERSGTNKPRHHNQSEKRKRNSTNTN----LEAVEEVDDCTLVGFLQKRSKK
        EGVDSYSVDEFTDKSHGVCK PQGRRKKNSK SNKSQD S VSCQQ ELE SG N+    NQS+KRK + TNT+    +EAVEEVDDCTL GFLQKR K+
Subjt:  EGVDSYSVDEFTDKSHGVCKPPQGRRKKNSKTSNKSQDYSFVSCQQVELERSGTNKPRHHNQSEKRKRNSTNTN----LEAVEEVDDCTLVGFLQKRSKK

Query:  TKITNDKKVDCSSNTPLEVDNDDNDLTIASFL-NKLKRKKH
        T  T+DKKVD SS+TP EVDNDDND T+A  L +KLKRKKH
Subjt:  TKITNDKKVDCSSNTPLEVDNDDNDLTIASFL-NKLKRKKH

A0A6J1JK98 uncharacterized protein LOC111485355 isoform X20.0e+0069.87Show/hide
Query:  DTLSNKPNQSCDYVGSLKMDSDNTAVESQIASERPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRR
        D LSNKP+QSCD  G LKMDSDNT VE   +SER SM+AFEKGSLPKAALAFIDAIKKNRSQQKF+RSKMIHLEARIEENKKLRKRFK+LK FQGSC+R+
Subjt:  DTLSNKPNQSCDYVGSLKMDSDNTAVESQIASERPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRR

Query:  TSCALSQMIDPRIQLMSAAKP-QAKDSSKKPCPITFTTRSLFEQKDKRLSAMYYGPAENSHVACYRMALTKFPRVDRKKWSIVERENLGKGIRQQFQEMV
        T+CALSQM+DPR+QL+SA KP QAKDSSK               KDKRLSAM YGPAENSHVACYR+ALTKF  VDRK+WS  ERENLGKGIRQQFQEMV
Subjt:  TSCALSQMIDPRIQLMSAAKP-QAKDSSKKPCPITFTTRSLFEQKDKRLSAMYYGPAENSHVACYRMALTKFPRVDRKKWSIVERENLGKGIRQQFQEMV

Query:  LQISVDQIRIIFCTLPRSTGIGPSVKDDEIEDENDGLSQMKGSFTVCDPLIDEFELRSGEMLVQKRTLNFDQESTLTLKIRVKVKYDSSYHKISINSSWG
        LQISVDQI                                                                                            
Subjt:  LQISVDQIRIIFCTLPRSTGIGPSVKDDEIEDENDGLSQMKGSFTVCDPLIDEFELRSGEMLVQKRTLNFDQESTLTLKIRVKVKYDSSYHKISINSSWG

Query:  SYCRSGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARYFLTESILIFFEACQLQSASQSGLHYVCDDNRHRHLWL
            S +QGFSA+SDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEAR                                    WL
Subjt:  SYCRSGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARYFLTESILIFFEACQLQSASQSGLHYVCDDNRHRHLWL

Query:  NFEDPLINRDPWITSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASIIKREWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQC
        NFEDPLINR+PW TSEDKNLLFTIQQKGLNNWI++AVS GTNRTPFQ LSRYQRSLNASI+K EWTKDEDDKLRSA+AIFG GDWQAVASTLEGRTG QC
Subjt:  NFEDPLINRDPWITSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASIIKREWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQC

Query:  SNRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYS
        SNR         WKKSLDPARTKRG+FTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRER   WFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYS
Subjt:  SNRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYS

Query:  WAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCDTDDPGPAPKRNVKTRKMPVSRNEKNVTG
        WAKVAACVPSRTDN+C RRWKKLFPN+VPLLQEARKIQK ALISNFVDRESERPALGPTDFRP PN+ LLC+TDDP  APKRNV+TR+MPVSRNEK+  G
Subjt:  WAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCDTDDPGPAPKRNVKTRKMPVSRNEKNVTG

Query:  GAPKKKKSNCQRNQADTTAQVGIANNTSSVPEEVKSLKPQRKRNRHGAYTAKSIGVLELRPNNEWCAKQNLDTRTLGVQLNGKE-SEWTNSDCTETVDEN
         APKK+KSN QRN+ D TAQV  A+NTSSVP EVKS KPQRKR RHGAYT +  G  ++  N+E CA+QN DTR L VQLN KE +E  NSDC ETVDEN
Subjt:  GAPKKKKSNCQRNQADTTAQVGIANNTSSVPEEVKSLKPQRKRNRHGAYTAKSIGVLELRPNNEWCAKQNLDTRTLGVQLNGKE-SEWTNSDCTETVDEN

Query:  GMEVFENKAAEKLSEKDACFSEPEENQNSTTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDIKELKGKSDAERNLDDSNSFSLPRSDLELR-TD
        GMEVFENKAAE  SE   CFSE EENQNS TGSSGVSVLSEMTNDMDEYNPS LPDTTLLAS T DDI E KG + A+++LD SNSFSLP+S LELR TD
Subjt:  GMEVFENKAAEKLSEKDACFSEPEENQNSTTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDIKELKGKSDAERNLDDSNSFSLPRSDLELR-TD

Query:  SEGVDSYSVDEFTDKSHGVCKPPQGRRKKNSKTSNKSQDYSFVSCQQVELERSGTNKPRHHNQSEKRKRNSTNTN----LEAVEEVDDCTLVGFLQKRSK
        SEGVDSYSVDEFTDKSH VCK PQGRRKKNSK SNKSQD S VSCQQ ELE SGTN+    NQ +KRK +STNT+    +EAVEEVDDCTL+GFLQKR K
Subjt:  SEGVDSYSVDEFTDKSHGVCKPPQGRRKKNSKTSNKSQDYSFVSCQQVELERSGTNKPRHHNQSEKRKRNSTNTN----LEAVEEVDDCTLVGFLQKRSK

Query:  KTKITNDKKVDCSSNTPLEVDNDDNDLTIASFL-NKLKRKKH
        +T  T+ KKVD SS+T  EVDNDDND T+A  L  KLKRKKH
Subjt:  KTKITNDKKVDCSSNTPLEVDNDDNDLTIASFL-NKLKRKKH

A0A6J1JKV7 uncharacterized protein LOC111485355 isoform X10.0e+0069.96Show/hide
Query:  DTLSNKPNQSCDYVGSLKMDSDNTAVESQIASERPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRR
        D LSNKP+QSCD  G LKMDSDNT VE   +SER SM+AFEKGSLPKAALAFIDAIKKNRSQQKF+RSKMIHLEARIEENKKLRKRFK+LK FQGSC+R+
Subjt:  DTLSNKPNQSCDYVGSLKMDSDNTAVESQIASERPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRR

Query:  TSCALSQMIDPRIQLMSAAKP-QAKDSSKKPCPITFTTRSLFEQKDKRLSAMYYGPAENSHVACYRMALTKFPRVDRKKWSIVERENLGKGIRQQFQEMV
        T+CALSQM+DPR+QL+SA KP QAKDSSK               KDKRLSAM YGPAENSHVACYR+ALTKF  VDRK+WS  ERENLGKGIRQQFQEMV
Subjt:  TSCALSQMIDPRIQLMSAAKP-QAKDSSKKPCPITFTTRSLFEQKDKRLSAMYYGPAENSHVACYRMALTKFPRVDRKKWSIVERENLGKGIRQQFQEMV

Query:  LQISVDQIRIIFCTLPRSTGIGPSVKDDEIEDENDGLSQMKGSFTVCDPLIDEFELRSGEMLVQKRTLNFDQESTLTLKIRVKVKYDSSYHKISINSSWG
        LQISVDQI                                                                                            
Subjt:  LQISVDQIRIIFCTLPRSTGIGPSVKDDEIEDENDGLSQMKGSFTVCDPLIDEFELRSGEMLVQKRTLNFDQESTLTLKIRVKVKYDSSYHKISINSSWG

Query:  SYCRSGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARYFLTESILIFFEACQLQSASQSGLHYVCDDNRHRHLWL
            S +QGFSA+SDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEAR                                    WL
Subjt:  SYCRSGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARYFLTESILIFFEACQLQSASQSGLHYVCDDNRHRHLWL

Query:  NFEDPLINRDPWITSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASIIKREWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQC
        NFEDPLINR+PW TSEDKNLLFTIQQKGLNNWI++AVS GTNRTPFQ LSRYQRSLNASI+K EWTKDEDDKLRSA+AIFG GDWQAVASTLEGRTG QC
Subjt:  NFEDPLINRDPWITSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASIIKREWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQC

Query:  SNRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYS
        SNR         WKKSLDPARTKRG+FTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRER   WFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYS
Subjt:  SNRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYS

Query:  WAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCDTDDPGPAPKRNVKTRKMPVSRNEKNVTG
        WAKVAACVPSRTDN+CRRRWKKLFPN+VPLLQEARKIQK ALISNFVDRESERPALGPTDFRP PN+ LLC+TDDP  APKRNV+TR+MPVSRNEK+  G
Subjt:  WAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCDTDDPGPAPKRNVKTRKMPVSRNEKNVTG

Query:  GAPKKKKSNCQRNQADTTAQVGIANNTSSVPEEVKSLKPQRKRNRHGAYTAKSIGVLELRPNNEWCAKQNLDTRTLGVQLNGKE-SEWTNSDCTETVDEN
         APKK+KSN QRN+ D TAQV  A+NTSSVP EVKS KPQRKR RHGAYT +  G  ++  N+E CA+QN DTR L VQLN KE +E  NSDC ETVDEN
Subjt:  GAPKKKKSNCQRNQADTTAQVGIANNTSSVPEEVKSLKPQRKRNRHGAYTAKSIGVLELRPNNEWCAKQNLDTRTLGVQLNGKE-SEWTNSDCTETVDEN

Query:  GMEVFENKAAEKLSEKDACFSEPEENQNSTTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDIKELKGKSDAERNLDDSNSFSLPRSDLELR-TD
        GMEVFENKAAE  SE   CFSE EENQNS TGSSGVSVLSEMTNDMDEYNPS LPDTTLLAS T DDI E KG + A+++LD SNSFSLP+S LELR TD
Subjt:  GMEVFENKAAEKLSEKDACFSEPEENQNSTTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDIKELKGKSDAERNLDDSNSFSLPRSDLELR-TD

Query:  SEGVDSYSVDEFTDKSHGVCKPPQGRRKKNSKTSNKSQDYSFVSCQQVELERSGTNKPRHHNQSEKRKRNSTNTN----LEAVEEVDDCTLVGFLQKRSK
        SEGVDSYSVDEFTDKSH VCK PQGRRKKNSK SNKSQD S VSCQQ ELE SGTN+    NQ +KRK +STNT+    +EAVEEVDDCTL+GFLQKR K
Subjt:  SEGVDSYSVDEFTDKSHGVCKPPQGRRKKNSKTSNKSQDYSFVSCQQVELERSGTNKPRHHNQSEKRKRNSTNTN----LEAVEEVDDCTLVGFLQKRSK

Query:  KTKITNDKKVDCSSNTPLEVDNDDNDLTIASFL-NKLKRKKH
        +T  T+ KKVD SS+T  EVDNDDND T+A  L  KLKRKKH
Subjt:  KTKITNDKKVDCSSNTPLEVDNDDNDLTIASFL-NKLKRKKH

SwissProt top hitse value%identityAlignment
P10242 Transcriptional activator Myb5.2e-2836.7Show/hide
Query:  KREWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQCSNRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQ
        K  WT++ED+KL+  +   G  DW+ +A+ L  RT  QC +R         W+K L+P   K G +T +ED R+   V  +GPK W+  A+ L GR   Q
Subjt:  KREWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQCSNRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQ

Query:  CRERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVP---LLQEARKIQKAALISNF
        CRER   W N L+P +++  WTEEED  +  A +  G  WA++A  +P RTDN  +  W      +V     LQE+ K  + A+ ++F
Subjt:  CRERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVP---LLQEARKIQKAALISNF

P46200 Transcriptional activator Myb5.2e-2836.7Show/hide
Query:  KREWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQCSNRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQ
        K  WT++ED+KL+  +   G  DW+ +A+ L  RT  QC +R         W+K L+P   K G +T +ED R+   V  +GPK W+  A+ L GR   Q
Subjt:  KREWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQCSNRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQ

Query:  CRERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVP---LLQEARKIQKAALISNF
        CRER   W N L+P +++  WTEEED  +  A +  G  WA++A  +P RTDN  +  W      +V     LQE+ K  + A+ ++F
Subjt:  CRERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVP---LLQEARKIQKAALISNF

Q54NA6 Myb-like protein L1.2e-4840.67Show/hide
Query:  WLNFEDPLINRDPWITSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASIIKREWTKDEDDKLRSALAIFGLG---DWQAVASTLEGR
        W N +DP IN+ P+   EDK LL   ++   + W +I++  GTNRTP  C+ RYQRSLN+ ++KREWTK+ED+ L   + +   G   DWQ +   + GR
Subjt:  WLNFEDPLINRDPWITSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASIIKREWTKDEDDKLRSALAIFGLG---DWQAVASTLEGR

Query:  TGTQCSNRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQ
        TG QC +R         W K+LDP+  K+G ++P+ED  L  AV  +G  NW      + GR  VQCRER   + N LDP L +  WT +ED RL     
Subjt:  TGTQCSNRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQ

Query:  EHGY-SWAKVAACVPSRTDNDCRRRWKKL--FPNEVPLLQEARKIQKAALISNFVDRESERPALGPTD
        + G   W+ VA  + +RTDN C RRWK+L    N +   QE    +K   +SNF  R+ ER  L   D
Subjt:  EHGY-SWAKVAACVPSRTDNDCRRRWKKL--FPNEVPLLQEARKIQKAALISNFVDRESERPALGPTD

Q5SXM2 snRNA-activating protein complex subunit 41.0e-3132.24Show/hide
Query:  RHLWLNFEDPLINRDPWITSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASIIKREWTKDEDDKLRSALAIFGLGD---WQAVASTL
        R  W N E P IN+  W   E++ L       G   W +IA   GT+R+ FQCL ++Q+  N ++ ++EWT++ED  L   +    +G    ++ +   +
Subjt:  RHLWLNFEDPLINRDPWITSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASIIKREWTKDEDDKLRSALAIFGLGD---WQAVASTL

Query:  EGRTGTQCSNRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERCVKWFNCLDPSLRRCEWTEEEDLRLEI
        EGR   Q   R         W KSLDP   K+G++ P+ED++L  AV  +G ++W K  E +PGR+  QCR+R   +   L  SL++  W  +E+ +L  
Subjt:  EGRTGTQCSNRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERCVKWFNCLDPSLRRCEWTEEEDLRLEI

Query:  AIQEHGYS-WAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARK
         I+++G   WAK+A+ +P R+ + C  +WK +   +  L +  R+
Subjt:  AIQEHGYS-WAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARK

Q8BP86 snRNA-activating protein complex subunit 42.5e-3032.76Show/hide
Query:  RHLWLNFEDPLINRDPWITSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASIIKREWTKDEDDKLRSALAIFGLGD---WQAVASTL
        R  W + E P I++  W T E + L       G   W  +A   GT+R+ FQCL ++Q+  N ++ ++EWT++ED  L   +    +G+   ++ +   +
Subjt:  RHLWLNFEDPLINRDPWITSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASIIKREWTKDEDDKLRSALAIFGLGD---WQAVASTL

Query:  EGRTGTQCSNRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERCVKWFNCLDPSLRRCEWTEEEDLRLEI
        EGR   Q   R         W KSLDP+  KRG + P+ED++L  AV  +G ++W K  E +PGR+  QCR+R ++    L  SL++  W  +E+ +L  
Subjt:  EGRTGTQCSNRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERCVKWFNCLDPSLRRCEWTEEEDLRLEI

Query:  AIQEHGYS-WAKVAACVPSRTDNDCRRRWKKL
         I+++G   WA++A+ +P R+ + C  +WK L
Subjt:  AIQEHGYS-WAKVAACVPSRTDNDCRRRWKKL

Arabidopsis top hitse value%identityAlignment
AT3G18100.1 myb domain protein 4r13.2e-13441.58Show/hide
Query:  SLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRRTSCALSQMIDPRIQLMSAAKPQAKDSSK--KPCPITFTTRSLF
        S P+AA AF+DAI++NR+ QKF+R K+  +EA IE+N+K +K  +I+KDFQ SCKR T  AL Q  DPR++L+S  K    DSS+   PC         F
Subjt:  SLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRRTSCALSQMIDPRIQLMSAAKPQAKDSSK--KPCPITFTTRSLF

Query:  EQKDKRLSAMYYGPAENSHVACYRMALTKFP-RVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQIRIIFCTLPRSTGIGPSVKDDEIEDENDGLSQMK
        E  DK++S +  GPAEN  V  YRMAL K+P  V R+KWS  E +NL KG++Q+ Q+++L  ++++                                  
Subjt:  EQKDKRLSAMYYGPAENSHVACYRMALTKFP-RVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQIRIIFCTLPRSTGIGPSVKDDEIEDENDGLSQMK

Query:  GSFTVCDPLIDEFELRSGEMLVQKRTLNFDQESTLTLKIRVKVKYDSSYHKISINSSWGSYCRSGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKV
                                                                       S L+G +    D+D I  SI +L+ITPE IR+FLPK+
Subjt:  GSFTVCDPLIDEFELRSGEMLVQKRTLNFDQESTLTLKIRVKVKYDSSYHKISINSSWGSYCRSGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKV

Query:  NWDKLASMYLRGRSGAECEARYFLTESILIFFEACQLQSASQSGLHYVCDDNRHRHLWLNFEDPLINRDPWITSEDKNLLFTIQQKGLNNWIEIAVSSGT
        NWD   S+ ++ RS AECEAR                                    W++ EDPLIN  PW  +EDKNLL TI+Q  L +W++IAVS GT
Subjt:  NWDKLASMYLRGRSGAECEARYFLTESILIFFEACQLQSASQSGLHYVCDDNRHRHLWLNFEDPLINRDPWITSEDKNLLFTIQQKGLNNWIEIAVSSGT

Query:  NRTPFQCLSRYQRSLNASIIKREWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQCSNRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLL
        NRTPFQCL+RYQRSLN SI+K+EWT +EDD+LR+A+ +FG  DWQ+VA+ L+GRTGTQCSNR         WKKSL P  T++G ++ +ED R+K+AV L
Subjt:  NRTPFQCLSRYQRSLNASIIKREWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQCSNRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLL

Query:  FGPKNWNKKAEFLPGRNQVQCRERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAA
        FG +NW+K ++F+PGR Q QCRER   W NCLDP + R +WTEEED +L  AI EHGYSW+KVA  +  RTDN C RRWK+L+P++V LLQEAR++QK A
Subjt:  FGPKNWNKKAEFLPGRNQVQCRERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAA

Query:  LISNFVDRESERPALGPTDFRPRPNTDLLCDTDDPGPAPKRNVKTRKMPVSRNEKNVTGGAPKKKKSNCQR
         + NFVDRESERPAL  +     P+  L  + D      KR  K +K    R  K    G        C++
Subjt:  LISNFVDRESERPALGPTDFRPRPNTDLLCDTDDPGPAPKRNVKTRKMPVSRNEKNVTGGAPKKKKSNCQR

AT3G18100.2 myb domain protein 4r13.2e-13441.58Show/hide
Query:  SLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRRTSCALSQMIDPRIQLMSAAKPQAKDSSK--KPCPITFTTRSLF
        S P+AA AF+DAI++NR+ QKF+R K+  +EA IE+N+K +K  +I+KDFQ SCKR T  AL Q  DPR++L+S  K    DSS+   PC         F
Subjt:  SLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRRTSCALSQMIDPRIQLMSAAKPQAKDSSK--KPCPITFTTRSLF

Query:  EQKDKRLSAMYYGPAENSHVACYRMALTKFP-RVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQIRIIFCTLPRSTGIGPSVKDDEIEDENDGLSQMK
        E  DK++S +  GPAEN  V  YRMAL K+P  V R+KWS  E +NL KG++Q+ Q+++L  ++++                                  
Subjt:  EQKDKRLSAMYYGPAENSHVACYRMALTKFP-RVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQIRIIFCTLPRSTGIGPSVKDDEIEDENDGLSQMK

Query:  GSFTVCDPLIDEFELRSGEMLVQKRTLNFDQESTLTLKIRVKVKYDSSYHKISINSSWGSYCRSGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKV
                                                                       S L+G +    D+D I  SI +L+ITPE IR+FLPK+
Subjt:  GSFTVCDPLIDEFELRSGEMLVQKRTLNFDQESTLTLKIRVKVKYDSSYHKISINSSWGSYCRSGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKV

Query:  NWDKLASMYLRGRSGAECEARYFLTESILIFFEACQLQSASQSGLHYVCDDNRHRHLWLNFEDPLINRDPWITSEDKNLLFTIQQKGLNNWIEIAVSSGT
        NWD   S+ ++ RS AECEAR                                    W++ EDPLIN  PW  +EDKNLL TI+Q  L +W++IAVS GT
Subjt:  NWDKLASMYLRGRSGAECEARYFLTESILIFFEACQLQSASQSGLHYVCDDNRHRHLWLNFEDPLINRDPWITSEDKNLLFTIQQKGLNNWIEIAVSSGT

Query:  NRTPFQCLSRYQRSLNASIIKREWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQCSNRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLL
        NRTPFQCL+RYQRSLN SI+K+EWT +EDD+LR+A+ +FG  DWQ+VA+ L+GRTGTQCSNR         WKKSL P  T++G ++ +ED R+K+AV L
Subjt:  NRTPFQCLSRYQRSLNASIIKREWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQCSNRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLL

Query:  FGPKNWNKKAEFLPGRNQVQCRERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAA
        FG +NW+K ++F+PGR Q QCRER   W NCLDP + R +WTEEED +L  AI EHGYSW+KVA  +  RTDN C RRWK+L+P++V LLQEAR++QK A
Subjt:  FGPKNWNKKAEFLPGRNQVQCRERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAA

Query:  LISNFVDRESERPALGPTDFRPRPNTDLLCDTDDPGPAPKRNVKTRKMPVSRNEKNVTGGAPKKKKSNCQR
         + NFVDRESERPAL  +     P+  L  + D      KR  K +K    R  K    G        C++
Subjt:  LISNFVDRESERPALGPTDFRPRPNTDLLCDTDDPGPAPKRNVKTRKMPVSRNEKNVTGGAPKKKKSNCQR

AT3G18100.3 myb domain protein 4r12.8e-11438.54Show/hide
Query:  KMIHLEARIEENKKLRKRF-----------KILKDFQGSCKRRTSCALS--QMIDPRIQLMSAAKPQAKDSSK--KPCPITFTTRSLFEQKDKRLSAMYY
        KM+  + R++ + KL +R              L++     K+R+S   +  +M DPR++L+S  K    DSS+   PC         FE  DK++S +  
Subjt:  KMIHLEARIEENKKLRKRF-----------KILKDFQGSCKRRTSCALS--QMIDPRIQLMSAAKPQAKDSSK--KPCPITFTTRSLFEQKDKRLSAMYY

Query:  GPAENSHVACYRMALTKFP-RVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQIRIIFCTLPRSTGIGPSVKDDEIEDENDGLSQMKGSFTVCDPLIDE
        GPAEN  V  YRMAL K+P  V R+KWS  E +NL KG++Q+ Q+++L  ++++                                              
Subjt:  GPAENSHVACYRMALTKFP-RVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQIRIIFCTLPRSTGIGPSVKDDEIEDENDGLSQMKGSFTVCDPLIDE

Query:  FELRSGEMLVQKRTLNFDQESTLTLKIRVKVKYDSSYHKISINSSWGSYCRSGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRG
                                                           S L+G +    D+D I  SI +L+ITPE IR+FLPK+NWD   S+ ++ 
Subjt:  FELRSGEMLVQKRTLNFDQESTLTLKIRVKVKYDSSYHKISINSSWGSYCRSGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRG

Query:  RSGAECEARYFLTESILIFFEACQLQSASQSGLHYVCDDNRHRHLWLNFEDPLINRDPWITSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQ
        RS AECEAR                                    W++ EDPLIN  PW  +EDKNLL TI+Q  L +W++IAVS GTNRTPFQCL+RYQ
Subjt:  RSGAECEARYFLTESILIFFEACQLQSASQSGLHYVCDDNRHRHLWLNFEDPLINRDPWITSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQ

Query:  RSLNASIIKREWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQCSNRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEF
        RSLN SI+K+EWT +EDD+LR+A+ +FG  DWQ+VA+ L+GRTGTQCSNR         WKKSL P  T++G ++ +ED R+K+AV LFG +NW+K ++F
Subjt:  RSLNASIIKREWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQCSNRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEF

Query:  LPGRNQVQCRERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESER
        +PGR Q QCRER   W NCLDP + R +WTEEED +L  AI EHGYSW+KVA  +  RTDN C RRWK+L+P++V LLQEAR++QK A + NFVDRESER
Subjt:  LPGRNQVQCRERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESER

Query:  PALGPTDFRPRPNTDLLCDTDDPGPAPKRNVKTRKMPVSRNEKNVTGGAPKKKKSNCQR
        PAL  +     P+  L  + D      KR  K +K    R  K    G        C++
Subjt:  PALGPTDFRPRPNTDLLCDTDDPGPAPKRNVKTRKMPVSRNEKNVTGGAPKKKKSNCQR

AT5G11510.1 myb domain protein 3r-42.5e-2537.58Show/hide
Query:  EWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQCSNRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCR
        +WT +ED+ LR A+  F   +W+ +A   + RT  QC +R         W+K L+P   K G +T +ED  +   +  +GPK W+  A FLPGR   QCR
Subjt:  EWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQCSNRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCR

Query:  ERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRW
        ER   W N L+P++ +  WT+EE+L L  A Q +G  WA++   +P R+DN  +  W
Subjt:  ERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRW

AT5G11510.2 myb domain protein 3r-42.5e-2537.58Show/hide
Query:  EWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQCSNRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCR
        +WT +ED+ LR A+  F   +W+ +A   + RT  QC +R         W+K L+P   K G +T +ED  +   +  +GPK W+  A FLPGR   QCR
Subjt:  EWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQCSNRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCR

Query:  ERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRW
        ER   W N L+P++ +  WT+EE+L L  A Q +G  WA++   +P R+DN  +  W
Subjt:  ERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATACTTTGAGCAATAAACCTAATCAGTCGTGTGACTATGTTGGGTCTCTGAAGATGGATTCTGACAACACAGCTGTTGAGAGTCAGATAGCCTCAGAAAGGCCATC
CATGCTAGCCTTTGAAAAGGGAAGCTTGCCAAAGGCTGCGTTGGCATTTATTGATGCCATTAAGAAGAATAGGTCCCAGCAGAAGTTTATTCGTAGCAAGATGATTCACC
TCGAAGCTAGAATTGAGGAGAACAAGAAGCTCAGAAAACGTTTCAAAATTCTCAAAGATTTCCAAGGTTCATGTAAACGAAGAACAAGCTGTGCACTGTCTCAGATGATA
GATCCTCGAATTCAGTTGATGTCAGCTGCAAAGCCACAAGCAAAGGATTCATCAAAGAAACCCTGCCCTATAACATTCACTACAAGAAGTCTTTTCGAACAAAAGGACAA
ACGATTATCTGCAATGTATTATGGCCCAGCTGAGAATTCTCATGTTGCATGTTACAGAATGGCATTGACGAAATTTCCTCGTGTAGATCGAAAAAAATGGTCCATTGTAG
AAAGGGAGAATCTAGGGAAGGGAATAAGACAGCAATTTCAAGAGATGGTGCTTCAGATTTCAGTGGATCAAATTAGAATTATATTTTGTACATTGCCGAGATCTACTGGG
ATTGGTCCTTCAGTTAAAGATGATGAAATTGAAGACGAAAACGACGGGTTGTCACAGATGAAGGGTAGTTTCACCGTCTGTGACCCACTGATCGATGAATTTGAATTACG
GTCAGGCGAAATGCTTGTTCAGAAGAGAACGCTTAATTTTGACCAAGAATCAACTTTGACGCTGAAGATTAGAGTTAAAGTTAAATACGACTCTAGTTACCATAAAATTA
GTATCAATTCAAGTTGGGGTTCATATTGCAGAAGTGGGCTACAAGGATTTTCTGCAGATTCCGATGATTTGGATAACATTCTTGCATCAATAAAAGACCTTGACATCACT
CCTGAAAAGATTAGGGAATTTCTGCCAAAAGTTAATTGGGACAAATTGGCTTCCATGTATCTTCGGGGTCGCTCAGGGGCAGAATGTGAAGCGAGGTATTTCTTGACCGA
GTCCATATTGATTTTCTTTGAAGCATGCCAACTTCAATCAGCATCGCAGAGTGGTCTTCATTATGTCTGTGATGACAATCGTCACAGACATCTGTGGTTGAATTTTGAAG
ACCCCCTAATTAATCGGGATCCATGGATTACAAGTGAGGATAAGAATCTTTTGTTTACTATCCAACAGAAGGGGTTGAATAACTGGATTGAGATAGCAGTTTCATCGGGT
ACAAACAGAACTCCTTTTCAGTGCTTGTCTCGGTATCAAAGGAGTTTAAACGCTTCCATAATAAAGAGGGAGTGGACCAAAGACGAGGATGATAAACTTCGATCTGCTCT
TGCTATATTTGGCCTGGGAGATTGGCAGGCAGTAGCTTCTACTTTGGAAGGACGAACTGGCACACAGTGCTCTAATAGGTTGATCTTATTTACAATATCTGACCAGTGGA
AAAAATCCCTTGACCCAGCTAGGACAAAAAGAGGTCATTTCACTCCAGATGAAGACAGTCGCTTGAAAATTGCTGTATTGCTCTTTGGGCCTAAAAATTGGAACAAGAAA
GCAGAATTTTTACCTGGTCGGAATCAAGTTCAATGCAGAGAAAGGTGTGTTAAATGGTTTAATTGTTTAGATCCTTCCTTGAGACGATGTGAATGGACGGAAGAGGAGGA
TTTAAGGCTGGAGATAGCAATTCAGGAACATGGATATAGCTGGGCTAAGGTAGCTGCATGTGTGCCGTCACGTACAGATAATGATTGCCGGAGGAGATGGAAGAAGTTAT
TTCCCAATGAAGTTCCTTTGCTCCAGGAAGCTAGAAAGATTCAGAAGGCTGCTCTTATTAGCAACTTTGTTGATAGGGAATCGGAGCGTCCTGCTCTTGGTCCTACTGAC
TTTCGACCCAGGCCAAATACAGATTTATTATGTGATACTGATGATCCAGGACCTGCCCCAAAAAGAAATGTGAAGACGAGAAAGATGCCAGTGTCAAGGAATGAAAAGAA
TGTTACTGGTGGTGCTCCAAAGAAGAAGAAATCAAATTGCCAGAGGAATCAGGCTGATACAACTGCTCAGGTGGGTATTGCAAATAATACGTCTTCCGTCCCAGAGGAGG
TTAAATCTTTAAAACCACAAAGAAAACGAAATAGACATGGAGCTTATACCGCGAAGAGTATAGGGGTTCTGGAGTTACGTCCTAACAATGAGTGGTGTGCTAAACAGAAT
TTGGACACACGGACCCTTGGGGTGCAGCTGAATGGTAAGGAATCTGAGTGGACCAACAGCGACTGCACCGAGACTGTTGATGAGAATGGTATGGAGGTATTTGAGAACAA
AGCTGCAGAGAAGCTTTCTGAAAAAGATGCATGCTTTTCAGAACCAGAAGAAAATCAAAACTCTACCACCGGATCTTCTGGAGTCTCGGTACTGTCAGAAATGACTAATG
ACATGGATGAATATAATCCCTCTATCCTTCCAGATACAACATTGTTGGCTAGTACTACTGTGGATGATATCAAAGAATTGAAGGGGAAGAGTGATGCAGAGAGAAATCTG
GATGACAGTAACAGTTTCTCGTTACCGCGCAGCGACTTAGAACTCAGGACTGACAGTGAAGGTGTGGACAGTTATTCTGTGGACGAATTCACAGATAAAAGCCATGGGGT
TTGCAAGCCCCCCCAAGGCAGAAGGAAGAAAAATAGTAAAACATCAAATAAGAGCCAGGATTATTCATTTGTTTCATGTCAACAAGTGGAGCTGGAGAGGTCGGGGACAA
ACAAGCCTCGTCATCATAATCAATCAGAAAAGAGAAAACGTAACAGTACAAATACAAATCTGGAGGCAGTTGAGGAGGTTGATGACTGTACTCTTGTGGGCTTTTTGCAA
AAAAGATCGAAGAAGACGAAAATCACAAATGACAAGAAAGTTGATTGCAGTTCTAATACTCCTCTAGAAGTTGATAACGACGACAACGATCTTACCATTGCCTCGTTTCT
TAATAAATTGAAGAGAAAAAAGCATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATACTTTGAGCAATAAACCTAATCAGTCGTGTGACTATGTTGGGTCTCTGAAGATGGATTCTGACAACACAGCTGTTGAGAGTCAGATAGCCTCAGAAAGGCCATC
CATGCTAGCCTTTGAAAAGGGAAGCTTGCCAAAGGCTGCGTTGGCATTTATTGATGCCATTAAGAAGAATAGGTCCCAGCAGAAGTTTATTCGTAGCAAGATGATTCACC
TCGAAGCTAGAATTGAGGAGAACAAGAAGCTCAGAAAACGTTTCAAAATTCTCAAAGATTTCCAAGGTTCATGTAAACGAAGAACAAGCTGTGCACTGTCTCAGATGATA
GATCCTCGAATTCAGTTGATGTCAGCTGCAAAGCCACAAGCAAAGGATTCATCAAAGAAACCCTGCCCTATAACATTCACTACAAGAAGTCTTTTCGAACAAAAGGACAA
ACGATTATCTGCAATGTATTATGGCCCAGCTGAGAATTCTCATGTTGCATGTTACAGAATGGCATTGACGAAATTTCCTCGTGTAGATCGAAAAAAATGGTCCATTGTAG
AAAGGGAGAATCTAGGGAAGGGAATAAGACAGCAATTTCAAGAGATGGTGCTTCAGATTTCAGTGGATCAAATTAGAATTATATTTTGTACATTGCCGAGATCTACTGGG
ATTGGTCCTTCAGTTAAAGATGATGAAATTGAAGACGAAAACGACGGGTTGTCACAGATGAAGGGTAGTTTCACCGTCTGTGACCCACTGATCGATGAATTTGAATTACG
GTCAGGCGAAATGCTTGTTCAGAAGAGAACGCTTAATTTTGACCAAGAATCAACTTTGACGCTGAAGATTAGAGTTAAAGTTAAATACGACTCTAGTTACCATAAAATTA
GTATCAATTCAAGTTGGGGTTCATATTGCAGAAGTGGGCTACAAGGATTTTCTGCAGATTCCGATGATTTGGATAACATTCTTGCATCAATAAAAGACCTTGACATCACT
CCTGAAAAGATTAGGGAATTTCTGCCAAAAGTTAATTGGGACAAATTGGCTTCCATGTATCTTCGGGGTCGCTCAGGGGCAGAATGTGAAGCGAGGTATTTCTTGACCGA
GTCCATATTGATTTTCTTTGAAGCATGCCAACTTCAATCAGCATCGCAGAGTGGTCTTCATTATGTCTGTGATGACAATCGTCACAGACATCTGTGGTTGAATTTTGAAG
ACCCCCTAATTAATCGGGATCCATGGATTACAAGTGAGGATAAGAATCTTTTGTTTACTATCCAACAGAAGGGGTTGAATAACTGGATTGAGATAGCAGTTTCATCGGGT
ACAAACAGAACTCCTTTTCAGTGCTTGTCTCGGTATCAAAGGAGTTTAAACGCTTCCATAATAAAGAGGGAGTGGACCAAAGACGAGGATGATAAACTTCGATCTGCTCT
TGCTATATTTGGCCTGGGAGATTGGCAGGCAGTAGCTTCTACTTTGGAAGGACGAACTGGCACACAGTGCTCTAATAGGTTGATCTTATTTACAATATCTGACCAGTGGA
AAAAATCCCTTGACCCAGCTAGGACAAAAAGAGGTCATTTCACTCCAGATGAAGACAGTCGCTTGAAAATTGCTGTATTGCTCTTTGGGCCTAAAAATTGGAACAAGAAA
GCAGAATTTTTACCTGGTCGGAATCAAGTTCAATGCAGAGAAAGGTGTGTTAAATGGTTTAATTGTTTAGATCCTTCCTTGAGACGATGTGAATGGACGGAAGAGGAGGA
TTTAAGGCTGGAGATAGCAATTCAGGAACATGGATATAGCTGGGCTAAGGTAGCTGCATGTGTGCCGTCACGTACAGATAATGATTGCCGGAGGAGATGGAAGAAGTTAT
TTCCCAATGAAGTTCCTTTGCTCCAGGAAGCTAGAAAGATTCAGAAGGCTGCTCTTATTAGCAACTTTGTTGATAGGGAATCGGAGCGTCCTGCTCTTGGTCCTACTGAC
TTTCGACCCAGGCCAAATACAGATTTATTATGTGATACTGATGATCCAGGACCTGCCCCAAAAAGAAATGTGAAGACGAGAAAGATGCCAGTGTCAAGGAATGAAAAGAA
TGTTACTGGTGGTGCTCCAAAGAAGAAGAAATCAAATTGCCAGAGGAATCAGGCTGATACAACTGCTCAGGTGGGTATTGCAAATAATACGTCTTCCGTCCCAGAGGAGG
TTAAATCTTTAAAACCACAAAGAAAACGAAATAGACATGGAGCTTATACCGCGAAGAGTATAGGGGTTCTGGAGTTACGTCCTAACAATGAGTGGTGTGCTAAACAGAAT
TTGGACACACGGACCCTTGGGGTGCAGCTGAATGGTAAGGAATCTGAGTGGACCAACAGCGACTGCACCGAGACTGTTGATGAGAATGGTATGGAGGTATTTGAGAACAA
AGCTGCAGAGAAGCTTTCTGAAAAAGATGCATGCTTTTCAGAACCAGAAGAAAATCAAAACTCTACCACCGGATCTTCTGGAGTCTCGGTACTGTCAGAAATGACTAATG
ACATGGATGAATATAATCCCTCTATCCTTCCAGATACAACATTGTTGGCTAGTACTACTGTGGATGATATCAAAGAATTGAAGGGGAAGAGTGATGCAGAGAGAAATCTG
GATGACAGTAACAGTTTCTCGTTACCGCGCAGCGACTTAGAACTCAGGACTGACAGTGAAGGTGTGGACAGTTATTCTGTGGACGAATTCACAGATAAAAGCCATGGGGT
TTGCAAGCCCCCCCAAGGCAGAAGGAAGAAAAATAGTAAAACATCAAATAAGAGCCAGGATTATTCATTTGTTTCATGTCAACAAGTGGAGCTGGAGAGGTCGGGGACAA
ACAAGCCTCGTCATCATAATCAATCAGAAAAGAGAAAACGTAACAGTACAAATACAAATCTGGAGGCAGTTGAGGAGGTTGATGACTGTACTCTTGTGGGCTTTTTGCAA
AAAAGATCGAAGAAGACGAAAATCACAAATGACAAGAAAGTTGATTGCAGTTCTAATACTCCTCTAGAAGTTGATAACGACGACAACGATCTTACCATTGCCTCGTTTCT
TAATAAATTGAAGAGAAAAAAGCATTAGCCGGGTAGCGATGGTGAGTAAAACTAGGAAGGATGACAAATAGTTTTGTTGCATTTTTGGGATATCTGAAGCAACATCACAT
TGGAACTCCGAAGAAGGTTAATGGAAGGCTGTACAAATAAAGGTTCCAAGGCCGCAAAATGGCCAAATGCTCAACTATATGGTTGATTCTATGGTTGACAAATTTTTGTT
TCGTGATTTAGCTAAGATGGCTTTGGCAATTTTCTTTTGTTTTGGTTCATTTACTTTTCCAATCTGTTATTATTATTATTTTTTTAAAATCTGAGAAGGTAGTGATAATC
TTCTCTTTGTACATTATTCTCATAATCTTAATGAAATCATAGGAAATGAATACAATGAATAGGTTTTTATT
Protein sequenceShow/hide protein sequence
MDTLSNKPNQSCDYVGSLKMDSDNTAVESQIASERPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRRTSCALSQMI
DPRIQLMSAAKPQAKDSSKKPCPITFTTRSLFEQKDKRLSAMYYGPAENSHVACYRMALTKFPRVDRKKWSIVERENLGKGIRQQFQEMVLQISVDQIRIIFCTLPRSTG
IGPSVKDDEIEDENDGLSQMKGSFTVCDPLIDEFELRSGEMLVQKRTLNFDQESTLTLKIRVKVKYDSSYHKISINSSWGSYCRSGLQGFSADSDDLDNILASIKDLDIT
PEKIREFLPKVNWDKLASMYLRGRSGAECEARYFLTESILIFFEACQLQSASQSGLHYVCDDNRHRHLWLNFEDPLINRDPWITSEDKNLLFTIQQKGLNNWIEIAVSSG
TNRTPFQCLSRYQRSLNASIIKREWTKDEDDKLRSALAIFGLGDWQAVASTLEGRTGTQCSNRLILFTISDQWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKK
AEFLPGRNQVQCRERCVKWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTD
FRPRPNTDLLCDTDDPGPAPKRNVKTRKMPVSRNEKNVTGGAPKKKKSNCQRNQADTTAQVGIANNTSSVPEEVKSLKPQRKRNRHGAYTAKSIGVLELRPNNEWCAKQN
LDTRTLGVQLNGKESEWTNSDCTETVDENGMEVFENKAAEKLSEKDACFSEPEENQNSTTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDIKELKGKSDAERNL
DDSNSFSLPRSDLELRTDSEGVDSYSVDEFTDKSHGVCKPPQGRRKKNSKTSNKSQDYSFVSCQQVELERSGTNKPRHHNQSEKRKRNSTNTNLEAVEEVDDCTLVGFLQ
KRSKKTKITNDKKVDCSSNTPLEVDNDDNDLTIASFLNKLKRKKH