; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G014580 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G014580
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionsnRNA-activating protein complex subunit 4
Genome locationCG_Chr05:24750910..24776529
RNA-Seq ExpressionClCG05G014580
SyntenyClCG05G014580
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017884 - SANT domain
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011650584.1 uncharacterized protein LOC101216287 [Cucumis sativus]0.0e+0072.62Show/hide
Query:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLDEEE
        MS RNH DE DVE P  KED VVDEDME L+RAY+  GVNPEDYINPRLSSP AGDA+P SDSDDVDDFELLR+IQNRFS +ADEQP ST  P+S DEEE
Subjt:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLDEEE

Query:  DEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKK
        DEFEMLRSIQRRFAAYESDTLSNKPNQ  D+VGSLK+D DD AVESQT+SKRPSMLAFE                                         
Subjt:  DEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKK

Query:  LRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGS
                                                     KGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEENKKLRKR KILKDFQGS
Subjt:  LRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGS

Query:  CKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQISGLQ
        CKRRT+CALSQMIDPRVQLISAAKPQAKDSSKKDKRLS M+YGP ENSHVACYRM L KFP VDRKKWS+VERENLGKGIRQQFQEMVLQISVDQI    
Subjt:  CKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQISGLQ

Query:  GFSADSDDLDSILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYL
                                                             SG QG S DSDDLDNILASIKDLDI P+KIREFLPKVNWDKLASMYL
Subjt:  GFSADSDDLDSILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYL

Query:  RGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQA
        +GRSGAECEARWLNFEDPLINRDPWTTSEDK+LLFTIQQKGLNNWIE+AVS GTNRTPFQCLSRYQRSLNASILKREWTK+EDD+LR+AVATFG+ DWQA
Subjt:  RGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQA

Query:  VASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGY
        VASTLEGR G QCSNRWKKSLDPART++G+FTPDED RLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGY
Subjt:  VASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGY

Query:  SWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEKSAT
        SWAKVAACVPSRTDN+CRRRWKKLFP+EVPLLQEARKIQKAALISNFVDRE+ERPALGP DFRPRPNTD LCN+  P PAPKRNVKTRKMPVSRNEKSAT
Subjt:  SWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEKSAT

Query:  GDAPKKRKSNCQRNQADVTAQAGIANNTSSVPEEVKSLKPQRKQNRHGAYTAE--GVPELCPNSEWCAKQSLDTQSLGVQLSLLESEGTNSDCIETVDEN
        GDAPKKRKSN QR Q D TAQ GIA NTS VPEEV+S KPQRK+NR GAYTA+  GVPEL  +SEWCAKQ+LDT+SLG+QL+  ESE +NS+C ETVDEN
Subjt:  GDAPKKRKSNCQRNQADVTAQAGIANNTSSVPEEVKSLKPQRKQNRHGAYTAE--GVPELCPNSEWCAKQSLDTQSLGVQLSLLESEGTNSDCIETVDEN

Query:  GMEVFENKVAEKLSERDVCFPEPEAIQNSTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDK
         MEV ENKVAEKL+E + CF EPE  QNSTGSSGVSVLSEMTND+ +YNPSIL DTTL ASTTVDD++ELK KS ADRDLDD NSFSL  S LELRT+D 
Subjt:  GMEVFENKVAEKLSERDVCFPEPEAIQNSTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDK

Query:  EGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTNL----EAVEEVDDCTLVGFLQKRLKR
        EGVDSYS+DE+T KS+GVC P   RRKKNSKTS+ S D+  +  QQ+  E LGT +P HHNQSKKRKH++T  +     EAVEEVDDCTLVGFLQKRLKR
Subjt:  EGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTNL----EAVEEVDDCTLVGFLQKRLKR

Query:  TTTTNDKKVDCSSSTPIEVDTDDNDPTLASFLNKLKRKKHQPSSGGD
        T  T+++ VDCSS+ P++VD DDN+PT+ASFLNKLKRKKHQ  SG +
Subjt:  TTTTNDKKVDCSSSTPIEVDTDDNDPTLASFLNKLKRKKHQPSSGGD

XP_023515735.1 uncharacterized protein LOC111779809 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0071.88Show/hide
Query:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLD
        MS R+H D GD ELP S+   EDD+VD+DME LRRA +  GVN EDYINPRLS P AGDAN  SDSDDVDD ELLRNIQNRFS  ADEQPLS LPP++ D
Subjt:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FEMLRSIQRRFAAYESD LSNKP+Q CD  G LKMD ++T VE  T+S+R SM+AFE                                      
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDF
                                                        KGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFK+LK F
Subjt:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDF

Query:  QGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQIS
        QGSC+R+TTCAL+QM+DPRVQLISA KPQAKDSSKKDKRLS+M YGPAENSHVACYR A  KF  VDRK+WS  ERENLGKGIRQQFQEMVLQISVDQI 
Subjt:  QGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQIS

Query:  GLQGFSADSDDLDSILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLAS
                                                                S +QGFSA+SDDLDNILASIK LDITPEKIREFLPKVNWDKLAS
Subjt:  GLQGFSADSDDLDSILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLAS

Query:  MYLRGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGD
        MYLRGRSGAECEARWLNFEDPLINR+PWTTSEDKNLLFTIQQKGLNNWIE+AVS GTNRTPFQCLSRYQRSLNASILK EWTKDEDDKLR+AVA FG GD
Subjt:  MYLRGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGD

Query:  WQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQE
        WQAVASTLEGRTGPQCSNRWKKSLDPARTKRG+FTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQE
Subjt:  WQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQE

Query:  HGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEK
        HGYSWAKVAACVPSRTDN+CRRRWKKLFPN+VPLLQEARKIQK ALISNFVDRESERPALGPTDFRP PN+ LLCN+ DP  APKRNV+TR+MPVSRNEK
Subjt:  HGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEK

Query:  SATGDAPKKRKSNCQRNQADVTAQAGIANNTSSVPEEVKSLKPQRKQNRHGAYTA--EGVPELCPNSEWCAKQSLDTQSLGVQLSLLE-SEGTNSDCIET
        SA GDAPK+RKSN QRN+AD TAQ    NNTSSVP EVKS KPQRK+ RHGAYT   +G P++  NSE CA+Q+ DT+S+ VQL+  E +E  NSDC ET
Subjt:  SATGDAPKKRKSNCQRNQADVTAQAGIANNTSSVPEEVKSLKPQRKQNRHGAYTA--EGVPELCPNSEWCAKQSLDTQSLGVQLSLLE-SEGTNSDCIET

Query:  VDENGMEVFENKVAEKLSERDVCFPEPEAIQNSTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELR
        VDENGMEVFENK AE  SE  VCF E E  QNSTGSSGVSVLSEMTNDMDEYNPS LPDTTLLAS T DD+ E K  +VAD+DLDD NSFSLP S LELR
Subjt:  VDENGMEVFENKVAEKLSERDVCFPEPEAIQNSTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELR

Query:  TIDKEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQK
        T D EGVDSYS+DEFTDKSHGVCK P  RRKKNSK S+KSQD S VSCQQ ELE  GTNE    NQSKKRKH+ TNT+    +EAVEEVDDCTL GFLQK
Subjt:  TIDKEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQK

Query:  RLKRTTTTNDKKVDCSSSTPIEVDTDDNDPTLASFLN-KLKRKKH
        RLKRTTTT+DKKVD SSSTP EVD DDNDPTLA  LN KLKRKKH
Subjt:  RLKRTTTTNDKKVDCSSSTPIEVDTDDNDPTLASFLN-KLKRKKH

XP_023515736.1 uncharacterized protein LOC111779809 isoform X2 [Cucurbita pepo subsp. pepo]0.0e+0071.79Show/hide
Query:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLD
        MS R+H D GD ELP S+   EDD+VD+DME LRRA +  GVN EDYINPRLS P AGDAN  SDSDDVDD ELLRNIQNRFS  ADEQPLS LPP++ D
Subjt:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FEMLRSIQRRFAAYESD LSNKP+Q CD  G LKMD ++T VE  T+S+R SM+AFE                                      
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDF
                                                        KGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFK+LK F
Subjt:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDF

Query:  QGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQIS
        QGSC+R+TTCAL+QM+DPRVQLISA KPQAKDSSKKDKRLS+M YGPAENSHVACYR A  KF  VDRK+WS  ERENLGKGIRQQFQEMVLQISVDQI 
Subjt:  QGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQIS

Query:  GLQGFSADSDDLDSILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLAS
                                                                S +QGFSA+SDDLDNILASIK LDITPEKIREFLPKVNWDKLAS
Subjt:  GLQGFSADSDDLDSILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLAS

Query:  MYLRGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGD
        MYLRGRSGAECEARWLNFEDPLINR+PWTTSEDKNLLFTIQQKGLNNWIE+AVS GTNRTPFQCLSRYQRSLNASILK EWTKDEDDKLR+AVA FG GD
Subjt:  MYLRGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGD

Query:  WQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQE
        WQAVASTLEGRTGPQCSNRWKKSLDPARTKRG+FTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQE
Subjt:  WQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQE

Query:  HGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEK
        HGYSWAKVAACVPSRTDN+C RRWKKLFPN+VPLLQEARKIQK ALISNFVDRESERPALGPTDFRP PN+ LLCN+ DP  APKRNV+TR+MPVSRNEK
Subjt:  HGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEK

Query:  SATGDAPKKRKSNCQRNQADVTAQAGIANNTSSVPEEVKSLKPQRKQNRHGAYTA--EGVPELCPNSEWCAKQSLDTQSLGVQLSLLE-SEGTNSDCIET
        SA GDAPK+RKSN QRN+AD TAQ    NNTSSVP EVKS KPQRK+ RHGAYT   +G P++  NSE CA+Q+ DT+S+ VQL+  E +E  NSDC ET
Subjt:  SATGDAPKKRKSNCQRNQADVTAQAGIANNTSSVPEEVKSLKPQRKQNRHGAYTA--EGVPELCPNSEWCAKQSLDTQSLGVQLSLLE-SEGTNSDCIET

Query:  VDENGMEVFENKVAEKLSERDVCFPEPEAIQNSTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELR
        VDENGMEVFENK AE  SE  VCF E E  QNSTGSSGVSVLSEMTNDMDEYNPS LPDTTLLAS T DD+ E K  +VAD+DLDD NSFSLP S LELR
Subjt:  VDENGMEVFENKVAEKLSERDVCFPEPEAIQNSTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELR

Query:  TIDKEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQK
        T D EGVDSYS+DEFTDKSHGVCK P  RRKKNSK S+KSQD S VSCQQ ELE  GTNE    NQSKKRKH+ TNT+    +EAVEEVDDCTL GFLQK
Subjt:  TIDKEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQK

Query:  RLKRTTTTNDKKVDCSSSTPIEVDTDDNDPTLASFLN-KLKRKKH
        RLKRTTTT+DKKVD SSSTP EVD DDNDPTLA  LN KLKRKKH
Subjt:  RLKRTTTTNDKKVDCSSSTPIEVDTDDNDPTLASFLN-KLKRKKH

XP_038905712.1 uncharacterized protein LOC120091681 isoform X1 [Benincasa hispida]0.0e+0075.32Show/hide
Query:  VRCAHSCSSMSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTL
        VR AH  ++MS  NH DEGDVELP +KEDDVVDEDME L+RAY+ VGVNPEDYINPRLSSP  GDAN   DSDD DDFELLRNIQNRFS V DEQPLSTL
Subjt:  VRCAHSCSSMSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTL

Query:  PPMSLDEEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHL
        PP+SLDEEEDEFEMLRSIQRRFAAYESD LSNKPN+  D+VGSLKMD  +TA ESQT+SKRPSM+AFE                                
Subjt:  PPMSLDEEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHL

Query:  EARIEENKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRF
                                                              KGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEENKKLRKR 
Subjt:  EARIEENKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRF

Query:  KILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQFQEMVLQI
        KILKDFQGSCKR+TTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLS M+YGPAENSHVACYRMAL KFPRVDRKKWS+VERENLGKGIRQQFQEMVLQI
Subjt:  KILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQFQEMVLQI

Query:  SVDQISGLQGFSADSDDLDSILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVN
        SVDQI                                                         SGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVN
Subjt:  SVDQISGLQGFSADSDDLDSILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVN

Query:  WDKLASMYLRGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVA
        WDKLASMYL GRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLR+AVA
Subjt:  WDKLASMYLRGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVA

Query:  TFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRL
         FG+ DWQAVASTLEGR G QCSNRWKKSLDPARTKRGHFTPDED+RLKIAVLL GPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRL
Subjt:  TFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRL

Query:  EIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMP
        EIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGP DFRPR NTD+LC++ DP+PAPKRN KTRKMP
Subjt:  EIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMP

Query:  VSRNEKSATGDAPKKRKSNCQRNQADVTAQAGIANNTSSVPEEVKSLKPQRKQNRHGAYTAE--GVPELCPNSEWCAKQSLDTQSLGVQLSLLESEGTNS
        VSRNEKSATGDAP+KRKSN QRNQAD TA+ GIANNTSSVPEEV+SLKP RK+NRH A T +  GV EL  N +WCAKQ+L+T+S+GVQLS  E E TNS
Subjt:  VSRNEKSATGDAPKKRKSNCQRNQADVTAQAGIANNTSSVPEEVKSLKPQRKQNRHGAYTAE--GVPELCPNSEWCAKQSLDTQSLGVQLSLLESEGTNS

Query:  DCIETVDENGMEVFENKVAEKLSERDVCFPEPEAIQNSTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCS
        D  ETVD NG+EVFENK+A+KLSERDV F EPE  QNSTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDD++ELK KS ADRDLDD NSFSLP S
Subjt:  DCIETVDENGMEVFENKVAEKLSERDVCFPEPEAIQNSTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCS

Query:  DLELRTIDKEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTN----LEAVEEVDDCTLV
         LELRTID EGVDSYS+D+ TDKSH VCK P  RRKKNSKTS K+ ++SF+SCQQVE ERLG NEPRH NQSKKRKH+STNT+    LEAVEEVD+CTLV
Subjt:  DLELRTIDKEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTN----LEAVEEVDDCTLV

Query:  GFLQKRLKRTTTTNDKKVDCSSSTPIEVDTDDNDPTLASFLNKLKRKKHQPSSGG
        GFLQKRL        KKVDCSS TP+EVD DDND  +ASFLNKLKRKKHQP S G
Subjt:  GFLQKRLKRTTTTNDKKVDCSSSTPIEVDTDDNDPTLASFLNKLKRKKHQPSSGG

XP_038905717.1 uncharacterized protein LOC120091681 isoform X2 [Benincasa hispida]0.0e+0075.57Show/hide
Query:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLDEEE
        MS  NH DEGDVELP +KEDDVVDEDME L+RAY+ VGVNPEDYINPRLSSP  GDAN   DSDD DDFELLRNIQNRFS V DEQPLSTLPP+SLDEEE
Subjt:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLDEEE

Query:  DEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKK
        DEFEMLRSIQRRFAAYESD LSNKPN+  D+VGSLKMD  +TA ESQT+SKRPSM+AFE                                         
Subjt:  DEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKK

Query:  LRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGS
                                                     KGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEENKKLRKR KILKDFQGS
Subjt:  LRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGS

Query:  CKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQISGLQ
        CKR+TTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLS M+YGPAENSHVACYRMAL KFPRVDRKKWS+VERENLGKGIRQQFQEMVLQISVDQI    
Subjt:  CKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQISGLQ

Query:  GFSADSDDLDSILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYL
                                                             SGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYL
Subjt:  GFSADSDDLDSILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYL

Query:  RGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQA
         GRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLR+AVA FG+ DWQA
Subjt:  RGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQA

Query:  VASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGY
        VASTLEGR G QCSNRWKKSLDPARTKRGHFTPDED+RLKIAVLL GPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGY
Subjt:  VASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGY

Query:  SWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEKSAT
        SWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGP DFRPR NTD+LC++ DP+PAPKRN KTRKMPVSRNEKSAT
Subjt:  SWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEKSAT

Query:  GDAPKKRKSNCQRNQADVTAQAGIANNTSSVPEEVKSLKPQRKQNRHGAYTAE--GVPELCPNSEWCAKQSLDTQSLGVQLSLLESEGTNSDCIETVDEN
        GDAP+KRKSN QRNQAD TA+ GIANNTSSVPEEV+SLKP RK+NRH A T +  GV EL  N +WCAKQ+L+T+S+GVQLS  E E TNSD  ETVD N
Subjt:  GDAPKKRKSNCQRNQADVTAQAGIANNTSSVPEEVKSLKPQRKQNRHGAYTAE--GVPELCPNSEWCAKQSLDTQSLGVQLSLLESEGTNSDCIETVDEN

Query:  GMEVFENKVAEKLSERDVCFPEPEAIQNSTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDK
        G+EVFENK+A+KLSERDV F EPE  QNSTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDD++ELK KS ADRDLDD NSFSLP S LELRTID 
Subjt:  GMEVFENKVAEKLSERDVCFPEPEAIQNSTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDK

Query:  EGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKR
        EGVDSYS+D+ TDKSH VCK P  RRKKNSKTS K+ ++SF+SCQQVE ERLG NEPRH NQSKKRKH+STNT+    LEAVEEVD+CTLVGFLQKRL  
Subjt:  EGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKR

Query:  TTTTNDKKVDCSSSTPIEVDTDDNDPTLASFLNKLKRKKHQPSSGG
              KKVDCSS TP+EVD DDND  +ASFLNKLKRKKHQP S G
Subjt:  TTTTNDKKVDCSSSTPIEVDTDDNDPTLASFLNKLKRKKHQPSSGG

TrEMBL top hitse value%identityAlignment
A0A0A0L2R2 Uncharacterized protein0.0e+0072.62Show/hide
Query:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLDEEE
        MS RNH DE DVE P  KED VVDEDME L+RAY+  GVNPEDYINPRLSSP AGDA+P SDSDDVDDFELLR+IQNRFS +ADEQP ST  P+S DEEE
Subjt:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLDEEE

Query:  DEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKK
        DEFEMLRSIQRRFAAYESDTLSNKPNQ  D+VGSLK+D DD AVESQT+SKRPSMLAFE                                         
Subjt:  DEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKK

Query:  LRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGS
                                                     KGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEENKKLRKR KILKDFQGS
Subjt:  LRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGS

Query:  CKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQISGLQ
        CKRRT+CALSQMIDPRVQLISAAKPQAKDSSKKDKRLS M+YGP ENSHVACYRM L KFP VDRKKWS+VERENLGKGIRQQFQEMVLQISVDQI    
Subjt:  CKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQISGLQ

Query:  GFSADSDDLDSILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYL
                                                             SG QG S DSDDLDNILASIKDLDI P+KIREFLPKVNWDKLASMYL
Subjt:  GFSADSDDLDSILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYL

Query:  RGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQA
        +GRSGAECEARWLNFEDPLINRDPWTTSEDK+LLFTIQQKGLNNWIE+AVS GTNRTPFQCLSRYQRSLNASILKREWTK+EDD+LR+AVATFG+ DWQA
Subjt:  RGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQA

Query:  VASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGY
        VASTLEGR G QCSNRWKKSLDPART++G+FTPDED RLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGY
Subjt:  VASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGY

Query:  SWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEKSAT
        SWAKVAACVPSRTDN+CRRRWKKLFP+EVPLLQEARKIQKAALISNFVDRE+ERPALGP DFRPRPNTD LCN+  P PAPKRNVKTRKMPVSRNEKSAT
Subjt:  SWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEKSAT

Query:  GDAPKKRKSNCQRNQADVTAQAGIANNTSSVPEEVKSLKPQRKQNRHGAYTAE--GVPELCPNSEWCAKQSLDTQSLGVQLSLLESEGTNSDCIETVDEN
        GDAPKKRKSN QR Q D TAQ GIA NTS VPEEV+S KPQRK+NR GAYTA+  GVPEL  +SEWCAKQ+LDT+SLG+QL+  ESE +NS+C ETVDEN
Subjt:  GDAPKKRKSNCQRNQADVTAQAGIANNTSSVPEEVKSLKPQRKQNRHGAYTAE--GVPELCPNSEWCAKQSLDTQSLGVQLSLLESEGTNSDCIETVDEN

Query:  GMEVFENKVAEKLSERDVCFPEPEAIQNSTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDK
         MEV ENKVAEKL+E + CF EPE  QNSTGSSGVSVLSEMTND+ +YNPSIL DTTL ASTTVDD++ELK KS ADRDLDD NSFSL  S LELRT+D 
Subjt:  GMEVFENKVAEKLSERDVCFPEPEAIQNSTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDK

Query:  EGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTNL----EAVEEVDDCTLVGFLQKRLKR
        EGVDSYS+DE+T KS+GVC P   RRKKNSKTS+ S D+  +  QQ+  E LGT +P HHNQSKKRKH++T  +     EAVEEVDDCTLVGFLQKRLKR
Subjt:  EGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTNL----EAVEEVDDCTLVGFLQKRLKR

Query:  TTTTNDKKVDCSSSTPIEVDTDDNDPTLASFLNKLKRKKHQPSSGGD
        T  T+++ VDCSS+ P++VD DDN+PT+ASFLNKLKRKKHQ  SG +
Subjt:  TTTTNDKKVDCSSSTPIEVDTDDNDPTLASFLNKLKRKKHQPSSGGD

A0A6J1E2J4 uncharacterized protein LOC111430000 isoform X20.0e+0071.44Show/hide
Query:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLD
        MS R+H D GD ELP S+   EDD+VD+DME LRRA +  GVN ED INPRLS P AGDAN  SDSDDVDD ELLRNIQNRFS  ADEQPLS LPP++ D
Subjt:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FE LRSIQRRFAAYESD LSNKP+Q CD  G LKMD D+T V   T+S+R SM+AFE                                      
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDF
                                                        KGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFK+LK F
Subjt:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDF

Query:  QGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQIS
        QGSC+R+TTCAL+QM+DPRVQLISA KPQAKDSSKKDKRLSAM YGPAENSHVACYR AL KF  VDRK+WS  ERENLGKGIRQQFQEMVLQISVDQI 
Subjt:  QGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQIS

Query:  GLQGFSADSDDLDSILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLAS
                                                                S +QGFSA+SDDLDNILASIK LDITPEKIREFLPKVNWDKLA 
Subjt:  GLQGFSADSDDLDSILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLAS

Query:  MYLRGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGD
        MYL+GRSGAECEARWLNFEDPLINR+ WTTSEDKNLLFTIQQKGLNNWIE+AVS GTNRTPFQCLSRYQRSLNASILK EWTKDEDDKLR+AVA FG GD
Subjt:  MYLRGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGD

Query:  WQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQE
        WQAVASTLEGRTGPQCSNRWKKSLDPARTKRG+FTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQE
Subjt:  WQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQE

Query:  HGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEK
        HGYSWAKVAACVPSRTDN+C RRWKKLFPN+VPLLQEARKIQK ALISNFVDRESERPALGPTDFRP PN+ LLCN+ DP  APKRNV+ R+MPVSRNEK
Subjt:  HGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEK

Query:  SATGDAPKKRKSNCQRNQADVTAQAGIANNTSSVPEEVKSLKPQRKQNRHGAYTA--EGVPELCPNSEWCAKQSLDTQSLGVQLSLLE-SEGTNSDCIET
        SA GDAPKK KSN QRNQAD TAQ   ANNTSSVP EVKS KPQRK+ RHGAYT   +G P++  NSE CA+Q+ DT+SL VQL+  E +E  NSDC ET
Subjt:  SATGDAPKKRKSNCQRNQADVTAQAGIANNTSSVPEEVKSLKPQRKQNRHGAYTA--EGVPELCPNSEWCAKQSLDTQSLGVQLSLLE-SEGTNSDCIET

Query:  VDENGMEVFENKVAEKLSERDVCFPEPEAIQNSTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELR
        VDENGMEVFENK AE  SE  VCF E E  QNSTGSSGVSVLSEMTNDMDEYNPS  PDTTLLAS T DD+ E K  +VAD+DLDD NSFSLP S LELR
Subjt:  VDENGMEVFENKVAEKLSERDVCFPEPEAIQNSTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELR

Query:  TIDKEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQK
        T D EGVDSYS+DEFTDKSHGVCK P  RRKKNSK S+KSQD S VSCQQ ELE  G NE    NQSKKRKH+ TNT+    +EAVEEVDDCTL GFLQK
Subjt:  TIDKEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQK

Query:  RLKRTTTTNDKKVDCSSSTPIEVDTDDNDPTLASFL-NKLKRKKH
        RLKRTTTT+DKKVD SSSTP EVD DDNDPTLA  L +KLKRKKH
Subjt:  RLKRTTTTNDKKVDCSSSTPIEVDTDDNDPTLASFL-NKLKRKKH

A0A6J1E6Z7 uncharacterized protein LOC111430000 isoform X10.0e+0071.53Show/hide
Query:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLD
        MS R+H D GD ELP S+   EDD+VD+DME LRRA +  GVN ED INPRLS P AGDAN  SDSDDVDD ELLRNIQNRFS  ADEQPLS LPP++ D
Subjt:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FE LRSIQRRFAAYESD LSNKP+Q CD  G LKMD D+T V   T+S+R SM+AFE                                      
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDF
                                                        KGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFK+LK F
Subjt:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDF

Query:  QGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQIS
        QGSC+R+TTCAL+QM+DPRVQLISA KPQAKDSSKKDKRLSAM YGPAENSHVACYR AL KF  VDRK+WS  ERENLGKGIRQQFQEMVLQISVDQI 
Subjt:  QGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQIS

Query:  GLQGFSADSDDLDSILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLAS
                                                                S +QGFSA+SDDLDNILASIK LDITPEKIREFLPKVNWDKLA 
Subjt:  GLQGFSADSDDLDSILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLAS

Query:  MYLRGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGD
        MYL+GRSGAECEARWLNFEDPLINR+ WTTSEDKNLLFTIQQKGLNNWIE+AVS GTNRTPFQCLSRYQRSLNASILK EWTKDEDDKLR+AVA FG GD
Subjt:  MYLRGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGD

Query:  WQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQE
        WQAVASTLEGRTGPQCSNRWKKSLDPARTKRG+FTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQE
Subjt:  WQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQE

Query:  HGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEK
        HGYSWAKVAACVPSRTDN+CRRRWKKLFPN+VPLLQEARKIQK ALISNFVDRESERPALGPTDFRP PN+ LLCN+ DP  APKRNV+ R+MPVSRNEK
Subjt:  HGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEK

Query:  SATGDAPKKRKSNCQRNQADVTAQAGIANNTSSVPEEVKSLKPQRKQNRHGAYTA--EGVPELCPNSEWCAKQSLDTQSLGVQLSLLE-SEGTNSDCIET
        SA GDAPKK KSN QRNQAD TAQ   ANNTSSVP EVKS KPQRK+ RHGAYT   +G P++  NSE CA+Q+ DT+SL VQL+  E +E  NSDC ET
Subjt:  SATGDAPKKRKSNCQRNQADVTAQAGIANNTSSVPEEVKSLKPQRKQNRHGAYTA--EGVPELCPNSEWCAKQSLDTQSLGVQLSLLE-SEGTNSDCIET

Query:  VDENGMEVFENKVAEKLSERDVCFPEPEAIQNSTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELR
        VDENGMEVFENK AE  SE  VCF E E  QNSTGSSGVSVLSEMTNDMDEYNPS  PDTTLLAS T DD+ E K  +VAD+DLDD NSFSLP S LELR
Subjt:  VDENGMEVFENKVAEKLSERDVCFPEPEAIQNSTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELR

Query:  TIDKEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQK
        T D EGVDSYS+DEFTDKSHGVCK P  RRKKNSK S+KSQD S VSCQQ ELE  G NE    NQSKKRKH+ TNT+    +EAVEEVDDCTL GFLQK
Subjt:  TIDKEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQK

Query:  RLKRTTTTNDKKVDCSSSTPIEVDTDDNDPTLASFL-NKLKRKKH
        RLKRTTTT+DKKVD SSSTP EVD DDNDPTLA  L +KLKRKKH
Subjt:  RLKRTTTTNDKKVDCSSSTPIEVDTDDNDPTLASFL-NKLKRKKH

A0A6J1JK98 uncharacterized protein LOC111485355 isoform X20.0e+0071.35Show/hide
Query:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLD
        MS R+H D GD ELP S+   EDD+VD+DME LRRA +  GVN EDY+NP+LS P AGDAN  SDSDDVDD ELLRNIQNRFS  ADEQPLS LPP++ D
Subjt:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FE LRSIQRRFAAYESD LSNKP+Q CD  G LKMD D+T VE  T+S+R SM+AFEKGSLPKAALAFIDAIKKNRSQQKF+RSKMIHLEARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDF
        NKKLRKRFK+LK FQGSC+R+TTCALSQM+DPRVQLISA KPQ                                                         
Subjt:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDF

Query:  QGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQIS
                                    QAKDSSKKDKRLSAM YGPAENSHVACYR+AL KF  VDRK+WS  ERENLGKGIRQQFQEMVLQISVDQI 
Subjt:  QGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQIS

Query:  GLQGFSADSDDLDSILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLAS
                                                                S +QGFSA+SDDLDNILASIKDLDITPEKIREFLPKVNWDKLAS
Subjt:  GLQGFSADSDDLDSILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLAS

Query:  MYLRGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGD
        MYLRGRSGAECEARWLNFEDPLINR+PWTTSEDKNLLFTIQQKGLNNWI++AVS GTNRTPFQ LSRYQRSLNASILK EWTKDEDDKLR+AVA FG GD
Subjt:  MYLRGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGD

Query:  WQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQE
        WQAVASTLEGRTGPQCSNRWKKSLDPARTKRG+FTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQE
Subjt:  WQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQE

Query:  HGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEK
        HGYSWAKVAACVPSRTDN+C RRWKKLFPN+VPLLQEARKIQK ALISNFVDRESERPALGPTDFRP PN+ LLCN+ DP  APKRNV+TR+MPVSRNEK
Subjt:  HGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEK

Query:  SATGDAPKKRKSNCQRNQADVTAQAGIANNTSSVPEEVKSLKPQRKQNRHGAYTA--EGVPELCPNSEWCAKQSLDTQSLGVQLSLLE-SEGTNSDCIET
        SA GDAPKKRKSN QRN+ D TAQ   A+NTSSVP EVKS KPQRK+ RHGAYT   +G P++  NSE CA+Q+ DT++L VQL+  E +E  NSDC ET
Subjt:  SATGDAPKKRKSNCQRNQADVTAQAGIANNTSSVPEEVKSLKPQRKQNRHGAYTA--EGVPELCPNSEWCAKQSLDTQSLGVQLSLLE-SEGTNSDCIET

Query:  VDENGMEVFENKVAEKLSERDVCFPEPEAIQNSTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELR
        VDENGMEVFENK AE  SE  VCF E E  QNSTGSSGVSVLSEMTNDMDEYNPS LPDTTLLAS T DD+ E K  +VAD+DLD  NSFSLP S LELR
Subjt:  VDENGMEVFENKVAEKLSERDVCFPEPEAIQNSTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELR

Query:  TIDKEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQK
        T D EGVDSYS+DEFTDKSH VCK P  RRKKNSK S+KSQD S VSCQQ ELE  GTNE    NQ KKRKH+STNT+    +EAVEEVDDCTL+GFLQK
Subjt:  TIDKEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQK

Query:  RLKRTTTTNDKKVDCSSSTPIEVDTDDNDPTLASFL-NKLKRKKH
        RLKRTTTT+ KKVD SSST  EVD DDNDPTLA  L  KLKRKKH
Subjt:  RLKRTTTTNDKKVDCSSSTPIEVDTDDNDPTLASFL-NKLKRKKH

A0A6J1JKV7 uncharacterized protein LOC111485355 isoform X10.0e+0071.44Show/hide
Query:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLD
        MS R+H D GD ELP S+   EDD+VD+DME LRRA +  GVN EDY+NP+LS P AGDAN  SDSDDVDD ELLRNIQNRFS  ADEQPLS LPP++ D
Subjt:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FE LRSIQRRFAAYESD LSNKP+Q CD  G LKMD D+T VE  T+S+R SM+AFEKGSLPKAALAFIDAIKKNRSQQKF+RSKMIHLEARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDF
        NKKLRKRFK+LK FQGSC+R+TTCALSQM+DPRVQLISA KPQ                                                         
Subjt:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDF

Query:  QGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQIS
                                    QAKDSSKKDKRLSAM YGPAENSHVACYR+AL KF  VDRK+WS  ERENLGKGIRQQFQEMVLQISVDQI 
Subjt:  QGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQIS

Query:  GLQGFSADSDDLDSILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLAS
                                                                S +QGFSA+SDDLDNILASIKDLDITPEKIREFLPKVNWDKLAS
Subjt:  GLQGFSADSDDLDSILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLAS

Query:  MYLRGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGD
        MYLRGRSGAECEARWLNFEDPLINR+PWTTSEDKNLLFTIQQKGLNNWI++AVS GTNRTPFQ LSRYQRSLNASILK EWTKDEDDKLR+AVA FG GD
Subjt:  MYLRGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGD

Query:  WQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQE
        WQAVASTLEGRTGPQCSNRWKKSLDPARTKRG+FTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQE
Subjt:  WQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQE

Query:  HGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEK
        HGYSWAKVAACVPSRTDN+CRRRWKKLFPN+VPLLQEARKIQK ALISNFVDRESERPALGPTDFRP PN+ LLCN+ DP  APKRNV+TR+MPVSRNEK
Subjt:  HGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEK

Query:  SATGDAPKKRKSNCQRNQADVTAQAGIANNTSSVPEEVKSLKPQRKQNRHGAYTA--EGVPELCPNSEWCAKQSLDTQSLGVQLSLLE-SEGTNSDCIET
        SA GDAPKKRKSN QRN+ D TAQ   A+NTSSVP EVKS KPQRK+ RHGAYT   +G P++  NSE CA+Q+ DT++L VQL+  E +E  NSDC ET
Subjt:  SATGDAPKKRKSNCQRNQADVTAQAGIANNTSSVPEEVKSLKPQRKQNRHGAYTA--EGVPELCPNSEWCAKQSLDTQSLGVQLSLLE-SEGTNSDCIET

Query:  VDENGMEVFENKVAEKLSERDVCFPEPEAIQNSTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELR
        VDENGMEVFENK AE  SE  VCF E E  QNSTGSSGVSVLSEMTNDMDEYNPS LPDTTLLAS T DD+ E K  +VAD+DLD  NSFSLP S LELR
Subjt:  VDENGMEVFENKVAEKLSERDVCFPEPEAIQNSTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELR

Query:  TIDKEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQK
        T D EGVDSYS+DEFTDKSH VCK P  RRKKNSK S+KSQD S VSCQQ ELE  GTNE    NQ KKRKH+STNT+    +EAVEEVDDCTL+GFLQK
Subjt:  TIDKEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQK

Query:  RLKRTTTTNDKKVDCSSSTPIEVDTDDNDPTLASFL-NKLKRKKH
        RLKRTTTT+ KKVD SSST  EVD DDNDPTLA  L  KLKRKKH
Subjt:  RLKRTTTTNDKKVDCSSSTPIEVDTDDNDPTLASFL-NKLKRKKH

SwissProt top hitse value%identityAlignment
P46200 Transcriptional activator Myb1.3e-3139.77Show/hide
Query:  KREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL
        K  WT++ED+KL+  V   G  DW+ +A+ L  RT  QC +RW+K L+P   K G +T +ED R+   V  +GPK W+  A+ L GR   QCRERW N L
Subjt:  KREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL

Query:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVP---LLQEARKIQKAALISNF
        +P +++  WTEEED  +  A +  G  WA++A  +P RTDN  +  W      +V     LQE+ K  + A+ ++F
Subjt:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVP---LLQEARKIQKAALISNF

P91868 snRNA-activating protein complex subunit 4 homolog1.0e-3133.08Show/hide
Query:  VNWDKLASMYLRG-RSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRA
        V W  +A+   +G R+    +++W N  +P  N++ W+  E + L +  +     +W  +A++ GTNRT +QC+ +Y+  ++     +EW++DED KL A
Subjt:  VNWDKLASMYLRG-RSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRA

Query:  AVATFGLG---DWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCE-WT
              +     W  VA  + GRT  Q   R+  +LD A  K G +T  ED  L  AV  +G K+W K A+ +  RN  QCRERW N L+ S    E +T
Subjt:  AVATFGLG---DWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCE-WT

Query:  EEEDLRLEIAIQEHGY-SWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVD
          ED +L  A++  G  +WAK    +P +T    RRR+ +L          A K++ AA   N VD
Subjt:  EEEDLRLEIAIQEHGY-SWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVD

Q54NA6 Myb-like protein L1.6e-5338.27Show/hide
Query:  RSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLG---DWQ
        RS  E   RW N +DP IN+ P+T  EDK LL   ++   + W +I++  GTNRTP  C+ RYQRSLN+ ++KREWTK+ED+ L   +     G   DWQ
Subjt:  RSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLG---DWQ

Query:  AVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHG
         +   + GRTG QC +RW K+LDP+  K+G ++P+ED  L  AV  +G  NW      + GR  VQCRER+ N LDP L +  WT +ED RL     + G
Subjt:  AVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHG

Query:  Y-SWAKVAACVPSRTDNDCRRRWKKL--FPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKM------
           W+ VA  + +RTDN C RRWK+L    N +   QE    +K   +SNF  R+ ER  L   D             ++ +  PK N KT+ +      
Subjt:  Y-SWAKVAACVPSRTDNDCRRRWKKL--FPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKM------

Query:  ---PVSRNEKSATGDAPKKRKSNC
           P + N K+   D    + ++C
Subjt:  ---PVSRNEKSATGDAPKKRKSNC

Q5SXM2 snRNA-activating protein complex subunit 42.9e-3933.09Show/hide
Query:  IKDLDITPEK--IREFLPKVNWDKLASMYLRG-RSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSL
        I+D++  PE+  +   L   +W+K++++   G RS  E    W N E P IN+  W+  E++ L       G   W +IA   GT+R+ FQCL ++Q+  
Subjt:  IKDLDITPEK--IREFLPKVNWDKLASMYLRG-RSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSL

Query:  NASILKREWTKDEDDKLRAAVATFGLGD---WQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQC
        N ++ ++EWT++ED  L   V    +G    ++ +   +EGR   Q   RW KSLDP   K+G++ P+ED++L  AV  +G ++W K  E +PGR+  QC
Subjt:  NASILKREWTKDEDDKLRAAVATFGLGD---WQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQC

Query:  RERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYS-WAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARK
        R+R+   L  SL++  W  +E+ +L   I+++G   WAK+A+ +P R+ + C  +WK +   +  L +  R+
Subjt:  RERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYS-WAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARK

Q8BP86 snRNA-activating protein complex subunit 49.3e-3833.59Show/hide
Query:  IKDLDITPEK--IREFLPKVNWDKLASMYLRG-RSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSL
        I+D++  PE+  +   L   +W+K++++   G RS  E    W + E P I++  W+T E + L       G   W  +A   GT+R+ FQCL ++Q+  
Subjt:  IKDLDITPEK--IREFLPKVNWDKLASMYLRG-RSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSL

Query:  NASILKREWTKDEDDKLRAAVATFGLGD---WQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQC
        N ++ ++EWT++ED  L   V    +G+   ++ +   +EGR   Q   RW KSLDP+  KRG + P+ED++L  AV  +G ++W K  E +PGR+  QC
Subjt:  NASILKREWTKDEDDKLRAAVATFGLGD---WQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQC

Query:  RERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYS-WAKVAACVPSRTDNDCRRRWKKL
        R+R+   L  SL++  W  +E+ +L   I+++G   WA++A+ +P R+ + C  +WK L
Subjt:  RERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYS-WAKVAACVPSRTDNDCRRRWKKL

Arabidopsis top hitse value%identityAlignment
AT3G09370.1 myb domain protein 3r-33.6e-2942.18Show/hide
Query:  KREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL
        K  WT +ED+ LR AV TF    W+ +A +   RT  QC +RW+K L+P   K G +T +ED ++   V  +GP  W+  A+ LPGR   QCRERW N L
Subjt:  KREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL

Query:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRW
        +P + +  WT EE++ L  A + HG  WA++A  +P RTDN  +  W
Subjt:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRW

AT3G09370.2 myb domain protein 3r-33.6e-2942.18Show/hide
Query:  KREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL
        K  WT +ED+ LR AV TF    W+ +A +   RT  QC +RW+K L+P   K G +T +ED ++   V  +GP  W+  A+ LPGR   QCRERW N L
Subjt:  KREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL

Query:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRW
        +P + +  WT EE++ L  A + HG  WA++A  +P RTDN  +  W
Subjt:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRW

AT3G18100.1 myb domain protein 4r11.3e-15641.37Show/hide
Query:  SSMSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINP--RLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSL
        +S+   + DD+ D       E+D + ED+E LRRA     VN + + +    +     G     SDS++ DDFE+LR I+++ +   D    S+ PPM L
Subjt:  SSMSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINP--RLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSL

Query:  -----DEEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFI-----RS
              E ED+FEM+RSI+ + +   S  +S  P  L D       D +D A E+  A +R                    A K   S+ KF+     + 
Subjt:  -----DEEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFI-----RS

Query:  KMIHLEARIEENKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKK
        K +H  +  E + ++  R    + F    K   T   S+ +          +     ++  S P+AA AF+DAI++NR+ QKF+R K+  +EA IE+N+K
Subjt:  KMIHLEARIEENKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKK

Query:  LRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSK----------KDKRLSAMHYGPAENSHVACYRMALMKFP-RVDRKKWSVVEREN
         +K  +I+KDFQ SCKR T  AL Q  DPRV+LIS  K    DSS+           DK++S +  GPAEN  V  YRMAL K+P  V R+KWS  E +N
Subjt:  LRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSK----------KDKRLSAMHYGPAENSHVACYRMALMKFP-RVDRKKWSVVEREN

Query:  LGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDSILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASGLQGFSADSDDLDNILASIKD
        L KG++Q+ Q+++L  ++++ S L+G +    D+D+I  SI +L+ITPE IR+FLPK+NWD L                                  IKD
Subjt:  LGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDSILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASGLQGFSADSDDLDNILASIKD

Query:  LDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILK
                                   RS AECEARW++ EDPLIN  PWT +EDKNLL TI+Q  L +W++IAVS GTNRTPFQCL+RYQRSLN SILK
Subjt:  LDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILK

Query:  REWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLD
        +EWT +EDD+LR AV  FG  DWQ+VA+ L+GRTG QCSNRWKKSL P  T++G ++ +ED R+K+AV LFG +NW+K ++F+PGR Q QCRERW NCLD
Subjt:  REWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLD

Query:  PSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSV
        P + R +WTEEED +L  AI EHGYSW+KVA  +  RTDN C RRWK+L+P++V LLQEAR++QK A + NFVDRESERPAL  +     P+      S+
Subjt:  PSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSV

Query:  DPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNCQRNQADVTAQ
        +P P    +V  +K   ++ +KS     PK+R+   +    DV  Q
Subjt:  DPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNCQRNQADVTAQ

AT3G18100.2 myb domain protein 4r12.5e-14748.39Show/hide
Query:  SSKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSK----------K
        ++  S P+AA AF+DAI++NR+ QKF+R K+  +EA IE+N+K +K  +I+KDFQ SCKR T  AL Q  DPRV+LIS  K    DSS+           
Subjt:  SSKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSK----------K

Query:  DKRLSAMHYGPAENSHVACYRMALMKFP-RVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDSILASIKDLDITPEKIREFLPK
        DK++S +  GPAEN  V  YRMAL K+P  V R+KWS  E +NL KG++Q+ Q+++L  ++++ S L+G +    D+D+I  SI +L+ITPE IR+FLPK
Subjt:  DKRLSAMHYGPAENSHVACYRMALMKFP-RVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDSILASIKDLDITPEKIREFLPK

Query:  VNWDKLASMYLRGRSGAECEASGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRDPWTTSEDKN
        +NWD L                                  IKD                           RS AECEARW++ EDPLIN  PWT +EDKN
Subjt:  VNWDKLASMYLRGRSGAECEASGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRDPWTTSEDKN

Query:  LLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFT
        LL TI+Q  L +W++IAVS GTNRTPFQCL+RYQRSLN SILK+EWT +EDD+LR AV  FG  DWQ+VA+ L+GRTG QCSNRWKKSL P  T++G ++
Subjt:  LLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFT

Query:  PDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLL
         +ED R+K+AV LFG +NW+K ++F+PGR Q QCRERW NCLDP + R +WTEEED +L  AI EHGYSW+KVA  +  RTDN C RRWK+L+P++V LL
Subjt:  PDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLL

Query:  QEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNCQRNQADVTAQ
        QEAR++QK A + NFVDRESERPAL  +     P+      S++P P    +V  +K   ++ +KS     PK+R+   +    DV  Q
Subjt:  QEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNCQRNQADVTAQ

AT3G18100.3 myb domain protein 4r11.3e-13238.13Show/hide
Query:  SSMSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINP--RLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSL
        +S+   + DD+ D       E+D + ED+E LRRA     VN + + +    +     G     SDS++ DDFE+LR I+++ +   D    S+ PPM L
Subjt:  SSMSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINP--RLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSL

Query:  -----DEEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKR-----PSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRS
              E ED+FEM+RSI+ + +   S  +S  P  L D       D +D A E+  A +R      +   F   S  K      + +   ++Q+ +   
Subjt:  -----DEEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKR-----PSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRS

Query:  KMIHLEARIEENKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEEN-K
        KM+  + R++ + KL +R                            L+ + + +                                   H+   +EEN +
Subjt:  KMIHLEARIEENKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEEN-K

Query:  KLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSK----------KDKRLSAMHYGPAENSHVACYRMALMKFP-RVDRKKWSVVERE
        KL++R         S + R T    +M DPRV+LIS  K    DSS+           DK++S +  GPAEN  V  YRMAL K+P  V R+KWS  E +
Subjt:  KLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSK----------KDKRLSAMHYGPAENSHVACYRMALMKFP-RVDRKKWSVVERE

Query:  NLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDSILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASGLQGFSADSDDLDNILASIK
        NL KG++Q+ Q+++L  ++++ S L+G +    D+D+I  SI +L+ITPE IR+FLPK+NWD L                                  IK
Subjt:  NLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDSILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASGLQGFSADSDDLDNILASIK

Query:  DLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASIL
        D                           RS AECEARW++ EDPLIN  PWT +EDKNLL TI+Q  L +W++IAVS GTNRTPFQCL+RYQRSLN SIL
Subjt:  DLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASIL

Query:  KREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL
        K+EWT +EDD+LR AV  FG  DWQ+VA+ L+GRTG QCSNRWKKSL P  T++G ++ +ED R+K+AV LFG +NW+K ++F+PGR Q QCRERW NCL
Subjt:  KREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL

Query:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNS
        DP + R +WTEEED +L  AI EHGYSW+KVA  +  RTDN C RRWK+L+P++V LLQEAR++QK A + NFVDRESERPAL  +     P+      S
Subjt:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNS

Query:  VDPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNCQRNQADVTAQ
        ++P P    +V  +K   ++ +KS     PK+R+   +    DV  Q
Subjt:  VDPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNCQRNQADVTAQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GGGTGCCTCGTTTTGCATGCAGCCTCCACCCTCTCTCTCCTGCTGCCACACACACACACTCTCTCTCCCTCGCCGATCTCTCTCCTGTTTGGCTCTCTCTCTCTCGGTCT
CTCTCCTTCGCCGGCACACTCTTTCGCTCGGCGTCTCGCTCGGCGTTTTCATCTCGTGACATCACTTTCTCTCTCGGGTCTCGCCGTCTCGCTTTCGCTAGACCCTCTCT
CTCTCCCTCTCGGCAGTACGTTTCCCCCTCCCTTTCTCTCTCGATTTTACTCGGCCGGCAGCACCTCCTTAACATGCTTCCCTCTTAGCTACTCTCCCTCACGCCGCCGT
TGTGCCGCTGCCGCCGGATTTCTCTCTCCGCCGCCACCGCTGCCTTTTTGCAATGATTCAGTCAGGGTACGTTGTGCTCATTCATGTTCATCCATGTCTCACCGAAACCA
TGACGATGAAGGTGACGTTGAGCTTCCTGTCAGCAAGGAAGATGATGTGGTTGATGAGGACATGGAAGCCCTTCGGAGAGCCTATAAGCATGTTGGAGTTAATCCTGAGG
ATTACATTAATCCTAGGTTGTCATCACCTGTTGCCGGAGATGCTAATCCTAGTTCTGATTCTGATGATGTTGATGATTTTGAACTTCTTCGAAATATTCAGAATCGGTTC
TCATGTGTGGCTGATGAGCAGCCGTTGAGTACTCTCCCACCAATGTCCCTAGACGAGGAGGAAGATGAATTTGAGATGCTTCGTTCAATTCAGCGGCGCTTTGCAGCGTA
CGAAAGTGATACTTTGAGCAATAAACCCAATCAGTTGTGCGACCATGTTGGGTCTTTGAAGATGGATTTTGATGACACAGCTGTTGAGAGTCAGACAGCCTCAAAAAGGC
CATCCATGCTAGCCTTTGAAAAGGGAAGCTTGCCAAAAGCTGCATTGGCATTTATTGATGCCATCAAGAAAAATAGGTCCCAGCAGAAGTTTATTCGTAGTAAGATGATT
CATCTTGAAGCTAGAATTGAGGAGAACAAAAAGCTCAGAAAACGTTTCAAAATTCTCAAAGATTTCCAGGGTTCATGTAAACGGAGAACAACCTGTGCACTGTCTCAAAT
GATAGATCCTCGAGTCCAGTTGATTTCAGCTGCAAAACCACAAGCAAAGGATTCATCAAAGGGAAGCTTGCCAAAAGCTGCATTGGCATTTATTGATGCCATCAAGAAAA
ATAGGTCCCAGCAGAAGTTTATTCGTAGTAAGATGATTCATCTTGAAGCTAGAATTGAGGAGAACAAAAAGCTCAGAAAACGTTTCAAAATTCTCAAAGATTTCCAGGGT
TCATGTAAACGGAGAACAACCTGTGCACTGTCTCAAATGATAGATCCTCGAGTCCAGTTGATTTCAGCTGCAAAACCACAAGCAAAGGATTCATCAAAGAAGGACAAACG
ATTATCTGCAATGCATTATGGCCCAGCTGAGAATTCTCATGTTGCATGCTACAGAATGGCATTAATGAAGTTTCCTCGTGTAGATCGAAAAAAATGGTCCGTTGTAGAAA
GGGAGAATCTAGGGAAGGGAATAAGACAACAATTTCAGGAGATGGTGCTTCAGATTTCAGTGGATCAAATTAGTGGGCTACAAGGATTTTCTGCAGATTCAGATGATTTG
GATAGCATTCTTGCATCAATAAAAGACCTTGACATCACTCCTGAAAAGATTAGGGAATTTCTGCCAAAAGTTAATTGGGACAAATTGGCTTCCATGTATCTTCGGGGTCG
CTCAGGGGCAGAATGTGAAGCGAGTGGGCTACAAGGATTCTCTGCAGATTCAGATGATTTGGATAACATTCTTGCATCAATAAAAGACCTTGACATCACTCCTGAAAAGA
TTAGGGAATTTCTGCCAAAAGTTAATTGGGACAAATTGGCTTCCATGTATCTTCGGGGTCGCTCAGGGGCAGAATGTGAAGCGAGGTGGTTGAATTTCGAAGACCCCCTA
ATTAATCGGGATCCATGGACTACAAGTGAGGATAAGAATCTTTTGTTTACTATCCAACAGAAGGGGTTGAATAACTGGATTGAGATAGCAGTTTCGTCGGGTACAAATAG
AACTCCTTTTCAGTGCTTGTCTCGGTATCAAAGGAGTTTAAACGCTTCCATATTAAAGAGGGAGTGGACCAAAGACGAGGATGATAAACTTCGAGCTGCTGTTGCTACAT
TTGGCCTGGGAGATTGGCAGGCAGTAGCTTCTACTTTGGAAGGACGAACTGGCCCACAGTGCTCTAATAGGTGGAAAAAATCCCTTGACCCAGCTAGGACAAAAAGAGGT
CATTTCACTCCAGATGAAGACAGTCGCTTGAAAATTGCTGTACTGCTCTTTGGGCCTAAAAATTGGAACAAGAAAGCAGAATTTTTACCTGGTCGAAATCAAGTTCAATG
CAGAGAAAGATGGTTTAATTGTTTAGATCCTTCCTTGAGAAGATGCGAATGGACAGAAGAGGAGGATTTAAGGCTGGAGATAGCAATTCAGGAACATGGATATAGCTGGG
CTAAGGTAGCTGCATGTGTGCCGTCACGTACAGATAATGATTGTCGGAGGAGATGGAAGAAGTTATTTCCCAATGAAGTTCCGTTACTCCAGGAAGCTAGAAAGATTCAG
AAGGCTGCTCTTATTAGCAACTTTGTTGATAGGGAATCAGAGCGTCCTGCTCTTGGTCCTACTGACTTTCGACCCAGGCCAAACACAGATTTATTGTGTAATAGTGTTGA
TCCAAGACCTGCCCCAAAAAGAAATGTGAAGACGAGAAAGATGCCAGTGTCAAGGAATGAAAAGAGTGCTACTGGTGATGCTCCAAAGAAGAGGAAATCAAATTGCCAGA
GGAATCAGGCTGATGTAACTGCTCAGGCGGGTATTGCAAATAATACCTCTTCTGTCCCAGAGGAGGTTAAATCTCTAAAGCCTCAAAGAAAACAAAATAGACATGGAGCT
TATACCGCAGAAGGGGTTCCGGAGCTATGTCCTAACAGTGAGTGGTGTGCTAAACAGAGTTTGGACACTCAGAGCCTTGGGGTGCAGCTGTCGCTGTTGGAATCTGAGGG
GACCAACAGCGACTGCATCGAGACTGTTGATGAGAATGGTATGGAGGTATTTGAGAACAAAGTTGCAGAGAAACTTTCTGAAAGAGATGTATGTTTTCCAGAACCAGAAG
CAATTCAAAACTCTACCGGATCTTCTGGAGTCTCGGTATTGTCAGAAATGACTAATGACATGGATGAATACAATCCGTCTATCCTTCCAGATACAACATTGTTGGCTAGT
ACTACTGTGGATGATCTCAAAGAATTGAAGAGGAAGAGTGTTGCAGACAGAGATCTGGATGACTGTAACAGTTTCTCGTTACCGTGCAGTGACTTGGAACTCAGGACAAT
CGACAAGGAAGGTGTGGACAGTTATTCCATGGATGAATTTACAGATAAAAGCCATGGGGTTTGCAAGCCCCCCCATACCAGAAGGAAGAAAAACAGCAAAACATCAAGTA
AGAGCCAGGATCATTCCTTTGTTTCTTGTCAACAAGTGGAGTTGGAGAGGTTGGGGACAAATGAGCCTCGTCATCATAATCAATCAAAGAAGAGAAAACATAACAGTACA
AATACGAATCTGGAGGCAGTTGAGGAGGTTGATGACTGCACACTCGTGGGCTTTTTGCAAAAAAGATTGAAGAGGACGACGACAACAAATGACAAGAAAGTTGATTGCAG
TTCAAGTACTCCCATAGAAGTTGATACTGATGACAACGATCCTACCCTTGCCTCGTTTCTTAATAAATTAAAGAGAAAAAAGCATCAGCCGAGTAGTGGTGGTGACAACA
TCACATTGGATCTCCGAAGAAGGTTGATGGAAGGTTGTACAAATAAAGCTTCCAAGGCCGCAAAATGGCGAAATCTTCAACTATACGAGGCTTTGGCAATTTTCTTTTGT
TATGGTTCATTTTCTTTTCCAATTTTGTTATTATTATTATTTTTTTCTGAGAAG
mRNA sequenceShow/hide mRNA sequence
GGGTGCCTCGTTTTGCATGCAGCCTCCACCCTCTCTCTCCTGCTGCCACACACACACACTCTCTCTCCCTCGCCGATCTCTCTCCTGTTTGGCTCTCTCTCTCTCGGTCT
CTCTCCTTCGCCGGCACACTCTTTCGCTCGGCGTCTCGCTCGGCGTTTTCATCTCGTGACATCACTTTCTCTCTCGGGTCTCGCCGTCTCGCTTTCGCTAGACCCTCTCT
CTCTCCCTCTCGGCAGTACGTTTCCCCCTCCCTTTCTCTCTCGATTTTACTCGGCCGGCAGCACCTCCTTAACATGCTTCCCTCTTAGCTACTCTCCCTCACGCCGCCGT
TGTGCCGCTGCCGCCGGATTTCTCTCTCCGCCGCCACCGCTGCCTTTTTGCAATGATTCAGTCAGGGTACGTTGTGCTCATTCATGTTCATCCATGTCTCACCGAAACCA
TGACGATGAAGGTGACGTTGAGCTTCCTGTCAGCAAGGAAGATGATGTGGTTGATGAGGACATGGAAGCCCTTCGGAGAGCCTATAAGCATGTTGGAGTTAATCCTGAGG
ATTACATTAATCCTAGGTTGTCATCACCTGTTGCCGGAGATGCTAATCCTAGTTCTGATTCTGATGATGTTGATGATTTTGAACTTCTTCGAAATATTCAGAATCGGTTC
TCATGTGTGGCTGATGAGCAGCCGTTGAGTACTCTCCCACCAATGTCCCTAGACGAGGAGGAAGATGAATTTGAGATGCTTCGTTCAATTCAGCGGCGCTTTGCAGCGTA
CGAAAGTGATACTTTGAGCAATAAACCCAATCAGTTGTGCGACCATGTTGGGTCTTTGAAGATGGATTTTGATGACACAGCTGTTGAGAGTCAGACAGCCTCAAAAAGGC
CATCCATGCTAGCCTTTGAAAAGGGAAGCTTGCCAAAAGCTGCATTGGCATTTATTGATGCCATCAAGAAAAATAGGTCCCAGCAGAAGTTTATTCGTAGTAAGATGATT
CATCTTGAAGCTAGAATTGAGGAGAACAAAAAGCTCAGAAAACGTTTCAAAATTCTCAAAGATTTCCAGGGTTCATGTAAACGGAGAACAACCTGTGCACTGTCTCAAAT
GATAGATCCTCGAGTCCAGTTGATTTCAGCTGCAAAACCACAAGCAAAGGATTCATCAAAGGGAAGCTTGCCAAAAGCTGCATTGGCATTTATTGATGCCATCAAGAAAA
ATAGGTCCCAGCAGAAGTTTATTCGTAGTAAGATGATTCATCTTGAAGCTAGAATTGAGGAGAACAAAAAGCTCAGAAAACGTTTCAAAATTCTCAAAGATTTCCAGGGT
TCATGTAAACGGAGAACAACCTGTGCACTGTCTCAAATGATAGATCCTCGAGTCCAGTTGATTTCAGCTGCAAAACCACAAGCAAAGGATTCATCAAAGAAGGACAAACG
ATTATCTGCAATGCATTATGGCCCAGCTGAGAATTCTCATGTTGCATGCTACAGAATGGCATTAATGAAGTTTCCTCGTGTAGATCGAAAAAAATGGTCCGTTGTAGAAA
GGGAGAATCTAGGGAAGGGAATAAGACAACAATTTCAGGAGATGGTGCTTCAGATTTCAGTGGATCAAATTAGTGGGCTACAAGGATTTTCTGCAGATTCAGATGATTTG
GATAGCATTCTTGCATCAATAAAAGACCTTGACATCACTCCTGAAAAGATTAGGGAATTTCTGCCAAAAGTTAATTGGGACAAATTGGCTTCCATGTATCTTCGGGGTCG
CTCAGGGGCAGAATGTGAAGCGAGTGGGCTACAAGGATTCTCTGCAGATTCAGATGATTTGGATAACATTCTTGCATCAATAAAAGACCTTGACATCACTCCTGAAAAGA
TTAGGGAATTTCTGCCAAAAGTTAATTGGGACAAATTGGCTTCCATGTATCTTCGGGGTCGCTCAGGGGCAGAATGTGAAGCGAGGTGGTTGAATTTCGAAGACCCCCTA
ATTAATCGGGATCCATGGACTACAAGTGAGGATAAGAATCTTTTGTTTACTATCCAACAGAAGGGGTTGAATAACTGGATTGAGATAGCAGTTTCGTCGGGTACAAATAG
AACTCCTTTTCAGTGCTTGTCTCGGTATCAAAGGAGTTTAAACGCTTCCATATTAAAGAGGGAGTGGACCAAAGACGAGGATGATAAACTTCGAGCTGCTGTTGCTACAT
TTGGCCTGGGAGATTGGCAGGCAGTAGCTTCTACTTTGGAAGGACGAACTGGCCCACAGTGCTCTAATAGGTGGAAAAAATCCCTTGACCCAGCTAGGACAAAAAGAGGT
CATTTCACTCCAGATGAAGACAGTCGCTTGAAAATTGCTGTACTGCTCTTTGGGCCTAAAAATTGGAACAAGAAAGCAGAATTTTTACCTGGTCGAAATCAAGTTCAATG
CAGAGAAAGATGGTTTAATTGTTTAGATCCTTCCTTGAGAAGATGCGAATGGACAGAAGAGGAGGATTTAAGGCTGGAGATAGCAATTCAGGAACATGGATATAGCTGGG
CTAAGGTAGCTGCATGTGTGCCGTCACGTACAGATAATGATTGTCGGAGGAGATGGAAGAAGTTATTTCCCAATGAAGTTCCGTTACTCCAGGAAGCTAGAAAGATTCAG
AAGGCTGCTCTTATTAGCAACTTTGTTGATAGGGAATCAGAGCGTCCTGCTCTTGGTCCTACTGACTTTCGACCCAGGCCAAACACAGATTTATTGTGTAATAGTGTTGA
TCCAAGACCTGCCCCAAAAAGAAATGTGAAGACGAGAAAGATGCCAGTGTCAAGGAATGAAAAGAGTGCTACTGGTGATGCTCCAAAGAAGAGGAAATCAAATTGCCAGA
GGAATCAGGCTGATGTAACTGCTCAGGCGGGTATTGCAAATAATACCTCTTCTGTCCCAGAGGAGGTTAAATCTCTAAAGCCTCAAAGAAAACAAAATAGACATGGAGCT
TATACCGCAGAAGGGGTTCCGGAGCTATGTCCTAACAGTGAGTGGTGTGCTAAACAGAGTTTGGACACTCAGAGCCTTGGGGTGCAGCTGTCGCTGTTGGAATCTGAGGG
GACCAACAGCGACTGCATCGAGACTGTTGATGAGAATGGTATGGAGGTATTTGAGAACAAAGTTGCAGAGAAACTTTCTGAAAGAGATGTATGTTTTCCAGAACCAGAAG
CAATTCAAAACTCTACCGGATCTTCTGGAGTCTCGGTATTGTCAGAAATGACTAATGACATGGATGAATACAATCCGTCTATCCTTCCAGATACAACATTGTTGGCTAGT
ACTACTGTGGATGATCTCAAAGAATTGAAGAGGAAGAGTGTTGCAGACAGAGATCTGGATGACTGTAACAGTTTCTCGTTACCGTGCAGTGACTTGGAACTCAGGACAAT
CGACAAGGAAGGTGTGGACAGTTATTCCATGGATGAATTTACAGATAAAAGCCATGGGGTTTGCAAGCCCCCCCATACCAGAAGGAAGAAAAACAGCAAAACATCAAGTA
AGAGCCAGGATCATTCCTTTGTTTCTTGTCAACAAGTGGAGTTGGAGAGGTTGGGGACAAATGAGCCTCGTCATCATAATCAATCAAAGAAGAGAAAACATAACAGTACA
AATACGAATCTGGAGGCAGTTGAGGAGGTTGATGACTGCACACTCGTGGGCTTTTTGCAAAAAAGATTGAAGAGGACGACGACAACAAATGACAAGAAAGTTGATTGCAG
TTCAAGTACTCCCATAGAAGTTGATACTGATGACAACGATCCTACCCTTGCCTCGTTTCTTAATAAATTAAAGAGAAAAAAGCATCAGCCGAGTAGTGGTGGTGACAACA
TCACATTGGATCTCCGAAGAAGGTTGATGGAAGGTTGTACAAATAAAGCTTCCAAGGCCGCAAAATGGCGAAATCTTCAACTATACGAGGCTTTGGCAATTTTCTTTTGT
TATGGTTCATTTTCTTTTCCAATTTTGTTATTATTATTATTTTTTTCTGAGAAG
Protein sequenceShow/hide protein sequence
GCLVLHAASTLSLLLPHTHTLSPSPISLLFGSLSLGLSPSPAHSFARRLARRFHLVTSLSLSGLAVSLSLDPLSLPLGSTFPPPFLSRFYSAGSTSLTCFPLSYSPSRRR
CAAAAGFLSPPPPLPFCNDSVRVRCAHSCSSMSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRF
SCVADEQPLSTLPPMSLDEEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMI
HLEARIEENKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQG
SCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDL
DSILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPL
INRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRG
HFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQ
KAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNCQRNQADVTAQAGIANNTSSVPEEVKSLKPQRKQNRHGA
YTAEGVPELCPNSEWCAKQSLDTQSLGVQLSLLESEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFPEPEAIQNSTGSSGVSVLSEMTNDMDEYNPSILPDTTLLAS
TTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDKEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNST
NTNLEAVEEVDDCTLVGFLQKRLKRTTTTNDKKVDCSSSTPIEVDTDDNDPTLASFLNKLKRKKHQPSSGGDNITLDLRRRLMEGCTNKASKAAKWRNLQLYEALAIFFC
YGSFSFPILLLLLFFSEK