; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc05G16440 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc05G16440
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionsnRNA-activating protein complex subunit 4
Genome locationClcChr05:23369983..23406461
RNA-Seq ExpressionClc05G16440
SyntenyClc05G16440
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017884 - SANT domain
IPR017930 - Myb domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011650584.1 uncharacterized protein LOC101216287 [Cucumis sativus]0.0e+0082.1Show/hide
Query:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLDEEE
        MS RNH DE DVE P  KED VVDEDME L+RAY+  GVNPEDYINPRLSSP AGDA+P SDSDDVDDFELLR+IQNRFS +ADEQP ST  P+S DEEE
Subjt:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLDEEE

Query:  DEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKK
        DEFEMLRSIQRRFAAYESDTLSNKPNQ  D+VGSLK+D DD AVESQT+SKRPSMLAFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEENKK
Subjt:  DEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKK

Query:  LRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQFQE
        LRKR KILKDFQGSCKRRT+CALSQMIDPRVQLISAAKPQAKDSSKKDKRLS M+YGP ENSHVACYRM L KFP VDRKKWS+VERENLGKGIRQQFQE
Subjt:  LRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQFQE

Query:  MVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASMSTSISIAEYRWLNFEDPLINRDPWTTSEDKN
        MVLQISVDQISG QG S DSDDLDNILASIKDLDI P+KIREFLPKVNWDKLASMYL+GRSGAECEA           RWLNFEDPLINRDPWTTSEDK+
Subjt:  MVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASMSTSISIAEYRWLNFEDPLINRDPWTTSEDKN

Query:  LLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFT
        LLFTIQQKGLNNWIE+AVS GTNRTPFQCLSRYQRSLNASILKREWTK+EDD+LR+AVATFG+ DWQAVASTLEGR G QCSNRWKKSLDPART++G+FT
Subjt:  LLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFT

Query:  PDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNDCRRRWKKLFPNEVPLL
        PDED RLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSW KVAACVPSRTDN+CRRRWKKLFP+EVPLL
Subjt:  PDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNDCRRRWKKLFPNEVPLL

Query:  QEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNCQRNQADVTAQAGIANNTSSVP
        QEARKIQKAALISNFVDRE+ERPALGP DFRPRPNTD LCN+  P PAPKRNVKTRKMPVSRNEKSATGDAPKKRKSN QR Q D TAQ GIA NTS VP
Subjt:  QEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNCQRNQADVTAQAGIANNTSSVP

Query:  EEVKSLKPQRKQNRHGAYTAE--GVPELCPNSEWCAKQSLDTQSLGVQLSLLESEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFPEPEAIQNSTGS
        EEV+S KPQRK+NR GAYTA+  GVPEL  +SEWCAKQ+LDT+SLG+QL+  ESE +NS+C ETVDEN MEV ENKVAEKL+E + CF EPE  QNSTGS
Subjt:  EEVKSLKPQRKQNRHGAYTAE--GVPELCPNSEWCAKQSLDTQSLGVQLSLLESEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFPEPEAIQNSTGS

Query:  SGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDKEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKT
        SGVSVLSEMTND+ +YNPSIL DTTL ASTTVDD++ELK KS ADRDLDD NSFSL  S LELRT+D EGVDSYS+DE+T KS+GVC P   RRKKNSKT
Subjt:  SGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDKEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKT

Query:  SSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTNL----EAVEEVDDCTLVGFLQKRLKRTTTTNDKKVDCSSSTPIEVDTDDNDPTLASFL
        S+ S D+  +  QQ+  E LGT +P HHNQSKKRKH++T  +     EAVEEVDDCTLVGFLQKRLKRT  T+++ VDCSS+ P++VD DDN+PT+ASFL
Subjt:  SSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTNL----EAVEEVDDCTLVGFLQKRLKRTTTTNDKKVDCSSSTPIEVDTDDNDPTLASFL

Query:  NKLKRKKHQPSSGGELN
        NKLKRKKHQ  SG ELN
Subjt:  NKLKRKKHQPSSGGELN

XP_023515735.1 uncharacterized protein LOC111779809 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0081.15Show/hide
Query:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLD
        MS R+H D GD ELP S+   EDD+VD+DME LRRA +  GVN EDYINPRLS P AGDAN  SDSDDVDD ELLRNIQNRFS  ADEQPLS LPP++ D
Subjt:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FEMLRSIQRRFAAYESD LSNKP+Q CD  G LKMD ++T VE  T+S+R SM+AFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQ
        NKKLRKRFK+LK FQGSC+R+TTCAL+QM+DPRVQLISA KPQAKDSSKKDKRLS+M YGPAENSHVACYR A  KF  VDRK+WS  ERENLGKGIRQQ
Subjt:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQ

Query:  FQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASMSTSISIAEYRWLNFEDPLINRDPWTTSE
        FQEMVLQISVDQIS +QGFSA+SDDLDNILASIK LDITPEKIREFLPKVNWDKLASMYLRGRSGAECEA           RWLNFEDPLINR+PWTTSE
Subjt:  FQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASMSTSISIAEYRWLNFEDPLINRDPWTTSE

Query:  DKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRG
        DKNLLFTIQQKGLNNWIE+AVS GTNRTPFQCLSRYQRSLNASILK EWTKDEDDKLR+AVA FG GDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRG
Subjt:  DKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRG

Query:  HFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNDCRRRWKKLFPNEV
        +FTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSW KVAACVPSRTDN+CRRRWKKLFPN+V
Subjt:  HFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNDCRRRWKKLFPNEV

Query:  PLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNCQRNQADVTAQAGIANNTS
        PLLQEARKIQK ALISNFVDRESERPALGPTDFRP PN+ LLCN+ DP  APKRNV+TR+MPVSRNEKSA GDAPK+RKSN QRN+AD TAQ    NNTS
Subjt:  PLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNCQRNQADVTAQAGIANNTS

Query:  SVPEEVKSLKPQRKQNRHGAYTA--EGVPELCPNSEWCAKQSLDTQSLGVQLSLLE-SEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFPEPEAIQN
        SVP EVKS KPQRK+ RHGAYT   +G P++  NSE CA+Q+ DT+S+ VQL+  E +E  NSDC ETVDENGMEVFENK AE  SE  VCF E E  QN
Subjt:  SVPEEVKSLKPQRKQNRHGAYTA--EGVPELCPNSEWCAKQSLDTQSLGVQLSLLE-SEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFPEPEAIQN

Query:  STGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDKEGVDSYSMDEFTDKSHGVCKPPHTRRKK
        STGSSGVSVLSEMTNDMDEYNPS LPDTTLLAS T DD+ E K  +VAD+DLDD NSFSLP S LELRT D EGVDSYS+DEFTDKSHGVCK P  RRKK
Subjt:  STGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDKEGVDSYSMDEFTDKSHGVCKPPHTRRKK

Query:  NSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKRTTTTNDKKVDCSSSTPIEVDTDDNDPTL
        NSK S+KSQD S VSCQQ ELE  GTNE    NQSKKRKH+ TNT+    +EAVEEVDDCTL GFLQKRLKRTTTT+DKKVD SSSTP EVD DDNDPTL
Subjt:  NSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKRTTTTNDKKVDCSSSTPIEVDTDDNDPTL

Query:  ASFLN-KLKRKKH
        A  LN KLKRKKH
Subjt:  ASFLN-KLKRKKH

XP_023515736.1 uncharacterized protein LOC111779809 isoform X2 [Cucurbita pepo subsp. pepo]0.0e+0081.05Show/hide
Query:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLD
        MS R+H D GD ELP S+   EDD+VD+DME LRRA +  GVN EDYINPRLS P AGDAN  SDSDDVDD ELLRNIQNRFS  ADEQPLS LPP++ D
Subjt:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FEMLRSIQRRFAAYESD LSNKP+Q CD  G LKMD ++T VE  T+S+R SM+AFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQ
        NKKLRKRFK+LK FQGSC+R+TTCAL+QM+DPRVQLISA KPQAKDSSKKDKRLS+M YGPAENSHVACYR A  KF  VDRK+WS  ERENLGKGIRQQ
Subjt:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQ

Query:  FQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASMSTSISIAEYRWLNFEDPLINRDPWTTSE
        FQEMVLQISVDQIS +QGFSA+SDDLDNILASIK LDITPEKIREFLPKVNWDKLASMYLRGRSGAECEA           RWLNFEDPLINR+PWTTSE
Subjt:  FQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASMSTSISIAEYRWLNFEDPLINRDPWTTSE

Query:  DKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRG
        DKNLLFTIQQKGLNNWIE+AVS GTNRTPFQCLSRYQRSLNASILK EWTKDEDDKLR+AVA FG GDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRG
Subjt:  DKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRG

Query:  HFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNDCRRRWKKLFPNEV
        +FTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSW KVAACVPSRTDN+C RRWKKLFPN+V
Subjt:  HFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNDCRRRWKKLFPNEV

Query:  PLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNCQRNQADVTAQAGIANNTS
        PLLQEARKIQK ALISNFVDRESERPALGPTDFRP PN+ LLCN+ DP  APKRNV+TR+MPVSRNEKSA GDAPK+RKSN QRN+AD TAQ    NNTS
Subjt:  PLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNCQRNQADVTAQAGIANNTS

Query:  SVPEEVKSLKPQRKQNRHGAYTA--EGVPELCPNSEWCAKQSLDTQSLGVQLSLLE-SEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFPEPEAIQN
        SVP EVKS KPQRK+ RHGAYT   +G P++  NSE CA+Q+ DT+S+ VQL+  E +E  NSDC ETVDENGMEVFENK AE  SE  VCF E E  QN
Subjt:  SVPEEVKSLKPQRKQNRHGAYTA--EGVPELCPNSEWCAKQSLDTQSLGVQLSLLE-SEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFPEPEAIQN

Query:  STGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDKEGVDSYSMDEFTDKSHGVCKPPHTRRKK
        STGSSGVSVLSEMTNDMDEYNPS LPDTTLLAS T DD+ E K  +VAD+DLDD NSFSLP S LELRT D EGVDSYS+DEFTDKSHGVCK P  RRKK
Subjt:  STGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDKEGVDSYSMDEFTDKSHGVCKPPHTRRKK

Query:  NSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKRTTTTNDKKVDCSSSTPIEVDTDDNDPTL
        NSK S+KSQD S VSCQQ ELE  GTNE    NQSKKRKH+ TNT+    +EAVEEVDDCTL GFLQKRLKRTTTT+DKKVD SSSTP EVD DDNDPTL
Subjt:  NSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKRTTTTNDKKVDCSSSTPIEVDTDDNDPTL

Query:  ASFLN-KLKRKKH
        A  LN KLKRKKH
Subjt:  ASFLN-KLKRKKH

XP_038905712.1 uncharacterized protein LOC120091681 isoform X1 [Benincasa hispida]0.0e+0084.95Show/hide
Query:  VRCAHSCSSMSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTL
        VR AH  ++MS  NH DEGDVELP +KEDDVVDEDME L+RAY+ VGVNPEDYINPRLSSP  GDAN   DSDD DDFELLRNIQNRFS V DEQPLSTL
Subjt:  VRCAHSCSSMSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTL

Query:  PPMSLDEEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHL
        PP+SLDEEEDEFEMLRSIQRRFAAYESD LSNKPN+  D+VGSLKMD  +TA ESQT+SKRPSM+AFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHL
Subjt:  PPMSLDEEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHL

Query:  EARIEENKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLG
        EARIEENKKLRKR KILKDFQGSCKR+TTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLS M+YGPAENSHVACYRMAL KFPRVDRKKWS+VERENLG
Subjt:  EARIEENKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLG

Query:  KGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASMSTSISIAEYRWLNFEDPLINRD
        KGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYL GRSGAECEA           RWLNFEDPLINRD
Subjt:  KGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASMSTSISIAEYRWLNFEDPLINRD

Query:  PWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDP
        PWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLR+AVA FG+ DWQAVASTLEGR G QCSNRWKKSLDP
Subjt:  PWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDP

Query:  ARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNDCRRRWKK
        ARTKRGHFTPDED+RLKIAVLL GPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSW KVAACVPSRTDNDCRRRWKK
Subjt:  ARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNDCRRRWKK

Query:  LFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNCQRNQADVTAQAG
        LFPNEVPLLQEARKIQKAALISNFVDRESERPALGP DFRPR NTD+LC++ DP+PAPKRN KTRKMPVSRNEKSATGDAP+KRKSN QRNQAD TA+ G
Subjt:  LFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNCQRNQADVTAQAG

Query:  IANNTSSVPEEVKSLKPQRKQNRHGAYTAE--GVPELCPNSEWCAKQSLDTQSLGVQLSLLESEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFPEP
        IANNTSSVPEEV+SLKP RK+NRH A T +  GV EL  N +WCAKQ+L+T+S+GVQLS  E E TNSD  ETVD NG+EVFENK+A+KLSERDV F EP
Subjt:  IANNTSSVPEEVKSLKPQRKQNRHGAYTAE--GVPELCPNSEWCAKQSLDTQSLGVQLSLLESEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFPEP

Query:  EAIQNSTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDKEGVDSYSMDEFTDKSHGVCKPPH
        E  QNSTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDD++ELK KS ADRDLDD NSFSLP S LELRTID EGVDSYS+D+ TDKSH VCK P 
Subjt:  EAIQNSTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDKEGVDSYSMDEFTDKSHGVCKPPH

Query:  TRRKKNSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKRTTTTNDKKVDCSSSTPIEVDTDD
         RRKKNSKTS K+ ++SF+SCQQVE ERLG NEPRH NQSKKRKH+STNT+    LEAVEEVD+CTLVGFLQKRL        KKVDCSS TP+EVD DD
Subjt:  TRRKKNSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKRTTTTNDKKVDCSSSTPIEVDTDD

Query:  NDPTLASFLNKLKRKKHQPSSGG
        ND  +ASFLNKLKRKKHQP S G
Subjt:  NDPTLASFLNKLKRKKHQPSSGG

XP_038905717.1 uncharacterized protein LOC120091681 isoform X2 [Benincasa hispida]0.0e+0085.31Show/hide
Query:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLDEEE
        MS  NH DEGDVELP +KEDDVVDEDME L+RAY+ VGVNPEDYINPRLSSP  GDAN   DSDD DDFELLRNIQNRFS V DEQPLSTLPP+SLDEEE
Subjt:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLDEEE

Query:  DEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKK
        DEFEMLRSIQRRFAAYESD LSNKPN+  D+VGSLKMD  +TA ESQT+SKRPSM+AFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEENKK
Subjt:  DEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKK

Query:  LRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQFQE
        LRKR KILKDFQGSCKR+TTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLS M+YGPAENSHVACYRMAL KFPRVDRKKWS+VERENLGKGIRQQFQE
Subjt:  LRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQFQE

Query:  MVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASMSTSISIAEYRWLNFEDPLINRDPWTTSEDKN
        MVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYL GRSGAECEA           RWLNFEDPLINRDPWTTSEDKN
Subjt:  MVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASMSTSISIAEYRWLNFEDPLINRDPWTTSEDKN

Query:  LLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFT
        LLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLR+AVA FG+ DWQAVASTLEGR G QCSNRWKKSLDPARTKRGHFT
Subjt:  LLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFT

Query:  PDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNDCRRRWKKLFPNEVPLL
        PDED+RLKIAVLL GPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSW KVAACVPSRTDNDCRRRWKKLFPNEVPLL
Subjt:  PDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNDCRRRWKKLFPNEVPLL

Query:  QEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNCQRNQADVTAQAGIANNTSSVP
        QEARKIQKAALISNFVDRESERPALGP DFRPR NTD+LC++ DP+PAPKRN KTRKMPVSRNEKSATGDAP+KRKSN QRNQAD TA+ GIANNTSSVP
Subjt:  QEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNCQRNQADVTAQAGIANNTSSVP

Query:  EEVKSLKPQRKQNRHGAYTAE--GVPELCPNSEWCAKQSLDTQSLGVQLSLLESEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFPEPEAIQNSTGS
        EEV+SLKP RK+NRH A T +  GV EL  N +WCAKQ+L+T+S+GVQLS  E E TNSD  ETVD NG+EVFENK+A+KLSERDV F EPE  QNSTGS
Subjt:  EEVKSLKPQRKQNRHGAYTAE--GVPELCPNSEWCAKQSLDTQSLGVQLSLLESEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFPEPEAIQNSTGS

Query:  SGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDKEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKT
        SGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDD++ELK KS ADRDLDD NSFSLP S LELRTID EGVDSYS+D+ TDKSH VCK P  RRKKNSKT
Subjt:  SGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDKEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKT

Query:  SSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKRTTTTNDKKVDCSSSTPIEVDTDDNDPTLASFL
        S K+ ++SF+SCQQVE ERLG NEPRH NQSKKRKH+STNT+    LEAVEEVD+CTLVGFLQKRL        KKVDCSS TP+EVD DDND  +ASFL
Subjt:  SSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKRTTTTNDKKVDCSSSTPIEVDTDDNDPTLASFL

Query:  NKLKRKKHQPSSGG
        NKLKRKKHQP S G
Subjt:  NKLKRKKHQPSSGG

TrEMBL top hitse value%identityAlignment
A0A0A0L2R2 Uncharacterized protein0.0e+0082.1Show/hide
Query:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLDEEE
        MS RNH DE DVE P  KED VVDEDME L+RAY+  GVNPEDYINPRLSSP AGDA+P SDSDDVDDFELLR+IQNRFS +ADEQP ST  P+S DEEE
Subjt:  MSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLDEEE

Query:  DEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKK
        DEFEMLRSIQRRFAAYESDTLSNKPNQ  D+VGSLK+D DD AVESQT+SKRPSMLAFEKGSLPKAALAF+DAIKKNRSQQKFIRSKMIHLEARIEENKK
Subjt:  DEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKK

Query:  LRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQFQE
        LRKR KILKDFQGSCKRRT+CALSQMIDPRVQLISAAKPQAKDSSKKDKRLS M+YGP ENSHVACYRM L KFP VDRKKWS+VERENLGKGIRQQFQE
Subjt:  LRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQFQE

Query:  MVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASMSTSISIAEYRWLNFEDPLINRDPWTTSEDKN
        MVLQISVDQISG QG S DSDDLDNILASIKDLDI P+KIREFLPKVNWDKLASMYL+GRSGAECEA           RWLNFEDPLINRDPWTTSEDK+
Subjt:  MVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASMSTSISIAEYRWLNFEDPLINRDPWTTSEDKN

Query:  LLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFT
        LLFTIQQKGLNNWIE+AVS GTNRTPFQCLSRYQRSLNASILKREWTK+EDD+LR+AVATFG+ DWQAVASTLEGR G QCSNRWKKSLDPART++G+FT
Subjt:  LLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFT

Query:  PDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNDCRRRWKKLFPNEVPLL
        PDED RLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSW KVAACVPSRTDN+CRRRWKKLFP+EVPLL
Subjt:  PDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNDCRRRWKKLFPNEVPLL

Query:  QEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNCQRNQADVTAQAGIANNTSSVP
        QEARKIQKAALISNFVDRE+ERPALGP DFRPRPNTD LCN+  P PAPKRNVKTRKMPVSRNEKSATGDAPKKRKSN QR Q D TAQ GIA NTS VP
Subjt:  QEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNCQRNQADVTAQAGIANNTSSVP

Query:  EEVKSLKPQRKQNRHGAYTAE--GVPELCPNSEWCAKQSLDTQSLGVQLSLLESEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFPEPEAIQNSTGS
        EEV+S KPQRK+NR GAYTA+  GVPEL  +SEWCAKQ+LDT+SLG+QL+  ESE +NS+C ETVDEN MEV ENKVAEKL+E + CF EPE  QNSTGS
Subjt:  EEVKSLKPQRKQNRHGAYTAE--GVPELCPNSEWCAKQSLDTQSLGVQLSLLESEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFPEPEAIQNSTGS

Query:  SGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDKEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKT
        SGVSVLSEMTND+ +YNPSIL DTTL ASTTVDD++ELK KS ADRDLDD NSFSL  S LELRT+D EGVDSYS+DE+T KS+GVC P   RRKKNSKT
Subjt:  SGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDKEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKT

Query:  SSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTNL----EAVEEVDDCTLVGFLQKRLKRTTTTNDKKVDCSSSTPIEVDTDDNDPTLASFL
        S+ S D+  +  QQ+  E LGT +P HHNQSKKRKH++T  +     EAVEEVDDCTLVGFLQKRLKRT  T+++ VDCSS+ P++VD DDN+PT+ASFL
Subjt:  SSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTNL----EAVEEVDDCTLVGFLQKRLKRTTTTNDKKVDCSSSTPIEVDTDDNDPTLASFL

Query:  NKLKRKKHQPSSGGELN
        NKLKRKKHQ  SG ELN
Subjt:  NKLKRKKHQPSSGGELN

A0A6J1E2J4 uncharacterized protein LOC111430000 isoform X20.0e+0080.65Show/hide
Query:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLD
        MS R+H D GD ELP S+   EDD+VD+DME LRRA +  GVN ED INPRLS P AGDAN  SDSDDVDD ELLRNIQNRFS  ADEQPLS LPP++ D
Subjt:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FE LRSIQRRFAAYESD LSNKP+Q CD  G LKMD D+T V   T+S+R SM+AFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQ
        NKKLRKRFK+LK FQGSC+R+TTCAL+QM+DPRVQLISA KPQAKDSSKKDKRLSAM YGPAENSHVACYR AL KF  VDRK+WS  ERENLGKGIRQQ
Subjt:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQ

Query:  FQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASMSTSISIAEYRWLNFEDPLINRDPWTTSE
        FQEMVLQISVDQIS +QGFSA+SDDLDNILASIK LDITPEKIREFLPKVNWDKLA MYL+GRSGAECEA           RWLNFEDPLINR+ WTTSE
Subjt:  FQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASMSTSISIAEYRWLNFEDPLINRDPWTTSE

Query:  DKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRG
        DKNLLFTIQQKGLNNWIE+AVS GTNRTPFQCLSRYQRSLNASILK EWTKDEDDKLR+AVA FG GDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRG
Subjt:  DKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRG

Query:  HFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNDCRRRWKKLFPNEV
        +FTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSW KVAACVPSRTDN+C RRWKKLFPN+V
Subjt:  HFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNDCRRRWKKLFPNEV

Query:  PLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNCQRNQADVTAQAGIANNTS
        PLLQEARKIQK ALISNFVDRESERPALGPTDFRP PN+ LLCN+ DP  APKRNV+ R+MPVSRNEKSA GDAPKK KSN QRNQAD TAQ   ANNTS
Subjt:  PLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNCQRNQADVTAQAGIANNTS

Query:  SVPEEVKSLKPQRKQNRHGAYTA--EGVPELCPNSEWCAKQSLDTQSLGVQLSLLE-SEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFPEPEAIQN
        SVP EVKS KPQRK+ RHGAYT   +G P++  NSE CA+Q+ DT+SL VQL+  E +E  NSDC ETVDENGMEVFENK AE  SE  VCF E E  QN
Subjt:  SVPEEVKSLKPQRKQNRHGAYTA--EGVPELCPNSEWCAKQSLDTQSLGVQLSLLE-SEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFPEPEAIQN

Query:  STGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDKEGVDSYSMDEFTDKSHGVCKPPHTRRKK
        STGSSGVSVLSEMTNDMDEYNPS  PDTTLLAS T DD+ E K  +VAD+DLDD NSFSLP S LELRT D EGVDSYS+DEFTDKSHGVCK P  RRKK
Subjt:  STGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDKEGVDSYSMDEFTDKSHGVCKPPHTRRKK

Query:  NSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKRTTTTNDKKVDCSSSTPIEVDTDDNDPTL
        NSK S+KSQD S VSCQQ ELE  G NE    NQSKKRKH+ TNT+    +EAVEEVDDCTL GFLQKRLKRTTTT+DKKVD SSSTP EVD DDNDPTL
Subjt:  NSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKRTTTTNDKKVDCSSSTPIEVDTDDNDPTL

Query:  ASFL-NKLKRKKH
        A  L +KLKRKKH
Subjt:  ASFL-NKLKRKKH

A0A6J1E6Z7 uncharacterized protein LOC111430000 isoform X10.0e+0080.75Show/hide
Query:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLD
        MS R+H D GD ELP S+   EDD+VD+DME LRRA +  GVN ED INPRLS P AGDAN  SDSDDVDD ELLRNIQNRFS  ADEQPLS LPP++ D
Subjt:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FE LRSIQRRFAAYESD LSNKP+Q CD  G LKMD D+T V   T+S+R SM+AFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQ
        NKKLRKRFK+LK FQGSC+R+TTCAL+QM+DPRVQLISA KPQAKDSSKKDKRLSAM YGPAENSHVACYR AL KF  VDRK+WS  ERENLGKGIRQQ
Subjt:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQQ

Query:  FQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASMSTSISIAEYRWLNFEDPLINRDPWTTSE
        FQEMVLQISVDQIS +QGFSA+SDDLDNILASIK LDITPEKIREFLPKVNWDKLA MYL+GRSGAECEA           RWLNFEDPLINR+ WTTSE
Subjt:  FQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASMSTSISIAEYRWLNFEDPLINRDPWTTSE

Query:  DKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRG
        DKNLLFTIQQKGLNNWIE+AVS GTNRTPFQCLSRYQRSLNASILK EWTKDEDDKLR+AVA FG GDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRG
Subjt:  DKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRG

Query:  HFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNDCRRRWKKLFPNEV
        +FTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSW KVAACVPSRTDN+CRRRWKKLFPN+V
Subjt:  HFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNDCRRRWKKLFPNEV

Query:  PLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNCQRNQADVTAQAGIANNTS
        PLLQEARKIQK ALISNFVDRESERPALGPTDFRP PN+ LLCN+ DP  APKRNV+ R+MPVSRNEKSA GDAPKK KSN QRNQAD TAQ   ANNTS
Subjt:  PLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNCQRNQADVTAQAGIANNTS

Query:  SVPEEVKSLKPQRKQNRHGAYTA--EGVPELCPNSEWCAKQSLDTQSLGVQLSLLE-SEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFPEPEAIQN
        SVP EVKS KPQRK+ RHGAYT   +G P++  NSE CA+Q+ DT+SL VQL+  E +E  NSDC ETVDENGMEVFENK AE  SE  VCF E E  QN
Subjt:  SVPEEVKSLKPQRKQNRHGAYTA--EGVPELCPNSEWCAKQSLDTQSLGVQLSLLE-SEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFPEPEAIQN

Query:  STGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDKEGVDSYSMDEFTDKSHGVCKPPHTRRKK
        STGSSGVSVLSEMTNDMDEYNPS  PDTTLLAS T DD+ E K  +VAD+DLDD NSFSLP S LELRT D EGVDSYS+DEFTDKSHGVCK P  RRKK
Subjt:  STGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDKEGVDSYSMDEFTDKSHGVCKPPHTRRKK

Query:  NSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKRTTTTNDKKVDCSSSTPIEVDTDDNDPTL
        NSK S+KSQD S VSCQQ ELE  G NE    NQSKKRKH+ TNT+    +EAVEEVDDCTL GFLQKRLKRTTTT+DKKVD SSSTP EVD DDNDPTL
Subjt:  NSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKRTTTTNDKKVDCSSSTPIEVDTDDNDPTL

Query:  ASFL-NKLKRKKH
        A  L +KLKRKKH
Subjt:  ASFL-NKLKRKKH

A0A6J1JK98 uncharacterized protein LOC111485355 isoform X20.0e+0080.37Show/hide
Query:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLD
        MS R+H D GD ELP S+   EDD+VD+DME LRRA +  GVN EDY+NP+LS P AGDAN  SDSDDVDD ELLRNIQNRFS  ADEQPLS LPP++ D
Subjt:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FE LRSIQRRFAAYESD LSNKP+Q CD  G LKMD D+T VE  T+S+R SM+AFEKGSLPKAALAFIDAIKKNRSQQKF+RSKMIHLEARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKP-QAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQ
        NKKLRKRFK+LK FQGSC+R+TTCALSQM+DPRVQLISA KP QAKDSSKKDKRLSAM YGPAENSHVACYR+AL KF  VDRK+WS  ERENLGKGIRQ
Subjt:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKP-QAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQ

Query:  QFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASMSTSISIAEYRWLNFEDPLINRDPWTTS
        QFQEMVLQISVDQIS +QGFSA+SDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEA           RWLNFEDPLINR+PWTTS
Subjt:  QFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASMSTSISIAEYRWLNFEDPLINRDPWTTS

Query:  EDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKR
        EDKNLLFTIQQKGLNNWI++AVS GTNRTPFQ LSRYQRSLNASILK EWTKDEDDKLR+AVA FG GDWQAVASTLEGRTGPQCSNRWKKSLDPARTKR
Subjt:  EDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKR

Query:  GHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNDCRRRWKKLFPNE
        G+FTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSW KVAACVPSRTDN+C RRWKKLFPN+
Subjt:  GHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNDCRRRWKKLFPNE

Query:  VPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNCQRNQADVTAQAGIANNT
        VPLLQEARKIQK ALISNFVDRESERPALGPTDFRP PN+ LLCN+ DP  APKRNV+TR+MPVSRNEKSA GDAPKKRKSN QRN+ D TAQ   A+NT
Subjt:  VPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNCQRNQADVTAQAGIANNT

Query:  SSVPEEVKSLKPQRKQNRHGAYTA--EGVPELCPNSEWCAKQSLDTQSLGVQLSLLE-SEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFPEPEAIQ
        SSVP EVKS KPQRK+ RHGAYT   +G P++  NSE CA+Q+ DT++L VQL+  E +E  NSDC ETVDENGMEVFENK AE  SE  VCF E E  Q
Subjt:  SSVPEEVKSLKPQRKQNRHGAYTA--EGVPELCPNSEWCAKQSLDTQSLGVQLSLLE-SEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFPEPEAIQ

Query:  NSTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDKEGVDSYSMDEFTDKSHGVCKPPHTRRK
        NSTGSSGVSVLSEMTNDMDEYNPS LPDTTLLAS T DD+ E K  +VAD+DLD  NSFSLP S LELRT D EGVDSYS+DEFTDKSH VCK P  RRK
Subjt:  NSTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDKEGVDSYSMDEFTDKSHGVCKPPHTRRK

Query:  KNSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKRTTTTNDKKVDCSSSTPIEVDTDDNDPT
        KNSK S+KSQD S VSCQQ ELE  GTNE    NQ KKRKH+STNT+    +EAVEEVDDCTL+GFLQKRLKRTTTT+ KKVD SSST  EVD DDNDPT
Subjt:  KNSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKRTTTTNDKKVDCSSSTPIEVDTDDNDPT

Query:  LASFL-NKLKRKKH
        LA  L  KLKRKKH
Subjt:  LASFL-NKLKRKKH

A0A6J1JKV7 uncharacterized protein LOC111485355 isoform X10.0e+0080.47Show/hide
Query:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLD
        MS R+H D GD ELP S+   EDD+VD+DME LRRA +  GVN EDY+NP+LS P AGDAN  SDSDDVDD ELLRNIQNRFS  ADEQPLS LPP++ D
Subjt:  MSHRNHDDEGDVELPVSK---EDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSLD

Query:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE
        EEED+FE LRSIQRRFAAYESD LSNKP+Q CD  G LKMD D+T VE  T+S+R SM+AFEKGSLPKAALAFIDAIKKNRSQQKF+RSKMIHLEARIEE
Subjt:  EEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEE

Query:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKP-QAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQ
        NKKLRKRFK+LK FQGSC+R+TTCALSQM+DPRVQLISA KP QAKDSSKKDKRLSAM YGPAENSHVACYR+AL KF  VDRK+WS  ERENLGKGIRQ
Subjt:  NKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKP-QAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVVERENLGKGIRQ

Query:  QFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASMSTSISIAEYRWLNFEDPLINRDPWTTS
        QFQEMVLQISVDQIS +QGFSA+SDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEA           RWLNFEDPLINR+PWTTS
Subjt:  QFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASMSTSISIAEYRWLNFEDPLINRDPWTTS

Query:  EDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKR
        EDKNLLFTIQQKGLNNWI++AVS GTNRTPFQ LSRYQRSLNASILK EWTKDEDDKLR+AVA FG GDWQAVASTLEGRTGPQCSNRWKKSLDPARTKR
Subjt:  EDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKR

Query:  GHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNDCRRRWKKLFPNE
        G+FTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSW KVAACVPSRTDN+CRRRWKKLFPN+
Subjt:  GHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNDCRRRWKKLFPNE

Query:  VPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNCQRNQADVTAQAGIANNT
        VPLLQEARKIQK ALISNFVDRESERPALGPTDFRP PN+ LLCN+ DP  APKRNV+TR+MPVSRNEKSA GDAPKKRKSN QRN+ D TAQ   A+NT
Subjt:  VPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNCQRNQADVTAQAGIANNT

Query:  SSVPEEVKSLKPQRKQNRHGAYTA--EGVPELCPNSEWCAKQSLDTQSLGVQLSLLE-SEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFPEPEAIQ
        SSVP EVKS KPQRK+ RHGAYT   +G P++  NSE CA+Q+ DT++L VQL+  E +E  NSDC ETVDENGMEVFENK AE  SE  VCF E E  Q
Subjt:  SSVPEEVKSLKPQRKQNRHGAYTA--EGVPELCPNSEWCAKQSLDTQSLGVQLSLLE-SEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFPEPEAIQ

Query:  NSTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDKEGVDSYSMDEFTDKSHGVCKPPHTRRK
        NSTGSSGVSVLSEMTNDMDEYNPS LPDTTLLAS T DD+ E K  +VAD+DLD  NSFSLP S LELRT D EGVDSYS+DEFTDKSH VCK P  RRK
Subjt:  NSTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKELKRKSVADRDLDDCNSFSLPCSDLELRTIDKEGVDSYSMDEFTDKSHGVCKPPHTRRK

Query:  KNSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKRTTTTNDKKVDCSSSTPIEVDTDDNDPT
        KNSK S+KSQD S VSCQQ ELE  GTNE    NQ KKRKH+STNT+    +EAVEEVDDCTL+GFLQKRLKRTTTT+ KKVD SSST  EVD DDNDPT
Subjt:  KNSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTN----LEAVEEVDDCTLVGFLQKRLKRTTTTNDKKVDCSSSTPIEVDTDDNDPT

Query:  LASFL-NKLKRKKH
        LA  L  KLKRKKH
Subjt:  LASFL-NKLKRKKH

SwissProt top hitse value%identityAlignment
P10242 Transcriptional activator Myb5.2e-3139.2Show/hide
Query:  KREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL
        K  WT++ED+KL+  V   G  DW+ +A+ L  RT  QC +RW+K L+P   K G +T +ED R+   V  +GPK W+  A+ L GR   QCRERW N L
Subjt:  KREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL

Query:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNDCRRRWKKLFPNEVP---LLQEARKIQKAALISNF
        +P +++  WTEEED  +  A +  G  W ++A  +P RTDN  +  W      +V     LQE+ K  + A+ ++F
Subjt:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNDCRRRWKKLFPNEVP---LLQEARKIQKAALISNF

P46200 Transcriptional activator Myb5.2e-3139.2Show/hide
Query:  KREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL
        K  WT++ED+KL+  V   G  DW+ +A+ L  RT  QC +RW+K L+P   K G +T +ED R+   V  +GPK W+  A+ L GR   QCRERW N L
Subjt:  KREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL

Query:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNDCRRRWKKLFPNEVP---LLQEARKIQKAALISNF
        +P +++  WTEEED  +  A +  G  W ++A  +P RTDN  +  W      +V     LQE+ K  + A+ ++F
Subjt:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNDCRRRWKKLFPNEVP---LLQEARKIQKAALISNF

Q54NA6 Myb-like protein L2.6e-5433.04Show/hide
Query:  HYGPAENSHVACYRMALMKFPRVDR-KKWSVVERENLGKGIRQQ-FQEMVLQISVDQIS---------GLQGFSADSDDLDNILASIKDLDITPEKIREF
        HY PA+N      R+     P   + ++W+  E E L KGI+++  Q+ + ++S D++S          +Q  S ++++ +N + +  + +   +K    
Subjt:  HYGPAENSHVACYRMALMKFPRVDR-KKWSVVERENLGKGIRQQ-FQEMVLQISVDQIS---------GLQGFSADSDDLDNILASIKDLDITPEKIREF

Query:  LPKVNWDKLASMYLRGRSGAECEASMSTSISIAEYRWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILK
         P +   ++ S          C  S++ S   A  RW N +DP IN+ P+T  EDK LL   ++   + W +I++  GTNRTP  C+ RYQRSLN+ ++K
Subjt:  LPKVNWDKLASMYLRGRSGAECEASMSTSISIAEYRWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILK

Query:  REWTKDEDDKLRAAVATFGLG---DWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFN
        REWTK+ED+ L   +     G   DWQ +   + GRTG QC +RW K+LDP+  K+G ++P+ED  L  AV  +G  NW      + GR  VQCRER+ N
Subjt:  REWTKDEDDKLRAAVATFGLG---DWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFN

Query:  CLDPSLRRCEWTEEEDLRLEIAIQEHGY-SWTKVAACVPSRTDNDCRRRWKKL--FPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTD
         LDP L +  WT +ED RL     + G   W+ VA  + +RTDN C RRWK+L    N +   QE    +K   +SNF  R+ ER  L   D        
Subjt:  CLDPSLRRCEWTEEEDLRLEIAIQEHGY-SWTKVAACVPSRTDNDCRRRWKKL--FPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTD

Query:  LLCNSVDPRPAPKRNVKTRKM---------PVSRNEKSATGDAPKKRKSNC
             ++ +  PK N KT+ +         P + N K+   D    + ++C
Subjt:  LLCNSVDPRPAPKRNVKTRKM---------PVSRNEKSATGDAPKKRKSNC

Q5SXM2 snRNA-activating protein complex subunit 42.4e-3626.5Show/hide
Query:  IKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHV-ACYRMALM
        ++ N   Q+ I+ K+      + +N++ ++  ++++D  GS  + T     + + P   +    KP  KD             GP  N          + 
Subjt:  IKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHV-ACYRMALM

Query:  KFPRVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQISGL-QGFSADSDDLD---------NILASIKDLDITPEK--IREFLPKVNWDKLASMYLRGR
         F  +   KW   E+  L K +     + +LQ  + ++  L Q  S  S +L+              I+D++  PE+  +   L   +W+K++++   G 
Subjt:  KFPRVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQISGL-QGFSADSDDLD---------NILASIKDLDITPEK--IREFLPKVNWDKLASMYLRGR

Query:  SGAECEASMSTSISIAEYRWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVAT
          AE          I ++ W N E P IN+  W+  E++ L       G   W +IA   GT+R+ FQCL ++Q+  N ++ ++EWT++ED  L   V  
Subjt:  SGAECEASMSTSISIAEYRWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVAT

Query:  FGLGD---WQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDL
          +G    ++ +   +EGR   Q   RW KSLDP   K+G++ P+ED++L  AV  +G ++W K  E +PGR+  QCR+R+   L  SL++  W  +E+ 
Subjt:  FGLGD---WQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDL

Query:  RLEIAIQEHGYS-WTKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARK
        +L   I+++G   W K+A+ +P R+ + C  +WK +   +  L +  R+
Subjt:  RLEIAIQEHGYS-WTKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARK

Q8BP86 snRNA-activating protein complex subunit 42.7e-3531.97Show/hide
Query:  IKDLDITPEK--IREFLPKVNWDKLASMYLRGRSGAECEASMSTSISIAEYRWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPF
        I+D++  PE+  +   L   +W+K++++   G   AE          I ++ W + E P I++  W+T E + L       G   W  +A   GT+R+ F
Subjt:  IKDLDITPEK--IREFLPKVNWDKLASMYLRGRSGAECEASMSTSISIAEYRWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPF

Query:  QCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGD---WQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAE
        QCL ++Q+  N ++ ++EWT++ED  L   V    +G+   ++ +   +EGR   Q   RW KSLDP+  KRG + P+ED++L  AV  +G ++W K  E
Subjt:  QCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGD---WQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAE

Query:  FLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYS-WTKVAACVPSRTDNDCRRRWKKL
         +PGR+  QCR+R+   L  SL++  W  +E+ +L   I+++G   W ++A+ +P R+ + C  +WK L
Subjt:  FLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYS-WTKVAACVPSRTDNDCRRRWKKL

Arabidopsis top hitse value%identityAlignment
AT3G09370.1 myb domain protein 3r-31.4e-2841.5Show/hide
Query:  KREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL
        K  WT +ED+ LR AV TF    W+ +A +   RT  QC +RW+K L+P   K G +T +ED ++   V  +GP  W+  A+ LPGR   QCRERW N L
Subjt:  KREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL

Query:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNDCRRRW
        +P + +  WT EE++ L  A + HG  W ++A  +P RTDN  +  W
Subjt:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNDCRRRW

AT3G09370.2 myb domain protein 3r-31.4e-2841.5Show/hide
Query:  KREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL
        K  WT +ED+ LR AV TF    W+ +A +   RT  QC +RW+K L+P   K G +T +ED ++   V  +GP  W+  A+ LPGR   QCRERW N L
Subjt:  KREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL

Query:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNDCRRRW
        +P + +  WT EE++ L  A + HG  W ++A  +P RTDN  +  W
Subjt:  DPSLRRCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNDCRRRW

AT3G18100.1 myb domain protein 4r15.5e-16146.03Show/hide
Query:  GDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSL--DEEEDEFEMLR
        G  E+P   E+   ++D E LR     +  + +     R S P  G  +  SDS+  DDFE++R+I+++ S   D     +LPP+ L  DEE+D FE LR
Subjt:  GDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSL--DEEEDEFEMLR

Query:  SIQRRFAAYES-DTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAF-----------------------EKGSLPKAALAFIDAIKKNRSQQKF
        +I+RRF+AY++ D+     N    H    ++   D    S+  S+  +  +F                          S P+AA AF+DAI++NR+ QKF
Subjt:  SIQRRFAAYES-DTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAF-----------------------EKGSLPKAALAFIDAIKKNRSQQKF

Query:  IRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSK----------KDKRLSAMHYGPAENSHVACYRMALMK
        +R K+  +EA IE+N+K +K  +I+KDFQ SCKR T  AL Q  DPRV+LIS  K    DSS+           DK++S +  GPAEN  V  YRMAL K
Subjt:  IRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSK----------KDKRLSAMHYGPAENSHVACYRMALMK

Query:  FP-RVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASMSTS
        +P  V R+KWS  E +NL KG++Q+ Q+++L  ++++ S L+G +    D+D I  SI +L+ITPE IR+FLPK+NWD   S+ ++ RS AECEA     
Subjt:  FP-RVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASMSTS

Query:  ISIAEYRWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVAST
              RW++ EDPLIN  PWT +EDKNLL TI+Q  L +W++IAVS GTNRTPFQCL+RYQRSLN SILK+EWT +EDD+LR AV  FG  DWQ+VA+ 
Subjt:  ISIAEYRWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVAST

Query:  LEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWTK
        L+GRTG QCSNRWKKSL P  T++G ++ +ED R+K+AV LFG +NW+K ++F+PGR Q QCRERW NCLDP + R +WTEEED +L  AI EHGYSW+K
Subjt:  LEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWTK

Query:  VAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEKSATGDAP
        VA  +  RTDN C RRWK+L+P++V LLQEAR++QK A + NFVDRESERPAL  +     P+      S++P P    +V  +K   ++ +KS     P
Subjt:  VAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEKSATGDAP

Query:  KKRKSNCQRNQADVTAQ
        K+R+   +    DV  Q
Subjt:  KKRKSNCQRNQADVTAQ

AT3G18100.2 myb domain protein 4r18.8e-15152.32Show/hide
Query:  SLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSK----------KDKRL
        S P+AA AF+DAI++NR+ QKF+R K+  +EA IE+N+K +K  +I+KDFQ SCKR T  AL Q  DPRV+LIS  K    DSS+           DK++
Subjt:  SLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSK----------KDKRL

Query:  SAMHYGPAENSHVACYRMALMKFP-RVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWD
        S +  GPAEN  V  YRMAL K+P  V R+KWS  E +NL KG++Q+ Q+++L  ++++ S L+G +    D+D I  SI +L+ITPE IR+FLPK+NWD
Subjt:  SAMHYGPAENSHVACYRMALMKFP-RVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWD

Query:  KLASMYLRGRSGAECEASMSTSISIAEYRWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDE
           S+ ++ RS AECEA           RW++ EDPLIN  PWT +EDKNLL TI+Q  L +W++IAVS GTNRTPFQCL+RYQRSLN SILK+EWT +E
Subjt:  KLASMYLRGRSGAECEASMSTSISIAEYRWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDE

Query:  DDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCE
        DD+LR AV  FG  DWQ+VA+ L+GRTG QCSNRWKKSL P  T++G ++ +ED R+K+AV LFG +NW+K ++F+PGR Q QCRERW NCLDP + R +
Subjt:  DDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCE

Query:  WTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPK
        WTEEED +L  AI EHGYSW+KVA  +  RTDN C RRWK+L+P++V LLQEAR++QK A + NFVDRESERPAL  +     P+      S++P P   
Subjt:  WTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPK

Query:  RNVKTRKMPVSRNEKSATGDAPKKRKSNCQRNQADVTAQ
         +V  +K   ++ +KS     PK+R+   +    DV  Q
Subjt:  RNVKTRKMPVSRNEKSATGDAPKKRKSNCQRNQADVTAQ

AT3G18100.3 myb domain protein 4r19.7e-14243.05Show/hide
Query:  SSMSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINP--RLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSL
        +S+   + DD+ D       E+D + ED+E LRRA     VN + + +    +     G     SDS++ DDFE+LR I+++ +   D    S+ PPM L
Subjt:  SSMSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINP--RLSSPVAGDANPSSDSDDVDDFELLRNIQNRFSCVADEQPLSTLPPMSL

Query:  -----DEEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKR-----PSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRS
              E ED+FEM+RSI+ + +   S  +S  P  L D       D +D A E+  A +R      +   F   S  K      + +   ++Q+ +   
Subjt:  -----DEEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKR-----PSMLAFEKGSLPKAALAFIDAIKKNRSQQKFIRS

Query:  KMIHLEARIEENKKLRKRF-----------KILKDFQGSCKRRT--TCALSQMIDPRVQLISAAKPQAKDSSK----------KDKRLSAMHYGPAENSH
        KM+  + R++ + KL +R              L++     K+R+  T    +M DPRV+LIS  K    DSS+           DK++S +  GPAEN  
Subjt:  KMIHLEARIEENKKLRKRF-----------KILKDFQGSCKRRT--TCALSQMIDPRVQLISAAKPQAKDSSK----------KDKRLSAMHYGPAENSH

Query:  VACYRMALMKFP-RVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSG
        V  YRMAL K+P  V R+KWS  E +NL KG++Q+ Q+++L  ++++ S L+G +    D+D I  SI +L+ITPE IR+FLPK+NWD   S+ ++ RS 
Subjt:  VACYRMALMKFP-RVDRKKWSVVERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSG

Query:  AECEASMSTSISIAEYRWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFG
        AECEA           RW++ EDPLIN  PWT +EDKNLL TI+Q  L +W++IAVS GTNRTPFQCL+RYQRSLN SILK+EWT +EDD+LR AV  FG
Subjt:  AECEASMSTSISIAEYRWLNFEDPLINRDPWTTSEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFG

Query:  LGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIA
          DWQ+VA+ L+GRTG QCSNRWKKSL P  T++G ++ +ED R+K+AV LFG +NW+K ++F+PGR Q QCRERW NCLDP + R +WTEEED +L  A
Subjt:  LGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIA

Query:  IQEHGYSWTKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSR
        I EHGYSW+KVA  +  RTDN C RRWK+L+P++V LLQEAR++QK A + NFVDRESERPAL  +     P+      S++P P    +V  +K   ++
Subjt:  IQEHGYSWTKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSR

Query:  NEKSATGDAPKKRKSNCQRNQADVTAQ
         +KS     PK+R+   +    DV  Q
Subjt:  NEKSATGDAPKKRKSNCQRNQADVTAQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCTCAAATTTTCCAAGTAGCAAGCGAGTTCGGGATGCTGTGCCTACGCAAGAACCTCGTCTGGGCTGTCTTTGGATAAAGAAATGGTTGGTCAGTGAACCTTACAG
AGGAGTCAAGAGCCGCTCGTCCAAACTGGCAACGAGGCCCCTACTTCTGAGTTCAAGGCTCGGGCCCCATGTGGTCCCCGCTAAGCTTGTCACTGACACTGTCGATGGCA
CCAATCCCCCTACGGAAGCTAGTTTTAAGTTCAAGAGGGCCAAGACTGAGGTTTTGCCCAAGTTAGTTTCTACCTTAGAAGCGAGTAAGTTGGTTAGTAAAGGCGCATGG
GGCCTTTTGACTAGTGTAGTGGATACCAGGGAAGTTAAGGTCACCTTCTCATCCAAACGAATTGTTACAGATTACCCATATGTTTTTCCAGAAGACCTCCTAGGGTTGAC
ACCGTATCAGGAGATAGACTTCACCATCGAGTTGGAGCCAAGAACAACTCCTATCTCTAAACCCCCATATAGGATAGCCTTAGCAGAACTGAAGGAGCTCGAGATTCAAT
TGCAGGAGTTGCTCGATAAGGGTTTCATACATCCAAGGAGCTATAGTTTTCTCCAAGATTCATCTTCGTTCAAGTCTTCATCGAGATATCTTGGTTCATTCCAAGATGGA
GGTAGAAAACGAGGAGCATTTGCGAAAGAAGTCTTTCTTAGTGCAATACTGTCAACTGCCCTTAAAGCGTGCGAGCACCTAGCTACTAAAGTTGCGAGATTGAAAGGCAT
TCATGTAAGCTGTCACTCTGTTCCCTCTCATAGATGGATTTCTCAGAGTAGATCCTGTAGGGATTTACTTCTGTTTGGAATAAAACATGTGAGGACAGTTTTCAGGACCT
TAAACAAAAGCTCGTTACTGCACCGATTCTTACTTTACCAGATGGTTCAGGAAGTTTCATTTTACATTGATGCCTTTAAAAAAAGGGATGAGATGTGTGCTGATGCAACA
CGTGACTTTAAGCGTGCAGGAATTGCAATTGCGGTGGGAGAAGTTGCATCACACAGTGAGAGTAAGCTCAATAGCGCTGCTCAACATTCCTCCCATTTCAGGGATAACAT
CGGGGTTAATCTACAAAATTTCATCTATTGTTCATCTTCTTCTCAACCTTCATTCCTTCTCAAATTCACCACAAATTTGGACCCACCATCCAATCTAAGCCTAGAGAATA
GTAGAGAAGACTCTTGTGGTGGTCTACAATCAAGTGGTGCTGAAATTCATCAAGAATTGAAGGAGAATTTGAAGCTTCAAGGGTATTCTCCATCGAATGTAGTGCCGAAA
TTAGACAACAACCTTAACAAACAAGTGTTGAATGATGATTTTGATAACATTATTGAGATGGTTAAAGCAGCACAACAACATAGTAACCGAGACTGGTCTGGTGCTCATGC
AATCTGTGCAAGGTCTGGTATTTGGAGGATAGCGAGTTGTAGAGATCAAGCCATTAGAAGCGAAGACAGAAGCTGTGTGAGACAGGTCTCACTAGGAAAGAACAACGACA
CTGGGGCGTGGAGTCAGACGAATCTCGCTGGTGTCGCTAGTCGCTTTTTGTCTCACAAATTTCAGCTAGGTCTCGCTGGTCTTGCTTGTCTTGCTGGTCTCGCTAGTCAG
GTAGCAAGCGAATCTAGGGTGCTTAGGCTACACAGCAACTCGTCTGGACCTACCACCTGTGTGTCTGCTGTCCTCACACTTTCTCCTAGGTTAAGAGGGTACCTGGGAGG
GGGTATGACACCTCCGGTCGACGTTCTTTACATCGTTCGACGCTCCATTCGACTTTCTCTACCGCTGCTCGCTGAAATTTTGGGTTTTGGAAAATATTGTAGTTTTGGGG
GAGAGAGAAGATCGAGATTTGAAGATTCGACCAAGGAAGAGGTCGAGATTTCAACCTTCGACCAAGGAGTCGGCTTTCGCAGAGAGATTCCCATCCCGTCGCGTCGTCTC
GAGCCTTCACCTGCCGTTGCTACTACCACAGAGATTCCCTGTCGCTTAAATGTGTTGTCCCGGCGACCAAAAGCACCTGTGACTAAGAATACTTCATTTTCTCCAAATTT
GCGAGGGGTGCCTCGTTTTGCATGCAGCCTCCACCCTCTCTCTCCTGCTGCCACACACACACACTCTCTCTCCCTCGCCGATCTCTCTCCTGTTTGGCTCTCTCTCTCTC
GGTCTCTCTCCTTCGCCGGCACACTCTTTCGCTCGGCGTCTCGCTCGGCGTTTTCATCTCGTGACATCACTTTCTCTCTCGGGTCTCGCCGTCTCGCTTTCGCTAGACCC
TCTCTCTCTCCCTCTCGGCACTACTCTCCCTCACGCCGCCGTTGTGCCGCTGCCGCCGGATTTCTCTCTCCGCCGCCACCGCTGCCTTTTTGCAATGATTCAGTCAGGGT
ACGTTGTGCTCATTCATGTTCATCCATGTCTCACCGAAACCATGACGATGAAGGTGACGTTGAGCTTCCTGTCAGCAAGGAAGATGATGTGGTTGATGAGGACATGGAAG
CCCTTCGGAGAGCCTATAAGCATGTTGGAGTTAATCCTGAGGATTACATTAATCCTAGGTTGTCATCACCTGTTGCCGGAGATGCTAATCCTAGTTCTGATTCTGATGAT
GTTGATGATTTTGAACTTCTTCGAAATATTCAGAATCGGTTCTCATGTGTGGCTGATGAGCAGCCGTTGAGTACTCTCCCACCAATGTCCCTAGACGAGGAGGAAGATGA
ATTTGAGATGCTTCGTTCAATTCAGCGGCGCTTTGCAGCGTACGAAAGTGATACTTTGAGCAATAAACCCAATCAGTTGTGCGACCATGTTGGGTCTTTGAAGATGGATT
TTGATGACACAGCTGTTGAGAGTCAGACAGCCTCAAAAAGGCCATCCATGCTAGCCTTTGAAAAGGGAAGCTTGCCAAAAGCTGCATTGGCATTTATTGATGCCATCAAG
AAAAATAGGTCCCAGCAGAAGTTTATTCGTAGTAAGATGATTCATCTTGAAGCTAGAATTGAGGAGAACAAAAAGCTCAGAAAACGTTTCAAAATTCTCAAAGATTTCCA
GGGTTCATGTAAACGGAGAACAACCTGTGCACTGTCTCAAATGATAGATCCTCGAGTCCAGTTGATTTCAGCTGCAAAACCACAAGCAAAGGATTCATCAAAGAAGGACA
AACGATTATCTGCAATGCATTATGGCCCAGCTGAGAATTCTCATGTTGCATGCTACAGAATGGCATTAATGAAGTTTCCTCGTGTAGATCGAAAAAAATGGTCCGTTGTA
GAAAGGGAGAATCTAGGGAAGGGAATAAGACAACAATTTCAGGAGATGGTGCTTCAGATTTCAGTGGATCAAATTAGTGGGCTACAAGGATTCTCTGCAGATTCAGATGA
TTTGGATAACATTCTTGCATCAATAAAAGACCTTGACATCACTCCTGAAAAGATTAGGGAATTTCTGCCAAAAGTTAATTGGGACAAATTGGCTTCCATGTATCTTCGGG
GTCGCTCAGGGGCAGAATGTGAAGCGAGCATGTCAACTTCAATCAGCATCGCAGAGTACAGGTGGTTGAATTTCGAAGACCCCCTAATTAATCGGGATCCATGGACTACA
AGTGAGGATAAGAATCTTTTGTTTACTATCCAACAGAAGGGGTTGAATAACTGGATTGAGATAGCAGTTTCGTCGGGTACAAATAGAACTCCTTTTCAGTGCTTGTCTCG
GTATCAAAGGAGTTTAAACGCTTCCATATTAAAGAGGGAGTGGACCAAAGACGAGGATGATAAACTTCGAGCTGCTGTTGCTACATTTGGCCTGGGAGATTGGCAGGCAG
TAGCTTCTACTTTGGAAGGACGAACTGGCCCACAGTGCTCTAATAGGTGGAAAAAATCCCTTGACCCAGCTAGGACAAAAAGAGGTCATTTCACTCCAGATGAAGACAGT
CGCTTGAAAATTGCTGTACTGCTCTTTGGGCCTAAAAATTGGAACAAGAAAGCAGAATTTTTACCTGGTCGAAATCAAGTTCAATGCAGAGAAAGATGGTTTAATTGTTT
AGATCCTTCCTTGAGAAGATGCGAATGGACAGAAGAGGAGGATTTAAGGCTGGAGATAGCAATTCAGGAACATGGATATAGCTGGACTAAGGTAGCTGCATGTGTGCCGT
CACGTACAGATAATGATTGTCGGAGGAGATGGAAGAAGTTATTTCCCAATGAAGTTCCGTTACTCCAGGAAGCTAGAAAGATTCAGAAGGCTGCTCTTATTAGCAACTTT
GTTGATAGGGAATCAGAGCGTCCTGCTCTTGGTCCTACTGACTTTCGACCCAGGCCAAACACAGATTTATTGTGTAATAGTGTTGATCCAAGACCTGCCCCAAAAAGAAA
TGTGAAGACGAGAAAGATGCCAGTGTCAAGGAATGAAAAGAGTGCTACTGGTGATGCTCCAAAGAAGAGGAAATCAAATTGCCAGAGGAATCAGGCTGATGTAACTGCTC
AGGCGGGTATTGCAAATAATACCTCTTCTGTCCCAGAGGAGGTTAAATCTCTAAAGCCTCAAAGAAAACAAAATAGACATGGAGCTTATACCGCAGAAGGGGTTCCGGAG
CTATGTCCTAACAGTGAGTGGTGTGCTAAACAGAGTTTGGACACTCAGAGCCTTGGGGTGCAGCTGTCGCTGTTGGAATCTGAGGGGACCAACAGCGACTGCATCGAGAC
TGTTGATGAGAATGGTATGGAGGTATTTGAGAACAAAGTTGCAGAGAAACTTTCTGAAAGAGATGTATGTTTTCCAGAACCAGAAGCAATTCAAAACTCTACCGGATCTT
CTGGAGTCTCGGTATTGTCAGAAATGACTAATGACATGGATGAATACAATCCGTCTATCCTTCCAGATACAACATTGTTGGCTAGTACTACTGTGGATGATCTCAAAGAA
TTGAAGAGGAAGAGTGTTGCAGACAGAGATCTGGATGACTGTAACAGTTTCTCGTTACCGTGCAGTGACTTGGAACTCAGGACAATCGACAAGGAAGGTGTGGACAGTTA
TTCCATGGATGAATTTACAGATAAAAGCCATGGGGTTTGCAAGCCCCCCCATACCAGAAGGAAGAAAAACAGCAAAACATCAAGTAAGAGCCAGGATCATTCCTTTGTTT
CTTGTCAACAAGTGGAGTTGGAGAGGTTGGGGACAAATGAGCCTCGTCATCATAATCAATCAAAGAAGAGAAAACATAACAGTACAAATACGAATCTGGAGGCAGTTGAG
GAGGTTGATGACTGCACACTCGTGGGCTTTTTGCAAAAAAGATTGAAGAGGACGACGACAACAAATGACAAGAAAGTTGATTGCAGTTCAAGTACTCCCATAGAAGTTGA
TACTGATGACAACGATCCTACCCTTGCCTCGTTTCTTAATAAATTAAAGAGAAAAAAGCATCAGCCGAGTAGTGGTGGTGAGTTAAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTCTCAAATTTTCCAAGTAGCAAGCGAGTTCGGGATGCTGTGCCTACGCAAGAACCTCGTCTGGGCTGTCTTTGGATAAAGAAATGGTTGGTCAGTGAACCTTACAG
AGGAGTCAAGAGCCGCTCGTCCAAACTGGCAACGAGGCCCCTACTTCTGAGTTCAAGGCTCGGGCCCCATGTGGTCCCCGCTAAGCTTGTCACTGACACTGTCGATGGCA
CCAATCCCCCTACGGAAGCTAGTTTTAAGTTCAAGAGGGCCAAGACTGAGGTTTTGCCCAAGTTAGTTTCTACCTTAGAAGCGAGTAAGTTGGTTAGTAAAGGCGCATGG
GGCCTTTTGACTAGTGTAGTGGATACCAGGGAAGTTAAGGTCACCTTCTCATCCAAACGAATTGTTACAGATTACCCATATGTTTTTCCAGAAGACCTCCTAGGGTTGAC
ACCGTATCAGGAGATAGACTTCACCATCGAGTTGGAGCCAAGAACAACTCCTATCTCTAAACCCCCATATAGGATAGCCTTAGCAGAACTGAAGGAGCTCGAGATTCAAT
TGCAGGAGTTGCTCGATAAGGGTTTCATACATCCAAGGAGCTATAGTTTTCTCCAAGATTCATCTTCGTTCAAGTCTTCATCGAGATATCTTGGTTCATTCCAAGATGGA
GGTAGAAAACGAGGAGCATTTGCGAAAGAAGTCTTTCTTAGTGCAATACTGTCAACTGCCCTTAAAGCGTGCGAGCACCTAGCTACTAAAGTTGCGAGATTGAAAGGCAT
TCATGTAAGCTGTCACTCTGTTCCCTCTCATAGATGGATTTCTCAGAGTAGATCCTGTAGGGATTTACTTCTGTTTGGAATAAAACATGTGAGGACAGTTTTCAGGACCT
TAAACAAAAGCTCGTTACTGCACCGATTCTTACTTTACCAGATGGTTCAGGAAGTTTCATTTTACATTGATGCCTTTAAAAAAAGGGATGAGATGTGTGCTGATGCAACA
CGTGACTTTAAGCGTGCAGGAATTGCAATTGCGGTGGGAGAAGTTGCATCACACAGTGAGAGTAAGCTCAATAGCGCTGCTCAACATTCCTCCCATTTCAGGGATAACAT
CGGGGTTAATCTACAAAATTTCATCTATTGTTCATCTTCTTCTCAACCTTCATTCCTTCTCAAATTCACCACAAATTTGGACCCACCATCCAATCTAAGCCTAGAGAATA
GTAGAGAAGACTCTTGTGGTGGTCTACAATCAAGTGGTGCTGAAATTCATCAAGAATTGAAGGAGAATTTGAAGCTTCAAGGGTATTCTCCATCGAATGTAGTGCCGAAA
TTAGACAACAACCTTAACAAACAAGTGTTGAATGATGATTTTGATAACATTATTGAGATGGTTAAAGCAGCACAACAACATAGTAACCGAGACTGGTCTGGTGCTCATGC
AATCTGTGCAAGGTCTGGTATTTGGAGGATAGCGAGTTGTAGAGATCAAGCCATTAGAAGCGAAGACAGAAGCTGTGTGAGACAGGTCTCACTAGGAAAGAACAACGACA
CTGGGGCGTGGAGTCAGACGAATCTCGCTGGTGTCGCTAGTCGCTTTTTGTCTCACAAATTTCAGCTAGGTCTCGCTGGTCTTGCTTGTCTTGCTGGTCTCGCTAGTCAG
GTAGCAAGCGAATCTAGGGTGCTTAGGCTACACAGCAACTCGTCTGGACCTACCACCTGTGTGTCTGCTGTCCTCACACTTTCTCCTAGGTTAAGAGGGTACCTGGGAGG
GGGTATGACACCTCCGGTCGACGTTCTTTACATCGTTCGACGCTCCATTCGACTTTCTCTACCGCTGCTCGCTGAAATTTTGGGTTTTGGAAAATATTGTAGTTTTGGGG
GAGAGAGAAGATCGAGATTTGAAGATTCGACCAAGGAAGAGGTCGAGATTTCAACCTTCGACCAAGGAGTCGGCTTTCGCAGAGAGATTCCCATCCCGTCGCGTCGTCTC
GAGCCTTCACCTGCCGTTGCTACTACCACAGAGATTCCCTGTCGCTTAAATGTGTTGTCCCGGCGACCAAAAGCACCTGTGACTAAGAATACTTCATTTTCTCCAAATTT
GCGAGGGGTGCCTCGTTTTGCATGCAGCCTCCACCCTCTCTCTCCTGCTGCCACACACACACACTCTCTCTCCCTCGCCGATCTCTCTCCTGTTTGGCTCTCTCTCTCTC
GGTCTCTCTCCTTCGCCGGCACACTCTTTCGCTCGGCGTCTCGCTCGGCGTTTTCATCTCGTGACATCACTTTCTCTCTCGGGTCTCGCCGTCTCGCTTTCGCTAGACCC
TCTCTCTCTCCCTCTCGGCACTACTCTCCCTCACGCCGCCGTTGTGCCGCTGCCGCCGGATTTCTCTCTCCGCCGCCACCGCTGCCTTTTTGCAATGATTCAGTCAGGGT
ACGTTGTGCTCATTCATGTTCATCCATGTCTCACCGAAACCATGACGATGAAGGTGACGTTGAGCTTCCTGTCAGCAAGGAAGATGATGTGGTTGATGAGGACATGGAAG
CCCTTCGGAGAGCCTATAAGCATGTTGGAGTTAATCCTGAGGATTACATTAATCCTAGGTTGTCATCACCTGTTGCCGGAGATGCTAATCCTAGTTCTGATTCTGATGAT
GTTGATGATTTTGAACTTCTTCGAAATATTCAGAATCGGTTCTCATGTGTGGCTGATGAGCAGCCGTTGAGTACTCTCCCACCAATGTCCCTAGACGAGGAGGAAGATGA
ATTTGAGATGCTTCGTTCAATTCAGCGGCGCTTTGCAGCGTACGAAAGTGATACTTTGAGCAATAAACCCAATCAGTTGTGCGACCATGTTGGGTCTTTGAAGATGGATT
TTGATGACACAGCTGTTGAGAGTCAGACAGCCTCAAAAAGGCCATCCATGCTAGCCTTTGAAAAGGGAAGCTTGCCAAAAGCTGCATTGGCATTTATTGATGCCATCAAG
AAAAATAGGTCCCAGCAGAAGTTTATTCGTAGTAAGATGATTCATCTTGAAGCTAGAATTGAGGAGAACAAAAAGCTCAGAAAACGTTTCAAAATTCTCAAAGATTTCCA
GGGTTCATGTAAACGGAGAACAACCTGTGCACTGTCTCAAATGATAGATCCTCGAGTCCAGTTGATTTCAGCTGCAAAACCACAAGCAAAGGATTCATCAAAGAAGGACA
AACGATTATCTGCAATGCATTATGGCCCAGCTGAGAATTCTCATGTTGCATGCTACAGAATGGCATTAATGAAGTTTCCTCGTGTAGATCGAAAAAAATGGTCCGTTGTA
GAAAGGGAGAATCTAGGGAAGGGAATAAGACAACAATTTCAGGAGATGGTGCTTCAGATTTCAGTGGATCAAATTAGTGGGCTACAAGGATTCTCTGCAGATTCAGATGA
TTTGGATAACATTCTTGCATCAATAAAAGACCTTGACATCACTCCTGAAAAGATTAGGGAATTTCTGCCAAAAGTTAATTGGGACAAATTGGCTTCCATGTATCTTCGGG
GTCGCTCAGGGGCAGAATGTGAAGCGAGCATGTCAACTTCAATCAGCATCGCAGAGTACAGGTGGTTGAATTTCGAAGACCCCCTAATTAATCGGGATCCATGGACTACA
AGTGAGGATAAGAATCTTTTGTTTACTATCCAACAGAAGGGGTTGAATAACTGGATTGAGATAGCAGTTTCGTCGGGTACAAATAGAACTCCTTTTCAGTGCTTGTCTCG
GTATCAAAGGAGTTTAAACGCTTCCATATTAAAGAGGGAGTGGACCAAAGACGAGGATGATAAACTTCGAGCTGCTGTTGCTACATTTGGCCTGGGAGATTGGCAGGCAG
TAGCTTCTACTTTGGAAGGACGAACTGGCCCACAGTGCTCTAATAGGTGGAAAAAATCCCTTGACCCAGCTAGGACAAAAAGAGGTCATTTCACTCCAGATGAAGACAGT
CGCTTGAAAATTGCTGTACTGCTCTTTGGGCCTAAAAATTGGAACAAGAAAGCAGAATTTTTACCTGGTCGAAATCAAGTTCAATGCAGAGAAAGATGGTTTAATTGTTT
AGATCCTTCCTTGAGAAGATGCGAATGGACAGAAGAGGAGGATTTAAGGCTGGAGATAGCAATTCAGGAACATGGATATAGCTGGACTAAGGTAGCTGCATGTGTGCCGT
CACGTACAGATAATGATTGTCGGAGGAGATGGAAGAAGTTATTTCCCAATGAAGTTCCGTTACTCCAGGAAGCTAGAAAGATTCAGAAGGCTGCTCTTATTAGCAACTTT
GTTGATAGGGAATCAGAGCGTCCTGCTCTTGGTCCTACTGACTTTCGACCCAGGCCAAACACAGATTTATTGTGTAATAGTGTTGATCCAAGACCTGCCCCAAAAAGAAA
TGTGAAGACGAGAAAGATGCCAGTGTCAAGGAATGAAAAGAGTGCTACTGGTGATGCTCCAAAGAAGAGGAAATCAAATTGCCAGAGGAATCAGGCTGATGTAACTGCTC
AGGCGGGTATTGCAAATAATACCTCTTCTGTCCCAGAGGAGGTTAAATCTCTAAAGCCTCAAAGAAAACAAAATAGACATGGAGCTTATACCGCAGAAGGGGTTCCGGAG
CTATGTCCTAACAGTGAGTGGTGTGCTAAACAGAGTTTGGACACTCAGAGCCTTGGGGTGCAGCTGTCGCTGTTGGAATCTGAGGGGACCAACAGCGACTGCATCGAGAC
TGTTGATGAGAATGGTATGGAGGTATTTGAGAACAAAGTTGCAGAGAAACTTTCTGAAAGAGATGTATGTTTTCCAGAACCAGAAGCAATTCAAAACTCTACCGGATCTT
CTGGAGTCTCGGTATTGTCAGAAATGACTAATGACATGGATGAATACAATCCGTCTATCCTTCCAGATACAACATTGTTGGCTAGTACTACTGTGGATGATCTCAAAGAA
TTGAAGAGGAAGAGTGTTGCAGACAGAGATCTGGATGACTGTAACAGTTTCTCGTTACCGTGCAGTGACTTGGAACTCAGGACAATCGACAAGGAAGGTGTGGACAGTTA
TTCCATGGATGAATTTACAGATAAAAGCCATGGGGTTTGCAAGCCCCCCCATACCAGAAGGAAGAAAAACAGCAAAACATCAAGTAAGAGCCAGGATCATTCCTTTGTTT
CTTGTCAACAAGTGGAGTTGGAGAGGTTGGGGACAAATGAGCCTCGTCATCATAATCAATCAAAGAAGAGAAAACATAACAGTACAAATACGAATCTGGAGGCAGTTGAG
GAGGTTGATGACTGCACACTCGTGGGCTTTTTGCAAAAAAGATTGAAGAGGACGACGACAACAAATGACAAGAAAGTTGATTGCAGTTCAAGTACTCCCATAGAAGTTGA
TACTGATGACAACGATCCTACCCTTGCCTCGTTTCTTAATAAATTAAAGAGAAAAAAGCATCAGCCGAGTAGTGGTGGTGAGTTAAACTAG
Protein sequenceShow/hide protein sequence
MFSNFPSSKRVRDAVPTQEPRLGCLWIKKWLVSEPYRGVKSRSSKLATRPLLLSSRLGPHVVPAKLVTDTVDGTNPPTEASFKFKRAKTEVLPKLVSTLEASKLVSKGAW
GLLTSVVDTREVKVTFSSKRIVTDYPYVFPEDLLGLTPYQEIDFTIELEPRTTPISKPPYRIALAELKELEIQLQELLDKGFIHPRSYSFLQDSSSFKSSSRYLGSFQDG
GRKRGAFAKEVFLSAILSTALKACEHLATKVARLKGIHVSCHSVPSHRWISQSRSCRDLLLFGIKHVRTVFRTLNKSSLLHRFLLYQMVQEVSFYIDAFKKRDEMCADAT
RDFKRAGIAIAVGEVASHSESKLNSAAQHSSHFRDNIGVNLQNFIYCSSSSQPSFLLKFTTNLDPPSNLSLENSREDSCGGLQSSGAEIHQELKENLKLQGYSPSNVVPK
LDNNLNKQVLNDDFDNIIEMVKAAQQHSNRDWSGAHAICARSGIWRIASCRDQAIRSEDRSCVRQVSLGKNNDTGAWSQTNLAGVASRFLSHKFQLGLAGLACLAGLASQ
VASESRVLRLHSNSSGPTTCVSAVLTLSPRLRGYLGGGMTPPVDVLYIVRRSIRLSLPLLAEILGFGKYCSFGGERRSRFEDSTKEEVEISTFDQGVGFRREIPIPSRRL
EPSPAVATTTEIPCRLNVLSRRPKAPVTKNTSFSPNLRGVPRFACSLHPLSPAATHTHSLSLADLSPVWLSLSRSLSFAGTLFRSASRSAFSSRDITFSLGSRRLAFARP
SLSPSRHYSPSRRRCAAAAGFLSPPPPLPFCNDSVRVRCAHSCSSMSHRNHDDEGDVELPVSKEDDVVDEDMEALRRAYKHVGVNPEDYINPRLSSPVAGDANPSSDSDD
VDDFELLRNIQNRFSCVADEQPLSTLPPMSLDEEEDEFEMLRSIQRRFAAYESDTLSNKPNQLCDHVGSLKMDFDDTAVESQTASKRPSMLAFEKGSLPKAALAFIDAIK
KNRSQQKFIRSKMIHLEARIEENKKLRKRFKILKDFQGSCKRRTTCALSQMIDPRVQLISAAKPQAKDSSKKDKRLSAMHYGPAENSHVACYRMALMKFPRVDRKKWSVV
ERENLGKGIRQQFQEMVLQISVDQISGLQGFSADSDDLDNILASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEASMSTSISIAEYRWLNFEDPLINRDPWTT
SEDKNLLFTIQQKGLNNWIEIAVSSGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRAAVATFGLGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGHFTPDEDS
RLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNDCRRRWKKLFPNEVPLLQEARKIQKAALISNF
VDRESERPALGPTDFRPRPNTDLLCNSVDPRPAPKRNVKTRKMPVSRNEKSATGDAPKKRKSNCQRNQADVTAQAGIANNTSSVPEEVKSLKPQRKQNRHGAYTAEGVPE
LCPNSEWCAKQSLDTQSLGVQLSLLESEGTNSDCIETVDENGMEVFENKVAEKLSERDVCFPEPEAIQNSTGSSGVSVLSEMTNDMDEYNPSILPDTTLLASTTVDDLKE
LKRKSVADRDLDDCNSFSLPCSDLELRTIDKEGVDSYSMDEFTDKSHGVCKPPHTRRKKNSKTSSKSQDHSFVSCQQVELERLGTNEPRHHNQSKKRKHNSTNTNLEAVE
EVDDCTLVGFLQKRLKRTTTTNDKKVDCSSSTPIEVDTDDNDPTLASFLNKLKRKKHQPSSGGELN