; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy2G035170 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy2G035170
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionWD repeat-containing protein VIP3
Genome locationchrH02:9113618..9120302
RNA-Seq ExpressionChy2G035170
SyntenyChy2G035170
Gene Ontology termsGO:0051568 - histone H3-K4 methylation (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0016593 - Cdc73/Paf1 complex (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001680 - WD40 repeat
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR019775 - WD40 repeat, conserved site
IPR020472 - G-protein beta WD-40 repeat
IPR024977 - Anaphase-promoting complex subunit 4, WD40 domain
IPR036322 - WD40-repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055802.1 ras-associated and pleckstrin-like proteiny domains-containing protein 1 [Cucumis melo var. makuwa]0.091.51Show/hide
Query:  MEPNSDPPPFWPPSPPIRRRRSYSPPFISLPVLIILLPTLALILLFFAIRPLLSLINQVYKPSSVKKSWDSFNVFLVLFAIICGIFSRRNDDVPTTADGD
        MEPNSDPPPFW PSPPIRR RSYSPPFISLP+LIILLPTLALILLFFAIRPLLSL NQVYKP+SVKKSWDSFNVFLVL AIICGIF+RRNDDVPTTAD D
Subjt:  MEPNSDPPPFWPPSPPIRRRRSYSPPFISLPVLIILLPTLALILLFFAIRPLLSLINQVYKPSSVKKSWDSFNVFLVLFAIICGIFSRRNDDVPTTADGD

Query:  SRASDQMTVVDTGRVKVNGDSESSQQWFGFSERRFSDPTGRAPVTTRLRRNSSYPDLRQESLWGNGDDSNNQFRFFDDFEINKFRSRSFVYRTRGNEREE
        SR SDQ TVVDTG VKVNGDSE SQ+WFGFSERRFSDP GRAPVTTRLRRNSSYPDLRQESLW NG+D  NQFRFFDDFEINK+RSRSFVYRTRGNEREE
Subjt:  SRASDQMTVVDTGRVKVNGDSESSQQWFGFSERRFSDPTGRAPVTTRLRRNSSYPDLRQESLWGNGDDSNNQFRFFDDFEINKFRSRSFVYRTRGNEREE

Query:  SPA----IPIDSFVVSSSPAPEKIKSQSPNPPPPPPPPLPVTQRKPRRTYQNIQKKEEIPENKAEFTPPPPPPLPPRAVIPPSPVRVRLEEKFGKSVRKK
        SPA    IP+DSFV +SSPAPEKIKSQSPNPPPPPPPPLPVTQRKPRRTYQNIQKKEEIPENKAEFTPPPPPPLPPRAVIPPSPVRVRLEEKFGKSVRKK
Subjt:  SPA----IPIDSFVVSSSPAPEKIKSQSPNPPPPPPPPLPVTQRKPRRTYQNIQKKEEIPENKAEFTPPPPPPLPPRAVIPPSPVRVRLEEKFGKSVRKK

Query:  TNVKKEIAMALASLYRKRKRKQKTKDAYVGDRRSPTEQRPPPPPPPPPPSFFRIFKKSSKNKRVHSESPPPPPPPPPPVPSSSRSTKKKIQIPLPPSPPP
        TNVKKEIAMALASLYRKRKRKQKTKDAY  DRRSPTEQRPPPPPPPPPPSFFRIFKKSSKNKRVHSESPPPPPPPPPP P SSRSTK+KIQIPLPPSPPP
Subjt:  TNVKKEIAMALASLYRKRKRKQKTKDAYVGDRRSPTEQRPPPPPPPPPPSFFRIFKKSSKNKRVHSESPPPPPPPPPPVPSSSRSTKKKIQIPLPPSPPP

Query:  PPPAQQRNSTTYRRPPLPTSVRNSYIENPSINSREKSPTPTIPPPPPPPPSFKTTTDVKSTVGSDTVGSRSSETSRCGSPDPDNVNTSASGGAGVGSVFC
        PPP+QQRNSTT RRPPLPTSVRNSYIEN SINSR KSP  TIPPPPPPPPSFKTTTDVKSTVGSDTVGSRSSETSRCGSPDP+NVN+ AS GAGVG VFC
Subjt:  PPPAQQRNSTTYRRPPLPTSVRNSYIENPSINSREKSPTPTIPPPPPPPPSFKTTTDVKSTVGSDTVGSRSSETSRCGSPDPDNVNTSASGGAGVGSVFC

Query:  PSPDVNVKAANFIARLRGEWRLEKMNSALE
        PSPDVNVKAANFIARLR EWRLEKMNS  E
Subjt:  PSPDVNVKAANFIARLRGEWRLEKMNSALE

KAE8653439.1 hypothetical protein Csa_006892 [Cucumis sativus]0.079.63Show/hide
Query:  MEPNSDPPPFWPPSPPIRRRRSYSPPFISLPVLIILLPTLALILLFFAIRPLLSLINQVYKPSSVKKSWDSFNVFLVLFAIICGIFSRRNDDVPTTADGD
        MEPNSDPPPFWPPSPPI RRRSYSPPFISLPVLIILLPTLALILLFFAIRPLLSLINQVYKPSSVKKSWDSFNVFLVLFAIICGIFSRRNDDVPTTADGD
Subjt:  MEPNSDPPPFWPPSPPIRRRRSYSPPFISLPVLIILLPTLALILLFFAIRPLLSLINQVYKPSSVKKSWDSFNVFLVLFAIICGIFSRRNDDVPTTADGD

Query:  SRASDQMTVVDTGRVKVNGDSESSQQWFGFSERRFSDPTGRAPVTTRLRRNSSYPDLRQESLWGNGDDSNNQFRFFDDFEINKFRSRSFVYRTRGNEREE
        +R SDQMTVVDTG VKVNGDSESSQQWFGFSERRFSDPTGRAPVTTRLRRNSSYPDLRQESLWGNGDDSNNQFRFFDDFEINKFRSRSFVYRTRGNEREE
Subjt:  SRASDQMTVVDTGRVKVNGDSESSQQWFGFSERRFSDPTGRAPVTTRLRRNSSYPDLRQESLWGNGDDSNNQFRFFDDFEINKFRSRSFVYRTRGNEREE

Query:  SPAIPIDSFVVSSSPAPEKIKSQSPNPPPPPPPPLPVTQRKPRRTYQNIQKKEEIPENKAEFTPPPPPPLPPRAVIPPSPVRVRLEEKFGKSVRKKTNVK
        SPAIP+DSFVV+SSPAPEK+KSQSPNPPPPPPPPLPVTQRKPRRTYQNIQKKEEIPENKAEFTPPPPPPLPPR VIPPSPVRVRLEEKFGKSVRKKTNVK
Subjt:  SPAIPIDSFVVSSSPAPEKIKSQSPNPPPPPPPPLPVTQRKPRRTYQNIQKKEEIPENKAEFTPPPPPPLPPRAVIPPSPVRVRLEEKFGKSVRKKTNVK

Query:  KEIAMALASLYRKRKRKQKTKDAYVGDRRSPTEQRPPPPPPPPPPSFFRIFKKSSKNKRVHSESPPPPPPPPPPVPSSSRSTKKKIQIPLPPSPPPPPPA
        KEIAMALASL                                                                                          
Subjt:  KEIAMALASLYRKRKRKQKTKDAYVGDRRSPTEQRPPPPPPPPPPSFFRIFKKSSKNKRVHSESPPPPPPPPPPVPSSSRSTKKKIQIPLPPSPPPPPPA

Query:  QQRNSTTYRRPPLPTSVRNSYIENPSINSREKSPTPTIPPPPPPPPSFKTTTDVKSTVGSDTVGSRSSETSRCGSPDPDNVNTSASGGAGVGSVFCPSPD
                                     REKS TPTIPPPPPPPPSFKTTTDVKSTVGSDTVGSRSSETSRCGSPDP+NVNTSASGGAGVGSVFCPSPD
Subjt:  QQRNSTTYRRPPLPTSVRNSYIENPSINSREKSPTPTIPPPPPPPPSFKTTTDVKSTVGSDTVGSRSSETSRCGSPDPDNVNTSASGGAGVGSVFCPSPD

Query:  VNVKAANFIARLRGEWRLEKMNSA---------------------------------------------------------LENRDGNDPTCSRAHCENT
        VNVKAANFIARLRGEWRLEKMNS                                                          LENRDGNDPTCSRAHCENT
Subjt:  VNVKAANFIARLRGEWRLEKMNSA---------------------------------------------------------LENRDGNDPTCSRAHCENT

Query:  DQMKLAGLKSVENAHEESVWAATWVPATDTRPSLLLTGSLDETVKLWKSDELDLERTNTGHCLGVVSVAAHPSGFIAASASLDSFVRVFEVDSNSTIATL
        DQMKLAGLKSVENAHEESVWAATWVPATDTRPSLLLTGSLDETVKLWKSDELDLERTNTGHCLGVVSVAAHPSGFIAASASLDSFVRVFEVDSNSTIATL
Subjt:  DQMKLAGLKSVENAHEESVWAATWVPATDTRPSLLLTGSLDETVKLWKSDELDLERTNTGHCLGVVSVAAHPSGFIAASASLDSFVRVFEVDSNSTIATL

Query:  EAPPSEVWQMRFNPEGTMLAVAGGGSASIKLWDTNTWKLAATLSIPRPEGPKPTDKTASKKFVLSVAWSIDGRRLACGSMDGTISVFDVARAKFLHHLEG
        EAPPSEVWQMRFNPEGTMLAVAGGGSASIKLWDTNTWKLAATLSIPRPEGPKPTDKTASKKFVLSVAWSIDGRRLACGSMDGTISVFDVARAKFLHHLEG
Subjt:  EAPPSEVWQMRFNPEGTMLAVAGGGSASIKLWDTNTWKLAATLSIPRPEGPKPTDKTASKKFVLSVAWSIDGRRLACGSMDGTISVFDVARAKFLHHLEG

Query:  HFMPVRSLVYSPVEPRLLFSASDDAHVHMYDAEGKTLIGAMSGHSSWVLSVDASPDGAAVATGSSDRTVRLWDLNMRTAVQTMTNHSDQVWGVAFRPPGG
        HFMPVRSLVYSPVEPRLLFSASDDAHVHMYDAEGKTLIGAMSGHSSWVLSVDASPDGAAVATGSSDRTVRLWDLNMRTAVQTMTNHSDQVWGVAFRPPGG
Subjt:  HFMPVRSLVYSPVEPRLLFSASDDAHVHMYDAEGKTLIGAMSGHSSWVLSVDASPDGAAVATGSSDRTVRLWDLNMRTAVQTMTNHSDQVWGVAFRPPGG

Query:  VGVRSVRLASVSDDKSISLYDYS
        VGVRS RLASVSDDKSISLYDYS
Subjt:  VGVRSVRLASVSDDKSISLYDYS

KAG6590017.1 WD repeat-containing protein VIP3, partial [Cucurbita argyrosperma subsp. sororia]0.075.03Show/hide
Query:  MEPNSDPPPFWPPSPPIRRRRSYSPPFISLPVLIILLPTLALILLFFAIRPLLSLINQVYKPSSVKKSWDSFNVFLVLFAIICGIFSRRNDDVPTTAD-G
        MEPNS  P  W  +PP+RRRRS SPP ISLPVLIILLPTLALILLFFAIR LLSL NQV+ P+SVKKSWDSFNVFLVLFAIICGI+ RR DD  T  D G
Subjt:  MEPNSDPPPFWPPSPPIRRRRSYSPPFISLPVLIILLPTLALILLFFAIRPLLSLINQVYKPSSVKKSWDSFNVFLVLFAIICGIFSRRNDDVPTTAD-G

Query:  DSRASDQMTVVDTGRVKVNGDSESSQQWFGFSERRFSDPTGRAPVTTRLRRNSSYPDLRQESLWGNGDDSNNQFRFFDDFEINKFRSRSFVYRTRGNERE
        +S  S +  V       VNGDSE SQ+ F F+ERRF D   R P T  +       DLRQES   NG+D N +F FFDDFE NKFRSRSF +R RG E E
Subjt:  DSRASDQMTVVDTGRVKVNGDSESSQQWFGFSERRFSDPTGRAPVTTRLRRNSSYPDLRQESLWGNGDDSNNQFRFFDDFEINKFRSRSFVYRTRGNERE

Query:  ESPA----IPIDSFVVSSSPAPEKIKSQSPNPPPPPPPPLPVTQRKPRRTYQNIQKKEEIPENKAEF----TPPPPPPLPPRAVIPPSPVRVRLEEKFGK
        ESPA    IP+D+FV +SSPA  +++S +P PPPPPPPP PVT+RK R+T Q  ++  E+  N AE     +PPPPPPLPPR VIPPSP+RVRLEEKF K
Subjt:  ESPA----IPIDSFVVSSSPAPEKIKSQSPNPPPPPPPPLPVTQRKPRRTYQNIQKKEEIPENKAEF----TPPPPPPLPPRAVIPPSPVRVRLEEKFGK

Query:  SVRKKTNVKKEIAMALASLYRKRKRKQKTKDAYVGDRRSPTEQRPPPPPPPPPP---SFFRIFKKSSKNKRVHSESPPPPPPPPPPVPSSSRSTKKKIQI
        S RKKTNVKKEIA+ALASLYRKRK KQK +D Y GDR SPT+QRPPPPPPPPPP   SFFRIFKK +KNK+  SES P PP P   V SSSRSTKK+ QI
Subjt:  SVRKKTNVKKEIAMALASLYRKRKRKQKTKDAYVGDRRSPTEQRPPPPPPPPPP---SFFRIFKKSSKNKRVHSESPPPPPPPPPPVPSSSRSTKKKIQI

Query:  PLPPSPPPPPPAQQRNSTTYRRPPLPTSVRNSYIENPSINSREKSPTPTIPPPPPPPPSFKTTTDVKSTVGSDTVGSRSSETSRCGSPDPDNVNTSAS--
        PLPPSPPPPP ++Q+NST   RPPLP  V+NS +EN  INS  ++P+ TIPPP      +KTTTDVKS V  DTVGSRSSET  CGSP+PD+V +S++  
Subjt:  PLPPSPPPPPPAQQRNSTTYRRPPLPTSVRNSYIENPSINSREKSPTPTIPPPPPPPPSFKTTTDVKSTVGSDTVGSRSSETSRCGSPDPDNVNTSAS--

Query:  ---------GGAGVGSVFCPSPDVNVKAANFIARLRGEWRLEKMNSALENRDGNDPTCSRAHCENTDQMKLAGLKSVENAHEESVWAATWVPATDTRPSL
                 G  GVG VFCPSPDVN+KAANFIARLRGEWRLEKMNS   +     P   R H +  D MKLAGLKSVENAHEESVWAATW+PATD RPSL
Subjt:  ---------GGAGVGSVFCPSPDVNVKAANFIARLRGEWRLEKMNSALENRDGNDPTCSRAHCENTDQMKLAGLKSVENAHEESVWAATWVPATDTRPSL

Query:  LLTGSLDETVKLWKSDELDLERTNTGHCLGVVSVAAHPSGFIAASASLDSFVRVFEVDSNSTIATLEAPPSEVWQMRFNPEGTMLAVAGGGSASIKLWDT
        L+TGSLDETVKLWKSDELDL+RTNTGHCLGVVSVAAHPSG+IAASASLDSFVR+FEVDSNSTIATLEAPPSEVWQMRFNPEGTMLAVAGGGSASIK+WDT
Subjt:  LLTGSLDETVKLWKSDELDLERTNTGHCLGVVSVAAHPSGFIAASASLDSFVRVFEVDSNSTIATLEAPPSEVWQMRFNPEGTMLAVAGGGSASIKLWDT

Query:  NTWKLAATLSIPRPEGPKPTDKTASKKFVLSVAWSIDGRRLACGSMDGTISVFDVARAKFLHHLEGHFMPVRSLVYSPVEPRLLFSASDDAHVHMYDAEG
        +TWKLAATLSIPRPEGPKPTDKTASKKFVLSVAWS+DGRRLACGSMDGTISVFDVARAKFLHHLEGHFMPVRSLVYSPVEPRLLFSASDDAHVHMYD EG
Subjt:  NTWKLAATLSIPRPEGPKPTDKTASKKFVLSVAWSIDGRRLACGSMDGTISVFDVARAKFLHHLEGHFMPVRSLVYSPVEPRLLFSASDDAHVHMYDAEG

Query:  KTLIGAMSGHSSWVLSVDASPDGAAVATGSSDRTVRLWDLNMRTAVQTMTNHSDQVWGVAFRPPGGVGVRSVRLASVSDDKSISLYDYS
        KTLIGAMSGHSSWVLSVDASPDGAAVATGSSDRTVRLWDLNMRTAVQTMTNHSDQVWGVAFRPPG  GVRS RLASVSDDKSISLYDYS
Subjt:  KTLIGAMSGHSSWVLSVDASPDGAAVATGSSDRTVRLWDLNMRTAVQTMTNHSDQVWGVAFRPPGGVGVRSVRLASVSDDKSISLYDYS

XP_008451154.1 PREDICTED: LOW QUALITY PROTEIN: ras-associated and pleckstrin homology domains-containing protein 1 [Cucumis melo]4.36e-30383.01Show/hide
Query:  MEPNSDPPPFWPPSPPIRRRRSYSPPFISLPVLIILLPTLALILLFFAIRPLLSLINQVYKPSSVKKSWDSFNVFLVLFAIICGIFSRRNDDVPTTADGD
        MEPNSDPPPFW PSPPIRR RSYSPPFISLP+LIILLPTLALILLFFAIRPLLSL NQVYKP+SVKKSWDSFNVFLVL AIICGIF+RRNDDVPTTAD D
Subjt:  MEPNSDPPPFWPPSPPIRRRRSYSPPFISLPVLIILLPTLALILLFFAIRPLLSLINQVYKPSSVKKSWDSFNVFLVLFAIICGIFSRRNDDVPTTADGD

Query:  SRASDQMTVVDTGRVKVNGDSESSQQWFGFSERRFSDPTGRAPVTTRLRRNSSYPDLRQESLWGNGDDSNNQFRFFDDFEINKFRSRSFVYRTRGNEREE
        SR SDQ TVVDTG VKVNGDSE SQ+WFGFSERRFSDP GRAPVTTRLRRNSSYPDLRQESLW NG+D  NQFRFFDDFEINK+RSRSFVYRTRGNEREE
Subjt:  SRASDQMTVVDTGRVKVNGDSESSQQWFGFSERRFSDPTGRAPVTTRLRRNSSYPDLRQESLWGNGDDSNNQFRFFDDFEINKFRSRSFVYRTRGNEREE

Query:  SPA----IPIDSFVVSSSPAPEKIKSQSPNPPPPPPPPLPVTQRKPRRTYQNIQKKEEIPENKAEFTPPPPPPLPPRAVIPPSPVRVRLEEKFGKSVRKK
        SPA    IP+DSFV +SSPAPEKIKSQSPNPPPPPPPPLPVTQRKPRRTYQNIQKKEEIPENKAEFTPPPPPPLPPRAVIPPSPVRVRLEEKFGKSVRKK
Subjt:  SPA----IPIDSFVVSSSPAPEKIKSQSPNPPPPPPPPLPVTQRKPRRTYQNIQKKEEIPENKAEFTPPPPPPLPPRAVIPPSPVRVRLEEKFGKSVRKK

Query:  TNVKKEIAMALASLYRKRKRKQKTKDAYVGD---------RRSPTEQRPPPPPPPPPPSFFRIFKKSSKNKRVHSESPPPPPPPPPPVPSSSRSTKKKIQ
        TNVKKEIAMALASLYRKRKRKQK K               RR P  +  P           RIFKKSSKNKRVHSESPPPPPPPPPP P SSRSTK+KIQ
Subjt:  TNVKKEIAMALASLYRKRKRKQKTKDAYVGD---------RRSPTEQRPPPPPPPPPPSFFRIFKKSSKNKRVHSESPPPPPPPPPPVPSSSRSTKKKIQ

Query:  IPLPPSPPPPPPAQQRNSTTYRRPPLPTSVRNSYIENPSINSREKSPTPTIPPPPPPPPSFKTTTDVKSTVGSDTVGSRSSETSRCGSPDPDNVNTSASG
        IPLPPSPPPPPP+QQRNSTT RRPPLPTSVRNSYIEN SINSR KSP  TIPPPPPPPPSFKTTTDVKSTVGSDTVGSRSSETSRCGSPDP+NVN+ AS 
Subjt:  IPLPPSPPPPPPAQQRNSTTYRRPPLPTSVRNSYIENPSINSREKSPTPTIPPPPPPPPSFKTTTDVKSTVGSDTVGSRSSETSRCGSPDPDNVNTSASG

Query:  GAGVGSVFCPSPDVNVKAANFIARLRGEWRLEKMNSALEN-RDGNDPTCSRAHCENTDQMKLAGL
        GAGVG VFCPSPDVNVKAANFIARLR EWRLEKMNS  E  R G  P         T   K+  L
Subjt:  GAGVGSVFCPSPDVNVKAANFIARLRGEWRLEKMNSALEN-RDGNDPTCSRAHCENTDQMKLAGL

XP_031742598.1 formin-like protein 20 [Cucumis sativus]2.03e-30787.69Show/hide
Query:  MEPNSDPPPFWPPSPPIRRRRSYSPPFISLPVLIILLPTLALILLFFAIRPLLSLINQVYKPSSVKKSWDSFNVFLVLFAIICGIFSRRNDDVPTTADGD
        MEPNSDPPPFWPPSPPI RRRSYSPPFISLPVLIILLPTLALILLFFAIRPLLSLINQVYKPSSVKKSWDSFNVFLVLFAIICGIFSRRNDDVPTTADGD
Subjt:  MEPNSDPPPFWPPSPPIRRRRSYSPPFISLPVLIILLPTLALILLFFAIRPLLSLINQVYKPSSVKKSWDSFNVFLVLFAIICGIFSRRNDDVPTTADGD

Query:  SRASDQMTVVDTGRVKVNGDSESSQQWFGFSERRFSDPTGRAPVTTRLRRNSSYPDLRQESLWGNGDDSNNQFRFFDDFEINKFRSRSFVYRTRGNEREE
        +R SDQMTVVDTG VKVNGDSESSQQWFGFSERRFSDPTGRAPVTTRLRRNSSYPDLRQESLWGNGDDSNNQFRFFDDFEINKFRSRSFVYRTRGNEREE
Subjt:  SRASDQMTVVDTGRVKVNGDSESSQQWFGFSERRFSDPTGRAPVTTRLRRNSSYPDLRQESLWGNGDDSNNQFRFFDDFEINKFRSRSFVYRTRGNEREE

Query:  SPAIPIDSFVVSSSPAPEKIKSQSPNPPPPPPPPLPVTQRKPRRTYQNIQKKEEIPENKAEFTPPPPPPLPPRAVIPPSPVRVRLEEKFGKSVRKKTNVK
        SPAIP+DSFVV+SSPAPEK+KSQSPNPPPPPPPPLPVTQRKPRRTYQNIQKKEEIPENKAEFTPPPPPPLPPR VIPPSPVRVRLEEKFGKSVRKKTNVK
Subjt:  SPAIPIDSFVVSSSPAPEKIKSQSPNPPPPPPPPLPVTQRKPRRTYQNIQKKEEIPENKAEFTPPPPPPLPPRAVIPPSPVRVRLEEKFGKSVRKKTNVK

Query:  KEIAMALASLYRKRKRKQKTKDAYVGDRRSPTEQRPPPPPPPPPPSFFRIFKKSSKNKRVHSESPPPPPPPPPPVPSSSRSTKKKIQIPLPPSPPPPPP-
        KEIAMALASL                                                 VHSESPPPPPPPPPPVPSSSRSTKKKIQIP PPSPPPPPP 
Subjt:  KEIAMALASLYRKRKRKQKTKDAYVGDRRSPTEQRPPPPPPPPPPSFFRIFKKSSKNKRVHSESPPPPPPPPPPVPSSSRSTKKKIQIPLPPSPPPPPP-

Query:  -AQQRNSTTYRRPPLPTSVRNSYIENPSINSREKSPTPTIPPPPPPPPSFKTTTDVKSTVGSDTVGSRSSETSRCGSPDPDNVNTSASGGAGVGSVFCPS
         AQQRNSTT RRPPLPTSVRNSYIENPSINSREKS TPTIPPPPPPPPSFKTTTDVKSTVGSDTVGSRSSETSRCGSPDP+NVNTSASGGAGVGSVFCPS
Subjt:  -AQQRNSTTYRRPPLPTSVRNSYIENPSINSREKSPTPTIPPPPPPPPSFKTTTDVKSTVGSDTVGSRSSETSRCGSPDPDNVNTSASGGAGVGSVFCPS

Query:  PDVNVKAANFIARLRGEWRLEKMNSALE
        PDVNVKAANFIARLRGEWRLEKMNS  E
Subjt:  PDVNVKAANFIARLRGEWRLEKMNSALE

TrEMBL top hitse value%identityAlignment
A0A0A0LYY7 Uncharacterized protein1.3e-27795.71Show/hide
Query:  MEPNSDPPPFWPPSPPIRRRRSYSPPFISLPVLIILLPTLALILLFFAIRPLLSLINQVYKPSSVKKSWDSFNVFLVLFAIICGIFSRRNDDVPTTADGD
        MEPNSDPPPFWPPSPPI RRRSYSPPFISLPVLIILLPTLALILLFFAIRPLLSLINQVYKPSSVKKSWDSFNVFLVLFAIICGIFSRRNDDVPTTADGD
Subjt:  MEPNSDPPPFWPPSPPIRRRRSYSPPFISLPVLIILLPTLALILLFFAIRPLLSLINQVYKPSSVKKSWDSFNVFLVLFAIICGIFSRRNDDVPTTADGD

Query:  SRASDQMTVVDTGRVKVNGDSESSQQWFGFSERRFSDPTGRAPVTTRLRRNSSYPDLRQESLWGNGDDSNNQFRFFDDFEINKFRSRSFVYRTRGNEREE
        +R SDQMTVVDTG VKVNGDSESSQQWFGFSERRFSDPTGRAPVTTRLRRNSSYPDLRQESLWGNGDDSNNQFRFFDDFEINKFRSRSFVYRTRGNEREE
Subjt:  SRASDQMTVVDTGRVKVNGDSESSQQWFGFSERRFSDPTGRAPVTTRLRRNSSYPDLRQESLWGNGDDSNNQFRFFDDFEINKFRSRSFVYRTRGNEREE

Query:  SPAIPIDSFVVSSSPAPEKIKSQSPNPPPPPPPPLPVTQRKPRRTYQNIQKKEEIPENKAEFTPPPPPPLPPRAVIPPSPVRVRLEEKFGKSVRKKTNVK
        SPAIP+DSFVV+SSPAPEK+KSQSPNPPPPPPPPLPVTQRKPRRTYQNIQKKEEIPENKAEFTPPPPPPLPPR VIPPSPVRVRLEEKFGKSVRKKTNVK
Subjt:  SPAIPIDSFVVSSSPAPEKIKSQSPNPPPPPPPPLPVTQRKPRRTYQNIQKKEEIPENKAEFTPPPPPPLPPRAVIPPSPVRVRLEEKFGKSVRKKTNVK

Query:  KEIAMALASLYRKRKRKQKTKDAYVGDRRSPTEQRPPPPPPPPPPSFFRIFKKSSKNKRVHSESPPPPPPPPPPVPSSSRSTKKKIQIPLPPS--PPPPP
        KEIAMALASLYRKRKRKQKTKDAY GDRRSPTEQRPPPPPPPPPPSF RIFKKSSKNKRVHSESPPPPPPPPPPVPSSSRSTKKKIQIP PPS  PPPPP
Subjt:  KEIAMALASLYRKRKRKQKTKDAYVGDRRSPTEQRPPPPPPPPPPSFFRIFKKSSKNKRVHSESPPPPPPPPPPVPSSSRSTKKKIQIPLPPS--PPPPP

Query:  PAQQRNSTTYRRPPLPTSVRNSYIENPSINSREKSPTPTIPPPPPPPPSFKTTTDVKSTVGSDTVGSRSSETSRCGSPDPDNVNTSASGGAGVGSVFCPS
        PAQQRNSTT RRPPLPTSVRNSYIENPSINSREKS TPTIPPPPPPPPSFKTTTDVKSTVGSDTVGSRSSETSRCGSPDP+NVNTSASGGAGVGSVFCPS
Subjt:  PAQQRNSTTYRRPPLPTSVRNSYIENPSINSREKSPTPTIPPPPPPPPSFKTTTDVKSTVGSDTVGSRSSETSRCGSPDPDNVNTSASGGAGVGSVFCPS

Query:  PDVNVKAANFIARLRGEWRLEKMNSALE-NRDGNDP
        PDVNVKAANFIARLRGEWRLEKMNS  E  R G  P
Subjt:  PDVNVKAANFIARLRGEWRLEKMNSALE-NRDGNDP

A0A1S3BRX6 LOW QUALITY PROTEIN: ras-associated and pleckstrin homology domains-containing protein 17.5e-24186.64Show/hide
Query:  MEPNSDPPPFWPPSPPIRRRRSYSPPFISLPVLIILLPTLALILLFFAIRPLLSLINQVYKPSSVKKSWDSFNVFLVLFAIICGIFSRRNDDVPTTADGD
        MEPNSDPPPFW PSPPIRR RSYSPPFISLP+LIILLPTLALILLFFAIRPLLSL NQVYKP+SVKKSWDSFNVFLVL AIICGIF+RRNDDVPTTAD D
Subjt:  MEPNSDPPPFWPPSPPIRRRRSYSPPFISLPVLIILLPTLALILLFFAIRPLLSLINQVYKPSSVKKSWDSFNVFLVLFAIICGIFSRRNDDVPTTADGD

Query:  SRASDQMTVVDTGRVKVNGDSESSQQWFGFSERRFSDPTGRAPVTTRLRRNSSYPDLRQESLWGNGDDSNNQFRFFDDFEINKFRSRSFVYRTRGNEREE
        SR SDQ TVVDTG VKVNGDSE SQ+WFGFSERRFSDP GRAPVTTRLRRNSSYPDLRQESLW NG+D  NQFRFFDDFEINK+RSRSFVYRTRGNEREE
Subjt:  SRASDQMTVVDTGRVKVNGDSESSQQWFGFSERRFSDPTGRAPVTTRLRRNSSYPDLRQESLWGNGDDSNNQFRFFDDFEINKFRSRSFVYRTRGNEREE

Query:  SPA----IPIDSFVVSSSPAPEKIKSQSPNPPPPPPPPLPVTQRKPRRTYQNIQKKEEIPENKAEFTPPPPPPLPPRAVIPPSPVRVRLEEKFGKSVRKK
        SPA    IP+DSFV +SSPAPEKIKSQSPNPPPPPPPPLPVTQRKPRRTYQNIQKKEEIPENKAEFTPPPPPPLPPRAVIPPSPVRVRLEEKFGKSVRKK
Subjt:  SPA----IPIDSFVVSSSPAPEKIKSQSPNPPPPPPPPLPVTQRKPRRTYQNIQKKEEIPENKAEFTPPPPPPLPPRAVIPPSPVRVRLEEKFGKSVRKK

Query:  TNVKKEIAMALASLYRKRKRKQKTKDAYVG-DRRSPTEQRPPPPPPPPPPSFFRIFKKSSKNKRVHSESPPPPPPPPPPVPSSSRSTKKKIQIPLPPSPP
        TNVKKEIAMALASLYRKRKRKQK K      D   P       P     PS  RIFKKSSKNKRVHSESPPPPPPPPPP P SSRSTK+KIQIPLPPSPP
Subjt:  TNVKKEIAMALASLYRKRKRKQKTKDAYVG-DRRSPTEQRPPPPPPPPPPSFFRIFKKSSKNKRVHSESPPPPPPPPPPVPSSSRSTKKKIQIPLPPSPP

Query:  PPPPAQQRNSTTYRRPPLPTSVRNSYIENPSINSREKSPTPTIPPPPPPPPSFKTTTDVKSTVGSDTVGSRSSETSRCGSPDPDNVNTSASGGAGVGSVF
        PPPP+QQRNSTT RRPPLPTSVRNSYIEN SINSR KSP  TIPPPPPPPPSFKTTTDVKSTVGSDTVGSRSSETSRCGSPDP+NVN+ AS GAGVG VF
Subjt:  PPPPAQQRNSTTYRRPPLPTSVRNSYIENPSINSREKSPTPTIPPPPPPPPSFKTTTDVKSTVGSDTVGSRSSETSRCGSPDPDNVNTSASGGAGVGSVF

Query:  CPSPDVNVKAANFIARLRGEWRLEKMNSALE-NRDGNDP
        CPSPDVNVKAANFIARLR EWRLEKMNS  E  R G  P
Subjt:  CPSPDVNVKAANFIARLRGEWRLEKMNSALE-NRDGNDP

A0A5A7UIX1 Ras-associated and pleckstrin-like proteiny domains-containing protein 13.8e-26190.71Show/hide
Query:  MEPNSDPPPFWPPSPPIRRRRSYSPPFISLPVLIILLPTLALILLFFAIRPLLSLINQVYKPSSVKKSWDSFNVFLVLFAIICGIFSRRNDDVPTTADGD
        MEPNSDPPPFW PSPPIRR RSYSPPFISLP+LIILLPTLALILLFFAIRPLLSL NQVYKP+SVKKSWDSFNVFLVL AIICGIF+RRNDDVPTTAD D
Subjt:  MEPNSDPPPFWPPSPPIRRRRSYSPPFISLPVLIILLPTLALILLFFAIRPLLSLINQVYKPSSVKKSWDSFNVFLVLFAIICGIFSRRNDDVPTTADGD

Query:  SRASDQMTVVDTGRVKVNGDSESSQQWFGFSERRFSDPTGRAPVTTRLRRNSSYPDLRQESLWGNGDDSNNQFRFFDDFEINKFRSRSFVYRTRGNEREE
        SR SDQ TVVDTG VKVNGDSE SQ+WFGFSERRFSDP GRAPVTTRLRRNSSYPDLRQESLW NG+D  NQFRFFDDFEINK+RSRSFVYRTRGNEREE
Subjt:  SRASDQMTVVDTGRVKVNGDSESSQQWFGFSERRFSDPTGRAPVTTRLRRNSSYPDLRQESLWGNGDDSNNQFRFFDDFEINKFRSRSFVYRTRGNEREE

Query:  SPA----IPIDSFVVSSSPAPEKIKSQSPNPPPPPPPPLPVTQRKPRRTYQNIQKKEEIPENKAEFTPPPPPPLPPRAVIPPSPVRVRLEEKFGKSVRKK
        SPA    IP+DSFV +SSPAPEKIKSQSPNPPPPPPPPLPVTQRKPRRTYQNIQKKEEIPENKAEFTPPPPPPLPPRAVIPPSPVRVRLEEKFGKSVRKK
Subjt:  SPA----IPIDSFVVSSSPAPEKIKSQSPNPPPPPPPPLPVTQRKPRRTYQNIQKKEEIPENKAEFTPPPPPPLPPRAVIPPSPVRVRLEEKFGKSVRKK

Query:  TNVKKEIAMALASLYRKRKRKQKTKDAYVGDRRSPTEQRPPPPPPPPPPSFFRIFKKSSKNKRVHSESPPPPPPPPPPVPSSSRSTKKKIQIPLPPSPPP
        TNVKKEIAMALASLYRKRKRKQKTKDAY  DRRSPTEQRPPPPPPPPPPSFFRIFKKSSKNKRVHSESPPPPPPPPPP P SSRSTK+KIQIPLPPSPPP
Subjt:  TNVKKEIAMALASLYRKRKRKQKTKDAYVGDRRSPTEQRPPPPPPPPPPSFFRIFKKSSKNKRVHSESPPPPPPPPPPVPSSSRSTKKKIQIPLPPSPPP

Query:  PPPAQQRNSTTYRRPPLPTSVRNSYIENPSINSREKSPTPTIPPPPPPPPSFKTTTDVKSTVGSDTVGSRSSETSRCGSPDPDNVNTSASGGAGVGSVFC
        PPP+QQRNSTT RRPPLPTSVRNSYIEN SINSR KSP  TIPPPPPPPPSFKTTTDVKSTVGSDTVGSRSSETSRCGSPDP+NVN+ AS GAGVG VFC
Subjt:  PPPAQQRNSTTYRRPPLPTSVRNSYIENPSINSREKSPTPTIPPPPPPPPSFKTTTDVKSTVGSDTVGSRSSETSRCGSPDPDNVNTSASGGAGVGSVFC

Query:  PSPDVNVKAANFIARLRGEWRLEKMNSALE-NRDGNDP
        PSPDVNVKAANFIARLR EWRLEKMNS  E  R G  P
Subjt:  PSPDVNVKAANFIARLRGEWRLEKMNSALE-NRDGNDP

A0A6J1EH96 uncharacterized protein LOC1114341691.1e-19673.26Show/hide
Query:  MEPNSDPPPFWPPSPPIRRRRSYSPPFISLPVLIILLPTLALILLFFAIRPLLSLINQVYKPSSVKKSWDSFNVFLVLFAIICGIFSRRNDDVPTTADGD
        MEPNSDPPP+W P P + RRRS SPPFISLPVLIILLPTLALI+LFFAIRPLLSL  Q+++P+SVKKSWDSFNVFL+L AIICGIF+RRNDDVPT AD D
Subjt:  MEPNSDPPPFWPPSPPIRRRRSYSPPFISLPVLIILLPTLALILLFFAIRPLLSLINQVYKPSSVKKSWDSFNVFLVLFAIICGIFSRRNDDVPTTADGD

Query:  -SRASDQMTVVDTGRVKVNGDSESSQQWFGFSERRFSDPTGRAPVTT-RLRRNSSYPDLRQESLWGNGDDSNNQFRFFDDFEINKFRSRSFVYRTRGNER
         SR SD+ T VD   VKVNGD E  QQWFGF+ERRFSD +GR P T  RLRRNSSYPDLRQES    G+D  NQFRF+DDFEINKFRSRSFVYRTRG+E 
Subjt:  -SRASDQMTVVDTGRVKVNGDSESSQQWFGFSERRFSDPTGRAPVTT-RLRRNSSYPDLRQESLWGNGDDSNNQFRFFDDFEINKFRSRSFVYRTRGNER

Query:  EESPA----IPIDSFVVSSSPAPEKIKSQSPNPPPPPPPPLPVTQRKPRRTYQNIQKKEEIPE---NKAEF----TPPPPPPLPPRAVIPPSPVRVRLEE
        EESPA    IP+DSFV +SSP P+++KS   NPPPPPPPPLPVTQRK RRTYQ IQ+KEE+ E   N AEF    TPP PPPLPPR VIPPSPVRVRLEE
Subjt:  EESPA----IPIDSFVVSSSPAPEKIKSQSPNPPPPPPPPLPVTQRKPRRTYQNIQKKEEIPE---NKAEF----TPPPPPPLPPRAVIPPSPVRVRLEE

Query:  KFGKSVRKKTNVKKEIAMALASLYRKRKRKQKTKDAYVGDRRSPTEQR--PPPPPPPPPPSFFR-IFKKSSKNKRVHSESPPPPPPPPPPVPSSSRSTKK
        +FG+S RKKTNVKKEIAMALASLYRKRK+KQK K+ Y GDRRSPTEQR  PPPPPPPPPPS FR +FKKS+KNKR+HSES PPPPPPPPPVP SSRSTKK
Subjt:  KFGKSVRKKTNVKKEIAMALASLYRKRKRKQKTKDAYVGDRRSPTEQR--PPPPPPPPPPSFFR-IFKKSSKNKRVHSESPPPPPPPPPPVPSSSRSTKK

Query:  KIQIPLPPSPPPPPPAQQRNSTTYRRPPLPTSVRNSYIENPSINSREKSPTPTIPPPPPPPPSFKTTTDVKSTVGSDTVGSRSSETSRCGSPDPDNVNTS
        KIQIP    PPP P  +QRNST  RRPPLP    N  IEN  INS  +SP+ TIPPPPPPPP FKTTTDVKST   +T GSRSSETSRCGSP+P  V  S
Subjt:  KIQIPLPPSPPPPPPAQQRNSTTYRRPPLPTSVRNSYIENPSINSREKSPTPTIPPPPPPPPSFKTTTDVKSTVGSDTVGSRSSETSRCGSPDPDNVNTS

Query:  A-----------SGGAGVGSVFCPSPDVNVKAANFIARLRGEWRLEKMNSALE-NRDGNDP
        +           +GG GVGSVFCPSPDVN+KA NFIARLRGEWRLEKMNS  E  R G  P
Subjt:  A-----------SGGAGVGSVFCPSPDVNVKAANFIARLRGEWRLEKMNSALE-NRDGNDP

B9RK45 WD_REPEATS_REGION domain-containing protein3.9e-22955.07Show/hide
Query:  SLPVLIILLPTLALILLFFAIRPLLSLINQVY---KPSSVKKSWDSFNVFLVLFAIICGIFSRRNDDVPTTADGDSRASDQMTVVDTGRVKVNGDSESSQ
        SLP+LI+L   L  IL FF I  L+S  +  Y   +PS+VKKSWDS NVFLVLFAI+CGIF+RRNDD  +   GD   S  +   ++   K    + S+ 
Subjt:  SLPVLIILLPTLALILLFFAIRPLLSLINQVY---KPSSVKKSWDSFNVFLVLFAIICGIFSRRNDDVPTTADGDSRASDQMTVVDTGRVKVNGDSESSQ

Query:  QWFGFSERRFSDPTGRAPVTTRLRRNSSYPDLRQESLWGNGDDSNNQFRFFDDFEINKFRSRSFVYRTRGNEREES-----------PAIPIDSFVVSS-
          +   + +F+  T   P+    R +SSYPDLRQESLW +GDD  ++FRFFDDFE++KFRS  + +    + R +              IP+D++V+ S 
Subjt:  QWFGFSERRFSDPTGRAPVTTRLRRNSSYPDLRQESLWGNGDDSNNQFRFFDDFEINKFRSRSFVYRTRGNEREES-----------PAIPIDSFVVSS-

Query:  ---SPAPEKIKSQSPNPPPPPPPPLP---VTQRKPRRTYQNIQKKEEIPENKAEFTPPPPPPLPPRAVIPPSPVRVRLEEKFGKSVRKKTNVKKEIAMAL
           SPAP    + +P PPPPPPPP P       K RR+Y+ + ++E+  ++      PPP P PP    P  PV  R ++K+ ++         E+    
Subjt:  ---SPAPEKIKSQSPNPPPPPPPPLP---VTQRKPRRTYQNIQKKEEIPENKAEFTPPPPPPLPPRAVIPPSPVRVRLEEKFGKSVRKKTNVKKEIAMAL

Query:  ASLYRKRKRKQ--------------KTKDAYVGDRRSPTEQRPPPPPPPPPPSFFRIFKKSSKNKRVHSESPPPPPPPPPPVPSSSRSTKKKIQIPLPPS
          LY + +R++                K+A V +      +   PPPPPP  +     K+ SK K       PPPPPPPPP PS+               
Subjt:  ASLYRKRKRKQ--------------KTKDAYVGDRRSPTEQRPPPPPPPPPPSFFRIFKKSSKNKRVHSESPPPPPPPPPPVPSSSRSTKKKIQIPLPPS

Query:  PPPPPPAQQRNST-TYRRPPLPTSVRNSYIENPSINSREKSPTPTIPPPPPPPPSFKTTTDVKSTVGSDTVGSRSSETSRCGSPDPDNVNTSASGGAGV-
         PP  P Q+RN T T  RPPLPT V N+     ++NS  +SP   +PPPPPPPP        K  V  D V  RS+ +SRC SP+ + V+  ++    + 
Subjt:  PPPPPPAQQRNST-TYRRPPLPTSVRNSYIENPSINSREKSPTPTIPPPPPPPPSFKTTTDVKSTVGSDTVGSRSSETSRCGSPDPDNVNTSASGGAGV-

Query:  --GSVFCPSPDVNVKAANFIARLRGEWRLEKMNSALENRDGNDPTCSRAHCENTDQMKLAGLKSVENAHEESVWAATWVPATDTRPSLLLTGSLDETVKL
          GSVFC SPDVN+KA +FIARLRGEWRLEK+NS      G  P               +GL+S+ENAH+ESVWAATWVPAT TR +LLLTGSLDETVKL
Subjt:  --GSVFCPSPDVNVKAANFIARLRGEWRLEKMNSALENRDGNDPTCSRAHCENTDQMKLAGLKSVENAHEESVWAATWVPATDTRPSLLLTGSLDETVKL

Query:  WKSDELDLERTNTGHCLGVVSVAAHPSGFIAASASLDSFVRVFEVDSNSTIATLEAPPSEVWQMRFNPEGTMLAVAGGGSASIKLWDTNTWKLAATLSIP
        W SDEL+LERTNTGHCLGVVSVAAHPSG IAASASLDSFVRVF+VD+N+TIATLE+PPSEVWQM+F+P+GT LAVAGGGSAS+ LWDT TWK  A+LS+P
Subjt:  WKSDELDLERTNTGHCLGVVSVAAHPSGFIAASASLDSFVRVFEVDSNSTIATLEAPPSEVWQMRFNPEGTMLAVAGGGSASIKLWDTNTWKLAATLSIP

Query:  RPEGPKPTDKTASKKFVLSVAWSIDGRRLACGSMDGTISVFDVARAKFLHHLEGHFMPVRSLVYSPVEPRLLFSASDDAHVHMYDAEGKTLIGAMSGHSS
        RPEGPKP+DK +SKKFVLSVAWS DG+RLACGSMDGTISVFDVARAKFLHHLEGHFMPVRSLVYSP++PR+LFSASDDAHVHMYD+EGK+LI AMSGH+S
Subjt:  RPEGPKPTDKTASKKFVLSVAWSIDGRRLACGSMDGTISVFDVARAKFLHHLEGHFMPVRSLVYSPVEPRLLFSASDDAHVHMYDAEGKTLIGAMSGHSS

Query:  WVLSVDASPDGAAVATGSSDRTVRLWDLNMRTAVQTMTNHSDQVWGVAFRPPGGVGVRSVRLASVSDDKSISLYDYS
        WVLSVDASPDGAA+ATGSSDRTVRLWDLNMR AVQTM+NHSDQVW VAFRPPGG G R+ RLASVSDDKSISLY YS
Subjt:  WVLSVDASPDGAAVATGSSDRTVRLWDLNMRTAVQTMTNHSDQVWGVAFRPPGGVGVRSVRLASVSDDKSISLYDYS

SwissProt top hitse value%identityAlignment
Q5ZJH5 WD repeat-containing protein 611.0e-6139.1Show/hide
Query:  ENAHEESVWAATWVPATDTRPSLLLTGSLDETVKLWK--SDELDLERTNTGHCLGVVSVAAHPSGFIAASASLDSFVRVFEVDSNSTIATLEAPPSEVWQ
        E AH++++W+  W          +++GSLD+ VK+WK   ++LDL+ T  GH LGVVSV    +G IAAS+SLD+ +R++++++   I +++A P + W 
Subjt:  ENAHEESVWAATWVPATDTRPSLLLTGSLDETVKLWK--SDELDLERTNTGHCLGVVSVAAHPSGFIAASASLDSFVRVFEVDSNSTIATLEAPPSEVWQ

Query:  MRFNPEGTMLAVAGGGSASIKLWDTNTWKLAATLSIPRPEGPKPTDKTASKKFVLSVAWSIDGRRLACGSMDGTISVFDVARAKFLHHLEGHFMPVRSLV
        + F+P+   LA  G     + ++   T             G K        KF+LS+A+S DG+ LA G++DG I++FD+A  K LH LEGH MP+RSL 
Subjt:  MRFNPEGTMLAVAGGGSASIKLWDTNTWKLAATLSIPRPEGPKPTDKTASKKFVLSVAWSIDGRRLACGSMDGTISVFDVARAKFLHHLEGHFMPVRSLV

Query:  YSPVEPRLLFSASDDAHVHMYDAEGKTLIGAMSGHSSWVLSVDASPDGAAVATGSSDRTVRLWDLNMRTAVQTMTNHSDQVWGVAFRPPGGVGVRSVRLA
        +SP + +LL +ASDD ++ +YD +   L G +SGH SWVL+V   PD     + SSD++V++WD   RT V T  +H DQVWGV +   G       ++ 
Subjt:  YSPVEPRLLFSASDDAHVHMYDAEGKTLIGAMSGHSSWVLSVDASPDGAAVATGSSDRTVRLWDLNMRTAVQTMTNHSDQVWGVAFRPPGGVGVRSVRLA

Query:  SVSDDKSISLYD
        SV DD+ I +YD
Subjt:  SVSDDKSISLYD

Q6GMD2 WD repeat-containing protein 611.6e-6240.38Show/hide
Query:  ENAHEESVWAATWVPATDTRPSLLLTGSLDETVKLWK-SDE-LDLERTNTGHCLGVVSVAAHPSGFIAASASLDSFVRVFEVDSNSTIATLEAPPSEVWQ
        E+AHE+++W+  W   ++    L+++GSLD+ VK+WK SDE L+++    GH LGVVSV   PSG I AS+SLD+ +R+++++S   I +++A P + W 
Subjt:  ENAHEESVWAATWVPATDTRPSLLLTGSLDETVKLWK-SDE-LDLERTNTGHCLGVVSVAAHPSGFIAASASLDSFVRVFEVDSNSTIATLEAPPSEVWQ

Query:  MRFNPEGTMLAVAGGGSASIKLWDTNTWKLAATLSIPRPEGPKPTDKTASKKFVLSVAWSIDGRRLACGSMDGTISVFDVARAKFLHHLEGHFMPVRSLV
        + F+P+   LA  G     + ++   T             G K        KF+LS+A+S DG+ LA G++DG I++FD+A  K LH LEGH MP+RSL 
Subjt:  MRFNPEGTMLAVAGGGSASIKLWDTNTWKLAATLSIPRPEGPKPTDKTASKKFVLSVAWSIDGRRLACGSMDGTISVFDVARAKFLHHLEGHFMPVRSLV

Query:  YSPVEPRLLFSASDDAHVHMYDAEGKTLIGAMSGHSSWVLSVDASPDGAAVATGSSDRTVRLWDLNMRTAVQTMTNHSDQVWGVAFRPPGGVGVRSVRLA
        +S  + +LL +ASDD ++ +YD +  +L   +SGH SWVL+V  SPD A   + SSD++V++WD++ RT V T  +H DQVWGV +   G       ++ 
Subjt:  YSPVEPRLLFSASDDAHVHMYDAEGKTLIGAMSGHSSWVLSVDASPDGAAVATGSSDRTVRLWDLNMRTAVQTMTNHSDQVWGVAFRPPGGVGVRSVRLA

Query:  SVSDDKSISLYD
        SV DD+ I +YD
Subjt:  SVSDDKSISLYD

Q6P5M2 WD repeat-containing protein 612.1e-6239.74Show/hide
Query:  ENAHEESVWAATWVPATDTRPSLLLTGSLDETVKLWK-SDE-LDLERTNTGHCLGVVSVAAHPSGFIAASASLDSFVRVFEVDSNSTIATLEAPPSEVWQ
        E+AHE+++W A W  +       ++TGSLD+ VK+WK SDE L+L+ T  GH LGVVSV    +G IAAS+SLD+ +R++++++   I +++A P + W 
Subjt:  ENAHEESVWAATWVPATDTRPSLLLTGSLDETVKLWK-SDE-LDLERTNTGHCLGVVSVAAHPSGFIAASASLDSFVRVFEVDSNSTIATLEAPPSEVWQ

Query:  MRFNPEGTMLAVAGGGSASIKLWDTNTWKLAATLSIPRPEGPKPTDKTASKKFVLSVAWSIDGRRLACGSMDGTISVFDVARAKFLHHLEGHFMPVRSLV
        + F+P+   +A  G     + ++   +             G K        KF+LS+A+S DG+ LA G++DG I++FD+A  K LH LEGH MP+RSL 
Subjt:  MRFNPEGTMLAVAGGGSASIKLWDTNTWKLAATLSIPRPEGPKPTDKTASKKFVLSVAWSIDGRRLACGSMDGTISVFDVARAKFLHHLEGHFMPVRSLV

Query:  YSPVEPRLLFSASDDAHVHMYDAEGKTLIGAMSGHSSWVLSVDASPDGAAVATGSSDRTVRLWDLNMRTAVQTMTNHSDQVWGVAFRPPGGVGVRSVRLA
        +SP + +LL +ASDD ++ +YD +   L G +SGH SWVLSV  SPD     + SSD+++++WD + R+ V T  +H DQVW V + P G       ++ 
Subjt:  YSPVEPRLLFSASDDAHVHMYDAEGKTLIGAMSGHSSWVLSVDASPDGAAVATGSSDRTVRLWDLNMRTAVQTMTNHSDQVWGVAFRPPGGVGVRSVRLA

Query:  SVSDDKSISLYD
        S  DD++I +YD
Subjt:  SVSDDKSISLYD

Q6PBD6 WD repeat-containing protein 611.9e-6340.71Show/hide
Query:  ENAHEESVWAATWVPATDTRPSLLLTGSLDETVKLWK-SDE-LDLERTNTGHCLGVVSVAAHPSGFIAASASLDSFVRVFEVDSNSTIATLEAPPSEVWQ
        E+AHE+++W+  W   ++    L+++GSLD+ VK+WK SDE L+L+ T  GH LGVVSV   PSG I AS+SLD+ +R+++++S   I  ++A P + W 
Subjt:  ENAHEESVWAATWVPATDTRPSLLLTGSLDETVKLWK-SDE-LDLERTNTGHCLGVVSVAAHPSGFIAASASLDSFVRVFEVDSNSTIATLEAPPSEVWQ

Query:  MRFNPEGTMLAVAGGGSASIKLWDTNTWKLAATLSIPRPEGPKPTDKTASKKFVLSVAWSIDGRRLACGSMDGTISVFDVARAKFLHHLEGHFMPVRSLV
        + F+P+   LA  G     + ++   T             G K        KF+LS+A+S DG+ LA G++DG I++FD+A  K LH LEGH MP+RSL 
Subjt:  MRFNPEGTMLAVAGGGSASIKLWDTNTWKLAATLSIPRPEGPKPTDKTASKKFVLSVAWSIDGRRLACGSMDGTISVFDVARAKFLHHLEGHFMPVRSLV

Query:  YSPVEPRLLFSASDDAHVHMYDAEGKTLIGAMSGHSSWVLSVDASPDGAAVATGSSDRTVRLWDLNMRTAVQTMTNHSDQVWGVAFRPPGGVGVRSVRLA
        +SP + +LL +ASDD ++ +Y+ +  +L   +SGH SWVL+V  SPD     + SSD++V++WD++ RT V T  +H DQVWGV +   G       ++ 
Subjt:  YSPVEPRLLFSASDDAHVHMYDAEGKTLIGAMSGHSSWVLSVDASPDGAAVATGSSDRTVRLWDLNMRTAVQTMTNHSDQVWGVAFRPPGGVGVRSVRLA

Query:  SVSDDKSISLYD
        SV+DD+ I +YD
Subjt:  SVSDDKSISLYD

Q9SZQ5 WD repeat-containing protein VIP32.8e-15279.44Show/hide
Query:  MKLAGLKSVENAHEESVWAATWVPATDTRPSLLLTGSLDETVKLWKSDELDLERTNTGHCLGVVSVAAHPSGFIAASASLDSFVRVFEVDSNSTIATLEA
        MKLAGLKS+ENAHE+SVWAATWVPAT+ RP+LLLTGSLDETVKLW+ DELDL RTNTGH LGV ++AAHPSG IAAS+S+DSFVRVF+VD+N+TIA LEA
Subjt:  MKLAGLKSVENAHEESVWAATWVPATDTRPSLLLTGSLDETVKLWKSDELDLERTNTGHCLGVVSVAAHPSGFIAASASLDSFVRVFEVDSNSTIATLEA

Query:  PPSEVWQMRFNPEGTMLAVAGGGSASIKLWDTNTWKLAATLSIPRPEGPKPTDKTASKKFVLSVAWSIDGRRLACGSMDGTISVFDVARAKFLHHLEGHF
        PPSEVW M+F P+GT+LAVAGG SAS+KLWDT +W+L +TLSIPRP+ PKP+DKT+SKKFVLSVAWS +G+RLACGSMDGTI VFDV R+K LH LEGH 
Subjt:  PPSEVWQMRFNPEGTMLAVAGGGSASIKLWDTNTWKLAATLSIPRPEGPKPTDKTASKKFVLSVAWSIDGRRLACGSMDGTISVFDVARAKFLHHLEGHF

Query:  MPVRSLVYSPVEPRLLFSASDDAHVHMYDAEGKTLIGAMSGHSSWVLSVDASPDGAAVATGSSDRTVRLWDLNMRTAVQTMTNHSDQVWGVAFRPPGGVG
        MPVRSLV+SPV+PR+LFS SDD HV+M+DAEGKTL+G+MSGH+SWVLSVDASPDG A+ATGSSDRTVRLWDL MR A+QTM+NH+DQVW VAFRPPGG G
Subjt:  MPVRSLVYSPVEPRLLFSASDDAHVHMYDAEGKTLIGAMSGHSSWVLSVDASPDGAAVATGSSDRTVRLWDLNMRTAVQTMTNHSDQVWGVAFRPPGGVG

Query:  VRSVRLASVSDDKSISLYDYS
        VR+ RLASVSDDKS+SLYDYS
Subjt:  VRSVRLASVSDDKSISLYDYS

Arabidopsis top hitse value%identityAlignment
AT1G72790.1 hydroxyproline-rich glycoprotein family protein2.0e-3631.25Show/hide
Query:  PFWPPS----PPIRRRRSYSPPFISLPVLIILLPTLALILLFFAIRPLLSLINQVYKPSSVKKSWDSFNVFLVLFAIICGIFSR--RNDDVPTTADGD--
        PFW  S       RR  S      ++   I    T A++++ F I P  S ++Q+++P  V+KSWD  N  LVLFA++CG  SR   ND+     + D  
Subjt:  PFWPPS----PPIRRRRSYSPPFISLPVLIILLPTLALILLFFAIRPLLSLINQVYKPSSVKKSWDSFNVFLVLFAIICGIFSR--RNDDVPTTADGD--

Query:  SRASDQMTVVDTGRVKVNGDSESSQQWFGFSERRFSDPTGRAPVTTRLRRNSSYPDLRQESLWGNGDDSNNQFRFFDDFEINKFRSRS----FVYRTRGN
        ++ S   +++D  R +V+    + + W         D T      +RLR  SSYPDLR         +++ ++RF+DD  +++ R       +  ++  N
Subjt:  SRASDQMTVVDTGRVKVNGDSESSQQWFGFSERRFSDPTGRAPVTTRLRRNSSYPDLRQESLWGNGDDSNNQFRFFDDFEINKFRSRS----FVYRTRGN

Query:  EREESPAIPID------------SFVVSSSPAPEKIK---------------------SQSPNPPPPPPPPLPVTQRKPRRTYQNIQKKEEIPENK---A
          EE    P D            S V +     EK++                       SP P PP PPP    +RK  R YQ++  +EE  E     A
Subjt:  EREESPAIPID------------SFVVSSSPAPEKIK---------------------SQSPNPPPPPPPPLPVTQRKPRRTYQNIQKKEEIPENK---A

Query:  EFTPPPPPPLPPRAVIPPSPVRVRLEEKFGKSVRKKTNVKKEIAMALASLYRKRKRKQKTKDA--YVGDRRSPTEQRPPPPPPPPPPSFFRIF-KKSSKN
          TP PPP                + +K  K  +KK    K+  +AL    +K+K++Q++ D    +     P    PPPPPPPPPP F  +F  K  K+
Subjt:  EFTPPPPPPLPPRAVIPPSPVRVRLEEKFGKSVRKKTNVKKEIAMALASLYRKRKRKQKTKDA--YVGDRRSPTEQRPPPPPPPPPPSFFRIF-KKSSKN

Query:  KRVHSESPPPPPPPPPPVPSSSRSTKKKIQIPLPPSPPPPPPAQQRNSTTYRRPPLPTSVRNSYIENPSINSREKSPTPTIPPPPPPPPSFKTTTDVKST
        K+ +S  PPPPPPPPP     SR++  K++           P + R S      P P +    Y+   S     +SP   IPPPPPPPP FK     K  
Subjt:  KRVHSESPPPPPPPPPPVPSSSRSTKKKIQIPLPPSPPPPPPAQQRNSTTYRRPPLPTSVRNSYIENPSINSREKSPTPTIPPPPPPPPSFKTTTDVKST

Query:  VGSDTVGSRSSETSRCGSPDPDNVNTSASGGAGVGSVFCPSPDVNVKAANFIARLRGEWRLEKMNSALENRDGNDP
           D V   S  +     PD  +V  SA      GS+FCPSPDV+ KA +FIAR R   +LEKMNS    R    P
Subjt:  VGSDTVGSRSSETSRCGSPDPDNVNTSASGGAGVGSVFCPSPDVNVKAANFIARLRGEWRLEKMNSALENRDGNDP

AT2G41500.1 WD-40 repeat family protein / small nuclear ribonucleoprotein Prp4p-related3.9e-2428.87Show/hide
Query:  NRDGND-PTCSRAHCENTDQM-KLAGLKSVENAHEESVWAATWVPATDTRPSLLLTGSLDETVKLWKSDELDLERTNTGHCLGVVSVAAHPSGFIAASAS
        +RDG    TCS +      +M ++    +V   H+E      + P  D     L T S D T KLWK+D   L +T  GH   +  VA HPSG    + S
Subjt:  NRDGND-PTCSRAHCENTDQM-KLAGLKSVENAHEESVWAATWVPATDTRPSLLLTGSLDETVKLWKSDELDLERTNTGHCLGVVSVAAHPSGFIAASAS

Query:  LDSFVRVFEVDSNSTIATLEAPPSEVWQMRFNPEGTMLAVAGGGSASIKLWDTNTWKLAATLSIPRPEGPKPTDKTASKKFVLSVAWSIDGRRLACGSMD
         D   R++++++ + +   E     V+ + F  +G + A  G  S + ++WD  T +                      K V SV +S +G  LA G  D
Subjt:  LDSFVRVFEVDSNSTIATLEAPPSEVWQMRFNPEGTMLAVAGGGSASIKLWDTNTWKLAATLSIPRPEGPKPTDKTASKKFVLSVAWSIDGRRLACGSMD

Query:  GTISVFDVARAKFLHHLEGHFMPVRSLVYSPVEPRLLFSASDDAHVHMYDAEGKTLIGAMSGHSSWVLSVDASPDGAAVATGSSDRTVRLW
            ++D+   K L+ +  H   V  + Y P E   L +AS D  V+++     +L+ +++GH S V S+D + D + +AT S DRT++LW
Subjt:  GTISVFDVARAKFLHHLEGHFMPVRSLVYSPVEPRLLFSASDDAHVHMYDAEGKTLIGAMSGHSSWVLSVDASPDGAAVATGSSDRTVRLW

AT4G29830.1 Transducin/WD40 repeat-like superfamily protein2.0e-15379.44Show/hide
Query:  MKLAGLKSVENAHEESVWAATWVPATDTRPSLLLTGSLDETVKLWKSDELDLERTNTGHCLGVVSVAAHPSGFIAASASLDSFVRVFEVDSNSTIATLEA
        MKLAGLKS+ENAHE+SVWAATWVPAT+ RP+LLLTGSLDETVKLW+ DELDL RTNTGH LGV ++AAHPSG IAAS+S+DSFVRVF+VD+N+TIA LEA
Subjt:  MKLAGLKSVENAHEESVWAATWVPATDTRPSLLLTGSLDETVKLWKSDELDLERTNTGHCLGVVSVAAHPSGFIAASASLDSFVRVFEVDSNSTIATLEA

Query:  PPSEVWQMRFNPEGTMLAVAGGGSASIKLWDTNTWKLAATLSIPRPEGPKPTDKTASKKFVLSVAWSIDGRRLACGSMDGTISVFDVARAKFLHHLEGHF
        PPSEVW M+F P+GT+LAVAGG SAS+KLWDT +W+L +TLSIPRP+ PKP+DKT+SKKFVLSVAWS +G+RLACGSMDGTI VFDV R+K LH LEGH 
Subjt:  PPSEVWQMRFNPEGTMLAVAGGGSASIKLWDTNTWKLAATLSIPRPEGPKPTDKTASKKFVLSVAWSIDGRRLACGSMDGTISVFDVARAKFLHHLEGHF

Query:  MPVRSLVYSPVEPRLLFSASDDAHVHMYDAEGKTLIGAMSGHSSWVLSVDASPDGAAVATGSSDRTVRLWDLNMRTAVQTMTNHSDQVWGVAFRPPGGVG
        MPVRSLV+SPV+PR+LFS SDD HV+M+DAEGKTL+G+MSGH+SWVLSVDASPDG A+ATGSSDRTVRLWDL MR A+QTM+NH+DQVW VAFRPPGG G
Subjt:  MPVRSLVYSPVEPRLLFSASDDAHVHMYDAEGKTLIGAMSGHSSWVLSVDASPDGAAVATGSSDRTVRLWDLNMRTAVQTMTNHSDQVWGVAFRPPGGVG

Query:  VRSVRLASVSDDKSISLYDYS
        VR+ RLASVSDDKS+SLYDYS
Subjt:  VRSVRLASVSDDKSISLYDYS

AT5G07740.1 actin binding3.4e-1232.95Show/hide
Query:  PAPEKIKSQSPN------PPPPPPPPLPVTQRKPRRTYQNIQKKEEIPENKAEFTPPPPPPLP-------PRAVIPPSPVRVRLEEKFGKSVRKKTNVKK
        P P    S+ PN      PPPPPPPP                   E P +     PPPPPPLP          V+PP P                   K 
Subjt:  PAPEKIKSQSPN------PPPPPPPPLPVTQRKPRRTYQNIQKKEEIPENKAEFTPPPPPPLP-------PRAVIPPSPVRVRLEEKFGKSVRKKTNVKK

Query:  EIAMALASLYRKRKRKQKTKDAYVGDRRSPTEQRPPPPPPPPPPSFFRIFKKSSKNKRVHSESPPPPPPPPPPVPSSSRSTKKKIQIPLPPSPPPPPP--
          A ALA              A     ++PT     P PPPPPP+++ + +KSS  +     SPPPPPPPPP       S ++  +  LPP PPPPPP  
Subjt:  EIAMALASLYRKRKRKQKTKDAYVGDRRSPTEQRPPPPPPPPPPSFFRIFKKSSKNKRVHSESPPPPPPPPPPVPSSSRSTKKKIQIPLPPSPPPPPP--

Query:  -AQQRNSTTYRRPPLPTSVRNSYIENPSINSREKSPTPTIPPPPPPPPSFKTTTDVKS
         + +RNS T   PP P     S   + +  + E   T + PPPPPPPP F      K+
Subjt:  -AQQRNSTTYRRPPLPTSVRNSYIENPSINSREKSPTPTIPPPPPPPPSFKTTTDVKS

AT5G57070.1 hydroxyproline-rich glycoprotein family protein2.1e-6237.01Show/hide
Query:  NSDPPPFWP--PSPPIRRRRSYSPPFISLPVLI-ILLPTLALILLFFAIRPLLSLINQVYKPSSVKKSWDSFNVFLVLFAIICGIFSRRNDDVPTT----
        +  PP  WP   S    RRRS S P I +P +I +    + L+ + F +   LS+ +Q+ +P+SVK+ WDS NV LV+FAI+CG+ +RRNDD  ++    
Subjt:  NSDPPPFWP--PSPPIRRRRSYSPPFISLPVLI-ILLPTLALILLFFAIRPLLSLINQVYKPSSVKKSWDSFNVFLVLFAIICGIFSRRNDDVPTT----

Query:  ------ADGDSRASDQMTVVDTGRVKVNGDSESSQQWF-------------GFSERRFS---DPTGRAPVTTRLRRNSSYPDLRQESLWGNGDDSNNQFR
                G +  + +MTV +  ++  +  S  S+QWF               S R FS     TG  P+    R +SSYPDLRQ      GD    +FR
Subjt:  ------ADGDSRASDQMTVVDTGRVKVNGDSESSQQWF-------------GFSERRFS---DPTGRAPVTTRLRRNSSYPDLRQESLWGNGDDSNNQFR

Query:  FFDDFEINKFRSRSF--------VYRTRGNEREESP-AIPIDSFVVSSSPAPEKIKSQSPNPPPPPPPPLPV-TQRKPRRTYQNIQKK---EEIPENKAE
        F+DDFEI+K+RS+          + +T   E E  P  I ID+FVV  S  P+    Q P  PPPPPPP PV   +KPRRT+++++ +   E    ++ +
Subjt:  FFDDFEINKFRSRSF--------VYRTRGNEREESP-AIPIDSFVVSSSPAPEKIKSQSPNPPPPPPPPLPV-TQRKPRRTYQNIQKK---EEIPENKAE

Query:  F---------TPPPPPPLPPRAVIPPSPVRVRLEEKFGKSVRKKTNVKKEIAMALASLY---RKRKRKQKTK-------DAYVGDRRSPTEQR-------
        F          PPPPPP PP+ +I  +P R     K G   R+K+N  KEI M  ASLY   +K+K+ QK+K          V D   P + +       
Subjt:  F---------TPPPPPPLPPRAVIPPSPVRVRLEEKFGKSVRKKTNVKKEIAMALASLY---RKRKRKQKTK-------DAYVGDRRSPTEQR-------

Query:  PPPPPPPPPPS-------FFRIFKKSSK-NKRVHSESPPPPPPPPPPVPSSSRSTKKKIQIPLPPSPPPPPPAQQRNSTTYRRPPLPTSVRNSYIENPSI
        PPPPPPPPPP        F+ +FKK  K NK++HS   PPPPPPP       R T+           P  PP + ++     RPP PT  +N   E    
Subjt:  PPPPPPPPPPS-------FFRIFKKSSK-NKRVHSESPPPPPPPPPPVPSSSRSTKKKIQIPLPPSPPPPPPAQQRNSTTYRRPPLPTSVRNSYIENPSI

Query:  NSREKSPTPTIPPPPPPPPSFKTTTDVKSTVGSDTVGSRSSETSRCGSPDPD---------------NVNTSASGGAGVGSVFCPSPDVNVKAANFIARL
        N+ + SP   I PPPPPPP F+    +K  V  D    RS+++SRC SP+ +                V T A+   G    FCPSPDV+ KA NFIARL
Subjt:  NSREKSPTPTIPPPPPPPPSFKTTTDVKSTVGSDTVGSRSSETSRCGSPDPD---------------NVNTSASGGAGVGSVFCPSPDVNVKAANFIARL

Query:  RGEWRLEKMNSALENR
        R EWRL+K+NS    R
Subjt:  RGEWRLEKMNSALENR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCCAAATAGCGACCCACCACCATTCTGGCCACCGTCGCCACCCATCCGCCGTCGTAGATCCTACTCTCCGCCCTTCATTTCTTTGCCGGTTTTGATTATTCTCTT
ACCCACGCTAGCTTTGATTCTCTTGTTCTTTGCAATCCGTCCGCTTCTTTCACTCATCAATCAAGTTTACAAACCCAGTTCCGTCAAGAAAAGCTGGGATTCCTTCAATG
TTTTCCTTGTTCTCTTTGCGATTATCTGCGGCATTTTCTCTAGACGAAACGACGACGTTCCGACTACCGCCGACGGAGATAGCCGTGCTTCTGATCAAATGACGGTTGTT
GATACTGGTCGAGTTAAGGTGAACGGAGACTCGGAGTCATCGCAGCAGTGGTTCGGATTTTCGGAGAGGAGATTTTCCGATCCGACGGGGAGAGCGCCGGTGACGACGAG
ATTGAGGAGGAATAGCTCTTATCCAGATCTGCGGCAGGAATCGCTATGGGGGAATGGTGATGATAGTAACAATCAGTTCCGATTTTTTGATGATTTTGAAATAAACAAGT
TTCGTTCGCGATCCTTCGTGTATCGGACGCGAGGAAATGAAAGAGAAGAATCTCCGGCGATTCCGATTGATTCGTTTGTGGTGAGTTCCTCGCCGGCGCCTGAGAAGATA
AAGTCTCAGTCTCCCAATCCTCCGCCTCCACCACCGCCGCCACTGCCGGTAACTCAGAGGAAACCGAGACGGACGTATCAGAATATTCAGAAAAAGGAGGAAATTCCTGA
GAATAAAGCTGAATTCACGCCGCCGCCTCCACCGCCACTTCCGCCGCGAGCAGTTATCCCACCGTCGCCGGTTCGTGTTCGATTGGAGGAGAAATTTGGAAAGAGTGTAC
GGAAAAAGACGAATGTTAAAAAAGAAATAGCAATGGCGTTGGCTTCACTGTATAGGAAGAGAAAGAGAAAACAAAAAACTAAAGATGCCTACGTCGGCGACCGACGATCT
CCAACCGAACAACGACCGCCGCCGCCACCCCCACCTCCGCCTCCTTCCTTCTTTCGCATATTCAAAAAATCCAGCAAAAACAAGAGAGTTCACTCCGAATCTCCCCCACC
GCCGCCTCCGCCACCACCGCCTGTCCCATCATCATCCCGTTCAACAAAGAAGAAAATCCAGATCCCGCTTCCACCTTCACCACCGCCACCTCCACCAGCACAGCAGCGAA
ACTCCACCACTTATCGCCGACCACCTCTACCAACGAGCGTCCGTAATTCCTACATCGAAAACCCCAGCATTAACAGCCGAGAAAAGAGTCCCACACCAACTATTCCACCG
CCTCCGCCTCCGCCTCCTTCGTTCAAAACGACGACAGACGTGAAATCCACAGTTGGAAGCGACACTGTCGGGTCACGAAGCTCCGAAACATCTCGTTGCGGATCTCCGGA
TCCAGATAACGTGAATACGTCGGCCAGCGGCGGGGCCGGAGTGGGGTCGGTGTTCTGTCCCAGCCCGGACGTTAACGTAAAAGCTGCAAATTTCATCGCGAGGTTGAGAG
GTGAATGGAGGCTGGAGAAGATGAATTCGGCACTTGAGAATAGAGATGGTAACGATCCGACGTGCTCTAGGGCTCACTGTGAAAACACTGACCAAATGAAACTCGCCGGT
CTCAAATCCGTTGAAAACGCTCACGAAGAGTCCGTATGGGCGGCCACTTGGGTTCCGGCCACCGACACTCGTCCTTCTCTCCTCCTCACCGGCTCCCTCGACGAGACTGT
CAAGCTATGGAAGTCCGATGAGCTCGACTTGGAACGCACCAACACTGGCCACTGCCTCGGCGTCGTCTCTGTTGCTGCTCATCCTTCCGGTTTCATTGCTGCCTCTGCTT
CCCTCGACAGTTTTGTTCGTGTCTTTGAAGTCGATTCCAACTCCACTATCGCCACTCTTGAGGCTCCTCCTTCTGAAGTCTGGCAAATGCGCTTCAATCCCGAGGGTACC
ATGTTGGCAGTTGCTGGTGGAGGTAGCGCATCAATTAAGCTTTGGGACACAAACACATGGAAACTGGCTGCAACTCTATCAATTCCTCGTCCGGAAGGTCCTAAGCCCAC
CGATAAAACTGCTAGCAAGAAGTTTGTACTTTCAGTAGCATGGAGCATTGACGGGAGAAGACTCGCTTGTGGTTCAATGGACGGGACCATTTCAGTCTTTGATGTAGCTC
GTGCCAAGTTTCTACACCACTTGGAAGGCCACTTCATGCCAGTGAGATCCCTCGTGTATTCACCAGTGGAGCCACGACTACTCTTTTCTGCCTCTGATGATGCTCATGTC
CACATGTATGACGCAGAGGGTAAAACTCTAATTGGAGCGATGTCAGGCCATTCAAGTTGGGTATTGAGCGTTGATGCAAGCCCTGATGGCGCTGCTGTTGCTACCGGTTC
AAGCGACAGAACTGTTAGGCTTTGGGATCTCAACATGAGGACTGCTGTTCAGACAATGACTAACCATAGTGATCAGGTCTGGGGGGTTGCATTTCGACCACCTGGTGGGG
TTGGCGTGCGATCTGTTCGACTTGCTAGTGTGTCTGATGACAAGAGTATCTCCTTGTATGACTATTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCCAAATAGCGACCCACCACCATTCTGGCCACCGTCGCCACCCATCCGCCGTCGTAGATCCTACTCTCCGCCCTTCATTTCTTTGCCGGTTTTGATTATTCTCTT
ACCCACGCTAGCTTTGATTCTCTTGTTCTTTGCAATCCGTCCGCTTCTTTCACTCATCAATCAAGTTTACAAACCCAGTTCCGTCAAGAAAAGCTGGGATTCCTTCAATG
TTTTCCTTGTTCTCTTTGCGATTATCTGCGGCATTTTCTCTAGACGAAACGACGACGTTCCGACTACCGCCGACGGAGATAGCCGTGCTTCTGATCAAATGACGGTTGTT
GATACTGGTCGAGTTAAGGTGAACGGAGACTCGGAGTCATCGCAGCAGTGGTTCGGATTTTCGGAGAGGAGATTTTCCGATCCGACGGGGAGAGCGCCGGTGACGACGAG
ATTGAGGAGGAATAGCTCTTATCCAGATCTGCGGCAGGAATCGCTATGGGGGAATGGTGATGATAGTAACAATCAGTTCCGATTTTTTGATGATTTTGAAATAAACAAGT
TTCGTTCGCGATCCTTCGTGTATCGGACGCGAGGAAATGAAAGAGAAGAATCTCCGGCGATTCCGATTGATTCGTTTGTGGTGAGTTCCTCGCCGGCGCCTGAGAAGATA
AAGTCTCAGTCTCCCAATCCTCCGCCTCCACCACCGCCGCCACTGCCGGTAACTCAGAGGAAACCGAGACGGACGTATCAGAATATTCAGAAAAAGGAGGAAATTCCTGA
GAATAAAGCTGAATTCACGCCGCCGCCTCCACCGCCACTTCCGCCGCGAGCAGTTATCCCACCGTCGCCGGTTCGTGTTCGATTGGAGGAGAAATTTGGAAAGAGTGTAC
GGAAAAAGACGAATGTTAAAAAAGAAATAGCAATGGCGTTGGCTTCACTGTATAGGAAGAGAAAGAGAAAACAAAAAACTAAAGATGCCTACGTCGGCGACCGACGATCT
CCAACCGAACAACGACCGCCGCCGCCACCCCCACCTCCGCCTCCTTCCTTCTTTCGCATATTCAAAAAATCCAGCAAAAACAAGAGAGTTCACTCCGAATCTCCCCCACC
GCCGCCTCCGCCACCACCGCCTGTCCCATCATCATCCCGTTCAACAAAGAAGAAAATCCAGATCCCGCTTCCACCTTCACCACCGCCACCTCCACCAGCACAGCAGCGAA
ACTCCACCACTTATCGCCGACCACCTCTACCAACGAGCGTCCGTAATTCCTACATCGAAAACCCCAGCATTAACAGCCGAGAAAAGAGTCCCACACCAACTATTCCACCG
CCTCCGCCTCCGCCTCCTTCGTTCAAAACGACGACAGACGTGAAATCCACAGTTGGAAGCGACACTGTCGGGTCACGAAGCTCCGAAACATCTCGTTGCGGATCTCCGGA
TCCAGATAACGTGAATACGTCGGCCAGCGGCGGGGCCGGAGTGGGGTCGGTGTTCTGTCCCAGCCCGGACGTTAACGTAAAAGCTGCAAATTTCATCGCGAGGTTGAGAG
GTGAATGGAGGCTGGAGAAGATGAATTCGGCACTTGAGAATAGAGATGGTAACGATCCGACGTGCTCTAGGGCTCACTGTGAAAACACTGACCAAATGAAACTCGCCGGT
CTCAAATCCGTTGAAAACGCTCACGAAGAGTCCGTATGGGCGGCCACTTGGGTTCCGGCCACCGACACTCGTCCTTCTCTCCTCCTCACCGGCTCCCTCGACGAGACTGT
CAAGCTATGGAAGTCCGATGAGCTCGACTTGGAACGCACCAACACTGGCCACTGCCTCGGCGTCGTCTCTGTTGCTGCTCATCCTTCCGGTTTCATTGCTGCCTCTGCTT
CCCTCGACAGTTTTGTTCGTGTCTTTGAAGTCGATTCCAACTCCACTATCGCCACTCTTGAGGCTCCTCCTTCTGAAGTCTGGCAAATGCGCTTCAATCCCGAGGGTACC
ATGTTGGCAGTTGCTGGTGGAGGTAGCGCATCAATTAAGCTTTGGGACACAAACACATGGAAACTGGCTGCAACTCTATCAATTCCTCGTCCGGAAGGTCCTAAGCCCAC
CGATAAAACTGCTAGCAAGAAGTTTGTACTTTCAGTAGCATGGAGCATTGACGGGAGAAGACTCGCTTGTGGTTCAATGGACGGGACCATTTCAGTCTTTGATGTAGCTC
GTGCCAAGTTTCTACACCACTTGGAAGGCCACTTCATGCCAGTGAGATCCCTCGTGTATTCACCAGTGGAGCCACGACTACTCTTTTCTGCCTCTGATGATGCTCATGTC
CACATGTATGACGCAGAGGGTAAAACTCTAATTGGAGCGATGTCAGGCCATTCAAGTTGGGTATTGAGCGTTGATGCAAGCCCTGATGGCGCTGCTGTTGCTACCGGTTC
AAGCGACAGAACTGTTAGGCTTTGGGATCTCAACATGAGGACTGCTGTTCAGACAATGACTAACCATAGTGATCAGGTCTGGGGGGTTGCATTTCGACCACCTGGTGGGG
TTGGCGTGCGATCTGTTCGACTTGCTAGTGTGTCTGATGACAAGAGTATCTCCTTGTATGACTATTCTTGA
Protein sequenceShow/hide protein sequence
MEPNSDPPPFWPPSPPIRRRRSYSPPFISLPVLIILLPTLALILLFFAIRPLLSLINQVYKPSSVKKSWDSFNVFLVLFAIICGIFSRRNDDVPTTADGDSRASDQMTVV
DTGRVKVNGDSESSQQWFGFSERRFSDPTGRAPVTTRLRRNSSYPDLRQESLWGNGDDSNNQFRFFDDFEINKFRSRSFVYRTRGNEREESPAIPIDSFVVSSSPAPEKI
KSQSPNPPPPPPPPLPVTQRKPRRTYQNIQKKEEIPENKAEFTPPPPPPLPPRAVIPPSPVRVRLEEKFGKSVRKKTNVKKEIAMALASLYRKRKRKQKTKDAYVGDRRS
PTEQRPPPPPPPPPPSFFRIFKKSSKNKRVHSESPPPPPPPPPPVPSSSRSTKKKIQIPLPPSPPPPPPAQQRNSTTYRRPPLPTSVRNSYIENPSINSREKSPTPTIPP
PPPPPPSFKTTTDVKSTVGSDTVGSRSSETSRCGSPDPDNVNTSASGGAGVGSVFCPSPDVNVKAANFIARLRGEWRLEKMNSALENRDGNDPTCSRAHCENTDQMKLAG
LKSVENAHEESVWAATWVPATDTRPSLLLTGSLDETVKLWKSDELDLERTNTGHCLGVVSVAAHPSGFIAASASLDSFVRVFEVDSNSTIATLEAPPSEVWQMRFNPEGT
MLAVAGGGSASIKLWDTNTWKLAATLSIPRPEGPKPTDKTASKKFVLSVAWSIDGRRLACGSMDGTISVFDVARAKFLHHLEGHFMPVRSLVYSPVEPRLLFSASDDAHV
HMYDAEGKTLIGAMSGHSSWVLSVDASPDGAAVATGSSDRTVRLWDLNMRTAVQTMTNHSDQVWGVAFRPPGGVGVRSVRLASVSDDKSISLYDYS