; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc05G04150 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc05G04150
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionGTD-binding domain-containing protein
Genome locationClcChr05:2990569..2994451
RNA-Seq ExpressionClc05G04150
SyntenyClc05G04150
Gene Ontology termsGO:0006511 - ubiquitin-dependent protein catabolic process (biological process)
GO:0016579 - protein deubiquitination (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004843 - thiol-dependent ubiquitin-specific protease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0080115 - myosin XI tail binding (molecular function)
InterPro domainsIPR007656 - GTD-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134377.1 uncharacterized protein LOC101204513 [Cucumis sativus]0.0e+0082.39Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGVFGNLSSDHCFQKLLVDRSSRKISSVLHSTRKKFPLNSVWDEPKCC
        MACEAI+LWTFNGLVAAFLDLGIAFLLL ASSLVFFTSKFLALFGLCLPCPCDG+FGNLSSDHCFQKLLVDRSSRKISSV+HSTR+KFPL+S+ D PKCC
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGVFGNLSSDHCFQKLLVDRSSRKISSVLHSTRKKFPLNSVWDEPKCC

Query:  FKSMLVHERNVKEAHVELEGETSSSSSFKTRSQQGMIYRDFPSVNELHCGGGVDRRKVISVSPNDISQSDVELEDLSHSPSSFSGFGDHNTE-DGFFSVD
         KSMLVHERNVK   VELEGE S SSSFK RS Q M+Y D+PSVNELHCG G DRRKVIS S   ISQ+DVELEDLS SPSSFSGFG+ NTE DGFFSVD
Subjt:  FKSMLVHERNVKEAHVELEGETSSSSSFKTRSQQGMIYRDFPSVNELHCGGGVDRRKVISVSPNDISQSDVELEDLSHSPSSFSGFGDHNTE-DGFFSVD

Query:  SGDEREASSDNSDQYKVFPDLELDDSYDEKICAEMYEASVGEAGNRCRGELCLDGNESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMI
        SGDERE SSDNSDQYKVFPDLELDDS DEKICAEM EASV EAGN CR EL LDGNESDTIK LEQALEEEQ+ RA LYLELEKERSAAATAADEAMAMI
Subjt:  SGDEREASSDNSDQYKVFPDLELDDSYDEKICAEMYEASVGEAGNRCRGELCLDGNESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMI

Query:  LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFRKSFFDDDGVDVDMLDSEFTPARTPSFPYPTEDPSHMLQCMNRS
        LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEA R SFF+ DGV VDMLDSE TP R PSF YPTEDP     C+N  
Subjt:  LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFRKSFFDDDGVDVDMLDSEFTPARTPSFPYPTEDPSHMLQCMNRS

Query:  TRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVADNFKGSEEIDYELQGKGMVEDEKLYTVPGEVNELDPYLQSNES
                 K HSLQ+EIPSV S+ LTFEFGEESP I ADE ADAAKARGMLL QV D +KGSEEIDYELQGK MVEDE LY VPG+V EL+PYLQSNES
Subjt:  TRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVADNFKGSEEIDYELQGKGMVEDEKLYTVPGEVNELDPYLQSNES

Query:  NGLGKVEKCTELIADEQEKVDEVSYDESALAKTILPCLEYNLEKIGDHQKMRTRDLDSVNATDPHPHDIHVIEDEARISNEAIANASEEPLVNSTSSIPV
        N LGKVEKCTELIADEQE V EVSYD  A AKT LPC E N    GDHQ  RTRDL SVN TDPH HDIHV+EDEA+ SNEA+ NASEEPLVN TS+IP 
Subjt:  NGLGKVEKCTELIADEQEKVDEVSYDESALAKTILPCLEYNLEKIGDHQKMRTRDLDSVNATDPHPHDIHVIEDEARISNEAIANASEEPLVNSTSSIPV

Query:  KCDSPSFSLLQSELEIPRTSSDASGRFPPMARSRSNSVRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEGREKLKFSVEYKENENNQLQLLENIAN
        KCDSPSFSLLQ+EL+  R+SSDASGRFPP+ARSRS+S+RS+LRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEGREKLKFSVE+KE E+NQ QLLENI N
Subjt:  KCDSPSFSLLQSELEIPRTSSDASGRFPPMARSRSNSVRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEGREKLKFSVEYKENENNQLQLLENIAN

Query:  HLREIRQLTDPRKASLQAPLPPSSKDVSKKRCWRSSSLSIHRSS
          REIRQLTDP KASLQAPLPPSSKDVSKKRCWRSSSLS+HRSS
Subjt:  HLREIRQLTDPRKASLQAPLPPSSKDVSKKRCWRSSSLSIHRSS

XP_016898951.1 PREDICTED: ubiquitin carboxyl-terminal hydrolase 2 [Cucumis melo]0.0e+0082.49Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGVFGNLSSDHCFQKLLVDRSSRKISSVLHSTRKKFPLNSVWDEPKCC
        MACEAI+LWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDG+FGNLS DHCFQKLLVD SSRKISSV+HSTR+KFPL+S+ DEPKCC
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGVFGNLSSDHCFQKLLVDRSSRKISSVLHSTRKKFPLNSVWDEPKCC

Query:  FKSMLVHERNVKEAHVELEGETSSSSSFKTRSQQGMIYRDFPSVNELHCGGGVDRRKVISVSPNDISQSDVELEDLSHSPSSFSGFGDHNTE-DGFFSVD
        FKSMLVHER+VKEA V+LEGE S SSS K RS Q MIY D+PSVNELHC  G D RKVIS SPN I QSDVELEDLS SPSSFSGFGD NTE DGFFSVD
Subjt:  FKSMLVHERNVKEAHVELEGETSSSSSFKTRSQQGMIYRDFPSVNELHCGGGVDRRKVISVSPNDISQSDVELEDLSHSPSSFSGFGDHNTE-DGFFSVD

Query:  SGDEREASSDNSDQYKVFPDLELDDSYDEKICAEMYEASVGEAGNRCRGELCLDGNESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMI
        SGDEREASSDNSDQYKVFP LELDDS DEKICAEM EASV EAGN CR +LCLDGNESDTIK LEQALEEEQ+ RATLY+ELEKERSAAATAADEAMAMI
Subjt:  SGDEREASSDNSDQYKVFPDLELDDSYDEKICAEMYEASVGEAGNRCRGELCLDGNESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMI

Query:  LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFRKSFFDDDGVDVDMLDSEFTPARTPSFPYPTEDPSHMLQCMNRS
        LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKE+EA R SFF+ DGV VDMLDSE TP R PSF YPTEDP     C N S
Subjt:  LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFRKSFFDDDGVDVDMLDSEFTPARTPSFPYPTEDPSHMLQCMNRS

Query:  TRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVADNFKGSEEIDYELQGKGMVEDEKLYTVPGEVNELDPYLQSNES
                   HSLQ+EIPSVES+ LTFEF EESP I+ADEIADAAKAR +LL QV DNFKGSEEIDYELQGKGM+EDE LY VPG+V EL PYLQSNES
Subjt:  TRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVADNFKGSEEIDYELQGKGMVEDEKLYTVPGEVNELDPYLQSNES

Query:  NGLGKVEKCTELIADEQEKVDEVSYDESALAKTILPCLEYNLEKIGDHQKMRTRDLDSVNATDPHPHDIHVIEDEARISNEAIANASEEPLVNSTSSIPV
        NGLGKV+KCTELIADEQEKVD+VSYD  ALAKTILPC E N    GDHQ  RTRDL SVN TDPHPHDIHV+EDEA+ SNEA+ N SEEPLVN TS+IPV
Subjt:  NGLGKVEKCTELIADEQEKVDEVSYDESALAKTILPCLEYNLEKIGDHQKMRTRDLDSVNATDPHPHDIHVIEDEARISNEAIANASEEPLVNSTSSIPV

Query:  KCDSPSFSLLQSELEIPRTSSDASGRFPPMARSRSNSVRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEGREKLKFSVEYKENENNQLQLLENIAN
        KCDSPSFSLLQSEL+I R+SSDA+GRFPP+ARSRS+S++S+LRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEGREKLKFSVE+KE E+NQ QLLENI N
Subjt:  KCDSPSFSLLQSELEIPRTSSDASGRFPPMARSRSNSVRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEGREKLKFSVEYKENENNQLQLLENIAN

Query:  HLREIRQLTDPRKASLQAPLPPSSKDVSKKR
        HLREIRQLTDP KASL APLPPSSK    K+
Subjt:  HLREIRQLTDPRKASLQAPLPPSSKDVSKKR

XP_022980235.1 uncharacterized protein LOC111479667 isoform X1 [Cucurbita maxima]0.0e+0078.9Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGVFGNLSSDHCFQKLLVDRSSRKISSVLHSTRKKFPLNSVWD-EPKC
        MACEAIQLWTFNGLVAAFLDLGIAFLLLCA+SLVFFTSKFLALFG CLPCPCDG+FG+L SDHCFQKLLVD SS+KISSVLHSTR+KFPL+S+WD EPKC
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGVFGNLSSDHCFQKLLVDRSSRKISSVLHSTRKKFPLNSVWD-EPKC

Query:  CFKSMLVHERNVKEAHVELEGETSSSSSFKTRSQQGMIYRDFPSVNELHCGGGVDRRKVISVSPNDISQSDVELEDLSHSPSSFSGFGDHNTEDGFFSVD
        CFKSM VH+RNVKEA VE + E S  S FKTRS +GMIY D  ++NE    GGV  RK+ SVSPND+ QSDVELEDL HSPSSF GFGD+N EDGFFSVD
Subjt:  CFKSMLVHERNVKEAHVELEGETSSSSSFKTRSQQGMIYRDFPSVNELHCGGGVDRRKVISVSPNDISQSDVELEDLSHSPSSFSGFGDHNTEDGFFSVD

Query:  SGDEREASSDNSDQYKVFPDLELDDSYDEKICAEMYEASVGEAGNRCRGELCLDGNESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMI
        SGDE EAS DNS+QYKVFPDLELDDSYDEKICAEMY AS  EA N CRGE CLDGNESD IKLLEQ+LEEEQ ARATLYLELEKERSAAATAADEAMAMI
Subjt:  SGDEREASSDNSDQYKVFPDLELDDSYDEKICAEMYEASVGEAGNRCRGELCLDGNESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMI

Query:  LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFRKSFFDDDGVDVDMLDSEFTPARTPSFPYPTEDPSHMLQCMNRS
        LRLQEEKASIEM+ARQYQRMIEEKTAYDAEEMSILKEILVRRE+EMHFL+KE+ AFR+SFFD+ GV VDMLD+EFTP   PS PYPTEDPSHMLQC+N S
Subjt:  LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFRKSFFDDDGVDVDMLDSEFTPARTPSFPYPTEDPSHMLQCMNRS

Query:  TRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVADNFKGSEEIDYELQGKGMVEDEKLYTVPGEVNELDPYLQSNES
          D+Q          +E+PSVESRNL FEFGEESP IQA E ADAAKARGMLLHQVADNF+G EEID ELQGKGMVED+ LY VPGEVNEL+PY +SN S
Subjt:  TRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVADNFKGSEEIDYELQGKGMVEDEKLYTVPGEVNELDPYLQSNES

Query:  NGLGKVEKCTELIADEQEKVDEVSYDESALAKTILPCLEYNLEKIGDHQKMRTRDLDSVNATDPHPHDIHVIEDEARISNEAIANASEEPLVNSTSSIPV
        N LGKVE+CTE   DEQEKV + S D    A+T LPC+EYNLEK  DHQK  TRD  SVN TD  PHDIHVIEDEAR+ NEA ANA EE LVN +SSIPV
Subjt:  NGLGKVEKCTELIADEQEKVDEVSYDESALAKTILPCLEYNLEKIGDHQKMRTRDLDSVNATDPHPHDIHVIEDEARISNEAIANASEEPLVNSTSSIPV

Query:  KCDSPSFSLLQSELEIPRTSSDASGRFPPMARSRSNSVRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEGREKLKFSVEYKENENNQLQLLENIAN
         CDS SFSLLQ+EL+I R+SSDA+GRFPPMARSRSNS+RSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQE REK K SVE KE ENNQLQLLENI  
Subjt:  KCDSPSFSLLQSELEIPRTSSDASGRFPPMARSRSNSVRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEGREKLKFSVEYKENENNQLQLLENIAN

Query:  HLREIRQLTDPRKASLQAPLPPSSKDVSKKRCWRSSSLSIHRSS
             + L+DP KA+LQAPLPPSSKDVSKKRCWRSSSL IHRSS
Subjt:  HLREIRQLTDPRKASLQAPLPPSSKDVSKKRCWRSSSLSIHRSS

XP_038893600.1 uncharacterized protein LOC120082482 isoform X1 [Benincasa hispida]0.0e+0090.44Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGVFGNLSSDHCFQKLLVDRSSRKISSVLHSTRKKFPLNSVWDEPKCC
        MACEAI+LWTFNGLVAAFLDLGIAFLLLCAS+LVFFTSKFLALFGLCLPCPCDG+FGNLSSDHCFQKLLVDRSSRKISSVL STRKKFPL+SVWDEPKCC
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGVFGNLSSDHCFQKLLVDRSSRKISSVLHSTRKKFPLNSVWDEPKCC

Query:  FKSMLVHERNVKEAHVELEGETSSSSSFKTRSQQGMIYRDFPSVNELHCGGGVDRRKVISVSPNDISQSDVELEDLSHSPSSFSGFGDHNTEDGFFSVDS
        FKS+LVHERNVKEAHVELEGE S  SSFK+RS QGM+Y DFPSVN+L  GGG+D RKVIS S N+ISQSDVELEDLSHSPSSFSGFGD+NTEDGFFSVDS
Subjt:  FKSMLVHERNVKEAHVELEGETSSSSSFKTRSQQGMIYRDFPSVNELHCGGGVDRRKVISVSPNDISQSDVELEDLSHSPSSFSGFGDHNTEDGFFSVDS

Query:  GDEREASSDNSDQYKVFPDLELDDSYDEKICAEMYEASVGEAGNRCRGELCLDGNESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMIL
        GDEREAS DNSDQYKVFP+LELDDSYD KICAEMYEASV EAGNRCRGELCLDGNESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMIL
Subjt:  GDEREASSDNSDQYKVFPDLELDDSYDEKICAEMYEASVGEAGNRCRGELCLDGNESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMIL

Query:  RLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFRKSFFDDDGVDVDMLDSEFTPARTPSFPYPTEDPSHMLQCMNRST
        RLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKE+EAFRKSFFDDDG  VDMLDSEFTPARTP  PYPTEDPSHMLQC+NRS 
Subjt:  RLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFRKSFFDDDGVDVDMLDSEFTPARTPSFPYPTEDPSHMLQCMNRST

Query:  RDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVADNFKGSEEIDYELQGKGMVEDEKLYTVPGEVNELDPYLQSNESN
        RDE+ +NYKIHSLQY IPSVESRNLTFEFGEES  I ADE+A AAKARGMLLHQVADNFK SEEIDYELQGKGM+EDEKLY VPGEVNELDPYLQSNESN
Subjt:  RDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVADNFKGSEEIDYELQGKGMVEDEKLYTVPGEVNELDPYLQSNESN

Query:  GLGKVEKCTELIADEQEKVDEVSYDESALAKTILPCLEYNLEKIGDHQKMRTRDLDSVNATDPHPHDIHVIEDEARISNEAIANASEEPLVNSTSSIPVK
        GLGKVEKCTE+IADEQEKVDEVSYD+ AL KTI PCLEYNLEK GDHQKMRTRD+DSVN TDPHPHDIHV+EDEARI NEAIANASEEP VN TSSIPVK
Subjt:  GLGKVEKCTELIADEQEKVDEVSYDESALAKTILPCLEYNLEKIGDHQKMRTRDLDSVNATDPHPHDIHVIEDEARISNEAIANASEEPLVNSTSSIPVK

Query:  CDSPSFSLLQSELEIPRTSSDASGRFPPMARSRSNSVRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEGREKLKFSVEYKENENNQLQLLENIANH
        CDSPSF LLQSELEI RTSSDA+ RFPPMARSRSNS+RSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEGREKLKFSVE+KE ENNQLQLLENI NH
Subjt:  CDSPSFSLLQSELEIPRTSSDASGRFPPMARSRSNSVRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEGREKLKFSVEYKENENNQLQLLENIANH

Query:  LREIRQLTDPRKASLQAPLPPSSKDVSKKRCWRSSSLSIHRSS
        LREI QL DP K +LQAPLPPSSKDVSKKRCWRSSSLS+HRSS
Subjt:  LREIRQLTDPRKASLQAPLPPSSKDVSKKRCWRSSSLSIHRSS

XP_038893608.1 uncharacterized protein LOC120082482 isoform X2 [Benincasa hispida]0.0e+0086.68Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGVFGNLSSDHCFQKLLVDRSSRKISSVLHSTRKKFPLNSVWDEPKCC
        MACEAI+LWTFNGLVAAFLDLGIAFLLLCAS+LVFFTSKFLALFGLCLPCPCDG+FGNLSSDHCFQKLLVDRSSRKISSVL STRKKFPL+SVWDEPKCC
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGVFGNLSSDHCFQKLLVDRSSRKISSVLHSTRKKFPLNSVWDEPKCC

Query:  FKSMLVHERNVKEAHVELEGETSSSSSFKTRSQQGMIYRDFPSVNELHCGGGVDRRKVISVSPNDISQSDVELEDLSHSPSSFSGFGDHNTEDGFFSVDS
        FKS+LVHERNVKEAHVELEGE S  SSFK+RS QGM+Y DFPSVN+L  GGG+D RKVIS S N+ISQSDVELEDLSHSPSSFSGFGD+NTEDGFFSVDS
Subjt:  FKSMLVHERNVKEAHVELEGETSSSSSFKTRSQQGMIYRDFPSVNELHCGGGVDRRKVISVSPNDISQSDVELEDLSHSPSSFSGFGDHNTEDGFFSVDS

Query:  GDEREASSDNSDQYKVFPDLELDDSYDEKICAEMYEASVGEAGNRCRGELCLDGNESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMIL
        G                               EMYEASV EAGNRCRGELCLDGNESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMIL
Subjt:  GDEREASSDNSDQYKVFPDLELDDSYDEKICAEMYEASVGEAGNRCRGELCLDGNESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMIL

Query:  RLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFRKSFFDDDGVDVDMLDSEFTPARTPSFPYPTEDPSHMLQCMNRST
        RLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKE+EAFRKSFFDDDG  VDMLDSEFTPARTP  PYPTEDPSHMLQC+NRS 
Subjt:  RLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFRKSFFDDDGVDVDMLDSEFTPARTPSFPYPTEDPSHMLQCMNRST

Query:  RDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVADNFKGSEEIDYELQGKGMVEDEKLYTVPGEVNELDPYLQSNESN
        RDE+ +NYKIHSLQY IPSVESRNLTFEFGEES  I ADE+A AAKARGMLLHQVADNFK SEEIDYELQGKGM+EDEKLY VPGEVNELDPYLQSNESN
Subjt:  RDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVADNFKGSEEIDYELQGKGMVEDEKLYTVPGEVNELDPYLQSNESN

Query:  GLGKVEKCTELIADEQEKVDEVSYDESALAKTILPCLEYNLEKIGDHQKMRTRDLDSVNATDPHPHDIHVIEDEARISNEAIANASEEPLVNSTSSIPVK
        GLGKVEKCTE+IADEQEKVDEVSYD+ AL KTI PCLEYNLEK GDHQKMRTRD+DSVN TDPHPHDIHV+EDEARI NEAIANASEEP VN TSSIPVK
Subjt:  GLGKVEKCTELIADEQEKVDEVSYDESALAKTILPCLEYNLEKIGDHQKMRTRDLDSVNATDPHPHDIHVIEDEARISNEAIANASEEPLVNSTSSIPVK

Query:  CDSPSFSLLQSELEIPRTSSDASGRFPPMARSRSNSVRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEGREKLKFSVEYKENENNQLQLLENIANH
        CDSPSF LLQSELEI RTSSDA+ RFPPMARSRSNS+RSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEGREKLKFSVE+KE ENNQLQLLENI NH
Subjt:  CDSPSFSLLQSELEIPRTSSDASGRFPPMARSRSNSVRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEGREKLKFSVEYKENENNQLQLLENIANH

Query:  LREIRQLTDPRKASLQAPLPPSSKDVSKKRCWRSSSLSIHRSS
        LREI QL DP K +LQAPLPPSSKDVSKKRCWRSSSLS+HRSS
Subjt:  LREIRQLTDPRKASLQAPLPPSSKDVSKKRCWRSSSLSIHRSS

TrEMBL top hitse value%identityAlignment
A0A0A0L4N9 GTD-binding domain-containing protein0.0e+0082.39Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGVFGNLSSDHCFQKLLVDRSSRKISSVLHSTRKKFPLNSVWDEPKCC
        MACEAI+LWTFNGLVAAFLDLGIAFLLL ASSLVFFTSKFLALFGLCLPCPCDG+FGNLSSDHCFQKLLVDRSSRKISSV+HSTR+KFPL+S+ D PKCC
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGVFGNLSSDHCFQKLLVDRSSRKISSVLHSTRKKFPLNSVWDEPKCC

Query:  FKSMLVHERNVKEAHVELEGETSSSSSFKTRSQQGMIYRDFPSVNELHCGGGVDRRKVISVSPNDISQSDVELEDLSHSPSSFSGFGDHNTE-DGFFSVD
         KSMLVHERNVK   VELEGE S SSSFK RS Q M+Y D+PSVNELHCG G DRRKVIS S   ISQ+DVELEDLS SPSSFSGFG+ NTE DGFFSVD
Subjt:  FKSMLVHERNVKEAHVELEGETSSSSSFKTRSQQGMIYRDFPSVNELHCGGGVDRRKVISVSPNDISQSDVELEDLSHSPSSFSGFGDHNTE-DGFFSVD

Query:  SGDEREASSDNSDQYKVFPDLELDDSYDEKICAEMYEASVGEAGNRCRGELCLDGNESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMI
        SGDERE SSDNSDQYKVFPDLELDDS DEKICAEM EASV EAGN CR EL LDGNESDTIK LEQALEEEQ+ RA LYLELEKERSAAATAADEAMAMI
Subjt:  SGDEREASSDNSDQYKVFPDLELDDSYDEKICAEMYEASVGEAGNRCRGELCLDGNESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMI

Query:  LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFRKSFFDDDGVDVDMLDSEFTPARTPSFPYPTEDPSHMLQCMNRS
        LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEA R SFF+ DGV VDMLDSE TP R PSF YPTEDP     C+N  
Subjt:  LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFRKSFFDDDGVDVDMLDSEFTPARTPSFPYPTEDPSHMLQCMNRS

Query:  TRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVADNFKGSEEIDYELQGKGMVEDEKLYTVPGEVNELDPYLQSNES
                 K HSLQ+EIPSV S+ LTFEFGEESP I ADE ADAAKARGMLL QV D +KGSEEIDYELQGK MVEDE LY VPG+V EL+PYLQSNES
Subjt:  TRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVADNFKGSEEIDYELQGKGMVEDEKLYTVPGEVNELDPYLQSNES

Query:  NGLGKVEKCTELIADEQEKVDEVSYDESALAKTILPCLEYNLEKIGDHQKMRTRDLDSVNATDPHPHDIHVIEDEARISNEAIANASEEPLVNSTSSIPV
        N LGKVEKCTELIADEQE V EVSYD  A AKT LPC E N    GDHQ  RTRDL SVN TDPH HDIHV+EDEA+ SNEA+ NASEEPLVN TS+IP 
Subjt:  NGLGKVEKCTELIADEQEKVDEVSYDESALAKTILPCLEYNLEKIGDHQKMRTRDLDSVNATDPHPHDIHVIEDEARISNEAIANASEEPLVNSTSSIPV

Query:  KCDSPSFSLLQSELEIPRTSSDASGRFPPMARSRSNSVRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEGREKLKFSVEYKENENNQLQLLENIAN
        KCDSPSFSLLQ+EL+  R+SSDASGRFPP+ARSRS+S+RS+LRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEGREKLKFSVE+KE E+NQ QLLENI N
Subjt:  KCDSPSFSLLQSELEIPRTSSDASGRFPPMARSRSNSVRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEGREKLKFSVEYKENENNQLQLLENIAN

Query:  HLREIRQLTDPRKASLQAPLPPSSKDVSKKRCWRSSSLSIHRSS
          REIRQLTDP KASLQAPLPPSSKDVSKKRCWRSSSLS+HRSS
Subjt:  HLREIRQLTDPRKASLQAPLPPSSKDVSKKRCWRSSSLSIHRSS

A0A1S4DSI6 Ubiquitinyl hydrolase 10.0e+0082.49Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGVFGNLSSDHCFQKLLVDRSSRKISSVLHSTRKKFPLNSVWDEPKCC
        MACEAI+LWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDG+FGNLS DHCFQKLLVD SSRKISSV+HSTR+KFPL+S+ DEPKCC
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGVFGNLSSDHCFQKLLVDRSSRKISSVLHSTRKKFPLNSVWDEPKCC

Query:  FKSMLVHERNVKEAHVELEGETSSSSSFKTRSQQGMIYRDFPSVNELHCGGGVDRRKVISVSPNDISQSDVELEDLSHSPSSFSGFGDHNTE-DGFFSVD
        FKSMLVHER+VKEA V+LEGE S SSS K RS Q MIY D+PSVNELHC  G D RKVIS SPN I QSDVELEDLS SPSSFSGFGD NTE DGFFSVD
Subjt:  FKSMLVHERNVKEAHVELEGETSSSSSFKTRSQQGMIYRDFPSVNELHCGGGVDRRKVISVSPNDISQSDVELEDLSHSPSSFSGFGDHNTE-DGFFSVD

Query:  SGDEREASSDNSDQYKVFPDLELDDSYDEKICAEMYEASVGEAGNRCRGELCLDGNESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMI
        SGDEREASSDNSDQYKVFP LELDDS DEKICAEM EASV EAGN CR +LCLDGNESDTIK LEQALEEEQ+ RATLY+ELEKERSAAATAADEAMAMI
Subjt:  SGDEREASSDNSDQYKVFPDLELDDSYDEKICAEMYEASVGEAGNRCRGELCLDGNESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMI

Query:  LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFRKSFFDDDGVDVDMLDSEFTPARTPSFPYPTEDPSHMLQCMNRS
        LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKE+EA R SFF+ DGV VDMLDSE TP R PSF YPTEDP     C N S
Subjt:  LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFRKSFFDDDGVDVDMLDSEFTPARTPSFPYPTEDPSHMLQCMNRS

Query:  TRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVADNFKGSEEIDYELQGKGMVEDEKLYTVPGEVNELDPYLQSNES
                   HSLQ+EIPSVES+ LTFEF EESP I+ADEIADAAKAR +LL QV DNFKGSEEIDYELQGKGM+EDE LY VPG+V EL PYLQSNES
Subjt:  TRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVADNFKGSEEIDYELQGKGMVEDEKLYTVPGEVNELDPYLQSNES

Query:  NGLGKVEKCTELIADEQEKVDEVSYDESALAKTILPCLEYNLEKIGDHQKMRTRDLDSVNATDPHPHDIHVIEDEARISNEAIANASEEPLVNSTSSIPV
        NGLGKV+KCTELIADEQEKVD+VSYD  ALAKTILPC E N    GDHQ  RTRDL SVN TDPHPHDIHV+EDEA+ SNEA+ N SEEPLVN TS+IPV
Subjt:  NGLGKVEKCTELIADEQEKVDEVSYDESALAKTILPCLEYNLEKIGDHQKMRTRDLDSVNATDPHPHDIHVIEDEARISNEAIANASEEPLVNSTSSIPV

Query:  KCDSPSFSLLQSELEIPRTSSDASGRFPPMARSRSNSVRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEGREKLKFSVEYKENENNQLQLLENIAN
        KCDSPSFSLLQSEL+I R+SSDA+GRFPP+ARSRS+S++S+LRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEGREKLKFSVE+KE E+NQ QLLENI N
Subjt:  KCDSPSFSLLQSELEIPRTSSDASGRFPPMARSRSNSVRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEGREKLKFSVEYKENENNQLQLLENIAN

Query:  HLREIRQLTDPRKASLQAPLPPSSKDVSKKR
        HLREIRQLTDP KASL APLPPSSK    K+
Subjt:  HLREIRQLTDPRKASLQAPLPPSSKDVSKKR

A0A6J1EFQ3 uncharacterized protein LOC111432110 isoform X16.9e-30577.21Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGVFGNLSSDHCFQKLLVDRSSRKISSVLHSTRKKFPLNSVWD-EPKC
        MACEA+QLWTFNGLVAAFLDLGIAFLLLCA+SLVFFTSKFLALFG CLPCPCDG+FGNL SDHCFQKLLVD SS++ISSVLHSTR+KFPL+S+WD EPKC
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGVFGNLSSDHCFQKLLVDRSSRKISSVLHSTRKKFPLNSVWD-EPKC

Query:  CFKSMLVHERNVKEAHVELEGETSSSSSFKTRSQQGMIYRDFPSVNELHCGGGVDRRKVISVSPNDISQSDVELEDLSHSPSSFSGFGDHNTEDGFFSVD
        CFKSM VH+RN KE  VE +GE S  S FKTRS +GMIY D  +VNE    GGV  RK+ SVSPND+ QSDVELEDL HSPSS  GFGD+N EDGFFSVD
Subjt:  CFKSMLVHERNVKEAHVELEGETSSSSSFKTRSQQGMIYRDFPSVNELHCGGGVDRRKVISVSPNDISQSDVELEDLSHSPSSFSGFGDHNTEDGFFSVD

Query:  SGDEREASSDNSDQYKVFPDLELDDSYDEKICAEMYEASVGEAGNRCRGELCLDGNESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMI
        SGDE EAS DNS+QYKVFPDLELDDSYDEKICAEMY AS  E  N CRGELCLDGNESD IKLL Q+LEEEQ ARATLYLELEKERSAAATAADEAMAMI
Subjt:  SGDEREASSDNSDQYKVFPDLELDDSYDEKICAEMYEASVGEAGNRCRGELCLDGNESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMI

Query:  LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFRKSFFDDDGVDVDMLDSEFTPARTPSFPYPTEDPSHMLQCMNRS
        LRLQEEKA IEM+ARQYQRMIEEKTAYDAEEMSILKEILVRRE+E HFL+KE+EAFRKS FD+DGV VDMLDSEFTP   PS P PTEDPSH+LQC+N S
Subjt:  LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFRKSFFDDDGVDVDMLDSEFTPARTPSFPYPTEDPSHMLQCMNRS

Query:  TRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVADNFKGSEEIDYELQGKGMVEDEKLYTVPGEVNELDPYLQSNE-
          D+Q          +E+PSVESRNL FEFGEESP IQA E ADAAKA G+LLHQVAD F+ SEEID ELQGKGMVED+ +Y VPGEVNEL+PY +SN  
Subjt:  TRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVADNFKGSEEIDYELQGKGMVEDEKLYTVPGEVNELDPYLQSNE-

Query:  -SNGLGKVEKCTELIADEQEKVDEVSYDESALAKTILPCLEYNLEKIGDHQKMRTRDLDSVNATDPHPHDIHVIEDEARISNEAIANASEEPLVNSTSSI
         SNGLGKVE+CTEL ADEQEKV + S+D  A A+  LPC+EYNLEK G+HQK  TRD  SVN TD  PHDIHVIEDE  + NEA ANA +E  VN  S I
Subjt:  -SNGLGKVEKCTELIADEQEKVDEVSYDESALAKTILPCLEYNLEKIGDHQKMRTRDLDSVNATDPHPHDIHVIEDEARISNEAIANASEEPLVNSTSSI

Query:  PVKCDSPSFSLLQSELEIPRTSSDASGRFPPMARSRSNSVRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEGREKLKFSVEYKENENNQLQLLENI
        PV CDS SFSLLQ+ L+I R+SSDA+GRFPPM RSRSNS+R ELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQE REKLKFSVE KE E NQLQLLENI
Subjt:  PVKCDSPSFSLLQSELEIPRTSSDASGRFPPMARSRSNSVRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEGREKLKFSVEYKENENNQLQLLENI

Query:  ANHLREIRQLTDPRKASLQAPLPPSSKDVSKKRCWRSSSLSIHRSS
               + L+DP KA+LQAPLPPSSKDVSKKRCWRSSSLSIHRSS
Subjt:  ANHLREIRQLTDPRKASLQAPLPPSSKDVSKKRCWRSSSLSIHRSS

A0A6J1FVE4 probable myosin-binding protein 54.3e-30776.84Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGVFGNLSSDHCFQKLLVDRSSRKISSVLHSTRKKFPLNSVWD-EPKC
        MACEAIQ WTFNGLVAAFLDLGIA+L+LCASSLVFFTSKFLALFGLCLPCPCDG+FGNLSSDHCF K+LVDRSS+K+SSV+HS R+K PLNS+ D EPKC
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGVFGNLSSDHCFQKLLVDRSSRKISSVLHSTRKKFPLNSVWD-EPKC

Query:  CFKSMLVHERNVKEAHVELEGETSSSSSFKTRSQQGMIYRDFPSVNELHCGGGVDRRKVISVSPNDISQSDVELEDLSHSPSSFSGFGDHNTEDGFFSVD
        CFKS L+HERN  EAHVE EGE S  S FKTRS Q MIY DF SV E  C G VD +KV SVSPNDISQ D  +EDLS SPSSFSGFGD+NTEDGFFSVD
Subjt:  CFKSMLVHERNVKEAHVELEGETSSSSSFKTRSQQGMIYRDFPSVNELHCGGGVDRRKVISVSPNDISQSDVELEDLSHSPSSFSGFGDHNTEDGFFSVD

Query:  SGDER-EASSDNSDQYKVFPDLELDDSYDEKICAEMYEASVGEAGNRCRGELCLDGNESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAM
        SGDER E+SSD+SD+ KVFP        DEKI AE + AS  EAGN CRGELCLDGNESDTIKLLEQALEEEQ ARATLYLELEKERSAAATA DEAMAM
Subjt:  SGDER-EASSDNSDQYKVFPDLELDDSYDEKICAEMYEASVGEAGNRCRGELCLDGNESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAM

Query:  ILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFRKSFFDDDGVDVDMLDSEFTPARTPSFPYPTEDPSHMLQCMNR
        ILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKE+EAF+KSFF+DDGVDVDMLD E TP   PSF Y ++ PSHMLQC++R
Subjt:  ILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFRKSFFDDDGVDVDMLDSEFTPARTPSFPYPTEDPSHMLQCMNR

Query:  STRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVADNFKGSEEIDYELQGKGMVEDEKLYTVPGEVNELDPYLQSNE
        S RD+QD+NY   S QYEIPS+ESR LTFEF +ESPFI +DE ADAA+A GMLLHQ ADNF   EE D ELQ +GMVEDE LY + GEVNEL+PYLQSN 
Subjt:  STRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVADNFKGSEEIDYELQGKGMVEDEKLYTVPGEVNELDPYLQSNE

Query:  SNGLGKVEKCTEL--IADEQEKVDEVSYDESALAKTILPCLEYNLEKIGDHQKMRTRDLDSVNATDPHPHDIHVIEDEARISNEAIANASEEPLVNSTSS
        SNGL KVEK TEL  IADE+EKVDEVSYD  A AKTI PC+EYNLEK GDHQK + +DLD +   DP  HDIHVI++EA   ++  A+AS+EP++N TSS
Subjt:  SNGLGKVEKCTEL--IADEQEKVDEVSYDESALAKTILPCLEYNLEKIGDHQKMRTRDLDSVNATDPHPHDIHVIEDEARISNEAIANASEEPLVNSTSS

Query:  IPVKCDSPSFSLLQSELEIPRTSSDASGRFPPMARSRSNSVRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEGREKLKFSVEYKENENNQLQLLEN
         P  C SPSFSLLQSEL+I R++SDASGRFPP ARSRSN +RS+LRRNSMSAVDYERSKIG+EVE LR RLKIVQE REKL+FS+E+K+ ENNQLQLLE+
Subjt:  IPVKCDSPSFSLLQSELEIPRTSSDASGRFPPMARSRSNSVRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEGREKLKFSVEYKENENNQLQLLEN

Query:  IANHLREIRQLTDPRKASLQAPLPPSSKDVSKKRCWRSSSLSIHRSS
        I NHLREIR LTDP KA LQAP PPSSK VSKKRCWRSSSLSIHRSS
Subjt:  IANHLREIRQLTDPRKASLQAPLPPSSKDVSKKRCWRSSSLSIHRSS

A0A6J1IVQ3 uncharacterized protein LOC111479667 isoform X10.0e+0078.9Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGVFGNLSSDHCFQKLLVDRSSRKISSVLHSTRKKFPLNSVWD-EPKC
        MACEAIQLWTFNGLVAAFLDLGIAFLLLCA+SLVFFTSKFLALFG CLPCPCDG+FG+L SDHCFQKLLVD SS+KISSVLHSTR+KFPL+S+WD EPKC
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGVFGNLSSDHCFQKLLVDRSSRKISSVLHSTRKKFPLNSVWD-EPKC

Query:  CFKSMLVHERNVKEAHVELEGETSSSSSFKTRSQQGMIYRDFPSVNELHCGGGVDRRKVISVSPNDISQSDVELEDLSHSPSSFSGFGDHNTEDGFFSVD
        CFKSM VH+RNVKEA VE + E S  S FKTRS +GMIY D  ++NE    GGV  RK+ SVSPND+ QSDVELEDL HSPSSF GFGD+N EDGFFSVD
Subjt:  CFKSMLVHERNVKEAHVELEGETSSSSSFKTRSQQGMIYRDFPSVNELHCGGGVDRRKVISVSPNDISQSDVELEDLSHSPSSFSGFGDHNTEDGFFSVD

Query:  SGDEREASSDNSDQYKVFPDLELDDSYDEKICAEMYEASVGEAGNRCRGELCLDGNESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMI
        SGDE EAS DNS+QYKVFPDLELDDSYDEKICAEMY AS  EA N CRGE CLDGNESD IKLLEQ+LEEEQ ARATLYLELEKERSAAATAADEAMAMI
Subjt:  SGDEREASSDNSDQYKVFPDLELDDSYDEKICAEMYEASVGEAGNRCRGELCLDGNESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMI

Query:  LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFRKSFFDDDGVDVDMLDSEFTPARTPSFPYPTEDPSHMLQCMNRS
        LRLQEEKASIEM+ARQYQRMIEEKTAYDAEEMSILKEILVRRE+EMHFL+KE+ AFR+SFFD+ GV VDMLD+EFTP   PS PYPTEDPSHMLQC+N S
Subjt:  LRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFRKSFFDDDGVDVDMLDSEFTPARTPSFPYPTEDPSHMLQCMNRS

Query:  TRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVADNFKGSEEIDYELQGKGMVEDEKLYTVPGEVNELDPYLQSNES
          D+Q          +E+PSVESRNL FEFGEESP IQA E ADAAKARGMLLHQVADNF+G EEID ELQGKGMVED+ LY VPGEVNEL+PY +SN S
Subjt:  TRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVADNFKGSEEIDYELQGKGMVEDEKLYTVPGEVNELDPYLQSNES

Query:  NGLGKVEKCTELIADEQEKVDEVSYDESALAKTILPCLEYNLEKIGDHQKMRTRDLDSVNATDPHPHDIHVIEDEARISNEAIANASEEPLVNSTSSIPV
        N LGKVE+CTE   DEQEKV + S D    A+T LPC+EYNLEK  DHQK  TRD  SVN TD  PHDIHVIEDEAR+ NEA ANA EE LVN +SSIPV
Subjt:  NGLGKVEKCTELIADEQEKVDEVSYDESALAKTILPCLEYNLEKIGDHQKMRTRDLDSVNATDPHPHDIHVIEDEARISNEAIANASEEPLVNSTSSIPV

Query:  KCDSPSFSLLQSELEIPRTSSDASGRFPPMARSRSNSVRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEGREKLKFSVEYKENENNQLQLLENIAN
         CDS SFSLLQ+EL+I R+SSDA+GRFPPMARSRSNS+RSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQE REK K SVE KE ENNQLQLLENI  
Subjt:  KCDSPSFSLLQSELEIPRTSSDASGRFPPMARSRSNSVRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEGREKLKFSVEYKENENNQLQLLENIAN

Query:  HLREIRQLTDPRKASLQAPLPPSSKDVSKKRCWRSSSLSIHRSS
             + L+DP KA+LQAPLPPSSKDVSKKRCWRSSSL IHRSS
Subjt:  HLREIRQLTDPRKASLQAPLPPSSKDVSKKRCWRSSSLSIHRSS

SwissProt top hitse value%identityAlignment
F4HVS6 Probable myosin-binding protein 62.4e-1232.34Show/hide
Query:  VSPNDISQSDVELEDL--SHSPSSFSGFGDHNTEDGFFSVDSGDEREASSDNSDQYKVFPDLELDDSYDEKICAEMYEASVGEAGNRCRGELCLDGNESD
        VS N +S+++ E +D+    +PS   G       + FF +   D    S+ NS ++ V        S  + +  +   AS              D     
Subjt:  VSPNDISQSDVELEDL--SHSPSSFSGFGDHNTEDGFFSVDSGDEREASSDNSDQYKVFPDLELDDSYDEKICAEMYEASVGEAGNRCRGELCLDGNESD

Query:  TIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFRKS
         +  L++ +  ++ +   LY+EL++ERSA+A AA+EAMAMI RLQ EKA+++M+A QYQRM++E+  YD E +  +   L +RE EM  LE E E +R+ 
Subjt:  TIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFRKS

Query:  F
        +
Subjt:  F

F4HXQ7 Myosin-binding protein 11.8e-1250Show/hide
Query:  LEQALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFR
        L++ ++ ++     LY ELE+ERSA+A A ++AMAMI RLQEEKAS +M+A Q  RM+EE+  YD E +  L ++LV RE+ +  LE EIE FR
Subjt:  LEQALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFR

Q0WNW4 Myosin-binding protein 34.3e-1748.54Show/hide
Query:  GNESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIE
        G+   TI+ L + +  EQ A   LY ELE+ERSA+A +A++ MAMI RLQEEKA ++M+A QYQRM+EE+  YD E + +L  ++V+RE+E   L++E+E
Subjt:  GNESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIE

Query:  AFR
         +R
Subjt:  AFR

Q9CAC4 Myosin-binding protein 21.2e-1652.53Show/hide
Query:  TIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFRK
        T+  L+  L+EE+ A   LY ELE ER+A+A AA E MAMI RL EEKA+++M+A QYQRM+EE+  +D E + +L E++V RE+E   LEKE+E +RK
Subjt:  TIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFRK

Q9LMC8 Probable myosin-binding protein 59.8e-1427.47Show/hide
Query:  EAHVELEGETSSSSSFKTRSQQ----GMIYR---DFPSVNELHCGGGVDRRKVISVSPNDISQSDVELEDLSHSPSSFSGFGDHNTEDGFFSVDSGDERE
        E   ++E  T++S+    R Q+    G I +   D P  N     G       +S +   +  S+++  DL     + +  G         S+D+ D+R 
Subjt:  EAHVELEGETSSSSSFKTRSQQ----GMIYR---DFPSVNELHCGGGVDRRKVISVSPNDISQSDVELEDLSHSPSSFSGFGDHNTEDGFFSVDSGDERE

Query:  ASSDNSDQYKVFPDLELDDSYDEK---ICAEMYEASVGEAGNRCRGELCLDGNESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMILRL
         S         F  + L DS           M ++ V + G        LDG+    ++ L + +  ++ +   LY+EL++ERSA+A AA+ AMAMI RL
Subjt:  ASSDNSDQYKVFPDLELDDSYDEK---ICAEMYEASVGEAGNRCRGELCLDGNESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMILRL

Query:  QEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFRKSF---FDDDGVDVDMLDSEFTPARTPSFPYPTEDPSHMLQCMNRS
        Q EKA+++M+A QYQRM++E+  YD E +  +  +LV+RE EM  LE  IE +R  +    ++ G   + LD E  P      P  + +    L+ M  S
Subjt:  QEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFRKSF---FDDDGVDVDMLDSEFTPARTPSFPYPTEDPSHMLQCMNRS

Query:  TRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVADNFKGSE
          D   +N  +      I   E  N + +        +  E  +A +++G LL Q++D    SE
Subjt:  TRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVADNFKGSE

Arabidopsis top hitse value%identityAlignment
AT1G04890.1 Protein of unknown function, DUF5933.4e-3031.21Show/hide
Query:  ESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAF
        E  +++ LE+ L+EE+ ARAT+ +EL+KERSAAA+AADEAMAMI RLQ+EKA+IEM+ARQ+QR++EE++ +DAEEM ILK+IL+RRERE HFLEKE+EA+
Subjt:  ESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAF

Query:  RKSFFDDDGVDVDMLDSEFTPARTPSFPYPTEDPSHMLQCMNRSTRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQV
        R+   + + ++  ++  +  P           +P H                                                +  D  + R +L+   
Subjt:  RKSFFDDDGVDVDMLDSEFTPARTPSFPYPTEDPSHMLQCMNRSTRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQV

Query:  ADNFKGSEEIDYELQGKGMVEDEKLYTVPGEVNELDPYLQSNESNGLGKVEKCTELIADEQEKVDEVSYDESALAKTILPCLEYNLEKIGDHQKMRTRDL
               +E+D                  G V ++ PY +       G  +K  +L   +     EV+Y                          R R  
Subjt:  ADNFKGSEEIDYELQGKGMVEDEKLYTVPGEVNELDPYLQSNESNGLGKVEKCTELIADEQEKVDEVSYDESALAKTILPCLEYNLEKIGDHQKMRTRDL

Query:  DSVNATDPHPHDIHVIEDEARISNEAIANASEEPLVNSTSSIPVKCDSPSFSLLQSELEIPRTSSDASGRFPPMARSRSNSVRSE-LRRNSMSAVDYERS
                   D+++++DE         N S++  +  +S        P  SL ++ + +    S  + + PP+ R R  S+ S   RR SMSAVDYER 
Subjt:  DSVNATDPHPHDIHVIEDEARISNEAIANASEEPLVNSTSSIPVKCDSPSFSLLQSELEIPRTSSDASGRFPPMARSRSNSVRSE-LRRNSMSAVDYERS

Query:  KIGNEVEWLRGRLKIVQEGREKL
        KI NEVE LR RLK VQE RE+L
Subjt:  KIGNEVEWLRGRLKIVQEGREKL

AT4G13160.1 Protein of unknown function, DUF5934.2e-2864.15Show/hide
Query:  DTIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFRK
        D ++LLE A+E+E+ A+A L +ELE+ER+A+A+AADEAMAMILRLQ +KAS+EM+ +QY+RMI+EK AYD EEM+ILKEIL +RERE HFLEKE+E ++ 
Subjt:  DTIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFRK

Query:  SFFDDD
           DDD
Subjt:  SFFDDD

AT4G13160.1 Protein of unknown function, DUF5932.9e-0545.07Show/hide
Query:  TFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGVFGNLSSDHCFQKLLVDRSSRKISSV
        TF G++ AF++L  A+ LLC S+ VF TSK L    L +PC      G  +SD C QKLL D   R I  V
Subjt:  TFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGVFGNLSSDHCFQKLLVDRSSRKISSV

AT4G13630.1 Protein of unknown function, DUF5931.5e-6234.27Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGVFGNLSSDHCFQKLLVDRSSRKISSVLHSTRKKFPLNSV-WDEPKC
        M C+ ++ WTF GLVAAF+DL +AF LLCAS +V+ TSKFL LFGL LPCPCDG++       CFQ+ L +   +KISSV  S + + P +S+ ++  K 
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGVFGNLSSDHCFQKLLVDRSSRKISSVLHSTRKKFPLNSV-WDEPKC

Query:  CFKSMLVHERNVKEAHVELEGETSSSSSFKTRSQQGMIYRDFPSVNEL-HCGGGVDRRKVISVSPNDISQSDVELEDLSHSPSSFSGFGDHNTED-GFFS
                +R  +   V+LE E SS++               PSV +  +   G D     S+         V+ + LS   S + GF +H     G  S
Subjt:  CFKSMLVHERNVKEAHVELEGETSSSSSFKTRSQQGMIYRDFPSVNEL-HCGGGVDRRKVISVSPNDISQSDVELEDLSHSPSSFSGFGDHNTED-GFFS

Query:  VD-SGDEREASSDNS-DQYKVFPDLELDDSYDEKICAEMYEASVGEAGNRCRGELCLDGNESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEA
         + S DE +    NS D  K   D+ L  S    +        VGE      G     G    T+++ EQ L EE+ ARA+L LELEKER+AAA+AADEA
Subjt:  VD-SGDEREASSDNS-DQYKVFPDLELDDSYDEKICAEMYEASVGEAGNRCRGELCLDGNESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEA

Query:  MAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFRKSFFDDDGVDVDMLDSEFTPARTP-SFPYPTEDPSHMLQ
        + MILRLQEEKASIEM+ARQYQRMIEEK+A+DAEEMSILKEIL+RRERE HFLEKE++ +R+ F + +            P  TP S P   E     LQ
Subjt:  MAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFRKSFFDDDGVDVDMLDSEFTPARTP-SFPYPTEDPSHMLQ

Query:  CMNRSTRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVADNFKGSEEIDYELQGKGMVEDEKLYTVPGEVNELDPYL
           + T    D      S  +EI + +  N   +     PF   D ++D                         ++ K   E E LY             
Subjt:  CMNRSTRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVADNFKGSEEIDYELQGKGMVEDEKLYTVPGEVNELDPYL

Query:  QSNESNGLGKVEKCTELIADEQEKVDEVSYDESALAKTILPCLEYNLEKIGDHQKMRTRDLDSVNATDPHPHDIHVIEDEARISNEAIANASEEPLVNST
                              E V   S  + A++K +                      +  +  D H HDIHV+ DE                 ++ 
Subjt:  QSNESNGLGKVEKCTELIADEQEKVDEVSYDESALAKTILPCLEYNLEKIGDHQKMRTRDLDSVNATDPHPHDIHVIEDEARISNEAIANASEEPLVNST

Query:  SSIPVKCDSPSFSLLQSELEIPRTSSDASGRFPPMARSRSNSVRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEGREKLKFSVEYKENENNQLQLL
          + V  D  +  L   +L+  ++ SD S  FP   + +SN + + +RRNSMSA+DYER KI +EV  LRGRL+ VQ+GREK+ FS   K+   +Q+Q  
Subjt:  SSIPVKCDSPSFSLLQSELEIPRTSSDASGRFPPMARSRSNSVRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEGREKLKFSVEYKENENNQLQLL

Query:  ENIANHLREIRQ
         +  +   E R+
Subjt:  ENIANHLREIRQ

AT4G13630.2 Protein of unknown function, DUF5931.5e-6234.27Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGVFGNLSSDHCFQKLLVDRSSRKISSVLHSTRKKFPLNSV-WDEPKC
        M C+ ++ WTF GLVAAF+DL +AF LLCAS +V+ TSKFL LFGL LPCPCDG++       CFQ+ L +   +KISSV  S + + P +S+ ++  K 
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVFFTSKFLALFGLCLPCPCDGVFGNLSSDHCFQKLLVDRSSRKISSVLHSTRKKFPLNSV-WDEPKC

Query:  CFKSMLVHERNVKEAHVELEGETSSSSSFKTRSQQGMIYRDFPSVNEL-HCGGGVDRRKVISVSPNDISQSDVELEDLSHSPSSFSGFGDHNTED-GFFS
                +R  +   V+LE E SS++               PSV +  +   G D     S+         V+ + LS   S + GF +H     G  S
Subjt:  CFKSMLVHERNVKEAHVELEGETSSSSSFKTRSQQGMIYRDFPSVNEL-HCGGGVDRRKVISVSPNDISQSDVELEDLSHSPSSFSGFGDHNTED-GFFS

Query:  VD-SGDEREASSDNS-DQYKVFPDLELDDSYDEKICAEMYEASVGEAGNRCRGELCLDGNESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEA
         + S DE +    NS D  K   D+ L  S    +        VGE      G     G    T+++ EQ L EE+ ARA+L LELEKER+AAA+AADEA
Subjt:  VD-SGDEREASSDNS-DQYKVFPDLELDDSYDEKICAEMYEASVGEAGNRCRGELCLDGNESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEA

Query:  MAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFRKSFFDDDGVDVDMLDSEFTPARTP-SFPYPTEDPSHMLQ
        + MILRLQEEKASIEM+ARQYQRMIEEK+A+DAEEMSILKEIL+RRERE HFLEKE++ +R+ F + +            P  TP S P   E     LQ
Subjt:  MAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFRKSFFDDDGVDVDMLDSEFTPARTP-SFPYPTEDPSHMLQ

Query:  CMNRSTRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVADNFKGSEEIDYELQGKGMVEDEKLYTVPGEVNELDPYL
           + T    D      S  +EI + +  N   +     PF   D ++D                         ++ K   E E LY             
Subjt:  CMNRSTRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVADNFKGSEEIDYELQGKGMVEDEKLYTVPGEVNELDPYL

Query:  QSNESNGLGKVEKCTELIADEQEKVDEVSYDESALAKTILPCLEYNLEKIGDHQKMRTRDLDSVNATDPHPHDIHVIEDEARISNEAIANASEEPLVNST
                              E V   S  + A++K +                      +  +  D H HDIHV+ DE                 ++ 
Subjt:  QSNESNGLGKVEKCTELIADEQEKVDEVSYDESALAKTILPCLEYNLEKIGDHQKMRTRDLDSVNATDPHPHDIHVIEDEARISNEAIANASEEPLVNST

Query:  SSIPVKCDSPSFSLLQSELEIPRTSSDASGRFPPMARSRSNSVRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEGREKLKFSVEYKENENNQLQLL
          + V  D  +  L   +L+  ++ SD S  FP   + +SN + + +RRNSMSA+DYER KI +EV  LRGRL+ VQ+GREK+ FS   K+   +Q+Q  
Subjt:  SSIPVKCDSPSFSLLQSELEIPRTSSDASGRFPPMARSRSNSVRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEGREKLKFSVEYKENENNQLQLL

Query:  ENIANHLREIRQ
         +  +   E R+
Subjt:  ENIANHLREIRQ

AT5G16720.1 Protein of unknown function, DUF5933.0e-1848.54Show/hide
Query:  GNESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIE
        G+   TI+ L + +  EQ A   LY ELE+ERSA+A +A++ MAMI RLQEEKA ++M+A QYQRM+EE+  YD E + +L  ++V+RE+E   L++E+E
Subjt:  GNESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIE

Query:  AFR
         +R
Subjt:  AFR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCCGTGCAAAATTGACTCGCTTCTCTAAGCTCCGTTATGATCCTCTTCCTCACACGACGTTACTAAATCGAACTTTATGGACATTTATTAGGATTCTTGTAGATTC
AGTAAGGGACTCGTTCGATTTTCTGAACTTTTCACCGGCGGAAACGGGATTCGTCGTTGATACGGTGTTTATATTAGCGACTAGAAAAGTAGCGGCTGCATTGGATATTG
AAGAAATGGCTTGTGAAGCTATACAACTATGGACATTTAATGGATTAGTGGCTGCATTTCTTGATCTTGGTATAGCTTTTTTATTATTATGTGCATCGAGTCTTGTTTTC
TTTACATCCAAATTTCTTGCACTGTTTGGATTATGTTTGCCTTGCCCTTGTGATGGGGTATTTGGGAACCTTAGTAGTGATCACTGCTTCCAAAAGTTACTTGTGGATCG
TTCGTCCAGAAAAATATCTTCAGTCCTACATTCGACTAGAAAAAAGTTCCCATTAAATTCCGTGTGGGATGAGCCAAAATGTTGTTTTAAGTCAATGTTGGTGCACGAGA
GGAATGTCAAGGAGGCACATGTTGAATTGGAAGGTGAAACATCAAGTAGTTCCTCTTTTAAAACCAGATCACAACAAGGTATGATTTATAGAGACTTTCCCAGTGTCAAC
GAATTGCATTGTGGAGGTGGTGTGGATCGCAGGAAGGTCATATCAGTGTCTCCGAACGACATTTCACAGTCAGACGTGGAACTGGAAGACCTTTCTCATTCTCCTTCAAG
CTTCAGTGGATTTGGGGATCACAATACCGAGGATGGCTTCTTTTCTGTTGATTCTGGAGATGAAAGGGAGGCTTCATCAGACAACAGCGATCAATATAAAGTATTTCCCG
ATCTTGAACTAGATGATTCTTATGATGAGAAAATATGCGCAGAGATGTATGAAGCATCTGTTGGAGAGGCTGGGAACAGGTGCAGAGGGGAGTTATGTTTGGATGGTAAT
GAGAGTGATACAATCAAACTATTGGAGCAAGCACTTGAAGAAGAGCAGACAGCACGTGCTACCCTGTACCTGGAGCTTGAGAAAGAGAGAAGTGCTGCTGCCACCGCTGC
CGATGAGGCCATGGCTATGATATTGCGTCTACAGGAAGAGAAGGCATCAATAGAAATGGATGCTAGGCAATACCAGAGGATGATAGAGGAGAAGACTGCTTATGATGCGG
AAGAAATGAGTATTCTTAAAGAAATTCTAGTGAGGAGAGAGCGGGAAATGCATTTCTTAGAGAAGGAAATTGAAGCTTTTCGGAAAAGTTTCTTTGACGATGATGGAGTG
GATGTTGATATGCTTGACTCAGAATTTACACCTGCAAGGACCCCTTCTTTTCCTTATCCAACTGAAGATCCATCGCATATGCTTCAATGCATGAATAGATCCACTAGAGA
CGAGCAAGATTCCAATTACAAAATACATTCTTTGCAATATGAGATACCATCAGTGGAGTCACGAAACTTAACTTTCGAGTTTGGGGAAGAGTCACCATTTATTCAAGCGG
ATGAAATTGCTGATGCTGCAAAAGCTAGGGGAATGTTGTTGCACCAAGTTGCTGATAACTTCAAGGGCAGTGAAGAGATTGACTATGAGTTACAAGGAAAGGGCATGGTA
GAGGATGAGAAGTTATACACTGTACCAGGAGAAGTAAATGAACTGGATCCATATCTGCAAAGCAATGAGTCAAATGGTTTAGGTAAGGTTGAGAAATGCACAGAATTAAT
TGCAGATGAACAAGAAAAAGTTGACGAAGTTTCATATGACGAATCAGCATTGGCTAAAACAATTCTTCCTTGTCTTGAGTACAATTTGGAAAAGATTGGTGACCATCAAA
AGATGCGGACAAGAGATCTTGACTCTGTGAATGCTACAGATCCCCATCCTCATGATATTCATGTCATTGAAGATGAAGCTAGAATTTCAAATGAAGCAATTGCTAATGCA
AGTGAAGAACCATTGGTTAACAGTACCTCGAGTATTCCAGTTAAATGTGACAGTCCATCCTTCAGCTTGTTGCAGAGTGAACTAGAAATCCCAAGAACCAGCTCAGATGC
CTCTGGAAGATTTCCACCAATGGCTCGTTCTCGAAGCAATTCCGTGCGTTCTGAATTGCGTAGAAATTCGATGTCTGCTGTCGATTATGAAAGGTCAAAAATTGGCAATG
AAGTTGAGTGGCTCAGAGGAAGGCTAAAGATTGTTCAAGAGGGAAGAGAAAAACTCAAGTTCTCTGTGGAGTACAAAGAGAATGAGAACAATCAGTTGCAACTTTTAGAG
AACATAGCAAATCACCTTCGCGAAATCCGACAACTAACGGATCCTAGAAAGGCATCTTTGCAGGCTCCATTGCCTCCATCCTCTAAGGATGTGTCAAAGAAACGTTGCTG
GCGAAGCTCATCCTTGAGCATTCATAGGAGCAGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTCCGTGCAAAATTGACTCGCTTCTCTAAGCTCCGTTATGATCCTCTTCCTCACACGACGTTACTAAATCGAACTTTATGGACATTTATTAGGATTCTTGTAGATTC
AGTAAGGGACTCGTTCGATTTTCTGAACTTTTCACCGGCGGAAACGGGATTCGTCGTTGATACGGTGTTTATATTAGCGACTAGAAAAGTAGCGGCTGCATTGGATATTG
AAGAAATGGCTTGTGAAGCTATACAACTATGGACATTTAATGGATTAGTGGCTGCATTTCTTGATCTTGGTATAGCTTTTTTATTATTATGTGCATCGAGTCTTGTTTTC
TTTACATCCAAATTTCTTGCACTGTTTGGATTATGTTTGCCTTGCCCTTGTGATGGGGTATTTGGGAACCTTAGTAGTGATCACTGCTTCCAAAAGTTACTTGTGGATCG
TTCGTCCAGAAAAATATCTTCAGTCCTACATTCGACTAGAAAAAAGTTCCCATTAAATTCCGTGTGGGATGAGCCAAAATGTTGTTTTAAGTCAATGTTGGTGCACGAGA
GGAATGTCAAGGAGGCACATGTTGAATTGGAAGGTGAAACATCAAGTAGTTCCTCTTTTAAAACCAGATCACAACAAGGTATGATTTATAGAGACTTTCCCAGTGTCAAC
GAATTGCATTGTGGAGGTGGTGTGGATCGCAGGAAGGTCATATCAGTGTCTCCGAACGACATTTCACAGTCAGACGTGGAACTGGAAGACCTTTCTCATTCTCCTTCAAG
CTTCAGTGGATTTGGGGATCACAATACCGAGGATGGCTTCTTTTCTGTTGATTCTGGAGATGAAAGGGAGGCTTCATCAGACAACAGCGATCAATATAAAGTATTTCCCG
ATCTTGAACTAGATGATTCTTATGATGAGAAAATATGCGCAGAGATGTATGAAGCATCTGTTGGAGAGGCTGGGAACAGGTGCAGAGGGGAGTTATGTTTGGATGGTAAT
GAGAGTGATACAATCAAACTATTGGAGCAAGCACTTGAAGAAGAGCAGACAGCACGTGCTACCCTGTACCTGGAGCTTGAGAAAGAGAGAAGTGCTGCTGCCACCGCTGC
CGATGAGGCCATGGCTATGATATTGCGTCTACAGGAAGAGAAGGCATCAATAGAAATGGATGCTAGGCAATACCAGAGGATGATAGAGGAGAAGACTGCTTATGATGCGG
AAGAAATGAGTATTCTTAAAGAAATTCTAGTGAGGAGAGAGCGGGAAATGCATTTCTTAGAGAAGGAAATTGAAGCTTTTCGGAAAAGTTTCTTTGACGATGATGGAGTG
GATGTTGATATGCTTGACTCAGAATTTACACCTGCAAGGACCCCTTCTTTTCCTTATCCAACTGAAGATCCATCGCATATGCTTCAATGCATGAATAGATCCACTAGAGA
CGAGCAAGATTCCAATTACAAAATACATTCTTTGCAATATGAGATACCATCAGTGGAGTCACGAAACTTAACTTTCGAGTTTGGGGAAGAGTCACCATTTATTCAAGCGG
ATGAAATTGCTGATGCTGCAAAAGCTAGGGGAATGTTGTTGCACCAAGTTGCTGATAACTTCAAGGGCAGTGAAGAGATTGACTATGAGTTACAAGGAAAGGGCATGGTA
GAGGATGAGAAGTTATACACTGTACCAGGAGAAGTAAATGAACTGGATCCATATCTGCAAAGCAATGAGTCAAATGGTTTAGGTAAGGTTGAGAAATGCACAGAATTAAT
TGCAGATGAACAAGAAAAAGTTGACGAAGTTTCATATGACGAATCAGCATTGGCTAAAACAATTCTTCCTTGTCTTGAGTACAATTTGGAAAAGATTGGTGACCATCAAA
AGATGCGGACAAGAGATCTTGACTCTGTGAATGCTACAGATCCCCATCCTCATGATATTCATGTCATTGAAGATGAAGCTAGAATTTCAAATGAAGCAATTGCTAATGCA
AGTGAAGAACCATTGGTTAACAGTACCTCGAGTATTCCAGTTAAATGTGACAGTCCATCCTTCAGCTTGTTGCAGAGTGAACTAGAAATCCCAAGAACCAGCTCAGATGC
CTCTGGAAGATTTCCACCAATGGCTCGTTCTCGAAGCAATTCCGTGCGTTCTGAATTGCGTAGAAATTCGATGTCTGCTGTCGATTATGAAAGGTCAAAAATTGGCAATG
AAGTTGAGTGGCTCAGAGGAAGGCTAAAGATTGTTCAAGAGGGAAGAGAAAAACTCAAGTTCTCTGTGGAGTACAAAGAGAATGAGAACAATCAGTTGCAACTTTTAGAG
AACATAGCAAATCACCTTCGCGAAATCCGACAACTAACGGATCCTAGAAAGGCATCTTTGCAGGCTCCATTGCCTCCATCCTCTAAGGATGTGTCAAAGAAACGTTGCTG
GCGAAGCTCATCCTTGAGCATTCATAGGAGCAGCTAGCTAATTTGAAACCCTCGCCCAGAAGAAACAATAAGAATAAAACAAAACACCATAGCAGAACATGATACAGCAT
CCTCCACGTTAGAAATTCTAGTCTTTTTTCCCCCACTGCCATACAAGACTACAAGTCATGTAAATTAGGAATAACTTGACAGATTATGTCTTATAGTTTGACAGATTATA
CCTTCTGTACATTCTCTAAAAAAGTTTTGATGCGACTGATAGAATGTAGTTAATCATATTTATAACTGTTAGATGCATGAGAATGAG
Protein sequenceShow/hide protein sequence
MVRAKLTRFSKLRYDPLPHTTLLNRTLWTFIRILVDSVRDSFDFLNFSPAETGFVVDTVFILATRKVAAALDIEEMACEAIQLWTFNGLVAAFLDLGIAFLLLCASSLVF
FTSKFLALFGLCLPCPCDGVFGNLSSDHCFQKLLVDRSSRKISSVLHSTRKKFPLNSVWDEPKCCFKSMLVHERNVKEAHVELEGETSSSSSFKTRSQQGMIYRDFPSVN
ELHCGGGVDRRKVISVSPNDISQSDVELEDLSHSPSSFSGFGDHNTEDGFFSVDSGDEREASSDNSDQYKVFPDLELDDSYDEKICAEMYEASVGEAGNRCRGELCLDGN
ESDTIKLLEQALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEIEAFRKSFFDDDGV
DVDMLDSEFTPARTPSFPYPTEDPSHMLQCMNRSTRDEQDSNYKIHSLQYEIPSVESRNLTFEFGEESPFIQADEIADAAKARGMLLHQVADNFKGSEEIDYELQGKGMV
EDEKLYTVPGEVNELDPYLQSNESNGLGKVEKCTELIADEQEKVDEVSYDESALAKTILPCLEYNLEKIGDHQKMRTRDLDSVNATDPHPHDIHVIEDEARISNEAIANA
SEEPLVNSTSSIPVKCDSPSFSLLQSELEIPRTSSDASGRFPPMARSRSNSVRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEGREKLKFSVEYKENENNQLQLLE
NIANHLREIRQLTDPRKASLQAPLPPSSKDVSKKRCWRSSSLSIHRSS