; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G04660 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G04660
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationClcChr06:4818560..4826953
RNA-Seq ExpressionClc06G04660
SyntenyClc06G04660
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR012419 - Cas1p 10 TM acyl transferase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4354295.1 hypothetical protein G4B88_007424 [Cannabis sativa]0.0e+0055.7Show/hide
Query:  MALLGPVTHAQVSCLIGFVTIVVAWIYSEYLVFKKKSISSKVRHSDIGLAEVESHTGKEDERAVLLEGGAIKSIPSGVRIPPSFSTFIRFIMLEESFLIE
        M +   VT  QVS L+G V ++VAWIY+E L +KK S+S K RHSDI L  V     K+D+RAVLLE G +++      I  S S+ +RF+++EESFL+E
Subjt:  MALLGPVTHAQVSCLIGFVTIVVAWIYSEYLVFKKKSISSKVRHSDIGLAEVESHTGKEDERAVLLEGGAIKSIPSGVRIPPSFSTFIRFIMLEESFLIE

Query:  NRMLLRAASELGLLLAYFYICDRTDIFGTATK-----------------------------------------------------VIFLMYHYFNAREIY
        NR+ LRA SE G  L Y+Y+CDRTD F ++ K                                                     V+FLMYHYF A+EIY
Subjt:  NRMLLRAASELGLLLAYFYICDRTDIFGTATK-----------------------------------------------------VIFLMYHYFNAREIY

Query:  NVGRVCVAAYVWMTGFGNFSYYYARKDFSLSRFAQLH-----------VILH--------LPLAHFFSLVVYGIIGILNKYNQIRAVIAVKFFVCFLVVI
        N  RV +AAYVWMTGFGNFSYYY RKDFS++RFAQ+            V+L+         P+   F+L+VYG +GILNK+N+  +VIA+K   CFLVVI
Subjt:  NVGRVCVAAYVWMTGFGNFSYYYARKDFSLSRFAQLH-----------VILH--------LPLAHFFSLVVYGIIGILNKYNQIRAVIAVKFFVCFLVVI

Query:  LVWEIPGVFDVLWEPFTFLLGYKDPNRLEEHLPLMYEWHFRTGLDRYIWILGMIYAYYYSTVENWIEKLDEAKLKLRILIKTTIVVTSATAAYLWFEYIF
        +VWEIPGVF+++W P TFLLGYKDP    ++ PL  EW+FR+GLDRYIWI+GMIYAYY+ TVE W+EKL+EA +K R+ IK  I   + T  YLWFEYI+
Subjt:  LVWEIPGVFDVLWEPFTFLLGYKDPNRLEEHLPLMYEWHFRTGLDRYIWILGMIYAYYYSTVENWIEKLDEAKLKLRILIKTTIVVTSATAAYLWFEYIF

Query:  KLDSITYNKYHPYTSWIPITAYICIRNVSQSTRSYTLTLFGWIGKISLDTYISQFHIWLRSNAPDAQPKKLLIIIPNYPLLNFMFTTIVFMTISYRLSEL
        KLD +TYNKYHPYTSWIPIT YIC+RNV+Q  R Y+LTLF W+GKI+L+TYISQ HIWLRS  PD QPK LL +IPNYP+LNFM TT +++ ISYRL +L
Subjt:  KLDSITYNKYHPYTSWIPITAYICIRNVSQSTRSYTLTLFGWIGKISLDTYISQFHIWLRSNAPDAQPKKLLIIIPNYPLLNFMFTTIVFMTISYRLSEL

Query:  TNTFKMAFVPSKDNKRIMHNMITAAAIMGILYSFSFVFLKVLPML-------------------------------------------------------
        TNT K+AFVPSKD+KR++HNMIT AAI   LY+ SFVFL V  ML                                                       
Subjt:  TNTFKMAFVPSKDNKRIMHNMITAAAIMGILYSFSFVFLKVLPML-------------------------------------------------------

Query:  --------------------IQKPLMPFVL--NSSPIINPRD-PNKQHL----LRESDVLKRLKKDRNLASALEFFSAIANSNAFKHTASTYKVMIERLG
                            + KP +PFVL  N++PI N  D P  Q +    L E DVL++LK++R+++SA+E+F ++ANS AFKHT  TY  MIE+LG
Subjt:  --------------------IQKPLMPFVL--NSSPIINPRD-PNKQHL----LRESDVLKRLKKDRNLASALEFFSAIANSNAFKHTASTYKVMIERLG

Query:  RECEMDVVQYILQQMKMDGISCCEDLFVCIINSYKRVGSAEQALKMFYRIGEFGCKPTVKIYNHLLDALLSENRFQIINPLYTNMKKDGLIPNVFTYNIL
         + +MD VQY+LQQMK++GI C EDLFV +INSYKR GSAEQALKMFYRI EFGCKPTV+IYNH+LDALL ENRF +INP+Y NMKKDG  PN+FTYNIL
Subjt:  RECEMDVVQYILQQMKMDGISCCEDLFVCIINSYKRVGSAEQALKMFYRIGEFGCKPTVKIYNHLLDALLSENRFQIINPLYTNMKKDGLIPNVFTYNIL

Query:  LKALCKNDRVDAAHKLFVEMSSKGCPPDVVSYTTMVSSLCKAGKIDDARELAGTFKPSVPVYNALIDGMCKEGRIEVAIKLLGEMMDNGVDPNVVSYSSI
        LKALCKNDRVD A KL  EMS KGC PD VSYTT+VSSLCK GK+++A+ LA  F+P +PVYNALI+G+CKE +I+   +LL EM+D G+DPNV++YS+I
Subjt:  LKALCKNDRVDAAHKLFVEMSSKGCPPDVVSYTTMVSSLCKAGKIDDARELAGTFKPSVPVYNALIDGMCKEGRIEVAIKLLGEMMDNGVDPNVVSYSSI

Query:  INSLCNSGNVELAFALLARMFLRGCDANIHTFTPLIKGCFMRGKLYEALDLWKLMIQDGCEPNVVAYNTLIHGLCKSGTLVEALQVCDQMQRSGCLPNVT
        I+SL + G+V L+FA+LA+MF++GC  N+HTFT LIKG F+ G+ +EAL LW  MI DG EPNVVAYNTLIHGLC SG +  A+ V  QM+RSGC PNV 
Subjt:  INSLCNSGNVELAFALLARMFLRGCDANIHTFTPLIKGCFMRGKLYEALDLWKLMIQDGCEPNVVAYNTLIHGLCKSGTLVEALQVCDQMQRSGCLPNVT

Query:  TYSILIDGFAKSGHLVGASETWNRMISHGCRPNVVAYTCMVDVLCKNLMFDQANSIMEKMILEGCTPNTVTFNTFIKGLCGNGRVEWAMKVLDRM-QGHG
         YS LIDGFAK G LVGASETWNRM++ GCRPNVVAYTCMVDVLC+N MF++A  ++EKM  +GC PNT+TFNT IKGLCG+GR +WA++VLD M   +G
Subjt:  TYSILIDGFAKSGHLVGASETWNRMISHGCRPNVVAYTCMVDVLCKNLMFDQANSIMEKMILEGCTPNTVTFNTFIKGLCGNGRVEWAMKVLDRM-QGHG

Query:  CLPNITTYNELLDALFRMNKYEEAFGLFQEIEERNLQPNLVTYNTILYGFSRAGMLGEALQLFGKVLVGGTAPDPITYNTMIHAYCKQGKVKIAAQLVER
        CLPNITTYNELLD  F++ +++EAF L +++EE+ ++ NLVTYNTIL GF  AG+  +ALQL GK++V G   D ITYNT+++AYCKQGKV+ A QL+ER
Subjt:  CLPNITTYNELLDALFRMNKYEEAFGLFQEIEERNLQPNLVTYNTILYGFSRAGMLGEALQLFGKVLVGGTAPDPITYNTMIHAYCKQGKVKIAAQLVER

Query:  VSSMKEWHPDIITYTSLIWGACNWINIEEAIAFLYKAINQGICPNFATWNALVRCFFDSLGHMGPIHILDDIL
        V    EWHPD++TYTSL+WG C+ I ++EA+  L+K +++ I PN ATWN LVRC FDSLGH+GPI+I+D+IL
Subjt:  VSSMKEWHPDIITYTSLIWGACNWINIEEAIAFLYKAINQGICPNFATWNALVRCFFDSLGHMGPIHILDDIL

RXH80110.1 hypothetical protein DVH24_041257 [Malus domestica]0.0e+0057.82Show/hide
Query:  MALLGPVTHAQVSCLIGFVTIVVAWIYSEYLVFKKKSISSKVRHSDIGLAEVESHTGKEDERAVLLEGGAIKSIPSGVRIPPSFSTFIRFIMLEESFLIE
        M +  PVT AQVS L+G + I+ AWIY+E+L +KK S+SSK+ +S++ L E+E    KED+RAVLLEGG ++S     RIP  +S   RF++++ SFL++
Subjt:  MALLGPVTHAQVSCLIGFVTIVVAWIYSEYLVFKKKSISSKVRHSDIGLAEVESHTGKEDERAVLLEGGAIKSIPSGVRIPPSFSTFIRFIMLEESFLIE

Query:  NRMLLRAASELGLLLAYFYICDRTDIFGTATK------------------------------------------------------VIFLMYHYFNAREI
        NR+ LRA SE  +LL+Y Y+CDRTD F  +TK                                                      V+FL+YHYF A EI
Subjt:  NRMLLRAASELGLLLAYFYICDRTDIFGTATK------------------------------------------------------VIFLMYHYFNAREI

Query:  YNVGRVCVAAYVWMTGFGNFSYYYARKDFSLSRFAQLHVILHLPLAHFFSLVVYGIIGILNKYNQIRAVIAVKFFVCFLVVILVWEIPGVFDVLWEPFTF
        YN  RV +AAYVWMTGFGNFSYYY RKDFSL+ +   ++    P+   F+L+VYG +GI+NKYN+  +VIAVK   CFLVVILVWEIPGVF++LW PFTF
Subjt:  YNVGRVCVAAYVWMTGFGNFSYYYARKDFSLSRFAQLHVILHLPLAHFFSLVVYGIIGILNKYNQIRAVIAVKFFVCFLVVILVWEIPGVFDVLWEPFTF

Query:  LLGYKDPNRLEEHLPLMYEWHFRTGLDRYIWILGMIYAYYYSTVENWIEKLDEAKLKLRILIKTTIVVTSATAAYLWFEYIFKLDSITYNKYHPYTSWIP
        LLGY DP   +  LPL++EW FR+GLDRYIWI+GMIYAYY+ TVE W+EKL+E++ K +  IK+ + + + T  YLWFEYI+K+D +TYNKYHPYTSWIP
Subjt:  LLGYKDPNRLEEHLPLMYEWHFRTGLDRYIWILGMIYAYYYSTVENWIEKLDEAKLKLRILIKTTIVVTSATAAYLWFEYIFKLDSITYNKYHPYTSWIP

Query:  ITAYICIRNVSQSTRSYTLTLFGWIGKISLDTYISQFHIWLRSNAPDAQPKKLLIIIPNYPLLNFMFTTIVFMTISYRLSELTNTFKMAFVPSKDNKRIM
        IT YIC+RNV+Q  RSY+LTLF W+GKI+L+TYI QFH+WLRS+ PDAQPK LL +IP+YP+LNFM T+ +++ IS RL +LTNT K+AFVPSKDNKR++
Subjt:  ITAYICIRNVSQSTRSYTLTLFGWIGKISLDTYISQFHIWLRSNAPDAQPKKLLIIIPNYPLLNFMFTTIVFMTISYRLSELTNTFKMAFVPSKDNKRIM

Query:  HNMITAAAIMGILYSFSFVFLKVLPML----------------------------------IQKPLMPFVLN-SSPII--NPR---DPNK-QHLLRESDV
         N++TA A+   LY  SFVFLKV  M+                                  + KP +PFVLN ++PI+  NP+   +PN+ Q  L+ESDV
Subjt:  HNMITAAAIMGILYSFSFVFLKVLPML----------------------------------IQKPLMPFVLN-SSPII--NPR---DPNK-QHLLRESDV

Query:  LKRLKKDRNLASALEFFSAIANSNAFKHTASTYKVMIERLGRECEMDVVQYILQQMKMDGISCCEDLFVCIINSYKRVGSAEQALKMFYRIGEFGCKPTV
        L+RLK + +L SALE+F +IANS AF+HT  TY  MIE+LGR+CEMD VQY+L QMK++G+ C E+LF+C+I+SY+R G AEQALKMFYRI EFGCK TV
Subjt:  LKRLKKDRNLASALEFFSAIANSNAFKHTASTYKVMIERLGRECEMDVVQYILQQMKMDGISCCEDLFVCIINSYKRVGSAEQALKMFYRIGEFGCKPTV

Query:  KIYNHLLDALLSENRFQIINPLYTNMKKDGLIPNVFTYNILLKALCKNDRVDAAHKLFVEMSSKGCPPDVVSYTTMVSSLCKAGKIDDARELAGTFKPSV
        KIYNHLLDALLSENRFQ+INP+Y+NMKKDG+ PNV+TYNILLKALCKNDRVD AHKL VEMS KGC PD VSYTT+VS+LC+ GK+++ARELAG F+P V
Subjt:  KIYNHLLDALLSENRFQIINPLYTNMKKDGLIPNVFTYNILLKALCKNDRVDAAHKLFVEMSSKGCPPDVVSYTTMVSSLCKAGKIDDARELAGTFKPSV

Query:  PVYNALIDGMCKEGRIEVAIKLLGEMMDNGVDPNVVSYSSIINSLCNSGNVELAFALLARMFLRGCDANIHTFTPLIKGCFMRGKLYEALDLWKLMIQDG
        PVYNAL++G+CKE +IE A+KLL EM+D G++PNV++YS+IINSL ++ NVE A A+LA+M +RGC  N+HTFT LIKG F+ G+++EALDLW  MI + 
Subjt:  PVYNALIDGMCKEGRIEVAIKLLGEMMDNGVDPNVVSYSSIINSLCNSGNVELAFALLARMFLRGCDANIHTFTPLIKGCFMRGKLYEALDLWKLMIQDG

Query:  CEPNVVAYNTLIHGLCKSGTLVEALQVCDQMQRSGCLPNVTTYSILIDGFAKSGHLVGASETWNRMISHGCRPNVVAYTCMVDVLCKNLMFDQANSIMEK
         +PN++AY TL+HGLC +G + +A+ VC +M ++GC PNVTTYS LIDGFAK G LVGAS+TWN M++ GCRPNV+AYTCM+DVLC+N MF QA+S++E 
Subjt:  CEPNVVAYNTLIHGLCKSGTLVEALQVCDQMQRSGCLPNVTTYSILIDGFAKSGHLVGASETWNRMISHGCRPNVVAYTCMVDVLCKNLMFDQANSIMEK

Query:  MILEGCTPNTVTFNTFIKGLCGNGRVEWAMKVLDRMQGHGCLPNITTYNELLDALFRMNKYEEAFGLFQEIEERNLQPNLVTYNTILYGFSRAGMLGEAL
        M  EGC PNTVTFNTFIKGLC +G+V+WA+ +LD+M+ +GCLPNITTYNELLD LF++N++EEA+G+ +EIEER +  NLVTYNTIL GF  AGM  EA+
Subjt:  MILEGCTPNTVTFNTFIKGLCGNGRVEWAMKVLDRMQGHGCLPNITTYNELLDALFRMNKYEEAFGLFQEIEERNLQPNLVTYNTILYGFSRAGMLGEAL

Query:  QLFGKVLVGGTAPDPITYNTMIHAYCKQGKVKIAAQLVERVSSMKEWHPDIITYTSLIWGACNWINIEEAIAFLYK
        QL GK+LV GT PD ITYNT+I+A CK+G +  A QL +R+ + KEW PD++ YTSL+WG CNW+ ++EA+ +L K
Subjt:  QLFGKVLVGGTAPDPITYNTMIHAYCKQGKVKIAAQLVERVSSMKEWHPDIITYTSLIWGACNWINIEEAIAFLYK

XP_016902287.1 PREDICTED: pentatricopeptide repeat-containing protein At3g48810 [Cucumis melo]0.0e+0091.72Show/hide
Query:  LPMLIQKPLMPFVLNSSPIINPRDPNKQHLLRESDVLKRLKKDRNLASALEFFSAIANSNAFKHTASTYKVMIERLGRECEMDVVQYILQQMKMDGISCC
        L M IQKPL+PFVLNS+PIINPRDPNKQ LL+ESDVLKRLK DRNL+SAL FF+AIANSNAF+HTASTY+VMIE+LGRECEMDVVQYILQQMKMDGI+CC
Subjt:  LPMLIQKPLMPFVLNSSPIINPRDPNKQHLLRESDVLKRLKKDRNLASALEFFSAIANSNAFKHTASTYKVMIERLGRECEMDVVQYILQQMKMDGISCC

Query:  EDLFVCIINSYKRVGSAEQALKMFYRIGEFGCKPTVKIYNHLLDALLSENRFQIINPLYTNMKKDGLIPNVFTYNILLKALCKNDRVDAAHKLFVEMSSK
        EDLF+CIIN YKRVGSAEQALK+FYRIGEFGCKPTVKIYNHLLDALLSEN+FQ+INPLY NMKK GLIPNVFTYNILLKALCKN RVDAAHKLFVEMS+K
Subjt:  EDLFVCIINSYKRVGSAEQALKMFYRIGEFGCKPTVKIYNHLLDALLSENRFQIINPLYTNMKKDGLIPNVFTYNILLKALCKNDRVDAAHKLFVEMSSK

Query:  GCPPDVVSYTTMVSSLCKAGKIDDARELAGTFKPSVPVYNALIDGMCKEGRIEVAIKLLGEMMDNGVDPNVVSYSSIINSLCNSGNVELAFALLARMFLR
        GCPPD V+YTTMVSSLCKAGKIDDARELAG FKP+VPVYNALIDGMCKEGRIEVAIKLLGEMMDNGVDPNVVSYSSIINSLC  GNVELAFALLA+MF R
Subjt:  GCPPDVVSYTTMVSSLCKAGKIDDARELAGTFKPSVPVYNALIDGMCKEGRIEVAIKLLGEMMDNGVDPNVVSYSSIINSLCNSGNVELAFALLARMFLR

Query:  GCDANIHTFTPLIKGCFMRGKLYEALDLWKLMIQDGCEPNVVAYNTLIHGLCKSGTLVEALQVCDQMQRSGCLPNVTTYSILIDGFAKSGHLVGASETWN
        GCDANIHTFTPLIKGCFMRGKLYEALDLWKLMI+DGCEPNVVAYNTLIHGLC SG+L EALQV DQMQRSGC+PNVTTYSILIDGFAKSG LVGASETWN
Subjt:  GCDANIHTFTPLIKGCFMRGKLYEALDLWKLMIQDGCEPNVVAYNTLIHGLCKSGTLVEALQVCDQMQRSGCLPNVTTYSILIDGFAKSGHLVGASETWN

Query:  RMISHGCRPNVVAYTCMVDVLCKNLMFDQANSIMEKMILEGCTPNTVTFNTFIKGLCGNGRVEWAMKVLDRMQGHGCLPNITTYNELLDALFRMNKYEEA
        RMISHGCRPNVVAYTCMVDVLCKN MFDQANS++EKM LEGCTPNT+TFNTFIKGLCGNGRVEWAMK+L+RMQGHGCLPNITTYNELLDALFRMNKYEEA
Subjt:  RMISHGCRPNVVAYTCMVDVLCKNLMFDQANSIMEKMILEGCTPNTVTFNTFIKGLCGNGRVEWAMKVLDRMQGHGCLPNITTYNELLDALFRMNKYEEA

Query:  FGLFQEIEERNLQPNLVTYNTILYGFSRAGMLGEALQLFGKVLVGGTAPDPITYNTMIHAYCKQGKVKIAAQLVERVSSMKEWHPDIITYTSLIWGACNW
        FGLFQEIEERNLQPNLVTYNTILYGFSRAGM+GEALQLFGK LV GTAPD ITYNTMIHAYCKQGKVKIAAQLVERVSSMKEW PDIITYT LIWGAC W
Subjt:  FGLFQEIEERNLQPNLVTYNTILYGFSRAGMLGEALQLFGKVLVGGTAPDPITYNTMIHAYCKQGKVKIAAQLVERVSSMKEWHPDIITYTSLIWGACNW

Query:  INIEEAIAFLYKAINQGICPNFATWNALVRCFFDSLGHMGPIHILDDILRKG
        INIEEAIAFL KAINQGICPNFATWNAL+RCFFDSLGHMGPIHILDDILRKG
Subjt:  INIEEAIAFLYKAINQGICPNFATWNALVRCFFDSLGHMGPIHILDDILRKG

XP_031742794.1 LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At3g48810-like [Cucumis sativus]0.0e+0091.56Show/hide
Query:  LPMLIQKPLMPFVLNSSPIINPRDPNKQHLLRESDVLKRLKKDRNLASALEFFSAIANSNAFKHTASTYKVMIERLGRECEMDVVQYILQQMKMDGISCC
        L M IQKPL+PFVLNS+PIINPRDPNKQ LL+ESDVLKRLK DRNL+S L FFSAIANSNAF+HTASTY+VMIERLGRECEMD+VQYILQQMKMDGI+CC
Subjt:  LPMLIQKPLMPFVLNSSPIINPRDPNKQHLLRESDVLKRLKKDRNLASALEFFSAIANSNAFKHTASTYKVMIERLGRECEMDVVQYILQQMKMDGISCC

Query:  EDLFVCIINSYKRVGSAEQALKMFYRIGEFGCKPTVKIYNHLLDALLSENRFQIINPLYTNMKKDGLIPNVFTYNILLKALCKNDRVDAAHKLFVEMSSK
        EDLF+CIIN YKRVGSAEQALKMFYRIGEFGCKPTV+IYNHLLDALLSEN+FQ+INPLYTNMKKDGLIPNVFTYNILLKALC NDRVDAAHKLFVEMS+K
Subjt:  EDLFVCIINSYKRVGSAEQALKMFYRIGEFGCKPTVKIYNHLLDALLSENRFQIINPLYTNMKKDGLIPNVFTYNILLKALCKNDRVDAAHKLFVEMSSK

Query:  GCPPDVVSYTTMVSSLCKAGKIDDARELAGTFKPSVPVYNALIDGMCKEGRIEVAIKLLGEMMDNGVDPNVVSYSSIINSLCNSGNVELAFALLARMFLR
        GCPPD V+YTTMVSSLCKAGKIDDARELAG FKPSVPVYNAL  GMCK+ RIEVAIKLLGEMMDNGVDPNVVSYS IINSLC SGNVELAFA  A+MFLR
Subjt:  GCPPDVVSYTTMVSSLCKAGKIDDARELAGTFKPSVPVYNALIDGMCKEGRIEVAIKLLGEMMDNGVDPNVVSYSSIINSLCNSGNVELAFALLARMFLR

Query:  GCDANIHTFTPLIKGCFMRGKLYEALDLWKLMIQDGCEPNVVAYNTLIHGLCKSGTLVEALQVCDQMQRSGCLPNVTTYSILIDGFAKSGHLVGASETWN
        GCDANIHTFTPLIKGCFMRGKLYEALDLWKLMIQDGCEPNVVAYNTLIHGLC +G+L EALQVCDQMQRSGCLPNVTTYSILIDGFAKSG LVGASETWN
Subjt:  GCDANIHTFTPLIKGCFMRGKLYEALDLWKLMIQDGCEPNVVAYNTLIHGLCKSGTLVEALQVCDQMQRSGCLPNVTTYSILIDGFAKSGHLVGASETWN

Query:  RMISHGCRPNVVAYTCMVDVLCKNLMFDQANSIMEKMILEGCTPNTVTFNTFIKGLCGNGRVEWAMKVLDRMQGHGCLPNITTYNELLDALFRMNKYEEA
        RMISHGCRPNVV YTCMVDVLCKN MFDQANS++EKM LEGCTPNT+TFNTFIKGLCGNGRVEWAMK+L+RMQGHGCLPNITTYNELLDALFRMNKYEEA
Subjt:  RMISHGCRPNVVAYTCMVDVLCKNLMFDQANSIMEKMILEGCTPNTVTFNTFIKGLCGNGRVEWAMKVLDRMQGHGCLPNITTYNELLDALFRMNKYEEA

Query:  FGLFQEIEERNLQPNLVTYNTILYGFSRAGMLGEALQLFGKVLVGGTAPDPITYNTMIHAYCKQGKVKIAAQLVERVSSMKEWHPDIITYTSLIWGACNW
        FGLFQEIE RNLQPNLVTYNT+LYGFSRAGM+GEALQLFGK LV GTAPD ITYNTMIHAYCKQGKVKIAAQLVERVSSMKEWHPDIITYTSLIWGACNW
Subjt:  FGLFQEIEERNLQPNLVTYNTILYGFSRAGMLGEALQLFGKVLVGGTAPDPITYNTMIHAYCKQGKVKIAAQLVERVSSMKEWHPDIITYTSLIWGACNW

Query:  INIEEAIAFLYKAINQGICPNFATWNALVRCFFDSLGHMGPIHILDDILRKG
        +NIEEA+AFL KAINQGICPNFATWNALVRCFFDSLGHMGPIHILDDILRKG
Subjt:  INIEEAIAFLYKAINQGICPNFATWNALVRCFFDSLGHMGPIHILDDILRKG

XP_038875460.1 pentatricopeptide repeat-containing protein At3g48810 [Benincasa hispida]0.0e+0094.17Show/hide
Query:  LPMLIQKPLMPFVLNSSPIINPRDPNKQHLLRESDVLKRLKKDRNLASALEFFSAIANSNAFKHTASTYKVMIERLGRECEMDVVQYILQQMKMDGISCC
        L + IQKPLMPFVLNS+PIINPRD NKQHLL+ESDVLKRLKKDRNLASALEFFSAIANSNAFKHTASTYKVMIE+LG ECEMDVVQYILQQMKMDGISCC
Subjt:  LPMLIQKPLMPFVLNSSPIINPRDPNKQHLLRESDVLKRLKKDRNLASALEFFSAIANSNAFKHTASTYKVMIERLGRECEMDVVQYILQQMKMDGISCC

Query:  EDLFVCIINSYKRVGSAEQALKMFYRIGEFGCKPTVKIYNHLLDALLSENRFQIINPLYTNMKKDGLIPNVFTYNILLKALCKNDRVDAAHKLFVEMSSK
        EDLFVCIINSYKRVGSAEQALKMFYRI EFGCKPTVKIYNHLLDALLSENRFQ+INPLYTNMKKDGLIPNVFTYNILLKALCKNDRVDAAHKLFVEMSSK
Subjt:  EDLFVCIINSYKRVGSAEQALKMFYRIGEFGCKPTVKIYNHLLDALLSENRFQIINPLYTNMKKDGLIPNVFTYNILLKALCKNDRVDAAHKLFVEMSSK

Query:  GCPPDVVSYTTMVSSLCKAGKIDDARELAGTFKPSVPVYNALIDGMCKEGRIEVAIKLLGEMMDNGVDPNVVSYSSIINSLCNSGNVELAFALLARMFLR
        GCPPDVVSYTTMVSSLCK GKIDDARELAG FKPSVP+YNALI GMCKEGRI+VAIKLLGEMMDNGVDPNVVSYSSIINSLC+SGNVELAFALLA+MF R
Subjt:  GCPPDVVSYTTMVSSLCKAGKIDDARELAGTFKPSVPVYNALIDGMCKEGRIEVAIKLLGEMMDNGVDPNVVSYSSIINSLCNSGNVELAFALLARMFLR

Query:  GCDANIHTFTPLIKGCFMRGKLYEALDLWKLMIQDGCEPNVVAYNTLIHGLCKSGTLVEALQVCDQMQRSGCLPNVTTYSILIDGFAKSGHLVGASETWN
        GCDANIHTFTPLIKGCFMRGK YEALDLWKLMIQDGC PNVVAYNTLIHGLC SGTL EALQV DQMQR+GCLPNV TYS LIDGFAKSG LVGASETWN
Subjt:  GCDANIHTFTPLIKGCFMRGKLYEALDLWKLMIQDGCEPNVVAYNTLIHGLCKSGTLVEALQVCDQMQRSGCLPNVTTYSILIDGFAKSGHLVGASETWN

Query:  RMISHGCRPNVVAYTCMVDVLCKNLMFDQANSIMEKMILEGCTPNTVTFNTFIKGLCGNGRVEWAMKVLDRMQGHGCLPNITTYNELLDALFRMNKYEEA
        RMISHGCRPNVVAYTCMV+VLCKN MFDQANSIMEKMIL GCTPNTVTFNTFIKGLCGNGRVEWAMKVLDRMQGHG LPNITTYNELLDALFRMNKYEEA
Subjt:  RMISHGCRPNVVAYTCMVDVLCKNLMFDQANSIMEKMILEGCTPNTVTFNTFIKGLCGNGRVEWAMKVLDRMQGHGCLPNITTYNELLDALFRMNKYEEA

Query:  FGLFQEIEERNLQPNLVTYNTILYGFSRAGMLGEALQLFGKVLVGGTAPDPITYNTMIHAYCKQGKVKIAAQLVERVSSMKEWHPDIITYTSLIWGACNW
        FGLFQEIEER+LQPNLVTYNTILYGFSRAGMLGEALQLFGKVLV GT PDPITYNT+IHAYCKQGKVKIAAQLVERVSSM EWHPDIITYTSLIWGACNW
Subjt:  FGLFQEIEERNLQPNLVTYNTILYGFSRAGMLGEALQLFGKVLVGGTAPDPITYNTMIHAYCKQGKVKIAAQLVERVSSMKEWHPDIITYTSLIWGACNW

Query:  INIEEAIAFLYKAINQGICPNFATWNALVRCFFDSLGHMGPIHILDDILRKG
        INIEEAIAFL+KAINQGICPNFATWN LVRCFFDSLGHMGPIHILDDILRKG
Subjt:  INIEEAIAFLYKAINQGICPNFATWNALVRCFFDSLGHMGPIHILDDILRKG

TrEMBL top hitse value%identityAlignment
A0A0A0KCK1 Uncharacterized protein0.0e+0092.48Show/hide
Query:  LPMLIQKPLMPFVLNSSPIINPRDPNKQHLLRESDVLKRLKKDRNLASALEFFSAIANSNAFKHTASTYKVMIERLGRECEMDVVQYILQQMKMDGISCC
        L M IQKPL+PFVLNS+PIINPRDPNKQ LL+ESDVLKRLK DRNL+S L FFSAIANSNAF+HTASTY+VMIERLGRECEMD+VQYILQQMKMDGI+CC
Subjt:  LPMLIQKPLMPFVLNSSPIINPRDPNKQHLLRESDVLKRLKKDRNLASALEFFSAIANSNAFKHTASTYKVMIERLGRECEMDVVQYILQQMKMDGISCC

Query:  EDLFVCIINSYKRVGSAEQALKMFYRIGEFGCKPTVKIYNHLLDALLSENRFQIINPLYTNMKKDGLIPNVFTYNILLKALCKNDRVDAAHKLFVEMSSK
        EDLF+CIIN YKRVGSAEQALKMFYRIGEFGCKPTV+IYNHLLDALLSEN+FQ+INPLYTNMKKDGLIPNVFTYNILLKALCKNDRVDAAHKLFVEMS+K
Subjt:  EDLFVCIINSYKRVGSAEQALKMFYRIGEFGCKPTVKIYNHLLDALLSENRFQIINPLYTNMKKDGLIPNVFTYNILLKALCKNDRVDAAHKLFVEMSSK

Query:  GCPPDVVSYTTMVSSLCKAGKIDDARELAGTFKPSVPVYNALIDGMCKEGRIEVAIKLLGEMMDNGVDPNVVSYSSIINSLCNSGNVELAFALLARMFLR
        GCPPD V+YTTMVSSLCKAGKIDDARELAG FKPSVPVYNALIDGMCKEGRIEVAIKLLGEMMDNGVDPNVVSYS IINSLC SGNVELAFAL A+MFLR
Subjt:  GCPPDVVSYTTMVSSLCKAGKIDDARELAGTFKPSVPVYNALIDGMCKEGRIEVAIKLLGEMMDNGVDPNVVSYSSIINSLCNSGNVELAFALLARMFLR

Query:  GCDANIHTFTPLIKGCFMRGKLYEALDLWKLMIQDGCEPNVVAYNTLIHGLCKSGTLVEALQVCDQMQRSGCLPNVTTYSILIDGFAKSGHLVGASETWN
        GCDANIHTFTPLIKGCFMRGKLYEALDLWKLMIQDGCEPNVVAYNTLIHGLC +G+L EALQVCDQMQRSGCLPNVTTYSILIDGFAKSG LVGASETWN
Subjt:  GCDANIHTFTPLIKGCFMRGKLYEALDLWKLMIQDGCEPNVVAYNTLIHGLCKSGTLVEALQVCDQMQRSGCLPNVTTYSILIDGFAKSGHLVGASETWN

Query:  RMISHGCRPNVVAYTCMVDVLCKNLMFDQANSIMEKMILEGCTPNTVTFNTFIKGLCGNGRVEWAMKVLDRMQGHGCLPNITTYNELLDALFRMNKYEEA
        RMISHGCRPNVV YTCMVDVLCKN MFDQANS++EKM LEGCTPNT+TFNTFIKGLCGNGRVEWAMK+L+RMQGHGCLPNITTYNELLDALFRMNKYEEA
Subjt:  RMISHGCRPNVVAYTCMVDVLCKNLMFDQANSIMEKMILEGCTPNTVTFNTFIKGLCGNGRVEWAMKVLDRMQGHGCLPNITTYNELLDALFRMNKYEEA

Query:  FGLFQEIEERNLQPNLVTYNTILYGFSRAGMLGEALQLFGKVLVGGTAPDPITYNTMIHAYCKQGKVKIAAQLVERVSSMKEWHPDIITYTSLIWGACNW
        FGLFQEIE RNLQPNLVTYNT+LYGFSRAGM+GEALQLFGK LV GTAPD ITYNTMIHAYCKQGKVKIAAQLVERVSSMKEWHPDIITYTSLIWGACNW
Subjt:  FGLFQEIEERNLQPNLVTYNTILYGFSRAGMLGEALQLFGKVLVGGTAPDPITYNTMIHAYCKQGKVKIAAQLVERVSSMKEWHPDIITYTSLIWGACNW

Query:  INIEEAIAFLYKAINQGICPNFATWNALVRCFFDSLGHMGPIHILDDILRKG
        +NIEEA+AFL KAINQGICPNFATWNALVRCFFDSLGHMGPIHILDDILRKG
Subjt:  INIEEAIAFLYKAINQGICPNFATWNALVRCFFDSLGHMGPIHILDDILRKG

A0A1S4E236 pentatricopeptide repeat-containing protein At3g488100.0e+0091.72Show/hide
Query:  LPMLIQKPLMPFVLNSSPIINPRDPNKQHLLRESDVLKRLKKDRNLASALEFFSAIANSNAFKHTASTYKVMIERLGRECEMDVVQYILQQMKMDGISCC
        L M IQKPL+PFVLNS+PIINPRDPNKQ LL+ESDVLKRLK DRNL+SAL FF+AIANSNAF+HTASTY+VMIE+LGRECEMDVVQYILQQMKMDGI+CC
Subjt:  LPMLIQKPLMPFVLNSSPIINPRDPNKQHLLRESDVLKRLKKDRNLASALEFFSAIANSNAFKHTASTYKVMIERLGRECEMDVVQYILQQMKMDGISCC

Query:  EDLFVCIINSYKRVGSAEQALKMFYRIGEFGCKPTVKIYNHLLDALLSENRFQIINPLYTNMKKDGLIPNVFTYNILLKALCKNDRVDAAHKLFVEMSSK
        EDLF+CIIN YKRVGSAEQALK+FYRIGEFGCKPTVKIYNHLLDALLSEN+FQ+INPLY NMKK GLIPNVFTYNILLKALCKN RVDAAHKLFVEMS+K
Subjt:  EDLFVCIINSYKRVGSAEQALKMFYRIGEFGCKPTVKIYNHLLDALLSENRFQIINPLYTNMKKDGLIPNVFTYNILLKALCKNDRVDAAHKLFVEMSSK

Query:  GCPPDVVSYTTMVSSLCKAGKIDDARELAGTFKPSVPVYNALIDGMCKEGRIEVAIKLLGEMMDNGVDPNVVSYSSIINSLCNSGNVELAFALLARMFLR
        GCPPD V+YTTMVSSLCKAGKIDDARELAG FKP+VPVYNALIDGMCKEGRIEVAIKLLGEMMDNGVDPNVVSYSSIINSLC  GNVELAFALLA+MF R
Subjt:  GCPPDVVSYTTMVSSLCKAGKIDDARELAGTFKPSVPVYNALIDGMCKEGRIEVAIKLLGEMMDNGVDPNVVSYSSIINSLCNSGNVELAFALLARMFLR

Query:  GCDANIHTFTPLIKGCFMRGKLYEALDLWKLMIQDGCEPNVVAYNTLIHGLCKSGTLVEALQVCDQMQRSGCLPNVTTYSILIDGFAKSGHLVGASETWN
        GCDANIHTFTPLIKGCFMRGKLYEALDLWKLMI+DGCEPNVVAYNTLIHGLC SG+L EALQV DQMQRSGC+PNVTTYSILIDGFAKSG LVGASETWN
Subjt:  GCDANIHTFTPLIKGCFMRGKLYEALDLWKLMIQDGCEPNVVAYNTLIHGLCKSGTLVEALQVCDQMQRSGCLPNVTTYSILIDGFAKSGHLVGASETWN

Query:  RMISHGCRPNVVAYTCMVDVLCKNLMFDQANSIMEKMILEGCTPNTVTFNTFIKGLCGNGRVEWAMKVLDRMQGHGCLPNITTYNELLDALFRMNKYEEA
        RMISHGCRPNVVAYTCMVDVLCKN MFDQANS++EKM LEGCTPNT+TFNTFIKGLCGNGRVEWAMK+L+RMQGHGCLPNITTYNELLDALFRMNKYEEA
Subjt:  RMISHGCRPNVVAYTCMVDVLCKNLMFDQANSIMEKMILEGCTPNTVTFNTFIKGLCGNGRVEWAMKVLDRMQGHGCLPNITTYNELLDALFRMNKYEEA

Query:  FGLFQEIEERNLQPNLVTYNTILYGFSRAGMLGEALQLFGKVLVGGTAPDPITYNTMIHAYCKQGKVKIAAQLVERVSSMKEWHPDIITYTSLIWGACNW
        FGLFQEIEERNLQPNLVTYNTILYGFSRAGM+GEALQLFGK LV GTAPD ITYNTMIHAYCKQGKVKIAAQLVERVSSMKEW PDIITYT LIWGAC W
Subjt:  FGLFQEIEERNLQPNLVTYNTILYGFSRAGMLGEALQLFGKVLVGGTAPDPITYNTMIHAYCKQGKVKIAAQLVERVSSMKEWHPDIITYTSLIWGACNW

Query:  INIEEAIAFLYKAINQGICPNFATWNALVRCFFDSLGHMGPIHILDDILRKG
        INIEEAIAFL KAINQGICPNFATWNAL+RCFFDSLGHMGPIHILDDILRKG
Subjt:  INIEEAIAFLYKAINQGICPNFATWNALVRCFFDSLGHMGPIHILDDILRKG

A0A498IAB1 Cas1_AcylT domain-containing protein0.0e+0057.82Show/hide
Query:  MALLGPVTHAQVSCLIGFVTIVVAWIYSEYLVFKKKSISSKVRHSDIGLAEVESHTGKEDERAVLLEGGAIKSIPSGVRIPPSFSTFIRFIMLEESFLIE
        M +  PVT AQVS L+G + I+ AWIY+E+L +KK S+SSK+ +S++ L E+E    KED+RAVLLEGG ++S     RIP  +S   RF++++ SFL++
Subjt:  MALLGPVTHAQVSCLIGFVTIVVAWIYSEYLVFKKKSISSKVRHSDIGLAEVESHTGKEDERAVLLEGGAIKSIPSGVRIPPSFSTFIRFIMLEESFLIE

Query:  NRMLLRAASELGLLLAYFYICDRTDIFGTATK------------------------------------------------------VIFLMYHYFNAREI
        NR+ LRA SE  +LL+Y Y+CDRTD F  +TK                                                      V+FL+YHYF A EI
Subjt:  NRMLLRAASELGLLLAYFYICDRTDIFGTATK------------------------------------------------------VIFLMYHYFNAREI

Query:  YNVGRVCVAAYVWMTGFGNFSYYYARKDFSLSRFAQLHVILHLPLAHFFSLVVYGIIGILNKYNQIRAVIAVKFFVCFLVVILVWEIPGVFDVLWEPFTF
        YN  RV +AAYVWMTGFGNFSYYY RKDFSL+ +   ++    P+   F+L+VYG +GI+NKYN+  +VIAVK   CFLVVILVWEIPGVF++LW PFTF
Subjt:  YNVGRVCVAAYVWMTGFGNFSYYYARKDFSLSRFAQLHVILHLPLAHFFSLVVYGIIGILNKYNQIRAVIAVKFFVCFLVVILVWEIPGVFDVLWEPFTF

Query:  LLGYKDPNRLEEHLPLMYEWHFRTGLDRYIWILGMIYAYYYSTVENWIEKLDEAKLKLRILIKTTIVVTSATAAYLWFEYIFKLDSITYNKYHPYTSWIP
        LLGY DP   +  LPL++EW FR+GLDRYIWI+GMIYAYY+ TVE W+EKL+E++ K +  IK+ + + + T  YLWFEYI+K+D +TYNKYHPYTSWIP
Subjt:  LLGYKDPNRLEEHLPLMYEWHFRTGLDRYIWILGMIYAYYYSTVENWIEKLDEAKLKLRILIKTTIVVTSATAAYLWFEYIFKLDSITYNKYHPYTSWIP

Query:  ITAYICIRNVSQSTRSYTLTLFGWIGKISLDTYISQFHIWLRSNAPDAQPKKLLIIIPNYPLLNFMFTTIVFMTISYRLSELTNTFKMAFVPSKDNKRIM
        IT YIC+RNV+Q  RSY+LTLF W+GKI+L+TYI QFH+WLRS+ PDAQPK LL +IP+YP+LNFM T+ +++ IS RL +LTNT K+AFVPSKDNKR++
Subjt:  ITAYICIRNVSQSTRSYTLTLFGWIGKISLDTYISQFHIWLRSNAPDAQPKKLLIIIPNYPLLNFMFTTIVFMTISYRLSELTNTFKMAFVPSKDNKRIM

Query:  HNMITAAAIMGILYSFSFVFLKVLPML----------------------------------IQKPLMPFVLN-SSPII--NPR---DPNK-QHLLRESDV
         N++TA A+   LY  SFVFLKV  M+                                  + KP +PFVLN ++PI+  NP+   +PN+ Q  L+ESDV
Subjt:  HNMITAAAIMGILYSFSFVFLKVLPML----------------------------------IQKPLMPFVLN-SSPII--NPR---DPNK-QHLLRESDV

Query:  LKRLKKDRNLASALEFFSAIANSNAFKHTASTYKVMIERLGRECEMDVVQYILQQMKMDGISCCEDLFVCIINSYKRVGSAEQALKMFYRIGEFGCKPTV
        L+RLK + +L SALE+F +IANS AF+HT  TY  MIE+LGR+CEMD VQY+L QMK++G+ C E+LF+C+I+SY+R G AEQALKMFYRI EFGCK TV
Subjt:  LKRLKKDRNLASALEFFSAIANSNAFKHTASTYKVMIERLGRECEMDVVQYILQQMKMDGISCCEDLFVCIINSYKRVGSAEQALKMFYRIGEFGCKPTV

Query:  KIYNHLLDALLSENRFQIINPLYTNMKKDGLIPNVFTYNILLKALCKNDRVDAAHKLFVEMSSKGCPPDVVSYTTMVSSLCKAGKIDDARELAGTFKPSV
        KIYNHLLDALLSENRFQ+INP+Y+NMKKDG+ PNV+TYNILLKALCKNDRVD AHKL VEMS KGC PD VSYTT+VS+LC+ GK+++ARELAG F+P V
Subjt:  KIYNHLLDALLSENRFQIINPLYTNMKKDGLIPNVFTYNILLKALCKNDRVDAAHKLFVEMSSKGCPPDVVSYTTMVSSLCKAGKIDDARELAGTFKPSV

Query:  PVYNALIDGMCKEGRIEVAIKLLGEMMDNGVDPNVVSYSSIINSLCNSGNVELAFALLARMFLRGCDANIHTFTPLIKGCFMRGKLYEALDLWKLMIQDG
        PVYNAL++G+CKE +IE A+KLL EM+D G++PNV++YS+IINSL ++ NVE A A+LA+M +RGC  N+HTFT LIKG F+ G+++EALDLW  MI + 
Subjt:  PVYNALIDGMCKEGRIEVAIKLLGEMMDNGVDPNVVSYSSIINSLCNSGNVELAFALLARMFLRGCDANIHTFTPLIKGCFMRGKLYEALDLWKLMIQDG

Query:  CEPNVVAYNTLIHGLCKSGTLVEALQVCDQMQRSGCLPNVTTYSILIDGFAKSGHLVGASETWNRMISHGCRPNVVAYTCMVDVLCKNLMFDQANSIMEK
         +PN++AY TL+HGLC +G + +A+ VC +M ++GC PNVTTYS LIDGFAK G LVGAS+TWN M++ GCRPNV+AYTCM+DVLC+N MF QA+S++E 
Subjt:  CEPNVVAYNTLIHGLCKSGTLVEALQVCDQMQRSGCLPNVTTYSILIDGFAKSGHLVGASETWNRMISHGCRPNVVAYTCMVDVLCKNLMFDQANSIMEK

Query:  MILEGCTPNTVTFNTFIKGLCGNGRVEWAMKVLDRMQGHGCLPNITTYNELLDALFRMNKYEEAFGLFQEIEERNLQPNLVTYNTILYGFSRAGMLGEAL
        M  EGC PNTVTFNTFIKGLC +G+V+WA+ +LD+M+ +GCLPNITTYNELLD LF++N++EEA+G+ +EIEER +  NLVTYNTIL GF  AGM  EA+
Subjt:  MILEGCTPNTVTFNTFIKGLCGNGRVEWAMKVLDRMQGHGCLPNITTYNELLDALFRMNKYEEAFGLFQEIEERNLQPNLVTYNTILYGFSRAGMLGEAL

Query:  QLFGKVLVGGTAPDPITYNTMIHAYCKQGKVKIAAQLVERVSSMKEWHPDIITYTSLIWGACNWINIEEAIAFLYK
        QL GK+LV GT PD ITYNT+I+A CK+G +  A QL +R+ + KEW PD++ YTSL+WG CNW+ ++EA+ +L K
Subjt:  QLFGKVLVGGTAPDPITYNTMIHAYCKQGKVKIAAQLVERVSSMKEWHPDIITYTSLIWGACNWINIEEAIAFLYK

A0A5D3BV38 Pentatricopeptide repeat-containing protein0.0e+0091.72Show/hide
Query:  LPMLIQKPLMPFVLNSSPIINPRDPNKQHLLRESDVLKRLKKDRNLASALEFFSAIANSNAFKHTASTYKVMIERLGRECEMDVVQYILQQMKMDGISCC
        L M IQKPL+PFVLNS+PIINPRDPNKQ LL+ESDVLKRLK DRNL+SAL FF+AIANSNAF+HTASTY+VMIE+LGRECEMDVVQYILQQMKMDGI+CC
Subjt:  LPMLIQKPLMPFVLNSSPIINPRDPNKQHLLRESDVLKRLKKDRNLASALEFFSAIANSNAFKHTASTYKVMIERLGRECEMDVVQYILQQMKMDGISCC

Query:  EDLFVCIINSYKRVGSAEQALKMFYRIGEFGCKPTVKIYNHLLDALLSENRFQIINPLYTNMKKDGLIPNVFTYNILLKALCKNDRVDAAHKLFVEMSSK
        EDLF+CIIN YKRVGSAEQALK+FYRIGEFGCKPTVKIYNHLLDALLSEN+FQ+INPLY NMKK GLIPNVFTYNILLKALCKN RVDAAHKLFVEMS+K
Subjt:  EDLFVCIINSYKRVGSAEQALKMFYRIGEFGCKPTVKIYNHLLDALLSENRFQIINPLYTNMKKDGLIPNVFTYNILLKALCKNDRVDAAHKLFVEMSSK

Query:  GCPPDVVSYTTMVSSLCKAGKIDDARELAGTFKPSVPVYNALIDGMCKEGRIEVAIKLLGEMMDNGVDPNVVSYSSIINSLCNSGNVELAFALLARMFLR
        GCPPD V+YTTMVSSLCKAGKIDDARELAG FKP+VPVYNALIDGMCKEGRIEVAIKLLGEMMDNGVDPNVVSYSSIINSLC  GNVELAFALLA+MF R
Subjt:  GCPPDVVSYTTMVSSLCKAGKIDDARELAGTFKPSVPVYNALIDGMCKEGRIEVAIKLLGEMMDNGVDPNVVSYSSIINSLCNSGNVELAFALLARMFLR

Query:  GCDANIHTFTPLIKGCFMRGKLYEALDLWKLMIQDGCEPNVVAYNTLIHGLCKSGTLVEALQVCDQMQRSGCLPNVTTYSILIDGFAKSGHLVGASETWN
        GCDANIHTFTPLIKGCFMRGKLYEALDLWKLMI+DGCEPNVVAYNTLIHGLC SG+L EALQV DQMQRSGC+PNVTTYSILIDGFAKSG LVGASETWN
Subjt:  GCDANIHTFTPLIKGCFMRGKLYEALDLWKLMIQDGCEPNVVAYNTLIHGLCKSGTLVEALQVCDQMQRSGCLPNVTTYSILIDGFAKSGHLVGASETWN

Query:  RMISHGCRPNVVAYTCMVDVLCKNLMFDQANSIMEKMILEGCTPNTVTFNTFIKGLCGNGRVEWAMKVLDRMQGHGCLPNITTYNELLDALFRMNKYEEA
        RMISHGCRPNVVAYTCMVDVLCKN MFDQANS++EKM LEGCTPNT+TFNTFIKGLCGNGRVEWAMK+L+RMQGHGCLPNITTYNELLDALFRMNKYEEA
Subjt:  RMISHGCRPNVVAYTCMVDVLCKNLMFDQANSIMEKMILEGCTPNTVTFNTFIKGLCGNGRVEWAMKVLDRMQGHGCLPNITTYNELLDALFRMNKYEEA

Query:  FGLFQEIEERNLQPNLVTYNTILYGFSRAGMLGEALQLFGKVLVGGTAPDPITYNTMIHAYCKQGKVKIAAQLVERVSSMKEWHPDIITYTSLIWGACNW
        FGLFQEIEERNLQPNLVTYNTILYGFSRAGM+GEALQLFGK LV GTAPD ITYNTMIHAYCKQGKVKIAAQLVERVSSMKEW PDIITYT LIWGAC W
Subjt:  FGLFQEIEERNLQPNLVTYNTILYGFSRAGMLGEALQLFGKVLVGGTAPDPITYNTMIHAYCKQGKVKIAAQLVERVSSMKEWHPDIITYTSLIWGACNW

Query:  INIEEAIAFLYKAINQGICPNFATWNALVRCFFDSLGHMGPIHILDDILRKG
        INIEEAIAFL KAINQGICPNFATWNAL+RCFFDSLGHMGPIHILDDILRKG
Subjt:  INIEEAIAFLYKAINQGICPNFATWNALVRCFFDSLGHMGPIHILDDILRKG

A0A7J6E770 Cas1_AcylT domain-containing protein0.0e+0055.7Show/hide
Query:  MALLGPVTHAQVSCLIGFVTIVVAWIYSEYLVFKKKSISSKVRHSDIGLAEVESHTGKEDERAVLLEGGAIKSIPSGVRIPPSFSTFIRFIMLEESFLIE
        M +   VT  QVS L+G V ++VAWIY+E L +KK S+S K RHSDI L  V     K+D+RAVLLE G +++      I  S S+ +RF+++EESFL+E
Subjt:  MALLGPVTHAQVSCLIGFVTIVVAWIYSEYLVFKKKSISSKVRHSDIGLAEVESHTGKEDERAVLLEGGAIKSIPSGVRIPPSFSTFIRFIMLEESFLIE

Query:  NRMLLRAASELGLLLAYFYICDRTDIFGTATK-----------------------------------------------------VIFLMYHYFNAREIY
        NR+ LRA SE G  L Y+Y+CDRTD F ++ K                                                     V+FLMYHYF A+EIY
Subjt:  NRMLLRAASELGLLLAYFYICDRTDIFGTATK-----------------------------------------------------VIFLMYHYFNAREIY

Query:  NVGRVCVAAYVWMTGFGNFSYYYARKDFSLSRFAQLH-----------VILH--------LPLAHFFSLVVYGIIGILNKYNQIRAVIAVKFFVCFLVVI
        N  RV +AAYVWMTGFGNFSYYY RKDFS++RFAQ+            V+L+         P+   F+L+VYG +GILNK+N+  +VIA+K   CFLVVI
Subjt:  NVGRVCVAAYVWMTGFGNFSYYYARKDFSLSRFAQLH-----------VILH--------LPLAHFFSLVVYGIIGILNKYNQIRAVIAVKFFVCFLVVI

Query:  LVWEIPGVFDVLWEPFTFLLGYKDPNRLEEHLPLMYEWHFRTGLDRYIWILGMIYAYYYSTVENWIEKLDEAKLKLRILIKTTIVVTSATAAYLWFEYIF
        +VWEIPGVF+++W P TFLLGYKDP    ++ PL  EW+FR+GLDRYIWI+GMIYAYY+ TVE W+EKL+EA +K R+ IK  I   + T  YLWFEYI+
Subjt:  LVWEIPGVFDVLWEPFTFLLGYKDPNRLEEHLPLMYEWHFRTGLDRYIWILGMIYAYYYSTVENWIEKLDEAKLKLRILIKTTIVVTSATAAYLWFEYIF

Query:  KLDSITYNKYHPYTSWIPITAYICIRNVSQSTRSYTLTLFGWIGKISLDTYISQFHIWLRSNAPDAQPKKLLIIIPNYPLLNFMFTTIVFMTISYRLSEL
        KLD +TYNKYHPYTSWIPIT YIC+RNV+Q  R Y+LTLF W+GKI+L+TYISQ HIWLRS  PD QPK LL +IPNYP+LNFM TT +++ ISYRL +L
Subjt:  KLDSITYNKYHPYTSWIPITAYICIRNVSQSTRSYTLTLFGWIGKISLDTYISQFHIWLRSNAPDAQPKKLLIIIPNYPLLNFMFTTIVFMTISYRLSEL

Query:  TNTFKMAFVPSKDNKRIMHNMITAAAIMGILYSFSFVFLKVLPML-------------------------------------------------------
        TNT K+AFVPSKD+KR++HNMIT AAI   LY+ SFVFL V  ML                                                       
Subjt:  TNTFKMAFVPSKDNKRIMHNMITAAAIMGILYSFSFVFLKVLPML-------------------------------------------------------

Query:  --------------------IQKPLMPFVL--NSSPIINPRD-PNKQHL----LRESDVLKRLKKDRNLASALEFFSAIANSNAFKHTASTYKVMIERLG
                            + KP +PFVL  N++PI N  D P  Q +    L E DVL++LK++R+++SA+E+F ++ANS AFKHT  TY  MIE+LG
Subjt:  --------------------IQKPLMPFVL--NSSPIINPRD-PNKQHL----LRESDVLKRLKKDRNLASALEFFSAIANSNAFKHTASTYKVMIERLG

Query:  RECEMDVVQYILQQMKMDGISCCEDLFVCIINSYKRVGSAEQALKMFYRIGEFGCKPTVKIYNHLLDALLSENRFQIINPLYTNMKKDGLIPNVFTYNIL
         + +MD VQY+LQQMK++GI C EDLFV +INSYKR GSAEQALKMFYRI EFGCKPTV+IYNH+LDALL ENRF +INP+Y NMKKDG  PN+FTYNIL
Subjt:  RECEMDVVQYILQQMKMDGISCCEDLFVCIINSYKRVGSAEQALKMFYRIGEFGCKPTVKIYNHLLDALLSENRFQIINPLYTNMKKDGLIPNVFTYNIL

Query:  LKALCKNDRVDAAHKLFVEMSSKGCPPDVVSYTTMVSSLCKAGKIDDARELAGTFKPSVPVYNALIDGMCKEGRIEVAIKLLGEMMDNGVDPNVVSYSSI
        LKALCKNDRVD A KL  EMS KGC PD VSYTT+VSSLCK GK+++A+ LA  F+P +PVYNALI+G+CKE +I+   +LL EM+D G+DPNV++YS+I
Subjt:  LKALCKNDRVDAAHKLFVEMSSKGCPPDVVSYTTMVSSLCKAGKIDDARELAGTFKPSVPVYNALIDGMCKEGRIEVAIKLLGEMMDNGVDPNVVSYSSI

Query:  INSLCNSGNVELAFALLARMFLRGCDANIHTFTPLIKGCFMRGKLYEALDLWKLMIQDGCEPNVVAYNTLIHGLCKSGTLVEALQVCDQMQRSGCLPNVT
        I+SL + G+V L+FA+LA+MF++GC  N+HTFT LIKG F+ G+ +EAL LW  MI DG EPNVVAYNTLIHGLC SG +  A+ V  QM+RSGC PNV 
Subjt:  INSLCNSGNVELAFALLARMFLRGCDANIHTFTPLIKGCFMRGKLYEALDLWKLMIQDGCEPNVVAYNTLIHGLCKSGTLVEALQVCDQMQRSGCLPNVT

Query:  TYSILIDGFAKSGHLVGASETWNRMISHGCRPNVVAYTCMVDVLCKNLMFDQANSIMEKMILEGCTPNTVTFNTFIKGLCGNGRVEWAMKVLDRM-QGHG
         YS LIDGFAK G LVGASETWNRM++ GCRPNVVAYTCMVDVLC+N MF++A  ++EKM  +GC PNT+TFNT IKGLCG+GR +WA++VLD M   +G
Subjt:  TYSILIDGFAKSGHLVGASETWNRMISHGCRPNVVAYTCMVDVLCKNLMFDQANSIMEKMILEGCTPNTVTFNTFIKGLCGNGRVEWAMKVLDRM-QGHG

Query:  CLPNITTYNELLDALFRMNKYEEAFGLFQEIEERNLQPNLVTYNTILYGFSRAGMLGEALQLFGKVLVGGTAPDPITYNTMIHAYCKQGKVKIAAQLVER
        CLPNITTYNELLD  F++ +++EAF L +++EE+ ++ NLVTYNTIL GF  AG+  +ALQL GK++V G   D ITYNT+++AYCKQGKV+ A QL+ER
Subjt:  CLPNITTYNELLDALFRMNKYEEAFGLFQEIEERNLQPNLVTYNTILYGFSRAGMLGEALQLFGKVLVGGTAPDPITYNTMIHAYCKQGKVKIAAQLVER

Query:  VSSMKEWHPDIITYTSLIWGACNWINIEEAIAFLYKAINQGICPNFATWNALVRCFFDSLGHMGPIHILDDIL
        V    EWHPD++TYTSL+WG C+ I ++EA+  L+K +++ I PN ATWN LVRC FDSLGH+GPI+I+D+IL
Subjt:  VSSMKEWHPDIITYTSLIWGACNWINIEEAIAFLYKAINQGICPNFATWNALVRCFFDSLGHMGPIHILDDIL

SwissProt top hitse value%identityAlignment
Q0WW17 Protein REDUCED WALL ACETYLATION 21.3e-15450.27Show/hide
Query:  MALLGPVTHAQVSCLIGFVTIVVAWIYSEYLVFKKKSISSKVRHSDIGLAEVESHTGKEDERAVLLE-GGAIKSIPSGVRIPPSFSTFIRFIMLEESFLI
        MA   PVT   +S + G V ++VAW+YSEYL + K S+S+K RHSD+ L E+     KED++A+L+E GG ++S     + P + S  IRF++L+ESFL+
Subjt:  MALLGPVTHAQVSCLIGFVTIVVAWIYSEYLVFKKKSISSKVRHSDIGLAEVESHTGKEDERAVLLE-GGAIKSIPSGVRIPPSFSTFIRFIMLEESFLI

Query:  ENRMLLRAASELGLLLAYFYICDRTDIFGTATK-----------------------------------------------------VIFLMYHYFNAREI
        ENR+ LRA  E  +L+ YFYICDRTD+F ++ K                                                     V+FLMYHYF A E 
Subjt:  ENRMLLRAASELGLLLAYFYICDRTDIFGTATK-----------------------------------------------------VIFLMYHYFNAREI

Query:  YNVGRVCVAAYVWMTGFGNFSYYYARKDFSLSRFAQLH-----------VILH--------LPLAHFFSLVVYGIIGILNKYNQIRAVIAVKFFVCFLVV
        YN  RV +A YVWMTGFGNFSYYY RKDFSL+RFAQ+            ++L+         P+   F+L+VYG +GI++KYN++ +VIA KFF CF+VV
Subjt:  YNVGRVCVAAYVWMTGFGNFSYYYARKDFSLSRFAQLH-----------VILH--------LPLAHFFSLVVYGIIGILNKYNQIRAVIAVKFFVCFLVV

Query:  ILVWEIPGVFDVLWEPFTFLLGYKDPNRLEEHLPLMYEWHFRTGLDRYIWILGMIYAYYYSTVENWIEKLDEAKLKLRILIKTTIVVTSATAAYLWFEYI
        I+VWEIPGVF+ +W PFT L+GY DP   +  LPL++EWHFR+GLDRYIWI+GM+YAYY+ TVE+W++KL+EA++K R+ IKT++ + + T  Y W+EYI
Subjt:  ILVWEIPGVFDVLWEPFTFLLGYKDPNRLEEHLPLMYEWHFRTGLDRYIWILGMIYAYYYSTVENWIEKLDEAKLKLRILIKTTIVVTSATAAYLWFEYI

Query:  FKLDSITYNKYHPYTSWIPITAYICIRNVSQSTRSYTLTLFGWIGKISLDTYISQFHIWLRSNAPDAQPKKLLIIIPNYPLLNFMFTTIVFMTISYRLSE
        +K+D +TYNKYHPYTSWIPIT YIC+RN++QS R Y+LTL  W+GKI+L+TYISQFHIWLRS  PD QPK LL ++P+YPLLNFM TT +++ ISYRL E
Subjt:  FKLDSITYNKYHPYTSWIPITAYICIRNVSQSTRSYTLTLFGWIGKISLDTYISQFHIWLRSNAPDAQPKKLLIIIPNYPLLNFMFTTIVFMTISYRLSE

Query:  LTNTFKMAFVPSKDNKRIMHNMITAAAIMGILYSFSFVFLKVLPMLI
        LTNT K AF+P+KD+KR+++N I+A  I   LY FSF+ + +   L+
Subjt:  LTNTFKMAFVPSKDNKRIMHNMITAAAIMGILYSFSFVFLKVLPMLI

Q66GQ5 Protein REDUCED WALL ACETYLATION 36.0e-14750.46Show/hide
Query:  MALLGPVTHAQVSCLIGFVTIVVAWIYSEYLVFKKKSISSKVRHSDIGLAEVESHTGKEDERAVLLEGGAIKSIPSGVRIPPSFSTFIRFIMLEESFLIE
        MA   P+T  QVS L+G + + +AWIYSE+L +K+ S+ SKV HSD  L E+     KEDE  VLLEGG  +S+ +     P  +  IRF+ LE+SFLIE
Subjt:  MALLGPVTHAQVSCLIGFVTIVVAWIYSEYLVFKKKSISSKVRHSDIGLAEVESHTGKEDERAVLLEGGAIKSIPSGVRIPPSFSTFIRFIMLEESFLIE

Query:  NRMLLRAASELGLLLAYFYICDRTDIFGTATK-----------------------------------------------------VIFLMYHYFNAREIY
        NR  LRA +E G +L YFYI DRT + G + K                                                     V+FLMYHYF A EIY
Subjt:  NRMLLRAASELGLLLAYFYICDRTDIFGTATK-----------------------------------------------------VIFLMYHYFNAREIY

Query:  NVGRVCVAAYVWMTGFGNFSYYYARKDFSLSRFAQLHVILHL-------------------PLAHFFSLVVYGIIGILNKYNQIRAVIAVKFFVCFLVVI
        N  RV +AAYVWMTGFGNFSYYY RKDFSL+RF Q+   L+L                   P+   F+L+VYG +GI ++YN+I +V+A+K   CFLVVI
Subjt:  NVGRVCVAAYVWMTGFGNFSYYYARKDFSLSRFAQLHVILHL-------------------PLAHFFSLVVYGIIGILNKYNQIRAVIAVKFFVCFLVVI

Query:  LVWEIPGVFDVLWEPFTFLLGYKDPNRLEEHLPLMYEWHFRTGLDRYIWILGMIYAYYYSTVENWIEKLDEAKLKLRILIKTTIVVTSATAAYLWFEYIF
        ++WEIPGVF++ W P TFLLGY DP + E  LPL++EWHFR+GLDRYIWI+GMIYAY++ TVE W+EKL+E   K ++ IKT+I+  S+   YLW+EYI+
Subjt:  LVWEIPGVFDVLWEPFTFLLGYKDPNRLEEHLPLMYEWHFRTGLDRYIWILGMIYAYYYSTVENWIEKLDEAKLKLRILIKTTIVVTSATAAYLWFEYIF

Query:  KLDSITYNKYHPYTSWIPITAYICIRNVSQSTRSYTLTLFGWIGKISLDTYISQFHIWLRSNAPDAQPKKLLIIIPNYPLLNFMFTTIVFMTISYRLSEL
        KLD +TYNKYHPYTSWIPIT YIC+RN +Q  R++++TLF W+GKI+L+TYISQFHIWLRSN P+ QPK LL IIP YP+LNFM  T +++ +S+RL EL
Subjt:  KLDSITYNKYHPYTSWIPITAYICIRNVSQSTRSYTLTLFGWIGKISLDTYISQFHIWLRSNAPDAQPKKLLIIIPNYPLLNFMFTTIVFMTISYRLSEL

Query:  TNTFKMAFVPSKDNKRIMHNMITAAAIMGILYSFSFVFLKV
        TNT K  F+P+KD+KR++HN++  AAI   LY  S + L++
Subjt:  TNTFKMAFVPSKDNKRIMHNMITAAAIMGILYSFSFVFLKV

Q8L7C8 Protein REDUCED WALL ACETYLATION 11.1e-14049.53Show/hide
Query:  GPVTHAQVSCLIGFVTIVVAWIYSEYLVFKKKSISSKVRHSDIGLAEVESHTGKEDERAVLLEGGAIKSIPSGVRIPPSFSTFIRFIMLEESFLIENRML
        GP+T  QVS L+G + I V WIYSE L ++K  +  K  HSD  L E+     K+D++A LLEGG  +S           +  IRF+ +E+SFL+E+R  
Subjt:  GPVTHAQVSCLIGFVTIVVAWIYSEYLVFKKKSISSKVRHSDIGLAEVESHTGKEDERAVLLEGGAIKSIPSGVRIPPSFSTFIRFIMLEESFLIENRML

Query:  LRAASELGLLLAYFYICDRTDIFGTATK-----------------------------------------------------VIFLMYHYFNAREIYNVGR
        LRA SE G +L YFYICDRT++ G +TK                                                     V+FLMYHYF A EIYN  R
Subjt:  LRAASELGLLLAYFYICDRTDIFGTATK-----------------------------------------------------VIFLMYHYFNAREIYNVGR

Query:  VCVAAYVWMTGFGNFSYYYARKDFSLSRFAQLH-----------VILH--------LPLAHFFSLVVYGIIGILNKYNQIRAVIAVKFFVCFLVVILVWE
        + +AAYVWMTGFGNFSYYY RKDFS++RFAQ+            ++L+         P+   F+L+VYG +GI +KYN+I +V+A+K F CFLVV L+WE
Subjt:  VCVAAYVWMTGFGNFSYYYARKDFSLSRFAQLH-----------VILH--------LPLAHFFSLVVYGIIGILNKYNQIRAVIAVKFFVCFLVVILVWE

Query:  IPGVFDVLWEPFTFLLGYKDPNRLEEHLPLMYEWHFRTGLDRYIWILGMIYAYYYSTVENWIEKLDEAKLKLRILIKTTIVVTSATAAYLWFEYIFKLDS
        IPG F++ W P TFLLGY DP + + H   ++EWHFR+GLDRYIWI+GMIYAYY+ TVE W+EKL++ + K R+ IK  IV  +    Y+W+E I+KLD 
Subjt:  IPGVFDVLWEPFTFLLGYKDPNRLEEHLPLMYEWHFRTGLDRYIWILGMIYAYYYSTVENWIEKLDEAKLKLRILIKTTIVVTSATAAYLWFEYIFKLDS

Query:  ITYNKYHPYTSWIPITAYICIRNVSQSTRSYTLTLFGWIGKISLDTYISQFHIWLRSNAPDAQPKKLLIIIPNYPLLNFMFTTIVFMTISYRLSELTNTF
         +YN YHPYTSWIPIT YIC+RN +   RS +LTLF W+GKI+L+TYISQFHIWLRSN PD QPK LL IIP YP+LNFM TT +++ +S+RL ELTNT 
Subjt:  ITYNKYHPYTSWIPITAYICIRNVSQSTRSYTLTLFGWIGKISLDTYISQFHIWLRSNAPDAQPKKLLIIIPNYPLLNFMFTTIVFMTISYRLSELTNTF

Query:  KMAFVPSKDNKRIMHNMITAAAIMGILYSFSFVFLKV
        K  FVP+KDNKR+  N I   AI   LY FSFV L++
Subjt:  KMAFVPSKDNKRIMHNMITAAAIMGILYSFSFVFLKV

Q9FXG3 Protein REDUCED WALL ACETYLATION 44.3e-14549.91Show/hide
Query:  MALLGPVTHAQVSCLIGFVTIVVAWIYSEYLVFKKKSISSKVRHSDIGLAEVESHTGKEDERAVLLEGGAIKSIPSGVRIPPSFSTFIRFIMLEESFLIE
        M +  P+T  QVS L+G + +++AW+YSE+L +++ S  +KV HSD  L E+E  T KEDE  VL+EGG  +S  S     P  +  IRF+ LE+SFL+E
Subjt:  MALLGPVTHAQVSCLIGFVTIVVAWIYSEYLVFKKKSISSKVRHSDIGLAEVESHTGKEDERAVLLEGGAIKSIPSGVRIPPSFSTFIRFIMLEESFLIE

Query:  NRMLLRAASELGLLLAYFYICDRTDIFGTATK-----------------------------------------------------VIFLMYHYFNAREIY
        NR  LRA +E G +L YFYICDRT + G + K                                                     V+FLMYHYF A E Y
Subjt:  NRMLLRAASELGLLLAYFYICDRTDIFGTATK-----------------------------------------------------VIFLMYHYFNAREIY

Query:  NVGRVCVAAYVWMTGFGNFSYYYARKDFSLSRFAQLH-----------VILH--------LPLAHFFSLVVYGIIGILNKYNQIRAVIAVKFFVCFLVVI
        N  RV +A YVWMTGFGNFSYYY RKDFSL+RF Q+            +IL+         P+   F+L+VYG +GI ++YN+I +V+A+K   CFLVVI
Subjt:  NVGRVCVAAYVWMTGFGNFSYYYARKDFSLSRFAQLH-----------VILH--------LPLAHFFSLVVYGIIGILNKYNQIRAVIAVKFFVCFLVVI

Query:  LVWEIPGVFDVLWEPFTFLLGYKDPNRLEEHLPLMYEWHFRTGLDRYIWILGMIYAYYYSTVENWIEKLDEAKLKLRILIKTTIVVTSATAAYLWFEYIF
        L+WEIPGVF++ W P  FLLGY DP + +  LP ++EWHFR+GLDRYIWI+GMIYAY++ TVE W+EKL+E   K R+ IKT+I+  S+ A YLW+EYI+
Subjt:  LVWEIPGVFDVLWEPFTFLLGYKDPNRLEEHLPLMYEWHFRTGLDRYIWILGMIYAYYYSTVENWIEKLDEAKLKLRILIKTTIVVTSATAAYLWFEYIF

Query:  KLDSITYNKYHPYTSWIPITAYICIRNVSQSTRSYTLTLFGWIGKISLDTYISQFHIWLRSNAPDAQPKKLLIIIPNYPLLNFMFTTIVFMTISYRLSEL
        KLD +TYNKYHPYTSWIPIT YIC+RN +Q  R ++LTLF W+GKI+L+TYISQFHIWLRS+ P+ QPK LL IIP YP+LNFM TT +++ +S RL EL
Subjt:  KLDSITYNKYHPYTSWIPITAYICIRNVSQSTRSYTLTLFGWIGKISLDTYISQFHIWLRSNAPDAQPKKLLIIIPNYPLLNFMFTTIVFMTISYRLSEL

Query:  TNTFKMAFVPSKDNKRIMHNMITAAAIMGILYSFSFVFLKV
        TNT K  F+P+KD+KR++HN+I  AAI   LY    + L +
Subjt:  TNTFKMAFVPSKDNKRIMHNMITAAAIMGILYSFSFVFLKV

Q9M302 Pentatricopeptide repeat-containing protein At3g488106.5e-20252.74Show/hide
Query:  LPMLIQKPLMPFVLNSSPIIN---PRDPNKQHLLRESDVLKRLKKDRNLASALEFFSAIANSNAFKHTASTYKVMIERLGRECEMDVVQYILQQMKMDGI
        L + +QKPL+PFVLN++  +N      PN    ++E DV+KRL+++  +  AL FF +IANSN FKHT  T++VMI +L  + ++D VQY+LQQMK+ G 
Subjt:  LPMLIQKPLMPFVLNSSPIIN---PRDPNKQHLLRESDVLKRLKKDRNLASALEFFSAIANSNAFKHTASTYKVMIERLGRECEMDVVQYILQQMKMDGI

Query:  SCCEDLFVCIINSYKRVGSAEQALKMFYRIGEFGCKPTVKIYNHLLDALLSENRFQIINPLYTNMKKDGLIPNVFTYNILLKALCKNDRVDAAHKLFVEM
         C EDLF+ +I+ Y++VG AE+A++MFYRI EFGC P+VKIYNH+LD LL ENR Q+I  +Y +MK+DG  PNVFTYN+LLKALCKN++VD A KL VEM
Subjt:  SCCEDLFVCIINSYKRVGSAEQALKMFYRIGEFGCKPTVKIYNHLLDALLSENRFQIINPLYTNMKKDGLIPNVFTYNILLKALCKNDRVDAAHKLFVEM

Query:  SSKGCPPDVVSYTTMVSSLCKAGKIDDARELAGTFKPSVPVYNALIDGMCKEGRIEVAIKLLGEMMDNGVDPNVVSYSSIINSLCNSGNVELAFALLARM
        S+KGC PD VSYTT++SS+C+ G + + RELA  F+P V VYNALI+G+CKE   + A +L+ EM++ G+ PNV+SYS++IN LCNSG +ELAF+ L +M
Subjt:  SSKGCPPDVVSYTTMVSSLCKAGKIDDARELAGTFKPSVPVYNALIDGMCKEGRIEVAIKLLGEMMDNGVDPNVVSYSSIINSLCNSGNVELAFALLARM

Query:  FLRGCDANIHTFTPLIKGCFMRGKLYEALDLWKLMIQD-GCEPNVVAYNTLIHGLCKSGTLVEALQVCDQMQRSGCLPNVTTYSILIDGFAKSGHLVGAS
          RGC  NI+T + L+KGCF+RG  ++ALDLW  MI+  G +PNVVAYNTL+ G C  G +V+A+ V   M+  GC PN+ TY  LI+GFAK G L GA 
Subjt:  FLRGCDANIHTFTPLIKGCFMRGKLYEALDLWKLMIQD-GCEPNVVAYNTLIHGLCKSGTLVEALQVCDQMQRSGCLPNVTTYSILIDGFAKSGHLVGAS

Query:  ETWNRMISHGCRPNVVAYTCMVDVLCKNLMFDQANSIMEKMILEGCTPNTVTFNTFIKGLCGNGRVEWAMKVLDRM-QGHGCLPNITTYNELLDALFRMN
          WN+M++ GC PNVV YT MV+ LC++  F +A S++E M  E C P+  TFN FIKGLC  GR++WA KV  +M Q H C PNI TYNELLD L + N
Subjt:  ETWNRMISHGCRPNVVAYTCMVDVLCKNLMFDQANSIMEKMILEGCTPNTVTFNTFIKGLCGNGRVEWAMKVLDRM-QGHGCLPNITTYNELLDALFRMN

Query:  KYEEAFGLFQEIEERNLQPNLVTYNTILYGFSRAGMLGEALQLFGKVLVGGTAPDPITYNTMIHAYCKQGKVKIAAQLVERVS-SMKEWHPDIITYTSLI
        + EEA+GL +EI  R ++ +  TYNT+L+G   AG+ G ALQL GK++V G +PD IT N +I AYCKQGK + AAQ+++ VS   ++W PD+I+YT++I
Subjt:  KYEEAFGLFQEIEERNLQPNLVTYNTILYGFSRAGMLGEALQLFGKVLVGGTAPDPITYNTMIHAYCKQGKVKIAAQLVERVS-SMKEWHPDIITYTSLI

Query:  WGACNWINIEEAIAFLYKAINQGICPNFATWNALVRCFFDSLGHMGPIHILDDILR
        WG C     E+ +  L + I+ GI P+ ATW+ L+ CF           ILDDI+R
Subjt:  WGACNWINIEEAIAFLYKAINQGICPNFATWNALVRCFFDSLGHMGPIHILDDILR

Arabidopsis top hitse value%identityAlignment
AT2G34410.1 O-acetyltransferase family protein4.3e-14850.46Show/hide
Query:  MALLGPVTHAQVSCLIGFVTIVVAWIYSEYLVFKKKSISSKVRHSDIGLAEVESHTGKEDERAVLLEGGAIKSIPSGVRIPPSFSTFIRFIMLEESFLIE
        MA   P+T  QVS L+G + + +AWIYSE+L +K+ S+ SKV HSD  L E+     KEDE  VLLEGG  +S+ +     P  +  IRF+ LE+SFLIE
Subjt:  MALLGPVTHAQVSCLIGFVTIVVAWIYSEYLVFKKKSISSKVRHSDIGLAEVESHTGKEDERAVLLEGGAIKSIPSGVRIPPSFSTFIRFIMLEESFLIE

Query:  NRMLLRAASELGLLLAYFYICDRTDIFGTATK-----------------------------------------------------VIFLMYHYFNAREIY
        NR  LRA +E G +L YFYI DRT + G + K                                                     V+FLMYHYF A EIY
Subjt:  NRMLLRAASELGLLLAYFYICDRTDIFGTATK-----------------------------------------------------VIFLMYHYFNAREIY

Query:  NVGRVCVAAYVWMTGFGNFSYYYARKDFSLSRFAQLHVILHL-------------------PLAHFFSLVVYGIIGILNKYNQIRAVIAVKFFVCFLVVI
        N  RV +AAYVWMTGFGNFSYYY RKDFSL+RF Q+   L+L                   P+   F+L+VYG +GI ++YN+I +V+A+K   CFLVVI
Subjt:  NVGRVCVAAYVWMTGFGNFSYYYARKDFSLSRFAQLHVILHL-------------------PLAHFFSLVVYGIIGILNKYNQIRAVIAVKFFVCFLVVI

Query:  LVWEIPGVFDVLWEPFTFLLGYKDPNRLEEHLPLMYEWHFRTGLDRYIWILGMIYAYYYSTVENWIEKLDEAKLKLRILIKTTIVVTSATAAYLWFEYIF
        ++WEIPGVF++ W P TFLLGY DP + E  LPL++EWHFR+GLDRYIWI+GMIYAY++ TVE W+EKL+E   K ++ IKT+I+  S+   YLW+EYI+
Subjt:  LVWEIPGVFDVLWEPFTFLLGYKDPNRLEEHLPLMYEWHFRTGLDRYIWILGMIYAYYYSTVENWIEKLDEAKLKLRILIKTTIVVTSATAAYLWFEYIF

Query:  KLDSITYNKYHPYTSWIPITAYICIRNVSQSTRSYTLTLFGWIGKISLDTYISQFHIWLRSNAPDAQPKKLLIIIPNYPLLNFMFTTIVFMTISYRLSEL
        KLD +TYNKYHPYTSWIPIT YIC+RN +Q  R++++TLF W+GKI+L+TYISQFHIWLRSN P+ QPK LL IIP YP+LNFM  T +++ +S+RL EL
Subjt:  KLDSITYNKYHPYTSWIPITAYICIRNVSQSTRSYTLTLFGWIGKISLDTYISQFHIWLRSNAPDAQPKKLLIIIPNYPLLNFMFTTIVFMTISYRLSEL

Query:  TNTFKMAFVPSKDNKRIMHNMITAAAIMGILYSFSFVFLKV
        TNT K  F+P+KD+KR++HN++  AAI   LY  S + L++
Subjt:  TNTFKMAFVPSKDNKRIMHNMITAAAIMGILYSFSFVFLKV

AT3G06550.1 O-acetyltransferase family protein9.5e-15650.27Show/hide
Query:  MALLGPVTHAQVSCLIGFVTIVVAWIYSEYLVFKKKSISSKVRHSDIGLAEVESHTGKEDERAVLLE-GGAIKSIPSGVRIPPSFSTFIRFIMLEESFLI
        MA   PVT   +S + G V ++VAW+YSEYL + K S+S+K RHSD+ L E+     KED++A+L+E GG ++S     + P + S  IRF++L+ESFL+
Subjt:  MALLGPVTHAQVSCLIGFVTIVVAWIYSEYLVFKKKSISSKVRHSDIGLAEVESHTGKEDERAVLLE-GGAIKSIPSGVRIPPSFSTFIRFIMLEESFLI

Query:  ENRMLLRAASELGLLLAYFYICDRTDIFGTATK-----------------------------------------------------VIFLMYHYFNAREI
        ENR+ LRA  E  +L+ YFYICDRTD+F ++ K                                                     V+FLMYHYF A E 
Subjt:  ENRMLLRAASELGLLLAYFYICDRTDIFGTATK-----------------------------------------------------VIFLMYHYFNAREI

Query:  YNVGRVCVAAYVWMTGFGNFSYYYARKDFSLSRFAQLH-----------VILH--------LPLAHFFSLVVYGIIGILNKYNQIRAVIAVKFFVCFLVV
        YN  RV +A YVWMTGFGNFSYYY RKDFSL+RFAQ+            ++L+         P+   F+L+VYG +GI++KYN++ +VIA KFF CF+VV
Subjt:  YNVGRVCVAAYVWMTGFGNFSYYYARKDFSLSRFAQLH-----------VILH--------LPLAHFFSLVVYGIIGILNKYNQIRAVIAVKFFVCFLVV

Query:  ILVWEIPGVFDVLWEPFTFLLGYKDPNRLEEHLPLMYEWHFRTGLDRYIWILGMIYAYYYSTVENWIEKLDEAKLKLRILIKTTIVVTSATAAYLWFEYI
        I+VWEIPGVF+ +W PFT L+GY DP   +  LPL++EWHFR+GLDRYIWI+GM+YAYY+ TVE+W++KL+EA++K R+ IKT++ + + T  Y W+EYI
Subjt:  ILVWEIPGVFDVLWEPFTFLLGYKDPNRLEEHLPLMYEWHFRTGLDRYIWILGMIYAYYYSTVENWIEKLDEAKLKLRILIKTTIVVTSATAAYLWFEYI

Query:  FKLDSITYNKYHPYTSWIPITAYICIRNVSQSTRSYTLTLFGWIGKISLDTYISQFHIWLRSNAPDAQPKKLLIIIPNYPLLNFMFTTIVFMTISYRLSE
        +K+D +TYNKYHPYTSWIPIT YIC+RN++QS R Y+LTL  W+GKI+L+TYISQFHIWLRS  PD QPK LL ++P+YPLLNFM TT +++ ISYRL E
Subjt:  FKLDSITYNKYHPYTSWIPITAYICIRNVSQSTRSYTLTLFGWIGKISLDTYISQFHIWLRSNAPDAQPKKLLIIIPNYPLLNFMFTTIVFMTISYRLSE

Query:  LTNTFKMAFVPSKDNKRIMHNMITAAAIMGILYSFSFVFLKVLPMLI
        LTNT K AF+P+KD+KR+++N I+A  I   LY FSF+ + +   L+
Subjt:  LTNTFKMAFVPSKDNKRIMHNMITAAAIMGILYSFSFVFLKVLPMLI

AT3G06550.2 O-acetyltransferase family protein5.6e-15650.09Show/hide
Query:  MALLGPVTHAQVSCLIGFVTIVVAWIYSEYLVFKKKSISSKVRHSDIGLAEVESHTGKEDERAVLLE-GGAIKSIPSGVRIPPSFSTFIRFIMLEESFLI
        MA   PVT   +S + G V ++VAW+YSEYL + K S+S+K RHSD+ L E+     KED++A+L+E GG ++S     + P + S  IRF++L+ESFL+
Subjt:  MALLGPVTHAQVSCLIGFVTIVVAWIYSEYLVFKKKSISSKVRHSDIGLAEVESHTGKEDERAVLLE-GGAIKSIPSGVRIPPSFSTFIRFIMLEESFLI

Query:  ENRMLLRAASELGLLLAYFYICDRTDIFGTATK-----------------------------------------------------VIFLMYHYFNAREI
        ENR+ LRA  E  +L+ YFYICDRTD+F ++ K                                                     V+FLMYHYF A E 
Subjt:  ENRMLLRAASELGLLLAYFYICDRTDIFGTATK-----------------------------------------------------VIFLMYHYFNAREI

Query:  YNVGRVCVAAYVWMTGFGNFSYYYARKDFSLSRFAQLH-----------VILH--------LPLAHFFSLVVYGIIGILNKYNQIRAVIAVKFFVCFLVV
        YN  RV +A YVWMTGFGNFSYYY RKDFSL+RFAQ+            ++L+         P+   F+L+VYG +GI++KYN++ +VIA KFF CF+VV
Subjt:  YNVGRVCVAAYVWMTGFGNFSYYYARKDFSLSRFAQLH-----------VILH--------LPLAHFFSLVVYGIIGILNKYNQIRAVIAVKFFVCFLVV

Query:  ILVWEIPGVFDVLWEPFTFLLGYKDPNRLEEHLPLMYEWHFRTGLDRYIWILGMIYAYYYSTVENWIEKLDEAKLKLRILIKTTIVVTSATAAYLWFEYI
        I+VWEIPGVF+ +W PFT L+GY DP   +  LPL++EWHFR+GLDRYIWI+GM+YAYY+ TVE+W++KL+EA++K R+ IKT++ + + T  Y W+EYI
Subjt:  ILVWEIPGVFDVLWEPFTFLLGYKDPNRLEEHLPLMYEWHFRTGLDRYIWILGMIYAYYYSTVENWIEKLDEAKLKLRILIKTTIVVTSATAAYLWFEYI

Query:  FKLDSITYNKYHPYTSWIPITAYICIRNVSQSTRSYTLTLFGWIGKISLDTYISQFHIWLRSNAPDAQPKKLLIIIPNYPLLNFMFTTIVFMTISYRLSE
        +K+D +TYNKYHPYTSWIPIT YIC+RN++QS R Y+LTL  W+GKI+L+TYISQFHIWLRS  PD QPK LL ++P+YPLLNFM TT +++ ISYRL E
Subjt:  FKLDSITYNKYHPYTSWIPITAYICIRNVSQSTRSYTLTLFGWIGKISLDTYISQFHIWLRSNAPDAQPKKLLIIIPNYPLLNFMFTTIVFMTISYRLSE

Query:  LTNTFKMAFVPSKDNKRIMHNMITAAAIMGILYSFSFVFLKVLPMLIQK
        LTNT K AF+P+KD+KR+++N I+A  I   LY FSF+ + +   L+ +
Subjt:  LTNTFKMAFVPSKDNKRIMHNMITAAAIMGILYSFSFVFLKVLPMLIQK

AT3G06550.3 O-acetyltransferase family protein8.9e-15450.09Show/hide
Query:  MALLGPVTHAQVSCLIGFVTIVVAWIYSEYLVFKKKSISSKVRHSDIGLAEVESHTGKEDERAVLLE-GGAIKSIPSGVRIPPSFSTFIRFIMLEESFLI
        MA   PVT   +S + G V ++VAW+YSEYL + K S+S+K  HSD+ L E+     KED++A+L+E GG ++S     + P + S  IRF++L+ESFL+
Subjt:  MALLGPVTHAQVSCLIGFVTIVVAWIYSEYLVFKKKSISSKVRHSDIGLAEVESHTGKEDERAVLLE-GGAIKSIPSGVRIPPSFSTFIRFIMLEESFLI

Query:  ENRMLLRAASELGLLLAYFYICDRTDIFGTATK-----------------------------------------------------VIFLMYHYFNAREI
        ENR+ LRA  E  +L+ YFYICDRTD+F ++ K                                                     V+FLMYHYF A E 
Subjt:  ENRMLLRAASELGLLLAYFYICDRTDIFGTATK-----------------------------------------------------VIFLMYHYFNAREI

Query:  YNVGRVCVAAYVWMTGFGNFSYYYARKDFSLSRFAQLH-----------VILH--------LPLAHFFSLVVYGIIGILNKYNQIRAVIAVKFFVCFLVV
        YN  RV +A YVWMTGFGNFSYYY RKDFSL+RFAQ+            ++L+         P+   F+L+VYG +GI++KYN++ +VIA KFF CF+VV
Subjt:  YNVGRVCVAAYVWMTGFGNFSYYYARKDFSLSRFAQLH-----------VILH--------LPLAHFFSLVVYGIIGILNKYNQIRAVIAVKFFVCFLVV

Query:  ILVWEIPGVFDVLWEPFTFLLGYKDPNRLEEHLPLMYEWHFRTGLDRYIWILGMIYAYYYSTVENWIEKLDEAKLKLRILIKTTIVVTSATAAYLWFEYI
        I+VWEIPGVF+ +W PFT L+GY DP   +  LPL++EWHFR+GLDRYIWI+GM+YAYY+ TVE+W++KL+EA++K R+ IKT++ + + T  Y W+EYI
Subjt:  ILVWEIPGVFDVLWEPFTFLLGYKDPNRLEEHLPLMYEWHFRTGLDRYIWILGMIYAYYYSTVENWIEKLDEAKLKLRILIKTTIVVTSATAAYLWFEYI

Query:  FKLDSITYNKYHPYTSWIPITAYICIRNVSQSTRSYTLTLFGWIGKISLDTYISQFHIWLRSNAPDAQPKKLLIIIPNYPLLNFMFTTIVFMTISYRLSE
        +K+D +TYNKYHPYTSWIPIT YIC+RN++QS R Y+LTL  W+GKI+L+TYISQFHIWLRS  PD QPK LL ++P+YPLLNFM TT +++ ISYRL E
Subjt:  FKLDSITYNKYHPYTSWIPITAYICIRNVSQSTRSYTLTLFGWIGKISLDTYISQFHIWLRSNAPDAQPKKLLIIIPNYPLLNFMFTTIVFMTISYRLSE

Query:  LTNTFKMAFVPSKDNKRIMHNMITAAAIMGILYSFSFVFLKVLPMLI
        LTNT K AF+P+KD+KR+++N I+A  I   LY FSF+ + +   L+
Subjt:  LTNTFKMAFVPSKDNKRIMHNMITAAAIMGILYSFSFVFLKVLPMLI

AT3G48810.1 Pentatricopeptide repeat (PPR) superfamily protein4.6e-20352.74Show/hide
Query:  LPMLIQKPLMPFVLNSSPIIN---PRDPNKQHLLRESDVLKRLKKDRNLASALEFFSAIANSNAFKHTASTYKVMIERLGRECEMDVVQYILQQMKMDGI
        L + +QKPL+PFVLN++  +N      PN    ++E DV+KRL+++  +  AL FF +IANSN FKHT  T++VMI +L  + ++D VQY+LQQMK+ G 
Subjt:  LPMLIQKPLMPFVLNSSPIIN---PRDPNKQHLLRESDVLKRLKKDRNLASALEFFSAIANSNAFKHTASTYKVMIERLGRECEMDVVQYILQQMKMDGI

Query:  SCCEDLFVCIINSYKRVGSAEQALKMFYRIGEFGCKPTVKIYNHLLDALLSENRFQIINPLYTNMKKDGLIPNVFTYNILLKALCKNDRVDAAHKLFVEM
         C EDLF+ +I+ Y++VG AE+A++MFYRI EFGC P+VKIYNH+LD LL ENR Q+I  +Y +MK+DG  PNVFTYN+LLKALCKN++VD A KL VEM
Subjt:  SCCEDLFVCIINSYKRVGSAEQALKMFYRIGEFGCKPTVKIYNHLLDALLSENRFQIINPLYTNMKKDGLIPNVFTYNILLKALCKNDRVDAAHKLFVEM

Query:  SSKGCPPDVVSYTTMVSSLCKAGKIDDARELAGTFKPSVPVYNALIDGMCKEGRIEVAIKLLGEMMDNGVDPNVVSYSSIINSLCNSGNVELAFALLARM
        S+KGC PD VSYTT++SS+C+ G + + RELA  F+P V VYNALI+G+CKE   + A +L+ EM++ G+ PNV+SYS++IN LCNSG +ELAF+ L +M
Subjt:  SSKGCPPDVVSYTTMVSSLCKAGKIDDARELAGTFKPSVPVYNALIDGMCKEGRIEVAIKLLGEMMDNGVDPNVVSYSSIINSLCNSGNVELAFALLARM

Query:  FLRGCDANIHTFTPLIKGCFMRGKLYEALDLWKLMIQD-GCEPNVVAYNTLIHGLCKSGTLVEALQVCDQMQRSGCLPNVTTYSILIDGFAKSGHLVGAS
          RGC  NI+T + L+KGCF+RG  ++ALDLW  MI+  G +PNVVAYNTL+ G C  G +V+A+ V   M+  GC PN+ TY  LI+GFAK G L GA 
Subjt:  FLRGCDANIHTFTPLIKGCFMRGKLYEALDLWKLMIQD-GCEPNVVAYNTLIHGLCKSGTLVEALQVCDQMQRSGCLPNVTTYSILIDGFAKSGHLVGAS

Query:  ETWNRMISHGCRPNVVAYTCMVDVLCKNLMFDQANSIMEKMILEGCTPNTVTFNTFIKGLCGNGRVEWAMKVLDRM-QGHGCLPNITTYNELLDALFRMN
          WN+M++ GC PNVV YT MV+ LC++  F +A S++E M  E C P+  TFN FIKGLC  GR++WA KV  +M Q H C PNI TYNELLD L + N
Subjt:  ETWNRMISHGCRPNVVAYTCMVDVLCKNLMFDQANSIMEKMILEGCTPNTVTFNTFIKGLCGNGRVEWAMKVLDRM-QGHGCLPNITTYNELLDALFRMN

Query:  KYEEAFGLFQEIEERNLQPNLVTYNTILYGFSRAGMLGEALQLFGKVLVGGTAPDPITYNTMIHAYCKQGKVKIAAQLVERVS-SMKEWHPDIITYTSLI
        + EEA+GL +EI  R ++ +  TYNT+L+G   AG+ G ALQL GK++V G +PD IT N +I AYCKQGK + AAQ+++ VS   ++W PD+I+YT++I
Subjt:  KYEEAFGLFQEIEERNLQPNLVTYNTILYGFSRAGMLGEALQLFGKVLVGGTAPDPITYNTMIHAYCKQGKVKIAAQLVERVS-SMKEWHPDIITYTSLI

Query:  WGACNWINIEEAIAFLYKAINQGICPNFATWNALVRCFFDSLGHMGPIHILDDILR
        WG C     E+ +  L + I+ GI P+ ATW+ L+ CF           ILDDI+R
Subjt:  WGACNWINIEEAIAFLYKAINQGICPNFATWNALVRCFFDSLGHMGPIHILDDILR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTACTAGGCCCAGTGACTCACGCTCAGGTTTCGTGTCTTATCGGATTTGTAACCATTGTCGTTGCGTGGATTTATTCGGAGTACTTAGTATTTAAAAAGAAGTC
GATCTCTTCCAAAGTAAGGCACTCTGATATTGGCTTGGCTGAGGTAGAAAGTCATACTGGAAAGGAAGATGAGAGAGCTGTTTTATTAGAAGGGGGGGCCATCAAGTCTA
TACCTTCTGGAGTTCGAATTCCACCTTCCTTCTCTACTTTTATCAGGTTCATTATGCTGGAAGAGTCATTCTTGATAGAAAATCGAATGTTACTCAGAGCAGCATCTGAA
CTTGGTCTTCTTTTAGCATACTTCTACATTTGTGACCGTACTGACATATTTGGAACAGCAACGAAGGTAATTTTTCTGATGTATCATTACTTTAATGCAAGAGAAATTTA
CAATGTGGGTCGCGTATGTGTTGCCGCCTACGTGTGGATGACTGGATTTGGGAATTTTTCATATTACTATGCTCGTAAAGACTTTAGCCTCTCTAGATTTGCTCAGTTAC
ATGTTATACTACATCTGCCCCTTGCACACTTTTTCTCTCTCGTGGTATATGGTATCATTGGTATTCTGAACAAGTACAATCAGATTAGAGCCGTCATTGCTGTGAAATTT
TTTGTTTGCTTCTTGGTCGTTATCCTAGTATGGGAAATACCTGGAGTGTTTGACGTGCTTTGGGAACCCTTTACCTTCCTTCTAGGGTATAAAGATCCAAATCGATTGGA
AGAACACTTGCCTCTTATGTATGAGTGGCACTTCCGCACGGGGCTTGATCGCTATATTTGGATATTAGGCATGATATATGCATACTATTATTCAACTGTTGAGAACTGGA
TTGAAAAGTTAGATGAAGCAAAATTGAAGCTCAGGATTCTGATCAAAACAACAATTGTTGTGACATCCGCAACGGCCGCATATCTATGGTTTGAGTACATATTCAAATTG
GACAGTATTACTTACAACAAATATCATCCTTACACGTCTTGGATTCCCATCACGGCCTACATATGTATACGAAATGTCAGCCAGAGTACCCGTAGCTATACATTGACCCT
TTTCGGTTGGATTGGAAAGATTTCACTGGACACATACATCTCGCAGTTCCATATTTGGTTAAGATCAAATGCTCCGGATGCCCAACCGAAGAAGTTGCTCATTATCATCC
CAAATTATCCTTTATTGAACTTTATGTTTACAACTATCGTATTTATGACGATTTCCTACAGGCTATCTGAGTTGACAAATACATTTAAAATGGCTTTTGTGCCTAGCAAG
GACAATAAGCGCATTATGCATAATATGATTACCGCAGCTGCAATCATGGGCATCTTATACTCCTTCTCATTTGTATTCCTTAAAGTTCTTCCAATGTTGATTCAGAAACC
CTTAATGCCGTTCGTGCTAAATTCCAGCCCCATCATCAATCCCCGTGATCCCAATAAACAACACCTTCTGCGAGAATCCGATGTGCTCAAACGATTGAAAAAGGATCGGA
ATCTTGCGTCGGCCTTAGAATTCTTCAGCGCCATAGCCAATTCAAATGCCTTCAAGCACACCGCATCGACGTACAAAGTCATGATCGAGAGGCTCGGGCGTGAATGTGAA
ATGGATGTGGTGCAGTACATTTTACAGCAAATGAAGATGGATGGAATTAGTTGCTGTGAAGATCTGTTCGTATGTATAATCAATAGTTACAAACGCGTTGGCTCTGCTGA
GCAGGCGCTTAAAATGTTCTATCGGATTGGAGAATTCGGGTGCAAGCCGACGGTGAAGATATACAATCATCTTTTGGATGCATTGCTGAGTGAAAACAGGTTTCAGATAA
TTAATCCGTTGTATACTAATATGAAGAAAGATGGGTTAATTCCTAATGTCTTTACATATAATATACTTTTGAAAGCGCTGTGCAAAAATGATCGGGTTGATGCTGCACAC
AAGCTGTTTGTCGAAATGTCAAGTAAGGGCTGCCCACCTGACGTGGTTAGTTATACGACTATGGTGTCTTCACTGTGTAAAGCAGGTAAGATTGATGATGCTAGAGAGTT
AGCGGGAACATTTAAACCGAGTGTTCCTGTTTATAATGCTTTGATTGATGGGATGTGTAAAGAAGGAAGAATTGAGGTAGCGATTAAGTTGTTGGGTGAAATGATGGATA
ATGGAGTTGATCCCAATGTGGTATCGTATTCGAGTATTATAAATTCTCTTTGCAATTCTGGGAATGTTGAATTGGCTTTTGCATTATTGGCTCGAATGTTTTTGAGAGGT
TGTGATGCCAATATCCACACCTTTACGCCATTGATAAAGGGTTGTTTCATGAGAGGAAAATTATATGAAGCTCTTGACTTGTGGAAGCTTATGATACAAGATGGTTGTGA
GCCAAATGTAGTAGCTTACAACACTCTGATTCATGGTCTTTGCAAGAGTGGGACTTTGGTAGAAGCCTTACAAGTATGTGATCAGATGCAGAGGAGTGGCTGCCTCCCCA
ACGTCACTACTTATAGCATTCTAATTGATGGTTTTGCTAAAAGTGGTCATTTGGTTGGTGCGTCTGAGACATGGAATAGGATGATATCCCATGGCTGTCGTCCTAATGTG
GTGGCCTATACTTGCATGGTGGATGTTCTTTGTAAAAACTTGATGTTTGACCAAGCCAATTCAATTATGGAGAAGATGATTCTTGAAGGTTGTACTCCAAATACTGTAAC
ATTCAACACGTTTATCAAAGGTTTGTGTGGAAATGGAAGAGTAGAATGGGCAATGAAAGTGCTTGACCGAATGCAAGGACATGGGTGTCTTCCTAATATCACAACTTACA
ACGAGTTACTGGATGCTCTCTTCAGGATGAACAAGTATGAAGAGGCTTTTGGGCTATTTCAGGAAATTGAAGAAAGAAATTTACAGCCGAACTTAGTGACATATAATACC
ATTTTATATGGTTTTTCTCGTGCTGGCATGCTTGGGGAAGCGTTGCAGCTTTTTGGTAAAGTACTTGTGGGGGGAACTGCCCCTGATCCCATCACGTATAATACAATGAT
CCATGCCTATTGCAAGCAGGGTAAGGTTAAGATCGCAGCTCAATTGGTGGAACGAGTTAGTTCTATGAAGGAGTGGCATCCAGATATCATTACATACACTAGCCTCATTT
GGGGTGCCTGCAATTGGATAAATATAGAAGAAGCCATTGCGTTTCTTTACAAGGCAATTAATCAAGGAATCTGTCCCAACTTTGCCACATGGAATGCATTAGTTCGGTGT
TTCTTCGACTCTTTAGGTCATATGGGACCAATTCACATTCTGGATGATATTCTGAGAAAGGGATAA
mRNA sequenceShow/hide mRNA sequence
CCGGCCAAGGCTGTGCCACATTCTCGCCGCCGGAAGAAACGCCGGCAGTTTTCTCAAAATCCTAAACCAAATTCATCTCAAATTCACGCCCAACACAAACATGAAGTTTT
CTGTACTGGAGATCTTCGGCGTTGGCAGCGTTTTCTCTCGAATCGGTTTCACTCTGAAACTTCAACGTTAGAAGCTCTTTAATTTGTGGTTATGGTTCCATTTATTGTTT
AGAGCTTTCTTCTTTCTTTGATTTACTCTTCTTCTTTCTCTGCTACATATATATACTTCCAATGGAACATTTCTCTTCACTCATCCTCTTCTGTTTTCTTCTTTCTATTC
TCTTTCCCTCGAATCGTTATTGCGAGCGCCATAACGCAATTGGCAATGCTTCTTATTGTCTTGTAAGCATTCCTTCTGATCGACTACTTCCATTTCGGATTTTTTCGGTA
ACTCATAGCAATGGCTTTACTAGGCCCAGTGACTCACGCTCAGGTTTCGTGTCTTATCGGATTTGTAACCATTGTCGTTGCGTGGATTTATTCGGAGTACTTAGTATTTA
AAAAGAAGTCGATCTCTTCCAAAGTAAGGCACTCTGATATTGGCTTGGCTGAGGTAGAAAGTCATACTGGAAAGGAAGATGAGAGAGCTGTTTTATTAGAAGGGGGGGCC
ATCAAGTCTATACCTTCTGGAGTTCGAATTCCACCTTCCTTCTCTACTTTTATCAGGTTCATTATGCTGGAAGAGTCATTCTTGATAGAAAATCGAATGTTACTCAGAGC
AGCATCTGAACTTGGTCTTCTTTTAGCATACTTCTACATTTGTGACCGTACTGACATATTTGGAACAGCAACGAAGGTAATTTTTCTGATGTATCATTACTTTAATGCAA
GAGAAATTTACAATGTGGGTCGCGTATGTGTTGCCGCCTACGTGTGGATGACTGGATTTGGGAATTTTTCATATTACTATGCTCGTAAAGACTTTAGCCTCTCTAGATTT
GCTCAGTTACATGTTATACTACATCTGCCCCTTGCACACTTTTTCTCTCTCGTGGTATATGGTATCATTGGTATTCTGAACAAGTACAATCAGATTAGAGCCGTCATTGC
TGTGAAATTTTTTGTTTGCTTCTTGGTCGTTATCCTAGTATGGGAAATACCTGGAGTGTTTGACGTGCTTTGGGAACCCTTTACCTTCCTTCTAGGGTATAAAGATCCAA
ATCGATTGGAAGAACACTTGCCTCTTATGTATGAGTGGCACTTCCGCACGGGGCTTGATCGCTATATTTGGATATTAGGCATGATATATGCATACTATTATTCAACTGTT
GAGAACTGGATTGAAAAGTTAGATGAAGCAAAATTGAAGCTCAGGATTCTGATCAAAACAACAATTGTTGTGACATCCGCAACGGCCGCATATCTATGGTTTGAGTACAT
ATTCAAATTGGACAGTATTACTTACAACAAATATCATCCTTACACGTCTTGGATTCCCATCACGGCCTACATATGTATACGAAATGTCAGCCAGAGTACCCGTAGCTATA
CATTGACCCTTTTCGGTTGGATTGGAAAGATTTCACTGGACACATACATCTCGCAGTTCCATATTTGGTTAAGATCAAATGCTCCGGATGCCCAACCGAAGAAGTTGCTC
ATTATCATCCCAAATTATCCTTTATTGAACTTTATGTTTACAACTATCGTATTTATGACGATTTCCTACAGGCTATCTGAGTTGACAAATACATTTAAAATGGCTTTTGT
GCCTAGCAAGGACAATAAGCGCATTATGCATAATATGATTACCGCAGCTGCAATCATGGGCATCTTATACTCCTTCTCATTTGTATTCCTTAAAGTTCTTCCAATGTTGA
TTCAGAAACCCTTAATGCCGTTCGTGCTAAATTCCAGCCCCATCATCAATCCCCGTGATCCCAATAAACAACACCTTCTGCGAGAATCCGATGTGCTCAAACGATTGAAA
AAGGATCGGAATCTTGCGTCGGCCTTAGAATTCTTCAGCGCCATAGCCAATTCAAATGCCTTCAAGCACACCGCATCGACGTACAAAGTCATGATCGAGAGGCTCGGGCG
TGAATGTGAAATGGATGTGGTGCAGTACATTTTACAGCAAATGAAGATGGATGGAATTAGTTGCTGTGAAGATCTGTTCGTATGTATAATCAATAGTTACAAACGCGTTG
GCTCTGCTGAGCAGGCGCTTAAAATGTTCTATCGGATTGGAGAATTCGGGTGCAAGCCGACGGTGAAGATATACAATCATCTTTTGGATGCATTGCTGAGTGAAAACAGG
TTTCAGATAATTAATCCGTTGTATACTAATATGAAGAAAGATGGGTTAATTCCTAATGTCTTTACATATAATATACTTTTGAAAGCGCTGTGCAAAAATGATCGGGTTGA
TGCTGCACACAAGCTGTTTGTCGAAATGTCAAGTAAGGGCTGCCCACCTGACGTGGTTAGTTATACGACTATGGTGTCTTCACTGTGTAAAGCAGGTAAGATTGATGATG
CTAGAGAGTTAGCGGGAACATTTAAACCGAGTGTTCCTGTTTATAATGCTTTGATTGATGGGATGTGTAAAGAAGGAAGAATTGAGGTAGCGATTAAGTTGTTGGGTGAA
ATGATGGATAATGGAGTTGATCCCAATGTGGTATCGTATTCGAGTATTATAAATTCTCTTTGCAATTCTGGGAATGTTGAATTGGCTTTTGCATTATTGGCTCGAATGTT
TTTGAGAGGTTGTGATGCCAATATCCACACCTTTACGCCATTGATAAAGGGTTGTTTCATGAGAGGAAAATTATATGAAGCTCTTGACTTGTGGAAGCTTATGATACAAG
ATGGTTGTGAGCCAAATGTAGTAGCTTACAACACTCTGATTCATGGTCTTTGCAAGAGTGGGACTTTGGTAGAAGCCTTACAAGTATGTGATCAGATGCAGAGGAGTGGC
TGCCTCCCCAACGTCACTACTTATAGCATTCTAATTGATGGTTTTGCTAAAAGTGGTCATTTGGTTGGTGCGTCTGAGACATGGAATAGGATGATATCCCATGGCTGTCG
TCCTAATGTGGTGGCCTATACTTGCATGGTGGATGTTCTTTGTAAAAACTTGATGTTTGACCAAGCCAATTCAATTATGGAGAAGATGATTCTTGAAGGTTGTACTCCAA
ATACTGTAACATTCAACACGTTTATCAAAGGTTTGTGTGGAAATGGAAGAGTAGAATGGGCAATGAAAGTGCTTGACCGAATGCAAGGACATGGGTGTCTTCCTAATATC
ACAACTTACAACGAGTTACTGGATGCTCTCTTCAGGATGAACAAGTATGAAGAGGCTTTTGGGCTATTTCAGGAAATTGAAGAAAGAAATTTACAGCCGAACTTAGTGAC
ATATAATACCATTTTATATGGTTTTTCTCGTGCTGGCATGCTTGGGGAAGCGTTGCAGCTTTTTGGTAAAGTACTTGTGGGGGGAACTGCCCCTGATCCCATCACGTATA
ATACAATGATCCATGCCTATTGCAAGCAGGGTAAGGTTAAGATCGCAGCTCAATTGGTGGAACGAGTTAGTTCTATGAAGGAGTGGCATCCAGATATCATTACATACACT
AGCCTCATTTGGGGTGCCTGCAATTGGATAAATATAGAAGAAGCCATTGCGTTTCTTTACAAGGCAATTAATCAAGGAATCTGTCCCAACTTTGCCACATGGAATGCATT
AGTTCGGTGTTTCTTCGACTCTTTAGGTCATATGGGACCAATTCACATTCTGGATGATATTCTGAGAAAGGGATAA
Protein sequenceShow/hide protein sequence
MALLGPVTHAQVSCLIGFVTIVVAWIYSEYLVFKKKSISSKVRHSDIGLAEVESHTGKEDERAVLLEGGAIKSIPSGVRIPPSFSTFIRFIMLEESFLIENRMLLRAASE
LGLLLAYFYICDRTDIFGTATKVIFLMYHYFNAREIYNVGRVCVAAYVWMTGFGNFSYYYARKDFSLSRFAQLHVILHLPLAHFFSLVVYGIIGILNKYNQIRAVIAVKF
FVCFLVVILVWEIPGVFDVLWEPFTFLLGYKDPNRLEEHLPLMYEWHFRTGLDRYIWILGMIYAYYYSTVENWIEKLDEAKLKLRILIKTTIVVTSATAAYLWFEYIFKL
DSITYNKYHPYTSWIPITAYICIRNVSQSTRSYTLTLFGWIGKISLDTYISQFHIWLRSNAPDAQPKKLLIIIPNYPLLNFMFTTIVFMTISYRLSELTNTFKMAFVPSK
DNKRIMHNMITAAAIMGILYSFSFVFLKVLPMLIQKPLMPFVLNSSPIINPRDPNKQHLLRESDVLKRLKKDRNLASALEFFSAIANSNAFKHTASTYKVMIERLGRECE
MDVVQYILQQMKMDGISCCEDLFVCIINSYKRVGSAEQALKMFYRIGEFGCKPTVKIYNHLLDALLSENRFQIINPLYTNMKKDGLIPNVFTYNILLKALCKNDRVDAAH
KLFVEMSSKGCPPDVVSYTTMVSSLCKAGKIDDARELAGTFKPSVPVYNALIDGMCKEGRIEVAIKLLGEMMDNGVDPNVVSYSSIINSLCNSGNVELAFALLARMFLRG
CDANIHTFTPLIKGCFMRGKLYEALDLWKLMIQDGCEPNVVAYNTLIHGLCKSGTLVEALQVCDQMQRSGCLPNVTTYSILIDGFAKSGHLVGASETWNRMISHGCRPNV
VAYTCMVDVLCKNLMFDQANSIMEKMILEGCTPNTVTFNTFIKGLCGNGRVEWAMKVLDRMQGHGCLPNITTYNELLDALFRMNKYEEAFGLFQEIEERNLQPNLVTYNT
ILYGFSRAGMLGEALQLFGKVLVGGTAPDPITYNTMIHAYCKQGKVKIAAQLVERVSSMKEWHPDIITYTSLIWGACNWINIEEAIAFLYKAINQGICPNFATWNALVRC
FFDSLGHMGPIHILDDILRKG