; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C10G190730 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C10G190730
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCla97Chr10:9121166..9138453
RNA-Seq ExpressionCla97C10G190730
SyntenyCla97C10G190730
Gene Ontology termsGO:0005739 - mitochondrion (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAY56118.1 hypothetical protein CUMW_169390 [Citrus unshiu]8.0e-14645.84Show/hide
Query:  VVLRLYSARRSCDRRNLFARISPLGDPELSVVPVLNQWIEEGRKIKDFELRRIVRDLRTSRRYGQALEVSAIEKHALISREKNACIVSEWMCSKGLFSLT
        +  R Y A +   R NL++RISPLGDP++S+ PVL+QW+ EG+KI + EL+R++R LR+ +R+  AL+                  VSEWM  +GL + +
Subjt:  VVLRLYSARRSCDRRNLFARISPLGDPELSVVPVLNQWIEEGRKIKDFELRRIVRDLRTSRRYGQALEVSAIEKHALISREKNACIVSEWMCSKGLFSLT

Query:  TRDFAVQLDLIGRVRGLDSAEKYFGSVSNQEEIGKLYGALLNCYVREGLVDKSLAHMQKMKEMGFASSPLCYNDIMCLYLKTGQADKVPNVLSEMKENGV
          D AVQLDLIG+VRGL+SAE YF S+++++++ KLYGALLNCYVREGLVD+SL+ MQKMKEMG   S L YN IMCLY  TGQ +K+P+VL +MKENGV
Subjt:  TRDFAVQLDLIGRVRGLDSAEKYFGSVSNQEEIGKLYGALLNCYVREGLVDKSLAHMQKMKEMGFASSPLCYNDIMCLYLKTGQADKVPNVLSEMKENGV

Query:  LPDNFSYRICISSYGAKSDVISMEKVLKEMESQTHISMDWTTYSMVASFFIKASMHEKAISYLRKCED----------------------------KLWA
         PDNFSYRICI+SYGA+S++ SME VL+EMESQ+HISMDW TYS VA+++I A + EKAI YL+KCED                            K W 
Subjt:  LPDNFSYRICISSYGAKSDVISMEKVLKEMESQTHISMDWTTYSMVASFFIKASMHEKAISYLRKCED----------------------------KLWA

Query:  SSGAEEGRPIGSDD----PSQIYISKTESCISLLSTLISFLFLFE-RLP---------------------------------------------------
            +  + +  D      S + I + E    +L    S  + ++ R+P                                                   
Subjt:  SSGAEEGRPIGSDD----PSQIYISKTESCISLLSTLISFLFLFE-RLP---------------------------------------------------

Query:  --------------KFSKPPTKM---------------------------------------ARDRNGE------LSTKRGSSDNDEPGEREGEDNGAPS
                      KF +P   +                                       A  R+G+       S K    D D+    +  D  +  
Subjt:  --------------KFSKPPTKM---------------------------------------ARDRNGE------LSTKRGSSDNDEPGEREGEDNGAPS

Query:  KRSSGRNSEKD-----PENSGSDVDEGRK-----RSKSRRKSRDSSDSDEKHDKKRGKSSRRK---GKSRRRYSSSEEDSDSEDTESDSSMYDSDSGHSD
          ++G    +D      + +G++ +E ++     +SKS+ KSR+SSDS+ + D+K  KS + K   GK +R  S S EDSDS          DSDS    
Subjt:  KRSSGRNSEKD-----PENSGSDVDEGRK-----RSKSRRKSRDSSDSDEKHDKKRGKSSRRK---GKSRRRYSSSEEDSDSEDTESDSSMYDSDSGHSD

Query:  AESASNSSGSEGDSESEEERRRRKRKERRRRRDKERERKRRRKEKEKKRRKKEREEEKRRKEKKKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFT
        + S+S  S SE DSE E  R +RK K+RRR R++ERERKRR++EKEKKRR+KE+ +E +R  KKKKEK ERGKKGAVTN WGKYGII+ETDMWNKRPEFT
Subjt:  AESASNSSGSEGDSESEEERRRRKRKERRRRRDKERERKRRRKEKEKKRRKKEREEEKRRKEKKKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFT

Query:  AWLAEIKKVNLESLANWEEKQMFKEFMEDHNTATFPSKKYYNLDAYYQRKIQKDMKKGHKKVLAGERTVFDDEEQRRQELLIEREKHKEEQVEALKRSMQ
        AWLAE+K+VNLESL NWEEKQMFK+FMEDHNTATFPSKKYYNLDAY++ K++K++KKG  KV   ERTVF+DEEQRR EL   REK KEE+VEALKRSMQ
Subjt:  AWLAEIKKVNLESLANWEEKQMFKEFMEDHNTATFPSKKYYNLDAYYQRKIQKDMKKGHKKVLAGERTVFDDEEQRRQELLIEREKHKEEQVEALKRSMQ

Query:  TGMAQAMKEQARLREEMAYQYKLGNFEVS
        +GMAQAMKEQA+L+EEMAYQYK+GNFE +
Subjt:  TGMAQAMKEQARLREEMAYQYKLGNFEVS

XP_004133704.1 vicilin-like seed storage protein At2g18540 [Cucumis sativus]2.2e-14390.83Show/hide
Query:  MARDRNGELSTKRGSSDNDEPGEREGEDNGAPSKRSSGRNSEKDPENSGSDVDEGRKRSKSRRKSRDSSDSDEKHDKKRGKSSRRKGKSRRRYSSSEEDS
        MARDRNGELSTKRGSSDNDEPGE+ GEDN APSKR SG++SEKD ENSG D DEGRKR+KSRRKSR++ DSDEKH K+RG+SSRRKGKSRRRYS+SEEDS
Subjt:  MARDRNGELSTKRGSSDNDEPGEREGEDNGAPSKRSSGRNSEKDPENSGSDVDEGRKRSKSRRKSRDSSDSDEKHDKKRGKSSRRKGKSRRRYSSSEEDS

Query:  DSEDTESDSSMYDSDSGHSDAESASNSSGSEGDSESEEERRRRKRKERRRRRDKERERKRRRKEKEKKRRKKEREEEKRRKEKKKKEKKERGKKGAVTNS
        DSE+TESDSSMYDSDSGHSD+ESAS+SSGSE DSESE ERR+RKRKERRRRR+KERERKRRRKEKEKKR KKE+EEEKRRKEKKKKEKKERGKKGAVTNS
Subjt:  DSEDTESDSSMYDSDSGHSDAESASNSSGSEGDSESEEERRRRKRKERRRRRDKERERKRRRKEKEKKRRKKEREEEKRRKEKKKKEKKERGKKGAVTNS

Query:  WGKYGIIKETDMWNKRPEFTAWLAEIKKVNLESLANWEEKQMFKEFMEDHNTATFPSKKYYNLDAYYQRKIQKDMKKGHKKVLAGERTVFDDEEQRRQEL
        WGK+GIIKETDMWNKRPEFTAWLAEIKKVNLESLANWEEKQMFKEFMEDHNTATFPSKKYYNLDAYYQRKIQKDMKKG KKV+AGERTVFDDEEQRRQEL
Subjt:  WGKYGIIKETDMWNKRPEFTAWLAEIKKVNLESLANWEEKQMFKEFMEDHNTATFPSKKYYNLDAYYQRKIQKDMKKGHKKVLAGERTVFDDEEQRRQEL

Query:  LIEREKHKEEQVEALKRSMQTGMAQAMKEQARLREEMAYQYKLGNFEVS
        LIEREKHKEEQVE LKRSMQTGMAQAMKEQARLREEMAYQYKLGNFE +
Subjt:  LIEREKHKEEQVEALKRSMQTGMAQAMKEQARLREEMAYQYKLGNFEVS

XP_008452265.2 PREDICTED: LOW QUALITY PROTEIN: vicilin-like seed storage protein At2g18540 [Cucumis melo]5.5e-14793.12Show/hide
Query:  MARDRNGELSTKRGSSDNDEPGEREGEDNGAPSKRSSGRNSEKDPENSGSDVDEGRKRSKSRRKSRDSSDSDEKHDKKRGKSSRRKGKSRRRYSSSEEDS
        MARDRNGEL+TKRGSSD+DEPGEREGEDN APSKR SG++SEKD ENSGSD DEGRKR+KSRRKSR++SDSDEKH K+RG+SSRRKGKSRRRYSSSEEDS
Subjt:  MARDRNGELSTKRGSSDNDEPGEREGEDNGAPSKRSSGRNSEKDPENSGSDVDEGRKRSKSRRKSRDSSDSDEKHDKKRGKSSRRKGKSRRRYSSSEEDS

Query:  DSEDTESDSSMYDSDSGHSDAESASNSSGSEGDSESEEERRRRKRKERRRRRDKERERKRRRKEKEKKRRKKEREEEKRRKEKKKKEKKERGKKGAVTNS
        DSEDTESDSSMYDS+SG+SD+ESAS+SSGSE DSESEEERRRRKRKERRRRR+KERERKRRRKEKEKKRRKK REEEKRRKEKKKKEKKERGKKGAVTNS
Subjt:  DSEDTESDSSMYDSDSGHSDAESASNSSGSEGDSESEEERRRRKRKERRRRRDKERERKRRRKEKEKKRRKKEREEEKRRKEKKKKEKKERGKKGAVTNS

Query:  WGKYGIIKETDMWNKRPEFTAWLAEIKKVNLESLANWEEKQMFKEFMEDHNTATFPSKKYYNLDAYYQRKIQKDMKKGHKKVLAGERTVFDDEEQRRQEL
        WGKYGIIKETDMWNKRPEFTAWLAEIKKVNLESLANWEEKQMFKEFMEDHNTATFPSKKYYNLDAYYQRKIQKDMKKGHKKV+AGERTVFDDEEQRRQEL
Subjt:  WGKYGIIKETDMWNKRPEFTAWLAEIKKVNLESLANWEEKQMFKEFMEDHNTATFPSKKYYNLDAYYQRKIQKDMKKGHKKVLAGERTVFDDEEQRRQEL

Query:  LIEREKHKEEQVEALKRSMQTGMAQAMKEQARLREEMAYQYKLGNFEVS
        LIEREKHKEEQVEALKRSMQTGMAQAMKEQARLREEMAYQYKLGNFE +
Subjt:  LIEREKHKEEQVEALKRSMQTGMAQAMKEQARLREEMAYQYKLGNFEVS

XP_038903584.1 pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like isoform X1 [Benincasa hispida]2.5e-13985.38Show/hide
Query:  MAAASAIFKIFRRSSSGCTRTARTETDAFYFVVLRLYSARRSCDRRNLFARISPLGDPELSVVPVLNQWIEEGRKIKDFELRRIVRDLRTSRRYGQALEV
        MAAASA+FKI  R SSGCTRT RTETDA+ FV+LR YSARRSC+RRNLFARISPLGDPEL+VVPVLNQWIEEGRKIK+FELRRIV DLRT RRYGQALE 
Subjt:  MAAASAIFKIFRRSSSGCTRTARTETDAFYFVVLRLYSARRSCDRRNLFARISPLGDPELSVVPVLNQWIEEGRKIKDFELRRIVRDLRTSRRYGQALEV

Query:  SAIEKHALISREKNACIVSEWMCSKGLFSLTTRDFAVQLDLIGRVRGLDSAEKYFGSVSNQEEIGKLYGALLNCYVREGLVDKSLAHMQKMKEMGFASSP
                         VSEWMCSKGLFSLTTRDFA+QLDLI RVRGLDSAEKYFGSVSNQEEIGKLYGALLNCYVREGLVDKSLAHMQKMKEMGFASS 
Subjt:  SAIEKHALISREKNACIVSEWMCSKGLFSLTTRDFAVQLDLIGRVRGLDSAEKYFGSVSNQEEIGKLYGALLNCYVREGLVDKSLAHMQKMKEMGFASSP

Query:  LCYNDIMCLYLKTGQADKVPNVLSEMKENGVLPDNFSYRICISSYGAKSDVISMEKVLKEMESQTHISMDWTTYSMVASFFIKASMHEKAISYLRKCEDK
        LCYNDIMCLYL TGQADKVPNVLSEMKENGVLPDNFSYRICISSYGA+SDVI+MEKVLKEME QTHISMDW TYSMVA+FFIKA MHEKA+SYLRKCEDK
Subjt:  LCYNDIMCLYLKTGQADKVPNVLSEMKENGVLPDNFSYRICISSYGAKSDVISMEKVLKEMESQTHISMDWTTYSMVASFFIKASMHEKAISYLRKCEDK

Query:  L
        +
Subjt:  L

XP_038906271.1 vicilin-like seed storage protein At2g18540 [Benincasa hispida]6.7e-15395.7Show/hide
Query:  MARDRNGELSTKRGSSDNDEPGEREGEDNGAPSKRSSGRNSEKDPENSGSDVDEGRKRSKSRRKSRDSSDSDEKHDKKRGKSSRRKGKSRRRYSSSEEDS
        MARDRNGELSTKRGSSDN+EPGEREGEDNGAPSK SSGRNS+KD ENSGSD DEGRKR KSR KSRDS DSDEKHDKKRGKSSRRKGKSRRRYSSSEEDS
Subjt:  MARDRNGELSTKRGSSDNDEPGEREGEDNGAPSKRSSGRNSEKDPENSGSDVDEGRKRSKSRRKSRDSSDSDEKHDKKRGKSSRRKGKSRRRYSSSEEDS

Query:  DSEDTESDSSMYDSDSGHSDAESASNSSGSEGDSESEEERRRRKRKERRRRRDKERERKRRRKEKEKKRRKKEREEEKRRKEKKKKEKKERGKKGAVTNS
        DSEDTESDSSMYDSDSGHSDAESASNSSGSEGDS+SEEERRRRKRKERRRRRDKERERKRRRKEKEKKRR+KEREEEKRRKEKKKKEKKERGKKGAVTNS
Subjt:  DSEDTESDSSMYDSDSGHSDAESASNSSGSEGDSESEEERRRRKRKERRRRRDKERERKRRRKEKEKKRRKKEREEEKRRKEKKKKEKKERGKKGAVTNS

Query:  WGKYGIIKETDMWNKRPEFTAWLAEIKKVNLESLANWEEKQMFKEFMEDHNTATFPSKKYYNLDAYYQRKIQKDMKKGHKKVLAGERTVFDDEEQRRQEL
        WGKYGIIKETDMWNKRPEFTAWLAEIKKVNLESLANWEEKQMFKEFMEDHNTATFPSKKYYNLDAYYQRKIQKDMKKGHKK++ GERTVFDDEEQRRQEL
Subjt:  WGKYGIIKETDMWNKRPEFTAWLAEIKKVNLESLANWEEKQMFKEFMEDHNTATFPSKKYYNLDAYYQRKIQKDMKKGHKKVLAGERTVFDDEEQRRQEL

Query:  LIEREKHKEEQVEALKRSMQTGMAQAMKEQARLREEMAYQYKLGNFEVS
        LIEREKHKEEQVEALKRSMQTGMAQAMKEQARLREEMAYQYKLGNFE +
Subjt:  LIEREKHKEEQVEALKRSMQTGMAQAMKEQARLREEMAYQYKLGNFEVS

TrEMBL top hitse value%identityAlignment
A0A0A0L7Y2 Uncharacterized protein2.1e-13984.39Show/hide
Query:  MAAASAIFKIFRRSSSGCTRTARTETDAFYFVVLRLYSARRSCDRRNLFARISPLGDPELSVVPVLNQWIEEGRKIKDFELRRIVRDLRTSRRYGQALEV
        MAAASA+FKI  RSSSGCTRT R ETDAF FV LRLYS RRSCDRRNL+ARISPLGDPE +VVPVLNQWIEEGR IKDFELRRIVRDLRT RRY QALE 
Subjt:  MAAASAIFKIFRRSSSGCTRTARTETDAFYFVVLRLYSARRSCDRRNLFARISPLGDPELSVVPVLNQWIEEGRKIKDFELRRIVRDLRTSRRYGQALEV

Query:  SAIEKHALISREKNACIVSEWMCSKGLFSLTTRDFAVQLDLIGRVRGLDSAEKYFGSVSNQEEIGKLYGALLNCYVREGLVDKSLAHMQKMKEMGFASSP
                         VSEWMCSKGLFSLTTRDFA+QLDLIG+VRGLDSAEKYFGSVSNQ+EIGKLYGALLNCYVREGL+DKSLAHMQKMKEMG ASSP
Subjt:  SAIEKHALISREKNACIVSEWMCSKGLFSLTTRDFAVQLDLIGRVRGLDSAEKYFGSVSNQEEIGKLYGALLNCYVREGLVDKSLAHMQKMKEMGFASSP

Query:  LCYNDIMCLYLKTGQADKVPNVLSEMKENGVLPDNFSYRICISSYGAKSDVISMEKVLKEMESQTHISMDWTTYSMVASFFIKASMHEKAISYLRKCEDK
        LCYNDIMCLYL TGQADKVPNVLSEMKENGVLPDNFSYRICISSYGA+SDVISME VLKEME QTHISMDWTTYSMVA FFIKA MH+KA++YLRKCEDK
Subjt:  LCYNDIMCLYLKTGQADKVPNVLSEMKENGVLPDNFSYRICISSYGAKSDVISMEKVLKEMESQTHISMDWTTYSMVASFFIKASMHEKAISYLRKCEDK

Query:  L
        +
Subjt:  L

A0A1S3BU92 LOW QUALITY PROTEIN: vicilin-like seed storage protein At2g185402.7e-14793.12Show/hide
Query:  MARDRNGELSTKRGSSDNDEPGEREGEDNGAPSKRSSGRNSEKDPENSGSDVDEGRKRSKSRRKSRDSSDSDEKHDKKRGKSSRRKGKSRRRYSSSEEDS
        MARDRNGEL+TKRGSSD+DEPGEREGEDN APSKR SG++SEKD ENSGSD DEGRKR+KSRRKSR++SDSDEKH K+RG+SSRRKGKSRRRYSSSEEDS
Subjt:  MARDRNGELSTKRGSSDNDEPGEREGEDNGAPSKRSSGRNSEKDPENSGSDVDEGRKRSKSRRKSRDSSDSDEKHDKKRGKSSRRKGKSRRRYSSSEEDS

Query:  DSEDTESDSSMYDSDSGHSDAESASNSSGSEGDSESEEERRRRKRKERRRRRDKERERKRRRKEKEKKRRKKEREEEKRRKEKKKKEKKERGKKGAVTNS
        DSEDTESDSSMYDS+SG+SD+ESAS+SSGSE DSESEEERRRRKRKERRRRR+KERERKRRRKEKEKKRRKK REEEKRRKEKKKKEKKERGKKGAVTNS
Subjt:  DSEDTESDSSMYDSDSGHSDAESASNSSGSEGDSESEEERRRRKRKERRRRRDKERERKRRRKEKEKKRRKKEREEEKRRKEKKKKEKKERGKKGAVTNS

Query:  WGKYGIIKETDMWNKRPEFTAWLAEIKKVNLESLANWEEKQMFKEFMEDHNTATFPSKKYYNLDAYYQRKIQKDMKKGHKKVLAGERTVFDDEEQRRQEL
        WGKYGIIKETDMWNKRPEFTAWLAEIKKVNLESLANWEEKQMFKEFMEDHNTATFPSKKYYNLDAYYQRKIQKDMKKGHKKV+AGERTVFDDEEQRRQEL
Subjt:  WGKYGIIKETDMWNKRPEFTAWLAEIKKVNLESLANWEEKQMFKEFMEDHNTATFPSKKYYNLDAYYQRKIQKDMKKGHKKVLAGERTVFDDEEQRRQEL

Query:  LIEREKHKEEQVEALKRSMQTGMAQAMKEQARLREEMAYQYKLGNFEVS
        LIEREKHKEEQVEALKRSMQTGMAQAMKEQARLREEMAYQYKLGNFE +
Subjt:  LIEREKHKEEQVEALKRSMQTGMAQAMKEQARLREEMAYQYKLGNFEVS

A0A1S4DZ16 pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like1.2e-13482.06Show/hide
Query:  MAAASAIFKIFRRSSSGCTRTARTETDAFYFVVLRLYSARRSCDRRNLFARISPLGDPELSVVPVLNQWIEEGRKIKDFELRRIVRDLRTSRRYGQALEV
        MAAASA+FKI  RSSSGCTRT R ETDAF FV LRLYS RRSC+RR L+A ISPLGDP+ SVVPVLNQWI+EGRKIKDFELRRIVRDLRT RRY QALE 
Subjt:  MAAASAIFKIFRRSSSGCTRTARTETDAFYFVVLRLYSARRSCDRRNLFARISPLGDPELSVVPVLNQWIEEGRKIKDFELRRIVRDLRTSRRYGQALEV

Query:  SAIEKHALISREKNACIVSEWMCSKGLFSLTTRDFAVQLDLIGRVRGLDSAEKYFGSVSNQEEIGKLYGALLNCYVREGLVDKSLAHMQKMKEMGFASSP
                         VSEWMCSKG FSLTTRDFA+QLDLIG+VRGLDSAEKYFGSVS Q+EIGKLYG+LLNCYVREGL+DKSLAHMQKMKEMGFASSP
Subjt:  SAIEKHALISREKNACIVSEWMCSKGLFSLTTRDFAVQLDLIGRVRGLDSAEKYFGSVSNQEEIGKLYGALLNCYVREGLVDKSLAHMQKMKEMGFASSP

Query:  LCYNDIMCLYLKTGQADKVPNVLSEMKENGVLPDNFSYRICISSYGAKSDVISMEKVLKEMESQTHISMDWTTYSMVASFFIKASMHEKAISYLRKCEDK
        LCYNDIMCLYL TGQADKVPNVLSEMKENGVLPDNFSYRICISSYGA+SDVISME VLKEMESQTHISMDW TYSMVA FFIK  MH+KA +YLRKCED+
Subjt:  LCYNDIMCLYLKTGQADKVPNVLSEMKENGVLPDNFSYRICISSYGAKSDVISMEKVLKEMESQTHISMDWTTYSMVASFFIKASMHEKAISYLRKCEDK

Query:  L
        +
Subjt:  L

A0A2H5PUS1 Uncharacterized protein3.9e-14645.84Show/hide
Query:  VVLRLYSARRSCDRRNLFARISPLGDPELSVVPVLNQWIEEGRKIKDFELRRIVRDLRTSRRYGQALEVSAIEKHALISREKNACIVSEWMCSKGLFSLT
        +  R Y A +   R NL++RISPLGDP++S+ PVL+QW+ EG+KI + EL+R++R LR+ +R+  AL+                  VSEWM  +GL + +
Subjt:  VVLRLYSARRSCDRRNLFARISPLGDPELSVVPVLNQWIEEGRKIKDFELRRIVRDLRTSRRYGQALEVSAIEKHALISREKNACIVSEWMCSKGLFSLT

Query:  TRDFAVQLDLIGRVRGLDSAEKYFGSVSNQEEIGKLYGALLNCYVREGLVDKSLAHMQKMKEMGFASSPLCYNDIMCLYLKTGQADKVPNVLSEMKENGV
          D AVQLDLIG+VRGL+SAE YF S+++++++ KLYGALLNCYVREGLVD+SL+ MQKMKEMG   S L YN IMCLY  TGQ +K+P+VL +MKENGV
Subjt:  TRDFAVQLDLIGRVRGLDSAEKYFGSVSNQEEIGKLYGALLNCYVREGLVDKSLAHMQKMKEMGFASSPLCYNDIMCLYLKTGQADKVPNVLSEMKENGV

Query:  LPDNFSYRICISSYGAKSDVISMEKVLKEMESQTHISMDWTTYSMVASFFIKASMHEKAISYLRKCED----------------------------KLWA
         PDNFSYRICI+SYGA+S++ SME VL+EMESQ+HISMDW TYS VA+++I A + EKAI YL+KCED                            K W 
Subjt:  LPDNFSYRICISSYGAKSDVISMEKVLKEMESQTHISMDWTTYSMVASFFIKASMHEKAISYLRKCED----------------------------KLWA

Query:  SSGAEEGRPIGSDD----PSQIYISKTESCISLLSTLISFLFLFE-RLP---------------------------------------------------
            +  + +  D      S + I + E    +L    S  + ++ R+P                                                   
Subjt:  SSGAEEGRPIGSDD----PSQIYISKTESCISLLSTLISFLFLFE-RLP---------------------------------------------------

Query:  --------------KFSKPPTKM---------------------------------------ARDRNGE------LSTKRGSSDNDEPGEREGEDNGAPS
                      KF +P   +                                       A  R+G+       S K    D D+    +  D  +  
Subjt:  --------------KFSKPPTKM---------------------------------------ARDRNGE------LSTKRGSSDNDEPGEREGEDNGAPS

Query:  KRSSGRNSEKD-----PENSGSDVDEGRK-----RSKSRRKSRDSSDSDEKHDKKRGKSSRRK---GKSRRRYSSSEEDSDSEDTESDSSMYDSDSGHSD
          ++G    +D      + +G++ +E ++     +SKS+ KSR+SSDS+ + D+K  KS + K   GK +R  S S EDSDS          DSDS    
Subjt:  KRSSGRNSEKD-----PENSGSDVDEGRK-----RSKSRRKSRDSSDSDEKHDKKRGKSSRRK---GKSRRRYSSSEEDSDSEDTESDSSMYDSDSGHSD

Query:  AESASNSSGSEGDSESEEERRRRKRKERRRRRDKERERKRRRKEKEKKRRKKEREEEKRRKEKKKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFT
        + S+S  S SE DSE E  R +RK K+RRR R++ERERKRR++EKEKKRR+KE+ +E +R  KKKKEK ERGKKGAVTN WGKYGII+ETDMWNKRPEFT
Subjt:  AESASNSSGSEGDSESEEERRRRKRKERRRRRDKERERKRRRKEKEKKRRKKEREEEKRRKEKKKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFT

Query:  AWLAEIKKVNLESLANWEEKQMFKEFMEDHNTATFPSKKYYNLDAYYQRKIQKDMKKGHKKVLAGERTVFDDEEQRRQELLIEREKHKEEQVEALKRSMQ
        AWLAE+K+VNLESL NWEEKQMFK+FMEDHNTATFPSKKYYNLDAY++ K++K++KKG  KV   ERTVF+DEEQRR EL   REK KEE+VEALKRSMQ
Subjt:  AWLAEIKKVNLESLANWEEKQMFKEFMEDHNTATFPSKKYYNLDAYYQRKIQKDMKKGHKKVLAGERTVFDDEEQRRQELLIEREKHKEEQVEALKRSMQ

Query:  TGMAQAMKEQARLREEMAYQYKLGNFEVS
        +GMAQAMKEQA+L+EEMAYQYK+GNFE +
Subjt:  TGMAQAMKEQARLREEMAYQYKLGNFEVS

A0A6J1JQ72 pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like4.4e-13482.06Show/hide
Query:  MAAASAIFKIFRRSSSGCTRTARTETDAFYFVVLRLYSARRSCDRRNLFARISPLGDPELSVVPVLNQWIEEGRKIKDFELRRIVRDLRTSRRYGQALEV
        +AAA A+FKI R  SSG TRTARTETDAF FV LRLYSARR+C+RRNLFARISPLG PELSVVP+L+QWI+EGR IKDFELRRIVRDLR  RRYGQALE 
Subjt:  MAAASAIFKIFRRSSSGCTRTARTETDAFYFVVLRLYSARRSCDRRNLFARISPLGDPELSVVPVLNQWIEEGRKIKDFELRRIVRDLRTSRRYGQALEV

Query:  SAIEKHALISREKNACIVSEWMCSKGLFSLTTRDFAVQLDLIGRVRGLDSAEKYFGSVSNQEEIGKLYGALLNCYVREGLVDKSLAHMQKMKEMGFASSP
                         VSEWM SKGLFS TTRDFAVQLDLIGRV+GLDSAEKYF SVSNQEE+GKLYGALLNCYVREGLVDK+L+HMQKMKEMGFASSP
Subjt:  SAIEKHALISREKNACIVSEWMCSKGLFSLTTRDFAVQLDLIGRVRGLDSAEKYFGSVSNQEEIGKLYGALLNCYVREGLVDKSLAHMQKMKEMGFASSP

Query:  LCYNDIMCLYLKTGQADKVPNVLSEMKENGVLPDNFSYRICISSYGAKSDVISMEKVLKEMESQTHISMDWTTYSMVASFFIKASMHEKAISYLRKCEDK
        LCYNDIMCLYL TGQ DKVPNVLSEMKENGVLPDN+SYRICISSYGA+SD+I M KVLKEMESQTHISMDWTTYSMVA+FFIKA MHEKA+SYLRKCEDK
Subjt:  LCYNDIMCLYLKTGQADKVPNVLSEMKENGVLPDNFSYRICISSYGAKSDVISMEKVLKEMESQTHISMDWTTYSMVASFFIKASMHEKAISYLRKCEDK

Query:  L
        +
Subjt:  L

SwissProt top hitse value%identityAlignment
O22714 Pentatricopeptide repeat-containing protein At1g607701.3e-3431.62Show/hide
Query:  LFARISPLGDPELSVVPVLNQWIEEGRKIKDFELRRIVRDLRTSRRYGQALEVSAIEKHALISREKNACIVSEWMCSKGLFSLTTRDFAVQLDLIGRVRG
        L+ R+   G  E+ V   LNQ+++  + +  +E+   ++ LR    Y  AL+                  +SE M  +G+ + T  D A+ LDL+ + R 
Subjt:  LFARISPLGDPELSVVPVLNQWIEEGRKIKDFELRRIVRDLRTSRRYGQALEVSAIEKHALISREKNACIVSEWMCSKGLFSLTTRDFAVQLDLIGRVRG

Query:  LDSAEKYFGSVSNQEEIGKLYGALLNCYVREGLVDKSLAHMQKMKEMGFASSPLCYNDIMCLYLKTGQADKVPNVLSEMKENGVLPDNFSYRICISSYGA
        + + E YF  +    +    YG+LLNCY +E L +K+   + KMKE+    S + YN +M LY KTG+ +KVP ++ E+K   V+PD+++Y + + +  A
Subjt:  LDSAEKYFGSVSNQEEIGKLYGALLNCYVREGLVDKSLAHMQKMKEMGFASSPLCYNDIMCLYLKTGQADKVPNVLSEMKENGVLPDNFSYRICISSYGA

Query:  KSDVISMEKVLKEMESQTHISMDWTTYSMVASFFIKASMHEKAISYLRKCEDK
         +D+  +E+V++EM     ++ DWTTYS +AS ++ A + +KA   L++ E K
Subjt:  KSDVISMEKVLKEMESQTHISMDWTTYSMVASFFIKASMHEKAISYLRKCEDK

Q84JR3 Pentatricopeptide repeat-containing protein At4g21705, mitochondrial1.8e-6044.73Show/hide
Query:  VVLRLYSARRSCDRRNLFARISPLGDPELSVVPVLNQWIEEGRKIKDFELRRIVRDLRTSRRYGQALEVSAIEKHALISREKNACIVSEWMCSKGLFSLT
        +  R Y   R   +  L+++ISPLGDP+ SV P L  W++ G+K+   EL RIV DLR  +R+  ALE                  VS+WM   G+   +
Subjt:  VVLRLYSARRSCDRRNLFARISPLGDPELSVVPVLNQWIEEGRKIKDFELRRIVRDLRTSRRYGQALEVSAIEKHALISREKNACIVSEWMCSKGLFSLT

Query:  TRDFAVQLDLIGRVRGLDSAEKYFGSVSNQEEIGKLYGALLNCYVREGLVDKSLAHMQKMKEMGFASSPLCYNDIMCLYLKTGQADKVPNVLSEMKENGV
          + AV LDLIGRV G  +AE+YF ++  Q +  K YGALLNCYVR+  V+KSL H +KMKEMGF +S L YN+IMCLY   GQ +KVP VL EMKE  V
Subjt:  TRDFAVQLDLIGRVRGLDSAEKYFGSVSNQEEIGKLYGALLNCYVREGLVDKSLAHMQKMKEMGFASSPLCYNDIMCLYLKTGQADKVPNVLSEMKENGV

Query:  LPDNFSYRICISSYGAKSDVISMEKVLKEMESQTHISMDWTTYSMVASFFIKASMHEKAISYLRKCEDKLWASSG
         PDN+SYRICI+++GA  D+  +   L++ME +  I+MDW TY++ A F+I     ++A+  L+  E++L    G
Subjt:  LPDNFSYRICISSYGAKSDVISMEKVLKEMESQTHISMDWTTYSMVASFFIKASMHEKAISYLRKCEDKLWASSG

Q8LPS6 Pentatricopeptide repeat-containing protein At1g021503.2e-4137.22Show/hide
Query:  YSARRSCDRRNLFARISPLGDPELSVVPVLNQWIEEGRKIKDFELRRIVRDLRTSRRYGQALEVSAIEKHALISREKNACIVSEWMCSKG-LFSLTTRDF
        Y  R       ++ +IS +  PEL    VLNQW + GRK+  +EL R+V++LR  +R  QALE                  V +WM ++G  F L+  D 
Subjt:  YSARRSCDRRNLFARISPLGDPELSVVPVLNQWIEEGRKIKDFELRRIVRDLRTSRRYGQALEVSAIEKHALISREKNACIVSEWMCSKG-LFSLTTRDF

Query:  AVQLDLIGRVRGLDSAEKYFGSVSNQEEIGKLYGALLNCYVREGLVDKSLAHMQKMKEMGFASSPLCYNDIMCLYLKTGQADKVPNVLSEMKENGVLPDN
        A+QLDLIG+VRG+  AE++F  +    +  ++YG+LLN YVR    +K+ A +  M++ G+A  PL +N +M LY+   + DKV  ++ EMK+  +  D 
Subjt:  AVQLDLIGRVRGLDSAEKYFGSVSNQEEIGKLYGALLNCYVREGLVDKSLAHMQKMKEMGFASSPLCYNDIMCLYLKTGQADKVPNVLSEMKENGVLPDN

Query:  FSYRICISSYGAKSDVISMEKVLKEMESQTHISMDWTTYSMVASFFIKASMHEKAISYLRKCEDKL
        +SY I +SS G+   V  ME V ++M+S   I  +WTT+S +A+ +IK    EKA   LRK E ++
Subjt:  FSYRICISSYGAKSDVISMEKVLKEMESQTHISMDWTTYSMVASFFIKASMHEKAISYLRKCEDKL

Q9FZ24 Pentatricopeptide repeat-containing protein At1g02370, mitochondrial1.3e-3432.17Show/hide
Query:  RRNLFARISPLGDPELSVVPVLNQWIEEGRKIKDFELRRIVRDLRTSRRYGQALEVSAIEKHALISREKNACIVSEWMCSKGLFSLTTRDFAVQLDLIGR
        +R L+ ++S L     +V   LNQ+I EG  ++  +L R  + LR  RR   A E                  + +WM  K   + +  D A+ LDLIG+
Subjt:  RRNLFARISPLGDPELSVVPVLNQWIEEGRKIKDFELRRIVRDLRTSRRYGQALEVSAIEKHALISREKNACIVSEWMCSKGLFSLTTRDFAVQLDLIGR

Query:  VRGLDSAEKYFGSVS-NQEEIGKLYGALLNCYVREGLVDKSLAHMQKMKEMGFASSPLCYNDIMCLYLKTGQADKVPNVLSEMKENGVLPDNFSYRICIS
         +GL++AE YF ++  + +     YGAL+NCY  E   +K+ AH + M E+ F ++ L +N++M +Y++  Q +KVP ++  MK+ G+ P   +Y I + 
Subjt:  VRGLDSAEKYFGSVS-NQEEIGKLYGALLNCYVREGLVDKSLAHMQKMKEMGFASSPLCYNDIMCLYLKTGQADKVPNVLSEMKENGVLPDNFSYRICIS

Query:  SYGAKSDVISMEKVLKEMESQTHISMDWTTYSMVASFFIKASMHEKAISYLRKCEDKL
        S G+ +D+  +EK++ EM   +     W T+S +A+ + KA ++EKA S L+  E+K+
Subjt:  SYGAKSDVISMEKVLKEMESQTHISMDWTTYSMVASFFIKASMHEKAISYLRKCEDKL

Q9SKU6 Pentatricopeptide repeat-containing protein At2g20710, mitochondrial2.8e-4035.89Show/hide
Query:  RISPLGDPELSVVPVLNQWIEEGRKIKDFELRRIVRDLRTSRRYGQALEVSAIEKHALISREKNACIVSEWMCSKGLFSLTTRDFAVQLDLIGRVRGLDS
        R++  GDP  S++ VL+ W+++G  +K  EL  I++ LR   R+  AL+                  +S+WM    +  ++  D A++LDLI +V GL  
Subjt:  RISPLGDPELSVVPVLNQWIEEGRKIKDFELRRIVRDLRTSRRYGQALEVSAIEKHALISREKNACIVSEWMCSKGLFSLTTRDFAVQLDLIGRVRGLDS

Query:  AEKYFGSVSNQEEIGKLYGALLNCYVREGLVDKSLAHMQKMKEMGFASSPLCYNDIMCLYLKTGQADKVPNVLSEMKENGVLPDNFSYRICISSYGAKSD
        AEK+F ++  +     LYGALLNCY  + ++ K+    Q+MKE+GF    L YN ++ LY++TG+   V  +L EM++  V PD F+    + +Y   SD
Subjt:  AEKYFGSVSNQEEIGKLYGALLNCYVREGLVDKSLAHMQKMKEMGFASSPLCYNDIMCLYLKTGQADKVPNVLSEMKENGVLPDNFSYRICISSYGAKSD

Query:  VISMEKVLKEMESQTHISMDWTTYSMVASFFIKASMHEKAISYLRKCE
        V  MEK L   E+   + +DW TY+  A+ +IKA + EKA+  LRK E
Subjt:  VISMEKVLKEMESQTHISMDWTTYSMVASFFIKASMHEKAISYLRKCE

Arabidopsis top hitse value%identityAlignment
AT1G02150.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.3e-4237.22Show/hide
Query:  YSARRSCDRRNLFARISPLGDPELSVVPVLNQWIEEGRKIKDFELRRIVRDLRTSRRYGQALEVSAIEKHALISREKNACIVSEWMCSKG-LFSLTTRDF
        Y  R       ++ +IS +  PEL    VLNQW + GRK+  +EL R+V++LR  +R  QALE                  V +WM ++G  F L+  D 
Subjt:  YSARRSCDRRNLFARISPLGDPELSVVPVLNQWIEEGRKIKDFELRRIVRDLRTSRRYGQALEVSAIEKHALISREKNACIVSEWMCSKG-LFSLTTRDF

Query:  AVQLDLIGRVRGLDSAEKYFGSVSNQEEIGKLYGALLNCYVREGLVDKSLAHMQKMKEMGFASSPLCYNDIMCLYLKTGQADKVPNVLSEMKENGVLPDN
        A+QLDLIG+VRG+  AE++F  +    +  ++YG+LLN YVR    +K+ A +  M++ G+A  PL +N +M LY+   + DKV  ++ EMK+  +  D 
Subjt:  AVQLDLIGRVRGLDSAEKYFGSVSNQEEIGKLYGALLNCYVREGLVDKSLAHMQKMKEMGFASSPLCYNDIMCLYLKTGQADKVPNVLSEMKENGVLPDN

Query:  FSYRICISSYGAKSDVISMEKVLKEMESQTHISMDWTTYSMVASFFIKASMHEKAISYLRKCEDKL
        +SY I +SS G+   V  ME V ++M+S   I  +WTT+S +A+ +IK    EKA   LRK E ++
Subjt:  FSYRICISSYGAKSDVISMEKVLKEMESQTHISMDWTTYSMVASFFIKASMHEKAISYLRKCEDKL

AT1G60770.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.4e-3631.62Show/hide
Query:  LFARISPLGDPELSVVPVLNQWIEEGRKIKDFELRRIVRDLRTSRRYGQALEVSAIEKHALISREKNACIVSEWMCSKGLFSLTTRDFAVQLDLIGRVRG
        L+ R+   G  E+ V   LNQ+++  + +  +E+   ++ LR    Y  AL+                  +SE M  +G+ + T  D A+ LDL+ + R 
Subjt:  LFARISPLGDPELSVVPVLNQWIEEGRKIKDFELRRIVRDLRTSRRYGQALEVSAIEKHALISREKNACIVSEWMCSKGLFSLTTRDFAVQLDLIGRVRG

Query:  LDSAEKYFGSVSNQEEIGKLYGALLNCYVREGLVDKSLAHMQKMKEMGFASSPLCYNDIMCLYLKTGQADKVPNVLSEMKENGVLPDNFSYRICISSYGA
        + + E YF  +    +    YG+LLNCY +E L +K+   + KMKE+    S + YN +M LY KTG+ +KVP ++ E+K   V+PD+++Y + + +  A
Subjt:  LDSAEKYFGSVSNQEEIGKLYGALLNCYVREGLVDKSLAHMQKMKEMGFASSPLCYNDIMCLYLKTGQADKVPNVLSEMKENGVLPDNFSYRICISSYGA

Query:  KSDVISMEKVLKEMESQTHISMDWTTYSMVASFFIKASMHEKAISYLRKCEDK
         +D+  +E+V++EM     ++ DWTTYS +AS ++ A + +KA   L++ E K
Subjt:  KSDVISMEKVLKEMESQTHISMDWTTYSMVASFFIKASMHEKAISYLRKCEDK

AT2G20710.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.0e-4135.89Show/hide
Query:  RISPLGDPELSVVPVLNQWIEEGRKIKDFELRRIVRDLRTSRRYGQALEVSAIEKHALISREKNACIVSEWMCSKGLFSLTTRDFAVQLDLIGRVRGLDS
        R++  GDP  S++ VL+ W+++G  +K  EL  I++ LR   R+  AL+                  +S+WM    +  ++  D A++LDLI +V GL  
Subjt:  RISPLGDPELSVVPVLNQWIEEGRKIKDFELRRIVRDLRTSRRYGQALEVSAIEKHALISREKNACIVSEWMCSKGLFSLTTRDFAVQLDLIGRVRGLDS

Query:  AEKYFGSVSNQEEIGKLYGALLNCYVREGLVDKSLAHMQKMKEMGFASSPLCYNDIMCLYLKTGQADKVPNVLSEMKENGVLPDNFSYRICISSYGAKSD
        AEK+F ++  +     LYGALLNCY  + ++ K+    Q+MKE+GF    L YN ++ LY++TG+   V  +L EM++  V PD F+    + +Y   SD
Subjt:  AEKYFGSVSNQEEIGKLYGALLNCYVREGLVDKSLAHMQKMKEMGFASSPLCYNDIMCLYLKTGQADKVPNVLSEMKENGVLPDNFSYRICISSYGAKSD

Query:  VISMEKVLKEMESQTHISMDWTTYSMVASFFIKASMHEKAISYLRKCE
        V  MEK L   E+   + +DW TY+  A+ +IKA + EKA+  LRK E
Subjt:  VISMEKVLKEMESQTHISMDWTTYSMVASFFIKASMHEKAISYLRKCE

AT4G21705.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.3e-6144.73Show/hide
Query:  VVLRLYSARRSCDRRNLFARISPLGDPELSVVPVLNQWIEEGRKIKDFELRRIVRDLRTSRRYGQALEVSAIEKHALISREKNACIVSEWMCSKGLFSLT
        +  R Y   R   +  L+++ISPLGDP+ SV P L  W++ G+K+   EL RIV DLR  +R+  ALE                  VS+WM   G+   +
Subjt:  VVLRLYSARRSCDRRNLFARISPLGDPELSVVPVLNQWIEEGRKIKDFELRRIVRDLRTSRRYGQALEVSAIEKHALISREKNACIVSEWMCSKGLFSLT

Query:  TRDFAVQLDLIGRVRGLDSAEKYFGSVSNQEEIGKLYGALLNCYVREGLVDKSLAHMQKMKEMGFASSPLCYNDIMCLYLKTGQADKVPNVLSEMKENGV
          + AV LDLIGRV G  +AE+YF ++  Q +  K YGALLNCYVR+  V+KSL H +KMKEMGF +S L YN+IMCLY   GQ +KVP VL EMKE  V
Subjt:  TRDFAVQLDLIGRVRGLDSAEKYFGSVSNQEEIGKLYGALLNCYVREGLVDKSLAHMQKMKEMGFASSPLCYNDIMCLYLKTGQADKVPNVLSEMKENGV

Query:  LPDNFSYRICISSYGAKSDVISMEKVLKEMESQTHISMDWTTYSMVASFFIKASMHEKAISYLRKCEDKLWASSG
         PDN+SYRICI+++GA  D+  +   L++ME +  I+MDW TY++ A F+I     ++A+  L+  E++L    G
Subjt:  LPDNFSYRICISSYGAKSDVISMEKVLKEMESQTHISMDWTTYSMVASFFIKASMHEKAISYLRKCEDKLWASSG

AT5G53800.1 unknown protein1.9e-6055Show/hide
Query:  STKRGSSDNDEPGEREGEDNGAPSKRSSGRNSEKDPENSGSDVDEGRKRSKSRRKSRDSSDSDEKHDKKRGKSSRRKGKSRRRYSSSEEDSDSEDTESDS
        S + G+   +   E E  D    S R     S KD   SG + D  R+R +S+R+++D +DS  +   + G  S ++ + R R    +  SD + + S  
Subjt:  STKRGSSDNDEPGEREGEDNGAPSKRSSGRNSEKDPENSGSDVDEGRKRSKSRRKSRDSSDSDEKHDKKRGKSSRRKGKSRRRYSSSEEDSDSEDTESDS

Query:  SMYDSDSGHSDAESASNSSGSEG-DSESEEERRRRKRKERRRRRDKERERKRRRKEKEKKRR-KKEREEEKRRKEKKKKEKKERGKKGAVTNSWGKYGII
           D  S  SD+ES S S  S+  +SESE+ERRRRKRK R+ R ++E+ERKRRR+EK+KK+R K +++ +K+RKEKKKK K E+ KKGAVT SWGKYGII
Subjt:  SMYDSDSGHSDAESASNSSGSEG-DSESEEERRRRKRKERRRRRDKERERKRRRKEKEKKRR-KKEREEEKRRKEKKKKEKKERGKKGAVTNSWGKYGII

Query:  KETDMWNKRPEFTAWLAEIKKVNLESLANWEEKQMFKEFMEDHNTATFPSKKYYNLDAYYQRKIQKDMKKGHKKVLAGERTVFDDEEQRRQELLIEREKH
        +ETDMWNKRPEFTAWL E+KKVNLESL  WEEK+MFK+FMEDHNT TF SKKYY++D YY+ K++K+MKKG KK    ERTVF+DEEQRR E+   RE+ 
Subjt:  KETDMWNKRPEFTAWLAEIKKVNLESLANWEEKQMFKEFMEDHNTATFPSKKYYNLDAYYQRKIQKDMKKGHKKVLAGERTVFDDEEQRRQELLIEREKH

Query:  KEEQVEALKRSMQTGMAQAMKEQARLREEMAYQYKLGNFE
        KEE+V ALKRSM+ GMAQAMKEQARL+EEM Y YK+G+ E
Subjt:  KEEQVEALKRSMQTGMAQAMKEQARLREEMAYQYKLGNFE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCGGCTTCAGCGATTTTCAAAATCTTCAGGAGGTCATCTTCAGGCTGCACTAGAACAGCAAGAACAGAGACAGATGCATTTTATTTTGTGGTGTTGAGATTATA
CAGCGCAAGACGAAGCTGCGATCGAAGAAACCTCTTTGCGAGGATTAGTCCTCTTGGTGACCCTGAGCTTAGTGTAGTGCCGGTTCTTAATCAATGGATTGAGGAAGGCA
GGAAGATCAAGGATTTTGAGCTCCGGAGAATCGTTCGCGACCTTCGTACTTCCCGGCGGTATGGGCAAGCCCTTGAGGTGAGCGCAATTGAAAAGCACGCCCTAATTTCC
CGTGAAAAGAATGCTTGTATTGTGTCTGAATGGATGTGTAGCAAGGGACTTTTTTCCCTCACAACAAGAGACTTTGCCGTGCAGCTTGATTTGATTGGCCGAGTTCGTGG
GTTGGATTCTGCAGAGAAGTATTTTGGCAGTGTTTCTAACCAAGAGGAAATTGGTAAACTCTATGGTGCTCTTCTAAATTGTTATGTCAGGGAAGGCCTTGTAGATAAGT
CCCTTGCTCATATGCAGAAGATGAAGGAGATGGGTTTTGCTTCCTCTCCCCTCTGCTACAATGATATAATGTGTCTATATTTGAAAACTGGCCAGGCTGATAAGGTTCCA
AATGTACTGTCTGAAATGAAGGAGAATGGTGTTCTTCCTGACAATTTTAGCTATAGAATTTGTATCAGTTCTTATGGAGCTAAGTCTGATGTAATCAGTATGGAAAAGGT
CTTGAAAGAAATGGAGAGTCAAACTCACATATCCATGGATTGGACTACTTATTCAATGGTTGCCAGTTTTTTCATCAAAGCCAGTATGCATGAGAAAGCAATTAGTTATC
TTCGGAAATGTGAGGACAAGCTTTGGGCTTCCTCTGGAGCCGAGGAGGGAAGGCCCATCGGTTCAGATGATCCGTCACAAATATATATCTCTAAAACGGAATCTTGCATC
TCGCTGCTTTCCACTCTCATTTCCTTTCTTTTCTTGTTCGAACGTCTCCCTAAATTCTCTAAACCCCCAACTAAAATGGCTAGAGATCGAAATGGTGAGCTATCTACTAA
GAGAGGAAGTTCCGATAACGACGAGCCCGGGGAGCGAGAAGGAGAAGACAACGGTGCTCCCTCAAAGAGAAGCTCCGGTAGGAATTCCGAGAAAGATCCTGAAAATTCGG
GCTCCGATGTGGACGAAGGACGAAAACGCAGCAAATCAAGAAGAAAATCACGAGACAGTTCGGATTCCGATGAGAAACATGACAAGAAGCGTGGAAAGAGTTCGAGAAGA
AAGGGCAAATCCCGTAGACGATACAGTAGTAGTGAGGAGGATTCTGATTCCGAGGATACCGAATCTGATAGCTCGATGTATGATTCGGATTCGGGGCACTCCGACGCGGA
ATCGGCTTCTAATAGTTCGGGTTCGGAAGGCGATAGCGAGAGCGAGGAGGAGAGGAGGAGGAGGAAGAGGAAGGAAAGGAGGAGGAGAAGAGACAAGGAGAGAGAGCGGA
AGAGGAGGAGAAAAGAGAAGGAGAAGAAGAGGAGGAAAAAAGAGAGGGAGGAAGAGAAAAGAAGGAAGGAGAAGAAAAAGAAGGAGAAGAAGGAACGAGGAAAGAAAGGG
GCAGTGACGAATTCGTGGGGAAAGTACGGGATTATCAAAGAAACTGATATGTGGAACAAACGACCGGAGTTCACTGCATGGTTGGCTGAAATTAAAAAGGTGAACTTGGA
AAGCCTAGCCAATTGGGAAGAGAAGCAAATGTTTAAGGAATTCATGGAGGACCATAACACGGCCACTTTTCCATCCAAAAAGTACTACAATCTTGATGCCTATTACCAGC
GTAAAATACAAAAAGATATGAAGAAAGGCCACAAAAAGGTTTTGGCAGGGGAACGCACTGTGTTTGATGATGAAGAACAGAGGAGGCAAGAACTACTCATAGAACGTGAA
AAGCACAAGGAAGAACAAGTGGAAGCCTTGAAGCGCTCTATGCAGACTGGAATGGCCCAAGCAATGAAAGAACAAGCTCGACTCAGGGAGGAGATGGCTTACCAGTATAA
ACTTGGAAACTTTGAGGTCAGCGTCCCTCCCCACCCATCACCAGCTACTGCATCCGCCAACGCACACAGCAGTCCGCTGCCGCTCCTACCTGAAGCAGTCTCAGCGTCTG
CCCCCATCCTCCCAACGAACTCGCCTCTGCTCTTCCCTCTCACGCTCTCTCGCCTCTCTGTGCAATCCGAGTTACTTAACTCTAAGGATATGGGAGGCAGAGGAGTCATT
GGTGACAAATGGTCCATGAGGGTTCTCTGGGTCTGTGCTCTAGGAAGTGCAATCGGTTTGTATATGGTTGCTCAAGAAAGACAATTACAAAACAGGCAAAGAATGTTGGC
TGAGAGTCTCAAAGATGCAGAATCAGGAAGCAGTGGTGAAAATGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGGCGGCTTCAGCGATTTTCAAAATCTTCAGGAGGTCATCTTCAGGCTGCACTAGAACAGCAAGAACAGAGACAGATGCATTTTATTTTGTGGTGTTGAGATTATA
CAGCGCAAGACGAAGCTGCGATCGAAGAAACCTCTTTGCGAGGATTAGTCCTCTTGGTGACCCTGAGCTTAGTGTAGTGCCGGTTCTTAATCAATGGATTGAGGAAGGCA
GGAAGATCAAGGATTTTGAGCTCCGGAGAATCGTTCGCGACCTTCGTACTTCCCGGCGGTATGGGCAAGCCCTTGAGGTGAGCGCAATTGAAAAGCACGCCCTAATTTCC
CGTGAAAAGAATGCTTGTATTGTGTCTGAATGGATGTGTAGCAAGGGACTTTTTTCCCTCACAACAAGAGACTTTGCCGTGCAGCTTGATTTGATTGGCCGAGTTCGTGG
GTTGGATTCTGCAGAGAAGTATTTTGGCAGTGTTTCTAACCAAGAGGAAATTGGTAAACTCTATGGTGCTCTTCTAAATTGTTATGTCAGGGAAGGCCTTGTAGATAAGT
CCCTTGCTCATATGCAGAAGATGAAGGAGATGGGTTTTGCTTCCTCTCCCCTCTGCTACAATGATATAATGTGTCTATATTTGAAAACTGGCCAGGCTGATAAGGTTCCA
AATGTACTGTCTGAAATGAAGGAGAATGGTGTTCTTCCTGACAATTTTAGCTATAGAATTTGTATCAGTTCTTATGGAGCTAAGTCTGATGTAATCAGTATGGAAAAGGT
CTTGAAAGAAATGGAGAGTCAAACTCACATATCCATGGATTGGACTACTTATTCAATGGTTGCCAGTTTTTTCATCAAAGCCAGTATGCATGAGAAAGCAATTAGTTATC
TTCGGAAATGTGAGGACAAGCTTTGGGCTTCCTCTGGAGCCGAGGAGGGAAGGCCCATCGGTTCAGATGATCCGTCACAAATATATATCTCTAAAACGGAATCTTGCATC
TCGCTGCTTTCCACTCTCATTTCCTTTCTTTTCTTGTTCGAACGTCTCCCTAAATTCTCTAAACCCCCAACTAAAATGGCTAGAGATCGAAATGGTGAGCTATCTACTAA
GAGAGGAAGTTCCGATAACGACGAGCCCGGGGAGCGAGAAGGAGAAGACAACGGTGCTCCCTCAAAGAGAAGCTCCGGTAGGAATTCCGAGAAAGATCCTGAAAATTCGG
GCTCCGATGTGGACGAAGGACGAAAACGCAGCAAATCAAGAAGAAAATCACGAGACAGTTCGGATTCCGATGAGAAACATGACAAGAAGCGTGGAAAGAGTTCGAGAAGA
AAGGGCAAATCCCGTAGACGATACAGTAGTAGTGAGGAGGATTCTGATTCCGAGGATACCGAATCTGATAGCTCGATGTATGATTCGGATTCGGGGCACTCCGACGCGGA
ATCGGCTTCTAATAGTTCGGGTTCGGAAGGCGATAGCGAGAGCGAGGAGGAGAGGAGGAGGAGGAAGAGGAAGGAAAGGAGGAGGAGAAGAGACAAGGAGAGAGAGCGGA
AGAGGAGGAGAAAAGAGAAGGAGAAGAAGAGGAGGAAAAAAGAGAGGGAGGAAGAGAAAAGAAGGAAGGAGAAGAAAAAGAAGGAGAAGAAGGAACGAGGAAAGAAAGGG
GCAGTGACGAATTCGTGGGGAAAGTACGGGATTATCAAAGAAACTGATATGTGGAACAAACGACCGGAGTTCACTGCATGGTTGGCTGAAATTAAAAAGGTGAACTTGGA
AAGCCTAGCCAATTGGGAAGAGAAGCAAATGTTTAAGGAATTCATGGAGGACCATAACACGGCCACTTTTCCATCCAAAAAGTACTACAATCTTGATGCCTATTACCAGC
GTAAAATACAAAAAGATATGAAGAAAGGCCACAAAAAGGTTTTGGCAGGGGAACGCACTGTGTTTGATGATGAAGAACAGAGGAGGCAAGAACTACTCATAGAACGTGAA
AAGCACAAGGAAGAACAAGTGGAAGCCTTGAAGCGCTCTATGCAGACTGGAATGGCCCAAGCAATGAAAGAACAAGCTCGACTCAGGGAGGAGATGGCTTACCAGTATAA
ACTTGGAAACTTTGAGGTCAGCGTCCCTCCCCACCCATCACCAGCTACTGCATCCGCCAACGCACACAGCAGTCCGCTGCCGCTCCTACCTGAAGCAGTCTCAGCGTCTG
CCCCCATCCTCCCAACGAACTCGCCTCTGCTCTTCCCTCTCACGCTCTCTCGCCTCTCTGTGCAATCCGAGTTACTTAACTCTAAGGATATGGGAGGCAGAGGAGTCATT
GGTGACAAATGGTCCATGAGGGTTCTCTGGGTCTGTGCTCTAGGAAGTGCAATCGGTTTGTATATGGTTGCTCAAGAAAGACAATTACAAAACAGGCAAAGAATGTTGGC
TGAGAGTCTCAAAGATGCAGAATCAGGAAGCAGTGGTGAAAATGTGTAG
Protein sequenceShow/hide protein sequence
MAAASAIFKIFRRSSSGCTRTARTETDAFYFVVLRLYSARRSCDRRNLFARISPLGDPELSVVPVLNQWIEEGRKIKDFELRRIVRDLRTSRRYGQALEVSAIEKHALIS
REKNACIVSEWMCSKGLFSLTTRDFAVQLDLIGRVRGLDSAEKYFGSVSNQEEIGKLYGALLNCYVREGLVDKSLAHMQKMKEMGFASSPLCYNDIMCLYLKTGQADKVP
NVLSEMKENGVLPDNFSYRICISSYGAKSDVISMEKVLKEMESQTHISMDWTTYSMVASFFIKASMHEKAISYLRKCEDKLWASSGAEEGRPIGSDDPSQIYISKTESCI
SLLSTLISFLFLFERLPKFSKPPTKMARDRNGELSTKRGSSDNDEPGEREGEDNGAPSKRSSGRNSEKDPENSGSDVDEGRKRSKSRRKSRDSSDSDEKHDKKRGKSSRR
KGKSRRRYSSSEEDSDSEDTESDSSMYDSDSGHSDAESASNSSGSEGDSESEEERRRRKRKERRRRRDKERERKRRRKEKEKKRRKKEREEEKRRKEKKKKEKKERGKKG
AVTNSWGKYGIIKETDMWNKRPEFTAWLAEIKKVNLESLANWEEKQMFKEFMEDHNTATFPSKKYYNLDAYYQRKIQKDMKKGHKKVLAGERTVFDDEEQRRQELLIERE
KHKEEQVEALKRSMQTGMAQAMKEQARLREEMAYQYKLGNFEVSVPPHPSPATASANAHSSPLPLLPEAVSASAPILPTNSPLLFPLTLSRLSVQSELLNSKDMGGRGVI
GDKWSMRVLWVCALGSAIGLYMVAQERQLQNRQRMLAESLKDAESGSSGENV