; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G02850 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G02850
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationClcChr02:2521241..2523976
RNA-Seq ExpressionClc02G02850
SyntenyClc02G02850
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059191.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.0e+0091.04Show/hide
Query:  MLSLTLSHSLQPFLPRDLHFPLQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLALYADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAI
        MLSLTLSHSLQPFLP +LHFPLQNC TERE+ QLH+LS+KT SLNHPSVSS LLALYA P INNL+YAQSLFDWI KPTLVSWN+LIKCYIENQRSNDAI
Subjt:  MLSLTLSHSLQPFLPRDLHFPLQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLALYADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAI

Query:  ALFCKLLCELLPDSFTLPCVLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMAL
        ALFCKLLC+ +PDSFTLPCVLKGCARL ALQEGKQIHGL+LKIGFGVDKFVLSSLVSMYSKCGEIE+CRKVFDRMEDKD+VSWNSLIDGYARCGEIE+AL
Subjt:  ALFCKLLCELLPDSFTLPCVLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMAL

Query:  ELFEEMPERDSFSWTILVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNKQFTQALKLFEIML
        E+FEEMPE+DSFSWTIL+DGLSKSGKLE ARDVF+RMPIRNSVSWNAMINGYMKAGD NTA+ELFDQMPER+LVTWNSMITGYE NKQFT+ALKLFE+ML
Subjt:  ELFEEMPERDSFSWTILVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNKQFTQALKLFEIML

Query:  REDISPNYATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWTAIIVGLGMHGLVEQALELFDDM
        REDISPNY TILGA+SAASGLVSLG GRWVHSY+VKNGFKTDGVLGTLLIEMYSKCGSVKSALRVF+ + KKKLGHWT+IIVGLGMHGLVEQ LELFD+M
Subjt:  REDISPNYATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWTAIIVGLGMHGLVEQALELFDDM

Query:  CRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKVIWTSLLSASRKHGNIRMGEYAA
        CRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMT DYGIKP+IEHYGCLIDVLCRAGYLEEAKDTI+RMP KANKVIWTSLLS SRKHGNIRMGEYAA
Subjt:  CRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKVIWTSLLSASRKHGNIRMGEYAA

Query:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLCEMKEKLNVVGHVPDTTQVLLCL
         HLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMK K IRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKL EMK+KLNV GH+PDT+QVLLCL
Subjt:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLCEMKEKLNVVGHVPDTTQVLLCL

Query:  EEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCKDFW
        EEDNEKEAELETHSERLAIAFGLL+IKHGSPIRIIKNLRICNDCHAV KL+SHIYN EIIIRDGSRFHHFKSGSCSCKDFW
Subjt:  EEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCKDFW

TYK19328.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.0e+0091.34Show/hide
Query:  MLSLTLSHSLQPFLPRDLHFPLQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLALYADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAI
        MLSLTLSHSLQPFLP +LHFPLQNC TERE+ QLH+LS+KT SLNHPSVSS LLALYA P INNL+YAQSLFDWI KPTLVSWN+LIKCYIENQRSNDAI
Subjt:  MLSLTLSHSLQPFLPRDLHFPLQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLALYADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAI

Query:  ALFCKLLCELLPDSFTLPCVLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMAL
        ALFCKLLC+ +PDSFTLPCVLKGCARL ALQEGKQIHGL+LKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKD+VSWNSLIDGYARCGEIE+AL
Subjt:  ALFCKLLCELLPDSFTLPCVLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMAL

Query:  ELFEEMPERDSFSWTILVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNKQFTQALKLFEIML
        E+FEEMPE+DSFSWTIL+DGLSKSGKLE AR VF+RMPIRNSVSWNAMINGYMKAGD NTA+ELFDQMPER+LVTWNSMITGYE NKQFT+ALKLFE+ML
Subjt:  ELFEEMPERDSFSWTILVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNKQFTQALKLFEIML

Query:  REDISPNYATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWTAIIVGLGMHGLVEQALELFDDM
        REDISPNY TILGA+SAASGLVSLG GRWVHSY+VKNGFKTDGVLGTLLIEMYSKCGSVKSALRVF+ + KKKLGHWT+IIVGLGMHGLVEQ LELFD+M
Subjt:  REDISPNYATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWTAIIVGLGMHGLVEQALELFDDM

Query:  CRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKVIWTSLLSASRKHGNIRMGEYAA
        CRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMT DYGIKP+IEHYGCLIDVLCRAGYLEEAKDTIERMP KANKVIWTSLLS SRKHGNIRMGEYAA
Subjt:  CRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKVIWTSLLSASRKHGNIRMGEYAA

Query:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLCEMKEKLNVVGHVPDTTQVLLCL
         HLIDLAPDTTGCYVILSNMYAA GLWEKVRQVREMMK KGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKL EMK+KLNV GH+PDT+QVLLCL
Subjt:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLCEMKEKLNVVGHVPDTTQVLLCL

Query:  EEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCKDFW
        EEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAV KL+SHIYN EIIIRDGSRFHHFKSGSCSCKDFW
Subjt:  EEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCKDFW

XP_004144616.1 pentatricopeptide repeat-containing protein At5g48910 [Cucumis sativus]0.0e+0091.34Show/hide
Query:  MLSLTLSHSLQPFLPRDLHFPLQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLALYADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAI
        MLS TLSHSLQPFLP +LHFPLQNC TERE+ QLH+LS+KT SLNHPSVSSRLLALYADPRINNL+YA SLFDWI +PTLVSWN+LIKCYIENQRSNDAI
Subjt:  MLSLTLSHSLQPFLPRDLHFPLQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLALYADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAI

Query:  ALFCKLLCELLPDSFTLPCVLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMAL
        ALFCKLLC+ +PDSFTLPCVLKGCARL ALQEGKQIHGL+LKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKD+VSWNSLIDGYARCGEIE+AL
Subjt:  ALFCKLLCELLPDSFTLPCVLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMAL

Query:  ELFEEMPERDSFSWTILVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNKQFTQALKLFEIML
        E+FEEMPE+DSFSWTIL+DGLSKSGKLE ARDVF+RMPIRNSVSWNAMINGYMKAGD NTA+ELFDQMPER+LVTWNSMITGYE NKQFT+ALKLFE+ML
Subjt:  ELFEEMPERDSFSWTILVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNKQFTQALKLFEIML

Query:  REDISPNYATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWTAIIVGLGMHGLVEQALELFDDM
        REDISPNY TILGA+SAASG+VSLG GRWVHSY+VK+GFKTDGVLGTLLIEMYSKCGSVKSALRVF+S+PKKKLGHWT++IVGLGMHGLVEQ LELFD+M
Subjt:  REDISPNYATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWTAIIVGLGMHGLVEQALELFDDM

Query:  CRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKVIWTSLLSASRKHGNIRMGEYAA
        CRTGL+PHAITFIGVLNACSHAGFAEDAHRYFKMMT DYGIKPSIEHYGCLIDVLCRAG+LEEAKDTIERMP KANKVIWTSLLS SRKHGNIRMGEYAA
Subjt:  CRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKVIWTSLLSASRKHGNIRMGEYAA

Query:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLCEMKEKLNVVGHVPDTTQVLLCL
         HLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMK+KG++KDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLCEMK+KLNV GH+PDTTQVLLCL
Subjt:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLCEMKEKLNVVGHVPDTTQVLLCL

Query:  EEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCKDFW
        EEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAV KL+SHIYN EIIIRDGSRFHHFKSGSCSCKDFW
Subjt:  EEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCKDFW

XP_008462083.1 PREDICTED: pentatricopeptide repeat-containing protein At5g48910-like [Cucumis melo]0.0e+0091.19Show/hide
Query:  MLSLTLSHSLQPFLPRDLHFPLQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLALYADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAI
        MLSLTLSHSLQPFLP +LHFPLQNC TERE+ QLH+LS+KT SLNHPSVSS LLALYA P INNL+YAQSLFDWI KPTLVSWN+LIKCYIENQRSNDAI
Subjt:  MLSLTLSHSLQPFLPRDLHFPLQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLALYADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAI

Query:  ALFCKLLCELLPDSFTLPCVLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMAL
        ALFCKLLC+ +PDSFTLPCVLKGCARL ALQEGKQIHGL+LKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKD+VSWNSLIDGYARCGEIE+AL
Subjt:  ALFCKLLCELLPDSFTLPCVLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMAL

Query:  ELFEEMPERDSFSWTILVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNKQFTQALKLFEIML
        E+FEEMPE+DSFSWTIL+DGLSKSGKLE AR VF+RMPIRNSVSWNAMINGYMKAG  NTA+ELFDQMPER+LVTWNSMITGYE NKQFT+ALKLFE+ML
Subjt:  ELFEEMPERDSFSWTILVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNKQFTQALKLFEIML

Query:  REDISPNYATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWTAIIVGLGMHGLVEQALELFDDM
        REDISPNY TILGA+SAASGLVSLG GRWVHSY+VKNGFKTDGVLGTLLIEMYSKCGSVKSALRVF+ + KKKLGHWT+IIVGLGMHGLVEQ LELFD+M
Subjt:  REDISPNYATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWTAIIVGLGMHGLVEQALELFDDM

Query:  CRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKVIWTSLLSASRKHGNIRMGEYAA
        CRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMT DYGIKP+IEHYGCLIDVLCRAGYLEEAKDTIERMP KANKVIWTSLLS SRKHGNIRMGEYAA
Subjt:  CRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKVIWTSLLSASRKHGNIRMGEYAA

Query:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLCEMKEKLNVVGHVPDTTQVLLCL
         HLIDLAPDTTGCYVILSNMYAA GLWEKVRQVREMMK KGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKL EMK+KLNV GH+PDT+QVLLCL
Subjt:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLCEMKEKLNVVGHVPDTTQVLLCL

Query:  EEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCKDFW
        EEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAV KL+SHIYN EIIIRDGSRFHHFKSGSCSCKDFW
Subjt:  EEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCKDFW

XP_038887831.1 pentatricopeptide repeat-containing protein At5g48910-like [Benincasa hispida]0.0e+0091.92Show/hide
Query:  MLSLTLSHSLQPFLPRDLHFPLQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLALYADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAI
        ML+LTLSHSLQPF+PR+LHFPLQNC TERE+KQLH+LSLK GSLNHPSVSSRLLALYADPRINNLEYAQSLFDWI KPTLVSWN+LIKCYIE+QRSNDAI
Subjt:  MLSLTLSHSLQPFLPRDLHFPLQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLALYADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAI

Query:  ALFCKLLCELLPDSFTLPCVLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMAL
        ALFCK LCE LPDSFTLPCVLKGC+RL ALQEGKQIHGL+LKIGFGVDKFVLSSLVSMY+KCGEIELCRKVFDRMED+DIVSWNSLIDGYARCGEIE+AL
Subjt:  ALFCKLLCELLPDSFTLPCVLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMAL

Query:  ELFEEMPERDSFSWTILVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNKQFTQALKLFEIML
        +L EEMPE+DS SWTILVDGLSKSGKLE ARDVF++MP RNSVSWNAMINGYMKAG+FNTARELFDQMPERNLVTWNSMI+GYELNKQFTQALKL E ML
Subjt:  ELFEEMPERDSFSWTILVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNKQFTQALKLFEIML

Query:  REDISPNYATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWTAIIVGLGMHGLVEQALELFDDM
        REDISPNY TILGALSAASGLVSLGKGRWVHSY+VKNGF+T+GVLGT LIEMYSKCGSV+SAL VFQS+P+KKLGHWTAIIVGLGMHGLVEQ LELFD+M
Subjt:  REDISPNYATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWTAIIVGLGMHGLVEQALELFDDM

Query:  CRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKVIWTSLLSASRKHGNIRMGEYAA
        CRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMP KANKVIW SLLS SRKHGNIRMGEYAA
Subjt:  CRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKVIWTSLLSASRKHGNIRMGEYAA

Query:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLCEMKEKLNVVGHVPDTTQVLLCL
        HHLIDLAPDTTGCYVILSNMYAAAGLWEKV QVREMMK+KGIRKDPGCSSIEHQGS+HEFIVGDKSHPQT+EIYIKLCEMKEKL+  GHVPDTTQVLLCL
Subjt:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLCEMKEKLNVVGHVPDTTQVLLCL

Query:  EEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCKDFW
        EEDN+KEAELETHSERLAIAFGLLNI HGSPIRIIKNLRICNDCHAV KLVSHIYN EIIIRDGSRFHHFKSGSCSCKDFW
Subjt:  EEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCKDFW

TrEMBL top hitse value%identityAlignment
A0A1S3CG66 pentatricopeptide repeat-containing protein At5g48910-like0.0e+0091.19Show/hide
Query:  MLSLTLSHSLQPFLPRDLHFPLQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLALYADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAI
        MLSLTLSHSLQPFLP +LHFPLQNC TERE+ QLH+LS+KT SLNHPSVSS LLALYA P INNL+YAQSLFDWI KPTLVSWN+LIKCYIENQRSNDAI
Subjt:  MLSLTLSHSLQPFLPRDLHFPLQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLALYADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAI

Query:  ALFCKLLCELLPDSFTLPCVLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMAL
        ALFCKLLC+ +PDSFTLPCVLKGCARL ALQEGKQIHGL+LKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKD+VSWNSLIDGYARCGEIE+AL
Subjt:  ALFCKLLCELLPDSFTLPCVLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMAL

Query:  ELFEEMPERDSFSWTILVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNKQFTQALKLFEIML
        E+FEEMPE+DSFSWTIL+DGLSKSGKLE AR VF+RMPIRNSVSWNAMINGYMKAG  NTA+ELFDQMPER+LVTWNSMITGYE NKQFT+ALKLFE+ML
Subjt:  ELFEEMPERDSFSWTILVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNKQFTQALKLFEIML

Query:  REDISPNYATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWTAIIVGLGMHGLVEQALELFDDM
        REDISPNY TILGA+SAASGLVSLG GRWVHSY+VKNGFKTDGVLGTLLIEMYSKCGSVKSALRVF+ + KKKLGHWT+IIVGLGMHGLVEQ LELFD+M
Subjt:  REDISPNYATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWTAIIVGLGMHGLVEQALELFDDM

Query:  CRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKVIWTSLLSASRKHGNIRMGEYAA
        CRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMT DYGIKP+IEHYGCLIDVLCRAGYLEEAKDTIERMP KANKVIWTSLLS SRKHGNIRMGEYAA
Subjt:  CRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKVIWTSLLSASRKHGNIRMGEYAA

Query:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLCEMKEKLNVVGHVPDTTQVLLCL
         HLIDLAPDTTGCYVILSNMYAA GLWEKVRQVREMMK KGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKL EMK+KLNV GH+PDT+QVLLCL
Subjt:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLCEMKEKLNVVGHVPDTTQVLLCL

Query:  EEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCKDFW
        EEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAV KL+SHIYN EIIIRDGSRFHHFKSGSCSCKDFW
Subjt:  EEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCKDFW

A0A5A7V0C2 Pentatricopeptide repeat-containing protein0.0e+0091.04Show/hide
Query:  MLSLTLSHSLQPFLPRDLHFPLQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLALYADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAI
        MLSLTLSHSLQPFLP +LHFPLQNC TERE+ QLH+LS+KT SLNHPSVSS LLALYA P INNL+YAQSLFDWI KPTLVSWN+LIKCYIENQRSNDAI
Subjt:  MLSLTLSHSLQPFLPRDLHFPLQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLALYADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAI

Query:  ALFCKLLCELLPDSFTLPCVLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMAL
        ALFCKLLC+ +PDSFTLPCVLKGCARL ALQEGKQIHGL+LKIGFGVDKFVLSSLVSMYSKCGEIE+CRKVFDRMEDKD+VSWNSLIDGYARCGEIE+AL
Subjt:  ALFCKLLCELLPDSFTLPCVLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMAL

Query:  ELFEEMPERDSFSWTILVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNKQFTQALKLFEIML
        E+FEEMPE+DSFSWTIL+DGLSKSGKLE ARDVF+RMPIRNSVSWNAMINGYMKAGD NTA+ELFDQMPER+LVTWNSMITGYE NKQFT+ALKLFE+ML
Subjt:  ELFEEMPERDSFSWTILVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNKQFTQALKLFEIML

Query:  REDISPNYATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWTAIIVGLGMHGLVEQALELFDDM
        REDISPNY TILGA+SAASGLVSLG GRWVHSY+VKNGFKTDGVLGTLLIEMYSKCGSVKSALRVF+ + KKKLGHWT+IIVGLGMHGLVEQ LELFD+M
Subjt:  REDISPNYATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWTAIIVGLGMHGLVEQALELFDDM

Query:  CRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKVIWTSLLSASRKHGNIRMGEYAA
        CRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMT DYGIKP+IEHYGCLIDVLCRAGYLEEAKDTI+RMP KANKVIWTSLLS SRKHGNIRMGEYAA
Subjt:  CRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKVIWTSLLSASRKHGNIRMGEYAA

Query:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLCEMKEKLNVVGHVPDTTQVLLCL
         HLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMK K IRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKL EMK+KLNV GH+PDT+QVLLCL
Subjt:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLCEMKEKLNVVGHVPDTTQVLLCL

Query:  EEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCKDFW
        EEDNEKEAELETHSERLAIAFGLL+IKHGSPIRIIKNLRICNDCHAV KL+SHIYN EIIIRDGSRFHHFKSGSCSCKDFW
Subjt:  EEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCKDFW

A0A5D3D6Y7 Pentatricopeptide repeat-containing protein0.0e+0091.34Show/hide
Query:  MLSLTLSHSLQPFLPRDLHFPLQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLALYADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAI
        MLSLTLSHSLQPFLP +LHFPLQNC TERE+ QLH+LS+KT SLNHPSVSS LLALYA P INNL+YAQSLFDWI KPTLVSWN+LIKCYIENQRSNDAI
Subjt:  MLSLTLSHSLQPFLPRDLHFPLQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLALYADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAI

Query:  ALFCKLLCELLPDSFTLPCVLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMAL
        ALFCKLLC+ +PDSFTLPCVLKGCARL ALQEGKQIHGL+LKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKD+VSWNSLIDGYARCGEIE+AL
Subjt:  ALFCKLLCELLPDSFTLPCVLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMAL

Query:  ELFEEMPERDSFSWTILVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNKQFTQALKLFEIML
        E+FEEMPE+DSFSWTIL+DGLSKSGKLE AR VF+RMPIRNSVSWNAMINGYMKAGD NTA+ELFDQMPER+LVTWNSMITGYE NKQFT+ALKLFE+ML
Subjt:  ELFEEMPERDSFSWTILVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNKQFTQALKLFEIML

Query:  REDISPNYATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWTAIIVGLGMHGLVEQALELFDDM
        REDISPNY TILGA+SAASGLVSLG GRWVHSY+VKNGFKTDGVLGTLLIEMYSKCGSVKSALRVF+ + KKKLGHWT+IIVGLGMHGLVEQ LELFD+M
Subjt:  REDISPNYATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWTAIIVGLGMHGLVEQALELFDDM

Query:  CRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKVIWTSLLSASRKHGNIRMGEYAA
        CRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMT DYGIKP+IEHYGCLIDVLCRAGYLEEAKDTIERMP KANKVIWTSLLS SRKHGNIRMGEYAA
Subjt:  CRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKVIWTSLLSASRKHGNIRMGEYAA

Query:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLCEMKEKLNVVGHVPDTTQVLLCL
         HLIDLAPDTTGCYVILSNMYAA GLWEKVRQVREMMK KGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKL EMK+KLNV GH+PDT+QVLLCL
Subjt:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLCEMKEKLNVVGHVPDTTQVLLCL

Query:  EEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCKDFW
        EEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAV KL+SHIYN EIIIRDGSRFHHFKSGSCSCKDFW
Subjt:  EEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCKDFW

A0A6J1C6D9 pentatricopeptide repeat-containing protein At5g48910-like0.0e+0088.84Show/hide
Query:  MLSLTLSHSLQPFLPRDLHFPLQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLALYADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAI
        MLSL LSHSL PFLPR+LHFPLQNC TERE KQLH+LSLKTGS NHPS+SSRLLALY DPRINNLEYA+SLFDWI +PTLVSWN+L+KCY+ENQRSNDAI
Subjt:  MLSLTLSHSLQPFLPRDLHFPLQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLALYADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAI

Query:  ALFCKLLCELLPDSFTLPCVLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMAL
        +LFC+LL E +PDSFTLPCVLKGCARL+AL EGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKD+VSWNSLIDGYARCG+IE+AL
Subjt:  ALFCKLLCELLPDSFTLPCVLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMAL

Query:  ELFEEMPERDSFSWTILVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNKQFTQALKLFEIML
        E+F+EMPERDSFSWTILVDGLSKSGKLETARDVF+RMP RNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELN+QF+QALKLFE+ML
Subjt:  ELFEEMPERDSFSWTILVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNKQFTQALKLFEIML

Query:  REDISPNYATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWTAIIVGLGMHGLVEQALELFDDM
        RE+ISPN+ATILGALSAASGLVS GKGRWVHS++VKNGF+TDGVLGT LIEMYSKCGS+ SALRVF+S+PKKKLGHWTAIIVGLGMHGLV Q LELFD+M
Subjt:  REDISPNYATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWTAIIVGLGMHGLVEQALELFDDM

Query:  CRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKVIWTSLLSASRKHGNIRMGEYAA
        CR GL+PHAITFIG+LNACSHAGFA+DA+ YFKMM DDYGI+PSIEHYGCLIDVLCRAG LEEAK+TIERMP K NKVIW SLLS SRKHGNIRMGEYAA
Subjt:  CRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKVIWTSLLSASRKHGNIRMGEYAA

Query:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLCEMKEKLNVVGHVPDTTQVLLCL
        HHLIDLAPDTTGCY+ILSNMYA AGLWEKVRQVREMMK+KGIRKDPGCSSIEHQGS+HEFIVGD+SHPQTEEIYIKL EMKEKLNV GHVPDTTQVLLCL
Subjt:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLCEMKEKLNVVGHVPDTTQVLLCL

Query:  EEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCKDFW
        E+DNEKE+ELETHSERLAIAFGL+NIKHG+P+RIIKNLRICNDCH V+KL+SHIYN EIIIRDGSRFHHFKSGSCSCKDFW
Subjt:  EEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCKDFW

A0A6J1GM70 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like0.0e+0089.57Show/hide
Query:  MLSLTLSHSLQPFLPRDLHFPLQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLALYADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAI
        M SLTLSHSLQPF P +LHFPLQNC TERE+KQ H+LSLKTGSLNHPS+S RLLALYA+PRINNLEYAQSLFDWI KPTLVSWNMLIKCYIENQRSNDAI
Subjt:  MLSLTLSHSLQPFLPRDLHFPLQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLALYADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAI

Query:  ALFCKLLCELLPDSFTLPCVLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMAL
        ALFCKLLCE +PDSFTLPCVLKGCARL+ALQEGKQIHGLILKIG GVDKFVLSSLV+MYSKCGEIELCRKVFDRMEDKD+VSWNSLIDGYARCGEIE+AL
Subjt:  ALFCKLLCELLPDSFTLPCVLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMAL

Query:  ELFEEMPERDSFSWTILVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNKQFTQALKLFEIML
        ELF+EMPE+D+FSWTILVDGLSKSGKL+ ARDVF+RMP RNSVSWNAMINGYMKAG FNTARELFD+MPERN V+WNSMITGYELNKQFTQALKLFE+ML
Subjt:  ELFEEMPERDSFSWTILVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNKQFTQALKLFEIML

Query:  REDISPNYATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWTAIIVGLGMHGLVEQALELFDDM
         EDISPN+AT+LGA SAASGL SLG GRWVHSY+VKN FKTDGVLGT LIEMYSKCGS+K ALRVFQS+PKKKLGHWTAIIVGLGMHGLVEQ LELFD+M
Subjt:  REDISPNYATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWTAIIVGLGMHGLVEQALELFDDM

Query:  CRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKVIWTSLLSASRKHGNIRMGEYAA
        CRTGL+PHAITFIGVLNACSHAGFA++A RYFK MTDD+GI+PSIEHYGCLID LCRAGYLEEAKDTIERMP KAN VIW SLLS SRKHG+ RMGEYAA
Subjt:  CRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKVIWTSLLSASRKHGNIRMGEYAA

Query:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLCEMKEKLNVVGHVPDTTQVLLCL
        HHL+DLAPDTTGCYVILSNMYAA GLWEKVRQVREMMK+KGIRKDPGCSSIEHQGSIHEFIVGD+SHPQTEEIY+KL EMKEKLNV GHVPDTTQVLLCL
Subjt:  HHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLCEMKEKLNVVGHVPDTTQVLLCL

Query:  EEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCKDFW
        EEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAV K +S IYN EIIIRDGSRFHHFKSGSCSCKDFW
Subjt:  EEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCKDFW

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic1.4e-15938.89Show/hide
Query:  LQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLALYADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAIALFCKLLCE--LLPDSFTLPC
        ++ CV+ R+ KQ H   ++TG+ + P  +S+L A+ A     +LEYA+ +FD IPKP   +WN LI+ Y        +I  F  ++ E    P+ +T P 
Subjt:  LQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLALYADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAIALFCKLLCE--LLPDSFTLPC

Query:  VLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMALELFEEMPERD---------
        ++K  A +++L  G+ +HG+ +K   G D FV +SL+  Y  CG+++   KVF  +++KD+VSWNS+I+G+ + G  + ALELF++M   D         
Subjt:  VLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMALELFEEMPERD---------

Query:  ---------------------------SFSWTI---LVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMI
                                   + + T+   ++D  +K G +E A+ +F+ M  +++V+W  M++GY  + D+  ARE+ + MP++++V WN++I
Subjt:  ---------------------------SFSWTI---LVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMI

Query:  TGYELNKQFTQALKLF-EIMLREDISPNYATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWTA
        + YE N +  +AL +F E+ L++++  N  T++  LSA + + +L  GRW+HSY+ K+G + +  + + LI MYSKCG ++ +  VF SV K+ +  W+A
Subjt:  TGYELNKQFTQALKLF-EIMLREDISPNYATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWTA

Query:  IIVGLGMHGLVEQALELFDDMCRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKVI
        +I GL MHG   +A+++F  M    ++P+ +TF  V  ACSH G  ++A   F  M  +YGI P  +HY C++DVL R+GYLE+A   IE MP   +  +
Subjt:  IIVGLGMHGLVEQALELFDDMCRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKVI

Query:  WTSLLSASRKHGNIRMGEYAAHHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLCE
        W +LL A + H N+ + E A   L++L P   G +V+LSN+YA  G WE V ++R+ M+  G++K+PGCSSIE  G IHEF+ GD +HP +E++Y KL E
Subjt:  WTSLLSASRKHGNIRMGEYAAHHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLCE

Query:  MKEKLNVVGHVPDTTQVLLCLEEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCKD
        + EKL   G+ P+ +QVL  +EE+  KE  L  HSE+LAI +GL++ +    IR+IKNLR+C DCH+VAKL+S +Y+ EII+RD  RFHHF++G CSC D
Subjt:  MKEKLNVVGHVPDTTQVLLCLEEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCKD

Query:  FW
        FW
Subjt:  FW

Q9FI80 Pentatricopeptide repeat-containing protein At5g489101.5e-15641.75Show/hide
Query:  PRDLHFPLQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLALYA--DPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSND--AIALFCKLLCE-
        P  L   + NC T R+  Q+H++ +K+G +     ++ +L   A  D    +L+YA  +F+ +P+    SWN +I+ + E+       AI LF +++ + 
Subjt:  PRDLHFPLQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLALYA--DPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSND--AIALFCKLLCE-

Query:  -LLPDSFTLPCVLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMALELFEEMPE
         + P+ FT P VLK CA+   +QEGKQIHGL LK GFG D+FV+S+LV MY  CG ++  R +F                              ++ + E
Subjt:  -LLPDSFTLPCVLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMALELFEEMPE

Query:  RDSFSWTILVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNKQFTQALKLFEIMLREDISPNY
        +D     ++ D   + G++               V WN MI+GYM+ GD   AR LFD+M +R++V+WN+MI+GY LN  F  A+++F  M + DI PNY
Subjt:  RDSFSWTILVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNKQFTQALKLFEIMLREDISPNY

Query:  ATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWTAIIVGLGMHGLVEQALELFDDMCRTGLQPH
         T++  L A S L SL  G W+H Y   +G + D VLG+ LI+MYSKCG ++ A+ VF+ +P++ +  W+A+I G  +HG    A++ F  M + G++P 
Subjt:  ATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWTAIIVGLGMHGLVEQALELFDDMCRTGLQPH

Query:  AITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKVIWTSLLSASRKHGNIRMGEYAAHHLIDLAP
         + +I +L ACSH G  E+  RYF  M    G++P IEHYGC++D+L R+G L+EA++ I  MP K + VIW +LL A R  GN+ MG+  A+ L+D+ P
Subjt:  AITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKVIWTSLLSASRKHGNIRMGEYAAHHLIDLAP

Query:  DTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLCEMKEKLNVVGHVPDTTQVLLCLEEDNEKEA
          +G YV LSNMYA+ G W +V ++R  MK K IRKDPGCS I+  G +HEF+V D SHP+ +EI   L E+ +KL + G+ P TTQVLL LEE+ +KE 
Subjt:  DTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLCEMKEKLNVVGHVPDTTQVLLCLEEDNEKEA

Query:  ELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCKDFW
         L  HSE++A AFGL++   G PIRI+KNLRIC DCH+  KL+S +Y  +I +RD  RFHHF+ GSCSC D+W
Subjt:  ELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCKDFW

Q9FJY7 Pentatricopeptide repeat-containing protein At5g665201.4e-14338.82Show/hide
Query:  LQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLALYADPRINN-LEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAIALFCKLLCELLP-DSFTLPC
        LQ C  + E KQ+H+  LKTG +      ++ L+       ++ L YAQ +FD   +P    WN++I+ +  +     ++ L+ ++LC   P +++T P 
Subjt:  LQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLALYADPRINN-LEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAIALFCKLLCELLP-DSFTLPC

Query:  VLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMALELFEEMPERDSFSWTILVD
        +LK C+ L+A +E  QIH  I K+G+  D + ++SL++ Y+  G  +L   +FDR+ + D VSWNS+I GY + G++++AL LF +M E           
Subjt:  VLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMALELFEEMPERDSFSWTILVD

Query:  GLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNKQFTQALKLFEIMLREDISPNYATILGALSAAS
                            +N++SW  MI+GY++A                            ++NK   +AL+LF  M   D+ P+  ++  ALSA +
Subjt:  GLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNKQFTQALKLFEIMLREDISPNYATILGALSAAS

Query:  GLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWTAIIVGLGMHGLVEQALELFDDMCRTGLQPHAITFIGVLNAC
         L +L +G+W+HSY+ K   + D VLG +LI+MY+KCG ++ AL VF+++ KK +  WTA+I G   HG   +A+  F +M + G++P+ ITF  VL AC
Subjt:  GLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWTAIIVGLGMHGLVEQALELFDDMCRTGLQPHAITFIGVLNAC

Query:  SHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKVIWTSLLSASRKHGNIRMGEYAAHHLIDLAPDTTGCYVILSN
        S+ G  E+    F  M  DY +KP+IEHYGC++D+L RAG L+EAK  I+ MP K N VIW +LL A R H NI +GE     LI + P   G YV  +N
Subjt:  SHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKVIWTSLLSASRKHGNIRMGEYAAHHLIDLAPDTTGCYVILSN

Query:  MYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLCEMKEKLNVVGHVPDTTQVLLCLEEDNEKEAELETHSERLAI
        ++A    W+K  + R +MK +G+ K PGCS+I  +G+ HEF+ GD+SHP+ E+I  K   M+ KL   G+VP+  ++LL L +D+E+EA +  HSE+LAI
Subjt:  MYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLCEMKEKLNVVGHVPDTTQVLLCLEEDNEKEAELETHSERLAI

Query:  AFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCKDFW
         +GL+  K G+ IRI+KNLR+C DCH V KL+S IY  +I++RD +RFHHF+ G CSC D+W
Subjt:  AFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCKDFW

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic7.8e-15840.83Show/hide
Query:  LQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLAL-YADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAIALF-CKLLCELLPDSFTLPC
        L NC T +  + +H+  +K G  N     S+L+      P    L YA S+F  I +P L+ WN + + +  +     A+ L+ C +   LLP+S+T P 
Subjt:  LQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLAL-YADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAIALF-CKLLCELLPDSFTLPC

Query:  VLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMALELFEEMPERDSFSWTILVD
        VLK CA+  A +EG+QIHG +LK+G  +D +V +SL+SMY + G +E   KVFD+   +D+VS+ +LI GYA  G IE A +LF+E+P +D  SW  ++ 
Subjt:  VLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMALELFEEMPERDSFSWTILVD

Query:  GLSKSGKLETARDVFNRMPIRN-------------------SVS-------W-------------NAMINGYMKAGDFNTARELFDQMPERNLVTWNSMI
        G +++G  + A ++F  M   N                   S+        W             NA+I+ Y K G+  TA  LF+++P +++++WN++I
Subjt:  GLSKSGKLETARDVFNRMPIRN-------------------SVS-------W-------------NAMINGYMKAGDFNTARELFDQMPERNLVTWNSMI

Query:  TGYELNKQFTQALKLFEIMLREDISPNYATILGALSAASGLVSLGKGRWVHSYMVK--NGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWT
         GY     + +AL LF+ MLR   +PN  T+L  L A + L ++  GRW+H Y+ K   G      L T LI+MY+KCG +++A +VF S+  K L  W 
Subjt:  TGYELNKQFTQALKLFEIMLREDISPNYATILGALSAASGLVSLGKGRWVHSYMVK--NGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWT

Query:  AIIVGLGMHGLVEQALELFDDMCRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKV
        A+I G  MHG  + + +LF  M + G+QP  ITF+G+L+ACSH+G  +     F+ MT DY + P +EHYGC+ID+L  +G  +EA++ I  M  + + V
Subjt:  AIIVGLGMHGLVEQALELFDDMCRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKV

Query:  IWTSLLSASRKHGNIRMGEYAAHHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLC
        IW SLL A + HGN+ +GE  A +LI + P+  G YV+LSN+YA+AG W +V + R ++  KG++K PGCSSIE    +HEFI+GDK HP+  EIY  L 
Subjt:  IWTSLLSASRKHGNIRMGEYAAHHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLC

Query:  EMKEKLNVVGHVPDTTQVLLCLEEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCK
        EM+  L   G VPDT++VL  +EE+  KE  L  HSE+LAIAFGL++ K G+ + I+KNLR+C +CH   KL+S IY  EII RD +RFHHF+ G CSC 
Subjt:  EMKEKLNVVGHVPDTTQVLLCLEEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCK

Query:  DFW
        D+W
Subjt:  DFW

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226902.0e-14537.91Show/hide
Query:  FPLQNCVTER---ESKQLHSLSLKTGSLNHPSVSSRLLALYADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAIALFCKLLC--ELLPDS
        F L  C   R      Q+H L +K G      V + L+  YA+     L+ A+ +FD + +  +VSW  +I  Y     + DA+ LF +++   E+ P+S
Subjt:  FPLQNCVTER---ESKQLHSLSLKTGSLNHPSVSSRLLALYADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAIALFCKLLC--ELLPDS

Query:  FTLPCVLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMALELFEEM------PE
         T+ CV+  CA+L  L+ G++++  I   G  V+  ++S+LV MY KC  I++ +++FD     ++   N++   Y R G    AL +F  M      P+
Subjt:  FTLPCVLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMALELFEEM------PE

Query:  RDSF-----------------------------SW----TILVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVT
        R S                              SW      L+D   K  + +TA  +F+RM  +  V+WN+++ GY++ G+ + A E F+ MPE+N+V+
Subjt:  RDSF-----------------------------SW----TILVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVT

Query:  WNSMITGYELNKQFTQALKLF-EIMLREDISPNYATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKL
        WN++I+G      F +A+++F  +  +E ++ +  T++   SA   L +L   +W++ Y+ KNG + D  LGT L++M+S+CG  +SA+ +F S+  + +
Subjt:  WNSMITGYELNKQFTQALKLF-EIMLREDISPNYATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKL

Query:  GHWTAIIVGLGMHGLVEQALELFDDMCRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTK
          WTA I  + M G  E+A+ELFDDM   GL+P  + F+G L ACSH G  +     F  M   +G+ P   HYGC++D+L RAG LEEA   IE MP +
Subjt:  GHWTAIIVGLGMHGLVEQALELFDDMCRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTK

Query:  ANKVIWTSLLSASRKHGNIRMGEYAAHHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIY
         N VIW SLL+A R  GN+ M  YAA  +  LAP+ TG YV+LSN+YA+AG W  + +VR  MK KG+RK PG SSI+ +G  HEF  GD+SHP+   I 
Subjt:  ANKVIWTSLLSASRKHGNIRMGEYAAHHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIY

Query:  IKLCEMKEKLNVVGHVPDTTQVLLCLEEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGS
          L E+ ++ + +GHVPD + VL+ ++E  EK   L  HSE+LA+A+GL++   G+ IRI+KNLR+C+DCH+ AK  S +YN EII+RD +RFH+ + G 
Subjt:  IKLCEMKEKLNVVGHVPDTTQVLLCLEEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGS

Query:  CSCKDFW
        CSC DFW
Subjt:  CSCKDFW

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.6e-15940.83Show/hide
Query:  LQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLAL-YADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAIALF-CKLLCELLPDSFTLPC
        L NC T +  + +H+  +K G  N     S+L+      P    L YA S+F  I +P L+ WN + + +  +     A+ L+ C +   LLP+S+T P 
Subjt:  LQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLAL-YADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAIALF-CKLLCELLPDSFTLPC

Query:  VLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMALELFEEMPERDSFSWTILVD
        VLK CA+  A +EG+QIHG +LK+G  +D +V +SL+SMY + G +E   KVFD+   +D+VS+ +LI GYA  G IE A +LF+E+P +D  SW  ++ 
Subjt:  VLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMALELFEEMPERDSFSWTILVD

Query:  GLSKSGKLETARDVFNRMPIRN-------------------SVS-------W-------------NAMINGYMKAGDFNTARELFDQMPERNLVTWNSMI
        G +++G  + A ++F  M   N                   S+        W             NA+I+ Y K G+  TA  LF+++P +++++WN++I
Subjt:  GLSKSGKLETARDVFNRMPIRN-------------------SVS-------W-------------NAMINGYMKAGDFNTARELFDQMPERNLVTWNSMI

Query:  TGYELNKQFTQALKLFEIMLREDISPNYATILGALSAASGLVSLGKGRWVHSYMVK--NGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWT
         GY     + +AL LF+ MLR   +PN  T+L  L A + L ++  GRW+H Y+ K   G      L T LI+MY+KCG +++A +VF S+  K L  W 
Subjt:  TGYELNKQFTQALKLFEIMLREDISPNYATILGALSAASGLVSLGKGRWVHSYMVK--NGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWT

Query:  AIIVGLGMHGLVEQALELFDDMCRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKV
        A+I G  MHG  + + +LF  M + G+QP  ITF+G+L+ACSH+G  +     F+ MT DY + P +EHYGC+ID+L  +G  +EA++ I  M  + + V
Subjt:  AIIVGLGMHGLVEQALELFDDMCRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKV

Query:  IWTSLLSASRKHGNIRMGEYAAHHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLC
        IW SLL A + HGN+ +GE  A +LI + P+  G YV+LSN+YA+AG W +V + R ++  KG++K PGCSSIE    +HEFI+GDK HP+  EIY  L 
Subjt:  IWTSLLSASRKHGNIRMGEYAAHHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLC

Query:  EMKEKLNVVGHVPDTTQVLLCLEEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCK
        EM+  L   G VPDT++VL  +EE+  KE  L  HSE+LAIAFGL++ K G+ + I+KNLR+C +CH   KL+S IY  EII RD +RFHHF+ G CSC 
Subjt:  EMKEKLNVVGHVPDTTQVLLCLEEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCK

Query:  DFW
        D+W
Subjt:  DFW

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.0e-16038.89Show/hide
Query:  LQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLALYADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAIALFCKLLCE--LLPDSFTLPC
        ++ CV+ R+ KQ H   ++TG+ + P  +S+L A+ A     +LEYA+ +FD IPKP   +WN LI+ Y        +I  F  ++ E    P+ +T P 
Subjt:  LQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLALYADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAIALFCKLLCE--LLPDSFTLPC

Query:  VLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMALELFEEMPERD---------
        ++K  A +++L  G+ +HG+ +K   G D FV +SL+  Y  CG+++   KVF  +++KD+VSWNS+I+G+ + G  + ALELF++M   D         
Subjt:  VLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMALELFEEMPERD---------

Query:  ---------------------------SFSWTI---LVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMI
                                   + + T+   ++D  +K G +E A+ +F+ M  +++V+W  M++GY  + D+  ARE+ + MP++++V WN++I
Subjt:  ---------------------------SFSWTI---LVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMI

Query:  TGYELNKQFTQALKLF-EIMLREDISPNYATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWTA
        + YE N +  +AL +F E+ L++++  N  T++  LSA + + +L  GRW+HSY+ K+G + +  + + LI MYSKCG ++ +  VF SV K+ +  W+A
Subjt:  TGYELNKQFTQALKLF-EIMLREDISPNYATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWTA

Query:  IIVGLGMHGLVEQALELFDDMCRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKVI
        +I GL MHG   +A+++F  M    ++P+ +TF  V  ACSH G  ++A   F  M  +YGI P  +HY C++DVL R+GYLE+A   IE MP   +  +
Subjt:  IIVGLGMHGLVEQALELFDDMCRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKVI

Query:  WTSLLSASRKHGNIRMGEYAAHHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLCE
        W +LL A + H N+ + E A   L++L P   G +V+LSN+YA  G WE V ++R+ M+  G++K+PGCSSIE  G IHEF+ GD +HP +E++Y KL E
Subjt:  WTSLLSASRKHGNIRMGEYAAHHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLCE

Query:  MKEKLNVVGHVPDTTQVLLCLEEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCKD
        + EKL   G+ P+ +QVL  +EE+  KE  L  HSE+LAI +GL++ +    IR+IKNLR+C DCH+VAKL+S +Y+ EII+RD  RFHHF++G CSC D
Subjt:  MKEKLNVVGHVPDTTQVLLCLEEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCKD

Query:  FW
        FW
Subjt:  FW

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)2.7e-14537.82Show/hide
Query:  FPLQNCVTER---ESKQLHSLSLKTGSLNHPSVSSRLLALYADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAIALFCKLLC--ELLPDS
        F L  C   R      Q+H L +K G      V + L+  YA+     L+ A+ +FD + +  +VSW  +I  Y     + DA+ LF +++   E+ P+S
Subjt:  FPLQNCVTER---ESKQLHSLSLKTGSLNHPSVSSRLLALYADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAIALFCKLLC--ELLPDS

Query:  FTLPCVLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMALELFEEM------PE
         T+ CV+  CA+L  L+ G++++  I   G  V+  ++S+LV MY KC  I++ +++FD     ++   N++   Y R G    AL +F  M      P+
Subjt:  FTLPCVLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMALELFEEM------PE

Query:  RDSF-----------------------------SW----TILVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVT
        R S                              SW      L+D   K  + +TA  +F+RM  +  V+WN+++ GY++ G+ + A E F+ MPE+N+V+
Subjt:  RDSF-----------------------------SW----TILVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVT

Query:  WNSMITGYELNKQFTQALKLF-EIMLREDISPNYATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKL
        WN++I+G      F +A+++F  +  +E ++ +  T++   SA   L +L   +W++ Y+ KNG + D  LGT L++M+S+CG  +SA+ +F S+  + +
Subjt:  WNSMITGYELNKQFTQALKLF-EIMLREDISPNYATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKL

Query:  GHWTAIIVGLGMHGLVEQALELFDDMCRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTK
          WTA I  + M G  E+A+ELFDDM   GL+P  + F+G L ACSH G  +     F  M   +G+ P   HYGC++D+L RAG LEEA   IE MP +
Subjt:  GHWTAIIVGLGMHGLVEQALELFDDMCRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTK

Query:  ANKVIWTSLLSASRKHGNIRMGEYAAHHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIY
         N VIW SLL+A R  GN+ M  YAA  +  LAP+ TG YV+LSN+YA+AG W  + +VR  MK KG+RK PG SSI+ +G  HEF  GD+SHP+   I 
Subjt:  ANKVIWTSLLSASRKHGNIRMGEYAAHHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIY

Query:  IKLCEMKEKLNVVGHVPDTTQVLLCLEEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGS
          L E+ ++ + +GHVPD + VL+ ++E  EK   L  HSE+LA+A+GL++   G+ IRI+KNLR+C+DCH+ AK  S +YN EII+RD +RFH+ + G 
Subjt:  IKLCEMKEKLNVVGHVPDTTQVLLCLEEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGS

Query:  CSCKDF
        CSC DF
Subjt:  CSCKDF

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification1.4e-14637.91Show/hide
Query:  FPLQNCVTER---ESKQLHSLSLKTGSLNHPSVSSRLLALYADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAIALFCKLLC--ELLPDS
        F L  C   R      Q+H L +K G      V + L+  YA+     L+ A+ +FD + +  +VSW  +I  Y     + DA+ LF +++   E+ P+S
Subjt:  FPLQNCVTER---ESKQLHSLSLKTGSLNHPSVSSRLLALYADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAIALFCKLLC--ELLPDS

Query:  FTLPCVLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMALELFEEM------PE
         T+ CV+  CA+L  L+ G++++  I   G  V+  ++S+LV MY KC  I++ +++FD     ++   N++   Y R G    AL +F  M      P+
Subjt:  FTLPCVLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMALELFEEM------PE

Query:  RDSF-----------------------------SW----TILVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVT
        R S                              SW      L+D   K  + +TA  +F+RM  +  V+WN+++ GY++ G+ + A E F+ MPE+N+V+
Subjt:  RDSF-----------------------------SW----TILVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVT

Query:  WNSMITGYELNKQFTQALKLF-EIMLREDISPNYATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKL
        WN++I+G      F +A+++F  +  +E ++ +  T++   SA   L +L   +W++ Y+ KNG + D  LGT L++M+S+CG  +SA+ +F S+  + +
Subjt:  WNSMITGYELNKQFTQALKLF-EIMLREDISPNYATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKL

Query:  GHWTAIIVGLGMHGLVEQALELFDDMCRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTK
          WTA I  + M G  E+A+ELFDDM   GL+P  + F+G L ACSH G  +     F  M   +G+ P   HYGC++D+L RAG LEEA   IE MP +
Subjt:  GHWTAIIVGLGMHGLVEQALELFDDMCRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTK

Query:  ANKVIWTSLLSASRKHGNIRMGEYAAHHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIY
         N VIW SLL+A R  GN+ M  YAA  +  LAP+ TG YV+LSN+YA+AG W  + +VR  MK KG+RK PG SSI+ +G  HEF  GD+SHP+   I 
Subjt:  ANKVIWTSLLSASRKHGNIRMGEYAAHHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIY

Query:  IKLCEMKEKLNVVGHVPDTTQVLLCLEEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGS
          L E+ ++ + +GHVPD + VL+ ++E  EK   L  HSE+LA+A+GL++   G+ IRI+KNLR+C+DCH+ AK  S +YN EII+RD +RFH+ + G 
Subjt:  IKLCEMKEKLNVVGHVPDTTQVLLCLEEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGS

Query:  CSCKDFW
        CSC DFW
Subjt:  CSCKDFW

AT5G48910.1 Pentatricopeptide repeat (PPR) superfamily protein1.0e-15741.75Show/hide
Query:  PRDLHFPLQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLALYA--DPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSND--AIALFCKLLCE-
        P  L   + NC T R+  Q+H++ +K+G +     ++ +L   A  D    +L+YA  +F+ +P+    SWN +I+ + E+       AI LF +++ + 
Subjt:  PRDLHFPLQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLALYA--DPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSND--AIALFCKLLCE-

Query:  -LLPDSFTLPCVLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMALELFEEMPE
         + P+ FT P VLK CA+   +QEGKQIHGL LK GFG D+FV+S+LV MY  CG ++  R +F                              ++ + E
Subjt:  -LLPDSFTLPCVLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMALELFEEMPE

Query:  RDSFSWTILVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNKQFTQALKLFEIMLREDISPNY
        +D     ++ D   + G++               V WN MI+GYM+ GD   AR LFD+M +R++V+WN+MI+GY LN  F  A+++F  M + DI PNY
Subjt:  RDSFSWTILVDGLSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNKQFTQALKLFEIMLREDISPNY

Query:  ATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWTAIIVGLGMHGLVEQALELFDDMCRTGLQPH
         T++  L A S L SL  G W+H Y   +G + D VLG+ LI+MYSKCG ++ A+ VF+ +P++ +  W+A+I G  +HG    A++ F  M + G++P 
Subjt:  ATILGALSAASGLVSLGKGRWVHSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWTAIIVGLGMHGLVEQALELFDDMCRTGLQPH

Query:  AITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKVIWTSLLSASRKHGNIRMGEYAAHHLIDLAP
         + +I +L ACSH G  E+  RYF  M    G++P IEHYGC++D+L R+G L+EA++ I  MP K + VIW +LL A R  GN+ MG+  A+ L+D+ P
Subjt:  AITFIGVLNACSHAGFAEDAHRYFKMMTDDYGIKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKVIWTSLLSASRKHGNIRMGEYAAHHLIDLAP

Query:  DTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLCEMKEKLNVVGHVPDTTQVLLCLEEDNEKEA
          +G YV LSNMYA+ G W +V ++R  MK K IRKDPGCS I+  G +HEF+V D SHP+ +EI   L E+ +KL + G+ P TTQVLL LEE+ +KE 
Subjt:  DTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSSIEHQGSIHEFIVGDKSHPQTEEIYIKLCEMKEKLNVVGHVPDTTQVLLCLEEDNEKEA

Query:  ELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCKDFW
         L  HSE++A AFGL++   G PIRI+KNLRIC DCH+  KL+S +Y  +I +RD  RFHHF+ GSCSC D+W
Subjt:  ELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEIIIRDGSRFHHFKSGSCSCKDFW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGTCTCTTACACTTTCACATTCCCTTCAACCATTTCTCCCTCGCGACCTTCATTTTCCTCTTCAAAACTGCGTAACTGAACGAGAATCAAAGCAACTCCACTCTCT
CTCGCTCAAAACAGGCTCCTTGAATCACCCTTCAGTATCTTCTCGTCTTTTGGCTCTCTATGCAGATCCCAGAATCAACAATCTCGAATATGCTCAGTCCCTTTTCGACT
GGATTCCAAAACCCACTTTGGTTTCTTGGAATATGCTCATCAAGTGCTACATCGAGAACCAACGTTCAAATGATGCCATTGCGTTGTTCTGCAAATTGCTCTGTGAGTTA
TTGCCTGATTCTTTTACATTGCCTTGCGTTCTAAAGGGTTGTGCTCGATTGACTGCATTACAGGAAGGGAAACAGATTCATGGGTTGATATTGAAAATTGGGTTTGGTGT
GGATAAGTTTGTTTTGAGTAGTTTAGTTAGTATGTATTCTAAATGTGGTGAGATTGAGCTGTGTAGGAAAGTGTTTGATCGAATGGAAGATAAGGATATAGTATCATGGA
ATTCTTTGATTGATGGATATGCTAGATGTGGTGAAATTGAAATGGCACTGGAGTTGTTTGAAGAAATGCCAGAGAGAGATTCTTTTTCATGGACTATTCTGGTTGACGGT
CTTTCGAAAAGCGGGAAGCTTGAGACTGCAAGAGACGTGTTCAATCGAATGCCTATTCGAAATTCTGTATCTTGGAATGCTATGATTAATGGCTACATGAAAGCTGGGGA
TTTTAACACGGCACGAGAATTATTCGATCAGATGCCAGAGAGAAACCTTGTTACATGGAATTCAATGATAACTGGATATGAACTGAACAAGCAGTTTACACAAGCCTTGA
AGCTGTTTGAGATCATGTTGAGAGAAGATATATCACCCAATTATGCCACTATCCTTGGAGCTCTTTCTGCAGCCTCAGGATTGGTTAGTCTTGGTAAGGGAAGATGGGTT
CATTCCTATATGGTGAAAAATGGTTTCAAAACAGATGGTGTGCTCGGCACGTTGCTGATAGAAATGTACTCCAAGTGTGGCAGCGTTAAGAGTGCCCTCAGAGTTTTTCA
GTCTGTACCCAAAAAGAAATTGGGGCATTGGACGGCTATAATTGTAGGCTTGGGAATGCATGGATTGGTAGAGCAAGCTCTTGAGCTATTTGATGATATGTGCAGAACTG
GATTGCAGCCTCATGCTATTACTTTTATCGGAGTGTTGAATGCTTGTAGTCATGCAGGATTTGCAGAAGATGCACATCGGTACTTTAAAATGATGACAGATGATTATGGA
ATCAAGCCCTCTATCGAACACTATGGTTGCTTGATTGATGTTCTGTGTCGAGCTGGATATCTGGAAGAGGCAAAGGATACCATTGAGAGAATGCCCACTAAAGCAAACAA
AGTAATTTGGACGAGTCTACTAAGTGCTTCAAGGAAACATGGAAACATAAGAATGGGGGAATATGCAGCTCATCATCTGATTGATTTAGCACCGGATACTACTGGATGTT
ATGTTATTCTTTCGAACATGTATGCCGCAGCTGGCTTGTGGGAGAAAGTTCGTCAAGTAAGAGAAATGATGAAGAGAAAAGGAATCAGAAAGGATCCAGGATGCAGTTCC
ATTGAGCATCAAGGTTCAATTCATGAATTCATCGTGGGAGATAAGTCACATCCCCAAACTGAAGAGATATACATCAAACTGTGTGAGATGAAAGAGAAATTGAATGTAGT
GGGGCATGTTCCCGACACGACTCAAGTTCTGTTATGTCTTGAAGAGGATAATGAGAAAGAAGCAGAACTTGAAACCCACAGTGAGAGGTTGGCTATAGCTTTTGGGCTTC
TTAATATCAAGCATGGAAGTCCTATCCGGATCATAAAGAATCTTCGTATTTGCAACGATTGCCATGCTGTGGCTAAACTTGTTTCTCATATTTATAATCATGAGATCATT
ATCAGAGATGGTAGTCGATTCCATCACTTTAAAAGTGGGTCTTGTTCTTGTAAAGATTTTTGGTAA
mRNA sequenceShow/hide mRNA sequence
AAATGTTCAGAATTTACTTTGATCCTTATTTATCACTTTATCGATCAATATCTCAATTGGGTATTCACAACCGAGCTCCATTCTTTGATTTCTAGTTCGATTCAATCGAA
TTCGAATGCTGTCTCTTACACTTTCACATTCCCTTCAACCATTTCTCCCTCGCGACCTTCATTTTCCTCTTCAAAACTGCGTAACTGAACGAGAATCAAAGCAACTCCAC
TCTCTCTCGCTCAAAACAGGCTCCTTGAATCACCCTTCAGTATCTTCTCGTCTTTTGGCTCTCTATGCAGATCCCAGAATCAACAATCTCGAATATGCTCAGTCCCTTTT
CGACTGGATTCCAAAACCCACTTTGGTTTCTTGGAATATGCTCATCAAGTGCTACATCGAGAACCAACGTTCAAATGATGCCATTGCGTTGTTCTGCAAATTGCTCTGTG
AGTTATTGCCTGATTCTTTTACATTGCCTTGCGTTCTAAAGGGTTGTGCTCGATTGACTGCATTACAGGAAGGGAAACAGATTCATGGGTTGATATTGAAAATTGGGTTT
GGTGTGGATAAGTTTGTTTTGAGTAGTTTAGTTAGTATGTATTCTAAATGTGGTGAGATTGAGCTGTGTAGGAAAGTGTTTGATCGAATGGAAGATAAGGATATAGTATC
ATGGAATTCTTTGATTGATGGATATGCTAGATGTGGTGAAATTGAAATGGCACTGGAGTTGTTTGAAGAAATGCCAGAGAGAGATTCTTTTTCATGGACTATTCTGGTTG
ACGGTCTTTCGAAAAGCGGGAAGCTTGAGACTGCAAGAGACGTGTTCAATCGAATGCCTATTCGAAATTCTGTATCTTGGAATGCTATGATTAATGGCTACATGAAAGCT
GGGGATTTTAACACGGCACGAGAATTATTCGATCAGATGCCAGAGAGAAACCTTGTTACATGGAATTCAATGATAACTGGATATGAACTGAACAAGCAGTTTACACAAGC
CTTGAAGCTGTTTGAGATCATGTTGAGAGAAGATATATCACCCAATTATGCCACTATCCTTGGAGCTCTTTCTGCAGCCTCAGGATTGGTTAGTCTTGGTAAGGGAAGAT
GGGTTCATTCCTATATGGTGAAAAATGGTTTCAAAACAGATGGTGTGCTCGGCACGTTGCTGATAGAAATGTACTCCAAGTGTGGCAGCGTTAAGAGTGCCCTCAGAGTT
TTTCAGTCTGTACCCAAAAAGAAATTGGGGCATTGGACGGCTATAATTGTAGGCTTGGGAATGCATGGATTGGTAGAGCAAGCTCTTGAGCTATTTGATGATATGTGCAG
AACTGGATTGCAGCCTCATGCTATTACTTTTATCGGAGTGTTGAATGCTTGTAGTCATGCAGGATTTGCAGAAGATGCACATCGGTACTTTAAAATGATGACAGATGATT
ATGGAATCAAGCCCTCTATCGAACACTATGGTTGCTTGATTGATGTTCTGTGTCGAGCTGGATATCTGGAAGAGGCAAAGGATACCATTGAGAGAATGCCCACTAAAGCA
AACAAAGTAATTTGGACGAGTCTACTAAGTGCTTCAAGGAAACATGGAAACATAAGAATGGGGGAATATGCAGCTCATCATCTGATTGATTTAGCACCGGATACTACTGG
ATGTTATGTTATTCTTTCGAACATGTATGCCGCAGCTGGCTTGTGGGAGAAAGTTCGTCAAGTAAGAGAAATGATGAAGAGAAAAGGAATCAGAAAGGATCCAGGATGCA
GTTCCATTGAGCATCAAGGTTCAATTCATGAATTCATCGTGGGAGATAAGTCACATCCCCAAACTGAAGAGATATACATCAAACTGTGTGAGATGAAAGAGAAATTGAAT
GTAGTGGGGCATGTTCCCGACACGACTCAAGTTCTGTTATGTCTTGAAGAGGATAATGAGAAAGAAGCAGAACTTGAAACCCACAGTGAGAGGTTGGCTATAGCTTTTGG
GCTTCTTAATATCAAGCATGGAAGTCCTATCCGGATCATAAAGAATCTTCGTATTTGCAACGATTGCCATGCTGTGGCTAAACTTGTTTCTCATATTTATAATCATGAGA
TCATTATCAGAGATGGTAGTCGATTCCATCACTTTAAAAGTGGGTCTTGTTCTTGTAAAGATTTTTGGTAACTTCATCTCTTCTAAGTTGGTGGTAAATCTTTTCTTCCT
GCTCGAGCAACTAGCATTGTGTTCACCTAATCCTTGTCCTCATGATTTCAGGTTTCTTCATTTTCCGGGGACGAAGTTGGTGGCGTAAGCCCATGCATTGTTGGCAACAG
GATCTGCAACATGGTCAA
Protein sequenceShow/hide protein sequence
MLSLTLSHSLQPFLPRDLHFPLQNCVTERESKQLHSLSLKTGSLNHPSVSSRLLALYADPRINNLEYAQSLFDWIPKPTLVSWNMLIKCYIENQRSNDAIALFCKLLCEL
LPDSFTLPCVLKGCARLTALQEGKQIHGLILKIGFGVDKFVLSSLVSMYSKCGEIELCRKVFDRMEDKDIVSWNSLIDGYARCGEIEMALELFEEMPERDSFSWTILVDG
LSKSGKLETARDVFNRMPIRNSVSWNAMINGYMKAGDFNTARELFDQMPERNLVTWNSMITGYELNKQFTQALKLFEIMLREDISPNYATILGALSAASGLVSLGKGRWV
HSYMVKNGFKTDGVLGTLLIEMYSKCGSVKSALRVFQSVPKKKLGHWTAIIVGLGMHGLVEQALELFDDMCRTGLQPHAITFIGVLNACSHAGFAEDAHRYFKMMTDDYG
IKPSIEHYGCLIDVLCRAGYLEEAKDTIERMPTKANKVIWTSLLSASRKHGNIRMGEYAAHHLIDLAPDTTGCYVILSNMYAAAGLWEKVRQVREMMKRKGIRKDPGCSS
IEHQGSIHEFIVGDKSHPQTEEIYIKLCEMKEKLNVVGHVPDTTQVLLCLEEDNEKEAELETHSERLAIAFGLLNIKHGSPIRIIKNLRICNDCHAVAKLVSHIYNHEII
IRDGSRFHHFKSGSCSCKDFW