; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G11610 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G11610
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationClcChr06:21874515..21881053
RNA-Seq ExpressionClc06G11610
SyntenyClc06G11610
Gene Ontology termsGO:0006338 - chromatin remodeling (biological process)
GO:0000228 - nuclear chromosome (cellular component)
GO:0070603 - SWI/SNF superfamily-type complex (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046726.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.0e+0088.41Show/hide
Query:  SFEQKHFDCACAFSEVFQFFKLSQSLHMLPLSIIRRVQFISRHFSSSLQLLPIPLRISKLTKKSCIEYLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKL
        SFE KHFDCACA S++FQFFK SQS HMLPLSIIRRVQFISRHFSSS  L+ +P+RISK TKKSCIEYLRNCKSM+QLK+IQSQIFRIGLEGDRD INKL
Subjt:  SFEQKHFDCACAFSEVFQFFKLSQSLHMLPLSIIRRVQFISRHFSSSLQLLPIPLRISKLTKKSCIEYLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKL

Query:  MAFCTDSSLGNLRYAERIFNYIQDPSLFVYNVMVKVYAKRRIFRKVILLFQQLREDELWPDNFTYPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVC
        MAFC D SLGNLRYAE+IF+Y+QDPSLFVYNVMVK+YAKR + RKV+LLFQQLRED+LWPDNFTYPFVLKAIGCLRDV QGEK+HGFVVKTGM+ DNYVC
Subjt:  MAFCTDSSLGNLRYAERIFNYIQDPSLFVYNVMVKVYAKRRIFRKVILLFQQLREDELWPDNFTYPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVC

Query:  NSLMDMYSELGNVENAKKLFDKMTTKDSVSWNVMIAGYVRCRRFEDAISTFREMQQLSNEEPGEATVVSTLSACTALKNMELGEEIHNYVRKELGFTTII
        NSLMDMYSELGNVENAKKLFD+MTT+DSVSWNVMI+GYV CRRFEDAI+TFREMQQ  NE+P EATVVSTLSACTALKN+ELG+EIHNYVRKELGFT  I
Subjt:  NSLMDMYSELGNVENAKKLFDKMTTKDSVSWNVMIAGYVRCRRFEDAISTFREMQQLSNEEPGEATVVSTLSACTALKNMELGEEIHNYVRKELGFTTII

Query:  NNALLDMYAKCGCLNIARNIFDEMPMKNVICWTSMISGYINCGDLGEARDLFDRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVT
        +NALLDMYAKCGCLNI+RNIFDEMPMKNVICWTSMISGYINCGDL EARDLFD+SPV+DVVLWTAMINGYVQFHHFD+AVALFREMQI+ VKPDKFTVVT
Subjt:  NNALLDMYAKCGCLNIARNIFDEMPMKNVICWTSMISGYINCGDLGEARDLFDRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVT

Query:  LLTGCAQLGALEQGKWIHGYLDENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFI
        LLTGCAQLGALEQGKWIHGYLDENRIT+DVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKT EALRLFSEMELVGAKPDDITFI
Subjt:  LLTGCAQLGALEQGKWIHGYLDENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFI

Query:  GVLSACSHGGLVEEGRRLFNSMKKVYRIEPKVEHYGCVIDLLGRAGLLDEAEELIQGIPDENGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCD
        GVLSACSHGGLVEEGRR FNSMKKVYRIEPKVEHYGCV+DLLGRAGLLDEAEELIQ I  EN EIVV LYGALLSACRIHNNVDMGERLA KLVNIE CD
Subjt:  GVLSACSHGGLVEEGRRLFNSMKKVYRIEPKVEHYGCVIDLLGRAGLLDEAEELIQGIPDENGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCD

Query:  SSIHTLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCSSIEVDGIVHEFLVGDPSHPKMIEICSMLDRVTGQLLGSKESQLEGVMPLYKDTQYCSFVE
        SSIH LLANIYAS DRWEDAKKVRRKMKELGVKKMPGCSSIEVDGIVHEFLVGDPSHP+ IEI SML+RV+ QLLG KESQL   M    DTQ+C+FVE
Subjt:  SSIHTLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCSSIEVDGIVHEFLVGDPSHPKMIEICSMLDRVTGQLLGSKESQLEGVMPLYKDTQYCSFVE

TYK14505.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.0e+0088.41Show/hide
Query:  SFEQKHFDCACAFSEVFQFFKLSQSLHMLPLSIIRRVQFISRHFSSSLQLLPIPLRISKLTKKSCIEYLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKL
        SFE KHFDCACA S++FQFFK SQS HMLPLSIIRRVQFISRHFSSS  L+ +P+RISK TKKSCIEYLRNCKSM+QLK+IQSQIFRIGLEGDRD INKL
Subjt:  SFEQKHFDCACAFSEVFQFFKLSQSLHMLPLSIIRRVQFISRHFSSSLQLLPIPLRISKLTKKSCIEYLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKL

Query:  MAFCTDSSLGNLRYAERIFNYIQDPSLFVYNVMVKVYAKRRIFRKVILLFQQLREDELWPDNFTYPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVC
        MAFC D SLGNLRYAE+IF+Y+QDPSLFVYNVMVK+YAKR + RKV+LLFQQLRED+LWPDNFTYPFVLKAIGCLRDV QGEK+HGFVVKTGM+ DNYVC
Subjt:  MAFCTDSSLGNLRYAERIFNYIQDPSLFVYNVMVKVYAKRRIFRKVILLFQQLREDELWPDNFTYPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVC

Query:  NSLMDMYSELGNVENAKKLFDKMTTKDSVSWNVMIAGYVRCRRFEDAISTFREMQQLSNEEPGEATVVSTLSACTALKNMELGEEIHNYVRKELGFTTII
        NSLMDMYSELGNVENAKKLFD+MTT+DSVSWNVMI+GYV CRRFEDAI+TFREMQQ  NE+P EATVVSTLSACTALKN+ELG+EIHNYVRKELGFT  I
Subjt:  NSLMDMYSELGNVENAKKLFDKMTTKDSVSWNVMIAGYVRCRRFEDAISTFREMQQLSNEEPGEATVVSTLSACTALKNMELGEEIHNYVRKELGFTTII

Query:  NNALLDMYAKCGCLNIARNIFDEMPMKNVICWTSMISGYINCGDLGEARDLFDRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVT
        +NALLDMYAKCGCLNI+RNIFDEMPMKNVICWTSMISGYINCGDL EARDLFD+SPV+DVVLWTAMINGYVQFHHFD+AVALFREMQI+ VKPDKFTVVT
Subjt:  NNALLDMYAKCGCLNIARNIFDEMPMKNVICWTSMISGYINCGDLGEARDLFDRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVT

Query:  LLTGCAQLGALEQGKWIHGYLDENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFI
        LLTGCAQLGALEQGKWIHGYLDENRIT+DVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKT EALRLFSEMELVGAKPDDITFI
Subjt:  LLTGCAQLGALEQGKWIHGYLDENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFI

Query:  GVLSACSHGGLVEEGRRLFNSMKKVYRIEPKVEHYGCVIDLLGRAGLLDEAEELIQGIPDENGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCD
        GVLSACSHGGLVEEGRR FNSMKKVYRIEPKVEHYGCV+DLLGRAGLLDEAEELIQ I  EN EIVV LYGALLSACRIHNNVDMGERLA KLVNIE CD
Subjt:  GVLSACSHGGLVEEGRRLFNSMKKVYRIEPKVEHYGCVIDLLGRAGLLDEAEELIQGIPDENGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCD

Query:  SSIHTLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCSSIEVDGIVHEFLVGDPSHPKMIEICSMLDRVTGQLLGSKESQLEGVMPLYKDTQYCSFVE
        SSIH LLANIYAS DRWEDAKKVRRKMKELGVKKMPGCSSIEVDGIVHEFLVGDPSHP+ IEI SML+RV+ QLLG KESQL   M    DTQ+C+FVE
Subjt:  SSIHTLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCSSIEVDGIVHEFLVGDPSHPKMIEICSMLDRVTGQLLGSKESQLEGVMPLYKDTQYCSFVE

XP_008464984.1 PREDICTED: pentatricopeptide repeat-containing protein At1g31430 [Cucumis melo]0.0e+0088.84Show/hide
Query:  MLPLSIIRRVQFISRHFSSSLQLLPIPLRISKLTKKSCIEYLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCTDSSLGNLRYAERIFNYIQDPSL
        MLPLSIIRRVQFISRHFSSS  L+ +P+RISK TKKSCIEYLRNCKSM+QLK+IQSQIFRIGLEGDRD INKLMAFC D SLGNLRYAE+IF+Y+QDPSL
Subjt:  MLPLSIIRRVQFISRHFSSSLQLLPIPLRISKLTKKSCIEYLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCTDSSLGNLRYAERIFNYIQDPSL

Query:  FVYNVMVKVYAKRRIFRKVILLFQQLREDELWPDNFTYPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVCNSLMDMYSELGNVENAKKLFDKMTTKD
        FVYNVMVK+YAKR + RKV+LLFQQLRED+LWPDNFTYPFVLKAIGCLRDV QGEK+HGFVVKTGM+ DNYVCNSLMDMYSELGNVENAKKLFD+MTT+D
Subjt:  FVYNVMVKVYAKRRIFRKVILLFQQLREDELWPDNFTYPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVCNSLMDMYSELGNVENAKKLFDKMTTKD

Query:  SVSWNVMIAGYVRCRRFEDAISTFREMQQLSNEEPGEATVVSTLSACTALKNMELGEEIHNYVRKELGFTTIINNALLDMYAKCGCLNIARNIFDEMPMK
        SVSWNVMI+GYV CRRFEDAI+TFREMQQ  NE+P EATVVSTLSACTALKN+ELG+EIHNYVRKELGFT  I+NALLDMYAKCGCLNI+RNIFDEMPMK
Subjt:  SVSWNVMIAGYVRCRRFEDAISTFREMQQLSNEEPGEATVVSTLSACTALKNMELGEEIHNYVRKELGFTTIINNALLDMYAKCGCLNIARNIFDEMPMK

Query:  NVICWTSMISGYINCGDLGEARDLFDRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRIT
        NVICWTSMISGYINCGDL EARDLFD+SPV+DVVLWTAMINGYVQFHHFD+AVALFREMQI+ VKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRIT
Subjt:  NVICWTSMISGYINCGDLGEARDLFDRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRIT

Query:  MDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRRLFNSMKKVYR
        +DVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKT EALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRR FNSMKKVYR
Subjt:  MDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRRLFNSMKKVYR

Query:  IEPKVEHYGCVIDLLGRAGLLDEAEELIQGIPDENGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCDSSIHTLLANIYASVDRWEDAKKVRRKM
        IEPKVEHYGCV+DLLGRAGLLDEAEELIQ I  EN EIVV LYGALLSACRIHNNVDMGERLA KLVNIE CDSSIH LLANIYAS DRWEDAKKVRRKM
Subjt:  IEPKVEHYGCVIDLLGRAGLLDEAEELIQGIPDENGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCDSSIHTLLANIYASVDRWEDAKKVRRKM

Query:  KELGVKKMPGCSSIEVDGIVHEFLVGDPSHPKMIEICSMLDRVTGQLLGSKESQLEGVMPLYKDTQYCSFVE
        KELGVKKMPGCSSIEVDGIVHEFLVGDPSHP+ IEI SML+RV+ QLLG KESQL   M    DTQ+C+FVE
Subjt:  KELGVKKMPGCSSIEVDGIVHEFLVGDPSHPKMIEICSMLDRVTGQLLGSKESQLEGVMPLYKDTQYCSFVE

XP_031736069.1 pentatricopeptide repeat-containing protein At1g31430 [Cucumis sativus]0.0e+0091.22Show/hide
Query:  MLPLSIIRRVQFISRHFSSSLQLLPIPLRISKLTKKSCIEYLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCTDSSLGNLRYAERIFNYIQDPSL
        MLPLSIIRRVQFISRHFSSS  L+P+ LRISKLTKKSCIE LRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFC DSSLGNLRYAE+IFNY+QDPSL
Subjt:  MLPLSIIRRVQFISRHFSSSLQLLPIPLRISKLTKKSCIEYLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCTDSSLGNLRYAERIFNYIQDPSL

Query:  FVYNVMVKVYAKRRIFRKVILLFQQLREDELWPDNFTYPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVCNSLMDMYSELGNVENAKKLFDKMTTKD
        FVYNVMVK+YAKR I RKV+LLFQQLRED LWPD FTYPFVLKAIGCLRDVRQGEKV GF+VKTGMD DNYV NSL+DMY EL NVENAKKLFD+MTT+D
Subjt:  FVYNVMVKVYAKRRIFRKVILLFQQLREDELWPDNFTYPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVCNSLMDMYSELGNVENAKKLFDKMTTKD

Query:  SVSWNVMIAGYVRCRRFEDAISTFREMQQLSNEEPGEATVVSTLSACTALKNMELGEEIHNYVRKELGFTTIINNALLDMYAKCGCLNIARNIFDEMPMK
        SVSWNVMI+GYVRCRRFEDAI+TFREMQQ  NE+P EATVVSTLSACTALKN+ELG+EIHNYVRKELGFTT I+NALLDMYAKCGCLNIARNIFDEM MK
Subjt:  SVSWNVMIAGYVRCRRFEDAISTFREMQQLSNEEPGEATVVSTLSACTALKNMELGEEIHNYVRKELGFTTIINNALLDMYAKCGCLNIARNIFDEMPMK

Query:  NVICWTSMISGYINCGDLGEARDLFDRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRIT
        NVICWTSMISGYINCGDL EARDLFD+SPV+DVVLWTAMINGYVQFHHFD+AVALFREMQI+ VKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRIT
Subjt:  NVICWTSMISGYINCGDLGEARDLFDRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRIT

Query:  MDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRRLFNSMKKVYR
        MDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKT EALRLFSEME VGAKPDDITFIGVLSACSHGGLVEEGRR FNSMKKV+R
Subjt:  MDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRRLFNSMKKVYR

Query:  IEPKVEHYGCVIDLLGRAGLLDEAEELIQGIPDENGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCDSSIHTLLANIYASVDRWEDAKKVRRKM
        IEPKVEHYGCVIDLLGRAGLLDEAEELIQ IP EN EIVVPLYGALLSACRIHNNVDMGERLA KL NIESCDSSIHTLLANIYASVDRWEDAKKVRRKM
Subjt:  IEPKVEHYGCVIDLLGRAGLLDEAEELIQGIPDENGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCDSSIHTLLANIYASVDRWEDAKKVRRKM

Query:  KELGVKKMPGCSSIEVDGIVHEFLVGDPSHPKMIEICSMLDRVTGQLLGSKESQLEGVMPLYKDTQYCSFVE
        KELGVKKMPGCS IEVDGIVHEFLVGDPSHP+M+EICSML+RVTGQLLG KESQ+E VMPLYKDTQ+C+FVE
Subjt:  KELGVKKMPGCSSIEVDGIVHEFLVGDPSHPKMIEICSMLDRVTGQLLGSKESQLEGVMPLYKDTQYCSFVE

XP_038880251.1 pentatricopeptide repeat-containing protein At1g31430 [Benincasa hispida]0.0e+0093.31Show/hide
Query:  MLPLSIIRRVQFISRHFSSSLQLLPIPLRISKLTKKSCIEYLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCTDSSLGNLRYAERIFNYIQDPSL
        MLPLS+IRRVQFISRHFSSSL L+PIPLRISKLTKKSCIEYLRNCKSMDQLKQIQSQIFRIGLEGD+DTI+K M FC DSSLGNL YAER+F+YIQ+PSL
Subjt:  MLPLSIIRRVQFISRHFSSSLQLLPIPLRISKLTKKSCIEYLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCTDSSLGNLRYAERIFNYIQDPSL

Query:  FVYNVMVKVYAKRRIFRKVILLFQQLREDELWPDNFTYPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVCNSLMDMYSELGNVENAKKLFDKMTTKD
         VYNVMVKVYAKR IF KVILLFQQLREDELWPDNFTYPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVCNSLMDMYS+LGNVENAKKLFDKMTT+D
Subjt:  FVYNVMVKVYAKRRIFRKVILLFQQLREDELWPDNFTYPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVCNSLMDMYSELGNVENAKKLFDKMTTKD

Query:  SVSWNVMIAGYVRCRRFEDAISTFREMQQLSNEEPGEATVVSTLSACTALKNMELGEEIHNYVRKELGFTTIINNALLDMYAKCGCLNIARNIFDEMPMK
        SVSWNVMI+GYVRCRRFEDAI TFREMQQ SNE+P EATVVSTLSAC ALKN+ELGEEIH+YVRKELGFTTIINNALLDMYAKCGCLNIARNIFDEMP+K
Subjt:  SVSWNVMIAGYVRCRRFEDAISTFREMQQLSNEEPGEATVVSTLSACTALKNMELGEEIHNYVRKELGFTTIINNALLDMYAKCGCLNIARNIFDEMPMK

Query:  NVICWTSMISGYINCGDLGEARDLFDRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRIT
        NVICWTSMISGYINCGDL EARDLFDRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIR VKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRIT
Subjt:  NVICWTSMISGYINCGDLGEARDLFDRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRIT

Query:  MDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRRLFNSMKKVYR
        MDVVVGTALIEMYSKCGCVDKSLEIFY+L++KDTASWTSIICGLAMNGKTGEALRLFSEME VGAKPDDITFIGVLSACSHGGLVEEGR  FN MKKVYR
Subjt:  MDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRRLFNSMKKVYR

Query:  IEPKVEHYGCVIDLLGRAGLLDEAEELIQGIPDENGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCDSSIHTLLANIYASVDRWEDAKKVRRKM
        IEPKVEHYGCVIDLLGRAGLLDEAEELIQGIP+ENGEIVVPLYGALLSAC+IHNNVDMGERLA KLV+IESCDSSIHTLLA+IYASVDRWEDAKKVRRKM
Subjt:  IEPKVEHYGCVIDLLGRAGLLDEAEELIQGIPDENGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCDSSIHTLLANIYASVDRWEDAKKVRRKM

Query:  KELGVKKMPGCSSIEVDGIVHEFLVGDPSHPKMIEICSMLDRVTGQLLGSKESQLEGVMPLYKDTQYCSFVEF
        KELGVKKMPGCSSIEVDGIVHEFLVGDPSHP+MIEICSMLD V GQLLGSKESQLE VMPLYKDT+YCSFVEF
Subjt:  KELGVKKMPGCSSIEVDGIVHEFLVGDPSHPKMIEICSMLDRVTGQLLGSKESQLEGVMPLYKDTQYCSFVEF

TrEMBL top hitse value%identityAlignment
A0A0A0LWP8 Uncharacterized protein0.0e+0090.94Show/hide
Query:  KHFDCACAFSEVFQFFKLSQSLHMLPLSIIRRVQFISRHFSSSLQLLPIPLRISKLTKKSCIEYLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFC
        KHFDCA AFSE+FQFFK SQS HMLPLSIIRRVQFISRHFSSS  L+P+ LRISKLTKKSCIE LRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFC
Subjt:  KHFDCACAFSEVFQFFKLSQSLHMLPLSIIRRVQFISRHFSSSLQLLPIPLRISKLTKKSCIEYLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFC

Query:  TDSSLGNLRYAERIFNYIQDPSLFVYNVMVKVYAKRRIFRKVILLFQQLREDELWPDNFTYPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVCNSLM
         DSSLGNLRYAE+IFNY+QDPSLFVYNVMVK+YAKR I RKV+LLFQQLRED LWPD FTYPFVLKAIGCLRDVRQGEKV GF+VKTGMD DNYV NSL+
Subjt:  TDSSLGNLRYAERIFNYIQDPSLFVYNVMVKVYAKRRIFRKVILLFQQLREDELWPDNFTYPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVCNSLM

Query:  DMYSELGNVENAKKLFDKMTTKDSVSWNVMIAGYVRCRRFEDAISTFREMQQLSNEEPGEATVVSTLSACTALKNMELGEEIHNYVRKELGFTTIINNAL
        DMY EL NVENAKKLFD+MTT+DSVSWNVMI+GYVRCRRFEDAI+TFREMQQ  NE+P EATVVSTLSACTALKN+ELG+EIHNYVRKELGFTT I+NAL
Subjt:  DMYSELGNVENAKKLFDKMTTKDSVSWNVMIAGYVRCRRFEDAISTFREMQQLSNEEPGEATVVSTLSACTALKNMELGEEIHNYVRKELGFTTIINNAL

Query:  LDMYAKCGCLNIARNIFDEMPMKNVICWTSMISGYINCGDLGEARDLFDRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVTLLTG
        LDMYAKCGCLNIARNIFDEM MKNVICWTSMISGYINCGDL EARDLFD+SPV+DVVLWTAMINGYVQFHHFD+AVALFREMQI+ VKPDKFTVVTLLTG
Subjt:  LDMYAKCGCLNIARNIFDEMPMKNVICWTSMISGYINCGDLGEARDLFDRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVTLLTG

Query:  CAQLGALEQGKWIHGYLDENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLS
        CAQLGALEQGKWIHGYLDENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKT EALRLFSEME VGAKPDDITFIGVLS
Subjt:  CAQLGALEQGKWIHGYLDENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLS

Query:  ACSHGGLVEEGRRLFNSMKKVYRIEPKVEHYGCVIDLLGRAGLLDEAEELIQGIPDENGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCDSSIH
        ACSHGGLVEEGRR FNSMKKV+RIEPKVEHYGCVIDLLGRAGLLDEAEELIQ IP EN EIVVPLYGALLSACRIHNNVDMGERLA KL NIESCDSSIH
Subjt:  ACSHGGLVEEGRRLFNSMKKVYRIEPKVEHYGCVIDLLGRAGLLDEAEELIQGIPDENGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCDSSIH

Query:  TLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCSSIEVDGIVHEFLVGDPSHPKMIEICSMLDRVTGQLLGSKESQLEGVMPLYKDTQYCSFVE
        TLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCS IEVDGIVHEFLVGDPSHP+M+EICSML+RVTGQLLG KESQ+E VMPLYKDTQ+C+FVE
Subjt:  TLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCSSIEVDGIVHEFLVGDPSHPKMIEICSMLDRVTGQLLGSKESQLEGVMPLYKDTQYCSFVE

A0A1S3CPD2 pentatricopeptide repeat-containing protein At1g314300.0e+0088.84Show/hide
Query:  MLPLSIIRRVQFISRHFSSSLQLLPIPLRISKLTKKSCIEYLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCTDSSLGNLRYAERIFNYIQDPSL
        MLPLSIIRRVQFISRHFSSS  L+ +P+RISK TKKSCIEYLRNCKSM+QLK+IQSQIFRIGLEGDRD INKLMAFC D SLGNLRYAE+IF+Y+QDPSL
Subjt:  MLPLSIIRRVQFISRHFSSSLQLLPIPLRISKLTKKSCIEYLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCTDSSLGNLRYAERIFNYIQDPSL

Query:  FVYNVMVKVYAKRRIFRKVILLFQQLREDELWPDNFTYPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVCNSLMDMYSELGNVENAKKLFDKMTTKD
        FVYNVMVK+YAKR + RKV+LLFQQLRED+LWPDNFTYPFVLKAIGCLRDV QGEK+HGFVVKTGM+ DNYVCNSLMDMYSELGNVENAKKLFD+MTT+D
Subjt:  FVYNVMVKVYAKRRIFRKVILLFQQLREDELWPDNFTYPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVCNSLMDMYSELGNVENAKKLFDKMTTKD

Query:  SVSWNVMIAGYVRCRRFEDAISTFREMQQLSNEEPGEATVVSTLSACTALKNMELGEEIHNYVRKELGFTTIINNALLDMYAKCGCLNIARNIFDEMPMK
        SVSWNVMI+GYV CRRFEDAI+TFREMQQ  NE+P EATVVSTLSACTALKN+ELG+EIHNYVRKELGFT  I+NALLDMYAKCGCLNI+RNIFDEMPMK
Subjt:  SVSWNVMIAGYVRCRRFEDAISTFREMQQLSNEEPGEATVVSTLSACTALKNMELGEEIHNYVRKELGFTTIINNALLDMYAKCGCLNIARNIFDEMPMK

Query:  NVICWTSMISGYINCGDLGEARDLFDRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRIT
        NVICWTSMISGYINCGDL EARDLFD+SPV+DVVLWTAMINGYVQFHHFD+AVALFREMQI+ VKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRIT
Subjt:  NVICWTSMISGYINCGDLGEARDLFDRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRIT

Query:  MDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRRLFNSMKKVYR
        +DVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKT EALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRR FNSMKKVYR
Subjt:  MDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRRLFNSMKKVYR

Query:  IEPKVEHYGCVIDLLGRAGLLDEAEELIQGIPDENGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCDSSIHTLLANIYASVDRWEDAKKVRRKM
        IEPKVEHYGCV+DLLGRAGLLDEAEELIQ I  EN EIVV LYGALLSACRIHNNVDMGERLA KLVNIE CDSSIH LLANIYAS DRWEDAKKVRRKM
Subjt:  IEPKVEHYGCVIDLLGRAGLLDEAEELIQGIPDENGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCDSSIHTLLANIYASVDRWEDAKKVRRKM

Query:  KELGVKKMPGCSSIEVDGIVHEFLVGDPSHPKMIEICSMLDRVTGQLLGSKESQLEGVMPLYKDTQYCSFVE
        KELGVKKMPGCSSIEVDGIVHEFLVGDPSHP+ IEI SML+RV+ QLLG KESQL   M    DTQ+C+FVE
Subjt:  KELGVKKMPGCSSIEVDGIVHEFLVGDPSHPKMIEICSMLDRVTGQLLGSKESQLEGVMPLYKDTQYCSFVE

A0A5A7TXP7 Pentatricopeptide repeat-containing protein0.0e+0088.41Show/hide
Query:  SFEQKHFDCACAFSEVFQFFKLSQSLHMLPLSIIRRVQFISRHFSSSLQLLPIPLRISKLTKKSCIEYLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKL
        SFE KHFDCACA S++FQFFK SQS HMLPLSIIRRVQFISRHFSSS  L+ +P+RISK TKKSCIEYLRNCKSM+QLK+IQSQIFRIGLEGDRD INKL
Subjt:  SFEQKHFDCACAFSEVFQFFKLSQSLHMLPLSIIRRVQFISRHFSSSLQLLPIPLRISKLTKKSCIEYLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKL

Query:  MAFCTDSSLGNLRYAERIFNYIQDPSLFVYNVMVKVYAKRRIFRKVILLFQQLREDELWPDNFTYPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVC
        MAFC D SLGNLRYAE+IF+Y+QDPSLFVYNVMVK+YAKR + RKV+LLFQQLRED+LWPDNFTYPFVLKAIGCLRDV QGEK+HGFVVKTGM+ DNYVC
Subjt:  MAFCTDSSLGNLRYAERIFNYIQDPSLFVYNVMVKVYAKRRIFRKVILLFQQLREDELWPDNFTYPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVC

Query:  NSLMDMYSELGNVENAKKLFDKMTTKDSVSWNVMIAGYVRCRRFEDAISTFREMQQLSNEEPGEATVVSTLSACTALKNMELGEEIHNYVRKELGFTTII
        NSLMDMYSELGNVENAKKLFD+MTT+DSVSWNVMI+GYV CRRFEDAI+TFREMQQ  NE+P EATVVSTLSACTALKN+ELG+EIHNYVRKELGFT  I
Subjt:  NSLMDMYSELGNVENAKKLFDKMTTKDSVSWNVMIAGYVRCRRFEDAISTFREMQQLSNEEPGEATVVSTLSACTALKNMELGEEIHNYVRKELGFTTII

Query:  NNALLDMYAKCGCLNIARNIFDEMPMKNVICWTSMISGYINCGDLGEARDLFDRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVT
        +NALLDMYAKCGCLNI+RNIFDEMPMKNVICWTSMISGYINCGDL EARDLFD+SPV+DVVLWTAMINGYVQFHHFD+AVALFREMQI+ VKPDKFTVVT
Subjt:  NNALLDMYAKCGCLNIARNIFDEMPMKNVICWTSMISGYINCGDLGEARDLFDRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVT

Query:  LLTGCAQLGALEQGKWIHGYLDENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFI
        LLTGCAQLGALEQGKWIHGYLDENRIT+DVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKT EALRLFSEMELVGAKPDDITFI
Subjt:  LLTGCAQLGALEQGKWIHGYLDENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFI

Query:  GVLSACSHGGLVEEGRRLFNSMKKVYRIEPKVEHYGCVIDLLGRAGLLDEAEELIQGIPDENGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCD
        GVLSACSHGGLVEEGRR FNSMKKVYRIEPKVEHYGCV+DLLGRAGLLDEAEELIQ I  EN EIVV LYGALLSACRIHNNVDMGERLA KLVNIE CD
Subjt:  GVLSACSHGGLVEEGRRLFNSMKKVYRIEPKVEHYGCVIDLLGRAGLLDEAEELIQGIPDENGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCD

Query:  SSIHTLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCSSIEVDGIVHEFLVGDPSHPKMIEICSMLDRVTGQLLGSKESQLEGVMPLYKDTQYCSFVE
        SSIH LLANIYAS DRWEDAKKVRRKMKELGVKKMPGCSSIEVDGIVHEFLVGDPSHP+ IEI SML+RV+ QLLG KESQL   M    DTQ+C+FVE
Subjt:  SSIHTLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCSSIEVDGIVHEFLVGDPSHPKMIEICSMLDRVTGQLLGSKESQLEGVMPLYKDTQYCSFVE

A0A5D3CSK6 Pentatricopeptide repeat-containing protein0.0e+0088.41Show/hide
Query:  SFEQKHFDCACAFSEVFQFFKLSQSLHMLPLSIIRRVQFISRHFSSSLQLLPIPLRISKLTKKSCIEYLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKL
        SFE KHFDCACA S++FQFFK SQS HMLPLSIIRRVQFISRHFSSS  L+ +P+RISK TKKSCIEYLRNCKSM+QLK+IQSQIFRIGLEGDRD INKL
Subjt:  SFEQKHFDCACAFSEVFQFFKLSQSLHMLPLSIIRRVQFISRHFSSSLQLLPIPLRISKLTKKSCIEYLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKL

Query:  MAFCTDSSLGNLRYAERIFNYIQDPSLFVYNVMVKVYAKRRIFRKVILLFQQLREDELWPDNFTYPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVC
        MAFC D SLGNLRYAE+IF+Y+QDPSLFVYNVMVK+YAKR + RKV+LLFQQLRED+LWPDNFTYPFVLKAIGCLRDV QGEK+HGFVVKTGM+ DNYVC
Subjt:  MAFCTDSSLGNLRYAERIFNYIQDPSLFVYNVMVKVYAKRRIFRKVILLFQQLREDELWPDNFTYPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVC

Query:  NSLMDMYSELGNVENAKKLFDKMTTKDSVSWNVMIAGYVRCRRFEDAISTFREMQQLSNEEPGEATVVSTLSACTALKNMELGEEIHNYVRKELGFTTII
        NSLMDMYSELGNVENAKKLFD+MTT+DSVSWNVMI+GYV CRRFEDAI+TFREMQQ  NE+P EATVVSTLSACTALKN+ELG+EIHNYVRKELGFT  I
Subjt:  NSLMDMYSELGNVENAKKLFDKMTTKDSVSWNVMIAGYVRCRRFEDAISTFREMQQLSNEEPGEATVVSTLSACTALKNMELGEEIHNYVRKELGFTTII

Query:  NNALLDMYAKCGCLNIARNIFDEMPMKNVICWTSMISGYINCGDLGEARDLFDRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVT
        +NALLDMYAKCGCLNI+RNIFDEMPMKNVICWTSMISGYINCGDL EARDLFD+SPV+DVVLWTAMINGYVQFHHFD+AVALFREMQI+ VKPDKFTVVT
Subjt:  NNALLDMYAKCGCLNIARNIFDEMPMKNVICWTSMISGYINCGDLGEARDLFDRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVT

Query:  LLTGCAQLGALEQGKWIHGYLDENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFI
        LLTGCAQLGALEQGKWIHGYLDENRIT+DVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKT EALRLFSEMELVGAKPDDITFI
Subjt:  LLTGCAQLGALEQGKWIHGYLDENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFI

Query:  GVLSACSHGGLVEEGRRLFNSMKKVYRIEPKVEHYGCVIDLLGRAGLLDEAEELIQGIPDENGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCD
        GVLSACSHGGLVEEGRR FNSMKKVYRIEPKVEHYGCV+DLLGRAGLLDEAEELIQ I  EN EIVV LYGALLSACRIHNNVDMGERLA KLVNIE CD
Subjt:  GVLSACSHGGLVEEGRRLFNSMKKVYRIEPKVEHYGCVIDLLGRAGLLDEAEELIQGIPDENGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCD

Query:  SSIHTLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCSSIEVDGIVHEFLVGDPSHPKMIEICSMLDRVTGQLLGSKESQLEGVMPLYKDTQYCSFVE
        SSIH LLANIYAS DRWEDAKKVRRKMKELGVKKMPGCSSIEVDGIVHEFLVGDPSHP+ IEI SML+RV+ QLLG KESQL   M    DTQ+C+FVE
Subjt:  SSIHTLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCSSIEVDGIVHEFLVGDPSHPKMIEICSMLDRVTGQLLGSKESQLEGVMPLYKDTQYCSFVE

A0A6J1CTE5 pentatricopeptide repeat-containing protein At1g314300.0e+0086.22Show/hide
Query:  MLPLSIIRRVQFISRHFSSSLQLLPIPLRISKLTKKSCIEYLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCTDSSLGNLRYAERIFNYIQDPSL
        MLPLSI+RR++ ISRHFSS+L  +PI     KLTKKSCI+YLRNCKSMDQ KQIQ+ IFRIGLE DRDT+NK MAFC D SLGNLRYAER+F+YIQ+P L
Subjt:  MLPLSIIRRVQFISRHFSSSLQLLPIPLRISKLTKKSCIEYLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCTDSSLGNLRYAERIFNYIQDPSL

Query:  FVYNVMVKVYAKRRIFRKVILLFQQLREDELWPDNFTYPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVCNSLMDMYSELGNVENAKKLFDKMTTKD
        FVYNVMVK YAKR IFRKVILLFQ+L+ED LWPDNFTYPFVLKAIGCLRD+  GEK+HGFVVKTGMDFDNYVCNSLMDMY+ELG +E A+KLFD+M TKD
Subjt:  FVYNVMVKVYAKRRIFRKVILLFQQLREDELWPDNFTYPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVCNSLMDMYSELGNVENAKKLFDKMTTKD

Query:  SVSWNVMIAGYVRCRRFEDAISTFREMQQLSNEEPGEATVVSTLSACTALKNMELGEEIHNYVRKELGFTTIINNALLDMYAKCGCLNIARNIFDEMPMK
        SVSWNV+I+GYVRCRRFEDAI+TF+EMQ+ SN +PGEA +VSTLSACTALKN+ELGEEIHNYVR ELGFTTIINNALLDMYAKCG LNIAR +FDEMPMK
Subjt:  SVSWNVMIAGYVRCRRFEDAISTFREMQQLSNEEPGEATVVSTLSACTALKNMELGEEIHNYVRKELGFTTIINNALLDMYAKCGCLNIARNIFDEMPMK

Query:  NVICWTSMISGYINCGDLGEARDLFDRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRIT
        NVICWTSMISGYINCGDL EARDLFD SPV+DVVLWTAMINGYVQF+HFDEAVALFR+MQI+ VKPDKFTVV LLTGCAQLGALEQGKW+HGYLDENRIT
Subjt:  NVICWTSMISGYINCGDLGEARDLFDRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRIT

Query:  MDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRRLFNSMKKVYR
        MD VVGTALIEMYSKCGCV+KSLEIFYELE+KDTASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRR FN MKKVYR
Subjt:  MDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRRLFNSMKKVYR

Query:  IEPKVEHYGCVIDLLGRAGLLDEAEELIQGIPDENGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCDSSIHTLLANIYASVDRWEDAKKVRRKM
        IEPKVEHYGCVIDLLGRAGLLDEAEELIQ IP+EN +I+VPLYGALLSACRIHNNVDMGERL+ KLVN ESCDSSIHTLLANIYAS DRWEDAKKVRRKM
Subjt:  IEPKVEHYGCVIDLLGRAGLLDEAEELIQGIPDENGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCDSSIHTLLANIYASVDRWEDAKKVRRKM

Query:  KELGVKKMPGCSSIEVDGIVHEFLVGDPSHPKMIEICSMLDRVTGQLLGSKESQ--LEGVMPLYKDTQYCSFVEF
        KELGVKKMPGCSSIEVDGIVHEFLVGDPSHP+MIEICSMLDRVTG LLGSKE Q   E V+PL  DTQ+ S VEF
Subjt:  KELGVKKMPGCSSIEVDGIVHEFLVGDPSHPKMIEICSMLDRVTGQLLGSKESQ--LEGVMPLYKDTQYCSFVEF

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic5.3e-13336.71Show/hide
Query:  IEYLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCTDSSLGNLRYAERIFNYIQDPSLFVYNVMVKVYAK-RRIFRKVILLFQQLREDELWPDNFT
        I  +  C S+ QLKQ    + R G   D  + +KL A    SS  +L YA ++F+ I  P+ F +N +++ YA        +      + E + +P+ +T
Subjt:  IEYLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCTDSSLGNLRYAERIFNYIQDPSLFVYNVMVKVYAK-RRIFRKVILLFQQLREDELWPDNFT

Query:  YPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVCNSLMDMYSELGNVENAKKLFDKMTTKDSVSWNVMIAGYVRCRRFEDAISTFREMQQLSNEEPGE
        +PF++KA   +  +  G+ +HG  VK+ +  D +V NSL+  Y   G++++A K+F  +  KD VSWN MI G+V+    + A+  F++M+   + +   
Subjt:  YPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVCNSLMDMYSELGNVENAKKLFDKMTTKDSVSWNVMIAGYVRCRRFEDAISTFREMQQLSNEEPGE

Query:  ATVVSTLSACTALKNMELGEEIHNYVRK-ELGFTTIINNALLDMYAKCGCLNIARNIFDEMPMKNVICWTSMISGYINCGDLGEARDLFDRSPVKDVVLW
         T+V  LSAC  ++N+E G ++ +Y+ +  +     + NA+LDMY KCG +  A+ +FD M  K+ + WT+M+ GY    D   AR++ +  P KD+V W
Subjt:  ATVVSTLSACTALKNMELGEEIHNYVRK-ELGFTTIINNALLDMYAKCGCLNIARNIFDEMPMKNVICWTSMISGYINCGDLGEARDLFDRSPVKDVVLW

Query:  TAMINGYVQFHHFDEAVALFREMQI-RNVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTA
         A+I+ Y Q    +EA+ +F E+Q+ +N+K ++ T+V+ L+ CAQ+GALE G+WIH Y+ ++ I M+  V +ALI MYSKCG ++KS E+F  +E +D  
Subjt:  TAMINGYVQFHHFDEAVALFREMQI-RNVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTA

Query:  SWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRRLFNSMKKVYRIEPKVEHYGCVIDLLGRAGLLDEAEELIQGIPDEN
         W+++I GLAM+G   EA+ +F +M+    KP+ +TF  V  ACSH GLV+E   LF+ M+  Y I P+ +HY C++D+LGR+G L++A + I+ +P   
Subjt:  SWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRRLFNSMKKVYRIEPKVEHYGCVIDLLGRAGLLDEAEELIQGIPDEN

Query:  GEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCDSSIHTLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCSSIEVDGIVHEFLVGDPSHPKMIE
           V   +GALL AC+IH N+++ E    +L+ +E  +   H LL+NIYA + +WE+  ++R+ M+  G+KK PGCSSIE+DG++HEFL GD +HP   +
Subjt:  GEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCDSSIHTLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCSSIEVDGIVHEFLVGDPSHPKMIE

Query:  ICSMLDRVTGQLLGS-KESQLEGVMPLYKDTQ
        +   L  V  +L  +  E ++  V+ + ++ +
Subjt:  ICSMLDRVTGQLLGS-KESQLEGVMPLYKDTQ

Q9C866 Pentatricopeptide repeat-containing protein At1g314309.4e-19958.69Show/hide
Query:  IQDPSLFVYNVMVKVYAKRRIFRKVILLFQQLREDELWPDNFTYPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVCNSLMDMYSELGNVENAKKLFD
        +Q PSL +YN M+K  A  + F KV+ LF +LR   L+PDNFT P VLK+IG LR V +GEKVHG+ VK G++FD+YV NSLM MY+ LG +E   K+FD
Subjt:  IQDPSLFVYNVMVKVYAKRRIFRKVILLFQQLREDELWPDNFTYPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVCNSLMDMYSELGNVENAKKLFD

Query:  KMTTKDSVSWNVMIAGYVRCRRFEDAISTFREMQQLSNEEPGEATVVSTLSACTALKNMELGEEIHNYVRKELGFTTIINNALLDMYAKCGCLNIARNIF
        +M  +D VSWN +I+ YV   RFEDAI  F+ M Q SN +  E T+VSTLSAC+ALKN+E+GE I+ +V  E   +  I NAL+DM+ KCGCL+ AR +F
Subjt:  KMTTKDSVSWNVMIAGYVRCRRFEDAISTFREMQQLSNEEPGEATVVSTLSACTALKNMELGEEIHNYVRKELGFTTIINNALLDMYAKCGCLNIARNIF

Query:  DEMPMKNVICWTSMISGYINCGDLGEARDLFDRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVTLLTGCAQLGALEQGKWIHGYL
        D M  KNV CWTSM+ GY++ G + EAR LF+RSPVKDVVLWTAM+NGYVQF+ FDEA+ LFR MQ   ++PD F +V+LLTGCAQ GALEQGKWIHGY+
Subjt:  DEMPMKNVICWTSMISGYINCGDLGEARDLFDRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVTLLTGCAQLGALEQGKWIHGYL

Query:  DENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRRLFNS
        +ENR+T+D VVGTAL++MY+KCGC++ +LE+FYE++++DTASWTS+I GLAMNG +G AL L+ EME VG + D ITF+ VL+AC+HGG V EGR++F+S
Subjt:  DENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRRLFNS

Query:  MKKVYRIEPKVEHYGCVIDLLGRAGLLDEAEELIQGIPDENGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCDSSIHTLLANIYASVDRWEDAK
        M + + ++PK EH  C+IDLL RAGLLDEAEELI  +  E+ E +VP+Y +LLSA R + NV + ER+A KL  +E  DSS HTLLA++YAS +RWED  
Subjt:  MKKVYRIEPKVEHYGCVIDLLGRAGLLDEAEELIQGIPDENGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCDSSIHTLLANIYASVDRWEDAK

Query:  KVRRKMKELGVKKMPGCSSIEVDGIVHEFLVGDP--SHPKMIEICSMLDRVTGQLLGSKESQLE
         VRRKMK+LG++K PGCSSIE+DG+ HEF+VGD   SHPKM EI SML + T  +L  +  +++
Subjt:  KVRRKMKELGVKKMPGCSSIEVDGIVHEFLVGDP--SHPKMIEICSMLDRVTGQLLGSKESQLE

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic5.7e-12736.88Show/hide
Query:  IEYLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCTDS-SLGNLRYAERIFNYIQDPSLFVYNVMVKVYAKRRIFRKVILLFQQLREDELWPDNFT
        +  L NCK++  L+ I +Q+ +IGL      ++KL+ FC  S     L YA  +F  IQ+P+L ++N M + +A        + L+  +    L P+++T
Subjt:  IEYLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCTDS-SLGNLRYAERIFNYIQDPSLFVYNVMVKVYAKRRIFRKVILLFQQLREDELWPDNFT

Query:  YPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVCNSLMDM-------------------------------YSELGNVENAKKLFDKMTTKDSVSWNV
        +PFVLK+    +  ++G+++HG V+K G D D YV  SL+ M                               Y+  G +ENA+KLFD++  KD VSWN 
Subjt:  YPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVCNSLMDM-------------------------------YSELGNVENAKKLFDKMTTKDSVSWNV

Query:  MIAGYVRCRRFEDAISTFREMQQLSNEEPGEATVVSTLSACTALKNMELGEEIHNYVRKE-LGFTTIINNALLDMYAKCGCLNIARNIFDEMPMKNVICW
        MI+GY     +++A+  F++M + +N  P E+T+V+ +SAC    ++ELG ++H ++     G    I NAL+D+Y+KCG L  A               
Subjt:  MIAGYVRCRRFEDAISTFREMQQLSNEEPGEATVVSTLSACTALKNMELGEEIHNYVRKE-LGFTTIINNALLDMYAKCGCLNIARNIFDEMPMKNVICW

Query:  TSMISGYINCGDLGEARDLFDRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDE--NRITMDV
                 CG       LF+R P KDV+ W  +I GY   + + EA+ LF+EM      P+  T++++L  CA LGA++ G+WIH Y+D+    +T   
Subjt:  TSMISGYINCGDLGEARDLFDRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDE--NRITMDV

Query:  VVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRRLFNSMKKVYRIEP
         + T+LI+MY+KCG ++ + ++F  +  K  +SW ++I G AM+G+   +  LFS M  +G +PDDITF+G+LSACSH G+++ GR +F +M + Y++ P
Subjt:  VVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRRLFNSMKKVYRIEP

Query:  KVEHYGCVIDLLGRAGLLDEAEELIQGIPDENGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCDSSIHTLLANIYASVDRWEDAKKVRRKMKEL
        K+EHYGC+IDLLG +GL  EAEE+I  +  E   ++   + +LL AC++H NV++GE  A  L+ IE  +   + LL+NIYAS  RW +  K R  + + 
Subjt:  KVEHYGCVIDLLGRAGLLDEAEELIQGIPDENGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCDSSIHTLLANIYASVDRWEDAKKVRRKMKEL

Query:  GVKKMPGCSSIEVDGIVHEFLVGDPSHPKMIEICSMLDRV
        G+KK+PGCSSIE+D +VHEF++GD  HP+  EI  ML+ +
Subjt:  GVKKMPGCSSIEVDGIVHEFLVGDPSHPKMIEICSMLDRV

Q9LSB8 Putative pentatricopeptide repeat-containing protein At3g159303.4e-14040.03Show/hide
Query:  FSSSLQLLPIPLRISKLTKKSCIEYLR------NCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCTDSSLGNLRYAERIFNYIQDPSLFVYNVMVKVY
        F+S L +    L +S +T+    +Y R       CK+ DQ KQ+ SQ    G+  +     KL  F      G++ YA ++F  I +P + V+N M+K +
Subjt:  FSSSLQLLPIPLRISKLTKKSCIEYLR------NCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCTDSSLGNLRYAERIFNYIQDPSLFVYNVMVKVY

Query:  AKRRIFRKVILLFQQLREDELWPDNFTYPFVLKAIGCLRD---VRQGEKVHGFVVKTGMDFDNYVCNSLMDMYSELGNVENAKKLFDKMTTKDSVSWNVM
        +K     + + L+  + ++ + PD+ T+PF+L   G  RD   +  G+K+H  VVK G+  + YV N+L+ MYS  G ++ A+ +FD+   +D  SWN+M
Subjt:  AKRRIFRKVILLFQQLREDELWPDNFTYPFVLKAIGCLRD---VRQGEKVHGFVVKTGMDFDNYVCNSLMDMYSELGNVENAKKLFDKMTTKDSVSWNVM

Query:  IAGYVRCRRFEDAISTFREMQQLSNEEPGEATVVSTLSACTALKNMELGEEIHNYVRK-ELGFTTIINNALLDMYAKCGCLNIARNIFDEMPMKNVICWT
        I+GY R + +E++I    EM++ +   P   T++  LSAC+ +K+ +L + +H YV + +   +  + NAL++ YA CG ++IA  IF  M  ++VI WT
Subjt:  IAGYVRCRRFEDAISTFREMQQLSNEEPGEATVVSTLSACTALKNMELGEEIHNYVRK-ELGFTTIINNALLDMYAKCGCLNIARNIFDEMPMKNVICWT

Query:  SMISGYINCGDLGEARDLFDRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRITMDVVVG
        S++ GY+  G+L  AR  FD+ PV+D + WT MI+GY++   F+E++ +FREMQ   + PD+FT+V++LT CA LG+LE G+WI  Y+D+N+I  DVVVG
Subjt:  SMISGYINCGDLGEARDLFDRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRITMDVVVG

Query:  TALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRRLFNSMKKVYRIEPKVE
         ALI+MY KCGC +K+ ++F++++ +D  +WT+++ GLA NG+  EA+++F +M+ +  +PDDIT++GVLSAC+H G+V++ R+ F  M+  +RIEP + 
Subjt:  TALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRRLFNSMKKVYRIEPKVE

Query:  HYGCVIDLLGRAGLLDEAEELIQGIPDENGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCDSSIHTLLANIYASVDRWEDAKKVRRKMKELGVK
        HYGC++D+LGRAGL+ EA E+++ +P     IV   +GALL A R+HN+  M E  A K++ +E  + +++ LL NIYA   RW+D ++VRRK+ ++ +K
Subjt:  HYGCVIDLLGRAGLLDEAEELIQGIPDENGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCDSSIHTLLANIYASVDRWEDAKKVRRKMKELGVK

Query:  KMPGCSSIEVDGIVHEFLVGDPSHPKMIEICSMLDRV
        K PG S IEV+G  HEF+ GD SH +  EI   L+ +
Subjt:  KMPGCSSIEVDGIVHEFLVGDPSHPKMIEICSMLDRV

Q9SJZ3 Pentatricopeptide repeat-containing protein At2g22410, mitochondrial7.3e-14341.09Show/hide
Query:  IEYLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCTDSSLGNLRYAERIFNYIQDPSLFVYNVMVKVYAKRRIFRKVILLFQQLRED---ELWPDN
        +  L  CK +  LKQIQ+Q+   GL  D    ++L+AFC  S    L Y+ +I   I++P++F +NV ++ +++    ++  LL++Q+      E  PD+
Subjt:  IEYLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCTDSSLGNLRYAERIFNYIQDPSLFVYNVMVKVYAKRRIFRKVILLFQQLRED---ELWPDN

Query:  FTYPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVCNSLMDMYSELGNVENAKKLFDKMTTKDSVSWNVMIAGYVRCRRFEDAISTFREMQQLSNEEP
        FTYP + K    LR    G  + G V+K  ++  ++V N+ + M++  G++ENA+K+FD+   +D VSWN +I GY +    E AI  ++ M+     +P
Subjt:  FTYPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVCNSLMDMYSELGNVENAKKLFDKMTTKDSVSWNVMIAGYVRCRRFEDAISTFREMQQLSNEEP

Query:  GEATVVSTLSACTALKNMELGEEIHNYVRKE-LGFTTIINNALLDMYAKCGCLNIARNIFDEMPMKNVICWTSMISGYINCGDLGEARDLFDRSPVKDVV
         + T++  +S+C+ L ++  G+E + YV++  L  T  + NAL+DM++KCG ++ AR IFD +  + ++ WT+MISGY  CG L  +R LFD    KDVV
Subjt:  GEATVVSTLSACTALKNMELGEEIHNYVRKE-LGFTTIINNALLDMYAKCGCLNIARNIFDEMPMKNVICWTSMISGYINCGDLGEARDLFDRSPVKDVV

Query:  LWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDT
        LW AMI G VQ     +A+ALF+EMQ  N KPD+ T++  L+ C+QLGAL+ G WIH Y+++  ++++V +GT+L++MY+KCG + ++L +F+ ++ +++
Subjt:  LWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDT

Query:  ASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRRLFNSMKKVYRIEPKVEHYGCVIDLLGRAGLLDEAEELIQGIPDE
         ++T+II GLA++G    A+  F+EM   G  PD+ITFIG+LSAC HGG+++ GR  F+ MK  + + P+++HY  ++DLLGRAGLL+EA+ L++ +P E
Subjt:  ASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRRLFNSMKKVYRIEPKVEHYGCVIDLLGRAGLLDEAEELIQGIPDE

Query:  NGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCDSSIHTLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCSSIEVDGIVHEFLVGDPSHPKMI
            V   +GALL  CR+H NV++GE+ A KL+ ++  DS I+ LL  +Y   + WEDAK+ RR M E GV+K+PGCSSIEV+GIV EF+V D S P+  
Subjt:  NGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCDSSIHTLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCSSIEVDGIVHEFLVGDPSHPKMI

Query:  EICSML
        +I   L
Subjt:  EICSML

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.0e-12836.88Show/hide
Query:  IEYLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCTDS-SLGNLRYAERIFNYIQDPSLFVYNVMVKVYAKRRIFRKVILLFQQLREDELWPDNFT
        +  L NCK++  L+ I +Q+ +IGL      ++KL+ FC  S     L YA  +F  IQ+P+L ++N M + +A        + L+  +    L P+++T
Subjt:  IEYLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCTDS-SLGNLRYAERIFNYIQDPSLFVYNVMVKVYAKRRIFRKVILLFQQLREDELWPDNFT

Query:  YPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVCNSLMDM-------------------------------YSELGNVENAKKLFDKMTTKDSVSWNV
        +PFVLK+    +  ++G+++HG V+K G D D YV  SL+ M                               Y+  G +ENA+KLFD++  KD VSWN 
Subjt:  YPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVCNSLMDM-------------------------------YSELGNVENAKKLFDKMTTKDSVSWNV

Query:  MIAGYVRCRRFEDAISTFREMQQLSNEEPGEATVVSTLSACTALKNMELGEEIHNYVRKE-LGFTTIINNALLDMYAKCGCLNIARNIFDEMPMKNVICW
        MI+GY     +++A+  F++M + +N  P E+T+V+ +SAC    ++ELG ++H ++     G    I NAL+D+Y+KCG L  A               
Subjt:  MIAGYVRCRRFEDAISTFREMQQLSNEEPGEATVVSTLSACTALKNMELGEEIHNYVRKE-LGFTTIINNALLDMYAKCGCLNIARNIFDEMPMKNVICW

Query:  TSMISGYINCGDLGEARDLFDRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDE--NRITMDV
                 CG       LF+R P KDV+ W  +I GY   + + EA+ LF+EM      P+  T++++L  CA LGA++ G+WIH Y+D+    +T   
Subjt:  TSMISGYINCGDLGEARDLFDRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDE--NRITMDV

Query:  VVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRRLFNSMKKVYRIEP
         + T+LI+MY+KCG ++ + ++F  +  K  +SW ++I G AM+G+   +  LFS M  +G +PDDITF+G+LSACSH G+++ GR +F +M + Y++ P
Subjt:  VVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRRLFNSMKKVYRIEP

Query:  KVEHYGCVIDLLGRAGLLDEAEELIQGIPDENGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCDSSIHTLLANIYASVDRWEDAKKVRRKMKEL
        K+EHYGC+IDLLG +GL  EAEE+I  +  E   ++   + +LL AC++H NV++GE  A  L+ IE  +   + LL+NIYAS  RW +  K R  + + 
Subjt:  KVEHYGCVIDLLGRAGLLDEAEELIQGIPDENGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCDSSIHTLLANIYASVDRWEDAKKVRRKMKEL

Query:  GVKKMPGCSSIEVDGIVHEFLVGDPSHPKMIEICSMLDRV
        G+KK+PGCSSIE+D +VHEF++GD  HP+  EI  ML+ +
Subjt:  GVKKMPGCSSIEVDGIVHEFLVGDPSHPKMIEICSMLDRV

AT1G31430.1 Pentatricopeptide repeat (PPR-like) superfamily protein6.7e-20058.69Show/hide
Query:  IQDPSLFVYNVMVKVYAKRRIFRKVILLFQQLREDELWPDNFTYPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVCNSLMDMYSELGNVENAKKLFD
        +Q PSL +YN M+K  A  + F KV+ LF +LR   L+PDNFT P VLK+IG LR V +GEKVHG+ VK G++FD+YV NSLM MY+ LG +E   K+FD
Subjt:  IQDPSLFVYNVMVKVYAKRRIFRKVILLFQQLREDELWPDNFTYPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVCNSLMDMYSELGNVENAKKLFD

Query:  KMTTKDSVSWNVMIAGYVRCRRFEDAISTFREMQQLSNEEPGEATVVSTLSACTALKNMELGEEIHNYVRKELGFTTIINNALLDMYAKCGCLNIARNIF
        +M  +D VSWN +I+ YV   RFEDAI  F+ M Q SN +  E T+VSTLSAC+ALKN+E+GE I+ +V  E   +  I NAL+DM+ KCGCL+ AR +F
Subjt:  KMTTKDSVSWNVMIAGYVRCRRFEDAISTFREMQQLSNEEPGEATVVSTLSACTALKNMELGEEIHNYVRKELGFTTIINNALLDMYAKCGCLNIARNIF

Query:  DEMPMKNVICWTSMISGYINCGDLGEARDLFDRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVTLLTGCAQLGALEQGKWIHGYL
        D M  KNV CWTSM+ GY++ G + EAR LF+RSPVKDVVLWTAM+NGYVQF+ FDEA+ LFR MQ   ++PD F +V+LLTGCAQ GALEQGKWIHGY+
Subjt:  DEMPMKNVICWTSMISGYINCGDLGEARDLFDRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVTLLTGCAQLGALEQGKWIHGYL

Query:  DENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRRLFNS
        +ENR+T+D VVGTAL++MY+KCGC++ +LE+FYE++++DTASWTS+I GLAMNG +G AL L+ EME VG + D ITF+ VL+AC+HGG V EGR++F+S
Subjt:  DENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRRLFNS

Query:  MKKVYRIEPKVEHYGCVIDLLGRAGLLDEAEELIQGIPDENGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCDSSIHTLLANIYASVDRWEDAK
        M + + ++PK EH  C+IDLL RAGLLDEAEELI  +  E+ E +VP+Y +LLSA R + NV + ER+A KL  +E  DSS HTLLA++YAS +RWED  
Subjt:  MKKVYRIEPKVEHYGCVIDLLGRAGLLDEAEELIQGIPDENGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCDSSIHTLLANIYASVDRWEDAK

Query:  KVRRKMKELGVKKMPGCSSIEVDGIVHEFLVGDP--SHPKMIEICSMLDRVTGQLLGSKESQLE
         VRRKMK+LG++K PGCSSIE+DG+ HEF+VGD   SHPKM EI SML + T  +L  +  +++
Subjt:  KVRRKMKELGVKKMPGCSSIEVDGIVHEFLVGDP--SHPKMIEICSMLDRVTGQLLGSKESQLE

AT2G22410.1 SLOW GROWTH 15.2e-14441.09Show/hide
Query:  IEYLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCTDSSLGNLRYAERIFNYIQDPSLFVYNVMVKVYAKRRIFRKVILLFQQLRED---ELWPDN
        +  L  CK +  LKQIQ+Q+   GL  D    ++L+AFC  S    L Y+ +I   I++P++F +NV ++ +++    ++  LL++Q+      E  PD+
Subjt:  IEYLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCTDSSLGNLRYAERIFNYIQDPSLFVYNVMVKVYAKRRIFRKVILLFQQLRED---ELWPDN

Query:  FTYPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVCNSLMDMYSELGNVENAKKLFDKMTTKDSVSWNVMIAGYVRCRRFEDAISTFREMQQLSNEEP
        FTYP + K    LR    G  + G V+K  ++  ++V N+ + M++  G++ENA+K+FD+   +D VSWN +I GY +    E AI  ++ M+     +P
Subjt:  FTYPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVCNSLMDMYSELGNVENAKKLFDKMTTKDSVSWNVMIAGYVRCRRFEDAISTFREMQQLSNEEP

Query:  GEATVVSTLSACTALKNMELGEEIHNYVRKE-LGFTTIINNALLDMYAKCGCLNIARNIFDEMPMKNVICWTSMISGYINCGDLGEARDLFDRSPVKDVV
         + T++  +S+C+ L ++  G+E + YV++  L  T  + NAL+DM++KCG ++ AR IFD +  + ++ WT+MISGY  CG L  +R LFD    KDVV
Subjt:  GEATVVSTLSACTALKNMELGEEIHNYVRKE-LGFTTIINNALLDMYAKCGCLNIARNIFDEMPMKNVICWTSMISGYINCGDLGEARDLFDRSPVKDVV

Query:  LWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDT
        LW AMI G VQ     +A+ALF+EMQ  N KPD+ T++  L+ C+QLGAL+ G WIH Y+++  ++++V +GT+L++MY+KCG + ++L +F+ ++ +++
Subjt:  LWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDT

Query:  ASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRRLFNSMKKVYRIEPKVEHYGCVIDLLGRAGLLDEAEELIQGIPDE
         ++T+II GLA++G    A+  F+EM   G  PD+ITFIG+LSAC HGG+++ GR  F+ MK  + + P+++HY  ++DLLGRAGLL+EA+ L++ +P E
Subjt:  ASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRRLFNSMKKVYRIEPKVEHYGCVIDLLGRAGLLDEAEELIQGIPDE

Query:  NGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCDSSIHTLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCSSIEVDGIVHEFLVGDPSHPKMI
            V   +GALL  CR+H NV++GE+ A KL+ ++  DS I+ LL  +Y   + WEDAK+ RR M E GV+K+PGCSSIEV+GIV EF+V D S P+  
Subjt:  NGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCDSSIHTLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCSSIEVDGIVHEFLVGDPSHPKMI

Query:  EICSML
        +I   L
Subjt:  EICSML

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.7e-13436.71Show/hide
Query:  IEYLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCTDSSLGNLRYAERIFNYIQDPSLFVYNVMVKVYAK-RRIFRKVILLFQQLREDELWPDNFT
        I  +  C S+ QLKQ    + R G   D  + +KL A    SS  +L YA ++F+ I  P+ F +N +++ YA        +      + E + +P+ +T
Subjt:  IEYLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCTDSSLGNLRYAERIFNYIQDPSLFVYNVMVKVYAK-RRIFRKVILLFQQLREDELWPDNFT

Query:  YPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVCNSLMDMYSELGNVENAKKLFDKMTTKDSVSWNVMIAGYVRCRRFEDAISTFREMQQLSNEEPGE
        +PF++KA   +  +  G+ +HG  VK+ +  D +V NSL+  Y   G++++A K+F  +  KD VSWN MI G+V+    + A+  F++M+   + +   
Subjt:  YPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVCNSLMDMYSELGNVENAKKLFDKMTTKDSVSWNVMIAGYVRCRRFEDAISTFREMQQLSNEEPGE

Query:  ATVVSTLSACTALKNMELGEEIHNYVRK-ELGFTTIINNALLDMYAKCGCLNIARNIFDEMPMKNVICWTSMISGYINCGDLGEARDLFDRSPVKDVVLW
         T+V  LSAC  ++N+E G ++ +Y+ +  +     + NA+LDMY KCG +  A+ +FD M  K+ + WT+M+ GY    D   AR++ +  P KD+V W
Subjt:  ATVVSTLSACTALKNMELGEEIHNYVRK-ELGFTTIINNALLDMYAKCGCLNIARNIFDEMPMKNVICWTSMISGYINCGDLGEARDLFDRSPVKDVVLW

Query:  TAMINGYVQFHHFDEAVALFREMQI-RNVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTA
         A+I+ Y Q    +EA+ +F E+Q+ +N+K ++ T+V+ L+ CAQ+GALE G+WIH Y+ ++ I M+  V +ALI MYSKCG ++KS E+F  +E +D  
Subjt:  TAMINGYVQFHHFDEAVALFREMQI-RNVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTA

Query:  SWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRRLFNSMKKVYRIEPKVEHYGCVIDLLGRAGLLDEAEELIQGIPDEN
         W+++I GLAM+G   EA+ +F +M+    KP+ +TF  V  ACSH GLV+E   LF+ M+  Y I P+ +HY C++D+LGR+G L++A + I+ +P   
Subjt:  SWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRRLFNSMKKVYRIEPKVEHYGCVIDLLGRAGLLDEAEELIQGIPDEN

Query:  GEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCDSSIHTLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCSSIEVDGIVHEFLVGDPSHPKMIE
           V   +GALL AC+IH N+++ E    +L+ +E  +   H LL+NIYA + +WE+  ++R+ M+  G+KK PGCSSIE+DG++HEFL GD +HP   +
Subjt:  GEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCDSSIHTLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCSSIEVDGIVHEFLVGDPSHPKMIE

Query:  ICSMLDRVTGQLLGS-KESQLEGVMPLYKDTQ
        +   L  V  +L  +  E ++  V+ + ++ +
Subjt:  ICSMLDRVTGQLLGS-KESQLEGVMPLYKDTQ

AT3G15930.1 Pentatricopeptide repeat (PPR) superfamily protein2.4e-14140.03Show/hide
Query:  FSSSLQLLPIPLRISKLTKKSCIEYLR------NCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCTDSSLGNLRYAERIFNYIQDPSLFVYNVMVKVY
        F+S L +    L +S +T+    +Y R       CK+ DQ KQ+ SQ    G+  +     KL  F      G++ YA ++F  I +P + V+N M+K +
Subjt:  FSSSLQLLPIPLRISKLTKKSCIEYLR------NCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCTDSSLGNLRYAERIFNYIQDPSLFVYNVMVKVY

Query:  AKRRIFRKVILLFQQLREDELWPDNFTYPFVLKAIGCLRD---VRQGEKVHGFVVKTGMDFDNYVCNSLMDMYSELGNVENAKKLFDKMTTKDSVSWNVM
        +K     + + L+  + ++ + PD+ T+PF+L   G  RD   +  G+K+H  VVK G+  + YV N+L+ MYS  G ++ A+ +FD+   +D  SWN+M
Subjt:  AKRRIFRKVILLFQQLREDELWPDNFTYPFVLKAIGCLRD---VRQGEKVHGFVVKTGMDFDNYVCNSLMDMYSELGNVENAKKLFDKMTTKDSVSWNVM

Query:  IAGYVRCRRFEDAISTFREMQQLSNEEPGEATVVSTLSACTALKNMELGEEIHNYVRK-ELGFTTIINNALLDMYAKCGCLNIARNIFDEMPMKNVICWT
        I+GY R + +E++I    EM++ +   P   T++  LSAC+ +K+ +L + +H YV + +   +  + NAL++ YA CG ++IA  IF  M  ++VI WT
Subjt:  IAGYVRCRRFEDAISTFREMQQLSNEEPGEATVVSTLSACTALKNMELGEEIHNYVRK-ELGFTTIINNALLDMYAKCGCLNIARNIFDEMPMKNVICWT

Query:  SMISGYINCGDLGEARDLFDRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRITMDVVVG
        S++ GY+  G+L  AR  FD+ PV+D + WT MI+GY++   F+E++ +FREMQ   + PD+FT+V++LT CA LG+LE G+WI  Y+D+N+I  DVVVG
Subjt:  SMISGYINCGDLGEARDLFDRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRITMDVVVG

Query:  TALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRRLFNSMKKVYRIEPKVE
         ALI+MY KCGC +K+ ++F++++ +D  +WT+++ GLA NG+  EA+++F +M+ +  +PDDIT++GVLSAC+H G+V++ R+ F  M+  +RIEP + 
Subjt:  TALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRRLFNSMKKVYRIEPKVE

Query:  HYGCVIDLLGRAGLLDEAEELIQGIPDENGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCDSSIHTLLANIYASVDRWEDAKKVRRKMKELGVK
        HYGC++D+LGRAGL+ EA E+++ +P     IV   +GALL A R+HN+  M E  A K++ +E  + +++ LL NIYA   RW+D ++VRRK+ ++ +K
Subjt:  HYGCVIDLLGRAGLLDEAEELIQGIPDENGEIVVPLYGALLSACRIHNNVDMGERLANKLVNIESCDSSIHTLLANIYASVDRWEDAKKVRRKMKELGVK

Query:  KMPGCSSIEVDGIVHEFLVGDPSHPKMIEICSMLDRV
        K PG S IEV+G  HEF+ GD SH +  EI   L+ +
Subjt:  KMPGCSSIEVDGIVHEFLVGDPSHPKMIEICSMLDRV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTGTGTCGAAAACTTGTTTGGGATTGTCCGTGTGGGGGAGTTCAATATTACAAGCTTGAGGAAGTTGACCGAGTATATCATTTCTTAGCCGGTTTAAACTCCAA
ATTCGATGCTGTGCGAAGTCGAATATTGGGACAAAAACCAACACCGACCTTGATGGAAGTCTGTTCAGAAGTTTGGTTTGAGGAAGATAGAACGAGTGCTATGAATACTG
TGGTTTGTGAACACTGTAAGAAACCATGGCACACGAAATATCAGTGTTGGAAGCTACATGGTCGGCCTCCGAATGGTAAACGCCGACCTCCAAACAACAAACCCAACCAG
GCCTTGGTAAGTGACTCGGAACCACAACGTCAAGAGAACTACTCAGCTGACAACGGTACCGCTTCTCTTGGGGCAATGGCACATTCAAGTATCTCTCCATCCTTGAGTCT
ACTCAGTATTACTGGCCAGAAACCCTGGATTCTTGATTCAGGAGCTACATACCATTTGACTGGAACTTCTGACAATTTCCTCTCTTATCATCCATGTGCTGCTTTCTCAC
CCGATGATGTTCTTTTTCAGGACTTGAGCTCGGGGAAGACGATTGGCACTGCCCAGCACAATAGGGGACTCTATTTCCTTAATGGTGATACTTCCTCTAGGCACAGTTCT
AGGGCTAGTCTGCTATCTTCCTATTTTTCAACTTCTGAAACTGACTGTATCGTTCACCAGAGCTCATGTGCCTATACGCCCCAACAAAATGGAGTGGCTGAAAGAAAAAA
CTGTCAATTCCTTGAAGTTGCTCAGTCTTTCATGCTTTCAGCCTCCCTTCCATCTTACCTATGGGGAGATGTAGTATTGACTGCAGCTCATCTTATAAATCGGATGCCTT
CTCGCGTTCTCCACTTCCAAATCCCTCTTGACTACCTCAAATTGTCTTATCCTACCGCTCGCCTTATACCTGATGTCCTTCTCCGAGTGTTTGGGTGTACAGCATTTGTC
CACAGCTTTAGTCCAAACCAAACTAAGTTCTCCCCTCGGGCCCAGAAATGTGTCTTCGTTGGATATCCCCTCCACCAACGTGGTTATAAATGTTTCCATCCCACTTCCCG
GAAATACTTCATCTCTATGGATATCACCTTCCTAGAAGATCAACCTTTCTTTCCCAATCTTAGAAAGGAAATACCGGCCCCTACTGATATGTCGGCTCCGGTCCAATCAT
CCGAACCGACACAAGCCCAAGGTACCACTGATCCTGATAATAATACTATTTGCGCTGAAAATGTTTGTGTTGAAAACGATATTGTTGACCTGACAAAACTTCCGGTAGAA
GATGGTAAAGATGATATGACAGAAAATAACCAGGTTGCTGAAAGTGATGTAGTGTTAACAATAGTTAAGGAGAATGAAAAAGGGATAATACCTCAAAACCCTACAACTGG
AGAAGAGCTCGACAAGCCAGGAGAGTGTGATACCAATCTAGACCTGCCCATTGCTTTAAGAAAGGGCACAAGATCTTGCACCAGGTATCCTATGTATAGCTTCCTCTCTT
ATAATAATTTGTCTTCTAAGTTTAGAGCATTTACTGTTAGTCTTGGTACTGTAACCATACCGAAAAACATATATGTGGCTATGGAAATCCCTGAATGGAAGGTTATTGTT
ATTGAAGAAATGAGAGCTCTTGAGAAAAATCATACTTGGGAACTTGTTGCTCTTCCGAAAGGGCATAAACCAGTTGGATGTAAATGGGTATTCACAACAAAGTATAAATC
TGACGAAACTCTGGACAGATATAAAGTCAGGCTTGTAGCAAAAGGGTTCACTCAGAATTTTAGGGTAGACTATTATGAGACTTTTTCTCCTGTCGCAAAGTTGAACACAA
TCAGAGTCCTTCTGTCAGTTGCAATTTCCGTTCTGATTGTATATGTTGATGACATCGTCTTATCTGGAGATGATACTACTGAAATCAACAAGTTAAAACAGAAAATGGCA
GACGAATTTGAAATCAAAGACCTAGGAAATCTGAAGTATTTTCTTAGGATGGAGGCTCCCTACGAAGAACACATGAAAGCTGTAAAAAGGATTCTGAGATATTTGAAAAG
CACACCAGGAAAAGGATTGATGTTTAGAAAATCCGACAGAAAATACATCGAAACCTATACAGACTCTGACTGGGCAGGATCAATAATTGACAGAAAATCTACATCTGGGT
ATTACACCTTTTTTGGGGGTAATCTTGTAACTTGGAGAAGTAAGAAGCAAAATGTGGTTGCTAGAAGCAGTGCTGAAGCAGAATACAGAGCTATGAGTCATAGAATCTGT
GAAGAAATCTGGTTGAAGAAAGTTCTATCTGATCTTCACCAAAGCAGTGAACTACCCATGAAACTCTATTGTGACAAAAAGGCTGCTATTAGTATTGCAAACAATCTGGT
TCAGCACGATAGAACAAAACATGTGGAGATCGATAGGCACTTTATAAAGGAGAAACTAGAAAACGGAAGCATTTGCATTCCTTATATTCCTTCCAGCCAACAAGTTGTAG
ATGTTCTTACCAAGGGGTTACTTAAGCAGAGCTTTGATGCATGTTCATTCGAGCAGAAGCATTTTGATTGTGCCTGTGCTTTTTCAGAGGTCTTCCAATTCTTCAAACTC
TCACAAAGCTTGCACATGCTTCCTCTATCAATCATTCGAAGAGTCCAGTTTATTTCCCGCCATTTTTCTTCAAGTCTGCAATTACTTCCAATTCCTCTCCGAATATCCAA
ACTAACAAAGAAATCATGCATCGAGTATCTCCGGAACTGCAAGTCCATGGATCAACTCAAACAAATTCAGAGTCAGATCTTTCGAATTGGTCTCGAAGGAGACCGAGACA
CAATAAACAAATTGATGGCATTCTGCACAGACTCATCTCTTGGCAACTTGCGGTATGCAGAGAGGATATTCAATTACATACAAGATCCATCTCTGTTTGTTTATAATGTG
ATGGTTAAAGTGTATGCCAAAAGGCGAATTTTCAGAAAAGTCATTTTGCTATTTCAACAGTTGAGGGAGGATGAATTGTGGCCCGATAATTTTACTTACCCATTTGTTCT
GAAAGCTATTGGTTGCTTAAGGGACGTGAGGCAAGGTGAAAAGGTTCATGGCTTTGTGGTGAAAACAGGAATGGATTTTGATAATTATGTTTGTAATTCACTTATGGATA
TGTATTCTGAATTGGGCAATGTTGAGAATGCTAAGAAGTTATTTGACAAAATGACGACTAAAGATTCGGTTTCTTGGAATGTTATGATTGCTGGGTATGTTAGGTGTCGG
AGATTTGAAGATGCTATTAGTACATTTAGGGAAATGCAGCAATTGAGCAATGAGGAACCTGGTGAAGCTACTGTAGTTAGCACTCTTTCTGCTTGTACAGCACTGAAAAA
TATGGAGCTTGGGGAGGAAATTCACAACTATGTTAGAAAGGAGCTTGGTTTTACCACTATAATCAACAACGCATTGTTAGATATGTATGCAAAATGTGGGTGTTTAAATA
TTGCCCGCAATATATTTGATGAAATGCCTATGAAAAATGTAATTTGTTGGACTAGCATGATTTCTGGCTACATAAACTGTGGTGATTTAGGAGAAGCTAGAGACTTGTTT
GACAGAAGTCCAGTTAAAGATGTTGTTTTGTGGACAGCTATGATAAATGGGTATGTGCAGTTCCACCATTTTGATGAAGCTGTGGCCCTGTTTCGCGAAATGCAAATTCG
AAATGTAAAACCAGATAAGTTCACAGTGGTCACTCTCCTCACAGGTTGTGCTCAGTTGGGAGCTCTAGAACAAGGGAAATGGATTCATGGATACCTAGATGAGAACAGAA
TAACAATGGATGTTGTTGTTGGTACTGCGCTCATTGAAATGTATTCCAAATGTGGATGTGTAGATAAATCATTAGAAATTTTCTATGAGTTAGAGGATAAGGACACAGCA
TCTTGGACGTCGATTATTTGTGGTCTTGCCATGAATGGTAAGACGGGTGAAGCACTTAGGTTGTTCTCAGAAATGGAACTTGTGGGGGCTAAACCTGATGATATCACCTT
CATTGGTGTTTTAAGTGCCTGTAGTCATGGTGGGTTGGTTGAGGAAGGGCGTAGGTTATTCAACTCGATGAAAAAGGTCTACCGAATTGAGCCGAAGGTAGAACACTATG
GGTGTGTAATTGACCTCCTTGGTAGAGCTGGGCTATTGGATGAAGCAGAGGAACTGATACAGGGGATTCCCGATGAAAATGGCGAAATTGTAGTTCCACTCTATGGTGCT
TTGCTTAGTGCTTGCAGAATCCACAACAATGTTGACATGGGTGAAAGACTAGCCAATAAACTGGTTAACATTGAATCATGTGATTCTAGCATTCATACACTTCTTGCCAA
TATATACGCTTCTGTTGATAGGTGGGAAGATGCAAAGAAAGTGAGAAGGAAAATGAAAGAACTCGGAGTGAAGAAGATGCCTGGGTGTAGTTCGATTGAGGTTGATGGCA
TTGTTCACGAGTTTCTTGTTGGAGATCCATCTCATCCGAAAATGATAGAGATATGCTCCATGTTGGATAGAGTGACTGGGCAATTACTAGGATCAAAGGAATCTCAGCTT
GAAGGAGTGATGCCACTTTACAAGGACACTCAATACTGCAGTTTTGTAGAATTTAAGGAGATTTACTATGAGGGAAGAAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTTGTGTCGAAAACTTGTTTGGGATTGTCCGTGTGGGGGAGTTCAATATTACAAGCTTGAGGAAGTTGACCGAGTATATCATTTCTTAGCCGGTTTAAACTCCAA
ATTCGATGCTGTGCGAAGTCGAATATTGGGACAAAAACCAACACCGACCTTGATGGAAGTCTGTTCAGAAGTTTGGTTTGAGGAAGATAGAACGAGTGCTATGAATACTG
TGGTTTGTGAACACTGTAAGAAACCATGGCACACGAAATATCAGTGTTGGAAGCTACATGGTCGGCCTCCGAATGGTAAACGCCGACCTCCAAACAACAAACCCAACCAG
GCCTTGGTAAGTGACTCGGAACCACAACGTCAAGAGAACTACTCAGCTGACAACGGTACCGCTTCTCTTGGGGCAATGGCACATTCAAGTATCTCTCCATCCTTGAGTCT
ACTCAGTATTACTGGCCAGAAACCCTGGATTCTTGATTCAGGAGCTACATACCATTTGACTGGAACTTCTGACAATTTCCTCTCTTATCATCCATGTGCTGCTTTCTCAC
CCGATGATGTTCTTTTTCAGGACTTGAGCTCGGGGAAGACGATTGGCACTGCCCAGCACAATAGGGGACTCTATTTCCTTAATGGTGATACTTCCTCTAGGCACAGTTCT
AGGGCTAGTCTGCTATCTTCCTATTTTTCAACTTCTGAAACTGACTGTATCGTTCACCAGAGCTCATGTGCCTATACGCCCCAACAAAATGGAGTGGCTGAAAGAAAAAA
CTGTCAATTCCTTGAAGTTGCTCAGTCTTTCATGCTTTCAGCCTCCCTTCCATCTTACCTATGGGGAGATGTAGTATTGACTGCAGCTCATCTTATAAATCGGATGCCTT
CTCGCGTTCTCCACTTCCAAATCCCTCTTGACTACCTCAAATTGTCTTATCCTACCGCTCGCCTTATACCTGATGTCCTTCTCCGAGTGTTTGGGTGTACAGCATTTGTC
CACAGCTTTAGTCCAAACCAAACTAAGTTCTCCCCTCGGGCCCAGAAATGTGTCTTCGTTGGATATCCCCTCCACCAACGTGGTTATAAATGTTTCCATCCCACTTCCCG
GAAATACTTCATCTCTATGGATATCACCTTCCTAGAAGATCAACCTTTCTTTCCCAATCTTAGAAAGGAAATACCGGCCCCTACTGATATGTCGGCTCCGGTCCAATCAT
CCGAACCGACACAAGCCCAAGGTACCACTGATCCTGATAATAATACTATTTGCGCTGAAAATGTTTGTGTTGAAAACGATATTGTTGACCTGACAAAACTTCCGGTAGAA
GATGGTAAAGATGATATGACAGAAAATAACCAGGTTGCTGAAAGTGATGTAGTGTTAACAATAGTTAAGGAGAATGAAAAAGGGATAATACCTCAAAACCCTACAACTGG
AGAAGAGCTCGACAAGCCAGGAGAGTGTGATACCAATCTAGACCTGCCCATTGCTTTAAGAAAGGGCACAAGATCTTGCACCAGGTATCCTATGTATAGCTTCCTCTCTT
ATAATAATTTGTCTTCTAAGTTTAGAGCATTTACTGTTAGTCTTGGTACTGTAACCATACCGAAAAACATATATGTGGCTATGGAAATCCCTGAATGGAAGGTTATTGTT
ATTGAAGAAATGAGAGCTCTTGAGAAAAATCATACTTGGGAACTTGTTGCTCTTCCGAAAGGGCATAAACCAGTTGGATGTAAATGGGTATTCACAACAAAGTATAAATC
TGACGAAACTCTGGACAGATATAAAGTCAGGCTTGTAGCAAAAGGGTTCACTCAGAATTTTAGGGTAGACTATTATGAGACTTTTTCTCCTGTCGCAAAGTTGAACACAA
TCAGAGTCCTTCTGTCAGTTGCAATTTCCGTTCTGATTGTATATGTTGATGACATCGTCTTATCTGGAGATGATACTACTGAAATCAACAAGTTAAAACAGAAAATGGCA
GACGAATTTGAAATCAAAGACCTAGGAAATCTGAAGTATTTTCTTAGGATGGAGGCTCCCTACGAAGAACACATGAAAGCTGTAAAAAGGATTCTGAGATATTTGAAAAG
CACACCAGGAAAAGGATTGATGTTTAGAAAATCCGACAGAAAATACATCGAAACCTATACAGACTCTGACTGGGCAGGATCAATAATTGACAGAAAATCTACATCTGGGT
ATTACACCTTTTTTGGGGGTAATCTTGTAACTTGGAGAAGTAAGAAGCAAAATGTGGTTGCTAGAAGCAGTGCTGAAGCAGAATACAGAGCTATGAGTCATAGAATCTGT
GAAGAAATCTGGTTGAAGAAAGTTCTATCTGATCTTCACCAAAGCAGTGAACTACCCATGAAACTCTATTGTGACAAAAAGGCTGCTATTAGTATTGCAAACAATCTGGT
TCAGCACGATAGAACAAAACATGTGGAGATCGATAGGCACTTTATAAAGGAGAAACTAGAAAACGGAAGCATTTGCATTCCTTATATTCCTTCCAGCCAACAAGTTGTAG
ATGTTCTTACCAAGGGGTTACTTAAGCAGAGCTTTGATGCATGTTCATTCGAGCAGAAGCATTTTGATTGTGCCTGTGCTTTTTCAGAGGTCTTCCAATTCTTCAAACTC
TCACAAAGCTTGCACATGCTTCCTCTATCAATCATTCGAAGAGTCCAGTTTATTTCCCGCCATTTTTCTTCAAGTCTGCAATTACTTCCAATTCCTCTCCGAATATCCAA
ACTAACAAAGAAATCATGCATCGAGTATCTCCGGAACTGCAAGTCCATGGATCAACTCAAACAAATTCAGAGTCAGATCTTTCGAATTGGTCTCGAAGGAGACCGAGACA
CAATAAACAAATTGATGGCATTCTGCACAGACTCATCTCTTGGCAACTTGCGGTATGCAGAGAGGATATTCAATTACATACAAGATCCATCTCTGTTTGTTTATAATGTG
ATGGTTAAAGTGTATGCCAAAAGGCGAATTTTCAGAAAAGTCATTTTGCTATTTCAACAGTTGAGGGAGGATGAATTGTGGCCCGATAATTTTACTTACCCATTTGTTCT
GAAAGCTATTGGTTGCTTAAGGGACGTGAGGCAAGGTGAAAAGGTTCATGGCTTTGTGGTGAAAACAGGAATGGATTTTGATAATTATGTTTGTAATTCACTTATGGATA
TGTATTCTGAATTGGGCAATGTTGAGAATGCTAAGAAGTTATTTGACAAAATGACGACTAAAGATTCGGTTTCTTGGAATGTTATGATTGCTGGGTATGTTAGGTGTCGG
AGATTTGAAGATGCTATTAGTACATTTAGGGAAATGCAGCAATTGAGCAATGAGGAACCTGGTGAAGCTACTGTAGTTAGCACTCTTTCTGCTTGTACAGCACTGAAAAA
TATGGAGCTTGGGGAGGAAATTCACAACTATGTTAGAAAGGAGCTTGGTTTTACCACTATAATCAACAACGCATTGTTAGATATGTATGCAAAATGTGGGTGTTTAAATA
TTGCCCGCAATATATTTGATGAAATGCCTATGAAAAATGTAATTTGTTGGACTAGCATGATTTCTGGCTACATAAACTGTGGTGATTTAGGAGAAGCTAGAGACTTGTTT
GACAGAAGTCCAGTTAAAGATGTTGTTTTGTGGACAGCTATGATAAATGGGTATGTGCAGTTCCACCATTTTGATGAAGCTGTGGCCCTGTTTCGCGAAATGCAAATTCG
AAATGTAAAACCAGATAAGTTCACAGTGGTCACTCTCCTCACAGGTTGTGCTCAGTTGGGAGCTCTAGAACAAGGGAAATGGATTCATGGATACCTAGATGAGAACAGAA
TAACAATGGATGTTGTTGTTGGTACTGCGCTCATTGAAATGTATTCCAAATGTGGATGTGTAGATAAATCATTAGAAATTTTCTATGAGTTAGAGGATAAGGACACAGCA
TCTTGGACGTCGATTATTTGTGGTCTTGCCATGAATGGTAAGACGGGTGAAGCACTTAGGTTGTTCTCAGAAATGGAACTTGTGGGGGCTAAACCTGATGATATCACCTT
CATTGGTGTTTTAAGTGCCTGTAGTCATGGTGGGTTGGTTGAGGAAGGGCGTAGGTTATTCAACTCGATGAAAAAGGTCTACCGAATTGAGCCGAAGGTAGAACACTATG
GGTGTGTAATTGACCTCCTTGGTAGAGCTGGGCTATTGGATGAAGCAGAGGAACTGATACAGGGGATTCCCGATGAAAATGGCGAAATTGTAGTTCCACTCTATGGTGCT
TTGCTTAGTGCTTGCAGAATCCACAACAATGTTGACATGGGTGAAAGACTAGCCAATAAACTGGTTAACATTGAATCATGTGATTCTAGCATTCATACACTTCTTGCCAA
TATATACGCTTCTGTTGATAGGTGGGAAGATGCAAAGAAAGTGAGAAGGAAAATGAAAGAACTCGGAGTGAAGAAGATGCCTGGGTGTAGTTCGATTGAGGTTGATGGCA
TTGTTCACGAGTTTCTTGTTGGAGATCCATCTCATCCGAAAATGATAGAGATATGCTCCATGTTGGATAGAGTGACTGGGCAATTACTAGGATCAAAGGAATCTCAGCTT
GAAGGAGTGATGCCACTTTACAAGGACACTCAATACTGCAGTTTTGTAGAATTTAAGGAGATTTACTATGAGGGAAGAAAGTGA
Protein sequenceShow/hide protein sequence
MDLCRKLVWDCPCGGVQYYKLEEVDRVYHFLAGLNSKFDAVRSRILGQKPTPTLMEVCSEVWFEEDRTSAMNTVVCEHCKKPWHTKYQCWKLHGRPPNGKRRPPNNKPNQ
ALVSDSEPQRQENYSADNGTASLGAMAHSSISPSLSLLSITGQKPWILDSGATYHLTGTSDNFLSYHPCAAFSPDDVLFQDLSSGKTIGTAQHNRGLYFLNGDTSSRHSS
RASLLSSYFSTSETDCIVHQSSCAYTPQQNGVAERKNCQFLEVAQSFMLSASLPSYLWGDVVLTAAHLINRMPSRVLHFQIPLDYLKLSYPTARLIPDVLLRVFGCTAFV
HSFSPNQTKFSPRAQKCVFVGYPLHQRGYKCFHPTSRKYFISMDITFLEDQPFFPNLRKEIPAPTDMSAPVQSSEPTQAQGTTDPDNNTICAENVCVENDIVDLTKLPVE
DGKDDMTENNQVAESDVVLTIVKENEKGIIPQNPTTGEELDKPGECDTNLDLPIALRKGTRSCTRYPMYSFLSYNNLSSKFRAFTVSLGTVTIPKNIYVAMEIPEWKVIV
IEEMRALEKNHTWELVALPKGHKPVGCKWVFTTKYKSDETLDRYKVRLVAKGFTQNFRVDYYETFSPVAKLNTIRVLLSVAISVLIVYVDDIVLSGDDTTEINKLKQKMA
DEFEIKDLGNLKYFLRMEAPYEEHMKAVKRILRYLKSTPGKGLMFRKSDRKYIETYTDSDWAGSIIDRKSTSGYYTFFGGNLVTWRSKKQNVVARSSAEAEYRAMSHRIC
EEIWLKKVLSDLHQSSELPMKLYCDKKAAISIANNLVQHDRTKHVEIDRHFIKEKLENGSICIPYIPSSQQVVDVLTKGLLKQSFDACSFEQKHFDCACAFSEVFQFFKL
SQSLHMLPLSIIRRVQFISRHFSSSLQLLPIPLRISKLTKKSCIEYLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCTDSSLGNLRYAERIFNYIQDPSLFVYNV
MVKVYAKRRIFRKVILLFQQLREDELWPDNFTYPFVLKAIGCLRDVRQGEKVHGFVVKTGMDFDNYVCNSLMDMYSELGNVENAKKLFDKMTTKDSVSWNVMIAGYVRCR
RFEDAISTFREMQQLSNEEPGEATVVSTLSACTALKNMELGEEIHNYVRKELGFTTIINNALLDMYAKCGCLNIARNIFDEMPMKNVICWTSMISGYINCGDLGEARDLF
DRSPVKDVVLWTAMINGYVQFHHFDEAVALFREMQIRNVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTA
SWTSIICGLAMNGKTGEALRLFSEMELVGAKPDDITFIGVLSACSHGGLVEEGRRLFNSMKKVYRIEPKVEHYGCVIDLLGRAGLLDEAEELIQGIPDENGEIVVPLYGA
LLSACRIHNNVDMGERLANKLVNIESCDSSIHTLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCSSIEVDGIVHEFLVGDPSHPKMIEICSMLDRVTGQLLGSKESQL
EGVMPLYKDTQYCSFVEFKEIYYEGRK