; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi01G020330 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi01G020330
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr01:28052382..28060220
RNA-Seq ExpressionLsi01G020330
SyntenyLsi01G020330
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_016900631.1 PREDICTED: pentatricopeptide repeat-containing protein At3g49740 isoform X1 [Cucumis melo]0.0e+0093.07Show/hide
Query:  QLKRSSRYFDSLQLFTQIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYAVRSGLKFYPHVANTILSLYAKTEDFVSLKRGFQEIEKPDVYSWT
        +LKRSSRY DSLQLFTQIHSS+C NIKPDHYNLSTTLAVCANFRDIAFGSQLHGYA+RSGLKFYPHVANT+LSLY+K EDFVSLKRGFQEIEKPDVYSWT
Subjt:  QLKRSSRYFDSLQLFTQIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYAVRSGLKFYPHVANTILSLYAKTEDFVSLKRGFQEIEKPDVYSWT

Query:  TLLSACTKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACILSLCTKEIKDLGRQVHSLVIKAGYLSKTSVIN
        TLLSAC K+GH+EYA EMFD MPKGNVACWNAMITGSAESGHDW+AMNTFYEMHKMGVKPDNYSFACILSLCTKEI+DLGRQVHS VIKAGYL KTSVIN
Subjt:  TLLSACTKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACILSLCTKEIKDLGRQVHSLVIKAGYLSKTSVIN

Query:  ALITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCSFIRVGQQVHSQAIKLGFESFTLVGN
        ALITMYFSIENLEDAYEVFEGTE+EVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCS IRV QQVHSQAIKLGFESFTLVGN
Subjt:  ALITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCSFIRVGQQVHSQAIKLGFESFTLVGN

Query:  STITMYSSCGEFQAANTVFQTLRKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEIVEMAHAFVYKNGLILVIEILNAL
        STITMYSSCGEFQAAN VFQ L +KDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEF+EI+EM HAFVYKNGLILVIEILNAL
Subjt:  STITMYSSCGEFQAANTVFQTLRKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEIVEMAHAFVYKNGLILVIEILNAL

Query:  VSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGNFSETSICNG
        VSAYA+CRK+ QSHQVFSEINSKNLISWNTVIYGFLLNGLP QAL HFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGN SETS+CNG
Subjt:  VSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGNFSETSICNG

Query:  LITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPSVDQ
        LITMYSKCG+L  SL+ FNVMIERDIVSWNS+ISAYAQHGQGKEAV CFKAM+DM SIMPDQATFTTILSACSHAGLV+EACQILDTMLIDYHVVPS+DQ
Subjt:  LITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPSVDQ

Query:  LSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGSLKQPG
        LSCIVDLIGRSGYIDQAESVIESAQYG+HTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGS+KQPG
Subjt:  LSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGSLKQPG

Query:  CSWIRIS
        CSWI  S
Subjt:  CSWIRIS

XP_016900638.1 PREDICTED: pentatricopeptide repeat-containing protein At3g49740 isoform X2 [Cucumis melo]0.0e+0093.32Show/hide
Query:  QLKRSSRYFDSLQLFTQIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYAVRSGLKFYPHVANTILSLYAKTEDFVSLKRGFQEIEKPDVYSWT
        +LKRSSRY DSLQLFTQIHSS+C NIKPDHYNLSTTLAVCANFRDIAFGSQLHGYA+RSGLKFYPHVANT+LSLY+K EDFVSLKRGFQEIEKPDVYSWT
Subjt:  QLKRSSRYFDSLQLFTQIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYAVRSGLKFYPHVANTILSLYAKTEDFVSLKRGFQEIEKPDVYSWT

Query:  TLLSACTKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACILSLCTKEIKDLGRQVHSLVIKAGYLSKTSVIN
        TLLSAC K+GH+EYA EMFD MPKGNVACWNAMITGSAESGHDW+AMNTFYEMHKMGVKPDNYSFACILSLCTKEI+DLGRQVHS VIKAGYL KTSVIN
Subjt:  TLLSACTKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACILSLCTKEIKDLGRQVHSLVIKAGYLSKTSVIN

Query:  ALITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCSFIRVGQQVHSQAIKLGFESFTLVGN
        ALITMYFSIENLEDAYEVFEGTE+EVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCS IRV QQVHSQAIKLGFESFTLVGN
Subjt:  ALITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCSFIRVGQQVHSQAIKLGFESFTLVGN

Query:  STITMYSSCGEFQAANTVFQTLRKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEIVEMAHAFVYKNGLILVIEILNAL
        STITMYSSCGEFQAAN VFQ L +KDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEF+EI+EM HAFVYKNGLILVIEILNAL
Subjt:  STITMYSSCGEFQAANTVFQTLRKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEIVEMAHAFVYKNGLILVIEILNAL

Query:  VSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGNFSETSICNG
        VSAYA+CRK+ QSHQVFSEINSKNLISWNTVIYGFLLNGLP QAL HFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGN SETS+CNG
Subjt:  VSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGNFSETSICNG

Query:  LITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPSVDQ
        LITMYSKCG+L  SL+ FNVMIERDIVSWNS+ISAYAQHGQGKEAV CFKAM+DM SIMPDQATFTTILSACSHAGLV+EACQILDTMLIDYHVVPS+DQ
Subjt:  LITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPSVDQ

Query:  LSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGSLKQPG
        LSCIVDLIGRSGYIDQAESVIESAQYG+HTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGS+KQPG
Subjt:  LSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGSLKQPG

Query:  CSWI
        CSWI
Subjt:  CSWI

XP_016900640.1 PREDICTED: pentatricopeptide repeat-containing protein At3g49740 isoform X3 [Cucumis melo]0.0e+0093.33Show/hide
Query:  QLKRSSRYFDSLQLFTQIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYAVRSGLKFYPHVANTILSLYAKTEDFVSLKRGFQEIEKPDVYSWT
        +LKRSSRY DSLQLFTQIHSS+C NIKPDHYNLSTTLAVCANFRDIAFGSQLHGYA+RSGLKFYPHVANT+LSLY+K EDFVSLKRGFQEIEKPDVYSWT
Subjt:  QLKRSSRYFDSLQLFTQIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYAVRSGLKFYPHVANTILSLYAKTEDFVSLKRGFQEIEKPDVYSWT

Query:  TLLSACTKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACILSLCTKEIKDLGRQVHSLVIKAGYLSKTSVIN
        TLLSAC K+GH+EYA EMFD MPKGNVACWNAMITGSAESGHDW+AMNTFYEMHKMGVKPDNYSFACILSLCTKEI+DLGRQVHS VIKAGYL KTSVIN
Subjt:  TLLSACTKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACILSLCTKEIKDLGRQVHSLVIKAGYLSKTSVIN

Query:  ALITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCSFIRVGQQVHSQAIKLGFESFTLVGN
        ALITMYFSIENLEDAYEVFEGTE+EVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCS IRV QQVHSQAIKLGFESFTLVGN
Subjt:  ALITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCSFIRVGQQVHSQAIKLGFESFTLVGN

Query:  STITMYSSCGEFQAANTVFQTLRKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEIVEMAHAFVYKNGLILVIEILNAL
        STITMYSSCGEFQAAN VFQ L +KDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEF+EI+EM HAFVYKNGLILVIEILNAL
Subjt:  STITMYSSCGEFQAANTVFQTLRKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEIVEMAHAFVYKNGLILVIEILNAL

Query:  VSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGNFSETSICNG
        VSAYA+CRK+ QSHQVFSEINSKNLISWNTVIYGFLLNGLP QAL HFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGN SETS+CNG
Subjt:  VSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGNFSETSICNG

Query:  LITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPSVDQ
        LITMYSKCG+L  SL+ FNVMIERDIVSWNS+ISAYAQHGQGKEAV CFKAM+DM SIMPDQATFTTILSACSHAGLV+EACQILDTMLIDYHVVPS+DQ
Subjt:  LITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPSVDQ

Query:  LSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGSLKQPG
        LSCIVDLIGRSGYIDQAESVIESAQYG+HTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGS+KQPG
Subjt:  LSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGSLKQPG

Query:  CSWIR
        CSWIR
Subjt:  CSWIR

XP_038899339.1 pentatricopeptide repeat-containing protein At3g49740 isoform X1 [Benincasa hispida]0.0e+0092.9Show/hide
Query:  QLKRSSRYFDSLQLFTQIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYAVRSGLKFYPHVANTILSLYAKTEDFVSLKRGFQEIEKPDVYSWT
        +LK SSRYFDSLQLFTQIHSSHCFNIKPDHYNLST LAVCANFRDI FGSQLHGYA+RSG KFYPHVANTILSLYAKTED  SLKRGFQEIEKPDVYSWT
Subjt:  QLKRSSRYFDSLQLFTQIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYAVRSGLKFYPHVANTILSLYAKTEDFVSLKRGFQEIEKPDVYSWT

Query:  TLLSACTKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACILSLCTKEIKDLGRQVHSLVIKAGYLSKTSVIN
        TLLSACTKLGH+EYADE+FD MPK NVACWNAMITG AESGHDW+A+NTFYEMHKMGVKPDNYSFACILSLCTKEI+DLGRQVHSLVIKAGYL KTSVIN
Subjt:  TLLSACTKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACILSLCTKEIKDLGRQVHSLVIKAGYLSKTSVIN

Query:  ALITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCSFIRVGQQVHSQAIKLGFESFTLVGN
        ALITMYFSIENLEDAYEVFEGTEA+V DQITYNVMIDGL C+RR EEALIMF DMKRACLSPTELTFVSIMSSCSFIRV QQVHSQAIKLGFESFTLVGN
Subjt:  ALITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCSFIRVGQQVHSQAIKLGFESFTLVGN

Query:  STITMYSSCGEFQAANTVFQTLRKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEIVEMAHAFVYKNGLILVIEILNAL
        STITMYSSCGEFQAAN VFQ LR+KDL+SWNAIISSY+QGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEF+EIVEM HAFV+KNGLILVIEILNAL
Subjt:  STITMYSSCGEFQAANTVFQTLRKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEIVEMAHAFVYKNGLILVIEILNAL

Query:  VSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGNFSETSICNG
        VSAYA+CRKI QSHQVFSEINSKNLISWN+VI GFLLNGLPLQAL HFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGNFSETSICNG
Subjt:  VSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGNFSETSICNG

Query:  LITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPSVDQ
        LITMYSKCG+LD SL+IFNVMIERDIVSWNSVISAYAQHG+GKEAVDCFKAMQD+ SIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPSVDQ
Subjt:  LITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPSVDQ

Query:  LSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGSLKQPG
        LSCIVDLIGR GYIDQAESVIESA+YG+HTHVWWALFSACAAHENLRLGRIVA ILLEKERENPSVYVVLSNIYASAGCWEEAANVR+LIKKTG++KQPG
Subjt:  LSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGSLKQPG

Query:  CSWI
        CSWI
Subjt:  CSWI

XP_038899340.1 pentatricopeptide repeat-containing protein At3g49740 isoform X2 [Benincasa hispida]0.0e+0092.91Show/hide
Query:  QLKRSSRYFDSLQLFTQIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYAVRSGLKFYPHVANTILSLYAKTEDFVSLKRGFQEIEKPDVYSWT
        +LK SSRYFDSLQLFTQIHSSHCFNIKPDHYNLST LAVCANFRDI FGSQLHGYA+RSG KFYPHVANTILSLYAKTED  SLKRGFQEIEKPDVYSWT
Subjt:  QLKRSSRYFDSLQLFTQIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYAVRSGLKFYPHVANTILSLYAKTEDFVSLKRGFQEIEKPDVYSWT

Query:  TLLSACTKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACILSLCTKEIKDLGRQVHSLVIKAGYLSKTSVIN
        TLLSACTKLGH+EYADE+FD MPK NVACWNAMITG AESGHDW+A+NTFYEMHKMGVKPDNYSFACILSLCTKEI+DLGRQVHSLVIKAGYL KTSVIN
Subjt:  TLLSACTKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACILSLCTKEIKDLGRQVHSLVIKAGYLSKTSVIN

Query:  ALITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCSFIRVGQQVHSQAIKLGFESFTLVGN
        ALITMYFSIENLEDAYEVFEGTEA+V DQITYNVMIDGL C+RR EEALIMF DMKRACLSPTELTFVSIMSSCSFIRV QQVHSQAIKLGFESFTLVGN
Subjt:  ALITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCSFIRVGQQVHSQAIKLGFESFTLVGN

Query:  STITMYSSCGEFQAANTVFQTLRKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEIVEMAHAFVYKNGLILVIEILNAL
        STITMYSSCGEFQAAN VFQ LR+KDL+SWNAIISSY+QGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEF+EIVEM HAFV+KNGLILVIEILNAL
Subjt:  STITMYSSCGEFQAANTVFQTLRKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEIVEMAHAFVYKNGLILVIEILNAL

Query:  VSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGNFSETSICNG
        VSAYA+CRKI QSHQVFSEINSKNLISWN+VI GFLLNGLPLQAL HFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGNFSETSICNG
Subjt:  VSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGNFSETSICNG

Query:  LITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPSVDQ
        LITMYSKCG+LD SL+IFNVMIERDIVSWNSVISAYAQHG+GKEAVDCFKAMQD+ SIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPSVDQ
Subjt:  LITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPSVDQ

Query:  LSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGSLKQPG
        LSCIVDLIGR GYIDQAESVIESA+YG+HTHVWWALFSACAAHENLRLGRIVA ILLEKERENPSVYVVLSNIYASAGCWEEAANVR+LIKKTG++KQPG
Subjt:  LSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGSLKQPG

Query:  CSWIR
        CSWIR
Subjt:  CSWIR

TrEMBL top hitse value%identityAlignment
A0A0A0L107 Uncharacterized protein0.0e+0092.62Show/hide
Query:  QLKRSSRYFDSLQLFTQIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYAVRSGLKFYPHVANTILSLYAKTEDFVSLKRGFQEIEKPDVYSWT
        +LKRSSRY DSLQLFTQIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYA+RSGLKFYPHVANT+LSLYAK EDFVSLKRGFQEIEKPDVYSWT
Subjt:  QLKRSSRYFDSLQLFTQIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYAVRSGLKFYPHVANTILSLYAKTEDFVSLKRGFQEIEKPDVYSWT

Query:  TLLSACTKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACILSLCTKEIKDLGRQVHSLVIKAGYLSKTSVIN
        TLLSACTK+GH+EYA EMFD MPKGNVACWNAMITGSAESG DW+AMNTFYEMHKMGVKPDNYSFACILSLCTKEI+DLGRQVHS VIKAGYL KTSV+N
Subjt:  TLLSACTKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACILSLCTKEIKDLGRQVHSLVIKAGYLSKTSVIN

Query:  ALITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCSFIRVGQQVHSQAIKLGFESFTLVGN
        ALITMYFSIENLEDAYEVFEGTE+EV DQITYNVMIDGLVC+RRNEEALIMFKDMKRACLSPTELTFVSIMSSCS I+V QQVH QAIKLGFESFTLVGN
Subjt:  ALITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCSFIRVGQQVHSQAIKLGFESFTLVGN

Query:  STITMYSSCGEFQAANTVFQTLRKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEIVEMAHAFVYKNGLILVIEILNAL
        STITMY+SCGEFQAAN VFQ L +KDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEF+EIVEM HA+VYKNGLIL+IEILNAL
Subjt:  STITMYSSCGEFQAANTVFQTLRKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEIVEMAHAFVYKNGLILVIEILNAL

Query:  VSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGNFSETSICNG
        VSAYA+CRK+ QS QVFSEINSKN+ISWNTVIYGFLLNGLPLQAL HFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGN SETS+CNG
Subjt:  VSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGNFSETSICNG

Query:  LITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPSVDQ
        LITMYSKCG+L  SLR FNVMIERDIVSWNS+ISAYAQHGQGKEAVDCFKAMQDM SIMPDQATFTTILSACSHAGLV+EACQILD MLIDY  VPSVDQ
Subjt:  LITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPSVDQ

Query:  LSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGSLKQPG
        LSCIVDLIGRSGYIDQAESVIESAQYG+HTHVWWALFSACAAHENLRLGRIVARILLEKER+NPSVYVVLSNIYASAGCWEEAANVRELIKKTGS+KQPG
Subjt:  LSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGSLKQPG

Query:  CSWIR
        CSWIR
Subjt:  CSWIR

A0A1S4DXC2 pentatricopeptide repeat-containing protein At3g49740 isoform X10.0e+0093.07Show/hide
Query:  QLKRSSRYFDSLQLFTQIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYAVRSGLKFYPHVANTILSLYAKTEDFVSLKRGFQEIEKPDVYSWT
        +LKRSSRY DSLQLFTQIHSS+C NIKPDHYNLSTTLAVCANFRDIAFGSQLHGYA+RSGLKFYPHVANT+LSLY+K EDFVSLKRGFQEIEKPDVYSWT
Subjt:  QLKRSSRYFDSLQLFTQIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYAVRSGLKFYPHVANTILSLYAKTEDFVSLKRGFQEIEKPDVYSWT

Query:  TLLSACTKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACILSLCTKEIKDLGRQVHSLVIKAGYLSKTSVIN
        TLLSAC K+GH+EYA EMFD MPKGNVACWNAMITGSAESGHDW+AMNTFYEMHKMGVKPDNYSFACILSLCTKEI+DLGRQVHS VIKAGYL KTSVIN
Subjt:  TLLSACTKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACILSLCTKEIKDLGRQVHSLVIKAGYLSKTSVIN

Query:  ALITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCSFIRVGQQVHSQAIKLGFESFTLVGN
        ALITMYFSIENLEDAYEVFEGTE+EVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCS IRV QQVHSQAIKLGFESFTLVGN
Subjt:  ALITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCSFIRVGQQVHSQAIKLGFESFTLVGN

Query:  STITMYSSCGEFQAANTVFQTLRKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEIVEMAHAFVYKNGLILVIEILNAL
        STITMYSSCGEFQAAN VFQ L +KDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEF+EI+EM HAFVYKNGLILVIEILNAL
Subjt:  STITMYSSCGEFQAANTVFQTLRKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEIVEMAHAFVYKNGLILVIEILNAL

Query:  VSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGNFSETSICNG
        VSAYA+CRK+ QSHQVFSEINSKNLISWNTVIYGFLLNGLP QAL HFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGN SETS+CNG
Subjt:  VSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGNFSETSICNG

Query:  LITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPSVDQ
        LITMYSKCG+L  SL+ FNVMIERDIVSWNS+ISAYAQHGQGKEAV CFKAM+DM SIMPDQATFTTILSACSHAGLV+EACQILDTMLIDYHVVPS+DQ
Subjt:  LITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPSVDQ

Query:  LSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGSLKQPG
        LSCIVDLIGRSGYIDQAESVIESAQYG+HTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGS+KQPG
Subjt:  LSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGSLKQPG

Query:  CSWIRIS
        CSWI  S
Subjt:  CSWIRIS

A0A1S4DXD5 pentatricopeptide repeat-containing protein At3g49740 isoform X20.0e+0093.32Show/hide
Query:  QLKRSSRYFDSLQLFTQIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYAVRSGLKFYPHVANTILSLYAKTEDFVSLKRGFQEIEKPDVYSWT
        +LKRSSRY DSLQLFTQIHSS+C NIKPDHYNLSTTLAVCANFRDIAFGSQLHGYA+RSGLKFYPHVANT+LSLY+K EDFVSLKRGFQEIEKPDVYSWT
Subjt:  QLKRSSRYFDSLQLFTQIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYAVRSGLKFYPHVANTILSLYAKTEDFVSLKRGFQEIEKPDVYSWT

Query:  TLLSACTKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACILSLCTKEIKDLGRQVHSLVIKAGYLSKTSVIN
        TLLSAC K+GH+EYA EMFD MPKGNVACWNAMITGSAESGHDW+AMNTFYEMHKMGVKPDNYSFACILSLCTKEI+DLGRQVHS VIKAGYL KTSVIN
Subjt:  TLLSACTKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACILSLCTKEIKDLGRQVHSLVIKAGYLSKTSVIN

Query:  ALITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCSFIRVGQQVHSQAIKLGFESFTLVGN
        ALITMYFSIENLEDAYEVFEGTE+EVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCS IRV QQVHSQAIKLGFESFTLVGN
Subjt:  ALITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCSFIRVGQQVHSQAIKLGFESFTLVGN

Query:  STITMYSSCGEFQAANTVFQTLRKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEIVEMAHAFVYKNGLILVIEILNAL
        STITMYSSCGEFQAAN VFQ L +KDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEF+EI+EM HAFVYKNGLILVIEILNAL
Subjt:  STITMYSSCGEFQAANTVFQTLRKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEIVEMAHAFVYKNGLILVIEILNAL

Query:  VSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGNFSETSICNG
        VSAYA+CRK+ QSHQVFSEINSKNLISWNTVIYGFLLNGLP QAL HFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGN SETS+CNG
Subjt:  VSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGNFSETSICNG

Query:  LITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPSVDQ
        LITMYSKCG+L  SL+ FNVMIERDIVSWNS+ISAYAQHGQGKEAV CFKAM+DM SIMPDQATFTTILSACSHAGLV+EACQILDTMLIDYHVVPS+DQ
Subjt:  LITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPSVDQ

Query:  LSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGSLKQPG
        LSCIVDLIGRSGYIDQAESVIESAQYG+HTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGS+KQPG
Subjt:  LSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGSLKQPG

Query:  CSWI
        CSWI
Subjt:  CSWI

A0A1S4DY49 pentatricopeptide repeat-containing protein At3g49740 isoform X30.0e+0093.33Show/hide
Query:  QLKRSSRYFDSLQLFTQIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYAVRSGLKFYPHVANTILSLYAKTEDFVSLKRGFQEIEKPDVYSWT
        +LKRSSRY DSLQLFTQIHSS+C NIKPDHYNLSTTLAVCANFRDIAFGSQLHGYA+RSGLKFYPHVANT+LSLY+K EDFVSLKRGFQEIEKPDVYSWT
Subjt:  QLKRSSRYFDSLQLFTQIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYAVRSGLKFYPHVANTILSLYAKTEDFVSLKRGFQEIEKPDVYSWT

Query:  TLLSACTKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACILSLCTKEIKDLGRQVHSLVIKAGYLSKTSVIN
        TLLSAC K+GH+EYA EMFD MPKGNVACWNAMITGSAESGHDW+AMNTFYEMHKMGVKPDNYSFACILSLCTKEI+DLGRQVHS VIKAGYL KTSVIN
Subjt:  TLLSACTKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACILSLCTKEIKDLGRQVHSLVIKAGYLSKTSVIN

Query:  ALITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCSFIRVGQQVHSQAIKLGFESFTLVGN
        ALITMYFSIENLEDAYEVFEGTE+EVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCS IRV QQVHSQAIKLGFESFTLVGN
Subjt:  ALITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCSFIRVGQQVHSQAIKLGFESFTLVGN

Query:  STITMYSSCGEFQAANTVFQTLRKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEIVEMAHAFVYKNGLILVIEILNAL
        STITMYSSCGEFQAAN VFQ L +KDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEF+EI+EM HAFVYKNGLILVIEILNAL
Subjt:  STITMYSSCGEFQAANTVFQTLRKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEIVEMAHAFVYKNGLILVIEILNAL

Query:  VSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGNFSETSICNG
        VSAYA+CRK+ QSHQVFSEINSKNLISWNTVIYGFLLNGLP QAL HFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGN SETS+CNG
Subjt:  VSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGNFSETSICNG

Query:  LITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPSVDQ
        LITMYSKCG+L  SL+ FNVMIERDIVSWNS+ISAYAQHGQGKEAV CFKAM+DM SIMPDQATFTTILSACSHAGLV+EACQILDTMLIDYHVVPS+DQ
Subjt:  LITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPSVDQ

Query:  LSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGSLKQPG
        LSCIVDLIGRSGYIDQAESVIESAQYG+HTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGS+KQPG
Subjt:  LSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGSLKQPG

Query:  CSWIR
        CSWIR
Subjt:  CSWIR

A0A5D3DVT9 Pentatricopeptide repeat-containing protein0.0e+0093.33Show/hide
Query:  QLKRSSRYFDSLQLFTQIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYAVRSGLKFYPHVANTILSLYAKTEDFVSLKRGFQEIEKPDVYSWT
        +LKRSSRY DSLQLFTQIHSS+C NIKPDHYNLSTTLAVCANFRDIAFGSQLHGYA+RSGLKFYPHVANT+LSLY+K EDFVSLKRGFQEIEKPDVYSWT
Subjt:  QLKRSSRYFDSLQLFTQIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYAVRSGLKFYPHVANTILSLYAKTEDFVSLKRGFQEIEKPDVYSWT

Query:  TLLSACTKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACILSLCTKEIKDLGRQVHSLVIKAGYLSKTSVIN
        TLLSAC K+GH+EYA EMFD MPKGNVACWNAMITGSAESGHDW+AMNTFYEMHKMGVKPDNYSFACILSLCTKEI+DLGRQVHS VIKAGYL KTSVIN
Subjt:  TLLSACTKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACILSLCTKEIKDLGRQVHSLVIKAGYLSKTSVIN

Query:  ALITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCSFIRVGQQVHSQAIKLGFESFTLVGN
        ALITMYFSIENLEDAYEVFEGTE+EVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCS IRV QQVHSQAIKLGFESFTLVGN
Subjt:  ALITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCSFIRVGQQVHSQAIKLGFESFTLVGN

Query:  STITMYSSCGEFQAANTVFQTLRKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEIVEMAHAFVYKNGLILVIEILNAL
        STITMYSSCGEFQAAN VFQ L +KDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEF+EI+EM HAFVYKNGLILVIEILNAL
Subjt:  STITMYSSCGEFQAANTVFQTLRKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEIVEMAHAFVYKNGLILVIEILNAL

Query:  VSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGNFSETSICNG
        VSAYA+CRK+ QSHQVFSEINSKNLISWNTVIYGFLLNGLP QAL HFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGN SETS+CNG
Subjt:  VSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGNFSETSICNG

Query:  LITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPSVDQ
        LITMYSKCG+L  SL+ FNVMIERDIVSWNS+ISAYAQHGQGKEAV CFKAM+DM SIMPDQATFTTILSACSHAGLV+EACQILDTMLIDYHVVPS+DQ
Subjt:  LITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPSVDQ

Query:  LSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGSLKQPG
        LSCIVDLIGRSGYIDQAESVIESAQYG+HTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGS+KQPG
Subjt:  LSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGSLKQPG

Query:  CSWIR
        CSWIR
Subjt:  CSWIR

SwissProt top hitse value%identityAlignment
Q9LU94 Putative pentatricopeptide repeat-containing protein At3g259705.8e-9834.87Show/hide
Query:  DVYSWTTLLSACTKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACIL-SLCTKEIKDLGRQVHSLVIKAGYL
        D+Y    +L +  K G + YA+ +FD MPK +   WN MI+G    G    A   F  M + G   D YSF+ +L  + + +  DLG QVH LVIK GY 
Subjt:  DVYSWTTLLSACTKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACIL-SLCTKEIKDLGRQVHSLVIKAGYL

Query:  SKTSVINALITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEA--LIMFKDMKRACL--SPTELTFVSIMSSCSFIRVGQQVHSQAIK
            V ++L+ MY   E +EDA+E F+  E    + +++N +I G V +R  + A  L+   +MK A    + T    ++++    F  + +QVH++ +K
Subjt:  SKTSVINALITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEA--LIMFKDMKRACL--SPTELTFVSIMSSCSFIRVGQQVHSQAIK

Query:  LGFESFTLVGNSTITMYSSCGEFQAANTVFQTL-RKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGV---SEFMEIVEMAHAFV
        LG +    + N+ I+ Y+ CG    A  VF  L   KDLISWN++I+ + +    +SA   F+QMQR  +  D +T+  LL      E     +  H  V
Subjt:  LGFESFTLVGNSTITMYSSCGEFQAANTVFQTL-RKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGV---SEFMEIVEMAHAFV

Query:  YKNGLILVIEILNALVSAYARCRKITQSH--QVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIH
         K GL  V    NAL+S Y +    T      +F  + SK+LISWN++I GF   GL   A+  FS L  S++K   +  S +L  C++++TL +G+QIH
Subjt:  YKNGLILVIEILNALVSAYARCRKITQSH--QVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIH

Query:  GYILRSGNFSETSICNGLITMYSKCGVLDCSLRIF-NVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEAC
            +SG  S   + + LI MYSKCG+++ + + F  +  +   V+WN++I  YAQHG G+ ++D F  M +  ++  D  TFT IL+ACSH GL+ E  
Subjt:  GYILRSGNFSETSICNGLITMYSKCGVLDCSLRIF-NVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEAC

Query:  QILDTMLIDYHVVPSVDQLSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEE
        ++L+ M   Y + P ++  +  VDL+GR+G +++A+ +IES        V       C A   + +   VA  LLE E E+   YV LS++Y+    WEE
Subjt:  QILDTMLIDYHVVPSVDQLSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEE

Query:  AANVRELIKKTGSLKQPGCSWIRISNSI
         A+V++++K+ G  K PG SWI I N +
Subjt:  AANVRELIKKTGSLKQPGCSWIRISNSI

Q9M2Y4 Pentatricopeptide repeat-containing protein At3g497403.3e-21051.41Show/hide
Query:  LKRSSRYFDSLQLFTQIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYAVRSGLKFYPHVANTILSLYAKTEDFVSLKRGFQEIEKPDVYSWTT
        L RS    ++L+LF  +H   C  ++PD Y++S  +    + RD  FG Q+H YA+RSGL  + HV+NT+LSLY +  +  SLK+ F EI++PDVYSWTT
Subjt:  LKRSSRYFDSLQLFTQIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYAVRSGLKFYPHVANTILSLYAKTEDFVSLKRGFQEIEKPDVYSWTT

Query:  LLSACTKLGHVEYADEMFDTMP-KGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACILSLCTKEIKDLGRQVHSLVIKAGYLSKTSVIN
        LLSA  KLG +EYA E+FD MP + +VA WNAMITG  ESG+   ++  F EMHK+GV+ D + FA ILS+C     D G+QVHSLVIKAG+   +SV+N
Subjt:  LLSACTKLGHVEYADEMFDTMP-KGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACILSLCTKEIKDLGRQVHSLVIKAGYLSKTSVIN

Query:  ALITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCSFIRVGQQVHSQAIKLGFESFTLVGN
        ALITMYF+ + + DA  VFE T+  V DQ+T+NV+IDGL   +R +E+L++F+ M  A L PT+LTFVS+M SCS   +G QVH  AIK G+E +TLV N
Subjt:  ALITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCSFIRVGQQVHSQAIKLGFESFTLVGN

Query:  STITMYSSCGEFQAANTVFQTLRKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEIVEMAHAFVYKNGLILVIEILNAL
        +T+TMYSS  +F AA+ VF++L +KDL++WN +ISSY Q   GKSA+  + +M   G+ PDEFTFGSLL  S  ++++EM  A + K GL   IEI NAL
Subjt:  STITMYSSCGEFQAANTVFQTLRKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEIVEMAHAFVYKNGLILVIEILNAL

Query:  VSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLK--PSTFTLSIVLSICANISTLDIGKQIHGYILRSGNFSETSIC
        +SAY++  +I ++  +F     KNLISWN +I GF  NG P + L  FS L+ S+++  P  +TLS +LSIC + S+L +G Q H Y+LR G F ET I 
Subjt:  VSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLK--PSTFTLSIVLSICANISTLDIGKQIHGYILRSGNFSETSIC

Query:  NGLITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPSV
        N LI MYS+CG +  SL +FN M E+D+VSWNS+ISAY++HG+G+ AV+ +K MQD   ++PD ATF+ +LSACSHAGLV+E  +I ++M+  + V+ +V
Subjt:  NGLITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPSV

Query:  DQLSCIVDLIGRSGYIDQAESVIESAQ--YGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGSL
        D  SC+VDL+GR+G++D+AES+++ ++   G    VWWALFSACAAH +L+LG++VA++L+EKE+++PSVYV LSNIYA AG W+EA   R  I   G++
Subjt:  DQLSCIVDLIGRSGYIDQAESVIESAQ--YGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGSL

Query:  KQPGCSWIRI
        KQ GCSW+R+
Subjt:  KQPGCSWIRI

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic4.0e-9933.01Show/hide
Query:  TKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACI-LSLCTKEIKDLGRQVHSLVIKAGYLSKTSVINALITM
        T  G ++ A  +FD +       WN ++   A+SG    ++  F +M   GV+ D+Y+F+C+  S  +      G Q+H  ++K+G+  + SV N+L+  
Subjt:  TKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACI-LSLCTKEIKDLGRQVHSLVIKAGYLSKTSVINALITM

Query:  YFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCS---FIRVGQQVHSQAIKLGFESFTLVGNST
        Y   + ++ A +VF+  E    D I++N +I+G V     E+ L +F  M  + +     T VS+ + C+    I +G+ VHS  +K  F       N+ 
Subjt:  YFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCS---FIRVGQQVHSQAIKLGFESFTLVGNST

Query:  ITMYSSCGEFQAANTVFQTLRKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEIVE---MAHAFVYKNGLILVIEILNA
        + MYS CG+  +A  VF+ +  + ++S+ ++I+ Y +      AV  F +M+  GI PD +T  ++L       +++     H ++ +N L   I + NA
Subjt:  ITMYSSCGEFQAANTVFQTLRKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEIVE---MAHAFVYKNGLILVIEILNA

Query:  LVSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSK-LKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGNFSETSIC
        L+  YA+C  + ++  VFSE+  K++ISWNT+I G+  N    +AL  F+ L+  K   P   T++ VL  CA++S  D G++IHGYI+R+G FS+  + 
Subjt:  LVSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSK-LKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGNFSETSIC

Query:  NGLITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPSV
        N L+ MY+KCG L  +  +F+ +  +D+VSW  +I+ Y  HG GKEA+  F  M+  + I  D+ +F ++L ACSH+GLVDE  +  + M  +  + P+V
Subjt:  NGLITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPSV

Query:  DQLSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGSLKQ
        +  +CIVD++ R+G + +A   IE+        +W AL   C  H +++L   VA  + E E EN   YV+++NIYA A  WE+   +R+ I + G  K 
Subjt:  DQLSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGSLKQ

Query:  PGCSWIRISNSI
        PGCSWI I   +
Subjt:  PGCSWIRISNSI

Q9SVP7 Pentatricopeptide repeat-containing protein At4g136501.3e-10029.24Show/hide
Query:  LRRRLHSRAAATSISGEFRQLKRSSRYFDSLQLFTQIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYAVRSGLKFYPHVANTILSLYAKTEDF
        LR + HS   A  ISG    L ++    ++++LF  +   +   I P  Y  S+ L+ C     +  G QLHG  ++ G     +V N ++SLY      
Subjt:  LRRRLHSRAAATSISGEFRQLKRSSRYFDSLQLFTQIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYAVRSGLKFYPHVANTILSLYAKTEDF

Query:  VSLKRGFQEIEKPDVYSWTTLLSACTKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACILSLCTKE-IKDLG
                                   LG++  A+ +F  M + +   +N +I G ++ G+   AM  F  MH  G++PD+ + A ++  C+ +     G
Subjt:  VSLKRGFQEIEKPDVYSWTTLLSACTKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACILSLCTKE-IKDLG

Query:  RQVHSLVIKAGYLSKTSVINALITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSC---SFI
        +Q+H+   K G+ S   +  AL+ +Y    ++E A + F   E EV + + +NVM+     +     +  +F+ M+   + P + T+ SI+ +C     +
Subjt:  RQVHSLVIKAGYLSKTSVINALITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSC---SFI

Query:  RVGQQVHSQAIKLGFESFTLVGNSTITMYSSCGEFQAANTVFQTLRKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEI
         +G+Q+HSQ IK  F+    V +  I MY+  G+   A  +      KD++SW  +I+ Y Q NF   A+  F QM   GI  DE    + +     ++ 
Subjt:  RVGQQVHSQAIKLGFESFTLVGNSTITMYSSCGEFQAANTVFQTLRKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEI

Query:  V---EMAHAFVYKNGLILVIEILNALVSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLKPSTFTLSIVLSICANIS
        +   +  HA    +G    +   NALV+ Y+RC KI +S+  F +  + + I+WN ++ GF  +G   +AL  F ++    +  + FT    +   +  +
Subjt:  V---EMAHAFVYKNGLILVIEILNALVSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLKPSTFTLSIVLSICANIS

Query:  TLDIGKQIHGYILRSGNFSETSICNGLITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSH
         +  GKQ+H  I ++G  SET +CN LI+MY+KCG +  + + F  +  ++ VSWN++I+AY++HG G EA+D F  M   S++ P+  T   +LSACSH
Subjt:  TLDIGKQIHGYILRSGNFSETSICNGLITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSH

Query:  AGLVDEACQILDTMLIDYHVVPSVDQLSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIY
         GLVD+     ++M  +Y + P  +   C+VD++ R+G + +A+  I+         VW  L SAC  H+N+ +G   A  LLE E E+ + YV+LSN+Y
Subjt:  AGLVDEACQILDTMLIDYHVVPSVDQLSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIY

Query:  ASAGCWEEAANVRELIKKTGSLKQPGCSWIRISNSIAWPSLAGTDKHGLKTRATRGVEPMTTCSRGQMGLVKDSNKELVE---------IFLNHAKIFGS
        A +  W+     R+ +K+ G  K+PG SWI + NSI      G   H L        + +T     ++G V+D    L E         IF++  K+  S
Subjt:  ASAGCWEEAANVRELIKKTGSLKQPGCSWIRISNSIAWPSLAGTDKHGLKTRATRGVEPMTTCSRGQMGLVKDSNKELVE---------IFLNHAKIFGS

Query:  FVFISRP
        F  +S P
Subjt:  FVFISRP

Q9ZUW3 Pentatricopeptide repeat-containing protein At2g276101.6e-10030.58Show/hide
Query:  RSSRYFDSLQLFTQIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYAVRSGLKFYPHVANTILSLYAKTEDFVSLKRGFQEIEKPDVYSWTTLL
        R  R  ++ +LF  IH      ++ D    S+ L V A   D  FG QLH   ++ G      V  +++  Y K  +F   ++ F E+++ +V +WTTL+
Subjt:  RSSRYFDSLQLFTQIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYAVRSGLKFYPHVANTILSLYAKTEDFVSLKRGFQEIEKPDVYSWTTLL

Query:  SACTKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACILSLCTKE-IKDLGRQVHSLVIKAGYLSKTSVINAL
        S                               G A +  +   +  F  M   G +P++++FA  L +  +E +   G QVH++V+K G      V N+L
Subjt:  SACTKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACILSLCTKE-IKDLGRQVHSLVIKAGYLSKTSVINAL

Query:  ITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCS---FIRVGQQVHSQAIKLGFESFTLVG
        I +Y    N+  A  +F+ T  EV   +T+N MI G      + EAL MF  M+   +  +E +F S++  C+    +R  +Q+H   +K GF     + 
Subjt:  ITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCS---FIRVGQQVHSQAIKLGFESFTLVG

Query:  NSTITMYSSCGEFQAANTVFQTLR-KKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEIVEMAHAFVYKNGLILVIEILN
         + +  YS C     A  +F+ +    +++SW A+IS ++Q +  + AV  F +M+R G+ P+EFT+  +L     +   E+ HA V K        +  
Subjt:  NSTITMYSSCGEFQAANTVFQTLR-KKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEIVEMAHAFVYKNGLILVIEILN

Query:  ALVSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLKPSTFTLSIVLSIC-ANISTLDIGKQIHGYILRSGNFSETSI
        AL+ AY +  K+ ++ +VFS I+ K++++W+ ++ G+   G    A+  F +L    +KP+ FT S +L++C A  +++  GKQ HG+ ++S   S   +
Subjt:  ALVSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLKPSTFTLSIVLSIC-ANISTLDIGKQIHGYILRSGNFSETSI

Query:  CNGLITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPS
         + L+TMY+K G ++ +  +F    E+D+VSWNS+IS YAQHGQ  +A+D FK M+    +  D  TF  + +AC+HAGLV+E  +  D M+ D  + P+
Subjt:  CNGLITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPS

Query:  VDQLSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGSLK
         +  SC+VDL  R+G +++A  VIE+      + +W  + +AC  H+   LGR+ A  ++  + E+ + YV+LSN+YA +G W+E A VR+L+ +    K
Subjt:  VDQLSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGSLK

Query:  QPGCSWIRISNSIAWPSLAGTDKHGLKTRATRGVEPMTTCSRGQMGLVKDSNKELVEIFLNH
        +PG SWI + N   +  LAG   H LK +    +E ++T  +  +G   D++  L +I   H
Subjt:  QPGCSWIRISNSIAWPSLAGTDKHGLKTRATRGVEPMTTCSRGQMGLVKDSNKELVEIFLNH

Arabidopsis top hitse value%identityAlignment
AT2G27610.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-10130.58Show/hide
Query:  RSSRYFDSLQLFTQIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYAVRSGLKFYPHVANTILSLYAKTEDFVSLKRGFQEIEKPDVYSWTTLL
        R  R  ++ +LF  IH      ++ D    S+ L V A   D  FG QLH   ++ G      V  +++  Y K  +F   ++ F E+++ +V +WTTL+
Subjt:  RSSRYFDSLQLFTQIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYAVRSGLKFYPHVANTILSLYAKTEDFVSLKRGFQEIEKPDVYSWTTLL

Query:  SACTKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACILSLCTKE-IKDLGRQVHSLVIKAGYLSKTSVINAL
        S                               G A +  +   +  F  M   G +P++++FA  L +  +E +   G QVH++V+K G      V N+L
Subjt:  SACTKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACILSLCTKE-IKDLGRQVHSLVIKAGYLSKTSVINAL

Query:  ITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCS---FIRVGQQVHSQAIKLGFESFTLVG
        I +Y    N+  A  +F+ T  EV   +T+N MI G      + EAL MF  M+   +  +E +F S++  C+    +R  +Q+H   +K GF     + 
Subjt:  ITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCS---FIRVGQQVHSQAIKLGFESFTLVG

Query:  NSTITMYSSCGEFQAANTVFQTLR-KKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEIVEMAHAFVYKNGLILVIEILN
         + +  YS C     A  +F+ +    +++SW A+IS ++Q +  + AV  F +M+R G+ P+EFT+  +L     +   E+ HA V K        +  
Subjt:  NSTITMYSSCGEFQAANTVFQTLR-KKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEIVEMAHAFVYKNGLILVIEILN

Query:  ALVSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLKPSTFTLSIVLSIC-ANISTLDIGKQIHGYILRSGNFSETSI
        AL+ AY +  K+ ++ +VFS I+ K++++W+ ++ G+   G    A+  F +L    +KP+ FT S +L++C A  +++  GKQ HG+ ++S   S   +
Subjt:  ALVSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLKPSTFTLSIVLSIC-ANISTLDIGKQIHGYILRSGNFSETSI

Query:  CNGLITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPS
         + L+TMY+K G ++ +  +F    E+D+VSWNS+IS YAQHGQ  +A+D FK M+    +  D  TF  + +AC+HAGLV+E  +  D M+ D  + P+
Subjt:  CNGLITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPS

Query:  VDQLSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGSLK
         +  SC+VDL  R+G +++A  VIE+      + +W  + +AC  H+   LGR+ A  ++  + E+ + YV+LSN+YA +G W+E A VR+L+ +    K
Subjt:  VDQLSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGSLK

Query:  QPGCSWIRISNSIAWPSLAGTDKHGLKTRATRGVEPMTTCSRGQMGLVKDSNKELVEIFLNH
        +PG SWI + N   +  LAG   H LK +    +E ++T  +  +G   D++  L +I   H
Subjt:  QPGCSWIRISNSIAWPSLAGTDKHGLKTRATRGVEPMTTCSRGQMGLVKDSNKELVEIFLNH

AT3G25970.1 Pentatricopeptide repeat (PPR) superfamily protein4.1e-9934.87Show/hide
Query:  DVYSWTTLLSACTKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACIL-SLCTKEIKDLGRQVHSLVIKAGYL
        D+Y    +L +  K G + YA+ +FD MPK +   WN MI+G    G    A   F  M + G   D YSF+ +L  + + +  DLG QVH LVIK GY 
Subjt:  DVYSWTTLLSACTKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACIL-SLCTKEIKDLGRQVHSLVIKAGYL

Query:  SKTSVINALITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEA--LIMFKDMKRACL--SPTELTFVSIMSSCSFIRVGQQVHSQAIK
            V ++L+ MY   E +EDA+E F+  E    + +++N +I G V +R  + A  L+   +MK A    + T    ++++    F  + +QVH++ +K
Subjt:  SKTSVINALITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEA--LIMFKDMKRACL--SPTELTFVSIMSSCSFIRVGQQVHSQAIK

Query:  LGFESFTLVGNSTITMYSSCGEFQAANTVFQTL-RKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGV---SEFMEIVEMAHAFV
        LG +    + N+ I+ Y+ CG    A  VF  L   KDLISWN++I+ + +    +SA   F+QMQR  +  D +T+  LL      E     +  H  V
Subjt:  LGFESFTLVGNSTITMYSSCGEFQAANTVFQTL-RKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGV---SEFMEIVEMAHAFV

Query:  YKNGLILVIEILNALVSAYARCRKITQSH--QVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIH
         K GL  V    NAL+S Y +    T      +F  + SK+LISWN++I GF   GL   A+  FS L  S++K   +  S +L  C++++TL +G+QIH
Subjt:  YKNGLILVIEILNALVSAYARCRKITQSH--QVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLKPSTFTLSIVLSICANISTLDIGKQIH

Query:  GYILRSGNFSETSICNGLITMYSKCGVLDCSLRIF-NVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEAC
            +SG  S   + + LI MYSKCG+++ + + F  +  +   V+WN++I  YAQHG G+ ++D F  M +  ++  D  TFT IL+ACSH GL+ E  
Subjt:  GYILRSGNFSETSICNGLITMYSKCGVLDCSLRIF-NVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEAC

Query:  QILDTMLIDYHVVPSVDQLSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEE
        ++L+ M   Y + P ++  +  VDL+GR+G +++A+ +IES        V       C A   + +   VA  LLE E E+   YV LS++Y+    WEE
Subjt:  QILDTMLIDYHVVPSVDQLSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEE

Query:  AANVRELIKKTGSLKQPGCSWIRISNSI
         A+V++++K+ G  K PG SWI I N +
Subjt:  AANVRELIKKTGSLKQPGCSWIRISNSI

AT3G49740.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.4e-21151.41Show/hide
Query:  LKRSSRYFDSLQLFTQIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYAVRSGLKFYPHVANTILSLYAKTEDFVSLKRGFQEIEKPDVYSWTT
        L RS    ++L+LF  +H   C  ++PD Y++S  +    + RD  FG Q+H YA+RSGL  + HV+NT+LSLY +  +  SLK+ F EI++PDVYSWTT
Subjt:  LKRSSRYFDSLQLFTQIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYAVRSGLKFYPHVANTILSLYAKTEDFVSLKRGFQEIEKPDVYSWTT

Query:  LLSACTKLGHVEYADEMFDTMP-KGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACILSLCTKEIKDLGRQVHSLVIKAGYLSKTSVIN
        LLSA  KLG +EYA E+FD MP + +VA WNAMITG  ESG+   ++  F EMHK+GV+ D + FA ILS+C     D G+QVHSLVIKAG+   +SV+N
Subjt:  LLSACTKLGHVEYADEMFDTMP-KGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACILSLCTKEIKDLGRQVHSLVIKAGYLSKTSVIN

Query:  ALITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCSFIRVGQQVHSQAIKLGFESFTLVGN
        ALITMYF+ + + DA  VFE T+  V DQ+T+NV+IDGL   +R +E+L++F+ M  A L PT+LTFVS+M SCS   +G QVH  AIK G+E +TLV N
Subjt:  ALITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCSFIRVGQQVHSQAIKLGFESFTLVGN

Query:  STITMYSSCGEFQAANTVFQTLRKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEIVEMAHAFVYKNGLILVIEILNAL
        +T+TMYSS  +F AA+ VF++L +KDL++WN +ISSY Q   GKSA+  + +M   G+ PDEFTFGSLL  S  ++++EM  A + K GL   IEI NAL
Subjt:  STITMYSSCGEFQAANTVFQTLRKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEIVEMAHAFVYKNGLILVIEILNAL

Query:  VSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLK--PSTFTLSIVLSICANISTLDIGKQIHGYILRSGNFSETSIC
        +SAY++  +I ++  +F     KNLISWN +I GF  NG P + L  FS L+ S+++  P  +TLS +LSIC + S+L +G Q H Y+LR G F ET I 
Subjt:  VSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLK--PSTFTLSIVLSICANISTLDIGKQIHGYILRSGNFSETSIC

Query:  NGLITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPSV
        N LI MYS+CG +  SL +FN M E+D+VSWNS+ISAY++HG+G+ AV+ +K MQD   ++PD ATF+ +LSACSHAGLV+E  +I ++M+  + V+ +V
Subjt:  NGLITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPSV

Query:  DQLSCIVDLIGRSGYIDQAESVIESAQ--YGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGSL
        D  SC+VDL+GR+G++D+AES+++ ++   G    VWWALFSACAAH +L+LG++VA++L+EKE+++PSVYV LSNIYA AG W+EA   R  I   G++
Subjt:  DQLSCIVDLIGRSGYIDQAESVIESAQ--YGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGSL

Query:  KQPGCSWIRI
        KQ GCSW+R+
Subjt:  KQPGCSWIRI

AT4G13650.1 Pentatricopeptide repeat (PPR) superfamily protein8.9e-10229.24Show/hide
Query:  LRRRLHSRAAATSISGEFRQLKRSSRYFDSLQLFTQIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYAVRSGLKFYPHVANTILSLYAKTEDF
        LR + HS   A  ISG    L ++    ++++LF  +   +   I P  Y  S+ L+ C     +  G QLHG  ++ G     +V N ++SLY      
Subjt:  LRRRLHSRAAATSISGEFRQLKRSSRYFDSLQLFTQIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYAVRSGLKFYPHVANTILSLYAKTEDF

Query:  VSLKRGFQEIEKPDVYSWTTLLSACTKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACILSLCTKE-IKDLG
                                   LG++  A+ +F  M + +   +N +I G ++ G+   AM  F  MH  G++PD+ + A ++  C+ +     G
Subjt:  VSLKRGFQEIEKPDVYSWTTLLSACTKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACILSLCTKE-IKDLG

Query:  RQVHSLVIKAGYLSKTSVINALITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSC---SFI
        +Q+H+   K G+ S   +  AL+ +Y    ++E A + F   E EV + + +NVM+     +     +  +F+ M+   + P + T+ SI+ +C     +
Subjt:  RQVHSLVIKAGYLSKTSVINALITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSC---SFI

Query:  RVGQQVHSQAIKLGFESFTLVGNSTITMYSSCGEFQAANTVFQTLRKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEI
         +G+Q+HSQ IK  F+    V +  I MY+  G+   A  +      KD++SW  +I+ Y Q NF   A+  F QM   GI  DE    + +     ++ 
Subjt:  RVGQQVHSQAIKLGFESFTLVGNSTITMYSSCGEFQAANTVFQTLRKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEI

Query:  V---EMAHAFVYKNGLILVIEILNALVSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLKPSTFTLSIVLSICANIS
        +   +  HA    +G    +   NALV+ Y+RC KI +S+  F +  + + I+WN ++ GF  +G   +AL  F ++    +  + FT    +   +  +
Subjt:  V---EMAHAFVYKNGLILVIEILNALVSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKLKPSTFTLSIVLSICANIS

Query:  TLDIGKQIHGYILRSGNFSETSICNGLITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSH
         +  GKQ+H  I ++G  SET +CN LI+MY+KCG +  + + F  +  ++ VSWN++I+AY++HG G EA+D F  M   S++ P+  T   +LSACSH
Subjt:  TLDIGKQIHGYILRSGNFSETSICNGLITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSH

Query:  AGLVDEACQILDTMLIDYHVVPSVDQLSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIY
         GLVD+     ++M  +Y + P  +   C+VD++ R+G + +A+  I+         VW  L SAC  H+N+ +G   A  LLE E E+ + YV+LSN+Y
Subjt:  AGLVDEACQILDTMLIDYHVVPSVDQLSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIY

Query:  ASAGCWEEAANVRELIKKTGSLKQPGCSWIRISNSIAWPSLAGTDKHGLKTRATRGVEPMTTCSRGQMGLVKDSNKELVE---------IFLNHAKIFGS
        A +  W+     R+ +K+ G  K+PG SWI + NSI      G   H L        + +T     ++G V+D    L E         IF++  K+  S
Subjt:  ASAGCWEEAANVRELIKKTGSLKQPGCSWIRISNSIAWPSLAGTDKHGLKTRATRGVEPMTTCSRGQMGLVKDSNKELVE---------IFLNHAKIFGS

Query:  FVFISRP
        F  +S P
Subjt:  FVFISRP

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein2.9e-10033.01Show/hide
Query:  TKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACI-LSLCTKEIKDLGRQVHSLVIKAGYLSKTSVINALITM
        T  G ++ A  +FD +       WN ++   A+SG    ++  F +M   GV+ D+Y+F+C+  S  +      G Q+H  ++K+G+  + SV N+L+  
Subjt:  TKLGHVEYADEMFDTMPKGNVACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACI-LSLCTKEIKDLGRQVHSLVIKAGYLSKTSVINALITM

Query:  YFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCS---FIRVGQQVHSQAIKLGFESFTLVGNST
        Y   + ++ A +VF+  E    D I++N +I+G V     E+ L +F  M  + +     T VS+ + C+    I +G+ VHS  +K  F       N+ 
Subjt:  YFSIENLEDAYEVFEGTEAEVHDQITYNVMIDGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCS---FIRVGQQVHSQAIKLGFESFTLVGNST

Query:  ITMYSSCGEFQAANTVFQTLRKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEIVE---MAHAFVYKNGLILVIEILNA
        + MYS CG+  +A  VF+ +  + ++S+ ++I+ Y +      AV  F +M+  GI PD +T  ++L       +++     H ++ +N L   I + NA
Subjt:  ITMYSSCGEFQAANTVFQTLRKKDLISWNAIISSYVQGNFGKSAVLAFLQMQRTGIGPDEFTFGSLLGVSEFMEIVE---MAHAFVYKNGLILVIEILNA

Query:  LVSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSK-LKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGNFSETSIC
        L+  YA+C  + ++  VFSE+  K++ISWNT+I G+  N    +AL  F+ L+  K   P   T++ VL  CA++S  D G++IHGYI+R+G FS+  + 
Subjt:  LVSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSK-LKPSTFTLSIVLSICANISTLDIGKQIHGYILRSGNFSETSIC

Query:  NGLITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPSV
        N L+ MY+KCG L  +  +F+ +  +D+VSW  +I+ Y  HG GKEA+  F  M+  + I  D+ +F ++L ACSH+GLVDE  +  + M  +  + P+V
Subjt:  NGLITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFTTILSACSHAGLVDEACQILDTMLIDYHVVPSV

Query:  DQLSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGSLKQ
        +  +CIVD++ R+G + +A   IE+        +W AL   C  H +++L   VA  + E E EN   YV+++NIYA A  WE+   +R+ I + G  K 
Subjt:  DQLSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYASAGCWEEAANVRELIKKTGSLKQ

Query:  PGCSWIRISNSI
        PGCSWI I   +
Subjt:  PGCSWIRISNSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAGAAAAACCCCCTCCCTCCTCCCTCACGTTTTCTCCTTCTCCGGCCACCTTCTTCATCTCCGGCGTTCTCCTTCTCCAACGGTGACAGACCCAGGCGAACCCAC
GTCGTCTTCTTGCTCCACCCGACCATTCGACACCCATGCGGCAACGGTTTCGGCAGCGGCGGTGCGAGCTCTTCAACGAACAGGAGGGTGGCCCACGCACGAACGAGGCT
CCCACCTTCGACGGCGTCTGCACTCACGAGCAGCAGCAACTTCGATTTCCGGAGAATTCCGGCAGCTCAAGCGTTCAAGTCGCTACTTCGACTCTTTGCAACTCTTCACT
CAAATCCATTCATCGCATTGCTTCAATATAAAGCCTGACCACTACAATCTCTCCACTACACTTGCTGTTTGTGCCAACTTTCGCGACATTGCCTTCGGGTCTCAACTCCA
TGGTTATGCTGTTCGATCTGGGCTCAAATTCTACCCTCATGTTGCAAATACCATTCTTTCGCTTTATGCCAAAACAGAGGATTTTGTGTCTTTGAAAAGGGGTTTCCAAG
AGATTGAGAAACCAGATGTTTATTCTTGGACTACACTGTTGTCAGCTTGTACAAAATTGGGTCATGTTGAATATGCAGATGAGATGTTTGATACAATGCCAAAGGGTAAT
GTTGCATGTTGGAATGCTATGATAACTGGGAGTGCAGAAAGTGGACATGATTGGATTGCCATGAACACCTTTTATGAAATGCACAAAATGGGCGTTAAGCCAGATAATTA
CTCTTTTGCTTGTATCTTGAGTTTGTGTACCAAGGAAATTAAAGATTTGGGAAGACAAGTGCATTCTTTGGTGATTAAAGCTGGATATCTTAGCAAAACTTCTGTGATTA
ATGCTTTGATTACTATGTATTTCAGTATTGAGAACCTCGAGGATGCCTATGAGGTTTTTGAGGGAACTGAAGCTGAGGTTCATGATCAGATTACTTATAATGTAATGATA
GACGGCCTGGTCTGCATAAGAAGGAATGAGGAGGCCTTGATTATGTTCAAAGATATGAAAAGGGCATGTCTAAGTCCTACTGAGCTCACCTTTGTGAGCATTATGAGCTC
ATGTTCATTTATACGGGTTGGCCAACAAGTGCACTCCCAAGCCATTAAGCTAGGGTTTGAATCTTTTACTTTAGTAGGAAATTCAACCATAACCATGTACTCTTCTTGTG
GGGAGTTTCAAGCAGCAAATACAGTTTTCCAGACGCTAAGAAAGAAAGATCTCATCTCATGGAATGCCATAATCTCGAGCTATGTCCAAGGGAATTTTGGCAAATCAGCT
GTTCTTGCTTTTCTACAAATGCAGAGGACTGGAATTGGGCCAGATGAGTTTACGTTTGGAAGCTTATTAGGAGTTTCAGAGTTCATGGAGATAGTGGAAATGGCTCACGC
CTTTGTATATAAAAATGGGTTGATCCTCGTAATCGAAATTTTAAATGCATTAGTTTCTGCATACGCGAGGTGTAGGAAGATAACACAGTCTCATCAAGTATTTAGTGAAA
TCAATTCAAAAAATTTAATCTCTTGGAACACTGTCATTTATGGATTTCTGTTAAATGGGCTTCCATTGCAAGCATTGGGGCATTTTTCTAAGCTTATAATGTCAAAGCTC
AAGCCAAGCACATTTACACTCAGCATTGTCCTGAGCATTTGTGCCAACATTTCAACCTTGGACATTGGAAAACAAATTCATGGTTACATTCTCAGATCGGGGAATTTCTC
AGAAACTTCTATATGTAATGGCCTTATTACAATGTATTCTAAATGTGGGGTGTTAGATTGTTCTTTGAGAATTTTTAATGTCATGATTGAAAGGGATATTGTATCATGGA
ATTCTGTAATATCTGCTTATGCACAACATGGGCAGGGGAAGGAAGCTGTGGATTGTTTCAAGGCTATGCAAGACATGTCCTCAATTATGCCTGATCAAGCCACATTCACT
ACTATTCTTTCAGCTTGCAGCCATGCAGGATTAGTTGACGAAGCCTGTCAGATTTTGGATACAATGTTGATAGATTATCATGTTGTTCCTAGTGTGGATCAATTATCTTG
CATTGTCGACCTTATAGGTCGTTCGGGGTATATTGATCAAGCTGAAAGTGTAATAGAAAGTGCACAATATGGAGACCATACACACGTTTGGTGGGCATTGTTTAGCGCCT
GTGCAGCACATGAAAACTTAAGGTTGGGAAGAATTGTTGCGAGAATCCTTCTAGAGAAAGAACGTGAGAATCCATCTGTGTATGTGGTTCTGTCAAATATATATGCCAGT
GCTGGGTGTTGGGAAGAAGCAGCCAATGTGAGGGAATTGATTAAGAAAACTGGTTCACTGAAACAACCAGGCTGCAGTTGGATCAGAATTTCTAATTCCATTGCTTGGCC
AAGCCTAGCAGGAACAGATAAACATGGCTTGAAGACTCGTGCAACAAGAGGAGTCGAGCCCATGACCACTTGTTCTAGGGGACAGATGGGTTTGGTAAAAGATTCAAACA
AGGAACTTGTGGAAATTTTTTTAAATCATGCCAAAATTTTTGGGAGCTTTGTCTTCATTTCTCGACCTCTTGCTTCTAAAGGATGCCCTCCCTCAACTGGATGCTGGTTT
AGCCAAGTTATCCGATTTGAACTGGATGAGGAGGAACAATCGATAAAAAATTTGGAAACTATAGATGGAGGTGTTGAAGACGAAGGCGCTGCAGATACAGAAGAGAAAGA
TACTCATGGGCAACTTGATTCGAAGAAATTTGAAAATCTTAATGCAAATAGTGAAGACGTGAAGAACAACGACAAGGAAGAAGAAGAAGAAGCTTACGACAATATTCAAG
AAGAAATAGACGATCACAGTGAAGAAATTGCAAAAGCAAAAACCCAAAATGGGCTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGAGAAAAACCCCCTCCCTCCTCCCTCACGTTTTCTCCTTCTCCGGCCACCTTCTTCATCTCCGGCGTTCTCCTTCTCCAACGGTGACAGACCCAGGCGAACCCAC
GTCGTCTTCTTGCTCCACCCGACCATTCGACACCCATGCGGCAACGGTTTCGGCAGCGGCGGTGCGAGCTCTTCAACGAACAGGAGGGTGGCCCACGCACGAACGAGGCT
CCCACCTTCGACGGCGTCTGCACTCACGAGCAGCAGCAACTTCGATTTCCGGAGAATTCCGGCAGCTCAAGCGTTCAAGTCGCTACTTCGACTCTTTGCAACTCTTCACT
CAAATCCATTCATCGCATTGCTTCAATATAAAGCCTGACCACTACAATCTCTCCACTACACTTGCTGTTTGTGCCAACTTTCGCGACATTGCCTTCGGGTCTCAACTCCA
TGGTTATGCTGTTCGATCTGGGCTCAAATTCTACCCTCATGTTGCAAATACCATTCTTTCGCTTTATGCCAAAACAGAGGATTTTGTGTCTTTGAAAAGGGGTTTCCAAG
AGATTGAGAAACCAGATGTTTATTCTTGGACTACACTGTTGTCAGCTTGTACAAAATTGGGTCATGTTGAATATGCAGATGAGATGTTTGATACAATGCCAAAGGGTAAT
GTTGCATGTTGGAATGCTATGATAACTGGGAGTGCAGAAAGTGGACATGATTGGATTGCCATGAACACCTTTTATGAAATGCACAAAATGGGCGTTAAGCCAGATAATTA
CTCTTTTGCTTGTATCTTGAGTTTGTGTACCAAGGAAATTAAAGATTTGGGAAGACAAGTGCATTCTTTGGTGATTAAAGCTGGATATCTTAGCAAAACTTCTGTGATTA
ATGCTTTGATTACTATGTATTTCAGTATTGAGAACCTCGAGGATGCCTATGAGGTTTTTGAGGGAACTGAAGCTGAGGTTCATGATCAGATTACTTATAATGTAATGATA
GACGGCCTGGTCTGCATAAGAAGGAATGAGGAGGCCTTGATTATGTTCAAAGATATGAAAAGGGCATGTCTAAGTCCTACTGAGCTCACCTTTGTGAGCATTATGAGCTC
ATGTTCATTTATACGGGTTGGCCAACAAGTGCACTCCCAAGCCATTAAGCTAGGGTTTGAATCTTTTACTTTAGTAGGAAATTCAACCATAACCATGTACTCTTCTTGTG
GGGAGTTTCAAGCAGCAAATACAGTTTTCCAGACGCTAAGAAAGAAAGATCTCATCTCATGGAATGCCATAATCTCGAGCTATGTCCAAGGGAATTTTGGCAAATCAGCT
GTTCTTGCTTTTCTACAAATGCAGAGGACTGGAATTGGGCCAGATGAGTTTACGTTTGGAAGCTTATTAGGAGTTTCAGAGTTCATGGAGATAGTGGAAATGGCTCACGC
CTTTGTATATAAAAATGGGTTGATCCTCGTAATCGAAATTTTAAATGCATTAGTTTCTGCATACGCGAGGTGTAGGAAGATAACACAGTCTCATCAAGTATTTAGTGAAA
TCAATTCAAAAAATTTAATCTCTTGGAACACTGTCATTTATGGATTTCTGTTAAATGGGCTTCCATTGCAAGCATTGGGGCATTTTTCTAAGCTTATAATGTCAAAGCTC
AAGCCAAGCACATTTACACTCAGCATTGTCCTGAGCATTTGTGCCAACATTTCAACCTTGGACATTGGAAAACAAATTCATGGTTACATTCTCAGATCGGGGAATTTCTC
AGAAACTTCTATATGTAATGGCCTTATTACAATGTATTCTAAATGTGGGGTGTTAGATTGTTCTTTGAGAATTTTTAATGTCATGATTGAAAGGGATATTGTATCATGGA
ATTCTGTAATATCTGCTTATGCACAACATGGGCAGGGGAAGGAAGCTGTGGATTGTTTCAAGGCTATGCAAGACATGTCCTCAATTATGCCTGATCAAGCCACATTCACT
ACTATTCTTTCAGCTTGCAGCCATGCAGGATTAGTTGACGAAGCCTGTCAGATTTTGGATACAATGTTGATAGATTATCATGTTGTTCCTAGTGTGGATCAATTATCTTG
CATTGTCGACCTTATAGGTCGTTCGGGGTATATTGATCAAGCTGAAAGTGTAATAGAAAGTGCACAATATGGAGACCATACACACGTTTGGTGGGCATTGTTTAGCGCCT
GTGCAGCACATGAAAACTTAAGGTTGGGAAGAATTGTTGCGAGAATCCTTCTAGAGAAAGAACGTGAGAATCCATCTGTGTATGTGGTTCTGTCAAATATATATGCCAGT
GCTGGGTGTTGGGAAGAAGCAGCCAATGTGAGGGAATTGATTAAGAAAACTGGTTCACTGAAACAACCAGGCTGCAGTTGGATCAGAATTTCTAATTCCATTGCTTGGCC
AAGCCTAGCAGGAACAGATAAACATGGCTTGAAGACTCGTGCAACAAGAGGAGTCGAGCCCATGACCACTTGTTCTAGGGGACAGATGGGTTTGGTAAAAGATTCAAACA
AGGAACTTGTGGAAATTTTTTTAAATCATGCCAAAATTTTTGGGAGCTTTGTCTTCATTTCTCGACCTCTTGCTTCTAAAGGATGCCCTCCCTCAACTGGATGCTGGTTT
AGCCAAGTTATCCGATTTGAACTGGATGAGGAGGAACAATCGATAAAAAATTTGGAAACTATAGATGGAGGTGTTGAAGACGAAGGCGCTGCAGATACAGAAGAGAAAGA
TACTCATGGGCAACTTGATTCGAAGAAATTTGAAAATCTTAATGCAAATAGTGAAGACGTGAAGAACAACGACAAGGAAGAAGAAGAAGAAGCTTACGACAATATTCAAG
AAGAAATAGACGATCACAGTGAAGAAATTGCAAAAGCAAAAACCCAAAATGGGCTATAGAGATCAGGAAGAAAATCGACGAGTGATAATTCTTGTTTAATTTCTTCTAAT
TAGAACAACACTCAAATCATTTAGATAATTCTTCTCCAACAAACTCAAAACCAATTACACAAGAATGTATAAAACTCCCATACGATGTAATATTTAGGAGCAAGGGAACA
ATAAAACAAAACCATATATTGAATCTTTGAATATTTAATTAATATTCTATATTAATTTAGATATTAACAATCACAAAATTGATTAGATACAGTTAGAGATAATTAGGGAC
AAAAACAAAAAGAAGAATAAATAAGTGAAGTTGCAAAAAACAGAAAATACATATAGTGTATGATCCTAGAAATGATGAGTTTTAAGAAAGTTTTGCCAAAGCCTTTTGAT
AATGTTCAGACTTGAGTTTTTCCAACTTGCTAGAGAGTTTGGCCTGCTTGTATTCATATTCACTTCTACAAATTTCCACTTCCCCATATTTCCTCTTCAAATCTCTAACT
CTTTCACTATCCATCAAATCTATGTCCACTCCTTCCTTGTAAAGAAACTCAACAAATTCTTTGTTCATGAAATCATTCGACAAATCAAAATTCATTCCCACCGTTGATAT
CAT
Protein sequenceShow/hide protein sequence
MARKTPSLLPHVFSFSGHLLHLRRSPSPTVTDPGEPTSSSCSTRPFDTHAATVSAAAVRALQRTGGWPTHERGSHLRRRLHSRAAATSISGEFRQLKRSSRYFDSLQLFT
QIHSSHCFNIKPDHYNLSTTLAVCANFRDIAFGSQLHGYAVRSGLKFYPHVANTILSLYAKTEDFVSLKRGFQEIEKPDVYSWTTLLSACTKLGHVEYADEMFDTMPKGN
VACWNAMITGSAESGHDWIAMNTFYEMHKMGVKPDNYSFACILSLCTKEIKDLGRQVHSLVIKAGYLSKTSVINALITMYFSIENLEDAYEVFEGTEAEVHDQITYNVMI
DGLVCIRRNEEALIMFKDMKRACLSPTELTFVSIMSSCSFIRVGQQVHSQAIKLGFESFTLVGNSTITMYSSCGEFQAANTVFQTLRKKDLISWNAIISSYVQGNFGKSA
VLAFLQMQRTGIGPDEFTFGSLLGVSEFMEIVEMAHAFVYKNGLILVIEILNALVSAYARCRKITQSHQVFSEINSKNLISWNTVIYGFLLNGLPLQALGHFSKLIMSKL
KPSTFTLSIVLSICANISTLDIGKQIHGYILRSGNFSETSICNGLITMYSKCGVLDCSLRIFNVMIERDIVSWNSVISAYAQHGQGKEAVDCFKAMQDMSSIMPDQATFT
TILSACSHAGLVDEACQILDTMLIDYHVVPSVDQLSCIVDLIGRSGYIDQAESVIESAQYGDHTHVWWALFSACAAHENLRLGRIVARILLEKERENPSVYVVLSNIYAS
AGCWEEAANVRELIKKTGSLKQPGCSWIRISNSIAWPSLAGTDKHGLKTRATRGVEPMTTCSRGQMGLVKDSNKELVEIFLNHAKIFGSFVFISRPLASKGCPPSTGCWF
SQVIRFELDEEEQSIKNLETIDGGVEDEGAADTEEKDTHGQLDSKKFENLNANSEDVKNNDKEEEEEAYDNIQEEIDDHSEEIAKAKTQNGL