; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0005910 (gene) of Chayote v1 genome

Gene IDSed0005910
OrganismSechium edule (Chayote v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG05:32408505..32410753
RNA-Seq ExpressionSed0005910
SyntenySed0005910
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7011986.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]5.2e-26389.46Show/hide
Query:  LLLQIRTFEVNEQALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWN
        ++  +  FE NEQAL LF+EMYGLG LPDEFTLGSVLRGCAGLRSL AGQEVHACLMKCGFE+N VVGSSLAHMYMKSGSLSDG KLIKSMPIRNVVAWN
Subjt:  LLLQIRTFEVNEQALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWN

Query:  TLIAGNAQNGCLEEVLHQYRMMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLW
        TLIAG AQNGC EEVL+QY MMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAG GS +AV+SSL+S+YSRSGCLEDSVK F DRED DVVLW
Subjt:  TLIAGNAQNGCLEEVLHQYRMMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLW

Query:  SSMIAAYGFHGRGEEAIELFHQMEELKIEANEVAFLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADG
        S+MIAAYGFHGRGEEAIELFHQMEELK+EANEV FLSLLYACSHCGLKEKGTEYLDLMV++YKLKPRIEHYTCVVDLLGRAG L EAE M+RSMPVKADG
Subjt:  SSMIAAYGFHGRGEEAIELFHQMEELKIEANEVAFLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADG

Query:  IIWKTLLSACKLHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYL
        IIWKTLLSACKLHK AEMAKRISEE+LKL PLDAASYVLLSNIHASARNW DVSEIRKAMRDRNV+KEPGISWLELKN+VHQFSM D SHPQ+LEIDSYL
Subjt:  IIWKTLLSACKLHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYL

Query:  KELMSEMKLHGYVPDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCG
        KELMSEMKLHGYVPDI SVLHDMDNEEKEYNLAHHSEK AIAFALMN PEGVPIRVMKNLRVC+DCH+AIKCISKIRNREIIVRD SRFHHFKDG+CSCG
Subjt:  KELMSEMKLHGYVPDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCG

Query:  NYW
        NYW
Subjt:  NYW

KGN43638.1 hypothetical protein Csa_017212 [Cucumis sativus]1.2e-27084.73Show/hide
Query:  MVRPISLFSFASIMYSSRLAFWRKTSSFF----------IHFQSSFELLLQIRTFEVNEQALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLRAGQEVH
        MVRPIS+FS +SIM+ +RLAFWRKT  FF          +HFQSSF+LLLQIRTFE N+QALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSL AGQEVH
Subjt:  MVRPISLFSFASIMYSSRLAFWRKTSSFF----------IHFQSSFELLLQIRTFEVNEQALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLRAGQEVH

Query:  ACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWNTLIAGNAQNGCLEEVLHQYRMMKMAGFRPDKITFVSVISACSELATLGQGQQI
        ACL+KCGFE++SVVGSSLAHMY+KSGSLSDG KLIKSMPIR VVAWNTLIAG AQNGC EEVL+QY MMKMAGFRPDKITFVSV+SACSELATLGQGQQI
Subjt:  ACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWNTLIAGNAQNGCLEEVLHQYRMMKMAGFRPDKITFVSVISACSELATLGQGQQI

Query:  HAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLWSSMIAAYGFHGRGEEAIELFHQMEELKIEANEVAFLSLLYACSHCGLKEKGTE
        HAEVIKAG  S LAV+SSL+S+YSRSGCLEDS+K F DRE+ DVVLWSSMIAAYGFHGRGEEA+ELFHQME+LK+EANEV FLSLLYACSH GLKEKGTE
Subjt:  HAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLWSSMIAAYGFHGRGEEAIELFHQMEELKIEANEVAFLSLLYACSHCGLKEKGTE

Query:  YLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADGIIWKTLLSACKLHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDV
        Y DLMVKKYKLKPRIEHYTCVVDLLGRAG L EAE M+RSMPV+ DGIIWKTLL+ACKLHKEAEMA+RISEE++KL PLDAASYVLLSNIHASARNW +V
Subjt:  YLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADGIIWKTLLSACKLHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDV

Query:  SEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYLKELMSEMKLHGYVPDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVP
        S+IRKAMRDR+VRKEPGISWLELKN+VHQFSM D SHPQ+ EID YLKELMSE+K HGYVP++ SVLHDMDNEEKEYNLAHHSEK AIAFALMNT E VP
Subjt:  SEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYLKELMSEMKLHGYVPDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVP

Query:  IRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCGNYW
        IRVMKNLRVC DCH+AIKCIS+IRNREIIVRDASRFHHFKDG+CSCGNYW
Subjt:  IRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCGNYW

XP_022147887.1 pentatricopeptide repeat-containing protein At2g41080 isoform X2 [Momordica charantia]9.6e-26581.63Show/hide
Query:  MVRPISLFSFASIMYSSRLAFWRKTSSFFIHF--------------QSSFE------------LLLQIRTFEVNEQALSLFKEMYGLGFLPDEFTLGSVL
        MVRPISL   A I++ +RL FWR+ +SFF H+              Q  F+            ++  +  FE NEQALSLF+EMYGLGFLPDEFTLGSVL
Subjt:  MVRPISLFSFASIMYSSRLAFWRKTSSFFIHF--------------QSSFE------------LLLQIRTFEVNEQALSLFKEMYGLGFLPDEFTLGSVL

Query:  RGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWNTLIAGNAQNGCLEEVLHQYRMMKMAGFRPDKITFVSV
        RGCAGLRS+RAGQEVHACLMKCGFE+N VVGSS+AHMYMKSGSLSDG K+IKSMP RNVVAWNTLIAG AQNGC EEVL+QY MMKMAGFRPDKITFVSV
Subjt:  RGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWNTLIAGNAQNGCLEEVLHQYRMMKMAGFRPDKITFVSV

Query:  ISACSELATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLWSSMIAAYGFHGRGEEAIELFHQMEELKIEANEVAFLS
        ISACSELATLGQGQQIHAE IKAG GS +AVISSL+S+YSRSGCL+DSVK F DRED DVVLWS+MIAAYGFHGRGEE IELFHQMEELK+EANEV FLS
Subjt:  ISACSELATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLWSSMIAAYGFHGRGEEAIELFHQMEELKIEANEVAFLS

Query:  LLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADGIIWKTLLSACKLHKEAEMAKRISEEVLKLHPLDAASY
        LLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAG L EAEA +RSMPVKADGIIWKTLLSACK+HK+AEMAKRISE++LKL PLDAASY
Subjt:  LLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADGIIWKTLLSACKLHKEAEMAKRISEEVLKLHPLDAASY

Query:  VLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYLKELMSEMKLHGYVPDISSVLHDMDNEEKEYNLAHHSE
        VLLSNIHASARNW DVSEIRKAMRDR VRKEPGISWLELK+ VHQF+M+D SHP++LEI+ YLKELM+EMKL GYVPDI SVLHDMDNEEKEYNLAHHSE
Subjt:  VLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYLKELMSEMKLHGYVPDISSVLHDMDNEEKEYNLAHHSE

Query:  KLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCGNYW
        KLAIAFALMNTPE  PIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHF+DG+CSCGNYW
Subjt:  KLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCGNYW

XP_022953052.1 pentatricopeptide repeat-containing protein At2g41080 [Cucurbita moschata]5.2e-26389.46Show/hide
Query:  LLLQIRTFEVNEQALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWN
        ++  +  FE NEQAL LF+EMYGLG LPDEFTLGSVLRGCAGLRSL AGQEVHACLMKCGFE+N VVGSSLAHMYMKSGSLSDG KLIKSMPIRNVVAWN
Subjt:  LLLQIRTFEVNEQALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWN

Query:  TLIAGNAQNGCLEEVLHQYRMMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLW
        TLIAG AQNGC EEVL+QY MMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAG GS +AV+SSL+S+YSRSGCLEDSVK F DRED DVVLW
Subjt:  TLIAGNAQNGCLEEVLHQYRMMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLW

Query:  SSMIAAYGFHGRGEEAIELFHQMEELKIEANEVAFLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADG
        S+MIAAYGFHGRGEEAIELFHQMEELK+EANEV FLSLLYACSHCGLKEKGTEYLDLMV++YKLKPRIEHYTCVVDLLGRAG L EAE M+RSMPVKADG
Subjt:  SSMIAAYGFHGRGEEAIELFHQMEELKIEANEVAFLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADG

Query:  IIWKTLLSACKLHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYL
        IIWKTLLSACKLHK AEMAKRISEE+LKL PLDAASYVLLSNIHASARNW DVSEIRKAMRDRNV+KEPGISWLELKN+VHQFSM D SHPQ+LEIDSYL
Subjt:  IIWKTLLSACKLHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYL

Query:  KELMSEMKLHGYVPDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCG
        KELMSEMKLHGYVPDI SVLHDMDNEEKEYNLAHHSEK AIAFALMN PEGVPIRVMKNLRVC+DCH+AIKCISKIRNREIIVRD SRFHHFKDG+CSCG
Subjt:  KELMSEMKLHGYVPDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCG

Query:  NYW
        NYW
Subjt:  NYW

XP_022969026.1 pentatricopeptide repeat-containing protein At2g41080 [Cucurbita maxima]2.8e-26489.86Show/hide
Query:  LLLQIRTFEVNEQALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWN
        ++  +  FE NEQALSLF+EMYGLGFLPDEFTLGSVLRGCAGLRSL AGQEVHACLMKCGFE+N VVGSSLAHMYMKSGSLSDG KLIKSMPIRNVVAWN
Subjt:  LLLQIRTFEVNEQALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWN

Query:  TLIAGNAQNGCLEEVLHQYRMMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLW
        TLIAG AQNGC EEVL+QY MMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAG GS +AV+SSL+S+YSRSGCLEDSVK F DRED DVVLW
Subjt:  TLIAGNAQNGCLEEVLHQYRMMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLW

Query:  SSMIAAYGFHGRGEEAIELFHQMEELKIEANEVAFLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADG
        S+MIAAYGFHGRGEEAIELFHQMEELK+EANEV FLSLLYACSHCGLKEKGTEYLDLMV++YKLKPRIEHYTCVVDLLGRAG L EAE M+RSMPVKADG
Subjt:  SSMIAAYGFHGRGEEAIELFHQMEELKIEANEVAFLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADG

Query:  IIWKTLLSACKLHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYL
        IIWKTLLSACKLHK+AEMAKRISEE+LKL PLDAASYVLLSNIHASARNW DVSEIRKAMRDRNV+KEPGISWLELKN+VHQFSM D SHPQ+LEIDSYL
Subjt:  IIWKTLLSACKLHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYL

Query:  KELMSEMKLHGYVPDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCG
        KELMSEMKLHGYVPDI SVLHDMDNEEKEYNLAHHSEK AIAFALMN PEGVPIRVMKNLRVC+DCH+AIKCISKIRNREIIVRD SRFHHFKDG+CSCG
Subjt:  KELMSEMKLHGYVPDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCG

Query:  NYW
        NYW
Subjt:  NYW

TrEMBL top hitse value%identityAlignment
A0A0A0K2D8 DYW_deaminase domain-containing protein5.6e-27184.73Show/hide
Query:  MVRPISLFSFASIMYSSRLAFWRKTSSFF----------IHFQSSFELLLQIRTFEVNEQALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLRAGQEVH
        MVRPIS+FS +SIM+ +RLAFWRKT  FF          +HFQSSF+LLLQIRTFE N+QALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSL AGQEVH
Subjt:  MVRPISLFSFASIMYSSRLAFWRKTSSFF----------IHFQSSFELLLQIRTFEVNEQALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLRAGQEVH

Query:  ACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWNTLIAGNAQNGCLEEVLHQYRMMKMAGFRPDKITFVSVISACSELATLGQGQQI
        ACL+KCGFE++SVVGSSLAHMY+KSGSLSDG KLIKSMPIR VVAWNTLIAG AQNGC EEVL+QY MMKMAGFRPDKITFVSV+SACSELATLGQGQQI
Subjt:  ACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWNTLIAGNAQNGCLEEVLHQYRMMKMAGFRPDKITFVSVISACSELATLGQGQQI

Query:  HAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLWSSMIAAYGFHGRGEEAIELFHQMEELKIEANEVAFLSLLYACSHCGLKEKGTE
        HAEVIKAG  S LAV+SSL+S+YSRSGCLEDS+K F DRE+ DVVLWSSMIAAYGFHGRGEEA+ELFHQME+LK+EANEV FLSLLYACSH GLKEKGTE
Subjt:  HAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLWSSMIAAYGFHGRGEEAIELFHQMEELKIEANEVAFLSLLYACSHCGLKEKGTE

Query:  YLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADGIIWKTLLSACKLHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDV
        Y DLMVKKYKLKPRIEHYTCVVDLLGRAG L EAE M+RSMPV+ DGIIWKTLL+ACKLHKEAEMA+RISEE++KL PLDAASYVLLSNIHASARNW +V
Subjt:  YLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADGIIWKTLLSACKLHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDV

Query:  SEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYLKELMSEMKLHGYVPDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVP
        S+IRKAMRDR+VRKEPGISWLELKN+VHQFSM D SHPQ+ EID YLKELMSE+K HGYVP++ SVLHDMDNEEKEYNLAHHSEK AIAFALMNT E VP
Subjt:  SEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYLKELMSEMKLHGYVPDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVP

Query:  IRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCGNYW
        IRVMKNLRVC DCH+AIKCIS+IRNREIIVRDASRFHHFKDG+CSCGNYW
Subjt:  IRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCGNYW

A0A6J1D3K7 pentatricopeptide repeat-containing protein At2g41080 isoform X24.6e-26581.63Show/hide
Query:  MVRPISLFSFASIMYSSRLAFWRKTSSFFIHF--------------QSSFE------------LLLQIRTFEVNEQALSLFKEMYGLGFLPDEFTLGSVL
        MVRPISL   A I++ +RL FWR+ +SFF H+              Q  F+            ++  +  FE NEQALSLF+EMYGLGFLPDEFTLGSVL
Subjt:  MVRPISLFSFASIMYSSRLAFWRKTSSFFIHF--------------QSSFE------------LLLQIRTFEVNEQALSLFKEMYGLGFLPDEFTLGSVL

Query:  RGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWNTLIAGNAQNGCLEEVLHQYRMMKMAGFRPDKITFVSV
        RGCAGLRS+RAGQEVHACLMKCGFE+N VVGSS+AHMYMKSGSLSDG K+IKSMP RNVVAWNTLIAG AQNGC EEVL+QY MMKMAGFRPDKITFVSV
Subjt:  RGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWNTLIAGNAQNGCLEEVLHQYRMMKMAGFRPDKITFVSV

Query:  ISACSELATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLWSSMIAAYGFHGRGEEAIELFHQMEELKIEANEVAFLS
        ISACSELATLGQGQQIHAE IKAG GS +AVISSL+S+YSRSGCL+DSVK F DRED DVVLWS+MIAAYGFHGRGEE IELFHQMEELK+EANEV FLS
Subjt:  ISACSELATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLWSSMIAAYGFHGRGEEAIELFHQMEELKIEANEVAFLS

Query:  LLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADGIIWKTLLSACKLHKEAEMAKRISEEVLKLHPLDAASY
        LLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAG L EAEA +RSMPVKADGIIWKTLLSACK+HK+AEMAKRISE++LKL PLDAASY
Subjt:  LLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADGIIWKTLLSACKLHKEAEMAKRISEEVLKLHPLDAASY

Query:  VLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYLKELMSEMKLHGYVPDISSVLHDMDNEEKEYNLAHHSE
        VLLSNIHASARNW DVSEIRKAMRDR VRKEPGISWLELK+ VHQF+M+D SHP++LEI+ YLKELM+EMKL GYVPDI SVLHDMDNEEKEYNLAHHSE
Subjt:  VLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYLKELMSEMKLHGYVPDISSVLHDMDNEEKEYNLAHHSE

Query:  KLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCGNYW
        KLAIAFALMNTPE  PIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHF+DG+CSCGNYW
Subjt:  KLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCGNYW

A0A6J1D3P8 pentatricopeptide repeat-containing protein At2g41080 isoform X11.3e-25987.87Show/hide
Query:  LLLQIRTFEVNEQALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWN
        ++  +  FE NEQALSLF+EMYGLGFLPDEFTLGSVLRGCAGLRS+RAGQEVHACLMKCGFE+N VVGSS+AHMYMKSGSLSDG K+IKSMP RNVVAWN
Subjt:  LLLQIRTFEVNEQALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWN

Query:  TLIAGNAQNGCLEEVLHQYRMMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLW
        TLIAG AQNGC EEVL+QY MMKMAGFRPDKITFVSVISACSELATLGQGQQIHAE IKAG GS +AVISSL+S+YSRSGCL+DSVK F DRED DVVLW
Subjt:  TLIAGNAQNGCLEEVLHQYRMMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLW

Query:  SSMIAAYGFHGRGEEAIELFHQMEELKIEANEVAFLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADG
        S+MIAAYGFHGRGEE IELFHQMEELK+EANEV FLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAG L EAEA +RSMPVKADG
Subjt:  SSMIAAYGFHGRGEEAIELFHQMEELKIEANEVAFLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADG

Query:  IIWKTLLSACKLHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYL
        IIWKTLLSACK+HK+AEMAKRISE++LKL PLDAASYVLLSNIHASARNW DVSEIRKAMRDR VRKEPGISWLELK+ VHQF+M+D SHP++LEI+ YL
Subjt:  IIWKTLLSACKLHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYL

Query:  KELMSEMKLHGYVPDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCG
        KELM+EMKL GYVPDI SVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPE  PIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHF+DG+CSCG
Subjt:  KELMSEMKLHGYVPDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCG

Query:  NYW
        NYW
Subjt:  NYW

A0A6J1GM53 pentatricopeptide repeat-containing protein At2g410802.5e-26389.46Show/hide
Query:  LLLQIRTFEVNEQALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWN
        ++  +  FE NEQAL LF+EMYGLG LPDEFTLGSVLRGCAGLRSL AGQEVHACLMKCGFE+N VVGSSLAHMYMKSGSLSDG KLIKSMPIRNVVAWN
Subjt:  LLLQIRTFEVNEQALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWN

Query:  TLIAGNAQNGCLEEVLHQYRMMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLW
        TLIAG AQNGC EEVL+QY MMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAG GS +AV+SSL+S+YSRSGCLEDSVK F DRED DVVLW
Subjt:  TLIAGNAQNGCLEEVLHQYRMMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLW

Query:  SSMIAAYGFHGRGEEAIELFHQMEELKIEANEVAFLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADG
        S+MIAAYGFHGRGEEAIELFHQMEELK+EANEV FLSLLYACSHCGLKEKGTEYLDLMV++YKLKPRIEHYTCVVDLLGRAG L EAE M+RSMPVKADG
Subjt:  SSMIAAYGFHGRGEEAIELFHQMEELKIEANEVAFLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADG

Query:  IIWKTLLSACKLHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYL
        IIWKTLLSACKLHK AEMAKRISEE+LKL PLDAASYVLLSNIHASARNW DVSEIRKAMRDRNV+KEPGISWLELKN+VHQFSM D SHPQ+LEIDSYL
Subjt:  IIWKTLLSACKLHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYL

Query:  KELMSEMKLHGYVPDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCG
        KELMSEMKLHGYVPDI SVLHDMDNEEKEYNLAHHSEK AIAFALMN PEGVPIRVMKNLRVC+DCH+AIKCISKIRNREIIVRD SRFHHFKDG+CSCG
Subjt:  KELMSEMKLHGYVPDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCG

Query:  NYW
        NYW
Subjt:  NYW

A0A6J1HWJ8 pentatricopeptide repeat-containing protein At2g410801.3e-26489.86Show/hide
Query:  LLLQIRTFEVNEQALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWN
        ++  +  FE NEQALSLF+EMYGLGFLPDEFTLGSVLRGCAGLRSL AGQEVHACLMKCGFE+N VVGSSLAHMYMKSGSLSDG KLIKSMPIRNVVAWN
Subjt:  LLLQIRTFEVNEQALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWN

Query:  TLIAGNAQNGCLEEVLHQYRMMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLW
        TLIAG AQNGC EEVL+QY MMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAG GS +AV+SSL+S+YSRSGCLEDSVK F DRED DVVLW
Subjt:  TLIAGNAQNGCLEEVLHQYRMMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLW

Query:  SSMIAAYGFHGRGEEAIELFHQMEELKIEANEVAFLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADG
        S+MIAAYGFHGRGEEAIELFHQMEELK+EANEV FLSLLYACSHCGLKEKGTEYLDLMV++YKLKPRIEHYTCVVDLLGRAG L EAE M+RSMPVKADG
Subjt:  SSMIAAYGFHGRGEEAIELFHQMEELKIEANEVAFLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADG

Query:  IIWKTLLSACKLHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYL
        IIWKTLLSACKLHK+AEMAKRISEE+LKL PLDAASYVLLSNIHASARNW DVSEIRKAMRDRNV+KEPGISWLELKN+VHQFSM D SHPQ+LEIDSYL
Subjt:  IIWKTLLSACKLHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYL

Query:  KELMSEMKLHGYVPDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCG
        KELMSEMKLHGYVPDI SVLHDMDNEEKEYNLAHHSEK AIAFALMN PEGVPIRVMKNLRVC+DCH+AIKCISKIRNREIIVRD SRFHHFKDG+CSCG
Subjt:  KELMSEMKLHGYVPDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCG

Query:  NYW
        NYW
Subjt:  NYW

SwissProt top hitse value%identityAlignment
A8MQA3 Pentatricopeptide repeat-containing protein At4g210653.9e-12044.11Show/hide
Query:  ALSLFKEMYGLGFL-PDEFTLGSVLRGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWNTLIAGNAQNGCL
        A SL++EM   G + PD  T   +++    +  +R G+ +H+ +++ GF     V +SL H+Y   G ++   K+   MP +++VAWN++I G A+NG  
Subjt:  ALSLFKEMYGLGFL-PDEFTLGSVLRGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWNTLIAGNAQNGCL

Query:  EEVLHQYRMMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLWSSMIAAYGFHGR
        EE L  Y  M   G +PD  T VS++SAC+++  L  G+++H  +IK G    L   + L+ LY+R G +E++  +F +  D + V W+S+I     +G 
Subjt:  EEVLHQYRMMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLWSSMIAAYGFHGR

Query:  GEEAIELFHQMEELK-IEANEVAFLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADGIIWKTLLSACK
        G+EAIELF  ME  + +   E+ F+ +LYACSHCG+ ++G EY   M ++YK++PRIEH+ C+VDLL RAG + +A   ++SMP++ + +IW+TLL AC 
Subjt:  GEEAIELFHQMEELK-IEANEVAFLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADGIIWKTLLSACK

Query:  LHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYLKELMSEMKLHG
        +H ++++A+    ++L+L P  +  YVLLSN++AS + WSDV +IRK M    V+K PG S +E+ N VH+F M D SHPQ   I + LKE+   ++  G
Subjt:  LHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYLKELMSEMKLHG

Query:  YVPDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCGNYW
        YVP IS+V  D++ EEKE  + +HSEK+AIAF L++TPE  PI V+KNLRVC+DCH AIK +SK+ NREI+VRD SRFHHFK+G CSC +YW
Subjt:  YVPDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCGNYW

Q8S9M4 Pentatricopeptide repeat-containing protein At2g410802.6e-19666.2Show/hide
Query:  FEVNEQALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWNTLIAGNA
        FE NE+ LSLF+EM+GLGF PDE+TLGSV  G AGLRS+  GQ++H   +K G E++ VV SSLAHMYM++G L DG  +I+SMP+RN+VAWNTLI GNA
Subjt:  FEVNEQALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWNTLIAGNA

Query:  QNGCLEEVLHQYRMMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLWSSMIAAY
        QNGC E VL+ Y+MMK++G RP+KITFV+V+S+CS+LA  GQGQQIHAE IK G  S +AV+SSL+S+YS+ GCL D+ K FS+RED D V+WSSMI+AY
Subjt:  QNGCLEEVLHQYRMMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLWSSMIAAY

Query:  GFHGRGEEAIELFHQM-EELKIEANEVAFLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADGIIWKTL
        GFHG+G+EAIELF+ M E+  +E NEVAFL+LLYACSH GLK+KG E  D+MV+KY  KP ++HYTCVVDLLGRAGCL +AEA++RSMP+K D +IWKTL
Subjt:  GFHGRGEEAIELFHQM-EELKIEANEVAFLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADGIIWKTL

Query:  LSACKLHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYLKELMSE
        LSAC +HK AEMA+R+ +E+L++ P D+A YVLL+N+HASA+ W DVSE+RK+MRD+NV+KE GISW E K  VHQF M D S  +  EI SYLKEL  E
Subjt:  LSACKLHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYLKELMSE

Query:  MKLHGYVPDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCGNYW
        MKL GY PD +SVLHDMD EEKE +L  HSEKLA+AFALM  PEG PIR++KNLRVCSDCH A K IS I+NREI +RD SRFHHF +G+CSCG+YW
Subjt:  MKLHGYVPDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCGNYW

Q9LIQ7 Pentatricopeptide repeat-containing protein At3g24000, mitochondrial3.2e-12244.08Show/hide
Query:  ALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWNTLIAGNAQNGCLE
        AL  F +M   G+ P+EFTL SV++  A  R    G ++H   +KCGF+ N  VGS+L  +Y + G + D   +  ++  RN V+WN LIAG+A+    E
Subjt:  ALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWNTLIAGNAQNGCLE

Query:  EVLHQYRMMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLWSSMIAAYGFHGRG
        + L  ++ M   GFRP   ++ S+  ACS    L QG+ +HA +IK+G        ++L+ +Y++SG + D+ K+F      DVV W+S++ AY  HG G
Subjt:  EVLHQYRMMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLWSSMIAAYGFHGRG

Query:  EEAIELFHQMEELKIEANEVAFLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADGIIWKTLLSACKLH
        +EA+  F +M  + I  NE++FLS+L ACSH GL ++G  Y +LM KK  + P   HY  VVDLLGRAG L  A   +  MP++    IWK LL+AC++H
Subjt:  EEAIELFHQMEELKIEANEVAFLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADGIIWKTLLSACKLH

Query:  KEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYLKELMSEMKLHGYV
        K  E+    +E V +L P D   +V+L NI+AS   W+D + +RK M++  V+KEP  SW+E++N +H F   D  HPQ  EI    +E+++++K  GYV
Subjt:  KEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYLKELMSEMKLHGYV

Query:  PDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCGNYW
        PD S V+  +D +E+E NL +HSEK+A+AFAL+NTP G  I + KN+RVC DCH AIK  SK+  REIIVRD +RFHHFKDG CSC +YW
Subjt:  PDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCGNYW

Q9LW63 Putative pentatricopeptide repeat-containing protein At3g233302.3e-12042.89Show/hide
Query:  EQALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWNTLIAGNAQNGC
        E AL + +EM      PD FTL SVL   +    +  G+E+H  +++ G + +  +GSSL  MY KS  + D  ++   +  R+ ++WN+L+AG  QNG 
Subjt:  EQALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWNTLIAGNAQNGC

Query:  LEEVLHQYRMMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLWSSMIAAYGFHG
          E L  +R M  A  +P  + F SVI AC+ LATL  G+Q+H  V++ G GS + + S+LV +YS+ G ++ + K+F     +D V W+++I  +  HG
Subjt:  LEEVLHQYRMMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLWSSMIAAYGFHG

Query:  RGEEAIELFHQMEELKIEANEVAFLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADGIIWKTLLSACK
         G EA+ LF +M+   ++ N+VAF+++L ACSH GL ++   Y + M K Y L   +EHY  V DLLGRAG L EA   +  M V+  G +W TLLS+C 
Subjt:  RGEEAIELFHQMEELKIEANEVAFLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADGIIWKTLLSACK

Query:  LHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYLKELMSEMKLHG
        +HK  E+A++++E++  +   +  +YVL+ N++AS   W +++++R  MR + +RK+P  SW+E+KN  H F   D SHP   +I+ +LK +M +M+  G
Subjt:  LHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYLKELMSEMKLHG

Query:  YVPDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCGNYW
        YV D S VLHD+D E K   L  HSE+LA+AF ++NT  G  IRV KN+R+C+DCH AIK ISKI  REIIVRD SRFHHF  G CSCG+YW
Subjt:  YVPDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCGNYW

Q9ZUW3 Pentatricopeptide repeat-containing protein At2g276109.7e-11941.5Show/hide
Query:  EQALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWNTLIAGNAQNGC
        E+A+ LF EM   G  P+EFT   +L     +    +  EVHA ++K  +E +S VG++L   Y+K G + +  K+   +  +++VAW+ ++AG AQ G 
Subjt:  EQALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWNTLIAGNAQNGC

Query:  LEEVLHQYRMMKMAGFRPDKITFVSVISACSEL-ATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLWSSMIAAYGFH
         E  +  +  +   G +P++ TF S+++ C+   A++GQG+Q H   IK+   S+L V S+L+++Y++ G +E + +VF  + + D+V W+SMI+ Y  H
Subjt:  LEEVLHQYRMMKMAGFRPDKITFVSVISACSEL-ATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLWSSMIAAYGFH

Query:  GRGEEAIELFHQMEELKIEANEVAFLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADGIIWKTLLSAC
        G+  +A+++F +M++ K++ + V F+ +  AC+H GL E+G +Y D+MV+  K+ P  EH +C+VDL  RAG L +A  ++ +MP  A   IW+T+L+AC
Subjt:  GRGEEAIELFHQMEELKIEANEVAFLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADGIIWKTLLSAC

Query:  KLHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYLKELMSEMKLH
        ++HK+ E+ +  +E+++ + P D+A+YVLLSN++A + +W + +++RK M +RNV+KEPG SW+E+KN  + F   D SHP   +I   L++L + +K  
Subjt:  KLHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYLKELMSEMKLH

Query:  GYVPDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHF-KDGQCSCGNYW
        GY PD S VL D+D+E KE  LA HSE+LAIAF L+ TP+G P+ ++KNLRVC DCH  IK I+KI  REI+VRD++RFHHF  DG CSCG++W
Subjt:  GYVPDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHF-KDGQCSCGNYW

Arabidopsis top hitse value%identityAlignment
AT2G27610.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.9e-12041.5Show/hide
Query:  EQALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWNTLIAGNAQNGC
        E+A+ LF EM   G  P+EFT   +L     +    +  EVHA ++K  +E +S VG++L   Y+K G + +  K+   +  +++VAW+ ++AG AQ G 
Subjt:  EQALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWNTLIAGNAQNGC

Query:  LEEVLHQYRMMKMAGFRPDKITFVSVISACSEL-ATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLWSSMIAAYGFH
         E  +  +  +   G +P++ TF S+++ C+   A++GQG+Q H   IK+   S+L V S+L+++Y++ G +E + +VF  + + D+V W+SMI+ Y  H
Subjt:  LEEVLHQYRMMKMAGFRPDKITFVSVISACSEL-ATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLWSSMIAAYGFH

Query:  GRGEEAIELFHQMEELKIEANEVAFLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADGIIWKTLLSAC
        G+  +A+++F +M++ K++ + V F+ +  AC+H GL E+G +Y D+MV+  K+ P  EH +C+VDL  RAG L +A  ++ +MP  A   IW+T+L+AC
Subjt:  GRGEEAIELFHQMEELKIEANEVAFLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADGIIWKTLLSAC

Query:  KLHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYLKELMSEMKLH
        ++HK+ E+ +  +E+++ + P D+A+YVLLSN++A + +W + +++RK M +RNV+KEPG SW+E+KN  + F   D SHP   +I   L++L + +K  
Subjt:  KLHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYLKELMSEMKLH

Query:  GYVPDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHF-KDGQCSCGNYW
        GY PD S VL D+D+E KE  LA HSE+LAIAF L+ TP+G P+ ++KNLRVC DCH  IK I+KI  REI+VRD++RFHHF  DG CSCG++W
Subjt:  GYVPDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHF-KDGQCSCGNYW

AT2G41080.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.8e-19766.2Show/hide
Query:  FEVNEQALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWNTLIAGNA
        FE NE+ LSLF+EM+GLGF PDE+TLGSV  G AGLRS+  GQ++H   +K G E++ VV SSLAHMYM++G L DG  +I+SMP+RN+VAWNTLI GNA
Subjt:  FEVNEQALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWNTLIAGNA

Query:  QNGCLEEVLHQYRMMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLWSSMIAAY
        QNGC E VL+ Y+MMK++G RP+KITFV+V+S+CS+LA  GQGQQIHAE IK G  S +AV+SSL+S+YS+ GCL D+ K FS+RED D V+WSSMI+AY
Subjt:  QNGCLEEVLHQYRMMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLWSSMIAAY

Query:  GFHGRGEEAIELFHQM-EELKIEANEVAFLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADGIIWKTL
        GFHG+G+EAIELF+ M E+  +E NEVAFL+LLYACSH GLK+KG E  D+MV+KY  KP ++HYTCVVDLLGRAGCL +AEA++RSMP+K D +IWKTL
Subjt:  GFHGRGEEAIELFHQM-EELKIEANEVAFLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADGIIWKTL

Query:  LSACKLHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYLKELMSE
        LSAC +HK AEMA+R+ +E+L++ P D+A YVLL+N+HASA+ W DVSE+RK+MRD+NV+KE GISW E K  VHQF M D S  +  EI SYLKEL  E
Subjt:  LSACKLHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYLKELMSE

Query:  MKLHGYVPDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCGNYW
        MKL GY PD +SVLHDMD EEKE +L  HSEKLA+AFALM  PEG PIR++KNLRVCSDCH A K IS I+NREI +RD SRFHHF +G+CSCG+YW
Subjt:  MKLHGYVPDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCGNYW

AT3G23330.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.6e-12142.89Show/hide
Query:  EQALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWNTLIAGNAQNGC
        E AL + +EM      PD FTL SVL   +    +  G+E+H  +++ G + +  +GSSL  MY KS  + D  ++   +  R+ ++WN+L+AG  QNG 
Subjt:  EQALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWNTLIAGNAQNGC

Query:  LEEVLHQYRMMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLWSSMIAAYGFHG
          E L  +R M  A  +P  + F SVI AC+ LATL  G+Q+H  V++ G GS + + S+LV +YS+ G ++ + K+F     +D V W+++I  +  HG
Subjt:  LEEVLHQYRMMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLWSSMIAAYGFHG

Query:  RGEEAIELFHQMEELKIEANEVAFLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADGIIWKTLLSACK
         G EA+ LF +M+   ++ N+VAF+++L ACSH GL ++   Y + M K Y L   +EHY  V DLLGRAG L EA   +  M V+  G +W TLLS+C 
Subjt:  RGEEAIELFHQMEELKIEANEVAFLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADGIIWKTLLSACK

Query:  LHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYLKELMSEMKLHG
        +HK  E+A++++E++  +   +  +YVL+ N++AS   W +++++R  MR + +RK+P  SW+E+KN  H F   D SHP   +I+ +LK +M +M+  G
Subjt:  LHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYLKELMSEMKLHG

Query:  YVPDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCGNYW
        YV D S VLHD+D E K   L  HSE+LA+AF ++NT  G  IRV KN+R+C+DCH AIK ISKI  REIIVRD SRFHHF  G CSCG+YW
Subjt:  YVPDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCGNYW

AT3G49710.1 Pentatricopeptide repeat (PPR) superfamily protein4.5e-11944.29Show/hide
Query:  QALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSG---SLSDGVKLIKSMPIRNVVAWNTLIAGNAQN
        +AL+L+KEM   GF  D FTL SVL     L  L  G++ H  L+K GF  NS VGS L   Y K G    + D  K+ + +   ++V WNT+I+G + N
Subjt:  QALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSG---SLSDGVKLIKSMPIRNVVAWNTLIAGNAQN

Query:  GCL-EEVLHQYRMMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGTGS-ALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLWSSMIAAY
          L EE +  +R M+  G RPD  +FV V SACS L++  Q +QIH   IK+   S  ++V ++L+SLY +SG L+D+  VF    +++ V ++ MI  Y
Subjt:  GCL-EEVLHQYRMMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGTGS-ALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLWSSMIAAY

Query:  GFHGRGEEAIELFHQMEELKIEANEVAFLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADGIIWKTLL
          HG G EA+ L+ +M +  I  N++ F+++L AC+HCG  ++G EY + M + +K++P  EHY+C++DLLGRAG L EAE  + +MP K   + W  LL
Subjt:  GFHGRGEEAIELFHQMEELKIEANEVAFLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADGIIWKTLL

Query:  SACKLHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYLKELMSEM
         AC+ HK   +A+R + E++ + PL A  YV+L+N++A AR W +++ +RK+MR + +RK+PG SW+E+K   H F   D SHP   E++ YL+E+M +M
Subjt:  SACKLHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYLKELMSEM

Query:  KLHGYVPD--ISSVLHDMDNE-EKEYNLAHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCGNYW
        K  GYV D   + V  D   E ++E  L HHSEKLA+AF LM+T +G  + V+KNLR+C DCH+AIK +S +  REIIVRD  RFH FKDG+CSCG+YW
Subjt:  KLHGYVPD--ISSVLHDMDNE-EKEYNLAHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCGNYW

AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.8e-12144.11Show/hide
Query:  ALSLFKEMYGLGFL-PDEFTLGSVLRGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWNTLIAGNAQNGCL
        A SL++EM   G + PD  T   +++    +  +R G+ +H+ +++ GF     V +SL H+Y   G ++   K+   MP +++VAWN++I G A+NG  
Subjt:  ALSLFKEMYGLGFL-PDEFTLGSVLRGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAHMYMKSGSLSDGVKLIKSMPIRNVVAWNTLIAGNAQNGCL

Query:  EEVLHQYRMMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLWSSMIAAYGFHGR
        EE L  Y  M   G +PD  T VS++SAC+++  L  G+++H  +IK G    L   + L+ LY+R G +E++  +F +  D + V W+S+I     +G 
Subjt:  EEVLHQYRMMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLEDSVKVFSDREDIDVVLWSSMIAAYGFHGR

Query:  GEEAIELFHQMEELK-IEANEVAFLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADGIIWKTLLSACK
        G+EAIELF  ME  + +   E+ F+ +LYACSHCG+ ++G EY   M ++YK++PRIEH+ C+VDLL RAG + +A   ++SMP++ + +IW+TLL AC 
Subjt:  GEEAIELFHQMEELK-IEANEVAFLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRSMPVKADGIIWKTLLSACK

Query:  LHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYLKELMSEMKLHG
        +H ++++A+    ++L+L P  +  YVLLSN++AS + WSDV +IRK M    V+K PG S +E+ N VH+F M D SHPQ   I + LKE+   ++  G
Subjt:  LHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYLKELMSEMKLHG

Query:  YVPDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCGNYW
        YVP IS+V  D++ EEKE  + +HSEK+AIAF L++TPE  PI V+KNLRVC+DCH AIK +SK+ NREI+VRD SRFHHFK+G CSC +YW
Subjt:  YVPDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCGNYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCAGACCCATCTCTCTTTTCTCATTTGCTTCAATCATGTATAGCTCTAGACTCGCTTTTTGGAGGAAAACAAGTTCATTCTTTATTCATTTCCAATCATCTTTTGA
ACTTTTACTCCAAATTCGGACATTTGAAGTTAATGAGCAGGCTTTGAGTTTGTTCAAAGAAATGTATGGATTGGGTTTTTTGCCTGACGAGTTCACATTGGGCAGTGTGT
TGAGAGGTTGTGCTGGTTTGAGGTCTTTACGTGCAGGTCAAGAGGTTCATGCTTGTCTGATGAAATGTGGGTTCGAAATGAATTCGGTGGTGGGAAGTTCTTTAGCTCAT
ATGTATATGAAGTCTGGTAGTTTATCTGATGGAGTGAAGTTGATTAAATCAATGCCGATTCGTAATGTAGTTGCTTGGAATACTCTTATTGCTGGAAACGCTCAAAATGG
GTGTTTAGAAGAAGTGTTGCATCAGTACCGTATGATGAAAATGGCAGGCTTTCGACCGGATAAAATAACATTCGTGAGTGTAATAAGTGCGTGTTCGGAACTGGCAACAT
TGGGACAAGGTCAGCAGATTCATGCTGAAGTGATCAAAGCTGGAACTGGCTCAGCTCTAGCAGTCATCAGCTCATTGGTTAGCTTGTACTCACGATCTGGGTGTCTAGAG
GACTCTGTGAAAGTCTTTTCGGATCGTGAAGATATCGATGTTGTGTTATGGAGTTCTATGATTGCAGCTTATGGATTCCATGGGAGAGGAGAGGAAGCTATTGAGCTGTT
CCATCAAATGGAAGAGTTGAAAATAGAGGCAAATGAAGTGGCATTCTTGAGTTTGCTTTATGCTTGTAGCCACTGTGGATTAAAGGAGAAAGGAACTGAGTATTTAGATT
TGATGGTGAAGAAGTATAAACTCAAACCTAGAATTGAACACTATACGTGTGTTGTCGATCTGCTCGGTCGGGCTGGCTGCTTGGTGGAAGCAGAGGCTATGGTAAGATCC
ATGCCAGTAAAAGCAGATGGCATCATATGGAAAACTTTATTATCAGCCTGCAAACTCCACAAGGAAGCCGAAATGGCCAAACGAATTTCTGAAGAAGTTCTAAAGCTTCA
TCCACTGGATGCTGCTTCTTATGTGCTGCTTTCAAACATCCATGCCTCTGCTAGAAACTGGTCTGATGTTTCCGAGATTCGGAAAGCCATGAGGGATAGGAACGTGAGGA
AGGAGCCTGGCATAAGTTGGTTAGAACTCAAAAATGTAGTTCACCAATTTAGCATGGCTGACAATTCTCACCCACAACATCTGGAGATTGATTCATATTTGAAAGAACTA
ATGTCTGAAATGAAGCTACATGGTTATGTGCCCGACATAAGCTCGGTTTTGCATGATATGGACAATGAAGAAAAAGAATACAATTTGGCACATCACAGTGAGAAGTTAGC
TATTGCTTTTGCTCTGATGAACACTCCCGAGGGTGTCCCGATTCGGGTGATGAAGAACTTACGGGTCTGTAGTGATTGTCACGATGCTATTAAGTGCATATCAAAGATCA
GAAACAGAGAGATTATTGTTAGAGATGCAAGTAGATTTCACCATTTCAAGGATGGTCAATGTTCTTGTGGTAATTATTGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTCAGACCCATCTCTCTTTTCTCATTTGCTTCAATCATGTATAGCTCTAGACTCGCTTTTTGGAGGAAAACAAGTTCATTCTTTATTCATTTCCAATCATCTTTTGA
ACTTTTACTCCAAATTCGGACATTTGAAGTTAATGAGCAGGCTTTGAGTTTGTTCAAAGAAATGTATGGATTGGGTTTTTTGCCTGACGAGTTCACATTGGGCAGTGTGT
TGAGAGGTTGTGCTGGTTTGAGGTCTTTACGTGCAGGTCAAGAGGTTCATGCTTGTCTGATGAAATGTGGGTTCGAAATGAATTCGGTGGTGGGAAGTTCTTTAGCTCAT
ATGTATATGAAGTCTGGTAGTTTATCTGATGGAGTGAAGTTGATTAAATCAATGCCGATTCGTAATGTAGTTGCTTGGAATACTCTTATTGCTGGAAACGCTCAAAATGG
GTGTTTAGAAGAAGTGTTGCATCAGTACCGTATGATGAAAATGGCAGGCTTTCGACCGGATAAAATAACATTCGTGAGTGTAATAAGTGCGTGTTCGGAACTGGCAACAT
TGGGACAAGGTCAGCAGATTCATGCTGAAGTGATCAAAGCTGGAACTGGCTCAGCTCTAGCAGTCATCAGCTCATTGGTTAGCTTGTACTCACGATCTGGGTGTCTAGAG
GACTCTGTGAAAGTCTTTTCGGATCGTGAAGATATCGATGTTGTGTTATGGAGTTCTATGATTGCAGCTTATGGATTCCATGGGAGAGGAGAGGAAGCTATTGAGCTGTT
CCATCAAATGGAAGAGTTGAAAATAGAGGCAAATGAAGTGGCATTCTTGAGTTTGCTTTATGCTTGTAGCCACTGTGGATTAAAGGAGAAAGGAACTGAGTATTTAGATT
TGATGGTGAAGAAGTATAAACTCAAACCTAGAATTGAACACTATACGTGTGTTGTCGATCTGCTCGGTCGGGCTGGCTGCTTGGTGGAAGCAGAGGCTATGGTAAGATCC
ATGCCAGTAAAAGCAGATGGCATCATATGGAAAACTTTATTATCAGCCTGCAAACTCCACAAGGAAGCCGAAATGGCCAAACGAATTTCTGAAGAAGTTCTAAAGCTTCA
TCCACTGGATGCTGCTTCTTATGTGCTGCTTTCAAACATCCATGCCTCTGCTAGAAACTGGTCTGATGTTTCCGAGATTCGGAAAGCCATGAGGGATAGGAACGTGAGGA
AGGAGCCTGGCATAAGTTGGTTAGAACTCAAAAATGTAGTTCACCAATTTAGCATGGCTGACAATTCTCACCCACAACATCTGGAGATTGATTCATATTTGAAAGAACTA
ATGTCTGAAATGAAGCTACATGGTTATGTGCCCGACATAAGCTCGGTTTTGCATGATATGGACAATGAAGAAAAAGAATACAATTTGGCACATCACAGTGAGAAGTTAGC
TATTGCTTTTGCTCTGATGAACACTCCCGAGGGTGTCCCGATTCGGGTGATGAAGAACTTACGGGTCTGTAGTGATTGTCACGATGCTATTAAGTGCATATCAAAGATCA
GAAACAGAGAGATTATTGTTAGAGATGCAAGTAGATTTCACCATTTCAAGGATGGTCAATGTTCTTGTGGTAATTATTGGTAGTGGAAGATTAACCCAGTTGAATTGCGG
CCAACAAGAACAACATAAGTGTGGGTCCTTGATCAAAAGGTCATAGATTCGAATCTCCCCATTGTAATTGAACGAGTTGAGTGCTATTATGACAGCCTAGTTTGGAAACC
CCAAAGTACCCTAAAATGAAGGGCTATTCTTTAGATTTGAGACTTTTGCTGGCAACCTGACCACCGATCATGGCAGGAGCGTGACCTCCCAATCGGTTTCGTAGAATTTG
AATTCACGACTTCTAAGTCCTGAGCACATACATCATTTCGGTTAAGCTATGTTTTTGTTGGCCGAGCTGAATTGAATCTGCAAAAAGATGTGGAAAGGCCAAGGGTGGCG
CCTATTAGTGCACTTGTGGCTATTTATGCCCAACTTCTGCGCAGCACAACTTTGGAAAACG
Protein sequenceShow/hide protein sequence
MVRPISLFSFASIMYSSRLAFWRKTSSFFIHFQSSFELLLQIRTFEVNEQALSLFKEMYGLGFLPDEFTLGSVLRGCAGLRSLRAGQEVHACLMKCGFEMNSVVGSSLAH
MYMKSGSLSDGVKLIKSMPIRNVVAWNTLIAGNAQNGCLEEVLHQYRMMKMAGFRPDKITFVSVISACSELATLGQGQQIHAEVIKAGTGSALAVISSLVSLYSRSGCLE
DSVKVFSDREDIDVVLWSSMIAAYGFHGRGEEAIELFHQMEELKIEANEVAFLSLLYACSHCGLKEKGTEYLDLMVKKYKLKPRIEHYTCVVDLLGRAGCLVEAEAMVRS
MPVKADGIIWKTLLSACKLHKEAEMAKRISEEVLKLHPLDAASYVLLSNIHASARNWSDVSEIRKAMRDRNVRKEPGISWLELKNVVHQFSMADNSHPQHLEIDSYLKEL
MSEMKLHGYVPDISSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHDAIKCISKIRNREIIVRDASRFHHFKDGQCSCGNYW