; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS005988 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS005988
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold254:2986887..2988344
RNA-Seq ExpressionMS005988
SyntenyMS005988
Gene Ontology termsGO:0016125 - sterol metabolic process (biological process)
GO:0019287 - isopentenyl diphosphate biosynthetic process, mevalonate pathway (biological process)
GO:0019288 - isopentenyl diphosphate biosynthetic process, methylerythritol 4-phosphate pathway (biological process)
GO:0048364 - root development (biological process)
GO:0050790 - regulation of catalytic activity (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0034046 - poly(G) binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7018711.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]6.8e-25488.45Show/hide
Query:  FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND
        FLYNHLVNMYAKLD  +SAEL+L LAPCRSVVTWT+LIAGSVQNG F SAL +FS MLSDCVRPNDFTFPC  KAST LRMAM+GKQ+HALAVKEGLIND
Subjt:  FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND

Query:  VFVGCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQ
        VFVGCS FDMY KLGLL+DA K+FVEMPHRNLETWNAYISNSVLHGRPEDS IAF+ELLRAGG PDSITFCAFLNACSDKLGLEPGCQLHGFIIRSG  Q
Subjt:  VFVGCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQ

Query:  NVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVE
        NVS+SNGLIDFYGKCGEV CS ++FDRMGERN+VSWSSLIAAY+QNNEEEKACCLFL+ARKEDIKP DFMVSSVLCA AGLS IELGRSVQALAVKACVE
Subjt:  NVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVE

Query:  ENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARY
        ENIFVGSALVDMYGKCGSID+AE+AF EMPE+NLVSWN LLGGYAHQG+ADKAVALLEEM SA G+AP+YVSLVCALSACSRAGDLK GMQIFESMKARY
Subjt:  ENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARY

Query:  NVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAAT
         +EPGPEHYA LVDL GRAGMVECAYDFI+ MPF PTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNM AAT
Subjt:  NVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAAT

XP_022137756.1 pentatricopeptide repeat-containing protein At4g14850 [Momordica charantia]1.8e-28399.18Show/hide
Query:  FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND
        FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHF+SALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND
Subjt:  FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND

Query:  VFVGCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQ
        VFVGCSTFDMY KLGLLEDASKVFVEMPHRNLETWNAYISNSV HGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQ
Subjt:  VFVGCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQ

Query:  NVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVE
        NVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVE
Subjt:  NVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVE

Query:  ENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARY
        ENIFVGSALVDMYGKCGSIDEAERAFKEMP+KNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARY
Subjt:  ENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARY

Query:  NVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATG
        NVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATG
Subjt:  NVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATG

XP_022956070.1 pentatricopeptide repeat-containing protein At4g14850 [Cucurbita moschata]1.2e-25388.25Show/hide
Query:  FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND
        FLYNHLVNMYAKLD  +SAEL+L LAPCRSVVTWT+LIAGSVQNG FASALL+FS MLSDCVRPNDFTFPC  KAST LRMAM+GKQ+HALAVKEGLIND
Subjt:  FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND

Query:  VFVGCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQ
        VFVGCS FDMY KLGLL+DA K+FVEMPHRNLETWNAYISNSVLHGRPEDS IAF+ELLRAGG PDSITFCAFLNACSDKLGLEPGCQLHGFIIRSG  Q
Subjt:  VFVGCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQ

Query:  NVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVE
        NVS+SNGLIDFYGKCGEV CS ++FDRMGERN+VSWSSLIAAY+QNNEEEKACCLFL+ARKEDIKP DFMVSSVLCA AGLS IELGRSVQALAVKACV+
Subjt:  NVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVE

Query:  ENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARY
        ENIFVGSALVDMYGKCGSID+AE+AF EMPE+NLVSWN LLGGYAHQG+ADKAVALL++M S  G+APSYVSLVCALSACSRAGDLK GMQIFESMKARY
Subjt:  ENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARY

Query:  NVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAAT
         +EPGPEHYA LVDL GRAGMVECAYDFI+ MPF PTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNM AAT
Subjt:  NVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAAT

XP_022979420.1 pentatricopeptide repeat-containing protein At4g14850 [Cucurbita maxima]1.1e-25489.28Show/hide
Query:  FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND
        FLYNHLVNMYAKLD  +SAEL+L LAPCRSVVTWT+LIAGSVQNG F+SALL+FS MLSDCVRPNDFTFPC LKAST LRMAM+GKQ+HALAVKEGLIND
Subjt:  FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND

Query:  VFVGCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQ
        VFVGCS FDMY KLGLL+DA K+FVEMPHRNLETWNAYISNSVLHGRPEDS IAF+ELLRAGG PDSITFCAFLNACSDKLGLEPGCQLHGFIIRSG  Q
Subjt:  VFVGCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQ

Query:  NVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVE
        NVS+SNGLIDFYGKCGEV CS ++FDRMGERN+VSWSSLIAAY+QNNEEEKACCLFL+ARKE IKP DFMVSSVLCA AGLS IELGRSVQALAVKACVE
Subjt:  NVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVE

Query:  ENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARY
        ENIFVGSALVDMYGKCGSIDEAERAF EMPE+NLVSWN+LLGGYAHQG ADKAVALLEEM SA G+APSYVSLVCALSACSRAGDLK GMQIFESMKARY
Subjt:  ENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARY

Query:  NVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAAT
         +EPGPEHYA LVDL GRAGMVECAYDFI+ MPF PTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNM AAT
Subjt:  NVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAAT

XP_038881355.1 pentatricopeptide repeat-containing protein At4g14850 [Benincasa hispida]3.1e-25489.3Show/hide
Query:  FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND
        FLYNHLVNMYAK DH +SA+L+L LAPCRSVVTWTALIAGSVQNG FASALL+FS MLSDCVRPNDFTFPC LKAST LRMAM+GKQ+HALAVKEGLIND
Subjt:  FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND

Query:  VFVGCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQ
        VFVGCS FDMY KLGLL DA K+F EMPHRNLET NAYISNSVLHGRPEDS IAF+ELLR G  PDSITFCAF NACSDKLGL PGCQLHGFIIRSG+ Q
Subjt:  VFVGCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQ

Query:  NVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVE
        NVSVSNGLIDFYGKCGEVECS MVFDRMGERN+VSWSSLIAAY+QNNEEEKA CLFL+ARKEDIKP DFMVSSVLCACAGLS IE GRSVQALAVKACVE
Subjt:  NVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVE

Query:  ENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARY
        ENIFVGSALVDMYGKCGSI EAE+AF EMPE+NLVSWN LLGGYAHQGHADKAVALLEEM S AG++PSYVSLVCALSACSRAGDLK GMQIFESMKARY
Subjt:  ENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARY

Query:  NVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATG
         +EPGPEHYA LVDLLGRAGMVECAYDFIK MPF PTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATG
Subjt:  NVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATG

TrEMBL top hitse value%identityAlignment
A0A0A0L4T8 Uncharacterized protein7.4e-25488.68Show/hide
Query:  FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND
        FLYNHLVNMYAKLDH +SA+L+L LAPCRSVVTWTALIAGSVQNG F SALL+FS MLSDCVRPNDFTFPC LKAST LRM  +GKQ+HALAVKEGLIND
Subjt:  FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND

Query:  VFVGCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQ
        VFVGCS FDMY KLG L DA KVF EMPHRNLETWNAYISNSVLHGRPEDSVIAF+ELLR GG PDSITFCAFLNACSDKLGL PGCQLHGFIIRSG+ Q
Subjt:  VFVGCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQ

Query:  NVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVE
        NVSVSNGLIDFYGKCGEVECS MVFDRMGERN+VSWSSLIAAY+QNNEEEKA CLFL+ARKEDI+P DFMVSSVLCACAGLS IE GRSVQALAVKACVE
Subjt:  NVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVE

Query:  ENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARY
        +NIFV SALVDMYGKCGSID AE+AF  MPE+NLVSWN LLGGYAHQGHA+KAVALLEEMTSAAG+ PSYVSL+CALSACSRAGDLK GM+IFESMK RY
Subjt:  ENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARY

Query:  NVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATG
         VEPGPEHYA LVDLLGRAGMVECAYDFIK MPF PTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATG
Subjt:  NVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATG

A0A5A7U206 Pentatricopeptide repeat-containing protein1.3e-25087.45Show/hide
Query:  FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND
        FLYNHLVNMYAKLDH +SA+L+L LAPCRSVVTWTALIAGSVQNG F SALL+FS MLSDCVRPNDFTFPC LKAST LRM M+GKQ+HALAVKEGLIND
Subjt:  FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND

Query:  VFVGCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQ
        VFVGCS FDMY KLG L DA K+F EMP RNLETWNAYI+NSVLHGRPEDS IAF+ELLR G  PDSITFCAFLNACSDKLGL PGCQLHGF+IRSG+ Q
Subjt:  VFVGCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQ

Query:  NVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVE
        NVSVSNGLIDFYGKCGEVECS MVFDRMGERN+VSWSSLIAAY+QNNEEEKA CLFL+ARKEDI+P DFMVSSVLCACAGLS IE GRSVQALAVKACVE
Subjt:  NVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVE

Query:  ENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARY
        +NIFV SALVDMYGKCGSID A +AF  MPE+NLVSWN LLGGYAHQGHA+KAVALLEEMTSAAG+ PSYVSL+CALSACSRAGDLK GM+IFESMK RY
Subjt:  ENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARY

Query:  NVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATG
         VEPGPEHYA LVDLLGRAGMVECAYDFIK MPF PTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATG
Subjt:  NVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATG

A0A6J1C7M0 pentatricopeptide repeat-containing protein At4g148508.9e-28499.18Show/hide
Query:  FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND
        FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHF+SALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND
Subjt:  FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND

Query:  VFVGCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQ
        VFVGCSTFDMY KLGLLEDASKVFVEMPHRNLETWNAYISNSV HGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQ
Subjt:  VFVGCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQ

Query:  NVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVE
        NVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVE
Subjt:  NVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVE

Query:  ENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARY
        ENIFVGSALVDMYGKCGSIDEAERAFKEMP+KNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARY
Subjt:  ENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARY

Query:  NVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATG
        NVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATG
Subjt:  NVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATG

A0A6J1GWT9 pentatricopeptide repeat-containing protein At4g148505.6e-25488.25Show/hide
Query:  FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND
        FLYNHLVNMYAKLD  +SAEL+L LAPCRSVVTWT+LIAGSVQNG FASALL+FS MLSDCVRPNDFTFPC  KAST LRMAM+GKQ+HALAVKEGLIND
Subjt:  FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND

Query:  VFVGCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQ
        VFVGCS FDMY KLGLL+DA K+FVEMPHRNLETWNAYISNSVLHGRPEDS IAF+ELLRAGG PDSITFCAFLNACSDKLGLEPGCQLHGFIIRSG  Q
Subjt:  VFVGCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQ

Query:  NVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVE
        NVS+SNGLIDFYGKCGEV CS ++FDRMGERN+VSWSSLIAAY+QNNEEEKACCLFL+ARKEDIKP DFMVSSVLCA AGLS IELGRSVQALAVKACV+
Subjt:  NVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVE

Query:  ENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARY
        ENIFVGSALVDMYGKCGSID+AE+AF EMPE+NLVSWN LLGGYAHQG+ADKAVALL++M S  G+APSYVSLVCALSACSRAGDLK GMQIFESMKARY
Subjt:  ENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARY

Query:  NVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAAT
         +EPGPEHYA LVDL GRAGMVECAYDFI+ MPF PTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNM AAT
Subjt:  NVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAAT

A0A6J1IQR6 pentatricopeptide repeat-containing protein At4g148505.1e-25589.28Show/hide
Query:  FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND
        FLYNHLVNMYAKLD  +SAEL+L LAPCRSVVTWT+LIAGSVQNG F+SALL+FS MLSDCVRPNDFTFPC LKAST LRMAM+GKQ+HALAVKEGLIND
Subjt:  FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND

Query:  VFVGCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQ
        VFVGCS FDMY KLGLL+DA K+FVEMPHRNLETWNAYISNSVLHGRPEDS IAF+ELLRAGG PDSITFCAFLNACSDKLGLEPGCQLHGFIIRSG  Q
Subjt:  VFVGCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQ

Query:  NVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVE
        NVS+SNGLIDFYGKCGEV CS ++FDRMGERN+VSWSSLIAAY+QNNEEEKACCLFL+ARKE IKP DFMVSSVLCA AGLS IELGRSVQALAVKACVE
Subjt:  NVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVE

Query:  ENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARY
        ENIFVGSALVDMYGKCGSIDEAERAF EMPE+NLVSWN+LLGGYAHQG ADKAVALLEEM SA G+APSYVSLVCALSACSRAGDLK GMQIFESMKARY
Subjt:  ENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARY

Query:  NVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAAT
         +EPGPEHYA LVDL GRAGMVECAYDFI+ MPF PTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNM AAT
Subjt:  NVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAAT

SwissProt top hitse value%identityAlignment
P0C898 Putative pentatricopeptide repeat-containing protein At3g151301.2e-8837.58Show/hide
Query:  NHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLINDVFV
        N+L++MY K   P  A  V    P R+VV+W+AL++G V NG    +L  FS M    + PN+FTF   LKA   L     G QIH   +K G    V V
Subjt:  NHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLINDVFV

Query:  GCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAG--GSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFE--
        G S  DMY K G + +A KVF  +  R+L +WNA I+  V  G    ++  F  +  A     PD  T  + L ACS    +  G Q+HGF++RSGF   
Subjt:  GCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAG--GSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFE--

Query:  QNVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACV
         + +++  L+D Y KCG +  +   FD++ E+  +SWSSLI  Y Q  E  +A  LF + ++ + +   F +SS++   A  + +  G+ +QALAVK   
Subjt:  QNVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACV

Query:  EENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKAR
             V +++VDMY KCG +DEAE+ F EM  K+++SW  ++ GY   G   K+V +  EM     + P  V  +  LSACS +G +K G ++F  +   
Subjt:  EENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKAR

Query:  YNVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATG
        + ++P  EHYA +VDLLGRAG ++ A   I  MP  P + IW  LL  CR+HG  ELGK   + L  +D K+  N+V++SN++   G
Subjt:  YNVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATG

Q0WSH6 Pentatricopeptide repeat-containing protein At4g148501.4e-18864.89Show/hide
Query:  FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND
        FL N+L+NMY+KLDHP+SA LVL L P R+VV+WT+LI+G  QNGHF++AL+ F  M  + V PNDFTFPCA KA  SLR+ ++GKQIHALAVK G I D
Subjt:  FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND

Query:  VFVGCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQ
        VFVGCS FDMY K  L +DA K+F E+P RNLETWNA+ISNSV  GRP +++ AF+E  R  G P+SITFCAFLNACSD L L  G QLHG ++RSGF+ 
Subjt:  VFVGCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQ

Query:  NVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVE
        +VSV NGLIDFYGKC ++  S ++F  MG +NAVSW SL+AAY+QN+E+EKA  L+L++RK+ ++  DFM+SSVL ACAG++G+ELGRS+ A AVKACVE
Subjt:  NVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVE

Query:  ENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMT-SAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKAR
          IFVGSALVDMYGKCG I+++E+AF EMPEKNLV+ N+L+GGYAHQG  D A+AL EEM     G  P+Y++ V  LSACSRAG ++ GM+IF+SM++ 
Subjt:  ENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMT-SAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKAR

Query:  YNVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATG
        Y +EPG EHY+ +VD+LGRAGMVE AY+FIK MP  PTIS+WGAL  ACRMHGKP+LG LAAE LF+LDPKDSGNHV+LSN FAA G
Subjt:  YNVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATG

Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic1.3e-9336.57Show/hide
Query:  FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND
        F    L NMYAK    + A  V    P R +V+W  ++AG  QNG    AL     M  + ++P+  T    L A ++LR+   GK+IH  A++ G  + 
Subjt:  FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND

Query:  VFVGCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQ
        V +  +  DMY K G LE A ++F  M  RN+ +WN+ I   V +  P+++++ F ++L  G  P  ++    L+AC+D   LE G  +H   +  G ++
Subjt:  VFVGCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQ

Query:  NVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVE
        NVSV N LI  Y KC EV+ +  +F ++  R  VSW+++I  + QN     A   F Q R   +KP  F   SV+ A A LS     + +  + +++C++
Subjt:  NVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVE

Query:  ENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARY
        +N+FV +ALVDMY KCG+I  A   F  M E+++ +WN ++ GY   G    A+ L EEM     + P+ V+ +  +SACS +G ++ G++ F  MK  Y
Subjt:  ENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARY

Query:  NVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAA
        ++E   +HY ++VDLLGRAG +  A+DFI  MP  P ++++GA+LGAC++H      + AAE+LFEL+P D G HV+L+N++ A
Subjt:  NVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAA

Q9FIB2 Putative pentatricopeptide repeat-containing protein At5g099501.8e-9239.22Show/hide
Query:  NHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLINDVFV
        N LVNMYAK      A  V      +  V+W ++I G  QNG F  A+  +  M    + P  FT   +L +  SL+ A  G+QIH  ++K G+  +V V
Subjt:  NHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLINDVFV

Query:  GCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRP-EDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQNV
          +   +Y + G L +  K+F  MP  +  +WN+ I       R   ++V+ FL   RAG   + ITF + L+A S     E G Q+HG  +++      
Subjt:  GCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRP-EDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQNV

Query:  SVSNGLIDFYGKCGEVECSGMVFDRMGE-RNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVEE
        +  N LI  YGKCGE++    +F RM E R+ V+W+S+I+ YI N    KA  L     +   +   FM ++VL A A ++ +E G  V A +V+AC+E 
Subjt:  SVSNGLIDFYGKCGEVECSGMVFDRMGE-RNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVEE

Query:  NIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARYN
        ++ VGSALVDMY KCG +D A R F  MP +N  SWN+++ GYA  G  ++A+ L E M       P +V+ V  LSACS AG L+ G + FESM   Y 
Subjt:  NIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARYN

Query:  VEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGA-CRMHG-KPELGKLAAEKLFELDPKDSGNHVVLSNMFAATG
        + P  EH++ + D+LGRAG ++   DFI+ MP  P + IW  +LGA CR +G K ELGK AAE LF+L+P+++ N+V+L NM+AA G
Subjt:  VEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGA-CRMHG-KPELGKLAAEKLFELDPKDSGNHVVLSNMFAATG

Q9SIT7 Pentatricopeptide repeat-containing protein At2g136001.8e-9233.59Show/hide
Query:  FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND
        + +N +V    KL   D A+ +    P R   TW ++++G  Q+     AL YF+ M  +    N+++F   L A + L     G Q+H+L  K   ++D
Subjt:  FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND

Query:  VFVGCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSG-FE
        V++G +  DMY K G + DA +VF EM  RN+ +WN+ I+    +G   +++  F  +L +   PD +T  + ++AC+    ++ G ++HG ++++    
Subjt:  VFVGCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSG-FE

Query:  QNVSVSNGLIDFYGKCGEVECSGMVFD-------------------------------RMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPID
         ++ +SN  +D Y KC  ++ +  +FD                               +M ERN VSW++LIA Y QN E E+A  LF   ++E + P  
Subjt:  QNVSVSNGLIDFYGKCGEVECSGMVFD-------------------------------RMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPID

Query:  FMVSSVLCACAGLSGIELGRSVQALAVK------ACVEENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTS
        +  +++L ACA L+ + LG       +K      +  E++IFVG++L+DMY KCG ++E    F++M E++ VSWN ++ G+A  G+ ++A+ L  EM  
Subjt:  FMVSSVLCACAGLSGIELGRSVQALAVK------ACVEENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTS

Query:  AAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARYNVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAE
         +G  P +++++  LSAC  AG ++ G   F SM   + V P  +HY  +VDLLGRAG +E A   I+ MP  P   IWG+LL AC++H    LGK  AE
Subjt:  AAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARYNVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAE

Query:  KLFELDPKDSGNHVVLSNMFAATG
        KL E++P +SG +V+LSNM+A  G
Subjt:  KLFELDPKDSGNHVVLSNMFAATG

Arabidopsis top hitse value%identityAlignment
AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein9.0e-9536.57Show/hide
Query:  FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND
        F    L NMYAK    + A  V    P R +V+W  ++AG  QNG    AL     M  + ++P+  T    L A ++LR+   GK+IH  A++ G  + 
Subjt:  FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND

Query:  VFVGCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQ
        V +  +  DMY K G LE A ++F  M  RN+ +WN+ I   V +  P+++++ F ++L  G  P  ++    L+AC+D   LE G  +H   +  G ++
Subjt:  VFVGCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQ

Query:  NVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVE
        NVSV N LI  Y KC EV+ +  +F ++  R  VSW+++I  + QN     A   F Q R   +KP  F   SV+ A A LS     + +  + +++C++
Subjt:  NVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVE

Query:  ENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARY
        +N+FV +ALVDMY KCG+I  A   F  M E+++ +WN ++ GY   G    A+ L EEM     + P+ V+ +  +SACS +G ++ G++ F  MK  Y
Subjt:  ENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARY

Query:  NVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAA
        ++E   +HY ++VDLLGRAG +  A+DFI  MP  P ++++GA+LGAC++H      + AAE+LFEL+P D G HV+L+N++ A
Subjt:  NVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAA

AT2G13600.1 Pentatricopeptide repeat (PPR) superfamily protein1.3e-9333.59Show/hide
Query:  FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND
        + +N +V    KL   D A+ +    P R   TW ++++G  Q+     AL YF+ M  +    N+++F   L A + L     G Q+H+L  K   ++D
Subjt:  FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND

Query:  VFVGCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSG-FE
        V++G +  DMY K G + DA +VF EM  RN+ +WN+ I+    +G   +++  F  +L +   PD +T  + ++AC+    ++ G ++HG ++++    
Subjt:  VFVGCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSG-FE

Query:  QNVSVSNGLIDFYGKCGEVECSGMVFD-------------------------------RMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPID
         ++ +SN  +D Y KC  ++ +  +FD                               +M ERN VSW++LIA Y QN E E+A  LF   ++E + P  
Subjt:  QNVSVSNGLIDFYGKCGEVECSGMVFD-------------------------------RMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPID

Query:  FMVSSVLCACAGLSGIELGRSVQALAVK------ACVEENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTS
        +  +++L ACA L+ + LG       +K      +  E++IFVG++L+DMY KCG ++E    F++M E++ VSWN ++ G+A  G+ ++A+ L  EM  
Subjt:  FMVSSVLCACAGLSGIELGRSVQALAVK------ACVEENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTS

Query:  AAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARYNVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAE
         +G  P +++++  LSAC  AG ++ G   F SM   + V P  +HY  +VDLLGRAG +E A   I+ MP  P   IWG+LL AC++H    LGK  AE
Subjt:  AAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARYNVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAE

Query:  KLFELDPKDSGNHVVLSNMFAATG
        KL E++P +SG +V+LSNM+A  G
Subjt:  KLFELDPKDSGNHVVLSNMFAATG

AT3G15130.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.7e-9037.58Show/hide
Query:  NHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLINDVFV
        N+L++MY K   P  A  V    P R+VV+W+AL++G V NG    +L  FS M    + PN+FTF   LKA   L     G QIH   +K G    V V
Subjt:  NHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLINDVFV

Query:  GCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAG--GSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFE--
        G S  DMY K G + +A KVF  +  R+L +WNA I+  V  G    ++  F  +  A     PD  T  + L ACS    +  G Q+HGF++RSGF   
Subjt:  GCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAG--GSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFE--

Query:  QNVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACV
         + +++  L+D Y KCG +  +   FD++ E+  +SWSSLI  Y Q  E  +A  LF + ++ + +   F +SS++   A  + +  G+ +QALAVK   
Subjt:  QNVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACV

Query:  EENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKAR
             V +++VDMY KCG +DEAE+ F EM  K+++SW  ++ GY   G   K+V +  EM     + P  V  +  LSACS +G +K G ++F  +   
Subjt:  EENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKAR

Query:  YNVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATG
        + ++P  EHYA +VDLLGRAG ++ A   I  MP  P + IW  LL  CR+HG  ELGK   + L  +D K+  N+V++SN++   G
Subjt:  YNVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATG

AT4G14850.1 Pentatricopeptide repeat (PPR) superfamily protein9.7e-19064.89Show/hide
Query:  FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND
        FL N+L+NMY+KLDHP+SA LVL L P R+VV+WT+LI+G  QNGHF++AL+ F  M  + V PNDFTFPCA KA  SLR+ ++GKQIHALAVK G I D
Subjt:  FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLIND

Query:  VFVGCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQ
        VFVGCS FDMY K  L +DA K+F E+P RNLETWNA+ISNSV  GRP +++ AF+E  R  G P+SITFCAFLNACSD L L  G QLHG ++RSGF+ 
Subjt:  VFVGCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQ

Query:  NVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVE
        +VSV NGLIDFYGKC ++  S ++F  MG +NAVSW SL+AAY+QN+E+EKA  L+L++RK+ ++  DFM+SSVL ACAG++G+ELGRS+ A AVKACVE
Subjt:  NVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVE

Query:  ENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMT-SAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKAR
          IFVGSALVDMYGKCG I+++E+AF EMPEKNLV+ N+L+GGYAHQG  D A+AL EEM     G  P+Y++ V  LSACSRAG ++ GM+IF+SM++ 
Subjt:  ENIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMT-SAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKAR

Query:  YNVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATG
        Y +EPG EHY+ +VD+LGRAGMVE AY+FIK MP  PTIS+WGAL  ACRMHGKP+LG LAAE LF+LDPKDSGNHV+LSN FAA G
Subjt:  YNVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATG

AT5G09950.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.3e-9339.22Show/hide
Query:  NHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLINDVFV
        N LVNMYAK      A  V      +  V+W ++I G  QNG F  A+  +  M    + P  FT   +L +  SL+ A  G+QIH  ++K G+  +V V
Subjt:  NHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLINDVFV

Query:  GCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRP-EDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQNV
          +   +Y + G L +  K+F  MP  +  +WN+ I       R   ++V+ FL   RAG   + ITF + L+A S     E G Q+HG  +++      
Subjt:  GCSTFDMYGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRP-EDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQNV

Query:  SVSNGLIDFYGKCGEVECSGMVFDRMGE-RNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVEE
        +  N LI  YGKCGE++    +F RM E R+ V+W+S+I+ YI N    KA  L     +   +   FM ++VL A A ++ +E G  V A +V+AC+E 
Subjt:  SVSNGLIDFYGKCGEVECSGMVFDRMGE-RNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVEE

Query:  NIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARYN
        ++ VGSALVDMY KCG +D A R F  MP +N  SWN+++ GYA  G  ++A+ L E M       P +V+ V  LSACS AG L+ G + FESM   Y 
Subjt:  NIFVGSALVDMYGKCGSIDEAERAFKEMPEKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARYN

Query:  VEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGA-CRMHG-KPELGKLAAEKLFELDPKDSGNHVVLSNMFAATG
        + P  EH++ + D+LGRAG ++   DFI+ MP  P + IW  +LGA CR +G K ELGK AAE LF+L+P+++ N+V+L NM+AA G
Subjt:  VEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISIWGALLGA-CRMHG-KPELGKLAAEKLFELDPKDSGNHVVLSNMFAATG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TTCCTCTACAACCACCTTGTGAATATGTACGCCAAACTCGATCATCCTGACTCAGCCGAACTCGTCCTCGGACTCGCCCCTTGCCGGTCCGTCGTCACTTGGACCGCCCT
CATCGCCGGCTCCGTCCAAAACGGCCATTTTGCTTCTGCTCTACTTTACTTCTCCCACATGCTAAGCGACTGTGTTCGCCCCAATGATTTCACCTTCCCTTGCGCTCTCA
AAGCTTCCACTTCCCTTCGCATGGCCATGTCGGGCAAACAGATACACGCACTTGCGGTTAAGGAGGGACTAATAAACGATGTCTTCGTTGGGTGCAGCACCTTCGACATG
TACGGTAAACTGGGTCTTCTCGAGGACGCATCCAAGGTGTTTGTTGAAATGCCTCACCGAAACCTCGAAACGTGGAATGCGTATATATCCAATTCCGTGCTCCATGGGCG
GCCTGAAGATTCCGTCATTGCATTTCTTGAGCTACTTCGTGCTGGTGGGAGCCCTGATTCCATAACATTCTGTGCTTTTCTCAATGCTTGTTCAGACAAACTGGGCTTGG
AGCCTGGATGTCAGCTACATGGATTCATAATTCGAAGTGGTTTTGAGCAGAATGTCTCTGTTTCAAATGGGTTGATTGATTTTTATGGGAAATGTGGGGAGGTCGAATGT
TCCGGGATGGTTTTCGACAGAATGGGAGAGCGGAATGCCGTCTCGTGGTCCTCTCTGATAGCTGCTTACATTCAAAACAACGAGGAAGAGAAGGCTTGCTGCTTATTCTT
GCAAGCGAGGAAAGAAGACATCAAACCAATTGATTTTATGGTATCGAGTGTGCTTTGTGCCTGTGCTGGCCTTTCAGGAATCGAGTTGGGGAGGTCAGTTCAAGCGCTAG
CGGTCAAGGCTTGTGTAGAGGAGAACATCTTTGTTGGGAGTGCACTGGTTGACATGTATGGAAAATGTGGAAGCATTGATGAAGCAGAGCGAGCCTTCAAGGAGATGCCA
GAGAAAAATTTGGTGTCTTGGAATACACTGTTGGGCGGATATGCACACCAAGGACACGCAGACAAGGCCGTGGCATTGCTCGAGGAGATGACATCGGCAGCAGGCATGGC
ACCGAGTTACGTGAGTTTGGTCTGTGCATTATCAGCTTGCAGTAGAGCAGGAGATTTGAAGAGGGGGATGCAGATTTTTGAGTCCATGAAAGCAAGGTACAATGTAGAGC
CAGGGCCAGAGCATTACGCTAGCTTGGTGGACTTGCTTGGGCGTGCTGGAATGGTGGAGTGTGCGTATGATTTTATAAAGAACATGCCATTCTCTCCAACAATCTCAATC
TGGGGTGCTCTGTTAGGGGCTTGTCGAATGCATGGGAAGCCAGAGTTGGGAAAGTTAGCCGCTGAGAAGCTGTTTGAACTTGATCCAAAAGACTCTGGAAACCACGTTGT
GCTGTCCAATATGTTTGCTGCAACTGGC
mRNA sequenceShow/hide mRNA sequence
TTCCTCTACAACCACCTTGTGAATATGTACGCCAAACTCGATCATCCTGACTCAGCCGAACTCGTCCTCGGACTCGCCCCTTGCCGGTCCGTCGTCACTTGGACCGCCCT
CATCGCCGGCTCCGTCCAAAACGGCCATTTTGCTTCTGCTCTACTTTACTTCTCCCACATGCTAAGCGACTGTGTTCGCCCCAATGATTTCACCTTCCCTTGCGCTCTCA
AAGCTTCCACTTCCCTTCGCATGGCCATGTCGGGCAAACAGATACACGCACTTGCGGTTAAGGAGGGACTAATAAACGATGTCTTCGTTGGGTGCAGCACCTTCGACATG
TACGGTAAACTGGGTCTTCTCGAGGACGCATCCAAGGTGTTTGTTGAAATGCCTCACCGAAACCTCGAAACGTGGAATGCGTATATATCCAATTCCGTGCTCCATGGGCG
GCCTGAAGATTCCGTCATTGCATTTCTTGAGCTACTTCGTGCTGGTGGGAGCCCTGATTCCATAACATTCTGTGCTTTTCTCAATGCTTGTTCAGACAAACTGGGCTTGG
AGCCTGGATGTCAGCTACATGGATTCATAATTCGAAGTGGTTTTGAGCAGAATGTCTCTGTTTCAAATGGGTTGATTGATTTTTATGGGAAATGTGGGGAGGTCGAATGT
TCCGGGATGGTTTTCGACAGAATGGGAGAGCGGAATGCCGTCTCGTGGTCCTCTCTGATAGCTGCTTACATTCAAAACAACGAGGAAGAGAAGGCTTGCTGCTTATTCTT
GCAAGCGAGGAAAGAAGACATCAAACCAATTGATTTTATGGTATCGAGTGTGCTTTGTGCCTGTGCTGGCCTTTCAGGAATCGAGTTGGGGAGGTCAGTTCAAGCGCTAG
CGGTCAAGGCTTGTGTAGAGGAGAACATCTTTGTTGGGAGTGCACTGGTTGACATGTATGGAAAATGTGGAAGCATTGATGAAGCAGAGCGAGCCTTCAAGGAGATGCCA
GAGAAAAATTTGGTGTCTTGGAATACACTGTTGGGCGGATATGCACACCAAGGACACGCAGACAAGGCCGTGGCATTGCTCGAGGAGATGACATCGGCAGCAGGCATGGC
ACCGAGTTACGTGAGTTTGGTCTGTGCATTATCAGCTTGCAGTAGAGCAGGAGATTTGAAGAGGGGGATGCAGATTTTTGAGTCCATGAAAGCAAGGTACAATGTAGAGC
CAGGGCCAGAGCATTACGCTAGCTTGGTGGACTTGCTTGGGCGTGCTGGAATGGTGGAGTGTGCGTATGATTTTATAAAGAACATGCCATTCTCTCCAACAATCTCAATC
TGGGGTGCTCTGTTAGGGGCTTGTCGAATGCATGGGAAGCCAGAGTTGGGAAAGTTAGCCGCTGAGAAGCTGTTTGAACTTGATCCAAAAGACTCTGGAAACCACGTTGT
GCTGTCCAATATGTTTGCTGCAACTGGC
Protein sequenceShow/hide protein sequence
FLYNHLVNMYAKLDHPDSAELVLGLAPCRSVVTWTALIAGSVQNGHFASALLYFSHMLSDCVRPNDFTFPCALKASTSLRMAMSGKQIHALAVKEGLINDVFVGCSTFDM
YGKLGLLEDASKVFVEMPHRNLETWNAYISNSVLHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGFEQNVSVSNGLIDFYGKCGEVEC
SGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVEENIFVGSALVDMYGKCGSIDEAERAFKEMP
EKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALSACSRAGDLKRGMQIFESMKARYNVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTISI
WGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATG