; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G016060 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G016060
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr04:23562641..23567462
RNA-Seq ExpressionLsi04G016060
SyntenyLsi04G016060
Gene Ontology termsGO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0005886 - plasma membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR019164 - Transmembrane protein 147
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057172.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.0e+0087.29Show/hide
Query:  MYELGAIASKISNISQLRQLHGHFVLNSLHSHNYWVSVLLIICTRLHAHPAYAASIFTSSPSPNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQS
        MYEL A+ASKISNI QLR  HGH V NSLHSHNYWVS+LL+IC RLHAHPAY  SIFTSSPSP+ASVYSCMLKYYSRMGAHN+VVSLF+CM SLDLRPQ 
Subjt:  MYELGAIASKISNISQLRQLHGHFVLNSLHSHNYWVSVLLIICTRLHAHPAYAASIFTSSPSPNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQS

Query:  FVYIYLIKLAGKYGNIFHANVMKLGHVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVT
        FVYIYLIK AGK GN+FHA V+KLGH+DD FIRNAILDMY K GQVDLARKLFEQMAE+TL DWNSMISGCWKSGNET+AVMLFNMMPARNIITWT+MVT
Subjt:  FVYIYLIKLAGKYGNIFHANVMKLGHVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVT

Query:  GYAKMGDLDSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAKTALL
        GYAK+GDL+SARRYFDEMPERSVVSWNAM SAYAQ EC +EALKLFHQMLKEGITPDDTTW  TISSCSSIG+PTLAD+ILR INQKH +LN+F +TALL
Subjt:  GYAKMGDLDSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAKTALL

Query:  DMHAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVLS
        DMHAKFGNLEIARNIFDELG QRN V WN+MISAYTRVGK+SLARELFDNMPKRDVVSWNSMIAGYAQNGE+A SIELF+EMI C DIQPDEVTIASVLS
Subjt:  DMHAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVLS

Query:  ACGHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGVL
        ACGHIGALKL  WVLDIVREKNIKLGISGFNSLIF+YSKCG V DAHRIFQTME RDVVSFNTLISG AANGHGKEAIKLVLTMEEEGIEPDHVTYIGVL
Subjt:  ACGHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGVL

Query:  TACSHAGLLKEGKNVFKSIKAPAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPENPGNYVLLSNIY
        TACSHAGLL EGKNVFKSIKAP VDHYACMVDLLGRAGELDEAK+LIQSMPM+PH GVYGSLLNASRIHKRV LGELAA+KLFELEP+NPGNYVLLSNIY
Subjt:  TACSHAGLLKEGKNVFKSIKAPAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPENPGNYVLLSNIY

Query:  ASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFA
        AS+GRWEDVK+VRE MR  G++K VGMSWVEY+GQVHKFIVGDRSHE SKDIY+LLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFA
Subjt:  ASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFA

Query:  LLISEVGTPIRV-----LC------------------------RSEYDTLGTSVKAALVYLGTALVKLVCLATFLNVSENDSFDLYQELLKALIGLIDVA
        LLISEVGTPIRV     +C                        RSEYDTLGTSVKAALVYLGTALVKLVCLATFLNVSENDSFD YQELLKALIG IDVA
Subjt:  LLISEVGTPIRV-----LC------------------------RSEYDTLGTSVKAALVYLGTALVKLVCLATFLNVSENDSFDLYQELLKALIGLIDVA

Query:  GLYFALTQLTYRNISQNHKFQAVGLGWAFADSVLHRLAPLWVGARGLEFTWDYILQGLEANANLVLSISLAALGSLMWLRKNKPKTLIPIIYICALIVAT
        GLYFALTQLTYRNISQNHKFQAVGLGWAFADSVLHRLAPLWVGARGLEFTWDYILQGLEANANLVLSISLAALGSLMWLRKNKPKTLIPIIY+CALIVAT
Subjt:  GLYFALTQLTYRNISQNHKFQAVGLGWAFADSVLHRLAPLWVGARGLEFTWDYILQGLEANANLVLSISLAALGSLMWLRKNKPKTLIPIIYICALIVAT

Query:  MPSIT
        MPSIT
Subjt:  MPSIT

KAG6608509.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0086.45Show/hide
Query:  MYELGAIASKISNISQLRQLHGHFVLNSLHSHNYWVSVLLIICTRLHAHPAYAASIFTSSPSPNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQS
        MY+L AIA+KISNISQLRQLH H VLNSL S NYWVS+LL ICTRLHAHP+YAASIFTSSP PNASVYSCMLKYYSRMGAH+EVVSLFRCMQ LDLRP  
Subjt:  MYELGAIASKISNISQLRQLHGHFVLNSLHSHNYWVSVLLIICTRLHAHPAYAASIFTSSPSPNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQS

Query:  FVYIYLIKLAGKYGNIFHANVMKLGHVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVT
         VYIYLIKLAGK GN+FHA+V+KLGH+DDHFIRNA+LDMYAKYGQVDLARKLFEQM  RTLADWNSMISGCW SGNE DAVMLFNMMP RN I+WTAMVT
Subjt:  FVYIYLIKLAGKYGNIFHANVMKLGHVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVT

Query:  GYAKMGDLDSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAKTALL
        GYAK  DL+SARRYFDEMPE+SVVSWNAMLSAYAQNECAEEALKLFH+MLKEGITPDDTTWVA ISSCSSIGNP LAD++L KINQKH ILNN+ KTALL
Subjt:  GYAKMGDLDSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAKTALL

Query:  DMHAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVLS
        DMHAKFGNLEIAR IFDELGGQRNAVTWN+MISAYTR GK+SLARELFDNMPKRDVVSWNSMIAGYAQNGESA SI LF+EMI C DIQPDEVTIASVLS
Subjt:  DMHAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVLS

Query:  ACGHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGVL
        ACGHIGALK S WVL+IV+EKNIK GISGFNSLIF+YSKCG V DAHRIFQ M  +DVV+FNTLISG AANGHGK+AIKL+LTMEEEGIEPDHVTYIGVL
Subjt:  ACGHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGVL

Query:  TACSHAGLLKEGKNVFKSIKAPAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPENPGNYVLLSNIY
        TACSHAG+LKEGKN+FKSIKAP VDHYACMVDLLGRAGELDEAKILI+SMPM+PHAGVYGSLLN SRIHKRVELGELAANKL ELEP+NPGNY+LLSNIY
Subjt:  TACSHAGLLKEGKNVFKSIKAPAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPENPGNYVLLSNIY

Query:  ASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFA
        ASAGRWEDV++VREKMR GGVKKSVGMSWVEY+GQ+H F VGDRSHE SKDIYRLLAELERKMKRVGFV DKSCALRDVEEEEKEEMLGTHSEKLAICFA
Subjt:  ASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFA

Query:  LLISEVGTPIRVLCR--------------------------------SEYDTLGTSVKAALVYLGTALVKLVCLATFLNVSENDSFDLYQELLKALIGLI
        LL+SEVGTPIRV+                                  SEYDTLGTSVKAALVYLGTALVKLVCLATFLNVSENDSFDLYQELLKALIGLI
Subjt:  LLISEVGTPIRVLCR--------------------------------SEYDTLGTSVKAALVYLGTALVKLVCLATFLNVSENDSFDLYQELLKALIGLI

Query:  DVAGLYFALTQLTYRNISQNHKFQAVGLGWAFADSVLHRLAPLWVGARGLEFTWDYILQGLEANANLVLSISLAALGSLMWLRKNKPKTLIPIIYICALI
        DVAGLYFALTQLTYRNISQNHKFQAVGLGWAFADSVLHRLAPLW+GARGLEFTWDYILQGLEANANLVLSISLAALGSLMWLRKNKPK LIPIIYICALI
Subjt:  DVAGLYFALTQLTYRNISQNHKFQAVGLGWAFADSVLHRLAPLWVGARGLEFTWDYILQGLEANANLVLSISLAALGSLMWLRKNKPKTLIPIIYICALI

Query:  VATMPSIT
        VATMPSIT
Subjt:  VATMPSIT

XP_004147828.2 pentatricopeptide repeat-containing protein At1g14470, partial [Cucumis sativus]0.0e+0089.2Show/hide
Query:  MYELGAIASKISNISQLRQLHGHFVLNSLHSHNYWVSVLLIICTRLHAHPAYAASIFTSSPSPNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQS
        MYEL A+ASKISNI QLRQ HGH V NSLHSHNYWVS+LLI CTRLHAHPAY  SIFTSSPSP+ASVYSCMLKYYSRMGAHN+VVSLF+C  SL+LRPQ 
Subjt:  MYELGAIASKISNISQLRQLHGHFVLNSLHSHNYWVSVLLIICTRLHAHPAYAASIFTSSPSPNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQS

Query:  FVYIYLIKLAGKYGNIFHANVMKLGHVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVT
        FVYIYLIKLAGK GN+FHA V+KLGH+DDHFIRNAILDMYAK GQVDLAR LFEQMAERTLADWNSMISGCWKSGNET+AV+LFNMMPARNIITWT+MVT
Subjt:  FVYIYLIKLAGKYGNIFHANVMKLGHVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVT

Query:  GYAKMGDLDSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAKTALL
        GYAKMGDL+SARRYFDEMPERSVVSWNAM SAYAQ EC +EAL LFHQML+EGITPDDTTWV TISSCSSIG+PTLAD+ILR I+QKH +LN+F KTALL
Subjt:  GYAKMGDLDSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAKTALL

Query:  DMHAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVLS
        DMHAKFGNLEIARNIFDELG QRNAVTWNIMISAYTRVGK+SLARELFDNMPKRDVVSWNSMIAGYAQNGESA SIELF+EMI C+DIQPDEVTIASVLS
Subjt:  DMHAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVLS

Query:  ACGHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGVL
        ACGHIGALKLS WVLDIVREKNIKLGISGFNSLIF+YSKCG V DAHRIFQTM  RDVVSFNTLISG AANGHGKEAIKLVLTMEEEGIEPDHVTYIGVL
Subjt:  ACGHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGVL

Query:  TACSHAGLLKEGKNVFKSIKAPAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPENPGNYVLLSNIY
        TACSHAGLL EGKNVFKSI+AP VDHYACMVDLLGRAGELDEAK+LIQSMPM+PHAGVYGSLLNASRIHKRV LGELAA+KLFELEP+N GNYVLLSNIY
Subjt:  TACSHAGLLKEGKNVFKSIKAPAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPENPGNYVLLSNIY

Query:  ASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFA
        AS GRWEDVK+VRE M+ GG+KKSVGMSWVEY+GQVHKF VGDRSHE SKDIY+LLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFA
Subjt:  ASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFA

Query:  LLISEVGTPIRVL
        LLISEVGT IRV+
Subjt:  LLISEVGTPIRVL

XP_008454670.1 PREDICTED: pentatricopeptide repeat-containing protein At1g14470 [Cucumis melo]0.0e+0088.36Show/hide
Query:  MYELGAIASKISNISQLRQLHGHFVLNSLHSHNYWVSVLLIICTRLHAHPAYAASIFTSSPSPNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQS
        MYEL A+ASKISNI QLR  HGH V NSLHSHNYWVS+LL+IC RLHAHPAY  SIFTSSPSP+ASVYSCMLKYYSRMGAHN+VVSLF+CM SLDLRPQ 
Subjt:  MYELGAIASKISNISQLRQLHGHFVLNSLHSHNYWVSVLLIICTRLHAHPAYAASIFTSSPSPNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQS

Query:  FVYIYLIKLAGKYGNIFHANVMKLGHVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVT
        FVYIYLIK AGK GN+FHA V+KLGH+DD FIRNAILDMY K GQVDLARKLFEQMAE+TL DWNSMISGCWKSGNET+AVMLFNMMPARNIITWT+MVT
Subjt:  FVYIYLIKLAGKYGNIFHANVMKLGHVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVT

Query:  GYAKMGDLDSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAKTALL
        GYAK+GDL+SARRYFDEMPERSVVSWNAM SAYAQ EC +EALKLFHQMLKEGITPDDTTW  TISSCSSIG+PTLAD+ILR INQKH +LN+F +TALL
Subjt:  GYAKMGDLDSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAKTALL

Query:  DMHAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVLS
        DMHAKFGNLEIARNIFDELG QRN V WN+MISAYTRVGK+SLARELFDNMPKRDVVSWNSMIAGYAQNGE+A SIELF+EMI C DIQPDEVTIASVLS
Subjt:  DMHAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVLS

Query:  ACGHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGVL
        ACGHIGALKL  WVLDIVREKNIKLGISGFNSLIF+YSKCG V DAHRIFQTME RDVVSFNTLISG AANGHGKEAIKLVLTMEEEGIEPDHVTYIGVL
Subjt:  ACGHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGVL

Query:  TACSHAGLLKEGKNVFKSIKAPAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPENPGNYVLLSNIY
        TACSHAGLL EGKNVFKSIKAP VDHYACMVDLLGRAGELDEAK+LIQSMPM+PH GVYGSLLNASRIHKRV LGELAA+KLFELEP+NPGNYVLLSNIY
Subjt:  TACSHAGLLKEGKNVFKSIKAPAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPENPGNYVLLSNIY

Query:  ASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFA
        AS+GRWEDVK+VRE MR  G++K VGMSWVEY+GQVHKFIVGDRSHE SKDIY+LLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFA
Subjt:  ASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFA

Query:  LLISEVGTPIRVL
        LLISEVGTPIRV+
Subjt:  LLISEVGTPIRVL

XP_038898615.1 pentatricopeptide repeat-containing protein At1g14470 [Benincasa hispida]0.0e+0091.87Show/hide
Query:  MYELGAIASKISNISQLRQLHGHFVLNSLHSHNYWVSVLLIICTRLHAHPAYAASIFTSSPSPNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQS
        M ELGAIASKISNI QLRQLHGH VLNSLHSHNYWVS+LLI CTRLHAHPAY ASIFTSSPSPN SVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQ 
Subjt:  MYELGAIASKISNISQLRQLHGHFVLNSLHSHNYWVSVLLIICTRLHAHPAYAASIFTSSPSPNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQS

Query:  FVYIYLIKLAGKYGNIFHANVMKLGHVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVT
        FVYIYLIKLAGK GN+FHA V+KLGHVDD FIRNAILDMYAKYGQVDLARKLF QMAERTLADWNSMISGCWKSGNETDAVMLFN MP RNIITWTAMVT
Subjt:  FVYIYLIKLAGKYGNIFHANVMKLGHVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVT

Query:  GYAKMGDLDSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAKTALL
        GYAKMGDL++ARRYFDEMPERSVVSWNA+LSAYAQN CAEEALKLFH+MLKEGITPDDTTWVATISSCSSI NPTLAD+ILRKINQ+H+ILN+F KTALL
Subjt:  GYAKMGDLDSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAKTALL

Query:  DMHAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVLS
        DMHAKFGNLEIARNIFDELGGQRN VTWN+MISAY RVGK+SLA+ELFDNMPKRDVVSWNSMI GYAQNGESAKSIELF+EMI C DIQPDEVTIASVLS
Subjt:  DMHAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVLS

Query:  ACGHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGVL
        ACGHIGALKLS WVLDIVREKNIKLG+SGFNSLIF+YSKCG V DAHRIFQTME RDVVSFNTLISG AANGHGKEAIKLVLTMEEEGIEPDHVTYIGVL
Subjt:  ACGHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGVL

Query:  TACSHAGLLKEGKNVFKSIKAPAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPENPGNYVLLSNIY
        TACSHAGLLKEGKNVFKSIKAP VDHYACMVDLLGRAGELDEAK+L+Q MPMEPHAGV+GSLLNASRIHKRVELGELAA+KLFELEP+NPGNYVLLSNIY
Subjt:  TACSHAGLLKEGKNVFKSIKAPAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPENPGNYVLLSNIY

Query:  ASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFA
        ASAGRWEDVKK+REKMR GGVKKSVGMSWVEY+GQVHKFIVGDRSHE SKDIYRLLAELERKMK  GFV DKSCALRDVEEEEKEEMLGTHSEKLAICFA
Subjt:  ASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFA

Query:  LLISEVGTPIRVL
        LLISEVGTPIRV+
Subjt:  LLISEVGTPIRVL

TrEMBL top hitse value%identityAlignment
A0A1S3BZ59 pentatricopeptide repeat-containing protein At1g144700.0e+0088.36Show/hide
Query:  MYELGAIASKISNISQLRQLHGHFVLNSLHSHNYWVSVLLIICTRLHAHPAYAASIFTSSPSPNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQS
        MYEL A+ASKISNI QLR  HGH V NSLHSHNYWVS+LL+IC RLHAHPAY  SIFTSSPSP+ASVYSCMLKYYSRMGAHN+VVSLF+CM SLDLRPQ 
Subjt:  MYELGAIASKISNISQLRQLHGHFVLNSLHSHNYWVSVLLIICTRLHAHPAYAASIFTSSPSPNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQS

Query:  FVYIYLIKLAGKYGNIFHANVMKLGHVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVT
        FVYIYLIK AGK GN+FHA V+KLGH+DD FIRNAILDMY K GQVDLARKLFEQMAE+TL DWNSMISGCWKSGNET+AVMLFNMMPARNIITWT+MVT
Subjt:  FVYIYLIKLAGKYGNIFHANVMKLGHVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVT

Query:  GYAKMGDLDSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAKTALL
        GYAK+GDL+SARRYFDEMPERSVVSWNAM SAYAQ EC +EALKLFHQMLKEGITPDDTTW  TISSCSSIG+PTLAD+ILR INQKH +LN+F +TALL
Subjt:  GYAKMGDLDSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAKTALL

Query:  DMHAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVLS
        DMHAKFGNLEIARNIFDELG QRN V WN+MISAYTRVGK+SLARELFDNMPKRDVVSWNSMIAGYAQNGE+A SIELF+EMI C DIQPDEVTIASVLS
Subjt:  DMHAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVLS

Query:  ACGHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGVL
        ACGHIGALKL  WVLDIVREKNIKLGISGFNSLIF+YSKCG V DAHRIFQTME RDVVSFNTLISG AANGHGKEAIKLVLTMEEEGIEPDHVTYIGVL
Subjt:  ACGHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGVL

Query:  TACSHAGLLKEGKNVFKSIKAPAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPENPGNYVLLSNIY
        TACSHAGLL EGKNVFKSIKAP VDHYACMVDLLGRAGELDEAK+LIQSMPM+PH GVYGSLLNASRIHKRV LGELAA+KLFELEP+NPGNYVLLSNIY
Subjt:  TACSHAGLLKEGKNVFKSIKAPAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPENPGNYVLLSNIY

Query:  ASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFA
        AS+GRWEDVK+VRE MR  G++K VGMSWVEY+GQVHKFIVGDRSHE SKDIY+LLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFA
Subjt:  ASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFA

Query:  LLISEVGTPIRVL
        LLISEVGTPIRV+
Subjt:  LLISEVGTPIRVL

A0A5D3DV14 Transmembrane protein 1470.0e+0087.29Show/hide
Query:  MYELGAIASKISNISQLRQLHGHFVLNSLHSHNYWVSVLLIICTRLHAHPAYAASIFTSSPSPNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQS
        MYEL A+ASKISNI QLR  HGH V NSLHSHNYWVS+LL+IC RLHAHPAY  SIFTSSPSP+ASVYSCMLKYYSRMGAHN+VVSLF+CM SLDLRPQ 
Subjt:  MYELGAIASKISNISQLRQLHGHFVLNSLHSHNYWVSVLLIICTRLHAHPAYAASIFTSSPSPNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQS

Query:  FVYIYLIKLAGKYGNIFHANVMKLGHVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVT
        FVYIYLIK AGK GN+FHA V+KLGH+DD FIRNAILDMY K GQVDLARKLFEQMAE+TL DWNSMISGCWKSGNET+AVMLFNMMPARNIITWT+MVT
Subjt:  FVYIYLIKLAGKYGNIFHANVMKLGHVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVT

Query:  GYAKMGDLDSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAKTALL
        GYAK+GDL+SARRYFDEMPERSVVSWNAM SAYAQ EC +EALKLFHQMLKEGITPDDTTW  TISSCSSIG+PTLAD+ILR INQKH +LN+F +TALL
Subjt:  GYAKMGDLDSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAKTALL

Query:  DMHAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVLS
        DMHAKFGNLEIARNIFDELG QRN V WN+MISAYTRVGK+SLARELFDNMPKRDVVSWNSMIAGYAQNGE+A SIELF+EMI C DIQPDEVTIASVLS
Subjt:  DMHAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVLS

Query:  ACGHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGVL
        ACGHIGALKL  WVLDIVREKNIKLGISGFNSLIF+YSKCG V DAHRIFQTME RDVVSFNTLISG AANGHGKEAIKLVLTMEEEGIEPDHVTYIGVL
Subjt:  ACGHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGVL

Query:  TACSHAGLLKEGKNVFKSIKAPAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPENPGNYVLLSNIY
        TACSHAGLL EGKNVFKSIKAP VDHYACMVDLLGRAGELDEAK+LIQSMPM+PH GVYGSLLNASRIHKRV LGELAA+KLFELEP+NPGNYVLLSNIY
Subjt:  TACSHAGLLKEGKNVFKSIKAPAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPENPGNYVLLSNIY

Query:  ASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFA
        AS+GRWEDVK+VRE MR  G++K VGMSWVEY+GQVHKFIVGDRSHE SKDIY+LLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFA
Subjt:  ASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFA

Query:  LLISEVGTPIRV-----LC------------------------RSEYDTLGTSVKAALVYLGTALVKLVCLATFLNVSENDSFDLYQELLKALIGLIDVA
        LLISEVGTPIRV     +C                        RSEYDTLGTSVKAALVYLGTALVKLVCLATFLNVSENDSFD YQELLKALIG IDVA
Subjt:  LLISEVGTPIRV-----LC------------------------RSEYDTLGTSVKAALVYLGTALVKLVCLATFLNVSENDSFDLYQELLKALIGLIDVA

Query:  GLYFALTQLTYRNISQNHKFQAVGLGWAFADSVLHRLAPLWVGARGLEFTWDYILQGLEANANLVLSISLAALGSLMWLRKNKPKTLIPIIYICALIVAT
        GLYFALTQLTYRNISQNHKFQAVGLGWAFADSVLHRLAPLWVGARGLEFTWDYILQGLEANANLVLSISLAALGSLMWLRKNKPKTLIPIIY+CALIVAT
Subjt:  GLYFALTQLTYRNISQNHKFQAVGLGWAFADSVLHRLAPLWVGARGLEFTWDYILQGLEANANLVLSISLAALGSLMWLRKNKPKTLIPIIYICALIVAT

Query:  MPSIT
        MPSIT
Subjt:  MPSIT

A0A6J1BWI0 pentatricopeptide repeat-containing protein At1g14470-like isoform X10.0e+0088.22Show/hide
Query:  MYELGAIASKISNISQLRQLHGHFVLNSLHSHNYWVSVLLIICTRLHAHPAYAASIFTSSPSPNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQS
        M +LGAI +KIS ISQLRQLH H VLNSLHSHNYWVS+L+ +CT L AHPAYAASIFTSS SP+ SVYSCMLKYYSRMGAHNEVVS+FRCMQ LDLRP  
Subjt:  MYELGAIASKISNISQLRQLHGHFVLNSLHSHNYWVSVLLIICTRLHAHPAYAASIFTSSPSPNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQS

Query:  FVYIYLIKLAGKYGNIFHANVMKLGHVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVT
        FVYIYLIKLAGK GN+FHA V+KLGH+DDHF+RNAILDMYAKYGQVDLARKLFEQMA+RTLADWNSMISG WKSG E DAVMLFNMMPARNIITWTAMVT
Subjt:  FVYIYLIKLAGKYGNIFHANVMKLGHVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVT

Query:  GYAKMGDLDSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAKTALL
        GYAKM DL+SARRYFD+MPERSVVSWNAMLSAYAQNECAEEALKLF QMLKEGI PDDTTWVA ISSCSS+GNP+LAD I+RKINQKH ILNNF KTALL
Subjt:  GYAKMGDLDSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAKTALL

Query:  DMHAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVLS
        DMHAKFGNLEIAR IFDELGGQRNAVTWNIMISAYTR GK+ LARELFDNMPKRDVVSWNSMIAGYAQNGESA SIELF+EMI C DIQPDEVTIASVLS
Subjt:  DMHAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVLS

Query:  ACGHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGVL
        ACGHIGA KLS  V++IV+EKNIKLGIS FNSLIF+YSKCG +  AHRIFQ M  RDVV+FNTLISG AANG GKEAI+LVL MEEEGIEPDHVTYIGVL
Subjt:  ACGHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGVL

Query:  TACSHAGLLKEGKNVFKSIKAPAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPENPGNYVLLSNIY
        TACSHAGLLKEGKN+FKSIKAP VDHYACMVDLLGRAGELDEAKILIQSMPM+PHAGVYGSLLNASRIHKRVELGELAANKLFELEP+NPGNYVLLSNIY
Subjt:  TACSHAGLLKEGKNVFKSIKAPAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPENPGNYVLLSNIY

Query:  ASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFA
        ASAGRWEDVK VREKM+  GVKKSVGMSWVEY+GQVHKFIVGDRSHE S+DIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEM+GTHSEKLAICFA
Subjt:  ASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFA

Query:  LLISEVGTPIRVL
        LL+SEVGTPIRV+
Subjt:  LLISEVGTPIRVL

A0A6J1FMC6 pentatricopeptide repeat-containing protein At1g144700.0e+0087.8Show/hide
Query:  MYELGAIASKISNISQLRQLHGHFVLNSLHSHNYWVSVLLIICTRLHAHPAYAASIFTSSPSPNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQS
        MY+L AIA+KISNISQLRQLH H VLNSL S NYWVS+LL ICTRLHAHP+YAASIFTSSP PNASVYSCMLKYYSRMGAH+EVVSLFRCMQ LDLRP  
Subjt:  MYELGAIASKISNISQLRQLHGHFVLNSLHSHNYWVSVLLIICTRLHAHPAYAASIFTSSPSPNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQS

Query:  FVYIYLIKLAGKYGNIFHANVMKLGHVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVT
         VYIYLIKLAGK GN+FHA+V+KLGH+DDHFIRNA+LDMYAKYGQVDLARKLFEQM  RTLADWNSMISGCW SGNE DAVMLFNMMP RN I+WTAMVT
Subjt:  FVYIYLIKLAGKYGNIFHANVMKLGHVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVT

Query:  GYAKMGDLDSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAKTALL
        GYAK  DL+SARRYFDEMPE+SVVSWNAMLSAYAQNECAEEALKLFH+MLKEGITPDDTTWVA ISSCSSIGNP LAD++L KINQKH ILNN+ KTALL
Subjt:  GYAKMGDLDSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAKTALL

Query:  DMHAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVLS
        DMHAKFGNLEIAR IFDELGGQRNAVTWN+MISAYTR GK+SLARELFDNMPKRDVVSWNSMIAGYAQNGESA SI LF+EMI C DIQPDEVTIASVLS
Subjt:  DMHAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVLS

Query:  ACGHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGVL
        ACGHIGALK S WVL+IV+EKNIK GISGFNSLIF+YSKCG V DAHRIFQ M  +DVV+FNTLISG AANGHGK+AIKL+LTMEEEGIEPDHVTYIGVL
Subjt:  ACGHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGVL

Query:  TACSHAGLLKEGKNVFKSIKAPAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPENPGNYVLLSNIY
        TACSHAG+LKEGKN+FKSIKAP VDHYACMVDLLGRAGELDEAKILI+SMPM+PHAGVYGSLLN SRIHKRVELGELAANKL ELEP+NPGNY+LLSNIY
Subjt:  TACSHAGLLKEGKNVFKSIKAPAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPENPGNYVLLSNIY

Query:  ASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFA
        ASAGRWEDV++VREKMR GGVKKSVGMSWVEY+GQ+H F VGDRSHE SKDIYRLLAELERKMKRVGFV DKSCALRDVEEEEKEEMLGTHSEKLAICFA
Subjt:  ASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFA

Query:  LLISEVGTPIRVL
        LL+SEVGTPIRV+
Subjt:  LLISEVGTPIRVL

A0A6J1IYP6 pentatricopeptide repeat-containing protein At1g144700.0e+0087.8Show/hide
Query:  MYELGAIASKISNISQLRQLHGHFVLNSLHSHNYWVSVLLIICTRLHAHPAYAASIFTSSPSPNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQS
        MY+L AIA+KISNISQLRQLH H VLNSL S NYWVS+LL ICTRLHAHPAYAASIFTSSP PNASVYSCMLKYYSRMGAH+EVVSLFRCMQ LDLRP  
Subjt:  MYELGAIASKISNISQLRQLHGHFVLNSLHSHNYWVSVLLIICTRLHAHPAYAASIFTSSPSPNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQS

Query:  FVYIYLIKLAGKYGNIFHANVMKLGHVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVT
         VYIYLIKLAGK GN+FHA+V+KLGH+DDHFIRN ILDMYAKYGQVDLARKLFEQM  RTLADWNSMISGCW SGNE DAVMLF+MMP RN I+WTAMVT
Subjt:  FVYIYLIKLAGKYGNIFHANVMKLGHVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVT

Query:  GYAKMGDLDSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAKTALL
        GYAKM DL++ARRYFDEMPE+SVVSWNAMLSAYAQNECAEEALKLFH+MLKEGITPDDTTWVA ISSCSSIGN +LAD++L KINQKH ILNNF KTALL
Subjt:  GYAKMGDLDSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAKTALL

Query:  DMHAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVLS
        DMHAKFGNLEIAR IFDELGGQRNAVTWN+MISAYTR GK+SLARELFDNMPKRDVVSWNSMIAGYAQNGESA SI LF+EMI C DIQPDEVTIASVLS
Subjt:  DMHAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVLS

Query:  ACGHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGVL
        ACGHIGALK S WVL+IV+EKNIK GISGFNSLIF+YSKCG V DAHRIFQ M  +DVV+FNTLISG AANGHGK+AIKL+LTMEEEGIEPDHVTYIGVL
Subjt:  ACGHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGVL

Query:  TACSHAGLLKEGKNVFKSIKAPAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPENPGNYVLLSNIY
        TACSHAG+LKEGKN+FKSIKAP VDHYACMVDLLGRAGELD+AKILI+SMPM+PHAGVYGSLLNASRIHKRVELGELAANKL ELEP+NPGNY+LLSNIY
Subjt:  TACSHAGLLKEGKNVFKSIKAPAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPENPGNYVLLSNIY

Query:  ASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFA
        ASAGRWEDV++VREKMR GGVKKSVGMSWVEY+GQ+H F VGDRSHE SKDIYRLLAELERKMKRVGFV DKSCALRDVEEEEKEEMLGTHSEKLAICFA
Subjt:  ASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFA

Query:  LLISEVGTPIRVL
        LL+SEVGTPIRV+
Subjt:  LLISEVGTPIRVL

SwissProt top hitse value%identityAlignment
Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic3.6e-12434.4Show/hide
Query:  ISQLRQLHGHFVLNSLHSHNYWVSVLLIICTRLHAH---PAYAASIFTSSPSPNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQSFVYIYLIK--
        +  LR +H   +   LH+ NY +S L+  C  L  H     YA S+F +   PN  +++ M + ++        + L+ CM SL L P S+ + +++K  
Subjt:  ISQLRQLHGHFVLNSLHSHNYWVSVLLIICTRLHAH---PAYAASIFTSSPSPNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQSFVYIYLIK--

Query:  ---LAGKYGNIFHANVMKLGHVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVTGYAKM
            A K G   H +V+KLG   D ++  +++ MY + G+++ A K+F++                                P R+++++TA++ GYA  
Subjt:  ---LAGKYGNIFHANVMKLGHVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVTGYAKM

Query:  GDLDSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAKTALLDMHAK
        G +++A++ FDE+P + VVSWNAM+S YA+    +EAL+LF  M+K  + PD++T V  +S+C                                   A+
Subjt:  GDLDSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAKTALLDMHAK

Query:  FGNLEIARNI---FDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVLSAC
         G++E+ R +    D+ G   N    N +I  Y++ G++  A  LF+ +P +DV+SWN++I GY       +++ LFQEM+   +  P++VT+ S+L AC
Subjt:  FGNLEIARNI---FDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVLSAC

Query:  GHIGALKLSNWVLDIV--REKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGVL
         H+GA+ +  W+   +  R K +    S   SLI +Y+KCG +  AH++F ++  + + S+N +I G A +G    +  L   M + GI+PD +T++G+L
Subjt:  GHIGALKLSNWVLDIV--REKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGVL

Query:  TACSHAGLLKEGKNVFKSI-----KAPAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPENPGNYVL
        +ACSH+G+L  G+++F+++       P ++HY CM+DLLG +G   EA+ +I  M MEP   ++ SLL A ++H  VELGE  A  L ++EPENPG+YVL
Subjt:  TACSHAGLLKEGKNVFKSI-----KAPAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPENPGNYVL

Query:  LSNIYASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKL
        LSNIYASAGRW +V K R  + + G+KK  G S +E    VH+FI+GD+ H  +++IY +L E+E  +++ GFV D S  L+++EEE KE  L  HSEKL
Subjt:  LSNIYASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKL

Query:  AICFALLISEVGTPIRVL
        AI F L+ ++ GT + ++
Subjt:  AICFALLISEVGTPIRVL

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226903.9e-12335.33Show/hide
Query:  AYAASIFTSSPS-PNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQSFVYIYLIKLAGK-----YGNIFHANVMKLGHVDDHFIRNAILDMYAKYG
        ++A  +F +S S     +Y+ +++ Y+  G  NE + LF  M +  + P  + + + +    K      G   H  ++K+G+  D F++N+++  YA+ G
Subjt:  AYAASIFTSSPS-PNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQSFVYIYLIKLAGK-----YGNIFHANVMKLGHVDDHFIRNAILDMYAKYG

Query:  QVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMM-----PARNIITWTAMVTGYAKMGDL-------------------------------
        ++D ARK+F++M+ER +  W SMI G  +     DAV LF  M        N +T   +++  AK+ DL                               
Subjt:  QVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMM-----PARNIITWTAMVTGYAKMGDL-------------------------------

Query:  ----DSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAK-----TAL
            D A+R FDE    ++   NAM S Y +     EAL +F+ M+  G+ PD  + ++ ISSCS + N      +  K    + + N F        AL
Subjt:  ----DSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAK-----TAL

Query:  LDMHAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVL
        +DM+ K    + A  IFD +   +  VTWN +++ Y   G+V  A E F+ MP++++VSWN++I+G  Q     ++IE+F  M     +  D VT+ S+ 
Subjt:  LDMHAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVL

Query:  SACGHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGV
        SACGH+GAL L+ W+   + +  I+L +    +L+ ++S+CG    A  IF ++  RDV ++   I  +A  G+ + AI+L   M E+G++PD V ++G 
Subjt:  SACGHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGV

Query:  LTACSHAGLLKEGKNVFKSI-----KAPAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPENPGNYV
        LTACSH GL+++GK +F S+      +P   HY CMVDLLGRAG L+EA  LI+ MPMEP+  ++ SLL A R+   VE+   AA K+  L PE  G+YV
Subjt:  LTACSHAGLLKEGKNVFKSI-----KAPAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPENPGNYV

Query:  LLSNIYASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEK
        LLSN+YASAGRW D+ KVR  M+  G++K  G S ++ RG+ H+F  GD SH    +I  +L E+ ++   +G V D S  L DV+E+EK  ML  HSEK
Subjt:  LLSNIYASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEK

Query:  LAICFALLISEVGTPIRVL
        LA+ + L+ S  GT IR++
Subjt:  LAICFALLISEVGTPIRVL

Q9M9R6 Pentatricopeptide repeat-containing protein At1g144701.1e-16052.63Show/hide
Query:  LGAIASKISNISQLRQLHGH-FVLNSLHSHNYWVSVLLIICTRLHAHPAYAASIFTSSPSPNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQSFV
        L AIAS+     QL Q+H    V NSL   +YW S ++  CTRL A   Y   IF S   PN  V + M KY+S+M   N+V+ L+       + P +F 
Subjt:  LGAIASKISNISQLRQLHGH-FVLNSLHSHNYWVSVLLIICTRLHAHPAYAASIFTSSPSPNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQSFV

Query:  YIYLIKLAGKYGNIFHANVMKLGHVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVTGY
        +  +IK AG++G +F A V KLG   D ++RN I+DMY K+  V+ ARK+F+Q+++R  +DWN MISG WK GN+ +A  LF+MMP  ++++WT M+TG+
Subjt:  YIYLIKLAGKYGNIFHANVMKLGHVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVTGY

Query:  AKMGDLDSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAKTALLDM
        AK+ DL++AR+YFD MPE+SVVSWNAMLS YAQN   E+AL+LF+ ML+ G+ P++TTWV  IS+CS   +P+L  ++++ I++K   LN F KTALLDM
Subjt:  AKMGDLDSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAKTALLDM

Query:  HAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVLSAC
        HAK  +++ AR IF+ELG QRN VTWN MIS YTR+G +S AR+LFD MPKR+VVSWNS+IAGYA NG++A +IE F++MI   D +PDEVT+ SVLSAC
Subjt:  HAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVLSAC

Query:  GHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGVLTA
        GH+  L+L + ++D +R+  IKL  SG+ SLIF+Y++ G + +A R+F  M++RDVVS+NTL +  AANG G E + L+  M++EGIEPD VTY  VLTA
Subjt:  GHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGVLTA

Query:  CSHAGLLKEGKNVFKSIKAPAVDHYACMVDLL
        C+ AGLLKEG+ +FKSI+ P  DHYACM DLL
Subjt:  CSHAGLLKEGKNVFKSIKAPAVDHYACMVDLL

Q9SIT7 Pentatricopeptide repeat-containing protein At2g136003.4e-11937.01Show/hide
Query:  KLAGKYGNIFHANVMKLGHVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVTG------
        KL+  Y    HA+V+K G  ++ FI+N ++D Y+K G ++  R++F++M +R +  WNS+++G  K G   +A  LF  MP R+  TW +MV+G      
Subjt:  KLAGKYGNIFHANVMKLGHVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVTG------

Query:  ----------------------------------------------------------------YAKMGDLDSARRYFDEMPERSVVSWNAMLSAYAQNE
                                                                        Y+K G+++ A+R FDEM +R+VVSWN++++ + QN 
Subjt:  ----------------------------------------------------------------YAKMGDLDSARRYFDEMPERSVVSWNAMLSAYAQNE

Query:  CAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNN-FAKTALLDMHAKFGNLEIARNIFDELGGQRNAVTWNIMISAYT
         A EAL +F  ML+  + PD+ T  + IS+C+S+    +   +  ++ +   + N+     A +DM+AK   ++ AR IFD +   RN +    MIS Y 
Subjt:  CAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNN-FAKTALLDMHAKFGNLEIARNIFDELGGQRNAVTWNIMISAYT

Query:  RVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELF----QEMICCLDIQPDEVTIASVLSACGHIGALKLSNWVLDIVREKNIKL------G
               AR +F  M +R+VVSWN++IAGY QNGE+ +++ LF    +E +C     P   + A++L AC  +  L L       V +   K        
Subjt:  RVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELF----QEMICCLDIQPDEVTIASVLSACGHIGALKLSNWVLDIVREKNIKL------G

Query:  ISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGVLTACSHAGLLKEGKNVFKSIK-----A
        I   NSLI +Y KCGCV + + +F+ M +RD VS+N +I G A NG+G EA++L   M E G +PDH+T IGVL+AC HAG ++EG++ F S+      A
Subjt:  ISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGVLTACSHAGLLKEGKNVFKSIK-----A

Query:  PAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPENPGNYVLLSNIYASAGRWEDVKKVREKMRNGGV
        P  DHY CMVDLLGRAG L+EAK +I+ MPM+P + ++GSLL A ++H+ + LG+  A KL E+EP N G YVLLSN+YA  G+WEDV  VR+ MR  GV
Subjt:  PAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPENPGNYVLLSNIYASAGRWEDVKKVREKMRNGGV

Query:  KKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMK
         K  G SW++ +G  H F+V D+SH   K I+ LL  L  +M+
Subjt:  KKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMK

Q9SY02 Pentatricopeptide repeat-containing protein At4g027503.7e-12939.35Show/hide
Query:  NAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVTGYAKMGDLDSARRYFDEMPERSVVSWNAMLSAY
        N ++  Y + G+ +LARKLF++M ER L  WN MI G  ++ N   A  LF +MP R++ +W  M++GYA+ G +D AR  FD MPE++ VSWNA+LSAY
Subjt:  NAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVTGYAKMGDLDSARRYFDEMPERSVVSWNAMLSAY

Query:  AQNECAEEALKLF----------------------------------------------------------HQMLKEGITPDDTTWVATISSCSSIGNPT
         QN   EEA  LF                                                           Q+  E    D  TW A +S    I N  
Subjt:  AQNECAEEALKLF----------------------------------------------------------HQMLKEGITPDDTTWVATISSCSSIGNPT

Query:  LADAILRKINQKHSILNNFAKTALLDMHAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKS
        + +A  R++  K    N  +  A+L  + +   +E+A+ +FD +   RN  TWN MI+ Y + GK+S A+ LFD MPKRD VSW +MIAGY+Q+G S ++
Subjt:  LADAILRKINQKHSILNNFAKTALLDMHAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKS

Query:  IELFQEMICCLDIQPDEVTIASVLSACGHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGK
        + LF +M      + +  + +S LS C  + AL+L   +   + +   + G    N+L+ +Y KCG + +A+ +F+ M  +D+VS+NT+I+G + +G G+
Subjt:  IELFQEMICCLDIQPDEVTIASVLSACGHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGK

Query:  EAIKLVLTMEEEGIEPDHVTYIGVLTACSHAGLLKEGKNVFKSIK-----APAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHK
         A++   +M+ EG++PD  T + VL+ACSH GL+ +G+  F ++       P   HYACMVDLLGRAG L++A  L+++MP EP A ++G+LL ASR+H 
Subjt:  EAIKLVLTMEEEGIEPDHVTYIGVLTACSHAGLLKEGKNVFKSIK-----APAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHK

Query:  RVELGELAANKLFELEPENPGNYVLLSNIYASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVA
          EL E AA+K+F +EPEN G YVLLSN+YAS+GRW DV K+R +MR+ GVKK  G SW+E + + H F VGD  H    +I+  L EL+ +MK+ G+V+
Subjt:  RVELGELAANKLFELEPENPGNYVLLSNIYASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVA

Query:  DKSCALRDVEEEEKEEMLGTHSEKLAICFALLISEVGTPIRVL
          S  L DVEEEEKE M+  HSE+LA+ + ++    G PIRV+
Subjt:  DKSCALRDVEEEEKEEMLGTHSEKLAICFALLISEVGTPIRVL

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.5e-12534.4Show/hide
Query:  ISQLRQLHGHFVLNSLHSHNYWVSVLLIICTRLHAH---PAYAASIFTSSPSPNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQSFVYIYLIK--
        +  LR +H   +   LH+ NY +S L+  C  L  H     YA S+F +   PN  +++ M + ++        + L+ CM SL L P S+ + +++K  
Subjt:  ISQLRQLHGHFVLNSLHSHNYWVSVLLIICTRLHAH---PAYAASIFTSSPSPNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQSFVYIYLIK--

Query:  ---LAGKYGNIFHANVMKLGHVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVTGYAKM
            A K G   H +V+KLG   D ++  +++ MY + G+++ A K+F++                                P R+++++TA++ GYA  
Subjt:  ---LAGKYGNIFHANVMKLGHVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVTGYAKM

Query:  GDLDSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAKTALLDMHAK
        G +++A++ FDE+P + VVSWNAM+S YA+    +EAL+LF  M+K  + PD++T V  +S+C                                   A+
Subjt:  GDLDSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAKTALLDMHAK

Query:  FGNLEIARNI---FDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVLSAC
         G++E+ R +    D+ G   N    N +I  Y++ G++  A  LF+ +P +DV+SWN++I GY       +++ LFQEM+   +  P++VT+ S+L AC
Subjt:  FGNLEIARNI---FDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVLSAC

Query:  GHIGALKLSNWVLDIV--REKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGVL
         H+GA+ +  W+   +  R K +    S   SLI +Y+KCG +  AH++F ++  + + S+N +I G A +G    +  L   M + GI+PD +T++G+L
Subjt:  GHIGALKLSNWVLDIV--REKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGVL

Query:  TACSHAGLLKEGKNVFKSI-----KAPAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPENPGNYVL
        +ACSH+G+L  G+++F+++       P ++HY CM+DLLG +G   EA+ +I  M MEP   ++ SLL A ++H  VELGE  A  L ++EPENPG+YVL
Subjt:  TACSHAGLLKEGKNVFKSI-----KAPAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPENPGNYVL

Query:  LSNIYASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKL
        LSNIYASAGRW +V K R  + + G+KK  G S +E    VH+FI+GD+ H  +++IY +L E+E  +++ GFV D S  L+++EEE KE  L  HSEKL
Subjt:  LSNIYASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKL

Query:  AICFALLISEVGTPIRVL
        AI F L+ ++ GT + ++
Subjt:  AICFALLISEVGTPIRVL

AT1G14470.1 Pentatricopeptide repeat (PPR) superfamily protein7.5e-16252.63Show/hide
Query:  LGAIASKISNISQLRQLHGH-FVLNSLHSHNYWVSVLLIICTRLHAHPAYAASIFTSSPSPNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQSFV
        L AIAS+     QL Q+H    V NSL   +YW S ++  CTRL A   Y   IF S   PN  V + M KY+S+M   N+V+ L+       + P +F 
Subjt:  LGAIASKISNISQLRQLHGH-FVLNSLHSHNYWVSVLLIICTRLHAHPAYAASIFTSSPSPNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQSFV

Query:  YIYLIKLAGKYGNIFHANVMKLGHVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVTGY
        +  +IK AG++G +F A V KLG   D ++RN I+DMY K+  V+ ARK+F+Q+++R  +DWN MISG WK GN+ +A  LF+MMP  ++++WT M+TG+
Subjt:  YIYLIKLAGKYGNIFHANVMKLGHVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVTGY

Query:  AKMGDLDSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAKTALLDM
        AK+ DL++AR+YFD MPE+SVVSWNAMLS YAQN   E+AL+LF+ ML+ G+ P++TTWV  IS+CS   +P+L  ++++ I++K   LN F KTALLDM
Subjt:  AKMGDLDSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAKTALLDM

Query:  HAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVLSAC
        HAK  +++ AR IF+ELG QRN VTWN MIS YTR+G +S AR+LFD MPKR+VVSWNS+IAGYA NG++A +IE F++MI   D +PDEVT+ SVLSAC
Subjt:  HAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVLSAC

Query:  GHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGVLTA
        GH+  L+L + ++D +R+  IKL  SG+ SLIF+Y++ G + +A R+F  M++RDVVS+NTL +  AANG G E + L+  M++EGIEPD VTY  VLTA
Subjt:  GHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGVLTA

Query:  CSHAGLLKEGKNVFKSIKAPAVDHYACMVDLL
        C+ AGLLKEG+ +FKSI+ P  DHYACM DLL
Subjt:  CSHAGLLKEGKNVFKSIKAPAVDHYACMVDLL

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)2.8e-12435.33Show/hide
Query:  AYAASIFTSSPS-PNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQSFVYIYLIKLAGK-----YGNIFHANVMKLGHVDDHFIRNAILDMYAKYG
        ++A  +F +S S     +Y+ +++ Y+  G  NE + LF  M +  + P  + + + +    K      G   H  ++K+G+  D F++N+++  YA+ G
Subjt:  AYAASIFTSSPS-PNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQSFVYIYLIKLAGK-----YGNIFHANVMKLGHVDDHFIRNAILDMYAKYG

Query:  QVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMM-----PARNIITWTAMVTGYAKMGDL-------------------------------
        ++D ARK+F++M+ER +  W SMI G  +     DAV LF  M        N +T   +++  AK+ DL                               
Subjt:  QVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMM-----PARNIITWTAMVTGYAKMGDL-------------------------------

Query:  ----DSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAK-----TAL
            D A+R FDE    ++   NAM S Y +     EAL +F+ M+  G+ PD  + ++ ISSCS + N      +  K    + + N F        AL
Subjt:  ----DSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAK-----TAL

Query:  LDMHAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVL
        +DM+ K    + A  IFD +   +  VTWN +++ Y   G+V  A E F+ MP++++VSWN++I+G  Q     ++IE+F  M     +  D VT+ S+ 
Subjt:  LDMHAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVL

Query:  SACGHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGV
        SACGH+GAL L+ W+   + +  I+L +    +L+ ++S+CG    A  IF ++  RDV ++   I  +A  G+ + AI+L   M E+G++PD V ++G 
Subjt:  SACGHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGV

Query:  LTACSHAGLLKEGKNVFKSI-----KAPAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPENPGNYV
        LTACSH GL+++GK +F S+      +P   HY CMVDLLGRAG L+EA  LI+ MPMEP+  ++ SLL A R+   VE+   AA K+  L PE  G+YV
Subjt:  LTACSHAGLLKEGKNVFKSI-----KAPAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPENPGNYV

Query:  LLSNIYASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEK
        LLSN+YASAGRW D+ KVR  M+  G++K  G S ++ RG+ H+F  GD SH    +I  +L E+ ++   +G V D S  L DV+E+EK  ML  HSEK
Subjt:  LLSNIYASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEK

Query:  LAICFALLISEVGTPIRVL
        LA+ + L+ S  GT IR++
Subjt:  LAICFALLISEVGTPIRVL

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification2.8e-12435.33Show/hide
Query:  AYAASIFTSSPS-PNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQSFVYIYLIKLAGK-----YGNIFHANVMKLGHVDDHFIRNAILDMYAKYG
        ++A  +F +S S     +Y+ +++ Y+  G  NE + LF  M +  + P  + + + +    K      G   H  ++K+G+  D F++N+++  YA+ G
Subjt:  AYAASIFTSSPS-PNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQSFVYIYLIKLAGK-----YGNIFHANVMKLGHVDDHFIRNAILDMYAKYG

Query:  QVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMM-----PARNIITWTAMVTGYAKMGDL-------------------------------
        ++D ARK+F++M+ER +  W SMI G  +     DAV LF  M        N +T   +++  AK+ DL                               
Subjt:  QVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMM-----PARNIITWTAMVTGYAKMGDL-------------------------------

Query:  ----DSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAK-----TAL
            D A+R FDE    ++   NAM S Y +     EAL +F+ M+  G+ PD  + ++ ISSCS + N      +  K    + + N F        AL
Subjt:  ----DSARRYFDEMPERSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAK-----TAL

Query:  LDMHAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVL
        +DM+ K    + A  IFD +   +  VTWN +++ Y   G+V  A E F+ MP++++VSWN++I+G  Q     ++IE+F  M     +  D VT+ S+ 
Subjt:  LDMHAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVL

Query:  SACGHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGV
        SACGH+GAL L+ W+   + +  I+L +    +L+ ++S+CG    A  IF ++  RDV ++   I  +A  G+ + AI+L   M E+G++PD V ++G 
Subjt:  SACGHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGV

Query:  LTACSHAGLLKEGKNVFKSI-----KAPAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPENPGNYV
        LTACSH GL+++GK +F S+      +P   HY CMVDLLGRAG L+EA  LI+ MPMEP+  ++ SLL A R+   VE+   AA K+  L PE  G+YV
Subjt:  LTACSHAGLLKEGKNVFKSI-----KAPAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPENPGNYV

Query:  LLSNIYASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEK
        LLSN+YASAGRW D+ KVR  M+  G++K  G S ++ RG+ H+F  GD SH    +I  +L E+ ++   +G V D S  L DV+E+EK  ML  HSEK
Subjt:  LLSNIYASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEK

Query:  LAICFALLISEVGTPIRVL
        LA+ + L+ S  GT IR++
Subjt:  LAICFALLISEVGTPIRVL

AT4G02750.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.6e-13039.35Show/hide
Query:  NAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVTGYAKMGDLDSARRYFDEMPERSVVSWNAMLSAY
        N ++  Y + G+ +LARKLF++M ER L  WN MI G  ++ N   A  LF +MP R++ +W  M++GYA+ G +D AR  FD MPE++ VSWNA+LSAY
Subjt:  NAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVTGYAKMGDLDSARRYFDEMPERSVVSWNAMLSAY

Query:  AQNECAEEALKLF----------------------------------------------------------HQMLKEGITPDDTTWVATISSCSSIGNPT
         QN   EEA  LF                                                           Q+  E    D  TW A +S    I N  
Subjt:  AQNECAEEALKLF----------------------------------------------------------HQMLKEGITPDDTTWVATISSCSSIGNPT

Query:  LADAILRKINQKHSILNNFAKTALLDMHAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKS
        + +A  R++  K    N  +  A+L  + +   +E+A+ +FD +   RN  TWN MI+ Y + GK+S A+ LFD MPKRD VSW +MIAGY+Q+G S ++
Subjt:  LADAILRKINQKHSILNNFAKTALLDMHAKFGNLEIARNIFDELGGQRNAVTWNIMISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKS

Query:  IELFQEMICCLDIQPDEVTIASVLSACGHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGK
        + LF +M      + +  + +S LS C  + AL+L   +   + +   + G    N+L+ +Y KCG + +A+ +F+ M  +D+VS+NT+I+G + +G G+
Subjt:  IELFQEMICCLDIQPDEVTIASVLSACGHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKCGCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGK

Query:  EAIKLVLTMEEEGIEPDHVTYIGVLTACSHAGLLKEGKNVFKSIK-----APAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHK
         A++   +M+ EG++PD  T + VL+ACSH GL+ +G+  F ++       P   HYACMVDLLGRAG L++A  L+++MP EP A ++G+LL ASR+H 
Subjt:  EAIKLVLTMEEEGIEPDHVTYIGVLTACSHAGLLKEGKNVFKSIK-----APAVDHYACMVDLLGRAGELDEAKILIQSMPMEPHAGVYGSLLNASRIHK

Query:  RVELGELAANKLFELEPENPGNYVLLSNIYASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVA
          EL E AA+K+F +EPEN G YVLLSN+YAS+GRW DV K+R +MR+ GVKK  G SW+E + + H F VGD  H    +I+  L EL+ +MK+ G+V+
Subjt:  RVELGELAANKLFELEPENPGNYVLLSNIYASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELERKMKRVGFVA

Query:  DKSCALRDVEEEEKEEMLGTHSEKLAICFALLISEVGTPIRVL
          S  L DVEEEEKE M+  HSE+LA+ + ++    G PIRV+
Subjt:  DKSCALRDVEEEEKEEMLGTHSEKLAICFALLISEVGTPIRVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATGAATTGGGCGCCATAGCTTCCAAAATAAGCAATATAAGTCAGTTAAGACAGCTTCATGGGCATTTTGTTCTCAATTCCTTACATTCTCACAATTACTGGGTTTC
TGTGCTCCTCATCATCTGTACCCGTCTTCACGCTCATCCTGCCTATGCGGCTTCTATTTTTACCTCCTCGCCGTCCCCCAATGCTTCTGTTTACAGTTGTATGCTCAAAT
ATTACTCCCGCATGGGTGCGCACAATGAGGTGGTTTCCCTCTTCAGATGTATGCAGTCTTTAGATCTCAGGCCCCAGTCATTTGTTTATATATACTTGATCAAGTTAGCT
GGGAAATATGGCAATATTTTCCATGCTAATGTCATGAAGTTGGGTCATGTTGATGACCATTTCATCCGTAATGCTATCTTGGATATGTATGCAAAATATGGCCAAGTCGA
TCTTGCCAGGAAGTTGTTTGAGCAAATGGCTGAAAGAACTTTAGCGGACTGGAATTCGATGATCTCTGGCTGTTGGAAATCAGGAAATGAAACTGATGCGGTCATGCTGT
TTAATATGATGCCTGCTAGGAATATTATCACATGGACTGCCATGGTTACTGGGTATGCCAAGATGGGGGACTTGGACAGTGCTAGAAGGTATTTTGATGAGATGCCAGAG
AGAAGTGTAGTCTCATGGAATGCAATGCTATCAGCTTATGCTCAAAATGAATGTGCAGAAGAGGCTTTGAAATTGTTCCATCAAATGCTGAAAGAAGGGATCACTCCTGA
TGATACAACATGGGTTGCTACAATTTCATCATGCTCTTCCATCGGCAATCCTACCCTTGCTGATGCAATTCTAAGGAAGATCAACCAAAAGCATAGCATTTTGAATAATT
TTGCCAAGACGGCTTTACTTGACATGCATGCAAAATTTGGTAACCTTGAAATTGCTAGAAATATCTTTGACGAATTGGGAGGTCAGAGGAATGCTGTTACTTGGAATATC
ATGATCTCAGCATATACGAGGGTAGGAAAAGTGTCATTAGCTCGAGAGTTGTTTGATAATATGCCAAAAAGAGATGTTGTTTCTTGGAATTCGATGATTGCTGGTTATGC
ACAAAATGGAGAGTCAGCCAAGTCAATTGAGCTCTTTCAAGAAATGATTTGTTGTCTGGACATACAGCCGGATGAGGTTACCATAGCTAGTGTTTTGTCTGCCTGTGGAC
ATATTGGGGCTCTAAAATTGAGTAACTGGGTTCTAGATATCGTTCGAGAGAAAAACATTAAGTTGGGGATCTCAGGATTCAATTCTTTAATATTCCTGTACTCTAAATGT
GGATGTGTGCCAGATGCCCATAGGATATTCCAAACTATGGAGAAAAGAGATGTTGTTTCTTTCAATACGCTAATTTCAGGATTGGCAGCCAATGGTCATGGGAAGGAAGC
TATCAAGTTAGTATTAACAATGGAGGAAGAAGGCATTGAACCAGACCATGTCACATACATTGGTGTTTTGACTGCATGTAGCCATGCAGGGCTGCTGAAAGAAGGTAAAA
ACGTCTTTAAGTCAATTAAAGCACCTGCTGTGGATCATTATGCTTGTATGGTTGATTTATTAGGAAGAGCAGGTGAATTAGATGAAGCCAAAATATTGATTCAATCTATG
CCGATGGAACCCCATGCTGGTGTTTATGGCTCTTTGTTAAATGCCAGTCGAATTCACAAGAGAGTTGAGTTGGGAGAACTTGCTGCTAACAAGCTCTTTGAGCTTGAACC
TGAAAATCCTGGAAATTATGTTTTACTTTCTAATATATATGCCTCGGCTGGAAGATGGGAAGATGTTAAAAAGGTTAGAGAGAAGATGAGGAACGGAGGTGTGAAGAAAT
CAGTTGGGATGAGTTGGGTGGAATATAGGGGTCAAGTGCATAAGTTCATTGTGGGCGATAGATCACATGAACTATCAAAAGATATCTATAGATTATTGGCTGAACTTGAA
AGGAAGATGAAGAGGGTTGGCTTTGTAGCTGATAAAAGTTGTGCACTTCGAGATGTTGAGGAGGAAGAGAAGGAAGAAATGCTGGGAACTCACAGTGAGAAGTTGGCCAT
TTGTTTTGCTCTCCTTATCAGTGAAGTGGGGACACCAATTAGAGTGCTGTGCAGATCTGAGTATGACACACTGGGAACATCAGTCAAAGCTGCACTTGTTTATCTTGGAA
CTGCCTTAGTAAAGCTTGTATGCCTTGCAACCTTTCTTAACGTGTCAGAGAATGACTCCTTTGACCTATATCAGGAACTGTTGAAAGCGCTTATCGGTTTGATTGATGTC
GCTGGACTTTACTTTGCTTTGACCCAGTTGACTTACCGGAACATATCTCAAAACCATAAGTTTCAGGCCGTTGGACTGGGTTGGGCATTTGCTGATTCTGTTCTGCATAG
ATTGGCACCACTATGGGTCGGGGCCAGAGGACTGGAGTTTACTTGGGATTACATTTTGCAGGGCCTTGAAGCTAATGCAAATCTGGTGTTGAGTATATCTCTAGCTGCAT
TGGGATCTTTGATGTGGCTTCGGAAGAACAAGCCCAAGACTCTAATTCCCATAATATATATCTGTGCATTGATCGTGGCTACCATGCCATCCATTACAAGACTCAAAACT
TGCAATAATCTTACCTATTTAGATGAGCCTTTTGCTGAATCTTTCTCCTTGCAAATTAGTTGGTTTTCCAGCAATTTGCAGGGTTTCAAGCTCCTTAGTTGCTATTCAAA
ATTCAGATATATAGAGAAGATGCATAATGTTAATACCCACTGCACATTGTGTTTTCAGCTACTTAAGGCGAGGAATGGGCTGGCACTTCCCAAAGGTGGTGGGATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTATGAATTGGGCGCCATAGCTTCCAAAATAAGCAATATAAGTCAGTTAAGACAGCTTCATGGGCATTTTGTTCTCAATTCCTTACATTCTCACAATTACTGGGTTTC
TGTGCTCCTCATCATCTGTACCCGTCTTCACGCTCATCCTGCCTATGCGGCTTCTATTTTTACCTCCTCGCCGTCCCCCAATGCTTCTGTTTACAGTTGTATGCTCAAAT
ATTACTCCCGCATGGGTGCGCACAATGAGGTGGTTTCCCTCTTCAGATGTATGCAGTCTTTAGATCTCAGGCCCCAGTCATTTGTTTATATATACTTGATCAAGTTAGCT
GGGAAATATGGCAATATTTTCCATGCTAATGTCATGAAGTTGGGTCATGTTGATGACCATTTCATCCGTAATGCTATCTTGGATATGTATGCAAAATATGGCCAAGTCGA
TCTTGCCAGGAAGTTGTTTGAGCAAATGGCTGAAAGAACTTTAGCGGACTGGAATTCGATGATCTCTGGCTGTTGGAAATCAGGAAATGAAACTGATGCGGTCATGCTGT
TTAATATGATGCCTGCTAGGAATATTATCACATGGACTGCCATGGTTACTGGGTATGCCAAGATGGGGGACTTGGACAGTGCTAGAAGGTATTTTGATGAGATGCCAGAG
AGAAGTGTAGTCTCATGGAATGCAATGCTATCAGCTTATGCTCAAAATGAATGTGCAGAAGAGGCTTTGAAATTGTTCCATCAAATGCTGAAAGAAGGGATCACTCCTGA
TGATACAACATGGGTTGCTACAATTTCATCATGCTCTTCCATCGGCAATCCTACCCTTGCTGATGCAATTCTAAGGAAGATCAACCAAAAGCATAGCATTTTGAATAATT
TTGCCAAGACGGCTTTACTTGACATGCATGCAAAATTTGGTAACCTTGAAATTGCTAGAAATATCTTTGACGAATTGGGAGGTCAGAGGAATGCTGTTACTTGGAATATC
ATGATCTCAGCATATACGAGGGTAGGAAAAGTGTCATTAGCTCGAGAGTTGTTTGATAATATGCCAAAAAGAGATGTTGTTTCTTGGAATTCGATGATTGCTGGTTATGC
ACAAAATGGAGAGTCAGCCAAGTCAATTGAGCTCTTTCAAGAAATGATTTGTTGTCTGGACATACAGCCGGATGAGGTTACCATAGCTAGTGTTTTGTCTGCCTGTGGAC
ATATTGGGGCTCTAAAATTGAGTAACTGGGTTCTAGATATCGTTCGAGAGAAAAACATTAAGTTGGGGATCTCAGGATTCAATTCTTTAATATTCCTGTACTCTAAATGT
GGATGTGTGCCAGATGCCCATAGGATATTCCAAACTATGGAGAAAAGAGATGTTGTTTCTTTCAATACGCTAATTTCAGGATTGGCAGCCAATGGTCATGGGAAGGAAGC
TATCAAGTTAGTATTAACAATGGAGGAAGAAGGCATTGAACCAGACCATGTCACATACATTGGTGTTTTGACTGCATGTAGCCATGCAGGGCTGCTGAAAGAAGGTAAAA
ACGTCTTTAAGTCAATTAAAGCACCTGCTGTGGATCATTATGCTTGTATGGTTGATTTATTAGGAAGAGCAGGTGAATTAGATGAAGCCAAAATATTGATTCAATCTATG
CCGATGGAACCCCATGCTGGTGTTTATGGCTCTTTGTTAAATGCCAGTCGAATTCACAAGAGAGTTGAGTTGGGAGAACTTGCTGCTAACAAGCTCTTTGAGCTTGAACC
TGAAAATCCTGGAAATTATGTTTTACTTTCTAATATATATGCCTCGGCTGGAAGATGGGAAGATGTTAAAAAGGTTAGAGAGAAGATGAGGAACGGAGGTGTGAAGAAAT
CAGTTGGGATGAGTTGGGTGGAATATAGGGGTCAAGTGCATAAGTTCATTGTGGGCGATAGATCACATGAACTATCAAAAGATATCTATAGATTATTGGCTGAACTTGAA
AGGAAGATGAAGAGGGTTGGCTTTGTAGCTGATAAAAGTTGTGCACTTCGAGATGTTGAGGAGGAAGAGAAGGAAGAAATGCTGGGAACTCACAGTGAGAAGTTGGCCAT
TTGTTTTGCTCTCCTTATCAGTGAAGTGGGGACACCAATTAGAGTGCTGTGCAGATCTGAGTATGACACACTGGGAACATCAGTCAAAGCTGCACTTGTTTATCTTGGAA
CTGCCTTAGTAAAGCTTGTATGCCTTGCAACCTTTCTTAACGTGTCAGAGAATGACTCCTTTGACCTATATCAGGAACTGTTGAAAGCGCTTATCGGTTTGATTGATGTC
GCTGGACTTTACTTTGCTTTGACCCAGTTGACTTACCGGAACATATCTCAAAACCATAAGTTTCAGGCCGTTGGACTGGGTTGGGCATTTGCTGATTCTGTTCTGCATAG
ATTGGCACCACTATGGGTCGGGGCCAGAGGACTGGAGTTTACTTGGGATTACATTTTGCAGGGCCTTGAAGCTAATGCAAATCTGGTGTTGAGTATATCTCTAGCTGCAT
TGGGATCTTTGATGTGGCTTCGGAAGAACAAGCCCAAGACTCTAATTCCCATAATATATATCTGTGCATTGATCGTGGCTACCATGCCATCCATTACAAGACTCAAAACT
TGCAATAATCTTACCTATTTAGATGAGCCTTTTGCTGAATCTTTCTCCTTGCAAATTAGTTGGTTTTCCAGCAATTTGCAGGGTTTCAAGCTCCTTAGTTGCTATTCAAA
ATTCAGATATATAGAGAAGATGCATAATGTTAATACCCACTGCACATTGTGTTTTCAGCTACTTAAGGCGAGGAATGGGCTGGCACTTCCCAAAGGTGGTGGGATTTGA
Protein sequenceShow/hide protein sequence
MYELGAIASKISNISQLRQLHGHFVLNSLHSHNYWVSVLLIICTRLHAHPAYAASIFTSSPSPNASVYSCMLKYYSRMGAHNEVVSLFRCMQSLDLRPQSFVYIYLIKLA
GKYGNIFHANVMKLGHVDDHFIRNAILDMYAKYGQVDLARKLFEQMAERTLADWNSMISGCWKSGNETDAVMLFNMMPARNIITWTAMVTGYAKMGDLDSARRYFDEMPE
RSVVSWNAMLSAYAQNECAEEALKLFHQMLKEGITPDDTTWVATISSCSSIGNPTLADAILRKINQKHSILNNFAKTALLDMHAKFGNLEIARNIFDELGGQRNAVTWNI
MISAYTRVGKVSLARELFDNMPKRDVVSWNSMIAGYAQNGESAKSIELFQEMICCLDIQPDEVTIASVLSACGHIGALKLSNWVLDIVREKNIKLGISGFNSLIFLYSKC
GCVPDAHRIFQTMEKRDVVSFNTLISGLAANGHGKEAIKLVLTMEEEGIEPDHVTYIGVLTACSHAGLLKEGKNVFKSIKAPAVDHYACMVDLLGRAGELDEAKILIQSM
PMEPHAGVYGSLLNASRIHKRVELGELAANKLFELEPENPGNYVLLSNIYASAGRWEDVKKVREKMRNGGVKKSVGMSWVEYRGQVHKFIVGDRSHELSKDIYRLLAELE
RKMKRVGFVADKSCALRDVEEEEKEEMLGTHSEKLAICFALLISEVGTPIRVLCRSEYDTLGTSVKAALVYLGTALVKLVCLATFLNVSENDSFDLYQELLKALIGLIDV
AGLYFALTQLTYRNISQNHKFQAVGLGWAFADSVLHRLAPLWVGARGLEFTWDYILQGLEANANLVLSISLAALGSLMWLRKNKPKTLIPIIYICALIVATMPSITRLKT
CNNLTYLDEPFAESFSLQISWFSSNLQGFKLLSCYSKFRYIEKMHNVNTHCTLCFQLLKARNGLALPKGGGI