; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS018201 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS018201
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold56:491284..492753
RNA-Seq ExpressionMS018201
SyntenyMS018201
Gene Ontology termsGO:1900865 - chloroplast RNA modification (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6590206.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]2.0e-23783.06Show/hide
Query:  VTNPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRI
        +TNP P  FNP LGA V S+APENGLFLYNQMLRHP  SSHNHYTFTYALKAC  LH+THKGLEIH  L+KSGHLSDIFIQNSLLHFYIVDGDVPSASR+
Subjt:  VTNPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRI

Query:  FASMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAENLFV
        F S+PDPDVVSWTSIISGLSKLGF+EEALGKFLSMNV PNSATLV+ALSACSSL  +K+GKAIHGL+LRS++EESV+LDNALLDFYV+CGSLR A+NLF 
Subjt:  FASMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAENLFV

Query:  KMPNRDVVSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIHSYIISRPDVILDGNIGNALINMYVKCGSMEMAVR
        +MP RDVVSWTT+IGGYA +GLCEEAVR+FQNMV   EA PNEATLINVLSACSSMSAL+LGQW+HSYI SR DVI+DGNIGNALINMYVKCGSM+ A+ 
Subjt:  KMPNRDVVSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIHSYIISRPDVILDGNIGNALINMYVKCGSMEMAVR

Query:  IFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLMVFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEA
        IFKTVEH+DIISWST+I+GLAMNG GKQAF LFSLMLVHG++PD ITFL LLS CSHGGLINQGLMVF+AMKDVYNVAP+MRHY CMVDMYG+AGLLDEA
Subjt:  IFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLMVFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEA

Query:  EAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWKDANEVRDTMRSRGLKKMAGCSWIE
        EAF+KEMPVEAEGPVWGALL+ACQ+HGNE +YEKVR+WL+   SK +TVGT ALLSNTYASCDRW DANEVRD MRSRGLKKMAGCSW E
Subjt:  EAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWKDANEVRDTMRSRGLKKMAGCSWIE

XP_011660133.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic isoform X1 [Cucumis sativus]2.6e-23782.86Show/hide
Query:  VTNPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRI
        +TNP P  FNP LG+ V SI PENGLFLYNQMLR+P  SSHNH+TFTYALKAC  LHQT KGLEIH HL+KSGHLSDIFIQNSLLHFYI+DGDV SAS I
Subjt:  VTNPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRI

Query:  FASMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAENLFV
        F S+PDPDVVSWTSIISGLSKLGFE+EAL KFLSMNVRPNS TLVTALSACSSL  LKLGKAIHGLR+R+++EE+V L+NALLDFYV+C  LRSAENLF 
Subjt:  FASMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAENLFV

Query:  KMPNRDVVSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIHSYIISRPDVILDGNIGNALINMYVKCGSMEMAVR
        KMP RDVVSWTT+IGGYAQSGLCEEAVR+FQNMV  GEA PNEATL+NVLSACSS+SAL+LGQW+HSYI SR DVI+DGN+GNALINMYVKCG+MEMA+ 
Subjt:  KMPNRDVVSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIHSYIISRPDVILDGNIGNALINMYVKCGSMEMAVR

Query:  IFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLMVFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEA
        IFK +EH+DI+SWST+I+GLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLS CSHGGLINQG+MVF+AMKDVYN++PQMRHY CMVDMYG+AGLLDEA
Subjt:  IFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLMVFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEA

Query:  EAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWKDANEVRDTMRSRGLKKMAGCSWIE
        EAF+KEMP+EAEGPVWGALL+ACQ+HGNEKKYEKVREWL+   SKGVTVGT ALLSNTYA CDRW DAN+VR  MRSRGLKKMAG SWIE
Subjt:  EAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWKDANEVRDTMRSRGLKKMAGCSWIE

XP_022154335.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Momordica charantia]2.7e-28299.39Show/hide
Query:  VTNPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRI
        VTNPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRI
Subjt:  VTNPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRI

Query:  FASMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAENLFV
        FASMPDPDVVSWTSIISGLSKLGFEEEAL KFLSMNVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAENLFV
Subjt:  FASMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAENLFV

Query:  KMPNRDVVSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIHSYIISRPDVILDGNIGNALINMYVKCGSMEMAVR
        KMPNRDVVSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIHSYIISRPDVIL+GNIGNALINMYVKCGSMEMAVR
Subjt:  KMPNRDVVSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIHSYIISRPDVILDGNIGNALINMYVKCGSMEMAVR

Query:  IFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLMVFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEA
        IFKTVEHRDIIS STVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLMVFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEA
Subjt:  IFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLMVFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEA

Query:  EAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWKDANEVRDTMRSRGLKKMAGCSWIE
        EAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWKDANEVRDTMRSRGLKKMAGCSWIE
Subjt:  EAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWKDANEVRDTMRSRGLKKMAGCSWIE

XP_022960642.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X1 [Cucurbita moschata]6.9e-23883.27Show/hide
Query:  VTNPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRI
        +TNP P  FNP LGA V S+APENGLFLYNQMLRHP  SSHNHYTFTYALKAC  LH+THKGLEIH  L+KSGHLSDIFIQNSLLHFYIVDGDVPSASR+
Subjt:  VTNPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRI

Query:  FASMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAENLFV
        F S+PDPDVVSWTSIISGLSKLGF+EEALGKFLSMNV PNSATLV+ALSACSSL  +K+GKAIHGL+LRS++EESV+LDNALLDFYV+CGSLR A+NLF 
Subjt:  FASMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAENLFV

Query:  KMPNRDVVSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIHSYIISRPDVILDGNIGNALINMYVKCGSMEMAVR
        +MP RDVVSWTT+IGGYA +GLCEEAVR+FQNMV   EA PNEATLINVLSACSSMSAL+LGQW+HSYI SR DVI+DGNIGNALINMYVKCGSM+ A+ 
Subjt:  KMPNRDVVSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIHSYIISRPDVILDGNIGNALINMYVKCGSMEMAVR

Query:  IFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLMVFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEA
        IFKTVEH+DIISWST+I+GLAMNG GKQAF LFSLMLVHG++PD ITFL LLS CSHGGLINQGLMVF+AMKDVYNVAP+MRHY CMVDMYG+AGLLDEA
Subjt:  IFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLMVFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEA

Query:  EAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWKDANEVRDTMRSRGLKKMAGCSWIE
        EAF+KEMPVEAEGPVWGALL+ACQ+HGNE +YEKVR+WL+   SK +TVGT ALLSNTYASCDRW DANEVRD MRSRGLKKMAGCSWIE
Subjt:  EAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWKDANEVRDTMRSRGLKKMAGCSWIE

XP_038878297.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Benincasa hispida]1.1e-23883.2Show/hide
Query:  NPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRIFA
        NP P   NP LG+ V S++PENGLFLYNQMLR+P  SSHNH+TFTYALKAC  LH+T KGLEIH HL+KSGHLSDIF+QNSLLHFYI+DGDVPSASRIF 
Subjt:  NPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRIFA

Query:  SMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAENLFVKM
        S+PDPDV+SWTSIISGLSKLGFE+EALGKFLSMNVRPNS TLVTALSACSSL  LKLGKAIHGLRLRS++EE+VSLDNALLDFYV+CG LRSAE LF +M
Subjt:  SMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAENLFVKM

Query:  PNRDVVSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIHSYIISRPDVILDGNIGNALINMYVKCGSMEMAVRIF
        P RDVVSWTT+IGGYAQ GLCEEAVR+FQNMV  GEA PNEATLINVLSACSS+SAL+LGQW+HSYI SR DVI+DGN+GNALINMYVKCG+MEMA+ IF
Subjt:  PNRDVVSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIHSYIISRPDVILDGNIGNALINMYVKCGSMEMAVRIF

Query:  KTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLMVFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEAEA
        K +EH+DIISWST+I+GLAMNGLG QAF LFSLMLVHG+SPDDITFL LLS CSHGGLINQGLMVF+AMKDVYN++PQMRHY CMVD+YG+AGLLDEAEA
Subjt:  KTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLMVFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEAEA

Query:  FMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWKDANEVRDTMRSRGLKKMAGCSWIE
        F+KEMP+EAEG VWGALL+ACQIHGNEKKYEKV+EWL+   SKGVTVGT ALLSNTYASCDRW DANEVRDTMRS+GLKKMAGCSWIE
Subjt:  FMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWKDANEVRDTMRSRGLKKMAGCSWIE

TrEMBL top hitse value%identityAlignment
A0A0A0LXJ1 Uncharacterized protein1.3e-23782.86Show/hide
Query:  VTNPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRI
        +TNP P  FNP LG+ V SI PENGLFLYNQMLR+P  SSHNH+TFTYALKAC  LHQT KGLEIH HL+KSGHLSDIFIQNSLLHFYI+DGDV SAS I
Subjt:  VTNPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRI

Query:  FASMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAENLFV
        F S+PDPDVVSWTSIISGLSKLGFE+EAL KFLSMNVRPNS TLVTALSACSSL  LKLGKAIHGLR+R+++EE+V L+NALLDFYV+C  LRSAENLF 
Subjt:  FASMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAENLFV

Query:  KMPNRDVVSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIHSYIISRPDVILDGNIGNALINMYVKCGSMEMAVR
        KMP RDVVSWTT+IGGYAQSGLCEEAVR+FQNMV  GEA PNEATL+NVLSACSS+SAL+LGQW+HSYI SR DVI+DGN+GNALINMYVKCG+MEMA+ 
Subjt:  KMPNRDVVSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIHSYIISRPDVILDGNIGNALINMYVKCGSMEMAVR

Query:  IFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLMVFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEA
        IFK +EH+DI+SWST+I+GLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLS CSHGGLINQG+MVF+AMKDVYN++PQMRHY CMVDMYG+AGLLDEA
Subjt:  IFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLMVFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEA

Query:  EAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWKDANEVRDTMRSRGLKKMAGCSWIE
        EAF+KEMP+EAEGPVWGALL+ACQ+HGNEKKYEKVREWL+   SKGVTVGT ALLSNTYA CDRW DAN+VR  MRSRGLKKMAG SWIE
Subjt:  EAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWKDANEVRDTMRSRGLKKMAGCSWIE

A0A1S4DY27 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X15.3e-23682.65Show/hide
Query:  VTNPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRI
        +TNP P  FNP LG+ V SI+PENGLFLYNQML +P  SSHNH+TFTYALKAC  LHQT KGLEIH HL+KSGHLSDIFIQNSLLHFYI+ GDV SAS I
Subjt:  VTNPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRI

Query:  FASMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAENLFV
        F S+P+PDVVSWTSIISG SKLGFE+EALGKFLSMNVRPNS TLVTALSACSSL  LKLGKAIHGLRLR+++EE+VSL+NALLDFYV+C  LRSAENLF 
Subjt:  FASMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAENLFV

Query:  KMPNRDVVSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIHSYIISRPDVILDGNIGNALINMYVKCGSMEMAVR
        KM  RDVVSWTT+IGGYAQSGLCEEAVR+FQNMV  GEA PNEATL+NVLSACSS+SAL+LGQW+HSYI SR DVI+DGN+GNALINMYVKCG+MEMA+ 
Subjt:  KMPNRDVVSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIHSYIISRPDVILDGNIGNALINMYVKCGSMEMAVR

Query:  IFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLMVFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEA
        IFK +EH+DIISWSTVI+GLAMNGLGKQAFVLFSLMLVHG+SPDDITFLGLLS CSHGGLINQG+MVF+AMKDVYN++PQ+RHY CMVDMYG+AGLLDEA
Subjt:  IFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLMVFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEA

Query:  EAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWKDANEVRDTMRSRGLKKMAGCSWIE
        EAF+KEMP+EAEGPVWGALL+ACQIHGNEKKYEKVRE L+   SKGVTVG  ALLSNTYASCDRW DAN+VR  MRSRGLKKMAGCSWIE
Subjt:  EAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWKDANEVRDTMRSRGLKKMAGCSWIE

A0A5D3D5L6 Pentatricopeptide repeat-containing protein5.3e-23682.65Show/hide
Query:  VTNPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRI
        +TNP P  FNP LG+ V SI+PENGLFLYNQML +P  SSHNH+TFTYALKAC  LHQT KGLEIH HL+KSGHLSDIFIQNSLLHFYI++GDV SAS I
Subjt:  VTNPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRI

Query:  FASMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAENLFV
        F S+P+PDVVSWTSIISGLSKLGFE+EALGKFLSMNVRPNS TLVTALSACSSL  LKLGKAIHGLRLR+++EE+VSL+NALLDFYV+C  LRSAENLF 
Subjt:  FASMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAENLFV

Query:  KMPNRDVVSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIHSYIISRPDVILDGNIGNALINMYVKCGSMEMAVR
        KM  RDVVSWTT+IGGYAQSGLCEEAVR+FQNMV  GEA PNEATL+NVLSACSS+SAL+LGQW+HSYI SR DVI+DGN+GNALINMYVKCG+MEMA+ 
Subjt:  KMPNRDVVSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIHSYIISRPDVILDGNIGNALINMYVKCGSMEMAVR

Query:  IFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLMVFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEA
        IF  +EH+DIISWSTVI+GLAMNGLGKQAFVLFSLMLVHG+SPDDITFLGLLS CSHGGLINQG+MVF+AMKDVYN++PQ+RHY CMVDMYG+AGLLDEA
Subjt:  IFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLMVFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEA

Query:  EAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWKDANEVRDTMRSRGLKKMAGCSWIE
        EAF+KEMP+EAEGPVWGALL+ACQIHGNEKKYEKVRE L+   SKGVTVG  ALLSNTYASCDRW DAN+VR  MRSRGLKKMAGCSWIE
Subjt:  EAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWKDANEVRDTMRSRGLKKMAGCSWIE

A0A6J1DJC1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like1.3e-28299.39Show/hide
Query:  VTNPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRI
        VTNPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRI
Subjt:  VTNPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRI

Query:  FASMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAENLFV
        FASMPDPDVVSWTSIISGLSKLGFEEEAL KFLSMNVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAENLFV
Subjt:  FASMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAENLFV

Query:  KMPNRDVVSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIHSYIISRPDVILDGNIGNALINMYVKCGSMEMAVR
        KMPNRDVVSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIHSYIISRPDVIL+GNIGNALINMYVKCGSMEMAVR
Subjt:  KMPNRDVVSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIHSYIISRPDVILDGNIGNALINMYVKCGSMEMAVR

Query:  IFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLMVFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEA
        IFKTVEHRDIIS STVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLMVFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEA
Subjt:  IFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLMVFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEA

Query:  EAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWKDANEVRDTMRSRGLKKMAGCSWIE
        EAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWKDANEVRDTMRSRGLKKMAGCSWIE
Subjt:  EAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWKDANEVRDTMRSRGLKKMAGCSWIE

A0A6J1H7Z9 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X13.3e-23883.27Show/hide
Query:  VTNPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRI
        +TNP P  FNP LGA V S+APENGLFLYNQMLRHP  SSHNHYTFTYALKAC  LH+THKGLEIH  L+KSGHLSDIFIQNSLLHFYIVDGDVPSASR+
Subjt:  VTNPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRI

Query:  FASMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAENLFV
        F S+PDPDVVSWTSIISGLSKLGF+EEALGKFLSMNV PNSATLV+ALSACSSL  +K+GKAIHGL+LRS++EESV+LDNALLDFYV+CGSLR A+NLF 
Subjt:  FASMPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAENLFV

Query:  KMPNRDVVSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIHSYIISRPDVILDGNIGNALINMYVKCGSMEMAVR
        +MP RDVVSWTT+IGGYA +GLCEEAVR+FQNMV   EA PNEATLINVLSACSSMSAL+LGQW+HSYI SR DVI+DGNIGNALINMYVKCGSM+ A+ 
Subjt:  KMPNRDVVSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIHSYIISRPDVILDGNIGNALINMYVKCGSMEMAVR

Query:  IFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLMVFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEA
        IFKTVEH+DIISWST+I+GLAMNG GKQAF LFSLMLVHG++PD ITFL LLS CSHGGLINQGLMVF+AMKDVYNVAP+MRHY CMVDMYG+AGLLDEA
Subjt:  IFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLMVFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEA

Query:  EAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWKDANEVRDTMRSRGLKKMAGCSWIE
        EAF+KEMPVEAEGPVWGALL+ACQ+HGNE +YEKVR+WL+   SK +TVGT ALLSNTYASCDRW DANEVRD MRSRGLKKMAGCSWIE
Subjt:  EAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWKDANEVRDTMRSRGLKKMAGCSWIE

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic2.7e-9135.11Show/hide
Query:  VTNPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRI
        +  PN   +N  + A+ +   P   ++ +  M+   S    N YTF + +KA + +     G  +HG  VKS   SD+F+ NSL+H Y   GD+ SA ++
Subjt:  VTNPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRI

Query:  FASMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAEN
        F ++ + DVVSW S+I+G  + G  ++AL  F  M   +V+ +  T+V  LSAC+ +  L+ G+ +      +    +++L NA+LD Y KCGS+  A+ 
Subjt:  FASMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAEN

Query:  LFVKMPNRDVVSWTTIIGGYA-------------------------------QSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIH
        LF  M  +D V+WTT++ GYA                               Q+G   EA+ +F  +  +   + N+ TL++ LSAC+ + AL LG+WIH
Subjt:  LFVKMPNRDVVSWTTIIGGYA-------------------------------QSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIH

Query:  SYIISRPDVILDGNIGNALINMYVKCGSMEMAVRIFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLM
        SY I +  + ++ ++ +ALI+MY KCG +E +  +F +VE RD+  WS +I GLAM+G G +A  +F  M    V P+ +TF  +   CSH GL+++   
Subjt:  SYIISRPDVILDGNIGNALINMYVKCGSMEMAVRIFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLM

Query:  VFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEAEAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWK
        +F  M+  Y + P+ +HY C+VD+ GR+G L++A  F++ MP+     VWGALL AC+IH N    E     L+    +    G   LLSN YA   +W+
Subjt:  VFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEAEAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWK

Query:  DANEVRDTMRSRGLKKMAGCSWIE
        + +E+R  MR  GLKK  GCS IE
Subjt:  DANEVRDTMRSRGLKKMAGCSWIE

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic4.1e-10036.76Show/hide
Query:  VTNPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGD-------
        +  PN L +N        S  P + L LY  M+        N YTF + LK+C+      +G +IHGH++K G   D+++  SL+  Y+ +G        
Subjt:  VTNPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGD-------

Query:  ------------------------VPSASRIFASMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVRPNSATLVTALSACSSLGYLKLGKAIHGL
                                + +A ++F  +P  DVVSW ++ISG ++ G  +EAL  F  M   NVRP+ +T+VT +SAC+  G ++LG+ +H  
Subjt:  ------------------------VPSASRIFASMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVRPNSATLVTALSACSSLGYLKLGKAIHGL

Query:  RLRSVDEESVSLDNALLDFYVKCGSLRSAENLFVKMPNRDVVSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIH
                ++ + NAL+D Y KCG L +A  LF ++P +DV+SW T+IGGY    L +EA+ +FQ M++ GE  PN+ T++++L AC+ + A+++G+WIH
Subjt:  RLRSVDEESVSLDNALLDFYVKCGSLRSAENLFVKMPNRDVVSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIH

Query:  SYIISR-PDVILDGNIGNALINMYVKCGSMEMAVRIFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGL
         YI  R   V    ++  +LI+MY KCG +E A ++F ++ H+ + SW+ +I G AM+G    +F LFS M   G+ PDDITF+GLLS CSH G+++ G 
Subjt:  SYIISR-PDVILDGNIGNALINMYVKCGSMEMAVRIFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGL

Query:  MVFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEAEAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRW
         +F+ M   Y + P++ HYGCM+D+ G +GL  EAE  +  M +E +G +W +LL AC++HGN +  E   E L++   +    G+  LLSN YAS  RW
Subjt:  MVFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEAEAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRW

Query:  KDANEVRDTMRSRGLKKMAGCSWIE
         +  + R  +  +G+KK+ GCS IE
Subjt:  KDANEVRDTMRSRGLKKMAGCSWIE

Q9SIT7 Pentatricopeptide repeat-containing protein At2g136005.0e-9035.44Show/hide
Query:  FNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRIFASMPDPD
        +N  +  F      E  L  +  M  H      N Y+F   L ACS L+  +KG+++H  + KS  LSD++I ++L+  Y   G+V  A R+F  M D +
Subjt:  FNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRIFASMPDPD

Query:  VVSWTSIISGLSKLGFEEEALGKF---LSMNVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVD-EESVSLDNALLDFYVKC---------------
        VVSW S+I+   + G   EAL  F   L   V P+  TL + +SAC+SL  +K+G+ +HG  +++      + L NA +D Y KC               
Subjt:  VVSWTSIISGLSKLGFEEEALGKF---LSMNVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVD-EESVSLDNALLDFYVKC---------------

Query:  ----------------GSLRSAENLFVKMPNRDVVSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIHSYIISRP
                         S ++A  +F KM  R+VVSW  +I GY Q+G  EEA+ +F  + +E    P   +  N+L AC+ ++ L+LG   H +++   
Subjt:  ----------------GSLRSAENLFVKMPNRDVVSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIHSYIISRP

Query:  DVILDGN-----IGNALINMYVKCGSMEMAVRIFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLMVF
             G      +GN+LI+MYVKCG +E    +F+ +  RD +SW+ +I G A NG G +A  LF  ML  G  PD IT +G+LS C H G + +G   F
Subjt:  DVILDGN-----IGNALINMYVKCGSMEMAVRIFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLMVF

Query:  KAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEAEAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWKDA
         +M   + VAP   HY CMVD+ GRAG L+EA++ ++EMP++ +  +WG+LL AC++H N    + V E L+         G   LLSN YA   +W+D 
Subjt:  KAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEAEAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWKDA

Query:  NEVRDTMRSRGLKKMAGCSWIE
          VR +MR  G+ K  GCSWI+
Subjt:  NEVRDTMRSRGLKKMAGCSWIE

Q9SJZ3 Pentatricopeptide repeat-containing protein At2g22410, mitochondrial7.5e-9435.31Show/hide
Query:  VTNPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSH-NHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASR
        + NPN  ++N  +  F  S  P+    LY QMLRH    S  +H+T+    K C+ L  +  G  I GH++K        + N+ +H +   GD+ +A +
Subjt:  VTNPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSH-NHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASR

Query:  IFASMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAE
        +F   P  D+VSW  +I+G  K+G  E+A+  +  M    V+P+  T++  +S+CS LG L  GK  +     +    ++ L NAL+D + KCG +  A 
Subjt:  IFASMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAE

Query:  NLFVKMPNRDVVSWTTIIGGYAQSGLCEEAVRIFQNM------------------------------VQEGEAEPNEATLINVLSACSSMSALNLGQWIH
         +F  +  R +VSWTT+I GYA+ GL + + ++F +M                              +Q    +P+E T+I+ LSACS + AL++G WIH
Subjt:  NLFVKMPNRDVVSWTTIIGGYAQSGLCEEAVRIFQNM------------------------------VQEGEAEPNEATLINVLSACSSMSALNLGQWIH

Query:  SYIISRPDVILDGNIGNALINMYVKCGSMEMAVRIFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLM
         Y I +  + L+  +G +L++MY KCG++  A+ +F  ++ R+ ++++ +I GLA++G    A   F+ M+  G++PD+ITF+GLLS C HGG+I  G  
Subjt:  SYIISRPDVILDGNIGNALINMYVKCGSMEMAVRIFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLM

Query:  VFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEAEAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWK
         F  MK  +N+ PQ++HY  MVD+ GRAGLL+EA+  M+ MP+EA+  VWGALL  C++HGN +  EK  + L+         G   LL   Y   + W+
Subjt:  VFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEAEAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWK

Query:  DANEVRDTMRSRGLKKMAGCSWIE
        DA   R  M  RG++K+ GCS IE
Subjt:  DANEVRDTMRSRGLKKMAGCSWIE

Q9SZK1 Pentatricopeptide repeat-containing protein At4g380101.4e-9237.78Show/hide
Query:  NFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRIFASMPDP
        ++N  L ++     P   +F Y   +   +  S + +TF    KAC       +G +IHG + K G   DI++QNSL+HFY V G+  +A ++F  MP  
Subjt:  NFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRIFASMPDP

Query:  DVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAENLFVKMPNRDV
        DVVSWT II+G ++ G  +EAL  F  M+V PN AT V  L +   +G L LGK IHGL L+     S+   NAL+D YVKC  L  A  +F ++  +D 
Subjt:  DVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAENLFVKMPNRDV

Query:  VSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIHSYIISRPDVILDGNIGNALINMYVKCGSMEMAVRIFKTVEH
        VSW ++I G       +EA+ +F  M      +P+   L +VLSAC+S+ A++ G+W+H YI++   +  D +IG A+++MY KCG +E A+ IF  +  
Subjt:  VSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIHSYIISRPDVILDGNIGNALINMYVKCGSMEMAVRIFKTVEH

Query:  RDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLMVFKAMKD-VYNVAPQMRHYGCMVDMYGRAGLLDEAEAFMKE
        +++ +W+ ++ GLA++G G ++   F  M+  G  P+ +TFL  L+ C H GL+++G   F  MK   YN+ P++ HYGCM+D+  RAGLLDEA   +K 
Subjt:  RDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLMVFKAMKD-VYNVAPQMRHYGCMVDMYGRAGLLDEAEAFMKE

Query:  MPVEAEGPVWGALLNACQIHGN--EKKYEKVREWL-VRTRSKGVTVGTLALLSNTYASCDRWKDANEVRDTMRSRGLKKMAGCSWIE
        MPV+ +  + GA+L+AC+  G   E   E +  +L +     GV V    LLSN +A+  RW D   +R  M+ +G+ K+ G S+IE
Subjt:  MPVEAEGPVWGALLNACQIHGN--EKKYEKVREWL-VRTRSKGVTVGTLALLSNTYASCDRWKDANEVRDTMRSRGLKKMAGCSWIE

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.9e-10136.76Show/hide
Query:  VTNPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGD-------
        +  PN L +N        S  P + L LY  M+        N YTF + LK+C+      +G +IHGH++K G   D+++  SL+  Y+ +G        
Subjt:  VTNPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGD-------

Query:  ------------------------VPSASRIFASMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVRPNSATLVTALSACSSLGYLKLGKAIHGL
                                + +A ++F  +P  DVVSW ++ISG ++ G  +EAL  F  M   NVRP+ +T+VT +SAC+  G ++LG+ +H  
Subjt:  ------------------------VPSASRIFASMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVRPNSATLVTALSACSSLGYLKLGKAIHGL

Query:  RLRSVDEESVSLDNALLDFYVKCGSLRSAENLFVKMPNRDVVSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIH
                ++ + NAL+D Y KCG L +A  LF ++P +DV+SW T+IGGY    L +EA+ +FQ M++ GE  PN+ T++++L AC+ + A+++G+WIH
Subjt:  RLRSVDEESVSLDNALLDFYVKCGSLRSAENLFVKMPNRDVVSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIH

Query:  SYIISR-PDVILDGNIGNALINMYVKCGSMEMAVRIFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGL
         YI  R   V    ++  +LI+MY KCG +E A ++F ++ H+ + SW+ +I G AM+G    +F LFS M   G+ PDDITF+GLLS CSH G+++ G 
Subjt:  SYIISR-PDVILDGNIGNALINMYVKCGSMEMAVRIFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGL

Query:  MVFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEAEAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRW
         +F+ M   Y + P++ HYGCM+D+ G +GL  EAE  +  M +E +G +W +LL AC++HGN +  E   E L++   +    G+  LLSN YAS  RW
Subjt:  MVFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEAEAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRW

Query:  KDANEVRDTMRSRGLKKMAGCSWIE
         +  + R  +  +G+KK+ GCS IE
Subjt:  KDANEVRDTMRSRGLKKMAGCSWIE

AT2G13600.1 Pentatricopeptide repeat (PPR) superfamily protein3.6e-9135.44Show/hide
Query:  FNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRIFASMPDPD
        +N  +  F      E  L  +  M  H      N Y+F   L ACS L+  +KG+++H  + KS  LSD++I ++L+  Y   G+V  A R+F  M D +
Subjt:  FNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRIFASMPDPD

Query:  VVSWTSIISGLSKLGFEEEALGKF---LSMNVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVD-EESVSLDNALLDFYVKC---------------
        VVSW S+I+   + G   EAL  F   L   V P+  TL + +SAC+SL  +K+G+ +HG  +++      + L NA +D Y KC               
Subjt:  VVSWTSIISGLSKLGFEEEALGKF---LSMNVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVD-EESVSLDNALLDFYVKC---------------

Query:  ----------------GSLRSAENLFVKMPNRDVVSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIHSYIISRP
                         S ++A  +F KM  R+VVSW  +I GY Q+G  EEA+ +F  + +E    P   +  N+L AC+ ++ L+LG   H +++   
Subjt:  ----------------GSLRSAENLFVKMPNRDVVSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIHSYIISRP

Query:  DVILDGN-----IGNALINMYVKCGSMEMAVRIFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLMVF
             G      +GN+LI+MYVKCG +E    +F+ +  RD +SW+ +I G A NG G +A  LF  ML  G  PD IT +G+LS C H G + +G   F
Subjt:  DVILDGN-----IGNALINMYVKCGSMEMAVRIFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLMVF

Query:  KAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEAEAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWKDA
         +M   + VAP   HY CMVD+ GRAG L+EA++ ++EMP++ +  +WG+LL AC++H N    + V E L+         G   LLSN YA   +W+D 
Subjt:  KAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEAEAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWKDA

Query:  NEVRDTMRSRGLKKMAGCSWIE
          VR +MR  G+ K  GCSWI+
Subjt:  NEVRDTMRSRGLKKMAGCSWIE

AT2G22410.1 SLOW GROWTH 15.3e-9535.31Show/hide
Query:  VTNPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSH-NHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASR
        + NPN  ++N  +  F  S  P+    LY QMLRH    S  +H+T+    K C+ L  +  G  I GH++K        + N+ +H +   GD+ +A +
Subjt:  VTNPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSH-NHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASR

Query:  IFASMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAE
        +F   P  D+VSW  +I+G  K+G  E+A+  +  M    V+P+  T++  +S+CS LG L  GK  +     +    ++ L NAL+D + KCG +  A 
Subjt:  IFASMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAE

Query:  NLFVKMPNRDVVSWTTIIGGYAQSGLCEEAVRIFQNM------------------------------VQEGEAEPNEATLINVLSACSSMSALNLGQWIH
         +F  +  R +VSWTT+I GYA+ GL + + ++F +M                              +Q    +P+E T+I+ LSACS + AL++G WIH
Subjt:  NLFVKMPNRDVVSWTTIIGGYAQSGLCEEAVRIFQNM------------------------------VQEGEAEPNEATLINVLSACSSMSALNLGQWIH

Query:  SYIISRPDVILDGNIGNALINMYVKCGSMEMAVRIFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLM
         Y I +  + L+  +G +L++MY KCG++  A+ +F  ++ R+ ++++ +I GLA++G    A   F+ M+  G++PD+ITF+GLLS C HGG+I  G  
Subjt:  SYIISRPDVILDGNIGNALINMYVKCGSMEMAVRIFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLM

Query:  VFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEAEAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWK
         F  MK  +N+ PQ++HY  MVD+ GRAGLL+EA+  M+ MP+EA+  VWGALL  C++HGN +  EK  + L+         G   LL   Y   + W+
Subjt:  VFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEAEAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWK

Query:  DANEVRDTMRSRGLKKMAGCSWIE
        DA   R  M  RG++K+ GCS IE
Subjt:  DANEVRDTMRSRGLKKMAGCSWIE

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.9e-9235.11Show/hide
Query:  VTNPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRI
        +  PN   +N  + A+ +   P   ++ +  M+   S    N YTF + +KA + +     G  +HG  VKS   SD+F+ NSL+H Y   GD+ SA ++
Subjt:  VTNPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRI

Query:  FASMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAEN
        F ++ + DVVSW S+I+G  + G  ++AL  F  M   +V+ +  T+V  LSAC+ +  L+ G+ +      +    +++L NA+LD Y KCGS+  A+ 
Subjt:  FASMPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAEN

Query:  LFVKMPNRDVVSWTTIIGGYA-------------------------------QSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIH
        LF  M  +D V+WTT++ GYA                               Q+G   EA+ +F  +  +   + N+ TL++ LSAC+ + AL LG+WIH
Subjt:  LFVKMPNRDVVSWTTIIGGYA-------------------------------QSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIH

Query:  SYIISRPDVILDGNIGNALINMYVKCGSMEMAVRIFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLM
        SY I +  + ++ ++ +ALI+MY KCG +E +  +F +VE RD+  WS +I GLAM+G G +A  +F  M    V P+ +TF  +   CSH GL+++   
Subjt:  SYIISRPDVILDGNIGNALINMYVKCGSMEMAVRIFKTVEHRDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLM

Query:  VFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEAEAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWK
        +F  M+  Y + P+ +HY C+VD+ GR+G L++A  F++ MP+     VWGALL AC+IH N    E     L+    +    G   LLSN YA   +W+
Subjt:  VFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEAEAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLVRTRSKGVTVGTLALLSNTYASCDRWK

Query:  DANEVRDTMRSRGLKKMAGCSWIE
        + +E+R  MR  GLKK  GCS IE
Subjt:  DANEVRDTMRSRGLKKMAGCSWIE

AT4G38010.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.0e-9337.78Show/hide
Query:  NFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRIFASMPDP
        ++N  L ++     P   +F Y   +   +  S + +TF    KAC       +G +IHG + K G   DI++QNSL+HFY V G+  +A ++F  MP  
Subjt:  NFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRIFASMPDP

Query:  DVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAENLFVKMPNRDV
        DVVSWT II+G ++ G  +EAL  F  M+V PN AT V  L +   +G L LGK IHGL L+     S+   NAL+D YVKC  L  A  +F ++  +D 
Subjt:  DVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAENLFVKMPNRDV

Query:  VSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIHSYIISRPDVILDGNIGNALINMYVKCGSMEMAVRIFKTVEH
        VSW ++I G       +EA+ +F  M      +P+   L +VLSAC+S+ A++ G+W+H YI++   +  D +IG A+++MY KCG +E A+ IF  +  
Subjt:  VSWTTIIGGYAQSGLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIHSYIISRPDVILDGNIGNALINMYVKCGSMEMAVRIFKTVEH

Query:  RDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLMVFKAMKD-VYNVAPQMRHYGCMVDMYGRAGLLDEAEAFMKE
        +++ +W+ ++ GLA++G G ++   F  M+  G  P+ +TFL  L+ C H GL+++G   F  MK   YN+ P++ HYGCM+D+  RAGLLDEA   +K 
Subjt:  RDIISWSTVITGLAMNGLGKQAFVLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLMVFKAMKD-VYNVAPQMRHYGCMVDMYGRAGLLDEAEAFMKE

Query:  MPVEAEGPVWGALLNACQIHGN--EKKYEKVREWL-VRTRSKGVTVGTLALLSNTYASCDRWKDANEVRDTMRSRGLKKMAGCSWIE
        MPV+ +  + GA+L+AC+  G   E   E +  +L +     GV V    LLSN +A+  RW D   +R  M+ +G+ K+ G S+IE
Subjt:  MPVEAEGPVWGALLNACQIHGN--EKKYEKVREWL-VRTRSKGVTVGTLALLSNTYASCDRWKDANEVRDTMRSRGLKKMAGCSWIE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GTCACAAATCCGAACCCTCTTAACTTCAACCCTTTCCTCGGCGCTTTTGTAGCTTCCATCGCTCCTGAAAATGGCCTCTTCCTCTACAACCAAATGCTTCGCCACCCATC
TTCATCTTCCCACAACCACTACACCTTCACCTACGCCCTCAAAGCCTGTTCCTCGCTCCATCAAACCCACAAGGGCCTCGAAATCCACGGCCATCTAGTTAAATCTGGCC
ACCTTTCCGACATCTTCATCCAAAATTCACTGCTCCATTTCTACATTGTCGATGGCGATGTTCCTTCTGCTTCTCGAATCTTCGCGTCCATGCCTGACCCAGATGTGGTT
TCCTGGACATCCATCATTTCGGGGCTCTCCAAGCTGGGTTTTGAGGAGGAAGCTCTAGGTAAGTTCTTGTCTATGAATGTGAGGCCTAATTCTGCTACTCTTGTCACTGC
TTTATCTGCTTGTTCTAGTCTTGGATATCTCAAGCTTGGGAAAGCTATACATGGGCTGAGATTGCGGAGTGTGGATGAGGAAAGCGTTAGTTTGGACAATGCCCTTCTGG
ACTTTTACGTCAAATGTGGGTCCTTGAGGAGTGCAGAGAACCTGTTTGTTAAAATGCCTAATAGAGACGTAGTGTCATGGACTACAATAATTGGGGGTTATGCTCAGAGT
GGATTGTGTGAAGAGGCTGTGAGGATCTTCCAAAACATGGTCCAAGAGGGAGAGGCTGAGCCCAATGAGGCCACTCTCATTAATGTATTATCTGCTTGTTCTTCCATGTC
TGCTCTGAATTTGGGTCAATGGATTCATTCCTATATCATCTCTAGGCCTGATGTGATACTTGATGGAAATATCGGAAATGCTTTAATAAACATGTATGTCAAATGTGGTA
GCATGGAAATGGCAGTTAGGATCTTCAAAACTGTTGAACACAGGGATATCATATCATGGAGCACAGTCATAACTGGGTTAGCCATGAATGGCCTAGGCAAGCAAGCTTTT
GTTCTGTTCTCACTCATGCTAGTTCATGGCGTTTCTCCAGACGACATAACATTTCTTGGCCTGTTATCTGGATGCAGCCATGGTGGGCTGATCAATCAAGGCTTGATGGT
GTTTAAAGCCATGAAAGATGTTTATAATGTTGCACCTCAGATGAGGCATTATGGCTGCATGGTGGACATGTATGGAAGGGCTGGACTTTTAGATGAAGCAGAGGCATTCA
TGAAGGAGATGCCTGTGGAAGCAGAAGGCCCAGTATGGGGAGCTCTGCTGAATGCTTGCCAGATTCATGGGAATGAGAAGAAGTATGAGAAGGTTAGGGAATGGCTGGTT
AGAACTAGAAGCAAGGGGGTGACAGTGGGAACTTTGGCTTTGTTGTCAAATACTTATGCAAGTTGTGACAGATGGAAGGATGCTAATGAAGTGAGAGACACCATGAGAAG
TAGAGGGTTGAAGAAAATGGCGGGATGTAGTTGGATCGAA
mRNA sequenceShow/hide mRNA sequence
GTCACAAATCCGAACCCTCTTAACTTCAACCCTTTCCTCGGCGCTTTTGTAGCTTCCATCGCTCCTGAAAATGGCCTCTTCCTCTACAACCAAATGCTTCGCCACCCATC
TTCATCTTCCCACAACCACTACACCTTCACCTACGCCCTCAAAGCCTGTTCCTCGCTCCATCAAACCCACAAGGGCCTCGAAATCCACGGCCATCTAGTTAAATCTGGCC
ACCTTTCCGACATCTTCATCCAAAATTCACTGCTCCATTTCTACATTGTCGATGGCGATGTTCCTTCTGCTTCTCGAATCTTCGCGTCCATGCCTGACCCAGATGTGGTT
TCCTGGACATCCATCATTTCGGGGCTCTCCAAGCTGGGTTTTGAGGAGGAAGCTCTAGGTAAGTTCTTGTCTATGAATGTGAGGCCTAATTCTGCTACTCTTGTCACTGC
TTTATCTGCTTGTTCTAGTCTTGGATATCTCAAGCTTGGGAAAGCTATACATGGGCTGAGATTGCGGAGTGTGGATGAGGAAAGCGTTAGTTTGGACAATGCCCTTCTGG
ACTTTTACGTCAAATGTGGGTCCTTGAGGAGTGCAGAGAACCTGTTTGTTAAAATGCCTAATAGAGACGTAGTGTCATGGACTACAATAATTGGGGGTTATGCTCAGAGT
GGATTGTGTGAAGAGGCTGTGAGGATCTTCCAAAACATGGTCCAAGAGGGAGAGGCTGAGCCCAATGAGGCCACTCTCATTAATGTATTATCTGCTTGTTCTTCCATGTC
TGCTCTGAATTTGGGTCAATGGATTCATTCCTATATCATCTCTAGGCCTGATGTGATACTTGATGGAAATATCGGAAATGCTTTAATAAACATGTATGTCAAATGTGGTA
GCATGGAAATGGCAGTTAGGATCTTCAAAACTGTTGAACACAGGGATATCATATCATGGAGCACAGTCATAACTGGGTTAGCCATGAATGGCCTAGGCAAGCAAGCTTTT
GTTCTGTTCTCACTCATGCTAGTTCATGGCGTTTCTCCAGACGACATAACATTTCTTGGCCTGTTATCTGGATGCAGCCATGGTGGGCTGATCAATCAAGGCTTGATGGT
GTTTAAAGCCATGAAAGATGTTTATAATGTTGCACCTCAGATGAGGCATTATGGCTGCATGGTGGACATGTATGGAAGGGCTGGACTTTTAGATGAAGCAGAGGCATTCA
TGAAGGAGATGCCTGTGGAAGCAGAAGGCCCAGTATGGGGAGCTCTGCTGAATGCTTGCCAGATTCATGGGAATGAGAAGAAGTATGAGAAGGTTAGGGAATGGCTGGTT
AGAACTAGAAGCAAGGGGGTGACAGTGGGAACTTTGGCTTTGTTGTCAAATACTTATGCAAGTTGTGACAGATGGAAGGATGCTAATGAAGTGAGAGACACCATGAGAAG
TAGAGGGTTGAAGAAAATGGCGGGATGTAGTTGGATCGAA
Protein sequenceShow/hide protein sequence
VTNPNPLNFNPFLGAFVASIAPENGLFLYNQMLRHPSSSSHNHYTFTYALKACSSLHQTHKGLEIHGHLVKSGHLSDIFIQNSLLHFYIVDGDVPSASRIFASMPDPDVV
SWTSIISGLSKLGFEEEALGKFLSMNVRPNSATLVTALSACSSLGYLKLGKAIHGLRLRSVDEESVSLDNALLDFYVKCGSLRSAENLFVKMPNRDVVSWTTIIGGYAQS
GLCEEAVRIFQNMVQEGEAEPNEATLINVLSACSSMSALNLGQWIHSYIISRPDVILDGNIGNALINMYVKCGSMEMAVRIFKTVEHRDIISWSTVITGLAMNGLGKQAF
VLFSLMLVHGVSPDDITFLGLLSGCSHGGLINQGLMVFKAMKDVYNVAPQMRHYGCMVDMYGRAGLLDEAEAFMKEMPVEAEGPVWGALLNACQIHGNEKKYEKVREWLV
RTRSKGVTVGTLALLSNTYASCDRWKDANEVRDTMRSRGLKKMAGCSWIE