; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

BhiUN452G8 (gene) of Wax gourd (B227) v1 genome

Gene IDBhiUN452G8
OrganismBenincasa hispida cv. B227 (Wax gourd (B227) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationContig452:60444..62251
RNA-Seq ExpressionBhiUN452G8
SyntenyBhiUN452G8
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139593.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucumis sativus]1.3e-26386.45Show/hide
Query:  MSSSPSYTAIPSQLQQYPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFPANSIDPIVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLS
        MSS PS+TA PSQLQ  P  PSSIPLSNPTKLNFPRSPNS HRNISSKF  NS+DPIVLWTSSLARYCRNGQLSEAA EFTRMRLAGVEPNH+TFITLLS
Subjt:  MSSSPSYTAIPSQLQQYPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFPANSIDPIVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLS

Query:  GCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARKVFDYLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN
         C DFPSES FF SSLHGYA K GLDTGHVMVGTAL+DMY+KCAQ   ARKVF  LG+KNSV+WNTML+G+ RNGEIELAI LFDEMPTRDAISWTALIN
Subjt:  GCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARKVFDYLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN

Query:  GLLKQGFSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GLLK G+SEQALECFHQMQ SG+  DYVSIIAVLAACADLGALTLGLWV+RFVM QEFKDNI+ISNSL+DMYSRCGCIEFARQVF KM KRTLVSWNSII
Subjt:  GLLKQGFSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG
        VGFAVNGFADESLEFF AMQ EGFKPDGVSYTGALTACSHAGLVNKGLELFDNMK +HKITPRIEHYGCIVDLYGRAGRLEDALN+IEEMPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG

Query:  SLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLF
        SLLAACRT+GDV+LAE+LMKHL KLDP GD+ YVLLSNIYAAIG+W+GAN VRRTMKARGVQKKPG SSVEIDGKVHEFVAGD YHADADNIYSML+LL 
Subjt:  SLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLF

Query:  HELKIYGYVPDTNIILNTKEFSKD
        HELK+ GYVP ++ ILNTKE +KD
Subjt:  HELKIYGYVPDTNIILNTKEFSKD

XP_008458940.1 PREDICTED: pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucumis melo]9.6e-26286.1Show/hide
Query:  MSSSPSYTAIPSQLQQYPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFPANSIDPIVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLS
        MSS PS+ A PSQLQQ P+  SSIPLSNPTK+NFPRSP S H NI SKF ANS+ PIV WTSS+ARYC NGQL EAA EFTRMRLAGVEPNH+TFITLLS
Subjt:  MSSSPSYTAIPSQLQQYPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFPANSIDPIVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLS

Query:  GCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARKVFDYLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN
        GC DFPSES FF SSLHGYA K GLDTGHVMVGTAL+DMY+KC+Q  LA+KVFDYLG+KNSV+WNTML+G+ RNGEIELAI LFDEMPTRDAISWTALIN
Subjt:  GCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARKVFDYLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN

Query:  GLLKQGFSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GLLK G+SEQALECFHQMQ SG+  DYVSIIAVLAACADLGALT GLWVNRFVMQQEFKDN+RISNSL+DMYSRCGCIEFARQVF KM KRTLVSWNSII
Subjt:  GLLKQGFSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG
        VGFA NGFADESLEFF AMQ EGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKR+HKITP IEHYGCIVDLYGRAGRLEDA NVIEEMPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG

Query:  SLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLF
        SLLAACRT+GDV LAE+LMKH+ KLD  GDSNYVLLSNIYAAIG+WEGANKVRRTMKARGVQKK G SSVEIDGKVHEFVAGDKYHADADNIYSML+LLF
Subjt:  SLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLF

Query:  HELKIYGYVPDTNIILNTKEFSKDH
        HELK+ GYVPDT+IILNTK+ +KDH
Subjt:  HELKIYGYVPDTNIILNTKEFSKDH

XP_022142716.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Momordica charantia]1.9e-27087.81Show/hide
Query:  MSSSPSYTAIPSQLQQYPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFPANSIDPIVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLS
        MSS P+ TA   QLQQYPNP +SIPL NP  +NFPRS NSS+R+ISSK   NSIDPIVLWTSS+ARYCRNGQL+EAA EFT MRLAGVEPNHVT ITLLS
Subjt:  MSSSPSYTAIPSQLQQYPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFPANSIDPIVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLS

Query:  GCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARKVFDYLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN
        GC DFPSESL+FGSSLHGYARKLGLDT HVMVGT++VDMYAKCAQ  LAR+VFDYL MKNSV+WNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN
Subjt:  GCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARKVFDYLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN

Query:  GLLKQGFSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GLLKQG+SEQALECFHQMQCSGI+PDYVSIIAVLAACADLG LTLGLWVNRFVMQQEFKDNIRISNSL+DMYSRCGCI FARQVFE+M KRTLVSWNSII
Subjt:  GLLKQGFSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG
        VG+A NGFADESLEFFDAMQ EGFKPD VSYTGALTACSHAGLVNKGLELFDNMKR+H+I PRIEHYGCIVDLYGRAGRLEDAL+VIE+MPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG

Query:  SLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLF
        SLLAACRT+GDVSLAE+LMKHLSKLDP GDSNYVLLSNIYAAIG+WEGANKVRRTMKARGVQKKPG SSVEIDGKVHEFVAGDKYHADAD+IYSML+LL 
Subjt:  SLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLF

Query:  HELKIYGYVPDTNIILNTKEFSKDH
        HELKI G VP+T   LNTKE SKDH
Subjt:  HELKIYGYVPDTNIILNTKEFSKDH

XP_022967078.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucurbita maxima]2.9e-25883.49Show/hide
Query:  MSSSPSYTAIP--SQLQQYPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFPANSIDPIVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITL
        MSS PS+T IP   QLQQY NPPS IP SNP+ L+FPR+PNSS          N I PIVLWTSS+ARYCRN QL+EAA EFTRMRLAGVEPNH+TFITL
Subjt:  MSSSPSYTAIP--SQLQQYPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFPANSIDPIVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITL

Query:  LSGCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARKVFDYLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTAL
        LSGC DFPS SL FG+SLHGY RKLGLDTGHVMVGTAL+ MYAKCAQ  LAR VFDYL MKNSVTWNTMLDGY RNGEIELAI+LFDEMPTRDAISWTAL
Subjt:  LSGCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARKVFDYLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTAL

Query:  INGLLKQGFSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNS
        ING LKQG+SEQALECFH+MQCSGIEPDYVSIIAVLAACADLGAL+ GLWVNRF+MQQEFKDNIRISNSL+DMYSRCGCIEFARQVF+KM K TLVSWNS
Subjt:  INGLLKQGFSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNS

Query:  IIVGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVV
        +IVGFA+NGFADESLEFFDAMQ EGF  DGVSYTGALTACSHAGLVNKGLELFDNMKR+H+ITPRIEHYGCIVDLY RAGRL++ALNVIE MPMKPNEVV
Subjt:  IIVGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVV

Query:  LGSLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLEL
        LGSLLAACRT+GDVSLAE+L+K+L +LDP GDS+YVLLSNIYAA+GRWEGANKVRRTMKARGVQKKPG SS+EIDGKVHEFVAGDKYH DADNIYSMLE+
Subjt:  LGSLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLEL

Query:  LFHELKIYGYVPDTNIILNTKEFSKDH
        LFHELKIYGYVP+T   +N  E SK++
Subjt:  LFHELKIYGYVPDTNIILNTKEFSKDH

XP_038877228.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Benincasa hispida]4.8e-309100Show/hide
Query:  MSSSPSYTAIPSQLQQYPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFPANSIDPIVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLS
        MSSSPSYTAIPSQLQQYPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFPANSIDPIVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLS
Subjt:  MSSSPSYTAIPSQLQQYPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFPANSIDPIVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLS

Query:  GCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARKVFDYLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN
        GCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARKVFDYLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN
Subjt:  GCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARKVFDYLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN

Query:  GLLKQGFSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GLLKQGFSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
Subjt:  GLLKQGFSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG
        VGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG

Query:  SLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLF
        SLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLF
Subjt:  SLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLF

Query:  HELKIYGYVPDTNIILNTKEFSKDH
        HELKIYGYVPDTNIILNTKEFSKDH
Subjt:  HELKIYGYVPDTNIILNTKEFSKDH

TrEMBL top hitse value%identityAlignment
A0A0A0LYD6 Uncharacterized protein6.5e-26486.45Show/hide
Query:  MSSSPSYTAIPSQLQQYPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFPANSIDPIVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLS
        MSS PS+TA PSQLQ  P  PSSIPLSNPTKLNFPRSPNS HRNISSKF  NS+DPIVLWTSSLARYCRNGQLSEAA EFTRMRLAGVEPNH+TFITLLS
Subjt:  MSSSPSYTAIPSQLQQYPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFPANSIDPIVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLS

Query:  GCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARKVFDYLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN
         C DFPSES FF SSLHGYA K GLDTGHVMVGTAL+DMY+KCAQ   ARKVF  LG+KNSV+WNTML+G+ RNGEIELAI LFDEMPTRDAISWTALIN
Subjt:  GCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARKVFDYLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN

Query:  GLLKQGFSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GLLK G+SEQALECFHQMQ SG+  DYVSIIAVLAACADLGALTLGLWV+RFVM QEFKDNI+ISNSL+DMYSRCGCIEFARQVF KM KRTLVSWNSII
Subjt:  GLLKQGFSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG
        VGFAVNGFADESLEFF AMQ EGFKPDGVSYTGALTACSHAGLVNKGLELFDNMK +HKITPRIEHYGCIVDLYGRAGRLEDALN+IEEMPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG

Query:  SLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLF
        SLLAACRT+GDV+LAE+LMKHL KLDP GD+ YVLLSNIYAAIG+W+GAN VRRTMKARGVQKKPG SSVEIDGKVHEFVAGD YHADADNIYSML+LL 
Subjt:  SLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLF

Query:  HELKIYGYVPDTNIILNTKEFSKD
        HELK+ GYVP ++ ILNTKE +KD
Subjt:  HELKIYGYVPDTNIILNTKEFSKD

A0A1S3C956 pentatricopeptide repeat-containing protein At1g05750, chloroplastic4.7e-26286.1Show/hide
Query:  MSSSPSYTAIPSQLQQYPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFPANSIDPIVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLS
        MSS PS+ A PSQLQQ P+  SSIPLSNPTK+NFPRSP S H NI SKF ANS+ PIV WTSS+ARYC NGQL EAA EFTRMRLAGVEPNH+TFITLLS
Subjt:  MSSSPSYTAIPSQLQQYPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFPANSIDPIVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLS

Query:  GCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARKVFDYLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN
        GC DFPSES FF SSLHGYA K GLDTGHVMVGTAL+DMY+KC+Q  LA+KVFDYLG+KNSV+WNTML+G+ RNGEIELAI LFDEMPTRDAISWTALIN
Subjt:  GCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARKVFDYLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN

Query:  GLLKQGFSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GLLK G+SEQALECFHQMQ SG+  DYVSIIAVLAACADLGALT GLWVNRFVMQQEFKDN+RISNSL+DMYSRCGCIEFARQVF KM KRTLVSWNSII
Subjt:  GLLKQGFSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG
        VGFA NGFADESLEFF AMQ EGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKR+HKITP IEHYGCIVDLYGRAGRLEDA NVIEEMPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG

Query:  SLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLF
        SLLAACRT+GDV LAE+LMKH+ KLD  GDSNYVLLSNIYAAIG+WEGANKVRRTMKARGVQKK G SSVEIDGKVHEFVAGDKYHADADNIYSML+LLF
Subjt:  SLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLF

Query:  HELKIYGYVPDTNIILNTKEFSKDH
        HELK+ GYVPDT+IILNTK+ +KDH
Subjt:  HELKIYGYVPDTNIILNTKEFSKDH

A0A5A7UJB6 Pentatricopeptide repeat-containing protein4.7e-26286.1Show/hide
Query:  MSSSPSYTAIPSQLQQYPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFPANSIDPIVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLS
        MSS PS+ A PSQLQQ P+  SSIPLSNPTK+NFPRSP S H NI SKF ANS+ PIV WTSS+ARYC NGQL EAA EFTRMRLAGVEPNH+TFITLLS
Subjt:  MSSSPSYTAIPSQLQQYPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFPANSIDPIVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLS

Query:  GCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARKVFDYLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN
        GC DFPSES FF SSLHGYA K GLDTGHVMVGTAL+DMY+KC+Q  LA+KVFDYLG+KNSV+WNTML+G+ RNGEIELAI LFDEMPTRDAISWTALIN
Subjt:  GCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARKVFDYLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN

Query:  GLLKQGFSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GLLK G+SEQALECFHQMQ SG+  DYVSIIAVLAACADLGALT GLWVNRFVMQQEFKDN+RISNSL+DMYSRCGCIEFARQVF KM KRTLVSWNSII
Subjt:  GLLKQGFSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG
        VGFA NGFADESLEFF AMQ EGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKR+HKITP IEHYGCIVDLYGRAGRLEDA NVIEEMPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG

Query:  SLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLF
        SLLAACRT+GDV LAE+LMKH+ KLD  GDSNYVLLSNIYAAIG+WEGANKVRRTMKARGVQKK G SSVEIDGKVHEFVAGDKYHADADNIYSML+LLF
Subjt:  SLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLF

Query:  HELKIYGYVPDTNIILNTKEFSKDH
        HELK+ GYVPDT+IILNTK+ +KDH
Subjt:  HELKIYGYVPDTNIILNTKEFSKDH

A0A6J1CN07 pentatricopeptide repeat-containing protein At1g05750, chloroplastic9.4e-27187.81Show/hide
Query:  MSSSPSYTAIPSQLQQYPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFPANSIDPIVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLS
        MSS P+ TA   QLQQYPNP +SIPL NP  +NFPRS NSS+R+ISSK   NSIDPIVLWTSS+ARYCRNGQL+EAA EFT MRLAGVEPNHVT ITLLS
Subjt:  MSSSPSYTAIPSQLQQYPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFPANSIDPIVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLS

Query:  GCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARKVFDYLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN
        GC DFPSESL+FGSSLHGYARKLGLDT HVMVGT++VDMYAKCAQ  LAR+VFDYL MKNSV+WNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN
Subjt:  GCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARKVFDYLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN

Query:  GLLKQGFSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GLLKQG+SEQALECFHQMQCSGI+PDYVSIIAVLAACADLG LTLGLWVNRFVMQQEFKDNIRISNSL+DMYSRCGCI FARQVFE+M KRTLVSWNSII
Subjt:  GLLKQGFSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG
        VG+A NGFADESLEFFDAMQ EGFKPD VSYTGALTACSHAGLVNKGLELFDNMKR+H+I PRIEHYGCIVDLYGRAGRLEDAL+VIE+MPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG

Query:  SLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLF
        SLLAACRT+GDVSLAE+LMKHLSKLDP GDSNYVLLSNIYAAIG+WEGANKVRRTMKARGVQKKPG SSVEIDGKVHEFVAGDKYHADAD+IYSML+LL 
Subjt:  SLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLF

Query:  HELKIYGYVPDTNIILNTKEFSKDH
        HELKI G VP+T   LNTKE SKDH
Subjt:  HELKIYGYVPDTNIILNTKEFSKDH

A0A6J1HU31 pentatricopeptide repeat-containing protein At1g05750, chloroplastic1.4e-25883.49Show/hide
Query:  MSSSPSYTAIP--SQLQQYPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFPANSIDPIVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITL
        MSS PS+T IP   QLQQY NPPS IP SNP+ L+FPR+PNSS          N I PIVLWTSS+ARYCRN QL+EAA EFTRMRLAGVEPNH+TFITL
Subjt:  MSSSPSYTAIP--SQLQQYPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFPANSIDPIVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITL

Query:  LSGCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARKVFDYLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTAL
        LSGC DFPS SL FG+SLHGY RKLGLDTGHVMVGTAL+ MYAKCAQ  LAR VFDYL MKNSVTWNTMLDGY RNGEIELAI+LFDEMPTRDAISWTAL
Subjt:  LSGCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARKVFDYLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTAL

Query:  INGLLKQGFSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNS
        ING LKQG+SEQALECFH+MQCSGIEPDYVSIIAVLAACADLGAL+ GLWVNRF+MQQEFKDNIRISNSL+DMYSRCGCIEFARQVF+KM K TLVSWNS
Subjt:  INGLLKQGFSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNS

Query:  IIVGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVV
        +IVGFA+NGFADESLEFFDAMQ EGF  DGVSYTGALTACSHAGLVNKGLELFDNMKR+H+ITPRIEHYGCIVDLY RAGRL++ALNVIE MPMKPNEVV
Subjt:  IIVGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVV

Query:  LGSLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLEL
        LGSLLAACRT+GDVSLAE+L+K+L +LDP GDS+YVLLSNIYAA+GRWEGANKVRRTMKARGVQKKPG SS+EIDGKVHEFVAGDKYH DADNIYSMLE+
Subjt:  LGSLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLEL

Query:  LFHELKIYGYVPDTNIILNTKEFSKDH
        LFHELKIYGYVP+T   +N  E SK++
Subjt:  LFHELKIYGYVPDTNIILNTKEFSKDH

SwissProt top hitse value%identityAlignment
O23337 Pentatricopeptide repeat-containing protein At4g148201.1e-9837.45Show/hide
Query:  IVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLSGCTDFPSESLFFGSSLHGY--ARKLGLDTGHVMVGTALVDMYAKCAQSRLARKVFD
        +V W + + RYCR G + EA   F  M+ + V P+ +    ++S C    + ++ +  +++ +     + +DT H++  TALV MYA      +AR+ F 
Subjt:  IVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLSGCTDFPSESLFFGSSLHGY--ARKLGLDTGHVMVGTALVDMYAKCAQSRLARKVFD

Query:  YLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGFSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVM
         + ++N      M+ GY++ G ++ A  +FD+   +D + WT +I+  ++  + ++AL  F +M CSGI+PD VS+ +V++ACA+LG L    WV+  + 
Subjt:  YLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGFSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVM

Query:  QQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHAGLVNKGLELFDNM
            +  + I+N+L++MY++CG ++  R VFEKMP+R +VSW+S+I   +++G A ++L  F  M+ E  +P+ V++ G L  CSH+GLV +G ++F +M
Subjt:  QQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHAGLVNKGLELFDNM

Query:  KRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGANKVRR
           + ITP++EHYGC+VDL+GRA  L +AL VIE MP+  N V+ GSL++ACR +G++ L +   K + +L+P  D   VL+SNIYA   RWE    +RR
Subjt:  KRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGANKVRR

Query:  TMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLFHELKIYGYVPDTNIIL
         M+ + V K+ G S ++ +GK HEF+ GDK H  ++ IY+ L+ +  +LK+ GYVPD   +L
Subjt:  TMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLFHELKIYGYVPDTNIIL

O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic3.0e-10136.77Show/hide
Query:  IVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLSGCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARKVFDYL
        +V W S +  + + G   +A   F +M    V+ +HVT + +LS C      +L FG  +  Y  +  ++  ++ +  A++DMY KC     A+++FD +
Subjt:  IVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLSGCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARKVFDYL

Query:  GMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGFSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQ
          K++VTW TMLDGY  + + E A ++ + MP +D ++W ALI+   + G   +AL  FH++Q    ++ + +++++ L+ACA +GAL LG W++ ++ +
Subjt:  GMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGFSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQ

Query:  QEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMK
           + N  ++++L+ MYS+CG +E +R+VF  + KR +  W+++I G A++G  +E+++ F  MQ    KP+GV++T    ACSH GLV++   LF  M+
Subjt:  QEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMK

Query:  RIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGANKVRRT
          + I P  +HY CIVD+ GR+G LE A+  IE MP+ P+  V G+LL AC+ + +++LAE     L +L+PR D  +VLLSNIYA +G+WE  +++R+ 
Subjt:  RIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGANKVRRT

Query:  MKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLFHELKIYGYVPDTNIILNTKE
        M+  G++K+PGCSS+EIDG +HEF++GD  H  ++ +Y  L  +  +LK  GY P+ + +L   E
Subjt:  MKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLFHELKIYGYVPDTNIILNTKE

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226902.0e-10838.92Show/hide
Query:  RNISSKFPANSIDPIVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLSGCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAK
        + +  ++ A+++D   L  +  + Y R G   EA   F  M  +GV P+ ++ ++ +S C+     ++ +G S HGY  + G ++    +  AL+DMY K
Subjt:  RNISSKFPANSIDPIVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLSGCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAK

Query:  CAQSRLARKVFDYLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGFSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLG
        C +   A ++FD +  K  VTWN+++ GY  NGE++ A + F+ MP ++ +SW  +I+GL++    E+A+E F  MQ   G+  D V+++++ +AC  LG
Subjt:  CAQSRLARKVFDYLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGFSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLG

Query:  ALTLGLWVNRFVMQQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHA
        AL L  W+  ++ +   + ++R+  +LVDM+SRCG  E A  +F  +  R + +W + I   A+ G A+ ++E FD M  +G KPDGV++ GALTACSH 
Subjt:  ALTLGLWVNRFVMQQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHA

Query:  GLVNKGLELFDNMKRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYA
        GLV +G E+F +M ++H ++P   HYGC+VDL GRAG LE+A+ +IE+MPM+PN+V+  SLLAACR  G+V +A    + +  L P    +YVLLSN+YA
Subjt:  GLVNKGLELFDNMKRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYA

Query:  AIGRWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLFHELKIYGYVPD-TNIILNTKEFSK
        + GRW    KVR +MK +G++K PG SS++I GK HEF +GD+ H +  NI +ML+ +       G+VPD +N++++  E  K
Subjt:  AIGRWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLFHELKIYGYVPD-TNIILNTKEFSK

Q9MA50 Pentatricopeptide repeat-containing protein At1g05750, chloroplastic2.2e-16861.18Show/hide
Query:  VLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLSGCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARKVFDYLG
        V WTS +    RNG+L+EAA EF+ M LAGVEPNH+TFI LLSGC DF S S   G  LHGYA KLGLD  HVMVGTA++ MY+K  + + AR VFDY+ 
Subjt:  VLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLSGCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARKVFDYLG

Query:  MKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGFSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQE
         KNSVTWNTM+DGY R+G+++ A  +FD+MP RD ISWTA+ING +K+G+ E+AL  F +MQ SG++PDYV+IIA L AC +LGAL+ GLWV+R+V+ Q+
Subjt:  MKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGFSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQE

Query:  FKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRI
        FK+N+R+SNSL+D+Y RCGC+EFARQVF  M KRT+VSWNS+IVGFA NG A ESL +F  MQ +GFKPD V++TGALTACSH GLV +GL  F  MK  
Subjt:  FKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRI

Query:  HKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTYG-DVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGANKVRRTM
        ++I+PRIEHYGC+VDLY RAGRLEDAL +++ MPMKPNEVV+GSLLAAC  +G ++ LAE+LMKHL+ L+ +  SNYV+LSN+YAA G+WEGA+K+RR M
Subjt:  HKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTYG-DVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGANKVRRTM

Query:  KARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLFHELKIYGYVPDT
        K  G++K+PG SS+EID  +H F+AGD  H +   I  +LEL+  +L++ G V +T
Subjt:  KARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLFHELKIYGYVPDT

Q9SZT8 Pentatricopeptide repeat-containing protein ELI1, chloroplastic2.3e-10138.64Show/hide
Query:  SIDP-IVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLSGCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARK
        +IDP + L+T+++     NG   +A   + ++  + + PN  TF +LL  C      S   G  +H +  K GL      V T LVD+YAK      A+K
Subjt:  SIDP-IVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLSGCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARK

Query:  VFDYLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGFSEQALECFHQMQCSG-IEPDYVSIIAVLAACADLGALTLGLWVN
        VFD +  ++ V+   M+  Y + G +E A  LFD M  RD +SW  +I+G  + GF   AL  F ++   G  +PD ++++A L+AC+ +GAL  G W++
Subjt:  VFDYLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGFSEQALECFHQMQCSG-IEPDYVSIIAVLAACADLGALTLGLWVN

Query:  RFVMQQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQN-EGFKPDGVSYTGALTACSHAGLVNKGLE
         FV     + N+++   L+DMYS+CG +E A  VF   P++ +V+WN++I G+A++G++ ++L  F+ MQ   G +P  +++ G L AC+HAGLVN+G+ 
Subjt:  RFVMQQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQN-EGFKPDGVSYTGALTACSHAGLVNKGLE

Query:  LFDNMKRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGA
        +F++M + + I P+IEHYGC+V L GRAG+L+ A   I+ M M  + V+  S+L +C+ +GD  L +++ ++L  L+ +    YVLLSNIYA++G +EG 
Subjt:  LFDNMKRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGA

Query:  NKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLFHELKIYGYVPDTNIILNTKE
         KVR  MK +G+ K+PG S++EI+ KVHEF AGD+ H+ +  IY+ML  +   +K +GYVP+TN +L   E
Subjt:  NKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLFHELKIYGYVPDTNIILNTKE

Arabidopsis top hitse value%identityAlignment
AT1G05750.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.6e-16961.18Show/hide
Query:  VLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLSGCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARKVFDYLG
        V WTS +    RNG+L+EAA EF+ M LAGVEPNH+TFI LLSGC DF S S   G  LHGYA KLGLD  HVMVGTA++ MY+K  + + AR VFDY+ 
Subjt:  VLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLSGCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARKVFDYLG

Query:  MKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGFSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQE
         KNSVTWNTM+DGY R+G+++ A  +FD+MP RD ISWTA+ING +K+G+ E+AL  F +MQ SG++PDYV+IIA L AC +LGAL+ GLWV+R+V+ Q+
Subjt:  MKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGFSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQE

Query:  FKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRI
        FK+N+R+SNSL+D+Y RCGC+EFARQVF  M KRT+VSWNS+IVGFA NG A ESL +F  MQ +GFKPD V++TGALTACSH GLV +GL  F  MK  
Subjt:  FKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRI

Query:  HKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTYG-DVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGANKVRRTM
        ++I+PRIEHYGC+VDLY RAGRLEDAL +++ MPMKPNEVV+GSLLAAC  +G ++ LAE+LMKHL+ L+ +  SNYV+LSN+YAA G+WEGA+K+RR M
Subjt:  HKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTYG-DVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGANKVRRTM

Query:  KARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLFHELKIYGYVPDT
        K  G++K+PG SS+EID  +H F+AGD  H +   I  +LEL+  +L++ G V +T
Subjt:  KARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLFHELKIYGYVPDT

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.2e-10236.77Show/hide
Query:  IVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLSGCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARKVFDYL
        +V W S +  + + G   +A   F +M    V+ +HVT + +LS C      +L FG  +  Y  +  ++  ++ +  A++DMY KC     A+++FD +
Subjt:  IVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLSGCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARKVFDYL

Query:  GMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGFSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQ
          K++VTW TMLDGY  + + E A ++ + MP +D ++W ALI+   + G   +AL  FH++Q    ++ + +++++ L+ACA +GAL LG W++ ++ +
Subjt:  GMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGFSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQ

Query:  QEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMK
           + N  ++++L+ MYS+CG +E +R+VF  + KR +  W+++I G A++G  +E+++ F  MQ    KP+GV++T    ACSH GLV++   LF  M+
Subjt:  QEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMK

Query:  RIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGANKVRRT
          + I P  +HY CIVD+ GR+G LE A+  IE MP+ P+  V G+LL AC+ + +++LAE     L +L+PR D  +VLLSNIYA +G+WE  +++R+ 
Subjt:  RIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGANKVRRT

Query:  MKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLFHELKIYGYVPDTNIILNTKE
        M+  G++K+PGCSS+EIDG +HEF++GD  H  ++ +Y  L  +  +LK  GY P+ + +L   E
Subjt:  MKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLFHELKIYGYVPDTNIILNTKE

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)1.4e-10938.92Show/hide
Query:  RNISSKFPANSIDPIVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLSGCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAK
        + +  ++ A+++D   L  +  + Y R G   EA   F  M  +GV P+ ++ ++ +S C+     ++ +G S HGY  + G ++    +  AL+DMY K
Subjt:  RNISSKFPANSIDPIVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLSGCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAK

Query:  CAQSRLARKVFDYLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGFSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLG
        C +   A ++FD +  K  VTWN+++ GY  NGE++ A + F+ MP ++ +SW  +I+GL++    E+A+E F  MQ   G+  D V+++++ +AC  LG
Subjt:  CAQSRLARKVFDYLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGFSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLG

Query:  ALTLGLWVNRFVMQQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHA
        AL L  W+  ++ +   + ++R+  +LVDM+SRCG  E A  +F  +  R + +W + I   A+ G A+ ++E FD M  +G KPDGV++ GALTACSH 
Subjt:  ALTLGLWVNRFVMQQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHA

Query:  GLVNKGLELFDNMKRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYA
        GLV +G E+F +M ++H ++P   HYGC+VDL GRAG LE+A+ +IE+MPM+PN+V+  SLLAACR  G+V +A    + +  L P    +YVLLSN+YA
Subjt:  GLVNKGLELFDNMKRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYA

Query:  AIGRWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLFHELKIYGYVPD-TNIILNTKEFSK
        + GRW    KVR +MK +G++K PG SS++I GK HEF +GD+ H +  NI +ML+ +       G+VPD +N++++  E  K
Subjt:  AIGRWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLFHELKIYGYVPD-TNIILNTKEFSK

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification1.4e-10938.92Show/hide
Query:  RNISSKFPANSIDPIVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLSGCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAK
        + +  ++ A+++D   L  +  + Y R G   EA   F  M  +GV P+ ++ ++ +S C+     ++ +G S HGY  + G ++    +  AL+DMY K
Subjt:  RNISSKFPANSIDPIVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLSGCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAK

Query:  CAQSRLARKVFDYLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGFSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLG
        C +   A ++FD +  K  VTWN+++ GY  NGE++ A + F+ MP ++ +SW  +I+GL++    E+A+E F  MQ   G+  D V+++++ +AC  LG
Subjt:  CAQSRLARKVFDYLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGFSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLG

Query:  ALTLGLWVNRFVMQQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHA
        AL L  W+  ++ +   + ++R+  +LVDM+SRCG  E A  +F  +  R + +W + I   A+ G A+ ++E FD M  +G KPDGV++ GALTACSH 
Subjt:  ALTLGLWVNRFVMQQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQNEGFKPDGVSYTGALTACSHA

Query:  GLVNKGLELFDNMKRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYA
        GLV +G E+F +M ++H ++P   HYGC+VDL GRAG LE+A+ +IE+MPM+PN+V+  SLLAACR  G+V +A    + +  L P    +YVLLSN+YA
Subjt:  GLVNKGLELFDNMKRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYA

Query:  AIGRWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLFHELKIYGYVPD-TNIILNTKEFSK
        + GRW    KVR +MK +G++K PG SS++I GK HEF +GD+ H +  NI +ML+ +       G+VPD +N++++  E  K
Subjt:  AIGRWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLFHELKIYGYVPD-TNIILNTKEFSK

AT4G37380.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.7e-10238.64Show/hide
Query:  SIDP-IVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLSGCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARK
        +IDP + L+T+++     NG   +A   + ++  + + PN  TF +LL  C      S   G  +H +  K GL      V T LVD+YAK      A+K
Subjt:  SIDP-IVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLSGCTDFPSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARK

Query:  VFDYLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGFSEQALECFHQMQCSG-IEPDYVSIIAVLAACADLGALTLGLWVN
        VFD +  ++ V+   M+  Y + G +E A  LFD M  RD +SW  +I+G  + GF   AL  F ++   G  +PD ++++A L+AC+ +GAL  G W++
Subjt:  VFDYLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGFSEQALECFHQMQCSG-IEPDYVSIIAVLAACADLGALTLGLWVN

Query:  RFVMQQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQN-EGFKPDGVSYTGALTACSHAGLVNKGLE
         FV     + N+++   L+DMYS+CG +E A  VF   P++ +V+WN++I G+A++G++ ++L  F+ MQ   G +P  +++ G L AC+HAGLVN+G+ 
Subjt:  RFVMQQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQN-EGFKPDGVSYTGALTACSHAGLVNKGLE

Query:  LFDNMKRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGA
        +F++M + + I P+IEHYGC+V L GRAG+L+ A   I+ M M  + V+  S+L +C+ +GD  L +++ ++L  L+ +    YVLLSNIYA++G +EG 
Subjt:  LFDNMKRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTYGDVSLAEKLMKHLSKLDPRGDSNYVLLSNIYAAIGRWEGA

Query:  NKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLFHELKIYGYVPDTNIILNTKE
         KVR  MK +G+ K+PG S++EI+ KVHEF AGD+ H+ +  IY+ML  +   +K +GYVP+TN +L   E
Subjt:  NKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLFHELKIYGYVPDTNIILNTKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAGCAGTCCTTCGTACACCGCCATTCCATCCCAACTCCAACAATATCCTAATCCGCCATCTTCAATCCCACTTTCAAACCCAACAAAACTCAACTTCCCC
CGATCTCCCAATTCCTCACATCGCAATATCTCCTCCAAATTCCCCGCCAATTCTATTGACCCCATTGTTCTATGGACCTCTTCTCTTGCTCGCTACTGCCGAAAC
GGCCAATTATCCGAAGCCGCCACAGAGTTTACACGCATGAGACTCGCCGGAGTTGAGCCGAATCACGTCACATTCATTACCCTTCTCTCCGGGTGTACTGATTTT
CCTTCAGAAAGCCTCTTCTTCGGCTCTTCTCTTCATGGCTACGCCCGTAAACTTGGTTTGGATACAGGGCATGTAATGGTGGGGACTGCTCTTGTTGATATGTAT
GCCAAATGTGCTCAATCGCGTCTTGCTAGGAAGGTTTTTGATTACCTGGGTATGAAAAATTCTGTCACTTGGAACACGATGCTCGATGGTTACACGAGGAATGGA
GAAATTGAGTTGGCCATTGACCTGTTTGATGAAATGCCTACAAGAGATGCGATTTCTTGGACGGCTTTGATTAACGGTCTTTTGAAACAGGGGTTCTCTGAACAA
GCATTGGAGTGCTTCCATCAGATGCAATGTTCAGGTATCGAGCCTGATTATGTGTCTATAATTGCTGTTCTTGCTGCGTGTGCTGATTTAGGCGCACTTACTTTG
GGGTTGTGGGTTAATCGGTTTGTTATGCAGCAGGAGTTTAAGGATAATATTAGGATAAGTAATTCTTTGGTAGATATGTATTCTCGATGTGGATGTATTGAGTTT
GCCCGCCAAGTGTTTGAGAAAATGCCCAAGCGAACTTTGGTATCTTGGAACTCTATCATTGTGGGGTTTGCAGTTAATGGATTTGCAGATGAATCTCTGGAGTTT
TTTGATGCAATGCAAAACGAAGGATTCAAACCAGATGGAGTTAGCTACACGGGAGCTCTTACTGCGTGTAGTCATGCTGGTTTAGTGAACAAAGGTCTAGAATTA
TTCGATAACATGAAGAGGATACACAAAATTACTCCCAGGATTGAGCATTATGGATGTATTGTCGACCTCTATGGTCGTGCAGGGAGGCTAGAGGATGCATTGAAT
GTGATCGAGGAAATGCCCATGAAACCGAATGAAGTCGTGCTGGGATCGTTGCTAGCGGCTTGCAGGACTTATGGTGATGTGAGTCTGGCTGAAAAGTTAATGAAA
CATCTCTCTAAGTTAGATCCACGGGGCGATTCGAATTATGTGCTCCTTTCAAACATATATGCAGCAATTGGGAGGTGGGAAGGTGCTAACAAGGTCAGGCGTACA
ATGAAAGCCCGAGGTGTGCAGAAAAAACCAGGTTGTAGTTCTGTTGAGATTGATGGTAAGGTTCATGAGTTTGTTGCTGGTGATAAATACCATGCTGATGCAGAC
AATATTTACTCAATGTTAGAGTTGTTGTTTCATGAACTGAAGATATATGGTTATGTTCCTGATACGAATATCATTCTGAATACCAAAGAATTTAGTAAAGACCAT
TGA
mRNA sequenceShow/hide mRNA sequence
TATTAAATGAGTCATTGATCATAAAAATCTGTTCCCGCTCTATCCCGAACGGTTCTTGGAACGATGAGCAGCAGTCCTTCGTACACCGCCATTCCATCCCAACTC
CAACAATATCCTAATCCGCCATCTTCAATCCCACTTTCAAACCCAACAAAACTCAACTTCCCCCGATCTCCCAATTCCTCACATCGCAATATCTCCTCCAAATTC
CCCGCCAATTCTATTGACCCCATTGTTCTATGGACCTCTTCTCTTGCTCGCTACTGCCGAAACGGCCAATTATCCGAAGCCGCCACAGAGTTTACACGCATGAGA
CTCGCCGGAGTTGAGCCGAATCACGTCACATTCATTACCCTTCTCTCCGGGTGTACTGATTTTCCTTCAGAAAGCCTCTTCTTCGGCTCTTCTCTTCATGGCTAC
GCCCGTAAACTTGGTTTGGATACAGGGCATGTAATGGTGGGGACTGCTCTTGTTGATATGTATGCCAAATGTGCTCAATCGCGTCTTGCTAGGAAGGTTTTTGAT
TACCTGGGTATGAAAAATTCTGTCACTTGGAACACGATGCTCGATGGTTACACGAGGAATGGAGAAATTGAGTTGGCCATTGACCTGTTTGATGAAATGCCTACA
AGAGATGCGATTTCTTGGACGGCTTTGATTAACGGTCTTTTGAAACAGGGGTTCTCTGAACAAGCATTGGAGTGCTTCCATCAGATGCAATGTTCAGGTATCGAG
CCTGATTATGTGTCTATAATTGCTGTTCTTGCTGCGTGTGCTGATTTAGGCGCACTTACTTTGGGGTTGTGGGTTAATCGGTTTGTTATGCAGCAGGAGTTTAAG
GATAATATTAGGATAAGTAATTCTTTGGTAGATATGTATTCTCGATGTGGATGTATTGAGTTTGCCCGCCAAGTGTTTGAGAAAATGCCCAAGCGAACTTTGGTA
TCTTGGAACTCTATCATTGTGGGGTTTGCAGTTAATGGATTTGCAGATGAATCTCTGGAGTTTTTTGATGCAATGCAAAACGAAGGATTCAAACCAGATGGAGTT
AGCTACACGGGAGCTCTTACTGCGTGTAGTCATGCTGGTTTAGTGAACAAAGGTCTAGAATTATTCGATAACATGAAGAGGATACACAAAATTACTCCCAGGATT
GAGCATTATGGATGTATTGTCGACCTCTATGGTCGTGCAGGGAGGCTAGAGGATGCATTGAATGTGATCGAGGAAATGCCCATGAAACCGAATGAAGTCGTGCTG
GGATCGTTGCTAGCGGCTTGCAGGACTTATGGTGATGTGAGTCTGGCTGAAAAGTTAATGAAACATCTCTCTAAGTTAGATCCACGGGGCGATTCGAATTATGTG
CTCCTTTCAAACATATATGCAGCAATTGGGAGGTGGGAAGGTGCTAACAAGGTCAGGCGTACAATGAAAGCCCGAGGTGTGCAGAAAAAACCAGGTTGTAGTTCT
GTTGAGATTGATGGTAAGGTTCATGAGTTTGTTGCTGGTGATAAATACCATGCTGATGCAGACAATATTTACTCAATGTTAGAGTTGTTGTTTCATGAACTGAAG
ATATATGGTTATGTTCCTGATACGAATATCATTCTGAATACCAAAGAATTTAGTAAAGACCATTGAAGCTTATTCGATCTAGTAATTCTCTTATATTATGTATCC
ATCTAAAGCTTTGATCAATAGCAATCTGAGAAATTATGAAATAGACAAAGGTAAATGGATGCTTAGGTTCTGCAGGCTACAAGGTATATTACTTGATTCGTATTT
GTCATCACATTTGAAAACGGAAG
Protein sequenceShow/hide protein sequence
MSSSPSYTAIPSQLQQYPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFPANSIDPIVLWTSSLARYCRNGQLSEAATEFTRMRLAGVEPNHVTFITLLSGCTDF
PSESLFFGSSLHGYARKLGLDTGHVMVGTALVDMYAKCAQSRLARKVFDYLGMKNSVTWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGFSEQ
ALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLVDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEF
FDAMQNEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRIHKITPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTYGDVSLAEKLMK
HLSKLDPRGDSNYVLLSNIYAAIGRWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHADADNIYSMLELLFHELKIYGYVPDTNIILNTKEFSKDH