; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G011870 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G011870
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr02:14917823..14919400
RNA-Seq ExpressionLsi02G011870
SyntenyLsi02G011870
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139593.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucumis sativus]7.1e-27388.93Show/hide
Query:  MSSIPSHIAIPSQHQQCPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS
        MSSIPSH A PSQ Q  P  PSSIPLSNPTKLNFPRSPNS HRNISSKF  NS+DPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNH+TFITLLS
Subjt:  MSSIPSHIAIPSQHQQCPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS

Query:  GCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN
         CADFPSES FF S LHGYA K+GLDTGHVMVGTALIDMY+KCAQLG ARKVF  LGVKNSVSWNTML+G+ RNGEIELA+ LFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN

Query:  GLFKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GL KHGYSEQALECFHQMQ SG+  DYVSIIAVLAACADLGALTLGLWV+RFVM QEFKDNI+ISNSLIDMYSRCGCIEFARQVF KM KRTLVSWNSII
Subjt:  GLFKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACTHAGLVNKGLELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLG
        VGFAVNGFADESLEFF AMQKEGFKPDGVSYTGALTAC+HAGLVNKGLELFDNMKSVH+ITPRIEHYGCIVDLYGRAGRL+DALN+IEEMPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACTHAGLVNKGLELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLG

Query:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLF
        SLLAACRTHGDV+LAERLMKHLFKLDP GD+ YVLLSNIYAAIGKW+GAN VRRTMKARGVQKKPG SSVEIDGKVHEFV+GD YHADAD+IYSML+LL 
Subjt:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLF

Query:  HELKICGYVPDTDTILNTKESSKD
        HELK+CGYVP +DTILNTKES+KD
Subjt:  HELKICGYVPDTDTILNTKESSKD

XP_008458940.1 PREDICTED: pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucumis melo]1.1e-27088.57Show/hide
Query:  MSSIPSHIAIPSQHQQCPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS
        MSSIPSHIA PSQ QQ P+  SSIPLSNPTK+NFPRSP S H NI SKFTANS+ PIV WTSS+ARYC NGQL EAAAEFTRMRLAGVEPNH+TFITLLS
Subjt:  MSSIPSHIAIPSQHQQCPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS

Query:  GCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN
        GCADFPSES FF S LHGYA KFGLDTGHVMVGTALIDMY+KC+QLGLA+KVFDYLGVKNSVSWNTML+G+ RNGEIELA+ LFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN

Query:  GLFKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GL KHGYSEQALECFHQMQ SG+  DYVSIIAVLAACADLGALT GLWVNRFVMQQEFKDN+RISNSLIDMYSRCGCIEFARQVF KM KRTLVSWNSII
Subjt:  GLFKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACTHAGLVNKGLELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLG
        VGFA NGFADESLEFF AMQKEGFKPDGVSYTGALTAC+HAGLVNKGLELFDNMK VH+ITP IEHYGCIVDLYGRAGRL+DA NVIEEMPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACTHAGLVNKGLELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLG

Query:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLF
        SLLAACRTHGDV LAERLMKH+FKLD  GDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKK G SSVEIDGKVHEFV+GDKYHADAD+IYSML+LLF
Subjt:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLF

Query:  HELKICGYVPDTDTILNTKESSKDH
        HELK+CGYVPDTD ILNTK+S+KDH
Subjt:  HELKICGYVPDTDTILNTKESSKDH

XP_022142716.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Momordica charantia]7.6e-27588.76Show/hide
Query:  MSSIPSHIAIPSQHQQCPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS
        MSSIP++ A   Q QQ PNP +SIPL NP  +NFPRS NSS+R+ISSK T NSIDPIVLWTSS+ARYCRNGQL+EAAAEFT MRLAGVEPNHVT ITLLS
Subjt:  MSSIPSHIAIPSQHQQCPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS

Query:  GCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN
        GCADFPSESL+FGS LHGYARK GLDT HVMVGT+++DMYAKCAQLGLAR+VFDYL +KNSVSWNTMLDGYTRNGEIELA+DLFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN

Query:  GLFKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GL K GYSEQALECFHQMQCSGI+PDYVSIIAVLAACADLG LTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCI FARQVFE+M KRTLVSWNSII
Subjt:  GLFKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACTHAGLVNKGLELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLG
        VG+A NGFADESLEFFDAMQKEGFKPD VSYTGALTAC+HAGLVNKGLELFDNMK VHRI PRIEHYGCIVDLYGRAGRL+DAL+VIE+MPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACTHAGLVNKGLELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLG

Query:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLF
        SLLAACRTHGDVSLAERLMKHL KLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPG SSVEIDGKVHEFV+GDKYHADADSIYSML+LL 
Subjt:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLF

Query:  HELKICGYVPDTDTILNTKESSKDH
        HELKICG VP+T+T LNTKESSKDH
Subjt:  HELKICGYVPDTDTILNTKESSKDH

XP_022967078.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucurbita maxima]4.0e-26083.87Show/hide
Query:  MSSIPSHIAIPSQH--QQCPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITL
        MSS+PSH  IP Q   QQ  NPPS IP SNP+ L+FPR+PNSS          N I PIVLWTSS+ARYCRN QL+EAAAEFTRMRLAGVEPNH+TFITL
Subjt:  MSSIPSHIAIPSQH--QQCPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITL

Query:  LSGCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTAL
        LSGCADFPS SL FG+ LHGY RK GLDTGHVMVGTALI MYAKCAQLGLAR VFDYL +KNSV+WNTMLDGY RNGEIELA++LFDEMPTRDAISWTAL
Subjt:  LSGCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTAL

Query:  INGLFKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNS
        ING  K GYSEQALECFH+MQCSGIEPDYVSIIAVLAACADLGAL+ GLWVNRF+MQQEFKDNIRISNSLIDMYSRCGCIEFARQVF+KM K TLVSWNS
Subjt:  INGLFKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNS

Query:  IIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACTHAGLVNKGLELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVV
        +IVGFA+NGFADESLEFFDAMQKEGF  DGVSYTGALTAC+HAGLVNKGLELFDNMK VHRITPRIEHYGCIVDLY RAGRLD+ALNVIE MPMKPNEVV
Subjt:  IIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACTHAGLVNKGLELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVV

Query:  LGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLEL
        LGSLLAACRTHGDVSLAERL+K+LF+LDPGGDS+YVLLSNIYAA+G+WEGANKVRRTMKARGVQKKPG SS+EIDGKVHEFV+GDKYH DAD+IYSMLE+
Subjt:  LGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLEL

Query:  LFHELKICGYVPDTDTILNTKESSKDH
        LFHELKI GYVP+T T +N  ESSK++
Subjt:  LFHELKICGYVPDTDTILNTKESSKDH

XP_038877228.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Benincasa hispida]7.1e-28992.95Show/hide
Query:  MSSIPSHIAIPSQHQQCPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS
        MSS PS+ AIPSQ QQ PNPPSSIPLSNPTKLNFPRSPNSSHRNISSKF ANSIDPIVLWTSSLARYCRNGQLSEAA EFTRMRLAGVEPNHVTFITLLS
Subjt:  MSSIPSHIAIPSQHQQCPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS

Query:  GCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN
        GC DFPSESLFFGS LHGYARK GLDTGHVMVGTAL+DMYAKCAQ  LARKVFDYLG+KNSV+WNTMLDGYTRNGEIELA+DLFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN

Query:  GLFKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GL K G+SEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSL+DMYSRCGCIEFARQVFEKMPKRTLVSWNSII
Subjt:  GLFKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACTHAGLVNKGLELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLG
        VGFAVNGFADESLEFFDAMQ EGFKPDGVSYTGALTAC+HAGLVNKGLELFDNMK +H+ITPRIEHYGCIVDLYGRAGRL+DALNVIEEMPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACTHAGLVNKGLELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLG

Query:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLF
        SLLAACRT+GDVSLAE+LMKHL KLDP GDSNYVLLSNIYAAIG+WEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFV+GDKYHADAD+IYSMLELLF
Subjt:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLF

Query:  HELKICGYVPDTDTILNTKESSKDH
        HELKI GYVPDT+ ILNTKE SKDH
Subjt:  HELKICGYVPDTDTILNTKESSKDH

TrEMBL top hitse value%identityAlignment
A0A0A0LYD6 Uncharacterized protein3.4e-27388.93Show/hide
Query:  MSSIPSHIAIPSQHQQCPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS
        MSSIPSH A PSQ Q  P  PSSIPLSNPTKLNFPRSPNS HRNISSKF  NS+DPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNH+TFITLLS
Subjt:  MSSIPSHIAIPSQHQQCPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS

Query:  GCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN
         CADFPSES FF S LHGYA K+GLDTGHVMVGTALIDMY+KCAQLG ARKVF  LGVKNSVSWNTML+G+ RNGEIELA+ LFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN

Query:  GLFKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GL KHGYSEQALECFHQMQ SG+  DYVSIIAVLAACADLGALTLGLWV+RFVM QEFKDNI+ISNSLIDMYSRCGCIEFARQVF KM KRTLVSWNSII
Subjt:  GLFKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACTHAGLVNKGLELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLG
        VGFAVNGFADESLEFF AMQKEGFKPDGVSYTGALTAC+HAGLVNKGLELFDNMKSVH+ITPRIEHYGCIVDLYGRAGRL+DALN+IEEMPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACTHAGLVNKGLELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLG

Query:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLF
        SLLAACRTHGDV+LAERLMKHLFKLDP GD+ YVLLSNIYAAIGKW+GAN VRRTMKARGVQKKPG SSVEIDGKVHEFV+GD YHADAD+IYSML+LL 
Subjt:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLF

Query:  HELKICGYVPDTDTILNTKESSKD
        HELK+CGYVP +DTILNTKES+KD
Subjt:  HELKICGYVPDTDTILNTKESSKD

A0A1S3C956 pentatricopeptide repeat-containing protein At1g05750, chloroplastic5.5e-27188.57Show/hide
Query:  MSSIPSHIAIPSQHQQCPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS
        MSSIPSHIA PSQ QQ P+  SSIPLSNPTK+NFPRSP S H NI SKFTANS+ PIV WTSS+ARYC NGQL EAAAEFTRMRLAGVEPNH+TFITLLS
Subjt:  MSSIPSHIAIPSQHQQCPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS

Query:  GCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN
        GCADFPSES FF S LHGYA KFGLDTGHVMVGTALIDMY+KC+QLGLA+KVFDYLGVKNSVSWNTML+G+ RNGEIELA+ LFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN

Query:  GLFKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GL KHGYSEQALECFHQMQ SG+  DYVSIIAVLAACADLGALT GLWVNRFVMQQEFKDN+RISNSLIDMYSRCGCIEFARQVF KM KRTLVSWNSII
Subjt:  GLFKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACTHAGLVNKGLELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLG
        VGFA NGFADESLEFF AMQKEGFKPDGVSYTGALTAC+HAGLVNKGLELFDNMK VH+ITP IEHYGCIVDLYGRAGRL+DA NVIEEMPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACTHAGLVNKGLELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLG

Query:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLF
        SLLAACRTHGDV LAERLMKH+FKLD  GDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKK G SSVEIDGKVHEFV+GDKYHADAD+IYSML+LLF
Subjt:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLF

Query:  HELKICGYVPDTDTILNTKESSKDH
        HELK+CGYVPDTD ILNTK+S+KDH
Subjt:  HELKICGYVPDTDTILNTKESSKDH

A0A5A7UJB6 Pentatricopeptide repeat-containing protein5.5e-27188.57Show/hide
Query:  MSSIPSHIAIPSQHQQCPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS
        MSSIPSHIA PSQ QQ P+  SSIPLSNPTK+NFPRSP S H NI SKFTANS+ PIV WTSS+ARYC NGQL EAAAEFTRMRLAGVEPNH+TFITLLS
Subjt:  MSSIPSHIAIPSQHQQCPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS

Query:  GCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN
        GCADFPSES FF S LHGYA KFGLDTGHVMVGTALIDMY+KC+QLGLA+KVFDYLGVKNSVSWNTML+G+ RNGEIELA+ LFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN

Query:  GLFKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GL KHGYSEQALECFHQMQ SG+  DYVSIIAVLAACADLGALT GLWVNRFVMQQEFKDN+RISNSLIDMYSRCGCIEFARQVF KM KRTLVSWNSII
Subjt:  GLFKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACTHAGLVNKGLELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLG
        VGFA NGFADESLEFF AMQKEGFKPDGVSYTGALTAC+HAGLVNKGLELFDNMK VH+ITP IEHYGCIVDLYGRAGRL+DA NVIEEMPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACTHAGLVNKGLELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLG

Query:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLF
        SLLAACRTHGDV LAERLMKH+FKLD  GDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKK G SSVEIDGKVHEFV+GDKYHADAD+IYSML+LLF
Subjt:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLF

Query:  HELKICGYVPDTDTILNTKESSKDH
        HELK+CGYVPDTD ILNTK+S+KDH
Subjt:  HELKICGYVPDTDTILNTKESSKDH

A0A6J1CN07 pentatricopeptide repeat-containing protein At1g05750, chloroplastic3.7e-27588.76Show/hide
Query:  MSSIPSHIAIPSQHQQCPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS
        MSSIP++ A   Q QQ PNP +SIPL NP  +NFPRS NSS+R+ISSK T NSIDPIVLWTSS+ARYCRNGQL+EAAAEFT MRLAGVEPNHVT ITLLS
Subjt:  MSSIPSHIAIPSQHQQCPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS

Query:  GCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN
        GCADFPSESL+FGS LHGYARK GLDT HVMVGT+++DMYAKCAQLGLAR+VFDYL +KNSVSWNTMLDGYTRNGEIELA+DLFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN

Query:  GLFKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GL K GYSEQALECFHQMQCSGI+PDYVSIIAVLAACADLG LTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCI FARQVFE+M KRTLVSWNSII
Subjt:  GLFKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACTHAGLVNKGLELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLG
        VG+A NGFADESLEFFDAMQKEGFKPD VSYTGALTAC+HAGLVNKGLELFDNMK VHRI PRIEHYGCIVDLYGRAGRL+DAL+VIE+MPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACTHAGLVNKGLELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLG

Query:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLF
        SLLAACRTHGDVSLAERLMKHL KLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPG SSVEIDGKVHEFV+GDKYHADADSIYSML+LL 
Subjt:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLF

Query:  HELKICGYVPDTDTILNTKESSKDH
        HELKICG VP+T+T LNTKESSKDH
Subjt:  HELKICGYVPDTDTILNTKESSKDH

A0A6J1HU31 pentatricopeptide repeat-containing protein At1g05750, chloroplastic2.0e-26083.87Show/hide
Query:  MSSIPSHIAIPSQH--QQCPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITL
        MSS+PSH  IP Q   QQ  NPPS IP SNP+ L+FPR+PNSS          N I PIVLWTSS+ARYCRN QL+EAAAEFTRMRLAGVEPNH+TFITL
Subjt:  MSSIPSHIAIPSQH--QQCPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITL

Query:  LSGCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTAL
        LSGCADFPS SL FG+ LHGY RK GLDTGHVMVGTALI MYAKCAQLGLAR VFDYL +KNSV+WNTMLDGY RNGEIELA++LFDEMPTRDAISWTAL
Subjt:  LSGCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTAL

Query:  INGLFKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNS
        ING  K GYSEQALECFH+MQCSGIEPDYVSIIAVLAACADLGAL+ GLWVNRF+MQQEFKDNIRISNSLIDMYSRCGCIEFARQVF+KM K TLVSWNS
Subjt:  INGLFKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNS

Query:  IIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACTHAGLVNKGLELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVV
        +IVGFA+NGFADESLEFFDAMQKEGF  DGVSYTGALTAC+HAGLVNKGLELFDNMK VHRITPRIEHYGCIVDLY RAGRLD+ALNVIE MPMKPNEVV
Subjt:  IIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACTHAGLVNKGLELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVV

Query:  LGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLEL
        LGSLLAACRTHGDVSLAERL+K+LF+LDPGGDS+YVLLSNIYAA+G+WEGANKVRRTMKARGVQKKPG SS+EIDGKVHEFV+GDKYH DAD+IYSMLE+
Subjt:  LGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLEL

Query:  LFHELKICGYVPDTDTILNTKESSKDH
        LFHELKI GYVP+T T +N  ESSK++
Subjt:  LFHELKICGYVPDTDTILNTKESSKDH

SwissProt top hitse value%identityAlignment
O23337 Pentatricopeptide repeat-containing protein At4g148206.8e-10138.01Show/hide
Query:  IVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGC---ADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGLARKVF
        +V W + + RYCR G + EA   F  M+ + V P+ +    ++S C    +       +  L+    R   +DT H++  TAL+ MYA    + +AR+ F
Subjt:  IVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGC---ADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGLARKVF

Query:  DYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLFKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFV
          + V+N      M+ GY++ G ++ A  +FD+   +D + WT +I+   +  Y ++AL  F +M CSGI+PD VS+ +V++ACA+LG L    WV+  +
Subjt:  DYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLFKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFV

Query:  MQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACTHAGLVNKGLELFDN
             +  + I+N+LI+MY++CG ++  R VFEKMP+R +VSW+S+I   +++G A ++L  F  M++E  +P+ V++ G L  C+H+GLV +G ++F +
Subjt:  MQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACTHAGLVNKGLELFDN

Query:  MKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVR
        M   + ITP++EHYGC+VDL+GRA  L +AL VIE MP+  N V+ GSL++ACR HG++ L +   K + +L+P  D   VL+SNIYA   +WE    +R
Subjt:  MKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVR

Query:  RTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLFHELKICGYVPDTDTIL
        R M+ + V K+ G S ++ +GK HEF+ GDK H  ++ IY+ L+ +  +LK+ GYVPD  ++L
Subjt:  RTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLFHELKICGYVPDTDTIL

O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic1.9e-10336.76Show/hide
Query:  FTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGL
        FT      +V W S +  + + G   +A   F +M    V+ +HVT + +LS CA     +L FG  +  Y  +  ++  ++ +  A++DMY KC  +  
Subjt:  FTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGL

Query:  ARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLFKHGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGL
        A+++FD +  K++V+W TMLDGY  + + E A ++ + MP +D ++W ALI+   ++G   +AL  FH++Q    ++ + +++++ L+ACA +GAL LG 
Subjt:  ARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLFKHGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGL

Query:  WVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACTHAGLVNKG
        W++ ++ +   + N  ++++LI MYS+CG +E +R+VF  + KR +  W+++I G A++G  +E+++ F  MQ+   KP+GV++T    AC+H GLV++ 
Subjt:  WVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACTHAGLVNKG

Query:  LELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWE
          LF  M+S + I P  +HY CIVD+ GR+G L+ A+  IE MP+ P+  V G+LL AC+ H +++LAE     L +L+P  D  +VLLSNIYA +GKWE
Subjt:  LELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWE

Query:  GANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLFHELKICGYVPDTDTILNTKESSK
          +++R+ M+  G++K+PGCSS+EIDG +HEF+SGD  H  ++ +Y  L  +  +LK  GY P+   +L   E  +
Subjt:  GANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLFHELKICGYVPDTDTILNTKESSK

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226905.9e-10538.11Show/hide
Query:  RNISSKFTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAK
        + +  ++ A+++D   L  +  + Y R G   EA   F  M  +GV P+ ++ ++ +S C+     ++ +G   HGY  + G ++    +  ALIDMY K
Subjt:  RNISSKFTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAK

Query:  CAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLFKHGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLG
        C +   A ++FD +  K  V+WN+++ GY  NGE++ A + F+ MP ++ +SW  +I+GL +    E+A+E F  MQ   G+  D V+++++ +AC  LG
Subjt:  CAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLFKHGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLG

Query:  ALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACTHA
        AL L  W+  ++ +   + ++R+  +L+DM+SRCG  E A  +F  +  R + +W + I   A+ G A+ ++E FD M ++G KPDGV++ GALTAC+H 
Subjt:  ALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACTHA

Query:  GLVNKGLELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYA
        GLV +G E+F +M  +H ++P   HYGC+VDL GRAG L++A+ +IE+MPM+PN+V+  SLLAACR  G+V +A    + +  L P    +YVLLSN+YA
Subjt:  GLVNKGLELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYA

Query:  AIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLFHELKICGYVPDTDTIL
        + G+W    KVR +MK +G++K PG SS++I GK HEF SGD+ H +  +I +ML+ +       G+VPD   +L
Subjt:  AIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLFHELKICGYVPDTDTIL

Q9MA50 Pentatricopeptide repeat-containing protein At1g05750, chloroplastic7.6e-16959.54Show/hide
Query:  SHRNISS----KFTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTAL
        +H+N ++    +   ++ +  V WTS +    RNG+L+EAA EF+ M LAGVEPNH+TFI LLSGC DF S S   G LLHGYA K GLD  HVMVGTA+
Subjt:  SHRNISS----KFTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTAL

Query:  IDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLFKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAA
        I MY+K  +   AR VFDY+  KNSV+WNTM+DGY R+G+++ A  +FD+MP RD ISWTA+ING  K GY E+AL  F +MQ SG++PDYV+IIA L A
Subjt:  IDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLFKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAA

Query:  CADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALT
        C +LGAL+ GLWV+R+V+ Q+FK+N+R+SNSLID+Y RCGC+EFARQVF  M KRT+VSWNS+IVGFA NG A ESL +F  MQ++GFKPD V++TGALT
Subjt:  CADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALT

Query:  ACTHAGLVNKGLELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLGSLLAACRTHG-DVSLAERLMKHLFKLDPGGDSNYVL
        AC+H GLV +GL  F  MK  +RI+PRIEHYGC+VDLY RAGRL+DAL +++ MPMKPNEVV+GSLLAAC  HG ++ LAERLMKHL  L+    SNYV+
Subjt:  ACTHAGLVNKGLELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLGSLLAACRTHG-DVSLAERLMKHLFKLDPGGDSNYVL

Query:  LSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLFHELKICGYVPDT
        LSN+YAA GKWEGA+K+RR MK  G++K+PG SS+EID  +H F++GD  H +   I  +LEL+  +L++ G V +T
Subjt:  LSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLFHELKICGYVPDT

Q9SZT8 Pentatricopeptide repeat-containing protein ELI1, chloroplastic1.1e-10339.03Show/hide
Query:  SIDP-IVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGLARK
        +IDP + L+T+++     NG   +A   + ++  + + PN  TF +LL  C      S   G L+H +  KFGL      V T L+D+YAK   +  A+K
Subjt:  SIDP-IVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGLARK

Query:  VFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLFKHGYSEQALECFHQMQCSG-IEPDYVSIIAVLAACADLGALTLGLWVN
        VFD +  ++ VS   M+  Y + G +E A  LFD M  RD +SW  +I+G  +HG+   AL  F ++   G  +PD ++++A L+AC+ +GAL  G W++
Subjt:  VFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLFKHGYSEQALECFHQMQCSG-IEPDYVSIIAVLAACADLGALTLGLWVN

Query:  RFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQK-EGFKPDGVSYTGALTACTHAGLVNKGLE
         FV     + N+++   LIDMYS+CG +E A  VF   P++ +V+WN++I G+A++G++ ++L  F+ MQ   G +P  +++ G L AC HAGLVN+G+ 
Subjt:  RFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQK-EGFKPDGVSYTGALTACTHAGLVNKGLE

Query:  LFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGA
        +F++M   + I P+IEHYGC+V L GRAG+L  A   I+ M M  + V+  S+L +C+ HGD  L + + ++L  L+      YVLLSNIYA++G +EG 
Subjt:  LFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGA

Query:  NKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLFHELKICGYVPDTDTILNTKESSK
         KVR  MK +G+ K+PG S++EI+ KVHEF +GD+ H+ +  IY+ML  +   +K  GYVP+T+T+L   E ++
Subjt:  NKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLFHELKICGYVPDTDTILNTKESSK

Arabidopsis top hitse value%identityAlignment
AT1G05750.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.4e-17059.54Show/hide
Query:  SHRNISS----KFTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTAL
        +H+N ++    +   ++ +  V WTS +    RNG+L+EAA EF+ M LAGVEPNH+TFI LLSGC DF S S   G LLHGYA K GLD  HVMVGTA+
Subjt:  SHRNISS----KFTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTAL

Query:  IDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLFKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAA
        I MY+K  +   AR VFDY+  KNSV+WNTM+DGY R+G+++ A  +FD+MP RD ISWTA+ING  K GY E+AL  F +MQ SG++PDYV+IIA L A
Subjt:  IDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLFKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAA

Query:  CADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALT
        C +LGAL+ GLWV+R+V+ Q+FK+N+R+SNSLID+Y RCGC+EFARQVF  M KRT+VSWNS+IVGFA NG A ESL +F  MQ++GFKPD V++TGALT
Subjt:  CADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALT

Query:  ACTHAGLVNKGLELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLGSLLAACRTHG-DVSLAERLMKHLFKLDPGGDSNYVL
        AC+H GLV +GL  F  MK  +RI+PRIEHYGC+VDLY RAGRL+DAL +++ MPMKPNEVV+GSLLAAC  HG ++ LAERLMKHL  L+    SNYV+
Subjt:  ACTHAGLVNKGLELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLGSLLAACRTHG-DVSLAERLMKHLFKLDPGGDSNYVL

Query:  LSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLFHELKICGYVPDT
        LSN+YAA GKWEGA+K+RR MK  G++K+PG SS+EID  +H F++GD  H +   I  +LEL+  +L++ G V +T
Subjt:  LSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLFHELKICGYVPDT

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.4e-10436.76Show/hide
Query:  FTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGL
        FT      +V W S +  + + G   +A   F +M    V+ +HVT + +LS CA     +L FG  +  Y  +  ++  ++ +  A++DMY KC  +  
Subjt:  FTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGL

Query:  ARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLFKHGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGL
        A+++FD +  K++V+W TMLDGY  + + E A ++ + MP +D ++W ALI+   ++G   +AL  FH++Q    ++ + +++++ L+ACA +GAL LG 
Subjt:  ARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLFKHGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGL

Query:  WVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACTHAGLVNKG
        W++ ++ +   + N  ++++LI MYS+CG +E +R+VF  + KR +  W+++I G A++G  +E+++ F  MQ+   KP+GV++T    AC+H GLV++ 
Subjt:  WVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACTHAGLVNKG

Query:  LELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWE
          LF  M+S + I P  +HY CIVD+ GR+G L+ A+  IE MP+ P+  V G+LL AC+ H +++LAE     L +L+P  D  +VLLSNIYA +GKWE
Subjt:  LELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWE

Query:  GANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLFHELKICGYVPDTDTILNTKESSK
          +++R+ M+  G++K+PGCSS+EIDG +HEF+SGD  H  ++ +Y  L  +  +LK  GY P+   +L   E  +
Subjt:  GANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLFHELKICGYVPDTDTILNTKESSK

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)4.2e-10638.11Show/hide
Query:  RNISSKFTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAK
        + +  ++ A+++D   L  +  + Y R G   EA   F  M  +GV P+ ++ ++ +S C+     ++ +G   HGY  + G ++    +  ALIDMY K
Subjt:  RNISSKFTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAK

Query:  CAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLFKHGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLG
        C +   A ++FD +  K  V+WN+++ GY  NGE++ A + F+ MP ++ +SW  +I+GL +    E+A+E F  MQ   G+  D V+++++ +AC  LG
Subjt:  CAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLFKHGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLG

Query:  ALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACTHA
        AL L  W+  ++ +   + ++R+  +L+DM+SRCG  E A  +F  +  R + +W + I   A+ G A+ ++E FD M ++G KPDGV++ GALTAC+H 
Subjt:  ALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACTHA

Query:  GLVNKGLELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYA
        GLV +G E+F +M  +H ++P   HYGC+VDL GRAG L++A+ +IE+MPM+PN+V+  SLLAACR  G+V +A    + +  L P    +YVLLSN+YA
Subjt:  GLVNKGLELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYA

Query:  AIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLFHELKICGYVPDTDTIL
        + G+W    KVR +MK +G++K PG SS++I GK HEF SGD+ H +  +I +ML+ +       G+VPD   +L
Subjt:  AIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLFHELKICGYVPDTDTIL

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification4.2e-10638.11Show/hide
Query:  RNISSKFTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAK
        + +  ++ A+++D   L  +  + Y R G   EA   F  M  +GV P+ ++ ++ +S C+     ++ +G   HGY  + G ++    +  ALIDMY K
Subjt:  RNISSKFTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAK

Query:  CAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLFKHGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLG
        C +   A ++FD +  K  V+WN+++ GY  NGE++ A + F+ MP ++ +SW  +I+GL +    E+A+E F  MQ   G+  D V+++++ +AC  LG
Subjt:  CAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLFKHGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLG

Query:  ALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACTHA
        AL L  W+  ++ +   + ++R+  +L+DM+SRCG  E A  +F  +  R + +W + I   A+ G A+ ++E FD M ++G KPDGV++ GALTAC+H 
Subjt:  ALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACTHA

Query:  GLVNKGLELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYA
        GLV +G E+F +M  +H ++P   HYGC+VDL GRAG L++A+ +IE+MPM+PN+V+  SLLAACR  G+V +A    + +  L P    +YVLLSN+YA
Subjt:  GLVNKGLELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYA

Query:  AIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLFHELKICGYVPDTDTIL
        + G+W    KVR +MK +G++K PG SS++I GK HEF SGD+ H +  +I +ML+ +       G+VPD   +L
Subjt:  AIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLFHELKICGYVPDTDTIL

AT4G37380.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.9e-10539.03Show/hide
Query:  SIDP-IVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGLARK
        +IDP + L+T+++     NG   +A   + ++  + + PN  TF +LL  C      S   G L+H +  KFGL      V T L+D+YAK   +  A+K
Subjt:  SIDP-IVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSLLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGLARK

Query:  VFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLFKHGYSEQALECFHQMQCSG-IEPDYVSIIAVLAACADLGALTLGLWVN
        VFD +  ++ VS   M+  Y + G +E A  LFD M  RD +SW  +I+G  +HG+   AL  F ++   G  +PD ++++A L+AC+ +GAL  G W++
Subjt:  VFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLFKHGYSEQALECFHQMQCSG-IEPDYVSIIAVLAACADLGALTLGLWVN

Query:  RFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQK-EGFKPDGVSYTGALTACTHAGLVNKGLE
         FV     + N+++   LIDMYS+CG +E A  VF   P++ +V+WN++I G+A++G++ ++L  F+ MQ   G +P  +++ G L AC HAGLVN+G+ 
Subjt:  RFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQK-EGFKPDGVSYTGALTACTHAGLVNKGLE

Query:  LFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGA
        +F++M   + I P+IEHYGC+V L GRAG+L  A   I+ M M  + V+  S+L +C+ HGD  L + + ++L  L+      YVLLSNIYA++G +EG 
Subjt:  LFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGA

Query:  NKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLFHELKICGYVPDTDTILNTKESSK
         KVR  MK +G+ K+PG S++EI+ KVHEF +GD+ H+ +  IY+ML  +   +K  GYVP+T+T+L   E ++
Subjt:  NKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLFHELKICGYVPDTDTILNTKESSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAGCATTCCTTCGCACATCGCCATTCCATCCCAACACCAACAATGTCCTAATCCGCCATCTTCAATTCCACTTTCAAACCCAACAAAACTCAACTTCCCCCGCTC
TCCCAATTCCTCACATCGCAATATCTCTTCCAAATTCACCGCCAATTCTATTGACCCCATTGTTCTATGGACTTCTTCTCTTGCTCGCTACTGCCGCAACGGCCAGTTAT
CCGAAGCCGCCGCAGAGTTTACGCGCATGAGACTCGCCGGAGTTGAGCCGAACCACGTCACATTCATTACACTTCTCTCCGGCTGTGCTGATTTTCCGTCAGAAAGCCTC
TTCTTCGGCTCTTTGCTTCATGGTTACGCCCGTAAATTTGGTTTGGATACAGGGCATGTAATGGTGGGGACTGCTTTGATTGATATGTATGCCAAATGTGCTCAATTGGG
TCTTGCTAGGAAGGTTTTTGATTACTTAGGCGTGAAAAACTCTGTCTCTTGGAACACGATGCTCGATGGTTACACGAGGAATGGAGAGATTGAGTTGGCACTTGACCTGT
TTGATGAAATGCCTACAAGAGATGCAATTTCTTGGACGGCTTTGATTAACGGTCTTTTCAAACATGGGTACTCTGAACAAGCATTGGAGTGCTTCCATCAGATGCAATGC
TCGGGTATAGAGCCTGATTATGTGTCTATAATTGCTGTTCTCGCTGCGTGTGCTGATTTGGGCGCGCTTACTTTGGGGTTGTGGGTTAATCGGTTTGTTATGCAGCAGGA
GTTTAAGGATAATATAAGGATAAGTAATTCCTTGATAGATATGTATTCTCGATGTGGATGTATTGAGTTTGCACGCCAAGTGTTTGAGAAAATGCCCAAGCGAACTTTGG
TATCTTGGAACTCTATCATTGTGGGGTTTGCAGTTAATGGGTTTGCAGATGAATCTCTGGAGTTTTTTGATGCGATGCAGAAGGAGGGATTCAAGCCAGATGGAGTTAGC
TACACGGGAGCTCTTACTGCATGTACCCATGCTGGCTTAGTGAACAAGGGCTTGGAATTGTTTGATAACATGAAGAGCGTACACAGAATTACTCCCAGGATTGAGCATTA
TGGATGTATTGTCGACCTCTATGGTCGTGCAGGGAGGTTAGACGATGCATTGAATGTGATTGAGGAAATGCCGATGAAACCGAATGAAGTCGTGTTGGGGTCGTTGCTGG
CTGCCTGCAGGACTCATGGTGATGTGAGCCTGGCTGAAAGGTTAATGAAACATCTCTTTAAGTTAGATCCAGGAGGCGATTCAAATTATGTGCTTCTTTCAAACATATAT
GCAGCAATTGGGAAGTGGGAAGGAGCTAACAAGGTCAGGAGAACGATGAAAGCCCGAGGCGTGCAGAAAAAACCGGGTTGTAGTTCTGTTGAGATTGATGGTAAGGTTCA
TGAGTTTGTTTCTGGTGACAAATACCATGCTGATGCAGACAGTATTTACTCAATGTTAGAGCTGTTGTTTCATGAATTAAAGATATGTGGCTATGTTCCTGATACCGACA
CCATTCTGAATACCAAAGAATCTAGTAAAGACCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCAGCATTCCTTCGCACATCGCCATTCCATCCCAACACCAACAATGTCCTAATCCGCCATCTTCAATTCCACTTTCAAACCCAACAAAACTCAACTTCCCCCGCTC
TCCCAATTCCTCACATCGCAATATCTCTTCCAAATTCACCGCCAATTCTATTGACCCCATTGTTCTATGGACTTCTTCTCTTGCTCGCTACTGCCGCAACGGCCAGTTAT
CCGAAGCCGCCGCAGAGTTTACGCGCATGAGACTCGCCGGAGTTGAGCCGAACCACGTCACATTCATTACACTTCTCTCCGGCTGTGCTGATTTTCCGTCAGAAAGCCTC
TTCTTCGGCTCTTTGCTTCATGGTTACGCCCGTAAATTTGGTTTGGATACAGGGCATGTAATGGTGGGGACTGCTTTGATTGATATGTATGCCAAATGTGCTCAATTGGG
TCTTGCTAGGAAGGTTTTTGATTACTTAGGCGTGAAAAACTCTGTCTCTTGGAACACGATGCTCGATGGTTACACGAGGAATGGAGAGATTGAGTTGGCACTTGACCTGT
TTGATGAAATGCCTACAAGAGATGCAATTTCTTGGACGGCTTTGATTAACGGTCTTTTCAAACATGGGTACTCTGAACAAGCATTGGAGTGCTTCCATCAGATGCAATGC
TCGGGTATAGAGCCTGATTATGTGTCTATAATTGCTGTTCTCGCTGCGTGTGCTGATTTGGGCGCGCTTACTTTGGGGTTGTGGGTTAATCGGTTTGTTATGCAGCAGGA
GTTTAAGGATAATATAAGGATAAGTAATTCCTTGATAGATATGTATTCTCGATGTGGATGTATTGAGTTTGCACGCCAAGTGTTTGAGAAAATGCCCAAGCGAACTTTGG
TATCTTGGAACTCTATCATTGTGGGGTTTGCAGTTAATGGGTTTGCAGATGAATCTCTGGAGTTTTTTGATGCGATGCAGAAGGAGGGATTCAAGCCAGATGGAGTTAGC
TACACGGGAGCTCTTACTGCATGTACCCATGCTGGCTTAGTGAACAAGGGCTTGGAATTGTTTGATAACATGAAGAGCGTACACAGAATTACTCCCAGGATTGAGCATTA
TGGATGTATTGTCGACCTCTATGGTCGTGCAGGGAGGTTAGACGATGCATTGAATGTGATTGAGGAAATGCCGATGAAACCGAATGAAGTCGTGTTGGGGTCGTTGCTGG
CTGCCTGCAGGACTCATGGTGATGTGAGCCTGGCTGAAAGGTTAATGAAACATCTCTTTAAGTTAGATCCAGGAGGCGATTCAAATTATGTGCTTCTTTCAAACATATAT
GCAGCAATTGGGAAGTGGGAAGGAGCTAACAAGGTCAGGAGAACGATGAAAGCCCGAGGCGTGCAGAAAAAACCGGGTTGTAGTTCTGTTGAGATTGATGGTAAGGTTCA
TGAGTTTGTTTCTGGTGACAAATACCATGCTGATGCAGACAGTATTTACTCAATGTTAGAGCTGTTGTTTCATGAATTAAAGATATGTGGCTATGTTCCTGATACCGACA
CCATTCTGAATACCAAAGAATCTAGTAAAGACCATTGA
Protein sequenceShow/hide protein sequence
MSSIPSHIAIPSQHQQCPNPPSSIPLSNPTKLNFPRSPNSSHRNISSKFTANSIDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESL
FFGSLLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLFKHGYSEQALECFHQMQC
SGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVS
YTGALTACTHAGLVNKGLELFDNMKSVHRITPRIEHYGCIVDLYGRAGRLDDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIY
AAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVSGDKYHADADSIYSMLELLFHELKICGYVPDTDTILNTKESSKDH