; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC01G011630 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC01G011630
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCicolChr01:19090222..19091799
RNA-Seq ExpressionCcUC01G011630
SyntenyCcUC01G011630
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139593.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucumis sativus]3.0e-27188.36Show/hide
Query:  MSSIPSHTAIPSQLQQYPNPPSPIPLSNPTKLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS
        MSSIPSHTA PSQLQ  P  PS IPLSNPTKLNFPRSPNS H NISSKF  NS+DPIV WTSSLA YCRNGQLSEAAAEFTRMRLAGVEPNH+TFITLLS
Subjt:  MSSIPSHTAIPSQLQQYPNPPSPIPLSNPTKLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS

Query:  GCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN
         CADFPSES FF SSLHGYA K+GLDT HVMVGTALIDMY+KCAQLG ARKVF  LGVKNSVSWNTML+G+ RNGEIELA+ LFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN

Query:  GLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GLLKHGYSEQALECFHQMQ SG+  DYVSIIAVLAACADLGALTLGLWV+RFVM QEFKDNI+ISNSLIDMYSRCGCIEFARQVF KM KRTLVSWNSII
Subjt:  GLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG
        VGFAVNGFADESLEFF AMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMK VH++TPRIEHYGCIVDLYGRAGRLEDALN+IEEMPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG

Query:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLF
        SLLAACRTHGDV+LAERLMKHLFKLDP GD+ YVLLSNIYAAIGKW+GAN VR+TMKARGVQKKPG SSVEIDGKVHEFVAGD YH DAD+IYSML+LL 
Subjt:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLF

Query:  HELKICGYVPNTDTIMNTKESSKD
        HELK+CGYVP +DTI+NTKES+KD
Subjt:  HELKICGYVPNTDTIMNTKESSKD

XP_008458940.1 PREDICTED: pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucumis melo]1.6e-26988.36Show/hide
Query:  MSSIPSHTAIPSQLQQYPNPPSPIPLSNPTKLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS
        MSSIPSH A PSQLQQ P+  S IPLSNPTK+NFPRSP S H NI SKFTANS+ PIVQWTSS+A YC NGQL EAAAEFTRMRLAGVEPNH+TFITLLS
Subjt:  MSSIPSHTAIPSQLQQYPNPPSPIPLSNPTKLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS

Query:  GCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN
        GCADFPSES FF SSLHGYA KFGLDT HVMVGTALIDMY+KC+QLGLA+KVFDYLGVKNSVSWNTML+G+ RNGEIELA+ LFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN

Query:  GLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GLLKHGYSEQALECFHQMQ SG+  DYVSIIAVLAACADLGALT GLWVNRFVMQQEFKDN+RISNSLIDMYSRCGCIEFARQVF KM KRTLVSWNSII
Subjt:  GLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG
        VGFA NGFADESLEFF AMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVH++TP IEHYGCIVDLYGRAGRLEDA NVIEEMPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG

Query:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLF
        SLLAACRTHGDV LAERLMKH+FKLD  GDSNYVLLSNIYAAIGKWEGANKVR+TMKARGVQKK G SSVEIDGKVHEFVAGDKYH DAD+IYSML+LLF
Subjt:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLF

Query:  HELKICGYVPNTDTIMNTKESSKD
        HELK+CGYVP+TD I+NTK+S+KD
Subjt:  HELKICGYVPNTDTIMNTKESSKD

XP_022142716.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Momordica charantia]7.6e-27588.93Show/hide
Query:  MSSIPSHTAIPSQLQQYPNPPSPIPLSNPTKLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS
        MSSIP++TA   QLQQYPNP + IPL NP  +NFPRS NSS+ +ISSK T NSIDPIV WTSS+A YCRNGQL+EAAAEFT MRLAGVEPNHVT ITLLS
Subjt:  MSSIPSHTAIPSQLQQYPNPPSPIPLSNPTKLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS

Query:  GCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN
        GCADFPSESL+FGSSLHGYARK GLDT HVMVGT+++DMYAKCAQLGLAR+VFDYL +KNSVSWNTMLDGYTRNGEIELA+DLFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN

Query:  GLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GLLK GYSEQALECFHQMQCSGI+PDYVSIIAVLAACADLG LTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCI FARQVFE+M KRTLVSWNSII
Subjt:  GLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG
        VG+A NGFADESLEFFDAMQKEGFKPD VSYTGALTACSHAGLVNKGLELFDNMKRVHR+ PRIEHYGCIVDLYGRAGRLEDAL+VIE+MPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG

Query:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLF
        SLLAACRTHGDVSLAERLMKHL KLDPGGDSNYVLLSNIYAAIGKWEGANKVR+TMKARGVQKKPG SSVEIDGKVHEFVAGDKYH DADSIYSML+LL 
Subjt:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLF

Query:  HELKICGYVPNTDTIMNTKESSKD
        HELKICG VP T+T +NTKESSKD
Subjt:  HELKICGYVPNTDTIMNTKESSKD

XP_022967078.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucurbita maxima]2.5e-26284.79Show/hide
Query:  MSSIPSHTAIP--SQLQQYPNPPSPIPLSNPTKLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITL
        MSS+PSHT IP   QLQQY NPPSPIP SNP+ L+FPR+PNSS          N I PIV WTSS+A YCRN QL+EAAAEFTRMRLAGVEPNH+TFITL
Subjt:  MSSIPSHTAIP--SQLQQYPNPPSPIPLSNPTKLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITL

Query:  LSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTAL
        LSGCADFPS SL FG+SLHGY RK GLDT HVMVGTALI MYAKCAQLGLAR VFDYL +KNSV+WNTMLDGY RNGEIELA++LFDEMPTRDAISWTAL
Subjt:  LSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTAL

Query:  INGLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNS
        ING LK GYSEQALECFH+MQCSGIEPDYVSIIAVLAACADLGAL+ GLWVNRF+MQQEFKDNIRISNSLIDMYSRCGCIEFARQVF+KM K TLVSWNS
Subjt:  INGLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNS

Query:  IIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVV
        +IVGFA+NGFADESLEFFDAMQKEGF  DGVSYTGALTACSHAGLVNKGLELFDNMKRVHR+TPRIEHYGCIVDLY RAGRL++ALNVIE MPMKPNEVV
Subjt:  IIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVV

Query:  LGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLEL
        LGSLLAACRTHGDVSLAERL+K+LF+LDPGGDS+YVLLSNIYAA+G+WEGANKVR+TMKARGVQKKPG SS+EIDGKVHEFVAGDKYH DAD+IYSMLE+
Subjt:  LGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLEL

Query:  LFHELKICGYVPNTDTIMNTKESSKD
        LFHELKI GYVP T T MN  ESSK+
Subjt:  LFHELKICGYVPNTDTIMNTKESSKD

XP_038877228.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Benincasa hispida]6.0e-28892.75Show/hide
Query:  MSSIPSHTAIPSQLQQYPNPPSPIPLSNPTKLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS
        MSS PS+TAIPSQLQQYPNPPS IPLSNPTKLNFPRSPNSSH NISSKF ANSIDPIV WTSSLA YCRNGQLSEAA EFTRMRLAGVEPNHVTFITLLS
Subjt:  MSSIPSHTAIPSQLQQYPNPPSPIPLSNPTKLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS

Query:  GCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN
        GC DFPSESLFFGSSLHGYARK GLDT HVMVGTAL+DMYAKCAQ  LARKVFDYLG+KNSV+WNTMLDGYTRNGEIELA+DLFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN

Query:  GLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GLLK G+SEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSL+DMYSRCGCIEFARQVFEKMPKRTLVSWNSII
Subjt:  GLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG
        VGFAVNGFADESLEFFDAMQ EGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKR+H++TPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG

Query:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLF
        SLLAACRT+GDVSLAE+LMKHL KLDP GDSNYVLLSNIYAAIG+WEGANKVR+TMKARGVQKKPGCSSVEIDGKVHEFVAGDKYH DAD+IYSMLELLF
Subjt:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLF

Query:  HELKICGYVPNTDTIMNTKESSKD
        HELKI GYVP+T+ I+NTKE SKD
Subjt:  HELKICGYVPNTDTIMNTKESSKD

TrEMBL top hitse value%identityAlignment
A0A0A0LYD6 Uncharacterized protein1.4e-27188.36Show/hide
Query:  MSSIPSHTAIPSQLQQYPNPPSPIPLSNPTKLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS
        MSSIPSHTA PSQLQ  P  PS IPLSNPTKLNFPRSPNS H NISSKF  NS+DPIV WTSSLA YCRNGQLSEAAAEFTRMRLAGVEPNH+TFITLLS
Subjt:  MSSIPSHTAIPSQLQQYPNPPSPIPLSNPTKLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS

Query:  GCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN
         CADFPSES FF SSLHGYA K+GLDT HVMVGTALIDMY+KCAQLG ARKVF  LGVKNSVSWNTML+G+ RNGEIELA+ LFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN

Query:  GLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GLLKHGYSEQALECFHQMQ SG+  DYVSIIAVLAACADLGALTLGLWV+RFVM QEFKDNI+ISNSLIDMYSRCGCIEFARQVF KM KRTLVSWNSII
Subjt:  GLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG
        VGFAVNGFADESLEFF AMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMK VH++TPRIEHYGCIVDLYGRAGRLEDALN+IEEMPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG

Query:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLF
        SLLAACRTHGDV+LAERLMKHLFKLDP GD+ YVLLSNIYAAIGKW+GAN VR+TMKARGVQKKPG SSVEIDGKVHEFVAGD YH DAD+IYSML+LL 
Subjt:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLF

Query:  HELKICGYVPNTDTIMNTKESSKD
        HELK+CGYVP +DTI+NTKES+KD
Subjt:  HELKICGYVPNTDTIMNTKESSKD

A0A1S3C956 pentatricopeptide repeat-containing protein At1g05750, chloroplastic7.9e-27088.36Show/hide
Query:  MSSIPSHTAIPSQLQQYPNPPSPIPLSNPTKLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS
        MSSIPSH A PSQLQQ P+  S IPLSNPTK+NFPRSP S H NI SKFTANS+ PIVQWTSS+A YC NGQL EAAAEFTRMRLAGVEPNH+TFITLLS
Subjt:  MSSIPSHTAIPSQLQQYPNPPSPIPLSNPTKLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS

Query:  GCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN
        GCADFPSES FF SSLHGYA KFGLDT HVMVGTALIDMY+KC+QLGLA+KVFDYLGVKNSVSWNTML+G+ RNGEIELA+ LFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN

Query:  GLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GLLKHGYSEQALECFHQMQ SG+  DYVSIIAVLAACADLGALT GLWVNRFVMQQEFKDN+RISNSLIDMYSRCGCIEFARQVF KM KRTLVSWNSII
Subjt:  GLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG
        VGFA NGFADESLEFF AMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVH++TP IEHYGCIVDLYGRAGRLEDA NVIEEMPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG

Query:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLF
        SLLAACRTHGDV LAERLMKH+FKLD  GDSNYVLLSNIYAAIGKWEGANKVR+TMKARGVQKK G SSVEIDGKVHEFVAGDKYH DAD+IYSML+LLF
Subjt:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLF

Query:  HELKICGYVPNTDTIMNTKESSKD
        HELK+CGYVP+TD I+NTK+S+KD
Subjt:  HELKICGYVPNTDTIMNTKESSKD

A0A5A7UJB6 Pentatricopeptide repeat-containing protein7.9e-27088.36Show/hide
Query:  MSSIPSHTAIPSQLQQYPNPPSPIPLSNPTKLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS
        MSSIPSH A PSQLQQ P+  S IPLSNPTK+NFPRSP S H NI SKFTANS+ PIVQWTSS+A YC NGQL EAAAEFTRMRLAGVEPNH+TFITLLS
Subjt:  MSSIPSHTAIPSQLQQYPNPPSPIPLSNPTKLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS

Query:  GCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN
        GCADFPSES FF SSLHGYA KFGLDT HVMVGTALIDMY+KC+QLGLA+KVFDYLGVKNSVSWNTML+G+ RNGEIELA+ LFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN

Query:  GLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GLLKHGYSEQALECFHQMQ SG+  DYVSIIAVLAACADLGALT GLWVNRFVMQQEFKDN+RISNSLIDMYSRCGCIEFARQVF KM KRTLVSWNSII
Subjt:  GLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG
        VGFA NGFADESLEFF AMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVH++TP IEHYGCIVDLYGRAGRLEDA NVIEEMPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG

Query:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLF
        SLLAACRTHGDV LAERLMKH+FKLD  GDSNYVLLSNIYAAIGKWEGANKVR+TMKARGVQKK G SSVEIDGKVHEFVAGDKYH DAD+IYSML+LLF
Subjt:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLF

Query:  HELKICGYVPNTDTIMNTKESSKD
        HELK+CGYVP+TD I+NTK+S+KD
Subjt:  HELKICGYVPNTDTIMNTKESSKD

A0A6J1CN07 pentatricopeptide repeat-containing protein At1g05750, chloroplastic3.7e-27588.93Show/hide
Query:  MSSIPSHTAIPSQLQQYPNPPSPIPLSNPTKLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS
        MSSIP++TA   QLQQYPNP + IPL NP  +NFPRS NSS+ +ISSK T NSIDPIV WTSS+A YCRNGQL+EAAAEFT MRLAGVEPNHVT ITLLS
Subjt:  MSSIPSHTAIPSQLQQYPNPPSPIPLSNPTKLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS

Query:  GCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN
        GCADFPSESL+FGSSLHGYARK GLDT HVMVGT+++DMYAKCAQLGLAR+VFDYL +KNSVSWNTMLDGYTRNGEIELA+DLFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN

Query:  GLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GLLK GYSEQALECFHQMQCSGI+PDYVSIIAVLAACADLG LTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCI FARQVFE+M KRTLVSWNSII
Subjt:  GLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG
        VG+A NGFADESLEFFDAMQKEGFKPD VSYTGALTACSHAGLVNKGLELFDNMKRVHR+ PRIEHYGCIVDLYGRAGRLEDAL+VIE+MPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG

Query:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLF
        SLLAACRTHGDVSLAERLMKHL KLDPGGDSNYVLLSNIYAAIGKWEGANKVR+TMKARGVQKKPG SSVEIDGKVHEFVAGDKYH DADSIYSML+LL 
Subjt:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLF

Query:  HELKICGYVPNTDTIMNTKESSKD
        HELKICG VP T+T +NTKESSKD
Subjt:  HELKICGYVPNTDTIMNTKESSKD

A0A6J1HU31 pentatricopeptide repeat-containing protein At1g05750, chloroplastic1.2e-26284.79Show/hide
Query:  MSSIPSHTAIP--SQLQQYPNPPSPIPLSNPTKLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITL
        MSS+PSHT IP   QLQQY NPPSPIP SNP+ L+FPR+PNSS          N I PIV WTSS+A YCRN QL+EAAAEFTRMRLAGVEPNH+TFITL
Subjt:  MSSIPSHTAIP--SQLQQYPNPPSPIPLSNPTKLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITL

Query:  LSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTAL
        LSGCADFPS SL FG+SLHGY RK GLDT HVMVGTALI MYAKCAQLGLAR VFDYL +KNSV+WNTMLDGY RNGEIELA++LFDEMPTRDAISWTAL
Subjt:  LSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTAL

Query:  INGLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNS
        ING LK GYSEQALECFH+MQCSGIEPDYVSIIAVLAACADLGAL+ GLWVNRF+MQQEFKDNIRISNSLIDMYSRCGCIEFARQVF+KM K TLVSWNS
Subjt:  INGLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNS

Query:  IIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVV
        +IVGFA+NGFADESLEFFDAMQKEGF  DGVSYTGALTACSHAGLVNKGLELFDNMKRVHR+TPRIEHYGCIVDLY RAGRL++ALNVIE MPMKPNEVV
Subjt:  IIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVV

Query:  LGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLEL
        LGSLLAACRTHGDVSLAERL+K+LF+LDPGGDS+YVLLSNIYAA+G+WEGANKVR+TMKARGVQKKPG SS+EIDGKVHEFVAGDKYH DAD+IYSMLE+
Subjt:  LGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLEL

Query:  LFHELKICGYVPNTDTIMNTKESSKD
        LFHELKI GYVP T T MN  ESSK+
Subjt:  LFHELKICGYVPNTDTIMNTKESSKD

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic7.2e-10336.34Show/hide
Query:  FTANSIDPIVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGL
        FT      +V W S +  + + G   +A   F +M    V+ +HVT + +LS CA     +L FG  +  Y  +  ++  ++ +  A++DMY KC  +  
Subjt:  FTANSIDPIVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGL

Query:  ARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGL
        A+++FD +  K++V+W TMLDGY  + + E A ++ + MP +D ++W ALI+   ++G   +AL  FH++Q    ++ + +++++ L+ACA +GAL LG 
Subjt:  ARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGL

Query:  WVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKG
        W++ ++ +   + N  ++++LI MYS+CG +E +R+VF  + KR +  W+++I G A++G  +E+++ F  MQ+   KP+GV++T    ACSH GLV++ 
Subjt:  WVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKG

Query:  LELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWE
          LF  M+  + + P  +HY CIVD+ GR+G LE A+  IE MP+ P+  V G+LL AC+ H +++LAE     L +L+P  D  +VLLSNIYA +GKWE
Subjt:  LELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWE

Query:  GANKVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPNTDTIMNTKESSK
          +++R+ M+  G++K+PGCSS+EIDG +HEF++GD  H  ++ +Y  L  +  +LK  GY P    ++   E  +
Subjt:  GANKVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPNTDTIMNTKESSK

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226902.8e-10739.7Show/hide
Query:  AHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSW
        ++Y R G   EA   F  M  +GV P+ ++ ++ +S C+     ++ +G S HGY  + G ++W   +  ALIDMY KC +   A ++FD +  K  V+W
Subjt:  AHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSW

Query:  NTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIR
        N+++ GY  NGE++ A + F+ MP ++ +SW  +I+GL++    E+A+E F  MQ   G+  D V+++++ +AC  LGAL L  W+  ++ +   + ++R
Subjt:  NTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIR

Query:  ISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPR
        +  +L+DM+SRCG  E A  +F  +  R + +W + I   A+ G A+ ++E FD M ++G KPDGV++ GALTACSH GLV +G E+F +M ++H V+P 
Subjt:  ISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPR

Query:  IEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRQTMKARGVQK
          HYGC+VDL GRAG LE+A+ +IE+MPM+PN+V+  SLLAACR  G+V +A    + +  L P    +YVLLSN+YA+ G+W    KVR +MK +G++K
Subjt:  IEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRQTMKARGVQK

Query:  KPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPN-TDTIMNTKESSK
         PG SS++I GK HEF +GD+ H +  +I +ML+ +       G+VP+ ++ +M+  E  K
Subjt:  KPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPN-TDTIMNTKESSK

Q9MA50 Pentatricopeptide repeat-containing protein At1g05750, chloroplastic2.9e-16860.17Show/hide
Query:  SPNSSHHNISSKFTANSIDPIVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTAL
        +P    HN S+  T       V WTS +    RNG+L+EAA EF+ M LAGVEPNH+TFI LLSGC DF S S   G  LHGYA K GLD  HVMVGTA+
Subjt:  SPNSSHHNISSKFTANSIDPIVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTAL

Query:  IDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAA
        I MY+K  +   AR VFDY+  KNSV+WNTM+DGY R+G+++ A  +FD+MP RD ISWTA+ING +K GY E+AL  F +MQ SG++PDYV+IIA L A
Subjt:  IDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAA

Query:  CADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALT
        C +LGAL+ GLWV+R+V+ Q+FK+N+R+SNSLID+Y RCGC+EFARQVF  M KRT+VSWNS+IVGFA NG A ESL +F  MQ++GFKPD V++TGALT
Subjt:  CADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALT

Query:  ACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHG-DVSLAERLMKHLFKLDPGGDSNYVL
        ACSH GLV +GL  F  MK  +R++PRIEHYGC+VDLY RAGRLEDAL +++ MPMKPNEVV+GSLLAAC  HG ++ LAERLMKHL  L+    SNYV+
Subjt:  ACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHG-DVSLAERLMKHLFKLDPGGDSNYVL

Query:  LSNIYAAIGKWEGANKVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPNT
        LSN+YAA GKWEGA+K+R+ MK  G++K+PG SS+EID  +H F+AGD  H +   I  +LEL+  +L++ G V  T
Subjt:  LSNIYAAIGKWEGANKVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPNT

Q9SIT7 Pentatricopeptide repeat-containing protein At2g136006.8e-10140.09Show/hide
Query:  IVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYL
        +V W S +  + +NG   EA   F  M  + VEP+ VT  +++S CA     ++  G  +HG   K       +++  A +DMYAKC+++  AR +FD +
Subjt:  IVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYL

Query:  GVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQ
         ++N ++  +M+ GY      + A  +F +M  R+ +SW ALI G  ++G +E+AL  F  ++   + P + S   +L ACADL  L LG+  +  V++ 
Subjt:  GVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQ

Query:  EFK------DNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLEL
         FK      D+I + NSLIDMY +CGC+E    VF KM +R  VSWN++I+GFA NG+ +E+LE F  M + G KPD ++  G L+AC HAG V +G   
Subjt:  EFK------DNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLEL

Query:  FDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGAN
        F +M R   V P  +HY C+VDL GRAG LE+A ++IEEMPM+P+ V+ GSLLAAC+ H +++L + + + L +++P     YVLLSN+YA +GKWE   
Subjt:  FDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGAN

Query:  KVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELK
         VR++M+  GV K+PGCS ++I G  H F+  DK H     I+S+L++L  E++
Subjt:  KVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELK

Q9SZT8 Pentatricopeptide repeat-containing protein ELI1, chloroplastic2.5e-10338.61Show/hide
Query:  SIDP-IVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARK
        +IDP +  +T+++     NG   +A   + ++  + + PN  TF +LL  C      S   G  +H +  KFGL      V T L+D+YAK   +  A+K
Subjt:  SIDP-IVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARK

Query:  VFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQCSG-IEPDYVSIIAVLAACADLGALTLGLWVN
        VFD +  ++ VS   M+  Y + G +E A  LFD M  RD +SW  +I+G  +HG+   AL  F ++   G  +PD ++++A L+AC+ +GAL  G W++
Subjt:  VFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQCSG-IEPDYVSIIAVLAACADLGALTLGLWVN

Query:  RFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQK-EGFKPDGVSYTGALTACSHAGLVNKGLE
         FV     + N+++   LIDMYS+CG +E A  VF   P++ +V+WN++I G+A++G++ ++L  F+ MQ   G +P  +++ G L AC+HAGLVN+G+ 
Subjt:  RFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQK-EGFKPDGVSYTGALTACSHAGLVNKGLE

Query:  LFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGA
        +F++M + + + P+IEHYGC+V L GRAG+L+ A   I+ M M  + V+  S+L +C+ HGD  L + + ++L  L+      YVLLSNIYA++G +EG 
Subjt:  LFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGA

Query:  NKVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPNTDTIMNTKESSK
         KVR  MK +G+ K+PG S++EI+ KVHEF AGD+ H+ +  IY+ML  +   +K  GYVPNT+T++   E ++
Subjt:  NKVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPNTDTIMNTKESSK

Arabidopsis top hitse value%identityAlignment
AT1G05750.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.1e-16960.17Show/hide
Query:  SPNSSHHNISSKFTANSIDPIVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTAL
        +P    HN S+  T       V WTS +    RNG+L+EAA EF+ M LAGVEPNH+TFI LLSGC DF S S   G  LHGYA K GLD  HVMVGTA+
Subjt:  SPNSSHHNISSKFTANSIDPIVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTAL

Query:  IDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAA
        I MY+K  +   AR VFDY+  KNSV+WNTM+DGY R+G+++ A  +FD+MP RD ISWTA+ING +K GY E+AL  F +MQ SG++PDYV+IIA L A
Subjt:  IDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAA

Query:  CADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALT
        C +LGAL+ GLWV+R+V+ Q+FK+N+R+SNSLID+Y RCGC+EFARQVF  M KRT+VSWNS+IVGFA NG A ESL +F  MQ++GFKPD V++TGALT
Subjt:  CADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALT

Query:  ACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHG-DVSLAERLMKHLFKLDPGGDSNYVL
        ACSH GLV +GL  F  MK  +R++PRIEHYGC+VDLY RAGRLEDAL +++ MPMKPNEVV+GSLLAAC  HG ++ LAERLMKHL  L+    SNYV+
Subjt:  ACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHG-DVSLAERLMKHLFKLDPGGDSNYVL

Query:  LSNIYAAIGKWEGANKVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPNT
        LSN+YAA GKWEGA+K+R+ MK  G++K+PG SS+EID  +H F+AGD  H +   I  +LEL+  +L++ G V  T
Subjt:  LSNIYAAIGKWEGANKVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPNT

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.1e-10436.34Show/hide
Query:  FTANSIDPIVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGL
        FT      +V W S +  + + G   +A   F +M    V+ +HVT + +LS CA     +L FG  +  Y  +  ++  ++ +  A++DMY KC  +  
Subjt:  FTANSIDPIVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGL

Query:  ARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGL
        A+++FD +  K++V+W TMLDGY  + + E A ++ + MP +D ++W ALI+   ++G   +AL  FH++Q    ++ + +++++ L+ACA +GAL LG 
Subjt:  ARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGL

Query:  WVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKG
        W++ ++ +   + N  ++++LI MYS+CG +E +R+VF  + KR +  W+++I G A++G  +E+++ F  MQ+   KP+GV++T    ACSH GLV++ 
Subjt:  WVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKG

Query:  LELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWE
          LF  M+  + + P  +HY CIVD+ GR+G LE A+  IE MP+ P+  V G+LL AC+ H +++LAE     L +L+P  D  +VLLSNIYA +GKWE
Subjt:  LELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWE

Query:  GANKVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPNTDTIMNTKESSK
          +++R+ M+  G++K+PGCSS+EIDG +HEF++GD  H  ++ +Y  L  +  +LK  GY P    ++   E  +
Subjt:  GANKVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPNTDTIMNTKESSK

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)2.0e-10839.7Show/hide
Query:  AHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSW
        ++Y R G   EA   F  M  +GV P+ ++ ++ +S C+     ++ +G S HGY  + G ++W   +  ALIDMY KC +   A ++FD +  K  V+W
Subjt:  AHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSW

Query:  NTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIR
        N+++ GY  NGE++ A + F+ MP ++ +SW  +I+GL++    E+A+E F  MQ   G+  D V+++++ +AC  LGAL L  W+  ++ +   + ++R
Subjt:  NTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIR

Query:  ISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPR
        +  +L+DM+SRCG  E A  +F  +  R + +W + I   A+ G A+ ++E FD M ++G KPDGV++ GALTACSH GLV +G E+F +M ++H V+P 
Subjt:  ISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPR

Query:  IEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRQTMKARGVQK
          HYGC+VDL GRAG LE+A+ +IE+MPM+PN+V+  SLLAACR  G+V +A    + +  L P    +YVLLSN+YA+ G+W    KVR +MK +G++K
Subjt:  IEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRQTMKARGVQK

Query:  KPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPN-TDTIMNTKESSK
         PG SS++I GK HEF +GD+ H +  +I +ML+ +       G+VP+ ++ +M+  E  K
Subjt:  KPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPN-TDTIMNTKESSK

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification2.0e-10839.7Show/hide
Query:  AHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSW
        ++Y R G   EA   F  M  +GV P+ ++ ++ +S C+     ++ +G S HGY  + G ++W   +  ALIDMY KC +   A ++FD +  K  V+W
Subjt:  AHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSW

Query:  NTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIR
        N+++ GY  NGE++ A + F+ MP ++ +SW  +I+GL++    E+A+E F  MQ   G+  D V+++++ +AC  LGAL L  W+  ++ +   + ++R
Subjt:  NTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIR

Query:  ISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPR
        +  +L+DM+SRCG  E A  +F  +  R + +W + I   A+ G A+ ++E FD M ++G KPDGV++ GALTACSH GLV +G E+F +M ++H V+P 
Subjt:  ISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPR

Query:  IEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRQTMKARGVQK
          HYGC+VDL GRAG LE+A+ +IE+MPM+PN+V+  SLLAACR  G+V +A    + +  L P    +YVLLSN+YA+ G+W    KVR +MK +G++K
Subjt:  IEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRQTMKARGVQK

Query:  KPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPN-TDTIMNTKESSK
         PG SS++I GK HEF +GD+ H +  +I +ML+ +       G+VP+ ++ +M+  E  K
Subjt:  KPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPN-TDTIMNTKESSK

AT4G37380.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.8e-10438.61Show/hide
Query:  SIDP-IVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARK
        +IDP +  +T+++     NG   +A   + ++  + + PN  TF +LL  C      S   G  +H +  KFGL      V T L+D+YAK   +  A+K
Subjt:  SIDP-IVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARK

Query:  VFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQCSG-IEPDYVSIIAVLAACADLGALTLGLWVN
        VFD +  ++ VS   M+  Y + G +E A  LFD M  RD +SW  +I+G  +HG+   AL  F ++   G  +PD ++++A L+AC+ +GAL  G W++
Subjt:  VFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQCSG-IEPDYVSIIAVLAACADLGALTLGLWVN

Query:  RFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQK-EGFKPDGVSYTGALTACSHAGLVNKGLE
         FV     + N+++   LIDMYS+CG +E A  VF   P++ +V+WN++I G+A++G++ ++L  F+ MQ   G +P  +++ G L AC+HAGLVN+G+ 
Subjt:  RFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQK-EGFKPDGVSYTGALTACSHAGLVNKGLE

Query:  LFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGA
        +F++M + + + P+IEHYGC+V L GRAG+L+ A   I+ M M  + V+  S+L +C+ HGD  L + + ++L  L+      YVLLSNIYA++G +EG 
Subjt:  LFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGA

Query:  NKVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPNTDTIMNTKESSK
         KVR  MK +G+ K+PG S++EI+ KVHEF AGD+ H+ +  IY+ML  +   +K  GYVPNT+T++   E ++
Subjt:  NKVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPNTDTIMNTKESSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAGCATTCCTTCTCACACCGCCATTCCATCACAACTCCAACAATATCCCAATCCGCCTTCTCCAATCCCACTTTCAAATCCAACAAAACTCAACTTCCCC
CGCTCTCCCAATTCTTCACATCACAATATCTCCTCCAAATTCACCGCCAATTCTATTGACCCCATTGTTCAATGGACCTCTTCTCTTGCTCACTACTGCCGCAAT
GGCCAATTATCCGAAGCCGCCGCAGAGTTTACACGCATGAGACTCGCCGGAGTTGAGCCAAACCACGTCACATTCATTACCCTTCTCTCCGGCTGCGCTGATTTT
CCATCAGAAAGCCTCTTCTTCGGCTCTTCCCTTCATGGCTACGCCCGTAAATTTGGCTTGGATACATGGCATGTAATGGTGGGGACTGCTTTGATTGATATGTAT
GCCAAATGTGCTCAATTGGGTCTTGCTAGGAAGGTTTTTGATTACCTAGGCGTGAAGAACTCTGTCTCTTGGAACACGATGCTCGATGGTTACACAAGGAATGGA
GAGATTGAATTGGCACTTGACCTGTTTGATGAAATGCCTACAAGAGATGCGATTTCTTGGACGGCTTTGATTAACGGTCTCTTGAAACATGGATACTCTGAACAA
GCATTGGAGTGCTTCCATCAGATGCAATGCTCCGGTATCGAGCCTGATTATGTGTCTATAATTGCTGTTCTCGCTGCGTGTGCTGATTTGGGCGCGCTTACTTTA
GGGTTATGGGTTAATAGGTTTGTTATGCAGCAGGAGTTTAAGGATAATATTAGGATAAGTAATTCCTTGATAGATATGTATTCTCGATGTGGATGTATTGAGTTT
GCCCGCCAAGTGTTTGAGAAAATGCCCAAGCGAACTTTGGTATCTTGGAACTCTATCATTGTGGGGTTTGCAGTTAATGGATTTGCTGATGAATCTCTGGAGTTC
TTTGATGCAATGCAGAAGGAAGGATTCAAGCCAGATGGAGTAAGCTACACGGGAGCTCTTACTGCATGTAGCCATGCTGGCTTAGTGAATAAGGGCCTCGAATTG
TTTGATAACATGAAGAGGGTACACAGAGTTACTCCCAGGATTGAGCATTATGGATGTATTGTTGACCTCTATGGTCGTGCAGGAAGGTTAGAGGATGCATTGAAT
GTGATTGAGGAAATGCCGATGAAACCGAATGAAGTTGTGTTGGGGTCGTTGTTGGCTGCCTGCAGGACTCATGGTGATGTGAGCCTGGCTGAAAGGTTAATGAAA
CATCTCTTTAAGTTGGATCCAGGAGGCGATTCAAATTATGTGCTTCTTTCAAACATATATGCAGCAATTGGAAAGTGGGAAGGTGCTAACAAGGTTAGGCAAACG
ATGAAAGCTCGAGGCGTGCAGAAAAAACCGGGTTGTAGTTCTGTTGAAATTGATGGTAAGGTTCATGAGTTTGTTGCTGGTGACAAATACCATACTGATGCAGAC
AGTATTTACTCAATGTTGGAGCTGCTGTTTCATGAACTAAAGATATGTGGCTATGTTCCTAATACTGATACCATTATGAATACCAAAGAATCTAGTAAAGATCGT
TGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCAGCATTCCTTCTCACACCGCCATTCCATCACAACTCCAACAATATCCCAATCCGCCTTCTCCAATCCCACTTTCAAATCCAACAAAACTCAACTTCCCC
CGCTCTCCCAATTCTTCACATCACAATATCTCCTCCAAATTCACCGCCAATTCTATTGACCCCATTGTTCAATGGACCTCTTCTCTTGCTCACTACTGCCGCAAT
GGCCAATTATCCGAAGCCGCCGCAGAGTTTACACGCATGAGACTCGCCGGAGTTGAGCCAAACCACGTCACATTCATTACCCTTCTCTCCGGCTGCGCTGATTTT
CCATCAGAAAGCCTCTTCTTCGGCTCTTCCCTTCATGGCTACGCCCGTAAATTTGGCTTGGATACATGGCATGTAATGGTGGGGACTGCTTTGATTGATATGTAT
GCCAAATGTGCTCAATTGGGTCTTGCTAGGAAGGTTTTTGATTACCTAGGCGTGAAGAACTCTGTCTCTTGGAACACGATGCTCGATGGTTACACAAGGAATGGA
GAGATTGAATTGGCACTTGACCTGTTTGATGAAATGCCTACAAGAGATGCGATTTCTTGGACGGCTTTGATTAACGGTCTCTTGAAACATGGATACTCTGAACAA
GCATTGGAGTGCTTCCATCAGATGCAATGCTCCGGTATCGAGCCTGATTATGTGTCTATAATTGCTGTTCTCGCTGCGTGTGCTGATTTGGGCGCGCTTACTTTA
GGGTTATGGGTTAATAGGTTTGTTATGCAGCAGGAGTTTAAGGATAATATTAGGATAAGTAATTCCTTGATAGATATGTATTCTCGATGTGGATGTATTGAGTTT
GCCCGCCAAGTGTTTGAGAAAATGCCCAAGCGAACTTTGGTATCTTGGAACTCTATCATTGTGGGGTTTGCAGTTAATGGATTTGCTGATGAATCTCTGGAGTTC
TTTGATGCAATGCAGAAGGAAGGATTCAAGCCAGATGGAGTAAGCTACACGGGAGCTCTTACTGCATGTAGCCATGCTGGCTTAGTGAATAAGGGCCTCGAATTG
TTTGATAACATGAAGAGGGTACACAGAGTTACTCCCAGGATTGAGCATTATGGATGTATTGTTGACCTCTATGGTCGTGCAGGAAGGTTAGAGGATGCATTGAAT
GTGATTGAGGAAATGCCGATGAAACCGAATGAAGTTGTGTTGGGGTCGTTGTTGGCTGCCTGCAGGACTCATGGTGATGTGAGCCTGGCTGAAAGGTTAATGAAA
CATCTCTTTAAGTTGGATCCAGGAGGCGATTCAAATTATGTGCTTCTTTCAAACATATATGCAGCAATTGGAAAGTGGGAAGGTGCTAACAAGGTTAGGCAAACG
ATGAAAGCTCGAGGCGTGCAGAAAAAACCGGGTTGTAGTTCTGTTGAAATTGATGGTAAGGTTCATGAGTTTGTTGCTGGTGACAAATACCATACTGATGCAGAC
AGTATTTACTCAATGTTGGAGCTGCTGTTTCATGAACTAAAGATATGTGGCTATGTTCCTAATACTGATACCATTATGAATACCAAAGAATCTAGTAAAGATCGT
TGA
Protein sequenceShow/hide protein sequence
MSSIPSHTAIPSQLQQYPNPPSPIPLSNPTKLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLAHYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADF
PSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQ
ALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEF
FDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSLAERLMK
HLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRQTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPNTDTIMNTKESSKDR