; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G11660 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G11660
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationClcChr01:18244473..18246176
RNA-Seq ExpressionClc01G11660
SyntenyClc01G11660
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139593.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucumis sativus]1.6e-26987.98Show/hide
Query:  MSSIPSHTAIPSQHQQYPNPPSPIPLSNPTNLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS
        MSSIPSHTA PSQ Q  P  PS IPLSNPT LNFPRSPNS H NISSKF  NS+DPIV WTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNH+TFITLLS
Subjt:  MSSIPSHTAIPSQHQQYPNPPSPIPLSNPTNLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS

Query:  GCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN
         CADFPSES FF SSLHGYA K+GLDT HVMVGTALIDMY+KCAQLG ARKVF  LGVKNSVSWNTML+G+ RNGEIELA+ LFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN

Query:  GLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GLLKHGYSEQALECFHQMQ SG+  DYVSIIAVLAACADLGALTLGLWV+RFVM QEFKDNI+ISNSLIDMYSRCGCIEFARQVF KM KRTLVSWNSII
Subjt:  GLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG
        VGFAVNGFADESLEFF AMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMK VH++TPRIEHYGCIVDLYGRAGRLEDALN+IEEMPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG

Query:  SLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLF
        SLLAACRTHGDV++AERLMKHLFKLDP GD+ YVLLSNIYAAIGKW+GAN VRRTMKARGVQKKPG SSVEIDGKVHEFVAGD YH DAD+IYSML+LL 
Subjt:  SLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLF

Query:  HELKICGYVPDTDIIMNTKESSKD
        HELK+CGYVP +D I+NTKES+KD
Subjt:  HELKICGYVPDTDIIMNTKESSKD

XP_008458940.1 PREDICTED: pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucumis melo]9.6e-27088.55Show/hide
Query:  MSSIPSHTAIPSQHQQYPNPPSPIPLSNPTNLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS
        MSSIPSH A PSQ QQ P+  S IPLSNPT +NFPRSP S H NI SKFTANS+ PIVQWTSS+ARYC NGQL EAAAEFTRMRLAGVEPNH+TFITLLS
Subjt:  MSSIPSHTAIPSQHQQYPNPPSPIPLSNPTNLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS

Query:  GCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN
        GCADFPSES FF SSLHGYA KFGLDT HVMVGTALIDMY+KC+QLGLA+KVFDYLGVKNSVSWNTML+G+ RNGEIELA+ LFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN

Query:  GLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GLLKHGYSEQALECFHQMQ SG+  DYVSIIAVLAACADLGALT GLWVNRFVMQQEFKDN+RISNSLIDMYSRCGCIEFARQVF KM KRTLVSWNSII
Subjt:  GLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG
        VGFA NGFADESLEFF AMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVH++TP IEHYGCIVDLYGRAGRLEDA NVIEEMPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG

Query:  SLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLF
        SLLAACRTHGDV +AERLMKH+FKLD  GDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKK G SSVEIDGKVHEFVAGDKYH DAD+IYSML+LLF
Subjt:  SLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLF

Query:  HELKICGYVPDTDIIMNTKESSKD
        HELK+CGYVPDTDII+NTK+S+KD
Subjt:  HELKICGYVPDTDIIMNTKESSKD

XP_022142716.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Momordica charantia]3.8e-27488.74Show/hide
Query:  MSSIPSHTAIPSQHQQYPNPPSPIPLSNPTNLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS
        MSSIP++TA   Q QQYPNP + IPL NP  +NFPRS NSS+ +ISSK T NSIDPIV WTSS+ARYCRNGQL+EAAAEFT MRLAGVEPNHVT ITLLS
Subjt:  MSSIPSHTAIPSQHQQYPNPPSPIPLSNPTNLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS

Query:  GCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN
        GCADFPSESL+FGSSLHGYARK GLDT HVMVGT+++DMYAKCAQLGLAR+VFDYL +KNSVSWNTMLDGYTRNGEIELA+DLFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN

Query:  GLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GLLK GYSEQALECFHQMQCSGI+PDYVSIIAVLAACADLG LTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCI FARQVFE+M KRTLVSWNSII
Subjt:  GLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG
        VG+A NGFADESLEFFDAMQKEGFKPD VSYTGALTACSHAGLVNKGLELFDNMKRVHR+ PRIEHYGCIVDLYGRAGRLEDAL+VIE+MPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG

Query:  SLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLF
        SLLAACRTHGDVS+AERLMKHL KLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPG SSVEIDGKVHEFVAGDKYH DADSIYSML+LL 
Subjt:  SLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLF

Query:  HELKICGYVPDTDIIMNTKESSKD
        HELKICG VP+T+  +NTKESSKD
Subjt:  HELKICGYVPDTDIIMNTKESSKD

XP_022967078.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucurbita maxima]1.9e-26284.79Show/hide
Query:  MSSIPSHTAIPSQH--QQYPNPPSPIPLSNPTNLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITL
        MSS+PSHT IP Q   QQY NPPSPIP SNP+NL+FPR+PNSS          N I PIV WTSS+ARYCRN QL+EAAAEFTRMRLAGVEPNH+TFITL
Subjt:  MSSIPSHTAIPSQH--QQYPNPPSPIPLSNPTNLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITL

Query:  LSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTAL
        LSGCADFPS SL FG+SLHGY RK GLDT HVMVGTALI MYAKCAQLGLAR VFDYL +KNSV+WNTMLDGY RNGEIELA++LFDEMPTRDAISWTAL
Subjt:  LSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTAL

Query:  INGLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNS
        ING LK GYSEQALECFH+MQCSGIEPDYVSIIAVLAACADLGAL+ GLWVNRF+MQQEFKDNIRISNSLIDMYSRCGCIEFARQVF+KM K TLVSWNS
Subjt:  INGLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNS

Query:  IIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVV
        +IVGFA+NGFADESLEFFDAMQKEGF  DGVSYTGALTACSHAGLVNKGLELFDNMKRVHR+TPRIEHYGCIVDLY RAGRL++ALNVIE MPMKPNEVV
Subjt:  IIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVV

Query:  LGSLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLEL
        LGSLLAACRTHGDVS+AERL+K+LF+LDPGGDS+YVLLSNIYAA+G+WEGANKVRRTMKARGVQKKPG SS+EIDGKVHEFVAGDKYH DAD+IYSMLE+
Subjt:  LGSLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLEL

Query:  LFHELKICGYVPDTDIIMNTKESSKD
        LFHELKI GYVP+T   MN  ESSK+
Subjt:  LFHELKICGYVPDTDIIMNTKESSKD

XP_038877228.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Benincasa hispida]3.5e-28892.94Show/hide
Query:  MSSIPSHTAIPSQHQQYPNPPSPIPLSNPTNLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS
        MSS PS+TAIPSQ QQYPNPPS IPLSNPT LNFPRSPNSSH NISSKF ANSIDPIV WTSSLARYCRNGQLSEAA EFTRMRLAGVEPNHVTFITLLS
Subjt:  MSSIPSHTAIPSQHQQYPNPPSPIPLSNPTNLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS

Query:  GCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN
        GC DFPSESLFFGSSLHGYARK GLDT HVMVGTAL+DMYAKCAQ  LARKVFDYLG+KNSV+WNTMLDGYTRNGEIELA+DLFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN

Query:  GLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GLLK G+SEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSL+DMYSRCGCIEFARQVFEKMPKRTLVSWNSII
Subjt:  GLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG
        VGFAVNGFADESLEFFDAMQ EGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKR+H++TPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG

Query:  SLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLF
        SLLAACRT+GDVS+AE+LMKHL KLDP GDSNYVLLSNIYAAIG+WEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYH DAD+IYSMLELLF
Subjt:  SLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLF

Query:  HELKICGYVPDTDIIMNTKESSKD
        HELKI GYVPDT+II+NTKE SKD
Subjt:  HELKICGYVPDTDIIMNTKESSKD

TrEMBL top hitse value%identityAlignment
A0A0A0LYD6 Uncharacterized protein7.9e-27087.98Show/hide
Query:  MSSIPSHTAIPSQHQQYPNPPSPIPLSNPTNLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS
        MSSIPSHTA PSQ Q  P  PS IPLSNPT LNFPRSPNS H NISSKF  NS+DPIV WTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNH+TFITLLS
Subjt:  MSSIPSHTAIPSQHQQYPNPPSPIPLSNPTNLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS

Query:  GCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN
         CADFPSES FF SSLHGYA K+GLDT HVMVGTALIDMY+KCAQLG ARKVF  LGVKNSVSWNTML+G+ RNGEIELA+ LFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN

Query:  GLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GLLKHGYSEQALECFHQMQ SG+  DYVSIIAVLAACADLGALTLGLWV+RFVM QEFKDNI+ISNSLIDMYSRCGCIEFARQVF KM KRTLVSWNSII
Subjt:  GLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG
        VGFAVNGFADESLEFF AMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMK VH++TPRIEHYGCIVDLYGRAGRLEDALN+IEEMPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG

Query:  SLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLF
        SLLAACRTHGDV++AERLMKHLFKLDP GD+ YVLLSNIYAAIGKW+GAN VRRTMKARGVQKKPG SSVEIDGKVHEFVAGD YH DAD+IYSML+LL 
Subjt:  SLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLF

Query:  HELKICGYVPDTDIIMNTKESSKD
        HELK+CGYVP +D I+NTKES+KD
Subjt:  HELKICGYVPDTDIIMNTKESSKD

A0A1S3C956 pentatricopeptide repeat-containing protein At1g05750, chloroplastic4.6e-27088.55Show/hide
Query:  MSSIPSHTAIPSQHQQYPNPPSPIPLSNPTNLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS
        MSSIPSH A PSQ QQ P+  S IPLSNPT +NFPRSP S H NI SKFTANS+ PIVQWTSS+ARYC NGQL EAAAEFTRMRLAGVEPNH+TFITLLS
Subjt:  MSSIPSHTAIPSQHQQYPNPPSPIPLSNPTNLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS

Query:  GCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN
        GCADFPSES FF SSLHGYA KFGLDT HVMVGTALIDMY+KC+QLGLA+KVFDYLGVKNSVSWNTML+G+ RNGEIELA+ LFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN

Query:  GLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GLLKHGYSEQALECFHQMQ SG+  DYVSIIAVLAACADLGALT GLWVNRFVMQQEFKDN+RISNSLIDMYSRCGCIEFARQVF KM KRTLVSWNSII
Subjt:  GLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG
        VGFA NGFADESLEFF AMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVH++TP IEHYGCIVDLYGRAGRLEDA NVIEEMPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG

Query:  SLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLF
        SLLAACRTHGDV +AERLMKH+FKLD  GDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKK G SSVEIDGKVHEFVAGDKYH DAD+IYSML+LLF
Subjt:  SLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLF

Query:  HELKICGYVPDTDIIMNTKESSKD
        HELK+CGYVPDTDII+NTK+S+KD
Subjt:  HELKICGYVPDTDIIMNTKESSKD

A0A5A7UJB6 Pentatricopeptide repeat-containing protein4.6e-27088.55Show/hide
Query:  MSSIPSHTAIPSQHQQYPNPPSPIPLSNPTNLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS
        MSSIPSH A PSQ QQ P+  S IPLSNPT +NFPRSP S H NI SKFTANS+ PIVQWTSS+ARYC NGQL EAAAEFTRMRLAGVEPNH+TFITLLS
Subjt:  MSSIPSHTAIPSQHQQYPNPPSPIPLSNPTNLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS

Query:  GCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN
        GCADFPSES FF SSLHGYA KFGLDT HVMVGTALIDMY+KC+QLGLA+KVFDYLGVKNSVSWNTML+G+ RNGEIELA+ LFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN

Query:  GLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GLLKHGYSEQALECFHQMQ SG+  DYVSIIAVLAACADLGALT GLWVNRFVMQQEFKDN+RISNSLIDMYSRCGCIEFARQVF KM KRTLVSWNSII
Subjt:  GLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG
        VGFA NGFADESLEFF AMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVH++TP IEHYGCIVDLYGRAGRLEDA NVIEEMPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG

Query:  SLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLF
        SLLAACRTHGDV +AERLMKH+FKLD  GDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKK G SSVEIDGKVHEFVAGDKYH DAD+IYSML+LLF
Subjt:  SLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLF

Query:  HELKICGYVPDTDIIMNTKESSKD
        HELK+CGYVPDTDII+NTK+S+KD
Subjt:  HELKICGYVPDTDIIMNTKESSKD

A0A6J1CN07 pentatricopeptide repeat-containing protein At1g05750, chloroplastic1.8e-27488.74Show/hide
Query:  MSSIPSHTAIPSQHQQYPNPPSPIPLSNPTNLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS
        MSSIP++TA   Q QQYPNP + IPL NP  +NFPRS NSS+ +ISSK T NSIDPIV WTSS+ARYCRNGQL+EAAAEFT MRLAGVEPNHVT ITLLS
Subjt:  MSSIPSHTAIPSQHQQYPNPPSPIPLSNPTNLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLS

Query:  GCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN
        GCADFPSESL+FGSSLHGYARK GLDT HVMVGT+++DMYAKCAQLGLAR+VFDYL +KNSVSWNTMLDGYTRNGEIELA+DLFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALIN

Query:  GLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GLLK GYSEQALECFHQMQCSGI+PDYVSIIAVLAACADLG LTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCI FARQVFE+M KRTLVSWNSII
Subjt:  GLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG
        VG+A NGFADESLEFFDAMQKEGFKPD VSYTGALTACSHAGLVNKGLELFDNMKRVHR+ PRIEHYGCIVDLYGRAGRLEDAL+VIE+MPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLG

Query:  SLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLF
        SLLAACRTHGDVS+AERLMKHL KLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPG SSVEIDGKVHEFVAGDKYH DADSIYSML+LL 
Subjt:  SLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLF

Query:  HELKICGYVPDTDIIMNTKESSKD
        HELKICG VP+T+  +NTKESSKD
Subjt:  HELKICGYVPDTDIIMNTKESSKD

A0A6J1HU31 pentatricopeptide repeat-containing protein At1g05750, chloroplastic9.4e-26384.79Show/hide
Query:  MSSIPSHTAIPSQH--QQYPNPPSPIPLSNPTNLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITL
        MSS+PSHT IP Q   QQY NPPSPIP SNP+NL+FPR+PNSS          N I PIV WTSS+ARYCRN QL+EAAAEFTRMRLAGVEPNH+TFITL
Subjt:  MSSIPSHTAIPSQH--QQYPNPPSPIPLSNPTNLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITL

Query:  LSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTAL
        LSGCADFPS SL FG+SLHGY RK GLDT HVMVGTALI MYAKCAQLGLAR VFDYL +KNSV+WNTMLDGY RNGEIELA++LFDEMPTRDAISWTAL
Subjt:  LSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTAL

Query:  INGLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNS
        ING LK GYSEQALECFH+MQCSGIEPDYVSIIAVLAACADLGAL+ GLWVNRF+MQQEFKDNIRISNSLIDMYSRCGCIEFARQVF+KM K TLVSWNS
Subjt:  INGLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNS

Query:  IIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVV
        +IVGFA+NGFADESLEFFDAMQKEGF  DGVSYTGALTACSHAGLVNKGLELFDNMKRVHR+TPRIEHYGCIVDLY RAGRL++ALNVIE MPMKPNEVV
Subjt:  IIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVV

Query:  LGSLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLEL
        LGSLLAACRTHGDVS+AERL+K+LF+LDPGGDS+YVLLSNIYAA+G+WEGANKVRRTMKARGVQKKPG SS+EIDGKVHEFVAGDKYH DAD+IYSMLE+
Subjt:  LGSLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLEL

Query:  LFHELKICGYVPDTDIIMNTKESSKD
        LFHELKI GYVP+T   MN  ESSK+
Subjt:  LFHELKICGYVPDTDIIMNTKESSKD

SwissProt top hitse value%identityAlignment
O23337 Pentatricopeptide repeat-containing protein At4g148205.2e-10137.02Show/hide
Query:  IVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGY--ARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFD
        +V W + + RYCR G + EA   F  M+ + V P+ +    ++S C    + ++ +  +++ +       +DT H++  TAL+ MYA    + +AR+ F 
Subjt:  IVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGY--ARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFD

Query:  YLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVM
         + V+N      M+ GY++ G ++ A  +FD+   +D + WT +I+  ++  Y ++AL  F +M CSGI+PD VS+ +V++ACA+LG L    WV+  + 
Subjt:  YLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVM

Query:  QQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNM
            +  + I+N+LI+MY++CG ++  R VFEKMP+R +VSW+S+I   +++G A ++L  F  M++E  +P+ V++ G L  CSH+GLV +G ++F +M
Subjt:  QQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNM

Query:  KRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRR
           + +TP++EHYGC+VDL+GRA  L +AL VIE MP+  N V+ GSL++ACR HG++ + +   K + +L+P  D   VL+SNIYA   +WE    +RR
Subjt:  KRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRR

Query:  TMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPDT-DIIMNTKESSK
         M+ + V K+ G S ++ +GK HEF+ GDK H  ++ IY+ L+ +  +LK+ GYVPD   ++++ +E  K
Subjt:  TMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPDT-DIIMNTKESSK

O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic9.5e-10336.13Show/hide
Query:  FTANSIDPIVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGL
        FT      +V W S +  + + G   +A   F +M    V+ +HVT + +LS CA     +L FG  +  Y  +  ++  ++ +  A++DMY KC  +  
Subjt:  FTANSIDPIVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGL

Query:  ARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGL
        A+++FD +  K++V+W TMLDGY  + + E A ++ + MP +D ++W ALI+   ++G   +AL  FH++Q    ++ + +++++ L+ACA +GAL LG 
Subjt:  ARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGL

Query:  WVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKG
        W++ ++ +   + N  ++++LI MYS+CG +E +R+VF  + KR +  W+++I G A++G  +E+++ F  MQ+   KP+GV++T    ACSH GLV++ 
Subjt:  WVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKG

Query:  LELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWE
          LF  M+  + + P  +HY CIVD+ GR+G LE A+  IE MP+ P+  V G+LL AC+ H ++++AE     L +L+P  D  +VLLSNIYA +GKWE
Subjt:  LELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWE

Query:  GANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPDTDIIMNTKESSK
          +++R+ M+  G++K+PGCSS+EIDG +HEF++GD  H  ++ +Y  L  +  +LK  GY P+   ++   E  +
Subjt:  GANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPDTDIIMNTKESSK

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226905.7e-10839.91Show/hide
Query:  ARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSW
        + Y R G   EA   F  M  +GV P+ ++ ++ +S C+     ++ +G S HGY  + G ++W   +  ALIDMY KC +   A ++FD +  K  V+W
Subjt:  ARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSW

Query:  NTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIR
        N+++ GY  NGE++ A + F+ MP ++ +SW  +I+GL++    E+A+E F  MQ   G+  D V+++++ +AC  LGAL L  W+  ++ +   + ++R
Subjt:  NTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIR

Query:  ISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPR
        +  +L+DM+SRCG  E A  +F  +  R + +W + I   A+ G A+ ++E FD M ++G KPDGV++ GALTACSH GLV +G E+F +M ++H V+P 
Subjt:  ISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPR

Query:  IEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQK
          HYGC+VDL GRAG LE+A+ +IE+MPM+PN+V+  SLLAACR  G+V +A    + +  L P    +YVLLSN+YA+ G+W    KVR +MK +G++K
Subjt:  IEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQK

Query:  KPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPD-TDIIMNTKESSK
         PG SS++I GK HEF +GD+ H +  +I +ML+ +       G+VPD ++++M+  E  K
Subjt:  KPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPD-TDIIMNTKESSK

Q9MA50 Pentatricopeptide repeat-containing protein At1g05750, chloroplastic9.9e-16960.17Show/hide
Query:  SPNSSHHNISSKFTANSIDPIVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTAL
        +P    HN S+  T       V WTS +    RNG+L+EAA EF+ M LAGVEPNH+TFI LLSGC DF S S   G  LHGYA K GLD  HVMVGTA+
Subjt:  SPNSSHHNISSKFTANSIDPIVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTAL

Query:  IDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAA
        I MY+K  +   AR VFDY+  KNSV+WNTM+DGY R+G+++ A  +FD+MP RD ISWTA+ING +K GY E+AL  F +MQ SG++PDYV+IIA L A
Subjt:  IDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAA

Query:  CADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALT
        C +LGAL+ GLWV+R+V+ Q+FK+N+R+SNSLID+Y RCGC+EFARQVF  M KRT+VSWNS+IVGFA NG A ESL +F  MQ++GFKPD V++TGALT
Subjt:  CADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALT

Query:  ACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHG-DVSIAERLMKHLFKLDPGGDSNYVL
        ACSH GLV +GL  F  MK  +R++PRIEHYGC+VDLY RAGRLEDAL +++ MPMKPNEVV+GSLLAAC  HG ++ +AERLMKHL  L+    SNYV+
Subjt:  ACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHG-DVSIAERLMKHLFKLDPGGDSNYVL

Query:  LSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPDT
        LSN+YAA GKWEGA+K+RR MK  G++K+PG SS+EID  +H F+AGD  H +   I  +LEL+  +L++ G V +T
Subjt:  LSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPDT

Q9SIT7 Pentatricopeptide repeat-containing protein At2g136008.9e-10139.87Show/hide
Query:  IVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYL
        +V W S +  + +NG   EA   F  M  + VEP+ VT  +++S CA     ++  G  +HG   K       +++  A +DMYAKC+++  AR +FD +
Subjt:  IVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYL

Query:  GVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQ
         ++N ++  +M+ GY      + A  +F +M  R+ +SW ALI G  ++G +E+AL  F  ++   + P + S   +L ACADL  L LG+  +  V++ 
Subjt:  GVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQ

Query:  EFK------DNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLEL
         FK      D+I + NSLIDMY +CGC+E    VF KM +R  VSWN++I+GFA NG+ +E+LE F  M + G KPD ++  G L+AC HAG V +G   
Subjt:  EFK------DNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLEL

Query:  FDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGAN
        F +M R   V P  +HY C+VDL GRAG LE+A ++IEEMPM+P+ V+ GSLLAAC+ H ++++ + + + L +++P     YVLLSN+YA +GKWE   
Subjt:  FDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGAN

Query:  KVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELK
         VR++M+  GV K+PGCS ++I G  H F+  DK H     I+S+L++L  E++
Subjt:  KVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELK

Arabidopsis top hitse value%identityAlignment
AT1G05750.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.1e-17060.17Show/hide
Query:  SPNSSHHNISSKFTANSIDPIVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTAL
        +P    HN S+  T       V WTS +    RNG+L+EAA EF+ M LAGVEPNH+TFI LLSGC DF S S   G  LHGYA K GLD  HVMVGTA+
Subjt:  SPNSSHHNISSKFTANSIDPIVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTAL

Query:  IDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAA
        I MY+K  +   AR VFDY+  KNSV+WNTM+DGY R+G+++ A  +FD+MP RD ISWTA+ING +K GY E+AL  F +MQ SG++PDYV+IIA L A
Subjt:  IDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAA

Query:  CADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALT
        C +LGAL+ GLWV+R+V+ Q+FK+N+R+SNSLID+Y RCGC+EFARQVF  M KRT+VSWNS+IVGFA NG A ESL +F  MQ++GFKPD V++TGALT
Subjt:  CADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALT

Query:  ACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHG-DVSIAERLMKHLFKLDPGGDSNYVL
        ACSH GLV +GL  F  MK  +R++PRIEHYGC+VDLY RAGRLEDAL +++ MPMKPNEVV+GSLLAAC  HG ++ +AERLMKHL  L+    SNYV+
Subjt:  ACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHG-DVSIAERLMKHLFKLDPGGDSNYVL

Query:  LSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPDT
        LSN+YAA GKWEGA+K+RR MK  G++K+PG SS+EID  +H F+AGD  H +   I  +LEL+  +L++ G V +T
Subjt:  LSNIYAAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPDT

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.7e-10436.13Show/hide
Query:  FTANSIDPIVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGL
        FT      +V W S +  + + G   +A   F +M    V+ +HVT + +LS CA     +L FG  +  Y  +  ++  ++ +  A++DMY KC  +  
Subjt:  FTANSIDPIVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGL

Query:  ARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGL
        A+++FD +  K++V+W TMLDGY  + + E A ++ + MP +D ++W ALI+   ++G   +AL  FH++Q    ++ + +++++ L+ACA +GAL LG 
Subjt:  ARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGL

Query:  WVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKG
        W++ ++ +   + N  ++++LI MYS+CG +E +R+VF  + KR +  W+++I G A++G  +E+++ F  MQ+   KP+GV++T    ACSH GLV++ 
Subjt:  WVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKG

Query:  LELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWE
          LF  M+  + + P  +HY CIVD+ GR+G LE A+  IE MP+ P+  V G+LL AC+ H ++++AE     L +L+P  D  +VLLSNIYA +GKWE
Subjt:  LELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWE

Query:  GANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPDTDIIMNTKESSK
          +++R+ M+  G++K+PGCSS+EIDG +HEF++GD  H  ++ +Y  L  +  +LK  GY P+   ++   E  +
Subjt:  GANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPDTDIIMNTKESSK

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)4.1e-10939.91Show/hide
Query:  ARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSW
        + Y R G   EA   F  M  +GV P+ ++ ++ +S C+     ++ +G S HGY  + G ++W   +  ALIDMY KC +   A ++FD +  K  V+W
Subjt:  ARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSW

Query:  NTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIR
        N+++ GY  NGE++ A + F+ MP ++ +SW  +I+GL++    E+A+E F  MQ   G+  D V+++++ +AC  LGAL L  W+  ++ +   + ++R
Subjt:  NTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIR

Query:  ISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPR
        +  +L+DM+SRCG  E A  +F  +  R + +W + I   A+ G A+ ++E FD M ++G KPDGV++ GALTACSH GLV +G E+F +M ++H V+P 
Subjt:  ISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPR

Query:  IEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQK
          HYGC+VDL GRAG LE+A+ +IE+MPM+PN+V+  SLLAACR  G+V +A    + +  L P    +YVLLSN+YA+ G+W    KVR +MK +G++K
Subjt:  IEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQK

Query:  KPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPD-TDIIMNTKESSK
         PG SS++I GK HEF +GD+ H +  +I +ML+ +       G+VPD ++++M+  E  K
Subjt:  KPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPD-TDIIMNTKESSK

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification4.1e-10939.91Show/hide
Query:  ARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSW
        + Y R G   EA   F  M  +GV P+ ++ ++ +S C+     ++ +G S HGY  + G ++W   +  ALIDMY KC +   A ++FD +  K  V+W
Subjt:  ARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSW

Query:  NTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIR
        N+++ GY  NGE++ A + F+ MP ++ +SW  +I+GL++    E+A+E F  MQ   G+  D V+++++ +AC  LGAL L  W+  ++ +   + ++R
Subjt:  NTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIR

Query:  ISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPR
        +  +L+DM+SRCG  E A  +F  +  R + +W + I   A+ G A+ ++E FD M ++G KPDGV++ GALTACSH GLV +G E+F +M ++H V+P 
Subjt:  ISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHRVTPR

Query:  IEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQK
          HYGC+VDL GRAG LE+A+ +IE+MPM+PN+V+  SLLAACR  G+V +A    + +  L P    +YVLLSN+YA+ G+W    KVR +MK +G++K
Subjt:  IEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQK

Query:  KPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPD-TDIIMNTKESSK
         PG SS++I GK HEF +GD+ H +  +I +ML+ +       G+VPD ++++M+  E  K
Subjt:  KPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPD-TDIIMNTKESSK

AT4G14820.1 Pentatricopeptide repeat (PPR) superfamily protein3.7e-10237.02Show/hide
Query:  IVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGY--ARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFD
        +V W + + RYCR G + EA   F  M+ + V P+ +    ++S C    + ++ +  +++ +       +DT H++  TAL+ MYA    + +AR+ F 
Subjt:  IVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESLFFGSSLHGY--ARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFD

Query:  YLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVM
         + V+N      M+ GY++ G ++ A  +FD+   +D + WT +I+  ++  Y ++AL  F +M CSGI+PD VS+ +V++ACA+LG L    WV+  + 
Subjt:  YLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVM

Query:  QQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNM
            +  + I+N+LI+MY++CG ++  R VFEKMP+R +VSW+S+I   +++G A ++L  F  M++E  +P+ V++ G L  CSH+GLV +G ++F +M
Subjt:  QQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNM

Query:  KRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRR
           + +TP++EHYGC+VDL+GRA  L +AL VIE MP+  N V+ GSL++ACR HG++ + +   K + +L+P  D   VL+SNIYA   +WE    +RR
Subjt:  KRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRR

Query:  TMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPDT-DIIMNTKESSK
         M+ + V K+ G S ++ +GK HEF+ GDK H  ++ IY+ L+ +  +LK+ GYVPD   ++++ +E  K
Subjt:  TMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPDT-DIIMNTKESSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAGCATTCCTTCTCACACTGCCATTCCATCACAACACCAACAATATCCCAATCCGCCTTCTCCAATCCCACTTTCAAATCCAACAAACCTCAACTTCCCCCGCTC
TCCCAATTCCTCACATCACAATATCTCCTCCAAATTCACCGCCAATTCTATTGACCCCATTGTTCAATGGACCTCTTCTCTTGCTCGCTACTGTCGCAATGGCCAATTAT
CCGAAGCCGCCGCAGAGTTTACACGCATGAGACTCGCCGGAGTTGAGCCAAACCACGTCACATTCATTACCCTTCTCTCCGGCTGTGCTGATTTTCCATCAGAAAGCCTC
TTCTTCGGCTCTTCCCTTCATGGCTACGCCCGTAAATTTGGCTTGGATACATGGCATGTAATGGTGGGGACTGCTTTGATTGATATGTATGCCAAATGTGCTCAATTGGG
TCTTGCTAGGAAAGTTTTTGATTACCTAGGCGTGAAGAACTCTGTCTCTTGGAACACGATGCTCGATGGTTACACAAGGAATGGAGAGATTGAATTGGCACTTGACCTGT
TTGATGAAATGCCTACAAGAGATGCGATTTCTTGGACGGCTTTGATTAACGGTCTCTTGAAACATGGGTACTCTGAACAAGCATTGGAGTGCTTCCATCAGATGCAATGC
TCCGGTATCGAGCCTGATTATGTGTCTATAATTGCTGTTCTCGCTGCGTGTGCTGATTTGGGCGCGCTTACTTTAGGGTTATGGGTTAATAGGTTTGTTATGCAGCAGGA
GTTTAAGGATAATATTAGGATAAGTAATTCCTTGATTGATATGTATTCTCGATGTGGATGTATTGAGTTTGCCCGCCAAGTGTTTGAGAAAATGCCCAAGCGAACTTTGG
TATCTTGGAACTCTATCATTGTGGGGTTTGCAGTTAATGGATTTGCTGATGAATCTCTGGAGTTCTTTGATGCAATGCAGAAGGAAGGATTCAAGCCAGATGGAGTAAGC
TACACGGGAGCTCTTACTGCATGTAGCCATGCTGGCTTAGTGAATAAGGGCCTCGAATTGTTTGATAACATGAAGAGGGTACACAGAGTTACTCCCAGGATTGAGCATTA
TGGATGTATTGTCGACCTCTATGGTCGTGCAGGGAGGTTAGAGGATGCATTGAATGTGATTGAGGAAATGCCGATGAAACCGAATGAAGTTGTGTTGGGGTCGTTGTTGG
CTGCTTGCAGGACTCATGGTGATGTGAGCATAGCTGAAAGGTTAATGAAACATCTCTTTAAGTTGGATCCAGGAGGCGATTCAAATTATGTGCTTCTTTCAAACATATAT
GCAGCAATTGGAAAGTGGGAAGGTGCTAACAAGGTTAGGCGAACGATGAAAGCTCGAGGCGTGCAGAAAAAACCGGGTTGTAGTTCTGTTGAAATTGATGGTAAGGTTCA
TGAGTTTGTTGCTGGTGACAAATACCATACTGATGCAGACAGTATTTACTCAATGTTGGAGCTGTTGTTTCATGAACTAAAGATATGTGGCTATGTTCCTGATACTGATA
TCATTATGAATACCAAAGAATCTAGTAAAGATAGTTGA
mRNA sequenceShow/hide mRNA sequence
TATTAAATGGGTCATTCATCATAGAAATCCGTTCCCGCTCTATCCCAAACGGCTGTTGGAACGATGAGCAGCATTCCTTCTCACACTGCCATTCCATCACAACACCAACA
ATATCCCAATCCGCCTTCTCCAATCCCACTTTCAAATCCAACAAACCTCAACTTCCCCCGCTCTCCCAATTCCTCACATCACAATATCTCCTCCAAATTCACCGCCAATT
CTATTGACCCCATTGTTCAATGGACCTCTTCTCTTGCTCGCTACTGTCGCAATGGCCAATTATCCGAAGCCGCCGCAGAGTTTACACGCATGAGACTCGCCGGAGTTGAG
CCAAACCACGTCACATTCATTACCCTTCTCTCCGGCTGTGCTGATTTTCCATCAGAAAGCCTCTTCTTCGGCTCTTCCCTTCATGGCTACGCCCGTAAATTTGGCTTGGA
TACATGGCATGTAATGGTGGGGACTGCTTTGATTGATATGTATGCCAAATGTGCTCAATTGGGTCTTGCTAGGAAAGTTTTTGATTACCTAGGCGTGAAGAACTCTGTCT
CTTGGAACACGATGCTCGATGGTTACACAAGGAATGGAGAGATTGAATTGGCACTTGACCTGTTTGATGAAATGCCTACAAGAGATGCGATTTCTTGGACGGCTTTGATT
AACGGTCTCTTGAAACATGGGTACTCTGAACAAGCATTGGAGTGCTTCCATCAGATGCAATGCTCCGGTATCGAGCCTGATTATGTGTCTATAATTGCTGTTCTCGCTGC
GTGTGCTGATTTGGGCGCGCTTACTTTAGGGTTATGGGTTAATAGGTTTGTTATGCAGCAGGAGTTTAAGGATAATATTAGGATAAGTAATTCCTTGATTGATATGTATT
CTCGATGTGGATGTATTGAGTTTGCCCGCCAAGTGTTTGAGAAAATGCCCAAGCGAACTTTGGTATCTTGGAACTCTATCATTGTGGGGTTTGCAGTTAATGGATTTGCT
GATGAATCTCTGGAGTTCTTTGATGCAATGCAGAAGGAAGGATTCAAGCCAGATGGAGTAAGCTACACGGGAGCTCTTACTGCATGTAGCCATGCTGGCTTAGTGAATAA
GGGCCTCGAATTGTTTGATAACATGAAGAGGGTACACAGAGTTACTCCCAGGATTGAGCATTATGGATGTATTGTCGACCTCTATGGTCGTGCAGGGAGGTTAGAGGATG
CATTGAATGTGATTGAGGAAATGCCGATGAAACCGAATGAAGTTGTGTTGGGGTCGTTGTTGGCTGCTTGCAGGACTCATGGTGATGTGAGCATAGCTGAAAGGTTAATG
AAACATCTCTTTAAGTTGGATCCAGGAGGCGATTCAAATTATGTGCTTCTTTCAAACATATATGCAGCAATTGGAAAGTGGGAAGGTGCTAACAAGGTTAGGCGAACGAT
GAAAGCTCGAGGCGTGCAGAAAAAACCGGGTTGTAGTTCTGTTGAAATTGATGGTAAGGTTCATGAGTTTGTTGCTGGTGACAAATACCATACTGATGCAGACAGTATTT
ACTCAATGTTGGAGCTGTTGTTTCATGAACTAAAGATATGTGGCTATGTTCCTGATACTGATATCATTATGAATACCAAAGAATCTAGTAAAGATAGTTGAAGCTTATTT
TGTGAGTTATTCGATCTAGTAAATCTCTTATTTTAGTAAATCCCTTATTTTATT
Protein sequenceShow/hide protein sequence
MSSIPSHTAIPSQHQQYPNPPSPIPLSNPTNLNFPRSPNSSHHNISSKFTANSIDPIVQWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHVTFITLLSGCADFPSESL
FFGSSLHGYARKFGLDTWHVMVGTALIDMYAKCAQLGLARKVFDYLGVKNSVSWNTMLDGYTRNGEIELALDLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQC
SGIEPDYVSIIAVLAACADLGALTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVS
YTGALTACSHAGLVNKGLELFDNMKRVHRVTPRIEHYGCIVDLYGRAGRLEDALNVIEEMPMKPNEVVLGSLLAACRTHGDVSIAERLMKHLFKLDPGGDSNYVLLSNIY
AAIGKWEGANKVRRTMKARGVQKKPGCSSVEIDGKVHEFVAGDKYHTDADSIYSMLELLFHELKICGYVPDTDIIMNTKESSKDS