; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg019955 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg019955
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold5:30563381..30564946
RNA-Seq ExpressionSpg019955
SyntenySpg019955
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139593.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucumis sativus]2.9e-25884.64Show/hide
Query:  MSSVPAQTAIPTQLQQYPNSPSPIPLSNSAQINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLS
        MSS+P+ TA P+QLQ  P +PS IPLSN  ++NFPRS NS +RNISSK   NS+DPIVLWTSS+ARYCRNGQL++AA EFTRMRLAGVEPNHITFITLLS
Subjt:  MSSVPAQTAIPTQLQQYPNSPSPIPLSNSAQINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLS

Query:  GCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARKVFDCLGMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALIN
         CADFPSES FF SSLHGYA K+GLDTGHVMVGTALIDMY+KCAQLG ARKVF  LG+KNSVSWNT+L+ +MRNGEIELAI LFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARKVFDCLGMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALIN

Query:  GLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNQFVMQQEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSII
        GL+K GYSEQALECFHQMQ SG+  DYVSIIAVLAACADLG LTLGLWV++FVM QEFKDNI+ISNSLIDMYSRCG IEFARQVF KM KRTLVSWNSII
Subjt:  GLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNQFVMQQEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGPKLFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLG
        VGFAVNGFADESLEFF AMQKEGFKPDGVSYTGALTACSHAGLVNKG +LFDNMK VHKITPRIEHYGCIVDLYGRAGRLEDALN+IE MPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGPKLFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLG

Query:  SLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLY
        SLLAACRTHGDV LAERLMKHLFKLDP GD+ YVLLSNIYAAIGKW+GAN VRRTMKARGVQKKPG+SS+EIDGKVHEFVAGD YHA+AD IYSML+LL 
Subjt:  SLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLY

Query:  HELKICGYAPETDTFMNPKES
        HELK+CGY P +DT +N KES
Subjt:  HELKICGYAPETDTFMNPKES

XP_008458940.1 PREDICTED: pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucumis melo]1.6e-25384.26Show/hide
Query:  MSSVPAQTAIPTQLQQYPNSPSPIPLSNSAQINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLS
        MSS+P+  A P+QLQQ P+  S IPLSN  ++NFPRS  S + NI SK T NS+ PIV WTSSIARYC NGQL +AA EFTRMRLAGVEPNHITFITLLS
Subjt:  MSSVPAQTAIPTQLQQYPNSPSPIPLSNSAQINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLS

Query:  GCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARKVFDCLGMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALIN
        GCADFPSES FF SSLHGYA KFGLDTGHVMVGTALIDMY+KC+QLG A+KVFD LG+KNSVSWNT+L+ +MRNGEIELAI LFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARKVFDCLGMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALIN

Query:  GLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNQFVMQQEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSII
        GL+K GYSEQALECFHQMQ SG+  DYVSIIAVLAACADLG LT GLWVN+FVMQQEFKDN+RISNSLIDMYSRCG IEFARQVF KM KRTLVSWNSII
Subjt:  GLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNQFVMQQEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGPKLFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLG
        VGFA NGFADESLEFF AMQKEGFKPDGVSYTGALTACSHAGLVNKG +LFDNMKRVHKITP IEHYGCIVDLYGRAGRLEDA NVIE MPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGPKLFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLG

Query:  SLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLY
        SLLAACRTHGDVRLAERLMKH+FKLD  GDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKK G+SS+EIDGKVHEFVAGDKYHA+AD IYSML+LL+
Subjt:  SLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLY

Query:  HELKICGYAPETDTFMNPKES
        HELK+CGY P+TD  +N K+S
Subjt:  HELKICGYAPETDTFMNPKES

XP_022142716.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Momordica charantia]8.1e-26987.91Show/hide
Query:  MSSVPAQTAIPTQLQQYPNSPSPIPLSNSAQINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLS
        MSS+PA TA   QLQQYPN  + IPL N   INFPRS NSSNR+ISSKST NSIDPIVLWTSSIARYCRNGQLA+AA EFT MRLAGVEPNH+T ITLLS
Subjt:  MSSVPAQTAIPTQLQQYPNSPSPIPLSNSAQINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLS

Query:  GCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARKVFDCLGMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALIN
        GCADFPSESL+FGSSLHGYARK GLDT HVMVGT+++DMYAKCAQLG AR+VFD L MKNSVSWNT+LD Y RNGEIELAIDLFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARKVFDCLGMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALIN

Query:  GLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNQFVMQQEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSII
        GL+KQGYSEQALECFHQMQCSGI+PDYVSIIAVLAACADLGTLTLGLWVN+FVMQQEFKDNIRISNSLIDMYSRCG I FARQVFE+M KRTLVSWNSII
Subjt:  GLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNQFVMQQEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGPKLFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLG
        VG+A NGFADESLEFFDAMQKEGFKPD VSYTGALTACSHAGLVNKG +LFDNMKRVH+I PRIEHYGCIVDLYGRAGRLEDAL+VIE+MPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGPKLFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLG

Query:  SLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLY
        SLLAACRTHGDV LAERLMKHL KLDP GDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSS+EIDGKVHEFVAGDKYHA+AD IYSML+LL 
Subjt:  SLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLY

Query:  HELKICGYAPETDTFMNPKES
        HELKICG  PET+TF+N KES
Subjt:  HELKICGYAPETDTFMNPKES

XP_022967078.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucurbita maxima]7.3e-25483.56Show/hide
Query:  MSSVPAQTAIP--TQLQQYPNSPSPIPLSNSAQINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITL
        MSSVP+ T IP   QLQQY N PSPIP SN + ++FPR+ NSS          N I PIVLWTSSIARYCRN QLA+AA EFTRMRLAGVEPNHITFITL
Subjt:  MSSVPAQTAIP--TQLQQYPNSPSPIPLSNSAQINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITL

Query:  LSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARKVFDCLGMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTAL
        LSGCADFPS SL FG+SLHGY RK GLDTGHVMVGTALI MYAKCAQLG AR VFD L MKNSV+WNT+LD YMRNGEIELAI+LFDEMPTRDAISWTAL
Subjt:  LSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARKVFDCLGMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTAL

Query:  INGLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNQFVMQQEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNS
        ING +KQGYSEQALECFH+MQCSGIEPDYVSIIAVLAACADLG L+ GLWVN+F+MQQEFKDNIRISNSLIDMYSRCG IEFARQVF+KM K TLVSWNS
Subjt:  INGLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNQFVMQQEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNS

Query:  IIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGPKLFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVV
        +IVGFA+NGFADESLEFFDAMQKEGF  DGVSYTGALTACSHAGLVNKG +LFDNMKRVH+ITPRIEHYGCIVDLY RAGRL++ALNVIE MPMKPNEVV
Subjt:  IIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGPKLFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVV

Query:  LGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLEL
        LGSLLAACRTHGDV LAERL+K+LF+LDP GDS+YVLLSNIYAA+G+WEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYH +AD IYSMLE+
Subjt:  LGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLEL

Query:  LYHELKICGYAPETDTFMNPKES
        L+HELKI GY PET TFMN  ES
Subjt:  LYHELKICGYAPETDTFMNPKES

XP_038877228.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Benincasa hispida]4.6e-27288.65Show/hide
Query:  MSSVPAQTAIPTQLQQYPNSPSPIPLSNSAQINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLS
        MSS P+ TAIP+QLQQYPN PS IPLSN  ++NFPRS NSS+RNISSK   NSIDPIVLWTSS+ARYCRNGQL++AATEFTRMRLAGVEPNH+TFITLLS
Subjt:  MSSVPAQTAIPTQLQQYPNSPSPIPLSNSAQINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLS

Query:  GCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARKVFDCLGMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALIN
        GC DFPSESLFFGSSLHGYARK GLDTGHVMVGTAL+DMYAKCAQ   ARKVFD LGMKNSV+WNT+LD Y RNGEIELAIDLFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARKVFDCLGMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALIN

Query:  GLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNQFVMQQEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSII
        GL+KQG+SEQALECFHQMQCSGIEPDYVSIIAVLAACADLG LTLGLWVN+FVMQQEFKDNIRISNSL+DMYSRCG IEFARQVFEKMPKRTLVSWNSII
Subjt:  GLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNQFVMQQEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGPKLFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLG
        VGFAVNGFADESLEFFDAMQ EGFKPDGVSYTGALTACSHAGLVNKG +LFDNMKR+HKITPRIEHYGCIVDLYGRAGRLEDALNVIE MPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGPKLFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLG

Query:  SLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLY
        SLLAACRT+GDV LAE+LMKHL KLDPRGDSNYVLLSNIYAAIG+WEGANKVRRTMKARGVQKKPG SS+EIDGKVHEFVAGDKYHA+AD IYSMLELL+
Subjt:  SLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLY

Query:  HELKICGYAPETDTFMNPKE
        HELKI GY P+T+  +N KE
Subjt:  HELKICGYAPETDTFMNPKE

TrEMBL top hitse value%identityAlignment
A0A0A0LYD6 Uncharacterized protein1.4e-25884.64Show/hide
Query:  MSSVPAQTAIPTQLQQYPNSPSPIPLSNSAQINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLS
        MSS+P+ TA P+QLQ  P +PS IPLSN  ++NFPRS NS +RNISSK   NS+DPIVLWTSS+ARYCRNGQL++AA EFTRMRLAGVEPNHITFITLLS
Subjt:  MSSVPAQTAIPTQLQQYPNSPSPIPLSNSAQINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLS

Query:  GCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARKVFDCLGMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALIN
         CADFPSES FF SSLHGYA K+GLDTGHVMVGTALIDMY+KCAQLG ARKVF  LG+KNSVSWNT+L+ +MRNGEIELAI LFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARKVFDCLGMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALIN

Query:  GLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNQFVMQQEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSII
        GL+K GYSEQALECFHQMQ SG+  DYVSIIAVLAACADLG LTLGLWV++FVM QEFKDNI+ISNSLIDMYSRCG IEFARQVF KM KRTLVSWNSII
Subjt:  GLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNQFVMQQEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGPKLFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLG
        VGFAVNGFADESLEFF AMQKEGFKPDGVSYTGALTACSHAGLVNKG +LFDNMK VHKITPRIEHYGCIVDLYGRAGRLEDALN+IE MPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGPKLFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLG

Query:  SLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLY
        SLLAACRTHGDV LAERLMKHLFKLDP GD+ YVLLSNIYAAIGKW+GAN VRRTMKARGVQKKPG+SS+EIDGKVHEFVAGD YHA+AD IYSML+LL 
Subjt:  SLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLY

Query:  HELKICGYAPETDTFMNPKES
        HELK+CGY P +DT +N KES
Subjt:  HELKICGYAPETDTFMNPKES

A0A1S3C956 pentatricopeptide repeat-containing protein At1g05750, chloroplastic7.9e-25484.26Show/hide
Query:  MSSVPAQTAIPTQLQQYPNSPSPIPLSNSAQINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLS
        MSS+P+  A P+QLQQ P+  S IPLSN  ++NFPRS  S + NI SK T NS+ PIV WTSSIARYC NGQL +AA EFTRMRLAGVEPNHITFITLLS
Subjt:  MSSVPAQTAIPTQLQQYPNSPSPIPLSNSAQINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLS

Query:  GCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARKVFDCLGMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALIN
        GCADFPSES FF SSLHGYA KFGLDTGHVMVGTALIDMY+KC+QLG A+KVFD LG+KNSVSWNT+L+ +MRNGEIELAI LFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARKVFDCLGMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALIN

Query:  GLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNQFVMQQEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSII
        GL+K GYSEQALECFHQMQ SG+  DYVSIIAVLAACADLG LT GLWVN+FVMQQEFKDN+RISNSLIDMYSRCG IEFARQVF KM KRTLVSWNSII
Subjt:  GLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNQFVMQQEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGPKLFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLG
        VGFA NGFADESLEFF AMQKEGFKPDGVSYTGALTACSHAGLVNKG +LFDNMKRVHKITP IEHYGCIVDLYGRAGRLEDA NVIE MPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGPKLFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLG

Query:  SLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLY
        SLLAACRTHGDVRLAERLMKH+FKLD  GDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKK G+SS+EIDGKVHEFVAGDKYHA+AD IYSML+LL+
Subjt:  SLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLY

Query:  HELKICGYAPETDTFMNPKES
        HELK+CGY P+TD  +N K+S
Subjt:  HELKICGYAPETDTFMNPKES

A0A5A7UJB6 Pentatricopeptide repeat-containing protein7.9e-25484.26Show/hide
Query:  MSSVPAQTAIPTQLQQYPNSPSPIPLSNSAQINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLS
        MSS+P+  A P+QLQQ P+  S IPLSN  ++NFPRS  S + NI SK T NS+ PIV WTSSIARYC NGQL +AA EFTRMRLAGVEPNHITFITLLS
Subjt:  MSSVPAQTAIPTQLQQYPNSPSPIPLSNSAQINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLS

Query:  GCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARKVFDCLGMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALIN
        GCADFPSES FF SSLHGYA KFGLDTGHVMVGTALIDMY+KC+QLG A+KVFD LG+KNSVSWNT+L+ +MRNGEIELAI LFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARKVFDCLGMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALIN

Query:  GLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNQFVMQQEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSII
        GL+K GYSEQALECFHQMQ SG+  DYVSIIAVLAACADLG LT GLWVN+FVMQQEFKDN+RISNSLIDMYSRCG IEFARQVF KM KRTLVSWNSII
Subjt:  GLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNQFVMQQEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGPKLFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLG
        VGFA NGFADESLEFF AMQKEGFKPDGVSYTGALTACSHAGLVNKG +LFDNMKRVHKITP IEHYGCIVDLYGRAGRLEDA NVIE MPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGPKLFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLG

Query:  SLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLY
        SLLAACRTHGDVRLAERLMKH+FKLD  GDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKK G+SS+EIDGKVHEFVAGDKYHA+AD IYSML+LL+
Subjt:  SLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLY

Query:  HELKICGYAPETDTFMNPKES
        HELK+CGY P+TD  +N K+S
Subjt:  HELKICGYAPETDTFMNPKES

A0A6J1CN07 pentatricopeptide repeat-containing protein At1g05750, chloroplastic3.9e-26987.91Show/hide
Query:  MSSVPAQTAIPTQLQQYPNSPSPIPLSNSAQINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLS
        MSS+PA TA   QLQQYPN  + IPL N   INFPRS NSSNR+ISSKST NSIDPIVLWTSSIARYCRNGQLA+AA EFT MRLAGVEPNH+T ITLLS
Subjt:  MSSVPAQTAIPTQLQQYPNSPSPIPLSNSAQINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLS

Query:  GCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARKVFDCLGMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALIN
        GCADFPSESL+FGSSLHGYARK GLDT HVMVGT+++DMYAKCAQLG AR+VFD L MKNSVSWNT+LD Y RNGEIELAIDLFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARKVFDCLGMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALIN

Query:  GLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNQFVMQQEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSII
        GL+KQGYSEQALECFHQMQCSGI+PDYVSIIAVLAACADLGTLTLGLWVN+FVMQQEFKDNIRISNSLIDMYSRCG I FARQVFE+M KRTLVSWNSII
Subjt:  GLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNQFVMQQEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGPKLFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLG
        VG+A NGFADESLEFFDAMQKEGFKPD VSYTGALTACSHAGLVNKG +LFDNMKRVH+I PRIEHYGCIVDLYGRAGRLEDAL+VIE+MPMKPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGPKLFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLG

Query:  SLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLY
        SLLAACRTHGDV LAERLMKHL KLDP GDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSS+EIDGKVHEFVAGDKYHA+AD IYSML+LL 
Subjt:  SLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLY

Query:  HELKICGYAPETDTFMNPKES
        HELKICG  PET+TF+N KES
Subjt:  HELKICGYAPETDTFMNPKES

A0A6J1HU31 pentatricopeptide repeat-containing protein At1g05750, chloroplastic3.5e-25483.56Show/hide
Query:  MSSVPAQTAIP--TQLQQYPNSPSPIPLSNSAQINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITL
        MSSVP+ T IP   QLQQY N PSPIP SN + ++FPR+ NSS          N I PIVLWTSSIARYCRN QLA+AA EFTRMRLAGVEPNHITFITL
Subjt:  MSSVPAQTAIP--TQLQQYPNSPSPIPLSNSAQINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITL

Query:  LSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARKVFDCLGMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTAL
        LSGCADFPS SL FG+SLHGY RK GLDTGHVMVGTALI MYAKCAQLG AR VFD L MKNSV+WNT+LD YMRNGEIELAI+LFDEMPTRDAISWTAL
Subjt:  LSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARKVFDCLGMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTAL

Query:  INGLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNQFVMQQEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNS
        ING +KQGYSEQALECFH+MQCSGIEPDYVSIIAVLAACADLG L+ GLWVN+F+MQQEFKDNIRISNSLIDMYSRCG IEFARQVF+KM K TLVSWNS
Subjt:  INGLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNQFVMQQEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNS

Query:  IIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGPKLFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVV
        +IVGFA+NGFADESLEFFDAMQKEGF  DGVSYTGALTACSHAGLVNKG +LFDNMKRVH+ITPRIEHYGCIVDLY RAGRL++ALNVIE MPMKPNEVV
Subjt:  IIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGPKLFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVV

Query:  LGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLEL
        LGSLLAACRTHGDV LAERL+K+LF+LDP GDS+YVLLSNIYAA+G+WEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYH +AD IYSMLE+
Subjt:  LGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLEL

Query:  LYHELKICGYAPETDTFMNPKES
        L+HELKI GY PET TFMN  ES
Subjt:  LYHELKICGYAPETDTFMNPKES

SwissProt top hitse value%identityAlignment
O23337 Pentatricopeptide repeat-containing protein At4g148202.4e-9837.64Show/hide
Query:  IVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGY--ARKFGLDTGHVMVGTALIDMYAKCAQLGPARKVFD
        +V W + I RYCR G + +A   F  M+ + V P+ +    ++S C    + ++ +  +++ +       +DT H++  TAL+ MYA    +  AR+ F 
Subjt:  IVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGY--ARKFGLDTGHVMVGTALIDMYAKCAQLGPARKVFD

Query:  CLGMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNQFVM
         + ++N      ++  Y + G ++ A  +FD+   +D + WT +I+  V+  Y ++AL  F +M CSGI+PD VS+ +V++ACA+LG L    WV+  + 
Subjt:  CLGMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNQFVM

Query:  QQEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGPKLFDNM
            +  + I+N+LI+MY++CG ++  R VFEKMP+R +VSW+S+I   +++G A ++L  F  M++E  +P+ V++ G L  CSH+GLV +G K+F +M
Subjt:  QQEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGPKLFDNM

Query:  KRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRR
           + ITP++EHYGC+VDL+GRA  L +AL VIE MP+  N V+ GSL++ACR HG++ L +   K + +L+P  D   VL+SNIYA   +WE    +RR
Subjt:  KRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRR

Query:  TMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPE
         M+ + V K+ G S I+ +GK HEF+ GDK H  ++ IY+ L+ +  +LK+ GY P+
Subjt:  TMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPE

O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic3.9e-10137.28Show/hide
Query:  IVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARKVFDCL
        +V W S I  + + G    A   F +M    V+ +H+T + +LS CA     +L FG  +  Y  +  ++  ++ +  A++DMY KC  +  A+++FD +
Subjt:  IVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARKVFDCL

Query:  GMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGTLTLGLWVNQFVMQ
          K++V+W T+LD Y  + + E A ++ + MP +D ++W ALI+   + G   +AL  FH++Q    ++ + +++++ L+ACA +G L LG W++ ++ +
Subjt:  GMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGTLTLGLWVNQFVMQ

Query:  QEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGPKLFDNMK
           + N  ++++LI MYS+CG +E +R+VF  + KR +  W+++I G A++G  +E+++ F  MQ+   KP+GV++T    ACSH GLV++   LF  M+
Subjt:  QEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGPKLFDNMK

Query:  RVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRT
          + I P  +HY CIVD+ GR+G LE A+  IE MP+ P+  V G+LL AC+ H ++ LAE     L +L+PR D  +VLLSNIYA +GKWE  +++R+ 
Subjt:  RVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRT

Query:  MKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPE
        M+  G++K+PG SSIEIDG +HEF++GD  H  ++ +Y  L  +  +LK  GY PE
Subjt:  MKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPE

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226901.6e-10238.99Show/hide
Query:  LWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARKVFDCLGM
        L  +  + Y R G   +A   F  M  +GV P+ I+ ++ +S C+     ++ +G S HGY  + G ++    +  ALIDMY KC +   A ++FD +  
Subjt:  LWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARKVFDCLGM

Query:  KNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGTLTLGLWVNQFVMQQE
        K  V+WN+++  Y+ NGE++ A + F+ MP ++ +SW  +I+GLV+    E+A+E F  MQ   G+  D V+++++ +AC  LG L L  W+  ++ +  
Subjt:  KNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGTLTLGLWVNQFVMQQE

Query:  FKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGPKLFDNMKRV
         + ++R+  +L+DM+SRCG  E A  +F  +  R + +W + I   A+ G A+ ++E FD M ++G KPDGV++ GALTACSH GLV +G ++F +M ++
Subjt:  FKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGPKLFDNMKRV

Query:  HKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMK
        H ++P   HYGC+VDL GRAG LE+A+ +IE MPM+PN+V+  SLLAACR  G+V +A    + +  L P    +YVLLSN+YA+ G+W    KVR +MK
Subjt:  HKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMK

Query:  ARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPE
         +G++K PG SSI+I GK HEF +GD+ H     I +ML+ +       G+ P+
Subjt:  ARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPE

Q9MA50 Pentatricopeptide repeat-containing protein At1g05750, chloroplastic1.6e-16660Show/hide
Query:  NSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALID
        N +N  I  +  +++ +  V WTS I    RNG+LA+AA EF+ M LAGVEPNHITFI LLSGC DF S S   G  LHGYA K GLD  HVMVGTA+I 
Subjt:  NSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALID

Query:  MYAKCAQLGPARKVFDCLGMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACA
        MY+K  +   AR VFD +  KNSV+WNT++D YMR+G+++ A  +FD+MP RD ISWTA+ING VK+GY E+AL  F +MQ SG++PDYV+IIA L AC 
Subjt:  MYAKCAQLGPARKVFDCLGMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACA

Query:  DLGTLTLGLWVNQFVMQQEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTAC
        +LG L+ GLWV+++V+ Q+FK+N+R+SNSLID+Y RCG +EFARQVF  M KRT+VSWNS+IVGFA NG A ESL +F  MQ++GFKPD V++TGALTAC
Subjt:  DLGTLTLGLWVNQFVMQQEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTAC

Query:  SHAGLVNKGPKLFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLGSLLAACRTHG-DVRLAERLMKHLFKLDPRGDSNYVLLS
        SH GLV +G + F  MK  ++I+PRIEHYGC+VDLY RAGRLEDAL +++ MPMKPNEVV+GSLLAAC  HG ++ LAERLMKHL  L+ +  SNYV+LS
Subjt:  SHAGLVNKGPKLFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLGSLLAACRTHG-DVRLAERLMKHLFKLDPRGDSNYVLLS

Query:  NIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPET
        N+YAA GKWEGA+K+RR MK  G++K+PGFSSIEID  +H F+AGD  H     I  +LEL+  +L++ G   ET
Subjt:  NIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPET

Q9SZT8 Pentatricopeptide repeat-containing protein ELI1, chloroplastic1.4e-10138.85Show/hide
Query:  SIDP-IVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARK
        +IDP + L+T++I     NG    A   + ++  + + PN  TF +LL  C      S   G  +H +  KFGL      V T L+D+YAK   +  A+K
Subjt:  SIDP-IVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARK

Query:  VFDCLGMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQCSG-IEPDYVSIIAVLAACADLGTLTLGLWVN
        VFD +  ++ VS   ++  Y + G +E A  LFD M  RD +SW  +I+G  + G+   AL  F ++   G  +PD ++++A L+AC+ +G L  G W++
Subjt:  VFDCLGMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQCSG-IEPDYVSIIAVLAACADLGTLTLGLWVN

Query:  QFVMQQEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQK-EGFKPDGVSYTGALTACSHAGLVNKGPK
         FV     + N+++   LIDMYS+CGS+E A  VF   P++ +V+WN++I G+A++G++ ++L  F+ MQ   G +P  +++ G L AC+HAGLVN+G +
Subjt:  QFVMQQEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQK-EGFKPDGVSYTGALTACSHAGLVNKGPK

Query:  LFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGA
        +F++M + + I P+IEHYGC+V L GRAG+L+ A   I+ M M  + V+  S+L +C+ HGD  L + + ++L  L+ +    YVLLSNIYA++G +EG 
Subjt:  LFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGA

Query:  NKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPETDTFMNPKE
         KVR  MK +G+ K+PG S+IEI+ KVHEF AGD+ H+ +  IY+ML  +   +K  GY P T+T +   E
Subjt:  NKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPETDTFMNPKE

Arabidopsis top hitse value%identityAlignment
AT1G05750.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-16760Show/hide
Query:  NSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALID
        N +N  I  +  +++ +  V WTS I    RNG+LA+AA EF+ M LAGVEPNHITFI LLSGC DF S S   G  LHGYA K GLD  HVMVGTA+I 
Subjt:  NSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALID

Query:  MYAKCAQLGPARKVFDCLGMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACA
        MY+K  +   AR VFD +  KNSV+WNT++D YMR+G+++ A  +FD+MP RD ISWTA+ING VK+GY E+AL  F +MQ SG++PDYV+IIA L AC 
Subjt:  MYAKCAQLGPARKVFDCLGMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACA

Query:  DLGTLTLGLWVNQFVMQQEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTAC
        +LG L+ GLWV+++V+ Q+FK+N+R+SNSLID+Y RCG +EFARQVF  M KRT+VSWNS+IVGFA NG A ESL +F  MQ++GFKPD V++TGALTAC
Subjt:  DLGTLTLGLWVNQFVMQQEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTAC

Query:  SHAGLVNKGPKLFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLGSLLAACRTHG-DVRLAERLMKHLFKLDPRGDSNYVLLS
        SH GLV +G + F  MK  ++I+PRIEHYGC+VDLY RAGRLEDAL +++ MPMKPNEVV+GSLLAAC  HG ++ LAERLMKHL  L+ +  SNYV+LS
Subjt:  SHAGLVNKGPKLFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLGSLLAACRTHG-DVRLAERLMKHLFKLDPRGDSNYVLLS

Query:  NIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPET
        N+YAA GKWEGA+K+RR MK  G++K+PGFSSIEID  +H F+AGD  H     I  +LEL+  +L++ G   ET
Subjt:  NIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPET

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.8e-10237.28Show/hide
Query:  IVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARKVFDCL
        +V W S I  + + G    A   F +M    V+ +H+T + +LS CA     +L FG  +  Y  +  ++  ++ +  A++DMY KC  +  A+++FD +
Subjt:  IVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARKVFDCL

Query:  GMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGTLTLGLWVNQFVMQ
          K++V+W T+LD Y  + + E A ++ + MP +D ++W ALI+   + G   +AL  FH++Q    ++ + +++++ L+ACA +G L LG W++ ++ +
Subjt:  GMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGTLTLGLWVNQFVMQ

Query:  QEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGPKLFDNMK
           + N  ++++LI MYS+CG +E +R+VF  + KR +  W+++I G A++G  +E+++ F  MQ+   KP+GV++T    ACSH GLV++   LF  M+
Subjt:  QEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGPKLFDNMK

Query:  RVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRT
          + I P  +HY CIVD+ GR+G LE A+  IE MP+ P+  V G+LL AC+ H ++ LAE     L +L+PR D  +VLLSNIYA +GKWE  +++R+ 
Subjt:  RVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRT

Query:  MKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPE
        M+  G++K+PG SSIEIDG +HEF++GD  H  ++ +Y  L  +  +LK  GY PE
Subjt:  MKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPE

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)1.1e-10338.99Show/hide
Query:  LWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARKVFDCLGM
        L  +  + Y R G   +A   F  M  +GV P+ I+ ++ +S C+     ++ +G S HGY  + G ++    +  ALIDMY KC +   A ++FD +  
Subjt:  LWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARKVFDCLGM

Query:  KNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGTLTLGLWVNQFVMQQE
        K  V+WN+++  Y+ NGE++ A + F+ MP ++ +SW  +I+GLV+    E+A+E F  MQ   G+  D V+++++ +AC  LG L L  W+  ++ +  
Subjt:  KNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGTLTLGLWVNQFVMQQE

Query:  FKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGPKLFDNMKRV
         + ++R+  +L+DM+SRCG  E A  +F  +  R + +W + I   A+ G A+ ++E FD M ++G KPDGV++ GALTACSH GLV +G ++F +M ++
Subjt:  FKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGPKLFDNMKRV

Query:  HKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMK
        H ++P   HYGC+VDL GRAG LE+A+ +IE MPM+PN+V+  SLLAACR  G+V +A    + +  L P    +YVLLSN+YA+ G+W    KVR +MK
Subjt:  HKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMK

Query:  ARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPE
         +G++K PG SSI+I GK HEF +GD+ H     I +ML+ +       G+ P+
Subjt:  ARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPE

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification1.1e-10338.99Show/hide
Query:  LWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARKVFDCLGM
        L  +  + Y R G   +A   F  M  +GV P+ I+ ++ +S C+     ++ +G S HGY  + G ++    +  ALIDMY KC +   A ++FD +  
Subjt:  LWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARKVFDCLGM

Query:  KNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGTLTLGLWVNQFVMQQE
        K  V+WN+++  Y+ NGE++ A + F+ MP ++ +SW  +I+GLV+    E+A+E F  MQ   G+  D V+++++ +AC  LG L L  W+  ++ +  
Subjt:  KNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGTLTLGLWVNQFVMQQE

Query:  FKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGPKLFDNMKRV
         + ++R+  +L+DM+SRCG  E A  +F  +  R + +W + I   A+ G A+ ++E FD M ++G KPDGV++ GALTACSH GLV +G ++F +M ++
Subjt:  FKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGPKLFDNMKRV

Query:  HKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMK
        H ++P   HYGC+VDL GRAG LE+A+ +IE MPM+PN+V+  SLLAACR  G+V +A    + +  L P    +YVLLSN+YA+ G+W    KVR +MK
Subjt:  HKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMK

Query:  ARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPE
         +G++K PG SSI+I GK HEF +GD+ H     I +ML+ +       G+ P+
Subjt:  ARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPE

AT4G37380.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.6e-10338.85Show/hide
Query:  SIDP-IVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARK
        +IDP + L+T++I     NG    A   + ++  + + PN  TF +LL  C      S   G  +H +  KFGL      V T L+D+YAK   +  A+K
Subjt:  SIDP-IVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARK

Query:  VFDCLGMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQCSG-IEPDYVSIIAVLAACADLGTLTLGLWVN
        VFD +  ++ VS   ++  Y + G +E A  LFD M  RD +SW  +I+G  + G+   AL  F ++   G  +PD ++++A L+AC+ +G L  G W++
Subjt:  VFDCLGMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQCSG-IEPDYVSIIAVLAACADLGTLTLGLWVN

Query:  QFVMQQEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQK-EGFKPDGVSYTGALTACSHAGLVNKGPK
         FV     + N+++   LIDMYS+CGS+E A  VF   P++ +V+WN++I G+A++G++ ++L  F+ MQ   G +P  +++ G L AC+HAGLVN+G +
Subjt:  QFVMQQEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQK-EGFKPDGVSYTGALTACSHAGLVNKGPK

Query:  LFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGA
        +F++M + + I P+IEHYGC+V L GRAG+L+ A   I+ M M  + V+  S+L +C+ HGD  L + + ++L  L+ +    YVLLSNIYA++G +EG 
Subjt:  LFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGA

Query:  NKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPETDTFMNPKE
         KVR  MK +G+ K+PG S+IEI+ KVHEF AGD+ H+ +  IY+ML  +   +K  GY P T+T +   E
Subjt:  NKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPETDTFMNPKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAGCGTTCCTGCGCAGACAGCCATTCCAACCCAGCTCCAACAATATCCTAATTCTCCTTCTCCAATCCCACTTTCGAACTCAGCACAAATCAACTTCCCTCGCTC
TGCCAATTCCTCAAATCGCAATATCTCCTCCAAATCCACTCGCAATTCTATCGACCCCATTGTTCTATGGACCTCTTCTATTGCTCGCTACTGCCGCAACGGCCAATTAG
CCGATGCCGCCACAGAGTTTACGAGGATGAGACTCGCCGGAGTTGAGCCGAACCACATCACATTCATCACGCTTCTCTCCGGCTGTGCTGATTTTCCGTCAGAAAGCCTC
TTCTTCGGCTCTTCGCTTCATGGTTACGCCCGAAAATTTGGATTGGATACAGGGCATGTAATGGTGGGGACTGCTCTGATTGATATGTATGCCAAATGCGCTCAATTGGG
TCCTGCTAGGAAGGTTTTTGATTGCCTGGGCATGAAAAACTCTGTCTCTTGGAACACGTTGCTCGATGCTTACATGAGGAATGGAGAGATTGAGTTGGCCATTGACCTGT
TTGATGAAATGCCTACAAGAGATGCGATTTCTTGGACGGCTTTGATTAATGGTCTTGTGAAACAGGGATACTCCGAACAAGCATTGGAGTGCTTCCATCAGATGCAATGC
TCGGGTATTGAGCCTGATTATGTGTCTATAATTGCTGTTCTTGCTGCGTGTGCTGATTTGGGCACGCTTACTTTGGGGTTATGGGTTAATCAGTTCGTTATGCAGCAGGA
GTTTAAGGATAATATTAGGATAAGTAATTCCTTGATAGATATGTATTCTCGATGTGGATCTATTGAGTTTGCCCGCCAAGTGTTTGAGAAAATGCCCAAGCGAACTTTGG
TGTCTTGGAACTCCATTATTGTGGGGTTTGCTGTTAATGGGTTTGCAGATGAATCTCTGGAGTTTTTTGATGCAATGCAGAAGGAAGGATTCAAGCCAGATGGAGTTAGC
TACACGGGAGCTCTTACTGCATGTAGCCATGCTGGCTTAGTGAATAAGGGGCCCAAATTGTTTGATAACATGAAGAGGGTACATAAAATTACTCCAAGGATTGAGCATTA
TGGGTGCATTGTTGACCTCTATGGCCGTGCAGGGAGGTTGGAGGATGCATTGAATGTGATCGAGAGAATGCCGATGAAACCGAATGAAGTTGTACTGGGGTCGCTGCTGG
CTGCCTGCAGAACTCATGGTGATGTGAGACTGGCTGAAAGGTTAATGAAACATCTCTTTAAGTTGGACCCTAGAGGCGATTCAAATTACGTGCTTCTTTCGAACATATAT
GCAGCGATTGGGAAATGGGAAGGTGCTAATAAGGTGAGGAGAACAATGAAGGCTCGAGGTGTGCAGAAAAAGCCAGGGTTTAGTTCAATTGAGATTGATGGTAAGGTTCA
TGAGTTTGTTGCTGGTGATAAATACCATGCTAATGCAGACTGTATTTATTCAATGTTAGAGCTGTTGTATCATGAACTAAAGATATGTGGCTATGCTCCTGAAACTGATA
CTTTTATGAATCCTAAAGAATCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCAGCGTTCCTGCGCAGACAGCCATTCCAACCCAGCTCCAACAATATCCTAATTCTCCTTCTCCAATCCCACTTTCGAACTCAGCACAAATCAACTTCCCTCGCTC
TGCCAATTCCTCAAATCGCAATATCTCCTCCAAATCCACTCGCAATTCTATCGACCCCATTGTTCTATGGACCTCTTCTATTGCTCGCTACTGCCGCAACGGCCAATTAG
CCGATGCCGCCACAGAGTTTACGAGGATGAGACTCGCCGGAGTTGAGCCGAACCACATCACATTCATCACGCTTCTCTCCGGCTGTGCTGATTTTCCGTCAGAAAGCCTC
TTCTTCGGCTCTTCGCTTCATGGTTACGCCCGAAAATTTGGATTGGATACAGGGCATGTAATGGTGGGGACTGCTCTGATTGATATGTATGCCAAATGCGCTCAATTGGG
TCCTGCTAGGAAGGTTTTTGATTGCCTGGGCATGAAAAACTCTGTCTCTTGGAACACGTTGCTCGATGCTTACATGAGGAATGGAGAGATTGAGTTGGCCATTGACCTGT
TTGATGAAATGCCTACAAGAGATGCGATTTCTTGGACGGCTTTGATTAATGGTCTTGTGAAACAGGGATACTCCGAACAAGCATTGGAGTGCTTCCATCAGATGCAATGC
TCGGGTATTGAGCCTGATTATGTGTCTATAATTGCTGTTCTTGCTGCGTGTGCTGATTTGGGCACGCTTACTTTGGGGTTATGGGTTAATCAGTTCGTTATGCAGCAGGA
GTTTAAGGATAATATTAGGATAAGTAATTCCTTGATAGATATGTATTCTCGATGTGGATCTATTGAGTTTGCCCGCCAAGTGTTTGAGAAAATGCCCAAGCGAACTTTGG
TGTCTTGGAACTCCATTATTGTGGGGTTTGCTGTTAATGGGTTTGCAGATGAATCTCTGGAGTTTTTTGATGCAATGCAGAAGGAAGGATTCAAGCCAGATGGAGTTAGC
TACACGGGAGCTCTTACTGCATGTAGCCATGCTGGCTTAGTGAATAAGGGGCCCAAATTGTTTGATAACATGAAGAGGGTACATAAAATTACTCCAAGGATTGAGCATTA
TGGGTGCATTGTTGACCTCTATGGCCGTGCAGGGAGGTTGGAGGATGCATTGAATGTGATCGAGAGAATGCCGATGAAACCGAATGAAGTTGTACTGGGGTCGCTGCTGG
CTGCCTGCAGAACTCATGGTGATGTGAGACTGGCTGAAAGGTTAATGAAACATCTCTTTAAGTTGGACCCTAGAGGCGATTCAAATTACGTGCTTCTTTCGAACATATAT
GCAGCGATTGGGAAATGGGAAGGTGCTAATAAGGTGAGGAGAACAATGAAGGCTCGAGGTGTGCAGAAAAAGCCAGGGTTTAGTTCAATTGAGATTGATGGTAAGGTTCA
TGAGTTTGTTGCTGGTGATAAATACCATGCTAATGCAGACTGTATTTATTCAATGTTAGAGCTGTTGTATCATGAACTAAAGATATGTGGCTATGCTCCTGAAACTGATA
CTTTTATGAATCCTAAAGAATCTTAG
Protein sequenceShow/hide protein sequence
MSSVPAQTAIPTQLQQYPNSPSPIPLSNSAQINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESL
FFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCAQLGPARKVFDCLGMKNSVSWNTLLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQC
SGIEPDYVSIIAVLAACADLGTLTLGLWVNQFVMQQEFKDNIRISNSLIDMYSRCGSIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDGVS
YTGALTACSHAGLVNKGPKLFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPMKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIY
AAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPETDTFMNPKES