; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025714 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025714
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr10:18301241..18302809
RNA-Seq ExpressionLag0025714
SyntenyLag0025714
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139593.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucumis sativus]2.8e-26185.41Show/hide
Query:  MSSVPAQTAIPTQLQQYPNSPSPIPLSNSTKINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLS
        MSS+P+ TA P+QLQ  P +PS IPLSN TK+NFPRS NS +RNISSK   NS+DPIVLWTSS+ARYCRNGQL++AA EFTRMRLAGVEPNHITFITLLS
Subjt:  MSSVPAQTAIPTQLQQYPNSPSPIPLSNSTKINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLS

Query:  GCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARKVFDSLGMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALIN
         CADFPSES FF SSLHGYA K+GLDTGHVMVGTALIDMY+KC+QLG ARKVF +LG+KNSVSWNTML+ +MRNGEIELAI LFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARKVFDSLGMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALIN

Query:  GLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GL+K GYSEQALECFHQMQ SG+  DYVSIIAVLAACADLG LTLGLWV+RFVM QEFKDNI+ISNSLIDMYSRCGCIEFARQVF KM KRTLVSWNSII
Subjt:  GLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTACSHAGLVNKGLELFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLG
        VGFAVNGFADESLEFF AMQKEGFKPD VSYTGALTACSHAGLVNKGLELFDNMK VHKITPRIEHYGCIVDLYGRAGRLEDALN+IE MP+KPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTACSHAGLVNKGLELFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLG

Query:  SLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLY
        SLLAACRTHGDV LAERLMKHLFKLDP GD+ YVLLSNIYAAIGKW+GAN VRRTMKARGVQKKPG+SS+EIDGKVHEFVAGD YHA+AD IYSML+LL 
Subjt:  SLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLY

Query:  HELKICGYAPETDTFMNPKES
        HELK+CGY P +DT +N KES
Subjt:  HELKICGYAPETDTFMNPKES

XP_008458940.1 PREDICTED: pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucumis melo]7.1e-25785.41Show/hide
Query:  MSSVPAQTAIPTQLQQYPNSPSPIPLSNSTKINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLS
        MSS+P+  A P+QLQQ P+  S IPLSN TK+NFPRS  S + NI SK T NS+ PIV WTSSIARYC NGQL +AA EFTRMRLAGVEPNHITFITLLS
Subjt:  MSSVPAQTAIPTQLQQYPNSPSPIPLSNSTKINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLS

Query:  GCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARKVFDSLGMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALIN
        GCADFPSES FF SSLHGYA KFGLDTGHVMVGTALIDMY+KCSQLG A+KVFD LG+KNSVSWNTML+ +MRNGEIELAI LFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARKVFDSLGMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALIN

Query:  GLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GL+K GYSEQALECFHQMQ SG+  DYVSIIAVLAACADLG LT GLWVNRFVMQQEFKDN+RISNSLIDMYSRCGCIEFARQVF KM KRTLVSWNSII
Subjt:  GLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTACSHAGLVNKGLELFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLG
        VGFA NGFADESLEFF AMQKEGFKPD VSYTGALTACSHAGLVNKGLELFDNMKRVHKITP IEHYGCIVDLYGRAGRLEDA NVIE MP+KPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTACSHAGLVNKGLELFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLG

Query:  SLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLY
        SLLAACRTHGDVRLAERLMKH+FKLD  GDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKK G+SS+EIDGKVHEFVAGDKYHA+AD IYSML+LL+
Subjt:  SLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLY

Query:  HELKICGYAPETDTFMNPKES
        HELK+CGY P+TD  +N K+S
Subjt:  HELKICGYAPETDTFMNPKES

XP_022142716.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Momordica charantia]3.9e-27188.48Show/hide
Query:  MSSVPAQTAIPTQLQQYPNSPSPIPLSNSTKINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLS
        MSS+PA TA   QLQQYPN  + IPL N   INFPRS NSSNR+ISSKST NSIDPIVLWTSSIARYCRNGQLA+AA EFT MRLAGVEPNH+T ITLLS
Subjt:  MSSVPAQTAIPTQLQQYPNSPSPIPLSNSTKINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLS

Query:  GCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARKVFDSLGMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALIN
        GCADFPSESL+FGSSLHGYARK GLDT HVMVGT+++DMYAKC+QLG AR+VFD L MKNSVSWNTMLD Y RNGEIELAIDLFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARKVFDSLGMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALIN

Query:  GLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GL+KQGYSEQALECFHQMQCSGI+PDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCI FARQVFE+M KRTLVSWNSII
Subjt:  GLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTACSHAGLVNKGLELFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLG
        VG+A NGFADESLEFFDAMQKEGFKPD VSYTGALTACSHAGLVNKGLELFDNMKRVH+I PRIEHYGCIVDLYGRAGRLEDAL+VIE+MP+KPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTACSHAGLVNKGLELFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLG

Query:  SLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLY
        SLLAACRTHGDV LAERLMKHL KLDP GDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSS+EIDGKVHEFVAGDKYHA+AD IYSML+LL 
Subjt:  SLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLY

Query:  HELKICGYAPETDTFMNPKES
        HELKICG  PET+TF+N KES
Subjt:  HELKICGYAPETDTFMNPKES

XP_022967078.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucurbita maxima]1.7e-25583.94Show/hide
Query:  MSSVPAQTAIP--TQLQQYPNSPSPIPLSNSTKINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITL
        MSSVP+ T IP   QLQQY N PSPIP SN + ++FPR+ NSS          N I PIVLWTSSIARYCRN QLA+AA EFTRMRLAGVEPNHITFITL
Subjt:  MSSVPAQTAIP--TQLQQYPNSPSPIPLSNSTKINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITL

Query:  LSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARKVFDSLGMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTAL
        LSGCADFPS SL FG+SLHGY RK GLDTGHVMVGTALI MYAKC+QLG AR VFD L MKNSV+WNTMLD YMRNGEIELAI+LFDEMPTRDAISWTAL
Subjt:  LSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARKVFDSLGMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTAL

Query:  INGLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNS
        ING +KQGYSEQALECFH+MQCSGIEPDYVSIIAVLAACADLG L+ GLWVNRF+MQQEFKDNIRISNSLIDMYSRCGCIEFARQVF+KM K TLVSWNS
Subjt:  INGLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNS

Query:  IIVGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTACSHAGLVNKGLELFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVV
        +IVGFA+NGFADESLEFFDAMQKEGF  D VSYTGALTACSHAGLVNKGLELFDNMKRVH+ITPRIEHYGCIVDLY RAGRL++ALNVIE MP+KPNEVV
Subjt:  IIVGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTACSHAGLVNKGLELFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVV

Query:  LGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLEL
        LGSLLAACRTHGDV LAERL+K+LF+LDP GDS+YVLLSNIYAA+G+WEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYH +AD IYSMLE+
Subjt:  LGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLEL

Query:  LYHELKICGYAPETDTFMNPKES
        L+HELKI GY PET TFMN  ES
Subjt:  LYHELKICGYAPETDTFMNPKES

XP_038877228.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Benincasa hispida]1.3e-27489.42Show/hide
Query:  MSSVPAQTAIPTQLQQYPNSPSPIPLSNSTKINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLS
        MSS P+ TAIP+QLQQYPN PS IPLSN TK+NFPRS NSS+RNISSK   NSIDPIVLWTSS+ARYCRNGQL++AATEFTRMRLAGVEPNH+TFITLLS
Subjt:  MSSVPAQTAIPTQLQQYPNSPSPIPLSNSTKINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLS

Query:  GCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARKVFDSLGMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALIN
        GC DFPSESLFFGSSLHGYARK GLDTGHVMVGTAL+DMYAKC+Q   ARKVFD LGMKNSV+WNTMLD Y RNGEIELAIDLFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARKVFDSLGMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALIN

Query:  GLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GL+KQG+SEQALECFHQMQCSGIEPDYVSIIAVLAACADLG LTLGLWVNRFVMQQEFKDNIRISNSL+DMYSRCGCIEFARQVFEKMPKRTLVSWNSII
Subjt:  GLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTACSHAGLVNKGLELFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLG
        VGFAVNGFADESLEFFDAMQ EGFKPD VSYTGALTACSHAGLVNKGLELFDNMKR+HKITPRIEHYGCIVDLYGRAGRLEDALNVIE MP+KPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTACSHAGLVNKGLELFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLG

Query:  SLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLY
        SLLAACRT+GDV LAE+LMKHL KLDPRGDSNYVLLSNIYAAIG+WEGANKVRRTMKARGVQKKPG SS+EIDGKVHEFVAGDKYHA+AD IYSMLELL+
Subjt:  SLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLY

Query:  HELKICGYAPETDTFMNPKE
        HELKI GY P+T+  +N KE
Subjt:  HELKICGYAPETDTFMNPKE

TrEMBL top hitse value%identityAlignment
A0A0A0LYD6 Uncharacterized protein1.3e-26185.41Show/hide
Query:  MSSVPAQTAIPTQLQQYPNSPSPIPLSNSTKINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLS
        MSS+P+ TA P+QLQ  P +PS IPLSN TK+NFPRS NS +RNISSK   NS+DPIVLWTSS+ARYCRNGQL++AA EFTRMRLAGVEPNHITFITLLS
Subjt:  MSSVPAQTAIPTQLQQYPNSPSPIPLSNSTKINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLS

Query:  GCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARKVFDSLGMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALIN
         CADFPSES FF SSLHGYA K+GLDTGHVMVGTALIDMY+KC+QLG ARKVF +LG+KNSVSWNTML+ +MRNGEIELAI LFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARKVFDSLGMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALIN

Query:  GLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GL+K GYSEQALECFHQMQ SG+  DYVSIIAVLAACADLG LTLGLWV+RFVM QEFKDNI+ISNSLIDMYSRCGCIEFARQVF KM KRTLVSWNSII
Subjt:  GLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTACSHAGLVNKGLELFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLG
        VGFAVNGFADESLEFF AMQKEGFKPD VSYTGALTACSHAGLVNKGLELFDNMK VHKITPRIEHYGCIVDLYGRAGRLEDALN+IE MP+KPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTACSHAGLVNKGLELFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLG

Query:  SLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLY
        SLLAACRTHGDV LAERLMKHLFKLDP GD+ YVLLSNIYAAIGKW+GAN VRRTMKARGVQKKPG+SS+EIDGKVHEFVAGD YHA+AD IYSML+LL 
Subjt:  SLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLY

Query:  HELKICGYAPETDTFMNPKES
        HELK+CGY P +DT +N KES
Subjt:  HELKICGYAPETDTFMNPKES

A0A1S3C956 pentatricopeptide repeat-containing protein At1g05750, chloroplastic3.4e-25785.41Show/hide
Query:  MSSVPAQTAIPTQLQQYPNSPSPIPLSNSTKINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLS
        MSS+P+  A P+QLQQ P+  S IPLSN TK+NFPRS  S + NI SK T NS+ PIV WTSSIARYC NGQL +AA EFTRMRLAGVEPNHITFITLLS
Subjt:  MSSVPAQTAIPTQLQQYPNSPSPIPLSNSTKINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLS

Query:  GCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARKVFDSLGMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALIN
        GCADFPSES FF SSLHGYA KFGLDTGHVMVGTALIDMY+KCSQLG A+KVFD LG+KNSVSWNTML+ +MRNGEIELAI LFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARKVFDSLGMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALIN

Query:  GLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GL+K GYSEQALECFHQMQ SG+  DYVSIIAVLAACADLG LT GLWVNRFVMQQEFKDN+RISNSLIDMYSRCGCIEFARQVF KM KRTLVSWNSII
Subjt:  GLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTACSHAGLVNKGLELFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLG
        VGFA NGFADESLEFF AMQKEGFKPD VSYTGALTACSHAGLVNKGLELFDNMKRVHKITP IEHYGCIVDLYGRAGRLEDA NVIE MP+KPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTACSHAGLVNKGLELFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLG

Query:  SLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLY
        SLLAACRTHGDVRLAERLMKH+FKLD  GDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKK G+SS+EIDGKVHEFVAGDKYHA+AD IYSML+LL+
Subjt:  SLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLY

Query:  HELKICGYAPETDTFMNPKES
        HELK+CGY P+TD  +N K+S
Subjt:  HELKICGYAPETDTFMNPKES

A0A5A7UJB6 Pentatricopeptide repeat-containing protein3.4e-25785.41Show/hide
Query:  MSSVPAQTAIPTQLQQYPNSPSPIPLSNSTKINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLS
        MSS+P+  A P+QLQQ P+  S IPLSN TK+NFPRS  S + NI SK T NS+ PIV WTSSIARYC NGQL +AA EFTRMRLAGVEPNHITFITLLS
Subjt:  MSSVPAQTAIPTQLQQYPNSPSPIPLSNSTKINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLS

Query:  GCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARKVFDSLGMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALIN
        GCADFPSES FF SSLHGYA KFGLDTGHVMVGTALIDMY+KCSQLG A+KVFD LG+KNSVSWNTML+ +MRNGEIELAI LFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARKVFDSLGMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALIN

Query:  GLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GL+K GYSEQALECFHQMQ SG+  DYVSIIAVLAACADLG LT GLWVNRFVMQQEFKDN+RISNSLIDMYSRCGCIEFARQVF KM KRTLVSWNSII
Subjt:  GLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTACSHAGLVNKGLELFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLG
        VGFA NGFADESLEFF AMQKEGFKPD VSYTGALTACSHAGLVNKGLELFDNMKRVHKITP IEHYGCIVDLYGRAGRLEDA NVIE MP+KPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTACSHAGLVNKGLELFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLG

Query:  SLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLY
        SLLAACRTHGDVRLAERLMKH+FKLD  GDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKK G+SS+EIDGKVHEFVAGDKYHA+AD IYSML+LL+
Subjt:  SLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLY

Query:  HELKICGYAPETDTFMNPKES
        HELK+CGY P+TD  +N K+S
Subjt:  HELKICGYAPETDTFMNPKES

A0A6J1CN07 pentatricopeptide repeat-containing protein At1g05750, chloroplastic1.9e-27188.48Show/hide
Query:  MSSVPAQTAIPTQLQQYPNSPSPIPLSNSTKINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLS
        MSS+PA TA   QLQQYPN  + IPL N   INFPRS NSSNR+ISSKST NSIDPIVLWTSSIARYCRNGQLA+AA EFT MRLAGVEPNH+T ITLLS
Subjt:  MSSVPAQTAIPTQLQQYPNSPSPIPLSNSTKINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLS

Query:  GCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARKVFDSLGMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALIN
        GCADFPSESL+FGSSLHGYARK GLDT HVMVGT+++DMYAKC+QLG AR+VFD L MKNSVSWNTMLD Y RNGEIELAIDLFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARKVFDSLGMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALIN

Query:  GLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII
        GL+KQGYSEQALECFHQMQCSGI+PDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCI FARQVFE+M KRTLVSWNSII
Subjt:  GLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSII

Query:  VGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTACSHAGLVNKGLELFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLG
        VG+A NGFADESLEFFDAMQKEGFKPD VSYTGALTACSHAGLVNKGLELFDNMKRVH+I PRIEHYGCIVDLYGRAGRLEDAL+VIE+MP+KPNEVVLG
Subjt:  VGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTACSHAGLVNKGLELFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLG

Query:  SLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLY
        SLLAACRTHGDV LAERLMKHL KLDP GDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSS+EIDGKVHEFVAGDKYHA+AD IYSML+LL 
Subjt:  SLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLY

Query:  HELKICGYAPETDTFMNPKES
        HELKICG  PET+TF+N KES
Subjt:  HELKICGYAPETDTFMNPKES

A0A6J1HU31 pentatricopeptide repeat-containing protein At1g05750, chloroplastic8.5e-25683.94Show/hide
Query:  MSSVPAQTAIP--TQLQQYPNSPSPIPLSNSTKINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITL
        MSSVP+ T IP   QLQQY N PSPIP SN + ++FPR+ NSS          N I PIVLWTSSIARYCRN QLA+AA EFTRMRLAGVEPNHITFITL
Subjt:  MSSVPAQTAIP--TQLQQYPNSPSPIPLSNSTKINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITL

Query:  LSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARKVFDSLGMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTAL
        LSGCADFPS SL FG+SLHGY RK GLDTGHVMVGTALI MYAKC+QLG AR VFD L MKNSV+WNTMLD YMRNGEIELAI+LFDEMPTRDAISWTAL
Subjt:  LSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARKVFDSLGMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTAL

Query:  INGLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNS
        ING +KQGYSEQALECFH+MQCSGIEPDYVSIIAVLAACADLG L+ GLWVNRF+MQQEFKDNIRISNSLIDMYSRCGCIEFARQVF+KM K TLVSWNS
Subjt:  INGLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNS

Query:  IIVGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTACSHAGLVNKGLELFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVV
        +IVGFA+NGFADESLEFFDAMQKEGF  D VSYTGALTACSHAGLVNKGLELFDNMKRVH+ITPRIEHYGCIVDLY RAGRL++ALNVIE MP+KPNEVV
Subjt:  IIVGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTACSHAGLVNKGLELFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVV

Query:  LGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLEL
        LGSLLAACRTHGDV LAERL+K+LF+LDP GDS+YVLLSNIYAA+G+WEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYH +AD IYSMLE+
Subjt:  LGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLEL

Query:  LYHELKICGYAPETDTFMNPKES
        L+HELKI GY PET TFMN  ES
Subjt:  LYHELKICGYAPETDTFMNPKES

SwissProt top hitse value%identityAlignment
O23337 Pentatricopeptide repeat-containing protein At4g148202.8e-9938.07Show/hide
Query:  IVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGY--ARKFGLDTGHVMVGTALIDMYAKCSQLGPARKVFD
        +V W + I RYCR G + +A   F  M+ + V P+ +    ++S C    + ++ +  +++ +       +DT H++  TAL+ MYA    +  AR+ F 
Subjt:  IVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGY--ARKFGLDTGHVMVGTALIDMYAKCSQLGPARKVFD

Query:  SLGMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNRFVM
         + ++N      M+  Y + G ++ A  +FD+   +D + WT +I+  V+  Y ++AL  F +M CSGI+PD VS+ +V++ACA+LG L    WV+  + 
Subjt:  SLGMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGTLTLGLWVNRFVM

Query:  QQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTACSHAGLVNKGLELFDNM
            +  + I+N+LI+MY++CG ++  R VFEKMP+R +VSW+S+I   +++G A ++L  F  M++E  +P+EV++ G L  CSH+GLV +G ++F +M
Subjt:  QQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTACSHAGLVNKGLELFDNM

Query:  KRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRR
           + ITP++EHYGC+VDL+GRA  L +AL VIE MPV  N V+ GSL++ACR HG++ L +   K + +L+P  D   VL+SNIYA   +WE    +RR
Subjt:  KRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRR

Query:  TMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPE
         M+ + V K+ G S I+ +GK HEF+ GDK H  ++ IY+ L+ +  +LK+ GY P+
Subjt:  TMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPE

O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic2.6e-10037.28Show/hide
Query:  IVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARKVFDSL
        +V W S I  + + G    A   F +M    V+ +H+T + +LS CA     +L FG  +  Y  +  ++  ++ +  A++DMY KC  +  A+++FD++
Subjt:  IVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARKVFDSL

Query:  GMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGTLTLGLWVNRFVMQ
          K++V+W TMLD Y  + + E A ++ + MP +D ++W ALI+   + G   +AL  FH++Q    ++ + +++++ L+ACA +G L LG W++ ++ +
Subjt:  GMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGTLTLGLWVNRFVMQ

Query:  QEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTACSHAGLVNKGLELFDNMK
           + N  ++++LI MYS+CG +E +R+VF  + KR +  W+++I G A++G  +E+++ F  MQ+   KP+ V++T    ACSH GLV++   LF  M+
Subjt:  QEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTACSHAGLVNKGLELFDNMK

Query:  RVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRT
          + I P  +HY CIVD+ GR+G LE A+  IE MP+ P+  V G+LL AC+ H ++ LAE     L +L+PR D  +VLLSNIYA +GKWE  +++R+ 
Subjt:  RVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRT

Query:  MKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPE
        M+  G++K+PG SSIEIDG +HEF++GD  H  ++ +Y  L  +  +LK  GY PE
Subjt:  MKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPE

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226902.3e-10138.77Show/hide
Query:  LWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARKVFDSLGM
        L  +  + Y R G   +A   F  M  +GV P+ I+ ++ +S C+     ++ +G S HGY  + G ++    +  ALIDMY KC +   A ++FD +  
Subjt:  LWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARKVFDSLGM

Query:  KNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQE
        K  V+WN+++  Y+ NGE++ A + F+ MP ++ +SW  +I+GLV+    E+A+E F  MQ   G+  D V+++++ +AC  LG L L  W+  ++ +  
Subjt:  KNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQE

Query:  FKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTACSHAGLVNKGLELFDNMKRV
         + ++R+  +L+DM+SRCG  E A  +F  +  R + +W + I   A+ G A+ ++E FD M ++G KPD V++ GALTACSH GLV +G E+F +M ++
Subjt:  FKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTACSHAGLVNKGLELFDNMKRV

Query:  HKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMK
        H ++P   HYGC+VDL GRAG LE+A+ +IE MP++PN+V+  SLLAACR  G+V +A    + +  L P    +YVLLSN+YA+ G+W    KVR +MK
Subjt:  HKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMK

Query:  ARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPE
         +G++K PG SSI+I GK HEF +GD+ H     I +ML+ +       G+ P+
Subjt:  ARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPE

Q9MA50 Pentatricopeptide repeat-containing protein At1g05750, chloroplastic1.7e-16860.63Show/hide
Query:  NSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALID
        N +N  I  +  +++ +  V WTS I    RNG+LA+AA EF+ M LAGVEPNHITFI LLSGC DF S S   G  LHGYA K GLD  HVMVGTA+I 
Subjt:  NSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALID

Query:  MYAKCSQLGPARKVFDSLGMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACA
        MY+K  +   AR VFD +  KNSV+WNTM+D YMR+G+++ A  +FD+MP RD ISWTA+ING VK+GY E+AL  F +MQ SG++PDYV+IIA L AC 
Subjt:  MYAKCSQLGPARKVFDSLGMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACA

Query:  DLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTAC
        +LG L+ GLWV+R+V+ Q+FK+N+R+SNSLID+Y RCGC+EFARQVF  M KRT+VSWNS+IVGFA NG A ESL +F  MQ++GFKPD V++TGALTAC
Subjt:  DLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTAC

Query:  SHAGLVNKGLELFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLGSLLAACRTHG-DVRLAERLMKHLFKLDPRGDSNYVLLS
        SH GLV +GL  F  MK  ++I+PRIEHYGC+VDLY RAGRLEDAL +++ MP+KPNEVV+GSLLAAC  HG ++ LAERLMKHL  L+ +  SNYV+LS
Subjt:  SHAGLVNKGLELFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLGSLLAACRTHG-DVRLAERLMKHLFKLDPRGDSNYVLLS

Query:  NIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPET
        N+YAA GKWEGA+K+RR MK  G++K+PGFSSIEID  +H F+AGD  H     I  +LEL+  +L++ G   ET
Subjt:  NIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPET

Q9SZT8 Pentatricopeptide repeat-containing protein ELI1, chloroplastic8.0e-10238.64Show/hide
Query:  SIDP-IVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARK
        +IDP + L+T++I     NG    A   + ++  + + PN  TF +LL  C      S   G  +H +  KFGL      V T L+D+YAK   +  A+K
Subjt:  SIDP-IVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARK

Query:  VFDSLGMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQCSG-IEPDYVSIIAVLAACADLGTLTLGLWVN
        VFD +  ++ VS   M+  Y + G +E A  LFD M  RD +SW  +I+G  + G+   AL  F ++   G  +PD ++++A L+AC+ +G L  G W++
Subjt:  VFDSLGMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQCSG-IEPDYVSIIAVLAACADLGTLTLGLWVN

Query:  RFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQK-EGFKPDEVSYTGALTACSHAGLVNKGLE
         FV     + N+++   LIDMYS+CG +E A  VF   P++ +V+WN++I G+A++G++ ++L  F+ MQ   G +P ++++ G L AC+HAGLVN+G+ 
Subjt:  RFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQK-EGFKPDEVSYTGALTACSHAGLVNKGLE

Query:  LFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGA
        +F++M + + I P+IEHYGC+V L GRAG+L+ A   I+ M +  + V+  S+L +C+ HGD  L + + ++L  L+ +    YVLLSNIYA++G +EG 
Subjt:  LFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGA

Query:  NKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPETDTFMNPKE
         KVR  MK +G+ K+PG S+IEI+ KVHEF AGD+ H+ +  IY+ML  +   +K  GY P T+T +   E
Subjt:  NKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPETDTFMNPKE

Arabidopsis top hitse value%identityAlignment
AT1G05750.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-16960.63Show/hide
Query:  NSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALID
        N +N  I  +  +++ +  V WTS I    RNG+LA+AA EF+ M LAGVEPNHITFI LLSGC DF S S   G  LHGYA K GLD  HVMVGTA+I 
Subjt:  NSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALID

Query:  MYAKCSQLGPARKVFDSLGMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACA
        MY+K  +   AR VFD +  KNSV+WNTM+D YMR+G+++ A  +FD+MP RD ISWTA+ING VK+GY E+AL  F +MQ SG++PDYV+IIA L AC 
Subjt:  MYAKCSQLGPARKVFDSLGMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACA

Query:  DLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTAC
        +LG L+ GLWV+R+V+ Q+FK+N+R+SNSLID+Y RCGC+EFARQVF  M KRT+VSWNS+IVGFA NG A ESL +F  MQ++GFKPD V++TGALTAC
Subjt:  DLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTAC

Query:  SHAGLVNKGLELFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLGSLLAACRTHG-DVRLAERLMKHLFKLDPRGDSNYVLLS
        SH GLV +GL  F  MK  ++I+PRIEHYGC+VDLY RAGRLEDAL +++ MP+KPNEVV+GSLLAAC  HG ++ LAERLMKHL  L+ +  SNYV+LS
Subjt:  SHAGLVNKGLELFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLGSLLAACRTHG-DVRLAERLMKHLFKLDPRGDSNYVLLS

Query:  NIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPET
        N+YAA GKWEGA+K+RR MK  G++K+PGFSSIEID  +H F+AGD  H     I  +LEL+  +L++ G   ET
Subjt:  NIYAAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPET

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.8e-10137.28Show/hide
Query:  IVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARKVFDSL
        +V W S I  + + G    A   F +M    V+ +H+T + +LS CA     +L FG  +  Y  +  ++  ++ +  A++DMY KC  +  A+++FD++
Subjt:  IVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARKVFDSL

Query:  GMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGTLTLGLWVNRFVMQ
          K++V+W TMLD Y  + + E A ++ + MP +D ++W ALI+   + G   +AL  FH++Q    ++ + +++++ L+ACA +G L LG W++ ++ +
Subjt:  GMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGTLTLGLWVNRFVMQ

Query:  QEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTACSHAGLVNKGLELFDNMK
           + N  ++++LI MYS+CG +E +R+VF  + KR +  W+++I G A++G  +E+++ F  MQ+   KP+ V++T    ACSH GLV++   LF  M+
Subjt:  QEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTACSHAGLVNKGLELFDNMK

Query:  RVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRT
          + I P  +HY CIVD+ GR+G LE A+  IE MP+ P+  V G+LL AC+ H ++ LAE     L +L+PR D  +VLLSNIYA +GKWE  +++R+ 
Subjt:  RVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRT

Query:  MKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPE
        M+  G++K+PG SSIEIDG +HEF++GD  H  ++ +Y  L  +  +LK  GY PE
Subjt:  MKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPE

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)1.6e-10238.77Show/hide
Query:  LWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARKVFDSLGM
        L  +  + Y R G   +A   F  M  +GV P+ I+ ++ +S C+     ++ +G S HGY  + G ++    +  ALIDMY KC +   A ++FD +  
Subjt:  LWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARKVFDSLGM

Query:  KNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQE
        K  V+WN+++  Y+ NGE++ A + F+ MP ++ +SW  +I+GLV+    E+A+E F  MQ   G+  D V+++++ +AC  LG L L  W+  ++ +  
Subjt:  KNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQE

Query:  FKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTACSHAGLVNKGLELFDNMKRV
         + ++R+  +L+DM+SRCG  E A  +F  +  R + +W + I   A+ G A+ ++E FD M ++G KPD V++ GALTACSH GLV +G E+F +M ++
Subjt:  FKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTACSHAGLVNKGLELFDNMKRV

Query:  HKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMK
        H ++P   HYGC+VDL GRAG LE+A+ +IE MP++PN+V+  SLLAACR  G+V +A    + +  L P    +YVLLSN+YA+ G+W    KVR +MK
Subjt:  HKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMK

Query:  ARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPE
         +G++K PG SSI+I GK HEF +GD+ H     I +ML+ +       G+ P+
Subjt:  ARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPE

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification1.6e-10238.77Show/hide
Query:  LWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARKVFDSLGM
        L  +  + Y R G   +A   F  M  +GV P+ I+ ++ +S C+     ++ +G S HGY  + G ++    +  ALIDMY KC +   A ++FD +  
Subjt:  LWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARKVFDSLGM

Query:  KNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQE
        K  V+WN+++  Y+ NGE++ A + F+ MP ++ +SW  +I+GLV+    E+A+E F  MQ   G+  D V+++++ +AC  LG L L  W+  ++ +  
Subjt:  KNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQE

Query:  FKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTACSHAGLVNKGLELFDNMKRV
         + ++R+  +L+DM+SRCG  E A  +F  +  R + +W + I   A+ G A+ ++E FD M ++G KPD V++ GALTACSH GLV +G E+F +M ++
Subjt:  FKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDEVSYTGALTACSHAGLVNKGLELFDNMKRV

Query:  HKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMK
        H ++P   HYGC+VDL GRAG LE+A+ +IE MP++PN+V+  SLLAACR  G+V +A    + +  L P    +YVLLSN+YA+ G+W    KVR +MK
Subjt:  HKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGANKVRRTMK

Query:  ARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPE
         +G++K PG SSI+I GK HEF +GD+ H     I +ML+ +       G+ P+
Subjt:  ARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPE

AT4G37380.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.7e-10338.64Show/hide
Query:  SIDP-IVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARK
        +IDP + L+T++I     NG    A   + ++  + + PN  TF +LL  C      S   G  +H +  KFGL      V T L+D+YAK   +  A+K
Subjt:  SIDP-IVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESLFFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARK

Query:  VFDSLGMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQCSG-IEPDYVSIIAVLAACADLGTLTLGLWVN
        VFD +  ++ VS   M+  Y + G +E A  LFD M  RD +SW  +I+G  + G+   AL  F ++   G  +PD ++++A L+AC+ +G L  G W++
Subjt:  VFDSLGMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQCSG-IEPDYVSIIAVLAACADLGTLTLGLWVN

Query:  RFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQK-EGFKPDEVSYTGALTACSHAGLVNKGLE
         FV     + N+++   LIDMYS+CG +E A  VF   P++ +V+WN++I G+A++G++ ++L  F+ MQ   G +P ++++ G L AC+HAGLVN+G+ 
Subjt:  RFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQK-EGFKPDEVSYTGALTACSHAGLVNKGLE

Query:  LFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGA
        +F++M + + I P+IEHYGC+V L GRAG+L+ A   I+ M +  + V+  S+L +C+ HGD  L + + ++L  L+ +    YVLLSNIYA++G +EG 
Subjt:  LFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIYAAIGKWEGA

Query:  NKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPETDTFMNPKE
         KVR  MK +G+ K+PG S+IEI+ KVHEF AGD+ H+ +  IY+ML  +   +K  GY P T+T +   E
Subjt:  NKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPETDTFMNPKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAGCGTTCCTGCGCAGACAGCCATTCCAACCCAACTCCAACAATATCCTAATTCTCCTTCTCCAATCCCACTTTCGAACTCAACAAAAATCAACTTCCCTCGCTC
TGCCAATTCCTCAAATCGCAATATCTCCTCCAAATCCACTCGCAATTCTATCGATCCCATTGTTCTATGGACCTCTTCTATTGCTCGCTACTGCCGCAACGGGCAATTAG
CCGATGCCGCCACAGAGTTTACGAGGATGAGACTCGCCGGAGTTGAGCCGAACCACATCACATTCATCACGCTTCTCTCCGGCTGTGCTGATTTTCCGTCAGAAAGCCTC
TTCTTCGGCTCTTCGCTTCATGGTTACGCCCGAAAATTTGGATTGGATACAGGGCATGTAATGGTGGGGACTGCTCTGATTGATATGTATGCCAAATGTTCTCAATTGGG
TCCTGCTAGGAAGGTTTTTGATTCCCTGGGCATGAAAAACTCTGTCTCTTGGAACACGATGCTCGATGCTTACATGAGGAATGGGGAGATTGAGTTGGCCATTGACTTGT
TTGATGAAATGCCTACAAGAGATGCGATTTCTTGGACGGCTTTGATTAATGGTCTTGTGAAACAGGGATACTCCGAACAAGCATTGGAGTGCTTCCATCAGATGCAATGC
TCGGGTATTGAGCCTGATTATGTGTCTATAATTGCTGTTCTTGCTGCGTGTGCTGATTTGGGCACCCTTACTTTGGGGTTATGGGTTAATCGGTTCGTTATGCAGCAGGA
GTTTAAGGATAACATTAGGATAAGTAATTCCTTGATAGATATGTATTCTCGATGTGGATGTATTGAGTTTGCCCGCCAAGTGTTTGAGAAAATGCCCAAGCGAACTTTGG
TGTCTTGGAACTCCATTATTGTGGGGTTTGCTGTTAATGGGTTTGCAGATGAATCTCTGGAGTTTTTTGATGCAATGCAGAAGGAAGGATTCAAGCCAGATGAAGTTAGC
TACACGGGAGCTCTTACTGCGTGTAGCCATGCTGGCTTAGTGAATAAGGGGCTCGAATTGTTTGATAACATGAAGAGGGTACATAAAATTACTCCAAGGATTGAGCATTA
TGGGTGCATTGTCGACCTCTATGGCCGTGCAGGGAGGTTGGAGGATGCGTTGAACGTGATCGAGAGAATGCCGGTGAAACCGAATGAAGTTGTACTGGGGTCGCTGCTGG
CTGCCTGCAGAACTCATGGTGATGTGAGACTGGCTGAAAGGTTAATGAAACATCTCTTTAAGTTGGACCCTAGAGGCGATTCAAATTACGTGCTTCTTTCGAACATATAT
GCAGCGATTGGGAAGTGGGAAGGTGCTAATAAGGTGAGGAGAACAATGAAGGCTCGAGGTGTGCAGAAAAAGCCAGGGTTTAGTTCAATTGAGATTGATGGTAAGGTTCA
TGAGTTTGTTGCTGGTGATAAATACCATGCTAATGCAGACTGTATTTATTCAATGTTAGAGCTATTGTATCATGAACTAAAGATATGTGGCTATGCTCCTGAAACTGATA
CTTTTATGAATCCTAAAGAATCTTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCAGCGTTCCTGCGCAGACAGCCATTCCAACCCAACTCCAACAATATCCTAATTCTCCTTCTCCAATCCCACTTTCGAACTCAACAAAAATCAACTTCCCTCGCTC
TGCCAATTCCTCAAATCGCAATATCTCCTCCAAATCCACTCGCAATTCTATCGATCCCATTGTTCTATGGACCTCTTCTATTGCTCGCTACTGCCGCAACGGGCAATTAG
CCGATGCCGCCACAGAGTTTACGAGGATGAGACTCGCCGGAGTTGAGCCGAACCACATCACATTCATCACGCTTCTCTCCGGCTGTGCTGATTTTCCGTCAGAAAGCCTC
TTCTTCGGCTCTTCGCTTCATGGTTACGCCCGAAAATTTGGATTGGATACAGGGCATGTAATGGTGGGGACTGCTCTGATTGATATGTATGCCAAATGTTCTCAATTGGG
TCCTGCTAGGAAGGTTTTTGATTCCCTGGGCATGAAAAACTCTGTCTCTTGGAACACGATGCTCGATGCTTACATGAGGAATGGGGAGATTGAGTTGGCCATTGACTTGT
TTGATGAAATGCCTACAAGAGATGCGATTTCTTGGACGGCTTTGATTAATGGTCTTGTGAAACAGGGATACTCCGAACAAGCATTGGAGTGCTTCCATCAGATGCAATGC
TCGGGTATTGAGCCTGATTATGTGTCTATAATTGCTGTTCTTGCTGCGTGTGCTGATTTGGGCACCCTTACTTTGGGGTTATGGGTTAATCGGTTCGTTATGCAGCAGGA
GTTTAAGGATAACATTAGGATAAGTAATTCCTTGATAGATATGTATTCTCGATGTGGATGTATTGAGTTTGCCCGCCAAGTGTTTGAGAAAATGCCCAAGCGAACTTTGG
TGTCTTGGAACTCCATTATTGTGGGGTTTGCTGTTAATGGGTTTGCAGATGAATCTCTGGAGTTTTTTGATGCAATGCAGAAGGAAGGATTCAAGCCAGATGAAGTTAGC
TACACGGGAGCTCTTACTGCGTGTAGCCATGCTGGCTTAGTGAATAAGGGGCTCGAATTGTTTGATAACATGAAGAGGGTACATAAAATTACTCCAAGGATTGAGCATTA
TGGGTGCATTGTCGACCTCTATGGCCGTGCAGGGAGGTTGGAGGATGCGTTGAACGTGATCGAGAGAATGCCGGTGAAACCGAATGAAGTTGTACTGGGGTCGCTGCTGG
CTGCCTGCAGAACTCATGGTGATGTGAGACTGGCTGAAAGGTTAATGAAACATCTCTTTAAGTTGGACCCTAGAGGCGATTCAAATTACGTGCTTCTTTCGAACATATAT
GCAGCGATTGGGAAGTGGGAAGGTGCTAATAAGGTGAGGAGAACAATGAAGGCTCGAGGTGTGCAGAAAAAGCCAGGGTTTAGTTCAATTGAGATTGATGGTAAGGTTCA
TGAGTTTGTTGCTGGTGATAAATACCATGCTAATGCAGACTGTATTTATTCAATGTTAGAGCTATTGTATCATGAACTAAAGATATGTGGCTATGCTCCTGAAACTGATA
CTTTTATGAATCCTAAAGAATCTTTGTAA
Protein sequenceShow/hide protein sequence
MSSVPAQTAIPTQLQQYPNSPSPIPLSNSTKINFPRSANSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLADAATEFTRMRLAGVEPNHITFITLLSGCADFPSESL
FFGSSLHGYARKFGLDTGHVMVGTALIDMYAKCSQLGPARKVFDSLGMKNSVSWNTMLDAYMRNGEIELAIDLFDEMPTRDAISWTALINGLVKQGYSEQALECFHQMQC
SGIEPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFEKMPKRTLVSWNSIIVGFAVNGFADESLEFFDAMQKEGFKPDEVS
YTGALTACSHAGLVNKGLELFDNMKRVHKITPRIEHYGCIVDLYGRAGRLEDALNVIERMPVKPNEVVLGSLLAACRTHGDVRLAERLMKHLFKLDPRGDSNYVLLSNIY
AAIGKWEGANKVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHANADCIYSMLELLYHELKICGYAPETDTFMNPKESL