; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0004444 (gene) of Chayote v1 genome

Gene IDSed0004444
OrganismSechium edule (Chayote v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG11:27691635..27693312
RNA-Seq ExpressionSed0004444
SyntenySed0004444
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7012718.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]6.1e-24580.39Show/hide
Query:  MNSISAHTAIPTRFR-PDSNPTSPTKIIFPRSANSSNHKFTANSIDPIVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLISGCADFPSRS
        M+S+ AHTAIP +F+    + ++P+ + FPR  NSS      N I PIV+WTSSIAR+CRN QLAEAAAE TRMRLAGVEPNHITFITL+SGCADFPS S
Subjt:  MNSISAHTAIPTRFR-PDSNPTSPTKIIFPRSANSSNHKFTANSIDPIVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLISGCADFPSRS

Query:  LYFGSSLHGYARKFGLDTGHVMVGTALIGMYSKFGQLGLGRKVFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALINGLLKQGYSE
        L+FG+SLHGY RK GLDTGHVMVGTALI MY+K  QLGL R VFDYL MKNSVTWNTMLDGYMRNGE+E A+ LFDEMPTRDAISWTALING LKQGYSE
Subjt:  LYFGSSLHGYARKFGLDTGHVMVGTALIGMYSKFGQLGLGRKVFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALINGLLKQGYSE

Query:  QALECFHLMQCSGVEPDYVSVIAVLAACADLGTLTLGLWVNRFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSIIVGFAVNSYA
        QAL+CFH MQCSG+EPDYVS+IAVLAACADLG L+ GLWVNRF+MQQEFK NIR++NSLIDMYSRCGCI F+RQVF+KMP RTLVSWNS+IVGFA+N +A
Subjt:  QALECFHLMQCSGVEPDYVSVIAVLAACADLGTLTLGLWVNRFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSIIVGFAVNSYA

Query:  EESLEFFDAMQREGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLGSLLAACRTH
        +ESLEFFDAMQ+EGFK DGVSYTGALTACSHAGLVNKGLELFDNMKR+HRITPRIEHYGCIVDLY RAGRL+EAL+VIE MP+KPNEVVLGSLLAACRTH
Subjt:  EESLEFFDAMQREGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLGSLLAACRTH

Query:  GDVSLAERLMKHLFKLDPGGDSNYVLLSNIYASIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLFHELEICGYV
        GDVSLAERL+K+LF+LDPGGDS+YVLLSNIYA++G+WEGAN VRRTMKARGVQKKPGFSS+EIDGKVHEFV+GDKYH DADNIYSML +LFHEL+ICGYV
Subjt:  GDVSLAERLMKHLFKLDPGGDSNYVLLSNIYASIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLFHELEICGYV

Query:  PEK------DESSKD
        PE       +ESSK+
Subjt:  PEK------DESSKD

XP_022142716.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Momordica charantia]4.1e-24981.68Show/hide
Query:  MNSISAHTAIPTRFRPDSNPT------SPTKIIFPRSANSSNH----KFTANSIDPIVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLIS
        M+SI A+TA   + +   NP       +P  I FPRS NSSN     K T NSIDPIV+WTSSIAR+CRNGQLAEAAAE T MRLAGVEPNH+T ITL+S
Subjt:  MNSISAHTAIPTRFRPDSNPT------SPTKIIFPRSANSSNH----KFTANSIDPIVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLIS

Query:  GCADFPSRSLYFGSSLHGYARKFGLDTGHVMVGTALIGMYSKFGQLGLGRKVFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALIN
        GCADFPS SLYFGSSLHGYARK GLDT HVMVGT+++ MY+K  QLGL R+VFDYL MKNSV+WNTMLDGY RNGE+E A++LFDEMPTRDAISWTALIN
Subjt:  GCADFPSRSLYFGSSLHGYARKFGLDTGHVMVGTALIGMYSKFGQLGLGRKVFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALIN

Query:  GLLKQGYSEQALECFHLMQCSGVEPDYVSVIAVLAACADLGTLTLGLWVNRFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSII
        GLLKQGYSEQALECFH MQCSG++PDYVS+IAVLAACADLGTLTLGLWVNRFVMQQEFK NIR++NSLIDMYSRCGCIGFARQVFE+M  RTLVSWNSII
Subjt:  GLLKQGYSEQALECFHLMQCSGVEPDYVSVIAVLAACADLGTLTLGLWVNRFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSII

Query:  VGFAVNSYAEESLEFFDAMQREGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLG
        VG+A N +A+ESLEFFDAMQ+EGFKPD VSYTGALTACSHAGLVNKGLELFDNMKR+HRI PRIEHYGCIVDLYGRAGRLE+AL VIEKMP+KPNEVVLG
Subjt:  VGFAVNSYAEESLEFFDAMQREGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLG

Query:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYASIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLF
        SLLAACRTHGDVSLAERLMKHL KLDPGGDSNYVLLSNIYA+IGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFV+GDKYH DAD+IYSML LL 
Subjt:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYASIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLF

Query:  HELEICGYVPEKD------ESSKD
        HEL+ICG VPE +      ESSKD
Subjt:  HELEICGYVPEKD------ESSKD

XP_022945035.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucurbita moschata]2.3e-24480.39Show/hide
Query:  MNSISAHTAIPTRFR-PDSNPTSPTKIIFPRSANSSNHKFTANSIDPIVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLISGCADFPSRS
        M+S+ AHTAIP +F+    + ++P+ + FPR  NSS      N I PIV+WTSSIAR+CRN QL EAAAE TRMRLAGVEPNHITFITL+SGCADFPS S
Subjt:  MNSISAHTAIPTRFR-PDSNPTSPTKIIFPRSANSSNHKFTANSIDPIVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLISGCADFPSRS

Query:  LYFGSSLHGYARKFGLDTGHVMVGTALIGMYSKFGQLGLGRKVFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALINGLLKQGYSE
        L+FG+SLHGY RK GLDTGHVMVGTALI MY+K  QLGL R VFDYL MKNSVTWNTMLDGYMRNGE+E A+ LFDEMPTRDAISWTALING LKQGYSE
Subjt:  LYFGSSLHGYARKFGLDTGHVMVGTALIGMYSKFGQLGLGRKVFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALINGLLKQGYSE

Query:  QALECFHLMQCSGVEPDYVSVIAVLAACADLGTLTLGLWVNRFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSIIVGFAVNSYA
        QAL+CFH MQCSG+EPDYVS+IAVLAACADLG L+ GLWVNRF+MQQEFK NIR++NSLIDMYSRCGCI FARQVF+KMP RTLVSWNS+IVGFA N +A
Subjt:  QALECFHLMQCSGVEPDYVSVIAVLAACADLGTLTLGLWVNRFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSIIVGFAVNSYA

Query:  EESLEFFDAMQREGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLGSLLAACRTH
        +ESLEFFDAMQ+EGFK DGVSYTGALTACSHAGLVNKGLELFDNMKR+HRITPRIEHYGCIVDLY RAGRL+EAL+VIE MP+KPNEVVLGSLLAACRTH
Subjt:  EESLEFFDAMQREGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLGSLLAACRTH

Query:  GDVSLAERLMKHLFKLDPGGDSNYVLLSNIYASIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLFHELEICGYV
        GDVSLAERL+K+LF+LDPGGDS+YVLLSNIYA++G+WEGAN VRRTMKARGVQKKPGFSS+EIDGKVHEFV+GDKYH DADNIYSML +LFHEL+ICGYV
Subjt:  GDVSLAERLMKHLFKLDPGGDSNYVLLSNIYASIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLFHELEICGYV

Query:  PEK------DESSKD
        PE       +ESSK+
Subjt:  PEK------DESSKD

XP_023521260.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucurbita pepo subsp. pepo]1.6e-24580.58Show/hide
Query:  MNSISAHTAIPTRFR-PDSNPTSPTKIIFPRSANSSNHKFTANSIDPIVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLISGCADFPSRS
        M+S+ AHTA+P +F+    + ++P+ + FPR  NSS      N I PIV+WTSSIAR+CRN QL EAAAE TRMRLAGVEPNHITFITL+SGCADFPS S
Subjt:  MNSISAHTAIPTRFR-PDSNPTSPTKIIFPRSANSSNHKFTANSIDPIVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLISGCADFPSRS

Query:  LYFGSSLHGYARKFGLDTGHVMVGTALIGMYSKFGQLGLGRKVFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALINGLLKQGYSE
        L+FG+SLHGY RK GLDTGHVMVGTALI MY+K  QLGL R VFDYL MKNSVTWNTMLDGYMRNGE+E A+ LFDEMPTRDAISWTALING LKQGYSE
Subjt:  LYFGSSLHGYARKFGLDTGHVMVGTALIGMYSKFGQLGLGRKVFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALINGLLKQGYSE

Query:  QALECFHLMQCSGVEPDYVSVIAVLAACADLGTLTLGLWVNRFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSIIVGFAVNSYA
        QALECFH MQCSG+EPDYVS+IAVLAACADLG L+ GLWVNRF+MQQEFK NIR++NSLIDMYSRCGCI FARQVF+KMP RTLVSWNS+IVGFA+N +A
Subjt:  QALECFHLMQCSGVEPDYVSVIAVLAACADLGTLTLGLWVNRFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSIIVGFAVNSYA

Query:  EESLEFFDAMQREGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLGSLLAACRTH
        +ESLEFFDAMQ+EGFK DGVSYTGALTACSHAGLVNKGLELFDNMKR+HRITPRIEHYGCIVDLY RAGRL+EAL+VIE MP+KPNEVVLGSLLAACRTH
Subjt:  EESLEFFDAMQREGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLGSLLAACRTH

Query:  GDVSLAERLMKHLFKLDPGGDSNYVLLSNIYASIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLFHELEICGYV
        GDVSLAERL+K+LF+LDPGGDS+YVLLSNIYA++G+WEGANKVRRTMKARGVQKKPGFSS+EIDGKVHEFV+GDKYH DADNIYSML +LFHEL+ICGYV
Subjt:  GDVSLAERLMKHLFKLDPGGDSNYVLLSNIYASIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLFHELEICGYV

Query:  PEK------DESSKD
        PE       +ESSK+
Subjt:  PEK------DESSKD

XP_038877228.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Benincasa hispida]8.2e-25081.49Show/hide
Query:  MNSISAHTAIPTRFRPDSNPTS------PTKIIFPRSANSSNH----KFTANSIDPIVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLIS
        M+S  ++TAIP++ +   NP S      PTK+ FPRS NSS+     KF ANSIDPIV+WTSS+AR+CRNGQL+EAA E TRMRLAGVEPNH+TFITL+S
Subjt:  MNSISAHTAIPTRFRPDSNPTS------PTKIIFPRSANSSNH----KFTANSIDPIVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLIS

Query:  GCADFPSRSLYFGSSLHGYARKFGLDTGHVMVGTALIGMYSKFGQLGLGRKVFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALIN
        GC DFPS SL+FGSSLHGYARK GLDTGHVMVGTAL+ MY+K  Q  L RKVFDYLGMKNSVTWNTMLDGY RNGE+E A++LFDEMPTRDAISWTALIN
Subjt:  GCADFPSRSLYFGSSLHGYARKFGLDTGHVMVGTALIGMYSKFGQLGLGRKVFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALIN

Query:  GLLKQGYSEQALECFHLMQCSGVEPDYVSVIAVLAACADLGTLTLGLWVNRFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSII
        GLLKQG+SEQALECFH MQCSG+EPDYVS+IAVLAACADLG LTLGLWVNRFVMQQEFK NIR++NSL+DMYSRCGCI FARQVFEKMP RTLVSWNSII
Subjt:  GLLKQGYSEQALECFHLMQCSGVEPDYVSVIAVLAACADLGTLTLGLWVNRFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSII

Query:  VGFAVNSYAEESLEFFDAMQREGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLG
        VGFAVN +A+ESLEFFDAMQ EGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKR+H+ITPRIEHYGCIVDLYGRAGRLE+AL+VIE+MP+KPNEVVLG
Subjt:  VGFAVNSYAEESLEFFDAMQREGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLG

Query:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYASIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLF
        SLLAACRT+GDVSLAE+LMKHL KLDP GDSNYVLLSNIYA+IG+WEGANKVRRTMKARGVQKKPG SSVEIDGKVHEFV+GDKYH DADNIYSML LLF
Subjt:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYASIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLF

Query:  HELEICGYVPEKD------ESSKD
        HEL+I GYVP+ +      E SKD
Subjt:  HELEICGYVPEKD------ESSKD

TrEMBL top hitse value%identityAlignment
A0A0A0LYD6 Uncharacterized protein9.2e-23978.82Show/hide
Query:  MNSISAHTAIPTR-----FRPDSNPTS-PTKIIFPRSANSSNH----KFTANSIDPIVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLIS
        M+SI +HTA P++     F P S P S PTK+ FPRS NS +     KF  NS+DPIV+WTSS+AR+CRNGQL+EAAAE TRMRLAGVEPNHITFITL+S
Subjt:  MNSISAHTAIPTR-----FRPDSNPTS-PTKIIFPRSANSSNH----KFTANSIDPIVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLIS

Query:  GCADFPSRSLYFGSSLHGYARKFGLDTGHVMVGTALIGMYSKFGQLGLGRKVFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALIN
         CADFPS S +F SSLHGYA K+GLDTGHVMVGTALI MYSK  QLG  RKVF  LG+KNSV+WNTML+G+MRNGE+E A+ LFDEMPTRDAISWTALIN
Subjt:  GCADFPSRSLYFGSSLHGYARKFGLDTGHVMVGTALIGMYSKFGQLGLGRKVFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALIN

Query:  GLLKQGYSEQALECFHLMQCSGVEPDYVSVIAVLAACADLGTLTLGLWVNRFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSII
        GLLK GYSEQALECFH MQ SGV  DYVS+IAVLAACADLG LTLGLWV+RFVM QEFK NI+++NSLIDMYSRCGCI FARQVF KM  RTLVSWNSII
Subjt:  GLLKQGYSEQALECFHLMQCSGVEPDYVSVIAVLAACADLGTLTLGLWVNRFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSII

Query:  VGFAVNSYAEESLEFFDAMQREGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLG
        VGFAVN +A+ESLEFF AMQ+EGFKPDGVSYTGALTACSHAGLVNKGLELFDNMK +H+ITPRIEHYGCIVDLYGRAGRLE+AL++IE+MP+KPNEVVLG
Subjt:  VGFAVNSYAEESLEFFDAMQREGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLG

Query:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYASIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLF
        SLLAACRTHGDV+LAERLMKHLFKLDP GD+ YVLLSNIYA+IGKW+GAN VRRTMKARGVQKKPG+SSVEIDGKVHEFV+GD YH DADNIYSML LL 
Subjt:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYASIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLF

Query:  HELEICGYVPEKD------ESSKD
        HEL++CGYVP  D      ES+KD
Subjt:  HELEICGYVPEKD------ESSKD

A0A5A7UJB6 Pentatricopeptide repeat-containing protein1.6e-23879.5Show/hide
Query:  MNSISAHTAIPTRF-RPDSNP---TSPTKIIFPRSANSSN----HKFTANSIDPIVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLISGC
        M+SI +H A P++  +P S+    ++PTK+ FPRS  S +     KFTANS+ PIV WTSSIAR+C NGQL EAAAE TRMRLAGVEPNHITFITL+SGC
Subjt:  MNSISAHTAIPTRF-RPDSNP---TSPTKIIFPRSANSSN----HKFTANSIDPIVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLISGC

Query:  ADFPSRSLYFGSSLHGYARKFGLDTGHVMVGTALIGMYSKFGQLGLGRKVFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALINGL
        ADFPS S +F SSLHGYA KFGLDTGHVMVGTALI MYSK  QLGL +KVFDYLG+KNSV+WNTML+G+MRNGE+E A+ LFDEMPTRDAISWTALINGL
Subjt:  ADFPSRSLYFGSSLHGYARKFGLDTGHVMVGTALIGMYSKFGQLGLGRKVFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALINGL

Query:  LKQGYSEQALECFHLMQCSGVEPDYVSVIAVLAACADLGTLTLGLWVNRFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSIIVG
        LK GYSEQALECFH MQ SGV  DYVS+IAVLAACADLG LT GLWVNRFVMQQEFK N+R++NSLIDMYSRCGCI FARQVF KM  RTLVSWNSIIVG
Subjt:  LKQGYSEQALECFHLMQCSGVEPDYVSVIAVLAACADLGTLTLGLWVNRFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSIIVG

Query:  FAVNSYAEESLEFFDAMQREGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLGSL
        FA N +A+ESLEFF AMQ+EGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKR+H+ITP IEHYGCIVDLYGRAGRLE+A +VIE+MP+KPNEVVLGSL
Subjt:  FAVNSYAEESLEFFDAMQREGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLGSL

Query:  LAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYASIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLFHE
        LAACRTHGDV LAERLMKH+FKLD  GDSNYVLLSNIYA+IGKWEGANKVRRTMKARGVQKK G+SSVEIDGKVHEFV+GDKYH DADNIYSML LLFHE
Subjt:  LAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYASIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLFHE

Query:  LEICGYVPEKD------ESSKD
        L++CGYVP+ D      +S+KD
Subjt:  LEICGYVPEKD------ESSKD

A0A6J1CN07 pentatricopeptide repeat-containing protein At1g05750, chloroplastic2.0e-24981.68Show/hide
Query:  MNSISAHTAIPTRFRPDSNPT------SPTKIIFPRSANSSNH----KFTANSIDPIVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLIS
        M+SI A+TA   + +   NP       +P  I FPRS NSSN     K T NSIDPIV+WTSSIAR+CRNGQLAEAAAE T MRLAGVEPNH+T ITL+S
Subjt:  MNSISAHTAIPTRFRPDSNPT------SPTKIIFPRSANSSNH----KFTANSIDPIVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLIS

Query:  GCADFPSRSLYFGSSLHGYARKFGLDTGHVMVGTALIGMYSKFGQLGLGRKVFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALIN
        GCADFPS SLYFGSSLHGYARK GLDT HVMVGT+++ MY+K  QLGL R+VFDYL MKNSV+WNTMLDGY RNGE+E A++LFDEMPTRDAISWTALIN
Subjt:  GCADFPSRSLYFGSSLHGYARKFGLDTGHVMVGTALIGMYSKFGQLGLGRKVFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALIN

Query:  GLLKQGYSEQALECFHLMQCSGVEPDYVSVIAVLAACADLGTLTLGLWVNRFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSII
        GLLKQGYSEQALECFH MQCSG++PDYVS+IAVLAACADLGTLTLGLWVNRFVMQQEFK NIR++NSLIDMYSRCGCIGFARQVFE+M  RTLVSWNSII
Subjt:  GLLKQGYSEQALECFHLMQCSGVEPDYVSVIAVLAACADLGTLTLGLWVNRFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSII

Query:  VGFAVNSYAEESLEFFDAMQREGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLG
        VG+A N +A+ESLEFFDAMQ+EGFKPD VSYTGALTACSHAGLVNKGLELFDNMKR+HRI PRIEHYGCIVDLYGRAGRLE+AL VIEKMP+KPNEVVLG
Subjt:  VGFAVNSYAEESLEFFDAMQREGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLG

Query:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYASIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLF
        SLLAACRTHGDVSLAERLMKHL KLDPGGDSNYVLLSNIYA+IGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFV+GDKYH DAD+IYSML LL 
Subjt:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYASIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLF

Query:  HELEICGYVPEKD------ESSKD
        HEL+ICG VPE +      ESSKD
Subjt:  HELEICGYVPEKD------ESSKD

A0A6J1FZR8 pentatricopeptide repeat-containing protein At1g05750, chloroplastic1.1e-24480.39Show/hide
Query:  MNSISAHTAIPTRFR-PDSNPTSPTKIIFPRSANSSNHKFTANSIDPIVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLISGCADFPSRS
        M+S+ AHTAIP +F+    + ++P+ + FPR  NSS      N I PIV+WTSSIAR+CRN QL EAAAE TRMRLAGVEPNHITFITL+SGCADFPS S
Subjt:  MNSISAHTAIPTRFR-PDSNPTSPTKIIFPRSANSSNHKFTANSIDPIVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLISGCADFPSRS

Query:  LYFGSSLHGYARKFGLDTGHVMVGTALIGMYSKFGQLGLGRKVFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALINGLLKQGYSE
        L+FG+SLHGY RK GLDTGHVMVGTALI MY+K  QLGL R VFDYL MKNSVTWNTMLDGYMRNGE+E A+ LFDEMPTRDAISWTALING LKQGYSE
Subjt:  LYFGSSLHGYARKFGLDTGHVMVGTALIGMYSKFGQLGLGRKVFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALINGLLKQGYSE

Query:  QALECFHLMQCSGVEPDYVSVIAVLAACADLGTLTLGLWVNRFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSIIVGFAVNSYA
        QAL+CFH MQCSG+EPDYVS+IAVLAACADLG L+ GLWVNRF+MQQEFK NIR++NSLIDMYSRCGCI FARQVF+KMP RTLVSWNS+IVGFA N +A
Subjt:  QALECFHLMQCSGVEPDYVSVIAVLAACADLGTLTLGLWVNRFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSIIVGFAVNSYA

Query:  EESLEFFDAMQREGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLGSLLAACRTH
        +ESLEFFDAMQ+EGFK DGVSYTGALTACSHAGLVNKGLELFDNMKR+HRITPRIEHYGCIVDLY RAGRL+EAL+VIE MP+KPNEVVLGSLLAACRTH
Subjt:  EESLEFFDAMQREGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLGSLLAACRTH

Query:  GDVSLAERLMKHLFKLDPGGDSNYVLLSNIYASIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLFHELEICGYV
        GDVSLAERL+K+LF+LDPGGDS+YVLLSNIYA++G+WEGAN VRRTMKARGVQKKPGFSS+EIDGKVHEFV+GDKYH DADNIYSML +LFHEL+ICGYV
Subjt:  GDVSLAERLMKHLFKLDPGGDSNYVLLSNIYASIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLFHELEICGYV

Query:  PEK------DESSKD
        PE       +ESSK+
Subjt:  PEK------DESSKD

A0A6J1HU31 pentatricopeptide repeat-containing protein At1g05750, chloroplastic6.1e-24379.5Show/hide
Query:  MNSISAHTAIPTRFRPD--SNPTS------PTKIIFPRSANSSNHKFTANSIDPIVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLISGC
        M+S+ +HT IP +F+    SNP S      P+ + FPR+ NSS      N I PIV+WTSSIAR+CRN QLAEAAAE TRMRLAGVEPNHITFITL+SGC
Subjt:  MNSISAHTAIPTRFRPD--SNPTS------PTKIIFPRSANSSNHKFTANSIDPIVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLISGC

Query:  ADFPSRSLYFGSSLHGYARKFGLDTGHVMVGTALIGMYSKFGQLGLGRKVFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALINGL
        ADFPS SL+FG+SLHGY RK GLDTGHVMVGTALI MY+K  QLGL R VFDYL MKNSVTWNTMLDGYMRNGE+E A+ LFDEMPTRDAISWTALING 
Subjt:  ADFPSRSLYFGSSLHGYARKFGLDTGHVMVGTALIGMYSKFGQLGLGRKVFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALINGL

Query:  LKQGYSEQALECFHLMQCSGVEPDYVSVIAVLAACADLGTLTLGLWVNRFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSIIVG
        LKQGYSEQALECFH MQCSG+EPDYVS+IAVLAACADLG L+ GLWVNRF+MQQEFK NIR++NSLIDMYSRCGCI FARQVF+KM   TLVSWNS+IVG
Subjt:  LKQGYSEQALECFHLMQCSGVEPDYVSVIAVLAACADLGTLTLGLWVNRFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSIIVG

Query:  FAVNSYAEESLEFFDAMQREGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLGSL
        FA+N +A+ESLEFFDAMQ+EGF  DGVSYTGALTACSHAGLVNKGLELFDNMKR+HRITPRIEHYGCIVDLY RAGRL+EAL+VIE MP+KPNEVVLGSL
Subjt:  FAVNSYAEESLEFFDAMQREGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLGSL

Query:  LAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYASIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLFHE
        LAACRTHGDVSLAERL+K+LF+LDPGGDS+YVLLSNIYA++G+WEGANKVRRTMKARGVQKKPGFSS+EIDGKVHEFV+GDKYH DADNIYSML +LFHE
Subjt:  LAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYASIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLFHE

Query:  LEICGYVPEK------DESSKD
        L+I GYVPE       +ESSK+
Subjt:  LEICGYVPEK------DESSKD

SwissProt top hitse value%identityAlignment
O23337 Pentatricopeptide repeat-containing protein At4g148202.2e-9636.31Show/hide
Query:  NSSNHKFTANSIDPIVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLISGCADFPSRSLYFGSSLHGY--ARKFGLDTGHVMVGTALIGMY
        N + + F   S   +V W + I R+CR G + EA      M+ + V P+ +    ++S C    + ++ +  +++ +       +DT H++  TAL+ MY
Subjt:  NSSNHKFTANSIDPIVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLISGCADFPSRSLYFGSSLHGY--ARKFGLDTGHVMVGTALIGMY

Query:  SKFGQLGLGRKVFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALINGLLKQGYSEQALECFHLMQCSGVEPDYVSVIAVLAACADL
        +  G + + R+ F  + ++N      M+ GY + G ++ A  +FD+   +D + WT +I+  ++  Y ++AL  F  M CSG++PD VS+ +V++ACA+L
Subjt:  SKFGQLGLGRKVFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALINGLLKQGYSEQALECFHLMQCSGVEPDYVSVIAVLAACADL

Query:  GTLTLGLWVNRFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSIIVGFAVNSYAEESLEFFDAMQREGFKPDGVSYTGALTACSH
        G L    WV+  +     +  + +NN+LI+MY++CG +   R VFEKMP R +VSW+S+I   +++  A ++L  F  M++E  +P+ V++ G L  CSH
Subjt:  GTLTLGLWVNRFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSIIVGFAVNSYAEESLEFFDAMQREGFKPDGVSYTGALTACSH

Query:  AGLVNKGLELFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIY
        +GLV +G ++F +M   + ITP++EHYGC+VDL+GRA  L EAL VIE MP+  N V+ GSL++ACR HG++ L +   K + +L+P  D   VL+SNIY
Subjt:  AGLVNKGLELFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIY

Query:  ASIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLFHELEICGYVPE
        A   +WE    +RR M+ + V K+ G S ++ +GK HEF+ GDK H  ++ IY+ L  +  +L++ GYVP+
Subjt:  ASIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLFHELEICGYVPE

O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic7.0e-9536.21Show/hide
Query:  FTANSIDPIVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLISGCADFPSRSLYFGSSLHGYARKFGLDTGHVMVGTALIGMYSKFGQLGL
        FT      +V W S I    + G   +A     +M    V+ +H+T + ++S CA    R+L FG  +  Y  +  ++  ++ +  A++ MY+K G +  
Subjt:  FTANSIDPIVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLISGCADFPSRSLYFGSSLHGYARKFGLDTGHVMVGTALIGMYSKFGQLGL

Query:  GRKVFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALINGLLKQGYSEQALECFHLMQC-SGVEPDYVSVIAVLAACADLGTLTLGL
         +++FD +  K++VTW TMLDGY  + + E A  + + MP +D ++W ALI+   + G   +AL  FH +Q    ++ + +++++ L+ACA +G L LG 
Subjt:  GRKVFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALINGLLKQGYSEQALECFHLMQC-SGVEPDYVSVIAVLAACADLGTLTLGL

Query:  WVNRFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSIIVGFAVNSYAEESLEFFDAMQREGFKPDGVSYTGALTACSHAGLVNKG
        W++ ++ +   + N  V ++LI MYS+CG +  +R+VF  +  R +  W+++I G A++    E+++ F  MQ    KP+GV++T    ACSH GLV++ 
Subjt:  WVNRFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSIIVGFAVNSYAEESLEFFDAMQREGFKPDGVSYTGALTACSHAGLVNKG

Query:  LELFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYASIGKWE
          LF  M+  + I P  +HY CIVD+ GR+G LE+A+  IE MP+ P+  V G+LL AC+ H +++LAE     L +L+P  D  +VLLSNIYA +GKWE
Subjt:  LELFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYASIGKWE

Query:  GANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLFHELEICGYVPE
          +++R+ M+  G++K+PG SS+EIDG +HEF+SGD  H  ++ +Y  L  +  +L+  GY PE
Subjt:  GANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLFHELEICGYVPE

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226908.6e-10140.4Show/hide
Query:  ARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLISGCADFPSRSLYFGSSLHGYARKFGLDTGHVMVGTALIGMYSKFGQLGLGRKVFDYLGMKNSVTW
        + + R G   EA      M  +GV P+ I+ ++ IS C+    R++ +G S HGY  + G ++    +  ALI MY K  +     ++FD +  K  VTW
Subjt:  ARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLISGCADFPSRSLYFGSSLHGYARKFGLDTGHVMVGTALIGMYSKFGQLGLGRKVFDYLGMKNSVTW

Query:  NTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALINGLLKQGYSEQALECFHLMQC-SGVEPDYVSVIAVLAACADLGTLTLGLWVNRFVMQQEFKGNIR
        N+++ GY+ NGEV+ A   F+ MP ++ +SW  +I+GL++    E+A+E F  MQ   GV  D V+++++ +AC  LG L L  W+  ++ +   + ++R
Subjt:  NTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALINGLLKQGYSEQALECFHLMQC-SGVEPDYVSVIAVLAACADLGTLTLGLWVNRFVMQQEFKGNIR

Query:  VNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSIIVGFAVNSYAEESLEFFDAMQREGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRMHRITPR
        +  +L+DM+SRCG    A  +F  + NR + +W + I   A+   AE ++E FD M  +G KPDGV++ GALTACSH GLV +G E+F +M ++H ++P 
Subjt:  VNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSIIVGFAVNSYAEESLEFFDAMQREGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRMHRITPR

Query:  IEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYASIGKWEGANKVRRTMKARGVQK
          HYGC+VDL GRAG LEEA+ +IE MP++PN+V+  SLLAACR  G+V +A    + +  L P    +YVLLSN+YAS G+W    KVR +MK +G++K
Subjt:  IEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYASIGKWEGANKVRRTMKARGVQK

Query:  KPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLFHELEICGYVPE
         PG SS++I GK HEF SGD+ H +  NI +ML  +       G+VP+
Subjt:  KPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLFHELEICGYVPE

Q9MA50 Pentatricopeptide repeat-containing protein At1g05750, chloroplastic3.1e-16759.75Show/hide
Query:  TSPTKIIFPRSANSSNHKFTANSIDPIVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLISGCADFPSRSLYFGSSLHGYARKFGLDTGHV
        TSP  I     AN    +   ++ +  V WTS I    RNG+LAEAA E + M LAGVEPNHITFI L+SGC DF S S   G  LHGYA K GLD  HV
Subjt:  TSPTKIIFPRSANSSNHKFTANSIDPIVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLISGCADFPSRSLYFGSSLHGYARKFGLDTGHV

Query:  MVGTALIGMYSKFGQLGLGRKVFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALINGLLKQGYSEQALECFHLMQCSGVEPDYVSV
        MVGTA+IGMYSK G+    R VFDY+  KNSVTWNTM+DGYMR+G+V+ A  +FD+MP RD ISWTA+ING +K+GY E+AL  F  MQ SGV+PDYV++
Subjt:  MVGTALIGMYSKFGQLGLGRKVFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALINGLLKQGYSEQALECFHLMQCSGVEPDYVSV

Query:  IAVLAACADLGTLTLGLWVNRFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSIIVGFAVNSYAEESLEFFDAMQREGFKPDGVS
        IA L AC +LG L+ GLWV+R+V+ Q+FK N+RV+NSLID+Y RCGC+ FARQVF  M  RT+VSWNS+IVGFA N  A ESL +F  MQ +GFKPD V+
Subjt:  IAVLAACADLGTLTLGLWVNRFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSIIVGFAVNSYAEESLEFFDAMQREGFKPDGVS

Query:  YTGALTACSHAGLVNKGLELFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLGSLLAACRTHG-DVSLAERLMKHLFKLDPGG
        +TGALTACSH GLV +GL  F  MK  +RI+PRIEHYGC+VDLY RAGRLE+AL +++ MP+KPNEVV+GSLLAAC  HG ++ LAERLMKHL  L+   
Subjt:  YTGALTACSHAGLVNKGLELFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLGSLLAACRTHG-DVSLAERLMKHLFKLDPGG

Query:  DSNYVLLSNIYASIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLFHELEICGYVPE
         SNYV+LSN+YA+ GKWEGA+K+RR MK  G++K+PGFSS+EID  +H F++GD  H +   I  +L L+  +L + G V E
Subjt:  DSNYVLLSNIYASIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLFHELEICGYVPE

Q9SZT8 Pentatricopeptide repeat-containing protein ELI1, chloroplastic5.4e-9537.53Show/hide
Query:  SIDP-IVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLISGCADFPSRSLYFGSSLHGYARKFGLDTGHVMVGTALIGMYSKFGQLGLGRK
        +IDP + ++T++I     NG   +A     ++  + + PN  TF +L+  C      S   G  +H +  KFGL      V T L+ +Y+K G +   +K
Subjt:  SIDP-IVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLISGCADFPSRSLYFGSSLHGYARKFGLDTGHVMVGTALIGMYSKFGQLGLGRK

Query:  VFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALINGLLKQGYSEQALECFHLMQCSG-VEPDYVSVIAVLAACADLGTLTLGLWVN
        VFD +  ++ V+   M+  Y + G VE A  LFD M  RD +SW  +I+G  + G+   AL  F  +   G  +PD ++V+A L+AC+ +G L  G W++
Subjt:  VFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALINGLLKQGYSEQALECFHLMQCSG-VEPDYVSVIAVLAACADLGTLTLGLWVN

Query:  RFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSIIVGFAVNSYAEESLEFFDAMQR-EGFKPDGVSYTGALTACSHAGLVNKGLE
         FV     + N++V   LIDMYS+CG +  A  VF   P + +V+WN++I G+A++ Y++++L  F+ MQ   G +P  +++ G L AC+HAGLVN+G+ 
Subjt:  RFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSIIVGFAVNSYAEESLEFFDAMQR-EGFKPDGVSYTGALTACSHAGLVNKGLE

Query:  LFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYASIGKWEGA
        +F++M + + I P+IEHYGC+V L GRAG+L+ A   I+ M +  + V+  S+L +C+ HGD  L + + ++L  L+      YVLLSNIYAS+G +EG 
Subjt:  LFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYASIGKWEGA

Query:  NKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLFHELEICGYVPEKDESSKD
         KVR  MK +G+ K+PG S++EI+ KVHEF +GD+ H  +  IY+ML  +   ++  GYVP  +   +D
Subjt:  NKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLFHELEICGYVPEKDESSKD

Arabidopsis top hitse value%identityAlignment
AT1G05750.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.2e-16859.75Show/hide
Query:  TSPTKIIFPRSANSSNHKFTANSIDPIVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLISGCADFPSRSLYFGSSLHGYARKFGLDTGHV
        TSP  I     AN    +   ++ +  V WTS I    RNG+LAEAA E + M LAGVEPNHITFI L+SGC DF S S   G  LHGYA K GLD  HV
Subjt:  TSPTKIIFPRSANSSNHKFTANSIDPIVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLISGCADFPSRSLYFGSSLHGYARKFGLDTGHV

Query:  MVGTALIGMYSKFGQLGLGRKVFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALINGLLKQGYSEQALECFHLMQCSGVEPDYVSV
        MVGTA+IGMYSK G+    R VFDY+  KNSVTWNTM+DGYMR+G+V+ A  +FD+MP RD ISWTA+ING +K+GY E+AL  F  MQ SGV+PDYV++
Subjt:  MVGTALIGMYSKFGQLGLGRKVFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALINGLLKQGYSEQALECFHLMQCSGVEPDYVSV

Query:  IAVLAACADLGTLTLGLWVNRFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSIIVGFAVNSYAEESLEFFDAMQREGFKPDGVS
        IA L AC +LG L+ GLWV+R+V+ Q+FK N+RV+NSLID+Y RCGC+ FARQVF  M  RT+VSWNS+IVGFA N  A ESL +F  MQ +GFKPD V+
Subjt:  IAVLAACADLGTLTLGLWVNRFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSIIVGFAVNSYAEESLEFFDAMQREGFKPDGVS

Query:  YTGALTACSHAGLVNKGLELFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLGSLLAACRTHG-DVSLAERLMKHLFKLDPGG
        +TGALTACSH GLV +GL  F  MK  +RI+PRIEHYGC+VDLY RAGRLE+AL +++ MP+KPNEVV+GSLLAAC  HG ++ LAERLMKHL  L+   
Subjt:  YTGALTACSHAGLVNKGLELFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLGSLLAACRTHG-DVSLAERLMKHLFKLDPGG

Query:  DSNYVLLSNIYASIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLFHELEICGYVPE
         SNYV+LSN+YA+ GKWEGA+K+RR MK  G++K+PGFSS+EID  +H F++GD  H +   I  +L L+  +L + G V E
Subjt:  DSNYVLLSNIYASIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLFHELEICGYVPE

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)6.1e-10240.4Show/hide
Query:  ARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLISGCADFPSRSLYFGSSLHGYARKFGLDTGHVMVGTALIGMYSKFGQLGLGRKVFDYLGMKNSVTW
        + + R G   EA      M  +GV P+ I+ ++ IS C+    R++ +G S HGY  + G ++    +  ALI MY K  +     ++FD +  K  VTW
Subjt:  ARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLISGCADFPSRSLYFGSSLHGYARKFGLDTGHVMVGTALIGMYSKFGQLGLGRKVFDYLGMKNSVTW

Query:  NTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALINGLLKQGYSEQALECFHLMQC-SGVEPDYVSVIAVLAACADLGTLTLGLWVNRFVMQQEFKGNIR
        N+++ GY+ NGEV+ A   F+ MP ++ +SW  +I+GL++    E+A+E F  MQ   GV  D V+++++ +AC  LG L L  W+  ++ +   + ++R
Subjt:  NTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALINGLLKQGYSEQALECFHLMQC-SGVEPDYVSVIAVLAACADLGTLTLGLWVNRFVMQQEFKGNIR

Query:  VNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSIIVGFAVNSYAEESLEFFDAMQREGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRMHRITPR
        +  +L+DM+SRCG    A  +F  + NR + +W + I   A+   AE ++E FD M  +G KPDGV++ GALTACSH GLV +G E+F +M ++H ++P 
Subjt:  VNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSIIVGFAVNSYAEESLEFFDAMQREGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRMHRITPR

Query:  IEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYASIGKWEGANKVRRTMKARGVQK
          HYGC+VDL GRAG LEEA+ +IE MP++PN+V+  SLLAACR  G+V +A    + +  L P    +YVLLSN+YAS G+W    KVR +MK +G++K
Subjt:  IEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYASIGKWEGANKVRRTMKARGVQK

Query:  KPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLFHELEICGYVPE
         PG SS++I GK HEF SGD+ H +  NI +ML  +       G+VP+
Subjt:  KPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLFHELEICGYVPE

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification6.1e-10240.4Show/hide
Query:  ARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLISGCADFPSRSLYFGSSLHGYARKFGLDTGHVMVGTALIGMYSKFGQLGLGRKVFDYLGMKNSVTW
        + + R G   EA      M  +GV P+ I+ ++ IS C+    R++ +G S HGY  + G ++    +  ALI MY K  +     ++FD +  K  VTW
Subjt:  ARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLISGCADFPSRSLYFGSSLHGYARKFGLDTGHVMVGTALIGMYSKFGQLGLGRKVFDYLGMKNSVTW

Query:  NTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALINGLLKQGYSEQALECFHLMQC-SGVEPDYVSVIAVLAACADLGTLTLGLWVNRFVMQQEFKGNIR
        N+++ GY+ NGEV+ A   F+ MP ++ +SW  +I+GL++    E+A+E F  MQ   GV  D V+++++ +AC  LG L L  W+  ++ +   + ++R
Subjt:  NTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALINGLLKQGYSEQALECFHLMQC-SGVEPDYVSVIAVLAACADLGTLTLGLWVNRFVMQQEFKGNIR

Query:  VNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSIIVGFAVNSYAEESLEFFDAMQREGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRMHRITPR
        +  +L+DM+SRCG    A  +F  + NR + +W + I   A+   AE ++E FD M  +G KPDGV++ GALTACSH GLV +G E+F +M ++H ++P 
Subjt:  VNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSIIVGFAVNSYAEESLEFFDAMQREGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRMHRITPR

Query:  IEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYASIGKWEGANKVRRTMKARGVQK
          HYGC+VDL GRAG LEEA+ +IE MP++PN+V+  SLLAACR  G+V +A    + +  L P    +YVLLSN+YAS G+W    KVR +MK +G++K
Subjt:  IEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYASIGKWEGANKVRRTMKARGVQK

Query:  KPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLFHELEICGYVPE
         PG SS++I GK HEF SGD+ H +  NI +ML  +       G+VP+
Subjt:  KPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLFHELEICGYVPE

AT4G14820.1 Pentatricopeptide repeat (PPR) superfamily protein1.6e-9736.31Show/hide
Query:  NSSNHKFTANSIDPIVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLISGCADFPSRSLYFGSSLHGY--ARKFGLDTGHVMVGTALIGMY
        N + + F   S   +V W + I R+CR G + EA      M+ + V P+ +    ++S C    + ++ +  +++ +       +DT H++  TAL+ MY
Subjt:  NSSNHKFTANSIDPIVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLISGCADFPSRSLYFGSSLHGY--ARKFGLDTGHVMVGTALIGMY

Query:  SKFGQLGLGRKVFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALINGLLKQGYSEQALECFHLMQCSGVEPDYVSVIAVLAACADL
        +  G + + R+ F  + ++N      M+ GY + G ++ A  +FD+   +D + WT +I+  ++  Y ++AL  F  M CSG++PD VS+ +V++ACA+L
Subjt:  SKFGQLGLGRKVFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALINGLLKQGYSEQALECFHLMQCSGVEPDYVSVIAVLAACADL

Query:  GTLTLGLWVNRFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSIIVGFAVNSYAEESLEFFDAMQREGFKPDGVSYTGALTACSH
        G L    WV+  +     +  + +NN+LI+MY++CG +   R VFEKMP R +VSW+S+I   +++  A ++L  F  M++E  +P+ V++ G L  CSH
Subjt:  GTLTLGLWVNRFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSIIVGFAVNSYAEESLEFFDAMQREGFKPDGVSYTGALTACSH

Query:  AGLVNKGLELFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIY
        +GLV +G ++F +M   + ITP++EHYGC+VDL+GRA  L EAL VIE MP+  N V+ GSL++ACR HG++ L +   K + +L+P  D   VL+SNIY
Subjt:  AGLVNKGLELFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIY

Query:  ASIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLFHELEICGYVPE
        A   +WE    +RR M+ + V K+ G S ++ +GK HEF+ GDK H  ++ IY+ L  +  +L++ GYVP+
Subjt:  ASIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLFHELEICGYVPE

AT4G37380.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.8e-9637.53Show/hide
Query:  SIDP-IVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLISGCADFPSRSLYFGSSLHGYARKFGLDTGHVMVGTALIGMYSKFGQLGLGRK
        +IDP + ++T++I     NG   +A     ++  + + PN  TF +L+  C      S   G  +H +  KFGL      V T L+ +Y+K G +   +K
Subjt:  SIDP-IVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLISGCADFPSRSLYFGSSLHGYARKFGLDTGHVMVGTALIGMYSKFGQLGLGRK

Query:  VFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALINGLLKQGYSEQALECFHLMQCSG-VEPDYVSVIAVLAACADLGTLTLGLWVN
        VFD +  ++ V+   M+  Y + G VE A  LFD M  RD +SW  +I+G  + G+   AL  F  +   G  +PD ++V+A L+AC+ +G L  G W++
Subjt:  VFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALINGLLKQGYSEQALECFHLMQCSG-VEPDYVSVIAVLAACADLGTLTLGLWVN

Query:  RFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSIIVGFAVNSYAEESLEFFDAMQR-EGFKPDGVSYTGALTACSHAGLVNKGLE
         FV     + N++V   LIDMYS+CG +  A  VF   P + +V+WN++I G+A++ Y++++L  F+ MQ   G +P  +++ G L AC+HAGLVN+G+ 
Subjt:  RFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSIIVGFAVNSYAEESLEFFDAMQR-EGFKPDGVSYTGALTACSHAGLVNKGLE

Query:  LFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYASIGKWEGA
        +F++M + + I P+IEHYGC+V L GRAG+L+ A   I+ M +  + V+  S+L +C+ HGD  L + + ++L  L+      YVLLSNIYAS+G +EG 
Subjt:  LFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYASIGKWEGA

Query:  NKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLFHELEICGYVPEKDESSKD
         KVR  MK +G+ K+PG S++EI+ KVHEF +GD+ H  +  IY+ML  +   ++  GYVP  +   +D
Subjt:  NKVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLFHELEICGYVPEKDESSKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAGCATTTCAGCGCACACCGCGATTCCGACCCGATTCCGACCCGATTCCAACCCGACTTCCCCAACCAAAATCATCTTCCCTCGCTCTGCCAATTCCTCAAATCA
CAAATTTACCGCAAATTCTATTGACCCAATTGTTGTATGGACCTCTTCGATTGCTCGCCACTGCCGCAACGGCCAATTGGCCGAAGCCGCCGCCGAGCTTACGCGCATGA
GGCTCGCCGGAGTTGAGCCAAACCACATCACATTCATTACACTCATCTCCGGCTGCGCTGATTTTCCCTCACGCAGCCTCTACTTCGGCTCTTCGCTTCACGGTTACGCC
CGTAAGTTCGGGTTGGATACAGGGCATGTAATGGTGGGCACTGCTCTAATTGGTATGTATTCTAAATTTGGTCAGTTGGGTCTTGGTAGGAAGGTTTTTGATTATCTGGG
TATGAAAAACTCCGTCACTTGGAACACTATGCTTGATGGGTACATGAGGAATGGAGAGGTTGAGTTTGCCCTCAACCTGTTCGATGAAATGCCTACGAGAGATGCGATTT
CTTGGACGGCTTTGATTAATGGTCTTTTGAAACAGGGGTACTCGGAACAAGCATTGGAGTGTTTTCATCTCATGCAATGCTCGGGTGTTGAGCCTGATTATGTGTCTGTA
ATTGCTGTTCTTGCTGCTTGTGCTGATTTGGGTACACTTACTTTAGGGTTATGGGTTAATCGGTTCGTTATGCAGCAGGAATTTAAGGGTAATATTAGGGTAAATAATTC
TTTGATAGATATGTATTCTCGTTGTGGATGTATTGGGTTTGCCCGCCAAGTGTTTGAGAAAATGCCCAACCGAACTTTAGTGTCTTGGAACTCTATTATTGTGGGATTTG
CTGTTAATAGTTATGCAGAGGAGTCTCTGGAGTTTTTTGATGCAATGCAAAGGGAAGGATTCAAGCCAGATGGGGTTAGCTACACAGGCGCTCTTACGGCGTGTAGCCAT
GCGGGATTAGTGAATAAGGGGCTGGAATTGTTTGATAACATGAAGAGAATGCACAGAATTACTCCTAGGATTGAGCATTATGGGTGCATTGTTGACCTCTACGGCCGCGC
AGGAAGGTTGGAGGAAGCATTGAGTGTCATCGAGAAGATGCCGTTGAAACCTAATGAAGTTGTACTCGGGTCGCTACTGGCTGCCTGCAGGACTCATGGTGATGTGAGTC
TGGCTGAAAGGTTGATGAAACATCTCTTTAAGTTGGACCCTGGAGGCGATTCGAATTATGTGCTCCTTTCGAACATATACGCATCAATTGGGAAGTGGGAAGGTGCTAAC
AAGGTCCGGAGAACAATGAAAGCTCGAGGCGTGCAGAAAAAACCCGGGTTTAGTTCTGTAGAGATTGATGGTAAGGTTCACGAGTTTGTTTCTGGTGATAAATACCATGA
TGATGCAGACAATATTTACTCGATGTTAGGGTTGTTGTTTCATGAACTAGAGATATGTGGCTATGTTCCTGAAAAAGATGAATCTAGTAAAGATCAATGA
mRNA sequenceShow/hide mRNA sequence
GTTTTCTCTCTTGACTTAATTCTAAGTATCCTCTTTCCCAAAATGATCTTATAAATCAATTTTCTTTAAAGAAATATCTCATCCTTCCTTACAAACTCCCCTTATTTTAA
TATTAAATCCCGTTCCCGCTCTATCCCCAACCGGAGCGATGAACAGCATTTCAGCGCACACCGCGATTCCGACCCGATTCCGACCCGATTCCAACCCGACTTCCCCAACC
AAAATCATCTTCCCTCGCTCTGCCAATTCCTCAAATCACAAATTTACCGCAAATTCTATTGACCCAATTGTTGTATGGACCTCTTCGATTGCTCGCCACTGCCGCAACGG
CCAATTGGCCGAAGCCGCCGCCGAGCTTACGCGCATGAGGCTCGCCGGAGTTGAGCCAAACCACATCACATTCATTACACTCATCTCCGGCTGCGCTGATTTTCCCTCAC
GCAGCCTCTACTTCGGCTCTTCGCTTCACGGTTACGCCCGTAAGTTCGGGTTGGATACAGGGCATGTAATGGTGGGCACTGCTCTAATTGGTATGTATTCTAAATTTGGT
CAGTTGGGTCTTGGTAGGAAGGTTTTTGATTATCTGGGTATGAAAAACTCCGTCACTTGGAACACTATGCTTGATGGGTACATGAGGAATGGAGAGGTTGAGTTTGCCCT
CAACCTGTTCGATGAAATGCCTACGAGAGATGCGATTTCTTGGACGGCTTTGATTAATGGTCTTTTGAAACAGGGGTACTCGGAACAAGCATTGGAGTGTTTTCATCTCA
TGCAATGCTCGGGTGTTGAGCCTGATTATGTGTCTGTAATTGCTGTTCTTGCTGCTTGTGCTGATTTGGGTACACTTACTTTAGGGTTATGGGTTAATCGGTTCGTTATG
CAGCAGGAATTTAAGGGTAATATTAGGGTAAATAATTCTTTGATAGATATGTATTCTCGTTGTGGATGTATTGGGTTTGCCCGCCAAGTGTTTGAGAAAATGCCCAACCG
AACTTTAGTGTCTTGGAACTCTATTATTGTGGGATTTGCTGTTAATAGTTATGCAGAGGAGTCTCTGGAGTTTTTTGATGCAATGCAAAGGGAAGGATTCAAGCCAGATG
GGGTTAGCTACACAGGCGCTCTTACGGCGTGTAGCCATGCGGGATTAGTGAATAAGGGGCTGGAATTGTTTGATAACATGAAGAGAATGCACAGAATTACTCCTAGGATT
GAGCATTATGGGTGCATTGTTGACCTCTACGGCCGCGCAGGAAGGTTGGAGGAAGCATTGAGTGTCATCGAGAAGATGCCGTTGAAACCTAATGAAGTTGTACTCGGGTC
GCTACTGGCTGCCTGCAGGACTCATGGTGATGTGAGTCTGGCTGAAAGGTTGATGAAACATCTCTTTAAGTTGGACCCTGGAGGCGATTCGAATTATGTGCTCCTTTCGA
ACATATACGCATCAATTGGGAAGTGGGAAGGTGCTAACAAGGTCCGGAGAACAATGAAAGCTCGAGGCGTGCAGAAAAAACCCGGGTTTAGTTCTGTAGAGATTGATGGT
AAGGTTCACGAGTTTGTTTCTGGTGATAAATACCATGATGATGCAGACAATATTTACTCGATGTTAGGGTTGTTGTTTCATGAACTAGAGATATGTGGCTATGTTCCTGA
AAAAGATGAATCTAGTAAAGATCAATGA
Protein sequenceShow/hide protein sequence
MNSISAHTAIPTRFRPDSNPTSPTKIIFPRSANSSNHKFTANSIDPIVVWTSSIARHCRNGQLAEAAAELTRMRLAGVEPNHITFITLISGCADFPSRSLYFGSSLHGYA
RKFGLDTGHVMVGTALIGMYSKFGQLGLGRKVFDYLGMKNSVTWNTMLDGYMRNGEVEFALNLFDEMPTRDAISWTALINGLLKQGYSEQALECFHLMQCSGVEPDYVSV
IAVLAACADLGTLTLGLWVNRFVMQQEFKGNIRVNNSLIDMYSRCGCIGFARQVFEKMPNRTLVSWNSIIVGFAVNSYAEESLEFFDAMQREGFKPDGVSYTGALTACSH
AGLVNKGLELFDNMKRMHRITPRIEHYGCIVDLYGRAGRLEEALSVIEKMPLKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYASIGKWEGAN
KVRRTMKARGVQKKPGFSSVEIDGKVHEFVSGDKYHDDADNIYSMLGLLFHELEICGYVPEKDESSKDQ