; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g23100 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g23100
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr3:16345800..16347377
RNA-Seq ExpressionMoc03g23100
SyntenyMoc03g23100
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139593.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucumis sativus]3.1e-25282.63Show/hide
Query:  MSSIPANTAAAVQLQQYPNPATSIPLPNPETINFPRSENSSNRHISSKSTGNSIDPIVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLS
        MSSIP++TA   QLQ  P   +SIPL NP  +NFPRS NS +R+ISSK   NS+DPIVLWTSS+ARYCRNGQL+EAAAEFT MRLAGVEPNH+T ITLLS
Subjt:  MSSIPANTAAAVQLQQYPNPATSIPLPNPETINFPRSENSSNRHISSKSTGNSIDPIVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLS

Query:  GCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYLRMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN
         CADFPSES +F SSLHGYA K GLDT HVMVGT+++DMY+KCAQLG AR+VF  L +KNSVSWNTML+G+ RNGEIELAI LFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYLRMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN

Query:  GLLKQGYSEQALECFHQMQCSGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSII
        GLLK GYSEQALECFHQMQ SG+  DYVSIIAVLAACADLG LTLGLWV+RFVM QEFKDNI+ISNSLIDMYSRCGCI FARQVF +M+KRTLVSWNSII
Subjt:  GLLKQGYSEQALECFHQMQCSGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSII

Query:  VGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNMKRVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLG
        VG+A NGFADESLEFF AMQKEGFKPD VSYTGALTACSHAGLVNKGLELFDNMK VH+I PRIEHYGCIVDLYGRAGRLEDAL++IE+MPMKPNEVVLG
Subjt:  VGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNMKRVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLG

Query:  SLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLV
        SLLAACRTHGDV+LAERLMKHL KLDP GD+ YVLLSNIYAAIGKW+GAN VRRTMKARGVQKKPG+SSVEIDGKVHEFVAGD YHADAD+IYSML LL 
Subjt:  SLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLV

Query:  HELKICGSVPETETFLNTKESSKD
        HELK+CG VP ++T LNTKES+KD
Subjt:  HELKICGSVPETETFLNTKESSKD

XP_022142716.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Momordica charantia]2.9e-306100Show/hide
Query:  MSSIPANTAAAVQLQQYPNPATSIPLPNPETINFPRSENSSNRHISSKSTGNSIDPIVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLS
        MSSIPANTAAAVQLQQYPNPATSIPLPNPETINFPRSENSSNRHISSKSTGNSIDPIVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLS
Subjt:  MSSIPANTAAAVQLQQYPNPATSIPLPNPETINFPRSENSSNRHISSKSTGNSIDPIVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLS

Query:  GCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYLRMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN
        GCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYLRMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYLRMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN

Query:  GLLKQGYSEQALECFHQMQCSGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSII
        GLLKQGYSEQALECFHQMQCSGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSII
Subjt:  GLLKQGYSEQALECFHQMQCSGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSII

Query:  VGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNMKRVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLG
        VGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNMKRVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLG
Subjt:  VGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNMKRVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLG

Query:  SLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLV
        SLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLV
Subjt:  SLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLV

Query:  HELKICGSVPETETFLNTKESSKDH
        HELKICGSVPETETFLNTKESSKDH
Subjt:  HELKICGSVPETETFLNTKESSKDH

XP_022967078.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucurbita maxima]8.2e-25381.97Show/hide
Query:  MSSIPANTAA--AVQLQQYPNPATSIPLPNPETINFPRSENSSNRHISSKSTGNSIDPIVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITL
        MSS+P++T      QLQQY NP + IP  NP  ++FPR+ NSS          N I PIVLWTSSIARYCRN QLAEAAAEFT MRLAGVEPNH+T ITL
Subjt:  MSSIPANTAA--AVQLQQYPNPATSIPLPNPETINFPRSENSSNRHISSKSTGNSIDPIVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITL

Query:  LSGCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYLRMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTAL
        LSGCADFPS SL+FG+SLHGY RKLGLDT HVMVGT+++ MYAKCAQLGLAR VFDYL MKNSV+WNTMLDGY RNGEIELAI+LFDEMPTRDAISWTAL
Subjt:  LSGCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYLRMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTAL

Query:  INGLLKQGYSEQALECFHQMQCSGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNS
        ING LKQGYSEQALECFH+MQCSGI+PDYVSIIAVLAACADLG L+ GLWVNRF+MQQEFKDNIRISNSLIDMYSRCGCI FARQVF++MSK TLVSWNS
Subjt:  INGLLKQGYSEQALECFHQMQCSGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNS

Query:  IIVGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNMKRVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVV
        +IVG+A NGFADESLEFFDAMQKEGF  D VSYTGALTACSHAGLVNKGLELFDNMKRVHRI PRIEHYGCIVDLY RAGRL++AL+VIE MPMKPNEVV
Subjt:  IIVGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNMKRVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVV

Query:  LGSLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQL
        LGSLLAACRTHGDVSLAERL+K+L +LDPGGDS+YVLLSNIYAA+G+WEGANKVRRTMKARGVQKKPGFSS+EIDGKVHEFVAGDKYH DAD+IYSML++
Subjt:  LGSLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQL

Query:  LVHELKICGSVPETETFLNTKESSKDH
        L HELKI G VPET TF+N  ESSK++
Subjt:  LVHELKICGSVPETETFLNTKESSKDH

XP_023521260.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucurbita pepo subsp. pepo]2.0e-25182.16Show/hide
Query:  MSSIPANTAA--AVQLQQYPNPATSIPLPNPETINFPRSENSSNRHISSKSTGNSIDPIVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITL
        MSS+PA+TA     QLQQY N        NP  +NFPR  NSS          N I PIVLWTSSIARYCRN QL EAAAEFT MRLAGVEPNH+T ITL
Subjt:  MSSIPANTAA--AVQLQQYPNPATSIPLPNPETINFPRSENSSNRHISSKSTGNSIDPIVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITL

Query:  LSGCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYLRMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTAL
        LSGCADFPS SL+FG+SLHGY RKLGLDT HVMVGT+++ MYAKCAQLGLAR VFDYL MKNSV+WNTMLDGY RNGEIELAI+LFDEMPTRDAISWTAL
Subjt:  LSGCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYLRMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTAL

Query:  INGLLKQGYSEQALECFHQMQCSGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNS
        ING LKQGYSEQALECFH+MQCSGI+PDYVSIIAVLAACADLG L+ GLWVNRF+MQQEFKDNIRISNSLIDMYSRCGCI FARQVF++M KRTLVSWNS
Subjt:  INGLLKQGYSEQALECFHQMQCSGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNS

Query:  IIVGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNMKRVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVV
        +IVG+A NGFADESLEFFDAMQKEGFK D VSYTGALTACSHAGLVNKGLELFDNMKRVHRI PRIEHYGCIVDLY RAGRL++AL+VIE MPMKPNEVV
Subjt:  IIVGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNMKRVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVV

Query:  LGSLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQL
        LGSLLAACRTHGDVSLAERL+K+L +LDPGGDS+YVLLSNIYAA+G+WEGANKVRRTMKARGVQKKPGFSS+EIDGKVHEFVAGDKYHADAD+IYSML++
Subjt:  LGSLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQL

Query:  LVHELKICGSVPETETFLNTKESSKDH
        L HELKICG VPET T +N  ESSK++
Subjt:  LVHELKICGSVPETETFLNTKESSKDH

XP_038877228.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Benincasa hispida]1.9e-27087.81Show/hide
Query:  MSSIPANTAAAVQLQQYPNPATSIPLPNPETINFPRSENSSNRHISSKSTGNSIDPIVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLS
        MSS P+ TA   QLQQYPNP +SIPL NP  +NFPRS NSS+R+ISSK   NSIDPIVLWTSS+ARYCRNGQL+EAA EFT MRLAGVEPNHVT ITLLS
Subjt:  MSSIPANTAAAVQLQQYPNPATSIPLPNPETINFPRSENSSNRHISSKSTGNSIDPIVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLS

Query:  GCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYLRMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN
        GC DFPSESL+FGSSLHGYARKLGLDT HVMVGT++VDMYAKCAQ  LAR+VFDYL MKNSV+WNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYLRMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN

Query:  GLLKQGYSEQALECFHQMQCSGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSII
        GLLKQG+SEQALECFHQMQCSGI+PDYVSIIAVLAACADLG LTLGLWVNRFVMQQEFKDNIRISNSL+DMYSRCGCI FARQVFE+M KRTLVSWNSII
Subjt:  GLLKQGYSEQALECFHQMQCSGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSII

Query:  VGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNMKRVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLG
        VG+A NGFADESLEFFDAMQ EGFKPD VSYTGALTACSHAGLVNKGLELFDNMKR+H+I PRIEHYGCIVDLYGRAGRLEDAL+VIE+MPMKPNEVVLG
Subjt:  VGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNMKRVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLG

Query:  SLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLV
        SLLAACRT+GDVSLAE+LMKHLSKLDP GDSNYVLLSNIYAAIG+WEGANKVRRTMKARGVQKKPG SSVEIDGKVHEFVAGDKYHADAD+IYSML+LL 
Subjt:  SLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLV

Query:  HELKICGSVPETETFLNTKESSKDH
        HELKI G VP+T   LNTKE SKDH
Subjt:  HELKICGSVPETETFLNTKESSKDH

TrEMBL top hitse value%identityAlignment
A0A0A0LYD6 Uncharacterized protein1.5e-25282.63Show/hide
Query:  MSSIPANTAAAVQLQQYPNPATSIPLPNPETINFPRSENSSNRHISSKSTGNSIDPIVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLS
        MSSIP++TA   QLQ  P   +SIPL NP  +NFPRS NS +R+ISSK   NS+DPIVLWTSS+ARYCRNGQL+EAAAEFT MRLAGVEPNH+T ITLLS
Subjt:  MSSIPANTAAAVQLQQYPNPATSIPLPNPETINFPRSENSSNRHISSKSTGNSIDPIVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLS

Query:  GCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYLRMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN
         CADFPSES +F SSLHGYA K GLDT HVMVGT+++DMY+KCAQLG AR+VF  L +KNSVSWNTML+G+ RNGEIELAI LFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYLRMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN

Query:  GLLKQGYSEQALECFHQMQCSGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSII
        GLLK GYSEQALECFHQMQ SG+  DYVSIIAVLAACADLG LTLGLWV+RFVM QEFKDNI+ISNSLIDMYSRCGCI FARQVF +M+KRTLVSWNSII
Subjt:  GLLKQGYSEQALECFHQMQCSGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSII

Query:  VGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNMKRVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLG
        VG+A NGFADESLEFF AMQKEGFKPD VSYTGALTACSHAGLVNKGLELFDNMK VH+I PRIEHYGCIVDLYGRAGRLEDAL++IE+MPMKPNEVVLG
Subjt:  VGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNMKRVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLG

Query:  SLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLV
        SLLAACRTHGDV+LAERLMKHL KLDP GD+ YVLLSNIYAAIGKW+GAN VRRTMKARGVQKKPG+SSVEIDGKVHEFVAGD YHADAD+IYSML LL 
Subjt:  SLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLV

Query:  HELKICGSVPETETFLNTKESSKD
        HELK+CG VP ++T LNTKES+KD
Subjt:  HELKICGSVPETETFLNTKESSKD

A0A5A7UJB6 Pentatricopeptide repeat-containing protein3.7e-25182.67Show/hide
Query:  MSSIPANTAAAVQLQQYPNPATSIPLPNPETINFPRSENSSNRHISSKSTGNSIDPIVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLS
        MSSIP++ A+  QLQQ   P++SIPL NP  +NFPRS  S + +I SK T NS+ PIV WTSSIARYC NGQL EAAAEFT MRLAGVEPNH+T ITLLS
Subjt:  MSSIPANTAAAVQLQQYPNPATSIPLPNPETINFPRSENSSNRHISSKSTGNSIDPIVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLS

Query:  GCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYLRMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN
        GCADFPSES +F SSLHGYA K GLDT HVMVGT+++DMY+KC+QLGLA++VFDYL +KNSVSWNTML+G+ RNGEIELAI LFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYLRMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN

Query:  GLLKQGYSEQALECFHQMQCSGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSII
        GLLK GYSEQALECFHQMQ SG+  DYVSIIAVLAACADLG LT GLWVNRFVMQQEFKDN+RISNSLIDMYSRCGCI FARQVF +M+KRTLVSWNSII
Subjt:  GLLKQGYSEQALECFHQMQCSGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSII

Query:  VGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNMKRVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLG
        VG+A NGFADESLEFF AMQKEGFKPD VSYTGALTACSHAGLVNKGLELFDNMKRVH+I P IEHYGCIVDLYGRAGRLEDA +VIE+MPMKPNEVVLG
Subjt:  VGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNMKRVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLG

Query:  SLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLV
        SLLAACRTHGDV LAERLMKH+ KLD  GDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKK G+SSVEIDGKVHEFVAGDKYHADAD+IYSML LL 
Subjt:  SLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLV

Query:  HELKICGSVPETETFLNTKESSKDH
        HELK+CG VP+T+  LNTK+S+KDH
Subjt:  HELKICGSVPETETFLNTKESSKDH

A0A6J1CN07 pentatricopeptide repeat-containing protein At1g05750, chloroplastic1.4e-306100Show/hide
Query:  MSSIPANTAAAVQLQQYPNPATSIPLPNPETINFPRSENSSNRHISSKSTGNSIDPIVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLS
        MSSIPANTAAAVQLQQYPNPATSIPLPNPETINFPRSENSSNRHISSKSTGNSIDPIVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLS
Subjt:  MSSIPANTAAAVQLQQYPNPATSIPLPNPETINFPRSENSSNRHISSKSTGNSIDPIVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLS

Query:  GCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYLRMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN
        GCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYLRMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN
Subjt:  GCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYLRMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALIN

Query:  GLLKQGYSEQALECFHQMQCSGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSII
        GLLKQGYSEQALECFHQMQCSGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSII
Subjt:  GLLKQGYSEQALECFHQMQCSGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSII

Query:  VGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNMKRVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLG
        VGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNMKRVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLG
Subjt:  VGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNMKRVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLG

Query:  SLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLV
        SLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLV
Subjt:  SLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLV

Query:  HELKICGSVPETETFLNTKESSKDH
        HELKICGSVPETETFLNTKESSKDH
Subjt:  HELKICGSVPETETFLNTKESSKDH

A0A6J1FZR8 pentatricopeptide repeat-containing protein At1g05750, chloroplastic1.7e-25181.97Show/hide
Query:  MSSIPANTAA--AVQLQQYPNPATSIPLPNPETINFPRSENSSNRHISSKSTGNSIDPIVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITL
        MSS+PA+TA     QLQQY N        NP  +NFPR  NSS          N I PIVLWTSSIARYCRN QL EAAAEFT MRLAGVEPNH+T ITL
Subjt:  MSSIPANTAA--AVQLQQYPNPATSIPLPNPETINFPRSENSSNRHISSKSTGNSIDPIVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITL

Query:  LSGCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYLRMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTAL
        LSGCADFPS SL+FG+SLHGY RKLGLDT HVMVGT+++ MYAKCAQLGLAR VFDYL MKNSV+WNTMLDGY RNGEIELAI+LFDEMPTRDAISWTAL
Subjt:  LSGCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYLRMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTAL

Query:  INGLLKQGYSEQALECFHQMQCSGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNS
        ING LKQGYSEQAL+CFH+MQCSGI+PDYVSIIAVLAACADLG L+ GLWVNRF+MQQEFKDNIRISNSLIDMYSRCGCI FARQVF++M KRTLVSWNS
Subjt:  INGLLKQGYSEQALECFHQMQCSGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNS

Query:  IIVGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNMKRVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVV
        +IVG+A NGFADESLEFFDAMQKEGFK D VSYTGALTACSHAGLVNKGLELFDNMKRVHRI PRIEHYGCIVDLY RAGRL++AL+VIE MPMKPNEVV
Subjt:  IIVGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNMKRVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVV

Query:  LGSLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQL
        LGSLLAACRTHGDVSLAERL+K+L +LDPGGDS+YVLLSNIYAA+G+WEGAN VRRTMKARGVQKKPGFSS+EIDGKVHEFVAGDKYHADAD+IYSML++
Subjt:  LGSLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQL

Query:  LVHELKICGSVPETETFLNTKESSKDH
        L HELKICG VPET TF+N  ESSK++
Subjt:  LVHELKICGSVPETETFLNTKESSKDH

A0A6J1HU31 pentatricopeptide repeat-containing protein At1g05750, chloroplastic3.9e-25381.97Show/hide
Query:  MSSIPANTAA--AVQLQQYPNPATSIPLPNPETINFPRSENSSNRHISSKSTGNSIDPIVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITL
        MSS+P++T      QLQQY NP + IP  NP  ++FPR+ NSS          N I PIVLWTSSIARYCRN QLAEAAAEFT MRLAGVEPNH+T ITL
Subjt:  MSSIPANTAA--AVQLQQYPNPATSIPLPNPETINFPRSENSSNRHISSKSTGNSIDPIVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITL

Query:  LSGCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYLRMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTAL
        LSGCADFPS SL+FG+SLHGY RKLGLDT HVMVGT+++ MYAKCAQLGLAR VFDYL MKNSV+WNTMLDGY RNGEIELAI+LFDEMPTRDAISWTAL
Subjt:  LSGCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYLRMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTAL

Query:  INGLLKQGYSEQALECFHQMQCSGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNS
        ING LKQGYSEQALECFH+MQCSGI+PDYVSIIAVLAACADLG L+ GLWVNRF+MQQEFKDNIRISNSLIDMYSRCGCI FARQVF++MSK TLVSWNS
Subjt:  INGLLKQGYSEQALECFHQMQCSGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNS

Query:  IIVGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNMKRVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVV
        +IVG+A NGFADESLEFFDAMQKEGF  D VSYTGALTACSHAGLVNKGLELFDNMKRVHRI PRIEHYGCIVDLY RAGRL++AL+VIE MPMKPNEVV
Subjt:  IIVGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNMKRVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVV

Query:  LGSLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQL
        LGSLLAACRTHGDVSLAERL+K+L +LDPGGDS+YVLLSNIYAA+G+WEGANKVRRTMKARGVQKKPGFSS+EIDGKVHEFVAGDKYH DAD+IYSML++
Subjt:  LGSLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQL

Query:  LVHELKICGSVPETETFLNTKESSKDH
        L HELKI G VPET TF+N  ESSK++
Subjt:  LVHELKICGSVPETETFLNTKESSKDH

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic9.2e-9836.54Show/hide
Query:  IVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLSGCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYL
        +V W S I  + + G   +A   F  M    V+ +HVT++ +LS CA     +L FG  +  Y  +  ++  ++ +  +++DMY KC  +  A+R+FD +
Subjt:  IVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLSGCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYL

Query:  RMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQC-SGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQ
          K++V+W TMLDGY  + + E A ++ + MP +D ++W ALI+   + G   +AL  FH++Q    +K + +++++ L+ACA +G L LG W++ ++ +
Subjt:  RMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQC-SGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQ

Query:  QEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSIIVGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNMK
           + N  ++++LI MYS+CG +  +R+VF  + KR +  W+++I G A +G  +E+++ F  MQ+   KP+ V++T    ACSH GLV++   LF  M+
Subjt:  QEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSIIVGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNMK

Query:  RVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRT
          + I+P  +HY CIVD+ GR+G LE A+  IE MP+ P+  V G+LL AC+ H +++LAE     L +L+P  D  +VLLSNIYA +GKWE  +++R+ 
Subjt:  RVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRT

Query:  MKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLVHELKICGSVPETETFLNTKESSK
        M+  G++K+PG SS+EIDG +HEF++GD  H  ++ +Y  L  ++ +LK  G  PE    L   E  +
Subjt:  MKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLVHELKICGSVPETETFLNTKESSK

Q9LSB8 Putative pentatricopeptide repeat-containing protein At3g159302.2e-9936.58Show/hide
Query:  DPIVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLSGCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFD
        + +  W   I+ Y R  +  E+      M    V P  VTL+ +LS C+    + L     +H Y  +   + + + +  ++V+ YA C ++ +A R+F 
Subjt:  DPIVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLSGCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFD

Query:  YLRMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQCSGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVM
         ++ ++ +SW +++ GY   G ++LA   FD+MP RD ISWT +I+G L+ G   ++LE F +MQ +G+ PD  ++++VL ACA LG+L +G W+  ++ 
Subjt:  YLRMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQCSGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVM

Query:  QQEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSIIVGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNM
        + + K+++ + N+LIDMY +CGC   A++VF  M +R   +W +++VG A NG   E+++ F  MQ    +PD ++Y G L+AC+H+G+V++  + F  M
Subjt:  QQEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSIIVGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNM

Query:  KRVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRR
        +  HRI P + HYGC+VD+ GRAG +++A +++ KMPM PN +V G+LL A R H D  +AE   K + +L+P   + Y LL NIYA   +W+   +VRR
Subjt:  KRVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRR

Query:  TMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLVHELKICGSVPETETFL
         +    ++K PGFS +E++G  HEFVAGDK H  ++ IY  L+ L  E      +P+T   L
Subjt:  TMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLVHELKICGSVPETETFL

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226901.2e-10038.13Show/hide
Query:  LWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLSGCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYLRM
        L  +  + Y R G   EA   F  M  +GV P+ +++++ +S C+     ++ +G S HGY  + G ++    +  +++DMY KC +   A R+FD +  
Subjt:  LWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLSGCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYLRM

Query:  KNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQC-SGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQE
        K  V+WN+++ GY  NGE++ A + F+ MP ++ +SW  +I+GL++    E+A+E F  MQ   G+  D V+++++ +AC  LG L L  W+  ++ +  
Subjt:  KNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQC-SGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQE

Query:  FKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSIIVGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNMKRV
         + ++R+  +L+DM+SRCG    A  +F  ++ R + +W + I   A  G A+ ++E FD M ++G KPD V++ GALTACSH GLV +G E+F +M ++
Subjt:  FKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSIIVGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNMKRV

Query:  HRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMK
        H + P   HYGC+VDL GRAG LE+A+ +IE MPM+PN+V+  SLLAACR  G+V +A    + +  L P    +YVLLSN+YA+ G+W    KVR +MK
Subjt:  HRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMK

Query:  ARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLVHELKICGSVPETETFL
         +G++K PG SS++I GK HEF +GD+ H +  +I +ML  +       G VP+    L
Subjt:  ARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLVHELKICGSVPETETFL

Q9MA50 Pentatricopeptide repeat-containing protein At1g05750, chloroplastic1.2e-16960.92Show/hide
Query:  ENSSNRHISSKSTGNSIDPIVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLSGCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIV
        +N +N  I   +   S +  V WTS I    RNG+LAEAA EF+ M LAGVEPNH+T I LLSGC DF S S   G  LHGYA KLGLD  HVMVGT+I+
Subjt:  ENSSNRHISSKSTGNSIDPIVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLSGCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIV

Query:  DMYAKCAQLGLARRVFDYLRMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQCSGIKPDYVSIIAVLAAC
         MY+K  +   AR VFDY+  KNSV+WNTM+DGY R+G+++ A  +FD+MP RD ISWTA+ING +K+GY E+AL  F +MQ SG+KPDYV+IIA L AC
Subjt:  DMYAKCAQLGLARRVFDYLRMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQCSGIKPDYVSIIAVLAAC

Query:  ADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSIIVGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTA
         +LG L+ GLWV+R+V+ Q+FK+N+R+SNSLID+Y RCGC+ FARQVF  M KRT+VSWNS+IVG+AANG A ESL +F  MQ++GFKPDAV++TGALTA
Subjt:  ADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSIIVGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTA

Query:  CSHAGLVNKGLELFDNMKRVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLGSLLAACRTHG-DVSLAERLMKHLSKLDPGGDSNYVLL
        CSH GLV +GL  F  MK  +RI PRIEHYGC+VDLY RAGRLEDAL +++ MPMKPNEVV+GSLLAAC  HG ++ LAERLMKHL+ L+    SNYV+L
Subjt:  CSHAGLVNKGLELFDNMKRVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLGSLLAACRTHG-DVSLAERLMKHLSKLDPGGDSNYVLL

Query:  SNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLVHELKICGSVPET
        SN+YAA GKWEGA+K+RR MK  G++K+PGFSS+EID  +H F+AGD  H +   I  +L+L+  +L++ G V ET
Subjt:  SNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLVHELKICGSVPET

Q9SIT7 Pentatricopeptide repeat-containing protein At2g136002.7e-9739.21Show/hide
Query:  IVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLSGCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYL
        +V W S I  + +NG   EA   F  M  + VEP+ VTL +++S CA     ++  G  +HG   K       +++  + VDMYAKC+++  AR +FD +
Subjt:  IVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLSGCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYL

Query:  RMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQCSGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQ
         ++N ++  +M+ GY      + A  +F +M  R+ +SW ALI G  + G +E+AL  F  ++   + P + S   +L ACADL  L LG+  +  V++ 
Subjt:  RMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQCSGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQ

Query:  EFK------DNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSIIVGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLEL
         FK      D+I + NSLIDMY +CGC+     VF +M +R  VSWN++I+G+A NG+ +E+LE F  M + G KPD ++  G L+AC HAG V +G   
Subjt:  EFK------DNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSIIVGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLEL

Query:  FDNMKRVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGAN
        F +M R   + P  +HY C+VDL GRAG LE+A  +IE+MPM+P+ V+ GSLLAAC+ H +++L + + + L +++P     YVLLSN+YA +GKWE   
Subjt:  FDNMKRVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGAN

Query:  KVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLVHELK
         VR++M+  GV K+PG S ++I G  H F+  DK H     I+S+L +L+ E++
Subjt:  KVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLVHELK

Arabidopsis top hitse value%identityAlignment
AT1G05750.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.3e-17160.92Show/hide
Query:  ENSSNRHISSKSTGNSIDPIVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLSGCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIV
        +N +N  I   +   S +  V WTS I    RNG+LAEAA EF+ M LAGVEPNH+T I LLSGC DF S S   G  LHGYA KLGLD  HVMVGT+I+
Subjt:  ENSSNRHISSKSTGNSIDPIVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLSGCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIV

Query:  DMYAKCAQLGLARRVFDYLRMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQCSGIKPDYVSIIAVLAAC
         MY+K  +   AR VFDY+  KNSV+WNTM+DGY R+G+++ A  +FD+MP RD ISWTA+ING +K+GY E+AL  F +MQ SG+KPDYV+IIA L AC
Subjt:  DMYAKCAQLGLARRVFDYLRMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQCSGIKPDYVSIIAVLAAC

Query:  ADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSIIVGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTA
         +LG L+ GLWV+R+V+ Q+FK+N+R+SNSLID+Y RCGC+ FARQVF  M KRT+VSWNS+IVG+AANG A ESL +F  MQ++GFKPDAV++TGALTA
Subjt:  ADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSIIVGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTA

Query:  CSHAGLVNKGLELFDNMKRVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLGSLLAACRTHG-DVSLAERLMKHLSKLDPGGDSNYVLL
        CSH GLV +GL  F  MK  +RI PRIEHYGC+VDLY RAGRLEDAL +++ MPMKPNEVV+GSLLAAC  HG ++ LAERLMKHL+ L+    SNYV+L
Subjt:  CSHAGLVNKGLELFDNMKRVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLGSLLAACRTHG-DVSLAERLMKHLSKLDPGGDSNYVLL

Query:  SNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLVHELKICGSVPET
        SN+YAA GKWEGA+K+RR MK  G++K+PGFSS+EID  +H F+AGD  H +   I  +L+L+  +L++ G V ET
Subjt:  SNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLVHELKICGSVPET

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.5e-9936.54Show/hide
Query:  IVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLSGCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYL
        +V W S I  + + G   +A   F  M    V+ +HVT++ +LS CA     +L FG  +  Y  +  ++  ++ +  +++DMY KC  +  A+R+FD +
Subjt:  IVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLSGCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYL

Query:  RMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQC-SGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQ
          K++V+W TMLDGY  + + E A ++ + MP +D ++W ALI+   + G   +AL  FH++Q    +K + +++++ L+ACA +G L LG W++ ++ +
Subjt:  RMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQC-SGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQ

Query:  QEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSIIVGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNMK
           + N  ++++LI MYS+CG +  +R+VF  + KR +  W+++I G A +G  +E+++ F  MQ+   KP+ V++T    ACSH GLV++   LF  M+
Subjt:  QEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSIIVGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNMK

Query:  RVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRT
          + I+P  +HY CIVD+ GR+G LE A+  IE MP+ P+  V G+LL AC+ H +++LAE     L +L+P  D  +VLLSNIYA +GKWE  +++R+ 
Subjt:  RVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRT

Query:  MKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLVHELKICGSVPETETFLNTKESSK
        M+  G++K+PG SS+EIDG +HEF++GD  H  ++ +Y  L  ++ +LK  G  PE    L   E  +
Subjt:  MKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLVHELKICGSVPETETFLNTKESSK

AT3G15930.1 Pentatricopeptide repeat (PPR) superfamily protein1.5e-10036.58Show/hide
Query:  DPIVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLSGCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFD
        + +  W   I+ Y R  +  E+      M    V P  VTL+ +LS C+    + L     +H Y  +   + + + +  ++V+ YA C ++ +A R+F 
Subjt:  DPIVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLSGCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFD

Query:  YLRMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQCSGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVM
         ++ ++ +SW +++ GY   G ++LA   FD+MP RD ISWT +I+G L+ G   ++LE F +MQ +G+ PD  ++++VL ACA LG+L +G W+  ++ 
Subjt:  YLRMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQCSGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVM

Query:  QQEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSIIVGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNM
        + + K+++ + N+LIDMY +CGC   A++VF  M +R   +W +++VG A NG   E+++ F  MQ    +PD ++Y G L+AC+H+G+V++  + F  M
Subjt:  QQEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSIIVGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNM

Query:  KRVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRR
        +  HRI P + HYGC+VD+ GRAG +++A +++ KMPM PN +V G+LL A R H D  +AE   K + +L+P   + Y LL NIYA   +W+   +VRR
Subjt:  KRVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRR

Query:  TMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLVHELKICGSVPETETFL
         +    ++K PGFS +E++G  HEFVAGDK H  ++ IY  L+ L  E      +P+T   L
Subjt:  TMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLVHELKICGSVPETETFL

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)8.2e-10238.13Show/hide
Query:  LWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLSGCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYLRM
        L  +  + Y R G   EA   F  M  +GV P+ +++++ +S C+     ++ +G S HGY  + G ++    +  +++DMY KC +   A R+FD +  
Subjt:  LWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLSGCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYLRM

Query:  KNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQC-SGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQE
        K  V+WN+++ GY  NGE++ A + F+ MP ++ +SW  +I+GL++    E+A+E F  MQ   G+  D V+++++ +AC  LG L L  W+  ++ +  
Subjt:  KNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQC-SGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQE

Query:  FKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSIIVGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNMKRV
         + ++R+  +L+DM+SRCG    A  +F  ++ R + +W + I   A  G A+ ++E FD M ++G KPD V++ GALTACSH GLV +G E+F +M ++
Subjt:  FKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSIIVGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNMKRV

Query:  HRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMK
        H + P   HYGC+VDL GRAG LE+A+ +IE MPM+PN+V+  SLLAACR  G+V +A    + +  L P    +YVLLSN+YA+ G+W    KVR +MK
Subjt:  HRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMK

Query:  ARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLVHELKICGSVPETETFL
         +G++K PG SS++I GK HEF +GD+ H +  +I +ML  +       G VP+    L
Subjt:  ARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLVHELKICGSVPETETFL

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification8.2e-10238.13Show/hide
Query:  LWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLSGCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYLRM
        L  +  + Y R G   EA   F  M  +GV P+ +++++ +S C+     ++ +G S HGY  + G ++    +  +++DMY KC +   A R+FD +  
Subjt:  LWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLSGCADFPSESLYFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYLRM

Query:  KNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQC-SGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQE
        K  V+WN+++ GY  NGE++ A + F+ MP ++ +SW  +I+GL++    E+A+E F  MQ   G+  D V+++++ +AC  LG L L  W+  ++ +  
Subjt:  KNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQC-SGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQE

Query:  FKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSIIVGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNMKRV
         + ++R+  +L+DM+SRCG    A  +F  ++ R + +W + I   A  G A+ ++E FD M ++G KPD V++ GALTACSH GLV +G E+F +M ++
Subjt:  FKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSIIVGYAANGFADESLEFFDAMQKEGFKPDAVSYTGALTACSHAGLVNKGLELFDNMKRV

Query:  HRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMK
        H + P   HYGC+VDL GRAG LE+A+ +IE MPM+PN+V+  SLLAACR  G+V +A    + +  L P    +YVLLSN+YA+ G+W    KVR +MK
Subjt:  HRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMK

Query:  ARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLVHELKICGSVPETETFL
         +G++K PG SS++I GK HEF +GD+ H +  +I +ML  +       G VP+    L
Subjt:  ARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLVHELKICGSVPETETFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAGCATTCCGGCGAACACCGCCGCCGCAGTCCAACTCCAACAATATCCTAATCCGGCTACTTCAATCCCACTTCCAAACCCGGAAACAATCAACTTCCCTCGCTC
TGAAAATTCCTCAAATCGCCATATCTCCTCCAAATCCACCGGCAATTCTATCGATCCCATCGTTCTATGGACCTCTTCCATCGCTCGCTACTGCCGCAATGGCCAATTAG
CCGAAGCGGCCGCAGAGTTTACAAGCATGAGACTCGCCGGAGTTGAGCCGAACCACGTCACGTTGATTACGCTTCTCTCCGGCTGTGCTGATTTTCCGTCAGAAAGCCTC
TACTTCGGCTCTTCGCTCCATGGCTACGCCCGTAAATTAGGTTTGGATACAGCGCATGTAATGGTGGGGACTTCTATCGTTGATATGTATGCCAAATGTGCTCAACTGGG
TCTTGCTAGGAGGGTTTTCGATTACCTGAGGATGAAAAACTCTGTTTCTTGGAACACGATGCTCGATGGTTACACGAGGAACGGGGAGATTGAGCTGGCGATTGACCTGT
TTGATGAAATGCCTACGAGAGATGCGATTTCTTGGACGGCTTTGATTAATGGTCTCTTGAAACAGGGGTACTCTGAACAAGCATTGGAGTGCTTCCATCAGATGCAGTGC
TCAGGTATCAAACCTGATTATGTGTCTATAATTGCTGTTCTTGCTGCGTGTGCTGATTTGGGAACGCTTACTTTGGGGTTATGGGTTAATCGGTTCGTTATGCAGCAGGA
GTTCAAGGATAATATTAGGATTAGTAATTCCTTGATAGATATGTATTCTCGATGTGGGTGCATCGGATTTGCCCGCCAAGTGTTTGAGAGAATGTCTAAGCGAACTTTGG
TGTCTTGGAACTCCATTATTGTGGGGTATGCTGCTAATGGGTTTGCAGATGAATCTCTGGAGTTTTTTGATGCAATGCAGAAGGAGGGATTCAAGCCAGATGCAGTTAGC
TATACGGGAGCTCTTACTGCGTGTAGTCATGCTGGCTTAGTGAATAAGGGGTTGGAATTGTTTGATAACATGAAGAGGGTGCACAGAATTATTCCCAGGATCGAGCATTA
TGGATGCATTGTCGACCTCTATGGCCGTGCAGGGAGGTTGGAGGATGCGTTGGATGTGATCGAGAAAATGCCGATGAAACCGAATGAAGTTGTGCTTGGATCGTTGCTGG
CTGCCTGCAGGACTCATGGCGATGTGAGCTTGGCTGAAAGGTTGATGAAACATCTCTCTAAGTTGGACCCGGGAGGCGATTCAAATTATGTGCTCCTTTCGAACATTTAT
GCAGCAATTGGAAAGTGGGAAGGGGCTAACAAGGTCAGGAGAACGATGAAAGCCCGAGGTGTGCAGAAAAAACCGGGGTTTAGTTCCGTTGAGATTGATGGTAAGGTTCA
TGAGTTTGTTGCTGGTGACAAATACCATGCTGATGCAGACAGTATTTATTCTATGTTACAGCTGCTTGTTCATGAACTAAAGATATGTGGCTCTGTCCCTGAAACTGAAA
CCTTTCTGAATACCAAAGAATCTAGTAAAGACCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCAGCATTCCGGCGAACACCGCCGCCGCAGTCCAACTCCAACAATATCCTAATCCGGCTACTTCAATCCCACTTCCAAACCCGGAAACAATCAACTTCCCTCGCTC
TGAAAATTCCTCAAATCGCCATATCTCCTCCAAATCCACCGGCAATTCTATCGATCCCATCGTTCTATGGACCTCTTCCATCGCTCGCTACTGCCGCAATGGCCAATTAG
CCGAAGCGGCCGCAGAGTTTACAAGCATGAGACTCGCCGGAGTTGAGCCGAACCACGTCACGTTGATTACGCTTCTCTCCGGCTGTGCTGATTTTCCGTCAGAAAGCCTC
TACTTCGGCTCTTCGCTCCATGGCTACGCCCGTAAATTAGGTTTGGATACAGCGCATGTAATGGTGGGGACTTCTATCGTTGATATGTATGCCAAATGTGCTCAACTGGG
TCTTGCTAGGAGGGTTTTCGATTACCTGAGGATGAAAAACTCTGTTTCTTGGAACACGATGCTCGATGGTTACACGAGGAACGGGGAGATTGAGCTGGCGATTGACCTGT
TTGATGAAATGCCTACGAGAGATGCGATTTCTTGGACGGCTTTGATTAATGGTCTCTTGAAACAGGGGTACTCTGAACAAGCATTGGAGTGCTTCCATCAGATGCAGTGC
TCAGGTATCAAACCTGATTATGTGTCTATAATTGCTGTTCTTGCTGCGTGTGCTGATTTGGGAACGCTTACTTTGGGGTTATGGGTTAATCGGTTCGTTATGCAGCAGGA
GTTCAAGGATAATATTAGGATTAGTAATTCCTTGATAGATATGTATTCTCGATGTGGGTGCATCGGATTTGCCCGCCAAGTGTTTGAGAGAATGTCTAAGCGAACTTTGG
TGTCTTGGAACTCCATTATTGTGGGGTATGCTGCTAATGGGTTTGCAGATGAATCTCTGGAGTTTTTTGATGCAATGCAGAAGGAGGGATTCAAGCCAGATGCAGTTAGC
TATACGGGAGCTCTTACTGCGTGTAGTCATGCTGGCTTAGTGAATAAGGGGTTGGAATTGTTTGATAACATGAAGAGGGTGCACAGAATTATTCCCAGGATCGAGCATTA
TGGATGCATTGTCGACCTCTATGGCCGTGCAGGGAGGTTGGAGGATGCGTTGGATGTGATCGAGAAAATGCCGATGAAACCGAATGAAGTTGTGCTTGGATCGTTGCTGG
CTGCCTGCAGGACTCATGGCGATGTGAGCTTGGCTGAAAGGTTGATGAAACATCTCTCTAAGTTGGACCCGGGAGGCGATTCAAATTATGTGCTCCTTTCGAACATTTAT
GCAGCAATTGGAAAGTGGGAAGGGGCTAACAAGGTCAGGAGAACGATGAAAGCCCGAGGTGTGCAGAAAAAACCGGGGTTTAGTTCCGTTGAGATTGATGGTAAGGTTCA
TGAGTTTGTTGCTGGTGACAAATACCATGCTGATGCAGACAGTATTTATTCTATGTTACAGCTGCTTGTTCATGAACTAAAGATATGTGGCTCTGTCCCTGAAACTGAAA
CCTTTCTGAATACCAAAGAATCTAGTAAAGACCATTGA
Protein sequenceShow/hide protein sequence
MSSIPANTAAAVQLQQYPNPATSIPLPNPETINFPRSENSSNRHISSKSTGNSIDPIVLWTSSIARYCRNGQLAEAAAEFTSMRLAGVEPNHVTLITLLSGCADFPSESL
YFGSSLHGYARKLGLDTAHVMVGTSIVDMYAKCAQLGLARRVFDYLRMKNSVSWNTMLDGYTRNGEIELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQC
SGIKPDYVSIIAVLAACADLGTLTLGLWVNRFVMQQEFKDNIRISNSLIDMYSRCGCIGFARQVFERMSKRTLVSWNSIIVGYAANGFADESLEFFDAMQKEGFKPDAVS
YTGALTACSHAGLVNKGLELFDNMKRVHRIIPRIEHYGCIVDLYGRAGRLEDALDVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLSKLDPGGDSNYVLLSNIY
AAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADADSIYSMLQLLVHELKICGSVPETETFLNTKESSKDH