; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019505 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019505
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00153348:330899..335416
RNA-Seq ExpressionSgr019505
SyntenySgr019505
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019600.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]9.4e-16474.5Show/hide
Query:  MGSKAMFKWAKTVMPSHVEQLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIHK
        MGSKAMFKWAKTV P+HVEQL++AERDINKALLIFDSATAEYTNGFKHDLNTFRLMI KLVSANQFRLAETLLDRMKEEKFDVTEDIFL+ICRAYGR+H+
Subjt:  MGSKAMFKWAKTVMPSHVEQLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIHK

Query:  PLDSIRVFHKMQDFHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLING
        PLDSIRVFHKMQDFHCKPTEKSYI+VFAILVEENQLKLA RFYRYMRK+GIPP+VASLNVLIKA CKN GTMDKAM++F EMSN GCEPDSY+YGTLING
Subjt:  PLDSIRVFHKMQDFHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLING

Query:  LCRLGKIVEAKELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQSLQLE-------------------
        LCR G IVEAKELLQEME KGCSPSV+TYTS+IHGLCQLNNVDEAM LLEDMMSKGIEPNVFTYSSLMDGFCKAG SL+                     
Subjt:  LCRLGKIVEAKELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQSLQLE-------------------

Query:  -----ASWKGKLNEALEILDRMKLQGLTPDAGL-----------------------------------------THNRVIHGLCTIDESNRAFQLYLSVQ
                +GKLNEALEILDRMKLQGLTPDAGL                                         THNRVIHGLCT+++SNRAFQLYLSV 
Subjt:  -----ASWKGKLNEALEILDRMKLQGLTPDAGL-----------------------------------------THNRVIHGLCTIDESNRAFQLYLSVQ

Query:  TRDI
        TR I
Subjt:  TRDI

XP_022139830.1 pentatricopeptide repeat-containing protein At5g46100 isoform X1 [Momordica charantia]5.5e-16475.12Show/hide
Query:  MGSKAMFKWAKTVMPSHVEQLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIHK
        MGSKAMFKWAKTV PSHVEQLI+AERDINKALLIFDSAT+EY NGFKHDLNTFRLMISKLVSANQFR AETLLDRM EEKFDVTEDIFLTICRAYGR+HK
Subjt:  MGSKAMFKWAKTVMPSHVEQLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIHK

Query:  PLDSIRVFHKMQDFHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLING
        PLDSIR+FHKM+DF CKPTEKSYITVFAILVEENQLKLALRFYRYMRKMG PP+VASLNVLIKAFCKN GTMDKAMH+  EMSNHGCEPDSY+YGTLING
Subjt:  PLDSIRVFHKMQDFHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLING

Query:  LCRLGKIVEAKELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQS------LQLEASWK---------
        LC+LGKIVEAKELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEA+ LLEDMM KGIEPNVFTYSSLMDGFCKAG S      L+L    +         
Subjt:  LCRLGKIVEAKELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQS------LQLEASWK---------

Query:  ---------GKLNEALEILDRMKLQGLTPDAGL-----------------------------------------THNRVIHGLCTIDESNRAFQLYLSVQ
                 GKLNEALEILDRMKLQGL PDAGL                                         THNRVI GLCTI++S+RAFQLYLSVQ
Subjt:  ---------GKLNEALEILDRMKLQGLTPDAGL-----------------------------------------THNRVIHGLCTIDESNRAFQLYLSVQ

Query:  TRDIVV
        TR I +
Subjt:  TRDIVV

XP_022139832.1 pentatricopeptide repeat-containing protein At5g46100 isoform X2 [Momordica charantia]5.5e-16475.12Show/hide
Query:  MGSKAMFKWAKTVMPSHVEQLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIHK
        MGSKAMFKWAKTV PSHVEQLI+AERDINKALLIFDSAT+EY NGFKHDLNTFRLMISKLVSANQFR AETLLDRM EEKFDVTEDIFLTICRAYGR+HK
Subjt:  MGSKAMFKWAKTVMPSHVEQLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIHK

Query:  PLDSIRVFHKMQDFHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLING
        PLDSIR+FHKM+DF CKPTEKSYITVFAILVEENQLKLALRFYRYMRKMG PP+VASLNVLIKAFCKN GTMDKAMH+  EMSNHGCEPDSY+YGTLING
Subjt:  PLDSIRVFHKMQDFHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLING

Query:  LCRLGKIVEAKELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQS------LQLEASWK---------
        LC+LGKIVEAKELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEA+ LLEDMM KGIEPNVFTYSSLMDGFCKAG S      L+L    +         
Subjt:  LCRLGKIVEAKELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQS------LQLEASWK---------

Query:  ---------GKLNEALEILDRMKLQGLTPDAGL-----------------------------------------THNRVIHGLCTIDESNRAFQLYLSVQ
                 GKLNEALEILDRMKLQGL PDAGL                                         THNRVI GLCTI++S+RAFQLYLSVQ
Subjt:  ---------GKLNEALEILDRMKLQGLTPDAGL-----------------------------------------THNRVIHGLCTIDESNRAFQLYLSVQ

Query:  TRDIVV
        TR I +
Subjt:  TRDIVV

XP_022927208.1 pentatricopeptide repeat-containing protein At5g46100 [Cucurbita moschata]9.4e-16474.5Show/hide
Query:  MGSKAMFKWAKTVMPSHVEQLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIHK
        MGSKAMFKWAKTV P+HVEQL++AERDINKALLIFDSATAEYTNGFKHDLNTFRLMI KLVSANQFRLAETLLDRMKEEKFDVTEDIFL+ICRAYGR+H+
Subjt:  MGSKAMFKWAKTVMPSHVEQLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIHK

Query:  PLDSIRVFHKMQDFHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLING
        PLDSIRVFHKMQDFHCKPTEKSYI+VFAILVEENQLKLA RFYRYMRK+GIPP+VASLNVLIKA CKN GTMDKAM++F EMSN GCEPDSY+YGTLING
Subjt:  PLDSIRVFHKMQDFHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLING

Query:  LCRLGKIVEAKELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQSLQLE-------------------
        LCR G IVEAKELLQEME KGCSPSV+TYTS+IHGLCQLNNVDEAM LLEDMMSKGIEPNVFTYSSLMDGFCKAG SL+                     
Subjt:  LCRLGKIVEAKELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQSLQLE-------------------

Query:  -----ASWKGKLNEALEILDRMKLQGLTPDAGL-----------------------------------------THNRVIHGLCTIDESNRAFQLYLSVQ
                +GKLNEALEILDRMKLQGLTPDAGL                                         THNRVIHGLCT+++SNRAFQLYLSV 
Subjt:  -----ASWKGKLNEALEILDRMKLQGLTPDAGL-----------------------------------------THNRVIHGLCTIDESNRAFQLYLSVQ

Query:  TRDI
        TR I
Subjt:  TRDI

XP_023519166.1 pentatricopeptide repeat-containing protein At5g46100 [Cucurbita pepo subsp. pepo]2.1e-16374.75Show/hide
Query:  MGSKAMFKWAKTVMPSHVEQLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIHK
        MGSKAMFKWAKTV P+HVEQLI+AERDINKALLIFDSATAEYTNGFKHDLNTFRLMI KLVSANQFRLAETLLDRMKEEKFDVTEDIFL+ICRAYGRIH+
Subjt:  MGSKAMFKWAKTVMPSHVEQLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIHK

Query:  PLDSIRVFHKMQDFHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLING
        PLDSIRVFHKMQDFHCKPTEKSYI+VFAILVEENQL LA RFYRYMRK+GIPP+VASLNVLIKA CKN GTMDKAM++F EMSN GCEPDSY+YGTLING
Subjt:  PLDSIRVFHKMQDFHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLING

Query:  LCRLGKIVEAKELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQSLQLE-------------------
        LCR G IVEAKELLQEME KGCSPSV+TYTS+IHGLCQLNNVDEAM LLEDMMSKGIEPNVFTYSSLMDGFCKAG SL+                     
Subjt:  LCRLGKIVEAKELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQSLQLE-------------------

Query:  -----ASWKGKLNEALEILDRMKLQGLTPDAGL-----------------------------------------THNRVIHGLCTIDESNRAFQLYLSVQ
                +GKLNEALEILDRMKLQGLTPDAGL                                         THNRVIHGLCT+++SNRAFQLYLSV 
Subjt:  -----ASWKGKLNEALEILDRMKLQGLTPDAGL-----------------------------------------THNRVIHGLCTIDESNRAFQLYLSVQ

Query:  TRDI
        TR I
Subjt:  TRDI

TrEMBL top hitse value%identityAlignment
A0A0A0LRZ4 Uncharacterized protein2.7e-15671.67Show/hide
Query:  MGSKAMFKWAKTVMPSHVEQLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIHK
        MGSKAMFKWAKTV P+HV+QLI+AERDI KAL+IFDSATAEY NGFKHDLNTF LMISKL+SANQFRLAETLLDRMKEEK DVTEDI L+ICRAYGRIHK
Subjt:  MGSKAMFKWAKTVMPSHVEQLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIHK

Query:  PLDSIRVFHKMQDFHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLING
        PLDSIRVFHKMQDFHCKPTEKSYI+V AILVEENQLK A RFYR MRKMGIPP+V SLNVLIKAFCKN GTMDKAMH+F  MSNHGCEPDSY+YGTLING
Subjt:  PLDSIRVFHKMQDFHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLING

Query:  LCRLGKIVEAKELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQS--------LQLEASWK-------
        LCR   IVEAKELLQEMETKGCSPSVVTYTS+IHGLCQLNNVDEAMRLLEDM  K IEPNVFTYSSLMDGFCK G S        L ++   +       
Subjt:  LCRLGKIVEAKELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQS--------LQLEASWK-------

Query:  ---------GKLNEALEILDRMKLQGLTPDAGL-----------------------------------------THNRVIHGLCTIDESNRAFQLYLSVQ
                 GK+NEALEI DRMKLQG  PDAGL                                         THNRVIHGLCTI+ SNRAFQLYLSV 
Subjt:  ---------GKLNEALEILDRMKLQGLTPDAGL-----------------------------------------THNRVIHGLCTIDESNRAFQLYLSVQ

Query:  TRDIVV
        TR I +
Subjt:  TRDIVV

A0A6J1CDW1 pentatricopeptide repeat-containing protein At5g46100 isoform X12.7e-16475.12Show/hide
Query:  MGSKAMFKWAKTVMPSHVEQLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIHK
        MGSKAMFKWAKTV PSHVEQLI+AERDINKALLIFDSAT+EY NGFKHDLNTFRLMISKLVSANQFR AETLLDRM EEKFDVTEDIFLTICRAYGR+HK
Subjt:  MGSKAMFKWAKTVMPSHVEQLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIHK

Query:  PLDSIRVFHKMQDFHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLING
        PLDSIR+FHKM+DF CKPTEKSYITVFAILVEENQLKLALRFYRYMRKMG PP+VASLNVLIKAFCKN GTMDKAMH+  EMSNHGCEPDSY+YGTLING
Subjt:  PLDSIRVFHKMQDFHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLING

Query:  LCRLGKIVEAKELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQS------LQLEASWK---------
        LC+LGKIVEAKELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEA+ LLEDMM KGIEPNVFTYSSLMDGFCKAG S      L+L    +         
Subjt:  LCRLGKIVEAKELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQS------LQLEASWK---------

Query:  ---------GKLNEALEILDRMKLQGLTPDAGL-----------------------------------------THNRVIHGLCTIDESNRAFQLYLSVQ
                 GKLNEALEILDRMKLQGL PDAGL                                         THNRVI GLCTI++S+RAFQLYLSVQ
Subjt:  ---------GKLNEALEILDRMKLQGLTPDAGL-----------------------------------------THNRVIHGLCTIDESNRAFQLYLSVQ

Query:  TRDIVV
        TR I +
Subjt:  TRDIVV

A0A6J1CF14 pentatricopeptide repeat-containing protein At5g46100 isoform X22.7e-16475.12Show/hide
Query:  MGSKAMFKWAKTVMPSHVEQLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIHK
        MGSKAMFKWAKTV PSHVEQLI+AERDINKALLIFDSAT+EY NGFKHDLNTFRLMISKLVSANQFR AETLLDRM EEKFDVTEDIFLTICRAYGR+HK
Subjt:  MGSKAMFKWAKTVMPSHVEQLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIHK

Query:  PLDSIRVFHKMQDFHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLING
        PLDSIR+FHKM+DF CKPTEKSYITVFAILVEENQLKLALRFYRYMRKMG PP+VASLNVLIKAFCKN GTMDKAMH+  EMSNHGCEPDSY+YGTLING
Subjt:  PLDSIRVFHKMQDFHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLING

Query:  LCRLGKIVEAKELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQS------LQLEASWK---------
        LC+LGKIVEAKELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEA+ LLEDMM KGIEPNVFTYSSLMDGFCKAG S      L+L    +         
Subjt:  LCRLGKIVEAKELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQS------LQLEASWK---------

Query:  ---------GKLNEALEILDRMKLQGLTPDAGL-----------------------------------------THNRVIHGLCTIDESNRAFQLYLSVQ
                 GKLNEALEILDRMKLQGL PDAGL                                         THNRVI GLCTI++S+RAFQLYLSVQ
Subjt:  ---------GKLNEALEILDRMKLQGLTPDAGL-----------------------------------------THNRVIHGLCTIDESNRAFQLYLSVQ

Query:  TRDIVV
        TR I +
Subjt:  TRDIVV

A0A6J1EKD3 pentatricopeptide repeat-containing protein At5g461004.6e-16474.5Show/hide
Query:  MGSKAMFKWAKTVMPSHVEQLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIHK
        MGSKAMFKWAKTV P+HVEQL++AERDINKALLIFDSATAEYTNGFKHDLNTFRLMI KLVSANQFRLAETLLDRMKEEKFDVTEDIFL+ICRAYGR+H+
Subjt:  MGSKAMFKWAKTVMPSHVEQLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIHK

Query:  PLDSIRVFHKMQDFHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLING
        PLDSIRVFHKMQDFHCKPTEKSYI+VFAILVEENQLKLA RFYRYMRK+GIPP+VASLNVLIKA CKN GTMDKAM++F EMSN GCEPDSY+YGTLING
Subjt:  PLDSIRVFHKMQDFHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLING

Query:  LCRLGKIVEAKELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQSLQLE-------------------
        LCR G IVEAKELLQEME KGCSPSV+TYTS+IHGLCQLNNVDEAM LLEDMMSKGIEPNVFTYSSLMDGFCKAG SL+                     
Subjt:  LCRLGKIVEAKELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQSLQLE-------------------

Query:  -----ASWKGKLNEALEILDRMKLQGLTPDAGL-----------------------------------------THNRVIHGLCTIDESNRAFQLYLSVQ
                +GKLNEALEILDRMKLQGLTPDAGL                                         THNRVIHGLCT+++SNRAFQLYLSV 
Subjt:  -----ASWKGKLNEALEILDRMKLQGLTPDAGL-----------------------------------------THNRVIHGLCTIDESNRAFQLYLSVQ

Query:  TRDI
        TR I
Subjt:  TRDI

A0A6J1KLZ3 pentatricopeptide repeat-containing protein At5g461002.3e-16374.75Show/hide
Query:  MGSKAMFKWAKTVMPSHVEQLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIHK
        MGSKAMFKWAKTV P+HVEQLI+AERDINKALLIFDSATAEYTNGFKHDLNTFRLMI KLVSANQFRLAETLLDRMKEEK DVTEDIFL+ICRAYGRIH+
Subjt:  MGSKAMFKWAKTVMPSHVEQLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIHK

Query:  PLDSIRVFHKMQDFHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLING
        PLDSIRVFHKMQDFHCKPTEKSYI+VFAILVEENQLKLA RFYRYMRK+GIPP+VASLNVLIKA CKN GTMDKAM++F EMSN GCEPDSY+YGTLING
Subjt:  PLDSIRVFHKMQDFHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLING

Query:  LCRLGKIVEAKELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQSLQLE-------------------
        LCR G IVEAKELLQEME KGCSPSVVTYTS+IHGLCQLNNVDEAM LLEDMMSKGIEPNVFTYSSLMDGFCKAG SL+                     
Subjt:  LCRLGKIVEAKELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQSLQLE-------------------

Query:  -----ASWKGKLNEALEILDRMKLQGLTPDAGL-----------------------------------------THNRVIHGLCTIDESNRAFQLYLSVQ
                +GK+NEALEILDRMKLQGLTPDAGL                                         THNRVIHGLCT+++SNRAFQLYLSV 
Subjt:  -----ASWKGKLNEALEILDRMKLQGLTPDAGL-----------------------------------------THNRVIHGLCTIDESNRAFQLYLSVQ

Query:  TRDI
        TR I
Subjt:  TRDI

SwissProt top hitse value%identityAlignment
O49436 Pentatricopeptide repeat-containing protein At4g200905.4e-4535.53Show/hide
Query:  TAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIHKPLDSIRVFHKMQD-FHCKPTEKSYITVFAILVEENQLK
        +A     FK   +T   MI    ++  F   E LL R++ E   + E  F+ + RAYG+ H P  ++ +FH+M D F CK + KS+ +V  +++ E    
Subjt:  TAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIHKPLDSIRVFHKMQD-FHCKPTEKSYITVFAILVEENQLK

Query:  LALRFYRYM----RKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLINGLCRLGKIVEAKELLQEMETKGCSPSVVTYTSLI
          L FY Y+      M I P+  S N++IKA CK R  +D+A+ VF  M    C PD Y+Y TL++GLC+  +I EA  LL EM+++GCSPS V Y  LI
Subjt:  LALRFYRYM----RKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLINGLCRLGKIVEAKELLQEMETKGCSPSVVTYTSLI

Query:  HGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQSLQLEASWKGKLNEALEILDRMKLQGLTPDAGLTHNRVIHGLCTIDESNRAFQLYLS
         GLC+  ++    +L+++M  KG  PN  TY++L+ G C            KGKL++A+ +L+RM      P+  +T+  +I+GL     +  A +L  S
Subjt:  HGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQSLQLEASWKGKLNEALEILDRMKLQGLTPDAGLTHNRVIHGLCTIDESNRAFQLYLS

Query:  VQTR
        ++ R
Subjt:  VQTR

Q9CA58 Putative pentatricopeptide repeat-containing protein At1g745803.3e-4231.78Show/hide
Query:  VMPSHVEQLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFD-VTEDIFLTICRAYGRIHKPLDSIRVFHKM
        ++P HV  +I+ ++D  KAL +F+S   E   GFKH L+T+R +I KL    +F   E +L  M+E   + + E +++   + YGR  K  +++ VF +M
Subjt:  VMPSHVEQLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFD-VTEDIFLTICRAYGRIHKPLDSIRVFHKM

Query:  QDFHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLINGLCRLGKIVEAK
          + C+PT  SY  + ++LV+      A + Y  MR  GI P V S  + +K+FCK       A+ + N MS+ GCE +  +Y T++ G        E  
Subjt:  QDFHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLINGLCRLGKIVEAK

Query:  ELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQSLQLEASWKGKLNEALEILDRMKLQGLTPDAGLTH
        EL  +M   G S  + T+  L+  LC+  +V E  +LL+ ++ +G+ PN+FTY+  + G C+           +G+L+ A+ ++  +  QG  PD  +T+
Subjt:  ELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQSLQLEASWKGKLNEALEILDRMKLQGLTPDAGLTH

Query:  NRVIHGLCTIDESNRAFQLYL
        N +I+GLC   +   A ++YL
Subjt:  NRVIHGLCTIDESNRAFQLYL

Q9FMF6 Pentatricopeptide repeat-containing protein At5g64320, mitochondrial4.7e-4129.97Show/hide
Query:  VMPSHVEQLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIHKPLDSIRVFHKMQ
        + P  + +L+    +++ ++ +F    ++  NG++H  + ++++I KL +  +F+  + LL +MK+E     E +F++I R Y +   P  + R+  +M+
Subjt:  VMPSHVEQLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIHKPLDSIRVFHKMQ

Query:  D-FHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLINGLCRLGKIVEAK
        + + C+PT KSY  V  ILV  N  K+A   +  M    IPP++ +  V++KAFC     +D A+ +  +M+ HGC P+S  Y TLI+ L +  ++ EA 
Subjt:  D-FHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLINGLCRLGKIVEAK

Query:  ELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQ--------------------SLQLEASWKGKLNEA
        +LL+EM   GC P   T+  +I GLC+ + ++EA +++  M+ +G  P+  TY  LM+G CK G+                    +L       G+L++A
Subjt:  ELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQ--------------------SLQLEASWKGKLNEA

Query:  LEIL-DRMKLQGLTPDAGLTHNRVIHG
          +L D +   G+ PD   T+N +I+G
Subjt:  LEIL-DRMKLQGLTPDAGLTHNRVIHG

Q9FNL2 Pentatricopeptide repeat-containing protein At5g461001.6e-11357.67Show/hide
Query:  MGSKA-MFKWAKTVMPSHVEQLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIH
        MGSK  MFKW+K + PS V +L+RAE+D+ K++ +FDSATAEY NG+ HD ++F  M+ +LVSAN+F+ AE L+ RMK E   V+EDI L+ICR YGR+H
Subjt:  MGSKA-MFKWAKTVMPSHVEQLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIH

Query:  KPLDSIRVFHKMQDFHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLIN
        +P DS+RVFHKM+DF C P++K+Y+TV AILVEENQL LA +FY+ MR++G+PP+VASLNVLIKA C+N GT+D  + +F EM   GC+PDSY+YGTLI+
Subjt:  KPLDSIRVFHKMQDFHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLIN

Query:  GLCRLGKIVEAKELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQSLQLEASW---------------
        GLCR G+I EAK+L  EM  K C+P+VVTYTSLI+GLC   NVDEAMR LE+M SKGIEPNVFTYSSLMDG CK G+SLQ    +               
Subjt:  GLCRLGKIVEAKELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQSLQLEASW---------------

Query:  ---------KGKLNEALEILDRMKLQGLTPDAGLTHNRVIHGLCTIDESNRA
                 + K+ EA+E+LDRM LQGL PDAGL + +VI G C I +   A
Subjt:  ---------KGKLNEALEILDRMKLQGLTPDAGLTHNRVIHGLCTIDESNRA

Q9M302 Pentatricopeptide repeat-containing protein At3g488101.1e-4231.96Show/hide
Query:  TVMPSHVE-------QLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIHKPLDS
        T  P+H E       + +R E  +  AL  F S     +N FKH   TF +MI KL    Q    + LL +MK + F  +ED+F+++   Y ++     +
Subjt:  TVMPSHVE-------QLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIHKPLDS

Query:  IRVFHKMQDFHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLINGLCRL
        + +F+++++F C P+ K Y  V   L+ EN++++    YR M++ G  P+V + NVL+KA CKN   +D A  +  EMSN GC PD+ SY T+I+ +C +
Subjt:  IRVFHKMQDFHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLINGLCRL

Query:  GKIVEAKELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQ------------------------SLQL
        G + E +EL +  E     P V  Y +LI+GLC+ ++   A  L+ +M+ KGI PNV +YS+L++  C +GQ                        SL  
Subjt:  GKIVEAKELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQ------------------------SLQL

Query:  EASWKGKLNEALEILDRM-KLQGLTPDAGLTHNRVIHGLCT
            +G   +AL++ ++M +  GL P+  + +N ++ G C+
Subjt:  EASWKGKLNEALEILDRM-KLQGLTPDAGLTHNRVIHGLCT

Arabidopsis top hitse value%identityAlignment
AT1G74580.1 Pentatricopeptide repeat (PPR) superfamily protein2.3e-4331.78Show/hide
Query:  VMPSHVEQLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFD-VTEDIFLTICRAYGRIHKPLDSIRVFHKM
        ++P HV  +I+ ++D  KAL +F+S   E   GFKH L+T+R +I KL    +F   E +L  M+E   + + E +++   + YGR  K  +++ VF +M
Subjt:  VMPSHVEQLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFD-VTEDIFLTICRAYGRIHKPLDSIRVFHKM

Query:  QDFHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLINGLCRLGKIVEAK
          + C+PT  SY  + ++LV+      A + Y  MR  GI P V S  + +K+FCK       A+ + N MS+ GCE +  +Y T++ G        E  
Subjt:  QDFHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLINGLCRLGKIVEAK

Query:  ELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQSLQLEASWKGKLNEALEILDRMKLQGLTPDAGLTH
        EL  +M   G S  + T+  L+  LC+  +V E  +LL+ ++ +G+ PN+FTY+  + G C+           +G+L+ A+ ++  +  QG  PD  +T+
Subjt:  ELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQSLQLEASWKGKLNEALEILDRMKLQGLTPDAGLTH

Query:  NRVIHGLCTIDESNRAFQLYL
        N +I+GLC   +   A ++YL
Subjt:  NRVIHGLCTIDESNRAFQLYL

AT3G48810.1 Pentatricopeptide repeat (PPR) superfamily protein8.0e-4431.96Show/hide
Query:  TVMPSHVE-------QLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIHKPLDS
        T  P+H E       + +R E  +  AL  F S     +N FKH   TF +MI KL    Q    + LL +MK + F  +ED+F+++   Y ++     +
Subjt:  TVMPSHVE-------QLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIHKPLDS

Query:  IRVFHKMQDFHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLINGLCRL
        + +F+++++F C P+ K Y  V   L+ EN++++    YR M++ G  P+V + NVL+KA CKN   +D A  +  EMSN GC PD+ SY T+I+ +C +
Subjt:  IRVFHKMQDFHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLINGLCRL

Query:  GKIVEAKELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQ------------------------SLQL
        G + E +EL +  E     P V  Y +LI+GLC+ ++   A  L+ +M+ KGI PNV +YS+L++  C +GQ                        SL  
Subjt:  GKIVEAKELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQ------------------------SLQL

Query:  EASWKGKLNEALEILDRM-KLQGLTPDAGLTHNRVIHGLCT
            +G   +AL++ ++M +  GL P+  + +N ++ G C+
Subjt:  EASWKGKLNEALEILDRM-KLQGLTPDAGLTHNRVIHGLCT

AT4G20090.1 Pentatricopeptide repeat (PPR) superfamily protein3.8e-4635.53Show/hide
Query:  TAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIHKPLDSIRVFHKMQD-FHCKPTEKSYITVFAILVEENQLK
        +A     FK   +T   MI    ++  F   E LL R++ E   + E  F+ + RAYG+ H P  ++ +FH+M D F CK + KS+ +V  +++ E    
Subjt:  TAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIHKPLDSIRVFHKMQD-FHCKPTEKSYITVFAILVEENQLK

Query:  LALRFYRYM----RKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLINGLCRLGKIVEAKELLQEMETKGCSPSVVTYTSLI
          L FY Y+      M I P+  S N++IKA CK R  +D+A+ VF  M    C PD Y+Y TL++GLC+  +I EA  LL EM+++GCSPS V Y  LI
Subjt:  LALRFYRYM----RKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLINGLCRLGKIVEAKELLQEMETKGCSPSVVTYTSLI

Query:  HGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQSLQLEASWKGKLNEALEILDRMKLQGLTPDAGLTHNRVIHGLCTIDESNRAFQLYLS
         GLC+  ++    +L+++M  KG  PN  TY++L+ G C            KGKL++A+ +L+RM      P+  +T+  +I+GL     +  A +L  S
Subjt:  HGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQSLQLEASWKGKLNEALEILDRMKLQGLTPDAGLTHNRVIHGLCTIDESNRAFQLYLS

Query:  VQTR
        ++ R
Subjt:  VQTR

AT5G46100.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-11457.67Show/hide
Query:  MGSKA-MFKWAKTVMPSHVEQLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIH
        MGSK  MFKW+K + PS V +L+RAE+D+ K++ +FDSATAEY NG+ HD ++F  M+ +LVSAN+F+ AE L+ RMK E   V+EDI L+ICR YGR+H
Subjt:  MGSKA-MFKWAKTVMPSHVEQLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIH

Query:  KPLDSIRVFHKMQDFHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLIN
        +P DS+RVFHKM+DF C P++K+Y+TV AILVEENQL LA +FY+ MR++G+PP+VASLNVLIKA C+N GT+D  + +F EM   GC+PDSY+YGTLI+
Subjt:  KPLDSIRVFHKMQDFHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLIN

Query:  GLCRLGKIVEAKELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQSLQLEASW---------------
        GLCR G+I EAK+L  EM  K C+P+VVTYTSLI+GLC   NVDEAMR LE+M SKGIEPNVFTYSSLMDG CK G+SLQ    +               
Subjt:  GLCRLGKIVEAKELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQSLQLEASW---------------

Query:  ---------KGKLNEALEILDRMKLQGLTPDAGLTHNRVIHGLCTIDESNRA
                 + K+ EA+E+LDRM LQGL PDAGL + +VI G C I +   A
Subjt:  ---------KGKLNEALEILDRMKLQGLTPDAGLTHNRVIHGLCTIDESNRA

AT5G64320.1 Pentatricopeptide repeat (PPR) superfamily protein3.3e-4229.97Show/hide
Query:  VMPSHVEQLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIHKPLDSIRVFHKMQ
        + P  + +L+    +++ ++ +F    ++  NG++H  + ++++I KL +  +F+  + LL +MK+E     E +F++I R Y +   P  + R+  +M+
Subjt:  VMPSHVEQLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIHKPLDSIRVFHKMQ

Query:  D-FHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLINGLCRLGKIVEAK
        + + C+PT KSY  V  ILV  N  K+A   +  M    IPP++ +  V++KAFC     +D A+ +  +M+ HGC P+S  Y TLI+ L +  ++ EA 
Subjt:  D-FHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLINGLCRLGKIVEAK

Query:  ELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQ--------------------SLQLEASWKGKLNEA
        +LL+EM   GC P   T+  +I GLC+ + ++EA +++  M+ +G  P+  TY  LM+G CK G+                    +L       G+L++A
Subjt:  ELLQEMETKGCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQ--------------------SLQLEASWKGKLNEA

Query:  LEIL-DRMKLQGLTPDAGLTHNRVIHG
          +L D +   G+ PD   T+N +I+G
Subjt:  LEIL-DRMKLQGLTPDAGLTHNRVIHG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCAGTAAAGCTATGTTTAAATGGGCAAAAACAGTCATGCCTTCTCACGTTGAGCAACTAATCCGAGCGGAACGAGACATAAACAAGGCACTTCTCATATTTGACTC
TGCAACAGCTGAGTATACAAATGGTTTTAAGCACGATCTCAATACTTTTAGGCTCATGATTAGTAAATTAGTTTCTGCAAACCAGTTCAGGTTAGCAGAAACACTTCTTG
ATAGAATGAAGGAAGAGAAATTTGATGTCACCGAGGATATATTTCTCACTATTTGTAGGGCTTATGGTCGTATCCATAAGCCATTGGACTCCATTCGAGTTTTCCACAAA
ATGCAGGATTTCCATTGCAAGCCTACAGAGAAATCTTACATTACAGTGTTTGCCATTCTTGTTGAAGAAAACCAATTAAAATTGGCTCTAAGGTTTTATAGGTATATGAG
AAAAATGGGTATTCCCCCAAGTGTAGCTTCTCTCAATGTTCTAATCAAAGCCTTTTGCAAGAATAGAGGAACCATGGATAAAGCAATGCACGTATTTAATGAAATGTCTA
ATCATGGGTGTGAACCTGATTCATATTCATATGGAACTCTGATCAATGGGTTATGTAGATTAGGAAAGATTGTTGAAGCAAAAGAATTATTGCAGGAGATGGAGACCAAA
GGTTGTTCACCTTCTGTTGTCACCTACACTTCGCTGATACATGGTTTGTGTCAGTTGAACAATGTGGATGAAGCCATGCGATTACTTGAAGATATGATGAGCAAGGGTAT
CGAACCTAACGTGTTTACTTACAGTTCCCTAATGGATGGATTTTGCAAGGCTGGTCAGTCTTTGCAGCTAGAGGCCTCTTGGAAAGGAAAACTAAATGAAGCTTTAGAGA
TTCTCGACAGAATGAAACTTCAAGGTTTGACACCAGATGCTGGGTTGACTCACAACAGAGTAATTCACGGTCTCTGCACTATTGACGAGTCAAATCGTGCATTTCAGTTG
TATCTTAGCGTGCAGACTCGCGACATTGTTGTTGGTAGCCTTTGGTGTAAAGCCATGTCTAGCGCAAAACCACCTGCATTCACCATGCATGGTCGGCTGACATTCTATGT
TTACTCATTAGCATTTCGGAAGAAATTAAGATTCTCTGATCCAAACCTGAGTCCAGATTACTGTGATTGTTGTGTTGTCCAGAGTAGAGCTATAGCCATCCATCAGTCAT
TTACCCATCAAGATTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCAGTAAAGCTATGTTTAAATGGGCAAAAACAGTCATGCCTTCTCACGTTGAGCAACTAATCCGAGCGGAACGAGACATAAACAAGGCACTTCTCATATTTGACTC
TGCAACAGCTGAGTATACAAATGGTTTTAAGCACGATCTCAATACTTTTAGGCTCATGATTAGTAAATTAGTTTCTGCAAACCAGTTCAGGTTAGCAGAAACACTTCTTG
ATAGAATGAAGGAAGAGAAATTTGATGTCACCGAGGATATATTTCTCACTATTTGTAGGGCTTATGGTCGTATCCATAAGCCATTGGACTCCATTCGAGTTTTCCACAAA
ATGCAGGATTTCCATTGCAAGCCTACAGAGAAATCTTACATTACAGTGTTTGCCATTCTTGTTGAAGAAAACCAATTAAAATTGGCTCTAAGGTTTTATAGGTATATGAG
AAAAATGGGTATTCCCCCAAGTGTAGCTTCTCTCAATGTTCTAATCAAAGCCTTTTGCAAGAATAGAGGAACCATGGATAAAGCAATGCACGTATTTAATGAAATGTCTA
ATCATGGGTGTGAACCTGATTCATATTCATATGGAACTCTGATCAATGGGTTATGTAGATTAGGAAAGATTGTTGAAGCAAAAGAATTATTGCAGGAGATGGAGACCAAA
GGTTGTTCACCTTCTGTTGTCACCTACACTTCGCTGATACATGGTTTGTGTCAGTTGAACAATGTGGATGAAGCCATGCGATTACTTGAAGATATGATGAGCAAGGGTAT
CGAACCTAACGTGTTTACTTACAGTTCCCTAATGGATGGATTTTGCAAGGCTGGTCAGTCTTTGCAGCTAGAGGCCTCTTGGAAAGGAAAACTAAATGAAGCTTTAGAGA
TTCTCGACAGAATGAAACTTCAAGGTTTGACACCAGATGCTGGGTTGACTCACAACAGAGTAATTCACGGTCTCTGCACTATTGACGAGTCAAATCGTGCATTTCAGTTG
TATCTTAGCGTGCAGACTCGCGACATTGTTGTTGGTAGCCTTTGGTGTAAAGCCATGTCTAGCGCAAAACCACCTGCATTCACCATGCATGGTCGGCTGACATTCTATGT
TTACTCATTAGCATTTCGGAAGAAATTAAGATTCTCTGATCCAAACCTGAGTCCAGATTACTGTGATTGTTGTGTTGTCCAGAGTAGAGCTATAGCCATCCATCAGTCAT
TTACCCATCAAGATTTTTGA
Protein sequenceShow/hide protein sequence
MGSKAMFKWAKTVMPSHVEQLIRAERDINKALLIFDSATAEYTNGFKHDLNTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLTICRAYGRIHKPLDSIRVFHK
MQDFHCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGIPPSVASLNVLIKAFCKNRGTMDKAMHVFNEMSNHGCEPDSYSYGTLINGLCRLGKIVEAKELLQEMETK
GCSPSVVTYTSLIHGLCQLNNVDEAMRLLEDMMSKGIEPNVFTYSSLMDGFCKAGQSLQLEASWKGKLNEALEILDRMKLQGLTPDAGLTHNRVIHGLCTIDESNRAFQL
YLSVQTRDIVVGSLWCKAMSSAKPPAFTMHGRLTFYVYSLAFRKKLRFSDPNLSPDYCDCCVVQSRAIAIHQSFTHQDF