; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10002117 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10002117
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr11:3569206..3570301
RNA-Seq ExpressionHG10002117
SyntenyHG10002117
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0009658 - chloroplast organization (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044190 - Protein THYLAKOID ASSEMBLY 8-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044536.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]1.1e-11990.91Show/hide
Query:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ
        MKSTLMGRLQLHFPQLGLRQNLTN SLHCCTAAPPPNIICGLRKGL+KPLGRSRVPS EAIQAVQSLKLAK TSKMEDVINTKLSRLLKADLFDALTELQ
Subjt:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ

Query:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI
        RQNELELSLQVFKFMQNEEWYEPDLRLYHGMI+MMGKNKMIEMAEEVFHKL+KDGLEPDTRAFNEMMGAYLQVDMIERA +TY LM ASGCTPD+LTFKI
Subjt:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI

Query:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKG
        LIKNLEKF+EEFA VVKK+C+EYLD+P+KF ND   KLT KG
Subjt:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKG

XP_008454079.1 PREDICTED: pentatricopeptide repeat-containing protein At1g62350-like [Cucumis melo]5.5e-11990.5Show/hide
Query:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ
        MKSTLMGRLQLHFP+LGLRQNLTN SLHCCTAAPPPNIICGLRKGL+KPLGRSRVPS EAIQAVQSLKLAK TSKMEDVINTKLSRLLKADLFDALTELQ
Subjt:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ

Query:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI
        RQNELELSLQVFKFMQNEEWYEPDLRLYHGMI+MMGKNKMIEMAEEVFHKL+KDGLEPDTRAFNEMMGAYLQVDMIERA +TY LM ASGCTPD+LTFKI
Subjt:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI

Query:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKG
        LIKNLEKF+EEFA VVKK+C EYLD+P+KF ND   KLT KG
Subjt:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKG

XP_022980054.1 pentatricopeptide repeat-containing protein At1g62350-like [Cucurbita maxima]9.8e-11688.98Show/hide
Query:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ
        MKSTLMGRLQLH PQLG RQNLTN +LHCCTA PPPNIICGLRKG RKPLG+SRVPSTE+IQAVQSLKLAKS SKMEDVIN+KLSRLLKADLFDAL ELQ
Subjt:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ

Query:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI
        RQNELELSLQVFKFM+NEEWYEPDL LYH MI MMGKNKMIEMAEEVFH  K+DGLEPDTRAFNEMMGAYLQVDM+ERAVETYELMKASGCTPD+LTFKI
Subjt:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI

Query:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKGQIL
        LIKNLE+FREEFAAVVKKEC E+LDSPEKFL DVE KL MK QIL
Subjt:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKGQIL

XP_023527118.1 pentatricopeptide repeat-containing protein At1g62350-like [Cucurbita pepo subsp. pepo]2.6e-11688.98Show/hide
Query:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ
        MKSTLMGRLQLHFPQLG RQNLTN +LHCCTA PPPNIICGLRKG RKPLG+SRVPSTE+IQAVQSLKLAKS SKMEDVIN+KLSRLLKADLFDAL ELQ
Subjt:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ

Query:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI
        RQNELELSLQVFKFM+NEEWYEPDL LYH MI MMGKNKMIEMAEEVFH+LK+DGLEPDTRAFNEMMGAY+QVDM+ERAVETYELMKASGC PD+LTFKI
Subjt:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI

Query:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKGQIL
        LIKNLE+FREEFAAVVKKEC E+LDSPEKFL DVE KL MK QIL
Subjt:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKGQIL

XP_038878176.1 pentatricopeptide repeat-containing protein At1g62350-like [Benincasa hispida]3.7e-12393.06Show/hide
Query:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ
        MKSTLMG LQ HFPQLGLRQ+LTN SLHCCTAAPPPNIICGLRKGLRKPLGRSRVPS EAIQAVQSLKLAKSTSKMEDVIN+KLSRLLKADLFDALTELQ
Subjt:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ

Query:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI
        RQNELELSLQVFKF+QNEEWYEPDLRLYHGMILM GKNKMIEMAEE+FHKLKKDGLEPD RAFNEMMGAYLQVDM+ERAVETYELMKASGCTPD+LTFKI
Subjt:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI

Query:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKGQIL
        LIKNLEKFREEFAAVVKKECN YLDSPEKFLNDVE K T KG+IL
Subjt:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKGQIL

TrEMBL top hitse value%identityAlignment
A0A0A0KU23 Uncharacterized protein3.4e-11487.76Show/hide
Query:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ
        MKSTL+G LQLHF QLGLRQNLTN SL C TAAPPPNIICGLRKG  +PLG SRVPS EAIQAVQSLKLAKSTSKMEDVINTKL RLLKADLFDAL+ELQ
Subjt:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ

Query:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI
        RQNELELSLQVFKFMQNEEW+EPDLRLYHGMI++MGKNKMIEMAEEVFHKL+KDGLEPDTRAFNEMMGAYLQVDMIERAVETY LM ASGCTPDELTFKI
Subjt:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI

Query:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKGQIL
        LIKNLEKFREEFA VVKK+CNEYLD+P+KF ND   KLT K +IL
Subjt:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKGQIL

A0A1S3BX97 pentatricopeptide repeat-containing protein At1g62350-like2.7e-11990.5Show/hide
Query:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ
        MKSTLMGRLQLHFP+LGLRQNLTN SLHCCTAAPPPNIICGLRKGL+KPLGRSRVPS EAIQAVQSLKLAK TSKMEDVINTKLSRLLKADLFDALTELQ
Subjt:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ

Query:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI
        RQNELELSLQVFKFMQNEEWYEPDLRLYHGMI+MMGKNKMIEMAEEVFHKL+KDGLEPDTRAFNEMMGAYLQVDMIERA +TY LM ASGCTPD+LTFKI
Subjt:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI

Query:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKG
        LIKNLEKF+EEFA VVKK+C EYLD+P+KF ND   KLT KG
Subjt:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKG

A0A5A7TQZ8 Pentatricopeptide repeat-containing protein5.4e-12090.91Show/hide
Query:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ
        MKSTLMGRLQLHFPQLGLRQNLTN SLHCCTAAPPPNIICGLRKGL+KPLGRSRVPS EAIQAVQSLKLAK TSKMEDVINTKLSRLLKADLFDALTELQ
Subjt:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ

Query:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI
        RQNELELSLQVFKFMQNEEWYEPDLRLYHGMI+MMGKNKMIEMAEEVFHKL+KDGLEPDTRAFNEMMGAYLQVDMIERA +TY LM ASGCTPD+LTFKI
Subjt:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI

Query:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKG
        LIKNLEKF+EEFA VVKK+C+EYLD+P+KF ND   KLT KG
Subjt:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKG

A0A5D3CZ95 Pentatricopeptide repeat-containing protein2.7e-11990.5Show/hide
Query:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ
        MKSTLMGRLQLHFP+LGLRQNLTN SLHCCTAAPPPNIICGLRKGL+KPLGRSRVPS EAIQAVQSLKLAK TSKMEDVINTKLSRLLKADLFDALTELQ
Subjt:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ

Query:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI
        RQNELELSLQVFKFMQNEEWYEPDLRLYHGMI+MMGKNKMIEMAEEVFHKL+KDGLEPDTRAFNEMMGAYLQVDMIERA +TY LM ASGCTPD+LTFKI
Subjt:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI

Query:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKG
        LIKNLEKF+EEFA VVKK+C EYLD+P+KF ND   KLT KG
Subjt:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKG

A0A6J1IY98 pentatricopeptide repeat-containing protein At1g62350-like4.7e-11688.98Show/hide
Query:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ
        MKSTLMGRLQLH PQLG RQNLTN +LHCCTA PPPNIICGLRKG RKPLG+SRVPSTE+IQAVQSLKLAKS SKMEDVIN+KLSRLLKADLFDAL ELQ
Subjt:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ

Query:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI
        RQNELELSLQVFKFM+NEEWYEPDL LYH MI MMGKNKMIEMAEEVFH  K+DGLEPDTRAFNEMMGAYLQVDM+ERAVETYELMKASGCTPD+LTFKI
Subjt:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI

Query:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKGQIL
        LIKNLE+FREEFAAVVKKEC E+LDSPEKFL DVE KL MK QIL
Subjt:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKGQIL

SwissProt top hitse value%identityAlignment
A7LN87 Pentatricopeptide repeat-containing protein PPR5, chloroplastic4.4e-1026.37Show/hide
Query:  STEAIQAVQSL--KLAKSTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKD
        + EA + V+SL  + A    ++  V++  +  +     F    EL R++     L VF++MQ + WY  D  +Y  +I +MG+   I MA  +F +++  
Subjt:  STEAIQAVQSL--KLAKSTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKD

Query:  GLEPDTRAFNEMMGAYL----QVDMIERAVETYELMKA-SGCTPDELTFKILIKNLEKFRE-EFAAVVKKECNEYLDSPEKF
        G +PDT  +N ++GA+L    +   + +A+  +E MK    C P  +T+ IL++   +  + +   ++ K+ +E + SP+ +
Subjt:  GLEPDTRAFNEMMGAYL----QVDMIERAVETYELMKA-SGCTPDELTFKILIKNLEKFRE-EFAAVVKKECNEYLDSPEKF

Q1PFH7 Pentatricopeptide repeat-containing protein At1g623504.8e-1730.97Show/hide
Query:  STEAIQAVQSLKLAKSTS-KMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDG
        S E + A + LK  ++ S +++  I + +SRLLK+DL   L E QRQN++ L +++++ ++ E WY PD+  Y  M++M+ +NK ++  ++V+  LKK+ 
Subjt:  STEAIQAVQSLKLAKSTS-KMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDG

Query:  LEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKILIKNLEKFRE
        +  D   F +++  +L  ++   A+  Y  M+ S   P  L F++++K L  + E
Subjt:  LEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKILIKNLEKFRE

Q9FKC3 Pentatricopeptide repeat-containing protein At5g48730, chloroplastic2.6e-1028.1Show/hide
Query:  SRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLK
        +++ S +AI  +   +  KS  + +      L R +   L + +T L+     E ++QVF+ ++ + WY+P++ +Y  +I+M+GK K  E A E+F ++ 
Subjt:  SRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLK

Query:  KDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKAS-GCTPDELTFKILIKN
         +G   +   +  ++ AY +    + A    E MK+S  C PD  T+ ILIK+
Subjt:  KDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKAS-GCTPDELTFKILIKN

Q9LVW6 Protein THYLAKOID ASSEMBLY 8, chloroplastic1.8e-1132.22Show/hide
Query:  NIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEP-DLRLYHGMILMM
        +I CG R   R PL + R+ STEAIQ++QSLK A  T     +    L RL+K+DL   L EL RQ+   L++ V   ++ E  Y P DL LY  ++  +
Subjt:  NIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEP-DLRLYHGMILMM

Query:  GKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASG-----CTPDELTFKILIKNLEKFRE
         +NK  +  + +  ++       D +A  +++ A +  +  E  V  Y LM+ SG        DE   ++L K L +  E
Subjt:  GKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASG-----CTPDELTFKILIKNLEKFRE

Q9STF9 Protein THYLAKOID ASSEMBLY 8-like, chloroplastic1.0e-1934.16Show/hide
Query:  RKPLGR-SRVPSTEAIQAVQSLK-LAKSTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMA
        R PL R  ++   EA+  +  LK L +   K++  I T + RLLK D+   + EL+RQ E  L++++F+ +Q +EWY+PD+ +Y  +I+ + K+K ++ A
Subjt:  RKPLGR-SRVPSTEAIQAVQSLK-LAKSTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMA

Query:  EEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKILIKNL
          ++ K+KK+ L PD++ + E++  +L+      A+  YE M  S   P+EL F++L+K L
Subjt:  EEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKILIKNL

Arabidopsis top hitse value%identityAlignment
AT1G62350.1 Pentatricopeptide repeat (PPR) superfamily protein3.4e-1830.97Show/hide
Query:  STEAIQAVQSLKLAKSTS-KMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDG
        S E + A + LK  ++ S +++  I + +SRLLK+DL   L E QRQN++ L +++++ ++ E WY PD+  Y  M++M+ +NK ++  ++V+  LKK+ 
Subjt:  STEAIQAVQSLKLAKSTS-KMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDG

Query:  LEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKILIKNLEKFRE
        +  D   F +++  +L  ++   A+  Y  M+ S   P  L F++++K L  + E
Subjt:  LEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKILIKNLEKFRE

AT3G27750.1 FUNCTIONS IN: molecular_function unknown1.3e-1232.22Show/hide
Query:  NIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEP-DLRLYHGMILMM
        +I CG R   R PL + R+ STEAIQ++QSLK A  T     +    L RL+K+DL   L EL RQ+   L++ V   ++ E  Y P DL LY  ++  +
Subjt:  NIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEP-DLRLYHGMILMM

Query:  GKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASG-----CTPDELTFKILIKNLEKFRE
         +NK  +  + +  ++       D +A  +++ A +  +  E  V  Y LM+ SG        DE   ++L K L +  E
Subjt:  GKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASG-----CTPDELTFKILIKNLEKFRE

AT3G46870.1 Pentatricopeptide repeat (PPR) superfamily protein7.4e-2134.16Show/hide
Query:  RKPLGR-SRVPSTEAIQAVQSLK-LAKSTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMA
        R PL R  ++   EA+  +  LK L +   K++  I T + RLLK D+   + EL+RQ E  L++++F+ +Q +EWY+PD+ +Y  +I+ + K+K ++ A
Subjt:  RKPLGR-SRVPSTEAIQAVQSLK-LAKSTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMA

Query:  EEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKILIKNL
          ++ K+KK+ L PD++ + E++  +L+      A+  YE M  S   P+EL F++L+K L
Subjt:  EEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKILIKNL

AT5G09320.1 Vacuolar sorting protein 9 (VPS9) domain1.4e-2433.82Show/hide
Query:  RKPLGRSRVPSTEAIQAVQSLKLA----------------KSTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEPDLRLYHG
        RKPL R R+ S EAIQAVQ+LK A                 S++ ++ VI +K  RLLK D+   L EL RQNE  L+L+VF+ ++ E WY+P +R+Y  
Subjt:  RKPLGRSRVPSTEAIQAVQSLKLA----------------KSTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEPDLRLYHG

Query:  MILMMGKNKMIEMAEEVFHKLKKD-GLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKILIKNLEKFRE-EFAAVVKKECNEYLDSPE
        MI +M  N ++E    ++  +K + GL  +   FN ++   L   + +  ++ Y  M++ G  PD  +F++L+  LE   E   +A+V+++ +EY     
Subjt:  MILMMGKNKMIEMAEEVFHKLKKD-GLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKILIKNLEKFRE-EFAAVVKKECNEYLDSPE

Query:  KFLNDVE
        +F+ + E
Subjt:  KFLNDVE

AT5G48730.1 Pentatricopeptide repeat (PPR) superfamily protein1.8e-1128.1Show/hide
Query:  SRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLK
        +++ S +AI  +   +  KS  + +      L R +   L + +T L+     E ++QVF+ ++ + WY+P++ +Y  +I+M+GK K  E A E+F ++ 
Subjt:  SRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLK

Query:  KDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKAS-GCTPDELTFKILIKN
         +G   +   +  ++ AY +    + A    E MK+S  C PD  T+ ILIK+
Subjt:  KDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKAS-GCTPDELTFKILIKN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATCTACTCTAATGGGGCGTCTTCAACTCCATTTTCCTCAATTGGGTCTTCGCCAAAACCTCACGAACTCAAGCCTCCATTGTTGTACGGCAGCTCCACCTCCAAA
TATCATCTGTGGCCTCAGAAAGGGCTTGAGGAAGCCCTTAGGGAGGTCAAGGGTGCCCTCCACTGAGGCTATTCAAGCAGTTCAGTCCCTCAAGCTCGCTAAATCCACCT
CCAAAATGGAAGACGTTATCAATACCAAGCTCAGCAGATTGCTGAAAGCAGACTTGTTTGATGCTCTGACTGAATTACAGAGGCAAAATGAACTGGAATTATCGCTTCAG
GTCTTCAAATTTATGCAAAATGAAGAATGGTATGAGCCAGATTTAAGGTTGTACCATGGGATGATTCTGATGATGGGAAAGAACAAAATGATTGAAATGGCTGAAGAGGT
CTTCCATAAGTTAAAAAAGGATGGGTTAGAACCAGACACAAGAGCTTTTAATGAGATGATGGGAGCATATTTGCAAGTGGACATGATCGAAAGAGCAGTTGAGACATATG
AATTAATGAAAGCATCAGGTTGTACTCCAGATGAACTGACTTTCAAGATCTTGATCAAGAATCTCGAGAAATTTAGGGAAGAATTTGCTGCAGTAGTGAAGAAAGAATGT
AATGAGTACTTGGATTCTCCTGAGAAGTTCCTCAACGATGTTGAACATAAACTGACGATGAAAGGTCAAATTCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAATCTACTCTAATGGGGCGTCTTCAACTCCATTTTCCTCAATTGGGTCTTCGCCAAAACCTCACGAACTCAAGCCTCCATTGTTGTACGGCAGCTCCACCTCCAAA
TATCATCTGTGGCCTCAGAAAGGGCTTGAGGAAGCCCTTAGGGAGGTCAAGGGTGCCCTCCACTGAGGCTATTCAAGCAGTTCAGTCCCTCAAGCTCGCTAAATCCACCT
CCAAAATGGAAGACGTTATCAATACCAAGCTCAGCAGATTGCTGAAAGCAGACTTGTTTGATGCTCTGACTGAATTACAGAGGCAAAATGAACTGGAATTATCGCTTCAG
GTCTTCAAATTTATGCAAAATGAAGAATGGTATGAGCCAGATTTAAGGTTGTACCATGGGATGATTCTGATGATGGGAAAGAACAAAATGATTGAAATGGCTGAAGAGGT
CTTCCATAAGTTAAAAAAGGATGGGTTAGAACCAGACACAAGAGCTTTTAATGAGATGATGGGAGCATATTTGCAAGTGGACATGATCGAAAGAGCAGTTGAGACATATG
AATTAATGAAAGCATCAGGTTGTACTCCAGATGAACTGACTTTCAAGATCTTGATCAAGAATCTCGAGAAATTTAGGGAAGAATTTGCTGCAGTAGTGAAGAAAGAATGT
AATGAGTACTTGGATTCTCCTGAGAAGTTCCTCAACGATGTTGAACATAAACTGACGATGAAAGGTCAAATTCTTTAA
Protein sequenceShow/hide protein sequence
MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQ
VFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKILIKNLEKFREEFAAVVKKEC
NEYLDSPEKFLNDVEHKLTMKGQIL