; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi07G003530 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi07G003530
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr07:3757909..3759766
RNA-Seq ExpressionLsi07G003530
SyntenyLsi07G003530
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0009658 - chloroplast organization (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044190 - Protein THYLAKOID ASSEMBLY 8-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044536.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]1.6e-11990.91Show/hide
Query:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ
        MKSTLMGRLQLHFPQLGLRQNLTN SLHCCTAAPPPNIICGLRKGL+KPLGRSRVPS EAIQAVQSLKLAK TSKMEDVINTKLSRLLKADLFDALTELQ
Subjt:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ

Query:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI
        RQNELELSLQVFKFMQNEEWYEPDLRLYHGMI+MMGKNKMIEMAEEVFHKL+KDGLEPDTRAFNEMMGAYLQVDMIERA +TY LM ASGCTPD+LTFKI
Subjt:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI

Query:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKG
        LIKNLEKF+EEFA VVKK+C+EYLD+P+KF ND   KLT KG
Subjt:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKG

XP_008454079.1 PREDICTED: pentatricopeptide repeat-containing protein At1g62350-like [Cucumis melo]8.0e-11990.5Show/hide
Query:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ
        MKSTLMGRLQLHFP+LGLRQNLTN SLHCCTAAPPPNIICGLRKGL+KPLGRSRVPS EAIQAVQSLKLAK TSKMEDVINTKLSRLLKADLFDALTELQ
Subjt:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ

Query:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI
        RQNELELSLQVFKFMQNEEWYEPDLRLYHGMI+MMGKNKMIEMAEEVFHKL+KDGLEPDTRAFNEMMGAYLQVDMIERA +TY LM ASGCTPD+LTFKI
Subjt:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI

Query:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKG
        LIKNLEKF+EEFA VVKK+C EYLD+P+KF ND   KLT KG
Subjt:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKG

XP_022980054.1 pentatricopeptide repeat-containing protein At1g62350-like [Cucurbita maxima]1.1e-11588.98Show/hide
Query:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ
        MKSTLMGRLQLH PQLG RQNLTN +LHCCTA PPPNIICGLRKG RKPLG+SRVPSTE+IQAVQSLKLAKS SKMEDVIN+KLSRLLKADLFDAL ELQ
Subjt:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ

Query:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI
        RQNELELSLQVFKFM+NEEWYEPDL LYH MI MMGKNKMIEMAEEVFH  K+DGLEPDTRAFNEMMGAYLQVDM+ERAVETYELMKASGCTPD+LTFKI
Subjt:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI

Query:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKGQIL
        LIKNLE+FREEFAAVVKKEC E+LDSPEKFL DVE KL MK QIL
Subjt:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKGQIL

XP_023527118.1 pentatricopeptide repeat-containing protein At1g62350-like [Cucurbita pepo subsp. pepo]2.8e-11688.98Show/hide
Query:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ
        MKSTLMGRLQLHFPQLG RQNLTN +LHCCTA PPPNIICGLRKG RKPLG+SRVPSTE+IQAVQSLKLAKS SKMEDVIN+KLSRLLKADLFDAL ELQ
Subjt:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ

Query:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI
        RQNELELSLQVFKFM+NEEWYEPDL LYH MI MMGKNKMIEMAEEVFH+LK+DGLEPDTRAFNEMMGAY+QVDM+ERAVETYELMKASGC PD+LTFKI
Subjt:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI

Query:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKGQIL
        LIKNLE+FREEFAAVVKKEC E+LDSPEKFL DVE KL MK QIL
Subjt:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKGQIL

XP_038878176.1 pentatricopeptide repeat-containing protein At1g62350-like [Benincasa hispida]4.1e-12393.06Show/hide
Query:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ
        MKSTLMG LQ HFPQLGLRQ+LTN SLHCCTAAPPPNIICGLRKGLRKPLGRSRVPS EAIQAVQSLKLAKSTSKMEDVIN+KLSRLLKADLFDALTELQ
Subjt:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ

Query:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI
        RQNELELSLQVFKF+QNEEWYEPDLRLYHGMILM GKNKMIEMAEE+FHKLKKDGLEPD RAFNEMMGAYLQVDM+ERAVETYELMKASGCTPD+LTFKI
Subjt:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI

Query:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKGQIL
        LIKNLEKFREEFAAVVKKECN YLDSPEKFLNDVE K T KG+IL
Subjt:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKGQIL

TrEMBL top hitse value%identityAlignment
A0A0A0KU23 Uncharacterized protein3.8e-11487.76Show/hide
Query:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ
        MKSTL+G LQLHF QLGLRQNLTN SL C TAAPPPNIICGLRKG  +PLG SRVPS EAIQAVQSLKLAKSTSKMEDVINTKL RLLKADLFDAL+ELQ
Subjt:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ

Query:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI
        RQNELELSLQVFKFMQNEEW+EPDLRLYHGMI++MGKNKMIEMAEEVFHKL+KDGLEPDTRAFNEMMGAYLQVDMIERAVETY LM ASGCTPDELTFKI
Subjt:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI

Query:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKGQIL
        LIKNLEKFREEFA VVKK+CNEYLD+P+KF ND   KLT K +IL
Subjt:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKGQIL

A0A1S3BX97 pentatricopeptide repeat-containing protein At1g62350-like3.9e-11990.5Show/hide
Query:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ
        MKSTLMGRLQLHFP+LGLRQNLTN SLHCCTAAPPPNIICGLRKGL+KPLGRSRVPS EAIQAVQSLKLAK TSKMEDVINTKLSRLLKADLFDALTELQ
Subjt:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ

Query:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI
        RQNELELSLQVFKFMQNEEWYEPDLRLYHGMI+MMGKNKMIEMAEEVFHKL+KDGLEPDTRAFNEMMGAYLQVDMIERA +TY LM ASGCTPD+LTFKI
Subjt:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI

Query:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKG
        LIKNLEKF+EEFA VVKK+C EYLD+P+KF ND   KLT KG
Subjt:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKG

A0A5A7TQZ8 Pentatricopeptide repeat-containing protein7.8e-12090.91Show/hide
Query:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ
        MKSTLMGRLQLHFPQLGLRQNLTN SLHCCTAAPPPNIICGLRKGL+KPLGRSRVPS EAIQAVQSLKLAK TSKMEDVINTKLSRLLKADLFDALTELQ
Subjt:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ

Query:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI
        RQNELELSLQVFKFMQNEEWYEPDLRLYHGMI+MMGKNKMIEMAEEVFHKL+KDGLEPDTRAFNEMMGAYLQVDMIERA +TY LM ASGCTPD+LTFKI
Subjt:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI

Query:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKG
        LIKNLEKF+EEFA VVKK+C+EYLD+P+KF ND   KLT KG
Subjt:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKG

A0A5D3CZ95 Pentatricopeptide repeat-containing protein3.9e-11990.5Show/hide
Query:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ
        MKSTLMGRLQLHFP+LGLRQNLTN SLHCCTAAPPPNIICGLRKGL+KPLGRSRVPS EAIQAVQSLKLAK TSKMEDVINTKLSRLLKADLFDALTELQ
Subjt:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ

Query:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI
        RQNELELSLQVFKFMQNEEWYEPDLRLYHGMI+MMGKNKMIEMAEEVFHKL+KDGLEPDTRAFNEMMGAYLQVDMIERA +TY LM ASGCTPD+LTFKI
Subjt:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI

Query:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKG
        LIKNLEKF+EEFA VVKK+C EYLD+P+KF ND   KLT KG
Subjt:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKG

A0A6J1IY98 pentatricopeptide repeat-containing protein At1g62350-like5.2e-11688.98Show/hide
Query:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ
        MKSTLMGRLQLH PQLG RQNLTN +LHCCTA PPPNIICGLRKG RKPLG+SRVPSTE+IQAVQSLKLAKS SKMEDVIN+KLSRLLKADLFDAL ELQ
Subjt:  MKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQ

Query:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI
        RQNELELSLQVFKFM+NEEWYEPDL LYH MI MMGKNKMIEMAEEVFH  K+DGLEPDTRAFNEMMGAYLQVDM+ERAVETYELMKASGCTPD+LTFKI
Subjt:  RQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKI

Query:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKGQIL
        LIKNLE+FREEFAAVVKKEC E+LDSPEKFL DVE KL MK QIL
Subjt:  LIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKGQIL

SwissProt top hitse value%identityAlignment
A7LN87 Pentatricopeptide repeat-containing protein PPR5, chloroplastic4.9e-1026.37Show/hide
Query:  STEAIQAVQSL--KLAKSTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKD
        + EA + V+SL  + A    ++  V++  +  +     F    EL R++     L VF++MQ + WY  D  +Y  +I +MG+   I MA  +F +++  
Subjt:  STEAIQAVQSL--KLAKSTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKD

Query:  GLEPDTRAFNEMMGAYL----QVDMIERAVETYELMKA-SGCTPDELTFKILIKNLEKFRE-EFAAVVKKECNEYLDSPEKF
        G +PDT  +N ++GA+L    +   + +A+  +E MK    C P  +T+ IL++   +  + +   ++ K+ +E + SP+ +
Subjt:  GLEPDTRAFNEMMGAYL----QVDMIERAVETYELMKA-SGCTPDELTFKILIKNLEKFRE-EFAAVVKKECNEYLDSPEKF

Q1PFH7 Pentatricopeptide repeat-containing protein At1g623505.4e-1730.97Show/hide
Query:  STEAIQAVQSLKLAKSTS-KMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDG
        S E + A + LK  ++ S +++  I + +SRLLK+DL   L E QRQN++ L +++++ ++ E WY PD+  Y  M++M+ +NK ++  ++V+  LKK+ 
Subjt:  STEAIQAVQSLKLAKSTS-KMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDG

Query:  LEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKILIKNLEKFRE
        +  D   F +++  +L  ++   A+  Y  M+ S   P  L F++++K L  + E
Subjt:  LEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKILIKNLEKFRE

Q9FKC3 Pentatricopeptide repeat-containing protein At5g48730, chloroplastic2.9e-1028.1Show/hide
Query:  SRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLK
        +++ S +AI  +   +  KS  + +      L R +   L + +T L+     E ++QVF+ ++ + WY+P++ +Y  +I+M+GK K  E A E+F ++ 
Subjt:  SRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLK

Query:  KDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKAS-GCTPDELTFKILIKN
         +G   +   +  ++ AY +    + A    E MK+S  C PD  T+ ILIK+
Subjt:  KDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKAS-GCTPDELTFKILIKN

Q9LVW6 Protein THYLAKOID ASSEMBLY 8, chloroplastic2.0e-1132.22Show/hide
Query:  NIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEP-DLRLYHGMILMM
        +I CG R   R PL + R+ STEAIQ++QSLK A  T     +    L RL+K+DL   L EL RQ+   L++ V   ++ E  Y P DL LY  ++  +
Subjt:  NIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEP-DLRLYHGMILMM

Query:  GKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASG-----CTPDELTFKILIKNLEKFRE
         +NK  +  + +  ++       D +A  +++ A +  +  E  V  Y LM+ SG        DE   ++L K L +  E
Subjt:  GKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASG-----CTPDELTFKILIKNLEKFRE

Q9STF9 Protein THYLAKOID ASSEMBLY 8-like, chloroplastic1.2e-1934.16Show/hide
Query:  RKPLGR-SRVPSTEAIQAVQSLK-LAKSTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMA
        R PL R  ++   EA+  +  LK L +   K++  I T + RLLK D+   + EL+RQ E  L++++F+ +Q +EWY+PD+ +Y  +I+ + K+K ++ A
Subjt:  RKPLGR-SRVPSTEAIQAVQSLK-LAKSTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMA

Query:  EEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKILIKNL
          ++ K+KK+ L PD++ + E++  +L+      A+  YE M  S   P+EL F++L+K L
Subjt:  EEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKILIKNL

Arabidopsis top hitse value%identityAlignment
AT1G62350.1 Pentatricopeptide repeat (PPR) superfamily protein3.8e-1830.97Show/hide
Query:  STEAIQAVQSLKLAKSTS-KMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDG
        S E + A + LK  ++ S +++  I + +SRLLK+DL   L E QRQN++ L +++++ ++ E WY PD+  Y  M++M+ +NK ++  ++V+  LKK+ 
Subjt:  STEAIQAVQSLKLAKSTS-KMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDG

Query:  LEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKILIKNLEKFRE
        +  D   F +++  +L  ++   A+  Y  M+ S   P  L F++++K L  + E
Subjt:  LEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKILIKNLEKFRE

AT3G27750.1 FUNCTIONS IN: molecular_function unknown1.4e-1232.22Show/hide
Query:  NIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEP-DLRLYHGMILMM
        +I CG R   R PL + R+ STEAIQ++QSLK A  T     +    L RL+K+DL   L EL RQ+   L++ V   ++ E  Y P DL LY  ++  +
Subjt:  NIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEP-DLRLYHGMILMM

Query:  GKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASG-----CTPDELTFKILIKNLEKFRE
         +NK  +  + +  ++       D +A  +++ A +  +  E  V  Y LM+ SG        DE   ++L K L +  E
Subjt:  GKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASG-----CTPDELTFKILIKNLEKFRE

AT3G46870.1 Pentatricopeptide repeat (PPR) superfamily protein8.2e-2134.16Show/hide
Query:  RKPLGR-SRVPSTEAIQAVQSLK-LAKSTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMA
        R PL R  ++   EA+  +  LK L +   K++  I T + RLLK D+   + EL+RQ E  L++++F+ +Q +EWY+PD+ +Y  +I+ + K+K ++ A
Subjt:  RKPLGR-SRVPSTEAIQAVQSLK-LAKSTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMA

Query:  EEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKILIKNL
          ++ K+KK+ L PD++ + E++  +L+      A+  YE M  S   P+EL F++L+K L
Subjt:  EEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKILIKNL

AT5G09320.1 Vacuolar sorting protein 9 (VPS9) domain1.6e-2433.82Show/hide
Query:  RKPLGRSRVPSTEAIQAVQSLKLA----------------KSTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEPDLRLYHG
        RKPL R R+ S EAIQAVQ+LK A                 S++ ++ VI +K  RLLK D+   L EL RQNE  L+L+VF+ ++ E WY+P +R+Y  
Subjt:  RKPLGRSRVPSTEAIQAVQSLKLA----------------KSTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEPDLRLYHG

Query:  MILMMGKNKMIEMAEEVFHKLKKD-GLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKILIKNLEKFRE-EFAAVVKKECNEYLDSPE
        MI +M  N ++E    ++  +K + GL  +   FN ++   L   + +  ++ Y  M++ G  PD  +F++L+  LE   E   +A+V+++ +EY     
Subjt:  MILMMGKNKMIEMAEEVFHKLKKD-GLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTPDELTFKILIKNLEKFRE-EFAAVVKKECNEYLDSPE

Query:  KFLNDVE
        +F+ + E
Subjt:  KFLNDVE

AT5G48730.1 Pentatricopeptide repeat (PPR) superfamily protein2.0e-1128.1Show/hide
Query:  SRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLK
        +++ S +AI  +   +  KS  + +      L R +   L + +T L+     E ++QVF+ ++ + WY+P++ +Y  +I+M+GK K  E A E+F ++ 
Subjt:  SRVPSTEAIQAVQSLKLAKSTSKMEDVINTKLSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLK

Query:  KDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKAS-GCTPDELTFKILIKN
         +G   +   +  ++ AY +    + A    E MK+S  C PD  T+ ILIK+
Subjt:  KDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKAS-GCTPDELTFKILIKN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTGCTGCGTTAGGGCAAGGCAGTGAGAGAAGGTTGAATTGCAGGCTGAGTCAAATTCGGGCGAGTTGGGCGGTTGAGATGAAATCTACTCTAATGGGGCGTCTTCA
ACTCCATTTTCCTCAATTGGGTCTTCGCCAAAACCTCACGAACTCAAGCCTCCATTGTTGTACGGCAGCTCCACCTCCAAATATCATCTGTGGCCTCAGAAAGGGCTTGA
GGAAGCCCTTAGGGAGGTCAAGGGTGCCCTCCACTGAGGCTATTCAAGCAGTTCAGTCCCTCAAGCTCGCTAAATCCACCTCCAAAATGGAAGACGTTATCAATACCAAG
CTCAGCAGATTGCTGAAAGCAGACTTGTTTGATGCTCTGACTGAATTACAGAGGCAAAATGAACTGGAATTATCGCTTCAGGTCTTCAAATTTATGCAAAATGAAGAATG
GTATGAGCCAGATTTAAGGTTGTACCATGGGATGATTCTGATGATGGGAAAGAACAAAATGATTGAAATGGCTGAAGAGGTCTTCCATAAGTTAAAAAAGGATGGGTTAG
AACCAGACACAAGAGCTTTTAATGAGATGATGGGAGCATATTTGCAAGTGGACATGATCGAAAGAGCAGTTGAGACATATGAATTAATGAAAGCATCAGGTTGTACTCCA
GATGAACTGACTTTCAAGATCTTGATCAAGAATCTCGAGAAATTTAGGGAAGAATTTGCTGCAGTAGTGAAGAAAGAATGTAATGAGTACTTGGATTCTCCTGAGAAGTT
CCTCAACGATGTTGAACATAAACTGACGATGAAAGGTCAAATTCTTTAA
mRNA sequenceShow/hide mRNA sequence
TACCGAAGTCGCAACGCCGAGCGCGTACAGTACAACCACACTTCCAACCACATCCTTTCCGGACCGAGTCACGGCGCCGATTCCGGCGATTCCGGCAATTCCAACCGGAA
CTCGGAGCATCACTTTTTTTCCGACGTTCTTCGCCACAGCCTCTGCGAATAATTCGAGCGCCCAATGCTTGAAGCGGATCGGGCCTAGGGTTTCGCCTTCACGGTCCGAG
TCATCGCGATTGACGGCTCGGAGCGCTGAGTCCACCTGGAGGCGAGTCGGGGGAGAGGGAAGCCAGTTGACGATTAGTGGATGCGGCCGGACGACGGTGAGGAGGTGGTG
AAGGTGGTCGGCGGCGGCGCAGAGTTGGTAGGGGAAAACGCCGTCGTATGCGTGCTGCGTTAGGGCAAGGCAGTGAGAGAAGGTTGAATTGCAGGCTGAGTCAAATTCGG
GCGAGTTGGGCGGTTGAGATGAAATCTACTCTAATGGGGCGTCTTCAACTCCATTTTCCTCAATTGGGTCTTCGCCAAAACCTCACGAACTCAAGCCTCCATTGTTGTAC
GGCAGCTCCACCTCCAAATATCATCTGTGGCCTCAGAAAGGGCTTGAGGAAGCCCTTAGGGAGGTCAAGGGTGCCCTCCACTGAGGCTATTCAAGCAGTTCAGTCCCTCA
AGCTCGCTAAATCCACCTCCAAAATGGAAGACGTTATCAATACCAAGCTCAGCAGATTGCTGAAAGCAGACTTGTTTGATGCTCTGACTGAATTACAGAGGCAAAATGAA
CTGGAATTATCGCTTCAGGTCTTCAAATTTATGCAAAATGAAGAATGGTATGAGCCAGATTTAAGGTTGTACCATGGGATGATTCTGATGATGGGAAAGAACAAAATGAT
TGAAATGGCTGAAGAGGTCTTCCATAAGTTAAAAAAGGATGGGTTAGAACCAGACACAAGAGCTTTTAATGAGATGATGGGAGCATATTTGCAAGTGGACATGATCGAAA
GAGCAGTTGAGACATATGAATTAATGAAAGCATCAGGTTGTACTCCAGATGAACTGACTTTCAAGATCTTGATCAAGAATCTCGAGAAATTTAGGGAAGAATTTGCTGCA
GTAGTGAAGAAAGAATGTAATGAGTACTTGGATTCTCCTGAGAAGTTCCTCAACGATGTTGAACATAAACTGACGATGAAAGGTCAAATTCTTTAA
Protein sequenceShow/hide protein sequence
MRAALGQGSERRLNCRLSQIRASWAVEMKSTLMGRLQLHFPQLGLRQNLTNSSLHCCTAAPPPNIICGLRKGLRKPLGRSRVPSTEAIQAVQSLKLAKSTSKMEDVINTK
LSRLLKADLFDALTELQRQNELELSLQVFKFMQNEEWYEPDLRLYHGMILMMGKNKMIEMAEEVFHKLKKDGLEPDTRAFNEMMGAYLQVDMIERAVETYELMKASGCTP
DELTFKILIKNLEKFREEFAAVVKKECNEYLDSPEKFLNDVEHKLTMKGQIL