; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10007795 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10007795
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionSAP domain-containing protein
Genome locationChr10:13553201..13554644
RNA-Seq ExpressionHG10007795
SyntenyHG10007795
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0006979 - response to oxidative stress (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0098869 - cellular oxidant detoxification (biological process)
GO:0004601 - peroxidase activity (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0016301 - kinase activity (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038380.1 Pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]5.4e-12592.06Show/hide
Query:  MSKFLLSHAHLLTLPHKHHSFSLNHGVVPIRSVLSAPEKRGRKKRQSR-QQQLHPKDDDSTALEKALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSK LLSHAHLLTLP+ H SFSLNHG++PIRSVLSAP+KRGRKKRQSR QQQL  KDDDST+LE +LRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHAHLLTLPHKHHSFSLNHGVVPIRSVLSAPEKRGRKKRQSR-QQQLHPKDDDSTALEKALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILTEELLRNKYLEDANEVFLKGAK
        PGPRSFHGLVVSHTLNGDTEGAMQSLRRELS+GLRPLHETFVALVRLFGSKGLA RGLEILAAME+LNYDIRQAWLILTEEL+RNKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILTEELLRNKYLEDANEVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
         GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ

KAG7020856.1 putative pentatricopeptide repeat-containing protein [Cucurbita argyrosperma subsp. argyrosperma]3.2e-12591.12Show/hide
Query:  MSKFLLSHAHLLTLPHKHHSFSLNHGVV-PIRSVLSAPEKRGRKKRQSRQQQLHPKDDDSTALEKALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSKFLLSH+ LLTLPHKHHSFSL++GV+ PIRSVLS  EKRGRKKRQSRQQQL  KDDDST LEK+LRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHAHLLTLPHKHHSFSLNHGVV-PIRSVLSAPEKRGRKKRQSRQQQLHPKDDDSTALEKALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILTEELLRNKYLEDANEVFLKGAK
        PGPRSFHGLVVSH LN D EGAMQSLR+ELS GLRPLHETFVALVRLFG+KGLATRGLEILAAMEKLNYDIRQAWLIL EEL++NKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILTEELLRNKYLEDANEVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQVMYLTIG
        GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQVMYL+IG
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQVMYLTIG

XP_008443747.1 PREDICTED: uncharacterized protein LOC103487261 isoform X2 [Cucumis melo]5.4e-12592.06Show/hide
Query:  MSKFLLSHAHLLTLPHKHHSFSLNHGVVPIRSVLSAPEKRGRKKRQSR-QQQLHPKDDDSTALEKALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSK LLSHAHLLTLP+ H SFSLNHG++PIRSVLSAP+KRGRKKRQSR QQQL  KDDDST+LE +LRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHAHLLTLPHKHHSFSLNHGVVPIRSVLSAPEKRGRKKRQSR-QQQLHPKDDDSTALEKALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILTEELLRNKYLEDANEVFLKGAK
        PGPRSFHGLVVSHTLNGDTEGAMQSLRRELS+GLRPLHETFVALVRLFGSKGLA RGLEILAAME+LNYDIRQAWLILTEEL+RNKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILTEELLRNKYLEDANEVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
         GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ

XP_011660243.1 uncharacterized protein LOC101209618 isoform X1 [Cucumis sativus]3.2e-12592.46Show/hide
Query:  MSKFLLSHAHLLTLPHKHHSFSLNHGVVPIRSVLSAPEKRGRKKRQSR-QQQLHPKDDDSTALEKALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSKFLLSHAHLLTLP  H SFSLNHG++PIRSVLSAP+KRGRKKRQSR QQQL PKD+DST+LE +LRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHAHLLTLPHKHHSFSLNHGVVPIRSVLSAPEKRGRKKRQSR-QQQLHPKDDDSTALEKALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILTEELLRNKYLEDANEVFLKGAK
        PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGL PLHETFVALVRLFGSKGLA RGLEILAAMEKLNYDIRQAWLILTEEL+R+KYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILTEELLRNKYLEDANEVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
         GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ

XP_038879291.1 uncharacterized protein LOC120071230 [Benincasa hispida]2.1e-12995.63Show/hide
Query:  MSKFLLSHAHLLTLPHKHHSFSLNHGVVPIRSVLSAPEKRGRKKRQSR-QQQLHPKDDDSTALEKALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSKFLLSHAHLLTLP+KHHSFSLNHGVVPIRSVLSAP+KRGRKKRQ+R QQQLH KD DSTALEK+LRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHAHLLTLPHKHHSFSLNHGVVPIRSVLSAPEKRGRKKRQSR-QQQLHPKDDDSTALEKALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILTEELLRNKYLEDANEVFLKGAK
        PGPRSFHGLVVSH LNGDTEGAMQSLRRELSAGL PLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILTEEL+RNKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILTEELLRNKYLEDANEVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
        GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ

TrEMBL top hitse value%identityAlignment
A0A1S3B8T6 uncharacterized protein LOC103487261 isoform X12.6e-12592.06Show/hide
Query:  MSKFLLSHAHLLTLPHKHHSFSLNHGVVPIRSVLSAPEKRGRKKRQSR-QQQLHPKDDDSTALEKALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSK LLSHAHLLTLP+ H SFSLNHG++PIRSVLSAP+KRGRKKRQSR QQQL  KDDDST+LE +LRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHAHLLTLPHKHHSFSLNHGVVPIRSVLSAPEKRGRKKRQSR-QQQLHPKDDDSTALEKALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILTEELLRNKYLEDANEVFLKGAK
        PGPRSFHGLVVSHTLNGDTEGAMQSLRRELS+GLRPLHETFVALVRLFGSKGLA RGLEILAAME+LNYDIRQAWLILTEEL+RNKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILTEELLRNKYLEDANEVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
         GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ

A0A1S3B9H7 uncharacterized protein LOC103487261 isoform X22.6e-12592.06Show/hide
Query:  MSKFLLSHAHLLTLPHKHHSFSLNHGVVPIRSVLSAPEKRGRKKRQSR-QQQLHPKDDDSTALEKALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSK LLSHAHLLTLP+ H SFSLNHG++PIRSVLSAP+KRGRKKRQSR QQQL  KDDDST+LE +LRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHAHLLTLPHKHHSFSLNHGVVPIRSVLSAPEKRGRKKRQSR-QQQLHPKDDDSTALEKALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILTEELLRNKYLEDANEVFLKGAK
        PGPRSFHGLVVSHTLNGDTEGAMQSLRRELS+GLRPLHETFVALVRLFGSKGLA RGLEILAAME+LNYDIRQAWLILTEEL+RNKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILTEELLRNKYLEDANEVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
         GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ

A0A5A7T4U0 Pentatricopeptide repeat-containing protein2.6e-12592.06Show/hide
Query:  MSKFLLSHAHLLTLPHKHHSFSLNHGVVPIRSVLSAPEKRGRKKRQSR-QQQLHPKDDDSTALEKALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSK LLSHAHLLTLP+ H SFSLNHG++PIRSVLSAP+KRGRKKRQSR QQQL  KDDDST+LE +LRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHAHLLTLPHKHHSFSLNHGVVPIRSVLSAPEKRGRKKRQSR-QQQLHPKDDDSTALEKALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILTEELLRNKYLEDANEVFLKGAK
        PGPRSFHGLVVSHTLNGDTEGAMQSLRRELS+GLRPLHETFVALVRLFGSKGLA RGLEILAAME+LNYDIRQAWLILTEEL+RNKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILTEELLRNKYLEDANEVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
         GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ

A0A6J1EQ88 uncharacterized protein LOC111436825 isoform X16.1e-12290.87Show/hide
Query:  MSKFLLSHAHLLTLPHKHHSFSLNHGVV-PIRSVLSAPEKRGRKKRQSRQQQLHPKDDDSTALEKALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSKFLLSH++LLTLPHKHHSFSL++GV  PIRSVLS  EKRGRKKRQSRQQQL  KDDDST  EK+LRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHAHLLTLPHKHHSFSLNHGVV-PIRSVLSAPEKRGRKKRQSRQQQLHPKDDDSTALEKALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILTEELLRNKYLEDANEVFLKGAK
        PGPRSFHGLVVSH LN D EGAMQSLR+ELS GLRPLHETFVALVRLFG+KGLATRGLEILAAMEKLNYDIRQAWLIL EEL++NKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILTEELLRNKYLEDANEVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
        GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ

A0A6J1L2D9 uncharacterized protein LOC111499221 isoform X12.5e-12090.48Show/hide
Query:  MSKFLLSHAHLLTLPHKHHSFSLNHGVV-PIRSVLSAPEKRGRKKRQSRQQQLHPKDDDSTALEKALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSKFLLSH+ LLTLPHKHHSFSL++ V+ PIRSVLS  EKRGRKKRQSRQQQL  KD DST LEK+LRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHAHLLTLPHKHHSFSLNHGVV-PIRSVLSAPEKRGRKKRQSRQQQLHPKDDDSTALEKALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILTEELLRNKYLEDANEVFLKGAK
        PGPRSFHGLVVSH LN D EGAMQSLR+ELS GLRPLHETFVALVRLFG+KGLATRGLEILAAMEKLNYDIRQAWLIL EEL++NKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILTEELLRNKYLEDANEVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
        GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ

SwissProt top hitse value%identityAlignment
O04504 Pentatricopeptide repeat-containing protein At1g098204.9e-0423.95Show/hide
Query:  VIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRG--LEILAAMEKLNYDIRQAWLILTEELL----
        V+ +MV   +SP   +F+ L+     + +  G+M+  +  L   ++P   ++ +L+      GL   G   E ++  +K+     Q  LI    L+    
Subjt:  VIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRG--LEILAAMEKLNYDIRQAWLILTEELL----

Query:  RNKYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLS
        +N  L++A ++F      G   T ++Y++LI+  CK G   +   +  EME  G +     +NCL++
Subjt:  RNKYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLS

O64624 Pentatricopeptide repeat-containing protein At2g18940, chloroplastic1.3e-0428.69Show/hide
Query:  SAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQ-AWLILTEELLRNKYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEI
        S G  P   T+ AL+++FG  G+ T  L +L  ME+ +       +  L    +R  + ++A  V     K G+      Y  +I+   KAG    AL++
Subjt:  SAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQ-AWLILTEELLRNKYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEI

Query:  SYEMEAAGRMATTFHFNCLLSV
         Y M+ AG +  T  +N +LS+
Subjt:  SYEMEAAGRMATTFHFNCLLSV

Q0WLC6 Pentatricopeptide repeat-containing protein MRL1, chloroplastic6.5e-0422.89Show/hide
Query:  DVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWL---ILTEELLRN
        +V + M  +G+     +F  L+      G    A  +     S  ++P    F AL+   G  G   R  ++LA M+   + I    +    L +     
Subjt:  DVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWL---ILTEELLRN

Query:  KYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSV
          +E A EV+    K G+R T ++Y + +    K+GD   A  I  +M+          F+ L+ V
Subjt:  KYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSV

Q9XIM8 Pentatricopeptide repeat-containing protein At2g159808.4e-0425.95Show/hide
Query:  GVSDVIYD---MVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRR-ELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAME--KLNYDIRQAWLILT
        G+ DV  D    +   + P   +F+ ++VS    G+TE   +  R  E   G  P   ++  L+  + ++GL +   ++   M+   + YDI  A+  + 
Subjt:  GVSDVIYD---MVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRR-ELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAME--KLNYDIRQAWLILT

Query:  EELLRNKYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAG
          L  N  +  A E+F      G+  T   Y+ L+   CKAGD  + L +  EM+  G
Subjt:  EELLRNKYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAG

Arabidopsis top hitse value%identityAlignment
AT2G15980.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.0e-0525.95Show/hide
Query:  GVSDVIYD---MVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRR-ELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAME--KLNYDIRQAWLILT
        G+ DV  D    +   + P   +F+ ++VS    G+TE   +  R  E   G  P   ++  L+  + ++GL +   ++   M+   + YDI  A+  + 
Subjt:  GVSDVIYD---MVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRR-ELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAME--KLNYDIRQAWLILT

Query:  EELLRNKYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAG
          L  N  +  A E+F      G+  T   Y+ L+   CKAGD  + L +  EM+  G
Subjt:  EELLRNKYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAG

AT2G17140.1 Pentatricopeptide repeat (PPR) superfamily protein1.0e-0426.17Show/hide
Query:  VSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLN-YDIRQAWLILTEELLRN
        VS +  DMV  G++P   +F+ L+ +   +   + A +        G +P   TF  LVR +   GL  +GLE+L AME       +  +  +     R 
Subjt:  VSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLN-YDIRQAWLILTEELLRN

Query:  KYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEME
           +D+ ++  K  + GL      ++  I   CK G   +A  I  +ME
Subjt:  KYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEME

AT2G18940.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.3e-0628.69Show/hide
Query:  SAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQ-AWLILTEELLRNKYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEI
        S G  P   T+ AL+++FG  G+ T  L +L  ME+ +       +  L    +R  + ++A  V     K G+      Y  +I+   KAG    AL++
Subjt:  SAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQ-AWLILTEELLRNKYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEI

Query:  SYEMEAAGRMATTFHFNCLLSV
         Y M+ AG +  T  +N +LS+
Subjt:  SYEMEAAGRMATTFHFNCLLSV

AT3G04260.1 plastid transcriptionally active 32.1e-9071.37Show/hide
Query:  GVVPIRSVLSAPEKRGRKKRQSRQQQLHPKDDD--------STALEKALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGD
        G+  IR  +SAPEK+ R++R+ ++      DD          +ALE++LR TFM+ELM+RARN D  GVS+VIYDM+AAGLSPGPRSFHGLVV+H LNGD
Subjt:  GVVPIRSVLSAPEKRGRKKRQSRQQQLHPKDDD--------STALEKALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGD

Query:  TEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILTEELLRNKYLEDANEVFLKGAKGGLRATDKIYDLLIEEDC
         +GAM SLR+EL AG RPL ET +ALVRL GSKG ATRGLEILAAMEKL YDIRQAWLIL EEL+R  +LEDAN+VFLKGA+GG+RATD++YDL+IEEDC
Subjt:  TEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILTEELLRNKYLEDANEVFLKGAKGGLRATDKIYDLLIEEDC

Query:  KAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
        KAGDHSNAL+ISYEMEAAGRMATTFHFNCLLSVQ
Subjt:  KAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ

AT4G34830.1 Pentatricopeptide repeat (PPR) superfamily protein4.6e-0522.89Show/hide
Query:  DVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWL---ILTEELLRN
        +V + M  +G+     +F  L+      G    A  +     S  ++P    F AL+   G  G   R  ++LA M+   + I    +    L +     
Subjt:  DVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWL---ILTEELLRN

Query:  KYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSV
          +E A EV+    K G+R T ++Y + +    K+GD   A  I  +M+          F+ L+ V
Subjt:  KYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCAAATTCCTGCTCTCTCACGCTCACCTTCTCACCCTTCCCCACAAACACCATTCCTTTTCTCTCAACCATGGCGTCGTTCCCATCCGCTCAGTCCTATCTGCTCC
GGAGAAGCGAGGTAGAAAGAAGCGGCAGTCGCGGCAGCAACAATTACACCCAAAGGACGACGATTCCACTGCACTTGAGAAGGCCCTCCGCTTCACTTTCATGGAGGAAC
TCATGGACCGCGCTAGAAACCACGATCCCCTTGGCGTTTCTGATGTCATTTACGATATGGTTGCCGCTGGATTGAGCCCTGGACCTCGCTCGTTCCATGGATTAGTTGTT
TCTCATACTCTCAATGGTGATACTGAGGGAGCGATGCAATCTCTGAGAAGGGAATTAAGTGCTGGACTTCGTCCTCTTCACGAAACGTTTGTTGCATTAGTTCGGTTATT
TGGTTCCAAGGGTCTTGCTACTAGAGGCTTAGAAATCCTTGCAGCCATGGAGAAATTGAATTATGACATCCGTCAAGCATGGCTCATTCTTACTGAGGAACTCCTAAGGA
ACAAATATTTAGAAGACGCCAATGAAGTGTTCTTAAAGGGTGCCAAAGGGGGTCTCAGAGCCACCGACAAGATTTATGATCTTCTGATTGAGGAAGATTGTAAAGCCGGG
GATCATTCAAATGCCTTAGAGATCTCATATGAAATGGAGGCTGCCGGGCGGATGGCAACGACCTTTCATTTCAATTGCCTTCTTAGTGTCCAGGTGATGTATCTTACTAT
TGGATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCCAAATTCCTGCTCTCTCACGCTCACCTTCTCACCCTTCCCCACAAACACCATTCCTTTTCTCTCAACCATGGCGTCGTTCCCATCCGCTCAGTCCTATCTGCTCC
GGAGAAGCGAGGTAGAAAGAAGCGGCAGTCGCGGCAGCAACAATTACACCCAAAGGACGACGATTCCACTGCACTTGAGAAGGCCCTCCGCTTCACTTTCATGGAGGAAC
TCATGGACCGCGCTAGAAACCACGATCCCCTTGGCGTTTCTGATGTCATTTACGATATGGTTGCCGCTGGATTGAGCCCTGGACCTCGCTCGTTCCATGGATTAGTTGTT
TCTCATACTCTCAATGGTGATACTGAGGGAGCGATGCAATCTCTGAGAAGGGAATTAAGTGCTGGACTTCGTCCTCTTCACGAAACGTTTGTTGCATTAGTTCGGTTATT
TGGTTCCAAGGGTCTTGCTACTAGAGGCTTAGAAATCCTTGCAGCCATGGAGAAATTGAATTATGACATCCGTCAAGCATGGCTCATTCTTACTGAGGAACTCCTAAGGA
ACAAATATTTAGAAGACGCCAATGAAGTGTTCTTAAAGGGTGCCAAAGGGGGTCTCAGAGCCACCGACAAGATTTATGATCTTCTGATTGAGGAAGATTGTAAAGCCGGG
GATCATTCAAATGCCTTAGAGATCTCATATGAAATGGAGGCTGCCGGGCGGATGGCAACGACCTTTCATTTCAATTGCCTTCTTAGTGTCCAGGTGATGTATCTTACTAT
TGGATAA
Protein sequenceShow/hide protein sequence
MSKFLLSHAHLLTLPHKHHSFSLNHGVVPIRSVLSAPEKRGRKKRQSRQQQLHPKDDDSTALEKALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVV
SHTLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILTEELLRNKYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAG
DHSNALEISYEMEAAGRMATTFHFNCLLSVQVMYLTIG