; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg16460 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg16460
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionSAP domain-containing protein
Genome locationCarg_Chr12:7623449..7624778
RNA-Seq ExpressionCarg16460
SyntenyCarg16460
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0006979 - response to oxidative stress (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0098869 - cellular oxidant detoxification (biological process)
GO:0004601 - peroxidase activity (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588298.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]2.6e-13599.6Show/hide
Query:  MSKFLLSHSCLLTLPHKHHSFSLHNGVLPPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTVLEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSP
        MSKFLLSHSCLLTLPHKHHSFSLHNGVLPP+RSVLSTEKRGRKKRQSRQQQLQQKDDDSTVLEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSP
Subjt:  MSKFLLSHSCLLTLPHKHHSFSLHNGVLPPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTVLEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSP

Query:  GPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKG
        GPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKG
Subjt:  GPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKG

Query:  GLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
        GLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
Subjt:  GLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ

KAG7020856.1 putative pentatricopeptide repeat-containing protein [Cucurbita argyrosperma subsp. argyrosperma]1.7e-139100Show/hide
Query:  MSKFLLSHSCLLTLPHKHHSFSLHNGVLPPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTVLEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSP
        MSKFLLSHSCLLTLPHKHHSFSLHNGVLPPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTVLEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSP
Subjt:  MSKFLLSHSCLLTLPHKHHSFSLHNGVLPPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTVLEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSP

Query:  GPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKG
        GPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKG
Subjt:  GPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKG

Query:  GLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQVMYLSIG
        GLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQVMYLSIG
Subjt:  GLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQVMYLSIG

XP_022930357.1 uncharacterized protein LOC111436825 isoform X1 [Cucurbita moschata]1.9e-13398.8Show/hide
Query:  MSKFLLSHSCLLTLPHKHHSFSLHNGVLPPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTVLEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSP
        MSKFLLSHS LLTLPHKHHSFSLHNGV PPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTV EKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSP
Subjt:  MSKFLLSHSCLLTLPHKHHSFSLHNGVLPPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTVLEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSP

Query:  GPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKG
        GPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKG
Subjt:  GPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKG

Query:  GLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
        GLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
Subjt:  GLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ

XP_023006519.1 uncharacterized protein LOC111499221 isoform X1 [Cucurbita maxima]6.4e-13499.2Show/hide
Query:  MSKFLLSHSCLLTLPHKHHSFSLHNGVLPPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTVLEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSP
        MSKFLLSHSCLLTLPHKHHSFSLHN VLPPIRSVLSTEKRGRKKRQSRQQQLQQKD DSTVLEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSP
Subjt:  MSKFLLSHSCLLTLPHKHHSFSLHNGVLPPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTVLEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSP

Query:  GPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKG
        GPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKG
Subjt:  GPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKG

Query:  GLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
        GLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
Subjt:  GLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ

XP_023531019.1 uncharacterized protein LOC111793400 isoform X1 [Cucurbita pepo subsp. pepo]1.4e-13398.81Show/hide
Query:  MSKFLLSHSCLLTLPHKHHSFSLHNG-VLPPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTVLEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSKFLLSHSCLLTLPHKHHSFSLHNG +LPPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTVLEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHSCLLTLPHKHHSFSLHNG-VLPPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTVLEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAK
        PGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRP+HETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAK
Subjt:  PGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
        GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ

TrEMBL top hitse value%identityAlignment
A0A1S3B8T6 uncharacterized protein LOC103487261 isoform X15.0e-11688.54Show/hide
Query:  MSKFLLSHSCLLTLPHKHHSFSLHNGVLPPIRSVLST-EKRGRKKRQSR-QQQLQQKDDDSTVLEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGL
        MSK LLSH+ LLTLP+ H SFSL++G+L PIRSVLS  +KRGRKKRQSR QQQLQ KDDDST LE SLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGL
Subjt:  MSKFLLSHSCLLTLPHKHHSFSLHNGVLPPIRSVLST-EKRGRKKRQSR-QQQLQQKDDDSTVLEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGL

Query:  SPGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGA
        SPGPRSFHGLVVSH LN D EGAMQSLR+ELS+GLRPLHETFVALVRLFG+KGLA RGLEILAAME+LNYDIRQAWLIL EELV+NKYLEDANKVFLKGA
Subjt:  SPGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGA

Query:  KGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
        K GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
Subjt:  KGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ

A0A1S3B9H7 uncharacterized protein LOC103487261 isoform X25.0e-11688.54Show/hide
Query:  MSKFLLSHSCLLTLPHKHHSFSLHNGVLPPIRSVLST-EKRGRKKRQSR-QQQLQQKDDDSTVLEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGL
        MSK LLSH+ LLTLP+ H SFSL++G+L PIRSVLS  +KRGRKKRQSR QQQLQ KDDDST LE SLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGL
Subjt:  MSKFLLSHSCLLTLPHKHHSFSLHNGVLPPIRSVLST-EKRGRKKRQSR-QQQLQQKDDDSTVLEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGL

Query:  SPGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGA
        SPGPRSFHGLVVSH LN D EGAMQSLR+ELS+GLRPLHETFVALVRLFG+KGLA RGLEILAAME+LNYDIRQAWLIL EELV+NKYLEDANKVFLKGA
Subjt:  SPGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGA

Query:  KGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
        K GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
Subjt:  KGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ

A0A5A7T4U0 Pentatricopeptide repeat-containing protein5.0e-11688.54Show/hide
Query:  MSKFLLSHSCLLTLPHKHHSFSLHNGVLPPIRSVLST-EKRGRKKRQSR-QQQLQQKDDDSTVLEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGL
        MSK LLSH+ LLTLP+ H SFSL++G+L PIRSVLS  +KRGRKKRQSR QQQLQ KDDDST LE SLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGL
Subjt:  MSKFLLSHSCLLTLPHKHHSFSLHNGVLPPIRSVLST-EKRGRKKRQSR-QQQLQQKDDDSTVLEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGL

Query:  SPGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGA
        SPGPRSFHGLVVSH LN D EGAMQSLR+ELS+GLRPLHETFVALVRLFG+KGLA RGLEILAAME+LNYDIRQAWLIL EELV+NKYLEDANKVFLKGA
Subjt:  SPGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGA

Query:  KGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
        K GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
Subjt:  KGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ

A0A6J1EQ88 uncharacterized protein LOC111436825 isoform X19.0e-13498.8Show/hide
Query:  MSKFLLSHSCLLTLPHKHHSFSLHNGVLPPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTVLEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSP
        MSKFLLSHS LLTLPHKHHSFSLHNGV PPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTV EKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSP
Subjt:  MSKFLLSHSCLLTLPHKHHSFSLHNGVLPPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTVLEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSP

Query:  GPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKG
        GPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKG
Subjt:  GPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKG

Query:  GLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
        GLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
Subjt:  GLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ

A0A6J1L2D9 uncharacterized protein LOC111499221 isoform X13.1e-13499.2Show/hide
Query:  MSKFLLSHSCLLTLPHKHHSFSLHNGVLPPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTVLEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSP
        MSKFLLSHSCLLTLPHKHHSFSLHN VLPPIRSVLSTEKRGRKKRQSRQQQLQQKD DSTVLEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSP
Subjt:  MSKFLLSHSCLLTLPHKHHSFSLHNGVLPPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTVLEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSP

Query:  GPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKG
        GPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKG
Subjt:  GPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKG

Query:  GLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
        GLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
Subjt:  GLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ

SwissProt top hitse value%identityAlignment
O04504 Pentatricopeptide repeat-containing protein At1g098207.6e-0522.84Show/hide
Query:  VIYDMVAAGLSPGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQ-AWLILIEELVKNKYL
        V+ +MV   +SP   +F+ L+     + +  G+M+  ++ L   ++P   ++ +L+    N G  +  + +   M           +  LI    KN  L
Subjt:  VIYDMVAAGLSPGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQ-AWLILIEELVKNKYL

Query:  EDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLS
        ++A  +F      G   T ++Y++LI+  CK G   +   +  EME  G +     +NCL++
Subjt:  EDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLS

O64624 Pentatricopeptide repeat-containing protein At2g18940, chloroplastic9.0e-0628.9Show/hide
Query:  LGVSDVIYDMVAAGLSPGPRSFHGLVVSHVLNADAEGAMQSLRKEL-----STGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQ-AWLIL
        LGV D   +M + GL      F     S VL+A A   +    KE      S G  P   T+ AL+++FG  G+ T  L +L  ME+ +       +  L
Subjt:  LGVSDVIYDMVAAGLSPGPRSFHGLVVSHVLNADAEGAMQSLRKEL-----STGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQ-AWLIL

Query:  IEELVKNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSV
        +   V+  + ++A  V     K G+      Y  +I+   KAG    AL++ Y M+ AG +  T  +N +LS+
Subjt:  IEELVKNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSV

Q0WPZ6 Pentatricopeptide repeat-containing protein At2g171403.8e-0426.17Show/hide
Query:  VSDVIYDMVAAGLSPGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLN-YDIRQAWLILIEELVKN
        VS +  DMV  G++P   +F+ L+ +   ++  + A +   +    G +P   TF  LVR +   GL  +GLE+L AME       +  +  ++    + 
Subjt:  VSDVIYDMVAAGLSPGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLN-YDIRQAWLILIEELVKN

Query:  KYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEME
           +D+ K+  K  + GL      ++  I   CK G   +A  I  +ME
Subjt:  KYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEME

Q9SAK0 Pentatricopeptide repeat-containing protein At1g79490, mitochondrial2.8e-0725.86Show/hide
Query:  RNHDPLGVSDVIYDMVAAGLSPGPRSF--HGLVVSHVLNAD-AEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDI-RQAW
        +  D +G+  +  +MV    S G  SF  +  V+ ++  A+  E A    +K   +G +   +T+  L+ LF NKGL  +  EI  +MEK +  +    +
Subjt:  RNHDPLGVSDVIYDMVAAGLSPGPRSF--HGLVVSHVLNAD-AEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDI-RQAW

Query:  LILIEELVKNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLL
         ++I  L K+  L+ A K+F +  +  LR +  ++  L++   KAG    ++++  EM+  G   +   F  L+
Subjt:  LILIEELVKNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLL

Arabidopsis top hitse value%identityAlignment
AT1G09820.1 Pentatricopeptide repeat (PPR-like) superfamily protein5.4e-0622.84Show/hide
Query:  VIYDMVAAGLSPGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQ-AWLILIEELVKNKYL
        V+ +MV   +SP   +F+ L+     + +  G+M+  ++ L   ++P   ++ +L+    N G  +  + +   M           +  LI    KN  L
Subjt:  VIYDMVAAGLSPGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQ-AWLILIEELVKNKYL

Query:  EDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLS
        ++A  +F      G   T ++Y++LI+  CK G   +   +  EME  G +     +NCL++
Subjt:  EDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLS

AT1G79490.1 Pentatricopeptide repeat (PPR) superfamily protein2.0e-0825.86Show/hide
Query:  RNHDPLGVSDVIYDMVAAGLSPGPRSF--HGLVVSHVLNAD-AEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDI-RQAW
        +  D +G+  +  +MV    S G  SF  +  V+ ++  A+  E A    +K   +G +   +T+  L+ LF NKGL  +  EI  +MEK +  +    +
Subjt:  RNHDPLGVSDVIYDMVAAGLSPGPRSF--HGLVVSHVLNAD-AEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDI-RQAW

Query:  LILIEELVKNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLL
         ++I  L K+  L+ A K+F +  +  LR +  ++  L++   KAG    ++++  EM+  G   +   F  L+
Subjt:  LILIEELVKNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLL

AT2G17140.1 Pentatricopeptide repeat (PPR) superfamily protein2.7e-0526.17Show/hide
Query:  VSDVIYDMVAAGLSPGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLN-YDIRQAWLILIEELVKN
        VS +  DMV  G++P   +F+ L+ +   ++  + A +   +    G +P   TF  LVR +   GL  +GLE+L AME       +  +  ++    + 
Subjt:  VSDVIYDMVAAGLSPGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLN-YDIRQAWLILIEELVKN

Query:  KYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEME
           +D+ K+  K  + GL      ++  I   CK G   +A  I  +ME
Subjt:  KYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEME

AT2G18940.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.4e-0728.9Show/hide
Query:  LGVSDVIYDMVAAGLSPGPRSFHGLVVSHVLNADAEGAMQSLRKEL-----STGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQ-AWLIL
        LGV D   +M + GL      F     S VL+A A   +    KE      S G  P   T+ AL+++FG  G+ T  L +L  ME+ +       +  L
Subjt:  LGVSDVIYDMVAAGLSPGPRSFHGLVVSHVLNADAEGAMQSLRKEL-----STGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQ-AWLIL

Query:  IEELVKNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSV
        +   V+  + ++A  V     K G+      Y  +I+   KAG    AL++ Y M+ AG +  T  +N +LS+
Subjt:  IEELVKNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSV

AT3G04260.1 plastid transcriptionally active 31.8e-8670.48Show/hide
Query:  SVLSTEKRGRKKRQSRQQQLQQKDDD--------STVLEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHVLNADAEGAMQS
        S+ + EK+ R++R+ ++    + DD          + LE+SLR TFM+ELM+RARN D  GVS+VIYDM+AAGLSPGPRSFHGLVV+H LN D +GAM S
Subjt:  SVLSTEKRGRKKRQSRQQQLQQKDDD--------STVLEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHVLNADAEGAMQS

Query:  LRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSN
        LRKEL  G RPL ET +ALVRL G+KG ATRGLEILAAMEKL YDIRQAWLIL+EEL++  +LEDANKVFLKGA+GG+RATD++YDL+IEEDCKAGDHSN
Subjt:  LRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSN

Query:  ALEISYEMEAAGRMATTFHFNCLLSVQ
        AL+ISYEMEAAGRMATTFHFNCLLSVQ
Subjt:  ALEISYEMEAAGRMATTFHFNCLLSVQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCAAATTCCTGCTCTCTCACTCCTGCCTTCTCACCCTTCCCCACAAGCACCATTCCTTTTCCCTCCACAATGGCGTCCTCCCCCCCATCCGCTCAGTTCTCTCTAC
TGAGAAGCGGGGTAGAAAGAAGCGGCAGTCGCGGCAGCAACAATTGCAACAAAAGGACGATGATTCTACTGTGCTTGAGAAGTCCCTTCGCTTCACTTTCATGGAGGAAC
TCATGGACCGCGCTAGAAACCACGATCCACTTGGCGTTTCTGATGTCATTTATGATATGGTTGCCGCTGGATTGAGCCCTGGTCCTCGCTCCTTCCATGGCTTGGTTGTT
TCACATGTTCTTAATGCTGATGCTGAGGGAGCGATGCAATCTCTGAGAAAGGAACTAAGTACTGGACTTCGACCCCTTCACGAAACGTTTGTTGCATTAGTTCGGTTATT
TGGTAACAAGGGTCTTGCTACTAGAGGCTTAGAAATCCTTGCAGCCATGGAGAAATTGAACTATGACATTCGCCAAGCTTGGCTCATTCTTATTGAGGAACTCGTAAAGA
ACAAATATTTAGAAGACGCCAATAAAGTGTTCTTAAAGGGGGCCAAAGGGGGCCTCAGAGCCACGGACAAGATTTACGATCTTCTAATTGAGGAAGACTGTAAAGCTGGG
GACCATTCAAATGCCTTGGAGATTTCATATGAAATGGAGGCTGCTGGGCGGATGGCAACAACCTTCCATTTCAATTGCCTTCTCAGTGTCCAGGTGATGTATCTTAGTAT
TGGATAA
mRNA sequenceShow/hide mRNA sequence
ATTATCGTCTCTCCCTCAGCACCCGCCTTCTTTCTTCTCTTCGCGGTGGCTCTGTTTCTTATCCAAATTCCAGCAGCCATCAATGTCCAAATTCCTGCTCTCTCACTCCT
GCCTTCTCACCCTTCCCCACAAGCACCATTCCTTTTCCCTCCACAATGGCGTCCTCCCCCCCATCCGCTCAGTTCTCTCTACTGAGAAGCGGGGTAGAAAGAAGCGGCAG
TCGCGGCAGCAACAATTGCAACAAAAGGACGATGATTCTACTGTGCTTGAGAAGTCCCTTCGCTTCACTTTCATGGAGGAACTCATGGACCGCGCTAGAAACCACGATCC
ACTTGGCGTTTCTGATGTCATTTATGATATGGTTGCCGCTGGATTGAGCCCTGGTCCTCGCTCCTTCCATGGCTTGGTTGTTTCACATGTTCTTAATGCTGATGCTGAGG
GAGCGATGCAATCTCTGAGAAAGGAACTAAGTACTGGACTTCGACCCCTTCACGAAACGTTTGTTGCATTAGTTCGGTTATTTGGTAACAAGGGTCTTGCTACTAGAGGC
TTAGAAATCCTTGCAGCCATGGAGAAATTGAACTATGACATTCGCCAAGCTTGGCTCATTCTTATTGAGGAACTCGTAAAGAACAAATATTTAGAAGACGCCAATAAAGT
GTTCTTAAAGGGGGCCAAAGGGGGCCTCAGAGCCACGGACAAGATTTACGATCTTCTAATTGAGGAAGACTGTAAAGCTGGGGACCATTCAAATGCCTTGGAGATTTCAT
ATGAAATGGAGGCTGCTGGGCGGATGGCAACAACCTTCCATTTCAATTGCCTTCTCAGTGTCCAGGTGATGTATCTTAGTATTGGATAA
Protein sequenceShow/hide protein sequence
MSKFLLSHSCLLTLPHKHHSFSLHNGVLPPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTVLEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVV
SHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAG
DHSNALEISYEMEAAGRMATTFHFNCLLSVQVMYLSIG