; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg029629 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg029629
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionSAP domain-containing protein
Genome locationscaffold2:21506897..21508366
RNA-Seq ExpressionSpg029629
SyntenySpg029629
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0006979 - response to oxidative stress (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0098869 - cellular oxidant detoxification (biological process)
GO:0004601 - peroxidase activity (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038380.1 Pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]1.6e-12190.48Show/hide
Query:  MSKFLLSHTHLLTLPHIHHSFSLNNGVVPIRSVLSAPEKRGRKKRQSR-QQQLHQKDDDSTALEKGLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSK LLSH HLLTLP+ H SFSLN+G++PIRSVLSAP+KRGRKKRQSR QQQL  KDDDST+LE  LRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHTHLLTLPHIHHSFSLNNGVVPIRSVLSAPEKRGRKKRQSR-QQQLHQKDDDSTALEKGLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANQVFLKGAK
        PGPRSFHGLVVSH+LNGDTEGAMQSLRRELS+GLRPLHETFVALVRLFGSKGLA+RGLEILAAME+LNYDIRQAWLIL EELVRNKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANQVFLKGAK

Query:  GGLRATGKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
         GLRAT KIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
Subjt:  GGLRATGKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ

KAG7020856.1 putative pentatricopeptide repeat-containing protein [Cucurbita argyrosperma subsp. argyrosperma]1.6e-12491.51Show/hide
Query:  MSKFLLSHTHLLTLPHIHHSFSLNNGVV-PIRSVLSAPEKRGRKKRQSRQQQLHQKDDDSTALEKGLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSKFLLSH+ LLTLPH HHSFSL+NGV+ PIRSVLS  EKRGRKKRQSRQQQL QKDDDST LEK LRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHTHLLTLPHIHHSFSLNNGVV-PIRSVLSAPEKRGRKKRQSRQQQLHQKDDDSTALEKGLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANQVFLKGAK
        PGPRSFHGLVVSH LN D EGAMQSLR+ELS GLRPLHETFVALVRLFG+KGLA+RGLEILAAMEKLNYDIRQAWLILIEELV+NKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANQVFLKGAK

Query:  GGLRATGKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQVMYLTIG
        GGLRAT KIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQVMYL+IG
Subjt:  GGLRATGKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQVMYLTIG

XP_008443746.1 PREDICTED: uncharacterized protein LOC103487261 isoform X1 [Cucumis melo]1.6e-12190.48Show/hide
Query:  MSKFLLSHTHLLTLPHIHHSFSLNNGVVPIRSVLSAPEKRGRKKRQSR-QQQLHQKDDDSTALEKGLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSK LLSH HLLTLP+ H SFSLN+G++PIRSVLSAP+KRGRKKRQSR QQQL  KDDDST+LE  LRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHTHLLTLPHIHHSFSLNNGVVPIRSVLSAPEKRGRKKRQSR-QQQLHQKDDDSTALEKGLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANQVFLKGAK
        PGPRSFHGLVVSH+LNGDTEGAMQSLRRELS+GLRPLHETFVALVRLFGSKGLA+RGLEILAAME+LNYDIRQAWLIL EELVRNKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANQVFLKGAK

Query:  GGLRATGKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
         GLRAT KIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
Subjt:  GGLRATGKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ

XP_008443747.1 PREDICTED: uncharacterized protein LOC103487261 isoform X2 [Cucumis melo]1.6e-12190.48Show/hide
Query:  MSKFLLSHTHLLTLPHIHHSFSLNNGVVPIRSVLSAPEKRGRKKRQSR-QQQLHQKDDDSTALEKGLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSK LLSH HLLTLP+ H SFSLN+G++PIRSVLSAP+KRGRKKRQSR QQQL  KDDDST+LE  LRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHTHLLTLPHIHHSFSLNNGVVPIRSVLSAPEKRGRKKRQSR-QQQLHQKDDDSTALEKGLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANQVFLKGAK
        PGPRSFHGLVVSH+LNGDTEGAMQSLRRELS+GLRPLHETFVALVRLFGSKGLA+RGLEILAAME+LNYDIRQAWLIL EELVRNKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANQVFLKGAK

Query:  GGLRATGKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
         GLRAT KIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
Subjt:  GGLRATGKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ

XP_038879291.1 uncharacterized protein LOC120071230 [Benincasa hispida]5.4e-12593.65Show/hide
Query:  MSKFLLSHTHLLTLPHIHHSFSLNNGVVPIRSVLSAPEKRGRKKRQSR-QQQLHQKDDDSTALEKGLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSKFLLSH HLLTLP+ HHSFSLN+GVVPIRSVLSAP+KRGRKKRQ+R QQQLH KD DSTALEK LRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHTHLLTLPHIHHSFSLNNGVVPIRSVLSAPEKRGRKKRQSR-QQQLHQKDDDSTALEKGLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANQVFLKGAK
        PGPRSFHGLVVSH LNGDTEGAMQSLRRELSAGL PLHETFVALVRLFGSKGLA+RGLEILAAMEKLNYDIRQAWLIL EELVRNKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANQVFLKGAK

Query:  GGLRATGKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
        GGLRAT KIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
Subjt:  GGLRATGKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ

TrEMBL top hitse value%identityAlignment
A0A1S3B8T6 uncharacterized protein LOC103487261 isoform X17.9e-12290.48Show/hide
Query:  MSKFLLSHTHLLTLPHIHHSFSLNNGVVPIRSVLSAPEKRGRKKRQSR-QQQLHQKDDDSTALEKGLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSK LLSH HLLTLP+ H SFSLN+G++PIRSVLSAP+KRGRKKRQSR QQQL  KDDDST+LE  LRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHTHLLTLPHIHHSFSLNNGVVPIRSVLSAPEKRGRKKRQSR-QQQLHQKDDDSTALEKGLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANQVFLKGAK
        PGPRSFHGLVVSH+LNGDTEGAMQSLRRELS+GLRPLHETFVALVRLFGSKGLA+RGLEILAAME+LNYDIRQAWLIL EELVRNKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANQVFLKGAK

Query:  GGLRATGKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
         GLRAT KIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
Subjt:  GGLRATGKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ

A0A1S3B9H7 uncharacterized protein LOC103487261 isoform X27.9e-12290.48Show/hide
Query:  MSKFLLSHTHLLTLPHIHHSFSLNNGVVPIRSVLSAPEKRGRKKRQSR-QQQLHQKDDDSTALEKGLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSK LLSH HLLTLP+ H SFSLN+G++PIRSVLSAP+KRGRKKRQSR QQQL  KDDDST+LE  LRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHTHLLTLPHIHHSFSLNNGVVPIRSVLSAPEKRGRKKRQSR-QQQLHQKDDDSTALEKGLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANQVFLKGAK
        PGPRSFHGLVVSH+LNGDTEGAMQSLRRELS+GLRPLHETFVALVRLFGSKGLA+RGLEILAAME+LNYDIRQAWLIL EELVRNKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANQVFLKGAK

Query:  GGLRATGKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
         GLRAT KIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
Subjt:  GGLRATGKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ

A0A5A7T4U0 Pentatricopeptide repeat-containing protein7.9e-12290.48Show/hide
Query:  MSKFLLSHTHLLTLPHIHHSFSLNNGVVPIRSVLSAPEKRGRKKRQSR-QQQLHQKDDDSTALEKGLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSK LLSH HLLTLP+ H SFSLN+G++PIRSVLSAP+KRGRKKRQSR QQQL  KDDDST+LE  LRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHTHLLTLPHIHHSFSLNNGVVPIRSVLSAPEKRGRKKRQSR-QQQLHQKDDDSTALEKGLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANQVFLKGAK
        PGPRSFHGLVVSH+LNGDTEGAMQSLRRELS+GLRPLHETFVALVRLFGSKGLA+RGLEILAAME+LNYDIRQAWLIL EELVRNKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANQVFLKGAK

Query:  GGLRATGKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
         GLRAT KIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
Subjt:  GGLRATGKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ

A0A6J1EQ88 uncharacterized protein LOC111436825 isoform X13.0e-12191.27Show/hide
Query:  MSKFLLSHTHLLTLPHIHHSFSLNNGVV-PIRSVLSAPEKRGRKKRQSRQQQLHQKDDDSTALEKGLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSKFLLSH++LLTLPH HHSFSL+NGV  PIRSVLS  EKRGRKKRQSRQQQL QKDDDST  EK LRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHTHLLTLPHIHHSFSLNNGVV-PIRSVLSAPEKRGRKKRQSRQQQLHQKDDDSTALEKGLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANQVFLKGAK
        PGPRSFHGLVVSH LN D EGAMQSLR+ELS GLRPLHETFVALVRLFG+KGLA+RGLEILAAMEKLNYDIRQAWLILIEELV+NKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANQVFLKGAK

Query:  GGLRATGKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
        GGLRAT KIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
Subjt:  GGLRATGKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ

A0A6J1L2D9 uncharacterized protein LOC111499221 isoform X11.3e-11990.87Show/hide
Query:  MSKFLLSHTHLLTLPHIHHSFSLNNGVV-PIRSVLSAPEKRGRKKRQSRQQQLHQKDDDSTALEKGLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSKFLLSH+ LLTLPH HHSFSL+N V+ PIRSVLS  EKRGRKKRQSRQQQL QKD DST LEK LRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHTHLLTLPHIHHSFSLNNGVV-PIRSVLSAPEKRGRKKRQSRQQQLHQKDDDSTALEKGLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANQVFLKGAK
        PGPRSFHGLVVSH LN D EGAMQSLR+ELS GLRPLHETFVALVRLFG+KGLA+RGLEILAAMEKLNYDIRQAWLILIEELV+NKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANQVFLKGAK

Query:  GGLRATGKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
        GGLRAT KIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
Subjt:  GGLRATGKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ

SwissProt top hitse value%identityAlignment
O04504 Pentatricopeptide repeat-containing protein At1g098203.8e-0422.22Show/hide
Query:  VIYDMVAAGLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLNYDIRQ-AWLILIEELVRNKYL
        V+ +MV   +SP   +F+ L+     + +  G+M+  +  L   ++P   ++ +L+    + G  S  + +   M           +  LI    +N  L
Subjt:  VIYDMVAAGLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLNYDIRQ-AWLILIEELVRNKYL

Query:  EDANQVFLKGAKGGLRATGKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLS
        ++A  +F      G   T ++Y++LI+  CK G   +   +  EME  G +     +NCL++
Subjt:  EDANQVFLKGAKGGLRATGKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLS

O64624 Pentatricopeptide repeat-containing protein At2g18940, chloroplastic4.5e-0528.69Show/hide
Query:  SAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLNYDIRQ-AWLILIEELVRNKYLEDANQVFLKGAKGGLRATGKIYDLLIEEDCKAGDHSNALEI
        S G  P   T+ AL+++FG  G+ +  L +L  ME+ +       +  L+   VR  + ++A  V     K G+      Y  +I+   KAG    AL++
Subjt:  SAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLNYDIRQ-AWLILIEELVRNKYLEDANQVFLKGAKGGLRATGKIYDLLIEEDCKAGDHSNALEI

Query:  SYEMEAAGRMATTFHFNCLLSV
         Y M+ AG +  T  +N +LS+
Subjt:  SYEMEAAGRMATTFHFNCLLSV

Q0WPZ6 Pentatricopeptide repeat-containing protein At2g171408.4e-0426.17Show/hide
Query:  VSDVIYDMVAAGLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLN-YDIRQAWLILIEELVRN
        VS +  DMV  G++P   +F+ L+ +   +   + A +        G +P   TF  LVR +   GL  +GLE+L AME       +  +  ++    R 
Subjt:  VSDVIYDMVAAGLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLN-YDIRQAWLILIEELVRN

Query:  KYLEDANQVFLKGAKGGLRATGKIYDLLIEEDCKAGDHSNALEISYEME
           +D+ ++  K  + GL      ++  I   CK G   +A  I  +ME
Subjt:  KYLEDANQVFLKGAKGGLRATGKIYDLLIEEDCKAGDHSNALEISYEME

Q9FIX3 Pentatricopeptide repeat-containing protein At5g397103.8e-0421.11Show/hide
Query:  LEKGLR---FTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLE
        +EKG++    T+   +             D+  +M+  GL P   ++  L+ ++ + GD E A+Q     +  G+ P   T+  L+     +        
Subjt:  LEKGLR---FTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLE

Query:  ILAAM---EKLNYDIRQAWLI-------------LIEELVRNKYLEDANQVFLKGAKGGLRATGKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATT
        +L  +   E +  D+    LI             LI+       + +A+QVF        +  G  Y+++I   C+AGD   A  +  EM  +G +  T
Subjt:  ILAAM---EKLNYDIRQAWLI-------------LIEELVRNKYLEDANQVFLKGAKGGLRATGKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATT

Arabidopsis top hitse value%identityAlignment
AT1G09820.1 Pentatricopeptide repeat (PPR-like) superfamily protein2.7e-0522.22Show/hide
Query:  VIYDMVAAGLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLNYDIRQ-AWLILIEELVRNKYL
        V+ +MV   +SP   +F+ L+     + +  G+M+  +  L   ++P   ++ +L+    + G  S  + +   M           +  LI    +N  L
Subjt:  VIYDMVAAGLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLNYDIRQ-AWLILIEELVRNKYL

Query:  EDANQVFLKGAKGGLRATGKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLS
        ++A  +F      G   T ++Y++LI+  CK G   +   +  EME  G +     +NCL++
Subjt:  EDANQVFLKGAKGGLRATGKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLS

AT1G79540.1 Pentatricopeptide repeat (PPR) superfamily protein1.3e-0424.27Show/hide
Query:  KRGRKKRQSRQQQLHQKDDDSTALEKGLR-FTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLH
        K GR        +L +KD        GLR ++ + + + RAR +      ++  +M+   + P    +  L+   S  G  E A++ L    S G+ P  
Subjt:  KRGRKKRQSRQQQLHQKDDDSTALEKGLR-FTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLH

Query:  ETFVALVRLFGSKGLASRGLEI-LAAMEKLNYDIRQAWLILIEELVRNKYLEDANQVFLKGAKGGLRATGKIYDLLIEEDCKAGDHSNALEISYEMEAAG
          + A+++    +GL   G  + L   E  ++       ILI  + RN  + +A ++F +  K G   +   ++ LI+  CK+G+   A  + ++ME  G
Subjt:  ETFVALVRLFGSKGLASRGLEI-LAAMEKLNYDIRQAWLILIEELVRNKYLEDANQVFLKGAKGGLRATGKIYDLLIEEDCKAGDHSNALEISYEMEAAG

Query:  RMATTF
        R A+ F
Subjt:  RMATTF

AT2G17140.1 Pentatricopeptide repeat (PPR) superfamily protein6.0e-0526.17Show/hide
Query:  VSDVIYDMVAAGLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLN-YDIRQAWLILIEELVRN
        VS +  DMV  G++P   +F+ L+ +   +   + A +        G +P   TF  LVR +   GL  +GLE+L AME       +  +  ++    R 
Subjt:  VSDVIYDMVAAGLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLN-YDIRQAWLILIEELVRN

Query:  KYLEDANQVFLKGAKGGLRATGKIYDLLIEEDCKAGDHSNALEISYEME
           +D+ ++  K  + GL      ++  I   CK G   +A  I  +ME
Subjt:  KYLEDANQVFLKGAKGGLRATGKIYDLLIEEDCKAGDHSNALEISYEME

AT2G18940.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.2e-0628.69Show/hide
Query:  SAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLNYDIRQ-AWLILIEELVRNKYLEDANQVFLKGAKGGLRATGKIYDLLIEEDCKAGDHSNALEI
        S G  P   T+ AL+++FG  G+ +  L +L  ME+ +       +  L+   VR  + ++A  V     K G+      Y  +I+   KAG    AL++
Subjt:  SAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLNYDIRQ-AWLILIEELVRNKYLEDANQVFLKGAKGGLRATGKIYDLLIEEDCKAGDHSNALEI

Query:  SYEMEAAGRMATTFHFNCLLSV
         Y M+ AG +  T  +N +LS+
Subjt:  SYEMEAAGRMATTFHFNCLLSV

AT3G04260.1 plastid transcriptionally active 32.3e-8970.51Show/hide
Query:  GVVPIRSVLSAPEKRGRKKRQSRQQQLHQKDDD--------STALEKGLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHSLNGD
        G+  IR  +SAPEK+ R++R+ ++    + DD          +ALE+ LR TFM+ELM+RARN D  GVS+VIYDM+AAGLSPGPRSFHGLVV+H+LNGD
Subjt:  GVVPIRSVLSAPEKRGRKKRQSRQQQLHQKDDD--------STALEKGLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHSLNGD

Query:  TEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANQVFLKGAKGGLRATGKIYDLLIEEDC
         +GAM SLR+EL AG RPL ET +ALVRL GSKG A+RGLEILAAMEKL YDIRQAWLIL+EEL+R  +LEDAN+VFLKGA+GG+RAT ++YDL+IEEDC
Subjt:  TEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANQVFLKGAKGGLRATGKIYDLLIEEDC

Query:  KAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ
        KAGDHSNAL+ISYEMEAAGRMATTFHFNCLLSVQ
Subjt:  KAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCAAATTCCTCCTCTCTCACACCCACCTTCTCACCCTTCCCCACATACACCATTCCTTCTCTCTCAACAATGGCGTTGTCCCCATCCGCTCAGTCCTCTCGGCTCC
AGAGAAGCGAGGCAGAAAGAAGCGCCAGTCGCGGCAGCAACAATTACACCAAAAGGACGACGATTCCACTGCGCTCGAGAAGGGCCTTCGCTTCACTTTCATGGAGGAAC
TCATGGACCGCGCCAGAAACCACGACCCCCTCGGCGTTTCTGATGTCATTTACGATATGGTTGCCGCTGGATTGAGCCCTGGACCTCGCTCGTTCCATGGCTTGGTTGTT
TCTCATTCTCTCAATGGTGATACTGAGGGAGCGATGCAATCTCTGAGAAGGGAACTAAGTGCTGGACTTCGTCCTCTTCATGAAACTTTTGTTGCATTAGTTCGGTTGTT
TGGTTCCAAGGGTCTTGCTTCTAGAGGCTTAGAAATCCTTGCAGCCATGGAGAAATTGAATTATGACATTCGCCAAGCTTGGCTCATTCTTATTGAGGAACTTGTAAGGA
ACAAATATTTAGAAGACGCCAATCAAGTGTTCTTAAAGGGTGCTAAAGGGGGCCTCAGAGCCACGGGCAAGATTTATGATCTTCTAATTGAGGAAGACTGTAAAGCCGGG
GATCATTCAAATGCCTTAGAGATCTCATATGAAATGGAGGCTGCTGGGCGGATGGCGACAACCTTTCATTTCAATTGCCTTCTTAGTGTCCAGGTGATGTATCTTACTAT
TGGATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCCAAATTCCTCCTCTCTCACACCCACCTTCTCACCCTTCCCCACATACACCATTCCTTCTCTCTCAACAATGGCGTTGTCCCCATCCGCTCAGTCCTCTCGGCTCC
AGAGAAGCGAGGCAGAAAGAAGCGCCAGTCGCGGCAGCAACAATTACACCAAAAGGACGACGATTCCACTGCGCTCGAGAAGGGCCTTCGCTTCACTTTCATGGAGGAAC
TCATGGACCGCGCCAGAAACCACGACCCCCTCGGCGTTTCTGATGTCATTTACGATATGGTTGCCGCTGGATTGAGCCCTGGACCTCGCTCGTTCCATGGCTTGGTTGTT
TCTCATTCTCTCAATGGTGATACTGAGGGAGCGATGCAATCTCTGAGAAGGGAACTAAGTGCTGGACTTCGTCCTCTTCATGAAACTTTTGTTGCATTAGTTCGGTTGTT
TGGTTCCAAGGGTCTTGCTTCTAGAGGCTTAGAAATCCTTGCAGCCATGGAGAAATTGAATTATGACATTCGCCAAGCTTGGCTCATTCTTATTGAGGAACTTGTAAGGA
ACAAATATTTAGAAGACGCCAATCAAGTGTTCTTAAAGGGTGCTAAAGGGGGCCTCAGAGCCACGGGCAAGATTTATGATCTTCTAATTGAGGAAGACTGTAAAGCCGGG
GATCATTCAAATGCCTTAGAGATCTCATATGAAATGGAGGCTGCTGGGCGGATGGCGACAACCTTTCATTTCAATTGCCTTCTTAGTGTCCAGGTGATGTATCTTACTAT
TGGATAA
Protein sequenceShow/hide protein sequence
MSKFLLSHTHLLTLPHIHHSFSLNNGVVPIRSVLSAPEKRGRKKRQSRQQQLHQKDDDSTALEKGLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVV
SHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANQVFLKGAKGGLRATGKIYDLLIEEDCKAG
DHSNALEISYEMEAAGRMATTFHFNCLLSVQVMYLTIG