; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010521 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010521
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein, putative
Genome locationchr1:459365..460284
RNA-Seq ExpressionLag0010521
SyntenyLag0010521
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583948.1 Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.4e-2980.9Show/hide
Query:  SSQNSNLDLDNYCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVSFISILYA-YSHFG
        SSQNSNLDLD YC ILQLCAEQKSIRDGRRVHSI ESN V+IDGILGAKLVFMYVKCGDL+EGRMIFDKLSEKKV   +++ + YS  G
Subjt:  SSQNSNLDLDNYCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVSFISILYA-YSHFG

KAG7019566.1 Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]1.9e-2979.78Show/hide
Query:  SSQNSNLDLDNYCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVSFISILYA-YSHFG
        SSQNSNLDLD YC ILQLCAEQKSIRDGRRVHSI ESN V+IDGILGAKL+FMYVKCGDL+EGRMIFDKLSEKKV   +++ + YS  G
Subjt:  SSQNSNLDLDNYCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVSFISILYA-YSHFG

XP_022927496.1 pentatricopeptide repeat-containing protein DOT4, chloroplastic [Cucurbita moschata]1.4e-2980.9Show/hide
Query:  SSQNSNLDLDNYCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVSFISILYA-YSHFG
        SSQNSNLDLD YC ILQLCAEQKSIRDGRRVHSI ESN V+IDGILGAKLVFMYVKCGDL+EGRMIFDKLSEKKV   +++ + YS  G
Subjt:  SSQNSNLDLDNYCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVSFISILYA-YSHFG

XP_023000778.1 pentatricopeptide repeat-containing protein DOT4, chloroplastic [Cucurbita maxima]1.4e-2980.9Show/hide
Query:  SSQNSNLDLDNYCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVSFISILYA-YSHFG
        SSQNSNLDLD YC ILQLCAEQKSIRDGRRVHSI ESN V+IDGILGAKLVFMYVKCGDL+EGRMIFDKLSEKKV   +++ + YS  G
Subjt:  SSQNSNLDLDNYCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVSFISILYA-YSHFG

XP_023519235.1 pentatricopeptide repeat-containing protein DOT4, chloroplastic [Cucurbita pepo subsp. pepo]1.9e-2992Show/hide
Query:  SSQNSNLDLDNYCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKV
        SSQNSNLDLD YC ILQLCAEQKSIRDGRRVHSI ESN V+IDGILGAKLVFMYVKCGDL+EGRMIFDKLSEKKV
Subjt:  SSQNSNLDLDNYCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKV

TrEMBL top hitse value%identityAlignment
A0A0A0LV30 DYW_deaminase domain-containing protein3.2e-2774.16Show/hide
Query:  SSQNSNLDLDNYCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVSFISILYA-YSHFG
        SSQNSN DL  YCSILQLCAE+KSIRDGRRV SI ES+GV+IDGILG KLVFMYVKCGDLKEGRM+FDKLSE K+   +++ + YS  G
Subjt:  SSQNSNLDLDNYCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVSFISILYA-YSHFG

A0A1S3B857 pentatricopeptide repeat-containing protein DOT4, chloroplastic1.3e-2888Show/hide
Query:  SSQNSNLDLDNYCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKV
        SSQNSN DLD +CSILQLCAE+KSIRDGRRVHSI ES+GV+IDGILG KLVFMYVKCGDLKEGRMIFDKLSE KV
Subjt:  SSQNSNLDLDNYCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKV

A0A5A7UPC9 Pentatricopeptide repeat-containing protein DOT41.3e-2888Show/hide
Query:  SSQNSNLDLDNYCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKV
        SSQNSN DLD +CSILQLCAE+KSIRDGRRVHSI ES+GV+IDGILG KLVFMYVKCGDLKEGRMIFDKLSE KV
Subjt:  SSQNSNLDLDNYCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKV

A0A6J1EHU9 pentatricopeptide repeat-containing protein DOT4, chloroplastic7.0e-3080.9Show/hide
Query:  SSQNSNLDLDNYCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVSFISILYA-YSHFG
        SSQNSNLDLD YC ILQLCAEQKSIRDGRRVHSI ESN V+IDGILGAKLVFMYVKCGDL+EGRMIFDKLSEKKV   +++ + YS  G
Subjt:  SSQNSNLDLDNYCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVSFISILYA-YSHFG

A0A6J1KNK7 pentatricopeptide repeat-containing protein DOT4, chloroplastic7.0e-3080.9Show/hide
Query:  SSQNSNLDLDNYCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVSFISILYA-YSHFG
        SSQNSNLDLD YC ILQLCAEQKSIRDGRRVHSI ESN V+IDGILGAKLVFMYVKCGDL+EGRMIFDKLSEKKV   +++ + YS  G
Subjt:  SSQNSNLDLDNYCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVSFISILYA-YSHFG

SwissProt top hitse value%identityAlignment
O23169 Pentatricopeptide repeat-containing protein At4g371701.9e-0829.27Show/hide
Query:  YCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKV-SFISILYAYSHFGLLDE
        YC+++Q+C++ +++ +G++VH    ++G +   ++  +L+ MY KCG L + R +FD++  + + S+  ++  Y+  GLL+E
Subjt:  YCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKV-SFISILYAYSHFGLLDE

P93011 Pentatricopeptide repeat-containing protein At2g337605.0e-0929.84Show/hide
Query:  FSFSSSTSSVSRRQPPLIFINLFLHVAHPSTSSQNSNLDLDNYCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDK
        F F+S   S S+ + PL  +  +  +   + S  N       + S+++ CA+  ++R G+ VH     +G  +D  + A LV  Y KCGD++  R +FD+
Subjt:  FSFSSSTSSVSRRQPPLIFINLFLHVAHPSTSSQNSNLDLDNYCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDK

Query:  LSEKK-VSFISILYAYSHFGLLDE
        + EK  V++ S++  +   GL DE
Subjt:  LSEKK-VSFISILYAYSHFGLLDE

Q0WNP3 Pentatricopeptide repeat-containing protein At4g18520, chloroplastic2.6e-1038.27Show/hide
Query:  CSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKK-VSFISILYAYSHFGLLDE
        CSIL+ C+E+K++R GR+VHS+     +  D  +G  L+ MY KCG++ + R +FD +S +  V++ SI+ A++  G  +E
Subjt:  CSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKK-VSFISILYAYSHFGLLDE

Q9CA56 Pentatricopeptide repeat-containing protein At1g74600, chloroplastic1.1e-0834.91Show/hide
Query:  INLFLHVAHPSTSSQNSNLDLDNYCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSE-KKVSFISILYAYSHF
        I LF  +    TS   S L      ++L +C+   S+  G+ +H  T   G+     LG+ LV MY KCG LK  R ++D+L E   VS  S++  YS  
Subjt:  INLFLHVAHPSTSSQNSNLDLDNYCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSE-KKVSFISILYAYSHF

Query:  GLLDEG
        GL+ +G
Subjt:  GLLDEG

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic1.1e-1346.05Show/hide
Query:  NLDLDNYCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVSFISIL
        ++D    CS+LQLCA+ KS++DG+ V +    NG +ID  LG+KL  MY  CGDLKE   +FD++  +K  F +IL
Subjt:  NLDLDNYCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVSFISIL

Arabidopsis top hitse value%identityAlignment
AT1G74600.1 pentatricopeptide (PPR) repeat-containing protein7.9e-1034.91Show/hide
Query:  INLFLHVAHPSTSSQNSNLDLDNYCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSE-KKVSFISILYAYSHF
        I LF  +    TS   S L      ++L +C+   S+  G+ +H  T   G+     LG+ LV MY KCG LK  R ++D+L E   VS  S++  YS  
Subjt:  INLFLHVAHPSTSSQNSNLDLDNYCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSE-KKVSFISILYAYSHF

Query:  GLLDEG
        GL+ +G
Subjt:  GLLDEG

AT2G33760.1 Pentatricopeptide repeat (PPR) superfamily protein3.5e-1029.84Show/hide
Query:  FSFSSSTSSVSRRQPPLIFINLFLHVAHPSTSSQNSNLDLDNYCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDK
        F F+S   S S+ + PL  +  +  +   + S  N       + S+++ CA+  ++R G+ VH     +G  +D  + A LV  Y KCGD++  R +FD+
Subjt:  FSFSSSTSSVSRRQPPLIFINLFLHVAHPSTSSQNSNLDLDNYCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDK

Query:  LSEKK-VSFISILYAYSHFGLLDE
        + EK  V++ S++  +   GL DE
Subjt:  LSEKK-VSFISILYAYSHFGLLDE

AT4G18520.1 Pentatricopeptide repeat (PPR) superfamily protein1.9e-1138.27Show/hide
Query:  CSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKK-VSFISILYAYSHFGLLDE
        CSIL+ C+E+K++R GR+VHS+     +  D  +G  L+ MY KCG++ + R +FD +S +  V++ SI+ A++  G  +E
Subjt:  CSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKK-VSFISILYAYSHFGLLDE

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein8.2e-1546.05Show/hide
Query:  NLDLDNYCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVSFISIL
        ++D    CS+LQLCA+ KS++DG+ V +    NG +ID  LG+KL  MY  CGDLKE   +FD++  +K  F +IL
Subjt:  NLDLDNYCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVSFISIL

AT4G37170.1 Pentatricopeptide repeat (PPR) superfamily protein1.3e-0929.27Show/hide
Query:  YCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKV-SFISILYAYSHFGLLDE
        YC+++Q+C++ +++ +G++VH    ++G +   ++  +L+ MY KCG L + R +FD++  + + S+  ++  Y+  GLL+E
Subjt:  YCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKV-SFISILYAYSHFGLLDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAACCGGCCGGTTCGGGCACACAACTTACCGCAAGATCCCTAGCGCCGCCTGCCTCCATTGTCTTCTTCATTTTCACGCAAGATGCCATCACGATTCACGCCAATG
CCTCGATTTTTCTTCTTCTTCGTTTTCGCGGCAGCCTCCATTTTTCTTCTTCTTCGTTTTCGCGCCGCTTGTCTCATTGCCTCTCGGTGTTCTTCGTTTTTCATTTTCTT
CTTCTACATCTTCAGTTTCACGCCGCCAGCCGCCTCTCATTTTTATTAATCTTTTTCTTCACGTCGCCCACCCCTCGACAAGCTCTCAAAATTCCAACCTTGACTTGGAT
AATTATTGCTCCATCTTGCAGCTATGTGCTGAACAAAAATCGATACGAGATGGAAGAAGGGTTCATTCTATAACTGAGTCCAATGGGGTTCTGATAGATGGAATCTTGGG
GGCGAAACTAGTTTTTATGTATGTAAAATGTGGGGATTTAAAAGAAGGGAGGATGATTTTTGATAAACTATCTGAAAAGAAGGTATCCTTCATTTCAATTCTTTATGCCT
ATAGCCATTTTGGATTGCTTGATGAAGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGAACCGGCCGGTTCGGGCACACAACTTACCGCAAGATCCCTAGCGCCGCCTGCCTCCATTGTCTTCTTCATTTTCACGCAAGATGCCATCACGATTCACGCCAATG
CCTCGATTTTTCTTCTTCTTCGTTTTCGCGGCAGCCTCCATTTTTCTTCTTCTTCGTTTTCGCGCCGCTTGTCTCATTGCCTCTCGGTGTTCTTCGTTTTTCATTTTCTT
CTTCTACATCTTCAGTTTCACGCCGCCAGCCGCCTCTCATTTTTATTAATCTTTTTCTTCACGTCGCCCACCCCTCGACAAGCTCTCAAAATTCCAACCTTGACTTGGAT
AATTATTGCTCCATCTTGCAGCTATGTGCTGAACAAAAATCGATACGAGATGGAAGAAGGGTTCATTCTATAACTGAGTCCAATGGGGTTCTGATAGATGGAATCTTGGG
GGCGAAACTAGTTTTTATGTATGTAAAATGTGGGGATTTAAAAGAAGGGAGGATGATTTTTGATAAACTATCTGAAAAGAAGGTATCCTTCATTTCAATTCTTTATGCCT
ATAGCCATTTTGGATTGCTTGATGAAGGATGA
Protein sequenceShow/hide protein sequence
MRTGRFGHTTYRKIPSAACLHCLLHFHARCHHDSRQCLDFSSSSFSRQPPFFFFFVFAPLVSLPLGVLRFSFSSSTSSVSRRQPPLIFINLFLHVAHPSTSSQNSNLDLD
NYCSILQLCAEQKSIRDGRRVHSITESNGVLIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVSFISILYAYSHFGLLDEG