; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0024112 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0024112
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr10:562664..563029
RNA-Seq ExpressionLag0024112
SyntenyLag0024112
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137435.1 pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Momordica charantia]6.2e-4789.19Show/hide
Query:  MFGKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALE
        MFGKQ+ SL VRNG  FVNGYVRAGMIDLFAKDSSF DALRVF+DVDCENVVCWNAIVSAAVRNGEN +ALDLFN MCSGFLEPNS TFSSVLTACAA+E
Subjt:  MFGKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALE

Query:  DLEFGKRVQGR
        DLEFGKRVQGR
Subjt:  DLEFGKRVQGR

XP_022923751.1 pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Cucurbita moschata]1.4e-4688.29Show/hide
Query:  MFGKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALE
        MFGKQV SLAVRNG FFVNGYVRAGMIDLFAKDSSFLDALRVFHD+ CENVVCWNAIVSAAVRNGEN MALDL+N MC G LEPNS TFSSVLTACAALE
Subjt:  MFGKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALE

Query:  DLEFGKRVQGR
          EFGKRVQG+
Subjt:  DLEFGKRVQGR

XP_023001341.1 pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Cucurbita maxima]8.1e-4789.19Show/hide
Query:  MFGKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALE
        MFGKQV SLAVRNG FFVNGYVRAGMIDLFAK+SSFLDALRVF DVDCENVVCWNAIVSAAVRNGEN MALDL+N MC GFLEPNS TFSSVLTACAALE
Subjt:  MFGKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALE

Query:  DLEFGKRVQGR
          EFGKRVQG+
Subjt:  DLEFGKRVQGR

XP_023519257.1 pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Cucurbita pepo subsp. pepo]3.3e-4890.99Show/hide
Query:  MFGKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALE
        MFGKQV SLAVRNG FFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGEN MALDL+N MC GFLEPNS TFSSVLTACAALE
Subjt:  MFGKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALE

Query:  DLEFGKRVQGR
          EFGKRVQG+
Subjt:  DLEFGKRVQGR

XP_038893557.1 pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Benincasa hispida]1.4e-5196.4Show/hide
Query:  MFGKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALE
        MFGKQV SLAVRNG FFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFN MCSGFLEPNS TFSSVLTACAALE
Subjt:  MFGKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALE

Query:  DLEFGKRVQGR
        DLEFGKRVQGR
Subjt:  DLEFGKRVQGR

TrEMBL top hitse value%identityAlignment
A0A5A7T3B5 Pentatricopeptide repeat-containing protein3.7e-4587.39Show/hide
Query:  MFGKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALE
        MFGKQV SLAVRNG FF NGYVR  MIDLFAKDS FLDALRVFHDVDC NVVCWNAIVSAAV NGE LMALDLFNRMCS FLEPNS TFSSVLTAC+AL+
Subjt:  MFGKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALE

Query:  DLEFGKRVQGR
        DLEFGK VQGR
Subjt:  DLEFGKRVQGR

A0A5D3BIJ5 Pentatricopeptide repeat-containing protein3.7e-4587.39Show/hide
Query:  MFGKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALE
        MFGKQV SLAVRNG FF NGYVR  MIDLFAKDS FLDALRVFHDVDC NVVCWNAIVSAAV NGE LMALDLFNRMCS FLEPNS TFSSVLTAC+AL+
Subjt:  MFGKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALE

Query:  DLEFGKRVQGR
        DLEFGK VQGR
Subjt:  DLEFGKRVQGR

A0A6J1C6M8 pentatricopeptide repeat-containing protein At1g74600, chloroplastic3.0e-4789.19Show/hide
Query:  MFGKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALE
        MFGKQ+ SL VRNG  FVNGYVRAGMIDLFAKDSSF DALRVF+DVDCENVVCWNAIVSAAVRNGEN +ALDLFN MCSGFLEPNS TFSSVLTACAA+E
Subjt:  MFGKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALE

Query:  DLEFGKRVQGR
        DLEFGKRVQGR
Subjt:  DLEFGKRVQGR

A0A6J1E7L2 pentatricopeptide repeat-containing protein At1g74600, chloroplastic6.7e-4788.29Show/hide
Query:  MFGKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALE
        MFGKQV SLAVRNG FFVNGYVRAGMIDLFAKDSSFLDALRVFHD+ CENVVCWNAIVSAAVRNGEN MALDL+N MC G LEPNS TFSSVLTACAALE
Subjt:  MFGKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALE

Query:  DLEFGKRVQGR
          EFGKRVQG+
Subjt:  DLEFGKRVQGR

A0A6J1KIC5 pentatricopeptide repeat-containing protein At1g74600, chloroplastic3.9e-4789.19Show/hide
Query:  MFGKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALE
        MFGKQV SLAVRNG FFVNGYVRAGMIDLFAK+SSFLDALRVF DVDCENVVCWNAIVSAAVRNGEN MALDL+N MC GFLEPNS TFSSVLTACAALE
Subjt:  MFGKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALE

Query:  DLEFGKRVQGR
          EFGKRVQG+
Subjt:  DLEFGKRVQGR

SwissProt top hitse value%identityAlignment
Q9CA56 Pentatricopeptide repeat-containing protein At1g74600, chloroplastic1.9e-2245.08Show/hide
Query:  MFGKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALE
        +F + VC   ++ G+FF    V + +ID+F+K+  F DA +VF D    NV CWN I++ A+RN       DLF+ MC GF +P+S T+SSVL ACA+LE
Subjt:  MFGKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALE

Query:  DLEFGKRVQGRGLNV-AEKMFL
         L FGK VQ R +   AE +F+
Subjt:  DLEFGKRVQGRGLNV-AEKMFL

Q9LJI9 Pentatricopeptide repeat-containing protein At3g286601.1e-1132.69Show/hide
Query:  GKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALEDL
        GKQ+    V+NG F  +G+V+ G++ ++ +D    DA +VF ++   +VV W+ +++  VR G     L++F  M    +EP+  + ++ LTACA +  L
Subjt:  GKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALEDL

Query:  EFGK
          GK
Subjt:  EFGK

Q9LJJ1 Putative pentatricopeptide repeat-containing protein At3g286401.1e-1133.65Show/hide
Query:  GKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALEDL
        GKQ+    V+NG F  + +V+ G++ ++ +D   LDA +VF ++   +VV W+ +++  VR G     L++F  M    LEP+  + ++ LTACA +  L
Subjt:  GKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALEDL

Query:  EFGK
          GK
Subjt:  EFGK

Q9S7F4 Putative pentatricopeptide repeat-containing protein At2g015103.9e-1235.24Show/hide
Query:  MFGKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALE
        + GKQ+ +  +R+G    N +  +G++D++AK  S  DA++VF ++   N V WNA++SA   NG+   A+  F +M    L+P+S++   VLTAC+   
Subjt:  MFGKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALE

Query:  DLEFG
         +E G
Subjt:  DLEFG

Q9SIT7 Pentatricopeptide repeat-containing protein At2g136001.5e-1435.78Show/hide
Query:  GKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALEDL
        G QV SL  ++  F  + Y+ + ++D+++K  +  DA RVF ++   NVV WN++++   +NG  + ALD+F  M    +EP+ +T +SV++ACA+L  +
Subjt:  GKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALEDL

Query:  EFGKRVQGR
        + G+ V GR
Subjt:  EFGKRVQGR

Arabidopsis top hitse value%identityAlignment
AT1G74600.1 pentatricopeptide (PPR) repeat-containing protein1.3e-2345.08Show/hide
Query:  MFGKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALE
        +F + VC   ++ G+FF    V + +ID+F+K+  F DA +VF D    NV CWN I++ A+RN       DLF+ MC GF +P+S T+SSVL ACA+LE
Subjt:  MFGKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALE

Query:  DLEFGKRVQGRGLNV-AEKMFL
         L FGK VQ R +   AE +F+
Subjt:  DLEFGKRVQGRGLNV-AEKMFL

AT2G13600.1 Pentatricopeptide repeat (PPR) superfamily protein1.0e-1535.78Show/hide
Query:  GKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALEDL
        G QV SL  ++  F  + Y+ + ++D+++K  +  DA RVF ++   NVV WN++++   +NG  + ALD+F  M    +EP+ +T +SV++ACA+L  +
Subjt:  GKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALEDL

Query:  EFGKRVQGR
        + G+ V GR
Subjt:  EFGKRVQGR

AT3G02010.1 Pentatricopeptide repeat (PPR) superfamily protein2.8e-1335.24Show/hide
Query:  MFGKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALE
        + GKQ+ +  +R+G    N +  +G++D++AK  S  DA++VF ++   N V WNA++SA   NG+   A+  F +M    L+P+S++   VLTAC+   
Subjt:  MFGKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALE

Query:  DLEFG
         +E G
Subjt:  DLEFG

AT3G28640.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.2e-1333.65Show/hide
Query:  GKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALEDL
        GKQ+    V+NG F  + +V+ G++ ++ +D   LDA +VF ++   +VV W+ +++  VR G     L++F  M    LEP+  + ++ LTACA +  L
Subjt:  GKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALEDL

Query:  EFGK
          GK
Subjt:  EFGK

AT3G28660.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.2e-1332.69Show/hide
Query:  GKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALEDL
        GKQ+    V+NG F  +G+V+ G++ ++ +D    DA +VF ++   +VV W+ +++  VR G     L++F  M    +EP+  + ++ LTACA +  L
Subjt:  GKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALEDL

Query:  EFGK
          GK
Subjt:  EFGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGGTAAGCAGGTTTGTTCACTTGCGGTGAGAAATGGGTTTTTTTTTGTTAATGGTTATGTTCGAGCTGGGATGATTGATTTGTTTGCAAAAGATTCTAGTTTTCT
GGATGCTCTAAGGGTGTTTCATGATGTTGATTGTGAGAATGTGGTGTGCTGGAATGCTATTGTCTCTGCAGCTGTAAGAAATGGAGAGAATTTGATGGCTTTGGATCTTT
TCAATCGAATGTGTAGTGGGTTTCTGGAGCCTAATAGTCTTACATTTTCTAGTGTTCTAACTGCGTGTGCTGCACTTGAAGATCTTGAATTCGGGAAAAGAGTTCAAGGG
AGAGGATTAAATGTGGCGGAGAAGATGTTTTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTGGTAAGCAGGTTTGTTCACTTGCGGTGAGAAATGGGTTTTTTTTTGTTAATGGTTATGTTCGAGCTGGGATGATTGATTTGTTTGCAAAAGATTCTAGTTTTCT
GGATGCTCTAAGGGTGTTTCATGATGTTGATTGTGAGAATGTGGTGTGCTGGAATGCTATTGTCTCTGCAGCTGTAAGAAATGGAGAGAATTTGATGGCTTTGGATCTTT
TCAATCGAATGTGTAGTGGGTTTCTGGAGCCTAATAGTCTTACATTTTCTAGTGTTCTAACTGCGTGTGCTGCACTTGAAGATCTTGAATTCGGGAAAAGAGTTCAAGGG
AGAGGATTAAATGTGGCGGAGAAGATGTTTTTGTAG
Protein sequenceShow/hide protein sequence
MFGKQVCSLAVRNGFFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENLMALDLFNRMCSGFLEPNSLTFSSVLTACAALEDLEFGKRVQG
RGLNVAEKMFL