; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg029023 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg029023
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationscaffold5:16161087..16170052
RNA-Seq ExpressionSpg029023
SyntenySpg029023
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605288.1 S-adenosylmethionine carrier 1, chloroplastic/mitochondrial, partial [Cucurbita argyrosperma subsp. sororia]3.4e-2836.58Show/hide
Query:  SVSFHAEEEILAFTVSEKVN-----------------------------------------RKFRRSLVIAFCGFDPEGLDDILFTTTIKS-DIRIDLSP
        S+S HAEEEILAFT+ ++++                                            +RS++I F  + PEG +DIL TTT  + D  I +  
Subjt:  SVSFHAEEEILAFTVSEKVN-----------------------------------------RKFRRSLVIAFCGFDPEGLDDILFTTTIKS-DIRIDLSP

Query:  SWSCAPRFRIRRNAKSNYSTLHL-INPVDFFTPPAMSRLSLNAQAVVERPHIVKLKVWRYDSIGDHVHNNHRLYLST--------------GKFLVACVV
        S   A  FRIRR   +N+  L + +NP+  F PPA   L +  Q VVE   +V LK+W+YD+  ++V  N RL +                GKFLVACVV
Subjt:  SWSCAPRFRIRRNAKSNYSTLHL-INPVDFFTPPAMSRLSLNAQAVVERPHIVKLKVWRYDSIGDHVHNNHRLYLST--------------GKFLVACVV

Query:  KRREAKKRIFELFPEEKFDHSTVCTLVYYIRVYSIADSRHSSLVSGVEMKAGKVVHA
         R++A ++IF+LF  E+ D ST+CTLVY IRVYSIAD     ++    +KAG  V+A
Subjt:  KRREAKKRIFELFPEEKFDHSTVCTLVYYIRVYSIADSRHSSLVSGVEMKAGKVVHA

KAG7035246.1 hypothetical protein SDJN02_02041, partial [Cucurbita argyrosperma subsp. argyrosperma]2.1e-2544.32Show/hide
Query:  RKFRRSLVIAFCGFDPEGLDDILFTTTIKS-DIRIDLSPSWSCAPRFRIRRNAKSNYSTLHL-INPVDFFTPPAMSRLSLNAQAVVERPHIVKLKVWRYD
        +  +RS++I F  + PEG +DIL TTT  + D  I +  S   A  FRIRR   +N+  L + +NP+  F PPA   L +  Q VVE   +V LK+W+YD
Subjt:  RKFRRSLVIAFCGFDPEGLDDILFTTTIKS-DIRIDLSPSWSCAPRFRIRRNAKSNYSTLHL-INPVDFFTPPAMSRLSLNAQAVVERPHIVKLKVWRYD

Query:  SIGDHVHNNHRLYLST--------------GKFLVACVVKRREAKKRIFELFPEEKFDHSTVCTLVYYIRVYSIAD
        +  D+V  N RL +                GKFLVACVV R++A ++IF+LF  E+ D ST+CTLVY IRVYSIAD
Subjt:  SIGDHVHNNHRLYLST--------------GKFLVACVVKRREAKKRIFELFPEEKFDHSTVCTLVYYIRVYSIAD

XP_022947738.1 uncharacterized protein LOC111451512 [Cucurbita moschata]5.3e-2936.96Show/hide
Query:  SVSFHAEEEILAFTVSEKVN-----------------------------------------RKFRRSLVIAFCGFDPEGLDDILFTTTIKS-DIRIDLSP
        S+S HAEEEILAFT+ ++++                                         +  +RS++I F  + PEG +DIL TTT  + D  I +  
Subjt:  SVSFHAEEEILAFTVSEKVN-----------------------------------------RKFRRSLVIAFCGFDPEGLDDILFTTTIKS-DIRIDLSP

Query:  SWSCAPRFRIRRNAKSNYSTLHL-INPVDFFTPPAMSRLSLNAQAVVERPHIVKLKVWRYDSIGDHVHNNHRLYLST--------------GKFLVACVV
        S   A  FRIRR   +N+  L + +NP+  F PPA   L +  Q VVE   +V LK+W+YD+  D+V  N RL +                GKFLVACVV
Subjt:  SWSCAPRFRIRRNAKSNYSTLHL-INPVDFFTPPAMSRLSLNAQAVVERPHIVKLKVWRYDSIGDHVHNNHRLYLST--------------GKFLVACVV

Query:  KRREAKKRIFELFPEEKFDHSTVCTLVYYIRVYSIADSRHSSLVSGVEMKAGKVVHA
         R++A ++IF+LF  E+ D ST+CTLVY IRVYSIAD     ++    +KAG  V+A
Subjt:  KRREAKKRIFELFPEEKFDHSTVCTLVYYIRVYSIADSRHSSLVSGVEMKAGKVVHA

XP_023007549.1 uncharacterized protein LOC111500010 [Cucurbita maxima]5.9e-2836.58Show/hide
Query:  SVSFHAEEEILAFTVSEKVN-----------------------------------------RKFRRSLVIAFCGFDPEGLDDILFTTTIKS-DIRIDLSP
        S+S HAEEEILAFT+ ++++                                         +  +RS++I F  + PEG +DIL TTT  + D  I +  
Subjt:  SVSFHAEEEILAFTVSEKVN-----------------------------------------RKFRRSLVIAFCGFDPEGLDDILFTTTIKS-DIRIDLSP

Query:  SWSCAPRFRIRRNAKSNYSTLHL-INPVDFFTPPAMSRLSLNAQAVVERPHIVKLKVWRYDSIGDHVHNNHRLYLST--------------GKFLVACVV
        S   A  FRIRR   +N+  L + +NP+  F PPA   L +  Q VVE   +V LK+W+YD+   +V  N RL +                GKFLVACVV
Subjt:  SWSCAPRFRIRRNAKSNYSTLHL-INPVDFFTPPAMSRLSLNAQAVVERPHIVKLKVWRYDSIGDHVHNNHRLYLST--------------GKFLVACVV

Query:  KRREAKKRIFELFPEEKFDHSTVCTLVYYIRVYSIADSRHSSLVSGVEMKAGKVVHA
         R++A ++IF+LF  E+ D ST+CTLVY IRVYSIAD     ++    +KAG  V+A
Subjt:  KRREAKKRIFELFPEEKFDHSTVCTLVYYIRVYSIADSRHSSLVSGVEMKAGKVVHA

XP_023534521.1 uncharacterized protein LOC111796064 [Cucurbita pepo subsp. pepo]5.3e-2936.96Show/hide
Query:  SVSFHAEEEILAFTVSEKVN-----------------------------------------RKFRRSLVIAFCGFDPEGLDDILFTTTIKS-DIRIDLSP
        S+S HAEEEILAFT+ ++++                                         +  +RS++I F  + PEG +DIL TTT  + D  I +  
Subjt:  SVSFHAEEEILAFTVSEKVN-----------------------------------------RKFRRSLVIAFCGFDPEGLDDILFTTTIKS-DIRIDLSP

Query:  SWSCAPRFRIRRNAKSNYSTLHL-INPVDFFTPPAMSRLSLNAQAVVERPHIVKLKVWRYDSIGDHVHNNHRLYLST--------------GKFLVACVV
        S   A  FRIRR   +N+  L + +NP+  F PPA   L +  Q VVE   +V LK+W+YD+  D+V  N RL +                GKFLVACVV
Subjt:  SWSCAPRFRIRRNAKSNYSTLHL-INPVDFFTPPAMSRLSLNAQAVVERPHIVKLKVWRYDSIGDHVHNNHRLYLST--------------GKFLVACVV

Query:  KRREAKKRIFELFPEEKFDHSTVCTLVYYIRVYSIADSRHSSLVSGVEMKAGKVVHA
         R++A ++IF+LF  E+ D ST+CTLVY IRVYSIAD     ++    +KAG  V+A
Subjt:  KRREAKKRIFELFPEEKFDHSTVCTLVYYIRVYSIADSRHSSLVSGVEMKAGKVVHA

TrEMBL top hitse value%identityAlignment
A0A6J1D906 Reverse transcriptase4.7e-2373.08Show/hide
Query:  MSATKQLAKTHVERLVEIEEQLLFLREIPDSVRYVEHRIEEISEKLGEIDAVNARIDGLPVQELALRVETLESKTTKP
        MS TKQL+K+HV+RLVEIEEQLL+LRE+PD +R +E R++E SEK GEIDAVNARIDGLP+Q++A+RVETLESK T+P
Subjt:  MSATKQLAKTHVERLVEIEEQLLFLREIPDSVRYVEHRIEEISEKLGEIDAVNARIDGLPVQELALRVETLESKTTKP

A0A6J1DK29 uncharacterized protein LOC1110218291.6e-2373.08Show/hide
Query:  MSATKQLAKTHVERLVEIEEQLLFLREIPDSVRYVEHRIEEISEKLGEIDAVNARIDGLPVQELALRVETLESKTTKP
        MSATKQL+K+HV+RLVEIEEQLL+LRE+PDS+R +E R++E SEK GEIDAVNAR+DGLP+Q++A+RVET ESK T+P
Subjt:  MSATKQLAKTHVERLVEIEEQLLFLREIPDSVRYVEHRIEEISEKLGEIDAVNARIDGLPVQELALRVETLESKTTKP

A0A6J1DLQ6 uncharacterized protein LOC1110223207.0e-1968.83Show/hide
Query:  MSATKQLAKTHVERLVEIEEQLLFLREIPDSVRYVEHRIEEISEKLGEIDAVNARIDGLPVQELALRVETLESKTTK
        MS TKQL K+H++RLVEIEE+LLFLREIPD++RYVE R++EIS K   ID VNARIDGL ++EL LRVETLE K  +
Subjt:  MSATKQLAKTHVERLVEIEEQLLFLREIPDSVRYVEHRIEEISEKLGEIDAVNARIDGLPVQELALRVETLESKTTK

A0A6J1G7S8 uncharacterized protein LOC1114515122.6e-2936.96Show/hide
Query:  SVSFHAEEEILAFTVSEKVN-----------------------------------------RKFRRSLVIAFCGFDPEGLDDILFTTTIKS-DIRIDLSP
        S+S HAEEEILAFT+ ++++                                         +  +RS++I F  + PEG +DIL TTT  + D  I +  
Subjt:  SVSFHAEEEILAFTVSEKVN-----------------------------------------RKFRRSLVIAFCGFDPEGLDDILFTTTIKS-DIRIDLSP

Query:  SWSCAPRFRIRRNAKSNYSTLHL-INPVDFFTPPAMSRLSLNAQAVVERPHIVKLKVWRYDSIGDHVHNNHRLYLST--------------GKFLVACVV
        S   A  FRIRR   +N+  L + +NP+  F PPA   L +  Q VVE   +V LK+W+YD+  D+V  N RL +                GKFLVACVV
Subjt:  SWSCAPRFRIRRNAKSNYSTLHL-INPVDFFTPPAMSRLSLNAQAVVERPHIVKLKVWRYDSIGDHVHNNHRLYLST--------------GKFLVACVV

Query:  KRREAKKRIFELFPEEKFDHSTVCTLVYYIRVYSIADSRHSSLVSGVEMKAGKVVHA
         R++A ++IF+LF  E+ D ST+CTLVY IRVYSIAD     ++    +KAG  V+A
Subjt:  KRREAKKRIFELFPEEKFDHSTVCTLVYYIRVYSIADSRHSSLVSGVEMKAGKVVHA

A0A6J1L591 uncharacterized protein LOC1115000102.8e-2836.58Show/hide
Query:  SVSFHAEEEILAFTVSEKVN-----------------------------------------RKFRRSLVIAFCGFDPEGLDDILFTTTIKS-DIRIDLSP
        S+S HAEEEILAFT+ ++++                                         +  +RS++I F  + PEG +DIL TTT  + D  I +  
Subjt:  SVSFHAEEEILAFTVSEKVN-----------------------------------------RKFRRSLVIAFCGFDPEGLDDILFTTTIKS-DIRIDLSP

Query:  SWSCAPRFRIRRNAKSNYSTLHL-INPVDFFTPPAMSRLSLNAQAVVERPHIVKLKVWRYDSIGDHVHNNHRLYLST--------------GKFLVACVV
        S   A  FRIRR   +N+  L + +NP+  F PPA   L +  Q VVE   +V LK+W+YD+   +V  N RL +                GKFLVACVV
Subjt:  SWSCAPRFRIRRNAKSNYSTLHL-INPVDFFTPPAMSRLSLNAQAVVERPHIVKLKVWRYDSIGDHVHNNHRLYLST--------------GKFLVACVV

Query:  KRREAKKRIFELFPEEKFDHSTVCTLVYYIRVYSIADSRHSSLVSGVEMKAGKVVHA
         R++A ++IF+LF  E+ D ST+CTLVY IRVYSIAD     ++    +KAG  V+A
Subjt:  KRREAKKRIFELFPEEKFDHSTVCTLVYYIRVYSIADSRHSSLVSGVEMKAGKVVHA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCTCTACGGAGCTCGACGTTCGACCCTTGACCATTTGTACCCAAGGCCTCGAACTGTGCATCACCCCTTGGCCATTTGTACTCAAGATGAGGATTTTGGATGTTGT
TGGATCTGTTTCTTTCCATGCTGAAGAAGAAATTCTTGCTTTTACAGTATCTGAAAAGGTGAATAGAAAATTCAGGCGTAGCTTGGTCATAGCATTTTGTGGTTTTGATC
CAGAAGGGTTGGATGATATACTTTTTACCACCACCATTAAATCAGATATTAGAATCGATTTAAGTCCAAGTTGGAGTTGTGCACCAAGATTTCGGATTCGTCGAAATGCT
AAAAGTAATTACTCGACGTTGCATTTGATTAACCCTGTTGACTTCTTTACTCCACCCGCGATGTCGAGATTGTCTCTGAATGCTCAAGCTGTGGTGGAGCGCCCCCATAT
TGTGAAACTAAAAGTTTGGCGATATGATTCGATAGGCGACCATGTGCATAACAACCATCGTTTATATCTCTCCACGGGCAAGTTTCTGGTTGCTTGTGTCGTTAAGAGGA
GAGAAGCTAAGAAGAGGATCTTCGAATTATTTCCTGAGGAAAAGTTCGATCACTCCACAGTTTGCACGCTTGTATATTATATACGCGTGTATTCTATAGCAGATTCCAGA
CATAGTTCACTTGTCTCGGGAGTAGAAATGAAAGCTGGGAAGGTTGTGCATGCATTTACCATACTGAGCTCAGATTTATCGAATCTTCGACATGGAACTTGTGGGAACCG
TGAGACTATACCTTCTCCAGGCTCTGGATGGAAGCATGCCACGCTGGTTGTCTTGGGCATGCTTGTCCAGGGCGAGTTGTGTCTATGGCCGCATGCCCAAACCATCCTCA
ACCACCATGTTTTAGAGTCATTGGTACCAGAGCTTAGTTTGCTCCTTGGCTTTCGAGAAGTTACAATCATGTCGGCGACTAAACAGTTGGCTAAGACGCATGTCGAGCGA
CTTGTTGAGATAGAGGAACAGTTGCTTTTCCTGAGGGAAATACCGGACTCGGTACGATACGTTGAACATCGAATTGAAGAGATCTCCGAGAAGCTAGGTGAGATCGATGC
GGTGAATGCCCGAATCGATGGGCTACCCGTACAAGAGTTGGCATTGAGGGTTGAGACCCTAGAAAGCAAGACTACGAAGCCTGTTTTGGGGGCGTATGACTCCTCGCGAT
GTCGCATGTTCACGCCTGCCAGACTAAAAATGCCTCATAATGTACTATAG
mRNA sequenceShow/hide mRNA sequence
ATGCTCTCTACGGAGCTCGACGTTCGACCCTTGACCATTTGTACCCAAGGCCTCGAACTGTGCATCACCCCTTGGCCATTTGTACTCAAGATGAGGATTTTGGATGTTGT
TGGATCTGTTTCTTTCCATGCTGAAGAAGAAATTCTTGCTTTTACAGTATCTGAAAAGGTGAATAGAAAATTCAGGCGTAGCTTGGTCATAGCATTTTGTGGTTTTGATC
CAGAAGGGTTGGATGATATACTTTTTACCACCACCATTAAATCAGATATTAGAATCGATTTAAGTCCAAGTTGGAGTTGTGCACCAAGATTTCGGATTCGTCGAAATGCT
AAAAGTAATTACTCGACGTTGCATTTGATTAACCCTGTTGACTTCTTTACTCCACCCGCGATGTCGAGATTGTCTCTGAATGCTCAAGCTGTGGTGGAGCGCCCCCATAT
TGTGAAACTAAAAGTTTGGCGATATGATTCGATAGGCGACCATGTGCATAACAACCATCGTTTATATCTCTCCACGGGCAAGTTTCTGGTTGCTTGTGTCGTTAAGAGGA
GAGAAGCTAAGAAGAGGATCTTCGAATTATTTCCTGAGGAAAAGTTCGATCACTCCACAGTTTGCACGCTTGTATATTATATACGCGTGTATTCTATAGCAGATTCCAGA
CATAGTTCACTTGTCTCGGGAGTAGAAATGAAAGCTGGGAAGGTTGTGCATGCATTTACCATACTGAGCTCAGATTTATCGAATCTTCGACATGGAACTTGTGGGAACCG
TGAGACTATACCTTCTCCAGGCTCTGGATGGAAGCATGCCACGCTGGTTGTCTTGGGCATGCTTGTCCAGGGCGAGTTGTGTCTATGGCCGCATGCCCAAACCATCCTCA
ACCACCATGTTTTAGAGTCATTGGTACCAGAGCTTAGTTTGCTCCTTGGCTTTCGAGAAGTTACAATCATGTCGGCGACTAAACAGTTGGCTAAGACGCATGTCGAGCGA
CTTGTTGAGATAGAGGAACAGTTGCTTTTCCTGAGGGAAATACCGGACTCGGTACGATACGTTGAACATCGAATTGAAGAGATCTCCGAGAAGCTAGGTGAGATCGATGC
GGTGAATGCCCGAATCGATGGGCTACCCGTACAAGAGTTGGCATTGAGGGTTGAGACCCTAGAAAGCAAGACTACGAAGCCTGTTTTGGGGGCGTATGACTCCTCGCGAT
GTCGCATGTTCACGCCTGCCAGACTAAAAATGCCTCATAATGTACTATAG
Protein sequenceShow/hide protein sequence
MLSTELDVRPLTICTQGLELCITPWPFVLKMRILDVVGSVSFHAEEEILAFTVSEKVNRKFRRSLVIAFCGFDPEGLDDILFTTTIKSDIRIDLSPSWSCAPRFRIRRNA
KSNYSTLHLINPVDFFTPPAMSRLSLNAQAVVERPHIVKLKVWRYDSIGDHVHNNHRLYLSTGKFLVACVVKRREAKKRIFELFPEEKFDHSTVCTLVYYIRVYSIADSR
HSSLVSGVEMKAGKVVHAFTILSSDLSNLRHGTCGNRETIPSPGSGWKHATLVVLGMLVQGELCLWPHAQTILNHHVLESLVPELSLLLGFREVTIMSATKQLAKTHVER
LVEIEEQLLFLREIPDSVRYVEHRIEEISEKLGEIDAVNARIDGLPVQELALRVETLESKTTKPVLGAYDSSRCRMFTPARLKMPHNVL