; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg020386 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg020386
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationscaffold1:34274419..34281832
RNA-Seq ExpressionSpg020386
SyntenySpg020386
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0090304 - nucleic acid metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052218.1 uncharacterized protein E6C27_scaffold207G00290 [Cucumis melo var. makuwa]5.3e-3139.26Show/hide
Query:  PQIEQPGPAAEPVTV-------DAIQAISTGFSSKAGGSLVFSAVFEHSGVYKMHKKLYLVSCRVFENEEFDVYVINFGSVLRKLWVLEVRPEAIWYRVH
        P I + G   +P+ V       + ++  ST F SK+GG LVFS                                            L + P      +H
Subjt:  PQIEQPGPAAEPVTV-------DAIQAISTGFSSKAGGSLVFSAVFEHSGVYKMHKKLYLVSCRVFENEEFDVYVINFGSVLRKLWVLEVRPEAIWYRVH

Query:  TGR-DRA-----PVPNTLPTSAKSSRTSSNV------GSVQIELAEYGLSIVESES---------------------------------RVMPPRTGRRR
        TG+ D+      PVP+TLPTSA+SS+++S+        SV +E+  Y  S V   S                                  VMPPRT RRR
Subjt:  TGR-DRA-----PVPNTLPTSAKSSRTSSNV------GSVQIELAEYGLSIVESES---------------------------------RVMPPRTGRRR

Query:  RQNQDGTQDPTQSQSESGSSTREVIMRQGSERFARSAQEIGRPERAGPSDSEKMYGIERLKKLEAAVFEGSTDPADAEVWLNMLENASSDEGFPERRR
        RQNQDG Q PTQ QSE GSST       GSERFARSAQEI RPERA PSD +KMYGIE+LKKL A VF+GSTDPADAEVWLNMLE        P+ ++
Subjt:  RQNQDGTQDPTQSQSESGSSTREVIMRQGSERFARSAQEIGRPERAGPSDSEKMYGIERLKKLEAAVFEGSTDPADAEVWLNMLENASSDEGFPERRR

KAA0056353.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]9.7e-3362.5Show/hide
Query:  PVPNTLPTSAKSSRTSSNVGSVQIELAE-------------YGLSIVESESRVMPPRTGRRRRQNQDGTQDPTQSQSESGSSTREVIMRQGSERFARSAQ
        PVP+TLPTSA+SS ++S+ GSV I   +              G+        VMPPRT +R RQNQDGTQDPTQ QSE GSST       GSERF+RSAQ
Subjt:  PVPNTLPTSAKSSRTSSNVGSVQIELAE-------------YGLSIVESESRVMPPRTGRRRRQNQDGTQDPTQSQSESGSSTREVIMRQGSERFARSAQ

Query:  EIGRPERAGPSDSEKMYGIERLKKLEAAVFEGSTDPADAEVWLN
        EIGRPE+AGPSD EKMYGIERLKKLEA VF+GSTD ADAEVW N
Subjt:  EIGRPERAGPSDSEKMYGIERLKKLEAAVFEGSTDPADAEVWLN

TYK01041.1 uncharacterized protein E5676_scaffold264G00470 [Cucumis melo var. makuwa]2.5e-3353.76Show/hide
Query:  PVPNTLPTSAKSSRTSSNVGSVQIELAEYGLSIVESESR--------------------------VMPPRTGRRRRQNQDGTQDPTQSQSESGSSTREVI
        PVP+TLPTS +SS ++S+  + Q+    + +  +  ES                           VMPPRT RR +QNQD TQDPTQ QSE GSST    
Subjt:  PVPNTLPTSAKSSRTSSNVGSVQIELAEYGLSIVESESR--------------------------VMPPRTGRRRRQNQDGTQDPTQSQSESGSSTREVI

Query:  MRQGSERFARSAQEIGRPERAGPSDSEKMYGIERLKKLEAAVFEGSTDPADAEVWLNMLENASSDEGFPERRR
           GSERFARSAQEIGRPER GPSD EKMYGIERLKKL A VFEGSTDPA+AEVWLNMLE        P++R+
Subjt:  MRQGSERFARSAQEIGRPERAGPSDSEKMYGIERLKKLEAAVFEGSTDPADAEVWLNMLENASSDEGFPERRR

XP_008448403.1 PREDICTED: uncharacterized protein LOC103490604 [Cucumis melo]6.9e-3172.9Show/hide
Query:  MPPRTGRRRRQNQDGTQDPTQSQSESGSSTREVIMRQGSERFARSAQEIGRPERAGPSDSEKMYGIERLKKLEAAVFEGSTDPADAEVWLNMLENASSDE
        MPPRT RR +QNQD TQDPTQ QSE GSST       GSERFARSAQEIGRPER GPSD EKMYGIERLKKL A VFEGSTDPA+AEVWLNMLE      
Subjt:  MPPRTGRRRRQNQDGTQDPTQSQSESGSSTREVIMRQGSERFARSAQEIGRPERAGPSDSEKMYGIERLKKLEAAVFEGSTDPADAEVWLNMLENASSDE

Query:  GFPERRR
          P++R+
Subjt:  GFPERRR

XP_016901625.1 PREDICTED: uncharacterized protein LOC107991320 [Cucumis melo]4.1e-3172.9Show/hide
Query:  MPPRTGRRRRQNQDGTQDPTQSQSESGSSTREVIMRQGSERFARSAQEIGRPERAGPSDSEKMYGIERLKKLEAAVFEGSTDPADAEVWLNMLENASSDE
        MPPRT +R RQNQDGTQDPTQ QSE GSST       GSERF+RSAQEIGRPE+AGPSD EKMYGIERLKKLEA VF+GSTD ADAEVWLNMLE      
Subjt:  MPPRTGRRRRQNQDGTQDPTQSQSESGSSTREVIMRQGSERFARSAQEIGRPERAGPSDSEKMYGIERLKKLEAAVFEGSTDPADAEVWLNMLENASSDE

Query:  GFPERRR
          P+ R+
Subjt:  GFPERRR

TrEMBL top hitse value%identityAlignment
A0A1S3BJ07 uncharacterized protein LOC1034906043.4e-3172.9Show/hide
Query:  MPPRTGRRRRQNQDGTQDPTQSQSESGSSTREVIMRQGSERFARSAQEIGRPERAGPSDSEKMYGIERLKKLEAAVFEGSTDPADAEVWLNMLENASSDE
        MPPRT RR +QNQD TQDPTQ QSE GSST       GSERFARSAQEIGRPER GPSD EKMYGIERLKKL A VFEGSTDPA+AEVWLNMLE      
Subjt:  MPPRTGRRRRQNQDGTQDPTQSQSESGSSTREVIMRQGSERFARSAQEIGRPERAGPSDSEKMYGIERLKKLEAAVFEGSTDPADAEVWLNMLENASSDE

Query:  GFPERRR
          P++R+
Subjt:  GFPERRR

A0A1S4E0X6 uncharacterized protein LOC1079913202.0e-3172.9Show/hide
Query:  MPPRTGRRRRQNQDGTQDPTQSQSESGSSTREVIMRQGSERFARSAQEIGRPERAGPSDSEKMYGIERLKKLEAAVFEGSTDPADAEVWLNMLENASSDE
        MPPRT +R RQNQDGTQDPTQ QSE GSST       GSERF+RSAQEIGRPE+AGPSD EKMYGIERLKKLEA VF+GSTD ADAEVWLNMLE      
Subjt:  MPPRTGRRRRQNQDGTQDPTQSQSESGSSTREVIMRQGSERFARSAQEIGRPERAGPSDSEKMYGIERLKKLEAAVFEGSTDPADAEVWLNMLENASSDE

Query:  GFPERRR
          P+ R+
Subjt:  GFPERRR

A0A5A7UAH6 CCHC-type domain-containing protein2.6e-3139.26Show/hide
Query:  PQIEQPGPAAEPVTV-------DAIQAISTGFSSKAGGSLVFSAVFEHSGVYKMHKKLYLVSCRVFENEEFDVYVINFGSVLRKLWVLEVRPEAIWYRVH
        P I + G   +P+ V       + ++  ST F SK+GG LVFS                                            L + P      +H
Subjt:  PQIEQPGPAAEPVTV-------DAIQAISTGFSSKAGGSLVFSAVFEHSGVYKMHKKLYLVSCRVFENEEFDVYVINFGSVLRKLWVLEVRPEAIWYRVH

Query:  TGR-DRA-----PVPNTLPTSAKSSRTSSNV------GSVQIELAEYGLSIVESES---------------------------------RVMPPRTGRRR
        TG+ D+      PVP+TLPTSA+SS+++S+        SV +E+  Y  S V   S                                  VMPPRT RRR
Subjt:  TGR-DRA-----PVPNTLPTSAKSSRTSSNV------GSVQIELAEYGLSIVESES---------------------------------RVMPPRTGRRR

Query:  RQNQDGTQDPTQSQSESGSSTREVIMRQGSERFARSAQEIGRPERAGPSDSEKMYGIERLKKLEAAVFEGSTDPADAEVWLNMLENASSDEGFPERRR
        RQNQDG Q PTQ QSE GSST       GSERFARSAQEI RPERA PSD +KMYGIE+LKKL A VF+GSTDPADAEVWLNMLE        P+ ++
Subjt:  RQNQDGTQDPTQSQSESGSSTREVIMRQGSERFARSAQEIGRPERAGPSDSEKMYGIERLKKLEAAVFEGSTDPADAEVWLNMLENASSDEGFPERRR

A0A5A7UKD3 DNA/RNA polymerases superfamily protein4.7e-3362.5Show/hide
Query:  PVPNTLPTSAKSSRTSSNVGSVQIELAE-------------YGLSIVESESRVMPPRTGRRRRQNQDGTQDPTQSQSESGSSTREVIMRQGSERFARSAQ
        PVP+TLPTSA+SS ++S+ GSV I   +              G+        VMPPRT +R RQNQDGTQDPTQ QSE GSST       GSERF+RSAQ
Subjt:  PVPNTLPTSAKSSRTSSNVGSVQIELAE-------------YGLSIVESESRVMPPRTGRRRRQNQDGTQDPTQSQSESGSSTREVIMRQGSERFARSAQ

Query:  EIGRPERAGPSDSEKMYGIERLKKLEAAVFEGSTDPADAEVWLN
        EIGRPE+AGPSD EKMYGIERLKKLEA VF+GSTD ADAEVW N
Subjt:  EIGRPERAGPSDSEKMYGIERLKKLEAAVFEGSTDPADAEVWLN

A0A5D3BMJ4 Retrotrans_gag domain-containing protein1.2e-3353.76Show/hide
Query:  PVPNTLPTSAKSSRTSSNVGSVQIELAEYGLSIVESESR--------------------------VMPPRTGRRRRQNQDGTQDPTQSQSESGSSTREVI
        PVP+TLPTS +SS ++S+  + Q+    + +  +  ES                           VMPPRT RR +QNQD TQDPTQ QSE GSST    
Subjt:  PVPNTLPTSAKSSRTSSNVGSVQIELAEYGLSIVESESR--------------------------VMPPRTGRRRRQNQDGTQDPTQSQSESGSSTREVI

Query:  MRQGSERFARSAQEIGRPERAGPSDSEKMYGIERLKKLEAAVFEGSTDPADAEVWLNMLENASSDEGFPERRR
           GSERFARSAQEIGRPER GPSD EKMYGIERLKKL A VFEGSTDPA+AEVWLNMLE        P++R+
Subjt:  MRQGSERFARSAQEIGRPERAGPSDSEKMYGIERLKKLEAAVFEGSTDPADAEVWLNMLENASSDEGFPERRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTCGCTCGCCGCCGACGGTAAAGCTTCGCTCTCTCTCACGTGTGGGTTTCTTTGTTTCTTCCCCTCGCCCGTTCTCACTCTCTCTCTCTCTTATGTATCAGTTGC
CGCCGTCACCTTCTCCGTTGAGAACCGCCGCACGTCGATTTTTACCTCACGCCCGCCGTCTGCTTCACCACGCGTCGCCGTCTATTCGCCGTCGCACACGAGCCGCCGCC
GATCTGTTCATTGCCTTTCAGATCTGTCCGTCGCTGCCGTTCGATTTTTAAGCTTTGCTTCTAAAGAATTAGTTGGtagtggttccgctttagagatcatgcctccccgt
gaacgaggccgtggaagaggtcgtggaaaaggccgtggtaggggtcgtacagcccctgaagcagttgtgccaccattggggcatggagataatctaccagaagatccgca
aattgaacagccgggacctgcagcagaacctgtcacagtggatgccatccaggcaattagcacaggattttcgagcaaagcaggaggatctctggttttctctgctgttt
ttgagcattctggggtgtacaagatgcataagaagctctatttggtaagttgtagagtgttcgaaaatgaagagtttgatgtgtatgtcattaattttggttcagtattg
aggaagttatgggtgttggaagttaggcccgaggctatatggtaccgtgtgcacacaggtagagatcgagctccggtgcctaatacactgccaacgtctgctaaaagttc
cagaacaagttccaacgttgggtcagttcagatagaactagcggagtatggtctgagtattgtcgagagtgagtccagagtcatgccaccacgtaccggcagacgacgca
ggcagaatcaggacgggacgcaggatcctacccaaagtcaatctgaaagtggatccagtacccgagaggtcataatgaggcaggggagtgagcgatttgctagatctgct
caggagatcggtaggccagagagagcagggcctagtgattcggaaaagatgtatgggatagaacggttgaagaagttagaagccgcagtgtttgagggttccacggatcc
agctgacgccgaggtctggttgaatatgttggagaatgcttcgagtgatgaaggatttcctgagaggaggcgagag
mRNA sequenceShow/hide mRNA sequence
ATGGAGTCGCTCGCCGCCGACGGTAAAGCTTCGCTCTCTCTCACGTGTGGGTTTCTTTGTTTCTTCCCCTCGCCCGTTCTCACTCTCTCTCTCTCTTATGTATCAGTTGC
CGCCGTCACCTTCTCCGTTGAGAACCGCCGCACGTCGATTTTTACCTCACGCCCGCCGTCTGCTTCACCACGCGTCGCCGTCTATTCGCCGTCGCACACGAGCCGCCGCC
GATCTGTTCATTGCCTTTCAGATCTGTCCGTCGCTGCCGTTCGATTTTTAAGCTTTGCTTCTAAAGAATTAGTTGGtagtggttccgctttagagatcatgcctccccgt
gaacgaggccgtggaagaggtcgtggaaaaggccgtggtaggggtcgtacagcccctgaagcagttgtgccaccattggggcatggagataatctaccagaagatccgca
aattgaacagccgggacctgcagcagaacctgtcacagtggatgccatccaggcaattagcacaggattttcgagcaaagcaggaggatctctggttttctctgctgttt
ttgagcattctggggtgtacaagatgcataagaagctctatttggtaagttgtagagtgttcgaaaatgaagagtttgatgtgtatgtcattaattttggttcagtattg
aggaagttatgggtgttggaagttaggcccgaggctatatggtaccgtgtgcacacaggtagagatcgagctccggtgcctaatacactgccaacgtctgctaaaagttc
cagaacaagttccaacgttgggtcagttcagatagaactagcggagtatggtctgagtattgtcgagagtgagtccagagtcatgccaccacgtaccggcagacgacgca
ggcagaatcaggacgggacgcaggatcctacccaaagtcaatctgaaagtggatccagtacccgagaggtcataatgaggcaggggagtgagcgatttgctagatctgct
caggagatcggtaggccagagagagcagggcctagtgattcggaaaagatgtatgggatagaacggttgaagaagttagaagccgcagtgtttgagggttccacggatcc
agctgacgccgaggtctggttgaatatgttggagaatgcttcgagtgatgaaggatttcctgagaggaggcgagag
Protein sequenceShow/hide protein sequence
MESLAADGKASLSLTCGFLCFFPSPVLTLSLSYVSVAAVTFSVENRRTSIFTSRPPSASPRVAVYSPSHTSRRRSVHCLSDLSVAAVRFLSFASKELVGSGSALEIMPPR
ERGRGRGRGKGRGRGRTAPEAVVPPLGHGDNLPEDPQIEQPGPAAEPVTVDAIQAISTGFSSKAGGSLVFSAVFEHSGVYKMHKKLYLVSCRVFENEEFDVYVINFGSVL
RKLWVLEVRPEAIWYRVHTGRDRAPVPNTLPTSAKSSRTSSNVGSVQIELAEYGLSIVESESRVMPPRTGRRRRQNQDGTQDPTQSQSESGSSTREVIMRQGSERFARSA
QEIGRPERAGPSDSEKMYGIERLKKLEAAVFEGSTDPADAEVWLNMLENASSDEGFPERRRE