; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg002304 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg002304
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionGag-protease polyprotein
Genome locationscaffold1:30174361..30179718
RNA-Seq ExpressionSpg002304
SyntenySpg002304
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0008233 - peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026081.1 pol protein [Cucumis melo var. makuwa]6.8e-1275.56Show/hide
Query:  RDCPLICMGESGQYTDSISLSFWGQDRVGSWKHNHTRWNSLIPDF
        RDCPLIC GESGQ  DSI LSF GQDR+GSW+HN+TRWN L+P F
Subjt:  RDCPLICMGESGQYTDSISLSFWGQDRVGSWKHNHTRWNSLIPDF

KAA0026081.1 pol protein [Cucumis melo var. makuwa]6.6e+0133.33Show/hide
Query:  GQKRKHEQTTTNLQRSQHSSESSRQKTQRDKQEGNG--NDKPKCNSCGRQHWGQCMAGKGVCFKCHQEGHMANFQPTAALVSSS
        GQKRK E     + +         Q+ +R+         + P C  CGR H G C+AG GVCF+C Q GH  +  P   + ++S
Subjt:  GQKRKHEQTTTNLQRSQHSSESSRQKTQRDKQEGNG--NDKPKCNSCGRQHWGQCMAGKGVCFKCHQEGHMANFQPTAALVSSS

KAA0026081.1 pol protein [Cucumis melo var. makuwa]4.6e-0880.56Show/hide
Query:  MFTRDCPLICMGESGQYTDSISLSFWGQDRVGSWKH
        M+ RDCPLIC GESGQ  DSI LSFWGQDRVG WKH
Subjt:  MFTRDCPLICMGESGQYTDSISLSFWGQDRVGSWKH

KAA0047821.1 uncharacterized protein E6C27_scaffold133G00730 [Cucumis melo var. makuwa]1.1e-1267.31Show/hide
Query:  DCPLICMGESGQYTDSISLSFWGQDRVGSWKHNHTRWNSLIPDFEEVDECSL
        DCPLI   ESGQ  DS+ LSFWGQDRV SW+HNHTRWNSL+P F E+ + S+
Subjt:  DCPLICMGESGQYTDSISLSFWGQDRVGSWKHNHTRWNSLIPDFEEVDECSL

KAA0049822.1 reverse transcriptase [Cucumis melo var. makuwa]4.6e-0880.56Show/hide
Query:  MFTRDCPLICMGESGQYTDSISLSFWGQDRVGSWKH
        M+ RDCPLIC GESGQ  DSI LSFWGQDRVG WKH
Subjt:  MFTRDCPLICMGESGQYTDSISLSFWGQDRVGSWKH

TYK05792.1 gag/pol protein [Cucumis melo var. makuwa]3.9e-1576Show/hide
Query:  MFTRDCPLICMGESGQYTDSISLSFWGQDRVGSWKHNHTRWNSLIPDFEE
        M+ RDCPL C GESGQ  DSI LSFWGQDRVGSW+HNHTRWNS IP F +
Subjt:  MFTRDCPLICMGESGQYTDSISLSFWGQDRVGSWKHNHTRWNSLIPDFEE

TrEMBL top hitse value%identityAlignment
A0A5A7SPG1 Pol protein3.3e-1275.56Show/hide
Query:  RDCPLICMGESGQYTDSISLSFWGQDRVGSWKHNHTRWNSLIPDF
        RDCPLIC GESGQ  DSI LSF GQDR+GSW+HN+TRWN L+P F
Subjt:  RDCPLICMGESGQYTDSISLSFWGQDRVGSWKHNHTRWNSLIPDF

A0A5A7SPG1 Pol protein3.2e+0133.33Show/hide
Query:  GQKRKHEQTTTNLQRSQHSSESSRQKTQRDKQEGNG--NDKPKCNSCGRQHWGQCMAGKGVCFKCHQEGHMANFQPTAALVSSS
        GQKRK E     + +         Q+ +R+         + P C  CGR H G C+AG GVCF+C Q GH  +  P   + ++S
Subjt:  GQKRKHEQTTTNLQRSQHSSESSRQKTQRDKQEGNG--NDKPKCNSCGRQHWGQCMAGKGVCFKCHQEGHMANFQPTAALVSSS

A0A5A7SPG1 Pol protein2.2e-0880.56Show/hide
Query:  MFTRDCPLICMGESGQYTDSISLSFWGQDRVGSWKH
        M+ RDCPLIC GESGQ  DSI LSFWGQDRVG WKH
Subjt:  MFTRDCPLICMGESGQYTDSISLSFWGQDRVGSWKH

A0A5A7U2P4 Integrase catalytic domain-containing protein5.1e-1367.31Show/hide
Query:  DCPLICMGESGQYTDSISLSFWGQDRVGSWKHNHTRWNSLIPDFEEVDECSL
        DCPLI   ESGQ  DS+ LSFWGQDRV SW+HNHTRWNSL+P F E+ + S+
Subjt:  DCPLICMGESGQYTDSISLSFWGQDRVGSWKHNHTRWNSLIPDFEEVDECSL

A0A5D3C3J6 Gag/pol protein1.9e-1576Show/hide
Query:  MFTRDCPLICMGESGQYTDSISLSFWGQDRVGSWKHNHTRWNSLIPDFEE
        M+ RDCPL C GESGQ  DSI LSFWGQDRVGSW+HNHTRWNS IP F +
Subjt:  MFTRDCPLICMGESGQYTDSISLSFWGQDRVGSWKHNHTRWNSLIPDFEE

A0A5D3CBX3 Reverse transcriptase2.2e-0880.56Show/hide
Query:  MFTRDCPLICMGESGQYTDSISLSFWGQDRVGSWKH
        M+ RDCPLIC GESGQ  DSI LSFWGQDRVG WKH
Subjt:  MFTRDCPLICMGESGQYTDSISLSFWGQDRVGSWKH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCACGAGGGATTGTCCTTTGATTTGTATGGGTGAGAGTGGTCAGTACACTGACTCAATAAGCCTATCATTTTGGGGACAAGACCGAGTGGGGAGCTGGAAACATAA
TCACACAAGATGGAATTCACTCATTCCGGACTTTGAGGAAGTAGATGAGTGTTCCCTTAAGTGGTGTCTCCGGGTCTTGAACAAAGGGCCCTACCCAGTCACTGACCCGA
GAGGGATTTCTATTTGGTGGTTAGACCACAAACAGCGTCGAGACGCCACAGACTTTAGTCTTCGTGTGAATTTTGCGTCGGCGTCGAGACGCCAGGGGCAGCGTCTCTAC
GGCGTCCCTAGTTCGAGCTTGCTGCTTTCTTCTCGGCCTTGTCTCGCTTCATCTCGGTATGGGGCTCCAATCTTCGGCTTTTGGTGTCGTTCTTCAGGACCCCTAGGGTG
GGGTGAAAGGCATCCCATTCATTCTCTCTCAGAGAATTCCCCTGAAGGCTCCCACCAGTCTTCTGCCACTAGAAGTCTCAGAGTCATACCGGTCCTAGTTGGTGATAAGC
TTGAAGTCAATGCAACTCTAGTAGCCAAGGAGTCGGAGCTCAACGCAGGACAGAAAAGGAAACACGAGCAGACAACTACCAACCTCCAGCGATCTCAACACTCATCCGAA
AGTTCTAGACAGAAAACTCAGCGTGACAAACAAGAGGGCAACGGTAACGATAAACCGAAGTGCAACTCTTGTGGAAGACAACATTGGGGTCAGTGCATGGCAGGGAAGGG
TGTGTGTTTTAAATGTCACCAGGAAGGGCATATGGCAAATTTTCAGCCAACAGCAGCCCTTGTTTCTTCTTCTCGGCCGGCAATCACAGCAGTCTTCATCTCCGGCGATC
CTTCTTCCGACGAGTGGTTGCACGGCGGCGGCGCTCGCGATTCCAACCGCAACCCTCAAGTTCAACAGCAGCAGTGTCTTCTTCGTGGTTCAGGCGGTAGCTTCGTTCGG
CAACTTGTGACAGCAAGCAGCGGCGCGGCAGACACGACTCACAGCGGGCATTCGCGGGTGTTGGCAGCGTCGGTTTCAGCAAGGTGCAGTGGCGCGTGTTCTTTTTCGAT
GAGCTCCCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTCACGAGGGATTGTCCTTTGATTTGTATGGGTGAGAGTGGTCAGTACACTGACTCAATAAGCCTATCATTTTGGGGACAAGACCGAGTGGGGAGCTGGAAACATAA
TCACACAAGATGGAATTCACTCATTCCGGACTTTGAGGAAGTAGATGAGTGTTCCCTTAAGTGGTGTCTCCGGGTCTTGAACAAAGGGCCCTACCCAGTCACTGACCCGA
GAGGGATTTCTATTTGGTGGTTAGACCACAAACAGCGTCGAGACGCCACAGACTTTAGTCTTCGTGTGAATTTTGCGTCGGCGTCGAGACGCCAGGGGCAGCGTCTCTAC
GGCGTCCCTAGTTCGAGCTTGCTGCTTTCTTCTCGGCCTTGTCTCGCTTCATCTCGGTATGGGGCTCCAATCTTCGGCTTTTGGTGTCGTTCTTCAGGACCCCTAGGGTG
GGGTGAAAGGCATCCCATTCATTCTCTCTCAGAGAATTCCCCTGAAGGCTCCCACCAGTCTTCTGCCACTAGAAGTCTCAGAGTCATACCGGTCCTAGTTGGTGATAAGC
TTGAAGTCAATGCAACTCTAGTAGCCAAGGAGTCGGAGCTCAACGCAGGACAGAAAAGGAAACACGAGCAGACAACTACCAACCTCCAGCGATCTCAACACTCATCCGAA
AGTTCTAGACAGAAAACTCAGCGTGACAAACAAGAGGGCAACGGTAACGATAAACCGAAGTGCAACTCTTGTGGAAGACAACATTGGGGTCAGTGCATGGCAGGGAAGGG
TGTGTGTTTTAAATGTCACCAGGAAGGGCATATGGCAAATTTTCAGCCAACAGCAGCCCTTGTTTCTTCTTCTCGGCCGGCAATCACAGCAGTCTTCATCTCCGGCGATC
CTTCTTCCGACGAGTGGTTGCACGGCGGCGGCGCTCGCGATTCCAACCGCAACCCTCAAGTTCAACAGCAGCAGTGTCTTCTTCGTGGTTCAGGCGGTAGCTTCGTTCGG
CAACTTGTGACAGCAAGCAGCGGCGCGGCAGACACGACTCACAGCGGGCATTCGCGGGTGTTGGCAGCGTCGGTTTCAGCAAGGTGCAGTGGCGCGTGTTCTTTTTCGAT
GAGCTCCCTTTAG
Protein sequenceShow/hide protein sequence
MFTRDCPLICMGESGQYTDSISLSFWGQDRVGSWKHNHTRWNSLIPDFEEVDECSLKWCLRVLNKGPYPVTDPRGISIWWLDHKQRRDATDFSLRVNFASASRRQGQRLY
GVPSSSLLLSSRPCLASSRYGAPIFGFWCRSSGPLGWGERHPIHSLSENSPEGSHQSSATRSLRVIPVLVGDKLEVNATLVAKESELNAGQKRKHEQTTTNLQRSQHSSE
SSRQKTQRDKQEGNGNDKPKCNSCGRQHWGQCMAGKGVCFKCHQEGHMANFQPTAALVSSSRPAITAVFISGDPSSDEWLHGGGARDSNRNPQVQQQQCLLRGSGGSFVR
QLVTASSGAADTTHSGHSRVLAASVSARCSGACSFSMSSL