; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg004707 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg004707
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionGag/pol protein
Genome locationscaffold5:19067324..19069412
RNA-Seq ExpressionSpg004707
SyntenySpg004707
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026081.1 pol protein [Cucumis melo var. makuwa]4.0e-1276.6Show/hide
Query:  MRDCPLICMGESGQYAISISLPFWGQDRVGSWEHNRTRWNSLLPDFR
        MRDCPLIC GESGQ   SI L F GQDR+GSWEHN TRWN LLP FR
Subjt:  MRDCPLICMGESGQYAISISLPFWGQDRVGSWEHNRTRWNSLLPDFR

KAA0047821.1 uncharacterized protein E6C27_scaffold133G00730 [Cucumis melo var. makuwa]2.6e-1176.09Show/hide
Query:  DCPLICMGESGQYAISISLPFWGQDRVGSWEHNRTRWNSLLPDFRE
        DCPLI   ESGQ A S+ L FWGQDRV SWEHN TRWNSLLP FRE
Subjt:  DCPLICMGESGQYAISISLPFWGQDRVGSWEHNRTRWNSLLPDFRE

KAA0049822.1 reverse transcriptase [Cucumis melo var. makuwa]3.2e-0676.47Show/hide
Query:  MRDCPLICMGESGQYAISISLPFWGQDRVGSWEH
        +RDCPLIC GESGQ A SI L FWGQDRVG W+H
Subjt:  MRDCPLICMGESGQYAISISLPFWGQDRVGSWEH

TYK05792.1 gag/pol protein [Cucumis melo var. makuwa]2.7e-1370Show/hide
Query:  MRDCPLICMGESGQYAISISLPFWGQDRVGSWEHNRTRWNSLLPDFREAV
        +RDCPL C GESGQ   SI L FWGQDRVGSWEHN TRWNS +P FR+ +
Subjt:  MRDCPLICMGESGQYAISISLPFWGQDRVGSWEHNRTRWNSLLPDFREAV

TYK08698.1 reverse transcriptase [Cucumis melo var. makuwa]3.2e-0676.47Show/hide
Query:  MRDCPLICMGESGQYAISISLPFWGQDRVGSWEH
        +RDCPLIC GESGQ A SI L FWGQDRVG W+H
Subjt:  MRDCPLICMGESGQYAISISLPFWGQDRVGSWEH

TrEMBL top hitse value%identityAlignment
A0A5A7SPG1 Pol protein1.9e-1276.6Show/hide
Query:  MRDCPLICMGESGQYAISISLPFWGQDRVGSWEHNRTRWNSLLPDFR
        MRDCPLIC GESGQ   SI L F GQDR+GSWEHN TRWN LLP FR
Subjt:  MRDCPLICMGESGQYAISISLPFWGQDRVGSWEHNRTRWNSLLPDFR

A0A5A7U202 Reverse transcriptase1.6e-0676.47Show/hide
Query:  MRDCPLICMGESGQYAISISLPFWGQDRVGSWEH
        +RDCPLIC GESGQ A SI L FWGQDRVG W+H
Subjt:  MRDCPLICMGESGQYAISISLPFWGQDRVGSWEH

A0A5A7U2P4 Integrase catalytic domain-containing protein1.2e-1176.09Show/hide
Query:  DCPLICMGESGQYAISISLPFWGQDRVGSWEHNRTRWNSLLPDFRE
        DCPLI   ESGQ A S+ L FWGQDRV SWEHN TRWNSLLP FRE
Subjt:  DCPLICMGESGQYAISISLPFWGQDRVGSWEHNRTRWNSLLPDFRE

A0A5D3C3J6 Gag/pol protein1.3e-1370Show/hide
Query:  MRDCPLICMGESGQYAISISLPFWGQDRVGSWEHNRTRWNSLLPDFREAV
        +RDCPL C GESGQ   SI L FWGQDRVGSWEHN TRWNS +P FR+ +
Subjt:  MRDCPLICMGESGQYAISISLPFWGQDRVGSWEHNRTRWNSLLPDFREAV

A0A5D3CBX3 Reverse transcriptase1.6e-0676.47Show/hide
Query:  MRDCPLICMGESGQYAISISLPFWGQDRVGSWEH
        +RDCPLIC GESGQ A SI L FWGQDRVG W+H
Subjt:  MRDCPLICMGESGQYAISISLPFWGQDRVGSWEH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGGATTGTCCTTTGATTTGTATGGGTGAGAGTGGCCAATACGCCATCTCAATAAGCCTACCATTTTGGGGACAAGACCGAGTGGGGAGCTGGGAACATAATCGTAC
AAGATGGAATTCACTCCTTCCCGACTTTAGGGAAGCAGTTGGTGTTACGGACACTCGTGAAGGACTAGCTAGTCGATATTGGTCTATATCCGTGGACACAGAAAATATGT
CTGCAGAGAGAAGAGTGCAACTGTCCATTAGGTCCCACTGCGAAGAAATCAACAAGGTGTCGCTGCGTTTTCGTTCGTTGGAGCATCGTTGGCGAAGAACGGTCAAGTCT
ACAACGAAGATGAAATTTGGAGAAATCGCCGCTGAAATCAAGGCCAAAACCGAAGATCAAGCTGCTGTTTTTGCCCAGATTGAGCTCCAGACGCAACAGCGTTGGGACGC
TACTCCAAAAGTGTCTCGACGCTATCCCGATACCGCAGGTGCTCAGGTAAGGAAATCGCACAGCGTCAGGACGCTGCGTTGCAAGCGTCGGGACGCTGCGTTGCAAGCGT
CGGGACGCTGCGTTGCAAGCGTCGGGACGCTGCGTTGCAAGCGTCGGGACGCTGCGTTGCAAGCGTCTCGACGCTGTCACGATGACGGTGCAAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGGGATTGTCCTTTGATTTGTATGGGTGAGAGTGGCCAATACGCCATCTCAATAAGCCTACCATTTTGGGGACAAGACCGAGTGGGGAGCTGGGAACATAATCGTAC
AAGATGGAATTCACTCCTTCCCGACTTTAGGGAAGCAGTTGGTGTTACGGACACTCGTGAAGGACTAGCTAGTCGATATTGGTCTATATCCGTGGACACAGAAAATATGT
CTGCAGAGAGAAGAGTGCAACTGTCCATTAGGTCCCACTGCGAAGAAATCAACAAGGTGTCGCTGCGTTTTCGTTCGTTGGAGCATCGTTGGCGAAGAACGGTCAAGTCT
ACAACGAAGATGAAATTTGGAGAAATCGCCGCTGAAATCAAGGCCAAAACCGAAGATCAAGCTGCTGTTTTTGCCCAGATTGAGCTCCAGACGCAACAGCGTTGGGACGC
TACTCCAAAAGTGTCTCGACGCTATCCCGATACCGCAGGTGCTCAGGTAAGGAAATCGCACAGCGTCAGGACGCTGCGTTGCAAGCGTCGGGACGCTGCGTTGCAAGCGT
CGGGACGCTGCGTTGCAAGCGTCGGGACGCTGCGTTGCAAGCGTCGGGACGCTGCGTTGCAAGCGTCTCGACGCTGTCACGATGACGGTGCAAATTAG
Protein sequenceShow/hide protein sequence
MRDCPLICMGESGQYAISISLPFWGQDRVGSWEHNRTRWNSLLPDFREAVGVTDTREGLASRYWSISVDTENMSAERRVQLSIRSHCEEINKVSLRFRSLEHRWRRTVKS
TTKMKFGEIAAEIKAKTEDQAAVFAQIELQTQQRWDATPKVSRRYPDTAGAQVRKSHSVRTLRCKRRDAALQASGRCVASVGTLRCKRRDAALQASRRCHDDGAN