; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg002822 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg002822
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionGag/pol protein
Genome locationscaffold6:2721067..2727694
RNA-Seq ExpressionSpg002822
SyntenySpg002822
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026081.1 pol protein [Cucumis melo var. makuwa]3.2e-1275Show/hide
Query:  RIRDCPLICMGESGQYADSISLPLWGQDRMGSWEHNRTRWNSLLPDFR
        R+RDCPLIC GESGQ  DSI L   GQDR+GSWEHN TRWN LLP FR
Subjt:  RIRDCPLICMGESGQYADSISLPLWGQDRMGSWEHNRTRWNSLLPDFR

KAA0047821.1 uncharacterized protein E6C27_scaffold133G00730 [Cucumis melo var. makuwa]2.1e-1160.71Show/hide
Query:  DCPLICMGESGQYADSISLPLWGQDRMGSWEHNRTRWNSLLPDFREADESGVTDTR
        DCPLI   ESGQ ADS+ L  WGQDR+ SWEHN TRWNSLLP FRE  +  + + +
Subjt:  DCPLICMGESGQYADSISLPLWGQDRMGSWEHNRTRWNSLLPDFREADESGVTDTR

KAA0049822.1 reverse transcriptase [Cucumis melo var. makuwa]3.5e-0676.47Show/hide
Query:  IRDCPLICMGESGQYADSISLPLWGQDRMGSWEH
        IRDCPLIC GESGQ ADSI L  WGQDR+G W+H
Subjt:  IRDCPLICMGESGQYADSISLPLWGQDRMGSWEH

TYK05792.1 gag/pol protein [Cucumis melo var. makuwa]6.5e-1372.92Show/hide
Query:  IRDCPLICMGESGQYADSISLPLWGQDRMGSWEHNRTRWNSLLPDFRE
        IRDCPL C GESGQ  DSI L  WGQDR+GSWEHN TRWNS +P FR+
Subjt:  IRDCPLICMGESGQYADSISLPLWGQDRMGSWEHNRTRWNSLLPDFRE

TYK08698.1 reverse transcriptase [Cucumis melo var. makuwa]3.5e-0676.47Show/hide
Query:  IRDCPLICMGESGQYADSISLPLWGQDRMGSWEH
        IRDCPLIC GESGQ ADSI L  WGQDR+G W+H
Subjt:  IRDCPLICMGESGQYADSISLPLWGQDRMGSWEH

TrEMBL top hitse value%identityAlignment
A0A5A7SPG1 Pol protein1.6e-1275Show/hide
Query:  RIRDCPLICMGESGQYADSISLPLWGQDRMGSWEHNRTRWNSLLPDFR
        R+RDCPLIC GESGQ  DSI L   GQDR+GSWEHN TRWN LLP FR
Subjt:  RIRDCPLICMGESGQYADSISLPLWGQDRMGSWEHNRTRWNSLLPDFR

A0A5A7U202 Reverse transcriptase1.7e-0676.47Show/hide
Query:  IRDCPLICMGESGQYADSISLPLWGQDRMGSWEH
        IRDCPLIC GESGQ ADSI L  WGQDR+G W+H
Subjt:  IRDCPLICMGESGQYADSISLPLWGQDRMGSWEH

A0A5A7U2P4 Integrase catalytic domain-containing protein1.0e-1160.71Show/hide
Query:  DCPLICMGESGQYADSISLPLWGQDRMGSWEHNRTRWNSLLPDFREADESGVTDTR
        DCPLI   ESGQ ADS+ L  WGQDR+ SWEHN TRWNSLLP FRE  +  + + +
Subjt:  DCPLICMGESGQYADSISLPLWGQDRMGSWEHNRTRWNSLLPDFREADESGVTDTR

A0A5D3C3J6 Gag/pol protein3.2e-1372.92Show/hide
Query:  IRDCPLICMGESGQYADSISLPLWGQDRMGSWEHNRTRWNSLLPDFRE
        IRDCPL C GESGQ  DSI L  WGQDR+GSWEHN TRWNS +P FR+
Subjt:  IRDCPLICMGESGQYADSISLPLWGQDRMGSWEHNRTRWNSLLPDFRE

A0A5D3CBX3 Reverse transcriptase1.7e-0676.47Show/hide
Query:  IRDCPLICMGESGQYADSISLPLWGQDRMGSWEH
        IRDCPLIC GESGQ ADSI L  WGQDR+G W+H
Subjt:  IRDCPLICMGESGQYADSISLPLWGQDRMGSWEH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCTTGAACTATATGTAGCGACTGAAGGATTGGAATTCCATAATTCCGTAAAGCGGAAGCGTGATTGGGACCGATTCTGTGAACACCACCACTATGGCTACACCGG
TATGACTCTGAGACTTCTAGAGGCAGGAGACTTGTGGGAGCCTTTGGGAGAATTCTCTGAGAAAGTTCCTCTCGGGCCAGGAGAGGACGGCGCGCCTTTGTTCAATCCCC
GGAATCAGCCCTTAAGGGAACACACATCTACTTATCCCAATAGGGGAAGGAGTGAATTCCATCTTGTACTGTTATGTTCCCAGCCTCCATTCGGTCTTGCCCCTGAAATG
GATACCCCCACCCGCATGTCTCCTACATGGATGCTCTGGATCATTGCATCTGTATCGAATACAAGGTGGGCCGTATCACATAGTGTCACCAGGATAAGAGATTGTCCTTT
GATTTGTATGGGTGAGAGTGGCCAATACGCCGACTCAATAAGCCTACCACTTTGGGGACAAGACCGAATGGGGAGCTGGGAACATAATCGTACAAGATGGAATTCACTCC
TTCCCGACTTTAGGGAAGCAGATGAGTCTGGTGTTACGGACACTCGTGAAGGACTAACTAGTCGATATTGGTCTATATCCGTGGACACAGAAAATATGTCTGCAGTGAGA
AGAGTGCAACTATCCATTAGGTCCCACTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTCTTGAACTATATGTAGCGACTGAAGGATTGGAATTCCATAATTCCGTAAAGCGGAAGCGTGATTGGGACCGATTCTGTGAACACCACCACTATGGCTACACCGG
TATGACTCTGAGACTTCTAGAGGCAGGAGACTTGTGGGAGCCTTTGGGAGAATTCTCTGAGAAAGTTCCTCTCGGGCCAGGAGAGGACGGCGCGCCTTTGTTCAATCCCC
GGAATCAGCCCTTAAGGGAACACACATCTACTTATCCCAATAGGGGAAGGAGTGAATTCCATCTTGTACTGTTATGTTCCCAGCCTCCATTCGGTCTTGCCCCTGAAATG
GATACCCCCACCCGCATGTCTCCTACATGGATGCTCTGGATCATTGCATCTGTATCGAATACAAGGTGGGCCGTATCACATAGTGTCACCAGGATAAGAGATTGTCCTTT
GATTTGTATGGGTGAGAGTGGCCAATACGCCGACTCAATAAGCCTACCACTTTGGGGACAAGACCGAATGGGGAGCTGGGAACATAATCGTACAAGATGGAATTCACTCC
TTCCCGACTTTAGGGAAGCAGATGAGTCTGGTGTTACGGACACTCGTGAAGGACTAACTAGTCGATATTGGTCTATATCCGTGGACACAGAAAATATGTCTGCAGTGAGA
AGAGTGCAACTATCCATTAGGTCCCACTGA
Protein sequenceShow/hide protein sequence
MSLELYVATEGLEFHNSVKRKRDWDRFCEHHHYGYTGMTLRLLEAGDLWEPLGEFSEKVPLGPGEDGAPLFNPRNQPLREHTSTYPNRGRSEFHLVLLCSQPPFGLAPEM
DTPTRMSPTWMLWIIASVSNTRWAVSHSVTRIRDCPLICMGESGQYADSISLPLWGQDRMGSWEHNRTRWNSLLPDFREADESGVTDTREGLTSRYWSISVDTENMSAVR
RVQLSIRSH