; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg004592 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg004592
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionGag/pol protein
Genome locationscaffold9:8878837..8885673
RNA-Seq ExpressionSpg004592
SyntenySpg004592
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026081.1 pol protein [Cucumis melo var. makuwa]7.1e-1177.78Show/hide
Query:  DYPLICMGESGQIADSISLPFWGQDRVGSWEHNNTRWNSLLPDFR
        D PLIC GESGQI DSI L F GQDR+GSWEHN TRWN LLP FR
Subjt:  DYPLICMGESGQIADSISLPFWGQDRVGSWEHNNTRWNSLLPDFR

KAA0047821.1 uncharacterized protein E6C27_scaffold133G00730 [Cucumis melo var. makuwa]7.1e-1177.78Show/hide
Query:  DYPLICMGESGQIADSISLPFWGQDRVGSWEHNNTRWNSLLPDFR
        D PLI   ESGQIADS+ L FWGQDRV SWEHN+TRWNSLLP FR
Subjt:  DYPLICMGESGQIADSISLPFWGQDRVGSWEHNNTRWNSLLPDFR

KAA0049822.1 reverse transcriptase [Cucumis melo var. makuwa]3.1e-0675Show/hide
Query:  MFATDYPLICMGESGQIADSISLPFWGQDRVGSWEH
        M+  D PLIC GESGQIADSI L FWGQDRVG W+H
Subjt:  MFATDYPLICMGESGQIADSISLPFWGQDRVGSWEH

TYK05792.1 gag/pol protein [Cucumis melo var. makuwa]4.4e-1373.47Show/hide
Query:  MFATDYPLICMGESGQIADSISLPFWGQDRVGSWEHNNTRWNSLLPDFR
        M+  D PL C GESGQI DSI L FWGQDRVGSWEHN+TRWNS +P FR
Subjt:  MFATDYPLICMGESGQIADSISLPFWGQDRVGSWEHNNTRWNSLLPDFR

TYK08698.1 reverse transcriptase [Cucumis melo var. makuwa]3.1e-0675Show/hide
Query:  MFATDYPLICMGESGQIADSISLPFWGQDRVGSWEH
        M+  D PLIC GESGQIADSI L FWGQDRVG W+H
Subjt:  MFATDYPLICMGESGQIADSISLPFWGQDRVGSWEH

TrEMBL top hitse value%identityAlignment
A0A5A7SPG1 Pol protein3.4e-1177.78Show/hide
Query:  DYPLICMGESGQIADSISLPFWGQDRVGSWEHNNTRWNSLLPDFR
        D PLIC GESGQI DSI L F GQDR+GSWEHN TRWN LLP FR
Subjt:  DYPLICMGESGQIADSISLPFWGQDRVGSWEHNNTRWNSLLPDFR

A0A5A7U202 Reverse transcriptase1.5e-0675Show/hide
Query:  MFATDYPLICMGESGQIADSISLPFWGQDRVGSWEH
        M+  D PLIC GESGQIADSI L FWGQDRVG W+H
Subjt:  MFATDYPLICMGESGQIADSISLPFWGQDRVGSWEH

A0A5A7U2P4 Integrase catalytic domain-containing protein3.4e-1177.78Show/hide
Query:  DYPLICMGESGQIADSISLPFWGQDRVGSWEHNNTRWNSLLPDFR
        D PLI   ESGQIADS+ L FWGQDRV SWEHN+TRWNSLLP FR
Subjt:  DYPLICMGESGQIADSISLPFWGQDRVGSWEHNNTRWNSLLPDFR

A0A5D3C3J6 Gag/pol protein2.1e-1373.47Show/hide
Query:  MFATDYPLICMGESGQIADSISLPFWGQDRVGSWEHNNTRWNSLLPDFR
        M+  D PL C GESGQI DSI L FWGQDRVGSWEHN+TRWNS +P FR
Subjt:  MFATDYPLICMGESGQIADSISLPFWGQDRVGSWEHNNTRWNSLLPDFR

A0A5D3CBX3 Reverse transcriptase1.5e-0675Show/hide
Query:  MFATDYPLICMGESGQIADSISLPFWGQDRVGSWEH
        M+  D PLIC GESGQIADSI L FWGQDRVG W+H
Subjt:  MFATDYPLICMGESGQIADSISLPFWGQDRVGSWEH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCGCAACGGATTATCCTTTGATTTGTATGGGTGAGAGTGGCCAGATCGCCGACTCAATAAGCTTACCATTTTGGGGACAAGACCGAGTGGGGAGCTGGGAACATAA
TAACACAAGATGGAATTCACTCCTTCCCGACTTTAGGTGCGAACACCACCACTATGGCTACACCGGTATGACTCTGAGACTTCTAGAGGCAGGAGACTTGTGGGAGCATT
TGGGAGAATTCTCTGAGAGTAGGAGAATGAATGGGATACCTTTCACCCCACTCAGGGGTCCTATATATAGTAAAGATATTGTTATAGGCAATTTTGGACCACCCGACGCA
CAAGGAGCAGACGAGGACGTCCGGGCAAAAATAGGGCTGGGAGACCGACCCAGAGGAAGATCCGACCAAAGGGCCGGGCCAACTTGGCCCGACCCATATGAGGCTGAGGT
CGACCATTCGGCTCGCTTGCGTGGGCTGAGCTCGGTCACCTCCTCTCGGTCCCTGATGCTTCTAGCCGCCCCGGTTTCGCCTAGTTTGTCCCGAAACGCCTCCGAATTCC
TAAAAACCCTAGGAGCACGAGCAGCATCGGAGGCGGTGTGGCTAGCACCACACCGGTGTGCAGGTTTTCTCTTTTGCAGGCCACGTCTTCCCCCGCTCTCAAACAAATTC
ACTGTCGATTATCACGAGGACGAGGACATCCGGACAGAAATAGGACTAGGAGACCGCCCCAGAGGAAGAACCGACCAAAGGGCTGGGCCAACTTGCCTTCTTTCGACCCC
TGATGCCTCTGGCCCCCCTGGTTCCGCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTCGCAACGGATTATCCTTTGATTTGTATGGGTGAGAGTGGCCAGATCGCCGACTCAATAAGCTTACCATTTTGGGGACAAGACCGAGTGGGGAGCTGGGAACATAA
TAACACAAGATGGAATTCACTCCTTCCCGACTTTAGGTGCGAACACCACCACTATGGCTACACCGGTATGACTCTGAGACTTCTAGAGGCAGGAGACTTGTGGGAGCATT
TGGGAGAATTCTCTGAGAGTAGGAGAATGAATGGGATACCTTTCACCCCACTCAGGGGTCCTATATATAGTAAAGATATTGTTATAGGCAATTTTGGACCACCCGACGCA
CAAGGAGCAGACGAGGACGTCCGGGCAAAAATAGGGCTGGGAGACCGACCCAGAGGAAGATCCGACCAAAGGGCCGGGCCAACTTGGCCCGACCCATATGAGGCTGAGGT
CGACCATTCGGCTCGCTTGCGTGGGCTGAGCTCGGTCACCTCCTCTCGGTCCCTGATGCTTCTAGCCGCCCCGGTTTCGCCTAGTTTGTCCCGAAACGCCTCCGAATTCC
TAAAAACCCTAGGAGCACGAGCAGCATCGGAGGCGGTGTGGCTAGCACCACACCGGTGTGCAGGTTTTCTCTTTTGCAGGCCACGTCTTCCCCCGCTCTCAAACAAATTC
ACTGTCGATTATCACGAGGACGAGGACATCCGGACAGAAATAGGACTAGGAGACCGCCCCAGAGGAAGAACCGACCAAAGGGCTGGGCCAACTTGCCTTCTTTCGACCCC
TGATGCCTCTGGCCCCCCTGGTTCCGCCTAG
Protein sequenceShow/hide protein sequence
MFATDYPLICMGESGQIADSISLPFWGQDRVGSWEHNNTRWNSLLPDFRCEHHHYGYTGMTLRLLEAGDLWEHLGEFSESRRMNGIPFTPLRGPIYSKDIVIGNFGPPDA
QGADEDVRAKIGLGDRPRGRSDQRAGPTWPDPYEAEVDHSARLRGLSSVTSSRSLMLLAAPVSPSLSRNASEFLKTLGARAASEAVWLAPHRCAGFLFCRPRLPPLSNKF
TVDYHEDEDIRTEIGLGDRPRGRTDQRAGPTCLLSTPDASGPPGSA