; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg000203 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg000203
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase
Genome locationscaffold6:23977978..23991580
RNA-Seq ExpressionSpg000203
SyntenySpg000203
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058280.1 uncharacterized protein E6C27_scaffold274G006090 [Cucumis melo var. makuwa]1.7e-3152.85Show/hide
Query:  VNLWCFSRFEL-HRRDQESTGFSSKAGGSLVFSDVLSILGI---TEKLESVLSDFGAFLDKLV------GHLVFISSDIEPEAIWYRVHTGYIPLSTLSV
        VNLW F  F L +    +STGFSSK GG LVFSD LSIL I    +KL  VL    + L+ L         L   + +++   I+        PLSTLSV
Subjt:  VNLWCFSRFEL-HRRDQESTGFSSKAGGSLVFSDVLSILGI---TEKLESVLSDFGAFLDKLV------GHLVFISSDIEPEAIWYRVHTGYIPLSTLSV

Query:  LRDNDAVVEIELPVPDTLPTSAESSRSSSSTWLELYIESV------TFQTR--------FSVSIGSTGIVRGDDVCWLHVVSRAKLAGGPGGG
        LRDN+ V  IELPVPD LPTSAESS S+SSTWLELY ESV      +F  +        F+VS+GSTGIVRGDDVCWLH V RAK  GGPGGG
Subjt:  LRDNDAVVEIELPVPDTLPTSAESSRSSSSTWLELYIESV------TFQTR--------FSVSIGSTGIVRGDDVCWLHVVSRAKLAGGPGGG

KAA0062409.1 reverse transcriptase [Cucumis melo var. makuwa]2.4e-3355.56Show/hide
Query:  TGFSSKAGGSLVFSDVLSILGITEKLESVLSDF-----GAFLDKLVGHLVFI-SSDIEPEAIWYRVHTGYIPLSTLSVLRDNDAVVEIELPVPDTLPTSA
        TGFSSK GG LVFSD LSIL + +  + +  D      G +   L G+   + S +++   I+        PLSTLSVLRDNDAVVEIELPVPDTLPTSA
Subjt:  TGFSSKAGGSLVFSDVLSILGITEKLESVLSDF-----GAFLDKLVGHLVFI-SSDIEPEAIWYRVHTGYIPLSTLSVLRDNDAVVEIELPVPDTLPTSA

Query:  ESSRSSSSTWLELYIESV--------------TFQTRFSVSIGSTGIVRGDDVCWLHVVSRAKLAGGPGGG
        ESS S+SSTWLELY ESV                   FSV +GSTGIVRGDDVCWLH V RAK  GGPGGG
Subjt:  ESSRSSSSTWLELYIESV--------------TFQTRFSVSIGSTGIVRGDDVCWLHVVSRAKLAGGPGGG

TYK01618.1 uncharacterized protein E5676_scaffold451G002250 [Cucumis melo var. makuwa]5.8e-3253.51Show/hide
Query:  VNLW--CFSRFELHRRDQESTGFSSKAGGSLVFSDVLSILGITEKLESVLSDFGAFLDKLVGHLVFISSDIEPEAIWYRVHTGYIPLSTLSVLRDNDAVV
        VNLW   FS         ES GFSSK GG LVFSD LSIL   E L+S    +  +L+ L   L     +           T   PLSTLSVLRDNDAVV
Subjt:  VNLW--CFSRFELHRRDQESTGFSSKAGGSLVFSDVLSILGITEKLESVLSDFGAFLDKLVGHLVFISSDIEPEAIWYRVHTGYIPLSTLSVLRDNDAVV

Query:  EIELPVPDTLPTSAESSRSSSSTWLELYIESV--------------TFQTRFSVSIGSTGIVRGDDVCWLHVVSRAKLAGGPGGG
        EIEL VP+TLPTSAESS S+SSTWLELY ESV                   F VS+G TGIVRG DVCWLH V +AK AGGPGGG
Subjt:  EIELPVPDTLPTSAESSRSSSSTWLELYIESV--------------TFQTRFSVSIGSTGIVRGDDVCWLHVVSRAKLAGGPGGG

TYK08801.1 hypothetical protein E5676_scaffold796G00040 [Cucumis melo var. makuwa]1.4e-3343.8Show/hide
Query:  EHFRERVKEEEEEEFLGLRLKKIGKSSCKASHCRICVQPVNLWCFSRFELHRRDQESTGFSSKAGGSLVFSDVLSILGITEKLESVLSDF----------
        EHF ERVK EEEEEF GLRLKKIGKSSCKA+HCRI       W             STGFSSK  G LVFSD LSIL + +  + +  D           
Subjt:  EHFRERVKEEEEEEFLGLRLKKIGKSSCKASHCRICVQPVNLWCFSRFELHRRDQESTGFSSKAGGSLVFSDVLSILGITEKLESVLSDF----------

Query:  ---GAFLDKLVGHL--VFISSDIEPEAIWYRVH-----TGYIPLSTLSVLRDNDAVV----------------------------------------EIE
           G       G L  + IS D   E +  R++         PLSTLSVLRDNDAVV                                        EIE
Subjt:  ---GAFLDKLVGHL--VFISSDIEPEAIWYRVH-----TGYIPLSTLSVLRDNDAVV----------------------------------------EIE

Query:  LPVPDTLPTSAESSRSSSSTWLELYIE------SVTFQTRFSVSIGSTGIVRGDDVCWLHVVSRAKLAGGPGGG
        L VPDTLPTSAESS S+SSTWLE   E       V   +R    + S GIVR +DVCWLH V RAK+AGG GGG
Subjt:  LPVPDTLPTSAESSRSSSSTWLELYIE------SVTFQTRFSVSIGSTGIVRGDDVCWLHVVSRAKLAGGPGGG

TYK26572.1 putative Retrotransposon protein [Cucumis melo var. makuwa]2.4e-3355.56Show/hide
Query:  TGFSSKAGGSLVFSDVLSILGITEKLESVLSDF-----GAFLDKLVGHLVFI-SSDIEPEAIWYRVHTGYIPLSTLSVLRDNDAVVEIELPVPDTLPTSA
        TGFSSK GG LVFSD LSIL + +  + +  D      G +   L G+   + S +++   I+        PLSTLSVLRDNDAVVEIELPVPDTLPTSA
Subjt:  TGFSSKAGGSLVFSDVLSILGITEKLESVLSDF-----GAFLDKLVGHLVFI-SSDIEPEAIWYRVHTGYIPLSTLSVLRDNDAVVEIELPVPDTLPTSA

Query:  ESSRSSSSTWLELYIESV--------------TFQTRFSVSIGSTGIVRGDDVCWLHVVSRAKLAGGPGGG
        ESS S+SSTWLELY ESV                   FSV +GSTGIVRGDDVCWLH V RAK  GGPGGG
Subjt:  ESSRSSSSTWLELYIESV--------------TFQTRFSVSIGSTGIVRGDDVCWLHVVSRAKLAGGPGGG

TrEMBL top hitse value%identityAlignment
A0A5A7UT17 CCHC-type domain-containing protein8.2e-3252.85Show/hide
Query:  VNLWCFSRFEL-HRRDQESTGFSSKAGGSLVFSDVLSILGI---TEKLESVLSDFGAFLDKLV------GHLVFISSDIEPEAIWYRVHTGYIPLSTLSV
        VNLW F  F L +    +STGFSSK GG LVFSD LSIL I    +KL  VL    + L+ L         L   + +++   I+        PLSTLSV
Subjt:  VNLWCFSRFEL-HRRDQESTGFSSKAGGSLVFSDVLSILGI---TEKLESVLSDFGAFLDKLV------GHLVFISSDIEPEAIWYRVHTGYIPLSTLSV

Query:  LRDNDAVVEIELPVPDTLPTSAESSRSSSSTWLELYIESV------TFQTR--------FSVSIGSTGIVRGDDVCWLHVVSRAKLAGGPGGG
        LRDN+ V  IELPVPD LPTSAESS S+SSTWLELY ESV      +F  +        F+VS+GSTGIVRGDDVCWLH V RAK  GGPGGG
Subjt:  LRDNDAVVEIELPVPDTLPTSAESSRSSSSTWLELYIESV------TFQTR--------FSVSIGSTGIVRGDDVCWLHVVSRAKLAGGPGGG

A0A5A7V9N3 Reverse transcriptase1.1e-3355.56Show/hide
Query:  TGFSSKAGGSLVFSDVLSILGITEKLESVLSDF-----GAFLDKLVGHLVFI-SSDIEPEAIWYRVHTGYIPLSTLSVLRDNDAVVEIELPVPDTLPTSA
        TGFSSK GG LVFSD LSIL + +  + +  D      G +   L G+   + S +++   I+        PLSTLSVLRDNDAVVEIELPVPDTLPTSA
Subjt:  TGFSSKAGGSLVFSDVLSILGITEKLESVLSDF-----GAFLDKLVGHLVFI-SSDIEPEAIWYRVHTGYIPLSTLSVLRDNDAVVEIELPVPDTLPTSA

Query:  ESSRSSSSTWLELYIESV--------------TFQTRFSVSIGSTGIVRGDDVCWLHVVSRAKLAGGPGGG
        ESS S+SSTWLELY ESV                   FSV +GSTGIVRGDDVCWLH V RAK  GGPGGG
Subjt:  ESSRSSSSTWLELYIESV--------------TFQTRFSVSIGSTGIVRGDDVCWLHVVSRAKLAGGPGGG

A0A5D3BRI2 Uncharacterized protein2.8e-3253.51Show/hide
Query:  VNLW--CFSRFELHRRDQESTGFSSKAGGSLVFSDVLSILGITEKLESVLSDFGAFLDKLVGHLVFISSDIEPEAIWYRVHTGYIPLSTLSVLRDNDAVV
        VNLW   FS         ES GFSSK GG LVFSD LSIL   E L+S    +  +L+ L   L     +           T   PLSTLSVLRDNDAVV
Subjt:  VNLW--CFSRFELHRRDQESTGFSSKAGGSLVFSDVLSILGITEKLESVLSDFGAFLDKLVGHLVFISSDIEPEAIWYRVHTGYIPLSTLSVLRDNDAVV

Query:  EIELPVPDTLPTSAESSRSSSSTWLELYIESV--------------TFQTRFSVSIGSTGIVRGDDVCWLHVVSRAKLAGGPGGG
        EIEL VP+TLPTSAESS S+SSTWLELY ESV                   F VS+G TGIVRG DVCWLH V +AK AGGPGGG
Subjt:  EIELPVPDTLPTSAESSRSSSSTWLELYIESV--------------TFQTRFSVSIGSTGIVRGDDVCWLHVVSRAKLAGGPGGG

A0A5D3CAC6 Uncharacterized protein6.7e-3443.8Show/hide
Query:  EHFRERVKEEEEEEFLGLRLKKIGKSSCKASHCRICVQPVNLWCFSRFELHRRDQESTGFSSKAGGSLVFSDVLSILGITEKLESVLSDF----------
        EHF ERVK EEEEEF GLRLKKIGKSSCKA+HCRI       W             STGFSSK  G LVFSD LSIL + +  + +  D           
Subjt:  EHFRERVKEEEEEEFLGLRLKKIGKSSCKASHCRICVQPVNLWCFSRFELHRRDQESTGFSSKAGGSLVFSDVLSILGITEKLESVLSDF----------

Query:  ---GAFLDKLVGHL--VFISSDIEPEAIWYRVH-----TGYIPLSTLSVLRDNDAVV----------------------------------------EIE
           G       G L  + IS D   E +  R++         PLSTLSVLRDNDAVV                                        EIE
Subjt:  ---GAFLDKLVGHL--VFISSDIEPEAIWYRVH-----TGYIPLSTLSVLRDNDAVV----------------------------------------EIE

Query:  LPVPDTLPTSAESSRSSSSTWLELYIE------SVTFQTRFSVSIGSTGIVRGDDVCWLHVVSRAKLAGGPGGG
        L VPDTLPTSAESS S+SSTWLE   E       V   +R    + S GIVR +DVCWLH V RAK+AGG GGG
Subjt:  LPVPDTLPTSAESSRSSSSTWLELYIE------SVTFQTRFSVSIGSTGIVRGDDVCWLHVVSRAKLAGGPGGG

A0A5D3DSY7 Putative Retrotransposon protein1.1e-3355.56Show/hide
Query:  TGFSSKAGGSLVFSDVLSILGITEKLESVLSDF-----GAFLDKLVGHLVFI-SSDIEPEAIWYRVHTGYIPLSTLSVLRDNDAVVEIELPVPDTLPTSA
        TGFSSK GG LVFSD LSIL + +  + +  D      G +   L G+   + S +++   I+        PLSTLSVLRDNDAVVEIELPVPDTLPTSA
Subjt:  TGFSSKAGGSLVFSDVLSILGITEKLESVLSDF-----GAFLDKLVGHLVFI-SSDIEPEAIWYRVHTGYIPLSTLSVLRDNDAVVEIELPVPDTLPTSA

Query:  ESSRSSSSTWLELYIESV--------------TFQTRFSVSIGSTGIVRGDDVCWLHVVSRAKLAGGPGGG
        ESS S+SSTWLELY ESV                   FSV +GSTGIVRGDDVCWLH V RAK  GGPGGG
Subjt:  ESSRSSSSTWLELYIESV--------------TFQTRFSVSIGSTGIVRGDDVCWLHVVSRAKLAGGPGGG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAAAGCACGAGGAGGGTATGAATATTTCATCAGTTTCATTGATGATTATTCAAGAGCACTTTAGGGAGAGAGTGAAGGAGGAGGAAGAAGAAGAATTTCTTGGTTT
GAGGTTGAAGAAGATAGGAAAAAGCTCATGCAAAGCCAGCCATTGCAGAATCTGTGTTCAACCTGTCAACCTCTGGTGTTTTAGTCGGTTTGAGCTACACAGAAGGGATC
AAGAGAGCACAGGATTTTCGAGCAAAGCAAGAGGATCTCTAGTTTTCTCTGATGTTTTGAGCATTCTGGGGCCAAGAGGAAATAGGTCGAGTCTACAAACCGGGGAACTA
GCTAAGACAGTAGAGATCGAGCTCCCGGTGCCTGATACACTGCCAACGTCTGCTGAAAGTTCTAGATCAAGCTCCAGTTACTTTCCAGACCAGGTTATGTCAGGGCTGCA
CCATGTGTTCAGCGTTTCAATAGGGTCAACAGGGATCGTTAGAGGTGACGATGTCTGTTGGCTTCACGCCGTCTCTCGGGCTAAGCTAGCAGGTGGTCCGGGAGGGGAGC
ACTTTAGGGAGAGAGTGAAGGAGGAGGAAGAAGAAGAATTTCTTGGTTTGAGGTTGAAGAAGATAGGAAAAAGCTCATGCAAAGCCAGCCATTGCAGAATCTGTGTTCAA
CCTGTCAACCTCTGGTGTTTTAGTCGGTTTGAGCTACACAGAAGGGATCAAGAGAGCACAGGATTTTCGAGCAAAGCAGGAGGATCTCTAGTTTTCTCTGATGTTTTGAG
CATTCTGGGGATAACAGAGAAGCTCGAATCTGTGTTGTCTGATTTCGGGGCATTTCTGGACAAATTGGTTGGGCATCTTGTTTTCATTTCTTCAGATATTGAGCCCGAGG
CTATATGGTACCGTGTGCACACAGGTTATATTCCGTTGTCGACGTTGAGTGTACTCCGTGACAACGATGCTGTCGTAGAGATCGAGCTCCCGGTGCCTGATACACTGCCA
ACGTCTGCTGAAAGTTCTAGATCAAGCTCCAGTACGTGGTTGGAGTTGTATATTGAGTCCGTTACTTTCCAGACCAGGTTCAGCGTTTCAATAGGGTCAACAGGGATCGT
TAGAGGTGACGATGTCTGTTGGCTTCACGTCGTCTCTCGGGCTAAGCTAGCAGGTGGTCCGGGAGGGGGAAAAGGTCTTGGCCTTTACCTATATATAGGACCCCTGGGGT
GGGGTGAAAGGGATCCCATTGTCTCTCCTTCTCTCAGAGATTTCTCTCAAAAGACTCCCACAAGTCTCATGCTTCAAGAAGTCTTAGAGATATACCGGTGTAGCCGATTG
ATGGTGTTCGCACAAGAGGGTTGCTGCGTTTTCAATCTTATTGGCAAGAAAAGGCGAATTGATCAAGGCTTTCTACAAAAGTATGTTTTCTATGAAATCCTTGATCACTA
G
mRNA sequenceShow/hide mRNA sequence
ATGTCAAAGCACGAGGAGGGTATGAATATTTCATCAGTTTCATTGATGATTATTCAAGAGCACTTTAGGGAGAGAGTGAAGGAGGAGGAAGAAGAAGAATTTCTTGGTTT
GAGGTTGAAGAAGATAGGAAAAAGCTCATGCAAAGCCAGCCATTGCAGAATCTGTGTTCAACCTGTCAACCTCTGGTGTTTTAGTCGGTTTGAGCTACACAGAAGGGATC
AAGAGAGCACAGGATTTTCGAGCAAAGCAAGAGGATCTCTAGTTTTCTCTGATGTTTTGAGCATTCTGGGGCCAAGAGGAAATAGGTCGAGTCTACAAACCGGGGAACTA
GCTAAGACAGTAGAGATCGAGCTCCCGGTGCCTGATACACTGCCAACGTCTGCTGAAAGTTCTAGATCAAGCTCCAGTTACTTTCCAGACCAGGTTATGTCAGGGCTGCA
CCATGTGTTCAGCGTTTCAATAGGGTCAACAGGGATCGTTAGAGGTGACGATGTCTGTTGGCTTCACGCCGTCTCTCGGGCTAAGCTAGCAGGTGGTCCGGGAGGGGAGC
ACTTTAGGGAGAGAGTGAAGGAGGAGGAAGAAGAAGAATTTCTTGGTTTGAGGTTGAAGAAGATAGGAAAAAGCTCATGCAAAGCCAGCCATTGCAGAATCTGTGTTCAA
CCTGTCAACCTCTGGTGTTTTAGTCGGTTTGAGCTACACAGAAGGGATCAAGAGAGCACAGGATTTTCGAGCAAAGCAGGAGGATCTCTAGTTTTCTCTGATGTTTTGAG
CATTCTGGGGATAACAGAGAAGCTCGAATCTGTGTTGTCTGATTTCGGGGCATTTCTGGACAAATTGGTTGGGCATCTTGTTTTCATTTCTTCAGATATTGAGCCCGAGG
CTATATGGTACCGTGTGCACACAGGTTATATTCCGTTGTCGACGTTGAGTGTACTCCGTGACAACGATGCTGTCGTAGAGATCGAGCTCCCGGTGCCTGATACACTGCCA
ACGTCTGCTGAAAGTTCTAGATCAAGCTCCAGTACGTGGTTGGAGTTGTATATTGAGTCCGTTACTTTCCAGACCAGGTTCAGCGTTTCAATAGGGTCAACAGGGATCGT
TAGAGGTGACGATGTCTGTTGGCTTCACGTCGTCTCTCGGGCTAAGCTAGCAGGTGGTCCGGGAGGGGGAAAAGGTCTTGGCCTTTACCTATATATAGGACCCCTGGGGT
GGGGTGAAAGGGATCCCATTGTCTCTCCTTCTCTCAGAGATTTCTCTCAAAAGACTCCCACAAGTCTCATGCTTCAAGAAGTCTTAGAGATATACCGGTGTAGCCGATTG
ATGGTGTTCGCACAAGAGGGTTGCTGCGTTTTCAATCTTATTGGCAAGAAAAGGCGAATTGATCAAGGCTTTCTACAAAAGTATGTTTTCTATGAAATCCTTGATCACTA
G
Protein sequenceShow/hide protein sequence
MSKHEEGMNISSVSLMIIQEHFRERVKEEEEEEFLGLRLKKIGKSSCKASHCRICVQPVNLWCFSRFELHRRDQESTGFSSKARGSLVFSDVLSILGPRGNRSSLQTGEL
AKTVEIELPVPDTLPTSAESSRSSSSYFPDQVMSGLHHVFSVSIGSTGIVRGDDVCWLHAVSRAKLAGGPGGEHFRERVKEEEEEEFLGLRLKKIGKSSCKASHCRICVQ
PVNLWCFSRFELHRRDQESTGFSSKAGGSLVFSDVLSILGITEKLESVLSDFGAFLDKLVGHLVFISSDIEPEAIWYRVHTGYIPLSTLSVLRDNDAVVEIELPVPDTLP
TSAESSRSSSSTWLELYIESVTFQTRFSVSIGSTGIVRGDDVCWLHVVSRAKLAGGPGGGKGLGLYLYIGPLGWGERDPIVSPSLRDFSQKTPTSLMLQEVLEIYRCSRL
MVFAQEGCCVFNLIGKKRRIDQGFLQKYVFYEILDH