; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC11G210560 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC11G210560
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionReverse transcriptase
Genome locationCiama_Chr11:22275654..22276022
RNA-Seq ExpressionCaUC11G210560
SyntenyCaUC11G210560
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
WP_217833156.1 hypothetical protein, partial [Synechococcus sp. PCC 7002]5.1e-5792.62Show/hide
Query:  MDFEKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITPRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFIIVKVSPHGAVELQGNDGTTFKVN
        MDFEKAGEKRLLELNEMEEFRAQAYENAKLYK+RTARWHDKKITP TFLP QR+LLFNSRLRLFPGKLRTRWSGPFIIVKVSPHGAVELQGN+GTTFKVN
Subjt:  MDFEKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITPRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFIIVKVSPHGAVELQGNDGTTFKVN

Query:  GQRLKHYIGDEERGFENLAFIA
        G RLKHYIGDEER  ENLAF A
Subjt:  GQRLKHYIGDEERGFENLAFIA

XP_030498073.1 uncharacterized protein LOC115713732 [Cannabis sativa]4.5e-3767.57Show/hide
Query:  MDFEKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITPRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFIIVKVSPHGAVELQGNDGTTFKVN
        M+ + AGEKRLL+LNE+EEFR +AYENAK+YKERT +WHD+ +  + F PGQ+VLLFNSRL+LFPGKL++RWSGPF +VKV P+GAVEL+G+   TFKVN
Subjt:  MDFEKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITPRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFIIVKVSPHGAVELQGNDGTTFKVN

Query:  GQRLKHYIGDE
        GQRLK Y+G +
Subjt:  GQRLKHYIGDE

XP_030502743.1 uncharacterized protein LOC115717916 [Cannabis sativa]5.9e-3768.81Show/hide
Query:  MDFEKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITPRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFIIVKVSPHGAVELQGNDGTTFKVN
        MD + AG+KRLL+L+E+EEFR +AYENAK+YKERT RWHD+ +  + F PGQ+VLLFNSRL+LFPGKL++RWSGPF +VKV P+GAVEL+G    TFKVN
Subjt:  MDFEKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITPRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFIIVKVSPHGAVELQGNDGTTFKVN

Query:  GQRLKHYIG
        GQRLK Y+G
Subjt:  GQRLKHYIG

XP_038885822.1 uncharacterized protein LOC120076116 [Benincasa hispida]4.2e-4376.92Show/hide
Query:  EKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITPRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFIIVKVSPHGAVELQGNDGTTFKVNGQR
        +K GEKRLLEL E+EEF  QAYENAKLYKER ARWHDKKI   TF  GQ VLLFNSRLRLFP KLRTRW GPF++VK SPHGAVE+QG DG  FKVNGQR
Subjt:  EKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITPRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFIIVKVSPHGAVELQGNDGTTFKVNGQR

Query:  LKHYIGDEERGFENLAF
        L+HY GDEER  ENL F
Subjt:  LKHYIGDEERGFENLAF

XP_038885946.1 uncharacterized protein LOC120076251 [Benincasa hispida]1.5e-4073.87Show/hide
Query:  DFEKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITPRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFIIVKVSPHGAVELQGNDGTTFKVNG
        D + AG+ RLL+LNEMEEF+ QAYEN+K+YKERT +WHD  I PR FLPGQRVLLFNSRLRLFPGKL++RW GPF+I  V+P+ AVEL G DGTTFKVN 
Subjt:  DFEKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITPRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFIIVKVSPHGAVELQGNDGTTFKVNG

Query:  QRLKHYIGDEE
        QRLKHY GDEE
Subjt:  QRLKHYIGDEE

TrEMBL top hitse value%identityAlignment
A0A1S4DFS8 uncharacterized protein LOC1078293657.0e-3668.75Show/hide
Query:  MDFEKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITPRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFIIVKVSPHGAVELQGNDGT-TFKV
        MD + AGEKRLL+LNE++EFR  AYENAKLYK +T RWHDK I  R F PGQ VLLFNSRL+LFPGKL++RWSGPF++V V PHGAVEL+    T TF V
Subjt:  MDFEKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITPRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFIIVKVSPHGAVELQGNDGT-TFKV

Query:  NGQRLKHYIGDE
        NGQR+KHY G +
Subjt:  NGQRLKHYIGDE

A0A1U7YP28 uncharacterized protein LOC1042490657.0e-3667.27Show/hide
Query:  MDFEKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITPRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFIIVKVSPHGAVELQG-NDGTTFKV
        MD + AGEK+L++LNE++EFR  +YENAKLYKE+T RWHDK I PR F PGQ+VLLFNSRLRLFPGKL++RWSGPF +V+V+P+GA+EL+  N G  F V
Subjt:  MDFEKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITPRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFIIVKVSPHGAVELQG-NDGTTFKV

Query:  NGQRLKHYIG
        NG R+KHY G
Subjt:  NGQRLKHYIG

A0A2G9H400 Reverse transcriptase4.1e-3670.64Show/hide
Query:  DFEKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITPRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFIIVKVSPHGAVELQG-NDGTTFKVN
        D + AGEKRLL+LNE++EFR QAYENAK+YKE+T RWHDKKI  R F PGQ VLLFNSRL+LFPGKL++RWSGPF I +V PHGAVEL+  N    FKVN
Subjt:  DFEKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITPRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFIIVKVSPHGAVELQG-NDGTTFKVN

Query:  GQRLKHYIG
         QR+KHY G
Subjt:  GQRLKHYIG

A0A6P4D1N6 uncharacterized protein LOC1074845093.1e-3666.96Show/hide
Query:  MDFEKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITPRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFIIVKVSPHGAVELQGN-DGTTFKV
        +D + AGEKRLL+LNE+EEFR +AYENA++YKER  RWHDK+I+ RTF PGQRVLLFNSRL++FPGKLR+RW+GP+ I+KVSPHG VEL       TF  
Subjt:  MDFEKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITPRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFIIVKVSPHGAVELQGN-DGTTFKV

Query:  NGQRLKHYIGDE
        NG R+KHY G E
Subjt:  NGQRLKHYIGDE

A0A6P6SHX4 uncharacterized protein LOC1136916591.2e-3566.96Show/hide
Query:  MDFEKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITPRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFIIVKVSPHGAVELQGNDGTTFKVN
        MDF  AGEKRLLELNE+EE R  AYENAK+YKE+   WHDK I P+ F  GQ VLLFNSRLRLFPGKL++RWSGPF + +V P+GAVE++G +G  FKVN
Subjt:  MDFEKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITPRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFIIVKVSPHGAVELQGNDGTTFKVN

Query:  GQRLKHYIGDEE
        GQRLK Y+  E+
Subjt:  GQRLKHYIGDEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTCGAGAAAGCCGGTGAGAAACGCCTCTTAGAACTCAATGAGATGGAGGAGTTCCGTGCTCAAGCTTATGAGAATGCCAAACTTTACAAAGAGCGCACTGCGAG
ATGGCATGACAAGAAGATCACCCCACGGACCTTCCTTCCAGGACAAAGAGTATTACTTTTTAACTCACGTTTACGCTTGTTTCCAGGAAAGCTTAGGACACGATGGTCGG
GACCCTTTATCATTGTCAAGGTATCCCCACACGGAGCCGTGGAACTACAAGGCAACGATGGAACAACCTTCAAAGTGAATGGTCAACGATTGAAGCACTACATCGGTGAT
GAAGAACGCGGATTTGAGAACCTGGCTTTCATTGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTTCGAGAAAGCCGGTGAGAAACGCCTCTTAGAACTCAATGAGATGGAGGAGTTCCGTGCTCAAGCTTATGAGAATGCCAAACTTTACAAAGAGCGCACTGCGAG
ATGGCATGACAAGAAGATCACCCCACGGACCTTCCTTCCAGGACAAAGAGTATTACTTTTTAACTCACGTTTACGCTTGTTTCCAGGAAAGCTTAGGACACGATGGTCGG
GACCCTTTATCATTGTCAAGGTATCCCCACACGGAGCCGTGGAACTACAAGGCAACGATGGAACAACCTTCAAAGTGAATGGTCAACGATTGAAGCACTACATCGGTGAT
GAAGAACGCGGATTTGAGAACCTGGCTTTCATTGCATGA
Protein sequenceShow/hide protein sequence
MDFEKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITPRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFIIVKVSPHGAVELQGNDGTTFKVNGQRLKHYIGD
EERGFENLAFIA