; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG09G015555 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG09G015555
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRetrotransposon protein
Genome locationCG_Chr09:30232937..30234403
RNA-Seq ExpressionClCG09G015555
SyntenyClCG09G015555
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN34114.1 retrotransposon protein [Cucumis melo subsp. melo]7.5e-4936.54Show/hide
Query:  SIGCLEPTRCVDIEEMVAIFLHILAHDVKNRVVQRNFSRSGEMVSRYFNAILKAVLQFHIILLKKPEQIANICTDERWKWFQNCLGALDDSYIKVNVSAV
        +I  L  T  VD+EEMVA+FLHILAHDVK+RV++R F RSGE +SR+FN +L AV++ H  LLKKP+ + N CTD+RW+WF+NCLGALD +YIKVNV A 
Subjt:  SIGCLEPTRCVDIEEMVAIFLHILAHDVKNRVVQRNFSRSGEMVSRYFNAILKAVLQFHIILLKKPEQIANICTDERWKWFQNCLGALDDSYIKVNVSAV

Query:  DQRRYRMRKGEIATNILAVCDQKGEFVFVFPG--------------------------------------------------------------------
        D+ RYR RKGE+ATN+L VCD KG+FV+V  G                                                                    
Subjt:  DQRRYRMRKGEIATNILAVCDQKGEFVFVFPG--------------------------------------------------------------------

Query:  -NMR----------------------------------------------IGAMLDAPDYENSASVEVDEDH-------IEFVESSNELTKFRNDLAVEI
         NM+                                              I   +   D E++   EVD  H       I ++E+SNE +++R++LA E 
Subjt:  -NMR----------------------------------------------IGAMLDAPDYENSASVEVDEDH-------IEFVESSNELTKFRNDLAVEI

Query:  FIECIMVGNSKRTKHVWSKVEDAKLVEALLYLVEI-DWRSDNGMFQPGYLQHL
            IM  +S+  KH W+K E+A LVE L+ LV    WRSDNG F+PGYL  L
Subjt:  FIECIMVGNSKRTKHVWSKVEDAKLVEALLYLVEI-DWRSDNGMFQPGYLQHL

KAA0036474.1 retrotransposon protein [Cucumis melo var. makuwa]6.4e-4836.34Show/hide
Query:  MVRSIGCLEPTRCVDIEEMVAIFLHILAHDVKNRVVQRNFSRSGEMVSRYFNAILKAVLQFHIILLKKPEQIANICTDERWKWFQNCLGALDDSYIKVNV
        ++R++  L  T  VD+EEMVA+FLHI AHDVKNRV+QR F RSGE VSR+FN +L AVL+ +  L+K+P  + + C D+RWK F+NCLGALD +YIKVNV
Subjt:  MVRSIGCLEPTRCVDIEEMVAIFLHILAHDVKNRVVQRNFSRSGEMVSRYFNAILKAVLQFHIILLKKPEQIANICTDERWKWFQNCLGALDDSYIKVNV

Query:  SAVDQRRYRMRKGEIATNILAVCDQKGEFVFVFPGNMRIGA----MLDAPDYENSASVE-----------------------------------------
         A D+  +R RKGEIATN+L VCD KG+FV+V  G     A    + DA   EN   V                                          
Subjt:  SAVDQRRYRMRKGEIATNILAVCDQKGEFVFVFPGNMRIGA----MLDAPDYENSASVE-----------------------------------------

Query:  -----------------------------------------VDEDH-------------IEFVESSNELTKFRNDLAVEIFIECIMVGNSKRTKHVWSKV
                                                  DED              I+++E++NE +++R+DLA  +FI+  M  +++  +HVW++ 
Subjt:  -----------------------------------------VDEDH-------------IEFVESSNELTKFRNDLAVEIFIECIMVGNSKRTKHVWSKV

Query:  EDAKLVEALLYLVEI-DWRSDNGMFQPGYLQHL
        E+  LVE L+ LV +  W+SDNG F+ GYL  L
Subjt:  EDAKLVEALLYLVEI-DWRSDNGMFQPGYLQHL

KAA0041970.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]7.5e-4943.46Show/hide
Query:  MVRSIGCLEPTRCVDIEEMVAIFLHILAHDVKNRVVQRNFSRSGEMVSRYFNAILKAVLQFHIILLKKPEQIANICTDERWKWFQNCLGALDDSYIKVNV
        M+R  G LE T+ VD+EEMVAIFLHI+AHD+KNRV +R+F+RSGE +SR+FNA+L  VL+ H ILLK+P+ + + C+ E+W+WFQN LGALD ++IKVNV
Subjt:  MVRSIGCLEPTRCVDIEEMVAIFLHILAHDVKNRVVQRNFSRSGEMVSRYFNAILKAVLQFHIILLKKPEQIANICTDERWKWFQNCLGALDDSYIKVNV

Query:  SAVDQRRYRMRKGEIATNILAVCDQKGEFVFVFPG------NMRIGAMLDAPDYENSASVEVDEDH-------------------IEFVESSNELTKFRN
        S  D+RRYR RKG+I  N+L VC Q GEF+FV  G      ++R+  + DA        V   + +                   I    + N + +   
Subjt:  SAVDQRRYRMRKGEIATNILAVCDQKGEFVFVFPG------NMRIGAMLDAPDYENSASVEVDEDH-------------------IEFVESSNELTKFRN

Query:  DLAVEIFIECIMVGNSKRTKHVWSKVEDAKLVEALLYLVEI-DWRSDNGMFQPGYLQHLE
         L    F E  +    +  KH W+ +E+  LVE LL LVE   WR DNG F+ GYL  ++
Subjt:  DLAVEIFIECIMVGNSKRTKHVWSKVEDAKLVEALLYLVEI-DWRSDNGMFQPGYLQHLE

KAA0047074.1 retrotransposon protein [Cucumis melo var. makuwa]4.1e-4750.98Show/hide
Query:  MVRSIGCLEPTRCVDIEEMVAIFLHILAHDVKNRVVQRNFSRSGEMVSRYFNAILKAVLQFHIILLKKPEQIANICTDERWKWFQNCLGALDDSYIKVNV
        ++R+I  L  T  VD+EEMVA+FLHILAHDVKNRV+QR F RS E +SR+FN +L AV++ H  LLKKP+ + N CTD+RW+WF+NCL AL+ +YIKVNV
Subjt:  MVRSIGCLEPTRCVDIEEMVAIFLHILAHDVKNRVVQRNFSRSGEMVSRYFNAILKAVLQFHIILLKKPEQIANICTDERWKWFQNCLGALDDSYIKVNV

Query:  SAVDQRRYRMRKGEIATNILAVCDQKGEFVFVFPG------NMRIGAMLDAPDYENSASV-------EVDEDH-------IEFVESSNELTKFRNDLAVE
         A D+ RYR RKGE+ATN+  +CD KG+FV+V  G      + RI  + DA    N   V       EVD  H       I ++E+SNE +++R+DLA E
Subjt:  SAVDQRRYRMRKGEIATNILAVCDQKGEFVFVFPG------NMRIGAMLDAPDYENSASV-------EVDEDH-------IEFVESSNELTKFRNDLAVE

Query:  IFIE
        +F E
Subjt:  IFIE

TYK02751.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]6.4e-4844.96Show/hide
Query:  MVRSIGCLEPTRCVDIEEMVAIFLHILAHDVKNRVVQRNFSRSGEMVSRYFNAILKAVLQFHIILLKKPEQIANICTDERWKWFQNCLGALDDSYIKVNV
        ++R++  L     VD+EEMVA+FLHI+AHDVKNRV+QR F RSGE +SR+FN +L  V++ H  LLKKP+ + N CTD+RW+WF+NCLGALD +YIKVNV
Subjt:  MVRSIGCLEPTRCVDIEEMVAIFLHILAHDVKNRVVQRNFSRSGEMVSRYFNAILKAVLQFHIILLKKPEQIANICTDERWKWFQNCLGALDDSYIKVNV

Query:  SAVDQRRYRMRKGEIATNILAVCDQKGEFVFVFPG------NMRIGAMLDAPDYENSASV-------------EVDEDHIEFVESSNELTKFRN-----D
         A D+ RYR  KGE+ATN+L +CD KG+FV+V  G      + RI  + DA    N   V              VD     +      L ++R       
Subjt:  SAVDQRRYRMRKGEIATNILAVCDQKGEFVFVFPG------NMRIGAMLDAPDYENSASV-------------EVDEDHIEFVESSNELTKFRN-----D

Query:  LAVEIFIECIMVGNSKRTKHVWSKVEDAKLVEALLYLVE-IDWRSDNGMFQPGYLQHL
         + E F       N K +    +K E+A LVE L+ LV    WRSDNG F PGYL  L
Subjt:  LAVEIFIECIMVGNSKRTKHVWSKVEDAKLVEALLYLVE-IDWRSDNGMFQPGYLQHL

TrEMBL top hitse value%identityAlignment
A0A5A7SYW1 Retrotransposon protein3.1e-4836.34Show/hide
Query:  MVRSIGCLEPTRCVDIEEMVAIFLHILAHDVKNRVVQRNFSRSGEMVSRYFNAILKAVLQFHIILLKKPEQIANICTDERWKWFQNCLGALDDSYIKVNV
        ++R++  L  T  VD+EEMVA+FLHI AHDVKNRV+QR F RSGE VSR+FN +L AVL+ +  L+K+P  + + C D+RWK F+NCLGALD +YIKVNV
Subjt:  MVRSIGCLEPTRCVDIEEMVAIFLHILAHDVKNRVVQRNFSRSGEMVSRYFNAILKAVLQFHIILLKKPEQIANICTDERWKWFQNCLGALDDSYIKVNV

Query:  SAVDQRRYRMRKGEIATNILAVCDQKGEFVFVFPGNMRIGA----MLDAPDYENSASVE-----------------------------------------
         A D+  +R RKGEIATN+L VCD KG+FV+V  G     A    + DA   EN   V                                          
Subjt:  SAVDQRRYRMRKGEIATNILAVCDQKGEFVFVFPGNMRIGA----MLDAPDYENSASVE-----------------------------------------

Query:  -----------------------------------------VDEDH-------------IEFVESSNELTKFRNDLAVEIFIECIMVGNSKRTKHVWSKV
                                                  DED              I+++E++NE +++R+DLA  +FI+  M  +++  +HVW++ 
Subjt:  -----------------------------------------VDEDH-------------IEFVESSNELTKFRNDLAVEIFIECIMVGNSKRTKHVWSKV

Query:  EDAKLVEALLYLVEI-DWRSDNGMFQPGYLQHL
        E+  LVE L+ LV +  W+SDNG F+ GYL  L
Subjt:  EDAKLVEALLYLVEI-DWRSDNGMFQPGYLQHL

A0A5A7TF70 Putative nuclease HARBI13.6e-4943.46Show/hide
Query:  MVRSIGCLEPTRCVDIEEMVAIFLHILAHDVKNRVVQRNFSRSGEMVSRYFNAILKAVLQFHIILLKKPEQIANICTDERWKWFQNCLGALDDSYIKVNV
        M+R  G LE T+ VD+EEMVAIFLHI+AHD+KNRV +R+F+RSGE +SR+FNA+L  VL+ H ILLK+P+ + + C+ E+W+WFQN LGALD ++IKVNV
Subjt:  MVRSIGCLEPTRCVDIEEMVAIFLHILAHDVKNRVVQRNFSRSGEMVSRYFNAILKAVLQFHIILLKKPEQIANICTDERWKWFQNCLGALDDSYIKVNV

Query:  SAVDQRRYRMRKGEIATNILAVCDQKGEFVFVFPG------NMRIGAMLDAPDYENSASVEVDEDH-------------------IEFVESSNELTKFRN
        S  D+RRYR RKG+I  N+L VC Q GEF+FV  G      ++R+  + DA        V   + +                   I    + N + +   
Subjt:  SAVDQRRYRMRKGEIATNILAVCDQKGEFVFVFPG------NMRIGAMLDAPDYENSASVEVDEDH-------------------IEFVESSNELTKFRN

Query:  DLAVEIFIECIMVGNSKRTKHVWSKVEDAKLVEALLYLVEI-DWRSDNGMFQPGYLQHLE
         L    F E  +    +  KH W+ +E+  LVE LL LVE   WR DNG F+ GYL  ++
Subjt:  DLAVEIFIECIMVGNSKRTKHVWSKVEDAKLVEALLYLVEI-DWRSDNGMFQPGYLQHLE

A0A5D3BSN2 Putative nuclease HARBI13.1e-4844.96Show/hide
Query:  MVRSIGCLEPTRCVDIEEMVAIFLHILAHDVKNRVVQRNFSRSGEMVSRYFNAILKAVLQFHIILLKKPEQIANICTDERWKWFQNCLGALDDSYIKVNV
        ++R++  L     VD+EEMVA+FLHI+AHDVKNRV+QR F RSGE +SR+FN +L  V++ H  LLKKP+ + N CTD+RW+WF+NCLGALD +YIKVNV
Subjt:  MVRSIGCLEPTRCVDIEEMVAIFLHILAHDVKNRVVQRNFSRSGEMVSRYFNAILKAVLQFHIILLKKPEQIANICTDERWKWFQNCLGALDDSYIKVNV

Query:  SAVDQRRYRMRKGEIATNILAVCDQKGEFVFVFPG------NMRIGAMLDAPDYENSASV-------------EVDEDHIEFVESSNELTKFRN-----D
         A D+ RYR  KGE+ATN+L +CD KG+FV+V  G      + RI  + DA    N   V              VD     +      L ++R       
Subjt:  SAVDQRRYRMRKGEIATNILAVCDQKGEFVFVFPG------NMRIGAMLDAPDYENSASV-------------EVDEDHIEFVESSNELTKFRN-----D

Query:  LAVEIFIECIMVGNSKRTKHVWSKVEDAKLVEALLYLVE-IDWRSDNGMFQPGYLQHL
         + E F       N K +    +K E+A LVE L+ LV    WRSDNG F PGYL  L
Subjt:  LAVEIFIECIMVGNSKRTKHVWSKVEDAKLVEALLYLVE-IDWRSDNGMFQPGYLQHL

A0A5D3BZB1 Retrotransposon protein2.0e-4750.98Show/hide
Query:  MVRSIGCLEPTRCVDIEEMVAIFLHILAHDVKNRVVQRNFSRSGEMVSRYFNAILKAVLQFHIILLKKPEQIANICTDERWKWFQNCLGALDDSYIKVNV
        ++R+I  L  T  VD+EEMVA+FLHILAHDVKNRV+QR F RS E +SR+FN +L AV++ H  LLKKP+ + N CTD+RW+WF+NCL AL+ +YIKVNV
Subjt:  MVRSIGCLEPTRCVDIEEMVAIFLHILAHDVKNRVVQRNFSRSGEMVSRYFNAILKAVLQFHIILLKKPEQIANICTDERWKWFQNCLGALDDSYIKVNV

Query:  SAVDQRRYRMRKGEIATNILAVCDQKGEFVFVFPG------NMRIGAMLDAPDYENSASV-------EVDEDH-------IEFVESSNELTKFRNDLAVE
         A D+ RYR RKGE+ATN+  +CD KG+FV+V  G      + RI  + DA    N   V       EVD  H       I ++E+SNE +++R+DLA E
Subjt:  SAVDQRRYRMRKGEIATNILAVCDQKGEFVFVFPG------NMRIGAMLDAPDYENSASV-------EVDEDH-------IEFVESSNELTKFRNDLAVE

Query:  IFIE
        +F E
Subjt:  IFIE

E5GCB5 Retrotransposon protein3.6e-4936.54Show/hide
Query:  SIGCLEPTRCVDIEEMVAIFLHILAHDVKNRVVQRNFSRSGEMVSRYFNAILKAVLQFHIILLKKPEQIANICTDERWKWFQNCLGALDDSYIKVNVSAV
        +I  L  T  VD+EEMVA+FLHILAHDVK+RV++R F RSGE +SR+FN +L AV++ H  LLKKP+ + N CTD+RW+WF+NCLGALD +YIKVNV A 
Subjt:  SIGCLEPTRCVDIEEMVAIFLHILAHDVKNRVVQRNFSRSGEMVSRYFNAILKAVLQFHIILLKKPEQIANICTDERWKWFQNCLGALDDSYIKVNVSAV

Query:  DQRRYRMRKGEIATNILAVCDQKGEFVFVFPG--------------------------------------------------------------------
        D+ RYR RKGE+ATN+L VCD KG+FV+V  G                                                                    
Subjt:  DQRRYRMRKGEIATNILAVCDQKGEFVFVFPG--------------------------------------------------------------------

Query:  -NMR----------------------------------------------IGAMLDAPDYENSASVEVDEDH-------IEFVESSNELTKFRNDLAVEI
         NM+                                              I   +   D E++   EVD  H       I ++E+SNE +++R++LA E 
Subjt:  -NMR----------------------------------------------IGAMLDAPDYENSASVEVDEDH-------IEFVESSNELTKFRNDLAVEI

Query:  FIECIMVGNSKRTKHVWSKVEDAKLVEALLYLVEI-DWRSDNGMFQPGYLQHL
            IM  +S+  KH W+K E+A LVE L+ LV    WRSDNG F+PGYL  L
Subjt:  FIECIMVGNSKRTKHVWSKVEDAKLVEALLYLVEI-DWRSDNGMFQPGYLQHL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43722.1 unknown protein2.7e-1231.85Show/hide
Query:  LEPTRCVDIEEMVAIFLHILAHDVKNRVVQRNFSRSGEMVSRYFNAILKAVLQFHIILLKKPEQ------IANICTDER-WKWFQNCLGALDDSYIKVNV
        L+PT  + IEE VA+FL I  H+   R V   F R+ E V R F  +L A        ++ P +         +  D+R W +F   +GA+D +++ V V
Subjt:  LEPTRCVDIEEMVAIFLHILAHDVKNRVVQRNFSRSGEMVSRYFNAILKAVLQFHIILLKKPEQ------IANICTDER-WKWFQNCLGALDDSYIKVNV

Query:  SAVDQRRYRMRKGEIATNILAVCDQKGEFVFVFPG
            Q  Y  R    + NI+A+CD K  F +++ G
Subjt:  SAVDQRRYRMRKGEIATNILAVCDQKGEFVFVFPG

AT5G28950.1 unknown protein2.6e-0732.32Show/hide
Query:  WKWFQNCLGALDDSYIKVNVSAVDQRRYRMRKGEIATNILAVCDQKGEFVFV---FPGNMRIGAMLDAPDYENSASVEVDEDHIEFVESSNELTKFRND
        + +F++C+GA+DD++I   VS      +R RKG+I+ N+LA C+   EF++V   + G+     +L+     NS  + V E+     ES+ E+ +  ND
Subjt:  WKWFQNCLGALDDSYIKVNVSAVDQRRYRMRKGEIATNILAVCDQKGEFVFV---FPGNMRIGAMLDAPDYENSASVEVDEDHIEFVESSNELTKFRND

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)2.9e-1431.11Show/hide
Query:  MVRSIGCLEPTRCVDIEEMVAIFLHILAHDVKNRVVQRNFSRSGEMVSRYFNAILKAVLQFHIILLKKPEQIANICTDERWKWFQNCLGALDDSYIKVNV
        ++++ G L  T  + IE  +AIFL I+ H+++ R VQ  F  SGE +SR+FN +L AV+       +       +  D+   +F++C+G +D  +I V V
Subjt:  MVRSIGCLEPTRCVDIEEMVAIFLHILAHDVKNRVVQRNFSRSGEMVSRYFNAILKAVLQFHIILLKKPEQIANICTDERWKWFQNCLGALDDSYIKVNV

Query:  SAVDQRRYRMRKGEIATNILAVCDQKGEFVFVFPG
           +Q  +R   G +  N+LA       F +V  G
Subjt:  SAVDQRRYRMRKGEIATNILAVCDQKGEFVFVFPG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTACGGTCGATTGGTTGTTTGGAACCAACAAGATGTGTGGACATCGAAGAGATGGTAGCGATATTCCTACACATTCTTGCGCATGATGTCAAGAATCGAGTGGTACA
GAGGAACTTTTCAAGATCCGGCGAGATGGTTTCGAGATATTTCAACGCAATTCTTAAGGCAGTTTTACAATTTCACATTATTTTGTTGAAAAAACCGGAACAAATCGCAA
ATATATGCACCGATGAAAGGTGGAAGTGGTTTCAGAATTGTTTAGGTGCGTTAGACGACTCATACATTAAAGTGAATGTTAGTGCCGTCGATCAACGTCGATATAGGATG
AGGAAGGGAGAAATTGCCACAAACATTCTTGCAGTGTGTGATCAAAAGGGGGAGTTCGTCTTTGTTTTCCCAGGGAACATGAGAATCGGTGCAATGCTTGACGCACCCGA
TTACGAGAATTCTGCATCAGTTGAAGTTGATGAGGATCATATTGAATTTGTTGAATCCTCGAACGAATTGACAAAGTTCAGGAATGACTTGGCGGTCGAAATATTTATTG
AATGTATAATGGTAGGTAATAGTAAGAGGACTAAGCACGTATGGTCTAAGGTGGAAGACGCTAAGTTGGTGGAAGCCCTACTCTATTTGGTAGAGATTGATTGGAGGTCG
GACAATGGGATGTTTCAACCCGGATACTTGCAGCACCTAGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTACGGTCGATTGGTTGTTTGGAACCAACAAGATGTGTGGACATCGAAGAGATGGTAGCGATATTCCTACACATTCTTGCGCATGATGTCAAGAATCGAGTGGTACA
GAGGAACTTTTCAAGATCCGGCGAGATGGTTTCGAGATATTTCAACGCAATTCTTAAGGCAGTTTTACAATTTCACATTATTTTGTTGAAAAAACCGGAACAAATCGCAA
ATATATGCACCGATGAAAGGTGGAAGTGGTTTCAGAATTGTTTAGGTGCGTTAGACGACTCATACATTAAAGTGAATGTTAGTGCCGTCGATCAACGTCGATATAGGATG
AGGAAGGGAGAAATTGCCACAAACATTCTTGCAGTGTGTGATCAAAAGGGGGAGTTCGTCTTTGTTTTCCCAGGGAACATGAGAATCGGTGCAATGCTTGACGCACCCGA
TTACGAGAATTCTGCATCAGTTGAAGTTGATGAGGATCATATTGAATTTGTTGAATCCTCGAACGAATTGACAAAGTTCAGGAATGACTTGGCGGTCGAAATATTTATTG
AATGTATAATGGTAGGTAATAGTAAGAGGACTAAGCACGTATGGTCTAAGGTGGAAGACGCTAAGTTGGTGGAAGCCCTACTCTATTTGGTAGAGATTGATTGGAGGTCG
GACAATGGGATGTTTCAACCCGGATACTTGCAGCACCTAGAGTGA
Protein sequenceShow/hide protein sequence
MVRSIGCLEPTRCVDIEEMVAIFLHILAHDVKNRVVQRNFSRSGEMVSRYFNAILKAVLQFHIILLKKPEQIANICTDERWKWFQNCLGALDDSYIKVNVSAVDQRRYRM
RKGEIATNILAVCDQKGEFVFVFPGNMRIGAMLDAPDYENSASVEVDEDHIEFVESSNELTKFRNDLAVEIFIECIMVGNSKRTKHVWSKVEDAKLVEALLYLVEIDWRS
DNGMFQPGYLQHLE