; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC10G191690 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC10G191690
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionRetrotransposon protein
Genome locationCmU531Chr10:21549344..21551248
RNA-Seq ExpressionCmUC10G191690
SyntenyCmUC10G191690
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033290.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]1.7e-3352.8Show/hide
Query:  MLRSTGCLEPTRCLDVEEMVAIFLHILAHNVKNQVIHINFSRSNETVLRYFNTVLRAVFQICTALLKKLEPITS---------------ASDDTYIKVNV
        +LR+T  L  T  +DVEEMVA+FLHILAH++KN++I   F RS ETV R+FN VL +V ++   LLKK + +T+               A DDTYIKVNV
Subjt:  MLRSTGCLEPTRCLDVEEMVAIFLHILAHNVKNQVIHINFSRSNETVLRYFNTVLRAVFQICTALLKKLEPITS---------------ASDDTYIKVNV

Query:  SAFDPPGYRTRNREITINVLAICNQNGEFVFVLLGWEESAANSRVFRDAILRSYRLRVLKG
        SA D P Y TR  E+ INVL +C+  G+FVFVL GWE SAA+SR+ RDAI R   L+V KG
Subjt:  SAFDPPGYRTRNREITINVLAICNQNGEFVFVLLGWEESAANSRVFRDAILRSYRLRVLKG

KAA0050107.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]2.6e-3452.8Show/hide
Query:  MLRSTGCLEPTRCLDVEEMVAIFLHILAHNVKNQVIHINFSRSNETVLRYFNTVLRAVFQICTALLKKLEPIT---------------SASDDTYIKVNV
        MLR+ G LE T+ +DVEEMVAIFLHI+AH+VKN+V   +F+RS ETV R+FN VL AV ++   LLK+ +P+T                A   T+IKVNV
Subjt:  MLRSTGCLEPTRCLDVEEMVAIFLHILAHNVKNQVIHINFSRSNETVLRYFNTVLRAVFQICTALLKKLEPIT---------------SASDDTYIKVNV

Query:  SAFDPPGYRTRNREITINVLAICNQNGEFVFVLLGWEESAANSRVFRDAILRSYRLRVLKG
        S  D P YR+R  +IT NVL +C+QNGEF+FV+ GWE SA++SRV RDA+ R   L+V KG
Subjt:  SAFDPPGYRTRNREITINVLAICNQNGEFVFVLLGWEESAANSRVFRDAILRSYRLRVLKG

TYK08067.1 retrotransposon protein [Cucumis melo var. makuwa]2.0e-3456.08Show/hide
Query:  MLRSTGCLEPTRCLDVEEMVAIFLHILAHNVKNQVIHINFSRSNETVLRYFNTVLRAVFQICTALLKKLEPIT--SASDDTYIKVNVSAFDPPGYRTRNR
        MLR+ G LE T+ +DVEEM AIFLHI+AH+VKN+V   +F+RS+ TV R+FN VL AV +I   LLK+ + +T   A D T+IKVNVS  D P YR+R  
Subjt:  MLRSTGCLEPTRCLDVEEMVAIFLHILAHNVKNQVIHINFSRSNETVLRYFNTVLRAVFQICTALLKKLEPIT--SASDDTYIKVNVSAFDPPGYRTRNR

Query:  EITINVLAICNQNGEFVFVLLGWEESAANSRVFRDAILRSYRLRVLKG
        +IT NVL +C+QNGEF+FV+ GWE SA++SRV RD + R   L+V KG
Subjt:  EITINVLAICNQNGEFVFVLLGWEESAANSRVFRDAILRSYRLRVLKG

XP_008455792.1 PREDICTED: putative nuclease HARBI1 [Cucumis melo]4.4e-3454.04Show/hide
Query:  MLRSTGCLEPTRCLDVEEMVAIFLHILAHNVKNQVIHINFSRSNETVLRYFNTVLRAVFQICTALLKKLEPITS---------------ASDDTYIKVNV
        +LR+T  L  T  +DVEEMVA+FLHILAH+VKN++I   F RS ETV R+FN VL A F++   LLKK +P+T+               A D TYIKVNV
Subjt:  MLRSTGCLEPTRCLDVEEMVAIFLHILAHNVKNQVIHINFSRSNETVLRYFNTVLRAVFQICTALLKKLEPITS---------------ASDDTYIKVNV

Query:  SAFDPPGYRTRNREITINVLAICNQNGEFVFVLLGWEESAANSRVFRDAILRSYRLRVLKG
        SA D P YRTR  E+  NVL  C+  G+FVFVL GWE SAA+SR+ RDAI R   L+V KG
Subjt:  SAFDPPGYRTRNREITINVLAICNQNGEFVFVLLGWEESAANSRVFRDAILRSYRLRVLKG

XP_038875111.1 uncharacterized protein LOC120067643 [Benincasa hispida]1.2e-3455.28Show/hide
Query:  MLRSTGCLEPTRCLDVEEMVAIFLHILAHNVKNQVIHINFSRSNETVLRYFNTVLRAVFQICTALLKKLEPITS---------------ASDDTYIKVNV
        MLR+     PT+C+D++EMVAIFLHIL H+VKN+V+   F+ S ETV R+F  VL  V Q+   LLKK EPITS               A DDTYIKVNV
Subjt:  MLRSTGCLEPTRCLDVEEMVAIFLHILAHNVKNQVIHINFSRSNETVLRYFNTVLRAVFQICTALLKKLEPITS---------------ASDDTYIKVNV

Query:  SAFDPPGYRTRNREITINVLAICNQNGEFVFVLLGWEESAANSRVFRDAILRSYRLRVLKG
        SA D   YRTR  EI  NVLAIC+   EF+FVL  WE S ANSRV RDAI R + L+V KG
Subjt:  SAFDPPGYRTRNREITINVLAICNQNGEFVFVLLGWEESAANSRVFRDAILRSYRLRVLKG

TrEMBL top hitse value%identityAlignment
A0A1S3C1U8 putative nuclease HARBI12.1e-3454.04Show/hide
Query:  MLRSTGCLEPTRCLDVEEMVAIFLHILAHNVKNQVIHINFSRSNETVLRYFNTVLRAVFQICTALLKKLEPITS---------------ASDDTYIKVNV
        +LR+T  L  T  +DVEEMVA+FLHILAH+VKN++I   F RS ETV R+FN VL A F++   LLKK +P+T+               A D TYIKVNV
Subjt:  MLRSTGCLEPTRCLDVEEMVAIFLHILAHNVKNQVIHINFSRSNETVLRYFNTVLRAVFQICTALLKKLEPITS---------------ASDDTYIKVNV

Query:  SAFDPPGYRTRNREITINVLAICNQNGEFVFVLLGWEESAANSRVFRDAILRSYRLRVLKG
        SA D P YRTR  E+  NVL  C+  G+FVFVL GWE SAA+SR+ RDAI R   L+V KG
Subjt:  SAFDPPGYRTRNREITINVLAICNQNGEFVFVLLGWEESAANSRVFRDAILRSYRLRVLKG

A0A5A7SQU2 Putative nuclease HARBI18.1e-3452.8Show/hide
Query:  MLRSTGCLEPTRCLDVEEMVAIFLHILAHNVKNQVIHINFSRSNETVLRYFNTVLRAVFQICTALLKKLEPITS---------------ASDDTYIKVNV
        +LR+T  L  T  +DVEEMVA+FLHILAH++KN++I   F RS ETV R+FN VL +V ++   LLKK + +T+               A DDTYIKVNV
Subjt:  MLRSTGCLEPTRCLDVEEMVAIFLHILAHNVKNQVIHINFSRSNETVLRYFNTVLRAVFQICTALLKKLEPITS---------------ASDDTYIKVNV

Query:  SAFDPPGYRTRNREITINVLAICNQNGEFVFVLLGWEESAANSRVFRDAILRSYRLRVLKG
        SA D P Y TR  E+ INVL +C+  G+FVFVL GWE SAA+SR+ RDAI R   L+V KG
Subjt:  SAFDPPGYRTRNREITINVLAICNQNGEFVFVLLGWEESAANSRVFRDAILRSYRLRVLKG

A0A5A7U6W3 Putative nuclease HARBI11.3e-3452.8Show/hide
Query:  MLRSTGCLEPTRCLDVEEMVAIFLHILAHNVKNQVIHINFSRSNETVLRYFNTVLRAVFQICTALLKKLEPIT---------------SASDDTYIKVNV
        MLR+ G LE T+ +DVEEMVAIFLHI+AH+VKN+V   +F+RS ETV R+FN VL AV ++   LLK+ +P+T                A   T+IKVNV
Subjt:  MLRSTGCLEPTRCLDVEEMVAIFLHILAHNVKNQVIHINFSRSNETVLRYFNTVLRAVFQICTALLKKLEPIT---------------SASDDTYIKVNV

Query:  SAFDPPGYRTRNREITINVLAICNQNGEFVFVLLGWEESAANSRVFRDAILRSYRLRVLKG
        S  D P YR+R  +IT NVL +C+QNGEF+FV+ GWE SA++SRV RDA+ R   L+V KG
Subjt:  SAFDPPGYRTRNREITINVLAICNQNGEFVFVLLGWEESAANSRVFRDAILRSYRLRVLKG

A0A5D3BXH4 Putative nuclease HARBI11.1e-3350.29Show/hide
Query:  MLRSTGCLEPTRCLDVEEMVAIFLHILAHNVKNQVIHINFSRSNETVLRYFNTVLRAVFQICTALLKKLEPITS---------------ASDDTYIKVNV
        +L +T  L     +DVEEMVA+FLHIL H+VKN++I   F RS ETV R+FN VL A  ++   LLKKL+P+T+               A DDTYIKVNV
Subjt:  MLRSTGCLEPTRCLDVEEMVAIFLHILAHNVKNQVIHINFSRSNETVLRYFNTVLRAVFQICTALLKKLEPITS---------------ASDDTYIKVNV

Query:  SAFDPPGYRTRNREITINVLAICNQNGEFVFVLLGWEESAANSRVFRDAILRSYRLRVLKGDLNTYGRRWRTL
        SA D P Y+TR  E+  NVL +C+  G+FVFVL GWE SAA+SR+ RDAI R   L+V KG       RW+ L
Subjt:  SAFDPPGYRTRNREITINVLAICNQNGEFVFVLLGWEESAANSRVFRDAILRSYRLRVLKGDLNTYGRRWRTL

A0A5D3C7X6 Retrotransposon protein9.6e-3556.08Show/hide
Query:  MLRSTGCLEPTRCLDVEEMVAIFLHILAHNVKNQVIHINFSRSNETVLRYFNTVLRAVFQICTALLKKLEPIT--SASDDTYIKVNVSAFDPPGYRTRNR
        MLR+ G LE T+ +DVEEM AIFLHI+AH+VKN+V   +F+RS+ TV R+FN VL AV +I   LLK+ + +T   A D T+IKVNVS  D P YR+R  
Subjt:  MLRSTGCLEPTRCLDVEEMVAIFLHILAHNVKNQVIHINFSRSNETVLRYFNTVLRAVFQICTALLKKLEPIT--SASDDTYIKVNVSAFDPPGYRTRNR

Query:  EITINVLAICNQNGEFVFVLLGWEESAANSRVFRDAILRSYRLRVLKG
        +IT NVL +C+QNGEF+FV+ GWE SA++SRV RD + R   L+V KG
Subjt:  EITINVLAICNQNGEFVFVLLGWEESAANSRVFRDAILRSYRLRVLKG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43722.1 unknown protein3.8e-0726.28Show/hide
Query:  MLRSTGCLEPTRCLDVEEMVAIFLHILAHNVKNQVIHINFSRSNETVLRYFNTVLRAV----------------------FQICTALLKKLEPITSASDD
        ML++   L+PT  + +EE VA+FL I  HN   + + + F R+ ETV R F  VL A                        Q+             A D 
Subjt:  MLRSTGCLEPTRCLDVEEMVAIFLHILAHNVKNQVIHINFSRSNETVLRYFNTVLRAV----------------------FQICTALLKKLEPITSASDD

Query:  TYIKVNVSAFDPPGYRTRNREITINVLAICNQNGEFVFVLLGWEESAANSRVFRDA
        T++ V V       Y  R+   ++N++AIC+    F ++  G   S  ++ V + A
Subjt:  TYIKVNVSAFDPPGYRTRNREITINVLAICNQNGEFVFVLLGWEESAANSRVFRDA

AT5G28730.1 unknown protein3.5e-0529.85Show/hide
Query:  LEPTRCLDVEEMVAIFLHILAHNVKNQVIHINFSRSNETVLRYFNTVLRAVFQICTALL--KKLEPITSAS----DDTYIKVNVSAFDPPGYRTRNREIT
        L+ +  + ++E VAIFL I A N   + I + F  + ET+ R F+ VL+A+ ++    +  +K+E + + S    DDT  +      D  G        +
Subjt:  LEPTRCLDVEEMVAIFLHILAHNVKNQVIHINFSRSNETVLRYFNTVLRAVFQICTALL--KKLEPITSAS----DDTYIKVNVSAFDPPGYRTRNREIT

Query:  INVLAICNQNGEFVFVLLGWEESAANSRVFRDAI
         NVLAIC+ +  F +  +G   S  ++RV   AI
Subjt:  INVLAICNQNGEFVFVLLGWEESAANSRVFRDAI

AT5G28950.1 unknown protein3.6e-1043.02Show/hide
Query:  TALLKKLEPITSASDDTYIKVNVSAFDPPGYRTRNREITINVLAICNQNGEFVFVLLGWEESAANSRVFRDAILR-SYRLRVLKGD
        T L    +    A DDT+I   VS    P +R R  +I+ N+LA CN + EF++VL GWE SA +S+V  DA+ R S RL V + D
Subjt:  TALLKKLEPITSASDDTYIKVNVSAFDPPGYRTRNREITINVLAICNQNGEFVFVLLGWEESAANSRVFRDAILR-SYRLRVLKGD

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)6.4e-1532.7Show/hide
Query:  MLRSTGCLEPTRCLDVEEMVAIFLHILAHNVKNQVIHINFSRSNETVLRYFNTVLRAVFQICTALLK-KLEPITSASDDTYIK------------VNVSA
        +L++ G L  T  + +E  +AIFL I+ HN++ + +   F  S ET+ R+FN VL AV  I     +      T  +DD Y K            V V  
Subjt:  MLRSTGCLEPTRCLDVEEMVAIFLHILAHNVKNQVIHINFSRSNETVLRYFNTVLRAVFQICTALLK-KLEPITSASDDTYIK------------VNVSA

Query:  FDPPGYRTRNREITINVLAICNQNGEFVFVLLGWEESAANSRVFRDAILRSYRLRVLKG
         +   +R  N  +T NVLA  + +  F +VL GWE SA++ +V   A+ R  +L+V +G
Subjt:  FDPPGYRTRNREITINVLAICNQNGEFVFVLLGWEESAANSRVFRDAILRSYRLRVLKG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAAGGTCAACTGGTTGTTTGGAACCAACTAGATGTTTGGACGTGGAAGAGATGGTTGCGATATTCCTACACATTCTTGCACACAATGTTAAGAATCAAGTGATACA
CATAAACTTTTCGAGGTCTAACGAGACTGTTTTGAGATATTTCAACACAGTTCTTAGAGCAGTCTTTCAAATTTGCACCGCTTTATTGAAAAAACTAGAACCAATCACAA
GTGCGTCAGATGACACATACATTAAAGTGAATGTTAGTGCATTCGATCCACCTGGATATAGGACGAGAAATAGAGAGATCACCATAAACGTTCTTGCGATCTGTAACCAA
AATGGGGAGTTCGTCTTCGTTCTGCTAGGGTGGGAAGAGTCTGCAGCTAATTCAAGGGTTTTTAGGGATGCAATTTTGCGATCGTACAGATTGAGGGTTCTGAAGGGAGA
TCTAAACACGTATGGTCGAAGGTGGAGAACGCTAAGTTGGTGGAAGGCTTATTGTACTTGGTGGAAACCGGTTGAAGGTCCAACAATGGAACGTTTGAACCAGGATACCT
ACATCACTCGGAGCGAATTCTACATGAGAAAGTGCCCAGAGTCATCCCACTGCGAAGGAAATGTGGAACAAGTCATTCTCTCTACCGTATTTGGGAAAGACAGAGCAGTA
GGACAATCAAGTGAGGCCCTGCACGTGATGGCAATGAATGCATTTAGAGAGTTTGAAGATGAGATTCAGCTTGGATCACAGGAATATCACACACATGAGGTTCGCGAGAC
AGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGCTAAGGTCAACTGGTTGTTTGGAACCAACTAGATGTTTGGACGTGGAAGAGATGGTTGCGATATTCCTACACATTCTTGCACACAATGTTAAGAATCAAGTGATACA
CATAAACTTTTCGAGGTCTAACGAGACTGTTTTGAGATATTTCAACACAGTTCTTAGAGCAGTCTTTCAAATTTGCACCGCTTTATTGAAAAAACTAGAACCAATCACAA
GTGCGTCAGATGACACATACATTAAAGTGAATGTTAGTGCATTCGATCCACCTGGATATAGGACGAGAAATAGAGAGATCACCATAAACGTTCTTGCGATCTGTAACCAA
AATGGGGAGTTCGTCTTCGTTCTGCTAGGGTGGGAAGAGTCTGCAGCTAATTCAAGGGTTTTTAGGGATGCAATTTTGCGATCGTACAGATTGAGGGTTCTGAAGGGAGA
TCTAAACACGTATGGTCGAAGGTGGAGAACGCTAAGTTGGTGGAAGGCTTATTGTACTTGGTGGAAACCGGTTGAAGGTCCAACAATGGAACGTTTGAACCAGGATACCT
ACATCACTCGGAGCGAATTCTACATGAGAAAGTGCCCAGAGTCATCCCACTGCGAAGGAAATGTGGAACAAGTCATTCTCTCTACCGTATTTGGGAAAGACAGAGCAGTA
GGACAATCAAGTGAGGCCCTGCACGTGATGGCAATGAATGCATTTAGAGAGTTTGAAGATGAGATTCAGCTTGGATCACAGGAATATCACACACATGAGGTTCGCGAGAC
AGAATAA
Protein sequenceShow/hide protein sequence
MLRSTGCLEPTRCLDVEEMVAIFLHILAHNVKNQVIHINFSRSNETVLRYFNTVLRAVFQICTALLKKLEPITSASDDTYIKVNVSAFDPPGYRTRNREITINVLAICNQ
NGEFVFVLLGWEESAANSRVFRDAILRSYRLRVLKGDLNTYGRRWRTLSWWKAYCTWWKPVEGPTMERLNQDTYITRSEFYMRKCPESSHCEGNVEQVILSTVFGKDRAV
GQSSEALHVMAMNAFREFEDEIQLGSQEYHTHEVRETE