; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C10G192770 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C10G192770
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionRetrotransposon protein
Genome locationCla97Chr10:20677200..20679542
RNA-Seq ExpressionCla97C10G192770
SyntenyCla97C10G192770
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
GO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056666.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]4.8e-0858.93Show/hide
Query:  YMHRWTMKMVSGALDDTYIKVN----DKSRWEGSTADSKVLRDAISQPYGLRVPKG
        +M    + MVSGALD TYIKVN    D+ RWEGS +DS+VLRDA+S+P G++ PKG
Subjt:  YMHRWTMKMVSGALDDTYIKVN----DKSRWEGSTADSKVLRDAISQPYGLRVPKG

XP_038877407.1 uncharacterized protein LOC120069696 [Benincasa hispida]6.8e-1068.42Show/hide
Query:  RHSNLQKEKYGLEFRRRKEVVNVIYNIEGLTED-DVALINFLVTDIQKTDCFLAVPD
        R ++ QK+KY LEF R+KEVVN IYNI+GL ED  V LI+ +VTDIQKTDCFLAVP+
Subjt:  RHSNLQKEKYGLEFRRRKEVVNVIYNIEGLTED-DVALINFLVTDIQKTDCFLAVPD

XP_038889345.1 putative nuclease HARBI1 [Benincasa hispida]1.9e-2037.97Show/hide
Query:  MDGRGRDGCNIPTHSCTRCQESSGTQKL---------------------SRIDHKYMHRWTMKMVSGALDDTYIKVN----DKSR---------------
        M GR RDGC+IPTHSCT CQESSGT KL                     +R DHKYMH+W M++VSGALD TYIKVN    ++ R               
Subjt:  MDGRGRDGCNIPTHSCTRCQESSGTQKL---------------------SRIDHKYMHRWTMKMVSGALDDTYIKVN----DKSR---------------

Query:  -------------WEGSTADSKVLRDAISQPYGLRVPKGSSNMRFSLGYKTITQLRFARQNHHYIKMEWMKIRHS---NLQKEKYGL
                     WEGS ADS+VLR A+S+PY LRVPK  ++  + +G +        ++ H  ++  +  ++HS   N+ +  +GL
Subjt:  -------------WEGSTADSKVLRDAISQPYGLRVPKGSSNMRFSLGYKTITQLRFARQNHHYIKMEWMKIRHS---NLQKEKYGL

XP_038895773.1 uncharacterized protein LOC120083935 [Benincasa hispida]4.7e-1160.53Show/hide
Query:  QLRFARQNHHYIKMEWMKIRHSNLQKEKYGLEFRRRKEVVNVIYNIEGLTEDD-VALINFLVTDIQKTDCFLAVPD
        ++R   Q+ H  +   M  R ++ QKEKY LEF RRKEVVN IYNI+GL EDD V LI+ LVTDIQKT+CFLAVP+
Subjt:  QLRFARQNHHYIKMEWMKIRHSNLQKEKYGLEFRRRKEVVNVIYNIEGLTEDD-VALINFLVTDIQKTDCFLAVPD

XP_038896380.1 uncharacterized protein LOC120084641 [Benincasa hispida]8.0e-1171.93Show/hide
Query:  RHSNLQKEKYGLEFRRRKEVVNVIYNIEGLTEDD-VALINFLVTDIQKTDCFLAVPD
        R ++ QKEKY LEF RRKEVVN IY+I+GL EDD V  I+ LVTDIQKTDCFLAVP+
Subjt:  RHSNLQKEKYGLEFRRRKEVVNVIYNIEGLTEDD-VALINFLVTDIQKTDCFLAVPD

TrEMBL top hitse value%identityAlignment
A0A1S3CI20 uncharacterized protein LOC1035012006.4e-0663.04Show/hide
Query:  SGALDDTYIKVN----DKSRWEGSTADSKVLRDAISQPYGLRVPKG
        S ALD TYIKVN    D+ RWEGS +DS+VLRDA+S+P G++ PKG
Subjt:  SGALDDTYIKVN----DKSRWEGSTADSKVLRDAISQPYGLRVPKG

A0A5A7THQ9 Retrotransposon protein1.1e-0556.6Show/hide
Query:  RWTMKMVSGALDDTYIKVN----DKSRWEGSTADSKVLRDAISQPYGLRVPKG
        RW    +S ALD TYIKVN    D++ WEGS ADS++L DA+S+P G++VPKG
Subjt:  RWTMKMVSGALDDTYIKVN----DKSRWEGSTADSKVLRDAISQPYGLRVPKG

A0A5A7UQS6 Putative nuclease HARBI12.3e-0858.93Show/hide
Query:  YMHRWTMKMVSGALDDTYIKVN----DKSRWEGSTADSKVLRDAISQPYGLRVPKG
        +M    + MVSGALD TYIKVN    D+ RWEGS +DS+VLRDA+S+P G++ PKG
Subjt:  YMHRWTMKMVSGALDDTYIKVN----DKSRWEGSTADSKVLRDAISQPYGLRVPKG

A0A5D3CS46 Putative nuclease HARBI11.1e-0557.41Show/hide
Query:  GALDDTYIKVN----DKSRWEGSTADSKVLRDAISQPYGLRVPKGSSNMRFSLG
        GALDD YIKVN    D+ RWEGSTAD ++LRDAIS+    +VPKG  ++ F  G
Subjt:  GALDDTYIKVN----DKSRWEGSTADSKVLRDAISQPYGLRVPKGSSNMRFSLG

A0A5D3DJR9 Retrotransposon protein8.3e-0659.62Show/hide
Query:  MVSGALDDTYIKVN---DKSRWEGSTADSKVLRDAISQPYGLRVPKGSSNMR
        MVSGALD T++KVN       WEGS +DS+VLRDA+S+P GL+VPKG   +R
Subjt:  MVSGALDDTYIKVN---DKSRWEGSTADSKVLRDAISQPYGLRVPKGSSNMR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGGACGTGGAAGAGATGGTTGCAATATTCCTACACATTCTTGCACACGATGTCAAGAATCGAGTGGTACGCAAAAACTTTCGAGAATCGATCACAAGTACATGCA
CCGATGGACGATGAAAATGGTTTCAGGTGCGTTAGATGACACATACATTAAAGTGAATGACAAGTCAAGGTGGGAAGGGTCTACAGCCGATTCAAAGGTTCTTAGGGATG
CGATTTCGCAACCATATGGATTGAGGGTTCCGAAGGGGAGTTCGAATATGAGATTTAGCTTGGGTTACAAGACTATCACACAATTGAGGTTCGCCAGACAGAATCACCAT
TACATCAAGATGGAATGGATGAAGATTCGACATAGCAATCTACAGAAGGAAAAGTATGGGTTGGAGTTTCGGCGTCGAAAGGAAGTAGTAAATGTCATATACAACATTGA
GGGTTTGACTGAGGATGATGTCGCCCTTATTAACTTCCTTGTCACAGACATTCAAAAGACAGATTGCTTCCTTGCAGTACCAGATAGGGTTGGCAGCGGGGGTGGGGCAG
GGCGGGCGGGTGGGGGTTCTCCGTCCCCGTCCCGTGGGGGGAATTTACCCCATCCCCACCTCTGTCCCCGCCAAGGCAAGGACGGGGGCGGGGATTCCCCATCGAGGAAA
TGGGTCCCCTCGGGGACCCATTCCCCAAGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGATGGACGTGGAAGAGATGGTTGCAATATTCCTACACATTCTTGCACACGATGTCAAGAATCGAGTGGTACGCAAAAACTTTCGAGAATCGATCACAAGTACATGCA
CCGATGGACGATGAAAATGGTTTCAGGTGCGTTAGATGACACATACATTAAAGTGAATGACAAGTCAAGGTGGGAAGGGTCTACAGCCGATTCAAAGGTTCTTAGGGATG
CGATTTCGCAACCATATGGATTGAGGGTTCCGAAGGGGAGTTCGAATATGAGATTTAGCTTGGGTTACAAGACTATCACACAATTGAGGTTCGCCAGACAGAATCACCAT
TACATCAAGATGGAATGGATGAAGATTCGACATAGCAATCTACAGAAGGAAAAGTATGGGTTGGAGTTTCGGCGTCGAAAGGAAGTAGTAAATGTCATATACAACATTGA
GGGTTTGACTGAGGATGATGTCGCCCTTATTAACTTCCTTGTCACAGACATTCAAAAGACAGATTGCTTCCTTGCAGTACCAGATAGGGTTGGCAGCGGGGGTGGGGCAG
GGCGGGCGGGTGGGGGTTCTCCGTCCCCGTCCCGTGGGGGGAATTTACCCCATCCCCACCTCTGTCCCCGCCAAGGCAAGGACGGGGGCGGGGATTCCCCATCGAGGAAA
TGGGTCCCCTCGGGGACCCATTCCCCAAGATAG
Protein sequenceShow/hide protein sequence
MDGRGRDGCNIPTHSCTRCQESSGTQKLSRIDHKYMHRWTMKMVSGALDDTYIKVNDKSRWEGSTADSKVLRDAISQPYGLRVPKGSSNMRFSLGYKTITQLRFARQNHH
YIKMEWMKIRHSNLQKEKYGLEFRRRKEVVNVIYNIEGLTEDDVALINFLVTDIQKTDCFLAVPDRVGSGGGAGRAGGGSPSPSRGGNLPHPHLCPRQGKDGGGDSPSRK
WVPSGTHSPR