; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G04700 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G04700
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrotransposon protein
Genome locationClcChr04:16956655..16958984
RNA-Seq ExpressionClc04G04700
SyntenyClc04G04700
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033290.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]3.7e-3354.05Show/hide
Query:  VEVVSGWEGSAAGLRVLRDAVSRPYRLRVPK--------------GFLAPYRGERYHLSDWRGVGNASTTVREFFNMKYSSTRNVITCRNTKRYRASVWV
        V V+SGWEGSAA  R+LRDA+SR   L+VPK              GFLAPYRGERYHLS+W G  NA TT REFFNMK+SS RNVI        + + W+
Subjt:  VEVVSGWEGSAAGLRVLRDAVSRPYRLRVPK--------------GFLAPYRGERYHLSDWRGVGNASTTVREFFNMKYSSTRNVITCRNTKRYRASVWV

Query:  AKGAVG-FRGKPFYPIQVQCRTITACCLIHNLIIRKMGIGAMLDASDE
         KG     RGK +YP+ VQCRTI  CCL+HNLI R+M    ++D  DE
Subjt:  AKGAVG-FRGKPFYPIQVQCRTITACCLIHNLIIRKMGIGAMLDASDE

KAA0035620.1 retrotransposon protein [Cucumis melo var. makuwa]2.6e-3153.38Show/hide
Query:  VEVVSGWEGSAAGLRVLRDAVSRPYRLRVPK--------------GFLAPYRGERYHLSDWRGVGNASTTVREFFNMKYSSTRNVITCRNTKRYRASVWV
        V V++GWEGSAA  R+LRDA+SRP RL+VPK              GFLAPYRG+RYHL +WRG  NA +T +EFFNMK+SS RNVI          +  V
Subjt:  VEVVSGWEGSAAGLRVLRDAVSRPYRLRVPK--------------GFLAPYRGERYHLSDWRGVGNASTTVREFFNMKYSSTRNVITCRNTKRYRASVWV

Query:  AKGAVG-FRGKPFYPIQVQCRTITACCLIHNLIIRKMGIGAMLDASDE
         KG     RGK +YP++VQCRTI ACCL+HNLI R+M    + D  DE
Subjt:  AKGAVG-FRGKPFYPIQVQCRTITACCLIHNLIIRKMGIGAMLDASDE

KAA0046727.1 retrotransposon protein [Cucumis melo var. makuwa]2.6e-3153.38Show/hide
Query:  VEVVSGWEGSAAGLRVLRDAVSRPYRLRVPK--------------GFLAPYRGERYHLSDWRGVGNASTTVREFFNMKYSSTRNVITCRNTKRYRASVWV
        V V++GWEGSAA  R+LRDA+SRP RL+VPK              GFLAPYRG+RYHL +WRG  NA +T +EFFNMK+SS RNVI          +  V
Subjt:  VEVVSGWEGSAAGLRVLRDAVSRPYRLRVPK--------------GFLAPYRGERYHLSDWRGVGNASTTVREFFNMKYSSTRNVITCRNTKRYRASVWV

Query:  AKGAVG-FRGKPFYPIQVQCRTITACCLIHNLIIRKMGIGAMLDASDE
         KG     RGK +YP++VQCRTI ACCL+HNLI R+M    + D  DE
Subjt:  AKGAVG-FRGKPFYPIQVQCRTITACCLIHNLIIRKMGIGAMLDASDE

KAA0062747.1 retrotransposon protein [Cucumis melo var. makuwa]5.3e-3254.05Show/hide
Query:  VEVVSGWEGSAAGLRVLRDAVSRPYRLRVPK--------------GFLAPYRGERYHLSDWRGVGNASTTVREFFNMKYSSTRNVITCRNTKRYRASVWV
        V V+ GWEGSAA  R+LRDA+SR   L+VPK              GFLAPYRGERYHLS+WRG  NA TT REFFNMK+SS+RNVI          +  +
Subjt:  VEVVSGWEGSAAGLRVLRDAVSRPYRLRVPK--------------GFLAPYRGERYHLSDWRGVGNASTTVREFFNMKYSSTRNVITCRNTKRYRASVWV

Query:  AKGAVG-FRGKPFYPIQVQCRTITACCLIHNLIIRKMGIGAMLDASDE
         KG     RGK +YP+ VQCRTI ACCL+HNLI R+M    ++D  DE
Subjt:  AKGAVG-FRGKPFYPIQVQCRTITACCLIHNLIIRKMGIGAMLDASDE

XP_038886674.1 putative nuclease HARBI1 [Benincasa hispida]4.8e-4169.92Show/hide
Query:  VEVVSGWEGSAAGLRVLRDAVSRPYRLRVPKGFLAPYRGERYHLSDWRGVGNASTTVREFFNMKYSSTRNVITCRNTKRYRASVWVAKGAVGFRGKPFYP
        V V+ GWEGSA   RVLR+ VSRPY LRVPKGFLA YR ERYHLSDWRGVGNASTT REFFNMK+SS RNVI      R     W        RGK FYP
Subjt:  VEVVSGWEGSAAGLRVLRDAVSRPYRLRVPKGFLAPYRGERYHLSDWRGVGNASTTVREFFNMKYSSTRNVITCRNTKRYRASVWVAKGAVGFRGKPFYP

Query:  IQVQCRTITACCLIHNLIIRKMGIGAMLDASDE
        IQVQCRTIT C LIHNLI R+MGIGAMLD  DE
Subjt:  IQVQCRTITACCLIHNLIIRKMGIGAMLDASDE

TrEMBL top hitse value%identityAlignment
A0A5A7SNV3 Retrotransposon protein2.2e-3153.38Show/hide
Query:  VEVVSGWEGSAAGLRVLRDAVSRPYRLRVPK--------------GFLAPYRGERYHLSDWRGVGNASTTVREFFNMKYSSTRNVITCRNTKRYRASVWV
        V V++GWEGSAA  R+LRDA+SRP RL+VPK              GFLAPYRG+RYHL  WRG  NA +T +EFFNMK+SS RNVI          +  V
Subjt:  VEVVSGWEGSAAGLRVLRDAVSRPYRLRVPK--------------GFLAPYRGERYHLSDWRGVGNASTTVREFFNMKYSSTRNVITCRNTKRYRASVWV

Query:  AKGAVG-FRGKPFYPIQVQCRTITACCLIHNLIIRKMGIGAMLDASDE
         KG     RGK +YP++VQCRTI ACCL+HNLI R+M    + D  DE
Subjt:  AKGAVG-FRGKPFYPIQVQCRTITACCLIHNLIIRKMGIGAMLDASDE

A0A5A7SQU2 Putative nuclease HARBI11.8e-3354.05Show/hide
Query:  VEVVSGWEGSAAGLRVLRDAVSRPYRLRVPK--------------GFLAPYRGERYHLSDWRGVGNASTTVREFFNMKYSSTRNVITCRNTKRYRASVWV
        V V+SGWEGSAA  R+LRDA+SR   L+VPK              GFLAPYRGERYHLS+W G  NA TT REFFNMK+SS RNVI        + + W+
Subjt:  VEVVSGWEGSAAGLRVLRDAVSRPYRLRVPK--------------GFLAPYRGERYHLSDWRGVGNASTTVREFFNMKYSSTRNVITCRNTKRYRASVWV

Query:  AKGAVG-FRGKPFYPIQVQCRTITACCLIHNLIIRKMGIGAMLDASDE
         KG     RGK +YP+ VQCRTI  CCL+HNLI R+M    ++D  DE
Subjt:  AKGAVG-FRGKPFYPIQVQCRTITACCLIHNLIIRKMGIGAMLDASDE

A0A5A7TXW1 Retrotransposon protein1.3e-3153.38Show/hide
Query:  VEVVSGWEGSAAGLRVLRDAVSRPYRLRVPK--------------GFLAPYRGERYHLSDWRGVGNASTTVREFFNMKYSSTRNVITCRNTKRYRASVWV
        V V++GWEGSAA  R+LRDA+SRP RL+VPK              GFLAPYRG+RYHL +WRG  NA +T +EFFNMK+SS RNVI          +  V
Subjt:  VEVVSGWEGSAAGLRVLRDAVSRPYRLRVPK--------------GFLAPYRGERYHLSDWRGVGNASTTVREFFNMKYSSTRNVITCRNTKRYRASVWV

Query:  AKGAVG-FRGKPFYPIQVQCRTITACCLIHNLIIRKMGIGAMLDASDE
         KG     RGK +YP++VQCRTI ACCL+HNLI R+M    + D  DE
Subjt:  AKGAVG-FRGKPFYPIQVQCRTITACCLIHNLIIRKMGIGAMLDASDE

A0A5D3BDX0 Retrotransposon protein1.3e-3153.38Show/hide
Query:  VEVVSGWEGSAAGLRVLRDAVSRPYRLRVPK--------------GFLAPYRGERYHLSDWRGVGNASTTVREFFNMKYSSTRNVITCRNTKRYRASVWV
        V V++GWEGSAA  R+LRDA+SRP RL+VPK              GFLAPYRG+RYHL +WRG  NA +T +EFFNMK+SS RNVI          +  V
Subjt:  VEVVSGWEGSAAGLRVLRDAVSRPYRLRVPK--------------GFLAPYRGERYHLSDWRGVGNASTTVREFFNMKYSSTRNVITCRNTKRYRASVWV

Query:  AKGAVG-FRGKPFYPIQVQCRTITACCLIHNLIIRKMGIGAMLDASDE
         KG     RGK +YP++VQCRTI ACCL+HNLI R+M    + D  DE
Subjt:  AKGAVG-FRGKPFYPIQVQCRTITACCLIHNLIIRKMGIGAMLDASDE

A0A5D3DG22 Retrotransposon protein2.6e-3254.05Show/hide
Query:  VEVVSGWEGSAAGLRVLRDAVSRPYRLRVPK--------------GFLAPYRGERYHLSDWRGVGNASTTVREFFNMKYSSTRNVITCRNTKRYRASVWV
        V V+ GWEGSAA  R+LRDA+SR   L+VPK              GFLAPYRGERYHLS+WRG  NA TT REFFNMK+SS+RNVI          +  +
Subjt:  VEVVSGWEGSAAGLRVLRDAVSRPYRLRVPK--------------GFLAPYRGERYHLSDWRGVGNASTTVREFFNMKYSSTRNVITCRNTKRYRASVWV

Query:  AKGAVG-FRGKPFYPIQVQCRTITACCLIHNLIIRKMGIGAMLDASDE
         KG     RGK +YP+ VQCRTI ACCL+HNLI R+M    ++D  DE
Subjt:  AKGAVG-FRGKPFYPIQVQCRTITACCLIHNLIIRKMGIGAMLDASDE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G35695.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)6.5e-1233.61Show/hide
Query:  VEVVSGWEGSAAGLRVLRDAVSRPYRLRV----PKGFLAPYRGERYHLSDWRGVGNASTTVREFFNMKYSSTRNVITCRNTKRYRASVWVAKGAVGFRGK
        + V+SGWEGSA   RVL DA+ + Y +         FLAP+RG RYHL ++ G      T  E FN+++ S RNVI           ++ ++ A+ F+  
Subjt:  VEVVSGWEGSAAGLRVLRDAVSRPYRLRV----PKGFLAPYRGERYHLSDWRGVGNASTTVREFFNMKYSSTRNVITCRNTKRYRASVWVAKGAVGFRGK

Query:  PFYPIQVQCRTITACCLIHNLI
        P +  + Q   +  C  +HN +
Subjt:  PFYPIQVQCRTITACCLIHNLI

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.1e-0626.92Show/hide
Query:  VVSGWEGSAAGLRVLRDAVSRPYRLRVPK--------------GFLAPYRGERYHLSDWRGVGNASTTVREFFNMKYSSTRNVITCRNTKRYRASVWVAK
        V++GWEGSA+  +VL  A++R  +L+VP+              GF+APY G            N+    +E FN ++      I       +R    + +
Subjt:  VVSGWEGSAAGLRVLRDAVSRPYRLRVPK--------------GFLAPYRGERYHLSDWRGVGNASTTVREFFNMKYSSTRNVITCRNTKRYRASVWVAK

Query:  GAVGFRGKPFYPIQVQCRTITACCLIHNLI
                P YP+Q Q + + A C +HN +
Subjt:  GAVGFRGKPFYPIQVQCRTITACCLIHNLI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATCGATGGAAGGTGGAAGTGGTTTCAGGGTGGGAAGGGTCTGCAGCCGGTTTAAGGGTTCTTAGGGATGCGGTTTCGCGACCATACAGATTGAGAGTTCCAAAGGG
ATTCTTGGCGCCGTACAGAGGGGAACGATACCATCTCTCCGATTGGCGTGGAGTAGGAAACGCATCGACAACTGTAAGAGAATTTTTCAACATGAAATATTCTTCGACAA
GGAATGTGATAACTTGTAGAAATACAAAACGTTATCGAGCGAGCGTTTGGGTTGCTAAAGGGGCGGTGGGCTTCAGGGGAAAACCATTCTACCCAATACAAGTTCAATGT
CGAACAATAACTGCATGTTGCCTTATTCACAACTTGATCATAAGGAAGATGGGAATCGGTGCAATGCTTGACGCGTCCGATGAAGCATTGCACCGATTCAGGGAATTCTG
CATCAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCATCGATGGAAGGTGGAAGTGGTTTCAGGGTGGGAAGGGTCTGCAGCCGGTTTAAGGGTTCTTAGGGATGCGGTTTCGCGACCATACAGATTGAGAGTTCCAAAGGG
ATTCTTGGCGCCGTACAGAGGGGAACGATACCATCTCTCCGATTGGCGTGGAGTAGGAAACGCATCGACAACTGTAAGAGAATTTTTCAACATGAAATATTCTTCGACAA
GGAATGTGATAACTTGTAGAAATACAAAACGTTATCGAGCGAGCGTTTGGGTTGCTAAAGGGGCGGTGGGCTTCAGGGGAAAACCATTCTACCCAATACAAGTTCAATGT
CGAACAATAACTGCATGTTGCCTTATTCACAACTTGATCATAAGGAAGATGGGAATCGGTGCAATGCTTGACGCGTCCGATGAAGCATTGCACCGATTCAGGGAATTCTG
CATCAGTTGA
Protein sequenceShow/hide protein sequence
MHRWKVEVVSGWEGSAAGLRVLRDAVSRPYRLRVPKGFLAPYRGERYHLSDWRGVGNASTTVREFFNMKYSSTRNVITCRNTKRYRASVWVAKGAVGFRGKPFYPIQVQC
RTITACCLIHNLIIRKMGIGAMLDASDEALHRFREFCIS