; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G08770 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G08770
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionTy3-gypsy retrotransposon protein
Genome locationClcChr06:10429867..10430410
RNA-Seq ExpressionClc06G08770
SyntenyClc06G08770
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022848903.1 uncharacterized protein LOC111371244 [Olea europaea var. sylvestris]2.6e-1539.45Show/hide
Query:  NLSLSSMLQLTVSPTEGYGVVLGTGGLVWAAGICKGLLLSISDLSIIHDLLLLPLGLADVILGVAWQERLGKIEFDYRSSVMNFCVGDWWVELWGDQSLV
        ++ L++ LQ+   PT GYG+++GTG  V   G+CKG+++S+  + ++ D L L LG  DVILG+ W E +GK++ D+ +  M   VG   V L GD SL 
Subjt:  NLSLSSMLQLTVSPTEGYGVVLGTGGLVWAAGICKGLLLSISDLSIIHDLLLLPLGLADVILGVAWQERLGKIEFDYRSSVMNFCVGDWWVELWGDQSLV

Query:  RSQVSLRSI
        ++Q S++++
Subjt:  RSQVSLRSI

XP_024030016.1 uncharacterized protein LOC112094120 [Morus notabilis]4.4e-1539.26Show/hide
Query:  TEMLKDVEQGKD----LENTAT-KMANLSLSSMLQLTVSPTEGYGVVLGTGGLVWAAGICKGLLLSISDLSIIHDLLLLPLGLADVILGVAWQERLGKIE
        T  LK V +GK+    +++ AT    ++ L   ++LTV     YGVV+GTG  V   G+C+G+++S+ D+ I+ D L L LG +D+ILG+ W E LG + 
Subjt:  TEMLKDVEQGKD----LENTAT-KMANLSLSSMLQLTVSPTEGYGVVLGTGGLVWAAGICKGLLLSISDLSIIHDLLLLPLGLADVILGVAWQERLGKIE

Query:  FDYRSSVMNFCVGDWWVELWGDQSLVRSQVSLRSI
         ++RS  M F VG+  V L GD  + R+ VSL+++
Subjt:  FDYRSSVMNFCVGDWWVELWGDQSLVRSQVSLRSI

XP_031737572.1 uncharacterized protein LOC116402461 [Cucumis sativus]5.8e-2358.82Show/hide
Query:  LQLTVSPTEGYGVVLGTGGLVWAAGICKGLLLSISDLSIIHDLLLLPLGLADVILGVAWQERLGKIEFDYRSSVMNFCVGDWWVELWGDQSLVRSQVSLR
        L+++V   + YGVVLGTGG+V A G+CK + L I++LSI H+ L LPLG  DVILGV W E LGK+ FDY+ S M F  G+W V L GD+SLVRSQVSL+
Subjt:  LQLTVSPTEGYGVVLGTGGLVWAAGICKGLLLSISDLSIIHDLLLLPLGLADVILGVAWQERLGKIEFDYRSSVMNFCVGDWWVELWGDQSLVRSQVSLR

Query:  SI
        S+
Subjt:  SI

XP_038904464.1 uncharacterized protein LOC120090832 [Benincasa hispida]1.7e-1948.11Show/hide
Query:  LSSMLQLTVSPTEGYGVVLGTGGLVWAAGICKGLLLSISDLSIIHDLLLLPLGLADVILGVAWQERLGKIEFDYRSSVMNFCVGDWWVELWGDQSLVRSQ
        L S L+L   PT  YG++LG G  V   G+CKG++L++S+L+II+DL  LPLG  DV+LGV W   LG++E D+ +S M F +GDW V L G+++L+++Q
Subjt:  LSSMLQLTVSPTEGYGVVLGTGGLVWAAGICKGLLLSISDLSIIHDLLLLPLGLADVILGVAWQERLGKIEFDYRSSVMNFCVGDWWVELWGDQSLVRSQ

Query:  VSLRSI
        +SL+S+
Subjt:  VSLRSI

XP_038907170.1 uncharacterized protein LOC120092972 [Benincasa hispida]9.5e-1849.06Show/hide
Query:  LSSMLQLTVSPTEGYGVVLGTGGLVWAAGICKGLLLSISDLSIIHDLLLLPLGLADVILGVAWQERLGKIEFDYRSSVMNFCVGDWWVELWGDQSLVRSQ
        L S L L  SPT   G++LGTG  V   GICKG++L++ +L+II+D   +PLG ADV++GV W   LG++E D+ +S M+F VG+  V L  D+SLV+SQ
Subjt:  LSSMLQLTVSPTEGYGVVLGTGGLVWAAGICKGLLLSISDLSIIHDLLLLPLGLADVILGVAWQERLGKIEFDYRSSVMNFCVGDWWVELWGDQSLVRSQ

Query:  VSLRSI
        +SL+S+
Subjt:  VSLRSI

TrEMBL top hitse value%identityAlignment
A0A5D3BD16 Ty3/gypsy retrotransposon protein1.5e-1340.19Show/hide
Query:  NLSLSSMLQLTVSPTEGYGVVLGTGGLVWAAGICKGLLLSISDLSIIHDLLLLPLGLADVILGVAWQERLGKIEFDYRSSVMNFCVGDWWVELWGDQSLV
        +L L   L+L ++ T  YGV++G+G  V   GICKG+ + +  +SI+ D L L LG  D++LG+ W ++ G +  D+++  M F VGD  V L GD SL 
Subjt:  NLSLSSMLQLTVSPTEGYGVVLGTGGLVWAAGICKGLLLSISDLSIIHDLLLLPLGLADVILGVAWQERLGKIEFDYRSSVMNFCVGDWWVELWGDQSLV

Query:  RSQVSLR
        R ++SL+
Subjt:  RSQVSLR

A0A5D3CTU6 Ty3-gypsy retrotransposon protein3.7e-1535.36Show/hide
Query:  VQDREDMLPEEGEPTEMLKDVEQGKDLENTATKMANLSLSSM----------------------------------------LQLTVSPTEGYGVVLGTG
        VQ+RED+     E T+ + +  + K  E    ++ANLSL+S+                                        L++++   + YG+VLGT 
Subjt:  VQDREDMLPEEGEPTEMLKDVEQGKDLENTATKMANLSLSSM----------------------------------------LQLTVSPTEGYGVVLGTG

Query:  GLVWAAGICKGLLLSISDLSIIHDLLLLPLGLADVILGVAWQERLGKIEFDYRSSVMNFCVGDWW-----------VELWG
        G+V AAG+CK + L+I++LSI HD L LPLG ADVILGV W E LGK+ F Y+ S M F +G+W            VE WG
Subjt:  GLVWAAGICKGLLLSISDLSIIHDLLLLPLGLADVILGVAWQERLGKIEFDYRSSVMNFCVGDWW-----------VELWG

A0A6C0T5F8 Retrotrans_gag domain-containing protein9.0e-1442.74Show/hide
Query:  ENTATKMANLSLSSMLQLTVSPTEGYGVVLGTGGLVWAAGICKGLLLSISDLSIIHDLLLLPLGLADVILGVAWQERLGKIEFDYRSSVMNFCVGDWWVE
        EN A       L   ++  + P E +GV+LG G  +  AGIC+ L + I  + I+ D LLL LG  D+ILG+ W ERLG+I  ++R  +M F   +  VE
Subjt:  ENTATKMANLSLSSMLQLTVSPTEGYGVVLGTGGLVWAAGICKGLLLSISDLSIIHDLLLLPLGLADVILGVAWQERLGKIEFDYRSSVMNFCVGDWWVE

Query:  LWGDQSLVRSQVSLRSI
        L GD SL  SQVSL++I
Subjt:  LWGDQSLVRSQVSLRSI

A0A803PBL9 Uncharacterized protein2.4e-1442.86Show/hide
Query:  VSPTEGYGVVLGTGGLVWAAGICKGLLLSISDLSIIHDLLLLPLGLADVILGVAWQERLGKIEFDYRSSVMNFCVGDWWVELWGDQSLVRSQVSLRSI
        ++ T GYG++LGTG  V A G+C  + L + ++ ++ D L L LG  DVILGV W E LG ++ ++R+ VM F  G  WV L GD SL +S ++L+++
Subjt:  VSPTEGYGVVLGTGGLVWAAGICKGLLLSISDLSIIHDLLLLPLGLADVILGVAWQERLGKIEFDYRSSVMNFCVGDWWVELWGDQSLVRSQVSLRSI

A0A803PSM5 Uncharacterized protein8.2e-1545Show/hide
Query:  LTVSPTEGYGVVLGTGGLVWAAGICKGLLLSISDLSIIHDLLLLPLGLADVILGVAWQERLGKIEFDYRSSVMNFCVGDWWVELWGDQSLVRSQVSLRSI
        + ++ T GYG++LGTG  V A G+C  + L +  L ++ D L L LG ADVILGV W E LG ++ ++R+ VM F     WV L GD SL +SQ+SL+++
Subjt:  LTVSPTEGYGVVLGTGGLVWAAGICKGLLLSISDLSIIHDLLLLPLGLADVILGVAWQERLGKIEFDYRSSVMNFCVGDWWVELWGDQSLVRSQVSLRSI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGTACAGGATAGGGAAGACATGTTGCCGGAGGAAGGGGAGCCGACGGAAATGCTAAAGGATGTTGAACAAGGGAAGGATCTTGAAAACACAGCAACCAAGATGGC
AAATTTATCTCTGAGTTCCATGCTCCAATTAACGGTGTCTCCTACGGAAGGTTATGGGGTTGTCTTAGGCACCGGGGGATTAGTTTGGGCAGCTGGTATTTGCAAAGGTC
TGTTACTCTCCATCTCTGATCTTTCCATAATTCATGATCTTTTGCTGCTGCCATTAGGCCTTGCTGATGTGATTTTGGGGGTCGCGTGGCAAGAGAGACTTGGAAAAATT
GAGTTTGATTACCGATCCTCTGTCATGAATTTTTGTGTGGGTGACTGGTGGGTGGAATTATGGGGAGACCAGAGTTTAGTGAGATCACAGGTTTCGCTGCGATCAATATG
A
mRNA sequenceShow/hide mRNA sequence
ATGTTTGTACAGGATAGGGAAGACATGTTGCCGGAGGAAGGGGAGCCGACGGAAATGCTAAAGGATGTTGAACAAGGGAAGGATCTTGAAAACACAGCAACCAAGATGGC
AAATTTATCTCTGAGTTCCATGCTCCAATTAACGGTGTCTCCTACGGAAGGTTATGGGGTTGTCTTAGGCACCGGGGGATTAGTTTGGGCAGCTGGTATTTGCAAAGGTC
TGTTACTCTCCATCTCTGATCTTTCCATAATTCATGATCTTTTGCTGCTGCCATTAGGCCTTGCTGATGTGATTTTGGGGGTCGCGTGGCAAGAGAGACTTGGAAAAATT
GAGTTTGATTACCGATCCTCTGTCATGAATTTTTGTGTGGGTGACTGGTGGGTGGAATTATGGGGAGACCAGAGTTTAGTGAGATCACAGGTTTCGCTGCGATCAATATG
A
Protein sequenceShow/hide protein sequence
MFVQDREDMLPEEGEPTEMLKDVEQGKDLENTATKMANLSLSSMLQLTVSPTEGYGVVLGTGGLVWAAGICKGLLLSISDLSIIHDLLLLPLGLADVILGVAWQERLGKI
EFDYRSSVMNFCVGDWWVELWGDQSLVRSQVSLRSI