; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG04G007680 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG04G007680
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionTy3-gypsy retrotransposon protein
Genome locationCG_Chr04:22648404..22655219
RNA-Seq ExpressionClCG04G007680
SyntenyClCG04G007680
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8647113.1 hypothetical protein Csa_021721 [Cucumis sativus]3.4e-1346.74Show/hide
Query:  NKEGLRAADHLIAHVEKTDYPQAGG--SSTTITPTKDISGKSNENQPYRRLTEEEMRIKKEKGICFKCDGKFSFGHRYKKKELQFMFIQDGE
        ++ G +A+D +      T  P      SSTTIT +++   KS  +QPYRR+T+ E+RIKKEKG+CF+CD KFS GHR K++EL  + +Q+GE
Subjt:  NKEGLRAADHLIAHVEKTDYPQAGG--SSTTITPTKDISGKSNENQPYRRLTEEEMRIKKEKGICFKCDGKFSFGHRYKKKELQFMFIQDGE

TYK19390.1 Transposon Ty3-I Gag-Pol polyprotein [Cucumis melo var. makuwa]1.4e-0629.32Show/hide
Query:  ISWDNPPRGGGSLEWTIMDKGKNVAKSSSWESKKDDEGKEGIRAPRSKEVPLFDMRLRKLEEKSVREYRRCFEQYSAGLKELEEKALESKFLIEEDQ---
        IS+D P     +L+W    + +N  K + W + K     E  R+ R  E  L+       +  +V EYR  F++  A L +L +K LE    +E  +   
Subjt:  ISWDNPPRGGGSLEWTIMDKGKNVAKSSSWESKKDDEGKEGIRAPRSKEVPLFDMRLRKLEEKSVREYRRCFEQYSAGLKELEEKALESKFLIEEDQ---

Query:  --KSNKEGLRAADHLIAHVEKTDYPQAGGSSTTITPTKDISGKSNENQPYRRLTEEEMRIKKEKGICFKCDGKFSFGHRYKKKELQFMFIQ
          +  +  L+A    I   E   Y  AGGS+         +G++ +  P +RL++ E + K+EKG+CFKCD K+  GH+ K+ E++ + I+
Subjt:  --KSNKEGLRAADHLIAHVEKTDYPQAGGSSTTITPTKDISGKSNENQPYRRLTEEEMRIKKEKGICFKCDGKFSFGHRYKKKELQFMFIQ

XP_022154744.1 uncharacterized protein LOC111021922 [Momordica charantia]2.2e-0734.21Show/hide
Query:  SKEVPLFDMRLRKLEEKSVREYRRCFEQYSAGLKELEEKALESKFLIEEDQKSNKEGLRAADHLIAHVEKTDYPQAGGSSTTITPTKDISGKS-------
        SK+  L    L   +E +V EYR+ FE +SA L ++ E  +E   +      S  +G R     +A+  KT  P+   S T  T T  +SGKS       
Subjt:  SKEVPLFDMRLRKLEEKSVREYRRCFEQYSAGLKELEEKALESKFLIEEDQKSNKEGLRAADHLIAHVEKTDYPQAGGSSTTITPTKDISGKS-------

Query:  NENQPYRRLTEEEMRIKKEKGICFKCDGKFSFGHRYKKKELQFMFIQDGEGV
         + Q  ++LTE E + +K+KG+CF+ + K+S GHR K +EL+   + D EG+
Subjt:  NENQPYRRLTEEEMRIKKEKGICFKCDGKFSFGHRYKKKELQFMFIQDGEGV

XP_024017591.1 uncharacterized protein LOC112090471 [Morus notabilis]2.2e-0729.69Show/hide
Query:  SKEVPLFDMRLRKLEEKSVREYRRCFEQYSAGLKELEEKALESKF-------LIEEDQKSNKEGLRAADHLIAHVE------------------------
        ++E  L +  L   +E +VREYRR FE  +A L E+ E+ LES F       +  E +     GL         VE                        
Subjt:  SKEVPLFDMRLRKLEEKSVREYRRCFEQYSAGLKELEEKALESKF-------LIEEDQKSNKEGLRAADHLIAHVE------------------------

Query:  -------KTDYPQAGGSSTTITPT-----KDISGKSNEN----QPYRRLTEEEMRIKKEKGICFKCDGKFSFGHRYKKKELQFMFIQDGEGV
               ++     GGS  T T         IS K+ E+     PYRRLT+ +++ K+EKG+C++CDGK+SFG+R   +ELQ + +++ + V
Subjt:  -------KTDYPQAGGSSTTITPT-----KDISGKSNEN----QPYRRLTEEEMRIKKEKGICFKCDGKFSFGHRYKKKELQFMFIQDGEGV

XP_031737572.1 uncharacterized protein LOC116402461 [Cucumis sativus]1.5e-0852.46Show/hide
Query:  ITPTKDISGKSNENQPYRRLTEEEMRIKKEKGICFKCDGKFSFGHRYKKKELQFMFIQDGE
        +T  ++   KS  +QPYR++T+ EMRIKKEKG CF+CD KFS  HR K++EL  + +Q+GE
Subjt:  ITPTKDISGKSNENQPYRRLTEEEMRIKKEKGICFKCDGKFSFGHRYKKKELQFMFIQDGE

TrEMBL top hitse value%identityAlignment
A0A5D3BJ44 Retrotransposon protein, putative, unclassified9.8e-0630.07Show/hide
Query:  LRKLEEKSVREYRRCFEQYSAGLKELEEKALESKF-------LIEEDQKSNKEGLRAADHLIAHVEKTDYPQAGGSST-----------TITPTKDISGK
        LR  +E SV EYR  F++  A L ++ EK +E  F       +  E      +GL     +   +  +  P  G  +T           TIT     + +
Subjt:  LRKLEEKSVREYRRCFEQYSAGLKELEEKALESKF-------LIEEDQKSNKEGLRAADHLIAHVEKTDYPQAGGSST-----------TITPTKDISGK

Query:  SNENQPYRRLTEEEMRIKKEKGICFKCDGKFSFGHRYKKKELQ--FMFIQDGE
          +   +RRL + E + +KEKG+CF+C+ K+S  H+ K KEL+   MF+  GE
Subjt:  SNENQPYRRLTEEEMRIKKEKGICFKCDGKFSFGHRYKKKELQ--FMFIQDGE

A0A5D3C895 Ty3-gypsy retrotransposon protein8.8e-0731.54Show/hide
Query:  EEKSVREYRRCFEQYSAGLKELEEKALESKFLIEEDQKSNKE-----------GLRAADHLIAHVEKTD-----------YPQAGGSSTTITPTKDISGK
        +E SV EY + FE+ S  L E+ E+ LE  F    D    KE            + AA+     +E++             P    S+T     K + G 
Subjt:  EEKSVREYRRCFEQYSAGLKELEEKALESKFLIEEDQKSNKE-----------GLRAADHLIAHVEKTD-----------YPQAGGSSTTITPTKDISGK

Query:  SNENQPYRRLTEEEMRIKKEKGICFKCDGKFSFGHRYKKKELQFMFIQD
             PYRR T+ E++ +KEKG+C++CD  FS GHR K K+L    + D
Subjt:  SNENQPYRRLTEEEMRIKKEKGICFKCDGKFSFGHRYKKKELQFMFIQD

A0A5D3D7H2 Transposon Ty3-I Gag-Pol polyprotein6.8e-0729.32Show/hide
Query:  ISWDNPPRGGGSLEWTIMDKGKNVAKSSSWESKKDDEGKEGIRAPRSKEVPLFDMRLRKLEEKSVREYRRCFEQYSAGLKELEEKALESKFLIEEDQ---
        IS+D P     +L+W    + +N  K + W + K     E  R+ R  E  L+       +  +V EYR  F++  A L +L +K LE    +E  +   
Subjt:  ISWDNPPRGGGSLEWTIMDKGKNVAKSSSWESKKDDEGKEGIRAPRSKEVPLFDMRLRKLEEKSVREYRRCFEQYSAGLKELEEKALESKFLIEEDQ---

Query:  --KSNKEGLRAADHLIAHVEKTDYPQAGGSSTTITPTKDISGKSNENQPYRRLTEEEMRIKKEKGICFKCDGKFSFGHRYKKKELQFMFIQ
          +  +  L+A    I   E   Y  AGGS+         +G++ +  P +RL++ E + K+EKG+CFKCD K+  GH+ K+ E++ + I+
Subjt:  --KSNKEGLRAADHLIAHVEKTDYPQAGGSSTTITPTKDISGKSNENQPYRRLTEEEMRIKKEKGICFKCDGKFSFGHRYKKKELQFMFIQ

A0A6J1DN22 Reverse transcriptase1.0e-0734.21Show/hide
Query:  SKEVPLFDMRLRKLEEKSVREYRRCFEQYSAGLKELEEKALESKFLIEEDQKSNKEGLRAADHLIAHVEKTDYPQAGGSSTTITPTKDISGKS-------
        SK+  L    L   +E +V EYR+ FE +SA L ++ E  +E   +      S  +G R     +A+  KT  P+   S T  T T  +SGKS       
Subjt:  SKEVPLFDMRLRKLEEKSVREYRRCFEQYSAGLKELEEKALESKFLIEEDQKSNKEGLRAADHLIAHVEKTDYPQAGGSSTTITPTKDISGKS-------

Query:  NENQPYRRLTEEEMRIKKEKGICFKCDGKFSFGHRYKKKELQFMFIQDGEGV
         + Q  ++LTE E + +K+KG+CF+ + K+S GHR K +EL+   + D EG+
Subjt:  NENQPYRRLTEEEMRIKKEKGICFKCDGKFSFGHRYKKKELQFMFIQDGEGV

W9R6D8 Uncharacterized protein1.5e-0627.69Show/hide
Query:  DEGKEGIRA-PRSKEVPLFDMRLRKLEEKSVREYRRCFEQYSAGLKELEEKALESKF----------------------LIEEDQKSNKEGLRAADHLIA
        ++G++ IR     KE  L +  L   +E SVR+YRR FE  +A L+++ E+ LES F                      ++E  Q+  +  L        
Subjt:  DEGKEGIRA-PRSKEVPLFDMRLRKLEEKSVREYRRCFEQYSAGLKELEEKALESKF----------------------LIEEDQKSNKEGLRAADHLIA

Query:  HVEKTDYPQAGGSSTTI----------TPT---KDISGKSNENQ-----PYRRLTEEEMRIKKEKGICFKCDGKFSFGHRYKKKELQFMFIQDGE
           ++ + +  G  TT            PT   K +  +   NQ     P+RR+ + E++ K+EKG+C++CDGK+  GHR   +ELQ + +++ E
Subjt:  HVEKTDYPQAGGSSTTI----------TPT---KDISGKSNENQ-----PYRRLTEEEMRIKKEKGICFKCDGKFSFGHRYKKKELQFMFIQDGE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCACCCAGATGGTGTTTTTGGCTGATGATAATTCCCTCATAGGCTGGTTAGCTGAGTTGGATGCCTTTCAAAGCAAACTTGCTGAGAATTATGGTGTAGCT
CAATCTTTTGCTTGTGTTGTACCTTCAACCATCCAAATACTCATCAAAGCCCCCAGTAAGATTTGCAAGGACATTGAAAAATTTATGCATAACTCCTTTTGGGAA
GGGGCGGTGGAAGAAGGATGTGGTTTCTGCTTGTACCCTTCGACACAAGGTTATTGCTTGCAAGCCAAGGCCTACCAATTTAGTATCAAAGTAGTCAATCTTGGG
TATGGTATGGTGGGAAAATCGGAGAGCAGAGTAATTACACTGGAGGAGAAGCTATCAGAGGTGGGTGAGAAACAAGAGGCGTTGGAAAACAAGATGGAGTCAGGC
TTCGTTGAGTTGGCCGACAAGATCGATACCCTGACTAGATACTTACGCGCACCTCTAAAAATCTCATGGGACAACCCGCCAAGAGGAGGAGGATCATTAGAATGG
ACGATAATGGATAAAGGGAAAAATGTTGCCAAATCGTCGTCTTGGGAATCAAAGAAAGATGATGAAGGAAAGGAAGGGATACGTGCACCTAGGAGTAAAGAAGTT
CCCCTGTTTGATATGAGATTGCGCAAGTTGGAAGAGAAGTCAGTGAGGGAATATCGCCGGTGTTTTGAGCAATATTCGGCTGGATTGAAAGAGCTGGAGGAGAAA
GCCTTAGAAAGTAAGTTTTTGATTGAGGAAGACCAGAAGTCCAACAAAGAAGGATTAAGGGCGGCGGATCATCTAATAGCCCACGTGGAAAAAACGGACTATCCA
CAAGCAGGGGGTTCATCCACAACGATTACTCCAACGAAAGATATTAGTGGCAAGAGCAACGAAAATCAACCATACCGGCGTTTGACCGAAGAGGAGATGAGGATC
AAGAAGGAGAAAGGAATATGTTTCAAGTGTGACGGAAAGTTTAGTTTCGGCCACCGCTACAAAAAGAAAGAATTACAGTTCATGTTCATTCAAGATGGTGAAGGC
GTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCACCCAGATGGTGTTTTTGGCTGATGATAATTCCCTCATAGGCTGGTTAGCTGAGTTGGATGCCTTTCAAAGCAAACTTGCTGAGAATTATGGTGTAGCT
CAATCTTTTGCTTGTGTTGTACCTTCAACCATCCAAATACTCATCAAAGCCCCCAGTAAGATTTGCAAGGACATTGAAAAATTTATGCATAACTCCTTTTGGGAA
GGGGCGGTGGAAGAAGGATGTGGTTTCTGCTTGTACCCTTCGACACAAGGTTATTGCTTGCAAGCCAAGGCCTACCAATTTAGTATCAAAGTAGTCAATCTTGGG
TATGGTATGGTGGGAAAATCGGAGAGCAGAGTAATTACACTGGAGGAGAAGCTATCAGAGGTGGGTGAGAAACAAGAGGCGTTGGAAAACAAGATGGAGTCAGGC
TTCGTTGAGTTGGCCGACAAGATCGATACCCTGACTAGATACTTACGCGCACCTCTAAAAATCTCATGGGACAACCCGCCAAGAGGAGGAGGATCATTAGAATGG
ACGATAATGGATAAAGGGAAAAATGTTGCCAAATCGTCGTCTTGGGAATCAAAGAAAGATGATGAAGGAAAGGAAGGGATACGTGCACCTAGGAGTAAAGAAGTT
CCCCTGTTTGATATGAGATTGCGCAAGTTGGAAGAGAAGTCAGTGAGGGAATATCGCCGGTGTTTTGAGCAATATTCGGCTGGATTGAAAGAGCTGGAGGAGAAA
GCCTTAGAAAGTAAGTTTTTGATTGAGGAAGACCAGAAGTCCAACAAAGAAGGATTAAGGGCGGCGGATCATCTAATAGCCCACGTGGAAAAAACGGACTATCCA
CAAGCAGGGGGTTCATCCACAACGATTACTCCAACGAAAGATATTAGTGGCAAGAGCAACGAAAATCAACCATACCGGCGTTTGACCGAAGAGGAGATGAGGATC
AAGAAGGAGAAAGGAATATGTTTCAAGTGTGACGGAAAGTTTAGTTTCGGCCACCGCTACAAAAAGAAAGAATTACAGTTCATGTTCATTCAAGATGGTGAAGGC
GTCTAG
Protein sequenceShow/hide protein sequence
MATQMVFLADDNSLIGWLAELDAFQSKLAENYGVAQSFACVVPSTIQILIKAPSKICKDIEKFMHNSFWEGAVEEGCGFCLYPSTQGYCLQAKAYQFSIKVVNLG
YGMVGKSESRVITLEEKLSEVGEKQEALENKMESGFVELADKIDTLTRYLRAPLKISWDNPPRGGGSLEWTIMDKGKNVAKSSSWESKKDDEGKEGIRAPRSKEV
PLFDMRLRKLEEKSVREYRRCFEQYSAGLKELEEKALESKFLIEEDQKSNKEGLRAADHLIAHVEKTDYPQAGGSSTTITPTKDISGKSNENQPYRRLTEEEMRI
KKEKGICFKCDGKFSFGHRYKKKELQFMFIQDGEGV