; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG01G011645 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG01G011645
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRetrotransposon protein
Genome locationCG_Chr01:19493539..19494309
RNA-Seq ExpressionClCG01G011645
SyntenyClCG01G011645
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0067273.1 retrotransposon protein [Cucumis melo var. makuwa]6.9e-2537.3Show/hide
Query:  MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPF
        M+G   S FGWN E+KCI+ +  +FD  VK HP+A+ L +KSFPY+ DL I+FG+DRATG    T  ++G +   + E +D+     ++ E+F IP+P  
Subjt:  MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPF

Query:  GSSPMSDDIPTTPSGRGSE--SSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWLVMNKDLARRRCRELYAELQSIPGL
           P  +D+ +TP+    +  SS PS+ RRS    +    +  R   +  +K I  IA W     ++     + LYAELQ+IPG+
Subjt:  GSSPMSDDIPTTPSGRGSE--SSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWLVMNKDLARRRCRELYAELQSIPGL

TYK07921.1 hypothetical protein E5676_scaffold265G00330 [Cucumis melo var. makuwa]1.4e-2537.84Show/hide
Query:  MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPF
        M+G   SGFGWN  +KCI+ E  +FD  VK HP+A+ L +K FPY+ DL +VFG+DRATG    T  ++  +   + E +D+     ++ E+F IP+P  
Subjt:  MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPF

Query:  GSSPMSDDIPTTPSG--RGSESSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWLVMNKDLARRRCRELYAELQSIPGL
           P  +D+P+TP+     + SS PS+ RR   S  G+  +  R   +  +K I  IA W     ++     + LYAELQ+IPG+
Subjt:  GSSPMSDDIPTTPSG--RGSESSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWLVMNKDLARRRCRELYAELQSIPGL

TYK10886.1 retrotransposon protein [Cucumis melo var. makuwa]4.5e-2437.04Show/hide
Query:  MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPF
        M+G   S FGWN E+KCI+ +  +FD  VK HP+A+ L +KSFPY+ DL I+FG+DRATG    T  ++G +   + E +D+     ++ E+F IP+P  
Subjt:  MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPF

Query:  GSSPMSDDIPTTPSGRGSE--SSLPSRSRRSRISSIGEYNEVVREGF----QLLTKSIDDIAQWLVMNKDLARRRCRELYAELQSIPGL
           P  +D+ +TP+    +  SS PS+ RRS       Y+E + + F    +  +K I  I  W     ++     + LYAELQ+IPG+
Subjt:  GSSPMSDDIPTTPSGRGSE--SSLPSRSRRSRISSIGEYNEVVREGF----QLLTKSIDDIAQWLVMNKDLARRRCRELYAELQSIPGL

XP_008460440.1 PREDICTED: uncharacterized protein LOC103499248 [Cucumis melo]3.4e-2436.96Show/hide
Query:  LGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPFG
        +G   S FGWN E+KCI+ +  +FD  VK HP+A+ L +KSFPY+ DL I+FG+DRATG    T  ++G +   + E +D+     ++ E+F IP+P   
Subjt:  LGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPFG

Query:  SSPMSDDIPTTPSGRGSE--SSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWLVMNKDLARRRCRELYAELQSIPGL
          P  +D+ +TP+    +  SS PS+ RRS    +    +  R   +  +K I  IA W     ++     + LYAELQ+IPG+
Subjt:  SSPMSDDIPTTPSGRGSE--SSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWLVMNKDLARRRCRELYAELQSIPGL

XP_022158565.1 uncharacterized protein LOC111025018 [Momordica charantia]2.1e-2133.68Show/hide
Query:  GFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPFGSS---P
        GFGWN + KCI+ E ++FD  VKSHP+AK LR+K  P+YDDL + FGKDRATG++      +            I  +   + ++FYIPDPP  ++    
Subjt:  GFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPFGSS---P

Query:  MSDDIPTTPSGRGSESSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWLVMNKDLARRRCRELYAELQSIPGL-SNAVKPLCGLATT
        + +D+P TP+ + +  +    S+R R     E  +VVR   ++ T  ++ +A W    ++    R + ++ +L+ IP L +N    L G+  T
Subjt:  MSDDIPTTPSGRGSESSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWLVMNKDLARRRCRELYAELQSIPGL-SNAVKPLCGLATT

TrEMBL top hitse value%identityAlignment
A0A1S3CC17 uncharacterized protein LOC1034992481.7e-2436.96Show/hide
Query:  LGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPFG
        +G   S FGWN E+KCI+ +  +FD  VK HP+A+ L +KSFPY+ DL I+FG+DRATG    T  ++G +   + E +D+     ++ E+F IP+P   
Subjt:  LGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPFG

Query:  SSPMSDDIPTTPSGRGSE--SSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWLVMNKDLARRRCRELYAELQSIPGL
          P  +D+ +TP+    +  SS PS+ RRS    +    +  R   +  +K I  IA W     ++     + LYAELQ+IPG+
Subjt:  SSPMSDDIPTTPSGRGSE--SSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWLVMNKDLARRRCRELYAELQSIPGL

A0A5A7VGQ0 Retrotransposon protein3.4e-2537.3Show/hide
Query:  MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPF
        M+G   S FGWN E+KCI+ +  +FD  VK HP+A+ L +KSFPY+ DL I+FG+DRATG    T  ++G +   + E +D+     ++ E+F IP+P  
Subjt:  MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPF

Query:  GSSPMSDDIPTTPSGRGSE--SSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWLVMNKDLARRRCRELYAELQSIPGL
           P  +D+ +TP+    +  SS PS+ RRS    +    +  R   +  +K I  IA W     ++     + LYAELQ+IPG+
Subjt:  GSSPMSDDIPTTPSGRGSE--SSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWLVMNKDLARRRCRELYAELQSIPGL

A0A5D3C7T4 Uncharacterized protein6.8e-2637.84Show/hide
Query:  MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPF
        M+G   SGFGWN  +KCI+ E  +FD  VK HP+A+ L +K FPY+ DL +VFG+DRATG    T  ++  +   + E +D+     ++ E+F IP+P  
Subjt:  MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPF

Query:  GSSPMSDDIPTTPSG--RGSESSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWLVMNKDLARRRCRELYAELQSIPGL
           P  +D+P+TP+     + SS PS+ RR   S  G+  +  R   +  +K I  IA W     ++     + LYAELQ+IPG+
Subjt:  GSSPMSDDIPTTPSG--RGSESSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWLVMNKDLARRRCRELYAELQSIPGL

A0A5D3CKC1 Retrotransposon protein2.2e-2437.04Show/hide
Query:  MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPF
        M+G   S FGWN E+KCI+ +  +FD  VK HP+A+ L +KSFPY+ DL I+FG+DRATG    T  ++G +   + E +D+     ++ E+F IP+P  
Subjt:  MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPF

Query:  GSSPMSDDIPTTPSGRGSE--SSLPSRSRRSRISSIGEYNEVVREGF----QLLTKSIDDIAQWLVMNKDLARRRCRELYAELQSIPGL
           P  +D+ +TP+    +  SS PS+ RRS       Y+E + + F    +  +K I  I  W     ++     + LYAELQ+IPG+
Subjt:  GSSPMSDDIPTTPSGRGSE--SSLPSRSRRSRISSIGEYNEVVREGF----QLLTKSIDDIAQWLVMNKDLARRRCRELYAELQSIPGL

A0A6J1DW73 uncharacterized protein LOC1110250181.0e-2133.68Show/hide
Query:  GFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPFGSS---P
        GFGWN + KCI+ E ++FD  VKSHP+AK LR+K  P+YDDL + FGKDRATG++      +            I  +   + ++FYIPDPP  ++    
Subjt:  GFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPFGSS---P

Query:  MSDDIPTTPSGRGSESSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWLVMNKDLARRRCRELYAELQSIPGL-SNAVKPLCGLATT
        + +D+P TP+ + +  +    S+R R     E  +VVR   ++ T  ++ +A W    ++    R + ++ +L+ IP L +N    L G+  T
Subjt:  MSDDIPTTPSGRGSESSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWLVMNKDLARRRCRELYAELQSIPGL-SNAVKPLCGLATT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30140.1 unknown protein2.0e-0631.21Show/hide
Query:  SGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHA-----TTTAKV--------GFEPVMEEENEDILNNQSLDFENF
        SGFGW+ E K      +++   +K+HP+ K ++ +S  +++DL I+FG   ATGS A     +T  ++        G E V ++EN +    +  +F   
Subjt:  SGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHA-----TTTAKV--------GFEPVMEEENEDILNNQSLDFENF

Query:  YIPDPPFGSSPMSDDIPTTPSGRGSESSLP-SRSRRSRISS
        +     + +SP + D PTT  GR SE  LP  R++  R +S
Subjt:  YIPDPPFGSSPMSDDIPTTPSGRGSESSLP-SRSRRSRISS

AT2G24960.1 unknown protein1.3e-0531.25Show/hide
Query:  GFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVF------GKDRATGSHATTTAKVGFEPVMEEENED
        GF W+  R  I  +  ++D+ +K HP A+  R KS P Y+DL  +F      G D      A  T++       +E+N D
Subjt:  GFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVF------GKDRATGSHATTTAKVGFEPVMEEENED

AT2G24960.2 unknown protein2.4e-0732.86Show/hide
Query:  LGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEP
        L  +GF W+A R  +  +  I++T +++HP A+  R K+ P Y +L  +FGK+ + G +  T     F+P
Subjt:  LGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEP

AT5G27260.1 unknown protein1.3e-0824.35Show/hide
Query:  SGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPFGSSPMS
        SGFGW+   K      +++   +K+HP+ K+LR+ +F ++D+L I+FG+  ATG +A        + +     E+       DF+N Y  D         
Subjt:  SGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPFGSSPMS

Query:  DDIPTTPSGRGSESSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWLVMNKDLARRRCRELYAELQSIPGLSNAVKPLCGLATTMKY
           P    G      LP R R     S  +  E       +++  I DI Q         +R  R+     Q    + +A+K +  L   ++Y
Subjt:  DDIPTTPSGRGSESSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWLVMNKDLARRRCRELYAELQSIPGLSNAVKPLCGLATTMKY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGGCCTAGGCTATAGTGGGTTTGGTTGGAACGCGGAGCGCAAATGTATTGACTGTGAGGCGAAGATATTTGACACATGTGTCAAGAGTCATCCGAGTGCAAAAGA
ACTACGCCATAAGTCATTTCCGTACTATGACGACTTGGCCATCGTATTCGGCAAAGACAGAGCCACAGGGAGTCATGCAACCACCACTGCAAAGGTTGGATTTGAACCTG
TTATGGAAGAGGAAAACGAGGACATCCTGAATAACCAATCCCTAGACTTTGAGAACTTCTATATTCCTGATCCACCTTTTGGCAGCTCGCCCATGTCAGACGACATTCCA
ACTACCCCCAGCGGTAGAGGGTCTGAGAGTAGCTTGCCATCAAGGAGTAGGAGGTCCCGAATTTCCTCGATTGGAGAGTACAACGAGGTGGTTCGTGAGGGATTCCAACT
TCTGACGAAGTCCATTGACGACATTGCACAGTGGCTTGTCATGAACAAGGACCTGGCAAGGCGTCGTTGTCGAGAACTATACGCTGAGCTACAATCCATTCCTGGTCTAT
CCAATGCTGTTAAGCCACTTTGTGGACTAGCCACCACAATGAAGTACGACTATTTCATGCAAGTCCTCGGGCGACCACAGGATCCAGCACCATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGGGCCTAGGCTATAGTGGGTTTGGTTGGAACGCGGAGCGCAAATGTATTGACTGTGAGGCGAAGATATTTGACACATGTGTCAAGAGTCATCCGAGTGCAAAAGA
ACTACGCCATAAGTCATTTCCGTACTATGACGACTTGGCCATCGTATTCGGCAAAGACAGAGCCACAGGGAGTCATGCAACCACCACTGCAAAGGTTGGATTTGAACCTG
TTATGGAAGAGGAAAACGAGGACATCCTGAATAACCAATCCCTAGACTTTGAGAACTTCTATATTCCTGATCCACCTTTTGGCAGCTCGCCCATGTCAGACGACATTCCA
ACTACCCCCAGCGGTAGAGGGTCTGAGAGTAGCTTGCCATCAAGGAGTAGGAGGTCCCGAATTTCCTCGATTGGAGAGTACAACGAGGTGGTTCGTGAGGGATTCCAACT
TCTGACGAAGTCCATTGACGACATTGCACAGTGGCTTGTCATGAACAAGGACCTGGCAAGGCGTCGTTGTCGAGAACTATACGCTGAGCTACAATCCATTCCTGGTCTAT
CCAATGCTGTTAAGCCACTTTGTGGACTAGCCACCACAATGAAGTACGACTATTTCATGCAAGTCCTCGGGCGACCACAGGATCCAGCACCATGA
Protein sequenceShow/hide protein sequence
MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPFGSSPMSDDIP
TTPSGRGSESSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWLVMNKDLARRRCRELYAELQSIPGLSNAVKPLCGLATTMKYDYFMQVLGRPQDPAP