; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G11725 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G11725
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrotransposon protein
Genome locationClcChr01:18568370..18569140
RNA-Seq ExpressionClc01G11725
SyntenyClc01G11725
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0067273.1 retrotransposon protein [Cucumis melo var. makuwa]4.1e-2537.3Show/hide
Query:  MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPF
        M+G   S FGWN E+KCI+ +  +FD  VK HP+A+ L +KSFPY+ DL I+FG+DRATG    T  ++G +   + E +D+     ++ E+F IP+P  
Subjt:  MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPF

Query:  GSSPMSDDIPTTPSGRGSE--SSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWPVMNKDLARRRCRELYAELQSIPGL
           P  +D+ +TP+    +  SS PS+ RRS    +    +  R   +  +K I  IA W     ++     + LYAELQ+IPG+
Subjt:  GSSPMSDDIPTTPSGRGSE--SSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWPVMNKDLARRRCRELYAELQSIPGL

TYK07921.1 hypothetical protein E5676_scaffold265G00330 [Cucumis melo var. makuwa]6.3e-2637.84Show/hide
Query:  MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPF
        M+G   SGFGWN  +KCI+ E  +FD  VK HP+A+ L +K FPY+ DL +VFG+DRATG    T  ++  +   + E +D+     ++ E+F IP+P  
Subjt:  MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPF

Query:  GSSPMSDDIPTTPSG--RGSESSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWPVMNKDLARRRCRELYAELQSIPGL
           P  +D+P+TP+     + SS PS+ RR   S  G+  +  R   +  +K I  IA W     ++     + LYAELQ+IPG+
Subjt:  GSSPMSDDIPTTPSG--RGSESSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWPVMNKDLARRRCRELYAELQSIPGL

TYK10886.1 retrotransposon protein [Cucumis melo var. makuwa]2.6e-2437.04Show/hide
Query:  MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPF
        M+G   S FGWN E+KCI+ +  +FD  VK HP+A+ L +KSFPY+ DL I+FG+DRATG    T  ++G +   + E +D+     ++ E+F IP+P  
Subjt:  MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPF

Query:  GSSPMSDDIPTTPSGRGSE--SSLPSRSRRSRISSIGEYNEVVREGF----QLLTKSIDDIAQWPVMNKDLARRRCRELYAELQSIPGL
           P  +D+ +TP+    +  SS PS+ RRS       Y+E + + F    +  +K I  I  W     ++     + LYAELQ+IPG+
Subjt:  GSSPMSDDIPTTPSGRGSE--SSLPSRSRRSRISSIGEYNEVVREGF----QLLTKSIDDIAQWPVMNKDLARRRCRELYAELQSIPGL

XP_008460440.1 PREDICTED: uncharacterized protein LOC103499248 [Cucumis melo]1.5e-2436.96Show/hide
Query:  LGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPFG
        +G   S FGWN E+KCI+ +  +FD  VK HP+A+ L +KSFPY+ DL I+FG+DRATG    T  ++G +   + E +D+     ++ E+F IP+P   
Subjt:  LGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPFG

Query:  SSPMSDDIPTTPSGRGSE--SSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWPVMNKDLARRRCRELYAELQSIPGL
          P  +D+ +TP+    +  SS PS+ RRS    +    +  R   +  +K I  IA W     ++     + LYAELQ+IPG+
Subjt:  SSPMSDDIPTTPSGRGSE--SSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWPVMNKDLARRRCRELYAELQSIPGL

XP_022158565.1 uncharacterized protein LOC111025018 [Momordica charantia]3.4e-2432.58Show/hide
Query:  GFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPFGSS---P
        GFGWN + KCI+ E ++FD  VKSHP+AK LR+K  P+YDDL + FGKDRATG++      +            I  +   + ++FYIPDPP  ++    
Subjt:  GFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPFGSS---P

Query:  MSDDIPTTPSGRGSESSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWPVMNKDLARRRCRELYAELQSIPGL-SNAVKPLCGLATT-------
        + +D+P TP+ + +  +    S+R R     E  +VVR   ++ T  ++ +A WP   ++    R + ++ +L+ IP L +N    L G+  T       
Subjt:  MSDDIPTTPSGRGSESSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWPVMNKDLARRRCRELYAELQSIPGL-SNAVKPLCGLATT-------

Query:  -------MKYDYCMQVLGRPQ
                K  +CMQ+LG+ +
Subjt:  -------MKYDYCMQVLGRPQ

TrEMBL top hitse value%identityAlignment
A0A1S3CC17 uncharacterized protein LOC1034992487.5e-2536.96Show/hide
Query:  LGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPFG
        +G   S FGWN E+KCI+ +  +FD  VK HP+A+ L +KSFPY+ DL I+FG+DRATG    T  ++G +   + E +D+     ++ E+F IP+P   
Subjt:  LGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPFG

Query:  SSPMSDDIPTTPSGRGSE--SSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWPVMNKDLARRRCRELYAELQSIPGL
          P  +D+ +TP+    +  SS PS+ RRS    +    +  R   +  +K I  IA W     ++     + LYAELQ+IPG+
Subjt:  SSPMSDDIPTTPSGRGSE--SSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWPVMNKDLARRRCRELYAELQSIPGL

A0A5A7VGQ0 Retrotransposon protein2.0e-2537.3Show/hide
Query:  MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPF
        M+G   S FGWN E+KCI+ +  +FD  VK HP+A+ L +KSFPY+ DL I+FG+DRATG    T  ++G +   + E +D+     ++ E+F IP+P  
Subjt:  MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPF

Query:  GSSPMSDDIPTTPSGRGSE--SSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWPVMNKDLARRRCRELYAELQSIPGL
           P  +D+ +TP+    +  SS PS+ RRS    +    +  R   +  +K I  IA W     ++     + LYAELQ+IPG+
Subjt:  GSSPMSDDIPTTPSGRGSE--SSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWPVMNKDLARRRCRELYAELQSIPGL

A0A5D3C7T4 Uncharacterized protein3.0e-2637.84Show/hide
Query:  MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPF
        M+G   SGFGWN  +KCI+ E  +FD  VK HP+A+ L +K FPY+ DL +VFG+DRATG    T  ++  +   + E +D+     ++ E+F IP+P  
Subjt:  MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPF

Query:  GSSPMSDDIPTTPSG--RGSESSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWPVMNKDLARRRCRELYAELQSIPGL
           P  +D+P+TP+     + SS PS+ RR   S  G+  +  R   +  +K I  IA W     ++     + LYAELQ+IPG+
Subjt:  GSSPMSDDIPTTPSG--RGSESSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWPVMNKDLARRRCRELYAELQSIPGL

A0A5D3CKC1 Retrotransposon protein1.3e-2437.04Show/hide
Query:  MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPF
        M+G   S FGWN E+KCI+ +  +FD  VK HP+A+ L +KSFPY+ DL I+FG+DRATG    T  ++G +   + E +D+     ++ E+F IP+P  
Subjt:  MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPF

Query:  GSSPMSDDIPTTPSGRGSE--SSLPSRSRRSRISSIGEYNEVVREGF----QLLTKSIDDIAQWPVMNKDLARRRCRELYAELQSIPGL
           P  +D+ +TP+    +  SS PS+ RRS       Y+E + + F    +  +K I  I  W     ++     + LYAELQ+IPG+
Subjt:  GSSPMSDDIPTTPSGRGSE--SSLPSRSRRSRISSIGEYNEVVREGF----QLLTKSIDDIAQWPVMNKDLARRRCRELYAELQSIPGL

A0A6J1DW73 uncharacterized protein LOC1110250181.7e-2432.58Show/hide
Query:  GFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPFGSS---P
        GFGWN + KCI+ E ++FD  VKSHP+AK LR+K  P+YDDL + FGKDRATG++      +            I  +   + ++FYIPDPP  ++    
Subjt:  GFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPFGSS---P

Query:  MSDDIPTTPSGRGSESSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWPVMNKDLARRRCRELYAELQSIPGL-SNAVKPLCGLATT-------
        + +D+P TP+ + +  +    S+R R     E  +VVR   ++ T  ++ +A WP   ++    R + ++ +L+ IP L +N    L G+  T       
Subjt:  MSDDIPTTPSGRGSESSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWPVMNKDLARRRCRELYAELQSIPGL-SNAVKPLCGLATT-------

Query:  -------MKYDYCMQVLGRPQ
                K  +CMQ+LG+ +
Subjt:  -------MKYDYCMQVLGRPQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30140.1 unknown protein2.0e-0631.21Show/hide
Query:  SGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHA-----TTTAKV--------GFEPVMEEENEDILNNQSLDFENF
        SGFGW+ E K      +++   +K+HP+ K ++ +S  +++DL I+FG   ATGS A     +T  ++        G E V ++EN +    +  +F   
Subjt:  SGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHA-----TTTAKV--------GFEPVMEEENEDILNNQSLDFENF

Query:  YIPDPPFGSSPMSDDIPTTPSGRGSESSLP-SRSRRSRISS
        +     + +SP + D PTT  GR SE  LP  R++  R +S
Subjt:  YIPDPPFGSSPMSDDIPTTPSGRGSESSLP-SRSRRSRISS

AT2G24960.1 unknown protein1.3e-0531.25Show/hide
Query:  GFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVF------GKDRATGSHATTTAKVGFEPVMEEENED
        GF W+  R  I  +  ++D+ +K HP A+  R KS P Y+DL  +F      G D      A  T++       +E+N D
Subjt:  GFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVF------GKDRATGSHATTTAKVGFEPVMEEENED

AT2G24960.2 unknown protein2.4e-0732.86Show/hide
Query:  LGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEP
        L  +GF W+A R  +  +  I++T +++HP A+  R K+ P Y +L  +FGK+ + G +  T     F+P
Subjt:  LGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEP

AT5G27260.1 unknown protein1.3e-0824.35Show/hide
Query:  SGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPFGSSPMS
        SGFGW+   K      +++   +K+HP+ K+LR+ +F ++D+L I+FG+  ATG +A        + +     E+       DF+N Y  D         
Subjt:  SGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPFGSSPMS

Query:  DDIPTTPSGRGSESSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWPVMNKDLARRRCRELYAELQSIPGLSNAVKPLCGLATTMKY
           P    G      LP R R     S  +  E       +++  I DI Q         +R  R+     Q    + +A+K +  L   ++Y
Subjt:  DDIPTTPSGRGSESSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWPVMNKDLARRRCRELYAELQSIPGLSNAVKPLCGLATTMKY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGGCCTAGGCTATAGTGGGTTTGGTTGGAACGCGGAGCGCAAATGTATTGACTGTGAGGCGAAGATATTTGACACATGTGTCAAGAGTCATCCGAGTGCA
AAAGAACTACGCCATAAGTCATTTCCGTACTATGACGACTTGGCCATCGTATTCGGCAAAGACAGAGCCACAGGGAGTCATGCAACCACCACTGCAAAGGTTGGA
TTTGAACCTGTTATGGAAGAGGAAAACGAGGACATCCTGAATAACCAATCCCTAGACTTTGAGAACTTCTATATTCCTGATCCACCTTTTGGCAGCTCGCCCATG
TCAGACGACATTCCAACTACCCCCAGCGGTAGAGGGTCTGAGAGTAGCTTGCCATCAAGGAGTAGGAGGTCCCGAATTTCCTCGATTGGAGAGTACAACGAGGTG
GTTCGTGAGGGATTCCAACTTCTGACGAAGTCCATTGACGACATTGCACAGTGGCCTGTCATGAACAAGGACCTGGCAAGGCGTCGTTGTCGAGAACTATACGCT
GAGCTACAATCCATTCCTGGTCTATCCAATGCTGTTAAGCCACTTTGTGGACTAGCCACCACAATGAAGTACGACTATTGCATGCAAGTCCTCGGGCGACCACAG
GATCCAGCACCATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGGGCCTAGGCTATAGTGGGTTTGGTTGGAACGCGGAGCGCAAATGTATTGACTGTGAGGCGAAGATATTTGACACATGTGTCAAGAGTCATCCGAGTGCA
AAAGAACTACGCCATAAGTCATTTCCGTACTATGACGACTTGGCCATCGTATTCGGCAAAGACAGAGCCACAGGGAGTCATGCAACCACCACTGCAAAGGTTGGA
TTTGAACCTGTTATGGAAGAGGAAAACGAGGACATCCTGAATAACCAATCCCTAGACTTTGAGAACTTCTATATTCCTGATCCACCTTTTGGCAGCTCGCCCATG
TCAGACGACATTCCAACTACCCCCAGCGGTAGAGGGTCTGAGAGTAGCTTGCCATCAAGGAGTAGGAGGTCCCGAATTTCCTCGATTGGAGAGTACAACGAGGTG
GTTCGTGAGGGATTCCAACTTCTGACGAAGTCCATTGACGACATTGCACAGTGGCCTGTCATGAACAAGGACCTGGCAAGGCGTCGTTGTCGAGAACTATACGCT
GAGCTACAATCCATTCCTGGTCTATCCAATGCTGTTAAGCCACTTTGTGGACTAGCCACCACAATGAAGTACGACTATTGCATGCAAGTCCTCGGGCGACCACAG
GATCCAGCACCATGA
Protein sequenceShow/hide protein sequence
MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPFGSSPM
SDDIPTTPSGRGSESSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWPVMNKDLARRRCRELYAELQSIPGLSNAVKPLCGLATTMKYDYCMQVLGRPQ
DPAP