; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG07G004600 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG07G004600
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionTy3-gypsy retrotransposon protein
Genome locationCG_Chr07:5854545..5855845
RNA-Seq ExpressionClCG07G004600
SyntenyClCG07G004600
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039074.1 keratin, type I cytoskeletal 10-like [Cucumis melo var. makuwa]3.2e-2165.52Show/hide
Query:  TKPVEMETTTNLVIDEGTSAPCGSNNRKVPLFDMRLCKLEEPILKGEVEENVDWWLHRVERYFVVNRLIERDKLDATMLCLEGEALD
        T+  EME     V++EG   PC  N R +PLF MRL KLE PI KGE EENVD WLH VERYFVVNRL ERDKL+A +LCLE EALD
Subjt:  TKPVEMETTTNLVIDEGTSAPCGSNNRKVPLFDMRLCKLEEPILKGEVEENVDWWLHRVERYFVVNRLIERDKLDATMLCLEGEALD

KAE8647113.1 hypothetical protein Csa_021721 [Cucumis sativus]4.2e-2156.38Show/hide
Query:  YQRLTDTEIQSMKEKGICFKCDGKFSVRHRCPNRELNVFVVQEVEDLSDLPEET--EVTAGNEKVVETEITTLSLNSLAGFDSPKTLKVKGEIQ
        Y+R+TD E++  KEKG+CF+CD KFS+ HRC  RELN+ VVQE EDLSD  ++   E+     K V T+I  LS+NSL GF+SPKT+K+KGEI+
Subjt:  YQRLTDTEIQSMKEKGICFKCDGKFSVRHRCPNRELNVFVVQEVEDLSDLPEET--EVTAGNEKVVETEITTLSLNSLAGFDSPKTLKVKGEIQ

XP_024017591.1 uncharacterized protein LOC112090471 [Morus notabilis]5.9e-2327.34Show/hide
Query:  EKMSELTSKVSELEGKFDSRFAEAENLEMLMHKMDTWAQGVREETSSEQTKTDKRTKPVEMETTTNLVIDEGTSAPCGSNNRKVPLFDMRLCKLEEPILK
        E M E   K+  LE          + LE+L+ +MD  ++  R   + E  +TD      +   TT            GS        + R  ++E P+  
Subjt:  EKMSELTSKVSELEGKFDSRFAEAENLEMLMHKMDTWAQGVREETSSEQTKTDKRTKPVEMETTTNLVIDEGTSAPCGSNNRKVPLFDMRLCKLEEPILK

Query:  GEVEENVDWWLHRVERYFVVNRLIERDKLDATMLCLEGEALDW-----------KLVEVPTTAVEKVSTFRSGGQTRK---VDEVATGDDYQEVSTTMA-
        G   EN D W+ R ERYF +NRL +R+KLD  ++ LEGEAL W              E+    +E+  + + G    K   + +  T  +Y+     +A 
Subjt:  GEVEENVDWWLHRVERYFVVNRLIERDKLDATMLCLEGEALDW-----------KLVEVPTTAVEKVSTFRSGGQTRK---VDEVATGDDYQEVSTTMA-

Query:  -----------------------------------QIIEDDLAIEEDKRAGRPTQAQI----LW----------------------------VKLGQPSH
                                           +I+E    +EE  +  R  + Q+     W                            V +G PS 
Subjt:  -----------------------------------QIIEDDLAIEEDKRAGRPTQAQI----LW----------------------------VKLGQPSH

Query:  -QHRMRGNSHQMTSYQRLTDTEIQSMKEKGICFKCDGKFSVRHRCPNRELNVFVVQEVEDLSDLPEETEVTAGNEKVVETEITTLSLNSLAGFDSPKTLK
           + + ++  M  Y+RLTD +IQ+ +EKG+C++CDGK+S  +RCPNREL V +V+E +++++  EE      +E+VVE     LSLNS+ GF SPKT+K
Subjt:  -QHRMRGNSHQMTSYQRLTDTEIQSMKEKGICFKCDGKFSVRHRCPNRELNVFVVQEVEDLSDLPEETEVTAGNEKVVETEITTLSLNSLAGFDSPKTLK

Query:  VKGEIQ
        +KG ++
Subjt:  VKGEIQ

XP_024030016.1 uncharacterized protein LOC112094120 [Morus notabilis]5.7e-1846.49Show/hide
Query:  VKLGQPS-HQHRMRGNSHQMTSYQRLTDTEIQSMKEKGICFKCDGKFSVRHRCPNRELNVFVVQEVEDLSDLPEETEVTAGNEKVVETEITTLSLNSLAG
        V +G+PS +  +   ++     Y+RLTD EI++ KEKG+C++CDGK+S RHRCPNREL V +V E E++++  EE      +E+VVE     LSLNS+ G
Subjt:  VKLGQPS-HQHRMRGNSHQMTSYQRLTDTEIQSMKEKGICFKCDGKFSVRHRCPNRELNVFVVQEVEDLSDLPEETEVTAGNEKVVETEITTLSLNSLAG

Query:  FDSPKTLKVKGEIQ
          SPKT+K+KG ++
Subjt:  FDSPKTLKVKGEIQ

XP_038904464.1 uncharacterized protein LOC120090832 [Benincasa hispida]4.7e-2828.83Show/hide
Query:  MEARVIEVEEKMSELTSKVSELEGKFDSRFAEAENL--EMLMHKMDTWAQGVREETSSEQTKTDKR---TKPVEMETTTNLVIDEG--TSAPCGSNNRKV
        MEAR+  VEE   +LT     +E + D RF E + +   M++ K++   + + ++         K+    K  E  ++   V+D G   +       R+V
Subjt:  MEARVIEVEEKMSELTSKVSELEGKFDSRFAEAENL--EMLMHKMDTWAQGVREETSSEQTKTDKR---TKPVEMETTTNLVIDEG--TSAPCGSNNRKV

Query:  PLFDMRLCKLEEPILKGEVEENVDWWLHRVERYFVVNRLIERDKLDATMLCLEGEALDWKLVEVPTTAVEKVSTFRS-----------------------
         LFDMRL KLE PI KGE+ E+   W HRVERYFVVNRL E+DK++A +LCLEGEAL+W   E   T +   + F++                       
Subjt:  PLFDMRLCKLEEPILKGEVEENVDWWLHRVERYFVVNRLIERDKLDATMLCLEGEALDWKLVEVPTTAVEKVSTFRS-----------------------

Query:  ---------------GGQTRKVDEVATG-------DDYQ-----------EVSTTMAQIIEDDLAIEEDKR----AGRP---------------------
                       G      DE+          +D Q           +    MAQIIED   +EE +R     G P                     
Subjt:  ---------------GGQTRKVDEVATG-------DDYQ-----------EVSTTMAQIIEDDLAIEEDKR----AGRP---------------------

Query:  -------TQAQILWVKL------GQPSHQHRMR---GNSHQMTSYQRLTDTEIQSMKEKGICFKCDGKFSVRHRCPNRELNVFVVQEVEDLSDLPEE---
               TQ+    + L      G PS+   ++   G S  + +++RL+D ++Q+ ++KG+C++C+ K++  HRC  +EL++ +    E+ ++  EE   
Subjt:  -------TQAQILWVKL------GQPSHQHRMR---GNSHQMTSYQRLTDTEIQSMKEKGICFKCDGKFSVRHRCPNRELNVFVVQEVEDLSDLPEE---

Query:  ------TEVTAGNEKVVETEITTLSLNSLAGFDSPKTLKVKGEI
              T + +   K  + E   LSLNSLA  DSP+T+KV+G I
Subjt:  ------TEVTAGNEKVVETEITTLSLNSLAGFDSPKTLKVKGEI

TrEMBL top hitse value%identityAlignment
A0A087H8D5 Uncharacterized protein3.4e-1631.8Show/hide
Query:  KLEEPILKGEVEENVDWWLHRVERYFVVNRLIERDKLDATMLCLEGEALDW--------KLVEVPTTAVEKVSTFRSGGQTRKVDEVATGDDYQEVSTTM
        KLE P   G   EN   W+ +VE+YF +    E  KL A  +C + +AL W          +         +S F S   T     + T      V    
Subjt:  KLEEPILKGEVEENVDWWLHRVERYFVVNRLIERDKLDATMLCLEGEALDW--------KLVEVPTTAVEKVSTFRSGGQTRKVDEVATGDDYQEVSTTM

Query:  AQII-------EDDLAIE-EDKRAGRPTQAQILWVKLGQPSHQHRMRGNSHQMTS-------------YQRLTDTEIQSMKEKGICFKCDGKFSVRHRCP
         Q I       E  LA     K +G P  ++ +      PSH +    N+ +  S             ++RLT TEI   K  G+CF+CD K+SVRH CP
Subjt:  AQII-------EDDLAIE-EDKRAGRPTQAQILWVKLGQPSHQHRMRGNSHQMTS-------------YQRLTDTEIQSMKEKGICFKCDGKFSVRHRCP

Query:  NRELNVFVVQEVEDLSDLPEETEVTAGNEKVVE--TEITTLSLNSLAGFDSPKTLKVKGEI
          EL V +VQ   D S++  E +    +  VV+   E   LSLNSL G  SP+T+K+KG+I
Subjt:  NRELNVFVVQEVEDLSDLPEETEVTAGNEKVVE--TEITTLSLNSLAGFDSPKTLKVKGEI

A0A2I0VWY1 Putative mitochondrial protein2.6e-1629.32Show/hide
Query:  EENVDWWLHRVERYFVVNRLIERDKLDATMLCLEGEALDW-KLVE----------------------VPTTAVEKVSTFRSGG---QTRKVDEVATGDDY
        E NVD W+H+VERYF VN L+E ++L A  +CLEG A  W K ++                       P    E+       G   + RK  E   GD  
Subjt:  EENVDWWLHRVERYFVVNRLIERDKLDATMLCLEGEALDW-KLVE----------------------VPTTAVEKVSTFRSGG---QTRKVDEVATGDDY

Query:  QEVSTTMAQIIEDDL--AIEEDKRAGRPTQAQ--ILWVKLGQPSHQHRMRGNSHQMTSYQRLTDTEIQSMKEKGICFKCDGKFSVRHRCPNRELNVFVVQ
           ++T+       L   I +  +  RP   +  +   +L +        G +    S+++LT+ E+Q  + KG+CF+C+ KF   HRC +R L    V 
Subjt:  QEVSTTMAQIIEDDL--AIEEDKRAGRPTQAQ--ILWVKLGQPSHQHRMRGNSHQMTSYQRLTDTEIQSMKEKGICFKCDGKFSVRHRCPNRELNVFVVQ

Query:  EVEDLSDLPEETEVTAGNEKVVETEITTLSLNSLAGFDSPKTLKVKGEI
            + + PEE + +   +   + E+  +SLNS+ GF    T+KVKG+I
Subjt:  EVEDLSDLPEETEVTAGNEKVVETEITTLSLNSLAGFDSPKTLKVKGEI

A0A2I0WN12 Putative mitochondrial protein3.1e-1727.24Show/hide
Query:  DMRLCKLEEPILKGEVEENVDWWLHRVERYFVVNRLIERDKLDATMLCLEGEALDW--------------------------------------------
        D R  KL+ P+ +   E NVD W+H+VERYF VN L+E ++L A  +CLEG A  W                                            
Subjt:  DMRLCKLEEPILKGEVEENVDWWLHRVERYFVVNRLIERDKLDATMLCLEGEALDW--------------------------------------------

Query:  -----KLVEVPTTAVEKVSTFRSGG--------QTRKVDEVATGDDYQEVSTTMAQIIEDDLAIEEDKR---AGRPTQAQILWVKLGQPSHQ------HR
             K  E     +E +S    GG        + R   +V    D +E +  +AQ++E+        R   +G PT+    ++    PS          
Subjt:  -----KLVEVPTTAVEKVSTFRSGG--------QTRKVDEVATGDDYQEVSTTMAQIIEDDLAIEEDKR---AGRPTQAQILWVKLGQPSHQ------HR

Query:  MRGNSHQMTSYQRLTDTEIQSMKEKGICFKCDGKFSVRHRCPNRELNVFVVQEVEDLSDLPEETEVTAGNEKVVETEITTLSLNSLAGFDSPKTLKVKGE
          G +    S+++LT+ E+Q  + KG+CF+C+ KF   HRC +R L    V     + + PEE + +   +   + E+  +SLNS+ GF    T+KVKG+
Subjt:  MRGNSHQMTSYQRLTDTEIQSMKEKGICFKCDGKFSVRHRCPNRELNVFVVQEVEDLSDLPEETEVTAGNEKVVETEITTLSLNSLAGFDSPKTLKVKGE

Query:  I
        I
Subjt:  I

A0A5A7T6F9 Keratin, type I cytoskeletal 10-like1.6e-2165.52Show/hide
Query:  TKPVEMETTTNLVIDEGTSAPCGSNNRKVPLFDMRLCKLEEPILKGEVEENVDWWLHRVERYFVVNRLIERDKLDATMLCLEGEALD
        T+  EME     V++EG   PC  N R +PLF MRL KLE PI KGE EENVD WLH VERYFVVNRL ERDKL+A +LCLE EALD
Subjt:  TKPVEMETTTNLVIDEGTSAPCGSNNRKVPLFDMRLCKLEEPILKGEVEENVDWWLHRVERYFVVNRLIERDKLDATMLCLEGEALD

A0A6J1DN22 Reverse transcriptase3.1e-1723.98Show/hide
Query:  EARVIEVEEKMSELTSKVSELEGKFDSRFAEAENL-EMLMHKMDTWAQGVREETSSEQTKTDKRTKPVEMETTTNLVIDEGTSAPCGSNNRKVPLFDMRL
        E  V  ++E + +++S++ E   + +++  E     E    K+D +   V +  S      +  +  ++ +        +G ++   +    V     + 
Subjt:  EARVIEVEEKMSELTSKVSELEGKFDSRFAEAENL-EMLMHKMDTWAQGVREETSSEQTKTDKRTKPVEMETTTNLVIDEGTSAPCGSNNRKVPLFDMRL

Query:  CKLEEPILKGEVEENVDWWLHRVERYFVVNRLIERDKLDATMLCLEGEALDW-----------KLVEVPTTAVEKVSTFRSGGQTRK---VDEVATGDDY
         K+E P+  G   ++ + WL R ERYF +N L E +K+  T++  EG A+ W               + T   E+ S  +      +   + +  T  +Y
Subjt:  CKLEEPILKGEVEENVDWWLHRVERYFVVNRLIERDKLDATMLCLEGEALDW-----------KLVEVPTTAVEKVSTFRSGGQTRK---VDEVATGDDY

Query:  QEV----STTMAQIIEDDLAIEEDKRAGRPTQAQILWVKLGQ---------PSHQHRMRGNSH-------QMTSYQRLTDTEIQSMKEKGICFKCDGKFS
        ++     S ++  I ED + I    R G P+Q         +          +    + G S+       +  + ++LT+TE Q  K+KG+CF+ + K+S
Subjt:  QEV----STTMAQIIEDDLAIEEDKRAGRPTQAQILWVKLGQ---------PSHQHRMRGNSH-------QMTSYQRLTDTEIQSMKEKGICFKCDGKFS

Query:  VRHRCPNRELNVFVVQEVEDLS-DLPEETEVTAGNEKVVETEITTLSLNSLAGFDSPKTLKVKGEIQ
        + HRC N+EL VFVV + E +  D  E    T G E  +  E+  L+LN++ GF +P T+K++G I+
Subjt:  VRHRCPNRELNVFVVQEVEDLS-DLPEETEVTAGNEKVVETEITTLSLNSLAGFDSPKTLKVKGEIQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCCAGAGTAATAGAAGTGGAAGAGAAAATGAGTGAATTGACATCTAAGGTATCAGAATTGGAAGGGAAATTTGACTCACGTTTTGCAGAAGCTGAGAAT
CTGGAAATGTTGATGCACAAAATGGATACGTGGGCCCAAGGAGTGAGAGAGGAAACGTCATCGGAACAAACCAAAACAGATAAAAGGACGAAACCAGTAGAGATG
GAAACAACTACTAACTTGGTAATTGACGAAGGAACGAGTGCTCCATGTGGGTCGAATAATCGCAAGGTTCCGTTGTTCGACATGCGGCTATGCAAGTTGGAGGAA
CCGATTCTCAAAGGAGAAGTAGAAGAAAACGTTGATTGGTGGCTTCATCGAGTGGAGAGGTATTTTGTGGTGAATCGACTGATAGAAAGAGACAAACTCGATGCT
ACAATGCTATGTTTGGAGGGGGAAGCCCTGGATTGGAAGTTGGTTGAAGTTCCGACAACTGCTGTGGAAAAGGTTTCGACCTTCAGATCGGGAGGACAAACACGC
AAGGTTGATGAAGTTGCAACAGGAGACGACTATCAGGAAGTATCGACGACAATGGCCCAAATAATTGAAGACGACTTGGCAATAGAGGAGGATAAAAGGGCTGGC
AGGCCAACACAAGCCCAAATACTATGGGTAAAATTAGGCCAACCAAGTCACCAACATCGAATGCGGGGAAATTCACACCAGATGACGTCGTACCAGCGATTAACA
GACACTGAGATACAAAGTATGAAAGAAAAGGGGATTTGTTTCAAATGTGACGGCAAATTTAGTGTTAGACATCGATGCCCTAATCGTGAGTTGAATGTGTTCGTG
GTGCAAGAGGTCGAGGATTTGAGTGACTTACCAGAAGAAACGGAAGTAACGGCCGGCAATGAGAAAGTTGTTGAAACGGAAATAACAACCCTCTCCCTAAATTCG
CTGGCTGGATTTGATTCTCCAAAAACGTTGAAGGTGAAGGGAGAAATTCAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGCCAGAGTAATAGAAGTGGAAGAGAAAATGAGTGAATTGACATCTAAGGTATCAGAATTGGAAGGGAAATTTGACTCACGTTTTGCAGAAGCTGAGAAT
CTGGAAATGTTGATGCACAAAATGGATACGTGGGCCCAAGGAGTGAGAGAGGAAACGTCATCGGAACAAACCAAAACAGATAAAAGGACGAAACCAGTAGAGATG
GAAACAACTACTAACTTGGTAATTGACGAAGGAACGAGTGCTCCATGTGGGTCGAATAATCGCAAGGTTCCGTTGTTCGACATGCGGCTATGCAAGTTGGAGGAA
CCGATTCTCAAAGGAGAAGTAGAAGAAAACGTTGATTGGTGGCTTCATCGAGTGGAGAGGTATTTTGTGGTGAATCGACTGATAGAAAGAGACAAACTCGATGCT
ACAATGCTATGTTTGGAGGGGGAAGCCCTGGATTGGAAGTTGGTTGAAGTTCCGACAACTGCTGTGGAAAAGGTTTCGACCTTCAGATCGGGAGGACAAACACGC
AAGGTTGATGAAGTTGCAACAGGAGACGACTATCAGGAAGTATCGACGACAATGGCCCAAATAATTGAAGACGACTTGGCAATAGAGGAGGATAAAAGGGCTGGC
AGGCCAACACAAGCCCAAATACTATGGGTAAAATTAGGCCAACCAAGTCACCAACATCGAATGCGGGGAAATTCACACCAGATGACGTCGTACCAGCGATTAACA
GACACTGAGATACAAAGTATGAAAGAAAAGGGGATTTGTTTCAAATGTGACGGCAAATTTAGTGTTAGACATCGATGCCCTAATCGTGAGTTGAATGTGTTCGTG
GTGCAAGAGGTCGAGGATTTGAGTGACTTACCAGAAGAAACGGAAGTAACGGCCGGCAATGAGAAAGTTGTTGAAACGGAAATAACAACCCTCTCCCTAAATTCG
CTGGCTGGATTTGATTCTCCAAAAACGTTGAAGGTGAAGGGAGAAATTCAA
Protein sequenceShow/hide protein sequence
MEARVIEVEEKMSELTSKVSELEGKFDSRFAEAENLEMLMHKMDTWAQGVREETSSEQTKTDKRTKPVEMETTTNLVIDEGTSAPCGSNNRKVPLFDMRLCKLEE
PILKGEVEENVDWWLHRVERYFVVNRLIERDKLDATMLCLEGEALDWKLVEVPTTAVEKVSTFRSGGQTRKVDEVATGDDYQEVSTTMAQIIEDDLAIEEDKRAG
RPTQAQILWVKLGQPSHQHRMRGNSHQMTSYQRLTDTEIQSMKEKGICFKCDGKFSVRHRCPNRELNVFVVQEVEDLSDLPEETEVTAGNEKVVETEITTLSLNS
LAGFDSPKTLKVKGEIQ