; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG08G005202 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG08G005202
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRetrotransposon protein
Genome locationCG_Chr08:16423318..16424335
RNA-Seq ExpressionClCG08G005202
SyntenyClCG08G005202
Gene Ontology termsNA
InterPro domainsIPR024752 - Myb/SANT-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038877407.1 uncharacterized protein LOC120069696 [Benincasa hispida]8.5e-6359.11Show/hide
Query:  RSDNGTFRPGYLQHLERMLHEKVPGCALNQNIIECKVRSLKKQYNVVSEMLSQSGFGWNEMFKCVQ-----------SHPSAKGMWNKSFPHYDELSTVF
        RS  G F       L      K P  ALNQN IECKVRSLKKQYN +SEMLSQSGF WNE FKCVQ           SHP+AKGMWNK FPHYD+LST  
Subjt:  RSDNGTFRPGYLQHLERMLHEKVPGCALNQNIIECKVRSLKKQYNVVSEMLSQSGFGWNEMFKCVQ-----------SHPSAKGMWNKSFPHYDELSTVF

Query:  VKEEQSQDCHTLEVHQTKSPLNQDGIDEEPTEQSIGRAM----------------------YLPSHLEA-DTYMGRLASWQKENYELEFGRRKEVVNAIY
               DCHT EV Q +S LNQD IDEEPTEQS GR                         + S +E   T+MGRLASWQK+ YELEFGR+KEVVNAIY
Subjt:  VKEEQSQDCHTLEVHQTKSPLNQDGIDEEPTEQSIGRAM----------------------YLPSHLEA-DTYMGRLASWQKENYELEFGRRKEVVNAIY

Query:  NINGLNEDDQVTLIDLFVIDIQKINCFLGVLEHARKRYCLCLLGRNM
        NI+GL+ED QVTLIDL V DIQK +CFL V EHA KRYCL LLGRNM
Subjt:  NINGLNEDDQVTLIDLFVIDIQKINCFLGVLEHARKRYCLCLLGRNM

XP_038887234.1 uncharacterized protein LOC120077425 [Benincasa hispida]4.2e-7865.99Show/hide
Query:  RSDNGTFRPGYLQHLERMLHEKVPGCALNQNIIECKVRSLKKQYNVVSEMLSQSGFGWNEMFKCVQ-----------SHPSAKGMWNKSFPHYDELSTVF
        RSDNGTFRPGYLQHLE++LHEKVPGCALN+N IECKVRSLKKQYN VSEMLSQSGF WNE FKCVQ           SHP+AKGMW K FPHYD+LS VF
Subjt:  RSDNGTFRPGYLQHLERMLHEKVPGCALNQNIIECKVRSLKKQYNVVSEMLSQSGFGWNEMFKCVQ-----------SHPSAKGMWNKSFPHYDELSTVF

Query:  VKEEQSQDCHTLEVHQTKSPLNQDGIDEEPTEQSIGRAM----------------------YLPSHLE-ADTYMGRLASWQKENYELEFGRRKEVVNAIY
         K+    DCHT EV QT+SPLNQD IDEEP EQS GRA                        + S +E   T+MGRLASWQ E YELE    KEVVNAIY
Subjt:  VKEEQSQDCHTLEVHQTKSPLNQDGIDEEPTEQSIGRAM----------------------YLPSHLE-ADTYMGRLASWQKENYELEFGRRKEVVNAIY

Query:  NINGLNEDDQVTLIDLFVIDIQKINCFLGVLEHARKRYCLCLLGRNM
        NI+ L E+DQVTLIDL V DIQK +CFL V EHARKRYCL LLGRNM
Subjt:  NINGLNEDDQVTLIDLFVIDIQKINCFLGVLEHARKRYCLCLLGRNM

XP_038892629.1 uncharacterized protein At2g29880-like [Benincasa hispida]7.7e-4870.07Show/hide
Query:  RSDNGTFRPGYLQHLERMLHEKVPGCALNQNIIECKVRSLKKQYNVVSEMLSQSGFGWNEMFKCVQ-----------SHPSAKGMWNKSFPHYDELSTVF
        R DNGTFRPGYLQHLE++LHEKVPGCALN N IECKVRSLKKQYN VSEMLSQSG GWNE FKCV            SHP+AK MWNK FPHYD+LST+F
Subjt:  RSDNGTFRPGYLQHLERMLHEKVPGCALNQNIIECKVRSLKKQYNVVSEMLSQSGFGWNEMFKCVQ-----------SHPSAKGMWNKSFPHYDELSTVF

Query:  VKEE---QS------QDCHTLEVHQTKSPLNQDGIDEEPTEQSIGRA
         K+    QS       DCHT EV QT+SPLNQD IDEEP EQS GRA
Subjt:  VKEE---QS------QDCHTLEVHQTKSPLNQDGIDEEPTEQSIGRA

XP_038895773.1 uncharacterized protein LOC120083935 [Benincasa hispida]1.2e-6965.35Show/hide
Query:  RSDNGTFRPGYLQHLERMLHEKVPGCALNQNIIECKVRSLKKQYNVVSEMLSQSGFGWNEMFKCVQ-----------SHPSAKGMWNKSFPHYDELSTVF
        RSDNGTFR  YLQHLER+ HEKV GCALNQN IECKVRSLKKQ N VSEMLSQSGF WNE FKCVQ           SHP+AKGMWNK FPHYD+LSTVF
Subjt:  RSDNGTFRPGYLQHLERMLHEKVPGCALNQNIIECKVRSLKKQYNVVSEMLSQSGFGWNEMFKCVQ-----------SHPSAKGMWNKSFPHYDELSTVF

Query:  VK----EEQSQDCHTLEVHQTKSPLNQDGIDEEPTEQSIGRAMYLPSHLEADTYMGRLASWQKENYELEFGRRKEVVNAIYNINGLNEDDQVTLIDLFVI
         K     + S+D + +  +  +         E   E  +G       H    T+MGRLASWQKE YELEFGRRKEVVNAIYNI+GL+EDDQVTLIDL V 
Subjt:  VK----EEQSQDCHTLEVHQTKSPLNQDGIDEEPTEQSIGRAMYLPSHLEADTYMGRLASWQKENYELEFGRRKEVVNAIYNINGLNEDDQVTLIDLFVI

Query:  DIQKINCFLGVLEHARKRYCLCLLGRNM
        DIQK NCFL V EHARKRYCL LLGRNM
Subjt:  DIQKINCFLGVLEHARKRYCLCLLGRNM

XP_038896380.1 uncharacterized protein LOC120084641 [Benincasa hispida]1.2e-8067.21Show/hide
Query:  RSDNGTFRPGYLQHLERMLHEKVPGCALNQNIIECKVRSLKKQYNVVSEMLSQSGFGWNEMFKCVQ-----------SHPSAKGMWNKSFPHYDELSTVF
        RSDNGTFR GYLQ+LER+LHEKVPGCALNQN IECKVRSLKKQYN VSEMLSQSGFGWNE FKCVQ           SH +AKGMWNKSF HYD+LSTVF
Subjt:  RSDNGTFRPGYLQHLERMLHEKVPGCALNQNIIECKVRSLKKQYNVVSEMLSQSGFGWNEMFKCVQ-----------SHPSAKGMWNKSFPHYDELSTVF

Query:  VKEEQSQDCHTLEVHQTKSPLNQDGIDEEPTEQSIGRAMYLPSHLEA-----------------------DTYMGRLASWQKENYELEFGRRKEVVNAIY
         K+    +CHT EV Q +SPLNQD IDEEP EQS GRA  L                              T+MGRLASWQKE YELEFGRRKEVVNAIY
Subjt:  VKEEQSQDCHTLEVHQTKSPLNQDGIDEEPTEQSIGRAMYLPSHLEA-----------------------DTYMGRLASWQKENYELEFGRRKEVVNAIY

Query:  NINGLNEDDQVTLIDLFVIDIQKINCFLGVLEHARKRYCLCLLGRNM
        +I+GL+EDDQVT IDL V DIQK +CFL V EHARKRYCL LL RNM
Subjt:  NINGLNEDDQVTLIDLFVIDIQKINCFLGVLEHARKRYCLCLLGRNM

TrEMBL top hitse value%identityAlignment
A0A1S3B4L3 uncharacterized protein LOC1034859534.2e-2330.86Show/hide
Query:  RSDNGTFRPGYLQHLERMLHEKVPGCALNQ-NIIECKVRSLKKQYNVVSEML--SQSGFGWNEMFKC-----------VQSHPSAKGMWNKSFPHYDELS
        RSDNGTF+PGYL  L+RM+ EK+PG  + + + I+C V+SLKK Y+ ++EM   S SGFGWNE F+C           ++SHP+AKG+ +KSFP+YD+LS
Subjt:  RSDNGTFRPGYLQHLERMLHEKVPGCALNQ-NIIECKVRSLKKQYNVVSEML--SQSGFGWNEMFKC-----------VQSHPSAKGMWNKSFPHYDELS

Query:  TVFVKEE---------QSQDCHTLEVHQTKSPLNQDGIDEEPTEQSIGRAM------------------------------------YLPSHLE-ADTYM
         VF K+           +   +   +     PL     ++ PT  S G  M                                     + S +E  +  +
Subjt:  TVFVKEE---------QSQDCHTLEVHQTKSPLNQDGIDEEPTEQSIGRAM------------------------------------YLPSHLE-ADTYM

Query:  GRLASWQKENYELEFGRRKEVVNAIYNINGLNEDDQVTLIDLFVIDIQKINCFLGVLEHARKRYCLCLL
          +A W KE   +E   R +VV  + +I  L   D+  L+ +    ++ I  FL +    +  YC  LL
Subjt:  GRLASWQKENYELEFGRRKEVVNAIYNINGLNEDDQVTLIDLFVIDIQKINCFLGVLEHARKRYCLCLL

A0A5A7U0H7 Retrotransposon protein4.2e-2330.86Show/hide
Query:  RSDNGTFRPGYLQHLERMLHEKVPGCALNQ-NIIECKVRSLKKQYNVVSEML--SQSGFGWNEMFKC-----------VQSHPSAKGMWNKSFPHYDELS
        RSDNGTF+PGYL  L+RM+ EK+PG  + + + I+C V+SLKK Y+ ++EM   S SGFGWNE F+C           ++SHP+AKG+ +KSFP+YD+LS
Subjt:  RSDNGTFRPGYLQHLERMLHEKVPGCALNQ-NIIECKVRSLKKQYNVVSEML--SQSGFGWNEMFKC-----------VQSHPSAKGMWNKSFPHYDELS

Query:  TVFVKEE---------QSQDCHTLEVHQTKSPLNQDGIDEEPTEQSIGRAM------------------------------------YLPSHLE-ADTYM
         VF K+           +   +   +     PL     ++ PT  S G  M                                     + S +E  +  +
Subjt:  TVFVKEE---------QSQDCHTLEVHQTKSPLNQDGIDEEPTEQSIGRAM------------------------------------YLPSHLE-ADTYM

Query:  GRLASWQKENYELEFGRRKEVVNAIYNINGLNEDDQVTLIDLFVIDIQKINCFLGVLEHARKRYCLCLL
          +A W KE   +E   R +VV  + +I  L   D+  L+ +    ++ I  FL +    +  YC  LL
Subjt:  GRLASWQKENYELEFGRRKEVVNAIYNINGLNEDDQVTLIDLFVIDIQKINCFLGVLEHARKRYCLCLL

A0A5A7U216 Retrotransposon protein6.0e-2230.89Show/hide
Query:  RSDNGTFRPGYLQHLERMLHEKVPGCALNQNIIECKVRSLKKQYNVVSEMLSQ--SGFGWNEMFKCV---------QSHPSAKGMWNKSFPHYDELSTVF
        RSDNGTFRPGYL  L RM+  K+PGC ++ + I+ +++ +K+ ++ ++EM     SGFGWN+  KC+          SHP+AKG+ NKSF HYDELS VF
Subjt:  RSDNGTFRPGYLQHLERMLHEKVPGCALNQNIIECKVRSLKKQYNVVSEMLSQ--SGFGWNEMFKCV---------QSHPSAKGMWNKSFPHYDELSTVF

Query:  VKEEQSQD-CHTLEVHQTKSPLNQDGIDEEPTEQSIGRAMYLPS-HLEADTYM-GRLASWQKENYELEFGR--------------------RKEVVNAIY
         K+  +     +     +  P   D    +    +    MY P  ++  D  M  R A W      +E+G                     R+E+V  + 
Subjt:  VKEEQSQD-CHTLEVHQTKSPLNQDGIDEEPTEQSIGRAMYLPS-HLEADTYM-GRLASWQKENYELEFGR--------------------RKEVVNAIY

Query:  NINGLNEDDQVTLIDLFVIDIQKINCFLGVLEHARKRYCLCLLGRN
         I  L   D+  L+ + + ++  +  FL V ++    YC  +L  N
Subjt:  NINGLNEDDQVTLIDLFVIDIQKINCFLGVLEHARKRYCLCLLGRN

A0A5D3CH30 Retrotransposon protein2.3e-2130.38Show/hide
Query:  RSDNGTFRPGYLQHLERMLHEKVPGCALNQNIIECKVRSLKKQYNVVSEMLSQ--SGFGWNEMFKCV---------QSHPSAKGMWNKSFPHYDELSTVF
        RSDNGTFRPGYL  L RM+  K+PGC ++ + I+ +++ +K+ ++ ++EM     SGFGWN+  KC+          SHP+ KG+ NKSF HYDELS VF
Subjt:  RSDNGTFRPGYLQHLERMLHEKVPGCALNQNIIECKVRSLKKQYNVVSEMLSQ--SGFGWNEMFKCV---------QSHPSAKGMWNKSFPHYDELSTVF

Query:  VKEE-------------QSQDCHTLEVHQTKSPLNQDGIDEEPTEQSIGRAMYLPSHLE-ADTYMGRLASWQKENYELEFGRRKEVVNAIYNINGLNEDD
         K+               +    T  V + ++     G   + T  +I     + + +E  +  + R+A W     +     R+E+V  +  I  L   D
Subjt:  VKEE-------------QSQDCHTLEVHQTKSPLNQDGIDEEPTEQSIGRAMYLPSHLE-ADTYMGRLASWQKENYELEFGRRKEVVNAIYNINGLNEDD

Query:  QVTLIDLFVIDIQKINCFLGVLEHARKRYCLCLLGRN
        +  L+ + + ++  +  FL V ++ +  YC  +L  N
Subjt:  QVTLIDLFVIDIQKINCFLGVLEHARKRYCLCLLGRN

A0A6V7P4Z9 Myb_DNA-bind_3 domain-containing protein4.3e-2030.38Show/hide
Query:  RSDNGTFRPGYLQHLERMLHEKVPGCAL-NQNIIECKVRSLKKQYNVVSEML--SQSGFGWNEMFKCVQ-----------SHPSAKGMWNKSFPHYDELS
        R+DNGTFR GYLQ LER +HEK+P C L     IE + +  K+QYN + EML  S SGFGW++  KCV+           SHP+A G+  KSFP+++ELS
Subjt:  RSDNGTFRPGYLQHLERMLHEKVPGCAL-NQNIIECKVRSLKKQYNVVSEML--SQSGFGWNEMFKCVQ-----------SHPSAKGMWNKSFPHYDELS

Query:  TVFVK--------EEQSQDCHTLEVHQTKSPLNQDGIDEE-----------PTEQSIGRAMYLPSHLEA--DTYMGRLASWQKENYELEFG---------
         VF K        E  +     +E  +T    +    DEE           PT  S G++        +  D  +    +    + E  FG         
Subjt:  TVFVK--------EEQSQDCHTLEVHQTKSPLNQDGIDEE-----------PTEQSIGRAMYLPSHLEA--DTYMGRLASWQKENYELEFG---------

Query:  ------------RRKEVVNAIYNINGLNEDDQVTLIDLFVIDIQKINCFLGVLEHARKRY
                    R++++   +  + GL + D++   D+ + D  K+  F G+   +RK Y
Subjt:  ------------RRKEVVNAIYNINGLNEDDQVTLIDLFVIDIQKINCFLGVLEHARKRY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G02550.1 unknown protein9.6e-0427.64Show/hide
Query:  KVRSLKKQYNVVSEMLSQSGFGWN---EMFKC---------VQSHPSAKGMWNKSFPHYDELSTVF-----------VKEEQS---QDCHTLEVHQTKSP
        +++++KK+Y V+ ++LS+ GF WN   +M  C         +  +P AK    K    Y+EL TV            VK+E S    D    E      P
Subjt:  KVRSLKKQYNVVSEMLSQSGFGWN---EMFKC---------VQSHPSAKGMWNKSFPHYDELSTVF-----------VKEEQS---QDCHTLEVHQTKSP

Query:  L--NQDGIDEEPTEQSIGRAMYL
        L  +++  D + TE   G + Y+
Subjt:  L--NQDGIDEEPTEQSIGRAMYL

AT4G02550.3 unknown protein9.6e-0427.64Show/hide
Query:  KVRSLKKQYNVVSEMLSQSGFGWN---EMFKC---------VQSHPSAKGMWNKSFPHYDELSTVF-----------VKEEQS---QDCHTLEVHQTKSP
        +++++KK+Y V+ ++LS+ GF WN   +M  C         +  +P AK    K    Y+EL TV            VK+E S    D    E      P
Subjt:  KVRSLKKQYNVVSEMLSQSGFGWN---EMFKC---------VQSHPSAKGMWNKSFPHYDELSTVF-----------VKEEQS---QDCHTLEVHQTKSP

Query:  L--NQDGIDEEPTEQSIGRAMYL
        L  +++  D + TE   G + Y+
Subjt:  L--NQDGIDEEPTEQSIGRAMYL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGCATAATGGCAGGTCCGACAATGGGACGTTTCGACCAGGATACCTACAGCACTTGGAGCGAATGCTGCATGAGAAAGTGCCCGGGTGCGCATTGAACCAGAACAT
CATTGAGTGCAAGGTGAGGAGTCTGAAGAAACAATACAACGTAGTATCAGAGATGTTAAGTCAGTCGGGGTTCGGGTGGAATGAGATGTTCAAATGTGTCCAGAGTCATC
CTAGTGCGAAGGGGATGTGGAACAAGTCATTCCCCCATTACGATGAACTTTCCACCGTATTTGTGAAAGAAGAGCAGTCACAGGACTGTCACACACTTGAGGTTCACCAG
ACAAAATCACCATTAAATCAAGATGGAATAGATGAAGAGCCAACAGAGCAATCTATAGGTAGAGCGATGTACCTGCCAAGTCATCTCGAGGCAGACACATACATGGGAAG
ACTTGCATCGTGGCAGAAGGAAAATTATGAGTTGGAGTTTGGGCGTCGGAAGGAAGTAGTAAACGCCATATACAACATTAATGGCTTGAATGAGGATGATCAGGTCACCC
TTATTGACCTCTTTGTCATAGACATTCAGAAGATAAATTGTTTCCTTGGAGTACTGGAACATGCACGGAAGAGGTATTGCCTCTGTCTACTAGGACGAAACATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAGCATAATGGCAGGTCCGACAATGGGACGTTTCGACCAGGATACCTACAGCACTTGGAGCGAATGCTGCATGAGAAAGTGCCCGGGTGCGCATTGAACCAGAACAT
CATTGAGTGCAAGGTGAGGAGTCTGAAGAAACAATACAACGTAGTATCAGAGATGTTAAGTCAGTCGGGGTTCGGGTGGAATGAGATGTTCAAATGTGTCCAGAGTCATC
CTAGTGCGAAGGGGATGTGGAACAAGTCATTCCCCCATTACGATGAACTTTCCACCGTATTTGTGAAAGAAGAGCAGTCACAGGACTGTCACACACTTGAGGTTCACCAG
ACAAAATCACCATTAAATCAAGATGGAATAGATGAAGAGCCAACAGAGCAATCTATAGGTAGAGCGATGTACCTGCCAAGTCATCTCGAGGCAGACACATACATGGGAAG
ACTTGCATCGTGGCAGAAGGAAAATTATGAGTTGGAGTTTGGGCGTCGGAAGGAAGTAGTAAACGCCATATACAACATTAATGGCTTGAATGAGGATGATCAGGTCACCC
TTATTGACCTCTTTGTCATAGACATTCAGAAGATAAATTGTTTCCTTGGAGTACTGGAACATGCACGGAAGAGGTATTGCCTCTGTCTACTAGGACGAAACATGTAG
Protein sequenceShow/hide protein sequence
MQHNGRSDNGTFRPGYLQHLERMLHEKVPGCALNQNIIECKVRSLKKQYNVVSEMLSQSGFGWNEMFKCVQSHPSAKGMWNKSFPHYDELSTVFVKEEQSQDCHTLEVHQ
TKSPLNQDGIDEEPTEQSIGRAMYLPSHLEADTYMGRLASWQKENYELEFGRRKEVVNAIYNINGLNEDDQVTLIDLFVIDIQKINCFLGVLEHARKRYCLCLLGRNM