; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG03G012020 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG03G012020
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRetrotrans_gag domain-containing protein
Genome locationCG_Chr03:24078359..24079286
RNA-Seq ExpressionClCG03G012020
SyntenyClCG03G012020
Gene Ontology termsGO:0044237 - cellular metabolic process (biological process)
GO:0016020 - membrane (cellular component)
GO:0016740 - transferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035612.1 No apical meristem (NAM) protein [Cucumis melo var. makuwa]9.2e-2239.91Show/hide
Query:  MLMAFSDKNKAGFLRGSIQKPRSQANLLSAWECNNGIIASWILNSISEDIVASIVDNGS----------VKEVWDELPNHVKQDNGTQIYQLHKE---LV
        MLMA S +NKAGF+ G IQKP S   LL AW CNN I+ASWILNS+S++I ASI+  GS            +    L    +Q     I     +   L 
Subjt:  MLMAFSDKNKAGFLRGSIQKPRSQANLLSAWECNNGIIASWILNSISEDIVASIVDNGS----------VKEVWDELPNHVKQDNGTQIYQLHKE---LV

Query:  TTRQVDSQKATS-DSNRKMDNRCPPLICLNYGIKGHNLHRCYKLQSYPLGYKFQSSNCSAGGESLNSLALKSTSIPIAATITSSTPNIFSSLNVSQYSQL
        TT+ +    A S D NRK +       C   GIKGH   +CYK   YP GYK ++SN        N +A  +      +T  + +P+ FSSLN  QYSQL
Subjt:  TTRQVDSQKATS-DSNRKMDNRCPPLICLNYGIKGHNLHRCYKLQSYPLGYKFQSSNCSAGGESLNSLALKSTSIPIAATITSSTPNIFSSLNVSQYSQL

Query:  MDMHFSHLHAAKLNPSLL
        M +  +HL AA   P  L
Subjt:  MDMHFSHLHAAKLNPSLL

KAA0054564.1 uncharacterized protein E6C27_scaffold24G003560 [Cucumis melo var. makuwa]1.3e-2059.34Show/hide
Query:  MLMAFSDKNKAGFLRGSIQKPRSQANLLSAWECNNGIIASWILNSISEDIVASIVDNGSVKEVWDELPNHVKQDNGTQIYQLHKELVTTRQ
        M +  S KNK GF+  SI+KP S+ NLLSAW+CN+ +IASWI+NSIS++I AS+V +GSVKEVWDEL    +Q NG  IYQL ++LVT  Q
Subjt:  MLMAFSDKNKAGFLRGSIQKPRSQANLLSAWECNNGIIASWILNSISEDIVASIVDNGSVKEVWDELPNHVKQDNGTQIYQLHKELVTTRQ

KAA0065480.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]8.9e-2532.68Show/hide
Query:  MLMAFSDKNKAGFLRGSIQKPRSQANLLSAWECNNGIIASWILNSISEDIVASIVDNGSVKEVWDELPNHVKQDNGTQIYQLHKELVTTRQ---------
        MLMA S +NKAGF+ G IQKP S   LL AW CNN I+ASWILNS+S++I ASI+  GS+KE+WDEL    KQ NG  IYQL KE VT RQ         
Subjt:  MLMAFSDKNKAGFLRGSIQKPRSQANLLSAWECNNGIIASWILNSISEDIVASIVDNGSVKEVWDELPNHVKQDNGTQIYQLHKELVTTRQ---------

Query:  ---------------------------------------------------------------------------------------------VDSQKAT
                                                                                                       S   +
Subjt:  ---------------------------------------------------------------------------------------------VDSQKAT

Query:  SDSNRKMDNRCPPLICLNYGIKGHNLHRCYKLQSYPLGYKFQSSNCSAGGESLNSLALKSTSIPIAATITSS---TPNIFSSLNVSQYSQLMDMHFSHLH
        +D NRK +       C   GIKGH   +CYK   YP GYK ++SN      S+ +    S +  +A T +++   +P+ FSSLN  QYSQLM +  +HL 
Subjt:  SDSNRKMDNRCPPLICLNYGIKGHNLHRCYKLQSYPLGYKFQSSNCSAGGESLNSLALKSTSIPIAATITSS---TPNIFSSLNVSQYSQLMDMHFSHLH

Query:  AAKLNP
        AA   P
Subjt:  AAKLNP

XP_022145891.1 uncharacterized protein LOC111015239 [Momordica charantia]5.8e-2430.38Show/hide
Query:  MLMAFSDKNKAGFLRGSIQKPRSQANLLSAWECNNGIIASWILNSISEDIVASIVDNGSVKEVWDELPNHVKQDNGTQIYQLHKELVTTRQ---------
        ML+A S KNK GF+ G+I+KP    NLL+AW+CNN II SWI+NS+S++I ASI+  GS K++WDEL    +Q +  +I+QL KELVTT Q         
Subjt:  MLMAFSDKNKAGFLRGSIQKPRSQANLLSAWECNNGIIASWILNSISEDIVASIVDNGSVKEVWDELPNHVKQDNGTQIYQLHKELVTTRQ---------

Query:  ----------------VD------------------------------------------------------------------------------SQKA
                        +D                                                                              S++ 
Subjt:  ----------------VD------------------------------------------------------------------------------SQKA

Query:  TSDSNRKMDNRCPPLICLNYGIKGHNLHRCYKLQSYPLGYKFQSSNCSAG------------GESLNSLALKS---TSIPIAATITSSTPNIFSSLNVSQ
        ++   R+ DNR     C + G++GH + +CYKL  YP GY+  +     G            G   N ++ K+   TS P     ++S+P  F+SLN SQ
Subjt:  TSDSNRKMDNRCPPLICLNYGIKGHNLHRCYKLQSYPLGYKFQSSNCSAG------------GESLNSLALKS---TSIPIAATITSSTPNIFSSLNVSQ

Query:  YSQLMDMHFSHLHAAK
        YSQLM+M  SHL AAK
Subjt:  YSQLMDMHFSHLHAAK

XP_038888312.1 uncharacterized protein LOC120078158 [Benincasa hispida]1.6e-1853.85Show/hide
Query:  MLMAFSDKNKAGFLRGSIQKPRSQANLLSAWECNNGIIASWILNSISEDIVASIVDNGSVKEVWDELPNHVKQDNGTQIYQLHKELVTTRQ
        M +  S KNK GF+ G+I+KP  + +L SAW CNN +I SWI+NS+S++I  S+V  GSVKE+WDEL     Q NG  IYQL K+L TT Q
Subjt:  MLMAFSDKNKAGFLRGSIQKPRSQANLLSAWECNNGIIASWILNSISEDIVASIVDNGSVKEVWDELPNHVKQDNGTQIYQLHKELVTTRQ

TrEMBL top hitse value%identityAlignment
A0A5A7VE66 Cysteine-rich RLK (Receptor-like protein kinase) 84.3e-2532.68Show/hide
Query:  MLMAFSDKNKAGFLRGSIQKPRSQANLLSAWECNNGIIASWILNSISEDIVASIVDNGSVKEVWDELPNHVKQDNGTQIYQLHKELVTTRQ---------
        MLMA S +NKAGF+ G IQKP S   LL AW CNN I+ASWILNS+S++I ASI+  GS+KE+WDEL    KQ NG  IYQL KE VT RQ         
Subjt:  MLMAFSDKNKAGFLRGSIQKPRSQANLLSAWECNNGIIASWILNSISEDIVASIVDNGSVKEVWDELPNHVKQDNGTQIYQLHKELVTTRQ---------

Query:  ---------------------------------------------------------------------------------------------VDSQKAT
                                                                                                       S   +
Subjt:  ---------------------------------------------------------------------------------------------VDSQKAT

Query:  SDSNRKMDNRCPPLICLNYGIKGHNLHRCYKLQSYPLGYKFQSSNCSAGGESLNSLALKSTSIPIAATITSS---TPNIFSSLNVSQYSQLMDMHFSHLH
        +D NRK +       C   GIKGH   +CYK   YP GYK ++SN      S+ +    S +  +A T +++   +P+ FSSLN  QYSQLM +  +HL 
Subjt:  SDSNRKMDNRCPPLICLNYGIKGHNLHRCYKLQSYPLGYKFQSSNCSAGGESLNSLALKSTSIPIAATITSS---TPNIFSSLNVSQYSQLMDMHFSHLH

Query:  AAKLNP
        AA   P
Subjt:  AAKLNP

A0A5D3D9Z1 Retrotrans_gag domain-containing protein6.4e-2159.34Show/hide
Query:  MLMAFSDKNKAGFLRGSIQKPRSQANLLSAWECNNGIIASWILNSISEDIVASIVDNGSVKEVWDELPNHVKQDNGTQIYQLHKELVTTRQ
        M +  S KNK GF+  SI+KP S+ NLLSAW+CN+ +IASWI+NSIS++I AS+V +GSVKEVWDEL    +Q NG  IYQL ++LVT  Q
Subjt:  MLMAFSDKNKAGFLRGSIQKPRSQANLLSAWECNNGIIASWILNSISEDIVASIVDNGSVKEVWDELPNHVKQDNGTQIYQLHKELVTTRQ

A0A5D3DEP5 Putative glycosyltransferase2.4e-1559.76Show/hide
Query:  MLMAFSDKNKAGFLRGSIQKPRSQANLLSAWECNNGIIASWILNSISEDIVASIVDNGSVKEVWDELPNHVKQDNGTQIYQL
        MLMA S +NKA F+   I+KP  +  LL AW CN  IIASWILNS+S++I ASIV  GSVKE+W+EL    KQ NG  IYQL
Subjt:  MLMAFSDKNKAGFLRGSIQKPRSQANLLSAWECNNGIIASWILNSISEDIVASIVDNGSVKEVWDELPNHVKQDNGTQIYQL

A0A5D3E5P0 No apical meristem (NAM) protein4.4e-2239.91Show/hide
Query:  MLMAFSDKNKAGFLRGSIQKPRSQANLLSAWECNNGIIASWILNSISEDIVASIVDNGS----------VKEVWDELPNHVKQDNGTQIYQLHKE---LV
        MLMA S +NKAGF+ G IQKP S   LL AW CNN I+ASWILNS+S++I ASI+  GS            +    L    +Q     I     +   L 
Subjt:  MLMAFSDKNKAGFLRGSIQKPRSQANLLSAWECNNGIIASWILNSISEDIVASIVDNGS----------VKEVWDELPNHVKQDNGTQIYQLHKE---LV

Query:  TTRQVDSQKATS-DSNRKMDNRCPPLICLNYGIKGHNLHRCYKLQSYPLGYKFQSSNCSAGGESLNSLALKSTSIPIAATITSSTPNIFSSLNVSQYSQL
        TT+ +    A S D NRK +       C   GIKGH   +CYK   YP GYK ++SN        N +A  +      +T  + +P+ FSSLN  QYSQL
Subjt:  TTRQVDSQKATS-DSNRKMDNRCPPLICLNYGIKGHNLHRCYKLQSYPLGYKFQSSNCSAGGESLNSLALKSTSIPIAATITSSTPNIFSSLNVSQYSQL

Query:  MDMHFSHLHAAKLNPSLL
        M +  +HL AA   P  L
Subjt:  MDMHFSHLHAAKLNPSLL

A0A6J1CXR2 uncharacterized protein LOC1110152392.8e-2430.38Show/hide
Query:  MLMAFSDKNKAGFLRGSIQKPRSQANLLSAWECNNGIIASWILNSISEDIVASIVDNGSVKEVWDELPNHVKQDNGTQIYQLHKELVTTRQ---------
        ML+A S KNK GF+ G+I+KP    NLL+AW+CNN II SWI+NS+S++I ASI+  GS K++WDEL    +Q +  +I+QL KELVTT Q         
Subjt:  MLMAFSDKNKAGFLRGSIQKPRSQANLLSAWECNNGIIASWILNSISEDIVASIVDNGSVKEVWDELPNHVKQDNGTQIYQLHKELVTTRQ---------

Query:  ----------------VD------------------------------------------------------------------------------SQKA
                        +D                                                                              S++ 
Subjt:  ----------------VD------------------------------------------------------------------------------SQKA

Query:  TSDSNRKMDNRCPPLICLNYGIKGHNLHRCYKLQSYPLGYKFQSSNCSAG------------GESLNSLALKS---TSIPIAATITSSTPNIFSSLNVSQ
        ++   R+ DNR     C + G++GH + +CYKL  YP GY+  +     G            G   N ++ K+   TS P     ++S+P  F+SLN SQ
Subjt:  TSDSNRKMDNRCPPLICLNYGIKGHNLHRCYKLQSYPLGYKFQSSNCSAG------------GESLNSLALKS---TSIPIAATITSSTPNIFSSLNVSQ

Query:  YSQLMDMHFSHLHAAK
        YSQLM+M  SHL AAK
Subjt:  YSQLMDMHFSHLHAAK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.5e-0629.27Show/hide
Query:  KAGFLRGSIQKPRSQANLLSAWECNNGIIASWILNSISEDIVASIVDNGSVKEVWDELPNHVKQDNGTQIYQLHKELVTTRQ
        K GF+ G++ KP   + L   WE  N ++  W++NS+++ ++ S++   +  ++W++L          +IYQL + L T RQ
Subjt:  KAGFLRGSIQKPRSQANLLSAWECNNGIIASWILNSISEDIVASIVDNGSVKEVWDELPNHVKQDNGTQIYQLHKELVTTRQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAATGGCGTTTTCTGATAAAAACAAAGCAGGTTTCCTTCGTGGATCTATCCAGAAGCCGAGATCTCAAGCGAATTTACTTTCTGCTTGGGAATGTAACAAT
GGCATTATAGCCTCGTGGATATTGAACTCTATTTCTGAGGATATTGTTGCAAGCATTGTCGACAATGGATCAGTGAAGGAAGTTTGGGATGAACTTCCCAATCAT
GTCAAGCAAGATAATGGTACTCAAATTTATCAGTTACACAAGGAATTGGTTACAACACGTCAAGTTGATTCTCAAAAAGCCACTTCTGACAGTAATCGCAAAATG
GATAATCGATGCCCTCCTTTAATCTGCTTGAATTATGGTATCAAAGGACATAACCTCCACCGTTGTTACAAATTGCAAAGTTATCCTCTAGGTTATAAGTTTCAA
TCTTCGAATTGCTCTGCTGGTGGTGAATCTTTGAATTCCCTTGCTCTTAAGAGTACTTCTATTCCTATTGCTGCTACAATTACTTCAAGTACTCCAAACATTTTC
TCAAGCTTGAATGTTTCACAATATAGTCAGCTCATGGATATGCATTTTTCTCATCTTCATGCTGCTAAGCTGAATCCATCGCTACTGCTCCGGTTGCTGCTCATG
TAA
mRNA sequenceShow/hide mRNA sequence
ATGTTAATGGCGTTTTCTGATAAAAACAAAGCAGGTTTCCTTCGTGGATCTATCCAGAAGCCGAGATCTCAAGCGAATTTACTTTCTGCTTGGGAATGTAACAAT
GGCATTATAGCCTCGTGGATATTGAACTCTATTTCTGAGGATATTGTTGCAAGCATTGTCGACAATGGATCAGTGAAGGAAGTTTGGGATGAACTTCCCAATCAT
GTCAAGCAAGATAATGGTACTCAAATTTATCAGTTACACAAGGAATTGGTTACAACACGTCAAGTTGATTCTCAAAAAGCCACTTCTGACAGTAATCGCAAAATG
GATAATCGATGCCCTCCTTTAATCTGCTTGAATTATGGTATCAAAGGACATAACCTCCACCGTTGTTACAAATTGCAAAGTTATCCTCTAGGTTATAAGTTTCAA
TCTTCGAATTGCTCTGCTGGTGGTGAATCTTTGAATTCCCTTGCTCTTAAGAGTACTTCTATTCCTATTGCTGCTACAATTACTTCAAGTACTCCAAACATTTTC
TCAAGCTTGAATGTTTCACAATATAGTCAGCTCATGGATATGCATTTTTCTCATCTTCATGCTGCTAAGCTGAATCCATCGCTACTGCTCCGGTTGCTGCTCATG
TAA
Protein sequenceShow/hide protein sequence
MLMAFSDKNKAGFLRGSIQKPRSQANLLSAWECNNGIIASWILNSISEDIVASIVDNGSVKEVWDELPNHVKQDNGTQIYQLHKELVTTRQVDSQKATSDSNRKM
DNRCPPLICLNYGIKGHNLHRCYKLQSYPLGYKFQSSNCSAGGESLNSLALKSTSIPIAATITSSTPNIFSSLNVSQYSQLMDMHFSHLHAAKLNPSLLLRLLLM