; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G013225 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G013225
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRetrotransposon protein
Genome locationCG_Chr05:18631257..18631962
RNA-Seq ExpressionClCG05G013225
SyntenyClCG05G013225
Gene Ontology termsNA
InterPro domainsIPR024752 - Myb/SANT-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035275.1 retrotransposon protein [Cucumis melo var. makuwa]8.1e-3951.68Show/hide
Query:  RVVKHIWTDEEDRILVECLVQFVQSRHWRADNGTFRPGFLANILRMVQQRILGCSIQVSPHLESKVRTLKRQYNMIVEMLGLGCSGFGWNAERKCINYEA
        RV +H+WT EE+R LVECL++ V    W++DNGTFR G+LA ++RM+ +++ GC ++ +  ++ +++TLKR +  I EM G  CSGFGWN E KCI  E 
Subjt:  RVVKHIWTDEEDRILVECLVQFVQSRHWRADNGTFRPGFLANILRMVQQRILGCSIQVSPHLESKVRTLKRQYNMIVEMLGLGCSGFGWNAERKCINYEA

Query:  EIFD--ASSHPSAKRLRHKSFSFYDDLAIVFSKDRATGRRATTTAEVGS
        E+FD    SHP+AKR  +K FS+YD+L  VF +DRATGR A T A+VGS
Subjt:  EIFD--ASSHPSAKRLRHKSFSFYDDLAIVFSKDRATGRRATTTAEVGS

KAA0050293.1 retrotransposon protein [Cucumis melo var. makuwa]4.7e-3946.51Show/hide
Query:  RVVKHIWTDEEDRILVECLVQFVQSRHWRADNGTFRPGFLANILRMVQQRILGCSIQVSPHLESKVRTLKRQYNMIVEMLGLGCSGFGWNAERKCINYEA
        R  +H+WT EE+  LVECL++ V    W++DNGTFRPG+LA ++RM+ +++ GC ++ +  ++ +++TLKR +  I EM GL CSGFGWN E KCI  E 
Subjt:  RVVKHIWTDEEDRILVECLVQFVQSRHWRADNGTFRPGFLANILRMVQQRILGCSIQVSPHLESKVRTLKRQYNMIVEMLGLGCSGFGWNAERKCINYEA

Query:  EIFD--ASSHPSAKRLRHKSFSFYDDLAIVFSKDRATGRRATTTAEVGSEPVVDKENEDILNNQSLDFENLY
        E+FD    SHP AK L +K F +YD+L  VF +DRATGR A T A+VGS    D  +   + + + DF  +Y
Subjt:  EIFD--ASSHPSAKRLRHKSFSFYDDLAIVFSKDRATGRRATTTAEVGSEPVVDKENEDILNNQSLDFENLY

KAA0060539.1 retrotransposon protein [Cucumis melo var. makuwa]1.1e-3851.01Show/hide
Query:  RVVKHIWTDEEDRILVECLVQFVQSRHWRADNGTFRPGFLANILRMVQQRILGCSIQVSPHLESKVRTLKRQYNMIVEMLGLGCSGFGWNAERKCINYEA
        R  +H+WT EE+  LVECL++ V  R W++DNGTFRPG+LA ++RM+ +++ GC ++ +  ++ +++TLKR +  I EM G  CSGFGW+ E KCI  E 
Subjt:  RVVKHIWTDEEDRILVECLVQFVQSRHWRADNGTFRPGFLANILRMVQQRILGCSIQVSPHLESKVRTLKRQYNMIVEMLGLGCSGFGWNAERKCINYEA

Query:  EIFD--ASSHPSAKRLRHKSFSFYDDLAIVFSKDRATGRRATTTAEVGS
        E+FD    SHP+AK L +KSF +YD+L  VF +DRATGR A T A+VGS
Subjt:  EIFD--ASSHPSAKRLRHKSFSFYDDLAIVFSKDRATGRRATTTAEVGS

KAA0067583.1 retrotransposon protein [Cucumis melo var. makuwa]1.1e-3850.34Show/hide
Query:  RVVKHIWTDEEDRILVECLVQFVQSRHWRADNGTFRPGFLANILRMVQQRILGCSIQVSPHLESKVRTLKRQYNMIVEMLGLGCSGFGWNAERKCINYEA
        R  +H+WT EE+  +VECL++ V    W++DNGTFRPG+LA ++RM+ +++ GC ++ +  ++ +++TLKR +  I EM G  CSGFGWN E KCI  E 
Subjt:  RVVKHIWTDEEDRILVECLVQFVQSRHWRADNGTFRPGFLANILRMVQQRILGCSIQVSPHLESKVRTLKRQYNMIVEMLGLGCSGFGWNAERKCINYEA

Query:  EIFD--ASSHPSAKRLRHKSFSFYDDLAIVFSKDRATGRRATTTAEVGS
        E+FD    SHP+AK L +K FS+YD+L  VF +DRATGR A T A+VGS
Subjt:  EIFD--ASSHPSAKRLRHKSFSFYDDLAIVFSKDRATGRRATTTAEVGS

XP_008441954.1 PREDICTED: uncharacterized protein LOC103485953 [Cucumis melo]6.2e-3949.42Show/hide
Query:  SRARVVKHIWTDEEDRILVECLVQFVQSRHWRADNGTFRPGFLANILRMVQQRILGCSIQVSPHLESKVRTLKRQYNMIVEMLGLGCSGFGWNAERKCIN
        S +R  KH WT EE+   VECLV+ V S  WR+DNGTF+PG+LA + RM+ +++ G +IQ S  ++  V++LK+ Y+ I EM G  CSGFGWN E +CI 
Subjt:  SRARVVKHIWTDEEDRILVECLVQFVQSRHWRADNGTFRPGFLANILRMVQQRILGCSIQVSPHLESKVRTLKRQYNMIVEMLGLGCSGFGWNAERKCIN

Query:  YEAEIFDA--SSHPSAKRLRHKSFSFYDDLAIVFSKDRATGRRATTTAEVGSE---------PVVDKENEDI
         E ++FD+   SHP+AK L HKSF +YDDL+ VF KDRATG R+ T   VGS          P+ D  +EDI
Subjt:  YEAEIFDA--SSHPSAKRLRHKSFSFYDDLAIVFSKDRATGRRATTTAEVGSE---------PVVDKENEDI

TrEMBL top hitse value%identityAlignment
A0A1S3B4L3 uncharacterized protein LOC1034859533.0e-3949.42Show/hide
Query:  SRARVVKHIWTDEEDRILVECLVQFVQSRHWRADNGTFRPGFLANILRMVQQRILGCSIQVSPHLESKVRTLKRQYNMIVEMLGLGCSGFGWNAERKCIN
        S +R  KH WT EE+   VECLV+ V S  WR+DNGTF+PG+LA + RM+ +++ G +IQ S  ++  V++LK+ Y+ I EM G  CSGFGWN E +CI 
Subjt:  SRARVVKHIWTDEEDRILVECLVQFVQSRHWRADNGTFRPGFLANILRMVQQRILGCSIQVSPHLESKVRTLKRQYNMIVEMLGLGCSGFGWNAERKCIN

Query:  YEAEIFDA--SSHPSAKRLRHKSFSFYDDLAIVFSKDRATGRRATTTAEVGSE---------PVVDKENEDI
         E ++FD+   SHP+AK L HKSF +YDDL+ VF KDRATG R+ T   VGS          P+ D  +EDI
Subjt:  YEAEIFDA--SSHPSAKRLRHKSFSFYDDLAIVFSKDRATGRRATTTAEVGSE---------PVVDKENEDI

A0A5A7T1E6 Retrotransposon protein3.9e-3951.68Show/hide
Query:  RVVKHIWTDEEDRILVECLVQFVQSRHWRADNGTFRPGFLANILRMVQQRILGCSIQVSPHLESKVRTLKRQYNMIVEMLGLGCSGFGWNAERKCINYEA
        RV +H+WT EE+R LVECL++ V    W++DNGTFR G+LA ++RM+ +++ GC ++ +  ++ +++TLKR +  I EM G  CSGFGWN E KCI  E 
Subjt:  RVVKHIWTDEEDRILVECLVQFVQSRHWRADNGTFRPGFLANILRMVQQRILGCSIQVSPHLESKVRTLKRQYNMIVEMLGLGCSGFGWNAERKCINYEA

Query:  EIFD--ASSHPSAKRLRHKSFSFYDDLAIVFSKDRATGRRATTTAEVGS
        E+FD    SHP+AKR  +K FS+YD+L  VF +DRATGR A T A+VGS
Subjt:  EIFD--ASSHPSAKRLRHKSFSFYDDLAIVFSKDRATGRRATTTAEVGS

A0A5A7U0H7 Retrotransposon protein3.0e-3949.42Show/hide
Query:  SRARVVKHIWTDEEDRILVECLVQFVQSRHWRADNGTFRPGFLANILRMVQQRILGCSIQVSPHLESKVRTLKRQYNMIVEMLGLGCSGFGWNAERKCIN
        S +R  KH WT EE+   VECLV+ V S  WR+DNGTF+PG+LA + RM+ +++ G +IQ S  ++  V++LK+ Y+ I EM G  CSGFGWN E +CI 
Subjt:  SRARVVKHIWTDEEDRILVECLVQFVQSRHWRADNGTFRPGFLANILRMVQQRILGCSIQVSPHLESKVRTLKRQYNMIVEMLGLGCSGFGWNAERKCIN

Query:  YEAEIFDA--SSHPSAKRLRHKSFSFYDDLAIVFSKDRATGRRATTTAEVGSE---------PVVDKENEDI
         E ++FD+   SHP+AK L HKSF +YDDL+ VF KDRATG R+ T   VGS          P+ D  +EDI
Subjt:  YEAEIFDA--SSHPSAKRLRHKSFSFYDDLAIVFSKDRATGRRATTTAEVGSE---------PVVDKENEDI

A0A5A7U308 Retrotransposon protein2.3e-3946.51Show/hide
Query:  RVVKHIWTDEEDRILVECLVQFVQSRHWRADNGTFRPGFLANILRMVQQRILGCSIQVSPHLESKVRTLKRQYNMIVEMLGLGCSGFGWNAERKCINYEA
        R  +H+WT EE+  LVECL++ V    W++DNGTFRPG+LA ++RM+ +++ GC ++ +  ++ +++TLKR +  I EM GL CSGFGWN E KCI  E 
Subjt:  RVVKHIWTDEEDRILVECLVQFVQSRHWRADNGTFRPGFLANILRMVQQRILGCSIQVSPHLESKVRTLKRQYNMIVEMLGLGCSGFGWNAERKCINYEA

Query:  EIFD--ASSHPSAKRLRHKSFSFYDDLAIVFSKDRATGRRATTTAEVGSEPVVDKENEDILNNQSLDFENLY
        E+FD    SHP AK L +K F +YD+L  VF +DRATGR A T A+VGS    D  +   + + + DF  +Y
Subjt:  EIFD--ASSHPSAKRLRHKSFSFYDDLAIVFSKDRATGRRATTTAEVGSEPVVDKENEDILNNQSLDFENLY

A0A5A7V480 Retrotransposon protein5.1e-3951.01Show/hide
Query:  RVVKHIWTDEEDRILVECLVQFVQSRHWRADNGTFRPGFLANILRMVQQRILGCSIQVSPHLESKVRTLKRQYNMIVEMLGLGCSGFGWNAERKCINYEA
        R  +H+WT EE+  LVECL++ V  R W++DNGTFRPG+LA ++RM+ +++ GC ++ +  ++ +++TLKR +  I EM G  CSGFGW+ E KCI  E 
Subjt:  RVVKHIWTDEEDRILVECLVQFVQSRHWRADNGTFRPGFLANILRMVQQRILGCSIQVSPHLESKVRTLKRQYNMIVEMLGLGCSGFGWNAERKCINYEA

Query:  EIFD--ASSHPSAKRLRHKSFSFYDDLAIVFSKDRATGRRATTTAEVGS
        E+FD    SHP+AK L +KSF +YD+L  VF +DRATGR A T A+VGS
Subjt:  EIFD--ASSHPSAKRLRHKSFSFYDDLAIVFSKDRATGRRATTTAEVGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G27260.1 unknown protein9.6e-1427.55Show/hide
Query:  WTDEEDRILVECLVQFVQSRHWRADNGTFRPGFLANILRMVQQRILGCSIQVSPHLESKVRTLKRQYNMIVEMLGLGCSGFGWNAERKCINYEAEIFD--
        W+ EE ++LV+ LV+ + + +WR  NGT           M +     C  +   H  S+++ LK QY   +++     SGFGW+   K      E++   
Subjt:  WTDEEDRILVECLVQFVQSRHWRADNGTFRPGFLANILRMVQQRILGCSIQVSPHLESKVRTLKRQYNMIVEMLGLGCSGFGWNAERKCINYEAEIFD--

Query:  ASSHPSAKRLRHKSFSFYDDLAIVFSKDRATGRRATTTAEVGSEPVVDKENEDILNNQSLDFENLY---------IPEHRSPALPH--QMTFQLPP
          +HP+ K+LR+ +F F+D+L I+F +  ATG+ A    +  ++ +  +  E+       DF+N+Y           EH +P + H    + +LPP
Subjt:  ASSHPSAKRLRHKSFSFYDDLAIVFSKDRATGRRATTTAEVGSEPVVDKENEDILNNQSLDFENLY---------IPEHRSPALPH--QMTFQLPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCATCCGGATCACGCGCAAGAGTCGTAAAACATATTTGGACGGATGAGGAGGACAGAATCCTTGTGGAGTGTTTGGTCCAGTTTGTGCAGTCTAGACACTGGCG
AGCTGATAACGGGACTTTTCGACCTGGATTCCTAGCAAACATACTACGGATGGTGCAGCAGAGGATTCTGGGGTGTTCCATACAGGTAAGCCCACATTTGGAGTCAAAGG
TTAGGACGTTGAAGAGACAGTACAACATGATCGTTGAAATGTTGGGCCTAGGATGTAGTGGGTTTGGTTGGAATGCGGAGCGCAAATGTATTAACTATGAGGCGGAGATA
TTTGACGCGTCGAGTCATCCGAGTGCAAAAAGACTGCGCCATAAGTCATTTTCGTTCTATGACGACTTGGCCATTGTATTTAGCAAAGACAGAGCCACAGGGCGTCGTGC
AACCACCACTGCAGAGGTCGGATCTGAACCAGTTGTGGACAAGGAGAACGAAGACATCTTGAATAACCAGTCCCTGGACTTTGAGAACTTATATATTCCTGAGCACCGTT
CACCAGCTCTCCCACATCAGATGACATTCCAACTACCCCCAACGGTAGAGGGTCTGGGAGTAGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGCATCCGGATCACGCGCAAGAGTCGTAAAACATATTTGGACGGATGAGGAGGACAGAATCCTTGTGGAGTGTTTGGTCCAGTTTGTGCAGTCTAGACACTGGCG
AGCTGATAACGGGACTTTTCGACCTGGATTCCTAGCAAACATACTACGGATGGTGCAGCAGAGGATTCTGGGGTGTTCCATACAGGTAAGCCCACATTTGGAGTCAAAGG
TTAGGACGTTGAAGAGACAGTACAACATGATCGTTGAAATGTTGGGCCTAGGATGTAGTGGGTTTGGTTGGAATGCGGAGCGCAAATGTATTAACTATGAGGCGGAGATA
TTTGACGCGTCGAGTCATCCGAGTGCAAAAAGACTGCGCCATAAGTCATTTTCGTTCTATGACGACTTGGCCATTGTATTTAGCAAAGACAGAGCCACAGGGCGTCGTGC
AACCACCACTGCAGAGGTCGGATCTGAACCAGTTGTGGACAAGGAGAACGAAGACATCTTGAATAACCAGTCCCTGGACTTTGAGAACTTATATATTCCTGAGCACCGTT
CACCAGCTCTCCCACATCAGATGACATTCCAACTACCCCCAACGGTAGAGGGTCTGGGAGTAGCATGA
Protein sequenceShow/hide protein sequence
MEASGSRARVVKHIWTDEEDRILVECLVQFVQSRHWRADNGTFRPGFLANILRMVQQRILGCSIQVSPHLESKVRTLKRQYNMIVEMLGLGCSGFGWNAERKCINYEAEI
FDASSHPSAKRLRHKSFSFYDDLAIVFSKDRATGRRATTTAEVGSEPVVDKENEDILNNQSLDFENLYIPEHRSPALPHQMTFQLPPTVEGLGVA