; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG06G010470 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG06G010470
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationCG_Chr06:22390927..22394611
RNA-Seq ExpressionClCG06G010470
SyntenyClCG06G010470
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW63383.1 hypothetical protein CK203_055887 [Vitis vinifera]4.1e-1234.27Show/hide
Query:  LLWKNQLLNLIMGHGLESFID------GSFPALLVITH----------CLSFLLQLASRGFGLSYKELRKMVTLSNNMLFKLKMLRIKFCAIGKPLSYFY
        L+WKNQLLN+I+ + L  FID       S P   +  H           + F L L  RG     + L+K        + KLK +     AIG+P+S   
Subjt:  LLWKNQLLNLIMGHGLESFID------GSFPALLVITH----------CLSFLLQLASRGFGLSYKELRKMVTLSNNMLFKLKMLRIKFCAIGKPLSYFY

Query:  YLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNLIQANVANLSISQNSCRPKWPPYDK
        +L Y+  GL  EY PFV +I N +D P+I  +  LL +Y+ RLE+Q  V+ LN  Q ++A+L+       P+ PP+++
Subjt:  YLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNLIQANVANLSISQNSCRPKWPPYDK

TXG68750.1 hypothetical protein EZV62_003685 [Acer yangbiense]6.0e-1151.72Show/hide
Query:  LRKMVTLSNNMLFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNL
        L+K  T  N  LF+ K +  KF AIG+PLSY  +LGY+LEGLG+EY  FV +I N  D PSI DV  LL ++E RL K+TL ++ +L
Subjt:  LRKMVTLSNNMLFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNL

XP_022148871.1 uncharacterized protein LOC111017438 [Momordica charantia]2.2e-1336.36Show/hide
Query:  LFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNLIQANVANLSISQNSCRPKWPPYDK
        L K+K +  K  +IG+P+S   ++ YI+EGLG EY  FV +I N +D  ++ DV  LL AY+ RLEKQ  V+QLN++QANVANL +++ S   +      
Subjt:  LFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNLIQANVANLSISQNSCRPKWPPYDK

Query:  SFTPTTLPMLGFVPFFIVVLSGPDILDNARPLIPLSNFQTIVLNNLINLQTVVLNICSKLGHIALVCYSRNNPLYQ
          +P         PF      G     N     P    Q    N  +  Q     IC KLGH    CY R N  Y+
Subjt:  SFTPTTLPMLGFVPFFIVVLSGPDILDNARPLIPLSNFQTIVLNNLINLQTVVLNICSKLGHIALVCYSRNNPLYQ

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]2.0e-2733.54Show/hide
Query:  PCSNELFDTTCLLWKNQLLNLIMGHGLESFIDGSF---------------PAL----------------------------LVITHCLSFLL------QL
        P + +L D   LLWKNQLLN ++ +GL  ++DG+                PA                             L  TH +   L      + 
Subjt:  PCSNELFDTTCLLWKNQLLNLIMGHGLESFIDGSF---------------PAL----------------------------LVITHCLSFLL------QL

Query:  ASRGFGL--SYKELRKMVTLSNNMLFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNL
         +R  GL    + LRK  +  +  L K+K +  KF A+G+PLSY  +L ++L+GLG+EY  FV +IHN  D PS+ DV  LL AYEARL+KQ  V+QLN+
Subjt:  ASRGFGL--SYKELRKMVTLSNNMLFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNL

Query:  IQANVANLSISQNSCRPKWPP-------YDKSFTPTTLPMLGFVPFFIVVLSGPDILDNARPLIPLSNFQTIVLNNLINLQTVVLNICSKLGHIALVCYS
         QAN+ NLS+  NS RP  PP       Y  SF  + +           +L  P  +    P    S  Q                IC KLGH A VCY 
Subjt:  IQANVANLSISQNSCRPKWPP-------YDKSFTPTTLPMLGFVPFFIVVLSGPDILDNARPLIPLSNFQTIVLNNLINLQTVVLNICSKLGHIALVCYS

Query:  RNNPLYQASSPTLCAHFIQ
        R N  Y  +SP    H +Q
Subjt:  RNNPLYQASSPTLCAHFIQ

XP_038887133.1 uncharacterized protein LOC120077323 [Benincasa hispida]1.4e-2045.86Show/hide
Query:  GFGLSYKELRKMVTLSNNMLFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNLIQANV
        GF    ++++K     +  L ++K +   F AIG+PLSY  +L YILEGLG+EY PFV +IHN T+ PSI DV  LL  Y++RLEKQT  + L LIQANV
Subjt:  GFGLSYKELRKMVTLSNNMLFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNLIQANV

Query:  ANLSISQNSCRPKWPPYDKSFTPTTLPMLGFVP
        A+LSI+  +  P+W  +++S   ++ P +G  P
Subjt:  ANLSISQNSCRPKWPPYDKSFTPTTLPMLGFVP

TrEMBL top hitse value%identityAlignment
A0A438FTV3 Uncharacterized protein2.0e-1234.27Show/hide
Query:  LLWKNQLLNLIMGHGLESFID------GSFPALLVITH----------CLSFLLQLASRGFGLSYKELRKMVTLSNNMLFKLKMLRIKFCAIGKPLSYFY
        L+WKNQLLN+I+ + L  FID       S P   +  H           + F L L  RG     + L+K        + KLK +     AIG+P+S   
Subjt:  LLWKNQLLNLIMGHGLESFID------GSFPALLVITH----------CLSFLLQLASRGFGLSYKELRKMVTLSNNMLFKLKMLRIKFCAIGKPLSYFY

Query:  YLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNLIQANVANLSISQNSCRPKWPPYDK
        +L Y+  GL  EY PFV +I N +D P+I  +  LL +Y+ RLE+Q  V+ LN  Q ++A+L+       P+ PP+++
Subjt:  YLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNLIQANVANLSISQNSCRPKWPPYDK

A0A5C7IHH0 Uncharacterized protein2.9e-1151.72Show/hide
Query:  LRKMVTLSNNMLFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNL
        L+K  T  N  LF+ K +  KF AIG+PLSY  +LGY+LEGLG+EY  FV +I N  D PSI DV  LL ++E RL K+TL ++ +L
Subjt:  LRKMVTLSNNMLFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNL

A0A6J1D6N7 uncharacterized protein LOC1110174381.1e-1336.36Show/hide
Query:  LFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNLIQANVANLSISQNSCRPKWPPYDK
        L K+K +  K  +IG+P+S   ++ YI+EGLG EY  FV +I N +D  ++ DV  LL AY+ RLEKQ  V+QLN++QANVANL +++ S   +      
Subjt:  LFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNLIQANVANLSISQNSCRPKWPPYDK

Query:  SFTPTTLPMLGFVPFFIVVLSGPDILDNARPLIPLSNFQTIVLNNLINLQTVVLNICSKLGHIALVCYSRNNPLYQ
          +P         PF      G     N     P    Q    N  +  Q     IC KLGH    CY R N  Y+
Subjt:  SFTPTTLPMLGFVPFFIVVLSGPDILDNARPLIPLSNFQTIVLNNLINLQTVVLNICSKLGHIALVCYSRNNPLYQ

A0A6J1DQX7 uncharacterized protein LOC1110223159.9e-2833.54Show/hide
Query:  PCSNELFDTTCLLWKNQLLNLIMGHGLESFIDGSF---------------PAL----------------------------LVITHCLSFLL------QL
        P + +L D   LLWKNQLLN ++ +GL  ++DG+                PA                             L  TH +   L      + 
Subjt:  PCSNELFDTTCLLWKNQLLNLIMGHGLESFIDGSF---------------PAL----------------------------LVITHCLSFLL------QL

Query:  ASRGFGL--SYKELRKMVTLSNNMLFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNL
         +R  GL    + LRK  +  +  L K+K +  KF A+G+PLSY  +L ++L+GLG+EY  FV +IHN  D PS+ DV  LL AYEARL+KQ  V+QLN+
Subjt:  ASRGFGL--SYKELRKMVTLSNNMLFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNL

Query:  IQANVANLSISQNSCRPKWPP-------YDKSFTPTTLPMLGFVPFFIVVLSGPDILDNARPLIPLSNFQTIVLNNLINLQTVVLNICSKLGHIALVCYS
         QAN+ NLS+  NS RP  PP       Y  SF  + +           +L  P  +    P    S  Q                IC KLGH A VCY 
Subjt:  IQANVANLSISQNSCRPKWPP-------YDKSFTPTTLPMLGFVPFFIVVLSGPDILDNARPLIPLSNFQTIVLNNLINLQTVVLNICSKLGHIALVCYS

Query:  RNNPLYQASSPTLCAHFIQ
        R N  Y  +SP    H +Q
Subjt:  RNNPLYQASSPTLCAHFIQ

A0A803NL56 Uncharacterized protein4.2e-1029.29Show/hide
Query:  SNELFDTTCLLWKNQLLNLIMGHGLESFIDGS------FP----------------------ALLVITHCLSFLLQLA----------------------
        S +L DT  L+W+ Q+ N+I+ +GLE +IDG+      FP                      + L  +   S L Q+                       
Subjt:  SNELFDTTCLLWKNQLLNLIMGHGLESFIDGS------FP----------------------ALLVITHCLSFLLQLA----------------------

Query:  -SRGFGLSYKELRKMVTLSNNMLFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNLIQ
         S  F ++ + L+K    ++  L KLK L     ++G P+S   +L Y+L GLG EY  FV  I      P+I +V  LL +YEARLE+Q      + +Q
Subjt:  -SRGFGLSYKELRKMVTLSNNMLFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNLIQ

Query:  ANVANLSI---------SQNSCRPKWPPYDKSFTPTTLP
        AN ANLS           Q S +P++P + +   P T+P
Subjt:  ANVANLSI---------SQNSCRPKWPPYDKSFTPTTLP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAACAAAGAACAATTCCGGTTCCAATTTGGAATTTTTCTTCATGGAGAAAGAGCTTTATGACAAGGAATTCCTCGTGTCTAAAGAGCTATTTTCTCCTTGTTCAAA
TGAGCTCTTTGACACTACTTGTTTACTCTGGAAGAATCAACTTCTTAATCTAATTATGGGTCATGGTCTTGAAAGCTTCATTGATGGTAGCTTTCCTGCCCTCCTCGTCA
TTACACATTGTTTGAGTTTTCTTCTACAACTTGCATCACGGGGCTTTGGTCTCAGCTACAAAGAGTTAAGAAAGATGGTCACTCTATCTAACAATATGTTGTTCAAATTA
AAGATGTTGCGGATAAAGTTCTGTGCTATTGGCAAGCCTTTATCTTATTTTTATTATCTGGGTTATATCCTTGAAGGTCTAGGTAATGAGTATTATCCATTTGTCATTAC
TATTCATAATTGCACTGATGGACCCTCTATTACAGATGTTCCCATCCTTCTTTGGGCGTATGAGGCTCGCTTGGAGAAACAAACTTTGGTTAATCAACTCAATCTTATTC
AGGCTAATGTTGCCAATTTGTCCATATCACAAAACTCTTGTCGACCAAAATGGCCTCCATACGATAAGTCTTTTACTCCTACCACTCTGCCAATGCTTGGGTTTGTGCCA
TTTTTTATTGTTGTTCTTTCTGGTCCTGATATTCTTGATAATGCTCGCCCTCTTATCCCTTTGTCCAACTTCCAAACAATTGTCCTCAATAACCTTATAAACCTCCAAAC
AGTTGTCCTCAATATTTGTAGTAAGCTTGGGCACATAGCCCTTGTTTGCTACAGTCGAAACAATCCTCTTTATCAAGCATCCTCACCCACTCTTTGTGCCCATTTTATCC
AA
mRNA sequenceShow/hide mRNA sequence
ATGGGAACAAAGAACAATTCCGGTTCCAATTTGGAATTTTTCTTCATGGAGAAAGAGCTTTATGACAAGGAATTCCTCGTGTCTAAAGAGCTATTTTCTCCTTGTTCAAA
TGAGCTCTTTGACACTACTTGTTTACTCTGGAAGAATCAACTTCTTAATCTAATTATGGGTCATGGTCTTGAAAGCTTCATTGATGGTAGCTTTCCTGCCCTCCTCGTCA
TTACACATTGTTTGAGTTTTCTTCTACAACTTGCATCACGGGGCTTTGGTCTCAGCTACAAAGAGTTAAGAAAGATGGTCACTCTATCTAACAATATGTTGTTCAAATTA
AAGATGTTGCGGATAAAGTTCTGTGCTATTGGCAAGCCTTTATCTTATTTTTATTATCTGGGTTATATCCTTGAAGGTCTAGGTAATGAGTATTATCCATTTGTCATTAC
TATTCATAATTGCACTGATGGACCCTCTATTACAGATGTTCCCATCCTTCTTTGGGCGTATGAGGCTCGCTTGGAGAAACAAACTTTGGTTAATCAACTCAATCTTATTC
AGGCTAATGTTGCCAATTTGTCCATATCACAAAACTCTTGTCGACCAAAATGGCCTCCATACGATAAGTCTTTTACTCCTACCACTCTGCCAATGCTTGGGTTTGTGCCA
TTTTTTATTGTTGTTCTTTCTGGTCCTGATATTCTTGATAATGCTCGCCCTCTTATCCCTTTGTCCAACTTCCAAACAATTGTCCTCAATAACCTTATAAACCTCCAAAC
AGTTGTCCTCAATATTTGTAGTAAGCTTGGGCACATAGCCCTTGTTTGCTACAGTCGAAACAATCCTCTTTATCAAGCATCCTCACCCACTCTTTGTGCCCATTTTATCC
AA
Protein sequenceShow/hide protein sequence
MGTKNNSGSNLEFFFMEKELYDKEFLVSKELFSPCSNELFDTTCLLWKNQLLNLIMGHGLESFIDGSFPALLVITHCLSFLLQLASRGFGLSYKELRKMVTLSNNMLFKL
KMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNLIQANVANLSISQNSCRPKWPPYDKSFTPTTLPMLGFVP
FFIVVLSGPDILDNARPLIPLSNFQTIVLNNLINLQTVVLNICSKLGHIALVCYSRNNPLYQASSPTLCAHFIQ