; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG09G012880 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG09G012880
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCG_Chr09:15805786..15806871
RNA-Seq ExpressionClCG09G012880
SyntenyClCG09G012880
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036141.1 retrotransposon protein, putative, Ty1-copia subclass [Cucumis melo var. makuwa]9.1e-2037.44Show/hide
Query:  MTLKVGVQLMGFENAQSLWEAIHAPLSVQSRAEEDCLRHVFQQT-------------FSQVLLRLDEEYNPV----LVSELLSFEKQLEHQNTTKSTVSF
        MT +V  QLMGF  A+ LWEAI     VQSR EED LRH FQ T                 +  L +E   +    + SELL FEK+LEHQN+ K   S 
Subjt:  MTLKVGVQLMGFENAQSLWEAIHAPLSVQSRAEEDCLRHVFQQT-------------FSQVLLRLDEEYNPV----LVSELLSFEKQLEHQNTTKSTVSF

Query:  GHNVTVNLVFNRNPSLSKSPSPAIHNLMVDETEARVIAGETDRHVKCAPNIDTLSFGQNPATFITTHNPNPFVATLETIGDFNWYAGSGATNHVTVDFNH
        GH  T        PS S                                        QN   F+TT+N N FV T ET+ D NWY  +GATNHVT D+++
Subjt:  GHNVTVNLVFNRNPSLSKSPSPAIHNLMVDETEARVIAGETDRHVKCAPNIDTLSFGQNPATFITTHNPNPFVATLETIGDFNWYAGSGATNHVTVDFNH

Query:  IANPTDYTGNEQVTVGNGE
        ++NP  Y+G E V VGN +
Subjt:  IANPTDYTGNEQVTVGNGE

TYK05754.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.0e-2332.42Show/hide
Query:  VGVQLMGFENAQSLWEAIHAPLSVQSRAEEDCLRHVFQQT---------------------------------FSQVLLRLDEEYNPVLV----------
        + +QLMGF NA+ LWEA      VQSRAEED LR +FQ T                                  SQ LL LDE YNPV+           
Subjt:  VGVQLMGFENAQSLWEAIHAPLSVQSRAEEDCLRHVFQQT---------------------------------FSQVLLRLDEEYNPVLV----------

Query:  ----SELLSFEKQLEHQNTTKSTVSFGHNVTVNLVFNRNPS-LSKSPSPAIHNLMVDETEAR------------------------------VIAGETDR
            SELL+FEK+LEHQ+T K+T +   NV VN+  NRN S   K  +   H    + ++ +                              ++      
Subjt:  ----SELLSFEKQLEHQNTTKSTVSFGHNVTVNLVFNRNPS-LSKSPSPAIHNLMVDETEAR------------------------------VIAGETDR

Query:  HVKCAPNI-------DTLSFGQNPATFITTHNPNPFVATLETIGDFNWYAGSGATNHVTVDFNHIANPTDYTGNEQVTVGNGESLCISTIGQA
            +P +          S   N    +T  + N F AT +T+ + NWY  SGATNH+TV++++++NP++Y+G E++ VGNG+SL IS IG A
Subjt:  HVKCAPNI-------DTLSFGQNPATFITTHNPNPFVATLETIGDFNWYAGSGATNHVTVDFNHIANPTDYTGNEQVTVGNGESLCISTIGQA

XP_016902197.1 PREDICTED: uncharacterized protein LOC107991581 isoform X1 [Cucumis melo]3.0e-2336.15Show/hide
Query:  MTLKVGVQLMGFENAQSLWEAIHAPLSVQSRAEEDCLRHVFQQTFSQVLLRLDEEYNPVLV--------------SELLSFEKQLEHQNTTKS-----TV
        MT  V +QLMGF N + LW+A      VQSRAEED LR + Q T       LDE YN V+V              S+LL FEK+L+HQNT K      T 
Subjt:  MTLKVGVQLMGFENAQSLWEAIHAPLSVQSRAEEDCLRHVFQQTFSQVLLRLDEEYNPVLV--------------SELLSFEKQLEHQNTTKS-----TV

Query:  SFGHNVTVNLVFN--RNPS------------------LSKSPSPAI-----------HNLMVDETEARVIAGETDRHVKCAPNIDTLSFGQNPATFITTH
        S   N+      N  RN S                  L+  P+  +           +N    E  + ++    +       +    S   NPA F++T 
Subjt:  SFGHNVTVNLVFN--RNPS------------------LSKSPSPAI-----------HNLMVDETEARVIAGETDRHVKCAPNIDTLSFGQNPATFITTH

Query:  NPNPFVATLETIGDFNWYAGSGATNHVTVDFNHIANPTDYTGNEQVTVGNGESLCISTIG
        N  PF AT +T+ D NWY  SGATNHVT + +++ NPT+Y+G E+VTVGNG  L IS +G
Subjt:  NPNPFVATLETIGDFNWYAGSGATNHVTVDFNHIANPTDYTGNEQVTVGNGESLCISTIG

XP_016902203.1 PREDICTED: uncharacterized protein LOC107991581 isoform X3 [Cucumis melo]1.1e-1734.3Show/hide
Query:  MTLKVGVQLMGFENAQSLWEAIHAPLSVQSRAEEDCLRHVFQQTFSQVLLRLDEEYNPVLV--------------SELLSFEKQLEHQNTTKS-----TV
        MT  V +QLMGF N + LW+A      VQSRAEED LR + Q T       LDE YN V+V              S+LL FEK+L+HQNT K      T 
Subjt:  MTLKVGVQLMGFENAQSLWEAIHAPLSVQSRAEEDCLRHVFQQTFSQVLLRLDEEYNPVLV--------------SELLSFEKQLEHQNTTKS-----TV

Query:  SFGHNVTVNLVFN--RNPS------------------LSKSPSPAI-----------HNLMVDETEARVIAGETDRHVKCAPNIDTLSFGQNPATFITTH
        S   N+      N  RN S                  L+  P+  +           +N    E  + ++    +       +    S   NPA F++T 
Subjt:  SFGHNVTVNLVFN--RNPS------------------LSKSPSPAI-----------HNLMVDETEARVIAGETDRHVKCAPNIDTLSFGQNPATFITTH

Query:  NPNPFVATLETIGDFNWYAGSGATNHVTVDFNHIANPTDYTG
        N  PF AT +T+ D NWY  SGATNHVT + +++ NPT+Y+G
Subjt:  NPNPFVATLETIGDFNWYAGSGATNHVTVDFNHIANPTDYTG

XP_022151683.1 uncharacterized protein LOC111019598 [Momordica charantia]3.1e-2032.96Show/hide
Query:  MTLKVGVQLMGFENAQSLWEAIHAPLSVQSRAEEDCLRHVFQQT---------------------------------FSQVLLRLDEEYNPVLV------
        M   V +Q+MGF  ++ LW A+     VQSRAE D L+ VFQQT                                  SQVL  LDEEYNP++V      
Subjt:  MTLKVGVQLMGFENAQSLWEAIHAPLSVQSRAEEDCLRHVFQQT---------------------------------FSQVLLRLDEEYNPVLV------

Query:  --------SELLSFEKQLEHQNTTKSTVSFGHNVT--VNLV----FNRNPSLSKSPSPAIHNLMVDETEARVIAGETDRHVKCAP----NIDTLSFGQNP
                +ELL++EK+LE+QN+ KS +      T  VN V    F  N   +   +    N        R   G+ +R     P    N    + G N 
Subjt:  --------SELLSFEKQLEHQNTTKSTVSFGHNVT--VNLV----FNRNPSLSKSPSPAIHNLMVDETEARVIAGETDRHVKCAP----NIDTLSFGQNP

Query:  ATFITTHNPNPFVATLETIGDFNWYAGSGATNHVTVDFNHIANPTDYTGNEQVTVGNGESLCISTIGQAD
              H+ +  V T ET+ D +WYA SGAT+HVT + N++    DY+G E V V NG  L IS IG  +
Subjt:  ATFITTHNPNPFVATLETIGDFNWYAGSGATNHVTVDFNHIANPTDYTGNEQVTVGNGESLCISTIGQAD

TrEMBL top hitse value%identityAlignment
A0A1S4E1U6 uncharacterized protein LOC107991581 isoform X11.5e-2336.15Show/hide
Query:  MTLKVGVQLMGFENAQSLWEAIHAPLSVQSRAEEDCLRHVFQQTFSQVLLRLDEEYNPVLV--------------SELLSFEKQLEHQNTTKS-----TV
        MT  V +QLMGF N + LW+A      VQSRAEED LR + Q T       LDE YN V+V              S+LL FEK+L+HQNT K      T 
Subjt:  MTLKVGVQLMGFENAQSLWEAIHAPLSVQSRAEEDCLRHVFQQTFSQVLLRLDEEYNPVLV--------------SELLSFEKQLEHQNTTKS-----TV

Query:  SFGHNVTVNLVFN--RNPS------------------LSKSPSPAI-----------HNLMVDETEARVIAGETDRHVKCAPNIDTLSFGQNPATFITTH
        S   N+      N  RN S                  L+  P+  +           +N    E  + ++    +       +    S   NPA F++T 
Subjt:  SFGHNVTVNLVFN--RNPS------------------LSKSPSPAI-----------HNLMVDETEARVIAGETDRHVKCAPNIDTLSFGQNPATFITTH

Query:  NPNPFVATLETIGDFNWYAGSGATNHVTVDFNHIANPTDYTGNEQVTVGNGESLCISTIG
        N  PF AT +T+ D NWY  SGATNHVT + +++ NPT+Y+G E+VTVGNG  L IS +G
Subjt:  NPNPFVATLETIGDFNWYAGSGATNHVTVDFNHIANPTDYTGNEQVTVGNGESLCISTIG

A0A1S4E1V2 uncharacterized protein LOC107991581 isoform X35.4e-1834.3Show/hide
Query:  MTLKVGVQLMGFENAQSLWEAIHAPLSVQSRAEEDCLRHVFQQTFSQVLLRLDEEYNPVLV--------------SELLSFEKQLEHQNTTKS-----TV
        MT  V +QLMGF N + LW+A      VQSRAEED LR + Q T       LDE YN V+V              S+LL FEK+L+HQNT K      T 
Subjt:  MTLKVGVQLMGFENAQSLWEAIHAPLSVQSRAEEDCLRHVFQQTFSQVLLRLDEEYNPVLV--------------SELLSFEKQLEHQNTTKS-----TV

Query:  SFGHNVTVNLVFN--RNPS------------------LSKSPSPAI-----------HNLMVDETEARVIAGETDRHVKCAPNIDTLSFGQNPATFITTH
        S   N+      N  RN S                  L+  P+  +           +N    E  + ++    +       +    S   NPA F++T 
Subjt:  SFGHNVTVNLVFN--RNPS------------------LSKSPSPAI-----------HNLMVDETEARVIAGETDRHVKCAPNIDTLSFGQNPATFITTH

Query:  NPNPFVATLETIGDFNWYAGSGATNHVTVDFNHIANPTDYTG
        N  PF AT +T+ D NWY  SGATNHVT + +++ NPT+Y+G
Subjt:  NPNPFVATLETIGDFNWYAGSGATNHVTVDFNHIANPTDYTG

A0A5D3C373 Retrovirus-related Pol polyprotein from transposon TNT 1-945.0e-2432.42Show/hide
Query:  VGVQLMGFENAQSLWEAIHAPLSVQSRAEEDCLRHVFQQT---------------------------------FSQVLLRLDEEYNPVLV----------
        + +QLMGF NA+ LWEA      VQSRAEED LR +FQ T                                  SQ LL LDE YNPV+           
Subjt:  VGVQLMGFENAQSLWEAIHAPLSVQSRAEEDCLRHVFQQT---------------------------------FSQVLLRLDEEYNPVLV----------

Query:  ----SELLSFEKQLEHQNTTKSTVSFGHNVTVNLVFNRNPS-LSKSPSPAIHNLMVDETEAR------------------------------VIAGETDR
            SELL+FEK+LEHQ+T K+T +   NV VN+  NRN S   K  +   H    + ++ +                              ++      
Subjt:  ----SELLSFEKQLEHQNTTKSTVSFGHNVTVNLVFNRNPS-LSKSPSPAIHNLMVDETEAR------------------------------VIAGETDR

Query:  HVKCAPNI-------DTLSFGQNPATFITTHNPNPFVATLETIGDFNWYAGSGATNHVTVDFNHIANPTDYTGNEQVTVGNGESLCISTIGQA
            +P +          S   N    +T  + N F AT +T+ + NWY  SGATNH+TV++++++NP++Y+G E++ VGNG+SL IS IG A
Subjt:  HVKCAPNI-------DTLSFGQNPATFITTHNPNPFVATLETIGDFNWYAGSGATNHVTVDFNHIANPTDYTGNEQVTVGNGESLCISTIGQA

A0A5D3CPY2 Retrotransposon protein, putative, Ty1-copia subclass4.4e-2037.44Show/hide
Query:  MTLKVGVQLMGFENAQSLWEAIHAPLSVQSRAEEDCLRHVFQQT-------------FSQVLLRLDEEYNPV----LVSELLSFEKQLEHQNTTKSTVSF
        MT +V  QLMGF  A+ LWEAI     VQSR EED LRH FQ T                 +  L +E   +    + SELL FEK+LEHQN+ K   S 
Subjt:  MTLKVGVQLMGFENAQSLWEAIHAPLSVQSRAEEDCLRHVFQQT-------------FSQVLLRLDEEYNPV----LVSELLSFEKQLEHQNTTKSTVSF

Query:  GHNVTVNLVFNRNPSLSKSPSPAIHNLMVDETEARVIAGETDRHVKCAPNIDTLSFGQNPATFITTHNPNPFVATLETIGDFNWYAGSGATNHVTVDFNH
        GH  T        PS S                                        QN   F+TT+N N FV T ET+ D NWY  +GATNHVT D+++
Subjt:  GHNVTVNLVFNRNPSLSKSPSPAIHNLMVDETEARVIAGETDRHVKCAPNIDTLSFGQNPATFITTHNPNPFVATLETIGDFNWYAGSGATNHVTVDFNH

Query:  IANPTDYTGNEQVTVGNGE
        ++NP  Y+G E V VGN +
Subjt:  IANPTDYTGNEQVTVGNGE

A0A6J1DCW4 uncharacterized protein LOC1110195981.5e-2032.96Show/hide
Query:  MTLKVGVQLMGFENAQSLWEAIHAPLSVQSRAEEDCLRHVFQQT---------------------------------FSQVLLRLDEEYNPVLV------
        M   V +Q+MGF  ++ LW A+     VQSRAE D L+ VFQQT                                  SQVL  LDEEYNP++V      
Subjt:  MTLKVGVQLMGFENAQSLWEAIHAPLSVQSRAEEDCLRHVFQQT---------------------------------FSQVLLRLDEEYNPVLV------

Query:  --------SELLSFEKQLEHQNTTKSTVSFGHNVT--VNLV----FNRNPSLSKSPSPAIHNLMVDETEARVIAGETDRHVKCAP----NIDTLSFGQNP
                +ELL++EK+LE+QN+ KS +      T  VN V    F  N   +   +    N        R   G+ +R     P    N    + G N 
Subjt:  --------SELLSFEKQLEHQNTTKSTVSFGHNVT--VNLV----FNRNPSLSKSPSPAIHNLMVDETEARVIAGETDRHVKCAP----NIDTLSFGQNP

Query:  ATFITTHNPNPFVATLETIGDFNWYAGSGATNHVTVDFNHIANPTDYTGNEQVTVGNGESLCISTIGQAD
              H+ +  V T ET+ D +WYA SGAT+HVT + N++    DY+G E V V NG  L IS IG  +
Subjt:  ATFITTHNPNPFVATLETIGDFNWYAGSGATNHVTVDFNHIANPTDYTGNEQVTVGNGESLCISTIGQAD

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.5e-0433.33Show/hide
Query:  QNPATFITTHNPNPFVATLETIGDFNWYAGSGATNHVTVDFNHIANPTDYTGNEQVTVGNGESLCISTIGQADKQNSS
        Q P +  T   P   +A        NW   SGAT+H+T DFN+++    YTG + V V +G ++ IS  G       S
Subjt:  QNPATFITTHNPNPFVATLETIGDFNWYAGSGATNHVTVDFNHIANPTDYTGNEQVTVGNGESLCISTIGQADKQNSS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.3e-0426.92Show/hide
Query:  HQNTTKSTVSFGHNVTVNLVFNRNPSLSKSPSPAIHNLMVDETEARVIAGETD------RHVKCAPNI----DTLSFGQNPATFITTHNPNPFVATLETI
        H+NT  +T    +N   N  +N N + S S  P+      D  + +   G            K  P +     T +  Q+ + F T   P   +A     
Subjt:  HQNTTKSTVSFGHNVTVNLVFNRNPSLSKSPSPAIHNLMVDETEARVIAGETD------RHVKCAPNI----DTLSFGQNPATFITTHNPNPFVATLETI

Query:  GDFNWYAGSGATNHVTVDFNHIANPTDYTGNEQVTVGNGESLCISTIGQADKQNSS
           NW   SGAT+H+T DFN+++    YTG + V + +G ++ I+  G A    SS
Subjt:  GDFNWYAGSGATNHVTVDFNHIANPTDYTGNEQVTVGNGESLCISTIGQADKQNSS

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCTTAAAGGTTGGTGTACAACTAATGGGCTTTGAGAATGCCCAAAGCCTATGGGAAGCCATCCATGCTCCACTTAGTGTACAATCAAGAGCTGAGGAAGATTGCTT
ACGACATGTCTTCCAACAAACTTTTTCCCAAGTGTTGCTTAGACTGGATGAAGAATACAATCCAGTGTTGGTTTCAGAACTACTCTCATTTGAGAAGCAATTGGAACACC
AAAACACAACAAAATCGACAGTATCATTTGGCCATAATGTTACTGTTAATCTGGTGTTTAATAGAAATCCATCTTTATCTAAATCACCTAGTCCGGCAATTCACAATTTA
ATGGTGGATGAAACAGAGGCAAGGGTCATAGCAGGGGAAACCGACCGACATGTTAAGTGTGCACCAAATATTGACACACTGTCCTTTGGACAAAATCCTGCTACCTTTAT
AACCACTCATAACCCAAATCCATTTGTAGCCACACTTGAAACCATAGGAGACTTCAATTGGTATGCTGGTAGTGGTGCCACAAACCATGTCACTGTGGATTTCAATCACA
TTGCAAATCCCACAGACTACACAGGTAATGAGCAAGTCACAGTAGGTAATGGTGAAAGTTTGTGTATCTCTACTATTGGACAAGCAGACAAGCAAAATTCTAGTGAGAGG
AACACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGACCTTAAAGGTTGGTGTACAACTAATGGGCTTTGAGAATGCCCAAAGCCTATGGGAAGCCATCCATGCTCCACTTAGTGTACAATCAAGAGCTGAGGAAGATTGCTT
ACGACATGTCTTCCAACAAACTTTTTCCCAAGTGTTGCTTAGACTGGATGAAGAATACAATCCAGTGTTGGTTTCAGAACTACTCTCATTTGAGAAGCAATTGGAACACC
AAAACACAACAAAATCGACAGTATCATTTGGCCATAATGTTACTGTTAATCTGGTGTTTAATAGAAATCCATCTTTATCTAAATCACCTAGTCCGGCAATTCACAATTTA
ATGGTGGATGAAACAGAGGCAAGGGTCATAGCAGGGGAAACCGACCGACATGTTAAGTGTGCACCAAATATTGACACACTGTCCTTTGGACAAAATCCTGCTACCTTTAT
AACCACTCATAACCCAAATCCATTTGTAGCCACACTTGAAACCATAGGAGACTTCAATTGGTATGCTGGTAGTGGTGCCACAAACCATGTCACTGTGGATTTCAATCACA
TTGCAAATCCCACAGACTACACAGGTAATGAGCAAGTCACAGTAGGTAATGGTGAAAGTTTGTGTATCTCTACTATTGGACAAGCAGACAAGCAAAATTCTAGTGAGAGG
AACACTTAA
Protein sequenceShow/hide protein sequence
MTLKVGVQLMGFENAQSLWEAIHAPLSVQSRAEEDCLRHVFQQTFSQVLLRLDEEYNPVLVSELLSFEKQLEHQNTTKSTVSFGHNVTVNLVFNRNPSLSKSPSPAIHNL
MVDETEARVIAGETDRHVKCAPNIDTLSFGQNPATFITTHNPNPFVATLETIGDFNWYAGSGATNHVTVDFNHIANPTDYTGNEQVTVGNGESLCISTIGQADKQNSSER
NT