; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG08G004460 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG08G004460
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCG_Chr08:13969911..13973962
RNA-Seq ExpressionClCG08G004460
SyntenyClCG08G004460
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK05754.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]4.9e-1131.03Show/hide
Query:  VAAQVMGFDEAKLLWDAIQEYFGIQSRSS--------------------------------------------ISHIFAGLDEEYNPIVCVIQVKGNMS-
        +A Q+MGF  AK LW+A Q+ FG+QSR+                                             IS    GLDE YNP++ VIQ K  +S 
Subjt:  VAAQVMGFDEAKLLWDAIQEYFGIQSRSS--------------------------------------------ISHIFAGLDEEYNPIVCVIQVKGNMS-

Query:  -------------------------LNQLFVNYAATDKKGNFNPLSNRWGYNNNNNSNKGGGGNRNRGRNRYSPYPQSNKPTCQICRKYGHATDVCYQSY
                                 + Q  VN A      +F   SN   + NN N+++G  G  N GR R     + NKPTCQ+C KYGH+  VCY  +
Subjt:  -------------------------LNQLFVNYAATDKKGNFNPLSNRWGYNNNNNSNKGGGGNRNRGRNRYSPYPQSNKPTCQICRKYGHATDVCYQSY

Query:  NKE
        NKE
Subjt:  NKE

XP_016902197.1 PREDICTED: uncharacterized protein LOC107991581 isoform X1 [Cucumis melo]3.1e-1336.14Show/hide
Query:  GWLYNSMTLEVAAQVMGFDEAKLLWDAIQEYFGIQSRSS-------ISHIFAGLDEEYNPIVCVIQVKGNMS---------LNQLFVNYAATDKKGNFN-
        GWLYNSMT +VA Q+MGF   + LWDA Q++FG+QSR+        +     GLDE YN ++ VIQ K ++S         + +  + +  T KK   N 
Subjt:  GWLYNSMTLEVAAQVMGFDEAKLLWDAIQEYFGIQSRSS-------ISHIFAGLDEEYNPIVCVIQVKGNMS---------LNQLFVNYAATDKKGNFN-

Query:  ------PLSNRWGYNNNNNSNKGGGGNRNRGRNRYSPYPQSNKPTCQICRKYGHATDVCYQSYNKE
               ++ R+  N   N +       NR          +N PTCQ+C KYGH+  VCY  +NKE
Subjt:  ------PLSNRWGYNNNNNSNKGGGGNRNRGRNRYSPYPQSNKPTCQICRKYGHATDVCYQSYNKE

XP_016902203.1 PREDICTED: uncharacterized protein LOC107991581 isoform X3 [Cucumis melo]3.1e-1336.14Show/hide
Query:  GWLYNSMTLEVAAQVMGFDEAKLLWDAIQEYFGIQSRSS-------ISHIFAGLDEEYNPIVCVIQVKGNMS---------LNQLFVNYAATDKKGNFN-
        GWLYNSMT +VA Q+MGF   + LWDA Q++FG+QSR+        +     GLDE YN ++ VIQ K ++S         + +  + +  T KK   N 
Subjt:  GWLYNSMTLEVAAQVMGFDEAKLLWDAIQEYFGIQSRSS-------ISHIFAGLDEEYNPIVCVIQVKGNMS---------LNQLFVNYAATDKKGNFN-

Query:  ------PLSNRWGYNNNNNSNKGGGGNRNRGRNRYSPYPQSNKPTCQICRKYGHATDVCYQSYNKE
               ++ R+  N   N +       NR          +N PTCQ+C KYGH+  VCY  +NKE
Subjt:  ------PLSNRWGYNNNNNSNKGGGGNRNRGRNRYSPYPQSNKPTCQICRKYGHATDVCYQSYNKE

XP_038875135.1 uncharacterized protein LOC120067668 [Benincasa hispida]8.6e-1639.86Show/hide
Query:  GWLYNSMTLEVAAQVMGFDEAKLLWDAIQEYFGIQSRSSISHIFAGLDEEYNPIVCVIQVKGNMSLNQLFVNYAATDKKGNFNPLSNRWGYNNNNNSNKG
        GWLYNSM  EVA Q+MGF+ AK LWDAI   FG+QSR+       G  +  + I                  + A+     F  ++++  Y+NN      
Subjt:  GWLYNSMTLEVAAQVMGFDEAKLLWDAIQEYFGIQSRSSISHIFAGLDEEYNPIVCVIQVKGNMSLNQLFVNYAATDKKGNFNPLSNRWGYNNNNNSNKG

Query:  GGGNRNRGRNRYSPYPQSNKPTCQICRKYGHATDVCYQSYNKE
        G G R +GR R       N PTCQ+C KYGH+TDVCYQ +N+E
Subjt:  GGGNRNRGRNRYSPYPQSNKPTCQICRKYGHATDVCYQSYNKE

XP_038902487.1 uncharacterized protein LOC120089143 [Benincasa hispida]2.6e-1231.82Show/hide
Query:  GVLSLAPPP--TKVVRTFPVLSLLQLSLPKSRVLKIQSSLRVARFGWLYNSMTLEVAAQVMGFDEAKLLWDAIQEYFGIQSRSS----------------
        G+ SL P    T    + PVL +      +SR +  Q  L     GWLYN MT EVA QVMG++  K LW AIQE FG+QSR+                 
Subjt:  GVLSLAPPP--TKVVRTFPVLSLLQLSLPKSRVLKIQSSLRVARFGWLYNSMTLEVAAQVMGFDEAKLLWDAIQEYFGIQSRSS----------------

Query:  ----------------------------ISHIFAGLDEEYNPIVCVIQVKGNMSLNQLFVNYAATDKKGNFNPLSNRWGYNNNNNSNKGGGGNRNRGRNR
                                    +S +  GLDEE+NP V  IQ +  +S   +     A +K+               NN+N+ GG  RNRGR R
Subjt:  ----------------------------ISHIFAGLDEEYNPIVCVIQVKGNMSLNQLFVNYAATDKKGNFNPLSNRWGYNNNNNSNKGGGGNRNRGRNR

Query:  YSPYPQSNKPTCQICRKYGH
        ++     N+PTCQ+C +Y +
Subjt:  YSPYPQSNKPTCQICRKYGH

TrEMBL top hitse value%identityAlignment
A0A1S4E1U6 uncharacterized protein LOC107991581 isoform X11.5e-1336.14Show/hide
Query:  GWLYNSMTLEVAAQVMGFDEAKLLWDAIQEYFGIQSRSS-------ISHIFAGLDEEYNPIVCVIQVKGNMS---------LNQLFVNYAATDKKGNFN-
        GWLYNSMT +VA Q+MGF   + LWDA Q++FG+QSR+        +     GLDE YN ++ VIQ K ++S         + +  + +  T KK   N 
Subjt:  GWLYNSMTLEVAAQVMGFDEAKLLWDAIQEYFGIQSRSS-------ISHIFAGLDEEYNPIVCVIQVKGNMS---------LNQLFVNYAATDKKGNFN-

Query:  ------PLSNRWGYNNNNNSNKGGGGNRNRGRNRYSPYPQSNKPTCQICRKYGHATDVCYQSYNKE
               ++ R+  N   N +       NR          +N PTCQ+C KYGH+  VCY  +NKE
Subjt:  ------PLSNRWGYNNNNNSNKGGGGNRNRGRNRYSPYPQSNKPTCQICRKYGHATDVCYQSYNKE

A0A1S4E1V2 uncharacterized protein LOC107991581 isoform X31.5e-1336.14Show/hide
Query:  GWLYNSMTLEVAAQVMGFDEAKLLWDAIQEYFGIQSRSS-------ISHIFAGLDEEYNPIVCVIQVKGNMS---------LNQLFVNYAATDKKGNFN-
        GWLYNSMT +VA Q+MGF   + LWDA Q++FG+QSR+        +     GLDE YN ++ VIQ K ++S         + +  + +  T KK   N 
Subjt:  GWLYNSMTLEVAAQVMGFDEAKLLWDAIQEYFGIQSRSS-------ISHIFAGLDEEYNPIVCVIQVKGNMS---------LNQLFVNYAATDKKGNFN-

Query:  ------PLSNRWGYNNNNNSNKGGGGNRNRGRNRYSPYPQSNKPTCQICRKYGHATDVCYQSYNKE
               ++ R+  N   N +       NR          +N PTCQ+C KYGH+  VCY  +NKE
Subjt:  ------PLSNRWGYNNNNNSNKGGGGNRNRGRNRYSPYPQSNKPTCQICRKYGHATDVCYQSYNKE

A0A5A7SIT7 Uncharacterized protein5.3e-1131.4Show/hide
Query:  GWLYNSMTLEVAAQVMGFDEAKLLWDAIQEYFGIQSRSS--------------------------------------------ISHIFAGLDEEYNPIVC
        GWLYNSMT +VA Q+MGF   + LWDA Q++FG+QSR+                                             IS +  GLDE YN ++ 
Subjt:  GWLYNSMTLEVAAQVMGFDEAKLLWDAIQEYFGIQSRSS--------------------------------------------ISHIFAGLDEEYNPIVC

Query:  VIQVKGNMS---------LNQLFVNYAATD----KKGNFNP-----LSNRWGYN--NNNNSNKGGGGNRNRGRNRYSPYPQSNKPTCQICRKYGHATDVC
        VIQ K ++S         + +  + +  T     KKGN        ++ R+  N   N+++ K  G NR     +      +N PTCQ+C KYGH+  VC
Subjt:  VIQVKGNMS---------LNQLFVNYAATD----KKGNFNP-----LSNRWGYN--NNNNSNKGGGGNRNRGRNRYSPYPQSNKPTCQICRKYGHATDVC

Query:  YQSYNKE
        Y  +NKE
Subjt:  YQSYNKE

A0A5D3C373 Retrovirus-related Pol polyprotein from transposon TNT 1-942.4e-1131.03Show/hide
Query:  VAAQVMGFDEAKLLWDAIQEYFGIQSRSS--------------------------------------------ISHIFAGLDEEYNPIVCVIQVKGNMS-
        +A Q+MGF  AK LW+A Q+ FG+QSR+                                             IS    GLDE YNP++ VIQ K  +S 
Subjt:  VAAQVMGFDEAKLLWDAIQEYFGIQSRSS--------------------------------------------ISHIFAGLDEEYNPIVCVIQVKGNMS-

Query:  -------------------------LNQLFVNYAATDKKGNFNPLSNRWGYNNNNNSNKGGGGNRNRGRNRYSPYPQSNKPTCQICRKYGHATDVCYQSY
                                 + Q  VN A      +F   SN   + NN N+++G  G  N GR R     + NKPTCQ+C KYGH+  VCY  +
Subjt:  -------------------------LNQLFVNYAATDKKGNFNPLSNRWGYNNNNNSNKGGGGNRNRGRNRYSPYPQSNKPTCQICRKYGHATDVCYQSY

Query:  NKE
        NKE
Subjt:  NKE

A0A803PM38 Uncharacterized protein9.9e-1028.57Show/hide
Query:  GWLYNSMTLEVAAQVMGFDEAKLLWDAIQEYFGIQSRSS--------------------------------------------ISHIFAGLDEEYNPIVC
        GWLY SMT  +A +VMG D +  LW A++E FG  S++                                             +S++ +GLD EY P+V 
Subjt:  GWLYNSMTLEVAAQVMGFDEAKLLWDAIQEYFGIQSRSS--------------------------------------------ISHIFAGLDEEYNPIVC

Query:  VIQVKGNMSLNQLF--------------------------VNYAATDKKGNFNPLSNRWGYNNNN---NSNKGGGGNRNRGRNRYSPYPQSNKPTCQICR
        +I+ +G+ +  QL                           +N +A+      +P +NR  +NNNN   +SN  G  NR+RGR   +  P   +PTCQ+C 
Subjt:  VIQVKGNMSLNQLF--------------------------VNYAATDKKGNFNPLSNRWGYNNNN---NSNKGGGGNRNRGRNRYSPYPQSNKPTCQICR

Query:  KYGHATDVCY
        KYGH+   CY
Subjt:  KYGHATDVCY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTCCAAAATATATTGTGTTTTATAAAAACTTTGAAAAGGCACCCGAGACTCAGGGACTTGGAGAAGGCCTAAGGGGGCTCGTAGAAGGCAAATGGGTAACC
TGGATATGGAAATGGGACCCTAATGCTCAGGGATTTGGAGAAGGCATTGTGGGCGGCTTGGATTATAATACCTATGTGCACATAGAGGAATATACGACGCTGATG
TGTAGCATGACAGTTCCCGTTGAGGAGCGAAGTCGAGGTTTGTCAAGAGTTAATGGTATTACTGTCATAGGAAGGGTAGATATGAAAAGTGAGGTAAATGTGATA
GTGGGATTTGAGACAGGGAAGTCGATGACTACTGTCCTCACACTCCGTCTCGGGCTATGGGAGGATAGTTCGGGGTGGGGTGTGACATCTAGTACTACCAAATCT
GGATGTTGGGGTGCGGCAATGTGGTTGTGGCTATGCGTTGCAAGGAGGAGTAGTCGTGAACAGACATGCGTTGAAGTCGCACCGCGTCGAAGACAGAGTGGTCTT
GACGCGACGAAAATAGTGGTGTATGACGGGGATTTGTTGAAGGCATTGGAGGAGAAGTTTAACATTAGCGCCTCGCATCGAGGTGAGAATCCTGCGCTAGCCTGC
GGTCGGTTCCATATTTCAGACAAACTTCAGGAGAAGATTGAGGAAGGTGTTCTTGTGGATTTTGGGCGAAATTCTACTCTCGCACAGACTGAACAGTGGTTGATA
TTTGGAGTAGAACGTATAGAGGAAGCCGAGTTTAGAGCTGAAATTTGGAGGTCAATTGAAGTAAGAACAAGAGGAAAGGAAGAATCACAGTGCAGGGTGCCGCTA
AAACCAAGTATCGGTAAGGTAACGTTGGTGTCGCTCAATCTAGTATCGCTAGTAACAGACTCGAGGAGGCTCAGTGCTCAAGGACATGAGAAGGCACGTGAGTTT
CCCCGGGACCCAGTATTGAAGGACATGGAGAAGGGTGATTTGGCTGGTGCGTCGAGTTTGCTCCGCATTGAGCCAAGGATGATTTCGATGTTGGGTGTGTTGAGC
CTTGCTCCACCCCCAACTAAGGTAGTGAGGACATTCCCGGTGCTTAGCCTACTTCAGCTCTCCCTGCCTAAGTCTCGAGTTCTCAAAATTCAGTCAAGTCTGAGA
GTTGCACGTTTCGGTTGGCTGTATAATTCCATGACGCTCGAAGTAGCAGCTCAAGTCATGGGCTTTGATGAAGCAAAGCTTTTATGGGATGCCATTCAAGAATAT
TTTGGCATTCAATCAAGGAGTTCCATCTCACATATTTTCGCGGGACTTGATGAGGAGTATAATCCTATTGTCTGTGTTATTCAGGTTAAAGGAAACATGTCATTG
AACCAGTTGTTTGTCAACTATGCTGCAACAGATAAGAAGGGAAATTTTAATCCATTATCCAACCGATGGGGATACAACAACAACAACAATTCAAATAAAGGTGGA
GGCGGAAACAGAAATAGAGGAAGAAATCGTTATTCTCCTTACCCTCAATCAAATAAACCAACATGCCAAATATGCAGAAAGTATGGACATGCTACTGATGTCTGT
TATCAGAGTTATAACAAAGAAGATGATAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTCCAAAATATATTGTGTTTTATAAAAACTTTGAAAAGGCACCCGAGACTCAGGGACTTGGAGAAGGCCTAAGGGGGCTCGTAGAAGGCAAATGGGTAACC
TGGATATGGAAATGGGACCCTAATGCTCAGGGATTTGGAGAAGGCATTGTGGGCGGCTTGGATTATAATACCTATGTGCACATAGAGGAATATACGACGCTGATG
TGTAGCATGACAGTTCCCGTTGAGGAGCGAAGTCGAGGTTTGTCAAGAGTTAATGGTATTACTGTCATAGGAAGGGTAGATATGAAAAGTGAGGTAAATGTGATA
GTGGGATTTGAGACAGGGAAGTCGATGACTACTGTCCTCACACTCCGTCTCGGGCTATGGGAGGATAGTTCGGGGTGGGGTGTGACATCTAGTACTACCAAATCT
GGATGTTGGGGTGCGGCAATGTGGTTGTGGCTATGCGTTGCAAGGAGGAGTAGTCGTGAACAGACATGCGTTGAAGTCGCACCGCGTCGAAGACAGAGTGGTCTT
GACGCGACGAAAATAGTGGTGTATGACGGGGATTTGTTGAAGGCATTGGAGGAGAAGTTTAACATTAGCGCCTCGCATCGAGGTGAGAATCCTGCGCTAGCCTGC
GGTCGGTTCCATATTTCAGACAAACTTCAGGAGAAGATTGAGGAAGGTGTTCTTGTGGATTTTGGGCGAAATTCTACTCTCGCACAGACTGAACAGTGGTTGATA
TTTGGAGTAGAACGTATAGAGGAAGCCGAGTTTAGAGCTGAAATTTGGAGGTCAATTGAAGTAAGAACAAGAGGAAAGGAAGAATCACAGTGCAGGGTGCCGCTA
AAACCAAGTATCGGTAAGGTAACGTTGGTGTCGCTCAATCTAGTATCGCTAGTAACAGACTCGAGGAGGCTCAGTGCTCAAGGACATGAGAAGGCACGTGAGTTT
CCCCGGGACCCAGTATTGAAGGACATGGAGAAGGGTGATTTGGCTGGTGCGTCGAGTTTGCTCCGCATTGAGCCAAGGATGATTTCGATGTTGGGTGTGTTGAGC
CTTGCTCCACCCCCAACTAAGGTAGTGAGGACATTCCCGGTGCTTAGCCTACTTCAGCTCTCCCTGCCTAAGTCTCGAGTTCTCAAAATTCAGTCAAGTCTGAGA
GTTGCACGTTTCGGTTGGCTGTATAATTCCATGACGCTCGAAGTAGCAGCTCAAGTCATGGGCTTTGATGAAGCAAAGCTTTTATGGGATGCCATTCAAGAATAT
TTTGGCATTCAATCAAGGAGTTCCATCTCACATATTTTCGCGGGACTTGATGAGGAGTATAATCCTATTGTCTGTGTTATTCAGGTTAAAGGAAACATGTCATTG
AACCAGTTGTTTGTCAACTATGCTGCAACAGATAAGAAGGGAAATTTTAATCCATTATCCAACCGATGGGGATACAACAACAACAACAATTCAAATAAAGGTGGA
GGCGGAAACAGAAATAGAGGAAGAAATCGTTATTCTCCTTACCCTCAATCAAATAAACCAACATGCCAAATATGCAGAAAGTATGGACATGCTACTGATGTCTGT
TATCAGAGTTATAACAAAGAAGATGATAACTGA
Protein sequenceShow/hide protein sequence
MFPKYIVFYKNFEKAPETQGLGEGLRGLVEGKWVTWIWKWDPNAQGFGEGIVGGLDYNTYVHIEEYTTLMCSMTVPVEERSRGLSRVNGITVIGRVDMKSEVNVI
VGFETGKSMTTVLTLRLGLWEDSSGWGVTSSTTKSGCWGAAMWLWLCVARRSSREQTCVEVAPRRRQSGLDATKIVVYDGDLLKALEEKFNISASHRGENPALAC
GRFHISDKLQEKIEEGVLVDFGRNSTLAQTEQWLIFGVERIEEAEFRAEIWRSIEVRTRGKEESQCRVPLKPSIGKVTLVSLNLVSLVTDSRRLSAQGHEKAREF
PRDPVLKDMEKGDLAGASSLLRIEPRMISMLGVLSLAPPPTKVVRTFPVLSLLQLSLPKSRVLKIQSSLRVARFGWLYNSMTLEVAAQVMGFDEAKLLWDAIQEY
FGIQSRSSISHIFAGLDEEYNPIVCVIQVKGNMSLNQLFVNYAATDKKGNFNPLSNRWGYNNNNNSNKGGGGNRNRGRNRYSPYPQSNKPTCQICRKYGHATDVC
YQSYNKEDDN