; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC01G007460 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC01G007460
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCmU531Chr01:8043073..8051642
RNA-Seq ExpressionCmUC01G007460
SyntenyCmUC01G007460
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG8362995.1 hypothetical protein BUALT_BualtUnG0016000 [Buddleja alternifolia]2.9e-0745.45Show/hide
Query:  YDSTRVLTVSENAVRDEWMLDYGCSNHMIHHGRYFQHFEKLEGGKA-LENNQHCVVKDIGYVKLKL-DDLEKFMSGV
        Y+S  VLTVS N   +EW+LD GC+ HM  +  +F  F KLEGG   L NN+ C V+ IG V++K+ + +EK +  V
Subjt:  YDSTRVLTVSENAVRDEWMLDYGCSNHMIHHGRYFQHFEKLEGGKA-LENNQHCVVKDIGYVKLKL-DDLEKFMSGV

KAG8367168.1 hypothetical protein BUALT_Bualt16G0044500 [Buddleja alternifolia]8.3e-0744.16Show/hide
Query:  YDSTRVLTVSENAVRDEWMLDYGCSNHMIHHGRYFQHFEKLEGGKA-LENNQHCVVKDIGYVKLKL-DDLEKFMSGV
        Y+S  VLTVS N   +EW+L+ GC+ HM  +  +F  F KLEGG   L NN+ C V+ IG V++K+ + +EK +  V
Subjt:  YDSTRVLTVSENAVRDEWMLDYGCSNHMIHHGRYFQHFEKLEGGKA-LENNQHCVVKDIGYVKLKL-DDLEKFMSGV

KAG8378472.1 hypothetical protein BUALT_Bualt08G0140700 [Buddleja alternifolia]2.9e-0745.45Show/hide
Query:  YDSTRVLTVSENAVRDEWMLDYGCSNHMIHHGRYFQHFEKLEGGKA-LENNQHCVVKDIGYVKLKL-DDLEKFMSGV
        Y+S  VLTVS N   +EW+LD GC+ HM  +  +F  F KLEGG   L NN+ C V+ IG V++K+ + +EK +  V
Subjt:  YDSTRVLTVSENAVRDEWMLDYGCSNHMIHHGRYFQHFEKLEGGKA-LENNQHCVVKDIGYVKLKL-DDLEKFMSGV

KHN13665.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja]1.6e-0537.35Show/hide
Query:  RYESKGRLQMCMHGEYDSTRVLTVSENAVRDEWMLDYGCSNHMIHHGRYFQHFEKLEGGKAL-ENNQHCVVKDIGYVKLKLDD
        +Y  +  + +   G Y+S  VL++S     +EW+LD GCS HM  +  +F  +++++GGK L  NN  C V  IG +KLK+ D
Subjt:  RYESKGRLQMCMHGEYDSTRVLTVSENAVRDEWMLDYGCSNHMIHHGRYFQHFEKLEGGKAL-ENNQHCVVKDIGYVKLKLDD

RVW35472.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.1e-0532.52Show/hide
Query:  VRGTEKTYGLKCKDFEIKYEKKDFEVLVVRRRYESKGRLQMCMHGEYDSTRVLTVSENAVRDEWMLDYGCSNHMIHHGRYFQHFEKLEGGKA-LENNQHC
        +RG E  +   C D      KK            ++G   + + G YDS  VL V+E     EW+LD  CS HM     +F+ F++++GG   L NN+HC
Subjt:  VRGTEKTYGLKCKDFEIKYEKKDFEVLVVRRRYESKGRLQMCMHGEYDSTRVLTVSENAVRDEWMLDYGCSNHMIHHGRYFQHFEKLEGGKA-LENNQHC

Query:  VVKDIGYVKLK-LDDLEKFMSGV
         +  IG V++K  D +E+ +  V
Subjt:  VVKDIGYVKLK-LDDLEKFMSGV

TrEMBL top hitse value%identityAlignment
A0A438DJ20 Retrovirus-related Pol polyprotein from transposon TNT 1-949.9e-0632.52Show/hide
Query:  VRGTEKTYGLKCKDFEIKYEKKDFEVLVVRRRYESKGRLQMCMHGEYDSTRVLTVSENAVRDEWMLDYGCSNHMIHHGRYFQHFEKLEGGKA-LENNQHC
        +RG E  +   C D      KK            ++G   + + G YDS  VL V+E     EW+LD  CS HM     +F+ F++++GG   L NN+HC
Subjt:  VRGTEKTYGLKCKDFEIKYEKKDFEVLVVRRRYESKGRLQMCMHGEYDSTRVLTVSENAVRDEWMLDYGCSNHMIHHGRYFQHFEKLEGGKA-LENNQHC

Query:  VVKDIGYVKLK-LDDLEKFMSGV
         +  IG V++K  D +E+ +  V
Subjt:  VVKDIGYVKLK-LDDLEKFMSGV

A0A438ITF4 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-0529.75Show/hide
Query:  RGTEKTYGLKC----KDFEIKYEKKDFEVLVVRRRYESKGRLQMCMHGEYDSTRVLTVSENAVRDEWMLDYGCSNHMIHHGRYFQHFEKLEGGKA-LENN
        R   KT   KC    K+   K +  D     V++   ++G   + + G YDS  VL V+E     EW+LD GCS HM     +F+ F++ +GG   L NN
Subjt:  RGTEKTYGLKC----KDFEIKYEKKDFEVLVVRRRYESKGRLQMCMHGEYDSTRVLTVSENAVRDEWMLDYGCSNHMIHHGRYFQHFEKLEGGKA-LENN

Query:  QHCVVKDIGYVKLK--------------LDDLEKFMSGVGILQTTYSDASFKSSPQEI
        +HC +   G V++K              + +L++ +  +G+L    S  +FKS P  +
Subjt:  QHCVVKDIGYVKLK--------------LDDLEKFMSGVGILQTTYSDASFKSSPQEI

A0A5D3CFX6 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-0541.18Show/hide
Query:  YDSTRVLTVSENAVRDEWMLDYGCSNHMIHHGRYFQHFEKLEGGKA-LENNQHCVVKDIGYVKLKLDD
        Y+S  VL VS   ++D W++D GC+ HM HH  +  +F+K +GGK  L +N  C VK  G V++   D
Subjt:  YDSTRVLTVSENAVRDEWMLDYGCSNHMIHHGRYFQHFEKLEGGKA-LENNQHCVVKDIGYVKLKLDD

A0A6J1JRG8 DNA-directed RNA polymerase subunit1.3e-0596.43Show/hide
Query:  MEEAPCCSSILDGEIVGIRFSLANGQEI
        MEEAPCCSSILDGEIVGIRF+LANGQEI
Subjt:  MEEAPCCSSILDGEIVGIRFSLANGQEI

A0A803P381 Uncharacterized protein1.3e-0536.27Show/hide
Query:  KDFEIKYEKKDFEVLVVRRRYESKGRLQMCMHGEYDSTRVLTVSENAVRDEWMLDYGCSNHMIHHGRYFQHFEKLEGGKA-LENNQHCVVKDIGYVKLKL
        K+   + E +  +  + R + + +G +     G Y+S  VL VS     D W+LD GCS HM      F  F+KL+GG   L +N+ C V  IG V+LKL
Subjt:  KDFEIKYEKKDFEVLVVRRRYESKGRLQMCMHGEYDSTRVLTVSENAVRDEWMLDYGCSNHMIHHGRYFQHFEKLEGGKA-LENNQHCVVKDIGYVKLKL

Query:  DD
         D
Subjt:  DD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTTAGAGTTAAACTATGCTCAAATCAGTGAGTTAGTGAGAGGGACAGAAAAGACTTATGGATTAAAATGTAAAGATTTTGAAATCAAATATGAGAAGAAA
GATTTTGAAGTTTTAGTGGTTAGAAGGAGATATGAATCTAAAGGAAGATTACAAATGTGCATGCATGGGGAGTATGATTCAACAAGGGTTTTAACTGTGTCTGAA
AATGCAGTTAGGGATGAGTGGATGCTTGATTATGGATGTTCCAATCATATGATTCATCATGGACGTTACTTTCAACATTTTGAGAAACTTGAAGGTGGGAAGGCT
TTGGAGAATAATCAACATTGTGTTGTAAAAGACATAGGATATGTGAAGTTGAAACTCGATGATTTGGAGAAGTTCATGTCTGGTGTTGGGATCTTACAAACAACT
TATAGTGATGCCTCCTTCAAATCTTCTCCGCAAGAGATAGACAAAGACAGACAGAGAAGAGTTGTTAGACCACTAACAAAATGTGATGAAGGTTTTGAACTCCTT
CGAGTTCTTCCATTCAGGAAAAAAAAAAAAAAAAAGAAAAGAAAAGGAAAAAGAAAGAAAGAAAGAAAGAAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAA
GAAGAAGACCGCTGTAACCATTCAGATCTAACATTGTATTTTTTTTTGTTTTGTATTACTTTGCAGCATTATATAAAAGTTACTGTTAAAGGAATGGAGGAGGCA
CCCTGTTGCTCATCTATTTTAGATGGGGAAATAGTGGGAATAAGATTTTCATTGGCCAATGGTCAAGAAATTGTAAGTAGAAGCTGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATTTAGAGTTAAACTATGCTCAAATCAGTGAGTTAGTGAGAGGGACAGAAAAGACTTATGGATTAAAATGTAAAGATTTTGAAATCAAATATGAGAAGAAA
GATTTTGAAGTTTTAGTGGTTAGAAGGAGATATGAATCTAAAGGAAGATTACAAATGTGCATGCATGGGGAGTATGATTCAACAAGGGTTTTAACTGTGTCTGAA
AATGCAGTTAGGGATGAGTGGATGCTTGATTATGGATGTTCCAATCATATGATTCATCATGGACGTTACTTTCAACATTTTGAGAAACTTGAAGGTGGGAAGGCT
TTGGAGAATAATCAACATTGTGTTGTAAAAGACATAGGATATGTGAAGTTGAAACTCGATGATTTGGAGAAGTTCATGTCTGGTGTTGGGATCTTACAAACAACT
TATAGTGATGCCTCCTTCAAATCTTCTCCGCAAGAGATAGACAAAGACAGACAGAGAAGAGTTGTTAGACCACTAACAAAATGTGATGAAGGTTTTGAACTCCTT
CGAGTTCTTCCATTCAGGAAAAAAAAAAAAAAAAAGAAAAGAAAAGGAAAAAGAAAGAAAGAAAGAAAGAAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAA
GAAGAAGACCGCTGTAACCATTCAGATCTAACATTGTATTTTTTTTTGTTTTGTATTACTTTGCAGCATTATATAAAAGTTACTGTTAAAGGAATGGAGGAGGCA
CCCTGTTGCTCATCTATTTTAGATGGGGAAATAGTGGGAATAAGATTTTCATTGGCCAATGGTCAAGAAATTGTAAGTAGAAGCTGTTAA
Protein sequenceShow/hide protein sequence
MNLELNYAQISELVRGTEKTYGLKCKDFEIKYEKKDFEVLVVRRRYESKGRLQMCMHGEYDSTRVLTVSENAVRDEWMLDYGCSNHMIHHGRYFQHFEKLEGGKA
LENNQHCVVKDIGYVKLKLDDLEKFMSGVGILQTTYSDASFKSSPQEIDKDRQRRVVRPLTKCDEGFELLRVLPFRKKKKKKKRKGKRKKERKKEEEEEEEEEEE
EEDRCNHSDLTLYFFLFCITLQHYIKVTVKGMEEAPCCSSILDGEIVGIRFSLANGQEIVSRSC