; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0229121 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0229121
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr08:23859496..23859969
RNA-Seq ExpressionCmc08g0229121
SyntenyCmc08g0229121
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040693.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]6.3e-6884.08Show/hide
Query:  MSFGLTNALVVSMDLMNKVFKDFLDTFVIVFIDDILVYSKTEVGHEEHLHQVLETLRANKLYAKFSKCEFWLKKVFFLSHVVSSEGVSVDRTKIEAVTSW
        MSFGLTNA  V MDLMN+VFKDFLD+FVIVFIDDI+VYSKTEV HEEHLHQVLETLRANKLYAKFSKCEFWL+KV FL HVVSSEGVSVD TKIEAVT+W
Subjt:  MSFGLTNALVVSMDLMNKVFKDFLDTFVIVFIDDILVYSKTEVGHEEHLHQVLETLRANKLYAKFSKCEFWLKKVFFLSHVVSSEGVSVDRTKIEAVTSW

Query:  PRPFTVSKVRSFLDLIGYYKRFMEDFSRIASPLNQLTRTGTHFVWSPTCESSFQDLK
        PRP TVS++RSFL L GYY+RF+EDFSRIASPL QLTR GT FVWSP CE SFQ+LK
Subjt:  PRPFTVSKVRSFLDLIGYYKRFMEDFSRIASPLNQLTRTGTHFVWSPTCESSFQDLK

KAA0051719.1 pol protein [Cucumis melo var. makuwa]8.3e-6884.08Show/hide
Query:  MSFGLTNALVVSMDLMNKVFKDFLDTFVIVFIDDILVYSKTEVGHEEHLHQVLETLRANKLYAKFSKCEFWLKKVFFLSHVVSSEGVSVDRTKIEAVTSW
        MSFGLTNA  V MDLMN+VFKDFLD+FVIVFIDDIL+YSKTE  HEEHLHQVLETLRANKLYAKFSKCEFWL+KV FL HVVSSEGVSVD  KIEAVT+W
Subjt:  MSFGLTNALVVSMDLMNKVFKDFLDTFVIVFIDDILVYSKTEVGHEEHLHQVLETLRANKLYAKFSKCEFWLKKVFFLSHVVSSEGVSVDRTKIEAVTSW

Query:  PRPFTVSKVRSFLDLIGYYKRFMEDFSRIASPLNQLTRTGTHFVWSPTCESSFQDLK
        PRP TVS++RSFL L GYY+RF+EDFSRIASPL QLTR GT FVWSPTCESSFQ LK
Subjt:  PRPFTVSKVRSFLDLIGYYKRFMEDFSRIASPLNQLTRTGTHFVWSPTCESSFQDLK

TYK01306.1 pol protein [Cucumis melo var. makuwa]8.3e-6884.08Show/hide
Query:  MSFGLTNALVVSMDLMNKVFKDFLDTFVIVFIDDILVYSKTEVGHEEHLHQVLETLRANKLYAKFSKCEFWLKKVFFLSHVVSSEGVSVDRTKIEAVTSW
        MSFGLTNA  V MDLMN+VFKDFLD+FVIVFIDDIL+YSKTE  HEEHLHQVLETLRANKLYAKFSKCEFWL+KV FL HVVSSEGVSVD  KIEAVT+W
Subjt:  MSFGLTNALVVSMDLMNKVFKDFLDTFVIVFIDDILVYSKTEVGHEEHLHQVLETLRANKLYAKFSKCEFWLKKVFFLSHVVSSEGVSVDRTKIEAVTSW

Query:  PRPFTVSKVRSFLDLIGYYKRFMEDFSRIASPLNQLTRTGTHFVWSPTCESSFQDLK
        PRP TVS++RSFL L GYY+RF+EDFSRIASPL QLTR GT FVWSPTCESSFQ LK
Subjt:  PRPFTVSKVRSFLDLIGYYKRFMEDFSRIASPLNQLTRTGTHFVWSPTCESSFQDLK

TYK05193.1 pol protein [Cucumis melo var. makuwa]3.7e-6884.08Show/hide
Query:  MSFGLTNALVVSMDLMNKVFKDFLDTFVIVFIDDILVYSKTEVGHEEHLHQVLETLRANKLYAKFSKCEFWLKKVFFLSHVVSSEGVSVDRTKIEAVTSW
        MSFGLTNA  V MDLMN+VFKDFLD+FVIVFIDDIL+YSKTE  HEEHLHQVLETLRANKLYAKFSKCEFWL+KV FL HVVSSEGVSVD TKIEAVT+W
Subjt:  MSFGLTNALVVSMDLMNKVFKDFLDTFVIVFIDDILVYSKTEVGHEEHLHQVLETLRANKLYAKFSKCEFWLKKVFFLSHVVSSEGVSVDRTKIEAVTSW

Query:  PRPFTVSKVRSFLDLIGYYKRFMEDFSRIASPLNQLTRTGTHFVWSPTCESSFQDLK
        PRP TVS++RSFL L GYY+RF+EDFSRIASPL QLTR GT FVWSP CESSFQ+LK
Subjt:  PRPFTVSKVRSFLDLIGYYKRFMEDFSRIASPLNQLTRTGTHFVWSPTCESSFQDLK

TYK18480.1 pol protein [Cucumis melo var. makuwa]3.7e-6884.08Show/hide
Query:  MSFGLTNALVVSMDLMNKVFKDFLDTFVIVFIDDILVYSKTEVGHEEHLHQVLETLRANKLYAKFSKCEFWLKKVFFLSHVVSSEGVSVDRTKIEAVTSW
        MSFGLTNA  V MDLMN+VFKDFLD+FVIVFIDDIL+YSKTE  HEEHLHQVLETLRANKLYAKFSKCEFWL+KV FL HVVSSEGVSVD TKIEAVT+W
Subjt:  MSFGLTNALVVSMDLMNKVFKDFLDTFVIVFIDDILVYSKTEVGHEEHLHQVLETLRANKLYAKFSKCEFWLKKVFFLSHVVSSEGVSVDRTKIEAVTSW

Query:  PRPFTVSKVRSFLDLIGYYKRFMEDFSRIASPLNQLTRTGTHFVWSPTCESSFQDLK
        PRP TVS++RSFL L GYY+RF+EDFSRIASPL QLTR GT FVWSP CESSFQ+LK
Subjt:  PRPFTVSKVRSFLDLIGYYKRFMEDFSRIASPLNQLTRTGTHFVWSPTCESSFQDLK

TrEMBL top hitse value%identityAlignment
A0A5A7TC72 Ty3-gypsy retrotransposon protein3.1e-6884.08Show/hide
Query:  MSFGLTNALVVSMDLMNKVFKDFLDTFVIVFIDDILVYSKTEVGHEEHLHQVLETLRANKLYAKFSKCEFWLKKVFFLSHVVSSEGVSVDRTKIEAVTSW
        MSFGLTNA  V MDLMN+VFKDFLD+FVIVFIDDI+VYSKTEV HEEHLHQVLETLRANKLYAKFSKCEFWL+KV FL HVVSSEGVSVD TKIEAVT+W
Subjt:  MSFGLTNALVVSMDLMNKVFKDFLDTFVIVFIDDILVYSKTEVGHEEHLHQVLETLRANKLYAKFSKCEFWLKKVFFLSHVVSSEGVSVDRTKIEAVTSW

Query:  PRPFTVSKVRSFLDLIGYYKRFMEDFSRIASPLNQLTRTGTHFVWSPTCESSFQDLK
        PRP TVS++RSFL L GYY+RF+EDFSRIASPL QLTR GT FVWSP CE SFQ+LK
Subjt:  PRPFTVSKVRSFLDLIGYYKRFMEDFSRIASPLNQLTRTGTHFVWSPTCESSFQDLK

A0A5A7UE01 Reverse transcriptase4.0e-6884.08Show/hide
Query:  MSFGLTNALVVSMDLMNKVFKDFLDTFVIVFIDDILVYSKTEVGHEEHLHQVLETLRANKLYAKFSKCEFWLKKVFFLSHVVSSEGVSVDRTKIEAVTSW
        MSFGLTNA  V MDLMN+VFKDFLD+FVIVFIDDIL+YSKTE  HEEHLHQVLETLRANKLYAKFSKCEFWL+KV FL HVVSSEGVSVD  KIEAVT+W
Subjt:  MSFGLTNALVVSMDLMNKVFKDFLDTFVIVFIDDILVYSKTEVGHEEHLHQVLETLRANKLYAKFSKCEFWLKKVFFLSHVVSSEGVSVDRTKIEAVTSW

Query:  PRPFTVSKVRSFLDLIGYYKRFMEDFSRIASPLNQLTRTGTHFVWSPTCESSFQDLK
        PRP TVS++RSFL L GYY+RF+EDFSRIASPL QLTR GT FVWSPTCESSFQ LK
Subjt:  PRPFTVSKVRSFLDLIGYYKRFMEDFSRIASPLNQLTRTGTHFVWSPTCESSFQDLK

A0A5D3BSV9 Reverse transcriptase4.0e-6884.08Show/hide
Query:  MSFGLTNALVVSMDLMNKVFKDFLDTFVIVFIDDILVYSKTEVGHEEHLHQVLETLRANKLYAKFSKCEFWLKKVFFLSHVVSSEGVSVDRTKIEAVTSW
        MSFGLTNA  V MDLMN+VFKDFLD+FVIVFIDDIL+YSKTE  HEEHLHQVLETLRANKLYAKFSKCEFWL+KV FL HVVSSEGVSVD  KIEAVT+W
Subjt:  MSFGLTNALVVSMDLMNKVFKDFLDTFVIVFIDDILVYSKTEVGHEEHLHQVLETLRANKLYAKFSKCEFWLKKVFFLSHVVSSEGVSVDRTKIEAVTSW

Query:  PRPFTVSKVRSFLDLIGYYKRFMEDFSRIASPLNQLTRTGTHFVWSPTCESSFQDLK
        PRP TVS++RSFL L GYY+RF+EDFSRIASPL QLTR GT FVWSPTCESSFQ LK
Subjt:  PRPFTVSKVRSFLDLIGYYKRFMEDFSRIASPLNQLTRTGTHFVWSPTCESSFQDLK

A0A5D3BZN1 Reverse transcriptase1.8e-6884.08Show/hide
Query:  MSFGLTNALVVSMDLMNKVFKDFLDTFVIVFIDDILVYSKTEVGHEEHLHQVLETLRANKLYAKFSKCEFWLKKVFFLSHVVSSEGVSVDRTKIEAVTSW
        MSFGLTNA  V MDLMN+VFKDFLD+FVIVFIDDIL+YSKTE  HEEHLHQVLETLRANKLYAKFSKCEFWL+KV FL HVVSSEGVSVD TKIEAVT+W
Subjt:  MSFGLTNALVVSMDLMNKVFKDFLDTFVIVFIDDILVYSKTEVGHEEHLHQVLETLRANKLYAKFSKCEFWLKKVFFLSHVVSSEGVSVDRTKIEAVTSW

Query:  PRPFTVSKVRSFLDLIGYYKRFMEDFSRIASPLNQLTRTGTHFVWSPTCESSFQDLK
        PRP TVS++RSFL L GYY+RF+EDFSRIASPL QLTR GT FVWSP CESSFQ+LK
Subjt:  PRPFTVSKVRSFLDLIGYYKRFMEDFSRIASPLNQLTRTGTHFVWSPTCESSFQDLK

A0A5D3D4M7 Pol protein1.8e-6884.08Show/hide
Query:  MSFGLTNALVVSMDLMNKVFKDFLDTFVIVFIDDILVYSKTEVGHEEHLHQVLETLRANKLYAKFSKCEFWLKKVFFLSHVVSSEGVSVDRTKIEAVTSW
        MSFGLTNA  V MDLMN+VFKDFLD+FVIVFIDDIL+YSKTE  HEEHLHQVLETLRANKLYAKFSKCEFWL+KV FL HVVSSEGVSVD TKIEAVT+W
Subjt:  MSFGLTNALVVSMDLMNKVFKDFLDTFVIVFIDDILVYSKTEVGHEEHLHQVLETLRANKLYAKFSKCEFWLKKVFFLSHVVSSEGVSVDRTKIEAVTSW

Query:  PRPFTVSKVRSFLDLIGYYKRFMEDFSRIASPLNQLTRTGTHFVWSPTCESSFQDLK
        PRP TVS++RSFL L GYY+RF+EDFSRIASPL QLTR GT FVWSP CESSFQ+LK
Subjt:  PRPFTVSKVRSFLDLIGYYKRFMEDFSRIASPLNQLTRTGTHFVWSPTCESSFQDLK

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.1e-1932.91Show/hide
Query:  MSFGLTNALVVSMDLMNKVFKDFLDTFVIVFIDDILVYSKTEVGHEEHLHQVLETLRANKLYAKFSKCEFWLKKVFFLSHVVSSEGVSVDRTKIEAVTSW
        M FGL NA       MN + +  L+   +V++DDI+V+S +   H + L  V E L    L  +  KCEF  ++  FL HV++ +G+  +  KIEA+  +
Subjt:  MSFGLTNALVVSMDLMNKVFKDFLDTFVIVFIDDILVYSKTEVGHEEHLHQVLETLRANKLYAKFSKCEFWLKKVFFLSHVVSSEGVSVDRTKIEAVTSW

Query:  PRPFTVSKVRSFLDLIGYYKRFMEDFSRIASPLNQLTRTGTHF-VWSPTCESSFQDLK
        P P    ++++FL L GYY++F+ +F+ IA P+ +  +        +P  +S+F+ LK
Subjt:  PRPFTVSKVRSFLDLIGYYKRFMEDFSRIASPLNQLTRTGTHF-VWSPTCESSFQDLK

P0CT34 Transposon Tf2-1 polyprotein4.8e-1828.03Show/hide
Query:  MSFGLTNALVVSMDLMNKVFKDFLDTFVIVFIDDILVYSKTEVGHEEHLHQVLETLRANKLYAKFSKCEFWLKKVFFLSHVVSSEGVSVDRTKIEAVTSW
        M +G++ A       +N +  +  ++ V+ ++DDIL++SK+E  H +H+  VL+ L+   L    +KCEF   +V F+ + +S +G +  +  I+ V  W
Subjt:  MSFGLTNALVVSMDLMNKVFKDFLDTFVIVFIDDILVYSKTEVGHEEHLHQVLETLRANKLYAKFSKCEFWLKKVFFLSHVVSSEGVSVDRTKIEAVTSW

Query:  PRPFTVSKVRSFLDLIGYYKRFMEDFSRIASPLNQLTRTGTHFVWSPTCESSFQDLK
         +P    ++R FL  + Y ++F+   S++  PLN L +    + W+PT   + +++K
Subjt:  PRPFTVSKVRSFLDLIGYYKRFMEDFSRIASPLNQLTRTGTHFVWSPTCESSFQDLK

P0CT41 Transposon Tf2-12 polyprotein4.8e-1828.03Show/hide
Query:  MSFGLTNALVVSMDLMNKVFKDFLDTFVIVFIDDILVYSKTEVGHEEHLHQVLETLRANKLYAKFSKCEFWLKKVFFLSHVVSSEGVSVDRTKIEAVTSW
        M +G++ A       +N +  +  ++ V+ ++DDIL++SK+E  H +H+  VL+ L+   L    +KCEF   +V F+ + +S +G +  +  I+ V  W
Subjt:  MSFGLTNALVVSMDLMNKVFKDFLDTFVIVFIDDILVYSKTEVGHEEHLHQVLETLRANKLYAKFSKCEFWLKKVFFLSHVVSSEGVSVDRTKIEAVTSW

Query:  PRPFTVSKVRSFLDLIGYYKRFMEDFSRIASPLNQLTRTGTHFVWSPTCESSFQDLK
         +P    ++R FL  + Y ++F+   S++  PLN L +    + W+PT   + +++K
Subjt:  PRPFTVSKVRSFLDLIGYYKRFMEDFSRIASPLNQLTRTGTHFVWSPTCESSFQDLK

P20825 Retrovirus-related Pol polyprotein from transposon 2973.7e-1831.91Show/hide
Query:  MSFGLTNALVVSMDLMNKVFKDFLDTFVIVFIDDILVYSKTEVGHEEHLHQVLETLRANKLYAKFSKCEFWLKKVFFLSHVVSSEGVSVDRTKIEAVTSW
        M FGL NA       MN + +  L+   +V++DDI+++S +   H   +  V   L    L  +  KCEF  K+  FL H+V+ +G+  +  K++A+ S+
Subjt:  MSFGLTNALVVSMDLMNKVFKDFLDTFVIVFIDDILVYSKTEVGHEEHLHQVLETLRANKLYAKFSKCEFWLKKVFFLSHVVSSEGVSVDRTKIEAVTSW

Query:  PRPFTVSKVRSFLDLIGYYKRFMEDFSRIASPLNQLTRTGT
        P P    ++R+FL L GYY++F+ +++ IA P+    +  T
Subjt:  PRPFTVSKVRSFLDLIGYYKRFMEDFSRIASPLNQLTRTGT

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus4.8e-1831.16Show/hide
Query:  MSFGLTNALVVSMDLMNKVFKDFLDTFVIVFIDDILVYSKTEVGHEEHLHQVLETLRANKLYAKFSKCEFWLKKVFFLSHVVSSEGVSVDRTKIEAVTSW
        + FGL NA  +   +++ + ++ +     V+IDDI+V+S+    H ++L  VL +L    L     K  F   +V FL ++V+++G+  D  K+ A++  
Subjt:  MSFGLTNALVVSMDLMNKVFKDFLDTFVIVFIDDILVYSKTEVGHEEHLHQVLETLRANKLYAKFSKCEFWLKKVFFLSHVVSSEGVSVDRTKIEAVTSW

Query:  PRPFTVSKVRSFLDLIGYYKRFMEDFSRIASPLNQLTR
        P P +V +++ FL +  YY++F++D++++A PL  LTR
Subjt:  PRPFTVSKVRSFLDLIGYYKRFMEDFSRIASPLNQLTR

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein5.4e-1735.71Show/hide
Query:  HLHQVLETLRANKLYAKFSKCEFWLKKVFFLS--HVVSSEGVSVDRTKIEAVTSWPRPFTVSKVRSFLDLIGYYKRFMEDFSRIASPLNQLTRTGTHFVW
        HL  VL+    ++ YA   KC F   ++ +L   H++S EGVS D  K+EA+  WP P   +++R FL L GYY+RF++++ +I  PL +L +  +   W
Subjt:  HLHQVLETLRANKLYAKFSKCEFWLKKVFFLS--HVVSSEGVSVDRTKIEAVTSWPRPFTVSKVRSFLDLIGYYKRFMEDFSRIASPLNQLTRTGTHFVW

Query:  SPTCESSFQDLK
        +     +F+ LK
Subjt:  SPTCESSFQDLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTTTGGCTTGACAAATGCTCTTGTTGTATCTATGGATCTGATGAACAAGGTGTTTAAGGATTTCTTAGACACTTTTGTTATTGTTTTTATTGATGATATC
TTAGTTTACTCTAAGACAGAAGTTGGGCATGAAGAGCATTTGCATCAGGTTTTAGAGACTCTTCGAGCCAATAAGCTATATGCTAAGTTTTCCAAGTGTGAGTTT
TGGCTGAAAAAGGTGTTTTTTCTTAGCCATGTAGTATCAAGTGAGGGAGTTTCTGTAGACCGAACAAAGATTGAAGCTGTTACCAGTTGGCCTCGACCTTTTACA
GTTAGTAAAGTTCGTAGTTTTTTGGATTTAATAGGTTATTACAAAAGGTTCATGGAAGATTTCTCGCGTATAGCTAGTCCTTTGAATCAGTTGACCAGGACAGGG
ACTCATTTTGTTTGGAGCCCAACATGTGAGAGTAGTTTCCAAGATCTTAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTTTTGGCTTGACAAATGCTCTTGTTGTATCTATGGATCTGATGAACAAGGTGTTTAAGGATTTCTTAGACACTTTTGTTATTGTTTTTATTGATGATATC
TTAGTTTACTCTAAGACAGAAGTTGGGCATGAAGAGCATTTGCATCAGGTTTTAGAGACTCTTCGAGCCAATAAGCTATATGCTAAGTTTTCCAAGTGTGAGTTT
TGGCTGAAAAAGGTGTTTTTTCTTAGCCATGTAGTATCAAGTGAGGGAGTTTCTGTAGACCGAACAAAGATTGAAGCTGTTACCAGTTGGCCTCGACCTTTTACA
GTTAGTAAAGTTCGTAGTTTTTTGGATTTAATAGGTTATTACAAAAGGTTCATGGAAGATTTCTCGCGTATAGCTAGTCCTTTGAATCAGTTGACCAGGACAGGG
ACTCATTTTGTTTGGAGCCCAACATGTGAGAGTAGTTTCCAAGATCTTAAGTAG
Protein sequenceShow/hide protein sequence
MSFGLTNALVVSMDLMNKVFKDFLDTFVIVFIDDILVYSKTEVGHEEHLHQVLETLRANKLYAKFSKCEFWLKKVFFLSHVVSSEGVSVDRTKIEAVTSWPRPFT
VSKVRSFLDLIGYYKRFMEDFSRIASPLNQLTRTGTHFVWSPTCESSFQDLK