; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0068781 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0068781
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr03:13081057..13081920
RNA-Seq ExpressionCmc03g0068781
SyntenyCmc03g0068781
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058812.1 pol protein [Cucumis melo var. makuwa]1.1e-13785.92Show/hide
Query:  MHLSIDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFV
        M L IDYREL+KVT+KNRYPLPRID+LFDQLQG TVFSKIDL S YHQLRIRD DIPKTAFHS+YGHY+F+VMS GLTNA AVFMDLMNRVFKDF D+FV
Subjt:  MHLSIDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFV

Query:  IDFIDDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSR
        I FIDDIL+YSKTE EH++HLHQVL TLRANKLYAKFSKCE W +KV+FLGHVVSSEGVSVDP KIE VT+WPR STVSEIRSFLGL GYYRRFVEDFSR
Subjt:  IDFIDDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSR

Query:  IASPLSQLTRKGTPFVWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQGKVVTYASRQLKSHEQNYP
        IASPL+QLTRKGTPFVW+PACESSFQELKQKLV+APVL V DGSG+F+IYSDASKKGLGCVLMQQGKVV YASRQLK HEQNYP
Subjt:  IASPLSQLTRKGTPFVWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQGKVVTYASRQLKSHEQNYP

KAA0063187.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.3e-14898.52Show/hide
Query:  MHLSIDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFV
        MHLSIDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFV
Subjt:  MHLSIDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFV

Query:  IDFIDDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSR
        IDFIDDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSR
Subjt:  IDFIDDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSR

Query:  IASPLSQLTRKGTPFVWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQGKVVT
        IASPLSQLTRKGTPFVWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQ    T
Subjt:  IASPLSQLTRKGTPFVWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQGKVVT

KAA0063946.1 pol protein [Cucumis melo var. makuwa]1.4e-13786.62Show/hide
Query:  MHLSIDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFV
        M L IDYREL+KVTVKNRYPLPRID+LFDQLQG TVFSKIDL S YHQLRIRDSDIPKTAF S+YGHY+F+VMS GLTNA AVFMDLMNRVFKDF D+FV
Subjt:  MHLSIDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFV

Query:  IDFIDDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSR
        I FIDDIL+YSKTE EH++HLHQVL TLRANKLYAKFSKCE W +KV+FLGHVVSSEGVSVDP KIE VT+WPR STVSEIRSFLGL GYYRRFVEDFSR
Subjt:  IDFIDDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSR

Query:  IASPLSQLTRKGTPFVWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQGKVVTYASRQLKSHEQNYP
         ASPL+QLTRKGTPFVW+PACESSFQELKQKLV+APVL V DGSGSF+IYSDASKKGLGCVLMQQGKVV YASRQLKSHEQNYP
Subjt:  IASPLSQLTRKGTPFVWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQGKVVTYASRQLKSHEQNYP

TYK05193.1 pol protein [Cucumis melo var. makuwa]3.7e-13886.62Show/hide
Query:  MHLSIDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFV
        M L IDYREL+KVTVKNRYPLPRID+LFDQLQG TVFSKIDL S YHQLRIRD DIPKTAF S+YGHY+F+VMS GLTNA AVFMDLMNRVFKDF D+FV
Subjt:  MHLSIDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFV

Query:  IDFIDDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSR
        I FIDDIL+YSKTE EH++HLHQVL TLRANKLYAKFSKCE W +KV+FLGHVVSSEGVSVDPTKIE VT+WPR STVSEIRSFLGL GYYRRFVEDFSR
Subjt:  IDFIDDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSR

Query:  IASPLSQLTRKGTPFVWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQGKVVTYASRQLKSHEQNYP
        IASPL+QLTRKGTPFVW+PACESSFQELKQKLV+APVL V DGSG+F+IYSDASKKGLGCVLMQQGKVV YASRQLKSHEQNYP
Subjt:  IASPLSQLTRKGTPFVWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQGKVVTYASRQLKSHEQNYP

TYK27670.1 pol protein [Cucumis melo var. makuwa]3.1e-161100Show/hide
Query:  MHLSIDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFV
        MHLSIDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFV
Subjt:  MHLSIDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFV

Query:  IDFIDDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSR
        IDFIDDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSR
Subjt:  IDFIDDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSR

Query:  IASPLSQLTRKGTPFVWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQGKVVTYASRQLKSHEQNYPILS
        IASPLSQLTRKGTPFVWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQGKVVTYASRQLKSHEQNYPILS
Subjt:  IASPLSQLTRKGTPFVWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQGKVVTYASRQLKSHEQNYPILS

TrEMBL top hitse value%identityAlignment
A0A5A7USG7 Reverse transcriptase5.1e-13885.92Show/hide
Query:  MHLSIDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFV
        M L IDYREL+KVT+KNRYPLPRID+LFDQLQG TVFSKIDL S YHQLRIRD DIPKTAFHS+YGHY+F+VMS GLTNA AVFMDLMNRVFKDF D+FV
Subjt:  MHLSIDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFV

Query:  IDFIDDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSR
        I FIDDIL+YSKTE EH++HLHQVL TLRANKLYAKFSKCE W +KV+FLGHVVSSEGVSVDP KIE VT+WPR STVSEIRSFLGL GYYRRFVEDFSR
Subjt:  IDFIDDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSR

Query:  IASPLSQLTRKGTPFVWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQGKVVTYASRQLKSHEQNYP
        IASPL+QLTRKGTPFVW+PACESSFQELKQKLV+APVL V DGSG+F+IYSDASKKGLGCVLMQQGKVV YASRQLK HEQNYP
Subjt:  IASPLSQLTRKGTPFVWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQGKVVTYASRQLKSHEQNYP

A0A5A7V868 Ty3-gypsy retrotransposon protein1.1e-14898.52Show/hide
Query:  MHLSIDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFV
        MHLSIDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFV
Subjt:  MHLSIDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFV

Query:  IDFIDDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSR
        IDFIDDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSR
Subjt:  IDFIDDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSR

Query:  IASPLSQLTRKGTPFVWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQGKVVT
        IASPLSQLTRKGTPFVWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQ    T
Subjt:  IASPLSQLTRKGTPFVWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQGKVVT

A0A5A7VBY3 Reverse transcriptase6.7e-13886.62Show/hide
Query:  MHLSIDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFV
        M L IDYREL+KVTVKNRYPLPRID+LFDQLQG TVFSKIDL S YHQLRIRDSDIPKTAF S+YGHY+F+VMS GLTNA AVFMDLMNRVFKDF D+FV
Subjt:  MHLSIDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFV

Query:  IDFIDDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSR
        I FIDDIL+YSKTE EH++HLHQVL TLRANKLYAKFSKCE W +KV+FLGHVVSSEGVSVDP KIE VT+WPR STVSEIRSFLGL GYYRRFVEDFSR
Subjt:  IDFIDDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSR

Query:  IASPLSQLTRKGTPFVWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQGKVVTYASRQLKSHEQNYP
         ASPL+QLTRKGTPFVW+PACESSFQELKQKLV+APVL V DGSGSF+IYSDASKKGLGCVLMQQGKVV YASRQLKSHEQNYP
Subjt:  IASPLSQLTRKGTPFVWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQGKVVTYASRQLKSHEQNYP

A0A5D3BZN1 Reverse transcriptase1.8e-13886.62Show/hide
Query:  MHLSIDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFV
        M L IDYREL+KVTVKNRYPLPRID+LFDQLQG TVFSKIDL S YHQLRIRD DIPKTAF S+YGHY+F+VMS GLTNA AVFMDLMNRVFKDF D+FV
Subjt:  MHLSIDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFV

Query:  IDFIDDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSR
        I FIDDIL+YSKTE EH++HLHQVL TLRANKLYAKFSKCE W +KV+FLGHVVSSEGVSVDPTKIE VT+WPR STVSEIRSFLGL GYYRRFVEDFSR
Subjt:  IDFIDDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSR

Query:  IASPLSQLTRKGTPFVWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQGKVVTYASRQLKSHEQNYP
        IASPL+QLTRKGTPFVW+PACESSFQELKQKLV+APVL V DGSG+F+IYSDASKKGLGCVLMQQGKVV YASRQLKSHEQNYP
Subjt:  IASPLSQLTRKGTPFVWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQGKVVTYASRQLKSHEQNYP

A0A5D3DWP7 Pol protein1.5e-161100Show/hide
Query:  MHLSIDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFV
        MHLSIDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFV
Subjt:  MHLSIDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFV

Query:  IDFIDDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSR
        IDFIDDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSR
Subjt:  IDFIDDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSR

Query:  IASPLSQLTRKGTPFVWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQGKVVTYASRQLKSHEQNYPILS
        IASPLSQLTRKGTPFVWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQGKVVTYASRQLKSHEQNYPILS
Subjt:  IASPLSQLTRKGTPFVWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQGKVVTYASRQLKSHEQNYPILS

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.9e-5337.14Show/hide
Query:  IDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFVIDFI
        IDYR+L+++TV +R+P+P +D +  +L     F+ IDL   +HQ+ +    + KTAF +K+GHY+++ M  GL NA A F   MN + +   +   + ++
Subjt:  IDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFVIDFI

Query:  DDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSRIASP
        DDI+V+S +  EH   L  V   L    L  +  KCE  +++ +FLGHV++ +G+  +P KIE +  +P  +   EI++FLGL GYYR+F+ +F+ IA P
Subjt:  DDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSRIASP

Query:  LSQLTRKGTPF-VWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQGKVVTYASRQLKSHEQNY
        +++  +K       NP  +S+F++LK  +   P+L V D +  F + +DAS   LG VL Q G  ++Y SR L  HE NY
Subjt:  LSQLTRKGTPF-VWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQGKVVTYASRQLKSHEQNY

P0CT34 Transposon Tf2-1 polyprotein5.3e-4734.48Show/hide
Query:  MHLSIDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFV
        + + +DY+ L+K    N YPLP I+ L  ++QG T+F+K+DL S YH +R+R  D  K AF    G ++++VM  G++ A A F   +N +  +  ++ V
Subjt:  MHLSIDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFV

Query:  IDFIDDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSR
        + ++DDIL++SK+E EH  H+  VL  L+   L    +KCE  Q +V F+G+ +S +G +     I+ V  W +     E+R FLG V Y R+F+   S+
Subjt:  IDFIDDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSR

Query:  IASPLSQLTRKGTPFVWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQGK-----VVTYASRQLKSHEQNYPI
        +  PL+ L +K   + W P    + + +KQ LVS PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY +
Subjt:  IASPLSQLTRKGTPFVWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQGK-----VVTYASRQLKSHEQNYPI

P0CT35 Transposon Tf2-2 polyprotein5.3e-4734.48Show/hide
Query:  MHLSIDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFV
        + + +DY+ L+K    N YPLP I+ L  ++QG T+F+K+DL S YH +R+R  D  K AF    G ++++VM  G++ A A F   +N +  +  ++ V
Subjt:  MHLSIDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFV

Query:  IDFIDDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSR
        + ++DDIL++SK+E EH  H+  VL  L+   L    +KCE  Q +V F+G+ +S +G +     I+ V  W +     E+R FLG V Y R+F+   S+
Subjt:  IDFIDDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSR

Query:  IASPLSQLTRKGTPFVWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQGK-----VVTYASRQLKSHEQNYPI
        +  PL+ L +K   + W P    + + +KQ LVS PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY +
Subjt:  IASPLSQLTRKGTPFVWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQGK-----VVTYASRQLKSHEQNYPI

P0CT41 Transposon Tf2-12 polyprotein5.3e-4734.48Show/hide
Query:  MHLSIDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFV
        + + +DY+ L+K    N YPLP I+ L  ++QG T+F+K+DL S YH +R+R  D  K AF    G ++++VM  G++ A A F   +N +  +  ++ V
Subjt:  MHLSIDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFV

Query:  IDFIDDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSR
        + ++DDIL++SK+E EH  H+  VL  L+   L    +KCE  Q +V F+G+ +S +G +     I+ V  W +     E+R FLG V Y R+F+   S+
Subjt:  IDFIDDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSR

Query:  IASPLSQLTRKGTPFVWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQGK-----VVTYASRQLKSHEQNYPI
        +  PL+ L +K   + W P    + + +KQ LVS PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY +
Subjt:  IASPLSQLTRKGTPFVWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQGK-----VVTYASRQLKSHEQNYPI

P20825 Retrovirus-related Pol polyprotein from transposon 2971.9e-5235.36Show/hide
Query:  IDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFVIDFI
        IDYR+L+++T+ +RYP+P +D +  +L     F+ IDL   +HQ+ + +  I KTAF +K GHY+++ M  GL NA A F   MN + +   +   + ++
Subjt:  IDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFVIDFI

Query:  DDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSRIASP
        DDI+++S +  EH + +  V   L    L  +  KCE  +K+ +FLGH+V+ +G+  +P K++ + S+P  +   EIR+FLGL GYYR+F+ +++ IA P
Subjt:  DDILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSRIASP

Query:  LSQLTRKGTPF-VWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQGKVVTYASRQLKSHEQNY
        ++   +K T           +F++LK  ++  P+L + D    F++ +DAS   LG VL Q G  +++ SR L  HE NY
Subjt:  LSQLTRKGTPF-VWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQGKVVTYASRQLKSHEQNY

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein7.1e-2338.89Show/hide
Query:  DHLHQVLGTLRANKLYAKFSKCELWQKKVSFLG--HVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSRIASPLSQLTRKGTPFV
        +HL  VL     ++ YA   KC   Q ++++LG  H++S EGVS DP K+E +  WP     +E+R FLGL GYYRRFV+++ +I  PL++L +K +   
Subjt:  DHLHQVLGTLRANKLYAKFSKCELWQKKVSFLG--HVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSRIASPLSQLTRKGTPFV

Query:  WNPACESSFQELKQKLVSAPVLIVSD
        W      +F+ LK  + + PVL + D
Subjt:  WNPACESSFQELKQKLVSAPVLIVSD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACCTTTCCATTGATTACAGAGAGCTAAGTAAGGTGACAGTAAAAAACCGTTATCCTTTGCCCAGGATTGACAACTTGTTTGACCAGCTGCAAGGAGACACT
GTCTTCTCTAAGATCGATTTATGTTCGAGATACCACCAGTTGAGGATCAGAGATAGTGATATTCCTAAGACTGCGTTCCATTCCAAATACGGGCATTACAAGTTC
ATTGTGATGTCCATTGGTTTGACTAATGCTCTTGCAGTATTTATGGACTTGATGAACAGAGTGTTTAAGGACTTCTTCGACACATTTGTTATAGACTTTATTGAT
GATATCTTGGTTTACTCCAAGACAGAGATCGAGCATAAGGATCACTTGCATCAAGTTTTGGGGACTCTTCGAGCTAATAAGCTGTATGCCAAGTTTTCCAAGTGT
GAGTTATGGCAGAAGAAGGTATCTTTTCTTGGACATGTGGTGTCCAGTGAGGGAGTTTCTGTAGACCCAACAAAGATTGAAGTTGTTACTAGTTGGCCTCGATCG
TCTACAGTCAGTGAGATCCGTAGTTTTTTGGGTCTAGTAGGTTATTATAGGAGGTTCGTGGAAGACTTTTCTCGTATAGCTAGTCCTTTGAGTCAGTTGACCAGG
AAGGGGACTCCATTTGTTTGGAACCCAGCTTGTGAATCTAGTTTCCAGGAGCTCAAGCAGAAGCTTGTGTCTGCACCAGTCTTAATAGTATCAGATGGATCTGGA
AGTTTCATGATCTACAGTGATGCCTCAAAGAAAGGACTGGGTTGTGTTCTGATGCAGCAAGGTAAGGTAGTTACTTATGCCTCCCGTCAGTTGAAGAGTCATGAG
CAGAACTACCCTATCCTGTCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCACCTTTCCATTGATTACAGAGAGCTAAGTAAGGTGACAGTAAAAAACCGTTATCCTTTGCCCAGGATTGACAACTTGTTTGACCAGCTGCAAGGAGACACT
GTCTTCTCTAAGATCGATTTATGTTCGAGATACCACCAGTTGAGGATCAGAGATAGTGATATTCCTAAGACTGCGTTCCATTCCAAATACGGGCATTACAAGTTC
ATTGTGATGTCCATTGGTTTGACTAATGCTCTTGCAGTATTTATGGACTTGATGAACAGAGTGTTTAAGGACTTCTTCGACACATTTGTTATAGACTTTATTGAT
GATATCTTGGTTTACTCCAAGACAGAGATCGAGCATAAGGATCACTTGCATCAAGTTTTGGGGACTCTTCGAGCTAATAAGCTGTATGCCAAGTTTTCCAAGTGT
GAGTTATGGCAGAAGAAGGTATCTTTTCTTGGACATGTGGTGTCCAGTGAGGGAGTTTCTGTAGACCCAACAAAGATTGAAGTTGTTACTAGTTGGCCTCGATCG
TCTACAGTCAGTGAGATCCGTAGTTTTTTGGGTCTAGTAGGTTATTATAGGAGGTTCGTGGAAGACTTTTCTCGTATAGCTAGTCCTTTGAGTCAGTTGACCAGG
AAGGGGACTCCATTTGTTTGGAACCCAGCTTGTGAATCTAGTTTCCAGGAGCTCAAGCAGAAGCTTGTGTCTGCACCAGTCTTAATAGTATCAGATGGATCTGGA
AGTTTCATGATCTACAGTGATGCCTCAAAGAAAGGACTGGGTTGTGTTCTGATGCAGCAAGGTAAGGTAGTTACTTATGCCTCCCGTCAGTTGAAGAGTCATGAG
CAGAACTACCCTATCCTGTCCTAG
Protein sequenceShow/hide protein sequence
MHLSIDYRELSKVTVKNRYPLPRIDNLFDQLQGDTVFSKIDLCSRYHQLRIRDSDIPKTAFHSKYGHYKFIVMSIGLTNALAVFMDLMNRVFKDFFDTFVIDFID
DILVYSKTEIEHKDHLHQVLGTLRANKLYAKFSKCELWQKKVSFLGHVVSSEGVSVDPTKIEVVTSWPRSSTVSEIRSFLGLVGYYRRFVEDFSRIASPLSQLTR
KGTPFVWNPACESSFQELKQKLVSAPVLIVSDGSGSFMIYSDASKKGLGCVLMQQGKVVTYASRQLKSHEQNYPILS