; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0027701 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0027701
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr01:28882244..28883251
RNA-Seq ExpressionCmc01g0027701
SyntenyCmc01g0027701
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008194 - UDP-glycosyltransferase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026063.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]3.2e-17092.24Show/hide
Query:  MKASKLLSQGTWGILASVVDIREPEVSLSSEPVVREYPDVFPDELPGLLPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
        MKASKLLSQGTWGILASVVDIREPEVSLSSEPVVREYPDVFPDELPGL PPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
Subjt:  MKASKLLSQGTWGILASVVDIREPEVSLSSEPVVREYPDVFPDELPGLLPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS

Query:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFM
        PWGAPVLFVKKKDGSMRLCIDYRELNKV VKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFM
Subjt:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFM

Query:  DLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAVTNCLDRPQLVKFEVFW
        DLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEA                 
Subjt:  DLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAVTNCLDRPQLVKFEVFW

Query:  AWQVLQEVRGRLLTYSQPVDPVDQEGNPFYLEPSM
              EVRGRLLTYSQPVDPVDQEGNPF LEPSM
Subjt:  AWQVLQEVRGRLLTYSQPVDPVDQEGNPFYLEPSM

KAA0043063.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]9.4e-17091.94Show/hide
Query:  MKASKLLSQGTWGILASVVDIREPEVSLSSEPVVREYPDVFPDELPGLLPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
        MKASKLLSQGTWGILASVVDIREPEVSLSSEPVVREYPDVFPDELPGL PPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
Subjt:  MKASKLLSQGTWGILASVVDIREPEVSLSSEPVVREYPDVFPDELPGLLPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS

Query:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFM
        PW APVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFM
Subjt:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFM

Query:  DLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAVTNCLDRPQLVKFEVFW
        DLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEA                 
Subjt:  DLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAVTNCLDRPQLVKFEVFW

Query:  AWQVLQEVRGRLLTYSQPVDPVDQEGNPFYLEPSM
              EVRGRLLTYSQPVDPVDQEGNPF +EPSM
Subjt:  AWQVLQEVRGRLLTYSQPVDPVDQEGNPFYLEPSM

KAA0053301.1 pol protein [Cucumis melo var. makuwa]4.0e-16088.13Show/hide
Query:  MKASKLLSQGTWGILASVVDIREPEVSLSSEPVVREYPDVFPDELPGLLPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
        MKASKLLSQGTWGILASVVDIREPEVSLSSEPVVREYPDVFPDELPGL PPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
Subjt:  MKASKLLSQGTWGILASVVDIREPEVSLSSEPVVREYPDVFPDELPGLLPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS

Query:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFM
        PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFM
Subjt:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFM

Query:  DLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAVTNCLDRPQLVKFEVFW
        DLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAVTN   RP  V  E+  
Subjt:  DLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAVTNCLDRPQLVKFEVFW

Query:  AWQVLQEVRGRLLTYSQPVDPVDQ---EGNPFYLEPS
           +    R  +  +S+   P+ Q   +G PF   P+
Subjt:  AWQVLQEVRGRLLTYSQPVDPVDQ---EGNPFYLEPS

KAA0061224.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]6.5e-16387.76Show/hide
Query:  MKASKLLSQGTWGILASVVDIREPEVSLSSEPVVREYPDVFPDELPGLLPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
        MKASKLLSQGTWGILASVVD RE EVSLSSEPVVREYPDVFPDELPGL PPRE++FAIELE GTA ISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
Subjt:  MKASKLLSQGTWGILASVVDIREPEVSLSSEPVVREYPDVFPDELPGLLPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS

Query:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFM
        PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDL SGYHQLRIRD DIPKTAFRSRYGHYEF+VMSF LTNAPAVFM
Subjt:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFM

Query:  DLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAVTNCLDRPQLVKFEVFW
        DLMNRVFKDFLDSFVI+FIDDILIYSKTEAEHEEHL QVLETLRAN+LYAKFSKCE WL+KV+FLGHVVSSEGVSVDP KIEAVTN   RP  V     +
Subjt:  DLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAVTNCLDRPQLVKFEVFW

Query:  AWQVLQEVRGRLLTYSQPVDPVDQEGNPFYLEPSM
            LQEV GRLL Y QP+DPVDQ+GNPF L+PS+
Subjt:  AWQVLQEVRGRLLTYSQPVDPVDQEGNPFYLEPSM

TYK20443.1 pol protein [Cucumis melo var. makuwa]5.2e-16087.83Show/hide
Query:  MKASKLLSQGTWGILASVVDIREPEVSLSSEPVVREYPDVFPDELPGLLPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
        MKASKLLSQGTWGILASVVD+REPEVSLSSEPVVREYPDVFPDELPGL PPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
Subjt:  MKASKLLSQGTWGILASVVDIREPEVSLSSEPVVREYPDVFPDELPGLLPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS

Query:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFM
        PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFM
Subjt:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFM

Query:  DLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAVTNCLDRPQLVKFEVFW
        DLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAVTN   RP  V  E+  
Subjt:  DLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAVTNCLDRPQLVKFEVFW

Query:  AWQVLQEVRGRLLTYSQPVDPVDQ---EGNPFYLEPS
           +    R  +  +S+   P+ Q   +G PF   P+
Subjt:  AWQVLQEVRGRLLTYSQPVDPVDQ---EGNPFYLEPS

TrEMBL top hitse value%identityAlignment
A0A5A7SN45 Reverse transcriptase1.6e-17092.24Show/hide
Query:  MKASKLLSQGTWGILASVVDIREPEVSLSSEPVVREYPDVFPDELPGLLPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
        MKASKLLSQGTWGILASVVDIREPEVSLSSEPVVREYPDVFPDELPGL PPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
Subjt:  MKASKLLSQGTWGILASVVDIREPEVSLSSEPVVREYPDVFPDELPGLLPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS

Query:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFM
        PWGAPVLFVKKKDGSMRLCIDYRELNKV VKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFM
Subjt:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFM

Query:  DLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAVTNCLDRPQLVKFEVFW
        DLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEA                 
Subjt:  DLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAVTNCLDRPQLVKFEVFW

Query:  AWQVLQEVRGRLLTYSQPVDPVDQEGNPFYLEPSM
              EVRGRLLTYSQPVDPVDQEGNPF LEPSM
Subjt:  AWQVLQEVRGRLLTYSQPVDPVDQEGNPFYLEPSM

A0A5A7TI92 Reverse transcriptase4.6e-17091.94Show/hide
Query:  MKASKLLSQGTWGILASVVDIREPEVSLSSEPVVREYPDVFPDELPGLLPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
        MKASKLLSQGTWGILASVVDIREPEVSLSSEPVVREYPDVFPDELPGL PPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
Subjt:  MKASKLLSQGTWGILASVVDIREPEVSLSSEPVVREYPDVFPDELPGLLPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS

Query:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFM
        PW APVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFM
Subjt:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFM

Query:  DLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAVTNCLDRPQLVKFEVFW
        DLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEA                 
Subjt:  DLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAVTNCLDRPQLVKFEVFW

Query:  AWQVLQEVRGRLLTYSQPVDPVDQEGNPFYLEPSM
              EVRGRLLTYSQPVDPVDQEGNPF +EPSM
Subjt:  AWQVLQEVRGRLLTYSQPVDPVDQEGNPFYLEPSM

A0A5A7UIB4 Pol protein1.9e-16088.13Show/hide
Query:  MKASKLLSQGTWGILASVVDIREPEVSLSSEPVVREYPDVFPDELPGLLPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
        MKASKLLSQGTWGILASVVDIREPEVSLSSEPVVREYPDVFPDELPGL PPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
Subjt:  MKASKLLSQGTWGILASVVDIREPEVSLSSEPVVREYPDVFPDELPGLLPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS

Query:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFM
        PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFM
Subjt:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFM

Query:  DLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAVTNCLDRPQLVKFEVFW
        DLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAVTN   RP  V  E+  
Subjt:  DLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAVTNCLDRPQLVKFEVFW

Query:  AWQVLQEVRGRLLTYSQPVDPVDQ---EGNPFYLEPS
           +    R  +  +S+   P+ Q   +G PF   P+
Subjt:  AWQVLQEVRGRLLTYSQPVDPVDQ---EGNPFYLEPS

A0A5D3BTN0 Reverse transcriptase2.5e-16087.83Show/hide
Query:  MKASKLLSQGTWGILASVVDIREPEVSLSSEPVVREYPDVFPDELPGLLPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
        MKASKLLSQGTWGILASVVD+REPEVSLSSEPVVREYPDVFPDELPGL PPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
Subjt:  MKASKLLSQGTWGILASVVDIREPEVSLSSEPVVREYPDVFPDELPGLLPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS

Query:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFM
        PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFM
Subjt:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFM

Query:  DLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAVTNCLDRPQLVKFEVFW
        DLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAVTN   RP  V  E+  
Subjt:  DLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAVTNCLDRPQLVKFEVFW

Query:  AWQVLQEVRGRLLTYSQPVDPVDQ---EGNPFYLEPS
           +    R  +  +S+   P+ Q   +G PF   P+
Subjt:  AWQVLQEVRGRLLTYSQPVDPVDQ---EGNPFYLEPS

A0A5D3C4W8 Ty3-gypsy retrotransposon protein3.2e-16387.76Show/hide
Query:  MKASKLLSQGTWGILASVVDIREPEVSLSSEPVVREYPDVFPDELPGLLPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
        MKASKLLSQGTWGILASVVD RE EVSLSSEPVVREYPDVFPDELPGL PPRE++FAIELE GTA ISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
Subjt:  MKASKLLSQGTWGILASVVDIREPEVSLSSEPVVREYPDVFPDELPGLLPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS

Query:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFM
        PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDL SGYHQLRIRD DIPKTAFRSRYGHYEF+VMSF LTNAPAVFM
Subjt:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFM

Query:  DLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAVTNCLDRPQLVKFEVFW
        DLMNRVFKDFLDSFVI+FIDDILIYSKTEAEHEEHL QVLETLRAN+LYAKFSKCE WL+KV+FLGHVVSSEGVSVDP KIEAVTN   RP  V     +
Subjt:  DLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAVTNCLDRPQLVKFEVFW

Query:  AWQVLQEVRGRLLTYSQPVDPVDQEGNPFYLEPSM
            LQEV GRLL Y QP+DPVDQ+GNPF L+PS+
Subjt:  AWQVLQEVRGRLLTYSQPVDPVDQEGNPFYLEPSM

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein9.8e-4534.96Show/hide
Query:  IREPEVSLSSEPVVREYPDVFPDELPGLLPP--REVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRL
        ++EPE+      + +E+ D+  +     LP   + ++F +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+
Subjt:  IREPEVSLSSEPVVREYPDVFPDELPGLLPP--REVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRL

Query:  CIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVF
         +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+S YH +R+R GD  K AFR   G +E++VM +G++ APA F   +N +  +  +S V+ +
Subjt:  CIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVF

Query:  IDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAV
        +DDILI+SK+E+EH +H+  VL+ L+   L    +KCEF   +V F+G+ +S +G +     I+ V
Subjt:  IDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAV

P0CT35 Transposon Tf2-2 polyprotein9.8e-4534.96Show/hide
Query:  IREPEVSLSSEPVVREYPDVFPDELPGLLPP--REVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRL
        ++EPE+      + +E+ D+  +     LP   + ++F +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+
Subjt:  IREPEVSLSSEPVVREYPDVFPDELPGLLPP--REVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRL

Query:  CIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVF
         +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+S YH +R+R GD  K AFR   G +E++VM +G++ APA F   +N +  +  +S V+ +
Subjt:  CIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVF

Query:  IDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAV
        +DDILI+SK+E+EH +H+  VL+ L+   L    +KCEF   +V F+G+ +S +G +     I+ V
Subjt:  IDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAV

P0CT41 Transposon Tf2-12 polyprotein9.8e-4534.96Show/hide
Query:  IREPEVSLSSEPVVREYPDVFPDELPGLLPP--REVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRL
        ++EPE+      + +E+ D+  +     LP   + ++F +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+
Subjt:  IREPEVSLSSEPVVREYPDVFPDELPGLLPP--REVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRL

Query:  CIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVF
         +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+S YH +R+R GD  K AFR   G +E++VM +G++ APA F   +N +  +  +S V+ +
Subjt:  CIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVF

Query:  IDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAV
        +DDILI+SK+E+EH +H+  VL+ L+   L    +KCEF   +V F+G+ +S +G +     I+ V
Subjt:  IDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAV

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein5.6e-4836.77Show/hide
Query:  KASKLLSQGTWGILASVVDIREPEVSLSSEP---------VVREYPDVFPDELPGLLPPREVDF-------AIELEPGTAPISRAPYRMAPAELKELKVQ
        +AS L   G +  + S +   EP  +  S           + ++Y ++  ++    LPPR  D         IE++PG       PY +     +E+   
Subjt:  KASKLLSQGTWGILASVVDIREPEVSLSSEP---------VVREYPDVFPDELPGLLPPREVDF-------AIELEPGTAPISRAPYRMAPAELKELKVQ

Query:  LQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEF
        +Q+LLD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T+ + +PLPRID+L  ++  A +F+ +DL SGYHQ+ +   D  KTAF +  G YE+
Subjt:  LQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEF

Query:  VVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAVT
         VM FGL NAP+ F   M   F+D    FV V++DDILI+S++  EH +HL  VLE L+   L  K  KC+F   +  FLG+ +  + ++    K  A+ 
Subjt:  VVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAVT

Query:  NCLDRPQLVK
        +    P+ VK
Subjt:  NCLDRPQLVK

Q99315 Transposon Ty3-G Gag-Pol polyprotein5.6e-4836.77Show/hide
Query:  KASKLLSQGTWGILASVVDIREPEVSLSSEP---------VVREYPDVFPDELPGLLPPREVDF-------AIELEPGTAPISRAPYRMAPAELKELKVQ
        +AS L   G +  + S +   EP  +  S           + ++Y ++  ++    LPPR  D         IE++PG       PY +     +E+   
Subjt:  KASKLLSQGTWGILASVVDIREPEVSLSSEP---------VVREYPDVFPDELPGLLPPREVDF-------AIELEPGTAPISRAPYRMAPAELKELKVQ

Query:  LQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEF
        +Q+LLD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T+ + +PLPRID+L  ++  A +F+ +DL SGYHQ+ +   D  KTAF +  G YE+
Subjt:  LQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEF

Query:  VVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAVT
         VM FGL NAP+ F   M   F+D    FV V++DDILI+S++  EH +HL  VLE L+   L  K  KC+F   +  FLG+ +  + ++    K  A+ 
Subjt:  VVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAVT

Query:  NCLDRPQLVK
        +    P+ VK
Subjt:  NCLDRPQLVK

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.2e-0544.23Show/hide
Query:  HLHQVLETLRANKLYAKFSKCEFWLRKVTFLG--HVVSSEGVSVDPAKIEAV
        HL  VL+    ++ YA   KC F   ++ +LG  H++S EGVS DPAK+EA+
Subjt:  HLHQVLETLRANKLYAKFSKCEFWLRKVTFLG--HVVSSEGVSVDPAKIEAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGCTAGTAAACTACTCAGCCAGGGTACTTGGGGCATCTTGGCAAGCGTAGTGGATATTAGAGAGCCAGAAGTTTCCCTATCTTCCGAACCAGTGGTAAGG
GAGTACCCTGACGTTTTCCCCGACGAACTCCCAGGACTTCTGCCTCCCAGGGAGGTAGACTTTGCCATCGAGTTAGAGCCGGGCACTGCCCCTATCTCGAGGGCC
CCTTACAGAATGGCCCCAGCCGAGCTAAAAGAGTTGAAGGTCCAGTTACAGGAGTTACTGGACAAAGGTTTCATCCGGCCCAGTGTGTCACCTTGGGGAGCCCCA
GTGTTGTTCGTGAAGAAGAAGGATGGGTCGATGCGCCTTTGTATTGACTACCGAGAGCTGAACAAGGTGACAGTTAAAAACCGCTACCCCTTGCCCAGGATTGAT
GACTTGTTCGATCAGTTGCAGGGAGCCACTGTCTTTTCCAAGATCGACCTGCGGTCAGGCTATCACCAGTTGAGGATTAGGGACGGTGACATTCCCAAGACGGCC
TTTCGTTCGAGGTACGGACATTACGAGTTCGTTGTGATGTCTTTCGGCTTGACTAACGCTCCTGCAGTGTTCATGGATTTGATGAACAGGGTGTTTAAGGACTTT
CTAGACTCGTTCGTCATAGTCTTCATTGATGACATCTTGATTTACTCTAAAACTGAGGCTGAGCACGAGGAGCACTTGCACCAGGTTTTGGAGACTCTTCGAGCC
AACAAGTTGTATGCCAAGTTCTCCAAGTGTGAATTCTGGTTAAGGAAGGTGACGTTCCTTGGCCACGTGGTTTCCAGTGAAGGAGTTTCTGTGGATCCAGCAAAG
ATTGAAGCTGTGACCAACTGCCTCGACCGTCCACAGTTAGTGAAATTCGAAGTTTTCTGGGCTTGGCAGGTTCTACAGGAGGTTCGTGGAAGACTTCTCACGTAT
AGCCAGCCCGTTGACCCAGTTGACCAGGAAGGGAACCCCTTTTATCTGGAGCCCAGCATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGCTAGTAAACTACTCAGCCAGGGTACTTGGGGCATCTTGGCAAGCGTAGTGGATATTAGAGAGCCAGAAGTTTCCCTATCTTCCGAACCAGTGGTAAGG
GAGTACCCTGACGTTTTCCCCGACGAACTCCCAGGACTTCTGCCTCCCAGGGAGGTAGACTTTGCCATCGAGTTAGAGCCGGGCACTGCCCCTATCTCGAGGGCC
CCTTACAGAATGGCCCCAGCCGAGCTAAAAGAGTTGAAGGTCCAGTTACAGGAGTTACTGGACAAAGGTTTCATCCGGCCCAGTGTGTCACCTTGGGGAGCCCCA
GTGTTGTTCGTGAAGAAGAAGGATGGGTCGATGCGCCTTTGTATTGACTACCGAGAGCTGAACAAGGTGACAGTTAAAAACCGCTACCCCTTGCCCAGGATTGAT
GACTTGTTCGATCAGTTGCAGGGAGCCACTGTCTTTTCCAAGATCGACCTGCGGTCAGGCTATCACCAGTTGAGGATTAGGGACGGTGACATTCCCAAGACGGCC
TTTCGTTCGAGGTACGGACATTACGAGTTCGTTGTGATGTCTTTCGGCTTGACTAACGCTCCTGCAGTGTTCATGGATTTGATGAACAGGGTGTTTAAGGACTTT
CTAGACTCGTTCGTCATAGTCTTCATTGATGACATCTTGATTTACTCTAAAACTGAGGCTGAGCACGAGGAGCACTTGCACCAGGTTTTGGAGACTCTTCGAGCC
AACAAGTTGTATGCCAAGTTCTCCAAGTGTGAATTCTGGTTAAGGAAGGTGACGTTCCTTGGCCACGTGGTTTCCAGTGAAGGAGTTTCTGTGGATCCAGCAAAG
ATTGAAGCTGTGACCAACTGCCTCGACCGTCCACAGTTAGTGAAATTCGAAGTTTTCTGGGCTTGGCAGGTTCTACAGGAGGTTCGTGGAAGACTTCTCACGTAT
AGCCAGCCCGTTGACCCAGTTGACCAGGAAGGGAACCCCTTTTATCTGGAGCCCAGCATGTGA
Protein sequenceShow/hide protein sequence
MKASKLLSQGTWGILASVVDIREPEVSLSSEPVVREYPDVFPDELPGLLPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP
VLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDF
LDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAVTNCLDRPQLVKFEVFWAWQVLQEVRGRLLTY
SQPVDPVDQEGNPFYLEPSM