; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc07g0193721 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc07g0193721
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr07:15687749..15688774
RNA-Seq ExpressionCmc07g0193721
SyntenyCmc07g0193721
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047194.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]9.5e-16487.35Show/hide
Query:  MMDTREVDVSLSSEPVVRDYPDVFLEELSELPPHREIEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMR
        ++DTREVDVSLSSEPVVRDYPDVF EEL  LPPHRE+EFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMR
Subjt:  MMDTREVDVSLSSEPVVRDYPDVFLEELSELPPHREIEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMR

Query:  LCIDYRELNKVTVNNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFMIV
        LCIDYRELNKVTV NRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTF+IV
Subjt:  LCIDYRELNKVTVNNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFMIV

Query:  FIDDILIYSKTEAEHEEHLCMVLHTLRDNKLYAKFSK------------------------SKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIA
        FIDDILIYSK EAEHEEHL MVL TLRDNKLYAKF K                        +KIEAVT W RPSTVSEVRSFLGLAGYYRRFVENF RIA
Subjt:  FIDDILIYSKTEAEHEEHLCMVLHTLRDNKLYAKFSK------------------------SKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIA

Query:  TPLTQLTRKGAPFVWSKPCEDSFQNLKQKLATAPVLTVPH
        TPLTQLTRKGAPFVWSK CEDSFQNLKQKL TAPVLTVP+
Subjt:  TPLTQLTRKGAPFVWSKPCEDSFQNLKQKLATAPVLTVPH

KAA0047433.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]3.3e-16487.91Show/hide
Query:  MMDTREVDVSLSSEPVVRDYPDVFLEELSELPPHREIEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMR
        ++DTREVDVSLSSEPVVRDYPDVF EEL  LPPHRE+EFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMR
Subjt:  MMDTREVDVSLSSEPVVRDYPDVFLEELSELPPHREIEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMR

Query:  LCIDYRELNKVTVNNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFMIV
        LCIDYRELNKVTV NRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTF+IV
Subjt:  LCIDYRELNKVTVNNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFMIV

Query:  FIDDILIYSKTEAEHEEHLCMVLHTLRDNKLYAKFSK------------------------SKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIA
        FIDDILIYSKTEAEHEEHL +VL TLRDNKLYAKFSK                        +KIEAVT W RPSTVSEVRSFLGLAGYYRRFVENF RIA
Subjt:  FIDDILIYSKTEAEHEEHLCMVLHTLRDNKLYAKFSK------------------------SKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIA

Query:  TPLTQLTRKGAPFVWSKPCEDSFQNLKQKLATAPVLTVP
        TPLTQLTRKGAPFVWSK CEDSFQNLKQKL TAPVLTVP
Subjt:  TPLTQLTRKGAPFVWSKPCEDSFQNLKQKLATAPVLTVP

KAA0051051.1 reverse transcriptase [Cucumis melo var. makuwa]9.5e-16487.61Show/hide
Query:  MMDTREVDVSLSSEPVVRDYPDVFLEELSELPPHREIEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMR
        ++DTREVDVSLSSEPVVRDYPDVF EEL  LPPHRE+EFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMR
Subjt:  MMDTREVDVSLSSEPVVRDYPDVFLEELSELPPHREIEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMR

Query:  LCIDYRELNKVTVNNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFMIV
        LCIDYRELNKVTV NRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK+GDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTF+IV
Subjt:  LCIDYRELNKVTVNNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFMIV

Query:  FIDDILIYSKTEAEHEEHLCMVLHTLRDNKLYAKFSK------------------------SKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIA
        FIDDILIYSKTEAEHEEHL +VL TLRDNKLYAKFSK                        +KIEAVT W RPSTVSEVRSFLGLAGYYRRFVENF RIA
Subjt:  FIDDILIYSKTEAEHEEHLCMVLHTLRDNKLYAKFSK------------------------SKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIA

Query:  TPLTQLTRKGAPFVWSKPCEDSFQNLKQKLATAPVLTVP
        TPLTQLTRKGAPFVWSK CEDSFQNLKQKL TAPVLTVP
Subjt:  TPLTQLTRKGAPFVWSKPCEDSFQNLKQKLATAPVLTVP

KAA0053368.1 pol protein [Cucumis melo var. makuwa]1.6e-16387.32Show/hide
Query:  MMDTREVDVSLSSEPVVRDYPDVFLEELSELPPHREIEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMR
        ++DTREVDVSLSSEPVVRDYPDVF EEL  LPPHRE+EFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMR
Subjt:  MMDTREVDVSLSSEPVVRDYPDVFLEELSELPPHREIEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMR

Query:  LCIDYRELNKVTVNNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFMIV
        LCIDYRELNKVTV NRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFM+LMNRVFREFLDTF+IV
Subjt:  LCIDYRELNKVTVNNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFMIV

Query:  FIDDILIYSKTEAEHEEHLCMVLHTLRDNKLYAKFSK------------------------SKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIA
        FIDDILIYSKTEAEHEEHL +VL TLRDNKLYAKFSK                        +KIEAVT W RPSTVSEVRSFLGLAGYYRRFVENF RIA
Subjt:  FIDDILIYSKTEAEHEEHLCMVLHTLRDNKLYAKFSK------------------------SKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIA

Query:  TPLTQLTRKGAPFVWSKPCEDSFQNLKQKLATAPVLTVP
        TPLTQLTRKGAPFVWSK CEDSFQNLKQKL TAP+LTVP
Subjt:  TPLTQLTRKGAPFVWSKPCEDSFQNLKQKLATAPVLTVP

KAA0059062.1 pol protein [Cucumis melo var. makuwa]6.6e-16593.33Show/hide
Query:  MMDTREVDVSLSSEPVVRDYPDVFLEELSELPPHREIEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMR
        ++D RE DVSLSSEPVVRDYPDVF EEL  LPPHRE+EFAIELE GTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMR
Subjt:  MMDTREVDVSLSSEPVVRDYPDVFLEELSELPPHREIEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMR

Query:  LCIDYRELNKVTVNNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFMIV
        LCIDYRELNKVTV NRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTF+IV
Subjt:  LCIDYRELNKVTVNNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFMIV

Query:  FIDDILIYSKTEAEHEEHLCMVLHTLRDNKLYAKFSKSKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPLTQLTRKGAPFVWSKPCEDSFQ
        FIDDILIYSKTEAEHE+HL MVL TLRDNKLYAKFSKSKIEAVT W  PSTVSEVRSFLGLAGYYRRFVENF RIATPLTQLTRKGAPFVWSK CEDSFQ
Subjt:  FIDDILIYSKTEAEHEEHLCMVLHTLRDNKLYAKFSKSKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPLTQLTRKGAPFVWSKPCEDSFQ

Query:  NLKQKLATAPVLTVP
        NLKQKL TAPVLTVP
Subjt:  NLKQKLATAPVLTVP

TrEMBL top hitse value%identityAlignment
A0A5A7TV57 Ty3-gypsy retrotransposon protein1.6e-16487.91Show/hide
Query:  MMDTREVDVSLSSEPVVRDYPDVFLEELSELPPHREIEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMR
        ++DTREVDVSLSSEPVVRDYPDVF EEL  LPPHRE+EFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMR
Subjt:  MMDTREVDVSLSSEPVVRDYPDVFLEELSELPPHREIEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMR

Query:  LCIDYRELNKVTVNNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFMIV
        LCIDYRELNKVTV NRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTF+IV
Subjt:  LCIDYRELNKVTVNNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFMIV

Query:  FIDDILIYSKTEAEHEEHLCMVLHTLRDNKLYAKFSK------------------------SKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIA
        FIDDILIYSKTEAEHEEHL +VL TLRDNKLYAKFSK                        +KIEAVT W RPSTVSEVRSFLGLAGYYRRFVENF RIA
Subjt:  FIDDILIYSKTEAEHEEHLCMVLHTLRDNKLYAKFSK------------------------SKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIA

Query:  TPLTQLTRKGAPFVWSKPCEDSFQNLKQKLATAPVLTVP
        TPLTQLTRKGAPFVWSK CEDSFQNLKQKL TAPVLTVP
Subjt:  TPLTQLTRKGAPFVWSKPCEDSFQNLKQKLATAPVLTVP

A0A5A7TW65 DNA/RNA polymerases superfamily protein4.6e-16487.35Show/hide
Query:  MMDTREVDVSLSSEPVVRDYPDVFLEELSELPPHREIEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMR
        ++DTREVDVSLSSEPVVRDYPDVF EEL  LPPHRE+EFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMR
Subjt:  MMDTREVDVSLSSEPVVRDYPDVFLEELSELPPHREIEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMR

Query:  LCIDYRELNKVTVNNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFMIV
        LCIDYRELNKVTV NRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTF+IV
Subjt:  LCIDYRELNKVTVNNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFMIV

Query:  FIDDILIYSKTEAEHEEHLCMVLHTLRDNKLYAKFSK------------------------SKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIA
        FIDDILIYSK EAEHEEHL MVL TLRDNKLYAKF K                        +KIEAVT W RPSTVSEVRSFLGLAGYYRRFVENF RIA
Subjt:  FIDDILIYSKTEAEHEEHLCMVLHTLRDNKLYAKFSK------------------------SKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIA

Query:  TPLTQLTRKGAPFVWSKPCEDSFQNLKQKLATAPVLTVPH
        TPLTQLTRKGAPFVWSK CEDSFQNLKQKL TAPVLTVP+
Subjt:  TPLTQLTRKGAPFVWSKPCEDSFQNLKQKLATAPVLTVPH

A0A5A7UC07 Reverse transcriptase4.6e-16487.61Show/hide
Query:  MMDTREVDVSLSSEPVVRDYPDVFLEELSELPPHREIEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMR
        ++DTREVDVSLSSEPVVRDYPDVF EEL  LPPHRE+EFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMR
Subjt:  MMDTREVDVSLSSEPVVRDYPDVFLEELSELPPHREIEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMR

Query:  LCIDYRELNKVTVNNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFMIV
        LCIDYRELNKVTV NRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK+GDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTF+IV
Subjt:  LCIDYRELNKVTVNNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFMIV

Query:  FIDDILIYSKTEAEHEEHLCMVLHTLRDNKLYAKFSK------------------------SKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIA
        FIDDILIYSKTEAEHEEHL +VL TLRDNKLYAKFSK                        +KIEAVT W RPSTVSEVRSFLGLAGYYRRFVENF RIA
Subjt:  FIDDILIYSKTEAEHEEHLCMVLHTLRDNKLYAKFSK------------------------SKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIA

Query:  TPLTQLTRKGAPFVWSKPCEDSFQNLKQKLATAPVLTVP
        TPLTQLTRKGAPFVWSK CEDSFQNLKQKL TAPVLTVP
Subjt:  TPLTQLTRKGAPFVWSKPCEDSFQNLKQKLATAPVLTVP

A0A5A7UE75 Reverse transcriptase7.9e-16487.32Show/hide
Query:  MMDTREVDVSLSSEPVVRDYPDVFLEELSELPPHREIEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMR
        ++DTREVDVSLSSEPVVRDYPDVF EEL  LPPHRE+EFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMR
Subjt:  MMDTREVDVSLSSEPVVRDYPDVFLEELSELPPHREIEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMR

Query:  LCIDYRELNKVTVNNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFMIV
        LCIDYRELNKVTV NRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFM+LMNRVFREFLDTF+IV
Subjt:  LCIDYRELNKVTVNNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFMIV

Query:  FIDDILIYSKTEAEHEEHLCMVLHTLRDNKLYAKFSK------------------------SKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIA
        FIDDILIYSKTEAEHEEHL +VL TLRDNKLYAKFSK                        +KIEAVT W RPSTVSEVRSFLGLAGYYRRFVENF RIA
Subjt:  FIDDILIYSKTEAEHEEHLCMVLHTLRDNKLYAKFSK------------------------SKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIA

Query:  TPLTQLTRKGAPFVWSKPCEDSFQNLKQKLATAPVLTVP
        TPLTQLTRKGAPFVWSK CEDSFQNLKQKL TAP+LTVP
Subjt:  TPLTQLTRKGAPFVWSKPCEDSFQNLKQKLATAPVLTVP

A0A5A7UT37 Reverse transcriptase3.2e-16593.33Show/hide
Query:  MMDTREVDVSLSSEPVVRDYPDVFLEELSELPPHREIEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMR
        ++D RE DVSLSSEPVVRDYPDVF EEL  LPPHRE+EFAIELE GTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMR
Subjt:  MMDTREVDVSLSSEPVVRDYPDVFLEELSELPPHREIEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMR

Query:  LCIDYRELNKVTVNNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFMIV
        LCIDYRELNKVTV NRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTF+IV
Subjt:  LCIDYRELNKVTVNNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFMIV

Query:  FIDDILIYSKTEAEHEEHLCMVLHTLRDNKLYAKFSKSKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPLTQLTRKGAPFVWSKPCEDSFQ
        FIDDILIYSKTEAEHE+HL MVL TLRDNKLYAKFSKSKIEAVT W  PSTVSEVRSFLGLAGYYRRFVENF RIATPLTQLTRKGAPFVWSK CEDSFQ
Subjt:  FIDDILIYSKTEAEHEEHLCMVLHTLRDNKLYAKFSKSKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPLTQLTRKGAPFVWSKPCEDSFQ

Query:  NLKQKLATAPVLTVP
        NLKQKL TAPVLTVP
Subjt:  NLKQKLATAPVLTVP

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.9e-5032.82Show/hide
Query:  VVRDYPDVFLEELSE-LP-PHREIEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTV
        + +++ D+  E  +E LP P + +EF +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK   
Subjt:  VVRDYPDVFLEELSE-LP-PHREIEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTV

Query:  NNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFMIVFIDDILIYSKTEA
         N YPLP I+ L  ++QG+T+F+K+DL+S YH +R++ GD  K AFR   G +E++VM +G++ APA F   +N +  E  ++ ++ ++DDILI+SK+E+
Subjt:  NNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFMIVFIDDILIYSKTEA

Query:  EHEEHLCMVLHTLRD-----NKLYAKFSKSK-------------------IEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPLTQLTRKGAPF
        EH +H+  VL  L++     N+   +F +S+                   I+ V  W +P    E+R FLG   Y R+F+    ++  PL  L +K   +
Subjt:  EHEEHLCMVLHTLRD-----NKLYAKFSKSK-------------------IEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPLTQLTRKGAPF

Query:  VWSKPCEDSFQNLKQKLATAPVL
         W+     + +N+KQ L + PVL
Subjt:  VWSKPCEDSFQNLKQKLATAPVL

P0CT35 Transposon Tf2-2 polyprotein1.9e-5032.82Show/hide
Query:  VVRDYPDVFLEELSE-LP-PHREIEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTV
        + +++ D+  E  +E LP P + +EF +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK   
Subjt:  VVRDYPDVFLEELSE-LP-PHREIEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTV

Query:  NNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFMIVFIDDILIYSKTEA
         N YPLP I+ L  ++QG+T+F+K+DL+S YH +R++ GD  K AFR   G +E++VM +G++ APA F   +N +  E  ++ ++ ++DDILI+SK+E+
Subjt:  NNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFMIVFIDDILIYSKTEA

Query:  EHEEHLCMVLHTLRD-----NKLYAKFSKSK-------------------IEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPLTQLTRKGAPF
        EH +H+  VL  L++     N+   +F +S+                   I+ V  W +P    E+R FLG   Y R+F+    ++  PL  L +K   +
Subjt:  EHEEHLCMVLHTLRD-----NKLYAKFSKSK-------------------IEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPLTQLTRKGAPF

Query:  VWSKPCEDSFQNLKQKLATAPVL
         W+     + +N+KQ L + PVL
Subjt:  VWSKPCEDSFQNLKQKLATAPVL

P0CT41 Transposon Tf2-12 polyprotein1.9e-5032.82Show/hide
Query:  VVRDYPDVFLEELSE-LP-PHREIEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTV
        + +++ D+  E  +E LP P + +EF +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK   
Subjt:  VVRDYPDVFLEELSE-LP-PHREIEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTV

Query:  NNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFMIVFIDDILIYSKTEA
         N YPLP I+ L  ++QG+T+F+K+DL+S YH +R++ GD  K AFR   G +E++VM +G++ APA F   +N +  E  ++ ++ ++DDILI+SK+E+
Subjt:  NNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFMIVFIDDILIYSKTEA

Query:  EHEEHLCMVLHTLRD-----NKLYAKFSKSK-------------------IEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPLTQLTRKGAPF
        EH +H+  VL  L++     N+   +F +S+                   I+ V  W +P    E+R FLG   Y R+F+    ++  PL  L +K   +
Subjt:  EHEEHLCMVLHTLRD-----NKLYAKFSKSK-------------------IEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPLTQLTRKGAPF

Query:  VWSKPCEDSFQNLKQKLATAPVL
         W+     + +N+KQ L + PVL
Subjt:  VWSKPCEDSFQNLKQKLATAPVL

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.2e-5236.94Show/hide
Query:  SELPP------HREIEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVNNRYPLPRI
        ++LPP      +  ++  IE++PG       PY +     +E+   +Q+LLD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T+++ +PLPRI
Subjt:  SELPP------HREIEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVNNRYPLPRI

Query:  DDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFMIVFIDDILIYSKTEAEHEEHLCMV
        D+L  ++  A +F+ +DL SGYHQ+ ++  D  KTAF +  G YE+ VM FGL NAP+ F   M   FR+    F+ V++DDILI+S++  EH +HL  V
Subjt:  DDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFMIVFIDDILIYSKTEAEHEEHLCMV

Query:  LHTLRDNKLYAKFSKSKI------------------------EAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPLTQLTRKGAPFVWSKPCEDS
        L  L++  L  K  K K                          A+  +P P TV + + FLG+  YYRRF+ N  +IA P+       +   W++  + +
Subjt:  LHTLRDNKLYAKFSKSKI------------------------EAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPLTQLTRKGAPFVWSKPCEDS

Query:  FQNLKQKLATAPVL
         + LK  L  +PVL
Subjt:  FQNLKQKLATAPVL

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.6e-5236.94Show/hide
Query:  SELPP------HREIEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVNNRYPLPRI
        ++LPP      +  ++  IE++PG       PY +     +E+   +Q+LLD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T+++ +PLPRI
Subjt:  SELPP------HREIEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVNNRYPLPRI

Query:  DDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFMIVFIDDILIYSKTEAEHEEHLCMV
        D+L  ++  A +F+ +DL SGYHQ+ ++  D  KTAF +  G YE+ VM FGL NAP+ F   M   FR+    F+ V++DDILI+S++  EH +HL  V
Subjt:  DDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFMIVFIDDILIYSKTEAEHEEHLCMV

Query:  LHTLRDNKLYAKFSKSKI------------------------EAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPLTQLTRKGAPFVWSKPCEDS
        L  L++  L  K  K K                          A+  +P P TV + + FLG+  YYRRF+ N  +IA P+       +   W++  + +
Subjt:  LHTLRDNKLYAKFSKSKI------------------------EAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPLTQLTRKGAPFVWSKPCEDS

Query:  FQNLKQKLATAPVL
           LK  L  +PVL
Subjt:  FQNLKQKLATAPVL

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein6.0e-1544.87Show/hide
Query:  SKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPLTQLTRKGAPFVWSKPCEDSFQNLKQKLATAPVLTVP
        +K+EA+  WP P   +E+R FLGL GYYRRFV+N+ +I  PLT+L +K +   W++    +F+ LK  + T PVL +P
Subjt:  SKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPLTQLTRKGAPFVWSKPCEDSFQNLKQKLATAPVLTVP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGATACTAGAGAGGTTGATGTATCCCTGTCATCAGAACCAGTGGTAAGGGACTATCCGGATGTCTTTCTTGAAGAACTTTCAGAGTTACCTCCTCACAGA
GAGATTGAGTTTGCCATAGAGCTGGAGCCGGGCACGGTTCCTATATCCAGAGCCCCATACAGAATGGCCCCAGCGGAATTGAAAGAACTGAAAGTGCAGTTACAG
GAGTTGCTTGATAAGGGCTTCATTCGACCGAGTGTGTCACCTTGGGGTGCGCCAGTTTTATTTGTTAAGAAGAAGGATGGATCGATGCGCCTATGCATTGACTAT
AGGGAGTTGAATAAGGTAACCGTTAATAACAGATATCCCTTGCCCAGGATCGATGACCTGTTTGACCAGTTACAGGGAGCTACAGTGTTCTCTAAGATTGATCTT
CGGTCGGGATATCATCAGCTGAGGATTAAGGATGGTGATGTACCGAAGACGGCCTTTCGTTCCAGATACGGACACTATGAGTTTATTGTGATGTCTTTTGGTTTG
ACGAATGCTCCGGCGGTGTTTATGGACTTAATGAACAGAGTGTTTAGGGAGTTCCTAGACACTTTTATGATCGTGTTTATTGATGATATCTTGATATATTCTAAG
ACAGAGGCCGAGCATGAGGAGCATTTATGTATGGTTCTGCACACCCTTCGGGATAATAAATTGTATGCAAAGTTCTCGAAATCTAAGATAGAGGCAGTCACCAGT
TGGCCCCGACCTTCCACAGTTAGTGAGGTTCGTAGCTTTCTAGGTTTAGCAGGTTATTATCGACGGTTTGTGGAGAACTTTTTCCGTATAGCTACTCCTCTTACT
CAGTTGACCAGGAAGGGAGCTCCTTTTGTTTGGAGCAAGCCATGTGAGGATAGTTTCCAAAACCTTAAACAGAAGCTAGCTACTGCACCGGTTCTTACTGTACCT
CATAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGATGGATACTAGAGAGGTTGATGTATCCCTGTCATCAGAACCAGTGGTAAGGGACTATCCGGATGTCTTTCTTGAAGAACTTTCAGAGTTACCTCCTCACAGA
GAGATTGAGTTTGCCATAGAGCTGGAGCCGGGCACGGTTCCTATATCCAGAGCCCCATACAGAATGGCCCCAGCGGAATTGAAAGAACTGAAAGTGCAGTTACAG
GAGTTGCTTGATAAGGGCTTCATTCGACCGAGTGTGTCACCTTGGGGTGCGCCAGTTTTATTTGTTAAGAAGAAGGATGGATCGATGCGCCTATGCATTGACTAT
AGGGAGTTGAATAAGGTAACCGTTAATAACAGATATCCCTTGCCCAGGATCGATGACCTGTTTGACCAGTTACAGGGAGCTACAGTGTTCTCTAAGATTGATCTT
CGGTCGGGATATCATCAGCTGAGGATTAAGGATGGTGATGTACCGAAGACGGCCTTTCGTTCCAGATACGGACACTATGAGTTTATTGTGATGTCTTTTGGTTTG
ACGAATGCTCCGGCGGTGTTTATGGACTTAATGAACAGAGTGTTTAGGGAGTTCCTAGACACTTTTATGATCGTGTTTATTGATGATATCTTGATATATTCTAAG
ACAGAGGCCGAGCATGAGGAGCATTTATGTATGGTTCTGCACACCCTTCGGGATAATAAATTGTATGCAAAGTTCTCGAAATCTAAGATAGAGGCAGTCACCAGT
TGGCCCCGACCTTCCACAGTTAGTGAGGTTCGTAGCTTTCTAGGTTTAGCAGGTTATTATCGACGGTTTGTGGAGAACTTTTTCCGTATAGCTACTCCTCTTACT
CAGTTGACCAGGAAGGGAGCTCCTTTTGTTTGGAGCAAGCCATGTGAGGATAGTTTCCAAAACCTTAAACAGAAGCTAGCTACTGCACCGGTTCTTACTGTACCT
CATAGTTAA
Protein sequenceShow/hide protein sequence
MMDTREVDVSLSSEPVVRDYPDVFLEELSELPPHREIEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDY
RELNKVTVNNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFMIVFIDDILIYSK
TEAEHEEHLCMVLHTLRDNKLYAKFSKSKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPLTQLTRKGAPFVWSKPCEDSFQNLKQKLATAPVLTVP
HS