; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0096901 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0096901
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr04:11519194..11520243
RNA-Seq ExpressionCmc04g0096901
SyntenyCmc04g0096901
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025998.1 pol protein [Cucumis melo var. makuwa]1.6e-17589.68Show/hide
Query:  MLDVTLLVLDMQDFDVILGMNWLSANHASIDCFRKEVVFNPPPGTSFKFKGAGIVCIPKVISAMKASKLLSQGSWSILASVVDTREPEVSLSSEPVVREY
        MLDVTLLVLDMQDFDVILGM+WLSANHA+IDCF KEVVFNPP G SFKF+GAG+VCIPKVISAMKASKLLSQG+W ILASVVD REPEVSLSSEPVVREY
Subjt:  MLDVTLLVLDMQDFDVILGMNWLSANHASIDCFRKEVVFNPPPGTSFKFKGAGIVCIPKVISAMKASKLLSQGSWSILASVVDTREPEVSLSSEPVVREY

Query:  PDVFPDELPGLPPPREIVFAIELELDTASISRAPYRMAPAELKELKVQLQKLLDKGFIRPNVSPWGAPVLFVKKKDGSMRLCIDYRKLNKMIVKNCYPLP
        PDVFPDELPGLPPPRE+ FAIELE  TA ISRAPYRMAPAELKELKVQLQ+LLDKGFIRP+VSPWGAPVLFVKKKDGSMRLCIDYR+LNK+ VKN YPLP
Subjt:  PDVFPDELPGLPPPREIVFAIELELDTASISRAPYRMAPAELKELKVQLQKLLDKGFIRPNVSPWGAPVLFVKKKDGSMRLCIDYRKLNKMIVKNCYPLP

Query:  NIDDLFDQLQGVTVFSKIDLRSDYHQLRIRDSDIPKTAFRSRYEHYEFIVMSFGLTNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLD
         IDDLFDQLQG TVFSKIDLRS YHQLRIRD DIPKTAFRSRY HYEF+VMSFGLTNA AVFMDLMNRVFKDFLD+FVIVFIDDILIYSKTEAEHEEHL 
Subjt:  NIDDLFDQLQGVTVFSKIDLRSDYHQLRIRDSDIPKTAFRSRYEHYEFIVMSFGLTNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLD

Query:  QVLETLRANKLHAKFSKCEFWLKKVIFLGHVVSSEGVSVDPTKIEALTS
        QVLETLRANKL+AKFSKCEFWL+KV FLGHVVSSEGVSVDP KIEA+T+
Subjt:  QVLETLRANKLHAKFSKCEFWLKKVIFLGHVVSSEGVSVDPTKIEALTS

KAA0033486.1 reverse transcriptase [Cucumis melo var. makuwa]5.4e-17689.4Show/hide
Query:  MLDVTLLVLDMQDFDVILGMNWLSANHASIDCFRKEVVFNPPPGTSFKFKGAGIVCIPKVISAMKASKLLSQGSWSILASVVDTREPEVSLSSEPVVREY
        MLDVTLLVLDMQDFD+ILGM+WLSANHA+IDCF KEVVFNPP G SFKF+GAG+VCIPKVISAMKASKLLSQG+W ILASVVD REPEVSLSSEPVVREY
Subjt:  MLDVTLLVLDMQDFDVILGMNWLSANHASIDCFRKEVVFNPPPGTSFKFKGAGIVCIPKVISAMKASKLLSQGSWSILASVVDTREPEVSLSSEPVVREY

Query:  PDVFPDELPGLPPPREIVFAIELELDTASISRAPYRMAPAELKELKVQLQKLLDKGFIRPNVSPWGAPVLFVKKKDGSMRLCIDYRKLNKMIVKNCYPLP
        PDVFPDELPGLPPPRE+ FAIELE  TA ISRAPY+MAPAELKELKVQLQ+LLDKGFIRP+VSPWGAPVLFVKKKDGSMRLCIDYR+LNK+ VKNCYPLP
Subjt:  PDVFPDELPGLPPPREIVFAIELELDTASISRAPYRMAPAELKELKVQLQKLLDKGFIRPNVSPWGAPVLFVKKKDGSMRLCIDYRKLNKMIVKNCYPLP

Query:  NIDDLFDQLQGVTVFSKIDLRSDYHQLRIRDSDIPKTAFRSRYEHYEFIVMSFGLTNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLD
         IDDLFDQLQG TVFSKIDLRS YHQLRIRDSDIPKTAFRSRY HYEF+VMSFGLTNA AVFMDLMNRVFKDFLD+FVIVFIDDILIYSKTEAEHEEHL 
Subjt:  NIDDLFDQLQGVTVFSKIDLRSDYHQLRIRDSDIPKTAFRSRYEHYEFIVMSFGLTNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLD

Query:  QVLETLRANKLHAKFSKCEFWLKKVIFLGHVVSSEGVSVDPTKIEALTS
        QVLETLRANKL+AKFSKCEFWL+KV FLGHVVS EGVSVDP KIEA+T+
Subjt:  QVLETLRANKLHAKFSKCEFWLKKVIFLGHVVSSEGVSVDPTKIEALTS

KAA0040693.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]5.4e-17689.11Show/hide
Query:  MLDVTLLVLDMQDFDVILGMNWLSANHASIDCFRKEVVFNPPPGTSFKFKGAGIVCIPKVISAMKASKLLSQGSWSILASVVDTREPEVSLSSEPVVREY
        MLDVTLLVLDMQDFDVILGM+WLSANHA+IDCF KEVVFNPP G  FKF+GAG+VCIPKVISAMKASKLLSQG+W ILASVVD REPEVSLSSEPVVREY
Subjt:  MLDVTLLVLDMQDFDVILGMNWLSANHASIDCFRKEVVFNPPPGTSFKFKGAGIVCIPKVISAMKASKLLSQGSWSILASVVDTREPEVSLSSEPVVREY

Query:  PDVFPDELPGLPPPREIVFAIELELDTASISRAPYRMAPAELKELKVQLQKLLDKGFIRPNVSPWGAPVLFVKKKDGSMRLCIDYRKLNKMIVKNCYPLP
        PDVFPDELPGLPPPRE+ FAIELE  TA ISRAPYRMAPAELKELKVQLQ+LLDKGFIRP+VSPWGAPVLFVKKKDGSMRLCIDYR+LNK+ VKNCYPLP
Subjt:  PDVFPDELPGLPPPREIVFAIELELDTASISRAPYRMAPAELKELKVQLQKLLDKGFIRPNVSPWGAPVLFVKKKDGSMRLCIDYRKLNKMIVKNCYPLP

Query:  NIDDLFDQLQGVTVFSKIDLRSDYHQLRIRDSDIPKTAFRSRYEHYEFIVMSFGLTNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLD
         IDDLFDQLQG TVFSKIDLRS YHQLRIRD DIPKTAFRSRY HYEF+VMSFGLTNA AVFMDLMNRVFKDFLD+FVIVFIDDI++YSKTE EHEEHL 
Subjt:  NIDDLFDQLQGVTVFSKIDLRSDYHQLRIRDSDIPKTAFRSRYEHYEFIVMSFGLTNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLD

Query:  QVLETLRANKLHAKFSKCEFWLKKVIFLGHVVSSEGVSVDPTKIEALTS
        QVLETLRANKL+AKFSKCEFWL+KV FLGHVVSSEGVSVDPTKIEA+T+
Subjt:  QVLETLRANKLHAKFSKCEFWLKKVIFLGHVVSSEGVSVDPTKIEALTS

KAA0060745.1 pol protein [Cucumis melo var. makuwa]2.0e-17589.68Show/hide
Query:  MLDVTLLVLDMQDFDVILGMNWLSANHASIDCFRKEVVFNPPPGTSFKFKGAGIVCIPKVISAMKASKLLSQGSWSILASVVDTREPEVSLSSEPVVREY
        MLDVTLLVLDMQDFDVILGM+WLSANHA+IDCF KEVVFNPP G SFKF+GAG+VCIPKVISAMKASKLLSQG+W ILASVVD REPEVSLSSEPVVREY
Subjt:  MLDVTLLVLDMQDFDVILGMNWLSANHASIDCFRKEVVFNPPPGTSFKFKGAGIVCIPKVISAMKASKLLSQGSWSILASVVDTREPEVSLSSEPVVREY

Query:  PDVFPDELPGLPPPREIVFAIELELDTASISRAPYRMAPAELKELKVQLQKLLDKGFIRPNVSPWGAPVLFVKKKDGSMRLCIDYRKLNKMIVKNCYPLP
        PDVFPDELPGLPPPRE+ FAIELE  TA ISRAPYRMAPAELKELKVQLQ+LLDKGFIRP+VSPWGAPVLFVKKKDGSMRLCIDYR+LNK+ VKN YPLP
Subjt:  PDVFPDELPGLPPPREIVFAIELELDTASISRAPYRMAPAELKELKVQLQKLLDKGFIRPNVSPWGAPVLFVKKKDGSMRLCIDYRKLNKMIVKNCYPLP

Query:  NIDDLFDQLQGVTVFSKIDLRSDYHQLRIRDSDIPKTAFRSRYEHYEFIVMSFGLTNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLD
         IDDLFDQLQG TVFSKIDLRS YHQLRIRD DIPKTAFRSRY HYEF+VMSFGLTNA AVFMDLMNRVFKDFLD+FVIVFIDDILIYSKTEAEHEEHL 
Subjt:  NIDDLFDQLQGVTVFSKIDLRSDYHQLRIRDSDIPKTAFRSRYEHYEFIVMSFGLTNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLD

Query:  QVLETLRANKLHAKFSKCEFWLKKVIFLGHVVSSEGVSVDPTKIEALTS
        QVLETLRANKL+AKFSKCEFWL+KV FLGHVVSSEGVSVDP KIEA+T+
Subjt:  QVLETLRANKLHAKFSKCEFWLKKVIFLGHVVSSEGVSVDPTKIEALTS

KAA0066258.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]8.8e-17996.11Show/hide
Query:  MLDVTLLVLDMQDFDVILGMNWLSANHASIDCFRKEVVFNPPPGTSFKFKGAGIVCIPKVISAMKASKLLSQGSWSILASVVDTREPEVSLSSEPVVREY
        MLDVTLLVLDMQDFDVILGMNWLSANHASIDCFRKEVVFNPP GTSFKFKGAGIVCIPKVISAMKASKLLSQGSWSILASVVDTREPEVSLSSEPVVREY
Subjt:  MLDVTLLVLDMQDFDVILGMNWLSANHASIDCFRKEVVFNPPPGTSFKFKGAGIVCIPKVISAMKASKLLSQGSWSILASVVDTREPEVSLSSEPVVREY

Query:  PDVFPDELPGLPPPREIVFAIELELDTASISRAPYRMAPAELKELKVQLQKLLDKGFIRPNVSPWGAPVLFVKKKDGSMRLCIDYRKLNKMIVKNCYPLP
        PDVFPDELPG+PPPREIVFAIELE DTA ISRA YRMAPAELKELKVQLQ+LLDKGFIRPNVSPWGAPVLFVKKKDGSMRLCI Y+KLNKM VKN YPLP
Subjt:  PDVFPDELPGLPPPREIVFAIELELDTASISRAPYRMAPAELKELKVQLQKLLDKGFIRPNVSPWGAPVLFVKKKDGSMRLCIDYRKLNKMIVKNCYPLP

Query:  NIDDLFDQLQGVTVFSKIDLRSDYHQLRIRDSDIPKTAFRSRYEHYEFIVMSFGLTNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLD
         IDDLFDQLQG TVFSKIDLRSDYHQLRIRDSDIPKT FRSRYEHYEFIVMSFGLTNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLD
Subjt:  NIDDLFDQLQGVTVFSKIDLRSDYHQLRIRDSDIPKTAFRSRYEHYEFIVMSFGLTNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLD

Query:  QVLETLRANKLHAKFSKCEFWLKKVIFLGHVVSS
        QVLETLRANKLHAKFSKCEFWLKKVIFLGHVVSS
Subjt:  QVLETLRANKLHAKFSKCEFWLKKVIFLGHVVSS

TrEMBL top hitse value%identityAlignment
A0A5A7SIJ5 Reverse transcriptase7.6e-17689.68Show/hide
Query:  MLDVTLLVLDMQDFDVILGMNWLSANHASIDCFRKEVVFNPPPGTSFKFKGAGIVCIPKVISAMKASKLLSQGSWSILASVVDTREPEVSLSSEPVVREY
        MLDVTLLVLDMQDFDVILGM+WLSANHA+IDCF KEVVFNPP G SFKF+GAG+VCIPKVISAMKASKLLSQG+W ILASVVD REPEVSLSSEPVVREY
Subjt:  MLDVTLLVLDMQDFDVILGMNWLSANHASIDCFRKEVVFNPPPGTSFKFKGAGIVCIPKVISAMKASKLLSQGSWSILASVVDTREPEVSLSSEPVVREY

Query:  PDVFPDELPGLPPPREIVFAIELELDTASISRAPYRMAPAELKELKVQLQKLLDKGFIRPNVSPWGAPVLFVKKKDGSMRLCIDYRKLNKMIVKNCYPLP
        PDVFPDELPGLPPPRE+ FAIELE  TA ISRAPYRMAPAELKELKVQLQ+LLDKGFIRP+VSPWGAPVLFVKKKDGSMRLCIDYR+LNK+ VKN YPLP
Subjt:  PDVFPDELPGLPPPREIVFAIELELDTASISRAPYRMAPAELKELKVQLQKLLDKGFIRPNVSPWGAPVLFVKKKDGSMRLCIDYRKLNKMIVKNCYPLP

Query:  NIDDLFDQLQGVTVFSKIDLRSDYHQLRIRDSDIPKTAFRSRYEHYEFIVMSFGLTNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLD
         IDDLFDQLQG TVFSKIDLRS YHQLRIRD DIPKTAFRSRY HYEF+VMSFGLTNA AVFMDLMNRVFKDFLD+FVIVFIDDILIYSKTEAEHEEHL 
Subjt:  NIDDLFDQLQGVTVFSKIDLRSDYHQLRIRDSDIPKTAFRSRYEHYEFIVMSFGLTNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLD

Query:  QVLETLRANKLHAKFSKCEFWLKKVIFLGHVVSSEGVSVDPTKIEALTS
        QVLETLRANKL+AKFSKCEFWL+KV FLGHVVSSEGVSVDP KIEA+T+
Subjt:  QVLETLRANKLHAKFSKCEFWLKKVIFLGHVVSSEGVSVDPTKIEALTS

A0A5A7SU25 Reverse transcriptase2.6e-17689.4Show/hide
Query:  MLDVTLLVLDMQDFDVILGMNWLSANHASIDCFRKEVVFNPPPGTSFKFKGAGIVCIPKVISAMKASKLLSQGSWSILASVVDTREPEVSLSSEPVVREY
        MLDVTLLVLDMQDFD+ILGM+WLSANHA+IDCF KEVVFNPP G SFKF+GAG+VCIPKVISAMKASKLLSQG+W ILASVVD REPEVSLSSEPVVREY
Subjt:  MLDVTLLVLDMQDFDVILGMNWLSANHASIDCFRKEVVFNPPPGTSFKFKGAGIVCIPKVISAMKASKLLSQGSWSILASVVDTREPEVSLSSEPVVREY

Query:  PDVFPDELPGLPPPREIVFAIELELDTASISRAPYRMAPAELKELKVQLQKLLDKGFIRPNVSPWGAPVLFVKKKDGSMRLCIDYRKLNKMIVKNCYPLP
        PDVFPDELPGLPPPRE+ FAIELE  TA ISRAPY+MAPAELKELKVQLQ+LLDKGFIRP+VSPWGAPVLFVKKKDGSMRLCIDYR+LNK+ VKNCYPLP
Subjt:  PDVFPDELPGLPPPREIVFAIELELDTASISRAPYRMAPAELKELKVQLQKLLDKGFIRPNVSPWGAPVLFVKKKDGSMRLCIDYRKLNKMIVKNCYPLP

Query:  NIDDLFDQLQGVTVFSKIDLRSDYHQLRIRDSDIPKTAFRSRYEHYEFIVMSFGLTNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLD
         IDDLFDQLQG TVFSKIDLRS YHQLRIRDSDIPKTAFRSRY HYEF+VMSFGLTNA AVFMDLMNRVFKDFLD+FVIVFIDDILIYSKTEAEHEEHL 
Subjt:  NIDDLFDQLQGVTVFSKIDLRSDYHQLRIRDSDIPKTAFRSRYEHYEFIVMSFGLTNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLD

Query:  QVLETLRANKLHAKFSKCEFWLKKVIFLGHVVSSEGVSVDPTKIEALTS
        QVLETLRANKL+AKFSKCEFWL+KV FLGHVVS EGVSVDP KIEA+T+
Subjt:  QVLETLRANKLHAKFSKCEFWLKKVIFLGHVVSSEGVSVDPTKIEALTS

A0A5A7TC72 Ty3-gypsy retrotransposon protein2.6e-17689.11Show/hide
Query:  MLDVTLLVLDMQDFDVILGMNWLSANHASIDCFRKEVVFNPPPGTSFKFKGAGIVCIPKVISAMKASKLLSQGSWSILASVVDTREPEVSLSSEPVVREY
        MLDVTLLVLDMQDFDVILGM+WLSANHA+IDCF KEVVFNPP G  FKF+GAG+VCIPKVISAMKASKLLSQG+W ILASVVD REPEVSLSSEPVVREY
Subjt:  MLDVTLLVLDMQDFDVILGMNWLSANHASIDCFRKEVVFNPPPGTSFKFKGAGIVCIPKVISAMKASKLLSQGSWSILASVVDTREPEVSLSSEPVVREY

Query:  PDVFPDELPGLPPPREIVFAIELELDTASISRAPYRMAPAELKELKVQLQKLLDKGFIRPNVSPWGAPVLFVKKKDGSMRLCIDYRKLNKMIVKNCYPLP
        PDVFPDELPGLPPPRE+ FAIELE  TA ISRAPYRMAPAELKELKVQLQ+LLDKGFIRP+VSPWGAPVLFVKKKDGSMRLCIDYR+LNK+ VKNCYPLP
Subjt:  PDVFPDELPGLPPPREIVFAIELELDTASISRAPYRMAPAELKELKVQLQKLLDKGFIRPNVSPWGAPVLFVKKKDGSMRLCIDYRKLNKMIVKNCYPLP

Query:  NIDDLFDQLQGVTVFSKIDLRSDYHQLRIRDSDIPKTAFRSRYEHYEFIVMSFGLTNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLD
         IDDLFDQLQG TVFSKIDLRS YHQLRIRD DIPKTAFRSRY HYEF+VMSFGLTNA AVFMDLMNRVFKDFLD+FVIVFIDDI++YSKTE EHEEHL 
Subjt:  NIDDLFDQLQGVTVFSKIDLRSDYHQLRIRDSDIPKTAFRSRYEHYEFIVMSFGLTNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLD

Query:  QVLETLRANKLHAKFSKCEFWLKKVIFLGHVVSSEGVSVDPTKIEALTS
        QVLETLRANKL+AKFSKCEFWL+KV FLGHVVSSEGVSVDPTKIEA+T+
Subjt:  QVLETLRANKLHAKFSKCEFWLKKVIFLGHVVSSEGVSVDPTKIEALTS

A0A5A7V4E4 Reverse transcriptase9.9e-17689.68Show/hide
Query:  MLDVTLLVLDMQDFDVILGMNWLSANHASIDCFRKEVVFNPPPGTSFKFKGAGIVCIPKVISAMKASKLLSQGSWSILASVVDTREPEVSLSSEPVVREY
        MLDVTLLVLDMQDFDVILGM+WLSANHA+IDCF KEVVFNPP G SFKF+GAG+VCIPKVISAMKASKLLSQG+W ILASVVD REPEVSLSSEPVVREY
Subjt:  MLDVTLLVLDMQDFDVILGMNWLSANHASIDCFRKEVVFNPPPGTSFKFKGAGIVCIPKVISAMKASKLLSQGSWSILASVVDTREPEVSLSSEPVVREY

Query:  PDVFPDELPGLPPPREIVFAIELELDTASISRAPYRMAPAELKELKVQLQKLLDKGFIRPNVSPWGAPVLFVKKKDGSMRLCIDYRKLNKMIVKNCYPLP
        PDVFPDELPGLPPPRE+ FAIELE  TA ISRAPYRMAPAELKELKVQLQ+LLDKGFIRP+VSPWGAPVLFVKKKDGSMRLCIDYR+LNK+ VKN YPLP
Subjt:  PDVFPDELPGLPPPREIVFAIELELDTASISRAPYRMAPAELKELKVQLQKLLDKGFIRPNVSPWGAPVLFVKKKDGSMRLCIDYRKLNKMIVKNCYPLP

Query:  NIDDLFDQLQGVTVFSKIDLRSDYHQLRIRDSDIPKTAFRSRYEHYEFIVMSFGLTNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLD
         IDDLFDQLQG TVFSKIDLRS YHQLRIRD DIPKTAFRSRY HYEF+VMSFGLTNA AVFMDLMNRVFKDFLD+FVIVFIDDILIYSKTEAEHEEHL 
Subjt:  NIDDLFDQLQGVTVFSKIDLRSDYHQLRIRDSDIPKTAFRSRYEHYEFIVMSFGLTNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLD

Query:  QVLETLRANKLHAKFSKCEFWLKKVIFLGHVVSSEGVSVDPTKIEALTS
        QVLETLRANKL+AKFSKCEFWL+KV FLGHVVSSEGVSVDP KIEA+T+
Subjt:  QVLETLRANKLHAKFSKCEFWLKKVIFLGHVVSSEGVSVDPTKIEALTS

A0A5A7VEX4 Reverse transcriptase4.3e-17996.11Show/hide
Query:  MLDVTLLVLDMQDFDVILGMNWLSANHASIDCFRKEVVFNPPPGTSFKFKGAGIVCIPKVISAMKASKLLSQGSWSILASVVDTREPEVSLSSEPVVREY
        MLDVTLLVLDMQDFDVILGMNWLSANHASIDCFRKEVVFNPP GTSFKFKGAGIVCIPKVISAMKASKLLSQGSWSILASVVDTREPEVSLSSEPVVREY
Subjt:  MLDVTLLVLDMQDFDVILGMNWLSANHASIDCFRKEVVFNPPPGTSFKFKGAGIVCIPKVISAMKASKLLSQGSWSILASVVDTREPEVSLSSEPVVREY

Query:  PDVFPDELPGLPPPREIVFAIELELDTASISRAPYRMAPAELKELKVQLQKLLDKGFIRPNVSPWGAPVLFVKKKDGSMRLCIDYRKLNKMIVKNCYPLP
        PDVFPDELPG+PPPREIVFAIELE DTA ISRA YRMAPAELKELKVQLQ+LLDKGFIRPNVSPWGAPVLFVKKKDGSMRLCI Y+KLNKM VKN YPLP
Subjt:  PDVFPDELPGLPPPREIVFAIELELDTASISRAPYRMAPAELKELKVQLQKLLDKGFIRPNVSPWGAPVLFVKKKDGSMRLCIDYRKLNKMIVKNCYPLP

Query:  NIDDLFDQLQGVTVFSKIDLRSDYHQLRIRDSDIPKTAFRSRYEHYEFIVMSFGLTNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLD
         IDDLFDQLQG TVFSKIDLRSDYHQLRIRDSDIPKT FRSRYEHYEFIVMSFGLTNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLD
Subjt:  NIDDLFDQLQGVTVFSKIDLRSDYHQLRIRDSDIPKTAFRSRYEHYEFIVMSFGLTNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLD

Query:  QVLETLRANKLHAKFSKCEFWLKKVIFLGHVVSS
        QVLETLRANKLHAKFSKCEFWLKKVIFLGHVVSS
Subjt:  QVLETLRANKLHAKFSKCEFWLKKVIFLGHVVSS

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.3e-3934.25Show/hide
Query:  REPEVSLSSEPVVREYPDVFPD-ELPGLPPP-REIVFAIELELDTASISRAPYRMAPAELKELKVQLQKLLDKGFIRPNVSPWGAPVLFVKKKDGSMRLC
        +EPE+      + +E+ D+  +     LP P + + F +EL  +   +    Y + P +++ +  ++ + L  G IR + +    PV+FV KK+G++R+ 
Subjt:  REPEVSLSSEPVVREYPDVFPD-ELPGLPPP-REIVFAIELELDTASISRAPYRMAPAELKELKVQLQKLLDKGFIRPNVSPWGAPVLFVKKKDGSMRLC

Query:  IDYRKLNKMIVKNCYPLPNIDDLFDQLQGVTVFSKIDLRSDYHQLRIRDSDIPKTAFRSRYEHYEFIVMSFGLTNASAVFMDLMNRVFKDFLDTFVIVFI
        +DY+ LNK +  N YPLP I+ L  ++QG T+F+K+DL+S YH +R+R  D  K AFR     +E++VM +G++ A A F   +N +  +  ++ V+ ++
Subjt:  IDYRKLNKMIVKNCYPLPNIDDLFDQLQGVTVFSKIDLRSDYHQLRIRDSDIPKTAFRSRYEHYEFIVMSFGLTNASAVFMDLMNRVFKDFLDTFVIVFI

Query:  DDILIYSKTEAEHEEHLDQVLETLRANKLHAKFSKCEFWLKKVIFLGHVVSSEG
        DDILI+SK+E+EH +H+  VL+ L+   L    +KCEF   +V F+G+ +S +G
Subjt:  DDILIYSKTEAEHEEHLDQVLETLRANKLHAKFSKCEFWLKKVIFLGHVVSSEG

P0CT35 Transposon Tf2-2 polyprotein1.3e-3934.25Show/hide
Query:  REPEVSLSSEPVVREYPDVFPD-ELPGLPPP-REIVFAIELELDTASISRAPYRMAPAELKELKVQLQKLLDKGFIRPNVSPWGAPVLFVKKKDGSMRLC
        +EPE+      + +E+ D+  +     LP P + + F +EL  +   +    Y + P +++ +  ++ + L  G IR + +    PV+FV KK+G++R+ 
Subjt:  REPEVSLSSEPVVREYPDVFPD-ELPGLPPP-REIVFAIELELDTASISRAPYRMAPAELKELKVQLQKLLDKGFIRPNVSPWGAPVLFVKKKDGSMRLC

Query:  IDYRKLNKMIVKNCYPLPNIDDLFDQLQGVTVFSKIDLRSDYHQLRIRDSDIPKTAFRSRYEHYEFIVMSFGLTNASAVFMDLMNRVFKDFLDTFVIVFI
        +DY+ LNK +  N YPLP I+ L  ++QG T+F+K+DL+S YH +R+R  D  K AFR     +E++VM +G++ A A F   +N +  +  ++ V+ ++
Subjt:  IDYRKLNKMIVKNCYPLPNIDDLFDQLQGVTVFSKIDLRSDYHQLRIRDSDIPKTAFRSRYEHYEFIVMSFGLTNASAVFMDLMNRVFKDFLDTFVIVFI

Query:  DDILIYSKTEAEHEEHLDQVLETLRANKLHAKFSKCEFWLKKVIFLGHVVSSEG
        DDILI+SK+E+EH +H+  VL+ L+   L    +KCEF   +V F+G+ +S +G
Subjt:  DDILIYSKTEAEHEEHLDQVLETLRANKLHAKFSKCEFWLKKVIFLGHVVSSEG

P0CT41 Transposon Tf2-12 polyprotein1.3e-3934.25Show/hide
Query:  REPEVSLSSEPVVREYPDVFPD-ELPGLPPP-REIVFAIELELDTASISRAPYRMAPAELKELKVQLQKLLDKGFIRPNVSPWGAPVLFVKKKDGSMRLC
        +EPE+      + +E+ D+  +     LP P + + F +EL  +   +    Y + P +++ +  ++ + L  G IR + +    PV+FV KK+G++R+ 
Subjt:  REPEVSLSSEPVVREYPDVFPD-ELPGLPPP-REIVFAIELELDTASISRAPYRMAPAELKELKVQLQKLLDKGFIRPNVSPWGAPVLFVKKKDGSMRLC

Query:  IDYRKLNKMIVKNCYPLPNIDDLFDQLQGVTVFSKIDLRSDYHQLRIRDSDIPKTAFRSRYEHYEFIVMSFGLTNASAVFMDLMNRVFKDFLDTFVIVFI
        +DY+ LNK +  N YPLP I+ L  ++QG T+F+K+DL+S YH +R+R  D  K AFR     +E++VM +G++ A A F   +N +  +  ++ V+ ++
Subjt:  IDYRKLNKMIVKNCYPLPNIDDLFDQLQGVTVFSKIDLRSDYHQLRIRDSDIPKTAFRSRYEHYEFIVMSFGLTNASAVFMDLMNRVFKDFLDTFVIVFI

Query:  DDILIYSKTEAEHEEHLDQVLETLRANKLHAKFSKCEFWLKKVIFLGHVVSSEG
        DDILI+SK+E+EH +H+  VL+ L+   L    +KCEF   +V F+G+ +S +G
Subjt:  DDILIYSKTEAEHEEHLDQVLETLRANKLHAKFSKCEFWLKKVIFLGHVVSSEG

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein3.1e-4135.76Show/hide
Query:  KASKLLSQGSWSILASVVDTREPEVSLSSEP---------VVREYPDVFPDELPGLPPPREI-----VFAIELELDTASISRAPYRMAPAELKELKVQLQ
        +AS L   G +S + S + + EP  +  S           + ++Y ++  ++LP  P P +I        IE++         PY +     +E+   +Q
Subjt:  KASKLLSQGSWSILASVVDTREPEVSLSSEP---------VVREYPDVFPDELPGLPPPREI-----VFAIELELDTASISRAPYRMAPAELKELKVQLQ

Query:  KLLDKGFIRPNVSPWGAPVLFVKKKDGSMRLCIDYRKLNKMIVKNCYPLPNIDDLFDQLQGVTVFSKIDLRSDYHQLRIRDSDIPKTAFRSRYEHYEFIV
        KLLD  FI P+ SP  +PV+ V KKDG+ RLC+DYR LNK  + + +PLP ID+L  ++    +F+ +DL S YHQ+ +   D  KTAF +    YE+ V
Subjt:  KLLDKGFIRPNVSPWGAPVLFVKKKDGSMRLCIDYRKLNKMIVKNCYPLPNIDDLFDQLQGVTVFSKIDLRSDYHQLRIRDSDIPKTAFRSRYEHYEFIV

Query:  MSFGLTNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLDQVLETLRANKLHAKFSKCEFWLKKVIFLGHVVSSEGVS
        M FGL NA + F   M   F+D    FV V++DDILI+S++  EH +HLD VLE L+   L  K  KC+F  ++  FLG+ +  + ++
Subjt:  MSFGLTNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLDQVLETLRANKLHAKFSKCEFWLKKVIFLGHVVSSEGVS

Q99315 Transposon Ty3-G Gag-Pol polyprotein3.1e-4135.76Show/hide
Query:  KASKLLSQGSWSILASVVDTREPEVSLSSEP---------VVREYPDVFPDELPGLPPPREI-----VFAIELELDTASISRAPYRMAPAELKELKVQLQ
        +AS L   G +S + S + + EP  +  S           + ++Y ++  ++LP  P P +I        IE++         PY +     +E+   +Q
Subjt:  KASKLLSQGSWSILASVVDTREPEVSLSSEP---------VVREYPDVFPDELPGLPPPREI-----VFAIELELDTASISRAPYRMAPAELKELKVQLQ

Query:  KLLDKGFIRPNVSPWGAPVLFVKKKDGSMRLCIDYRKLNKMIVKNCYPLPNIDDLFDQLQGVTVFSKIDLRSDYHQLRIRDSDIPKTAFRSRYEHYEFIV
        KLLD  FI P+ SP  +PV+ V KKDG+ RLC+DYR LNK  + + +PLP ID+L  ++    +F+ +DL S YHQ+ +   D  KTAF +    YE+ V
Subjt:  KLLDKGFIRPNVSPWGAPVLFVKKKDGSMRLCIDYRKLNKMIVKNCYPLPNIDDLFDQLQGVTVFSKIDLRSDYHQLRIRDSDIPKTAFRSRYEHYEFIV

Query:  MSFGLTNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLDQVLETLRANKLHAKFSKCEFWLKKVIFLGHVVSSEGVS
        M FGL NA + F   M   F+D    FV V++DDILI+S++  EH +HLD VLE L+   L  K  KC+F  ++  FLG+ +  + ++
Subjt:  MSFGLTNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLDQVLETLRANKLHAKFSKCEFWLKKVIFLGHVVSSEGVS

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein8.1e-0540.38Show/hide
Query:  HLDQVLETLRANKLHAKFSKCEFWLKKVIFLG--HVVSSEGVSVDPTKIEAL
        HL  VL+    ++ +A   KC F   ++ +LG  H++S EGVS DP K+EA+
Subjt:  HLDQVLETLRANKLHAKFSKCEFWLKKVIFLG--HVVSSEGVSVDPTKIEAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGACGTGACCCTGCTAGTGTTAGACATGCAGGATTTCGATGTAATACTAGGCATGAATTGGTTGTCCGCTAACCATGCAAGTATAGACTGTTTCCGTAAGGAAGT
CGTCTTTAACCCTCCCCCTGGGACTAGTTTCAAATTTAAAGGGGCAGGAATCGTATGTATACCTAAGGTCATCTCAGCCATGAAGGCTAGTAAACTACTCAGCCAGGGTT
CCTGGAGCATCTTGGCAAGCGTAGTAGATACCAGAGAACCAGAAGTTTCCCTATCCTCCGAACCAGTGGTAAGGGAGTACCCCGATGTTTTCCCTGATGAGCTTCCAGGA
CTTCCACCTCCCAGGGAGATAGTCTTCGCAATTGAGTTAGAGCTAGACACTGCTTCTATCTCGAGGGCCCCTTACAGAATGGCTCCAGCTGAGCTAAAGGAGCTGAAGGT
GCAGTTGCAGAAGTTACTGGACAAGGGTTTTATTCGACCCAATGTGTCACCTTGGGGAGCACCAGTGTTGTTTGTGAAGAAGAAGGATGGGTCGATGCGCCTTTGCATTG
ACTACAGAAAGCTGAACAAGATGATAGTTAAGAATTGCTATCCCTTGCCCAATATTGATGATTTGTTCGATCAGTTGCAAGGAGTCACCGTCTTTTCTAAGATTGACCTG
CGCTCAGACTACCACCAATTGAGGATCAGAGATAGTGATATTCCTAAGACCGCTTTTCGTTCAAGATACGAACATTACGAGTTCATTGTGATGTCTTTTGGGTTGACTAA
TGCTTCTGCGGTATTCATGGACTTGATGAATAGGGTGTTTAAGGATTTCTTAGACACGTTTGTCATAGTTTTCATTGATGACATTTTGATTTACTCCAAGACTGAGGCTG
AGCATGAGGAGCATTTGGACCAGGTTTTGGAGACTCTTCGAGCTAATAAGTTGCACGCCAAGTTCTCCAAGTGTGAGTTCTGGCTGAAGAAGGTGATTTTTCTCGGCCAC
GTAGTTTCCAGTGAAGGAGTTTCTGTGGACCCAACAAAGATCGAAGCACTTACCAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTAGACGTGACCCTGCTAGTGTTAGACATGCAGGATTTCGATGTAATACTAGGCATGAATTGGTTGTCCGCTAACCATGCAAGTATAGACTGTTTCCGTAAGGAAGT
CGTCTTTAACCCTCCCCCTGGGACTAGTTTCAAATTTAAAGGGGCAGGAATCGTATGTATACCTAAGGTCATCTCAGCCATGAAGGCTAGTAAACTACTCAGCCAGGGTT
CCTGGAGCATCTTGGCAAGCGTAGTAGATACCAGAGAACCAGAAGTTTCCCTATCCTCCGAACCAGTGGTAAGGGAGTACCCCGATGTTTTCCCTGATGAGCTTCCAGGA
CTTCCACCTCCCAGGGAGATAGTCTTCGCAATTGAGTTAGAGCTAGACACTGCTTCTATCTCGAGGGCCCCTTACAGAATGGCTCCAGCTGAGCTAAAGGAGCTGAAGGT
GCAGTTGCAGAAGTTACTGGACAAGGGTTTTATTCGACCCAATGTGTCACCTTGGGGAGCACCAGTGTTGTTTGTGAAGAAGAAGGATGGGTCGATGCGCCTTTGCATTG
ACTACAGAAAGCTGAACAAGATGATAGTTAAGAATTGCTATCCCTTGCCCAATATTGATGATTTGTTCGATCAGTTGCAAGGAGTCACCGTCTTTTCTAAGATTGACCTG
CGCTCAGACTACCACCAATTGAGGATCAGAGATAGTGATATTCCTAAGACCGCTTTTCGTTCAAGATACGAACATTACGAGTTCATTGTGATGTCTTTTGGGTTGACTAA
TGCTTCTGCGGTATTCATGGACTTGATGAATAGGGTGTTTAAGGATTTCTTAGACACGTTTGTCATAGTTTTCATTGATGACATTTTGATTTACTCCAAGACTGAGGCTG
AGCATGAGGAGCATTTGGACCAGGTTTTGGAGACTCTTCGAGCTAATAAGTTGCACGCCAAGTTCTCCAAGTGTGAGTTCTGGCTGAAGAAGGTGATTTTTCTCGGCCAC
GTAGTTTCCAGTGAAGGAGTTTCTGTGGACCCAACAAAGATCGAAGCACTTACCAGTTGA
Protein sequenceShow/hide protein sequence
MLDVTLLVLDMQDFDVILGMNWLSANHASIDCFRKEVVFNPPPGTSFKFKGAGIVCIPKVISAMKASKLLSQGSWSILASVVDTREPEVSLSSEPVVREYPDVFPDELPG
LPPPREIVFAIELELDTASISRAPYRMAPAELKELKVQLQKLLDKGFIRPNVSPWGAPVLFVKKKDGSMRLCIDYRKLNKMIVKNCYPLPNIDDLFDQLQGVTVFSKIDL
RSDYHQLRIRDSDIPKTAFRSRYEHYEFIVMSFGLTNASAVFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHEEHLDQVLETLRANKLHAKFSKCEFWLKKVIFLGH
VVSSEGVSVDPTKIEALTS