; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0101421 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0101421
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr04:17722306..17723175
RNA-Seq ExpressionCmc04g0101421
SyntenyCmc04g0101421
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033091.1 putative Gag-pol protein [Cucumis melo var. makuwa]3.9e-13286.13Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSHWGAPMLFIKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPM
        MAPAELKELKVQLQELLDKGFIRPSVS WGAP+LF+KKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAT+FSKIDLRSGYHQLRIRDS IP 
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSHWGAPMLFIKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPM

Query:  TAFRSRYEHYEFIVMSFGLTNASAAFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHKEYLHQVLEALRANKLYSKFSNCEFWRKKVTFLDHVVSSEG
        TAFRSRYEHYEFIVMSFGLTNA A FMDLMNRVFKDFLD+FVIVFIDDILIYSKTEAEH+E+LHQVLE LRAN+LY KFS CEFW KKV+FL HVVSSEG
Subjt:  TAFRSRYEHYEFIVMSFGLTNASAAFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHKEYLHQVLEALRANKLYSKFSNCEFWRKKVTFLDHVVSSEG

Query:  VSVDPTKIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRKGTPFVWSQLVRVASRRLSRSL
        VSVDP KIEA+T+WPR STVS++RSFLGL GYYRRFVEDFSRI+SPLTQLTRKGTPFVWS     + + L + L
Subjt:  VSVDPTKIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRKGTPFVWSQLVRVASRRLSRSL

KAA0058450.1 pol protein [Cucumis melo var. makuwa]3.1e-161100Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSHWGAPMLFIKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPM
        MAPAELKELKVQLQELLDKGFIRPSVSHWGAPMLFIKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPM
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSHWGAPMLFIKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPM

Query:  TAFRSRYEHYEFIVMSFGLTNASAAFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHKEYLHQVLEALRANKLYSKFSNCEFWRKKVTFLDHVVSSEG
        TAFRSRYEHYEFIVMSFGLTNASAAFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHKEYLHQVLEALRANKLYSKFSNCEFWRKKVTFLDHVVSSEG
Subjt:  TAFRSRYEHYEFIVMSFGLTNASAAFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHKEYLHQVLEALRANKLYSKFSNCEFWRKKVTFLDHVVSSEG

Query:  VSVDPTKIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRKGTPFVWSQLVRVASRRLSRSLCLHQSLQYQMDLGVL
        VSVDPTKIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRKGTPFVWSQLVRVASRRLSRSLCLHQSLQYQMDLGVL
Subjt:  VSVDPTKIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRKGTPFVWSQLVRVASRRLSRSLCLHQSLQYQMDLGVL

KAA0060745.1 pol protein [Cucumis melo var. makuwa]3.0e-13285.51Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSHWGAPMLFIKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPM
        MAPAELKELKVQLQELLDKGFIRPSVS WGAP+LF+KKKDGSMRLCIDYRELNKVTVKNRYPLP+IDDLFDQLQGATVFSKIDLRSGYHQLRIRD DIP 
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSHWGAPMLFIKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPM

Query:  TAFRSRYEHYEFIVMSFGLTNASAAFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHKEYLHQVLEALRANKLYSKFSNCEFWRKKVTFLDHVVSSEG
        TAFRSRY HYEF+VMSFGLTNA A FMDLMNRVFKDFLD+FVIVFIDDILIYSKTEAEH+E+LHQVLE LRANKLY+KFS CEFW +KVTFL HVVSSEG
Subjt:  TAFRSRYEHYEFIVMSFGLTNASAAFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHKEYLHQVLEALRANKLYSKFSNCEFWRKKVTFLDHVVSSEG

Query:  VSVDPTKIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRKGTPFVWSQLVRVASRRLSRSLCL
        VSVDP KIEAVT+WPR STVS++RSFLGL GYYRRFVEDFSRI+SPLTQLTRKGTPFVWS     + + L + L +
Subjt:  VSVDPTKIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRKGTPFVWSQLVRVASRRLSRSLCL

KAA0063793.1 pol protein [Cucumis melo var. makuwa]2.3e-13286.5Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSHWGAPMLFIKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPM
        MAPAELKELKVQLQELLDKGFIRPSVS WGAP+LF+KKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRD DIP 
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSHWGAPMLFIKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPM

Query:  TAFRSRYEHYEFIVMSFGLTNASAAFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHKEYLHQVLEALRANKLYSKFSNCEFWRKKVTFLDHVVSSEG
        TAFRSRY HYEF+VMSFGLTNA A FMDLMNRVFKDFLD+FVIVFIDDILIYSKTEAEH+E+LHQVLE LRANKLY+KFS CEFW +KVTFL HVVSSEG
Subjt:  TAFRSRYEHYEFIVMSFGLTNASAAFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHKEYLHQVLEALRANKLYSKFSNCEFWRKKVTFLDHVVSSEG

Query:  VSVDPTKIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRKGTPFVWSQLVRVASRRLSRSL
        VSVDP KIEAVT+WPR STVS++RSFLGL GYYRRFVEDFSRI+SPLTQLTRKGTPFVWS     + + L + L
Subjt:  VSVDPTKIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRKGTPFVWSQLVRVASRRLSRSL

TYK18480.1 pol protein [Cucumis melo var. makuwa]1.8e-13287.23Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSHWGAPMLFIKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPM
        MAPAELKELKVQLQELLDKGFIRPSVS WGAP+LF+KKKDGSMRLCIDYRELNKVTVKNRYPL RIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIP 
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSHWGAPMLFIKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPM

Query:  TAFRSRYEHYEFIVMSFGLTNASAAFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHKEYLHQVLEALRANKLYSKFSNCEFWRKKVTFLDHVVSSEG
        TAFRSRY HYEFIVMSFGLTNA A FMDLMNRVFKDFLD+FVIVFIDDILIYSKTEAEH+E+LHQVLE LRANKLY+KFS CEFW +KVTFL HVVSSEG
Subjt:  TAFRSRYEHYEFIVMSFGLTNASAAFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHKEYLHQVLEALRANKLYSKFSNCEFWRKKVTFLDHVVSSEG

Query:  VSVDPTKIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRKGTPFVWSQLVRVASRRLSRSL
        VSVDPTKIEAVT+WPR STVS++RSFLGL GYYRRFVEDFSRI+SPLTQLTRKGTPFVWS     + + L + L
Subjt:  VSVDPTKIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRKGTPFVWSQLVRVASRRLSRSL

TrEMBL top hitse value%identityAlignment
A0A5A7SUC9 Putative Gag-pol protein1.9e-13286.13Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSHWGAPMLFIKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPM
        MAPAELKELKVQLQELLDKGFIRPSVS WGAP+LF+KKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAT+FSKIDLRSGYHQLRIRDS IP 
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSHWGAPMLFIKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPM

Query:  TAFRSRYEHYEFIVMSFGLTNASAAFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHKEYLHQVLEALRANKLYSKFSNCEFWRKKVTFLDHVVSSEG
        TAFRSRYEHYEFIVMSFGLTNA A FMDLMNRVFKDFLD+FVIVFIDDILIYSKTEAEH+E+LHQVLE LRAN+LY KFS CEFW KKV+FL HVVSSEG
Subjt:  TAFRSRYEHYEFIVMSFGLTNASAAFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHKEYLHQVLEALRANKLYSKFSNCEFWRKKVTFLDHVVSSEG

Query:  VSVDPTKIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRKGTPFVWSQLVRVASRRLSRSL
        VSVDP KIEA+T+WPR STVS++RSFLGL GYYRRFVEDFSRI+SPLTQLTRKGTPFVWS     + + L + L
Subjt:  VSVDPTKIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRKGTPFVWSQLVRVASRRLSRSL

A0A5A7V4E4 Reverse transcriptase1.5e-13285.51Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSHWGAPMLFIKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPM
        MAPAELKELKVQLQELLDKGFIRPSVS WGAP+LF+KKKDGSMRLCIDYRELNKVTVKNRYPLP+IDDLFDQLQGATVFSKIDLRSGYHQLRIRD DIP 
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSHWGAPMLFIKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPM

Query:  TAFRSRYEHYEFIVMSFGLTNASAAFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHKEYLHQVLEALRANKLYSKFSNCEFWRKKVTFLDHVVSSEG
        TAFRSRY HYEF+VMSFGLTNA A FMDLMNRVFKDFLD+FVIVFIDDILIYSKTEAEH+E+LHQVLE LRANKLY+KFS CEFW +KVTFL HVVSSEG
Subjt:  TAFRSRYEHYEFIVMSFGLTNASAAFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHKEYLHQVLEALRANKLYSKFSNCEFWRKKVTFLDHVVSSEG

Query:  VSVDPTKIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRKGTPFVWSQLVRVASRRLSRSLCL
        VSVDP KIEAVT+WPR STVS++RSFLGL GYYRRFVEDFSRI+SPLTQLTRKGTPFVWS     + + L + L +
Subjt:  VSVDPTKIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRKGTPFVWSQLVRVASRRLSRSLCL

A0A5A7V6R2 Reverse transcriptase1.1e-13286.5Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSHWGAPMLFIKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPM
        MAPAELKELKVQLQELLDKGFIRPSVS WGAP+LF+KKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRD DIP 
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSHWGAPMLFIKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPM

Query:  TAFRSRYEHYEFIVMSFGLTNASAAFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHKEYLHQVLEALRANKLYSKFSNCEFWRKKVTFLDHVVSSEG
        TAFRSRY HYEF+VMSFGLTNA A FMDLMNRVFKDFLD+FVIVFIDDILIYSKTEAEH+E+LHQVLE LRANKLY+KFS CEFW +KVTFL HVVSSEG
Subjt:  TAFRSRYEHYEFIVMSFGLTNASAAFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHKEYLHQVLEALRANKLYSKFSNCEFWRKKVTFLDHVVSSEG

Query:  VSVDPTKIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRKGTPFVWSQLVRVASRRLSRSL
        VSVDP KIEAVT+WPR STVS++RSFLGL GYYRRFVEDFSRI+SPLTQLTRKGTPFVWS     + + L + L
Subjt:  VSVDPTKIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRKGTPFVWSQLVRVASRRLSRSL

A0A5D3BA18 Pol protein1.5e-161100Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSHWGAPMLFIKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPM
        MAPAELKELKVQLQELLDKGFIRPSVSHWGAPMLFIKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPM
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSHWGAPMLFIKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPM

Query:  TAFRSRYEHYEFIVMSFGLTNASAAFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHKEYLHQVLEALRANKLYSKFSNCEFWRKKVTFLDHVVSSEG
        TAFRSRYEHYEFIVMSFGLTNASAAFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHKEYLHQVLEALRANKLYSKFSNCEFWRKKVTFLDHVVSSEG
Subjt:  TAFRSRYEHYEFIVMSFGLTNASAAFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHKEYLHQVLEALRANKLYSKFSNCEFWRKKVTFLDHVVSSEG

Query:  VSVDPTKIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRKGTPFVWSQLVRVASRRLSRSLCLHQSLQYQMDLGVL
        VSVDPTKIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRKGTPFVWSQLVRVASRRLSRSLCLHQSLQYQMDLGVL
Subjt:  VSVDPTKIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRKGTPFVWSQLVRVASRRLSRSLCLHQSLQYQMDLGVL

A0A5D3D4M7 Pol protein8.6e-13387.23Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSHWGAPMLFIKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPM
        MAPAELKELKVQLQELLDKGFIRPSVS WGAP+LF+KKKDGSMRLCIDYRELNKVTVKNRYPL RIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIP 
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSHWGAPMLFIKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPM

Query:  TAFRSRYEHYEFIVMSFGLTNASAAFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHKEYLHQVLEALRANKLYSKFSNCEFWRKKVTFLDHVVSSEG
        TAFRSRY HYEFIVMSFGLTNA A FMDLMNRVFKDFLD+FVIVFIDDILIYSKTEAEH+E+LHQVLE LRANKLY+KFS CEFW +KVTFL HVVSSEG
Subjt:  TAFRSRYEHYEFIVMSFGLTNASAAFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHKEYLHQVLEALRANKLYSKFSNCEFWRKKVTFLDHVVSSEG

Query:  VSVDPTKIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRKGTPFVWSQLVRVASRRLSRSL
        VSVDPTKIEAVT+WPR STVS++RSFLGL GYYRRFVEDFSRI+SPLTQLTRKGTPFVWS     + + L + L
Subjt:  VSVDPTKIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRKGTPFVWSQLVRVASRRLSRSL

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.68.7e-5036.9Show/hide
Query:  KELKVQLQELLDKGFIRPSVSHWGAPMLFI-KKKDGS----MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPMT
        +E++ Q+Q++L++G IR S S + +P+  + KK+D S     R+ IDYR+LN++TV +R+P+P +D++  +L     F+ IDL  G+HQ+ +    +  T
Subjt:  KELKVQLQELLDKGFIRPSVSHWGAPMLFI-KKKDGS----MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPMT

Query:  AFRSRYEHYEFIVMSFGLTNASAAFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHKEYLHQVLEALRANKLYSKFSNCEFWRKKVTFLDHVVSSEGV
        AF +++ HYE++ M FGL NA A F   MN + +  L+   +V++DDI+++S +  EH + L  V E L    L  +   CEF +++ TFL HV++ +G+
Subjt:  AFRSRYEHYEFIVMSFGLTNASAAFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHKEYLHQVLEALRANKLYSKFSNCEFWRKKVTFLDHVVSSEGV

Query:  SVDPTKIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRK
          +P KIEA+  +P  +   ++++FLGL GYYR+F+ +F+ I+ P+T+  +K
Subjt:  SVDPTKIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRK

P0CT41 Transposon Tf2-12 polyprotein2.6e-4632.48Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSHWGAPMLFIKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPM
        + P +++ +  ++ + L  G IR S +    P++F+ KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+S YH +R+R  D   
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSHWGAPMLFIKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPM

Query:  TAFRSRYEHYEFIVMSFGLTNASAAFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHKEYLHQVLEALRANKLYSKFSNCEFWRKKVTFLDHVVSSEG
         AFR     +E++VM +G++ A A F   +N +  +  ++ V+ ++DDILI+SK+E+EH +++  VL+ L+   L    + CEF + +V F+ + +S +G
Subjt:  TAFRSRYEHYEFIVMSFGLTNASAAFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHKEYLHQVLEALRANKLYSKFSNCEFWRKKVTFLDHVVSSEG

Query:  VSVDPTKIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRKGTPFVWSQLVRVASRRLSRSL
         +     I+ V  W +     ++R FLG V Y R+F+   S+++ PL  L +K   + W+     A   + + L
Subjt:  VSVDPTKIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRKGTPFVWSQLVRVASRRLSRSL

P20825 Retrovirus-related Pol polyprotein from transposon 2972.5e-4936.36Show/hide
Query:  ELKVQLQELLDKGFIRPSVSHWGAPMLFIKKKD-----GSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPMTA
        E++ Q+QE+L++G IR S S + +P   + KK         R+ IDYR+LN++T+ +RYP+P +D++  +L     F+ IDL  G+HQ+ + +  I  TA
Subjt:  ELKVQLQELLDKGFIRPSVSHWGAPMLFIKKKD-----GSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPMTA

Query:  FRSRYEHYEFIVMSFGLTNASAAFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHKEYLHQVLEALRANKLYSKFSNCEFWRKKVTFLDHVVSSEGVS
        F ++  HYE++ M FGL NA A F   MN + +  L+   +V++DDI+I+S +  EH   +  V   L    L  +   CEF +K+  FL H+V+ +G+ 
Subjt:  FRSRYEHYEFIVMSFGLTNASAAFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHKEYLHQVLEALRANKLYSKFSNCEFWRKKVTFLDHVVSSEGVS

Query:  VDPTKIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRKGT
         +P K++A+ S+P  +   ++R+FLGL GYYR+F+ +++ I+ P+T   +K T
Subjt:  VDPTKIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRKGT

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.1e-4737.55Show/hide
Query:  KELKVQLQELLDKGFIRPSVSHWGAPMLFIKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPMTAFRSR
        +E+   +Q+LLD  FI PS S   +P++ + KKDG+ RLC+DYR LNK T+ + +PLPRID+L  ++  A +F+ +DL SGYHQ+ +   D   TAF + 
Subjt:  KELKVQLQELLDKGFIRPSVSHWGAPMLFIKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPMTAFRSR

Query:  YEHYEFIVMSFGLTNASAAFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHKEYLHQVLEALRANKLYSKFSNCEFWRKKVTFLDHVVSSEGVSVDPT
           YE+ VM FGL NA + F   M   F+D    FV V++DDILI+S++  EH ++L  VLE L+   L  K   C+F  ++  FL + +  + ++    
Subjt:  YEHYEFIVMSFGLTNASAAFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHKEYLHQVLEALRANKLYSKFSNCEFWRKKVTFLDHVVSSEGVSVDPT

Query:  KIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRKGTPFVWSQLVRVASRRLSRSLC
        K  A+  +P   TV   + FLG++ YYRRF+ + S+I+ P+       +   W++    A  +L  +LC
Subjt:  KIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRKGTPFVWSQLVRVASRRLSRSLC

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.4e-4737.55Show/hide
Query:  KELKVQLQELLDKGFIRPSVSHWGAPMLFIKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPMTAFRSR
        +E+   +Q+LLD  FI PS S   +P++ + KKDG+ RLC+DYR LNK T+ + +PLPRID+L  ++  A +F+ +DL SGYHQ+ +   D   TAF + 
Subjt:  KELKVQLQELLDKGFIRPSVSHWGAPMLFIKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPMTAFRSR

Query:  YEHYEFIVMSFGLTNASAAFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHKEYLHQVLEALRANKLYSKFSNCEFWRKKVTFLDHVVSSEGVSVDPT
           YE+ VM FGL NA + F   M   F+D    FV V++DDILI+S++  EH ++L  VLE L+   L  K   C+F  ++  FL + +  + ++    
Subjt:  YEHYEFIVMSFGLTNASAAFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHKEYLHQVLEALRANKLYSKFSNCEFWRKKVTFLDHVVSSEGVSVDPT

Query:  KIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRKGTPFVWSQLVRVASRRLSRSLC
        K  A+  +P   TV   + FLG++ YYRRF+ + S+I+ P+       +   W++    A  +L  +LC
Subjt:  KIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRKGTPFVWSQLVRVASRRLSRSLC

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein2.4e-1836.94Show/hide
Query:  YLHQVLEALRANKLYSKFSNCEFWRKKVTFLD--HVVSSEGVSVDPTKIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRKGTPFVW
        +L  VL+    ++ Y+    C F + ++ +L   H++S EGVS DP K+EA+  WP     +++R FLGL GYYRRFV+++ +I  PLT+L +K +   W
Subjt:  YLHQVLEALRANKLYSKFSNCEFWRKKVTFLD--HVVSSEGVSVDPTKIEAVTSWPRLSTVSDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRKGTPFVW

Query:  SQLVRVASRRL
        +++  +A + L
Subjt:  SQLVRVASRRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCCAGCTGAGTTAAAGGAGCTGAAGGTGCAGTTGCAAGAGTTGCTGGACAAGGGTTTTATTCGACCCAGTGTGTCACATTGGGGAGCACCAATGTTGTTCATAAA
GAAGAAGGATGGGTCAATGCGCCTTTGCATTGACTACAGAGAGCTAAACAAGGTGACAGTCAAGAACCGGTACCCCTTGCCCAGGATTGATGATTTGTTTGATCAGTTGC
AAGGAGCCACCGTCTTTTCTAAGATCGACCTACGATCAGGCTACCACCAGCTAAGGATCAGGGATAGTGATATTCCTATGACCGCTTTCCGTTCTAGATACGAGCATTAC
GAGTTCATTGTGATGTCCTTTGGTTTGACTAATGCTTCTGCGGCATTCATGGACTTGATGAACAGGGTGTTTAAGGACTTCTTAGACACGTTTGTCATAGTTTTTATTGA
CGACATTTTGATTTACTCCAAGACTGAGGCCGAGCATAAAGAGTATTTGCACCAGGTTTTGGAGGCTCTTCGAGCTAATAAGCTGTACTCCAAGTTCTCCAACTGTGAGT
TCTGGCGGAAGAAGGTGACTTTCCTTGACCATGTGGTTTCCAGTGAGGGAGTTTCTGTGGACCCAACAAAGATCGAAGCGGTTACCAGTTGGCCTCGACTGTCTACGGTT
AGCGACGTTCGTAGTTTCCTGGGTTTAGTAGGTTACTATAGGAGGTTCGTGGAAGATTTCTCTCGTATATCCAGTCCCTTGACTCAGTTGACCAGGAAGGGGACTCCTTT
TGTTTGGAGCCAGCTTGTGAGAGTAGCTTCTAGGAGATTAAGTAGAAGCTTGTGTCTGCACCAGTCCTTACAGTACCAGATGGATCTGGGAGTTTTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCCAGCTGAGTTAAAGGAGCTGAAGGTGCAGTTGCAAGAGTTGCTGGACAAGGGTTTTATTCGACCCAGTGTGTCACATTGGGGAGCACCAATGTTGTTCATAAA
GAAGAAGGATGGGTCAATGCGCCTTTGCATTGACTACAGAGAGCTAAACAAGGTGACAGTCAAGAACCGGTACCCCTTGCCCAGGATTGATGATTTGTTTGATCAGTTGC
AAGGAGCCACCGTCTTTTCTAAGATCGACCTACGATCAGGCTACCACCAGCTAAGGATCAGGGATAGTGATATTCCTATGACCGCTTTCCGTTCTAGATACGAGCATTAC
GAGTTCATTGTGATGTCCTTTGGTTTGACTAATGCTTCTGCGGCATTCATGGACTTGATGAACAGGGTGTTTAAGGACTTCTTAGACACGTTTGTCATAGTTTTTATTGA
CGACATTTTGATTTACTCCAAGACTGAGGCCGAGCATAAAGAGTATTTGCACCAGGTTTTGGAGGCTCTTCGAGCTAATAAGCTGTACTCCAAGTTCTCCAACTGTGAGT
TCTGGCGGAAGAAGGTGACTTTCCTTGACCATGTGGTTTCCAGTGAGGGAGTTTCTGTGGACCCAACAAAGATCGAAGCGGTTACCAGTTGGCCTCGACTGTCTACGGTT
AGCGACGTTCGTAGTTTCCTGGGTTTAGTAGGTTACTATAGGAGGTTCGTGGAAGATTTCTCTCGTATATCCAGTCCCTTGACTCAGTTGACCAGGAAGGGGACTCCTTT
TGTTTGGAGCCAGCTTGTGAGAGTAGCTTCTAGGAGATTAAGTAGAAGCTTGTGTCTGCACCAGTCCTTACAGTACCAGATGGATCTGGGAGTTTTGTGA
Protein sequenceShow/hide protein sequence
MAPAELKELKVQLQELLDKGFIRPSVSHWGAPMLFIKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPMTAFRSRYEHY
EFIVMSFGLTNASAAFMDLMNRVFKDFLDTFVIVFIDDILIYSKTEAEHKEYLHQVLEALRANKLYSKFSNCEFWRKKVTFLDHVVSSEGVSVDPTKIEAVTSWPRLSTV
SDVRSFLGLVGYYRRFVEDFSRISSPLTQLTRKGTPFVWSQLVRVASRRLSRSLCLHQSLQYQMDLGVL