; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc02g0052391 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc02g0052391
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionTransposon Ty3-G Gag-Pol polyprotein
Genome locationCMiso1.1chr02:19736341..19737066
RNA-Seq ExpressionCmc02g0052391
SyntenyCmc02g0052391
Gene Ontology termsGO:0006468 - protein phosphorylation (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004672 - protein kinase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0008445 - D-aspartate oxidase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KYP35881.1 Transposon Ty3-G Gag-Pol polyprotein, partial [Cajanus cajan]8.2e-9969.29Show/hide
Query:  MIERLTGRSYFCFLDGFFGFFQIPIACEDQKKTTFTCDYGTYAFRCMPFGLCNAPGIFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFDACLASLESILN
        M+ERL G+S++ FLDGF G+FQI IA EDQ+KT FTC +GT+A+R MPFGLCNAPG FQRCM+SIF DF+E CIE+FMDDFTVYG SFDACL SL+  LN
Subjt:  MIERLTGRSYFCFLDGFFGFFQIPIACEDQKKTTFTCDYGTYAFRCMPFGLCNAPGIFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFDACLASLESILN

Query:  RCIETNLVLNYEKCHFMVSHGIVLEHLVSSEGIEVDKAKIDIIQKFPYPTCLNDVRSFLGSAGFCRWFIKDFSKIALPLTNLLQKDVPFLIDDNCKKTFD
        RCIETNLVLN+EKCHFMV  GIVL H++SS+GIEVD AK+ +I + PYP+C+ +VRSFLG AGF R F+K+FSK ALPL+NLLQKDV F+ DD CK+ FD
Subjt:  RCIETNLVLNYEKCHFMVSHGIVLEHLVSSEGIEVDKAKIDIIQKFPYPTCLNDVRSFLGSAGFCRWFIKDFSKIALPLTNLLQKDVPFLIDDNCKKTFD

Query:  DLKGRLVSTPILQSPNWNLPFEIMCDASNLALGVVLGQRID
         LK  L +TPI+Q+P+W +PFE+MCDASN ALG VL QR+D
Subjt:  DLKGRLVSTPILQSPNWNLPFEIMCDASNLALGVVLGQRID

KYP55026.1 Retrovirus-related Pol polyprotein from transposon 17.6 [Cajanus cajan]4.3e-10070.12Show/hide
Query:  MIERLTGRSYFCFLDGFFGFFQIPIACEDQKKTTFTCDYGTYAFRCMPFGLCNAPGIFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFDACLASLESILN
        M+ERL G+S++CFLDGF G+FQI IA EDQ+KTTFTC +GT+A+R MPFGLCNAPG FQRCM+SIF DF+E CI+VFMDDFTVYG SFDACL SL+ +LN
Subjt:  MIERLTGRSYFCFLDGFFGFFQIPIACEDQKKTTFTCDYGTYAFRCMPFGLCNAPGIFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFDACLASLESILN

Query:  RCIETNLVLNYEKCHFMVSHGIVLEHLVSSEGIEVDKAKIDIIQKFPYPTCLNDVRSFLGSAGFCRWFIKDFSKIALPLTNLLQKDVPFLIDDNCKKTFD
        RCIETNLVLN+EKCHFMV  GIVL H++SS GI+VD AKI +I + PYP+C+ +VRSFLG AGF R FIKDFSK ALPL++LLQKD+ F  DD CK+ FD
Subjt:  RCIETNLVLNYEKCHFMVSHGIVLEHLVSSEGIEVDKAKIDIIQKFPYPTCLNDVRSFLGSAGFCRWFIKDFSKIALPLTNLLQKDVPFLIDDNCKKTFD

Query:  DLKGRLVSTPILQSPNWNLPFEIMCDASNLALGVVLGQRID
         LK  L +TPI+Q+P+W +PFE+MCDASN ALG VL QR+D
Subjt:  DLKGRLVSTPILQSPNWNLPFEIMCDASNLALGVVLGQRID

RZB41284.1 Transposon Ty3-G Gag-Pol polyprotein [Glycine soja]2.1e-9969.71Show/hide
Query:  MIERLTGRSYFCFLDGFFGFFQIPIACEDQKKTTFTCDYGTYAFRCMPFGLCNAPGIFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFDACLASLESILN
        M+ERL G+S++CFLDGF G+ QI IA EDQ+KTTFTC +GT+A+R MPFGLCNAPG FQRCM+SIF DF+E CIEVFMDDFTVYG SFD CL SLE +LN
Subjt:  MIERLTGRSYFCFLDGFFGFFQIPIACEDQKKTTFTCDYGTYAFRCMPFGLCNAPGIFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFDACLASLESILN

Query:  RCIETNLVLNYEKCHFMVSHGIVLEHLVSSEGIEVDKAKIDIIQKFPYPTCLNDVRSFLGSAGFCRWFIKDFSKIALPLTNLLQKDVPFLIDDNCKKTFD
        RCIETNLVLN+EKCHFMV  GIVL H++S++GIEVD AKI +I + PYP+C+ +VRSFLG AGF R FI+DFSK+ALPL+NLLQK+V F  +D CK+ FD
Subjt:  RCIETNLVLNYEKCHFMVSHGIVLEHLVSSEGIEVDKAKIDIIQKFPYPTCLNDVRSFLGSAGFCRWFIKDFSKIALPLTNLLQKDVPFLIDDNCKKTFD

Query:  DLKGRLVSTPILQSPNWNLPFEIMCDASNLALGVVLGQRID
         LK  L +TPI+Q+P+W  PFE+MCDASN ALG VL Q+ID
Subjt:  DLKGRLVSTPILQSPNWNLPFEIMCDASNLALGVVLGQRID

RZC25552.1 Transposon Ty3-G Gag-Pol polyprotein [Glycine soja]2.1e-9969.71Show/hide
Query:  MIERLTGRSYFCFLDGFFGFFQIPIACEDQKKTTFTCDYGTYAFRCMPFGLCNAPGIFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFDACLASLESILN
        M+ERL G+S++CFLDGF G+ QI IA EDQ+KTTFTC +GT+A+R MPFGLCNAPG FQRCM+SIF DF+E CIEVFMDDFTVYG SFD CL SLE +LN
Subjt:  MIERLTGRSYFCFLDGFFGFFQIPIACEDQKKTTFTCDYGTYAFRCMPFGLCNAPGIFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFDACLASLESILN

Query:  RCIETNLVLNYEKCHFMVSHGIVLEHLVSSEGIEVDKAKIDIIQKFPYPTCLNDVRSFLGSAGFCRWFIKDFSKIALPLTNLLQKDVPFLIDDNCKKTFD
        RCIETNLVLN+EKCHFMV  GIVL H++S++GIEVD AKI +I + PYP+C+ +VRSFLG AGF R FI+DFSK+ALPL+NLLQK+V F  +D CK+ FD
Subjt:  RCIETNLVLNYEKCHFMVSHGIVLEHLVSSEGIEVDKAKIDIIQKFPYPTCLNDVRSFLGSAGFCRWFIKDFSKIALPLTNLLQKDVPFLIDDNCKKTFD

Query:  DLKGRLVSTPILQSPNWNLPFEIMCDASNLALGVVLGQRID
         LK  L +TPI+Q+P+W  PFE+MCDASN ALG VL Q+ID
Subjt:  DLKGRLVSTPILQSPNWNLPFEIMCDASNLALGVVLGQRID

XP_021603241.1 uncharacterized protein LOC110608326 [Manihot esculenta]9.6e-10069.71Show/hide
Query:  MIERLTGRSYFCFLDGFFGFFQIPIACEDQKKTTFTCDYGTYAFRCMPFGLCNAPGIFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFDACLASLESILN
        M+ERL G+S+FC LDGF GF+QIP+A EDQ+KTTFTC +GT+A+R MPFGLCNAP  FQRCM+SIF +F+EK IEVFMDDFTVYG+SFD CL +L  IL 
Subjt:  MIERLTGRSYFCFLDGFFGFFQIPIACEDQKKTTFTCDYGTYAFRCMPFGLCNAPGIFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFDACLASLESILN

Query:  RCIETNLVLNYEKCHFMVSHGIVLEHLVSSEGIEVDKAKIDIIQKFPYPTCLNDVRSFLGSAGFCRWFIKDFSKIALPLTNLLQKDVPFLIDDNCKKTFD
        RC+E+NLVLNYEKCHFMV  G++L H VS+ GIEVDKAK+D+I+  PYPTC+ D+RSFLG AGF R FIKDFS+IA PL  LLQKDVPF  D+ C++ F+
Subjt:  RCIETNLVLNYEKCHFMVSHGIVLEHLVSSEGIEVDKAKIDIIQKFPYPTCLNDVRSFLGSAGFCRWFIKDFSKIALPLTNLLQKDVPFLIDDNCKKTFD

Query:  DLKGRLVSTPILQSPNWNLPFEIMCDASNLALGVVLGQRID
         LK +LV+ PI+Q+PNW+ PFEIMCDASN ALG VLGQRID
Subjt:  DLKGRLVSTPILQSPNWNLPFEIMCDASNLALGVVLGQRID

TrEMBL top hitse value%identityAlignment
A0A151QZW2 Transposon Ty3-G Gag-Pol polyprotein (Fragment)4.0e-9969.29Show/hide
Query:  MIERLTGRSYFCFLDGFFGFFQIPIACEDQKKTTFTCDYGTYAFRCMPFGLCNAPGIFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFDACLASLESILN
        M+ERL G+S++ FLDGF G+FQI IA EDQ+KT FTC +GT+A+R MPFGLCNAPG FQRCM+SIF DF+E CIE+FMDDFTVYG SFDACL SL+  LN
Subjt:  MIERLTGRSYFCFLDGFFGFFQIPIACEDQKKTTFTCDYGTYAFRCMPFGLCNAPGIFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFDACLASLESILN

Query:  RCIETNLVLNYEKCHFMVSHGIVLEHLVSSEGIEVDKAKIDIIQKFPYPTCLNDVRSFLGSAGFCRWFIKDFSKIALPLTNLLQKDVPFLIDDNCKKTFD
        RCIETNLVLN+EKCHFMV  GIVL H++SS+GIEVD AK+ +I + PYP+C+ +VRSFLG AGF R F+K+FSK ALPL+NLLQKDV F+ DD CK+ FD
Subjt:  RCIETNLVLNYEKCHFMVSHGIVLEHLVSSEGIEVDKAKIDIIQKFPYPTCLNDVRSFLGSAGFCRWFIKDFSKIALPLTNLLQKDVPFLIDDNCKKTFD

Query:  DLKGRLVSTPILQSPNWNLPFEIMCDASNLALGVVLGQRID
         LK  L +TPI+Q+P+W +PFE+MCDASN ALG VL QR+D
Subjt:  DLKGRLVSTPILQSPNWNLPFEIMCDASNLALGVVLGQRID

A0A151SJM9 Retrovirus-related Pol polyprotein from transposon 17.62.1e-10070.12Show/hide
Query:  MIERLTGRSYFCFLDGFFGFFQIPIACEDQKKTTFTCDYGTYAFRCMPFGLCNAPGIFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFDACLASLESILN
        M+ERL G+S++CFLDGF G+FQI IA EDQ+KTTFTC +GT+A+R MPFGLCNAPG FQRCM+SIF DF+E CI+VFMDDFTVYG SFDACL SL+ +LN
Subjt:  MIERLTGRSYFCFLDGFFGFFQIPIACEDQKKTTFTCDYGTYAFRCMPFGLCNAPGIFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFDACLASLESILN

Query:  RCIETNLVLNYEKCHFMVSHGIVLEHLVSSEGIEVDKAKIDIIQKFPYPTCLNDVRSFLGSAGFCRWFIKDFSKIALPLTNLLQKDVPFLIDDNCKKTFD
        RCIETNLVLN+EKCHFMV  GIVL H++SS GI+VD AKI +I + PYP+C+ +VRSFLG AGF R FIKDFSK ALPL++LLQKD+ F  DD CK+ FD
Subjt:  RCIETNLVLNYEKCHFMVSHGIVLEHLVSSEGIEVDKAKIDIIQKFPYPTCLNDVRSFLGSAGFCRWFIKDFSKIALPLTNLLQKDVPFLIDDNCKKTFD

Query:  DLKGRLVSTPILQSPNWNLPFEIMCDASNLALGVVLGQRID
         LK  L +TPI+Q+P+W +PFE+MCDASN ALG VL QR+D
Subjt:  DLKGRLVSTPILQSPNWNLPFEIMCDASNLALGVVLGQRID

A0A445EXV2 Retrovirus-related Pol polyprotein from transposon 17.65.2e-9969.29Show/hide
Query:  MIERLTGRSYFCFLDGFFGFFQIPIACEDQKKTTFTCDYGTYAFRCMPFGLCNAPGIFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFDACLASLESILN
        M+ERL G+S++CFLDGF G+ QI IA EDQ+KTTFTC +GT+A+R MPFGLCNAPG  QRCM+SIF DF+E CIEVFMDDFTVYG SFD CL SLE +LN
Subjt:  MIERLTGRSYFCFLDGFFGFFQIPIACEDQKKTTFTCDYGTYAFRCMPFGLCNAPGIFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFDACLASLESILN

Query:  RCIETNLVLNYEKCHFMVSHGIVLEHLVSSEGIEVDKAKIDIIQKFPYPTCLNDVRSFLGSAGFCRWFIKDFSKIALPLTNLLQKDVPFLIDDNCKKTFD
        RCIETNLVLN+EKCHFMV  GIVL H++S++GIEVD AKI +I + PYP+C+ +VRSFLG AGF R FI+DFSK+ALPL+NLLQK+V F  +D CK+ FD
Subjt:  RCIETNLVLNYEKCHFMVSHGIVLEHLVSSEGIEVDKAKIDIIQKFPYPTCLNDVRSFLGSAGFCRWFIKDFSKIALPLTNLLQKDVPFLIDDNCKKTFD

Query:  DLKGRLVSTPILQSPNWNLPFEIMCDASNLALGVVLGQRID
         LK  L +TPI+Q+P+W  PFE+MCDASN ALG VL Q+ID
Subjt:  DLKGRLVSTPILQSPNWNLPFEIMCDASNLALGVVLGQRID

A0A445EY74 Reverse transcriptase1.0e-9969.71Show/hide
Query:  MIERLTGRSYFCFLDGFFGFFQIPIACEDQKKTTFTCDYGTYAFRCMPFGLCNAPGIFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFDACLASLESILN
        M+ERL G+S++CFLDGF G+ QI IA EDQ+KTTFTC +GT+A+R MPFGLCNAPG FQRCM+SIF DF+E CIEVFMDDFTVYG SFD CL SLE +LN
Subjt:  MIERLTGRSYFCFLDGFFGFFQIPIACEDQKKTTFTCDYGTYAFRCMPFGLCNAPGIFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFDACLASLESILN

Query:  RCIETNLVLNYEKCHFMVSHGIVLEHLVSSEGIEVDKAKIDIIQKFPYPTCLNDVRSFLGSAGFCRWFIKDFSKIALPLTNLLQKDVPFLIDDNCKKTFD
        RCIETNLVLN+EKCHFMV  GIVL H++S++GIEVD AKI +I + PYP+C+ +VRSFLG AGF R FI+DFSK+ALPL+NLLQK+V F  +D CK+ FD
Subjt:  RCIETNLVLNYEKCHFMVSHGIVLEHLVSSEGIEVDKAKIDIIQKFPYPTCLNDVRSFLGSAGFCRWFIKDFSKIALPLTNLLQKDVPFLIDDNCKKTFD

Query:  DLKGRLVSTPILQSPNWNLPFEIMCDASNLALGVVLGQRID
         LK  L +TPI+Q+P+W  PFE+MCDASN ALG VL Q+ID
Subjt:  DLKGRLVSTPILQSPNWNLPFEIMCDASNLALGVVLGQRID

A0A445LR48 Transposon Ty3-G Gag-Pol polyprotein1.0e-9969.71Show/hide
Query:  MIERLTGRSYFCFLDGFFGFFQIPIACEDQKKTTFTCDYGTYAFRCMPFGLCNAPGIFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFDACLASLESILN
        M+ERL G+S++CFLDGF G+ QI IA EDQ+KTTFTC +GT+A+R MPFGLCNAPG FQRCM+SIF DF+E CIEVFMDDFTVYG SFD CL SLE +LN
Subjt:  MIERLTGRSYFCFLDGFFGFFQIPIACEDQKKTTFTCDYGTYAFRCMPFGLCNAPGIFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFDACLASLESILN

Query:  RCIETNLVLNYEKCHFMVSHGIVLEHLVSSEGIEVDKAKIDIIQKFPYPTCLNDVRSFLGSAGFCRWFIKDFSKIALPLTNLLQKDVPFLIDDNCKKTFD
        RCIETNLVLN+EKCHFMV  GIVL H++S++GIEVD AKI +I + PYP+C+ +VRSFLG AGF R FI+DFSK+ALPL+NLLQK+V F  +D CK+ FD
Subjt:  RCIETNLVLNYEKCHFMVSHGIVLEHLVSSEGIEVDKAKIDIIQKFPYPTCLNDVRSFLGSAGFCRWFIKDFSKIALPLTNLLQKDVPFLIDDNCKKTFD

Query:  DLKGRLVSTPILQSPNWNLPFEIMCDASNLALGVVLGQRID
         LK  L +TPI+Q+P+W  PFE+MCDASN ALG VL Q+ID
Subjt:  DLKGRLVSTPILQSPNWNLPFEIMCDASNLALGVVLGQRID

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.9e-3837.23Show/hide
Query:  SYFCFLDGFFGFFQIPIACEDQKKTTFTCDYGTYAFRCMPFGLCNAPGIFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFDACLASLESILNRCIETNLV
        +YF  +D   GF QI +  E   KT F+  +G Y +  MPFGL NAP  FQRCM  I    + K   V++DD  V+  S D  L SL  +  +  + NL 
Subjt:  SYFCFLDGFFGFFQIPIACEDQKKTTFTCDYGTYAFRCMPFGLCNAPGIFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFDACLASLESILNRCIETNLV

Query:  LNYEKCHFMVSHGIVLEHLVSSEGIEVDKAKIDIIQKFPYPTCLNDVRSFLGSAGFCRWFIKDFSKIALPLTNLLQKDVPF-LIDDNCKKTFDDLKGRLV
        L  +KC F+      L H+++ +GI+ +  KI+ IQK+P PT   ++++FLG  G+ R FI +F+ IA P+T  L+K++     +      F  LK  + 
Subjt:  LNYEKCHFMVSHGIVLEHLVSSEGIEVDKAKIDIIQKFPYPTCLNDVRSFLGSAGFCRWFIKDFSKIALPLTNLLQKDVPF-LIDDNCKKTFDDLKGRLV

Query:  STPILQSPNWNLPFEIMCDASNLALGVVLGQ
          PIL+ P++   F +  DAS++ALG VL Q
Subjt:  STPILQSPNWNLPFEIMCDASNLALGVVLGQ

P10394 Retrovirus-related Pol polyprotein from transposon 4126.0e-3637.66Show/hide
Query:  MIERLTGRSYFCFLDGFFGFFQIPIACEDQKKTTFTCDYGTYAFRCMPFGLCNAPGIFQRCMMSIFFDFIEKC-IEVFMDDFTVYGDSFDACLASLESIL
        ++++L    YF  LD   GF QI +    +  T+F+   G+Y F  +PFGL  AP  FQR MM+I F  IE     ++MDD  V G S    L +L  + 
Subjt:  MIERLTGRSYFCFLDGFFGFFQIPIACEDQKKTTFTCDYGTYAFRCMPFGLCNAPGIFQRCMMSIFFDFIEKC-IEVFMDDFTVYGDSFDACLASLESIL

Query:  NRCIETNLVLNYEKCHFMVSHGIVLEHLVSSEGIEVDKAKIDIIQKFPYPTCLNDVRSFLGSAGFCRWFIKDFSKIALPLTNLLQKDVPFLIDDNCKKTF
         +C E NL L+ EKC F +     L H  + +GI  D  K D+IQ +P P   +  R F+    + R FIK+F+  +  +T L +K+VPF   D C+K F
Subjt:  NRCIETNLVLNYEKCHFMVSHGIVLEHLVSSEGIEVDKAKIDIIQKFPYPTCLNDVRSFLGSAGFCRWFIKDFSKIALPLTNLLQKDVPFLIDDNCKKTF

Query:  DDLKGRLVSTPILQSPNWNLPFEIMCDASNLALGVVLGQ
          LK +L++  +LQ P+++  F I  DAS  A G VL Q
Subjt:  DDLKGRLVSTPILQSPNWNLPFEIMCDASNLALGVVLGQ

P10401 Retrovirus-related Pol polyprotein from transposon gypsy3.3e-3433.2Show/hide
Query:  MIERLTGRSYFCFLDGFFGFFQIPIACEDQKKTTFTCDYGTYAFRCMPFGLCNAPGIFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFDACLASLESILN
        ++  L    +F  LD   G+ QI +A  D++KT+F+ + G Y F  +PFGL NA  IFQR +  +  + I K   V++DD  ++ ++    +  ++++L 
Subjt:  MIERLTGRSYFCFLDGFFGFFQIPIACEDQKKTTFTCDYGTYAFRCMPFGLCNAPGIFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFDACLASLESILN

Query:  RCIETNLVLNYEKCHFMVSHGIVLEHLVSSEGIEVDKAKIDIIQKFPYPTCLNDVRSFLGSAGFCRWFIKDFSKIALPLTNLLQ-----------KDVPF
          I+ N+ ++ EK  F       L  +VS +G + D  K+  IQ++P P C+  VRSFLG A + R FIKDF+ IA P+T++L+           K +P 
Subjt:  RCIETNLVLNYEKCHFMVSHGIVLEHLVSSEGIEVDKAKIDIIQKFPYPTCLNDVRSFLGSAGFCRWFIKDFSKIALPLTNLLQ-----------KDVPF

Query:  LIDDNCKKTFDDLKGRLVSTP-ILQSPNWNLPFEIMCDASNLALGVVLGQ
          ++  +  F  L+  L S   IL+ P++  PF++  DAS   +G VL Q
Subjt:  LIDDNCKKTFDDLKGRLVSTP-ILQSPNWNLPFEIMCDASNLALGVVLGQ

P20825 Retrovirus-related Pol polyprotein from transposon 2974.1e-3736.91Show/hide
Query:  YFCFLDGFFGFFQIPIACEDQKKTTFTCDYGTYAFRCMPFGLCNAPGIFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFDACLASLESILNRCIETNLVL
        YF  +D   GF QI +  E   KT F+   G Y +  MPFGL NAP  FQRCM +I    + K   V++DD  ++  S    L S++ +  +  + NL L
Subjt:  YFCFLDGFFGFFQIPIACEDQKKTTFTCDYGTYAFRCMPFGLCNAPGIFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFDACLASLESILNRCIETNLVL

Query:  NYEKCHFMVSHGIVLEHLVSSEGIEVDKAKIDIIQKFPYPTCLNDVRSFLGSAGFCRWFIKDFSKIALPLTNLLQKDVPFLIDDNCK----KTFDDLKGR
          +KC F+      L H+V+ +GI+ +  K+  I  +P PT   ++R+FLG  G+ R FI +++ IA P+T+ L+K       D  K    + F+ LK  
Subjt:  NYEKCHFMVSHGIVLEHLVSSEGIEVDKAKIDIIQKFPYPTCLNDVRSFLGSAGFCRWFIKDFSKIALPLTNLLQKDVPFLIDDNCK----KTFDDLKGR

Query:  LVSTPILQSPNWNLPFEIMCDASNLALGVVLGQ
        ++  PILQ P++   F +  DASNLALG VL Q
Subjt:  LVSTPILQSPNWNLPFEIMCDASNLALGVVLGQ

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.5e-3937.14Show/hide
Query:  LTGRSYFCFLDGFFGFFQIPIACEDQKKTTFTCDYGTYAFRCMPFGLCNAPGIFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFDACLASLESILNRCIE
        L    YF  LD   GF QI +   D  KT F+   G Y F  +PFGL NAP IFQR +  I  + I K   V++DD  V+ + +D    +L  +L    +
Subjt:  LTGRSYFCFLDGFFGFFQIPIACEDQKKTTFTCDYGTYAFRCMPFGLCNAPGIFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFDACLASLESILNRCIE

Query:  TNLVLNYEKCHFMVSHGIVLEHLVSSEGIEVDKAKIDIIQKFPYPTCLNDVRSFLGSAGFCRWFIKDFSKIALPLTNLLQ-----------KDVPFLIDD
         NL +N EK HF+ +    L ++V+++GI+ D  K+  I + P PT + +++ FLG   + R FI+D++K+A PLTNL +             VP  +D+
Subjt:  TNLVLNYEKCHFMVSHGIVLEHLVSSEGIEVDKAKIDIIQKFPYPTCLNDVRSFLGSAGFCRWFIKDFSKIALPLTNLLQ-----------KDVPFLIDD

Query:  NCKKTFDDLKGRLVSTPILQSPNWNLPFEIMCDASNLALGVVLGQ
           ++F+DLK  L S+ IL  P +  PF +  DASN A+G VL Q
Subjt:  NCKKTFDDLKGRLVSTPILQSPNWNLPFEIMCDASNLALGVVLGQ

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein6.6e-1435.09Show/hide
Query:  NYEKCHFMVSHGIVL--EHLVSSEGIEVDKAKIDIIQKFPYPTCLNDVRSFLGSAGFCRWFIKDFSKIALPLTNLLQKDVPFLIDDNCKKTFDDLKGRLV
        N +KC F       L   H++S EG+  D AK++ +  +P P    ++R FLG  G+ R F+K++ KI  PLT LL+K+      +     F  LKG + 
Subjt:  NYEKCHFMVSHGIVL--EHLVSSEGIEVDKAKIDIIQKFPYPTCLNDVRSFLGSAGFCRWFIKDFSKIALPLTNLLQKDVPFLIDDNCKKTFDDLKGRLV

Query:  STPILQSPNWNLPF
        + P+L  P+  LPF
Subjt:  STPILQSPNWNLPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAGAACGCTTAACGGGTAGATCATATTTTTGTTTCTTAGATGGATTTTTCGGTTTTTTCCAAATTCCTATAGCATGTGAAGACCAAAAGAAAACAACTTTTACTTG
TGATTATGGAACTTATGCATTTAGGTGTATGCCATTTGGTCTTTGTAATGCACCAGGCATATTCCAACGTTGCATGATGAGCATATTTTTTGACTTTATTGAAAAATGCA
TTGAAGTTTTCATGGATGATTTCACAGTTTATGGTGATAGTTTTGATGCATGTTTAGCCAGTCTAGAATCAATTCTAAATAGATGCATAGAAACTAACCTTGTCTTGAAT
TATGAAAAGTGTCATTTCATGGTGTCTCATGGTATAGTTTTAGAACATTTAGTGTCTTCAGAAGGAATAGAGGTAGATAAAGCAAAAATTGACATTATACAAAAATTTCC
ATATCCAACATGTTTAAATGATGTTAGATCCTTCCTTGGTAGTGCTGGCTTTTGTAGATGGTTTATAAAAGATTTTTCTAAAATTGCTTTGCCTCTAACTAATCTCTTGC
AAAAAGATGTACCATTTCTGATTGATGACAATTGTAAGAAGACATTTGATGATCTCAAAGGAAGGTTAGTCTCTACCCCTATCCTTCAATCTCCTAATTGGAATTTACCT
TTCGAAATAATGTGTGATGCAAGCAACTTAGCATTAGGAGTTGTTTTAGGACAAAGGATAGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGATAGAACGCTTAACGGGTAGATCATATTTTTGTTTCTTAGATGGATTTTTCGGTTTTTTCCAAATTCCTATAGCATGTGAAGACCAAAAGAAAACAACTTTTACTTG
TGATTATGGAACTTATGCATTTAGGTGTATGCCATTTGGTCTTTGTAATGCACCAGGCATATTCCAACGTTGCATGATGAGCATATTTTTTGACTTTATTGAAAAATGCA
TTGAAGTTTTCATGGATGATTTCACAGTTTATGGTGATAGTTTTGATGCATGTTTAGCCAGTCTAGAATCAATTCTAAATAGATGCATAGAAACTAACCTTGTCTTGAAT
TATGAAAAGTGTCATTTCATGGTGTCTCATGGTATAGTTTTAGAACATTTAGTGTCTTCAGAAGGAATAGAGGTAGATAAAGCAAAAATTGACATTATACAAAAATTTCC
ATATCCAACATGTTTAAATGATGTTAGATCCTTCCTTGGTAGTGCTGGCTTTTGTAGATGGTTTATAAAAGATTTTTCTAAAATTGCTTTGCCTCTAACTAATCTCTTGC
AAAAAGATGTACCATTTCTGATTGATGACAATTGTAAGAAGACATTTGATGATCTCAAAGGAAGGTTAGTCTCTACCCCTATCCTTCAATCTCCTAATTGGAATTTACCT
TTCGAAATAATGTGTGATGCAAGCAACTTAGCATTAGGAGTTGTTTTAGGACAAAGGATAGATTAA
Protein sequenceShow/hide protein sequence
MIERLTGRSYFCFLDGFFGFFQIPIACEDQKKTTFTCDYGTYAFRCMPFGLCNAPGIFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFDACLASLESILNRCIETNLVLN
YEKCHFMVSHGIVLEHLVSSEGIEVDKAKIDIIQKFPYPTCLNDVRSFLGSAGFCRWFIKDFSKIALPLTNLLQKDVPFLIDDNCKKTFDDLKGRLVSTPILQSPNWNLP
FEIMCDASNLALGVVLGQRID