; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0010021 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0010021
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr01:4648999..4649922
RNA-Seq ExpressionCmc01g0010021
SyntenyCmc01g0010021
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008233 - peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036855.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.6e-13171.8Show/hide
Query:  SGTYQLTRSNQTWLWHKKLGHVSMRGLEKIIKSKAIVGIPNLDVKGNFFCGDCQFGKQTKSTHKSMKECYTNRVLELLYMDLMGSMQTESLGGKRYVLVV
        S    LT+ +QTWLWH+KLGH+S+R L+K+I+++A+VGIP+LD+ G FFCG+CQ GKQTK++H+ +KECYT  VLELL+++LMG MQTESLGGK+YVLVV
Subjt:  SGTYQLTRSNQTWLWHKKLGHVSMRGLEKIIKSKAIVGIPNLDVKGNFFCGDCQFGKQTKSTHKSMKECYTNRVLELLYMDLMGSMQTESLGGKRYVLVV

Query:  VDDYSRYAWVCFLKGKTDNIEICKNLCLKLQREKGKKIIRIRSDHGKEFDNESFSNFYLLEGIHHEFSAPITPQQNGVVEIKNRTLQEITRVMIHAKNLP
        VDDYS++ WV FLK K D +++C +LCL LQREKG+KIIRIRSDHGKEFDNE  +N    EGIHHEF+APITPQQNGVVE KNR LQE+ RVMIHAKN P
Subjt:  VDDYSRYAWVCFLKGKTDNIEICKNLCLKLQREKGKKIIRIRSDHGKEFDNESFSNFYLLEGIHHEFSAPITPQQNGVVEIKNRTLQEITRVMIHAKNLP

Query:  LCFWAEAVNTVCHIHNRVNIRSGTTVTLYELWKDRKPNVKYFHVFGSTCYILADREYHQKWDAKSEQAIFLGYSQNSRAYRVFNNRSGSVMETINVVIND
        L FWAE VNT CHIH RV  R GTT+TLYELWK RKPNVKYFH+FGSTCYILADREYH+KWD KS Q IFL YSQNSRAYRVFN +SG+VME INVV+ND
Subjt:  LCFWAEAVNTVCHIHNRVNIRSGTTVTLYELWKDRKPNVKYFHVFGSTCYILADREYHQKWDAKSEQAIFLGYSQNSRAYRVFNNRSGSVMETINVVIND

Query:  LDSAI
         +S +
Subjt:  LDSAI

KAA0042995.1 gag-pol polyprotein [Cucumis melo var. makuwa]5.8e-13477.52Show/hide
Query:  MSGTYQLTRSNQTWLWHKKLGHVSMRGLEKIIKSKAIVGIPNLDVKGNFFCGDCQFGKQTKSTHKSMKECYTNRVLELLYMDLMGSMQTESLGGKRYVLV
        MS T QL RS+QTWLWH+KLGHVSMRGLEK+IK+KA+VGIPNLDV GNFFC DCQ GKQT+STHKS+KECYTNRVLELL+MDLMG MQT+SLG       
Subjt:  MSGTYQLTRSNQTWLWHKKLGHVSMRGLEKIIKSKAIVGIPNLDVKGNFFCGDCQFGKQTKSTHKSMKECYTNRVLELLYMDLMGSMQTESLGGKRYVLV

Query:  VVDDYSRYAWVCFLKGKTDNIEICKNLCLKLQREKGKKIIRIRSDHGKEFDNESFSNFYLLEGIHHEFSAPITPQQNGVVEIKNRTLQEITRVMIHAKNL
                       GKTD +EICKNLCLKLQRE+ KKI RIRSDHGKEFDNE F++F LLEG HHEFSAPITPQQNGVVE KN+TLQE+ RVMIHAKNL
Subjt:  VVDDYSRYAWVCFLKGKTDNIEICKNLCLKLQREKGKKIIRIRSDHGKEFDNESFSNFYLLEGIHHEFSAPITPQQNGVVEIKNRTLQEITRVMIHAKNL

Query:  PLCFWAEAVNTVCHIHNRVNIRSGTTVTLYELWKDRKPNVKYFHVFGSTCYILADREYHQKWDAKSEQAIFLGYSQNSRAYRVFNNRSGSVMETINVVIN
        PLCF+AEAVNT CHIHNRV IR+GTT+TLYE WK+RK NVKYFHVFGSTCYILADREYH+KWDA+SEQ IFL YSQ SRAYRV+NNRS SVMETIN  IN
Subjt:  PLCFWAEAVNTVCHIHNRVNIRSGTTVTLYELWKDRKPNVKYFHVFGSTCYILADREYHQKWDAKSEQAIFLGYSQNSRAYRVFNNRSGSVMETINVVIN

Query:  DLDSAIK
        DLDSAIK
Subjt:  DLDSAIK

KAA0054354.1 gag-pol polyprotein [Cucumis melo var. makuwa]5.8e-14282.06Show/hide
Query:  LTRSNQTWLWHKKLGHVSMRGLEKIIKSKAIVGIPNLDVKGNFFCGDCQFGKQTKSTHKSMKECYTNRVLELLYMDLMGSMQTESLGGKRYVLVVVDDYS
        + RS+QTW+WHKKLGHVSMRGLEKIIK+KA+VGIP+L+V GNFFCG         STHKS K CYTNRVLELL+MDLM  M+TES GGKRYVLVVV DYS
Subjt:  LTRSNQTWLWHKKLGHVSMRGLEKIIKSKAIVGIPNLDVKGNFFCGDCQFGKQTKSTHKSMKECYTNRVLELLYMDLMGSMQTESLGGKRYVLVVVDDYS

Query:  RYAWVCFLKGKTDNIEICKNLCLKLQREKGKKIIRIRSDHGKEFDNESFSNFYLLEGIHHEFSAPITPQQNGVVEIKNRTLQEITRVMIHAKNLPLCFWA
        RY WVCFL+GKTD +EICKNLCLKLQREKGKKI RIRSDHGKEFDNE F++F LLEGI HEFSAPITPQQNGVVE KNRTLQE+  VMIHAKNLP+CFWA
Subjt:  RYAWVCFLKGKTDNIEICKNLCLKLQREKGKKIIRIRSDHGKEFDNESFSNFYLLEGIHHEFSAPITPQQNGVVEIKNRTLQEITRVMIHAKNLPLCFWA

Query:  EAVNTVCHIHNRVNIRSGTTVTLYELWKDRKPNVKYFHVFGSTCYILADREYHQKWDAKSEQAIFLGYSQNSRAYRVFNNRSGSVMETINVVINDLDSAI
        EAVN  CHIHNRV IR+G TVTLYELWK+RKPNVKYFHVFGSTCYILADREY QKWDA+SEQ IFLGYSQNS AYRVFNNRSG+V+ETINVVINDLDSA+
Subjt:  EAVNTVCHIHNRVNIRSGTTVTLYELWKDRKPNVKYFHVFGSTCYILADREYHQKWDAKSEQAIFLGYSQNSRAYRVFNNRSGSVMETINVVINDLDSAI

Query:  K
        K
Subjt:  K

PNX99503.1 gag-protease polyprotein, partial [Trifolium pratense]1.4e-10862.46Show/hide
Query:  TYQLTRSNQTWLWHKKLGHVSMRGLEKIIKSKAIVGIPNLDVKGNFFCGDCQFGKQTKSTHKSMKECYTNRVLELLYMDLMGSMQTESLGGKRYVLVVVD
        T  +T+ ++  LWH++LGH+++R ++K I  +AI G+PNL ++    CGDCQ GKQTK  H  ++   T RVLELL+MDLMG MQTESLGGKRY  VVVD
Subjt:  TYQLTRSNQTWLWHKKLGHVSMRGLEKIIKSKAIVGIPNLDVKGNFFCGDCQFGKQTKSTHKSMKECYTNRVLELLYMDLMGSMQTESLGGKRYVLVVVD

Query:  DYSRYAWVCFLKGKTDNIEICKNLCLKLQREKGKKIIRIRSDHGKEFDNESFSNFYLLEGIHHEFSAPITPQQNGVVEIKNRTLQEITRVMIHAKNLPLC
        DYSRY W+ F++ K++  ++ K+LC++LQREK   ++RIRSDHGKEF+N SFS+F   EGI HEFS+PITPQQNGVVE KNRT+QE  RVM+HAK L   
Subjt:  DYSRYAWVCFLKGKTDNIEICKNLCLKLQREKGKKIIRIRSDHGKEFDNESFSNFYLLEGIHHEFSAPITPQQNGVVEIKNRTLQEITRVMIHAKNLPLC

Query:  FWAEAVNTVCHIHNRVNIRSGTTVTLYELWKDRKPNVKYFHVFGSTCYILADREYHQKWDAKSEQAIFLGYSQNSRAYRVFNNRSGSVMETINVVINDLD
        FWAEA+NT C+IHNRV +RSGTT TLYELWK+RKP VK+FHVFGS CYIL+DRE   K D KS++ IFLGYS NSRAYRV+N R+ ++ME+INVVI+D  
Subjt:  FWAEAVNTVCHIHNRVNIRSGTTVTLYELWKDRKPNVKYFHVFGSTCYILADREYHQKWDAKSEQAIFLGYSQNSRAYRVFNNRSGSVMETINVVINDLD

Query:  S
        S
Subjt:  S

TYK26041.1 gag/pol polyprotein [Cucumis melo var. makuwa]1.1e-10863.93Show/hide
Query:  SGTYQLTRSNQTWLWHKKLGHVSMRGLEKIIKSKAIVGIPNLDVKGNFFCGDCQFGKQTKSTHKSMKECYTNRVLELLYMDLMGSMQTESLGGKRYVLVV
        S    LT+ +QTWLWH+KLGH+S+R L+K+I++KA+VGIP+LD+ G FFCGDC+ GKQTK +H+ +KECYT RVLELL++DL+G M+TESLG K+YVLVV
Subjt:  SGTYQLTRSNQTWLWHKKLGHVSMRGLEKIIKSKAIVGIPNLDVKGNFFCGDCQFGKQTKSTHKSMKECYTNRVLELLYMDLMGSMQTESLGGKRYVLVV

Query:  VDDYSRYAWVCFLKGKTDNIEICKNLCLKLQREKGKKIIRIRSDHGKEFDNESFSNFYLLEGIHHEFSAPITPQQNGVVEIKNRTLQEITRVMIHAKNLP
        VDDYSR+  V FLKGK+D +++C +L L LQREKG+KIIRIRSDHGKEFDNE  +NF   EGIHHEF APITPQQNGVVE KNRT               
Subjt:  VDDYSRYAWVCFLKGKTDNIEICKNLCLKLQREKGKKIIRIRSDHGKEFDNESFSNFYLLEGIHHEFSAPITPQQNGVVEIKNRTLQEITRVMIHAKNLP

Query:  LCFWAEAVNTVCHIHNRVNIRSGTTVTLYELWKDRKPNVKYFHVFGSTCYILADREYHQKWDAKSEQAIFLGYSQNSRAYRVFNNRSGSVMETINVVIND
                         V  RSGTT+ LYELWK RKPNVKYFH+FGSTCYILADR YH+KWD KS+Q IFLGYSQNSR YRVFN +S +VMETINVV+ND
Subjt:  LCFWAEAVNTVCHIHNRVNIRSGTTVTLYELWKDRKPNVKYFHVFGSTCYILADREYHQKWDAKSEQAIFLGYSQNSRAYRVFNNRSGSVMETINVVIND

Query:  LDSAI
          S +
Subjt:  LDSAI

TrEMBL top hitse value%identityAlignment
A0A2K3N8X7 Gag-protease polyprotein (Fragment)7.0e-10962.46Show/hide
Query:  TYQLTRSNQTWLWHKKLGHVSMRGLEKIIKSKAIVGIPNLDVKGNFFCGDCQFGKQTKSTHKSMKECYTNRVLELLYMDLMGSMQTESLGGKRYVLVVVD
        T  +T+ ++  LWH++LGH+++R ++K I  +AI G+PNL ++    CGDCQ GKQTK  H  ++   T RVLELL+MDLMG MQTESLGGKRY  VVVD
Subjt:  TYQLTRSNQTWLWHKKLGHVSMRGLEKIIKSKAIVGIPNLDVKGNFFCGDCQFGKQTKSTHKSMKECYTNRVLELLYMDLMGSMQTESLGGKRYVLVVVD

Query:  DYSRYAWVCFLKGKTDNIEICKNLCLKLQREKGKKIIRIRSDHGKEFDNESFSNFYLLEGIHHEFSAPITPQQNGVVEIKNRTLQEITRVMIHAKNLPLC
        DYSRY W+ F++ K++  ++ K+LC++LQREK   ++RIRSDHGKEF+N SFS+F   EGI HEFS+PITPQQNGVVE KNRT+QE  RVM+HAK L   
Subjt:  DYSRYAWVCFLKGKTDNIEICKNLCLKLQREKGKKIIRIRSDHGKEFDNESFSNFYLLEGIHHEFSAPITPQQNGVVEIKNRTLQEITRVMIHAKNLPLC

Query:  FWAEAVNTVCHIHNRVNIRSGTTVTLYELWKDRKPNVKYFHVFGSTCYILADREYHQKWDAKSEQAIFLGYSQNSRAYRVFNNRSGSVMETINVVINDLD
        FWAEA+NT C+IHNRV +RSGTT TLYELWK+RKP VK+FHVFGS CYIL+DRE   K D KS++ IFLGYS NSRAYRV+N R+ ++ME+INVVI+D  
Subjt:  FWAEAVNTVCHIHNRVNIRSGTTVTLYELWKDRKPNVKYFHVFGSTCYILADREYHQKWDAKSEQAIFLGYSQNSRAYRVFNNRSGSVMETINVVINDLD

Query:  S
        S
Subjt:  S

A0A5A7TNK7 Gag-pol polyprotein2.8e-13477.52Show/hide
Query:  MSGTYQLTRSNQTWLWHKKLGHVSMRGLEKIIKSKAIVGIPNLDVKGNFFCGDCQFGKQTKSTHKSMKECYTNRVLELLYMDLMGSMQTESLGGKRYVLV
        MS T QL RS+QTWLWH+KLGHVSMRGLEK+IK+KA+VGIPNLDV GNFFC DCQ GKQT+STHKS+KECYTNRVLELL+MDLMG MQT+SLG       
Subjt:  MSGTYQLTRSNQTWLWHKKLGHVSMRGLEKIIKSKAIVGIPNLDVKGNFFCGDCQFGKQTKSTHKSMKECYTNRVLELLYMDLMGSMQTESLGGKRYVLV

Query:  VVDDYSRYAWVCFLKGKTDNIEICKNLCLKLQREKGKKIIRIRSDHGKEFDNESFSNFYLLEGIHHEFSAPITPQQNGVVEIKNRTLQEITRVMIHAKNL
                       GKTD +EICKNLCLKLQRE+ KKI RIRSDHGKEFDNE F++F LLEG HHEFSAPITPQQNGVVE KN+TLQE+ RVMIHAKNL
Subjt:  VVDDYSRYAWVCFLKGKTDNIEICKNLCLKLQREKGKKIIRIRSDHGKEFDNESFSNFYLLEGIHHEFSAPITPQQNGVVEIKNRTLQEITRVMIHAKNL

Query:  PLCFWAEAVNTVCHIHNRVNIRSGTTVTLYELWKDRKPNVKYFHVFGSTCYILADREYHQKWDAKSEQAIFLGYSQNSRAYRVFNNRSGSVMETINVVIN
        PLCF+AEAVNT CHIHNRV IR+GTT+TLYE WK+RK NVKYFHVFGSTCYILADREYH+KWDA+SEQ IFL YSQ SRAYRV+NNRS SVMETIN  IN
Subjt:  PLCFWAEAVNTVCHIHNRVNIRSGTTVTLYELWKDRKPNVKYFHVFGSTCYILADREYHQKWDAKSEQAIFLGYSQNSRAYRVFNNRSGSVMETINVVIN

Query:  DLDSAIK
        DLDSAIK
Subjt:  DLDSAIK

A0A5D3BA69 Gag-pol polyprotein7.7e-13271.8Show/hide
Query:  SGTYQLTRSNQTWLWHKKLGHVSMRGLEKIIKSKAIVGIPNLDVKGNFFCGDCQFGKQTKSTHKSMKECYTNRVLELLYMDLMGSMQTESLGGKRYVLVV
        S    LT+ +QTWLWH+KLGH+S+R L+K+I+++A+VGIP+LD+ G FFCG+CQ GKQTK++H+ +KECYT  VLELL+++LMG MQTESLGGK+YVLVV
Subjt:  SGTYQLTRSNQTWLWHKKLGHVSMRGLEKIIKSKAIVGIPNLDVKGNFFCGDCQFGKQTKSTHKSMKECYTNRVLELLYMDLMGSMQTESLGGKRYVLVV

Query:  VDDYSRYAWVCFLKGKTDNIEICKNLCLKLQREKGKKIIRIRSDHGKEFDNESFSNFYLLEGIHHEFSAPITPQQNGVVEIKNRTLQEITRVMIHAKNLP
        VDDYS++ WV FLK K D +++C +LCL LQREKG+KIIRIRSDHGKEFDNE  +N    EGIHHEF+APITPQQNGVVE KNR LQE+ RVMIHAKN P
Subjt:  VDDYSRYAWVCFLKGKTDNIEICKNLCLKLQREKGKKIIRIRSDHGKEFDNESFSNFYLLEGIHHEFSAPITPQQNGVVEIKNRTLQEITRVMIHAKNLP

Query:  LCFWAEAVNTVCHIHNRVNIRSGTTVTLYELWKDRKPNVKYFHVFGSTCYILADREYHQKWDAKSEQAIFLGYSQNSRAYRVFNNRSGSVMETINVVIND
        L FWAE VNT CHIH RV  R GTT+TLYELWK RKPNVKYFH+FGSTCYILADREYH+KWD KS Q IFL YSQNSRAYRVFN +SG+VME INVV+ND
Subjt:  LCFWAEAVNTVCHIHNRVNIRSGTTVTLYELWKDRKPNVKYFHVFGSTCYILADREYHQKWDAKSEQAIFLGYSQNSRAYRVFNNRSGSVMETINVVIND

Query:  LDSAI
         +S +
Subjt:  LDSAI

A0A5D3CM30 Gag-pol polyprotein2.8e-14282.06Show/hide
Query:  LTRSNQTWLWHKKLGHVSMRGLEKIIKSKAIVGIPNLDVKGNFFCGDCQFGKQTKSTHKSMKECYTNRVLELLYMDLMGSMQTESLGGKRYVLVVVDDYS
        + RS+QTW+WHKKLGHVSMRGLEKIIK+KA+VGIP+L+V GNFFCG         STHKS K CYTNRVLELL+MDLM  M+TES GGKRYVLVVV DYS
Subjt:  LTRSNQTWLWHKKLGHVSMRGLEKIIKSKAIVGIPNLDVKGNFFCGDCQFGKQTKSTHKSMKECYTNRVLELLYMDLMGSMQTESLGGKRYVLVVVDDYS

Query:  RYAWVCFLKGKTDNIEICKNLCLKLQREKGKKIIRIRSDHGKEFDNESFSNFYLLEGIHHEFSAPITPQQNGVVEIKNRTLQEITRVMIHAKNLPLCFWA
        RY WVCFL+GKTD +EICKNLCLKLQREKGKKI RIRSDHGKEFDNE F++F LLEGI HEFSAPITPQQNGVVE KNRTLQE+  VMIHAKNLP+CFWA
Subjt:  RYAWVCFLKGKTDNIEICKNLCLKLQREKGKKIIRIRSDHGKEFDNESFSNFYLLEGIHHEFSAPITPQQNGVVEIKNRTLQEITRVMIHAKNLPLCFWA

Query:  EAVNTVCHIHNRVNIRSGTTVTLYELWKDRKPNVKYFHVFGSTCYILADREYHQKWDAKSEQAIFLGYSQNSRAYRVFNNRSGSVMETINVVINDLDSAI
        EAVN  CHIHNRV IR+G TVTLYELWK+RKPNVKYFHVFGSTCYILADREY QKWDA+SEQ IFLGYSQNS AYRVFNNRSG+V+ETINVVINDLDSA+
Subjt:  EAVNTVCHIHNRVNIRSGTTVTLYELWKDRKPNVKYFHVFGSTCYILADREYHQKWDAKSEQAIFLGYSQNSRAYRVFNNRSGSVMETINVVINDLDSAI

Query:  K
        K
Subjt:  K

A0A5D3DQT9 Gag/pol polyprotein5.4e-10963.93Show/hide
Query:  SGTYQLTRSNQTWLWHKKLGHVSMRGLEKIIKSKAIVGIPNLDVKGNFFCGDCQFGKQTKSTHKSMKECYTNRVLELLYMDLMGSMQTESLGGKRYVLVV
        S    LT+ +QTWLWH+KLGH+S+R L+K+I++KA+VGIP+LD+ G FFCGDC+ GKQTK +H+ +KECYT RVLELL++DL+G M+TESLG K+YVLVV
Subjt:  SGTYQLTRSNQTWLWHKKLGHVSMRGLEKIIKSKAIVGIPNLDVKGNFFCGDCQFGKQTKSTHKSMKECYTNRVLELLYMDLMGSMQTESLGGKRYVLVV

Query:  VDDYSRYAWVCFLKGKTDNIEICKNLCLKLQREKGKKIIRIRSDHGKEFDNESFSNFYLLEGIHHEFSAPITPQQNGVVEIKNRTLQEITRVMIHAKNLP
        VDDYSR+  V FLKGK+D +++C +L L LQREKG+KIIRIRSDHGKEFDNE  +NF   EGIHHEF APITPQQNGVVE KNRT               
Subjt:  VDDYSRYAWVCFLKGKTDNIEICKNLCLKLQREKGKKIIRIRSDHGKEFDNESFSNFYLLEGIHHEFSAPITPQQNGVVEIKNRTLQEITRVMIHAKNLP

Query:  LCFWAEAVNTVCHIHNRVNIRSGTTVTLYELWKDRKPNVKYFHVFGSTCYILADREYHQKWDAKSEQAIFLGYSQNSRAYRVFNNRSGSVMETINVVIND
                         V  RSGTT+ LYELWK RKPNVKYFH+FGSTCYILADR YH+KWD KS+Q IFLGYSQNSR YRVFN +S +VMETINVV+ND
Subjt:  LCFWAEAVNTVCHIHNRVNIRSGTTVTLYELWKDRKPNVKYFHVFGSTCYILADREYHQKWDAKSEQAIFLGYSQNSRAYRVFNNRSGSVMETINVVIND

Query:  LDSAI
          S +
Subjt:  LDSAI

SwissProt top hitse value%identityAlignment
A0A0B7P3V8 Transposon Ty4-P Gag-Pol polyprotein1.2e-1726.85Show/hide
Query:  HKKLGHVSMRGLEKIIK------SKAIVGIPNLDVKGNFFCGDCQFGKQTKSTH--KSMKECYTNRVLELLY-MDLMGSMQTESLGGKRYVLVVVDDYSR
        HK++GH  ++ +E  IK      S  ++  PN      F+C  C+  K TK  H   SM    T+      + MD+ G + + +   KRY+L++VD+ +R
Subjt:  HKKLGHVSMRGLEKIIK------SKAIVGIPNLDVKGNFFCGDCQFGKQTKSTH--KSMKECYTNRVLELLY-MDLMGSMQTESLGGKRYVLVVVDDYSR

Query:  YAWVC--FLK-GKTDNIEICKNLCLKLQREKGKKIIRIRSDHGKEFDNESFSNFYLLEGIHHEFSAPITPQQNGVVEIKNRTLQEITRVMIHAKNLPLCF
        Y      F K  +T   +I KN+   ++ +  +K+  I SD G EF N+    +++ +GIHH  ++      NG  E   RT+      ++   NL + F
Subjt:  YAWVC--FLK-GKTDNIEICKNLCLKLQREKGKKIIRIRSDHGKEFDNESFSNFYLLEGIHHEFSAPITPQQNGVVEIKNRTLQEITRVMIHAKNLPLCF

Query:  WAEAVNTVCHIHNRVNIRSGTTVTLYELWKDRKP---NVKYFHVFGSTCYILADREYHQKWDAKSEQAIFLGYSQNSRAYRVFNNRSGSVMETINVVI
        W  AV +  +I N +  +S   + L  +   R+P    +  F  FG    I      H+K       +I L    NS  Y+ F      ++ + N  I
Subjt:  WAEAVNTVCHIHNRVNIRSGTTVTLYELWKDRKP---NVKYFHVFGSTCYILADREYHQKWDAKSEQAIFLGYSQNSRAYRVFNNRSGSVMETINVVI

P04146 Copia protein1.0e-3230.4Show/hide
Query:  NQTWLWHKKLGHVSMRGLEKIIKSKAIVG---IPNLDVKGNFFCGDCQFGKQTKSTHKSMKE-CYTNRVLELLYMDLMGSMQTESLGGKRYVLVVVDDYS
        N   LWH++ GH+S   L +I +         + NL++     C  C  GKQ +   K +K+  +  R L +++ D+ G +   +L  K Y ++ VD ++
Subjt:  NQTWLWHKKLGHVSMRGLEKIIKSKAIVG---IPNLDVKGNFFCGDCQFGKQTKSTHKSMKE-CYTNRVLELLYMDLMGSMQTESLGGKRYVLVVVDDYS

Query:  RYAWVCFLKGKTDNIEICKNLCLKLQREKGKKIIRIRSDHGKEFDNESFSNFYLLEGIHHEFSAPITPQQNGVVEIKNRTLQEITRVMIHAKNLPLCFWA
         Y     +K K+D   + ++   K +     K++ +  D+G+E+ +     F + +GI +  + P TPQ NGV E   RT+ E  R M+    L   FW 
Subjt:  RYAWVCFLKGKTDNIEICKNLCLKLQREKGKKIIRIRSDHGKEFDNESFSNFYLLEGIHHEFSAPITPQQNGVVEIKNRTLQEITRVMIHAKNLPLCFWA

Query:  EAVNTVCHIHNRVNIRS--GTTVTLYELWKDRKPNVKYFHVFGSTCYILADREYHQKWDAKSEQAIFLGYSQN
        EAV T  ++ NR+  R+   ++ T YE+W ++KP +K+  VFG+T Y+   +    K+D KS ++IF+GY  N
Subjt:  EAVNTVCHIHNRVNIRS--GTTVTLYELWKDRKPNVKYFHVFGSTCYILADREYHQKWDAKSEQAIFLGYSQN

P0C2J7 Transposon Ty4-H Gag-Pol polyprotein1.2e-1726.85Show/hide
Query:  HKKLGHVSMRGLEKIIK------SKAIVGIPNLDVKGNFFCGDCQFGKQTKSTH--KSMKECYTNRVLELLY-MDLMGSMQTESLGGKRYVLVVVDDYSR
        HK++GH  ++ +E  IK      S  ++  PN      F+C  C+  K TK  H   SM    T+      + MD+ G + + +   KRY+L++VD+ +R
Subjt:  HKKLGHVSMRGLEKIIK------SKAIVGIPNLDVKGNFFCGDCQFGKQTKSTH--KSMKECYTNRVLELLY-MDLMGSMQTESLGGKRYVLVVVDDYSR

Query:  YAWVC--FLK-GKTDNIEICKNLCLKLQREKGKKIIRIRSDHGKEFDNESFSNFYLLEGIHHEFSAPITPQQNGVVEIKNRTLQEITRVMIHAKNLPLCF
        Y      F K  +T   +I KN+   ++ +  +K+  I SD G EF N+    +++ +GIHH  ++      NG  E   RT+      ++   NL + F
Subjt:  YAWVC--FLK-GKTDNIEICKNLCLKLQREKGKKIIRIRSDHGKEFDNESFSNFYLLEGIHHEFSAPITPQQNGVVEIKNRTLQEITRVMIHAKNLPLCF

Query:  WAEAVNTVCHIHNRVNIRSGTTVTLYELWKDRKP---NVKYFHVFGSTCYILADREYHQKWDAKSEQAIFLGYSQNSRAYRVFNNRSGSVMETINVVI
        W  AV +  +I N +  +S   + L  +   R+P    +  F  FG    I      H+K       +I L    NS  Y+ F      ++ + N  I
Subjt:  WAEAVNTVCHIHNRVNIRSGTTVTLYELWKDRKP---NVKYFHVFGSTCYILADREYHQKWDAKSEQAIFLGYSQNSRAYRVFNNRSGSVMETINVVI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.4e-4534.71Show/hide
Query:  LWHKKLGHVSMRGLEKIIKSKAIVGIPNLDVKGNFFCGDCQFGKQTKSTHKSMKECYTNRVLELLYMDLMGSMQTESLGGKRYVLVVVDDYSRYAWVCFL
        LWHK++GH+S +GL+ + K   I       VK    C  C FGKQ + + ++  E   N +L+L+Y D+ G M+ ES+GG +Y +  +DD SR  WV  L
Subjt:  LWHKKLGHVSMRGLEKIIKSKAIVGIPNLDVKGNFFCGDCQFGKQTKSTHKSMKECYTNRVLELLYMDLMGSMQTESLGGKRYVLVVVDDYSRYAWVCFL

Query:  KGKTDNIEICKNLCLKLQREKGKKIIRIRSDHGKEFDNESFSNFYLLEGIHHEFSAPITPQQNGVVEIKNRTLQEITRVMIHAKNLPLCFWAEAVNTVCH
        K K    ++ +     ++RE G+K+ R+RSD+G E+ +  F  +    GI HE + P TPQ NGV E  NRT+ E  R M+    LP  FW EAV T C+
Subjt:  KGKTDNIEICKNLCLKLQREKGKKIIRIRSDHGKEFDNESFSNFYLLEGIHHEFSAPITPQQNGVVEIKNRTLQEITRVMIHAKNLPLCFWAEAVNTVCH

Query:  IHNRVNIRSGTTVTLYEL----WKDRKPNVKYFHVFGSTCYILADREYHQKWDAKSEQAIFLGYSQNSRAYRVFNNRSGSVMETINVVIND
        + N    RS +    +E+    W +++ +  +  VFG   +    +E   K D KS   IF+GY      YR+++     V+ + +VV  +
Subjt:  IHNRVNIRSGTTVTLYEL----WKDRKPNVKYFHVFGSTCYILADREYHQKWDAKSEQAIFLGYSQNSRAYRVFNNRSGSVMETINVVIND

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.4e-2129.85Show/hide
Query:  WHKKLGHVSMRGLEKIIKSKAIVGIPNLDVKGNFF-CGDCQFGKQTKSTHKSMKECYTNRVLELLYMDLMGSMQTESLGGKRYVLVVVDDYSRYAWVCFL
        WH +LGH +   L  +I +     +  L+    F  C DC   K  K    S     + R LE +Y D+  S    S    RY ++ VD ++RY W+  L
Subjt:  WHKKLGHVSMRGLEKIIKSKAIVGIPNLDVKGNFF-CGDCQFGKQTKSTHKSMKECYTNRVLELLYMDLMGSMQTESLGGKRYVLVVVDDYSRYAWVCFL

Query:  KGKTDNIEICKNLCLKLQREKGKKIIRIRSDHGKEFDNESFSNFYLLEGIHHEFSAPITPQQNGVVEIKNRTLQEITRVMIHAKNLPLCFWAEAVNTVCH
        K K+   E        L+     +I    SD+G EF   +   ++   GI H  S P TP+ NG+ E K+R + E    ++   ++P  +W  A     +
Subjt:  KGKTDNIEICKNLCLKLQREKGKKIIRIRSDHGKEFDNESFSNFYLLEGIHHEFSAPITPQQNGVVEIKNRTLQEITRVMIHAKNLPLCFWAEAVNTVCH

Query:  IHNRVNIRSGTTVTLYELWKDRKPNVKYFHVFGSTCYILADREYHQ-KWDAKSEQAIFLGYSQNSRAY
        + NR+        + ++      PN     VFG  CY    R Y+Q K D KS Q +FLGYS    AY
Subjt:  IHNRVNIRSGTTVTLYELWKDRKPNVKYFHVFGSTCYILADREYHQ-KWDAKSEQAIFLGYSQNSRAY

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein3.2e-0532.91Show/hide
Query:  TRSNQTWLWHKKLGHVSMRGLEKIIKSKAIVGIPNLDVKGNFFCGDCQFGKQTKSTHKSMKECYTNRVLELLYMDLMGS
        T  ++T LWH +L H+S RG+E ++K      + +  V    FC DC +GK T   + S  +  T   L+ ++ DL G+
Subjt:  TRSNQTWLWHKKLGHVSMRGLEKIIKSKAIVGIPNLDVKGNFFCGDCQFGKQTKSTHKSMKECYTNRVLELLYMDLMGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGGCACCTACCAGTTGACAAGATCAAATCAAACATGGCTATGGCATAAAAAGCTGGGGCATGTCAGTATGAGAGGCTTGGAAAAGATTATAAAAAGTAAAGCTAT
TGTAGGAATTCCTAATCTAGACGTAAAGGGAAACTTCTTCTGTGGAGATTGCCAATTTGGTAAGCAGACAAAGTCTACTCATAAAAGTATGAAAGAATGTTATACTAACA
GAGTCTTGGAATTGCTATATATGGATCTTATGGGTTCAATGCAAACAGAAAGTCTGGGAGGAAAGAGGTATGTGCTGGTAGTAGTTGATGACTACTCAAGGTATGCTTGG
GTCTGCTTTCTCAAAGGCAAAACAGATAATATTGAAATATGTAAAAATCTGTGTTTGAAGCTACAACGTGAAAAAGGAAAGAAAATAATCAGGATCCGAAGTGATCATGG
TAAAGAATTTGATAATGAAAGCTTTAGCAACTTTTATCTACTAGAAGGAATACACCATGAGTTCTCTGCACCTATAACTCCTCAACAGAATGGTGTAGTAGAAATAAAGA
ACAGGACGTTACAAGAAATAACACGTGTTATGATACATGCCAAAAATTTACCTTTATGTTTTTGGGCAGAAGCTGTAAATACTGTCTGTCACATTCATAACAGGGTAAAT
ATTAGATCCGGAACGACTGTTACACTTTATGAACTTTGGAAAGATAGAAAGCCAAATGTTAAATACTTTCATGTGTTTGGAAGTACATGTTATATCTTAGCTGACAGGGA
ATACCACCAGAAATGGGATGCTAAGTCAGAACAAGCAATCTTTCTCGGGTACTCTCAGAACAGCCGTGCCTATAGAGTCTTCAATAACAGATCTGGGAGTGTTATGGAAA
CAATAAATGTAGTTATAAATGACCTCGATTCAGCTATCAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCAGGCACCTACCAGTTGACAAGATCAAATCAAACATGGCTATGGCATAAAAAGCTGGGGCATGTCAGTATGAGAGGCTTGGAAAAGATTATAAAAAGTAAAGCTAT
TGTAGGAATTCCTAATCTAGACGTAAAGGGAAACTTCTTCTGTGGAGATTGCCAATTTGGTAAGCAGACAAAGTCTACTCATAAAAGTATGAAAGAATGTTATACTAACA
GAGTCTTGGAATTGCTATATATGGATCTTATGGGTTCAATGCAAACAGAAAGTCTGGGAGGAAAGAGGTATGTGCTGGTAGTAGTTGATGACTACTCAAGGTATGCTTGG
GTCTGCTTTCTCAAAGGCAAAACAGATAATATTGAAATATGTAAAAATCTGTGTTTGAAGCTACAACGTGAAAAAGGAAAGAAAATAATCAGGATCCGAAGTGATCATGG
TAAAGAATTTGATAATGAAAGCTTTAGCAACTTTTATCTACTAGAAGGAATACACCATGAGTTCTCTGCACCTATAACTCCTCAACAGAATGGTGTAGTAGAAATAAAGA
ACAGGACGTTACAAGAAATAACACGTGTTATGATACATGCCAAAAATTTACCTTTATGTTTTTGGGCAGAAGCTGTAAATACTGTCTGTCACATTCATAACAGGGTAAAT
ATTAGATCCGGAACGACTGTTACACTTTATGAACTTTGGAAAGATAGAAAGCCAAATGTTAAATACTTTCATGTGTTTGGAAGTACATGTTATATCTTAGCTGACAGGGA
ATACCACCAGAAATGGGATGCTAAGTCAGAACAAGCAATCTTTCTCGGGTACTCTCAGAACAGCCGTGCCTATAGAGTCTTCAATAACAGATCTGGGAGTGTTATGGAAA
CAATAAATGTAGTTATAAATGACCTCGATTCAGCTATCAAATAG
Protein sequenceShow/hide protein sequence
MSGTYQLTRSNQTWLWHKKLGHVSMRGLEKIIKSKAIVGIPNLDVKGNFFCGDCQFGKQTKSTHKSMKECYTNRVLELLYMDLMGSMQTESLGGKRYVLVVVDDYSRYAW
VCFLKGKTDNIEICKNLCLKLQREKGKKIIRIRSDHGKEFDNESFSNFYLLEGIHHEFSAPITPQQNGVVEIKNRTLQEITRVMIHAKNLPLCFWAEAVNTVCHIHNRVN
IRSGTTVTLYELWKDRKPNVKYFHVFGSTCYILADREYHQKWDAKSEQAIFLGYSQNSRAYRVFNNRSGSVMETINVVINDLDSAIK