; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0103061 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0103061
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr04:20053608..20054636
RNA-Seq ExpressionCmc04g0103061
SyntenyCmc04g0103061
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036855.1 gag-pol polyprotein [Cucumis melo var. makuwa]9.1e-13671.03Show/hide
Query:  MTENRSYFTNLNDCVTGHVTFGDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLYDQCYKVIFDDTGCVVMNKENQICMSGKRQADNCYHWN
        MT NRS+FT L +C  GH TFGDGAKGKIIAKGNIDK +LP LN+VRYVDGLKANLISISQL DQ Y V F++TG VV NK NQ+ MSG+R+A+NCY+W+
Subjt:  MTENRSYFTNLNDCVTGHVTFGDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLYDQCYKVIFDDTGCVVMNKENQICMSGKRQADNCYHWN

Query:  SNMSDTCQLIRSDQAWLWHRKLGHVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTQKSLKECYTNRFLELLHMDLMGQMQTESLGGKRYV
        SN S+ C L + DQ WLWHRKLGH+S+R L+KVI+N+AVVGIP LD+NG FFCG+CQ+GKQT+++ + LKECYT   LELLH++LMG MQTESLGGK+YV
Subjt:  SNMSDTCQLIRSDQAWLWHRKLGHVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTQKSLKECYTNRFLELLHMDLMGQMQTESLGGKRYV

Query:  LVVVDDYSRYTWDCFLKGKTYTIEICKNLCLKLQREKGKKTTRIQSDHGKEFDNESFNNFCLLEGIHHEFSAPITPKQNGLVERKNRTLQEMTHVMIHAK
        LVVVDDYS++TW  FLK K  T+++C +LCL LQREKG+K  RI+SDHGKEFDNE  NN C  EGIHHEF+APITP+QNG+VERKNR LQEM  VMIHAK
Subjt:  LVVVDDYSRYTWDCFLKGKTYTIEICKNLCLKLQREKGKKTTRIQSDHGKEFDNESFNNFCLLEGIHHEFSAPITPKQNGLVERKNRTLQEMTHVMIHAK

Query:  NLPLCFWAEAVNTDCHIHNRV
        N PL FWAE VNT CHIH RV
Subjt:  NLPLCFWAEAVNTDCHIHNRV

KAA0042995.1 gag-pol polyprotein [Cucumis melo var. makuwa]6.9e-15278.76Show/hide
Query:  MTENRSYFTNLNDCVTGHVTFGDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLYDQCYKVIFDDTGCVVMNKENQICMSGKRQADNCYHWN
        M  NRSYFTNLNDCV  HVTFGDGAKGKIIAKGNIDKDDL RLNDVRYVDGLKANLI+ISQL DQ YKV FDD GCVV+NKENQICMSGKRQADNCYHWN
Subjt:  MTENRSYFTNLNDCVTGHVTFGDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLYDQCYKVIFDDTGCVVMNKENQICMSGKRQADNCYHWN

Query:  SNMSDTCQLIRSDQAWLWHRKLGHVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTQKSLKECYTNRFLELLHMDLMGQMQTESLGGKRYV
        SNMSDTCQLIRSDQ WLWHRKLGHVSMRGLEKVIKNKAVVGIP+LDVNGNFFC DCQIGKQTRST KSLKECYTNR LELLHMDLMG MQT+SLG     
Subjt:  SNMSDTCQLIRSDQAWLWHRKLGHVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTQKSLKECYTNRFLELLHMDLMGQMQTESLGGKRYV

Query:  LVVVDDYSRYTWDCFLKGKTYTIEICKNLCLKLQREKGKKTTRIQSDHGKEFDNESFNNFCLLEGIHHEFSAPITPKQNGLVERKNRTLQEMTHVMIHAK
                         GKT T+EICKNLCLKLQRE+ KK TRI+SDHGKEFDNE FN+FCLLEG HHEFSAPITP+QNG+VERKN+TLQEM  VMIHAK
Subjt:  LVVVDDYSRYTWDCFLKGKTYTIEICKNLCLKLQREKGKKTTRIQSDHGKEFDNESFNNFCLLEGIHHEFSAPITPKQNGLVERKNRTLQEMTHVMIHAK

Query:  NLPLCFWAEAVNTDCHIHNRVVLLELERLLLFMNFGKRE
        NLPLCF+AEAVNT CHIHNRV +     + L+ ++ +R+
Subjt:  NLPLCFWAEAVNTDCHIHNRVVLLELERLLLFMNFGKRE

KAA0048228.1 gag-pol polyprotein [Cucumis melo var. makuwa]7.2e-14988.93Show/hide
Query:  MTENRSYFTNLNDCVTGHVTFGDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLYDQCYKVIFDDTGCVVMNKENQICMSGKRQADNCYHWN
        MT N+SYFTNLNDCVTGHVTFGDGAK +IIAKGNIDKDDLPRLNDVRYVDGLK NLISISQL DQ YKV FDD GCVVMNKENQICMSGKRQADNCYHWN
Subjt:  MTENRSYFTNLNDCVTGHVTFGDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLYDQCYKVIFDDTGCVVMNKENQICMSGKRQADNCYHWN

Query:  SNMSDTCQLIRSDQAWLWHRKLGHVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTQKSLKECYTNRFLELLHMDLMGQMQTESLGGKRYV
        SNMSDTCQLIR DQ WLWHRKL HVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRST KSLKECYTNR L+LLHMDLMG MQTESLGGKRYV
Subjt:  SNMSDTCQLIRSDQAWLWHRKLGHVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTQKSLKECYTNRFLELLHMDLMGQMQTESLGGKRYV

Query:  LVVVDDYSRYTWDCFLKGKTYTIEICKNLCLKLQREKGKKTTRIQSDHGKEFDNESFNNFCLLEGIHHEFSAPITPKQNGLVERKNRTL
        LVVVDDYSRYTW CFLKGK  T+EICKNL LKLQREK KK TRIQSDHGKEFDNE FN+FCLLEGI+HEFSA ITP+QNG++E KNR L
Subjt:  LVVVDDYSRYTWDCFLKGKTYTIEICKNLCLKLQREKGKKTTRIQSDHGKEFDNESFNNFCLLEGIHHEFSAPITPKQNGLVERKNRTL

KAA0053561.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.3e-12689.52Show/hide
Query:  MTENRSYFTNLNDCVTGHVTFGDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLYDQCYKVIFDDTGCVVMNKENQICMSGKRQADNCYHWN
        MT NRSYFTNLNDCVT HVT GDGAKGKIIAKGNIDK DLPRLNDVRYVDGLKANLISISQL DQ YKV FDD G VVMNKENQICMSGKRQADNCYHWN
Subjt:  MTENRSYFTNLNDCVTGHVTFGDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLYDQCYKVIFDDTGCVVMNKENQICMSGKRQADNCYHWN

Query:  SNMSDTCQLIRSDQAWLWHRKLGHVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTQKSLKECYTNRFLELLHMDLMGQMQTESLGGKRYV
        SNMSDTCQLIRSDQ WLWHRKLGHVSMRGLEK+IKNKAVVGIPDLDVNGNFFCGDCQIGKQTR T K+LKECYTNR LELLHMDLMG M  ESL GKRYV
Subjt:  SNMSDTCQLIRSDQAWLWHRKLGHVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTQKSLKECYTNRFLELLHMDLMGQMQTESLGGKRYV

Query:  LVVVDDYSRYTWDCFLKGKTYTIEICKNLCLKLQREKGKKTTRIQSDH
        LVVVDDYSRYTW CFLK KT T+E CKNLCLKLQREKGKK TRI+SDH
Subjt:  LVVVDDYSRYTWDCFLKGKTYTIEICKNLCLKLQREKGKKTTRIQSDH

TYK19154.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.3e-12689.52Show/hide
Query:  MTENRSYFTNLNDCVTGHVTFGDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLYDQCYKVIFDDTGCVVMNKENQICMSGKRQADNCYHWN
        MT NRSYFTNLNDCVT HVT GDGAKGKIIAKGNIDK DLPRLNDVRYVDGLKANLISISQL DQ YKV FDD G VVMNKENQICMSGKRQADNCYHWN
Subjt:  MTENRSYFTNLNDCVTGHVTFGDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLYDQCYKVIFDDTGCVVMNKENQICMSGKRQADNCYHWN

Query:  SNMSDTCQLIRSDQAWLWHRKLGHVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTQKSLKECYTNRFLELLHMDLMGQMQTESLGGKRYV
        SNMSDTCQLIRSDQ WLWHRKLGHVSMRGLEK+IKNKAVVGIPDLDVNGNFFCGDCQIGKQTR T K+LKECYTNR LELLHMDLMG M  ESL GKRYV
Subjt:  SNMSDTCQLIRSDQAWLWHRKLGHVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTQKSLKECYTNRFLELLHMDLMGQMQTESLGGKRYV

Query:  LVVVDDYSRYTWDCFLKGKTYTIEICKNLCLKLQREKGKKTTRIQSDH
        LVVVDDYSRYTW CFLK KT T+E CKNLCLKLQREKGKK TRI+SDH
Subjt:  LVVVDDYSRYTWDCFLKGKTYTIEICKNLCLKLQREKGKKTTRIQSDH

TrEMBL top hitse value%identityAlignment
A0A5A7TNK7 Gag-pol polyprotein3.3e-15278.76Show/hide
Query:  MTENRSYFTNLNDCVTGHVTFGDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLYDQCYKVIFDDTGCVVMNKENQICMSGKRQADNCYHWN
        M  NRSYFTNLNDCV  HVTFGDGAKGKIIAKGNIDKDDL RLNDVRYVDGLKANLI+ISQL DQ YKV FDD GCVV+NKENQICMSGKRQADNCYHWN
Subjt:  MTENRSYFTNLNDCVTGHVTFGDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLYDQCYKVIFDDTGCVVMNKENQICMSGKRQADNCYHWN

Query:  SNMSDTCQLIRSDQAWLWHRKLGHVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTQKSLKECYTNRFLELLHMDLMGQMQTESLGGKRYV
        SNMSDTCQLIRSDQ WLWHRKLGHVSMRGLEKVIKNKAVVGIP+LDVNGNFFC DCQIGKQTRST KSLKECYTNR LELLHMDLMG MQT+SLG     
Subjt:  SNMSDTCQLIRSDQAWLWHRKLGHVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTQKSLKECYTNRFLELLHMDLMGQMQTESLGGKRYV

Query:  LVVVDDYSRYTWDCFLKGKTYTIEICKNLCLKLQREKGKKTTRIQSDHGKEFDNESFNNFCLLEGIHHEFSAPITPKQNGLVERKNRTLQEMTHVMIHAK
                         GKT T+EICKNLCLKLQRE+ KK TRI+SDHGKEFDNE FN+FCLLEG HHEFSAPITP+QNG+VERKN+TLQEM  VMIHAK
Subjt:  LVVVDDYSRYTWDCFLKGKTYTIEICKNLCLKLQREKGKKTTRIQSDHGKEFDNESFNNFCLLEGIHHEFSAPITPKQNGLVERKNRTLQEMTHVMIHAK

Query:  NLPLCFWAEAVNTDCHIHNRVVLLELERLLLFMNFGKRE
        NLPLCF+AEAVNT CHIHNRV +     + L+ ++ +R+
Subjt:  NLPLCFWAEAVNTDCHIHNRVVLLELERLLLFMNFGKRE

A0A5A7UJA4 Gag-pol polyprotein6.4e-12789.52Show/hide
Query:  MTENRSYFTNLNDCVTGHVTFGDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLYDQCYKVIFDDTGCVVMNKENQICMSGKRQADNCYHWN
        MT NRSYFTNLNDCVT HVT GDGAKGKIIAKGNIDK DLPRLNDVRYVDGLKANLISISQL DQ YKV FDD G VVMNKENQICMSGKRQADNCYHWN
Subjt:  MTENRSYFTNLNDCVTGHVTFGDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLYDQCYKVIFDDTGCVVMNKENQICMSGKRQADNCYHWN

Query:  SNMSDTCQLIRSDQAWLWHRKLGHVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTQKSLKECYTNRFLELLHMDLMGQMQTESLGGKRYV
        SNMSDTCQLIRSDQ WLWHRKLGHVSMRGLEK+IKNKAVVGIPDLDVNGNFFCGDCQIGKQTR T K+LKECYTNR LELLHMDLMG M  ESL GKRYV
Subjt:  SNMSDTCQLIRSDQAWLWHRKLGHVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTQKSLKECYTNRFLELLHMDLMGQMQTESLGGKRYV

Query:  LVVVDDYSRYTWDCFLKGKTYTIEICKNLCLKLQREKGKKTTRIQSDH
        LVVVDDYSRYTW CFLK KT T+E CKNLCLKLQREKGKK TRI+SDH
Subjt:  LVVVDDYSRYTWDCFLKGKTYTIEICKNLCLKLQREKGKKTTRIQSDH

A0A5D3BA69 Gag-pol polyprotein4.4e-13671.03Show/hide
Query:  MTENRSYFTNLNDCVTGHVTFGDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLYDQCYKVIFDDTGCVVMNKENQICMSGKRQADNCYHWN
        MT NRS+FT L +C  GH TFGDGAKGKIIAKGNIDK +LP LN+VRYVDGLKANLISISQL DQ Y V F++TG VV NK NQ+ MSG+R+A+NCY+W+
Subjt:  MTENRSYFTNLNDCVTGHVTFGDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLYDQCYKVIFDDTGCVVMNKENQICMSGKRQADNCYHWN

Query:  SNMSDTCQLIRSDQAWLWHRKLGHVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTQKSLKECYTNRFLELLHMDLMGQMQTESLGGKRYV
        SN S+ C L + DQ WLWHRKLGH+S+R L+KVI+N+AVVGIP LD+NG FFCG+CQ+GKQT+++ + LKECYT   LELLH++LMG MQTESLGGK+YV
Subjt:  SNMSDTCQLIRSDQAWLWHRKLGHVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTQKSLKECYTNRFLELLHMDLMGQMQTESLGGKRYV

Query:  LVVVDDYSRYTWDCFLKGKTYTIEICKNLCLKLQREKGKKTTRIQSDHGKEFDNESFNNFCLLEGIHHEFSAPITPKQNGLVERKNRTLQEMTHVMIHAK
        LVVVDDYS++TW  FLK K  T+++C +LCL LQREKG+K  RI+SDHGKEFDNE  NN C  EGIHHEF+APITP+QNG+VERKNR LQEM  VMIHAK
Subjt:  LVVVDDYSRYTWDCFLKGKTYTIEICKNLCLKLQREKGKKTTRIQSDHGKEFDNESFNNFCLLEGIHHEFSAPITPKQNGLVERKNRTLQEMTHVMIHAK

Query:  NLPLCFWAEAVNTDCHIHNRV
        N PL FWAE VNT CHIH RV
Subjt:  NLPLCFWAEAVNTDCHIHNRV

A0A5D3C7T5 Gag-pol polyprotein3.5e-14988.93Show/hide
Query:  MTENRSYFTNLNDCVTGHVTFGDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLYDQCYKVIFDDTGCVVMNKENQICMSGKRQADNCYHWN
        MT N+SYFTNLNDCVTGHVTFGDGAK +IIAKGNIDKDDLPRLNDVRYVDGLK NLISISQL DQ YKV FDD GCVVMNKENQICMSGKRQADNCYHWN
Subjt:  MTENRSYFTNLNDCVTGHVTFGDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLYDQCYKVIFDDTGCVVMNKENQICMSGKRQADNCYHWN

Query:  SNMSDTCQLIRSDQAWLWHRKLGHVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTQKSLKECYTNRFLELLHMDLMGQMQTESLGGKRYV
        SNMSDTCQLIR DQ WLWHRKL HVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRST KSLKECYTNR L+LLHMDLMG MQTESLGGKRYV
Subjt:  SNMSDTCQLIRSDQAWLWHRKLGHVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTQKSLKECYTNRFLELLHMDLMGQMQTESLGGKRYV

Query:  LVVVDDYSRYTWDCFLKGKTYTIEICKNLCLKLQREKGKKTTRIQSDHGKEFDNESFNNFCLLEGIHHEFSAPITPKQNGLVERKNRTL
        LVVVDDYSRYTW CFLKGK  T+EICKNL LKLQREK KK TRIQSDHGKEFDNE FN+FCLLEGI+HEFSA ITP+QNG++E KNR L
Subjt:  LVVVDDYSRYTWDCFLKGKTYTIEICKNLCLKLQREKGKKTTRIQSDHGKEFDNESFNNFCLLEGIHHEFSAPITPKQNGLVERKNRTL

A0A5D3D6G0 Gag-pol polyprotein6.4e-12789.52Show/hide
Query:  MTENRSYFTNLNDCVTGHVTFGDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLYDQCYKVIFDDTGCVVMNKENQICMSGKRQADNCYHWN
        MT NRSYFTNLNDCVT HVT GDGAKGKIIAKGNIDK DLPRLNDVRYVDGLKANLISISQL DQ YKV FDD G VVMNKENQICMSGKRQADNCYHWN
Subjt:  MTENRSYFTNLNDCVTGHVTFGDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLYDQCYKVIFDDTGCVVMNKENQICMSGKRQADNCYHWN

Query:  SNMSDTCQLIRSDQAWLWHRKLGHVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTQKSLKECYTNRFLELLHMDLMGQMQTESLGGKRYV
        SNMSDTCQLIRSDQ WLWHRKLGHVSMRGLEK+IKNKAVVGIPDLDVNGNFFCGDCQIGKQTR T K+LKECYTNR LELLHMDLMG M  ESL GKRYV
Subjt:  SNMSDTCQLIRSDQAWLWHRKLGHVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTQKSLKECYTNRFLELLHMDLMGQMQTESLGGKRYV

Query:  LVVVDDYSRYTWDCFLKGKTYTIEICKNLCLKLQREKGKKTTRIQSDH
        LVVVDDYSRYTW CFLK KT T+E CKNLCLKLQREKGKK TRI+SDH
Subjt:  LVVVDDYSRYTWDCFLKGKTYTIEICKNLCLKLQREKGKKTTRIQSDH

SwissProt top hitse value%identityAlignment
A0A0B7P3V8 Transposon Ty4-P Gag-Pol polyprotein9.8e-1628.85Show/hide
Query:  HRKLGHVSMRGLEKVIKNKAVVGIPDLDVNGN-FFCGDCQIGKQTRS---TQKSLKECYTNRFLELLHMDLMGQMQTESLGGKRYVLVVVDDYSRY--TW
        H+++GH  ++ +E  IK+       DL    N F+C  C+I K T+    T         +       MD+ G + + +   KRY+L++VD+ +RY  T 
Subjt:  HRKLGHVSMRGLEKVIKNKAVVGIPDLDVNGN-FFCGDCQIGKQTRS---TQKSLKECYTNRFLELLHMDLMGQMQTESLGGKRYVLVVVDDYSRY--TW

Query:  DCFLK-GKTYTIEICKNLCLKLQREKGKKTTRIQSDHGKEFDNESFNNFCLLEGIHHEFSAPITPKQNGLVERKNRTLQEMTHVMIHAKNLPLCFWAEAV
          F K  +T   +I KN+   ++ +  +K   I SD G EF N+    + + +GIHH  ++      NG  ER  RT+      ++   NL + FW  AV
Subjt:  DCFLK-GKTYTIEICKNLCLKLQREKGKKTTRIQSDHGKEFDNESFNNFCLLEGIHHEFSAPITPKQNGLVERKNRTLQEMTHVMIHAKNLPLCFWAEAV

Query:  NTDCHIHN
         +  +I N
Subjt:  NTDCHIHN

P0C2J7 Transposon Ty4-H Gag-Pol polyprotein9.8e-1628.85Show/hide
Query:  HRKLGHVSMRGLEKVIKNKAVVGIPDLDVNGN-FFCGDCQIGKQTRS---TQKSLKECYTNRFLELLHMDLMGQMQTESLGGKRYVLVVVDDYSRY--TW
        H+++GH  ++ +E  IK+       DL    N F+C  C+I K T+    T         +       MD+ G + + +   KRY+L++VD+ +RY  T 
Subjt:  HRKLGHVSMRGLEKVIKNKAVVGIPDLDVNGN-FFCGDCQIGKQTRS---TQKSLKECYTNRFLELLHMDLMGQMQTESLGGKRYVLVVVDDYSRY--TW

Query:  DCFLK-GKTYTIEICKNLCLKLQREKGKKTTRIQSDHGKEFDNESFNNFCLLEGIHHEFSAPITPKQNGLVERKNRTLQEMTHVMIHAKNLPLCFWAEAV
          F K  +T   +I KN+   ++ +  +K   I SD G EF N+    + + +GIHH  ++      NG  ER  RT+      ++   NL + FW  AV
Subjt:  DCFLK-GKTYTIEICKNLCLKLQREKGKKTTRIQSDHGKEFDNESFNNFCLLEGIHHEFSAPITPKQNGLVERKNRTLQEMTHVMIHAKNLPLCFWAEAV

Query:  NTDCHIHN
         +  +I N
Subjt:  NTDCHIHN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.7e-3632.86Show/hide
Query:  LNDVRYVDGLKANLISISQLYDQCYKVIFDDTGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQLIRSDQ--AWLWHRKLGHVSMRGLEKVIKNKAVV
        L DVR+V  L+ NLIS   L    Y+  F +     + K + +   G  +    Y  N+ +         D+    LWH+++GH+S +GL+ + K   + 
Subjt:  LNDVRYVDGLKANLISISQLYDQCYKVIFDDTGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQLIRSDQ--AWLWHRKLGHVSMRGLEKVIKNKAVV

Query:  GIPDLDVNGNFFCGDCQIGKQTRSTQKSLKECYTNRFLELLHMDLMGQMQTESLGGKRYVLVVVDDYSRYTWDCFLKGKTYTIEICKNLCLKLQREKGKK
              V     C  C  GKQ R + ++  E   N  L+L++ D+ G M+ ES+GG +Y +  +DD SR  W   LK K    ++ +     ++RE G+K
Subjt:  GIPDLDVNGNFFCGDCQIGKQTRSTQKSLKECYTNRFLELLHMDLMGQMQTESLGGKRYVLVVVDDYSRYTWDCFLKGKTYTIEICKNLCLKLQREKGKK

Query:  TTRIQSDHGKEFDNESFNNFCLLEGIHHEFSAPITPKQNGLVERKNRTLQEMTHVMIHAKNLPLCFWAEAVNTDCHIHNR
          R++SD+G E+ +  F  +C   GI HE + P TP+ NG+ ER NRT+ E    M+    LP  FW EAV T C++ NR
Subjt:  TTRIQSDHGKEFDNESFNNFCLLEGIHHEFSAPITPKQNGLVERKNRTLQEMTHVMIHAKNLPLCFWAEAVNTDCHIHNR

P47024 Transposon Ty4-J Gag-Pol polyprotein1.7e-1528.37Show/hide
Query:  HRKLGHVSMRGLEKVIKNKAVVGIPDLDVNGN-FFCGDCQIGKQTRS---TQKSLKECYTNRFLELLHMDLMGQMQTESLGGKRYVLVVVDDYSRY--TW
        H+++GH  ++ +E  IK+       DL    N F+C  C+I K T+    T         +       MD+ G + + +   KRY+L++VD+ +RY  T 
Subjt:  HRKLGHVSMRGLEKVIKNKAVVGIPDLDVNGN-FFCGDCQIGKQTRS---TQKSLKECYTNRFLELLHMDLMGQMQTESLGGKRYVLVVVDDYSRY--TW

Query:  DCFLK-GKTYTIEICKNLCLKLQREKGKKTTRIQSDHGKEFDNESFNNFCLLEGIHHEFSAPITPKQNGLVERKNRTLQEMTHVMIHAKNLPLCFWAEAV
          F K  +T   ++ KN+   ++ +  +K   I SD G EF N+    + + +GIHH  ++      NG  ER  RT+      ++   NL + FW  AV
Subjt:  DCFLK-GKTYTIEICKNLCLKLQREKGKKTTRIQSDHGKEFDNESFNNFCLLEGIHHEFSAPITPKQNGLVERKNRTLQEMTHVMIHAKNLPLCFWAEAV

Query:  NTDCHIHN
         +  +I N
Subjt:  NTDCHIHN

Q87040 Pro-Pol polyprotein2.0e-0527.46Show/hide
Query:  CGDCQIGKQTRSTQ-KSLKECYTNRFLELLHMDLMGQMQTESLGGKRYVLVVVDDYSRYTWDCFLKGKTYTIEI-CKNLCLKLQREKGKKTTRIQSDHGK
        C  C I   +  T    L+     +  +   +D +G +      G  YVLV+VD  + +TW    K  + +  +   N+   +   K      I SD G 
Subjt:  CGDCQIGKQTRSTQ-KSLKECYTNRFLELLHMDLMGQMQTESLGGKRYVLVVVDDYSRYTWDCFLKGKTYTIEI-CKNLCLKLQREKGKKTTRIQSDHGK

Query:  EFDNESFNNFCLLEGIHHEFSAPITPKQNGLVERKNRTLQEM
         F + +F  +    GIH EFS P  P+ +G VERKN  ++ +
Subjt:  EFDNESFNNFCLLEGIHHEFSAPITPKQNGLVERKNRTLQEM

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein2.1e-0533.33Show/hide
Query:  SNMSDTCQLIRSDQAWLWHRKLGHVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTQKSLKECYTNRFLELLHMDLMG
        SN+++T +    D+  LWH +L H+S RG+E ++K      +    V+   FC DC  GK T     S  +  T   L+ +H DL G
Subjt:  SNMSDTCQLIRSDQAWLWHRKLGHVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTQKSLKECYTNRFLELLHMDLMG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGAAAACAGATCCTACTTTACGAACTTAAACGATTGTGTCACCGGACATGTTACCTTTGGTGATGGTGCAAAAGGAAAAATTATAGCTAAAGGTAACATAGACAA
GGATGATTTACCACGACTGAACGATGTTAGGTATGTGGATGGACTAAAAGCAAACTTGATCAGTATAAGTCAACTGTATGATCAATGTTATAAAGTTATTTTTGATGATA
CTGGTTGTGTTGTCATGAATAAAGAAAATCAAATTTGCATGAGTGGTAAACGACAAGCTGATAATTGCTACCATTGGAATTCAAATATGTCTGACACTTGTCAGTTGATA
AGATCCGATCAAGCATGGCTGTGGCACAGAAAGCTAGGGCATGTCAGCATGAGGGGATTGGAAAAAGTTATTAAAAATAAAGCAGTTGTGGGAATTCCTGATTTAGACGT
AAATGGAAACTTCTTCTGTGGAGACTGTCAAATTGGCAAGCAGACAAGGTCTACTCAAAAAAGTTTGAAAGAATGTTATACCAATAGATTCTTGGAACTGTTACATATGG
ATCTCATGGGTCAAATGCAAACAGAAAGTCTTGGAGGTAAGAGGTATGTGCTGGTTGTAGTTGATGATTACTCAAGATACACTTGGGATTGTTTTCTCAAAGGAAAAACA
TATACAATTGAAATATGCAAAAATCTGTGTTTGAAGCTACAACGTGAAAAAGGGAAGAAGACAACAAGAATCCAAAGTGATCATGGTAAAGAATTCGATAATGAAAGCTT
TAACAATTTTTGTCTGTTAGAAGGAATACACCATGAGTTTTCTGCACCGATAACTCCTAAACAAAATGGTTTAGTAGAAAGAAAGAATAGGACATTACAAGAAATGACTC
ATGTTATGATACATGCCAAAAATTTACCTTTATGTTTTTGGGCAGAAGCTGTAAATACTGACTGTCACATTCATAACAGAGTAGTACTATTAGAACTGGAACGACTGTTA
CTCTTTATGAACTTTGGAAAAAGAGAAAACCAAATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTGAAAACAGATCCTACTTTACGAACTTAAACGATTGTGTCACCGGACATGTTACCTTTGGTGATGGTGCAAAAGGAAAAATTATAGCTAAAGGTAACATAGACAA
GGATGATTTACCACGACTGAACGATGTTAGGTATGTGGATGGACTAAAAGCAAACTTGATCAGTATAAGTCAACTGTATGATCAATGTTATAAAGTTATTTTTGATGATA
CTGGTTGTGTTGTCATGAATAAAGAAAATCAAATTTGCATGAGTGGTAAACGACAAGCTGATAATTGCTACCATTGGAATTCAAATATGTCTGACACTTGTCAGTTGATA
AGATCCGATCAAGCATGGCTGTGGCACAGAAAGCTAGGGCATGTCAGCATGAGGGGATTGGAAAAAGTTATTAAAAATAAAGCAGTTGTGGGAATTCCTGATTTAGACGT
AAATGGAAACTTCTTCTGTGGAGACTGTCAAATTGGCAAGCAGACAAGGTCTACTCAAAAAAGTTTGAAAGAATGTTATACCAATAGATTCTTGGAACTGTTACATATGG
ATCTCATGGGTCAAATGCAAACAGAAAGTCTTGGAGGTAAGAGGTATGTGCTGGTTGTAGTTGATGATTACTCAAGATACACTTGGGATTGTTTTCTCAAAGGAAAAACA
TATACAATTGAAATATGCAAAAATCTGTGTTTGAAGCTACAACGTGAAAAAGGGAAGAAGACAACAAGAATCCAAAGTGATCATGGTAAAGAATTCGATAATGAAAGCTT
TAACAATTTTTGTCTGTTAGAAGGAATACACCATGAGTTTTCTGCACCGATAACTCCTAAACAAAATGGTTTAGTAGAAAGAAAGAATAGGACATTACAAGAAATGACTC
ATGTTATGATACATGCCAAAAATTTACCTTTATGTTTTTGGGCAGAAGCTGTAAATACTGACTGTCACATTCATAACAGAGTAGTACTATTAGAACTGGAACGACTGTTA
CTCTTTATGAACTTTGGAAAAAGAGAAAACCAAATGTGA
Protein sequenceShow/hide protein sequence
MTENRSYFTNLNDCVTGHVTFGDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLYDQCYKVIFDDTGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQLI
RSDQAWLWHRKLGHVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTQKSLKECYTNRFLELLHMDLMGQMQTESLGGKRYVLVVVDDYSRYTWDCFLKGKT
YTIEICKNLCLKLQREKGKKTTRIQSDHGKEFDNESFNNFCLLEGIHHEFSAPITPKQNGLVERKNRTLQEMTHVMIHAKNLPLCFWAEAVNTDCHIHNRVVLLELERLL
LFMNFGKRENQM