; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc07g0185401 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc07g0185401
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr07:3484168..3485244
RNA-Seq ExpressionCmc07g0185401
SyntenyCmc07g0185401
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025735.1 gag-pol polyprotein [Cucumis melo var. makuwa]3.6e-14388.04Show/hide
Query:  MTGNQSFFIELEECASGHVTFEDGAKARIIAKGNIDKSNLPCLNDVRYMDGLKANLISISQLCDQGYSVNFNNTGCVVTDKNNQVFMSGRRQADNCYHWN
        MTGN+SFF ELEEC S HVTFEDGAK RIIAKGNI+KSNLPCLN+VRYMDGLKANLISISQ+CDQGYSVNFNNTGCVVTDKNNQVFMSGRRQ DNCYHW+
Subjt:  MTGNQSFFIELEECASGHVTFEDGAKARIIAKGNIDKSNLPCLNDVRYMDGLKANLISISQLCDQGYSVNFNNTGCVVTDKNNQVFMSGRRQADNCYHWN

Query:  SNSSNICHLTKVYQSWLWHKKLGHVSLRSLDKVIRNEAVVGIPCFDINGKFFCGDCQVGKKTKTSHRSLKECYTIRVLELLHLDLMGPMQTESLGGKKYV
        SNSSNICHLTK  Q+WLWH+KLGH+S+RSLDKVIRNEAVV IP  DINGKFFCGDCQVGKKTK SH+SLKECYTIRV ELLHLDLMG MQTESLGGKKYV
Subjt:  SNSSNICHLTKVYQSWLWHKKLGHVSLRSLDKVIRNEAVVGIPCFDINGKFFCGDCQVGKKTKTSHRSLKECYTIRVLELLHLDLMGPMQTESLGGKKYV

Query:  LFVVDDYSRFTWVQFLKGKLDTVKICISLCLNLQQEKGQKIIRIGSDHGKEFDNEDLNNFCQLEGIHHEFAASITP
        L VVDDYSRFTWV FLKGK DTVK+CISLCLNLQ+EKG+KIIRI SDHGKEFDNEDLNNFCQ +GIHHEFAA ITP
Subjt:  LFVVDDYSRFTWVQFLKGKLDTVKICISLCLNLQQEKGQKIIRIGSDHGKEFDNEDLNNFCQLEGIHHEFAASITP

KAA0036855.1 gag-pol polyprotein [Cucumis melo var. makuwa]7.4e-17384.06Show/hide
Query:  MTGNQSFFIELEECASGHVTFEDGAKARIIAKGNIDKSNLPCLNDVRYMDGLKANLISISQLCDQGYSVNFNNTGCVVTDKNNQVFMSGRRQADNCYHWN
        MTGN+SFF ELEECA GH TF DGAK +IIAKGNIDKSNLP LN+VRY+DGLKANLISISQLCDQGYSVNFNNTG VVT+KNNQVFMSGRR+A+NCY+W+
Subjt:  MTGNQSFFIELEECASGHVTFEDGAKARIIAKGNIDKSNLPCLNDVRYMDGLKANLISISQLCDQGYSVNFNNTGCVVTDKNNQVFMSGRRQADNCYHWN

Query:  SNSSNICHLTKVYQSWLWHKKLGHVSLRSLDKVIRNEAVVGIPCFDINGKFFCGDCQVGKKTKTSHRSLKECYTIRVLELLHLDLMGPMQTESLGGKKYV
        SN SNICHLTK+ Q+WLWH+KLGH+SLRSLDKVIRNEAVVGIP  DINGKFFCG+CQVGK+TKTSHR LKECYTI VLELLHL+LMGPMQTESLGGKKYV
Subjt:  SNSSNICHLTKVYQSWLWHKKLGHVSLRSLDKVIRNEAVVGIPCFDINGKFFCGDCQVGKKTKTSHRSLKECYTIRVLELLHLDLMGPMQTESLGGKKYV

Query:  LFVVDDYSRFTWVQFLKGKLDTVKICISLCLNLQQEKGQKIIRIGSDHGKEFDNEDLNNFCQLEGIHHEFAASITPQQNGVVEQKNRTLQDMAQVMIHAK
        L VVDDYS+FTWV+FLK K DTVK+CISLCLNLQ+EKGQKIIRI SDHGKEFDNEDLNN CQ EGIHHEFAA ITPQQNGVVE+KNR LQ+MA+VMIHAK
Subjt:  LFVVDDYSRFTWVQFLKGKLDTVKICISLCLNLQQEKGQKIIRIGSDHGKEFDNEDLNNFCQLEGIHHEFAASITPQQNGVVEQKNRTLQDMAQVMIHAK

Query:  NLPLIFWTEAVNTACHIHNRVITRSSMSVTLYELWKGRKPNVKYF
        N PL FW E VNTACHIH RV TR   ++TLYELWKGRKPNVKYF
Subjt:  NLPLIFWTEAVNTACHIHNRVITRSSMSVTLYELWKGRKPNVKYF

KAA0059174.1 F5J5.1 [Cucumis melo var. makuwa]6.8e-14282.31Show/hide
Query:  MTGNQSFFIELEECASGHVTFEDGAKARIIAKGNIDKSNLPCLNDVRYMDGLKANLISISQLCDQGYSVNFNNTGCVVTDKNNQVFMSGRRQADNCYHWN
        MT N+SFF ELEECASGHV F+DGAK +IIAKGNIDKSNLPCLN VRY+DGLK NLIS SQLCDQGYSVNFNNTGCVVT+KNNQVF+SG R+ADNCYHW+
Subjt:  MTGNQSFFIELEECASGHVTFEDGAKARIIAKGNIDKSNLPCLNDVRYMDGLKANLISISQLCDQGYSVNFNNTGCVVTDKNNQVFMSGRRQADNCYHWN

Query:  SNSSNICHLTKVYQSWLWHKKLGHVSLRSLDKVIRNEAVVGIPCFDINGKFFCGDCQVGKKTKTSHRSLKECYTIRVLELLHLDLMGPMQTESLGGKKYV
        SN SNICHLTK+ Q+WLWH+KLGH+SLRSLDKVIRNEA++GIP  DINGKFFCGDCQVGK+TKTSHR L ECYTI  LELLHLDL+  MQ ESLGGKKYV
Subjt:  SNSSNICHLTKVYQSWLWHKKLGHVSLRSLDKVIRNEAVVGIPCFDINGKFFCGDCQVGKKTKTSHRSLKECYTIRVLELLHLDLMGPMQTESLGGKKYV

Query:  LFVVDDYSRFTWVQFLKGKLDTVKICISLCLNLQQEKGQKIIRIGSDHGKEFDNEDLNNFCQLEGIHHEFAASITPQQNGVVEQKNRTLQDMAQ
          VVDDYSRFTWV+FLK K D VK+CISLCLNLQ+EKGQKIIRI SDHGK+FDNE+LNNFCQ EGIHHEFAA ITPQQN VVE+KNRTLQ+MA+
Subjt:  LFVVDDYSRFTWVQFLKGKLDTVKICISLCLNLQQEKGQKIIRIGSDHGKEFDNEDLNNFCQLEGIHHEFAASITPQQNGVVEQKNRTLQDMAQ

KAA0066731.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.0e-14586.58Show/hide
Query:  MFKTMTGNQSFFIELEECASGHVTFEDGAKARIIAKGNIDKSNLPCLNDVRYMDGLKANLISISQLCDQGYSVNFNNTGCVVTDKNNQVFMSGRRQADNC
        MFKTMTGNQSFFIELEECASGHVTFEDGAKARIIAKGNIDKSNLP LNDVRYMDGLKANLISISQLCDQGYSVNFNNTGCVVTDKNNQVFMSGRRQADNC
Subjt:  MFKTMTGNQSFFIELEECASGHVTFEDGAKARIIAKGNIDKSNLPCLNDVRYMDGLKANLISISQLCDQGYSVNFNNTGCVVTDKNNQVFMSGRRQADNC

Query:  YHWNSNSSNICHLTKVYQSWLWHKKLGHVSLRSLDKVIRNEAVVGIPCFDINGKFFCGDCQVGKKTKTSHRSLKECYTIRVLELLHLDLMGPMQTESLGG
        YHWNSNSSNICHLTKVY++WLWHKKLGHVSLRSLDKVIRNEA+VGIP FDINGK FCGDCQVGKKTKTSHRSLKECYTIRVLELLHLD MGPMQTESLGG
Subjt:  YHWNSNSSNICHLTKVYQSWLWHKKLGHVSLRSLDKVIRNEAVVGIPCFDINGKFFCGDCQVGKKTKTSHRSLKECYTIRVLELLHLDLMGPMQTESLGG

Query:  KKYVLFVVDDYSRFTWVQFLKGKLDTVKICISLCLNLQQEKGQKIIRIGSDHGKEFDNEDLNNFCQLEGIHHEFAASITPQQNGVVEQKNRTLQDMAQ
        K                              +LCLNLQQEKGQKIIRIGSDHGKEFDNEDLNNFCQ EGIHHEFAASITPQQNGVVE+KNRTLQDMAQ
Subjt:  KKYVLFVVDDYSRFTWVQFLKGKLDTVKICISLCLNLQQEKGQKIIRIGSDHGKEFDNEDLNNFCQLEGIHHEFAASITPQQNGVVEQKNRTLQDMAQ

TYK26041.1 gag/pol polyprotein [Cucumis melo var. makuwa]2.6e-14975.65Show/hide
Query:  MTGNQSFFIELEECASGHVTFEDGAKARIIAKGNIDKSNLPCLNDVRYMDGLKANLISISQLCDQGYSVNFNNTGCVVTDKNNQVFMSGRRQADNCYHWN
        MTGN+SFF ELEECASGHVTF DGAK +IIAKGN+DKSNLP +N+VRY+DGLK NLIS+SQLCDQGYSVNFNNT CV TDKNNQVF+SGRR+A+NC HW+
Subjt:  MTGNQSFFIELEECASGHVTFEDGAKARIIAKGNIDKSNLPCLNDVRYMDGLKANLISISQLCDQGYSVNFNNTGCVVTDKNNQVFMSGRRQADNCYHWN

Query:  SNSSNICHLTKVYQSWLWHKKLGHVSLRSLDKVIRNEAVVGIPCFDINGKFFCGDCQVGKKTKTSHRSLKECYTIRVLELLHLDLMGPMQTESLGGKKYV
        SN SNICHLTKV Q+WLWH+KLGH+SLRSLDKVIRN+AVVGIP  DINGKFFCGDC+VGK+TK SHR LKECYTIRVLELLHLDL+GPM+TESLG KKYV
Subjt:  SNSSNICHLTKVYQSWLWHKKLGHVSLRSLDKVIRNEAVVGIPCFDINGKFFCGDCQVGKKTKTSHRSLKECYTIRVLELLHLDLMGPMQTESLGGKKYV

Query:  LFVVDDYSRFTWVQFLKGKLDTVKICISLCLNLQQEKGQKIIRIGSDHGKEFDNEDLNNFCQLEGIHHEFAASITPQQNGVVEQKNRTLQDMAQVMIHAK
        L VVDDYSRFT V+FLKGK DTVK+CISL LNLQ+EKGQKIIRI SDHGKEFDNEDLNNFCQ EGIHHEF A ITPQQNGVVE+KNRT            
Subjt:  LFVVDDYSRFTWVQFLKGKLDTVKICISLCLNLQQEKGQKIIRIGSDHGKEFDNEDLNNFCQLEGIHHEFAASITPQQNGVVEQKNRTLQDMAQVMIHAK

Query:  NLPLIFWTEAVNTACHIHNRVITRSSMSVTLYELWKGRKPNVKYF
                            V TRS  ++ LYELWKGRKPNVKYF
Subjt:  NLPLIFWTEAVNTACHIHNRVITRSSMSVTLYELWKGRKPNVKYF

TrEMBL top hitse value%identityAlignment
A0A5A7SMR2 Gag-pol polyprotein1.7e-14388.04Show/hide
Query:  MTGNQSFFIELEECASGHVTFEDGAKARIIAKGNIDKSNLPCLNDVRYMDGLKANLISISQLCDQGYSVNFNNTGCVVTDKNNQVFMSGRRQADNCYHWN
        MTGN+SFF ELEEC S HVTFEDGAK RIIAKGNI+KSNLPCLN+VRYMDGLKANLISISQ+CDQGYSVNFNNTGCVVTDKNNQVFMSGRRQ DNCYHW+
Subjt:  MTGNQSFFIELEECASGHVTFEDGAKARIIAKGNIDKSNLPCLNDVRYMDGLKANLISISQLCDQGYSVNFNNTGCVVTDKNNQVFMSGRRQADNCYHWN

Query:  SNSSNICHLTKVYQSWLWHKKLGHVSLRSLDKVIRNEAVVGIPCFDINGKFFCGDCQVGKKTKTSHRSLKECYTIRVLELLHLDLMGPMQTESLGGKKYV
        SNSSNICHLTK  Q+WLWH+KLGH+S+RSLDKVIRNEAVV IP  DINGKFFCGDCQVGKKTK SH+SLKECYTIRV ELLHLDLMG MQTESLGGKKYV
Subjt:  SNSSNICHLTKVYQSWLWHKKLGHVSLRSLDKVIRNEAVVGIPCFDINGKFFCGDCQVGKKTKTSHRSLKECYTIRVLELLHLDLMGPMQTESLGGKKYV

Query:  LFVVDDYSRFTWVQFLKGKLDTVKICISLCLNLQQEKGQKIIRIGSDHGKEFDNEDLNNFCQLEGIHHEFAASITP
        L VVDDYSRFTWV FLKGK DTVK+CISLCLNLQ+EKG+KIIRI SDHGKEFDNEDLNNFCQ +GIHHEFAA ITP
Subjt:  LFVVDDYSRFTWVQFLKGKLDTVKICISLCLNLQQEKGQKIIRIGSDHGKEFDNEDLNNFCQLEGIHHEFAASITP

A0A5A7UVR7 F5J5.13.3e-14282.31Show/hide
Query:  MTGNQSFFIELEECASGHVTFEDGAKARIIAKGNIDKSNLPCLNDVRYMDGLKANLISISQLCDQGYSVNFNNTGCVVTDKNNQVFMSGRRQADNCYHWN
        MT N+SFF ELEECASGHV F+DGAK +IIAKGNIDKSNLPCLN VRY+DGLK NLIS SQLCDQGYSVNFNNTGCVVT+KNNQVF+SG R+ADNCYHW+
Subjt:  MTGNQSFFIELEECASGHVTFEDGAKARIIAKGNIDKSNLPCLNDVRYMDGLKANLISISQLCDQGYSVNFNNTGCVVTDKNNQVFMSGRRQADNCYHWN

Query:  SNSSNICHLTKVYQSWLWHKKLGHVSLRSLDKVIRNEAVVGIPCFDINGKFFCGDCQVGKKTKTSHRSLKECYTIRVLELLHLDLMGPMQTESLGGKKYV
        SN SNICHLTK+ Q+WLWH+KLGH+SLRSLDKVIRNEA++GIP  DINGKFFCGDCQVGK+TKTSHR L ECYTI  LELLHLDL+  MQ ESLGGKKYV
Subjt:  SNSSNICHLTKVYQSWLWHKKLGHVSLRSLDKVIRNEAVVGIPCFDINGKFFCGDCQVGKKTKTSHRSLKECYTIRVLELLHLDLMGPMQTESLGGKKYV

Query:  LFVVDDYSRFTWVQFLKGKLDTVKICISLCLNLQQEKGQKIIRIGSDHGKEFDNEDLNNFCQLEGIHHEFAASITPQQNGVVEQKNRTLQDMAQ
          VVDDYSRFTWV+FLK K D VK+CISLCLNLQ+EKGQKIIRI SDHGK+FDNE+LNNFCQ EGIHHEFAA ITPQQN VVE+KNRTLQ+MA+
Subjt:  LFVVDDYSRFTWVQFLKGKLDTVKICISLCLNLQQEKGQKIIRIGSDHGKEFDNEDLNNFCQLEGIHHEFAASITPQQNGVVEQKNRTLQDMAQ

A0A5A7VM58 Gag-pol polyprotein4.9e-14686.58Show/hide
Query:  MFKTMTGNQSFFIELEECASGHVTFEDGAKARIIAKGNIDKSNLPCLNDVRYMDGLKANLISISQLCDQGYSVNFNNTGCVVTDKNNQVFMSGRRQADNC
        MFKTMTGNQSFFIELEECASGHVTFEDGAKARIIAKGNIDKSNLP LNDVRYMDGLKANLISISQLCDQGYSVNFNNTGCVVTDKNNQVFMSGRRQADNC
Subjt:  MFKTMTGNQSFFIELEECASGHVTFEDGAKARIIAKGNIDKSNLPCLNDVRYMDGLKANLISISQLCDQGYSVNFNNTGCVVTDKNNQVFMSGRRQADNC

Query:  YHWNSNSSNICHLTKVYQSWLWHKKLGHVSLRSLDKVIRNEAVVGIPCFDINGKFFCGDCQVGKKTKTSHRSLKECYTIRVLELLHLDLMGPMQTESLGG
        YHWNSNSSNICHLTKVY++WLWHKKLGHVSLRSLDKVIRNEA+VGIP FDINGK FCGDCQVGKKTKTSHRSLKECYTIRVLELLHLD MGPMQTESLGG
Subjt:  YHWNSNSSNICHLTKVYQSWLWHKKLGHVSLRSLDKVIRNEAVVGIPCFDINGKFFCGDCQVGKKTKTSHRSLKECYTIRVLELLHLDLMGPMQTESLGG

Query:  KKYVLFVVDDYSRFTWVQFLKGKLDTVKICISLCLNLQQEKGQKIIRIGSDHGKEFDNEDLNNFCQLEGIHHEFAASITPQQNGVVEQKNRTLQDMAQ
        K                              +LCLNLQQEKGQKIIRIGSDHGKEFDNEDLNNFCQ EGIHHEFAASITPQQNGVVE+KNRTLQDMAQ
Subjt:  KKYVLFVVDDYSRFTWVQFLKGKLDTVKICISLCLNLQQEKGQKIIRIGSDHGKEFDNEDLNNFCQLEGIHHEFAASITPQQNGVVEQKNRTLQDMAQ

A0A5D3BA69 Gag-pol polyprotein3.6e-17384.06Show/hide
Query:  MTGNQSFFIELEECASGHVTFEDGAKARIIAKGNIDKSNLPCLNDVRYMDGLKANLISISQLCDQGYSVNFNNTGCVVTDKNNQVFMSGRRQADNCYHWN
        MTGN+SFF ELEECA GH TF DGAK +IIAKGNIDKSNLP LN+VRY+DGLKANLISISQLCDQGYSVNFNNTG VVT+KNNQVFMSGRR+A+NCY+W+
Subjt:  MTGNQSFFIELEECASGHVTFEDGAKARIIAKGNIDKSNLPCLNDVRYMDGLKANLISISQLCDQGYSVNFNNTGCVVTDKNNQVFMSGRRQADNCYHWN

Query:  SNSSNICHLTKVYQSWLWHKKLGHVSLRSLDKVIRNEAVVGIPCFDINGKFFCGDCQVGKKTKTSHRSLKECYTIRVLELLHLDLMGPMQTESLGGKKYV
        SN SNICHLTK+ Q+WLWH+KLGH+SLRSLDKVIRNEAVVGIP  DINGKFFCG+CQVGK+TKTSHR LKECYTI VLELLHL+LMGPMQTESLGGKKYV
Subjt:  SNSSNICHLTKVYQSWLWHKKLGHVSLRSLDKVIRNEAVVGIPCFDINGKFFCGDCQVGKKTKTSHRSLKECYTIRVLELLHLDLMGPMQTESLGGKKYV

Query:  LFVVDDYSRFTWVQFLKGKLDTVKICISLCLNLQQEKGQKIIRIGSDHGKEFDNEDLNNFCQLEGIHHEFAASITPQQNGVVEQKNRTLQDMAQVMIHAK
        L VVDDYS+FTWV+FLK K DTVK+CISLCLNLQ+EKGQKIIRI SDHGKEFDNEDLNN CQ EGIHHEFAA ITPQQNGVVE+KNR LQ+MA+VMIHAK
Subjt:  LFVVDDYSRFTWVQFLKGKLDTVKICISLCLNLQQEKGQKIIRIGSDHGKEFDNEDLNNFCQLEGIHHEFAASITPQQNGVVEQKNRTLQDMAQVMIHAK

Query:  NLPLIFWTEAVNTACHIHNRVITRSSMSVTLYELWKGRKPNVKYF
        N PL FW E VNTACHIH RV TR   ++TLYELWKGRKPNVKYF
Subjt:  NLPLIFWTEAVNTACHIHNRVITRSSMSVTLYELWKGRKPNVKYF

A0A5D3DQT9 Gag/pol polyprotein1.2e-14975.65Show/hide
Query:  MTGNQSFFIELEECASGHVTFEDGAKARIIAKGNIDKSNLPCLNDVRYMDGLKANLISISQLCDQGYSVNFNNTGCVVTDKNNQVFMSGRRQADNCYHWN
        MTGN+SFF ELEECASGHVTF DGAK +IIAKGN+DKSNLP +N+VRY+DGLK NLIS+SQLCDQGYSVNFNNT CV TDKNNQVF+SGRR+A+NC HW+
Subjt:  MTGNQSFFIELEECASGHVTFEDGAKARIIAKGNIDKSNLPCLNDVRYMDGLKANLISISQLCDQGYSVNFNNTGCVVTDKNNQVFMSGRRQADNCYHWN

Query:  SNSSNICHLTKVYQSWLWHKKLGHVSLRSLDKVIRNEAVVGIPCFDINGKFFCGDCQVGKKTKTSHRSLKECYTIRVLELLHLDLMGPMQTESLGGKKYV
        SN SNICHLTKV Q+WLWH+KLGH+SLRSLDKVIRN+AVVGIP  DINGKFFCGDC+VGK+TK SHR LKECYTIRVLELLHLDL+GPM+TESLG KKYV
Subjt:  SNSSNICHLTKVYQSWLWHKKLGHVSLRSLDKVIRNEAVVGIPCFDINGKFFCGDCQVGKKTKTSHRSLKECYTIRVLELLHLDLMGPMQTESLGGKKYV

Query:  LFVVDDYSRFTWVQFLKGKLDTVKICISLCLNLQQEKGQKIIRIGSDHGKEFDNEDLNNFCQLEGIHHEFAASITPQQNGVVEQKNRTLQDMAQVMIHAK
        L VVDDYSRFT V+FLKGK DTVK+CISL LNLQ+EKGQKIIRI SDHGKEFDNEDLNNFCQ EGIHHEF A ITPQQNGVVE+KNRT            
Subjt:  LFVVDDYSRFTWVQFLKGKLDTVKICISLCLNLQQEKGQKIIRIGSDHGKEFDNEDLNNFCQLEGIHHEFAASITPQQNGVVEQKNRTLQDMAQVMIHAK

Query:  NLPLIFWTEAVNTACHIHNRVITRSSMSVTLYELWKGRKPNVKYF
                            V TRS  ++ LYELWKGRKPNVKYF
Subjt:  NLPLIFWTEAVNTACHIHNRVITRSSMSVTLYELWKGRKPNVKYF

SwissProt top hitse value%identityAlignment
A0A0B7P3V8 Transposon Ty4-P Gag-Pol polyprotein7.8e-1626.36Show/hide
Query:  HKKLGHVSLRSLDKVIR-NEAVVGIPCFDINGKFFCGDCQVGKKTKTSH--RSLKECYTIRVL-ELLHLDLMGPMQTESLGGKKYVLFVVDDYSRFTWVQ
        HK++GH  ++ ++  I+ N     +       +F+C  C++ K TK +H   S+    T         +D+ GP+ + +   K+Y+L +VD+ +R+    
Subjt:  HKKLGHVSLRSLDKVIR-NEAVVGIPCFDINGKFFCGDCQVGKKTKTSH--RSLKECYTIRVL-ELLHLDLMGPMQTESLGGKKYVLFVVDDYSRFTWVQ

Query:  FLKGKLDTVKICISLCLNLQQEKGQ---KIIRIGSDHGKEFDNEDLNNFCQLEGIHHEFAASITPQQNGVVEQKNRTLQDMAQVMIHAKNLPLIFWTEAV
            K +   I   +  N+Q  + Q   K+  I SD G EF N+ +  +   +GIHH   ++     NG  E+  RT+   A  ++   NL + FW  AV
Subjt:  FLKGKLDTVKICISLCLNLQQEKGQ---KIIRIGSDHGKEFDNEDLNNFCQLEGIHHEFAASITPQQNGVVEQKNRTLQDMAQVMIHAKNLPLIFWTEAV

Query:  NTACHIHNRVITRSSMSVTL
         +A +I N +  +S+  + L
Subjt:  NTACHIHNRVITRSSMSVTL

P04146 Copia protein6.2e-2927.1Show/hide
Query:  LNDVRYMDGLKANLISISQLCDQGYSVNFNNTGCVVTDKNNQVFMSGRRQADN--CYHWNSNSSNICHLTKVYQSWLWHKKLGHVSLRSLDKVIRNEAVV
        L DV +      NL+S+ +L + G S+ F+ +G V   KN  + +      +N    ++ + S N  H        LWH++ GH+S   L ++ R     
Subjt:  LNDVRYMDGLKANLISISQLCDQGYSVNFNNTGCVVTDKNNQVFMSGRRQADN--CYHWNSNSSNICHLTKVYQSWLWHKKLGHVSLRSLDKVIRNEAVV

Query:  G---IPCFDINGKFFCGDCQVGKKTKTSHRSLKE-CYTIRVLELLHLDLMGPMQTESLGGKKYVLFVVDDYSRFTWVQFLKGKLDTVKICISLCLNLQQE
            +   +++ +  C  C  GK+ +   + LK+  +  R L ++H D+ GP+   +L  K Y +  VD ++ +     +K K D   +        +  
Subjt:  G---IPCFDINGKFFCGDCQVGKKTKTSHRSLKE-CYTIRVLELLHLDLMGPMQTESLGGKKYVLFVVDDYSRFTWVQFLKGKLDTVKICISLCLNLQQE

Query:  KGQKIIRIGSDHGKEFDNEDLNNFCQLEGIHHEFAASITPQQNGVVEQKNRTLQDMAQVMIHAKNLPLIFWTEAVNTACHIHNRVITRS--SMSVTLYEL
           K++ +  D+G+E+ + ++  FC  +GI +      TPQ NGV E+  RT+ + A+ M+    L   FW EAV TA ++ NR+ +R+    S T YE+
Subjt:  KGQKIIRIGSDHGKEFDNEDLNNFCQLEGIHHEFAASITPQQNGVVEQKNRTLQDMAQVMIHAKNLPLIFWTEAVNTACHIHNRVITRS--SMSVTLYEL

Query:  WKGRKPNVKY
        W  +KP +K+
Subjt:  WKGRKPNVKY

P0C2J7 Transposon Ty4-H Gag-Pol polyprotein7.8e-1626.36Show/hide
Query:  HKKLGHVSLRSLDKVIR-NEAVVGIPCFDINGKFFCGDCQVGKKTKTSH--RSLKECYTIRVL-ELLHLDLMGPMQTESLGGKKYVLFVVDDYSRFTWVQ
        HK++GH  ++ ++  I+ N     +       +F+C  C++ K TK +H   S+    T         +D+ GP+ + +   K+Y+L +VD+ +R+    
Subjt:  HKKLGHVSLRSLDKVIR-NEAVVGIPCFDINGKFFCGDCQVGKKTKTSH--RSLKECYTIRVL-ELLHLDLMGPMQTESLGGKKYVLFVVDDYSRFTWVQ

Query:  FLKGKLDTVKICISLCLNLQQEKGQ---KIIRIGSDHGKEFDNEDLNNFCQLEGIHHEFAASITPQQNGVVEQKNRTLQDMAQVMIHAKNLPLIFWTEAV
            K +   I   +  N+Q  + Q   K+  I SD G EF N+ +  +   +GIHH   ++     NG  E+  RT+   A  ++   NL + FW  AV
Subjt:  FLKGKLDTVKICISLCLNLQQEKGQ---KIIRIGSDHGKEFDNEDLNNFCQLEGIHHEFAASITPQQNGVVEQKNRTLQDMAQVMIHAKNLPLIFWTEAV

Query:  NTACHIHNRVITRSSMSVTL
         +A +I N +  +S+  + L
Subjt:  NTACHIHNRVITRSSMSVTL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.2e-3631.94Show/hide
Query:  GHVTFEDGAKARIIAKGNI-DKSNLPC---LNDVRYMDGLKANLISISQLCDQGYSVNFNNTGCVVTDKNNQVFMSGRRQADNCYHWNSN--SSNICHLT
        G V   + + ++I   G+I  K+N+ C   L DVR++  L+ NLIS   L   GY   F N    +T K + V   G  +    Y  N+      +    
Subjt:  GHVTFEDGAKARIIAKGNI-DKSNLPC---LNDVRYMDGLKANLISISQLCDQGYSVNFNNTGCVVTDKNNQVFMSGRRQADNCYHWNSN--SSNICHLT

Query:  KVYQSWLWHKKLGHVSLRSLDKVIRNEAVVGIPCFDINGKFFCGDCQVGKKTKTSHRSLKECYTIRVLELLHLDLMGPMQTESLGGKKYVLFVVDDYSRF
              LWHK++GH+S + L  + +   +       +     C  C  GK+ + S ++  E   + +L+L++ D+ GPM+ ES+GG KY +  +DD SR 
Subjt:  KVYQSWLWHKKLGHVSLRSLDKVIRNEAVVGIPCFDINGKFFCGDCQVGKKTKTSHRSLKECYTIRVLELLHLDLMGPMQTESLGGKKYVLFVVDDYSRF

Query:  TWVQFLKGKLDTVKICISLCLNLQQEKGQKIIRIGSDHGKEFDNEDLNNFCQLEGIHHEFAASITPQQNGVVEQKNRTLQDMAQVMIHAKNLPLIFWTEA
         WV  LK K    ++       +++E G+K+ R+ SD+G E+ + +   +C   GI HE     TPQ NGV E+ NRT+ +  + M+    LP  FW EA
Subjt:  TWVQFLKGKLDTVKICISLCLNLQQEKGQKIIRIGSDHGKEFDNEDLNNFCQLEGIHHEFAASITPQQNGVVEQKNRTLQDMAQVMIHAKNLPLIFWTEA

Query:  VNTACHIHNR
        V TAC++ NR
Subjt:  VNTACHIHNR

P47024 Transposon Ty4-J Gag-Pol polyprotein1.0e-1526.36Show/hide
Query:  HKKLGHVSLRSLDKVIR-NEAVVGIPCFDINGKFFCGDCQVGKKTKTSH--RSLKECYTIRVL-ELLHLDLMGPMQTESLGGKKYVLFVVDDYSRFTWVQ
        HK++GH  ++ ++  I+ N     +       +F+C  C++ K TK +H   S+    T         +D+ GP+ + +   K+Y+L +VD+ +R+    
Subjt:  HKKLGHVSLRSLDKVIR-NEAVVGIPCFDINGKFFCGDCQVGKKTKTSH--RSLKECYTIRVL-ELLHLDLMGPMQTESLGGKKYVLFVVDDYSRFTWVQ

Query:  FLKGKLDTVKICISLCLNLQQEKGQ---KIIRIGSDHGKEFDNEDLNNFCQLEGIHHEFAASITPQQNGVVEQKNRTLQDMAQVMIHAKNLPLIFWTEAV
            K +   I   +  N+Q  + Q   K+  I SD G EF N+ +  +   +GIHH   ++     NG  E+  RT+   A  ++   NL + FW  AV
Subjt:  FLKGKLDTVKICISLCLNLQQEKGQ---KIIRIGSDHGKEFDNEDLNNFCQLEGIHHEFAASITPQQNGVVEQKNRTLQDMAQVMIHAKNLPLIFWTEAV

Query:  NTACHIHNRVITRSSMSVTL
         +A +I N +  +S+  + L
Subjt:  NTACHIHNRVITRSSMSVTL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCAAGACAATGACTGGCAATCAATCATTCTTTATTGAATTAGAGGAATGTGCCTCAGGTCATGTTACTTTTGAAGATGGAGCCAAAGCAAGAATTATTGCAAAAGG
AAACATTGATAAAAGTAATCTACCCTGTCTTAATGATGTTAGATATATGGATGGATTGAAGGCAAACTTGATTAGTATAAGTCAACTATGTGACCAAGGATACAGTGTAA
ATTTTAACAACACTGGTTGTGTTGTTACAGACAAAAATAATCAAGTGTTTATGAGTGGCAGACGACAAGCAGATAACTGTTATCATTGGAACTCCAATAGCTCAAACATA
TGTCACTTAACTAAAGTATATCAAAGCTGGTTGTGGCATAAAAAGTTGGGGCATGTTAGCTTGAGAAGCTTAGATAAAGTTATCAGAAACGAAGCTGTTGTAGGCATTCC
TTGTTTTGATATCAATGGAAAATTCTTTTGTGGTGACTGTCAAGTTGGAAAGAAAACTAAAACTTCTCACAGAAGTCTAAAGGAATGTTATACAATTAGAGTTCTTGAAC
TTCTACATCTTGATCTTATGGGTCCCATGCAAACTGAAAGTTTAGGTGGAAAGAAGTATGTTTTGTTTGTTGTGGATGATTATTCCAGATTTACTTGGGTTCAGTTCTTA
AAAGGAAAATTAGATACTGTTAAAATATGTATCAGTCTATGCTTGAATTTACAACAAGAAAAGGGGCAAAAGATAATCAGGATCGGTAGTGATCATGGGAAGGAGTTTGA
TAATGAAGATCTTAATAATTTCTGCCAGTTGGAAGGAATACATCATGAATTTGCTGCTTCCATAACTCCTCAGCAAAATGGAGTAGTGGAACAGAAGAACAGAACTTTAC
AAGATATGGCTCAAGTTATGATACATGCCAAAAATTTACCTTTGATTTTTTGGACCGAAGCTGTTAACACAGCATGTCATATTCACAACAGGGTTATTACTCGATCTAGT
ATGTCAGTCACATTATATGAACTATGGAAGGGAAGAAAGCCAAATGTTAAGTATTTTATATTTTTGGAAGTACTTGTTATATTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTCAAGACAATGACTGGCAATCAATCATTCTTTATTGAATTAGAGGAATGTGCCTCAGGTCATGTTACTTTTGAAGATGGAGCCAAAGCAAGAATTATTGCAAAAGG
AAACATTGATAAAAGTAATCTACCCTGTCTTAATGATGTTAGATATATGGATGGATTGAAGGCAAACTTGATTAGTATAAGTCAACTATGTGACCAAGGATACAGTGTAA
ATTTTAACAACACTGGTTGTGTTGTTACAGACAAAAATAATCAAGTGTTTATGAGTGGCAGACGACAAGCAGATAACTGTTATCATTGGAACTCCAATAGCTCAAACATA
TGTCACTTAACTAAAGTATATCAAAGCTGGTTGTGGCATAAAAAGTTGGGGCATGTTAGCTTGAGAAGCTTAGATAAAGTTATCAGAAACGAAGCTGTTGTAGGCATTCC
TTGTTTTGATATCAATGGAAAATTCTTTTGTGGTGACTGTCAAGTTGGAAAGAAAACTAAAACTTCTCACAGAAGTCTAAAGGAATGTTATACAATTAGAGTTCTTGAAC
TTCTACATCTTGATCTTATGGGTCCCATGCAAACTGAAAGTTTAGGTGGAAAGAAGTATGTTTTGTTTGTTGTGGATGATTATTCCAGATTTACTTGGGTTCAGTTCTTA
AAAGGAAAATTAGATACTGTTAAAATATGTATCAGTCTATGCTTGAATTTACAACAAGAAAAGGGGCAAAAGATAATCAGGATCGGTAGTGATCATGGGAAGGAGTTTGA
TAATGAAGATCTTAATAATTTCTGCCAGTTGGAAGGAATACATCATGAATTTGCTGCTTCCATAACTCCTCAGCAAAATGGAGTAGTGGAACAGAAGAACAGAACTTTAC
AAGATATGGCTCAAGTTATGATACATGCCAAAAATTTACCTTTGATTTTTTGGACCGAAGCTGTTAACACAGCATGTCATATTCACAACAGGGTTATTACTCGATCTAGT
ATGTCAGTCACATTATATGAACTATGGAAGGGAAGAAAGCCAAATGTTAAGTATTTTATATTTTTGGAAGTACTTGTTATATTCTAA
Protein sequenceShow/hide protein sequence
MFKTMTGNQSFFIELEECASGHVTFEDGAKARIIAKGNIDKSNLPCLNDVRYMDGLKANLISISQLCDQGYSVNFNNTGCVVTDKNNQVFMSGRRQADNCYHWNSNSSNI
CHLTKVYQSWLWHKKLGHVSLRSLDKVIRNEAVVGIPCFDINGKFFCGDCQVGKKTKTSHRSLKECYTIRVLELLHLDLMGPMQTESLGGKKYVLFVVDDYSRFTWVQFL
KGKLDTVKICISLCLNLQQEKGQKIIRIGSDHGKEFDNEDLNNFCQLEGIHHEFAASITPQQNGVVEQKNRTLQDMAQVMIHAKNLPLIFWTEAVNTACHIHNRVITRSS
MSVTLYELWKGRKPNVKYFIFLEVLVIF