; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0101851 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0101851
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCMiso1.1chr04:18378902..18379708
RNA-Seq ExpressionCmc04g0101851
SyntenyCmc04g0101851
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0000166 - nucleotide binding (molecular function)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040427.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.1e-11581.13Show/hide
Query:  MGNDGSTNAVGIGGVHLKNINGSRLILKNVKHIPDIRMNLISTSKLDNKGFCNTFDNGIWKLTEGFMVIAKGQKISLLYYVDAKIIDSDINIVNGEANVE
        MGNDG TN VGIG VHLKN NGSRLILKNVKHIPDI MNLIST KLD++GFCNTFDNGIWKLT+G MVIA GQK S LYY+DAKIID DIN VN EANVE
Subjt:  MGNDGSTNAVGIGGVHLKNINGSRLILKNVKHIPDIRMNLISTSKLDNKGFCNTFDNGIWKLTEGFMVIAKGQKISLLYYVDAKIIDSDINIVNGEANVE

Query:  LWQMRLSHMSEKGLKILIKKNHLLDLKSAPLKWFAHYLTGKQTSVTFKSSQHSRKPNVLELVHFNVCSPMKTKSLWGASYFVTFTNDHSRKIWVYTLKTK
        LW  RLSHMSEKGLKIL KKNHL DLKS PLK   H L GKQT VTFKSSQHSRK NVLELVH NVC  MKTKSL GA YFVTFT+DHSRKIWVYTLKTK
Subjt:  LWQMRLSHMSEKGLKILIKKNHLLDLKSAPLKWFAHYLTGKQTSVTFKSSQHSRKPNVLELVHFNVCSPMKTKSLWGASYFVTFTNDHSRKIWVYTLKTK

Query:  YQVLQVFKQFHASIEREIGEKLNCIRTDNGGKYYGPFDEYCRNHGIQHQKTPLKTLQLNWIAERL
            QVFKQFHAS+ERE GEKL CIRTDNGG+Y GPFDEYCRNHGI+HQKTP K+ QLN IA+RL
Subjt:  YQVLQVFKQFHASIEREIGEKLNCIRTDNGGKYYGPFDEYCRNHGIQHQKTPLKTLQLNWIAERL

KAA0047570.1 putative retrotransposon [Cucumis melo var. makuwa]4.8e-11678.99Show/hide
Query:  MGNDGSTNAVGIGGVHLKNINGSRLILKNVKHIPDIRMNLISTSKLDNKGFCNTFDNGIWKLTEGFMVIAKGQKISLLYYVDAKIIDSDINIVNGEANVE
        MGNDGS N VGIG VHL N NGSRLILKNVKHI DIRMNLIST KLD++GFCNTFDNGIWKLT+G +VIA+G K S LYY+DAKIIDSDIN VN E N+E
Subjt:  MGNDGSTNAVGIGGVHLKNINGSRLILKNVKHIPDIRMNLISTSKLDNKGFCNTFDNGIWKLTEGFMVIAKGQKISLLYYVDAKIIDSDINIVNGEANVE

Query:  LWQMRLSHMSEKGLKILIKK-----------NHLLDLKSAPLKWFAHYLTGKQTSVTFKSSQHSRKPNVLELVHFNVCSPMKTKSLWGASYFVTFTNDHS
        LW  RLSHMSEKGLKIL KK           NHL DLKS PLK   H L GKQT VTFKSSQHSRKPNVLELVH NVC PMKTKSL GA YFVTFT+DHS
Subjt:  LWQMRLSHMSEKGLKILIKK-----------NHLLDLKSAPLKWFAHYLTGKQTSVTFKSSQHSRKPNVLELVHFNVCSPMKTKSLWGASYFVTFTNDHS

Query:  RKIWVYTLKTKYQVLQVFKQFHASIEREIGEKLNCIRTDNGGKYYGPFDEYCRNHGIQHQKTPLKTLQLNWIAERL
        RKIWVYTLKTK QVLQVFKQFHAS+ERE GEKL CIRTDNGG+Y GPFDEYCRNHGI+HQKTP KT QLN IAERL
Subjt:  RKIWVYTLKTKYQVLQVFKQFHASIEREIGEKLNCIRTDNGGKYYGPFDEYCRNHGIQHQKTPLKTLQLNWIAERL

KAA0065636.1 putative retrotransposon [Cucumis melo var. makuwa]2.3e-15098.13Show/hide
Query:  MGNDGSTNAVGIGGVHLKNINGSRLILKNVKHIPDIRMNLISTSKLDNKGFCNTFDNGIWKLTEGFMVIAKGQKISLLYYVDAKIIDSDINIVNGEANVE
        MGNDGSTNAVGIGGVHLKNINGSRLILKNVKHIPDIRMNLISTSKLDN+GFCNTFDNGIWKLTEGFMVIAKGQKISLLYYVDAKIIDSDIN VNGEANVE
Subjt:  MGNDGSTNAVGIGGVHLKNINGSRLILKNVKHIPDIRMNLISTSKLDNKGFCNTFDNGIWKLTEGFMVIAKGQKISLLYYVDAKIIDSDINIVNGEANVE

Query:  LWQMRLSHMSEKGLKILIKKNHLLDLKSAPLKWFAHYLTGKQTSVTFKSSQHSRKPNVLELVHFNVCSPMKTKSLWGASYFVTFTNDHSRKIWVYTLKTK
        LWQMRLSHMSEKGLKILIKKNHL DLKSAPLKWFAHYL GKQTSVTFKSSQHSRKPNVLELVHFNVCSPMKTKSLWGASYFVTFTNDHSRKIWVYTLKTK
Subjt:  LWQMRLSHMSEKGLKILIKKNHLLDLKSAPLKWFAHYLTGKQTSVTFKSSQHSRKPNVLELVHFNVCSPMKTKSLWGASYFVTFTNDHSRKIWVYTLKTK

Query:  YQVLQVFKQFHASIEREIGEKLNCIRTDNGGKYYGPFDEYCRNHGIQHQKTPLKTLQLNWIAERLKEH
        YQVLQVFKQFHASIEREIGEKLNCIRTDNGGKYYGPFDEYCRNHGIQHQKTPLKTLQLN IAERLKEH
Subjt:  YQVLQVFKQFHASIEREIGEKLNCIRTDNGGKYYGPFDEYCRNHGIQHQKTPLKTLQLNWIAERLKEH

RVW24012.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.5e-9666.79Show/hide
Query:  MGNDGSTNAVGIGGVHLKNINGSRLILKNVKHIPDIRMNLISTSKLDNKGFCNTFDNGIWKLTEGFMVIAKGQKISLLYYVDAKIIDSDINIVNGEANVE
        MGNDGS  A+G+G V L+  NG+ L LKNVKHI DIRMNLIST KLD++GFCNTF +  WKLT G MVIAKG K S LY + A++IDS IN V+ ++  E
Subjt:  MGNDGSTNAVGIGGVHLKNINGSRLILKNVKHIPDIRMNLISTSKLDNKGFCNTFDNGIWKLTEGFMVIAKGQKISLLYYVDAKIIDSDINIVNGEANVE

Query:  LWQMRLSHMSEKGLKILIKKNHLLDLKSAPLKWFAHYLTGKQTSVTFKSSQHSRKPNVLELVHFNVCSPMKTKSLWGASYFVTFTNDHSRKIWVYTLKTK
        LW  RL HMSEKGL IL KKN L  +K   LK  AHYL GKQT V FK+ +H+RKP +L+LV+ +VC PMKTK+L G+ YFVTF +DHSRKIWVYTLKTK
Subjt:  LWQMRLSHMSEKGLKILIKKNHLLDLKSAPLKWFAHYLTGKQTSVTFKSSQHSRKPNVLELVHFNVCSPMKTKSLWGASYFVTFTNDHSRKIWVYTLKTK

Query:  YQVLQVFKQFHASIEREIGEKLNCIRTDNGGKYYGPFDEYCRNHGIQHQKTPLKTLQLNWIAERL
         QVL VFKQFHA +ER+ GEKL CIRTDNGG+Y GPFDEYCR HGI+HQKTP KT QLN +AER+
Subjt:  YQVLQVFKQFHASIEREIGEKLNCIRTDNGGKYYGPFDEYCRNHGIQHQKTPLKTLQLNWIAERL

TYJ98688.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.1e-11581.13Show/hide
Query:  MGNDGSTNAVGIGGVHLKNINGSRLILKNVKHIPDIRMNLISTSKLDNKGFCNTFDNGIWKLTEGFMVIAKGQKISLLYYVDAKIIDSDINIVNGEANVE
        MGNDG TN VGIG VHLKN NGSRLILKNVKHIPDI MNLIST KLD++GFCNTFDNGIWKLT+G MVIA GQK S LYY+DAKIID DIN VN EANVE
Subjt:  MGNDGSTNAVGIGGVHLKNINGSRLILKNVKHIPDIRMNLISTSKLDNKGFCNTFDNGIWKLTEGFMVIAKGQKISLLYYVDAKIIDSDINIVNGEANVE

Query:  LWQMRLSHMSEKGLKILIKKNHLLDLKSAPLKWFAHYLTGKQTSVTFKSSQHSRKPNVLELVHFNVCSPMKTKSLWGASYFVTFTNDHSRKIWVYTLKTK
        LW  RLSHMSEKGLKIL KKNHL DLKS PLK   H L GKQT VTFKSSQHSRK NVLELVH NVC  MKTKSL GA YFVTFT+DHSRKIWVYTLKTK
Subjt:  LWQMRLSHMSEKGLKILIKKNHLLDLKSAPLKWFAHYLTGKQTSVTFKSSQHSRKPNVLELVHFNVCSPMKTKSLWGASYFVTFTNDHSRKIWVYTLKTK

Query:  YQVLQVFKQFHASIEREIGEKLNCIRTDNGGKYYGPFDEYCRNHGIQHQKTPLKTLQLNWIAERL
            QVFKQFHAS+ERE GEKL CIRTDNGG+Y GPFDEYCRNHGI+HQKTP K+ QLN IA+RL
Subjt:  YQVLQVFKQFHASIEREIGEKLNCIRTDNGGKYYGPFDEYCRNHGIQHQKTPLKTLQLNWIAERL

TrEMBL top hitse value%identityAlignment
A0A438CLC2 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-9666.79Show/hide
Query:  MGNDGSTNAVGIGGVHLKNINGSRLILKNVKHIPDIRMNLISTSKLDNKGFCNTFDNGIWKLTEGFMVIAKGQKISLLYYVDAKIIDSDINIVNGEANVE
        MGNDGS  A+G+G V L+  NG+ L LKNVKHI DIRMNLIST KLD++GFCNTF +  WKLT G MVIAKG K S LY + A++IDS IN V+ ++  E
Subjt:  MGNDGSTNAVGIGGVHLKNINGSRLILKNVKHIPDIRMNLISTSKLDNKGFCNTFDNGIWKLTEGFMVIAKGQKISLLYYVDAKIIDSDINIVNGEANVE

Query:  LWQMRLSHMSEKGLKILIKKNHLLDLKSAPLKWFAHYLTGKQTSVTFKSSQHSRKPNVLELVHFNVCSPMKTKSLWGASYFVTFTNDHSRKIWVYTLKTK
        LW  RL HMSEKGL IL KKN L  +K   LK  AHYL GKQT V FK+ +H+RKP +L+LV+ +VC PMKTK+L G+ YFVTF +DHSRKIWVYTLKTK
Subjt:  LWQMRLSHMSEKGLKILIKKNHLLDLKSAPLKWFAHYLTGKQTSVTFKSSQHSRKPNVLELVHFNVCSPMKTKSLWGASYFVTFTNDHSRKIWVYTLKTK

Query:  YQVLQVFKQFHASIEREIGEKLNCIRTDNGGKYYGPFDEYCRNHGIQHQKTPLKTLQLNWIAERL
         QVL VFKQFHA +ER+ GEKL CIRTDNGG+Y GPFDEYCR HGI+HQKTP KT QLN +AER+
Subjt:  YQVLQVFKQFHASIEREIGEKLNCIRTDNGGKYYGPFDEYCRNHGIQHQKTPLKTLQLNWIAERL

A0A5A7TFU1 Retrovirus-related Pol polyprotein from transposon TNT 1-945.2e-11681.13Show/hide
Query:  MGNDGSTNAVGIGGVHLKNINGSRLILKNVKHIPDIRMNLISTSKLDNKGFCNTFDNGIWKLTEGFMVIAKGQKISLLYYVDAKIIDSDINIVNGEANVE
        MGNDG TN VGIG VHLKN NGSRLILKNVKHIPDI MNLIST KLD++GFCNTFDNGIWKLT+G MVIA GQK S LYY+DAKIID DIN VN EANVE
Subjt:  MGNDGSTNAVGIGGVHLKNINGSRLILKNVKHIPDIRMNLISTSKLDNKGFCNTFDNGIWKLTEGFMVIAKGQKISLLYYVDAKIIDSDINIVNGEANVE

Query:  LWQMRLSHMSEKGLKILIKKNHLLDLKSAPLKWFAHYLTGKQTSVTFKSSQHSRKPNVLELVHFNVCSPMKTKSLWGASYFVTFTNDHSRKIWVYTLKTK
        LW  RLSHMSEKGLKIL KKNHL DLKS PLK   H L GKQT VTFKSSQHSRK NVLELVH NVC  MKTKSL GA YFVTFT+DHSRKIWVYTLKTK
Subjt:  LWQMRLSHMSEKGLKILIKKNHLLDLKSAPLKWFAHYLTGKQTSVTFKSSQHSRKPNVLELVHFNVCSPMKTKSLWGASYFVTFTNDHSRKIWVYTLKTK

Query:  YQVLQVFKQFHASIEREIGEKLNCIRTDNGGKYYGPFDEYCRNHGIQHQKTPLKTLQLNWIAERL
            QVFKQFHAS+ERE GEKL CIRTDNGG+Y GPFDEYCRNHGI+HQKTP K+ QLN IA+RL
Subjt:  YQVLQVFKQFHASIEREIGEKLNCIRTDNGGKYYGPFDEYCRNHGIQHQKTPLKTLQLNWIAERL

A0A5D3BKF7 Retrovirus-related Pol polyprotein from transposon TNT 1-945.2e-11681.13Show/hide
Query:  MGNDGSTNAVGIGGVHLKNINGSRLILKNVKHIPDIRMNLISTSKLDNKGFCNTFDNGIWKLTEGFMVIAKGQKISLLYYVDAKIIDSDINIVNGEANVE
        MGNDG TN VGIG VHLKN NGSRLILKNVKHIPDI MNLIST KLD++GFCNTFDNGIWKLT+G MVIA GQK S LYY+DAKIID DIN VN EANVE
Subjt:  MGNDGSTNAVGIGGVHLKNINGSRLILKNVKHIPDIRMNLISTSKLDNKGFCNTFDNGIWKLTEGFMVIAKGQKISLLYYVDAKIIDSDINIVNGEANVE

Query:  LWQMRLSHMSEKGLKILIKKNHLLDLKSAPLKWFAHYLTGKQTSVTFKSSQHSRKPNVLELVHFNVCSPMKTKSLWGASYFVTFTNDHSRKIWVYTLKTK
        LW  RLSHMSEKGLKIL KKNHL DLKS PLK   H L GKQT VTFKSSQHSRK NVLELVH NVC  MKTKSL GA YFVTFT+DHSRKIWVYTLKTK
Subjt:  LWQMRLSHMSEKGLKILIKKNHLLDLKSAPLKWFAHYLTGKQTSVTFKSSQHSRKPNVLELVHFNVCSPMKTKSLWGASYFVTFTNDHSRKIWVYTLKTK

Query:  YQVLQVFKQFHASIEREIGEKLNCIRTDNGGKYYGPFDEYCRNHGIQHQKTPLKTLQLNWIAERL
            QVFKQFHAS+ERE GEKL CIRTDNGG+Y GPFDEYCRNHGI+HQKTP K+ QLN IA+RL
Subjt:  YQVLQVFKQFHASIEREIGEKLNCIRTDNGGKYYGPFDEYCRNHGIQHQKTPLKTLQLNWIAERL

A0A5D3C706 Putative retrotransposon1.1e-15098.13Show/hide
Query:  MGNDGSTNAVGIGGVHLKNINGSRLILKNVKHIPDIRMNLISTSKLDNKGFCNTFDNGIWKLTEGFMVIAKGQKISLLYYVDAKIIDSDINIVNGEANVE
        MGNDGSTNAVGIGGVHLKNINGSRLILKNVKHIPDIRMNLISTSKLDN+GFCNTFDNGIWKLTEGFMVIAKGQKISLLYYVDAKIIDSDIN VNGEANVE
Subjt:  MGNDGSTNAVGIGGVHLKNINGSRLILKNVKHIPDIRMNLISTSKLDNKGFCNTFDNGIWKLTEGFMVIAKGQKISLLYYVDAKIIDSDINIVNGEANVE

Query:  LWQMRLSHMSEKGLKILIKKNHLLDLKSAPLKWFAHYLTGKQTSVTFKSSQHSRKPNVLELVHFNVCSPMKTKSLWGASYFVTFTNDHSRKIWVYTLKTK
        LWQMRLSHMSEKGLKILIKKNHL DLKSAPLKWFAHYL GKQTSVTFKSSQHSRKPNVLELVHFNVCSPMKTKSLWGASYFVTFTNDHSRKIWVYTLKTK
Subjt:  LWQMRLSHMSEKGLKILIKKNHLLDLKSAPLKWFAHYLTGKQTSVTFKSSQHSRKPNVLELVHFNVCSPMKTKSLWGASYFVTFTNDHSRKIWVYTLKTK

Query:  YQVLQVFKQFHASIEREIGEKLNCIRTDNGGKYYGPFDEYCRNHGIQHQKTPLKTLQLNWIAERLKEH
        YQVLQVFKQFHASIEREIGEKLNCIRTDNGGKYYGPFDEYCRNHGIQHQKTPLKTLQLN IAERLKEH
Subjt:  YQVLQVFKQFHASIEREIGEKLNCIRTDNGGKYYGPFDEYCRNHGIQHQKTPLKTLQLNWIAERLKEH

A0A5D3CVK2 Putative retrotransposon2.3e-11678.99Show/hide
Query:  MGNDGSTNAVGIGGVHLKNINGSRLILKNVKHIPDIRMNLISTSKLDNKGFCNTFDNGIWKLTEGFMVIAKGQKISLLYYVDAKIIDSDINIVNGEANVE
        MGNDGS N VGIG VHL N NGSRLILKNVKHI DIRMNLIST KLD++GFCNTFDNGIWKLT+G +VIA+G K S LYY+DAKIIDSDIN VN E N+E
Subjt:  MGNDGSTNAVGIGGVHLKNINGSRLILKNVKHIPDIRMNLISTSKLDNKGFCNTFDNGIWKLTEGFMVIAKGQKISLLYYVDAKIIDSDINIVNGEANVE

Query:  LWQMRLSHMSEKGLKILIKK-----------NHLLDLKSAPLKWFAHYLTGKQTSVTFKSSQHSRKPNVLELVHFNVCSPMKTKSLWGASYFVTFTNDHS
        LW  RLSHMSEKGLKIL KK           NHL DLKS PLK   H L GKQT VTFKSSQHSRKPNVLELVH NVC PMKTKSL GA YFVTFT+DHS
Subjt:  LWQMRLSHMSEKGLKILIKK-----------NHLLDLKSAPLKWFAHYLTGKQTSVTFKSSQHSRKPNVLELVHFNVCSPMKTKSLWGASYFVTFTNDHS

Query:  RKIWVYTLKTKYQVLQVFKQFHASIEREIGEKLNCIRTDNGGKYYGPFDEYCRNHGIQHQKTPLKTLQLNWIAERL
        RKIWVYTLKTK QVLQVFKQFHAS+ERE GEKL CIRTDNGG+Y GPFDEYCRNHGI+HQKTP KT QLN IAERL
Subjt:  RKIWVYTLKTKYQVLQVFKQFHASIEREIGEKLNCIRTDNGGKYYGPFDEYCRNHGIQHQKTPLKTLQLNWIAERL

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.0e-1526.85Show/hide
Query:  NGSRLILKNVKHIPDIRMNLISTSKLDNKGFCNTFD-NGIWKLTEGFMVIAKGQKISLLYYVDAKIIDSDINIVNG--EANVELWQMRLSHMSEKGLKIL
        N   + L++V    +   NL+S  +L   G    FD +G+     G MV+     ++     +  +I+     +N   + N  LW  R  H+S+  L  +
Subjt:  NGSRLILKNVKHIPDIRMNLISTSKLDNKGFCNTFD-NGIWKLTEGFMVIAKGQKISLLYYVDAKIIDSDINIVNG--EANVELWQMRLSHMSEKGLKIL

Query:  IKKN-----HLLDLKSAPLKWFAHYLTGKQTSVTF---KSSQHSRKPNVLELVHFNVCSPMKTKSLWGASYFVTFTNDHSRKIWVYTLKTKYQVLQVFKQ
         +KN      LL+      +     L GKQ  + F   K   H ++P  L +VH +VC P+   +L   +YFV F +  +     Y +K K  V  +F+ 
Subjt:  IKKN-----HLLDLKSAPLKWFAHYLTGKQTSVTF---KSSQHSRKPNVLELVHFNVCSPMKTKSLWGASYFVTFTNDHSRKIWVYTLKTKYQVLQVFKQ

Query:  FHASIEREIGEKLNCIRTDNGGKYY-GPFDEYCRNHGIQHQKTPLKTLQLNWIAERL
        F A  E     K+  +  DNG +Y      ++C   GI +  T   T QLN ++ER+
Subjt:  FHASIEREIGEKLNCIRTDNGGKYY-GPFDEYCRNHGIQHQKTPLKTLQLNWIAERL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.8e-6548.5Show/hide
Query:  MGNDGSTNAVGIGGVHLKNINGSRLILKNVKHIPDIRMNLISTSKLDNKGFCNTFDNGIWKLTEGFMVIAKGQKISLLYYVDAKIIDSDINIVNGEANVE
        MGN   +   GIG + +K   G  L+LK+V+H+PD+RMNLIS   LD  G+ + F N  W+LT+G +VIAKG     LY  +A+I   ++N    E +V+
Subjt:  MGNDGSTNAVGIGGVHLKNINGSRLILKNVKHIPDIRMNLISTSKLDNKGFCNTFDNGIWKLTEGFMVIAKGQKISLLYYVDAKIIDSDINIVNGEANVE

Query:  LWQMRLSHMSEKGLKILIKKNHLLDLKSAPLKWFAHYLTGKQTSVTFKSSQHSRKPNVLELVHFNVCSPMKTKSLWGASYFVTFTNDHSRKIWVYTLKTK
        LW  R+ HMSEKGL+IL KK+ +   K   +K   + L GKQ  V+F++S   RK N+L+LV+ +VC PM+ +S+ G  YFVTF +D SRK+WVY LKTK
Subjt:  LWQMRLSHMSEKGLKILIKKNHLLDLKSAPLKWFAHYLTGKQTSVTFKSSQHSRKPNVLELVHFNVCSPMKTKSLWGASYFVTFTNDHSRKIWVYTLKTK

Query:  YQVLQVFKQFHASIEREIGEKLNCIRTDNGGKYYG-PFDEYCRNHGIQHQKTPLKTLQLNWIAERL
         QV QVF++FHA +ERE G KL  +R+DNGG+Y    F+EYC +HGI+H+KT   T Q N +AER+
Subjt:  YQVLQVFKQFHASIEREIGEKLNCIRTDNGGKYYG-PFDEYCRNHGIQHQKTPLKTLQLNWIAERL

P25384 Transposon Ty2-C Gag-Pol polyprotein2.7e-1324.55Show/hide
Query:  IGGVHLKNINGSRLILKNVKHIPDIRMNLISTSKLDNKGFCNTFDNGIWKLTEGFM---VIAKGQ--KISLLYYVDAKIIDSDINIVNGEANVE-----L
        IG +H    NG++  +K + H P+I  +L+S S+L N+     F     + ++G +   ++  G    +S  Y + + I    IN VN   +V      L
Subjt:  IGGVHLKNINGSRLILKNVKHIPDIRMNLISTSKLDNKGFCNTFDNGIWKLTEGFM---VIAKGQ--KISLLYYVDAKIIDSDINIVNGEANVE-----L

Query:  WQMRLSHMSEKGLKILIKKNHLLDLKSAPLKW-------FAHYLTGKQTS---VTFKSSQHSRKPNVLELVHFNVCSPMKTKSLWGASYFVTFTNDHSRK
            L H + + ++  +KKN +  LK + ++W           L GK T    V     ++       + +H ++  P+        SYF++FT++ +R 
Subjt:  WQMRLSHMSEKGLKILIKKNHLLDLKSAPLKW-------FAHYLTGKQTS---VTFKSSQHSRKPNVLELVHFNVCSPMKTKSLWGASYFVTFTNDHSRK

Query:  IWVYTL--KTKYQVLQVFKQFHASIEREIGEKLNCIRTDNGGKYYG-PFDEYCRNHGIQHQKTPLKTLQLNWIAERL
         WVY L  + +  +L VF    A I+ +   ++  I+ D G +Y      ++  N GI    T     + + +AERL
Subjt:  IWVYTL--KTKYQVLQVFKQFHASIEREIGEKLNCIRTDNGGKYYG-PFDEYCRNHGIQHQKTPLKTLQLNWIAERL

Q12491 Transposon Ty2-B Gag-Pol polyprotein2.7e-1324.55Show/hide
Query:  IGGVHLKNINGSRLILKNVKHIPDIRMNLISTSKLDNKGFCNTFDNGIWKLTEGFM---VIAKGQ--KISLLYYVDAKIIDSDINIVNGEANVE-----L
        IG +H    NG++  +K + H P+I  +L+S S+L N+     F     + ++G +   ++  G    +S  Y + + I    IN VN   +V      L
Subjt:  IGGVHLKNINGSRLILKNVKHIPDIRMNLISTSKLDNKGFCNTFDNGIWKLTEGFM---VIAKGQ--KISLLYYVDAKIIDSDINIVNGEANVE-----L

Query:  WQMRLSHMSEKGLKILIKKNHLLDLKSAPLKW-------FAHYLTGKQTS---VTFKSSQHSRKPNVLELVHFNVCSPMKTKSLWGASYFVTFTNDHSRK
            L H + + ++  +KKN +  LK + ++W           L GK T    V     ++       + +H ++  P+        SYF++FT++ +R 
Subjt:  WQMRLSHMSEKGLKILIKKNHLLDLKSAPLKW-------FAHYLTGKQTS---VTFKSSQHSRKPNVLELVHFNVCSPMKTKSLWGASYFVTFTNDHSRK

Query:  IWVYTL--KTKYQVLQVFKQFHASIEREIGEKLNCIRTDNGGKYYG-PFDEYCRNHGIQHQKTPLKTLQLNWIAERL
         WVY L  + +  +L VF    A I+ +   ++  I+ D G +Y      ++  N GI    T     + + +AERL
Subjt:  IWVYTL--KTKYQVLQVFKQFHASIEREIGEKLNCIRTDNGGKYYG-PFDEYCRNHGIQHQKTPLKTLQLNWIAERL

Q12501 Transposon Ty2-OR2 Gag-Pol polyprotein2.1e-1324.55Show/hide
Query:  IGGVHLKNINGSRLILKNVKHIPDIRMNLISTSKLDNKGFCNTFDNGIWKLTEGFM---VIAKGQ--KISLLYYVDAKIIDSDINIVNGEANVE-----L
        IG +H    NG++  +K + H P+I  +L+S S+L N+     F     + ++G +   ++  G    +S  Y + + I    IN VN   +V      L
Subjt:  IGGVHLKNINGSRLILKNVKHIPDIRMNLISTSKLDNKGFCNTFDNGIWKLTEGFM---VIAKGQ--KISLLYYVDAKIIDSDINIVNGEANVE-----L

Query:  WQMRLSHMSEKGLKILIKKNHLLDLKSAPLKW-------FAHYLTGKQTS---VTFKSSQHSRKPNVLELVHFNVCSPMKTKSLWGASYFVTFTNDHSRK
            L H + + ++  +KKN +  LK + ++W           L GK T    V     ++       + +H ++  P+        SYF++FT++ +R 
Subjt:  WQMRLSHMSEKGLKILIKKNHLLDLKSAPLKW-------FAHYLTGKQTS---VTFKSSQHSRKPNVLELVHFNVCSPMKTKSLWGASYFVTFTNDHSRK

Query:  IWVYTL--KTKYQVLQVFKQFHASIEREIGEKLNCIRTDNGGKYYG-PFDEYCRNHGIQHQKTPLKTLQLNWIAERL
         WVY L  + +  +L VF    A I+ +   ++  I+ D G +Y      ++  N GI    T     + + +AERL
Subjt:  IWVYTL--KTKYQVLQVFKQFHASIEREIGEKLNCIRTDNGGKYYG-PFDEYCRNHGIQHQKTPLKTLQLNWIAERL

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein2.6e-1131.97Show/hide
Query:  GIWKLTEGFMVIAKGQKISLLYYVDAKIIDSDINIV-NGEANVELWQMRLSHMSEKGLKILIKKNHLLDLKSAPLKWFAHYLTGKQTSVTFKSSQHSRKP
        G+ K+ +G   I KG +   LY +   +   + N+    +    LW  RL+HMS++G+++L+KK  L   K + LK+    + GK   V F + QH+ K 
Subjt:  GIWKLTEGFMVIAKGQKISLLYYVDAKIIDSDINIV-NGEANVELWQMRLSHMSEKGLKILIKKNHLLDLKSAPLKWFAHYLTGKQTSVTFKSSQHSRKP

Query:  NVLELVHFNVCSPMKTKSLWGA
        N L+ VH           LWGA
Subjt:  NVLELVHFNVCSPMKTKSLWGA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCAATGACGGATCAACAAATGCAGTTGGTATCGGAGGTGTACACTTGAAGAATATAAATGGCTCTAGGCTGATTTTGAAAAATGTGAAACATATTCCTGATATTCG
TATGAACTTGATTTCTACAAGTAAGCTTGATAACAAAGGTTTTTGCAATACCTTCGATAATGGAATATGGAAGCTTACTGAAGGTTTTATGGTTATAGCAAAGGGACAAA
AAATTTCTTTATTGTACTACGTGGATGCTAAAATCATAGATTCTGATATAAATATAGTAAATGGTGAAGCGAATGTTGAGCTTTGGCAGATGAGACTTAGCCATATGAGT
GAGAAGGGTTTAAAGATTTTAATTAAGAAAAACCATCTTCTTGATTTAAAGAGTGCACCTCTAAAATGGTTTGCTCATTATTTGACAGGAAAGCAAACATCGGTTACCTT
TAAATCATCTCAACATTCAAGAAAGCCAAATGTACTAGAGTTGGTACATTTTAATGTGTGTAGTCCCATGAAAACAAAATCGCTTTGGGGTGCATCGTATTTTGTGACAT
TTACTAATGATCATTCAAGGAAAATATGGGTTTACACCTTGAAGACTAAATATCAAGTGTTGCAAGTGTTTAAACAGTTCCATGCCTCTATTGAAAGAGAAATTGGTGAA
AAGCTTAACTGCATTAGAACTGATAATGGAGGTAAGTATTATGGACCTTTTGATGAATATTGCAGAAATCATGGCATTCAACATCAAAAGACACCTCTTAAGACCCTACA
GTTAAATTGGATAGCTGAAAGATTGAAAGAACATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCAATGACGGATCAACAAATGCAGTTGGTATCGGAGGTGTACACTTGAAGAATATAAATGGCTCTAGGCTGATTTTGAAAAATGTGAAACATATTCCTGATATTCG
TATGAACTTGATTTCTACAAGTAAGCTTGATAACAAAGGTTTTTGCAATACCTTCGATAATGGAATATGGAAGCTTACTGAAGGTTTTATGGTTATAGCAAAGGGACAAA
AAATTTCTTTATTGTACTACGTGGATGCTAAAATCATAGATTCTGATATAAATATAGTAAATGGTGAAGCGAATGTTGAGCTTTGGCAGATGAGACTTAGCCATATGAGT
GAGAAGGGTTTAAAGATTTTAATTAAGAAAAACCATCTTCTTGATTTAAAGAGTGCACCTCTAAAATGGTTTGCTCATTATTTGACAGGAAAGCAAACATCGGTTACCTT
TAAATCATCTCAACATTCAAGAAAGCCAAATGTACTAGAGTTGGTACATTTTAATGTGTGTAGTCCCATGAAAACAAAATCGCTTTGGGGTGCATCGTATTTTGTGACAT
TTACTAATGATCATTCAAGGAAAATATGGGTTTACACCTTGAAGACTAAATATCAAGTGTTGCAAGTGTTTAAACAGTTCCATGCCTCTATTGAAAGAGAAATTGGTGAA
AAGCTTAACTGCATTAGAACTGATAATGGAGGTAAGTATTATGGACCTTTTGATGAATATTGCAGAAATCATGGCATTCAACATCAAAAGACACCTCTTAAGACCCTACA
GTTAAATTGGATAGCTGAAAGATTGAAAGAACATTAG
Protein sequenceShow/hide protein sequence
MGNDGSTNAVGIGGVHLKNINGSRLILKNVKHIPDIRMNLISTSKLDNKGFCNTFDNGIWKLTEGFMVIAKGQKISLLYYVDAKIIDSDINIVNGEANVELWQMRLSHMS
EKGLKILIKKNHLLDLKSAPLKWFAHYLTGKQTSVTFKSSQHSRKPNVLELVHFNVCSPMKTKSLWGASYFVTFTNDHSRKIWVYTLKTKYQVLQVFKQFHASIEREIGE
KLNCIRTDNGGKYYGPFDEYCRNHGIQHQKTPLKTLQLNWIAERLKEH