; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0023711 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0023711
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr01:23659244..23660245
RNA-Seq ExpressionCmc01g0023711
SyntenyCmc01g0023711
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036855.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.4e-16587.99Show/hide
Query:  MGPMQTESLRGKKYVLVVVDDYFIFTWVRFLKEKSDTIKLCISLCVNLQREKGQKIIRVRSDHGKEFDNEDLNNFCQTKGIHHEFAAPITSQQNGVVERK
        MGPMQTESL GKKYVLVVVDDY  FTWVRFLKEK DT+KLCISLC+NLQREKGQKIIR+RSDHGKEFDNEDLNN CQT+GIHHEFAAPIT QQNGVVERK
Subjt:  MGPMQTESLRGKKYVLVVVDDYFIFTWVRFLKEKSDTIKLCISLCVNLQREKGQKIIRVRSDHGKEFDNEDLNNFCQTKGIHHEFAAPITSQQNGVVERK

Query:  NRTLQEMARVMIHANNLPLNFLAEAVNTVCHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILADREYHRKWDVKSDQGIFLGYSHNNRAYRV
        NR LQEMARVMIHA N PLNF AE VNT CHIH RVTTR GTT+TLYELWKGRKPNVKYFHIFGSTCYILADREYHRKWDVKS+QGIFL YS N+RAYRV
Subjt:  NRTLQEMARVMIHANNLPLNFLAEAVNTVCHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILADREYHRKWDVKSDQGIFLGYSHNNRAYRV

Query:  FNIKSGTVMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNETVLVPSAHVKKNHLSSSIIGDPSAGITTK
        FNIKSGTVME INVVVNDFESNVNQFN EDDET+VTP+VT TPLDEMPKGD QP+SAK NSNITDEVINNETV+ PSAHVKKNH SSSIIGD SAGITT+
Subjt:  FNIKSGTVMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNETVLVPSAHVKKNHLSSSIIGDPSAGITTK

Query:  WKEKVDYTKMIADLCYVSAIEPTSVENALKDEY
         KEKVDY KMIADLCY SAIEP SVENALKDEY
Subjt:  WKEKVDYTKMIADLCYVSAIEPTSVENALKDEY

KAA0042206.1 gag-pol polyprotein [Cucumis melo var. makuwa]7.2e-14682.28Show/hide
Query:  MGPMQTESLRGKKYVLVVVDDYFIFTWVRFLKEKSDTIKLCISLCVNLQREKGQKIIRVRSDHGKEFDNEDLNNFCQTKGIHHEFAAPITSQQNGVVERK
        MGPMQTESLRGKKYVLVVVDDYFIFTWVRFLKEKSDTIKLCISLCVNLQREKGQKII+VRSDHGKEFDNEDLNNFCQTKGIHHEF APITSQQNGVVERK
Subjt:  MGPMQTESLRGKKYVLVVVDDYFIFTWVRFLKEKSDTIKLCISLCVNLQREKGQKIIRVRSDHGKEFDNEDLNNFCQTKGIHHEFAAPITSQQNGVVERK

Query:  NRTLQEMARVMIHANNLPLNFLAEAVNTVCHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILADREYHRKWDVKSDQGIFLGYSHNNRAYRV
        NRTLQEMARVMIHANNLPLNFLAEAVNTVCHI  +                         H    TCYILADREYHRKWDVKSDQGIFLGYSHN+RAYRV
Subjt:  NRTLQEMARVMIHANNLPLNFLAEAVNTVCHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILADREYHRKWDVKSDQGIFLGYSHNNRAYRV

Query:  FNIKSGTVMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNETVLVPSAHVKKNHLSSSIIGDPSAGITTK
        FNIKSGTVME INVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNETVLVPSAHVKKNHLSSSI+GDPSAGITTK
Subjt:  FNIKSGTVMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNETVLVPSAHVKKNHLSSSIIGDPSAGITTK

Query:  WKEKVDYTKMIADLCYVSAIEPTSVENALKDEY
        WKEK                      NALKDEY
Subjt:  WKEKVDYTKMIADLCYVSAIEPTSVENALKDEY

KAA0045248.1 gag-pol polyprotein [Cucumis melo var. makuwa]4.1e-14178.72Show/hide
Query:  QTESLRGKKYVLVVVDDYFIFTWVRFLKEKSDTIKLCISLCVNLQREKGQKIIRVRSDHGKEFDNEDLNNFCQTKGIHHEFAAPITSQQNGVVERKNRTL
        +T   R KKYVLVVVDDY  FTWVRFLK KSDT+KLCI++C+NLQREKGQKII++RSDHGKEFDNEDLNNFCQT+GIHHEF APIT QQNGVV+RKNR  
Subjt:  QTESLRGKKYVLVVVDDYFIFTWVRFLKEKSDTIKLCISLCVNLQREKGQKIIRVRSDHGKEFDNEDLNNFCQTKGIHHEFAAPITSQQNGVVERKNRTL

Query:  QEMARVMIHANNLPLNFLAEAVNTVCHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILADREYHRKWDVKSDQGIFLGYSHNNRAYRVFNIK
                                       VTT+S  T TLYELWKG+KPNVKYFHIFGSTCYILADREYHRKWDVKSDQ IFLGYS N+RAYRVFNIK
Subjt:  QEMARVMIHANNLPLNFLAEAVNTVCHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILADREYHRKWDVKSDQGIFLGYSHNNRAYRVFNIK

Query:  SGTVMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNETVLVPSAHVKKNHLSSSIIGDPSAGITTKWKEK
        S TVME INVVVNDFESNVNQFNIEDDETHVTP+VT TPLDEMPKGD QPDSAKTNSNITDEVINNETVL+PSAHVKKNH SSSIIGDPS GITT+ KEK
Subjt:  SGTVMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNETVLVPSAHVKKNHLSSSIIGDPSAGITTKWKEK

Query:  VDYTKMIADLCYVSAIEPTSVENALKDEY
        VDY KMIADLCYVSAIEPTSVENALKDEY
Subjt:  VDYTKMIADLCYVSAIEPTSVENALKDEY

KAA0060126.1 gag-pol polyprotein [Cucumis melo var. makuwa]2.3e-16088.24Show/hide
Query:  GKKYVLVVVDDYFIFTWVRFLKEKSDTIKLCISLCVNLQREKGQKIIRVRSDHGKEFDNEDLNNFCQTKGIHHEFAAPITSQQNGVVERKNRTLQEMARV
        G+KYVLVVVDDY  FTWVRFLK K DT KLCI+LC+NLQREK QKIIR+R +HG EF+NEDLNNFCQT+GIHHEFAAPIT QQNGVVERKNRTLQEMARV
Subjt:  GKKYVLVVVDDYFIFTWVRFLKEKSDTIKLCISLCVNLQREKGQKIIRVRSDHGKEFDNEDLNNFCQTKGIHHEFAAPITSQQNGVVERKNRTLQEMARV

Query:  MIHANNLPLNFLAEAVNTVCHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILADREYHRKWDVKSDQGIFLGYSHNNRAYRVFNIKSGTVME
        MIHA NLPLNF AEAVNT CHIH+RVTTRSGTTVTLYELWKGRKPN+KYFHIFGS CYILADR+YHRKWDVKSDQ IFLGYS N+RAYRVFNIKSGTVME
Subjt:  MIHANNLPLNFLAEAVNTVCHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILADREYHRKWDVKSDQGIFLGYSHNNRAYRVFNIKSGTVME

Query:  TINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNETVLVPSAHVKKNHLSSSIIGDPSAGITTKWKEKVDYTKM
        TINVVVNDFESN+NQFNIE+DETHVTPEVTSTPLDEM KG+SQ DSAKT S+ITDEVINNETVLVPSAHVKKNH  SS+IGDPSAGITT+ KEKVDYTKM
Subjt:  TINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNETVLVPSAHVKKNHLSSSIIGDPSAGITTKWKEKVDYTKM

Query:  IADLCYVSAIEPTSVENALKDEY
        IADLCYVSAIEPTSVENALKDEY
Subjt:  IADLCYVSAIEPTSVENALKDEY

TYK21443.1 gag-pol polyprotein [Cucumis melo var. makuwa]8.8e-13681.55Show/hide
Query:  LCVNLQREKGQKIIRVRSDHGKEFDNEDLNNFCQTKGIHHEFAAPITSQQNGVVERKNRTLQEMARVMIHANNLPLNFLAEAVNTVCHIHNRVTTRSGTT
        LC+NLQREKGQKIIR+RSDHGKEFDNEDLNNFCQT GIHH+F  PIT QQNGVVE +N TLQEMARVMIHA NLPLNF AEAVNT CHIHNRVTTRSG T
Subjt:  LCVNLQREKGQKIIRVRSDHGKEFDNEDLNNFCQTKGIHHEFAAPITSQQNGVVERKNRTLQEMARVMIHANNLPLNFLAEAVNTVCHIHNRVTTRSGTT

Query:  VTLYELWKGRKPNVKYFHIFGSTCYILADREYHRKWDVKSDQGIFLGYSHNNRAYRVFNIKSGTVMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTP
        VTLYEL KGRKPNVKYFHIFGSTCYILADREYHRKWD KS QGIFLGYS N+RAYRVFNIKSGTVMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTP
Subjt:  VTLYELWKGRKPNVKYFHIFGSTCYILADREYHRKWDVKSDQGIFLGYSHNNRAYRVFNIKSGTVMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTP

Query:  LDEMPKGDSQPDSAKTNSNITDEVINNETVLVPSAHVKKNHLSSSIIGDPSAGITTKWKEKVDYTKMIA-------------------DLCYVSAIEPTS
        LDEMPKG+SQ  SAKT+S+ITDEVINNET+LVPSAHVKKNH SSSII DPSAGITT+ KE ++    I+                   DLCYVSAIEPTS
Subjt:  LDEMPKGDSQPDSAKTNSNITDEVINNETVLVPSAHVKKNHLSSSIIGDPSAGITTKWKEKVDYTKMIA-------------------DLCYVSAIEPTS

Query:  VENALKDEY
        VEN+LKDEY
Subjt:  VENALKDEY

TrEMBL top hitse value%identityAlignment
A0A5A7TQU8 Gag-pol polyprotein2.0e-14178.72Show/hide
Query:  QTESLRGKKYVLVVVDDYFIFTWVRFLKEKSDTIKLCISLCVNLQREKGQKIIRVRSDHGKEFDNEDLNNFCQTKGIHHEFAAPITSQQNGVVERKNRTL
        +T   R KKYVLVVVDDY  FTWVRFLK KSDT+KLCI++C+NLQREKGQKII++RSDHGKEFDNEDLNNFCQT+GIHHEF APIT QQNGVV+RKNR  
Subjt:  QTESLRGKKYVLVVVDDYFIFTWVRFLKEKSDTIKLCISLCVNLQREKGQKIIRVRSDHGKEFDNEDLNNFCQTKGIHHEFAAPITSQQNGVVERKNRTL

Query:  QEMARVMIHANNLPLNFLAEAVNTVCHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILADREYHRKWDVKSDQGIFLGYSHNNRAYRVFNIK
                                       VTT+S  T TLYELWKG+KPNVKYFHIFGSTCYILADREYHRKWDVKSDQ IFLGYS N+RAYRVFNIK
Subjt:  QEMARVMIHANNLPLNFLAEAVNTVCHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILADREYHRKWDVKSDQGIFLGYSHNNRAYRVFNIK

Query:  SGTVMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNETVLVPSAHVKKNHLSSSIIGDPSAGITTKWKEK
        S TVME INVVVNDFESNVNQFNIEDDETHVTP+VT TPLDEMPKGD QPDSAKTNSNITDEVINNETVL+PSAHVKKNH SSSIIGDPS GITT+ KEK
Subjt:  SGTVMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNETVLVPSAHVKKNHLSSSIIGDPSAGITTKWKEK

Query:  VDYTKMIADLCYVSAIEPTSVENALKDEY
        VDY KMIADLCYVSAIEPTSVENALKDEY
Subjt:  VDYTKMIADLCYVSAIEPTSVENALKDEY

A0A5A7V0X1 Gag-pol polyprotein1.1e-16088.24Show/hide
Query:  GKKYVLVVVDDYFIFTWVRFLKEKSDTIKLCISLCVNLQREKGQKIIRVRSDHGKEFDNEDLNNFCQTKGIHHEFAAPITSQQNGVVERKNRTLQEMARV
        G+KYVLVVVDDY  FTWVRFLK K DT KLCI+LC+NLQREK QKIIR+R +HG EF+NEDLNNFCQT+GIHHEFAAPIT QQNGVVERKNRTLQEMARV
Subjt:  GKKYVLVVVDDYFIFTWVRFLKEKSDTIKLCISLCVNLQREKGQKIIRVRSDHGKEFDNEDLNNFCQTKGIHHEFAAPITSQQNGVVERKNRTLQEMARV

Query:  MIHANNLPLNFLAEAVNTVCHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILADREYHRKWDVKSDQGIFLGYSHNNRAYRVFNIKSGTVME
        MIHA NLPLNF AEAVNT CHIH+RVTTRSGTTVTLYELWKGRKPN+KYFHIFGS CYILADR+YHRKWDVKSDQ IFLGYS N+RAYRVFNIKSGTVME
Subjt:  MIHANNLPLNFLAEAVNTVCHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILADREYHRKWDVKSDQGIFLGYSHNNRAYRVFNIKSGTVME

Query:  TINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNETVLVPSAHVKKNHLSSSIIGDPSAGITTKWKEKVDYTKM
        TINVVVNDFESN+NQFNIE+DETHVTPEVTSTPLDEM KG+SQ DSAKT S+ITDEVINNETVLVPSAHVKKNH  SS+IGDPSAGITT+ KEKVDYTKM
Subjt:  TINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNETVLVPSAHVKKNHLSSSIIGDPSAGITTKWKEKVDYTKM

Query:  IADLCYVSAIEPTSVENALKDEY
        IADLCYVSAIEPTSVENALKDEY
Subjt:  IADLCYVSAIEPTSVENALKDEY

A0A5D3BA69 Gag-pol polyprotein6.8e-16687.99Show/hide
Query:  MGPMQTESLRGKKYVLVVVDDYFIFTWVRFLKEKSDTIKLCISLCVNLQREKGQKIIRVRSDHGKEFDNEDLNNFCQTKGIHHEFAAPITSQQNGVVERK
        MGPMQTESL GKKYVLVVVDDY  FTWVRFLKEK DT+KLCISLC+NLQREKGQKIIR+RSDHGKEFDNEDLNN CQT+GIHHEFAAPIT QQNGVVERK
Subjt:  MGPMQTESLRGKKYVLVVVDDYFIFTWVRFLKEKSDTIKLCISLCVNLQREKGQKIIRVRSDHGKEFDNEDLNNFCQTKGIHHEFAAPITSQQNGVVERK

Query:  NRTLQEMARVMIHANNLPLNFLAEAVNTVCHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILADREYHRKWDVKSDQGIFLGYSHNNRAYRV
        NR LQEMARVMIHA N PLNF AE VNT CHIH RVTTR GTT+TLYELWKGRKPNVKYFHIFGSTCYILADREYHRKWDVKS+QGIFL YS N+RAYRV
Subjt:  NRTLQEMARVMIHANNLPLNFLAEAVNTVCHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILADREYHRKWDVKSDQGIFLGYSHNNRAYRV

Query:  FNIKSGTVMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNETVLVPSAHVKKNHLSSSIIGDPSAGITTK
        FNIKSGTVME INVVVNDFESNVNQFN EDDET+VTP+VT TPLDEMPKGD QP+SAK NSNITDEVINNETV+ PSAHVKKNH SSSIIGD SAGITT+
Subjt:  FNIKSGTVMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNETVLVPSAHVKKNHLSSSIIGDPSAGITTK

Query:  WKEKVDYTKMIADLCYVSAIEPTSVENALKDEY
         KEKVDY KMIADLCY SAIEP SVENALKDEY
Subjt:  WKEKVDYTKMIADLCYVSAIEPTSVENALKDEY

A0A5D3DCZ8 Gag-pol polyprotein4.3e-13681.55Show/hide
Query:  LCVNLQREKGQKIIRVRSDHGKEFDNEDLNNFCQTKGIHHEFAAPITSQQNGVVERKNRTLQEMARVMIHANNLPLNFLAEAVNTVCHIHNRVTTRSGTT
        LC+NLQREKGQKIIR+RSDHGKEFDNEDLNNFCQT GIHH+F  PIT QQNGVVE +N TLQEMARVMIHA NLPLNF AEAVNT CHIHNRVTTRSG T
Subjt:  LCVNLQREKGQKIIRVRSDHGKEFDNEDLNNFCQTKGIHHEFAAPITSQQNGVVERKNRTLQEMARVMIHANNLPLNFLAEAVNTVCHIHNRVTTRSGTT

Query:  VTLYELWKGRKPNVKYFHIFGSTCYILADREYHRKWDVKSDQGIFLGYSHNNRAYRVFNIKSGTVMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTP
        VTLYEL KGRKPNVKYFHIFGSTCYILADREYHRKWD KS QGIFLGYS N+RAYRVFNIKSGTVMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTP
Subjt:  VTLYELWKGRKPNVKYFHIFGSTCYILADREYHRKWDVKSDQGIFLGYSHNNRAYRVFNIKSGTVMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTP

Query:  LDEMPKGDSQPDSAKTNSNITDEVINNETVLVPSAHVKKNHLSSSIIGDPSAGITTKWKEKVDYTKMIA-------------------DLCYVSAIEPTS
        LDEMPKG+SQ  SAKT+S+ITDEVINNET+LVPSAHVKKNH SSSII DPSAGITT+ KE ++    I+                   DLCYVSAIEPTS
Subjt:  LDEMPKGDSQPDSAKTNSNITDEVINNETVLVPSAHVKKNHLSSSIIGDPSAGITTKWKEKVDYTKMIA-------------------DLCYVSAIEPTS

Query:  VENALKDEY
        VEN+LKDEY
Subjt:  VENALKDEY

A0A5D3DSN1 Gag-pol polyprotein3.5e-14682.28Show/hide
Query:  MGPMQTESLRGKKYVLVVVDDYFIFTWVRFLKEKSDTIKLCISLCVNLQREKGQKIIRVRSDHGKEFDNEDLNNFCQTKGIHHEFAAPITSQQNGVVERK
        MGPMQTESLRGKKYVLVVVDDYFIFTWVRFLKEKSDTIKLCISLCVNLQREKGQKII+VRSDHGKEFDNEDLNNFCQTKGIHHEF APITSQQNGVVERK
Subjt:  MGPMQTESLRGKKYVLVVVDDYFIFTWVRFLKEKSDTIKLCISLCVNLQREKGQKIIRVRSDHGKEFDNEDLNNFCQTKGIHHEFAAPITSQQNGVVERK

Query:  NRTLQEMARVMIHANNLPLNFLAEAVNTVCHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILADREYHRKWDVKSDQGIFLGYSHNNRAYRV
        NRTLQEMARVMIHANNLPLNFLAEAVNTVCHI  +                         H    TCYILADREYHRKWDVKSDQGIFLGYSHN+RAYRV
Subjt:  NRTLQEMARVMIHANNLPLNFLAEAVNTVCHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILADREYHRKWDVKSDQGIFLGYSHNNRAYRV

Query:  FNIKSGTVMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNETVLVPSAHVKKNHLSSSIIGDPSAGITTK
        FNIKSGTVME INVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNETVLVPSAHVKKNHLSSSI+GDPSAGITTK
Subjt:  FNIKSGTVMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNETVLVPSAHVKKNHLSSSIIGDPSAGITTK

Query:  WKEKVDYTKMIADLCYVSAIEPTSVENALKDEY
        WKEK                      NALKDEY
Subjt:  WKEKVDYTKMIADLCYVSAIEPTSVENALKDEY

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.6e-2530.14Show/hide
Query:  GPMQTESLRGKKYVLVVVDDYFIFTWVRFLKEKSDTIKLCISLCVNLQREKGQKIIRVRSDHGKEFDNEDLNNFCQTKGIHHEFAAPITSQQNGVVERKN
        GP+   +L  K Y ++ VD +  +     +K KSD   +        +     K++ +  D+G+E+ + ++  FC  KGI +    P T Q NGV ER  
Subjt:  GPMQTESLRGKKYVLVVVDDYFIFTWVRFLKEKSDTIKLCISLCVNLQREKGQKIIRVRSDHGKEFDNEDLNNFCQTKGIHHEFAAPITSQQNGVVERKN

Query:  RTLQEMARVMIHANNLPLNFLAEAVNTVCHIHNRVTTRS--GTTVTLYELWKGRKPNVKYFHIFGSTCYILADREYHRKWDVKSDQGIFLGYSHNNRAYR
        RT+ E AR M+    L  +F  EAV T  ++ NR+ +R+   ++ T YE+W  +KP +K+  +FG+T Y+   +    K+D KS + IF+GY  N   ++
Subjt:  RTLQEMARVMIHANNLPLNFLAEAVNTVCHIHNRVTTRS--GTTVTLYELWKGRKPNVKYFHIFGSTCYILADREYHRKWDVKSDQGIFLGYSHNNRAYR

Query:  VFNIKSGTVMETINVVVND
        +++  +   +   +VVV++
Subjt:  VFNIKSGTVMETINVVVND

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.8e-3030.83Show/hide
Query:  GPMQTESLRGKKYVLVVVDDYFIFTWVRFLKEKSDTIKLCISLCVNLQREKGQKIIRVRSDHGKEFDNEDLNNFCQTKGIHHEFAAPITSQQNGVVERKN
        GPM+ ES+ G KY +  +DD     WV  LK K    ++       ++RE G+K+ R+RSD+G E+ + +   +C + GI HE   P T Q NGV ER N
Subjt:  GPMQTESLRGKKYVLVVVDDYFIFTWVRFLKEKSDTIKLCISLCVNLQREKGQKIIRVRSDHGKEFDNEDLNNFCQTKGIHHEFAAPITSQQNGVVERKN

Query:  RTLQEMARVMIHANNLPLNFLAEAVNTVCHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILADREYHRKWDVKSDQGIFLGYSHNNRAYRVF
        RT+ E  R M+    LP +F  EAV T C++ NR  +          +W  ++ +  +  +FG   +    +E   K D KS   IF+GY      YR++
Subjt:  RTLQEMARVMIHANNLPLNFLAEAVNTVCHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILADREYHRKWDVKSDQGIFLGYSHNNRAYRVF

Query:  NIKSGTVMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEV
        +     V+ + +VV  + E        E  +  + P   +     +P   + P SA++    TDEV
Subjt:  NIKSGTVMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEV

P25384 Transposon Ty2-C Gag-Pol polyprotein6.2e-0721.28Show/hide
Query:  GPMQTESLRGKKYVLVVVDDYFIFTWVRFLKEKSD--TIKLCISLCVNLQREKGQKIIRVRSDHGKEFDNEDLNNFCQTKGIHHEFAAPITSQQNGVVER
        GP+         Y +   D+   F WV  L ++ +   + +  S+   ++ +   +++ ++ D G E+ N+ L+ F   +GI   +     S+ +GV ER
Subjt:  GPMQTESLRGKKYVLVVVDDYFIFTWVRFLKEKSD--TIKLCISLCVNLQREKGQKIIRVRSDHGKEFDNEDLNNFCQTKGIHHEFAAPITSQQNGVVER

Query:  KNRTLQEMARVMIHANNLPLNFLAEAVNTVCHIHNR-VTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILADREYHRKWDVKSDQGIFLGYSHNNRAY
         NRTL    R ++H + LP +    AV     I N  V+ ++  +   +    G   ++     FG    I+ +     K   +   G  L  S N+  Y
Subjt:  KNRTLQEMARVMIHANNLPLNFLAEAVNTVCHIHNR-VTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILADREYHRKWDVKSDQGIFLGYSHNNRAY

Query:  RVFNIKSGTVMETIN-VVVNDFESNVNQFNIE----DDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNETVLV
         ++       ++T N V++ D +S ++QFN +    DD+ +       + +++     S   + +++ +   E+  N   LV
Subjt:  RVFNIKSGTVMETIN-VVVNDFESNVNQFNIE----DDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNETVLV

Q07791 Transposon Ty2-DR3 Gag-Pol polyprotein6.2e-0721.28Show/hide
Query:  GPMQTESLRGKKYVLVVVDDYFIFTWVRFLKEKSD--TIKLCISLCVNLQREKGQKIIRVRSDHGKEFDNEDLNNFCQTKGIHHEFAAPITSQQNGVVER
        GP+         Y +   D+   F WV  L ++ +   + +  S+   ++ +   +++ ++ D G E+ N+ L+ F   +GI   +     S+ +GV ER
Subjt:  GPMQTESLRGKKYVLVVVDDYFIFTWVRFLKEKSD--TIKLCISLCVNLQREKGQKIIRVRSDHGKEFDNEDLNNFCQTKGIHHEFAAPITSQQNGVVER

Query:  KNRTLQEMARVMIHANNLPLNFLAEAVNTVCHIHNR-VTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILADREYHRKWDVKSDQGIFLGYSHNNRAY
         NRTL    R ++H + LP +    AV     I N  V+ ++  +   +    G   ++     FG    I+ +     K   +   G  L  S N+  Y
Subjt:  KNRTLQEMARVMIHANNLPLNFLAEAVNTVCHIHNR-VTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILADREYHRKWDVKSDQGIFLGYSHNNRAY

Query:  RVFNIKSGTVMETIN-VVVNDFESNVNQFNIE----DDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNETVLV
         ++       ++T N V++ D +S ++QFN +    DD+ +       + +++     S   + +++ +   E+  N   LV
Subjt:  RVFNIKSGTVMETIN-VVVNDFESNVNQFNIE----DDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNETVLV

Q12491 Transposon Ty2-B Gag-Pol polyprotein6.2e-0721.28Show/hide
Query:  GPMQTESLRGKKYVLVVVDDYFIFTWVRFLKEKSD--TIKLCISLCVNLQREKGQKIIRVRSDHGKEFDNEDLNNFCQTKGIHHEFAAPITSQQNGVVER
        GP+         Y +   D+   F WV  L ++ +   + +  S+   ++ +   +++ ++ D G E+ N+ L+ F   +GI   +     S+ +GV ER
Subjt:  GPMQTESLRGKKYVLVVVDDYFIFTWVRFLKEKSD--TIKLCISLCVNLQREKGQKIIRVRSDHGKEFDNEDLNNFCQTKGIHHEFAAPITSQQNGVVER

Query:  KNRTLQEMARVMIHANNLPLNFLAEAVNTVCHIHNR-VTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILADREYHRKWDVKSDQGIFLGYSHNNRAY
         NRTL    R ++H + LP +    AV     I N  V+ ++  +   +    G   ++     FG    I+ +     K   +   G  L  S N+  Y
Subjt:  KNRTLQEMARVMIHANNLPLNFLAEAVNTVCHIHNR-VTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILADREYHRKWDVKSDQGIFLGYSHNNRAY

Query:  RVFNIKSGTVMETIN-VVVNDFESNVNQFNIE----DDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNETVLV
         ++       ++T N V++ D +S ++QFN +    DD+ +       + +++     S   + +++ +   E+  N   LV
Subjt:  RVFNIKSGTVMETIN-VVVNDFESNVNQFNIE----DDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNETVLV

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCCCATGCAAACTGAAAGTTTGAGGGGAAAGAAGTATGTGTTAGTTGTTGTGGATGACTACTTCATATTCACCTGGGTTCGGTTCTTAAAAGAAAAATCAGATAC
TATTAAACTATGTATCAGTCTATGTGTGAACTTGCAACGTGAGAAGGGGCAAAAGATAATCAGGGTCCGTAGTGATCATGGGAAGGAATTTGATAATGAAGATCTGAATA
ACTTCTGTCAGACTAAAGGAATTCATCATGAATTTGCAGCTCCCATAACTTCTCAACAAAATGGAGTAGTTGAACGGAAGAACAGAACGTTACAAGAAATGGCTCGAGTT
ATGATACATGCCAACAATTTGCCTTTGAATTTTTTGGCAGAAGCTGTAAACACAGTATGTCATATTCACAATAGGGTCACTACACGATCTGGTACGACAGTTACATTGTA
TGAATTATGGAAGGGACGGAAACCAAATGTTAAGTATTTTCATATTTTTGGAAGTACTTGTTACATTTTGGCCGATAGAGAGTATCATCGTAAGTGGGATGTGAAATCTG
ATCAAGGGATCTTTCTTGGTTATTCTCATAATAATCGAGCGTACAGAGTCTTTAATATTAAATCCGGAACAGTCATGGAAACAATCAATGTTGTGGTTAATGATTTTGAG
TCTAATGTCAATCAGTTTAATATTGAGGATGATGAGACCCATGTTACACCTGAAGTTACTTCTACTCCTCTTGATGAAATGCCTAAAGGTGATTCGCAGCCAGACAGTGC
TAAGACCAATTCAAACATAACTGATGAGGTCATAAACAATGAAACTGTGCTTGTCCCCTCTGCACATGTGAAAAAGAATCATCTATCAAGTTCCATAATAGGCGATCCGT
CAGCTGGAATTACTACCAAATGGAAAGAAAAGGTAGATTACACGAAAATGATTGCTGATTTATGTTATGTATCAGCAATAGAACCCACATCTGTTGAGAATGCTCTCAAG
GATGAATACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTCCCATGCAAACTGAAAGTTTGAGGGGAAAGAAGTATGTGTTAGTTGTTGTGGATGACTACTTCATATTCACCTGGGTTCGGTTCTTAAAAGAAAAATCAGATAC
TATTAAACTATGTATCAGTCTATGTGTGAACTTGCAACGTGAGAAGGGGCAAAAGATAATCAGGGTCCGTAGTGATCATGGGAAGGAATTTGATAATGAAGATCTGAATA
ACTTCTGTCAGACTAAAGGAATTCATCATGAATTTGCAGCTCCCATAACTTCTCAACAAAATGGAGTAGTTGAACGGAAGAACAGAACGTTACAAGAAATGGCTCGAGTT
ATGATACATGCCAACAATTTGCCTTTGAATTTTTTGGCAGAAGCTGTAAACACAGTATGTCATATTCACAATAGGGTCACTACACGATCTGGTACGACAGTTACATTGTA
TGAATTATGGAAGGGACGGAAACCAAATGTTAAGTATTTTCATATTTTTGGAAGTACTTGTTACATTTTGGCCGATAGAGAGTATCATCGTAAGTGGGATGTGAAATCTG
ATCAAGGGATCTTTCTTGGTTATTCTCATAATAATCGAGCGTACAGAGTCTTTAATATTAAATCCGGAACAGTCATGGAAACAATCAATGTTGTGGTTAATGATTTTGAG
TCTAATGTCAATCAGTTTAATATTGAGGATGATGAGACCCATGTTACACCTGAAGTTACTTCTACTCCTCTTGATGAAATGCCTAAAGGTGATTCGCAGCCAGACAGTGC
TAAGACCAATTCAAACATAACTGATGAGGTCATAAACAATGAAACTGTGCTTGTCCCCTCTGCACATGTGAAAAAGAATCATCTATCAAGTTCCATAATAGGCGATCCGT
CAGCTGGAATTACTACCAAATGGAAAGAAAAGGTAGATTACACGAAAATGATTGCTGATTTATGTTATGTATCAGCAATAGAACCCACATCTGTTGAGAATGCTCTCAAG
GATGAATACTAG
Protein sequenceShow/hide protein sequence
MGPMQTESLRGKKYVLVVVDDYFIFTWVRFLKEKSDTIKLCISLCVNLQREKGQKIIRVRSDHGKEFDNEDLNNFCQTKGIHHEFAAPITSQQNGVVERKNRTLQEMARV
MIHANNLPLNFLAEAVNTVCHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILADREYHRKWDVKSDQGIFLGYSHNNRAYRVFNIKSGTVMETINVVVNDFE
SNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNETVLVPSAHVKKNHLSSSIIGDPSAGITTKWKEKVDYTKMIADLCYVSAIEPTSVENALK
DEY