; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0230381 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0230381
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr08:25914041..25915012
RNA-Seq ExpressionCmc08g0230381
SyntenyCmc08g0230381
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025735.1 gag-pol polyprotein [Cucumis melo var. makuwa]3.2e-183100Show/hide
Query:  MIWRVKTSEKCKVAFTTIQTPVDAWYFDSGCSRHMTGNRSFFTELEECTSVHVTFEDGAKGRIIAKGNINKSNLPCLNEVRYMDGLKANLISISQICDQG
        MIWRVKTSEKCKVAFTTIQTPVDAWYFDSGCSRHMTGNRSFFTELEECTSVHVTFEDGAKGRIIAKGNINKSNLPCLNEVRYMDGLKANLISISQICDQG
Subjt:  MIWRVKTSEKCKVAFTTIQTPVDAWYFDSGCSRHMTGNRSFFTELEECTSVHVTFEDGAKGRIIAKGNINKSNLPCLNEVRYMDGLKANLISISQICDQG

Query:  YSVNFNNTGCVVTDKNNQVFMSGRRQTDNCYHWSSNSSNICHLTKTDQTWLWHRKLGHISMRSLDKVIRNEAVVDIPSLDINGKFFCGDCQVGKKTKISH
        YSVNFNNTGCVVTDKNNQVFMSGRRQTDNCYHWSSNSSNICHLTKTDQTWLWHRKLGHISMRSLDKVIRNEAVVDIPSLDINGKFFCGDCQVGKKTKISH
Subjt:  YSVNFNNTGCVVTDKNNQVFMSGRRQTDNCYHWSSNSSNICHLTKTDQTWLWHRKLGHISMRSLDKVIRNEAVVDIPSLDINGKFFCGDCQVGKKTKISH

Query:  KSLKECYTIRVFELLHLDLMGLMQTESLGGKKYVLVVVDDYSRFTWVLFLKGKSDTVKLCISLCLNLQREKGKKIIRISSDHGKEFDNEDLNNFCQSKGI
        KSLKECYTIRVFELLHLDLMGLMQTESLGGKKYVLVVVDDYSRFTWVLFLKGKSDTVKLCISLCLNLQREKGKKIIRISSDHGKEFDNEDLNNFCQSKGI
Subjt:  KSLKECYTIRVFELLHLDLMGLMQTESLGGKKYVLVVVDDYSRFTWVLFLKGKSDTVKLCISLCLNLQREKGKKIIRISSDHGKEFDNEDLNNFCQSKGI

Query:  HHEFAAPITP
        HHEFAAPITP
Subjt:  HHEFAAPITP

KAA0036855.1 gag-pol polyprotein [Cucumis melo var. makuwa]5.2e-14979.39Show/hide
Query:  KWNKSIRGTHMIWRVK---------TSEKCKVAFT-TIQTPVDAWYFDSGCSRHMTGNRSFFTELEECTSVHVTFEDGAKGRIIAKGNINKSNLPCLNEV
        +++ S RG  +   VK         T   CK   T T+Q  VDAWYFDSGCSRHMTGNRSFFTELEEC   H TF DGAKG+IIAKGNI+KSNLP LNEV
Subjt:  KWNKSIRGTHMIWRVK---------TSEKCKVAFT-TIQTPVDAWYFDSGCSRHMTGNRSFFTELEECTSVHVTFEDGAKGRIIAKGNINKSNLPCLNEV

Query:  RYMDGLKANLISISQICDQGYSVNFNNTGCVVTDKNNQVFMSGRRQTDNCYHWSSNSSNICHLTKTDQTWLWHRKLGHISMRSLDKVIRNEAVVDIPSLD
        RY+DGLKANLISISQ+CDQGYSVNFNNTG VVT+KNNQVFMSGRR+ +NCY+WSSN SNICHLTK DQTWLWHRKLGHIS+RSLDKVIRNEAVV IPSLD
Subjt:  RYMDGLKANLISISQICDQGYSVNFNNTGCVVTDKNNQVFMSGRRQTDNCYHWSSNSSNICHLTKTDQTWLWHRKLGHISMRSLDKVIRNEAVVDIPSLD

Query:  INGKFFCGDCQVGKKTKISHKSLKECYTIRVFELLHLDLMGLMQTESLGGKKYVLVVVDDYSRFTWVLFLKGKSDTVKLCISLCLNLQREKGKKIIRISS
        INGKFFCG+CQVGK+TK SH+ LKECYTI V ELLHL+LMG MQTESLGGKKYVLVVVDDYS+FTWV FLK K DTVKLCISLCLNLQREKG+KIIRI S
Subjt:  INGKFFCGDCQVGKKTKISHKSLKECYTIRVFELLHLDLMGLMQTESLGGKKYVLVVVDDYSRFTWVLFLKGKSDTVKLCISLCLNLQREKGKKIIRISS

Query:  DHGKEFDNEDLNNFCQSKGIHHEFAAPITP
        DHGKEFDNEDLNN CQ++GIHHEFAAPITP
Subjt:  DHGKEFDNEDLNNFCQSKGIHHEFAAPITP

KAA0059174.1 F5J5.1 [Cucumis melo var. makuwa]3.2e-15983.44Show/hide
Query:  KWNKSIRGTHMIWRVKTSEKCKVAFTTIQTPVDAWYFDSGCSRHMTGNRSFFTELEECTSVHVTFEDGAKGRIIAKGNINKSNLPCLNEVRYMDGLKANL
        K N ++RGTHMIWRVKTSEKC VAFT +QT VDAWYFDSGCSRHMT NRSFFTELEEC S HV F+DGAKG+IIAKGNI+KSNLPCLN+VRY+DGLK NL
Subjt:  KWNKSIRGTHMIWRVKTSEKCKVAFTTIQTPVDAWYFDSGCSRHMTGNRSFFTELEECTSVHVTFEDGAKGRIIAKGNINKSNLPCLNEVRYMDGLKANL

Query:  ISISQICDQGYSVNFNNTGCVVTDKNNQVFMSGRRQTDNCYHWSSNSSNICHLTKTDQTWLWHRKLGHISMRSLDKVIRNEAVVDIPSLDINGKFFCGDC
        IS SQ+CDQGYSVNFNNTGCVVT+KNNQVF+SG R+ DNCYHWSSN SNICHLTK  QTWLWHRKLGHIS+RSLDKVIRNEA++ IPSLDINGKFFCGDC
Subjt:  ISISQICDQGYSVNFNNTGCVVTDKNNQVFMSGRRQTDNCYHWSSNSSNICHLTKTDQTWLWHRKLGHISMRSLDKVIRNEAVVDIPSLDINGKFFCGDC

Query:  QVGKKTKISHKSLKECYTIRVFELLHLDLMGLMQTESLGGKKYVLVVVDDYSRFTWVLFLKGKSDTVKLCISLCLNLQREKGKKIIRISSDHGKEFDNED
        QVGK+TK SH+ L ECYTI   ELLHLDL+ LMQ ESLGGKKYV VVVDDYSRFTWV FLK KSD VKLCISLCLNLQREKG+KIIRI SDHGK+FDNE+
Subjt:  QVGKKTKISHKSLKECYTIRVFELLHLDLMGLMQTESLGGKKYVLVVVDDYSRFTWVLFLKGKSDTVKLCISLCLNLQREKGKKIIRISSDHGKEFDNED

Query:  LNNFCQSKGIHHEFAAPITP
        LNNFCQ++GIHHEFAAPITP
Subjt:  LNNFCQSKGIHHEFAAPITP

TYK19345.1 F5J5.1 [Cucumis melo var. makuwa]4.7e-15883.12Show/hide
Query:  KWNKSIRGTHMIWRVKTSEKCKVAFTTIQTPVDAWYFDSGCSRHMTGNRSFFTELEECTSVHVTFEDGAKGRIIAKGNINKSNLPCLNEVRYMDGLKANL
        K N ++RGTHMIWRVKTSEKC VAFT +QT VDAWYFDSGCSRHMT NRSFFTELEEC S HV F+DGAKG+IIAKGNI+KSNLP LN+VRY+DGLK NL
Subjt:  KWNKSIRGTHMIWRVKTSEKCKVAFTTIQTPVDAWYFDSGCSRHMTGNRSFFTELEECTSVHVTFEDGAKGRIIAKGNINKSNLPCLNEVRYMDGLKANL

Query:  ISISQICDQGYSVNFNNTGCVVTDKNNQVFMSGRRQTDNCYHWSSNSSNICHLTKTDQTWLWHRKLGHISMRSLDKVIRNEAVVDIPSLDINGKFFCGDC
        IS SQ+CDQGYSVNFNNTGCVVT+KNNQVF+SG R+ DNCYHWSSN SNICHLTK  QTWLWHRKLGHIS+RSLDKVIRNEA++ IPSLDINGKFFCGDC
Subjt:  ISISQICDQGYSVNFNNTGCVVTDKNNQVFMSGRRQTDNCYHWSSNSSNICHLTKTDQTWLWHRKLGHISMRSLDKVIRNEAVVDIPSLDINGKFFCGDC

Query:  QVGKKTKISHKSLKECYTIRVFELLHLDLMGLMQTESLGGKKYVLVVVDDYSRFTWVLFLKGKSDTVKLCISLCLNLQREKGKKIIRISSDHGKEFDNED
        QVGK+TK SH+ L ECYTI   ELLHLDL+ LMQ ESLGGKKYV VVVDDYSRFTWV FLK KSD VKLCISLCLNLQREKG+KIIRI SDHGK+FDNE+
Subjt:  QVGKKTKISHKSLKECYTIRVFELLHLDLMGLMQTESLGGKKYVLVVVDDYSRFTWVLFLKGKSDTVKLCISLCLNLQREKGKKIIRISSDHGKEFDNED

Query:  LNNFCQSKGIHHEFAAPITP
        LNNFCQ++GIHHEFAAPITP
Subjt:  LNNFCQSKGIHHEFAAPITP

TYK26041.1 gag/pol polyprotein [Cucumis melo var. makuwa]6.1e-15884.98Show/hide
Query:  GTHMIWRVKTSEKCKVAFTTIQTPVDAWYFDSGCSRHMTGNRSFFTELEECTSVHVTFEDGAKGRIIAKGNINKSNLPCLNEVRYMDGLKANLISISQIC
        GTH+IWRVKTS+KC VAF T+QT VDAWYFDSGCSRHMTGNRSFFTELEEC S HVTF DGAKG+IIAKGN++KSNLP +NEVRY+DGLK NLIS+SQ+C
Subjt:  GTHMIWRVKTSEKCKVAFTTIQTPVDAWYFDSGCSRHMTGNRSFFTELEECTSVHVTFEDGAKGRIIAKGNINKSNLPCLNEVRYMDGLKANLISISQIC

Query:  DQGYSVNFNNTGCVVTDKNNQVFMSGRRQTDNCYHWSSNSSNICHLTKTDQTWLWHRKLGHISMRSLDKVIRNEAVVDIPSLDINGKFFCGDCQVGKKTK
        DQGYSVNFNNT CV TDKNNQVF+SGRR+ +NC HWSSN SNICHLTK DQTWLWHRKLGHIS+RSLDKVIRN+AVV IPSLDINGKFFCGDC+VGK+TK
Subjt:  DQGYSVNFNNTGCVVTDKNNQVFMSGRRQTDNCYHWSSNSSNICHLTKTDQTWLWHRKLGHISMRSLDKVIRNEAVVDIPSLDINGKFFCGDCQVGKKTK

Query:  ISHKSLKECYTIRVFELLHLDLMGLMQTESLGGKKYVLVVVDDYSRFTWVLFLKGKSDTVKLCISLCLNLQREKGKKIIRISSDHGKEFDNEDLNNFCQS
        ISH+ LKECYTIRV ELLHLDL+G M+TESLG KKYVLVVVDDYSRFT V FLKGKSDTVKLCISL LNLQREKG+KIIRI SDHGKEFDNEDLNNFCQ+
Subjt:  ISHKSLKECYTIRVFELLHLDLMGLMQTESLGGKKYVLVVVDDYSRFTWVLFLKGKSDTVKLCISLCLNLQREKGKKIIRISSDHGKEFDNEDLNNFCQS

Query:  KGIHHEFAAPITP
        +GIHHEF APITP
Subjt:  KGIHHEFAAPITP

TrEMBL top hitse value%identityAlignment
A0A5A7SMR2 Gag-pol polyprotein1.6e-183100Show/hide
Query:  MIWRVKTSEKCKVAFTTIQTPVDAWYFDSGCSRHMTGNRSFFTELEECTSVHVTFEDGAKGRIIAKGNINKSNLPCLNEVRYMDGLKANLISISQICDQG
        MIWRVKTSEKCKVAFTTIQTPVDAWYFDSGCSRHMTGNRSFFTELEECTSVHVTFEDGAKGRIIAKGNINKSNLPCLNEVRYMDGLKANLISISQICDQG
Subjt:  MIWRVKTSEKCKVAFTTIQTPVDAWYFDSGCSRHMTGNRSFFTELEECTSVHVTFEDGAKGRIIAKGNINKSNLPCLNEVRYMDGLKANLISISQICDQG

Query:  YSVNFNNTGCVVTDKNNQVFMSGRRQTDNCYHWSSNSSNICHLTKTDQTWLWHRKLGHISMRSLDKVIRNEAVVDIPSLDINGKFFCGDCQVGKKTKISH
        YSVNFNNTGCVVTDKNNQVFMSGRRQTDNCYHWSSNSSNICHLTKTDQTWLWHRKLGHISMRSLDKVIRNEAVVDIPSLDINGKFFCGDCQVGKKTKISH
Subjt:  YSVNFNNTGCVVTDKNNQVFMSGRRQTDNCYHWSSNSSNICHLTKTDQTWLWHRKLGHISMRSLDKVIRNEAVVDIPSLDINGKFFCGDCQVGKKTKISH

Query:  KSLKECYTIRVFELLHLDLMGLMQTESLGGKKYVLVVVDDYSRFTWVLFLKGKSDTVKLCISLCLNLQREKGKKIIRISSDHGKEFDNEDLNNFCQSKGI
        KSLKECYTIRVFELLHLDLMGLMQTESLGGKKYVLVVVDDYSRFTWVLFLKGKSDTVKLCISLCLNLQREKGKKIIRISSDHGKEFDNEDLNNFCQSKGI
Subjt:  KSLKECYTIRVFELLHLDLMGLMQTESLGGKKYVLVVVDDYSRFTWVLFLKGKSDTVKLCISLCLNLQREKGKKIIRISSDHGKEFDNEDLNNFCQSKGI

Query:  HHEFAAPITP
        HHEFAAPITP
Subjt:  HHEFAAPITP

A0A5A7UVR7 F5J5.11.6e-15983.44Show/hide
Query:  KWNKSIRGTHMIWRVKTSEKCKVAFTTIQTPVDAWYFDSGCSRHMTGNRSFFTELEECTSVHVTFEDGAKGRIIAKGNINKSNLPCLNEVRYMDGLKANL
        K N ++RGTHMIWRVKTSEKC VAFT +QT VDAWYFDSGCSRHMT NRSFFTELEEC S HV F+DGAKG+IIAKGNI+KSNLPCLN+VRY+DGLK NL
Subjt:  KWNKSIRGTHMIWRVKTSEKCKVAFTTIQTPVDAWYFDSGCSRHMTGNRSFFTELEECTSVHVTFEDGAKGRIIAKGNINKSNLPCLNEVRYMDGLKANL

Query:  ISISQICDQGYSVNFNNTGCVVTDKNNQVFMSGRRQTDNCYHWSSNSSNICHLTKTDQTWLWHRKLGHISMRSLDKVIRNEAVVDIPSLDINGKFFCGDC
        IS SQ+CDQGYSVNFNNTGCVVT+KNNQVF+SG R+ DNCYHWSSN SNICHLTK  QTWLWHRKLGHIS+RSLDKVIRNEA++ IPSLDINGKFFCGDC
Subjt:  ISISQICDQGYSVNFNNTGCVVTDKNNQVFMSGRRQTDNCYHWSSNSSNICHLTKTDQTWLWHRKLGHISMRSLDKVIRNEAVVDIPSLDINGKFFCGDC

Query:  QVGKKTKISHKSLKECYTIRVFELLHLDLMGLMQTESLGGKKYVLVVVDDYSRFTWVLFLKGKSDTVKLCISLCLNLQREKGKKIIRISSDHGKEFDNED
        QVGK+TK SH+ L ECYTI   ELLHLDL+ LMQ ESLGGKKYV VVVDDYSRFTWV FLK KSD VKLCISLCLNLQREKG+KIIRI SDHGK+FDNE+
Subjt:  QVGKKTKISHKSLKECYTIRVFELLHLDLMGLMQTESLGGKKYVLVVVDDYSRFTWVLFLKGKSDTVKLCISLCLNLQREKGKKIIRISSDHGKEFDNED

Query:  LNNFCQSKGIHHEFAAPITP
        LNNFCQ++GIHHEFAAPITP
Subjt:  LNNFCQSKGIHHEFAAPITP

A0A5D3BA69 Gag-pol polyprotein2.5e-14979.39Show/hide
Query:  KWNKSIRGTHMIWRVK---------TSEKCKVAFT-TIQTPVDAWYFDSGCSRHMTGNRSFFTELEECTSVHVTFEDGAKGRIIAKGNINKSNLPCLNEV
        +++ S RG  +   VK         T   CK   T T+Q  VDAWYFDSGCSRHMTGNRSFFTELEEC   H TF DGAKG+IIAKGNI+KSNLP LNEV
Subjt:  KWNKSIRGTHMIWRVK---------TSEKCKVAFT-TIQTPVDAWYFDSGCSRHMTGNRSFFTELEECTSVHVTFEDGAKGRIIAKGNINKSNLPCLNEV

Query:  RYMDGLKANLISISQICDQGYSVNFNNTGCVVTDKNNQVFMSGRRQTDNCYHWSSNSSNICHLTKTDQTWLWHRKLGHISMRSLDKVIRNEAVVDIPSLD
        RY+DGLKANLISISQ+CDQGYSVNFNNTG VVT+KNNQVFMSGRR+ +NCY+WSSN SNICHLTK DQTWLWHRKLGHIS+RSLDKVIRNEAVV IPSLD
Subjt:  RYMDGLKANLISISQICDQGYSVNFNNTGCVVTDKNNQVFMSGRRQTDNCYHWSSNSSNICHLTKTDQTWLWHRKLGHISMRSLDKVIRNEAVVDIPSLD

Query:  INGKFFCGDCQVGKKTKISHKSLKECYTIRVFELLHLDLMGLMQTESLGGKKYVLVVVDDYSRFTWVLFLKGKSDTVKLCISLCLNLQREKGKKIIRISS
        INGKFFCG+CQVGK+TK SH+ LKECYTI V ELLHL+LMG MQTESLGGKKYVLVVVDDYS+FTWV FLK K DTVKLCISLCLNLQREKG+KIIRI S
Subjt:  INGKFFCGDCQVGKKTKISHKSLKECYTIRVFELLHLDLMGLMQTESLGGKKYVLVVVDDYSRFTWVLFLKGKSDTVKLCISLCLNLQREKGKKIIRISS

Query:  DHGKEFDNEDLNNFCQSKGIHHEFAAPITP
        DHGKEFDNEDLNN CQ++GIHHEFAAPITP
Subjt:  DHGKEFDNEDLNNFCQSKGIHHEFAAPITP

A0A5D3D755 F5J5.12.3e-15883.12Show/hide
Query:  KWNKSIRGTHMIWRVKTSEKCKVAFTTIQTPVDAWYFDSGCSRHMTGNRSFFTELEECTSVHVTFEDGAKGRIIAKGNINKSNLPCLNEVRYMDGLKANL
        K N ++RGTHMIWRVKTSEKC VAFT +QT VDAWYFDSGCSRHMT NRSFFTELEEC S HV F+DGAKG+IIAKGNI+KSNLP LN+VRY+DGLK NL
Subjt:  KWNKSIRGTHMIWRVKTSEKCKVAFTTIQTPVDAWYFDSGCSRHMTGNRSFFTELEECTSVHVTFEDGAKGRIIAKGNINKSNLPCLNEVRYMDGLKANL

Query:  ISISQICDQGYSVNFNNTGCVVTDKNNQVFMSGRRQTDNCYHWSSNSSNICHLTKTDQTWLWHRKLGHISMRSLDKVIRNEAVVDIPSLDINGKFFCGDC
        IS SQ+CDQGYSVNFNNTGCVVT+KNNQVF+SG R+ DNCYHWSSN SNICHLTK  QTWLWHRKLGHIS+RSLDKVIRNEA++ IPSLDINGKFFCGDC
Subjt:  ISISQICDQGYSVNFNNTGCVVTDKNNQVFMSGRRQTDNCYHWSSNSSNICHLTKTDQTWLWHRKLGHISMRSLDKVIRNEAVVDIPSLDINGKFFCGDC

Query:  QVGKKTKISHKSLKECYTIRVFELLHLDLMGLMQTESLGGKKYVLVVVDDYSRFTWVLFLKGKSDTVKLCISLCLNLQREKGKKIIRISSDHGKEFDNED
        QVGK+TK SH+ L ECYTI   ELLHLDL+ LMQ ESLGGKKYV VVVDDYSRFTWV FLK KSD VKLCISLCLNLQREKG+KIIRI SDHGK+FDNE+
Subjt:  QVGKKTKISHKSLKECYTIRVFELLHLDLMGLMQTESLGGKKYVLVVVDDYSRFTWVLFLKGKSDTVKLCISLCLNLQREKGKKIIRISSDHGKEFDNED

Query:  LNNFCQSKGIHHEFAAPITP
        LNNFCQ++GIHHEFAAPITP
Subjt:  LNNFCQSKGIHHEFAAPITP

A0A5D3DQT9 Gag/pol polyprotein3.0e-15884.98Show/hide
Query:  GTHMIWRVKTSEKCKVAFTTIQTPVDAWYFDSGCSRHMTGNRSFFTELEECTSVHVTFEDGAKGRIIAKGNINKSNLPCLNEVRYMDGLKANLISISQIC
        GTH+IWRVKTS+KC VAF T+QT VDAWYFDSGCSRHMTGNRSFFTELEEC S HVTF DGAKG+IIAKGN++KSNLP +NEVRY+DGLK NLIS+SQ+C
Subjt:  GTHMIWRVKTSEKCKVAFTTIQTPVDAWYFDSGCSRHMTGNRSFFTELEECTSVHVTFEDGAKGRIIAKGNINKSNLPCLNEVRYMDGLKANLISISQIC

Query:  DQGYSVNFNNTGCVVTDKNNQVFMSGRRQTDNCYHWSSNSSNICHLTKTDQTWLWHRKLGHISMRSLDKVIRNEAVVDIPSLDINGKFFCGDCQVGKKTK
        DQGYSVNFNNT CV TDKNNQVF+SGRR+ +NC HWSSN SNICHLTK DQTWLWHRKLGHIS+RSLDKVIRN+AVV IPSLDINGKFFCGDC+VGK+TK
Subjt:  DQGYSVNFNNTGCVVTDKNNQVFMSGRRQTDNCYHWSSNSSNICHLTKTDQTWLWHRKLGHISMRSLDKVIRNEAVVDIPSLDINGKFFCGDCQVGKKTK

Query:  ISHKSLKECYTIRVFELLHLDLMGLMQTESLGGKKYVLVVVDDYSRFTWVLFLKGKSDTVKLCISLCLNLQREKGKKIIRISSDHGKEFDNEDLNNFCQS
        ISH+ LKECYTIRV ELLHLDL+G M+TESLG KKYVLVVVDDYSRFT V FLKGKSDTVKLCISL LNLQREKG+KIIRI SDHGKEFDNEDLNNFCQ+
Subjt:  ISHKSLKECYTIRVFELLHLDLMGLMQTESLGGKKYVLVVVDDYSRFTWVLFLKGKSDTVKLCISLCLNLQREKGKKIIRISSDHGKEFDNEDLNNFCQS

Query:  KGIHHEFAAPITP
        +GIHHEF APITP
Subjt:  KGIHHEFAAPITP

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.0e-1823.73Show/hide
Query:  DSGCSRHMTGNRSFFTE-------LEECTSVHVTFEDGAKGRIIAKGNINKSNLPCLNEVRYMDGLKANLISISQICDQGYSVNFNNTGCVVTDKNNQVF
        DSG S H+  + S +T+       L+   +    F    K  I+   N ++     L +V +      NL+S+ ++ + G S+ F+ +G  ++     V 
Subjt:  DSGCSRHMTGNRSFFTE-------LEECTSVHVTFEDGAKGRIIAKGNINKSNLPCLNEVRYMDGLKANLISISQICDQGYSVNFNNTGCVVTDKNNQVF

Query:  M-SGRRQTDNCYHWSSNSSNICHLTKTDQTWLWHRKLGHISMRSLDKVIRNEAVVD---IPSLDINGKFFCGDCQVGKKTKISHKSLKE-CYTIRVFELL
          SG        ++ + S N  H    +   LWH + GHIS   L ++ R     D   + +L+++ +  C  C  GK+ ++  K LK+  +  R   ++
Subjt:  M-SGRRQTDNCYHWSSNSSNICHLTKTDQTWLWHRKLGHISMRSLDKVIRNEAVVD---IPSLDINGKFFCGDCQVGKKTKISHKSLKE-CYTIRVFELL

Query:  HLDLMGLMQTESLGGKKYVLVVVDDYSRFTWVLFLKGKSDTVKLCISLCLNLQREKGKKIIRISSDHGKEFDNEDLNNFCQSKGIHHEFAAPITP
        H D+ G +   +L  K Y ++ VD ++ +     +K KSD   +        +     K++ +  D+G+E+ + ++  FC  KGI +    P TP
Subjt:  HLDLMGLMQTESLGGKKYVLVVVDDYSRFTWVLFLKGKSDTVKLCISLCLNLQREKGKKIIRISSDHGKEFDNEDLNNFCQSKGIHHEFAAPITP

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.5e-2627.57Show/hide
Query:  PVDAWYFDSGCSRHMTGNRSFFTELEECTSVHVTFEDGAKGRIIAKGNI-NKSNLPC---LNEVRYMDGLKANLISISQICDQGYSVNFNNTGCVVTDKN
        P   W  D+  S H T  R  F          V   + +  +I   G+I  K+N+ C   L +VR++  L+ NLIS   +   GY   F N    +T K 
Subjt:  PVDAWYFDSGCSRHMTGNRSFFTELEECTSVHVTFEDGAKGRIIAKGNI-NKSNLPC---LNEVRYMDGLKANLISISQICDQGYSVNFNNTGCVVTDKN

Query:  NQVFMSG-------RRQTDNCYHWSSNSSNICHLTKTDQTWLWHRKLGHISMRSLDKVIRNEAVVDIPSLDINGKFFCGDCQVGKKTKISHKSLKECYTI
        + V   G       R   + C    + + +   +       LWH+++GH+S + L  + +   +       +     C  C  GK+ ++S ++  E   +
Subjt:  NQVFMSG-------RRQTDNCYHWSSNSSNICHLTKTDQTWLWHRKLGHISMRSLDKVIRNEAVVDIPSLDINGKFFCGDCQVGKKTKISHKSLKECYTI

Query:  RVFELLHLDLMGLMQTESLGGKKYVLVVVDDYSRFTWVLFLKGKSDTVKLCISLCLNLQREKGKKIIRISSDHGKEFDNEDLNNFCQSKGIHHEFAAPIT
         + +L++ D+ G M+ ES+GG KY +  +DD SR  WV  LK K    ++       ++RE G+K+ R+ SD+G E+ + +   +C S GI HE   P T
Subjt:  RVFELLHLDLMGLMQTESLGGKKYVLVVVDDYSRFTWVLFLKGKSDTVKLCISLCLNLQREKGKKIIRISSDHGKEFDNEDLNNFCQSKGIHHEFAAPIT

Query:  P
        P
Subjt:  P

P25384 Transposon Ty2-C Gag-Pol polyprotein1.2e-1228.3Show/hide
Query:  LWHRKLGHISMRSLDKVIRNEAVVDIPSLDIN----GKFFCGDCQVGKKTKISH---KSLKECYTIRVFELLHLDLMGLMQTESLGGKKYVLVVVDDYSR
        L HR LGH + RS+ K ++  AV  +   DI       + C DC +GK TK  H     LK   +   F+ LH D+ G +         Y +   D+ +R
Subjt:  LWHRKLGHISMRSLDKVIRNEAVVDIPSLDIN----GKFFCGDCQVGKKTKISH---KSLKECYTIRVFELLHLDLMGLMQTESLGGKKYVLVVVDDYSR

Query:  FTWV--LFLKGKSDTVKLCISLCLNLQREKGKKIIRISSDHGKEFDNEDLNNFCQSKGI
        F WV  L  + +   + +  S+   ++ +   +++ I  D G E+ N+ L+ F  ++GI
Subjt:  FTWV--LFLKGKSDTVKLCISLCLNLQREKGKKIIRISSDHGKEFDNEDLNNFCQSKGI

Q12472 Transposon Ty2-DR1 Gag-Pol polyprotein1.2e-1228.3Show/hide
Query:  LWHRKLGHISMRSLDKVIRNEAVVDIPSLDIN----GKFFCGDCQVGKKTKISH---KSLKECYTIRVFELLHLDLMGLMQTESLGGKKYVLVVVDDYSR
        L HR LGH + RS+ K ++  AV  +   DI       + C DC +GK TK  H     LK   +   F+ LH D+ G +         Y +   D+ +R
Subjt:  LWHRKLGHISMRSLDKVIRNEAVVDIPSLDIN----GKFFCGDCQVGKKTKISH---KSLKECYTIRVFELLHLDLMGLMQTESLGGKKYVLVVVDDYSR

Query:  FTWV--LFLKGKSDTVKLCISLCLNLQREKGKKIIRISSDHGKEFDNEDLNNFCQSKGI
        F WV  L  + +   + +  S+   ++ +   +++ I  D G E+ N+ L+ F  ++GI
Subjt:  FTWV--LFLKGKSDTVKLCISLCLNLQREKGKKIIRISSDHGKEFDNEDLNNFCQSKGI

Q12491 Transposon Ty2-B Gag-Pol polyprotein1.2e-1228.3Show/hide
Query:  LWHRKLGHISMRSLDKVIRNEAVVDIPSLDIN----GKFFCGDCQVGKKTKISH---KSLKECYTIRVFELLHLDLMGLMQTESLGGKKYVLVVVDDYSR
        L HR LGH + RS+ K ++  AV  +   DI       + C DC +GK TK  H     LK   +   F+ LH D+ G +         Y +   D+ +R
Subjt:  LWHRKLGHISMRSLDKVIRNEAVVDIPSLDIN----GKFFCGDCQVGKKTKISH---KSLKECYTIRVFELLHLDLMGLMQTESLGGKKYVLVVVDDYSR

Query:  FTWV--LFLKGKSDTVKLCISLCLNLQREKGKKIIRISSDHGKEFDNEDLNNFCQSKGI
        F WV  L  + +   + +  S+   ++ +   +++ I  D G E+ N+ L+ F  ++GI
Subjt:  FTWV--LFLKGKSDTVKLCISLCLNLQREKGKKIIRISSDHGKEFDNEDLNNFCQSKGI

Arabidopsis top hitse value%identityAlignment
AT3G20980.1 Gag-Pol-related retrotransposon family protein3.4e-0530.69Show/hide
Query:  WYFDSGCSRHMTGNRSFFTELEECTSVHVTFEDGAKGR-----IIAKGNI----NKSNLPCLNEVRYMDGLKANLISISQICDQGYSVNF-NNTGCVVTD
        W   S  S HMT +  FFT L+      V F  G K       +   G++    N+ N   +  V Y+ G++ N +S+SQ+   G+ V+    TGC V D
Subjt:  WYFDSGCSRHMTGNRSFFTELEECTSVHVTFEDGAKGR-----IIAKGNI----NKSNLPCLNEVRYMDGLKANLISISQICDQGYSVNF-NNTGCVVTD

Query:  K
        +
Subjt:  K


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGCCAAATGGAACAAAAGTATAAGGGGAACTCACATGATTTGGAGAGTGAAGACTTCTGAGAAGTGCAAGGTTGCATTTACAACAATTCAAACCCCTGTTGATGC
TTGGTACTTTGACAGTGGATGCTCAAGACATATGACTGGCAATCGATCATTCTTTACTGAATTAGAGGAATGCACCTCAGTTCATGTTACTTTTGAAGATGGAGCCAAAG
GAAGAATTATTGCAAAAGGAAACATTAATAAAAGTAATCTACCCTGTCTTAATGAAGTTAGATACATGGATGGACTGAAGGCAAACTTGATTAGTATAAGTCAAATATGT
GACCAAGGATACAGTGTAAACTTTAACAACACTGGTTGTGTAGTTACAGACAAAAATAATCAAGTGTTTATGAGTGGCAGACGACAAACAGATAACTGTTATCATTGGAG
CTCAAATAGCTCAAACATATGTCACTTAACTAAAACTGATCAAACCTGGTTGTGGCATAGGAAATTGGGACACATTAGCATGAGAAGCTTAGATAAAGTTATCAGAAACG
AGGCTGTTGTAGACATTCCTTCTTTAGATATCAATGGAAAATTCTTTTGTGGTGATTGTCAAGTTGGAAAGAAAACTAAAATTTCTCACAAAAGTTTAAAGGAATGTTAT
ACAATTAGAGTCTTTGAACTTCTACATCTTGATCTTATGGGTCTCATGCAAACTGAAAGTTTGGGTGGAAAGAAGTATGTGTTGGTTGTTGTGGATGACTACTCCAGATT
TACTTGGGTTCTGTTCTTAAAAGGAAAATCAGATACTGTTAAATTATGTATCAGTCTATGTTTGAATTTGCAACGTGAAAAAGGGAAAAAGATAATCAGGATCAGTAGTG
ATCATGGGAAGGAGTTTGATAATGAAGATCTTAATAATTTCTGTCAATCGAAAGGAATCCATCATGAATTTGCTGCTCCCATAACACCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTGCCAAATGGAACAAAAGTATAAGGGGAACTCACATGATTTGGAGAGTGAAGACTTCTGAGAAGTGCAAGGTTGCATTTACAACAATTCAAACCCCTGTTGATGC
TTGGTACTTTGACAGTGGATGCTCAAGACATATGACTGGCAATCGATCATTCTTTACTGAATTAGAGGAATGCACCTCAGTTCATGTTACTTTTGAAGATGGAGCCAAAG
GAAGAATTATTGCAAAAGGAAACATTAATAAAAGTAATCTACCCTGTCTTAATGAAGTTAGATACATGGATGGACTGAAGGCAAACTTGATTAGTATAAGTCAAATATGT
GACCAAGGATACAGTGTAAACTTTAACAACACTGGTTGTGTAGTTACAGACAAAAATAATCAAGTGTTTATGAGTGGCAGACGACAAACAGATAACTGTTATCATTGGAG
CTCAAATAGCTCAAACATATGTCACTTAACTAAAACTGATCAAACCTGGTTGTGGCATAGGAAATTGGGACACATTAGCATGAGAAGCTTAGATAAAGTTATCAGAAACG
AGGCTGTTGTAGACATTCCTTCTTTAGATATCAATGGAAAATTCTTTTGTGGTGATTGTCAAGTTGGAAAGAAAACTAAAATTTCTCACAAAAGTTTAAAGGAATGTTAT
ACAATTAGAGTCTTTGAACTTCTACATCTTGATCTTATGGGTCTCATGCAAACTGAAAGTTTGGGTGGAAAGAAGTATGTGTTGGTTGTTGTGGATGACTACTCCAGATT
TACTTGGGTTCTGTTCTTAAAAGGAAAATCAGATACTGTTAAATTATGTATCAGTCTATGTTTGAATTTGCAACGTGAAAAAGGGAAAAAGATAATCAGGATCAGTAGTG
ATCATGGGAAGGAGTTTGATAATGAAGATCTTAATAATTTCTGTCAATCGAAAGGAATCCATCATGAATTTGCTGCTCCCATAACACCTTAG
Protein sequenceShow/hide protein sequence
MFAKWNKSIRGTHMIWRVKTSEKCKVAFTTIQTPVDAWYFDSGCSRHMTGNRSFFTELEECTSVHVTFEDGAKGRIIAKGNINKSNLPCLNEVRYMDGLKANLISISQIC
DQGYSVNFNNTGCVVTDKNNQVFMSGRRQTDNCYHWSSNSSNICHLTKTDQTWLWHRKLGHISMRSLDKVIRNEAVVDIPSLDINGKFFCGDCQVGKKTKISHKSLKECY
TIRVFELLHLDLMGLMQTESLGGKKYVLVVVDDYSRFTWVLFLKGKSDTVKLCISLCLNLQREKGKKIIRISSDHGKEFDNEDLNNFCQSKGIHHEFAAPITP