; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc11g0302511 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc11g0302511
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol polyprotein
Genome locationCMiso1.1chr11:23467216..23468181
RNA-Seq ExpressionCmc11g0302511
SyntenyCmc11g0302511
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025735.1 gag-pol polyprotein [Cucumis melo var. makuwa]4.0e-13370.59Show/hide
Query:  VKTSENCKIAFTTVQTTNDVWYFDSGCSRHMTGKRSFLTELNECVVGHVTFGDEAKGRIVVKGNINNNNLPCLNDVRYVDGLKANLISVSQLCDQGYSVN
        VKTSE CK+AFTT+QT  D WYFDSGCSRHMTG RSF TEL EC   HVTF D AKGRI+ KGNIN +NLPCLN+VRY+DGLKANLIS+SQ+CDQGYSVN
Subjt:  VKTSENCKIAFTTVQTTNDVWYFDSGCSRHMTGKRSFLTELNECVVGHVTFGDEAKGRIVVKGNINNNNLPCLNDVRYVDGLKANLISVSQLCDQGYSVN

Query:  FSKNSCVVLNEDNQILMNGYRQANNCYHWISNNSDVCHSTKEDQTWLWHRKLGHINLKSIDRAVKNEAVIGVPKIDVNSKFVCGDCQIGKQTKASHKSLK
        F+   CVV +++NQ+ M+G RQ +NCYHW SN+S++CH TK DQTWLWHRKLGHI+++S+D+ ++NEAV+ +P +D+N KF CGDCQ+GK+TK SHKSLK
Subjt:  FSKNSCVVLNEDNQILMNGYRQANNCYHWISNNSDVCHSTKEDQTWLWHRKLGHINLKSIDRAVKNEAVIGVPKIDVNSKFVCGDCQIGKQTKASHKSLK

Query:  ECSTNRVLELLHMDLMGPMQSESLGGKKYVLVAVNDFSRFTWVRFLRGKSDIPKVCMSLCLKLQREKGVNIIRIKSDHGKELKNVDLNSFCDAEEIHHEY
        EC T RV ELLH+DLMG MQ+ESLGGKKYVLV V+D+SRFTWV FL+GKSD  K+C+SLCL LQREKG  IIRI SDHGKE  N DLN+FC ++ IHHE+
Subjt:  ECSTNRVLELLHMDLMGPMQSESLGGKKYVLVAVNDFSRFTWVRFLRGKSDIPKVCMSLCLKLQREKGVNIIRIKSDHGKELKNVDLNSFCDAEEIHHEY

Query:  SAPITP
        +APITP
Subjt:  SAPITP

KAA0036855.1 gag-pol polyprotein [Cucumis melo var. makuwa]2.2e-13168.55Show/hide
Query:  TSENC-KIAFTTVQTTNDVWYFDSGCSRHMTGKRSFLTELNECVVGHVTFGDEAKGRIVVKGNINNNNLPCLNDVRYVDGLKANLISVSQLCDQGYSVNF
        T  +C K++  TVQ   D WYFDSGCSRHMTG RSF TEL EC +GH TFGD AKG+I+ KGNI+ +NLP LN+VRYVDGLKANLIS+SQLCDQGYSVNF
Subjt:  TSENC-KIAFTTVQTTNDVWYFDSGCSRHMTGKRSFLTELNECVVGHVTFGDEAKGRIVVKGNINNNNLPCLNDVRYVDGLKANLISVSQLCDQGYSVNF

Query:  SKNSCVVLNEDNQILMNGYRQANNCYHWISNNSDVCHSTKEDQTWLWHRKLGHINLKSIDRAVKNEAVIGVPKIDVNSKFVCGDCQIGKQTKASHKSLKE
        +    VV N++NQ+ M+G R+ANNCY+W SN S++CH TK DQTWLWHRKLGHI+L+S+D+ ++NEAV+G+P +D+N KF CG+CQ+GKQTK SH+ LKE
Subjt:  SKNSCVVLNEDNQILMNGYRQANNCYHWISNNSDVCHSTKEDQTWLWHRKLGHINLKSIDRAVKNEAVIGVPKIDVNSKFVCGDCQIGKQTKASHKSLKE

Query:  CSTNRVLELLHMDLMGPMQSESLGGKKYVLVAVNDFSRFTWVRFLRGKSDIPKVCMSLCLKLQREKGVNIIRIKSDHGKELKNVDLNSFCDAEEIHHEYS
        C T  VLELLH++LMGPMQ+ESLGGKKYVLV V+D+S+FTWVRFL+ K D  K+C+SLCL LQREKG  IIRI+SDHGKE  N DLN+ C  E IHHE++
Subjt:  CSTNRVLELLHMDLMGPMQSESLGGKKYVLVAVNDFSRFTWVRFLRGKSDIPKVCMSLCLKLQREKGVNIIRIKSDHGKELKNVDLNSFCDAEEIHHEYS

Query:  APITPQHNGVVERKNRTL
        APITPQ NGVVERKNR L
Subjt:  APITPQHNGVVERKNRTL

KAA0038999.1 envelope-like protein [Cucumis melo var. makuwa]2.2e-13681.31Show/hide
Query:  MTGKRSFLTELNECVVGHVTFGDEAKGRIVVKGNINNNNLPCLNDVRYVDGLKANLISVSQLCDQGYSVNFSKNSCVVLNEDNQILMNGYRQANNCYHWI
        MTG + F +EL EC  GHVTFGD A+GRI+ KGNI  + LPCLNDV+YVD LKANLISVSQLCDQGYS+NFSK+SCVV+N+DNQIL+ G RQA+NCYHWI
Subjt:  MTGKRSFLTELNECVVGHVTFGDEAKGRIVVKGNINNNNLPCLNDVRYVDGLKANLISVSQLCDQGYSVNFSKNSCVVLNEDNQILMNGYRQANNCYHWI

Query:  SNNSDVCHSTKEDQTWLWHRKLGHINLKSIDRAVKNEAVIGVPKIDVNSKFVCGDCQIGKQTKASHKSLKECSTNRVLELLHMDLMGPMQSESLGGKKYV
        SNNS+VCHSTKEDQTWLWHRKLGHINLKSIDR V+N+AVI VP IDVNSKFVCGDC I KQTKASHKSLK CSTNRVLELLHMDLMGPMQ++SLGGKKYV
Subjt:  SNNSDVCHSTKEDQTWLWHRKLGHINLKSIDRAVKNEAVIGVPKIDVNSKFVCGDCQIGKQTKASHKSLKECSTNRVLELLHMDLMGPMQSESLGGKKYV

Query:  LVAVNDFSRFTWVRFLRGKSDIPKVCMSLCLKLQREKGVNIIRIKSDHGKELKNVDLNSFCDAEEIHHEYSAPITPQHNGVVERKNRTL
        LVAV+DFSRFTWVRFL+GKSD PKVC+SLCL LQREK V I+RI+SDHGKE KN DLN+F D E IHHEYS PITPQ NGVVERKNRTL
Subjt:  LVAVNDFSRFTWVRFLRGKSDIPKVCMSLCLKLQREKGVNIIRIKSDHGKELKNVDLNSFCDAEEIHHEYSAPITPQHNGVVERKNRTL

KAA0059174.1 F5J5.1 [Cucumis melo var. makuwa]5.7e-13268.65Show/hide
Query:  VKTSENCKIAFTTVQTTNDVWYFDSGCSRHMTGKRSFLTELNECVVGHVTFGDEAKGRIVVKGNINNNNLPCLNDVRYVDGLKANLISVSQLCDQGYSVN
        VKTSE C +AFT VQT  D WYFDSGCSRHMT  RSF TEL EC  GHV F D AKG+I+ KGNI+ +NLPCLN VRYVDGLK NLIS SQLCDQGYSVN
Subjt:  VKTSENCKIAFTTVQTTNDVWYFDSGCSRHMTGKRSFLTELNECVVGHVTFGDEAKGRIVVKGNINNNNLPCLNDVRYVDGLKANLISVSQLCDQGYSVN

Query:  FSKNSCVVLNEDNQILMNGYRQANNCYHWISNNSDVCHSTKEDQTWLWHRKLGHINLKSIDRAVKNEAVIGVPKIDVNSKFVCGDCQIGKQTKASHKSLK
        F+   CVV N++NQ+ ++G R+A+NCYHW SN S++CH TK  QTWLWHRKLGHI+L+S+D+ ++NEA+IG+P +D+N KF CGDCQ+GKQTK SH+ L 
Subjt:  FSKNSCVVLNEDNQILMNGYRQANNCYHWISNNSDVCHSTKEDQTWLWHRKLGHINLKSIDRAVKNEAVIGVPKIDVNSKFVCGDCQIGKQTKASHKSLK

Query:  ECSTNRVLELLHMDLMGPMQSESLGGKKYVLVAVNDFSRFTWVRFLRGKSDIPKVCMSLCLKLQREKGVNIIRIKSDHGKELKNVDLNSFCDAEEIHHEY
        EC T  +LELLH+DL+  MQ+ESLGGKKYV V V+D+SRFTWVRFL+ KSDI K+C+SLCL LQREKG  IIRI+SDHGK+  N +LN+FC  E IHHE+
Subjt:  ECSTNRVLELLHMDLMGPMQSESLGGKKYVLVAVNDFSRFTWVRFLRGKSDIPKVCMSLCLKLQREKGVNIIRIKSDHGKELKNVDLNSFCDAEEIHHEY

Query:  SAPITPQHNGVVERKNRTL
        +APITPQ N VVERKNRTL
Subjt:  SAPITPQHNGVVERKNRTL

TYK26041.1 gag/pol polyprotein [Cucumis melo var. makuwa]8.0e-13469.28Show/hide
Query:  VKTSENCKIAFTTVQTTNDVWYFDSGCSRHMTGKRSFLTELNECVVGHVTFGDEAKGRIVVKGNINNNNLPCLNDVRYVDGLKANLISVSQLCDQGYSVN
        VKTS+ C +AF TVQT  D WYFDSGCSRHMTG RSF TEL EC  GHVTFGD AKG+I+ KGN++ +NLP +N+VRYVDGLK NLISVSQLCDQGYSVN
Subjt:  VKTSENCKIAFTTVQTTNDVWYFDSGCSRHMTGKRSFLTELNECVVGHVTFGDEAKGRIVVKGNINNNNLPCLNDVRYVDGLKANLISVSQLCDQGYSVN

Query:  FSKNSCVVLNEDNQILMNGYRQANNCYHWISNNSDVCHSTKEDQTWLWHRKLGHINLKSIDRAVKNEAVIGVPKIDVNSKFVCGDCQIGKQTKASHKSLK
        F+  SCV  +++NQ+ ++G R+ANNC HW SN S++CH TK DQTWLWHRKLGHI+L+S+D+ ++N+AV+G+P +D+N KF CGDC++GKQTK SH+ LK
Subjt:  FSKNSCVVLNEDNQILMNGYRQANNCYHWISNNSDVCHSTKEDQTWLWHRKLGHINLKSIDRAVKNEAVIGVPKIDVNSKFVCGDCQIGKQTKASHKSLK

Query:  ECSTNRVLELLHMDLMGPMQSESLGGKKYVLVAVNDFSRFTWVRFLRGKSDIPKVCMSLCLKLQREKGVNIIRIKSDHGKELKNVDLNSFCDAEEIHHEY
        EC T RVLELLH+DL+GPM++ESLG KKYVLV V+D+SRFT VRFL+GKSD  K+C+SL L LQREKG  IIRI+SDHGKE  N DLN+FC  E IHHE+
Subjt:  ECSTNRVLELLHMDLMGPMQSESLGGKKYVLVAVNDFSRFTWVRFLRGKSDIPKVCMSLCLKLQREKGVNIIRIKSDHGKELKNVDLNSFCDAEEIHHEY

Query:  SAPITPQHNGVVERKNRTL
         APITPQ NGVVERKNRT+
Subjt:  SAPITPQHNGVVERKNRTL

TrEMBL top hitse value%identityAlignment
A0A5A7SMR2 Gag-pol polyprotein1.9e-13370.59Show/hide
Query:  VKTSENCKIAFTTVQTTNDVWYFDSGCSRHMTGKRSFLTELNECVVGHVTFGDEAKGRIVVKGNINNNNLPCLNDVRYVDGLKANLISVSQLCDQGYSVN
        VKTSE CK+AFTT+QT  D WYFDSGCSRHMTG RSF TEL EC   HVTF D AKGRI+ KGNIN +NLPCLN+VRY+DGLKANLIS+SQ+CDQGYSVN
Subjt:  VKTSENCKIAFTTVQTTNDVWYFDSGCSRHMTGKRSFLTELNECVVGHVTFGDEAKGRIVVKGNINNNNLPCLNDVRYVDGLKANLISVSQLCDQGYSVN

Query:  FSKNSCVVLNEDNQILMNGYRQANNCYHWISNNSDVCHSTKEDQTWLWHRKLGHINLKSIDRAVKNEAVIGVPKIDVNSKFVCGDCQIGKQTKASHKSLK
        F+   CVV +++NQ+ M+G RQ +NCYHW SN+S++CH TK DQTWLWHRKLGHI+++S+D+ ++NEAV+ +P +D+N KF CGDCQ+GK+TK SHKSLK
Subjt:  FSKNSCVVLNEDNQILMNGYRQANNCYHWISNNSDVCHSTKEDQTWLWHRKLGHINLKSIDRAVKNEAVIGVPKIDVNSKFVCGDCQIGKQTKASHKSLK

Query:  ECSTNRVLELLHMDLMGPMQSESLGGKKYVLVAVNDFSRFTWVRFLRGKSDIPKVCMSLCLKLQREKGVNIIRIKSDHGKELKNVDLNSFCDAEEIHHEY
        EC T RV ELLH+DLMG MQ+ESLGGKKYVLV V+D+SRFTWV FL+GKSD  K+C+SLCL LQREKG  IIRI SDHGKE  N DLN+FC ++ IHHE+
Subjt:  ECSTNRVLELLHMDLMGPMQSESLGGKKYVLVAVNDFSRFTWVRFLRGKSDIPKVCMSLCLKLQREKGVNIIRIKSDHGKELKNVDLNSFCDAEEIHHEY

Query:  SAPITP
        +APITP
Subjt:  SAPITP

A0A5A7TCJ0 Envelope-like protein1.1e-13681.31Show/hide
Query:  MTGKRSFLTELNECVVGHVTFGDEAKGRIVVKGNINNNNLPCLNDVRYVDGLKANLISVSQLCDQGYSVNFSKNSCVVLNEDNQILMNGYRQANNCYHWI
        MTG + F +EL EC  GHVTFGD A+GRI+ KGNI  + LPCLNDV+YVD LKANLISVSQLCDQGYS+NFSK+SCVV+N+DNQIL+ G RQA+NCYHWI
Subjt:  MTGKRSFLTELNECVVGHVTFGDEAKGRIVVKGNINNNNLPCLNDVRYVDGLKANLISVSQLCDQGYSVNFSKNSCVVLNEDNQILMNGYRQANNCYHWI

Query:  SNNSDVCHSTKEDQTWLWHRKLGHINLKSIDRAVKNEAVIGVPKIDVNSKFVCGDCQIGKQTKASHKSLKECSTNRVLELLHMDLMGPMQSESLGGKKYV
        SNNS+VCHSTKEDQTWLWHRKLGHINLKSIDR V+N+AVI VP IDVNSKFVCGDC I KQTKASHKSLK CSTNRVLELLHMDLMGPMQ++SLGGKKYV
Subjt:  SNNSDVCHSTKEDQTWLWHRKLGHINLKSIDRAVKNEAVIGVPKIDVNSKFVCGDCQIGKQTKASHKSLKECSTNRVLELLHMDLMGPMQSESLGGKKYV

Query:  LVAVNDFSRFTWVRFLRGKSDIPKVCMSLCLKLQREKGVNIIRIKSDHGKELKNVDLNSFCDAEEIHHEYSAPITPQHNGVVERKNRTL
        LVAV+DFSRFTWVRFL+GKSD PKVC+SLCL LQREK V I+RI+SDHGKE KN DLN+F D E IHHEYS PITPQ NGVVERKNRTL
Subjt:  LVAVNDFSRFTWVRFLRGKSDIPKVCMSLCLKLQREKGVNIIRIKSDHGKELKNVDLNSFCDAEEIHHEYSAPITPQHNGVVERKNRTL

A0A5A7UVR7 F5J5.12.8e-13268.65Show/hide
Query:  VKTSENCKIAFTTVQTTNDVWYFDSGCSRHMTGKRSFLTELNECVVGHVTFGDEAKGRIVVKGNINNNNLPCLNDVRYVDGLKANLISVSQLCDQGYSVN
        VKTSE C +AFT VQT  D WYFDSGCSRHMT  RSF TEL EC  GHV F D AKG+I+ KGNI+ +NLPCLN VRYVDGLK NLIS SQLCDQGYSVN
Subjt:  VKTSENCKIAFTTVQTTNDVWYFDSGCSRHMTGKRSFLTELNECVVGHVTFGDEAKGRIVVKGNINNNNLPCLNDVRYVDGLKANLISVSQLCDQGYSVN

Query:  FSKNSCVVLNEDNQILMNGYRQANNCYHWISNNSDVCHSTKEDQTWLWHRKLGHINLKSIDRAVKNEAVIGVPKIDVNSKFVCGDCQIGKQTKASHKSLK
        F+   CVV N++NQ+ ++G R+A+NCYHW SN S++CH TK  QTWLWHRKLGHI+L+S+D+ ++NEA+IG+P +D+N KF CGDCQ+GKQTK SH+ L 
Subjt:  FSKNSCVVLNEDNQILMNGYRQANNCYHWISNNSDVCHSTKEDQTWLWHRKLGHINLKSIDRAVKNEAVIGVPKIDVNSKFVCGDCQIGKQTKASHKSLK

Query:  ECSTNRVLELLHMDLMGPMQSESLGGKKYVLVAVNDFSRFTWVRFLRGKSDIPKVCMSLCLKLQREKGVNIIRIKSDHGKELKNVDLNSFCDAEEIHHEY
        EC T  +LELLH+DL+  MQ+ESLGGKKYV V V+D+SRFTWVRFL+ KSDI K+C+SLCL LQREKG  IIRI+SDHGK+  N +LN+FC  E IHHE+
Subjt:  ECSTNRVLELLHMDLMGPMQSESLGGKKYVLVAVNDFSRFTWVRFLRGKSDIPKVCMSLCLKLQREKGVNIIRIKSDHGKELKNVDLNSFCDAEEIHHEY

Query:  SAPITPQHNGVVERKNRTL
        +APITPQ N VVERKNRTL
Subjt:  SAPITPQHNGVVERKNRTL

A0A5D3BA69 Gag-pol polyprotein1.1e-13168.55Show/hide
Query:  TSENC-KIAFTTVQTTNDVWYFDSGCSRHMTGKRSFLTELNECVVGHVTFGDEAKGRIVVKGNINNNNLPCLNDVRYVDGLKANLISVSQLCDQGYSVNF
        T  +C K++  TVQ   D WYFDSGCSRHMTG RSF TEL EC +GH TFGD AKG+I+ KGNI+ +NLP LN+VRYVDGLKANLIS+SQLCDQGYSVNF
Subjt:  TSENC-KIAFTTVQTTNDVWYFDSGCSRHMTGKRSFLTELNECVVGHVTFGDEAKGRIVVKGNINNNNLPCLNDVRYVDGLKANLISVSQLCDQGYSVNF

Query:  SKNSCVVLNEDNQILMNGYRQANNCYHWISNNSDVCHSTKEDQTWLWHRKLGHINLKSIDRAVKNEAVIGVPKIDVNSKFVCGDCQIGKQTKASHKSLKE
        +    VV N++NQ+ M+G R+ANNCY+W SN S++CH TK DQTWLWHRKLGHI+L+S+D+ ++NEAV+G+P +D+N KF CG+CQ+GKQTK SH+ LKE
Subjt:  SKNSCVVLNEDNQILMNGYRQANNCYHWISNNSDVCHSTKEDQTWLWHRKLGHINLKSIDRAVKNEAVIGVPKIDVNSKFVCGDCQIGKQTKASHKSLKE

Query:  CSTNRVLELLHMDLMGPMQSESLGGKKYVLVAVNDFSRFTWVRFLRGKSDIPKVCMSLCLKLQREKGVNIIRIKSDHGKELKNVDLNSFCDAEEIHHEYS
        C T  VLELLH++LMGPMQ+ESLGGKKYVLV V+D+S+FTWVRFL+ K D  K+C+SLCL LQREKG  IIRI+SDHGKE  N DLN+ C  E IHHE++
Subjt:  CSTNRVLELLHMDLMGPMQSESLGGKKYVLVAVNDFSRFTWVRFLRGKSDIPKVCMSLCLKLQREKGVNIIRIKSDHGKELKNVDLNSFCDAEEIHHEYS

Query:  APITPQHNGVVERKNRTL
        APITPQ NGVVERKNR L
Subjt:  APITPQHNGVVERKNRTL

A0A5D3DQT9 Gag/pol polyprotein3.9e-13469.28Show/hide
Query:  VKTSENCKIAFTTVQTTNDVWYFDSGCSRHMTGKRSFLTELNECVVGHVTFGDEAKGRIVVKGNINNNNLPCLNDVRYVDGLKANLISVSQLCDQGYSVN
        VKTS+ C +AF TVQT  D WYFDSGCSRHMTG RSF TEL EC  GHVTFGD AKG+I+ KGN++ +NLP +N+VRYVDGLK NLISVSQLCDQGYSVN
Subjt:  VKTSENCKIAFTTVQTTNDVWYFDSGCSRHMTGKRSFLTELNECVVGHVTFGDEAKGRIVVKGNINNNNLPCLNDVRYVDGLKANLISVSQLCDQGYSVN

Query:  FSKNSCVVLNEDNQILMNGYRQANNCYHWISNNSDVCHSTKEDQTWLWHRKLGHINLKSIDRAVKNEAVIGVPKIDVNSKFVCGDCQIGKQTKASHKSLK
        F+  SCV  +++NQ+ ++G R+ANNC HW SN S++CH TK DQTWLWHRKLGHI+L+S+D+ ++N+AV+G+P +D+N KF CGDC++GKQTK SH+ LK
Subjt:  FSKNSCVVLNEDNQILMNGYRQANNCYHWISNNSDVCHSTKEDQTWLWHRKLGHINLKSIDRAVKNEAVIGVPKIDVNSKFVCGDCQIGKQTKASHKSLK

Query:  ECSTNRVLELLHMDLMGPMQSESLGGKKYVLVAVNDFSRFTWVRFLRGKSDIPKVCMSLCLKLQREKGVNIIRIKSDHGKELKNVDLNSFCDAEEIHHEY
        EC T RVLELLH+DL+GPM++ESLG KKYVLV V+D+SRFT VRFL+GKSD  K+C+SL L LQREKG  IIRI+SDHGKE  N DLN+FC  E IHHE+
Subjt:  ECSTNRVLELLHMDLMGPMQSESLGGKKYVLVAVNDFSRFTWVRFLRGKSDIPKVCMSLCLKLQREKGVNIIRIKSDHGKELKNVDLNSFCDAEEIHHEY

Query:  SAPITPQHNGVVERKNRTL
         APITPQ NGVVERKNRT+
Subjt:  SAPITPQHNGVVERKNRTL

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.9e-2125.45Show/hide
Query:  VKTSENCKIAFTTVQTTNDV------WYFDSGCSRHMTGKRSFLTELNECVVGHVTFGDEAKGRIVV---KG--NINNNNLPCLNDVRYVDGLKANLISV
        V+T+ +  IAF   +  N        +  DSG S H+    S  T+  E VV  +      +G  +    +G   + N++   L DV +      NL+SV
Subjt:  VKTSENCKIAFTTVQTTNDV------WYFDSGCSRHMTGKRSFLTELNECVVGHVTFGDEAKGRIVV---KG--NINNNNLPCLNDVRYVDGLKANLISV

Query:  SQLCDQGYSVNFSKNSCVVLNEDNQILMNGYRQANNCYHWISNNSDVCHSTKEDQTWLWHRKLGHIN---LKSIDRAVKNEAVIGVPKIDVNSKFVCGDC
         +L + G S+ F K S V ++++  +++      NN    I+  +   ++  ++   LWH + GHI+   L  I R         +  ++++ + +C  C
Subjt:  SQLCDQGYSVNFSKNSCVVLNEDNQILMNGYRQANNCYHWISNNSDVCHSTKEDQTWLWHRKLGHIN---LKSIDRAVKNEAVIGVPKIDVNSKFVCGDC

Query:  QIGKQTKASHKSLKE-CSTNRVLELLHMDLMGPMQSESLGGKKYVLVAVNDFSRFTWVRFLRGKSDIPKVCMSLCLKLQREKGVNIIRIKSDHGKELKNV
          GKQ +   K LK+     R L ++H D+ GP+   +L  K Y ++ V+ F+ +     ++ KSD+  +      K +    + ++ +  D+G+E  + 
Subjt:  QIGKQTKASHKSLKE-CSTNRVLELLHMDLMGPMQSESLGGKKYVLVAVNDFSRFTWVRFLRGKSDIPKVCMSLCLKLQREKGVNIIRIKSDHGKELKNV

Query:  DLNSFCDAEEIHHEYSAPITPQHNGVVERKNRTL
        ++  FC  + I +  + P TPQ NGV ER  RT+
Subjt:  DLNSFCDAEEIHHEYSAPITPQHNGVVERKNRTL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.3e-3430.84Show/hide
Query:  WYFDSGCSRHMTGKRSFLTELNECVVGHVTFGDEAKGRIVVKGNI-NNNNLPC---LNDVRYVDGLKANLISVSQLCDQGYSVNFSKNSCVVLNEDNQIL
        W  D+  S H T  R           G V  G+ +  +I   G+I    N+ C   L DVR+V  L+ NLIS   L   GY   F+ N    L + + ++
Subjt:  WYFDSGCSRHMTGKRSFLTELNECVVGHVTFGDEAKGRIVVKGNI-NNNNLPC---LNDVRYVDGLKANLISVSQLCDQGYSVNFSKNSCVVLNEDNQIL

Query:  MNGYRQANNCYHWISNNSDVCH-----STKEDQTWLWHRKLGHINLKSIDRAVKNEAVIGVPKIDVNSKFVCGDCQIGKQTKASHKSLKECSTNRVLELL
          G  +          N+++C      +  E    LWH+++GH++ K +    K   +       V     C  C  GKQ + S ++  E   N +L+L+
Subjt:  MNGYRQANNCYHWISNNSDVCH-----STKEDQTWLWHRKLGHINLKSIDRAVKNEAVIGVPKIDVNSKFVCGDCQIGKQTKASHKSLKECSTNRVLELL

Query:  HMDLMGPMQSESLGGKKYVLVAVNDFSRFTWVRFLRGKSDIPKVCMSLCLKLQREKGVNIIRIKSDHGKELKNVDLNSFCDAEEIHHEYSAPITPQHNGV
        + D+ GPM+ ES+GG KY +  ++D SR  WV  L+ K  + +V       ++RE G  + R++SD+G E  + +   +C +  I HE + P TPQHNGV
Subjt:  HMDLMGPMQSESLGGKKYVLVAVNDFSRFTWVRFLRGKSDIPKVCMSLCLKLQREKGVNIIRIKSDHGKELKNVDLNSFCDAEEIHHEYSAPITPQHNGV

Query:  VERKNRTL
         ER NRT+
Subjt:  VERKNRTL

P25384 Transposon Ty2-C Gag-Pol polyprotein7.5e-1827.95Show/hide
Query:  NLISVSQLCDQGYSVNFSKNSC----------VVLNEDNQILMNGYRQANNCYHWISNNSDVCHSTKEDQTWLWHRKLGHINLKSIDRAVKNEAVIGVPK
        +L+S+S+L +Q  +  F++N+           +V + D   L   Y   ++      NN +   S  +    L HR LGH N +SI +++K  AV  + +
Subjt:  NLISVSQLCDQGYSVNFSKNSC----------VVLNEDNQILMNGYRQANNCYHWISNNSDVCHSTKEDQTWLWHRKLGHINLKSIDRAVKNEAVIGVPK

Query:  IDVN----SKFVCGDCQIGKQTKASH---KSLKECSTNRVLELLHMDLMGPMQSESLGGKKYVLVAVNDFSRFTWVRFL--RGKSDIPKVCMSLCLKLQR
         D+     S + C DC IGK TK  H     LK   +    + LH D+ GP+         Y +   ++ +RF WV  L  R +  I  V  S+   ++ 
Subjt:  IDVN----SKFVCGDCQIGKQTKASH---KSLKECSTNRVLELLHMDLMGPMQSESLGGKKYVLVAVNDFSRFTWVRFL--RGKSDIPKVCMSLCLKLQR

Query:  EKGVNIIRIKSDHGKELKNVDLNSFCDAEEIHHEYSAPITPQHNGVVERKNRTL
        +    ++ I+ D G E  N  L+ F     I   Y+     + +GV ER NRTL
Subjt:  EKGVNIIRIKSDHGKELKNVDLNSFCDAEEIHHEYSAPITPQHNGVVERKNRTL

Q12472 Transposon Ty2-DR1 Gag-Pol polyprotein7.5e-1827.95Show/hide
Query:  NLISVSQLCDQGYSVNFSKNSC----------VVLNEDNQILMNGYRQANNCYHWISNNSDVCHSTKEDQTWLWHRKLGHINLKSIDRAVKNEAVIGVPK
        +L+S+S+L +Q  +  F++N+           +V + D   L   Y   ++      NN +   S  +    L HR LGH N +SI +++K  AV  + +
Subjt:  NLISVSQLCDQGYSVNFSKNSC----------VVLNEDNQILMNGYRQANNCYHWISNNSDVCHSTKEDQTWLWHRKLGHINLKSIDRAVKNEAVIGVPK

Query:  IDVN----SKFVCGDCQIGKQTKASH---KSLKECSTNRVLELLHMDLMGPMQSESLGGKKYVLVAVNDFSRFTWVRFL--RGKSDIPKVCMSLCLKLQR
         D+     S + C DC IGK TK  H     LK   +    + LH D+ GP+         Y +   ++ +RF WV  L  R +  I  V  S+   ++ 
Subjt:  IDVN----SKFVCGDCQIGKQTKASH---KSLKECSTNRVLELLHMDLMGPMQSESLGGKKYVLVAVNDFSRFTWVRFL--RGKSDIPKVCMSLCLKLQR

Query:  EKGVNIIRIKSDHGKELKNVDLNSFCDAEEIHHEYSAPITPQHNGVVERKNRTL
        +    ++ I+ D G E  N  L+ F     I   Y+     + +GV ER NRTL
Subjt:  EKGVNIIRIKSDHGKELKNVDLNSFCDAEEIHHEYSAPITPQHNGVVERKNRTL

Q12491 Transposon Ty2-B Gag-Pol polyprotein7.5e-1827.95Show/hide
Query:  NLISVSQLCDQGYSVNFSKNSC----------VVLNEDNQILMNGYRQANNCYHWISNNSDVCHSTKEDQTWLWHRKLGHINLKSIDRAVKNEAVIGVPK
        +L+S+S+L +Q  +  F++N+           +V + D   L   Y   ++      NN +   S  +    L HR LGH N +SI +++K  AV  + +
Subjt:  NLISVSQLCDQGYSVNFSKNSC----------VVLNEDNQILMNGYRQANNCYHWISNNSDVCHSTKEDQTWLWHRKLGHINLKSIDRAVKNEAVIGVPK

Query:  IDVN----SKFVCGDCQIGKQTKASH---KSLKECSTNRVLELLHMDLMGPMQSESLGGKKYVLVAVNDFSRFTWVRFL--RGKSDIPKVCMSLCLKLQR
         D+     S + C DC IGK TK  H     LK   +    + LH D+ GP+         Y +   ++ +RF WV  L  R +  I  V  S+   ++ 
Subjt:  IDVN----SKFVCGDCQIGKQTKASH---KSLKECSTNRVLELLHMDLMGPMQSESLGGKKYVLVAVNDFSRFTWVRFL--RGKSDIPKVCMSLCLKLQR

Query:  EKGVNIIRIKSDHGKELKNVDLNSFCDAEEIHHEYSAPITPQHNGVVERKNRTL
        +    ++ I+ D G E  N  L+ F     I   Y+     + +GV ER NRTL
Subjt:  EKGVNIIRIKSDHGKELKNVDLNSFCDAEEIHHEYSAPITPQHNGVVERKNRTL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGTGAAGACATCTGAGAATTGCAAGATTGCATTTACAACAGTTCAAACCACAAATGATGTTTGGTACTTTGACAGTGGGTGTTCAAGACATATGACAGGTAAACG
ATCATTCCTGACTGAATTAAACGAATGTGTTGTAGGTCATGTTACTTTTGGAGATGAAGCCAAAGGAAGAATTGTTGTAAAAGGAAATATTAATAACAATAATCTACCAT
GCCTAAATGATGTAAGATATGTAGATGGGCTAAAAGCAAATCTGATTAGTGTGAGTCAGTTGTGTGACCAAGGCTATAGTGTAAATTTCAGTAAAAACAGCTGTGTGGTT
TTGAATGAAGACAATCAAATTCTTATGAATGGTTATAGGCAAGCGAATAATTGTTATCACTGGATTTCAAATAATTCAGATGTGTGTCATTCGACTAAAGAAGATCAAAC
CTGGCTTTGGCACAGGAAACTTGGACACATTAATTTAAAAAGCATAGACAGAGCCGTAAAAAATGAGGCTGTGATAGGAGTTCCAAAGATTGATGTGAATAGCAAATTTG
TGTGTGGAGACTGTCAAATTGGGAAACAAACTAAAGCATCTCACAAAAGTCTAAAAGAATGTTCCACAAATAGAGTTCTTGAACTTCTACATATGGATCTTATGGGTCCA
ATGCAGAGTGAAAGTCTTGGTGGAAAGAAATATGTACTTGTAGCTGTAAATGACTTTTCTAGATTCACTTGGGTTCGATTTTTGAGAGGCAAGTCTGACATTCCTAAAGT
TTGCATGAGCCTATGTTTGAAGTTGCAGCGAGAAAAAGGGGTAAATATTATTAGAATTAAAAGTGATCATGGCAAAGAATTAAAAAATGTAGATCTCAATAGTTTCTGTG
ATGCTGAAGAAATACACCATGAATACTCAGCTCCTATAACTCCTCAACATAATGGTGTGGTCGAAAGGAAAAACAGAACTTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTAGTGAAGACATCTGAGAATTGCAAGATTGCATTTACAACAGTTCAAACCACAAATGATGTTTGGTACTTTGACAGTGGGTGTTCAAGACATATGACAGGTAAACG
ATCATTCCTGACTGAATTAAACGAATGTGTTGTAGGTCATGTTACTTTTGGAGATGAAGCCAAAGGAAGAATTGTTGTAAAAGGAAATATTAATAACAATAATCTACCAT
GCCTAAATGATGTAAGATATGTAGATGGGCTAAAAGCAAATCTGATTAGTGTGAGTCAGTTGTGTGACCAAGGCTATAGTGTAAATTTCAGTAAAAACAGCTGTGTGGTT
TTGAATGAAGACAATCAAATTCTTATGAATGGTTATAGGCAAGCGAATAATTGTTATCACTGGATTTCAAATAATTCAGATGTGTGTCATTCGACTAAAGAAGATCAAAC
CTGGCTTTGGCACAGGAAACTTGGACACATTAATTTAAAAAGCATAGACAGAGCCGTAAAAAATGAGGCTGTGATAGGAGTTCCAAAGATTGATGTGAATAGCAAATTTG
TGTGTGGAGACTGTCAAATTGGGAAACAAACTAAAGCATCTCACAAAAGTCTAAAAGAATGTTCCACAAATAGAGTTCTTGAACTTCTACATATGGATCTTATGGGTCCA
ATGCAGAGTGAAAGTCTTGGTGGAAAGAAATATGTACTTGTAGCTGTAAATGACTTTTCTAGATTCACTTGGGTTCGATTTTTGAGAGGCAAGTCTGACATTCCTAAAGT
TTGCATGAGCCTATGTTTGAAGTTGCAGCGAGAAAAAGGGGTAAATATTATTAGAATTAAAAGTGATCATGGCAAAGAATTAAAAAATGTAGATCTCAATAGTTTCTGTG
ATGCTGAAGAAATACACCATGAATACTCAGCTCCTATAACTCCTCAACATAATGGTGTGGTCGAAAGGAAAAACAGAACTTTGTAG
Protein sequenceShow/hide protein sequence
MVVKTSENCKIAFTTVQTTNDVWYFDSGCSRHMTGKRSFLTELNECVVGHVTFGDEAKGRIVVKGNINNNNLPCLNDVRYVDGLKANLISVSQLCDQGYSVNFSKNSCVV
LNEDNQILMNGYRQANNCYHWISNNSDVCHSTKEDQTWLWHRKLGHINLKSIDRAVKNEAVIGVPKIDVNSKFVCGDCQIGKQTKASHKSLKECSTNRVLELLHMDLMGP
MQSESLGGKKYVLVAVNDFSRFTWVRFLRGKSDIPKVCMSLCLKLQREKGVNIIRIKSDHGKELKNVDLNSFCDAEEIHHEYSAPITPQHNGVVERKNRTL