; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0070841 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0070841
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr03:16813228..16814061
RNA-Seq ExpressionCmc03g0070841
SyntenyCmc03g0070841
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]7.4e-12080.14Show/hide
Query:  MVSAKALEDLKLFFNDRYILLKNVLYAPQMKRNLIFISCLIEHMYKISFEVNEAFILRKGIHICSAILENNLYKSRPTLANFVLNTEMFRTAETQNKRQK
        +VSA+A+ DL LFF DRY++LK+VLY P MKRNLI I+C++EH+Y ISFEVNE FIL KGI ICSAI ENNLYK RPT AN VLNTEMFRT ETQNK+QK
Subjt:  MVSAKALEDLKLFFNDRYILLKNVLYAPQMKRNLIFISCLIEHMYKISFEVNEAFILRKGIHICSAILENNLYKSRPTLANFVLNTEMFRTAETQNKRQK

Query:  VSSNAFLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPTCDSYLEGKMTKISFTGKGLRAKTPLEIVHSDLWGPMNVKARGGYEYFISFIDDYSRYGHVY
        VSSNA+LWHLRLGHINLNRI RLVKSG+LNQLEDNSLP C+S LEGKMTK SFTGKGLRAK PLE+VHSDL GPMNVKARGGYEYFISFIDD+SRYGHVY
Subjt:  VSSNAFLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPTCDSYLEGKMTKISFTGKGLRAKTPLEIVHSDLWGPMNVKARGGYEYFISFIDDYSRYGHVY

Query:  LGHVYLIQNKSDSFEMFKEYKAEVENESGKTIKTFRSDRGGEYMDFRFHDYLIEYGIQSQLSIPNTPQQNGVSESRN
        L H     +KS+SFE FKEYKAEVENE GKTIKT RSDRGGEYMD +F DYLIE+GIQSQLS P+TPQQNGVSE RN
Subjt:  LGHVYLIQNKSDSFEMFKEYKAEVENESGKTIKTFRSDRGGEYMDFRFHDYLIEYGIQSQLSIPNTPQQNGVSESRN

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-10267.74Show/hide
Query:  MVSAKALEDLKLFFNDRYILLKNVLYAPQMKRNLIFISCLIEHMYKISFEVNEAFILRKGIHICSAILENNLYKSRPTLANFVLNTEMFRTAETQNKRQK
        ++SA+A+ D KLFF ++++ L+N+   P++KRNL+ +SCLIEHMY I+F +NEAFI + G+HICSA LENNLY  RP  A  VLN EMFRTA TQNKRQ+
Subjt:  MVSAKALEDLKLFFNDRYILLKNVLYAPQMKRNLIFISCLIEHMYKISFEVNEAFILRKGIHICSAILENNLYKSRPTLANFVLNTEMFRTAETQNKRQK

Query:  VS--SNAFLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPTCDSYLEGKMTKISFTGKGLRAKTPLEIVHSDLWGPMNVKARGGYEYFISFIDDYSRYGH
        +S  +N +LWHLRLGHINL+RIGRLVK+GLLN+L+D SLP C+S LEGKMTK  FTGKG RAK PLE++HSDL GPMNVKARGG+EYFISFIDDYSRY  
Subjt:  VS--SNAFLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPTCDSYLEGKMTKISFTGKGLRAKTPLEIVHSDLWGPMNVKARGGYEYFISFIDDYSRYGH

Query:  VYLGHVYLIQNKSDSFEMFKEYKAEVENESGKTIKTFRSDRGGEYMDFRFHDYLIEYGIQSQLSIPNTPQQNGVSESRN
           G++YL+++KS++ E FKEYK EVEN   K IK  RSDRGGEYMD RF DY+IE+GIQSQLS P TPQQNGVSE RN
Subjt:  VYLGHVYLIQNKSDSFEMFKEYKAEVENESGKTIKTFRSDRGGEYMDFRFHDYLIEYGIQSQLSIPNTPQQNGVSESRN

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-10167.38Show/hide
Query:  MVSAKALEDLKLFFNDRYILLKNVLYAPQMKRNLIFISCLIEHMYKISFEVNEAFILRKGIHICSAILENNLYKSRPTLANFVLNTEMFRTAETQNKRQK
        ++SA+A+ D KLFF ++++ L+N+   P++KRNL+ +SCLIEHMY I+F +NEAFI + G+HICSA LENNLY  RP  A  VLN EMFRTA TQNKRQ+
Subjt:  MVSAKALEDLKLFFNDRYILLKNVLYAPQMKRNLIFISCLIEHMYKISFEVNEAFILRKGIHICSAILENNLYKSRPTLANFVLNTEMFRTAETQNKRQK

Query:  VS--SNAFLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPTCDSYLEGKMTKISFTGKGLRAKTPLEIVHSDLWGPMNVKARGGYEYFISFIDDYSRYGH
        +S  +N +LWHLRLGHINL+RIGRLVK GLLN+L+D SLP C+S LEGKMTK  FTGKG RAK PLE++HSDL GPMNVKARG +EYFISFIDDYSRY  
Subjt:  VS--SNAFLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPTCDSYLEGKMTKISFTGKGLRAKTPLEIVHSDLWGPMNVKARGGYEYFISFIDDYSRYGH

Query:  VYLGHVYLIQNKSDSFEMFKEYKAEVENESGKTIKTFRSDRGGEYMDFRFHDYLIEYGIQSQLSIPNTPQQNGVSESRN
           G++YL+++KS++ E FKEYK EVEN   K IK FRSDRGGEYMD  F DY+IE+GIQSQLS P TPQQNGVSE RN
Subjt:  VYLGHVYLIQNKSDSFEMFKEYKAEVENESGKTIKTFRSDRGGEYMDFRFHDYLIEYGIQSQLSIPNTPQQNGVSESRN

KAA0060534.1 gag/pol protein [Cucumis melo var. makuwa]4.5e-10167.03Show/hide
Query:  MVSAKALEDLKLFFNDRYILLKNVLYAPQMKRNLIFISCLIEHMYKISFEVNEAFILRKGIHICSAILENNLYKSRPTLANFVLNTEMFRTAETQNKRQK
        ++SA+A+ D KLFF ++++ L+N+   P++KRNL+ +SCLIEHMY ISF +NEAFI + G+HICS  LE+NLY  +P     VLN EMFRTA TQNKRQ+
Subjt:  MVSAKALEDLKLFFNDRYILLKNVLYAPQMKRNLIFISCLIEHMYKISFEVNEAFILRKGIHICSAILENNLYKSRPTLANFVLNTEMFRTAETQNKRQK

Query:  VSS--NAFLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPTCDSYLEGKMTKISFTGKGLRAKTPLEIVHSDLWGPMNVKARGGYEYFISFIDDYSRYGH
        +SS  N +LWHLRLGHINL+RIGRLVK+GLLN+LED+SLP C+S LEGKMTK  FTGKG RAK PLE++HSDL GPMNVKA GG+EYFISFIDDYS Y  
Subjt:  VSS--NAFLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPTCDSYLEGKMTKISFTGKGLRAKTPLEIVHSDLWGPMNVKARGGYEYFISFIDDYSRYGH

Query:  VYLGHVYLIQNKSDSFEMFKEYKAEVENESGKTIKTFRSDRGGEYMDFRFHDYLIEYGIQSQLSIPNTPQQNGVSESRN
           G++YLI++KS++ E FKEYK EVEN   K IK  RSDRGGEYMD RF DY+IE+GIQSQLS P TPQQNGVSE RN
Subjt:  VYLGHVYLIQNKSDSFEMFKEYKAEVENESGKTIKTFRSDRGGEYMDFRFHDYLIEYGIQSQLSIPNTPQQNGVSESRN

KAA0067938.1 gag/pol protein [Cucumis melo var. makuwa]9.7e-9665.23Show/hide
Query:  MVSAKALEDLKLFFNDRYILLKNVLYAPQMKRNLIFISCLIEHMYKISFEVNEAFILRKGIHICSAILENNLYKSRPTLANFVLNTEMFRTAETQNKRQK
        ++SA+A+ D+KLFF  +++ L+N+   P++KRNL+F+SCLIEHMY I+F +NEAFI + G     A LE+NLY  RP  A  VLN EMFRTA TQNKRQ+
Subjt:  MVSAKALEDLKLFFNDRYILLKNVLYAPQMKRNLIFISCLIEHMYKISFEVNEAFILRKGIHICSAILENNLYKSRPTLANFVLNTEMFRTAETQNKRQK

Query:  VS--SNAFLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPTCDSYLEGKMTKISFTGKGLRAKTPLEIVHSDLWGPMNVKARGGYEYFISFIDDYSRYGH
        +S  +N +LWHLRL HINL+RIGRLVK+GLLN+L+D+SLP C+S LEGKMTK  FTGK  RAK PLE++HSDL GPMNVKARGG+EYFISFIDDYSRY  
Subjt:  VS--SNAFLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPTCDSYLEGKMTKISFTGKGLRAKTPLEIVHSDLWGPMNVKARGGYEYFISFIDDYSRYGH

Query:  VYLGHVYLIQNKSDSFEMFKEYKAEVENESGKTIKTFRSDRGGEYMDFRFHDYLIEYGIQSQLSIPNTPQQNGVSESRN
           G++YL+++K ++ E FKEYK EVEN   K IK  RSDRGGEYMD RF DY+IE+GIQSQLS P TPQQNGVSE RN
Subjt:  VYLGHVYLIQNKSDSFEMFKEYKAEVENESGKTIKTFRSDRGGEYMDFRFHDYLIEYGIQSQLSIPNTPQQNGVSESRN

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein5.7e-10267.38Show/hide
Query:  MVSAKALEDLKLFFNDRYILLKNVLYAPQMKRNLIFISCLIEHMYKISFEVNEAFILRKGIHICSAILENNLYKSRPTLANFVLNTEMFRTAETQNKRQK
        ++SA+A+ D KLFF ++++ L+N+   P++KRNL+ +SCLIEHMY I+F +NEAFI + G+HICSA LENNLY  RP  A  VLN EMFRTA TQNKRQ+
Subjt:  MVSAKALEDLKLFFNDRYILLKNVLYAPQMKRNLIFISCLIEHMYKISFEVNEAFILRKGIHICSAILENNLYKSRPTLANFVLNTEMFRTAETQNKRQK

Query:  VS--SNAFLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPTCDSYLEGKMTKISFTGKGLRAKTPLEIVHSDLWGPMNVKARGGYEYFISFIDDYSRYGH
        +S  +N +LWHLRLGHINL+RIGRLVK GLLN+L+D SLP C+S LEGKMTK  FTGKG RAK PLE++HSDL GPMNVKARG +EYFISFIDDYSRY  
Subjt:  VS--SNAFLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPTCDSYLEGKMTKISFTGKGLRAKTPLEIVHSDLWGPMNVKARGGYEYFISFIDDYSRYGH

Query:  VYLGHVYLIQNKSDSFEMFKEYKAEVENESGKTIKTFRSDRGGEYMDFRFHDYLIEYGIQSQLSIPNTPQQNGVSESRN
           G++YL+++KS++ E FKEYK EVEN   K IK FRSDRGGEYMD  F DY+IE+GIQSQLS P TPQQNGVSE RN
Subjt:  VYLGHVYLIQNKSDSFEMFKEYKAEVENESGKTIKTFRSDRGGEYMDFRFHDYLIEYGIQSQLSIPNTPQQNGVSESRN

A0A5A7TZD0 Gag/pol protein5.2e-10367.74Show/hide
Query:  MVSAKALEDLKLFFNDRYILLKNVLYAPQMKRNLIFISCLIEHMYKISFEVNEAFILRKGIHICSAILENNLYKSRPTLANFVLNTEMFRTAETQNKRQK
        ++SA+A+ D KLFF ++++ L+N+   P++KRNL+ +SCLIEHMY I+F +NEAFI + G+HICSA LENNLY  RP  A  VLN EMFRTA TQNKRQ+
Subjt:  MVSAKALEDLKLFFNDRYILLKNVLYAPQMKRNLIFISCLIEHMYKISFEVNEAFILRKGIHICSAILENNLYKSRPTLANFVLNTEMFRTAETQNKRQK

Query:  VS--SNAFLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPTCDSYLEGKMTKISFTGKGLRAKTPLEIVHSDLWGPMNVKARGGYEYFISFIDDYSRYGH
        +S  +N +LWHLRLGHINL+RIGRLVK+GLLN+L+D SLP C+S LEGKMTK  FTGKG RAK PLE++HSDL GPMNVKARGG+EYFISFIDDYSRY  
Subjt:  VS--SNAFLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPTCDSYLEGKMTKISFTGKGLRAKTPLEIVHSDLWGPMNVKARGGYEYFISFIDDYSRYGH

Query:  VYLGHVYLIQNKSDSFEMFKEYKAEVENESGKTIKTFRSDRGGEYMDFRFHDYLIEYGIQSQLSIPNTPQQNGVSESRN
           G++YL+++KS++ E FKEYK EVEN   K IK  RSDRGGEYMD RF DY+IE+GIQSQLS P TPQQNGVSE RN
Subjt:  VYLGHVYLIQNKSDSFEMFKEYKAEVENESGKTIKTFRSDRGGEYMDFRFHDYLIEYGIQSQLSIPNTPQQNGVSESRN

A0A5A7VJG3 Gag/pol protein4.7e-9665.23Show/hide
Query:  MVSAKALEDLKLFFNDRYILLKNVLYAPQMKRNLIFISCLIEHMYKISFEVNEAFILRKGIHICSAILENNLYKSRPTLANFVLNTEMFRTAETQNKRQK
        ++SA+A+ D+KLFF  +++ L+N+   P++KRNL+F+SCLIEHMY I+F +NEAFI + G     A LE+NLY  RP  A  VLN EMFRTA TQNKRQ+
Subjt:  MVSAKALEDLKLFFNDRYILLKNVLYAPQMKRNLIFISCLIEHMYKISFEVNEAFILRKGIHICSAILENNLYKSRPTLANFVLNTEMFRTAETQNKRQK

Query:  VS--SNAFLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPTCDSYLEGKMTKISFTGKGLRAKTPLEIVHSDLWGPMNVKARGGYEYFISFIDDYSRYGH
        +S  +N +LWHLRL HINL+RIGRLVK+GLLN+L+D+SLP C+S LEGKMTK  FTGK  RAK PLE++HSDL GPMNVKARGG+EYFISFIDDYSRY  
Subjt:  VS--SNAFLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPTCDSYLEGKMTKISFTGKGLRAKTPLEIVHSDLWGPMNVKARGGYEYFISFIDDYSRYGH

Query:  VYLGHVYLIQNKSDSFEMFKEYKAEVENESGKTIKTFRSDRGGEYMDFRFHDYLIEYGIQSQLSIPNTPQQNGVSESRN
           G++YL+++K ++ E FKEYK EVEN   K IK  RSDRGGEYMD RF DY+IE+GIQSQLS P TPQQNGVSE RN
Subjt:  VYLGHVYLIQNKSDSFEMFKEYKAEVENESGKTIKTFRSDRGGEYMDFRFHDYLIEYGIQSQLSIPNTPQQNGVSESRN

A0A5D3BNE1 Gag/pol protein2.2e-10167.03Show/hide
Query:  MVSAKALEDLKLFFNDRYILLKNVLYAPQMKRNLIFISCLIEHMYKISFEVNEAFILRKGIHICSAILENNLYKSRPTLANFVLNTEMFRTAETQNKRQK
        ++SA+A+ D KLFF ++++ L+N+   P++KRNL+ +SCLIEHMY ISF +NEAFI + G+HICS  LE+NLY  +P     VLN EMFRTA TQNKRQ+
Subjt:  MVSAKALEDLKLFFNDRYILLKNVLYAPQMKRNLIFISCLIEHMYKISFEVNEAFILRKGIHICSAILENNLYKSRPTLANFVLNTEMFRTAETQNKRQK

Query:  VSS--NAFLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPTCDSYLEGKMTKISFTGKGLRAKTPLEIVHSDLWGPMNVKARGGYEYFISFIDDYSRYGH
        +SS  N +LWHLRLGHINL+RIGRLVK+GLLN+LED+SLP C+S LEGKMTK  FTGKG RAK PLE++HSDL GPMNVKA GG+EYFISFIDDYS Y  
Subjt:  VSS--NAFLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPTCDSYLEGKMTKISFTGKGLRAKTPLEIVHSDLWGPMNVKARGGYEYFISFIDDYSRYGH

Query:  VYLGHVYLIQNKSDSFEMFKEYKAEVENESGKTIKTFRSDRGGEYMDFRFHDYLIEYGIQSQLSIPNTPQQNGVSESRN
           G++YLI++KS++ E FKEYK EVEN   K IK  RSDRGGEYMD RF DY+IE+GIQSQLS P TPQQNGVSE RN
Subjt:  VYLGHVYLIQNKSDSFEMFKEYKAEVENESGKTIKTFRSDRGGEYMDFRFHDYLIEYGIQSQLSIPNTPQQNGVSESRN

E2GK51 Gag/pol protein (Fragment)3.6e-12080.14Show/hide
Query:  MVSAKALEDLKLFFNDRYILLKNVLYAPQMKRNLIFISCLIEHMYKISFEVNEAFILRKGIHICSAILENNLYKSRPTLANFVLNTEMFRTAETQNKRQK
        +VSA+A+ DL LFF DRY++LK+VLY P MKRNLI I+C++EH+Y ISFEVNE FIL KGI ICSAI ENNLYK RPT AN VLNTEMFRT ETQNK+QK
Subjt:  MVSAKALEDLKLFFNDRYILLKNVLYAPQMKRNLIFISCLIEHMYKISFEVNEAFILRKGIHICSAILENNLYKSRPTLANFVLNTEMFRTAETQNKRQK

Query:  VSSNAFLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPTCDSYLEGKMTKISFTGKGLRAKTPLEIVHSDLWGPMNVKARGGYEYFISFIDDYSRYGHVY
        VSSNA+LWHLRLGHINLNRI RLVKSG+LNQLEDNSLP C+S LEGKMTK SFTGKGLRAK PLE+VHSDL GPMNVKARGGYEYFISFIDD+SRYGHVY
Subjt:  VSSNAFLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPTCDSYLEGKMTKISFTGKGLRAKTPLEIVHSDLWGPMNVKARGGYEYFISFIDDYSRYGHVY

Query:  LGHVYLIQNKSDSFEMFKEYKAEVENESGKTIKTFRSDRGGEYMDFRFHDYLIEYGIQSQLSIPNTPQQNGVSESRN
        L H     +KS+SFE FKEYKAEVENE GKTIKT RSDRGGEYMD +F DYLIE+GIQSQLS P+TPQQNGVSE RN
Subjt:  LGHVYLIQNKSDSFEMFKEYKAEVENESGKTIKTFRSDRGGEYMDFRFHDYLIEYGIQSQLSIPNTPQQNGVSESRN

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.3e-2330.37Show/hide
Query:  NDRYILLKNVLYAPQMKRNLIFISCLIEHMYKISFEVNEAFILRKGIHICSAILENNLYKSRPTLANFVLNTEMFRTAETQNKRQKVSSNAFLWHLRLGH
        ND  I L++VL+  +   NL+ +  L E    I F+ +   I + G+     +++N+   +   + NF          +  +   K  +N  LWH R GH
Subjt:  NDRYILLKNVLYAPQMKRNLIFISCLIEHMYKISFEVNEAFILRKGIHICSAILENNLYKSRPTLANFVLNTEMFRTAETQNKRQKVSSNAFLWHLRLGH

Query:  IN------LNRIGRLVKSGLLNQLEDNSLPTCDSYLEGKMTKISFTGKGLRAKT----PLEIVHSDLWGPMNVKARGGYEYFISFIDDYSRYGHVYLGHV
        I+      + R        LLN LE  S   C+  L GK  ++ F  K L+ KT    PL +VHSD+ GP+         YF+ F+D ++ Y        
Subjt:  IN------LNRIGRLVKSGLLNQLEDNSLPTCDSYLEGKMTKISFTGKGLRAKT----PLEIVHSDLWGPMNVKARGGYEYFISFIDDYSRYGHVYLGHV

Query:  YLIQNKSDSFEMFKEYKAEVENESGKTIKTFRSDRGGEYMDFRFHDYLIEYGIQSQLSIPNTPQQNGVSE
        YLI+ KSD F MF+++ A+ E      +     D G EY+      + ++ GI   L++P+TPQ NGVSE
Subjt:  YLIQNKSDSFEMFKEYKAEVENESGKTIKTFRSDRGGEYMDFRFHDYLIEYGIQSQLSIPNTPQQNGVSE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.4e-3432.69Show/hide
Query:  ILLKNVLYAPQMKRNLIFISCLIEHMYKISFEVNEAFILRKG-IHICSAILENNLYKSRPTLANFVLNTEMFRTAETQNKRQKVSSNAFLWHLRLGHINL
        ++LK+V + P ++ NLI    L    Y+ S+  N+ + L KG + I   +    LY++   +    LN            + ++S +  LWH R+GH++ 
Subjt:  ILLKNVLYAPQMKRNLIFISCLIEHMYKISFEVNEAFILRKG-IHICSAILENNLYKSRPTLANFVLNTEMFRTAETQNKRQKVSSNAFLWHLRLGHINL

Query:  NRIGRLVKSGLLNQLEDNSLPTCDSYLEGKMTKISFTGKGLRAKTPLEIVHSDLWGPMNVKARGGYEYFISFIDDYSRYGHVYLGHVYLIQNKSDSFEMF
          +  L K  L++  +  ++  CD  L GK  ++SF     R    L++V+SD+ GPM +++ GG +YF++FIDD SR        VY+++ K   F++F
Subjt:  NRIGRLVKSGLLNQLEDNSLPTCDSYLEGKMTKISFTGKGLRAKTPLEIVHSDLWGPMNVKARGGYEYFISFIDDYSRYGHVYLGHVYLIQNKSDSFEMF

Query:  KEYKAEVENESGKTIKTFRSDRGGEYMDFRFHDYLIEYGIQSQLSIPNTPQQNGVSESRN
        +++ A VE E+G+ +K  RSD GGEY    F +Y   +GI+ + ++P TPQ NGV+E  N
Subjt:  KEYKAEVENESGKTIKTFRSDRGGEYMDFRFHDYLIEYGIQSQLSIPNTPQQNGVSESRN

P25384 Transposon Ty2-C Gag-Pol polyprotein1.5e-1126.5Show/hide
Query:  TAETQNKRQKVSSNAF-LWHLRLGHINLNRIGRLVKSGLLNQLEDNSLP-------TCDSYLEGKMTKISFTGKGLRAK-----TPLEIVHSDLWGPMNV
        T    NK + V+   + L H  LGH N   I + +K   +  L+++ +         C   L GK TK     KG R K      P + +H+D++GP++ 
Subjt:  TAETQNKRQKVSSNAF-LWHLRLGHINLNRIGRLVKSGLLNQLEDNSLP-------TCDSYLEGKMTKISFTGKGLRAK-----TPLEIVHSDLWGPMNV

Query:  KARGGYEYFISFIDDYSRYGHVYLGHVYLIQNKSDSFEMFKEYKAEVENESGKTIKTFRSDRGGEYMDFRFHDYLIEYGIQSQLSIPNTPQQNGVSESRN
          +    YFISF D+ +R+  VY  H    + +     +F    A ++N+    +   + DRG EY +   H +    GI +  +     + +GV+E  N
Subjt:  KARGGYEYFISFIDDYSRYGHVYLGHVYLIQNKSDSFEMFKEYKAEVENESGKTIKTFRSDRGGEYMDFRFHDYLIEYGIQSQLSIPNTPQQNGVSESRN

Q12491 Transposon Ty2-B Gag-Pol polyprotein1.5e-1126.5Show/hide
Query:  TAETQNKRQKVSSNAF-LWHLRLGHINLNRIGRLVKSGLLNQLEDNSLP-------TCDSYLEGKMTKISFTGKGLRAK-----TPLEIVHSDLWGPMNV
        T    NK + V+   + L H  LGH N   I + +K   +  L+++ +         C   L GK TK     KG R K      P + +H+D++GP++ 
Subjt:  TAETQNKRQKVSSNAF-LWHLRLGHINLNRIGRLVKSGLLNQLEDNSLP-------TCDSYLEGKMTKISFTGKGLRAK-----TPLEIVHSDLWGPMNV

Query:  KARGGYEYFISFIDDYSRYGHVYLGHVYLIQNKSDSFEMFKEYKAEVENESGKTIKTFRSDRGGEYMDFRFHDYLIEYGIQSQLSIPNTPQQNGVSESRN
          +    YFISF D+ +R+  VY  H    + +     +F    A ++N+    +   + DRG EY +   H +    GI +  +     + +GV+E  N
Subjt:  KARGGYEYFISFIDDYSRYGHVYLGHVYLIQNKSDSFEMFKEYKAEVENESGKTIKTFRSDRGGEYMDFRFHDYLIEYGIQSQLSIPNTPQQNGVSESRN

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.6e-1926.49Show/hide
Query:  RYILLKNVLYAPQMKRNLIFISCLIE------HMYKISFEVNEAFILRKGIHICSAILENNLYKSRPTLANFVLNTEMFRTAETQNKRQKVSSNAFLWHL
        R + L N+LY P + +NLI +  L          +  SF+V +   L  G+ +     ++ LY+        + +++      + + +   SS    WH 
Subjt:  RYILLKNVLYAPQMKRNLIFISCLIE------HMYKISFEVNEAFILRKGIHICSAILENNLYKSRPTLANFVLNTEMFRTAETQNKRQKVSSNAFLWHL

Query:  RLGHINLNRIGRLVKSGLLNQLE-DNSLPTCDSYLEGKMTKISFTGKGLRAKTPLEIVHSDLWGPMNVKARGGYEYFISFIDDYSRYGHVYLGHVYLIQN
        RLGH   + +  ++ +  L+ L   +   +C   L  K  K+ F+   + +  PLE ++SD+W    + +   Y Y++ F+D ++RY       +Y ++ 
Subjt:  RLGHINLNRIGRLVKSGLLNQLE-DNSLPTCDSYLEGKMTKISFTGKGLRAKTPLEIVHSDLWGPMNVKARGGYEYFISFIDDYSRYGHVYLGHVYLIQN

Query:  KSDSFEMFKEYKAEVENESGKTIKTFRSDRGGEYMDFRFHDYLIEYGIQSQLSIPNTPQQNGVSESRN
        KS   E F  +K  +EN     I TF SD GGE++     +Y  ++GI    S P+TP+ NG+SE ++
Subjt:  KSDSFEMFKEYKAEVENESGKTIKTFRSDRGGEYMDFRFHDYLIEYGIQSQLSIPNTPQQNGVSESRN

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein2.3e-1032.46Show/hide
Query:  CSAILENNLYKSRPTLANFVLNTEMFRTAETQNKRQKVSSNAFLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPTCDSYLEGKMTKISFTGKGLRAKTP
        C  IL+ N + S   L   V   E        N  +       LWH RL H++   +  LVK G L+  + +SL  C+  + GK  +++F+      K P
Subjt:  CSAILENNLYKSRPTLANFVLNTEMFRTAETQNKRQKVSSNAFLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPTCDSYLEGKMTKISFTGKGLRAKTP

Query:  LEIVHSDLWGPMNV
        L+ VHSDLWG  +V
Subjt:  LEIVHSDLWGPMNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTATCAGCTAAAGCATTAGAAGATTTAAAGTTGTTTTTTAATGATAGATATATTCTACTCAAGAATGTCTTGTATGCACCTCAAATGAAGAGGAATTTGATATTTAT
CTCTTGTTTGATAGAACATATGTATAAAATATCTTTTGAAGTTAATGAAGCATTCATTTTAAGAAAAGGTATTCACATTTGTTCTGCTATACTTGAAAACAACTTATATA
AGTCAAGACCAACACTTGCAAATTTTGTCTTAAATACTGAGATGTTTAGAACAGCTGAAACTCAGAATAAAAGACAAAAAGTTTCTTCTAATGCCTTCTTATGGCACTTA
AGACTTGGTCACATAAATCTCAATAGGATTGGAAGATTGGTTAAGAGTGGACTTCTAAACCAGTTAGAAGATAACTCTTTACCTACATGTGATTCCTATCTTGAAGGAAA
GATGACCAAAATATCTTTTACTGGAAAAGGTCTTAGAGCTAAAACCCCTTTAGAGATTGTACATTCGGACCTTTGGGGACCAATGAATGTCAAGGCTCGAGGAGGATACG
AATATTTTATCAGCTTTATTGATGATTATTCTAGGTATGGTCATGTTTACCTTGGTCATGTTTACCTAATTCAGAACAAGTCTGATTCTTTTGAAATGTTCAAAGAATAT
AAGGCTGAAGTTGAAAATGAATCAGGTAAAACAATAAAGACATTTCGATCAGATCGAGGTGGAGAGTATATGGATTTTCGATTTCATGACTATTTGATAGAATATGGAAT
CCAATCACAACTCTCTATACCTAATACGCCTCAGCAGAACGGTGTATCAGAAAGCAGAAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTATCAGCTAAAGCATTAGAAGATTTAAAGTTGTTTTTTAATGATAGATATATTCTACTCAAGAATGTCTTGTATGCACCTCAAATGAAGAGGAATTTGATATTTAT
CTCTTGTTTGATAGAACATATGTATAAAATATCTTTTGAAGTTAATGAAGCATTCATTTTAAGAAAAGGTATTCACATTTGTTCTGCTATACTTGAAAACAACTTATATA
AGTCAAGACCAACACTTGCAAATTTTGTCTTAAATACTGAGATGTTTAGAACAGCTGAAACTCAGAATAAAAGACAAAAAGTTTCTTCTAATGCCTTCTTATGGCACTTA
AGACTTGGTCACATAAATCTCAATAGGATTGGAAGATTGGTTAAGAGTGGACTTCTAAACCAGTTAGAAGATAACTCTTTACCTACATGTGATTCCTATCTTGAAGGAAA
GATGACCAAAATATCTTTTACTGGAAAAGGTCTTAGAGCTAAAACCCCTTTAGAGATTGTACATTCGGACCTTTGGGGACCAATGAATGTCAAGGCTCGAGGAGGATACG
AATATTTTATCAGCTTTATTGATGATTATTCTAGGTATGGTCATGTTTACCTTGGTCATGTTTACCTAATTCAGAACAAGTCTGATTCTTTTGAAATGTTCAAAGAATAT
AAGGCTGAAGTTGAAAATGAATCAGGTAAAACAATAAAGACATTTCGATCAGATCGAGGTGGAGAGTATATGGATTTTCGATTTCATGACTATTTGATAGAATATGGAAT
CCAATCACAACTCTCTATACCTAATACGCCTCAGCAGAACGGTGTATCAGAAAGCAGAAATTGA
Protein sequenceShow/hide protein sequence
MVSAKALEDLKLFFNDRYILLKNVLYAPQMKRNLIFISCLIEHMYKISFEVNEAFILRKGIHICSAILENNLYKSRPTLANFVLNTEMFRTAETQNKRQKVSSNAFLWHL
RLGHINLNRIGRLVKSGLLNQLEDNSLPTCDSYLEGKMTKISFTGKGLRAKTPLEIVHSDLWGPMNVKARGGYEYFISFIDDYSRYGHVYLGHVYLIQNKSDSFEMFKEY
KAEVENESGKTIKTFRSDRGGEYMDFRFHDYLIEYGIQSQLSIPNTPQQNGVSESRN