; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013649 (gene) of Snake gourd v1 genome

Gene IDTan0013649
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG05:33260062..33261467
RNA-Seq ExpressionTan0013649
SyntenyTan0013649
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]1.5e-14862.5Show/hide
Query:  MNKIEYNLTTLLNELQTFESLMKSKGKEKEANV-VTSKKFLRGSSSGTKSGPSFSKNKSIQKKKKKDKGKGQLPH-AKAKATE---KCFHCGAFGTGRGT
        +NKIE+NLTTLLNELQ F++L  SKGKE EANV VT +KF+RGSSS  K GPS        K + K KGKG+ P+ +K K      KCFHC   G  +  
Subjt:  MNKIEYNLTTLLNELQTFESLMKSKGKEKEANV-VTSKKFLRGSSSGTKSGPSFSKNKSIQKKKKKDKGKGQLPH-AKAKATE---KCFHCGAFGTGRGT

Query:  ARNTLQKRKLRRKT-----------------------------------KETSSWQQLADGEITLRVGTGEVVSAKAVGAVKLLFRDRFVLLENVLLVPG
            L ++K  + T                                   +ETSSW++L +GEITL+VGTGEVVSA+AVG + L F+DR+++L++VL VP 
Subjt:  ARNTLQKRKLRRKT-----------------------------------KETSSWQQLADGEITLRVGTGEVVSAKAVGAVKLLFRDRFVLLENVLLVPG

Query:  IKRNLVSISCLLEHMYKVSFNHNEAFISKRGVRICSAKLEKNLYVLRPTEVKTILNTEMFKTADTQNKRQKLSPSTYLWHLRLGHINLNKIERLIKSGLL
        +KRNL+SI+C+LEH+Y +SF  NE FI  +G++ICSA  E NLY LRPT    +LNTEMF+T +TQNK+QK+S + YLWHLRLGHINLN+IERL+KSG+L
Subjt:  IKRNLVSISCLLEHMYKVSFNHNEAFISKRGVRICSAKLEKNLYVLRPTEVKTILNTEMFKTADTQNKRQKLSPSTYLWHLRLGHINLNKIERLIKSGLL

Query:  SQLEENSLPPCESCLEGKMTKRPFSEKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKKYKAEVENTLGKTIKT
        +QLE+NSLPPCESCLEGKMTKR F+ KG RAK PLEL+HSDLCGPMNVKARGGYEYFISFIDD+SRYG++YL+HHKSE+ EKFK+YKAEVEN +GKTIKT
Subjt:  SQLEENSLPPCESCLEGKMTKRPFSEKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKKYKAEVENTLGKTIKT

Query:  LRSDRGGEYMDLRFQDYMIEHGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSLMSY
        LRSDRGGEYMD +FQDY+IE GI SQLSAP TPQQNGVSERRNRTLLDMVRS+MSY
Subjt:  LRSDRGGEYMDLRFQDYMIEHGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSLMSY

KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]4.5e-14864.57Show/hide
Query:  VMNKIEYNLTTLLNELQTFESLMKSKGKEKEANVVTS-KKFLRGSSSGTKSGPSFSKNKSIQKKKKKDKGKGQLPHAKAKATEK-------CFHCGAFGT
        VMNKI Y LTTLLNELQTFESLMK KG++ EANV TS +KF RGS+SGTKS PS S NK  +KKK     K  L  A AK T+K       CFHC   G 
Subjt:  VMNKIEYNLTTLLNELQTFESLMKSKGKEKEANVVTS-KKFLRGSSSGTKSGPSFSKNKSIQKKKKKDKGKGQLPHAKAKATEK-------CFHCGAFGT

Query:  GRGTARNTLQKRKLRRKTK------ET---------------------------SSWQQLADGEITLRVGTGEVVSAKAVGAVKLLFRDRFVLLENVLLV
         +      L ++K  ++ K      ET                           SSW+QL  GE+T+RVGTG VVSA AVG ++L  +  F+LLENV +V
Subjt:  GRGTARNTLQKRKLRRKTK------ET---------------------------SSWQQLADGEITLRVGTGEVVSAKAVGAVKLLFRDRFVLLENVLLV

Query:  PGIKRNLVSISCLLEHMYKVSFNHNEAFISKRGVRICSAKLEKNLYVLRPTEVKTILNTEMFKTADTQNKRQKLSP--STYLWHLRLGHINLNKIERLIK
        P +KRNL+S+ CLLE  Y ++FN N+ FI K GV ICSAKLE NLYVLR    K +LNTEMFKTA TQNKR K+SP  + +LWHLRLGHINLN+IERL+K
Subjt:  PGIKRNLVSISCLLEHMYKVSFNHNEAFISKRGVRICSAKLEKNLYVLRPTEVKTILNTEMFKTADTQNKRQKLSP--STYLWHLRLGHINLNKIERLIK

Query:  SGLLSQLEENSLPPCESCLEGKMTKRPFSEKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKKYKAEVENTLGK
        +GLLS+LEENSLP CESCLEGKMTKRPF+ KG+RAKEPLEL+HSDLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSE LEKFK+YKAEVEN L K
Subjt:  SGLLSQLEENSLPPCESCLEGKMTKRPFSEKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKKYKAEVENTLGK

Query:  TIKTLRSDRGGEYMDLRFQDYMIEHGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSLMSY
        TIKT RSDRGGEYMDL+FQ+Y++E GIVSQLSAPGTPQQNGVSERRNRTLLDMVRS+MSY
Subjt:  TIKTLRSDRGGEYMDLRFQDYMIEHGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSLMSY

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]4.5e-14864.57Show/hide
Query:  VMNKIEYNLTTLLNELQTFESLMKSKGKEKEANVVTS-KKFLRGSSSGTKSGPSFSKNKSIQKKKKKDKGKGQLPHAKAKATEK-------CFHCGAFGT
        VMNKI Y LTTLLNELQTFESLMK KG++ EANV TS +KF RGS+SGTKS PS S NK  +KKK     K  L  A AK T+K       CFHC   G 
Subjt:  VMNKIEYNLTTLLNELQTFESLMKSKGKEKEANVVTS-KKFLRGSSSGTKSGPSFSKNKSIQKKKKKDKGKGQLPHAKAKATEK-------CFHCGAFGT

Query:  GRGTARNTLQKRKLRRKTK------ET---------------------------SSWQQLADGEITLRVGTGEVVSAKAVGAVKLLFRDRFVLLENVLLV
         +      L ++K  ++ K      ET                           SSW+QL  GE+T+RVGTG VVSA AVG ++L  +  F+LLENV +V
Subjt:  GRGTARNTLQKRKLRRKTK------ET---------------------------SSWQQLADGEITLRVGTGEVVSAKAVGAVKLLFRDRFVLLENVLLV

Query:  PGIKRNLVSISCLLEHMYKVSFNHNEAFISKRGVRICSAKLEKNLYVLRPTEVKTILNTEMFKTADTQNKRQKLSP--STYLWHLRLGHINLNKIERLIK
        P +KRNL+S+ CLLE  Y ++FN N+ FI K GV ICSAKLE NLYVLR    K +LNTEMFKTA TQNKR K+SP  + +LWHLRLGHINLN+IERL+K
Subjt:  PGIKRNLVSISCLLEHMYKVSFNHNEAFISKRGVRICSAKLEKNLYVLRPTEVKTILNTEMFKTADTQNKRQKLSP--STYLWHLRLGHINLNKIERLIK

Query:  SGLLSQLEENSLPPCESCLEGKMTKRPFSEKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKKYKAEVENTLGK
        +GLLS+LEENSLP CESCLEGKMTKRPF+ KG+RAKEPLEL+HSDLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSE LEKFK+YKAEVEN L K
Subjt:  SGLLSQLEENSLPPCESCLEGKMTKRPFSEKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKKYKAEVENTLGK

Query:  TIKTLRSDRGGEYMDLRFQDYMIEHGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSLMSY
        TIKT RSDRGGEYMDL+FQ+Y++E GIVSQLSAPGTPQQNGVSERRNRTLLDMVRS+MSY
Subjt:  TIKTLRSDRGGEYMDLRFQDYMIEHGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSLMSY

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]4.5e-14864.57Show/hide
Query:  VMNKIEYNLTTLLNELQTFESLMKSKGKEKEANVVTS-KKFLRGSSSGTKSGPSFSKNKSIQKKKKKDKGKGQLPHAKAKATEK-------CFHCGAFGT
        VMNKI Y LTTLLNELQTFESLMK KG++ EANV TS +KF RGS+SGTKS PS S NK  +KKK     K  L  A AK T+K       CFHC   G 
Subjt:  VMNKIEYNLTTLLNELQTFESLMKSKGKEKEANVVTS-KKFLRGSSSGTKSGPSFSKNKSIQKKKKKDKGKGQLPHAKAKATEK-------CFHCGAFGT

Query:  GRGTARNTLQKRKLRRKTK------ET---------------------------SSWQQLADGEITLRVGTGEVVSAKAVGAVKLLFRDRFVLLENVLLV
         +      L ++K  ++ K      ET                           SSW+QL  GE+T+RVGTG VVSA AVG ++L  +  F+LLENV +V
Subjt:  GRGTARNTLQKRKLRRKTK------ET---------------------------SSWQQLADGEITLRVGTGEVVSAKAVGAVKLLFRDRFVLLENVLLV

Query:  PGIKRNLVSISCLLEHMYKVSFNHNEAFISKRGVRICSAKLEKNLYVLRPTEVKTILNTEMFKTADTQNKRQKLSP--STYLWHLRLGHINLNKIERLIK
        P +KRNL+S+ CLLE  Y ++FN N+ FI K GV ICSAKLE NLYVLR    K +LNTEMFKTA TQNKR K+SP  + +LWHLRLGHINLN+IERL+K
Subjt:  PGIKRNLVSISCLLEHMYKVSFNHNEAFISKRGVRICSAKLEKNLYVLRPTEVKTILNTEMFKTADTQNKRQKLSP--STYLWHLRLGHINLNKIERLIK

Query:  SGLLSQLEENSLPPCESCLEGKMTKRPFSEKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKKYKAEVENTLGK
        +GLLS+LEENSLP CESCLEGKMTKRPF+ KG+RAKEPLEL+HSDLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSE LEKFK+YKAEVEN L K
Subjt:  SGLLSQLEENSLPPCESCLEGKMTKRPFSEKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKKYKAEVENTLGK

Query:  TIKTLRSDRGGEYMDLRFQDYMIEHGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSLMSY
        TIKT RSDRGGEYMDL+FQ+Y++E GIVSQLSAPGTPQQNGVSERRNRTLLDMVRS+MSY
Subjt:  TIKTLRSDRGGEYMDLRFQDYMIEHGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSLMSY

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]4.5e-14864.57Show/hide
Query:  VMNKIEYNLTTLLNELQTFESLMKSKGKEKEANVVTS-KKFLRGSSSGTKSGPSFSKNKSIQKKKKKDKGKGQLPHAKAKATEK-------CFHCGAFGT
        VMNKI Y LTTLLNELQTFESLMK KG++ EANV TS +KF RGS+SGTKS PS S NK  +KKK     K  L  A AK T+K       CFHC   G 
Subjt:  VMNKIEYNLTTLLNELQTFESLMKSKGKEKEANVVTS-KKFLRGSSSGTKSGPSFSKNKSIQKKKKKDKGKGQLPHAKAKATEK-------CFHCGAFGT

Query:  GRGTARNTLQKRKLRRKTK------ET---------------------------SSWQQLADGEITLRVGTGEVVSAKAVGAVKLLFRDRFVLLENVLLV
         +      L ++K  ++ K      ET                           SSW+QL  GE+T+RVGTG VVSA AVG ++L  +  F+LLENV +V
Subjt:  GRGTARNTLQKRKLRRKTK------ET---------------------------SSWQQLADGEITLRVGTGEVVSAKAVGAVKLLFRDRFVLLENVLLV

Query:  PGIKRNLVSISCLLEHMYKVSFNHNEAFISKRGVRICSAKLEKNLYVLRPTEVKTILNTEMFKTADTQNKRQKLSP--STYLWHLRLGHINLNKIERLIK
        P +KRNL+S+ CLLE  Y ++FN N+ FI K GV ICSAKLE NLYVLR    K +LNTEMFKTA TQNKR K+SP  + +LWHLRLGHINLN+IERL+K
Subjt:  PGIKRNLVSISCLLEHMYKVSFNHNEAFISKRGVRICSAKLEKNLYVLRPTEVKTILNTEMFKTADTQNKRQKLSP--STYLWHLRLGHINLNKIERLIK

Query:  SGLLSQLEENSLPPCESCLEGKMTKRPFSEKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKKYKAEVENTLGK
        +GLLS+LEENSLP CESCLEGKMTKRPF+ KG+RAKEPLEL+HSDLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSE LEKFK+YKAEVEN L K
Subjt:  SGLLSQLEENSLPPCESCLEGKMTKRPFSEKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKKYKAEVENTLGK

Query:  TIKTLRSDRGGEYMDLRFQDYMIEHGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSLMSY
        TIKT RSDRGGEYMDL+FQ+Y++E GIVSQLSAPGTPQQNGVSERRNRTLLDMVRS+MSY
Subjt:  TIKTLRSDRGGEYMDLRFQDYMIEHGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSLMSY

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein2.2e-14864.57Show/hide
Query:  VMNKIEYNLTTLLNELQTFESLMKSKGKEKEANVVTS-KKFLRGSSSGTKSGPSFSKNKSIQKKKKKDKGKGQLPHAKAKATEK-------CFHCGAFGT
        VMNKI Y LTTLLNELQTFESLMK KG++ EANV TS +KF RGS+SGTKS PS S NK  +KKK     K  L  A AK T+K       CFHC   G 
Subjt:  VMNKIEYNLTTLLNELQTFESLMKSKGKEKEANVVTS-KKFLRGSSSGTKSGPSFSKNKSIQKKKKKDKGKGQLPHAKAKATEK-------CFHCGAFGT

Query:  GRGTARNTLQKRKLRRKTK------ET---------------------------SSWQQLADGEITLRVGTGEVVSAKAVGAVKLLFRDRFVLLENVLLV
         +      L ++K  ++ K      ET                           SSW+QL  GE+T+RVGTG VVSA AVG ++L  +  F+LLENV +V
Subjt:  GRGTARNTLQKRKLRRKTK------ET---------------------------SSWQQLADGEITLRVGTGEVVSAKAVGAVKLLFRDRFVLLENVLLV

Query:  PGIKRNLVSISCLLEHMYKVSFNHNEAFISKRGVRICSAKLEKNLYVLRPTEVKTILNTEMFKTADTQNKRQKLSP--STYLWHLRLGHINLNKIERLIK
        P +KRNL+S+ CLLE  Y ++FN N+ FI K GV ICSAKLE NLYVLR    K +LNTEMFKTA TQNKR K+SP  + +LWHLRLGHINLN+IERL+K
Subjt:  PGIKRNLVSISCLLEHMYKVSFNHNEAFISKRGVRICSAKLEKNLYVLRPTEVKTILNTEMFKTADTQNKRQKLSP--STYLWHLRLGHINLNKIERLIK

Query:  SGLLSQLEENSLPPCESCLEGKMTKRPFSEKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKKYKAEVENTLGK
        +GLLS+LEENSLP CESCLEGKMTKRPF+ KG+RAKEPLEL+HSDLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSE LEKFK+YKAEVEN L K
Subjt:  SGLLSQLEENSLPPCESCLEGKMTKRPFSEKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKKYKAEVENTLGK

Query:  TIKTLRSDRGGEYMDLRFQDYMIEHGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSLMSY
        TIKT RSDRGGEYMDL+FQ+Y++E GIVSQLSAPGTPQQNGVSERRNRTLLDMVRS+MSY
Subjt:  TIKTLRSDRGGEYMDLRFQDYMIEHGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSLMSY

A0A5A7V4M1 Gag/pol protein2.2e-14864.57Show/hide
Query:  VMNKIEYNLTTLLNELQTFESLMKSKGKEKEANVVTS-KKFLRGSSSGTKSGPSFSKNKSIQKKKKKDKGKGQLPHAKAKATEK-------CFHCGAFGT
        VMNKI Y LTTLLNELQTFESLMK KG++ EANV TS +KF RGS+SGTKS PS S NK  +KKK     K  L  A AK T+K       CFHC   G 
Subjt:  VMNKIEYNLTTLLNELQTFESLMKSKGKEKEANVVTS-KKFLRGSSSGTKSGPSFSKNKSIQKKKKKDKGKGQLPHAKAKATEK-------CFHCGAFGT

Query:  GRGTARNTLQKRKLRRKTK------ET---------------------------SSWQQLADGEITLRVGTGEVVSAKAVGAVKLLFRDRFVLLENVLLV
         +      L ++K  ++ K      ET                           SSW+QL  GE+T+RVGTG VVSA AVG ++L  +  F+LLENV +V
Subjt:  GRGTARNTLQKRKLRRKTK------ET---------------------------SSWQQLADGEITLRVGTGEVVSAKAVGAVKLLFRDRFVLLENVLLV

Query:  PGIKRNLVSISCLLEHMYKVSFNHNEAFISKRGVRICSAKLEKNLYVLRPTEVKTILNTEMFKTADTQNKRQKLSP--STYLWHLRLGHINLNKIERLIK
        P +KRNL+S+ CLLE  Y ++FN N+ FI K GV ICSAKLE NLYVLR    K +LNTEMFKTA TQNKR K+SP  + +LWHLRLGHINLN+IERL+K
Subjt:  PGIKRNLVSISCLLEHMYKVSFNHNEAFISKRGVRICSAKLEKNLYVLRPTEVKTILNTEMFKTADTQNKRQKLSP--STYLWHLRLGHINLNKIERLIK

Query:  SGLLSQLEENSLPPCESCLEGKMTKRPFSEKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKKYKAEVENTLGK
        +GLLS+LEENSLP CESCLEGKMTKRPF+ KG+RAKEPLEL+HSDLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSE LEKFK+YKAEVEN L K
Subjt:  SGLLSQLEENSLPPCESCLEGKMTKRPFSEKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKKYKAEVENTLGK

Query:  TIKTLRSDRGGEYMDLRFQDYMIEHGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSLMSY
        TIKT RSDRGGEYMDL+FQ+Y++E GIVSQLSAPGTPQQNGVSERRNRTLLDMVRS+MSY
Subjt:  TIKTLRSDRGGEYMDLRFQDYMIEHGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSLMSY

A0A5D3CPJ6 Gag/pol protein2.2e-14864.57Show/hide
Query:  VMNKIEYNLTTLLNELQTFESLMKSKGKEKEANVVTS-KKFLRGSSSGTKSGPSFSKNKSIQKKKKKDKGKGQLPHAKAKATEK-------CFHCGAFGT
        VMNKI Y LTTLLNELQTFESLMK KG++ EANV TS +KF RGS+SGTKS PS S NK  +KKK     K  L  A AK T+K       CFHC   G 
Subjt:  VMNKIEYNLTTLLNELQTFESLMKSKGKEKEANVVTS-KKFLRGSSSGTKSGPSFSKNKSIQKKKKKDKGKGQLPHAKAKATEK-------CFHCGAFGT

Query:  GRGTARNTLQKRKLRRKTK------ET---------------------------SSWQQLADGEITLRVGTGEVVSAKAVGAVKLLFRDRFVLLENVLLV
         +      L ++K  ++ K      ET                           SSW+QL  GE+T+RVGTG VVSA AVG ++L  +  F+LLENV +V
Subjt:  GRGTARNTLQKRKLRRKTK------ET---------------------------SSWQQLADGEITLRVGTGEVVSAKAVGAVKLLFRDRFVLLENVLLV

Query:  PGIKRNLVSISCLLEHMYKVSFNHNEAFISKRGVRICSAKLEKNLYVLRPTEVKTILNTEMFKTADTQNKRQKLSP--STYLWHLRLGHINLNKIERLIK
        P +KRNL+S+ CLLE  Y ++FN N+ FI K GV ICSAKLE NLYVLR    K +LNTEMFKTA TQNKR K+SP  + +LWHLRLGHINLN+IERL+K
Subjt:  PGIKRNLVSISCLLEHMYKVSFNHNEAFISKRGVRICSAKLEKNLYVLRPTEVKTILNTEMFKTADTQNKRQKLSP--STYLWHLRLGHINLNKIERLIK

Query:  SGLLSQLEENSLPPCESCLEGKMTKRPFSEKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKKYKAEVENTLGK
        +GLLS+LEENSLP CESCLEGKMTKRPF+ KG+RAKEPLEL+HSDLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSE LEKFK+YKAEVEN L K
Subjt:  SGLLSQLEENSLPPCESCLEGKMTKRPFSEKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKKYKAEVENTLGK

Query:  TIKTLRSDRGGEYMDLRFQDYMIEHGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSLMSY
        TIKT RSDRGGEYMDL+FQ+Y++E GIVSQLSAPGTPQQNGVSERRNRTLLDMVRS+MSY
Subjt:  TIKTLRSDRGGEYMDLRFQDYMIEHGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSLMSY

A0A5D3DS88 Gag/pol protein2.2e-14864.57Show/hide
Query:  VMNKIEYNLTTLLNELQTFESLMKSKGKEKEANVVTS-KKFLRGSSSGTKSGPSFSKNKSIQKKKKKDKGKGQLPHAKAKATEK-------CFHCGAFGT
        VMNKI Y LTTLLNELQTFESLMK KG++ EANV TS +KF RGS+SGTKS PS S NK  +KKK     K  L  A AK T+K       CFHC   G 
Subjt:  VMNKIEYNLTTLLNELQTFESLMKSKGKEKEANVVTS-KKFLRGSSSGTKSGPSFSKNKSIQKKKKKDKGKGQLPHAKAKATEK-------CFHCGAFGT

Query:  GRGTARNTLQKRKLRRKTK------ET---------------------------SSWQQLADGEITLRVGTGEVVSAKAVGAVKLLFRDRFVLLENVLLV
         +      L ++K  ++ K      ET                           SSW+QL  GE+T+RVGTG VVSA AVG ++L  +  F+LLENV +V
Subjt:  GRGTARNTLQKRKLRRKTK------ET---------------------------SSWQQLADGEITLRVGTGEVVSAKAVGAVKLLFRDRFVLLENVLLV

Query:  PGIKRNLVSISCLLEHMYKVSFNHNEAFISKRGVRICSAKLEKNLYVLRPTEVKTILNTEMFKTADTQNKRQKLSP--STYLWHLRLGHINLNKIERLIK
        P +KRNL+S+ CLLE  Y ++FN N+ FI K GV ICSAKLE NLYVLR    K +LNTEMFKTA TQNKR K+SP  + +LWHLRLGHINLN+IERL+K
Subjt:  PGIKRNLVSISCLLEHMYKVSFNHNEAFISKRGVRICSAKLEKNLYVLRPTEVKTILNTEMFKTADTQNKRQKLSP--STYLWHLRLGHINLNKIERLIK

Query:  SGLLSQLEENSLPPCESCLEGKMTKRPFSEKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKKYKAEVENTLGK
        +GLLS+LEENSLP CESCLEGKMTKRPF+ KG+RAKEPLEL+HSDLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSE LEKFK+YKAEVEN L K
Subjt:  SGLLSQLEENSLPPCESCLEGKMTKRPFSEKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKKYKAEVENTLGK

Query:  TIKTLRSDRGGEYMDLRFQDYMIEHGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSLMSY
        TIKT RSDRGGEYMDL+FQ+Y++E GIVSQLSAPGTPQQNGVSERRNRTLLDMVRS+MSY
Subjt:  TIKTLRSDRGGEYMDLRFQDYMIEHGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSLMSY

E2GK51 Gag/pol protein (Fragment)7.5e-14962.5Show/hide
Query:  MNKIEYNLTTLLNELQTFESLMKSKGKEKEANV-VTSKKFLRGSSSGTKSGPSFSKNKSIQKKKKKDKGKGQLPH-AKAKATE---KCFHCGAFGTGRGT
        +NKIE+NLTTLLNELQ F++L  SKGKE EANV VT +KF+RGSSS  K GPS        K + K KGKG+ P+ +K K      KCFHC   G  +  
Subjt:  MNKIEYNLTTLLNELQTFESLMKSKGKEKEANV-VTSKKFLRGSSSGTKSGPSFSKNKSIQKKKKKDKGKGQLPH-AKAKATE---KCFHCGAFGTGRGT

Query:  ARNTLQKRKLRRKT-----------------------------------KETSSWQQLADGEITLRVGTGEVVSAKAVGAVKLLFRDRFVLLENVLLVPG
            L ++K  + T                                   +ETSSW++L +GEITL+VGTGEVVSA+AVG + L F+DR+++L++VL VP 
Subjt:  ARNTLQKRKLRRKT-----------------------------------KETSSWQQLADGEITLRVGTGEVVSAKAVGAVKLLFRDRFVLLENVLLVPG

Query:  IKRNLVSISCLLEHMYKVSFNHNEAFISKRGVRICSAKLEKNLYVLRPTEVKTILNTEMFKTADTQNKRQKLSPSTYLWHLRLGHINLNKIERLIKSGLL
        +KRNL+SI+C+LEH+Y +SF  NE FI  +G++ICSA  E NLY LRPT    +LNTEMF+T +TQNK+QK+S + YLWHLRLGHINLN+IERL+KSG+L
Subjt:  IKRNLVSISCLLEHMYKVSFNHNEAFISKRGVRICSAKLEKNLYVLRPTEVKTILNTEMFKTADTQNKRQKLSPSTYLWHLRLGHINLNKIERLIKSGLL

Query:  SQLEENSLPPCESCLEGKMTKRPFSEKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKKYKAEVENTLGKTIKT
        +QLE+NSLPPCESCLEGKMTKR F+ KG RAK PLEL+HSDLCGPMNVKARGGYEYFISFIDD+SRYG++YL+HHKSE+ EKFK+YKAEVEN +GKTIKT
Subjt:  SQLEENSLPPCESCLEGKMTKRPFSEKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKKYKAEVENTLGKTIKT

Query:  LRSDRGGEYMDLRFQDYMIEHGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSLMSY
        LRSDRGGEYMD +FQDY+IE GI SQLSAP TPQQNGVSERRNRTLLDMVRS+MSY
Subjt:  LRSDRGGEYMDLRFQDYMIEHGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSLMSY

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.3e-2830.38Show/hide
Query:  GEVVSAKAVGAVKLLFRDRFVLLENVLLVPGIKRNLVSISCLLEHMYKVSFNHNEAFISKRGVRICSAKLEKNLYVLRPTEVKTILNTEMFKTADTQNKR
        GE + A   G V+L   D  + LE+VL       NL+S+  L E    + F+ +   ISK G+ +      KN  +L    V   +N + +         
Subjt:  GEVVSAKAVGAVKLLFRDRFVLLENVLLVPGIKRNLVSISCLLEHMYKVSFNHNEAFISKRGVRICSAKLEKNLYVLRPTEVKTILNTEMFKTADTQNKR

Query:  QKLSPSTYLWHLRLGHINLNKIERLIKSGLLSQLE-----ENSLPPCESCLEGKMTKRPFSEKGYRA--KEPLELIHSDLCGPMNVKARGGYEYFISFID
         K   +  LWH R GHI+  K+  + +  + S        E S   CE CL GK  + PF +   +   K PL ++HSD+CGP+         YF+ F+D
Subjt:  QKLSPSTYLWHLRLGHINLNKIERLIKSGLLSQLE-----ENSLPPCESCLEGKMTKRPFSEKGYRA--KEPLELIHSDLCGPMNVKARGGYEYFISFID

Query:  DYSRYGYLYLMHHKSETLEKFKKYKAEVENTLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSLMS
         ++ Y   YL+ +KS+    F+ + A+ E      +  L  D G EY+    + + ++ GI   L+ P TPQ NGVSER  RT+ +  R+++S
Subjt:  DYSRYGYLYLMHHKSETLEKFKKYKAEVENTLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSLMS

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.2e-4034.34Show/hide
Query:  VLLENVLLVPGIKRNLVSISCLLEHMYKVSFNHNEAFISKRGVRICSAKLEKNLYVLRPTEVKTILNTEMFKTADTQNKRQKLSPSTYLWHLRLGHINLN
        ++L++V  VP ++ NL+S   L    Y+  F + +  ++K  + I        LY       +  LN    +             S  LWH R+GH++  
Subjt:  VLLENVLLVPGIKRNLVSISCLLEHMYKVSFNHNEAFISKRGVRICSAKLEKNLYVLRPTEVKTILNTEMFKTADTQNKRQKLSPSTYLWHLRLGHINLN

Query:  KIERLIKSGLLSQLEENSLPPCESCLEGKMTKRPFSEKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKKYKAE
         ++ L K  L+S  +  ++ PC+ CL GK  +  F     R    L+L++SD+CGPM +++ GG +YF++FIDD SR  ++Y++  K +  + F+K+ A 
Subjt:  KIERLIKSGLLSQLEENSLPPCESCLEGKMTKRPFSEKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKKYKAE

Query:  VENTLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSLM
        VE   G+ +K LRSD GGEY    F++Y   HGI  + + PGTPQ NGV+ER NRT+++ VRS++
Subjt:  VENTLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSLM

Q07791 Transposon Ty2-DR3 Gag-Pol polyprotein2.2e-2026.69Show/hide
Query:  DGEITLRVGTGEVVSAKAVGAVKLLFRDRFVLLENVLLVPGIKRNLVSISCLLEHMYKVSFNHNEAFISKRGVRICSAKLEKNLYVLRPTEVKTILNTEM
        + EI +     + +   A+G +   F++        L  P I  +L+S+S L        F  N    S  G  +       + Y L     K ++ + +
Subjt:  DGEITLRVGTGEVVSAKAVGAVKLLFRDRFVLLENVLLVPGIKRNLVSISCLLEHMYKVSFNHNEAFISKRGVRICSAKLEKNLYVLRPTEVKTILNTEM

Query:  FK-TADTQNKRQKLSPSTY-LWHLRLGHINLNKIERLIKSGLLSQLEENSLP-------PCESCLEGKMTKRPFSEKGYRAK-----EPLELIHSDLCGP
         K T +  NK + ++   Y L H  LGH N   I++ +K   ++ L+E+ +         C  CL GK TK     KG R K     EP + +H+D+ GP
Subjt:  FK-TADTQNKRQKLSPSTY-LWHLRLGHINLNKIERLIKSGLLSQLEENSLP-------PCESCLEGKMTKRPFSEKGYRAK-----EPLELIHSDLCGP

Query:  MNVKARGGYEYFISFIDDYSRYGYLYLMHHKSE--TLEKFKKYKAEVENTLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIVSQLSAPGTPQQNGVSERRN
        ++   +    YFISF D+ +R+ ++Y +H + E   L  F    A ++N     +  ++ DRG EY +     +    GI +  +     + +GV+ER N
Subjt:  MNVKARGGYEYFISFIDDYSRYGYLYLMHHKSE--TLEKFKKYKAEVENTLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIVSQLSAPGTPQQNGVSERRN

Query:  RTLLDMVRSLM
        RTLL+  R+L+
Subjt:  RTLLDMVRSLM

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.1e-2729.62Show/hide
Query:  SWQQLADGEITLRVGTGEVVSAKAVGAVKLLFRDRFVLLENVLLVPGIKRNLVSISCLLE------HMYKVSFNHNEAFISKRGVRICSAKLEKNLYVLR
        S  Q   G   + V  G  +     G+  L  + R + L N+L VP I +NL+S+  L          +  SF   +      GV +   K +  LY   
Subjt:  SWQQLADGEITLRVGTGEVVSAKAVGAVKLLFRDRFVLLENVLLVPGIKRNLVSISCLLE------HMYKVSFNHNEAFISKRGVRICSAKLEKNLYVLR

Query:  PTEVKTILNTEMFKTADTQNKRQKLSPSTYL----WHLRLGHINLNKIERLIKSGLLSQLE-ENSLPPCESCLEGKMTKRPFSEKGYRAKEPLELIHSDL
                    +  A +Q      SPS+      WH RLGH   + +  +I +  LS L   +    C  CL  K  K PFS+    +  PLE I+SD+
Subjt:  PTEVKTILNTEMFKTADTQNKRQKLSPSTYL----WHLRLGHINLNKIERLIKSGLLSQLE-ENSLPPCESCLEGKMTKRPFSEKGYRAKEPLELIHSDL

Query:  CGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKKYKAEVENTLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIVSQLSAPGTPQQNGVSERR
             + +   Y Y++ F+D ++RY +LY +  KS+  E F  +K  +EN     I T  SD GGE++ L   +Y  +HGI    S P TP+ NG+SER+
Subjt:  CGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKKYKAEVENTLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIVSQLSAPGTPQQNGVSERR

Query:  NRTLLDMVRSLMSY
        +R +++   +L+S+
Subjt:  NRTLLDMVRSLMSY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.3e-2830Show/hide
Query:  SWQQLADGEITLRVGTGEVVSAKAVGAVKLLFRDRFVLLENVLLVPGIKRNLVSISCLLE------HMYKVSFNHNEAFISKRGVRICSAKLEKNLYVLR
        S+ Q   G   + +  G  +     G+  L    R + L  VL VP I +NL+S+  L          +  SF   +      GV +   K +  LY   
Subjt:  SWQQLADGEITLRVGTGEVVSAKAVGAVKLLFRDRFVLLENVLLVPGIKRNLVSISCLLE------HMYKVSFNHNEAFISKRGVRICSAKLEKNLYVLR

Query:  PTEVKTILNTEMFKTADTQNKRQKLSPSTYLWHLRLGHINLNKIERLIKSGLLSQLE-ENSLPPCESCLEGKMTKRPFSEKGYRAKEPLELIHSDLCGPM
           + +     MF +  ++            WH RLGH +L  +  +I +  L  L   + L  C  C   K  K PFS     + +PLE I+SD+    
Subjt:  PTEVKTILNTEMFKTADTQNKRQKLSPSTYLWHLRLGHINLNKIERLIKSGLLSQLE-ENSLPPCESCLEGKMTKRPFSEKGYRAKEPLELIHSDLCGPM

Query:  NVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKKYKAEVENTLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIVSQLSAPGTPQQNGVSERRNRTL
         + +   Y Y++ F+D ++RY +LY +  KS+  + F  +K+ VEN     I TL SD GGE++ LR  DY+ +HGI    S P TP+ NG+SER++R +
Subjt:  NVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKKYKAEVENTLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIVSQLSAPGTPQQNGVSERRNRTL

Query:  LDMVRSLMSY
        ++M  +L+S+
Subjt:  LDMVRSLMSY

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein4.7e-1037.8Show/hide
Query:  NKRQKLSPSTYLWHLRLGHINLNKIERLIKSGLLSQLEENSLPPCESCLEGKMTKRPFSEKGYRAKEPLELIHSDLCGPMNV
        N  +     T LWH RL H++   +E L+K G L   + +SL  CE C+ GK  +  FS   +  K PL+ +HSDL G  +V
Subjt:  NKRQKLSPSTYLWHLRLGHINLNKIERLIKSGLLSQLEENSLPPCESCLEGKMTKRPFSEKGYRAKEPLELIHSDLCGPMNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATCTCTTCTAAGAGTTTCTGCCATTCCGCAGAAATGCACGGTAATGAATAAAATAGAGTATAACCTGACTACTCTCCTCAACGAGCTACAGACTTTTGAGTCCCT
CATGAAATCAAAAGGAAAAGAGAAGGAGGCAAATGTTGTCACTTCAAAGAAGTTCCTAAGAGGATCATCCTCTGGGACCAAGTCTGGTCCTTCTTTTTCTAAGAATAAGA
GTATTCAGAAGAAGAAGAAGAAGGACAAAGGGAAGGGACAGCTCCCACACGCAAAGGCCAAAGCCACGGAAAAATGTTTCCACTGTGGTGCATTTGGCACTGGAAGAGGA
ACTGCCCGAAATACCTTGCAGAAAAGAAAGCTGAGAAGGAAAACCAAGGAAACTAGTTCCTGGCAGCAGCTTGCAGATGGGGAGATAACTCTCAGGGTTGGAACGGGAGA
GGTTGTCTCAGCCAAAGCGGTGGGAGCAGTGAAGCTGTTGTTTAGAGATAGATTCGTTTTATTAGAAAATGTACTTTTGGTTCCTGGAATCAAAAGAAATCTTGTATCTA
TCTCTTGTTTGCTTGAACATATGTATAAAGTTTCTTTTAATCATAATGAAGCGTTCATTAGCAAAAGAGGTGTACGAATATGTTCTGCTAAACTTGAAAAAAACTTATAC
GTGTTAAGACCAACTGAAGTAAAAACTATTTTGAACACTGAAATGTTTAAAACAGCTGATACTCAAAATAAAAGACAGAAACTTTCTCCTAGTACCTATCTTTGGCACTT
GAGACTAGGCCACATTAATCTCAATAAGATTGAGAGATTGATCAAGAGTGGTCTCCTAAGTCAGTTAGAGGAAAACTCTTTACCGCCATGTGAGTCCTGTCTCGAAGGAA
AAATGACTAAAAGACCTTTTTCTGAAAAAGGTTATAGAGCCAAAGAGCCCTTGGAACTCATCCATTCTGATCTATGTGGTCCTATGAATGTCAAGGCACGAGGAGGGTAT
GAATACTTCATCAGTTTTATTGATGATTATTCTAGGTATGGCTATCTATACCTAATGCATCATAAGTCCGAAACTCTTGAAAAGTTCAAGAAGTATAAGGCAGAGGTTGA
GAACACATTAGGTAAAACAATTAAAACACTTCGATCAGATCGAGGTGGAGAGTATATGGATTTGAGATTCCAAGACTATATGATTGAACATGGAATTGTATCCCAACTCT
CAGCGCCTGGTACACCTCAGCAGAATGGTGTATCTGAGAGGAGAAATAGAACCTTGTTAGACATGGTTCGATCTTTGATGAGCTATCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAATCTCTTCTAAGAGTTTCTGCCATTCCGCAGAAATGCACGGTAATGAATAAAATAGAGTATAACCTGACTACTCTCCTCAACGAGCTACAGACTTTTGAGTCCCT
CATGAAATCAAAAGGAAAAGAGAAGGAGGCAAATGTTGTCACTTCAAAGAAGTTCCTAAGAGGATCATCCTCTGGGACCAAGTCTGGTCCTTCTTTTTCTAAGAATAAGA
GTATTCAGAAGAAGAAGAAGAAGGACAAAGGGAAGGGACAGCTCCCACACGCAAAGGCCAAAGCCACGGAAAAATGTTTCCACTGTGGTGCATTTGGCACTGGAAGAGGA
ACTGCCCGAAATACCTTGCAGAAAAGAAAGCTGAGAAGGAAAACCAAGGAAACTAGTTCCTGGCAGCAGCTTGCAGATGGGGAGATAACTCTCAGGGTTGGAACGGGAGA
GGTTGTCTCAGCCAAAGCGGTGGGAGCAGTGAAGCTGTTGTTTAGAGATAGATTCGTTTTATTAGAAAATGTACTTTTGGTTCCTGGAATCAAAAGAAATCTTGTATCTA
TCTCTTGTTTGCTTGAACATATGTATAAAGTTTCTTTTAATCATAATGAAGCGTTCATTAGCAAAAGAGGTGTACGAATATGTTCTGCTAAACTTGAAAAAAACTTATAC
GTGTTAAGACCAACTGAAGTAAAAACTATTTTGAACACTGAAATGTTTAAAACAGCTGATACTCAAAATAAAAGACAGAAACTTTCTCCTAGTACCTATCTTTGGCACTT
GAGACTAGGCCACATTAATCTCAATAAGATTGAGAGATTGATCAAGAGTGGTCTCCTAAGTCAGTTAGAGGAAAACTCTTTACCGCCATGTGAGTCCTGTCTCGAAGGAA
AAATGACTAAAAGACCTTTTTCTGAAAAAGGTTATAGAGCCAAAGAGCCCTTGGAACTCATCCATTCTGATCTATGTGGTCCTATGAATGTCAAGGCACGAGGAGGGTAT
GAATACTTCATCAGTTTTATTGATGATTATTCTAGGTATGGCTATCTATACCTAATGCATCATAAGTCCGAAACTCTTGAAAAGTTCAAGAAGTATAAGGCAGAGGTTGA
GAACACATTAGGTAAAACAATTAAAACACTTCGATCAGATCGAGGTGGAGAGTATATGGATTTGAGATTCCAAGACTATATGATTGAACATGGAATTGTATCCCAACTCT
CAGCGCCTGGTACACCTCAGCAGAATGGTGTATCTGAGAGGAGAAATAGAACCTTGTTAGACATGGTTCGATCTTTGATGAGCTATCTTTGA
Protein sequenceShow/hide protein sequence
MESLLRVSAIPQKCTVMNKIEYNLTTLLNELQTFESLMKSKGKEKEANVVTSKKFLRGSSSGTKSGPSFSKNKSIQKKKKKDKGKGQLPHAKAKATEKCFHCGAFGTGRG
TARNTLQKRKLRRKTKETSSWQQLADGEITLRVGTGEVVSAKAVGAVKLLFRDRFVLLENVLLVPGIKRNLVSISCLLEHMYKVSFNHNEAFISKRGVRICSAKLEKNLY
VLRPTEVKTILNTEMFKTADTQNKRQKLSPSTYLWHLRLGHINLNKIERLIKSGLLSQLEENSLPPCESCLEGKMTKRPFSEKGYRAKEPLELIHSDLCGPMNVKARGGY
EYFISFIDDYSRYGYLYLMHHKSETLEKFKKYKAEVENTLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSLMSYL