; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020156 (gene) of Snake gourd v1 genome

Gene IDTan0020156
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG03:58365109..58367769
RNA-Seq ExpressionTan0020156
SyntenyTan0020156
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]6.8e-18160.33Show/hide
Query:  ECPRIPARNVPLS-HWGVGHWIKVNDKAKIYILASVSEVLAKKHELMVSAREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNV
        ECP++PA N   +       W K N+KA+ YILAS+SEVLAKKHE M++AREIM SLQEMFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNV
Subjt:  ECPRIPARNVPLS-HWGVGHWIKVNDKAKIYILASVSEVLAKKHELMVSAREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNV

Query:  AGMNRAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADGEANLFAHSRRFQKGSSSGTKPCNSSSGLKKTQKKKIG
        A MN AVIDE SQVSFILESL +SFLQFRSNAVMNKI Y LTTLLNELQTF+SLMK KGQ  GEAN+   +R+F +GS+SGTK   SSSG KK +KKK G
Subjt:  AGMNRAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADGEANLFAHSRRFQKGSSSGTKPCNSSSGLKKTQKKKIG

Query:  RKGKAP-ATYKGKGKTKLADKGKCFNCNMDGHWKRNCPKYLVELKE-KKGKLDLLFLETCLVENDDLTWILDSGATNHVCSSFQETSSFKELEEGEMTLR
        +  KA  A  K   KTK A KG CF+CN +GHWKRNCPKYL E K+ K+GK DLL LETCLVENDD  WI+DSGATNHVCSSFQ  SS+++LE GEMT+R
Subjt:  RKGKAP-ATYKGKGKTKLADKGKCFNCNMDGHWKRNCPKYLVELKE-KKGKLDLLFLETCLVENDDLTWILDSGATNHVCSSFQETSSFKELEEGEMTLR

Query:  VGTGDVVSARAVGDAKLFFGDRFLSLINLYIVPKLKRNLISISCLIELGYCISFSINEAFILKEGVNICSAKLKNGL-----------------------
        VGTG VVSA AVG  +L     FL L N+Y+VP LKRNLIS+ CL+E  Y ++F++N+ FI K GV ICSAKL+N L                       
Subjt:  VGTGDVVSARAVGDAKLFFGDRFLSLINLYIVPKLKRNLISISCLIELGYCISFSINEAFILKEGVNICSAKLKNGL-----------------------

Query:  ------------------LGHINLNRIERLSKNRLLNKLEDDSLPPCESCLE---------------------------------------------DDY
                          LGHINLNRIERL KN LL++LE++SLP CESCLE                                             DDY
Subjt:  ------------------LGHINLNRIERLSKNRLLNKLEDDSLPPCESCLE---------------------------------------------DDY

Query:  SRYGYLYLMHHKSEALEKFKEYKAEVENALGKSIKTLRSDRGGEYMDLRFQDYMIEHEIKFQLSAPNTPQQNGVSERRNRTLLDIVRSMMSYAQLSASFW
        SRYGY+YLM HKSEALEKFKEYKAEVENAL K+IKT RSDRGGEYMDL+FQ+Y++E  I  QLSAP TPQQNGVSERRNRTLLD+VRSMMSYA L  SFW
Subjt:  SRYGYLYLMHHKSEALEKFKEYKAEVENALGKSIKTLRSDRGGEYMDLRFQDYMIEHEIKFQLSAPNTPQQNGVSERRNRTLLDIVRSMMSYAQLSASFW

Query:  GYAMR
        GYA++
Subjt:  GYAMR

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-18059.77Show/hide
Query:  ECPRIPARNVPLS-HWGVGHWIKVNDKAKIYILASVSEVLAKKHELMVSAREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNV
        ECP++PA N   +       W K N+KA+ YILAS+SEVLAKKHE M++AREIM SLQEMFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNV
Subjt:  ECPRIPARNVPLS-HWGVGHWIKVNDKAKIYILASVSEVLAKKHELMVSAREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNV

Query:  AGMNRAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADGEANLFAHSRRFQKGSSSGTKPCNSSSGLKKTQKKKIG
        A MN AVIDE SQVSFILESL +SFLQFRSNAVMNKI Y LTTLLNELQTF+SLMK KGQ  GEAN+   +R+F +GS+SGTK   SSSG KK +KKK G
Subjt:  AGMNRAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADGEANLFAHSRRFQKGSSSGTKPCNSSSGLKKTQKKKIG

Query:  RKGKAPATYKGKGKTKLADKGKCFNCNMDGHWKRNCPKYLVELKE-KKGKLDLLFLETCLVENDDLTWILDSGATNHVCSSFQETSSFKELEEGEMTLRV
        +  KA        K   A KG CF+CN +GHWKRNCPKYL E K+ K+GK DLL LETCLVENDD  WI+DSGATNHVCSSFQ  SS+++LE GEMT+RV
Subjt:  RKGKAPATYKGKGKTKLADKGKCFNCNMDGHWKRNCPKYLVELKE-KKGKLDLLFLETCLVENDDLTWILDSGATNHVCSSFQETSSFKELEEGEMTLRV

Query:  GTGDVVSARAVGDAKLFFGDRFLSLINLYIVPKLKRNLISISCLIELGYCISFSINEAFILKEGVNICSAKLKNGL------------------------
        GTG VVSA AVG  +L     FL L N+Y+VP LKRNLIS+ CL+E  Y ++F++N+ FI K GV ICSAKL+N L                        
Subjt:  GTGDVVSARAVGDAKLFFGDRFLSLINLYIVPKLKRNLISISCLIELGYCISFSINEAFILKEGVNICSAKLKNGL------------------------

Query:  -----------------LGHINLNRIERLSKNRLLNKLEDDSLPPCESCLE---------------------------------------------DDYS
                         LGHINLNRIERL KN LL++LE++SLP CESCLE                                             DDYS
Subjt:  -----------------LGHINLNRIERLSKNRLLNKLEDDSLPPCESCLE---------------------------------------------DDYS

Query:  RYGYLYLMHHKSEALEKFKEYKAEVENALGKSIKTLRSDRGGEYMDLRFQDYMIEHEIKFQLSAPNTPQQNGVSERRNRTLLDIVRSMMSYAQLSASFWG
        RYGY+YLM HKSEALEKFKEYKAEVENAL K+IKT RSDRGGEYMDL+FQ+Y++E  I  QLSAP TPQQNGVSERRNRTLLD+VRSMMSYA L  SFWG
Subjt:  RYGYLYLMHHKSEALEKFKEYKAEVENALGKSIKTLRSDRGGEYMDLRFQDYMIEHEIKFQLSAPNTPQQNGVSERRNRTLLDIVRSMMSYAQLSASFWG

Query:  YAMR
        YA++
Subjt:  YAMR

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-18059.77Show/hide
Query:  ECPRIPARNVPLS-HWGVGHWIKVNDKAKIYILASVSEVLAKKHELMVSAREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNV
        ECP++PA N   +       W K N+KA+ YILAS+SEVLAKKHE M++AREIM SLQEMFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNV
Subjt:  ECPRIPARNVPLS-HWGVGHWIKVNDKAKIYILASVSEVLAKKHELMVSAREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNV

Query:  AGMNRAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADGEANLFAHSRRFQKGSSSGTKPCNSSSGLKKTQKKKIG
        A MN AVIDE SQVSFILESL +SFLQFRSNAVMNKI Y LTTLLNELQTF+SLMK KGQ  GEAN+   +R+F +GS+SGTK   SSSG KK +KKK G
Subjt:  AGMNRAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADGEANLFAHSRRFQKGSSSGTKPCNSSSGLKKTQKKKIG

Query:  RKGKAPATYKGKGKTKLADKGKCFNCNMDGHWKRNCPKYLVELKE-KKGKLDLLFLETCLVENDDLTWILDSGATNHVCSSFQETSSFKELEEGEMTLRV
        +  KA        K   A KG CF+CN +GHWKRNCPKYL E K+ K+GK DLL LETCLVENDD  WI+DSGATNHVCSSFQ  SS+++LE GEMT+RV
Subjt:  RKGKAPATYKGKGKTKLADKGKCFNCNMDGHWKRNCPKYLVELKE-KKGKLDLLFLETCLVENDDLTWILDSGATNHVCSSFQETSSFKELEEGEMTLRV

Query:  GTGDVVSARAVGDAKLFFGDRFLSLINLYIVPKLKRNLISISCLIELGYCISFSINEAFILKEGVNICSAKLKNGL------------------------
        GTG VVSA AVG  +L     FL L N+Y+VP LKRNLIS+ CL+E  Y ++F++N+ FI K GV ICSAKL+N L                        
Subjt:  GTGDVVSARAVGDAKLFFGDRFLSLINLYIVPKLKRNLISISCLIELGYCISFSINEAFILKEGVNICSAKLKNGL------------------------

Query:  -----------------LGHINLNRIERLSKNRLLNKLEDDSLPPCESCLE---------------------------------------------DDYS
                         LGHINLNRIERL KN LL++LE++SLP CESCLE                                             DDYS
Subjt:  -----------------LGHINLNRIERLSKNRLLNKLEDDSLPPCESCLE---------------------------------------------DDYS

Query:  RYGYLYLMHHKSEALEKFKEYKAEVENALGKSIKTLRSDRGGEYMDLRFQDYMIEHEIKFQLSAPNTPQQNGVSERRNRTLLDIVRSMMSYAQLSASFWG
        RYGY+YLM HKSEALEKFKEYKAEVENAL K+IKT RSDRGGEYMDL+FQ+Y++E  I  QLSAP TPQQNGVSERRNRTLLD+VRSMMSYA L  SFWG
Subjt:  RYGYLYLMHHKSEALEKFKEYKAEVENALGKSIKTLRSDRGGEYMDLRFQDYMIEHEIKFQLSAPNTPQQNGVSERRNRTLLDIVRSMMSYAQLSASFWG

Query:  YAMR
        YA++
Subjt:  YAMR

KAA0062993.1 gag/pol protein [Cucumis melo var. makuwa]6.8e-18159.67Show/hide
Query:  RECPRIPARNVPLS-HWGVGHWIKVNDKAKIYILASVSEVLAKKHELMVSAREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFN
        +ECP++PA N   +       W K N+KA+ YILAS+SEVLAKKHE M++AREIM SLQEMFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFN
Subjt:  RECPRIPARNVPLS-HWGVGHWIKVNDKAKIYILASVSEVLAKKHELMVSAREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFN

Query:  VAGMNRAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADGEANLFAHSRRFQKGSSSGTKPCNSSSGLKKTQKKKI
        VA MN AVIDE SQVSFILESL +SFLQFRSNAVMNKI Y LTTLLNELQTF+SLMK KGQ  GEAN+   +R+F +GS+SGTK   SSSG KK +KKK 
Subjt:  VAGMNRAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADGEANLFAHSRRFQKGSSSGTKPCNSSSGLKKTQKKKI

Query:  GRKGKAPATYKGKGKTKLADKGKCFNCNMDGHWKRNCPKYLVELKE-KKGKLDLLFLETCLVENDDLTWILDSGATNHVCSSFQETSSFKELEEGEMTLR
        G+  KA        K   A KG CF+CN +GHWKRNCPKYL E K+ K+GK DLL LETCLVENDD  WI+DSGATNHVCSSFQ  SS+++LE GEMT+R
Subjt:  GRKGKAPATYKGKGKTKLADKGKCFNCNMDGHWKRNCPKYLVELKE-KKGKLDLLFLETCLVENDDLTWILDSGATNHVCSSFQETSSFKELEEGEMTLR

Query:  VGTGDVVSARAVGDAKLFFGDRFLSLINLYIVPKLKRNLISISCLIELGYCISFSINEAFILKEGVNICSAKLKNGL-----------------------
        VGTG VVSA AVG  +L     FL L N+Y+VP LKRNLIS+ CL+E  Y ++F++N+ FI K GV ICSAKL+N L                       
Subjt:  VGTGDVVSARAVGDAKLFFGDRFLSLINLYIVPKLKRNLISISCLIELGYCISFSINEAFILKEGVNICSAKLKNGL-----------------------

Query:  ------------------LGHINLNRIERLSKNRLLNKLEDDSLPPCESCLE---------------------------------------------DDY
                          LGHINLNRIERL KN LL++LE++SLP CESCLE                                             DDY
Subjt:  ------------------LGHINLNRIERLSKNRLLNKLEDDSLPPCESCLE---------------------------------------------DDY

Query:  SRYGYLYLMHHKSEALEKFKEYKAEVENALGKSIKTLRSDRGGEYMDLRFQDYMIEHEIKFQLSAPNTPQQNGVSERRNRTLLDIVRSMMSYAQLSASFW
        SRYGY+YLM HKSEALEKFKEYKAEVENAL K+IKT RSDRGGEYMDL+FQ+Y++E  I  QLSAP TPQQNGVSERRNRTLLD+VRSMMSYA L  SFW
Subjt:  SRYGYLYLMHHKSEALEKFKEYKAEVENALGKSIKTLRSDRGGEYMDLRFQDYMIEHEIKFQLSAPNTPQQNGVSERRNRTLLDIVRSMMSYAQLSASFW

Query:  GYAMR
        GYA++
Subjt:  GYAMR

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]6.8e-18160.33Show/hide
Query:  ECPRIPARNVPLS-HWGVGHWIKVNDKAKIYILASVSEVLAKKHELMVSAREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNV
        ECP++PA N   +       W K N+KA+ YILAS+SEVLAKKHE M++AREIM SLQEMFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNV
Subjt:  ECPRIPARNVPLS-HWGVGHWIKVNDKAKIYILASVSEVLAKKHELMVSAREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNV

Query:  AGMNRAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADGEANLFAHSRRFQKGSSSGTKPCNSSSGLKKTQKKKIG
        A MN AVIDE SQVSFILESL +SFLQFRSNAVMNKI Y LTTLLNELQTF+SLMK KGQ  GEAN+   +R+F +GS+SGTK   SSSG KK +KKK G
Subjt:  AGMNRAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADGEANLFAHSRRFQKGSSSGTKPCNSSSGLKKTQKKKIG

Query:  RKGKAP-ATYKGKGKTKLADKGKCFNCNMDGHWKRNCPKYLVELKE-KKGKLDLLFLETCLVENDDLTWILDSGATNHVCSSFQETSSFKELEEGEMTLR
        +  KA  A  K   KTK A KG CF+CN +GHWKRNCPKYL E K+ K+GK DLL LETCLVENDD  WI+DSGATNHVCSSFQ  SS+++LE GEMT+R
Subjt:  RKGKAP-ATYKGKGKTKLADKGKCFNCNMDGHWKRNCPKYLVELKE-KKGKLDLLFLETCLVENDDLTWILDSGATNHVCSSFQETSSFKELEEGEMTLR

Query:  VGTGDVVSARAVGDAKLFFGDRFLSLINLYIVPKLKRNLISISCLIELGYCISFSINEAFILKEGVNICSAKLKNGL-----------------------
        VGTG VVSA AVG  +L     FL L N+Y+VP LKRNLIS+ CL+E  Y ++F++N+ FI K GV ICSAKL+N L                       
Subjt:  VGTGDVVSARAVGDAKLFFGDRFLSLINLYIVPKLKRNLISISCLIELGYCISFSINEAFILKEGVNICSAKLKNGL-----------------------

Query:  ------------------LGHINLNRIERLSKNRLLNKLEDDSLPPCESCLE---------------------------------------------DDY
                          LGHINLNRIERL KN LL++LE++SLP CESCLE                                             DDY
Subjt:  ------------------LGHINLNRIERLSKNRLLNKLEDDSLPPCESCLE---------------------------------------------DDY

Query:  SRYGYLYLMHHKSEALEKFKEYKAEVENALGKSIKTLRSDRGGEYMDLRFQDYMIEHEIKFQLSAPNTPQQNGVSERRNRTLLDIVRSMMSYAQLSASFW
        SRYGY+YLM HKSEALEKFKEYKAEVENAL K+IKT RSDRGGEYMDL+FQ+Y++E  I  QLSAP TPQQNGVSERRNRTLLD+VRSMMSYA L  SFW
Subjt:  SRYGYLYLMHHKSEALEKFKEYKAEVENALGKSIKTLRSDRGGEYMDLRFQDYMIEHEIKFQLSAPNTPQQNGVSERRNRTLLDIVRSMMSYAQLSASFW

Query:  GYAMR
        GYA++
Subjt:  GYAMR

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein5.6e-18159.77Show/hide
Query:  ECPRIPARNVPLS-HWGVGHWIKVNDKAKIYILASVSEVLAKKHELMVSAREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNV
        ECP++PA N   +       W K N+KA+ YILAS+SEVLAKKHE M++AREIM SLQEMFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNV
Subjt:  ECPRIPARNVPLS-HWGVGHWIKVNDKAKIYILASVSEVLAKKHELMVSAREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNV

Query:  AGMNRAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADGEANLFAHSRRFQKGSSSGTKPCNSSSGLKKTQKKKIG
        A MN AVIDE SQVSFILESL +SFLQFRSNAVMNKI Y LTTLLNELQTF+SLMK KGQ  GEAN+   +R+F +GS+SGTK   SSSG KK +KKK G
Subjt:  AGMNRAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADGEANLFAHSRRFQKGSSSGTKPCNSSSGLKKTQKKKIG

Query:  RKGKAPATYKGKGKTKLADKGKCFNCNMDGHWKRNCPKYLVELKE-KKGKLDLLFLETCLVENDDLTWILDSGATNHVCSSFQETSSFKELEEGEMTLRV
        +  KA        K   A KG CF+CN +GHWKRNCPKYL E K+ K+GK DLL LETCLVENDD  WI+DSGATNHVCSSFQ  SS+++LE GEMT+RV
Subjt:  RKGKAPATYKGKGKTKLADKGKCFNCNMDGHWKRNCPKYLVELKE-KKGKLDLLFLETCLVENDDLTWILDSGATNHVCSSFQETSSFKELEEGEMTLRV

Query:  GTGDVVSARAVGDAKLFFGDRFLSLINLYIVPKLKRNLISISCLIELGYCISFSINEAFILKEGVNICSAKLKNGL------------------------
        GTG VVSA AVG  +L     FL L N+Y+VP LKRNLIS+ CL+E  Y ++F++N+ FI K GV ICSAKL+N L                        
Subjt:  GTGDVVSARAVGDAKLFFGDRFLSLINLYIVPKLKRNLISISCLIELGYCISFSINEAFILKEGVNICSAKLKNGL------------------------

Query:  -----------------LGHINLNRIERLSKNRLLNKLEDDSLPPCESCLE---------------------------------------------DDYS
                         LGHINLNRIERL KN LL++LE++SLP CESCLE                                             DDYS
Subjt:  -----------------LGHINLNRIERLSKNRLLNKLEDDSLPPCESCLE---------------------------------------------DDYS

Query:  RYGYLYLMHHKSEALEKFKEYKAEVENALGKSIKTLRSDRGGEYMDLRFQDYMIEHEIKFQLSAPNTPQQNGVSERRNRTLLDIVRSMMSYAQLSASFWG
        RYGY+YLM HKSEALEKFKEYKAEVENAL K+IKT RSDRGGEYMDL+FQ+Y++E  I  QLSAP TPQQNGVSERRNRTLLD+VRSMMSYA L  SFWG
Subjt:  RYGYLYLMHHKSEALEKFKEYKAEVENALGKSIKTLRSDRGGEYMDLRFQDYMIEHEIKFQLSAPNTPQQNGVSERRNRTLLDIVRSMMSYAQLSASFWG

Query:  YAMR
        YA++
Subjt:  YAMR

A0A5A7TWB9 Gag/pol protein5.6e-18159.77Show/hide
Query:  ECPRIPARNVPLS-HWGVGHWIKVNDKAKIYILASVSEVLAKKHELMVSAREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNV
        ECP++PA N   +       W K N+KA+ YILAS+SEVLAKKHE M++AREIM SLQEMFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNV
Subjt:  ECPRIPARNVPLS-HWGVGHWIKVNDKAKIYILASVSEVLAKKHELMVSAREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNV

Query:  AGMNRAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADGEANLFAHSRRFQKGSSSGTKPCNSSSGLKKTQKKKIG
        A MN AVIDE SQVSFILESL +SFLQFRSNAVMNKI Y LTTLLNELQTF+SLMK KGQ  GEAN+   +R+F +GS+SGTK   SSSG KK +KKK G
Subjt:  AGMNRAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADGEANLFAHSRRFQKGSSSGTKPCNSSSGLKKTQKKKIG

Query:  RKGKAPATYKGKGKTKLADKGKCFNCNMDGHWKRNCPKYLVELKE-KKGKLDLLFLETCLVENDDLTWILDSGATNHVCSSFQETSSFKELEEGEMTLRV
        +  KA        K   A KG CF+CN +GHWKRNCPKYL E K+ K+GK DLL LETCLVENDD  WI+DSGATNHVCSSFQ  SS+++LE GEMT+RV
Subjt:  RKGKAPATYKGKGKTKLADKGKCFNCNMDGHWKRNCPKYLVELKE-KKGKLDLLFLETCLVENDDLTWILDSGATNHVCSSFQETSSFKELEEGEMTLRV

Query:  GTGDVVSARAVGDAKLFFGDRFLSLINLYIVPKLKRNLISISCLIELGYCISFSINEAFILKEGVNICSAKLKNGL------------------------
        GTG VVSA AVG  +L     FL L N+Y+VP LKRNLIS+ CL+E  Y ++F++N+ FI K GV ICSAKL+N L                        
Subjt:  GTGDVVSARAVGDAKLFFGDRFLSLINLYIVPKLKRNLISISCLIELGYCISFSINEAFILKEGVNICSAKLKNGL------------------------

Query:  -----------------LGHINLNRIERLSKNRLLNKLEDDSLPPCESCLE---------------------------------------------DDYS
                         LGHINLNRIERL KN LL++LE++SLP CESCLE                                             DDYS
Subjt:  -----------------LGHINLNRIERLSKNRLLNKLEDDSLPPCESCLE---------------------------------------------DDYS

Query:  RYGYLYLMHHKSEALEKFKEYKAEVENALGKSIKTLRSDRGGEYMDLRFQDYMIEHEIKFQLSAPNTPQQNGVSERRNRTLLDIVRSMMSYAQLSASFWG
        RYGY+YLM HKSEALEKFKEYKAEVENAL K+IKT RSDRGGEYMDL+FQ+Y++E  I  QLSAP TPQQNGVSERRNRTLLD+VRSMMSYA L  SFWG
Subjt:  RYGYLYLMHHKSEALEKFKEYKAEVENALGKSIKTLRSDRGGEYMDLRFQDYMIEHEIKFQLSAPNTPQQNGVSERRNRTLLDIVRSMMSYAQLSASFWG

Query:  YAMR
        YA++
Subjt:  YAMR

A0A5A7V4M1 Gag/pol protein3.3e-18159.67Show/hide
Query:  RECPRIPARNVPLS-HWGVGHWIKVNDKAKIYILASVSEVLAKKHELMVSAREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFN
        +ECP++PA N   +       W K N+KA+ YILAS+SEVLAKKHE M++AREIM SLQEMFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFN
Subjt:  RECPRIPARNVPLS-HWGVGHWIKVNDKAKIYILASVSEVLAKKHELMVSAREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFN

Query:  VAGMNRAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADGEANLFAHSRRFQKGSSSGTKPCNSSSGLKKTQKKKI
        VA MN AVIDE SQVSFILESL +SFLQFRSNAVMNKI Y LTTLLNELQTF+SLMK KGQ  GEAN+   +R+F +GS+SGTK   SSSG KK +KKK 
Subjt:  VAGMNRAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADGEANLFAHSRRFQKGSSSGTKPCNSSSGLKKTQKKKI

Query:  GRKGKAPATYKGKGKTKLADKGKCFNCNMDGHWKRNCPKYLVELKE-KKGKLDLLFLETCLVENDDLTWILDSGATNHVCSSFQETSSFKELEEGEMTLR
        G+  KA        K   A KG CF+CN +GHWKRNCPKYL E K+ K+GK DLL LETCLVENDD  WI+DSGATNHVCSSFQ  SS+++LE GEMT+R
Subjt:  GRKGKAPATYKGKGKTKLADKGKCFNCNMDGHWKRNCPKYLVELKE-KKGKLDLLFLETCLVENDDLTWILDSGATNHVCSSFQETSSFKELEEGEMTLR

Query:  VGTGDVVSARAVGDAKLFFGDRFLSLINLYIVPKLKRNLISISCLIELGYCISFSINEAFILKEGVNICSAKLKNGL-----------------------
        VGTG VVSA AVG  +L     FL L N+Y+VP LKRNLIS+ CL+E  Y ++F++N+ FI K GV ICSAKL+N L                       
Subjt:  VGTGDVVSARAVGDAKLFFGDRFLSLINLYIVPKLKRNLISISCLIELGYCISFSINEAFILKEGVNICSAKLKNGL-----------------------

Query:  ------------------LGHINLNRIERLSKNRLLNKLEDDSLPPCESCLE---------------------------------------------DDY
                          LGHINLNRIERL KN LL++LE++SLP CESCLE                                             DDY
Subjt:  ------------------LGHINLNRIERLSKNRLLNKLEDDSLPPCESCLE---------------------------------------------DDY

Query:  SRYGYLYLMHHKSEALEKFKEYKAEVENALGKSIKTLRSDRGGEYMDLRFQDYMIEHEIKFQLSAPNTPQQNGVSERRNRTLLDIVRSMMSYAQLSASFW
        SRYGY+YLM HKSEALEKFKEYKAEVENAL K+IKT RSDRGGEYMDL+FQ+Y++E  I  QLSAP TPQQNGVSERRNRTLLD+VRSMMSYA L  SFW
Subjt:  SRYGYLYLMHHKSEALEKFKEYKAEVENALGKSIKTLRSDRGGEYMDLRFQDYMIEHEIKFQLSAPNTPQQNGVSERRNRTLLDIVRSMMSYAQLSASFW

Query:  GYAMR
        GYA++
Subjt:  GYAMR

A0A5D3CPJ6 Gag/pol protein3.3e-18160.33Show/hide
Query:  ECPRIPARNVPLS-HWGVGHWIKVNDKAKIYILASVSEVLAKKHELMVSAREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNV
        ECP++PA N   +       W K N+KA+ YILAS+SEVLAKKHE M++AREIM SLQEMFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNV
Subjt:  ECPRIPARNVPLS-HWGVGHWIKVNDKAKIYILASVSEVLAKKHELMVSAREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNV

Query:  AGMNRAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADGEANLFAHSRRFQKGSSSGTKPCNSSSGLKKTQKKKIG
        A MN AVIDE SQVSFILESL +SFLQFRSNAVMNKI Y LTTLLNELQTF+SLMK KGQ  GEAN+   +R+F +GS+SGTK   SSSG KK +KKK G
Subjt:  AGMNRAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADGEANLFAHSRRFQKGSSSGTKPCNSSSGLKKTQKKKIG

Query:  RKGKAP-ATYKGKGKTKLADKGKCFNCNMDGHWKRNCPKYLVELKE-KKGKLDLLFLETCLVENDDLTWILDSGATNHVCSSFQETSSFKELEEGEMTLR
        +  KA  A  K   KTK A KG CF+CN +GHWKRNCPKYL E K+ K+GK DLL LETCLVENDD  WI+DSGATNHVCSSFQ  SS+++LE GEMT+R
Subjt:  RKGKAP-ATYKGKGKTKLADKGKCFNCNMDGHWKRNCPKYLVELKE-KKGKLDLLFLETCLVENDDLTWILDSGATNHVCSSFQETSSFKELEEGEMTLR

Query:  VGTGDVVSARAVGDAKLFFGDRFLSLINLYIVPKLKRNLISISCLIELGYCISFSINEAFILKEGVNICSAKLKNGL-----------------------
        VGTG VVSA AVG  +L     FL L N+Y+VP LKRNLIS+ CL+E  Y ++F++N+ FI K GV ICSAKL+N L                       
Subjt:  VGTGDVVSARAVGDAKLFFGDRFLSLINLYIVPKLKRNLISISCLIELGYCISFSINEAFILKEGVNICSAKLKNGL-----------------------

Query:  ------------------LGHINLNRIERLSKNRLLNKLEDDSLPPCESCLE---------------------------------------------DDY
                          LGHINLNRIERL KN LL++LE++SLP CESCLE                                             DDY
Subjt:  ------------------LGHINLNRIERLSKNRLLNKLEDDSLPPCESCLE---------------------------------------------DDY

Query:  SRYGYLYLMHHKSEALEKFKEYKAEVENALGKSIKTLRSDRGGEYMDLRFQDYMIEHEIKFQLSAPNTPQQNGVSERRNRTLLDIVRSMMSYAQLSASFW
        SRYGY+YLM HKSEALEKFKEYKAEVENAL K+IKT RSDRGGEYMDL+FQ+Y++E  I  QLSAP TPQQNGVSERRNRTLLD+VRSMMSYA L  SFW
Subjt:  SRYGYLYLMHHKSEALEKFKEYKAEVENALGKSIKTLRSDRGGEYMDLRFQDYMIEHEIKFQLSAPNTPQQNGVSERRNRTLLDIVRSMMSYAQLSASFW

Query:  GYAMR
        GYA++
Subjt:  GYAMR

A0A5D3CSZ6 Gag/pol protein3.3e-18160.33Show/hide
Query:  ECPRIPARNVPLS-HWGVGHWIKVNDKAKIYILASVSEVLAKKHELMVSAREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNV
        ECP++PA N   +       W K N+KA+ YILAS+SEVLAKKHE M++AREIM SLQEMFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNV
Subjt:  ECPRIPARNVPLS-HWGVGHWIKVNDKAKIYILASVSEVLAKKHELMVSAREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNV

Query:  AGMNRAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADGEANLFAHSRRFQKGSSSGTKPCNSSSGLKKTQKKKIG
        A MN AVIDE SQVSFILESL +SFLQFRSNAVMNKI Y LTTLLNELQTF+SLMK KGQ  GEAN+   +R+F +GS+SGTK   SSSG KK +KKK G
Subjt:  AGMNRAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADGEANLFAHSRRFQKGSSSGTKPCNSSSGLKKTQKKKIG

Query:  RKGKAP-ATYKGKGKTKLADKGKCFNCNMDGHWKRNCPKYLVELKE-KKGKLDLLFLETCLVENDDLTWILDSGATNHVCSSFQETSSFKELEEGEMTLR
        +  KA  A  K   KTK A KG CF+CN +GHWKRNCPKYL E K+ K+GK DLL LETCLVENDD  WI+DSGATNHVCSSFQ  SS+++LE GEMT+R
Subjt:  RKGKAP-ATYKGKGKTKLADKGKCFNCNMDGHWKRNCPKYLVELKE-KKGKLDLLFLETCLVENDDLTWILDSGATNHVCSSFQETSSFKELEEGEMTLR

Query:  VGTGDVVSARAVGDAKLFFGDRFLSLINLYIVPKLKRNLISISCLIELGYCISFSINEAFILKEGVNICSAKLKNGL-----------------------
        VGTG VVSA AVG  +L     FL L N+Y+VP LKRNLIS+ CL+E  Y ++F++N+ FI K GV ICSAKL+N L                       
Subjt:  VGTGDVVSARAVGDAKLFFGDRFLSLINLYIVPKLKRNLISISCLIELGYCISFSINEAFILKEGVNICSAKLKNGL-----------------------

Query:  ------------------LGHINLNRIERLSKNRLLNKLEDDSLPPCESCLE---------------------------------------------DDY
                          LGHINLNRIERL KN LL++LE++SLP CESCLE                                             DDY
Subjt:  ------------------LGHINLNRIERLSKNRLLNKLEDDSLPPCESCLE---------------------------------------------DDY

Query:  SRYGYLYLMHHKSEALEKFKEYKAEVENALGKSIKTLRSDRGGEYMDLRFQDYMIEHEIKFQLSAPNTPQQNGVSERRNRTLLDIVRSMMSYAQLSASFW
        SRYGY+YLM HKSEALEKFKEYKAEVENAL K+IKT RSDRGGEYMDL+FQ+Y++E  I  QLSAP TPQQNGVSERRNRTLLD+VRSMMSYA L  SFW
Subjt:  SRYGYLYLMHHKSEALEKFKEYKAEVENALGKSIKTLRSDRGGEYMDLRFQDYMIEHEIKFQLSAPNTPQQNGVSERRNRTLLDIVRSMMSYAQLSASFW

Query:  GYAMR
        GYA++
Subjt:  GYAMR

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.8e-1432.65Show/hide
Query:  DDYSRYGYLYLMHHKSEALEKFKEYKAEVENALGKSIKTLRSDRGGEYMDLRFQDYMIEHEIKFQLSAPNTPQQNGVSERRNRTLLDIVRSMMSYAQLSA
        D ++ Y   YL+ +KS+    F+++ A+ E      +  L  D G EY+    + + ++  I + L+ P+TPQ NGVSER  RT+ +  R+M+S A+L  
Subjt:  DDYSRYGYLYLMHHKSEALEKFKEYKAEVENALGKSIKTLRSDRGGEYMDLRFQDYMIEHEIKFQLSAPNTPQQNGVSERRNRTLLDIVRSMMSYAQLSA

Query:  SFWGYAMRDCDSNLERV---------------FHQK--YLKHLLNFG
        SFWG A+      + R+               +H K  YLKHL  FG
Subjt:  SFWGYAMRDCDSNLERV---------------FHQK--YLKHLLNFG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.1e-2823.99Show/hide
Query:  WIKVNDKAKIYILASVSEVLAKKHELMVSAREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAGMNRAV-IDEQSQVSFILE
        W  ++++A   I   +S+ +        +AR I + L+ ++   +   +    K +Y   M EG++   H L++         N  V I+E+ +   +L 
Subjt:  WIKVNDKAKIYILASVSEVLAKKHELMVSAREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAGMNRAV-IDEQSQVSFILE

Query:  SLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADGEANLF-AHSRRFQKGSSSGTKPCNSSSGLKKTQKKKIGRKGKAPATYKGKGKTKLA
        SL  S+    +  +  K    L  + + L      M+ K +  G+A +     R +Q+ S++                   GR G        +GK+K  
Subjt:  SLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADGEANLF-AHSRRFQKGSSSGTKPCNSSSGLKKTQKKKIGRKGKAPATYKGKGKTKLA

Query:  DKGK---CFNCNMDGHWKRNCPKYLVELKEKKGKLD--------------LLFL---ETCL-VENDDLTWILDSGATNHVCSSFQETSSFKELEEGEM-T
         K +   C+NCN  GH+KR+CP       E  G+ +              +LF+   E C+ +   +  W++D+ A++H   +      F     G+  T
Subjt:  DKGK---CFNCNMDGHWKRNCPKYLVELKEKKGKLD--------------LLFL---ETCL-VENDDLTWILDSGATNHVCSSFQETSSFKELEEGEM-T

Query:  LRVGTGDVVSARAVGD--AKLFFGDRFLSLINLYIVPKLKRNLISISCLIELGYCISFSINEAFILKEG-----------------VNICSAKLK-----
        +++G         +GD   K   G   L L ++  VP L+ NLIS   L   GY  S+  N+ + L +G                   IC  +L      
Subjt:  LRVGTGDVVSARAVGD--AKLFFGDRFLSLINLYIVPKLKRNLISISCLIELGYCISFSINEAFILKEG-----------------VNICSAKLK-----

Query:  ------NGLLGHINLNRIERLSKNRLLNKLEDDSLPPCESCL---------------------------------------------EDDYSRYGYLYLM
              +  +GH++   ++ L+K  L++  +  ++ PC+ CL                                              DD SR  ++Y++
Subjt:  ------NGLLGHINLNRIERLSKNRLLNKLEDDSLPPCESCL---------------------------------------------EDDYSRYGYLYLM

Query:  HHKSEALEKFKEYKAEVENALGKSIKTLRSDRGGEYMDLRFQDYMIEHEIKFQLSAPNTPQQNGVSERRNRTLLDIVRSMMSYAQLSASFWGYAMR
          K +  + F+++ A VE   G+ +K LRSD GGEY    F++Y   H I+ + + P TPQ NGV+ER NRT+++ VRSM+  A+L  SFWG A++
Subjt:  HHKSEALEKFKEYKAEVENALGKSIKTLRSDRGGEYMDLRFQDYMIEHEIKFQLSAPNTPQQNGVSERRNRTLLDIVRSMMSYAQLSASFWGYAMR

Q07163 Transposon TyH3 Gag-Pol polyprotein2.2e-0425.86Show/hide
Query:  DSLPPCESCLEDDYSRYGYLYLMHHKSE--ALEKFKEYKAEVENALGKSIKTLRSDRGGEYMDLRFQDYMIEHEIKFQLSAPNTPQQNGVSERRNRTLLD
        +S P       D+ +++ ++Y +H + E   L+ F    A ++N    S+  ++ DRG EY +     ++ ++ I    +     + +GV+ER NRTLLD
Subjt:  DSLPPCESCLEDDYSRYGYLYLMHHKSE--ALEKFKEYKAEVENALGKSIKTLRSDRGGEYMDLRFQDYMIEHEIKFQLSAPNTPQQNGVSERRNRTLLD

Query:  IVRSMMSYAQLSASFW
          R+ +  + L    W
Subjt:  IVRSMMSYAQLSASFW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.5e-1620.9Show/hide
Query:  ILASVSEVLAKKHELMVSAREIMSSLQEMFGQPS-GQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAGMNRAVIDEQSQVSFILESLSKSFLQFRS
        +L ++S  +        +A +I  +L++++  PS G +    L+       K   ++ +++  L+  F+   +    +D   QV  +LE+L + +     
Subjt:  ILASVSEVLAKKHELMVSAREIMSSLQEMFGQPS-GQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAGMNRAVIDEQSQVSFILESLSKSFLQFRS

Query:  NAVMNKIEYNLTTLLNELQTFQSLMKNKGQADG---EANLFAHSRRFQKGSSSGTKPCNSSSGLKKTQKKKIGRKGKAPATYKGKGKTKLADKGKCFNCN
                  LT +   L   +S +     A      AN  +H       +++     N           K  +  ++   +           GKC  C 
Subjt:  NAVMNKIEYNLTTLLNELQTFQSLMKNKGQADG---EANLFAHSRRFQKGSSSGTKPCNSSSGLKKTQKKKIGRKGKAPATYKGKGKTKLADKGKCFNCN

Query:  MDGHWKRNCPK---YLVELKEKKGKLDL--------LFLETCLVENDDLTWILDSGATNHVCSSFQETSSFKELEEGEMTLRVGTGDVVSARAVGDAKLF
        + GH  + C +   +L  +  ++             L L +    N+   W+LDSGAT+H+ S F   S  +    G+  + V  G  +     G   L 
Subjt:  MDGHWKRNCPK---YLVELKEKKGKLDL--------LFLETCLVENDDLTWILDSGATNHVCSSFQETSSFKELEEGEMTLRVGTGDVVSARAVGDAKLF

Query:  FGDRFLSLINLYIVPKLKRNLISISCLIELG------YCISFSINEAFILKEGVNICSAKLKNGL-----------------------------LGHINL
           R L+L N+  VP + +NLIS+  L          +  SF + +   L  GV +   K K+ L                             LGH   
Subjt:  FGDRFLSLINLYIVPKLKRNLISISCLIELG------YCISFSINEAFILKEGVNICSAKLKNGL-----------------------------LGHINL

Query:  NRIERLSKNRLLNKLE-DDSLPPCESCL--------------------------------------------EDDYSRYGYLYLMHHKSEALEKFKEYKA
        + +  +  N  L+ L        C  CL                                             D ++RY +LY +  KS+  E F  +K 
Subjt:  NRIERLSKNRLLNKLE-DDSLPPCESCL--------------------------------------------EDDYSRYGYLYLMHHKSEALEKFKEYKA

Query:  EVENALGKSIKTLRSDRGGEYMDLRFQDYMIEHEIKFQLSAPNTPQQNGVSERRNRTLLDIVRSMMSYAQLSASFWGYA
         +EN     I T  SD GGE++ L   +Y  +H I    S P+TP+ NG+SER++R +++   +++S+A +  ++W YA
Subjt:  EVENALGKSIKTLRSDRGGEYMDLRFQDYMIEHEIKFQLSAPNTPQQNGVSERRNRTLLDIVRSMMSYAQLSASFWGYA

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-1722.65Show/hide
Query:  LDLMVHFNVAGMNRAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADG---EANLFAH----SRRFQKGSSSGTKP
        L  +  F+   +    +D   QV  +LE+L   +              +LT +   L   +S +     A+     AN+  H    + R Q         
Subjt:  LDLMVHFNVAGMNRAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADG---EANLFAH----SRRFQKGSSSGTKP

Query:  CNSSSGLKKTQKKKIGRK--GKAPATYKGKGKTKLADKGKCFNCNMDGHWKRNCPKY-----LVELKEKKGKLDLLFLETCLVENDDL---TWILDSGAT
         N+++     Q    G +   + P  Y           G+C  C++ GH  + CP+          ++             L  N       W+LDSGAT
Subjt:  CNSSSGLKKTQKKKIGRK--GKAPATYKGKGKTKLADKGKCFNCNMDGHWKRNCPKY-----LVELKEKKGKLDLLFLETCLVENDDL---TWILDSGAT

Query:  NHVCSSFQETSSFKELEEGEMTLRVGTGDVVSARAVGDAKLFFGDRFLSLINLYIVPKLKRNLISISCLIELG------YCISFSINEAFILKEGVNICS
        +H+ S F    SF +   G   + +  G  +     G A L    R L L  +  VP + +NLIS+  L          +  SF + +   L  GV +  
Subjt:  NHVCSSFQETSSFKELEEGEMTLRVGTGDVVSARAVGDAKLFFGDRFLSLINLYIVPKLKRNLISISCLIELG------YCISFSINEAFILKEGVNICS

Query:  AKLKNGL-----------------------------LGHINLNRIERLSKNRLLNKLE-DDSLPPCESC-------------------------------
         K K+ L                             LGH +L  +  +  N  L  L     L  C  C                               
Subjt:  AKLKNGL-----------------------------LGHINLNRIERLSKNRLLNKLE-DDSLPPCESC-------------------------------

Query:  -------------LEDDYSRYGYLYLMHHKSEALEKFKEYKAEVENALGKSIKTLRSDRGGEYMDLRFQDYMIEHEIKFQLSAPNTPQQNGVSERRNRTL
                       D ++RY +LY +  KS+  + F  +K+ VEN     I TL SD GGE++ LR  DY+ +H I    S P+TP+ NG+SER++R +
Subjt:  -------------LEDDYSRYGYLYLMHHKSEALEKFKEYKAEVENALGKSIKTLRSDRGGEYMDLRFQDYMIEHEIKFQLSAPNTPQQNGVSERRNRTL

Query:  LDIVRSMMSYAQLSASFWGYA
        +++  +++S+A +  ++W YA
Subjt:  LDIVRSMMSYAQLSASFWGYA

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCGATTCTGCTGGCGTCAATAGGGGCGTCCCCACTACGACTAAGCGAGAATGCCCTCGGATCCCTGCTCGTAACGTTCCTCTATCTCATTGGGGCGTCGGCCATTG
GATCAAGGTCAATGATAAGGCCAAGATCTACATTTTGGCTAGTGTTTCTGAAGTTCTGGCCAAAAAGCACGAGCTCATGGTCTCAGCTCGTGAGATCATGAGTTCACTGC
AGGAAATGTTTGGACAACCGTCTGGACAGATTCGGCACGAATCCCTCAAATACGTTTATAACTCCCGTATGAAGGAGGGGTCATCGGTGAGAGAACACGTTCTTGATCTG
ATGGTCCACTTCAACGTGGCTGGGATGAACAGAGCGGTCATTGACGAGCAAAGTCAGGTATCGTTCATCCTGGAATCTCTTTCGAAGAGTTTCCTGCAATTCCGCAGCAA
TGCGGTGATGAACAAGATAGAGTATAACCTGACTACTCTCCTTAATGAACTACAGACTTTCCAGTCTCTTATGAAGAATAAGGGACAGGCTGATGGAGAGGCAAATCTGT
TTGCTCATTCCAGAAGGTTCCAGAAGGGTTCATCCTCTGGGACTAAGCCCTGTAACTCTTCTTCTGGGCTTAAGAAGACCCAAAAGAAGAAAATAGGAAGGAAAGGGAAG
GCACCTGCTACTTACAAAGGCAAGGGAAAGACCAAGCTTGCAGATAAAGGAAAGTGTTTCAACTGCAACATGGATGGGCACTGGAAGAGAAACTGCCCAAAATACCTTGT
TGAGCTCAAAGAGAAGAAAGGTAAATTAGATTTACTTTTTCTTGAAACTTGTTTAGTGGAAAATGATGATTTAACCTGGATACTTGATTCAGGAGCCACTAATCACGTTT
GCTCTTCATTTCAGGAAACTAGTTCCTTCAAGGAGCTCGAAGAGGGTGAGATGACGCTCAGGGTCGGAACGGGAGACGTCGTCTCAGCTCGTGCAGTGGGAGATGCCAAG
TTATTTTTCGGAGATAGATTTCTTTCTTTAATAAATCTGTATATAGTTCCTAAGCTTAAAAGGAACTTAATTTCTATCTCTTGTTTAATAGAACTTGGTTATTGTATTTC
TTTTTCAATCAATGAAGCGTTCATTTTGAAAGAGGGTGTCAATATTTGTTCTGCTAAATTGAAAAATGGCTTACTTGGTCATATTAATCTCAACCGGATTGAGAGACTTT
CTAAGAATAGACTTCTAAACAAGTTAGAAGATGATTCTTTACCGCCTTGCGAGTCTTGCTTGGAAGATGATTATTCGAGGTATGGTTATCTATACCTAATGCATCACAAG
TCTGAAGCTCTTGAAAAGTTCAAAGAGTATAAGGCTGAAGTAGAGAATGCATTAGGAAAAAGCATTAAAACACTTCGATCCGATCGAGGTGGAGAGTATATGGATTTGAG
ATTCCAGGACTATATGATAGAACATGAAATTAAATTTCAACTCTCAGCACCTAATACACCACAGCAAAACGGTGTGTCAGAAAGGAGAAATAGAACCTTGTTAGACATAG
TTCGTTCTATGATGAGTTATGCTCAATTGTCTGCCTCGTTTTGGGGATATGCGATGAGAGACTGCGATTCAAATCTTGAACGAGTGTTCCATCAAAAGTATTTGAAACAC
CTTTTGAACTTTGGAAGGGCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTCGATTCTGCTGGCGTCAATAGGGGCGTCCCCACTACGACTAAGCGAGAATGCCCTCGGATCCCTGCTCGTAACGTTCCTCTATCTCATTGGGGCGTCGGCCATTG
GATCAAGGTCAATGATAAGGCCAAGATCTACATTTTGGCTAGTGTTTCTGAAGTTCTGGCCAAAAAGCACGAGCTCATGGTCTCAGCTCGTGAGATCATGAGTTCACTGC
AGGAAATGTTTGGACAACCGTCTGGACAGATTCGGCACGAATCCCTCAAATACGTTTATAACTCCCGTATGAAGGAGGGGTCATCGGTGAGAGAACACGTTCTTGATCTG
ATGGTCCACTTCAACGTGGCTGGGATGAACAGAGCGGTCATTGACGAGCAAAGTCAGGTATCGTTCATCCTGGAATCTCTTTCGAAGAGTTTCCTGCAATTCCGCAGCAA
TGCGGTGATGAACAAGATAGAGTATAACCTGACTACTCTCCTTAATGAACTACAGACTTTCCAGTCTCTTATGAAGAATAAGGGACAGGCTGATGGAGAGGCAAATCTGT
TTGCTCATTCCAGAAGGTTCCAGAAGGGTTCATCCTCTGGGACTAAGCCCTGTAACTCTTCTTCTGGGCTTAAGAAGACCCAAAAGAAGAAAATAGGAAGGAAAGGGAAG
GCACCTGCTACTTACAAAGGCAAGGGAAAGACCAAGCTTGCAGATAAAGGAAAGTGTTTCAACTGCAACATGGATGGGCACTGGAAGAGAAACTGCCCAAAATACCTTGT
TGAGCTCAAAGAGAAGAAAGGTAAATTAGATTTACTTTTTCTTGAAACTTGTTTAGTGGAAAATGATGATTTAACCTGGATACTTGATTCAGGAGCCACTAATCACGTTT
GCTCTTCATTTCAGGAAACTAGTTCCTTCAAGGAGCTCGAAGAGGGTGAGATGACGCTCAGGGTCGGAACGGGAGACGTCGTCTCAGCTCGTGCAGTGGGAGATGCCAAG
TTATTTTTCGGAGATAGATTTCTTTCTTTAATAAATCTGTATATAGTTCCTAAGCTTAAAAGGAACTTAATTTCTATCTCTTGTTTAATAGAACTTGGTTATTGTATTTC
TTTTTCAATCAATGAAGCGTTCATTTTGAAAGAGGGTGTCAATATTTGTTCTGCTAAATTGAAAAATGGCTTACTTGGTCATATTAATCTCAACCGGATTGAGAGACTTT
CTAAGAATAGACTTCTAAACAAGTTAGAAGATGATTCTTTACCGCCTTGCGAGTCTTGCTTGGAAGATGATTATTCGAGGTATGGTTATCTATACCTAATGCATCACAAG
TCTGAAGCTCTTGAAAAGTTCAAAGAGTATAAGGCTGAAGTAGAGAATGCATTAGGAAAAAGCATTAAAACACTTCGATCCGATCGAGGTGGAGAGTATATGGATTTGAG
ATTCCAGGACTATATGATAGAACATGAAATTAAATTTCAACTCTCAGCACCTAATACACCACAGCAAAACGGTGTGTCAGAAAGGAGAAATAGAACCTTGTTAGACATAG
TTCGTTCTATGATGAGTTATGCTCAATTGTCTGCCTCGTTTTGGGGATATGCGATGAGAGACTGCGATTCAAATCTTGAACGAGTGTTCCATCAAAAGTATTTGAAACAC
CTTTTGAACTTTGGAAGGGCGTAA
Protein sequenceShow/hide protein sequence
MLDSAGVNRGVPTTTKRECPRIPARNVPLSHWGVGHWIKVNDKAKIYILASVSEVLAKKHELMVSAREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDL
MVHFNVAGMNRAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADGEANLFAHSRRFQKGSSSGTKPCNSSSGLKKTQKKKIGRKGK
APATYKGKGKTKLADKGKCFNCNMDGHWKRNCPKYLVELKEKKGKLDLLFLETCLVENDDLTWILDSGATNHVCSSFQETSSFKELEEGEMTLRVGTGDVVSARAVGDAK
LFFGDRFLSLINLYIVPKLKRNLISISCLIELGYCISFSINEAFILKEGVNICSAKLKNGLLGHINLNRIERLSKNRLLNKLEDDSLPPCESCLEDDYSRYGYLYLMHHK
SEALEKFKEYKAEVENALGKSIKTLRSDRGGEYMDLRFQDYMIEHEIKFQLSAPNTPQQNGVSERRNRTLLDIVRSMMSYAQLSASFWGYAMRDCDSNLERVFHQKYLKH
LLNFGRA