; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018327 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018327
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr5:23205290..23217695
RNA-Seq ExpressionLag0018327
SyntenyLag0018327
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR005162 - Retrotransposon gag domain
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]2.2e-17553.63Show/hide
Query:  VKDAYERWTKANEKVKVYIFVSLSEVLAKRYENVETAREIMNSLQEMFGLLSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIAEANGGVVCERSQ
        V++ YERW KANEK + YI  SLSEVLAK++E++ TAREIM+SLQEMFG  SYQ+ HDALK ++NA+M EG SVREHVL+M+  FN+AE NG V+ E SQ
Subjt:  VKDAYERWTKANEKVKVYIFVSLSEVLAKRYENVETAREIMNSLQEMFGLLSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIAEANGGVVCERSQ

Query:  VAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKYVPQASSSKQIQKRKGDKGKAPAQAVQG
        V+FIL SLP S+L FR+NA MNKI + LTTLL+ELQ +ESL K KG+   KGEANVA S +KF +GS+SGTK +P +S +K+ +K+KG +G   A     
Subjt:  VAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKYVPQASSSKQIQKRKGDKGKAPAQAVQG

Query:  KGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRREKEGKFDLLVLETCFVEHDEFAWILDSGAANHVCSSLQG-NNFQQLVEGEMTLKVGTGDVVSARA
        K   K  A KG CFHCN +GHWKRNCP+YLAEK++ K+GK+DLLVLETC VE+D+ AWI+DSGA NHVCSS QG ++++QL  GEMT++VGTG VVSA A
Subjt:  KGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRREKEGKFDLLVLETCFVEHDEFAWILDSGAANHVCSSLQG-NNFQQLVEGEMTLKVGTGDVVSARA

Query:  VGVAKLFFRNRFLILENLYLVRRIKRNLISVSTLLEQCSSVSFLRNEALISKNGIYICSAKRENNLFVLRPNHAKAILSHEMFKTAETQNKRQKVSPLSN
        VG  +L  +  FL+LEN+Y+V  +KRNLISV  LLEQ  S++F  N+  I KNG+ ICSAK ENNL+VLR   +KA+L+ EMFKTA TQNKR K+SP   
Subjt:  VGVAKLFFRNRFLILENLYLVRRIKRNLISVSTLLEQCSSVSFLRNEALISKNGIYICSAKRENNLFVLRPNHAKAILSHEMFKTAETQNKRQKVSPLSN

Query:  NTYLWHLRLGHINIDRIDRLVKNGLLTDIEDTSLPPCESCLE---------------------------------------------DDYSRYGYLYLMG
        N +LWHLRLGHIN++RI+RLVKNGLL+++E+ SLP CESCLE                                             DDYSRYGY+YLM 
Subjt:  NTYLWHLRLGHINIDRIDRLVKNGLLTDIEDTSLPPCESCLE---------------------------------------------DDYSRYGYLYLMG

Query:  HKSKALEKFKEFKA------------------------------------------------------------------------SSFWGYAVETAVHI
        HKS+ALEKFKE+KA                                                                        +SFWGYAV+TAV+I
Subjt:  HKSKALEKFKEFKA------------------------------------------------------------------------SSFWGYAVETAVHI

Query:  LNSVPSKSVSETPFELWRGRKPSLHYFRIWGCPAHVLVTNPKKLEPR
        LN VPSKSVSETP +LW GRK SL +FRIWGCPAHVL  NPKKLEPR
Subjt:  LNSVPSKSVSETPFELWRGRKPSLHYFRIWGCPAHVLVTNPKKLEPR

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]3.4e-17654.17Show/hide
Query:  VKDAYERWTKANEKVKVYIFVSLSEVLAKRYENVETAREIMNSLQEMFGLLSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIAEANGGVVCERSQ
        V++ YERW KANEK + YI  SLSEVLAK++E++ TAREIM+SLQEMFG  SYQ+ HDALK ++NA+M EG SVREHVL+M+  FN+AE NG V+ E SQ
Subjt:  VKDAYERWTKANEKVKVYIFVSLSEVLAKRYENVETAREIMNSLQEMFGLLSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIAEANGGVVCERSQ

Query:  VAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKYVPQASSSKQIQKRKGDKG-KAPAQAVQ
        V+FIL SLP S+L FR+NA MNKI + LTTLL+ELQ +ESL K KG+   KGEANVA S +KF +GS+SGTK +P +S +K+ +K+KG +G KA   A +
Subjt:  VAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKYVPQASSSKQIQKRKGDKG-KAPAQAVQ

Query:  GKGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRREKEGKFDLLVLETCFVEHDEFAWILDSGAANHVCSSLQG-NNFQQLVEGEMTLKVGTGDVVSAR
           KAK  A KG CFHCN +GHWKRNCP+YLAEK++ K+GK+DLLVLETC VE+D+ AWI+DSGA NHVCSS QG +++QQL  GEMT++VGTG VVSA 
Subjt:  GKGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRREKEGKFDLLVLETCFVEHDEFAWILDSGAANHVCSSLQG-NNFQQLVEGEMTLKVGTGDVVSAR

Query:  AVGVAKLFFRNRFLILENLYLVRRIKRNLISVSTLLEQCSSVSFLRNEALISKNGIYICSAKRENNLFVLRPNHAKAILSHEMFKTAETQNKRQKVSPLS
        AVG  +L+ +  FL+LEN+Y+V  +KRNLISV  LLEQ  S++F  N+  I KNG+ ICSAK ENNL+VLR   +KA+L+ EMFKTA TQNKR K+SP  
Subjt:  AVGVAKLFFRNRFLILENLYLVRRIKRNLISVSTLLEQCSSVSFLRNEALISKNGIYICSAKRENNLFVLRPNHAKAILSHEMFKTAETQNKRQKVSPLS

Query:  NNTYLWHLRLGHINIDRIDRLVKNGLLTDIEDTSLPPCESCLE---------------------------------------------DDYSRYGYLYLM
         N +LWHLRLGHIN++RI+RLVKNGLL+++E+ SLP CESCLE                                             DDYSRYGY+YLM
Subjt:  NNTYLWHLRLGHINIDRIDRLVKNGLLTDIEDTSLPPCESCLE---------------------------------------------DDYSRYGYLYLM

Query:  GHKSKALEKFKEFKA------------------------------------------------------------------------SSFWGYAVETAVH
         HKS+ALEKFKE+KA                                                                        +SFWGYAV+TAV+
Subjt:  GHKSKALEKFKEFKA------------------------------------------------------------------------SSFWGYAVETAVH

Query:  ILNSVPSKSVSETPFELWRGRKPSLHYFRIWGCPAHVLVTNPKKLEPR
        ILN VPSKSVSETP +LW GRK SL +FRIWGCPAHVL  NPKKLEPR
Subjt:  ILNSVPSKSVSETPFELWRGRKPSLHYFRIWGCPAHVLVTNPKKLEPR

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]4.5e-17652.26Show/hide
Query:  NWSSWDRFGSVQPI---------FCPVRVLQGRFGSAVKDAYERWTKANEKVKVYIFVSLSEVLAKRYENVETAREIMNSLQEMFGLLSYQLHHDALKNV
        N++SW    +V  I          CP +V        V++ YERW KANEK + YI  SLSEVLAK++E++ TAREIM+SLQEMFG  SYQ+ HDALK +
Subjt:  NWSSWDRFGSVQPI---------FCPVRVLQGRFGSAVKDAYERWTKANEKVKVYIFVSLSEVLAKRYENVETAREIMNSLQEMFGLLSYQLHHDALKNV

Query:  FNAKMLEGQSVREHVLDMINQFNIAEANGGVVCERSQVAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKF
        +NA+M EG SVREHVL+M+  FN+AE NG V+ E SQV+FIL SLP S+L FR+NA MNKI + LTTLL+ELQ +ESL K KG+   KGEANVA S +KF
Subjt:  FNAKMLEGQSVREHVLDMINQFNIAEANGGVVCERSQVAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKF

Query:  LKGSSSGTKYVPQASSSKQIQKRKGDKG-KAPAQAVQGKGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRREKEGKFDLLVLETCFVEHDEFAWILDS
         +GS+SGTK +P +S +K+ +K+KG +G KA   A +   KAK  A KG CFHCN +GHWKRNCP+YLAEK++ K+GK+DLLVLETC VE+D+ AWI+DS
Subjt:  LKGSSSGTKYVPQASSSKQIQKRKGDKG-KAPAQAVQGKGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRREKEGKFDLLVLETCFVEHDEFAWILDS

Query:  GAANHVCSSLQG-NNFQQLVEGEMTLKVGTGDVVSARAVGVAKLFFRNRFLILENLYLVRRIKRNLISVSTLLEQCSSVSFLRNEALISKNGIYICSAKR
        GA NHVCSS QG ++++QL  GEMT++VGTG VVSA AVG  +L  +  FL+LEN+Y+V  +KRNLISV  LLEQ  S++F  N+  I KNG+ ICSAK 
Subjt:  GAANHVCSSLQG-NNFQQLVEGEMTLKVGTGDVVSARAVGVAKLFFRNRFLILENLYLVRRIKRNLISVSTLLEQCSSVSFLRNEALISKNGIYICSAKR

Query:  ENNLFVLRPNHAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDRIDRLVKNGLLTDIEDTSLPPCESCLE--------------------
        ENNL+VLR   +KA+L+ EMFKTA TQNKR K+SP   N +LWHLRLGHIN++RI+RLVKNGLL+++E+ SLP CESCLE                    
Subjt:  ENNLFVLRPNHAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDRIDRLVKNGLLTDIEDTSLPPCESCLE--------------------

Query:  -------------------------DDYSRYGYLYLMGHKSKALEKFKEFKA------------------------------------------------
                                 DDYSRYGY+YLM HKS+ALEKFKE+KA                                                
Subjt:  -------------------------DDYSRYGYLYLMGHKSKALEKFKEFKA------------------------------------------------

Query:  ------------------------SSFWGYAVETAVHILNSVPSKSVSETPFELWRGRKPSLHYFRIWGCPAHVLVTNPKKLEPR
                                +SFWGYAV+TAV+ILN VPSKSVSETP +LW GRK SL +FRIWGCPAHVL  NPKKLEPR
Subjt:  ------------------------SSFWGYAVETAVHILNSVPSKSVSETPFELWRGRKPSLHYFRIWGCPAHVLVTNPKKLEPR

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]2.9e-17554.01Show/hide
Query:  VKDAYERWTKANEKVKVYIFVSLSEVLAKRYENVETAREIMNSLQEMFGLLSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIAEANGGVVCERSQ
        V++ YERW KANEK + YI  SLSEVLAK++E++ TAREIM+SLQEMFG  SYQ+ HDALK ++NA+M EG SVREHVL+M+  FN+AE NG V+ E SQ
Subjt:  VKDAYERWTKANEKVKVYIFVSLSEVLAKRYENVETAREIMNSLQEMFGLLSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIAEANGGVVCERSQ

Query:  VAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKYVPQASSSKQIQKRKGDKG-KAPAQAVQ
        V+FIL SLP S+L FR+NA MNKI + LTTLL+ELQ +ESL K KG+   KGEANVA S +KF +GS+SGTK +P +S +K+ +K+KG +G KA   A +
Subjt:  VAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKYVPQASSSKQIQKRKGDKG-KAPAQAVQ

Query:  GKGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRREKEGKFDLLVLETCFVEHDEFAWILDSGAANHVCSSLQG-NNFQQLVEGEMTLKVGTGDVVSAR
           KAK  A KG CFHCN +GHWKRNCP+YLAEK++ K+GK+DLLVLETC VE+D+ AWI+DSGA NHVCSS QG ++++QL  GEMT++VGTG VVSA 
Subjt:  GKGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRREKEGKFDLLVLETCFVEHDEFAWILDSGAANHVCSSLQG-NNFQQLVEGEMTLKVGTGDVVSAR

Query:  AVGVAKLFFRNRFLILENLYLVRRIKRNLISVSTLLEQCSSVSFLRNEALISKNGIYICSAKRENNLFVLRPNHAKAILSHEMFKTAETQNKRQKVSPLS
        AVG  +L  +  FL+LEN+Y+V  +KRNLISV  LLEQ  S++F  N+  I KNG+ ICSAK ENNL+VLR   +KA+L+ EMFKTA TQNKR K+SP  
Subjt:  AVGVAKLFFRNRFLILENLYLVRRIKRNLISVSTLLEQCSSVSFLRNEALISKNGIYICSAKRENNLFVLRPNHAKAILSHEMFKTAETQNKRQKVSPLS

Query:  NNTYLWHLRLGHINIDRIDRLVKNGLLTDIEDTSLPPCESCLE---------------------------------------------DDYSRYGYLYLM
         N +LWHLRLGHIN++RI+RLVKNGLL+++E+ SLP CESCLE                                             DDYSRYGY+YLM
Subjt:  NNTYLWHLRLGHINIDRIDRLVKNGLLTDIEDTSLPPCESCLE---------------------------------------------DDYSRYGYLYLM

Query:  GHKSKALEKFKEFKA------------------------------------------------------------------------SSFWGYAVETAVH
         HKS+ALEKFKE+KA                                                                        +SFWGYAV+TAV+
Subjt:  GHKSKALEKFKEFKA------------------------------------------------------------------------SSFWGYAVETAVH

Query:  ILNSVPSKSVSETPFELWRGRKPSLHYFRIWGCPAHVLVTNPKKLEPR
        ILN VPSKSVSETP +LW GRK SL +FRIWGCPAHVL  NPKKLEPR
Subjt:  ILNSVPSKSVSETPFELWRGRKPSLHYFRIWGCPAHVLVTNPKKLEPR

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]2.2e-17553.63Show/hide
Query:  VKDAYERWTKANEKVKVYIFVSLSEVLAKRYENVETAREIMNSLQEMFGLLSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIAEANGGVVCERSQ
        V++ YERW KANEK + YI  SLSEVLAK++E++ TAREIM+SLQEMFG  SYQ+ HDALK ++NA+M EG SVREHVL+M+  FN+AE NG V+ E SQ
Subjt:  VKDAYERWTKANEKVKVYIFVSLSEVLAKRYENVETAREIMNSLQEMFGLLSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIAEANGGVVCERSQ

Query:  VAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKYVPQASSSKQIQKRKGDKGKAPAQAVQG
        V+FIL SLP S+L FR+NA MNKI + LTTLL+ELQ +ESL K KG+   KGEANVA S +KF +GS+SGTK +P +S +K+ +K+KG +G   A     
Subjt:  VAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKYVPQASSSKQIQKRKGDKGKAPAQAVQG

Query:  KGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRREKEGKFDLLVLETCFVEHDEFAWILDSGAANHVCSSLQG-NNFQQLVEGEMTLKVGTGDVVSARA
        K   K  A KG CFHCN +GHWKRNCP+YLAEK++ K+GK+DLLVLETC VE+D+ AWI+DSGA NHVCSS QG ++++QL  GEMT++VGTG VVSA A
Subjt:  KGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRREKEGKFDLLVLETCFVEHDEFAWILDSGAANHVCSSLQG-NNFQQLVEGEMTLKVGTGDVVSARA

Query:  VGVAKLFFRNRFLILENLYLVRRIKRNLISVSTLLEQCSSVSFLRNEALISKNGIYICSAKRENNLFVLRPNHAKAILSHEMFKTAETQNKRQKVSPLSN
        VG  +L  +  FL+LEN+Y+V  +KRNLISV  LLEQ  S++F  N+  I KNG+ ICSAK ENNL+VLR   +KA+L+ EMFKTA TQNKR K+SP   
Subjt:  VGVAKLFFRNRFLILENLYLVRRIKRNLISVSTLLEQCSSVSFLRNEALISKNGIYICSAKRENNLFVLRPNHAKAILSHEMFKTAETQNKRQKVSPLSN

Query:  NTYLWHLRLGHINIDRIDRLVKNGLLTDIEDTSLPPCESCLE---------------------------------------------DDYSRYGYLYLMG
        N +LWHLRLGHIN++RI+RLVKNGLL+++E+ SLP CESCLE                                             DDYSRYGY+YLM 
Subjt:  NTYLWHLRLGHINIDRIDRLVKNGLLTDIEDTSLPPCESCLE---------------------------------------------DDYSRYGYLYLMG

Query:  HKSKALEKFKEFKA------------------------------------------------------------------------SSFWGYAVETAVHI
        HKS+ALEKFKE+KA                                                                        +SFWGYAV+TAV+I
Subjt:  HKSKALEKFKEFKA------------------------------------------------------------------------SSFWGYAVETAVHI

Query:  LNSVPSKSVSETPFELWRGRKPSLHYFRIWGCPAHVLVTNPKKLEPR
        LN VPSKSVSETP +LW GRK SL +FRIWGCPAHVL  NPKKLEPR
Subjt:  LNSVPSKSVSETPFELWRGRKPSLHYFRIWGCPAHVLVTNPKKLEPR

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.4e-17554.01Show/hide
Query:  VKDAYERWTKANEKVKVYIFVSLSEVLAKRYENVETAREIMNSLQEMFGLLSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIAEANGGVVCERSQ
        V++ YERW KANEK + YI  SLSEVLAK++E++ TAREIM+SLQEMFG  SYQ+ HDALK ++NA+M EG SVREHVL+M+  FN+AE NG V+ E SQ
Subjt:  VKDAYERWTKANEKVKVYIFVSLSEVLAKRYENVETAREIMNSLQEMFGLLSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIAEANGGVVCERSQ

Query:  VAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKYVPQASSSKQIQKRKGDKG-KAPAQAVQ
        V+FIL SLP S+L FR+NA MNKI + LTTLL+ELQ +ESL K KG+   KGEANVA S +KF +GS+SGTK +P +S +K+ +K+KG +G KA   A +
Subjt:  VAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKYVPQASSSKQIQKRKGDKG-KAPAQAVQ

Query:  GKGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRREKEGKFDLLVLETCFVEHDEFAWILDSGAANHVCSSLQG-NNFQQLVEGEMTLKVGTGDVVSAR
           KAK  A KG CFHCN +GHWKRNCP+YLAEK++ K+GK+DLLVLETC VE+D+ AWI+DSGA NHVCSS QG ++++QL  GEMT++VGTG VVSA 
Subjt:  GKGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRREKEGKFDLLVLETCFVEHDEFAWILDSGAANHVCSSLQG-NNFQQLVEGEMTLKVGTGDVVSAR

Query:  AVGVAKLFFRNRFLILENLYLVRRIKRNLISVSTLLEQCSSVSFLRNEALISKNGIYICSAKRENNLFVLRPNHAKAILSHEMFKTAETQNKRQKVSPLS
        AVG  +L  +  FL+LEN+Y+V  +KRNLISV  LLEQ  S++F  N+  I KNG+ ICSAK ENNL+VLR   +KA+L+ EMFKTA TQNKR K+SP  
Subjt:  AVGVAKLFFRNRFLILENLYLVRRIKRNLISVSTLLEQCSSVSFLRNEALISKNGIYICSAKRENNLFVLRPNHAKAILSHEMFKTAETQNKRQKVSPLS

Query:  NNTYLWHLRLGHINIDRIDRLVKNGLLTDIEDTSLPPCESCLE---------------------------------------------DDYSRYGYLYLM
         N +LWHLRLGHIN++RI+RLVKNGLL+++E+ SLP CESCLE                                             DDYSRYGY+YLM
Subjt:  NNTYLWHLRLGHINIDRIDRLVKNGLLTDIEDTSLPPCESCLE---------------------------------------------DDYSRYGYLYLM

Query:  GHKSKALEKFKEFKA------------------------------------------------------------------------SSFWGYAVETAVH
         HKS+ALEKFKE+KA                                                                        +SFWGYAV+TAV+
Subjt:  GHKSKALEKFKEFKA------------------------------------------------------------------------SSFWGYAVETAVH

Query:  ILNSVPSKSVSETPFELWRGRKPSLHYFRIWGCPAHVLVTNPKKLEPR
        ILN VPSKSVSETP +LW GRK SL +FRIWGCPAHVL  NPKKLEPR
Subjt:  ILNSVPSKSVSETPFELWRGRKPSLHYFRIWGCPAHVLVTNPKKLEPR

A0A5A7TU93 Gag/pol protein1.7e-17654.17Show/hide
Query:  VKDAYERWTKANEKVKVYIFVSLSEVLAKRYENVETAREIMNSLQEMFGLLSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIAEANGGVVCERSQ
        V++ YERW KANEK + YI  SLSEVLAK++E++ TAREIM+SLQEMFG  SYQ+ HDALK ++NA+M EG SVREHVL+M+  FN+AE NG V+ E SQ
Subjt:  VKDAYERWTKANEKVKVYIFVSLSEVLAKRYENVETAREIMNSLQEMFGLLSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIAEANGGVVCERSQ

Query:  VAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKYVPQASSSKQIQKRKGDKG-KAPAQAVQ
        V+FIL SLP S+L FR+NA MNKI + LTTLL+ELQ +ESL K KG+   KGEANVA S +KF +GS+SGTK +P +S +K+ +K+KG +G KA   A +
Subjt:  VAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKYVPQASSSKQIQKRKGDKG-KAPAQAVQ

Query:  GKGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRREKEGKFDLLVLETCFVEHDEFAWILDSGAANHVCSSLQG-NNFQQLVEGEMTLKVGTGDVVSAR
           KAK  A KG CFHCN +GHWKRNCP+YLAEK++ K+GK+DLLVLETC VE+D+ AWI+DSGA NHVCSS QG +++QQL  GEMT++VGTG VVSA 
Subjt:  GKGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRREKEGKFDLLVLETCFVEHDEFAWILDSGAANHVCSSLQG-NNFQQLVEGEMTLKVGTGDVVSAR

Query:  AVGVAKLFFRNRFLILENLYLVRRIKRNLISVSTLLEQCSSVSFLRNEALISKNGIYICSAKRENNLFVLRPNHAKAILSHEMFKTAETQNKRQKVSPLS
        AVG  +L+ +  FL+LEN+Y+V  +KRNLISV  LLEQ  S++F  N+  I KNG+ ICSAK ENNL+VLR   +KA+L+ EMFKTA TQNKR K+SP  
Subjt:  AVGVAKLFFRNRFLILENLYLVRRIKRNLISVSTLLEQCSSVSFLRNEALISKNGIYICSAKRENNLFVLRPNHAKAILSHEMFKTAETQNKRQKVSPLS

Query:  NNTYLWHLRLGHINIDRIDRLVKNGLLTDIEDTSLPPCESCLE---------------------------------------------DDYSRYGYLYLM
         N +LWHLRLGHIN++RI+RLVKNGLL+++E+ SLP CESCLE                                             DDYSRYGY+YLM
Subjt:  NNTYLWHLRLGHINIDRIDRLVKNGLLTDIEDTSLPPCESCLE---------------------------------------------DDYSRYGYLYLM

Query:  GHKSKALEKFKEFKA------------------------------------------------------------------------SSFWGYAVETAVH
         HKS+ALEKFKE+KA                                                                        +SFWGYAV+TAV+
Subjt:  GHKSKALEKFKEFKA------------------------------------------------------------------------SSFWGYAVETAVH

Query:  ILNSVPSKSVSETPFELWRGRKPSLHYFRIWGCPAHVLVTNPKKLEPR
        ILN VPSKSVSETP +LW GRK SL +FRIWGCPAHVL  NPKKLEPR
Subjt:  ILNSVPSKSVSETPFELWRGRKPSLHYFRIWGCPAHVLVTNPKKLEPR

A0A5A7TZD7 Gag/pol protein2.2e-17652.26Show/hide
Query:  NWSSWDRFGSVQPI---------FCPVRVLQGRFGSAVKDAYERWTKANEKVKVYIFVSLSEVLAKRYENVETAREIMNSLQEMFGLLSYQLHHDALKNV
        N++SW    +V  I          CP +V        V++ YERW KANEK + YI  SLSEVLAK++E++ TAREIM+SLQEMFG  SYQ+ HDALK +
Subjt:  NWSSWDRFGSVQPI---------FCPVRVLQGRFGSAVKDAYERWTKANEKVKVYIFVSLSEVLAKRYENVETAREIMNSLQEMFGLLSYQLHHDALKNV

Query:  FNAKMLEGQSVREHVLDMINQFNIAEANGGVVCERSQVAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKF
        +NA+M EG SVREHVL+M+  FN+AE NG V+ E SQV+FIL SLP S+L FR+NA MNKI + LTTLL+ELQ +ESL K KG+   KGEANVA S +KF
Subjt:  FNAKMLEGQSVREHVLDMINQFNIAEANGGVVCERSQVAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKF

Query:  LKGSSSGTKYVPQASSSKQIQKRKGDKG-KAPAQAVQGKGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRREKEGKFDLLVLETCFVEHDEFAWILDS
         +GS+SGTK +P +S +K+ +K+KG +G KA   A +   KAK  A KG CFHCN +GHWKRNCP+YLAEK++ K+GK+DLLVLETC VE+D+ AWI+DS
Subjt:  LKGSSSGTKYVPQASSSKQIQKRKGDKG-KAPAQAVQGKGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRREKEGKFDLLVLETCFVEHDEFAWILDS

Query:  GAANHVCSSLQG-NNFQQLVEGEMTLKVGTGDVVSARAVGVAKLFFRNRFLILENLYLVRRIKRNLISVSTLLEQCSSVSFLRNEALISKNGIYICSAKR
        GA NHVCSS QG ++++QL  GEMT++VGTG VVSA AVG  +L  +  FL+LEN+Y+V  +KRNLISV  LLEQ  S++F  N+  I KNG+ ICSAK 
Subjt:  GAANHVCSSLQG-NNFQQLVEGEMTLKVGTGDVVSARAVGVAKLFFRNRFLILENLYLVRRIKRNLISVSTLLEQCSSVSFLRNEALISKNGIYICSAKR

Query:  ENNLFVLRPNHAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDRIDRLVKNGLLTDIEDTSLPPCESCLE--------------------
        ENNL+VLR   +KA+L+ EMFKTA TQNKR K+SP   N +LWHLRLGHIN++RI+RLVKNGLL+++E+ SLP CESCLE                    
Subjt:  ENNLFVLRPNHAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDRIDRLVKNGLLTDIEDTSLPPCESCLE--------------------

Query:  -------------------------DDYSRYGYLYLMGHKSKALEKFKEFKA------------------------------------------------
                                 DDYSRYGY+YLM HKS+ALEKFKE+KA                                                
Subjt:  -------------------------DDYSRYGYLYLMGHKSKALEKFKEFKA------------------------------------------------

Query:  ------------------------SSFWGYAVETAVHILNSVPSKSVSETPFELWRGRKPSLHYFRIWGCPAHVLVTNPKKLEPR
                                +SFWGYAV+TAV+ILN VPSKSVSETP +LW GRK SL +FRIWGCPAHVL  NPKKLEPR
Subjt:  ------------------------SSFWGYAVETAVHILNSVPSKSVSETPFELWRGRKPSLHYFRIWGCPAHVLVTNPKKLEPR

A0A5D3CPJ6 Gag/pol protein1.1e-17553.63Show/hide
Query:  VKDAYERWTKANEKVKVYIFVSLSEVLAKRYENVETAREIMNSLQEMFGLLSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIAEANGGVVCERSQ
        V++ YERW KANEK + YI  SLSEVLAK++E++ TAREIM+SLQEMFG  SYQ+ HDALK ++NA+M EG SVREHVL+M+  FN+AE NG V+ E SQ
Subjt:  VKDAYERWTKANEKVKVYIFVSLSEVLAKRYENVETAREIMNSLQEMFGLLSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIAEANGGVVCERSQ

Query:  VAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKYVPQASSSKQIQKRKGDKGKAPAQAVQG
        V+FIL SLP S+L FR+NA MNKI + LTTLL+ELQ +ESL K KG+   KGEANVA S +KF +GS+SGTK +P +S +K+ +K+KG +G   A     
Subjt:  VAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKYVPQASSSKQIQKRKGDKGKAPAQAVQG

Query:  KGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRREKEGKFDLLVLETCFVEHDEFAWILDSGAANHVCSSLQG-NNFQQLVEGEMTLKVGTGDVVSARA
        K   K  A KG CFHCN +GHWKRNCP+YLAEK++ K+GK+DLLVLETC VE+D+ AWI+DSGA NHVCSS QG ++++QL  GEMT++VGTG VVSA A
Subjt:  KGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRREKEGKFDLLVLETCFVEHDEFAWILDSGAANHVCSSLQG-NNFQQLVEGEMTLKVGTGDVVSARA

Query:  VGVAKLFFRNRFLILENLYLVRRIKRNLISVSTLLEQCSSVSFLRNEALISKNGIYICSAKRENNLFVLRPNHAKAILSHEMFKTAETQNKRQKVSPLSN
        VG  +L  +  FL+LEN+Y+V  +KRNLISV  LLEQ  S++F  N+  I KNG+ ICSAK ENNL+VLR   +KA+L+ EMFKTA TQNKR K+SP   
Subjt:  VGVAKLFFRNRFLILENLYLVRRIKRNLISVSTLLEQCSSVSFLRNEALISKNGIYICSAKRENNLFVLRPNHAKAILSHEMFKTAETQNKRQKVSPLSN

Query:  NTYLWHLRLGHINIDRIDRLVKNGLLTDIEDTSLPPCESCLE---------------------------------------------DDYSRYGYLYLMG
        N +LWHLRLGHIN++RI+RLVKNGLL+++E+ SLP CESCLE                                             DDYSRYGY+YLM 
Subjt:  NTYLWHLRLGHINIDRIDRLVKNGLLTDIEDTSLPPCESCLE---------------------------------------------DDYSRYGYLYLMG

Query:  HKSKALEKFKEFKA------------------------------------------------------------------------SSFWGYAVETAVHI
        HKS+ALEKFKE+KA                                                                        +SFWGYAV+TAV+I
Subjt:  HKSKALEKFKEFKA------------------------------------------------------------------------SSFWGYAVETAVHI

Query:  LNSVPSKSVSETPFELWRGRKPSLHYFRIWGCPAHVLVTNPKKLEPR
        LN VPSKSVSETP +LW GRK SL +FRIWGCPAHVL  NPKKLEPR
Subjt:  LNSVPSKSVSETPFELWRGRKPSLHYFRIWGCPAHVLVTNPKKLEPR

A0A5D3CSZ6 Gag/pol protein1.1e-17553.63Show/hide
Query:  VKDAYERWTKANEKVKVYIFVSLSEVLAKRYENVETAREIMNSLQEMFGLLSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIAEANGGVVCERSQ
        V++ YERW KANEK + YI  SLSEVLAK++E++ TAREIM+SLQEMFG  SYQ+ HDALK ++NA+M EG SVREHVL+M+  FN+AE NG V+ E SQ
Subjt:  VKDAYERWTKANEKVKVYIFVSLSEVLAKRYENVETAREIMNSLQEMFGLLSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIAEANGGVVCERSQ

Query:  VAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKYVPQASSSKQIQKRKGDKGKAPAQAVQG
        V+FIL SLP S+L FR+NA MNKI + LTTLL+ELQ +ESL K KG+   KGEANVA S +KF +GS+SGTK +P +S +K+ +K+KG +G   A     
Subjt:  VAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKYVPQASSSKQIQKRKGDKGKAPAQAVQG

Query:  KGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRREKEGKFDLLVLETCFVEHDEFAWILDSGAANHVCSSLQG-NNFQQLVEGEMTLKVGTGDVVSARA
        K   K  A KG CFHCN +GHWKRNCP+YLAEK++ K+GK+DLLVLETC VE+D+ AWI+DSGA NHVCSS QG ++++QL  GEMT++VGTG VVSA A
Subjt:  KGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRREKEGKFDLLVLETCFVEHDEFAWILDSGAANHVCSSLQG-NNFQQLVEGEMTLKVGTGDVVSARA

Query:  VGVAKLFFRNRFLILENLYLVRRIKRNLISVSTLLEQCSSVSFLRNEALISKNGIYICSAKRENNLFVLRPNHAKAILSHEMFKTAETQNKRQKVSPLSN
        VG  +L  +  FL+LEN+Y+V  +KRNLISV  LLEQ  S++F  N+  I KNG+ ICSAK ENNL+VLR   +KA+L+ EMFKTA TQNKR K+SP   
Subjt:  VGVAKLFFRNRFLILENLYLVRRIKRNLISVSTLLEQCSSVSFLRNEALISKNGIYICSAKRENNLFVLRPNHAKAILSHEMFKTAETQNKRQKVSPLSN

Query:  NTYLWHLRLGHINIDRIDRLVKNGLLTDIEDTSLPPCESCLE---------------------------------------------DDYSRYGYLYLMG
        N +LWHLRLGHIN++RI+RLVKNGLL+++E+ SLP CESCLE                                             DDYSRYGY+YLM 
Subjt:  NTYLWHLRLGHINIDRIDRLVKNGLLTDIEDTSLPPCESCLE---------------------------------------------DDYSRYGYLYLMG

Query:  HKSKALEKFKEFKA------------------------------------------------------------------------SSFWGYAVETAVHI
        HKS+ALEKFKE+KA                                                                        +SFWGYAV+TAV+I
Subjt:  HKSKALEKFKEFKA------------------------------------------------------------------------SSFWGYAVETAVHI

Query:  LNSVPSKSVSETPFELWRGRKPSLHYFRIWGCPAHVLVTNPKKLEPR
        LN VPSKSVSETP +LW GRK SL +FRIWGCPAHVL  NPKKLEPR
Subjt:  LNSVPSKSVSETPFELWRGRKPSLHYFRIWGCPAHVLVTNPKKLEPR

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.9e-1621.01Show/hide
Query:  VRVLQGRFGSAVKDAYERWTKANEKVKVYIFVSLSEVLAKRYENVETAREIMNSLQEMFGLLSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIAE
        ++V+ G   + V D+   W KA    K  I   LS+       +  TAR+I+ +L  ++   S        K + + K+    S+  H        +   
Subjt:  VRVLQGRFGSAVKDAYERWTKANEKVKVYIFVSLSEVLAKRYENVETAREIMNSLQEMFGLLSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIAE

Query:  ANGGVVCERSQVAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKYVPQASSSKQIQKRKGD
        A G  + E  +++ +L +LP+ Y    T A     + NLT    + ++ +   K      +K + N    K       ++   Y         + K +  
Subjt:  ANGGVVCERSQVAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKYVPQASSSKQIQKRKGD

Query:  KGKAPAQAVQGKGKAKVVADKGRCFHCNADGHWKRNC---PRYLAEKRREKEGKFD--------LLVLETCFVE-HDEFAWILDSGAANHVCSSLQGNNF
        K   P +  +G  K KV     +C HC  +GH K++C    R L  K +E E +           +V E       D   ++LDSGA++H+ +    + +
Subjt:  KGKAPAQAVQGKGKAKVVADKGRCFHCNADGHWKRNC---PRYLAEKRREKEGKFD--------LLVLETCFVE-HDEFAWILDSGAANHVCSSLQGNNF

Query:  QQLVEGEMTLKVGT---GDVVSARAVGVAKLFFRNRFLILENLYLVRRIKRNLISVSTLLEQCSSVSFLRNEALISKNGIYIC-SAKRENNLFVLRPNHA
           VE    LK+     G+ + A   G+ +L   +  + LE++   +    NL+SV  L E   S+ F ++   ISKNG+ +  ++   NN+ V+     
Subjt:  QQLVEGEMTLKVGT---GDVVSARAVGVAKLFFRNRFLILENLYLVRRIKRNLISVSTLLEQCSSVSFLRNEALISKNGIYIC-SAKRENNLFVLRPNHA

Query:  KAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDRIDRLVKNGLLTDIE-----DTSLPPCESCLE---------------------------
                   A + N + K     NN  LWH R GHI+  ++  + +  + +D       + S   CE CL                            
Subjt:  KAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDRIDRLVKNGLLTDIE-----DTSLPPCESCLE---------------------------

Query:  --------------------DDYSRYGYLYLMGHKSKALEKFKEFKA-----------------------------------------------------
                            D ++ Y   YL+ +KS     F++F A                                                     
Subjt:  --------------------DDYSRYGYLYLMGHKSKALEKFKEFKA-----------------------------------------------------

Query:  -------------------SSFWGYAVETAVHILNSVPSKSV---SETPFELWRGRKPSLHYFRIWGCPAHVLVTN
                            SFWG AV TA +++N +PS+++   S+TP+E+W  +KP L + R++G   +V + N
Subjt:  -------------------SSFWGYAVETAVHILNSVPSKSV---SETPFELWRGRKPSLHYFRIWGCPAHVLVTN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.6e-1322.71Show/hide
Query:  ERWTKANEKVKVYIFVSLSEVLAKRYENVETAREIMNSLQEMFGLLSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIAEANGGV-VCERSQVAFI
        E W   +E+    I + LS+ +     + +TAR I   L+ ++   +        K ++   M EG +   H L++ N      AN GV + E  +   +
Subjt:  ERWTKANEKVKVYIFVSLSEVLAKRYENVETAREIMNSLQEMFGLLSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIAEANGGV-VCERSQVAFI

Query:  LHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKYVPQASSSKQIQKRKGDKGKAPAQAVQGKGKA
        L+SLP+SY +  T     K    L  + S L + E ++K K +N  +G+A +   + +  + SS+                R G +GK+     + + K+
Subjt:  LHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKFLKGSSSGTKYVPQASSSKQIQKRKGDKGKAPAQAVQGKGKA

Query:  KVVADKGRCFHCNADGHWKRNCPRYLAEKRREKEGKFD-------------LLVL----ETCFVEHDEFAWILDSGAANHVCSSLQGNNFQQLVEGEM-T
        +V      C++CN  GH+KR+CP     K      K D             +L +    E   +   E  W++D+ A++H  ++   + F + V G+  T
Subjt:  KVVADKGRCFHCNADGHWKRNCPRYLAEKRREKEGKFD-------------LLVL----ETCFVEHDEFAWILDSGAANHVCSSLQGNNFQQLVEGEM-T

Query:  LKVGTGDVVSARAVGVAKLFFRNRF---LILENLYLVRRIKRNLISVSTLLEQCSSVSFLRNEALISKNGIYICSAKRENNLFVLRPNHAKAILSHEMFK
        +K+  G+   ++  G+  +  +      L+L+++  V  ++ NLIS   L        F   +  ++K  + I                AK +    +++
Subjt:  LKVGTGDVVSARAVGVAKLFFRNRF---LILENLYLVRRIKRNLISVSTLLEQCSSVSFLRNEALISKNGIYICSAKRENNLFVLRPNHAKAILSHEMFK

Query:  T-AETQNKRQKVSPLSNNTYLWHLRLGHINIDRIDRLVKNGLLTDIEDTSLPPCESCL
        T AE        +    +  LWH R+GH++   +  L K  L++  + T++ PC+ CL
Subjt:  T-AETQNKRQKVSPLSNNTYLWHLRLGHINIDRIDRLVKNGLLTDIEDTSLPPCESCL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAAGAAGAAGAGAGCAGCGTAGAAACCAGAATCAAATGAATAACGTGTCGCGCCTTCCGCAGGTTCCTGAAGATCCTGCAGACCCCCAGAATCGCTTGCTGCAGCA
AAATCCGCCGCTGGAGCAAAATGAGCAGCAAAATAATCAGGCTGAGAATCCTATCTTGGTAGTGAACGATAAGACCAGAGCCATTCGAGCGTATGTTGTTCCAATGTGTG
ATGAATTAAATCCAGGGATTGCACGTCCTCAAATCCAAGCGGCAAATTTTGAAATGAAACCGATAATGTTTCAGATGTTGCAAACCATGGGTCAATTCCATGGTTTGTCA
TTTGAAGACCCTCGTTTACATCTTAAGTCTTTTTTAGGAGTTAGTGATTCTTTTGTAATTCAAGGAGTGCCTAGAGATGCTCTTAGATTAACTTTGTTCTCGTATTGTCT
TAGAGATGGAGCAAAGACATGGTTAAATTCTTTTGCTCCAGGATCGATTAGGACATGGGATAAGTTAGCTGGAAGATTTTTTAGAAAATATTTCCCACCTAATAGAAATG
CTAAATTAAGGAGTGAAATAGTAGGGTTTAGGCAACTTGAAGATGAGACTTTTAGTGAGGCTTGGGAGAGGTTTAAGGAGCTTTTACGAAAGTGCCCCCACCATGGTTTA
CCTCATTGTATTCAAATGGAGACATTTTACAATGGGTTAAATGGAGCAACCCAGGGTATGGTTGATGCTTCGGCTGGAGGGGCCCTTTTGGCAAAAACGTTTAATGAGGC
CCATGAAATTTTAGAGAGAATATCTACTAATAGTTGTCAGTGGTCAGATGTTAGAGGCACAAATAAAAAGGTTAAGAGTGTATTAGAGGTTGATGGTGTGTCCACCATTA
GGGTTGATCTTGCAATGATTGCTAACGATCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCGACAAGCTGTGGAGTCTGCTGCATTGGTGAACCAAGTCACAGAGGAA
GCATGTGTCTATTGTGGTGAAGATCACAATTACGAGTTTTGCCCCAGCAAGCCAGCTTCTGTGTTTTTTGTAGGTAATCAAAGGAATAACCCTTATTCTAACTTTTATAA
TCCAGGTTGGCGCAACCACCCACACTTTACATGGGGAGGACAAGGAAGTAATATGCAAGCACAACAAAAGGTGAACCAGTCGGGATTTGCTAAAGCGCAGGTAATGCCCC
AAAAAAGTAAGCAGGCTATGCCCCAGCAAAATACAGGAAGTTCTCTTGAGGCAATGATGAAAGAATTTATGGCTCGTACAGATGCCGCAATTCAAAGTAATCAAGCTTCA
ATGAGAGCCCTGGAATTGCAAGTGGGTCAGCTAGCTAATGAGCTAAAGGCAATGCCTCAAGGGAAACTTCCCTCAGATACTGAACACCCTAGAAGGGAAGGTAAGGAGCA
CGTAAAGACAGTGACTCTTAGGAGTGACTTTGAGACTAGTCAGGGTGCTGGAGGCAGCAATAAAAATGCTGGAGCATCTGGTTCTTTTCCGGATGAGGAACCACATTATG
TGTCGCCCCCACCTTATGTACCACATCTACCTTTTCCACAAAGGCAAAAGCCTAAGAATCAGGTCTTGAAAATGGGCCCTACCCTCTTTATGGCACGAGAGGGATTTCTG
TATGCTGGTTGGACCACAAACAGGTTGTTCATTAGAGGAGCACTGGTACTTAAGGACCAAGAGGTAGCCAGGGAAAATGTTCCAGAGAAGAAATCCTTAGATTATTGGGG
TTCCCACATAACCACTCAACACATATTCCTTGGAGGTACCGGTGCAACCTTGACGGTAGTGTTCGTGATTTTGCAGCAAGATCAAGATCAAAACTGGGTGTTGGTTGCTG
TATTTTTGTTATCAAAGCTTAGAATGAAGATTTGTGAAGAACACGATAAAATTCGAGGCCAATTGGGAGAAAATCAGTCGTTATTCTGCCCAGAAAACTGCTCCAGACGC
CGTCTCGACGCTGTCACGATTTCGTGCGGCAGTTCGTTTTTACGTGGAGACGTGGGAGAAACCCTAACTGCTGGTTCGGTTCGCGTGAATTGGTCGAGCTGGGACCGGTT
TGGTTCGGTTCAACCCATTTTTTGTCCGGTTCGGGTTCTTCAAGGTCGGTTCGGGTCGGCAGTTAAGGACGCCTACGAACGCTGGACTAAGGCCAATGAAAAGGTCAAAG
TCTATATTTTTGTCAGCTTATCTGAAGTATTGGCCAAGCGTTACGAGAACGTGGAAACTGCCAGGGAGATTATGAATTCCCTGCAGGAGATGTTTGGACTTCTGTCCTAC
CAGCTCCATCATGACGCCTTGAAGAACGTCTTCAATGCCAAGATGCTGGAAGGTCAATCTGTTCGGGAACATGTCCTGGACATGATTAACCAGTTCAACATAGCTGAGGC
AAATGGCGGGGTTGTCTGCGAGCGCAGTCAGGTTGCGTTCATCCTTCACTCGCTTCCGGCGAGCTATCTGTCATTCAGGACGAACGCGAGCATGAATAAGATTCAGTTCA
ACCTGACTACCCTCCTCTCGGAGTTACAGATTTATGAGTCCCTGCAAAAGAGCAAGGGCAAGAATGTGGTTAAAGGAGAGGCCAACGTGGCCCATTCCAAGAAGAAGTTC
CTGAAGGGTTCATCCTCAGGGACTAAATATGTACCCCAGGCTTCTTCATCGAAGCAGATCCAAAAGAGGAAGGGAGACAAGGGGAAGGCTCCTGCACAGGCTGTGCAAGG
GAAGGGGAAGGCCAAGGTCGTGGCCGACAAAGGCAGGTGCTTCCACTGTAACGCAGATGGTCACTGGAAGCGCAACTGTCCCCGCTACCTTGCTGAGAAGAGAAGAGAAA
AAGAAGGTAAATTTGATTTACTTGTGTTAGAGACTTGTTTCGTTGAACATGATGAGTTTGCCTGGATACTGGATTCGGGAGCCGCTAATCATGTTTGCTCTTCTCTTCAG
GGAAATAATTTCCAGCAGCTGGTAGAGGGTGAAATGACGCTCAAGGTTGGAACAGGAGACGTCGTTTCAGCTCGTGCAGTGGGAGTTGCAAAATTATTCTTTAGAAATAG
GTTTTTAATTTTAGAGAACTTGTACTTGGTTCGTAGAATTAAAAGAAACCTTATTTCTGTTTCAACTTTACTAGAACAATGTTCTTCAGTTTCCTTCTTGCGTAATGAAG
CTTTAATCTCAAAGAATGGAATTTACATTTGTTCTGCTAAGCGTGAGAATAATTTATTTGTGTTAAGACCAAATCATGCTAAGGCTATTTTAAGTCATGAAATGTTTAAA
ACGGCTGAAACGCAAAATAAAAGGCAAAAGGTTTCTCCCTTAAGTAACAACACGTATCTTTGGCACCTTCGTCTTGGTCATATTAACATCGATCGGATTGATCGTTTGGT
TAAAAATGGACTTCTAACTGACATAGAAGATACATCTTTACCACCCTGTGAATCGTGTCTCGAGGATGACTATTCAAGATACGGTTATTTGTACCTAATGGGCCATAAGT
CTAAGGCTCTTGAAAAGTTCAAAGAGTTTAAGGCGAGCTCGTTTTGGGGGTATGCAGTAGAGACTGCAGTACATATTCTTAACAGTGTTCCATCAAAAAGTGTTTCTGAA
ACTCCTTTTGAGTTATGGAGGGGGCGTAAACCTAGTTTGCATTACTTTCGAATCTGGGGTTGTCCTGCACATGTGTTAGTGACAAATCCTAAGAAGTTGGAACCACGTCC
ATCAGGTCCCACTGGTAGCTCTATAAGGGCGTTGAGCAAGATCAAGATCAATAACTGGGTGTTGGTTGCTGTATTTTTCTGTTTATCAAAGGTTTTGGCGAAGCGGTTCA
AGAATTTCTTCAAGGCTGGTGTTACGGACACTCGTGAAGGACTAACTAGTCGATATTGGTCTAAGTTCCGTGGACACAGAAAATATGTCTGCAGTGAGAAGAGTGCAACT
ATTCTGCCTCCAGACGCAACAGCGTCGGGACGCTCCTTCAACAGCATCTCGACGCTGTCTCGATACCGAAGGCGCTCGGTAAGGAAAGCACACAGCGTCGGGACGCTACC
TCAACAGCGTCTCGACGCTGTCTCGATTACGCAGGCGCCAGATATGGAAAACGCACAGCGTCGGGACGCTAATCTACACAGCGTCTCGACGCTGTCACGATGGAGGCGCG
CACTAGGGTTTCAACAGGCGCGGTTCAAGCAAGGACGGTCCGGTTCGGACCGGTTCGGCTGGGCTGGGACCTATTTGGTCCGGTTCAGCCAATTTTTGGCATGGTTCGTG
ATTTTGGAGGCCGGTTCGAGGTGGTTCGGGGCGGTTCGTGCTGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGAAGAAGAAGAGAGCAGCGTAGAAACCAGAATCAAATGAATAACGTGTCGCGCCTTCCGCAGGTTCCTGAAGATCCTGCAGACCCCCAGAATCGCTTGCTGCAGCA
AAATCCGCCGCTGGAGCAAAATGAGCAGCAAAATAATCAGGCTGAGAATCCTATCTTGGTAGTGAACGATAAGACCAGAGCCATTCGAGCGTATGTTGTTCCAATGTGTG
ATGAATTAAATCCAGGGATTGCACGTCCTCAAATCCAAGCGGCAAATTTTGAAATGAAACCGATAATGTTTCAGATGTTGCAAACCATGGGTCAATTCCATGGTTTGTCA
TTTGAAGACCCTCGTTTACATCTTAAGTCTTTTTTAGGAGTTAGTGATTCTTTTGTAATTCAAGGAGTGCCTAGAGATGCTCTTAGATTAACTTTGTTCTCGTATTGTCT
TAGAGATGGAGCAAAGACATGGTTAAATTCTTTTGCTCCAGGATCGATTAGGACATGGGATAAGTTAGCTGGAAGATTTTTTAGAAAATATTTCCCACCTAATAGAAATG
CTAAATTAAGGAGTGAAATAGTAGGGTTTAGGCAACTTGAAGATGAGACTTTTAGTGAGGCTTGGGAGAGGTTTAAGGAGCTTTTACGAAAGTGCCCCCACCATGGTTTA
CCTCATTGTATTCAAATGGAGACATTTTACAATGGGTTAAATGGAGCAACCCAGGGTATGGTTGATGCTTCGGCTGGAGGGGCCCTTTTGGCAAAAACGTTTAATGAGGC
CCATGAAATTTTAGAGAGAATATCTACTAATAGTTGTCAGTGGTCAGATGTTAGAGGCACAAATAAAAAGGTTAAGAGTGTATTAGAGGTTGATGGTGTGTCCACCATTA
GGGTTGATCTTGCAATGATTGCTAACGATCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCGACAAGCTGTGGAGTCTGCTGCATTGGTGAACCAAGTCACAGAGGAA
GCATGTGTCTATTGTGGTGAAGATCACAATTACGAGTTTTGCCCCAGCAAGCCAGCTTCTGTGTTTTTTGTAGGTAATCAAAGGAATAACCCTTATTCTAACTTTTATAA
TCCAGGTTGGCGCAACCACCCACACTTTACATGGGGAGGACAAGGAAGTAATATGCAAGCACAACAAAAGGTGAACCAGTCGGGATTTGCTAAAGCGCAGGTAATGCCCC
AAAAAAGTAAGCAGGCTATGCCCCAGCAAAATACAGGAAGTTCTCTTGAGGCAATGATGAAAGAATTTATGGCTCGTACAGATGCCGCAATTCAAAGTAATCAAGCTTCA
ATGAGAGCCCTGGAATTGCAAGTGGGTCAGCTAGCTAATGAGCTAAAGGCAATGCCTCAAGGGAAACTTCCCTCAGATACTGAACACCCTAGAAGGGAAGGTAAGGAGCA
CGTAAAGACAGTGACTCTTAGGAGTGACTTTGAGACTAGTCAGGGTGCTGGAGGCAGCAATAAAAATGCTGGAGCATCTGGTTCTTTTCCGGATGAGGAACCACATTATG
TGTCGCCCCCACCTTATGTACCACATCTACCTTTTCCACAAAGGCAAAAGCCTAAGAATCAGGTCTTGAAAATGGGCCCTACCCTCTTTATGGCACGAGAGGGATTTCTG
TATGCTGGTTGGACCACAAACAGGTTGTTCATTAGAGGAGCACTGGTACTTAAGGACCAAGAGGTAGCCAGGGAAAATGTTCCAGAGAAGAAATCCTTAGATTATTGGGG
TTCCCACATAACCACTCAACACATATTCCTTGGAGGTACCGGTGCAACCTTGACGGTAGTGTTCGTGATTTTGCAGCAAGATCAAGATCAAAACTGGGTGTTGGTTGCTG
TATTTTTGTTATCAAAGCTTAGAATGAAGATTTGTGAAGAACACGATAAAATTCGAGGCCAATTGGGAGAAAATCAGTCGTTATTCTGCCCAGAAAACTGCTCCAGACGC
CGTCTCGACGCTGTCACGATTTCGTGCGGCAGTTCGTTTTTACGTGGAGACGTGGGAGAAACCCTAACTGCTGGTTCGGTTCGCGTGAATTGGTCGAGCTGGGACCGGTT
TGGTTCGGTTCAACCCATTTTTTGTCCGGTTCGGGTTCTTCAAGGTCGGTTCGGGTCGGCAGTTAAGGACGCCTACGAACGCTGGACTAAGGCCAATGAAAAGGTCAAAG
TCTATATTTTTGTCAGCTTATCTGAAGTATTGGCCAAGCGTTACGAGAACGTGGAAACTGCCAGGGAGATTATGAATTCCCTGCAGGAGATGTTTGGACTTCTGTCCTAC
CAGCTCCATCATGACGCCTTGAAGAACGTCTTCAATGCCAAGATGCTGGAAGGTCAATCTGTTCGGGAACATGTCCTGGACATGATTAACCAGTTCAACATAGCTGAGGC
AAATGGCGGGGTTGTCTGCGAGCGCAGTCAGGTTGCGTTCATCCTTCACTCGCTTCCGGCGAGCTATCTGTCATTCAGGACGAACGCGAGCATGAATAAGATTCAGTTCA
ACCTGACTACCCTCCTCTCGGAGTTACAGATTTATGAGTCCCTGCAAAAGAGCAAGGGCAAGAATGTGGTTAAAGGAGAGGCCAACGTGGCCCATTCCAAGAAGAAGTTC
CTGAAGGGTTCATCCTCAGGGACTAAATATGTACCCCAGGCTTCTTCATCGAAGCAGATCCAAAAGAGGAAGGGAGACAAGGGGAAGGCTCCTGCACAGGCTGTGCAAGG
GAAGGGGAAGGCCAAGGTCGTGGCCGACAAAGGCAGGTGCTTCCACTGTAACGCAGATGGTCACTGGAAGCGCAACTGTCCCCGCTACCTTGCTGAGAAGAGAAGAGAAA
AAGAAGGTAAATTTGATTTACTTGTGTTAGAGACTTGTTTCGTTGAACATGATGAGTTTGCCTGGATACTGGATTCGGGAGCCGCTAATCATGTTTGCTCTTCTCTTCAG
GGAAATAATTTCCAGCAGCTGGTAGAGGGTGAAATGACGCTCAAGGTTGGAACAGGAGACGTCGTTTCAGCTCGTGCAGTGGGAGTTGCAAAATTATTCTTTAGAAATAG
GTTTTTAATTTTAGAGAACTTGTACTTGGTTCGTAGAATTAAAAGAAACCTTATTTCTGTTTCAACTTTACTAGAACAATGTTCTTCAGTTTCCTTCTTGCGTAATGAAG
CTTTAATCTCAAAGAATGGAATTTACATTTGTTCTGCTAAGCGTGAGAATAATTTATTTGTGTTAAGACCAAATCATGCTAAGGCTATTTTAAGTCATGAAATGTTTAAA
ACGGCTGAAACGCAAAATAAAAGGCAAAAGGTTTCTCCCTTAAGTAACAACACGTATCTTTGGCACCTTCGTCTTGGTCATATTAACATCGATCGGATTGATCGTTTGGT
TAAAAATGGACTTCTAACTGACATAGAAGATACATCTTTACCACCCTGTGAATCGTGTCTCGAGGATGACTATTCAAGATACGGTTATTTGTACCTAATGGGCCATAAGT
CTAAGGCTCTTGAAAAGTTCAAAGAGTTTAAGGCGAGCTCGTTTTGGGGGTATGCAGTAGAGACTGCAGTACATATTCTTAACAGTGTTCCATCAAAAAGTGTTTCTGAA
ACTCCTTTTGAGTTATGGAGGGGGCGTAAACCTAGTTTGCATTACTTTCGAATCTGGGGTTGTCCTGCACATGTGTTAGTGACAAATCCTAAGAAGTTGGAACCACGTCC
ATCAGGTCCCACTGGTAGCTCTATAAGGGCGTTGAGCAAGATCAAGATCAATAACTGGGTGTTGGTTGCTGTATTTTTCTGTTTATCAAAGGTTTTGGCGAAGCGGTTCA
AGAATTTCTTCAAGGCTGGTGTTACGGACACTCGTGAAGGACTAACTAGTCGATATTGGTCTAAGTTCCGTGGACACAGAAAATATGTCTGCAGTGAGAAGAGTGCAACT
ATTCTGCCTCCAGACGCAACAGCGTCGGGACGCTCCTTCAACAGCATCTCGACGCTGTCTCGATACCGAAGGCGCTCGGTAAGGAAAGCACACAGCGTCGGGACGCTACC
TCAACAGCGTCTCGACGCTGTCTCGATTACGCAGGCGCCAGATATGGAAAACGCACAGCGTCGGGACGCTAATCTACACAGCGTCTCGACGCTGTCACGATGGAGGCGCG
CACTAGGGTTTCAACAGGCGCGGTTCAAGCAAGGACGGTCCGGTTCGGACCGGTTCGGCTGGGCTGGGACCTATTTGGTCCGGTTCAGCCAATTTTTGGCATGGTTCGTG
ATTTTGGAGGCCGGTTCGAGGTGGTTCGGGGCGGTTCGTGCTGTCTAG
Protein sequenceShow/hide protein sequence
MRRRREQRRNQNQMNNVSRLPQVPEDPADPQNRLLQQNPPLEQNEQQNNQAENPILVVNDKTRAIRAYVVPMCDELNPGIARPQIQAANFEMKPIMFQMLQTMGQFHGLS
FEDPRLHLKSFLGVSDSFVIQGVPRDALRLTLFSYCLRDGAKTWLNSFAPGSIRTWDKLAGRFFRKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGL
PHCIQMETFYNGLNGATQGMVDASAGGALLAKTFNEAHEILERISTNSCQWSDVRGTNKKVKSVLEVDGVSTIRVDLAMIANDLKNVTVISHQQRQAVESAALVNQVTEE
ACVYCGEDHNYEFCPSKPASVFFVGNQRNNPYSNFYNPGWRNHPHFTWGGQGSNMQAQQKVNQSGFAKAQVMPQKSKQAMPQQNTGSSLEAMMKEFMARTDAAIQSNQAS
MRALELQVGQLANELKAMPQGKLPSDTEHPRREGKEHVKTVTLRSDFETSQGAGGSNKNAGASGSFPDEEPHYVSPPPYVPHLPFPQRQKPKNQVLKMGPTLFMAREGFL
YAGWTTNRLFIRGALVLKDQEVARENVPEKKSLDYWGSHITTQHIFLGGTGATLTVVFVILQQDQDQNWVLVAVFLLSKLRMKICEEHDKIRGQLGENQSLFCPENCSRR
RLDAVTISCGSSFLRGDVGETLTAGSVRVNWSSWDRFGSVQPIFCPVRVLQGRFGSAVKDAYERWTKANEKVKVYIFVSLSEVLAKRYENVETAREIMNSLQEMFGLLSY
QLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIAEANGGVVCERSQVAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKF
LKGSSSGTKYVPQASSSKQIQKRKGDKGKAPAQAVQGKGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRREKEGKFDLLVLETCFVEHDEFAWILDSGAANHVCSSLQ
GNNFQQLVEGEMTLKVGTGDVVSARAVGVAKLFFRNRFLILENLYLVRRIKRNLISVSTLLEQCSSVSFLRNEALISKNGIYICSAKRENNLFVLRPNHAKAILSHEMFK
TAETQNKRQKVSPLSNNTYLWHLRLGHINIDRIDRLVKNGLLTDIEDTSLPPCESCLEDDYSRYGYLYLMGHKSKALEKFKEFKASSFWGYAVETAVHILNSVPSKSVSE
TPFELWRGRKPSLHYFRIWGCPAHVLVTNPKKLEPRPSGPTGSSIRALSKIKINNWVLVAVFFCLSKVLAKRFKNFFKAGVTDTREGLTSRYWSKFRGHRKYVCSEKSAT
ILPPDATASGRSFNSISTLSRYRRRSVRKAHSVGTLPQQRLDAVSITQAPDMENAQRRDANLHSVSTLSRWRRALGFQQARFKQGRSGSDRFGWAGTYLVRFSQFLAWFV
ILEAGSRWFGAVRAV