; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035988 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035988
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr3:36043653..36053621
RNA-Seq ExpressionLag0035988
SyntenyLag0035988
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031826.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-14950.84Show/hide
Query:  NDVSAW------ILDSEDLRFVLTEECPPVPPRTATQTVKDAHERWTKANEKVKVYILVGLSEVLAKRYENVETAREIMNSLQEMFGLPSYQLHHDALKN
        N+ ++W      +L  +DLRFVL EECP VP   AT+TV++ +ERW KANEK + YIL  LSEVLAK++E++ TAREIM+SLQEMFG  SYQ+ HDALK 
Subjt:  NDVSAW------ILDSEDLRFVLTEECPPVPPRTATQTVKDAHERWTKANEKVKVYILVGLSEVLAKRYENVETAREIMNSLQEMFGLPSYQLHHDALKN

Query:  VFNAKMQEGQSVREHVHDMINQFNIVEANGGLVCERSQVAFILHSLPASYMPFRTNASMNKIQFNLTSLLSELQIYESMQKSKSKNVVKGEANVAHSKKK
        ++NA+M EG SVREHV +M+  FN+ E NG ++ E SQV+FIL SLP S++ FR+NA MNKI + LT+LL+ELQ +ES+ K K +   KGEANVA S +K
Subjt:  VFNAKMQEGQSVREHVHDMINQFNIVEANGGLVCERSQVAFILHSLPASYMPFRTNASMNKIQFNLTSLLSELQIYESMQKSKSKNVVKGEANVAHSKKK

Query:  FLKGSSSGTKSVLQASSSKQIQKRKGDKG-KAPAQAVQGKGKAKVVTNKGRCFHCNVDGHWKRNCHRYLAEKKREKE-----------------------
        F +GS+SGTKS+  +S +K+ +K+KG +G KA   A +   KAK    KG CFHCN +GHWKRNC +YLAEKK+ K+                       
Subjt:  FLKGSSSGTKSVLQASSSKQIQKRKGDKG-KAPAQAVQGKGKAKVVTNKGRCFHCNVDGHWKRNCHRYLAEKKREKE-----------------------

Query:  --------------------------------------------------------------------------------------------------AK
                                                                                                          AK
Subjt:  --------------------------------------------------------------------------------------------------AK

Query:  GENNLFVLRPTDAKAILSHEMFKTAETQNKRQKISPLSNNTYWHLRLGHINIDRIDRLVKDGLLTDLEDTSLPPCELCLEGKMTKRPFTGKGDRAEKPLE
         ENNL+VLR   +KA+L+ EMFKTA TQNKR KISP  N   WHLRLGHIN++RI+RLVK+GLL++LE+ SLP CE CLEGKMTKRPFTGKG RA++PLE
Subjt:  GENNLFVLRPTDAKAILSHEMFKTAETQNKRQKISPLSNNTYWHLRLGHINIDRIDRLVKDGLLTDLEDTSLPPCELCLEGKMTKRPFTGKGDRAEKPLE

Query:  LIHLDLCGAMNVKARGGYEYFISFIDDYSRYGYLYLMGHMSEALEKFKEFKTEVENPLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSPPGTPQ
        L+H DLCG MNVKARGG+EYFI+F DDYSRYGY+YLM H SEALEKFKE+K EVEN L KTIKT RSDRGGEY+D +FQ+Y++E GI SQLS PGTPQ
Subjt:  LIHLDLCGAMNVKARGGYEYFISFIDDYSRYGYLYLMGHMSEALEKFKEFKTEVENPLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSPPGTPQ

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-14950.84Show/hide
Query:  NDVSAW------ILDSEDLRFVLTEECPPVPPRTATQTVKDAHERWTKANEKVKVYILVGLSEVLAKRYENVETAREIMNSLQEMFGLPSYQLHHDALKN
        N+ ++W      +L  +DLRFVL EECP VP   AT+TV++ +ERW KANEK + YIL  LSEVLAK++E++ TAREIM+SLQEMFG  SYQ+ HDALK 
Subjt:  NDVSAW------ILDSEDLRFVLTEECPPVPPRTATQTVKDAHERWTKANEKVKVYILVGLSEVLAKRYENVETAREIMNSLQEMFGLPSYQLHHDALKN

Query:  VFNAKMQEGQSVREHVHDMINQFNIVEANGGLVCERSQVAFILHSLPASYMPFRTNASMNKIQFNLTSLLSELQIYESMQKSKSKNVVKGEANVAHSKKK
        ++NA+M EG SVREHV +M+  FN+ E NG ++ E SQV+FIL SLP S++ FR+NA MNKI + LT+LL+ELQ +ES+ K K +   KGEANVA S +K
Subjt:  VFNAKMQEGQSVREHVHDMINQFNIVEANGGLVCERSQVAFILHSLPASYMPFRTNASMNKIQFNLTSLLSELQIYESMQKSKSKNVVKGEANVAHSKKK

Query:  FLKGSSSGTKSVLQASSSKQIQKRKGDKG-KAPAQAVQGKGKAKVVTNKGRCFHCNVDGHWKRNCHRYLAEKKREKE-----------------------
        F +GS+SGTKS+  +S +K+ +K+KG +G KA   A +   KAK    KG CFHCN +GHWKRNC +YLAEKK+ K+                       
Subjt:  FLKGSSSGTKSVLQASSSKQIQKRKGDKG-KAPAQAVQGKGKAKVVTNKGRCFHCNVDGHWKRNCHRYLAEKKREKE-----------------------

Query:  --------------------------------------------------------------------------------------------------AK
                                                                                                          AK
Subjt:  --------------------------------------------------------------------------------------------------AK

Query:  GENNLFVLRPTDAKAILSHEMFKTAETQNKRQKISPLSNNTYWHLRLGHINIDRIDRLVKDGLLTDLEDTSLPPCELCLEGKMTKRPFTGKGDRAEKPLE
         ENNL+VLR   +KA+L+ EMFKTA TQNKR KISP  N   WHLRLGHIN++RI+RLVK+GLL++LE+ SLP CE CLEGKMTKRPFTGKG RA++PLE
Subjt:  GENNLFVLRPTDAKAILSHEMFKTAETQNKRQKISPLSNNTYWHLRLGHINIDRIDRLVKDGLLTDLEDTSLPPCELCLEGKMTKRPFTGKGDRAEKPLE

Query:  LIHLDLCGAMNVKARGGYEYFISFIDDYSRYGYLYLMGHMSEALEKFKEFKTEVENPLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSPPGTPQ
        L+H DLCG MNVKARGG+EYFI+F DDYSRYGY+YLM H SEALEKFKE+K EVEN L KTIKT RSDRGGEY+D +FQ+Y++E GI SQLS PGTPQ
Subjt:  LIHLDLCGAMNVKARGGYEYFISFIDDYSRYGYLYLMGHMSEALEKFKEFKTEVENPLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSPPGTPQ

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]8.7e-15050.92Show/hide
Query:  NDVSAW-----ILDSEDLRFVLTEECPPVPPRTATQTVKDAHERWTKANEKVKVYILVGLSEVLAKRYENVETAREIMNSLQEMFGLPSYQLHHDALKNV
        N+ ++W     +L  +DLRFVL EECP VP   AT+TV++ +ERW KANEK + YIL  LSEVLAK++E++ TAREIM+SLQEMFG  SYQ+ HDALK +
Subjt:  NDVSAW-----ILDSEDLRFVLTEECPPVPPRTATQTVKDAHERWTKANEKVKVYILVGLSEVLAKRYENVETAREIMNSLQEMFGLPSYQLHHDALKNV

Query:  FNAKMQEGQSVREHVHDMINQFNIVEANGGLVCERSQVAFILHSLPASYMPFRTNASMNKIQFNLTSLLSELQIYESMQKSKSKNVVKGEANVAHSKKKF
        +NA+M EG SVREHV +M+  FN+ E NG ++ E SQV+FIL SLP S++ FR+NA MNKI + LT+LL+ELQ +ES+ K K +   KGEANVA S +KF
Subjt:  FNAKMQEGQSVREHVHDMINQFNIVEANGGLVCERSQVAFILHSLPASYMPFRTNASMNKIQFNLTSLLSELQIYESMQKSKSKNVVKGEANVAHSKKKF

Query:  LKGSSSGTKSVLQASSSKQIQKRKGDKG-KAPAQAVQGKGKAKVVTNKGRCFHCNVDGHWKRNCHRYLAEKKREKE------------------------
         +GS+SGTKS+  +S +K+ +K+KG +G KA   A +   KAK    KG CFHCN +GHWKRNC +YLAEKK+ K+                        
Subjt:  LKGSSSGTKSVLQASSSKQIQKRKGDKG-KAPAQAVQGKGKAKVVTNKGRCFHCNVDGHWKRNCHRYLAEKKREKE------------------------

Query:  -------------------------------------------------------------------------------------------------AKG
                                                                                                         AK 
Subjt:  -------------------------------------------------------------------------------------------------AKG

Query:  ENNLFVLRPTDAKAILSHEMFKTAETQNKRQKISPLSNNTYWHLRLGHINIDRIDRLVKDGLLTDLEDTSLPPCELCLEGKMTKRPFTGKGDRAEKPLEL
        ENNL+VLR   +KA+L+ EMFKTA TQNKR KISP  N   WHLRLGHIN++RI+RLVK+GLL++LE+ SLP CE CLEGKMTKRPFTGKG RA++PLEL
Subjt:  ENNLFVLRPTDAKAILSHEMFKTAETQNKRQKISPLSNNTYWHLRLGHINIDRIDRLVKDGLLTDLEDTSLPPCELCLEGKMTKRPFTGKGDRAEKPLEL

Query:  IHLDLCGAMNVKARGGYEYFISFIDDYSRYGYLYLMGHMSEALEKFKEFKTEVENPLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSPPGTPQ
        +H DLCG MNVKARGG+EYFI+F DDYSRYGY+YLM H SEALEKFKE+K EVEN L KTIKT RSDRGGEY+D +FQ+Y++E GI SQLS PGTPQ
Subjt:  IHLDLCGAMNVKARGGYEYFISFIDDYSRYGYLYLMGHMSEALEKFKEFKTEVENPLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSPPGTPQ

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-14950.84Show/hide
Query:  NDVSAW------ILDSEDLRFVLTEECPPVPPRTATQTVKDAHERWTKANEKVKVYILVGLSEVLAKRYENVETAREIMNSLQEMFGLPSYQLHHDALKN
        N+ ++W      +L  +DLRFVL EECP VP   AT+TV++ +ERW KANEK + YIL  LSEVLAK++E++ TAREIM+SLQEMFG  SYQ+ HDALK 
Subjt:  NDVSAW------ILDSEDLRFVLTEECPPVPPRTATQTVKDAHERWTKANEKVKVYILVGLSEVLAKRYENVETAREIMNSLQEMFGLPSYQLHHDALKN

Query:  VFNAKMQEGQSVREHVHDMINQFNIVEANGGLVCERSQVAFILHSLPASYMPFRTNASMNKIQFNLTSLLSELQIYESMQKSKSKNVVKGEANVAHSKKK
        ++NA+M EG SVREHV +M+  FN+ E NG ++ E SQV+FIL SLP S++ FR+NA MNKI + LT+LL+ELQ +ES+ K K +   KGEANVA S +K
Subjt:  VFNAKMQEGQSVREHVHDMINQFNIVEANGGLVCERSQVAFILHSLPASYMPFRTNASMNKIQFNLTSLLSELQIYESMQKSKSKNVVKGEANVAHSKKK

Query:  FLKGSSSGTKSVLQASSSKQIQKRKGDKG-KAPAQAVQGKGKAKVVTNKGRCFHCNVDGHWKRNCHRYLAEKKREKE-----------------------
        F +GS+SGTKS+  +S +K+ +K+KG +G KA   A +   KAK    KG CFHCN +GHWKRNC +YLAEKK+ K+                       
Subjt:  FLKGSSSGTKSVLQASSSKQIQKRKGDKG-KAPAQAVQGKGKAKVVTNKGRCFHCNVDGHWKRNCHRYLAEKKREKE-----------------------

Query:  --------------------------------------------------------------------------------------------------AK
                                                                                                          AK
Subjt:  --------------------------------------------------------------------------------------------------AK

Query:  GENNLFVLRPTDAKAILSHEMFKTAETQNKRQKISPLSNNTYWHLRLGHINIDRIDRLVKDGLLTDLEDTSLPPCELCLEGKMTKRPFTGKGDRAEKPLE
         ENNL+VLR   +KA+L+ EMFKTA TQNKR KISP  N   WHLRLGHIN++RI+RLVK+GLL++LE+ SLP CE CLEGKMTKRPFTGKG RA++PLE
Subjt:  GENNLFVLRPTDAKAILSHEMFKTAETQNKRQKISPLSNNTYWHLRLGHINIDRIDRLVKDGLLTDLEDTSLPPCELCLEGKMTKRPFTGKGDRAEKPLE

Query:  LIHLDLCGAMNVKARGGYEYFISFIDDYSRYGYLYLMGHMSEALEKFKEFKTEVENPLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSPPGTPQ
        L+H DLCG MNVKARGG+EYFI+F DDYSRYGY+YLM H SEALEKFKE+K EVEN L KTIKT RSDRGGEY+D +FQ+Y++E GI SQLS PGTPQ
Subjt:  LIHLDLCGAMNVKARGGYEYFISFIDDYSRYGYLYLMGHMSEALEKFKEFKTEVENPLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSPPGTPQ

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]1.9e-14950.42Show/hide
Query:  NDVSAW------ILDSEDLRFVLTEECPPVPPRTATQTVKDAHERWTKANEKVKVYILVGLSEVLAKRYENVETAREIMNSLQEMFGLPSYQLHHDALKN
        N+ ++W      +L  +DLRFVL EECP VP   AT+TV++ +ERW KANEK + YIL  LSEVLAK++E++ TAREIM+SLQEMFG  SYQ+ HDALK 
Subjt:  NDVSAW------ILDSEDLRFVLTEECPPVPPRTATQTVKDAHERWTKANEKVKVYILVGLSEVLAKRYENVETAREIMNSLQEMFGLPSYQLHHDALKN

Query:  VFNAKMQEGQSVREHVHDMINQFNIVEANGGLVCERSQVAFILHSLPASYMPFRTNASMNKIQFNLTSLLSELQIYESMQKSKSKNVVKGEANVAHSKKK
        ++NA+M EG SVREHV +M+  FN+ E NG ++ E SQV+FIL SLP S++ FR+NA MNKI + LT+LL+ELQ +ES+ K K +   KGEANVA S +K
Subjt:  VFNAKMQEGQSVREHVHDMINQFNIVEANGGLVCERSQVAFILHSLPASYMPFRTNASMNKIQFNLTSLLSELQIYESMQKSKSKNVVKGEANVAHSKKK

Query:  FLKGSSSGTKSVLQASSSKQIQKRKGDKGKAPAQAVQGKGKAKVVTNKGRCFHCNVDGHWKRNCHRYLAEKKREKE------------------------
        F +GS+SGTKS+  +S +K+ +K+KG +G   A     K   K    KG CFHCN +GHWKRNC +YLAEKK+ K+                        
Subjt:  FLKGSSSGTKSVLQASSSKQIQKRKGDKGKAPAQAVQGKGKAKVVTNKGRCFHCNVDGHWKRNCHRYLAEKKREKE------------------------

Query:  -------------------------------------------------------------------------------------------------AKG
                                                                                                         AK 
Subjt:  -------------------------------------------------------------------------------------------------AKG

Query:  ENNLFVLRPTDAKAILSHEMFKTAETQNKRQKISPLSNNTYWHLRLGHINIDRIDRLVKDGLLTDLEDTSLPPCELCLEGKMTKRPFTGKGDRAEKPLEL
        ENNL+VLR   +KA+L+ EMFKTA TQNKR KISP  N   WHLRLGHIN++RI+RLVK+GLL++LE+ SLP CE CLEGKMTKRPFTGKG RA++PLEL
Subjt:  ENNLFVLRPTDAKAILSHEMFKTAETQNKRQKISPLSNNTYWHLRLGHINIDRIDRLVKDGLLTDLEDTSLPPCELCLEGKMTKRPFTGKGDRAEKPLEL

Query:  IHLDLCGAMNVKARGGYEYFISFIDDYSRYGYLYLMGHMSEALEKFKEFKTEVENPLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSPPGTPQ
        +H DLCG MNVKARGG+EYFI+F DDYSRYGY+YLM H SEALEKFKE+K EVEN L KTIKT RSDRGGEY+D +FQ+Y++E GI SQLS PGTPQ
Subjt:  IHLDLCGAMNVKARGGYEYFISFIDDYSRYGYLYLMGHMSEALEKFKEFKTEVENPLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSPPGTPQ

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein5.5e-15050.84Show/hide
Query:  NDVSAW------ILDSEDLRFVLTEECPPVPPRTATQTVKDAHERWTKANEKVKVYILVGLSEVLAKRYENVETAREIMNSLQEMFGLPSYQLHHDALKN
        N+ ++W      +L  +DLRFVL EECP VP   AT+TV++ +ERW KANEK + YIL  LSEVLAK++E++ TAREIM+SLQEMFG  SYQ+ HDALK 
Subjt:  NDVSAW------ILDSEDLRFVLTEECPPVPPRTATQTVKDAHERWTKANEKVKVYILVGLSEVLAKRYENVETAREIMNSLQEMFGLPSYQLHHDALKN

Query:  VFNAKMQEGQSVREHVHDMINQFNIVEANGGLVCERSQVAFILHSLPASYMPFRTNASMNKIQFNLTSLLSELQIYESMQKSKSKNVVKGEANVAHSKKK
        ++NA+M EG SVREHV +M+  FN+ E NG ++ E SQV+FIL SLP S++ FR+NA MNKI + LT+LL+ELQ +ES+ K K +   KGEANVA S +K
Subjt:  VFNAKMQEGQSVREHVHDMINQFNIVEANGGLVCERSQVAFILHSLPASYMPFRTNASMNKIQFNLTSLLSELQIYESMQKSKSKNVVKGEANVAHSKKK

Query:  FLKGSSSGTKSVLQASSSKQIQKRKGDKG-KAPAQAVQGKGKAKVVTNKGRCFHCNVDGHWKRNCHRYLAEKKREKE-----------------------
        F +GS+SGTKS+  +S +K+ +K+KG +G KA   A +   KAK    KG CFHCN +GHWKRNC +YLAEKK+ K+                       
Subjt:  FLKGSSSGTKSVLQASSSKQIQKRKGDKG-KAPAQAVQGKGKAKVVTNKGRCFHCNVDGHWKRNCHRYLAEKKREKE-----------------------

Query:  --------------------------------------------------------------------------------------------------AK
                                                                                                          AK
Subjt:  --------------------------------------------------------------------------------------------------AK

Query:  GENNLFVLRPTDAKAILSHEMFKTAETQNKRQKISPLSNNTYWHLRLGHINIDRIDRLVKDGLLTDLEDTSLPPCELCLEGKMTKRPFTGKGDRAEKPLE
         ENNL+VLR   +KA+L+ EMFKTA TQNKR KISP  N   WHLRLGHIN++RI+RLVK+GLL++LE+ SLP CE CLEGKMTKRPFTGKG RA++PLE
Subjt:  GENNLFVLRPTDAKAILSHEMFKTAETQNKRQKISPLSNNTYWHLRLGHINIDRIDRLVKDGLLTDLEDTSLPPCELCLEGKMTKRPFTGKGDRAEKPLE

Query:  LIHLDLCGAMNVKARGGYEYFISFIDDYSRYGYLYLMGHMSEALEKFKEFKTEVENPLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSPPGTPQ
        L+H DLCG MNVKARGG+EYFI+F DDYSRYGY+YLM H SEALEKFKE+K EVEN L KTIKT RSDRGGEY+D +FQ+Y++E GI SQLS PGTPQ
Subjt:  LIHLDLCGAMNVKARGGYEYFISFIDDYSRYGYLYLMGHMSEALEKFKEFKTEVENPLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSPPGTPQ

A0A5A7TWB9 Gag/pol protein5.5e-15050.84Show/hide
Query:  NDVSAW------ILDSEDLRFVLTEECPPVPPRTATQTVKDAHERWTKANEKVKVYILVGLSEVLAKRYENVETAREIMNSLQEMFGLPSYQLHHDALKN
        N+ ++W      +L  +DLRFVL EECP VP   AT+TV++ +ERW KANEK + YIL  LSEVLAK++E++ TAREIM+SLQEMFG  SYQ+ HDALK 
Subjt:  NDVSAW------ILDSEDLRFVLTEECPPVPPRTATQTVKDAHERWTKANEKVKVYILVGLSEVLAKRYENVETAREIMNSLQEMFGLPSYQLHHDALKN

Query:  VFNAKMQEGQSVREHVHDMINQFNIVEANGGLVCERSQVAFILHSLPASYMPFRTNASMNKIQFNLTSLLSELQIYESMQKSKSKNVVKGEANVAHSKKK
        ++NA+M EG SVREHV +M+  FN+ E NG ++ E SQV+FIL SLP S++ FR+NA MNKI + LT+LL+ELQ +ES+ K K +   KGEANVA S +K
Subjt:  VFNAKMQEGQSVREHVHDMINQFNIVEANGGLVCERSQVAFILHSLPASYMPFRTNASMNKIQFNLTSLLSELQIYESMQKSKSKNVVKGEANVAHSKKK

Query:  FLKGSSSGTKSVLQASSSKQIQKRKGDKG-KAPAQAVQGKGKAKVVTNKGRCFHCNVDGHWKRNCHRYLAEKKREKE-----------------------
        F +GS+SGTKS+  +S +K+ +K+KG +G KA   A +   KAK    KG CFHCN +GHWKRNC +YLAEKK+ K+                       
Subjt:  FLKGSSSGTKSVLQASSSKQIQKRKGDKG-KAPAQAVQGKGKAKVVTNKGRCFHCNVDGHWKRNCHRYLAEKKREKE-----------------------

Query:  --------------------------------------------------------------------------------------------------AK
                                                                                                          AK
Subjt:  --------------------------------------------------------------------------------------------------AK

Query:  GENNLFVLRPTDAKAILSHEMFKTAETQNKRQKISPLSNNTYWHLRLGHINIDRIDRLVKDGLLTDLEDTSLPPCELCLEGKMTKRPFTGKGDRAEKPLE
         ENNL+VLR   +KA+L+ EMFKTA TQNKR KISP  N   WHLRLGHIN++RI+RLVK+GLL++LE+ SLP CE CLEGKMTKRPFTGKG RA++PLE
Subjt:  GENNLFVLRPTDAKAILSHEMFKTAETQNKRQKISPLSNNTYWHLRLGHINIDRIDRLVKDGLLTDLEDTSLPPCELCLEGKMTKRPFTGKGDRAEKPLE

Query:  LIHLDLCGAMNVKARGGYEYFISFIDDYSRYGYLYLMGHMSEALEKFKEFKTEVENPLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSPPGTPQ
        L+H DLCG MNVKARGG+EYFI+F DDYSRYGY+YLM H SEALEKFKE+K EVEN L KTIKT RSDRGGEY+D +FQ+Y++E GI SQLS PGTPQ
Subjt:  LIHLDLCGAMNVKARGGYEYFISFIDDYSRYGYLYLMGHMSEALEKFKEFKTEVENPLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSPPGTPQ

A0A5A7TZD7 Gag/pol protein4.2e-15050.92Show/hide
Query:  NDVSAW-----ILDSEDLRFVLTEECPPVPPRTATQTVKDAHERWTKANEKVKVYILVGLSEVLAKRYENVETAREIMNSLQEMFGLPSYQLHHDALKNV
        N+ ++W     +L  +DLRFVL EECP VP   AT+TV++ +ERW KANEK + YIL  LSEVLAK++E++ TAREIM+SLQEMFG  SYQ+ HDALK +
Subjt:  NDVSAW-----ILDSEDLRFVLTEECPPVPPRTATQTVKDAHERWTKANEKVKVYILVGLSEVLAKRYENVETAREIMNSLQEMFGLPSYQLHHDALKNV

Query:  FNAKMQEGQSVREHVHDMINQFNIVEANGGLVCERSQVAFILHSLPASYMPFRTNASMNKIQFNLTSLLSELQIYESMQKSKSKNVVKGEANVAHSKKKF
        +NA+M EG SVREHV +M+  FN+ E NG ++ E SQV+FIL SLP S++ FR+NA MNKI + LT+LL+ELQ +ES+ K K +   KGEANVA S +KF
Subjt:  FNAKMQEGQSVREHVHDMINQFNIVEANGGLVCERSQVAFILHSLPASYMPFRTNASMNKIQFNLTSLLSELQIYESMQKSKSKNVVKGEANVAHSKKKF

Query:  LKGSSSGTKSVLQASSSKQIQKRKGDKG-KAPAQAVQGKGKAKVVTNKGRCFHCNVDGHWKRNCHRYLAEKKREKE------------------------
         +GS+SGTKS+  +S +K+ +K+KG +G KA   A +   KAK    KG CFHCN +GHWKRNC +YLAEKK+ K+                        
Subjt:  LKGSSSGTKSVLQASSSKQIQKRKGDKG-KAPAQAVQGKGKAKVVTNKGRCFHCNVDGHWKRNCHRYLAEKKREKE------------------------

Query:  -------------------------------------------------------------------------------------------------AKG
                                                                                                         AK 
Subjt:  -------------------------------------------------------------------------------------------------AKG

Query:  ENNLFVLRPTDAKAILSHEMFKTAETQNKRQKISPLSNNTYWHLRLGHINIDRIDRLVKDGLLTDLEDTSLPPCELCLEGKMTKRPFTGKGDRAEKPLEL
        ENNL+VLR   +KA+L+ EMFKTA TQNKR KISP  N   WHLRLGHIN++RI+RLVK+GLL++LE+ SLP CE CLEGKMTKRPFTGKG RA++PLEL
Subjt:  ENNLFVLRPTDAKAILSHEMFKTAETQNKRQKISPLSNNTYWHLRLGHINIDRIDRLVKDGLLTDLEDTSLPPCELCLEGKMTKRPFTGKGDRAEKPLEL

Query:  IHLDLCGAMNVKARGGYEYFISFIDDYSRYGYLYLMGHMSEALEKFKEFKTEVENPLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSPPGTPQ
        +H DLCG MNVKARGG+EYFI+F DDYSRYGY+YLM H SEALEKFKE+K EVEN L KTIKT RSDRGGEY+D +FQ+Y++E GI SQLS PGTPQ
Subjt:  IHLDLCGAMNVKARGGYEYFISFIDDYSRYGYLYLMGHMSEALEKFKEFKTEVENPLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSPPGTPQ

A0A5A7UGV2 Gag/pol protein5.5e-15050.84Show/hide
Query:  NDVSAW------ILDSEDLRFVLTEECPPVPPRTATQTVKDAHERWTKANEKVKVYILVGLSEVLAKRYENVETAREIMNSLQEMFGLPSYQLHHDALKN
        N+ ++W      +L  +DLRFVL EECP VP   AT+TV++ +ERW KANEK + YIL  LSEVLAK++E++ TAREIM+SLQEMFG  SYQ+ HDALK 
Subjt:  NDVSAW------ILDSEDLRFVLTEECPPVPPRTATQTVKDAHERWTKANEKVKVYILVGLSEVLAKRYENVETAREIMNSLQEMFGLPSYQLHHDALKN

Query:  VFNAKMQEGQSVREHVHDMINQFNIVEANGGLVCERSQVAFILHSLPASYMPFRTNASMNKIQFNLTSLLSELQIYESMQKSKSKNVVKGEANVAHSKKK
        ++NA+M EG SVREHV +M+  FN+ E NG ++ E SQV+FIL SLP S++ FR+NA MNKI + LT+LL+ELQ +ES+ K K +   KGEANVA S +K
Subjt:  VFNAKMQEGQSVREHVHDMINQFNIVEANGGLVCERSQVAFILHSLPASYMPFRTNASMNKIQFNLTSLLSELQIYESMQKSKSKNVVKGEANVAHSKKK

Query:  FLKGSSSGTKSVLQASSSKQIQKRKGDKG-KAPAQAVQGKGKAKVVTNKGRCFHCNVDGHWKRNCHRYLAEKKREKE-----------------------
        F +GS+SGTKS+  +S +K+ +K+KG +G KA   A +   KAK    KG CFHCN +GHWKRNC +YLAEKK+ K+                       
Subjt:  FLKGSSSGTKSVLQASSSKQIQKRKGDKG-KAPAQAVQGKGKAKVVTNKGRCFHCNVDGHWKRNCHRYLAEKKREKE-----------------------

Query:  --------------------------------------------------------------------------------------------------AK
                                                                                                          AK
Subjt:  --------------------------------------------------------------------------------------------------AK

Query:  GENNLFVLRPTDAKAILSHEMFKTAETQNKRQKISPLSNNTYWHLRLGHINIDRIDRLVKDGLLTDLEDTSLPPCELCLEGKMTKRPFTGKGDRAEKPLE
         ENNL+VLR   +KA+L+ EMFKTA TQNKR KISP  N   WHLRLGHIN++RI+RLVK+GLL++LE+ SLP CE CLEGKMTKRPFTGKG RA++PLE
Subjt:  GENNLFVLRPTDAKAILSHEMFKTAETQNKRQKISPLSNNTYWHLRLGHINIDRIDRLVKDGLLTDLEDTSLPPCELCLEGKMTKRPFTGKGDRAEKPLE

Query:  LIHLDLCGAMNVKARGGYEYFISFIDDYSRYGYLYLMGHMSEALEKFKEFKTEVENPLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSPPGTPQ
        L+H DLCG MNVKARGG+EYFI+F DDYSRYGY+YLM H SEALEKFKE+K EVEN L KTIKT RSDRGGEY+D +FQ+Y++E GI SQLS PGTPQ
Subjt:  LIHLDLCGAMNVKARGGYEYFISFIDDYSRYGYLYLMGHMSEALEKFKEFKTEVENPLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSPPGTPQ

A0A5D3CPJ6 Gag/pol protein9.3e-15050.42Show/hide
Query:  NDVSAW------ILDSEDLRFVLTEECPPVPPRTATQTVKDAHERWTKANEKVKVYILVGLSEVLAKRYENVETAREIMNSLQEMFGLPSYQLHHDALKN
        N+ ++W      +L  +DLRFVL EECP VP   AT+TV++ +ERW KANEK + YIL  LSEVLAK++E++ TAREIM+SLQEMFG  SYQ+ HDALK 
Subjt:  NDVSAW------ILDSEDLRFVLTEECPPVPPRTATQTVKDAHERWTKANEKVKVYILVGLSEVLAKRYENVETAREIMNSLQEMFGLPSYQLHHDALKN

Query:  VFNAKMQEGQSVREHVHDMINQFNIVEANGGLVCERSQVAFILHSLPASYMPFRTNASMNKIQFNLTSLLSELQIYESMQKSKSKNVVKGEANVAHSKKK
        ++NA+M EG SVREHV +M+  FN+ E NG ++ E SQV+FIL SLP S++ FR+NA MNKI + LT+LL+ELQ +ES+ K K +   KGEANVA S +K
Subjt:  VFNAKMQEGQSVREHVHDMINQFNIVEANGGLVCERSQVAFILHSLPASYMPFRTNASMNKIQFNLTSLLSELQIYESMQKSKSKNVVKGEANVAHSKKK

Query:  FLKGSSSGTKSVLQASSSKQIQKRKGDKGKAPAQAVQGKGKAKVVTNKGRCFHCNVDGHWKRNCHRYLAEKKREKE------------------------
        F +GS+SGTKS+  +S +K+ +K+KG +G   A     K   K    KG CFHCN +GHWKRNC +YLAEKK+ K+                        
Subjt:  FLKGSSSGTKSVLQASSSKQIQKRKGDKGKAPAQAVQGKGKAKVVTNKGRCFHCNVDGHWKRNCHRYLAEKKREKE------------------------

Query:  -------------------------------------------------------------------------------------------------AKG
                                                                                                         AK 
Subjt:  -------------------------------------------------------------------------------------------------AKG

Query:  ENNLFVLRPTDAKAILSHEMFKTAETQNKRQKISPLSNNTYWHLRLGHINIDRIDRLVKDGLLTDLEDTSLPPCELCLEGKMTKRPFTGKGDRAEKPLEL
        ENNL+VLR   +KA+L+ EMFKTA TQNKR KISP  N   WHLRLGHIN++RI+RLVK+GLL++LE+ SLP CE CLEGKMTKRPFTGKG RA++PLEL
Subjt:  ENNLFVLRPTDAKAILSHEMFKTAETQNKRQKISPLSNNTYWHLRLGHINIDRIDRLVKDGLLTDLEDTSLPPCELCLEGKMTKRPFTGKGDRAEKPLEL

Query:  IHLDLCGAMNVKARGGYEYFISFIDDYSRYGYLYLMGHMSEALEKFKEFKTEVENPLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSPPGTPQ
        +H DLCG MNVKARGG+EYFI+F DDYSRYGY+YLM H SEALEKFKE+K EVEN L KTIKT RSDRGGEY+D +FQ+Y++E GI SQLS PGTPQ
Subjt:  IHLDLCGAMNVKARGGYEYFISFIDDYSRYGYLYLMGHMSEALEKFKEFKTEVENPLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSPPGTPQ

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.0e-2033.17Show/hide
Query:  MKDLGEAQYVLGIQILRDRKNRLLALSQTSYIDKILNRYSMQNSKRGLLPFRHGIHLSKEQSPQTPQEVEDMKQVPYASADGSIMYAMLCTHLDICYAIG
        M DL E ++ +GI+I  + +   + LSQ++Y+ KIL++++M+N      P    I+     S       ++    P  S  G +MY MLCT  D+  A+ 
Subjt:  MKDLGEAQYVLGIQILRDRKNRLLALSQTSYIDKILNRYSMQNSKRGLLPFRHGIHLSKEQSPQTPQEVEDMKQVPYASADGSIMYAMLCTHLDICYAIG

Query:  IVSRYKSNPSQAHWNVVKNILKYLRRTRDYSFVYGFKNLIF----TGYTDSGFQTDIDSRRSTSGSVFTL-NGGAVVWRNIRQGCIADSTMEAEYVVACE
        I+SRY S  +   W  +K +L+YL+ T D   ++  KNL F     GY DS +      R+ST+G +F + +   + W   RQ  +A S+ EAEY+   E
Subjt:  IVSRYKSNPSQAHWNVVKNILKYLRRTRDYSFVYGFKNLIF----TGYTDSGFQTDIDSRRSTSGSVFTL-NGGAVVWRNIRQGCIADSTMEAEYVVACE

Query:  ASKEA
        A +EA
Subjt:  ASKEA

P0CV72 Secreted RxLR effector protein 1615.8e-2443.94Show/hide
Query:  MKQVPYASADGSIMYAMLCTHLDICYAIGIVSRYKSNPSQAHWNVVKNILKYLRRTRDYSFVY---GFKNLIFTGYTDSGFQTDIDSRRSTSGSVFTLNG
        MK VPY SA G+IMY M+ T  D+  A+G++S++ S+P   HW  +K +L+YL+ T+ Y   +   G   L+  GY+D+ +  D++SRRSTSG +F LNG
Subjt:  MKQVPYASADGSIMYAMLCTHLDICYAIGIVSRYKSNPSQAHWNVVKNILKYLRRTRDYSFVY---GFKNLIFTGYTDSGFQTDIDSRRSTSGSVFTLNG

Query:  GAVVWRNIRQGCIADSTMEAEYVVACEASKEA
        G V WR+ +Q  +A S+ E EY+   EA++EA
Subjt:  GAVVWRNIRQGCIADSTMEAEYVVACEASKEA

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.4e-4646.41Show/hide
Query:  MKDLGEAQYVLGIQILRDRKNRLLALSQTSYIDKILNRYSMQNSKRGLLPFRHGIHLSKEQSPQTPQEVEDMKQVPYASADGSIMYAMLCTHLDICYAIG
        MKDLG AQ +LG++I+R+R +R L LSQ  YI+++L R++M+N+K    P    + LSK+  P T +E  +M +VPY+SA GS+MYAM+CT  DI +A+G
Subjt:  MKDLGEAQYVLGIQILRDRKNRLLALSQTSYIDKILNRYSMQNSKRGLLPFRHGIHLSKEQSPQTPQEVEDMKQVPYASADGSIMYAMLCTHLDICYAIG

Query:  IVSRYKSNPSQAHWNVVKNILKYLRRTRDYSFVYGFKNLIFTGYTDSGFQTDIDSRRSTSGSVFTLNGGAVVWRNIRQGCIADSTMEAEYVVACEASKEA
        +VSR+  NP + HW  VK IL+YLR T      +G  + I  GYTD+    DID+R+S++G +FT +GGA+ W++  Q C+A ST EAEY+ A E  KE 
Subjt:  IVSRYKSNPSQAHWNVVKNILKYLRRTRDYSFVYGFKNLIFTGYTDSGFQTDIDSRRSTSGSVFTLNGGAVVWRNIRQGCIADSTMEAEYVVACEASKEA

Query:  EKVERIEEE
          ++R  +E
Subjt:  EKVERIEEE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.8e-1732.93Show/hide
Query:  SPLSNNTY--WHLRLGHINIDRIDRLVKDGLLTDLEDT-SLPPCELCLEGKMTKRPFTGKGDRAEKPLELIHLDLCGAMNVKARGGYEYFISFIDDYSRY
        SP S  T+  WH RLGH     ++ ++ +  L+ L  +     C  CL  K  K PF+     + +PLE I+ D+  +  + +   Y Y++ F+D ++RY
Subjt:  SPLSNNTY--WHLRLGHINIDRIDRLVKDGLLTDLEDT-SLPPCELCLEGKMTKRPFTGKGDRAEKPLELIHLDLCGAMNVKARGGYEYFISFIDDYSRY

Query:  GYLYLMGHMSEALEKFKEFKTEVENPLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSPPGTPQ
         +LY +   S+  E F  FK  +EN     I T  SD GGE++     +Y  +HGI    SPP TP+
Subjt:  GYLYLMGHMSEALEKFKEFKTEVENPLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSPPGTPQ

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.3e-1934.46Show/hide
Query:  AETQNKRQKISPLSNNTY--WHLRLGHINIDRIDRLVKDGLLTDLEDT-SLPPCELCLEGKMTKRPFTGKGDRAEKPLELIHLDLCGAMNVKARGGYEYF
        A +Q      SP S  T+  WH RLGH ++  ++ ++ +  L  L  +  L  C  C   K  K PF+     + KPLE I+ D+  +  + +   Y Y+
Subjt:  AETQNKRQKISPLSNNTY--WHLRLGHINIDRIDRLVKDGLLTDLEDT-SLPPCELCLEGKMTKRPFTGKGDRAEKPLELIHLDLCGAMNVKARGGYEYF

Query:  ISFIDDYSRYGYLYLMGHMSEALEKFKEFKTEVENPLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSPPGTPQ
        + F+D ++RY +LY +   S+  + F  FK+ VEN     I TL SD GGE++  R  DY+ +HGI    SPP TP+
Subjt:  ISFIDDYSRYGYLYLMGHMSEALEKFKEFKTEVENPLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSPPGTPQ

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.4e-1227Show/hide
Query:  MKDLGEAQYVLGIQILRDRKNRLLALSQTSYIDKILNRYSMQNSKRGLLPFRHGIHLSKEQSPQTPQEVEDMKQVPYASADGSIMYAMLCTHLDICYAIG
        ++DLG  +Y LG++I R      + + Q  Y   +L+   +   K   +P    +  S         +  D K   Y    G +MY  + T LDI +A+ 
Subjt:  MKDLGEAQYVLGIQILRDRKNRLLALSQTSYIDKILNRYSMQNSKRGLLPFRHGIHLSKEQSPQTPQEVEDMKQVPYASADGSIMYAMLCTHLDICYAIG

Query:  IVSRYKSNPSQAHWNVVKNILKYLRRTRDYSFVYGFK-NLIFTGYTDSGFQTDIDSRRSTSGSVFTLNGGAVVWRNIRQGCIADSTMEAEYVVACEASKE
         +S++   P  AH   V  IL Y++ T      Y  +  +    ++D+ FQ+  D+RRST+G    L    + W++ +Q  ++ S+ EAEY     A+ E
Subjt:  IVSRYKSNPSQAHWNVVKNILKYLRRTRDYSFVYGFK-NLIFTGYTDSGFQTDIDSRRSTSGSVFTLNGGAVVWRNIRQGCIADSTMEAEYVVACEASKE

ATMG00300.1 Gag-Pol-related retrotransposon family protein1.9e-0637.14Show/hide
Query:  WHLRLGHINIDRIDRLVKDGLLTDLEDTSLPPCELCLEGKMTKRPFTGKGDRAEKPLELIHLDLCGAMNV
        WH RL H++   ++ LVK G L   + +SL  CE C+ GK  +  F+      + PL+ +H DL GA +V
Subjt:  WHLRLGHINIDRIDRLVKDGLLTDLEDTSLPPCELCLEGKMTKRPFTGKGDRAEKPLELIHLDLCGAMNV

ATMG00810.1 DNA/RNA polymerases superfamily protein8.1e-1330.89Show/hide
Query:  MKDLGEAQYVLGIQILRDRKNRLLALSQTSYIDKILNRYSMQNSKRGLLPFRHGIHLSKEQSPQTPQEVEDMKQVPYASADGSIMYAMLCTHLDICYAIG
        MKDLG   Y LGIQI        L LSQT Y ++ILN   M + K    P    + L    S  T +  +      + S  G++ Y  L T  DI YA+ 
Subjt:  MKDLGEAQYVLGIQILRDRKNRLLALSQTSYIDKILNRYSMQNSKRGLLPFRHGIHLSKEQSPQTPQEVEDMKQVPYASADGSIMYAMLCTHLDICYAIG

Query:  IVSRYKSNPSQAHWNVVKNILKYLRRTRDYS-FVYGFKNLIFTGYTDSGFQTDIDSRRSTSGSVFTLNGGAVVWRNIRQGCIADSTMEAEY
        IV +    P+ A ++++K +L+Y++ T  +  +++    L    + DS +     +RRST+G    L    + W   RQ  ++ S+ E EY
Subjt:  IVSRYKSNPSQAHWNVVKNILKYLRRTRDYS-FVYGFKNLIFTGYTDSGFQTDIDSRRSTSGSVFTLNGGAVVWRNIRQGCIADSTMEAEY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGATTTGGGAGAAGCTCAATACGTTCTAGGGATCCAAATACTAAGAGATCGGAAGAACAGACTACTAGCACTGTCTCAGACATCTTATATCGATAAAATTCTGAA
TCGATATTCGATGCAGAATTCCAAAAGGGGTTTGTTACCCTTTAGACATGGAATTCACCTATCAAAGGAACAGAGTCCACAAACACCTCAAGAAGTTGAGGATATGAAAC
AAGTTCCGTACGCATCGGCAGATGGAAGTATTATGTATGCAATGTTATGTACACATCTTGACATATGCTATGCGATAGGTATTGTCAGCAGATATAAGTCCAATCCTAGT
CAAGCACATTGGAATGTCGTTAAGAATATCCTCAAATATTTGAGGAGAACTAGAGACTATTCGTTTGTGTATGGATTTAAGAATTTGATCTTTACTGGATACACTGACTC
AGGTTTTCAAACCGACATAGACTCAAGGAGATCTACGTCGGGATCTGTGTTCACTCTGAATGGAGGAGCAGTAGTGTGGAGAAATATTAGACAGGGATGTATCGCCGACT
CCACAATGGAAGCTGAATACGTTGTAGCCTGTGAAGCATCAAAGGAAGCAGAGAAGGTTGAAAGAATTGAGGAAGAAAGGCTTGAGGATAGCAAGGAAGATCCTTTGGTG
GTTCGTGACTCATTGGTGAAGAAAAACAAGAAGTTAATCCAACCAATTTCTCCATTGAAACAAGCAATCGCCCCCCCTTTGCGTTTGCGTTTCTTCTCCAGCCTTCTTCC
TCGCGCGCAGCCACCAATCTCCCTCTCTTGCCCGTGTGTTGCGCCGCCGCTAGCAGCAGGTTTGCGTCCCTCTTCGTCTCTTCCTCCCTCGTGTGAGCAGCCGCGTTTCC
CATCTCTCTCTCGTTCATCCTCGTGTGAAACTGCCGGCGTCTTTTCCTGCTCCCTCCCGCATGTGCGTTCGCCGATAGCAAGACTCGCGAAGCCTTCGGTTTCTCCCCTC
TCATGCGCAGCCGTCCCGCCTCCCTCATTCACGAATCTCCATCTCCTTCGGCATCGGTTCGTCTTCCTTGCAGATCTCCCTCGGCTTCACGTGCGTCGGCCAGCACCCAC
GTCTCCGGTTTCGTCTCCTTCCTCGCGTGTAAGGCGCGTTCGGGCTGATTTAAGGATCGTCGCGGTGTGCTTAGGCTCTTTCGGAGCTCCGTTGGCCTGTTTAGGAGCGC
CTTATAGGGATCGGTGTGATGAGTCTAGAAAGCATCGGTGCACCTTGGGTACAAATGGCCAAGGGGCGATGCACAGTTCGAGGCCTTGGGTACAAATGGTCAAGGGTCGA
ATGCCGAGCTCCGTAGAGAGCATTGTGGCCCTGGGTACAAATGGTCAGGGGACAGTGCGGCTCGAAGGATCAGTCTTGGAGAGCCATGTAGGAGGGACTACTTACCAGTA
CCATGGTTGTACTGACCCCCTCCCTTCTCTTCCCCCCGACTTTCAGATTATGCAGTATGTTGGTGTAGTTGGAATTGTTTTAGAAGTGTTATTCTCGTTCTGGCAGGCCC
TAATATCTGATTCATCCCGAATTGTTCGTAGTGAATATGAATATAACTGCATAGCCAATACCAGAGACAGTAAATATGATTTACTTGTCATTGAAATGTGCATAGTGGAA
AATGATGTTTCAGCCTGGATCCTTGATTCAGAAGATCTTAGGTTCGTCTTAACGGAGGAATGTCCTCCTGTTCCCCCTCGCACTGCCACTCAGACAGTAAAGGATGCCCA
CGAACGCTGGACGAAGGCCAATGAAAAAGTCAAAGTCTATATATTGGTCGGCTTATCTGAAGTCTTGGCCAAGCGTTATGAGAACGTGGAAACTGCCAGGGAGATTATGA
ATTCCCTGCAGGAGATGTTTGGACTCCCATCCTATCAGCTCCACCACGATGCCTTGAAGAACGTCTTCAACGCCAAGATGCAAGAAGGTCAGTCTGTCCGGGAGCATGTC
CATGATATGATTAACCAGTTTAATATTGTTGAGGCAAATGGCGGGTTAGTCTGCGAGCGCAGTCAGGTTGCGTTCATCCTTCACTCGCTTCCTGCAAGCTATATGCCATT
CAGGACGAATGCGAGTATGAACAAAATTCAGTTCAACCTGACTTCCCTCCTCTCTGAGTTACAGATTTACGAGTCCATGCAAAAGAGCAAGAGCAAAAACGTGGTGAAAG
GAGAGGCCAATGTGGCCCATTCCAAGAAAAAGTTCTTGAAGGGTTCATCCTCAGGGACTAAATCTGTACTTCAAGCGTCTTCATCGAAGCAAATACAGAAGAGGAAGGGA
GACAAGGGGAAGGCTCCTGCACAAGCTGTGCAAGGTAAGGGCAAGGCCAAGGTCGTGACCAACAAAGGCAGATGCTTCCACTGCAATGTGGATGGTCACTGGAAGCGCAA
CTGTCACCGTTACCTCGCTGAGAAGAAGAGAGAGAAAGAAGCTAAGGGTGAAAATAATTTATTTGTGTTAAGACCAACCGACGCTAAGGCTATTTTAAGTCATGAAATGT
TTAAAACGGCTGAAACTCAAAATAAAAGGCAAAAGATTTCTCCTTTAAGTAACAATACGTATTGGCATCTTCGTCTCGGTCATATTAACATCGATCGGATCGATCGTTTG
GTTAAAGATGGACTTCTAACTGATCTAGAAGATACATCTTTGCCACCCTGTGAATTATGCCTAGAGGGTAAAATGACCAAGAGGCCTTTTACTGGAAAAGGTGATAGGGC
CGAAAAACCACTTGAACTGATACACTTGGATCTTTGTGGAGCAATGAATGTAAAGGCTCGAGGTGGTTATGAATATTTCATCTCTTTCATAGATGATTATTCTCGATACG
GTTACTTATACCTAATGGGCCATATGTCTGAAGCTCTTGAAAAGTTTAAGGAGTTTAAGACTGAGGTAGAAAACCCATTAGGTAAAACAATTAAAACACTTCGATCAGAT
CGAGGTGGAGAGTATCTGGATCAGAGATTCCAGGACTATATGATAGAACATGGAATCCAATCCCAACTCTCACCACCTGGTACACCTCAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAGATTTGGGAGAAGCTCAATACGTTCTAGGGATCCAAATACTAAGAGATCGGAAGAACAGACTACTAGCACTGTCTCAGACATCTTATATCGATAAAATTCTGAA
TCGATATTCGATGCAGAATTCCAAAAGGGGTTTGTTACCCTTTAGACATGGAATTCACCTATCAAAGGAACAGAGTCCACAAACACCTCAAGAAGTTGAGGATATGAAAC
AAGTTCCGTACGCATCGGCAGATGGAAGTATTATGTATGCAATGTTATGTACACATCTTGACATATGCTATGCGATAGGTATTGTCAGCAGATATAAGTCCAATCCTAGT
CAAGCACATTGGAATGTCGTTAAGAATATCCTCAAATATTTGAGGAGAACTAGAGACTATTCGTTTGTGTATGGATTTAAGAATTTGATCTTTACTGGATACACTGACTC
AGGTTTTCAAACCGACATAGACTCAAGGAGATCTACGTCGGGATCTGTGTTCACTCTGAATGGAGGAGCAGTAGTGTGGAGAAATATTAGACAGGGATGTATCGCCGACT
CCACAATGGAAGCTGAATACGTTGTAGCCTGTGAAGCATCAAAGGAAGCAGAGAAGGTTGAAAGAATTGAGGAAGAAAGGCTTGAGGATAGCAAGGAAGATCCTTTGGTG
GTTCGTGACTCATTGGTGAAGAAAAACAAGAAGTTAATCCAACCAATTTCTCCATTGAAACAAGCAATCGCCCCCCCTTTGCGTTTGCGTTTCTTCTCCAGCCTTCTTCC
TCGCGCGCAGCCACCAATCTCCCTCTCTTGCCCGTGTGTTGCGCCGCCGCTAGCAGCAGGTTTGCGTCCCTCTTCGTCTCTTCCTCCCTCGTGTGAGCAGCCGCGTTTCC
CATCTCTCTCTCGTTCATCCTCGTGTGAAACTGCCGGCGTCTTTTCCTGCTCCCTCCCGCATGTGCGTTCGCCGATAGCAAGACTCGCGAAGCCTTCGGTTTCTCCCCTC
TCATGCGCAGCCGTCCCGCCTCCCTCATTCACGAATCTCCATCTCCTTCGGCATCGGTTCGTCTTCCTTGCAGATCTCCCTCGGCTTCACGTGCGTCGGCCAGCACCCAC
GTCTCCGGTTTCGTCTCCTTCCTCGCGTGTAAGGCGCGTTCGGGCTGATTTAAGGATCGTCGCGGTGTGCTTAGGCTCTTTCGGAGCTCCGTTGGCCTGTTTAGGAGCGC
CTTATAGGGATCGGTGTGATGAGTCTAGAAAGCATCGGTGCACCTTGGGTACAAATGGCCAAGGGGCGATGCACAGTTCGAGGCCTTGGGTACAAATGGTCAAGGGTCGA
ATGCCGAGCTCCGTAGAGAGCATTGTGGCCCTGGGTACAAATGGTCAGGGGACAGTGCGGCTCGAAGGATCAGTCTTGGAGAGCCATGTAGGAGGGACTACTTACCAGTA
CCATGGTTGTACTGACCCCCTCCCTTCTCTTCCCCCCGACTTTCAGATTATGCAGTATGTTGGTGTAGTTGGAATTGTTTTAGAAGTGTTATTCTCGTTCTGGCAGGCCC
TAATATCTGATTCATCCCGAATTGTTCGTAGTGAATATGAATATAACTGCATAGCCAATACCAGAGACAGTAAATATGATTTACTTGTCATTGAAATGTGCATAGTGGAA
AATGATGTTTCAGCCTGGATCCTTGATTCAGAAGATCTTAGGTTCGTCTTAACGGAGGAATGTCCTCCTGTTCCCCCTCGCACTGCCACTCAGACAGTAAAGGATGCCCA
CGAACGCTGGACGAAGGCCAATGAAAAAGTCAAAGTCTATATATTGGTCGGCTTATCTGAAGTCTTGGCCAAGCGTTATGAGAACGTGGAAACTGCCAGGGAGATTATGA
ATTCCCTGCAGGAGATGTTTGGACTCCCATCCTATCAGCTCCACCACGATGCCTTGAAGAACGTCTTCAACGCCAAGATGCAAGAAGGTCAGTCTGTCCGGGAGCATGTC
CATGATATGATTAACCAGTTTAATATTGTTGAGGCAAATGGCGGGTTAGTCTGCGAGCGCAGTCAGGTTGCGTTCATCCTTCACTCGCTTCCTGCAAGCTATATGCCATT
CAGGACGAATGCGAGTATGAACAAAATTCAGTTCAACCTGACTTCCCTCCTCTCTGAGTTACAGATTTACGAGTCCATGCAAAAGAGCAAGAGCAAAAACGTGGTGAAAG
GAGAGGCCAATGTGGCCCATTCCAAGAAAAAGTTCTTGAAGGGTTCATCCTCAGGGACTAAATCTGTACTTCAAGCGTCTTCATCGAAGCAAATACAGAAGAGGAAGGGA
GACAAGGGGAAGGCTCCTGCACAAGCTGTGCAAGGTAAGGGCAAGGCCAAGGTCGTGACCAACAAAGGCAGATGCTTCCACTGCAATGTGGATGGTCACTGGAAGCGCAA
CTGTCACCGTTACCTCGCTGAGAAGAAGAGAGAGAAAGAAGCTAAGGGTGAAAATAATTTATTTGTGTTAAGACCAACCGACGCTAAGGCTATTTTAAGTCATGAAATGT
TTAAAACGGCTGAAACTCAAAATAAAAGGCAAAAGATTTCTCCTTTAAGTAACAATACGTATTGGCATCTTCGTCTCGGTCATATTAACATCGATCGGATCGATCGTTTG
GTTAAAGATGGACTTCTAACTGATCTAGAAGATACATCTTTGCCACCCTGTGAATTATGCCTAGAGGGTAAAATGACCAAGAGGCCTTTTACTGGAAAAGGTGATAGGGC
CGAAAAACCACTTGAACTGATACACTTGGATCTTTGTGGAGCAATGAATGTAAAGGCTCGAGGTGGTTATGAATATTTCATCTCTTTCATAGATGATTATTCTCGATACG
GTTACTTATACCTAATGGGCCATATGTCTGAAGCTCTTGAAAAGTTTAAGGAGTTTAAGACTGAGGTAGAAAACCCATTAGGTAAAACAATTAAAACACTTCGATCAGAT
CGAGGTGGAGAGTATCTGGATCAGAGATTCCAGGACTATATGATAGAACATGGAATCCAATCCCAACTCTCACCACCTGGTACACCTCAGTAG
Protein sequenceShow/hide protein sequence
MKDLGEAQYVLGIQILRDRKNRLLALSQTSYIDKILNRYSMQNSKRGLLPFRHGIHLSKEQSPQTPQEVEDMKQVPYASADGSIMYAMLCTHLDICYAIGIVSRYKSNPS
QAHWNVVKNILKYLRRTRDYSFVYGFKNLIFTGYTDSGFQTDIDSRRSTSGSVFTLNGGAVVWRNIRQGCIADSTMEAEYVVACEASKEAEKVERIEEERLEDSKEDPLV
VRDSLVKKNKKLIQPISPLKQAIAPPLRLRFFSSLLPRAQPPISLSCPCVAPPLAAGLRPSSSLPPSCEQPRFPSLSRSSSCETAGVFSCSLPHVRSPIARLAKPSVSPL
SCAAVPPPSFTNLHLLRHRFVFLADLPRLHVRRPAPTSPVSSPSSRVRRVRADLRIVAVCLGSFGAPLACLGAPYRDRCDESRKHRCTLGTNGQGAMHSSRPWVQMVKGR
MPSSVESIVALGTNGQGTVRLEGSVLESHVGGTTYQYHGCTDPLPSLPPDFQIMQYVGVVGIVLEVLFSFWQALISDSSRIVRSEYEYNCIANTRDSKYDLLVIEMCIVE
NDVSAWILDSEDLRFVLTEECPPVPPRTATQTVKDAHERWTKANEKVKVYILVGLSEVLAKRYENVETAREIMNSLQEMFGLPSYQLHHDALKNVFNAKMQEGQSVREHV
HDMINQFNIVEANGGLVCERSQVAFILHSLPASYMPFRTNASMNKIQFNLTSLLSELQIYESMQKSKSKNVVKGEANVAHSKKKFLKGSSSGTKSVLQASSSKQIQKRKG
DKGKAPAQAVQGKGKAKVVTNKGRCFHCNVDGHWKRNCHRYLAEKKREKEAKGENNLFVLRPTDAKAILSHEMFKTAETQNKRQKISPLSNNTYWHLRLGHINIDRIDRL
VKDGLLTDLEDTSLPPCELCLEGKMTKRPFTGKGDRAEKPLELIHLDLCGAMNVKARGGYEYFISFIDDYSRYGYLYLMGHMSEALEKFKEFKTEVENPLGKTIKTLRSD
RGGEYLDQRFQDYMIEHGIQSQLSPPGTPQ