; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G05310 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G05310
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionglutamic acid-rich protein-like isoform X2
Genome locationClcChr08:16447759..16458320
RNA-Seq ExpressionClc08G05310
SyntenyClc08G05310
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0016874 - ligase activity (molecular function)
InterPro domainsIPR019098 - Histone chaperone domain CHZ
IPR037647 - HIRA-interacting protein 3


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016498.1 hypothetical protein SDJN02_21607 [Cucurbita argyrosperma subsp. argyrosperma]2.4e-20183.43Show/hide
Query:  MAEELQDKDAPNEEAMDVAVDIERKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCLEKYALDVHKRYVKQCLVKCLESAEEDNASKDSEEIGRKSAS
        MAEELQD DAPNEEAMDV V IE KI NAM SR+SHFKEQADSLTFEGVRRLLEKDLC+E YALDVHKRY+KQCLVKCLE  EEDNASK SEE G KS S
Subjt:  MAEELQDKDAPNEEAMDVAVDIERKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCLEKYALDVHKRYVKQCLVKCLESAEEDNASKDSEEIGRKSAS

Query:  KGEAAESLEGHQSEKDVNEPCVE-YENMEDSPVMGLLTGRKTKNVESDGIKGTKDKDDKDIPSESTITEAIRKRTSYLKANSEKVTMAGVRRLLEDDLKL
        +GEAAESLEGHQS+K   EPC+E  E MEDSPVMGLL G KTKNVESD IKG KDKDDKDIP+ESTI +AIRKRT YLKANSEKVTMAGVRRLLEDDLKL
Subjt:  KGEAAESLEGHQSEKDVNEPCVE-YENMEDSPVMGLLTGRKTKNVESDGIKGTKDKDDKDIPSESTITEAIRKRTSYLKANSEKVTMAGVRRLLEDDLKL

Query:  TTKALDSCKKFISQQVEKILTSCEAAEQVSNEKKGSCLKTPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSNETKKRKRSTMETVSAKKQS
        T  ALD CKKFISQQVE+IL SCEAAEQVSNEKKGS LKTPKKVSKESSHSTE GSSSEEE+DEVKP KKN TKGRI NSNETKKRKRST E VSAKKQ 
Subjt:  TTKALDSCKKFISQQVEKILTSCEAAEQVSNEKKGSCLKTPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSNETKKRKRSTMETVSAKKQS

Query:  KHVQQTSEEDSD-GGGENGSEDGQSDSSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPAIYKKVKQAPESKRESQLIKELEGILSKEGLSANPSEK
        KHVQ TSEEDSD  GGEN SEDG S+SSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPP+IYKKVKQAPESKRESQLIKELEGILS+EGLSANP+EK
Subjt:  KHVQQTSEEDSD-GGGENGSEDGQSDSSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPAIYKKVKQAPESKRESQLIKELEGILSKEGLSANPSEK

Query:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSMTSYVAPPPKPKIPVKTDGDDVHDTEEEEEEEEEEEEEDEEEEEEDDDDDDEEEDGEEEDNGDVDESQG
        EIK+VKKKKERAKELEGIDLSNIVSSSRRRS +SYVAPPPKPKIPVKT+GDDV DT        ++E++D++++++DDDD+D EE+ +EEDNGDVDESQG
Subjt:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSMTSYVAPPPKPKIPVKTDGDDVHDTEEEEEEEEEEEEEDEEEEEEDDDDDDEEEDGEEEDNGDVDESQG

Query:  -EEFNEDDNEDSD
         EEFNEDDNEDSD
Subjt:  -EEFNEDDNEDSD

XP_022939456.1 DNA ligase 1-like isoform X1 [Cucurbita moschata]3.7e-20283.63Show/hide
Query:  MAEELQDKDAPNEEAMDVAVDIERKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCLEKYALDVHKRYVKQCLVKCLESAEEDNASKDSEEIGRKSAS
        MAEELQD DAPNEEAMDV V IE KI NAM SR+SHFKEQADSLTFEGVRRLLEKDLC+E YALDVHKRY+KQCLVKCLE  EEDNASK SEE G KS S
Subjt:  MAEELQDKDAPNEEAMDVAVDIERKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCLEKYALDVHKRYVKQCLVKCLESAEEDNASKDSEEIGRKSAS

Query:  KGEAAESLEGHQSEKDVNEPCVE-YENMEDSPVMGLLTGRKTKNVESDGIKGTKDKDDKDIPSESTITEAIRKRTSYLKANSEKVTMAGVRRLLEDDLKL
        +GEAAESLEGHQS+K   EPC+E  E MEDSPVMGLL G KTKNVESD IKG KDKDDKDIP+ESTI +AIRKRT YLKANSEKVTMAGVRRLLEDDLKL
Subjt:  KGEAAESLEGHQSEKDVNEPCVE-YENMEDSPVMGLLTGRKTKNVESDGIKGTKDKDDKDIPSESTITEAIRKRTSYLKANSEKVTMAGVRRLLEDDLKL

Query:  TTKALDSCKKFISQQVEKILTSCEAAEQVSNEKKGSCLKTPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSNETKKRKRSTMETVSAKKQS
        T  ALD CKKFISQQVE+IL SCEAAE+VSNEKKGS LKTPKKVSKESSHSTE GSSSEEE+DEVKP KKN TKGRI NSNETKKRKRST E VSAKKQ 
Subjt:  TTKALDSCKKFISQQVEKILTSCEAAEQVSNEKKGSCLKTPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSNETKKRKRSTMETVSAKKQS

Query:  KHVQQTSEEDSD-GGGENGSEDGQSDSSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPAIYKKVKQAPESKRESQLIKELEGILSKEGLSANPSEK
        KHVQ TSEEDSD  GGEN SEDG S+SSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPP+IYKKVKQAPESKRESQLIKELEGILS+EGLSANP+EK
Subjt:  KHVQQTSEEDSD-GGGENGSEDGQSDSSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPAIYKKVKQAPESKRESQLIKELEGILSKEGLSANPSEK

Query:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSMTSYVAPPPKPKIPVKTDGDDVHDTEEEEEEEEEEEEEDEEEEEEDDDDDDEEEDGEEEDNGDVDESQG
        EIK+VKKKKERAKELEGIDLSNIVSSSRRRS +SYVAPPPKPKIPVKT+GDDV DT++E           EEE+++DDDD+D EE+ +EEDNGDVDESQG
Subjt:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSMTSYVAPPPKPKIPVKTDGDDVHDTEEEEEEEEEEEEEDEEEEEEDDDDDDEEEDGEEEDNGDVDESQG

Query:  -EEFNEDDNEDSD
         EEFNEDDNEDSD
Subjt:  -EEFNEDDNEDSD

XP_023551365.1 glutamic acid-rich protein-like isoform X1 [Cucurbita pepo subsp. pepo]1.3e-20283.82Show/hide
Query:  MAEELQDKDAPNEEAMDVAVDIERKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCLEKYALDVHKRYVKQCLVKCLESAEEDNASKDSEEIGRKSAS
        MAEELQD DAPNEEAMDV V IE KI NAM SR+SHFKEQADSLTFEGVRRLLEKDLC+E YALDVHKRY+KQCLVKCLE  EEDN SK SEE G KS S
Subjt:  MAEELQDKDAPNEEAMDVAVDIERKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCLEKYALDVHKRYVKQCLVKCLESAEEDNASKDSEEIGRKSAS

Query:  KGEAAESLEGHQSEKDVNEPCVE-YENMEDSPVMGLLTGRKTKNVESDGIKGTKDKDDKDIPSESTITEAIRKRTSYLKANSEKVTMAGVRRLLEDDLKL
        +GEAAESLEGHQS+K   EPC+E  E MEDSPVMGLL G KTKNVESD +KG KDKDDKDIP+E+TI +AIRKRT YLKANSEKVTMAGVRRLLEDDLKL
Subjt:  KGEAAESLEGHQSEKDVNEPCVE-YENMEDSPVMGLLTGRKTKNVESDGIKGTKDKDDKDIPSESTITEAIRKRTSYLKANSEKVTMAGVRRLLEDDLKL

Query:  TTKALDSCKKFISQQVEKILTSCEAAEQVSNEKKGSCLKTPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSNETKKRKRSTMETVSAKKQS
        T  ALD CKKFISQQVE+IL SCEAAEQVSNEKKGS LKTPKKVSKESSHSTE GSSSEEE+DEVKP KKN TKGRI NSNETKKRKRST E VSAKKQ 
Subjt:  TTKALDSCKKFISQQVEKILTSCEAAEQVSNEKKGSCLKTPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSNETKKRKRSTMETVSAKKQS

Query:  KHVQQTSEEDSD-GGGENGSEDGQSDSSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPAIYKKVKQAPESKRESQLIKELEGILSKEGLSANPSEK
        KHVQ TSEEDSD  GGEN SEDG S+SSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPP+IYKKVKQAPESKRESQLIKELEGILS+EGLSANP+EK
Subjt:  KHVQQTSEEDSD-GGGENGSEDGQSDSSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPAIYKKVKQAPESKRESQLIKELEGILSKEGLSANPSEK

Query:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSMTSYVAPPPKPKIPVKTDGDDVHDTEEEEEEEEEEEEEDEEEEEEDDDDDDEEEDGEEEDNGDVDESQG
        EIK+VKKKKERAKELEGIDLSNIVSSSRRRS +SYVAPPPKPKIPVKT+GDDV DT+E            EEEEEEDDDD+D EE+ +EEDNGDVDESQG
Subjt:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSMTSYVAPPPKPKIPVKTDGDDVHDTEEEEEEEEEEEEEDEEEEEEDDDDDDEEEDGEEEDNGDVDESQG

Query:  -EEFNEDDNEDSD
         EEFNEDDNEDSD
Subjt:  -EEFNEDDNEDSD

XP_023551366.1 glutamic acid-rich protein-like isoform X2 [Cucurbita pepo subsp. pepo]3.9e-19982.81Show/hide
Query:  MAEELQDKDAPNEEAMDVAVDIERKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCLEKYALDVHKRYVKQCLVKCLESAEEDNASKDSEEIGRKSAS
        MAEELQD DAPNEEAMDV V IE KI NAM SR+SHFKEQADSLTFEGVRRLLEKDLC+E YALDVHKRY+KQCLVKCLE  EEDN SK SEE G KS S
Subjt:  MAEELQDKDAPNEEAMDVAVDIERKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCLEKYALDVHKRYVKQCLVKCLESAEEDNASKDSEEIGRKSAS

Query:  KGEAAESLEGHQSEKDVNEPCVE-YENMEDSPVMGLLTGRKTKNVESDGIKGTKDKDDKDIPSESTITEAIRKRTSYLKANSEKVTMAGVRRLLEDDLKL
        +GEAAESLEGHQS+K   EPC+E  E MEDSPVMGLL G KTKNVESD +KG KDKDDKDIP+E+TI +AIRKRT YLKANSEKVTMAGVRRLLEDDLKL
Subjt:  KGEAAESLEGHQSEKDVNEPCVE-YENMEDSPVMGLLTGRKTKNVESDGIKGTKDKDDKDIPSESTITEAIRKRTSYLKANSEKVTMAGVRRLLEDDLKL

Query:  TTKALDSCKKFISQQVEKILTSCEAAEQVSNEKKGSCLKTPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSNETKKRKRSTMETVSAKKQS
        T  ALD CKKFISQQVE+IL SCEAAEQVSNEKKGS LKTPKKVSKESSHSTE GSSSEEE+DEVKP KKN TKGRI NSNETKKRKRST E VSAKKQ 
Subjt:  TTKALDSCKKFISQQVEKILTSCEAAEQVSNEKKGSCLKTPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSNETKKRKRSTMETVSAKKQS

Query:  KHVQQTSEEDSD-GGGENGSEDGQSDSSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPAIYKKVKQAPESKRESQLIKELEGILSKEGLSANPSEK
        KHVQ TSEEDSD  GGEN SEDG S+SSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPP+IYKKVKQAPESKRESQLIKELEGILS+EGLSANP+EK
Subjt:  KHVQQTSEEDSD-GGGENGSEDGQSDSSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPAIYKKVKQAPESKRESQLIKELEGILSKEGLSANPSEK

Query:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSMTSYVAPPPKPKIPVKTDGDDVHDTEEEEEEEEEEEEEDEEEEEEDDDDDDEEEDGEEEDNGDVDESQG
        EIK+VKKKKERAKELEGIDLSNIVSSSRRRS +SYVAPPPKPKIPVKT+GDDV DT+E            EEEEEEDDDD+D EE+ +EEDNGDVDESQ 
Subjt:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSMTSYVAPPPKPKIPVKTDGDDVHDTEEEEEEEEEEEEEDEEEEEEDDDDDDEEEDGEEEDNGDVDESQG

Query:  EEFNEDDNEDSD
             DDNEDSD
Subjt:  EEFNEDDNEDSD

XP_038884709.1 glutamic acid-rich protein isoform X1 [Benincasa hispida]1.6e-20083.27Show/hide
Query:  MAEELQDKDAPNEEAMDVAVDIERKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCLEKYALDVHKRYVKQCLVKCLESAEEDNASKDSEEIGRKSAS
        MAEELQDKDA N++AMDVAVDIE KI+NAMRSR+S+FKE+ADSLTFEGVRRLLEKDLC+E Y LDVHKR VKQCLVKC E+  EDN SK SEE GRKS +
Subjt:  MAEELQDKDAPNEEAMDVAVDIERKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCLEKYALDVHKRYVKQCLVKCLESAEEDNASKDSEEIGRKSAS

Query:  KGEAAESLEGHQSEKDVNEPCVE-YENMEDSPVMGLLTGRKTKNVESDGIKGTKDKDDKDIPSESTITEAIRKRTSYLKANSEKVTMAGVRRLLEDDLKL
        K EAAE LEGHQS+K V EPC E  E MEDSPVMGLL  R TKNVESDGIKG KDKDDKDIPSES I +AIRKRTSYLKANSEKVTMAGVRRLLEDDLKL
Subjt:  KGEAAESLEGHQSEKDVNEPCVE-YENMEDSPVMGLLTGRKTKNVESDGIKGTKDKDDKDIPSESTITEAIRKRTSYLKANSEKVTMAGVRRLLEDDLKL

Query:  TTKALDSCKKFISQQVEKILTSCEAAEQVSNEKKGSCLKTPKKVSKESSHSTEGSS----SEEENDEVKPGKKNATKGRIPNSNETKKRKRSTMETVSAK
        T  ALDSCKKFISQQVE+ILTSCEAAEQVSNEK+   LKTPKKVSKESSHSTEGSS    SEEENDEVKPGKKNATKGRIPNSNETKKRKRST ET+SAK
Subjt:  TTKALDSCKKFISQQVEKILTSCEAAEQVSNEKKGSCLKTPKKVSKESSHSTEGSS----SEEENDEVKPGKKNATKGRIPNSNETKKRKRSTMETVSAK

Query:  KQSKHVQQTSEEDSDGGGENGSEDGQSDSSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPAIYKKVKQAPESKRESQLIKELEGILSKEGLSANPS
        KQSKHVQ TSEED+D GGEN SEDGQS+SS+E+PVKKEVSTPVYGK VEHLKSVIKSCGMSVPP+IYKKVKQAPESKRESQLIKELEGILS+EGLSANP+
Subjt:  KQSKHVQQTSEEDSDGGGENGSEDGQSDSSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPAIYKKVKQAPESKRESQLIKELEGILSKEGLSANPS

Query:  EKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSMTSYVAPPPKPKIPVKTDGDDVHDTEEEEEEEEEEEEEDEEEEEEDDDDDDEEEDGEEEDNGDVDES
        EKEIKEVKKKKERAKELEGIDLSNIVSSSRRRS TSY  PPPKPKIPVKTDGD         ++ ++ EEE+EE+E+E+DDDD+EEEDGEEEDNG+VD S
Subjt:  EKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSMTSYVAPPPKPKIPVKTDGDDVHDTEEEEEEEEEEEEEDEEEEEEDDDDDDEEEDGEEEDNGDVDES

Query:  QGEEFNEDDNEDSD
        QGEEFNEDDNEDSD
Subjt:  QGEEFNEDDNEDSD

TrEMBL top hitse value%identityAlignment
A0A0A0LIS6 CHZ domain-containing protein2.7e-19881.84Show/hide
Query:  MAEELQDKDAPNEEAMDVAVDIERKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCLEKYALDVHKRYVKQCLVKCLESAEEDNASKDSEEIGRKSAS
        MAEELQ  D P EE MDVAV IE KIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLC+E Y LDVHKRYVKQCLVKCLE+  EDN SKDSE  GRKS +
Subjt:  MAEELQDKDAPNEEAMDVAVDIERKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCLEKYALDVHKRYVKQCLVKCLESAEEDNASKDSEEIGRKSAS

Query:  KGEAAESLEGHQSEKDVNEPCVE-YENMEDSPVMGLLTGRKTKNVESDGIKGTKDKDDKDIPSESTITEAIRKRTSYLKANSEKVTMAGVRRLLEDDLKL
        K EA ES EGHQS+K   EPC+E  E MEDSPVMGLLTGR TKNVESDGIKG K KDDKD+PSESTI +AIRKRTSYLKANSEKVTMAGVRRLLEDDLKL
Subjt:  KGEAAESLEGHQSEKDVNEPCVE-YENMEDSPVMGLLTGRKTKNVESDGIKGTKDKDDKDIPSESTITEAIRKRTSYLKANSEKVTMAGVRRLLEDDLKL

Query:  TTKALDSCKKFISQQVEKILTSCEAAEQVSNEKKGSCLKTPKKVSKESSHSTEGSSSEEENDEVKPGKKNATKGRIPNSNETKKRKRSTMETVSAKKQSK
        T   LDSCKKFISQQVE+ILTSCEAAEQVSN      LK+PKK+SKESS+STEGSSSEEENDEV PGK NATKGRIP+SNETKKRKRST +TVSA+KQSK
Subjt:  TTKALDSCKKFISQQVEKILTSCEAAEQVSNEKKGSCLKTPKKVSKESSHSTEGSSSEEENDEVKPGKKNATKGRIPNSNETKKRKRSTMETVSAKKQSK

Query:  HVQQTSEEDSDGGGENGSEDGQSDSSNEKPVKKEV--STPVYGKRVEHLKSVIKSCGMSVPPAIYKKVKQAPESKRESQLIKELEGILSKEGLSANPSEK
        HVQ TS+EDSD GG N SEDG+S SSNEKPVKKEV  STPVYGKRVEHLKSVIKSCGMSVPP+IYKKVKQAPESKRESQLIKELEGILS+EGLSAN +EK
Subjt:  HVQQTSEEDSDGGGENGSEDGQSDSSNEKPVKKEV--STPVYGKRVEHLKSVIKSCGMSVPPAIYKKVKQAPESKRESQLIKELEGILSKEGLSANPSEK

Query:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSMTSYVAPPPKPKIPVKTDGDDVHDTEEEEEEEEEEEEEDEEEEEEDDDDDDEEEDGEEEDNGDVDESQG
        EIKEVKKKKERAKELEGIDLSNIVSSSRRRS TSYVAPPPKPKIPVKTDGDD                   +EEE+D+++D+EEEDGEEEDNGDVDESQG
Subjt:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSMTSYVAPPPKPKIPVKTDGDDVHDTEEEEEEEEEEEEEDEEEEEEDDDDDDEEEDGEEEDNGDVDESQG

Query:  EEFNEDDNEDSD
        EEFNEDDNEDSD
Subjt:  EEFNEDDNEDSD

A0A6J1FFY5 DNA ligase 1-like isoform X11.8e-20283.63Show/hide
Query:  MAEELQDKDAPNEEAMDVAVDIERKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCLEKYALDVHKRYVKQCLVKCLESAEEDNASKDSEEIGRKSAS
        MAEELQD DAPNEEAMDV V IE KI NAM SR+SHFKEQADSLTFEGVRRLLEKDLC+E YALDVHKRY+KQCLVKCLE  EEDNASK SEE G KS S
Subjt:  MAEELQDKDAPNEEAMDVAVDIERKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCLEKYALDVHKRYVKQCLVKCLESAEEDNASKDSEEIGRKSAS

Query:  KGEAAESLEGHQSEKDVNEPCVE-YENMEDSPVMGLLTGRKTKNVESDGIKGTKDKDDKDIPSESTITEAIRKRTSYLKANSEKVTMAGVRRLLEDDLKL
        +GEAAESLEGHQS+K   EPC+E  E MEDSPVMGLL G KTKNVESD IKG KDKDDKDIP+ESTI +AIRKRT YLKANSEKVTMAGVRRLLEDDLKL
Subjt:  KGEAAESLEGHQSEKDVNEPCVE-YENMEDSPVMGLLTGRKTKNVESDGIKGTKDKDDKDIPSESTITEAIRKRTSYLKANSEKVTMAGVRRLLEDDLKL

Query:  TTKALDSCKKFISQQVEKILTSCEAAEQVSNEKKGSCLKTPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSNETKKRKRSTMETVSAKKQS
        T  ALD CKKFISQQVE+IL SCEAAE+VSNEKKGS LKTPKKVSKESSHSTE GSSSEEE+DEVKP KKN TKGRI NSNETKKRKRST E VSAKKQ 
Subjt:  TTKALDSCKKFISQQVEKILTSCEAAEQVSNEKKGSCLKTPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSNETKKRKRSTMETVSAKKQS

Query:  KHVQQTSEEDSD-GGGENGSEDGQSDSSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPAIYKKVKQAPESKRESQLIKELEGILSKEGLSANPSEK
        KHVQ TSEEDSD  GGEN SEDG S+SSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPP+IYKKVKQAPESKRESQLIKELEGILS+EGLSANP+EK
Subjt:  KHVQQTSEEDSD-GGGENGSEDGQSDSSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPAIYKKVKQAPESKRESQLIKELEGILSKEGLSANPSEK

Query:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSMTSYVAPPPKPKIPVKTDGDDVHDTEEEEEEEEEEEEEDEEEEEEDDDDDDEEEDGEEEDNGDVDESQG
        EIK+VKKKKERAKELEGIDLSNIVSSSRRRS +SYVAPPPKPKIPVKT+GDDV DT++E           EEE+++DDDD+D EE+ +EEDNGDVDESQG
Subjt:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSMTSYVAPPPKPKIPVKTDGDDVHDTEEEEEEEEEEEEEDEEEEEEDDDDDDEEEDGEEEDNGDVDESQG

Query:  -EEFNEDDNEDSD
         EEFNEDDNEDSD
Subjt:  -EEFNEDDNEDSD

A0A6J1FGV2 glutamic acid-rich protein-like isoform X25.4e-19982.62Show/hide
Query:  MAEELQDKDAPNEEAMDVAVDIERKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCLEKYALDVHKRYVKQCLVKCLESAEEDNASKDSEEIGRKSAS
        MAEELQD DAPNEEAMDV V IE KI NAM SR+SHFKEQADSLTFEGVRRLLEKDLC+E YALDVHKRY+KQCLVKCLE  EEDNASK SEE G KS S
Subjt:  MAEELQDKDAPNEEAMDVAVDIERKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCLEKYALDVHKRYVKQCLVKCLESAEEDNASKDSEEIGRKSAS

Query:  KGEAAESLEGHQSEKDVNEPCVE-YENMEDSPVMGLLTGRKTKNVESDGIKGTKDKDDKDIPSESTITEAIRKRTSYLKANSEKVTMAGVRRLLEDDLKL
        +GEAAESLEGHQS+K   EPC+E  E MEDSPVMGLL G KTKNVESD IKG KDKDDKDIP+ESTI +AIRKRT YLKANSEKVTMAGVRRLLEDDLKL
Subjt:  KGEAAESLEGHQSEKDVNEPCVE-YENMEDSPVMGLLTGRKTKNVESDGIKGTKDKDDKDIPSESTITEAIRKRTSYLKANSEKVTMAGVRRLLEDDLKL

Query:  TTKALDSCKKFISQQVEKILTSCEAAEQVSNEKKGSCLKTPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSNETKKRKRSTMETVSAKKQS
        T  ALD CKKFISQQVE+IL SCEAAE+VSNEKKGS LKTPKKVSKESSHSTE GSSSEEE+DEVKP KKN TKGRI NSNETKKRKRST E VSAKKQ 
Subjt:  TTKALDSCKKFISQQVEKILTSCEAAEQVSNEKKGSCLKTPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSNETKKRKRSTMETVSAKKQS

Query:  KHVQQTSEEDSD-GGGENGSEDGQSDSSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPAIYKKVKQAPESKRESQLIKELEGILSKEGLSANPSEK
        KHVQ TSEEDSD  GGEN SEDG S+SSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPP+IYKKVKQAPESKRESQLIKELEGILS+EGLSANP+EK
Subjt:  KHVQQTSEEDSD-GGGENGSEDGQSDSSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPAIYKKVKQAPESKRESQLIKELEGILSKEGLSANPSEK

Query:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSMTSYVAPPPKPKIPVKTDGDDVHDTEEEEEEEEEEEEEDEEEEEEDDDDDDEEEDGEEEDNGDVDESQG
        EIK+VKKKKERAKELEGIDLSNIVSSSRRRS +SYVAPPPKPKIPVKT+GDDV DT++E           EEE+++DDDD+D EE+ +EEDNGDVDESQ 
Subjt:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSMTSYVAPPPKPKIPVKTDGDDVHDTEEEEEEEEEEEEEDEEEEEEDDDDDDEEEDGEEEDNGDVDESQG

Query:  EEFNEDDNEDSD
             DDNEDSD
Subjt:  EEFNEDDNEDSD

A0A6J1JTY1 glutamic acid-rich protein-like isoform X11.2e-19882.65Show/hide
Query:  MAEELQDKDAPNEEAMDVAVDIERKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCLEKYALDVHKRYVKQCLVKCLESAEEDNASKDSEEIGRKSAS
        MAEELQD DAPNEEAMDV V IE KI NAM SR+SHFKEQADSLTFEGVRRLLE DLC+E YALDVHKRY+KQCLVKCLE  EEDNASK SEE G KS S
Subjt:  MAEELQDKDAPNEEAMDVAVDIERKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCLEKYALDVHKRYVKQCLVKCLESAEEDNASKDSEEIGRKSAS

Query:  KGEAAESLEGHQSEKDVNEPCVE-YENMEDSPVMGLLTGRKTKNVESDGIKGTKDKDDKDIPSESTITEAIRKRTSYLKANSEKVTMAGVRRLLEDDLKL
        +GEAAESLEGHQS+K   EPC+E  E MEDSPVMGLL G KTKN ESD +KG KDKDDKDIP+ESTI +AIRKRT YLKANSEKVTMAGVRRLLEDDLKL
Subjt:  KGEAAESLEGHQSEKDVNEPCVE-YENMEDSPVMGLLTGRKTKNVESDGIKGTKDKDDKDIPSESTITEAIRKRTSYLKANSEKVTMAGVRRLLEDDLKL

Query:  TTKALDSCKKFISQQVEKILTSCEAAEQVSNEKKGSCLKTPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSNETKKRKRSTMETVSAKKQS
        T  ALD CKKFISQQVE+IL SCEAAEQVSNEKKGS LKTPKKVSKESSHSTE GSSSEEE+DEVKP KKN TKG I NSNE KKRKRST E VSAKKQ 
Subjt:  TTKALDSCKKFISQQVEKILTSCEAAEQVSNEKKGSCLKTPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSNETKKRKRSTMETVSAKKQS

Query:  KHVQQTSEEDSD-GGGENGSEDGQSDSSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPAIYKKVKQAPESKRESQLIKELEGILSKEGLSANPSEK
        KHV  T EEDSD  GGEN SEDG S+SSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPP+IYKKVKQAPESKRESQLIKELEGILS+EGLS NP+EK
Subjt:  KHVQQTSEEDSD-GGGENGSEDGQSDSSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPAIYKKVKQAPESKRESQLIKELEGILSKEGLSANPSEK

Query:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSMTSYVAPPPKPKIPVKTDGDDVHDTEEEEEEEEEEEEEDEEEEEEDDDDDDEEEDGEEEDNGDVDESQG
        EIK+VKKKKERAKELEGIDLSNIVSSSRRRS +SY APPPKPKIPVKT+GDDV DT            +DEEEEE+DDDDDD EE+ +EEDNGDVDESQG
Subjt:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSMTSYVAPPPKPKIPVKTDGDDVHDTEEEEEEEEEEEEEDEEEEEEDDDDDDEEEDGEEEDNGDVDESQG

Query:  -EEFNEDDNEDSD
         EEFNEDDNEDSD
Subjt:  -EEFNEDDNEDSD

A0A6J1K3E3 glutamic acid-rich protein-like isoform X23.6e-19581.64Show/hide
Query:  MAEELQDKDAPNEEAMDVAVDIERKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCLEKYALDVHKRYVKQCLVKCLESAEEDNASKDSEEIGRKSAS
        MAEELQD DAPNEEAMDV V IE KI NAM SR+SHFKEQADSLTFEGVRRLLE DLC+E YALDVHKRY+KQCLVKCLE  EEDNASK SEE G KS S
Subjt:  MAEELQDKDAPNEEAMDVAVDIERKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCLEKYALDVHKRYVKQCLVKCLESAEEDNASKDSEEIGRKSAS

Query:  KGEAAESLEGHQSEKDVNEPCVE-YENMEDSPVMGLLTGRKTKNVESDGIKGTKDKDDKDIPSESTITEAIRKRTSYLKANSEKVTMAGVRRLLEDDLKL
        +GEAAESLEGHQS+K   EPC+E  E MEDSPVMGLL G KTKN ESD +KG KDKDDKDIP+ESTI +AIRKRT YLKANSEKVTMAGVRRLLEDDLKL
Subjt:  KGEAAESLEGHQSEKDVNEPCVE-YENMEDSPVMGLLTGRKTKNVESDGIKGTKDKDDKDIPSESTITEAIRKRTSYLKANSEKVTMAGVRRLLEDDLKL

Query:  TTKALDSCKKFISQQVEKILTSCEAAEQVSNEKKGSCLKTPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSNETKKRKRSTMETVSAKKQS
        T  ALD CKKFISQQVE+IL SCEAAEQVSNEKKGS LKTPKKVSKESSHSTE GSSSEEE+DEVKP KKN TKG I NSNE KKRKRST E VSAKKQ 
Subjt:  TTKALDSCKKFISQQVEKILTSCEAAEQVSNEKKGSCLKTPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSNETKKRKRSTMETVSAKKQS

Query:  KHVQQTSEEDSD-GGGENGSEDGQSDSSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPAIYKKVKQAPESKRESQLIKELEGILSKEGLSANPSEK
        KHV  T EEDSD  GGEN SEDG S+SSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPP+IYKKVKQAPESKRESQLIKELEGILS+EGLS NP+EK
Subjt:  KHVQQTSEEDSD-GGGENGSEDGQSDSSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPAIYKKVKQAPESKRESQLIKELEGILSKEGLSANPSEK

Query:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSMTSYVAPPPKPKIPVKTDGDDVHDTEEEEEEEEEEEEEDEEEEEEDDDDDDEEEDGEEEDNGDVDESQG
        EIK+VKKKKERAKELEGIDLSNIVSSSRRRS +SY APPPKPKIPVKT+GDDV DT            +DEEEEE+DDDDDD EE+ +EEDNGDVDESQ 
Subjt:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSMTSYVAPPPKPKIPVKTDGDDVHDTEEEEEEEEEEEEEDEEEEEEDDDDDDEEEDGEEEDNGDVDESQG

Query:  EEFNEDDNEDSD
             DDNEDSD
Subjt:  EEFNEDDNEDSD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G44780.1 CONTAINS InterPro DOMAIN/s: Histone chaperone domain CHZ (InterPro:IPR019098)4.0e-6944.47Show/hide
Query:  AVDIERKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCLEKYALDVHKRYVKQCLVKCLESAEEDNASKDSEEIGRK--SASKGEAAESLEGHQSEKD
        A +IE KI  A+RSR+++ + +AD  T   VRR+LE+D+ LEK  LDV+K +VK+ LVKCLE A  ++ S++S+E  R+       E AE  E H+   D
Subjt:  AVDIERKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCLEKYALDVHKRYVKQCLVKCLESAEEDNASKDSEEIGRK--SASKGEAAESLEGHQSEKD

Query:  VNEPCVEYENMEDSPVMGLLTGRKTKNVESDGIKGTKDKDDKDIPSESTITEAIRKRTSYLKANSEKVTMAGVRRLLEDDLKLTTKALDSCKKFISQQVE
          E     EN          + R+ K+V+  G K T  +D         I  A+RKR SY+KANSE +TMA +RRLLE+DLKL  ++LD  KKFI+++++
Subjt:  VNEPCVEYENMEDSPVMGLLTGRKTKNVESDGIKGTKDKDDKDIPSESTITEAIRKRTSYLKANSEKVTMAGVRRLLEDDLKLTTKALDSCKKFISQQVE

Query:  KIL-----TSCEAAEQVSNEKKGSCLKTPKKVSKESSHSTEGSSSEEENDEVKPGKKNATKGRIPNSNETKKRKRSTMETVSAKKQSKHVQQTSEEDSDG
        ++L       C     V N KK     TP K+     +S   +    +N+EV   K  A K ++       KRK    + VS +K++KH +  SE DSD 
Subjt:  KIL-----TSCEAAEQVSNEKKGSCLKTPKKVSKESSHSTEGSSSEEENDEVKPGKKNATKGRIPNSNETKKRKRSTMETVSAKKQSKHVQQTSEEDSDG

Query:  GGENGSEDGQSDSSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPAIYKKVKQAPESKRESQLIKELEGILSKEGLSANPSEKEIKEVKKKKERAKE
        G          DS       KE +T VYGKRVEHLKSVIKSCGMSVPP IYKK KQAP+ KRE+ LI+ELE IL+KEGLS++PS  EIKEVKK+K  ++E
Subjt:  GGENGSEDGQSDSSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPAIYKKVKQAPESKRESQLIKELEGILSKEGLSANPSEKEIKEVKKKKERAKE

Query:  LEGIDLSNIVSSSRRRSMTSYVAPPPKPKIPV--KTDGDDVHDTEEEEEEEEEEEEEDEEEEEEDDDDDDEEEDGEEED
        LEGID +NIV +SRRRS TS+ APPPKPK+    +++ D+  D+E EEE  E+ E   + EE E++ + +E++ GEE D
Subjt:  LEGIDLSNIVSSSRRRSMTSYVAPPPKPKIPV--KTDGDDVHDTEEEEEEEEEEEEEDEEEEEEDDDDDDEEEDGEEED

AT1G44780.2 INVOLVED IN: biological_process unknown4.0e-6944.58Show/hide
Query:  AVDIERKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCLEKYALDVHKRYVKQCLVKCLESAEEDNASKDSEEIGRK--SASKGEAAESLEGHQSEKD
        A +IE KI  A+RSR+++ + +AD  T   VRR+LE+D+ LEK  LDV+K +VK+ LVKCLE A  ++ S++S+E  R+       E AE  E H+   D
Subjt:  AVDIERKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCLEKYALDVHKRYVKQCLVKCLESAEEDNASKDSEEIGRK--SASKGEAAESLEGHQSEKD

Query:  VNEPCVEYENMEDSPVMGLLTGRKTKNVESDGIKGTKDKDDKDIPSESTITEAIRKRTSYLKANSEKVTMAGVRRLLEDDLKLTTKALDSCKKFISQQVE
          E     EN          + R+ K+V+  G K T  +D         I  A+RKR SY+KANSE +TMA +RRLLE+DLKL  ++LD  KKFI+++++
Subjt:  VNEPCVEYENMEDSPVMGLLTGRKTKNVESDGIKGTKDKDDKDIPSESTITEAIRKRTSYLKANSEKVTMAGVRRLLEDDLKLTTKALDSCKKFISQQVE

Query:  KIL-----TSCEAAEQVSNEKKGSCLKTPKKVSKESSHSTEGSSSEEENDEVKPGKKNATKGRIPNSNETKKRKRSTMETVSAKKQSKHVQQTSEEDSDG
        ++L       C     V N KK     TP K+     +S   +    +N+EV   K  A K ++       KRK    + VS +K++KH +  SE DSD 
Subjt:  KIL-----TSCEAAEQVSNEKKGSCLKTPKKVSKESSHSTEGSSSEEENDEVKPGKKNATKGRIPNSNETKKRKRSTMETVSAKKQSKHVQQTSEEDSDG

Query:  GGENGSEDGQSDSSNEKPVK-KEVSTPVYGKRVEHLKSVIKSCGMSVPPAIYKKVKQAPESKRESQLIKELEGILSKEGLSANPSEKEIKEVKKKKERAK
        G             +EK +K KE +T VYGKRVEHLKSVIKSCGMSVPP IYKK KQAP+ KRE+ LI+ELE IL+KEGLS++PS  EIKEVKK+K  ++
Subjt:  GGENGSEDGQSDSSNEKPVK-KEVSTPVYGKRVEHLKSVIKSCGMSVPPAIYKKVKQAPESKRESQLIKELEGILSKEGLSANPSEKEIKEVKKKKERAK

Query:  ELEGIDLSNIVSSSRRRSMTSYVAPPPKPKIPV--KTDGDDVHDTEEEEEEEEEEEEEDEEEEEEDDDDDDEEEDGEEED
        ELEGID +NIV +SRRRS TS+ APPPKPK+    +++ D+  D+E EEE  E+ E   + EE E++ + +E++ GEE D
Subjt:  ELEGIDLSNIVSSSRRRSMTSYVAPPPKPKIPV--KTDGDDVHDTEEEEEEEEEEEEEDEEEEEEDDDDDDEEEDGEEED

AT4G08310.1 FUNCTIONS IN: molecular_function unknown1.8e-8249.7Show/hide
Query:  DIERKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCLEKYALDVHKRYVKQCLVKCLESAEEDNASKDSEEIGRKS--ASKGEAAESLEGHQSEKDVN
        DIE +I  AM+SR+++ +++AD+ TFEGVRRLLE+DL LEK+ALDVHK +VKQ LV+CL  AE D  S++S E  +K       EAAE  + H ++KD  
Subjt:  DIERKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCLEKYALDVHKRYVKQCLVKCLESAEEDNASKDSEEIGRKS--ASKGEAAESLEGHQSEKDVN

Query:  EPCV-EYENMEDSPVMGLLTGRKTKNVESDGIKGTKDKDDKDIPSESTITEAIRKRTSYLKANSEKVTMAGVRRLLEDDLKLTTKALDSCKKFISQQVEK
        E    + E  +DSPVMGLLT    +N      + TKD+D + +  +S I +A+RKR+SY+KANSEK+TM  +RRLLE DLKL   +LD  KKFI+ ++++
Subjt:  EPCV-EYENMEDSPVMGLLTGRKTKNVESDGIKGTKDKDDKDIPSESTITEAIRKRTSYLKANSEKVTMAGVRRLLEDDLKLTTKALDSCKKFISQQVEK

Query:  ILTSCEAAEQVSNEKKGSCLKTPKKVSKESSHSTE----GSSSEEENDEVKPGKKNATKGRIPNSNETKKRKRSTMETVSAKKQSKHVQQTSEEDSDGGG
        IL + EA +  +  ++    K  K    ++S S E        EEE+ EV   KK A K ++  S  T KRKR   +  SAKK     Q  S+ DSD   
Subjt:  ILTSCEAAEQVSNEKKGSCLKTPKKVSKESSHSTE----GSSSEEENDEVKPGKKNATKGRIPNSNETKKRKRSTMETVSAKKQSKHVQQTSEEDSDGGG

Query:  ENGSEDGQSDSSNEKPVKK-EVSTPVYGKRVEHLKSVIKSCGMSVPPAIYKKVKQAPESKRESQLIKELEGILSKEGLSANPSEKEIKEVKKKKERAKEL
              G+   S+EK VKK E  T  YGKRVEHLKS+IKSCGMS+ P++Y+K KQAPE KRE  LIKEL+ +L+KEGLSANPSEKEIKEVKK+KER KEL
Subjt:  ENGSEDGQSDSSNEKPVKK-EVSTPVYGKRVEHLKSVIKSCGMSVPPAIYKKVKQAPESKRESQLIKELEGILSKEGLSANPSEKEIKEVKKKKERAKEL

Query:  EGIDLSNIVSSSRRRSMTSYVAPPPKPKIPVKTDGDDVHDTEEEEEEEEEEEEEDEEEEEEDDDDDDEEEDGEEEDNGDVDESQGEEFNEDDNED
        EGID SNIVSSSRRRS  S+V PPPK   P+K +  +  D+E+ E EE+E+EE   EEEEE      EE++G  ED G+  +++GE   ED  E+
Subjt:  EGIDLSNIVSSSRRRSMTSYVAPPPKPKIPVKTDGDDVHDTEEEEEEEEEEEEEDEEEEEEDDDDDDEEEDGEEEDNGDVDESQGEEFNEDDNED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAGGAATTACAGGACAAAGATGCTCCGAACGAAGAAGCCATGGATGTAGCTGTTGATATAGAGAGGAAGATTCATAACGCTATGCGCTCTCGCATCTCTCACTT
CAAGGAACAAGCCGACTCTTTAACTTTTGAGGGGGTTAGGAGATTGCTTGAAAAGGACTTGTGTTTGGAGAAGTATGCATTAGATGTGCATAAAAGATATGTCAAGCAGT
GTTTGGTGAAGTGCTTAGAAAGTGCTGAGGAAGACAATGCCTCCAAGGATTCTGAGGAGATTGGGAGAAAAAGTGCAAGTAAAGGAGAAGCGGCTGAGTCACTTGAAGGG
CATCAGTCCGAGAAGGATGTAAACGAACCTTGCGTGGAATATGAGAACATGGAAGACTCTCCAGTTATGGGCCTTCTCACAGGACGTAAAACAAAAAATGTTGAATCTGA
TGGAATCAAAGGAACCAAAGACAAAGATGACAAAGATATTCCTAGTGAGAGTACAATTACGGAAGCTATTAGAAAAAGAACTTCTTATCTTAAAGCTAATTCCGAGAAAG
TTACTATGGCTGGAGTTCGCCGCCTTCTGGAGGATGACCTTAAACTTACTACAAAAGCTCTCGACAGTTGCAAGAAGTTTATAAGCCAACAAGTAGAGAAGATATTGACT
TCTTGTGAAGCTGCTGAACAAGTTTCTAATGAAAAGAAAGGTTCTTGTTTGAAAACTCCGAAAAAGGTAAGCAAAGAAAGCTCTCATTCTACGGAAGGGAGCAGTAGTGA
GGAGGAAAACGATGAAGTAAAGCCTGGAAAGAAAAATGCAACTAAAGGAAGAATACCGAACTCTAACGAAACAAAAAAGCGGAAAAGGTCTACAATGGAGACTGTCTCTG
CCAAGAAGCAAAGCAAGCATGTCCAGCAGACATCAGAGGAGGATAGTGATGGAGGTGGTGAAAATGGCTCTGAAGATGGCCAGTCTGACTCATCCAATGAAAAACCTGTG
AAGAAGGAAGTTTCAACTCCTGTCTATGGTAAGCGTGTGGAGCACTTGAAATCGGTTATCAAATCGTGTGGGATGAGTGTTCCTCCAGCGATTTATAAGAAAGTCAAGCA
GGCACCTGAAAGCAAACGTGAATCACAACTTATCAAGGAGTTGGAGGGGATACTATCCAAAGAAGGATTGTCTGCTAATCCCTCTGAAAAAGAAATTAAGGAAGTTAAAA
AGAAGAAGGAAAGGGCCAAAGAACTTGAAGGCATTGACTTAAGCAATATCGTTTCAAGTTCACGTAGAAGATCCATGACCAGCTATGTAGCACCACCTCCAAAACCGAAA
ATACCAGTTAAAACTGATGGTGATGATGTTCATGATACTGAGGAGGAGGAGGAGGAGGAGGAGGAGGAGGAGGAGGAGGATGAAGAAGAAGAAGAAGAAGATGACGACGA
TGACGACGAAGAGGAGGATGGCGAGGAAGAGGATAATGGCGATGTTGATGAAAGCCAAGGCGAAGAATTCAATGAGGATGACAATGAAGACAGTGATTGA
mRNA sequenceShow/hide mRNA sequence
AGACTCGGATAGCTTCTGAAGGAGGTCGAGATTTTGTTCAAGCAAATTCCTCAATCTCCGATTCTCCGATCGCAGAGCATCGAGTTCTTCTTCCTCTGTAGCAGTGCGTT
CGGTGGCATCTTTGCTGTGTTTATGGTGATTGCAACACTCAATCGCCGACCAGGCCACGTCGGCGACCTCCATGACCGTCTTGGCCACCTCCAAAGCGCCGTGGCTGCCT
TTCATCTGTACAAACCTTCAGTTGCGATCAAACCTGCTTCAATAATGGCGTCGTTTTGTTGCGGGAGCGTTTTCCTTCTACGGAACTCGGAAGGATAATGATCACCAGAT
TTGGACCAAGCCATTTTCAAAGGCTTCGCATTTCATACAAAGGAACGGCGGAGATCGAGATCAAAATTTCGCGAGATCTTCATTGTGTACGAGCACCCACACAATAACAA
AAATCTCTCCCCCCTCTTTTTTTCTGTCAAGCAGAGTCATTCTTGGAAAGTCCCAACACACATTCAGTCCAGAGCCCATAATTTCTTCGTTCGCACAAAAAATCGGTGCG
AGTCGTGGAAGAGCAAAATGGCGGAGGAATTACAGGACAAAGATGCTCCGAACGAAGAAGCCATGGATGTAGCTGTTGATATAGAGAGGAAGATTCATAACGCTATGCGC
TCTCGCATCTCTCACTTCAAGGAACAAGCCGACTCTTTAACTTTTGAGGGGGTTAGGAGATTGCTTGAAAAGGACTTGTGTTTGGAGAAGTATGCATTAGATGTGCATAA
AAGATATGTCAAGCAGTGTTTGGTGAAGTGCTTAGAAAGTGCTGAGGAAGACAATGCCTCCAAGGATTCTGAGGAGATTGGGAGAAAAAGTGCAAGTAAAGGAGAAGCGG
CTGAGTCACTTGAAGGGCATCAGTCCGAGAAGGATGTAAACGAACCTTGCGTGGAATATGAGAACATGGAAGACTCTCCAGTTATGGGCCTTCTCACAGGACGTAAAACA
AAAAATGTTGAATCTGATGGAATCAAAGGAACCAAAGACAAAGATGACAAAGATATTCCTAGTGAGAGTACAATTACGGAAGCTATTAGAAAAAGAACTTCTTATCTTAA
AGCTAATTCCGAGAAAGTTACTATGGCTGGAGTTCGCCGCCTTCTGGAGGATGACCTTAAACTTACTACAAAAGCTCTCGACAGTTGCAAGAAGTTTATAAGCCAACAAG
TAGAGAAGATATTGACTTCTTGTGAAGCTGCTGAACAAGTTTCTAATGAAAAGAAAGGTTCTTGTTTGAAAACTCCGAAAAAGGTAAGCAAAGAAAGCTCTCATTCTACG
GAAGGGAGCAGTAGTGAGGAGGAAAACGATGAAGTAAAGCCTGGAAAGAAAAATGCAACTAAAGGAAGAATACCGAACTCTAACGAAACAAAAAAGCGGAAAAGGTCTAC
AATGGAGACTGTCTCTGCCAAGAAGCAAAGCAAGCATGTCCAGCAGACATCAGAGGAGGATAGTGATGGAGGTGGTGAAAATGGCTCTGAAGATGGCCAGTCTGACTCAT
CCAATGAAAAACCTGTGAAGAAGGAAGTTTCAACTCCTGTCTATGGTAAGCGTGTGGAGCACTTGAAATCGGTTATCAAATCGTGTGGGATGAGTGTTCCTCCAGCGATT
TATAAGAAAGTCAAGCAGGCACCTGAAAGCAAACGTGAATCACAACTTATCAAGGAGTTGGAGGGGATACTATCCAAAGAAGGATTGTCTGCTAATCCCTCTGAAAAAGA
AATTAAGGAAGTTAAAAAGAAGAAGGAAAGGGCCAAAGAACTTGAAGGCATTGACTTAAGCAATATCGTTTCAAGTTCACGTAGAAGATCCATGACCAGCTATGTAGCAC
CACCTCCAAAACCGAAAATACCAGTTAAAACTGATGGTGATGATGTTCATGATACTGAGGAGGAGGAGGAGGAGGAGGAGGAGGAGGAGGAGGAGGATGAAGAAGAAGAA
GAAGAAGATGACGACGATGACGACGAAGAGGAGGATGGCGAGGAAGAGGATAATGGCGATGTTGATGAAAGCCAAGGCGAAGAATTCAATGAGGATGACAATGAAGACAG
TGATTGAAACCGGAAAGAGCATCCAAGATTCTCGCATCTGATCGTCGATCAACGTAGAAATGATCCAGTGTAATCTTAATTCTCTATGTAGCTCCCTTTTCTCTTTAGTT
TTATTGGAGAGCTTGTAGTCAGCGATATGAGAGCTATATCTAGGAGATTGGACGTGGTATTATTGTACAATATTTTTTACTCTTCATGATAAAGACGAAATAAAAGAACA
CTTATATAATTTTGATTT
Protein sequenceShow/hide protein sequence
MAEELQDKDAPNEEAMDVAVDIERKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCLEKYALDVHKRYVKQCLVKCLESAEEDNASKDSEEIGRKSASKGEAAESLEG
HQSEKDVNEPCVEYENMEDSPVMGLLTGRKTKNVESDGIKGTKDKDDKDIPSESTITEAIRKRTSYLKANSEKVTMAGVRRLLEDDLKLTTKALDSCKKFISQQVEKILT
SCEAAEQVSNEKKGSCLKTPKKVSKESSHSTEGSSSEEENDEVKPGKKNATKGRIPNSNETKKRKRSTMETVSAKKQSKHVQQTSEEDSDGGGENGSEDGQSDSSNEKPV
KKEVSTPVYGKRVEHLKSVIKSCGMSVPPAIYKKVKQAPESKRESQLIKELEGILSKEGLSANPSEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSMTSYVAPPPKPK
IPVKTDGDDVHDTEEEEEEEEEEEEEDEEEEEEDDDDDDEEEDGEEEDNGDVDESQGEEFNEDDNEDSD