; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi08G004240 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi08G004240
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionglutamic acid-rich protein-like isoform X2
Genome locationchr08:11760297..11769625
RNA-Seq ExpressionLsi08G004240
SyntenyLsi08G004240
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0016874 - ligase activity (molecular function)
InterPro domainsIPR019098 - Histone chaperone domain CHZ
IPR037647 - HIRA-interacting protein 3


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016498.1 hypothetical protein SDJN02_21607 [Cucurbita argyrosperma subsp. argyrosperma]2.4e-18880Show/hide
Query:  MAEELQDNDASNEEAMDVAVDIETKIQNAMRSRVSHFKEQADPNPVVPDNPLSSADDRHNNPPLDWDERFSLSVEGVLEDKIFFGRERNKERRQCLEGAE
        MAEELQDNDA NEEAMDV V IETKIQNAM SRVSHFKEQAD         L   D       LD  +R+   ++  L               +CLEG E
Subjt:  MAEELQDNDASNEEAMDVAVDIETKIQNAMRSRVSHFKEQADPNPVVPDNPLSSADDRHNNPPLDWDERFSLSVEGVLEDKIFFGRERNKERRQCLEGAE

Query:  EENASKDSEETGRKSVSKEEAADSLEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGRKTKNVESDGIKGIKEKDDKDIPSESTIMKAIRKRISYLKANSE
        E+NASK SEETG KSVS+ EAA+SLEGHQSKKG KEPCLEDEEKMEDSPVMGLL G KTKNVESD IKGIK+KDDKDIP+ESTI KAIRKR  YLKANSE
Subjt:  EENASKDSEETGRKSVSKEEAADSLEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGRKTKNVESDGIKGIKEKDDKDIPSESTIMKAIRKRISYLKANSE

Query:  KVTMAGVRRLLEDDLKLTKNALDSCKKFISQQIEEILTSCEAAGQISNEKKGSRLKTPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSNET
        KVTMAGVRRLLEDDLKLTK ALD CKKFISQQ+EEIL SCEAA Q+SNEKKGSRLKTPKKVSKESSHSTE GSSSEEE+DEVKP KKN TKGRI NSNET
Subjt:  KVTMAGVRRLLEDDLKLTKNALDSCKKFISQQIEEILTSCEAAGQISNEKKGSRLKTPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSNET

Query:  KKRKRSTKETVSAKKQSKHVQQTSEEDSD-EGGENGSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKE
        KKRKRSTKE VSAKKQ KHVQ TSEEDSD EGGEN SEDG SESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKE
Subjt:  KKRKRSTKETVSAKKQSKHVQQTSEEDSD-EGGENGSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKE

Query:  LEGILSREGLSANPTEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTTSYVAPPPKPKIPVKTDGDDVDDTEEEEENDEEEEDDDEEE--EEDGEEED
        LEGILSREGLSANPTEKEIK+VKKKKERAKELEGIDLSNIVSSSRRRST+SYVAPPPKPKIPVKT+GDDVDDT++E+++D++++DDD++E  EE+ +EED
Subjt:  LEGILSREGLSANPTEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTTSYVAPPPKPKIPVKTDGDDVDDTEEEEENDEEEEDDDEEE--EEDGEEED

Query:  NGDVDESQG-EEFNE
        NGDVDESQG EEFNE
Subjt:  NGDVDESQG-EEFNE

XP_022939456.1 DNA ligase 1-like isoform X1 [Cucurbita moschata]1.8e-18880.51Show/hide
Query:  MAEELQDNDASNEEAMDVAVDIETKIQNAMRSRVSHFKEQADPNPVVPDNPLSSADDRHNNPPLDWDERFSLSVEGVLEDKIFFGRERNKERRQCLEGAE
        MAEELQDNDA NEEAMDV V IETKIQNAM SRVSHFKEQAD         L   D       LD  +R+   ++  L               +CLEG E
Subjt:  MAEELQDNDASNEEAMDVAVDIETKIQNAMRSRVSHFKEQADPNPVVPDNPLSSADDRHNNPPLDWDERFSLSVEGVLEDKIFFGRERNKERRQCLEGAE

Query:  EENASKDSEETGRKSVSKEEAADSLEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGRKTKNVESDGIKGIKEKDDKDIPSESTIMKAIRKRISYLKANSE
        E+NASK SEETG KSVS+ EAA+SLEGHQSKKG KEPCLEDEEKMEDSPVMGLL G KTKNVESD IKGIK+KDDKDIP+ESTI KAIRKR  YLKANSE
Subjt:  EENASKDSEETGRKSVSKEEAADSLEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGRKTKNVESDGIKGIKEKDDKDIPSESTIMKAIRKRISYLKANSE

Query:  KVTMAGVRRLLEDDLKLTKNALDSCKKFISQQIEEILTSCEAAGQISNEKKGSRLKTPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSNET
        KVTMAGVRRLLEDDLKLTK ALD CKKFISQQ+EEIL SCEAA ++SNEKKGSRLKTPKKVSKESSHSTE GSSSEEE+DEVKP KKN TKGRI NSNET
Subjt:  KVTMAGVRRLLEDDLKLTKNALDSCKKFISQQIEEILTSCEAAGQISNEKKGSRLKTPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSNET

Query:  KKRKRSTKETVSAKKQSKHVQQTSEEDSD-EGGENGSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKE
        KKRKRSTKE VSAKKQ KHVQ TSEEDSD EGGEN SEDG SESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKE
Subjt:  KKRKRSTKETVSAKKQSKHVQQTSEEDSD-EGGENGSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKE

Query:  LEGILSREGLSANPTEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTTSYVAPPPKPKIPVKTDGDDVDDTEEEEENDEEEEDDDEEEEEDGEEEDNG
        LEGILSREGLSANPTEKEIK+VKKKKERAKELEGIDLSNIVSSSRRRST+SYVAPPPKPKIPVKT+GDDVDDT++EEE D++++DD++ EEED +EEDNG
Subjt:  LEGILSREGLSANPTEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTTSYVAPPPKPKIPVKTDGDDVDDTEEEEENDEEEEDDDEEEEEDGEEEDNG

Query:  DVDESQG-EEFNE
        DVDESQG EEFNE
Subjt:  DVDESQG-EEFNE

XP_023551365.1 glutamic acid-rich protein-like isoform X1 [Cucurbita pepo subsp. pepo]1.1e-18880.7Show/hide
Query:  MAEELQDNDASNEEAMDVAVDIETKIQNAMRSRVSHFKEQADPNPVVPDNPLSSADDRHNNPPLDWDERFSLSVEGVLEDKIFFGRERNKERRQCLEGAE
        MAEELQDNDA NEEAMDV V IETKIQNAM SRVSHFKEQAD         L   D       LD  +R+   ++  L               +CLEG E
Subjt:  MAEELQDNDASNEEAMDVAVDIETKIQNAMRSRVSHFKEQADPNPVVPDNPLSSADDRHNNPPLDWDERFSLSVEGVLEDKIFFGRERNKERRQCLEGAE

Query:  EENASKDSEETGRKSVSKEEAADSLEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGRKTKNVESDGIKGIKEKDDKDIPSESTIMKAIRKRISYLKANSE
        E+N SK SEETG KSVS+ EAA+SLEGHQSKKG KEPCLEDEEKMEDSPVMGLL G KTKNVESD +KGIK+KDDKDIP+E+TI KAIRKR  YLKANSE
Subjt:  EENASKDSEETGRKSVSKEEAADSLEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGRKTKNVESDGIKGIKEKDDKDIPSESTIMKAIRKRISYLKANSE

Query:  KVTMAGVRRLLEDDLKLTKNALDSCKKFISQQIEEILTSCEAAGQISNEKKGSRLKTPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSNET
        KVTMAGVRRLLEDDLKLTK ALD CKKFISQQ+EEIL SCEAA Q+SNEKKGSRLKTPKKVSKESSHSTE GSSSEEE+DEVKP KKN TKGRI NSNET
Subjt:  KVTMAGVRRLLEDDLKLTKNALDSCKKFISQQIEEILTSCEAAGQISNEKKGSRLKTPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSNET

Query:  KKRKRSTKETVSAKKQSKHVQQTSEEDSD-EGGENGSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKE
        KKRKRSTKE VSAKKQ KHVQ TSEEDSD EGGEN SEDG SESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKE
Subjt:  KKRKRSTKETVSAKKQSKHVQQTSEEDSD-EGGENGSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKE

Query:  LEGILSREGLSANPTEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTTSYVAPPPKPKIPVKTDGDDVDDTEEEEENDEEEEDDDEEEEEDGEEEDNG
        LEGILSREGLSANPTEKEIK+VKKKKERAKELEGIDLSNIVSSSRRRST+SYVAPPPKPKIPVKT+GDDVDDT+EEEE  EEE+DDDE+ EE+ +EEDNG
Subjt:  LEGILSREGLSANPTEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTTSYVAPPPKPKIPVKTDGDDVDDTEEEEENDEEEEDDDEEEEEDGEEEDNG

Query:  DVDESQG-EEFNE
        DVDESQG EEFNE
Subjt:  DVDESQG-EEFNE

XP_038884709.1 glutamic acid-rich protein isoform X1 [Benincasa hispida]6.1e-19281.01Show/hide
Query:  MAEELQDNDASNEEAMDVAVDIETKIQNAMRSRVSHFKEQADPNPVVPDNPLSSADDRHNNPPLDWDERFSLSVEGVLEDKIFFGRERNKERRQCLEGAE
        MAEELQD DASN++AMDVAVDIETKI NAMRSRVS+FKE+AD         L   D           E ++L V   L  +            +C E   
Subjt:  MAEELQDNDASNEEAMDVAVDIETKIQNAMRSRVSHFKEQADPNPVVPDNPLSSADDRHNNPPLDWDERFSLSVEGVLEDKIFFGRERNKERRQCLEGAE

Query:  EENASKDSEETGRKSVSKEEAADSLEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGRKTKNVESDGIKGIKEKDDKDIPSESTIMKAIRKRISYLKANSE
        E+N SK SEETGRKSV+KEEAA+ LEGHQSKKGVKEPC EDEEKMEDSPVMGLL  R TKNVESDGIKGIK+KDDKDIPSES I KAIRKR SYLKANSE
Subjt:  EENASKDSEETGRKSVSKEEAADSLEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGRKTKNVESDGIKGIKEKDDKDIPSESTIMKAIRKRISYLKANSE

Query:  KVTMAGVRRLLEDDLKLTKNALDSCKKFISQQIEEILTSCEAAGQISNEKKGSRLKTPKKVSKESSHSTEGSS----SEEENDEVKPGKKNATKGRIPNS
        KVTMAGVRRLLEDDLKLTKNALDSCKKFISQQ+EEILTSCEAA Q+SNEK   RLKTPKKVSKESSHSTEGSS    SEEENDEVKPGKKNATKGRIPNS
Subjt:  KVTMAGVRRLLEDDLKLTKNALDSCKKFISQQIEEILTSCEAAGQISNEKKGSRLKTPKKVSKESSHSTEGSS----SEEENDEVKPGKKNATKGRIPNS

Query:  NETKKRKRSTKETVSAKKQSKHVQQTSEEDSDEGGENGSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLI
        NETKKRKRSTKET+SAKKQSKHVQ TSEED+DEGGEN SEDGQSESS+E+PVKKEVSTPVYGK VEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLI
Subjt:  NETKKRKRSTKETVSAKKQSKHVQQTSEEDSDEGGENGSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLI

Query:  KELEGILSREGLSANPTEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTTSYVAPPPKPKIPVKT--DGDDVDDTEEEEENDEEEEDDDEEEEEDGEE
        KELEGILSREGLSANPTEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRS TSY  PPPKPKIPVKT  DGDD DDTEEEEE DE+EEDDD+EEEEDGEE
Subjt:  KELEGILSREGLSANPTEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTTSYVAPPPKPKIPVKT--DGDDVDDTEEEEENDEEEEDDDEEEEEDGEE

Query:  EDNGDVDESQGEEFNE
        EDNG+VD SQGEEFNE
Subjt:  EDNGDVDESQGEEFNE

XP_038884710.1 glutamic acid-rich protein isoform X2 [Benincasa hispida]1.1e-18880.47Show/hide
Query:  MAEELQDNDASNEEAMDVAVDIETKIQNAMRSRVSHFKEQADPNPVVPDNPLSSADDRHNNPPLDWDERFSLSVEGVLEDKIFFGRERNKERRQCLEGAE
        MAEELQD DASN++AMDVAVDIETKI NAMRSRVS+FKE+AD         L   D           E ++L V   L  +            +C E   
Subjt:  MAEELQDNDASNEEAMDVAVDIETKIQNAMRSRVSHFKEQADPNPVVPDNPLSSADDRHNNPPLDWDERFSLSVEGVLEDKIFFGRERNKERRQCLEGAE

Query:  EENASKDSEETGRKSVSKEEAADSLEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGRKTKNVESDGIKGIKEKDDKDIPSESTIMKAIRKRISYLKANSE
        E+N SK SEETGRKSV+KEEAA+ LEGHQSKKGVKEPC EDEEKMEDSPVMGLL  R TKNVESDGIKGIK+KDDKDIPSES I KAIRKR SYLKANSE
Subjt:  EENASKDSEETGRKSVSKEEAADSLEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGRKTKNVESDGIKGIKEKDDKDIPSESTIMKAIRKRISYLKANSE

Query:  KVTMAGVRRLLEDDLKLTKNALDSCKKFISQQIEEILTSCEAAGQISNEKKGSRLKTPKKVSKESSHSTEGSS----SEEENDEVKPGKKNATKGRIPNS
        KVTMAGVRRLLEDDLKLTKNALDSCKKFISQQ+EEILTSCEAA Q+SNEK   RLKTPKKVSKESSHSTEGSS    SEEENDEVKPGKKNATKGRIPNS
Subjt:  KVTMAGVRRLLEDDLKLTKNALDSCKKFISQQIEEILTSCEAAGQISNEKKGSRLKTPKKVSKESSHSTEGSS----SEEENDEVKPGKKNATKGRIPNS

Query:  NETKKRKRSTKETVSAKKQSKHVQQTSEEDSDEGGENGSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLI
        NETKKRKRSTKET+SAKKQSKHVQ TSEED+DEGGEN SEDGQSESS+E+PVKKEVSTPVYGK VEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLI
Subjt:  NETKKRKRSTKETVSAKKQSKHVQQTSEEDSDEGGENGSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLI

Query:  KELEGILSREGLSANPTEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTTSYVAPPPKPKIPVKT--DGDDVDDTEEEEENDEEEEDDDEEEEEDGEE
        KELEGILSREGLSANPTEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRS TSY  PPPKPKIPVKT  DGDD DDTEEEEE DE+EEDDD+EEEEDGEE
Subjt:  KELEGILSREGLSANPTEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTTSYVAPPPKPKIPVKT--DGDDVDDTEEEEENDEEEEDDDEEEEEDGEE

Query:  EDNGDVDESQGE
        EDNG+VD SQ +
Subjt:  EDNGDVDESQGE

TrEMBL top hitse value%identityAlignment
A0A0A0LIS6 CHZ domain-containing protein7.7e-18579.3Show/hide
Query:  MAEELQDNDASNEEAMDVAVDIETKIQNAMRSRVSHFKEQADPNPVVPDNPLSSADDRHNNPPLDWDERFSLSVEGVLEDKIFFGRERNKERRQCLEGAE
        MAEELQ ND   EE MDVAV IETKI NAMRSR+SHFKEQAD         L   D       LD  +R+   V+  L               +CLE   
Subjt:  MAEELQDNDASNEEAMDVAVDIETKIQNAMRSRVSHFKEQADPNPVVPDNPLSSADDRHNNPPLDWDERFSLSVEGVLEDKIFFGRERNKERRQCLEGAE

Query:  EENASKDSEETGRKSVSKEEAADSLEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGRKTKNVESDGIKGIKEKDDKDIPSESTIMKAIRKRISYLKANSE
        E+N SKDSE TGRKSV+KEEA +S EGHQSKKG KEPCLEDEEKMEDSPVMGLLTGR TKNVESDGIKGIK KDDKD+PSESTIMKAIRKR SYLKANSE
Subjt:  EENASKDSEETGRKSVSKEEAADSLEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGRKTKNVESDGIKGIKEKDDKDIPSESTIMKAIRKRISYLKANSE

Query:  KVTMAGVRRLLEDDLKLTKNALDSCKKFISQQIEEILTSCEAAGQISNEKKGSRLKTPKKVSKESSHSTEGSSSEEENDEVKPGKKNATKGRIPNSNETK
        KVTMAGVRRLLEDDLKLTKN LDSCKKFISQQ+EEILTSCEAA Q+SN      LK+PKK+SKESS+STEGSSSEEENDEV PGK NATKGRIP+SNETK
Subjt:  KVTMAGVRRLLEDDLKLTKNALDSCKKFISQQIEEILTSCEAAGQISNEKKGSRLKTPKKVSKESSHSTEGSSSEEENDEVKPGKKNATKGRIPNSNETK

Query:  KRKRSTKETVSAKKQSKHVQQTSEEDSDEGGENGSEDGQSESSNEKPVKKEV--STPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKE
        KRKRSTK+TVSA+KQSKHVQ TS+EDSDEGG N SEDG+S SSNEKPVKKEV  STPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKE
Subjt:  KRKRSTKETVSAKKQSKHVQQTSEEDSDEGGENGSEDGQSESSNEKPVKKEV--STPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKE

Query:  LEGILSREGLSANPTEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTTSYVAPPPKPKIPVKTDGDDVDDTEEEEENDEEEEDDDEEEEEDGEEEDNG
        LEGILSREGLSAN TEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTTSYVAPPPKPKIPVKTDGDD D    EEE+DEEE    +EEEEDGEEEDNG
Subjt:  LEGILSREGLSANPTEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTTSYVAPPPKPKIPVKTDGDDVDDTEEEEENDEEEEDDDEEEEEDGEEEDNG

Query:  DVDESQGEEFNE
        DVDESQGEEFNE
Subjt:  DVDESQGEEFNE

A0A6J1FFY5 DNA ligase 1-like isoform X18.8e-18980.51Show/hide
Query:  MAEELQDNDASNEEAMDVAVDIETKIQNAMRSRVSHFKEQADPNPVVPDNPLSSADDRHNNPPLDWDERFSLSVEGVLEDKIFFGRERNKERRQCLEGAE
        MAEELQDNDA NEEAMDV V IETKIQNAM SRVSHFKEQAD         L   D       LD  +R+   ++  L               +CLEG E
Subjt:  MAEELQDNDASNEEAMDVAVDIETKIQNAMRSRVSHFKEQADPNPVVPDNPLSSADDRHNNPPLDWDERFSLSVEGVLEDKIFFGRERNKERRQCLEGAE

Query:  EENASKDSEETGRKSVSKEEAADSLEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGRKTKNVESDGIKGIKEKDDKDIPSESTIMKAIRKRISYLKANSE
        E+NASK SEETG KSVS+ EAA+SLEGHQSKKG KEPCLEDEEKMEDSPVMGLL G KTKNVESD IKGIK+KDDKDIP+ESTI KAIRKR  YLKANSE
Subjt:  EENASKDSEETGRKSVSKEEAADSLEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGRKTKNVESDGIKGIKEKDDKDIPSESTIMKAIRKRISYLKANSE

Query:  KVTMAGVRRLLEDDLKLTKNALDSCKKFISQQIEEILTSCEAAGQISNEKKGSRLKTPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSNET
        KVTMAGVRRLLEDDLKLTK ALD CKKFISQQ+EEIL SCEAA ++SNEKKGSRLKTPKKVSKESSHSTE GSSSEEE+DEVKP KKN TKGRI NSNET
Subjt:  KVTMAGVRRLLEDDLKLTKNALDSCKKFISQQIEEILTSCEAAGQISNEKKGSRLKTPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSNET

Query:  KKRKRSTKETVSAKKQSKHVQQTSEEDSD-EGGENGSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKE
        KKRKRSTKE VSAKKQ KHVQ TSEEDSD EGGEN SEDG SESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKE
Subjt:  KKRKRSTKETVSAKKQSKHVQQTSEEDSD-EGGENGSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKE

Query:  LEGILSREGLSANPTEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTTSYVAPPPKPKIPVKTDGDDVDDTEEEEENDEEEEDDDEEEEEDGEEEDNG
        LEGILSREGLSANPTEKEIK+VKKKKERAKELEGIDLSNIVSSSRRRST+SYVAPPPKPKIPVKT+GDDVDDT++EEE D++++DD++ EEED +EEDNG
Subjt:  LEGILSREGLSANPTEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTTSYVAPPPKPKIPVKTDGDDVDDTEEEEENDEEEEDDDEEEEEDGEEEDNG

Query:  DVDESQG-EEFNE
        DVDESQG EEFNE
Subjt:  DVDESQG-EEFNE

A0A6J1FGV2 glutamic acid-rich protein-like isoform X26.3e-18780.12Show/hide
Query:  MAEELQDNDASNEEAMDVAVDIETKIQNAMRSRVSHFKEQADPNPVVPDNPLSSADDRHNNPPLDWDERFSLSVEGVLEDKIFFGRERNKERRQCLEGAE
        MAEELQDNDA NEEAMDV V IETKIQNAM SRVSHFKEQAD         L   D       LD  +R+   ++  L               +CLEG E
Subjt:  MAEELQDNDASNEEAMDVAVDIETKIQNAMRSRVSHFKEQADPNPVVPDNPLSSADDRHNNPPLDWDERFSLSVEGVLEDKIFFGRERNKERRQCLEGAE

Query:  EENASKDSEETGRKSVSKEEAADSLEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGRKTKNVESDGIKGIKEKDDKDIPSESTIMKAIRKRISYLKANSE
        E+NASK SEETG KSVS+ EAA+SLEGHQSKKG KEPCLEDEEKMEDSPVMGLL G KTKNVESD IKGIK+KDDKDIP+ESTI KAIRKR  YLKANSE
Subjt:  EENASKDSEETGRKSVSKEEAADSLEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGRKTKNVESDGIKGIKEKDDKDIPSESTIMKAIRKRISYLKANSE

Query:  KVTMAGVRRLLEDDLKLTKNALDSCKKFISQQIEEILTSCEAAGQISNEKKGSRLKTPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSNET
        KVTMAGVRRLLEDDLKLTK ALD CKKFISQQ+EEIL SCEAA ++SNEKKGSRLKTPKKVSKESSHSTE GSSSEEE+DEVKP KKN TKGRI NSNET
Subjt:  KVTMAGVRRLLEDDLKLTKNALDSCKKFISQQIEEILTSCEAAGQISNEKKGSRLKTPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSNET

Query:  KKRKRSTKETVSAKKQSKHVQQTSEEDSD-EGGENGSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKE
        KKRKRSTKE VSAKKQ KHVQ TSEEDSD EGGEN SEDG SESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKE
Subjt:  KKRKRSTKETVSAKKQSKHVQQTSEEDSD-EGGENGSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKE

Query:  LEGILSREGLSANPTEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTTSYVAPPPKPKIPVKTDGDDVDDTEEEEENDEEEEDDDEEEEEDGEEEDNG
        LEGILSREGLSANPTEKEIK+VKKKKERAKELEGIDLSNIVSSSRRRST+SYVAPPPKPKIPVKT+GDDVDDT++EEE D++++DD++ EEED +EEDNG
Subjt:  LEGILSREGLSANPTEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTTSYVAPPPKPKIPVKTDGDDVDDTEEEEENDEEEEDDDEEEEEDGEEEDNG

Query:  DVDESQGE
        DVDESQ +
Subjt:  DVDESQGE

A0A6J1JTY1 glutamic acid-rich protein-like isoform X11.1e-18378.95Show/hide
Query:  MAEELQDNDASNEEAMDVAVDIETKIQNAMRSRVSHFKEQADPNPVVPDNPLSSADDRHNNPPLDWDERFSLSVEGVLEDKIFFGRERNKERRQCLEGAE
        MAEELQD DA NEEAMDV V IETKIQNAM SRVSHFKEQAD         L   D       LD  +R+   ++  L               +CLEG E
Subjt:  MAEELQDNDASNEEAMDVAVDIETKIQNAMRSRVSHFKEQADPNPVVPDNPLSSADDRHNNPPLDWDERFSLSVEGVLEDKIFFGRERNKERRQCLEGAE

Query:  EENASKDSEETGRKSVSKEEAADSLEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGRKTKNVESDGIKGIKEKDDKDIPSESTIMKAIRKRISYLKANSE
        E+NASK SEETG KSVS+ EAA+SLEGHQSKKG KEPCLEDEEKMEDSPVMGLL G KTKN ESD +KGIK+KDDKDIP+ESTI KAIRKR  YLKANSE
Subjt:  EENASKDSEETGRKSVSKEEAADSLEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGRKTKNVESDGIKGIKEKDDKDIPSESTIMKAIRKRISYLKANSE

Query:  KVTMAGVRRLLEDDLKLTKNALDSCKKFISQQIEEILTSCEAAGQISNEKKGSRLKTPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSNET
        KVTMAGVRRLLEDDLKLTK ALD CKKFISQQ+EEIL SCEAA Q+SNEKKGSRLKTPKKVSKESSHSTE GSSSEEE+DEVKP KKN TKG I NSNE 
Subjt:  KVTMAGVRRLLEDDLKLTKNALDSCKKFISQQIEEILTSCEAAGQISNEKKGSRLKTPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSNET

Query:  KKRKRSTKETVSAKKQSKHVQQTSEEDSDE-GGENGSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKE
        KKRKRSTKE VSAKKQ KHV  T EEDSDE GGEN SEDG SESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKE
Subjt:  KKRKRSTKETVSAKKQSKHVQQTSEEDSDE-GGENGSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKE

Query:  LEGILSREGLSANPTEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTTSYVAPPPKPKIPVKTDGDDVDDTEEEEENDEEEEDDDEEEEEDGEEEDNG
        LEGILSREGLS NPTEKEIK+VKKKKERAKELEGIDLSNIVSSSRRRST+SY APPPKPKIPVKT+GDDVDDT++EEE +++++DDD EEE+D  EEDNG
Subjt:  LEGILSREGLSANPTEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTTSYVAPPPKPKIPVKTDGDDVDDTEEEEENDEEEEDDDEEEEEDGEEEDNG

Query:  DVDESQG-EEFNE
        DVDESQG EEFNE
Subjt:  DVDESQG-EEFNE

A0A6J1K3E3 glutamic acid-rich protein-like isoform X28.0e-18278.54Show/hide
Query:  MAEELQDNDASNEEAMDVAVDIETKIQNAMRSRVSHFKEQADPNPVVPDNPLSSADDRHNNPPLDWDERFSLSVEGVLEDKIFFGRERNKERRQCLEGAE
        MAEELQD DA NEEAMDV V IETKIQNAM SRVSHFKEQAD         L   D       LD  +R+   ++  L               +CLEG E
Subjt:  MAEELQDNDASNEEAMDVAVDIETKIQNAMRSRVSHFKEQADPNPVVPDNPLSSADDRHNNPPLDWDERFSLSVEGVLEDKIFFGRERNKERRQCLEGAE

Query:  EENASKDSEETGRKSVSKEEAADSLEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGRKTKNVESDGIKGIKEKDDKDIPSESTIMKAIRKRISYLKANSE
        E+NASK SEETG KSVS+ EAA+SLEGHQSKKG KEPCLEDEEKMEDSPVMGLL G KTKN ESD +KGIK+KDDKDIP+ESTI KAIRKR  YLKANSE
Subjt:  EENASKDSEETGRKSVSKEEAADSLEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGRKTKNVESDGIKGIKEKDDKDIPSESTIMKAIRKRISYLKANSE

Query:  KVTMAGVRRLLEDDLKLTKNALDSCKKFISQQIEEILTSCEAAGQISNEKKGSRLKTPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSNET
        KVTMAGVRRLLEDDLKLTK ALD CKKFISQQ+EEIL SCEAA Q+SNEKKGSRLKTPKKVSKESSHSTE GSSSEEE+DEVKP KKN TKG I NSNE 
Subjt:  KVTMAGVRRLLEDDLKLTKNALDSCKKFISQQIEEILTSCEAAGQISNEKKGSRLKTPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSNET

Query:  KKRKRSTKETVSAKKQSKHVQQTSEEDSDE-GGENGSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKE
        KKRKRSTKE VSAKKQ KHV  T EEDSDE GGEN SEDG SESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKE
Subjt:  KKRKRSTKETVSAKKQSKHVQQTSEEDSDE-GGENGSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKE

Query:  LEGILSREGLSANPTEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTTSYVAPPPKPKIPVKTDGDDVDDTEEEEENDEEEEDDDEEEEEDGEEEDNG
        LEGILSREGLS NPTEKEIK+VKKKKERAKELEGIDLSNIVSSSRRRST+SY APPPKPKIPVKT+GDDVDDT++EEE +++++DDD EEE+D  EEDNG
Subjt:  LEGILSREGLSANPTEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTTSYVAPPPKPKIPVKTDGDDVDDTEEEEENDEEEEDDDEEEEEDGEEEDNG

Query:  DVDESQGE
        DVDESQ +
Subjt:  DVDESQGE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G44780.1 CONTAINS InterPro DOMAIN/s: Histone chaperone domain CHZ (InterPro:IPR019098)1.2e-5740.51Show/hide
Query:  AEELQDNDASNEEAMDVAVDIETKIQNAMRSRVSHFKEQADPNPVVPDNPLSSADDRHNNPPLDWDERFSLSVEGVLEDKIFFGRERNKERRQCLEGAEE
        AE   ++  SN + +D A +IE KI  A+RSRV++ + +AD   +V    +   D       LD  + F      V E  +           +CLE A  
Subjt:  AEELQDNDASNEEAMDVAVDIETKIQNAMRSRVSHFKEQADPNPVVPDNPLSSADDRHNNPPLDWDERFSLSVEGVLEDKIFFGRERNKERRQCLEGAEE

Query:  ENASKDSEETGRK--SVSKEEAADSLEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGRKTKNVESDGIKGIKEKDDKDIPSESTIMKAIRKRISYLKANS
         + S++S+ET R+   +  +E A+  E H+      E   E+  K E   V G               KG KE   +D      I +A+RKR SY+KANS
Subjt:  ENASKDSEETGRK--SVSKEEAADSLEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGRKTKNVESDGIKGIKEKDDKDIPSESTIMKAIRKRISYLKANS

Query:  EKVTMAGVRRLLEDDLKLTKNALDSCKKFISQQIEEIL-----TSCEAAGQISNEKKGSRLKTPKKVSKESSHSTEGSSSEEENDEVKPGKKNATKGRIP
        E +TMA +RRLLE+DLKL K +LD  KKFI+++++E+L       C     + N KK  +  TP K+     +S   +    +N+EV   K  A K ++ 
Subjt:  EKVTMAGVRRLLEDDLKLTKNALDSCKKFISQQIEEIL-----TSCEAAGQISNEKKGSRLKTPKKVSKESSHSTEGSSSEEENDEVKPGKKNATKGRIP

Query:  NSNETKKRKRSTKETVSAKKQSKHVQQTSEEDSDEGGENGSEDGQSESSNEKPVK--KEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRE
              KRK    + VS +K++KH +  SE DSD G             +EK +K  KE +T VYGKRVEHLKSVIKSCGMSVPP+IYKK KQAP+ KRE
Subjt:  NSNETKKRKRSTKETVSAKKQSKHVQQTSEEDSDEGGENGSEDGQSESSNEKPVK--KEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRE

Query:  SQLIKELEGILSREGLSANPTEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTTSYVAPPPKPKIPV--KTDGDDVDDTEEEEENDEEEE---DDDEE
        + LI+ELE IL++EGLS++P+  EIKEVKK+K  ++ELEGID +NIV +SRRRS+TS+ APPPKPK+    +++ D+ +D+E EEE++E+ E     +E 
Subjt:  SQLIKELEGILSREGLSANPTEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTTSYVAPPPKPKIPV--KTDGDDVDDTEEEEENDEEEE---DDDEE

Query:  EEEDGEEEDNG
        EEE   EED+G
Subjt:  EEEDGEEEDNG

AT1G44780.2 INVOLVED IN: biological_process unknown9.4e-5840.59Show/hide
Query:  AEELQDNDASNEEAMDVAVDIETKIQNAMRSRVSHFKEQADPNPVVPDNPLSSADDRHNNPPLDWDERFSLSVEGVLEDKIFFGRERNKERRQCLEGAEE
        AE   ++  SN + +D A +IE KI  A+RSRV++ + +AD   +V    +   D       LD  + F      V E  +           +CLE A  
Subjt:  AEELQDNDASNEEAMDVAVDIETKIQNAMRSRVSHFKEQADPNPVVPDNPLSSADDRHNNPPLDWDERFSLSVEGVLEDKIFFGRERNKERRQCLEGAEE

Query:  ENASKDSEETGRK--SVSKEEAADSLEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGRKTKNVESDGIKGIKEKDDKDIPSESTIMKAIRKRISYLKANS
         + S++S+ET R+   +  +E A+  E H+      E   E+  K E   V G               KG KE   +D      I +A+RKR SY+KANS
Subjt:  ENASKDSEETGRK--SVSKEEAADSLEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGRKTKNVESDGIKGIKEKDDKDIPSESTIMKAIRKRISYLKANS

Query:  EKVTMAGVRRLLEDDLKLTKNALDSCKKFISQQIEEIL-----TSCEAAGQISNEKKGSRLKTPKKVSKESSHSTEGSSSEEENDEVKPGKKNATKGRIP
        E +TMA +RRLLE+DLKL K +LD  KKFI+++++E+L       C     + N KK  +  TP K+     +S   +    +N+EV   K  A K ++ 
Subjt:  EKVTMAGVRRLLEDDLKLTKNALDSCKKFISQQIEEIL-----TSCEAAGQISNEKKGSRLKTPKKVSKESSHSTEGSSSEEENDEVKPGKKNATKGRIP

Query:  NSNETKKRKRSTKETVSAKKQSKHVQQTSEEDSDEGGENGSEDGQSESSNEKPVK-KEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRES
              KRK    + VS +K++KH +  SE DSD G             +EK +K KE +T VYGKRVEHLKSVIKSCGMSVPP+IYKK KQAP+ KRE+
Subjt:  NSNETKKRKRSTKETVSAKKQSKHVQQTSEEDSDEGGENGSEDGQSESSNEKPVK-KEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRES

Query:  QLIKELEGILSREGLSANPTEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTTSYVAPPPKPKIPV--KTDGDDVDDTEEEEENDEEEE---DDDEEE
         LI+ELE IL++EGLS++P+  EIKEVKK+K  ++ELEGID +NIV +SRRRS+TS+ APPPKPK+    +++ D+ +D+E EEE++E+ E     +E E
Subjt:  QLIKELEGILSREGLSANPTEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTTSYVAPPPKPKIPV--KTDGDDVDDTEEEEENDEEEE---DDDEEE

Query:  EEDGEEEDNG
        EE   EED+G
Subjt:  EEDGEEEDNG

AT4G08310.1 FUNCTIONS IN: molecular_function unknown1.5e-7143.92Show/hide
Query:  LQDNDASNEEAMDVA----------------VDIETKIQNAMRSRVSHFKEQADPNPVVPDNPLSSADDRHNNPPLDWDERFSLSVEGVLEDKIFFGRER
        + D D++   AM+++                 DIE++I  AM+SRV++ +++AD         L   D +     LD  + F                  
Subjt:  LQDNDASNEEAMDVA----------------VDIETKIQNAMRSRVSHFKEQADPNPVVPDNPLSSADDRHNNPPLDWDERFSLSVEGVLEDKIFFGRER

Query:  NKERRQCLEGAEEENASKDSEETGRKS--VSKEEAADSLEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGRKTKNVESDGIKGIKEKDDKDIPSESTIMK
         +   QCL GAE +  S++S ET +K      +EAA+  + H +KK  KE    D+EK +DSPVMGLLT   T    ++  K     +DK++  +S I K
Subjt:  NKERRQCLEGAEEENASKDSEETGRKS--VSKEEAADSLEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGRKTKNVESDGIKGIKEKDDKDIPSESTIMK

Query:  AIRKRISYLKANSEKVTMAGVRRLLEDDLKLTKNALDSCKKFISQQIEEILTSCEAAGQISNEKKGSRLKTPKKVSKESSHSTE----GSSSEEENDEVK
        A+RKR SY+KANSEK+TM  +RRLLE DLKL K +LD  KKFI+ +++EIL + EA    +  ++    K  K    ++S S E        EEE+ EV 
Subjt:  AIRKRISYLKANSEKVTMAGVRRLLEDDLKLTKNALDSCKKFISQQIEEILTSCEAAGQISNEKKGSRLKTPKKVSKESSHSTE----GSSSEEENDEVK

Query:  PGKKNATKGRIPNSNETKKRKRSTKETVSAKKQSKHVQQTSEEDSDEGGENGSEDGQSESSNEKPVKK-EVSTPVYGKRVEHLKSVIKSCGMSVPPSIYK
          KK A K ++  S  T KRKR  ++  SAKK     Q  S+ DSD         G+   S+EK VKK E  T  YGKRVEHLKS+IKSCGMS+ PS+Y+
Subjt:  PGKKNATKGRIPNSNETKKRKRSTKETVSAKKQSKHVQQTSEEDSDEGGENGSEDGQSESSNEKPVKK-EVSTPVYGKRVEHLKSVIKSCGMSVPPSIYK

Query:  KVKQAPESKRESQLIKELEGILSREGLSANPTEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTTSYVAPPPKPKIPVKTDGDDVDDTEEEEENDEEE
        K KQAPE KRE  LIKEL+ +L++EGLSANP+EKEIKEVKK+KER KELEGID SNIVSSSRRRS+ S+V PPPKP    +++ DD +D+E EE+ DEE 
Subjt:  KVKQAPESKRESQLIKELEGILSREGLSANPTEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTTSYVAPPPKPKIPVKTDGDDVDDTEEEEENDEEE

Query:  EDDDEEEEED-GEEEDNGDVDESQGE
          ++EEEEED G  ED G+  +++GE
Subjt:  EDDDEEEEED-GEEEDNGDVDESQGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAGGAATTACAGGACAACGATGCTTCAAACGAAGAAGCCATGGATGTAGCTGTTGATATAGAGACGAAAATTCAGAACGCTATGCGCTCTCGCGTTTCTCACTT
CAAGGAACAAGCCGACCCCAACCCTGTGGTGCCTGACAACCCCCTCTCCTCCGCCGACGACCGCCACAACAACCCTCCTCTAGACTGGGATGAACGGTTCTCTCTATCCG
TGGAAGGAGTATTGGAAGACAAGATTTTTTTTGGAAGAGAAAGAAATAAGGAGCGACGGCAATGCTTAGAAGGTGCCGAGGAAGAAAATGCCTCCAAAGATTCTGAGGAG
ACTGGGAGAAAAAGTGTAAGTAAAGAAGAAGCGGCTGACTCACTTGAAGGGCATCAGTCCAAGAAGGGTGTAAAGGAACCTTGCTTGGAAGATGAGGAGAAAATGGAAGA
CTCTCCAGTTATGGGCCTTCTCACAGGACGTAAAACAAAAAATGTTGAATCTGATGGAATCAAAGGAATCAAAGAAAAAGATGACAAAGATATTCCTAGTGAGAGTACAA
TTATGAAAGCTATTAGAAAAAGAATTTCTTATCTTAAAGCTAATTCCGAGAAAGTTACTATGGCTGGAGTTCGCCGCCTTCTGGAGGATGACCTTAAACTTACTAAAAAT
GCTCTCGACAGTTGCAAGAAGTTTATAAGCCAACAAATAGAGGAGATATTGACTTCTTGTGAAGCTGCTGGACAAATTTCTAATGAAAAGAAAGGTTCTCGTTTGAAAAC
TCCGAAAAAGGTAAGCAAAGAAAGCTCTCATTCTACTGAAGGGAGCAGTAGTGAGGAGGAAAACGATGAAGTAAAGCCTGGAAAGAAAAATGCAACTAAAGGAAGAATAC
CGAACTCTAATGAAACAAAAAAGCGGAAAAGGTCTACAAAGGAGACTGTCTCTGCCAAGAAGCAAAGCAAGCATGTCCAGCAGACATCAGAGGAGGATAGCGATGAAGGG
GGTGAAAATGGCTCTGAAGATGGCCAGTCTGAATCATCCAATGAAAAACCTGTCAAGAAGGAAGTTTCAACTCCCGTCTATGGCAAGCGTGTGGAGCACTTGAAATCGGT
TATCAAATCGTGTGGGATGAGTGTTCCTCCATCGATTTATAAGAAAGTCAAGCAGGCACCTGAAAGCAAACGTGAATCACAACTTATAAAGGAGTTGGAGGGGATACTAT
CCAGAGAGGGATTGTCTGCTAATCCCACTGAAAAAGAAATTAAGGAAGTCAAAAAGAAGAAGGAAAGGGCCAAAGAACTTGAAGGCATCGACTTAAGTAATATTGTCTCA
AGTTCACGTAGAAGATCCACGACCAGTTATGTAGCACCACCTCCAAAACCGAAAATACCAGTTAAAACTGATGGTGATGATGTTGATGATACTGAAGAGGAGGAGGAGAA
CGATGAAGAAGAAGAAGACGACGATGAAGAAGAAGAGGAGGATGGCGAGGAAGAGGATAATGGTGATGTTGATGAAAGCCAAGGTGAAGAATTCAATGAGGGTAACAAAC
CATCAACTTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATTTTTCCTGTCAAGCAGAGTCATTCTTGGAAAGTCCCAACACACCTTCAGTCCAGAGCCCATAATTTCTTCCTTCGCACAAAAATCGGTGCGAGTCGAAGAACAGCAAA
ATGGCGGAGGAATTACAGGACAACGATGCTTCAAACGAAGAAGCCATGGATGTAGCTGTTGATATAGAGACGAAAATTCAGAACGCTATGCGCTCTCGCGTTTCTCACTT
CAAGGAACAAGCCGACCCCAACCCTGTGGTGCCTGACAACCCCCTCTCCTCCGCCGACGACCGCCACAACAACCCTCCTCTAGACTGGGATGAACGGTTCTCTCTATCCG
TGGAAGGAGTATTGGAAGACAAGATTTTTTTTGGAAGAGAAAGAAATAAGGAGCGACGGCAATGCTTAGAAGGTGCCGAGGAAGAAAATGCCTCCAAAGATTCTGAGGAG
ACTGGGAGAAAAAGTGTAAGTAAAGAAGAAGCGGCTGACTCACTTGAAGGGCATCAGTCCAAGAAGGGTGTAAAGGAACCTTGCTTGGAAGATGAGGAGAAAATGGAAGA
CTCTCCAGTTATGGGCCTTCTCACAGGACGTAAAACAAAAAATGTTGAATCTGATGGAATCAAAGGAATCAAAGAAAAAGATGACAAAGATATTCCTAGTGAGAGTACAA
TTATGAAAGCTATTAGAAAAAGAATTTCTTATCTTAAAGCTAATTCCGAGAAAGTTACTATGGCTGGAGTTCGCCGCCTTCTGGAGGATGACCTTAAACTTACTAAAAAT
GCTCTCGACAGTTGCAAGAAGTTTATAAGCCAACAAATAGAGGAGATATTGACTTCTTGTGAAGCTGCTGGACAAATTTCTAATGAAAAGAAAGGTTCTCGTTTGAAAAC
TCCGAAAAAGGTAAGCAAAGAAAGCTCTCATTCTACTGAAGGGAGCAGTAGTGAGGAGGAAAACGATGAAGTAAAGCCTGGAAAGAAAAATGCAACTAAAGGAAGAATAC
CGAACTCTAATGAAACAAAAAAGCGGAAAAGGTCTACAAAGGAGACTGTCTCTGCCAAGAAGCAAAGCAAGCATGTCCAGCAGACATCAGAGGAGGATAGCGATGAAGGG
GGTGAAAATGGCTCTGAAGATGGCCAGTCTGAATCATCCAATGAAAAACCTGTCAAGAAGGAAGTTTCAACTCCCGTCTATGGCAAGCGTGTGGAGCACTTGAAATCGGT
TATCAAATCGTGTGGGATGAGTGTTCCTCCATCGATTTATAAGAAAGTCAAGCAGGCACCTGAAAGCAAACGTGAATCACAACTTATAAAGGAGTTGGAGGGGATACTAT
CCAGAGAGGGATTGTCTGCTAATCCCACTGAAAAAGAAATTAAGGAAGTCAAAAAGAAGAAGGAAAGGGCCAAAGAACTTGAAGGCATCGACTTAAGTAATATTGTCTCA
AGTTCACGTAGAAGATCCACGACCAGTTATGTAGCACCACCTCCAAAACCGAAAATACCAGTTAAAACTGATGGTGATGATGTTGATGATACTGAAGAGGAGGAGGAGAA
CGATGAAGAAGAAGAAGACGACGATGAAGAAGAAGAGGAGGATGGCGAGGAAGAGGATAATGGTGATGTTGATGAAAGCCAAGGTGAAGAATTCAATGAGGGTAACAAAC
CATCAACTTTTTAA
Protein sequenceShow/hide protein sequence
MAEELQDNDASNEEAMDVAVDIETKIQNAMRSRVSHFKEQADPNPVVPDNPLSSADDRHNNPPLDWDERFSLSVEGVLEDKIFFGRERNKERRQCLEGAEEENASKDSEE
TGRKSVSKEEAADSLEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGRKTKNVESDGIKGIKEKDDKDIPSESTIMKAIRKRISYLKANSEKVTMAGVRRLLEDDLKLTKN
ALDSCKKFISQQIEEILTSCEAAGQISNEKKGSRLKTPKKVSKESSHSTEGSSSEEENDEVKPGKKNATKGRIPNSNETKKRKRSTKETVSAKKQSKHVQQTSEEDSDEG
GENGSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEKEIKEVKKKKERAKELEGIDLSNIVS
SSRRRSTTSYVAPPPKPKIPVKTDGDDVDDTEEEEENDEEEEDDDEEEEEDGEEEDNGDVDESQGEEFNEGNKPSTF