; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0029243 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0029243
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionglutamic acid-rich protein-like isoform X2
Genome locationchr8:36845736..36853130
RNA-Seq ExpressionLag0029243
SyntenyLag0029243
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0016874 - ligase activity (molecular function)
InterPro domainsIPR019098 - Histone chaperone domain CHZ
IPR037647 - HIRA-interacting protein 3


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016498.1 hypothetical protein SDJN02_21607 [Cucurbita argyrosperma subsp. argyrosperma]3.3e-21286.36Show/hide
Query:  MAEELQDNDAPNEEAMDVAGDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDNASKDSEETGGKGVT
        MAEELQDNDAPNEEAMDV   IETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLC+ETYALDVHKRYIKQCLVKCLEGVEEDNASK SEETGGK V+
Subjt:  MAEELQDNDAPNEEAMDVAGDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDNASKDSEETGGKGVT

Query:  REQAAESLEGHRSKKGVKEPSLEDEVKMEDSPVMGLLTGRKATNVESEGIKGIKDKDDKDIPSEITIKKAIRKRIPYLKANAEKVTMAGVRRLLEDDLKL
        R +AAESLEGH+SKKG KEP LEDE KMEDSPVMGLL G K  NVES+ IKGIKDKDDKDIP+E TIKKAIRKR PYLKAN+EKVTMAGVRRLLEDDLKL
Subjt:  REQAAESLEGHRSKKGVKEPSLEDEVKMEDSPVMGLLTGRKATNVESEGIKGIKDKDDKDIPSEITIKKAIRKRIPYLKANAEKVTMAGVRRLLEDDLKL

Query:  TKFALDSFKKSISQQVEDILNSCEAAEQVSNEKKSSRLKTPKKVMNEGSHSTE-GSSSEEENDEVKPGKKTATKGRIANSNEMKKRKRSTKETVSAKKQS
        TK+ALD  KK ISQQVE+ILNSCEAAEQVSNEKK SRLKTPKKV  E SHSTE GSSSEEE+DEVKP KK  TKGRI+NSNE KKRKRSTKE VSAKKQ 
Subjt:  TKFALDSFKKSISQQVEDILNSCEAAEQVSNEKKSSRLKTPKKVMNEGSHSTE-GSSSEEENDEVKPGKKTATKGRIANSNEMKKRKRSTKETVSAKKQS

Query:  KHAQHSSEDDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
        KH QH+SE+D DEEGGENVSEDG SESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
Subjt:  KHAQHSSEDDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK

Query:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYLPPPPPKPKIPVETDGDDGDDTDEEEEEDDDDEEEDEEED----DDEEDNGDVDESPG-EEFNEE
        EIK+VKKKKERAKELEGIDLSNIVSSSRRRSTSSY+  PPPKPKIPV+T+GDD DDTD+E+++DDDD+++D++ED    DDEEDNGDVDES G EEFNE+
Subjt:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYLPPPPPKPKIPVETDGDDGDDTDEEEEEDDDDEEEDEEED----DDEEDNGDVDESPG-EEFNEE

Query:  DNEDSD
        DNEDSD
Subjt:  DNEDSD

XP_022939456.1 DNA ligase 1-like isoform X1 [Cucurbita moschata]5.0e-21387.48Show/hide
Query:  MAEELQDNDAPNEEAMDVAGDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDNASKDSEETGGKGVT
        MAEELQDNDAPNEEAMDV   IETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLC+ETYALDVHKRYIKQCLVKCLEGVEEDNASK SEETGGK V+
Subjt:  MAEELQDNDAPNEEAMDVAGDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDNASKDSEETGGKGVT

Query:  REQAAESLEGHRSKKGVKEPSLEDEVKMEDSPVMGLLTGRKATNVESEGIKGIKDKDDKDIPSEITIKKAIRKRIPYLKANAEKVTMAGVRRLLEDDLKL
        R +AAESLEGH+SKKG KEP LEDE KMEDSPVMGLL G K  NVES+ IKGIKDKDDKDIP+E TIKKAIRKR PYLKAN+EKVTMAGVRRLLEDDLKL
Subjt:  REQAAESLEGHRSKKGVKEPSLEDEVKMEDSPVMGLLTGRKATNVESEGIKGIKDKDDKDIPSEITIKKAIRKRIPYLKANAEKVTMAGVRRLLEDDLKL

Query:  TKFALDSFKKSISQQVEDILNSCEAAEQVSNEKKSSRLKTPKKVMNEGSHSTE-GSSSEEENDEVKPGKKTATKGRIANSNEMKKRKRSTKETVSAKKQS
        TK+ALD  KK ISQQVE+ILNSCEAAE+VSNEKK SRLKTPKKV  E SHSTE GSSSEEE+DEVKP KK  TKGRI+NSNE KKRKRSTKE VSAKKQ 
Subjt:  TKFALDSFKKSISQQVEDILNSCEAAEQVSNEKKSSRLKTPKKVMNEGSHSTE-GSSSEEENDEVKPGKKTATKGRIANSNEMKKRKRSTKETVSAKKQS

Query:  KHAQHSSEDDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
        KH QH+SE+D DEEGGENVSEDG SESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
Subjt:  KHAQHSSEDDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK

Query:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYLPPPPPKPKIPVETDGDDGDDT-DEEEEEDDDDEEEDEEEDDDEEDNGDVDESPG-EEFNEEDNE
        EIK+VKKKKERAKELEGIDLSNIVSSSRRRSTSSY+  PPPKPKIPV+T+GDD DDT DEEEE+DDDD++ED EE+DDEEDNGDVDES G EEFNE+DNE
Subjt:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYLPPPPPKPKIPVETDGDDGDDT-DEEEEEDDDDEEEDEEEDDDEEDNGDVDESPG-EEFNEEDNE

Query:  DSD
        DSD
Subjt:  DSD

XP_022939457.1 glutamic acid-rich protein-like isoform X2 [Cucurbita moschata]1.2e-20986.45Show/hide
Query:  MAEELQDNDAPNEEAMDVAGDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDNASKDSEETGGKGVT
        MAEELQDNDAPNEEAMDV   IETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLC+ETYALDVHKRYIKQCLVKCLEGVEEDNASK SEETGGK V+
Subjt:  MAEELQDNDAPNEEAMDVAGDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDNASKDSEETGGKGVT

Query:  REQAAESLEGHRSKKGVKEPSLEDEVKMEDSPVMGLLTGRKATNVESEGIKGIKDKDDKDIPSEITIKKAIRKRIPYLKANAEKVTMAGVRRLLEDDLKL
        R +AAESLEGH+SKKG KEP LEDE KMEDSPVMGLL G K  NVES+ IKGIKDKDDKDIP+E TIKKAIRKR PYLKAN+EKVTMAGVRRLLEDDLKL
Subjt:  REQAAESLEGHRSKKGVKEPSLEDEVKMEDSPVMGLLTGRKATNVESEGIKGIKDKDDKDIPSEITIKKAIRKRIPYLKANAEKVTMAGVRRLLEDDLKL

Query:  TKFALDSFKKSISQQVEDILNSCEAAEQVSNEKKSSRLKTPKKVMNEGSHSTE-GSSSEEENDEVKPGKKTATKGRIANSNEMKKRKRSTKETVSAKKQS
        TK+ALD  KK ISQQVE+ILNSCEAAE+VSNEKK SRLKTPKKV  E SHSTE GSSSEEE+DEVKP KK  TKGRI+NSNE KKRKRSTKE VSAKKQ 
Subjt:  TKFALDSFKKSISQQVEDILNSCEAAEQVSNEKKSSRLKTPKKVMNEGSHSTE-GSSSEEENDEVKPGKKTATKGRIANSNEMKKRKRSTKETVSAKKQS

Query:  KHAQHSSEDDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
        KH QH+SE+D DEEGGENVSEDG SESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
Subjt:  KHAQHSSEDDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK

Query:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYLPPPPPKPKIPVETDGDDGDDT-DEEEEEDDDDEEEDEEEDDDEEDNGDVDESPGEEFNEEDNED
        EIK+VKKKKERAKELEGIDLSNIVSSSRRRSTSSY+  PPPKPKIPV+T+GDD DDT DEEEE+DDDD++ED EE+DDEEDNGDVDES      ++DNED
Subjt:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYLPPPPPKPKIPVETDGDDGDDT-DEEEEEDDDDEEEDEEEDDDEEDNGDVDESPGEEFNEEDNED

Query:  SD
        SD
Subjt:  SD

XP_023551365.1 glutamic acid-rich protein-like isoform X1 [Cucurbita pepo subsp. pepo]1.0e-21387.25Show/hide
Query:  MAEELQDNDAPNEEAMDVAGDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDNASKDSEETGGKGVT
        MAEELQDNDAPNEEAMDV   IETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLC+ETYALDVHKRYIKQCLVKCLEGVEEDN SK SEETGGK V+
Subjt:  MAEELQDNDAPNEEAMDVAGDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDNASKDSEETGGKGVT

Query:  REQAAESLEGHRSKKGVKEPSLEDEVKMEDSPVMGLLTGRKATNVESEGIKGIKDKDDKDIPSEITIKKAIRKRIPYLKANAEKVTMAGVRRLLEDDLKL
        R +AAESLEGH+SKKG KEP LEDE KMEDSPVMGLL G K  NVES+ +KGIKDKDDKDIP+E TIKKAIRKR PYLKAN+EKVTMAGVRRLLEDDLKL
Subjt:  REQAAESLEGHRSKKGVKEPSLEDEVKMEDSPVMGLLTGRKATNVESEGIKGIKDKDDKDIPSEITIKKAIRKRIPYLKANAEKVTMAGVRRLLEDDLKL

Query:  TKFALDSFKKSISQQVEDILNSCEAAEQVSNEKKSSRLKTPKKVMNEGSHSTE-GSSSEEENDEVKPGKKTATKGRIANSNEMKKRKRSTKETVSAKKQS
        TK+ALD  KK ISQQVE+ILNSCEAAEQVSNEKK SRLKTPKKV  E SHSTE GSSSEEE+DEVKP KK  TKGRI+NSNE KKRKRSTKE VSAKKQ 
Subjt:  TKFALDSFKKSISQQVEDILNSCEAAEQVSNEKKSSRLKTPKKVMNEGSHSTE-GSSSEEENDEVKPGKKTATKGRIANSNEMKKRKRSTKETVSAKKQS

Query:  KHAQHSSEDDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
        KH QH+SE+D DEEGGENVSEDG SESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
Subjt:  KHAQHSSEDDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK

Query:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYLPPPPPKPKIPVETDGDDGDDTDEEEEEDDDDEEEDEEEDDDEEDNGDVDESPG-EEFNEEDNED
        EIK+VKKKKERAKELEGIDLSNIVSSSRRRSTSSY+  PPPKPKIPV+T+GDD DDTDEEEEE++DD++ED EE+DDEEDNGDVDES G EEFNE+DNED
Subjt:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYLPPPPPKPKIPVETDGDDGDDTDEEEEEDDDDEEEDEEEDDDEEDNGDVDESPG-EEFNEEDNED

Query:  SD
        SD
Subjt:  SD

XP_023551366.1 glutamic acid-rich protein-like isoform X2 [Cucurbita pepo subsp. pepo]2.3e-21086.23Show/hide
Query:  MAEELQDNDAPNEEAMDVAGDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDNASKDSEETGGKGVT
        MAEELQDNDAPNEEAMDV   IETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLC+ETYALDVHKRYIKQCLVKCLEGVEEDN SK SEETGGK V+
Subjt:  MAEELQDNDAPNEEAMDVAGDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDNASKDSEETGGKGVT

Query:  REQAAESLEGHRSKKGVKEPSLEDEVKMEDSPVMGLLTGRKATNVESEGIKGIKDKDDKDIPSEITIKKAIRKRIPYLKANAEKVTMAGVRRLLEDDLKL
        R +AAESLEGH+SKKG KEP LEDE KMEDSPVMGLL G K  NVES+ +KGIKDKDDKDIP+E TIKKAIRKR PYLKAN+EKVTMAGVRRLLEDDLKL
Subjt:  REQAAESLEGHRSKKGVKEPSLEDEVKMEDSPVMGLLTGRKATNVESEGIKGIKDKDDKDIPSEITIKKAIRKRIPYLKANAEKVTMAGVRRLLEDDLKL

Query:  TKFALDSFKKSISQQVEDILNSCEAAEQVSNEKKSSRLKTPKKVMNEGSHSTE-GSSSEEENDEVKPGKKTATKGRIANSNEMKKRKRSTKETVSAKKQS
        TK+ALD  KK ISQQVE+ILNSCEAAEQVSNEKK SRLKTPKKV  E SHSTE GSSSEEE+DEVKP KK  TKGRI+NSNE KKRKRSTKE VSAKKQ 
Subjt:  TKFALDSFKKSISQQVEDILNSCEAAEQVSNEKKSSRLKTPKKVMNEGSHSTE-GSSSEEENDEVKPGKKTATKGRIANSNEMKKRKRSTKETVSAKKQS

Query:  KHAQHSSEDDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
        KH QH+SE+D DEEGGENVSEDG SESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
Subjt:  KHAQHSSEDDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK

Query:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYLPPPPPKPKIPVETDGDDGDDTDEEEEEDDDDEEEDEEEDDDEEDNGDVDESPGEEFNEEDNEDS
        EIK+VKKKKERAKELEGIDLSNIVSSSRRRSTSSY+  PPPKPKIPV+T+GDD DDTDEEEEE++DD++ED EE+DDEEDNGDVDES      ++DNEDS
Subjt:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYLPPPPPKPKIPVETDGDDGDDTDEEEEEDDDDEEEDEEEDDDEEDNGDVDESPGEEFNEEDNEDS

Query:  D
        D
Subjt:  D

TrEMBL top hitse value%identityAlignment
A0A6J1CJ49 DNA ligase 16.0e-20484.98Show/hide
Query:  MAEELQDND-APNEEAMDVAGDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDNASKDSEETGGKGV
        MAEELQDND APNE+AMD   DIE KIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDNASKDSEETGGK V
Subjt:  MAEELQDND-APNEEAMDVAGDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDNASKDSEETGGKGV

Query:  TREQAAESLEGHRSKKGVKEPSLEDEVKMEDSPVMGLLTGRKATNVESEGIKGIKDKDDKDIPSEITIKKAIRKRIPYLKANAEKVTMAGVRRLLEDDLK
        +RE+AA+SLEG +SKKGVKEP LEDE KMEDSPVMGLLTG+K TNV+ E   GIKD+DDKDIPSE  IKKAIRKR  YLKAN+EKVTMAGVRRLLE+DLK
Subjt:  TREQAAESLEGHRSKKGVKEPSLEDEVKMEDSPVMGLLTGRKATNVESEGIKGIKDKDDKDIPSEITIKKAIRKRIPYLKANAEKVTMAGVRRLLEDDLK

Query:  LTKFALDSFKKSISQQVEDILNSCEAAEQVSNEKKSSRLKTPKKVMNEGSHSTEGSSSEEENDEVKPGKKTATKGRIANSNEMKKRKRSTKETVSAKKQS
        LTK+ALD  KK ISQQVE+ILNSCEAAEQV NEKKSS+LKTPKKV  E SHSTEGSSSEEENDEVKP KK ATKGRI NSNE KKRKRS KETVSAKKQS
Subjt:  LTKFALDSFKKSISQQVEDILNSCEAAEQVSNEKKSSRLKTPKKVMNEGSHSTEGSSSEEENDEVKPGKKTATKGRIANSNEMKKRKRSTKETVSAKKQS

Query:  KHAQHSSEDDGDEEGGENVSEDGQSESSNEKPVK---KEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANP
        K+ Q +SE+D DEEGGENVSEDG+SESS+EKPVK   KEVST VYGKRVEHLKSVIKSCGMSVPP+IYKKVKQAPESKRESQLIKELEGILSREGLSANP
Subjt:  KHAQHSSEDDGDEEGGENVSEDGQSESSNEKPVK---KEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANP

Query:  TEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYL--PPPPPKPKIPVETDGDDGDDTDEEEEEDDDDEEEDEEEDDDEEDNGDVDESPGEEFNEE
        TEKEIK+V+KKKERAKELEGIDLSNIVSSSRRRST+SY   PPPPPKPKIPVETDG D DDTDEEE++++DDEEED   D DEEDNG  DES GEEFNE+
Subjt:  TEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYL--PPPPPKPKIPVETDGDDGDDTDEEEEEDDDDEEEDEEEDDDEEDNGDVDESPGEEFNEE

Query:  DNEDSD
        DNEDSD
Subjt:  DNEDSD

A0A6J1FFY5 DNA ligase 1-like isoform X12.4e-21387.48Show/hide
Query:  MAEELQDNDAPNEEAMDVAGDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDNASKDSEETGGKGVT
        MAEELQDNDAPNEEAMDV   IETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLC+ETYALDVHKRYIKQCLVKCLEGVEEDNASK SEETGGK V+
Subjt:  MAEELQDNDAPNEEAMDVAGDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDNASKDSEETGGKGVT

Query:  REQAAESLEGHRSKKGVKEPSLEDEVKMEDSPVMGLLTGRKATNVESEGIKGIKDKDDKDIPSEITIKKAIRKRIPYLKANAEKVTMAGVRRLLEDDLKL
        R +AAESLEGH+SKKG KEP LEDE KMEDSPVMGLL G K  NVES+ IKGIKDKDDKDIP+E TIKKAIRKR PYLKAN+EKVTMAGVRRLLEDDLKL
Subjt:  REQAAESLEGHRSKKGVKEPSLEDEVKMEDSPVMGLLTGRKATNVESEGIKGIKDKDDKDIPSEITIKKAIRKRIPYLKANAEKVTMAGVRRLLEDDLKL

Query:  TKFALDSFKKSISQQVEDILNSCEAAEQVSNEKKSSRLKTPKKVMNEGSHSTE-GSSSEEENDEVKPGKKTATKGRIANSNEMKKRKRSTKETVSAKKQS
        TK+ALD  KK ISQQVE+ILNSCEAAE+VSNEKK SRLKTPKKV  E SHSTE GSSSEEE+DEVKP KK  TKGRI+NSNE KKRKRSTKE VSAKKQ 
Subjt:  TKFALDSFKKSISQQVEDILNSCEAAEQVSNEKKSSRLKTPKKVMNEGSHSTE-GSSSEEENDEVKPGKKTATKGRIANSNEMKKRKRSTKETVSAKKQS

Query:  KHAQHSSEDDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
        KH QH+SE+D DEEGGENVSEDG SESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
Subjt:  KHAQHSSEDDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK

Query:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYLPPPPPKPKIPVETDGDDGDDT-DEEEEEDDDDEEEDEEEDDDEEDNGDVDESPG-EEFNEEDNE
        EIK+VKKKKERAKELEGIDLSNIVSSSRRRSTSSY+  PPPKPKIPV+T+GDD DDT DEEEE+DDDD++ED EE+DDEEDNGDVDES G EEFNE+DNE
Subjt:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYLPPPPPKPKIPVETDGDDGDDT-DEEEEEDDDDEEEDEEEDDDEEDNGDVDESPG-EEFNEEDNE

Query:  DSD
        DSD
Subjt:  DSD

A0A6J1FGV2 glutamic acid-rich protein-like isoform X25.6e-21086.45Show/hide
Query:  MAEELQDNDAPNEEAMDVAGDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDNASKDSEETGGKGVT
        MAEELQDNDAPNEEAMDV   IETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLC+ETYALDVHKRYIKQCLVKCLEGVEEDNASK SEETGGK V+
Subjt:  MAEELQDNDAPNEEAMDVAGDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDNASKDSEETGGKGVT

Query:  REQAAESLEGHRSKKGVKEPSLEDEVKMEDSPVMGLLTGRKATNVESEGIKGIKDKDDKDIPSEITIKKAIRKRIPYLKANAEKVTMAGVRRLLEDDLKL
        R +AAESLEGH+SKKG KEP LEDE KMEDSPVMGLL G K  NVES+ IKGIKDKDDKDIP+E TIKKAIRKR PYLKAN+EKVTMAGVRRLLEDDLKL
Subjt:  REQAAESLEGHRSKKGVKEPSLEDEVKMEDSPVMGLLTGRKATNVESEGIKGIKDKDDKDIPSEITIKKAIRKRIPYLKANAEKVTMAGVRRLLEDDLKL

Query:  TKFALDSFKKSISQQVEDILNSCEAAEQVSNEKKSSRLKTPKKVMNEGSHSTE-GSSSEEENDEVKPGKKTATKGRIANSNEMKKRKRSTKETVSAKKQS
        TK+ALD  KK ISQQVE+ILNSCEAAE+VSNEKK SRLKTPKKV  E SHSTE GSSSEEE+DEVKP KK  TKGRI+NSNE KKRKRSTKE VSAKKQ 
Subjt:  TKFALDSFKKSISQQVEDILNSCEAAEQVSNEKKSSRLKTPKKVMNEGSHSTE-GSSSEEENDEVKPGKKTATKGRIANSNEMKKRKRSTKETVSAKKQS

Query:  KHAQHSSEDDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
        KH QH+SE+D DEEGGENVSEDG SESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
Subjt:  KHAQHSSEDDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK

Query:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYLPPPPPKPKIPVETDGDDGDDT-DEEEEEDDDDEEEDEEEDDDEEDNGDVDESPGEEFNEEDNED
        EIK+VKKKKERAKELEGIDLSNIVSSSRRRSTSSY+  PPPKPKIPV+T+GDD DDT DEEEE+DDDD++ED EE+DDEEDNGDVDES      ++DNED
Subjt:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYLPPPPPKPKIPVETDGDDGDDT-DEEEEEDDDDEEEDEEEDDDEEDNGDVDESPGEEFNEEDNED

Query:  SD
        SD
Subjt:  SD

A0A6J1JTY1 glutamic acid-rich protein-like isoform X12.1e-20985.86Show/hide
Query:  MAEELQDNDAPNEEAMDVAGDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDNASKDSEETGGKGVT
        MAEELQD DAPNEEAMDV   IETKIQNAMLSRVSHFKEQADSLTFEGVRRLLE DLC+ETYALDVHKRYIKQCLVKCLEGVEEDNASK SEETGGK V+
Subjt:  MAEELQDNDAPNEEAMDVAGDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDNASKDSEETGGKGVT

Query:  REQAAESLEGHRSKKGVKEPSLEDEVKMEDSPVMGLLTGRKATNVESEGIKGIKDKDDKDIPSEITIKKAIRKRIPYLKANAEKVTMAGVRRLLEDDLKL
        R +AAESLEGH+SKKG KEP LEDE KMEDSPVMGLL G K  N ES+ +KGIKDKDDKDIP+E TIKKAIRKR PYLKAN+EKVTMAGVRRLLEDDLKL
Subjt:  REQAAESLEGHRSKKGVKEPSLEDEVKMEDSPVMGLLTGRKATNVESEGIKGIKDKDDKDIPSEITIKKAIRKRIPYLKANAEKVTMAGVRRLLEDDLKL

Query:  TKFALDSFKKSISQQVEDILNSCEAAEQVSNEKKSSRLKTPKKVMNEGSHSTE-GSSSEEENDEVKPGKKTATKGRIANSNEMKKRKRSTKETVSAKKQS
        TK+ALD  KK ISQQVE+ILNSCEAAEQVSNEKK SRLKTPKKV  E SHSTE GSSSEEE+DEVKP KK  TKG I+NSNEMKKRKRSTKE VSAKKQ 
Subjt:  TKFALDSFKKSISQQVEDILNSCEAAEQVSNEKKSSRLKTPKKVMNEGSHSTE-GSSSEEENDEVKPGKKTATKGRIANSNEMKKRKRSTKETVSAKKQS

Query:  KHAQHSSEDDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
        KH  H+ E+D DE+GGENVSEDG SESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLS NPTEK
Subjt:  KHAQHSSEDDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK

Query:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYLPPPPPKPKIPVETDGDDGDDTDEEEEEDDDDEEEDEEEDDDEEDNGDVDESPG-EEFNEEDNED
        EIK+VKKKKERAKELEGIDLSNIVSSSRRRSTSSY   PPPKPKIPV+T+GDD DDTD+EEEE+DDD+++D EE+DDEEDNGDVDES G EEFNE+DNED
Subjt:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYLPPPPPKPKIPVETDGDDGDDTDEEEEEDDDDEEEDEEEDDDEEDNGDVDESPG-EEFNEEDNED

Query:  SD
        SD
Subjt:  SD

A0A6J1K3E3 glutamic acid-rich protein-like isoform X24.9e-20684.83Show/hide
Query:  MAEELQDNDAPNEEAMDVAGDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDNASKDSEETGGKGVT
        MAEELQD DAPNEEAMDV   IETKIQNAMLSRVSHFKEQADSLTFEGVRRLLE DLC+ETYALDVHKRYIKQCLVKCLEGVEEDNASK SEETGGK V+
Subjt:  MAEELQDNDAPNEEAMDVAGDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDNASKDSEETGGKGVT

Query:  REQAAESLEGHRSKKGVKEPSLEDEVKMEDSPVMGLLTGRKATNVESEGIKGIKDKDDKDIPSEITIKKAIRKRIPYLKANAEKVTMAGVRRLLEDDLKL
        R +AAESLEGH+SKKG KEP LEDE KMEDSPVMGLL G K  N ES+ +KGIKDKDDKDIP+E TIKKAIRKR PYLKAN+EKVTMAGVRRLLEDDLKL
Subjt:  REQAAESLEGHRSKKGVKEPSLEDEVKMEDSPVMGLLTGRKATNVESEGIKGIKDKDDKDIPSEITIKKAIRKRIPYLKANAEKVTMAGVRRLLEDDLKL

Query:  TKFALDSFKKSISQQVEDILNSCEAAEQVSNEKKSSRLKTPKKVMNEGSHSTE-GSSSEEENDEVKPGKKTATKGRIANSNEMKKRKRSTKETVSAKKQS
        TK+ALD  KK ISQQVE+ILNSCEAAEQVSNEKK SRLKTPKKV  E SHSTE GSSSEEE+DEVKP KK  TKG I+NSNEMKKRKRSTKE VSAKKQ 
Subjt:  TKFALDSFKKSISQQVEDILNSCEAAEQVSNEKKSSRLKTPKKVMNEGSHSTE-GSSSEEENDEVKPGKKTATKGRIANSNEMKKRKRSTKETVSAKKQS

Query:  KHAQHSSEDDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
        KH  H+ E+D DE+GGENVSEDG SESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLS NPTEK
Subjt:  KHAQHSSEDDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK

Query:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYLPPPPPKPKIPVETDGDDGDDTDEEEEEDDDDEEEDEEEDDDEEDNGDVDESPGEEFNEEDNEDS
        EIK+VKKKKERAKELEGIDLSNIVSSSRRRSTSSY   PPPKPKIPV+T+GDD DDTD+EEEE+DDD+++D EE+DDEEDNGDVDES      ++DNEDS
Subjt:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYLPPPPPKPKIPVETDGDDGDDTDEEEEEDDDDEEEDEEEDDDEEDNGDVDESPGEEFNEEDNEDS

Query:  D
        D
Subjt:  D

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G44780.1 CONTAINS InterPro DOMAIN/s: Histone chaperone domain CHZ (InterPro:IPR019098)1.1e-6742.38Show/hide
Query:  AGDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDNASKDSEETGGKG--VTREQAAESLEGHRSKKG
        A +IE KI  A+ SRV++ + +AD  T   VRR+LE+D+ LE   LDV+K ++K+ LVKCLE    ++ S++S+ET  +   +  ++ AE  E H     
Subjt:  AGDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDNASKDSEETGGKG--VTREQAAESLEGHRSKKG

Query:  VKEPSLEDEVKMEDSPVMGLLTGRKATNVESEGIKGIKDKDDKDIPSEITIKKAIRKRIPYLKANAEKVTMAGVRRLLEDDLKLTKFALDSFKKSISQQV
          E   E+  K E   V G               KG K+   +D      IK+A+RKR  Y+KAN+E +TMA +RRLLE+DLKL K +LD FKK I++++
Subjt:  VKEPSLEDEVKMEDSPVMGLLTGRKATNVESEGIKGIKDKDDKDIPSEITIKKAIRKRIPYLKANAEKVTMAGVRRLLEDDLKLTKFALDSFKKSISQQV

Query:  EDILN-----SCEAAEQVSNEKKSSRLKTPKKVMNEGSHSTEGSSSEEENDEVKPGKKTATKGRIANSNEMKKRKRSTKETVSAKKQSKHAQHSSEDDGD
        +++L       C     V N KK  +  TP K+++   +S   +    +N+EV   K  A K +++    M KRK    + VS +K++KH +  SE+D  
Subjt:  EDILN-----SCEAAEQVSNEKKSSRLKTPKKVMNEGSHSTEGSSSEEENDEVKPGKKTATKGRIANSNEMKKRKRSTKETVSAKKQSKHAQHSSEDDGD

Query:  EEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEKEIKEVKKKKERA
               S+ G SE S ++   KE +T VYGKRVEHLKSVIKSCGMSVPP+IYKK KQAP+ KRE+ LI+ELE IL++EGLS++P+  EIKEVKK+K  +
Subjt:  EEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEKEIKEVKKKKERA

Query:  KELEGIDLSNIVSSSRRRSTSSYLPPPPPKPKIPVETDGDDGDDTDEEEEEDDDDEEEDEEEDDDEEDNGDVDESPGEE
        +ELEGID +NIV +SRRRS++S+ PPP PK     E++ D+ +D++ EEE ++  E   + E+ +EE N + D+  GEE
Subjt:  KELEGIDLSNIVSSSRRRSTSSYLPPPPPKPKIPVETDGDDGDDTDEEEEEDDDDEEEDEEEDDDEEDNGDVDESPGEE

AT1G44780.2 INVOLVED IN: biological_process unknown1.4e-6742.38Show/hide
Query:  AGDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDNASKDSEETGGKG--VTREQAAESLEGHRSKKG
        A +IE KI  A+ SRV++ + +AD  T   VRR+LE+D+ LE   LDV+K ++K+ LVKCLE    ++ S++S+ET  +   +  ++ AE  E H     
Subjt:  AGDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDNASKDSEETGGKG--VTREQAAESLEGHRSKKG

Query:  VKEPSLEDEVKMEDSPVMGLLTGRKATNVESEGIKGIKDKDDKDIPSEITIKKAIRKRIPYLKANAEKVTMAGVRRLLEDDLKLTKFALDSFKKSISQQV
          E   E+  K E   V G               KG K+   +D      IK+A+RKR  Y+KAN+E +TMA +RRLLE+DLKL K +LD FKK I++++
Subjt:  VKEPSLEDEVKMEDSPVMGLLTGRKATNVESEGIKGIKDKDDKDIPSEITIKKAIRKRIPYLKANAEKVTMAGVRRLLEDDLKLTKFALDSFKKSISQQV

Query:  EDILN-----SCEAAEQVSNEKKSSRLKTPKKVMNEGSHSTEGSSSEEENDEVKPGKKTATKGRIANSNEMKKRKRSTKETVSAKKQSKHAQHSSEDDGD
        +++L       C     V N KK  +  TP K+++   +S   +    +N+EV   K  A K +++    M KRK    + VS +K++KH +  SE+D  
Subjt:  EDILN-----SCEAAEQVSNEKKSSRLKTPKKVMNEGSHSTEGSSSEEENDEVKPGKKTATKGRIANSNEMKKRKRSTKETVSAKKQSKHAQHSSEDDGD

Query:  EEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEKEIKEVKKKKERA
               S+ G SE S +    KE +T VYGKRVEHLKSVIKSCGMSVPP+IYKK KQAP+ KRE+ LI+ELE IL++EGLS++P+  EIKEVKK+K  +
Subjt:  EEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEKEIKEVKKKKERA

Query:  KELEGIDLSNIVSSSRRRSTSSYLPPPPPKPKIPVETDGDDGDDTDEEEEEDDDDEEEDEEEDDDEEDNGDVDESPGEE
        +ELEGID +NIV +SRRRS++S+ PPP PK     E++ D+ +D++ EEE ++  E   + E+ +EE N + D+  GEE
Subjt:  KELEGIDLSNIVSSSRRRSTSSYLPPPPPKPKIPVETDGDDGDDTDEEEEEDDDDEEEDEEEDDDEEDNGDVDESPGEE

AT4G08310.1 FUNCTIONS IN: molecular_function unknown1.0e-8646.54Show/hide
Query:  LQDNDAPNEEAMDVAG----------------DIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDNAS
        + D D+    AM+++G                DIE++I  AM SRV++ +++AD+ TFEGVRRLLE+DL LE +ALDVHK ++KQ LV+CL G E D  S
Subjt:  LQDNDAPNEEAMDVAG----------------DIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDNAS

Query:  KDSEETGGKG--VTREQAAESLEGHRSKKGVKEPSLEDEVKMEDSPVMGLLTGRKATNVESEGIKGIKDKDDKDIPSEITIKKAIRKRIPYLKANAEKVT
        ++S ET  K      ++AAE  + H +KK  KE    D+ K +DSPVMGLLT     N      +  KD+D + + S+  IKKA+RKR  Y+KAN+EK+T
Subjt:  KDSEETGGKG--VTREQAAESLEGHRSKKGVKEPSLEDEVKMEDSPVMGLLTGRKATNVESEGIKGIKDKDDKDIPSEITIKKAIRKRIPYLKANAEKVT

Query:  MAGVRRLLEDDLKLTKFALDSFKKSISQQVEDILNSCEAAEQVSNEKKSSRLKTPKKVMNEGSHSTE----GSSSEEENDEVKPGKKTATKGRIANSNEM
        M  +RRLLE DLKL K++LD +KK I+ ++++IL + EA +  +  ++    K  K    + S S E        EEE+ EV   KK A K +++ S   
Subjt:  MAGVRRLLEDDLKLTKFALDSFKKSISQQVEDILNSCEAAEQVSNEKKSSRLKTPKKVMNEGSHSTE----GSSSEEENDEVKPGKKTATKGRIANSNEM

Query:  KKRKRSTKETVSAKKQSKHAQHSSEDDGDEEGGENVSEDGQSESSNEKPVKK-EVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIK
         KRKR  ++  SAKK     Q  S+ D D          G+   S+EK VKK E  T  YGKRVEHLKS+IKSCGMS+ PS+Y+K KQAPE KRE  LIK
Subjt:  KKRKRSTKETVSAKKQSKHAQHSSEDDGDEEGGENVSEDGQSESSNEKPVKK-EVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIK

Query:  ELEGILSREGLSANPTEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYLPPPPPKPKIPVETDGDDGDDTDEEEEEDDD---DEEEDEEEDDDEE
        EL+ +L++EGLSANP+EKEIKEVKK+KER KELEGID SNIVSSSRRRS++S++  PPPKP    E++ DD +D++ EE+ED++   +EEE+EE++   E
Subjt:  ELEGILSREGLSANPTEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYLPPPPPKPKIPVETDGDDGDDTDEEEEEDDD---DEEEDEEEDDDEE

Query:  DNGDVDESPGEEFNEEDNED
        D G+  ++ GE   E+  E+
Subjt:  DNGDVDESPGEEFNEEDNED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGTGGCTTCAAAGAGTCAACGGTTCACACAGGCTTCTAATTTCAAACAAACGGACGGCGGAGATCGAGATCAAGATTTCAGCCCATAATTTCTTCCTCGGCACAGA
GAATCGGTGCGAGTCGCAGAAGAGCAAAATGGCGGAGGAATTACAGGACAACGATGCTCCTAACGAAGAGGCCATGGATGTCGCCGGTGATATAGAGACGAAGATTCAAA
ACGCTATGCTCTCTCGCGTCTCTCACTTCAAGGAACAAGCCGACTCTTTAACTTTTGAGGGGGTTAGAAGATTGTTAGAAAAGGACTTGTGTTTGGAGACGTATGCTTTA
GATGTGCATAAAAGATATATCAAGCAGTGTTTGGTGAAGTGCTTAGAAGGCGTTGAGGAAGACAATGCTTCAAAAGATTCTGAGGAGACTGGGGGAAAAGGTGTAACTAG
AGAACAAGCGGCTGAGTCTCTTGAAGGGCATCGGTCCAAAAAAGGTGTAAAAGAACCTTCCTTGGAAGATGAGGTTAAAATGGAAGACTCTCCAGTCATGGGCCTTCTCA
CCGGACGTAAAGCAACAAATGTTGAATCTGAGGGAATCAAAGGAATCAAAGACAAAGATGACAAAGATATTCCTAGTGAGATTACAATTAAGAAAGCTATTAGAAAAAGA
ATTCCTTATCTTAAAGCTAATGCAGAGAAAGTTACTATGGCTGGAGTTCGTCGCCTTCTGGAGGATGACCTTAAACTTACTAAATTTGCTCTCGATAGTTTCAAGAAGTC
TATAAGCCAGCAAGTGGAGGATATATTAAATTCTTGTGAAGCTGCTGAACAAGTTTCTAATGAAAAGAAAAGTTCTCGTTTGAAAACTCCGAAAAAGGTAATGAATGAAG
GCTCTCATTCTACTGAAGGGAGCAGTAGTGAGGAGGAAAACGATGAAGTAAAACCTGGAAAGAAAACTGCAACTAAAGGAAGAATAGCGAACTCTAACGAAATGAAAAAG
CGAAAAAGATCTACAAAGGAGACTGTCTCTGCCAAGAAGCAGAGCAAGCATGCCCAGCATTCATCAGAGGACGATGGTGATGAAGAAGGTGGTGAAAATGTCTCTGAAGA
TGGCCAGTCTGAATCATCCAATGAAAAACCTGTTAAGAAGGAAGTTTCAACTCCTGTCTATGGCAAGCGTGTGGAGCACTTGAAATCGGTTATCAAATCATGTGGGATGA
GTGTTCCTCCATCAATTTATAAGAAAGTTAAGCAGGCACCTGAAAGCAAACGTGAATCACAACTTATAAAGGAGTTGGAGGGAATACTATCCAGAGAAGGATTGTCTGCT
AATCCCACTGAAAAAGAAATTAAAGAAGTCAAAAAGAAGAAGGAAAGGGCCAAAGAACTTGAAGGCATTGACTTGAGTAATATTGTCTCAAGTTCACGTAGGAGATCCAC
AAGCAGTTATCTACCACCACCACCTCCAAAACCGAAAATACCAGTTGAAACTGATGGTGATGATGGTGATGATACTGATGAGGAGGAGGAGGAGGACGACGACGACGAAG
AAGAGGATGAAGAGGAGGATGACGATGAAGAGGATAATGGTGATGTTGATGAAAGCCCGGGTGAAGAATTCAATGAAGAGGACAATGAAGACAGTGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGACGTGGCTTCAAAGAGTCAACGGTTCACACAGGCTTCTAATTTCAAACAAACGGACGGCGGAGATCGAGATCAAGATTTCAGCCCATAATTTCTTCCTCGGCACAGA
GAATCGGTGCGAGTCGCAGAAGAGCAAAATGGCGGAGGAATTACAGGACAACGATGCTCCTAACGAAGAGGCCATGGATGTCGCCGGTGATATAGAGACGAAGATTCAAA
ACGCTATGCTCTCTCGCGTCTCTCACTTCAAGGAACAAGCCGACTCTTTAACTTTTGAGGGGGTTAGAAGATTGTTAGAAAAGGACTTGTGTTTGGAGACGTATGCTTTA
GATGTGCATAAAAGATATATCAAGCAGTGTTTGGTGAAGTGCTTAGAAGGCGTTGAGGAAGACAATGCTTCAAAAGATTCTGAGGAGACTGGGGGAAAAGGTGTAACTAG
AGAACAAGCGGCTGAGTCTCTTGAAGGGCATCGGTCCAAAAAAGGTGTAAAAGAACCTTCCTTGGAAGATGAGGTTAAAATGGAAGACTCTCCAGTCATGGGCCTTCTCA
CCGGACGTAAAGCAACAAATGTTGAATCTGAGGGAATCAAAGGAATCAAAGACAAAGATGACAAAGATATTCCTAGTGAGATTACAATTAAGAAAGCTATTAGAAAAAGA
ATTCCTTATCTTAAAGCTAATGCAGAGAAAGTTACTATGGCTGGAGTTCGTCGCCTTCTGGAGGATGACCTTAAACTTACTAAATTTGCTCTCGATAGTTTCAAGAAGTC
TATAAGCCAGCAAGTGGAGGATATATTAAATTCTTGTGAAGCTGCTGAACAAGTTTCTAATGAAAAGAAAAGTTCTCGTTTGAAAACTCCGAAAAAGGTAATGAATGAAG
GCTCTCATTCTACTGAAGGGAGCAGTAGTGAGGAGGAAAACGATGAAGTAAAACCTGGAAAGAAAACTGCAACTAAAGGAAGAATAGCGAACTCTAACGAAATGAAAAAG
CGAAAAAGATCTACAAAGGAGACTGTCTCTGCCAAGAAGCAGAGCAAGCATGCCCAGCATTCATCAGAGGACGATGGTGATGAAGAAGGTGGTGAAAATGTCTCTGAAGA
TGGCCAGTCTGAATCATCCAATGAAAAACCTGTTAAGAAGGAAGTTTCAACTCCTGTCTATGGCAAGCGTGTGGAGCACTTGAAATCGGTTATCAAATCATGTGGGATGA
GTGTTCCTCCATCAATTTATAAGAAAGTTAAGCAGGCACCTGAAAGCAAACGTGAATCACAACTTATAAAGGAGTTGGAGGGAATACTATCCAGAGAAGGATTGTCTGCT
AATCCCACTGAAAAAGAAATTAAAGAAGTCAAAAAGAAGAAGGAAAGGGCCAAAGAACTTGAAGGCATTGACTTGAGTAATATTGTCTCAAGTTCACGTAGGAGATCCAC
AAGCAGTTATCTACCACCACCACCTCCAAAACCGAAAATACCAGTTGAAACTGATGGTGATGATGGTGATGATACTGATGAGGAGGAGGAGGAGGACGACGACGACGAAG
AAGAGGATGAAGAGGAGGATGACGATGAAGAGGATAATGGTGATGTTGATGAAAGCCCGGGTGAAGAATTCAATGAAGAGGACAATGAAGACAGTGATTGA
Protein sequenceShow/hide protein sequence
MTWLQRVNGSHRLLISNKRTAEIEIKISAHNFFLGTENRCESQKSKMAEELQDNDAPNEEAMDVAGDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYAL
DVHKRYIKQCLVKCLEGVEEDNASKDSEETGGKGVTREQAAESLEGHRSKKGVKEPSLEDEVKMEDSPVMGLLTGRKATNVESEGIKGIKDKDDKDIPSEITIKKAIRKR
IPYLKANAEKVTMAGVRRLLEDDLKLTKFALDSFKKSISQQVEDILNSCEAAEQVSNEKKSSRLKTPKKVMNEGSHSTEGSSSEEENDEVKPGKKTATKGRIANSNEMKK
RKRSTKETVSAKKQSKHAQHSSEDDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSA
NPTEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYLPPPPPKPKIPVETDGDDGDDTDEEEEEDDDDEEEDEEEDDDEEDNGDVDESPGEEFNEEDNEDSD