; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017322 (gene) of Snake gourd v1 genome

Gene IDTan0017322
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionglutamic acid-rich protein-like isoform X2
Genome locationLG11:10994486..11000415
RNA-Seq ExpressionTan0017322
SyntenyTan0017322
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0016874 - ligase activity (molecular function)
InterPro domainsIPR019098 - Histone chaperone domain CHZ
IPR037647 - HIRA-interacting protein 3


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016498.1 hypothetical protein SDJN02_21607 [Cucurbita argyrosperma subsp. argyrosperma]1.1e-21187.13Show/hide
Query:  MAEELQDNDAPNKDAMDVAGDIEAKIENAMRSRVSHFKEQADSLTFEGVRRLLEKDLCLKTYALDVHKRYVKECLVKCLEAVEEDNTSKDSEETGGKSVS
        MAEELQDNDAPN++AMDV   IE KI+NAM SRVSHFKEQADSLTFEGVRRLLEKDLC++TYALDVHKRY+K+CLVKCLE VEEDN SK SEETGGKSVS
Subjt:  MAEELQDNDAPNKDAMDVAGDIEAKIENAMRSRVSHFKEQADSLTFEGVRRLLEKDLCLKTYALDVHKRYVKECLVKCLEAVEEDNTSKDSEETGGKSVS

Query:  REEAAESLEGHQSKKGVKEPSLEDEEKMEDSPVMGLLTGRKTTNVESDGIKGNKGKDDKNILSESTIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLKL
        R EAAESLEGHQSKKG KEP LEDEEKMEDSPVMGLL G KT NVESD IKG K KDDK+I +ESTIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLKL
Subjt:  REEAAESLEGHQSKKGVKEPSLEDEEKMEDSPVMGLLTGRKTTNVESDGIKGNKGKDDKNILSESTIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLKL

Query:  TKHALDSWKKFISQQVEEILDSCEAAQQVSNEKKGSRLKIPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSKEMKKRKRSTKETVSAKKQS
        TK+ALD  KKFISQQVEEIL+SCEAA+QVSNEKKGSRLK PKKVSKESSHSTE GSSSEEE+DEVKP KKN TKGRI NS E KKRKRSTKE VSAKKQ 
Subjt:  TKHALDSWKKFISQQVEEILDSCEAAQQVSNEKKGSRLKIPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSKEMKKRKRSTKETVSAKKQS

Query:  KHVQHTSEDDSDEEGGGNVSEDAQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQGPESKRESQLIKELEGILSREGLSANPSEK
        KHVQHTSE+DSDEEGG NVSED  SESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQ PESKRESQLIKELEGILSREGLSANP+EK
Subjt:  KHVQHTSEDDSDEEGGGNVSEDAQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQGPESKRESQLIKELEGILSREGLSANPSEK

Query:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYAPPPPKPKIPVKTDGDDGDDTDEEEEEDDDDDEDDEE---EEDDDEEDNGDVDESQG-EEFNEDD
        EIK+VKKKKERAKELEGIDLSNIVSSSRRRSTSSY  PPPKPKIPVKT+GDD DDTD+E+++DDDDD+DD++   EE+DDEEDNGDVDESQG EEFNEDD
Subjt:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYAPPPPKPKIPVKTDGDDGDDTDEEEEEDDDDDEDDEE---EEDDDEEDNGDVDESQG-EEFNEDD

Query:  NEDSD
        NEDSD
Subjt:  NEDSD

XP_022939456.1 DNA ligase 1-like isoform X1 [Cucurbita moschata]4.6e-21387.85Show/hide
Query:  MAEELQDNDAPNKDAMDVAGDIEAKIENAMRSRVSHFKEQADSLTFEGVRRLLEKDLCLKTYALDVHKRYVKECLVKCLEAVEEDNTSKDSEETGGKSVS
        MAEELQDNDAPN++AMDV   IE KI+NAM SRVSHFKEQADSLTFEGVRRLLEKDLC++TYALDVHKRY+K+CLVKCLE VEEDN SK SEETGGKSVS
Subjt:  MAEELQDNDAPNKDAMDVAGDIEAKIENAMRSRVSHFKEQADSLTFEGVRRLLEKDLCLKTYALDVHKRYVKECLVKCLEAVEEDNTSKDSEETGGKSVS

Query:  REEAAESLEGHQSKKGVKEPSLEDEEKMEDSPVMGLLTGRKTTNVESDGIKGNKGKDDKNILSESTIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLKL
        R EAAESLEGHQSKKG KEP LEDEEKMEDSPVMGLL G KT NVESD IKG K KDDK+I +ESTIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLKL
Subjt:  REEAAESLEGHQSKKGVKEPSLEDEEKMEDSPVMGLLTGRKTTNVESDGIKGNKGKDDKNILSESTIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLKL

Query:  TKHALDSWKKFISQQVEEILDSCEAAQQVSNEKKGSRLKIPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSKEMKKRKRSTKETVSAKKQS
        TK+ALD  KKFISQQVEEIL+SCEAA++VSNEKKGSRLK PKKVSKESSHSTE GSSSEEE+DEVKP KKN TKGRI NS E KKRKRSTKE VSAKKQ 
Subjt:  TKHALDSWKKFISQQVEEILDSCEAAQQVSNEKKGSRLKIPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSKEMKKRKRSTKETVSAKKQS

Query:  KHVQHTSEDDSDEEGGGNVSEDAQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQGPESKRESQLIKELEGILSREGLSANPSEK
        KHVQHTSE+DSDEEGG NVSED  SESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQ PESKRESQLIKELEGILSREGLSANP+EK
Subjt:  KHVQHTSEDDSDEEGGGNVSEDAQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQGPESKRESQLIKELEGILSREGLSANPSEK

Query:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYAPPPPKPKIPVKTDGDDGDDTDEEEEEDDDDDEDDEEEEDDDEEDNGDVDESQG-EEFNEDDNED
        EIK+VKKKKERAKELEGIDLSNIVSSSRRRSTSSY  PPPKPKIPVKT+GDD DDTD+EEEEDDDDD+D++ EE+DDEEDNGDVDESQG EEFNEDDNED
Subjt:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYAPPPPKPKIPVKTDGDDGDDTDEEEEEDDDDDEDDEEEEDDDEEDNGDVDESQG-EEFNEDDNED

Query:  SD
        SD
Subjt:  SD

XP_022939457.1 glutamic acid-rich protein-like isoform X2 [Cucurbita moschata]1.4e-20986.83Show/hide
Query:  MAEELQDNDAPNKDAMDVAGDIEAKIENAMRSRVSHFKEQADSLTFEGVRRLLEKDLCLKTYALDVHKRYVKECLVKCLEAVEEDNTSKDSEETGGKSVS
        MAEELQDNDAPN++AMDV   IE KI+NAM SRVSHFKEQADSLTFEGVRRLLEKDLC++TYALDVHKRY+K+CLVKCLE VEEDN SK SEETGGKSVS
Subjt:  MAEELQDNDAPNKDAMDVAGDIEAKIENAMRSRVSHFKEQADSLTFEGVRRLLEKDLCLKTYALDVHKRYVKECLVKCLEAVEEDNTSKDSEETGGKSVS

Query:  REEAAESLEGHQSKKGVKEPSLEDEEKMEDSPVMGLLTGRKTTNVESDGIKGNKGKDDKNILSESTIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLKL
        R EAAESLEGHQSKKG KEP LEDEEKMEDSPVMGLL G KT NVESD IKG K KDDK+I +ESTIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLKL
Subjt:  REEAAESLEGHQSKKGVKEPSLEDEEKMEDSPVMGLLTGRKTTNVESDGIKGNKGKDDKNILSESTIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLKL

Query:  TKHALDSWKKFISQQVEEILDSCEAAQQVSNEKKGSRLKIPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSKEMKKRKRSTKETVSAKKQS
        TK+ALD  KKFISQQVEEIL+SCEAA++VSNEKKGSRLK PKKVSKESSHSTE GSSSEEE+DEVKP KKN TKGRI NS E KKRKRSTKE VSAKKQ 
Subjt:  TKHALDSWKKFISQQVEEILDSCEAAQQVSNEKKGSRLKIPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSKEMKKRKRSTKETVSAKKQS

Query:  KHVQHTSEDDSDEEGGGNVSEDAQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQGPESKRESQLIKELEGILSREGLSANPSEK
        KHVQHTSE+DSDEEGG NVSED  SESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQ PESKRESQLIKELEGILSREGLSANP+EK
Subjt:  KHVQHTSEDDSDEEGGGNVSEDAQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQGPESKRESQLIKELEGILSREGLSANPSEK

Query:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYAPPPPKPKIPVKTDGDDGDDTDEEEEEDDDDDEDDEEEEDDDEEDNGDVDESQGEEFNEDDNEDS
        EIK+VKKKKERAKELEGIDLSNIVSSSRRRSTSSY  PPPKPKIPVKT+GDD DDTD+EEEEDDDDD+D++ EE+DDEEDNGDVDESQ      DDNEDS
Subjt:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYAPPPPKPKIPVKTDGDDGDDTDEEEEEDDDDDEDDEEEEDDDEEDNGDVDESQGEEFNEDDNEDS

Query:  D
        D
Subjt:  D

XP_022993822.1 glutamic acid-rich protein-like isoform X1 [Cucurbita maxima]1.2e-20886.85Show/hide
Query:  MAEELQDNDAPNKDAMDVAGDIEAKIENAMRSRVSHFKEQADSLTFEGVRRLLEKDLCLKTYALDVHKRYVKECLVKCLEAVEEDNTSKDSEETGGKSVS
        MAEELQD DAPN++AMDV   IE KI+NAM SRVSHFKEQADSLTFEGVRRLLE DLC++TYALDVHKRY+K+CLVKCLE VEEDN SK SEETGGKSVS
Subjt:  MAEELQDNDAPNKDAMDVAGDIEAKIENAMRSRVSHFKEQADSLTFEGVRRLLEKDLCLKTYALDVHKRYVKECLVKCLEAVEEDNTSKDSEETGGKSVS

Query:  REEAAESLEGHQSKKGVKEPSLEDEEKMEDSPVMGLLTGRKTTNVESDGIKGNKGKDDKNILSESTIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLKL
        R EAAESLEGHQSKKG KEP LEDEEKMEDSPVMGLL G KT N ESD +KG K KDDK+I +ESTIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLKL
Subjt:  REEAAESLEGHQSKKGVKEPSLEDEEKMEDSPVMGLLTGRKTTNVESDGIKGNKGKDDKNILSESTIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLKL

Query:  TKHALDSWKKFISQQVEEILDSCEAAQQVSNEKKGSRLKIPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSKEMKKRKRSTKETVSAKKQS
        TK+ALD  KKFISQQVEEIL+SCEAA+QVSNEKKGSRLK PKKVSKESSHSTE GSSSEEE+DEVKP KKN TKG I NS EMKKRKRSTKE VSAKKQ 
Subjt:  TKHALDSWKKFISQQVEEILDSCEAAQQVSNEKKGSRLKIPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSKEMKKRKRSTKETVSAKKQS

Query:  KHVQHTSEDDSDEEGGGNVSEDAQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQGPESKRESQLIKELEGILSREGLSANPSEK
        KHV HT E+DSDE+GG NVSED  SESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQ PESKRESQLIKELEGILSREGLS NP+EK
Subjt:  KHVQHTSEDDSDEEGGGNVSEDAQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQGPESKRESQLIKELEGILSREGLSANPSEK

Query:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYAPPPPKPKIPVKTDGDDGDDTDEEEEEDDDDDEDDEEEEDDDEEDNGDVDESQG-EEFNEDDNED
        EIK+VKKKKERAKELEGIDLSNIVSSSRRRSTSSYA PPPKPKIPVKT+GDD DDTD+EEEE+DDDD+DD EEE DDEEDNGDVDESQG EEFNEDDNED
Subjt:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYAPPPPKPKIPVKTDGDDGDDTDEEEEEDDDDDEDDEEEEDDDEEDNGDVDESQG-EEFNEDDNED

Query:  SD
        SD
Subjt:  SD

XP_023551365.1 glutamic acid-rich protein-like isoform X1 [Cucurbita pepo subsp. pepo]1.5e-21187.65Show/hide
Query:  MAEELQDNDAPNKDAMDVAGDIEAKIENAMRSRVSHFKEQADSLTFEGVRRLLEKDLCLKTYALDVHKRYVKECLVKCLEAVEEDNTSKDSEETGGKSVS
        MAEELQDNDAPN++AMDV   IE KI+NAM SRVSHFKEQADSLTFEGVRRLLEKDLC++TYALDVHKRY+K+CLVKCLE VEEDN SK SEETGGKSVS
Subjt:  MAEELQDNDAPNKDAMDVAGDIEAKIENAMRSRVSHFKEQADSLTFEGVRRLLEKDLCLKTYALDVHKRYVKECLVKCLEAVEEDNTSKDSEETGGKSVS

Query:  REEAAESLEGHQSKKGVKEPSLEDEEKMEDSPVMGLLTGRKTTNVESDGIKGNKGKDDKNILSESTIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLKL
        R EAAESLEGHQSKKG KEP LEDEEKMEDSPVMGLL G KT NVESD +KG K KDDK+I +E+TIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLKL
Subjt:  REEAAESLEGHQSKKGVKEPSLEDEEKMEDSPVMGLLTGRKTTNVESDGIKGNKGKDDKNILSESTIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLKL

Query:  TKHALDSWKKFISQQVEEILDSCEAAQQVSNEKKGSRLKIPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSKEMKKRKRSTKETVSAKKQS
        TK+ALD  KKFISQQVEEIL+SCEAA+QVSNEKKGSRLK PKKVSKESSHSTE GSSSEEE+DEVKP KKN TKGRI NS E KKRKRSTKE VSAKKQ 
Subjt:  TKHALDSWKKFISQQVEEILDSCEAAQQVSNEKKGSRLKIPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSKEMKKRKRSTKETVSAKKQS

Query:  KHVQHTSEDDSDEEGGGNVSEDAQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQGPESKRESQLIKELEGILSREGLSANPSEK
        KHVQHTSE+DSDEEGG NVSED  SESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQ PESKRESQLIKELEGILSREGLSANP+EK
Subjt:  KHVQHTSEDDSDEEGGGNVSEDAQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQGPESKRESQLIKELEGILSREGLSANPSEK

Query:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYAPPPPKPKIPVKTDGDDGDDTDEEEEEDDDDDEDDEEEEDDDEEDNGDVDESQG-EEFNEDDNED
        EIK+VKKKKERAKELEGIDLSNIVSSSRRRSTSSY  PPPKPKIPVKT+GDD DDTDEEEEE++DDD++D EEE DDEEDNGDVDESQG EEFNEDDNED
Subjt:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYAPPPPKPKIPVKTDGDDGDDTDEEEEEDDDDDEDDEEEEDDDEEDNGDVDESQG-EEFNEDDNED

Query:  SD
        SD
Subjt:  SD

TrEMBL top hitse value%identityAlignment
A0A6J1CJ49 DNA ligase 13.0e-20285.21Show/hide
Query:  MAEELQDND-APNKDAMDVAGDIEAKIENAMRSRVSHFKEQADSLTFEGVRRLLEKDLCLKTYALDVHKRYVKECLVKCLEAVEEDNTSKDSEETGGKSV
        MAEELQDND APN+DAMD   DIEAKI+NAM SRVSHFKEQADSLTFEGVRRLLEKDLCL+TYALDVHKRY+K+CLVKCLE VEEDN SKDSEETGGKSV
Subjt:  MAEELQDND-APNKDAMDVAGDIEAKIENAMRSRVSHFKEQADSLTFEGVRRLLEKDLCLKTYALDVHKRYVKECLVKCLEAVEEDNTSKDSEETGGKSV

Query:  SREEAAESLEGHQSKKGVKEPSLEDEEKMEDSPVMGLLTGRKTTNVESDGIKGNKGKDDKNILSESTIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLK
        SREEAA+SLEG QSKKGVKEP LEDEEKMEDSPVMGLLTG+KTTNV+ +GI   K +DDK+I SES IKKAIRKRT YLKANSEKVTMAGVRRLLE+DLK
Subjt:  SREEAAESLEGHQSKKGVKEPSLEDEEKMEDSPVMGLLTGRKTTNVESDGIKGNKGKDDKNILSESTIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLK

Query:  LTKHALDSWKKFISQQVEEILDSCEAAQQVSNEKKGSRLKIPKKVSKESSHSTEGSSSEEENDEVKPGKKNATKGRIPNSKEMKKRKRSTKETVSAKKQS
        LTK+ALD  KK ISQQVEEIL+SCEAA+QV NEKK S+LK PKKVSKESSHSTEGSSSEEENDEVKP KKNATKGRIPNS E KKRKRS KETVSAKKQS
Subjt:  LTKHALDSWKKFISQQVEEILDSCEAAQQVSNEKKGSRLKIPKKVSKESSHSTEGSSSEEENDEVKPGKKNATKGRIPNSKEMKKRKRSTKETVSAKKQS

Query:  KHVQHTSEDDSDEEGGGNVSEDAQSESSNEKPVK---KEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQGPESKRESQLIKELEGILSREGLSANP
        K+VQ TSE+DSDEEGG NVSED +SESS+EKPVK   KEVST VYGKRVEHLKSVIKSCGMSVPP+IYKKVKQ PESKRESQLIKELEGILSREGLSANP
Subjt:  KHVQHTSEDDSDEEGGGNVSEDAQSESSNEKPVK---KEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQGPESKRESQLIKELEGILSREGLSANP

Query:  SEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYA---PPPPKPKIPVKTDGDDGDDTDEEEEEDDDDDEDDEEEEDDDEEDNGDVDESQGEEFNE
        +EKEIK+V+KKKERAKELEGIDLSNIVSSSRRRST+SYA   PPPPKPKIPV+TDG D DDTDEEE    DD+EDDEEE+D DEEDNG  DESQGEEFNE
Subjt:  SEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYA---PPPPKPKIPVKTDGDDGDDTDEEEEEDDDDDEDDEEEEDDDEEDNGDVDESQGEEFNE

Query:  DDNEDSD
        DDNEDSD
Subjt:  DDNEDSD

A0A6J1FFY5 DNA ligase 1-like isoform X12.2e-21387.85Show/hide
Query:  MAEELQDNDAPNKDAMDVAGDIEAKIENAMRSRVSHFKEQADSLTFEGVRRLLEKDLCLKTYALDVHKRYVKECLVKCLEAVEEDNTSKDSEETGGKSVS
        MAEELQDNDAPN++AMDV   IE KI+NAM SRVSHFKEQADSLTFEGVRRLLEKDLC++TYALDVHKRY+K+CLVKCLE VEEDN SK SEETGGKSVS
Subjt:  MAEELQDNDAPNKDAMDVAGDIEAKIENAMRSRVSHFKEQADSLTFEGVRRLLEKDLCLKTYALDVHKRYVKECLVKCLEAVEEDNTSKDSEETGGKSVS

Query:  REEAAESLEGHQSKKGVKEPSLEDEEKMEDSPVMGLLTGRKTTNVESDGIKGNKGKDDKNILSESTIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLKL
        R EAAESLEGHQSKKG KEP LEDEEKMEDSPVMGLL G KT NVESD IKG K KDDK+I +ESTIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLKL
Subjt:  REEAAESLEGHQSKKGVKEPSLEDEEKMEDSPVMGLLTGRKTTNVESDGIKGNKGKDDKNILSESTIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLKL

Query:  TKHALDSWKKFISQQVEEILDSCEAAQQVSNEKKGSRLKIPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSKEMKKRKRSTKETVSAKKQS
        TK+ALD  KKFISQQVEEIL+SCEAA++VSNEKKGSRLK PKKVSKESSHSTE GSSSEEE+DEVKP KKN TKGRI NS E KKRKRSTKE VSAKKQ 
Subjt:  TKHALDSWKKFISQQVEEILDSCEAAQQVSNEKKGSRLKIPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSKEMKKRKRSTKETVSAKKQS

Query:  KHVQHTSEDDSDEEGGGNVSEDAQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQGPESKRESQLIKELEGILSREGLSANPSEK
        KHVQHTSE+DSDEEGG NVSED  SESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQ PESKRESQLIKELEGILSREGLSANP+EK
Subjt:  KHVQHTSEDDSDEEGGGNVSEDAQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQGPESKRESQLIKELEGILSREGLSANPSEK

Query:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYAPPPPKPKIPVKTDGDDGDDTDEEEEEDDDDDEDDEEEEDDDEEDNGDVDESQG-EEFNEDDNED
        EIK+VKKKKERAKELEGIDLSNIVSSSRRRSTSSY  PPPKPKIPVKT+GDD DDTD+EEEEDDDDD+D++ EE+DDEEDNGDVDESQG EEFNEDDNED
Subjt:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYAPPPPKPKIPVKTDGDDGDDTDEEEEEDDDDDEDDEEEEDDDEEDNGDVDESQG-EEFNEDDNED

Query:  SD
        SD
Subjt:  SD

A0A6J1FGV2 glutamic acid-rich protein-like isoform X26.7e-21086.83Show/hide
Query:  MAEELQDNDAPNKDAMDVAGDIEAKIENAMRSRVSHFKEQADSLTFEGVRRLLEKDLCLKTYALDVHKRYVKECLVKCLEAVEEDNTSKDSEETGGKSVS
        MAEELQDNDAPN++AMDV   IE KI+NAM SRVSHFKEQADSLTFEGVRRLLEKDLC++TYALDVHKRY+K+CLVKCLE VEEDN SK SEETGGKSVS
Subjt:  MAEELQDNDAPNKDAMDVAGDIEAKIENAMRSRVSHFKEQADSLTFEGVRRLLEKDLCLKTYALDVHKRYVKECLVKCLEAVEEDNTSKDSEETGGKSVS

Query:  REEAAESLEGHQSKKGVKEPSLEDEEKMEDSPVMGLLTGRKTTNVESDGIKGNKGKDDKNILSESTIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLKL
        R EAAESLEGHQSKKG KEP LEDEEKMEDSPVMGLL G KT NVESD IKG K KDDK+I +ESTIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLKL
Subjt:  REEAAESLEGHQSKKGVKEPSLEDEEKMEDSPVMGLLTGRKTTNVESDGIKGNKGKDDKNILSESTIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLKL

Query:  TKHALDSWKKFISQQVEEILDSCEAAQQVSNEKKGSRLKIPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSKEMKKRKRSTKETVSAKKQS
        TK+ALD  KKFISQQVEEIL+SCEAA++VSNEKKGSRLK PKKVSKESSHSTE GSSSEEE+DEVKP KKN TKGRI NS E KKRKRSTKE VSAKKQ 
Subjt:  TKHALDSWKKFISQQVEEILDSCEAAQQVSNEKKGSRLKIPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSKEMKKRKRSTKETVSAKKQS

Query:  KHVQHTSEDDSDEEGGGNVSEDAQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQGPESKRESQLIKELEGILSREGLSANPSEK
        KHVQHTSE+DSDEEGG NVSED  SESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQ PESKRESQLIKELEGILSREGLSANP+EK
Subjt:  KHVQHTSEDDSDEEGGGNVSEDAQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQGPESKRESQLIKELEGILSREGLSANPSEK

Query:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYAPPPPKPKIPVKTDGDDGDDTDEEEEEDDDDDEDDEEEEDDDEEDNGDVDESQGEEFNEDDNEDS
        EIK+VKKKKERAKELEGIDLSNIVSSSRRRSTSSY  PPPKPKIPVKT+GDD DDTD+EEEEDDDDD+D++ EE+DDEEDNGDVDESQ      DDNEDS
Subjt:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYAPPPPKPKIPVKTDGDDGDDTDEEEEEDDDDDEDDEEEEDDDEEDNGDVDESQGEEFNEDDNEDS

Query:  D
        D
Subjt:  D

A0A6J1JTY1 glutamic acid-rich protein-like isoform X15.7e-20986.85Show/hide
Query:  MAEELQDNDAPNKDAMDVAGDIEAKIENAMRSRVSHFKEQADSLTFEGVRRLLEKDLCLKTYALDVHKRYVKECLVKCLEAVEEDNTSKDSEETGGKSVS
        MAEELQD DAPN++AMDV   IE KI+NAM SRVSHFKEQADSLTFEGVRRLLE DLC++TYALDVHKRY+K+CLVKCLE VEEDN SK SEETGGKSVS
Subjt:  MAEELQDNDAPNKDAMDVAGDIEAKIENAMRSRVSHFKEQADSLTFEGVRRLLEKDLCLKTYALDVHKRYVKECLVKCLEAVEEDNTSKDSEETGGKSVS

Query:  REEAAESLEGHQSKKGVKEPSLEDEEKMEDSPVMGLLTGRKTTNVESDGIKGNKGKDDKNILSESTIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLKL
        R EAAESLEGHQSKKG KEP LEDEEKMEDSPVMGLL G KT N ESD +KG K KDDK+I +ESTIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLKL
Subjt:  REEAAESLEGHQSKKGVKEPSLEDEEKMEDSPVMGLLTGRKTTNVESDGIKGNKGKDDKNILSESTIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLKL

Query:  TKHALDSWKKFISQQVEEILDSCEAAQQVSNEKKGSRLKIPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSKEMKKRKRSTKETVSAKKQS
        TK+ALD  KKFISQQVEEIL+SCEAA+QVSNEKKGSRLK PKKVSKESSHSTE GSSSEEE+DEVKP KKN TKG I NS EMKKRKRSTKE VSAKKQ 
Subjt:  TKHALDSWKKFISQQVEEILDSCEAAQQVSNEKKGSRLKIPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSKEMKKRKRSTKETVSAKKQS

Query:  KHVQHTSEDDSDEEGGGNVSEDAQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQGPESKRESQLIKELEGILSREGLSANPSEK
        KHV HT E+DSDE+GG NVSED  SESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQ PESKRESQLIKELEGILSREGLS NP+EK
Subjt:  KHVQHTSEDDSDEEGGGNVSEDAQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQGPESKRESQLIKELEGILSREGLSANPSEK

Query:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYAPPPPKPKIPVKTDGDDGDDTDEEEEEDDDDDEDDEEEEDDDEEDNGDVDESQG-EEFNEDDNED
        EIK+VKKKKERAKELEGIDLSNIVSSSRRRSTSSYA PPPKPKIPVKT+GDD DDTD+EEEE+DDDD+DD EEE DDEEDNGDVDESQG EEFNEDDNED
Subjt:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYAPPPPKPKIPVKTDGDDGDDTDEEEEEDDDDDEDDEEEEDDDEEDNGDVDESQG-EEFNEDDNED

Query:  SD
        SD
Subjt:  SD

A0A6J1K3E3 glutamic acid-rich protein-like isoform X21.7e-20585.83Show/hide
Query:  MAEELQDNDAPNKDAMDVAGDIEAKIENAMRSRVSHFKEQADSLTFEGVRRLLEKDLCLKTYALDVHKRYVKECLVKCLEAVEEDNTSKDSEETGGKSVS
        MAEELQD DAPN++AMDV   IE KI+NAM SRVSHFKEQADSLTFEGVRRLLE DLC++TYALDVHKRY+K+CLVKCLE VEEDN SK SEETGGKSVS
Subjt:  MAEELQDNDAPNKDAMDVAGDIEAKIENAMRSRVSHFKEQADSLTFEGVRRLLEKDLCLKTYALDVHKRYVKECLVKCLEAVEEDNTSKDSEETGGKSVS

Query:  REEAAESLEGHQSKKGVKEPSLEDEEKMEDSPVMGLLTGRKTTNVESDGIKGNKGKDDKNILSESTIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLKL
        R EAAESLEGHQSKKG KEP LEDEEKMEDSPVMGLL G KT N ESD +KG K KDDK+I +ESTIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLKL
Subjt:  REEAAESLEGHQSKKGVKEPSLEDEEKMEDSPVMGLLTGRKTTNVESDGIKGNKGKDDKNILSESTIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLKL

Query:  TKHALDSWKKFISQQVEEILDSCEAAQQVSNEKKGSRLKIPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSKEMKKRKRSTKETVSAKKQS
        TK+ALD  KKFISQQVEEIL+SCEAA+QVSNEKKGSRLK PKKVSKESSHSTE GSSSEEE+DEVKP KKN TKG I NS EMKKRKRSTKE VSAKKQ 
Subjt:  TKHALDSWKKFISQQVEEILDSCEAAQQVSNEKKGSRLKIPKKVSKESSHSTE-GSSSEEENDEVKPGKKNATKGRIPNSKEMKKRKRSTKETVSAKKQS

Query:  KHVQHTSEDDSDEEGGGNVSEDAQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQGPESKRESQLIKELEGILSREGLSANPSEK
        KHV HT E+DSDE+GG NVSED  SESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQ PESKRESQLIKELEGILSREGLS NP+EK
Subjt:  KHVQHTSEDDSDEEGGGNVSEDAQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQGPESKRESQLIKELEGILSREGLSANPSEK

Query:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYAPPPPKPKIPVKTDGDDGDDTDEEEEEDDDDDEDDEEEEDDDEEDNGDVDESQGEEFNEDDNEDS
        EIK+VKKKKERAKELEGIDLSNIVSSSRRRSTSSYA PPPKPKIPVKT+GDD DDTD+EEEE+DDDD+DD EEE DDEEDNGDVDESQ      DDNEDS
Subjt:  EIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYAPPPPKPKIPVKTDGDDGDDTDEEEEEDDDDDEDDEEEEDDDEEDNGDVDESQGEEFNEDDNEDS

Query:  D
        D
Subjt:  D

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G44780.1 CONTAINS InterPro DOMAIN/s: Histone chaperone domain CHZ (InterPro:IPR019098)5.7e-6843.81Show/hide
Query:  AGDIEAKIENAMRSRVSHFKEQADSLTFEGVRRLLEKDLCLKTYALDVHKRYVKECLVKCLEAVEEDNTSKDSEETGGK--SVSREEAAESLEGHQSKKG
        A +IE KI  A+RSRV++ + +AD  T   VRR+LE+D+ L+   LDV+K +VKE LVKCLE    ++TS++S+ET  +   +  +E AE  E H+    
Subjt:  AGDIEAKIENAMRSRVSHFKEQADSLTFEGVRRLLEKDLCLKTYALDVHKRYVKECLVKCLEAVEEDNTSKDSEETGGK--SVSREEAAESLEGHQSKKG

Query:  VKEPSLEDEEKMEDSPVMGLLTGRKTTNVESDGIKGNKGKDDKNILSESTIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLKLTKHALDSWKKFISQQV
          E   E+  K E   V                    KGK +K  L +  IK+A+RKR  Y+KANSE +TMA +RRLLE+DLKL K +LD +KKFI++++
Subjt:  VKEPSLEDEEKMEDSPVMGLLTGRKTTNVESDGIKGNKGKDDKNILSESTIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLKLTKHALDSWKKFISQQV

Query:  EEILD-----SCEAAQQVSNEKKGSRLKIPKKVSKE--SSHSTEGSSSEEENDEVKPGKKNATKGRIPNSKEMKKRKRSTKETVSAKKQSKHVQHTSEDD
        +E+L       C     V N KK  +    K VS E  S   TEG+    +N+EV   K  A K ++   + M KRK    + VS +K++KH +  SE+D
Subjt:  EEILD-----SCEAAQQVSNEKKGSRLKIPKKVSKE--SSHSTEGSSSEEENDEVKPGKKNATKGRIPNSKEMKKRKRSTKETVSAKKQSKHVQHTSEDD

Query:  SDEEGGGNVSEDAQSESSNEKPVK--KEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQGPESKRESQLIKELEGILSREGLSANPSEKEIKEVKKK
        SD                +EK +K  KE +T VYGKRVEHLKSVIKSCGMSVPP+IYKK KQ P+ KRE+ LI+ELE IL++EGLS++PS  EIKEVKK+
Subjt:  SDEEGGGNVSEDAQSESSNEKPVK--KEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQGPESKRESQLIKELEGILSREGLSANPSEKEIKEVKKK

Query:  KERAKELEGIDLSNIVSSSRRRSTSSYAPPPPKPKIPVKTDGDDGDDTDEEEEEDDDDDEDDEEEEDDDEEDNGDVDESQGEEFNEDDNEDSD
        K  ++ELEGID +NIV +SRRRS++S+A PPPKPK+           T E E E D+ ++ + EEE +++ + G    SQ EE  E+ N + D
Subjt:  KERAKELEGIDLSNIVSSSRRRSTSSYAPPPPKPKIPVKTDGDDGDDTDEEEEEDDDDDEDDEEEEDDDEEDNGDVDESQGEEFNEDDNEDSD

AT1G44780.2 INVOLVED IN: biological_process unknown4.3e-6843.9Show/hide
Query:  AGDIEAKIENAMRSRVSHFKEQADSLTFEGVRRLLEKDLCLKTYALDVHKRYVKECLVKCLEAVEEDNTSKDSEETGGK--SVSREEAAESLEGHQSKKG
        A +IE KI  A+RSRV++ + +AD  T   VRR+LE+D+ L+   LDV+K +VKE LVKCLE    ++TS++S+ET  +   +  +E AE  E H+    
Subjt:  AGDIEAKIENAMRSRVSHFKEQADSLTFEGVRRLLEKDLCLKTYALDVHKRYVKECLVKCLEAVEEDNTSKDSEETGGK--SVSREEAAESLEGHQSKKG

Query:  VKEPSLEDEEKMEDSPVMGLLTGRKTTNVESDGIKGNKGKDDKNILSESTIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLKLTKHALDSWKKFISQQV
          E   E+  K E   V                    KGK +K  L +  IK+A+RKR  Y+KANSE +TMA +RRLLE+DLKL K +LD +KKFI++++
Subjt:  VKEPSLEDEEKMEDSPVMGLLTGRKTTNVESDGIKGNKGKDDKNILSESTIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLKLTKHALDSWKKFISQQV

Query:  EEILD-----SCEAAQQVSNEKKGSRLKIPKKVSKE--SSHSTEGSSSEEENDEVKPGKKNATKGRIPNSKEMKKRKRSTKETVSAKKQSKHVQHTSEDD
        +E+L       C     V N KK  +    K VS E  S   TEG+    +N+EV   K  A K ++   + M KRK    + VS +K++KH +  SE+D
Subjt:  EEILD-----SCEAAQQVSNEKKGSRLKIPKKVSKE--SSHSTEGSSSEEENDEVKPGKKNATKGRIPNSKEMKKRKRSTKETVSAKKQSKHVQHTSEDD

Query:  SDEEGGGNVSEDAQSESSNEKPVK-KEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQGPESKRESQLIKELEGILSREGLSANPSEKEIKEVKKKK
        SD                +EK +K KE +T VYGKRVEHLKSVIKSCGMSVPP+IYKK KQ P+ KRE+ LI+ELE IL++EGLS++PS  EIKEVKK+K
Subjt:  SDEEGGGNVSEDAQSESSNEKPVK-KEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQGPESKRESQLIKELEGILSREGLSANPSEKEIKEVKKKK

Query:  ERAKELEGIDLSNIVSSSRRRSTSSYAPPPPKPKIPVKTDGDDGDDTDEEEEEDDDDDEDDEEEEDDDEEDNGDVDESQGEEFNEDDNEDSD
          ++ELEGID +NIV +SRRRS++S+A PPPKPK+           T E E E D+ ++ + EEE +++ + G    SQ EE  E+ N + D
Subjt:  ERAKELEGIDLSNIVSSSRRRSTSSYAPPPPKPKIPVKTDGDDGDDTDEEEEEDDDDDEDDEEEEDDDEEDNGDVDESQGEEFNEDDNEDSD

AT4G08310.1 FUNCTIONS IN: molecular_function unknown9.3e-8747.59Show/hide
Query:  LQDNDAPNKDAMDVAG----------------DIEAKIENAMRSRVSHFKEQADSLTFEGVRRLLEKDLCLKTYALDVHKRYVKECLVKCLEAVEEDNTS
        + D D+    AM+++G                DIE++I  AM+SRV++ +++AD+ TFEGVRRLLE+DL L+ +ALDVHK +VK+ LV+CL   E D TS
Subjt:  LQDNDAPNKDAMDVAG----------------DIEAKIENAMRSRVSHFKEQADSLTFEGVRRLLEKDLCLKTYALDVHKRYVKECLVKCLEAVEEDNTS

Query:  KDSEETGGKS--VSREEAAESLEGHQSKKGVKEPSLEDEEKMEDSPVMGLLTGRKTTNVESDGIKGNKGKDDKNILSESTIKKAIRKRTPYLKANSEKVT
        ++S ET  K      +EAAE  + H +KK  KE    D+EK +DSPVMGLLT   T+   ++  K     +DK +L +S IKKA+RKR+ Y+KANSEK+T
Subjt:  KDSEETGGKS--VSREEAAESLEGHQSKKGVKEPSLEDEEKMEDSPVMGLLTGRKTTNVESDGIKGNKGKDDKNILSESTIKKAIRKRTPYLKANSEKVT

Query:  MAGVRRLLEDDLKLTKHALDSWKKFISQQVEEILDSCEAAQQVSNEKKGSRLKIPKKVSKESSHSTE----GSSSEEENDEVKPGKKNATKGRIPNSKEM
        M  +RRLLE DLKL K++LD +KKFI+ +++EIL + EA Q  +  ++    K  K    ++S S E        EEE+ EV   KK A K ++  S+  
Subjt:  MAGVRRLLEDDLKLTKHALDSWKKFISQQVEEILDSCEAAQQVSNEKKGSRLKIPKKVSKESSHSTE----GSSSEEENDEVKPGKKNATKGRIPNSKEM

Query:  KKRKRSTKETVSAKKQSKHVQHTSEDDSDEEGGGNVSEDAQSESSNEKPVKK-EVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQGPESKRESQLIK
         KRKR  ++  SAKK       T + DS  +     S+  +   S+EK VKK E  T  YGKRVEHLKS+IKSCGMS+ PS+Y+K KQ PE KRE  LIK
Subjt:  KKRKRSTKETVSAKKQSKHVQHTSEDDSDEEGGGNVSEDAQSESSNEKPVKK-EVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQGPESKRESQLIK

Query:  ELEGILSREGLSANPSEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYAPPPPKPKIPVKTDGDDGDDTDEEEEEDDDDDEDDEEEEDDD--EED
        EL+ +L++EGLSANPSEKEIKEVKK+KER KELEGID SNIVSSSRRRS++S+  PPPKP    +++ DD +D++ EE+ED++   ++EEEE+D+   ED
Subjt:  ELEGILSREGLSANPSEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYAPPPPKPKIPVKTDGDDGDDTDEEEEEDDDDDEDDEEEEDDD--EED

Query:  NGDVDESQGEEFNEDDNED
         G+  +++GE   ED  E+
Subjt:  NGDVDESQGEEFNEDDNED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAGGAATTACAGGACAACGATGCTCCGAACAAGGACGCCATGGATGTAGCCGGTGATATAGAGGCCAAGATTGAGAACGCTATGCGCTCCCGCGTCTCTCACTT
CAAGGAACAAGCCGACTCTTTAACTTTTGAGGGGGTTAGAAGACTGTTAGAAAAGGACTTGTGTTTGAAGACATATGCTTTAGATGTGCATAAAAGATATGTCAAGGAGT
GTTTGGTGAAGTGCTTAGAAGCTGTTGAGGAAGACAATACCTCAAAGGATTCTGAGGAGACTGGGGGGAAAAGTGTAAGTAGAGAAGAAGCGGCTGAGTCACTTGAAGGG
CATCAGTCCAAGAAAGGTGTAAAGGAACCTAGCTTAGAAGATGAGGAGAAAATGGAAGACTCTCCAGTTATGGGCCTTCTCACAGGACGTAAAACAACAAATGTTGAATC
TGACGGAATCAAAGGAAACAAAGGCAAAGATGACAAAAATATTCTTAGTGAGAGTACAATTAAGAAAGCTATTAGAAAAAGAACTCCTTATCTTAAAGCTAATTCGGAGA
AAGTTACTATGGCTGGAGTTCGCCGCCTTCTGGAGGATGACCTTAAACTTACAAAACATGCTCTCGATAGTTGGAAGAAGTTTATAAGCCAGCAAGTAGAGGAGATATTG
GATTCTTGTGAAGCTGCTCAACAAGTTTCTAATGAAAAGAAAGGTTCTCGTTTGAAAATTCCAAAAAAGGTAAGCAAAGAAAGCTCTCATTCTACAGAAGGGAGCAGTAG
TGAGGAGGAAAACGATGAAGTTAAACCTGGAAAGAAAAATGCAACTAAAGGAAGAATACCGAACTCTAAAGAAATGAAAAAGCGGAAAAGATCTACAAAGGAGACTGTCT
CTGCCAAGAAGCAAAGCAAGCATGTCCAGCATACATCAGAGGACGATAGTGATGAAGAAGGTGGTGGAAATGTCTCTGAAGATGCCCAGTCTGAATCATCCAATGAAAAA
CCTGTTAAGAAGGAAGTTTCAACTCCTGTCTATGGAAAGCGTGTGGAGCACTTGAAATCGGTTATCAAATCGTGTGGGATGAGTGTTCCTCCATCGATTTATAAGAAAGT
CAAGCAGGGACCTGAAAGCAAACGTGAATCACAACTTATAAAGGAGTTGGAGGGGATACTATCCAGAGAAGGATTGTCTGCTAATCCCTCTGAAAAAGAAATTAAGGAAG
TCAAAAAGAAGAAGGAAAGGGCCAAAGAACTTGAAGGCATCGACTTGAGTAATATCGTCTCAAGTTCACGTAGAAGATCCACGTCCAGTTATGCACCACCACCTCCGAAA
CCGAAAATACCAGTTAAAACAGATGGAGATGATGGTGATGATACTGATGAGGAGGAGGAGGAGGACGACGATGATGACGAGGATGACGAAGAGGAGGAGGATGACGATGA
AGAGGATAACGGTGATGTTGATGAAAGCCAGGGTGAAGAATTCAATGAGGATGACAATGAAGACAGTGATTGA
mRNA sequenceShow/hide mRNA sequence
CTTGGAAAGTCCTAACACACTTTCAGAGCCCATAATTTCTTCCTTCGCACAGAGAATCGCTGCGAGTCGCAGAAGAGCAAAATGGCGGAGGAATTACAGGACAACGATGC
TCCGAACAAGGACGCCATGGATGTAGCCGGTGATATAGAGGCCAAGATTGAGAACGCTATGCGCTCCCGCGTCTCTCACTTCAAGGAACAAGCCGACTCTTTAACTTTTG
AGGGGGTTAGAAGACTGTTAGAAAAGGACTTGTGTTTGAAGACATATGCTTTAGATGTGCATAAAAGATATGTCAAGGAGTGTTTGGTGAAGTGCTTAGAAGCTGTTGAG
GAAGACAATACCTCAAAGGATTCTGAGGAGACTGGGGGGAAAAGTGTAAGTAGAGAAGAAGCGGCTGAGTCACTTGAAGGGCATCAGTCCAAGAAAGGTGTAAAGGAACC
TAGCTTAGAAGATGAGGAGAAAATGGAAGACTCTCCAGTTATGGGCCTTCTCACAGGACGTAAAACAACAAATGTTGAATCTGACGGAATCAAAGGAAACAAAGGCAAAG
ATGACAAAAATATTCTTAGTGAGAGTACAATTAAGAAAGCTATTAGAAAAAGAACTCCTTATCTTAAAGCTAATTCGGAGAAAGTTACTATGGCTGGAGTTCGCCGCCTT
CTGGAGGATGACCTTAAACTTACAAAACATGCTCTCGATAGTTGGAAGAAGTTTATAAGCCAGCAAGTAGAGGAGATATTGGATTCTTGTGAAGCTGCTCAACAAGTTTC
TAATGAAAAGAAAGGTTCTCGTTTGAAAATTCCAAAAAAGGTAAGCAAAGAAAGCTCTCATTCTACAGAAGGGAGCAGTAGTGAGGAGGAAAACGATGAAGTTAAACCTG
GAAAGAAAAATGCAACTAAAGGAAGAATACCGAACTCTAAAGAAATGAAAAAGCGGAAAAGATCTACAAAGGAGACTGTCTCTGCCAAGAAGCAAAGCAAGCATGTCCAG
CATACATCAGAGGACGATAGTGATGAAGAAGGTGGTGGAAATGTCTCTGAAGATGCCCAGTCTGAATCATCCAATGAAAAACCTGTTAAGAAGGAAGTTTCAACTCCTGT
CTATGGAAAGCGTGTGGAGCACTTGAAATCGGTTATCAAATCGTGTGGGATGAGTGTTCCTCCATCGATTTATAAGAAAGTCAAGCAGGGACCTGAAAGCAAACGTGAAT
CACAACTTATAAAGGAGTTGGAGGGGATACTATCCAGAGAAGGATTGTCTGCTAATCCCTCTGAAAAAGAAATTAAGGAAGTCAAAAAGAAGAAGGAAAGGGCCAAAGAA
CTTGAAGGCATCGACTTGAGTAATATCGTCTCAAGTTCACGTAGAAGATCCACGTCCAGTTATGCACCACCACCTCCGAAACCGAAAATACCAGTTAAAACAGATGGAGA
TGATGGTGATGATACTGATGAGGAGGAGGAGGAGGACGACGATGATGACGAGGATGACGAAGAGGAGGAGGATGACGATGAAGAGGATAACGGTGATGTTGATGAAAGCC
AGGGTGAAGAATTCAATGAGGATGACAATGAAGACAGTGATTGAAATCGGAGATAGCATTCAAGATTCTGGCGCCAAGTCGTCGATCAACTTAGAAACGAACCATAGTGT
AACCTATTATTTTGCTATGTAGTTTTATTGGAGAGTTTGTGGTCAGCGATTTTAGAGCTATATGCCTATATCTAGGAGATTGGACGTAATATTGTACAATATTTTTTACT
CTTCATGATATAGACGAAATAAAAGAAAACTTATATAATTTTGATT
Protein sequenceShow/hide protein sequence
MAEELQDNDAPNKDAMDVAGDIEAKIENAMRSRVSHFKEQADSLTFEGVRRLLEKDLCLKTYALDVHKRYVKECLVKCLEAVEEDNTSKDSEETGGKSVSREEAAESLEG
HQSKKGVKEPSLEDEEKMEDSPVMGLLTGRKTTNVESDGIKGNKGKDDKNILSESTIKKAIRKRTPYLKANSEKVTMAGVRRLLEDDLKLTKHALDSWKKFISQQVEEIL
DSCEAAQQVSNEKKGSRLKIPKKVSKESSHSTEGSSSEEENDEVKPGKKNATKGRIPNSKEMKKRKRSTKETVSAKKQSKHVQHTSEDDSDEEGGGNVSEDAQSESSNEK
PVKKEVSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQGPESKRESQLIKELEGILSREGLSANPSEKEIKEVKKKKERAKELEGIDLSNIVSSSRRRSTSSYAPPPPK
PKIPVKTDGDDGDDTDEEEEEDDDDDEDDEEEEDDDEEDNGDVDESQGEEFNEDDNEDSD