; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr000540 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr000540
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionglutamic acid-rich protein-like isoform X2
Genome locationtig00000289:41943..48858
RNA-Seq ExpressionSgr000540
SyntenySgr000540
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0016874 - ligase activity (molecular function)
InterPro domainsIPR019098 - Histone chaperone domain CHZ
IPR037647 - HIRA-interacting protein 3


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016498.1 hypothetical protein SDJN02_21607 [Cucurbita argyrosperma subsp. argyrosperma]1.7e-21286.55Show/hide
Query:  MAEELQDNDTPNEEVMDTAVDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDIASKDSEETGGKSVS
        MAEELQDND PNEE MD  V IETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLC+ETYALDVHKRYIKQCLVKCLEGVEED ASK SEETGGKSVS
Subjt:  MAEELQDNDTPNEEVMDTAVDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDIASKDSEETGGKSVS

Query:  RNEAAEALEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGHKTTNVKSE---GIKDKDDKDIPSESTIKKAIRKRTPYLKAHSEKVTMAGVRRLLEDDLKL
        R EAAE+LEGHQSKKG KEPCLEDEEKMEDSPVMGLL GHKT NV+S+   GIKDKDDKDIP+ESTIKKAIRKRTPYLKA+SEKVTMAGVRRLLEDDLKL
Subjt:  RNEAAEALEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGHKTTNVKSE---GIKDKDDKDIPSESTIKKAIRKRTPYLKAHSEKVTMAGVRRLLEDDLKL

Query:  TKYALDSCKKFISHQVEEILNSCEAAEQVPKEKKSSHLKTSKKVSKESSHSTE-GSSSEEENDEVKPGKNIATKGRIQNSNEMKKRKRSAKETVSAKKQS
        TKYALD CKKFIS QVEEILNSCEAAEQV  EKK S LKT KKVSKESSHSTE GSSSEEE+DEVKP K   TKGRI NSNE KKRKRS KE VSAKKQ 
Subjt:  TKYALDSCKKFISHQVEEILNSCEAAEQVPKEKKSSHLKTSKKVSKESSHSTE-GSSSEEENDEVKPGKNIATKGRIQNSNEMKKRKRSAKETVSAKKQS

Query:  KHVQHTSEEDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKLCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
        KHVQHTSEED DEEGGENVSEDG SESSNEKPVKKEVSTPVYGKRVEHLKSVIK CGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
Subjt:  KHVQHTSEEDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKLCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK

Query:  EIKEVKKKKERAKELEGIDVSNIVSSSRRRSSTSYVAPPPKPEVPVETDGDDGDDTDEEEEDEEDDE--------EEDNDEEDNGDVDESQG-EEFNE
        EIK+VKKKKERAKELEGID+SNIVSSSRRRS++SYVAPPPKP++PV+T+GDD DDTD+E++D++DD+        EE++DEEDNGDVDESQG EEFNE
Subjt:  EIKEVKKKKERAKELEGIDVSNIVSSSRRRSSTSYVAPPPKPEVPVETDGDDGDDTDEEEEDEEDDE--------EEDNDEEDNGDVDESQG-EEFNE

XP_022939456.1 DNA ligase 1-like isoform X1 [Cucurbita moschata]1.0e-21287.07Show/hide
Query:  MAEELQDNDTPNEEVMDTAVDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDIASKDSEETGGKSVS
        MAEELQDND PNEE MD  V IETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLC+ETYALDVHKRYIKQCLVKCLEGVEED ASK SEETGGKSVS
Subjt:  MAEELQDNDTPNEEVMDTAVDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDIASKDSEETGGKSVS

Query:  RNEAAEALEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGHKTTNVKSE---GIKDKDDKDIPSESTIKKAIRKRTPYLKAHSEKVTMAGVRRLLEDDLKL
        R EAAE+LEGHQSKKG KEPCLEDEEKMEDSPVMGLL GHKT NV+S+   GIKDKDDKDIP+ESTIKKAIRKRTPYLKA+SEKVTMAGVRRLLEDDLKL
Subjt:  RNEAAEALEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGHKTTNVKSE---GIKDKDDKDIPSESTIKKAIRKRTPYLKAHSEKVTMAGVRRLLEDDLKL

Query:  TKYALDSCKKFISHQVEEILNSCEAAEQVPKEKKSSHLKTSKKVSKESSHSTE-GSSSEEENDEVKPGKNIATKGRIQNSNEMKKRKRSAKETVSAKKQS
        TKYALD CKKFIS QVEEILNSCEAAE+V  EKK S LKT KKVSKESSHSTE GSSSEEE+DEVKP K   TKGRI NSNE KKRKRS KE VSAKKQ 
Subjt:  TKYALDSCKKFISHQVEEILNSCEAAEQVPKEKKSSHLKTSKKVSKESSHSTE-GSSSEEENDEVKPGKNIATKGRIQNSNEMKKRKRSAKETVSAKKQS

Query:  KHVQHTSEEDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKLCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
        KHVQHTSEED DEEGGENVSEDG SESSNEKPVKKEVSTPVYGKRVEHLKSVIK CGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
Subjt:  KHVQHTSEEDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKLCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK

Query:  EIKEVKKKKERAKELEGIDVSNIVSSSRRRSSTSYVAPPPKPEVPVETDGDDGDDTDEEEEDEEDDE-----EEDNDEEDNGDVDESQG-EEFNE
        EIK+VKKKKERAKELEGID+SNIVSSSRRRS++SYVAPPPKP++PV+T+GDD DDTD+EEE+++DD+     EE++DEEDNGDVDESQG EEFNE
Subjt:  EIKEVKKKKERAKELEGIDVSNIVSSSRRRSSTSYVAPPPKPEVPVETDGDDGDDTDEEEEDEEDDE-----EEDNDEEDNGDVDESQG-EEFNE

XP_022939457.1 glutamic acid-rich protein-like isoform X2 [Cucurbita moschata]5.5e-21186.73Show/hide
Query:  MAEELQDNDTPNEEVMDTAVDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDIASKDSEETGGKSVS
        MAEELQDND PNEE MD  V IETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLC+ETYALDVHKRYIKQCLVKCLEGVEED ASK SEETGGKSVS
Subjt:  MAEELQDNDTPNEEVMDTAVDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDIASKDSEETGGKSVS

Query:  RNEAAEALEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGHKTTNVKSE---GIKDKDDKDIPSESTIKKAIRKRTPYLKAHSEKVTMAGVRRLLEDDLKL
        R EAAE+LEGHQSKKG KEPCLEDEEKMEDSPVMGLL GHKT NV+S+   GIKDKDDKDIP+ESTIKKAIRKRTPYLKA+SEKVTMAGVRRLLEDDLKL
Subjt:  RNEAAEALEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGHKTTNVKSE---GIKDKDDKDIPSESTIKKAIRKRTPYLKAHSEKVTMAGVRRLLEDDLKL

Query:  TKYALDSCKKFISHQVEEILNSCEAAEQVPKEKKSSHLKTSKKVSKESSHSTE-GSSSEEENDEVKPGKNIATKGRIQNSNEMKKRKRSAKETVSAKKQS
        TKYALD CKKFIS QVEEILNSCEAAE+V  EKK S LKT KKVSKESSHSTE GSSSEEE+DEVKP K   TKGRI NSNE KKRKRS KE VSAKKQ 
Subjt:  TKYALDSCKKFISHQVEEILNSCEAAEQVPKEKKSSHLKTSKKVSKESSHSTE-GSSSEEENDEVKPGKNIATKGRIQNSNEMKKRKRSAKETVSAKKQS

Query:  KHVQHTSEEDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKLCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
        KHVQHTSEED DEEGGENVSEDG SESSNEKPVKKEVSTPVYGKRVEHLKSVIK CGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
Subjt:  KHVQHTSEEDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKLCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK

Query:  EIKEVKKKKERAKELEGIDVSNIVSSSRRRSSTSYVAPPPKPEVPVETDGDDGDDTDEEEEDEEDDE-----EEDNDEEDNGDVDESQGE
        EIK+VKKKKERAKELEGID+SNIVSSSRRRS++SYVAPPPKP++PV+T+GDD DDTD+EEE+++DD+     EE++DEEDNGDVDESQ +
Subjt:  EIKEVKKKKERAKELEGIDVSNIVSSSRRRSSTSYVAPPPKPEVPVETDGDDGDDTDEEEEDEEDDE-----EEDNDEEDNGDVDESQGE

XP_023551365.1 glutamic acid-rich protein-like isoform X1 [Cucurbita pepo subsp. pepo]1.5e-21387.65Show/hide
Query:  MAEELQDNDTPNEEVMDTAVDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDIASKDSEETGGKSVS
        MAEELQDND PNEE MD  V IETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLC+ETYALDVHKRYIKQCLVKCLEGVEED  SK SEETGGKSVS
Subjt:  MAEELQDNDTPNEEVMDTAVDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDIASKDSEETGGKSVS

Query:  RNEAAEALEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGHKTTNVKSE---GIKDKDDKDIPSESTIKKAIRKRTPYLKAHSEKVTMAGVRRLLEDDLKL
        R EAAE+LEGHQSKKG KEPCLEDEEKMEDSPVMGLL GHKT NV+S+   GIKDKDDKDIP+E+TIKKAIRKRTPYLKA+SEKVTMAGVRRLLEDDLKL
Subjt:  RNEAAEALEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGHKTTNVKSE---GIKDKDDKDIPSESTIKKAIRKRTPYLKAHSEKVTMAGVRRLLEDDLKL

Query:  TKYALDSCKKFISHQVEEILNSCEAAEQVPKEKKSSHLKTSKKVSKESSHSTE-GSSSEEENDEVKPGKNIATKGRIQNSNEMKKRKRSAKETVSAKKQS
        TKYALD CKKFIS QVEEILNSCEAAEQV  EKK S LKT KKVSKESSHSTE GSSSEEE+DEVKP K   TKGRI NSNE KKRKRS KE VSAKKQ 
Subjt:  TKYALDSCKKFISHQVEEILNSCEAAEQVPKEKKSSHLKTSKKVSKESSHSTE-GSSSEEENDEVKPGKNIATKGRIQNSNEMKKRKRSAKETVSAKKQS

Query:  KHVQHTSEEDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKLCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
        KHVQHTSEED DEEGGENVSEDG SESSNEKPVKKEVSTPVYGKRVEHLKSVIK CGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
Subjt:  KHVQHTSEEDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKLCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK

Query:  EIKEVKKKKERAKELEGIDVSNIVSSSRRRSSTSYVAPPPKPEVPVETDGDDGDDTDEEEEDEEDDEEED----NDEEDNGDVDESQG-EEFNE
        EIK+VKKKKERAKELEGID+SNIVSSSRRRS++SYVAPPPKP++PV+T+GDD DDTDEEEE+EEDD++ED    +DEEDNGDVDESQG EEFNE
Subjt:  EIKEVKKKKERAKELEGIDVSNIVSSSRRRSSTSYVAPPPKPEVPVETDGDDGDDTDEEEEDEEDDEEED----NDEEDNGDVDESQG-EEFNE

XP_023551366.1 glutamic acid-rich protein-like isoform X2 [Cucurbita pepo subsp. pepo]8.4e-21287.32Show/hide
Query:  MAEELQDNDTPNEEVMDTAVDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDIASKDSEETGGKSVS
        MAEELQDND PNEE MD  V IETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLC+ETYALDVHKRYIKQCLVKCLEGVEED  SK SEETGGKSVS
Subjt:  MAEELQDNDTPNEEVMDTAVDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDIASKDSEETGGKSVS

Query:  RNEAAEALEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGHKTTNVKSE---GIKDKDDKDIPSESTIKKAIRKRTPYLKAHSEKVTMAGVRRLLEDDLKL
        R EAAE+LEGHQSKKG KEPCLEDEEKMEDSPVMGLL GHKT NV+S+   GIKDKDDKDIP+E+TIKKAIRKRTPYLKA+SEKVTMAGVRRLLEDDLKL
Subjt:  RNEAAEALEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGHKTTNVKSE---GIKDKDDKDIPSESTIKKAIRKRTPYLKAHSEKVTMAGVRRLLEDDLKL

Query:  TKYALDSCKKFISHQVEEILNSCEAAEQVPKEKKSSHLKTSKKVSKESSHSTE-GSSSEEENDEVKPGKNIATKGRIQNSNEMKKRKRSAKETVSAKKQS
        TKYALD CKKFIS QVEEILNSCEAAEQV  EKK S LKT KKVSKESSHSTE GSSSEEE+DEVKP K   TKGRI NSNE KKRKRS KE VSAKKQ 
Subjt:  TKYALDSCKKFISHQVEEILNSCEAAEQVPKEKKSSHLKTSKKVSKESSHSTE-GSSSEEENDEVKPGKNIATKGRIQNSNEMKKRKRSAKETVSAKKQS

Query:  KHVQHTSEEDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKLCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
        KHVQHTSEED DEEGGENVSEDG SESSNEKPVKKEVSTPVYGKRVEHLKSVIK CGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
Subjt:  KHVQHTSEEDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKLCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK

Query:  EIKEVKKKKERAKELEGIDVSNIVSSSRRRSSTSYVAPPPKPEVPVETDGDDGDDTDEEEEDEEDDEEED----NDEEDNGDVDESQGE
        EIK+VKKKKERAKELEGID+SNIVSSSRRRS++SYVAPPPKP++PV+T+GDD DDTDEEEE+EEDD++ED    +DEEDNGDVDESQ +
Subjt:  EIKEVKKKKERAKELEGIDVSNIVSSSRRRSSTSYVAPPPKPEVPVETDGDDGDDTDEEEEDEEDDEEED----NDEEDNGDVDESQGE

TrEMBL top hitse value%identityAlignment
A0A6J1CJ49 DNA ligase 15.0e-21088.03Show/hide
Query:  MAEELQDND-TPNEEVMDTAVDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDIASKDSEETGGKSV
        MAEELQDND  PNE+ MDT VDIE KIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEED ASKDSEETGGKSV
Subjt:  MAEELQDND-TPNEEVMDTAVDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDIASKDSEETGGKSV

Query:  SRNEAAEALEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGHKTTNVKSEGIKDKDDKDIPSESTIKKAIRKRTPYLKAHSEKVTMAGVRRLLEDDLKLTK
        SR EAA++LEG QSKKGVKEPCLEDEEKMEDSPVMGLLTG KTTNVK EGIKD+DDKDIPSES IKKAIRKRT YLKA+SEKVTMAGVRRLLE+DLKLTK
Subjt:  SRNEAAEALEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGHKTTNVKSEGIKDKDDKDIPSESTIKKAIRKRTPYLKAHSEKVTMAGVRRLLEDDLKLTK

Query:  YALDSCKKFISHQVEEILNSCEAAEQVPKEKKSSHLKTSKKVSKESSHSTEGSSSEEENDEVKPGKNIATKGRIQNSNEMKKRKRSAKETVSAKKQSKHV
        YALD CKK IS QVEEILNSCEAAEQVP EKKSS LKT KKVSKESSHSTEGSSSEEENDEVKP K  ATKGRI NSNE KKRKRSAKETVSAKKQSK+V
Subjt:  YALDSCKKFISHQVEEILNSCEAAEQVPKEKKSSHLKTSKKVSKESSHSTEGSSSEEENDEVKPGKNIATKGRIQNSNEMKKRKRSAKETVSAKKQSKHV

Query:  QHTSEEDGDEEGGENVSEDGQSESSNEKPVK---KEVSTPVYGKRVEHLKSVIKLCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
        Q TSE D DEEGGENVSEDG+SESS+EKPVK   KEVST VYGKRVEHLKSVIK CGMSVPP+IYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
Subjt:  QHTSEEDGDEEGGENVSEDGQSESSNEKPVK---KEVSTPVYGKRVEHLKSVIKLCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK

Query:  EIKEVKKKKERAKELEGIDVSNIVSSSRRRSSTSYVA---PPPKPEVPVETDGDDGDDTDEEEEDEEDDEEE-DNDEEDNGDVDESQGEEFNE
        EIK+V+KKKERAKELEGID+SNIVSSSRRRS+ SY A   PPPKP++PVETDG D DDTDEEE+DEEDDEEE D+DEEDNG  DESQGEEFNE
Subjt:  EIKEVKKKKERAKELEGIDVSNIVSSSRRRSSTSYVA---PPPKPEVPVETDGDDGDDTDEEEEDEEDDEEE-DNDEEDNGDVDESQGEEFNE

A0A6J1FFY5 DNA ligase 1-like isoform X14.8e-21387.07Show/hide
Query:  MAEELQDNDTPNEEVMDTAVDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDIASKDSEETGGKSVS
        MAEELQDND PNEE MD  V IETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLC+ETYALDVHKRYIKQCLVKCLEGVEED ASK SEETGGKSVS
Subjt:  MAEELQDNDTPNEEVMDTAVDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDIASKDSEETGGKSVS

Query:  RNEAAEALEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGHKTTNVKSE---GIKDKDDKDIPSESTIKKAIRKRTPYLKAHSEKVTMAGVRRLLEDDLKL
        R EAAE+LEGHQSKKG KEPCLEDEEKMEDSPVMGLL GHKT NV+S+   GIKDKDDKDIP+ESTIKKAIRKRTPYLKA+SEKVTMAGVRRLLEDDLKL
Subjt:  RNEAAEALEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGHKTTNVKSE---GIKDKDDKDIPSESTIKKAIRKRTPYLKAHSEKVTMAGVRRLLEDDLKL

Query:  TKYALDSCKKFISHQVEEILNSCEAAEQVPKEKKSSHLKTSKKVSKESSHSTE-GSSSEEENDEVKPGKNIATKGRIQNSNEMKKRKRSAKETVSAKKQS
        TKYALD CKKFIS QVEEILNSCEAAE+V  EKK S LKT KKVSKESSHSTE GSSSEEE+DEVKP K   TKGRI NSNE KKRKRS KE VSAKKQ 
Subjt:  TKYALDSCKKFISHQVEEILNSCEAAEQVPKEKKSSHLKTSKKVSKESSHSTE-GSSSEEENDEVKPGKNIATKGRIQNSNEMKKRKRSAKETVSAKKQS

Query:  KHVQHTSEEDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKLCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
        KHVQHTSEED DEEGGENVSEDG SESSNEKPVKKEVSTPVYGKRVEHLKSVIK CGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
Subjt:  KHVQHTSEEDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKLCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK

Query:  EIKEVKKKKERAKELEGIDVSNIVSSSRRRSSTSYVAPPPKPEVPVETDGDDGDDTDEEEEDEEDDE-----EEDNDEEDNGDVDESQG-EEFNE
        EIK+VKKKKERAKELEGID+SNIVSSSRRRS++SYVAPPPKP++PV+T+GDD DDTD+EEE+++DD+     EE++DEEDNGDVDESQG EEFNE
Subjt:  EIKEVKKKKERAKELEGIDVSNIVSSSRRRSSTSYVAPPPKPEVPVETDGDDGDDTDEEEEDEEDDE-----EEDNDEEDNGDVDESQG-EEFNE

A0A6J1FGV2 glutamic acid-rich protein-like isoform X22.7e-21186.73Show/hide
Query:  MAEELQDNDTPNEEVMDTAVDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDIASKDSEETGGKSVS
        MAEELQDND PNEE MD  V IETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLC+ETYALDVHKRYIKQCLVKCLEGVEED ASK SEETGGKSVS
Subjt:  MAEELQDNDTPNEEVMDTAVDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDIASKDSEETGGKSVS

Query:  RNEAAEALEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGHKTTNVKSE---GIKDKDDKDIPSESTIKKAIRKRTPYLKAHSEKVTMAGVRRLLEDDLKL
        R EAAE+LEGHQSKKG KEPCLEDEEKMEDSPVMGLL GHKT NV+S+   GIKDKDDKDIP+ESTIKKAIRKRTPYLKA+SEKVTMAGVRRLLEDDLKL
Subjt:  RNEAAEALEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGHKTTNVKSE---GIKDKDDKDIPSESTIKKAIRKRTPYLKAHSEKVTMAGVRRLLEDDLKL

Query:  TKYALDSCKKFISHQVEEILNSCEAAEQVPKEKKSSHLKTSKKVSKESSHSTE-GSSSEEENDEVKPGKNIATKGRIQNSNEMKKRKRSAKETVSAKKQS
        TKYALD CKKFIS QVEEILNSCEAAE+V  EKK S LKT KKVSKESSHSTE GSSSEEE+DEVKP K   TKGRI NSNE KKRKRS KE VSAKKQ 
Subjt:  TKYALDSCKKFISHQVEEILNSCEAAEQVPKEKKSSHLKTSKKVSKESSHSTE-GSSSEEENDEVKPGKNIATKGRIQNSNEMKKRKRSAKETVSAKKQS

Query:  KHVQHTSEEDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKLCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
        KHVQHTSEED DEEGGENVSEDG SESSNEKPVKKEVSTPVYGKRVEHLKSVIK CGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
Subjt:  KHVQHTSEEDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKLCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK

Query:  EIKEVKKKKERAKELEGIDVSNIVSSSRRRSSTSYVAPPPKPEVPVETDGDDGDDTDEEEEDEEDDE-----EEDNDEEDNGDVDESQGE
        EIK+VKKKKERAKELEGID+SNIVSSSRRRS++SYVAPPPKP++PV+T+GDD DDTD+EEE+++DD+     EE++DEEDNGDVDESQ +
Subjt:  EIKEVKKKKERAKELEGIDVSNIVSSSRRRSSTSYVAPPPKPEVPVETDGDDGDDTDEEEEDEEDDE-----EEDNDEEDNGDVDESQGE

A0A6J1JTY1 glutamic acid-rich protein-like isoform X14.2e-20986.03Show/hide
Query:  MAEELQDNDTPNEEVMDTAVDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDIASKDSEETGGKSVS
        MAEELQD D PNEE MD  V IETKIQNAMLSRVSHFKEQADSLTFEGVRRLLE DLC+ETYALDVHKRYIKQCLVKCLEGVEED ASK SEETGGKSVS
Subjt:  MAEELQDNDTPNEEVMDTAVDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDIASKDSEETGGKSVS

Query:  RNEAAEALEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGHKTTNVKSE---GIKDKDDKDIPSESTIKKAIRKRTPYLKAHSEKVTMAGVRRLLEDDLKL
        R EAAE+LEGHQSKKG KEPCLEDEEKMEDSPVMGLL GHKT N +S+   GIKDKDDKDIP+ESTIKKAIRKRTPYLKA+SEKVTMAGVRRLLEDDLKL
Subjt:  RNEAAEALEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGHKTTNVKSE---GIKDKDDKDIPSESTIKKAIRKRTPYLKAHSEKVTMAGVRRLLEDDLKL

Query:  TKYALDSCKKFISHQVEEILNSCEAAEQVPKEKKSSHLKTSKKVSKESSHSTE-GSSSEEENDEVKPGKNIATKGRIQNSNEMKKRKRSAKETVSAKKQS
        TKYALD CKKFIS QVEEILNSCEAAEQV  EKK S LKT KKVSKESSHSTE GSSSEEE+DEVKP K   TKG I NSNEMKKRKRS KE VSAKKQ 
Subjt:  TKYALDSCKKFISHQVEEILNSCEAAEQVPKEKKSSHLKTSKKVSKESSHSTE-GSSSEEENDEVKPGKNIATKGRIQNSNEMKKRKRSAKETVSAKKQS

Query:  KHVQHTSEEDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKLCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
        KHV HT EED DE+GGENVSEDG SESSNEKPVKKEVSTPVYGKRVEHLKSVIK CGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLS NPTEK
Subjt:  KHVQHTSEEDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKLCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK

Query:  EIKEVKKKKERAKELEGIDVSNIVSSSRRRSSTSYVAPPPKPEVPVETDGDDGDDTDEEEEDEEDDE----EEDNDEEDNGDVDESQG-EEFNE
        EIK+VKKKKERAKELEGID+SNIVSSSRRRS++SY APPPKP++PV+T+GDD DDTD+EEE+E+DD+    EE++DEEDNGDVDESQG EEFNE
Subjt:  EIKEVKKKKERAKELEGIDVSNIVSSSRRRSSTSYVAPPPKPEVPVETDGDDGDDTDEEEEDEEDDE----EEDNDEEDNGDVDESQG-EEFNE

A0A6J1K3E3 glutamic acid-rich protein-like isoform X22.3e-20785.69Show/hide
Query:  MAEELQDNDTPNEEVMDTAVDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDIASKDSEETGGKSVS
        MAEELQD D PNEE MD  V IETKIQNAMLSRVSHFKEQADSLTFEGVRRLLE DLC+ETYALDVHKRYIKQCLVKCLEGVEED ASK SEETGGKSVS
Subjt:  MAEELQDNDTPNEEVMDTAVDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDIASKDSEETGGKSVS

Query:  RNEAAEALEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGHKTTNVKSE---GIKDKDDKDIPSESTIKKAIRKRTPYLKAHSEKVTMAGVRRLLEDDLKL
        R EAAE+LEGHQSKKG KEPCLEDEEKMEDSPVMGLL GHKT N +S+   GIKDKDDKDIP+ESTIKKAIRKRTPYLKA+SEKVTMAGVRRLLEDDLKL
Subjt:  RNEAAEALEGHQSKKGVKEPCLEDEEKMEDSPVMGLLTGHKTTNVKSE---GIKDKDDKDIPSESTIKKAIRKRTPYLKAHSEKVTMAGVRRLLEDDLKL

Query:  TKYALDSCKKFISHQVEEILNSCEAAEQVPKEKKSSHLKTSKKVSKESSHSTE-GSSSEEENDEVKPGKNIATKGRIQNSNEMKKRKRSAKETVSAKKQS
        TKYALD CKKFIS QVEEILNSCEAAEQV  EKK S LKT KKVSKESSHSTE GSSSEEE+DEVKP K   TKG I NSNEMKKRKRS KE VSAKKQ 
Subjt:  TKYALDSCKKFISHQVEEILNSCEAAEQVPKEKKSSHLKTSKKVSKESSHSTE-GSSSEEENDEVKPGKNIATKGRIQNSNEMKKRKRSAKETVSAKKQS

Query:  KHVQHTSEEDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKLCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK
        KHV HT EED DE+GGENVSEDG SESSNEKPVKKEVSTPVYGKRVEHLKSVIK CGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLS NPTEK
Subjt:  KHVQHTSEEDGDEEGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKLCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEK

Query:  EIKEVKKKKERAKELEGIDVSNIVSSSRRRSSTSYVAPPPKPEVPVETDGDDGDDTDEEEEDEEDDE----EEDNDEEDNGDVDESQGE
        EIK+VKKKKERAKELEGID+SNIVSSSRRRS++SY APPPKP++PV+T+GDD DDTD+EEE+E+DD+    EE++DEEDNGDVDESQ +
Subjt:  EIKEVKKKKERAKELEGIDVSNIVSSSRRRSSTSYVAPPPKPEVPVETDGDDGDDTDEEEEDEEDDE----EEDNDEEDNGDVDESQGE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G44780.1 CONTAINS InterPro DOMAIN/s: Histone chaperone domain CHZ (InterPro:IPR019098)3.4e-7044.21Show/hide
Query:  AVDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDIASKDSEETGGK--SVSRNEAAEALEGHQSKKG
        A +IE KI  A+ SRV++ + +AD  T   VRR+LE+D+ LE   LDV+K ++K+ LVKCLE    +  S++S+ET  +   +   E AE  E H+    
Subjt:  AVDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDIASKDSEETGGK--SVSRNEAAEALEGHQSKKG

Query:  VKEPCLEDEEKMEDSPVMGLLTGHKTTNVKSEGIKDKDDKDIPSESTIKKAIRKRTPYLKAHSEKVTMAGVRRLLEDDLKLTKYALDSCKKFISHQVEEI
          E   E+  K E              +VK +G K+   +D      IK+A+RKR  Y+KA+SE +TMA +RRLLE+DLKL K +LD  KKFI+ +++E+
Subjt:  VKEPCLEDEEKMEDSPVMGLLTGHKTTNVKSEGIKDKDDKDIPSESTIKKAIRKRTPYLKAHSEKVTMAGVRRLLEDDLKLTKYALDSCKKFISHQVEEI

Query:  LN-----SCEAAEQVPKEKKSSHLKTSKKVSKE--SSHSTEGSSSEEENDEVKPGKNIATKGRIQNSNEMKKRKRSAKETVSAKKQSKHVQHTSEEDGDE
        L       C     V   KK      SK VS E  S   TEG+    +N+EV   K +A K ++     M KRK    + VS +K++KH +  SE D   
Subjt:  LN-----SCEAAEQVPKEKKSSHLKTSKKVSKE--SSHSTEGSSSEEENDEVKPGKNIATKGRIQNSNEMKKRKRSAKETVSAKKQSKHVQHTSEEDGDE

Query:  EGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKLCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEKEIKEVKKKKERAK
              S+ G SE S ++   KE +T VYGKRVEHLKSVIK CGMSVPP+IYKK KQAP+ KRE+ LI+ELE IL++EGLS++P+  EIKEVKK+K  ++
Subjt:  EGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKLCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEKEIKEVKKKKERAK

Query:  ELEGIDVSNIVSSSRRRSSTSYVAPPPKPEVPVETDGDDGDDTDEEEEDE-----------EDDEEEDNDEEDNG
        ELEGID +NIV +SRRRSSTS+ APPPKP+V  E++ +  +  D E E+E           E+ EEE N EED+G
Subjt:  ELEGIDVSNIVSSSRRRSSTSYVAPPPKPEVPVETDGDDGDDTDEEEEDE-----------EDDEEEDNDEEDNG

AT1G44780.2 INVOLVED IN: biological_process unknown4.5e-7044.21Show/hide
Query:  AVDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDIASKDSEETGGK--SVSRNEAAEALEGHQSKKG
        A +IE KI  A+ SRV++ + +AD  T   VRR+LE+D+ LE   LDV+K ++K+ LVKCLE    +  S++S+ET  +   +   E AE  E H+    
Subjt:  AVDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDIASKDSEETGGK--SVSRNEAAEALEGHQSKKG

Query:  VKEPCLEDEEKMEDSPVMGLLTGHKTTNVKSEGIKDKDDKDIPSESTIKKAIRKRTPYLKAHSEKVTMAGVRRLLEDDLKLTKYALDSCKKFISHQVEEI
          E   E+  K E              +VK +G K+   +D      IK+A+RKR  Y+KA+SE +TMA +RRLLE+DLKL K +LD  KKFI+ +++E+
Subjt:  VKEPCLEDEEKMEDSPVMGLLTGHKTTNVKSEGIKDKDDKDIPSESTIKKAIRKRTPYLKAHSEKVTMAGVRRLLEDDLKLTKYALDSCKKFISHQVEEI

Query:  LN-----SCEAAEQVPKEKKSSHLKTSKKVSKE--SSHSTEGSSSEEENDEVKPGKNIATKGRIQNSNEMKKRKRSAKETVSAKKQSKHVQHTSEEDGDE
        L       C     V   KK      SK VS E  S   TEG+    +N+EV   K +A K ++     M KRK    + VS +K++KH +  SE D   
Subjt:  LN-----SCEAAEQVPKEKKSSHLKTSKKVSKE--SSHSTEGSSSEEENDEVKPGKNIATKGRIQNSNEMKKRKRSAKETVSAKKQSKHVQHTSEEDGDE

Query:  EGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKLCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEKEIKEVKKKKERAK
              S+ G SE S +    KE +T VYGKRVEHLKSVIK CGMSVPP+IYKK KQAP+ KRE+ LI+ELE IL++EGLS++P+  EIKEVKK+K  ++
Subjt:  EGGENVSEDGQSESSNEKPVKKEVSTPVYGKRVEHLKSVIKLCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEKEIKEVKKKKERAK

Query:  ELEGIDVSNIVSSSRRRSSTSYVAPPPKPEVPVETDGDDGDDTDEEEEDE-----------EDDEEEDNDEEDNG
        ELEGID +NIV +SRRRSSTS+ APPPKP+V  E++ +  +  D E E+E           E+ EEE N EED+G
Subjt:  ELEGIDVSNIVSSSRRRSSTSYVAPPPKPEVPVETDGDDGDDTDEEEEDE-----------EDDEEEDNDEEDNG

AT4G08310.1 FUNCTIONS IN: molecular_function unknown1.4e-9251.16Show/hide
Query:  DIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDIASKDSEETGGKS--VSRNEAAEALEGHQSKKGVK
        DIE++I  AM SRV++ +++AD+ TFEGVRRLLE+DL LE +ALDVHK ++KQ LV+CL G E D  S++S ET  K       EAAE  + H +KK  K
Subjt:  DIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDIASKDSEETGGKS--VSRNEAAEALEGHQSKKGVK

Query:  EPCLEDEEKMEDSPVMGLLTGHKTTNVKSEGIKDKDDKDIPSESTIKKAIRKRTPYLKAHSEKVTMAGVRRLLEDDLKLTKYALDSCKKFISHQVEEILN
        E    D+EK +DSPVMGLLT   T+   +E  KD+D + +  +S IKKA+RKR+ Y+KA+SEK+TM  +RRLLE DLKL KY+LD  KKFI+ +++EIL 
Subjt:  EPCLEDEEKMEDSPVMGLLTGHKTTNVKSEGIKDKDDKDIPSESTIKKAIRKRTPYLKAHSEKVTMAGVRRLLEDDLKLTKYALDSCKKFISHQVEEILN

Query:  SCEAAEQVPKEKKSSHLKTSKKVSKESSHSTE----GSSSEEENDEVKPGKNIATKGRIQNSNEMKKRKRSAKETVSAKKQSKHVQHTSEEDGDEEGGEN
        + EA +   K ++    K  K    ++S S E        EEE+ EV   K +A K ++  S    KRKR  ++  SAKK       T + D   +    
Subjt:  SCEAAEQVPKEKKSSHLKTSKKVSKESSHSTE----GSSSEEENDEVKPGKNIATKGRIQNSNEMKKRKRSAKETVSAKKQSKHVQHTSEEDGDEEGGEN

Query:  VSEDGQSESSNEKPVKK-EVSTPVYGKRVEHLKSVIKLCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEKEIKEVKKKKERAKELEG
         S+ G+   S+EK VKK E  T  YGKRVEHLKS+IK CGMS+ PS+Y+K KQAPE KRE  LIKEL+ +L++EGLSANP+EKEIKEVKK+KER KELEG
Subjt:  VSEDGQSESSNEKPVKK-EVSTPVYGKRVEHLKSVIKLCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEKEIKEVKKKKERAKELEG

Query:  IDVSNIVSSSRRRSSTSYVAPPPKPEVPVETDGDDGDDTD-EEEEDEE---DDEEEDNDE---EDNGDVDESQGE
        ID SNIVSSSRRRSS S+V PPPKP    E++ DD +D++ EE+EDEE   ++EEE+ DE   ED G+  +++GE
Subjt:  IDVSNIVSSSRRRSSTSYVAPPPKPEVPVETDGDDGDDTD-EEEEDEE---DDEEEDNDE---EDNGDVDESQGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAGGAGTTGCAGGACAACGACACTCCTAACGAAGAGGTGATGGATACAGCGGTTGATATAGAGACGAAGATTCAGAACGCTATGCTCTCCCGCGTCTCTCACTT
CAAGGAACAGGCCGACTCTTTAACTTTTGAGGGGGTTAGAAGATTGTTAGAAAAGGACTTGTGTTTGGAGACATATGCTTTAGATGTGCATAAAAGATATATCAAGCAGT
GTTTGGTGAAGTGCTTAGAAGGTGTTGAGGAAGACATTGCCTCAAAGGATTCTGAGGAGACTGGGGGGAAAAGTGTAAGTAGAAATGAAGCGGCTGAGGCACTTGAAGGG
CATCAGTCCAAGAAAGGTGTCAAGGAACCTTGCTTGGAAGATGAGGAGAAAATGGAAGACTCTCCAGTTATGGGCCTCCTCACAGGACATAAAACAACAAATGTCAAATC
TGAGGGAATCAAAGACAAAGATGACAAAGATATTCCTAGTGAGAGTACAATTAAGAAGGCTATTAGAAAAAGAACTCCTTATCTTAAAGCTCATTCGGAGAAAGTTACTA
TGGCTGGAGTTCGCCGCCTTCTGGAGGATGACCTTAAACTTACTAAATATGCTCTCGATAGTTGTAAGAAGTTTATAAGCCACCAAGTAGAGGAGATATTAAATTCTTGT
GAAGCTGCTGAACAAGTTCCTAAGGAAAAGAAAAGTTCTCATTTGAAAACTTCAAAAAAGGTAAGCAAAGAAAGTTCTCATTCTACAGAAGGCAGTAGTAGCGAGGAGGA
AAATGATGAAGTAAAACCTGGAAAGAATATTGCTACTAAAGGAAGAATACAGAACTCTAATGAAATGAAAAAGCGGAAAAGATCTGCGAAGGAGACTGTCTCTGCCAAGA
AGCAAAGCAAGCATGTCCAGCATACATCAGAGGAAGATGGTGATGAAGAAGGTGGTGAAAATGTCTCTGAAGATGGCCAGTCTGAATCATCCAATGAAAAACCTGTTAAG
AAGGAAGTTTCAACTCCTGTCTATGGCAAGCGTGTGGAGCACCTGAAATCGGTTATCAAATTGTGTGGGATGAGTGTTCCTCCATCGATTTATAAGAAAGTCAAGCAGGC
ACCTGAAAGCAAACGTGAATCCCAACTTATAAAGGAGTTGGAGGGGATACTATCCAGAGAAGGATTGTCTGCTAATCCCACTGAAAAAGAAATTAAGGAAGTCAAAAAGA
AGAAGGAAAGAGCCAAAGAACTTGAAGGCATTGACGTGAGTAATATTGTCTCAAGTTCACGTAGAAGATCCTCAACTAGTTATGTAGCACCACCTCCAAAACCAGAAGTA
CCAGTTGAAACTGATGGTGATGATGGTGATGATACTGATGAGGAGGAAGAAGATGAAGAGGATGATGAAGAGGAGGATAACGACGAAGAGGATAATGGTGATGTTGATGA
AAGCCAGGGTGAAGAATTTAACGAGG
mRNA sequenceShow/hide mRNA sequence
ATGGCGGAGGAGTTGCAGGACAACGACACTCCTAACGAAGAGGTGATGGATACAGCGGTTGATATAGAGACGAAGATTCAGAACGCTATGCTCTCCCGCGTCTCTCACTT
CAAGGAACAGGCCGACTCTTTAACTTTTGAGGGGGTTAGAAGATTGTTAGAAAAGGACTTGTGTTTGGAGACATATGCTTTAGATGTGCATAAAAGATATATCAAGCAGT
GTTTGGTGAAGTGCTTAGAAGGTGTTGAGGAAGACATTGCCTCAAAGGATTCTGAGGAGACTGGGGGGAAAAGTGTAAGTAGAAATGAAGCGGCTGAGGCACTTGAAGGG
CATCAGTCCAAGAAAGGTGTCAAGGAACCTTGCTTGGAAGATGAGGAGAAAATGGAAGACTCTCCAGTTATGGGCCTCCTCACAGGACATAAAACAACAAATGTCAAATC
TGAGGGAATCAAAGACAAAGATGACAAAGATATTCCTAGTGAGAGTACAATTAAGAAGGCTATTAGAAAAAGAACTCCTTATCTTAAAGCTCATTCGGAGAAAGTTACTA
TGGCTGGAGTTCGCCGCCTTCTGGAGGATGACCTTAAACTTACTAAATATGCTCTCGATAGTTGTAAGAAGTTTATAAGCCACCAAGTAGAGGAGATATTAAATTCTTGT
GAAGCTGCTGAACAAGTTCCTAAGGAAAAGAAAAGTTCTCATTTGAAAACTTCAAAAAAGGTAAGCAAAGAAAGTTCTCATTCTACAGAAGGCAGTAGTAGCGAGGAGGA
AAATGATGAAGTAAAACCTGGAAAGAATATTGCTACTAAAGGAAGAATACAGAACTCTAATGAAATGAAAAAGCGGAAAAGATCTGCGAAGGAGACTGTCTCTGCCAAGA
AGCAAAGCAAGCATGTCCAGCATACATCAGAGGAAGATGGTGATGAAGAAGGTGGTGAAAATGTCTCTGAAGATGGCCAGTCTGAATCATCCAATGAAAAACCTGTTAAG
AAGGAAGTTTCAACTCCTGTCTATGGCAAGCGTGTGGAGCACCTGAAATCGGTTATCAAATTGTGTGGGATGAGTGTTCCTCCATCGATTTATAAGAAAGTCAAGCAGGC
ACCTGAAAGCAAACGTGAATCCCAACTTATAAAGGAGTTGGAGGGGATACTATCCAGAGAAGGATTGTCTGCTAATCCCACTGAAAAAGAAATTAAGGAAGTCAAAAAGA
AGAAGGAAAGAGCCAAAGAACTTGAAGGCATTGACGTGAGTAATATTGTCTCAAGTTCACGTAGAAGATCCTCAACTAGTTATGTAGCACCACCTCCAAAACCAGAAGTA
CCAGTTGAAACTGATGGTGATGATGGTGATGATACTGATGAGGAGGAAGAAGATGAAGAGGATGATGAAGAGGAGGATAACGACGAAGAGGATAATGGTGATGTTGATGA
AAGCCAGGGTGAAGAATTTAACGAGG
Protein sequenceShow/hide protein sequence
MAEELQDNDTPNEEVMDTAVDIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCLETYALDVHKRYIKQCLVKCLEGVEEDIASKDSEETGGKSVSRNEAAEALEG
HQSKKGVKEPCLEDEEKMEDSPVMGLLTGHKTTNVKSEGIKDKDDKDIPSESTIKKAIRKRTPYLKAHSEKVTMAGVRRLLEDDLKLTKYALDSCKKFISHQVEEILNSC
EAAEQVPKEKKSSHLKTSKKVSKESSHSTEGSSSEEENDEVKPGKNIATKGRIQNSNEMKKRKRSAKETVSAKKQSKHVQHTSEEDGDEEGGENVSEDGQSESSNEKPVK
KEVSTPVYGKRVEHLKSVIKLCGMSVPPSIYKKVKQAPESKRESQLIKELEGILSREGLSANPTEKEIKEVKKKKERAKELEGIDVSNIVSSSRRRSSTSYVAPPPKPEV
PVETDGDDGDDTDEEEEDEEDDEEEDNDEEDNGDVDESQGEEFNEX