; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007929 (gene) of Snake gourd v1 genome

Gene IDTan0007929
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSAGA-associated factor 11
Genome locationLG08:75125997..75128908
RNA-Seq ExpressionTan0007929
SyntenyTan0007929
Gene Ontology termsGO:0006325 - chromatin organization (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0035616 - histone H2B conserved C-terminal lysine deubiquitination (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0031969 - chloroplast membrane (cellular component)
GO:0070461 - SAGA-type complex (cellular component)
GO:0071819 - DUBm complex (cellular component)
GO:0016301 - kinase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR013246 - SAGA complex, Sgf11 subunit


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0060025.1 farnesol kinase [Cucumis melo var. makuwa]3.9e-8294.77Show/hide
Query:  MSMSNEDNASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARLRVADSSNSSEANGKYVVDIFGQNHPPVANEIFDCMNCG
        MSM NEDNASS TQLSSN FGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQAR+RVADSSNSSEANGKYVVDIFGQ HP VANEIFDCMNCG
Subjt:  MSMSNEDNASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARLRVADSSNSSEANGKYVVDIFGQNHPPVANEIFDCMNCG

Query:  RSIVAGRFAPHLEKCMGRGRKARLKVTRSSTAAQSRYSRGSPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSN
        RSI+AGRFAPHLEKCMGRGRKAR KVTRSSTAAQSRYSRG+PVSAYSPYPNSTSTNRLPNGTSSLAGEEYSN
Subjt:  RSIVAGRFAPHLEKCMGRGRKARLKVTRSSTAAQSRYSRGSPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSN

XP_004153719.1 SAGA-associated factor 11 [Cucumis sativus]1.7e-8594.38Show/hide
Query:  MSMSNEDNASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARLRVADSSNSSEANGKYVVDIFGQNHPPVANEIFDCMNCG
        MSM NEDNASS TQLSSN FGDLLDSVIVD+ASECHRIARLGLDRNLEEEEEELRLSAQAR+RVADSSNSSEANGKYVVDIFGQ HP VANEIFDCMNCG
Subjt:  MSMSNEDNASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARLRVADSSNSSEANGKYVVDIFGQNHPPVANEIFDCMNCG

Query:  RSIVAGRFAPHLEKCMGRGRKARLKVTRSSTAAQSRYSRGSPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP
        RSI+AGRFAPHLEKCMGRGRKAR KVTRSSTAAQSRYSRG+PVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP
Subjt:  RSIVAGRFAPHLEKCMGRGRKARLKVTRSSTAAQSRYSRGSPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP

XP_008457340.1 PREDICTED: ataxin-7-like protein 3 [Cucumis melo]1.3e-8594.94Show/hide
Query:  MSMSNEDNASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARLRVADSSNSSEANGKYVVDIFGQNHPPVANEIFDCMNCG
        MSM NEDNASS TQLSSN FGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQAR+RVADSSNSSEANGKYVVDIFGQ HP VANEIFDCMNCG
Subjt:  MSMSNEDNASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARLRVADSSNSSEANGKYVVDIFGQNHPPVANEIFDCMNCG

Query:  RSIVAGRFAPHLEKCMGRGRKARLKVTRSSTAAQSRYSRGSPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP
        RSI+AGRFAPHLEKCMGRGRKAR KVTRSSTAAQSRYSRG+PVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP
Subjt:  RSIVAGRFAPHLEKCMGRGRKARLKVTRSSTAAQSRYSRGSPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP

XP_022150963.1 ataxin-7-like protein 3 [Momordica charantia]1.9e-8189.89Show/hide
Query:  MSMSNEDNASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARLRVADSSNSSEANGKYVVDIFGQNHPPVANEIFDCMNCG
        MSM NE++ASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQAR+RVADSSNSSEANGKY+VDIFGQ HP VANEIFDCMNCG
Subjt:  MSMSNEDNASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARLRVADSSNSSEANGKYVVDIFGQNHPPVANEIFDCMNCG

Query:  RSIVAGRFAPHLEKCMGRGRKARLKVTRSSTAAQSRYSRGSPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP
        RSI+AGRFAPHLEKCMGRGRKARLKVTRSSTAAQ RYSRGSPV AYSPY +S   NRLPNG S +AGEEYSNGTSEDP
Subjt:  RSIVAGRFAPHLEKCMGRGRKARLKVTRSSTAAQSRYSRGSPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP

XP_038893583.1 SAGA-associated factor 11 [Benincasa hispida]1.0e-8292.7Show/hide
Query:  MSMSNEDNASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARLRVADSSNSSEANGKYVVDIFGQNHPPVANEIFDCMNCG
        MSM NEDNASS TQLSSN FGDLLDSVIVD+ASECHRIARLGLDRNLEEEEEELRLSAQAR+RVADSSNSSEANGKYVVDIFGQ HP VANEIFDCMNCG
Subjt:  MSMSNEDNASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARLRVADSSNSSEANGKYVVDIFGQNHPPVANEIFDCMNCG

Query:  RSIVAGRFAPHLEKCMGRGRKARLKVTRSSTAAQSRYSRGSPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP
        RSI+AGRFAPHLEKCMGRGRKAR KVTRSSTAAQSRYSRG+PVSAYSPY NSTSTNRL NGTSSLAGEEYSNG SEDP
Subjt:  RSIVAGRFAPHLEKCMGRGRKARLKVTRSSTAAQSRYSRGSPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP

TrEMBL top hitse value%identityAlignment
A0A0A0LYA8 SAGA-associated factor 118.2e-8694.38Show/hide
Query:  MSMSNEDNASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARLRVADSSNSSEANGKYVVDIFGQNHPPVANEIFDCMNCG
        MSM NEDNASS TQLSSN FGDLLDSVIVD+ASECHRIARLGLDRNLEEEEEELRLSAQAR+RVADSSNSSEANGKYVVDIFGQ HP VANEIFDCMNCG
Subjt:  MSMSNEDNASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARLRVADSSNSSEANGKYVVDIFGQNHPPVANEIFDCMNCG

Query:  RSIVAGRFAPHLEKCMGRGRKARLKVTRSSTAAQSRYSRGSPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP
        RSI+AGRFAPHLEKCMGRGRKAR KVTRSSTAAQSRYSRG+PVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP
Subjt:  RSIVAGRFAPHLEKCMGRGRKARLKVTRSSTAAQSRYSRGSPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP

A0A1S3C599 SAGA-associated factor 116.3e-8694.94Show/hide
Query:  MSMSNEDNASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARLRVADSSNSSEANGKYVVDIFGQNHPPVANEIFDCMNCG
        MSM NEDNASS TQLSSN FGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQAR+RVADSSNSSEANGKYVVDIFGQ HP VANEIFDCMNCG
Subjt:  MSMSNEDNASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARLRVADSSNSSEANGKYVVDIFGQNHPPVANEIFDCMNCG

Query:  RSIVAGRFAPHLEKCMGRGRKARLKVTRSSTAAQSRYSRGSPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP
        RSI+AGRFAPHLEKCMGRGRKAR KVTRSSTAAQSRYSRG+PVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP
Subjt:  RSIVAGRFAPHLEKCMGRGRKARLKVTRSSTAAQSRYSRGSPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP

A0A5D3BBT0 SAGA-associated factor 111.9e-8294.77Show/hide
Query:  MSMSNEDNASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARLRVADSSNSSEANGKYVVDIFGQNHPPVANEIFDCMNCG
        MSM NEDNASS TQLSSN FGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQAR+RVADSSNSSEANGKYVVDIFGQ HP VANEIFDCMNCG
Subjt:  MSMSNEDNASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARLRVADSSNSSEANGKYVVDIFGQNHPPVANEIFDCMNCG

Query:  RSIVAGRFAPHLEKCMGRGRKARLKVTRSSTAAQSRYSRGSPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSN
        RSI+AGRFAPHLEKCMGRGRKAR KVTRSSTAAQSRYSRG+PVSAYSPYPNSTSTNRLPNGTSSLAGEEYSN
Subjt:  RSIVAGRFAPHLEKCMGRGRKARLKVTRSSTAAQSRYSRGSPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSN

A0A6J1DBL8 SAGA-associated factor 119.4e-8289.89Show/hide
Query:  MSMSNEDNASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARLRVADSSNSSEANGKYVVDIFGQNHPPVANEIFDCMNCG
        MSM NE++ASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQAR+RVADSSNSSEANGKY+VDIFGQ HP VANEIFDCMNCG
Subjt:  MSMSNEDNASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARLRVADSSNSSEANGKYVVDIFGQNHPPVANEIFDCMNCG

Query:  RSIVAGRFAPHLEKCMGRGRKARLKVTRSSTAAQSRYSRGSPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP
        RSI+AGRFAPHLEKCMGRGRKARLKVTRSSTAAQ RYSRGSPV AYSPY +S   NRLPNG S +AGEEYSNGTSEDP
Subjt:  RSIVAGRFAPHLEKCMGRGRKARLKVTRSSTAAQSRYSRGSPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP

A0A6J1I624 SAGA-associated factor 111.2e-8192.13Show/hide
Query:  MSMSNEDNASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARLRVADSSNSSEANGKYVVDIFGQNHPPVANEIFDCMNCG
        MSM +ED+ASSHTQLS N FGDLLDSVI DVASECHRIARLGLDRNLEEEEEELRLSAQAR RVADS NSSEANGKYVVDIFGQ HP VANEIFDCMNCG
Subjt:  MSMSNEDNASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARLRVADSSNSSEANGKYVVDIFGQNHPPVANEIFDCMNCG

Query:  RSIVAGRFAPHLEKCMGRGRKARLKVTRSSTAAQSRYSRGSPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP
        RSI+AGRFAPHLEKCMGRGRKAR KVTRSSTAAQSRYSRG+PVS YSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP
Subjt:  RSIVAGRFAPHLEKCMGRGRKARLKVTRSSTAAQSRYSRGSPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP

SwissProt top hitse value%identityAlignment
A1L209 Ataxin-7-like protein 31.7e-0831.65Show/hide
Query:  MSMSNEDNASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARLRVADSSNSSEANGKYVVDIFGQNHPPVANEIFDCMNCG
        MS+S  DN      L+ + + DL++   + +  E HR  + G     E ++E ++        + D            VDIFGQ +    N+   C NC 
Subjt:  MSMSNEDNASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARLRVADSSNSSEANGKYVVDIFGQNHPPVANEIFDCMNCG

Query:  RSIVAGRFAPHLEKCMGRGRK----ARLKVTRSSTAAQS
        RSI A RFAPHLEKC+G GR     A  ++  S+  ++S
Subjt:  RSIVAGRFAPHLEKCMGRGRK----ARLKVTRSSTAAQS

A2AWT3 Ataxin-7-like protein 31.6e-0632.79Show/hide
Query:  MSMSNEDNASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARLRVADSSNSSEANGKYVVDIFGQNHPPVANEIFDCMNCG
        MS+S  DN S    ++   + DL++   +    E HR  + G    L++ +              DS    E   +  +DIFGQ      ++   C NC 
Subjt:  MSMSNEDNASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARLRVADSSNSSEANGKYVVDIFGQNHPPVANEIFDCMNCG

Query:  RSIVAGRFAPHLEKCMGRGRKA
        RSI A RFAPHLEKC+G GR +
Subjt:  RSIVAGRFAPHLEKCMGRGRKA

B1PM81 SAGA-associated factor 11 homolog1.6e-0630.77Show/hide
Query:  QLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARLRVADSSNSSEANGKYVVDIFGQNHPPVANEIFDCM--NCGRSIVAGRFAPH
        + ++  +  LLD  +V V  E H + + G   NL   +      ++   R+ D  N          DIFG +    A +  DC   NC R + A RFAPH
Subjt:  QLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARLRVADSSNSSEANGKYVVDIFGQNHPPVANEIFDCM--NCGRSIVAGRFAPH

Query:  LEKCMGRGRKARLKVTRSSTAAQSRYSRGSPVSAYSPYPNSTS
        LEKCMG GR +    +R     +S  +  S  S+Y    N+ S
Subjt:  LEKCMGRGRKARLKVTRSSTAAQSRYSRGSPVSAYSPYPNSTS

Q14CW9 Ataxin-7-like protein 31.6e-0632.79Show/hide
Query:  MSMSNEDNASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARLRVADSSNSSEANGKYVVDIFGQNHPPVANEIFDCMNCG
        MS+S  DN S    ++   + DL++   +    E HR  + G    L++ +              DS    E   +  +DIFGQ      ++   C NC 
Subjt:  MSMSNEDNASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARLRVADSSNSSEANGKYVVDIFGQNHPPVANEIFDCMNCG

Query:  RSIVAGRFAPHLEKCMGRGRKA
        RSI A RFAPHLEKC+G GR +
Subjt:  RSIVAGRFAPHLEKCMGRGRKA

Q94BV2 SAGA-associated factor 118.8e-6170.41Show/hide
Query:  EDNASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARLRVADSSNSSEANGKYVVDIFGQNHPPVANEIFDCMNCGRSIVA
        EDN SSH QLSS  F DL+DSVI DVASECHR+ARLGLDR+L+  EEELRLS +AR ++AD SN+ E N KYVVDIFGQ HPPVA+E+F+CMNCGR IVA
Subjt:  EDNASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARLRVADSSNSSEANGKYVVDIFGQNHPPVANEIFDCMNCGRSIVA

Query:  GRFAPHLEKCMGRGRKARLKVTRSSTAAQSRYSRGSPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGT
        GRFAPHLEKCMG+GRKAR K TRS+TAAQ+R +R SP   YSPYPNS S N+L +G+  +AGE+ SN T
Subjt:  GRFAPHLEKCMGRGRKARLKVTRSSTAAQSRYSRGSPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGT

Arabidopsis top hitse value%identityAlignment
AT5G58575.1 CONTAINS InterPro DOMAIN/s: Sgf11, transcriptional regulation (InterPro:IPR013246); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink).6.3e-6270.41Show/hide
Query:  EDNASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARLRVADSSNSSEANGKYVVDIFGQNHPPVANEIFDCMNCGRSIVA
        EDN SSH QLSS  F DL+DSVI DVASECHR+ARLGLDR+L+  EEELRLS +AR ++AD SN+ E N KYVVDIFGQ HPPVA+E+F+CMNCGR IVA
Subjt:  EDNASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARLRVADSSNSSEANGKYVVDIFGQNHPPVANEIFDCMNCGRSIVA

Query:  GRFAPHLEKCMGRGRKARLKVTRSSTAAQSRYSRGSPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGT
        GRFAPHLEKCMG+GRKAR K TRS+TAAQ+R +R SP   YSPYPNS S N+L +G+  +AGE+ SN T
Subjt:  GRFAPHLEKCMGRGRKARLKVTRSSTAAQSRYSRGSPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAATGTCTAATGAGGACAATGCATCTTCGCATACTCAGCTTTCATCTAATTTCTTTGGGGATCTCCTGGATTCCGTGATTGTTGATGTTGCATCAGAATGCCATCG
AATAGCAAGGTTAGGTCTTGATCGTAACTTAGAAGAGGAAGAAGAAGAATTAAGACTTTCAGCACAGGCACGACTAAGAGTAGCTGATTCTAGCAATAGTAGTGAAGCAA
ACGGCAAATATGTAGTTGATATTTTTGGACAAAATCATCCTCCTGTTGCAAACGAAATTTTTGATTGCATGAATTGTGGTCGATCAATCGTGGCTGGGAGATTTGCCCCT
CATTTAGAGAAATGCATGGGAAGGGGTAGAAAGGCTCGTCTCAAAGTAACAAGAAGTAGTACAGCTGCACAGAGCCGGTATTCACGAGGCAGTCCTGTTTCTGCATATTC
CCCTTATCCTAATTCCACCAGCACAAATCGCTTACCTAATGGAACATCTAGTCTTGCAGGGGAGGAGTACTCAAACGGTACATCTGAAGATCCATGA
mRNA sequenceShow/hide mRNA sequence
CTTCTTTTTTCTTCCTAGTTCTTCCCTCATTTTGCAGTCTTGTCGGATCCCAGTAGCTTGGATTTTAGACTCTATCCATCTTTACTGCGAATTCGATTTCATATCAGACT
GTATTCAAACTCGTTCTGCTTCTAGATCCATGTCAATGTCTAATGAGGACAATGCATCTTCGCATACTCAGCTTTCATCTAATTTCTTTGGGGATCTCCTGGATTCCGTG
ATTGTTGATGTTGCATCAGAATGCCATCGAATAGCAAGGTTAGGTCTTGATCGTAACTTAGAAGAGGAAGAAGAAGAATTAAGACTTTCAGCACAGGCACGACTAAGAGT
AGCTGATTCTAGCAATAGTAGTGAAGCAAACGGCAAATATGTAGTTGATATTTTTGGACAAAATCATCCTCCTGTTGCAAACGAAATTTTTGATTGCATGAATTGTGGTC
GATCAATCGTGGCTGGGAGATTTGCCCCTCATTTAGAGAAATGCATGGGAAGGGGTAGAAAGGCTCGTCTCAAAGTAACAAGAAGTAGTACAGCTGCACAGAGCCGGTAT
TCACGAGGCAGTCCTGTTTCTGCATATTCCCCTTATCCTAATTCCACCAGCACAAATCGCTTACCTAATGGAACATCTAGTCTTGCAGGGGAGGAGTACTCAAACGGTAC
ATCTGAAGATCCATGAACAACAAAGCAAGGGCCGAGTTATTTAAATTTAGGACAACAAAAATTTAGGACAACAAACTGTAATCTAAATGTGCCCTCAGGAATTTAATGAT
ATCTTCTGTCATGTTCAGTTTTATTTTACATGACTTCTTTGTATGGGTTATCATTACATGGGAAAATATATATCATGTCTGGAATCCGGATGTATTATGAGTTCATTACT
TTCGACTGTTCCAAATTTTGATTCTGAAGATCTTGTACAGAAGTTTGAAGAAAGGAAA
Protein sequenceShow/hide protein sequence
MSMSNEDNASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARLRVADSSNSSEANGKYVVDIFGQNHPPVANEIFDCMNCGRSIVAGRFAP
HLEKCMGRGRKARLKVTRSSTAAQSRYSRGSPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP