; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC03g0064 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC03g0064
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionSAGA-associated factor 11
Genome locationMC03:939992..947939
RNA-Seq ExpressionMC03g0064
SyntenyMC03g0064
Gene Ontology termsGO:0006325 - chromatin organization (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0035616 - histone H2B conserved C-terminal lysine deubiquitination (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0031969 - chloroplast membrane (cellular component)
GO:0070461 - SAGA-type complex (cellular component)
GO:0071819 - DUBm complex (cellular component)
GO:0016301 - kinase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR013246 - SAGA complex, Sgf11 subunit


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019118.1 Farnesol kinase, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]4.33e-10387.3Show/hide
Query:  AYFGTRPASRSMSMPNEEHASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSV
        AY  TRPASRSMSMP+E+ ASSHTQLS N FGDLLDSVI DVASECHRIARLGLDRNLEEEEEELRLSAQAR RVADS NSSEANGKY+VDIFGQTHPSV
Subjt:  AYFGTRPASRSMSMPNEEHASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSV

Query:  ANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARLKVTRSSTAAQGRYSRGSPVPAYSPYTSSPGANRLPNGMSGVAGEEYSNGTSEDP
        ANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKAR KVTRSSTAAQ RYSRG+PV +YSPY +S   NRLPNG S +AGEEYSNGTSEDP
Subjt:  ANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARLKVTRSSTAAQGRYSRGSPVPAYSPYTSSPGANRLPNGMSGVAGEEYSNGTSEDP

XP_004153719.1 SAGA-associated factor 11 [Cucumis sativus]5.30e-10689.89Show/hide
Query:  MSMPNEEHASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSVANEIFDCMNCG
        MSMPNE++ASS TQLSSN FGDLLDSVIVD+ASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKY+VDIFGQTHPSVANEIFDCMNCG
Subjt:  MSMPNEEHASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSVANEIFDCMNCG

Query:  RSIMAGRFAPHLEKCMGRGRKARLKVTRSSTAAQGRYSRGSPVPAYSPYTSSPGANRLPNGMSGVAGEEYSNGTSEDP
        RSIMAGRFAPHLEKCMGRGRKAR KVTRSSTAAQ RYSRG+PV AYSPY +S   NRLPNG S +AGEEYSNGTSEDP
Subjt:  RSIMAGRFAPHLEKCMGRGRKARLKVTRSSTAAQGRYSRGSPVPAYSPYTSSPGANRLPNGMSGVAGEEYSNGTSEDP

XP_008457340.1 PREDICTED: ataxin-7-like protein 3 [Cucumis melo]3.73e-10690.45Show/hide
Query:  MSMPNEEHASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSVANEIFDCMNCG
        MSMPNE++ASS TQLSSN FGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKY+VDIFGQTHPSVANEIFDCMNCG
Subjt:  MSMPNEEHASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSVANEIFDCMNCG

Query:  RSIMAGRFAPHLEKCMGRGRKARLKVTRSSTAAQGRYSRGSPVPAYSPYTSSPGANRLPNGMSGVAGEEYSNGTSEDP
        RSIMAGRFAPHLEKCMGRGRKAR KVTRSSTAAQ RYSRG+PV AYSPY +S   NRLPNG S +AGEEYSNGTSEDP
Subjt:  RSIMAGRFAPHLEKCMGRGRKARLKVTRSSTAAQGRYSRGSPVPAYSPYTSSPGANRLPNGMSGVAGEEYSNGTSEDP

XP_022150963.1 ataxin-7-like protein 3 [Momordica charantia]5.26e-120100Show/hide
Query:  MSMPNEEHASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSVANEIFDCMNCG
        MSMPNEEHASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSVANEIFDCMNCG
Subjt:  MSMPNEEHASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSVANEIFDCMNCG

Query:  RSIMAGRFAPHLEKCMGRGRKARLKVTRSSTAAQGRYSRGSPVPAYSPYTSSPGANRLPNGMSGVAGEEYSNGTSEDP
        RSIMAGRFAPHLEKCMGRGRKARLKVTRSSTAAQGRYSRGSPVPAYSPYTSSPGANRLPNGMSGVAGEEYSNGTSEDP
Subjt:  RSIMAGRFAPHLEKCMGRGRKARLKVTRSSTAAQGRYSRGSPVPAYSPYTSSPGANRLPNGMSGVAGEEYSNGTSEDP

XP_038893583.1 SAGA-associated factor 11 [Benincasa hispida]7.14e-10488.76Show/hide
Query:  MSMPNEEHASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSVANEIFDCMNCG
        MSMPNE++ASS TQLSSN FGDLLDSVIVD+ASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKY+VDIFGQTHPSVANEIFDCMNCG
Subjt:  MSMPNEEHASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSVANEIFDCMNCG

Query:  RSIMAGRFAPHLEKCMGRGRKARLKVTRSSTAAQGRYSRGSPVPAYSPYTSSPGANRLPNGMSGVAGEEYSNGTSEDP
        RSIMAGRFAPHLEKCMGRGRKAR KVTRSSTAAQ RYSRG+PV AYSPY++S   NRL NG S +AGEEYSNG SEDP
Subjt:  RSIMAGRFAPHLEKCMGRGRKARLKVTRSSTAAQGRYSRGSPVPAYSPYTSSPGANRLPNGMSGVAGEEYSNGTSEDP

TrEMBL top hitse value%identityAlignment
A0A0A0LYA8 SAGA-associated factor 112.56e-10689.89Show/hide
Query:  MSMPNEEHASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSVANEIFDCMNCG
        MSMPNE++ASS TQLSSN FGDLLDSVIVD+ASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKY+VDIFGQTHPSVANEIFDCMNCG
Subjt:  MSMPNEEHASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSVANEIFDCMNCG

Query:  RSIMAGRFAPHLEKCMGRGRKARLKVTRSSTAAQGRYSRGSPVPAYSPYTSSPGANRLPNGMSGVAGEEYSNGTSEDP
        RSIMAGRFAPHLEKCMGRGRKAR KVTRSSTAAQ RYSRG+PV AYSPY +S   NRLPNG S +AGEEYSNGTSEDP
Subjt:  RSIMAGRFAPHLEKCMGRGRKARLKVTRSSTAAQGRYSRGSPVPAYSPYTSSPGANRLPNGMSGVAGEEYSNGTSEDP

A0A1S3C599 SAGA-associated factor 111.81e-10690.45Show/hide
Query:  MSMPNEEHASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSVANEIFDCMNCG
        MSMPNE++ASS TQLSSN FGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKY+VDIFGQTHPSVANEIFDCMNCG
Subjt:  MSMPNEEHASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSVANEIFDCMNCG

Query:  RSIMAGRFAPHLEKCMGRGRKARLKVTRSSTAAQGRYSRGSPVPAYSPYTSSPGANRLPNGMSGVAGEEYSNGTSEDP
        RSIMAGRFAPHLEKCMGRGRKAR KVTRSSTAAQ RYSRG+PV AYSPY +S   NRLPNG S +AGEEYSNGTSEDP
Subjt:  RSIMAGRFAPHLEKCMGRGRKARLKVTRSSTAAQGRYSRGSPVPAYSPYTSSPGANRLPNGMSGVAGEEYSNGTSEDP

A0A5D3BBT0 SAGA-associated factor 111.03e-10188.52Show/hide
Query:  AYFGTRPASRSMSMPNEEHASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSV
        AY  T  ASRSMSMPNE++ASS TQLSSN FGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKY+VDIFGQTHPSV
Subjt:  AYFGTRPASRSMSMPNEEHASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSV

Query:  ANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARLKVTRSSTAAQGRYSRGSPVPAYSPYTSSPGANRLPNGMSGVAGEEYSN
        ANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKAR KVTRSSTAAQ RYSRG+PV AYSPY +S   NRLPNG S +AGEEYSN
Subjt:  ANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARLKVTRSSTAAQGRYSRGSPVPAYSPYTSSPGANRLPNGMSGVAGEEYSN

A0A6J1DBL8 SAGA-associated factor 112.55e-120100Show/hide
Query:  MSMPNEEHASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSVANEIFDCMNCG
        MSMPNEEHASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSVANEIFDCMNCG
Subjt:  MSMPNEEHASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSVANEIFDCMNCG

Query:  RSIMAGRFAPHLEKCMGRGRKARLKVTRSSTAAQGRYSRGSPVPAYSPYTSSPGANRLPNGMSGVAGEEYSNGTSEDP
        RSIMAGRFAPHLEKCMGRGRKARLKVTRSSTAAQGRYSRGSPVPAYSPYTSSPGANRLPNGMSGVAGEEYSNGTSEDP
Subjt:  RSIMAGRFAPHLEKCMGRGRKARLKVTRSSTAAQGRYSRGSPVPAYSPYTSSPGANRLPNGMSGVAGEEYSNGTSEDP

A0A6J1I624 SAGA-associated factor 111.83e-10187.64Show/hide
Query:  MSMPNEEHASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSVANEIFDCMNCG
        MSMP+E+ ASSHTQLS N FGDLLDSVI DVASECHRIARLGLDRNLEEEEEELRLSAQAR RVADS NSSEANGKY+VDIFGQTHPSVANEIFDCMNCG
Subjt:  MSMPNEEHASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSVANEIFDCMNCG

Query:  RSIMAGRFAPHLEKCMGRGRKARLKVTRSSTAAQGRYSRGSPVPAYSPYTSSPGANRLPNGMSGVAGEEYSNGTSEDP
        RSIMAGRFAPHLEKCMGRGRKAR KVTRSSTAAQ RYSRG+PV +YSPY +S   NRLPNG S +AGEEYSNGTSEDP
Subjt:  RSIMAGRFAPHLEKCMGRGRKARLKVTRSSTAAQGRYSRGSPVPAYSPYTSSPGANRLPNGMSGVAGEEYSNGTSEDP

SwissProt top hitse value%identityAlignment
A1L209 Ataxin-7-like protein 36.0e-0833.33Show/hide
Query:  LSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEK
        L+ + + DL++   + +  E HR  + G     E ++E ++        + D            VDIFGQ +    N+   C NC RSI A RFAPHLEK
Subjt:  LSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEK

Query:  CMGRGRKA
        C+G GR +
Subjt:  CMGRGRKA

B4KY72 SAGA-associated factor 11 homolog1.1e-0632.56Show/hide
Query:  FGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSVANEIFDCM--NCGRSIMAGRFAPHLEKCMG
        F  LLD V+  +  E H + + G    L+   EE   +A++  R+ +  N         +DIFG    S A +  DC   +C R + A RFAPHLEKCMG
Subjt:  FGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSVANEIFDCM--NCGRSIMAGRFAPHLEKCMG

Query:  RGRKARLKVTRSSTAAQGRYSRGSPVPAY
         GR +    +R     +G  +  S    Y
Subjt:  RGRKARLKVTRSSTAAQGRYSRGSPVPAY

B4LDA6 SAGA-associated factor 11 homolog1.1e-0632.56Show/hide
Query:  FGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSVANEIFDCM--NCGRSIMAGRFAPHLEKCMG
        F  LLD V+  +  E H + + G    L+   EE   +A++  R+ +  N         +DIFG    S A +  DC   NC R + A RFAPHLEKCMG
Subjt:  FGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSVANEIFDCM--NCGRSIMAGRFAPHLEKCMG

Query:  RGRKARLKVTRSSTAAQGRYSRGSPVPAY
         GR +    +R     +G  +  +    Y
Subjt:  RGRKARLKVTRSSTAAQGRYSRGSPVPAY

Q7PXG4 SAGA-associated factor 11 homolog5.6e-0628Show/hide
Query:  RPASRSMSMPNEEHASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSVANEIF
        R   R MS P      +  + ++  +  L+D  I+ +A E H   + G    +E E E+ +        + D  ++         D+FG ++   A +  
Subjt:  RPASRSMSMPNEEHASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSVANEIF

Query:  DCMNCGRSIMAGRFAPHLEKCMGRGRK----ARLKVTRSSTAAQGRYSRG
         C NC R + A RFAPHLEKCMG GR     A  ++  +     G Y  G
Subjt:  DCMNCGRSIMAGRFAPHLEKCMGRGRK----ARLKVTRSSTAAQGRYSRG

Q94BV2 SAGA-associated factor 115.7e-5968.05Show/hide
Query:  EEHASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSVANEIFDCMNCGRSIMA
        E++ SSH QLSS  F DL+DSVI DVASECHR+ARLGLDR+L+  EEELRLS +AR ++AD SN+ E N KY+VDIFGQTHP VA+E+F+CMNCGR I+A
Subjt:  EEHASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSVANEIFDCMNCGRSIMA

Query:  GRFAPHLEKCMGRGRKARLKVTRSSTAAQGRYSRGSPVPAYSPYTSSPGANRLPNGMSGVAGEEYSNGT
        GRFAPHLEKCMG+GRKAR K TRS+TAAQ R +R SP P YSPY +S   N+L +G  GVAGE+ SN T
Subjt:  GRFAPHLEKCMGRGRKARLKVTRSSTAAQGRYSRGSPVPAYSPYTSSPGANRLPNGMSGVAGEEYSNGT

Arabidopsis top hitse value%identityAlignment
AT5G58575.1 CONTAINS InterPro DOMAIN/s: Sgf11, transcriptional regulation (InterPro:IPR013246); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink).4.1e-6068.05Show/hide
Query:  EEHASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSVANEIFDCMNCGRSIMA
        E++ SSH QLSS  F DL+DSVI DVASECHR+ARLGLDR+L+  EEELRLS +AR ++AD SN+ E N KY+VDIFGQTHP VA+E+F+CMNCGR I+A
Subjt:  EEHASSHTQLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSVANEIFDCMNCGRSIMA

Query:  GRFAPHLEKCMGRGRKARLKVTRSSTAAQGRYSRGSPVPAYSPYTSSPGANRLPNGMSGVAGEEYSNGT
        GRFAPHLEKCMG+GRKAR K TRS+TAAQ R +R SP P YSPY +S   N+L +G  GVAGE+ SN T
Subjt:  GRFAPHLEKCMGRGRKARLKVTRSSTAAQGRYSRGSPVPAYSPYTSSPGANRLPNGMSGVAGEEYSNGT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TGGCTTTCACCCAGGAGACCCGGGTTCGATTCCCGGCAACGGAAACTTTTTTTTTTTTTTTCAATTTTTTTCGATTTTTCACACTTTTTACTGTTCAACATTTTCAGTGC
CCTCTTCTTCTCCTCGATGAACCTATTTTTTTTTTCTTCATTCCTTCTCTCCCCCCTCTTTTTGCAGTCTTCACAAAGCGCGGTTGGATCCAAGTTGTGTGGATTCGTAG
TCTCTATCGATTTTTGCTACGAATTCAATTTCATTCCAGCCTATTTTGGAACTCGTCCTGCTTCTAGATCCATGTCAATGCCTAATGAGGAACATGCGTCTTCGCATACC
CAGCTTTCATCTAATTTCTTTGGGGATCTCCTGGATTCCGTGATTGTTGATGTTGCATCGGAGTGTCATCGAATAGCAAGGTTAGGTCTTGATCGAAACTTAGAAGAGGA
AGAAGAAGAATTAAGACTTTCAGCACAGGCACGAGTAAGAGTAGCTGATTCTAGCAATAGTAGTGAAGCAAACGGCAAATATATAGTTGATATTTTTGGACAAACTCACC
CTTCTGTTGCGAATGAAATATTTGATTGTATGAATTGTGGTCGATCAATCATGGCTGGGAGATTTGCCCCTCATTTAGAGAAGTGCATGGGAAGGGGTAGAAAGGCTCGT
CTCAAAGTAACAAGAAGTAGTACAGCTGCACAGGGCCGGTATTCTCGAGGCAGTCCTGTTCCTGCATATTCCCCTTACACTAGTTCCCCTGGCGCAAACCGGTTACCTAA
TGGAATGTCCGGTGTTGCAGGCGAGGAGTACTCAAACGGTACATCCGAAGACCCATGA
mRNA sequenceShow/hide mRNA sequence
CTGGCTTTCACCCAGGAGACCCGGGTTCGATTCCCGGCAACGGAAACTTTTTTTTTTTTTTTCAATTTTTTTCGATTTTTCACACTTTTTACTGTTCAACATTTTCAGTG
CCCTCTTCTTCTCCTCGATGAACCTATTTTTTTTTTCTTCATTCCTTCTCTCCCCCCTCTTTTTGCAGTCTTCACAAAGCGCGGTTGGATCCAAGTTGTGTGGATTCGTA
GTCTCTATCGATTTTTGCTACGAATTCAATTTCATTCCAGCCTATTTTGGAACTCGTCCTGCTTCTAGATCCATGTCAATGCCTAATGAGGAACATGCGTCTTCGCATAC
CCAGCTTTCATCTAATTTCTTTGGGGATCTCCTGGATTCCGTGATTGTTGATGTTGCATCGGAGTGTCATCGAATAGCAAGGTTAGGTCTTGATCGAAACTTAGAAGAGG
AAGAAGAAGAATTAAGACTTTCAGCACAGGCACGAGTAAGAGTAGCTGATTCTAGCAATAGTAGTGAAGCAAACGGCAAATATATAGTTGATATTTTTGGACAAACTCAC
CCTTCTGTTGCGAATGAAATATTTGATTGTATGAATTGTGGTCGATCAATCATGGCTGGGAGATTTGCCCCTCATTTAGAGAAGTGCATGGGAAGGGGTAGAAAGGCTCG
TCTCAAAGTAACAAGAAGTAGTACAGCTGCACAGGGCCGGTATTCTCGAGGCAGTCCTGTTCCTGCATATTCCCCTTACACTAGTTCCCCTGGCGCAAACCGGTTACCTA
ATGGAATGTCCGGTGTTGCAGGCGAGGAGTACTCAAACGGTACATCCGAAGACCCATGAACAACAAAGCAATGACCGAATTATTTCATTTTAGGAAAACATACTGTAATC
TAAATTTGCTTGCAGGAATCTGATGATATCTCCTGTCATGTTGAGTTTTATATTACATGACTTCTTTGTATGGGTTATCATTACATGGGAAGAATATGTCTGGAATCTGG
ATGTATTATGAGTTCATTACTTTTCTACTGTTCTAAATTTTGATGCTGAAGATCTTGTACAGAAGAAGTGCAAAAGGAAGTTAAAAATGCTAAATCTAGTTGCATATCTT
CAATCTTGAAATGTAATTAGAAAGTCTGAGTTATTCTTCATGGCAACATACCTAAAGGATTCGTATTATATGATTTTTCTTCAATATTTCAAATATTGAAAGATTAAGTA
GTTCTTATGGATCAGTTAGAAGTCGCTCTATCACCAAATATTTCAAATATTGTAATTTGTTTCCATAAAACTCCGAGTCTTTGTCATATTAGATTCTCTGGAAGGGGCCA
CAAAGATTATAAAGGTAGTTTAAACTCTTGCTGCGTTTATTGTCTGACAGTTCCCAAGATATTCAGCTCTTTTGCTAGAAATTCTTTTAACAATTTTATGAAGGCAAATT
GCAGGAAGGTGGTCCAAAAGGTTGCCCAATCTATGCTCAACCACTGAAAAATAGAATGTTAGTTAAGTCTGCATCAGTGGTCTCCACGCTCCACTTCCATTTGCTCAAAG
ATTTCAAGTCTGATTACAGGGACCATAGCGCAGTGCTTGGAAGCTTATGAGACCTGAATGGCTGATTTCTTCTATGTGGAAATTGAACAAGAATTGTGCTCAGATACGTC
TCTCTCACACAAACCATTCTTATTTCTGGCATGTAATTGTTTTTAATTTTCGACTGCCAAACTTTTCAATTCTTAGGCTAATAGCAAAGAAAGTGTTTGATAATGGCTTC
CATAAACTTACCCACTTGTGTTTAGATGTATGAGAGAAATTGCAAAATACTTTTGATTATTCATCCAAGTGTCATAAAAGTGGCGGCTCCCCCAAACATATTTTCAAAAA
CGGGAGCTTAAATACAAGTTCTAATGATTAATTTCTTTAATTGATATAAACCACAAATTCACCACTAATGAAGTGCAGGTGCTCCACTTAGCAAAAGTAGTTTCTTTGGG
AGATTTGGAGGAAAAAAGGGTCCTCTATCTGCGCTTCTCTCTCTCTCGAATTAGCAGGATTAAACACATCATCAAACACCAAAGGGCATGACAGCAATTTCCAATGTGGA
ATCTCTTCTCTCAGTGCGTCTTCTCCTCCAATCTCTGTAACTATCACTTTACAGTCCCTTGCCTCCGCCGCCATGTTGTGCACGTGCTGGCACAGTGCCCTCACCAGCTT
CCCTGATGTCGATGCTGCCCCCTCCCGGTGAACCCCATACATGAAGTAGAATCCGAACGGCTCGTAGAAGTCCGGTATTGAGGGAAGTTTCAGACATGGGAAGATCTTGT
CCATCACCTTCGAGCTCTCTGTGTATATCAAGCATGACAATGGTGCCTTTCCCAATCGCAGTTTGAACACCTGGATTGACAAAAGATTAAAGTTATCATACCCTACTAAC
TTAAGCTTATGTGATTAATTGTTAAGTGGTGGTTTAATCTACCTCCCCACTATTCCATACACTCAGCATTGCCCAGCTCTTTGGAATTGCCATTTCTGATCCGCTGCCAG
TTGTTTCGAATTTATCGTCTTTGTAATAAGCCACCCATGTTCCAAGGCTGAGCTTGTGTTTGAGCACGCGGTCGATGTCATGGGGAAAGAACTCGGTGGAGGCCATGTAT
TTTCCGTAGAGAAACTCGGCCACATCGACTTTCAGGCGGGAAATTTGGATGTTGGAAGGGAGGTTATAAGGGCGGTAATGTTTCACCGGGTTGACGAGGATCGCCGGAAC
TCTAAAATTGGTGTACCCGAGCTTGTTGATGAAGAGCTTGACGGAGGCTTCATTGTCTTTCTCTGTCGCCATATAGGTGTAGTCCACGTCGTTGGCAGCAAACCACTCCT
CCAATCGTCGGACAAGGCTGCAGCCAATCCCTCGGCGGCGGAACATCGGAACAACGCGAAGGCCTAAAACATACCCAACCTTGGCCCGGCCCTTGGGCGCTTGGTGAACC
GTAACGATCTTTATCGAGCCCTGAATTACTCCAACTATTTGGTTATCCACCTCTGCCACCTAATGAAAACGAACTCAAAAGAGTTAAGTTGATAGGTTAGGGTATATTTA
ACACGTAACGGATGGCAAGAGGAAGAGGAATACCAGCATCTTGTATAAGGGACTATTTCTGATCCTACAAATGGGGTCACCCATGGTGTCTGTGAAGAGAAAAACTCGTT
CAGATGGGCCTACCTCGCATCTTCTCTCTAGATCTTCCACTCTAGCTCTATCTGCAGATTGCCCATCGTAGCTTCTTATTATCAAAATCTCTTCTCCGTATCCCATTTCT
TTAAAAGAAAAATCTCCCCACTTTTCGTGGGAAACAAACAGAAAAAACAGAAAAGAAGAAAAAAAAAAAAACAGCAGTTCTCTAGCTATTACATATATATACACACACTG
CAAGTTCAATCAACCCTTTTAAAGGTATATTGCTAGCAGCTAGCTACTTAGCTTTCTCTTTTGTGGAGGGGGAAAAAAACTTGGGGGGCTTGGAAATTTTGGAGTCTGAT
GATTGGCATTTTAT
Protein sequenceShow/hide protein sequence
WLSPRRPGFDSRQRKLFFFFSIFFDFSHFLLFNIFSALFFSSMNLFFFSSFLLSPLFLQSSQSAVGSKLCGFVVSIDFCYEFNFIPAYFGTRPASRSMSMPNEEHASSHT
QLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYIVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKAR
LKVTRSSTAAQGRYSRGSPVPAYSPYTSSPGANRLPNGMSGVAGEEYSNGTSEDP