; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS019633 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS019633
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionKAT8 regulatory NSL complex subunit 1 like
Genome locationscaffold729:1138165..1139651
RNA-Seq ExpressionMS019633
SyntenyMS019633
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580620.1 hypothetical protein SDJN03_20622, partial [Cucurbita argyrosperma subsp. sororia]1.0e-11288.75Show/hide
Query:  TMAEASAALCFSSFSSPHCISRSLDLSPSFLSSPRFSFSVSHHRPSRLLRFSVRSSGSGSFMGDDSSGLFPWADGGSEIHWVPEERVTLFTPDGLVQIGG
        +MA+ S +LCFSS SSP CISRSL      LSSPR  FS+SHHRPSRLLRFSV+SS SGSF+GDDS GLFPW DG +EIHWVPEERVTLFTPDGLVQIGG
Subjt:  TMAEASAALCFSSFSSPHCISRSLDLSPSFLSSPRFSFSVSHHRPSRLLRFSVRSSGSGSFMGDDSSGLFPWADGGSEIHWVPEERVTLFTPDGLVQIGG

Query:  SIVPRRISSSDKKQGKSKTYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKL
        SIVPRRIS SDKKQGKSK YQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKL
Subjt:  SIVPRRISSSDKKQGKSKTYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKL

Query:  TMTVAVPLLWGVPPASETLHSAVQSGGGIVEKVYWQWNFL
        TMTVAVPLLWGVPPASETLH AVQSGGGIVEKVYWQW+FL
Subjt:  TMTVAVPLLWGVPPASETLHSAVQSGGGIVEKVYWQWNFL

XP_022144581.1 uncharacterized protein LOC111014232 isoform X1 [Momordica charantia]8.2e-131100Show/hide
Query:  MAEASAALCFSSFSSPHCISRSLDLSPSFLSSPRFSFSVSHHRPSRLLRFSVRSSGSGSFMGDDSSGLFPWADGGSEIHWVPEERVTLFTPDGLVQIGGS
        MAEASAALCFSSFSSPHCISRSLDLSPSFLSSPRFSFSVSHHRPSRLLRFSVRSSGSGSFMGDDSSGLFPWADGGSEIHWVPEERVTLFTPDGLVQIGGS
Subjt:  MAEASAALCFSSFSSPHCISRSLDLSPSFLSSPRFSFSVSHHRPSRLLRFSVRSSGSGSFMGDDSSGLFPWADGGSEIHWVPEERVTLFTPDGLVQIGGS

Query:  IVPRRISSSDKKQGKSKTYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKLT
        IVPRRISSSDKKQGKSKTYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKLT
Subjt:  IVPRRISSSDKKQGKSKTYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKLT

Query:  MTVAVPLLWGVPPASETLHSAVQSGGGIVEKVYWQWNFL
        MTVAVPLLWGVPPASETLHSAVQSGGGIVEKVYWQWNFL
Subjt:  MTVAVPLLWGVPPASETLHSAVQSGGGIVEKVYWQWNFL

XP_022934142.1 uncharacterized protein LOC111441404 isoform X1 [Cucurbita moschata]3.4e-11389.17Show/hide
Query:  TMAEASAALCFSSFSSPHCISRSLDLSPSFLSSPRFSFSVSHHRPSRLLRFSVRSSGSGSFMGDDSSGLFPWADGGSEIHWVPEERVTLFTPDGLVQIGG
        +MA+ S +LCFSS SSP CISRSL      LSSPR  FS+SHHRPSRLLRFSV+SS SGSFMGDDS GLFPW DG +EIHWVPEERVTLFTPDGLVQIGG
Subjt:  TMAEASAALCFSSFSSPHCISRSLDLSPSFLSSPRFSFSVSHHRPSRLLRFSVRSSGSGSFMGDDSSGLFPWADGGSEIHWVPEERVTLFTPDGLVQIGG

Query:  SIVPRRISSSDKKQGKSKTYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKL
        SIVPRRIS SDKKQGKSK YQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKL
Subjt:  SIVPRRISSSDKKQGKSKTYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKL

Query:  TMTVAVPLLWGVPPASETLHSAVQSGGGIVEKVYWQWNFL
        TMTVAVPLLWGVPPASETLH AVQSGGGIVEKVYWQW+FL
Subjt:  TMTVAVPLLWGVPPASETLHSAVQSGGGIVEKVYWQWNFL

XP_022983093.1 uncharacterized protein LOC111481743 [Cucurbita maxima]3.4e-11388.75Show/hide
Query:  TMAEASAALCFSSFSSPHCISRSLDLSPSFLSSPRFSFSVSHHRPSRLLRFSVRSSGSGSFMGDDSSGLFPWADGGSEIHWVPEERVTLFTPDGLVQIGG
        +M + S +LCFSS SSP CISRSL      LSSPR  FS+SHHRPSRLLRFS++SS SGSFMGDDS GLFPW DG +EIHWVPEERVTLFTPDGLVQIGG
Subjt:  TMAEASAALCFSSFSSPHCISRSLDLSPSFLSSPRFSFSVSHHRPSRLLRFSVRSSGSGSFMGDDSSGLFPWADGGSEIHWVPEERVTLFTPDGLVQIGG

Query:  SIVPRRISSSDKKQGKSKTYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKL
        SIVPRRISSSDKKQGKSK YQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKL
Subjt:  SIVPRRISSSDKKQGKSKTYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKL

Query:  TMTVAVPLLWGVPPASETLHSAVQSGGGIVEKVYWQWNFL
        TMTVAVPLLWGVPPASETLH AVQSGGGIVEKVYWQW+FL
Subjt:  TMTVAVPLLWGVPPASETLHSAVQSGGGIVEKVYWQWNFL

XP_023526196.1 uncharacterized protein LOC111789748 [Cucurbita pepo subsp. pepo]1.0e-11288.75Show/hide
Query:  TMAEASAALCFSSFSSPHCISRSLDLSPSFLSSPRFSFSVSHHRPSRLLRFSVRSSGSGSFMGDDSSGLFPWADGGSEIHWVPEERVTLFTPDGLVQIGG
        +MA+ S + CFSS SSP CISRSL      LSSPR  FS+SHHRPSRLLRFSV+SS SGSF+GDDS GLFPW DG +EIHWVPEERVTLFTPDGLVQIGG
Subjt:  TMAEASAALCFSSFSSPHCISRSLDLSPSFLSSPRFSFSVSHHRPSRLLRFSVRSSGSGSFMGDDSSGLFPWADGGSEIHWVPEERVTLFTPDGLVQIGG

Query:  SIVPRRISSSDKKQGKSKTYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKL
        SIVPRRISSSDKKQGKSK YQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKL
Subjt:  SIVPRRISSSDKKQGKSKTYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKL

Query:  TMTVAVPLLWGVPPASETLHSAVQSGGGIVEKVYWQWNFL
        TMTVAVPLLWGVPPASETLH AVQSGGGIVEKVYWQW+FL
Subjt:  TMTVAVPLLWGVPPASETLHSAVQSGGGIVEKVYWQWNFL

TrEMBL top hitse value%identityAlignment
A0A1S3B735 uncharacterized protein LOC1034865016.8e-10785.66Show/hide
Query:  TMAEASAALCFSSFSSPHCISRSLDLSPSFLSSPRF---SFSVSHHRPSRLLRFSVRSSGSGSFMGD-DSSGLFPWADGGSEIHWVPEERVTLFTPDGLV
        +MA+ S +  FSSFSS      SL LSPSFL  P      F +SHHRPS LLRFS++SS SG FMGD DS GLFPWADG SEIHWVPEERVTLFTPDGLV
Subjt:  TMAEASAALCFSSFSSPHCISRSLDLSPSFLSSPRF---SFSVSHHRPSRLLRFSVRSSGSGSFMGD-DSSGLFPWADGGSEIHWVPEERVTLFTPDGLV

Query:  QIGGSIVPRRISSSDKKQGKSKTYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGL
        QIGGSIVPRRISSSDKKQGKSKT QRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGL
Subjt:  QIGGSIVPRRISSSDKKQGKSKTYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGL

Query:  QEKLTMTVAVPLLWGVPPASETLHSAVQSGGGIVEKVYWQWNFL
        QEKLTMTVAVPLLWGVPPASETLH AVQSGGGIVEKVYWQW+FL
Subjt:  QEKLTMTVAVPLLWGVPPASETLHSAVQSGGGIVEKVYWQWNFL

A0A5D3DPB2 Uncharacterized protein6.8e-10785.66Show/hide
Query:  TMAEASAALCFSSFSSPHCISRSLDLSPSFLSSPRF---SFSVSHHRPSRLLRFSVRSSGSGSFMGD-DSSGLFPWADGGSEIHWVPEERVTLFTPDGLV
        +MA+ S +  FSSFSS      SL LSPSFL  P      F +SHHRPS LLRFS++SS SG FMGD DS GLFPWADG SEIHWVPEERVTLFTPDGLV
Subjt:  TMAEASAALCFSSFSSPHCISRSLDLSPSFLSSPRF---SFSVSHHRPSRLLRFSVRSSGSGSFMGD-DSSGLFPWADGGSEIHWVPEERVTLFTPDGLV

Query:  QIGGSIVPRRISSSDKKQGKSKTYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGL
        QIGGSIVPRRISSSDKKQGKSKT QRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGL
Subjt:  QIGGSIVPRRISSSDKKQGKSKTYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGL

Query:  QEKLTMTVAVPLLWGVPPASETLHSAVQSGGGIVEKVYWQWNFL
        QEKLTMTVAVPLLWGVPPASETLH AVQSGGGIVEKVYWQW+FL
Subjt:  QEKLTMTVAVPLLWGVPPASETLHSAVQSGGGIVEKVYWQWNFL

A0A6J1CTU7 uncharacterized protein LOC111014232 isoform X13.9e-131100Show/hide
Query:  MAEASAALCFSSFSSPHCISRSLDLSPSFLSSPRFSFSVSHHRPSRLLRFSVRSSGSGSFMGDDSSGLFPWADGGSEIHWVPEERVTLFTPDGLVQIGGS
        MAEASAALCFSSFSSPHCISRSLDLSPSFLSSPRFSFSVSHHRPSRLLRFSVRSSGSGSFMGDDSSGLFPWADGGSEIHWVPEERVTLFTPDGLVQIGGS
Subjt:  MAEASAALCFSSFSSPHCISRSLDLSPSFLSSPRFSFSVSHHRPSRLLRFSVRSSGSGSFMGDDSSGLFPWADGGSEIHWVPEERVTLFTPDGLVQIGGS

Query:  IVPRRISSSDKKQGKSKTYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKLT
        IVPRRISSSDKKQGKSKTYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKLT
Subjt:  IVPRRISSSDKKQGKSKTYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKLT

Query:  MTVAVPLLWGVPPASETLHSAVQSGGGIVEKVYWQWNFL
        MTVAVPLLWGVPPASETLHSAVQSGGGIVEKVYWQWNFL
Subjt:  MTVAVPLLWGVPPASETLHSAVQSGGGIVEKVYWQWNFL

A0A6J1F1V4 uncharacterized protein LOC111441404 isoform X11.7e-11389.17Show/hide
Query:  TMAEASAALCFSSFSSPHCISRSLDLSPSFLSSPRFSFSVSHHRPSRLLRFSVRSSGSGSFMGDDSSGLFPWADGGSEIHWVPEERVTLFTPDGLVQIGG
        +MA+ S +LCFSS SSP CISRSL      LSSPR  FS+SHHRPSRLLRFSV+SS SGSFMGDDS GLFPW DG +EIHWVPEERVTLFTPDGLVQIGG
Subjt:  TMAEASAALCFSSFSSPHCISRSLDLSPSFLSSPRFSFSVSHHRPSRLLRFSVRSSGSGSFMGDDSSGLFPWADGGSEIHWVPEERVTLFTPDGLVQIGG

Query:  SIVPRRISSSDKKQGKSKTYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKL
        SIVPRRIS SDKKQGKSK YQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKL
Subjt:  SIVPRRISSSDKKQGKSKTYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKL

Query:  TMTVAVPLLWGVPPASETLHSAVQSGGGIVEKVYWQWNFL
        TMTVAVPLLWGVPPASETLH AVQSGGGIVEKVYWQW+FL
Subjt:  TMTVAVPLLWGVPPASETLHSAVQSGGGIVEKVYWQWNFL

A0A6J1J6S7 uncharacterized protein LOC1114817431.7e-11388.75Show/hide
Query:  TMAEASAALCFSSFSSPHCISRSLDLSPSFLSSPRFSFSVSHHRPSRLLRFSVRSSGSGSFMGDDSSGLFPWADGGSEIHWVPEERVTLFTPDGLVQIGG
        +M + S +LCFSS SSP CISRSL      LSSPR  FS+SHHRPSRLLRFS++SS SGSFMGDDS GLFPW DG +EIHWVPEERVTLFTPDGLVQIGG
Subjt:  TMAEASAALCFSSFSSPHCISRSLDLSPSFLSSPRFSFSVSHHRPSRLLRFSVRSSGSGSFMGDDSSGLFPWADGGSEIHWVPEERVTLFTPDGLVQIGG

Query:  SIVPRRISSSDKKQGKSKTYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKL
        SIVPRRISSSDKKQGKSK YQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKL
Subjt:  SIVPRRISSSDKKQGKSKTYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKL

Query:  TMTVAVPLLWGVPPASETLHSAVQSGGGIVEKVYWQWNFL
        TMTVAVPLLWGVPPASETLH AVQSGGGIVEKVYWQW+FL
Subjt:  TMTVAVPLLWGVPPASETLHSAVQSGGGIVEKVYWQWNFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G36895.1 unknown protein8.8e-8366.53Show/hide
Query:  MAEASAALCFSSFSSPHCISRSLDLSPSFLSSPRFSFSVSHHRPSRLLRFSVRSSGSGSFMGDDSSGLFPWADGGSEIHWVPEERVTLFTPDGLVQIGGS
        MAE S  L FS+FSS   IS      P   S+ RFS  +S  RPS   RF+V++S  G+F  DD+   FPW+D  +EI WVPEER+TLFT DGLVQIGG+
Subjt:  MAEASAALCFSSFSSPHCISRSLDLSPSFLSSPRFSFSVSHHRPSRLLRFSVRSSGSGSFMGDDSSGLFPWADGGSEIHWVPEERVTLFTPDGLVQIGGS

Query:  IVPRRISSSDKKQGKSKTYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKLT
        +VPRRI SS+KK G+S++ ++ Q+F ES YMDP Q +CLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVED VLE GGE+VA E  S  GLQEKLT
Subjt:  IVPRRISSSDKKQGKSKTYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKLT

Query:  MTVAVPLLWGVPPASETLHSAVQSGGGIVEKVYWQWNFL
        MTVAVP LWGVPPA+E LH AV++GGGIV+KVYWQW+FL
Subjt:  MTVAVPLLWGVPPASETLHSAVQSGGGIVEKVYWQWNFL

AT2G36895.2 unknown protein8.2e-8166.11Show/hide
Query:  MAEASAALCFSSFSSPHCISRSLDLSPSFLSSPRFSFSVSHHRPSRLLRFSVRSSGSGSFMGDDSSGLFPWADGGSEIHWVPEERVTLFTPDGLVQIGGS
        MAE S  L FS+FSS   IS      P   S+ RFS  +S  RPS   RF+V++S  G+F  DD+   FPW+D  +EI WVPEER+TLFT DGLVQIGG+
Subjt:  MAEASAALCFSSFSSPHCISRSLDLSPSFLSSPRFSFSVSHHRPSRLLRFSVRSSGSGSFMGDDSSGLFPWADGGSEIHWVPEERVTLFTPDGLVQIGGS

Query:  IVPRRISSSDKKQGKSKTYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKLT
        +VPRRI SS+ K G+S++ ++ Q+F ES YMDP Q +CLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVED VLE GGE+VA E  S  GLQEKLT
Subjt:  IVPRRISSSDKKQGKSKTYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKLT

Query:  MTVAVPLLWGVPPASETLHSAVQSGGGIVEKVYWQWNFL
        MTVAVP LWGVPPA+E LH AV++GGGIV+KVYWQW+FL
Subjt:  MTVAVPLLWGVPPASETLHSAVQSGGGIVEKVYWQWNFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TGGTCCCTAACAATGGCGGAGGCATCTGCTGCCCTCTGTTTCTCCTCCTTCTCCTCGCCCCACTGCATTTCCCGCTCCCTCGACCTCTCCCCCTCTTTCCTCTCCTCTCC
TAGATTTTCCTTTTCGGTTTCTCATCACCGTCCCTCTCGTCTTCTCCGTTTCTCCGTCAGATCCTCCGGCTCCGGAAGCTTCATGGGGGACGATTCTTCCGGATTGTTTC
CCTGGGCCGACGGCGGCAGTGAAATCCATTGGGTTCCTGAGGAGAGAGTCACATTGTTCACTCCAGATGGGCTTGTTCAGATTGGAGGTTCAATCGTCCCTCGACGCATC
TCTTCTTCAGATAAAAAACAAGGGAAATCAAAGACTTACCAGAGATTCCAACGGTTCCAAGAGAGTGATTATATGGATCCCAAACAGAGCATATGTCTTGGTGCATTATT
TGATATTGCAGCTACCAACGGACTTGACATGGGAAGAAGACTTTGTATCTTTGGTTTTTGCCGTTCAGTGGAGATGCTCAGTGATGTTGTGGAAGACATTGTTTTGGAGC
AAGGTGGAGAGGTTGTAGCAGCAGAGAAGGCAAGTAAAGGGGGCTTGCAGGAAAAGCTAACCATGACAGTAGCTGTTCCACTACTTTGGGGGGTTCCTCCTGCTTCTGAA
ACTCTTCACTCAGCTGTTCAGAGTGGTGGAGGGATTGTGGAGAAGGTCTATTGGCAATGGAATTTTTTG
mRNA sequenceShow/hide mRNA sequence
TGGTCCCTAACAATGGCGGAGGCATCTGCTGCCCTCTGTTTCTCCTCCTTCTCCTCGCCCCACTGCATTTCCCGCTCCCTCGACCTCTCCCCCTCTTTCCTCTCCTCTCC
TAGATTTTCCTTTTCGGTTTCTCATCACCGTCCCTCTCGTCTTCTCCGTTTCTCCGTCAGATCCTCCGGCTCCGGAAGCTTCATGGGGGACGATTCTTCCGGATTGTTTC
CCTGGGCCGACGGCGGCAGTGAAATCCATTGGGTTCCTGAGGAGAGAGTCACATTGTTCACTCCAGATGGGCTTGTTCAGATTGGAGGTTCAATCGTCCCTCGACGCATC
TCTTCTTCAGATAAAAAACAAGGGAAATCAAAGACTTACCAGAGATTCCAACGGTTCCAAGAGAGTGATTATATGGATCCCAAACAGAGCATATGTCTTGGTGCATTATT
TGATATTGCAGCTACCAACGGACTTGACATGGGAAGAAGACTTTGTATCTTTGGTTTTTGCCGTTCAGTGGAGATGCTCAGTGATGTTGTGGAAGACATTGTTTTGGAGC
AAGGTGGAGAGGTTGTAGCAGCAGAGAAGGCAAGTAAAGGGGGCTTGCAGGAAAAGCTAACCATGACAGTAGCTGTTCCACTACTTTGGGGGGTTCCTCCTGCTTCTGAA
ACTCTTCACTCAGCTGTTCAGAGTGGTGGAGGGATTGTGGAGAAGGTCTATTGGCAATGGAATTTTTTG
Protein sequenceShow/hide protein sequence
WSLTMAEASAALCFSSFSSPHCISRSLDLSPSFLSSPRFSFSVSHHRPSRLLRFSVRSSGSGSFMGDDSSGLFPWADGGSEIHWVPEERVTLFTPDGLVQIGGSIVPRRI
SSSDKKQGKSKTYQRFQRFQESDYMDPKQSICLGALFDIAATNGLDMGRRLCIFGFCRSVEMLSDVVEDIVLEQGGEVVAAEKASKGGLQEKLTMTVAVPLLWGVPPASE
TLHSAVQSGGGIVEKVYWQWNFL