; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS017660 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS017660
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionZinc finger matrin-type protein 1, putative isoform 1
Genome locationscaffold373:1255893..1258004
RNA-Seq ExpressionMS017660
SyntenyMS017660
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131237.1 uncharacterized protein LOC111004499 [Momordica charantia]5.2e-13697.69Show/hide
Query:  MISLAYASLSSSPSNLSSLKLRLPRPPSTFSTSLSNLKSFNPCDKAASDQKRIGNGVCRADLGNDGPFAVAIGACILSSFVFPVAGGGSDDESDAVIDST
        MISLAYASLSSSPSNLSSLKLRLPRPPSTFSTSLSNLKS NPCDKAASDQKRIGNGVCRADLGNDGPFAVAIGACILSSFVFPVAGGGSDDESDAVIDST
Subjt:  MISLAYASLSSSPSNLSSLKLRLPRPPSTFSTSLSNLKSFNPCDKAASDQKRIGNGVCRADLGNDGPFAVAIGACILSSFVFPVAGGGSDDESDAVIDST

Query:  DTRFAVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYALVYLVPYLRSNLSLSPEESWLPIASILLCIIHIQLEVSIRNGDIQPFQIFGKTSKKISSTTRG
        DTRFAVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYALVYLVPYLRSNLSLSPEESWLPIASILLCIIHIQLEVSIRNGDIQPFQIFGKTSKKISSTTRG
Subjt:  DTRFAVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYALVYLVPYLRSNLSLSPEESWLPIASILLCIIHIQLEVSIRNGDIQPFQIFGKTSKKISSTTRG

Query:  RDHFKGSQGPPE-----EDMKLPSIQEQLRDEIRRWGDSKETLDHEQSNGEWDDEQRRKH
        RDHFKGSQGPPE     EDMKLPSIQEQLRDEIRRWGDSKETLDHEQSNGEWDDEQRRKH
Subjt:  RDHFKGSQGPPE-----EDMKLPSIQEQLRDEIRRWGDSKETLDHEQSNGEWDDEQRRKH

XP_022958725.1 uncharacterized protein LOC111459865 isoform X2 [Cucurbita moschata]1.1e-10175.77Show/hide
Query:  MISLAYASLSSSPSNLSSLK-LRLPRPPSTFSTSLSNLKSFNPCDKAASDQKRIGNGVCRADLGNDGPFAVAIGACILSSFVFPVAGGGSDDESDAVIDS
        MI+LA A LSSSPSN SSL  LRL +PP TFSTSLSNLK  NP  K+AS+Q+   NG+CRA+LGND PFA+AIGACILSS V P AGGGSDD+SDAV+DS
Subjt:  MISLAYASLSSSPSNLSSLK-LRLPRPPSTFSTSLSNLKSFNPCDKAASDQKRIGNGVCRADLGNDGPFAVAIGACILSSFVFPVAGGGSDDESDAVIDS

Query:  TDTRFAVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYALVYLVPYLRSNLSLSPEESWLPIASILLCIIHIQLEVSIRNGDIQPFQIFGKTSKKISSTTR
        TD R AVMGIISFIPYFNWLSWVFAWLDSG+R YAVYA+VYL PYLRSNLSLSP+ESWLPI SIL+CI HIQ+E SI+NGDIQPFQIFGKTS +IS T  
Subjt:  TDTRFAVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYALVYLVPYLRSNLSLSPEESWLPIASILLCIIHIQLEVSIRNGDIQPFQIFGKTSKKISSTTR

Query:  GRDHFKGSQGPPEE-----DMKLPSIQEQLRDEIRRWGDSKETLDHEQSNGEWDDEQRRK
         R H KGSQGP ++     DMKLPS +EQLRDEI+ WGD KETLDHEQSN EWDDEQRRK
Subjt:  GRDHFKGSQGPPEE-----DMKLPSIQEQLRDEIRRWGDSKETLDHEQSNGEWDDEQRRK

XP_023534481.1 uncharacterized protein LOC111796027 isoform X1 [Cucurbita pepo subsp. pepo]1.9e-10176.25Show/hide
Query:  MISLAYASLSSSPSNLSSLK-LRLPRPPSTFSTSLSNLKSFNPCDKAASDQKR-IGNGVCRADLGNDGPFAVAIGACILSSFVFPVAGGGSDDESDAVID
        MI+LA A LSSSPSN SSL  LRL +PP TFSTSLSNLK  NP  K+AS+QKR   NG+CRA+LGND PFA+AIGACIL+S V P AGGGSDD+SDAV+D
Subjt:  MISLAYASLSSSPSNLSSLK-LRLPRPPSTFSTSLSNLKSFNPCDKAASDQKR-IGNGVCRADLGNDGPFAVAIGACILSSFVFPVAGGGSDDESDAVID

Query:  STDTRFAVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYALVYLVPYLRSNLSLSPEESWLPIASILLCIIHIQLEVSIRNGDIQPFQIFGKTSKKISSTT
        STD R AVMGIISFIPYFNWLSWVFAWLDSG+R YAVYA+VYL PYLRSNLSLSP+ESWLPI SIL+CI HIQ+E SI+NGDIQPFQIFGK S +IS T 
Subjt:  STDTRFAVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYALVYLVPYLRSNLSLSPEESWLPIASILLCIIHIQLEVSIRNGDIQPFQIFGKTSKKISSTT

Query:  RGRDHFKGSQGPPEE-----DMKLPSIQEQLRDEIRRWGDSKETLDHEQSNGEWDDEQRRK
         GR H KGSQGP ++     DMKLPS +EQLRDEIR WGD KETLDHEQSN EWDDEQRRK
Subjt:  RGRDHFKGSQGPPEE-----DMKLPSIQEQLRDEIRRWGDSKETLDHEQSNGEWDDEQRRK

XP_023534483.1 uncharacterized protein LOC111796027 isoform X2 [Cucurbita pepo subsp. pepo]8.3e-10275.77Show/hide
Query:  MISLAYASLSSSPSNLSSLK-LRLPRPPSTFSTSLSNLKSFNPCDKAASDQKRIGNGVCRADLGNDGPFAVAIGACILSSFVFPVAGGGSDDESDAVIDS
        MI+LA A LSSSPSN SSL  LRL +PP TFSTSLSNLK  NP  K+AS+Q+   NG+CRA+LGND PFA+AIGACIL+S V P AGGGSDD+SDAV+DS
Subjt:  MISLAYASLSSSPSNLSSLK-LRLPRPPSTFSTSLSNLKSFNPCDKAASDQKRIGNGVCRADLGNDGPFAVAIGACILSSFVFPVAGGGSDDESDAVIDS

Query:  TDTRFAVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYALVYLVPYLRSNLSLSPEESWLPIASILLCIIHIQLEVSIRNGDIQPFQIFGKTSKKISSTTR
        TD R AVMGIISFIPYFNWLSWVFAWLDSG+R YAVYA+VYL PYLRSNLSLSP+ESWLPI SIL+CI HIQ+E SI+NGDIQPFQIFGK S +IS T  
Subjt:  TDTRFAVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYALVYLVPYLRSNLSLSPEESWLPIASILLCIIHIQLEVSIRNGDIQPFQIFGKTSKKISSTTR

Query:  GRDHFKGSQGPPEE-----DMKLPSIQEQLRDEIRRWGDSKETLDHEQSNGEWDDEQRRK
        GR H KGSQGP ++     DMKLPS +EQLRDEIR WGD KETLDHEQSN EWDDEQRRK
Subjt:  GRDHFKGSQGPPEE-----DMKLPSIQEQLRDEIRRWGDSKETLDHEQSNGEWDDEQRRK

XP_038875289.1 uncharacterized protein LOC120067780 [Benincasa hispida]7.3e-10679.69Show/hide
Query:  MISLAYASLSSSPSNLSSLK-LRLPRPPSTFSTSLSNLKSFNPCDKAASDQKRIGNGVCRADLGNDGPFAVAIGACILSSFVFPVAGGGSDDESDAVIDS
        MI+LA A LSSS SNLSSLK LRL +P STFS SLSNLK  NP  K  S+Q RIGNG+CRA+LGND PFA+AIGAC LSS V PVA G SDDESDA+IDS
Subjt:  MISLAYASLSSSPSNLSSLK-LRLPRPPSTFSTSLSNLKSFNPCDKAASDQKRIGNGVCRADLGNDGPFAVAIGACILSSFVFPVAGGGSDDESDAVIDS

Query:  TDTRFAVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYALVYLVPYLRSNLSLSPEESWLPIASILLCIIHIQLEVSIRNGDIQPFQIFGKTSKKISSTTR
        TDTR AVM IISFIPYFNWLSWVFAWLDSGRRLYAVYA+VYL PYLRSNLSLSPEESWLPI SILLCIIHIQLEVSI NGDIQP QIFGK SK ISST +
Subjt:  TDTRFAVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYALVYLVPYLRSNLSLSPEESWLPIASILLCIIHIQLEVSIRNGDIQPFQIFGKTSKKISSTTR

Query:  GRDHFKGSQGP-----PEEDMKLPSIQEQLRDEIRRWGDSKETLDHEQSNGEWDDEQRRKH
        GRDHFKGSQGP      +ED KLPS +EQ +D+IRRWGDSKE LD+EQSNGEWDDEQRRKH
Subjt:  GRDHFKGSQGP-----PEEDMKLPSIQEQLRDEIRRWGDSKETLDHEQSNGEWDDEQRRKH

TrEMBL top hitse value%identityAlignment
A0A6J1BQE6 uncharacterized protein LOC1110044992.5e-13697.69Show/hide
Query:  MISLAYASLSSSPSNLSSLKLRLPRPPSTFSTSLSNLKSFNPCDKAASDQKRIGNGVCRADLGNDGPFAVAIGACILSSFVFPVAGGGSDDESDAVIDST
        MISLAYASLSSSPSNLSSLKLRLPRPPSTFSTSLSNLKS NPCDKAASDQKRIGNGVCRADLGNDGPFAVAIGACILSSFVFPVAGGGSDDESDAVIDST
Subjt:  MISLAYASLSSSPSNLSSLKLRLPRPPSTFSTSLSNLKSFNPCDKAASDQKRIGNGVCRADLGNDGPFAVAIGACILSSFVFPVAGGGSDDESDAVIDST

Query:  DTRFAVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYALVYLVPYLRSNLSLSPEESWLPIASILLCIIHIQLEVSIRNGDIQPFQIFGKTSKKISSTTRG
        DTRFAVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYALVYLVPYLRSNLSLSPEESWLPIASILLCIIHIQLEVSIRNGDIQPFQIFGKTSKKISSTTRG
Subjt:  DTRFAVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYALVYLVPYLRSNLSLSPEESWLPIASILLCIIHIQLEVSIRNGDIQPFQIFGKTSKKISSTTRG

Query:  RDHFKGSQGPPE-----EDMKLPSIQEQLRDEIRRWGDSKETLDHEQSNGEWDDEQRRKH
        RDHFKGSQGPPE     EDMKLPSIQEQLRDEIRRWGDSKETLDHEQSNGEWDDEQRRKH
Subjt:  RDHFKGSQGPPE-----EDMKLPSIQEQLRDEIRRWGDSKETLDHEQSNGEWDDEQRRKH

A0A6J1H4A9 uncharacterized protein LOC111459865 isoform X25.3e-10275.77Show/hide
Query:  MISLAYASLSSSPSNLSSLK-LRLPRPPSTFSTSLSNLKSFNPCDKAASDQKRIGNGVCRADLGNDGPFAVAIGACILSSFVFPVAGGGSDDESDAVIDS
        MI+LA A LSSSPSN SSL  LRL +PP TFSTSLSNLK  NP  K+AS+Q+   NG+CRA+LGND PFA+AIGACILSS V P AGGGSDD+SDAV+DS
Subjt:  MISLAYASLSSSPSNLSSLK-LRLPRPPSTFSTSLSNLKSFNPCDKAASDQKRIGNGVCRADLGNDGPFAVAIGACILSSFVFPVAGGGSDDESDAVIDS

Query:  TDTRFAVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYALVYLVPYLRSNLSLSPEESWLPIASILLCIIHIQLEVSIRNGDIQPFQIFGKTSKKISSTTR
        TD R AVMGIISFIPYFNWLSWVFAWLDSG+R YAVYA+VYL PYLRSNLSLSP+ESWLPI SIL+CI HIQ+E SI+NGDIQPFQIFGKTS +IS T  
Subjt:  TDTRFAVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYALVYLVPYLRSNLSLSPEESWLPIASILLCIIHIQLEVSIRNGDIQPFQIFGKTSKKISSTTR

Query:  GRDHFKGSQGPPEE-----DMKLPSIQEQLRDEIRRWGDSKETLDHEQSNGEWDDEQRRK
         R H KGSQGP ++     DMKLPS +EQLRDEI+ WGD KETLDHEQSN EWDDEQRRK
Subjt:  GRDHFKGSQGPPEE-----DMKLPSIQEQLRDEIRRWGDSKETLDHEQSNGEWDDEQRRK

A0A6J1H5Y1 uncharacterized protein LOC111459865 isoform X12.6e-10175.86Show/hide
Query:  MISLAYASLSSSPSNLSSLK-LRLPRPPSTFSTSLSNLKSFNPCDKAASDQKR-IGNGVCRADLGNDGPFAVAIGACILSSFVFPVAGGGSDDESDAVID
        MI+LA A LSSSPSN SSL  LRL +PP TFSTSLSNLK  NP  K+AS+Q+R   NG+CRA+LGND PFA+AIGACILSS V P AGGGSDD+SDAV+D
Subjt:  MISLAYASLSSSPSNLSSLK-LRLPRPPSTFSTSLSNLKSFNPCDKAASDQKR-IGNGVCRADLGNDGPFAVAIGACILSSFVFPVAGGGSDDESDAVID

Query:  STDTRFAVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYALVYLVPYLRSNLSLSPEESWLPIASILLCIIHIQLEVSIRNGDIQPFQIFGKTSKKISSTT
        STD R AVMGIISFIPYFNWLSWVFAWLDSG+R YAVYA+VYL PYLRSNLSLSP+ESWLPI SIL+CI HIQ+E SI+NGDIQPFQIFGKTS +IS T 
Subjt:  STDTRFAVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYALVYLVPYLRSNLSLSPEESWLPIASILLCIIHIQLEVSIRNGDIQPFQIFGKTSKKISSTT

Query:  RGRDHFKGSQGPPEE-----DMKLPSIQEQLRDEIRRWGDSKETLDHEQSNGEWDDEQRRK
          R H KGSQGP ++     DMKLPS +EQLRDEI+ WGD KETLDHEQSN EWDDEQRRK
Subjt:  RGRDHFKGSQGPPEE-----DMKLPSIQEQLRDEIRRWGDSKETLDHEQSNGEWDDEQRRK

A0A6J1K887 uncharacterized protein LOC111491538 isoform X23.4e-10175Show/hide
Query:  MISLAYASLSSSPSNLSSLK-LRLPRPPSTFSTSLSNLKSFNPCDKAASDQKRIGNGVCRADLGNDGPFAVAIGACILSSFVFPVAGGGSDDESDAVIDS
        M++LA A LSSSPSN SSL  LRL +PP TFSTSLSNLK  NP  K+AS+Q+   NG+CRA+LGND PFA+AIGACILSS V P AGGGSDD+SDAV+DS
Subjt:  MISLAYASLSSSPSNLSSLK-LRLPRPPSTFSTSLSNLKSFNPCDKAASDQKRIGNGVCRADLGNDGPFAVAIGACILSSFVFPVAGGGSDDESDAVIDS

Query:  TDTRFAVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYALVYLVPYLRSNLSLSPEESWLPIASILLCIIHIQLEVSIRNGDIQPFQIFGKTSKKISSTTR
        TD R AVMGIISFIPYFNWLSWVFAWLDSG+R YAVYA+VYL PYLRSNLSLSP+ESWLPI SIL+CI HIQ+E SI+NGDIQPFQIFGK S +IS T  
Subjt:  TDTRFAVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYALVYLVPYLRSNLSLSPEESWLPIASILLCIIHIQLEVSIRNGDIQPFQIFGKTSKKISSTTR

Query:  GRDHFKGSQGPPEE-----DMKLPSIQEQLRDEIRRWGDSKETLDHEQSNGEWDDEQRRK
        GR H KG +GP ++     DMKLPS +EQLRDEIR WGD KETLDHEQSN EWDDEQRRK
Subjt:  GRDHFKGSQGPPEE-----DMKLPSIQEQLRDEIRRWGDSKETLDHEQSNGEWDDEQRRK

A0A6J1KAA9 uncharacterized protein LOC111491538 isoform X11.7e-10075.1Show/hide
Query:  MISLAYASLSSSPSNLSSLK-LRLPRPPSTFSTSLSNLKSFNPCDKAASDQKR-IGNGVCRADLGNDGPFAVAIGACILSSFVFPVAGGGSDDESDAVID
        M++LA A LSSSPSN SSL  LRL +PP TFSTSLSNLK  NP  K+AS+Q+R   NG+CRA+LGND PFA+AIGACILSS V P AGGGSDD+SDAV+D
Subjt:  MISLAYASLSSSPSNLSSLK-LRLPRPPSTFSTSLSNLKSFNPCDKAASDQKR-IGNGVCRADLGNDGPFAVAIGACILSSFVFPVAGGGSDDESDAVID

Query:  STDTRFAVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYALVYLVPYLRSNLSLSPEESWLPIASILLCIIHIQLEVSIRNGDIQPFQIFGKTSKKISSTT
        STD R AVMGIISFIPYFNWLSWVFAWLDSG+R YAVYA+VYL PYLRSNLSLSP+ESWLPI SIL+CI HIQ+E SI+NGDIQPFQIFGK S +IS T 
Subjt:  STDTRFAVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYALVYLVPYLRSNLSLSPEESWLPIASILLCIIHIQLEVSIRNGDIQPFQIFGKTSKKISSTT

Query:  RGRDHFKGSQGPPEE-----DMKLPSIQEQLRDEIRRWGDSKETLDHEQSNGEWDDEQRRK
         GR H KG +GP ++     DMKLPS +EQLRDEIR WGD KETLDHEQSN EWDDEQRRK
Subjt:  RGRDHFKGSQGPPEE-----DMKLPSIQEQLRDEIRRWGDSKETLDHEQSNGEWDDEQRRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G41960.1 unknown protein5.1e-4952.56Show/hide
Query:  LAYASLSSSPSNLSSLKLRLPRPPSTFSTSLSNLKSFNPCDKAASDQKRIGNGVCRADLGNDGPFAVAIGACILSSFVFPVAGGGSDDESD---AVIDST
        L    LSSS S  +  +L      S+ S+S S L    P        ++I   +CRA+   D P   AIGACILSSFVFPVA   +D+E +   + I ST
Subjt:  LAYASLSSSPSNLSSLKLRLPRPPSTFSTSLSNLKSFNPCDKAASDQKRIGNGVCRADLGNDGPFAVAIGACILSSFVFPVAGGGSDDESD---AVIDST

Query:  DTRFAVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYALVYLVPYLRSNLSLSPEESWLPIASILLCIIHIQLEVSIRNGDIQPFQIFGKTSKKISSTTRG
        D R A MGIISFIPYFNWLSWVFAWLD+G+  YAVYALVYLVPYL SNLS+SPEESWLPI SI+L IIH+QLE SI NGD++    F  TS    S+ + 
Subjt:  DTRFAVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYALVYLVPYLRSNLSLSPEESWLPIASILLCIIHIQLEVSIRNGDIQPFQIFGKTSKKISSTTRG

Query:  ---RDHFKGSQGPPE
           + HFKG     E
Subjt:  ---RDHFKGSQGPPE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTTCTCTAGCTTATGCTTCTCTATCATCCTCCCCTTCCAATTTGTCTTCTCTGAAGCTTCGTCTCCCCAGACCGCCTTCCACCTTCTCAACATCCCTCTCCAATCT
CAAATCCTTCAATCCTTGCGACAAAGCAGCTTCCGACCAGAAGAGGATTGGGAATGGGGTTTGTAGGGCGGACTTGGGGAACGACGGGCCTTTTGCCGTTGCGATCGGGG
CCTGCATTCTCAGTTCGTTTGTTTTTCCGGTAGCTGGCGGTGGTTCCGATGATGAGAGCGATGCCGTCATTGATTCCACCGATACCAGGTTCGCTGTCATGGGTATCATT
AGCTTTATCCCCTACTTCAACTGGCTGAGTTGGGTTTTTGCGTGGCTTGATTCTGGGAGAAGGCTTTATGCTGTGTATGCACTCGTGTATTTGGTCCCTTATCTAAGGTC
AAATTTATCTCTATCCCCTGAAGAGAGTTGGCTTCCTATTGCCAGTATACTTCTCTGCATTATTCACATTCAGCTTGAAGTGAGCATTAGAAATGGAGATATCCAACCCT
TCCAAATATTTGGAAAAACTTCCAAGAAAATTTCTTCAACTACAAGAGGGAGAGACCATTTCAAGGGGTCCCAAGGACCACCCGAAGAGGACATGAAGCTGCCATCAATT
CAAGAACAATTAAGAGATGAGATTAGAAGATGGGGAGATTCTAAAGAGACATTAGATCATGAACAATCAAATGGAGAATGGGATGATGAACAGAGGAGAAAACAT
mRNA sequenceShow/hide mRNA sequence
ATGATTTCTCTAGCTTATGCTTCTCTATCATCCTCCCCTTCCAATTTGTCTTCTCTGAAGCTTCGTCTCCCCAGACCGCCTTCCACCTTCTCAACATCCCTCTCCAATCT
CAAATCCTTCAATCCTTGCGACAAAGCAGCTTCCGACCAGAAGAGGATTGGGAATGGGGTTTGTAGGGCGGACTTGGGGAACGACGGGCCTTTTGCCGTTGCGATCGGGG
CCTGCATTCTCAGTTCGTTTGTTTTTCCGGTAGCTGGCGGTGGTTCCGATGATGAGAGCGATGCCGTCATTGATTCCACCGATACCAGGTTCGCTGTCATGGGTATCATT
AGCTTTATCCCCTACTTCAACTGGCTGAGTTGGGTTTTTGCGTGGCTTGATTCTGGGAGAAGGCTTTATGCTGTGTATGCACTCGTGTATTTGGTCCCTTATCTAAGGTC
AAATTTATCTCTATCCCCTGAAGAGAGTTGGCTTCCTATTGCCAGTATACTTCTCTGCATTATTCACATTCAGCTTGAAGTGAGCATTAGAAATGGAGATATCCAACCCT
TCCAAATATTTGGAAAAACTTCCAAGAAAATTTCTTCAACTACAAGAGGGAGAGACCATTTCAAGGGGTCCCAAGGACCACCCGAAGAGGACATGAAGCTGCCATCAATT
CAAGAACAATTAAGAGATGAGATTAGAAGATGGGGAGATTCTAAAGAGACATTAGATCATGAACAATCAAATGGAGAATGGGATGATGAACAGAGGAGAAAACAT
Protein sequenceShow/hide protein sequence
MISLAYASLSSSPSNLSSLKLRLPRPPSTFSTSLSNLKSFNPCDKAASDQKRIGNGVCRADLGNDGPFAVAIGACILSSFVFPVAGGGSDDESDAVIDSTDTRFAVMGII
SFIPYFNWLSWVFAWLDSGRRLYAVYALVYLVPYLRSNLSLSPEESWLPIASILLCIIHIQLEVSIRNGDIQPFQIFGKTSKKISSTTRGRDHFKGSQGPPEEDMKLPSI
QEQLRDEIRRWGDSKETLDHEQSNGEWDDEQRRKH