; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC08g0401 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC08g0401
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionDUF4050 domain-containing protein
Genome locationMC08:3256986..3260578
RNA-Seq ExpressionMC08g0401
SyntenyMC08g0401
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR025124 - Domain of unknown function DUF4050


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064925.1 uncharacterized protein E6C27_scaffold82G002430 [Cucumis melo var. makuwa]4.10e-12286.73Show/hide
Query:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNIKLLFNRRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTV
        MYSRCCLL+RLEGCSSK PCCSFLQFSGEY+RALILLMVD IKLLF++R   GCC+A ALGNAMDGPSKGLRV+D+EAKKQCLPENFPSSST EMDNSTV
Subjt:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNIKLLFNRRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTV

Query:  WSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWTGNKLSESQKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        WSQRS+ASA+SHDS SNIGSS DFVNSGLLLWNETRKQW GNK+S+SQKQV+EPKISWNATY+SLL TNKPFPEA+PL EMIEFLVDVWEQEGLYD
Subjt:  WSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWTGNKLSESQKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

XP_004138726.1 uncharacterized protein LOC101216869 [Cucumis sativus]1.09e-12185.71Show/hide
Query:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNIKLLFNRRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTV
        MYSRCCLL+RLEGCSSK PCCSFLQFSGEY+RALILLMVD IKLLF++R   GCCTA ALGNAMDGPSKGLRV+++EAKKQCLPENFPSSST EMDNSTV
Subjt:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNIKLLFNRRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTV

Query:  WSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWTGNKLSESQKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        WSQRS+AS ++HDSHSNIGSS DFVNSGLLLWNETRKQW GNK+S SQKQV+EPKISWNATY++LL TNKPFPEA+PL EMIEFLVDVWEQEGLYD
Subjt:  WSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWTGNKLSESQKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

XP_008445211.1 PREDICTED: uncharacterized protein LOC103488310 isoform X1 [Cucumis melo]1.33e-12286.73Show/hide
Query:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNIKLLFNRRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTV
        MYSRCCLL+RLEGCSSK PCCSFLQFSGEY+RALILLMVD IKLLF++R   GCC+A ALGNAMDGPSKGLRV+D+EAKKQCLPENFPSSST EMDNSTV
Subjt:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNIKLLFNRRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTV

Query:  WSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWTGNKLSESQKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        WSQRS+ASA+SHDS SNIGSS DFVNSGLLLWNETRKQW GNK+S+SQKQV+EPKISWNATY+SLL TNKPFPEA+PL EMIEFLVDVWEQEGLYD
Subjt:  WSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWTGNKLSESQKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

XP_022951409.1 uncharacterized protein LOC111454240 isoform X1 [Cucurbita moschata]2.91e-12186.73Show/hide
Query:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNIKLLFNRRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTV
        MYSRCCLLSRLEGCSSK PCCSFLQFSG+YLRALI+L+VDN+KLLF+RR C G CT PALG+AMDGPS GLRVEDQEAKKQCLPENF SSST EMDNSTV
Subjt:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNIKLLFNRRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTV

Query:  WSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWTGNKLSESQKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        WSQRS+ASA+SHDSH+N+GSS +FVNSGLLLWNETRKQW GNK SESQK+VREPKISWNATY+SLL TNKPFPEA+PLAEMIEFLVDVWEQEGLYD
Subjt:  WSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWTGNKLSESQKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

XP_038885342.1 uncharacterized protein LOC120075759 isoform X1 [Benincasa hispida]1.96e-12488.27Show/hide
Query:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNIKLLFNRRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTV
        MYSRCCLL RLEGCSSKKPCCSFLQFSGEYLRALILLMVDNIKLLF+RR C GCCTA AL NAMDGPSKGLRV+DQEAKKQCLPEN PSSST EMDNSTV
Subjt:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNIKLLFNRRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTV

Query:  WSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWTGNKLSESQKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        WSQRS+ASA SHDSHSNIGSS DFVNSGLLLWNETRKQW GNK+SE QKQV+EPKISW+ATY+SLL TNKPFPE VPL EMI+FLVDVWEQ+GLYD
Subjt:  WSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWTGNKLSESQKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

TrEMBL top hitse value%identityAlignment
A0A0A0LPL3 Uncharacterized protein5.27e-12285.71Show/hide
Query:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNIKLLFNRRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTV
        MYSRCCLL+RLEGCSSK PCCSFLQFSGEY+RALILLMVD IKLLF++R   GCCTA ALGNAMDGPSKGLRV+++EAKKQCLPENFPSSST EMDNSTV
Subjt:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNIKLLFNRRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTV

Query:  WSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWTGNKLSESQKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        WSQRS+AS ++HDSHSNIGSS DFVNSGLLLWNETRKQW GNK+S SQKQV+EPKISWNATY++LL TNKPFPEA+PL EMIEFLVDVWEQEGLYD
Subjt:  WSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWTGNKLSESQKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

A0A1S3BC47 uncharacterized protein LOC103488310 isoform X16.42e-12386.73Show/hide
Query:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNIKLLFNRRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTV
        MYSRCCLL+RLEGCSSK PCCSFLQFSGEY+RALILLMVD IKLLF++R   GCC+A ALGNAMDGPSKGLRV+D+EAKKQCLPENFPSSST EMDNSTV
Subjt:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNIKLLFNRRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTV

Query:  WSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWTGNKLSESQKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        WSQRS+ASA+SHDS SNIGSS DFVNSGLLLWNETRKQW GNK+S+SQKQV+EPKISWNATY+SLL TNKPFPEA+PL EMIEFLVDVWEQEGLYD
Subjt:  WSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWTGNKLSESQKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

A0A5A7VGA9 Uncharacterized protein1.99e-12286.73Show/hide
Query:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNIKLLFNRRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTV
        MYSRCCLL+RLEGCSSK PCCSFLQFSGEY+RALILLMVD IKLLF++R   GCC+A ALGNAMDGPSKGLRV+D+EAKKQCLPENFPSSST EMDNSTV
Subjt:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNIKLLFNRRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTV

Query:  WSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWTGNKLSESQKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        WSQRS+ASA+SHDS SNIGSS DFVNSGLLLWNETRKQW GNK+S+SQKQV+EPKISWNATY+SLL TNKPFPEA+PL EMIEFLVDVWEQEGLYD
Subjt:  WSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWTGNKLSESQKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

A0A6J1GIP5 uncharacterized protein LOC111454240 isoform X11.41e-12186.73Show/hide
Query:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNIKLLFNRRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTV
        MYSRCCLLSRLEGCSSK PCCSFLQFSG+YLRALI+L+VDN+KLLF+RR C G CT PALG+AMDGPS GLRVEDQEAKKQCLPENF SSST EMDNSTV
Subjt:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNIKLLFNRRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTV

Query:  WSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWTGNKLSESQKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        WSQRS+ASA+SHDSH+N+GSS +FVNSGLLLWNETRKQW GNK SESQK+VREPKISWNATY+SLL TNKPFPEA+PLAEMIEFLVDVWEQEGLYD
Subjt:  WSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWTGNKLSESQKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

A0A6J1KQM2 uncharacterized protein LOC111496323 isoform X11.65e-12085.71Show/hide
Query:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNIKLLFNRRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTV
        MYSRCCLLSRLEGCSSK PCCSFLQFSG+YLRALI+L+VDN+KLLF+RR C G CT PALG+AMDGPS GLRV+DQEAKKQCLP+NF SSST EMDNSTV
Subjt:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNIKLLFNRRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTV

Query:  WSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWTGNKLSESQKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        WSQRS+ASA+SHDSH+N+GSS +FVNSGLLLWNETRKQW GNK SESQK+VREPKISWNATY+SLL TNKPFPEA+PLAEMIEFLVDVWEQEGLYD
Subjt:  WSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRKQWTGNKLSESQKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15350.1 unknown protein4.7e-2645.22Show/hide
Query:  CSGC-----CTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASA-KSHDSHS---NIGSSRDFVNSGLLLWNETRKQWTG
        C GC      TA +L    D PS  +    +  KK  + E+F S+ST +MDN T  SQ S++S+ ++ DS S   N  +  ++VN GLLLWN+TR++W G
Subjt:  CSGC-----CTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASA-KSHDSHS---NIGSSRDFVNSGLLLWNETRKQWTG

Query:  -NKLSESQKQVREPKISWN-ATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
         +K +      +  K++WN ATY+SLLG+NK FP+ +PL EM++FLVD+WEQEGLYD
Subjt:  -NKLSESQKQVREPKISWN-ATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

AT1G15350.2 unknown protein4.7e-2645.22Show/hide
Query:  CSGC-----CTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASA-KSHDSHS---NIGSSRDFVNSGLLLWNETRKQWTG
        C GC      TA +L    D PS  +    +  KK  + E+F S+ST +MDN T  SQ S++S+ ++ DS S   N  +  ++VN GLLLWN+TR++W G
Subjt:  CSGC-----CTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASA-KSHDSHS---NIGSSRDFVNSGLLLWNETRKQWTG

Query:  -NKLSESQKQVREPKISWN-ATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
         +K +      +  K++WN ATY+SLLG+NK FP+ +PL EM++FLVD+WEQEGLYD
Subjt:  -NKLSESQKQVREPKISWN-ATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

AT4G32342.1 unknown protein3.1e-3048.1Show/hide
Query:  NIKLLFNRRCCSGCCTAP-ALGNAMDGPSKGLRVEDQEAKK-QCLPENFPSSSTYEMD-NSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRK
        N K L N   C GCC     L   +D PSKGL+++ +  KK     ++F S+ST +MD N T+ SQ   +S    D   +  +S +FVN GL+LWN TR+
Subjt:  NIKLLFNRRCCSGCCTAP-ALGNAMDGPSKGLRVEDQEAKK-QCLPENFPSSSTYEMD-NSTVWSQRSIASAKSHDSHSNIGSSRDFVNSGLLLWNETRK

Query:  QWTGNKLSESQKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLY
        QW    L+  Q  V EP ISWN+TY+SLL TNK FP+ +PL EM+ FLVDVWE+EGLY
Subjt:  QWTGNKLSESQKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLY

AT5G25360.1 unknown protein1.1e-4056.38Show/hide
Query:  CSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSS---RDFVNSGLLLWNETRKQWTGNKLSES
        C GCC  P L  A+D PSKGLR++ +  KK  + E+F S+ST EMDNST+ SQRS++S    ++ S   S+    +FVN GL LWN+TR+QW  N  S+ 
Subjt:  CSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSS---RDFVNSGLLLWNETRKQWTGNKLSES

Query:  QKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        + +VREP ISWNATYESLLG NK F   +PL EM++FLVDVWEQEGLYD
Subjt:  QKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD

AT5G25360.2 unknown protein1.1e-4056.38Show/hide
Query:  CSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSS---RDFVNSGLLLWNETRKQWTGNKLSES
        C GCC  P L  A+D PSKGLR++ +  KK  + E+F S+ST EMDNST+ SQRS++S    ++ S   S+    +FVN GL LWN+TR+QW  N  S+ 
Subjt:  CSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAKSHDSHSNIGSS---RDFVNSGLLLWNETRKQWTGNKLSES

Query:  QKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD
        + +VREP ISWNATYESLLG NK F   +PL EM++FLVDVWEQEGLYD
Subjt:  QKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATTCTAGGTGTTGTCTCCTCAGCCGTTTAGAGGGTTGCTCTAGCAAGAAACCATGTTGCTCGTTTTTACAGTTTTCTGGAGAATATCTGCGCGCTCTTATACTTTT
GATGGTGGATAATATCAAGCTTCTTTTCAATAGAAGATGCTGTAGTGGATGCTGCACTGCACCTGCACTAGGTAATGCAATGGATGGACCATCTAAAGGTCTGAGAGTTG
AAGACCAAGAAGCGAAAAAACAATGCTTACCGGAAAATTTCCCAAGTTCTAGCACATATGAAATGGACAACAGTACAGTTTGGTCCCAGAGAAGCATTGCATCAGCCAAG
TCTCATGATTCCCACAGCAATATTGGGAGCAGTAGAGACTTTGTAAATTCTGGTCTGCTTCTTTGGAACGAGACCAGGAAGCAATGGACTGGAAATAAATTGTCCGAGAG
CCAAAAGCAAGTTCGAGAACCGAAAATAAGTTGGAATGCTACTTATGAGAGCTTGTTAGGAACGAACAAGCCGTTCCCCGAGGCCGTGCCTCTTGCTGAGATGATAGAGT
TTCTTGTTGATGTCTGGGAGCAGGAGGGTCTGTATGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTATTCTAGGTGTTGTCTCCTCAGCCGTTTAGAGGGTTGCTCTAGCAAGAAACCATGTTGCTCGTTTTTACAGTTTTCTGGAGAATATCTGCGCGCTCTTATACTTTT
GATGGTGGATAATATCAAGCTTCTTTTCAATAGAAGATGCTGTAGTGGATGCTGCACTGCACCTGCACTAGGTAATGCAATGGATGGACCATCTAAAGGTCTGAGAGTTG
AAGACCAAGAAGCGAAAAAACAATGCTTACCGGAAAATTTCCCAAGTTCTAGCACATATGAAATGGACAACAGTACAGTTTGGTCCCAGAGAAGCATTGCATCAGCCAAG
TCTCATGATTCCCACAGCAATATTGGGAGCAGTAGAGACTTTGTAAATTCTGGTCTGCTTCTTTGGAACGAGACCAGGAAGCAATGGACTGGAAATAAATTGTCCGAGAG
CCAAAAGCAAGTTCGAGAACCGAAAATAAGTTGGAATGCTACTTATGAGAGCTTGTTAGGAACGAACAAGCCGTTCCCCGAGGCCGTGCCTCTTGCTGAGATGATAGAGT
TTCTTGTTGATGTCTGGGAGCAGGAGGGTCTGTATGACTGAGCCTGCAATTCAAGGGGAGTCTTCATATTGTTTAAACTTATGCAGCTTGAATTTTTCGCTGCTGAATCT
ATATGTTCGATTATGCTCGATCGCGATGCTTATAGTATACAAATAATAGCGTTTATGTTGAATTTATGCTATAATCAAGAATGGTATCATGGTTATAAAATTTTTAGTTT
TAGATTACTTTGTTGTATGGTCTTTATGTTGAATTTCTGCCACACTTGAGCTGTCTGGGGTTCGCATCTTTGCTGGTTGGGGAAGGTGGAAGTGAATACAGTTTGGGCTT
CTGGGAATTCCTAAGCCCATGAAGATGAAGATAAGTTGTTGGCAGAATCTTATTAAAATATTTAGGTTATGTAAATTTAATAAATTGCACAAATTAATAGGCCAAGTAAT
TAATAGGCCAGAAATGCTCCAAAATAGAGTCTCAAACTCAACATTGAACTACCCAAATAAAGGAGGAAGAGAGAGAGAAAAAAATGTTGGCCACAAAAATGAGCATCTAC
AAATAGTGTAATGTATGGGGCATATAAAAGAAAATGGAATATTAAAAAAAAGAAAAAAAAAGGAAGGAGGTTGAAGTTGAGGATAAATGAAATATCCAAATGGCAAAAGA
TGAGAGGGTGGGAGAATGATTAAATGAGGCATTGAATGAGTAAGAAATGTGGGTGAAAATAAAATGATTAGGGTTTGACAGCAAATGTGTGTGTGTTGGGGTTTTGGTGA
GACAAAGAAAATGGCTGTGGGGTCACATTTATTGACATTTTAGATGATGTGACCACCTTAAAACCCCATCCCCCACTTCTTCATTCAATTCAACACTCATTTCAAAAAAA
AATATATATAATTCTCCTAAAAATAAATAAATAAATATTCGAAATTCAATAATAAGATACTGATTATTTTTAAGTCGAAAATTTCTTCCATTTGTTGTACTAAATAAAAA
TGGGGTTG
Protein sequenceShow/hide protein sequence
MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNIKLLFNRRCCSGCCTAPALGNAMDGPSKGLRVEDQEAKKQCLPENFPSSSTYEMDNSTVWSQRSIASAK
SHDSHSNIGSSRDFVNSGLLLWNETRKQWTGNKLSESQKQVREPKISWNATYESLLGTNKPFPEAVPLAEMIEFLVDVWEQEGLYD