; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS017766 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS017766
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionSec-independent protein translocase protein TatB-like
Genome locationscaffold373:2341839..2343439
RNA-Seq ExpressionMS017766
SyntenyMS017766
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152379.1 uncharacterized protein LOC101219447 [Cucumis sativus]2.4e-9582.01Show/hide
Query:  MLGISYGELFLLLGATAAFIGPKDLPVIARMAGRMAGRAIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLDAIRHEIRSISILNPGPLTRRLVDDPE
        MLGISYGE+ LL+GATAA IGPKDLP+I+RMAGRMAGRAIGYVQLARGQFDS+M+QT AR+VHKELQDT+AQLDAIRHEIRSISILNPGPLT+RLVD+PE
Subjt:  MLGISYGELFLLLGATAAFIGPKDLPVIARMAGRMAGRAIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLDAIRHEIRSISILNPGPLTRRLVDDPE

Query:  LRTADSGVTSDLAKEKHTVEIT-------PAASSLKVAPSQISNEHSRATTFARLAESPTIKNGSTGSFPIATDGDKLNDELGLPTVLPISAENSGLLPK
        L+ ADSGVTS+ A+EK TVE T       PAAS LKVA SQISNEHSRATTFARLAESP IKNGS+ S PI TD +KLNDE GLP VLP+SAEN+GLLPK
Subjt:  LRTADSGVTSDLAKEKHTVEIT-------PAASSLKVAPSQISNEHSRATTFARLAESPTIKNGSTGSFPIATDGDKLNDELGLPTVLPISAENSGLLPK

Query:  RPDELKGSDIMLEAVLEAEVAHNAKEFF-SHQSQMKQEQ
        RP+E KGSDIMLEAVLEAEVAHNAKEFF SHQSQMKQEQ
Subjt:  RPDELKGSDIMLEAVLEAEVAHNAKEFF-SHQSQMKQEQ

XP_022155834.1 uncharacterized protein LOC111022860 isoform X1 [Momordica charantia]2.5e-11397.37Show/hide
Query:  MLGISYGELFLLLGATAAFIGPKDLPVIARMAGRMAGRAIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLDAIRHEIRSISILNPGPLTRRLVDDPE
        MLGISYGELFLLLGATAAFIGPKDLPVIARMAGRMAGRAIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLDAIRHEIRSISILNPGPLTRRLVDDPE
Subjt:  MLGISYGELFLLLGATAAFIGPKDLPVIARMAGRMAGRAIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLDAIRHEIRSISILNPGPLTRRLVDDPE

Query:  LRTADSGVTSDLAKEKHTVEITPAASSLKVAPSQISNEHSRATTFARLAESPTIKNGSTGSFPIATDGDKLNDELGLPTVLPISAENSGLLPKRPDELKG
        LRTADSGVTSDLAKEKHTVEITPAASSLKVAPSQISNEHSRATTFARLAESPTIKNGSTGSFPIATDGDKL DELGLPTVLPISAENSGLLPKRPDELKG
Subjt:  LRTADSGVTSDLAKEKHTVEITPAASSLKVAPSQISNEHSRATTFARLAESPTIKNGSTGSFPIATDGDKLNDELGLPTVLPISAENSGLLPKRPDELKG

Query:  SDIMLEAVLEAEVAHNAKEFFSHQSQMK
        SDIMLEAVLEAEVAHNAKEFFS  ++ K
Subjt:  SDIMLEAVLEAEVAHNAKEFFSHQSQMK

XP_022155851.1 uncharacterized protein LOC111022860 isoform X2 [Momordica charantia]2.1e-9696.97Show/hide
Query:  MAGRMAGRAIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLDAIRHEIRSISILNPGPLTRRLVDDPELRTADSGVTSDLAKEKHTVEITPAASSLKV
        MAGRMAGRAIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLDAIRHEIRSISILNPGPLTRRLVDDPELRTADSGVTSDLAKEKHTVEITPAASSLKV
Subjt:  MAGRMAGRAIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLDAIRHEIRSISILNPGPLTRRLVDDPELRTADSGVTSDLAKEKHTVEITPAASSLKV

Query:  APSQISNEHSRATTFARLAESPTIKNGSTGSFPIATDGDKLNDELGLPTVLPISAENSGLLPKRPDELKGSDIMLEAVLEAEVAHNAKEFFSHQSQMK
        APSQISNEHSRATTFARLAESPTIKNGSTGSFPIATDGDKL DELGLPTVLPISAENSGLLPKRPDELKGSDIMLEAVLEAEVAHNAKEFFS  ++ K
Subjt:  APSQISNEHSRATTFARLAESPTIKNGSTGSFPIATDGDKLNDELGLPTVLPISAENSGLLPKRPDELKGSDIMLEAVLEAEVAHNAKEFFSHQSQMK

XP_022922242.1 uncharacterized protein LOC111430283 [Cucurbita moschata]4.0e-9581.59Show/hide
Query:  MLGISYGELFLLLGATAAFIGPKDLPVIARMAGRMAGRAIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLDAIRHEIRSISILNPGPLTRRLVDDPE
        MLGISYGEL LL+GAT AFIGPKDLP IARMAGR AG+AIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLDAIRHEIRSIS LNPG LT+RLVD+PE
Subjt:  MLGISYGELFLLLGATAAFIGPKDLPVIARMAGRMAGRAIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLDAIRHEIRSISILNPGPLTRRLVDDPE

Query:  LRTADSGVTSDLAKEKHTVEI-------TPAASSLKVAPSQISNEHSRATTFARLAESPTIKNGSTGSFPIATDGDKLNDELGLPTVLPISAENSGLLPK
        L+ ADSGVTSD AKEK +VEI       TPAA+ LKVA SQIS EHSRATTFA+LAESPTI+NGS+ SFP+ATD +K NDELG+P+VLP+SAEN+G+LPK
Subjt:  LRTADSGVTSDLAKEKHTVEI-------TPAASSLKVAPSQISNEHSRATTFARLAESPTIKNGSTGSFPIATDGDKLNDELGLPTVLPISAENSGLLPK

Query:  RPDELKGSDIMLEAVLEAEVAHNAKEFFS-HQSQMKQEQ
        RP+ELKGSDIMLEAVLEAEVA++AKEFFS HQ QMKQEQ
Subjt:  RPDELKGSDIMLEAVLEAEVAHNAKEFFS-HQSQMKQEQ

XP_023551431.1 uncharacterized protein LOC111809244 [Cucurbita pepo subsp. pepo]1.5e-9481.17Show/hide
Query:  MLGISYGELFLLLGATAAFIGPKDLPVIARMAGRMAGRAIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLDAIRHEIRSISILNPGPLTRRLVDDPE
        MLGISYGEL LL+GAT AFIGPKDLP IARMAGR AG+AIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLDAIRHEIRSIS LNPG LT+RLVD+PE
Subjt:  MLGISYGELFLLLGATAAFIGPKDLPVIARMAGRMAGRAIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLDAIRHEIRSISILNPGPLTRRLVDDPE

Query:  LRTADSGVTSDLAKEKHTVEI-------TPAASSLKVAPSQISNEHSRATTFARLAESPTIKNGSTGSFPIATDGDKLNDELGLPTVLPISAENSGLLPK
        L+ ADSGVTSD AKEK +VEI       TPAA+ LKVA SQIS EHSRATTFA+LAESPTI+NGS+ SFP ATD +  NDELG+P+VLP+SAEN+G+LPK
Subjt:  LRTADSGVTSDLAKEKHTVEI-------TPAASSLKVAPSQISNEHSRATTFARLAESPTIKNGSTGSFPIATDGDKLNDELGLPTVLPISAENSGLLPK

Query:  RPDELKGSDIMLEAVLEAEVAHNAKEFFS-HQSQMKQEQ
        RP+ELKGSDIMLEAV+EAEVAH+AKEFFS HQ QMKQEQ
Subjt:  RPDELKGSDIMLEAVLEAEVAHNAKEFFS-HQSQMKQEQ

TrEMBL top hitse value%identityAlignment
A0A0A0KN15 Uncharacterized protein1.1e-9582.01Show/hide
Query:  MLGISYGELFLLLGATAAFIGPKDLPVIARMAGRMAGRAIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLDAIRHEIRSISILNPGPLTRRLVDDPE
        MLGISYGE+ LL+GATAA IGPKDLP+I+RMAGRMAGRAIGYVQLARGQFDS+M+QT AR+VHKELQDT+AQLDAIRHEIRSISILNPGPLT+RLVD+PE
Subjt:  MLGISYGELFLLLGATAAFIGPKDLPVIARMAGRMAGRAIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLDAIRHEIRSISILNPGPLTRRLVDDPE

Query:  LRTADSGVTSDLAKEKHTVEIT-------PAASSLKVAPSQISNEHSRATTFARLAESPTIKNGSTGSFPIATDGDKLNDELGLPTVLPISAENSGLLPK
        L+ ADSGVTS+ A+EK TVE T       PAAS LKVA SQISNEHSRATTFARLAESP IKNGS+ S PI TD +KLNDE GLP VLP+SAEN+GLLPK
Subjt:  LRTADSGVTSDLAKEKHTVEIT-------PAASSLKVAPSQISNEHSRATTFARLAESPTIKNGSTGSFPIATDGDKLNDELGLPTVLPISAENSGLLPK

Query:  RPDELKGSDIMLEAVLEAEVAHNAKEFF-SHQSQMKQEQ
        RP+E KGSDIMLEAVLEAEVAHNAKEFF SHQSQMKQEQ
Subjt:  RPDELKGSDIMLEAVLEAEVAHNAKEFF-SHQSQMKQEQ

A0A1S3ATI2 sec-independent protein translocase protein TatB1.7e-9482.01Show/hide
Query:  MLGISYGELFLLLGATAAFIGPKDLPVIARMAGRMAGRAIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLDAIRHEIRSISILNPGPLTRRLVDDPE
        MLGISYGEL LL+GATAA IGPKDLP+I+RMAGRMAGRAIGYVQLARGQFDSVM+QT ARQVHKELQDT+AQLDAIRHEIRSISILNPGPLT+RLVD+PE
Subjt:  MLGISYGELFLLLGATAAFIGPKDLPVIARMAGRMAGRAIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLDAIRHEIRSISILNPGPLTRRLVDDPE

Query:  LRTADSGVTSDLAKEKHTVEITP-------AASSLKVAPSQISNEHSRATTFARLAESPTIKNGSTGSFPIATDGDKLNDELGLPTVLPISAENSGLLPK
        L+ A SGVTS+ A+EK TVE TP        AS LKVA SQISNEHSRATTFARLAESP IKNGS+ S PI TD +KLNDE GLP VLP+SAEN+GLLPK
Subjt:  LRTADSGVTSDLAKEKHTVEITP-------AASSLKVAPSQISNEHSRATTFARLAESPTIKNGSTGSFPIATDGDKLNDELGLPTVLPISAENSGLLPK

Query:  RPDELKGSDIMLEAVLEAEVAHNAKEFF-SHQSQMKQEQ
        RP+E KGSDIMLEAVLEAEVAHNAKEFF S QSQMKQEQ
Subjt:  RPDELKGSDIMLEAVLEAEVAHNAKEFF-SHQSQMKQEQ

A0A6J1DRF4 uncharacterized protein LOC111022860 isoform X11.2e-11397.37Show/hide
Query:  MLGISYGELFLLLGATAAFIGPKDLPVIARMAGRMAGRAIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLDAIRHEIRSISILNPGPLTRRLVDDPE
        MLGISYGELFLLLGATAAFIGPKDLPVIARMAGRMAGRAIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLDAIRHEIRSISILNPGPLTRRLVDDPE
Subjt:  MLGISYGELFLLLGATAAFIGPKDLPVIARMAGRMAGRAIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLDAIRHEIRSISILNPGPLTRRLVDDPE

Query:  LRTADSGVTSDLAKEKHTVEITPAASSLKVAPSQISNEHSRATTFARLAESPTIKNGSTGSFPIATDGDKLNDELGLPTVLPISAENSGLLPKRPDELKG
        LRTADSGVTSDLAKEKHTVEITPAASSLKVAPSQISNEHSRATTFARLAESPTIKNGSTGSFPIATDGDKL DELGLPTVLPISAENSGLLPKRPDELKG
Subjt:  LRTADSGVTSDLAKEKHTVEITPAASSLKVAPSQISNEHSRATTFARLAESPTIKNGSTGSFPIATDGDKLNDELGLPTVLPISAENSGLLPKRPDELKG

Query:  SDIMLEAVLEAEVAHNAKEFFSHQSQMK
        SDIMLEAVLEAEVAHNAKEFFS  ++ K
Subjt:  SDIMLEAVLEAEVAHNAKEFFSHQSQMK

A0A6J1DSY9 uncharacterized protein LOC111022860 isoform X21.0e-9696.97Show/hide
Query:  MAGRMAGRAIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLDAIRHEIRSISILNPGPLTRRLVDDPELRTADSGVTSDLAKEKHTVEITPAASSLKV
        MAGRMAGRAIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLDAIRHEIRSISILNPGPLTRRLVDDPELRTADSGVTSDLAKEKHTVEITPAASSLKV
Subjt:  MAGRMAGRAIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLDAIRHEIRSISILNPGPLTRRLVDDPELRTADSGVTSDLAKEKHTVEITPAASSLKV

Query:  APSQISNEHSRATTFARLAESPTIKNGSTGSFPIATDGDKLNDELGLPTVLPISAENSGLLPKRPDELKGSDIMLEAVLEAEVAHNAKEFFSHQSQMK
        APSQISNEHSRATTFARLAESPTIKNGSTGSFPIATDGDKL DELGLPTVLPISAENSGLLPKRPDELKGSDIMLEAVLEAEVAHNAKEFFS  ++ K
Subjt:  APSQISNEHSRATTFARLAESPTIKNGSTGSFPIATDGDKLNDELGLPTVLPISAENSGLLPKRPDELKGSDIMLEAVLEAEVAHNAKEFFSHQSQMK

A0A6J1E2P5 uncharacterized protein LOC1114302832.0e-9581.59Show/hide
Query:  MLGISYGELFLLLGATAAFIGPKDLPVIARMAGRMAGRAIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLDAIRHEIRSISILNPGPLTRRLVDDPE
        MLGISYGEL LL+GAT AFIGPKDLP IARMAGR AG+AIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLDAIRHEIRSIS LNPG LT+RLVD+PE
Subjt:  MLGISYGELFLLLGATAAFIGPKDLPVIARMAGRMAGRAIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLDAIRHEIRSISILNPGPLTRRLVDDPE

Query:  LRTADSGVTSDLAKEKHTVEI-------TPAASSLKVAPSQISNEHSRATTFARLAESPTIKNGSTGSFPIATDGDKLNDELGLPTVLPISAENSGLLPK
        L+ ADSGVTSD AKEK +VEI       TPAA+ LKVA SQIS EHSRATTFA+LAESPTI+NGS+ SFP+ATD +K NDELG+P+VLP+SAEN+G+LPK
Subjt:  LRTADSGVTSDLAKEKHTVEI-------TPAASSLKVAPSQISNEHSRATTFARLAESPTIKNGSTGSFPIATDGDKLNDELGLPTVLPISAENSGLLPK

Query:  RPDELKGSDIMLEAVLEAEVAHNAKEFFS-HQSQMKQEQ
        RP+ELKGSDIMLEAVLEAEVA++AKEFFS HQ QMKQEQ
Subjt:  RPDELKGSDIMLEAVLEAEVAHNAKEFFS-HQSQMKQEQ

SwissProt top hitse value%identityAlignment
A4G9I1 Sec-independent protein translocase protein TatB8.1e-0633.78Show/hide
Query:  MLGISYGELFLLLGATAAFIGPKDLPVIARMAGRMAGRAIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLD
        M+ I++ +L ++  A   FIGP+ LP +ARMAG + GRA  Y+   + +    M+  + R++HK++QD    ++
Subjt:  MLGISYGELFLLLGATAAFIGPKDLPVIARMAGRMAGRAIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLD

Q0C0V9 Sec-independent protein translocase protein TatB7.6e-0434.85Show/hide
Query:  GISYGELFLLLGATAAFIGPKDLPVIARMAGRMAGRAIGYVQLARGQFDSVMQQTQARQVHKELQD
        GI + EL L+  A    IGPKDLP++ R  G++ G+     +  +  FD + +Q++  ++ KE+QD
Subjt:  GISYGELFLLLGATAAFIGPKDLPVIARMAGRMAGRAIGYVQLARGQFDSVMQQTQARQVHKELQD

Arabidopsis top hitse value%identityAlignment
AT5G43680.1 unknown protein7.7e-4448.46Show/hide
Query:  MLGISYGELFLLLGATAAFIGPKDLPVIARMAGRMAGRAIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLDAIRHEIRSISILNPGPLTRRL----V
        MLG+SYGEL L++GATAA +GPKDLP+IAR  GR+ GRAIGY+ +ARG  D VM+Q Q +++ KE+QD  AQ+DAI H  R  S+ +  PLTRR+     
Subjt:  MLGISYGELFLLLGATAAFIGPKDLPVIARMAGRMAGRAIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLDAIRHEIRSISILNPGPLTRRL----V

Query:  DDPELRTADSGVTSDLAKEKH-TVEITPAASSLKVAPSQISNEHSRATTFARLAESPTIKNGSTGSFPIATDGDKLNDELGLPTVLPISAENSGLLPKRP
         +P   T +  VTS   +EK   V+    A     + S   N H++AT+FARL+E+    NG T S         LN +     VLP+SAE + LLP+R 
Subjt:  DDPELRTADSGVTSDLAKEKH-TVEITPAASSLKVAPSQISNEHSRATTFARLAESPTIKNGSTGSFPIATDGDKLNDELGLPTVLPISAENSGLLPKRP

Query:  DELKGSDIMLEAVLEAEVAHNAKEFFS
        +  +GSD+MLEAVLEAEVAH AK FF+
Subjt:  DELKGSDIMLEAVLEAEVAHNAKEFFS

AT5G43680.2 unknown protein7.7e-4448.46Show/hide
Query:  MLGISYGELFLLLGATAAFIGPKDLPVIARMAGRMAGRAIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLDAIRHEIRSISILNPGPLTRRL----V
        MLG+SYGEL L++GATAA +GPKDLP+IAR  GR+ GRAIGY+ +ARG  D VM+Q Q +++ KE+QD  AQ+DAI H  R  S+ +  PLTRR+     
Subjt:  MLGISYGELFLLLGATAAFIGPKDLPVIARMAGRMAGRAIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLDAIRHEIRSISILNPGPLTRRL----V

Query:  DDPELRTADSGVTSDLAKEKH-TVEITPAASSLKVAPSQISNEHSRATTFARLAESPTIKNGSTGSFPIATDGDKLNDELGLPTVLPISAENSGLLPKRP
         +P   T +  VTS   +EK   V+    A     + S   N H++AT+FARL+E+    NG T S         LN +     VLP+SAE + LLP+R 
Subjt:  DDPELRTADSGVTSDLAKEKH-TVEITPAASSLKVAPSQISNEHSRATTFARLAESPTIKNGSTGSFPIATDGDKLNDELGLPTVLPISAENSGLLPKRP

Query:  DELKGSDIMLEAVLEAEVAHNAKEFFS
        +  +GSD+MLEAVLEAEVAH AK FF+
Subjt:  DELKGSDIMLEAVLEAEVAHNAKEFFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGGGCATTTCATATGGAGAACTCTTCCTCCTTCTTGGAGCTACTGCTGCCTTCATTGGGCCAAAGGATCTCCCAGTGATAGCAAGAATGGCAGGGAGGATGGCTGG
AAGAGCAATTGGATATGTTCAGTTAGCTCGAGGTCAGTTTGACTCTGTCATGCAACAAACTCAAGCTCGCCAGGTTCACAAGGAATTGCAAGACACTATGGCTCAGCTTG
ATGCTATTCGTCATGAAATCCGAAGCATATCGATTCTGAATCCTGGGCCATTGACTCGGAGGCTTGTGGATGATCCCGAGCTCAGAACAGCTGATAGTGGCGTAACTAGT
GATTTAGCAAAAGAAAAACATACTGTGGAGATCACACCAGCAGCTAGCAGTCTGAAGGTAGCACCTTCACAAATATCAAATGAGCACAGCCGAGCAACTACATTCGCCAG
ATTGGCTGAATCACCAACCATAAAGAATGGTTCCACTGGCTCATTTCCAATTGCTACAGACGGAGATAAGCTTAATGATGAGTTAGGACTTCCTACAGTTTTACCCATAT
CCGCGGAAAATAGTGGGCTGTTACCTAAACGCCCGGATGAGTTAAAAGGATCTGACATAATGTTAGAAGCAGTACTGGAAGCAGAAGTGGCTCACAATGCAAAAGAATTT
TTTTCTCACCAAAGCCAAATGAAACAAGAACAAGGA
mRNA sequenceShow/hide mRNA sequence
ATGCTGGGCATTTCATATGGAGAACTCTTCCTCCTTCTTGGAGCTACTGCTGCCTTCATTGGGCCAAAGGATCTCCCAGTGATAGCAAGAATGGCAGGGAGGATGGCTGG
AAGAGCAATTGGATATGTTCAGTTAGCTCGAGGTCAGTTTGACTCTGTCATGCAACAAACTCAAGCTCGCCAGGTTCACAAGGAATTGCAAGACACTATGGCTCAGCTTG
ATGCTATTCGTCATGAAATCCGAAGCATATCGATTCTGAATCCTGGGCCATTGACTCGGAGGCTTGTGGATGATCCCGAGCTCAGAACAGCTGATAGTGGCGTAACTAGT
GATTTAGCAAAAGAAAAACATACTGTGGAGATCACACCAGCAGCTAGCAGTCTGAAGGTAGCACCTTCACAAATATCAAATGAGCACAGCCGAGCAACTACATTCGCCAG
ATTGGCTGAATCACCAACCATAAAGAATGGTTCCACTGGCTCATTTCCAATTGCTACAGACGGAGATAAGCTTAATGATGAGTTAGGACTTCCTACAGTTTTACCCATAT
CCGCGGAAAATAGTGGGCTGTTACCTAAACGCCCGGATGAGTTAAAAGGATCTGACATAATGTTAGAAGCAGTACTGGAAGCAGAAGTGGCTCACAATGCAAAAGAATTT
TTTTCTCACCAAAGCCAAATGAAACAAGAACAAGGA
Protein sequenceShow/hide protein sequence
MLGISYGELFLLLGATAAFIGPKDLPVIARMAGRMAGRAIGYVQLARGQFDSVMQQTQARQVHKELQDTMAQLDAIRHEIRSISILNPGPLTRRLVDDPELRTADSGVTS
DLAKEKHTVEITPAASSLKVAPSQISNEHSRATTFARLAESPTIKNGSTGSFPIATDGDKLNDELGLPTVLPISAENSGLLPKRPDELKGSDIMLEAVLEAEVAHNAKEF
FSHQSQMKQEQG