; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC03g0858 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC03g0858
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionUnknown protein
Genome locationMC03:15308286..15312543
RNA-Seq ExpressionMC03g0858
SyntenyMC03g0858
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008441474.1 PREDICTED: uncharacterized protein LOC103485579 [Cucumis melo]1.57e-11083.9Show/hide
Query:  MEIPKLNSASLREVNQLEETKNSQKISVCSQPNGVHYATNSDSFVIDMNGFSNGGGTKETSPNPRITLQKNLSRKGFQRGGDRMITSNSAPTDRDSSSPP
        MEIPKLN +S REVN LEET+NSQKISVCSQPNGVHY+TNSDSFVIDMNGFS+ GGTKE++ NPRITLQ+N SRKG QRGGD+MI +NSAP DRDSSSP 
Subjt:  MEIPKLNSASLREVNQLEETKNSQKISVCSQPNGVHYATNSDSFVIDMNGFSNGGGTKETSPNPRITLQKNLSRKGFQRGGDRMITSNSAPTDRDSSSPP

Query:  VAVGATMAEKA-GAAVVV---SQQDHLGLPQVHHQITITTAGNTAAPVERSILRRNSFRRAPSSWFLDPKKVLLLFATVSCIGSMVLIYFTLAIGKPNAE
        VAVGATMAEKA G+AV V   SQQDHLG+PQVHHQITITT G+TAAPVER ILRRNSFRR PS W LDPKKVLLLFATVSCIGSMVLIYFTLAIGKPNAE
Subjt:  VAVGATMAEKA-GAAVVV---SQQDHLGLPQVHHQITITTAGNTAAPVERSILRRNSFRRAPSSWFLDPKKVLLLFATVSCIGSMVLIYFTLAIGKPNAE

Query:  ERGFD
        +RGFD
Subjt:  ERGFD

XP_022140038.1 uncharacterized protein LOC111010789 [Momordica charantia]1.61e-137100Show/hide
Query:  MEIPKLNSASLREVNQLEETKNSQKISVCSQPNGVHYATNSDSFVIDMNGFSNGGGTKETSPNPRITLQKNLSRKGFQRGGDRMITSNSAPTDRDSSSPP
        MEIPKLNSASLREVNQLEETKNSQKISVCSQPNGVHYATNSDSFVIDMNGFSNGGGTKETSPNPRITLQKNLSRKGFQRGGDRMITSNSAPTDRDSSSPP
Subjt:  MEIPKLNSASLREVNQLEETKNSQKISVCSQPNGVHYATNSDSFVIDMNGFSNGGGTKETSPNPRITLQKNLSRKGFQRGGDRMITSNSAPTDRDSSSPP

Query:  VAVGATMAEKAGAAVVVSQQDHLGLPQVHHQITITTAGNTAAPVERSILRRNSFRRAPSSWFLDPKKVLLLFATVSCIGSMVLIYFTLAIGKPNAEERGF
        VAVGATMAEKAGAAVVVSQQDHLGLPQVHHQITITTAGNTAAPVERSILRRNSFRRAPSSWFLDPKKVLLLFATVSCIGSMVLIYFTLAIGKPNAEERGF
Subjt:  VAVGATMAEKAGAAVVVSQQDHLGLPQVHHQITITTAGNTAAPVERSILRRNSFRRAPSSWFLDPKKVLLLFATVSCIGSMVLIYFTLAIGKPNAEERGF

Query:  D
        D
Subjt:  D

XP_022933196.1 uncharacterized protein LOC111440049 [Cucurbita moschata]1.40e-11183.5Show/hide
Query:  MEIPKLNSASLREVNQLEETKNSQKISVCSQPNGVHYATNSDSFVIDMNGFSNGGGTKETSPNPRITLQKNLSRKGFQRGGDRMITSNSAPTDRDSSSPP
        MEIPKLNSA LREV++LEETKNSQKISVC+QPNGVHY+ NSDSFVIDMNGFSNGG TKE + NPRITLQ+NLSRKG QRGGD+MI SN+AP DRDSSSP 
Subjt:  MEIPKLNSASLREVNQLEETKNSQKISVCSQPNGVHYATNSDSFVIDMNGFSNGGGTKETSPNPRITLQKNLSRKGFQRGGDRMITSNSAPTDRDSSSPP

Query:  VAVGATMAEKAGAAVVV-----SQQDHLGLPQVHHQITITTAGNTAAPVERSILRRNSFRRAPSSWFLDPKKVLLLFATVSCIGSMVLIYFTLAIGKPNA
        VAVGATMAEKAGAAV V     SQQDHLG+ QVHHQITITTA   AAPVER  LRRNSFRR  SSWFLDPKKVLLLFATVSCIGSM+LIYFTLAIGKP+A
Subjt:  VAVGATMAEKAGAAVVV-----SQQDHLGLPQVHHQITITTAGNTAAPVERSILRRNSFRRAPSSWFLDPKKVLLLFATVSCIGSMVLIYFTLAIGKPNA

Query:  EERGFD
        EERGFD
Subjt:  EERGFD

XP_038895557.1 uncharacterized protein LOC120083769 isoform X1 [Benincasa hispida]1.11e-11382.27Show/hide
Query:  FLFSCSSLSLFPEKVMEIPKLNSASLREVNQLEETKNSQKISVCSQPNGVHYATNSDSFVIDMNGFSNGGGTKETSPNPRITLQKNLSRKGFQRGGDRMI
        F F  S      +K+MEIPKLN +SLREVNQLEET+ SQKISVCSQPNGVHY+TNSDSFVIDMNGFSNGG TKE + NPRITLQ+NLSR G QRGGD+MI
Subjt:  FLFSCSSLSLFPEKVMEIPKLNSASLREVNQLEETKNSQKISVCSQPNGVHYATNSDSFVIDMNGFSNGGGTKETSPNPRITLQKNLSRKGFQRGGDRMI

Query:  TSNSAPTDRDSSSPPVAVGATMAEKA-GAAVVV---SQQDHLGLPQVHHQITITTAGNTAAPVERSILRRNSFRRAPSSWFLDPKKVLLLFATVSCIGSM
         SNSAP DRDSSSP VAVGATMAEKA GAAV +   SQQDHLG+PQVHHQITITT GNTAAPVER+ILRRNSFRR PSSW LDPKKVLLLFATVSCIGSM
Subjt:  TSNSAPTDRDSSSPPVAVGATMAEKA-GAAVVV---SQQDHLGLPQVHHQITITTAGNTAAPVERSILRRNSFRRAPSSWFLDPKKVLLLFATVSCIGSM

Query:  VLIYFTLAIGKPNAEERGFD
        +LIYFTLAIGKPNAEERGFD
Subjt:  VLIYFTLAIGKPNAEERGFD

XP_038895558.1 uncharacterized protein LOC120083769 isoform X2 [Benincasa hispida]4.57e-11286.34Show/hide
Query:  MEIPKLNSASLREVNQLEETKNSQKISVCSQPNGVHYATNSDSFVIDMNGFSNGGGTKETSPNPRITLQKNLSRKGFQRGGDRMITSNSAPTDRDSSSPP
        MEIPKLN +SLREVNQLEET+ SQKISVCSQPNGVHY+TNSDSFVIDMNGFSNGG TKE + NPRITLQ+NLSR G QRGGD+MI SNSAP DRDSSSP 
Subjt:  MEIPKLNSASLREVNQLEETKNSQKISVCSQPNGVHYATNSDSFVIDMNGFSNGGGTKETSPNPRITLQKNLSRKGFQRGGDRMITSNSAPTDRDSSSPP

Query:  VAVGATMAEKA-GAAVVV---SQQDHLGLPQVHHQITITTAGNTAAPVERSILRRNSFRRAPSSWFLDPKKVLLLFATVSCIGSMVLIYFTLAIGKPNAE
        VAVGATMAEKA GAAV +   SQQDHLG+PQVHHQITITT GNTAAPVER+ILRRNSFRR PSSW LDPKKVLLLFATVSCIGSM+LIYFTLAIGKPNAE
Subjt:  VAVGATMAEKA-GAAVVV---SQQDHLGLPQVHHQITITTAGNTAAPVERSILRRNSFRRAPSSWFLDPKKVLLLFATVSCIGSMVLIYFTLAIGKPNAE

Query:  ERGFD
        ERGFD
Subjt:  ERGFD

TrEMBL top hitse value%identityAlignment
A0A1S3B3I3 uncharacterized protein LOC1034855797.60e-11183.9Show/hide
Query:  MEIPKLNSASLREVNQLEETKNSQKISVCSQPNGVHYATNSDSFVIDMNGFSNGGGTKETSPNPRITLQKNLSRKGFQRGGDRMITSNSAPTDRDSSSPP
        MEIPKLN +S REVN LEET+NSQKISVCSQPNGVHY+TNSDSFVIDMNGFS+ GGTKE++ NPRITLQ+N SRKG QRGGD+MI +NSAP DRDSSSP 
Subjt:  MEIPKLNSASLREVNQLEETKNSQKISVCSQPNGVHYATNSDSFVIDMNGFSNGGGTKETSPNPRITLQKNLSRKGFQRGGDRMITSNSAPTDRDSSSPP

Query:  VAVGATMAEKA-GAAVVV---SQQDHLGLPQVHHQITITTAGNTAAPVERSILRRNSFRRAPSSWFLDPKKVLLLFATVSCIGSMVLIYFTLAIGKPNAE
        VAVGATMAEKA G+AV V   SQQDHLG+PQVHHQITITT G+TAAPVER ILRRNSFRR PS W LDPKKVLLLFATVSCIGSMVLIYFTLAIGKPNAE
Subjt:  VAVGATMAEKA-GAAVVV---SQQDHLGLPQVHHQITITTAGNTAAPVERSILRRNSFRRAPSSWFLDPKKVLLLFATVSCIGSMVLIYFTLAIGKPNAE

Query:  ERGFD
        +RGFD
Subjt:  ERGFD

A0A5A7UNT8 Uncharacterized protein7.60e-11183.9Show/hide
Query:  MEIPKLNSASLREVNQLEETKNSQKISVCSQPNGVHYATNSDSFVIDMNGFSNGGGTKETSPNPRITLQKNLSRKGFQRGGDRMITSNSAPTDRDSSSPP
        MEIPKLN +S REVN LEET+NSQKISVCSQPNGVHY+TNSDSFVIDMNGFS+ GGTKE++ NPRITLQ+N SRKG QRGGD+MI +NSAP DRDSSSP 
Subjt:  MEIPKLNSASLREVNQLEETKNSQKISVCSQPNGVHYATNSDSFVIDMNGFSNGGGTKETSPNPRITLQKNLSRKGFQRGGDRMITSNSAPTDRDSSSPP

Query:  VAVGATMAEKA-GAAVVV---SQQDHLGLPQVHHQITITTAGNTAAPVERSILRRNSFRRAPSSWFLDPKKVLLLFATVSCIGSMVLIYFTLAIGKPNAE
        VAVGATMAEKA G+AV V   SQQDHLG+PQVHHQITITT G+TAAPVER ILRRNSFRR PS W LDPKKVLLLFATVSCIGSMVLIYFTLAIGKPNAE
Subjt:  VAVGATMAEKA-GAAVVV---SQQDHLGLPQVHHQITITTAGNTAAPVERSILRRNSFRRAPSSWFLDPKKVLLLFATVSCIGSMVLIYFTLAIGKPNAE

Query:  ERGFD
        +RGFD
Subjt:  ERGFD

A0A6J1CH33 uncharacterized protein LOC1110107897.78e-138100Show/hide
Query:  MEIPKLNSASLREVNQLEETKNSQKISVCSQPNGVHYATNSDSFVIDMNGFSNGGGTKETSPNPRITLQKNLSRKGFQRGGDRMITSNSAPTDRDSSSPP
        MEIPKLNSASLREVNQLEETKNSQKISVCSQPNGVHYATNSDSFVIDMNGFSNGGGTKETSPNPRITLQKNLSRKGFQRGGDRMITSNSAPTDRDSSSPP
Subjt:  MEIPKLNSASLREVNQLEETKNSQKISVCSQPNGVHYATNSDSFVIDMNGFSNGGGTKETSPNPRITLQKNLSRKGFQRGGDRMITSNSAPTDRDSSSPP

Query:  VAVGATMAEKAGAAVVVSQQDHLGLPQVHHQITITTAGNTAAPVERSILRRNSFRRAPSSWFLDPKKVLLLFATVSCIGSMVLIYFTLAIGKPNAEERGF
        VAVGATMAEKAGAAVVVSQQDHLGLPQVHHQITITTAGNTAAPVERSILRRNSFRRAPSSWFLDPKKVLLLFATVSCIGSMVLIYFTLAIGKPNAEERGF
Subjt:  VAVGATMAEKAGAAVVVSQQDHLGLPQVHHQITITTAGNTAAPVERSILRRNSFRRAPSSWFLDPKKVLLLFATVSCIGSMVLIYFTLAIGKPNAEERGF

Query:  D
        D
Subjt:  D

A0A6J1EZ37 uncharacterized protein LOC1114400496.78e-11283.5Show/hide
Query:  MEIPKLNSASLREVNQLEETKNSQKISVCSQPNGVHYATNSDSFVIDMNGFSNGGGTKETSPNPRITLQKNLSRKGFQRGGDRMITSNSAPTDRDSSSPP
        MEIPKLNSA LREV++LEETKNSQKISVC+QPNGVHY+ NSDSFVIDMNGFSNGG TKE + NPRITLQ+NLSRKG QRGGD+MI SN+AP DRDSSSP 
Subjt:  MEIPKLNSASLREVNQLEETKNSQKISVCSQPNGVHYATNSDSFVIDMNGFSNGGGTKETSPNPRITLQKNLSRKGFQRGGDRMITSNSAPTDRDSSSPP

Query:  VAVGATMAEKAGAAVVV-----SQQDHLGLPQVHHQITITTAGNTAAPVERSILRRNSFRRAPSSWFLDPKKVLLLFATVSCIGSMVLIYFTLAIGKPNA
        VAVGATMAEKAGAAV V     SQQDHLG+ QVHHQITITTA   AAPVER  LRRNSFRR  SSWFLDPKKVLLLFATVSCIGSM+LIYFTLAIGKP+A
Subjt:  VAVGATMAEKAGAAVVV-----SQQDHLGLPQVHHQITITTAGNTAAPVERSILRRNSFRRAPSSWFLDPKKVLLLFATVSCIGSMVLIYFTLAIGKPNA

Query:  EERGFD
        EERGFD
Subjt:  EERGFD

A0A6J1KI90 uncharacterized protein LOC111495477 isoform X12.17e-11083.5Show/hide
Query:  MEIPKLNSASLREVNQLEETKNSQKISVCSQPNGVHYATNSDSFVIDMNGFSNGGGTKETSPNPRITLQKNLSRKGFQRGGDRMITSNSAPTDRDSSSPP
        MEIPKLNSA LREV++LEETKNSQKISVC+QPNGVHY+ NSDSFVIDMNGFSNGG TKE + NPRITLQ+NLSRKG QRGGD+MI SN+AP DRDSSSP 
Subjt:  MEIPKLNSASLREVNQLEETKNSQKISVCSQPNGVHYATNSDSFVIDMNGFSNGGGTKETSPNPRITLQKNLSRKGFQRGGDRMITSNSAPTDRDSSSPP

Query:  VAVGATMAEKAGAAVVV-----SQQDHLGLPQVHHQITITTAGNTAAPVERSILRRNSFRRAPSSWFLDPKKVLLLFATVSCIGSMVLIYFTLAIGKPNA
        VAVGATMAEKAGA V V     SQQDHLG+ QVHHQITITTA NTAAPVER  LRRNSFRR  SSWFLDPKKVLLLFATVSCIGSM+LIYFTLAIGKP+ 
Subjt:  VAVGATMAEKAGAAVVV-----SQQDHLGLPQVHHQITITTAGNTAAPVERSILRRNSFRRAPSSWFLDPKKVLLLFATVSCIGSMVLIYFTLAIGKPNA

Query:  EERGFD
        EERGFD
Subjt:  EERGFD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G21215.1 unknown protein2.6e-1737.37Show/hide
Query:  NGVHYATNSDSFV-IDMNGFSNGGGTKETSPNPRITLQKNLSRKGFQR------------GGDRMITSNSAPTDRDSSSPPVAVGATMAEKAGAAVVVSQ
        +G+    N D FV ID+  FSN      +S +PRIT  +N+SRKG  R            G D+ I+   +P  R SS+P  A      E AG A   + 
Subjt:  NGVHYATNSDSFV-IDMNGFSNGGGTKETSPNPRITLQKNLSRKGFQR------------GGDRMITSNSAPTDRDSSSPPVAVGATMAEKAGAAVVVSQ

Query:  QDHLGLPQVHHQITITTAGNTAAPV------ER--SILRRNSFRRAPSSWFLDPKKVLLLFATVSCIGSMVLIYFTLAIGKPNAEERGFD
                  HQIT+TTA   A  +      ER     R++SF+R+ +SW LDPKK++L FAT+S +GS++LI FTL+I K N  +   D
Subjt:  QDHLGLPQVHHQITITTAGNTAAPV------ER--SILRRNSFRRAPSSWFLDPKKVLLLFATVSCIGSMVLIYFTLAIGKPNAEERGFD

AT4G21215.2 unknown protein5.5e-2038.42Show/hide
Query:  NGVHYATNSDSFV-IDMNGFSNGGGTKETSPNPRITLQKNLSRKGFQR------------GGDRMITSNSAPTDRDSSSPPVAVGATMAEKAGAAVVVSQ
        +G+    N D FV ID+  FSN      +S +PRITLQ+N+SRKG  R            G D+ I+   +P  R SS+P  A      E AG A   + 
Subjt:  NGVHYATNSDSFV-IDMNGFSNGGGTKETSPNPRITLQKNLSRKGFQR------------GGDRMITSNSAPTDRDSSSPPVAVGATMAEKAGAAVVVSQ

Query:  QDHLGLPQVHHQITITTAGNTAAPV------ER--SILRRNSFRRAPSSWFLDPKKVLLLFATVSCIGSMVLIYFTLAIGKPNAEERGFD
                  HQIT+TTA   A  +      ER     R++SF+R+ +SW LDPKK++L FAT+S +GS++LI FTL+I K N  +   D
Subjt:  QDHLGLPQVHHQITITTAGNTAAPV------ER--SILRRNSFRRAPSSWFLDPKKVLLLFATVSCIGSMVLIYFTLAIGKPNAEERGFD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TTTCTTTCTTCAACGGCACACCCACACATCATCAGATTCCTACTTTCCCTTTCGCCATTTTTGCTTCAACTCTATCACAATTTTCAGAACAAACAGTTCGCTGTTTCTGT
TTCTGTTTCTGTTTCAGCCATACCCAATTGCCCAATTCGGTGCGGAATTGTTGACGAAGAACACAACCTGCTTCCTTCTTCTTCATTTTTGTTTTCTTGTTCATCTCTCT
CCCTCTTTCCCGAGAAAGTGATGGAAATTCCCAAATTGAATTCGGCTTCTTTACGGGAAGTTAATCAATTGGAAGAAACGAAGAATTCCCAGAAAATTTCAGTTTGCAGT
CAGCCAAACGGTGTGCATTACGCAACAAACTCAGATAGCTTCGTCATCGACATGAACGGCTTCTCCAATGGCGGAGGAACCAAAGAAACCTCCCCAAATCCCAGAATTAC
ATTGCAGAAAAATTTATCCCGAAAAGGGTTTCAGCGTGGCGGCGACAGGATGATTACCAGCAATTCCGCTCCTACGGATAGAGATTCATCATCTCCACCGGTTGCAGTTG
GAGCTACCATGGCGGAAAAAGCAGGGGCGGCGGTGGTGGTGTCGCAGCAAGACCATTTGGGGCTTCCCCAAGTTCATCATCAGATCACTATAACCACCGCCGGCAACACG
GCGGCGCCGGTTGAACGATCCATTCTTAGAAGAAACAGTTTCAGGAGGGCTCCCTCTTCTTGGTTTTTGGACCCCAAGAAAGTTCTTCTCCTCTTTGCCACTGTGTCATG
TATAGGAAGTATGGTGCTCATATACTTCACACTGGCCATTGGAAAACCCAACGCGGAGGAACGCGGGTTTGATTAA
mRNA sequenceShow/hide mRNA sequence
GTTTCTTTCTTCAACGGCACACCCACACATCATCAGATTCCTACTTTCCCTTTCGCCATTTTTGCTTCAACTCTATCACAATTTTCAGAACAAACAGTTCGCTGTTTCTG
TTTCTGTTTCTGTTTCAGCCATACCCAATTGCCCAATTCGGTGCGGAATTGTTGACGAAGAACACAACCTGCTTCCTTCTTCTTCATTTTTGTTTTCTTGTTCATCTCTC
TCCCTCTTTCCCGAGAAAGTGATGGAAATTCCCAAATTGAATTCGGCTTCTTTACGGGAAGTTAATCAATTGGAAGAAACGAAGAATTCCCAGAAAATTTCAGTTTGCAG
TCAGCCAAACGGTGTGCATTACGCAACAAACTCAGATAGCTTCGTCATCGACATGAACGGCTTCTCCAATGGCGGAGGAACCAAAGAAACCTCCCCAAATCCCAGAATTA
CATTGCAGAAAAATTTATCCCGAAAAGGGTTTCAGCGTGGCGGCGACAGGATGATTACCAGCAATTCCGCTCCTACGGATAGAGATTCATCATCTCCACCGGTTGCAGTT
GGAGCTACCATGGCGGAAAAAGCAGGGGCGGCGGTGGTGGTGTCGCAGCAAGACCATTTGGGGCTTCCCCAAGTTCATCATCAGATCACTATAACCACCGCCGGCAACAC
GGCGGCGCCGGTTGAACGATCCATTCTTAGAAGAAACAGTTTCAGGAGGGCTCCCTCTTCTTGGTTTTTGGACCCCAAGAAAGTTCTTCTCCTCTTTGCCACTGTGTCAT
GTATAGGAAGTATGGTGCTCATATACTTCACACTGGCCATTGGAAAACCCAACGCGGAGGAACGCGGGTTTGATTAACGGAAGGCAGAATAACGAACGCAAGGACCACAA
TAACAAAACAACACAGATGGGAAGATGATGTGAGAGATGTTTGTACAAAGAGATAGAATGAAGAGAGACAAAAGGGCAGCAAAAGCAAGAGGGCACGGCAGGCTTCCATG
GCCTCTAATTCTAGCAATCGGATGCGCCACATTTTCATTAAGCCTAGGAATGGAAGAAAGTGACACTGCAGCAATGCGACAAAGAAATGGAATCTTCCCTTTTTCTTTTT
GTTTTCTTTTGTTTCTAAAATTGACAGATGAAAGAGATCGGATCAGAAACGTGGTGCGCTTTTGATTCCAATCTGTGTTAGGGATTTCTCCATCTGCATAGATTAGTTGA
AGTGGAAGGTGATGAGATCACTCTTCAACACTTTTACAACCTCCTTTTTCCTCCAAATATTCCTTGTTTATACAAGTCTGCAACCCTTTTTTGGCTGCTCCAAGTTCGAC
CAAATTTGCTTTGTATTCTGTGGCTTCCATGTGAGCAAGAGCAGGCCACGCACTGCCACAAACAGAAAGTGTTTGAATGTACTCTATGCATGCGCACTTGAAGATTTTCT
GACAGTGGTTTTTAACTTTTTATTGTGCCAAACTCTGAATGTATATATTAACTTTCCTTTTTCAGGATGGTTGAAGGCAAAAGGGGGGGTAGTAGAGAAGGGTATATATT
ATAGCACTGCCATTTAATCGACTGTTTCAAGTAAATCACGTTGGATGAACTATGAAAATGGAAAGTGATTGCAGACCATGTTTGGGACAATAAGGAAACAAATTTAGGTA
TTAGAAGTTCTATAGTCATGAAAGCAAAAGAAATCAACAAGAAAGGCAAATTCTCTGTGCTGAGGCTTCTTATAGGAACTTTAAAGGTAAAATAGATAATGATAGCATCC
AAATATGTAACTTTCTCGCATGCTCTTTATCAAGTAACTGACTTAGGTCTATAATTAAAGAACTTGTCTAATCCAGCGGACACAGAATGGATATAAGAACTTTTACTTGA
CATTGTTGGGCCCAAGTAGTAATTGGGTGTAGGTTTTGCCCTTATTTACACAGGATAATGATCATCAGCTACACTATAGTATGAACAAATTTCAGGCAAAATTCCATCAA
TGAGAAGTCACCTGTGTACTGCCGTGGTCTATGTCACAGCTAACAAAGATTGAGCGACATCTTGTTCCACATTGATACTGGTATCATGCCTTCATGCTTAGCTTTTATGT
CTCTAGTAGCGCTTTGATATTCTCGGCATCGAACACTTGTGTTTTCTATCGTGGTAAAGCCTCAATATGACAGCAGCAGATTCTTCAATGGCCTTCCCTGTCACCTCTGC
AAGACAAATAGATAAGATATACAGTCCAATATGCAGTAAAAATAAATATCAGAACATTCTGTCTTTTTGATAAGTTGAATACCTATTACAGGCCATGTAGGGTTTTGAGC
AAAAATCTTTCCAGCAAACTCCAGCTCCTCTCTAACATAATCCATGTCTGAATAGGTACTTCTCATTTCGTCACTAAATCCCAAACTCTTGGCTCTTGCTCTTCTAATTG
TTTGCAACACAATTGGATTTATAGTCAGACCAAACACCTTTTCCGGGTCTGCCTCGAACAGGGTCTTTGGCAGTCCTATCCCCATTACGATCGGCACGTTAGCCACTTTA
TAACCCTTTTGTGCAAGATAAATGGACAACGGTGTCTTCCCCGTTCGAGATACACCAGCAAGAACAATGTCAGCTTTTTTCAAGTTTTGGGGCAATGCACCATCGTCTTG
TTTGATCGTGAACTCAATTGCTTCTATCCGACGAAAGTACTCCTCAGAAAGCGGGAGATTACTGCCAAAAGCCCCACGAGGAAGACCAGACGGTGAAACACCTAGATGAG
AAGCAACAGCTTCTGTAAGTGGCCCCAAAATGTCATTGGACTGAATGCCCCATAACCTGCAAGCTTGCTTGGCAGATTCAGCCATGAATGGATCGGCCAAAGTATAAACT
AGCATAGCACCCTCTTTTGCTGCTTGTTTTACTATCTCTAGTAATCGCTCTGCATCATCAATCTGCCACATGTACGTTTTGTTTGTAACAGCAAAAGATCAGTACAAATT
CAGAAGGTTTCAC
Protein sequenceShow/hide protein sequence
FLSSTAHPHIIRFLLSLSPFLLQLYHNFQNKQFAVSVSVSVSAIPNCPIRCGIVDEEHNLLPSSSFLFSCSSLSLFPEKVMEIPKLNSASLREVNQLEETKNSQKISVCS
QPNGVHYATNSDSFVIDMNGFSNGGGTKETSPNPRITLQKNLSRKGFQRGGDRMITSNSAPTDRDSSSPPVAVGATMAEKAGAAVVVSQQDHLGLPQVHHQITITTAGNT
AAPVERSILRRNSFRRAPSSWFLDPKKVLLLFATVSCIGSMVLIYFTLAIGKPNAEERGFD