; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC07g0541 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC07g0541
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionTransmembrane protein
Genome locationMC07:13483501..13486181
RNA-Seq ExpressionMC07g0541
SyntenyMC07g0541
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7026129.1 hypothetical protein SDJN02_12628 [Cucurbita argyrosperma subsp. argyrosperma]4.50e-11668.47Show/hide
Query:  LLPHPFS-ALPFHHPWRL-----SSRSRFSSAT-RYRPWDSNAE------------DEEP----FDPGIRFRTTRNKRRRWWSDDPAPEFEEESSGILDE
        LL HP S +  F HPWR      SSR RFSSAT  YRPWDSNAE            DEE     FDPGIRF T+R  RRRWWSD+PAPEF+EE SG+LD+
Subjt:  LLPHPFS-ALPFHHPWRL-----SSRSRFSSAT-RYRPWDSNAE------------DEEP----FDPGIRFRTTRNKRRRWWSDDPAPEFEEESSGILDE

Query:  VIDSVWIFKVFKSYGWTLPPIIISLLLNSGPKAFLMALALPLAQSIISLALEKLWGAAERRPKRSARSRTRKRSFYSTETK-VQEEEE-----------K
        VIDSVWIFKVFKSYGW LPPIIISLLLNSGPKAFLMALA PL QSIISLALEKLWGA ERRPKR  R++TRKR FYSTE + V+EEEE           K
Subjt:  VIDSVWIFKVFKSYGWTLPPIIISLLLNSGPKAFLMALALPLAQSIISLALEKLWGAAERRPKRSARSRTRKRSFYSTETK-VQEEEE-----------K

Query:  GKMGYGYQSWEA-------RKEGRNGSSYGGWEDLDGVGSEREPKSGVRPKNQSSTTS-MEKGKLSWREKKSDTPLLLRLLIAVFPFLGSWTRML
        GKMGYGYQSWE        RKE R+G+++GGWEDLDGV      +SGVR K +SS++S ME GKLSWREKKSDTPLLLRLLIAVFPFLGSWTRML
Subjt:  GKMGYGYQSWEA-------RKEGRNGSSYGGWEDLDGVGSEREPKSGVRPKNQSSTTS-MEKGKLSWREKKSDTPLLLRLLIAVFPFLGSWTRML

XP_004147177.1 uncharacterized protein LOC101211925 [Cucumis sativus]4.02e-11768.24Show/hide
Query:  LLPHPFSALPFHHPWRL--SSRSRFSSAT-RYRPWDSNAE--------------------DEEP-FDPGIRFRTTRNKRRRWWSDDPAPEFEEESSGILD
        L PHP    PF  PWR   SS  RF+S    YRP D NAE                    DE+P FDPGIRFR     RRRWWSDDPAPEFE++ SGILD
Subjt:  LLPHPFSALPFHHPWRL--SSRSRFSSAT-RYRPWDSNAE--------------------DEEP-FDPGIRFRTTRNKRRRWWSDDPAPEFEEESSGILD

Query:  EVIDSVWIFKVFKSYGWTLPPIIISLLLNSGPKAFLMALALPLAQSIISLALEKLWGAAERRPKRSARSRTRKRSFYSTET-KVQEEEE-----------
        EVIDSVWIFKVFKSYGWTLPPIIISLLLNSGPKAFLMALALPL QSII+LALEKLWG  ER+PKR  RS+TRKR FYST T +VQEEE+           
Subjt:  EVIDSVWIFKVFKSYGWTLPPIIISLLLNSGPKAFLMALALPLAQSIISLALEKLWGAAERRPKRSARSRTRKRSFYSTET-KVQEEEE-----------

Query:  -KGKMGYGYQSWE-------ARKEGRNGSSYGGWEDLDGVGSEREPKSGVRPKNQSSTTSMEKGKLSWREKKSDTPLLLRLLIAVFPFLGSWTRML
          GKMGYGYQSWE        R EGRNG+S+GGWEDLDGVG+ER+PK GVR K QSSTT MEKGKL+WREKKSDTPLLLRLLIAVFPFLGSWT+ML
Subjt:  -KGKMGYGYQSWE-------ARKEGRNGSSYGGWEDLDGVGSEREPKSGVRPKNQSSTTSMEKGKLSWREKKSDTPLLLRLLIAVFPFLGSWTRML

XP_022138787.1 uncharacterized protein LOC111009866 [Momordica charantia]1.05e-18399.62Show/hide
Query:  MKALHLQLQLLPHPFSALPFHHPWRLSSRSRFSSATRYRPWDSNAEDEEPFDPGIRFRTTRNKRRRWWSDDPAPEFEEESSGILDEVIDSVWIFKVFKSY
        MKALHLQLQLLPHPFSALPFHHPWRLSSRSRFSSATRYR WDSNAEDEEPFDPGIRFRTTRNKRRRWWSDDPAPEFEEESSGILDEVIDSVWIFKVFKSY
Subjt:  MKALHLQLQLLPHPFSALPFHHPWRLSSRSRFSSATRYRPWDSNAEDEEPFDPGIRFRTTRNKRRRWWSDDPAPEFEEESSGILDEVIDSVWIFKVFKSY

Query:  GWTLPPIIISLLLNSGPKAFLMALALPLAQSIISLALEKLWGAAERRPKRSARSRTRKRSFYSTETKVQEEEEKGKMGYGYQSWEARKEGRNGSSYGGWE
        GWTLPPIIISLLLNSGPKAFLMALALPLAQSIISLALEKLWGAAERRPKRSARSRTRKRSFYSTETKVQEEEEKGKMGYGYQSWEARKEGRNGSSYGGWE
Subjt:  GWTLPPIIISLLLNSGPKAFLMALALPLAQSIISLALEKLWGAAERRPKRSARSRTRKRSFYSTETKVQEEEEKGKMGYGYQSWEARKEGRNGSSYGGWE

Query:  DLDGVGSEREPKSGVRPKNQSSTTSMEKGKLSWREKKSDTPLLLRLLIAVFPFLGSWTRML
        DLDGVGSEREPKSGVRPKNQSSTTSMEKGKLSWREKKSDTPLLLRLLIAVFPFLGSWTRML
Subjt:  DLDGVGSEREPKSGVRPKNQSSTTSMEKGKLSWREKKSDTPLLLRLLIAVFPFLGSWTRML

XP_023000532.1 uncharacterized protein LOC111494773 [Cucurbita maxima]1.74e-11769.49Show/hide
Query:  LLPHPFSALPFHHPWRL-----SSRSRFSSAT-RYRPWDSNAE------------DEEP----FDPGIRFRTTRNKRRRWWSDDPAPEFEEESSGILDEV
        LL HP S   F HPWR      SSR RFSSAT  YRPWDSNAE            DEE     FDPGIRF T+R  RRRWWSD+PAPEF+EESSGILD+V
Subjt:  LLPHPFSALPFHHPWRL-----SSRSRFSSAT-RYRPWDSNAE------------DEEP----FDPGIRFRTTRNKRRRWWSDDPAPEFEEESSGILDEV

Query:  IDSVWIFKVFKSYGWTLPPIIISLLLNSGPKAFLMALALPLAQSIISLALEKLWGAAERRPKRSARSRTRKRSFYSTETK-VQEEEEK------------
        IDSVWIFKVFKSYGW LPPIIISLLLNSGPKAFLMALALPL QSIISLALEKLWGA ERRPKR  R++TRKR FYSTE + V+EEEEK            
Subjt:  IDSVWIFKVFKSYGWTLPPIIISLLLNSGPKAFLMALALPLAQSIISLALEKLWGAAERRPKRSARSRTRKRSFYSTETK-VQEEEEK------------

Query:  GKMGYGYQSWEA-------RKEGRNGSSYGGWEDLDGVGSEREPKSGVRPKNQSSTTS-MEKGKLSWREKKSDTPLLLRLLIAVFPFLGSWTRML
        GKMGYGYQSWE        RKE R+G+++GGWEDLDGV      +SGVR K +SS++S ME+GKLSWREKKSDTPLLLRLLIAVFPFLGSWTRML
Subjt:  GKMGYGYQSWEA-------RKEGRNGSSYGGWEDLDGVGSEREPKSGVRPKNQSSTTS-MEKGKLSWREKKSDTPLLLRLLIAVFPFLGSWTRML

XP_038907133.1 uncharacterized protein LOC120092944 [Benincasa hispida]1.00e-11969Show/hide
Query:  LLPHPFSAL-PFH-HPWR------LSSRSRFSSATRYRPWDSNAE-----------------------DEEPFDPGIRFRTT-RNKRRRWWSDDPAPEFE
        L PHP S L P H  PWR       SS  RFS    YRP DSNAE                       D++ FDPGIRFRTT R  RRRWWSD+PAP+FE
Subjt:  LLPHPFSAL-PFH-HPWR------LSSRSRFSSATRYRPWDSNAE-----------------------DEEPFDPGIRFRTT-RNKRRRWWSDDPAPEFE

Query:  EESSGILDEVIDSVWIFKVFKSYGWTLPPIIISLLLNSGPKAFLMALALPLAQSIISLALEKLWGAAERRPKRSARSRTRKRSFYSTET-KVQEEEEK--
        ++ SGILD+VIDSVWIFKVFKSYGWTLPPII SLLLNSGPKAFLMALALPL QSIISLALEKLWG  ER+PKR  RS+TRKR FYST T +VQEEEE+  
Subjt:  EESSGILDEVIDSVWIFKVFKSYGWTLPPIIISLLLNSGPKAFLMALALPLAQSIISLALEKLWGAAERRPKRSARSRTRKRSFYSTET-KVQEEEEK--

Query:  ------GKMGYGYQSWEA-------RKEGRNGSSYGGWEDLDGVGSEREPKSGVRPKNQSSTTSMEKGKLSWREKKSDTPLLLRLLIAVFPFLGSWTRML
              GKMGYGYQSWE        RKEGRNG+S+GGWEDLDGVGSER+ K GVR K QSST SMEKGKLSWREKKSDTPLLLRLLIAVFPFLGSWTRML
Subjt:  ------GKMGYGYQSWEA-------RKEGRNGSSYGGWEDLDGVGSEREPKSGVRPKNQSSTTSMEKGKLSWREKKSDTPLLLRLLIAVFPFLGSWTRML

TrEMBL top hitse value%identityAlignment
A0A0A0LKI3 Uncharacterized protein1.94e-11768.24Show/hide
Query:  LLPHPFSALPFHHPWRL--SSRSRFSSAT-RYRPWDSNAE--------------------DEEP-FDPGIRFRTTRNKRRRWWSDDPAPEFEEESSGILD
        L PHP    PF  PWR   SS  RF+S    YRP D NAE                    DE+P FDPGIRFR     RRRWWSDDPAPEFE++ SGILD
Subjt:  LLPHPFSALPFHHPWRL--SSRSRFSSAT-RYRPWDSNAE--------------------DEEP-FDPGIRFRTTRNKRRRWWSDDPAPEFEEESSGILD

Query:  EVIDSVWIFKVFKSYGWTLPPIIISLLLNSGPKAFLMALALPLAQSIISLALEKLWGAAERRPKRSARSRTRKRSFYSTET-KVQEEEE-----------
        EVIDSVWIFKVFKSYGWTLPPIIISLLLNSGPKAFLMALALPL QSII+LALEKLWG  ER+PKR  RS+TRKR FYST T +VQEEE+           
Subjt:  EVIDSVWIFKVFKSYGWTLPPIIISLLLNSGPKAFLMALALPLAQSIISLALEKLWGAAERRPKRSARSRTRKRSFYSTET-KVQEEEE-----------

Query:  -KGKMGYGYQSWE-------ARKEGRNGSSYGGWEDLDGVGSEREPKSGVRPKNQSSTTSMEKGKLSWREKKSDTPLLLRLLIAVFPFLGSWTRML
          GKMGYGYQSWE        R EGRNG+S+GGWEDLDGVG+ER+PK GVR K QSSTT MEKGKL+WREKKSDTPLLLRLLIAVFPFLGSWT+ML
Subjt:  -KGKMGYGYQSWE-------ARKEGRNGSSYGGWEDLDGVGSEREPKSGVRPKNQSSTTSMEKGKLSWREKKSDTPLLLRLLIAVFPFLGSWTRML

A0A5A7SNV9 Uncharacterized protein1.84e-11466Show/hide
Query:  LLPHPFSALPFHHP--WRL----SSRSRFSSAT-RYRPWDSNAE---------------------DEEPFDPGIRFRTTRNKRRRWWSDDPAPEFEEESS
        L PHP    PF  P  WR     SS  RF+S    YRP D NAE                     +++ FDPGIRFR     RRRWWSDDPAP+FE++ S
Subjt:  LLPHPFSALPFHHP--WRL----SSRSRFSSAT-RYRPWDSNAE---------------------DEEPFDPGIRFRTTRNKRRRWWSDDPAPEFEEESS

Query:  GILDEVIDSVWIFKVFKSYGWTLPPIIISLLLNSGPKAFLMALALPLAQSIISLALEKLWGAAERRPKRSARSRTRKRSFYSTET-KVQEEEE-------
        GILDEVIDSVWIFKVFKSYGWTLPPIIISLLLNSGPKAFLMALALPL QSII+LALEKLWG  ER+PKR  RS+TRKR FYST T +VQEEE+       
Subjt:  GILDEVIDSVWIFKVFKSYGWTLPPIIISLLLNSGPKAFLMALALPLAQSIISLALEKLWGAAERRPKRSARSRTRKRSFYSTET-KVQEEEE-------

Query:  -----KGKMGYGYQSWEA-------RKEGRNGSSYGGWEDLDGVGSEREPKSGVRPKNQSSTTSMEKGKLSWREKKSDTPLLLRLLIAVFPFLGSWTRML
              GKMGYGYQSWE        R  GRNG+S+GGWEDLDGVG+ER+PK GVR K QSSTT MEKGKLSWREKKSDTPLLLRLLIAVFPFLGSWT+ML
Subjt:  -----KGKMGYGYQSWEA-------RKEGRNGSSYGGWEDLDGVGSEREPKSGVRPKNQSSTTSMEKGKLSWREKKSDTPLLLRLLIAVFPFLGSWTRML

A0A6J1CC49 uncharacterized protein LOC1110098665.06e-18499.62Show/hide
Query:  MKALHLQLQLLPHPFSALPFHHPWRLSSRSRFSSATRYRPWDSNAEDEEPFDPGIRFRTTRNKRRRWWSDDPAPEFEEESSGILDEVIDSVWIFKVFKSY
        MKALHLQLQLLPHPFSALPFHHPWRLSSRSRFSSATRYR WDSNAEDEEPFDPGIRFRTTRNKRRRWWSDDPAPEFEEESSGILDEVIDSVWIFKVFKSY
Subjt:  MKALHLQLQLLPHPFSALPFHHPWRLSSRSRFSSATRYRPWDSNAEDEEPFDPGIRFRTTRNKRRRWWSDDPAPEFEEESSGILDEVIDSVWIFKVFKSY

Query:  GWTLPPIIISLLLNSGPKAFLMALALPLAQSIISLALEKLWGAAERRPKRSARSRTRKRSFYSTETKVQEEEEKGKMGYGYQSWEARKEGRNGSSYGGWE
        GWTLPPIIISLLLNSGPKAFLMALALPLAQSIISLALEKLWGAAERRPKRSARSRTRKRSFYSTETKVQEEEEKGKMGYGYQSWEARKEGRNGSSYGGWE
Subjt:  GWTLPPIIISLLLNSGPKAFLMALALPLAQSIISLALEKLWGAAERRPKRSARSRTRKRSFYSTETKVQEEEEKGKMGYGYQSWEARKEGRNGSSYGGWE

Query:  DLDGVGSEREPKSGVRPKNQSSTTSMEKGKLSWREKKSDTPLLLRLLIAVFPFLGSWTRML
        DLDGVGSEREPKSGVRPKNQSSTTSMEKGKLSWREKKSDTPLLLRLLIAVFPFLGSWTRML
Subjt:  DLDGVGSEREPKSGVRPKNQSSTTSMEKGKLSWREKKSDTPLLLRLLIAVFPFLGSWTRML

A0A6J1HKD0 uncharacterized protein LOC1114643441.25e-11567.91Show/hide
Query:  LLPHPFSALPFHHPWRL-----SSRSRFSSAT-RYRPWDSNAE------------DEEP----FDPGIRFRTTRNKRRRWWSDDPAPEFEEESSGILDEV
        LL HP S   F HPWR      SSR RFSSAT  YRPWDSNAE            DEE     FDPGIRF T+R  RRRWWSD+PAPEF+EE SG+LD+V
Subjt:  LLPHPFSALPFHHPWRL-----SSRSRFSSAT-RYRPWDSNAE------------DEEP----FDPGIRFRTTRNKRRRWWSDDPAPEFEEESSGILDEV

Query:  IDSVWIFKVFKSYGWTLPPIIISLLLNSGPKAFLMALALPLAQSIISLALEKLWGAAERRPKRSARSRTRKRSFYSTETK--VQEEEE------------
        IDSVWIFKVFKSYGW LPPIIISLLLNSGPKAFLMALALPL QSIISLALEKLWGA ER+PKR  R++TRKR FYSTE +  V+EEEE            
Subjt:  IDSVWIFKVFKSYGWTLPPIIISLLLNSGPKAFLMALALPLAQSIISLALEKLWGAAERRPKRSARSRTRKRSFYSTETK--VQEEEE------------

Query:  KGKMGYGYQSWEA-------RKEGRNGSSYGGWEDLDGVGSEREPKSGVRPKNQSSTTS-MEKGKLSWREKKSDTPLLLRLLIAVFPFLGSWTRML
        KGKMGYGYQSWE        RKE R+G+++GGWEDLDGV      +SGVR K +SS++S ME GKLSWREKKSDTPLLLRLLI+VFPFLGSWTRML
Subjt:  KGKMGYGYQSWEA-------RKEGRNGSSYGGWEDLDGVGSEREPKSGVRPKNQSSTTS-MEKGKLSWREKKSDTPLLLRLLIAVFPFLGSWTRML

A0A6J1KDX0 uncharacterized protein LOC1114947738.41e-11869.49Show/hide
Query:  LLPHPFSALPFHHPWRL-----SSRSRFSSAT-RYRPWDSNAE------------DEEP----FDPGIRFRTTRNKRRRWWSDDPAPEFEEESSGILDEV
        LL HP S   F HPWR      SSR RFSSAT  YRPWDSNAE            DEE     FDPGIRF T+R  RRRWWSD+PAPEF+EESSGILD+V
Subjt:  LLPHPFSALPFHHPWRL-----SSRSRFSSAT-RYRPWDSNAE------------DEEP----FDPGIRFRTTRNKRRRWWSDDPAPEFEEESSGILDEV

Query:  IDSVWIFKVFKSYGWTLPPIIISLLLNSGPKAFLMALALPLAQSIISLALEKLWGAAERRPKRSARSRTRKRSFYSTETK-VQEEEEK------------
        IDSVWIFKVFKSYGW LPPIIISLLLNSGPKAFLMALALPL QSIISLALEKLWGA ERRPKR  R++TRKR FYSTE + V+EEEEK            
Subjt:  IDSVWIFKVFKSYGWTLPPIIISLLLNSGPKAFLMALALPLAQSIISLALEKLWGAAERRPKRSARSRTRKRSFYSTETK-VQEEEEK------------

Query:  GKMGYGYQSWEA-------RKEGRNGSSYGGWEDLDGVGSEREPKSGVRPKNQSSTTS-MEKGKLSWREKKSDTPLLLRLLIAVFPFLGSWTRML
        GKMGYGYQSWE        RKE R+G+++GGWEDLDGV      +SGVR K +SS++S ME+GKLSWREKKSDTPLLLRLLIAVFPFLGSWTRML
Subjt:  GKMGYGYQSWEA-------RKEGRNGSSYGGWEDLDGVGSEREPKSGVRPKNQSSTTS-MEKGKLSWREKKSDTPLLLRLLIAVFPFLGSWTRML

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G33250.1 unknown protein4.2e-2233.89Show/hide
Query:  DPGIRFRTTR-----NKRRRWWS---DDPAPEFEEESSG-------------ILDEVIDSVWIFKVFKSYGWTLPPIIISLLLNSGPKAFLMALALPLAQ
        D G  +R T       KRR  W    +D   + ++E  G             IL+E++D+VWI K FKSYG+ LP II+SL  ++GPKAFL++LA+ +  
Subjt:  DPGIRFRTTR-----NKRRRWWS---DDPAPEFEEESSG-------------ILDEVIDSVWIFKVFKSYGWTLPPIIISLLLNSGPKAFLMALALPLAQ

Query:  SIISLALEKLWGAAERRPKRSARSRTRKRSFYSTETKVQEEEEKGKMGYGYQSWEARKEGRN--------GSSYGGWEDLDGVGSEREPKSGVRPKNQSS
        S++  A +KL G  +RR    A        F   E + + E    ++ Y   +      GR          S +GGW++LDG+G+        RP ++  
Subjt:  SIISLALEKLWGAAERRPKRSARSRTRKRSFYSTETKVQEEEEKGKMGYGYQSWEARKEGRN--------GSSYGGWEDLDGVGSEREPKSGVRPKNQSS

Query:  TTSMEKGKLSWREKKSDTPLLLRLLIAVFPFLGSWTRML
           + K K   REK ++ PLLLRLL+++FPFL ++T ML
Subjt:  TTSMEKGKLSWREKKSDTPLLLRLLIAVFPFLGSWTRML

AT3G04310.1 unknown protein1.1e-2536.33Show/hide
Query:  SRSRFS-SATRYRPWDSNAEDEEPFDPGIRFRTTRNKRRRWWSDDPAPEFEEESSGILDEVIDSVWIFKVFKSYGWTLPPIIISLLLNSGPKAFLMALAL
        SR  F   A+R   W+     +E F     F   R K+R WW DD   + ++    + +E  D   +F+VF++  W L PI ISLLL +   A +MALA+
Subjt:  SRSRFS-SATRYRPWDSNAEDEEPFDPGIRFRTTRNKRRRWWSDDPAPEFEEESSGILDEVIDSVWIFKVFKSYGWTLPPIIISLLLNSGPKAFLMALAL

Query:  PLAQSIISLALEKLWGAAERRPKRSARSRTRKRSFYSTETKVQEEEEKGKM-----GYGYQSWEARKEGRN--GSSYGGWEDLD---GVGSEREPKSGVR
        PL QS++SL + K+W     R  + +R  T  RS   +  + ++  + G M       GY+SW    +  N  G+ YGGW+DLD    + ++      VR
Subjt:  PLAQSIISLALEKLWGAAERRPKRSARSRTRKRSFYSTETKVQEEEEKGKM-----GYGYQSWEARKEGRN--GSSYGGWEDLD---GVGSEREPKSGVR

Query:  PKNQSSTTSMEKGKLSWREKKSDTPLLLRLLIAVFPFLGSWTRML
        PK Q       K    WR K  + PLLLR+LIA FPFLGSWT++L
Subjt:  PKNQSSTTSMEKGKLSWREKKSDTPLLLRLLIAVFPFLGSWTRML


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGCCCTGCACCTGCAGCTGCAGCTGCTCCCTCATCCCTTCTCCGCCCTTCCCTTTCACCATCCATGGCGTCTCTCTTCTCGCTCCCGTTTCTCCTCTGCAACCCG
CTACCGTCCCTGGGATTCTAATGCCGAAGACGAAGAACCCTTCGACCCCGGCATTCGCTTCCGAACCACCCGGAACAAGCGGCGGCGATGGTGGTCCGACGACCCGGCGC
CGGAGTTCGAGGAAGAATCTTCTGGGATCTTGGATGAAGTAATCGACAGTGTCTGGATTTTCAAGGTGTTCAAATCCTACGGTTGGACTCTTCCACCCATAATCATCTCA
CTGCTGCTGAACTCAGGGCCTAAAGCTTTTCTAATGGCGCTGGCCCTTCCACTTGCCCAATCAATCATATCTCTGGCGTTGGAGAAGCTGTGGGGAGCAGCAGAGAGAAG
GCCAAAGCGCAGCGCCAGAAGCAGGACCAGGAAGAGGTCATTTTACAGCACTGAGACTAAGGTTCAAGAAGAAGAAGAAAAAGGAAAGATGGGGTATGGATATCAGTCAT
GGGAAGCTAGAAAGGAGGGCAGAAATGGGAGCAGTTATGGGGGATGGGAGGATTTGGATGGAGTTGGGTCTGAGAGAGAGCCAAAATCTGGGGTGAGACCCAAGAATCAG
AGCAGCACCACATCAATGGAGAAGGGTAAGTTGAGTTGGAGAGAGAAGAAAAGTGATACTCCCTTGCTGTTAAGATTGTTGATTGCTGTTTTTCCATTTTTGGGTTCATG
GACTAGGATGCTTTAG
mRNA sequenceShow/hide mRNA sequence
CTTTCAAATTTTAACACAAAGTTAACAATTTAATAAAATCATAGTTATATTTTTTAAAAAAAAAATATTGAACACATTCCGTTTGAATGGAATAGAAAAAGAAAATTATA
TTAAAAATAATATTATAATGGATAAAAGAGTCCAATGAAGATTTTAAAATTACAGTAACTAAAATAAATCAAATCAAAGTTGAGATTACTTTGATCAAAATAAACTAAAT
CTAAAACTGAAAATATAAATATCAAAATAGTATTTTAATCTAAAATTTAATATAATTTATTAGATTTAATCCTGAAATTTTTTTTTTTTGGCTGATGTAGAACACTATAC
GACACCGTGTAGGCGAAAATGAAGGCGCTTTAACAGAGAAGGAGGGAGACAAAAGGATAACGTGGCGAAATGAAGGCCCTGCACCTGCAGCTGCAGCTGCTCCCTCATCC
CTTCTCCGCCCTTCCCTTTCACCATCCATGGCGTCTCTCTTCTCGCTCCCGTTTCTCCTCTGCAACCCGCTACCGTCCCTGGGATTCTAATGCCGAAGACGAAGAACCCT
TCGACCCCGGCATTCGCTTCCGAACCACCCGGAACAAGCGGCGGCGATGGTGGTCCGACGACCCGGCGCCGGAGTTCGAGGAAGAATCTTCTGGGATCTTGGATGAAGTA
ATCGACAGTGTCTGGATTTTCAAGGTGTTCAAATCCTACGGTTGGACTCTTCCACCCATAATCATCTCACTGCTGCTGAACTCAGGGCCTAAAGCTTTTCTAATGGCGCT
GGCCCTTCCACTTGCCCAATCAATCATATCTCTGGCGTTGGAGAAGCTGTGGGGAGCAGCAGAGAGAAGGCCAAAGCGCAGCGCCAGAAGCAGGACCAGGAAGAGGTCAT
TTTACAGCACTGAGACTAAGGTTCAAGAAGAAGAAGAAAAAGGAAAGATGGGGTATGGATATCAGTCATGGGAAGCTAGAAAGGAGGGCAGAAATGGGAGCAGTTATGGG
GGATGGGAGGATTTGGATGGAGTTGGGTCTGAGAGAGAGCCAAAATCTGGGGTGAGACCCAAGAATCAGAGCAGCACCACATCAATGGAGAAGGGTAAGTTGAGTTGGAG
AGAGAAGAAAAGTGATACTCCCTTGCTGTTAAGATTGTTGATTGCTGTTTTTCCATTTTTGGGTTCATGGACTAGGATGCTTTAGTTGAAGGCTCTTCTAGACTTCTATA
CTTGGGTTAGAGTTTACTCATTGTTTTGGTTACATTAAACCTCAAGAAGTTACTCATCTTGTTGAGAACAGGATAAAAAATTCACAATAAACCTAATGTAATAGTGTGAG
TTTCTATACTTCAGTATAAGTTAACCTGTAAGTCACTGCAATATTATTTCTTCTGCATAGCTACATGCTCCATGCAAATGAACTTAATATTACAGGATAACAGTCTAATC
TTACATATTAAGAATCAAAAGCTTGGTAGGATAAGTTTGTTCTTAATTGATCTAACTCTTGAAGTGTTCGGAGCACAACTTGACTTATTTCACGGGACAACGGGACAACT
ACCTGATCCTACCTCACCGTCTAACTAAATCTTTAATTCTTATCCATAAGAAACTTTTGAGTTATGGAAGAAGCTAAAGATTGAATGGAGAATTTGATTAAAATATAAAT
CAAACATGCTGCTTGTATTTAACTTGTAAATAATTGCTATTACATAAACCAATAGGATTATAAAATGTAAAGAATGTTGATTAGTCAAGATAACTGACCATCCGGGTTAA
ATAATGCCAATCTAATTCAAAGAAAAATCTCCATCATCTTGAAAATAATTGTACCGTGTTGACATTGCAAGCAGAAGCCGGGTCCCTGAGGCAGCTGCAGAAACCATCTT
CGTCCTTGGGGAATGAAGCTGGTAGAGGACGATCTGGAATCACTCCAACCTGTAGAATATGAATATCCAGGTGAATTAGAACTTCGACTTTCTGAGAAACGTCTGTGTGG
GTAAGAAATAGAGAGACCTTGTCAATGTCGATGTGAGCTGGTGTCTCGTAACGAGCTACGGTAACAGCCAAGCCAGAACCATCGGATAGTTTAAAGACCGACTGGATTTT
ACTGCAAGGAGGGATGTGAATTCTTGTTAGGAATTTGAACATTATGTCATGTTTTGTGTTACAGACTCGCAGAAAAATTCTTACCCTTTACCATAAGTTGGTTCTCCAAA
CAACATAGCACGTTTATTGTCCTTTAGTGCTCCGGCGAGTATTTCACTCGCACTAGCAGTTCCCTTATTCACCTGATATAAGCGTGTTGAAGAAATAATCACAACAAAGT
TCACAAAAGGCGAACAAAATTTCCGTACACTCGATCACAAATACTCAGATTTAAGACTAAAAATGCTGCAAACTAACTAGCAAAGAATGATGAAGGGGAAAATCAAATTT
GCGGATAGGAAATATTAGGGGATTTGTTATACATGAAGAAAATGCATGGAGTCCATCTAAGGAAGTAAGGTTTAAAT
Protein sequenceShow/hide protein sequence
MKALHLQLQLLPHPFSALPFHHPWRLSSRSRFSSATRYRPWDSNAEDEEPFDPGIRFRTTRNKRRRWWSDDPAPEFEEESSGILDEVIDSVWIFKVFKSYGWTLPPIIIS
LLLNSGPKAFLMALALPLAQSIISLALEKLWGAAERRPKRSARSRTRKRSFYSTETKVQEEEEKGKMGYGYQSWEARKEGRNGSSYGGWEDLDGVGSEREPKSGVRPKNQ
SSTTSMEKGKLSWREKKSDTPLLLRLLIAVFPFLGSWTRML