; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS008506 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS008506
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionHistone acetyltransferase
Genome locationscaffold4:1879452..1881217
RNA-Seq ExpressionMS008506
SyntenyMS008506
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038814.1 uncharacterized protein E6C27_scaffold92G003710 [Cucumis melo var. makuwa]3.8e-9564.14Show/hide
Query:  MPPRAGCRPFESVRRTWHTETHQPIRGSLIQHIFRVVNEVHSSATKKNKEWQETLPIVILKAEEILYSKPDSVDEYMDVTTLWIRINEAINTIIRVDEDD
        MPPR GCRPFE VRR WH+E HQPIRGSLIQ IFRVV+EVH  ATKKNKEWQE LPIV+LKAEEILYSK DS  EYMDVTTLW RINEAINTIIR+D+D 
Subjt:  MPPRAGCRPFESVRRTWHTETHQPIRGSLIQHIFRVVNEVHSSATKKNKEWQETLPIVILKAEEILYSKPDSVDEYMDVTTLWIRINEAINTIIRVDEDD

Query:  ESGELLHPFIEAALYLGCTPRRSSRSNRSSNLRCYLTSCTPQVLDTPPCTLCNTNTIRPISFDSQQCLSCLNLSKQMRNASNCR-RNENHVGPQNKVNTN
        E+GE LHP IEAALYLGCTPRRSS+SNR SNLR YL SCTPQ+LDT P     TNT RP    SQ    C NLSKQ R     R  N+ HVG     + N
Subjt:  ESGELLHPFIEAALYLGCTPRRSSRSNRSSNLRCYLTSCTPQVLDTPPCTLCNTNTIRPISFDSQQCLSCLNLSKQMRNASNCR-RNENHVGPQNKVNTN

Query:  PKTSPVYKNIWSSIRSQQLLTEVVSGW-MFCLCPLYHGSNQHNPDIEMLPKPLDRNLNFSNPGAVKYFHSCD-------------------QQKTTCDLL
        P TS VYKNI +SI +QQ LTE V+GW MF LCPLY G NQH  DIE+ P  ++ N +F  P   KYFHS D                   QQKTTCDL 
Subjt:  PKTSPVYKNIWSSIRSQQLLTEVVSGW-MFCLCPLYHGSNQHNPDIEMLPKPLDRNLNFSNPGAVKYFHSCD-------------------QQKTTCDLL

Query:  LRLG
        LRLG
Subjt:  LRLG

XP_008466415.1 PREDICTED: uncharacterized protein LOC103503829 [Cucumis melo]1.9e-9463.82Show/hide
Query:  MPPRAGCRPFESVRRTWHTETHQPIRGSLIQHIFRVVNEVHSSATKKNKEWQETLPIVILKAEEILYSKPDSVDEYMDVTTLWIRINEAINTIIRVDEDD
        MPPR GCRPFE VRR WH+E HQPIRGSLIQ IFRVV+EVH  ATKKNKEWQE LPIV+LKAEEILYSK DS  EYMDVTTLW RINEAINTIIR+D+D 
Subjt:  MPPRAGCRPFESVRRTWHTETHQPIRGSLIQHIFRVVNEVHSSATKKNKEWQETLPIVILKAEEILYSKPDSVDEYMDVTTLWIRINEAINTIIRVDEDD

Query:  ESGELLHPFIEAALYLGCTPRRSSRSNRSSNLRCYLTSCTPQVLDTPPCTLCNTNTIRPISFDSQQCLSCLNLSKQMRNASNCR-RNENHVGPQNKVNTN
        E+GE LHP IEAALYLGCTPRRSS+SNR S LR YL SCTPQ+LDT P     TNT RP    SQ    C NLSKQ R     R  N+ HVG     + N
Subjt:  ESGELLHPFIEAALYLGCTPRRSSRSNRSSNLRCYLTSCTPQVLDTPPCTLCNTNTIRPISFDSQQCLSCLNLSKQMRNASNCR-RNENHVGPQNKVNTN

Query:  PKTSPVYKNIWSSIRSQQLLTEVVSGW-MFCLCPLYHGSNQHNPDIEMLPKPLDRNLNFSNPGAVKYFHSCD-------------------QQKTTCDLL
        P TS VYKNI +SI +QQ LTE V+GW MF LCPLY G NQH  DIE+ P  ++ N +F  P   KYFHS D                   QQKTTCDL 
Subjt:  PKTSPVYKNIWSSIRSQQLLTEVVSGW-MFCLCPLYHGSNQHNPDIEMLPKPLDRNLNFSNPGAVKYFHSCD-------------------QQKTTCDLL

Query:  LRLG
        LRLG
Subjt:  LRLG

XP_011652481.1 uncharacterized protein LOC101219225 [Cucumis sativus]1.2e-9664.8Show/hide
Query:  MPPRAGCRPFESVRRTWHTETHQPIRGSLIQHIFRVVNEVHSSATKKNKEWQETLPIVILKAEEILYSKPDSVDEYMDVTTLWIRINEAINTIIRVDEDD
        MPPR GCRPFE VRR WH+E HQPIRGSLIQ IFRVV+EVH  ATKKNKEWQE LPIV+LKAEEILYSK DS  EYMDVTTLW RINEAINTIIR+D+D 
Subjt:  MPPRAGCRPFESVRRTWHTETHQPIRGSLIQHIFRVVNEVHSSATKKNKEWQETLPIVILKAEEILYSKPDSVDEYMDVTTLWIRINEAINTIIRVDEDD

Query:  ESGELLHPFIEAALYLGCTPRRSSRSNRSSNLRCYLTSCTPQVLDTPPCTLCNTNTIRPISFDSQQCLSCLNLSKQMRNA-SNCRRNENHVGPQNKVNTN
        E+GE LHP IEAALYLGCTPRRSS+SNR SNLR YL SCTPQVLDT P     TN IRP    SQ    C NLSKQ RN   +   N+ HVG    V + 
Subjt:  ESGELLHPFIEAALYLGCTPRRSSRSNRSSNLRCYLTSCTPQVLDTPPCTLCNTNTIRPISFDSQQCLSCLNLSKQMRNA-SNCRRNENHVGPQNKVNTN

Query:  PKTSPVYKNIWSSIRSQQLLTEVVSGW-MFCLCPLYHGSNQHNPDIEMLPKPLDRNLNFSNPGAVKYFHSCD-------------------QQKTTCDLL
          TS VYKNI  SIR++Q LTE V+GW MF LCPLY G NQH  DIE+ P P+  N +FS P   KYFHS D                   QQKTTCDL 
Subjt:  PKTSPVYKNIWSSIRSQQLLTEVVSGW-MFCLCPLYHGSNQHNPDIEMLPKPLDRNLNFSNPGAVKYFHSCD-------------------QQKTTCDLL

Query:  LRLG
        LRLG
Subjt:  LRLG

XP_023524265.1 uncharacterized protein LOC111788219 [Cucurbita pepo subsp. pepo]1.9e-9162.46Show/hide
Query:  MPPRAGCRPFESVRRTWHTETHQPIRGSLIQHIFRVVNEVHSSATKKNKEWQETLPIVILKAEEILYSKPDSVDEYMDVTTLWIRINEAINTIIRVDEDD
        MPPR G RPFE VRR WH E HQPIRGSLIQ IFRVVN+VH  ATKKNKEWQE LPIV+LKAEEILYSK DS  EYMD+TTLW RINEAINTIIR+D D 
Subjt:  MPPRAGCRPFESVRRTWHTETHQPIRGSLIQHIFRVVNEVHSSATKKNKEWQETLPIVILKAEEILYSKPDSVDEYMDVTTLWIRINEAINTIIRVDEDD

Query:  ESGELLHPFIEAALYLGCTPRRSSRSNRSSNLRCYLTSCTPQVLDTPPCTLCNTNTIRPISFDSQQCLSCLNLSKQMRNASNCRRNENHVGPQNKVNTNP
        E+GE LHP IEAALYLGCTPRRSSRSNR SNLR YLTSC+PQVL+T    +   NTIRP    +  C SC     +        +++ HVG    V+TN 
Subjt:  ESGELLHPFIEAALYLGCTPRRSSRSNRSSNLRCYLTSCTPQVLDTPPCTLCNTNTIRPISFDSQQCLSCLNLSKQMRNASNCRRNENHVGPQNKVNTNP

Query:  KTSPVYKNIWSSIRSQQLLTEVVSGW-MFCLCPLYHGSNQHNPDIEMLPKPLDRNLNFSNPGAVKYFHSC-----------------DQQKTTCDLLLRL
        K+S VYKNIW SI SQQ LTE VSGW MF LCPLY+GSNQH P I + P PL  + +FS P   KYFHS                  +QQKT C+L LRL
Subjt:  KTSPVYKNIWSSIRSQQLLTEVVSGW-MFCLCPLYHGSNQHNPDIEMLPKPLDRNLNFSNPGAVKYFHSC-----------------DQQKTTCDLLLRL

Query:  G
        G
Subjt:  G

XP_038899374.1 uncharacterized protein LOC120086684 [Benincasa hispida]9.6e-9966.34Show/hide
Query:  MPPRAGCRPFESVRRTWHTETHQPIRGSLIQHIFRVVNEVHSSATKKNKEWQETLPIVILKAEEILYSKPDSVDEYMDVTTLWIRINEAINTIIRVDEDD
        MPPR GCRPFE VRR WH+E HQPIRGSLIQ IFRVV+EVH  ATKKNKEWQE LPIV+LKAEEILYSK DS  EYMD+TTLW RINEAIN IIR+D+D 
Subjt:  MPPRAGCRPFESVRRTWHTETHQPIRGSLIQHIFRVVNEVHSSATKKNKEWQETLPIVILKAEEILYSKPDSVDEYMDVTTLWIRINEAINTIIRVDEDD

Query:  ESGELLHPFIEAALYLGCTPRRSSRSNRSSNLRCYLTSCTPQVLDTPPCTLCNTNTIRPISFDSQQCLSCLNLSKQMRNASNCR-RNENHVGPQNKVNTN
        E+GE LHP IEAALYLGCTPRRSS+SNR SNLR YL SCTPQVLDT P     TN+IRP    SQ   SC NLSKQ RN S  + +N+NHVG     +TN
Subjt:  ESGELLHPFIEAALYLGCTPRRSSRSNRSSNLRCYLTSCTPQVLDTPPCTLCNTNTIRPISFDSQQCLSCLNLSKQMRNASNCR-RNENHVGPQNKVNTN

Query:  PKTSPVYKNIWSSIRSQQLLTEVVSGW-MFCLCPLYHGSNQHNPDIEMLPKPLDRNLNFSNPGAVKYFHSCD------------------QQKTTCDLLL
        P TS VYKNIW  I  Q LLTE V+GW MF LCPLYHGSNQ    IE+ P PL  N  FS P   KYFHS D                  QQKTTCDL L
Subjt:  PKTSPVYKNIWSSIRSQQLLTEVVSGW-MFCLCPLYHGSNQHNPDIEMLPKPLDRNLNFSNPGAVKYFHSCD------------------QQKTTCDLLL

Query:  RLG
        RLG
Subjt:  RLG

TrEMBL top hitse value%identityAlignment
A0A0A0LHA1 Uncharacterized protein5.7e-9764.8Show/hide
Query:  MPPRAGCRPFESVRRTWHTETHQPIRGSLIQHIFRVVNEVHSSATKKNKEWQETLPIVILKAEEILYSKPDSVDEYMDVTTLWIRINEAINTIIRVDEDD
        MPPR GCRPFE VRR WH+E HQPIRGSLIQ IFRVV+EVH  ATKKNKEWQE LPIV+LKAEEILYSK DS  EYMDVTTLW RINEAINTIIR+D+D 
Subjt:  MPPRAGCRPFESVRRTWHTETHQPIRGSLIQHIFRVVNEVHSSATKKNKEWQETLPIVILKAEEILYSKPDSVDEYMDVTTLWIRINEAINTIIRVDEDD

Query:  ESGELLHPFIEAALYLGCTPRRSSRSNRSSNLRCYLTSCTPQVLDTPPCTLCNTNTIRPISFDSQQCLSCLNLSKQMRNA-SNCRRNENHVGPQNKVNTN
        E+GE LHP IEAALYLGCTPRRSS+SNR SNLR YL SCTPQVLDT P     TN IRP    SQ    C NLSKQ RN   +   N+ HVG    V + 
Subjt:  ESGELLHPFIEAALYLGCTPRRSSRSNRSSNLRCYLTSCTPQVLDTPPCTLCNTNTIRPISFDSQQCLSCLNLSKQMRNA-SNCRRNENHVGPQNKVNTN

Query:  PKTSPVYKNIWSSIRSQQLLTEVVSGW-MFCLCPLYHGSNQHNPDIEMLPKPLDRNLNFSNPGAVKYFHSCD-------------------QQKTTCDLL
          TS VYKNI  SIR++Q LTE V+GW MF LCPLY G NQH  DIE+ P P+  N +FS P   KYFHS D                   QQKTTCDL 
Subjt:  PKTSPVYKNIWSSIRSQQLLTEVVSGW-MFCLCPLYHGSNQHNPDIEMLPKPLDRNLNFSNPGAVKYFHSCD-------------------QQKTTCDLL

Query:  LRLG
        LRLG
Subjt:  LRLG

A0A1S3CR62 uncharacterized protein LOC1035038299.1e-9563.82Show/hide
Query:  MPPRAGCRPFESVRRTWHTETHQPIRGSLIQHIFRVVNEVHSSATKKNKEWQETLPIVILKAEEILYSKPDSVDEYMDVTTLWIRINEAINTIIRVDEDD
        MPPR GCRPFE VRR WH+E HQPIRGSLIQ IFRVV+EVH  ATKKNKEWQE LPIV+LKAEEILYSK DS  EYMDVTTLW RINEAINTIIR+D+D 
Subjt:  MPPRAGCRPFESVRRTWHTETHQPIRGSLIQHIFRVVNEVHSSATKKNKEWQETLPIVILKAEEILYSKPDSVDEYMDVTTLWIRINEAINTIIRVDEDD

Query:  ESGELLHPFIEAALYLGCTPRRSSRSNRSSNLRCYLTSCTPQVLDTPPCTLCNTNTIRPISFDSQQCLSCLNLSKQMRNASNCR-RNENHVGPQNKVNTN
        E+GE LHP IEAALYLGCTPRRSS+SNR S LR YL SCTPQ+LDT P     TNT RP    SQ    C NLSKQ R     R  N+ HVG     + N
Subjt:  ESGELLHPFIEAALYLGCTPRRSSRSNRSSNLRCYLTSCTPQVLDTPPCTLCNTNTIRPISFDSQQCLSCLNLSKQMRNASNCR-RNENHVGPQNKVNTN

Query:  PKTSPVYKNIWSSIRSQQLLTEVVSGW-MFCLCPLYHGSNQHNPDIEMLPKPLDRNLNFSNPGAVKYFHSCD-------------------QQKTTCDLL
        P TS VYKNI +SI +QQ LTE V+GW MF LCPLY G NQH  DIE+ P  ++ N +F  P   KYFHS D                   QQKTTCDL 
Subjt:  PKTSPVYKNIWSSIRSQQLLTEVVSGW-MFCLCPLYHGSNQHNPDIEMLPKPLDRNLNFSNPGAVKYFHSCD-------------------QQKTTCDLL

Query:  LRLG
        LRLG
Subjt:  LRLG

A0A5A7T5Q7 Uncharacterized protein1.8e-9564.14Show/hide
Query:  MPPRAGCRPFESVRRTWHTETHQPIRGSLIQHIFRVVNEVHSSATKKNKEWQETLPIVILKAEEILYSKPDSVDEYMDVTTLWIRINEAINTIIRVDEDD
        MPPR GCRPFE VRR WH+E HQPIRGSLIQ IFRVV+EVH  ATKKNKEWQE LPIV+LKAEEILYSK DS  EYMDVTTLW RINEAINTIIR+D+D 
Subjt:  MPPRAGCRPFESVRRTWHTETHQPIRGSLIQHIFRVVNEVHSSATKKNKEWQETLPIVILKAEEILYSKPDSVDEYMDVTTLWIRINEAINTIIRVDEDD

Query:  ESGELLHPFIEAALYLGCTPRRSSRSNRSSNLRCYLTSCTPQVLDTPPCTLCNTNTIRPISFDSQQCLSCLNLSKQMRNASNCR-RNENHVGPQNKVNTN
        E+GE LHP IEAALYLGCTPRRSS+SNR SNLR YL SCTPQ+LDT P     TNT RP    SQ    C NLSKQ R     R  N+ HVG     + N
Subjt:  ESGELLHPFIEAALYLGCTPRRSSRSNRSSNLRCYLTSCTPQVLDTPPCTLCNTNTIRPISFDSQQCLSCLNLSKQMRNASNCR-RNENHVGPQNKVNTN

Query:  PKTSPVYKNIWSSIRSQQLLTEVVSGW-MFCLCPLYHGSNQHNPDIEMLPKPLDRNLNFSNPGAVKYFHSCD-------------------QQKTTCDLL
        P TS VYKNI +SI +QQ LTE V+GW MF LCPLY G NQH  DIE+ P  ++ N +F  P   KYFHS D                   QQKTTCDL 
Subjt:  PKTSPVYKNIWSSIRSQQLLTEVVSGW-MFCLCPLYHGSNQHNPDIEMLPKPLDRNLNFSNPGAVKYFHSCD-------------------QQKTTCDLL

Query:  LRLG
        LRLG
Subjt:  LRLG

A0A5D3E5M9 Uncharacterized protein9.1e-9563.82Show/hide
Query:  MPPRAGCRPFESVRRTWHTETHQPIRGSLIQHIFRVVNEVHSSATKKNKEWQETLPIVILKAEEILYSKPDSVDEYMDVTTLWIRINEAINTIIRVDEDD
        MPPR GCRPFE VRR WH+E HQPIRGSLIQ IFRVV+EVH  ATKKNKEWQE LPIV+LKAEEILYSK DS  EYMDVTTLW RINEAINTIIR+D+D 
Subjt:  MPPRAGCRPFESVRRTWHTETHQPIRGSLIQHIFRVVNEVHSSATKKNKEWQETLPIVILKAEEILYSKPDSVDEYMDVTTLWIRINEAINTIIRVDEDD

Query:  ESGELLHPFIEAALYLGCTPRRSSRSNRSSNLRCYLTSCTPQVLDTPPCTLCNTNTIRPISFDSQQCLSCLNLSKQMRNASNCR-RNENHVGPQNKVNTN
        E+GE LHP IEAALYLGCTPRRSS+SNR S LR YL SCTPQ+LDT P     TNT RP    SQ    C NLSKQ R     R  N+ HVG     + N
Subjt:  ESGELLHPFIEAALYLGCTPRRSSRSNRSSNLRCYLTSCTPQVLDTPPCTLCNTNTIRPISFDSQQCLSCLNLSKQMRNASNCR-RNENHVGPQNKVNTN

Query:  PKTSPVYKNIWSSIRSQQLLTEVVSGW-MFCLCPLYHGSNQHNPDIEMLPKPLDRNLNFSNPGAVKYFHSCD-------------------QQKTTCDLL
        P TS VYKNI +SI +QQ LTE V+GW MF LCPLY G NQH  DIE+ P  ++ N +F  P   KYFHS D                   QQKTTCDL 
Subjt:  PKTSPVYKNIWSSIRSQQLLTEVVSGW-MFCLCPLYHGSNQHNPDIEMLPKPLDRNLNFSNPGAVKYFHSCD-------------------QQKTTCDLL

Query:  LRLG
        LRLG
Subjt:  LRLG

A0A6J1FPE4 uncharacterized protein LOC1114457841.2e-9162.13Show/hide
Query:  MPPRAGCRPFESVRRTWHTETHQPIRGSLIQHIFRVVNEVHSSATKKNKEWQETLPIVILKAEEILYSKPDSVDEYMDVTTLWIRINEAINTIIRVDEDD
        MPPR G RPFE VRR WH E HQPIRGSLIQ IFRVV++VH  ATKKNKEWQE LPIV+LKAEEILYSK DS  EYMD+TTLW RINEAINTIIR+D D 
Subjt:  MPPRAGCRPFESVRRTWHTETHQPIRGSLIQHIFRVVNEVHSSATKKNKEWQETLPIVILKAEEILYSKPDSVDEYMDVTTLWIRINEAINTIIRVDEDD

Query:  ESGELLHPFIEAALYLGCTPRRSSRSNRSSNLRCYLTSCTPQVLDTPPCTLCNTNTIRPISFDSQQCLSCLNLSKQMRNASNCRRNENHVGPQNKVNTNP
        E+GE LHP IEAALYLGCTPRRSSRSNR SNLR YLTSC+PQVL+T    L   NTIRP    S   +       Q+           HVG    V+TN 
Subjt:  ESGELLHPFIEAALYLGCTPRRSSRSNRSSNLRCYLTSCTPQVLDTPPCTLCNTNTIRPISFDSQQCLSCLNLSKQMRNASNCRRNENHVGPQNKVNTNP

Query:  KTSPVYKNIWSSIRSQQLLTEVVSGW-MFCLCPLYHGSNQHNPDIEMLPKPLDRNLNFSNPGAVKYFHSC-----------------DQQKTTCDLLLRL
        K+S VYKNIW SI SQQ LTE VSGW MF LCPLY+GSNQH P I + P PL  + +FS P   KYFHS                  +QQKT C+L LRL
Subjt:  KTSPVYKNIWSSIRSQQLLTEVVSGW-MFCLCPLYHGSNQHNPDIEMLPKPLDRNLNFSNPGAVKYFHSC-----------------DQQKTTCDLLLRL

Query:  G
        G
Subjt:  G

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G24150.1 unknown protein5.5e-4461.19Show/hide
Query:  PRAGCRPFESVRRTWHTETHQPIRGSLIQHIFRVVNEVHSSATKKNKEWQETLPIVILKAEEILYSKPDSVDEYMDVTTLWIRINEAINTIIRVDEDDES
        PR G RP+E V+R WH++ HQPIRGS+I+ IFR+  E HS+AT+KNKEWQE LP+V+LKAEEI+YSK +S +EY D  T+W R+N+AI+TIIR DE  E+
Subjt:  PRAGCRPFESVRRTWHTETHQPIRGSLIQHIFRVVNEVHSSATKKNKEWQETLPIVILKAEEILYSKPDSVDEYMDVTTLWIRINEAINTIIRVDEDDES

Query:  GELLHPFIEAALYLGCTPRRSSRSNRSSNLRCYL
        G LL P +EAAL LGC   R+SRS R S+ R YL
Subjt:  GELLHPFIEAALYLGCTPRRSSRSNRSSNLRCYL

AT4G32295.1 unknown protein1.8e-4763.7Show/hide
Query:  PRAGCRPFESVRRTWHTETHQPIRGSLIQHIFRVVNEVHSSATKKNKEWQETLPIVILKAEEILYSKPDSVDEYMDVTTLWIRINEAINTIIRVDEDDES
        PR G RP++ +RR WH++ HQP+RG LIQ IFR+V E+HS +T+KN EWQE LP+V+L+AEEI+YSK +S  EYMD+ TL  R N+AINTIIR+DE  E+
Subjt:  PRAGCRPFESVRRTWHTETHQPIRGSLIQHIFRVVNEVHSSATKKNKEWQETLPIVILKAEEILYSKPDSVDEYMDVTTLWIRINEAINTIIRVDEDDES

Query:  GELLHPFIEAALYLGCTPRRSSRSNRSSNLRCYLT
        GE L P IEAAL+LGCTPRR+SRS R+ N RCYL+
Subjt:  GELLHPFIEAALYLGCTPRRSSRSNRSSNLRCYLT

AT4G32295.2 unknown protein4.2e-2065.28Show/hide
Query:  LYSKPDSVDEYMDVTTLWIRINEAINTIIRVDEDDESGELLHPFIEAALYLGCTPRRSSRSNRSSNLRCYLT
        +YSK +S  EYMD+ TL  R N+AINTIIR+DE  E+GE L P IEAAL+LGCTPRR+SRS R+ N RCYL+
Subjt:  LYSKPDSVDEYMDVTTLWIRINEAINTIIRVDEDDESGELLHPFIEAALYLGCTPRRSSRSNRSSNLRCYLT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCACCAAGAGCAGGGTGTAGGCCATTTGAGAGTGTGAGGAGGACTTGGCACACTGAAACCCACCAACCCATTAGAGGTTCTCTCATTCAACACATTTTTAGGGTTGT
GAATGAAGTTCATAGCTCTGCTACTAAGAAGAATAAGGAGTGGCAAGAGACGCTCCCCATTGTTATTCTCAAAGCTGAGGAAATCTTGTACTCTAAGCCTGATTCTGTGG
ATGAATACATGGATGTTACTACACTATGGATCCGAATAAACGAAGCGATTAATACGATAATTCGGGTTGATGAGGACGATGAAAGTGGGGAGCTTCTTCATCCTTTTATT
GAAGCTGCTCTCTATTTGGGTTGCACGCCGAGAAGATCGTCGAGGAGTAACCGAAGTAGTAATTTGAGGTGCTATCTTACTTCTTGCACTCCACAGGTCTTAGATACACC
TCCTTGTACACTTTGCAACACCAACACAATTAGACCAATTAGTTTCGATTCACAACAATGTCTGTCTTGTCTGAATTTGTCAAAGCAGATGAGAAATGCATCGAATTGTA
GACGAAACGAAAACCATGTTGGTCCTCAAAACAAAGTTAACACTAATCCGAAAACTTCACCTGTATATAAGAACATTTGGTCATCCATCCGTAGCCAGCAGTTGCTCACA
GAGGTTGTTTCAGGATGGATGTTTTGTTTGTGTCCCTTGTACCATGGAAGCAATCAACACAACCCCGATATCGAAATGCTACCCAAGCCCCTCGACAGAAATTTGAATTT
CTCGAATCCAGGTGCGGTTAAGTACTTCCATTCCTGCGATCAACAGAAGACGACATGCGATCTATTGTTGCGATTGGGA
mRNA sequenceShow/hide mRNA sequence
ATGCCACCAAGAGCAGGGTGTAGGCCATTTGAGAGTGTGAGGAGGACTTGGCACACTGAAACCCACCAACCCATTAGAGGTTCTCTCATTCAACACATTTTTAGGGTTGT
GAATGAAGTTCATAGCTCTGCTACTAAGAAGAATAAGGAGTGGCAAGAGACGCTCCCCATTGTTATTCTCAAAGCTGAGGAAATCTTGTACTCTAAGCCTGATTCTGTGG
ATGAATACATGGATGTTACTACACTATGGATCCGAATAAACGAAGCGATTAATACGATAATTCGGGTTGATGAGGACGATGAAAGTGGGGAGCTTCTTCATCCTTTTATT
GAAGCTGCTCTCTATTTGGGTTGCACGCCGAGAAGATCGTCGAGGAGTAACCGAAGTAGTAATTTGAGGTGCTATCTTACTTCTTGCACTCCACAGGTCTTAGATACACC
TCCTTGTACACTTTGCAACACCAACACAATTAGACCAATTAGTTTCGATTCACAACAATGTCTGTCTTGTCTGAATTTGTCAAAGCAGATGAGAAATGCATCGAATTGTA
GACGAAACGAAAACCATGTTGGTCCTCAAAACAAAGTTAACACTAATCCGAAAACTTCACCTGTATATAAGAACATTTGGTCATCCATCCGTAGCCAGCAGTTGCTCACA
GAGGTTGTTTCAGGATGGATGTTTTGTTTGTGTCCCTTGTACCATGGAAGCAATCAACACAACCCCGATATCGAAATGCTACCCAAGCCCCTCGACAGAAATTTGAATTT
CTCGAATCCAGGTGCGGTTAAGTACTTCCATTCCTGCGATCAACAGAAGACGACATGCGATCTATTGTTGCGATTGGGA
Protein sequenceShow/hide protein sequence
MPPRAGCRPFESVRRTWHTETHQPIRGSLIQHIFRVVNEVHSSATKKNKEWQETLPIVILKAEEILYSKPDSVDEYMDVTTLWIRINEAINTIIRVDEDDESGELLHPFI
EAALYLGCTPRRSSRSNRSSNLRCYLTSCTPQVLDTPPCTLCNTNTIRPISFDSQQCLSCLNLSKQMRNASNCRRNENHVGPQNKVNTNPKTSPVYKNIWSSIRSQQLLT
EVVSGWMFCLCPLYHGSNQHNPDIEMLPKPLDRNLNFSNPGAVKYFHSCDQQKTTCDLLLRLG