; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS005765 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS005765
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionMyb-like protein X
Genome locationscaffold254:1442811..1444953
RNA-Seq ExpressionMS005765
SyntenyMS005765
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049003.1 myb-like protein X [Cucumis melo var. makuwa]8.8e-7757.47Show/hide
Query:  AATLSFPNFPFPQTDHVLGSKSTRMLKDFLQES--SGIASSKSRTASFKALA----IRAVKRI---SISIP-ILPRSLSRRLLGKTERDEREIGGDFVVK
        A +LSFP FP  +         TRMLKDFL E+  +GIASSK +  SFKALA    + AVKRI   S+  P I PRSLSRRLL KTERDERE GGDFVVK
Subjt:  AATLSFPNFPFPQTDHVLGSKSTRMLKDFLQES--SGIASSKSRTASFKALA----IRAVKRI---SISIP-ILPRSLSRRLLGKTERDEREIGGDFVVK

Query:  IKDIIRWRSFRDLVDETTVAAEAPPLDFADSPD--CYTAAATTTTTTTAT--SSKSSSWCESDFTAEDSPSPSWRGCSDDGEVGKKYFLCAGEDLEEDDK
        IKDIIRW+SFRDL+DETT A  APPLDFA+SPD   YTAAATTTTTTT T  SSKSSSWCESDFTAED PSPSWR  SDDG +GK YF C GED  E   
Subjt:  IKDIIRWRSFRDLVDETTVAAEAPPLDFADSPD--CYTAAATTTTTTTAT--SSKSSSWCESDFTAEDSPSPSWRGCSDDGEVGKKYFLCAGEDLEEDDK

Query:  KVSIIEYFAPVEALSR----EEQNLLKERARRVLEHVEAAISFSEICSTAERRPY-GLLVEFFRRELTESGESEVDEEDR-------------KWRCWLR
          +  +    + ALSR    EEQ +L E  RR+LE V+  IS SE C  AE     GLL E FRR+L     S  D++DR              W     
Subjt:  KVSIIEYFAPVEALSR----EEQNLLKERARRVLEHVEAAISFSEICSTAERRPY-GLLVEFFRRELTESGESEVDEEDR-------------KWRCWLR

Query:  KGNEGYLREMEREGKWSAFGNEEKVELGLQIERGILGSLVYELLFDIF
        KG E Y+REMEREGKW  FG +EK++LGL+IE  +LG LV E+L DIF
Subjt:  KGNEGYLREMEREGKWSAFGNEEKVELGLQIERGILGSLVYELLFDIF

XP_004133889.1 uncharacterized protein LOC101208043 [Cucumis sativus]2.8e-7555.97Show/hide
Query:  AATLSFPNFPFPQTDHVLGSKSTRMLKDFLQES--SGIASSKSRTASFKALA----IRAVKRISI----SIPILPRSLSRRLLGKTERDEREIGGDFVVK
        A TLSFP FP  +         TRMLKDFL E+  +G+AS K +  SFKALA    + AVKRIS+    S  I PRSLSRRLL KTERDERE GGDFVVK
Subjt:  AATLSFPNFPFPQTDHVLGSKSTRMLKDFLQES--SGIASSKSRTASFKALA----IRAVKRISI----SIPILPRSLSRRLLGKTERDEREIGGDFVVK

Query:  IKDIIRWRSFRDLVDETTVAAEAPPLDFADSPD--CYTAAATTTTTTTAT----SSKSSSWCESDFTAEDSPSPSWRGCSDDGEVGKKYFLCAGEDLEED
        IKDIIRW+SFRDL+DETT A  APPLDFA+SPD   YTAAATTTTTTT T    SSKSSSWCESDFTAED  SPSWR  SDDG +GK YF C GED   +
Subjt:  IKDIIRWRSFRDLVDETTVAAEAPPLDFADSPD--CYTAAATTTTTTTAT----SSKSSSWCESDFTAEDSPSPSWRGCSDDGEVGKKYFLCAGEDLEED

Query:  DKKVSIIEYFAPVEAL----SREEQNLLKERARRVLEHVEAAISFSEICSTAERRPYGLLV-EFFRREL--TESGESEVDEEDRKWRC-----------W
        +   +  +    V AL      EEQ +L E  RR+LE V+ AIS S+ C   ER     L+ E FRREL   +  +  V  +DR+ R            W
Subjt:  DKKVSIIEYFAPVEAL----SREEQNLLKERARRVLEHVEAAISFSEICSTAERRPYGLLV-EFFRREL--TESGESEVDEEDRKWRC-----------W

Query:  L--RKGNEGYLREMEREGKWSAFGNEEKVELGLQIERGILGSLVYELLFDIF
            KG E Y+REMEREGKW  FG +EK+ELGL+IE  ILG LV E+L DIF
Subjt:  L--RKGNEGYLREMEREGKWSAFGNEEKVELGLQIERGILGSLVYELLFDIF

XP_008438108.1 PREDICTED: uncharacterized protein LOC103483313 [Cucumis melo]5.2e-7757.47Show/hide
Query:  AATLSFPNFPFPQTDHVLGSKSTRMLKDFLQES--SGIASSKSRTASFKALA----IRAVKRISI----SIPILPRSLSRRLLGKTERDEREIGGDFVVK
        A +LSFP FP  +         TRMLKDFL E+  +GIASSK +  SFKALA    + AVKRIS     S  I PRSLSRRLL KTERDERE GGDFVVK
Subjt:  AATLSFPNFPFPQTDHVLGSKSTRMLKDFLQES--SGIASSKSRTASFKALA----IRAVKRISI----SIPILPRSLSRRLLGKTERDEREIGGDFVVK

Query:  IKDIIRWRSFRDLVDETTVAAEAPPLDFADSPD--CYTAAATTTTTTTAT--SSKSSSWCESDFTAEDSPSPSWRGCSDDGEVGKKYFLCAGEDLEEDDK
        IKDIIRW+SFRDL+DETT A  APPLDFA+SPD   YTAAATTTTTTT T  SSKSSSWCESDFTAED PSPSWR  SDDG +GK YF C GED  E   
Subjt:  IKDIIRWRSFRDLVDETTVAAEAPPLDFADSPD--CYTAAATTTTTTTAT--SSKSSSWCESDFTAEDSPSPSWRGCSDDGEVGKKYFLCAGEDLEEDDK

Query:  KVSIIEYFAPVEALSR----EEQNLLKERARRVLEHVEAAISFSEICSTAERRPY-GLLVEFFRRELTESGESEVDEEDR-------------KWRCWLR
          +  +    + ALSR    EEQ +L E  RR+LE V+  IS SE C  AE     GLL E FRR+L     S  D++DR              W     
Subjt:  KVSIIEYFAPVEALSR----EEQNLLKERARRVLEHVEAAISFSEICSTAERRPY-GLLVEFFRRELTESGESEVDEEDR-------------KWRCWLR

Query:  KGNEGYLREMEREGKWSAFGNEEKVELGLQIERGILGSLVYELLFDIF
        KG E Y+REMEREGKW  FG +EK++LGL+IE  +LG LV E+L DIF
Subjt:  KGNEGYLREMEREGKWSAFGNEEKVELGLQIERGILGSLVYELLFDIF

XP_022147284.1 uncharacterized protein LOC111016277 [Momordica charantia]6.2e-15593.71Show/hide
Query:  MAATLSFPNFPFPQTDHVLGSKSTRMLKDFLQESSGIASSKSRTASFKALAIRAVKRISISIPILPRSLSRRLLGKTERDEREIGGDFVVKIKDIIRWRS
        MAATLS PNFPFPQTDHVLGSKSTRMLKDFLQESSGIASSKSRTASFKALAIRAVKRISISIPILPRSLSRRLLGKTERDEREIGGDFVVKIKDIIRWRS
Subjt:  MAATLSFPNFPFPQTDHVLGSKSTRMLKDFLQESSGIASSKSRTASFKALAIRAVKRISISIPILPRSLSRRLLGKTERDEREIGGDFVVKIKDIIRWRS

Query:  FRDLVDETTVAAEAPPLDFADSPDCYTAAATTTTTTTATSSKSSSWCESDFTAEDSPSPSWRGCSDDGEVGKKYFLCAGEDLEEDDKKVSIIEYFAPVEA
        FRDLVDETTVAAEAPPLDFADSPDC TAAATTTTTTTATSSKSSSWCESDFTAEDSPSPSWRGCSDDGEVGKKYFLCAGEDLEEDDKK         VEA
Subjt:  FRDLVDETTVAAEAPPLDFADSPDCYTAAATTTTTTTATSSKSSSWCESDFTAEDSPSPSWRGCSDDGEVGKKYFLCAGEDLEEDDKKVSIIEYFAPVEA

Query:  LSR-EEQNLLKERARRVLEHVEAAISFSEICSTAERRPYGLLVEFFRRELTESGESEVDEEDRKWRCWLRKGNEGYLREMEREGKWSAFGNEEKVELGLQ
        LSR EE+++LKERA RVLEHVEAAISFSEICSTAE RP GLLV+FFRRELTESGESEVDEED KWRCWLRKGNEGYLREMEREGKWSAFGNEEKVELGLQ
Subjt:  LSR-EEQNLLKERARRVLEHVEAAISFSEICSTAERRPYGLLVEFFRRELTESGESEVDEEDRKWRCWLRKGNEGYLREMEREGKWSAFGNEEKVELGLQ

Query:  IERGILGSLVYELLFDIF
        IERGILGSLVYELLFDIF
Subjt:  IERGILGSLVYELLFDIF

XP_038880771.1 uncharacterized protein LOC120072365 [Benincasa hispida]1.1e-7456.79Show/hide
Query:  MAATLSFPNFPFPQTDHVLGSKSTRMLKDFLQE--SSGIASSKSRTASFKALAIR----AVKRISI----SIPILPRSLSRRLLGKTERDEREIGGDFVV
        MAATLSFP FP  +        STRMLKDFLQE  ++GI SSK + ASFKALAI     AVKRIS     S  I PRSLSRRLL KTER+EREIGGDFVV
Subjt:  MAATLSFPNFPFPQTDHVLGSKSTRMLKDFLQE--SSGIASSKSRTASFKALAIR----AVKRISI----SIPILPRSLSRRLLGKTERDEREIGGDFVV

Query:  KIKDIIRWRSFRDLVDETTVAAEAPPLDFADSPDCYTAAATTTTTTTATSSKSSSWCESDFTAEDSPSPSWRGCSD---DGEVGKKYFLCAGED------
        KIKDIIRW+SFRDLVDET   A AP LDFADSPD YTAAATTTTTTT T S SSSWCESDF AED PSPSWR  S+   DG VGK +F C GED      
Subjt:  KIKDIIRWRSFRDLVDETTVAAEAPPLDFADSPDCYTAAATTTTTTTATSSKSSSWCESDFTAEDSPSPSWRGCSD---DGEVGKKYFLCAGED------

Query:  -LEEDDKKVSIIEYFAPVEALSR----EEQNLLKERARRVLEHVEAAISFSEICSTAERRPYG---LLVEFFRRELT-----------------ESGESE
           E+DKKV+I        ALSR    +EQN+L +  + VLE VE AIS  +  +    + YG   LL+EF RREL                  + G+ E
Subjt:  -LEEDDKKVSIIEYFAPVEALSR----EEQNLLKERARRVLEHVEAAISFSEICSTAERRPYG---LLVEFFRRELT-----------------ESGESE

Query:  VDEEDRKWRCWL--RKGNEGYLREMEREGKWSAFGNEEKVELGLQIERGILGSLVYELLFD
         D +D     W    KG E Y+REMEREGKW  FG EEK+ELGLQ E  ILG LV E+L D
Subjt:  VDEEDRKWRCWL--RKGNEGYLREMEREGKWSAFGNEEKVELGLQIERGILGSLVYELLFD

TrEMBL top hitse value%identityAlignment
A0A0A0L6C4 Uncharacterized protein1.4e-7555.97Show/hide
Query:  AATLSFPNFPFPQTDHVLGSKSTRMLKDFLQES--SGIASSKSRTASFKALA----IRAVKRISI----SIPILPRSLSRRLLGKTERDEREIGGDFVVK
        A TLSFP FP  +         TRMLKDFL E+  +G+AS K +  SFKALA    + AVKRIS+    S  I PRSLSRRLL KTERDERE GGDFVVK
Subjt:  AATLSFPNFPFPQTDHVLGSKSTRMLKDFLQES--SGIASSKSRTASFKALA----IRAVKRISI----SIPILPRSLSRRLLGKTERDEREIGGDFVVK

Query:  IKDIIRWRSFRDLVDETTVAAEAPPLDFADSPD--CYTAAATTTTTTTAT----SSKSSSWCESDFTAEDSPSPSWRGCSDDGEVGKKYFLCAGEDLEED
        IKDIIRW+SFRDL+DETT A  APPLDFA+SPD   YTAAATTTTTTT T    SSKSSSWCESDFTAED  SPSWR  SDDG +GK YF C GED   +
Subjt:  IKDIIRWRSFRDLVDETTVAAEAPPLDFADSPD--CYTAAATTTTTTTAT----SSKSSSWCESDFTAEDSPSPSWRGCSDDGEVGKKYFLCAGEDLEED

Query:  DKKVSIIEYFAPVEAL----SREEQNLLKERARRVLEHVEAAISFSEICSTAERRPYGLLV-EFFRREL--TESGESEVDEEDRKWRC-----------W
        +   +  +    V AL      EEQ +L E  RR+LE V+ AIS S+ C   ER     L+ E FRREL   +  +  V  +DR+ R            W
Subjt:  DKKVSIIEYFAPVEAL----SREEQNLLKERARRVLEHVEAAISFSEICSTAERRPYGLLV-EFFRREL--TESGESEVDEEDRKWRC-----------W

Query:  L--RKGNEGYLREMEREGKWSAFGNEEKVELGLQIERGILGSLVYELLFDIF
            KG E Y+REMEREGKW  FG +EK+ELGL+IE  ILG LV E+L DIF
Subjt:  L--RKGNEGYLREMEREGKWSAFGNEEKVELGLQIERGILGSLVYELLFDIF

A0A1S3AVN8 uncharacterized protein LOC1034833132.5e-7757.47Show/hide
Query:  AATLSFPNFPFPQTDHVLGSKSTRMLKDFLQES--SGIASSKSRTASFKALA----IRAVKRISI----SIPILPRSLSRRLLGKTERDEREIGGDFVVK
        A +LSFP FP  +         TRMLKDFL E+  +GIASSK +  SFKALA    + AVKRIS     S  I PRSLSRRLL KTERDERE GGDFVVK
Subjt:  AATLSFPNFPFPQTDHVLGSKSTRMLKDFLQES--SGIASSKSRTASFKALA----IRAVKRISI----SIPILPRSLSRRLLGKTERDEREIGGDFVVK

Query:  IKDIIRWRSFRDLVDETTVAAEAPPLDFADSPD--CYTAAATTTTTTTAT--SSKSSSWCESDFTAEDSPSPSWRGCSDDGEVGKKYFLCAGEDLEEDDK
        IKDIIRW+SFRDL+DETT A  APPLDFA+SPD   YTAAATTTTTTT T  SSKSSSWCESDFTAED PSPSWR  SDDG +GK YF C GED  E   
Subjt:  IKDIIRWRSFRDLVDETTVAAEAPPLDFADSPD--CYTAAATTTTTTTAT--SSKSSSWCESDFTAEDSPSPSWRGCSDDGEVGKKYFLCAGEDLEEDDK

Query:  KVSIIEYFAPVEALSR----EEQNLLKERARRVLEHVEAAISFSEICSTAERRPY-GLLVEFFRRELTESGESEVDEEDR-------------KWRCWLR
          +  +    + ALSR    EEQ +L E  RR+LE V+  IS SE C  AE     GLL E FRR+L     S  D++DR              W     
Subjt:  KVSIIEYFAPVEALSR----EEQNLLKERARRVLEHVEAAISFSEICSTAERRPY-GLLVEFFRRELTESGESEVDEEDR-------------KWRCWLR

Query:  KGNEGYLREMEREGKWSAFGNEEKVELGLQIERGILGSLVYELLFDIF
        KG E Y+REMEREGKW  FG +EK++LGL+IE  +LG LV E+L DIF
Subjt:  KGNEGYLREMEREGKWSAFGNEEKVELGLQIERGILGSLVYELLFDIF

A0A5A7TZF7 Myb-like protein X4.3e-7757.47Show/hide
Query:  AATLSFPNFPFPQTDHVLGSKSTRMLKDFLQES--SGIASSKSRTASFKALA----IRAVKRI---SISIP-ILPRSLSRRLLGKTERDEREIGGDFVVK
        A +LSFP FP  +         TRMLKDFL E+  +GIASSK +  SFKALA    + AVKRI   S+  P I PRSLSRRLL KTERDERE GGDFVVK
Subjt:  AATLSFPNFPFPQTDHVLGSKSTRMLKDFLQES--SGIASSKSRTASFKALA----IRAVKRI---SISIP-ILPRSLSRRLLGKTERDEREIGGDFVVK

Query:  IKDIIRWRSFRDLVDETTVAAEAPPLDFADSPD--CYTAAATTTTTTTAT--SSKSSSWCESDFTAEDSPSPSWRGCSDDGEVGKKYFLCAGEDLEEDDK
        IKDIIRW+SFRDL+DETT A  APPLDFA+SPD   YTAAATTTTTTT T  SSKSSSWCESDFTAED PSPSWR  SDDG +GK YF C GED  E   
Subjt:  IKDIIRWRSFRDLVDETTVAAEAPPLDFADSPD--CYTAAATTTTTTTAT--SSKSSSWCESDFTAEDSPSPSWRGCSDDGEVGKKYFLCAGEDLEEDDK

Query:  KVSIIEYFAPVEALSR----EEQNLLKERARRVLEHVEAAISFSEICSTAERRPY-GLLVEFFRRELTESGESEVDEEDR-------------KWRCWLR
          +  +    + ALSR    EEQ +L E  RR+LE V+  IS SE C  AE     GLL E FRR+L     S  D++DR              W     
Subjt:  KVSIIEYFAPVEALSR----EEQNLLKERARRVLEHVEAAISFSEICSTAERRPY-GLLVEFFRRELTESGESEVDEEDR-------------KWRCWLR

Query:  KGNEGYLREMEREGKWSAFGNEEKVELGLQIERGILGSLVYELLFDIF
        KG E Y+REMEREGKW  FG +EK++LGL+IE  +LG LV E+L DIF
Subjt:  KGNEGYLREMEREGKWSAFGNEEKVELGLQIERGILGSLVYELLFDIF

A0A6J1D0J8 uncharacterized protein LOC1110162773.0e-15593.71Show/hide
Query:  MAATLSFPNFPFPQTDHVLGSKSTRMLKDFLQESSGIASSKSRTASFKALAIRAVKRISISIPILPRSLSRRLLGKTERDEREIGGDFVVKIKDIIRWRS
        MAATLS PNFPFPQTDHVLGSKSTRMLKDFLQESSGIASSKSRTASFKALAIRAVKRISISIPILPRSLSRRLLGKTERDEREIGGDFVVKIKDIIRWRS
Subjt:  MAATLSFPNFPFPQTDHVLGSKSTRMLKDFLQESSGIASSKSRTASFKALAIRAVKRISISIPILPRSLSRRLLGKTERDEREIGGDFVVKIKDIIRWRS

Query:  FRDLVDETTVAAEAPPLDFADSPDCYTAAATTTTTTTATSSKSSSWCESDFTAEDSPSPSWRGCSDDGEVGKKYFLCAGEDLEEDDKKVSIIEYFAPVEA
        FRDLVDETTVAAEAPPLDFADSPDC TAAATTTTTTTATSSKSSSWCESDFTAEDSPSPSWRGCSDDGEVGKKYFLCAGEDLEEDDKK         VEA
Subjt:  FRDLVDETTVAAEAPPLDFADSPDCYTAAATTTTTTTATSSKSSSWCESDFTAEDSPSPSWRGCSDDGEVGKKYFLCAGEDLEEDDKKVSIIEYFAPVEA

Query:  LSR-EEQNLLKERARRVLEHVEAAISFSEICSTAERRPYGLLVEFFRRELTESGESEVDEEDRKWRCWLRKGNEGYLREMEREGKWSAFGNEEKVELGLQ
        LSR EE+++LKERA RVLEHVEAAISFSEICSTAE RP GLLV+FFRRELTESGESEVDEED KWRCWLRKGNEGYLREMEREGKWSAFGNEEKVELGLQ
Subjt:  LSR-EEQNLLKERARRVLEHVEAAISFSEICSTAERRPYGLLVEFFRRELTESGESEVDEEDRKWRCWLRKGNEGYLREMEREGKWSAFGNEEKVELGLQ

Query:  IERGILGSLVYELLFDIF
        IERGILGSLVYELLFDIF
Subjt:  IERGILGSLVYELLFDIF

A0A6J1H9R7 uncharacterized protein LOC111461748 isoform X21.9e-7254.55Show/hide
Query:  AATLSFPNFPFPQTDHVLGSKSTRMLKDFLQESSGIASSKSRTASFKALAIRAVKRISISI----PILPRSLSRRLLGKTERDEREIGGDFVVKIKDIIR
        AA LSF  F      H+  + STRMLKDFLQES+G  + +S+TASF       VKRIS        ILPRSLSRRL G  ERDERE GGDFVVK+KDIIR
Subjt:  AATLSFPNFPFPQTDHVLGSKSTRMLKDFLQESSGIASSKSRTASFKALAIRAVKRISISI----PILPRSLSRRLLGKTERDEREIGGDFVVKIKDIIR

Query:  WRSFRDLVDETTVAAEAPPLDFADSPDCYTAAATTTTTTTATSSKSSSWCESDFTAEDSPSPSWRGCSDDGEVGKKYFLCAGEDLEEDDKKVSIIEYFAP
        WRSFRDLVDET   A APPLDFADSPD YTAAATTTTTTTAT+S SSSWCESDFTAED PSPSW+GCSDD E GK YF C GEDL E    V+  E    
Subjt:  WRSFRDLVDETTVAAEAPPLDFADSPDCYTAAATTTTTTTATSSKSSSWCESDFTAEDSPSPSWRGCSDDGEVGKKYFLCAGEDLEEDDKKVSIIEYFAP

Query:  VEALSREEQ--NLLKERARRVLEHVEAAISFSEICSTAE----RRPY----------------GLLV----EFFRRELT----ESGESEVDEEDRKWRCW
        V  LSR+ +  N  ++ ARR+LE ++  IS S      E     RPY                G       E FRRE +       + E DE +     W
Subjt:  VEALSREEQ--NLLKERARRVLEHVEAAISFSEICSTAE----RRPY----------------GLLV----EFFRRELT----ESGESEVDEEDRKWRCW

Query:  L--RKGNEGYLREMEREGKWSAFGNEEKVELGLQIERGILGSLVYELLFDIF
        +  +KG E   REMEREGKW  FGNEE+ ELGL+IE GI G LV E++ DIF
Subjt:  L--RKGNEGYLREMEREGKWSAFGNEEKVELGLQIERGILGSLVYELLFDIF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G00770.1 unknown protein4.2e-0833.33Show/hide
Query:  TRMLKDFLQESSGIASS-------------------KSRTASFKALAIRAVKRI------SISIPILPRSLSRRLLGKTERDEREIGGDFVVKIKDIIRW
        +RMLKD L E S   SS                   K + ++     I A+K +      S    ILPRSLSRRL  K + + +      V+++KDI+RW
Subjt:  TRMLKDFLQESSGIASS-------------------KSRTASFKALAIRAVKRI------SISIPILPRSLSRRLLGKTERDEREIGGDFVVKIKDIIRW

Query:  RSFRDLVDETTVAAEAPPLDFADSPDCYTAAATTTTTTTATSS--KSSSWCESDFTAEDSPSPSW----RGCSDDGEVGKKYFLCAGED
         S +DL ++ +             P  YT   TTTTT ++T+S    SSW + DFT+E  PS SW      C +   V K    C GED
Subjt:  RSFRDLVDETTVAAEAPPLDFADSPDCYTAAATTTTTTTATSS--KSSSWCESDFTAEDSPSPSW----RGCSDDGEVGKKYFLCAGED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCGACTCTGTCATTCCCTAATTTTCCTTTCCCCCAAACGGATCACGTACTCGGATCGAAATCGACGAGAATGCTCAAAGATTTTCTCCAAGAAAGTAGCGGAAT
CGCTTCTTCGAAATCCAGAACGGCGTCGTTTAAAGCTCTGGCGATTCGCGCCGTGAAGAGGATCTCGATCTCGATTCCGATTTTGCCGAGAAGTCTCTCGCGGCGGCTGC
TGGGGAAGACAGAGAGAGATGAGAGAGAAATCGGAGGCGATTTCGTTGTTAAAATCAAGGACATCATACGCTGGAGATCGTTCCGGGATTTGGTCGACGAGACGACGGTG
GCGGCGGAGGCTCCGCCGCTTGATTTCGCCGATTCGCCGGATTGTTATACGGCCGCCGCCACGACTACCACGACCACGACGGCGACTAGCAGTAAGAGCTCCAGCTGGTG
CGAGAGCGATTTCACGGCGGAGGATTCGCCGTCGCCGTCGTGGAGGGGTTGCTCCGACGACGGCGAAGTAGGGAAAAAATATTTCCTATGTGCTGGTGAAGATTTGGAGG
AAGACGACAAAAAGGTGAGCATAATTGAGTACTTCGCACCCGTAGAAGCATTATCAAGAGAAGAACAAAATTTATTGAAAGAGAGGGCACGGCGGGTGTTAGAGCACGTG
GAAGCCGCGATTTCCTTCTCGGAAATCTGCAGCACGGCGGAGCGCCGCCCGTATGGGCTGTTGGTGGAATTTTTCCGGCGAGAACTGACGGAATCTGGAGAAAGTGAAGT
GGACGAAGAGGATCGTAAATGGCGGTGTTGGTTGCGGAAGGGAAACGAAGGTTATTTGAGAGAAATGGAGAGAGAGGGAAAATGGAGTGCGTTTGGTAATGAGGAGAAAG
TGGAATTAGGGCTTCAAATTGAAAGGGGGATTTTGGGCTCTTTGGTTTATGAGCTTCTATTTGATATTTTC
mRNA sequenceShow/hide mRNA sequence
ATGGCGGCGACTCTGTCATTCCCTAATTTTCCTTTCCCCCAAACGGATCACGTACTCGGATCGAAATCGACGAGAATGCTCAAAGATTTTCTCCAAGAAAGTAGCGGAAT
CGCTTCTTCGAAATCCAGAACGGCGTCGTTTAAAGCTCTGGCGATTCGCGCCGTGAAGAGGATCTCGATCTCGATTCCGATTTTGCCGAGAAGTCTCTCGCGGCGGCTGC
TGGGGAAGACAGAGAGAGATGAGAGAGAAATCGGAGGCGATTTCGTTGTTAAAATCAAGGACATCATACGCTGGAGATCGTTCCGGGATTTGGTCGACGAGACGACGGTG
GCGGCGGAGGCTCCGCCGCTTGATTTCGCCGATTCGCCGGATTGTTATACGGCCGCCGCCACGACTACCACGACCACGACGGCGACTAGCAGTAAGAGCTCCAGCTGGTG
CGAGAGCGATTTCACGGCGGAGGATTCGCCGTCGCCGTCGTGGAGGGGTTGCTCCGACGACGGCGAAGTAGGGAAAAAATATTTCCTATGTGCTGGTGAAGATTTGGAGG
AAGACGACAAAAAGGTGAGCATAATTGAGTACTTCGCACCCGTAGAAGCATTATCAAGAGAAGAACAAAATTTATTGAAAGAGAGGGCACGGCGGGTGTTAGAGCACGTG
GAAGCCGCGATTTCCTTCTCGGAAATCTGCAGCACGGCGGAGCGCCGCCCGTATGGGCTGTTGGTGGAATTTTTCCGGCGAGAACTGACGGAATCTGGAGAAAGTGAAGT
GGACGAAGAGGATCGTAAATGGCGGTGTTGGTTGCGGAAGGGAAACGAAGGTTATTTGAGAGAAATGGAGAGAGAGGGAAAATGGAGTGCGTTTGGTAATGAGGAGAAAG
TGGAATTAGGGCTTCAAATTGAAAGGGGGATTTTGGGCTCTTTGGTTTATGAGCTTCTATTTGATATTTTC
Protein sequenceShow/hide protein sequence
MAATLSFPNFPFPQTDHVLGSKSTRMLKDFLQESSGIASSKSRTASFKALAIRAVKRISISIPILPRSLSRRLLGKTERDEREIGGDFVVKIKDIIRWRSFRDLVDETTV
AAEAPPLDFADSPDCYTAAATTTTTTTATSSKSSSWCESDFTAEDSPSPSWRGCSDDGEVGKKYFLCAGEDLEEDDKKVSIIEYFAPVEALSREEQNLLKERARRVLEHV
EAAISFSEICSTAERRPYGLLVEFFRRELTESGESEVDEEDRKWRCWLRKGNEGYLREMEREGKWSAFGNEEKVELGLQIERGILGSLVYELLFDIF