; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g23530 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g23530
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGDSL esterase/lipase At5g03980-like
Genome locationchr4:16995779..17006595
RNA-Seq ExpressionMoc04g23530
SyntenyMoc04g23530
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0016788 - hydrolase activity, acting on ester bonds (molecular function)
InterPro domainsIPR001087 - GDSL lipase/esterase
IPR005162 - Retrotransposon gag domain
IPR036514 - SGNH hydrolase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150751.1 acetylajmalan esterase-like [Momordica charantia]3.5e-6788.72Show/hide
Query:  TNDTTAYDELHCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNPR
        TNDTTAYDELHCLKGL  LASYHND+IKQ IEVL+ ENPHT+IVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNF L+KICGL GVPVCPNP 
Subjt:  TNDTTAYDELHCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNPR

Query:  EHISWDGIHLTQKAYKFMAYWLIHDIFPKLYCI
        EHISWDGIHLT+KAYKFMAYW+I+DIFPKL+CI
Subjt:  EHISWDGIHLTQKAYKFMAYWLIHDIFPKLYCI

XP_022156343.1 GDSL esterase/lipase At5g03980-like [Momordica charantia]2.0e-7093.28Show/hide
Query:  TNDTTAYDELHCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNPR
        TNDTTAYDELHCLKGL  LASYHNDQIKQAIEVLKRENPHT+IVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNF LIKICGL  VP+CPNP 
Subjt:  TNDTTAYDELHCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNPR

Query:  EHISWDGIHLTQKAYKFMAYWLIHDIFPKLYCIV
        EHISWDGIHLTQKAYKFMAYWLIHDIFPKL+CIV
Subjt:  EHISWDGIHLTQKAYKFMAYWLIHDIFPKLYCIV

XP_022156428.1 acetylajmalan esterase-like [Momordica charantia]1.2e-7093.98Show/hide
Query:  TNDTTAYDELHCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNPR
        TNDTTAYDELHCLKGL  LASYHNDQIKQAIEVLKRENPHT+IVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNF L+KICGL GVPVCPNP 
Subjt:  TNDTTAYDELHCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNPR

Query:  EHISWDGIHLTQKAYKFMAYWLIHDIFPKLYCI
        EHISWDGIHLTQKAYKFMAYWLIHDIFPKL+CI
Subjt:  EHISWDGIHLTQKAYKFMAYWLIHDIFPKLYCI

XP_022158415.1 GDSL esterase/lipase At5g03980-like [Momordica charantia]3.2e-6888.72Show/hide
Query:  TNDTTAYDELHCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNPR
        TNDT AYDELHCLKG+  LASYHNDQIK AIEVL+RENPHT+IVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNF L+KICGL GVP+CPNP 
Subjt:  TNDTTAYDELHCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNPR

Query:  EHISWDGIHLTQKAYKFMAYWLIHDIFPKLYCI
        EHISWDGIHLT+KAYKFMAYW+IHDIFPKL+CI
Subjt:  EHISWDGIHLTQKAYKFMAYWLIHDIFPKLYCI

XP_022158836.1 uncharacterized protein LOC111025302 [Momordica charantia]3.3e-9471.98Show/hide
Query:  MNRNAQDPPPPQDPPMNGDMAVEGAANRAGEVLNLILLADNRDVAMRNYVTHAFHNLNSGISNSLPQAAQSELKLVMFHMLHTMGQFGRLTNKDPYSHLK
        MNRNAQDPPPPQ+PP+NGDMA E AANR GE+ NLILLADNRDVAMRNYVTHAFHNLNSGI+N LPQAAQ ELK VMF +L TMGQFG LTN+DPYSHLK
Subjt:  MNRNAQDPPPPQDPPMNGDMAVEGAANRAGEVLNLILLADNRDVAMRNYVTHAFHNLNSGISNSLPQAAQSELKLVMFHMLHTMGQFGRLTNKDPYSHLK

Query:  SFIEIANAFQLPGVSDDTLRLKMFYFSLRNSARTWLNALEPNSITTWAELTEKFLAKYHTLTRNGNLQEDF-----------------------------
        SFIEIANAFQLPG S+D LRLKMF FSLR+ ARTW+NALEPNSI TWAELT+KFLAKYHTLT+N +L+ED                              
Subjt:  SFIEIANAFQLPGVSDDTLRLKMFYFSLRNSARTWLNALEPNSITTWAELTEKFLAKYHTLTRNGNLQEDF-----------------------------

Query:  ------IEQFYRGLDRPSRMMLNTAANGSLLEKSVNEIVDILNKMTNINDQNELGRS
              IEQFYRGLDR S+MMLNT ANGSLLEKSVNEIVD+LNKMT+INDQ E+GRS
Subjt:  ------IEQFYRGLDRPSRMMLNTAANGSLLEKSVNEIVDILNKMTNINDQNELGRS

TrEMBL top hitse value%identityAlignment
A0A6J1DCG0 acetylajmalan esterase-like1.7e-6788.72Show/hide
Query:  TNDTTAYDELHCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNPR
        TNDTTAYDELHCLKGL  LASYHND+IKQ IEVL+ ENPHT+IVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNF L+KICGL GVPVCPNP 
Subjt:  TNDTTAYDELHCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNPR

Query:  EHISWDGIHLTQKAYKFMAYWLIHDIFPKLYCI
        EHISWDGIHLT+KAYKFMAYW+I+DIFPKL+CI
Subjt:  EHISWDGIHLTQKAYKFMAYWLIHDIFPKLYCI

A0A6J1DQD3 GDSL esterase/lipase At5g03980-like9.6e-7193.28Show/hide
Query:  TNDTTAYDELHCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNPR
        TNDTTAYDELHCLKGL  LASYHNDQIKQAIEVLKRENPHT+IVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNF LIKICGL  VP+CPNP 
Subjt:  TNDTTAYDELHCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNPR

Query:  EHISWDGIHLTQKAYKFMAYWLIHDIFPKLYCIV
        EHISWDGIHLTQKAYKFMAYWLIHDIFPKL+CIV
Subjt:  EHISWDGIHLTQKAYKFMAYWLIHDIFPKLYCIV

A0A6J1DQL3 acetylajmalan esterase-like5.6e-7193.98Show/hide
Query:  TNDTTAYDELHCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNPR
        TNDTTAYDELHCLKGL  LASYHNDQIKQAIEVLKRENPHT+IVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNF L+KICGL GVPVCPNP 
Subjt:  TNDTTAYDELHCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNPR

Query:  EHISWDGIHLTQKAYKFMAYWLIHDIFPKLYCI
        EHISWDGIHLTQKAYKFMAYWLIHDIFPKL+CI
Subjt:  EHISWDGIHLTQKAYKFMAYWLIHDIFPKLYCI

A0A6J1DVS3 GDSL esterase/lipase At5g03980-like1.5e-6888.72Show/hide
Query:  TNDTTAYDELHCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNPR
        TNDT AYDELHCLKG+  LASYHNDQIK AIEVL+RENPHT+IVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNF L+KICGL GVP+CPNP 
Subjt:  TNDTTAYDELHCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNPR

Query:  EHISWDGIHLTQKAYKFMAYWLIHDIFPKLYCI
        EHISWDGIHLT+KAYKFMAYW+IHDIFPKL+CI
Subjt:  EHISWDGIHLTQKAYKFMAYWLIHDIFPKLYCI

A0A6J1E251 uncharacterized protein LOC1110253021.6e-9471.98Show/hide
Query:  MNRNAQDPPPPQDPPMNGDMAVEGAANRAGEVLNLILLADNRDVAMRNYVTHAFHNLNSGISNSLPQAAQSELKLVMFHMLHTMGQFGRLTNKDPYSHLK
        MNRNAQDPPPPQ+PP+NGDMA E AANR GE+ NLILLADNRDVAMRNYVTHAFHNLNSGI+N LPQAAQ ELK VMF +L TMGQFG LTN+DPYSHLK
Subjt:  MNRNAQDPPPPQDPPMNGDMAVEGAANRAGEVLNLILLADNRDVAMRNYVTHAFHNLNSGISNSLPQAAQSELKLVMFHMLHTMGQFGRLTNKDPYSHLK

Query:  SFIEIANAFQLPGVSDDTLRLKMFYFSLRNSARTWLNALEPNSITTWAELTEKFLAKYHTLTRNGNLQEDF-----------------------------
        SFIEIANAFQLPG S+D LRLKMF FSLR+ ARTW+NALEPNSI TWAELT+KFLAKYHTLT+N +L+ED                              
Subjt:  SFIEIANAFQLPGVSDDTLRLKMFYFSLRNSARTWLNALEPNSITTWAELTEKFLAKYHTLTRNGNLQEDF-----------------------------

Query:  ------IEQFYRGLDRPSRMMLNTAANGSLLEKSVNEIVDILNKMTNINDQNELGRS
              IEQFYRGLDR S+MMLNT ANGSLLEKSVNEIVD+LNKMT+INDQ E+GRS
Subjt:  ------IEQFYRGLDRPSRMMLNTAANGSLLEKSVNEIVDILNKMTNINDQNELGRS

SwissProt top hitse value%identityAlignment
Q3MKY2 Acetylajmalan esterase1.3e-2745.6Show/hide
Query:  DELHCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNPREHISWDG
        D+L CL  L  L+ Y N   ++A+  L  E P  +I+Y DYYNA  ++ R+   LG +  SL K CCGIGG YN+   + CG  GVPVCPNP ++I WDG
Subjt:  DELHCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNPREHISWDG

Query:  IHLTQKAYKFMAYWLIHDIFPKLYC
         H TQ AY+ +A ++I  I   L C
Subjt:  IHLTQKAYKFMAYWLIHDIFPKLYC

Q94F40 GDSL esterase/lipase At1g286003.2e-2340.83Show/hide
Query:  TNDTTAYD-ELHCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNP
        T++   YD    CLK L     YH++++K  +  L++  PH  I+Y DYYN+LL I +     GF E     +CCGIGG YNF   + CG +GV  C +P
Subjt:  TNDTTAYD-ELHCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNP

Query:  REHISWDGIHLTQKAYKFMA
         +++ WDG+H+T+ AYK++A
Subjt:  REHISWDGIHLTQKAYKFMA

Q9C857 GDSL esterase/lipase At1g315502.9e-2441.94Show/hide
Query:  TNDTTAYDEL-HCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNP
        T++   YD L  CLK L     YH++Q+++ +  L++ NPH  I+Y DYYNA L + R     GF    L  +CCG+GG YNF L + CG +GV  C +P
Subjt:  TNDTTAYDEL-HCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNP

Query:  REHISWDGIHLTQKAYKFMAYWLI
         ++++WDG+H+T+ A+K MA  L+
Subjt:  REHISWDGIHLTQKAYKFMAYWLI

Q9LZB2 GDSL esterase/lipase At5g039807.9e-3048.46Show/hide
Query:  DTTAYDELHCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNPREH
        DT  YD+  CL  L   A  HN+Q+++AI  L++E P   IVYGDYYNA  ++LR      FD++   KSCCG GG YN+   +  G +GVPVC NP + 
Subjt:  DTTAYDELHCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNPREH

Query:  ISWDGIHLTQKAYKFMAYWLIHDIFPKLYC
        ISWDG+HLTQKAY+FM+ +L + I  ++ C
Subjt:  ISWDGIHLTQKAYKFMAYWLIHDIFPKLYC

Q9SHP6 GDSL esterase/lipase At1g286104.2e-2340.32Show/hide
Query:  TNDTTAYDEL-HCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNP
        T++   YD L  CL  L   + Y+N++++  +  L +  PH  I+YGDY+NALL + +     GF +  L  +CCG+GG YNF L K CG +GV  C +P
Subjt:  TNDTTAYDEL-HCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNP

Query:  REHISWDGIHLTQKAYKFMAYWLI
         ++++WDG+H+T+ AYK++A  L+
Subjt:  REHISWDGIHLTQKAYKFMAYWLI

Arabidopsis top hitse value%identityAlignment
AT1G28590.1 GDSL-like Lipase/Acylhydrolase superfamily protein1.5e-2338.71Show/hide
Query:  TNDTTAYDEL-HCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNP
        T++   YD L  CLK L   + Y+N Q+++ +  L++  PH  I+Y DYYNALL + +     GF    L  +CCG+GG+YNF   + CG +GV  C +P
Subjt:  TNDTTAYDEL-HCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNP

Query:  REHISWDGIHLTQKAYKFMAYWLI
         +++++DGIH+T+ AY+ ++  L+
Subjt:  REHISWDGIHLTQKAYKFMAYWLI

AT1G28600.1 GDSL-like Lipase/Acylhydrolase superfamily protein2.3e-2440.83Show/hide
Query:  TNDTTAYD-ELHCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNP
        T++   YD    CLK L     YH++++K  +  L++  PH  I+Y DYYN+LL I +     GF E     +CCGIGG YNF   + CG +GV  C +P
Subjt:  TNDTTAYD-ELHCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNP

Query:  REHISWDGIHLTQKAYKFMA
         +++ WDG+H+T+ AYK++A
Subjt:  REHISWDGIHLTQKAYKFMA

AT1G28610.2 GDSL-like Lipase/Acylhydrolase superfamily protein3.0e-2440.32Show/hide
Query:  TNDTTAYDEL-HCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNP
        T++   YD L  CL  L   + Y+N++++  +  L +  PH  I+YGDY+NALL + +     GF +  L  +CCG+GG YNF L K CG +GV  C +P
Subjt:  TNDTTAYDEL-HCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNP

Query:  REHISWDGIHLTQKAYKFMAYWLI
         ++++WDG+H+T+ AYK++A  L+
Subjt:  REHISWDGIHLTQKAYKFMAYWLI

AT1G31550.2 GDSL-like Lipase/Acylhydrolase superfamily protein2.1e-2541.94Show/hide
Query:  TNDTTAYDEL-HCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNP
        T++   YD L  CLK L     YH++Q+++ +  L++ NPH  I+Y DYYNA L + R     GF    L  +CCG+GG YNF L + CG +GV  C +P
Subjt:  TNDTTAYDEL-HCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNP

Query:  REHISWDGIHLTQKAYKFMAYWLI
         ++++WDG+H+T+ A+K MA  L+
Subjt:  REHISWDGIHLTQKAYKFMAYWLI

AT5G03980.1 SGNH hydrolase-type esterase superfamily protein5.6e-3148.46Show/hide
Query:  DTTAYDELHCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNPREH
        DT  YD+  CL  L   A  HN+Q+++AI  L++E P   IVYGDYYNA  ++LR      FD++   KSCCG GG YN+   +  G +GVPVC NP + 
Subjt:  DTTAYDELHCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNPREH

Query:  ISWDGIHLTQKAYKFMAYWLIHDIFPKLYC
        ISWDG+HLTQKAY+FM+ +L + I  ++ C
Subjt:  ISWDGIHLTQKAYKFMAYWLIHDIFPKLYC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCGAAATGCACAAGATCCTCCACCTCCACAAGATCCACCTATGAACGGAGATATGGCAGTTGAAGGAGCAGCAAACCGAGCAGGAGAAGTTCTTAATCTA
ATCCTGTTAGCTGATAACCGAGATGTAGCCATGCGAAACTATGTCACTCATGCGTTCCACAACTTAAATTCAGGGATAAGTAATTCTTTACCCCAAGCCGCGCAG
TCCGAGCTTAAGCTAGTCATGTTCCACATGTTGCACACGATGGGCCAGTTTGGAAGATTGACTAACAAAGATCCCTATTCCCATCTCAAATCTTTTATTGAAATA
GCTAATGCATTTCAACTTCCTGGTGTTTCTGACGATACACTAAGACTAAAAATGTTTTATTTTTCTCTCAGGAACAGTGCAAGGACTTGGCTAAATGCACTAGAA
CCAAATTCTATCACCACATGGGCTGAACTGACGGAGAAATTCTTGGCAAAGTACCATACTTTGACAAGGAACGGAAACCTTCAAGAAGATTTTATTGAGCAATTT
TATAGAGGATTGGATCGTCCATCGAGAATGATGTTAAACACTGCAGCCAATGGCTCTTTGTTAGAGAAGTCAGTTAATGAGATCGTTGATATTTTGAACAAGATG
ACGAACATCAATGATCAAAATGAATTAGGAAGGTCAAGGTTCCCAGTGATTACGCTTGTTTACATGCGTAAAAAGAAGGACCATTTGGGAGTGCAAATTAATCAA
AAGAAGCAAAAAGACGGAAAAACCTTCATTGGAGGCGCCAGGCGCCTGGTGTGCGCTTGCATCTATAGAATTGTTACCACGCTTAGTGCGTGCGTTCAGCATAAC
CTCAGTCACCCAGCGGAACTTCAATCCATATTCAAACACCTACAACCCTGCACTATCCAAACTTCTCTTGGAACAAAGGAAGAGCATGTAGCAGAGCGGAAGCAG
CTGCTCAATAATTCAAACAAAACTACACTCTTCCTGGTTTTCCGACACAACCTATTTTGCCGCCACAACAGTATAATCAACAAAGAGTGCAAAATAACACTCAGC
AAAGTGGGGCAGTTAGCGAAGGACCTAAACTCTAGATCACAAGGTAAATTGCCTGGACACACAGAGAATCCGAAGCGAGATCGTGAAGGTAAAGAGCAATGTAAG
GCAATCATCACAAGAAAAGGACTAAGTTATGACGGACCCTCACTTCCAGATGGAGGAACCGATGTGGCTACACCTATTTCCACATCAGTCTCCACTCCACAACCA
GAAGAGAAAGCAGAACCCGTAAGTTCAAAGGAGAAAGGTAAGAAGACCGACAAAGGAGTAGTGCCTTGCACTAGTCCGCACATAAATGCTTTAGAACAGATGCCT
AATTACACCAAGTTTTTAAAAGATATCATTTCTAGGCATAAGAAATTAGGTGAGCATGAGACGGTAGCCCTAACAAAATGCAGTAGTGATACTCTAGAGAATCTC
TTGCTTGTTAAGTGTAGGACCCAGGATTGCTCCCACCAGCACGATCCCAAGACCCAAGAGGATAGCGAGGAAGACATAGTGGTGGTGTTCGAGGGAAACTCGTTG
AAGAAACGTTCTTCAAAGTCCGGGTCGCTCGAAAGTCGTTCGTGGCGTCGGATTAGGCAAAAAGTTGCAGAAAACAGCGAAGAAGACGAAGCAAACTGCGCAGAC
AGCGCCATGGCGCTGCGGGGACAGCACACAGCGCCACGGCGCTGCACTGTAGCGCTGCGGCGCTGCTGCTGCAGTTTTTTTCTGCAGCAGCGCTGTGGCGCTGCC
CTTAGGCGTAGAGGCGCTATCCCGGGTGTTCTTCGGCGCGTTTCCGTGGCTCCGATTCGCGGTAATGCAAATTCTGAAAAAGTGACTTTAGAACCTCATGTTGCT
AGGGTTAGTGAAGGAGGCCAGAGTGAGAAGAAATTAGAAGAATTTAGCAAGGCCTACCTTAGAAAAAATCAATTCATGGGTGATAAAGGTTCTGATTTAGATGAG
AGAATTGCTAGGCTTAACGAGAGAGTTGAGATTAAGGTCAAGAATAGGTTAGTGGTGACCGAGCACGACACAAAGTCATTGGAGCACTCAGATTCAACCATGGTC
GAAATACATTGCCAAATTGCGCCTGGCGCAATTTTGGAGGATACTCCACCGGCCACTCTACAAGGAGAAGCCAACGTCAAAAATTGCACCAATGATACAACCGCT
TATGATGAACTTCATTGTTTGAAGGGTTTGAAGGGTTTGGCAAGTTATCACAATGATCAAATCAAGCAAGCTATTGAAGTACTGAAAAGGGAGAATCCACATACT
ATTATCGTATATGGCGACTACTATAATGCATTGCTTTGGATTCTTCGCCATGCTTTTGTGCTCGGATTTGATGAAGCTTCTTTGCAAAAATCATGTTGTGGGATT
GGAGGCAACTACAACTTTAAACTCATAAAGATTTGCGGACTTCTTGGTGTACCAGTTTGCCCAAACCCTCGTGAACATATAAGTTGGGATGGAATCCATTTGACG
CAAAAAGCTTACAAATTCATGGCATATTGGCTCATCCACGACATCTTTCCAAAATTGTACTGCATTGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATCGAAATGCACAAGATCCTCCACCTCCACAAGATCCACCTATGAACGGAGATATGGCAGTTGAAGGAGCAGCAAACCGAGCAGGAGAAGTTCTTAATCTA
ATCCTGTTAGCTGATAACCGAGATGTAGCCATGCGAAACTATGTCACTCATGCGTTCCACAACTTAAATTCAGGGATAAGTAATTCTTTACCCCAAGCCGCGCAG
TCCGAGCTTAAGCTAGTCATGTTCCACATGTTGCACACGATGGGCCAGTTTGGAAGATTGACTAACAAAGATCCCTATTCCCATCTCAAATCTTTTATTGAAATA
GCTAATGCATTTCAACTTCCTGGTGTTTCTGACGATACACTAAGACTAAAAATGTTTTATTTTTCTCTCAGGAACAGTGCAAGGACTTGGCTAAATGCACTAGAA
CCAAATTCTATCACCACATGGGCTGAACTGACGGAGAAATTCTTGGCAAAGTACCATACTTTGACAAGGAACGGAAACCTTCAAGAAGATTTTATTGAGCAATTT
TATAGAGGATTGGATCGTCCATCGAGAATGATGTTAAACACTGCAGCCAATGGCTCTTTGTTAGAGAAGTCAGTTAATGAGATCGTTGATATTTTGAACAAGATG
ACGAACATCAATGATCAAAATGAATTAGGAAGGTCAAGGTTCCCAGTGATTACGCTTGTTTACATGCGTAAAAAGAAGGACCATTTGGGAGTGCAAATTAATCAA
AAGAAGCAAAAAGACGGAAAAACCTTCATTGGAGGCGCCAGGCGCCTGGTGTGCGCTTGCATCTATAGAATTGTTACCACGCTTAGTGCGTGCGTTCAGCATAAC
CTCAGTCACCCAGCGGAACTTCAATCCATATTCAAACACCTACAACCCTGCACTATCCAAACTTCTCTTGGAACAAAGGAAGAGCATGTAGCAGAGCGGAAGCAG
CTGCTCAATAATTCAAACAAAACTACACTCTTCCTGGTTTTCCGACACAACCTATTTTGCCGCCACAACAGTATAATCAACAAAGAGTGCAAAATAACACTCAGC
AAAGTGGGGCAGTTAGCGAAGGACCTAAACTCTAGATCACAAGGTAAATTGCCTGGACACACAGAGAATCCGAAGCGAGATCGTGAAGGTAAAGAGCAATGTAAG
GCAATCATCACAAGAAAAGGACTAAGTTATGACGGACCCTCACTTCCAGATGGAGGAACCGATGTGGCTACACCTATTTCCACATCAGTCTCCACTCCACAACCA
GAAGAGAAAGCAGAACCCGTAAGTTCAAAGGAGAAAGGTAAGAAGACCGACAAAGGAGTAGTGCCTTGCACTAGTCCGCACATAAATGCTTTAGAACAGATGCCT
AATTACACCAAGTTTTTAAAAGATATCATTTCTAGGCATAAGAAATTAGGTGAGCATGAGACGGTAGCCCTAACAAAATGCAGTAGTGATACTCTAGAGAATCTC
TTGCTTGTTAAGTGTAGGACCCAGGATTGCTCCCACCAGCACGATCCCAAGACCCAAGAGGATAGCGAGGAAGACATAGTGGTGGTGTTCGAGGGAAACTCGTTG
AAGAAACGTTCTTCAAAGTCCGGGTCGCTCGAAAGTCGTTCGTGGCGTCGGATTAGGCAAAAAGTTGCAGAAAACAGCGAAGAAGACGAAGCAAACTGCGCAGAC
AGCGCCATGGCGCTGCGGGGACAGCACACAGCGCCACGGCGCTGCACTGTAGCGCTGCGGCGCTGCTGCTGCAGTTTTTTTCTGCAGCAGCGCTGTGGCGCTGCC
CTTAGGCGTAGAGGCGCTATCCCGGGTGTTCTTCGGCGCGTTTCCGTGGCTCCGATTCGCGGTAATGCAAATTCTGAAAAAGTGACTTTAGAACCTCATGTTGCT
AGGGTTAGTGAAGGAGGCCAGAGTGAGAAGAAATTAGAAGAATTTAGCAAGGCCTACCTTAGAAAAAATCAATTCATGGGTGATAAAGGTTCTGATTTAGATGAG
AGAATTGCTAGGCTTAACGAGAGAGTTGAGATTAAGGTCAAGAATAGGTTAGTGGTGACCGAGCACGACACAAAGTCATTGGAGCACTCAGATTCAACCATGGTC
GAAATACATTGCCAAATTGCGCCTGGCGCAATTTTGGAGGATACTCCACCGGCCACTCTACAAGGAGAAGCCAACGTCAAAAATTGCACCAATGATACAACCGCT
TATGATGAACTTCATTGTTTGAAGGGTTTGAAGGGTTTGGCAAGTTATCACAATGATCAAATCAAGCAAGCTATTGAAGTACTGAAAAGGGAGAATCCACATACT
ATTATCGTATATGGCGACTACTATAATGCATTGCTTTGGATTCTTCGCCATGCTTTTGTGCTCGGATTTGATGAAGCTTCTTTGCAAAAATCATGTTGTGGGATT
GGAGGCAACTACAACTTTAAACTCATAAAGATTTGCGGACTTCTTGGTGTACCAGTTTGCCCAAACCCTCGTGAACATATAAGTTGGGATGGAATCCATTTGACG
CAAAAAGCTTACAAATTCATGGCATATTGGCTCATCCACGACATCTTTCCAAAATTGTACTGCATTGTTTAA
Protein sequenceShow/hide protein sequence
MNRNAQDPPPPQDPPMNGDMAVEGAANRAGEVLNLILLADNRDVAMRNYVTHAFHNLNSGISNSLPQAAQSELKLVMFHMLHTMGQFGRLTNKDPYSHLKSFIEI
ANAFQLPGVSDDTLRLKMFYFSLRNSARTWLNALEPNSITTWAELTEKFLAKYHTLTRNGNLQEDFIEQFYRGLDRPSRMMLNTAANGSLLEKSVNEIVDILNKM
TNINDQNELGRSRFPVITLVYMRKKKDHLGVQINQKKQKDGKTFIGGARRLVCACIYRIVTTLSACVQHNLSHPAELQSIFKHLQPCTIQTSLGTKEEHVAERKQ
LLNNSNKTTLFLVFRHNLFCRHNSIINKECKITLSKVGQLAKDLNSRSQGKLPGHTENPKRDREGKEQCKAIITRKGLSYDGPSLPDGGTDVATPISTSVSTPQP
EEKAEPVSSKEKGKKTDKGVVPCTSPHINALEQMPNYTKFLKDIISRHKKLGEHETVALTKCSSDTLENLLLVKCRTQDCSHQHDPKTQEDSEEDIVVVFEGNSL
KKRSSKSGSLESRSWRRIRQKVAENSEEDEANCADSAMALRGQHTAPRRCTVALRRCCCSFFLQQRCGAALRRRGAIPGVLRRVSVAPIRGNANSEKVTLEPHVA
RVSEGGQSEKKLEEFSKAYLRKNQFMGDKGSDLDERIARLNERVEIKVKNRLVVTEHDTKSLEHSDSTMVEIHCQIAPGAILEDTPPATLQGEANVKNCTNDTTA
YDELHCLKGLKGLASYHNDQIKQAIEVLKRENPHTIIVYGDYYNALLWILRHAFVLGFDEASLQKSCCGIGGNYNFKLIKICGLLGVPVCPNPREHISWDGIHLT
QKAYKFMAYWLIHDIFPKLYCIV