; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g1508 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g1508
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionAdenine nucleotide alpha hydrolases-like superfamily protein
Genome locationMC09:20922843..20927399
RNA-Seq ExpressionMC09g1508
SyntenyMC09g1508
Gene Ontology termsNA
InterPro domainsIPR006015 - Universal stress protein A family
IPR006016 - UspA
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7013016.1 Universal stress protein A-like protein, partial [Cucurbita argyrosperma subsp. argyrosperma]6.99e-9182.08Show/hide
Query:  MAEA------VEKRVMVAIDESECSYYALIWVLENLQQSLAESPLFVFTALPPPTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEK
        MAEA      VEKRVMVA+DESECSYYALIWVLENL+QS+A+SPLFVFTALPPPT YT GAGA  SLGLAR+Y  V SNTELA+++QENDKKVRC +LEK
Subjt:  MAEA------VEKRVMVAIDESECSYYALIWVLENLQQSLAESPLFVFTALPPPTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEK

Query:  AKDICAERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        AKDICAERGVAAISITEVG PG  IC+ VEKLNIN+LVLGD GLGRIKRALIGSVSNYCVQNAKC VLVVKKP
Subjt:  AKDICAERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP

XP_022150587.1 uncharacterized protein LOC111018690 isoform X1 [Momordica charantia]4.18e-114100Show/hide
Query:  MAEAVEKRVMVAIDESECSYYALIWVLENLQQSLAESPLFVFTALPPPTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEKAKDICA
        MAEAVEKRVMVAIDESECSYYALIWVLENLQQSLAESPLFVFTALPPPTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEKAKDICA
Subjt:  MAEAVEKRVMVAIDESECSYYALIWVLENLQQSLAESPLFVFTALPPPTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEKAKDICA

Query:  ERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        ERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP
Subjt:  ERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP

XP_022150590.1 uncharacterized protein LOC111018690 isoform X2 [Momordica charantia]2.93e-95100Show/hide
Query:  MAEAVEKRVMVAIDESECSYYALIWVLENLQQSLAESPLFVFTALPPPTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEKAKDICA
        MAEAVEKRVMVAIDESECSYYALIWVLENLQQSLAESPLFVFTALPPPTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEKAKDICA
Subjt:  MAEAVEKRVMVAIDESECSYYALIWVLENLQQSLAESPLFVFTALPPPTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEKAKDICA

Query:  ERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKR
        ERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKR
Subjt:  ERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKR

XP_022968136.1 uncharacterized protein LOC111467462 [Cucurbita maxima]1.06e-9282.66Show/hide
Query:  MAEA------VEKRVMVAIDESECSYYALIWVLENLQQSLAESPLFVFTALPPPTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEK
        MAEA      VEKRVMVA+DESECSYYALIWVLENL+QS+A+SPLFVFTALPPPT YT GAGA  SLGLAR+Y  V SNTELA+++QENDKKVRC +LEK
Subjt:  MAEA------VEKRVMVAIDESECSYYALIWVLENLQQSLAESPLFVFTALPPPTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEK

Query:  AKDICAERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        AKDICAERGVAAISITEVG PG  ICD VEKLNIN+LVLGD G+GRIKRALIGSVSNYCVQNAKCPVLVVKKP
Subjt:  AKDICAERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP

XP_023541640.1 universal stress protein A-like protein [Cucurbita pepo subsp. pepo]4.30e-9280.92Show/hide
Query:  MAEA------VEKRVMVAIDESECSYYALIWVLENLQQSLAESPLFVFTALPPPTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEK
        MAEA      VEKRVMVA+DESECSYYALIWVLENL+QS+ ++PLF+FTALPPPT YT GAGA  SLGLAR+Y  V SNTEL++++QENDKKVRC +LEK
Subjt:  MAEA------VEKRVMVAIDESECSYYALIWVLENLQQSLAESPLFVFTALPPPTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEK

Query:  AKDICAERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        AKDICAERGVAAISITEVG+PG TICD VEKLNIN+LVLGD G+GRIKRALIGSVSNYCVQNAKCPVLVVKKP
Subjt:  AKDICAERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP

TrEMBL top hitse value%identityAlignment
A0A5A7SXH5 Universal stress protein YxiE1.66e-8179.14Show/hide
Query:  EKRVMVAIDESECSYYALIWVLENLQQSLAESPLFVFTALPPP-TIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEKAKDICAERGV
        EKRVMVAIDESE SYYALIWVLENL++S+A SPLF+FTALPPP T YT G        LAR+Y  + SNTE  ++IQENDKK+RC LLEKAKDICA RGV
Subjt:  EKRVMVAIDESECSYYALIWVLENLQQSLAESPLFVFTALPPP-TIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEKAKDICAERGV

Query:  AAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        AAISITE G+PGTTICD VEKLNI++LVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP
Subjt:  AAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP

A0A6J1D8W2 uncharacterized protein LOC111018690 isoform X21.42e-95100Show/hide
Query:  MAEAVEKRVMVAIDESECSYYALIWVLENLQQSLAESPLFVFTALPPPTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEKAKDICA
        MAEAVEKRVMVAIDESECSYYALIWVLENLQQSLAESPLFVFTALPPPTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEKAKDICA
Subjt:  MAEAVEKRVMVAIDESECSYYALIWVLENLQQSLAESPLFVFTALPPPTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEKAKDICA

Query:  ERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKR
        ERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKR
Subjt:  ERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKR

A0A6J1D9U3 uncharacterized protein LOC111018690 isoform X12.02e-114100Show/hide
Query:  MAEAVEKRVMVAIDESECSYYALIWVLENLQQSLAESPLFVFTALPPPTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEKAKDICA
        MAEAVEKRVMVAIDESECSYYALIWVLENLQQSLAESPLFVFTALPPPTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEKAKDICA
Subjt:  MAEAVEKRVMVAIDESECSYYALIWVLENLQQSLAESPLFVFTALPPPTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEKAKDICA

Query:  ERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        ERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP
Subjt:  ERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP

A0A6J1G084 uncharacterized protein LOC1114495323.45e-9181.5Show/hide
Query:  MAEA------VEKRVMVAIDESECSYYALIWVLENLQQSLAESPLFVFTALPPPTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEK
        MAEA      VEKRVMVA+DESECSYYALIWVLENL+QS+A+SPLF+FTALPP T YT GAGA  SLGLAR+   V SNTELA+++QENDKKVRC +LEK
Subjt:  MAEA------VEKRVMVAIDESECSYYALIWVLENLQQSLAESPLFVFTALPPPTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEK

Query:  AKDICAERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        AKDICAERGVAAISITEVG+PG TICD VEKLNIN+LVLGD G+GRIKRALIGSVSNYCVQNAKCPVLVVKKP
Subjt:  AKDICAERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP

A0A6J1HU14 uncharacterized protein LOC1114674625.12e-9382.66Show/hide
Query:  MAEA------VEKRVMVAIDESECSYYALIWVLENLQQSLAESPLFVFTALPPPTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEK
        MAEA      VEKRVMVA+DESECSYYALIWVLENL+QS+A+SPLFVFTALPPPT YT GAGA  SLGLAR+Y  V SNTELA+++QENDKKVRC +LEK
Subjt:  MAEA------VEKRVMVAIDESECSYYALIWVLENLQQSLAESPLFVFTALPPPTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEK

Query:  AKDICAERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        AKDICAERGVAAISITEVG PG  ICD VEKLNIN+LVLGD G+GRIKRALIGSVSNYCVQNAKCPVLVVKKP
Subjt:  AKDICAERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP

SwissProt top hitse value%identityAlignment
P42297 Universal stress protein YxiE2.1e-0828.48Show/hide
Query:  RVMVAIDESECSYYALIWVLENLQQSLAESPLFVFTALPPPTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEKAKDICAERGVAAI
        +++VAID S+ S  AL   +   ++  AE            +I   G  A  +        +V  +    + I+   KK    +LE AK+  AE+GV A 
Subjt:  RVMVAIDESECSYYALIWVLENLQQSLAESPLFVFTALPPPTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEKAKDICAERGVAAI

Query:  SITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVK
        +I   G P   I +  ++  ++++V+G RG+  +K  ++GSVS+   Q + CPVL+V+
Subjt:  SITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVK

P72817 Universal stress protein Sll16549.0e-0736.49Show/hide
Query:  LLEKAKDICAERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVV
        LLE A+ + +++G+A  +I   G    TICD  +++N +++V+G RGLG     +  SV+   +  + CPVLVV
Subjt:  LLEKAKDICAERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVV

P74148 Universal stress protein Sll13881.5e-0629.34Show/hide
Query:  RVMVAIDESECSYYALIWVLENLQQSLA-----ESPLFVFTALPPP----TIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEKAKDI
        +++VA+D SE +        E LQQ++A      S L VF  +P      +IY    G AA +G ++          +   ++E   + R   L+     
Subjt:  RVMVAIDESECSYYALIWVLENLQQSLA-----ESPLFVFTALPPP----TIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEKAKDI

Query:  CAERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVK
          E GVA     +VG PG  I D  +  + +++VLG RGL  +    +GSVS+Y + + +C VL+V+
Subjt:  CAERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVK

Q57951 Universal stress protein MJ05314.0e-0739.47Show/hide
Query:  LEKAKDICAERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        L+K K +  E GV   +    G P   I +  EK   +++V+G  G   ++R L+GSV+   ++NA CPVLVVKKP
Subjt:  LEKAKDICAERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP

Q8LGG8 Universal stress protein A-like protein1.0e-1032.99Show/hide
Query:  VTSNTELANSIQENDKKVRCALLEKAKDICAERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKK
        + ++ E    +++++K     LLE   + C E GV   +  + G+P   IC  V+++  + LV+G RGLGR ++  +G+VS +CV++A+CPV+ +K+
Subjt:  VTSNTELANSIQENDKKVRCALLEKAKDICAERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKK

Arabidopsis top hitse value%identityAlignment
AT1G09740.1 Adenine nucleotide alpha hydrolases-like superfamily protein4.6e-2237.74Show/hide
Query:  VMVAIDESECSYYALIWVLENLQ--QSLAESPLFVFTALPPPTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEKAKDICAERGVAA
        V+VA+D SE S  AL W L+NL+   S ++S   V    P P++    +      G   +   V + T    +I+++ K++   +LE A  ICAE+ V  
Subjt:  VMVAIDESECSYYALIWVLENLQ--QSLAESPLFVFTALPPPTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEKAKDICAERGVAA

Query:  ISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVK
         +   +G+P   IC+AVE L+ ++LV+G R  GRIKR  +GSVSNYC  +A CPV+++K
Subjt:  ISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVK

AT1G68300.1 Adenine nucleotide alpha hydrolases-like superfamily protein3.7e-3246.99Show/hide
Query:  EAVEKRVMVAIDESECSYYALIWVLENLQQSLAESPLFVFTALPP---PTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEKAKDIC
        ++V K+VMVAIDESECS  AL W L  L+ SLA+S + +FTA P      +Y    GAA                EL NS+QE+ K      L++   IC
Subjt:  EAVEKRVMVAIDESECSYYALIWVLENLQQSLAESPLFVFTALPP---PTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEKAKDIC

Query:  AERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVK
        AE GV    + E GNP   IC+A EKL ++MLV+G  G G ++R  +GSVSNYCV NAKCPVLVV+
Subjt:  AERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVK

AT3G11930.1 Adenine nucleotide alpha hydrolases-like superfamily protein9.8e-2536.14Show/hide
Query:  KRVMVAIDESECSYYALIWVLEN-----LQQSLAESPLFVFTALPPPTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEKAKDICAE
        KR++VAIDES+ S+YAL WV+++     L  + AE+   + T +   + +   A   A  G A  Y    +++ +  S+++  ++   ALL +A  +C  
Subjt:  KRVMVAIDESECSYYALIWVLEN-----LQQSLAESPLFVFTALPPPTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEKAKDICAE

Query:  RGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        + +   ++   G     IC+AVEK+++++LV+G RGLG+IKRA +GSVS+YC  +A CP+L+VK P
Subjt:  RGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP

AT3G11930.2 Adenine nucleotide alpha hydrolases-like superfamily protein2.9e-2436.14Show/hide
Query:  KRVMVAIDESECSYYALIWVLEN-----LQQSLAESPLFVFTALPPPTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEKAKDICAE
        KR++VAIDES+ S+YAL WV+++     L  + AE+   + T +   + +   A   A  G A     V +++ +  S+++  ++   ALL +A  +C  
Subjt:  KRVMVAIDESECSYYALIWVLEN-----LQQSLAESPLFVFTALPPPTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEKAKDICAE

Query:  RGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        + +   ++   G     IC+AVEK+++++LV+G RGLG+IKRA +GSVS+YC  +A CP+L+VK P
Subjt:  RGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP

AT3G11930.4 Adenine nucleotide alpha hydrolases-like superfamily protein9.8e-2536.31Show/hide
Query:  KRVMVAIDESECSYYALIWVLENLQQSL-------AESPLFVFTALPPPTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEKAKDIC
        KR++VAIDES+ S+YAL WV+++    L       AES +     +  P    F   AA   G       V +++ +  S+++  ++   ALL +A  +C
Subjt:  KRVMVAIDESECSYYALIWVLENLQQSL-------AESPLFVFTALPPPTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEKAKDIC

Query:  AERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP
          + +   ++   G     IC+AVEK+++++LV+G RGLG+IKRA +GSVS+YC  +A CP+L+VK P
Subjt:  AERGVAAISITEVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGAAGCAGTCGAGAAGAGGGTGATGGTGGCCATAGACGAGAGCGAGTGTAGCTACTATGCCCTCATCTGGGTGCTCGAAAATCTTCAACAATCCCTAGCCGAGTC
CCCCCTTTTCGTCTTCACGGCTCTTCCTCCGCCCACGATTTACACCTTCGGCGCTGGTGCTGCTGCATCGCTTGGCCTTGCGCGCACGTATTGTCACGTTACATCCAATA
CGGAGTTGGCTAATTCGATCCAAGAGAATGACAAGAAAGTCAGATGCGCCCTCCTTGAGAAAGCAAAGGATATATGTGCTGAAAGAGGGGTGGCTGCTATATCAATCACA
GAGGTTGGGAATCCTGGAACAACCATATGTGATGCAGTTGAAAAGCTCAATATAAATATGCTTGTCTTAGGTGATCGTGGCCTTGGGAGAATTAAGAGAGCTCTTATAGG
GAGTGTGAGCAACTACTGTGTTCAAAATGCCAAGTGTCCTGTCCTTGTTGTGAAGAAACCATAG
mRNA sequenceShow/hide mRNA sequence
AAAGACAGGAGTGTAATGTAATTTGATATAATTGGTGATATCAATAATGAATTTTATAAGAAATAGTTTAGAGATATGTGAGAAAAAGCACACAGTGTCCCACCCGAATG
GTGCAGGAAAGAAAAATGTAAATAGGGAGAAAATAGAAATTTCTCTGGAAAATTTTGATTCAGAGTGAGTATAGCTCAACCAATAACGAAAGCCAACCAATCTTCCGCTT
CCGCCTTGAAATTAAAGAAGAAAAAAGATTGCAGAACATAAATGTGATAATGTGCTGGAGTCACAATGATGAATGAAGAAATTGGGCGGAGATTATCCTGATTCTCTATC
CTCGCTCAGAATTGGGCGCCACCCAAGAGAACCCACCATCAATTTACACTGTAGTACGCGCTGGAGCTTAAATTCCTTCTCGCCCAATTCTAATGGCGTAATTCACTTTC
TGGGCTTCCTTTCACAGTTCCTCCTCTAATTCCCCCTTCCAGAATGGCTGAAGCAGTCGAGAAGAGGGTGATGGTGGCCATAGACGAGAGCGAGTGTAGCTACTATGCCC
TCATCTGGGTGCTCGAAAATCTTCAACAATCCCTAGCCGAGTCCCCCCTTTTCGTCTTCACGGCTCTTCCTCCGCCCACGATTTACACCTTCGGCGCTGGTGCTGCTGCA
TCGCTTGGCCTTGCGCGCACGTATTGTCACGTTACATCCAATACGGAGTTGGCTAATTCGATCCAAGAGAATGACAAGAAAGTCAGATGCGCCCTCCTTGAGAAAGCAAA
GGATATATGTGCTGAAAGAGGGGTGGCTGCTATATCAATCACAGAGGTTGGGAATCCTGGAACAACCATATGTGATGCAGTTGAAAAGCTCAATATAAATATGCTTGTCT
TAGGTGATCGTGGCCTTGGGAGAATTAAGAGAGCTCTTATAGGGAGTGTGAGCAACTACTGTGTTCAAAATGCCAAGTGTCCTGTCCTTGTTGTGAAGAAACCATAGTAA
TTAAAGACGCAGGTTCACTATCGGCGCTCGGGAGAAGAAATATTTATCTGCAAGCAAGTTTTTAAGTTAAAATCTCTTCTGCCCAATGAACAGCTCAATCAAGCTTCGTA
AGCAGTCTACCTTCTAGTATAGAACCTTCACTTGCACCTTTGTATGCCCAGTCACAGGCTGCTTGGATACTTTCTGATGCATATCTGCACATTCGAGCGAAAGTTGAGGC
ATATTATTGAAATTATATGCTCTAAAAAATGAAGGATCAATGGACTGTTTCTTGTGGCACATACATTTCTGTGCATGCTACCTCATTAGAACTACACTTCTCCCACTCTT
CAACGTGATTTCTCCATCCCTTCTGTTGATCAGAAACAGCAAAAGAGAGAGGAAAAAAAAGCATTAGAATCTCTAAAACGATTAGGATCGAGCTTGTTTTTCAAGTTGAG
TTTTGACTTAAAAGTATATGACCGTGATATTCATTTGGATTGCATCAACCAGGCCATCCACACTGAAGTTGTAAAACTTTTCTTCTGCAGTCTCGATAATGCTGTCATCC
CAGATCTGCAAAAATCACAACCAACTCTGTATTTTCATGGATTAAAGACTAGAAATGTCAAGGGCACTCTGTTGAAGTTGGGTTATTATAATTACATGGTGAAGATTTTG
CTTCCTGGTGTACCAGTGAACATCGATCGTGTTTCCTCCCCTGTCTGTCGTAAATCCCACATGCAGAGGCTGCATTTGAGGTAAAATCATCTCAGTAAATAATTGCATAT
GTCCATGTAAGCTTCATATCGTACCTGATGGATGTCTCCCATGAAATGTGAAAGAAAAAGAAGTGCTTGAGTCAAGTTATCTGGCGATGGGAAGTTTTAGCGTATTAGCA
GATTTATTGGTAAATTGATGAAGAGAAGTCGATAAGATTTTTGTATTCTTGAAAATTACATTTAGACTGAGATGGTTGAGCTTGGTACTTCAGAAGCTGGGAAGTATAGT
TGTTAATGGCGCCTGCAACACACCGTCCTATCACTCCTGCTTGGTCTTTGCAGTCCCCTTGAAAGATTGAAATGACAAAAATTTAGAACAAGGAGAGAAGAAATCAGACT
AGAGAGGAATGGAAACAAAGGAAGTTACTCAACTGTCATACTGGTAAGAACAGAGGGAATCTGGAGTATCAATGAAATGAAGAGCAGAGGACCAGTGATAATAAAACTTG
ACTCTGTCAGCCCAACTGCAAACGCTAGCCAAATCGCCGCTAGCGGAGTCTGGCAACAGTTGCTGCACTGCATCTGCTGCTGCTCTGCTCAACCGTGACTGTTCCAACAA
AACAAAGACCAAAGAAGTTCCTTCATCAAAGATGCAAATTTGGAAAATGGGGTTTCAAAGACATATAAAGAGAAGAACCTGAGCGATTTTGCAGACGGCAGCATGGCCGT
CGACGCCCCAACCAGAGATAACAGGAAAAATGAACCCCAACAACAGCAAGGCCGCCACTAGAACTTTACAGGTCTCCATTTTCTTATTTTGTTTGTTTGTTTGTTGTTCT
GGTTCGTTTTTTGTGGTTGCTTCAGCAGCATTAATAGAGCAAAAGAGTGTGAGCCTGAACTGTTGAAGATTTGGAACTGAACAAATGTGGGATATGCTAATTGCTGGATT
CCAAATTTGTGGAAGAAATTATGGTAATGTATTAAATAATGGAACTGTGACTCTCTCCTCCCCTGGAAAGGAAGGGAGCCTGTGCTGTGCTGGTTTCACATGAATCTGTT
CTATAAGAATGTAATTCCAGTTGCTGTCTGTATAGTTATTTTATTATGTTAGAAGGAATAAAGTTTTTTATTATTAAAAAAAAAAGCAAATAGAAAGAGATTCAACTCGT
TTCCTA
Protein sequenceShow/hide protein sequence
MAEAVEKRVMVAIDESECSYYALIWVLENLQQSLAESPLFVFTALPPPTIYTFGAGAAASLGLARTYCHVTSNTELANSIQENDKKVRCALLEKAKDICAERGVAAISIT
EVGNPGTTICDAVEKLNINMLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP