; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10015531 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10015531
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionAdenine nucleotide alpha hydrolases-like superfamily protein
Genome locationChr02:27441427..27442698
RNA-Seq ExpressionHG10015531
SyntenyHG10015531
Gene Ontology termsNA
InterPro domainsIPR006015 - Universal stress protein A family
IPR006016 - UspA
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573952.1 Acyl-acyl carrier protein thioesterase ATL3, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]3.6e-7788.89Show/hide
Query:  MAEAAKSNPAVEKRVMVAIDESECSYYALIWVLQNLKESIADSPLFLFTALPPPSSYTSGAGASLGLARSY--VASNRELVYTLQENDKKVRCGLLEKAK
        MAEA KSNP VEKRVMVA+DESECSYYALIWVL+NLK+SIADSPLF+FTALPPP+SYTSGAGASLGLARSY  VASN EL +TLQENDKKVRCG+LEKAK
Subjt:  MAEAAKSNPAVEKRVMVAIDESECSYYALIWVLQNLKESIADSPLFLFTALPPPSSYTSGAGASLGLARSY--VASNRELVYTLQENDKKVRCGLLEKAK

Query:  DICAERGVAAISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        DICAERGVAAISITE G+PG  IC+TVEKLNINLLVLGD GLGRIKRALIGSVSNYCVQNAKCPVLVVKKP
Subjt:  DICAERGVAAISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP

KAG7013016.1 Universal stress protein A-like protein, partial [Cucurbita argyrosperma subsp. argyrosperma]3.1e-7688.3Show/hide
Query:  MAEAAKSNPAVEKRVMVAIDESECSYYALIWVLQNLKESIADSPLFLFTALPPPSSYTSGAGASLGLARSY--VASNRELVYTLQENDKKVRCGLLEKAK
        MAEA KSNP VEKRVMVA+DESECSYYALIWVL+NLK+SIADSPLF+FTALPPP+SYTSGAGASLGLARSY  VASN EL +TLQENDKKVRCG+LEKAK
Subjt:  MAEAAKSNPAVEKRVMVAIDESECSYYALIWVLQNLKESIADSPLFLFTALPPPSSYTSGAGASLGLARSY--VASNRELVYTLQENDKKVRCGLLEKAK

Query:  DICAERGVAAISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        DICAERGVAAISITE G+PG  IC+TVEKLNINLLVLGD GLGRIKRALIGSVSNYCVQNAKC VLVVKKP
Subjt:  DICAERGVAAISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP

XP_022945228.1 uncharacterized protein LOC111449532 [Cucurbita moschata]3.1e-7688.3Show/hide
Query:  MAEAAKSNPAVEKRVMVAIDESECSYYALIWVLQNLKESIADSPLFLFTALPPPSSYTSGAGASLGLARSY--VASNRELVYTLQENDKKVRCGLLEKAK
        MAEA +SNP VEKRVMVA+DESECSYYALIWVL+NLK+SIADSPLF+FTALPP +SYTSGAGASLGLARS   VASN EL +TLQENDKKVRCG+LEKAK
Subjt:  MAEAAKSNPAVEKRVMVAIDESECSYYALIWVLQNLKESIADSPLFLFTALPPPSSYTSGAGASLGLARSY--VASNRELVYTLQENDKKVRCGLLEKAK

Query:  DICAERGVAAISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        DICAERGVAAISITE GDPG TICDTVEKLNINLLVLGD G+GRIKRALIGSVSNYCVQNAKCPVLVVKKP
Subjt:  DICAERGVAAISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP

XP_022968136.1 uncharacterized protein LOC111467462 [Cucurbita maxima]2.1e-7788.89Show/hide
Query:  MAEAAKSNPAVEKRVMVAIDESECSYYALIWVLQNLKESIADSPLFLFTALPPPSSYTSGAGASLGLARSY--VASNRELVYTLQENDKKVRCGLLEKAK
        MAEA KSNP VEKRVMVA+DESECSYYALIWVL+NLK+SIADSPLF+FTALPPP+SYTSGAGASLGLARSY  VASN EL +TLQENDKKVRCG+LEKAK
Subjt:  MAEAAKSNPAVEKRVMVAIDESECSYYALIWVLQNLKESIADSPLFLFTALPPPSSYTSGAGASLGLARSY--VASNRELVYTLQENDKKVRCGLLEKAK

Query:  DICAERGVAAISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        DICAERGVAAISITE G+PG  ICDTVEKLNINLLVLGD G+GRIKRALIGSVSNYCVQNAKCPVLVVKKP
Subjt:  DICAERGVAAISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP

XP_023541640.1 universal stress protein A-like protein [Cucurbita pepo subsp. pepo]1.2e-7788.89Show/hide
Query:  MAEAAKSNPAVEKRVMVAIDESECSYYALIWVLQNLKESIADSPLFLFTALPPPSSYTSGAGASLGLARSY--VASNRELVYTLQENDKKVRCGLLEKAK
        MAEA KSNP VEKRVMVA+DESECSYYALIWVL+NLK+SI D+PLF+FTALPPP+SYTSGAGASLGLARSY  VASN EL +TLQENDKKVRCG+LEKAK
Subjt:  MAEAAKSNPAVEKRVMVAIDESECSYYALIWVLQNLKESIADSPLFLFTALPPPSSYTSGAGASLGLARSY--VASNRELVYTLQENDKKVRCGLLEKAK

Query:  DICAERGVAAISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        DICAERGVAAISITE GDPG TICDTVEKLNINLLVLGD G+GRIKRALIGSVSNYCVQNAKCPVLVVKKP
Subjt:  DICAERGVAAISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP

TrEMBL top hitse value%identityAlignment
A0A1S3BEZ1 universal stress protein YxiE8.1e-6785.8Show/hide
Query:  VEKRVMVAIDESECSYYALIWVLQNLKESIADSPLFLFTALPPPS-SYTSGAGASLGLARSY--VASNRELVYTLQENDKKVRCGLLEKAKDICAERGVA
        +EKRVMVAIDESE SYYALIWVL+NLKESIA SPLFLFTALPPPS +YTS      GLARSY  + SN E V+T+QENDKK+RCGLLEKAKDICA RGVA
Subjt:  VEKRVMVAIDESECSYYALIWVLQNLKESIADSPLFLFTALPPPS-SYTSGAGASLGLARSY--VASNRELVYTLQENDKKVRCGLLEKAKDICAERGVA

Query:  AISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        AISITEDGDPGTTICDTVEKLNI+LLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP
Subjt:  AISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP

A0A5A7SXH5 Universal stress protein YxiE8.1e-6785.8Show/hide
Query:  VEKRVMVAIDESECSYYALIWVLQNLKESIADSPLFLFTALPPPS-SYTSGAGASLGLARSY--VASNRELVYTLQENDKKVRCGLLEKAKDICAERGVA
        +EKRVMVAIDESE SYYALIWVL+NLKESIA SPLFLFTALPPPS +YTS      GLARSY  + SN E V+T+QENDKK+RCGLLEKAKDICA RGVA
Subjt:  VEKRVMVAIDESECSYYALIWVLQNLKESIADSPLFLFTALPPPS-SYTSGAGASLGLARSY--VASNRELVYTLQENDKKVRCGLLEKAKDICAERGVA

Query:  AISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        AISITEDGDPGTTICDTVEKLNI+LLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP
Subjt:  AISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP

A0A6J1D9U3 uncharacterized protein LOC111018690 isoform X15.1e-6984.76Show/hide
Query:  AVEKRVMVAIDESECSYYALIWVLQNLKESIADSPLFLFTALPPPSSYTSGAG--ASLGLARSY--VASNRELVYTLQENDKKVRCGLLEKAKDICAERG
        AVEKRVMVAIDESECSYYALIWVL+NL++S+A+SPLF+FTALPPP+ YT GAG  ASLGLAR+Y  V SN EL  ++QENDKKVRC LLEKAKDICAERG
Subjt:  AVEKRVMVAIDESECSYYALIWVLQNLKESIADSPLFLFTALPPPSSYTSGAG--ASLGLARSY--VASNRELVYTLQENDKKVRCGLLEKAKDICAERG

Query:  VAAISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        VAAISITE G+PGTTICD VEKLNIN+LVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP
Subjt:  VAAISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP

A0A6J1G084 uncharacterized protein LOC1114495321.5e-7688.3Show/hide
Query:  MAEAAKSNPAVEKRVMVAIDESECSYYALIWVLQNLKESIADSPLFLFTALPPPSSYTSGAGASLGLARSY--VASNRELVYTLQENDKKVRCGLLEKAK
        MAEA +SNP VEKRVMVA+DESECSYYALIWVL+NLK+SIADSPLF+FTALPP +SYTSGAGASLGLARS   VASN EL +TLQENDKKVRCG+LEKAK
Subjt:  MAEAAKSNPAVEKRVMVAIDESECSYYALIWVLQNLKESIADSPLFLFTALPPPSSYTSGAGASLGLARSY--VASNRELVYTLQENDKKVRCGLLEKAK

Query:  DICAERGVAAISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        DICAERGVAAISITE GDPG TICDTVEKLNINLLVLGD G+GRIKRALIGSVSNYCVQNAKCPVLVVKKP
Subjt:  DICAERGVAAISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP

A0A6J1HU14 uncharacterized protein LOC1114674621.0e-7788.89Show/hide
Query:  MAEAAKSNPAVEKRVMVAIDESECSYYALIWVLQNLKESIADSPLFLFTALPPPSSYTSGAGASLGLARSY--VASNRELVYTLQENDKKVRCGLLEKAK
        MAEA KSNP VEKRVMVA+DESECSYYALIWVL+NLK+SIADSPLF+FTALPPP+SYTSGAGASLGLARSY  VASN EL +TLQENDKKVRCG+LEKAK
Subjt:  MAEAAKSNPAVEKRVMVAIDESECSYYALIWVLQNLKESIADSPLFLFTALPPPSSYTSGAGASLGLARSY--VASNRELVYTLQENDKKVRCGLLEKAK

Query:  DICAERGVAAISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        DICAERGVAAISITE G+PG  ICDTVEKLNINLLVLGD G+GRIKRALIGSVSNYCVQNAKCPVLVVKKP
Subjt:  DICAERGVAAISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP

SwissProt top hitse value%identityAlignment
P42297 Universal stress protein YxiE1.2e-0930.13Show/hide
Query:  RVMVAIDESECSYYALIWVLQNLKESIADSPLFLF--TALPPPSSYTSGAGASLGLARSYVASNRELVYTLQENDKKVRCGLLEKAKDICAERGVAAISI
        +++VAID S+ S  AL   +   KE  A+  +      A+   SS T             V      +  ++   KK    +LE AK+  AE+GV A +I
Subjt:  RVMVAIDESECSYYALIWVLQNLKESIADSPLFLF--TALPPPSSYTSGAGASLGLARSYVASNRELVYTLQENDKKVRCGLLEKAKDICAERGVAAISI

Query:  TEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVK
          +G+P   I +  ++  ++L+V+G RG+  +K  ++GSVS+   Q + CPVL+V+
Subjt:  TEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVK

P72817 Universal stress protein Sll16541.8e-0737.84Show/hide
Query:  LLEKAKDICAERGVAAISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVV
        LLE A+ + +++G+A  +I  +G    TICD  +++N +L+V+G RGLG     +  SV+   +  + CPVLVV
Subjt:  LLEKAKDICAERGVAAISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVV

Q57951 Universal stress protein MJ05314.8e-0840.79Show/hide
Query:  LEKAKDICAERGVAAISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        L+K K +  E GV   +   +G P   I +  EK   +L+V+G  G   ++R L+GSV+   ++NA CPVLVVKKP
Subjt:  LEKAKDICAERGVAAISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP

Q8L4N1 Universal stress protein PHOS345.3e-0727.43Show/hide
Query:  KRVMVAIDESECSYYALIWVLQN---------LKESIADSPLF------LFTALPPPSSYTSGAGASLGLARSYVASNRELVYTLQENDKKVRCGLLEKA
        +++ VA+D SE S +A+ W + +         +      S LF      L    PPP S  +  GA    ++             ++ D      + + A
Subjt:  KRVMVAIDESECSYYALIWVLQN---------LKESIADSPLF------LFTALPPPSSYTSGAGASLGLARSYVASNRELVYTLQENDKKVRCGLLEKA

Query:  KDICAERGVAAISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKR---ALIGSVSNYCVQNAKCPVLVVKKP
        K +        I I +D D    +C   E+LN++ +++G RG G  KR     +GSVS+YCV +  CPV+VV+ P
Subjt:  KDICAERGVAAISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKR---ALIGSVSNYCVQNAKCPVLVVKKP

Q8LGG8 Universal stress protein A-like protein2.1e-1136.78Show/hide
Query:  LQENDKKVRCGLLEKAKDICAERGVAAISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKK
        +++++K     LLE   + C E GV   +  + GDP   IC  V+++  + LV+G RGLGR ++  +G+VS +CV++A+CPV+ +K+
Subjt:  LQENDKKVRCGLLEKAKDICAERGVAAISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKK

Arabidopsis top hitse value%identityAlignment
AT1G09740.1 Adenine nucleotide alpha hydrolases-like superfamily protein2.4e-2341.14Show/hide
Query:  VMVAIDESECSYYALIWVLQNLK--ESIADSPLFLFTALPPPSSYTSGAGASLGLARSYVASNREL---VYTLQENDKKVRCGLLEKAKDICAERGVAAI
        V+VA+D SE S  AL W L NLK   S +DS   +    P PS     AG S G       S  E+      ++++ K++   +LE A  ICAE+ V   
Subjt:  VMVAIDESECSYYALIWVLQNLK--ESIADSPLFLFTALPPPSSYTSGAGASLGLARSYVASNREL---VYTLQENDKKVRCGLLEKAKDICAERGVAAI

Query:  SITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVK
        +    GDP   IC+ VE L+ +LLV+G R  GRIKR  +GSVSNYC  +A CPV+++K
Subjt:  SITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVK

AT1G68300.1 Adenine nucleotide alpha hydrolases-like superfamily protein9.0e-3449.1Show/hide
Query:  MAEAAKSNPAVEKRVMVAIDESECSYYALIWVLQNLKESIADSPLFLFTALPPPSSYTSGAGASLGLARSYVASNRELVYTLQENDKKVRCGLLEKAKDI
        MAE  KS   V K+VMVAIDESECS  AL W L  LK+S+ADS + LFTA P           S   A SY A+  EL+ +LQE+ K      L++   I
Subjt:  MAEAAKSNPAVEKRVMVAIDESECSYYALIWVLQNLKESIADSPLFLFTALPPPSSYTSGAGASLGLARSYVASNRELVYTLQENDKKVRCGLLEKAKDI

Query:  CAERGVAAISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVK
        CAE GV    + E G+P   IC+  EKL +++LV+G  G G ++R  +GSVSNYCV NAKCPVLVV+
Subjt:  CAERGVAAISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVK

AT3G11930.1 Adenine nucleotide alpha hydrolases-like superfamily protein3.4e-2533.53Show/hide
Query:  AKSNPAVEKRVMVAIDESECSYYALIWVLQN-----LKESIADSPLFLFTALPPPSSYTSGAGASLGLARSYVASNRELVYTLQENDKKVRCGLLEKAKD
        A++     KR++VAIDES+ S+YAL WV+ +     L  + A++   + T +   S +   A    G   + V ++  ++ ++++  ++    LL +A  
Subjt:  AKSNPAVEKRVMVAIDESECSYYALIWVLQN-----LKESIADSPLFLFTALPPPSSYTSGAGASLGLARSYVASNRELVYTLQENDKKVRCGLLEKAKD

Query:  ICAERGVAAISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        +C  + +   ++  +G+    IC+ VEK++++LLV+G RGLG+IKRA +GSVS+YC  +A CP+L+VK P
Subjt:  ICAERGVAAISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP

AT3G11930.2 Adenine nucleotide alpha hydrolases-like superfamily protein1.3e-2433.72Show/hide
Query:  AKSNPAVEKRVMVAIDESECSYYALIWVLQNLKESI-------ADSPLFLFTALPPPSSYTSGAGASLGLARSYVASNRELVYTLQENDKKVRCGLLEKA
        A++     KR++VAIDES+ S+YAL WV+ +    +       A+S +     +  P ++ +   A  G A +  AS+  ++ ++++  ++    LL +A
Subjt:  AKSNPAVEKRVMVAIDESECSYYALIWVLQNLKESI-------ADSPLFLFTALPPPSSYTSGAGASLGLARSYVASNRELVYTLQENDKKVRCGLLEKA

Query:  KDICAERGVAAISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP
          +C  + +   ++  +G+    IC+ VEK++++LLV+G RGLG+IKRA +GSVS+YC  +A CP+L+VK P
Subjt:  KDICAERGVAAISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP

AT3G11930.4 Adenine nucleotide alpha hydrolases-like superfamily protein2.6e-2533.14Show/hide
Query:  AKSNPAVEKRVMVAIDESECSYYALIWVLQNLKESI-------ADSPLFLFTALPPPSSYTSGAGASLGLARSYVASNRELVYTLQENDKKVRCGLLEKA
        A++     KR++VAIDES+ S+YAL WV+ +    +       A+S +     +  P ++ +   A  G A + V ++  ++ ++++  ++    LL +A
Subjt:  AKSNPAVEKRVMVAIDESECSYYALIWVLQNLKESI-------ADSPLFLFTALPPPSSYTSGAGASLGLARSYVASNRELVYTLQENDKKVRCGLLEKA

Query:  KDICAERGVAAISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP
          +C  + +   ++  +G+    IC+ VEK++++LLV+G RGLG+IKRA +GSVS+YC  +A CP+L+VK P
Subjt:  KDICAERGVAAISITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGAAGCAGCAAAATCAAACCCAGCAGTGGAGAAGAGGGTGATGGTGGCCATAGATGAGAGTGAGTGTAGCTACTATGCCCTAATCTGGGTGCTCCAAAATCTTAA
AGAATCCATAGCCGATTCCCCACTTTTCCTCTTCACGGCTCTACCTCCGCCCTCCAGTTACACCTCCGGTGCTGGCGCATCTCTTGGCCTCGCACGCTCCTATGTTGCAT
CCAATAGGGAGTTGGTTTATACTCTTCAAGAGAATGATAAGAAAGTTAGATGCGGTCTCCTTGAGAAAGCAAAGGATATATGTGCTGAAAGAGGGGTGGCTGCTATATCC
ATCACAGAAGATGGAGATCCTGGAACAACCATATGTGATACGGTTGAAAAGCTCAATATAAATTTGCTTGTTTTAGGTGATCGTGGCCTTGGGAGAATTAAGAGAGCTCT
TATAGGGAGTGTGAGCAACTATTGCGTTCAAAATGCCAAATGCCCTGTCCTTGTTGTGAAGAAACCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTGAAGCAGCAAAATCAAACCCAGCAGTGGAGAAGAGGGTGATGGTGGCCATAGATGAGAGTGAGTGTAGCTACTATGCCCTAATCTGGGTGCTCCAAAATCTTAA
AGAATCCATAGCCGATTCCCCACTTTTCCTCTTCACGGCTCTACCTCCGCCCTCCAGTTACACCTCCGGTGCTGGCGCATCTCTTGGCCTCGCACGCTCCTATGTTGCAT
CCAATAGGGAGTTGGTTTATACTCTTCAAGAGAATGATAAGAAAGTTAGATGCGGTCTCCTTGAGAAAGCAAAGGATATATGTGCTGAAAGAGGGGTGGCTGCTATATCC
ATCACAGAAGATGGAGATCCTGGAACAACCATATGTGATACGGTTGAAAAGCTCAATATAAATTTGCTTGTTTTAGGTGATCGTGGCCTTGGGAGAATTAAGAGAGCTCT
TATAGGGAGTGTGAGCAACTATTGCGTTCAAAATGCCAAATGCCCTGTCCTTGTTGTGAAGAAACCGTAG
Protein sequenceShow/hide protein sequence
MAEAAKSNPAVEKRVMVAIDESECSYYALIWVLQNLKESIADSPLFLFTALPPPSSYTSGAGASLGLARSYVASNRELVYTLQENDKKVRCGLLEKAKDICAERGVAAIS
ITEDGDPGTTICDTVEKLNINLLVLGDRGLGRIKRALIGSVSNYCVQNAKCPVLVVKKP