; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh18G010200 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh18G010200
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionAdenine nucleotide alpha hydrolases-like superfamily protein
Genome locationCmo_Chr18:11027572..11032458
RNA-Seq ExpressionCmoCh18G010200
SyntenyCmoCh18G010200
Gene Ontology termsNA
InterPro domainsIPR006015 - Universal stress protein A family
IPR006016 - UspA
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573952.1 Acyl-acyl carrier protein thioesterase ATL3, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.3e-8595.32Show/hide
Query:  MAEAKRSNPDVEKRVMVAVDESECSYYALIWVLENLKQSIADSPLFIFTALPPRTSYTSGAGASLGLARSCFPVASNTELAHTLQENDKKVRCGILEKAK
        MAEAK+SNPDVEKRVMVAVDESECSYYALIWVLENLKQSIADSPLF+FTALPP TSYTSGAGASLGLARS FPVASNTELAHTLQENDKKVRCGILEKAK
Subjt:  MAEAKRSNPDVEKRVMVAVDESECSYYALIWVLENLKQSIADSPLFIFTALPPRTSYTSGAGASLGLARSCFPVASNTELAHTLQENDKKVRCGILEKAK

Query:  DICAERGVAAISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        DICAERGVAAISITEVG+PGR IC+TVEKLNINLLVLGDHG+GRIKRALIGSVSNYCVQNAKCPVLVVKKP
Subjt:  DICAERGVAAISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKKP

KAG7013016.1 Universal stress protein A-like protein, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-8494.74Show/hide
Query:  MAEAKRSNPDVEKRVMVAVDESECSYYALIWVLENLKQSIADSPLFIFTALPPRTSYTSGAGASLGLARSCFPVASNTELAHTLQENDKKVRCGILEKAK
        MAEAK+SNPDVEKRVMVAVDESECSYYALIWVLENLKQSIADSPLF+FTALPP TSYTSGAGASLGLARS FPVASNTELAHTLQENDKKVRCGILEKAK
Subjt:  MAEAKRSNPDVEKRVMVAVDESECSYYALIWVLENLKQSIADSPLFIFTALPPRTSYTSGAGASLGLARSCFPVASNTELAHTLQENDKKVRCGILEKAK

Query:  DICAERGVAAISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        DICAERGVAAISITEVG+PGR IC+TVEKLNINLLVLGDHG+GRIKRALIGSVSNYCVQNAKC VLVVKKP
Subjt:  DICAERGVAAISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKKP

XP_022945228.1 uncharacterized protein LOC111449532 [Cucurbita moschata]4.9e-90100Show/hide
Query:  MAEAKRSNPDVEKRVMVAVDESECSYYALIWVLENLKQSIADSPLFIFTALPPRTSYTSGAGASLGLARSCFPVASNTELAHTLQENDKKVRCGILEKAK
        MAEAKRSNPDVEKRVMVAVDESECSYYALIWVLENLKQSIADSPLFIFTALPPRTSYTSGAGASLGLARSCFPVASNTELAHTLQENDKKVRCGILEKAK
Subjt:  MAEAKRSNPDVEKRVMVAVDESECSYYALIWVLENLKQSIADSPLFIFTALPPRTSYTSGAGASLGLARSCFPVASNTELAHTLQENDKKVRCGILEKAK

Query:  DICAERGVAAISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        DICAERGVAAISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKKP
Subjt:  DICAERGVAAISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKKP

XP_022968136.1 uncharacterized protein LOC111467462 [Cucurbita maxima]2.5e-8696.49Show/hide
Query:  MAEAKRSNPDVEKRVMVAVDESECSYYALIWVLENLKQSIADSPLFIFTALPPRTSYTSGAGASLGLARSCFPVASNTELAHTLQENDKKVRCGILEKAK
        MAEAK+SNPDVEKRVMVAVDESECSYYALIWVLENLKQSIADSPLF+FTALPP TSYTSGAGASLGLARS FPVASNTELAHTLQENDKKVRCGILEKAK
Subjt:  MAEAKRSNPDVEKRVMVAVDESECSYYALIWVLENLKQSIADSPLFIFTALPPRTSYTSGAGASLGLARSCFPVASNTELAHTLQENDKKVRCGILEKAK

Query:  DICAERGVAAISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        DICAERGVAAISITEVG+PGR ICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKKP
Subjt:  DICAERGVAAISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKKP

XP_023541640.1 universal stress protein A-like protein [Cucurbita pepo subsp. pepo]1.9e-8696.49Show/hide
Query:  MAEAKRSNPDVEKRVMVAVDESECSYYALIWVLENLKQSIADSPLFIFTALPPRTSYTSGAGASLGLARSCFPVASNTELAHTLQENDKKVRCGILEKAK
        MAEAK+SNPDVEKRVMVAVDESECSYYALIWVLENLKQSI D+PLFIFTALPP TSYTSGAGASLGLARS FPVASNTEL+HTLQENDKKVRCGILEKAK
Subjt:  MAEAKRSNPDVEKRVMVAVDESECSYYALIWVLENLKQSIADSPLFIFTALPPRTSYTSGAGASLGLARSCFPVASNTELAHTLQENDKKVRCGILEKAK

Query:  DICAERGVAAISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        DICAERGVAAISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKKP
Subjt:  DICAERGVAAISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKKP

TrEMBL top hitse value%identityAlignment
A0A1S3BEZ1 universal stress protein YxiE5.3e-6682.1Show/hide
Query:  VEKRVMVAVDESECSYYALIWVLENLKQSIADSPLFIFTAL-PPRTSYTSGAGASLGLARSCFPVASNTELAHTLQENDKKVRCGILEKAKDICAERGVA
        +EKRVMVA+DESE SYYALIWVLENLK+SIA SPLF+FTAL PP T+YTS      GLARS FP+ SNTE  HT+QENDKK+RCG+LEKAKDICA RGVA
Subjt:  VEKRVMVAVDESECSYYALIWVLENLKQSIADSPLFIFTAL-PPRTSYTSGAGASLGLARSCFPVASNTELAHTLQENDKKVRCGILEKAKDICAERGVA

Query:  AISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        AISITE GDPG TICDTVEKLNI+LLVLGD G+GRIKRALIGSVSNYCVQNAKCPVLVVKKP
Subjt:  AISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKKP

A0A5A7SXH5 Universal stress protein YxiE5.3e-6682.1Show/hide
Query:  VEKRVMVAVDESECSYYALIWVLENLKQSIADSPLFIFTAL-PPRTSYTSGAGASLGLARSCFPVASNTELAHTLQENDKKVRCGILEKAKDICAERGVA
        +EKRVMVA+DESE SYYALIWVLENLK+SIA SPLF+FTAL PP T+YTS      GLARS FP+ SNTE  HT+QENDKK+RCG+LEKAKDICA RGVA
Subjt:  VEKRVMVAVDESECSYYALIWVLENLKQSIADSPLFIFTAL-PPRTSYTSGAGASLGLARSCFPVASNTELAHTLQENDKKVRCGILEKAKDICAERGVA

Query:  AISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        AISITE GDPG TICDTVEKLNI+LLVLGD G+GRIKRALIGSVSNYCVQNAKCPVLVVKKP
Subjt:  AISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKKP

A0A6J1D9U3 uncharacterized protein LOC111018690 isoform X18.0e-7081.5Show/hide
Query:  MAEAKRSNPDVEKRVMVAVDESECSYYALIWVLENLKQSIADSPLFIFTALPPRTSYTSGAG--ASLGLARSCFPVASNTELAHTLQENDKKVRCGILEK
        MAEA      VEKRVMVA+DESECSYYALIWVLENL+QS+A+SPLF+FTALPP T YT GAG  ASLGLAR+   V SNTELA+++QENDKKVRC +LEK
Subjt:  MAEAKRSNPDVEKRVMVAVDESECSYYALIWVLENLKQSIADSPLFIFTALPPRTSYTSGAG--ASLGLARSCFPVASNTELAHTLQENDKKVRCGILEK

Query:  AKDICAERGVAAISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        AKDICAERGVAAISITEVG+PG TICD VEKLNIN+LVLGD G+GRIKRALIGSVSNYCVQNAKCPVLVVKKP
Subjt:  AKDICAERGVAAISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKKP

A0A6J1G084 uncharacterized protein LOC1114495322.4e-90100Show/hide
Query:  MAEAKRSNPDVEKRVMVAVDESECSYYALIWVLENLKQSIADSPLFIFTALPPRTSYTSGAGASLGLARSCFPVASNTELAHTLQENDKKVRCGILEKAK
        MAEAKRSNPDVEKRVMVAVDESECSYYALIWVLENLKQSIADSPLFIFTALPPRTSYTSGAGASLGLARSCFPVASNTELAHTLQENDKKVRCGILEKAK
Subjt:  MAEAKRSNPDVEKRVMVAVDESECSYYALIWVLENLKQSIADSPLFIFTALPPRTSYTSGAGASLGLARSCFPVASNTELAHTLQENDKKVRCGILEKAK

Query:  DICAERGVAAISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        DICAERGVAAISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKKP
Subjt:  DICAERGVAAISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKKP

A0A6J1HU14 uncharacterized protein LOC1114674621.2e-8696.49Show/hide
Query:  MAEAKRSNPDVEKRVMVAVDESECSYYALIWVLENLKQSIADSPLFIFTALPPRTSYTSGAGASLGLARSCFPVASNTELAHTLQENDKKVRCGILEKAK
        MAEAK+SNPDVEKRVMVAVDESECSYYALIWVLENLKQSIADSPLF+FTALPP TSYTSGAGASLGLARS FPVASNTELAHTLQENDKKVRCGILEKAK
Subjt:  MAEAKRSNPDVEKRVMVAVDESECSYYALIWVLENLKQSIADSPLFIFTALPPRTSYTSGAGASLGLARSCFPVASNTELAHTLQENDKKVRCGILEKAK

Query:  DICAERGVAAISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        DICAERGVAAISITEVG+PGR ICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKKP
Subjt:  DICAERGVAAISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKKP

SwissProt top hitse value%identityAlignment
P42297 Universal stress protein YxiE5.8e-0929.94Show/hide
Query:  RVMVAVDESECSYYALIWVLENLKQSIAD-SPLFIFTALPPRTSYTSGAGASLGLARSCFPVASNTELAHTLQENDKKVRCGILEKAKDICAERGVAAIS
        +++VA+D S+ S  AL   +   K+  A+ S L +       TS  +G             V         ++   KK    ILE AK+  AE+GV A +
Subjt:  RVMVAVDESECSYYALIWVLENLKQSIAD-SPLFIFTALPPRTSYTSGAGASLGLARSCFPVASNTELAHTLQENDKKVRCGILEKAKDICAERGVAAIS

Query:  ITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVK
        I   G+P   I +  ++  ++L+V+G  GI  +K  ++GSVS+   Q + CPVL+V+
Subjt:  ITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVK

P72817 Universal stress protein Sll16546.0e-0633.78Show/hide
Query:  ILEKAKDICAERGVAAISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVV
        +LE A+ + +++G+A  +I   G    TICD  +++N +L+V+G  G+G     +  SV+   +  + CPVLVV
Subjt:  ILEKAKDICAERGVAAISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVV

Q57951 Universal stress protein MJ05311.9e-0740.79Show/hide
Query:  LEKAKDICAERGVAAISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        L+K K +  E GV   +    G P   I +  EK   +L+V+G  G   ++R L+GSV+   ++NA CPVLVVKKP
Subjt:  LEKAKDICAERGVAAISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKKP

Q8L4N1 Universal stress protein PHOS349.2e-0726.67Show/hide
Query:  KRVMVAVDESECSYYALIWVLENLKQSIADSPLFIFTALPPRTSYTSGAGASLGLARSCFPVASNTELAH---TLQENDKKVRCGILEKAKDICAERGVA
        +++ VAVD SE S +A+ W +++    I      +   + P +         L L     P A+    A    + ++ D      + + AK +       
Subjt:  KRVMVAVDESECSYYALIWVLENLKQSIADSPLFIFTALPPRTSYTSGAGASLGLARSCFPVASNTELAH---TLQENDKKVRCGILEKAKDICAERGVA

Query:  AISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKR---ALIGSVSNYCVQNAKCPVLVVKKP
         I I +  D    +C   E+LN++ +++G  G G  KR     +GSVS+YCV +  CPV+VV+ P
Subjt:  AISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKR---ALIGSVSNYCVQNAKCPVLVVKKP

Q8LGG8 Universal stress protein A-like protein1.4e-1033.33Show/hide
Query:  LQENDKKVRCGILEKAKDICAERGVAAISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKK
        +++++K     +LE   + C E GV   +  + GDP   IC  V+++  + LV+G  G+GR ++  +G+VS +CV++A+CPV+ +K+
Subjt:  LQENDKKVRCGILEKAKDICAERGVAAISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKK

Arabidopsis top hitse value%identityAlignment
AT1G09740.1 Adenine nucleotide alpha hydrolases-like superfamily protein5.0e-2439.88Show/hide
Query:  VMVAVDESECSYYALIWVLENLKQSIADS-PLFIFTALPPRTSYTSGA-------GASLGLARSCFPVASNTELAHTLQENDKKVRCGILEKAKDICAER
        V+VAVD SE S  AL W L+NLK S + S   F+   + P  S  +G        G   GL    F  A        ++++ K++   ILE A  ICAE+
Subjt:  VMVAVDESECSYYALIWVLENLKQSIADS-PLFIFTALPPRTSYTSGA-------GASLGLARSCFPVASNTELAHTLQENDKKVRCGILEKAKDICAER

Query:  GVAAISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVK
         V   +   +GDP   IC+ VE L+ +LLV+G    GRIKR  +GSVSNYC  +A CPV+++K
Subjt:  GVAAISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVK

AT1G68300.1 Adenine nucleotide alpha hydrolases-like superfamily protein3.4e-3346.51Show/hide
Query:  MAEAKRSNPDVEKRVMVAVDESECSYYALIWVLENLKQSIADSPLFIFTALPP---RTSYTSGAGASLGLARSCFPVASNTELAHTLQENDKKVRCGILE
        MAE ++S   V K+VMVA+DESECS  AL W L  LK S+ADS + +FTA P       Y S  GA+        P+    EL ++LQE+ K      L+
Subjt:  MAEAKRSNPDVEKRVMVAVDESECSYYALIWVLENLKQSIADSPLFIFTALPP---RTSYTSGAGASLGLARSCFPVASNTELAHTLQENDKKVRCGILE

Query:  KAKDICAERGVAAISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVK
        +   ICAE GV    + E G+P   IC+  EKL +++LV+G HG G ++R  +GSVSNYCV NAKCPVLVV+
Subjt:  KAKDICAERGVAAISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVK

AT3G11930.2 Adenine nucleotide alpha hydrolases-like superfamily protein1.2e-2231.71Show/hide
Query:  KRVMVAVDESECSYYALIWVLEN-----LKQSIADSPLFIFTALPPRTSYTSGAGASLGLARSCFPVASNTELAHTLQENDKKVRCGILEKAKDICAERG
        KR++VA+DES+ S+YAL WV+++     L  + A++   + T +  ++ +   A    G   +    AS++ +  ++++  ++    +L +A  +C  + 
Subjt:  KRVMVAVDESECSYYALIWVLEN-----LKQSIADSPLFIFTALPPRTSYTSGAGASLGLARSCFPVASNTELAHTLQENDKKVRCGILEKAKDICAERG

Query:  VAAISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        +   ++   G+    IC+ VEK++++LLV+G  G+G+IKRA +GSVS+YC  +A CP+L+VK P
Subjt:  VAAISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKKP

AT3G11930.4 Adenine nucleotide alpha hydrolases-like superfamily protein1.1e-2331.1Show/hide
Query:  KRVMVAVDESECSYYALIWVLEN-----LKQSIADSPLFIFTALPPRTSYTSGAGASLGLARSCFPVASNTELAHTLQENDKKVRCGILEKAKDICAERG
        KR++VA+DES+ S+YAL WV+++     L  + A++   + T +  ++ +   A    G   +   V +++ +  ++++  ++    +L +A  +C  + 
Subjt:  KRVMVAVDESECSYYALIWVLEN-----LKQSIADSPLFIFTALPPRTSYTSGAGASLGLARSCFPVASNTELAHTLQENDKKVRCGILEKAKDICAERG

Query:  VAAISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKKP
        +   ++   G+    IC+ VEK++++LLV+G  G+G+IKRA +GSVS+YC  +A CP+L+VK P
Subjt:  VAAISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKKP

AT3G25930.1 Adenine nucleotide alpha hydrolases-like superfamily protein5.0e-2440.62Show/hide
Query:  KRVMVAVDESECSYYALIWVLENLKQSIADSPLFIFTALPPRTSYT--SGAGASLGLARSCFPVASNTELAHTLQENDKKVRCGILEKAKDICAERGVAA
        K VM+ +DES  SY  LIW LEN K +I  S ++IF A  P+ S+T  +   +S+G A+  +P + N+EL    QE + K+  GILEKAK IC   G+ A
Subjt:  KRVMVAVDESECSYYALIWVLENLKQSIADSPLFIFTALPPRTSYT--SGAGASLGLARSCFPVASNTELAHTLQENDKKVRCGILEKAKDICAERGVAA

Query:  ISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKK
         + T+ GDP   I   +++ NINL+V  D     +K+         C QN  C +LVVKK
Subjt:  ISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGAAGCAAAGAGATCAAACCCTGATGTGGAGAAGAGGGTGATGGTGGCCGTAGACGAGAGTGAGTGTAGCTACTATGCCCTAATCTGGGTACTGGAAAATCTTAA
ACAATCCATTGCCGACTCGCCTCTTTTCATCTTCACGGCTCTACCTCCGCGCACCAGTTATACCTCCGGTGCCGGTGCCTCTCTCGGCCTCGCGCGATCGTGTTTCCCTG
TCGCATCCAATACCGAGTTGGCTCATACGCTTCAAGAGAATGATAAGAAAGTTAGATGCGGCATCCTAGAGAAAGCAAAAGATATCTGTGCTGAAAGAGGGGTGGCTGCG
ATATCCATCACAGAGGTTGGGGATCCTGGAAGGACGATATGTGATACGGTTGAAAAGCTCAATATAAATTTGCTTGTTTTGGGTGATCATGGCATTGGGAGAATTAAGAG
AGCTCTGATCGGGAGTGTGAGCAACTACTGTGTTCAAAATGCCAAGTGCCCTGTCCTTGTCGTGAAGAAACCATAG
mRNA sequenceShow/hide mRNA sequence
TCCTTGATCCTTGCTCTTTGCTCTTGGGTCAGAGTTGGGCACAATCCGAGGGAACCCACCATCAACTTTCTCAATTGAAGCTGAATTTTACGGTAGGACGCTCAATTCAC
TTTCTGGGGTTTCACAGTTCTTCCTCTTCCACCCTTTGCAGAAATGGCTGAAGCAAAGAGATCAAACCCTGATGTGGAGAAGAGGGTGATGGTGGCCGTAGACGAGAGTG
AGTGTAGCTACTATGCCCTAATCTGGGTACTGGAAAATCTTAAACAATCCATTGCCGACTCGCCTCTTTTCATCTTCACGGCTCTACCTCCGCGCACCAGTTATACCTCC
GGTGCCGGTGCCTCTCTCGGCCTCGCGCGATCGTGTTTCCCTGTCGCATCCAATACCGAGTTGGCTCATACGCTTCAAGAGAATGATAAGAAAGTTAGATGCGGCATCCT
AGAGAAAGCAAAAGATATCTGTGCTGAAAGAGGGGTGGCTGCGATATCCATCACAGAGGTTGGGGATCCTGGAAGGACGATATGTGATACGGTTGAAAAGCTCAATATAA
ATTTGCTTGTTTTGGGTGATCATGGCATTGGGAGAATTAAGAGAGCTCTGATCGGGAGTGTGAGCAACTACTGTGTTCAAAATGCCAAGTGCCCTGTCCTTGTCGTGAAG
AAACCATAGGAATATGAGGCCTTTCCTCAGAGTATAAACTTGAAGAAAATCTGTCAACCAAAGATGCTATTCAAAACAGCTGCTAATCTGACTCCGCCTTGTGCCAACCG
CAGGCCATCCACACTCGAGTCGTAGAACTTTTCTTCTGCTGTCTCGATTATGTTGCTATCCCATATCTATTTTGCTTTCTGGTGTACCAGTGAACTTCAATCGTGTTTCC
TCCCTTGTCTTCAGCAAACCCAACATGAAGAGGCTGCATTTTATGTCAGAAGCTGGGAAGTATAGTTGTTAATGGCCCCTGCAACGCACCGACCTTCCACTCCATTTTCG
TCTTTGCAGTCCCCTTTCGACCCAAACGACAAAAGTTCAGAACCAGAAAGGCGAGAGAAATTAGAACAACAGAGAGGGGATGGAAACAGAGTAACTCAACTGGCATATTG
GTAAGTACAAAGGGATTCCGGAGTATTAACATAATGAAGAGCAGATGACCATCGATATCGAAACTTGATTCGATCAGCCCAAATACAAACGTTCGCCAATTCACCATTTG
CATACTCAGGCAACAGTCGCTGCACTGCATCAGCTGCTGCTCTGCTCAACCGCGACTGTCGCAAGCAAAAACAAACATCAAATTCTTCATCAAAGAGGTAAAACATGGGG
GAATTGGTTTCAATGAACGTGAAGATCAAGAACCTGAGCGATCTTGCAGATGGTAAAATGGCCATCGAGGCCCCAACCAAAGCTAACAGGAAAGATGAACACCAAGAACA
CTAGAAGACCCGCCATGAAAGCTCTGCAGGGCTTCATTTTTAGCTCCTGTTCTTGTTCTTCTGGTGATTTGATTCATTTTTGTATTTGAATCTGTGCTTAAATAGAGTGA
TGAATGCAAGAAATCTCAGAAAGTATGAACAGTTTCGAAAGGCATTCAATTTCGTTGTGCTCATATGAGATATGCCAAGTGCTCTGTCACGCTGAAGTAGTTAAGGTAAA
GTTTTAAATTATGGAACCGTTTCCCTTTTGCTGT
Protein sequenceShow/hide protein sequence
MAEAKRSNPDVEKRVMVAVDESECSYYALIWVLENLKQSIADSPLFIFTALPPRTSYTSGAGASLGLARSCFPVASNTELAHTLQENDKKVRCGILEKAKDICAERGVAA
ISITEVGDPGRTICDTVEKLNINLLVLGDHGIGRIKRALIGSVSNYCVQNAKCPVLVVKKP