; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh05G002500 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh05G002500
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionHAD-superfamily hydrolase isoform 1
Genome locationCmo_Chr05:1071079..1076563
RNA-Seq ExpressionCmoCh05G002500
SyntenyCmoCh05G002500
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598444.1 hypothetical protein SDJN03_08222, partial [Cucurbita argyrosperma subsp. sororia]3.1e-11999.13Show/hide
Query:  MTTIGFRLISVPSAVSPLSLLSHEPRDDQSQALRHNLPVTTSRRGLTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLREQKKKASRFLLAPIEASRDSL
        MTTIGFRLISVPSAVSPLSLLSHEPRDDQSQALRHNLPVTTSRRGLTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLREQKKKA RFLLAPIEASRDSL
Subjt:  MTTIGFRLISVPSAVSPLSLLSHEPRDDQSQALRHNLPVTTSRRGLTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLREQKKKASRFLLAPIEASRDSL

Query:  HAVYLMLSNGSDYSNKDMEDVQRMLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLIVKNAASLLGNEDPIKLEAESRLKDLVSSFTSLNSLAYETDIQV
        HAVYLMLSN SDYSNKDMEDVQRMLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLIVKNAASLLGNEDPIKLEAESRLKDLVSSFTSLNSLAYETDIQV
Subjt:  HAVYLMLSNGSDYSNKDMEDVQRMLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLIVKNAASLLGNEDPIKLEAESRLKDLVSSFTSLNSLAYETDIQV

Query:  NSNRQKVLDAVNDTIDSLDKFEKGIKDCLE
        NSNRQKVLDAVNDTIDSLDKFEKGIKDCLE
Subjt:  NSNRQKVLDAVNDTIDSLDKFEKGIKDCLE

KAG7029386.1 hypothetical protein SDJN02_07725 [Cucurbita argyrosperma subsp. argyrosperma]3.1e-11999.13Show/hide
Query:  MTTIGFRLISVPSAVSPLSLLSHEPRDDQSQALRHNLPVTTSRRGLTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLREQKKKASRFLLAPIEASRDSL
        M TIGFRLISVPSAVSPLSLLSHEPRDDQSQALRHNLPVTTSRRGLTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLREQKKKASRFLLAPIEASRDSL
Subjt:  MTTIGFRLISVPSAVSPLSLLSHEPRDDQSQALRHNLPVTTSRRGLTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLREQKKKASRFLLAPIEASRDSL

Query:  HAVYLMLSNGSDYSNKDMEDVQRMLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLIVKNAASLLGNEDPIKLEAESRLKDLVSSFTSLNSLAYETDIQV
        HAVYLMLSN SDYSNKDMEDVQRMLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLIVKNAASLLGNEDPIKLEAESRLKDLVSSFTSLNSLAYETDIQV
Subjt:  HAVYLMLSNGSDYSNKDMEDVQRMLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLIVKNAASLLGNEDPIKLEAESRLKDLVSSFTSLNSLAYETDIQV

Query:  NSNRQKVLDAVNDTIDSLDKFEKGIKDCLE
        NSNRQKVLDAVNDTIDSLDKFEKGIKDCLE
Subjt:  NSNRQKVLDAVNDTIDSLDKFEKGIKDCLE

XP_022961866.1 uncharacterized protein LOC111462505 isoform X1 [Cucurbita moschata]2.5e-121100Show/hide
Query:  MTTIGFRLISVPSAVSPLSLLSHEPRDDQSQALRHNLPVTTSRRGLTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLREQKKKASRFLLAPIEASRDSL
        MTTIGFRLISVPSAVSPLSLLSHEPRDDQSQALRHNLPVTTSRRGLTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLREQKKKASRFLLAPIEASRDSL
Subjt:  MTTIGFRLISVPSAVSPLSLLSHEPRDDQSQALRHNLPVTTSRRGLTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLREQKKKASRFLLAPIEASRDSL

Query:  HAVYLMLSNGSDYSNKDMEDVQRMLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLIVKNAASLLGNEDPIKLEAESRLKDLVSSFTSLNSLAYETDIQV
        HAVYLMLSNGSDYSNKDMEDVQRMLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLIVKNAASLLGNEDPIKLEAESRLKDLVSSFTSLNSLAYETDIQV
Subjt:  HAVYLMLSNGSDYSNKDMEDVQRMLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLIVKNAASLLGNEDPIKLEAESRLKDLVSSFTSLNSLAYETDIQV

Query:  NSNRQKVLDAVNDTIDSLDKFEKGIKDCLET
        NSNRQKVLDAVNDTIDSLDKFEKGIKDCLET
Subjt:  NSNRQKVLDAVNDTIDSLDKFEKGIKDCLET

XP_022997500.1 uncharacterized protein LOC111492399 isoform X1 [Cucurbita maxima]4.0e-11998.7Show/hide
Query:  MTTIGFRLISVPSAVSPLSLLSHEPRDDQSQALRHNLPVTTSRRGLTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLREQKKKASRFLLAPIEASRDSL
        MTTIGFRLISVPSAVSPLSLLSH+PRDDQSQALRHNLP+TTSRRGLTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLREQKKKASRFLLAPIEASRDSL
Subjt:  MTTIGFRLISVPSAVSPLSLLSHEPRDDQSQALRHNLPVTTSRRGLTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLREQKKKASRFLLAPIEASRDSL

Query:  HAVYLMLSNGSDYSNKDMEDVQRMLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLIVKNAASLLGNEDPIKLEAESRLKDLVSSFTSLNSLAYETDIQV
        HAVYLMLSN SDYSNKDMEDVQRMLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLIVKNAASLLGNEDPIKLEAESRLKDLVSSFTSLNSLAYETDIQV
Subjt:  HAVYLMLSNGSDYSNKDMEDVQRMLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLIVKNAASLLGNEDPIKLEAESRLKDLVSSFTSLNSLAYETDIQV

Query:  NSNRQKVLDAVNDTIDSLDKFEKGIKDCLE
        NSNRQKVLDAVNDTIDSLDKFEKGIKDCLE
Subjt:  NSNRQKVLDAVNDTIDSLDKFEKGIKDCLE

XP_023545902.1 uncharacterized protein LOC111805195 isoform X1 [Cucurbita pepo subsp. pepo]6.4e-11796.96Show/hide
Query:  MTTIGFRLISVPSAVSPLSLLSHEPRDDQSQALRHNLPVTTSRRGLTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLREQKKKASRFLLAPIEASRDSL
        MTTIGFRLISVPSAVSPLSLLSH P +DQSQALRHNLP+TTSRRGLTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLREQK+KASRFLLAPIEASR SL
Subjt:  MTTIGFRLISVPSAVSPLSLLSHEPRDDQSQALRHNLPVTTSRRGLTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLREQKKKASRFLLAPIEASRDSL

Query:  HAVYLMLSNGSDYSNKDMEDVQRMLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLIVKNAASLLGNEDPIKLEAESRLKDLVSSFTSLNSLAYETDIQV
        HAVYLMLSN SDYSNKDMEDVQRMLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLIVKNAASLLGNEDPIKLEAESRLKDLVSSFTSLNSLAYETDIQV
Subjt:  HAVYLMLSNGSDYSNKDMEDVQRMLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLIVKNAASLLGNEDPIKLEAESRLKDLVSSFTSLNSLAYETDIQV

Query:  NSNRQKVLDAVNDTIDSLDKFEKGIKDCLE
        NSNRQKVLDAVNDTIDSLDKFEKGIKDCLE
Subjt:  NSNRQKVLDAVNDTIDSLDKFEKGIKDCLE

TrEMBL top hitse value%identityAlignment
A0A5A7VAP0 Chloroplast thylakoid membrane5.6e-10385.65Show/hide
Query:  MTTIGFRLISVPSAVSPLSLLSHEPRDDQSQALRHNLPVTTSRRGLTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLREQKKKASRFLLAPIEASRDSL
        MTTIGFRL SVP+AVSP S   +  +  +SQ LRH LP+TTSRRGLTMLSF+SAMPSLFLPAPA+AFDIGISGPKDWL+EQKKK+S+FLLAPIEASRDSL
Subjt:  MTTIGFRLISVPSAVSPLSLLSHEPRDDQSQALRHNLPVTTSRRGLTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLREQKKKASRFLLAPIEASRDSL

Query:  HAVYLMLSNGSDYSNKDMEDVQRMLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLIVKNAASLLGNEDPIKLEAESRLKDLVSSFTSLNSLAYETDIQV
          VYL+LSN SDYSNKDMEDVQR+LKSAARDCVLKDR+SFVQFQASTGVEVCTFQLIVKNAASLLGN+DPIKLEAE+ LKDLVSSFTSLNSLAYETDIQV
Subjt:  HAVYLMLSNGSDYSNKDMEDVQRMLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLIVKNAASLLGNEDPIKLEAESRLKDLVSSFTSLNSLAYETDIQV

Query:  NSNRQKVLDAVNDTIDSLDKFEKGIKDCLE
        NSNRQKVLDA+NDT+ SLDKFEKGIKDCLE
Subjt:  NSNRQKVLDAVNDTIDSLDKFEKGIKDCLE

A0A6J1HBA0 uncharacterized protein LOC111462505 isoform X23.5e-105100Show/hide
Query:  MTTIGFRLISVPSAVSPLSLLSHEPRDDQSQALRHNLPVTTSRRGLTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLREQKKKASRFLLAPIEASRDSL
        MTTIGFRLISVPSAVSPLSLLSHEPRDDQSQALRHNLPVTTSRRGLTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLREQKKKASRFLLAPIEASRDSL
Subjt:  MTTIGFRLISVPSAVSPLSLLSHEPRDDQSQALRHNLPVTTSRRGLTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLREQKKKASRFLLAPIEASRDSL

Query:  HAVYLMLSNGSDYSNKDMEDVQRMLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLIVKNAASLLGNEDPIKLEAESRLKDLVSSFTSLNSLAYETDIQV
        HAVYLMLSNGSDYSNKDMEDVQRMLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLIVKNAASLLGNEDPIKLEAESRLKDLVSSFTSLNSLAYETDIQV
Subjt:  HAVYLMLSNGSDYSNKDMEDVQRMLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLIVKNAASLLGNEDPIKLEAESRLKDLVSSFTSLNSLAYETDIQV

Query:  NSN
        NSN
Subjt:  NSN

A0A6J1HBI9 uncharacterized protein LOC111462505 isoform X11.2e-121100Show/hide
Query:  MTTIGFRLISVPSAVSPLSLLSHEPRDDQSQALRHNLPVTTSRRGLTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLREQKKKASRFLLAPIEASRDSL
        MTTIGFRLISVPSAVSPLSLLSHEPRDDQSQALRHNLPVTTSRRGLTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLREQKKKASRFLLAPIEASRDSL
Subjt:  MTTIGFRLISVPSAVSPLSLLSHEPRDDQSQALRHNLPVTTSRRGLTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLREQKKKASRFLLAPIEASRDSL

Query:  HAVYLMLSNGSDYSNKDMEDVQRMLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLIVKNAASLLGNEDPIKLEAESRLKDLVSSFTSLNSLAYETDIQV
        HAVYLMLSNGSDYSNKDMEDVQRMLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLIVKNAASLLGNEDPIKLEAESRLKDLVSSFTSLNSLAYETDIQV
Subjt:  HAVYLMLSNGSDYSNKDMEDVQRMLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLIVKNAASLLGNEDPIKLEAESRLKDLVSSFTSLNSLAYETDIQV

Query:  NSNRQKVLDAVNDTIDSLDKFEKGIKDCLET
        NSNRQKVLDAVNDTIDSLDKFEKGIKDCLET
Subjt:  NSNRQKVLDAVNDTIDSLDKFEKGIKDCLET

A0A6J1K7N2 uncharacterized protein LOC111492399 isoform X21.5e-10398.52Show/hide
Query:  MTTIGFRLISVPSAVSPLSLLSHEPRDDQSQALRHNLPVTTSRRGLTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLREQKKKASRFLLAPIEASRDSL
        MTTIGFRLISVPSAVSPLSLLSH+PRDDQSQALRHNLP+TTSRRGLTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLREQKKKASRFLLAPIEASRDSL
Subjt:  MTTIGFRLISVPSAVSPLSLLSHEPRDDQSQALRHNLPVTTSRRGLTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLREQKKKASRFLLAPIEASRDSL

Query:  HAVYLMLSNGSDYSNKDMEDVQRMLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLIVKNAASLLGNEDPIKLEAESRLKDLVSSFTSLNSLAYETDIQV
        HAVYLMLSN SDYSNKDMEDVQRMLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLIVKNAASLLGNEDPIKLEAESRLKDLVSSFTSLNSLAYETDIQV
Subjt:  HAVYLMLSNGSDYSNKDMEDVQRMLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLIVKNAASLLGNEDPIKLEAESRLKDLVSSFTSLNSLAYETDIQV

Query:  NSN
        NSN
Subjt:  NSN

A0A6J1K9U3 uncharacterized protein LOC111492399 isoform X11.9e-11998.7Show/hide
Query:  MTTIGFRLISVPSAVSPLSLLSHEPRDDQSQALRHNLPVTTSRRGLTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLREQKKKASRFLLAPIEASRDSL
        MTTIGFRLISVPSAVSPLSLLSH+PRDDQSQALRHNLP+TTSRRGLTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLREQKKKASRFLLAPIEASRDSL
Subjt:  MTTIGFRLISVPSAVSPLSLLSHEPRDDQSQALRHNLPVTTSRRGLTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLREQKKKASRFLLAPIEASRDSL

Query:  HAVYLMLSNGSDYSNKDMEDVQRMLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLIVKNAASLLGNEDPIKLEAESRLKDLVSSFTSLNSLAYETDIQV
        HAVYLMLSN SDYSNKDMEDVQRMLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLIVKNAASLLGNEDPIKLEAESRLKDLVSSFTSLNSLAYETDIQV
Subjt:  HAVYLMLSNGSDYSNKDMEDVQRMLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLIVKNAASLLGNEDPIKLEAESRLKDLVSSFTSLNSLAYETDIQV

Query:  NSNRQKVLDAVNDTIDSLDKFEKGIKDCLE
        NSNRQKVLDAVNDTIDSLDKFEKGIKDCLE
Subjt:  NSNRQKVLDAVNDTIDSLDKFEKGIKDCLE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G42765.1 INVOLVED IN: biological_process unknown; LOCATED IN: thylakoid, chloroplast thylakoid membrane, chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Twin-arginine translocation pathway, signal sequence (InterPro:IPR006311); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink).1.8e-5657.79Show/hide
Query:  NLPVTTSRRGLTMLSFLSAMPSLFLPA----PATAFDIGISGPKDWLREQKKKASRFLLAPIEASRDSLHAVYLMLSNGSDYSNKDMEDVQRMLKSAARD
        +LP   SRR  T+L  LS+  SL L      PA AF +GISGPK+WL++QKKK+SRFLLAPI+A+R +L + YL L++ SDY+ KD+E++Q + KS+ARD
Subjt:  NLPVTTSRRGLTMLSFLSAMPSLFLPA----PATAFDIGISGPKDWLREQKKKASRFLLAPIEASRDSLHAVYLMLSNGSDYSNKDMEDVQRMLKSAARD

Query:  CVLKDRNSFVQFQASTGVEVCTFQLIVKNAASLLGNEDPIKLEAESRLKDLVSSFTSLNSLAYETDIQVNSNRQKVLDAVNDTIDSLDKFEKGIKDCLE
        CV K+R+S V FQ+ +GVEVCTF+L+VKNAASLL ++DP+KLEAE+ L DLV SF SL  L    D+ + S+R+KV D V +TI  LDKFEKG+KDCLE
Subjt:  CVLKDRNSFVQFQASTGVEVCTFQLIVKNAASLLGNEDPIKLEAESRLKDLVSSFTSLNSLAYETDIQVNSNRQKVLDAVNDTIDSLDKFEKGIKDCLE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAACGATCGGATTCCGCCTTATCTCGGTTCCATCCGCCGTCTCACCACTCTCTCTACTTTCTCATGAACCGAGGGATGATCAAAGCCAAGCGCTACGCCATAACCT
TCCGGTTACAACATCTCGCCGAGGTCTCACTATGCTCTCATTTCTCTCTGCGATGCCGTCTCTGTTTCTACCTGCTCCCGCCACTGCTTTCGATATCGGCATTTCAGGAC
CAAAGGATTGGCTGAGGGAGCAGAAGAAGAAGGCCTCCAGGTTCCTCTTGGCTCCAATTGAGGCCTCCAGAGACAGCCTTCACGCAGTTTACCTTATGCTTTCCAATGGT
TCCGATTACTCAAACAAGGATATGGAGGACGTTCAAAGAATGCTGAAGTCTGCTGCAAGGGATTGCGTTTTGAAGGACAGGAATTCGTTCGTTCAATTCCAAGCGAGCAC
TGGAGTCGAGGTCTGTACATTTCAATTGATTGTGAAAAATGCCGCCTCCTTGCTTGGAAACGAAGATCCTATAAAACTAGAAGCTGAAAGTCGATTAAAAGATCTAGTAA
GCTCGTTCACTTCTCTTAATAGTCTTGCATATGAGACTGATATTCAAGTCAATTCCAACAGACAAAAAGTACTGGATGCAGTAAATGATACCATAGATTCCCTCGACAAA
TTCGAGAAGGGTATTAAGGATTGTCTTGAAACTTAA
mRNA sequenceShow/hide mRNA sequence
GGATATTTTTCAAGGAAAAAAAAAATTGGTTCAGTTCAATAATCATAATTCATTACTCGGAAGCATCGCCGTTCTCCGACCATGACAACGATCGGATTCCGCCTTATCTC
GGTTCCATCCGCCGTCTCACCACTCTCTCTACTTTCTCATGAACCGAGGGATGATCAAAGCCAAGCGCTACGCCATAACCTTCCGGTTACAACATCTCGCCGAGGTCTCA
CTATGCTCTCATTTCTCTCTGCGATGCCGTCTCTGTTTCTACCTGCTCCCGCCACTGCTTTCGATATCGGCATTTCAGGACCAAAGGATTGGCTGAGGGAGCAGAAGAAG
AAGGCCTCCAGGTTCCTCTTGGCTCCAATTGAGGCCTCCAGAGACAGCCTTCACGCAGTTTACCTTATGCTTTCCAATGGTTCCGATTACTCAAACAAGGATATGGAGGA
CGTTCAAAGAATGCTGAAGTCTGCTGCAAGGGATTGCGTTTTGAAGGACAGGAATTCGTTCGTTCAATTCCAAGCGAGCACTGGAGTCGAGGTCTGTACATTTCAATTGA
TTGTGAAAAATGCCGCCTCCTTGCTTGGAAACGAAGATCCTATAAAACTAGAAGCTGAAAGTCGATTAAAAGATCTAGTAAGCTCGTTCACTTCTCTTAATAGTCTTGCA
TATGAGACTGATATTCAAGTCAATTCCAACAGACAAAAAGTACTGGATGCAGTAAATGATACCATAGATTCCCTCGACAAATTCGAGAAGGGTATTAAGGATTGTCTTGA
AACTTAAGTCCCGTGATTTTGTTTTGTAATTAGATTTTCTTATGTTAAGTCCAAACAAACAACAATTAAATGAAGCCAAAAGCAAAGTTTTTAAGCTCATTCTCTAGCTT
CTTTAGTGCTAAATCTAGTGCTTGAAAGTTGAATGCATATGTACCCACCCTATCATATATGTGCTTATGTGTGCGTGTATGTGCATGTTCATATTTAGTTTGGTGGAATT
ATTATATATCTTGAAAATCTGGGAGTGTAATACGCTATCAAGATATTTATAAGTAAATCCATTCGTTTTGACAGTCATATGTTATCATTATTATACTAACAAAGACAGAC
ATTTCATGTGCTCTCAGAACACCATTTCTCTCTGTTAGCAAGTGATAGTAGCTAACTTTCATCTATTGGGCAGTCCTTGCGATTCTTTTGATGCTTTCAACGTTATCATT
TCTTCCTACAGAACCATGGTATATACAAAATTAATATCCAACTCTTAGAGTTTATAACCATTGAAAATCAGGGATAAAATAACTATTTGCCGATGATTTTGCCATTTGGT
GATGTAAAAATTTTCACTTGTGGCAAATGATTGAGTTGTTACAAGAGATTCCAGTTAATTATTATAAGTTCAGTTTCCTAGGCTCTTTTCATTGGCATAGTAGACTATTT
CGGGAAAGATGTTTGCATAGGTGATTTCTTTTCACATGCATAAGTCTGAACGTTATAGTGGAATGCTTGTAATCTACTTCTTTCAGAATGCACAATTTAGGTTCAAGTTT
GTTCTAATTGCATAGTTTACACATGACATACTCGAATTCCACATGTATAGCTACACCTCTGATATGGTATGGTGTAGAAACCTTGGGCAGTATAATACGACATGCTCTCC
TTGGAATAGTTGTAATAGGTAGCAGGTAGATATGACGTCTGACATTCATCCAACCTACTCACTCAGAGATTTTCTTACTTGTGGGTTGTCTTTATGGGACGGTTGTAGCT
GAGGTGAGCCAGAAGCCTGTACCAAGTACAAGCGGCAGCAAGGTTTTCCATCATCAGGGTGAGTCTGCAATGACCTAGGTAGACTGCGGATTTTATTGAAGAGATTTAAT
GGATCGTGCATAAGCCCAAAGTGTGCATCCATATCTCCAATCTGTGATAGTGTAGTGTTGGAAAATCCAAGGGACGGAAGTAGATGATCAGGCCAATCGCAGTAGAAATG
GAAGACAGAATTTGAAAGAGTGGTTGATGGCTTGTTCATAAAATCTGCTAAGAGGACTGTGTGGGTTATGGTGCATTGGTCAGCTATTGTCTCCAACACTTGCAGCGCCT
GGGAATGGCCTTTAGAAGAGTGGCTTTCGCTTGCAACAACTCAGAGAAATCAACTTCAAAAACGTCACTTTCCTTCAAGCAACTTAATCTATGCAGCTGCAAATAATTCT
CTTTGACTAAGCATACAGTTCCCATTTTGGTAGTTTTGATGGCCTCATTTTTAGTCACTAAATAGGTTGCCGTGGTTATCTATCATTAACTAGCGTTTTAGCCTGCCCTC
TCCCCCACTGCAACCCCACCAAAGACGAAAGAGGAAAAAAGAGATAGAGAACGACAAGAAAAATAAAAAGGACTACGCACTTTTCTTAGATATCATTGTAAAAACAGGGT
CGTTATTTGTGCTGTTGGCTATGGCTATCTTCTACTGCTATGGTTGGGACCCTCTTAATGCAGTATGAAGTTTAACTAATTGCAGCTTCTGGGAGTTGAACCAACCTGCA
CCAAGGAGAACGACCTGTGAATCTCTGCCACTGATGGATGTCAGAGCACCTTCAATTTTTGAATCAAACCAGAGAGTGCGAACAGCAAGAATCACTCCAGAAATCTCTCT
TGCATTGTTGCTTCGATCTTTCTTTATTTTCTCATAAAGGTTTCTTAGGCATGCTTCTCCAGCCAGTAACTCTGCCAGTGGATCATGGATCATGTGCTTCCAAAGCGCTC
TTCCTGCTGCCATTTGACATGCCGAGCGTTGGAGGTAATCCCATTCAGCCTCAATTCTGGCATGGATTTCACGTACACTGTCTACCAATAACAAATCTGGAAGCTTTAGC
TCAGACCATCCTGGGTTGTCTGGCCAACCATTCTCTTTATCAGTCATGCTTCGATTATAAAACGGGCTGTTAGAATTGATAAGAAATAGGGAAGTAGATAATGCGTTCTG
TTTTTGTAGGAACAAGGTGTGCGCAAGGAAAAAGAGCAGGATTTTATGGAATGGGATAAGGAAAGAGAGCCACAACAACCAATGCTTAAAAAAAAAGCTCAATTTAATGA
TAAGGGAGAGTGAGTATATAAGTTCCACGAGTGCGTTAAATCTCGTATTTTTGCTCTCTCTCTCTCTCTATTGTGGATTCTTTTGTGCTGTCTTTCATCAGAAGAGAAGA
GGAAGAGGGCTCCACTGAAAAAGACGCGCGATCACATTTGAGTGGCTCATAGCTCGGATATTGAAATTGCCATACTACTTCAGTAATACCAGGTACCTTTGTTTTTATTC
ATTTACATGGCAAATGTTTTTCTGATTCTGATTCTTCTGCACGTGGCCGTTAATTCTCCATTCCTTTAAGGCCTGTTTGGCTCTCAGAGAGAGATATTTTTCCGTAGTTC
TGGCTGAAGGCAAGGTATTGATTGGTTAGTTATTGGGCCATCCTATTTGGGGCGAAGGGATATTGGATATCGCTTAAGATCTTACCTCTCTAGTTTGTATTCGAAGTTGG
ATAATTAAAACTTTTCAAGTGGAGTATACAAATTTGAATCTCCAGATGGAGAGGAAGCCAGGTGTTTTTTTTTTCTTGAAGTACTTAGTTGGATTAGTTTAGGTTAAATA
TTGATTTAGATATTTAAATGGTTTTGTTGTTTGTTTTGATCCAACCGATTATTTACCTACATGTGTTTAAGTGTATGTAGTTATTTATTTATTTAAGATTGATTGACAAC
TTTATTTCACTTGTTAATGTTTAAAAGTATTCTTTAACTTGTAGTTAAATTTTTAAGAGAG
Protein sequenceShow/hide protein sequence
MTTIGFRLISVPSAVSPLSLLSHEPRDDQSQALRHNLPVTTSRRGLTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLREQKKKASRFLLAPIEASRDSLHAVYLMLSNG
SDYSNKDMEDVQRMLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLIVKNAASLLGNEDPIKLEAESRLKDLVSSFTSLNSLAYETDIQVNSNRQKVLDAVNDTIDSLDK
FEKGIKDCLET