; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G10140 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G10140
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Description14 kDa zinc-binding protein-like
Genome locationClcChr01:12709247..12714085
RNA-Seq ExpressionClc01G10140
SyntenyClc01G10140
Gene Ontology termsGO:0005737 - cytoplasm (cellular component)
GO:0047627 - adenylylsulfatase activity (molecular function)
InterPro domainsIPR001310 - Histidine triad (HIT) protein
IPR011146 - HIT-like domain
IPR019808 - Histidine triad, conserved site
IPR036265 - HIT-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK12293.1 14 kDa zinc-binding protein isoform X2 [Cucumis melo var. makuwa]3.1e-9094.54Show/hide
Query:  MAAGTAFLRQCIATTTRTLVTAKLKHSSNHFTTFLLPLSTRSRRLTIRAGITNDEEVAAKAAASDADSGAPTIFDKIIAKEIPSNIVYEDDKVLAFRDIN
        MAAGT+FLRQC+ATTTRTLVTAKLKHSS HF+TFLLPL T SRRLTIRAGITN+EEVAAKAAASDADSGAPTIFDKII+KEIPSNIVYEDDKVLAFRDIN
Subjt:  MAAGTAFLRQCIATTTRTLVTAKLKHSSNHFTTFLLPLSTRSRRLTIRAGITNDEEVAAKAAASDADSGAPTIFDKIIAKEIPSNIVYEDDKVLAFRDIN

Query:  PQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIVEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
        PQAPVHVLIIPKLRDGLTELGKAEARHGEILG+LLYAAKIVAEKEGI+EGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
Subjt:  PQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIVEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG

XP_004145437.1 14 kDa zinc-binding protein isoform X2 [Cucumis sativus]3.1e-9095.63Show/hide
Query:  MAAGTAFLRQCIATTTRTLVTAKLKHSSNHFTTFLLPLSTRSRRLTIRAGITNDEEVAAKAAASDADSGAPTIFDKIIAKEIPSNIVYEDDKVLAFRDIN
        MAAGT+FLRQCIATTTRTLVTAK KHSS HF+TFLLPL T SRRLTIRAGITN+EEVAAKAAASDADSGAPTIFDKII+KEIPSNIVYEDDKVLAFRDIN
Subjt:  MAAGTAFLRQCIATTTRTLVTAKLKHSSNHFTTFLLPLSTRSRRLTIRAGITNDEEVAAKAAASDADSGAPTIFDKIIAKEIPSNIVYEDDKVLAFRDIN

Query:  PQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIVEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
        PQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIVEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
Subjt:  PQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIVEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG

XP_008459016.1 PREDICTED: 14 kDa zinc-binding protein isoform X1 [Cucumis melo]5.8e-8994.57Show/hide
Query:  MAAGTAFL-RQCIATTTRTLVTAKLKHSSNHFTTFLLPLSTRSRRLTIRAGITNDEEVAAKAAASDADSGAPTIFDKIIAKEIPSNIVYEDDKVLAFRDI
        MAAGT+FL RQC+ATTTRTLVTAKLKHSS HF+TFLLPL T SRRLTIRAGITN+EEVAAKAAASDADSGAPTIFDKII+KEIPSNIVYEDDKVLAFRDI
Subjt:  MAAGTAFL-RQCIATTTRTLVTAKLKHSSNHFTTFLLPLSTRSRRLTIRAGITNDEEVAAKAAASDADSGAPTIFDKIIAKEIPSNIVYEDDKVLAFRDI

Query:  NPQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIVEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
        NPQAPVHVLIIPKLRDGLTELGKAEARHGEILG+LLYAAKIVAEKEGIVEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
Subjt:  NPQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIVEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG

XP_008459017.1 PREDICTED: 14 kDa zinc-binding protein isoform X2 [Cucumis melo]2.4e-9095.08Show/hide
Query:  MAAGTAFLRQCIATTTRTLVTAKLKHSSNHFTTFLLPLSTRSRRLTIRAGITNDEEVAAKAAASDADSGAPTIFDKIIAKEIPSNIVYEDDKVLAFRDIN
        MAAGT+FLRQC+ATTTRTLVTAKLKHSS HF+TFLLPL T SRRLTIRAGITN+EEVAAKAAASDADSGAPTIFDKII+KEIPSNIVYEDDKVLAFRDIN
Subjt:  MAAGTAFLRQCIATTTRTLVTAKLKHSSNHFTTFLLPLSTRSRRLTIRAGITNDEEVAAKAAASDADSGAPTIFDKIIAKEIPSNIVYEDDKVLAFRDIN

Query:  PQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIVEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
        PQAPVHVLIIPKLRDGLTELGKAEARHGEILG+LLYAAKIVAEKEGIVEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
Subjt:  PQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIVEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG

XP_011660307.1 14 kDa zinc-binding protein isoform X1 [Cucumis sativus]7.6e-8995.11Show/hide
Query:  MAAGTAFL-RQCIATTTRTLVTAKLKHSSNHFTTFLLPLSTRSRRLTIRAGITNDEEVAAKAAASDADSGAPTIFDKIIAKEIPSNIVYEDDKVLAFRDI
        MAAGT+FL RQCIATTTRTLVTAK KHSS HF+TFLLPL T SRRLTIRAGITN+EEVAAKAAASDADSGAPTIFDKII+KEIPSNIVYEDDKVLAFRDI
Subjt:  MAAGTAFL-RQCIATTTRTLVTAKLKHSSNHFTTFLLPLSTRSRRLTIRAGITNDEEVAAKAAASDADSGAPTIFDKIIAKEIPSNIVYEDDKVLAFRDI

Query:  NPQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIVEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
        NPQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIVEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
Subjt:  NPQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIVEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG

TrEMBL top hitse value%identityAlignment
A0A0A0M163 HIT domain-containing protein1.5e-9095.63Show/hide
Query:  MAAGTAFLRQCIATTTRTLVTAKLKHSSNHFTTFLLPLSTRSRRLTIRAGITNDEEVAAKAAASDADSGAPTIFDKIIAKEIPSNIVYEDDKVLAFRDIN
        MAAGT+FLRQCIATTTRTLVTAK KHSS HF+TFLLPL T SRRLTIRAGITN+EEVAAKAAASDADSGAPTIFDKII+KEIPSNIVYEDDKVLAFRDIN
Subjt:  MAAGTAFLRQCIATTTRTLVTAKLKHSSNHFTTFLLPLSTRSRRLTIRAGITNDEEVAAKAAASDADSGAPTIFDKIIAKEIPSNIVYEDDKVLAFRDIN

Query:  PQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIVEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
        PQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIVEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
Subjt:  PQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIVEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG

A0A1S3C9B7 14 kDa zinc-binding protein isoform X21.1e-9095.08Show/hide
Query:  MAAGTAFLRQCIATTTRTLVTAKLKHSSNHFTTFLLPLSTRSRRLTIRAGITNDEEVAAKAAASDADSGAPTIFDKIIAKEIPSNIVYEDDKVLAFRDIN
        MAAGT+FLRQC+ATTTRTLVTAKLKHSS HF+TFLLPL T SRRLTIRAGITN+EEVAAKAAASDADSGAPTIFDKII+KEIPSNIVYEDDKVLAFRDIN
Subjt:  MAAGTAFLRQCIATTTRTLVTAKLKHSSNHFTTFLLPLSTRSRRLTIRAGITNDEEVAAKAAASDADSGAPTIFDKIIAKEIPSNIVYEDDKVLAFRDIN

Query:  PQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIVEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
        PQAPVHVLIIPKLRDGLTELGKAEARHGEILG+LLYAAKIVAEKEGIVEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
Subjt:  PQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIVEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG

A0A1S4E2B9 14 kDa zinc-binding protein isoform X12.8e-8994.57Show/hide
Query:  MAAGTAFL-RQCIATTTRTLVTAKLKHSSNHFTTFLLPLSTRSRRLTIRAGITNDEEVAAKAAASDADSGAPTIFDKIIAKEIPSNIVYEDDKVLAFRDI
        MAAGT+FL RQC+ATTTRTLVTAKLKHSS HF+TFLLPL T SRRLTIRAGITN+EEVAAKAAASDADSGAPTIFDKII+KEIPSNIVYEDDKVLAFRDI
Subjt:  MAAGTAFL-RQCIATTTRTLVTAKLKHSSNHFTTFLLPLSTRSRRLTIRAGITNDEEVAAKAAASDADSGAPTIFDKIIAKEIPSNIVYEDDKVLAFRDI

Query:  NPQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIVEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
        NPQAPVHVLIIPKLRDGLTELGKAEARHGEILG+LLYAAKIVAEKEGIVEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
Subjt:  NPQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIVEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG

A0A5A7TIN3 14 kDa zinc-binding protein isoform X13.7e-8994.02Show/hide
Query:  MAAGTAFL-RQCIATTTRTLVTAKLKHSSNHFTTFLLPLSTRSRRLTIRAGITNDEEVAAKAAASDADSGAPTIFDKIIAKEIPSNIVYEDDKVLAFRDI
        MAAGT+FL RQC+ATTTRTLVTAKLKHSS HF+TFLLPL T SRRLTIRAGITN+EEVAAKAAASDADSGAPTIFDKII+KEIPSNIVYEDDKVLAFRDI
Subjt:  MAAGTAFL-RQCIATTTRTLVTAKLKHSSNHFTTFLLPLSTRSRRLTIRAGITNDEEVAAKAAASDADSGAPTIFDKIIAKEIPSNIVYEDDKVLAFRDI

Query:  NPQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIVEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
        NPQAPVHVLIIPKLRDGLTELGKAEARHGEILG+LLYAAKIVAEKEGI+EGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
Subjt:  NPQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIVEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG

A0A5D3CK60 14 kDa zinc-binding protein isoform X21.5e-9094.54Show/hide
Query:  MAAGTAFLRQCIATTTRTLVTAKLKHSSNHFTTFLLPLSTRSRRLTIRAGITNDEEVAAKAAASDADSGAPTIFDKIIAKEIPSNIVYEDDKVLAFRDIN
        MAAGT+FLRQC+ATTTRTLVTAKLKHSS HF+TFLLPL T SRRLTIRAGITN+EEVAAKAAASDADSGAPTIFDKII+KEIPSNIVYEDDKVLAFRDIN
Subjt:  MAAGTAFLRQCIATTTRTLVTAKLKHSSNHFTTFLLPLSTRSRRLTIRAGITNDEEVAAKAAASDADSGAPTIFDKIIAKEIPSNIVYEDDKVLAFRDIN

Query:  PQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIVEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
        PQAPVHVLIIPKLRDGLTELGKAEARHGEILG+LLYAAKIVAEKEGI+EGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
Subjt:  PQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIVEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG

SwissProt top hitse value%identityAlignment
P42855 14 kDa zinc-binding protein (Fragment)2.7e-4472.32Show/hide
Query:  TIFDKIIAKEIPSNIVYEDDKVLAFRDINPQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIVEGFRVVINSGASACQSVYHLHLH
        TIF KII+KEIPS +VYEDDKVLAFRDI PQ PVH+L+IPK+RDGLT L KAE RH +ILG+LLY AK+VA++EG+ EGFR+VIN G   CQSVYH+H+H
Subjt:  TIFDKIIAKEIPSNIVYEDDKVLAFRDINPQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIVEGFRVVINSGASACQSVYHLHLH

Query:  VLGGRQMKWPPG
        ++GGRQM WPPG
Subjt:  VLGGRQMKWPPG

P42856 14 kDa zinc-binding protein1.7e-4670.97Show/hide
Query:  KAAASDADSGAPTIFDKIIAKEIPSNIVYEDDKVLAFRDINPQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIVEGFRVVINSGA
        K AA      +PTIFDKII KEIPS +VYED+KVLAFRDINPQAP H+LIIPK++DGLT L KAE RH EILG LLY AK+VA++EG+ +G+RVVIN G 
Subjt:  KAAASDADSGAPTIFDKIIAKEIPSNIVYEDDKVLAFRDINPQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIVEGFRVVINSGA

Query:  SACQSVYHLHLHVLGGRQMKWPPG
        S CQSVYH+H+H+LGGRQM WPPG
Subjt:  SACQSVYHLHLHVLGGRQMKWPPG

Q8GUN2 Adenylylsulfatase HINT14.4e-4765.19Show/hide
Query:  AGITNDEEVAAKAAASDADSGAPTIFDKIIAKEIPSNIVYEDDKVLAFRDINPQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIV
        A + +++E A  A  SD    +PTIFDKII+KEIPS +V+EDDKVLAFRDI PQ PVH+L+IPK+RDGLT L KAE RH +ILG+LLY AK+VA++EG+ 
Subjt:  AGITNDEEVAAKAAASDADSGAPTIFDKIIAKEIPSNIVYEDDKVLAFRDINPQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIV

Query:  EGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
        EGFR+VIN G   CQSVYH+H+H++GGRQM WPPG
Subjt:  EGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG

Q8SQ21 Histidine triad nucleotide-binding protein 2, mitochondrial4.7e-3351.11Show/hide
Query:  AGITNDEEVAAKAAASDADSGAPTIFDKIIAKEIPSNIVYEDDKVLAFRDINPQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIV
        AG+T+ +EV AKA  +     APTIF +I+ + +P++I+YED + LAFRD+ PQAPVH L+IPK    +  + +AE    ++LG LL  AK  A+ EG+ 
Subjt:  AGITNDEEVAAKAAASDADSGAPTIFDKIIAKEIPSNIVYEDDKVLAFRDINPQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIV

Query:  EGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
        +G+R+VIN G    QSVYHLH+HVLGGRQ++WPPG
Subjt:  EGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG

Q9BX68 Histidine triad nucleotide-binding protein 2, mitochondrial8.0e-3351.11Show/hide
Query:  AGITNDEEVAAKAAASDADSGAPTIFDKIIAKEIPSNIVYEDDKVLAFRDINPQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIV
        AG+T+  EV AKA  +     APTIF +I+ K +P++I+YED + L FRD+ PQAPVH L+IPK    +  + +AE    ++LG LL  AK  A+ EG+ 
Subjt:  AGITNDEEVAAKAAASDADSGAPTIFDKIIAKEIPSNIVYEDDKVLAFRDINPQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIV

Query:  EGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
        +G+R+VIN G    QSVYHLH+HVLGGRQ++WPPG
Subjt:  EGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG

Arabidopsis top hitse value%identityAlignment
AT1G31160.1 HISTIDINE TRIAD NUCLEOTIDE-BINDING 22.2e-6280.56Show/hide
Query:  TRSRRLTIRAGITNDEEVAAKAAASDADSGAPTIFDKIIAKEIPSNIVYEDDKVLAFRDINPQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAK
        TRS+RL       ++EE AAKAAAS AD+GAPTIFDKIIAKEIPS+IVYED+ VLAFRDINPQAPVHVL+IPKLRDGLT LGKAE RH E+LGQLL+A+K
Subjt:  TRSRRLTIRAGITNDEEVAAKAAASDADSGAPTIFDKIIAKEIPSNIVYEDDKVLAFRDINPQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAK

Query:  IVAEKEGIVEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
        IVAEKEGI++GFRVVIN+G  ACQSVYHLHLHVLGGRQMKWPPG
Subjt:  IVAEKEGIVEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG

AT3G56490.1 HIS triad family protein 33.1e-4865.19Show/hide
Query:  AGITNDEEVAAKAAASDADSGAPTIFDKIIAKEIPSNIVYEDDKVLAFRDINPQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIV
        A + +++E A  A  SD    +PTIFDKII+KEIPS +V+EDDKVLAFRDI PQ PVH+L+IPK+RDGLT L KAE RH +ILG+LLY AK+VA++EG+ 
Subjt:  AGITNDEEVAAKAAASDADSGAPTIFDKIIAKEIPSNIVYEDDKVLAFRDINPQAPVHVLIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIV

Query:  EGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
        EGFR+VIN G   CQSVYH+H+H++GGRQM WPPG
Subjt:  EGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG

AT4G16566.1 histidine triad nucleotide-binding 44.7e-0427.72Show/hide
Query:  IFDKIIAKEIPSNIVYEDDKVLAFRDINPQAPVHVLIIPKLRDGLTELGKAEARHGE--ILGQLLYAAKIVAEKEGIVEGFRVVINSGASACQSVYHLHL
        IF +I+     + +++ D+KV+AF+DI P A  H L+IPK  + +  +   + R  +  ++  +L   + + +K+      R   +       SV HLHL
Subjt:  IFDKIIAKEIPSNIVYEDDKVLAFRDINPQAPVHVLIIPKLRDGLTELGKAEARHGE--ILGQLLYAAKIVAEKEGIVEGFRVVINSGASACQSVYHLHL

Query:  H
        H
Subjt:  H


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCTGGGACAGCTTTCTTGAGGCAATGTATAGCTACGACTACAAGAACATTGGTGACTGCGAAATTGAAACATTCCAGCAACCATTTCACGACTTTTCTCCTCCC
ACTTAGCACTCGCTCTCGAAGATTGACCATTCGTGCTGGTATCACAAACGATGAAGAAGTTGCTGCTAAAGCAGCTGCATCTGATGCAGACAGTGGAGCTCCAACCATAT
TTGACAAAATAATAGCCAAGGAAATTCCTTCGAACATTGTTTATGAGGATGATAAAGTCCTTGCATTCCGAGACATAAATCCACAGGCTCCTGTTCATGTATTAATCATT
CCAAAGCTCAGAGATGGATTGACAGAGCTGGGAAAGGCTGAAGCAAGACATGGAGAGATACTGGGTCAGCTGCTGTATGCAGCCAAAATAGTGGCTGAAAAAGAGGGTAT
TGTTGAAGGATTTCGAGTAGTCATCAACAGTGGGGCAAGTGCTTGTCAATCTGTGTATCATCTTCACTTGCATGTCCTGGGAGGGAGACAGATGAAATGGCCTCCAGGTT
GA
mRNA sequenceShow/hide mRNA sequence
TGAGAATGAAAAGATTAGCCATCATCGACAGTACTTCCTCTCTCTCTCTCTCTCTCTCTGAGAAGAAGAAGATGTATATGGATCGATATTAACAGATCCAGAACCATGGG
ATGAGAGAAATGGGCTAACAATAAATCCACTTTTGTCTTGAAGACCGACTTTTGGCCCAACCCATTTAAGTTAAAGCTTGTGTATTTTTAGACAAAATAAGAAAATATGT
GCAAAAACCATCCTCAAAACCCATCAACCGTTTGTTAAACACTCCTTCAAATTTTGGCCGCTCTGGTTGATTAAGAAAGCAAATGGCTGCTGGGACAGCTTTCTTGAGGC
AATGTATAGCTACGACTACAAGAACATTGGTGACTGCGAAATTGAAACATTCCAGCAACCATTTCACGACTTTTCTCCTCCCACTTAGCACTCGCTCTCGAAGATTGACC
ATTCGTGCTGGTATCACAAACGATGAAGAAGTTGCTGCTAAAGCAGCTGCATCTGATGCAGACAGTGGAGCTCCAACCATATTTGACAAAATAATAGCCAAGGAAATTCC
TTCGAACATTGTTTATGAGGATGATAAAGTCCTTGCATTCCGAGACATAAATCCACAGGCTCCTGTTCATGTATTAATCATTCCAAAGCTCAGAGATGGATTGACAGAGC
TGGGAAAGGCTGAAGCAAGACATGGAGAGATACTGGGTCAGCTGCTGTATGCAGCCAAAATAGTGGCTGAAAAAGAGGGTATTGTTGAAGGATTTCGAGTAGTCATCAAC
AGTGGGGCAAGTGCTTGTCAATCTGTGTATCATCTTCACTTGCATGTCCTGGGAGGGAGACAGATGAAATGGCCTCCAGGTTGAAGGCAACTACATTTTTTGTAATATAT
CATAGCTAAACTGGTTTCGATATATTTATTGTATCTCCATTCAGTGAGTGAGAACTGAGAAGCTTAAACATAAGAAAATAAAAGGACAACAATGACTTCTGTAGATGTGA
CTCAAGTTGGATGGTTGTATCAGCTTATATTTATAACAGGCAAGTAGTGTTTCAAATTT
Protein sequenceShow/hide protein sequence
MAAGTAFLRQCIATTTRTLVTAKLKHSSNHFTTFLLPLSTRSRRLTIRAGITNDEEVAAKAAASDADSGAPTIFDKIIAKEIPSNIVYEDDKVLAFRDINPQAPVHVLII
PKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIVEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG