; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G13182 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G13182
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionHistidine triad nucleotide-binding protein 2
Genome locationctg1838:1796262..1800372
RNA-Seq ExpressionCucsat.G13182
SyntenyCucsat.G13182
Gene Ontology termsGO:0006790 - sulfur compound metabolic process (biological process)
GO:0009150 - purine ribonucleotide metabolic process (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0005777 - peroxisome (cellular component)
GO:0047627 - adenylylsulfatase activity (molecular function)
InterPro domainsIPR001310 - Histidine triad (HIT) protein
IPR011146 - HIT-like domain
IPR019808 - Histidine triad, conserved site
IPR036265 - HIT-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7017601.1 Adenylylsulfatase HINT1 [Cucurbita argyrosperma subsp. argyrosperma]3.17e-8974.62Show/hide
Query:  MVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIAWSVCSKQKANGSMSTVRFKDVSRNQILNSYKNLDELALIGFDKIINKEIPSTV
        MVNEDIK  TRLSVLSSH S SLSISMASSE+EAALAAVPSDSPTI                                         FDKIINKEIPST+
Subjt:  MVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIAWSVCSKQKANGSMSTVRFKDVSRNQILNSYKNLDELALIGFDKIINKEIPSTV

Query:  VFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
        VFEDDKVLAFRDI+PQAPTHILIIPKVKDGLSGLSKAEERHTEILG LLY AKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
Subjt:  VFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG

XP_004136603.2 14 kDa zinc-binding protein [Cucumis sativus]3.31e-10280Show/hide
Query:  MKKEIRAVMVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIAWSVCSKQKANGSMSTVRFKDVSRNQILNSYKNLDELALIGFDKII
        MKKEIRAVMVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTI                                         FDKII
Subjt:  MKKEIRAVMVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIAWSVCSKQKANGSMSTVRFKDVSRNQILNSYKNLDELALIGFDKII

Query:  NKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQM
        NKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQM
Subjt:  NKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQM

Query:  NWPPG
        NWPPG
Subjt:  NWPPG

XP_008443161.1 PREDICTED: 14 kDa zinc-binding protein [Cucumis melo]1.80e-9578.17Show/hide
Query:  MVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIAWSVCSKQKANGSMSTVRFKDVSRNQILNSYKNLDELALIGFDKIINKEIPSTV
        MVNEDIKLRTRLSVLSSH SHSLSISMASSEQEAALAAVPSDSPTI                                         FDKIINKEIPSTV
Subjt:  MVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIAWSVCSKQKANGSMSTVRFKDVSRNQILNSYKNLDELALIGFDKIINKEIPSTV

Query:  VFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
        VFEDDKVLAFRDIAPQAPTHILIIP+VKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
Subjt:  VFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG

XP_022971276.1 14 kDa zinc-binding protein-like isoform X2 [Cucurbita maxima]6.84e-9173.89Show/hide
Query:  KEIRAVMVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIAWSVCSKQKANGSMSTVRFKDVSRNQILNSYKNLDELALIGFDKIINK
        ++IRAVMVN DIK  TRLSVLSSH + SLSISMASSE+EAALAAVPSDSPTI                                         FDKIINK
Subjt:  KEIRAVMVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIAWSVCSKQKANGSMSTVRFKDVSRNQILNSYKNLDELALIGFDKIINK

Query:  EIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNW
        EIPSTVVFEDDKVLAFRDI+PQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIA+QEGLDDGFR+VINDGPSGCQSVYHLHVHLLGGRQMNW
Subjt:  EIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNW

Query:  PPG
        PPG
Subjt:  PPG

XP_038905126.1 14 kDa zinc-binding protein [Benincasa hispida]8.54e-9477.16Show/hide
Query:  MVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIAWSVCSKQKANGSMSTVRFKDVSRNQILNSYKNLDELALIGFDKIINKEIPSTV
        M+NEDIK RTRLSVLSSH SHSLSISMASSEQEAALAAVPSDSPTI                                         FDKIINKEIPS V
Subjt:  MVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIAWSVCSKQKANGSMSTVRFKDVSRNQILNSYKNLDELALIGFDKIINKEIPSTV

Query:  VFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
        VFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
Subjt:  VFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG

TrEMBL top hitse value%identityAlignment
A0A0A0LC24 HIT domain-containing protein3.71e-9779.19Show/hide
Query:  MVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIAWSVCSKQKANGSMSTVRFKDVSRNQILNSYKNLDELALIGFDKIINKEIPSTV
        MVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTI                                         FDKIINKEIPSTV
Subjt:  MVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIAWSVCSKQKANGSMSTVRFKDVSRNQILNSYKNLDELALIGFDKIINKEIPSTV

Query:  VFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
        VFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
Subjt:  VFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG

A0A1S3B859 14 kDa zinc-binding protein8.73e-9678.17Show/hide
Query:  MVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIAWSVCSKQKANGSMSTVRFKDVSRNQILNSYKNLDELALIGFDKIINKEIPSTV
        MVNEDIKLRTRLSVLSSH SHSLSISMASSEQEAALAAVPSDSPTI                                         FDKIINKEIPSTV
Subjt:  MVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIAWSVCSKQKANGSMSTVRFKDVSRNQILNSYKNLDELALIGFDKIINKEIPSTV

Query:  VFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
        VFEDDKVLAFRDIAPQAPTHILIIP+VKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
Subjt:  VFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG

A0A6J1F2T2 14 kDa zinc-binding protein-like1.03e-8773.6Show/hide
Query:  MVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIAWSVCSKQKANGSMSTVRFKDVSRNQILNSYKNLDELALIGFDKIINKEIPSTV
        MVNE IK  TRL+VLSSH S SLSISMASSE+EAALAAVPSDSPTI                                         FDKIINKEIPST+
Subjt:  MVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIAWSVCSKQKANGSMSTVRFKDVSRNQILNSYKNLDELALIGFDKIINKEIPSTV

Query:  VFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
        VFEDDKVLAFRDI+PQAPTHILIIPKVKDGLSGLSKAEERHTEILG LLY AKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
Subjt:  VFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG

A0A6J1GGA0 14 kDa zinc-binding protein-like3.60e-8873.6Show/hide
Query:  MVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIAWSVCSKQKANGSMSTVRFKDVSRNQILNSYKNLDELALIGFDKIINKEIPSTV
        MVN DIK  TRLSVLSSH + SLSISMASSE+EAALAAVPSDSPTI                                         FDKIINKEIPSTV
Subjt:  MVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIAWSVCSKQKANGSMSTVRFKDVSRNQILNSYKNLDELALIGFDKIINKEIPSTV

Query:  VFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
        VFEDDKVLAFRDI+PQAPTHILIIPKVKDGLSGLSKAEERH EILGHLLYTAKLIA+QEGLDDGFR+VINDGPSGCQSVYHLHVHLLGGRQMNWPPG
Subjt:  VFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG

A0A6J1I2W3 14 kDa zinc-binding protein-like isoform X23.31e-9173.89Show/hide
Query:  KEIRAVMVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIAWSVCSKQKANGSMSTVRFKDVSRNQILNSYKNLDELALIGFDKIINK
        ++IRAVMVN DIK  TRLSVLSSH + SLSISMASSE+EAALAAVPSDSPTI                                         FDKIINK
Subjt:  KEIRAVMVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIAWSVCSKQKANGSMSTVRFKDVSRNQILNSYKNLDELALIGFDKIINK

Query:  EIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNW
        EIPSTVVFEDDKVLAFRDI+PQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIA+QEGLDDGFR+VINDGPSGCQSVYHLHVHLLGGRQMNW
Subjt:  EIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNW

Query:  PPG
        PPG
Subjt:  PPG

SwissProt top hitse value%identityAlignment
P32084 Uncharacterized HIT-like protein Synpcc7942_13907.3e-3560.91Show/hide
Query:  FDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLL
        F KII +EIP+ +V+EDD  LAFRD+APQAP HIL+IP  K  ++ L +A   H  +LGHLL T K IA QEGL +G+R VIN GP+G Q+VYHLH+HLL
Subjt:  FDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLL

Query:  GGRQMNWPPG
        GGR + WPPG
Subjt:  GGRQMNWPPG

P42855 14 kDa zinc-binding protein (Fragment)1.1e-5182.73Show/hide
Query:  FDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLL
        F KII+KEIPSTVV+EDDKVLAFRDI PQ P HIL+IPKV+DGL+GL KAEERH +ILG LLYTAKL+AKQEGLD+GFR+VINDGP GCQSVYH+HVHL+
Subjt:  FDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLL

Query:  GGRQMNWPPG
        GGRQMNWPPG
Subjt:  GGRQMNWPPG

P42856 14 kDa zinc-binding protein5.4e-5482.5Show/hide
Query:  KNLDELALIGFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQ
        + LD+   I FDKII KEIPSTVV+ED+KVLAFRDI PQAPTHILIIPKVKDGL+GL+KAEERH EILG+LLY AK++AKQEGL+DG+RVVINDGPSGCQ
Subjt:  KNLDELALIGFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQ

Query:  SVYHLHVHLLGGRQMNWPPG
        SVYH+HVHLLGGRQMNWPPG
Subjt:  SVYHLHVHLLGGRQMNWPPG

Q8GUN2 Adenylylsulfatase HINT11.8e-5762.03Show/hide
Query:  RLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIAWSVCSKQKANGSMSTVRFKDVSRNQILNSYKNLDELALIGFDKIINKEIPSTVVFEDDKVLAF
        R+S+LSSHFS + ++   +SE+EAALAA PSDSPTI                                         FDKII+KEIPSTVVFEDDKVLAF
Subjt:  RLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIAWSVCSKQKANGSMSTVRFKDVSRNQILNSYKNLDELALIGFDKIINKEIPSTVVFEDDKVLAF

Query:  RDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
        RDI PQ P HIL+IPKV+DGL+GLSKAEERH +ILG LLYTAKL+AKQEGL +GFR+VINDGP GCQSVYH+HVHL+GGRQMNWPPG
Subjt:  RDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG

Q8SQ21 Histidine triad nucleotide-binding protein 2, mitochondrial1.8e-3356.36Show/hide
Query:  FDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLL
        F +I+++ +P+ +++ED + LAFRD+APQAP H L+IP  K  +  +S+AEE   ++LGHLL  AK  AK EGL DG+R+VINDG  G QSVYHLH+H+L
Subjt:  FDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLL

Query:  GGRQMNWPPG
        GGRQ+ WPPG
Subjt:  GGRQMNWPPG

Arabidopsis top hitse value%identityAlignment
AT1G31160.1 HISTIDINE TRIAD NUCLEOTIDE-BINDING 28.9e-4468.18Show/hide
Query:  FDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLL
        FDKII KEIPS +V+ED+ VLAFRDI PQAP H+L+IPK++DGL+ L KAE RH E+LG LL+ +K++A++EG+ DGFRVVIN+G   CQSVYHLH+H+L
Subjt:  FDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLL

Query:  GGRQMNWPPG
        GGRQM WPPG
Subjt:  GGRQMNWPPG

AT3G56490.1 HIS triad family protein 31.3e-5862.03Show/hide
Query:  RLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIAWSVCSKQKANGSMSTVRFKDVSRNQILNSYKNLDELALIGFDKIINKEIPSTVVFEDDKVLAF
        R+S+LSSHFS + ++   +SE+EAALAA PSDSPTI                                         FDKII+KEIPSTVVFEDDKVLAF
Subjt:  RLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIAWSVCSKQKANGSMSTVRFKDVSRNQILNSYKNLDELALIGFDKIINKEIPSTVVFEDDKVLAF

Query:  RDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
        RDI PQ P HIL+IPKV+DGL+GLSKAEERH +ILG LLYTAKL+AKQEGL +GFR+VINDGP GCQSVYH+HVHL+GGRQMNWPPG
Subjt:  RDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG

AT4G16566.1 histidine triad nucleotide-binding 47.4e-0628.28Show/hide
Query:  FDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVK-DGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVH
        F +I+     + ++  D+KV+AF+DI P A  H L+IPK     ++ L + +E ++ ++ H+L   + + +++      R   +  P    SV HLH+H
Subjt:  FDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVK-DGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAAAGAGATTAGGGCTGTAATGGTCAACGAAGATATAAAACTCAGGACTCGACTCTCAGTTCTGAGCTCTCACTTTTCCCACTCTCTTTCTATTTCCATGGCTTC
TTCCGAGCAGGAAGCGGCTCTTGCAGCCGTTCCCTCCGATTCCCCCACCATTGCTTGGAGCGTATGCTCAAAGCAGAAGGCTAATGGTTCTATGAGTACGGTGAGATTTA
AAGATGTTTCCAGAAATCAGATTTTGAACTCATACAAGAATCTTGACGAGCTAGCTTTAATTGGATTTGACAAAATCATTAACAAGGAAATTCCATCTACGGTGGTTTTT
GAGGATGACAAGGTCCTTGCTTTCAGGGACATAGCACCCCAAGCTCCTACACATATTTTGATCATTCCAAAAGTTAAGGATGGATTATCTGGATTATCTAAGGCTGAGGA
GAGGCACACGGAGATTCTCGGCCACCTTCTTTACACTGCCAAGCTCATTGCCAAACAAGAAGGACTGGACGATGGCTTTAGAGTCGTAATTAACGACGGACCAAGTGGAT
GTCAATCGGTTTATCATCTTCATGTTCACCTTTTGGGGGGACGACAAATGAATTGGCCCCCAGGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAAAAGAGATTAGGGCTGTAATGGTCAACGAAGATATAAAACTCAGGACTCGACTCTCAGTTCTGAGCTCTCACTTTTCCCACTCTCTTTCTATTTCCATGGCTTC
TTCCGAGCAGGAAGCGGCTCTTGCAGCCGTTCCCTCCGATTCCCCCACCATTGCTTGGAGCGTATGCTCAAAGCAGAAGGCTAATGGTTCTATGAGTACGGTGAGATTTA
AAGATGTTTCCAGAAATCAGATTTTGAACTCATACAAGAATCTTGACGAGCTAGCTTTAATTGGATTTGACAAAATCATTAACAAGGAAATTCCATCTACGGTGGTTTTT
GAGGATGACAAGGTCCTTGCTTTCAGGGACATAGCACCCCAAGCTCCTACACATATTTTGATCATTCCAAAAGTTAAGGATGGATTATCTGGATTATCTAAGGCTGAGGA
GAGGCACACGGAGATTCTCGGCCACCTTCTTTACACTGCCAAGCTCATTGCCAAACAAGAAGGACTGGACGATGGCTTTAGAGTCGTAATTAACGACGGACCAAGTGGAT
GTCAATCGGTTTATCATCTTCATGTTCACCTTTTGGGGGGACGACAAATGAATTGGCCCCCAGGTTAA
Protein sequenceShow/hide protein sequence
MKKEIRAVMVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIAWSVCSKQKANGSMSTVRFKDVSRNQILNSYKNLDELALIGFDKIINKEIPSTVVF
EDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG