; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh02G009380 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh02G009380
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionZinc finger matrin-type protein 1, putative isoform 1
Genome locationCma_Chr02:5573502..5576018
RNA-Seq ExpressionCmaCh02G009380
SyntenyCmaCh02G009380
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605602.1 hypothetical protein SDJN03_02919, partial [Cucurbita argyrosperma subsp. sororia]4.8e-13796.56Show/hide
Query:  MVTLASAYLSSSPSNFSSLNLLRLSKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
        M+TLASAYLSSSPSNFSSLNLLRL+KPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
Subjt:  MVTLASAYLSSSPSNFSSLNLLRLSKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD

Query:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKASNQISPTE
        STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGK SNQIS T+
Subjt:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKASNQISPTE

Query:  IGRGHLKGFKGPTKKSGKKRDMKLPSAEEQLRDEIRGWGDYKETLDHEQSNEEWDDEQRRKR
        IGRGHLKG +GPTKKSGKKRDMKLPSAEEQLRDEI+GWGDYKETLDHEQ NEEWDDEQRRKR
Subjt:  IGRGHLKGFKGPTKKSGKKRDMKLPSAEEQLRDEIRGWGDYKETLDHEQSNEEWDDEQRRKR

XP_022958724.1 uncharacterized protein LOC111459865 isoform X1 [Cucurbita moschata]4.8e-13796.56Show/hide
Query:  MVTLASAYLSSSPSNFSSLNLLRLSKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
        M+TLASAYLSSSPSNFSSLNLLRL+KPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
Subjt:  MVTLASAYLSSSPSNFSSLNLLRLSKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD

Query:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKASNQISPTE
        STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGK SNQIS T+
Subjt:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKASNQISPTE

Query:  IGRGHLKGFKGPTKKSGKKRDMKLPSAEEQLRDEIRGWGDYKETLDHEQSNEEWDDEQRRKR
        I RGHLKG +GPTKKSGKKRDMKLPSAEEQLRDEI+GWGDYKETLDHEQSNEEWDDEQRRKR
Subjt:  IGRGHLKGFKGPTKKSGKKRDMKLPSAEEQLRDEIRGWGDYKETLDHEQSNEEWDDEQRRKR

XP_022996258.1 uncharacterized protein LOC111491538 isoform X1 [Cucurbita maxima]6.5e-142100Show/hide
Query:  MVTLASAYLSSSPSNFSSLNLLRLSKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
        MVTLASAYLSSSPSNFSSLNLLRLSKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
Subjt:  MVTLASAYLSSSPSNFSSLNLLRLSKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD

Query:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKASNQISPTE
        STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKASNQISPTE
Subjt:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKASNQISPTE

Query:  IGRGHLKGFKGPTKKSGKKRDMKLPSAEEQLRDEIRGWGDYKETLDHEQSNEEWDDEQRRKR
        IGRGHLKGFKGPTKKSGKKRDMKLPSAEEQLRDEIRGWGDYKETLDHEQSNEEWDDEQRRKR
Subjt:  IGRGHLKGFKGPTKKSGKKRDMKLPSAEEQLRDEIRGWGDYKETLDHEQSNEEWDDEQRRKR

XP_022996259.1 uncharacterized protein LOC111491538 isoform X2 [Cucurbita maxima]4.6e-14099.62Show/hide
Query:  MVTLASAYLSSSPSNFSSLNLLRLSKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
        MVTLASAYLSSSPSNFSSLNLLRLSKPPFTFSTSLSNLKPLNPSHKSASNQ RTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
Subjt:  MVTLASAYLSSSPSNFSSLNLLRLSKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD

Query:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKASNQISPTE
        STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKASNQISPTE
Subjt:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKASNQISPTE

Query:  IGRGHLKGFKGPTKKSGKKRDMKLPSAEEQLRDEIRGWGDYKETLDHEQSNEEWDDEQRRKR
        IGRGHLKGFKGPTKKSGKKRDMKLPSAEEQLRDEIRGWGDYKETLDHEQSNEEWDDEQRRKR
Subjt:  IGRGHLKGFKGPTKKSGKKRDMKLPSAEEQLRDEIRGWGDYKETLDHEQSNEEWDDEQRRKR

XP_023534481.1 uncharacterized protein LOC111796027 isoform X1 [Cucurbita pepo subsp. pepo]7.4e-13896.95Show/hide
Query:  MVTLASAYLSSSPSNFSSLNLLRLSKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
        M+TLASAYLSSSPSNFSSLNLLRL+KPPFTFSTSLSNLKPLNPSHKSASNQ+RTTRNGICRAELGNDAPFAIAIGACIL+SLVLPPAGGGSDDDSDAVMD
Subjt:  MVTLASAYLSSSPSNFSSLNLLRLSKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD

Query:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKASNQISPTE
        STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKASNQIS T+
Subjt:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKASNQISPTE

Query:  IGRGHLKGFKGPTKKSGKKRDMKLPSAEEQLRDEIRGWGDYKETLDHEQSNEEWDDEQRRKR
        IGRGHLKG +GPTKKSGKKRDMKLPSAEEQLRDEIRGWGDYKETLDHEQSNEEWDDEQRRKR
Subjt:  IGRGHLKGFKGPTKKSGKKRDMKLPSAEEQLRDEIRGWGDYKETLDHEQSNEEWDDEQRRKR

TrEMBL top hitse value%identityAlignment
A0A6J1BQE6 uncharacterized protein LOC1110044992.0e-10476.63Show/hide
Query:  MVTLASAYLSSSPSNFSSLNLLRLSKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
        M++LA A LSSSPSN SSL  LRL +PP TFSTSLSNLK LNP  K+AS+Q+R   NG+CRA+LGND PFA+AIGACILSS V P AGGGSDD+SDAV+D
Subjt:  MVTLASAYLSSSPSNFSSLNLLRLSKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD

Query:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKASNQISPTE
        STD R AVMGIISFIPYFNWLSWVFAWLDSG+R YAVYA+VYL PYLRSNLSLSP+ESWLPI SIL+CI HIQ+E SI+NGDIQPFQIFGK S +IS T 
Subjt:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKASNQISPTE

Query:  IGRGHLKGFKGPTKKSGKKRDMKLPSAEEQLRDEIRGWGDYKETLDHEQSNEEWDDEQRRK
         GR H KG +GP ++SG+K DMKLPS +EQLRDEIR WGD KETLDHEQSN EWDDEQRRK
Subjt:  IGRGHLKGFKGPTKKSGKKRDMKLPSAEEQLRDEIRGWGDYKETLDHEQSNEEWDDEQRRK

A0A6J1H4A9 uncharacterized protein LOC111459865 isoform X21.7e-13596.18Show/hide
Query:  MVTLASAYLSSSPSNFSSLNLLRLSKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
        M+TLASAYLSSSPSNFSSLNLLRL+KPPFTFSTSLSNLKPLNPSHKSASNQ RTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
Subjt:  MVTLASAYLSSSPSNFSSLNLLRLSKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD

Query:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKASNQISPTE
        STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGK SNQIS T+
Subjt:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKASNQISPTE

Query:  IGRGHLKGFKGPTKKSGKKRDMKLPSAEEQLRDEIRGWGDYKETLDHEQSNEEWDDEQRRKR
        I RGHLKG +GPTKKSGKKRDMKLPSAEEQLRDEI+GWGDYKETLDHEQSNEEWDDEQRRKR
Subjt:  IGRGHLKGFKGPTKKSGKKRDMKLPSAEEQLRDEIRGWGDYKETLDHEQSNEEWDDEQRRKR

A0A6J1H5Y1 uncharacterized protein LOC111459865 isoform X12.3e-13796.56Show/hide
Query:  MVTLASAYLSSSPSNFSSLNLLRLSKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
        M+TLASAYLSSSPSNFSSLNLLRL+KPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
Subjt:  MVTLASAYLSSSPSNFSSLNLLRLSKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD

Query:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKASNQISPTE
        STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGK SNQIS T+
Subjt:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKASNQISPTE

Query:  IGRGHLKGFKGPTKKSGKKRDMKLPSAEEQLRDEIRGWGDYKETLDHEQSNEEWDDEQRRKR
        I RGHLKG +GPTKKSGKKRDMKLPSAEEQLRDEI+GWGDYKETLDHEQSNEEWDDEQRRKR
Subjt:  IGRGHLKGFKGPTKKSGKKRDMKLPSAEEQLRDEIRGWGDYKETLDHEQSNEEWDDEQRRKR

A0A6J1K887 uncharacterized protein LOC111491538 isoform X22.3e-14099.62Show/hide
Query:  MVTLASAYLSSSPSNFSSLNLLRLSKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
        MVTLASAYLSSSPSNFSSLNLLRLSKPPFTFSTSLSNLKPLNPSHKSASNQ RTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
Subjt:  MVTLASAYLSSSPSNFSSLNLLRLSKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD

Query:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKASNQISPTE
        STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKASNQISPTE
Subjt:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKASNQISPTE

Query:  IGRGHLKGFKGPTKKSGKKRDMKLPSAEEQLRDEIRGWGDYKETLDHEQSNEEWDDEQRRKR
        IGRGHLKGFKGPTKKSGKKRDMKLPSAEEQLRDEIRGWGDYKETLDHEQSNEEWDDEQRRKR
Subjt:  IGRGHLKGFKGPTKKSGKKRDMKLPSAEEQLRDEIRGWGDYKETLDHEQSNEEWDDEQRRKR

A0A6J1KAA9 uncharacterized protein LOC111491538 isoform X13.1e-142100Show/hide
Query:  MVTLASAYLSSSPSNFSSLNLLRLSKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
        MVTLASAYLSSSPSNFSSLNLLRLSKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
Subjt:  MVTLASAYLSSSPSNFSSLNLLRLSKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD

Query:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKASNQISPTE
        STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKASNQISPTE
Subjt:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKASNQISPTE

Query:  IGRGHLKGFKGPTKKSGKKRDMKLPSAEEQLRDEIRGWGDYKETLDHEQSNEEWDDEQRRKR
        IGRGHLKGFKGPTKKSGKKRDMKLPSAEEQLRDEIRGWGDYKETLDHEQSNEEWDDEQRRKR
Subjt:  IGRGHLKGFKGPTKKSGKKRDMKLPSAEEQLRDEIRGWGDYKETLDHEQSNEEWDDEQRRKR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G41960.1 unknown protein1.6e-4550.48Show/hide
Query:  LSSSPSNFSSLNLLRLSKPPFTFSTSLSNLKPLNPSHKSASNQR--RTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSD---AVMDSTD
        LSSS S ++   LL         S+S S+  PL  ++   + ++  R     ICRAE   DAP   AIGACILSS V P A   +D++ +   + + STD
Subjt:  LSSSPSNFSSLNLLRLSKPPFTFSTSLSNLKPLNPSHKSASNQR--RTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSD---AVMDSTD

Query:  ARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKASNQISPTEIGR
         RLA MGIISFIPYFNWLSWVFAWLD+GK RYAVYA+VYL PYL SNLS+SP+ESWLPI SI++ I H+Q+EASI NGD++    F   S+    ++   
Subjt:  ARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKASNQISPTEIGR

Query:  GHLKGFKG
           K FKG
Subjt:  GHLKGFKG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCACTCTGGCTTCTGCTTATTTATCATCATCCCCTTCCAATTTCTCTTCTCTCAACCTTCTTCGCCTCTCCAAACCCCCTTTCACCTTCTCAACTTCACTCTCCAA
TCTCAAACCCTTAAATCCCTCCCACAAATCAGCTTCTAATCAGAGGAGGACGACCCGAAATGGGATTTGTCGGGCGGAATTAGGGAACGACGCGCCTTTCGCTATTGCGA
TCGGGGCTTGCATTCTCAGTTCTCTTGTTCTTCCACCAGCTGGCGGTGGTTCCGATGATGATAGCGATGCCGTTATGGATTCCACCGATGCTAGGCTCGCTGTCATGGGC
ATCATTAGCTTTATCCCCTACTTCAACTGGCTGAGTTGGGTTTTTGCGTGGCTTGATTCTGGGAAAAGACGTTATGCTGTGTATGCTATCGTGTATTTGGCTCCTTATCT
AAGGTCAAATTTATCGTTGTCACCCGATGAGAGTTGGCTTCCTATTGTCAGTATACTTATCTGCATAGCTCACATTCAGGTTGAAGCAAGCATTAAAAATGGAGACATTC
AACCCTTCCAAATATTCGGTAAGGCGTCCAATCAAATTTCTCCGACGGAGATAGGGAGAGGCCATTTGAAGGGGTTCAAAGGACCAACCAAAAAGAGTGGCAAGAAAAGG
GATATGAAGCTTCCATCTGCTGAAGAACAATTGAGAGATGAGATTAGAGGATGGGGAGATTATAAAGAGACATTAGATCATGAACAATCCAATGAAGAATGGGATGACGA
ACAGAGGAGAAAACGTTAG
mRNA sequenceShow/hide mRNA sequence
CGAAAGAGTTGGCACGGTTGCCAAGAAGTGAAGAAGAATTCCATAGGGGAGTGGCTCTTTCGTTTTCCCGTCGAGAAATTCTGATTCTCACTTTCTTCCGACTATCATTT
CACTGATTCACCACAATTACAGCGCCAATGGTCACTCTGGCTTCTGCTTATTTATCATCATCCCCTTCCAATTTCTCTTCTCTCAACCTTCTTCGCCTCTCCAAACCCCC
TTTCACCTTCTCAACTTCACTCTCCAATCTCAAACCCTTAAATCCCTCCCACAAATCAGCTTCTAATCAGAGGAGGACGACCCGAAATGGGATTTGTCGGGCGGAATTAG
GGAACGACGCGCCTTTCGCTATTGCGATCGGGGCTTGCATTCTCAGTTCTCTTGTTCTTCCACCAGCTGGCGGTGGTTCCGATGATGATAGCGATGCCGTTATGGATTCC
ACCGATGCTAGGCTCGCTGTCATGGGCATCATTAGCTTTATCCCCTACTTCAACTGGCTGAGTTGGGTTTTTGCGTGGCTTGATTCTGGGAAAAGACGTTATGCTGTGTA
TGCTATCGTGTATTTGGCTCCTTATCTAAGGTCAAATTTATCGTTGTCACCCGATGAGAGTTGGCTTCCTATTGTCAGTATACTTATCTGCATAGCTCACATTCAGGTTG
AAGCAAGCATTAAAAATGGAGACATTCAACCCTTCCAAATATTCGGTAAGGCGTCCAATCAAATTTCTCCGACGGAGATAGGGAGAGGCCATTTGAAGGGGTTCAAAGGA
CCAACCAAAAAGAGTGGCAAGAAAAGGGATATGAAGCTTCCATCTGCTGAAGAACAATTGAGAGATGAGATTAGAGGATGGGGAGATTATAAAGAGACATTAGATCATGA
ACAATCCAATGAAGAATGGGATGACGAACAGAGGAGAAAACGTTAGGACATAGTTCTTTCTTGAAGTTGAATTATACAGAATGATAGTTCTAAACATGTTTATTACATAC
AGAATGAAGTCACTTGCAACCATTTCCATGGAGCTTACATATTCTTACAACTGCTGTAATTGTTCTGCAAAACTTAGCCTTCTGTTCTTATTCTCACTTAGTTTACTCTT
CTTTTGCCAGATAACAAGCTTGATATTGAAACTTCTGTAGCAAACATCACAAGGAACTATAAATACAGGATTGAGTATGGTAGTTTAATATTCGTATCAGAAACGTCAAA
TTTCTGCAATGTCATATAAGGAGTATAGATTAGTGGTAGATGCTGTCGATTTAGGTATAGCTTAAATGGTTAGGTCATCGTCTCTCGTTGGTTTTAGATTCAAATTCTCT
CGCGTTTTATGTCTTTTTTTTAAGGTTAGATAGTTTAAAATGGATAGACAATGACTTGAAGTAAATTTATG
Protein sequenceShow/hide protein sequence
MVTLASAYLSSSPSNFSSLNLLRLSKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMDSTDARLAVMG
IISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKASNQISPTEIGRGHLKGFKGPTKKSGKKR
DMKLPSAEEQLRDEIRGWGDYKETLDHEQSNEEWDDEQRRKR