; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg08536 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg08536
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionZinc finger matrin-type protein 1, putative isoform 1
Genome locationCarg_Chr02:5102512..5104908
RNA-Seq ExpressionCarg08536
SyntenyCarg08536
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605602.1 hypothetical protein SDJN03_02919, partial [Cucurbita argyrosperma subsp. sororia]2.7e-14099.24Show/hide
Query:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRTTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
        MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQR TTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
Subjt:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRTTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD

Query:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK
        STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK
Subjt:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK

Query:  IGRGHLKGSQGPTKKSGKKRDMKLPSAEEQLRGEIKGWGDYKETLDHEQYNEEWDDEQRRKR
        IGRGHLKGSQGPTKKSGKKRDMKLPSAEEQLR EIKGWGDYKETLDHEQYNEEWDDEQRRKR
Subjt:  IGRGHLKGSQGPTKKSGKKRDMKLPSAEEQLRGEIKGWGDYKETLDHEQYNEEWDDEQRRKR

KAG7035513.1 hypothetical protein SDJN02_02309 [Cucurbita argyrosperma subsp. argyrosperma]8.5e-142100Show/hide
Query:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRTTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
        MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRTTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
Subjt:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRTTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD

Query:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK
        STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK
Subjt:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK

Query:  IGRGHLKGSQGPTKKSGKKRDMKLPSAEEQLRGEIKGWGDYKETLDHEQYNEEWDDEQRRKR
        IGRGHLKGSQGPTKKSGKKRDMKLPSAEEQLRGEIKGWGDYKETLDHEQYNEEWDDEQRRKR
Subjt:  IGRGHLKGSQGPTKKSGKKRDMKLPSAEEQLRGEIKGWGDYKETLDHEQYNEEWDDEQRRKR

XP_022958724.1 uncharacterized protein LOC111459865 isoform X1 [Cucurbita moschata]1.5e-13898.47Show/hide
Query:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRTTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
        MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQR TTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
Subjt:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRTTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD

Query:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK
        STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK
Subjt:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK

Query:  IGRGHLKGSQGPTKKSGKKRDMKLPSAEEQLRGEIKGWGDYKETLDHEQYNEEWDDEQRRKR
        I RGHLKGSQGPTKKSGKKRDMKLPSAEEQLR EIKGWGDYKETLDHEQ NEEWDDEQRRKR
Subjt:  IGRGHLKGSQGPTKKSGKKRDMKLPSAEEQLRGEIKGWGDYKETLDHEQYNEEWDDEQRRKR

XP_022958725.1 uncharacterized protein LOC111459865 isoform X2 [Cucurbita moschata]2.2e-13798.47Show/hide
Query:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRTTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
        MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQR TTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
Subjt:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRTTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD

Query:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK
        STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK
Subjt:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK

Query:  IGRGHLKGSQGPTKKSGKKRDMKLPSAEEQLRGEIKGWGDYKETLDHEQYNEEWDDEQRRKR
        I RGHLKGSQGPTKKSGKKRDMKLPSAEEQLR EIKGWGDYKETLDHEQ NEEWDDEQRRKR
Subjt:  IGRGHLKGSQGPTKKSGKKRDMKLPSAEEQLRGEIKGWGDYKETLDHEQYNEEWDDEQRRKR

XP_023534481.1 uncharacterized protein LOC111796027 isoform X1 [Cucurbita pepo subsp. pepo]1.3e-13797.33Show/hide
Query:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRTTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
        MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQ+ TTRNGICRAELGNDAPFAIAIGACIL+SLVLPPAGGGSDDDSDAVMD
Subjt:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRTTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD

Query:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK
        STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGK SNQISRTK
Subjt:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK

Query:  IGRGHLKGSQGPTKKSGKKRDMKLPSAEEQLRGEIKGWGDYKETLDHEQYNEEWDDEQRRKR
        IGRGHLKGSQGPTKKSGKKRDMKLPSAEEQLR EI+GWGDYKETLDHEQ NEEWDDEQRRKR
Subjt:  IGRGHLKGSQGPTKKSGKKRDMKLPSAEEQLRGEIKGWGDYKETLDHEQYNEEWDDEQRRKR

TrEMBL top hitse value%identityAlignment
A0A6J1BQE6 uncharacterized protein LOC1110044991.7e-10376.63Show/hide
Query:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRTTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
        MI+LA A LSSSPSN SSL  LRL +PP TFSTSLSNLK LNP  K+AS+Q+    NG+CRA+LGND PFA+AIGACILSS V P AGGGSDD+SDAV+D
Subjt:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRTTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD

Query:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK
        STD R AVMGIISFIPYFNWLSWVFAWLDSG+R YAVYA+VYL PYLRSNLSLSP+ESWLPI SIL+CI HIQ+E SI+NGDIQPFQIFGKTS +IS T 
Subjt:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK

Query:  IGRGHLKGSQGPTKKSGKKRDMKLPSAEEQLRGEIKGWGDYKETLDHEQYNEEWDDEQRRK
         GR H KGSQGP ++SG+K DMKLPS +EQLR EI+ WGD KETLDHEQ N EWDDEQRRK
Subjt:  IGRGHLKGSQGPTKKSGKKRDMKLPSAEEQLRGEIKGWGDYKETLDHEQYNEEWDDEQRRK

A0A6J1H4A9 uncharacterized protein LOC111459865 isoform X21.0e-13798.47Show/hide
Query:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRTTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
        MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQR TTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
Subjt:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRTTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD

Query:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK
        STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK
Subjt:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK

Query:  IGRGHLKGSQGPTKKSGKKRDMKLPSAEEQLRGEIKGWGDYKETLDHEQYNEEWDDEQRRKR
        I RGHLKGSQGPTKKSGKKRDMKLPSAEEQLR EIKGWGDYKETLDHEQ NEEWDDEQRRKR
Subjt:  IGRGHLKGSQGPTKKSGKKRDMKLPSAEEQLRGEIKGWGDYKETLDHEQYNEEWDDEQRRKR

A0A6J1H5Y1 uncharacterized protein LOC111459865 isoform X17.2e-13998.47Show/hide
Query:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRTTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
        MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQR TTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
Subjt:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRTTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD

Query:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK
        STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK
Subjt:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK

Query:  IGRGHLKGSQGPTKKSGKKRDMKLPSAEEQLRGEIKGWGDYKETLDHEQYNEEWDDEQRRKR
        I RGHLKGSQGPTKKSGKKRDMKLPSAEEQLR EIKGWGDYKETLDHEQ NEEWDDEQRRKR
Subjt:  IGRGHLKGSQGPTKKSGKKRDMKLPSAEEQLRGEIKGWGDYKETLDHEQYNEEWDDEQRRKR

A0A6J1K887 uncharacterized protein LOC111491538 isoform X21.8e-13495.8Show/hide
Query:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRTTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
        M+TLASAYLSSSPSNFSSLNLLRL+KPPFTFSTSLSNLKPLNPSHKSASNQR TTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
Subjt:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRTTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD

Query:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK
        STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGK SNQIS T+
Subjt:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK

Query:  IGRGHLKGSQGPTKKSGKKRDMKLPSAEEQLRGEIKGWGDYKETLDHEQYNEEWDDEQRRKR
        IGRGHLKG +GPTKKSGKKRDMKLPSAEEQLR EI+GWGDYKETLDHEQ NEEWDDEQRRKR
Subjt:  IGRGHLKGSQGPTKKSGKKRDMKLPSAEEQLRGEIKGWGDYKETLDHEQYNEEWDDEQRRKR

A0A6J1KAA9 uncharacterized protein LOC111491538 isoform X11.3e-13595.8Show/hide
Query:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRTTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
        M+TLASAYLSSSPSNFSSLNLLRL+KPPFTFSTSLSNLKPLNPSHKSASNQR TTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
Subjt:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRTTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD

Query:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK
        STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGK SNQIS T+
Subjt:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK

Query:  IGRGHLKGSQGPTKKSGKKRDMKLPSAEEQLRGEIKGWGDYKETLDHEQYNEEWDDEQRRKR
        IGRGHLKG +GPTKKSGKKRDMKLPSAEEQLR EI+GWGDYKETLDHEQ NEEWDDEQRRKR
Subjt:  IGRGHLKGSQGPTKKSGKKRDMKLPSAEEQLRGEIKGWGDYKETLDHEQYNEEWDDEQRRKR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G41960.1 unknown protein2.7e-4550.96Show/hide
Query:  LSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRTTTR--NGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSD---AVMDSTD
        LSSS S ++   LL         S+S S+  PL  ++   + ++   +    ICRAE   DAP   AIGACILSS V P A   +D++ +   + + STD
Subjt:  LSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRTTTR--NGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSD---AVMDSTD

Query:  ARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQ--ISRTKI
         RLA MGIISFIPYFNWLSWVFAWLD+GK RYAVYA+VYL PYL SNLS+SP+ESWLPI SI++ I H+Q+EASI NGD++    F  TS+    S+ +I
Subjt:  ARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQ--ISRTKI

Query:  G-RGHLKG
          + H KG
Subjt:  G-RGHLKG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCACTCTGGCTTCTGCTTATTTATCATCATCCCCTTCCAATTTCTCTTCTCTCAACCTTCTTCGCCTCACCAAACCCCCTTTCACCTTCTCAACTTCACTCTCCAA
TCTCAAACCCTTAAATCCCTCCCACAAATCAGCTTCTAATCAGAGGACGACGACCCGAAATGGGATTTGTCGGGCGGAATTGGGAAACGATGCGCCTTTCGCTATTGCGA
TCGGGGCTTGCATTCTTAGTTCTCTTGTTCTTCCACCAGCTGGCGGTGGTTCCGATGATGATAGCGATGCCGTTATGGATTCCACTGATGCCAGGCTCGCTGTCATGGGC
ATCATTAGCTTTATCCCCTACTTCAACTGGCTGAGTTGGGTTTTTGCGTGGCTTGATTCTGGGAAAAGACGTTATGCTGTGTATGCTATCGTGTATTTGGCTCCTTATCT
AAGGTCAAATTTATCGTTGTCACCCGATGAGAGTTGGCTTCCTATTGTCAGTATACTTATCTGCATAGCTCACATTCAGGTTGAAGCAAGCATTAAAAATGGAGATATTC
AACCCTTCCAAATATTCGGTAAGACGTCCAATCAAATTTCTCGGACGAAGATAGGGAGAGGCCATTTGAAGGGGTCCCAAGGACCAACTAAAAAGAGTGGCAAGAAAAGG
GATATGAAGCTTCCATCTGCTGAAGAACAATTGAGAGGTGAGATTAAAGGATGGGGAGATTATAAAGAGACATTAGATCATGAACAATACAATGAAGAATGGGATGACGA
ACAGAGGAGAAAACGTTAG
mRNA sequenceShow/hide mRNA sequence
TGCCAAGAAGTGAAGAAGAATTCCATAGGGGAGTGGCTCTTCGTTTTCCCGTTCAGAATTTTGGTTCCACAATTCTGATTCTCACTTTCTTCCGATCATCATTTCACTGA
TTCACCACAATTACAGCGCCAATGATCACTCTGGCTTCTGCTTATTTATCATCATCCCCTTCCAATTTCTCTTCTCTCAACCTTCTTCGCCTCACCAAACCCCCTTTCAC
CTTCTCAACTTCACTCTCCAATCTCAAACCCTTAAATCCCTCCCACAAATCAGCTTCTAATCAGAGGACGACGACCCGAAATGGGATTTGTCGGGCGGAATTGGGAAACG
ATGCGCCTTTCGCTATTGCGATCGGGGCTTGCATTCTTAGTTCTCTTGTTCTTCCACCAGCTGGCGGTGGTTCCGATGATGATAGCGATGCCGTTATGGATTCCACTGAT
GCCAGGCTCGCTGTCATGGGCATCATTAGCTTTATCCCCTACTTCAACTGGCTGAGTTGGGTTTTTGCGTGGCTTGATTCTGGGAAAAGACGTTATGCTGTGTATGCTAT
CGTGTATTTGGCTCCTTATCTAAGGTCAAATTTATCGTTGTCACCCGATGAGAGTTGGCTTCCTATTGTCAGTATACTTATCTGCATAGCTCACATTCAGGTTGAAGCAA
GCATTAAAAATGGAGATATTCAACCCTTCCAAATATTCGGTAAGACGTCCAATCAAATTTCTCGGACGAAGATAGGGAGAGGCCATTTGAAGGGGTCCCAAGGACCAACT
AAAAAGAGTGGCAAGAAAAGGGATATGAAGCTTCCATCTGCTGAAGAACAATTGAGAGGTGAGATTAAAGGATGGGGAGATTATAAAGAGACATTAGATCATGAACAATA
CAATGAAGAATGGGATGACGAACAGAGGAGAAAACGTTAGGACATAGTTCTTTCTTGAAGTTGAATTATACAGCATGATAGTTCTAAACATGTTTATTACATACAGAATG
AAGTCACTTGCAACCATTTCCATGGAGCTTACATGTTCTTACAACTGCTGCAATTGTTCTGCAAAACTTAGCCTTCTGTTCTTATTCTCACTTAGTTTACTCTTCTTTTG
CCGTTCGATATTGAAATTTCTGTAGCAAACATCACAAGGAACTATAAATACAGGATTGAGTATGGTAGTTTAATATTTGTATCAGAAACGTCAAATTTCTGCAATGTCAT
ACAAGGAGTATAGATTCGTGGTAGATGCTGTCGATTTAGGTATAGCCTAAATGGTTAGGTCATCGTTTTTCGTTGGTTTTAGATTC
Protein sequenceShow/hide protein sequence
MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRTTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMDSTDARLAVMG
IISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTKIGRGHLKGSQGPTKKSGKKR
DMKLPSAEEQLRGEIKGWGDYKETLDHEQYNEEWDDEQRRKR