; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh02G009480 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh02G009480
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionZinc finger matrin-type protein 1, putative isoform 1
Genome locationCmo_Chr02:5794207..5796808
RNA-Seq ExpressionCmoCh02G009480
SyntenyCmoCh02G009480
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605602.1 hypothetical protein SDJN03_02919, partial [Cucurbita argyrosperma subsp. sororia]6.1e-14099.24Show/hide
Query:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
        MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
Subjt:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD

Query:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK
        STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK
Subjt:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK

Query:  IARGHLKGSQGPTKKSGKKRDMKLPSAEEQLRDEIKGWGDYKETLDHEQSNEEWDDEQRRKR
        I RGHLKGSQGPTKKSGKKRDMKLPSAEEQLRDEIKGWGDYKETLDHEQ NEEWDDEQRRKR
Subjt:  IARGHLKGSQGPTKKSGKKRDMKLPSAEEQLRDEIKGWGDYKETLDHEQSNEEWDDEQRRKR

KAG7035513.1 hypothetical protein SDJN02_02309 [Cucurbita argyrosperma subsp. argyrosperma]2.0e-13898.47Show/hide
Query:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
        MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQR TTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
Subjt:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD

Query:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK
        STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK
Subjt:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK

Query:  IARGHLKGSQGPTKKSGKKRDMKLPSAEEQLRDEIKGWGDYKETLDHEQSNEEWDDEQRRKR
        I RGHLKGSQGPTKKSGKKRDMKLPSAEEQLR EIKGWGDYKETLDHEQ NEEWDDEQRRKR
Subjt:  IARGHLKGSQGPTKKSGKKRDMKLPSAEEQLRDEIKGWGDYKETLDHEQSNEEWDDEQRRKR

XP_022958724.1 uncharacterized protein LOC111459865 isoform X1 [Cucurbita moschata]4.2e-141100Show/hide
Query:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
        MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
Subjt:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD

Query:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK
        STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK
Subjt:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK

Query:  IARGHLKGSQGPTKKSGKKRDMKLPSAEEQLRDEIKGWGDYKETLDHEQSNEEWDDEQRRKR
        IARGHLKGSQGPTKKSGKKRDMKLPSAEEQLRDEIKGWGDYKETLDHEQSNEEWDDEQRRKR
Subjt:  IARGHLKGSQGPTKKSGKKRDMKLPSAEEQLRDEIKGWGDYKETLDHEQSNEEWDDEQRRKR

XP_022958725.1 uncharacterized protein LOC111459865 isoform X2 [Cucurbita moschata]3.9e-13999.62Show/hide
Query:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
        MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQ RTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
Subjt:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD

Query:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK
        STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK
Subjt:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK

Query:  IARGHLKGSQGPTKKSGKKRDMKLPSAEEQLRDEIKGWGDYKETLDHEQSNEEWDDEQRRKR
        IARGHLKGSQGPTKKSGKKRDMKLPSAEEQLRDEIKGWGDYKETLDHEQSNEEWDDEQRRKR
Subjt:  IARGHLKGSQGPTKKSGKKRDMKLPSAEEQLRDEIKGWGDYKETLDHEQSNEEWDDEQRRKR

XP_023534481.1 uncharacterized protein LOC111796027 isoform X1 [Cucurbita pepo subsp. pepo]5.1e-13998.09Show/hide
Query:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
        MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQ+RTTRNGICRAELGNDAPFAIAIGACIL+SLVLPPAGGGSDDDSDAVMD
Subjt:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD

Query:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK
        STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGK SNQISRTK
Subjt:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK

Query:  IARGHLKGSQGPTKKSGKKRDMKLPSAEEQLRDEIKGWGDYKETLDHEQSNEEWDDEQRRKR
        I RGHLKGSQGPTKKSGKKRDMKLPSAEEQLRDEI+GWGDYKETLDHEQSNEEWDDEQRRKR
Subjt:  IARGHLKGSQGPTKKSGKKRDMKLPSAEEQLRDEIKGWGDYKETLDHEQSNEEWDDEQRRKR

TrEMBL top hitse value%identityAlignment
A0A6J1BQE6 uncharacterized protein LOC1110044998.9e-10577.39Show/hide
Query:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
        MI+LA A LSSSPSN SSL  LRL +PP TFSTSLSNLK LNP  K+AS+Q+R   NG+CRA+LGND PFA+AIGACILSS V P AGGGSDD+SDAV+D
Subjt:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD

Query:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK
        STD R AVMGIISFIPYFNWLSWVFAWLDSG+R YAVYA+VYL PYLRSNLSLSP+ESWLPI SIL+CI HIQ+E SI+NGDIQPFQIFGKTS +IS T 
Subjt:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK

Query:  IARGHLKGSQGPTKKSGKKRDMKLPSAEEQLRDEIKGWGDYKETLDHEQSNEEWDDEQRRK
          R H KGSQGP ++SG+K DMKLPS +EQLRDEI+ WGD KETLDHEQSN EWDDEQRRK
Subjt:  IARGHLKGSQGPTKKSGKKRDMKLPSAEEQLRDEIKGWGDYKETLDHEQSNEEWDDEQRRK

A0A6J1H4A9 uncharacterized protein LOC111459865 isoform X21.9e-13999.62Show/hide
Query:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
        MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQ RTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
Subjt:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD

Query:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK
        STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK
Subjt:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK

Query:  IARGHLKGSQGPTKKSGKKRDMKLPSAEEQLRDEIKGWGDYKETLDHEQSNEEWDDEQRRKR
        IARGHLKGSQGPTKKSGKKRDMKLPSAEEQLRDEIKGWGDYKETLDHEQSNEEWDDEQRRKR
Subjt:  IARGHLKGSQGPTKKSGKKRDMKLPSAEEQLRDEIKGWGDYKETLDHEQSNEEWDDEQRRKR

A0A6J1H5Y1 uncharacterized protein LOC111459865 isoform X12.0e-141100Show/hide
Query:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
        MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
Subjt:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD

Query:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK
        STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK
Subjt:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK

Query:  IARGHLKGSQGPTKKSGKKRDMKLPSAEEQLRDEIKGWGDYKETLDHEQSNEEWDDEQRRKR
        IARGHLKGSQGPTKKSGKKRDMKLPSAEEQLRDEIKGWGDYKETLDHEQSNEEWDDEQRRKR
Subjt:  IARGHLKGSQGPTKKSGKKRDMKLPSAEEQLRDEIKGWGDYKETLDHEQSNEEWDDEQRRKR

A0A6J1K887 uncharacterized protein LOC111491538 isoform X24.9e-13596.18Show/hide
Query:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
        M+TLASAYLSSSPSNFSSLNLLRL+KPPFTFSTSLSNLKPLNPSHKSASNQ RTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
Subjt:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD

Query:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK
        STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGK SNQIS T+
Subjt:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK

Query:  IARGHLKGSQGPTKKSGKKRDMKLPSAEEQLRDEIKGWGDYKETLDHEQSNEEWDDEQRRKR
        I RGHLKG +GPTKKSGKKRDMKLPSAEEQLRDEI+GWGDYKETLDHEQSNEEWDDEQRRKR
Subjt:  IARGHLKGSQGPTKKSGKKRDMKLPSAEEQLRDEIKGWGDYKETLDHEQSNEEWDDEQRRKR

A0A6J1KAA9 uncharacterized protein LOC111491538 isoform X15.2e-13796.56Show/hide
Query:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
        M+TLASAYLSSSPSNFSSLNLLRL+KPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD
Subjt:  MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMD

Query:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK
        STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGK SNQIS T+
Subjt:  STDARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTK

Query:  IARGHLKGSQGPTKKSGKKRDMKLPSAEEQLRDEIKGWGDYKETLDHEQSNEEWDDEQRRKR
        I RGHLKG +GPTKKSGKKRDMKLPSAEEQLRDEI+GWGDYKETLDHEQSNEEWDDEQRRKR
Subjt:  IARGHLKGSQGPTKKSGKKRDMKLPSAEEQLRDEIKGWGDYKETLDHEQSNEEWDDEQRRKR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G41960.1 unknown protein9.3e-4651.96Show/hide
Query:  LSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQR--RTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSD---AVMDSTD
        LSSS S ++   LL         S+S S+  PL  ++   + ++  R     ICRAE   DAP   AIGACILSS V P A   +D++ +   + + STD
Subjt:  LSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQR--RTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSD---AVMDSTD

Query:  ARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTKIAR
         RLA MGIISFIPYFNWLSWVFAWLD+GK RYAVYA+VYL PYL SNLS+SP+ESWLPI SI++ I H+Q+EASI NGD++    F  TS+    +K  R
Subjt:  ARLAVMGIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTKIAR

Query:  GHLK
         H K
Subjt:  GHLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCACTCTGGCTTCTGCTTATTTATCATCATCCCCTTCCAATTTCTCTTCTCTCAACCTTCTTCGCCTCACCAAACCCCCTTTCACCTTCTCAACTTCACTCTCCAA
TCTCAAACCCTTAAATCCCTCCCACAAATCAGCTTCTAATCAGAGGAGGACGACCCGAAATGGGATTTGTCGGGCGGAATTGGGAAACGACGCGCCTTTCGCTATTGCGA
TCGGGGCTTGCATTCTTAGTTCTCTTGTTCTTCCACCAGCTGGCGGTGGTTCCGATGATGATAGCGATGCCGTTATGGATTCCACTGATGCTAGGCTCGCTGTCATGGGC
ATCATTAGCTTTATCCCCTACTTCAACTGGCTGAGTTGGGTTTTTGCGTGGCTTGATTCTGGGAAAAGACGTTATGCTGTGTATGCTATCGTGTATTTGGCTCCTTATCT
AAGGTCAAATTTATCGTTGTCACCCGATGAGAGTTGGCTTCCTATTGTCAGTATACTTATCTGCATAGCTCACATTCAGGTTGAAGCAAGCATTAAAAATGGAGATATTC
AACCCTTCCAAATATTCGGTAAGACGTCCAATCAAATTTCTCGGACGAAGATAGCGAGAGGCCATTTGAAGGGGTCCCAAGGACCAACTAAAAAGAGTGGCAAGAAAAGG
GATATGAAGCTTCCATCTGCTGAAGAACAATTGAGAGATGAGATTAAAGGATGGGGAGATTATAAAGAGACATTAGATCATGAACAATCCAATGAAGAATGGGATGACGA
ACAGAGGAGAAAACGTTAG
mRNA sequenceShow/hide mRNA sequence
AAGAAAAAACAAAATCCGTACAGCAATGCCGAAAGAGGTTGGCACGGTTGCCAAGAAGTGAAGAAGAATTCCATAGGGGAGTGGCTCTTCGTTTTCCCGTTCAGAATTTT
GGTTCCACAATTCTGATTCTCACTCCCTTCCGATTATCATTTCACTGATTCACCACAATTACAGCGCCAATGATCACTCTGGCTTCTGCTTATTTATCATCATCCCCTTC
CAATTTCTCTTCTCTCAACCTTCTTCGCCTCACCAAACCCCCTTTCACCTTCTCAACTTCACTCTCCAATCTCAAACCCTTAAATCCCTCCCACAAATCAGCTTCTAATC
AGAGGAGGACGACCCGAAATGGGATTTGTCGGGCGGAATTGGGAAACGACGCGCCTTTCGCTATTGCGATCGGGGCTTGCATTCTTAGTTCTCTTGTTCTTCCACCAGCT
GGCGGTGGTTCCGATGATGATAGCGATGCCGTTATGGATTCCACTGATGCTAGGCTCGCTGTCATGGGCATCATTAGCTTTATCCCCTACTTCAACTGGCTGAGTTGGGT
TTTTGCGTGGCTTGATTCTGGGAAAAGACGTTATGCTGTGTATGCTATCGTGTATTTGGCTCCTTATCTAAGGTCAAATTTATCGTTGTCACCCGATGAGAGTTGGCTTC
CTATTGTCAGTATACTTATCTGCATAGCTCACATTCAGGTTGAAGCAAGCATTAAAAATGGAGATATTCAACCCTTCCAAATATTCGGTAAGACGTCCAATCAAATTTCT
CGGACGAAGATAGCGAGAGGCCATTTGAAGGGGTCCCAAGGACCAACTAAAAAGAGTGGCAAGAAAAGGGATATGAAGCTTCCATCTGCTGAAGAACAATTGAGAGATGA
GATTAAAGGATGGGGAGATTATAAAGAGACATTAGATCATGAACAATCCAATGAAGAATGGGATGACGAACAGAGGAGAAAACGTTAGGACATAGTTCTTTCTCGAAGTT
GAATTATACAGAATGATAGTTCTAAACATGTTTATTACATACAGAATGAAGTCACTTGCAACCATTTCCATGGAGCTTACATATTCTTACAACTGCTGCAATTGTTCTGC
AAAACTTAGCCTTCTGTTCTTATTCTCACTTAGTTTACTCTTCTTTTGCCAGATAACAAGTTTGATATTGAAACTTCTGTAGCAAACATCACAAGGAACTATAAATACAG
GATTGAGTATGGTAGTTTAATATTCGTATCAGAAAGGTCAAATTTCTGCAATGTCATATAAGGAGTATAGATTCGTGGTAGATGCTGTCGATTTAGGTATAGCTTAAATG
GTTAGGTCATTGTTTCTCGTTGGTTTTAGATTCAAATTCTCTCGCGTTTTATGCCGTTTTTAAAAGGTTAGATAGTTTAAAATGGATGATATTAAATTTGTCTTCATAAT
AGGCGTTATTTTTTTTTTAAAAGGTATATTGATGTTATTTTATAAGAAAACAGCATATAAATGATATACAGGAGCAAACGAG
Protein sequenceShow/hide protein sequence
MITLASAYLSSSPSNFSSLNLLRLTKPPFTFSTSLSNLKPLNPSHKSASNQRRTTRNGICRAELGNDAPFAIAIGACILSSLVLPPAGGGSDDDSDAVMDSTDARLAVMG
IISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPDESWLPIVSILICIAHIQVEASIKNGDIQPFQIFGKTSNQISRTKIARGHLKGSQGPTKKSGKKR
DMKLPSAEEQLRDEIKGWGDYKETLDHEQSNEEWDDEQRRKR