; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0000230 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0000230
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionZinc finger matrin-type protein 1, putative isoform 1
Genome locationchr09:23191272..23193658
RNA-Seq ExpressionPay0000230
SyntenyPay0000230
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042655.1 Zinc finger matrin-type protein 1, putative isoform 1 [Cucumis melo var. makuwa]1.3e-13499.61Show/hide
Query:  MITLASTHYSSSSFKDLRLFKPSSTFSPSLSNLKPLNPFLKPPSNQTRFGNGICRAELGNDAPFAIAIGACFLTSLVVPAADRASDDESDAVIDSTDTRL
        MITLAST+YSSSSFKDLRLFKPSSTFSPSLSNLKPLNPFLKPPSNQTRFGNGICRAELGNDAPFAIAIGACFLTSLVVPAADRASDDESDAVIDSTDTRL
Subjt:  MITLASTHYSSSSFKDLRLFKPSSTFSPSLSNLKPLNPFLKPPSNQTRFGNGICRAELGNDAPFAIAIGACFLTSLVVPAADRASDDESDAVIDSTDTRL

Query:  AVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIIHIQLEASITNGDIQPLQIFGKASKQISSTKKGRDQF
        AVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIIHIQLEASITNGDIQPLQIFGKASKQISSTKKGRDQF
Subjt:  AVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIIHIQLEASITNGDIQPLQIFGKASKQISSTKKGRDQF

Query:  KGSQGSYKESEKKEDRKLPSAEERFRDKISRLGDSKETDHEQINEEWDDDQRRKH
        KGSQGSYKESEKKEDRKLPSAEERFRDKISRLGDSKETDHEQINEEWDDDQRRKH
Subjt:  KGSQGSYKESEKKEDRKLPSAEERFRDKISRLGDSKETDHEQINEEWDDDQRRKH

XP_004143845.1 uncharacterized protein LOC101203490 [Cucumis sativus]1.8e-12594.53Show/hide
Query:  MITLASTHYSSSSFKDLRLFKPSSTFSPSLSNLKPLNPFLKPPSNQTRFGNGICRAELGNDAPFAIAIGACFLTSLVVPAADRASDDESDAVIDSTDTRL
        MITLAST+YS SSFK+LRLFKPSSTFSPSLSNLKPLNPFLKP SNQTRFGNGICRAELGNDAPFAIAIGAC LTSLVVPAAD ASDDESDAVIDSTDTRL
Subjt:  MITLASTHYSSSSFKDLRLFKPSSTFSPSLSNLKPLNPFLKPPSNQTRFGNGICRAELGNDAPFAIAIGACFLTSLVVPAADRASDDESDAVIDSTDTRL

Query:  AVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIIHIQLEASITNGDIQPLQIFGKASKQISSTKKGRDQF
        AVMGIISFIPYFNWLSWVFAWLDSGRR YAVYAIVYLAPYLRSNLSLSPEESWLPI+SILLCIIHIQLEASITNGDIQPLQIFGKASK+ISSTKKGRDQF
Subjt:  AVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIIHIQLEASITNGDIQPLQIFGKASKQISSTKKGRDQF

Query:  KGSQGSYKESEKKEDRKLPSAEERFRDKISRLGDSKET-DHEQINEEWDDDQRRKH
        KGSQGSYKESEKK DRKLPSAEERFRDKISRLGD+KET DHEQ N EWDDDQRRKH
Subjt:  KGSQGSYKESEKKEDRKLPSAEERFRDKISRLGDSKET-DHEQINEEWDDDQRRKH

XP_008437440.1 PREDICTED: uncharacterized protein LOC103482855 isoform X1 [Cucumis melo]2.6e-135100Show/hide
Query:  MITLASTHYSSSSFKDLRLFKPSSTFSPSLSNLKPLNPFLKPPSNQTRFGNGICRAELGNDAPFAIAIGACFLTSLVVPAADRASDDESDAVIDSTDTRL
        MITLASTHYSSSSFKDLRLFKPSSTFSPSLSNLKPLNPFLKPPSNQTRFGNGICRAELGNDAPFAIAIGACFLTSLVVPAADRASDDESDAVIDSTDTRL
Subjt:  MITLASTHYSSSSFKDLRLFKPSSTFSPSLSNLKPLNPFLKPPSNQTRFGNGICRAELGNDAPFAIAIGACFLTSLVVPAADRASDDESDAVIDSTDTRL

Query:  AVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIIHIQLEASITNGDIQPLQIFGKASKQISSTKKGRDQF
        AVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIIHIQLEASITNGDIQPLQIFGKASKQISSTKKGRDQF
Subjt:  AVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIIHIQLEASITNGDIQPLQIFGKASKQISSTKKGRDQF

Query:  KGSQGSYKESEKKEDRKLPSAEERFRDKISRLGDSKETDHEQINEEWDDDQRRKH
        KGSQGSYKESEKKEDRKLPSAEERFRDKISRLGDSKETDHEQINEEWDDDQRRKH
Subjt:  KGSQGSYKESEKKEDRKLPSAEERFRDKISRLGDSKETDHEQINEEWDDDQRRKH

XP_008437441.1 PREDICTED: uncharacterized protein LOC103482855 isoform X2 [Cucumis melo]1.2e-10886.8Show/hide
Query:  MITLASTHYSSSSFKDLRLFKPSSTFSPSLSNLKPLNPFLKPPSNQTRFGNGICRAELGNDAPFAIAIGACFLTSLVVPAADRASDDESDAVIDSTDTRL
        MITLASTHYSSSSFKDLRLFKPSSTFSPSLSNLKPLNPFLKPPSNQTRFGNGICRAELGNDAPFAIAIGACFLTSLVVPAADRASDDESDAVIDSTDTRL
Subjt:  MITLASTHYSSSSFKDLRLFKPSSTFSPSLSNLKPLNPFLKPPSNQTRFGNGICRAELGNDAPFAIAIGACFLTSLVVPAADRASDDESDAVIDSTDTRL

Query:  AVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIIHIQLEASITNGDIQPLQIFGKASKQISSTKKGRDQF
        AVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIIHIQLEASITNGDIQPLQIFGKASKQISSTKKGRDQF
Subjt:  AVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIIHIQLEASITNGDIQPLQIFGKASKQISSTKKGRDQF

Query:  KGSQGSYKESEKKEDRKLPSAEERFRDKISRLGDSKE--TDHEQINEEWD
        KGSQGSYKE        +     R R K  R G  ++   D E  + +W+
Subjt:  KGSQGSYKESEKKEDRKLPSAEERFRDKISRLGDSKE--TDHEQINEEWD

XP_038875289.1 uncharacterized protein LOC120067780 [Benincasa hispida]1.0e-11587.36Show/hide
Query:  MITLASTHYSS-----SSFKDLRLFKPSSTFSPSLSNLKPLNPFLKPPSNQTRFGNGICRAELGNDAPFAIAIGACFLTSLVVPAADRASDDESDAVIDS
        MITLAS + SS     SS K+LRLFKPSSTFSPSLSNLKPLNPFLKPPSNQ+R GNGICRAELGNDAPFAIAIGACFL+SLV+P AD ASDDESDA+IDS
Subjt:  MITLASTHYSS-----SSFKDLRLFKPSSTFSPSLSNLKPLNPFLKPPSNQTRFGNGICRAELGNDAPFAIAIGACFLTSLVVPAADRASDDESDAVIDS

Query:  TDTRLAVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIIHIQLEASITNGDIQPLQIFGKASKQISSTKK
        TDTRLAVM IISFIPYFNWLSWVFAWLDSGRRLYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIIHIQLE SITNGDIQPLQIFGKASK ISSTKK
Subjt:  TDTRLAVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIIHIQLEASITNGDIQPLQIFGKASKQISSTKK

Query:  GRDQFKGSQGSYKESEKKEDRKLPSAEERFRDKISRLGDSKE-TDHEQINEEWDDDQRRKH
        GRD FKGSQG YKES KKEDRKLPSAEE+F+DKI R GDSKE  D+EQ N EWDD+QRRKH
Subjt:  GRDQFKGSQGSYKESEKKEDRKLPSAEERFRDKISRLGDSKE-TDHEQINEEWDDDQRRKH

TrEMBL top hitse value%identityAlignment
A0A0A0KK98 Uncharacterized protein8.9e-12694.53Show/hide
Query:  MITLASTHYSSSSFKDLRLFKPSSTFSPSLSNLKPLNPFLKPPSNQTRFGNGICRAELGNDAPFAIAIGACFLTSLVVPAADRASDDESDAVIDSTDTRL
        MITLAST+YS SSFK+LRLFKPSSTFSPSLSNLKPLNPFLKP SNQTRFGNGICRAELGNDAPFAIAIGAC LTSLVVPAAD ASDDESDAVIDSTDTRL
Subjt:  MITLASTHYSSSSFKDLRLFKPSSTFSPSLSNLKPLNPFLKPPSNQTRFGNGICRAELGNDAPFAIAIGACFLTSLVVPAADRASDDESDAVIDSTDTRL

Query:  AVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIIHIQLEASITNGDIQPLQIFGKASKQISSTKKGRDQF
        AVMGIISFIPYFNWLSWVFAWLDSGRR YAVYAIVYLAPYLRSNLSLSPEESWLPI+SILLCIIHIQLEASITNGDIQPLQIFGKASK+ISSTKKGRDQF
Subjt:  AVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIIHIQLEASITNGDIQPLQIFGKASKQISSTKKGRDQF

Query:  KGSQGSYKESEKKEDRKLPSAEERFRDKISRLGDSKET-DHEQINEEWDDDQRRKH
        KGSQGSYKESEKK DRKLPSAEERFRDKISRLGD+KET DHEQ N EWDDDQRRKH
Subjt:  KGSQGSYKESEKKEDRKLPSAEERFRDKISRLGDSKET-DHEQINEEWDDDQRRKH

A0A1S3AU10 uncharacterized protein LOC103482855 isoform X25.8e-10986.8Show/hide
Query:  MITLASTHYSSSSFKDLRLFKPSSTFSPSLSNLKPLNPFLKPPSNQTRFGNGICRAELGNDAPFAIAIGACFLTSLVVPAADRASDDESDAVIDSTDTRL
        MITLASTHYSSSSFKDLRLFKPSSTFSPSLSNLKPLNPFLKPPSNQTRFGNGICRAELGNDAPFAIAIGACFLTSLVVPAADRASDDESDAVIDSTDTRL
Subjt:  MITLASTHYSSSSFKDLRLFKPSSTFSPSLSNLKPLNPFLKPPSNQTRFGNGICRAELGNDAPFAIAIGACFLTSLVVPAADRASDDESDAVIDSTDTRL

Query:  AVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIIHIQLEASITNGDIQPLQIFGKASKQISSTKKGRDQF
        AVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIIHIQLEASITNGDIQPLQIFGKASKQISSTKKGRDQF
Subjt:  AVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIIHIQLEASITNGDIQPLQIFGKASKQISSTKKGRDQF

Query:  KGSQGSYKESEKKEDRKLPSAEERFRDKISRLGDSKE--TDHEQINEEWD
        KGSQGSYKE        +     R R K  R G  ++   D E  + +W+
Subjt:  KGSQGSYKESEKKEDRKLPSAEERFRDKISRLGDSKE--TDHEQINEEWD

A0A1S3AU53 uncharacterized protein LOC103482855 isoform X11.2e-135100Show/hide
Query:  MITLASTHYSSSSFKDLRLFKPSSTFSPSLSNLKPLNPFLKPPSNQTRFGNGICRAELGNDAPFAIAIGACFLTSLVVPAADRASDDESDAVIDSTDTRL
        MITLASTHYSSSSFKDLRLFKPSSTFSPSLSNLKPLNPFLKPPSNQTRFGNGICRAELGNDAPFAIAIGACFLTSLVVPAADRASDDESDAVIDSTDTRL
Subjt:  MITLASTHYSSSSFKDLRLFKPSSTFSPSLSNLKPLNPFLKPPSNQTRFGNGICRAELGNDAPFAIAIGACFLTSLVVPAADRASDDESDAVIDSTDTRL

Query:  AVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIIHIQLEASITNGDIQPLQIFGKASKQISSTKKGRDQF
        AVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIIHIQLEASITNGDIQPLQIFGKASKQISSTKKGRDQF
Subjt:  AVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIIHIQLEASITNGDIQPLQIFGKASKQISSTKKGRDQF

Query:  KGSQGSYKESEKKEDRKLPSAEERFRDKISRLGDSKETDHEQINEEWDDDQRRKH
        KGSQGSYKESEKKEDRKLPSAEERFRDKISRLGDSKETDHEQINEEWDDDQRRKH
Subjt:  KGSQGSYKESEKKEDRKLPSAEERFRDKISRLGDSKETDHEQINEEWDDDQRRKH

A0A5D3C6U3 Zinc finger matrin-type protein 1, putative isoform 16.2e-13599.61Show/hide
Query:  MITLASTHYSSSSFKDLRLFKPSSTFSPSLSNLKPLNPFLKPPSNQTRFGNGICRAELGNDAPFAIAIGACFLTSLVVPAADRASDDESDAVIDSTDTRL
        MITLAST+YSSSSFKDLRLFKPSSTFSPSLSNLKPLNPFLKPPSNQTRFGNGICRAELGNDAPFAIAIGACFLTSLVVPAADRASDDESDAVIDSTDTRL
Subjt:  MITLASTHYSSSSFKDLRLFKPSSTFSPSLSNLKPLNPFLKPPSNQTRFGNGICRAELGNDAPFAIAIGACFLTSLVVPAADRASDDESDAVIDSTDTRL

Query:  AVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIIHIQLEASITNGDIQPLQIFGKASKQISSTKKGRDQF
        AVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIIHIQLEASITNGDIQPLQIFGKASKQISSTKKGRDQF
Subjt:  AVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIIHIQLEASITNGDIQPLQIFGKASKQISSTKKGRDQF

Query:  KGSQGSYKESEKKEDRKLPSAEERFRDKISRLGDSKETDHEQINEEWDDDQRRKH
        KGSQGSYKESEKKEDRKLPSAEERFRDKISRLGDSKETDHEQINEEWDDDQRRKH
Subjt:  KGSQGSYKESEKKEDRKLPSAEERFRDKISRLGDSKETDHEQINEEWDDDQRRKH

A0A6J1BQE6 uncharacterized protein LOC1110044994.8e-9575Show/hide
Query:  MITLASTHYSSS----SFKDLRLFKPSSTFSPSLSNLKPLNPFLKPPSNQTRFGNGICRAELGNDAPFAIAIGACFLTSLVVPAADRASDDESDAVIDST
        MI+LA    SSS    S   LRL +P STFS SLSNLK LNP  K  S+Q R GNG+CRA+LGND PFA+AIGAC L+S V P A   SDDESDAVIDST
Subjt:  MITLASTHYSSS----SFKDLRLFKPSSTFSPSLSNLKPLNPFLKPPSNQTRFGNGICRAELGNDAPFAIAIGACFLTSLVVPAADRASDDESDAVIDST

Query:  DTRLAVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIIHIQLEASITNGDIQPLQIFGKASKQISSTKKG
        DTR AVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYA+VYL PYLRSNLSLSPEESWLPI SILLCIIHIQLE SI NGDIQP QIFGK SK+ISST +G
Subjt:  DTRLAVMGIISFIPYFNWLSWVFAWLDSGRRLYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIIHIQLEASITNGDIQPLQIFGKASKQISSTKKG

Query:  RDQFKGSQGSYKESEKKEDRKLPSAEERFRDKISRLGDSKET-DHEQINEEWDDDQRRKH
        RD FKGSQG  +ES +KED KLPS +E+ RD+I R GDSKET DHEQ N EWDD+QRRKH
Subjt:  RDQFKGSQGSYKESEKKEDRKLPSAEERFRDKISRLGDSKET-DHEQINEEWDDDQRRKH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G41960.1 unknown protein4.3e-4854.77Show/hide
Query:  SSSSFKDLRLFKPSSTFSPSLSNLKPLNPFLKPPSNQTRFGNGICRAELGNDAPFAIAIGACFLTSLVVPAADRASDDESD---AVIDSTDTRLAVMGII
        S+S +   RL   SS+   S S L    P   P     +    ICRAE   DAP   AIGAC L+S V P A R +D+E +   + I STD RLA MGII
Subjt:  SSSSFKDLRLFKPSSTFSPSLSNLKPLNPFLKPPSNQTRFGNGICRAELGNDAPFAIAIGACFLTSLVVPAADRASDDESD---AVIDSTDTRLAVMGII

Query:  SFIPYFNWLSWVFAWLDSGRRLYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIIHIQLEASITNGDIQPLQIFGKASKQISSTKKG---RDQFKG
        SFIPYFNWLSWVFAWLD+G+  YAVYA+VYL PYL SNLS+SPEESWLPI SI+L IIH+QLEASI NGD++ L  F   S    S+KK    +  FKG
Subjt:  SFIPYFNWLSWVFAWLDSGRRLYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIIHIQLEASITNGDIQPLQIFGKASKQISSTKKG---RDQFKG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCACTCTGGCTTCTACTCATTATTCATCATCTTCCTTCAAGGATCTTCGTCTCTTCAAACCCTCTTCCACATTCTCCCCATCACTCTCCAATCTCAAACCCTTAAA
TCCTTTCCTCAAACCACCTTCCAATCAGACCAGGTTCGGTAATGGGATTTGTAGGGCAGAATTGGGTAACGATGCACCTTTCGCCATTGCAATCGGTGCCTGTTTTCTCA
CTTCTCTTGTTGTTCCGGCAGCTGATCGTGCTTCCGATGATGAGAGCGATGCCGTCATTGATTCCACCGATACTAGGCTTGCTGTCATGGGCATCATTAGCTTTATTCCT
TACTTCAATTGGCTGAGTTGGGTTTTCGCTTGGCTTGATTCTGGAAGAAGACTTTATGCTGTGTATGCAATCGTGTATTTGGCTCCTTACTTAAGGTCAAACTTATCGTT
GTCACCCGAAGAGAGCTGGCTTCCTATTGTCAGTATACTTCTCTGCATTATTCACATCCAGCTTGAAGCGAGCATTACAAATGGAGATATTCAACCTTTGCAAATATTTG
GGAAAGCTTCAAAGCAAATTTCTTCAACCAAGAAAGGGAGAGACCAATTCAAGGGATCCCAAGGATCATACAAAGAGAGCGAAAAGAAAGAGGACAGGAAGCTGCCGTCA
GCAGAAGAACGATTTCGAGATAAGATCAGTAGATTGGGAGATTCTAAAGAGACAGATCATGAACAAATAAATGAAGAATGGGATGATGATCAAAGGAGAAAGCATTAG
mRNA sequenceShow/hide mRNA sequence
ATCCACTTTCTATAAATACCAAATTGGAGTTTAAATTAGATAGAAAAGAAGAAAAGCCAAGCCAACAATGGCGAAAAAGGAAATTGGCACACATGCCAAGAACAGTTCTA
CTGGTGTTCTTCGTTCTCAAGTTATGATTTTTTGTTCAAAAACTCTCTAAATCTCTCCTTTTTCCCTTTCTCTTCTGACTGATTCATATCAATTTTAACGCCAATGATCA
CTCTGGCTTCTACTCATTATTCATCATCTTCCTTCAAGGATCTTCGTCTCTTCAAACCCTCTTCCACATTCTCCCCATCACTCTCCAATCTCAAACCCTTAAATCCTTTC
CTCAAACCACCTTCCAATCAGACCAGGTTCGGTAATGGGATTTGTAGGGCAGAATTGGGTAACGATGCACCTTTCGCCATTGCAATCGGTGCCTGTTTTCTCACTTCTCT
TGTTGTTCCGGCAGCTGATCGTGCTTCCGATGATGAGAGCGATGCCGTCATTGATTCCACCGATACTAGGCTTGCTGTCATGGGCATCATTAGCTTTATTCCTTACTTCA
ATTGGCTGAGTTGGGTTTTCGCTTGGCTTGATTCTGGAAGAAGACTTTATGCTGTGTATGCAATCGTGTATTTGGCTCCTTACTTAAGGTCAAACTTATCGTTGTCACCC
GAAGAGAGCTGGCTTCCTATTGTCAGTATACTTCTCTGCATTATTCACATCCAGCTTGAAGCGAGCATTACAAATGGAGATATTCAACCTTTGCAAATATTTGGGAAAGC
TTCAAAGCAAATTTCTTCAACCAAGAAAGGGAGAGACCAATTCAAGGGATCCCAAGGATCATACAAAGAGAGCGAAAAGAAAGAGGACAGGAAGCTGCCGTCAGCAGAAG
AACGATTTCGAGATAAGATCAGTAGATTGGGAGATTCTAAAGAGACAGATCATGAACAAATAAATGAAGAATGGGATGATGATCAAAGGAGAAAGCATTAGGTTATATGT
TCTAACTTTACTCTGCTTGTGGTATACTAAAATTGGAGGATACAGAAAATCAAAGTTGTAGCATTAACATTACTTTTCCCTTCAAATCTCGAGTCAGTTGCTACCATTTT
CATGGGACCTTGAGTATTCTTAGAAATGCTGCAGTTGTATTGCTAAGCTGAGGTTGCTGAATGCTATTATTTTACTTCTCTTTTATCAGAAACATAAGTTTATTATTGAA
ATGAGAGAAGCGTTATGCATAGGCCTTAACTTACCCCACAAGTAGGACTAGTGTTGTGCATCAACCTCTGTCATTGTTTCTATTGGTGGGTTTTTTCATTTAATTTATCA
ATGAAATGTCACTGTTTC
Protein sequenceShow/hide protein sequence
MITLASTHYSSSSFKDLRLFKPSSTFSPSLSNLKPLNPFLKPPSNQTRFGNGICRAELGNDAPFAIAIGACFLTSLVVPAADRASDDESDAVIDSTDTRLAVMGIISFIP
YFNWLSWVFAWLDSGRRLYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIIHIQLEASITNGDIQPLQIFGKASKQISSTKKGRDQFKGSQGSYKESEKKEDRKLPS
AEERFRDKISRLGDSKETDHEQINEEWDDDQRRKH