; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg029236 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg029236
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionZinc finger matrin-type protein 1, putative isoform 1
Genome locationscaffold12:35128333..35130602
RNA-Seq ExpressionSpg029236
SyntenySpg029236
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0000166 - nucleotide binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131237.1 uncharacterized protein LOC111004499 [Momordica charantia]1.1e-11281.15Show/hide
Query:  MITLAYAYLSSSPSNLSSLKLRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFAFAIGACILSSLVFPAAGGGSDDEGDAVIDST
        MI+LAYA LSSSPSNLSSLKLRL +P S FST LSNLK LNPC K+AS+++RI NG+CRA+LGND PFA AIGACILSS VFP AGGGSDDE DAVIDST
Subjt:  MITLAYAYLSSSPSNLSSLKLRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFAFAIGACILSSLVFPAAGGGSDDEGDAVIDST

Query:  DTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGRASNQISSTKKG
        DTR AVM IISFIPYFNWLSWVFAWLDSG+R YAVYA+VYL PYLRSNLSLSPEESWLPI SILLCI+HIQ+EVSI NGDIQPFQIFG+ S +ISST +G
Subjt:  DTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGRASNQISSTKKG

Query:  RDYFKGSQEPTKKSGKKEDRKLPTAEEQLRDEIRRWGNSKETLDHEQSDGEWDDEQRRKH
        RD+FKGSQ P ++SG+KED KLP+ +EQLRDEIRRWG+SKETLDHEQS+GEWDDEQRRKH
Subjt:  RDYFKGSQEPTKKSGKKEDRKLPTAEEQLRDEIRRWGNSKETLDHEQSDGEWDDEQRRKH

XP_022958725.1 uncharacterized protein LOC111459865 isoform X2 [Cucurbita moschata]8.5e-11082.69Show/hide
Query:  MITLAYAYLSSSPSNLSSLK-LRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFAFAIGACILSSLVFPAAGGGSDDEGDAVIDS
        MITLA AYLSSSPSN SSL  LRL KP   FST LSNLKPLNP HKSASN+R   NGICRAELGNDAPFA AIGACILSSLV P AGGGSDD+ DAV+DS
Subjt:  MITLAYAYLSSSPSNLSSLK-LRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFAFAIGACILSSLVFPAAGGGSDDEGDAVIDS

Query:  TDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGRASNQISSTKK
        TD RLAVM IISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSP+ESWLPIVSIL+CI HIQVE SI NGDIQPFQIFG+ SNQIS TK 
Subjt:  TDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGRASNQISSTKK

Query:  GRDYFKGSQEPTKKSGKKEDRKLPTAEEQLRDEIRRWGNSKETLDHEQSDGEWDDEQRRK
         R + KGSQ PTKKSGKK D KLP+AEEQLRDEI+ WG+ KETLDHEQS+ EWDDEQRRK
Subjt:  GRDYFKGSQEPTKKSGKKEDRKLPTAEEQLRDEIRRWGNSKETLDHEQSDGEWDDEQRRK

XP_023534481.1 uncharacterized protein LOC111796027 isoform X1 [Cucurbita pepo subsp. pepo]5.0e-11083.52Show/hide
Query:  MITLAYAYLSSSPSNLSSLK-LRLFKPSSAFSTPLSNLKPLNPCHKSASN-KRRIANGICRAELGNDAPFAFAIGACILSSLVFPAAGGGSDDEGDAVID
        MITLA AYLSSSPSN SSL  LRL KP   FST LSNLKPLNP HKSASN KR   NGICRAELGNDAPFA AIGACIL+SLV P AGGGSDD+ DAV+D
Subjt:  MITLAYAYLSSSPSNLSSLK-LRLFKPSSAFSTPLSNLKPLNPCHKSASN-KRRIANGICRAELGNDAPFAFAIGACILSSLVFPAAGGGSDDEGDAVID

Query:  STDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGRASNQISSTK
        STD RLAVM IISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSP+ESWLPIVSIL+CI HIQVE SI NGDIQPFQIFG+ASNQIS TK
Subjt:  STDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGRASNQISSTK

Query:  KGRDYFKGSQEPTKKSGKKEDRKLPTAEEQLRDEIRRWGNSKETLDHEQSDGEWDDEQRRK
         GR + KGSQ PTKKSGKK D KLP+AEEQLRDEIR WG+ KETLDHEQS+ EWDDEQRRK
Subjt:  KGRDYFKGSQEPTKKSGKKEDRKLPTAEEQLRDEIRRWGNSKETLDHEQSDGEWDDEQRRK

XP_023534483.1 uncharacterized protein LOC111796027 isoform X2 [Cucurbita pepo subsp. pepo]5.9e-11183.46Show/hide
Query:  MITLAYAYLSSSPSNLSSLK-LRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFAFAIGACILSSLVFPAAGGGSDDEGDAVIDS
        MITLA AYLSSSPSN SSL  LRL KP   FST LSNLKPLNP HKSASN+R   NGICRAELGNDAPFA AIGACIL+SLV P AGGGSDD+ DAV+DS
Subjt:  MITLAYAYLSSSPSNLSSLK-LRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFAFAIGACILSSLVFPAAGGGSDDEGDAVIDS

Query:  TDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGRASNQISSTKK
        TD RLAVM IISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSP+ESWLPIVSIL+CI HIQVE SI NGDIQPFQIFG+ASNQIS TK 
Subjt:  TDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGRASNQISSTKK

Query:  GRDYFKGSQEPTKKSGKKEDRKLPTAEEQLRDEIRRWGNSKETLDHEQSDGEWDDEQRRK
        GR + KGSQ PTKKSGKK D KLP+AEEQLRDEIR WG+ KETLDHEQS+ EWDDEQRRK
Subjt:  GRDYFKGSQEPTKKSGKKEDRKLPTAEEQLRDEIRRWGNSKETLDHEQSDGEWDDEQRRK

XP_038875289.1 uncharacterized protein LOC120067780 [Benincasa hispida]1.4e-11284.29Show/hide
Query:  MITLAYAYLSSSPSNLSSLK-LRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFAFAIGACILSSLVFPAAGGGSDDEGDAVIDS
        MITLA AYLSSS SNLSSLK LRLFKPSS FS  LSNLKPLNP  K  SN+ RI NGICRAELGNDAPFA AIGAC LSSLV P A G SDDE DA+IDS
Subjt:  MITLAYAYLSSSPSNLSSLK-LRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFAFAIGACILSSLVFPAAGGGSDDEGDAVIDS

Query:  TDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGRASNQISSTKK
        TDTRLAVMSIISFIPYFNWLSWVFAWLDSG+R YAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCI+HIQ+EVSITNGDIQP QIFG+AS  ISSTKK
Subjt:  TDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGRASNQISSTKK

Query:  GRDYFKGSQEPTKKSGKKEDRKLPTAEEQLRDEIRRWGNSKETLDHEQSDGEWDDEQRRKH
        GRD+FKGSQ P K+SGKKEDRKLP+AEEQ +D+IRRWG+SKE LD+EQS+GEWDDEQRRKH
Subjt:  GRDYFKGSQEPTKKSGKKEDRKLPTAEEQLRDEIRRWGNSKETLDHEQSDGEWDDEQRRKH

TrEMBL top hitse value%identityAlignment
A0A6J1BQE6 uncharacterized protein LOC1110044995.2e-11381.15Show/hide
Query:  MITLAYAYLSSSPSNLSSLKLRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFAFAIGACILSSLVFPAAGGGSDDEGDAVIDST
        MI+LAYA LSSSPSNLSSLKLRL +P S FST LSNLK LNPC K+AS+++RI NG+CRA+LGND PFA AIGACILSS VFP AGGGSDDE DAVIDST
Subjt:  MITLAYAYLSSSPSNLSSLKLRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFAFAIGACILSSLVFPAAGGGSDDEGDAVIDST

Query:  DTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGRASNQISSTKKG
        DTR AVM IISFIPYFNWLSWVFAWLDSG+R YAVYA+VYL PYLRSNLSLSPEESWLPI SILLCI+HIQ+EVSI NGDIQPFQIFG+ S +ISST +G
Subjt:  DTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGRASNQISSTKKG

Query:  RDYFKGSQEPTKKSGKKEDRKLPTAEEQLRDEIRRWGNSKETLDHEQSDGEWDDEQRRKH
        RD+FKGSQ P ++SG+KED KLP+ +EQLRDEIRRWG+SKETLDHEQS+GEWDDEQRRKH
Subjt:  RDYFKGSQEPTKKSGKKEDRKLPTAEEQLRDEIRRWGNSKETLDHEQSDGEWDDEQRRKH

A0A6J1H4A9 uncharacterized protein LOC111459865 isoform X24.1e-11082.69Show/hide
Query:  MITLAYAYLSSSPSNLSSLK-LRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFAFAIGACILSSLVFPAAGGGSDDEGDAVIDS
        MITLA AYLSSSPSN SSL  LRL KP   FST LSNLKPLNP HKSASN+R   NGICRAELGNDAPFA AIGACILSSLV P AGGGSDD+ DAV+DS
Subjt:  MITLAYAYLSSSPSNLSSLK-LRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFAFAIGACILSSLVFPAAGGGSDDEGDAVIDS

Query:  TDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGRASNQISSTKK
        TD RLAVM IISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSP+ESWLPIVSIL+CI HIQVE SI NGDIQPFQIFG+ SNQIS TK 
Subjt:  TDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGRASNQISSTKK

Query:  GRDYFKGSQEPTKKSGKKEDRKLPTAEEQLRDEIRRWGNSKETLDHEQSDGEWDDEQRRK
         R + KGSQ PTKKSGKK D KLP+AEEQLRDEI+ WG+ KETLDHEQS+ EWDDEQRRK
Subjt:  GRDYFKGSQEPTKKSGKKEDRKLPTAEEQLRDEIRRWGNSKETLDHEQSDGEWDDEQRRK

A0A6J1H5Y1 uncharacterized protein LOC111459865 isoform X11.6e-10982.76Show/hide
Query:  MITLAYAYLSSSPSNLSSLK-LRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIA-NGICRAELGNDAPFAFAIGACILSSLVFPAAGGGSDDEGDAVID
        MITLA AYLSSSPSN SSL  LRL KP   FST LSNLKPLNP HKSASN+RR   NGICRAELGNDAPFA AIGACILSSLV P AGGGSDD+ DAV+D
Subjt:  MITLAYAYLSSSPSNLSSLK-LRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIA-NGICRAELGNDAPFAFAIGACILSSLVFPAAGGGSDDEGDAVID

Query:  STDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGRASNQISSTK
        STD RLAVM IISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSP+ESWLPIVSIL+CI HIQVE SI NGDIQPFQIFG+ SNQIS TK
Subjt:  STDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGRASNQISSTK

Query:  KGRDYFKGSQEPTKKSGKKEDRKLPTAEEQLRDEIRRWGNSKETLDHEQSDGEWDDEQRRK
          R + KGSQ PTKKSGKK D KLP+AEEQLRDEI+ WG+ KETLDHEQS+ EWDDEQRRK
Subjt:  KGRDYFKGSQEPTKKSGKKEDRKLPTAEEQLRDEIRRWGNSKETLDHEQSDGEWDDEQRRK

A0A6J1K887 uncharacterized protein LOC111491538 isoform X27.0e-11082.31Show/hide
Query:  MITLAYAYLSSSPSNLSSLK-LRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFAFAIGACILSSLVFPAAGGGSDDEGDAVIDS
        M+TLA AYLSSSPSN SSL  LRL KP   FST LSNLKPLNP HKSASN+R   NGICRAELGNDAPFA AIGACILSSLV P AGGGSDD+ DAV+DS
Subjt:  MITLAYAYLSSSPSNLSSLK-LRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFAFAIGACILSSLVFPAAGGGSDDEGDAVIDS

Query:  TDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGRASNQISSTKK
        TD RLAVM IISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSP+ESWLPIVSIL+CI HIQVE SI NGDIQPFQIFG+ASNQIS T+ 
Subjt:  TDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGRASNQISSTKK

Query:  GRDYFKGSQEPTKKSGKKEDRKLPTAEEQLRDEIRRWGNSKETLDHEQSDGEWDDEQRRK
        GR + KG + PTKKSGKK D KLP+AEEQLRDEIR WG+ KETLDHEQS+ EWDDEQRRK
Subjt:  GRDYFKGSQEPTKKSGKKEDRKLPTAEEQLRDEIRRWGNSKETLDHEQSDGEWDDEQRRK

A0A6J1KAA9 uncharacterized protein LOC111491538 isoform X12.7e-10982.38Show/hide
Query:  MITLAYAYLSSSPSNLSSLK-LRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIA-NGICRAELGNDAPFAFAIGACILSSLVFPAAGGGSDDEGDAVID
        M+TLA AYLSSSPSN SSL  LRL KP   FST LSNLKPLNP HKSASN+RR   NGICRAELGNDAPFA AIGACILSSLV P AGGGSDD+ DAV+D
Subjt:  MITLAYAYLSSSPSNLSSLK-LRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIA-NGICRAELGNDAPFAFAIGACILSSLVFPAAGGGSDDEGDAVID

Query:  STDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGRASNQISSTK
        STD RLAVM IISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSP+ESWLPIVSIL+CI HIQVE SI NGDIQPFQIFG+ASNQIS T+
Subjt:  STDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGRASNQISSTK

Query:  KGRDYFKGSQEPTKKSGKKEDRKLPTAEEQLRDEIRRWGNSKETLDHEQSDGEWDDEQRRK
         GR + KG + PTKKSGKK D KLP+AEEQLRDEIR WG+ KETLDHEQS+ EWDDEQRRK
Subjt:  KGRDYFKGSQEPTKKSGKKEDRKLPTAEEQLRDEIRRWGNSKETLDHEQSDGEWDDEQRRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G41960.1 unknown protein1.8e-4955.88Show/hide
Query:  LSSSPSNLSSLKLRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFAFAIGACILSSLVFPAAGGGSDDEGD---AVIDSTDTRLA
        LSSS S  +  K RL   SS+  +PL    PL    K     R+I   ICRAE   DAP   AIGACILSS VFP A   +D+E +   + I STD RLA
Subjt:  LSSSPSNLSSLKLRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFAFAIGACILSSLVFPAAGGGSDDEGD---AVIDSTDTRLA

Query:  VMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGRASNQISSTKKG---RD
         M IISFIPYFNWLSWVFAWLD+GK RYAVYA+VYL PYL SNLS+SPEESWLPI SI+L I+H+Q+E SI NGD++    F   S+   S+KK    + 
Subjt:  VMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGRASNQISSTKKG---RD

Query:  YFKG
        +FKG
Subjt:  YFKG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCACGCTGGCTTATGCTTATTTATCATCATCCCCTTCCAATCTCTCTTCTCTCAAGCTTCGTCTCTTCAAACCCTCTTCCGCCTTCTCGACACCACTCTCCAATCT
CAAACCCTTAAATCCTTGCCACAAATCCGCTTCCAATAAGAGGAGGATCGCAAATGGGATTTGTAGGGCGGAATTGGGGAACGACGCGCCTTTCGCCTTTGCGATCGGGG
CCTGCATTCTCAGCTCTCTCGTTTTTCCGGCAGCCGGCGGTGGTTCCGATGATGAGGGCGATGCCGTTATTGATTCCACCGATACCAGGCTCGCTGTCATGAGCATCATT
AGCTTTATCCCCTACTTCAACTGGCTGAGTTGGGTTTTTGCGTGGCTTGATTCTGGGAAAAGACGTTATGCTGTGTATGCAATCGTGTATTTGGCTCCTTATCTAAGGTC
AAATTTATCGTTGTCGCCCGAAGAGAGCTGGCTTCCTATTGTCAGTATACTTCTCTGCATTCTTCACATTCAGGTCGAAGTGAGCATTACAAATGGAGATATTCAACCGT
TCCAAATATTTGGCAGAGCTTCAAATCAAATTTCTTCAACAAAGAAAGGGAGAGACTATTTCAAGGGCTCCCAAGAACCAACCAAAAAGAGTGGAAAAAAGGAGGACAGG
AAACTGCCAACTGCTGAAGAACAATTGAGAGACGAGATTAGAAGGTGGGGCAATTCTAAAGAGACATTAGACCATGAACAATCAGATGGAGAATGGGATGACGAACAGAG
GAGAAAACATTAG
mRNA sequenceShow/hide mRNA sequence
ATGATCACGCTGGCTTATGCTTATTTATCATCATCCCCTTCCAATCTCTCTTCTCTCAAGCTTCGTCTCTTCAAACCCTCTTCCGCCTTCTCGACACCACTCTCCAATCT
CAAACCCTTAAATCCTTGCCACAAATCCGCTTCCAATAAGAGGAGGATCGCAAATGGGATTTGTAGGGCGGAATTGGGGAACGACGCGCCTTTCGCCTTTGCGATCGGGG
CCTGCATTCTCAGCTCTCTCGTTTTTCCGGCAGCCGGCGGTGGTTCCGATGATGAGGGCGATGCCGTTATTGATTCCACCGATACCAGGCTCGCTGTCATGAGCATCATT
AGCTTTATCCCCTACTTCAACTGGCTGAGTTGGGTTTTTGCGTGGCTTGATTCTGGGAAAAGACGTTATGCTGTGTATGCAATCGTGTATTTGGCTCCTTATCTAAGGTC
AAATTTATCGTTGTCGCCCGAAGAGAGCTGGCTTCCTATTGTCAGTATACTTCTCTGCATTCTTCACATTCAGGTCGAAGTGAGCATTACAAATGGAGATATTCAACCGT
TCCAAATATTTGGCAGAGCTTCAAATCAAATTTCTTCAACAAAGAAAGGGAGAGACTATTTCAAGGGCTCCCAAGAACCAACCAAAAAGAGTGGAAAAAAGGAGGACAGG
AAACTGCCAACTGCTGAAGAACAATTGAGAGACGAGATTAGAAGGTGGGGCAATTCTAAAGAGACATTAGACCATGAACAATCAGATGGAGAATGGGATGACGAACAGAG
GAGAAAACATTAG
Protein sequenceShow/hide protein sequence
MITLAYAYLSSSPSNLSSLKLRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFAFAIGACILSSLVFPAAGGGSDDEGDAVIDSTDTRLAVMSII
SFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGRASNQISSTKKGRDYFKGSQEPTKKSGKKEDR
KLPTAEEQLRDEIRRWGNSKETLDHEQSDGEWDDEQRRKH