; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032574 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032574
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionZinc finger matrin-type protein 1, putative isoform 1
Genome locationchr11:34824980..34830502
RNA-Seq ExpressionLag0032574
SyntenyLag0032574
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131237.1 uncharacterized protein LOC111004499 [Momordica charantia]2.6e-11381.54Show/hide
Query:  MITLAYAYLSSSPSNLFSLKLRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFALAIGACILSSLVFPAAGGGSDDEGDAVIDST
        MI+LAYA LSSSPSNL SLKLRL +P S FST LSNLK LNPC K+AS+++RI NG+CRA+LGND PFA+AIGACILSS VFP AGGGSDDE DAVIDST
Subjt:  MITLAYAYLSSSPSNLFSLKLRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFALAIGACILSSLVFPAAGGGSDDEGDAVIDST

Query:  DTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGKASNQISSTKKG
        DTR AVM IISFIPYFNWLSWVFAWLDSG+R YAVYA+VYL PYLRSNLSLSPEESWLPI SILLCI+HIQ+EVSI NGDIQPFQIFGK S +ISST +G
Subjt:  DTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGKASNQISSTKKG

Query:  RDYFKGSQEPTKKSGKKEDRKLPNAEEQLRDELRRWGDSKETLDHEQSNGEWDDEQRRKH
        RD+FKGSQ P ++SG+KED KLP+ +EQLRDE+RRWGDSKETLDHEQSNGEWDDEQRRKH
Subjt:  RDYFKGSQEPTKKSGKKEDRKLPNAEEQLRDELRRWGDSKETLDHEQSNGEWDDEQRRKH

XP_022958725.1 uncharacterized protein LOC111459865 isoform X2 [Cucurbita moschata]2.0e-11083.08Show/hide
Query:  MITLAYAYLSSSPSNLFSLK-LRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFALAIGACILSSLVFPAAGGGSDDEGDAVIDS
        MITLA AYLSSSPSN  SL  LRL KP   FST LSNLKPLNP HKSASN+R   NGICRAELGNDAPFA+AIGACILSSLV P AGGGSDD+ DAV+DS
Subjt:  MITLAYAYLSSSPSNLFSLK-LRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFALAIGACILSSLVFPAAGGGSDDEGDAVIDS

Query:  TDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGKASNQISSTKK
        TD RLAVM IISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSP+ESWLPIVSIL+CI HIQVE SI NGDIQPFQIFGK SNQIS TK 
Subjt:  TDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGKASNQISSTKK

Query:  GRDYFKGSQEPTKKSGKKEDRKLPNAEEQLRDELRRWGDSKETLDHEQSNGEWDDEQRRK
         R + KGSQ PTKKSGKK D KLP+AEEQLRDE++ WGD KETLDHEQSN EWDDEQRRK
Subjt:  GRDYFKGSQEPTKKSGKKEDRKLPNAEEQLRDELRRWGDSKETLDHEQSNGEWDDEQRRK

XP_023534481.1 uncharacterized protein LOC111796027 isoform X1 [Cucurbita pepo subsp. pepo]1.2e-11083.91Show/hide
Query:  MITLAYAYLSSSPSNLFSLK-LRLFKPSSAFSTPLSNLKPLNPCHKSASN-KRRIANGICRAELGNDAPFALAIGACILSSLVFPAAGGGSDDEGDAVID
        MITLA AYLSSSPSN  SL  LRL KP   FST LSNLKPLNP HKSASN KR   NGICRAELGNDAPFA+AIGACIL+SLV P AGGGSDD+ DAV+D
Subjt:  MITLAYAYLSSSPSNLFSLK-LRLFKPSSAFSTPLSNLKPLNPCHKSASN-KRRIANGICRAELGNDAPFALAIGACILSSLVFPAAGGGSDDEGDAVID

Query:  STDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGKASNQISSTK
        STD RLAVM IISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSP+ESWLPIVSIL+CI HIQVE SI NGDIQPFQIFGKASNQIS TK
Subjt:  STDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGKASNQISSTK

Query:  KGRDYFKGSQEPTKKSGKKEDRKLPNAEEQLRDELRRWGDSKETLDHEQSNGEWDDEQRRK
         GR + KGSQ PTKKSGKK D KLP+AEEQLRDE+R WGD KETLDHEQSN EWDDEQRRK
Subjt:  KGRDYFKGSQEPTKKSGKKEDRKLPNAEEQLRDELRRWGDSKETLDHEQSNGEWDDEQRRK

XP_023534483.1 uncharacterized protein LOC111796027 isoform X2 [Cucurbita pepo subsp. pepo]1.4e-11183.85Show/hide
Query:  MITLAYAYLSSSPSNLFSLK-LRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFALAIGACILSSLVFPAAGGGSDDEGDAVIDS
        MITLA AYLSSSPSN  SL  LRL KP   FST LSNLKPLNP HKSASN+R   NGICRAELGNDAPFA+AIGACIL+SLV P AGGGSDD+ DAV+DS
Subjt:  MITLAYAYLSSSPSNLFSLK-LRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFALAIGACILSSLVFPAAGGGSDDEGDAVIDS

Query:  TDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGKASNQISSTKK
        TD RLAVM IISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSP+ESWLPIVSIL+CI HIQVE SI NGDIQPFQIFGKASNQIS TK 
Subjt:  TDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGKASNQISSTKK

Query:  GRDYFKGSQEPTKKSGKKEDRKLPNAEEQLRDELRRWGDSKETLDHEQSNGEWDDEQRRK
        GR + KGSQ PTKKSGKK D KLP+AEEQLRDE+R WGD KETLDHEQSN EWDDEQRRK
Subjt:  GRDYFKGSQEPTKKSGKKEDRKLPNAEEQLRDELRRWGDSKETLDHEQSNGEWDDEQRRK

XP_038875289.1 uncharacterized protein LOC120067780 [Benincasa hispida]3.4e-11384.67Show/hide
Query:  MITLAYAYLSSSPSNLFSLK-LRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFALAIGACILSSLVFPAAGGGSDDEGDAVIDS
        MITLA AYLSSS SNL SLK LRLFKPSS FS  LSNLKPLNP  K  SN+ RI NGICRAELGNDAPFA+AIGAC LSSLV P A G SDDE DA+IDS
Subjt:  MITLAYAYLSSSPSNLFSLK-LRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFALAIGACILSSLVFPAAGGGSDDEGDAVIDS

Query:  TDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGKASNQISSTKK
        TDTRLAVMSIISFIPYFNWLSWVFAWLDSG+R YAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCI+HIQ+EVSITNGDIQP QIFGKAS  ISSTKK
Subjt:  TDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGKASNQISSTKK

Query:  GRDYFKGSQEPTKKSGKKEDRKLPNAEEQLRDELRRWGDSKETLDHEQSNGEWDDEQRRKH
        GRD+FKGSQ P K+SGKKEDRKLP+AEEQ +D++RRWGDSKE LD+EQSNGEWDDEQRRKH
Subjt:  GRDYFKGSQEPTKKSGKKEDRKLPNAEEQLRDELRRWGDSKETLDHEQSNGEWDDEQRRKH

TrEMBL top hitse value%identityAlignment
A0A6J1BQE6 uncharacterized protein LOC1110044991.3e-11381.54Show/hide
Query:  MITLAYAYLSSSPSNLFSLKLRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFALAIGACILSSLVFPAAGGGSDDEGDAVIDST
        MI+LAYA LSSSPSNL SLKLRL +P S FST LSNLK LNPC K+AS+++RI NG+CRA+LGND PFA+AIGACILSS VFP AGGGSDDE DAVIDST
Subjt:  MITLAYAYLSSSPSNLFSLKLRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFALAIGACILSSLVFPAAGGGSDDEGDAVIDST

Query:  DTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGKASNQISSTKKG
        DTR AVM IISFIPYFNWLSWVFAWLDSG+R YAVYA+VYL PYLRSNLSLSPEESWLPI SILLCI+HIQ+EVSI NGDIQPFQIFGK S +ISST +G
Subjt:  DTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGKASNQISSTKKG

Query:  RDYFKGSQEPTKKSGKKEDRKLPNAEEQLRDELRRWGDSKETLDHEQSNGEWDDEQRRKH
        RD+FKGSQ P ++SG+KED KLP+ +EQLRDE+RRWGDSKETLDHEQSNGEWDDEQRRKH
Subjt:  RDYFKGSQEPTKKSGKKEDRKLPNAEEQLRDELRRWGDSKETLDHEQSNGEWDDEQRRKH

A0A6J1H4A9 uncharacterized protein LOC111459865 isoform X29.9e-11183.08Show/hide
Query:  MITLAYAYLSSSPSNLFSLK-LRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFALAIGACILSSLVFPAAGGGSDDEGDAVIDS
        MITLA AYLSSSPSN  SL  LRL KP   FST LSNLKPLNP HKSASN+R   NGICRAELGNDAPFA+AIGACILSSLV P AGGGSDD+ DAV+DS
Subjt:  MITLAYAYLSSSPSNLFSLK-LRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFALAIGACILSSLVFPAAGGGSDDEGDAVIDS

Query:  TDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGKASNQISSTKK
        TD RLAVM IISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSP+ESWLPIVSIL+CI HIQVE SI NGDIQPFQIFGK SNQIS TK 
Subjt:  TDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGKASNQISSTKK

Query:  GRDYFKGSQEPTKKSGKKEDRKLPNAEEQLRDELRRWGDSKETLDHEQSNGEWDDEQRRK
         R + KGSQ PTKKSGKK D KLP+AEEQLRDE++ WGD KETLDHEQSN EWDDEQRRK
Subjt:  GRDYFKGSQEPTKKSGKKEDRKLPNAEEQLRDELRRWGDSKETLDHEQSNGEWDDEQRRK

A0A6J1H5Y1 uncharacterized protein LOC111459865 isoform X13.8e-11083.14Show/hide
Query:  MITLAYAYLSSSPSNLFSLK-LRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIA-NGICRAELGNDAPFALAIGACILSSLVFPAAGGGSDDEGDAVID
        MITLA AYLSSSPSN  SL  LRL KP   FST LSNLKPLNP HKSASN+RR   NGICRAELGNDAPFA+AIGACILSSLV P AGGGSDD+ DAV+D
Subjt:  MITLAYAYLSSSPSNLFSLK-LRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIA-NGICRAELGNDAPFALAIGACILSSLVFPAAGGGSDDEGDAVID

Query:  STDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGKASNQISSTK
        STD RLAVM IISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSP+ESWLPIVSIL+CI HIQVE SI NGDIQPFQIFGK SNQIS TK
Subjt:  STDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGKASNQISSTK

Query:  KGRDYFKGSQEPTKKSGKKEDRKLPNAEEQLRDELRRWGDSKETLDHEQSNGEWDDEQRRK
          R + KGSQ PTKKSGKK D KLP+AEEQLRDE++ WGD KETLDHEQSN EWDDEQRRK
Subjt:  KGRDYFKGSQEPTKKSGKKEDRKLPNAEEQLRDELRRWGDSKETLDHEQSNGEWDDEQRRK

A0A6J1K887 uncharacterized protein LOC111491538 isoform X21.7e-11082.69Show/hide
Query:  MITLAYAYLSSSPSNLFSLK-LRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFALAIGACILSSLVFPAAGGGSDDEGDAVIDS
        M+TLA AYLSSSPSN  SL  LRL KP   FST LSNLKPLNP HKSASN+R   NGICRAELGNDAPFA+AIGACILSSLV P AGGGSDD+ DAV+DS
Subjt:  MITLAYAYLSSSPSNLFSLK-LRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFALAIGACILSSLVFPAAGGGSDDEGDAVIDS

Query:  TDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGKASNQISSTKK
        TD RLAVM IISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSP+ESWLPIVSIL+CI HIQVE SI NGDIQPFQIFGKASNQIS T+ 
Subjt:  TDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGKASNQISSTKK

Query:  GRDYFKGSQEPTKKSGKKEDRKLPNAEEQLRDELRRWGDSKETLDHEQSNGEWDDEQRRK
        GR + KG + PTKKSGKK D KLP+AEEQLRDE+R WGD KETLDHEQSN EWDDEQRRK
Subjt:  GRDYFKGSQEPTKKSGKKEDRKLPNAEEQLRDELRRWGDSKETLDHEQSNGEWDDEQRRK

A0A6J1KAA9 uncharacterized protein LOC111491538 isoform X16.4e-11082.76Show/hide
Query:  MITLAYAYLSSSPSNLFSLK-LRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIA-NGICRAELGNDAPFALAIGACILSSLVFPAAGGGSDDEGDAVID
        M+TLA AYLSSSPSN  SL  LRL KP   FST LSNLKPLNP HKSASN+RR   NGICRAELGNDAPFA+AIGACILSSLV P AGGGSDD+ DAV+D
Subjt:  MITLAYAYLSSSPSNLFSLK-LRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIA-NGICRAELGNDAPFALAIGACILSSLVFPAAGGGSDDEGDAVID

Query:  STDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGKASNQISSTK
        STD RLAVM IISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSP+ESWLPIVSIL+CI HIQVE SI NGDIQPFQIFGKASNQIS T+
Subjt:  STDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGKASNQISSTK

Query:  KGRDYFKGSQEPTKKSGKKEDRKLPNAEEQLRDELRRWGDSKETLDHEQSNGEWDDEQRRK
         GR + KG + PTKKSGKK D KLP+AEEQLRDE+R WGD KETLDHEQSN EWDDEQRRK
Subjt:  KGRDYFKGSQEPTKKSGKKEDRKLPNAEEQLRDELRRWGDSKETLDHEQSNGEWDDEQRRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G41960.1 unknown protein7.4e-5055.45Show/hide
Query:  SSPSNLFSLKLRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFALAIGACILSSLVFPAAGGGSDDEGD---AVIDSTDTRLAVM
        SS ++L++ K RL   SS+  +PL    PL    K     R+I   ICRAE   DAP   AIGACILSS VFP A   +D+E +   + I STD RLA M
Subjt:  SSPSNLFSLKLRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFALAIGACILSSLVFPAAGGGSDDEGD---AVIDSTDTRLAVM

Query:  SIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGKASNQISSTKKG---RDYF
         IISFIPYFNWLSWVFAWLD+GK RYAVYA+VYL PYL SNLS+SPEESWLPI SI+L I+H+Q+E SI NGD++    F   S+   S+KK    + +F
Subjt:  SIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQPFQIFGKASNQISSTKKG---RDYF

Query:  KG
        KG
Subjt:  KG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAGCGTGGGTCTCAGCTTCAATCGTTGTAGGTCTGTTTTTCGCGTCGATTTGGGTCTCGATTTGAGACTGTCTCTTCTTCGACGTGGGGGCCTCGGTTCAGAGGT
CGCTGGAGTGCGTCGTTGGTTTCATAGACTTCCTCTTGCCAGCGTGGAAACTGACCCAGAGGAAGACCAGACCAAAGGGCCTGGCCAAGTCGGCCCGCGCCGCCTCCGTT
TGGTCCCTGTTGCCTCTAAGCGTCCTGATTCTGGCCTGGCCAAGTCGGCCCGCGCCGCCTCCGTTTGGTCCCTGTTGCCTCTAAGCGTCCTGATTCGATCTGAAGGGATC
CCGAACTCTATTTTCTACTCTTGCTCTCTTGCTACCCTCCTTCCGTTTTCTGACTTAAGCATCGGAGGCAGTGTGACAAGCACCACACCAGTGTGCAGAATTCGAACGCC
AATGATCACGCTGGCTTATGCTTATTTATCATCATCCCCTTCCAATCTCTTTTCTCTCAAGCTTCGTCTCTTCAAACCCTCTTCCGCCTTCTCGACACCACTCTCCAATC
TCAAACCCTTAAATCCTTGCCACAAATCCGCTTCCAATAAGAGGAGGATCGCAAATGGGATTTGCAGGGCGGAATTGGGGAACGACGCGCCTTTCGCCCTTGCGATCGGG
GCCTGCATTCTCAGTTCTCTCGTTTTTCCGGCAGCCGGCGGTGGTTCCGATGATGAGGGCGATGCCGTTATTGATTCCACCGATACCAGGCTCGCTGTCATGAGCATCAT
TAGCTTTATCCCCTACTTCAACTGGCTGAGTTGGGTTTTTGCGTGGCTTGATTCTGGGAAAAGACGTTATGCTGTGTATGCAATCGTGTATTTGGCTCCTTATCTAAGGT
CGAATTTATCATTGTCGCCCGAAGAGAGCTGGCTTCCTATTGTCAGTATACTTCTCTGCATTCTTCACATTCAGGTTGAAGTGAGCATTACAAATGGAGATATTCAACCC
TTCCAAATATTTGGCAAAGCTTCAAATCAAATTTCTTCAACAAAGAAAGGGAGAGACTATTTCAAGGGCTCCCAAGAACCAACCAAAAAGAGTGGAAAAAAAGAGGACAG
GAAACTGCCAAATGCTGAAGAACAATTGAGAGATGAGCTTAGAAGGTGGGGAGATTCTAAAGAGACATTAGACCATGAACAATCAAATGGAGAATGGGATGACGAACAGA
GGAGAAAACATTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCAGCGTGGGTCTCAGCTTCAATCGTTGTAGGTCTGTTTTTCGCGTCGATTTGGGTCTCGATTTGAGACTGTCTCTTCTTCGACGTGGGGGCCTCGGTTCAGAGGT
CGCTGGAGTGCGTCGTTGGTTTCATAGACTTCCTCTTGCCAGCGTGGAAACTGACCCAGAGGAAGACCAGACCAAAGGGCCTGGCCAAGTCGGCCCGCGCCGCCTCCGTT
TGGTCCCTGTTGCCTCTAAGCGTCCTGATTCTGGCCTGGCCAAGTCGGCCCGCGCCGCCTCCGTTTGGTCCCTGTTGCCTCTAAGCGTCCTGATTCGATCTGAAGGGATC
CCGAACTCTATTTTCTACTCTTGCTCTCTTGCTACCCTCCTTCCGTTTTCTGACTTAAGCATCGGAGGCAGTGTGACAAGCACCACACCAGTGTGCAGAATTCGAACGCC
AATGATCACGCTGGCTTATGCTTATTTATCATCATCCCCTTCCAATCTCTTTTCTCTCAAGCTTCGTCTCTTCAAACCCTCTTCCGCCTTCTCGACACCACTCTCCAATC
TCAAACCCTTAAATCCTTGCCACAAATCCGCTTCCAATAAGAGGAGGATCGCAAATGGGATTTGCAGGGCGGAATTGGGGAACGACGCGCCTTTCGCCCTTGCGATCGGG
GCCTGCATTCTCAGTTCTCTCGTTTTTCCGGCAGCCGGCGGTGGTTCCGATGATGAGGGCGATGCCGTTATTGATTCCACCGATACCAGGCTCGCTGTCATGAGCATCAT
TAGCTTTATCCCCTACTTCAACTGGCTGAGTTGGGTTTTTGCGTGGCTTGATTCTGGGAAAAGACGTTATGCTGTGTATGCAATCGTGTATTTGGCTCCTTATCTAAGGT
CGAATTTATCATTGTCGCCCGAAGAGAGCTGGCTTCCTATTGTCAGTATACTTCTCTGCATTCTTCACATTCAGGTTGAAGTGAGCATTACAAATGGAGATATTCAACCC
TTCCAAATATTTGGCAAAGCTTCAAATCAAATTTCTTCAACAAAGAAAGGGAGAGACTATTTCAAGGGCTCCCAAGAACCAACCAAAAAGAGTGGAAAAAAAGAGGACAG
GAAACTGCCAAATGCTGAAGAACAATTGAGAGATGAGCTTAGAAGGTGGGGAGATTCTAAAGAGACATTAGACCATGAACAATCAAATGGAGAATGGGATGACGAACAGA
GGAGAAAACATTAG
Protein sequenceShow/hide protein sequence
MSSVGLSFNRCRSVFRVDLGLDLRLSLLRRGGLGSEVAGVRRWFHRLPLASVETDPEEDQTKGPGQVGPRRLRLVPVASKRPDSGLAKSARAASVWSLLPLSVLIRSEGI
PNSIFYSCSLATLLPFSDLSIGGSVTSTTPVCRIRTPMITLAYAYLSSSPSNLFSLKLRLFKPSSAFSTPLSNLKPLNPCHKSASNKRRIANGICRAELGNDAPFALAIG
ACILSSLVFPAAGGGSDDEGDAVIDSTDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCILHIQVEVSITNGDIQP
FQIFGKASNQISSTKKGRDYFKGSQEPTKKSGKKEDRKLPNAEEQLRDELRRWGDSKETLDHEQSNGEWDDEQRRKH