; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0027229 (gene) of Chayote v1 genome

Gene IDSed0027229
OrganismSechium edule (Chayote v1)
DescriptionZinc finger matrin-type protein 1, putative isoform 1
Genome locationLG01:67080602..67082942
RNA-Seq ExpressionSed0027229
SyntenySed0027229
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605602.1 hypothetical protein SDJN03_02919, partial [Cucurbita argyrosperma subsp. sororia]1.0e-10278.33Show/hide
Query:  MITLAHTYLSSAPSNLSSLK-LRLTKPPSTFSTSLSDLKPLNPSNKSGSNQRRRRIGYGICRAELGTDAPFAIAVGACILSSLVFPADGGGSDDEGDDAV
        MITLA  YLSS+PSN SSL  LRLTKPP TFSTSLS+LKPLNPS+KS SNQRR     GICRAELG DAPFAIA+GACILSSLV P  GGGSDD+  DAV
Subjt:  MITLAHTYLSSAPSNLSSLK-LRLTKPPSTFSTSLSDLKPLNPSNKSGSNQRRRRIGYGICRAELGTDAPFAIAVGACILSSLVFPADGGGSDDEGDDAV

Query:  IDSTDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWIPIISILLCIAHIQIEVSITNGDIQPFPIFGRASNQISS
        +DSTD RLAVM IISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSP+ESW+PI+SIL+CIAHIQ+E SI NGDIQPF IFG+ SNQIS 
Subjt:  IDSTDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWIPIISILLCIAHIQIEVSITNGDIQPFPIFGRASNQISS

Query:  MKERRDHFRDSQGPSKMSEKKRDTKLPSGEEQVRDEIRRWEDSTETLDHEQTNGEWDDEQRRK
         K  R H + SQGP+K S KKRD KLPS EEQ+RDEI+ W D  ETLDHEQ N EWDDEQRRK
Subjt:  MKERRDHFRDSQGPSKMSEKKRDTKLPSGEEQVRDEIRRWEDSTETLDHEQTNGEWDDEQRRK

XP_022958724.1 uncharacterized protein LOC111459865 isoform X1 [Cucurbita moschata]3.5e-10378.33Show/hide
Query:  MITLAHTYLSSAPSNLSSLK-LRLTKPPSTFSTSLSDLKPLNPSNKSGSNQRRRRIGYGICRAELGTDAPFAIAVGACILSSLVFPADGGGSDDEGDDAV
        MITLA  YLSS+PSN SSL  LRLTKPP TFSTSLS+LKPLNPS+KS SNQRR     GICRAELG DAPFAIA+GACILSSLV P  GGGSDD+  DAV
Subjt:  MITLAHTYLSSAPSNLSSLK-LRLTKPPSTFSTSLSDLKPLNPSNKSGSNQRRRRIGYGICRAELGTDAPFAIAVGACILSSLVFPADGGGSDDEGDDAV

Query:  IDSTDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWIPIISILLCIAHIQIEVSITNGDIQPFPIFGRASNQISS
        +DSTD RLAVM IISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSP+ESW+PI+SIL+CIAHIQ+E SI NGDIQPF IFG+ SNQIS 
Subjt:  IDSTDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWIPIISILLCIAHIQIEVSITNGDIQPFPIFGRASNQISS

Query:  MKERRDHFRDSQGPSKMSEKKRDTKLPSGEEQVRDEIRRWEDSTETLDHEQTNGEWDDEQRRK
         K  R H + SQGP+K S KKRD KLPS EEQ+RDEI+ W D  ETLDHEQ+N EWDDEQRRK
Subjt:  MKERRDHFRDSQGPSKMSEKKRDTKLPSGEEQVRDEIRRWEDSTETLDHEQTNGEWDDEQRRK

XP_022958725.1 uncharacterized protein LOC111459865 isoform X2 [Cucurbita moschata]2.7e-10378.33Show/hide
Query:  MITLAHTYLSSAPSNLSSLK-LRLTKPPSTFSTSLSDLKPLNPSNKSGSNQRRRRIGYGICRAELGTDAPFAIAVGACILSSLVFPADGGGSDDEGDDAV
        MITLA  YLSS+PSN SSL  LRLTKPP TFSTSLS+LKPLNPS+KS SNQR  R   GICRAELG DAPFAIA+GACILSSLV P  GGGSDD+  DAV
Subjt:  MITLAHTYLSSAPSNLSSLK-LRLTKPPSTFSTSLSDLKPLNPSNKSGSNQRRRRIGYGICRAELGTDAPFAIAVGACILSSLVFPADGGGSDDEGDDAV

Query:  IDSTDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWIPIISILLCIAHIQIEVSITNGDIQPFPIFGRASNQISS
        +DSTD RLAVM IISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSP+ESW+PI+SIL+CIAHIQ+E SI NGDIQPF IFG+ SNQIS 
Subjt:  IDSTDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWIPIISILLCIAHIQIEVSITNGDIQPFPIFGRASNQISS

Query:  MKERRDHFRDSQGPSKMSEKKRDTKLPSGEEQVRDEIRRWEDSTETLDHEQTNGEWDDEQRRK
         K  R H + SQGP+K S KKRD KLPS EEQ+RDEI+ W D  ETLDHEQ+N EWDDEQRRK
Subjt:  MKERRDHFRDSQGPSKMSEKKRDTKLPSGEEQVRDEIRRWEDSTETLDHEQTNGEWDDEQRRK

XP_023534481.1 uncharacterized protein LOC111796027 isoform X1 [Cucurbita pepo subsp. pepo]3.5e-10378.33Show/hide
Query:  MITLAHTYLSSAPSNLSSLK-LRLTKPPSTFSTSLSDLKPLNPSNKSGSNQRRRRIGYGICRAELGTDAPFAIAVGACILSSLVFPADGGGSDDEGDDAV
        MITLA  YLSS+PSN SSL  LRLTKPP TFSTSLS+LKPLNPS+KS SNQ+R     GICRAELG DAPFAIA+GACIL+SLV P  GGGSDD+  DAV
Subjt:  MITLAHTYLSSAPSNLSSLK-LRLTKPPSTFSTSLSDLKPLNPSNKSGSNQRRRRIGYGICRAELGTDAPFAIAVGACILSSLVFPADGGGSDDEGDDAV

Query:  IDSTDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWIPIISILLCIAHIQIEVSITNGDIQPFPIFGRASNQISS
        +DSTD RLAVM IISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSP+ESW+PI+SIL+CIAHIQ+E SI NGDIQPF IFG+ASNQIS 
Subjt:  IDSTDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWIPIISILLCIAHIQIEVSITNGDIQPFPIFGRASNQISS

Query:  MKERRDHFRDSQGPSKMSEKKRDTKLPSGEEQVRDEIRRWEDSTETLDHEQTNGEWDDEQRRK
         K  R H + SQGP+K S KKRD KLPS EEQ+RDEIR W D  ETLDHEQ+N EWDDEQRRK
Subjt:  MKERRDHFRDSQGPSKMSEKKRDTKLPSGEEQVRDEIRRWEDSTETLDHEQTNGEWDDEQRRK

XP_023534483.1 uncharacterized protein LOC111796027 isoform X2 [Cucurbita pepo subsp. pepo]1.2e-10378.71Show/hide
Query:  MITLAHTYLSSAPSNLSSLK-LRLTKPPSTFSTSLSDLKPLNPSNKSGSNQRRRRIGYGICRAELGTDAPFAIAVGACILSSLVFPADGGGSDDEGDDAV
        MITLA  YLSS+PSN SSL  LRLTKPP TFSTSLS+LKPLNPS+KS SNQR  R   GICRAELG DAPFAIA+GACIL+SLV P  GGGSDD+  DAV
Subjt:  MITLAHTYLSSAPSNLSSLK-LRLTKPPSTFSTSLSDLKPLNPSNKSGSNQRRRRIGYGICRAELGTDAPFAIAVGACILSSLVFPADGGGSDDEGDDAV

Query:  IDSTDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWIPIISILLCIAHIQIEVSITNGDIQPFPIFGRASNQISS
        +DSTD RLAVM IISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSP+ESW+PI+SIL+CIAHIQ+E SI NGDIQPF IFG+ASNQIS 
Subjt:  IDSTDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWIPIISILLCIAHIQIEVSITNGDIQPFPIFGRASNQISS

Query:  MKERRDHFRDSQGPSKMSEKKRDTKLPSGEEQVRDEIRRWEDSTETLDHEQTNGEWDDEQRRK
         K  R H + SQGP+K S KKRD KLPS EEQ+RDEIR W D  ETLDHEQ+N EWDDEQRRK
Subjt:  MKERRDHFRDSQGPSKMSEKKRDTKLPSGEEQVRDEIRRWEDSTETLDHEQTNGEWDDEQRRK

TrEMBL top hitse value%identityAlignment
A0A6J1BQE6 uncharacterized protein LOC1110044998.4e-10375.67Show/hide
Query:  MITLAHTYLSSAPSNLSSLKLRLTKPPSTFSTSLSDLKPLNPSNKSGSNQRRRRIGYGICRAELGTDAPFAIAVGACILSSLVFPADGGGSDDEGDDAVI
        MI+LA+  LSS+PSNLSSLKLRL +PPSTFSTSLS+LK LNP +K+ S+Q  +RIG G+CRA+LG D PFA+A+GACILSS VFP  GGGSDDE  DAVI
Subjt:  MITLAHTYLSSAPSNLSSLKLRLTKPPSTFSTSLSDLKPLNPSNKSGSNQRRRRIGYGICRAELGTDAPFAIAVGACILSSLVFPADGGGSDDEGDDAVI

Query:  DSTDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWIPIISILLCIAHIQIEVSITNGDIQPFPIFGRASNQISSM
        DSTDTR AVM IISFIPYFNWLSWVFAWLDSG+R YAVYA+VYL PYLRSNLSLSPEESW+PI SILLCI HIQ+EVSI NGDIQPF IFG+ S +ISS 
Subjt:  DSTDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWIPIISILLCIAHIQIEVSITNGDIQPFPIFGRASNQISSM

Query:  KERRDHFRDSQGPSKMSEKKRDTKLPSGEEQVRDEIRRWEDSTETLDHEQTNGEWDDEQRRKH
           RDHF+ SQGP + S +K D KLPS +EQ+RDEIRRW DS ETLDHEQ+NGEWDDEQRRKH
Subjt:  KERRDHFRDSQGPSKMSEKKRDTKLPSGEEQVRDEIRRWEDSTETLDHEQTNGEWDDEQRRKH

A0A6J1H4A9 uncharacterized protein LOC111459865 isoform X21.3e-10378.33Show/hide
Query:  MITLAHTYLSSAPSNLSSLK-LRLTKPPSTFSTSLSDLKPLNPSNKSGSNQRRRRIGYGICRAELGTDAPFAIAVGACILSSLVFPADGGGSDDEGDDAV
        MITLA  YLSS+PSN SSL  LRLTKPP TFSTSLS+LKPLNPS+KS SNQR  R   GICRAELG DAPFAIA+GACILSSLV P  GGGSDD+  DAV
Subjt:  MITLAHTYLSSAPSNLSSLK-LRLTKPPSTFSTSLSDLKPLNPSNKSGSNQRRRRIGYGICRAELGTDAPFAIAVGACILSSLVFPADGGGSDDEGDDAV

Query:  IDSTDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWIPIISILLCIAHIQIEVSITNGDIQPFPIFGRASNQISS
        +DSTD RLAVM IISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSP+ESW+PI+SIL+CIAHIQ+E SI NGDIQPF IFG+ SNQIS 
Subjt:  IDSTDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWIPIISILLCIAHIQIEVSITNGDIQPFPIFGRASNQISS

Query:  MKERRDHFRDSQGPSKMSEKKRDTKLPSGEEQVRDEIRRWEDSTETLDHEQTNGEWDDEQRRK
         K  R H + SQGP+K S KKRD KLPS EEQ+RDEI+ W D  ETLDHEQ+N EWDDEQRRK
Subjt:  MKERRDHFRDSQGPSKMSEKKRDTKLPSGEEQVRDEIRRWEDSTETLDHEQTNGEWDDEQRRK

A0A6J1H5Y1 uncharacterized protein LOC111459865 isoform X11.7e-10378.33Show/hide
Query:  MITLAHTYLSSAPSNLSSLK-LRLTKPPSTFSTSLSDLKPLNPSNKSGSNQRRRRIGYGICRAELGTDAPFAIAVGACILSSLVFPADGGGSDDEGDDAV
        MITLA  YLSS+PSN SSL  LRLTKPP TFSTSLS+LKPLNPS+KS SNQRR     GICRAELG DAPFAIA+GACILSSLV P  GGGSDD+  DAV
Subjt:  MITLAHTYLSSAPSNLSSLK-LRLTKPPSTFSTSLSDLKPLNPSNKSGSNQRRRRIGYGICRAELGTDAPFAIAVGACILSSLVFPADGGGSDDEGDDAV

Query:  IDSTDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWIPIISILLCIAHIQIEVSITNGDIQPFPIFGRASNQISS
        +DSTD RLAVM IISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSP+ESW+PI+SIL+CIAHIQ+E SI NGDIQPF IFG+ SNQIS 
Subjt:  IDSTDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWIPIISILLCIAHIQIEVSITNGDIQPFPIFGRASNQISS

Query:  MKERRDHFRDSQGPSKMSEKKRDTKLPSGEEQVRDEIRRWEDSTETLDHEQTNGEWDDEQRRK
         K  R H + SQGP+K S KKRD KLPS EEQ+RDEI+ W D  ETLDHEQ+N EWDDEQRRK
Subjt:  MKERRDHFRDSQGPSKMSEKKRDTKLPSGEEQVRDEIRRWEDSTETLDHEQTNGEWDDEQRRK

A0A6J1K887 uncharacterized protein LOC111491538 isoform X24.2e-10277.19Show/hide
Query:  MITLAHTYLSSAPSNLSSLK-LRLTKPPSTFSTSLSDLKPLNPSNKSGSNQRRRRIGYGICRAELGTDAPFAIAVGACILSSLVFPADGGGSDDEGDDAV
        M+TLA  YLSS+PSN SSL  LRL+KPP TFSTSLS+LKPLNPS+KS SNQR  R   GICRAELG DAPFAIA+GACILSSLV P  GGGSDD+  DAV
Subjt:  MITLAHTYLSSAPSNLSSLK-LRLTKPPSTFSTSLSDLKPLNPSNKSGSNQRRRRIGYGICRAELGTDAPFAIAVGACILSSLVFPADGGGSDDEGDDAV

Query:  IDSTDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWIPIISILLCIAHIQIEVSITNGDIQPFPIFGRASNQISS
        +DSTD RLAVM IISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSP+ESW+PI+SIL+CIAHIQ+E SI NGDIQPF IFG+ASNQIS 
Subjt:  IDSTDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWIPIISILLCIAHIQIEVSITNGDIQPFPIFGRASNQISS

Query:  MKERRDHFRDSQGPSKMSEKKRDTKLPSGEEQVRDEIRRWEDSTETLDHEQTNGEWDDEQRRK
         +  R H +  +GP+K S KKRD KLPS EEQ+RDEIR W D  ETLDHEQ+N EWDDEQRRK
Subjt:  MKERRDHFRDSQGPSKMSEKKRDTKLPSGEEQVRDEIRRWEDSTETLDHEQTNGEWDDEQRRK

A0A6J1KAA9 uncharacterized protein LOC111491538 isoform X15.4e-10277.19Show/hide
Query:  MITLAHTYLSSAPSNLSSLK-LRLTKPPSTFSTSLSDLKPLNPSNKSGSNQRRRRIGYGICRAELGTDAPFAIAVGACILSSLVFPADGGGSDDEGDDAV
        M+TLA  YLSS+PSN SSL  LRL+KPP TFSTSLS+LKPLNPS+KS SNQRR     GICRAELG DAPFAIA+GACILSSLV P  GGGSDD+  DAV
Subjt:  MITLAHTYLSSAPSNLSSLK-LRLTKPPSTFSTSLSDLKPLNPSNKSGSNQRRRRIGYGICRAELGTDAPFAIAVGACILSSLVFPADGGGSDDEGDDAV

Query:  IDSTDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWIPIISILLCIAHIQIEVSITNGDIQPFPIFGRASNQISS
        +DSTD RLAVM IISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSP+ESW+PI+SIL+CIAHIQ+E SI NGDIQPF IFG+ASNQIS 
Subjt:  IDSTDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWIPIISILLCIAHIQIEVSITNGDIQPFPIFGRASNQISS

Query:  MKERRDHFRDSQGPSKMSEKKRDTKLPSGEEQVRDEIRRWEDSTETLDHEQTNGEWDDEQRRK
         +  R H +  +GP+K S KKRD KLPS EEQ+RDEIR W D  ETLDHEQ+N EWDDEQRRK
Subjt:  MKERRDHFRDSQGPSKMSEKKRDTKLPSGEEQVRDEIRRWEDSTETLDHEQTNGEWDDEQRRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G41960.1 unknown protein8.4e-4759.01Show/hide
Query:  RRIGYGICRAELGTDAPFAIAVGACILSSLVFPADGGGSDDEGDD--AVIDSTDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRS
        R+I   ICRAE   DAP   A+GACILSS VFP     +D+E ++  + I STD RLA M IISFIPYFNWLSWVFAWLD+GK RYAVYA+VYL PYL S
Subjt:  RRIGYGICRAELGTDAPFAIAVGACILSSLVFPADGGGSDDEGDD--AVIDSTDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRS

Query:  NLSLSPEESWIPIISILLCIAHIQIEVSITNGDIQPFPIFGRASNQISSMKER---RDHFR
        NLS+SPEESW+PI SI+L I H+Q+E SI NGD++    F   S+   S K+R   + HF+
Subjt:  NLSLSPEESWIPIISILLCIAHIQIEVSITNGDIQPFPIFGRASNQISSMKER---RDHFR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCACTTTAGCTCATACCTATTTGTCATCGGCCCCTTCCAATCTCTCTTCTCTGAAGCTCCGCCTCACCAAACCCCCTTCCACTTTCTCAACTTCACTGTCCGATCT
CAAACCCTTAAATCCCTCCAACAAATCCGGTTCCAATCAGAGGAGGAGGAGGATCGGATATGGGATTTGCAGGGCGGAGTTAGGGACCGACGCGCCGTTCGCCATCGCCG
TCGGTGCCTGCATTCTCAGTTCTCTTGTTTTTCCGGCGGACGGCGGTGGTTCCGATGATGAGGGCGATGATGCCGTCATTGATTCCACTGATACTAGACTCGCTGTCATG
AGTATTATTAGCTTTATCCCTTACTTCAACTGGCTGAGTTGGGTTTTTGCTTGGCTTGATTCTGGGAAAAGACGTTATGCTGTTTATGCAATTGTGTATTTGGCTCCTTA
TTTGAGGTCGAATTTATCGTTGTCGCCCGAAGAAAGTTGGATTCCGATCATCAGTATACTTCTCTGCATTGCTCACATTCAGATTGAAGTGAGCATTACAAATGGAGATA
TTCAACCCTTCCCAATATTTGGAAGAGCTTCAAATCAAATTTCTTCAATGAAAGAAAGGAGAGACCATTTCAGGGATTCCCAAGGACCATCTAAAATGAGCGAAAAGAAG
AGGGACACGAAGCTGCCATCTGGTGAAGAACAAGTGAGAGATGAGATTAGAAGATGGGAAGATTCTACAGAGACATTAGATCATGAACAAACAAATGGAGAATGGGATGA
TGAACAGAGAAGAAAACATTAG
mRNA sequenceShow/hide mRNA sequence
CTGAAAATAATTTTCCTTTAAAAAAGAAGTTTAAAAAATCCGTGCAGAAATGTCGAAAGAGGATCGGCACAGTTGCCAAGAACAATTCCACAGCGGAGTCTTCATCTTCC
CGTACTGAATTTCCGTTCTGCATTTCCGATTCTCATCTCGCCGGAATCTCAACGCCGATGATCACTTTAGCTCATACCTATTTGTCATCGGCCCCTTCCAATCTCTCTTC
TCTGAAGCTCCGCCTCACCAAACCCCCTTCCACTTTCTCAACTTCACTGTCCGATCTCAAACCCTTAAATCCCTCCAACAAATCCGGTTCCAATCAGAGGAGGAGGAGGA
TCGGATATGGGATTTGCAGGGCGGAGTTAGGGACCGACGCGCCGTTCGCCATCGCCGTCGGTGCCTGCATTCTCAGTTCTCTTGTTTTTCCGGCGGACGGCGGTGGTTCC
GATGATGAGGGCGATGATGCCGTCATTGATTCCACTGATACTAGACTCGCTGTCATGAGTATTATTAGCTTTATCCCTTACTTCAACTGGCTGAGTTGGGTTTTTGCTTG
GCTTGATTCTGGGAAAAGACGTTATGCTGTTTATGCAATTGTGTATTTGGCTCCTTATTTGAGGTCGAATTTATCGTTGTCGCCCGAAGAAAGTTGGATTCCGATCATCA
GTATACTTCTCTGCATTGCTCACATTCAGATTGAAGTGAGCATTACAAATGGAGATATTCAACCCTTCCCAATATTTGGAAGAGCTTCAAATCAAATTTCTTCAATGAAA
GAAAGGAGAGACCATTTCAGGGATTCCCAAGGACCATCTAAAATGAGCGAAAAGAAGAGGGACACGAAGCTGCCATCTGGTGAAGAACAAGTGAGAGATGAGATTAGAAG
ATGGGAAGATTCTACAGAGACATTAGATCATGAACAAACAAATGGAGAATGGGATGATGAACAGAGAAGAAAACATTAGGTTCTATATAATTGAAGTTGAAGGATAGAGA
ATGTTAGTTTTAAAGTTGAATTATGTTTAATACAGCATTAAAATTAGGTTTACCTTACCTTCTTATATAGTGGGTTACAACCATTTTCAGAAGCTTATGCTGCAGTTGTA
TTGCTAAGCCTAGTTTTCTGTCTGGTTTAAATTTCTATTCATCTTTTGCAAGAAACATAGGTTTGTTATTGAAACTTTTGTCACAAACCTCAAAAGAAAACTGATAGTCA
TTTCAATTGATATTGATAGTCATTAGTTGGATTTGTTCAACAATGT
Protein sequenceShow/hide protein sequence
MITLAHTYLSSAPSNLSSLKLRLTKPPSTFSTSLSDLKPLNPSNKSGSNQRRRRIGYGICRAELGTDAPFAIAVGACILSSLVFPADGGGSDDEGDDAVIDSTDTRLAVM
SIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWIPIISILLCIAHIQIEVSITNGDIQPFPIFGRASNQISSMKERRDHFRDSQGPSKMSEKK
RDTKLPSGEEQVRDEIRRWEDSTETLDHEQTNGEWDDEQRRKH