; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000452 (gene) of Snake gourd v1 genome

Gene IDTan0000452
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionZinc finger matrin-type protein 1, putative isoform 1
Genome locationLG07:66249721..66251867
RNA-Seq ExpressionTan0000452
SyntenyTan0000452
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131237.1 uncharacterized protein LOC111004499 [Momordica charantia]3.8e-11079.92Show/hide
Query:  MITLGHAYLSPSPSNLTSLKLRLLKPPSIFSPSLSNLKPLNPCNKSTSSQRRIGNGICRAELGNDAPLAIAIGACILSSLVFPAAGGGSDDEGDAVIDST
        MI+L +A LS SPSNL+SLKLRL +PPS FS SLSNLK LNPC+K+ S Q+RIGNG+CRA+LGND P A+AIGACILSS VFP AGGGSDDE DAVIDST
Subjt:  MITLGHAYLSPSPSNLTSLKLRLLKPPSIFSPSLSNLKPLNPCNKSTSSQRRIGNGICRAELGNDAPLAIAIGACILSSLVFPAAGGGSDDEGDAVIDST

Query:  DTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIAHIQVEVSITNGDIQPFQIFGRASNQISSMKEG
        DTR AVM IISFIPYFNWLSWVFAWLDSG+R YAVYA+VYL PYLRSNLSLSPEESWLPI SILLCI HIQ+EVSI NGDIQPFQIFG+ S +ISS   G
Subjt:  DTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIAHIQVEVSITNGDIQPFQIFGRASNQISSMKEG

Query:  RDRFKGSQGQSEKSEKKRDMKLPSAEEQLKDEIRRWGDSTETLDHEQSNGEWDDEQRRK
        RD FKGSQG  E+S +K DMKLPS +EQL+DEIRRWGDS ETLDHEQSNGEWDDEQRRK
Subjt:  RDRFKGSQGQSEKSEKKRDMKLPSAEEQLKDEIRRWGDSTETLDHEQSNGEWDDEQRRK

XP_022958725.1 uncharacterized protein LOC111459865 isoform X2 [Cucurbita moschata]2.3e-10781.15Show/hide
Query:  MITLGHAYLSPSPSNLTSLK-LRLLKPPSIFSPSLSNLKPLNPCNKSTSSQRRIGNGICRAELGNDAPLAIAIGACILSSLVFPAAGGGSDDEGDAVIDS
        MITL  AYLS SPSN +SL  LRL KPP  FS SLSNLKPLNP +KS S+QR   NGICRAELGNDAP AIAIGACILSSLV P AGGGSDD+ DAV+DS
Subjt:  MITLGHAYLSPSPSNLTSLK-LRLLKPPSIFSPSLSNLKPLNPCNKSTSSQRRIGNGICRAELGNDAPLAIAIGACILSSLVFPAAGGGSDDEGDAVIDS

Query:  TDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIAHIQVEVSITNGDIQPFQIFGRASNQISSMKE
        TD RLAVM IISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSP+ESWLPIVSIL+CIAHIQVE SI NGDIQPFQIFG+ SNQIS  K 
Subjt:  TDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIAHIQVEVSITNGDIQPFQIFGRASNQISSMKE

Query:  GRDRFKGSQGQSEKSEKKRDMKLPSAEEQLKDEIRRWGDSTETLDHEQSNGEWDDEQRRK
         R   KGSQG ++KS KKRDMKLPSAEEQL+DEI+ WGD  ETLDHEQSN EWDDEQRRK
Subjt:  GRDRFKGSQGQSEKSEKKRDMKLPSAEEQLKDEIRRWGDSTETLDHEQSNGEWDDEQRRK

XP_023534481.1 uncharacterized protein LOC111796027 isoform X1 [Cucurbita pepo subsp. pepo]1.8e-10781.61Show/hide
Query:  MITLGHAYLSPSPSNLTSLK-LRLLKPPSIFSPSLSNLKPLNPCNKSTSSQRR-IGNGICRAELGNDAPLAIAIGACILSSLVFPAAGGGSDDEGDAVID
        MITL  AYLS SPSN +SL  LRL KPP  FS SLSNLKPLNP +KS S+Q+R   NGICRAELGNDAP AIAIGACIL+SLV P AGGGSDD+ DAV+D
Subjt:  MITLGHAYLSPSPSNLTSLK-LRLLKPPSIFSPSLSNLKPLNPCNKSTSSQRR-IGNGICRAELGNDAPLAIAIGACILSSLVFPAAGGGSDDEGDAVID

Query:  STDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIAHIQVEVSITNGDIQPFQIFGRASNQISSMK
        STD RLAVM IISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSP+ESWLPIVSIL+CIAHIQVE SI NGDIQPFQIFG+ASNQIS  K
Subjt:  STDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIAHIQVEVSITNGDIQPFQIFGRASNQISSMK

Query:  EGRDRFKGSQGQSEKSEKKRDMKLPSAEEQLKDEIRRWGDSTETLDHEQSNGEWDDEQRRK
         GR   KGSQG ++KS KKRDMKLPSAEEQL+DEIR WGD  ETLDHEQSN EWDDEQRRK
Subjt:  EGRDRFKGSQGQSEKSEKKRDMKLPSAEEQLKDEIRRWGDSTETLDHEQSNGEWDDEQRRK

XP_023534483.1 uncharacterized protein LOC111796027 isoform X2 [Cucurbita pepo subsp. pepo]1.6e-10881.92Show/hide
Query:  MITLGHAYLSPSPSNLTSLK-LRLLKPPSIFSPSLSNLKPLNPCNKSTSSQRRIGNGICRAELGNDAPLAIAIGACILSSLVFPAAGGGSDDEGDAVIDS
        MITL  AYLS SPSN +SL  LRL KPP  FS SLSNLKPLNP +KS S+QR   NGICRAELGNDAP AIAIGACIL+SLV P AGGGSDD+ DAV+DS
Subjt:  MITLGHAYLSPSPSNLTSLK-LRLLKPPSIFSPSLSNLKPLNPCNKSTSSQRRIGNGICRAELGNDAPLAIAIGACILSSLVFPAAGGGSDDEGDAVIDS

Query:  TDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIAHIQVEVSITNGDIQPFQIFGRASNQISSMKE
        TD RLAVM IISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSP+ESWLPIVSIL+CIAHIQVE SI NGDIQPFQIFG+ASNQIS  K 
Subjt:  TDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIAHIQVEVSITNGDIQPFQIFGRASNQISSMKE

Query:  GRDRFKGSQGQSEKSEKKRDMKLPSAEEQLKDEIRRWGDSTETLDHEQSNGEWDDEQRRK
        GR   KGSQG ++KS KKRDMKLPSAEEQL+DEIR WGD  ETLDHEQSN EWDDEQRRK
Subjt:  GRDRFKGSQGQSEKSEKKRDMKLPSAEEQLKDEIRRWGDSTETLDHEQSNGEWDDEQRRK

XP_038875289.1 uncharacterized protein LOC120067780 [Benincasa hispida]1.4e-10781.92Show/hide
Query:  MITLGHAYLSPSPSNLTSLK-LRLLKPPSIFSPSLSNLKPLNPCNKSTSSQRRIGNGICRAELGNDAPLAIAIGACILSSLVFPAAGGGSDDEGDAVIDS
        MITL  AYLS S SNL+SLK LRL KP S FSPSLSNLKPLNP  K  S+Q RIGNGICRAELGNDAP AIAIGAC LSSLV P A G SDDE DA+IDS
Subjt:  MITLGHAYLSPSPSNLTSLK-LRLLKPPSIFSPSLSNLKPLNPCNKSTSSQRRIGNGICRAELGNDAPLAIAIGACILSSLVFPAAGGGSDDEGDAVIDS

Query:  TDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIAHIQVEVSITNGDIQPFQIFGRASNQISSMKE
        TDTRLAVMSIISFIPYFNWLSWVFAWLDSG+R YAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCI HIQ+EVSITNGDIQP QIFG+AS  ISS K+
Subjt:  TDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIAHIQVEVSITNGDIQPFQIFGRASNQISSMKE

Query:  GRDRFKGSQGQSEKSEKKRDMKLPSAEEQLKDEIRRWGDSTETLDHEQSNGEWDDEQRRK
        GRD FKGSQG  ++S KK D KLPSAEEQ +D+IRRWGDS E LD+EQSNGEWDDEQRRK
Subjt:  GRDRFKGSQGQSEKSEKKRDMKLPSAEEQLKDEIRRWGDSTETLDHEQSNGEWDDEQRRK

TrEMBL top hitse value%identityAlignment
A0A6J1BQE6 uncharacterized protein LOC1110044991.8e-11079.92Show/hide
Query:  MITLGHAYLSPSPSNLTSLKLRLLKPPSIFSPSLSNLKPLNPCNKSTSSQRRIGNGICRAELGNDAPLAIAIGACILSSLVFPAAGGGSDDEGDAVIDST
        MI+L +A LS SPSNL+SLKLRL +PPS FS SLSNLK LNPC+K+ S Q+RIGNG+CRA+LGND P A+AIGACILSS VFP AGGGSDDE DAVIDST
Subjt:  MITLGHAYLSPSPSNLTSLKLRLLKPPSIFSPSLSNLKPLNPCNKSTSSQRRIGNGICRAELGNDAPLAIAIGACILSSLVFPAAGGGSDDEGDAVIDST

Query:  DTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIAHIQVEVSITNGDIQPFQIFGRASNQISSMKEG
        DTR AVM IISFIPYFNWLSWVFAWLDSG+R YAVYA+VYL PYLRSNLSLSPEESWLPI SILLCI HIQ+EVSI NGDIQPFQIFG+ S +ISS   G
Subjt:  DTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIAHIQVEVSITNGDIQPFQIFGRASNQISSMKEG

Query:  RDRFKGSQGQSEKSEKKRDMKLPSAEEQLKDEIRRWGDSTETLDHEQSNGEWDDEQRRK
        RD FKGSQG  E+S +K DMKLPS +EQL+DEIRRWGDS ETLDHEQSNGEWDDEQRRK
Subjt:  RDRFKGSQGQSEKSEKKRDMKLPSAEEQLKDEIRRWGDSTETLDHEQSNGEWDDEQRRK

A0A6J1H4A9 uncharacterized protein LOC111459865 isoform X21.1e-10781.15Show/hide
Query:  MITLGHAYLSPSPSNLTSLK-LRLLKPPSIFSPSLSNLKPLNPCNKSTSSQRRIGNGICRAELGNDAPLAIAIGACILSSLVFPAAGGGSDDEGDAVIDS
        MITL  AYLS SPSN +SL  LRL KPP  FS SLSNLKPLNP +KS S+QR   NGICRAELGNDAP AIAIGACILSSLV P AGGGSDD+ DAV+DS
Subjt:  MITLGHAYLSPSPSNLTSLK-LRLLKPPSIFSPSLSNLKPLNPCNKSTSSQRRIGNGICRAELGNDAPLAIAIGACILSSLVFPAAGGGSDDEGDAVIDS

Query:  TDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIAHIQVEVSITNGDIQPFQIFGRASNQISSMKE
        TD RLAVM IISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSP+ESWLPIVSIL+CIAHIQVE SI NGDIQPFQIFG+ SNQIS  K 
Subjt:  TDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIAHIQVEVSITNGDIQPFQIFGRASNQISSMKE

Query:  GRDRFKGSQGQSEKSEKKRDMKLPSAEEQLKDEIRRWGDSTETLDHEQSNGEWDDEQRRK
         R   KGSQG ++KS KKRDMKLPSAEEQL+DEI+ WGD  ETLDHEQSN EWDDEQRRK
Subjt:  GRDRFKGSQGQSEKSEKKRDMKLPSAEEQLKDEIRRWGDSTETLDHEQSNGEWDDEQRRK

A0A6J1H5Y1 uncharacterized protein LOC111459865 isoform X15.6e-10781.23Show/hide
Query:  MITLGHAYLSPSPSNLTSLK-LRLLKPPSIFSPSLSNLKPLNPCNKSTSSQRR-IGNGICRAELGNDAPLAIAIGACILSSLVFPAAGGGSDDEGDAVID
        MITL  AYLS SPSN +SL  LRL KPP  FS SLSNLKPLNP +KS S+QRR   NGICRAELGNDAP AIAIGACILSSLV P AGGGSDD+ DAV+D
Subjt:  MITLGHAYLSPSPSNLTSLK-LRLLKPPSIFSPSLSNLKPLNPCNKSTSSQRR-IGNGICRAELGNDAPLAIAIGACILSSLVFPAAGGGSDDEGDAVID

Query:  STDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIAHIQVEVSITNGDIQPFQIFGRASNQISSMK
        STD RLAVM IISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSP+ESWLPIVSIL+CIAHIQVE SI NGDIQPFQIFG+ SNQIS  K
Subjt:  STDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIAHIQVEVSITNGDIQPFQIFGRASNQISSMK

Query:  EGRDRFKGSQGQSEKSEKKRDMKLPSAEEQLKDEIRRWGDSTETLDHEQSNGEWDDEQRRK
          R   KGSQG ++KS KKRDMKLPSAEEQL+DEI+ WGD  ETLDHEQSN EWDDEQRRK
Subjt:  EGRDRFKGSQGQSEKSEKKRDMKLPSAEEQLKDEIRRWGDSTETLDHEQSNGEWDDEQRRK

A0A6J1K887 uncharacterized protein LOC111491538 isoform X22.5e-10780.77Show/hide
Query:  MITLGHAYLSPSPSNLTSLK-LRLLKPPSIFSPSLSNLKPLNPCNKSTSSQRRIGNGICRAELGNDAPLAIAIGACILSSLVFPAAGGGSDDEGDAVIDS
        M+TL  AYLS SPSN +SL  LRL KPP  FS SLSNLKPLNP +KS S+QR   NGICRAELGNDAP AIAIGACILSSLV P AGGGSDD+ DAV+DS
Subjt:  MITLGHAYLSPSPSNLTSLK-LRLLKPPSIFSPSLSNLKPLNPCNKSTSSQRRIGNGICRAELGNDAPLAIAIGACILSSLVFPAAGGGSDDEGDAVIDS

Query:  TDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIAHIQVEVSITNGDIQPFQIFGRASNQISSMKE
        TD RLAVM IISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSP+ESWLPIVSIL+CIAHIQVE SI NGDIQPFQIFG+ASNQIS  + 
Subjt:  TDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIAHIQVEVSITNGDIQPFQIFGRASNQISSMKE

Query:  GRDRFKGSQGQSEKSEKKRDMKLPSAEEQLKDEIRRWGDSTETLDHEQSNGEWDDEQRRK
        GR   KG +G ++KS KKRDMKLPSAEEQL+DEIR WGD  ETLDHEQSN EWDDEQRRK
Subjt:  GRDRFKGSQGQSEKSEKKRDMKLPSAEEQLKDEIRRWGDSTETLDHEQSNGEWDDEQRRK

A0A6J1KAA9 uncharacterized protein LOC111491538 isoform X11.2e-10680.84Show/hide
Query:  MITLGHAYLSPSPSNLTSLK-LRLLKPPSIFSPSLSNLKPLNPCNKSTSSQRR-IGNGICRAELGNDAPLAIAIGACILSSLVFPAAGGGSDDEGDAVID
        M+TL  AYLS SPSN +SL  LRL KPP  FS SLSNLKPLNP +KS S+QRR   NGICRAELGNDAP AIAIGACILSSLV P AGGGSDD+ DAV+D
Subjt:  MITLGHAYLSPSPSNLTSLK-LRLLKPPSIFSPSLSNLKPLNPCNKSTSSQRR-IGNGICRAELGNDAPLAIAIGACILSSLVFPAAGGGSDDEGDAVID

Query:  STDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIAHIQVEVSITNGDIQPFQIFGRASNQISSMK
        STD RLAVM IISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSP+ESWLPIVSIL+CIAHIQVE SI NGDIQPFQIFG+ASNQIS  +
Subjt:  STDTRLAVMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIAHIQVEVSITNGDIQPFQIFGRASNQISSMK

Query:  EGRDRFKGSQGQSEKSEKKRDMKLPSAEEQLKDEIRRWGDSTETLDHEQSNGEWDDEQRRK
         GR   KG +G ++KS KKRDMKLPSAEEQL+DEIR WGD  ETLDHEQSN EWDDEQRRK
Subjt:  EGRDRFKGSQGQSEKSEKKRDMKLPSAEEQLKDEIRRWGDSTETLDHEQSNGEWDDEQRRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G41960.1 unknown protein2.2e-4754.76Show/hide
Query:  LSPSPSNLTSLKLRLLKPPSIFSPSLSNLKPLNPCNKSTSSQRRIGNGICRAELGNDAPLAIAIGACILSSLVFPAAGGGSDDEGD---AVIDSTDTRLA
        LS S S  T  K RLL   S  S S S L    P        R+I   ICRAE   DAPL  AIGACILSS VFP A   +D+E +   + I STD RLA
Subjt:  LSPSPSNLTSLKLRLLKPPSIFSPSLSNLKPLNPCNKSTSSQRRIGNGICRAELGNDAPLAIAIGACILSSLVFPAAGGGSDDEGD---AVIDSTDTRLA

Query:  VMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIAHIQVEVSITNGDIQPFQIFGRASNQISSMKEG---RD
         M IISFIPYFNWLSWVFAWLD+GK RYAVYA+VYL PYL SNLS+SPEESWLPI SI+L I H+Q+E SI NGD++    F   S+   S K+    + 
Subjt:  VMSIISFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIAHIQVEVSITNGDIQPFQIFGRASNQISSMKEG---RD

Query:  RFKGSQGQSE
         FKG     E
Subjt:  RFKGSQGQSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCACTCTGGGTCATGCTTATTTATCACCATCCCCTTCCAATCTCACTTCTCTGAAGCTTCGTCTTTTGAAACCCCCTTCCATCTTCTCACCATCGCTCTCCAATCT
TAAACCCTTAAATCCCTGCAACAAATCAACTTCCAGTCAGAGAAGGATCGGAAATGGGATTTGTAGGGCGGAATTGGGGAACGACGCGCCTTTAGCCATTGCGATCGGAG
CCTGCATTCTCAGTTCTCTTGTTTTTCCGGCAGCCGGCGGTGGTTCCGATGATGAGGGAGATGCCGTTATTGATTCCACTGATACCAGGCTCGCTGTCATGAGCATCATT
AGTTTCATCCCCTACTTCAACTGGCTGAGTTGGGTTTTTGCGTGGCTTGATTCTGGGAAAAGACGTTATGCTGTGTATGCAATCGTGTATTTGGCTCCTTATCTAAGGTC
GAATTTATCGTTGTCGCCCGAAGAGAGCTGGCTTCCTATTGTCAGTATACTTCTCTGCATTGCTCACATTCAGGTTGAAGTGAGCATTACTAATGGAGATATTCAACCCT
TTCAAATATTTGGGAGAGCTTCAAATCAAATTTCTTCAATGAAGGAAGGGAGAGACCGTTTCAAGGGGTCCCAAGGACAATCCGAAAAGAGCGAAAAGAAAAGGGACATG
AAGCTGCCATCTGCTGAAGAACAATTGAAAGATGAGATTAGAAGATGGGGAGATTCTACAGAGACATTAGATCATGAACAATCCAATGGAGAATGGGATGATGAACAGAG
GAGAAAAGATTAG
mRNA sequenceShow/hide mRNA sequence
GGTATTTTGTATAATTTGAAAGAGAGAGACTGGCACACTTGCCAAGAACAATTCCATTGGGGGTCTGCGTCTTCCCGGAATGAATTTTCATTCTACATTTCTGATTCTCA
CTTTCTTCCGATTCTCATCTCACTGATTCACCGGAATTTCAACGCCAATCAAAGCAATGATCACTCTGGGTCATGCTTATTTATCACCATCCCCTTCCAATCTCACTTCT
CTGAAGCTTCGTCTTTTGAAACCCCCTTCCATCTTCTCACCATCGCTCTCCAATCTTAAACCCTTAAATCCCTGCAACAAATCAACTTCCAGTCAGAGAAGGATCGGAAA
TGGGATTTGTAGGGCGGAATTGGGGAACGACGCGCCTTTAGCCATTGCGATCGGAGCCTGCATTCTCAGTTCTCTTGTTTTTCCGGCAGCCGGCGGTGGTTCCGATGATG
AGGGAGATGCCGTTATTGATTCCACTGATACCAGGCTCGCTGTCATGAGCATCATTAGTTTCATCCCCTACTTCAACTGGCTGAGTTGGGTTTTTGCGTGGCTTGATTCT
GGGAAAAGACGTTATGCTGTGTATGCAATCGTGTATTTGGCTCCTTATCTAAGGTCGAATTTATCGTTGTCGCCCGAAGAGAGCTGGCTTCCTATTGTCAGTATACTTCT
CTGCATTGCTCACATTCAGGTTGAAGTGAGCATTACTAATGGAGATATTCAACCCTTTCAAATATTTGGGAGAGCTTCAAATCAAATTTCTTCAATGAAGGAAGGGAGAG
ACCGTTTCAAGGGGTCCCAAGGACAATCCGAAAAGAGCGAAAAGAAAAGGGACATGAAGCTGCCATCTGCTGAAGAACAATTGAAAGATGAGATTAGAAGATGGGGAGAT
TCTACAGAGACATTAGATCATGAACAATCCAATGGAGAATGGGATGATGAACAGAGGAGAAAAGATTAGGTTCTATGTGCTAACTTTACTCTGCTTGGTGCATATACAAA
TTAAAGTTGAGGGGTACAGAATGTTAGTTTTAAATTTATATTATGTTTAGTACAACATTAAAATTAGATATACCTTCTACTTCTAATATAGTCAGTTGCAAGCAAATTCA
TGGAGCTTACATATTCTTAGAACTGCTGCAGTCACATCGGTA
Protein sequenceShow/hide protein sequence
MITLGHAYLSPSPSNLTSLKLRLLKPPSIFSPSLSNLKPLNPCNKSTSSQRRIGNGICRAELGNDAPLAIAIGACILSSLVFPAAGGGSDDEGDAVIDSTDTRLAVMSII
SFIPYFNWLSWVFAWLDSGKRRYAVYAIVYLAPYLRSNLSLSPEESWLPIVSILLCIAHIQVEVSITNGDIQPFQIFGRASNQISSMKEGRDRFKGSQGQSEKSEKKRDM
KLPSAEEQLKDEIRRWGDSTETLDHEQSNGEWDDEQRRKD