; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007291 (gene) of Snake gourd v1 genome

Gene IDTan0007291
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF1685)
Genome locationLG10:15293023..15295232
RNA-Seq ExpressionTan0007291
SyntenyTan0007291
Gene Ontology termsNA
InterPro domainsIPR012881 - Protein of unknown function DUF1685


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597653.1 hypothetical protein SDJN03_10833, partial [Cucurbita argyrosperma subsp. sororia]9.5e-8591.01Show/hide
Query:  MGSDFFSESESWVSSVNGVREDPDEETSIEEGEEIGSDSDLDELIQMGMENLKKKKNMKKKNQVLLEGFVEDEDDLMRTKSLTDEDLDELKGCVDLGFGF
        MGS FFSESE WVSSVNGVREDPDE+TSIEE EEIGSDSDLDE +QMGMENL KK   KKKN VLLEGFVEDED+LMRTKSLTDEDLDELKGCVDLGFGF
Subjt:  MGSDFFSESESWVSSVNGVREDPDEETSIEEGEEIGSDSDLDELIQMGMENLKKKKNMKKKNQVLLEGFVEDEDDLMRTKSLTDEDLDELKGCVDLGFGF

Query:  SYDEIPELCNTLPALELCYSMSQKYMDDHQKSPESLPAVDSCSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVACT
        SYDEIPELCNTLPALELCYSMSQKYMD+HQKSPES  A+DSCSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVACT
Subjt:  SYDEIPELCNTLPALELCYSMSQKYMDDHQKSPESLPAVDSCSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVACT

KAG7029094.1 hypothetical protein SDJN02_10278, partial [Cucurbita argyrosperma subsp. argyrosperma]8.6e-8689.25Show/hide
Query:  MGSDFFSESESWVSSVNGVREDPDEETSIEEGEEIGSDSDLDELIQMGMENLKKKKNMKKKNQVLLEGFVEDEDDLMRTKSLTDEDLDELKGCVDLGFGF
        MGS FFSESE WVSSVNGVREDPDE+TSIEEGEEIGSDSDLDE +QMGMENL KK   KKKN VLLEGFVEDED+LMRTKSLTDEDLDELKGCVDLGFGF
Subjt:  MGSDFFSESESWVSSVNGVREDPDEETSIEEGEEIGSDSDLDELIQMGMENLKKKKNMKKKNQVLLEGFVEDEDDLMRTKSLTDEDLDELKGCVDLGFGF

Query:  SYDEIPELCNTLPALELCYSMSQKYMDDHQKSPESLPAVDSCSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVACTPCGERLCS
        SYDEIPELCNTLPALELCYSMSQKYMD+HQKSPES  A+DSCSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVACT    RLC+
Subjt:  SYDEIPELCNTLPALELCYSMSQKYMDDHQKSPESLPAVDSCSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVACTPCGERLCS

XP_008453998.1 PREDICTED: uncharacterized protein LOC103494552 [Cucumis melo]4.4e-8285.26Show/hide
Query:  MGSDFFSESESWVSSVNGVREDPDEETSIEEGEEIGSDSDLDELIQMGMENLKKKKNMKKKNQVLLEGFVEDEDDLMRTKSLTDEDLDELKGCVDLGFGF
        MGSDFFS+SE+WVSSVNG+REDPD+ETS+EEGE IGSDSD DE +QMG    KK+  MKK+NQVLLEGFVEDEDDLMRTKSLTDEDLDELKGCVDLGFGF
Subjt:  MGSDFFSESESWVSSVNGVREDPDEETSIEEGEEIGSDSDLDELIQMGMENLKKKKNMKKKNQVLLEGFVEDEDDLMRTKSLTDEDLDELKGCVDLGFGF

Query:  SYDEIPELCNTLPALELCYSMSQKYMDDHQKSPESLPA----VDSCSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVACTPCGERLCS
        SYDEIPELCNTLPALELCYSMSQKYMDDHQKSPES PA     DSCSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVACT    RLC+
Subjt:  SYDEIPELCNTLPALELCYSMSQKYMDDHQKSPESLPA----VDSCSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVACTPCGERLCS

XP_022932682.1 uncharacterized protein LOC111439155 [Cucurbita moschata]1.0e-8388.24Show/hide
Query:  MGSDFFSESESWVSSVNGVREDPDEETSIEEGEEIGSDSDLDELIQMGMENL-KKKKNMKKKNQVLLEGFVEDEDDLMRTKSLTDEDLDELKGCVDLGFG
        MGS FFSESE WVSSVN VREDPDE++SIEEGEEIGSDSDLDE +QMGMENL KK K  KKKN VLLEGFVEDED+LMRTKSLTDEDLDELKGCVDLGFG
Subjt:  MGSDFFSESESWVSSVNGVREDPDEETSIEEGEEIGSDSDLDELIQMGMENL-KKKKNMKKKNQVLLEGFVEDEDDLMRTKSLTDEDLDELKGCVDLGFG

Query:  FSYDEIPELCNTLPALELCYSMSQKYMDDHQKSPESLPAVDSCSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVACTPCGERLCS
        FSYDEIPELCNTLPALELCYSMSQKYMD+HQKSPES  A+DSCSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVACT    RLC+
Subjt:  FSYDEIPELCNTLPALELCYSMSQKYMDDHQKSPESLPAVDSCSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVACTPCGERLCS

XP_022972236.1 uncharacterized protein LOC111470827 [Cucurbita maxima]3.7e-8186.56Show/hide
Query:  MGSDFFSESESWVSSVNGVREDPDEETSIEEGEEIGSDSDLDELIQMGMENLKKKKNMKKKNQVLLEGFVEDEDDLMRTKSLTDEDLDELKGCVDLGFGF
        MGS FFSESE W   VNGVRED +E+TSIEEGEEIGSDSDLDE +QMGMENL KK   KKKN VLLEGFVEDED+LMRTKSLTDEDLDELKGCVDLGFGF
Subjt:  MGSDFFSESESWVSSVNGVREDPDEETSIEEGEEIGSDSDLDELIQMGMENLKKKKNMKKKNQVLLEGFVEDEDDLMRTKSLTDEDLDELKGCVDLGFGF

Query:  SYDEIPELCNTLPALELCYSMSQKYMDDHQKSPESLPAVDSCSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVACTPCGERLCS
        SYDEIPELCNTLPALELCYSMSQKYMD+HQKSPES  A+DSCSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVACT    RLC+
Subjt:  SYDEIPELCNTLPALELCYSMSQKYMDDHQKSPESLPAVDSCSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVACTPCGERLCS

TrEMBL top hitse value%identityAlignment
A0A0A0KXD6 Uncharacterized protein5.8e-8083.16Show/hide
Query:  MGSDFFSESESWVSSVNGVREDPDEETSIEEGEEIGSDSDLDELIQMGMENLKKKKNMKKKNQVLLEGFVEDEDDLMRTKSLTDEDLDELKGCVDLGFGF
        MGSDFF+ES++WVSSVNG+REDPD+ETSI+ GE IG+DSD DE  QMG+    KK+ MKK++QVLLEGFVEDEDDLMRTKSLTDEDLDELKGCVDLGFGF
Subjt:  MGSDFFSESESWVSSVNGVREDPDEETSIEEGEEIGSDSDLDELIQMGMENLKKKKNMKKKNQVLLEGFVEDEDDLMRTKSLTDEDLDELKGCVDLGFGF

Query:  SYDEIPELCNTLPALELCYSMSQKYMDDHQKSPESLPA----VDSCSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVACTPCGERLCS
        SYDEIPELCNTLPALELCYSMSQKYMDDHQKSPES PA     DSCSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVACT    RLC+
Subjt:  SYDEIPELCNTLPALELCYSMSQKYMDDHQKSPESLPA----VDSCSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVACTPCGERLCS

A0A1S3BXQ6 uncharacterized protein LOC1034945522.1e-8285.26Show/hide
Query:  MGSDFFSESESWVSSVNGVREDPDEETSIEEGEEIGSDSDLDELIQMGMENLKKKKNMKKKNQVLLEGFVEDEDDLMRTKSLTDEDLDELKGCVDLGFGF
        MGSDFFS+SE+WVSSVNG+REDPD+ETS+EEGE IGSDSD DE +QMG    KK+  MKK+NQVLLEGFVEDEDDLMRTKSLTDEDLDELKGCVDLGFGF
Subjt:  MGSDFFSESESWVSSVNGVREDPDEETSIEEGEEIGSDSDLDELIQMGMENLKKKKNMKKKNQVLLEGFVEDEDDLMRTKSLTDEDLDELKGCVDLGFGF

Query:  SYDEIPELCNTLPALELCYSMSQKYMDDHQKSPESLPA----VDSCSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVACTPCGERLCS
        SYDEIPELCNTLPALELCYSMSQKYMDDHQKSPES PA     DSCSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVACT    RLC+
Subjt:  SYDEIPELCNTLPALELCYSMSQKYMDDHQKSPESLPA----VDSCSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVACTPCGERLCS

A0A5A7TTN2 Uncharacterized protein2.1e-8285.26Show/hide
Query:  MGSDFFSESESWVSSVNGVREDPDEETSIEEGEEIGSDSDLDELIQMGMENLKKKKNMKKKNQVLLEGFVEDEDDLMRTKSLTDEDLDELKGCVDLGFGF
        MGSDFFS+SE+WVSSVNG+REDPD+ETS+EEGE IGSDSD DE +QMG    KK+  MKK+NQVLLEGFVEDEDDLMRTKSLTDEDLDELKGCVDLGFGF
Subjt:  MGSDFFSESESWVSSVNGVREDPDEETSIEEGEEIGSDSDLDELIQMGMENLKKKKNMKKKNQVLLEGFVEDEDDLMRTKSLTDEDLDELKGCVDLGFGF

Query:  SYDEIPELCNTLPALELCYSMSQKYMDDHQKSPESLPA----VDSCSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVACTPCGERLCS
        SYDEIPELCNTLPALELCYSMSQKYMDDHQKSPES PA     DSCSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVACT    RLC+
Subjt:  SYDEIPELCNTLPALELCYSMSQKYMDDHQKSPESLPA----VDSCSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVACTPCGERLCS

A0A6J1EXG0 uncharacterized protein LOC1114391555.1e-8488.24Show/hide
Query:  MGSDFFSESESWVSSVNGVREDPDEETSIEEGEEIGSDSDLDELIQMGMENL-KKKKNMKKKNQVLLEGFVEDEDDLMRTKSLTDEDLDELKGCVDLGFG
        MGS FFSESE WVSSVN VREDPDE++SIEEGEEIGSDSDLDE +QMGMENL KK K  KKKN VLLEGFVEDED+LMRTKSLTDEDLDELKGCVDLGFG
Subjt:  MGSDFFSESESWVSSVNGVREDPDEETSIEEGEEIGSDSDLDELIQMGMENL-KKKKNMKKKNQVLLEGFVEDEDDLMRTKSLTDEDLDELKGCVDLGFG

Query:  FSYDEIPELCNTLPALELCYSMSQKYMDDHQKSPESLPAVDSCSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVACTPCGERLCS
        FSYDEIPELCNTLPALELCYSMSQKYMD+HQKSPES  A+DSCSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVACT    RLC+
Subjt:  FSYDEIPELCNTLPALELCYSMSQKYMDDHQKSPESLPAVDSCSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVACTPCGERLCS

A0A6J1I5G0 uncharacterized protein LOC1114708271.8e-8186.56Show/hide
Query:  MGSDFFSESESWVSSVNGVREDPDEETSIEEGEEIGSDSDLDELIQMGMENLKKKKNMKKKNQVLLEGFVEDEDDLMRTKSLTDEDLDELKGCVDLGFGF
        MGS FFSESE W   VNGVRED +E+TSIEEGEEIGSDSDLDE +QMGMENL KK   KKKN VLLEGFVEDED+LMRTKSLTDEDLDELKGCVDLGFGF
Subjt:  MGSDFFSESESWVSSVNGVREDPDEETSIEEGEEIGSDSDLDELIQMGMENLKKKKNMKKKNQVLLEGFVEDEDDLMRTKSLTDEDLDELKGCVDLGFGF

Query:  SYDEIPELCNTLPALELCYSMSQKYMDDHQKSPESLPAVDSCSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVACTPCGERLCS
        SYDEIPELCNTLPALELCYSMSQKYMD+HQKSPES  A+DSCSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVACT    RLC+
Subjt:  SYDEIPELCNTLPALELCYSMSQKYMDDHQKSPESLPAVDSCSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVACTPCGERLCS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05870.1 Protein of unknown function (DUF1685)1.2e-4555.5Show/hide
Query:  MGSDFFSESESWVSSVNGVREDPDEETSIEEGEEIGSDSDLDELIQMGMENLKKKKNMKKKNQVLLEGFVE---------DEDDLMRTKSLTDEDLDELK
        MG      S+ W +S N   E  DE    E   EI S         +  E  +KK   +KK+QVLLEG+VE          +DDL R+KSLTD+DL++L+
Subjt:  MGSDFFSESESWVSSVNGVREDPDEETSIEEGEEIGSDSDLDELIQMGMENLKKKKNMKKKNQVLLEGFVE---------DEDDLMRTKSLTDEDLDELK

Query:  GCVDLGFGFSYDEIPELCNTLPALELCYSMSQKYMDDHQ-KSPESLPAVDSCSS----VSSPIANWKISSPGDHPEDVKARLKFWAQAVACTPCGERLCS
        GC+DLGFGFSYDEIPELCNTLPALELCYSMSQK++DD Q KSPE+  +V+ C S     ++PIANWKISSPGD+P+DVKARLK+WAQAVACT    +LCS
Subjt:  GCVDLGFGFSYDEIPELCNTLPALELCYSMSQKYMDDHQ-KSPESLPAVDSCSS----VSSPIANWKISSPGDHPEDVKARLKFWAQAVACTPCGERLCS

AT1G05870.2 Protein of unknown function (DUF1685)1.2e-4555.5Show/hide
Query:  MGSDFFSESESWVSSVNGVREDPDEETSIEEGEEIGSDSDLDELIQMGMENLKKKKNMKKKNQVLLEGFVE---------DEDDLMRTKSLTDEDLDELK
        MG      S+ W +S N   E  DE    E   EI S         +  E  +KK   +KK+QVLLEG+VE          +DDL R+KSLTD+DL++L+
Subjt:  MGSDFFSESESWVSSVNGVREDPDEETSIEEGEEIGSDSDLDELIQMGMENLKKKKNMKKKNQVLLEGFVE---------DEDDLMRTKSLTDEDLDELK

Query:  GCVDLGFGFSYDEIPELCNTLPALELCYSMSQKYMDDHQ-KSPESLPAVDSCSS----VSSPIANWKISSPGDHPEDVKARLKFWAQAVACTPCGERLCS
        GC+DLGFGFSYDEIPELCNTLPALELCYSMSQK++DD Q KSPE+  +V+ C S     ++PIANWKISSPGD+P+DVKARLK+WAQAVACT    +LCS
Subjt:  GCVDLGFGFSYDEIPELCNTLPALELCYSMSQKYMDDHQ-KSPESLPAVDSCSS----VSSPIANWKISSPGDHPEDVKARLKFWAQAVACTPCGERLCS

AT1G05870.3 Protein of unknown function (DUF1685)1.2e-4555.5Show/hide
Query:  MGSDFFSESESWVSSVNGVREDPDEETSIEEGEEIGSDSDLDELIQMGMENLKKKKNMKKKNQVLLEGFVE---------DEDDLMRTKSLTDEDLDELK
        MG      S+ W +S N   E  DE    E   EI S         +  E  +KK   +KK+QVLLEG+VE          +DDL R+KSLTD+DL++L+
Subjt:  MGSDFFSESESWVSSVNGVREDPDEETSIEEGEEIGSDSDLDELIQMGMENLKKKKNMKKKNQVLLEGFVE---------DEDDLMRTKSLTDEDLDELK

Query:  GCVDLGFGFSYDEIPELCNTLPALELCYSMSQKYMDDHQ-KSPESLPAVDSCSS----VSSPIANWKISSPGDHPEDVKARLKFWAQAVACTPCGERLCS
        GC+DLGFGFSYDEIPELCNTLPALELCYSMSQK++DD Q KSPE+  +V+ C S     ++PIANWKISSPGD+P+DVKARLK+WAQAVACT    +LCS
Subjt:  GCVDLGFGFSYDEIPELCNTLPALELCYSMSQKYMDDHQ-KSPESLPAVDSCSS----VSSPIANWKISSPGDHPEDVKARLKFWAQAVACTPCGERLCS

AT2G31560.1 Protein of unknown function (DUF1685)2.9e-4768.61Show/hide
Query:  KKKNMKKKNQVLLEGF-VEDEDDLMRTKSLTDEDLDELKGCVDLGFGFSYDEIPELCNTLPALELCYSMSQKYMDDHQKSPESLPAVDSCS---SVSSPI
        +KK  KKK+QVLLEG+ ++D+DDL R KSLTD+DL+ELKGC+DLGFGFSYDEIPELCNTLPALELCYSMSQK++DD Q++       D  S   + ++PI
Subjt:  KKKNMKKKNQVLLEGF-VEDEDDLMRTKSLTDEDLDELKGCVDLGFGFSYDEIPELCNTLPALELCYSMSQKYMDDHQKSPESLPAVDSCS---SVSSPI

Query:  ANWKISSPGDHPEDVKARLKFWAQAVACTPCGERLCS
        ANWKISSPGD P+DVKARLK+WAQ VACT    RLCS
Subjt:  ANWKISSPGDHPEDVKARLKFWAQAVACTPCGERLCS

AT2G31560.2 Protein of unknown function (DUF1685)2.9e-4768.61Show/hide
Query:  KKKNMKKKNQVLLEGF-VEDEDDLMRTKSLTDEDLDELKGCVDLGFGFSYDEIPELCNTLPALELCYSMSQKYMDDHQKSPESLPAVDSCS---SVSSPI
        +KK  KKK+QVLLEG+ ++D+DDL R KSLTD+DL+ELKGC+DLGFGFSYDEIPELCNTLPALELCYSMSQK++DD Q++       D  S   + ++PI
Subjt:  KKKNMKKKNQVLLEGF-VEDEDDLMRTKSLTDEDLDELKGCVDLGFGFSYDEIPELCNTLPALELCYSMSQKYMDDHQKSPESLPAVDSCS---SVSSPI

Query:  ANWKISSPGDHPEDVKARLKFWAQAVACTPCGERLCS
        ANWKISSPGD P+DVKARLK+WAQ VACT    RLCS
Subjt:  ANWKISSPGDHPEDVKARLKFWAQAVACTPCGERLCS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGTGATTTCTTTAGCGAATCTGAATCGTGGGTTTCTTCTGTGAATGGAGTTCGAGAGGACCCAGATGAGGAAACCTCCATTGAAGAAGGTGAAGAAATTGGGTC
GGATTCGGATTTGGACGAGTTGATTCAAATGGGGATGGAAAATTTGAAGAAAAAGAAGAACATGAAGAAGAAGAACCAGGTTTTGCTTGAGGGATTTGTGGAGGATGAGG
ACGATTTGATGAGGACGAAGAGCTTGACGGATGAGGATCTGGATGAGCTTAAGGGCTGTGTGGATCTAGGGTTTGGTTTCAGTTACGATGAAATTCCGGAGCTCTGCAAC
ACTCTCCCGGCGTTGGAGCTCTGTTATTCCATGAGCCAAAAGTATATGGACGACCACCAGAAGTCGCCGGAGAGCTTGCCGGCGGTGGATTCGTGTTCGTCGGTGTCGAG
TCCGATTGCCAATTGGAAGATCTCCAGTCCTGGTGATCATCCAGAAGATGTTAAGGCAAGGCTCAAATTTTGGGCTCAAGCAGTGGCATGTACTCCTTGTGGTGAACGGC
TTTGCTCGCCTCTGAGAAAAGATGGAACGAATGAACGAACAGATCGTTTGACGAGTCTGACTTATGAAGATGAAGAAGAAGAAGAACGTGTCAGTGGTTTTGACTTTGAG
AGCCTAGCTTCCATAACACCCAACACCCAAGTCTCTTTCACTGACCTTTTTGCATATGCTGAGGTAACTAACAGTATTAACCCATCATCAAGTGGTTCTTCCTTCTCTTT
TAGACCCCATTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGTGATTTCTTTAGCGAATCTGAATCGTGGGTTTCTTCTGTGAATGGAGTTCGAGAGGACCCAGATGAGGAAACCTCCATTGAAGAAGGTGAAGAAATTGGGTC
GGATTCGGATTTGGACGAGTTGATTCAAATGGGGATGGAAAATTTGAAGAAAAAGAAGAACATGAAGAAGAAGAACCAGGTTTTGCTTGAGGGATTTGTGGAGGATGAGG
ACGATTTGATGAGGACGAAGAGCTTGACGGATGAGGATCTGGATGAGCTTAAGGGCTGTGTGGATCTAGGGTTTGGTTTCAGTTACGATGAAATTCCGGAGCTCTGCAAC
ACTCTCCCGGCGTTGGAGCTCTGTTATTCCATGAGCCAAAAGTATATGGACGACCACCAGAAGTCGCCGGAGAGCTTGCCGGCGGTGGATTCGTGTTCGTCGGTGTCGAG
TCCGATTGCCAATTGGAAGATCTCCAGTCCTGGTGATCATCCAGAAGATGTTAAGGCAAGGCTCAAATTTTGGGCTCAAGCAGTGGCATGTACTCCTTGTGGTGAACGGC
TTTGCTCGCCTCTGAGAAAAGATGGAACGAATGAACGAACAGATCGTTTGACGAGTCTGACTTATGAAGATGAAGAAGAAGAAGAACGTGTCAGTGGTTTTGACTTTGAG
AGCCTAGCTTCCATAACACCCAACACCCAAGTCTCTTTCACTGACCTTTTTGCATATGCTGAGGTAACTAACAGTATTAACCCATCATCAAGTGGTTCTTCCTTCTCTTT
TAGACCCCATTTCTGAAGGTAACTGACACTGACTTCACTAACTATCAGGTTTTTTTCAATGGAAAGCAAGGGAGAAAGTTCATTCTTTGTTGTTTCAATGTCTGAGTAGA
GATGTTTTTGGTTTCGGATATTGGGTTGTAGATGGTAATCTCTTCTAGTTTATGCTATCCAACGACCCGAAATTAGAGTTCGAGTCGAGGTTGGGAGAGTGTTATGTTCT
CGATGTCTTCTCTACTTATGATTTCTAATCTCCCTTGATGGTTTGGATGCCAAAGAAAAGAAAATGTTTGGTTGATGTTGTATGAGCATTAAA
Protein sequenceShow/hide protein sequence
MGSDFFSESESWVSSVNGVREDPDEETSIEEGEEIGSDSDLDELIQMGMENLKKKKNMKKKNQVLLEGFVEDEDDLMRTKSLTDEDLDELKGCVDLGFGFSYDEIPELCN
TLPALELCYSMSQKYMDDHQKSPESLPAVDSCSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVACTPCGERLCSPLRKDGTNERTDRLTSLTYEDEEEEERVSGFDFE
SLASITPNTQVSFTDLFAYAEVTNSINPSSSGSSFSFRPHF