; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G003960 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G003960
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptiontransmembrane protein 33 homolog
Genome locationchr05:4988953..4994513
RNA-Seq ExpressionLsi05G003960
SyntenyLsi05G003960
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR005344 - TMEM33/Pom33 family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143362.1 uncharacterized protein LOC111013250 isoform X1 [Momordica charantia]7.1e-11982.08Show/hide
Query:  MGEISEETQRLKRIAAAAYDYDNDSRWNDYWSNILIPPNLVSRSDVIDHFKRKFYQRYIDPDLVVETMSTSGSLFHPASQSAPSPPTNDQTQAQSSGSTT
        M E SEETQRLKRIAAAAYDYDNDSRW DYWSNILIPPNLVSRSDVIDHFKRKFYQR+IDPDLVV+T+S S      A QSAPSPPTNDQTQAQSSGSTT
Subjt:  MGEISEETQRLKRIAAAAYDYDNDSRWNDYWSNILIPPNLVSRSDVIDHFKRKFYQRYIDPDLVVETMSTSGSLFHPASQSAPSPPTNDQTQAQSSGSTT

Query:  RNSATSATTDSISNSSRWDRHMVLFSLNAWVLIVALFAMIPMVPRNLSHRAYRLSIMGTACSSLFSLFIKCGMPRAFDMQGLEIYFQSVVATKAFVYFIY
         NS TS TTDS   S     HM+LFS+NAWVLIVALFAM+PMVP+NLSHRA+RLS MGT  SSLFSL I CG P+A DMQ LE+YFQSVVATKAFVYFIY
Subjt:  RNSATSATTDSISNSSRWDRHMVLFSLNAWVLIVALFAMIPMVPRNLSHRAYRLSIMGTACSSLFSLFIKCGMPRAFDMQGLEIYFQSVVATKAFVYFIY

Query:  SVTFVASNLCLKFALIPIICGAIEQIAKFLGRNFPQSFFYRKWLERPCAWVESHTTTLSLLSSNVEIALGFLLIISLFT
         +TFVASNLCLKFALIPIICG IEQIAKFL R+FP+S FYRK LERPCAWVES+TTTLSLLSSNVEIALGFLLIISLF+
Subjt:  SVTFVASNLCLKFALIPIICGAIEQIAKFLGRNFPQSFFYRKWLERPCAWVESHTTTLSLLSSNVEIALGFLLIISLFT

XP_022143363.1 uncharacterized protein LOC111013250 isoform X2 [Momordica charantia]7.1e-11982.08Show/hide
Query:  MGEISEETQRLKRIAAAAYDYDNDSRWNDYWSNILIPPNLVSRSDVIDHFKRKFYQRYIDPDLVVETMSTSGSLFHPASQSAPSPPTNDQTQAQSSGSTT
        M E SEETQRLKRIAAAAYDYDNDSRW DYWSNILIPPNLVSRSDVIDHFKRKFYQR+IDPDLVV+T+S S      A QSAPSPPTNDQTQAQSSGSTT
Subjt:  MGEISEETQRLKRIAAAAYDYDNDSRWNDYWSNILIPPNLVSRSDVIDHFKRKFYQRYIDPDLVVETMSTSGSLFHPASQSAPSPPTNDQTQAQSSGSTT

Query:  RNSATSATTDSISNSSRWDRHMVLFSLNAWVLIVALFAMIPMVPRNLSHRAYRLSIMGTACSSLFSLFIKCGMPRAFDMQGLEIYFQSVVATKAFVYFIY
         NS TS TTDS   S     HM+LFS+NAWVLIVALFAM+PMVP+NLSHRA+RLS MGT  SSLFSL I CG P+A DMQ LE+YFQSVVATKAFVYFIY
Subjt:  RNSATSATTDSISNSSRWDRHMVLFSLNAWVLIVALFAMIPMVPRNLSHRAYRLSIMGTACSSLFSLFIKCGMPRAFDMQGLEIYFQSVVATKAFVYFIY

Query:  SVTFVASNLCLKFALIPIICGAIEQIAKFLGRNFPQSFFYRKWLERPCAWVESHTTTLSLLSSNVEIALGFLLIISLFT
         +TFVASNLCLKFALIPIICG IEQIAKFL R+FP+S FYRK LERPCAWVES+TTTLSLLSSNVEIALGFLLIISLF+
Subjt:  SVTFVASNLCLKFALIPIICGAIEQIAKFLGRNFPQSFFYRKWLERPCAWVESHTTTLSLLSSNVEIALGFLLIISLFT

XP_022998724.1 transmembrane protein 33 homolog [Cucurbita maxima]1.1e-11981Show/hide
Query:  MGEISEETQRLKRIAAAAYDYDNDSRWNDYWSNILIPPNLVSRSDVIDHFKRKFYQRYIDPDLVVETMSTSGSLFHPASQSAPSPPTNDQTQAQSSGSTT
        MGEISEETQRLKRIAAA YDYDND RW DYWSNILIPPNLVSRSDV DHFKRKFYQRYIDP+LVVETMS+S S F P  QSAPSPPTNDQTQ+QSSGSTT
Subjt:  MGEISEETQRLKRIAAAAYDYDNDSRWNDYWSNILIPPNLVSRSDVIDHFKRKFYQRYIDPDLVVETMSTSGSLFHPASQSAPSPPTNDQTQAQSSGSTT

Query:  RNSATSATTDSISNSSRWDRHMVLFSLNAWVLIVALFAMIPMVPRNLSHRAYRLSIMGTACSSLFSLFIKCGMPRAFDMQGLEIYFQSVVATKAFVYFIY
        RNS TSATTDS     RWDRH VLF +NAWV ++  FAMIPMVPRN+SHRAY++S+MGTACSSLF+LFI CG PR+FDMQ LE+YFQ VVATKA VY IY
Subjt:  RNSATSATTDSISNSSRWDRHMVLFSLNAWVLIVALFAMIPMVPRNLSHRAYRLSIMGTACSSLFSLFIKCGMPRAFDMQGLEIYFQSVVATKAFVYFIY

Query:  SVTFVASNLCLKFALIPIICGAIEQIAKFLGRNFPQSFFYRKWLERPCAWVESHTTTLSLLSSNVEIALGFLLIISLFT
        S+TFVASNL LK ALIPIIC A+EQIAKF+  NFP+S FYRK LERPCAW ES+TTTL LLSSNVEIALGFLLIISLFT
Subjt:  SVTFVASNLCLKFALIPIICGAIEQIAKFLGRNFPQSFFYRKWLERPCAWVESHTTTLSLLSSNVEIALGFLLIISLFT

XP_023525188.1 uncharacterized protein LOC111788863 [Cucurbita pepo subsp. pepo]4.3e-11678.85Show/hide
Query:  MGEISEETQRLKRIAAAAYDYDNDSRWNDYWSNILIPPNLVSRSDVIDHFKRKFYQRYIDPDLVVETMSTSGSLFHPASQSAPSPPTNDQTQAQSSGSTT
        MGEISEETQRLKRIAAA YDYDND RW DYWSNILIPPNLVSRSDV DHFKRKFYQRYIDPDLVVETMS+  S F P  QSA SPPTNDQTQ+QSSGSTT
Subjt:  MGEISEETQRLKRIAAAAYDYDNDSRWNDYWSNILIPPNLVSRSDVIDHFKRKFYQRYIDPDLVVETMSTSGSLFHPASQSAPSPPTNDQTQAQSSGSTT

Query:  RNSATSATTDSISNSSRWDRHMVLFSLNAWVLIVALFAMIPMVPRNLSHRAYRLSIMGTACSSLFSLFIKCGMPRAFDMQGLEIYFQSVVATKAFVYFIY
        RNS TSATT S     RWDRH VLF +NAWV +++  A+IPMVPRN+SHRAY++S+MGTACSSLF+LFI CG PR+FDMQ LE+YF+ VVATKA VY IY
Subjt:  RNSATSATTDSISNSSRWDRHMVLFSLNAWVLIVALFAMIPMVPRNLSHRAYRLSIMGTACSSLFSLFIKCGMPRAFDMQGLEIYFQSVVATKAFVYFIY

Query:  SVTFVASNLCLKFALIPIICGAIEQIAKFLGRNFPQSFFYRKWLERPCAWVESHTTTLSLLSSNVEIALGFLLIISLFT
        S+TFVASNL LK ALIPIIC A+EQIAK++  NFP+S FYRK LERPCAWVES+T TL LLSSNVEIALGFLLIISLFT
Subjt:  SVTFVASNLCLKFALIPIICGAIEQIAKFLGRNFPQSFFYRKWLERPCAWVESHTTTLSLLSSNVEIALGFLLIISLFT

XP_038874907.1 uncharacterized protein LOC120067410 [Benincasa hispida]3.0e-13389.96Show/hide
Query:  MGEISEETQRLKRIAAAAYDYDNDSRWNDYWSNILIPPNLVSRSDVIDHFKRKFYQRYIDPDLVVETMSTSGSLFHPASQSAPSPPTNDQTQAQSSGSTT
        MGE SEETQR KRIAAAAYDYDNDSRW DYWSN LIPP++VSRSDVIDHFKRKFYQRYIDPDLVVETMS S S F PASQSAPSPPTNDQTQ QSS    
Subjt:  MGEISEETQRLKRIAAAAYDYDNDSRWNDYWSNILIPPNLVSRSDVIDHFKRKFYQRYIDPDLVVETMSTSGSLFHPASQSAPSPPTNDQTQAQSSGSTT

Query:  RNSATSATTDSISNSSRWDRHMVLFSLNAWVLIVALFAMIPMVPRNLSHRAYRLSIMGTACSSLFSLFIKCGMPRAFDMQGLEIYFQSVVATKAFVYFIY
        RNS   ATTDSISNSSRW+RHMVLFSLNAWVLIVALFAMIPMVPRNLSHRAYRLSIMGTACSSLFSLFIKCGMPRAFDMQGLE+YFQ+VVATK FVYFIY
Subjt:  RNSATSATTDSISNSSRWDRHMVLFSLNAWVLIVALFAMIPMVPRNLSHRAYRLSIMGTACSSLFSLFIKCGMPRAFDMQGLEIYFQSVVATKAFVYFIY

Query:  SVTFVASNLCLKFALIPIICGAIEQIAKFLGRNFPQSFFYRKWLERPCAWVESHTTTLSLLSSNVEIALGFLLIISLFT
        SVTFVASNLCLKFALIPIIC A EQIAKFL R FPQSFFYRKWLERPCAWVES+TTTLSLLSS+VEIALGFLLIISLFT
Subjt:  SVTFVASNLCLKFALIPIICGAIEQIAKFLGRNFPQSFFYRKWLERPCAWVESHTTTLSLLSSNVEIALGFLLIISLFT

TrEMBL top hitse value%identityAlignment
A0A6J1CQJ0 uncharacterized protein LOC111013250 isoform X23.4e-11982.08Show/hide
Query:  MGEISEETQRLKRIAAAAYDYDNDSRWNDYWSNILIPPNLVSRSDVIDHFKRKFYQRYIDPDLVVETMSTSGSLFHPASQSAPSPPTNDQTQAQSSGSTT
        M E SEETQRLKRIAAAAYDYDNDSRW DYWSNILIPPNLVSRSDVIDHFKRKFYQR+IDPDLVV+T+S S      A QSAPSPPTNDQTQAQSSGSTT
Subjt:  MGEISEETQRLKRIAAAAYDYDNDSRWNDYWSNILIPPNLVSRSDVIDHFKRKFYQRYIDPDLVVETMSTSGSLFHPASQSAPSPPTNDQTQAQSSGSTT

Query:  RNSATSATTDSISNSSRWDRHMVLFSLNAWVLIVALFAMIPMVPRNLSHRAYRLSIMGTACSSLFSLFIKCGMPRAFDMQGLEIYFQSVVATKAFVYFIY
         NS TS TTDS   S     HM+LFS+NAWVLIVALFAM+PMVP+NLSHRA+RLS MGT  SSLFSL I CG P+A DMQ LE+YFQSVVATKAFVYFIY
Subjt:  RNSATSATTDSISNSSRWDRHMVLFSLNAWVLIVALFAMIPMVPRNLSHRAYRLSIMGTACSSLFSLFIKCGMPRAFDMQGLEIYFQSVVATKAFVYFIY

Query:  SVTFVASNLCLKFALIPIICGAIEQIAKFLGRNFPQSFFYRKWLERPCAWVESHTTTLSLLSSNVEIALGFLLIISLFT
         +TFVASNLCLKFALIPIICG IEQIAKFL R+FP+S FYRK LERPCAWVES+TTTLSLLSSNVEIALGFLLIISLF+
Subjt:  SVTFVASNLCLKFALIPIICGAIEQIAKFLGRNFPQSFFYRKWLERPCAWVESHTTTLSLLSSNVEIALGFLLIISLFT

A0A6J1CQK8 uncharacterized protein LOC111013250 isoform X13.4e-11982.08Show/hide
Query:  MGEISEETQRLKRIAAAAYDYDNDSRWNDYWSNILIPPNLVSRSDVIDHFKRKFYQRYIDPDLVVETMSTSGSLFHPASQSAPSPPTNDQTQAQSSGSTT
        M E SEETQRLKRIAAAAYDYDNDSRW DYWSNILIPPNLVSRSDVIDHFKRKFYQR+IDPDLVV+T+S S      A QSAPSPPTNDQTQAQSSGSTT
Subjt:  MGEISEETQRLKRIAAAAYDYDNDSRWNDYWSNILIPPNLVSRSDVIDHFKRKFYQRYIDPDLVVETMSTSGSLFHPASQSAPSPPTNDQTQAQSSGSTT

Query:  RNSATSATTDSISNSSRWDRHMVLFSLNAWVLIVALFAMIPMVPRNLSHRAYRLSIMGTACSSLFSLFIKCGMPRAFDMQGLEIYFQSVVATKAFVYFIY
         NS TS TTDS   S     HM+LFS+NAWVLIVALFAM+PMVP+NLSHRA+RLS MGT  SSLFSL I CG P+A DMQ LE+YFQSVVATKAFVYFIY
Subjt:  RNSATSATTDSISNSSRWDRHMVLFSLNAWVLIVALFAMIPMVPRNLSHRAYRLSIMGTACSSLFSLFIKCGMPRAFDMQGLEIYFQSVVATKAFVYFIY

Query:  SVTFVASNLCLKFALIPIICGAIEQIAKFLGRNFPQSFFYRKWLERPCAWVESHTTTLSLLSSNVEIALGFLLIISLFT
         +TFVASNLCLKFALIPIICG IEQIAKFL R+FP+S FYRK LERPCAWVES+TTTLSLLSSNVEIALGFLLIISLF+
Subjt:  SVTFVASNLCLKFALIPIICGAIEQIAKFLGRNFPQSFFYRKWLERPCAWVESHTTTLSLLSSNVEIALGFLLIISLFT

A0A6J1GAW8 uncharacterized protein LOC1114524832.7e-11678.14Show/hide
Query:  MGEISEETQRLKRIAAAAYDYDNDSRWNDYWSNILIPPNLVSRSDVIDHFKRKFYQRYIDPDLVVETMSTSGSLFHPASQSAPSPPTNDQTQAQSSGSTT
        MGEISEETQRLKRIAAA YDYDND RW DYWSNILIPPNLVSRSDV DHFKRKFYQRYIDPDLVVETMS+  S F P  QSAPSPPTNDQTQ+QSSGSTT
Subjt:  MGEISEETQRLKRIAAAAYDYDNDSRWNDYWSNILIPPNLVSRSDVIDHFKRKFYQRYIDPDLVVETMSTSGSLFHPASQSAPSPPTNDQTQAQSSGSTT

Query:  RNSATSATTDSISNSSRWDRHMVLFSLNAWVLIVALFAMIPMVPRNLSHRAYRLSIMGTACSSLFSLFIKCGMPRAFDMQGLEIYFQSVVATKAFVYFIY
        RN  TSATT S     RWDRH VLF +NAWV +++  A++PMVPRN+SHRAY++S+MGTACSSLF+LFI CG+PR+FDMQ LE+YF+ VVATKA VY IY
Subjt:  RNSATSATTDSISNSSRWDRHMVLFSLNAWVLIVALFAMIPMVPRNLSHRAYRLSIMGTACSSLFSLFIKCGMPRAFDMQGLEIYFQSVVATKAFVYFIY

Query:  SVTFVASNLCLKFALIPIICGAIEQIAKFLGRNFPQSFFYRKWLERPCAWVESHTTTLSLLSSNVEIALGFLLIISLFT
        S+TFVASNL LK ALIPIIC A+EQI+K++  NFP+S FYRK LERPCAWVES+T TL LLSSNVEIALGFLLIISLFT
Subjt:  SVTFVASNLCLKFALIPIICGAIEQIAKFLGRNFPQSFFYRKWLERPCAWVESHTTTLSLLSSNVEIALGFLLIISLFT

A0A6J1KF32 transmembrane protein 33 homolog5.3e-12081Show/hide
Query:  MGEISEETQRLKRIAAAAYDYDNDSRWNDYWSNILIPPNLVSRSDVIDHFKRKFYQRYIDPDLVVETMSTSGSLFHPASQSAPSPPTNDQTQAQSSGSTT
        MGEISEETQRLKRIAAA YDYDND RW DYWSNILIPPNLVSRSDV DHFKRKFYQRYIDP+LVVETMS+S S F P  QSAPSPPTNDQTQ+QSSGSTT
Subjt:  MGEISEETQRLKRIAAAAYDYDNDSRWNDYWSNILIPPNLVSRSDVIDHFKRKFYQRYIDPDLVVETMSTSGSLFHPASQSAPSPPTNDQTQAQSSGSTT

Query:  RNSATSATTDSISNSSRWDRHMVLFSLNAWVLIVALFAMIPMVPRNLSHRAYRLSIMGTACSSLFSLFIKCGMPRAFDMQGLEIYFQSVVATKAFVYFIY
        RNS TSATTDS     RWDRH VLF +NAWV ++  FAMIPMVPRN+SHRAY++S+MGTACSSLF+LFI CG PR+FDMQ LE+YFQ VVATKA VY IY
Subjt:  RNSATSATTDSISNSSRWDRHMVLFSLNAWVLIVALFAMIPMVPRNLSHRAYRLSIMGTACSSLFSLFIKCGMPRAFDMQGLEIYFQSVVATKAFVYFIY

Query:  SVTFVASNLCLKFALIPIICGAIEQIAKFLGRNFPQSFFYRKWLERPCAWVESHTTTLSLLSSNVEIALGFLLIISLFT
        S+TFVASNL LK ALIPIIC A+EQIAKF+  NFP+S FYRK LERPCAW ES+TTTL LLSSNVEIALGFLLIISLFT
Subjt:  SVTFVASNLCLKFALIPIICGAIEQIAKFLGRNFPQSFFYRKWLERPCAWVESHTTTLSLLSSNVEIALGFLLIISLFT

W9RU36 Uncharacterized protein6.3e-9764.77Show/hide
Query:  MGEISEETQRLKRIAAAAYDYDNDSRWNDYWSNILIPPNLVSRSDVIDHFKRKFYQRYIDPDLVVETMST--SGSLFHPASQSAPSPPTNDQTQAQSSGS
        MGE  E+ Q+LKRIAAAAYDYDND RW+DYWSN+L+PP+L SRSDV DHFKRKFYQRYIDPDLVVE+MST  S     PAS S  +PP+NDQ +++ +GS
Subjt:  MGEISEETQRLKRIAAAAYDYDNDSRWNDYWSNILIPPNLVSRSDVIDHFKRKFYQRYIDPDLVVETMST--SGSLFHPASQSAPSPPTNDQTQAQSSGS

Query:  TTRNSATSATTDSISNSSRWDRHMVLFSLNAWVLIVALFAMIPMVPRNLSHRAYRLSIMGTACSSLFSLFIKCGMPRAFDMQGLEIYFQSVVATKAFVYF
        T R S TSA   S   S RWDR  + FS+NAWV IVA+ A+ P+VP+ LSHRAYRLS MGTACSSL+S++   G PRA+++Q L++YFQS++ATK F+YF
Subjt:  TTRNSATSATTDSISNSSRWDRHMVLFSLNAWVLIVALFAMIPMVPRNLSHRAYRLSIMGTACSSLFSLFIKCGMPRAFDMQGLEIYFQSVVATKAFVYF

Query:  IYSVTFVASNLCLKFALIPIICGAIEQIAKFLGRNFPQSFFYRKWLERPCAWVESHTTTLSLLSSNVEIALGFLLIISLFT
        IY +TF  S+LCLKFA IPI+C A+E +AKFL RNF +S  YRK+LE PC WVES+TTTLS+LSS  EI +GFLLIIS+F+
Subjt:  IYSVTFVASNLCLKFALIPIICGAIEQIAKFLGRNFPQSFFYRKWLERPCAWVESHTTTLSLLSSNVEIALGFLLIISLFT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G02420.1 unknown protein1.6e-9762.11Show/hide
Query:  MGEISEETQRLKRIAAAAYDYDNDSRWNDYWSNILIPPNLVSRSDVIDHFKRKFYQRYIDPDLVVETMSTSGSLFHPA------SQSAPSPPTNDQTQAQ
        M E  E++QRLK+IAAAA+DY+ND+RW DYWSNILIPP++ SR +V+DHFKRKFYQRYIDPDLVVE MSTS S    A      + S  S   N+Q +++
Subjt:  MGEISEETQRLKRIAAAAYDYDNDSRWNDYWSNILIPPNLVSRSDVIDHFKRKFYQRYIDPDLVVETMSTSGSLFHPA------SQSAPSPPTNDQTQAQ

Query:  SSGSTTRNSATSATTDSISNSSRWDRHMVLFSLNAWVLIVALFAMIPMVPRNLSHRAYRLSIMGTACSSLFSLFIKCGMPRAFDMQGLEIYFQSVVATKA
        +SGS  R S  SATT +  +S RWD   + FS+NAWV ++A+ A++P++P+NLS+RAYRLS MGTACSSL+SL+   G PRA++MQGL++YFQS+VA K 
Subjt:  SSGSTTRNSATSATTDSISNSSRWDRHMVLFSLNAWVLIVALFAMIPMVPRNLSHRAYRLSIMGTACSSLFSLFIKCGMPRAFDMQGLEIYFQSVVATKA

Query:  FVYFIYSVTFVASNLCLKFALIPIICGAIEQIAKFLGRNFPQSFFYRKWLERPCAWVESHTTTLSLLSSNVEIALGFLLIISLFT
        F+YFIY +TFV S+LCLKFALIPI+C A+EQ+AKFL RNF +S  YRK+LE PC WVES+TTTL++LSS  EIA+GFLLIISL +
Subjt:  FVYFIYSVTFVASNLCLKFALIPIICGAIEQIAKFLGRNFPQSFFYRKWLERPCAWVESHTTTLSLLSSNVEIALGFLLIISLFT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGAAATCAGCGAAGAAACTCAGAGGCTTAAGCGAATTGCGGCGGCTGCATACGACTACGACAATGACTCCAGATGGAACGACTACTGGTCCAACATTCTCATTCC
TCCTAACTTGGTTTCTCGTTCCGACGTCATCGATCACTTCAAACGCAAATTCTATCAACGTTACATCGATCCTGATCTTGTAGTTGAGACAATGTCTACCAGCGGTTCAT
TATTTCATCCAGCATCGCAATCAGCACCTTCACCTCCTACAAATGATCAAACTCAGGCACAGAGCTCAGGGTCAACTACAAGAAATTCAGCAACATCGGCAACTACGGAC
TCAATTTCAAATTCTTCACGCTGGGATCGACATATGGTTCTTTTTTCACTCAATGCCTGGGTGCTTATTGTGGCATTGTTTGCAATGATACCAATGGTACCCAGAAACCT
TTCACATAGGGCTTATCGTCTTTCTATTATGGGCACTGCATGCTCCTCTCTGTTCTCCTTGTTCATCAAATGTGGGATGCCCAGGGCATTTGATATGCAGGGTTTGGAAA
TTTATTTCCAGTCTGTCGTTGCAACAAAAGCTTTTGTCTACTTCATTTACTCTGTTACCTTTGTAGCTTCAAATCTCTGCCTTAAATTTGCTTTAATTCCGATAATATGT
GGAGCCATTGAGCAGATTGCCAAGTTCCTTGGGCGTAATTTCCCTCAATCTTTCTTCTACAGGAAATGGTTGGAGCGGCCTTGTGCTTGGGTGGAATCACATACAACCAC
TCTTTCTCTTCTGTCTTCAAATGTTGAGATTGCACTGGGTTTCCTTCTGATCATCTCCTTGTTTACGTCGATGGCAGCGAAACTTCGTGCAAACATTCATGTACTGGCAG
CTACTGAAGCTGATGTATCACTTCCCCATGACTGCTGGGTTTCATCAGAGTGCCTGGGCTAA
mRNA sequenceShow/hide mRNA sequence
ATAGAAGGAAGAGGAAGGATCGGCGGGTAAGAGAATGAGATGATCATCAACGTTAAAAGTGGAAGAAACGAGTCGTCCTTCGTCTTCTGCTAGCACAGTCATACCAGAGA
GAAAAATGGGGGAAATCAGCGAAGAAACTCAGAGGCTTAAGCGAATTGCGGCGGCTGCATACGACTACGACAATGACTCCAGATGGAACGACTACTGGTCCAACATTCTC
ATTCCTCCTAACTTGGTTTCTCGTTCCGACGTCATCGATCACTTCAAACGCAAATTCTATCAACGTTACATCGATCCTGATCTTGTAGTTGAGACAATGTCTACCAGCGG
TTCATTATTTCATCCAGCATCGCAATCAGCACCTTCACCTCCTACAAATGATCAAACTCAGGCACAGAGCTCAGGGTCAACTACAAGAAATTCAGCAACATCGGCAACTA
CGGACTCAATTTCAAATTCTTCACGCTGGGATCGACATATGGTTCTTTTTTCACTCAATGCCTGGGTGCTTATTGTGGCATTGTTTGCAATGATACCAATGGTACCCAGA
AACCTTTCACATAGGGCTTATCGTCTTTCTATTATGGGCACTGCATGCTCCTCTCTGTTCTCCTTGTTCATCAAATGTGGGATGCCCAGGGCATTTGATATGCAGGGTTT
GGAAATTTATTTCCAGTCTGTCGTTGCAACAAAAGCTTTTGTCTACTTCATTTACTCTGTTACCTTTGTAGCTTCAAATCTCTGCCTTAAATTTGCTTTAATTCCGATAA
TATGTGGAGCCATTGAGCAGATTGCCAAGTTCCTTGGGCGTAATTTCCCTCAATCTTTCTTCTACAGGAAATGGTTGGAGCGGCCTTGTGCTTGGGTGGAATCACATACA
ACCACTCTTTCTCTTCTGTCTTCAAATGTTGAGATTGCACTGGGTTTCCTTCTGATCATCTCCTTGTTTACGTCGATGGCAGCGAAACTTCGTGCAAACATTCATGTACT
GGCAGCTACTGAAGCTGATGTATCACTTCCCCATGACTGCTGGGTTTCATCAGAGTGCCTGGGCTAAGGTTGGGAGAAAAGTTAATCCATTTGTCAAGAGGTTTCTCCCA
TTTCTCAAACCTTTACTTTCTGCAGCTCAAAGTTGGTGGACCAGGTAGAGAAAAACTGAGTAATGAAGTGTTGCTGCTATTAGCTTGCTTGCTTTTAGTCTAATGAATGA
ATACTCATTTTCTTAGTTTGAAGATAAAAGGCTTCTGTCATATATGGTTGACAAAATCAGCCTCCATTCAAGCAAATGAACCCTGTTAATTGTGGTTTCTGGATGGAGAT
TTTAAATGGTCATTTCTTATAGGTTAATCTTCTCCTTTTCCTTGCTTTCCAATGACCATATACTGTAAATTTTCTGTTTAGTGAATTGAATTGAACCTTTTCTTTTCTCA
TTTGCAA
Protein sequenceShow/hide protein sequence
MGEISEETQRLKRIAAAAYDYDNDSRWNDYWSNILIPPNLVSRSDVIDHFKRKFYQRYIDPDLVVETMSTSGSLFHPASQSAPSPPTNDQTQAQSSGSTTRNSATSATTD
SISNSSRWDRHMVLFSLNAWVLIVALFAMIPMVPRNLSHRAYRLSIMGTACSSLFSLFIKCGMPRAFDMQGLEIYFQSVVATKAFVYFIYSVTFVASNLCLKFALIPIIC
GAIEQIAKFLGRNFPQSFFYRKWLERPCAWVESHTTTLSLLSSNVEIALGFLLIISLFTSMAAKLRANIHVLAATEADVSLPHDCWVSSECLG