; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009691 (gene) of Snake gourd v1 genome

Gene IDTan0009691
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG03:79892362..79894997
RNA-Seq ExpressionTan0009691
SyntenyTan0009691
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7026464.1 hypothetical protein SDJN02_10464, partial [Cucurbita argyrosperma subsp. argyrosperma]1.2e-10072.03Show/hide
Query:  MPDFSLEFARDIFLIVRNFLKSFKIQTKYGTTAGAAASSAIISGIGLVLIYVYTQRKKEKNDQRVYTRSMSMGALHGGKLAMKRLLQYHRMRATQKHQYD
        M  FSL FARD+F +++NFLKSFKIQTKYGTTAGA ASS IISGIGL+LIY YTQRKKEK  QRV+TRSMS+GALHGG++AMKR+LQYH++RA Q+ Q+D
Subjt:  MPDFSLEFARDIFLIVRNFLKSFKIQTKYGTTAGAAASSAIISGIGLVLIYVYTQRKKEKNDQRVYTRSMSMGALHGGKLAMKRLLQYHRMRATQKHQYD

Query:  YLEKLEKMINTDSPDFPRLQNILAKLEMRGQEDKAIEILKKAAKQAKEKSLPQHEYEYQMLLVEALIYKGNIAEAEMVSCLNNDDTSDVRRSLYKAIIQV
        YLEKLE  +NTD+PDFP +QN++AK+EM GQEDKAIEILKKA K+AKEKSL  HEYEYQMLLVEALIYKG+I EA    CLN+D+ SDVRR LYK II++
Subjt:  YLEKLEKMINTDSPDFPRLQNILAKLEMRGQEDKAIEILKKAAKQAKEKSLPQHEYEYQMLLVEALIYKGNIAEAEMVSCLNNDDTSDVRRSLYKAIIQV

Query:  LLNNQEKAEEEWQEFKKMRSQFLLPPDVKDTHFYKLVTDFQKFKQVVDLLKKDIYEKKKAK
        LLNN ++AEEEW++F++MR QF LPPD+KD+ FYKLV  F+ FK+VVDLLK+DI +KKK K
Subjt:  LLNNQEKAEEEWQEFKKMRSQFLLPPDVKDTHFYKLVTDFQKFKQVVDLLKKDIYEKKKAK

XP_022146501.1 uncharacterized protein LOC111015701 [Momordica charantia]1.2e-10377.39Show/hide
Query:  MPDFSLEFARDIFLIVRNFLKSFKIQTKYGTTAGAAASSAIISGIGLVLIYVYTQRKKEKNDQRVYTRSMSMGALHGGKLAMKRLLQYHRMRATQKHQYD
        MP FSL+ ARD  LI+RNFLKSFKIQTKYGT+A AAASS IISGIGLVLIYVYTQRK+EKND+RV+ RSMS+GALHGGKLAMKRLLQY +MRAT+K+Q  
Subjt:  MPDFSLEFARDIFLIVRNFLKSFKIQTKYGTTAGAAASSAIISGIGLVLIYVYTQRKKEKNDQRVYTRSMSMGALHGGKLAMKRLLQYHRMRATQKHQYD

Query:  YLEKLEKMINTDSPDFPRLQNILAKLEMRGQEDKAIEILKKAAKQAKEKSLPQHEYEYQMLLVEALIYKGNIAEAEMVSCLNNDDTSDVRRSLYKAIIQV
         LEKLEKMI   +PDF +LQ+I+AKLEMRGQEDKAIEILKKAAK+AKE SL  +EYEYQ+LLVE LIYKGNI EAE  SCLN ++TSDVRRSLYKAIIQV
Subjt:  YLEKLEKMINTDSPDFPRLQNILAKLEMRGQEDKAIEILKKAAKQAKEKSLPQHEYEYQMLLVEALIYKGNIAEAEMVSCLNNDDTSDVRRSLYKAIIQV

Query:  LLNNQEKAEEEWQEFKKMRSQFLLPPDVKDTHFYKLVTDFQKFKQVVDLLKKDIYEKKKAK
        LLNN +KA+E+W+EFK+MRS+FLLPPDVKD+ FYKLVT+F+ FKQVVDLL KDI E+ K K
Subjt:  LLNNQEKAEEEWQEFKKMRSQFLLPPDVKDTHFYKLVTDFQKFKQVVDLLKKDIYEKKKAK

XP_022926668.1 uncharacterized protein LOC111433729 [Cucurbita moschata]1.1e-10172.8Show/hide
Query:  MPDFSLEFARDIFLIVRNFLKSFKIQTKYGTTAGAAASSAIISGIGLVLIYVYTQRKKEKNDQRVYTRSMSMGALHGGKLAMKRLLQYHRMRATQKHQYD
        M  FSL FARD+F++++NFLKSFKIQTKYGTTAGA ASS IISGIGL+LIY YTQRKKEK  QRV+TRSMS+GALHGG++AMKR+LQYH++RA Q+ Q+D
Subjt:  MPDFSLEFARDIFLIVRNFLKSFKIQTKYGTTAGAAASSAIISGIGLVLIYVYTQRKKEKNDQRVYTRSMSMGALHGGKLAMKRLLQYHRMRATQKHQYD

Query:  YLEKLEKMINTDSPDFPRLQNILAKLEMRGQEDKAIEILKKAAKQAKEKSLPQHEYEYQMLLVEALIYKGNIAEAEMVSCLNNDDTSDVRRSLYKAIIQV
        YLEKLE M NTD+PDFP +Q++LAK+EM GQEDKAIEILKKA K+AKEKSL  HEYEYQMLLVEALIYKG+I EA    CLN+D+ SDVRR LYK II++
Subjt:  YLEKLEKMINTDSPDFPRLQNILAKLEMRGQEDKAIEILKKAAKQAKEKSLPQHEYEYQMLLVEALIYKGNIAEAEMVSCLNNDDTSDVRRSLYKAIIQV

Query:  LLNNQEKAEEEWQEFKKMRSQFLLPPDVKDTHFYKLVTDFQKFKQVVDLLKKDIYEKKKAK
        LLNN +KAEEEW++F++MR QF LPPD+KD+ FYKLV  F+ FK+VVDLLK+DI +KKK K
Subjt:  LLNNQEKAEEEWQEFKKMRSQFLLPPDVKDTHFYKLVTDFQKFKQVVDLLKKDIYEKKKAK

XP_023003990.1 uncharacterized protein LOC111497439 [Cucurbita maxima]1.2e-10072.03Show/hide
Query:  MPDFSLEFARDIFLIVRNFLKSFKIQTKYGTTAGAAASSAIISGIGLVLIYVYTQRKKEKNDQRVYTRSMSMGALHGGKLAMKRLLQYHRMRATQKHQYD
        M DFSL FARD+FL+++NFLKSFKIQTKYGTTAGA ASS IISGIGL+LIY YTQRKKEK  QRV+TRSMS+GALHGG++AMKR+LQY +MRA Q+ QYD
Subjt:  MPDFSLEFARDIFLIVRNFLKSFKIQTKYGTTAGAAASSAIISGIGLVLIYVYTQRKKEKNDQRVYTRSMSMGALHGGKLAMKRLLQYHRMRATQKHQYD

Query:  YLEKLEKMINTDSPDFPRLQNILAKLEMRGQEDKAIEILKKAAKQAKEKSLPQHEYEYQMLLVEALIYKGNIAEAEMVSCLNNDDTSDVRRSLYKAIIQV
        YLEKLE   +TD+PDFP +Q +L K+EMRGQEDKAIEILKKA K+AKE+SL  HEYEYQMLLVEALIYKG+I EA    CLN+D+ SDVRR LYK II++
Subjt:  YLEKLEKMINTDSPDFPRLQNILAKLEMRGQEDKAIEILKKAAKQAKEKSLPQHEYEYQMLLVEALIYKGNIAEAEMVSCLNNDDTSDVRRSLYKAIIQV

Query:  LLNNQEKAEEEWQEFKKMRSQFLLPPDVKDTHFYKLVTDFQKFKQVVDLLKKDIYEKKKAK
        LLNN +KAEEEW++F++MR  F LPPD++D+ FYKLV  F+ FK+VVDLLK+DI +KKK K
Subjt:  LLNNQEKAEEEWQEFKKMRSQFLLPPDVKDTHFYKLVTDFQKFKQVVDLLKKDIYEKKKAK

XP_023518384.1 uncharacterized protein LOC111781887 [Cucurbita pepo subsp. pepo]1.6e-10072.03Show/hide
Query:  MPDFSLEFARDIFLIVRNFLKSFKIQTKYGTTAGAAASSAIISGIGLVLIYVYTQRKKEKNDQRVYTRSMSMGALHGGKLAMKRLLQYHRMRATQKHQYD
        M DFSL FARD+ ++++NFLKSFKIQTKYGTTAGA ASS IISGIGL+LIY YTQRKKEK  QRV+TRSMS+GALHGG++AMKR+LQYH+MRA Q+ Q+ 
Subjt:  MPDFSLEFARDIFLIVRNFLKSFKIQTKYGTTAGAAASSAIISGIGLVLIYVYTQRKKEKNDQRVYTRSMSMGALHGGKLAMKRLLQYHRMRATQKHQYD

Query:  YLEKLEKMINTDSPDFPRLQNILAKLEMRGQEDKAIEILKKAAKQAKEKSLPQHEYEYQMLLVEALIYKGNIAEAEMVSCLNNDDTSDVRRSLYKAIIQV
        YLE LE M NTD+PDFP +QN+LAK+EM GQEDKAIEILKKA K+A EKSL  HEYEYQMLLVEALIYKG+I EA    CLN+D+ SDVRR LYK II++
Subjt:  YLEKLEKMINTDSPDFPRLQNILAKLEMRGQEDKAIEILKKAAKQAKEKSLPQHEYEYQMLLVEALIYKGNIAEAEMVSCLNNDDTSDVRRSLYKAIIQV

Query:  LLNNQEKAEEEWQEFKKMRSQFLLPPDVKDTHFYKLVTDFQKFKQVVDLLKKDIYEKKKAK
        LLNN ++AEEEW++F++MR QF LPPD+KD+ FYKLV  F+ FK+VVDLLK+DI +KKK K
Subjt:  LLNNQEKAEEEWQEFKKMRSQFLLPPDVKDTHFYKLVTDFQKFKQVVDLLKKDIYEKKKAK

TrEMBL top hitse value%identityAlignment
A0A0A0KLZ3 Uncharacterized protein2.1e-7756.93Show/hide
Query:  MPDFSLEFARDIFLIVRNFLKSFKIQTKYGTTAGAAASSAIISGIGLVLIYVYTQRKKEKNDQRVYTRSMSMGALHGGKLAMKRLLQYHRMRATQKHQYD
        M  FS +   D F  + NF + F +QTKYG  AGA AS+ I+SG+GLVL+Y  T+  K+KN QRV+TRS+S+GALHGGK+AMKRLLQ+ +MRA  +++  
Subjt:  MPDFSLEFARDIFLIVRNFLKSFKIQTKYGTTAGAAASSAIISGIGLVLIYVYTQRKKEKNDQRVYTRSMSMGALHGGKLAMKRLLQYHRMRATQKHQYD

Query:  YLEKLEKMINTDS------PDFPRLQNILAKLEMRGQEDKAIEILKKAAKQAKEKSLPQHEYEYQMLLVEALIYKGNIAEAEMVSCLNNDDTSDVRRSLY
        +++KL+  I  D+      P+F ++QNI+ KLEM GQEDKAIE LK AA++AK+KSLP +E+EYQMLLVE  IYKG++ +AE + CL ND TSDVRR LY
Subjt:  YLEKLEKMINTDS------PDFPRLQNILAKLEMRGQEDKAIEILKKAAKQAKEKSLPQHEYEYQMLLVEALIYKGNIAEAEMVSCLNNDDTSDVRRSLY

Query:  KAIIQVLLNNQEKAEEEWQEFKKMRSQFLLPPDVKDTHFYKLVTDFQKFKQVVDLLKKDIYEKKKAK
        KAII+VL N  ++A +EW+EF++MRS FLLPPDVKD+HFY L+ DF  FK+VV +L++DI++K +AK
Subjt:  KAIIQVLLNNQEKAEEEWQEFKKMRSQFLLPPDVKDTHFYKLVTDFQKFKQVVDLLKKDIYEKKKAK

A0A5D3CMZ4 Uncharacterized protein1.1e-4660.23Show/hide
Query:  MRATQKHQYDYLEKLEKMINTDSPDFPRLQNILAKLEMRGQEDKAIEILKKAAKQAKEKSLPQHEYEYQMLLVEALIYKGNIAEAEMVSCLNNDDTSDVR
        MRA  + +  YL+KL+  I +D PDF +LQNI+AKLEM GQEDK IE LK AA++A EKS P +EYEYQMLLVE  IYKG  A+AE + CLNN+D SDVR
Subjt:  MRATQKHQYDYLEKLEKMINTDSPDFPRLQNILAKLEMRGQEDKAIEILKKAAKQAKEKSLPQHEYEYQMLLVEALIYKGNIAEAEMVSCLNNDDTSDVR

Query:  RSLYKAIIQVLLNNQEKAEEEWQEFKKMRSQFLLPPDVKDTHFYKLVTDFQKFKQVVDLLKKDIYEKKKAK
        R L+KAII+VLLN  ++A +EW+EF+K+RS +LLPPDVKD+ FY L+ DF  F++VV +L++DI++K +AK
Subjt:  RSLYKAIIQVLLNNQEKAEEEWQEFKKMRSQFLLPPDVKDTHFYKLVTDFQKFKQVVDLLKKDIYEKKKAK

A0A6J1CZJ5 uncharacterized protein LOC1110157015.8e-10477.39Show/hide
Query:  MPDFSLEFARDIFLIVRNFLKSFKIQTKYGTTAGAAASSAIISGIGLVLIYVYTQRKKEKNDQRVYTRSMSMGALHGGKLAMKRLLQYHRMRATQKHQYD
        MP FSL+ ARD  LI+RNFLKSFKIQTKYGT+A AAASS IISGIGLVLIYVYTQRK+EKND+RV+ RSMS+GALHGGKLAMKRLLQY +MRAT+K+Q  
Subjt:  MPDFSLEFARDIFLIVRNFLKSFKIQTKYGTTAGAAASSAIISGIGLVLIYVYTQRKKEKNDQRVYTRSMSMGALHGGKLAMKRLLQYHRMRATQKHQYD

Query:  YLEKLEKMINTDSPDFPRLQNILAKLEMRGQEDKAIEILKKAAKQAKEKSLPQHEYEYQMLLVEALIYKGNIAEAEMVSCLNNDDTSDVRRSLYKAIIQV
         LEKLEKMI   +PDF +LQ+I+AKLEMRGQEDKAIEILKKAAK+AKE SL  +EYEYQ+LLVE LIYKGNI EAE  SCLN ++TSDVRRSLYKAIIQV
Subjt:  YLEKLEKMINTDSPDFPRLQNILAKLEMRGQEDKAIEILKKAAKQAKEKSLPQHEYEYQMLLVEALIYKGNIAEAEMVSCLNNDDTSDVRRSLYKAIIQV

Query:  LLNNQEKAEEEWQEFKKMRSQFLLPPDVKDTHFYKLVTDFQKFKQVVDLLKKDIYEKKKAK
        LLNN +KA+E+W+EFK+MRS+FLLPPDVKD+ FYKLVT+F+ FKQVVDLL KDI E+ K K
Subjt:  LLNNQEKAEEEWQEFKKMRSQFLLPPDVKDTHFYKLVTDFQKFKQVVDLLKKDIYEKKKAK

A0A6J1EIU2 uncharacterized protein LOC1114337295.4e-10272.8Show/hide
Query:  MPDFSLEFARDIFLIVRNFLKSFKIQTKYGTTAGAAASSAIISGIGLVLIYVYTQRKKEKNDQRVYTRSMSMGALHGGKLAMKRLLQYHRMRATQKHQYD
        M  FSL FARD+F++++NFLKSFKIQTKYGTTAGA ASS IISGIGL+LIY YTQRKKEK  QRV+TRSMS+GALHGG++AMKR+LQYH++RA Q+ Q+D
Subjt:  MPDFSLEFARDIFLIVRNFLKSFKIQTKYGTTAGAAASSAIISGIGLVLIYVYTQRKKEKNDQRVYTRSMSMGALHGGKLAMKRLLQYHRMRATQKHQYD

Query:  YLEKLEKMINTDSPDFPRLQNILAKLEMRGQEDKAIEILKKAAKQAKEKSLPQHEYEYQMLLVEALIYKGNIAEAEMVSCLNNDDTSDVRRSLYKAIIQV
        YLEKLE M NTD+PDFP +Q++LAK+EM GQEDKAIEILKKA K+AKEKSL  HEYEYQMLLVEALIYKG+I EA    CLN+D+ SDVRR LYK II++
Subjt:  YLEKLEKMINTDSPDFPRLQNILAKLEMRGQEDKAIEILKKAAKQAKEKSLPQHEYEYQMLLVEALIYKGNIAEAEMVSCLNNDDTSDVRRSLYKAIIQV

Query:  LLNNQEKAEEEWQEFKKMRSQFLLPPDVKDTHFYKLVTDFQKFKQVVDLLKKDIYEKKKAK
        LLNN +KAEEEW++F++MR QF LPPD+KD+ FYKLV  F+ FK+VVDLLK+DI +KKK K
Subjt:  LLNNQEKAEEEWQEFKKMRSQFLLPPDVKDTHFYKLVTDFQKFKQVVDLLKKDIYEKKKAK

A0A6J1KTB7 uncharacterized protein LOC1114974396.0e-10172.03Show/hide
Query:  MPDFSLEFARDIFLIVRNFLKSFKIQTKYGTTAGAAASSAIISGIGLVLIYVYTQRKKEKNDQRVYTRSMSMGALHGGKLAMKRLLQYHRMRATQKHQYD
        M DFSL FARD+FL+++NFLKSFKIQTKYGTTAGA ASS IISGIGL+LIY YTQRKKEK  QRV+TRSMS+GALHGG++AMKR+LQY +MRA Q+ QYD
Subjt:  MPDFSLEFARDIFLIVRNFLKSFKIQTKYGTTAGAAASSAIISGIGLVLIYVYTQRKKEKNDQRVYTRSMSMGALHGGKLAMKRLLQYHRMRATQKHQYD

Query:  YLEKLEKMINTDSPDFPRLQNILAKLEMRGQEDKAIEILKKAAKQAKEKSLPQHEYEYQMLLVEALIYKGNIAEAEMVSCLNNDDTSDVRRSLYKAIIQV
        YLEKLE   +TD+PDFP +Q +L K+EMRGQEDKAIEILKKA K+AKE+SL  HEYEYQMLLVEALIYKG+I EA    CLN+D+ SDVRR LYK II++
Subjt:  YLEKLEKMINTDSPDFPRLQNILAKLEMRGQEDKAIEILKKAAKQAKEKSLPQHEYEYQMLLVEALIYKGNIAEAEMVSCLNNDDTSDVRRSLYKAIIQV

Query:  LLNNQEKAEEEWQEFKKMRSQFLLPPDVKDTHFYKLVTDFQKFKQVVDLLKKDIYEKKKAK
        LLNN +KAEEEW++F++MR  F LPPD++D+ FYKLV  F+ FK+VVDLLK+DI +KKK K
Subjt:  LLNNQEKAEEEWQEFKKMRSQFLLPPDVKDTHFYKLVTDFQKFKQVVDLLKKDIYEKKKAK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34530.1 unknown protein3.3e-2735.82Show/hide
Query:  DQRVYTRSMSMGALHGGKLAMKRLLQYHRMRATQKHQYDYLEKLEKMINTDSPDFPRLQNILAKLEMRGQEDKAIEILKKAAKQAKEKSLPQHEYEYQML
        D+R  ++S+SMGA+ GGKLA++RLL  H  R       +   + E +++ + PDF  LQ  + K+EM G+E K  E+LKKA ++A+++      YE +ML
Subjt:  DQRVYTRSMSMGALHGGKLAMKRLLQYHRMRATQKHQYDYLEKLEKMINTDSPDFPRLQNILAKLEMRGQEDKAIEILKKAAKQAKEKSLPQHEYEYQML

Query:  LVEALIYKGNIAEAEMVSCLNNDDTSDVRRSLYKAIIQVLLNN-QEKAEEEWQEFKKMRSQFLLPPDVKDTHFYKLVTDFQKFKQVVDLLKKDIYEKKKA
        LVE LIY GN+ EA    CL ++  +D RR LY+ II  L  +  ++ EE +  F++++     P   ++    ++   F +FK+V++ LK +I +  K 
Subjt:  LVEALIYKGNIAEAEMVSCLNNDDTSDVRRSLYKAIIQVLLNN-QEKAEEEWQEFKKMRSQFLLPPDVKDTHFYKLVTDFQKFKQVVDLLKKDIYEKKKA

Query:  K
        K
Subjt:  K

AT2G34530.2 unknown protein2.2e-2342.22Show/hide
Query:  DQRVYTRSMSMGALHGGKLAMKRLLQYHRMRATQKHQYDYLEKLEKMINTDSPDFPRLQNILAKLEMRGQEDKAIEILKKAAKQAKEKSLPQHEYEYQML
        D+R  ++S+SMGA+ GGKLA++RLL  H  R       +   + E +++ + PDF  LQ  + K+EM G+E K  E+LKKA ++A+++      YE +ML
Subjt:  DQRVYTRSMSMGALHGGKLAMKRLLQYHRMRATQKHQYDYLEKLEKMINTDSPDFPRLQNILAKLEMRGQEDKAIEILKKAAKQAKEKSLPQHEYEYQML

Query:  LVEALIYKGNIAEAEMVSCLNNDDTSDVRRSLYKA
        LVE LIY GN+ EA    CL ++  +D RR LY+A
Subjt:  LVEALIYKGNIAEAEMVSCLNNDDTSDVRRSLYKA

AT2G34540.2 unknown protein9.0e-0931.72Show/hide
Query:  KLAMKRLLQYHRMRATQK----HQYDYLEKLEKMINTDSPDFPRLQNILAKLEMRGQEDKAIEILKKAAKQAKEKSLPQHEYEYQMLLVEALIYKGNIAE
        K A++ L +   M A+ K     +   L KL  + + D  D  +++ +    E  G+ ++A+++L+ A    + ++ P+  +  QM LVE LI      E
Subjt:  KLAMKRLLQYHRMRATQK----HQYDYLEKLEKMINTDSPDFPRLQNILAKLEMRGQEDKAIEILKKAAKQAKEKSLPQHEYEYQMLLVEALIYKGNIAE

Query:  AEMVSCLNNDDT--SDVRRSLYKAIIQVLLNNQEKAEEEWQEFKK
        A   SCLN+++   SDVR  LYKAII  +L+   +A++ W+EF+K
Subjt:  AEMVSCLNNDDT--SDVRRSLYKAIIQVLLNNQEKAEEEWQEFKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAGATTTCTCATTAGAATTTGCAAGAGACATTTTTCTGATAGTGAGAAACTTCCTCAAGAGTTTTAAGATCCAAACCAAATATGGAACAACAGCAGGGGCTGCAGC
ATCGAGTGCCATCATCTCGGGTATCGGTCTCGTGTTGATTTATGTCTATACTCAGAGGAAAAAGGAGAAAAATGACCAGCGTGTTTATACGAGATCAATGTCCATGGGAG
CTCTACATGGTGGCAAACTAGCCATGAAAAGATTGCTTCAATACCATAGAATGCGAGCAACTCAAAAACATCAATATGATTATCTCGAGAAGTTGGAGAAAATGATCAAC
ACAGACTCTCCTGATTTCCCAAGGCTTCAGAACATTCTGGCAAAGCTGGAAATGAGAGGACAAGAAGACAAAGCTATTGAAATATTAAAAAAAGCAGCAAAACAAGCTAA
GGAGAAATCACTTCCACAACATGAATATGAATATCAGATGCTTCTTGTGGAAGCGCTCATTTACAAGGGAAATATTGCGGAGGCAGAAATGGTTTCATGCTTGAATAATG
ATGACACTTCAGATGTTCGACGCTCATTATATAAGGCAATAATTCAAGTGCTGCTGAATAATCAAGAAAAGGCAGAAGAAGAATGGCAAGAGTTTAAGAAAATGAGAAGC
CAATTCCTCTTACCTCCTGACGTTAAAGACACTCATTTTTACAAGCTCGTGACCGATTTCCAGAAGTTTAAACAAGTCGTCGACCTGCTCAAAAAAGACATTTATGAGAA
GAAGAAAGCAAAATAA
mRNA sequenceShow/hide mRNA sequence
TATCAATATTTCAAAGAGAAATAGGAAACTTGAATCATATCAAAGGCAAATATTCTACCATTTGACATCTCTATCAAACAGAGAGTAAACCAAATCGGATAGAAAACGAT
GATCCTGGTAAAATTGTCCCTACATTCTAGCTTCCTTACCCAGGCAGAAGCAAAGAATCATTCTTTCCTCTACACAAGTTTCATGTTTTCAAGGTACAAAATCCATTTTA
AAATTACTTGCTGTTTGCCCTCTTTTAGCACCGCGAACTCGACATCTCTGATCTTGGAACATCCTTTGAAGTTTTGATTCCTGTTGACACAAATCCAAAATGCCAGATTT
CTCATTAGAATTTGCAAGAGACATTTTTCTGATAGTGAGAAACTTCCTCAAGAGTTTTAAGATCCAAACCAAATATGGAACAACAGCAGGGGCTGCAGCATCGAGTGCCA
TCATCTCGGGTATCGGTCTCGTGTTGATTTATGTCTATACTCAGAGGAAAAAGGAGAAAAATGACCAGCGTGTTTATACGAGATCAATGTCCATGGGAGCTCTACATGGT
GGCAAACTAGCCATGAAAAGATTGCTTCAATACCATAGAATGCGAGCAACTCAAAAACATCAATATGATTATCTCGAGAAGTTGGAGAAAATGATCAACACAGACTCTCC
TGATTTCCCAAGGCTTCAGAACATTCTGGCAAAGCTGGAAATGAGAGGACAAGAAGACAAAGCTATTGAAATATTAAAAAAAGCAGCAAAACAAGCTAAGGAGAAATCAC
TTCCACAACATGAATATGAATATCAGATGCTTCTTGTGGAAGCGCTCATTTACAAGGGAAATATTGCGGAGGCAGAAATGGTTTCATGCTTGAATAATGATGACACTTCA
GATGTTCGACGCTCATTATATAAGGCAATAATTCAAGTGCTGCTGAATAATCAAGAAAAGGCAGAAGAAGAATGGCAAGAGTTTAAGAAAATGAGAAGCCAATTCCTCTT
ACCTCCTGACGTTAAAGACACTCATTTTTACAAGCTCGTGACCGATTTCCAGAAGTTTAAACAAGTCGTCGACCTGCTCAAAAAAGACATTTATGAGAAGAAGAAAGCAA
AATAATGAAAGCACAAAAAATGGGCGTCGTCTAAAGCTAAAGCAAACGAGCCCTCTAATTGTTACTGTTTTCTTTCTTGGAAGTTGAAACTACATTTTACTGTAATTATG
TGTGCTGTGGCTGTGGGTAGAACATGAATAATTGAGGAACTCCAAATAAAATAAAATAAAAATGGAGGGGAAATTTGAATACCGCCATTTTCCGTTGGACTTGGAATACA
TACAATGAAGTAATAAAGTAATGGAATAGAATTAATGTGTATACATTGTAGGCTAGG
Protein sequenceShow/hide protein sequence
MPDFSLEFARDIFLIVRNFLKSFKIQTKYGTTAGAAASSAIISGIGLVLIYVYTQRKKEKNDQRVYTRSMSMGALHGGKLAMKRLLQYHRMRATQKHQYDYLEKLEKMIN
TDSPDFPRLQNILAKLEMRGQEDKAIEILKKAAKQAKEKSLPQHEYEYQMLLVEALIYKGNIAEAEMVSCLNNDDTSDVRRSLYKAIIQVLLNNQEKAEEEWQEFKKMRS
QFLLPPDVKDTHFYKLVTDFQKFKQVVDLLKKDIYEKKKAK