; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000461 (gene) of Snake gourd v1 genome

Gene IDTan0000461
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionOrotidine 5'-phosphate decarboxylase
Genome locationLG03:77135813..77137706
RNA-Seq ExpressionTan0000461
SyntenyTan0000461
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134770.1 uncharacterized protein LOC101221037 isoform X1 [Cucumis sativus]1.8e-13590.88Show/hide
Query:  MLTVYAPGANKKSLACLDSIPLPEASHANSLTCSSSVSLCPLR-SSCNSILHNPSSFKRCSQLSTHYVLNRFAFRRWLCRSDSSPSPSDDEYRSSRNIAI
        MLTVY+PGANKK LACLDSIPL EASHANSL  S+SVSLCPL+ SS N+ILHNPSSFKRC+QL+T  VLNR +FRR LCRSDSSP PSDDEYRSSRNIAI
Subjt:  MLTVYAPGANKKSLACLDSIPLPEASHANSLTCSSSVSLCPLR-SSCNSILHNPSSFKRCSQLSTHYVLNRFAFRRWLCRSDSSPSPSDDEYRSSRNIAI

Query:  SLFRRYRNFVDRGGGENLKDFISAGVNAYALGCTDEGLRKELIDMKESGLEIEVMQSYGGSTGLKSKIISGEIEECILWLSIIFITILCTPQPTVVRWSS
        SLFRRYRNFVDRGGG+NLKDFISAGVNAYALGCTDEGLRKELIDMKE+G EIEVMQSYGGSTGLKSKIISGEIEECILWLSIIFITILCTPQPTVVRWSS
Subjt:  SLFRRYRNFVDRGGGENLKDFISAGVNAYALGCTDEGLRKELIDMKESGLEIEVMQSYGGSTGLKSKIISGEIEECILWLSIIFITILCTPQPTVVRWSS

Query:  TPPVSDEVRLQWKGFCALIANAYYVRGMAWLPVNTLQLEQMVVVGRAEEPSVVASRMRLVFSTLEVVSPQWPKV
        TPPVSDE+RLQWKGFCALIANAYYVRGMAWLPV TLQLEQM VVGRAEEPSVVASRMR+VFSTLEVVSPQWPKV
Subjt:  TPPVSDEVRLQWKGFCALIANAYYVRGMAWLPVNTLQLEQMVVVGRAEEPSVVASRMRLVFSTLEVVSPQWPKV

XP_008440016.1 PREDICTED: uncharacterized protein LOC103484620 isoform X1 [Cucumis melo]6.4e-14188.81Show/hide
Query:  ILLLSWKLYDGSVMLTVYAPGANKKSLACLDSIPLPEASHANSLTCSSSVSLCPLRSSCNSILHNPSSFKRCSQLSTHYVLNRFAFRRWLCRSDSSPSPS
        I + + K YDGSVMLTVY PGAN+K LACLD IPL EAS+ANSL  S+SVS+CPL+SSCN+I+HNPSSFKRC+QL+TH VLNR +FRR LCRSDSSP PS
Subjt:  ILLLSWKLYDGSVMLTVYAPGANKKSLACLDSIPLPEASHANSLTCSSSVSLCPLRSSCNSILHNPSSFKRCSQLSTHYVLNRFAFRRWLCRSDSSPSPS

Query:  DDEYRSSRNIAISLFRRYRNFVDRGGGENLKDFISAGVNAYALGCTDEGLRKELIDMKESGLEIEVMQSYGGSTGLKSKIISGEIEECILWLSIIFITIL
        DDEYRSSRNIAISLFRRYRNFVDRGGG+NLKDFISAGVNAYALGCTDEGLRKELIDMKE+GLEIEVMQSYGGSTGLKSKIISGEIEECILWLSIIFITIL
Subjt:  DDEYRSSRNIAISLFRRYRNFVDRGGGENLKDFISAGVNAYALGCTDEGLRKELIDMKESGLEIEVMQSYGGSTGLKSKIISGEIEECILWLSIIFITIL

Query:  CTPQPTVVRWSSTPPVSDEVRLQWKGFCALIANAYYVRGMAWLPVNTLQLEQMVVVGRAEEPSVVASRMRLVFSTLEVVSPQWPKV
        CTPQPTVVRWSSTPPVSDE+RLQWKGFCALIANAYYVRGMAWLPV TLQLEQM VVGRAEEPSVVASRMR+VFSTLEVVSPQWPKV
Subjt:  CTPQPTVVRWSSTPPVSDEVRLQWKGFCALIANAYYVRGMAWLPVNTLQLEQMVVVGRAEEPSVVASRMRLVFSTLEVVSPQWPKV

XP_008440017.1 PREDICTED: uncharacterized protein LOC103484620 isoform X2 [Cucumis melo]4.3e-13790.48Show/hide
Query:  MLTVYAPGANKKSLACLDSIPLPEASHANSLTCSSSVSLCPLRSSCNSILHNPSSFKRCSQLSTHYVLNRFAFRRWLCRSDSSPSPSDDEYRSSRNIAIS
        MLTVY PGAN+K LACLD IPL EAS+ANSL  S+SVS+CPL+SSCN+I+HNPSSFKRC+QL+TH VLNR +FRR LCRSDSSP PSDDEYRSSRNIAIS
Subjt:  MLTVYAPGANKKSLACLDSIPLPEASHANSLTCSSSVSLCPLRSSCNSILHNPSSFKRCSQLSTHYVLNRFAFRRWLCRSDSSPSPSDDEYRSSRNIAIS

Query:  LFRRYRNFVDRGGGENLKDFISAGVNAYALGCTDEGLRKELIDMKESGLEIEVMQSYGGSTGLKSKIISGEIEECILWLSIIFITILCTPQPTVVRWSST
        LFRRYRNFVDRGGG+NLKDFISAGVNAYALGCTDEGLRKELIDMKE+GLEIEVMQSYGGSTGLKSKIISGEIEECILWLSIIFITILCTPQPTVVRWSST
Subjt:  LFRRYRNFVDRGGGENLKDFISAGVNAYALGCTDEGLRKELIDMKESGLEIEVMQSYGGSTGLKSKIISGEIEECILWLSIIFITILCTPQPTVVRWSST

Query:  PPVSDEVRLQWKGFCALIANAYYVRGMAWLPVNTLQLEQMVVVGRAEEPSVVASRMRLVFSTLEVVSPQWPKV
        PPVSDE+RLQWKGFCALIANAYYVRGMAWLPV TLQLEQM VVGRAEEPSVVASRMR+VFSTLEVVSPQWPKV
Subjt:  PPVSDEVRLQWKGFCALIANAYYVRGMAWLPVNTLQLEQMVVVGRAEEPSVVASRMRLVFSTLEVVSPQWPKV

XP_038883021.1 uncharacterized protein LOC120074097 isoform X1 [Benincasa hispida]6.6e-14691.84Show/hide
Query:  LSWKLYDGSVMLTVYAPGANKKSLACLDSIPLPEASHANSLTCSSSVSLCPLRSSCNSILHNPSSFKRCSQLSTHYVLNRFAFRRWLCRSDSSPSPSDDE
        L WK YDGSVMLTV++PGANKK LACLDSIPLPEASHANSL CS+SVSLCPL+SSCN+ILHNPSSFKRC+QL+TH VLNRFAFRRWLCRSDSS  PSDDE
Subjt:  LSWKLYDGSVMLTVYAPGANKKSLACLDSIPLPEASHANSLTCSSSVSLCPLRSSCNSILHNPSSFKRCSQLSTHYVLNRFAFRRWLCRSDSSPSPSDDE

Query:  YRSSRNIAISLFRRYRNFVDRGGGENLKDFISAGVNAYALGCTDEGLRKELIDMKESGLEIEVMQSYGGSTGLKSKIISGEIEECILWLSIIFITILCTP
        +RSSRNIAISLFRRYRNFVDRGGG+NLKDFISAGVNAYALGCTDEGLRKELIDMKESGLEIEVMQS+GGSTGLKSKIIS EIEECILWLSIIFITILCTP
Subjt:  YRSSRNIAISLFRRYRNFVDRGGGENLKDFISAGVNAYALGCTDEGLRKELIDMKESGLEIEVMQSYGGSTGLKSKIISGEIEECILWLSIIFITILCTP

Query:  QPTVVRWSSTPPVSDEVRLQWKGFCALIANAYYVRGMAWLPVNTLQLEQMVVVGRAEEPSVVASRMRLVFSTLEVVSPQWPK
        QPTVVRWSSTPPVSDEVRLQWKGFCALIANAYYVRGMAWLPV TLQLEQ+ VVGRAEEPS+VASRMR+VFSTLEVVSPQWPK
Subjt:  QPTVVRWSSTPPVSDEVRLQWKGFCALIANAYYVRGMAWLPVNTLQLEQMVVVGRAEEPSVVASRMRLVFSTLEVVSPQWPK

XP_038883022.1 uncharacterized protein LOC120074097 isoform X2 [Benincasa hispida]1.1e-14092.28Show/hide
Query:  MLTVYAPGANKKSLACLDSIPLPEASHANSLTCSSSVSLCPLRSSCNSILHNPSSFKRCSQLSTHYVLNRFAFRRWLCRSDSSPSPSDDEYRSSRNIAIS
        MLTV++PGANKK LACLDSIPLPEASHANSL CS+SVSLCPL+SSCN+ILHNPSSFKRC+QL+TH VLNRFAFRRWLCRSDSS  PSDDE+RSSRNIAIS
Subjt:  MLTVYAPGANKKSLACLDSIPLPEASHANSLTCSSSVSLCPLRSSCNSILHNPSSFKRCSQLSTHYVLNRFAFRRWLCRSDSSPSPSDDEYRSSRNIAIS

Query:  LFRRYRNFVDRGGGENLKDFISAGVNAYALGCTDEGLRKELIDMKESGLEIEVMQSYGGSTGLKSKIISGEIEECILWLSIIFITILCTPQPTVVRWSST
        LFRRYRNFVDRGGG+NLKDFISAGVNAYALGCTDEGLRKELIDMKESGLEIEVMQS+GGSTGLKSKIIS EIEECILWLSIIFITILCTPQPTVVRWSST
Subjt:  LFRRYRNFVDRGGGENLKDFISAGVNAYALGCTDEGLRKELIDMKESGLEIEVMQSYGGSTGLKSKIISGEIEECILWLSIIFITILCTPQPTVVRWSST

Query:  PPVSDEVRLQWKGFCALIANAYYVRGMAWLPVNTLQLEQMVVVGRAEEPSVVASRMRLVFSTLEVVSPQWPK
        PPVSDEVRLQWKGFCALIANAYYVRGMAWLPV TLQLEQ+ VVGRAEEPS+VASRMR+VFSTLEVVSPQWPK
Subjt:  PPVSDEVRLQWKGFCALIANAYYVRGMAWLPVNTLQLEQMVVVGRAEEPSVVASRMRLVFSTLEVVSPQWPK

TrEMBL top hitse value%identityAlignment
A0A0A0KHT1 Uncharacterized protein8.7e-13690.88Show/hide
Query:  MLTVYAPGANKKSLACLDSIPLPEASHANSLTCSSSVSLCPLR-SSCNSILHNPSSFKRCSQLSTHYVLNRFAFRRWLCRSDSSPSPSDDEYRSSRNIAI
        MLTVY+PGANKK LACLDSIPL EASHANSL  S+SVSLCPL+ SS N+ILHNPSSFKRC+QL+T  VLNR +FRR LCRSDSSP PSDDEYRSSRNIAI
Subjt:  MLTVYAPGANKKSLACLDSIPLPEASHANSLTCSSSVSLCPLR-SSCNSILHNPSSFKRCSQLSTHYVLNRFAFRRWLCRSDSSPSPSDDEYRSSRNIAI

Query:  SLFRRYRNFVDRGGGENLKDFISAGVNAYALGCTDEGLRKELIDMKESGLEIEVMQSYGGSTGLKSKIISGEIEECILWLSIIFITILCTPQPTVVRWSS
        SLFRRYRNFVDRGGG+NLKDFISAGVNAYALGCTDEGLRKELIDMKE+G EIEVMQSYGGSTGLKSKIISGEIEECILWLSIIFITILCTPQPTVVRWSS
Subjt:  SLFRRYRNFVDRGGGENLKDFISAGVNAYALGCTDEGLRKELIDMKESGLEIEVMQSYGGSTGLKSKIISGEIEECILWLSIIFITILCTPQPTVVRWSS

Query:  TPPVSDEVRLQWKGFCALIANAYYVRGMAWLPVNTLQLEQMVVVGRAEEPSVVASRMRLVFSTLEVVSPQWPKV
        TPPVSDE+RLQWKGFCALIANAYYVRGMAWLPV TLQLEQM VVGRAEEPSVVASRMR+VFSTLEVVSPQWPKV
Subjt:  TPPVSDEVRLQWKGFCALIANAYYVRGMAWLPVNTLQLEQMVVVGRAEEPSVVASRMRLVFSTLEVVSPQWPKV

A0A1S3B037 uncharacterized protein LOC103484620 isoform X22.1e-13790.48Show/hide
Query:  MLTVYAPGANKKSLACLDSIPLPEASHANSLTCSSSVSLCPLRSSCNSILHNPSSFKRCSQLSTHYVLNRFAFRRWLCRSDSSPSPSDDEYRSSRNIAIS
        MLTVY PGAN+K LACLD IPL EAS+ANSL  S+SVS+CPL+SSCN+I+HNPSSFKRC+QL+TH VLNR +FRR LCRSDSSP PSDDEYRSSRNIAIS
Subjt:  MLTVYAPGANKKSLACLDSIPLPEASHANSLTCSSSVSLCPLRSSCNSILHNPSSFKRCSQLSTHYVLNRFAFRRWLCRSDSSPSPSDDEYRSSRNIAIS

Query:  LFRRYRNFVDRGGGENLKDFISAGVNAYALGCTDEGLRKELIDMKESGLEIEVMQSYGGSTGLKSKIISGEIEECILWLSIIFITILCTPQPTVVRWSST
        LFRRYRNFVDRGGG+NLKDFISAGVNAYALGCTDEGLRKELIDMKE+GLEIEVMQSYGGSTGLKSKIISGEIEECILWLSIIFITILCTPQPTVVRWSST
Subjt:  LFRRYRNFVDRGGGENLKDFISAGVNAYALGCTDEGLRKELIDMKESGLEIEVMQSYGGSTGLKSKIISGEIEECILWLSIIFITILCTPQPTVVRWSST

Query:  PPVSDEVRLQWKGFCALIANAYYVRGMAWLPVNTLQLEQMVVVGRAEEPSVVASRMRLVFSTLEVVSPQWPKV
        PPVSDE+RLQWKGFCALIANAYYVRGMAWLPV TLQLEQM VVGRAEEPSVVASRMR+VFSTLEVVSPQWPKV
Subjt:  PPVSDEVRLQWKGFCALIANAYYVRGMAWLPVNTLQLEQMVVVGRAEEPSVVASRMRLVFSTLEVVSPQWPKV

A0A1S3B053 uncharacterized protein LOC103484620 isoform X13.1e-14188.81Show/hide
Query:  ILLLSWKLYDGSVMLTVYAPGANKKSLACLDSIPLPEASHANSLTCSSSVSLCPLRSSCNSILHNPSSFKRCSQLSTHYVLNRFAFRRWLCRSDSSPSPS
        I + + K YDGSVMLTVY PGAN+K LACLD IPL EAS+ANSL  S+SVS+CPL+SSCN+I+HNPSSFKRC+QL+TH VLNR +FRR LCRSDSSP PS
Subjt:  ILLLSWKLYDGSVMLTVYAPGANKKSLACLDSIPLPEASHANSLTCSSSVSLCPLRSSCNSILHNPSSFKRCSQLSTHYVLNRFAFRRWLCRSDSSPSPS

Query:  DDEYRSSRNIAISLFRRYRNFVDRGGGENLKDFISAGVNAYALGCTDEGLRKELIDMKESGLEIEVMQSYGGSTGLKSKIISGEIEECILWLSIIFITIL
        DDEYRSSRNIAISLFRRYRNFVDRGGG+NLKDFISAGVNAYALGCTDEGLRKELIDMKE+GLEIEVMQSYGGSTGLKSKIISGEIEECILWLSIIFITIL
Subjt:  DDEYRSSRNIAISLFRRYRNFVDRGGGENLKDFISAGVNAYALGCTDEGLRKELIDMKESGLEIEVMQSYGGSTGLKSKIISGEIEECILWLSIIFITIL

Query:  CTPQPTVVRWSSTPPVSDEVRLQWKGFCALIANAYYVRGMAWLPVNTLQLEQMVVVGRAEEPSVVASRMRLVFSTLEVVSPQWPKV
        CTPQPTVVRWSSTPPVSDE+RLQWKGFCALIANAYYVRGMAWLPV TLQLEQM VVGRAEEPSVVASRMR+VFSTLEVVSPQWPKV
Subjt:  CTPQPTVVRWSSTPPVSDEVRLQWKGFCALIANAYYVRGMAWLPVNTLQLEQMVVVGRAEEPSVVASRMRLVFSTLEVVSPQWPKV

A0A6J1CK48 uncharacterized protein LOC111012348 isoform X18.7e-13689.96Show/hide
Query:  MLTVYAPGANKKSLACLDSIPLPEAS-----HANSLTCSSSVSLCPLRSSCNS-ILHNPSSFKRCSQLSTHYVLNRFAFRRWLCRSDSSPSPSDDEYRSS
        MLTVY+PGANK  LAC DSIP PE S     HANSLTCS+SVS C LRSSCNS +LHNPSSFKRCSQL+ HYVLNRFAFRRWLC SDSS  PSDDEYRSS
Subjt:  MLTVYAPGANKKSLACLDSIPLPEAS-----HANSLTCSSSVSLCPLRSSCNS-ILHNPSSFKRCSQLSTHYVLNRFAFRRWLCRSDSSPSPSDDEYRSS

Query:  RNIAISLFRRYRNFVDRGGGENLKDFISAGVNAYALGCTDEGLRKELIDMKESGLEIEVMQSYGGSTGLKSKIISGEIEECILWLSIIFITILCTPQPTV
        RNIAISLFRRYRN+VDRGGG+NLKDFISAGVNAYALGCTDEGLRKELIDMKESGLEIEVMQSYGGSTGLKSKIIS EIEECILWLSIIF+TILCTPQPTV
Subjt:  RNIAISLFRRYRNFVDRGGGENLKDFISAGVNAYALGCTDEGLRKELIDMKESGLEIEVMQSYGGSTGLKSKIISGEIEECILWLSIIFITILCTPQPTV

Query:  VRWSSTPPVSDEVRLQWKGFCALIANAYYVRGMAWLPVNTLQLEQMVVVGRAEEPSVVASRMRLVFSTLEVVSPQWPKV
        VRWSSTP VSDEVRLQWKGFCALIANAYYVRGMAWLPV TLQLEQM VVGRAEEPSVVASRMRLVFSTLEVVSPQWPKV
Subjt:  VRWSSTPPVSDEVRLQWKGFCALIANAYYVRGMAWLPVNTLQLEQMVVVGRAEEPSVVASRMRLVFSTLEVVSPQWPKV

A0A6J1KLB5 uncharacterized protein LOC1114967724.1e-13390.11Show/hide
Query:  MLTVYAPGANKKSLACLDSIPLPEASHANSLTCSSSVSLCPLRSSCNSILHNPSSFKRCSQLSTHYVLNRFAFRRWLCRSDSSPSPSDDEYRSSRNIAIS
        MLT+YAPGANKK LACLDSIPL E SHANSL+ SSSVSLC LRSSC+S LHNPSSFKRC QL T  VL+R A RRWLC SDSSPSPSDDEYRSSRNIAIS
Subjt:  MLTVYAPGANKKSLACLDSIPLPEASHANSLTCSSSVSLCPLRSSCNSILHNPSSFKRCSQLSTHYVLNRFAFRRWLCRSDSSPSPSDDEYRSSRNIAIS

Query:  LFRRYRNFVDRGGGENLKDFISAGVNAYALGCTDEGLRKELIDMKESGLEIEVMQSYGGSTGLKSKIISGEIEECILWLSIIFITILCTPQPTVVRWSST
        LFRRYRNFVDRGG +NLKDFISAGVNAYALGCTDEGLRKELIDMK+SGL+IEVMQS GG TGLKSKIIS EIEECILWLSI+FITILCTPQPTVVRWSST
Subjt:  LFRRYRNFVDRGGGENLKDFISAGVNAYALGCTDEGLRKELIDMKESGLEIEVMQSYGGSTGLKSKIISGEIEECILWLSIIFITILCTPQPTVVRWSST

Query:  PPVSDEVRLQWKGFCALIANAYYVRGMAWLPVNTLQLEQMVVVGRAEEPSVVASRMRLVFSTLEVVSPQWPKV
        PPVSDEVRLQWKGFCALIANAYYVRGMAWLPVNTLQLEQM VVGRAEEPSVVASRM LVFSTLEVVSPQWPKV
Subjt:  PPVSDEVRLQWKGFCALIANAYYVRGMAWLPVNTLQLEQMVVVGRAEEPSVVASRMRLVFSTLEVVSPQWPKV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G62250.1 unknown protein1.1e-8570.98Show/hide
Query:  PSSFKRC--SQLSTHYVLNRFAFRRWL-CRSDSSPSPSDDEYRSSRNIAISLFRRYRNFVDRGGGENLKDFISAGVNAYALGCTDEGLRKELIDMKESGL
        P S   C  S  ST Y +     RRW+ C S ++   SDDEYRSS NIAISL RRYR  + RG GE LK+FISAGVNAYALGCTDE LRKEL+ MK+SGL
Subjt:  PSSFKRC--SQLSTHYVLNRFAFRRWL-CRSDSSPSPSDDEYRSSRNIAISLFRRYRNFVDRGGGENLKDFISAGVNAYALGCTDEGLRKELIDMKESGL

Query:  EIEVMQSYGGSTGLKSKIISGEIEECILWLSIIFITILCTPQPTVVRWSSTPPVSDEVRLQWKGFCALIANAYYVRGMAWLPVNTLQLEQMVVVGRAEEP
        EIE M++YGGST  KSKI   E++ECILWL I+FITILCTPQPTV+RWSSTP VSDE+  +W+GFCA+IANAYY+RGMAWLPV TLQLEQM V G++EEP
Subjt:  EIEVMQSYGGSTGLKSKIISGEIEECILWLSIIFITILCTPQPTVVRWSSTPPVSDEVRLQWKGFCALIANAYYVRGMAWLPVNTLQLEQMVVVGRAEEP

Query:  SVVASRMRLVFSTLEVVSPQWPKV
        SVVASRMRLVFSTLEVVSPQWP+V
Subjt:  SVVASRMRLVFSTLEVVSPQWPKV

AT1G62250.2 unknown protein1.1e-6367.22Show/hide
Query:  PSSFKRC--SQLSTHYVLNRFAFRRWL-CRSDSSPSPSDDEYRSSRNIAISLFRRYRNFVDRGGGENLKDFISAGVNAYALGCTDEGLRKELIDMKESGL
        P S   C  S  ST Y +     RRW+ C S ++   SDDEYRSS NIAISL RRYR  + RG GE LK+FISAGVNAYALGCTDE LRKEL+ MK+SGL
Subjt:  PSSFKRC--SQLSTHYVLNRFAFRRWL-CRSDSSPSPSDDEYRSSRNIAISLFRRYRNFVDRGGGENLKDFISAGVNAYALGCTDEGLRKELIDMKESGL

Query:  EIEVMQSYGGSTGLKSKIISGEIEECILWLSIIFITILCTPQPTVVRWSSTPPVSDEVRLQWKGFCALIANAYYVRGMAW
        EIE M++YGGST  KSKI   E++ECILWL I+FITILCTPQPTV+RWSSTP VSDE+  +W+GFCA+IANAYY+RGMAW
Subjt:  EIEVMQSYGGSTGLKSKIISGEIEECILWLSIIFITILCTPQPTVVRWSSTPPVSDEVRLQWKGFCALIANAYYVRGMAW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGCTAGAAATGAAGCTTGAAAAGAGAGAAAAGTTAGAAAGGATGAGCTTCGTTATTTTATTGCTTTCTTGGAAATTGTATGATGGTTCAGTCATGTTGACGGTATA
TGCTCCGGGGGCTAACAAGAAATCTCTGGCATGTCTAGACTCTATTCCACTGCCTGAGGCTTCTCATGCTAATTCCCTTACTTGTAGTTCCTCAGTTTCTTTGTGCCCTC
TAAGGAGTTCTTGCAACAGCATTCTACACAATCCCTCATCTTTTAAAAGGTGTAGTCAATTGAGTACTCATTATGTTTTGAACCGATTCGCTTTCAGAAGATGGCTTTGC
CGCTCGGACAGTTCCCCATCTCCTTCAGACGATGAATATCGCTCTTCACGCAACATAGCTATCAGTTTGTTCAGGCGATATCGGAATTTTGTTGATCGAGGAGGAGGCGA
AAACCTAAAGGATTTCATTAGTGCTGGGGTGAATGCTTATGCACTTGGTTGTACTGATGAAGGGTTAAGAAAGGAACTTATTGATATGAAGGAATCTGGTCTTGAAATTG
AAGTAATGCAGAGTTATGGTGGAAGCACTGGTTTGAAATCCAAAATTATCTCGGGGGAGATTGAAGAGTGCATTCTGTGGTTGAGCATTATATTCATCACCATCTTGTGT
ACACCACAACCAACTGTAGTTAGATGGTCATCAACACCTCCAGTGTCAGATGAAGTAAGGCTTCAGTGGAAAGGCTTTTGTGCTCTCATTGCCAATGCATATTATGTCCG
GGGCATGGCATGGCTGCCTGTAAACACTCTACAACTTGAGCAAATGGTAGTGGTGGGACGAGCAGAGGAACCTTCAGTTGTTGCTAGCCGAATGCGTTTAGTGTTCAGCA
CACTTGAGGTAGTTAGTCCACAATGGCCGAAAGTATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGCTAGAAATGAAGCTTGAAAAGAGAGAAAAGTTAGAAAGGATGAGCTTCGTTATTTTATTGCTTTCTTGGAAATTGTATGATGGTTCAGTCATGTTGACGGTATA
TGCTCCGGGGGCTAACAAGAAATCTCTGGCATGTCTAGACTCTATTCCACTGCCTGAGGCTTCTCATGCTAATTCCCTTACTTGTAGTTCCTCAGTTTCTTTGTGCCCTC
TAAGGAGTTCTTGCAACAGCATTCTACACAATCCCTCATCTTTTAAAAGGTGTAGTCAATTGAGTACTCATTATGTTTTGAACCGATTCGCTTTCAGAAGATGGCTTTGC
CGCTCGGACAGTTCCCCATCTCCTTCAGACGATGAATATCGCTCTTCACGCAACATAGCTATCAGTTTGTTCAGGCGATATCGGAATTTTGTTGATCGAGGAGGAGGCGA
AAACCTAAAGGATTTCATTAGTGCTGGGGTGAATGCTTATGCACTTGGTTGTACTGATGAAGGGTTAAGAAAGGAACTTATTGATATGAAGGAATCTGGTCTTGAAATTG
AAGTAATGCAGAGTTATGGTGGAAGCACTGGTTTGAAATCCAAAATTATCTCGGGGGAGATTGAAGAGTGCATTCTGTGGTTGAGCATTATATTCATCACCATCTTGTGT
ACACCACAACCAACTGTAGTTAGATGGTCATCAACACCTCCAGTGTCAGATGAAGTAAGGCTTCAGTGGAAAGGCTTTTGTGCTCTCATTGCCAATGCATATTATGTCCG
GGGCATGGCATGGCTGCCTGTAAACACTCTACAACTTGAGCAAATGGTAGTGGTGGGACGAGCAGAGGAACCTTCAGTTGTTGCTAGCCGAATGCGTTTAGTGTTCAGCA
CACTTGAGGTAGTTAGTCCACAATGGCCGAAAGTATAG
Protein sequenceShow/hide protein sequence
MSLEMKLEKREKLERMSFVILLLSWKLYDGSVMLTVYAPGANKKSLACLDSIPLPEASHANSLTCSSSVSLCPLRSSCNSILHNPSSFKRCSQLSTHYVLNRFAFRRWLC
RSDSSPSPSDDEYRSSRNIAISLFRRYRNFVDRGGGENLKDFISAGVNAYALGCTDEGLRKELIDMKESGLEIEVMQSYGGSTGLKSKIISGEIEECILWLSIIFITILC
TPQPTVVRWSSTPPVSDEVRLQWKGFCALIANAYYVRGMAWLPVNTLQLEQMVVVGRAEEPSVVASRMRLVFSTLEVVSPQWPKV