; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017922 (gene) of Snake gourd v1 genome

Gene IDTan0017922
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionBasic secretory protein
Genome locationLG02:957548..959358
RNA-Seq ExpressionTan0017922
SyntenyTan0017922
Gene Ontology termsNA
InterPro domainsIPR007541 - Uncharacterised protein family, basic secretory protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572454.1 hypothetical protein SDJN03_29182, partial [Cucurbita argyrosperma subsp. sororia]4.8e-12279.44Show/hide
Query:  MEDLQSLSLPLLSETSSGTVAAVSAAARPTSADKSPFLSNQAIAVRLLLVAFIGITSLWANHEASKGFDITILNNAKGSPAGQRFDLFYVSNDEATRLIL
        MED QSLSLPLL  T        SAAA PTS ++SPFLSN A+AVRLLLVAFIGITSLWANHEASKGFDITILNNAKGSPAGQRFDLFYVSNDEATRL+L
Subjt:  MEDLQSLSLPLLSETSSGTVAAVSAAARPTSADKSPFLSNQAIAVRLLLVAFIGITSLWANHEASKGFDITILNNAKGSPAGQRFDLFYVSNDEATRLIL

Query:  NASSFIENLIYPSQAFPKKPVKSVHLTLSRRDLSSNVAVEKLDDAGVDFVVHLSPSIFNERNANRAMSAAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIT
        NAS+F+ENLIYPSQ FPKK VKSVHLTLS RDL SNVAVE L D GVDFVV+LSPSIFN+RN N AMSAA+ RGMSRVWLW+GE +APPSLLAGMVEHIT
Subjt:  NASSFIENLIYPSQAFPKKPVKSVHLTLSRRDLSSNVAVEKLDDAGVDFVVHLSPSIFNERNANRAMSAAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIT

Query:  AVAGFAEEKYSGAVVTTLTTCDPTWWKDKNPMEVARFLDHHESEQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPSGSFNSSGIRV
        A AGF E+KYSG VVTTL  CDP WWKDK+P EVA FLD+ E +QKGFIQRLNQGLKSRWHDRTVEDA+G+P +R    FNSSGI+V
Subjt:  AVAGFAEEKYSGAVVTTLTTCDPTWWKDKNPMEVARFLDHHESEQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPSGSFNSSGIRV

KAG7012046.1 hypothetical protein SDJN02_26954, partial [Cucurbita argyrosperma subsp. argyrosperma]4.1e-12179.09Show/hide
Query:  MEDLQSLSLPLLSETSSGTVAAVSAAARPTSADKSPFLSNQAIAVRLLLVAFIGITSLWANHEASKGFDITILNNAKGSPAGQRFDLFYVSNDEATRLIL
        MED QSLSLPLL  T        SAAA PTS ++SPFLSN A+AVRLLLVAFIGITSLWANHEASKGFDITILNNAKGSPAGQRFDLFYVSNDEATRL+L
Subjt:  MEDLQSLSLPLLSETSSGTVAAVSAAARPTSADKSPFLSNQAIAVRLLLVAFIGITSLWANHEASKGFDITILNNAKGSPAGQRFDLFYVSNDEATRLIL

Query:  NASSFIENLIYPSQAFPKKPVKSVHLTLSRRDLSSNVAVEKLDDAGVDFVVHLSPSIFNERNANRAMSAAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIT
        NAS+F+ENLIYPSQ FPKK VKSVHLTLS RDL SNVAVE L D GVDFVV+LSPSIFN+RN N AMSAA+ RGMSRVWLW+GE +APPSLLAGMVEHIT
Subjt:  NASSFIENLIYPSQAFPKKPVKSVHLTLSRRDLSSNVAVEKLDDAGVDFVVHLSPSIFNERNANRAMSAAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIT

Query:  AVAGFAEEKYSGAVVTTLTTCDPTWWKDKNPMEVARFLDHHESEQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPSGSFNSSGIRV
        A AGF E+KYSG VVTTL  CDP WWKDK+P EVA FLD+ E +Q GFIQRLNQGLKSRW DRTVEDA+G+P +R   SFNSSGI+V
Subjt:  AVAGFAEEKYSGAVVTTLTTCDPTWWKDKNPMEVARFLDHHESEQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPSGSFNSSGIRV

XP_022952992.1 uncharacterized protein LOC111455507 [Cucurbita moschata]5.9e-12078.4Show/hide
Query:  MEDLQSLSLPLLSETSSGTVAAVSAAARPTSADKSPFLSNQAIAVRLLLVAFIGITSLWANHEASKGFDITILNNAKGSPAGQRFDLFYVSNDEATRLIL
        MED QSLSLPLL  T        SAA  PTS ++SP LSN A+AVRLLLVAFIGITSLWANHEASKGFDITILNNAKGSPAGQRFDLFYVSNDEATRL+L
Subjt:  MEDLQSLSLPLLSETSSGTVAAVSAAARPTSADKSPFLSNQAIAVRLLLVAFIGITSLWANHEASKGFDITILNNAKGSPAGQRFDLFYVSNDEATRLIL

Query:  NASSFIENLIYPSQAFPKKPVKSVHLTLSRRDLSSNVAVEKLDDAGVDFVVHLSPSIFNERNANRAMSAAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIT
        NAS+F+ENLIYPSQ FPKK VKSVHLTLS RDL SNVAVE L D GVDFVV+LSPSIFN+RN N AMSAA+ RGMSRVWLW+GE +APPSLLAGMVEHIT
Subjt:  NASSFIENLIYPSQAFPKKPVKSVHLTLSRRDLSSNVAVEKLDDAGVDFVVHLSPSIFNERNANRAMSAAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIT

Query:  AVAGFAEEKYSGAVVTTLTTCDPTWWKDKNPMEVARFLDHHESEQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPSGSFNSSGIRV
        A AGF E+KYSG VVTTL  CDP WWKDK+P EVA FL++ E +QKGFIQRLNQGLKSRW DRTVEDA+G+P +R   SFNSSGI+V
Subjt:  AVAGFAEEKYSGAVVTTLTTCDPTWWKDKNPMEVARFLDHHESEQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPSGSFNSSGIRV

XP_022989196.1 uncharacterized protein LOC111486342 [Cucurbita maxima]2.0e-12080.14Show/hide
Query:  MEDLQSLSLPLLSETSSGTVAAVSAAARPTSADKSPFLSNQAIAVRLLLVAFIGITSLWANHEASKGFDITILNNAKGSPAGQRFDLFYVSNDEATRLIL
        MED QSLSLPLLS            AA PTSA++SPFLSNQAIA+RLLLVAF+GITSL ANHEASKGFDITILNNAK SPAGQRF LFYVSNDEATRLIL
Subjt:  MEDLQSLSLPLLSETSSGTVAAVSAAARPTSADKSPFLSNQAIAVRLLLVAFIGITSLWANHEASKGFDITILNNAKGSPAGQRFDLFYVSNDEATRLIL

Query:  NASSFIENLIYPSQAFPKKPVKSVHLTLSRRDLSSNVAVEKLDDAGVDFVVHLSPSIFNERNANRAMSAAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIT
        NASSFIENLIYPSQAFPKKPVKSVHLTLSR DLSS+ AVEKL D G DFV+HLSPSIFNE+NANRAMS AVFRGMSRVWLWDGEAHAPPSLLAGMVEHI 
Subjt:  NASSFIENLIYPSQAFPKKPVKSVHLTLSRRDLSSNVAVEKLDDAGVDFVVHLSPSIFNERNANRAMSAAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIT

Query:  AVAGFAEEKYSGAVVTTLT-TCDPTWWKDKNPMEVARFLDHHESEQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPSGSFN
        A AG   EKYSG V +TLT  CDPTWWKDK+P E+A FLDH E E++GF+QRLNQGLK+RWHDRTVEDA+G+P Q P GS N
Subjt:  AVAGFAEEKYSGAVVTTLT-TCDPTWWKDKNPMEVARFLDHHESEQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPSGSFN

XP_023553976.1 uncharacterized protein LOC111811392 [Cucurbita pepo subsp. pepo]2.6e-12079.09Show/hide
Query:  MEDLQSLSLPLLSETSSGTVAAVSAAARPTSADKSPFLSNQAIAVRLLLVAFIGITSLWANHEASKGFDITILNNAKGSPAGQRFDLFYVSNDEATRLIL
        MED QSLSLPLL  T        SAAA PTS ++SPFLSN A+AVRLLLVAFIGITSLWANHEASKGFDITILNNAKGSPAGQRFDLFYVSNDEATRLIL
Subjt:  MEDLQSLSLPLLSETSSGTVAAVSAAARPTSADKSPFLSNQAIAVRLLLVAFIGITSLWANHEASKGFDITILNNAKGSPAGQRFDLFYVSNDEATRLIL

Query:  NASSFIENLIYPSQAFPKKPVKSVHLTLSRRDLSSNVAVEKLDDAGVDFVVHLSPSIFNERNANRAMSAAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIT
        NAS+F+ENLIYPSQ FPKK VKSVHLTLS RDL SNVAVE L D GVDFVV+LSPSIFN+RN N AMSAA+ RG+SRVWLW+GE +APPSLLAGMVEHIT
Subjt:  NASSFIENLIYPSQAFPKKPVKSVHLTLSRRDLSSNVAVEKLDDAGVDFVVHLSPSIFNERNANRAMSAAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIT

Query:  AVAGFAEEKYSGAVVTTLTTCDPTWWKDKNPMEVARFLDHHESEQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPSGSFNSSGIRV
        A AGF E+KYSG VVTTL  CDP WWKDK+P EVA FL++ E +QKGFIQRLNQ LKSRW DRTVE+A+GMP +R   SFNSSGIRV
Subjt:  AVAGFAEEKYSGAVVTTLTTCDPTWWKDKNPMEVARFLDHHESEQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPSGSFNSSGIRV

TrEMBL top hitse value%identityAlignment
A0A6J1D3U3 uncharacterized protein LOC1110167763.1e-11172.32Show/hide
Query:  MEDLQSLSLPLLSETSSGTVAAVSAAARPTSADKSP-FLSNQAIAVRLLLVAFIGITSLWANHEASKGFDITILNNAKGSPAGQRFDLFYVSNDEATRLI
        MED +SLSLPLL  T +G        + PT  ++S  F SN  IAVRLLL+AFIG+TSLWANHEASKGF+IT++N AK SPAGQRFDLFYVSNDEATR++
Subjt:  MEDLQSLSLPLLSETSSGTVAAVSAAARPTSADKSP-FLSNQAIAVRLLLVAFIGITSLWANHEASKGFDITILNNAKGSPAGQRFDLFYVSNDEATRLI

Query:  LNASSFIENLIYPSQ-AFPKKPVKSVHLTLSRRDLSSNVAVEKLDDAGVDFVVHLSPSIFNERNANRAMSAAVFRGMSRVWLWDGEAHAPPSLLAGMVEH
        LNAS+F+ENLIYPSQ AFPKK VK V LTL+ RDLS NVAV K DD GVDF + LSPSIF+E N N AMSAAV RGMSRVWLWDG++HAPPSLLAGMVEH
Subjt:  LNASSFIENLIYPSQ-AFPKKPVKSVHLTLSRRDLSSNVAVEKLDDAGVDFVVHLSPSIFNERNANRAMSAAVFRGMSRVWLWDGEAHAPPSLLAGMVEH

Query:  ITAVAGFAEEKYSGAVVTTLTTCDPTWWKDKNPMEVARFLDHHESEQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPSGSFNSSGIRV
        I A AGF ++KYSG V++T T CDP WWKDKNPMEVA FL +HE+++ GFIQRLNQGLKSRW DRTV+DALGMP QRP GSFN SGI V
Subjt:  ITAVAGFAEEKYSGAVVTTLTTCDPTWWKDKNPMEVARFLDHHESEQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPSGSFNSSGIRV

A0A6J1EXI2 uncharacterized protein LOC1114370811.5e-11678.01Show/hide
Query:  MEDLQSLSLPLLSETSSGTVAAVSAAARPTSADKSPFLSNQAIAVRLLLVAFIGITSLWANHEASKGFDITILNNAKGSPAGQRFDLFYVSNDEATRLIL
        MED QSLSLPLLS            AA PT A++SPFL NQAIA+RLLLVAF+GITSL ANHEASKGF+ITILNNAK SPAGQRF LFYVSNDEATRLIL
Subjt:  MEDLQSLSLPLLSETSSGTVAAVSAAARPTSADKSPFLSNQAIAVRLLLVAFIGITSLWANHEASKGFDITILNNAKGSPAGQRFDLFYVSNDEATRLIL

Query:  NASSFIENLIYPSQAFPKKPVKSVHLTLSRRDLSSNVAVEKLDDAGVDFVVHLSPSIFNERNANRAMSAAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIT
        NASSFIENLIYPS AFPKKPVKSVHLTLSR DLSS+ AVEKL D G DFV+HLSPSI NE++ANRAMS AVFRGMSRVWLWDGEA APP+LLAGMVEHI 
Subjt:  NASSFIENLIYPSQAFPKKPVKSVHLTLSRRDLSSNVAVEKLDDAGVDFVVHLSPSIFNERNANRAMSAAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIT

Query:  AVAGFAEEKYSGAVVTTLT-TCDPTWWKDKNPMEVARFLDHHESEQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPSGSFN
        A AGF  EKYSG VV+ LT  CDPTWWKDK+P E+A FLDH E E++GFIQRLNQGLK RWHDRTVEDA+G+P Q P GS N
Subjt:  AVAGFAEEKYSGAVVTTLT-TCDPTWWKDKNPMEVARFLDHHESEQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPSGSFN

A0A6J1GLY7 uncharacterized protein LOC1114555072.8e-12078.4Show/hide
Query:  MEDLQSLSLPLLSETSSGTVAAVSAAARPTSADKSPFLSNQAIAVRLLLVAFIGITSLWANHEASKGFDITILNNAKGSPAGQRFDLFYVSNDEATRLIL
        MED QSLSLPLL  T        SAA  PTS ++SP LSN A+AVRLLLVAFIGITSLWANHEASKGFDITILNNAKGSPAGQRFDLFYVSNDEATRL+L
Subjt:  MEDLQSLSLPLLSETSSGTVAAVSAAARPTSADKSPFLSNQAIAVRLLLVAFIGITSLWANHEASKGFDITILNNAKGSPAGQRFDLFYVSNDEATRLIL

Query:  NASSFIENLIYPSQAFPKKPVKSVHLTLSRRDLSSNVAVEKLDDAGVDFVVHLSPSIFNERNANRAMSAAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIT
        NAS+F+ENLIYPSQ FPKK VKSVHLTLS RDL SNVAVE L D GVDFVV+LSPSIFN+RN N AMSAA+ RGMSRVWLW+GE +APPSLLAGMVEHIT
Subjt:  NASSFIENLIYPSQAFPKKPVKSVHLTLSRRDLSSNVAVEKLDDAGVDFVVHLSPSIFNERNANRAMSAAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIT

Query:  AVAGFAEEKYSGAVVTTLTTCDPTWWKDKNPMEVARFLDHHESEQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPSGSFNSSGIRV
        A AGF E+KYSG VVTTL  CDP WWKDK+P EVA FL++ E +QKGFIQRLNQGLKSRW DRTVEDA+G+P +R   SFNSSGI+V
Subjt:  AVAGFAEEKYSGAVVTTLTTCDPTWWKDKNPMEVARFLDHHESEQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPSGSFNSSGIRV

A0A6J1I2J5 uncharacterized protein LOC1114684368.5e-11776.66Show/hide
Query:  MEDLQSLSLPLLSETSSGTVAAVSAAARPTSADKSPFLSNQAIAVRLLLVAFIGITSLWANHEASKGFDITILNNAKGSPAGQRFDLFYVSNDEATRLIL
        MED QSLSLPLL  T        SAAA PTS ++SPFLSN A+AVRLLLVAFIGITSLWANHEASKGF +TILNNAKGSPAGQRFDLFYVSNDEATRL+L
Subjt:  MEDLQSLSLPLLSETSSGTVAAVSAAARPTSADKSPFLSNQAIAVRLLLVAFIGITSLWANHEASKGFDITILNNAKGSPAGQRFDLFYVSNDEATRLIL

Query:  NASSFIENLIYPSQAFPKKPVKSVHLTLSRRDLSSNVAVEKLDDAGVDFVVHLSPSIFNERNANRAMSAAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIT
        NAS+F+ENLIYPSQ FPKK VKSVHLTLS RDL SNVAVE L D GVDFVV+LSPSIFN+RN N AMSAA+ RGMS VWLW+GE HAPPSLLAGMVEHIT
Subjt:  NASSFIENLIYPSQAFPKKPVKSVHLTLSRRDLSSNVAVEKLDDAGVDFVVHLSPSIFNERNANRAMSAAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIT

Query:  AVAGFAEEKYSGAVVTTLTTCDPTWWKDKNPMEVARFLDHHESEQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPSGSFNSSGIRV
        A AGF E+K  G VV+T+  CDP WWKDK P EVA FL + E +QKGFIQRLNQGL+SRW DRTVEDA+GM  +R   SFNSSGI+V
Subjt:  AVAGFAEEKYSGAVVTTLTTCDPTWWKDKNPMEVARFLDHHESEQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPSGSFNSSGIRV

A0A6J1JJD5 uncharacterized protein LOC1114863429.7e-12180.14Show/hide
Query:  MEDLQSLSLPLLSETSSGTVAAVSAAARPTSADKSPFLSNQAIAVRLLLVAFIGITSLWANHEASKGFDITILNNAKGSPAGQRFDLFYVSNDEATRLIL
        MED QSLSLPLLS            AA PTSA++SPFLSNQAIA+RLLLVAF+GITSL ANHEASKGFDITILNNAK SPAGQRF LFYVSNDEATRLIL
Subjt:  MEDLQSLSLPLLSETSSGTVAAVSAAARPTSADKSPFLSNQAIAVRLLLVAFIGITSLWANHEASKGFDITILNNAKGSPAGQRFDLFYVSNDEATRLIL

Query:  NASSFIENLIYPSQAFPKKPVKSVHLTLSRRDLSSNVAVEKLDDAGVDFVVHLSPSIFNERNANRAMSAAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIT
        NASSFIENLIYPSQAFPKKPVKSVHLTLSR DLSS+ AVEKL D G DFV+HLSPSIFNE+NANRAMS AVFRGMSRVWLWDGEAHAPPSLLAGMVEHI 
Subjt:  NASSFIENLIYPSQAFPKKPVKSVHLTLSRRDLSSNVAVEKLDDAGVDFVVHLSPSIFNERNANRAMSAAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIT

Query:  AVAGFAEEKYSGAVVTTLT-TCDPTWWKDKNPMEVARFLDHHESEQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPSGSFN
        A AG   EKYSG V +TLT  CDPTWWKDK+P E+A FLDH E E++GF+QRLNQGLK+RWHDRTVEDA+G+P Q P GS N
Subjt:  AVAGFAEEKYSGAVVTTLT-TCDPTWWKDKNPMEVARFLDHHESEQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPSGSFN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G42900.1 Plant basic secretory protein (BSP) family protein1.1e-3939.57Show/hide
Query:  SNQAIAVRLLLVAFIGITSLWANHEASKGFDITILNNAKGSPAGQRFDLFYVSNDEATRLILNASSFIENLIYPSQAFPKKPVKSV-HLTLSRRDLSSNV
        S+  I +RL  +  +G  SLWANHEASKGF I+I+N+AK SP+G+RF LF+ S+D A R++L+AS F+E  +Y  +  P +  K V H+T+     SS+ 
Subjt:  SNQAIAVRLLLVAFIGITSLWANHEASKGFDITILNNAKGSPAGQRFDLFYVSNDEATRLILNASSFIENLIYPSQAFPKKPVKSV-HLTLSRRDLSSNV

Query:  AVEKLDDAGV---DFVVHLSPSIFNERNANRAMSAAVFRGMSRVWLWDGEAHAPPSLLAGMVEHITAVAGFAE--EKYSGAVVTTLTTCDPTWWKDK-NP
               +G    ++V+ LSPS+   +  + A+ +A+ R M R+WLW  E+ A P L+AGMVE++   +      EK+ G             WKDK   
Subjt:  AVEKLDDAGV---DFVVHLSPSIFNERNANRAMSAAVFRGMSRVWLWDGEAHAPPSLLAGMVEHITAVAGFAE--EKYSGAVVTTLTTCDPTWWKDK-NP

Query:  MEVARFLDHHESEQKGFIQRLNQGLKSRWHDRTVE
        + V   LD+ E   +GFI+RLN G++ RW DRTV+
Subjt:  MEVARFLDHHESEQKGFIQRLNQGLKSRWHDRTVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGACCTCCAATCCCTCTCTCTCCCTCTCCTTTCCGAAACCAGCTCCGGCACCGTCGCCGCCGTCTCCGCCGCTGCCCGTCCCACTTCTGCAGACAAATCCCCCTT
CCTCTCAAATCAGGCCATCGCCGTCCGCCTCCTTCTGGTGGCCTTCATCGGAATCACTTCACTGTGGGCCAATCACGAAGCCTCAAAAGGCTTCGATATCACCATTCTTA
ACAACGCCAAAGGCTCTCCCGCCGGTCAACGCTTCGATCTCTTCTACGTCTCTAATGACGAAGCCACGCGCCTCATCCTCAACGCGAGTAGCTTCATCGAGAATCTGATC
TACCCTTCTCAAGCCTTTCCAAAGAAACCAGTCAAGAGCGTGCATCTCACGCTCTCCCGCCGCGATCTCTCCTCTAATGTCGCCGTCGAGAAGCTCGACGACGCCGGAGT
TGACTTCGTCGTTCATTTGAGCCCTTCGATTTTCAATGAGAGAAACGCGAATCGCGCCATGTCAGCGGCGGTTTTTAGAGGGATGTCCCGCGTGTGGCTGTGGGATGGAG
AGGCCCACGCGCCGCCCTCACTCCTAGCCGGAATGGTCGAGCATATAACCGCGGTGGCCGGATTTGCCGAGGAGAAATATTCCGGTGCGGTCGTTACTACGTTGACGACA
TGTGATCCCACGTGGTGGAAGGATAAGAATCCCATGGAAGTCGCCCGGTTTCTCGACCATCACGAAAGCGAACAGAAGGGTTTCATCCAACGGTTAAATCAAGGGTTGAA
GTCCAGGTGGCATGATCGGACGGTGGAGGATGCGCTTGGTATGCCAGCTCAGCGTCCATCTGGTTCGTTCAATTCTTCCGGAATTAGGGTTTAG
mRNA sequenceShow/hide mRNA sequence
CTAAATCATAATTATACAGTTACGTATTTGGGTTTTTTTTTTCAAAAAATAATAATGGAATATGTGAGTATAAAATGATAAATATGAATAAAACCAAAAGAAAAAGAAAA
GGTGAAAAGAAATGCAACCAAATTATACATGAATTTAAGACACAAAGTTAAGACCCGCGAATCCGTCCATTTTTTATCAATCCTATAATTATCTTAAATGTTCAATCCCC
TTCAACCATGGAAGACCTCCAATCCCTCTCTCTCCCTCTCCTTTCCGAAACCAGCTCCGGCACCGTCGCCGCCGTCTCCGCCGCTGCCCGTCCCACTTCTGCAGACAAAT
CCCCCTTCCTCTCAAATCAGGCCATCGCCGTCCGCCTCCTTCTGGTGGCCTTCATCGGAATCACTTCACTGTGGGCCAATCACGAAGCCTCAAAAGGCTTCGATATCACC
ATTCTTAACAACGCCAAAGGCTCTCCCGCCGGTCAACGCTTCGATCTCTTCTACGTCTCTAATGACGAAGCCACGCGCCTCATCCTCAACGCGAGTAGCTTCATCGAGAA
TCTGATCTACCCTTCTCAAGCCTTTCCAAAGAAACCAGTCAAGAGCGTGCATCTCACGCTCTCCCGCCGCGATCTCTCCTCTAATGTCGCCGTCGAGAAGCTCGACGACG
CCGGAGTTGACTTCGTCGTTCATTTGAGCCCTTCGATTTTCAATGAGAGAAACGCGAATCGCGCCATGTCAGCGGCGGTTTTTAGAGGGATGTCCCGCGTGTGGCTGTGG
GATGGAGAGGCCCACGCGCCGCCCTCACTCCTAGCCGGAATGGTCGAGCATATAACCGCGGTGGCCGGATTTGCCGAGGAGAAATATTCCGGTGCGGTCGTTACTACGTT
GACGACATGTGATCCCACGTGGTGGAAGGATAAGAATCCCATGGAAGTCGCCCGGTTTCTCGACCATCACGAAAGCGAACAGAAGGGTTTCATCCAACGGTTAAATCAAG
GGTTGAAGTCCAGGTGGCATGATCGGACGGTGGAGGATGCGCTTGGTATGCCAGCTCAGCGTCCATCTGGTTCGTTCAATTCTTCCGGAATTAGGGTTTAGGCCAATTGG
GCCCGTCCACATGGGTCAATCATCCCAATCCAGCGGGGCGGTCCCCACCACTGGGACCATTCTTGAATTGCCACGTCGGATCGGTTTGTCTTGGAAATTCTCGTGCCTTT
TTTTTTTCCGGCATTTCTTTTTACAATTTTGTTTATTGTTTATATCCGATTTTTAAATTTTTTTAAGCTCTGAATTGTACAACTTGACGTGTTAAATACTCGTGATGCTA
CAGAAAATTGTTGGTGGTCTGTTAAATAGTTTTTTGCTTCATTGCTTTTTTATTCATTTGTATTTTCTGGAAAATTAATGAATATTTTGATTTTTGAAAATATTATCTTC
TTAAGATGTTGCCAAAAAGGAATAATTTTTTAGCCCAACCTTAATCAATTTGTATGAAAATAATAATGTCGACAAAATAAATATTGGCCCACCATAGTTGTCACTTTCAT
CTAAGAAAATTTGTGAAAAAAAATTAAAGGATTAGTCTCCTGTGAATTACCATATTCAATTATGTTATTCTCCTTTTCATTTTATCATTATTGTCCACTTGTAATTTTTT
TAATACCATTTTTTTTTTTTGTTAACTCCCAAACTCTTTTCATTTTTCTTTTCATTTTATCTGATGTGAGAGAGAGTTTTCTTTTTTATTAATTTTGAAAATATTCTTTT
ACCGTTCTAAATGTATTATGGAATTTTAACTCGAGATATGGCTAATGGGCA
Protein sequenceShow/hide protein sequence
MEDLQSLSLPLLSETSSGTVAAVSAAARPTSADKSPFLSNQAIAVRLLLVAFIGITSLWANHEASKGFDITILNNAKGSPAGQRFDLFYVSNDEATRLILNASSFIENLI
YPSQAFPKKPVKSVHLTLSRRDLSSNVAVEKLDDAGVDFVVHLSPSIFNERNANRAMSAAVFRGMSRVWLWDGEAHAPPSLLAGMVEHITAVAGFAEEKYSGAVVTTLTT
CDPTWWKDKNPMEVARFLDHHESEQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPSGSFNSSGIRV