; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg007731 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg007731
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionBasic secretory protein
Genome locationscaffold4:9224520..9230114
RNA-Seq ExpressionSpg007731
SyntenySpg007731
Gene Ontology termsNA
InterPro domainsIPR007541 - Uncharacterised protein family, basic secretory protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572454.1 hypothetical protein SDJN03_29182, partial [Cucurbita argyrosperma subsp. sororia]1.4e-12181.23Show/hide
Query:  MEDRQSLSLPLLSVTSSDTAAASASASPTSANQSPFLSNQAVAVRLLLFAFIGITSLWANHEASKGFDVTILNNAKGSPAGQRFDLFYVSNDEATRLLLN
        MEDRQSLSLPLL  T       SA+A+PTS NQSPFLSN AVAVRLLL AFIGITSLWANHEASKGFD+TILNNAKGSPAGQRFDLFYVSNDEATRL+LN
Subjt:  MEDRQSLSLPLLSVTSSDTAAASASASPTSANQSPFLSNQAVAVRLLLFAFIGITSLWANHEASKGFDVTILNNAKGSPAGQRFDLFYVSNDEATRLLLN

Query:  ASTFIENLIYPSEAFPKKTVKSVHLTLSRRDLSSSVAVEQLDDGGVDFAVHLSPSIFNERNVNRAMSEAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIIA
        AS F+ENLIYPS+ FPKK VKSVHLTLS RDL S+VAVE L DGGVDF V+LSPSIFN+RNVN AMS A+ RGMSRVWLW+GE +APPSLLAGMVEHI A
Subjt:  ASTFIENLIYPSEAFPKKTVKSVHLTLSRRDLSSSVAVEQLDDGGVDFAVHLSPSIFNERNVNRAMSEAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIIA

Query:  AAGFVEMEYSGGVVTTSTACAPMWWKDKDPTEVALFLDHHERQQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPCE
        +AGFVE +YSGGVVTT  AC PMWWKDKDPTEVA+FLD+ ERQQKGFIQRLNQGLKSRWHDRTVEDA+G+P +R C+
Subjt:  AAGFVEMEYSGGVVTTSTACAPMWWKDKDPTEVALFLDHHERQQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPCE

KAG7012046.1 hypothetical protein SDJN02_26954, partial [Cucurbita argyrosperma subsp. argyrosperma]6.0e-12080.51Show/hide
Query:  MEDRQSLSLPLLSVTSSDTAAASASASPTSANQSPFLSNQAVAVRLLLFAFIGITSLWANHEASKGFDVTILNNAKGSPAGQRFDLFYVSNDEATRLLLN
        MEDRQSLSLPLL  T       SA+A+PTS NQSPFLSN AVAVRLLL AFIGITSLWANHEASKGFD+TILNNAKGSPAGQRFDLFYVSNDEATRL+LN
Subjt:  MEDRQSLSLPLLSVTSSDTAAASASASPTSANQSPFLSNQAVAVRLLLFAFIGITSLWANHEASKGFDVTILNNAKGSPAGQRFDLFYVSNDEATRLLLN

Query:  ASTFIENLIYPSEAFPKKTVKSVHLTLSRRDLSSSVAVEQLDDGGVDFAVHLSPSIFNERNVNRAMSEAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIIA
        AS F+ENLIYPS+ FPKK VKSVHLTLS RDL S+VAVE L DGGVDF V+LSPSIFN+RNVN AMS A+ RGMSRVWLW+GE +APPSLLAGMVEHI A
Subjt:  ASTFIENLIYPSEAFPKKTVKSVHLTLSRRDLSSSVAVEQLDDGGVDFAVHLSPSIFNERNVNRAMSEAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIIA

Query:  AAGFVEMEYSGGVVTTSTACAPMWWKDKDPTEVALFLDHHERQQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPCE
        +AGFVE +YSGGVVTT  AC PMWWKDKDPTEVA+FLD+ ERQQ GFIQRLNQGLKSRW DRTVEDA+G+P +R C+
Subjt:  AAGFVEMEYSGGVVTTSTACAPMWWKDKDPTEVALFLDHHERQQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPCE

XP_022952992.1 uncharacterized protein LOC111455507 [Cucurbita moschata]8.7e-11979.78Show/hide
Query:  MEDRQSLSLPLLSVTSSDTAAASASASPTSANQSPFLSNQAVAVRLLLFAFIGITSLWANHEASKGFDVTILNNAKGSPAGQRFDLFYVSNDEATRLLLN
        MEDRQSLSLPLL  T       SA+ +PTS NQSP LSN AVAVRLLL AFIGITSLWANHEASKGFD+TILNNAKGSPAGQRFDLFYVSNDEATRL+LN
Subjt:  MEDRQSLSLPLLSVTSSDTAAASASASPTSANQSPFLSNQAVAVRLLLFAFIGITSLWANHEASKGFDVTILNNAKGSPAGQRFDLFYVSNDEATRLLLN

Query:  ASTFIENLIYPSEAFPKKTVKSVHLTLSRRDLSSSVAVEQLDDGGVDFAVHLSPSIFNERNVNRAMSEAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIIA
        AS F+ENLIYPS+ FPKK VKSVHLTLS RDL S+VAVE L DGGVDF V+LSPSIFN+RNVN AMS A+ RGMSRVWLW+GE +APPSLLAGMVEHI A
Subjt:  ASTFIENLIYPSEAFPKKTVKSVHLTLSRRDLSSSVAVEQLDDGGVDFAVHLSPSIFNERNVNRAMSEAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIIA

Query:  AAGFVEMEYSGGVVTTSTACAPMWWKDKDPTEVALFLDHHERQQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPCE
        +AGFVE +YSGGVVTT  AC PMWWKDKDPTEVA+FL++ ERQQKGFIQRLNQGLKSRW DRTVEDA+G+P +R C+
Subjt:  AAGFVEMEYSGGVVTTSTACAPMWWKDKDPTEVALFLDHHERQQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPCE

XP_022989196.1 uncharacterized protein LOC111486342 [Cucurbita maxima]2.5e-11879.06Show/hide
Query:  MEDRQSLSLPLLSVTSSDTAAASASASPTSANQSPFLSNQAVAVRLLLFAFIGITSLWANHEASKGFDVTILNNAKGSPAGQRFDLFYVSNDEATRLLLN
        MEDRQSLSLPLLS           +A PTSANQSPFLSNQA+A+RLLL AF+GITSL ANHEASKGFD+TILNNAK SPAGQRF LFYVSNDEATRL+LN
Subjt:  MEDRQSLSLPLLSVTSSDTAAASASASPTSANQSPFLSNQAVAVRLLLFAFIGITSLWANHEASKGFDVTILNNAKGSPAGQRFDLFYVSNDEATRLLLN

Query:  ASTFIENLIYPSEAFPKKTVKSVHLTLSRRDLSSSVAVEQLDDGGVDFAVHLSPSIFNERNVNRAMSEAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIIA
        AS+FIENLIYPS+AFPKK VKSVHLTLSR DLSSS AVE+L DGG DF +HLSPSIFNE+N NRAMS AVFRGMSRVWLWDGEAHAPPSLLAGMVEHIIA
Subjt:  ASTFIENLIYPSEAFPKKTVKSVHLTLSRRDLSSSVAVEQLDDGGVDFAVHLSPSIFNERNVNRAMSEAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIIA

Query:  AAGFVEMEYSGGVVTTST-ACAPMWWKDKDPTEVALFLDHHERQQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPC
        AAG V  +YSGGV +T T AC P WWKDKDPTE+A+FLDH E++++GF+QRLNQGLK+RWHDRTVEDA+G+P Q PC
Subjt:  AAGFVEMEYSGGVVTTST-ACAPMWWKDKDPTEVALFLDHHERQQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPC

XP_023553976.1 uncharacterized protein LOC111811392 [Cucurbita pepo subsp. pepo]8.7e-11979.78Show/hide
Query:  MEDRQSLSLPLLSVTSSDTAAASASASPTSANQSPFLSNQAVAVRLLLFAFIGITSLWANHEASKGFDVTILNNAKGSPAGQRFDLFYVSNDEATRLLLN
        MEDRQSLSLPLL  T       SA+A+PTS NQSPFLSN AVAVRLLL AFIGITSLWANHEASKGFD+TILNNAKGSPAGQRFDLFYVSNDEATRL+LN
Subjt:  MEDRQSLSLPLLSVTSSDTAAASASASPTSANQSPFLSNQAVAVRLLLFAFIGITSLWANHEASKGFDVTILNNAKGSPAGQRFDLFYVSNDEATRLLLN

Query:  ASTFIENLIYPSEAFPKKTVKSVHLTLSRRDLSSSVAVEQLDDGGVDFAVHLSPSIFNERNVNRAMSEAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIIA
        AS F+ENLIYPS+ FPKK VKSVHLTLS RDL S+VAVE L DGGVDF V+LSPSIFN+RNVN AMS A+ RG+SRVWLW+GE +APPSLLAGMVEHI A
Subjt:  ASTFIENLIYPSEAFPKKTVKSVHLTLSRRDLSSSVAVEQLDDGGVDFAVHLSPSIFNERNVNRAMSEAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIIA

Query:  AAGFVEMEYSGGVVTTSTACAPMWWKDKDPTEVALFLDHHERQQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPCE
        +AGFVE +YSGGVVTT  AC PMWWKDKDPTEVA+FL++ ERQQKGFIQRLNQ LKSRW DRTVE+A+GMP +R C+
Subjt:  AAGFVEMEYSGGVVTTSTACAPMWWKDKDPTEVALFLDHHERQQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPCE

TrEMBL top hitse value%identityAlignment
A0A5A7V0J1 Peptidase3.6e-11075Show/hide
Query:  MEDRQSLSLPLLSVTSSDTAAASASASPTSANQSPFLSNQAVAVRLLLFAFIGITSLWANHEASKGFDVTILNNAKGSPAGQRFDLFYVSNDEATRLLLN
        ME+R+SLS PLLS TS+       SA+PT+   SP LSN AV +RLLL A +G+TSLWANHEASKGFD+TILNNAKGS AGQRFDLFYVSNDEATRLLLN
Subjt:  MEDRQSLSLPLLSVTSSDTAAASASASPTSANQSPFLSNQAVAVRLLLFAFIGITSLWANHEASKGFDVTILNNAKGSPAGQRFDLFYVSNDEATRLLLN

Query:  ASTFIENLIYPSEAFPKKTVKSVHLTLSRRDLSSSVAVEQLDDGGVDFAVHLSPSIFNERNVNRAMSEAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIIA
        AS FI+NLIYPS+  PKK +KSVHLTLS RDLSS+VAVEQL DGGVDFAVHLSPSIFN+ N+N AMS A+ RGMSRVWLW+GE HAPPSLL GMVEHI+A
Subjt:  ASTFIENLIYPSEAFPKKTVKSVHLTLSRRDLSSSVAVEQLDDGGVDFAVHLSPSIFNERNVNRAMSEAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIIA

Query:  AAGFVEMEYSGGVVTTSTACAPMWWKDKDPTEVALFLDHHERQQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPC
        AAGFVE +YSGG V T  AC PMWWKDKDP E+A FLD+HERQ +GFIQRLNQ L+SRW DRTV++ LG+P QRPC
Subjt:  AAGFVEMEYSGGVVTTSTACAPMWWKDKDPTEVALFLDHHERQQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPC

A0A6J1EXI2 uncharacterized protein LOC1114370818.2e-11577.26Show/hide
Query:  MEDRQSLSLPLLSVTSSDTAAASASASPTSANQSPFLSNQAVAVRLLLFAFIGITSLWANHEASKGFDVTILNNAKGSPAGQRFDLFYVSNDEATRLLLN
        MEDRQSLSLPLLS           +A PT ANQSPFL NQA+A+RLLL AF+GITSL ANHEASKGF++TILNNAK SPAGQRF LFYVSNDEATRL+LN
Subjt:  MEDRQSLSLPLLSVTSSDTAAASASASPTSANQSPFLSNQAVAVRLLLFAFIGITSLWANHEASKGFDVTILNNAKGSPAGQRFDLFYVSNDEATRLLLN

Query:  ASTFIENLIYPSEAFPKKTVKSVHLTLSRRDLSSSVAVEQLDDGGVDFAVHLSPSIFNERNVNRAMSEAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIIA
        AS+FIENLIYPS AFPKK VKSVHLTLSR DLSSS AVE+L DGG DF +HLSPSI NE++ NRAMS AVFRGMSRVWLWDGEA APP+LLAGMVEHIIA
Subjt:  ASTFIENLIYPSEAFPKKTVKSVHLTLSRRDLSSSVAVEQLDDGGVDFAVHLSPSIFNERNVNRAMSEAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIIA

Query:  AAGFVEMEYSGGVVTTST-ACAPMWWKDKDPTEVALFLDHHERQQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPC
        AAGFV  +YSGGVV+  T AC P WWKDKDPTE+A+FLDH E++++GFIQRLNQGLK RWHDRTVEDA+G+P Q PC
Subjt:  AAGFVEMEYSGGVVTTST-ACAPMWWKDKDPTEVALFLDHHERQQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPC

A0A6J1GLY7 uncharacterized protein LOC1114555074.2e-11979.78Show/hide
Query:  MEDRQSLSLPLLSVTSSDTAAASASASPTSANQSPFLSNQAVAVRLLLFAFIGITSLWANHEASKGFDVTILNNAKGSPAGQRFDLFYVSNDEATRLLLN
        MEDRQSLSLPLL  T       SA+ +PTS NQSP LSN AVAVRLLL AFIGITSLWANHEASKGFD+TILNNAKGSPAGQRFDLFYVSNDEATRL+LN
Subjt:  MEDRQSLSLPLLSVTSSDTAAASASASPTSANQSPFLSNQAVAVRLLLFAFIGITSLWANHEASKGFDVTILNNAKGSPAGQRFDLFYVSNDEATRLLLN

Query:  ASTFIENLIYPSEAFPKKTVKSVHLTLSRRDLSSSVAVEQLDDGGVDFAVHLSPSIFNERNVNRAMSEAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIIA
        AS F+ENLIYPS+ FPKK VKSVHLTLS RDL S+VAVE L DGGVDF V+LSPSIFN+RNVN AMS A+ RGMSRVWLW+GE +APPSLLAGMVEHI A
Subjt:  ASTFIENLIYPSEAFPKKTVKSVHLTLSRRDLSSSVAVEQLDDGGVDFAVHLSPSIFNERNVNRAMSEAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIIA

Query:  AAGFVEMEYSGGVVTTSTACAPMWWKDKDPTEVALFLDHHERQQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPCE
        +AGFVE +YSGGVVTT  AC PMWWKDKDPTEVA+FL++ ERQQKGFIQRLNQGLKSRW DRTVEDA+G+P +R C+
Subjt:  AAGFVEMEYSGGVVTTSTACAPMWWKDKDPTEVALFLDHHERQQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPCE

A0A6J1I2J5 uncharacterized protein LOC1114684367.4e-11678.7Show/hide
Query:  MEDRQSLSLPLLSVTSSDTAAASASASPTSANQSPFLSNQAVAVRLLLFAFIGITSLWANHEASKGFDVTILNNAKGSPAGQRFDLFYVSNDEATRLLLN
        MEDRQSLSLPLL  T       SA+A+PTS NQSPFLSN AVAVRLLL AFIGITSLWANHEASKGF VTILNNAKGSPAGQRFDLFYVSNDEATRL+LN
Subjt:  MEDRQSLSLPLLSVTSSDTAAASASASPTSANQSPFLSNQAVAVRLLLFAFIGITSLWANHEASKGFDVTILNNAKGSPAGQRFDLFYVSNDEATRLLLN

Query:  ASTFIENLIYPSEAFPKKTVKSVHLTLSRRDLSSSVAVEQLDDGGVDFAVHLSPSIFNERNVNRAMSEAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIIA
        AS F+ENLIYPS+ FPKK VKSVHLTLS RDL S+VAVE L DGGVDF V+LSPSIFN+RNVN AMS A+ RGMS VWLW+GE HAPPSLLAGMVEHI A
Subjt:  ASTFIENLIYPSEAFPKKTVKSVHLTLSRRDLSSSVAVEQLDDGGVDFAVHLSPSIFNERNVNRAMSEAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIIA

Query:  AAGFVEMEYSGGVVTTSTACAPMWWKDKDPTEVALFLDHHERQQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPCE
        +AGFVE +  GGVV+T  AC PMWWKDK+PTEVA+FL + ERQQKGFIQRLNQGL+SRW DRTVEDA+GM  +R C+
Subjt:  AAGFVEMEYSGGVVTTSTACAPMWWKDKDPTEVALFLDHHERQQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPCE

A0A6J1JJD5 uncharacterized protein LOC1114863421.2e-11879.06Show/hide
Query:  MEDRQSLSLPLLSVTSSDTAAASASASPTSANQSPFLSNQAVAVRLLLFAFIGITSLWANHEASKGFDVTILNNAKGSPAGQRFDLFYVSNDEATRLLLN
        MEDRQSLSLPLLS           +A PTSANQSPFLSNQA+A+RLLL AF+GITSL ANHEASKGFD+TILNNAK SPAGQRF LFYVSNDEATRL+LN
Subjt:  MEDRQSLSLPLLSVTSSDTAAASASASPTSANQSPFLSNQAVAVRLLLFAFIGITSLWANHEASKGFDVTILNNAKGSPAGQRFDLFYVSNDEATRLLLN

Query:  ASTFIENLIYPSEAFPKKTVKSVHLTLSRRDLSSSVAVEQLDDGGVDFAVHLSPSIFNERNVNRAMSEAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIIA
        AS+FIENLIYPS+AFPKK VKSVHLTLSR DLSSS AVE+L DGG DF +HLSPSIFNE+N NRAMS AVFRGMSRVWLWDGEAHAPPSLLAGMVEHIIA
Subjt:  ASTFIENLIYPSEAFPKKTVKSVHLTLSRRDLSSSVAVEQLDDGGVDFAVHLSPSIFNERNVNRAMSEAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIIA

Query:  AAGFVEMEYSGGVVTTST-ACAPMWWKDKDPTEVALFLDHHERQQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPC
        AAG V  +YSGGV +T T AC P WWKDKDPTE+A+FLDH E++++GF+QRLNQGLK+RWHDRTVEDA+G+P Q PC
Subjt:  AAGFVEMEYSGGVVTTST-ACAPMWWKDKDPTEVALFLDHHERQQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G42900.1 Plant basic secretory protein (BSP) family protein5.6e-3939.74Show/hide
Query:  SNQAVAVRLLLFAFIGITSLWANHEASKGFDVTILNNAKGSPAGQRFDLFYVSNDEATRLLLNASTFIENLIYPSEAFPKKTVKSV-HLTLSRRDLSSSV
        S+  + +RL     +G  SLWANHEASKGF ++I+N+AK SP+G+RF LF+ S+D A R+LL+AS F+E  +Y  E  P +  K V H+T+     SS  
Subjt:  SNQAVAVRLLLFAFIGITSLWANHEASKGFDVTILNNAKGSPAGQRFDLFYVSNDEATRLLLNASTFIENLIYPSEAFPKKTVKSV-HLTLSRRDLSSSV

Query:  AVEQLDDGGV---DFAVHLSPSIFNERNVNRAMSEAVFRGMSRVWLWDGEAHAPPSLLAGMVEHI-IAAAGFVEMEYSGGVVTTSTACAPMWWKDKDPT-
                G    ++ + LSPS+   +  + A+  A+ R M R+WLW  E+ A P L+AGMVE++ + +      E  GG            WKDK+ + 
Subjt:  AVEQLDDGGV---DFAVHLSPSIFNERNVNRAMSEAVFRGMSRVWLWDGEAHAPPSLLAGMVEHI-IAAAGFVEMEYSGGVVTTSTACAPMWWKDKDPT-

Query:  EVALFLDHHERQQKGFIQRLNQGLKSRWHDRTVE
         V   LD+ ER+ +GFI+RLN G++ RW DRTV+
Subjt:  EVALFLDHHERQQKGFIQRLNQGLKSRWHDRTVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGATCGCCAATCCCTCTCCCTCCCTCTCCTCTCGGTAACCAGCTCCGACACCGCCGCCGCCTCCGCCTCCGCCAGTCCCACTTCTGCAAACCAATCCCCCTTCCT
CTCAAATCAGGCCGTCGCCGTCCGCCTCCTTCTGTTCGCCTTCATCGGAATCACTTCACTATGGGCCAATCACGAAGCCTCAAAAGGCTTCGATGTCACCATTCTCAACA
ACGCCAAAGGCTCCCCGGCCGGTCAACGCTTCGATCTCTTCTACGTCTCAAATGACGAGGCCACGCGCCTGCTCCTCAACGCGAGCACCTTCATCGAGAATCTGATCTAC
CCTTCCGAAGCCTTCCCAAAGAAAACGGTCAAGAGCGTGCATCTCACGCTCTCCCGCCGCGATCTGTCCTCTAGTGTCGCCGTCGAGCAGCTCGACGACGGCGGAGTTGA
CTTCGCCGTTCATTTGAGTCCCTCGATTTTCAATGAGAGAAACGTGAATCGCGCCATGTCAGAGGCGGTTTTCAGAGGGATGTCACGCGTGTGGCTGTGGGATGGAGAGG
CCCACGCGCCGCCCTCGCTCCTCGCCGGAATGGTCGAGCACATAATCGCGGCGGCCGGATTTGTCGAGATGGAATATTCCGGTGGGGTCGTTACCACGTCGACGGCATGT
GCTCCCATGTGGTGGAAGGACAAGGATCCCACGGAGGTCGCACTCTTTCTCGACCATCACGAAAGACAGCAGAAAGGTTTCATCCAACGGCTGAATCAAGGTTTGAAGTC
CAGGTGGCATGATCGGACGGTGGAGGATGCGCTGGGTATGCCAGCTCAGCGTCCATGTGAATACGGTATAATTTCATATGGGTTAAGTTTTAAAATTAGTGGCTGTTATC
GTAATTCTAGACCACTTCGAGATGTAAGGATCCAACAAGGACAACGCAACACCAACTCAAGGACACGACCAGGAAGCGAACCCCAGATGAGAGGTAGGCCAAAAGGAATT
GGGCCAAGGCCTAATAACTTGGCCTGGTCGACCTCGACTCGCTTTGGTAGGTTGGGTCTTCTCCCTTTCGCTCGGTCCCTAGTGTTTTGGTCCAGTCTGGGGATAGAGAA
GAAAAGCCAGAATCCAAACACAGAACCTTCAGAAAATAACCGCAATCCGCAGAGGTCACGAGAAGAGAATCACTCGCAGAGATCACAAGAAGAAGGTCGAGGTCGGCCTC
AGTCTCCTCGGCCTCAGCACCTTCAACCTTCATCCCGAGAGAGACAGGTTGATGTCAAAATTGCTGCCCTCGACGATAAAGTAAGTGCGACGGATCACAATTTATCTAGG
ATACTTCATATCCTGGATAAATCTGGTCCTAACACTAAAATCCATGATGAGAGGTTGGTTAGGGATCCGAGGAAAGGGAAGGAACCAGTAAAGTACACTACAGAGTCAGA
AACAAGGTCGAAGGGGAAAAGGACTGATAGCGTTACCAGCAAGGTCAGGGGGCTGAAGTCTACGAATCGCACAGTATTGAGAAGCCTAGAGTCAAGCACGTACAAGGGAC
ACGACCACACCGTTTCGACGCCGAGCATCAGCCGTAGAACAGGTGCAAACCTGAGGAACCTGATTGAAGAAATACGCAGAATGGCCAAAGCTGTCGAGGCCGAGGCTAAG
AATGATGACCCCTCTTGGAAGACCGAGCTTTTGAAAACCTTGAAGGAGTTTGTGAATTTCCAAGGGGACCTGCACAAATCAAAAAGTTCGGAGGACCAGAATTTGGAAGC
CCTGATCGATCAGGTTGATCTACCCTTCACTGATGAAGTTATGAAAGTTGAGGTGTCCCAGAAGTTCAAGGTACCCACATTCAAACAATATGATGGTAAGAAGGACCCTA
TACAGCACATAGATGCCTACAGGAACTGGATGGACTTCCATGGCGTTTTAGACGCAATTAGGTGTCGCGTTTTCTCTTTCACTCTGACAGGATCAGCCAAGCAATGGTTT
CAGAGTGAGGAAGCAGCTCTAGTAGCTATAACAGCAGGGCTAGAAGATGAGAGGTTGCTTAATTCGATAGTTAAGAACCAACCTCGAACCTATGCTGAATTTGTTTCCAG
GGCACAGAAAAAGAAGGACAAAAAACCATGGACGGACGATGGTGGCCGAGGCCGAGCACATCCCTTTGGTAAGTTTGAGAAATACACGCCAACTGCTGTCCCGCAAGAGC
AAGTTTTGATGGAGATCCGAAACACGAGACTTCTTAAATTCCCGGGGAGGATGAAGTCAAACCCCGATAAAAGAGACAAGAGCCAGTATTGTGTTTTCCACCGAGACCAT
GGGCATTCAACCAGAAATTGTATTCAGTTGAAGGATGAAATTGAGGCACTAATCCAGAATGGATATCTGAAGGAGTTCGTCGTCGAAACCGGGGCCGAGGTCTACCTCGC
CTCATCCAGCTTTGAGATTATATTTGAGATTAATGTGAGTCTGGACCACCGAATCTACCTCGCCTCATCTGGCTTGAGACCATGCACTTCATCTGAGCCAAGCTCACCTG
GCCGGGGCCGAGGCCGAGCACATCTTGCCGAGGCCGACCCCAAACTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGATCGCCAATCCCTCTCCCTCCCTCTCCTCTCGGTAACCAGCTCCGACACCGCCGCCGCCTCCGCCTCCGCCAGTCCCACTTCTGCAAACCAATCCCCCTTCCT
CTCAAATCAGGCCGTCGCCGTCCGCCTCCTTCTGTTCGCCTTCATCGGAATCACTTCACTATGGGCCAATCACGAAGCCTCAAAAGGCTTCGATGTCACCATTCTCAACA
ACGCCAAAGGCTCCCCGGCCGGTCAACGCTTCGATCTCTTCTACGTCTCAAATGACGAGGCCACGCGCCTGCTCCTCAACGCGAGCACCTTCATCGAGAATCTGATCTAC
CCTTCCGAAGCCTTCCCAAAGAAAACGGTCAAGAGCGTGCATCTCACGCTCTCCCGCCGCGATCTGTCCTCTAGTGTCGCCGTCGAGCAGCTCGACGACGGCGGAGTTGA
CTTCGCCGTTCATTTGAGTCCCTCGATTTTCAATGAGAGAAACGTGAATCGCGCCATGTCAGAGGCGGTTTTCAGAGGGATGTCACGCGTGTGGCTGTGGGATGGAGAGG
CCCACGCGCCGCCCTCGCTCCTCGCCGGAATGGTCGAGCACATAATCGCGGCGGCCGGATTTGTCGAGATGGAATATTCCGGTGGGGTCGTTACCACGTCGACGGCATGT
GCTCCCATGTGGTGGAAGGACAAGGATCCCACGGAGGTCGCACTCTTTCTCGACCATCACGAAAGACAGCAGAAAGGTTTCATCCAACGGCTGAATCAAGGTTTGAAGTC
CAGGTGGCATGATCGGACGGTGGAGGATGCGCTGGGTATGCCAGCTCAGCGTCCATGTGAATACGGTATAATTTCATATGGGTTAAGTTTTAAAATTAGTGGCTGTTATC
GTAATTCTAGACCACTTCGAGATGTAAGGATCCAACAAGGACAACGCAACACCAACTCAAGGACACGACCAGGAAGCGAACCCCAGATGAGAGGTAGGCCAAAAGGAATT
GGGCCAAGGCCTAATAACTTGGCCTGGTCGACCTCGACTCGCTTTGGTAGGTTGGGTCTTCTCCCTTTCGCTCGGTCCCTAGTGTTTTGGTCCAGTCTGGGGATAGAGAA
GAAAAGCCAGAATCCAAACACAGAACCTTCAGAAAATAACCGCAATCCGCAGAGGTCACGAGAAGAGAATCACTCGCAGAGATCACAAGAAGAAGGTCGAGGTCGGCCTC
AGTCTCCTCGGCCTCAGCACCTTCAACCTTCATCCCGAGAGAGACAGGTTGATGTCAAAATTGCTGCCCTCGACGATAAAGTAAGTGCGACGGATCACAATTTATCTAGG
ATACTTCATATCCTGGATAAATCTGGTCCTAACACTAAAATCCATGATGAGAGGTTGGTTAGGGATCCGAGGAAAGGGAAGGAACCAGTAAAGTACACTACAGAGTCAGA
AACAAGGTCGAAGGGGAAAAGGACTGATAGCGTTACCAGCAAGGTCAGGGGGCTGAAGTCTACGAATCGCACAGTATTGAGAAGCCTAGAGTCAAGCACGTACAAGGGAC
ACGACCACACCGTTTCGACGCCGAGCATCAGCCGTAGAACAGGTGCAAACCTGAGGAACCTGATTGAAGAAATACGCAGAATGGCCAAAGCTGTCGAGGCCGAGGCTAAG
AATGATGACCCCTCTTGGAAGACCGAGCTTTTGAAAACCTTGAAGGAGTTTGTGAATTTCCAAGGGGACCTGCACAAATCAAAAAGTTCGGAGGACCAGAATTTGGAAGC
CCTGATCGATCAGGTTGATCTACCCTTCACTGATGAAGTTATGAAAGTTGAGGTGTCCCAGAAGTTCAAGGTACCCACATTCAAACAATATGATGGTAAGAAGGACCCTA
TACAGCACATAGATGCCTACAGGAACTGGATGGACTTCCATGGCGTTTTAGACGCAATTAGGTGTCGCGTTTTCTCTTTCACTCTGACAGGATCAGCCAAGCAATGGTTT
CAGAGTGAGGAAGCAGCTCTAGTAGCTATAACAGCAGGGCTAGAAGATGAGAGGTTGCTTAATTCGATAGTTAAGAACCAACCTCGAACCTATGCTGAATTTGTTTCCAG
GGCACAGAAAAAGAAGGACAAAAAACCATGGACGGACGATGGTGGCCGAGGCCGAGCACATCCCTTTGGTAAGTTTGAGAAATACACGCCAACTGCTGTCCCGCAAGAGC
AAGTTTTGATGGAGATCCGAAACACGAGACTTCTTAAATTCCCGGGGAGGATGAAGTCAAACCCCGATAAAAGAGACAAGAGCCAGTATTGTGTTTTCCACCGAGACCAT
GGGCATTCAACCAGAAATTGTATTCAGTTGAAGGATGAAATTGAGGCACTAATCCAGAATGGATATCTGAAGGAGTTCGTCGTCGAAACCGGGGCCGAGGTCTACCTCGC
CTCATCCAGCTTTGAGATTATATTTGAGATTAATGTGAGTCTGGACCACCGAATCTACCTCGCCTCATCTGGCTTGAGACCATGCACTTCATCTGAGCCAAGCTCACCTG
GCCGGGGCCGAGGCCGAGCACATCTTGCCGAGGCCGACCCCAAACTTTAA
Protein sequenceShow/hide protein sequence
MEDRQSLSLPLLSVTSSDTAAASASASPTSANQSPFLSNQAVAVRLLLFAFIGITSLWANHEASKGFDVTILNNAKGSPAGQRFDLFYVSNDEATRLLLNASTFIENLIY
PSEAFPKKTVKSVHLTLSRRDLSSSVAVEQLDDGGVDFAVHLSPSIFNERNVNRAMSEAVFRGMSRVWLWDGEAHAPPSLLAGMVEHIIAAAGFVEMEYSGGVVTTSTAC
APMWWKDKDPTEVALFLDHHERQQKGFIQRLNQGLKSRWHDRTVEDALGMPAQRPCEYGIISYGLSFKISGCYRNSRPLRDVRIQQGQRNTNSRTRPGSEPQMRGRPKGI
GPRPNNLAWSTSTRFGRLGLLPFARSLVFWSSLGIEKKSQNPNTEPSENNRNPQRSREENHSQRSQEEGRGRPQSPRPQHLQPSSRERQVDVKIAALDDKVSATDHNLSR
ILHILDKSGPNTKIHDERLVRDPRKGKEPVKYTTESETRSKGKRTDSVTSKVRGLKSTNRTVLRSLESSTYKGHDHTVSTPSISRRTGANLRNLIEEIRRMAKAVEAEAK
NDDPSWKTELLKTLKEFVNFQGDLHKSKSSEDQNLEALIDQVDLPFTDEVMKVEVSQKFKVPTFKQYDGKKDPIQHIDAYRNWMDFHGVLDAIRCRVFSFTLTGSAKQWF
QSEEAALVAITAGLEDERLLNSIVKNQPRTYAEFVSRAQKKKDKKPWTDDGGRGRAHPFGKFEKYTPTAVPQEQVLMEIRNTRLLKFPGRMKSNPDKRDKSQYCVFHRDH
GHSTRNCIQLKDEIEALIQNGYLKEFVVETGAEVYLASSSFEIIFEINVSLDHRIYLASSGLRPCTSSEPSSPGRGRGRAHLAEADPKL