; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg021642 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg021642
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Description(1->3)-beta-glucan endohydrolase
Genome locationscaffold2:16660836..16682106
RNA-Seq ExpressionSpg021642
SyntenySpg021642
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6590116.1 hypothetical protein SDJN03_15539, partial [Cucurbita argyrosperma subsp. sororia]3.7e-9888.84Show/hide
Query:  TLFLAAEISEDEAEDRLQDAIQEKFAVFGDNAGGLWWEGHEEGEEKDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEE
        T+    EISEDEAEDRLQDAIQEKFAVFGD               KDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEE
Subjt:  TLFLAAEISEDEAEDRLQDAIQEKFAVFGDNAGGLWWEGHEEGEEKDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEE

Query:  YDESHKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVRQIKKLPVDLKALMDGLS
        YDE+HKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVR IK+LPVDLKALMDGLS
Subjt:  YDESHKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVRQIKKLPVDLKALMDGLS

Query:  SLLLPSQKALFSQTI
        SLLLPSQKALFSQT+
Subjt:  SLLLPSQKALFSQTI

XP_022960981.1 uncharacterized protein LOC111461618 [Cucurbita moschata]3.7e-9888.84Show/hide
Query:  TLFLAAEISEDEAEDRLQDAIQEKFAVFGDNAGGLWWEGHEEGEEKDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEE
        T+    EISEDEAEDRLQDAIQEKFAVFGD               KDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEE
Subjt:  TLFLAAEISEDEAEDRLQDAIQEKFAVFGDNAGGLWWEGHEEGEEKDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEE

Query:  YDESHKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVRQIKKLPVDLKALMDGLS
        YDE+HKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVR IK+LPVDLKALMDGLS
Subjt:  YDESHKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVRQIKKLPVDLKALMDGLS

Query:  SLLLPSQKALFSQTI
        SLLLPSQKALFSQT+
Subjt:  SLLLPSQKALFSQTI

XP_022987363.1 uncharacterized protein LOC111484942 [Cucurbita maxima]3.7e-9888.84Show/hide
Query:  TLFLAAEISEDEAEDRLQDAIQEKFAVFGDNAGGLWWEGHEEGEEKDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEE
        T+    EISEDEAEDRLQDAIQEKFAVFGD               KDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEE
Subjt:  TLFLAAEISEDEAEDRLQDAIQEKFAVFGDNAGGLWWEGHEEGEEKDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEE

Query:  YDESHKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVRQIKKLPVDLKALMDGLS
        YDE+HKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVR IK+LPVDLKALMDGLS
Subjt:  YDESHKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVRQIKKLPVDLKALMDGLS

Query:  SLLLPSQKALFSQTI
        SLLLPSQKALFSQT+
Subjt:  SLLLPSQKALFSQTI

XP_023516259.1 uncharacterized protein LOC111780168 [Cucurbita pepo subsp. pepo]3.2e-9788.37Show/hide
Query:  TLFLAAEISEDEAEDRLQDAIQEKFAVFGDNAGGLWWEGHEEGEEKDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEE
        T+    EISEDEAEDRLQDAIQEKFAVFGD               KDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEE
Subjt:  TLFLAAEISEDEAEDRLQDAIQEKFAVFGDNAGGLWWEGHEEGEEKDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEE

Query:  YDESHKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVRQIKKLPVDLKALMDGLS
        YDE+HKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHY EKISFQLFFITQEKVR IK+LPVDLKALMDGLS
Subjt:  YDESHKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVRQIKKLPVDLKALMDGLS

Query:  SLLLPSQKALFSQTI
        SLLLPSQKALFSQT+
Subjt:  SLLLPSQKALFSQTI

XP_038880657.1 uncharacterized protein LOC120072284 isoform X2 [Benincasa hispida]9.2e-9787.44Show/hide
Query:  TLFLAAEISEDEAEDRLQDAIQEKFAVFGDNAGGLWWEGHEEGEEKDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEE
        T+    EISEDEAEDRLQDAIQEKFAVFGD               KDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEE
Subjt:  TLFLAAEISEDEAEDRLQDAIQEKFAVFGDNAGGLWWEGHEEGEEKDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEE

Query:  YDESHKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVRQIKKLPVDLKALMDGLS
        YDE HKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDG FHYFEKISFQLFFITQEK R IK+LPVDLKA+MDGLS
Subjt:  YDESHKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVRQIKKLPVDLKALMDGLS

Query:  SLLLPSQKALFSQTI
        SLLLPSQKALFSQT+
Subjt:  SLLLPSQKALFSQTI

TrEMBL top hitse value%identityAlignment
A0A1S3B823 uncharacterized protein LOC1034871974.9e-9686.05Show/hide
Query:  TLFLAAEISEDEAEDRLQDAIQEKFAVFGDNAGGLWWEGHEEGEEKDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEE
        T+    EISEDEAEDRLQ+AIQEKFAVFGD               KDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEE
Subjt:  TLFLAAEISEDEAEDRLQDAIQEKFAVFGDNAGGLWWEGHEEGEEKDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEE

Query:  YDESHKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVRQIKKLPVDLKALMDGLS
        YDE HKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDG+FHYFEKISFQLFFITQEK R IK+LP+DLKA+MDGLS
Subjt:  YDESHKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVRQIKKLPVDLKALMDGLS

Query:  SLLLPSQKALFSQTI
        SLLLPSQK LFSQT+
Subjt:  SLLLPSQKALFSQTI

A0A5D3CU98 Uncharacterized protein6.4e-9686.32Show/hide
Query:  LAAEISEDEAEDRLQDAIQEKFAVFGDNAGGLWWEGHEEGEEKDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEEYDE
        ++ +ISEDEAEDRLQ+AIQEKFAVFGD               KDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEEYDE
Subjt:  LAAEISEDEAEDRLQDAIQEKFAVFGDNAGGLWWEGHEEGEEKDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEEYDE

Query:  SHKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVRQIKKLPVDLKALMDGLSSLL
         HKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDG+FHYFEKISFQLFFITQEK R IK+LP+DLKA+MDGLSSLL
Subjt:  SHKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVRQIKKLPVDLKALMDGLSSLL

Query:  LPSQKALFSQTI
        LPSQK LFSQT+
Subjt:  LPSQKALFSQTI

A0A6J1DS42 uncharacterized protein LOC1110238802.9e-9686.51Show/hide
Query:  TLFLAAEISEDEAEDRLQDAIQEKFAVFGDNAGGLWWEGHEEGEEKDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEE
        T+    EISEDEAEDRLQDAIQEKF+VFGD               KDHQAIDILLAEIDIYELFAFK+CKGRKVKLALCEELDERMRDLKNELQSF+GEE
Subjt:  TLFLAAEISEDEAEDRLQDAIQEKFAVFGDNAGGLWWEGHEEGEEKDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEE

Query:  YDESHKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVRQIKKLPVDLKALMDGLS
        YDE+HKRKAIDALKRMENWNLFSDTYEEFQNY+VARDTFLAHLG+TLWGSMRHIISPSLSDGSFHYFEK+SFQLFFITQEKVRQIK LPVDLKALMDGLS
Subjt:  YDESHKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVRQIKKLPVDLKALMDGLS

Query:  SLLLPSQKALFSQTI
        SLLLPSQKALFSQT+
Subjt:  SLLLPSQKALFSQTI

A0A6J1H937 uncharacterized protein LOC1114616181.8e-9888.84Show/hide
Query:  TLFLAAEISEDEAEDRLQDAIQEKFAVFGDNAGGLWWEGHEEGEEKDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEE
        T+    EISEDEAEDRLQDAIQEKFAVFGD               KDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEE
Subjt:  TLFLAAEISEDEAEDRLQDAIQEKFAVFGDNAGGLWWEGHEEGEEKDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEE

Query:  YDESHKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVRQIKKLPVDLKALMDGLS
        YDE+HKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVR IK+LPVDLKALMDGLS
Subjt:  YDESHKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVRQIKKLPVDLKALMDGLS

Query:  SLLLPSQKALFSQTI
        SLLLPSQKALFSQT+
Subjt:  SLLLPSQKALFSQTI

A0A6J1JJ89 uncharacterized protein LOC1114849421.8e-9888.84Show/hide
Query:  TLFLAAEISEDEAEDRLQDAIQEKFAVFGDNAGGLWWEGHEEGEEKDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEE
        T+    EISEDEAEDRLQDAIQEKFAVFGD               KDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEE
Subjt:  TLFLAAEISEDEAEDRLQDAIQEKFAVFGDNAGGLWWEGHEEGEEKDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEE

Query:  YDESHKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVRQIKKLPVDLKALMDGLS
        YDE+HKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVR IK+LPVDLKALMDGLS
Subjt:  YDESHKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVRQIKKLPVDLKALMDGLS

Query:  SLLLPSQKALFSQTI
        SLLLPSQKALFSQT+
Subjt:  SLLLPSQKALFSQTI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G58100.1 unknown protein3.0e-9380.75Show/hide
Query:  TLFLAAEISEDEAEDRLQDAIQEKFAVFGDNAGGLWWEGHEEGEEKDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEE
        T+   AEISEDEAED+LQ AIQ+KF+VFG+N               DHQA+DILLAEID+YELFAFKHCKGRKVKLALCEELDERMRDLK ELQSFDGEE
Subjt:  TLFLAAEISEDEAEDRLQDAIQEKFAVFGDNAGGLWWEGHEEGEEKDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCEELDERMRDLKNELQSFDGEE

Query:  YDESHKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVRQIKKLPVDLKALMDGLS
        YDE+HKRKA+DAL+RME+WNLFSD  EEFQNYTVARDTFLAHLGATLWGSMRHIISPS++DG+FH++EKISFQL FITQEKVRQIK+LPVDLKALMDGLS
Subjt:  YDESHKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVRQIKKLPVDLKALMDGLS

Query:  SLLLPSQKALFSQ
        SLLLPSQK LFSQ
Subjt:  SLLLPSQKALFSQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAGGGTGTTGATAGATAGGAAAACATTTACCCTTCGTAAGAGTGGATGTGGGAAGAAAGTGTCAGTTGAGGAACGTAGGGGTGAGAGGGCGAGGACTAGGAGTGT
GGAGTTGGATGTAGGGACTTTTGTGTGGGTTCAGCACTACTTAGTGACAATTTCTAAGGTTGGTCGACTAGTGGGATTTTGGAGAAGAAAAAGGCTAGAAGCAGCGATTA
TTTTCTTTCGGGTGTTGTCGAGTGTGAAAGGTAAATATGGCCTTTTATCGTTGGAACCCTTTAAAGGGGGATGGATTATTGTGTTGGACATTCCTCCTTTCCTGAGAACT
CGAAGTTTGATTCTCATGATAGCGGAGTTATGTGAAGGGGTGGTGGAAGACGAAGAAATCATCAGAGAAGAGGTGTCGTTGGAGGAAGTTAGCTTCAAGGATAAAAGGAA
TGAGAATGGTTTAGAAGTCGGAAAAAAGGACGAATGTGGTACTTCATCTGCCACAATTGGCTGGCAAGTCAAAGTTTGCAGAAGACGCAAGTTGCAGGCCCAAGAGCATG
GAATAGAGGGTTTGGGCTCAAAGATGGAGATGGAAAGAAGAAAAGGGCTTGGGCCATCAGAGCCATTGAAGGGTGAGGAGACGTATGCGTCCGATCCTTCAATTGTGAGC
TATTTAGATGATGAGTCCTTCTTGTCCACTCCATTCGCGAATCAGATGAAGGTAGGCAACTTGGAAACTTCTGATGTCCTGTCTCTTCTTTCCTTGATAGAAGAGGTTAC
CTTTAGACCGGATAGGAGGGATAGTCGTTTTTGGAAGCCTAACCCCTCTAAGGGTTTCTCTTGCCGCTCGTTCTTTCATTGTCTTCTGGACCCTCCTCCTCTTGAGAAGT
CAGTTTATTCTTTGGTATGGAAAGTGAAAATTCCAAAGAAAGTGCAGTTTTTTACTTGGCAGGATGGTAGAGGAAGACTTCGAATTTTGTGGCATTGTGATTTTGTGTGT
TCGGTGTGGAACACTTTCTTTGAGATATTTGGGCTTCAATTTGCTAGACACAGGGATTTCAGGGAGATGATTGATGAGTTCCTTCCCCATCCTCCTTTTCCGGACCAAGG
AAACTTGCAAGTTGGGATTTGTGCTATTATTTGGGGGTTGTGGGGTGAGAGAAATAACAAAATTTTTAGAGGGAAAGAGAGGAACGTAGAGGATGTTTGGGCCCTTATTA
CATATTCTGTTTCGTTGTGGGCGTCGGTGACGCGTTTGGGGAAGGAAGGAAGAGGTGCCTATCTTGTCAGTTGGGAGGTGTTGGGGAAACCGATCCCTCATGGGGTATGG
GGATTGGGAATCTTAGGGGAGATGATCAAGGAGTTCCTCCTCAATTCGATGTTCCTTGACAAAGGAAGATTTTTGTGGACATCTGTTTTTACTTTATTTCTGGCAGCCGA
AATATCAGAAGATGAAGCTGAAGATCGCCTGCAAGATGCTATTCAGGAGAAATTTGCTGTTTTTGGTGATAATGCTGGAGGATTATGGTGGGAAGGGCATGAGGAGGGTG
AAGAGAAAGACCATCAAGCCATTGATATTCTTTTAGCAGAGATTGACATATATGAGCTTTTTGCTTTCAAACATTGCAAGGGAAGGAAAGTTAAACTTGCTCTTTGTGAA
GAACTTGATGAAAGGATGCGAGACTTGAAAAATGAGCTTCAGTCGTTTGACGGTGAAGAATATGATGAAAGTCATAAGAGGAAGGCCATAGATGCATTAAAACGAATGGA
GAATTGGAATTTATTTAGTGATACGTATGAGGAGTTCCAAAACTATACTGTAGCACGTGATACTTTTCTGGCTCACCTAGGTGCTACTCTCTGGGGGTCAATGAGACATA
TTATATCACCTTCACTTTCTGATGGGTCATTCCATTATTTTGAGAAAATATCATTTCAATTGTTTTTCATCACACAGGAGAAAGTTAGACAAATTAAAAAATTGCCCGTG
GATCTTAAAGCTCTAATGGATGGGCTCTCGTCTTTGTTGTTACCTTCACAGAAAGCACTATTTAGTCAGACCATTAATGTGGAGGGTCTTCGTAATACTGTACTGACTGG
TTTCCCTGTTAAAAATGCCAAAGCCTCGCTGTGGATGCATATCTCGAGGGCTTTGATAGATGTTTGTCGCAATCCATTGTGCGATGAAGTGGATTGTGTCCTTGAGTATG
TCAGTAGAGGCAAGTTGGCTAGAGTTGTGAACGAGGGAAAATACATGTACGAGTGGGGTGTATGGCTGAGTGAAGCTCAATTACAAGTAAAGTTAAGTTCAGTTGAGGAT
GAAAAAACCTTGCCTAAGCTAAGTTGCTTTGGATTAATGCGGTCAAAGCTTATCTTTCGGAATTATGGTTGGAATAGAATCATAGAATTTTTTGGACCAAGTATGTTCCT
TGGATTGATCAATTTGAAGCTTCTCGCCTTAAGGCCTAGGTGTTCTTTGTCCAAATTGTTCATGAGATTCTCGCTTCAGGATATTTGTCTAAATTGGAATGCTTTTATAT
ATCCTTTATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGGAGGGTGTTGATAGATAGGAAAACATTTACCCTTCGTAAGAGTGGATGTGGGAAGAAAGTGTCAGTTGAGGAACGTAGGGGTGAGAGGGCGAGGACTAGGAGTGT
GGAGTTGGATGTAGGGACTTTTGTGTGGGTTCAGCACTACTTAGTGACAATTTCTAAGGTTGGTCGACTAGTGGGATTTTGGAGAAGAAAAAGGCTAGAAGCAGCGATTA
TTTTCTTTCGGGTGTTGTCGAGTGTGAAAGGTAAATATGGCCTTTTATCGTTGGAACCCTTTAAAGGGGGATGGATTATTGTGTTGGACATTCCTCCTTTCCTGAGAACT
CGAAGTTTGATTCTCATGATAGCGGAGTTATGTGAAGGGGTGGTGGAAGACGAAGAAATCATCAGAGAAGAGGTGTCGTTGGAGGAAGTTAGCTTCAAGGATAAAAGGAA
TGAGAATGGTTTAGAAGTCGGAAAAAAGGACGAATGTGGTACTTCATCTGCCACAATTGGCTGGCAAGTCAAAGTTTGCAGAAGACGCAAGTTGCAGGCCCAAGAGCATG
GAATAGAGGGTTTGGGCTCAAAGATGGAGATGGAAAGAAGAAAAGGGCTTGGGCCATCAGAGCCATTGAAGGGTGAGGAGACGTATGCGTCCGATCCTTCAATTGTGAGC
TATTTAGATGATGAGTCCTTCTTGTCCACTCCATTCGCGAATCAGATGAAGGTAGGCAACTTGGAAACTTCTGATGTCCTGTCTCTTCTTTCCTTGATAGAAGAGGTTAC
CTTTAGACCGGATAGGAGGGATAGTCGTTTTTGGAAGCCTAACCCCTCTAAGGGTTTCTCTTGCCGCTCGTTCTTTCATTGTCTTCTGGACCCTCCTCCTCTTGAGAAGT
CAGTTTATTCTTTGGTATGGAAAGTGAAAATTCCAAAGAAAGTGCAGTTTTTTACTTGGCAGGATGGTAGAGGAAGACTTCGAATTTTGTGGCATTGTGATTTTGTGTGT
TCGGTGTGGAACACTTTCTTTGAGATATTTGGGCTTCAATTTGCTAGACACAGGGATTTCAGGGAGATGATTGATGAGTTCCTTCCCCATCCTCCTTTTCCGGACCAAGG
AAACTTGCAAGTTGGGATTTGTGCTATTATTTGGGGGTTGTGGGGTGAGAGAAATAACAAAATTTTTAGAGGGAAAGAGAGGAACGTAGAGGATGTTTGGGCCCTTATTA
CATATTCTGTTTCGTTGTGGGCGTCGGTGACGCGTTTGGGGAAGGAAGGAAGAGGTGCCTATCTTGTCAGTTGGGAGGTGTTGGGGAAACCGATCCCTCATGGGGTATGG
GGATTGGGAATCTTAGGGGAGATGATCAAGGAGTTCCTCCTCAATTCGATGTTCCTTGACAAAGGAAGATTTTTGTGGACATCTGTTTTTACTTTATTTCTGGCAGCCGA
AATATCAGAAGATGAAGCTGAAGATCGCCTGCAAGATGCTATTCAGGAGAAATTTGCTGTTTTTGGTGATAATGCTGGAGGATTATGGTGGGAAGGGCATGAGGAGGGTG
AAGAGAAAGACCATCAAGCCATTGATATTCTTTTAGCAGAGATTGACATATATGAGCTTTTTGCTTTCAAACATTGCAAGGGAAGGAAAGTTAAACTTGCTCTTTGTGAA
GAACTTGATGAAAGGATGCGAGACTTGAAAAATGAGCTTCAGTCGTTTGACGGTGAAGAATATGATGAAAGTCATAAGAGGAAGGCCATAGATGCATTAAAACGAATGGA
GAATTGGAATTTATTTAGTGATACGTATGAGGAGTTCCAAAACTATACTGTAGCACGTGATACTTTTCTGGCTCACCTAGGTGCTACTCTCTGGGGGTCAATGAGACATA
TTATATCACCTTCACTTTCTGATGGGTCATTCCATTATTTTGAGAAAATATCATTTCAATTGTTTTTCATCACACAGGAGAAAGTTAGACAAATTAAAAAATTGCCCGTG
GATCTTAAAGCTCTAATGGATGGGCTCTCGTCTTTGTTGTTACCTTCACAGAAAGCACTATTTAGTCAGACCATTAATGTGGAGGGTCTTCGTAATACTGTACTGACTGG
TTTCCCTGTTAAAAATGCCAAAGCCTCGCTGTGGATGCATATCTCGAGGGCTTTGATAGATGTTTGTCGCAATCCATTGTGCGATGAAGTGGATTGTGTCCTTGAGTATG
TCAGTAGAGGCAAGTTGGCTAGAGTTGTGAACGAGGGAAAATACATGTACGAGTGGGGTGTATGGCTGAGTGAAGCTCAATTACAAGTAAAGTTAAGTTCAGTTGAGGAT
GAAAAAACCTTGCCTAAGCTAAGTTGCTTTGGATTAATGCGGTCAAAGCTTATCTTTCGGAATTATGGTTGGAATAGAATCATAGAATTTTTTGGACCAAGTATGTTCCT
TGGATTGATCAATTTGAAGCTTCTCGCCTTAAGGCCTAGGTGTTCTTTGTCCAAATTGTTCATGAGATTCTCGCTTCAGGATATTTGTCTAAATTGGAATGCTTTTATAT
ATCCTTTATAG
Protein sequenceShow/hide protein sequence
MRRVLIDRKTFTLRKSGCGKKVSVEERRGERARTRSVELDVGTFVWVQHYLVTISKVGRLVGFWRRKRLEAAIIFFRVLSSVKGKYGLLSLEPFKGGWIIVLDIPPFLRT
RSLILMIAELCEGVVEDEEIIREEVSLEEVSFKDKRNENGLEVGKKDECGTSSATIGWQVKVCRRRKLQAQEHGIEGLGSKMEMERRKGLGPSEPLKGEETYASDPSIVS
YLDDESFLSTPFANQMKVGNLETSDVLSLLSLIEEVTFRPDRRDSRFWKPNPSKGFSCRSFFHCLLDPPPLEKSVYSLVWKVKIPKKVQFFTWQDGRGRLRILWHCDFVC
SVWNTFFEIFGLQFARHRDFREMIDEFLPHPPFPDQGNLQVGICAIIWGLWGERNNKIFRGKERNVEDVWALITYSVSLWASVTRLGKEGRGAYLVSWEVLGKPIPHGVW
GLGILGEMIKEFLLNSMFLDKGRFLWTSVFTLFLAAEISEDEAEDRLQDAIQEKFAVFGDNAGGLWWEGHEEGEEKDHQAIDILLAEIDIYELFAFKHCKGRKVKLALCE
ELDERMRDLKNELQSFDGEEYDESHKRKAIDALKRMENWNLFSDTYEEFQNYTVARDTFLAHLGATLWGSMRHIISPSLSDGSFHYFEKISFQLFFITQEKVRQIKKLPV
DLKALMDGLSSLLLPSQKALFSQTINVEGLRNTVLTGFPVKNAKASLWMHISRALIDVCRNPLCDEVDCVLEYVSRGKLARVVNEGKYMYEWGVWLSEAQLQVKLSSVED
EKTLPKLSCFGLMRSKLIFRNYGWNRIIEFFGPSMFLGLINLKLLALRPRCSLSKLFMRFSLQDICLNWNAFIYPL