; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg036635 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg036635
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCHANNEL_COLICIN domain-containing protein
Genome locationscaffold5:44249243..44251802
RNA-Seq ExpressionSpg036635
SyntenySpg036635
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573128.1 hypothetical protein SDJN03_27015, partial [Cucurbita argyrosperma subsp. sororia]9.6e-13090.91Show/hide
Query:  MSNPAEPTPEGEEPKPSDLNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD
        MSNPAE   EG EPKPSDLNPSDQSDHSQEWE+MARAWL SFPEAKAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEIQGD
Subjt:  MSNPAEPTPEGEEPKPSDLNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD

Query:  Q------GDQGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQ
        Q       DQGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENP+IRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKK SQLPEKSPQ
Subjt:  Q------GDQGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQ

Query:  KSPLKVHKDVVMKPALPPRDSFCDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK
        KSPLKVHKDVVMKPALPPRDSF DLPKDSD+YLAKR+EAFRKYEILVELEKLL+ N  KS+GVK
Subjt:  KSPLKVHKDVVMKPALPPRDSFCDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK

XP_022137036.1 uncharacterized protein LOC111008597 isoform X1 [Momordica charantia]4.9e-12690.7Show/hide
Query:  MSNPAEPTPEGEEPKPSDLNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD
        MSNPAE T +GE+PKPSDLNPSDQSDHSQEWE MARAWLCSFPEA+AGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEI   
Subjt:  MSNPAEPTPEGEEPKPSDLNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD

Query:  QGDQGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQKSPLKV
        QGDQGDQGDLPHARFQRTDQWIPVYSWLESLQ +EVVKSK+ISDWLTENP+IRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKK SQ+P+    KSPLKV
Subjt:  QGDQGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQKSPLKV

Query:  HKDVVMKPALPPRDSFCDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK
        HKDV+MKPALPPRDSF DLPKDSD+YLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK
Subjt:  HKDVVMKPALPPRDSFCDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK

XP_022955309.1 uncharacterized protein LOC111457310 [Cucurbita moschata]2.8e-12990.53Show/hide
Query:  MSNPAEPTPEGEEPKPSDLNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD
        MSNPAE   EG EPKPSDLNPSDQSDHSQEWE+MARAWL SFPEAKAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEIQGD
Subjt:  MSNPAEPTPEGEEPKPSDLNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD

Query:  Q------GDQGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQ
        Q       DQG+QGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENP+IRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKK SQLPEKSPQ
Subjt:  Q------GDQGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQ

Query:  KSPLKVHKDVVMKPALPPRDSFCDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK
        KSPLKVHKDVVMKPALPPRDSF DLPKDSD+YLAKR+EAFRKYEILVELEKLL+ N  KS+GVK
Subjt:  KSPLKVHKDVVMKPALPPRDSFCDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK

XP_022994449.1 uncharacterized protein LOC111490167 [Cucurbita maxima]1.6e-12990.91Show/hide
Query:  MSNPAEPTPEGEEPKPSDLNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD
        MSNPAE   EG EPKPSDLNPSDQSDHSQEWE+MARAWL SFPEAKAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEIQGD
Subjt:  MSNPAEPTPEGEEPKPSDLNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD

Query:  Q------GDQGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQ
        Q       DQGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENP+IRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKK SQLPEKSP 
Subjt:  Q------GDQGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQ

Query:  KSPLKVHKDVVMKPALPPRDSFCDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK
        KSPLKVHKDVVMKPALPPRDSF DLPKDSD+YLAKRKEAFRKYEILVELEKLL+ N  KS+GVK
Subjt:  KSPLKVHKDVVMKPALPPRDSFCDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK

XP_023542742.1 uncharacterized protein LOC111802563 [Cucurbita pepo subsp. pepo]4.3e-13091.29Show/hide
Query:  MSNPAEPTPEGEEPKPSDLNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD
        MSNPAE   EG EPKPSDLNPSDQSDHSQEWE+MARAWL SFPEAKAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEIQGD
Subjt:  MSNPAEPTPEGEEPKPSDLNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD

Query:  Q------GDQGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQ
        Q       DQGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENP+IRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKK SQLPEKSPQ
Subjt:  Q------GDQGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQ

Query:  KSPLKVHKDVVMKPALPPRDSFCDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK
        KSPLKVHKDVVMKPALPPRDSF DLPKDSD+YLAKRKEAFRKYEILVELEKLL+ N  KS+GVK
Subjt:  KSPLKVHKDVVMKPALPPRDSFCDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK

TrEMBL top hitse value%identityAlignment
A0A1S4E072 uncharacterized protein LOC1079913135.0e-12489.06Show/hide
Query:  MSNPAEPTPEGEE------PKPSDLNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSTQGK
        MSNP E T EGEE      PKPSD+NPSD  D SQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSTQGK
Subjt:  MSNPAEPTPEGEE------PKPSDLNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSTQGK

Query:  EEI--QGDQGDQGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKS
        E+I  Q D+ DQGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKK SQLP+  
Subjt:  EEI--QGDQGDQGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKS

Query:  PQKSPLKVHKDVVMKPALPPRDSFCDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGV
          KSPLKVHKDV MKPALPPRDSF DLPKDSDIYLAKRKEAFRKYEILVELEKLLA  FSKSQGV
Subjt:  PQKSPLKVHKDVVMKPALPPRDSFCDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGV

A0A5D3BJS0 Uncharacterized protein5.0e-12489.06Show/hide
Query:  MSNPAEPTPEGEE------PKPSDLNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSTQGK
        MSNP E T EGEE      PKPSD+NPSD  D SQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSTQGK
Subjt:  MSNPAEPTPEGEE------PKPSDLNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSTQGK

Query:  EEI--QGDQGDQGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKS
        E+I  Q D+ DQGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKK SQLP+  
Subjt:  EEI--QGDQGDQGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKS

Query:  PQKSPLKVHKDVVMKPALPPRDSFCDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGV
          KSPLKVHKDV MKPALPPRDSF DLPKDSDIYLAKRKEAFRKYEILVELEKLLA  FSKSQGV
Subjt:  PQKSPLKVHKDVVMKPALPPRDSFCDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGV

A0A6J1C586 uncharacterized protein LOC111008597 isoform X12.4e-12690.7Show/hide
Query:  MSNPAEPTPEGEEPKPSDLNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD
        MSNPAE T +GE+PKPSDLNPSDQSDHSQEWE MARAWLCSFPEA+AGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEI   
Subjt:  MSNPAEPTPEGEEPKPSDLNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD

Query:  QGDQGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQKSPLKV
        QGDQGDQGDLPHARFQRTDQWIPVYSWLESLQ +EVVKSK+ISDWLTENP+IRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKK SQ+P+    KSPLKV
Subjt:  QGDQGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQKSPLKV

Query:  HKDVVMKPALPPRDSFCDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK
        HKDV+MKPALPPRDSF DLPKDSD+YLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK
Subjt:  HKDVVMKPALPPRDSFCDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK

A0A6J1GVV9 uncharacterized protein LOC1114573101.3e-12990.53Show/hide
Query:  MSNPAEPTPEGEEPKPSDLNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD
        MSNPAE   EG EPKPSDLNPSDQSDHSQEWE+MARAWL SFPEAKAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEIQGD
Subjt:  MSNPAEPTPEGEEPKPSDLNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD

Query:  Q------GDQGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQ
        Q       DQG+QGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENP+IRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKK SQLPEKSPQ
Subjt:  Q------GDQGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQ

Query:  KSPLKVHKDVVMKPALPPRDSFCDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK
        KSPLKVHKDVVMKPALPPRDSF DLPKDSD+YLAKR+EAFRKYEILVELEKLL+ N  KS+GVK
Subjt:  KSPLKVHKDVVMKPALPPRDSFCDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK

A0A6J1K2W0 uncharacterized protein LOC1114901677.9e-13090.91Show/hide
Query:  MSNPAEPTPEGEEPKPSDLNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD
        MSNPAE   EG EPKPSDLNPSDQSDHSQEWE+MARAWL SFPEAKAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEIQGD
Subjt:  MSNPAEPTPEGEEPKPSDLNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD

Query:  Q------GDQGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQ
        Q       DQGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENP+IRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKK SQLPEKSP 
Subjt:  Q------GDQGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQ

Query:  KSPLKVHKDVVMKPALPPRDSFCDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK
        KSPLKVHKDVVMKPALPPRDSF DLPKDSD+YLAKRKEAFRKYEILVELEKLL+ N  KS+GVK
Subjt:  KSPLKVHKDVVMKPALPPRDSFCDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G50410.1 unknown protein2.6e-6151.33Show/hide
Query:  MSNPAEPTPEGEEPKPSDL-------NPSDQSDHSQEWEIMARAWLCSFPEAKA-GSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSTQ
        MSN ++ T E E+ +   L       N + +++ SQEWE MARAW+ +FP+AKA  S  EVE WI +N  SLP +L+ MPRS++  RL+SIQ+ MR +  
Subjt:  MSNPAEPTPEGEEPKPSDL-------NPSDQSDHSQEWEIMARAWLCSFPEAKA-GSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSTQ

Query:  GKEEIQGDQGDQGDQGDLPH-ARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEK
               DQ +Q  + D  H ARFQRTDQW+PVYSWLESL   E+VKSK+IS+WL  NP ++ +L SRHSRYHL HY+KKCHLKILKRKEKK + +L   
Subjt:  GKEEIQGDQGDQGDQGDLPH-ARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEK

Query:  SPQKSPLKVHKDVVMK-PALPPRDSFCDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSK
          + + ++VHK+   K  A    D   ++PKDSD+Y  K+KEA R++EILVELEK LAP+F+K
Subjt:  SPQKSPLKVHKDVVMK-PALPPRDSFCDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAATCCGGCGGAACCAACCCCAGAGGGAGAAGAGCCTAAGCCATCTGATCTAAACCCTAGCGACCAAAGCGATCACTCTCAAGAATGGGAAATCATGGCGCGGGC
TTGGCTCTGCTCCTTCCCCGAGGCCAAAGCTGGGTCCATGGAAGAGGTTGAAGCTTGGATTGACTCCAACCATGCCTCTTTACCTGGAAACCTCAAATCAATGCCCCGCT
CCGACCTTTGCCAGAGGCTGATTTCTATCCAAAATTTAATGAGACTGTCCACTCAGGGAAAGGAAGAGATTCAGGGTGATCAAGGCGATCAGGGTGATCAGGGCGACCTT
CCACATGCTCGATTTCAACGCACTGACCAGTGGATACCAGTTTATTCTTGGTTAGAGTCTCTACAGCAAGATGAGGTTGTCAAGTCAAAGGAAATATCTGATTGGTTAAC
TGAAAATCCCACCATCAGAGATCAGTTGTGTTCAAGACATTCTCGTTATCATTTAATGCACTACATCAAAAAGTGTCATTTGAAGATATTGAAAAGAAAAGAAAAGAAAA
AGGTTTCTCAGCTGCCTGAAAAGTCTCCTCAAAAGTCTCCTCTTAAAGTTCACAAGGATGTTGTGATGAAACCAGCGTTGCCTCCACGTGATTCATTTTGCGATCTACCA
AAAGACAGTGACATATACTTGGCAAAACGAAAGGAAGCCTTTCGAAAATATGAAATTTTAGTGGAGTTGGAGAAGTTGCTTGCCCCCAACTTTTCAAAGTCTCAAGGAGT
CAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGAATCCGGCGGAACCAACCCCAGAGGGAGAAGAGCCTAAGCCATCTGATCTAAACCCTAGCGACCAAAGCGATCACTCTCAAGAATGGGAAATCATGGCGCGGGC
TTGGCTCTGCTCCTTCCCCGAGGCCAAAGCTGGGTCCATGGAAGAGGTTGAAGCTTGGATTGACTCCAACCATGCCTCTTTACCTGGAAACCTCAAATCAATGCCCCGCT
CCGACCTTTGCCAGAGGCTGATTTCTATCCAAAATTTAATGAGACTGTCCACTCAGGGAAAGGAAGAGATTCAGGGTGATCAAGGCGATCAGGGTGATCAGGGCGACCTT
CCACATGCTCGATTTCAACGCACTGACCAGTGGATACCAGTTTATTCTTGGTTAGAGTCTCTACAGCAAGATGAGGTTGTCAAGTCAAAGGAAATATCTGATTGGTTAAC
TGAAAATCCCACCATCAGAGATCAGTTGTGTTCAAGACATTCTCGTTATCATTTAATGCACTACATCAAAAAGTGTCATTTGAAGATATTGAAAAGAAAAGAAAAGAAAA
AGGTTTCTCAGCTGCCTGAAAAGTCTCCTCAAAAGTCTCCTCTTAAAGTTCACAAGGATGTTGTGATGAAACCAGCGTTGCCTCCACGTGATTCATTTTGCGATCTACCA
AAAGACAGTGACATATACTTGGCAAAACGAAAGGAAGCCTTTCGAAAATATGAAATTTTAGTGGAGTTGGAGAAGTTGCTTGCCCCCAACTTTTCAAAGTCTCAAGGAGT
CAAATAA
Protein sequenceShow/hide protein sequence
MSNPAEPTPEGEEPKPSDLNPSDQSDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGDQGDQGDQGDL
PHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKVSQLPEKSPQKSPLKVHKDVVMKPALPPRDSFCDLP
KDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK