; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029454 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029454
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionCHANNEL_COLICIN domain-containing protein
Genome locationtig00153349:872301..874302
RNA-Seq ExpressionSgr029454
SyntenySgr029454
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573128.1 hypothetical protein SDJN03_27015, partial [Cucurbita argyrosperma subsp. sororia]1.9e-12287.55Show/hide
Query:  MSNPAEHTQEGEEPKPSDLNPSDQSDHSQEWETMARAWLCSFPEAKAGSTEEVEAWIDANHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEI---
        MSNPAE  QEG EPKPSDLNPSDQSDHSQEWE MARAWL SFPEAKAGS EEVEAWID+NHASLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEI   
Subjt:  MSNPAEHTQEGEEPKPSDLNPSDQSDHSQEWETMARAWLCSFPEAKAGSTEEVEAWIDANHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEI---

Query:  ---------QGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKDISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGGSQLPD---
                 QGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSK+ISDWLTENP+IRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKK GSQLP+   
Subjt:  ---------QGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKDISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGGSQLPD---

Query:  -KSPLKVHKDVVMKPALPPRDSFSDLPKDSDVYLAKRREAFRKYEILVELEKLLAPNFSKSEGIK
         KSPLKVHKDVVMKPALPPRDSFSDLPKDSDVYLAKRREAFRKYEILVELEKLL+ N  KSEG+K
Subjt:  -KSPLKVHKDVVMKPALPPRDSFSDLPKDSDVYLAKRREAFRKYEILVELEKLLAPNFSKSEGIK

XP_022137036.1 uncharacterized protein LOC111008597 isoform X1 [Momordica charantia]5.1e-12892.86Show/hide
Query:  MSNPAEHTQEGEEPKPSDLNPSDQSDHSQEWETMARAWLCSFPEAKAGSTEEVEAWIDANHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEI---
        MSNPAE TQ GE+PKPSDLNPSDQSDHSQEWETMARAWLCSFPEA+AGS EEVEAWID+NHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEI   
Subjt:  MSNPAEHTQEGEEPKPSDLNPSDQSDHSQEWETMARAWLCSFPEAKAGSTEEVEAWIDANHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEI---

Query:  QGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKDISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGGSQLPDKSPLKVHKDVVM
        QGDQGDLPHARFQRTDQWIPVYSWLESLQ +EVVKSKDISDWLTENP+IRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKK GSQ+PDKSPLKVHKDV+M
Subjt:  QGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKDISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGGSQLPDKSPLKVHKDVVM

Query:  KPALPPRDSFSDLPKDSDVYLAKRREAFRKYEILVELEKLLAPNFSKSEGIK
        KPALPPRDSFSDLPKDSDVYLAKR+EAFRKYEILVELEKLLAPNFSKS+G+K
Subjt:  KPALPPRDSFSDLPKDSDVYLAKRREAFRKYEILVELEKLLAPNFSKSEGIK

XP_022955309.1 uncharacterized protein LOC111457310 [Cucurbita moschata]1.9e-12287.55Show/hide
Query:  MSNPAEHTQEGEEPKPSDLNPSDQSDHSQEWETMARAWLCSFPEAKAGSTEEVEAWIDANHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEIQGD
        MSNPAE  QEG EPKPSDLNPSDQSDHSQEWE MARAWL SFPEAKAGS EEVEAWID+NHASLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEIQGD
Subjt:  MSNPAEHTQEGEEPKPSDLNPSDQSDHSQEWETMARAWLCSFPEAKAGSTEEVEAWIDANHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEIQGD

Query:  ------------QGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKDISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGGSQLPD---
                    QGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSK+ISDWLTENP+IRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKK GSQLP+   
Subjt:  ------------QGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKDISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGGSQLPD---

Query:  -KSPLKVHKDVVMKPALPPRDSFSDLPKDSDVYLAKRREAFRKYEILVELEKLLAPNFSKSEGIK
         KSPLKVHKDVVMKPALPPRDSFSDLPKDSDVYLAKRREAFRKYEILVELEKLL+ N  KSEG+K
Subjt:  -KSPLKVHKDVVMKPALPPRDSFSDLPKDSDVYLAKRREAFRKYEILVELEKLLAPNFSKSEGIK

XP_022994449.1 uncharacterized protein LOC111490167 [Cucurbita maxima]4.2e-12287.17Show/hide
Query:  MSNPAEHTQEGEEPKPSDLNPSDQSDHSQEWETMARAWLCSFPEAKAGSTEEVEAWIDANHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEI---
        MSNPAE  QEG EPKPSDLNPSDQSDHSQEWE MARAWL SFPEAKAGS EEVEAWID+NHASLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEI   
Subjt:  MSNPAEHTQEGEEPKPSDLNPSDQSDHSQEWETMARAWLCSFPEAKAGSTEEVEAWIDANHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEI---

Query:  ---------QGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKDISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGGSQLPD---
                 QGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSK+ISDWLTENP+IRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKK GSQLP+   
Subjt:  ---------QGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKDISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGGSQLPD---

Query:  -KSPLKVHKDVVMKPALPPRDSFSDLPKDSDVYLAKRREAFRKYEILVELEKLLAPNFSKSEGIK
         KSPLKVHKDVVMKPALPPRDSFSDLPKDSDVYLAKR+EAFRKYEILVELEKLL+ N  KSEG+K
Subjt:  -KSPLKVHKDVVMKPALPPRDSFSDLPKDSDVYLAKRREAFRKYEILVELEKLLAPNFSKSEGIK

XP_023542742.1 uncharacterized protein LOC111802563 [Cucurbita pepo subsp. pepo]4.2e-12287.17Show/hide
Query:  MSNPAEHTQEGEEPKPSDLNPSDQSDHSQEWETMARAWLCSFPEAKAGSTEEVEAWIDANHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEI---
        MSNPAE  QEG EPKPSDLNPSDQSDHSQEWE MARAWL SFPEAKAGS EEVEAWID+NHASLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEI   
Subjt:  MSNPAEHTQEGEEPKPSDLNPSDQSDHSQEWETMARAWLCSFPEAKAGSTEEVEAWIDANHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEI---

Query:  ---------QGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKDISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGGSQLPD---
                 QGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSK+ISDWLTENP+IRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKK GSQLP+   
Subjt:  ---------QGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKDISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGGSQLPD---

Query:  -KSPLKVHKDVVMKPALPPRDSFSDLPKDSDVYLAKRREAFRKYEILVELEKLLAPNFSKSEGIK
         KSPLKVHKDVVMKPALPPRDSFSDLPKDSDVYLAKR+EAFRKYEILVELEKLL+ N  KSEG+K
Subjt:  -KSPLKVHKDVVMKPALPPRDSFSDLPKDSDVYLAKRREAFRKYEILVELEKLLAPNFSKSEGIK

TrEMBL top hitse value%identityAlignment
A0A1S4E072 uncharacterized protein LOC1079913134.5e-12286.64Show/hide
Query:  MSNPAEHTQEGEE------PKPSDLNPSDQSDHSQEWETMARAWLCSFPEAKAGSTEEVEAWIDANHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGK
        MSNP E T+EGEE      PKPSD+NPSD  D SQEWE MARAWLCSFPEAKAGS EEVEAWID+NHASLPGNLKSMPRSDLCQRLISIQNLMRLS QGK
Subjt:  MSNPAEHTQEGEE------PKPSDLNPSDQSDHSQEWETMARAWLCSFPEAKAGSTEEVEAWIDANHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGK

Query:  EEI--------QGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKDISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGGSQLPDK
        E+I        QGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSK+ISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKK GSQLPDK
Subjt:  EEI--------QGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKDISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGGSQLPDK

Query:  SPLKVHKDVVMKPALPPRDSFSDLPKDSDVYLAKRREAFRKYEILVELEKLLAPNFSKSEGI
        SPLKVHKDV MKPALPPRDSFSDLPKDSD+YLAKR+EAFRKYEILVELEKLLA  FSKS+G+
Subjt:  SPLKVHKDVVMKPALPPRDSFSDLPKDSDVYLAKRREAFRKYEILVELEKLLAPNFSKSEGI

A0A5D3BJS0 Uncharacterized protein4.5e-12286.64Show/hide
Query:  MSNPAEHTQEGEE------PKPSDLNPSDQSDHSQEWETMARAWLCSFPEAKAGSTEEVEAWIDANHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGK
        MSNP E T+EGEE      PKPSD+NPSD  D SQEWE MARAWLCSFPEAKAGS EEVEAWID+NHASLPGNLKSMPRSDLCQRLISIQNLMRLS QGK
Subjt:  MSNPAEHTQEGEE------PKPSDLNPSDQSDHSQEWETMARAWLCSFPEAKAGSTEEVEAWIDANHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGK

Query:  EEI--------QGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKDISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGGSQLPDK
        E+I        QGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSK+ISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKK GSQLPDK
Subjt:  EEI--------QGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKDISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGGSQLPDK

Query:  SPLKVHKDVVMKPALPPRDSFSDLPKDSDVYLAKRREAFRKYEILVELEKLLAPNFSKSEGI
        SPLKVHKDV MKPALPPRDSFSDLPKDSD+YLAKR+EAFRKYEILVELEKLLA  FSKS+G+
Subjt:  SPLKVHKDVVMKPALPPRDSFSDLPKDSDVYLAKRREAFRKYEILVELEKLLAPNFSKSEGI

A0A6J1C586 uncharacterized protein LOC111008597 isoform X12.5e-12892.86Show/hide
Query:  MSNPAEHTQEGEEPKPSDLNPSDQSDHSQEWETMARAWLCSFPEAKAGSTEEVEAWIDANHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEI---
        MSNPAE TQ GE+PKPSDLNPSDQSDHSQEWETMARAWLCSFPEA+AGS EEVEAWID+NHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEI   
Subjt:  MSNPAEHTQEGEEPKPSDLNPSDQSDHSQEWETMARAWLCSFPEAKAGSTEEVEAWIDANHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEI---

Query:  QGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKDISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGGSQLPDKSPLKVHKDVVM
        QGDQGDLPHARFQRTDQWIPVYSWLESLQ +EVVKSKDISDWLTENP+IRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKK GSQ+PDKSPLKVHKDV+M
Subjt:  QGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKDISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGGSQLPDKSPLKVHKDVVM

Query:  KPALPPRDSFSDLPKDSDVYLAKRREAFRKYEILVELEKLLAPNFSKSEGIK
        KPALPPRDSFSDLPKDSDVYLAKR+EAFRKYEILVELEKLLAPNFSKS+G+K
Subjt:  KPALPPRDSFSDLPKDSDVYLAKRREAFRKYEILVELEKLLAPNFSKSEGIK

A0A6J1GVV9 uncharacterized protein LOC1114573109.0e-12387.55Show/hide
Query:  MSNPAEHTQEGEEPKPSDLNPSDQSDHSQEWETMARAWLCSFPEAKAGSTEEVEAWIDANHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEIQGD
        MSNPAE  QEG EPKPSDLNPSDQSDHSQEWE MARAWL SFPEAKAGS EEVEAWID+NHASLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEIQGD
Subjt:  MSNPAEHTQEGEEPKPSDLNPSDQSDHSQEWETMARAWLCSFPEAKAGSTEEVEAWIDANHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEIQGD

Query:  ------------QGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKDISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGGSQLPD---
                    QGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSK+ISDWLTENP+IRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKK GSQLP+   
Subjt:  ------------QGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKDISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGGSQLPD---

Query:  -KSPLKVHKDVVMKPALPPRDSFSDLPKDSDVYLAKRREAFRKYEILVELEKLLAPNFSKSEGIK
         KSPLKVHKDVVMKPALPPRDSFSDLPKDSDVYLAKRREAFRKYEILVELEKLL+ N  KSEG+K
Subjt:  -KSPLKVHKDVVMKPALPPRDSFSDLPKDSDVYLAKRREAFRKYEILVELEKLLAPNFSKSEGIK

A0A6J1K2W0 uncharacterized protein LOC1114901672.0e-12287.17Show/hide
Query:  MSNPAEHTQEGEEPKPSDLNPSDQSDHSQEWETMARAWLCSFPEAKAGSTEEVEAWIDANHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEI---
        MSNPAE  QEG EPKPSDLNPSDQSDHSQEWE MARAWL SFPEAKAGS EEVEAWID+NHASLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEI   
Subjt:  MSNPAEHTQEGEEPKPSDLNPSDQSDHSQEWETMARAWLCSFPEAKAGSTEEVEAWIDANHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEI---

Query:  ---------QGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKDISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGGSQLPD---
                 QGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSK+ISDWLTENP+IRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKK GSQLP+   
Subjt:  ---------QGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKDISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGGSQLPD---

Query:  -KSPLKVHKDVVMKPALPPRDSFSDLPKDSDVYLAKRREAFRKYEILVELEKLLAPNFSKSEGIK
         KSPLKVHKDVVMKPALPPRDSFSDLPKDSDVYLAKR+EAFRKYEILVELEKLL+ N  KSEG+K
Subjt:  -KSPLKVHKDVVMKPALPPRDSFSDLPKDSDVYLAKRREAFRKYEILVELEKLLAPNFSKSEGIK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G50410.1 unknown protein1.4e-6453.54Show/hide
Query:  MSNPAEHTQEGEEPKPSDL-------NPSDQSDHSQEWETMARAWLCSFPEAKA-GSTEEVEAWIDANHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQ
        MSN ++ T E E+ +   L       N + +++ SQEWETMARAW+ +FP+AKA  S  EVE WI  N  SLP +L+ MPRS++  RL+SIQ+ MR +  
Subjt:  MSNPAEHTQEGEEPKPSDL-------NPSDQSDHSQEWETMARAWLCSFPEAKA-GSTEEVEAWIDANHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQ

Query:  GKEEIQGDQGDLPH-ARFQRTDQWIPVYSWLESLQQDEVVKSKDISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGGSQLPDKSPLKV
          +  Q  + D  H ARFQRTDQW+PVYSWLESL   E+VKSKDIS+WL  NP ++ +L SRHSRYHL HY+KKCHLKILKRKE KKG  +L   + ++V
Subjt:  GKEEIQGDQGDLPH-ARFQRTDQWIPVYSWLESLQQDEVVKSKDISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGGSQLPDKSPLKV

Query:  HKDVVMK-PALPPRDSFSDLPKDSDVYLAKRREAFRKYEILVELEKLLAPNFSK
        HK+   K  A    D  S++PKDSD+Y  K++EA R++EILVELEK LAP+F+K
Subjt:  HKDVVMK-PALPPRDSFSDLPKDSDVYLAKRREAFRKYEILVELEKLLAPNFSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAATCCAGCAGAACACACCCAAGAAGGAGAGGAGCCTAAGCCATCTGATCTAAACCCTAGCGATCAAAGCGATCACTCCCAAGAATGGGAAACCATGGCGCGGGC
GTGGCTTTGCTCCTTCCCGGAGGCCAAAGCTGGGTCCACGGAAGAGGTTGAAGCTTGGATTGACGCCAACCATGCCTCTTTACCTGGAAACCTCAAATCAATGCCCCGCT
CCGACCTTTGCCAGAGGCTGATTTCCATCCAAAATTTAATGAGGCTTTCCAATCAGGGAAAAGAAGAGATTCAGGGCGATCAAGGCGATCTTCCACATGCTCGATTTCAA
CGTACTGACCAATGGATACCAGTTTATTCTTGGTTAGAGTCTCTACAACAAGATGAGGTTGTCAAGTCAAAGGACATATCTGATTGGTTAACTGAAAATCCCACCATCAG
AGATCAGTTGTGTTCAAGACATTCTCGCTATCATTTAATGCACTACATCAAAAAGTGTCATTTGAAGATATTGAAGAGAAAGGAAAAGAAAAAGGGGGGTTCTCAGCTGC
CTGACAAATCTCCTCTAAAAGTTCACAAGGATGTTGTGATGAAACCAGCACTTCCTCCACGTGATTCTTTTAGCGATCTACCAAAAGACAGTGACGTATATTTGGCAAAA
CGAAGGGAAGCCTTTCGAAAATATGAAATTTTAGTGGAGTTGGAGAAGCTGCTTGCTCCCAACTTTTCTAAGTCCGAAGGGATCAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGAATCCAGCAGAACACACCCAAGAAGGAGAGGAGCCTAAGCCATCTGATCTAAACCCTAGCGATCAAAGCGATCACTCCCAAGAATGGGAAACCATGGCGCGGGC
GTGGCTTTGCTCCTTCCCGGAGGCCAAAGCTGGGTCCACGGAAGAGGTTGAAGCTTGGATTGACGCCAACCATGCCTCTTTACCTGGAAACCTCAAATCAATGCCCCGCT
CCGACCTTTGCCAGAGGCTGATTTCCATCCAAAATTTAATGAGGCTTTCCAATCAGGGAAAAGAAGAGATTCAGGGCGATCAAGGCGATCTTCCACATGCTCGATTTCAA
CGTACTGACCAATGGATACCAGTTTATTCTTGGTTAGAGTCTCTACAACAAGATGAGGTTGTCAAGTCAAAGGACATATCTGATTGGTTAACTGAAAATCCCACCATCAG
AGATCAGTTGTGTTCAAGACATTCTCGCTATCATTTAATGCACTACATCAAAAAGTGTCATTTGAAGATATTGAAGAGAAAGGAAAAGAAAAAGGGGGGTTCTCAGCTGC
CTGACAAATCTCCTCTAAAAGTTCACAAGGATGTTGTGATGAAACCAGCACTTCCTCCACGTGATTCTTTTAGCGATCTACCAAAAGACAGTGACGTATATTTGGCAAAA
CGAAGGGAAGCCTTTCGAAAATATGAAATTTTAGTGGAGTTGGAGAAGCTGCTTGCTCCCAACTTTTCTAAGTCCGAAGGGATCAAATAA
Protein sequenceShow/hide protein sequence
MSNPAEHTQEGEEPKPSDLNPSDQSDHSQEWETMARAWLCSFPEAKAGSTEEVEAWIDANHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEIQGDQGDLPHARFQ
RTDQWIPVYSWLESLQQDEVVKSKDISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGGSQLPDKSPLKVHKDVVMKPALPPRDSFSDLPKDSDVYLAK
RREAFRKYEILVELEKLLAPNFSKSEGIK