; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS019931 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS019931
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionCHANNEL_COLICIN domain-containing protein
Genome locationscaffold22:616610..618386
RNA-Seq ExpressionMS019931
SyntenyMS019931
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573128.1 hypothetical protein SDJN03_27015, partial [Cucurbita argyrosperma subsp. sororia]1.4e-12287.12Show/hide
Query:  MSNPAEETQ-GEQPKPSDLNPSDQSDHSQEWETMARAWLCSFPEARAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEIQG-
        MSNPAE+ Q G +PKPSDLNPSDQSDHSQEWE MARAWL SFPEA+AGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEIQG 
Subjt:  MSNPAEETQ-GEQPKPSDLNPSDQSDHSQEWETMARAWLCSFPEARAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEIQG-

Query:  --------DQGDQGDLPHARFQRTDQWIPVYSWLESLQHEEVVKSKDISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQMPD----
                DQGDQGDLPHARFQRTDQWIPVYSWLESLQ +EVVKSK+ISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQ+P+    
Subjt:  --------DQGDQGDLPHARFQRTDQWIPVYSWLESLQHEEVVKSKDISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQMPD----

Query:  KSPLKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK
        KSPLKVHKDV+MKPALPPRDSFSDLPKDSDVYLAKR+EAFRKYEILVELEKLL+ N  KS+GVK
Subjt:  KSPLKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK

XP_016901626.1 PREDICTED: uncharacterized protein LOC107991313 [Cucumis melo]8.4e-12386.59Show/hide
Query:  MSNPAEET-------QGEQPKPSDLNPSDQSDHSQEWETMARAWLCSFPEARAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGK
        MSNP E T       +GE PKPSD+NPSD  D SQEWE MARAWLCSFPEA+AGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLS QGK
Subjt:  MSNPAEET-------QGEQPKPSDLNPSDQSDHSQEWETMARAWLCSFPEARAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGK

Query:  EEI-----QGDQGDQGDLPHARFQRTDQWIPVYSWLESLQHEEVVKSKDISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQMPDKS
        E+I     + DQGDQGDLPHARFQRTDQWIPVYSWLESLQ +EVVKSK+ISDWLTENP+IRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQ+PDKS
Subjt:  EEI-----QGDQGDQGDLPHARFQRTDQWIPVYSWLESLQHEEVVKSKDISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQMPDKS

Query:  PLKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLLAPNFSKSQGV
        PLKVHKDV MKPALPPRDSFSDLPKDSD+YLAKRKEAFRKYEILVELEKLLA  FSKSQGV
Subjt:  PLKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLLAPNFSKSQGV

XP_022137036.1 uncharacterized protein LOC111008597 isoform X1 [Momordica charantia]2.9e-139100Show/hide
Query:  MSNPAEETQGEQPKPSDLNPSDQSDHSQEWETMARAWLCSFPEARAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEIQGDQ
        MSNPAEETQGEQPKPSDLNPSDQSDHSQEWETMARAWLCSFPEARAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEIQGDQ
Subjt:  MSNPAEETQGEQPKPSDLNPSDQSDHSQEWETMARAWLCSFPEARAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEIQGDQ

Query:  GDQGDLPHARFQRTDQWIPVYSWLESLQHEEVVKSKDISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQMPDKSPLKVHKDVLMKP
        GDQGDLPHARFQRTDQWIPVYSWLESLQHEEVVKSKDISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQMPDKSPLKVHKDVLMKP
Subjt:  GDQGDLPHARFQRTDQWIPVYSWLESLQHEEVVKSKDISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQMPDKSPLKVHKDVLMKP

Query:  ALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK
        ALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK
Subjt:  ALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK

XP_022994449.1 uncharacterized protein LOC111490167 [Cucurbita maxima]6.4e-12387.5Show/hide
Query:  MSNPAEETQ-GEQPKPSDLNPSDQSDHSQEWETMARAWLCSFPEARAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEIQG-
        MSNPAE+ Q G +PKPSDLNPSDQSDHSQEWE MARAWL SFPEA+AGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEIQG 
Subjt:  MSNPAEETQ-GEQPKPSDLNPSDQSDHSQEWETMARAWLCSFPEARAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEIQG-

Query:  --------DQGDQGDLPHARFQRTDQWIPVYSWLESLQHEEVVKSKDISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQMPD----
                DQGDQGDLPHARFQRTDQWIPVYSWLESLQ +EVVKSK+ISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQ+P+    
Subjt:  --------DQGDQGDLPHARFQRTDQWIPVYSWLESLQHEEVVKSKDISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQMPD----

Query:  KSPLKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK
        KSPLKVHKDV+MKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLL+ N  KS+GVK
Subjt:  KSPLKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK

XP_023542742.1 uncharacterized protein LOC111802563 [Cucurbita pepo subsp. pepo]6.4e-12387.5Show/hide
Query:  MSNPAEETQ-GEQPKPSDLNPSDQSDHSQEWETMARAWLCSFPEARAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEIQG-
        MSNPAE+ Q G +PKPSDLNPSDQSDHSQEWE MARAWL SFPEA+AGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEIQG 
Subjt:  MSNPAEETQ-GEQPKPSDLNPSDQSDHSQEWETMARAWLCSFPEARAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEIQG-

Query:  --------DQGDQGDLPHARFQRTDQWIPVYSWLESLQHEEVVKSKDISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQMPD----
                DQGDQGDLPHARFQRTDQWIPVYSWLESLQ +EVVKSK+ISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQ+P+    
Subjt:  --------DQGDQGDLPHARFQRTDQWIPVYSWLESLQHEEVVKSKDISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQMPD----

Query:  KSPLKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK
        KSPLKVHKDV+MKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLL+ N  KS+GVK
Subjt:  KSPLKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK

TrEMBL top hitse value%identityAlignment
A0A1S4E072 uncharacterized protein LOC1079913134.1e-12386.59Show/hide
Query:  MSNPAEET-------QGEQPKPSDLNPSDQSDHSQEWETMARAWLCSFPEARAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGK
        MSNP E T       +GE PKPSD+NPSD  D SQEWE MARAWLCSFPEA+AGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLS QGK
Subjt:  MSNPAEET-------QGEQPKPSDLNPSDQSDHSQEWETMARAWLCSFPEARAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGK

Query:  EEI-----QGDQGDQGDLPHARFQRTDQWIPVYSWLESLQHEEVVKSKDISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQMPDKS
        E+I     + DQGDQGDLPHARFQRTDQWIPVYSWLESLQ +EVVKSK+ISDWLTENP+IRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQ+PDKS
Subjt:  EEI-----QGDQGDQGDLPHARFQRTDQWIPVYSWLESLQHEEVVKSKDISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQMPDKS

Query:  PLKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLLAPNFSKSQGV
        PLKVHKDV MKPALPPRDSFSDLPKDSD+YLAKRKEAFRKYEILVELEKLLA  FSKSQGV
Subjt:  PLKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLLAPNFSKSQGV

A0A5D3BJS0 Uncharacterized protein4.1e-12386.59Show/hide
Query:  MSNPAEET-------QGEQPKPSDLNPSDQSDHSQEWETMARAWLCSFPEARAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGK
        MSNP E T       +GE PKPSD+NPSD  D SQEWE MARAWLCSFPEA+AGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLS QGK
Subjt:  MSNPAEET-------QGEQPKPSDLNPSDQSDHSQEWETMARAWLCSFPEARAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGK

Query:  EEI-----QGDQGDQGDLPHARFQRTDQWIPVYSWLESLQHEEVVKSKDISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQMPDKS
        E+I     + DQGDQGDLPHARFQRTDQWIPVYSWLESLQ +EVVKSK+ISDWLTENP+IRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQ+PDKS
Subjt:  EEI-----QGDQGDQGDLPHARFQRTDQWIPVYSWLESLQHEEVVKSKDISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQMPDKS

Query:  PLKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLLAPNFSKSQGV
        PLKVHKDV MKPALPPRDSFSDLPKDSD+YLAKRKEAFRKYEILVELEKLLA  FSKSQGV
Subjt:  PLKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLLAPNFSKSQGV

A0A6J1C586 uncharacterized protein LOC111008597 isoform X11.4e-139100Show/hide
Query:  MSNPAEETQGEQPKPSDLNPSDQSDHSQEWETMARAWLCSFPEARAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEIQGDQ
        MSNPAEETQGEQPKPSDLNPSDQSDHSQEWETMARAWLCSFPEARAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEIQGDQ
Subjt:  MSNPAEETQGEQPKPSDLNPSDQSDHSQEWETMARAWLCSFPEARAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEIQGDQ

Query:  GDQGDLPHARFQRTDQWIPVYSWLESLQHEEVVKSKDISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQMPDKSPLKVHKDVLMKP
        GDQGDLPHARFQRTDQWIPVYSWLESLQHEEVVKSKDISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQMPDKSPLKVHKDVLMKP
Subjt:  GDQGDLPHARFQRTDQWIPVYSWLESLQHEEVVKSKDISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQMPDKSPLKVHKDVLMKP

Query:  ALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK
        ALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK
Subjt:  ALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK

A0A6J1GVV9 uncharacterized protein LOC1114573102.0e-12286.74Show/hide
Query:  MSNPAEETQ-GEQPKPSDLNPSDQSDHSQEWETMARAWLCSFPEARAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEIQG-
        MSNPAE+ Q G +PKPSDLNPSDQSDHSQEWE MARAWL SFPEA+AGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEIQG 
Subjt:  MSNPAEETQ-GEQPKPSDLNPSDQSDHSQEWETMARAWLCSFPEARAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEIQG-

Query:  --------DQGDQGDLPHARFQRTDQWIPVYSWLESLQHEEVVKSKDISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQMPD----
                DQG+QGDLPHARFQRTDQWIPVYSWLESLQ +EVVKSK+ISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQ+P+    
Subjt:  --------DQGDQGDLPHARFQRTDQWIPVYSWLESLQHEEVVKSKDISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQMPD----

Query:  KSPLKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK
        KSPLKVHKDV+MKPALPPRDSFSDLPKDSDVYLAKR+EAFRKYEILVELEKLL+ N  KS+GVK
Subjt:  KSPLKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK

A0A6J1K2W0 uncharacterized protein LOC1114901673.1e-12387.5Show/hide
Query:  MSNPAEETQ-GEQPKPSDLNPSDQSDHSQEWETMARAWLCSFPEARAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEIQG-
        MSNPAE+ Q G +PKPSDLNPSDQSDHSQEWE MARAWL SFPEA+AGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEIQG 
Subjt:  MSNPAEETQ-GEQPKPSDLNPSDQSDHSQEWETMARAWLCSFPEARAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEIQG-

Query:  --------DQGDQGDLPHARFQRTDQWIPVYSWLESLQHEEVVKSKDISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQMPD----
                DQGDQGDLPHARFQRTDQWIPVYSWLESLQ +EVVKSK+ISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQ+P+    
Subjt:  --------DQGDQGDLPHARFQRTDQWIPVYSWLESLQHEEVVKSKDISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQMPD----

Query:  KSPLKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK
        KSPLKVHKDV+MKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLL+ N  KS+GVK
Subjt:  KSPLKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G50410.1 unknown protein1.0e-6253.44Show/hide
Query:  EETQGEQPKPSD--LNPSDQSDHSQEWETMARAWLCSFPEARA-GSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMR---LSNQGKEEIQGD
        E+ Q +Q +  D   N + +++ SQEWETMARAW+ +FP+A+A  S  EVE WI +N  SLP +L+ MPRS++  RL+SIQ+ MR    S+Q ++ ++ D
Subjt:  EETQGEQPKPSD--LNPSDQSDHSQEWETMARAWLCSFPEARA-GSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMR---LSNQGKEEIQGD

Query:  QGDQGDLPHARFQRTDQWIPVYSWLESLQHEEVVKSKDISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQMPDKSPLKVHKDVLMK
        Q        ARFQRTDQW+PVYSWLESL + E+VKSKDIS+WL  NP ++ +L SRHSRYHL HY+KKCHLKILKRKEKK   ++   + ++VHK+   K
Subjt:  QGDQGDLPHARFQRTDQWIPVYSWLESLQHEEVVKSKDISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQMPDKSPLKVHKDVLMK

Query:  -PALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLLAPNFSK
          A    D  S++PKDSD+Y  K+KEA R++EILVELEK LAP+F+K
Subjt:  -PALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLLAPNFSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAATCCAGCGGAAGAAACCCAAGGAGAGCAGCCTAAGCCATCTGATCTGAACCCTAGCGACCAAAGCGATCACTCCCAAGAATGGGAAACCATGGCGCGCGCTTG
GCTTTGCTCCTTCCCCGAGGCCAGAGCTGGCTCCATGGAAGAGGTTGAAGCTTGGATTGACTCCAACCATGCCTCTTTACCTGGAAACCTCAAATCAATGCCCCGCTCCG
ACCTTTGCCAGAGGCTGATTTCCATCCAAAATTTAATGCGACTTTCCAATCAGGGAAAGGAAGAGATTCAGGGTGACCAGGGTGATCAGGGCGATCTTCCACATGCTCGA
TTTCAACGCACCGACCAGTGGATACCAGTTTATTCTTGGCTAGAGTCTCTACAACATGAAGAGGTTGTCAAGTCAAAGGACATATCTGATTGGTTAACTGAAAATCCCTC
CATCAGAGATCAGTTGTGTTCAAGACATTCTCGCTATCATTTAATGCACTACATCAAAAAGTGTCACTTGAAGATACTGAAAAGAAAGGAAAAGAAAAAGGGTTCTCAGA
TGCCTGACAAATCTCCTCTAAAAGTTCACAAGGATGTTCTGATGAAACCAGCATTGCCTCCGCGTGATTCATTTAGCGATCTACCGAAAGACAGCGATGTATATTTGGCA
AAACGAAAGGAAGCCTTCCGAAAATATGAAATTTTAGTGGAGTTGGAGAAGTTGCTTGCCCCCAACTTTTCCAAGTCTCAAGGAGTCAAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGAATCCAGCGGAAGAAACCCAAGGAGAGCAGCCTAAGCCATCTGATCTGAACCCTAGCGACCAAAGCGATCACTCCCAAGAATGGGAAACCATGGCGCGCGCTTG
GCTTTGCTCCTTCCCCGAGGCCAGAGCTGGCTCCATGGAAGAGGTTGAAGCTTGGATTGACTCCAACCATGCCTCTTTACCTGGAAACCTCAAATCAATGCCCCGCTCCG
ACCTTTGCCAGAGGCTGATTTCCATCCAAAATTTAATGCGACTTTCCAATCAGGGAAAGGAAGAGATTCAGGGTGACCAGGGTGATCAGGGCGATCTTCCACATGCTCGA
TTTCAACGCACCGACCAGTGGATACCAGTTTATTCTTGGCTAGAGTCTCTACAACATGAAGAGGTTGTCAAGTCAAAGGACATATCTGATTGGTTAACTGAAAATCCCTC
CATCAGAGATCAGTTGTGTTCAAGACATTCTCGCTATCATTTAATGCACTACATCAAAAAGTGTCACTTGAAGATACTGAAAAGAAAGGAAAAGAAAAAGGGTTCTCAGA
TGCCTGACAAATCTCCTCTAAAAGTTCACAAGGATGTTCTGATGAAACCAGCATTGCCTCCGCGTGATTCATTTAGCGATCTACCGAAAGACAGCGATGTATATTTGGCA
AAACGAAAGGAAGCCTTCCGAAAATATGAAATTTTAGTGGAGTTGGAGAAGTTGCTTGCCCCCAACTTTTCCAAGTCTCAAGGAGTCAAA
Protein sequenceShow/hide protein sequence
MSNPAEETQGEQPKPSDLNPSDQSDHSQEWETMARAWLCSFPEARAGSMEEVEAWIDSNHASLPGNLKSMPRSDLCQRLISIQNLMRLSNQGKEEIQGDQGDQGDLPHAR
FQRTDQWIPVYSWLESLQHEEVVKSKDISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQMPDKSPLKVHKDVLMKPALPPRDSFSDLPKDSDVYLA
KRKEAFRKYEILVELEKLLAPNFSKSQGVK