; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003194 (gene) of Snake gourd v1 genome

Gene IDTan0003194
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCHANNEL_COLICIN domain-containing protein
Genome locationLG05:81684773..81687445
RNA-Seq ExpressionTan0003194
SyntenyTan0003194
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573128.1 hypothetical protein SDJN03_27015, partial [Cucurbita argyrosperma subsp. sororia]2.0e-13090.53Show/hide
Query:  MSNPAEQPHEGEEPKPSDLNPSDQNDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNYGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQG-
        MSNPAEQ  EG EPKPSDLNPSDQ+DHSQEWE+MARAWL SFPEAKAGSMEEVEAWIDSN+ SLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEIQG 
Subjt:  MSNPAEQPHEGEEPKPSDLNPSDQNDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNYGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQG-

Query:  --DHGDQGDEGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPEKSPQ
          D  D+ D+GDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPEKSPQ
Subjt:  --DHGDQGDEGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPEKSPQ

Query:  KSPLKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLFAPNFSKSQGVK
        KSPLKVHKDV+MKPALPPRDSFSDLPKDSDVYLAKR+EAFRKYEILVELEKL + N  KS+GVK
Subjt:  KSPLKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLFAPNFSKSQGVK

XP_022137036.1 uncharacterized protein LOC111008597 isoform X1 [Momordica charantia]1.1e-12589.27Show/hide
Query:  MSNPAEQPHEGEEPKPSDLNPSDQNDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNYGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD
        MSNPAE+  +GE+PKPSDLNPSDQ+DHSQEWE MARAWLCSFPEA+AGSMEEVEAWIDSN+ SLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEI   
Subjt:  MSNPAEQPHEGEEPKPSDLNPSDQNDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNYGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD

Query:  HGDQGDEGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPEKSPQKSP
           QGD+GDQGDLPHARFQRTDQWIPVYSWLESLQ +EVVKSK+ISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQ+P+    KSP
Subjt:  HGDQGDEGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPEKSPQKSP

Query:  LKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLFAPNFSKSQGVK
        LKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKL APNFSKSQGVK
Subjt:  LKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLFAPNFSKSQGVK

XP_022955309.1 uncharacterized protein LOC111457310 [Cucurbita moschata]5.7e-13090.15Show/hide
Query:  MSNPAEQPHEGEEPKPSDLNPSDQNDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNYGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQG-
        MSNPAEQ  EG EPKPSDLNPSDQ+DHSQEWE+MARAWL SFPEAKAGSMEEVEAWIDSN+ SLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEIQG 
Subjt:  MSNPAEQPHEGEEPKPSDLNPSDQNDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNYGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQG-

Query:  --DHGDQGDEGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPEKSPQ
          D  D+ D+G+QGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPEKSPQ
Subjt:  --DHGDQGDEGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPEKSPQ

Query:  KSPLKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLFAPNFSKSQGVK
        KSPLKVHKDV+MKPALPPRDSFSDLPKDSDVYLAKR+EAFRKYEILVELEKL + N  KS+GVK
Subjt:  KSPLKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLFAPNFSKSQGVK

XP_022994449.1 uncharacterized protein LOC111490167 [Cucurbita maxima]3.3e-13090.53Show/hide
Query:  MSNPAEQPHEGEEPKPSDLNPSDQNDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNYGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQG-
        MSNPAEQ  EG EPKPSDLNPSDQ+DHSQEWE+MARAWL SFPEAKAGSMEEVEAWIDSN+ SLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEIQG 
Subjt:  MSNPAEQPHEGEEPKPSDLNPSDQNDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNYGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQG-

Query:  --DHGDQGDEGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPEKSPQ
          D  D+ D+GDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPEKSP 
Subjt:  --DHGDQGDEGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPEKSPQ

Query:  KSPLKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLFAPNFSKSQGVK
        KSPLKVHKDV+MKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKL + N  KS+GVK
Subjt:  KSPLKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLFAPNFSKSQGVK

XP_023542742.1 uncharacterized protein LOC111802563 [Cucurbita pepo subsp. pepo]8.8e-13190.91Show/hide
Query:  MSNPAEQPHEGEEPKPSDLNPSDQNDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNYGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQG-
        MSNPAEQ  EG EPKPSDLNPSDQ+DHSQEWE+MARAWL SFPEAKAGSMEEVEAWIDSN+ SLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEIQG 
Subjt:  MSNPAEQPHEGEEPKPSDLNPSDQNDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNYGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQG-

Query:  --DHGDQGDEGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPEKSPQ
          D  D+ D+GDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPEKSPQ
Subjt:  --DHGDQGDEGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPEKSPQ

Query:  KSPLKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLFAPNFSKSQGVK
        KSPLKVHKDV+MKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKL + N  KS+GVK
Subjt:  KSPLKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLFAPNFSKSQGVK

TrEMBL top hitse value%identityAlignment
A0A1S4E072 uncharacterized protein LOC1079913137.2e-12386.47Show/hide
Query:  MSNPAEQPHEGEE------PKPSDLNPSDQNDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNYGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGK
        MSNP E+  EGEE      PKPSD+NPSD  D SQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSN+ SLPGNLKSMPRSDLCQRLISIQNLMRLSTQGK
Subjt:  MSNPAEQPHEGEE------PKPSDLNPSDQNDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNYGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGK

Query:  EEIQGDHGDQGDEGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPEK
        E+I     D+ D+GDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENP+IRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLP+ 
Subjt:  EEIQGDHGDQGDEGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPEK

Query:  SPQKSPLKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLFAPNFSKSQGV
           KSPLKVHKDV MKPALPPRDSFSDLPKDSD+YLAKRKEAFRKYEILVELEKL A  FSKSQGV
Subjt:  SPQKSPLKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLFAPNFSKSQGV

A0A5D3BJS0 Uncharacterized protein7.2e-12386.47Show/hide
Query:  MSNPAEQPHEGEE------PKPSDLNPSDQNDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNYGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGK
        MSNP E+  EGEE      PKPSD+NPSD  D SQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSN+ SLPGNLKSMPRSDLCQRLISIQNLMRLSTQGK
Subjt:  MSNPAEQPHEGEE------PKPSDLNPSDQNDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNYGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGK

Query:  EEIQGDHGDQGDEGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPEK
        E+I     D+ D+GDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENP+IRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLP+ 
Subjt:  EEIQGDHGDQGDEGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPEK

Query:  SPQKSPLKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLFAPNFSKSQGV
           KSPLKVHKDV MKPALPPRDSFSDLPKDSD+YLAKRKEAFRKYEILVELEKL A  FSKSQGV
Subjt:  SPQKSPLKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLFAPNFSKSQGV

A0A6J1C586 uncharacterized protein LOC111008597 isoform X15.4e-12689.27Show/hide
Query:  MSNPAEQPHEGEEPKPSDLNPSDQNDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNYGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD
        MSNPAE+  +GE+PKPSDLNPSDQ+DHSQEWE MARAWLCSFPEA+AGSMEEVEAWIDSN+ SLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEI   
Subjt:  MSNPAEQPHEGEEPKPSDLNPSDQNDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNYGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGD

Query:  HGDQGDEGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPEKSPQKSP
           QGD+GDQGDLPHARFQRTDQWIPVYSWLESLQ +EVVKSK+ISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQ+P+    KSP
Subjt:  HGDQGDEGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPEKSPQKSP

Query:  LKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLFAPNFSKSQGVK
        LKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKL APNFSKSQGVK
Subjt:  LKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLFAPNFSKSQGVK

A0A6J1GVV9 uncharacterized protein LOC1114573102.7e-13090.15Show/hide
Query:  MSNPAEQPHEGEEPKPSDLNPSDQNDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNYGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQG-
        MSNPAEQ  EG EPKPSDLNPSDQ+DHSQEWE+MARAWL SFPEAKAGSMEEVEAWIDSN+ SLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEIQG 
Subjt:  MSNPAEQPHEGEEPKPSDLNPSDQNDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNYGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQG-

Query:  --DHGDQGDEGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPEKSPQ
          D  D+ D+G+QGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPEKSPQ
Subjt:  --DHGDQGDEGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPEKSPQ

Query:  KSPLKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLFAPNFSKSQGVK
        KSPLKVHKDV+MKPALPPRDSFSDLPKDSDVYLAKR+EAFRKYEILVELEKL + N  KS+GVK
Subjt:  KSPLKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLFAPNFSKSQGVK

A0A6J1K2W0 uncharacterized protein LOC1114901671.6e-13090.53Show/hide
Query:  MSNPAEQPHEGEEPKPSDLNPSDQNDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNYGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQG-
        MSNPAEQ  EG EPKPSDLNPSDQ+DHSQEWE+MARAWL SFPEAKAGSMEEVEAWIDSN+ SLPGNLKSMPRSDLCQRLISIQNLMRLS QGKEEIQG 
Subjt:  MSNPAEQPHEGEEPKPSDLNPSDQNDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNYGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQG-

Query:  --DHGDQGDEGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPEKSPQ
          D  D+ D+GDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPEKSP 
Subjt:  --DHGDQGDEGDQGDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPEKSPQ

Query:  KSPLKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLFAPNFSKSQGVK
        KSPLKVHKDV+MKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKL + N  KS+GVK
Subjt:  KSPLKVHKDVLMKPALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLFAPNFSKSQGVK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G50410.1 unknown protein3.9e-6050Show/hide
Query:  MSNPAEQPHEGEEPKPSDL-------NPSDQNDHSQEWEIMARAWLCSFPEAKA-GSMEEVEAWIDSNYGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQ
        MSN +++  E E+ +   L       N + + + SQEWE MARAW+ +FP+AKA  S  EVE WI +N  SLP +L+ MPRS++  RL+SIQ+ MR +  
Subjt:  MSNPAEQPHEGEEPKPSDL-------NPSDQNDHSQEWEIMARAWLCSFPEAKA-GSMEEVEAWIDSNYGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQ

Query:  GKEEIQGDHGDQGDEGDQGDLPH-ARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQL
                  DQ ++  + D  H ARFQRTDQW+PVYSWLESL   E+VKSK+IS+WL  NP ++ +L SRHSRYHL HY+KKCHLKILKRKEKK   +L
Subjt:  GKEEIQGDHGDQGDEGDQGDLPH-ARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQL

Query:  PEKSPQKSPLKVHKDVLMK-PALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLFAPNFSK
             + + ++VHK+   K  A    D  S++PKDSD+Y  K+KEA R++EILVELEK  AP+F+K
Subjt:  PEKSPQKSPLKVHKDVLMK-PALPPRDSFSDLPKDSDVYLAKRKEAFRKYEILVELEKLFAPNFSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAATCCAGCGGAACAACCCCATGAAGGAGAAGAGCCTAAGCCATCTGATCTAAACCCTAGCGACCAGAACGATCACTCTCAAGAATGGGAAATCATGGCACGAGC
GTGGCTTTGCTCCTTCCCCGAGGCCAAAGCTGGGTCCATGGAAGAGGTTGAAGCTTGGATTGACTCCAACTATGGCTCTTTACCTGGAAACCTCAAATCAATGCCCCGCT
CCGACCTTTGCCAGAGGCTGATTTCTATCCAGAATTTAATGAGACTTTCCACTCAGGGAAAGGAAGAGATTCAGGGTGATCATGGTGATCAAGGTGATGAAGGCGATCAG
GGCGACCTTCCTCACGCTCGATTTCAACGCACTGACCAGTGGATACCAGTTTATTCTTGGTTAGAGTCTCTACAACAAGATGAGGTTGTCAAGTCAAAGGAAATATCCGA
TTGGTTAACTGAAAATCCCTCCATCAGAGATCAGTTGTGTTCAAGACATTCTCGCTATCATTTAATGCACTACATCAAGAAGTGTCATTTGAAGATATTGAAAAGAAAGG
AAAAGAAAAAGGGTTCTCAGTTGCCTGAAAAGTCTCCTCAAAAGTCTCCTCTAAAAGTGCACAAGGATGTTTTGATGAAACCAGCATTGCCTCCACGCGATTCATTTAGC
GATCTACCAAAAGACAGTGATGTATATTTAGCAAAACGAAAGGAAGCATTTCGAAAATATGAAATTTTAGTGGAGTTGGAGAAGTTGTTTGCCCCCAACTTTTCTAAGTC
TCAAGGAGTGAAATGA
mRNA sequenceShow/hide mRNA sequence
CGTTATTCATGTCAATGTCATACTTATACAAACAATTATAAATTTATACCTGATATTTTGAAGGCAAACTCATAGAAATGGGGAAATGAAGGAGAAAAAAAACGAAAAGA
AAAGTAGCGATAGCGATGTTGTTTGGGGCTTCACCAATTGATGATCAGGCGTTCTTCTTCTACTTCTAGATTTCAAATGTTGCAAAATTCTTCTTTAATTGGATTTTTGA
GGAATGCGATACAGGAAATCGCAATCAGAGCAGTTTAATCAGAGACAAGTGCGTGAAAAATGTCGAATCCAGCGGAACAACCCCATGAAGGAGAAGAGCCTAAGCCATCT
GATCTAAACCCTAGCGACCAGAACGATCACTCTCAAGAATGGGAAATCATGGCACGAGCGTGGCTTTGCTCCTTCCCCGAGGCCAAAGCTGGGTCCATGGAAGAGGTTGA
AGCTTGGATTGACTCCAACTATGGCTCTTTACCTGGAAACCTCAAATCAATGCCCCGCTCCGACCTTTGCCAGAGGCTGATTTCTATCCAGAATTTAATGAGACTTTCCA
CTCAGGGAAAGGAAGAGATTCAGGGTGATCATGGTGATCAAGGTGATGAAGGCGATCAGGGCGACCTTCCTCACGCTCGATTTCAACGCACTGACCAGTGGATACCAGTT
TATTCTTGGTTAGAGTCTCTACAACAAGATGAGGTTGTCAAGTCAAAGGAAATATCCGATTGGTTAACTGAAAATCCCTCCATCAGAGATCAGTTGTGTTCAAGACATTC
TCGCTATCATTTAATGCACTACATCAAGAAGTGTCATTTGAAGATATTGAAAAGAAAGGAAAAGAAAAAGGGTTCTCAGTTGCCTGAAAAGTCTCCTCAAAAGTCTCCTC
TAAAAGTGCACAAGGATGTTTTGATGAAACCAGCATTGCCTCCACGCGATTCATTTAGCGATCTACCAAAAGACAGTGATGTATATTTAGCAAAACGAAAGGAAGCATTT
CGAAAATATGAAATTTTAGTGGAGTTGGAGAAGTTGTTTGCCCCCAACTTTTCTAAGTCTCAAGGAGTGAAATGATTGGAGAAACTTTGGGGACTGCCATCACTTTGGGT
ATGGGGATATGGGACCAAGTTTTAAGCCATTGCCAGGTTTTTACCTTGTTCACTAGTGAATCTTGTGGCTCCATTTCCCTGCTTTCCCACTCTGTAATAGAGTAGCGGAT
CTGACAAATCTTTTAGAAATTTAATTCATATCTTATAAAATGTTGATATTTGGTCACGTATGGGCATGCCGACGAATTTGAGATGATGCTGGAG
Protein sequenceShow/hide protein sequence
MSNPAEQPHEGEEPKPSDLNPSDQNDHSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNYGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQGKEEIQGDHGDQGDEGDQ
GDLPHARFQRTDQWIPVYSWLESLQQDEVVKSKEISDWLTENPSIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPEKSPQKSPLKVHKDVLMKPALPPRDSFS
DLPKDSDVYLAKRKEAFRKYEILVELEKLFAPNFSKSQGVK