; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G005010 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G005010
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionCHANNEL_COLICIN domain-containing protein
Genome locationchr02:4252025..4254761
RNA-Seq ExpressionLsi02G005010
SyntenyLsi02G005010
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152526.1 uncharacterized protein LOC101216205 [Cucumis sativus]2.7e-12489.58Show/hide
Query:  MSNPVEQT----QQAEGPKPSDVNPSDQGDQSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQATGK
        MSNPVEQT    ++ E PKPSDVNP+D GDQSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNH SLPGNLKSMPRSDLCQRLISIQNLMRLSTQ  GK
Subjt:  MSNPVEQT----QQAEGPKPSDVNPSDQGDQSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQATGK

Query:  EEIQGDRGDQGDQGDLPHARFQRTDQWLPVYSWLESLHHDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPDKSPL
        E+I   + D+GDQGDLPHARFQRTDQW+PVYSWLESL  DEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPDKSPL
Subjt:  EEIQGDRGDQGDQGDLPHARFQRTDQWLPVYSWLESLHHDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPDKSPL

Query:  KVHKDVSMKPALPPRDSFSDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGV
        KVHKDV MKPALPPRDSFSDLPKDSDIYL KRKEAFRKYEILVELEKLLA  FSKSQGV
Subjt:  KVHKDVSMKPALPPRDSFSDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGV

XP_016901626.1 PREDICTED: uncharacterized protein LOC107991313 [Cucumis melo]8.3e-12689.73Show/hide
Query:  MSNPVEQTQ------QAEGPKPSDVNPSDQGDQSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQAT
        MSNPVE+T+      + E PKPSDVNPSD GDQSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNH SLPGNLKSMPRSDLCQRLISIQNLMRLSTQ  
Subjt:  MSNPVEQTQ------QAEGPKPSDVNPSDQGDQSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQAT

Query:  GKEEI--QGDRGDQGDQGDLPHARFQRTDQWLPVYSWLESLHHDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPD
        GKE+I  Q D  DQGDQGDLPHARFQRTDQW+PVYSWLESL  DEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPD
Subjt:  GKEEI--QGDRGDQGDQGDLPHARFQRTDQWLPVYSWLESLHHDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPD

Query:  KSPLKVHKDVSMKPALPPRDSFSDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGV
        KSPLKVHKDV MKPALPPRDSFSDLPKDSDIYLAKRKEAFRKYEILVELEKLLA  FSKSQGV
Subjt:  KSPLKVHKDVSMKPALPPRDSFSDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGV

XP_022137036.1 uncharacterized protein LOC111008597 isoform X1 [Momordica charantia]8.3e-12690.23Show/hide
Query:  MSNPVEQTQQAEGPKPSDVNPSDQGDQSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQATGKEEIQ
        MSNP E+T Q E PKPSD+NPSDQ D SQEWE MARAWLCSFPEA+AGSMEEVEAWIDSNH SLPGNLKSMPRSDLCQRLISIQNLMRLS Q  GKEEIQ
Subjt:  MSNPVEQTQQAEGPKPSDVNPSDQGDQSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQATGKEEIQ

Query:  GDRGDQGDQGDLPHARFQRTDQWLPVYSWLESLHHDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPDKSPLKVHK
           GDQGDQGDLPHARFQRTDQW+PVYSWLESL H+EVVKSK+ISDWLTENP+IRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQ+PDKSPLKVHK
Subjt:  GDRGDQGDQGDLPHARFQRTDQWLPVYSWLESLHHDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPDKSPLKVHK

Query:  DVSMKPALPPRDSFSDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK
        DV MKPALPPRDSFSDLPKDSD+YLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK
Subjt:  DVSMKPALPPRDSFSDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK

XP_023542742.1 uncharacterized protein LOC111802563 [Cucurbita pepo subsp. pepo]1.2e-12185.71Show/hide
Query:  MSNPVEQTQQAEGPKPSDVNPSDQGDQSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQATGKEEIQ
        MSNP EQ Q+   PKPSD+NPSDQ D SQEWE+MARAWL SFPEAKAGSMEEVEAWIDSNH SLPGNLKSMPRSDLCQRLISIQNLMRLS Q  GKEEIQ
Subjt:  MSNPVEQTQQAEGPKPSDVNPSDQGDQSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQATGKEEIQ

Query:  G------DRGDQGDQGDLPHARFQRTDQWLPVYSWLESLHHDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPD--
        G      DR DQGDQGDLPHARFQRTDQW+PVYSWLESL  DEVVKSKEISDWLTENP+IRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLP+  
Subjt:  G------DRGDQGDQGDLPHARFQRTDQWLPVYSWLESLHHDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPD--

Query:  --KSPLKVHKDVSMKPALPPRDSFSDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK
          KSPLKVHKDV MKPALPPRDSFSDLPKDSD+YLAKRKEAFRKYEILVELEKLL+ N  KS+GVK
Subjt:  --KSPLKVHKDVSMKPALPPRDSFSDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK

XP_038893664.1 uncharacterized protein LOC120082533 [Benincasa hispida]1.6e-12991.86Show/hide
Query:  MSNPVEQTQQAEGPKPSDVNPSDQGDQSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQATGKEEI-
        MSNPVEQTQ+ E PKPS+VNPSDQGDQSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNH SLPGNLKSMPRSDLCQRLISIQNLMRLS+QATGKEEI 
Subjt:  MSNPVEQTQQAEGPKPSDVNPSDQGDQSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQATGKEEI-

Query:  -QGDRGDQGDQGDLPHARFQRTDQWLPVYSWLESLHHDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPDKSPLKV
         Q D  DQG QGDLPHARFQRT+QW+PVYSWLESLH DEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPDKSPLKV
Subjt:  -QGDRGDQGDQGDLPHARFQRTDQWLPVYSWLESLHHDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPDKSPLKV

Query:  HKDVSMKPALPPRDSFSDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK
        HKDV MKPALPPRDSFSDLPKDS IYLAKRKEA+RKYEIL+ELEKLLA  FSKSQGVK
Subjt:  HKDVSMKPALPPRDSFSDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK

TrEMBL top hitse value%identityAlignment
A0A0A0LUD5 Uncharacterized protein1.3e-12489.58Show/hide
Query:  MSNPVEQT----QQAEGPKPSDVNPSDQGDQSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQATGK
        MSNPVEQT    ++ E PKPSDVNP+D GDQSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNH SLPGNLKSMPRSDLCQRLISIQNLMRLSTQ  GK
Subjt:  MSNPVEQT----QQAEGPKPSDVNPSDQGDQSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQATGK

Query:  EEIQGDRGDQGDQGDLPHARFQRTDQWLPVYSWLESLHHDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPDKSPL
        E+I   + D+GDQGDLPHARFQRTDQW+PVYSWLESL  DEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPDKSPL
Subjt:  EEIQGDRGDQGDQGDLPHARFQRTDQWLPVYSWLESLHHDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPDKSPL

Query:  KVHKDVSMKPALPPRDSFSDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGV
        KVHKDV MKPALPPRDSFSDLPKDSDIYL KRKEAFRKYEILVELEKLLA  FSKSQGV
Subjt:  KVHKDVSMKPALPPRDSFSDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGV

A0A1S4E072 uncharacterized protein LOC1079913134.0e-12689.73Show/hide
Query:  MSNPVEQTQ------QAEGPKPSDVNPSDQGDQSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQAT
        MSNPVE+T+      + E PKPSDVNPSD GDQSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNH SLPGNLKSMPRSDLCQRLISIQNLMRLSTQ  
Subjt:  MSNPVEQTQ------QAEGPKPSDVNPSDQGDQSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQAT

Query:  GKEEI--QGDRGDQGDQGDLPHARFQRTDQWLPVYSWLESLHHDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPD
        GKE+I  Q D  DQGDQGDLPHARFQRTDQW+PVYSWLESL  DEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPD
Subjt:  GKEEI--QGDRGDQGDQGDLPHARFQRTDQWLPVYSWLESLHHDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPD

Query:  KSPLKVHKDVSMKPALPPRDSFSDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGV
        KSPLKVHKDV MKPALPPRDSFSDLPKDSDIYLAKRKEAFRKYEILVELEKLLA  FSKSQGV
Subjt:  KSPLKVHKDVSMKPALPPRDSFSDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGV

A0A5D3BJS0 Uncharacterized protein4.0e-12689.73Show/hide
Query:  MSNPVEQTQ------QAEGPKPSDVNPSDQGDQSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQAT
        MSNPVE+T+      + E PKPSDVNPSD GDQSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNH SLPGNLKSMPRSDLCQRLISIQNLMRLSTQ  
Subjt:  MSNPVEQTQ------QAEGPKPSDVNPSDQGDQSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQAT

Query:  GKEEI--QGDRGDQGDQGDLPHARFQRTDQWLPVYSWLESLHHDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPD
        GKE+I  Q D  DQGDQGDLPHARFQRTDQW+PVYSWLESL  DEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPD
Subjt:  GKEEI--QGDRGDQGDQGDLPHARFQRTDQWLPVYSWLESLHHDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPD

Query:  KSPLKVHKDVSMKPALPPRDSFSDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGV
        KSPLKVHKDV MKPALPPRDSFSDLPKDSDIYLAKRKEAFRKYEILVELEKLLA  FSKSQGV
Subjt:  KSPLKVHKDVSMKPALPPRDSFSDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGV

A0A6J1C586 uncharacterized protein LOC111008597 isoform X14.0e-12690.23Show/hide
Query:  MSNPVEQTQQAEGPKPSDVNPSDQGDQSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQATGKEEIQ
        MSNP E+T Q E PKPSD+NPSDQ D SQEWE MARAWLCSFPEA+AGSMEEVEAWIDSNH SLPGNLKSMPRSDLCQRLISIQNLMRLS Q  GKEEIQ
Subjt:  MSNPVEQTQQAEGPKPSDVNPSDQGDQSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQATGKEEIQ

Query:  GDRGDQGDQGDLPHARFQRTDQWLPVYSWLESLHHDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPDKSPLKVHK
           GDQGDQGDLPHARFQRTDQW+PVYSWLESL H+EVVKSK+ISDWLTENP+IRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQ+PDKSPLKVHK
Subjt:  GDRGDQGDQGDLPHARFQRTDQWLPVYSWLESLHHDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPDKSPLKVHK

Query:  DVSMKPALPPRDSFSDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK
        DV MKPALPPRDSFSDLPKDSD+YLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK
Subjt:  DVSMKPALPPRDSFSDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK

A0A6J1K2W0 uncharacterized protein LOC1114901676.0e-12285.71Show/hide
Query:  MSNPVEQTQQAEGPKPSDVNPSDQGDQSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQATGKEEIQ
        MSNP EQ Q+   PKPSD+NPSDQ D SQEWE+MARAWL SFPEAKAGSMEEVEAWIDSNH SLPGNLKSMPRSDLCQRLISIQNLMRLS Q  GKEEIQ
Subjt:  MSNPVEQTQQAEGPKPSDVNPSDQGDQSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQATGKEEIQ

Query:  G------DRGDQGDQGDLPHARFQRTDQWLPVYSWLESLHHDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPD--
        G      DR DQGDQGDLPHARFQRTDQW+PVYSWLESL  DEVVKSKEISDWLTENP+IRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLP+  
Subjt:  G------DRGDQGDQGDLPHARFQRTDQWLPVYSWLESLHHDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPD--

Query:  --KSPLKVHKDVSMKPALPPRDSFSDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK
          KSPLKVHKDV MKPALPPRDSFSDLPKDSD+YLAKRKEAFRKYEILVELEKLL+ N  KS+GVK
Subjt:  --KSPLKVHKDVSMKPALPPRDSFSDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G50410.1 unknown protein6.3e-6353.6Show/hide
Query:  VEQTQQAEGPKPSDVNPSDQGDQSQEWEIMARAWLCSFPEAKA-GSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQATGKEEIQGDR
        V+Q Q     K S+ N   + + SQEWE MARAW+ +FP+AKA  S  EVE WI +N  SLP +L+ MPRS++  RL+SIQ+ MR +  +        D+
Subjt:  VEQTQQAEGPKPSDVNPSDQGDQSQEWEIMARAWLCSFPEAKA-GSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQATGKEEIQGDR

Query:  GDQGDQGDLPH-ARFQRTDQWLPVYSWLESLHHDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPDKSPLKVHKDV
         +Q  + D  H ARFQRTDQWLPVYSWLESL + E+VKSK+IS+WL  NP ++ +L SRHSRYHL HY+KKCHLKILKRKEKK   +L   + ++VHK+ 
Subjt:  GDQGDQGDLPH-ARFQRTDQWLPVYSWLESLHHDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPDKSPLKVHKDV

Query:  SMK-PALPPRDSFSDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSK
          K  A    D  S++PKDSD+Y  K+KEA R++EILVELEK LAP+F+K
Subjt:  SMK-PALPPRDSFSDLPKDSDIYLAKRKEAFRKYEILVELEKLLAPNFSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAATCCAGTGGAACAAACCCAGCAAGCAGAAGGCCCTAAGCCATCTGATGTAAACCCTAGCGACCAAGGCGATCAATCCCAAGAATGGGAAATCATGGCGCGAGC
TTGGCTTTGCTCCTTCCCCGAGGCCAAAGCTGGGTCCATGGAAGAGGTTGAAGCTTGGATTGACTCCAATCATGGCTCTTTACCAGGAAACCTCAAATCAATGCCCCGCT
CCGACCTTTGCCAGAGGCTGATTTCTATCCAAAATTTAATGAGACTTTCCACTCAGGCTACGGGAAAGGAAGAGATTCAGGGTGATCGAGGCGATCAGGGGGATCAGGGA
GATCTTCCACATGCTCGATTTCAACGCACCGACCAGTGGTTACCAGTTTATTCTTGGTTAGAGTCTCTACATCACGATGAGGTTGTCAAGTCTAAGGAAATATCTGATTG
GTTAACTGAAAATCCCACCATCAGAGATCAGCTGTGTTCAAGACATTCTCGTTATCATTTGATGCACTATATCAAAAAGTGCCATTTGAAGATATTGAAAAGAAAGGAAA
AGAAAAAGGGTTCTCAGCTGCCTGACAAATCTCCTTTAAAAGTTCACAAGGATGTTTCTATGAAACCAGCACTGCCTCCACGTGATTCGTTTAGCGATCTACCGAAAGAC
AGTGACATATATTTGGCAAAACGAAAGGAAGCCTTCCGAAAATATGAAATTTTAGTGGAGTTGGAGAAGTTGCTTGCCCCCAACTTTTCCAAGTCTCAAGGGGTCAAATA
A
mRNA sequenceShow/hide mRNA sequence
AGAAGGTGTAACAGATACATTTTATTTTTCACACATTATCGATCAATTTAAACTCTAGAAAAAAACTCTATGGGGTATTTTTTTGAAAAAACAAAAATTTAATGTTGGAC
CAAAAATAAAAGTTTGAAACAAAATCTTGAGGTTATTTTATATGGTTTAATCGATATTTTGAAGGCGAAGTCATAGAAATGGGGAAATGAAGGAGAAAGAAGAAGAAAAC
GGATTTGGGAACTAACAAATCGACAAGCAGAATGCGACACAGGAAATCGCAATCAGAACAGCTTCTTAATCTGCAACAAGTGCGAGTAAAATGTCGAATCCAGTGGAACA
AACCCAGCAAGCAGAAGGCCCTAAGCCATCTGATGTAAACCCTAGCGACCAAGGCGATCAATCCCAAGAATGGGAAATCATGGCGCGAGCTTGGCTTTGCTCCTTCCCCG
AGGCCAAAGCTGGGTCCATGGAAGAGGTTGAAGCTTGGATTGACTCCAATCATGGCTCTTTACCAGGAAACCTCAAATCAATGCCCCGCTCCGACCTTTGCCAGAGGCTG
ATTTCTATCCAAAATTTAATGAGACTTTCCACTCAGGCTACGGGAAAGGAAGAGATTCAGGGTGATCGAGGCGATCAGGGGGATCAGGGAGATCTTCCACATGCTCGATT
TCAACGCACCGACCAGTGGTTACCAGTTTATTCTTGGTTAGAGTCTCTACATCACGATGAGGTTGTCAAGTCTAAGGAAATATCTGATTGGTTAACTGAAAATCCCACCA
TCAGAGATCAGCTGTGTTCAAGACATTCTCGTTATCATTTGATGCACTATATCAAAAAGTGCCATTTGAAGATATTGAAAAGAAAGGAAAAGAAAAAGGGTTCTCAGCTG
CCTGACAAATCTCCTTTAAAAGTTCACAAGGATGTTTCTATGAAACCAGCACTGCCTCCACGTGATTCGTTTAGCGATCTACCGAAAGACAGTGACATATATTTGGCAAA
ACGAAAGGAAGCCTTCCGAAAATATGAAATTTTAGTGGAGTTGGAGAAGTTGCTTGCCCCCAACTTTTCCAAGTCTCAAGGGGTCAAATAATTGGAGAAACTTTGGGGAC
TGCCATCACTCTTGAGTATGGGATATGGGTCCAATTCCAAGTTTGTAGTCATTGCCAGGTTTTTACCTTGTTCATTGCTGAATATTGTGGCCCCATTGCCCTGCTTTCTC
ACACCAGAATAGAGTACCAGATCTGACAAATCTATTAGAAATTTAATTTACCTTATATATTATATAGCTCCATTCCTTATAAAATATTGATATCCCTTCGATCATGTATG
GGCGTGTCTGTGGGCTCTGGTCGTCGTGACGAATTTTAGATGATAATGTGGAGAATATTTTTCTTTATGATATTTGATATTTCACTTCAACATTCATTTAATTGATAAGG
TTTTACATTTTGTAGAACATTGTAGGAGAATGTAGTGGTAAAATTAAAACCAATGCAGCATAAAATATATCATTATTTCATAAAAGAATTGACGT
Protein sequenceShow/hide protein sequence
MSNPVEQTQQAEGPKPSDVNPSDQGDQSQEWEIMARAWLCSFPEAKAGSMEEVEAWIDSNHGSLPGNLKSMPRSDLCQRLISIQNLMRLSTQATGKEEIQGDRGDQGDQG
DLPHARFQRTDQWLPVYSWLESLHHDEVVKSKEISDWLTENPTIRDQLCSRHSRYHLMHYIKKCHLKILKRKEKKKGSQLPDKSPLKVHKDVSMKPALPPRDSFSDLPKD
SDIYLAKRKEAFRKYEILVELEKLLAPNFSKSQGVK