; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg006769 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg006769
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDUF4477 domain-containing protein
Genome locationscaffold1:54851660..54860174
RNA-Seq ExpressionSpg006769
SyntenySpg006769
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR027951 - Domain of unknown function DUF4477


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577429.1 hypothetical protein SDJN03_25003, partial [Cucurbita argyrosperma subsp. sororia]1.6e-8971.17Show/hide
Query:  MASFEAENLEEKLASMIDQLYLESGILQKMIYKNKNQHRRCSYFRYLLQVKRDLRLLQAAKLEELVSCCFQVIHGNKPKQKIHLLESLKRRKCEVGKYNF
        MAS  AEN +EKLASM+DQLYLE GIL KMIYKNKNQHRR SYFRYLLQV+RDLRLLQAAKLEELV+ CFQVI G KPKQKIH LESLKR KCEVGKYNF
Subjt:  MASFEAENLEEKLASMIDQLYLESGILQKMIYKNKNQHRRCSYFRYLLQVKRDLRLLQAAKLEELVSCCFQVIHGNKPKQKIHLLESLKRRKCEVGKYNF

Query:  MERLLGATRLLSEMVEPIFKAATEISILLARTFFTGFCFVILALLARIRVLVQQVIACRLWVKILLDVVSVLNMVSSMSKKKRVVRISQEGIQ-------
        ME+LLGA RLLSEMVEPIFKAATEISILLARTFFTGFCF+ILALLARIRVLVQQ         ILLDVVS+ NMV+S+SKKK VV I+QEGIQ       
Subjt:  MERLLGATRLLSEMVEPIFKAATEISILLARTFFTGFCFVILALLARIRVLVQQVIACRLWVKILLDVVSVLNMVSSMSKKKRVVRISQEGIQ-------

Query:  ---------------------KKQEVGTKNQEEHLVPNVSVATSAVRYQSIESFLGDDESAIKQEEANQGN-QGPDPMKMT
                              KQEV T NQEEH+ PNVS   SAVRYQS+ESFLGDDE A KQ EANQ N +  D MKM+
Subjt:  ---------------------KKQEVGTKNQEEHLVPNVSVATSAVRYQSIESFLGDDESAIKQEEANQGN-QGPDPMKMT

KAG7015502.1 hypothetical protein SDJN02_23138 [Cucurbita argyrosperma subsp. argyrosperma]3.3e-9071.79Show/hide
Query:  MASFEAENLEEKLASMIDQLYLESGILQKMIYKNKNQHRRCSYFRYLLQVKRDLRLLQAAKLEELVSCCFQVIHGNKPKQKIHLLESLKRRKCEVGKYNF
        MAS  AEN +EKLASM+DQLYLE GIL KMIYKNKNQHRR SYFRYLLQV+RDLRLLQAAKLEELV+ CFQVI G KPKQKIH LESLKRRKCEVGKYNF
Subjt:  MASFEAENLEEKLASMIDQLYLESGILQKMIYKNKNQHRRCSYFRYLLQVKRDLRLLQAAKLEELVSCCFQVIHGNKPKQKIHLLESLKRRKCEVGKYNF

Query:  MERLLGATRLLSEMVEPIFKAATEISILLARTFFTGFCFVILALLARIRVLVQQVIACRLWVKILLDVVSVLNMVSSMSKKKRVVRISQEGIQ-------
        ME+LLGA RLLSEMVEPIFKAATEISILLARTFFTGFCF+ILALLARIRVLVQQ         ILLDVVS+ NMV+S+SKKK VV I+QEGIQ       
Subjt:  MERLLGATRLLSEMVEPIFKAATEISILLARTFFTGFCFVILALLARIRVLVQQVIACRLWVKILLDVVSVLNMVSSMSKKKRVVRISQEGIQ-------

Query:  ---------------------KKQEVGTKNQEEHLVPNVSVATSAVRYQSIESFLGDDESAIKQEEANQGN-QGPDPMKM
                              KQEV T NQEEH+ PNVS   SAVRYQS+ESFLGDDE A KQ EANQ N +  D MKM
Subjt:  ---------------------KKQEVGTKNQEEHLVPNVSVATSAVRYQSIESFLGDDESAIKQEEANQGN-QGPDPMKM

XP_022151030.1 uncharacterized protein LOC111019049 [Momordica charantia]6.2e-8970.71Show/hide
Query:  MASFEAENLEEKLASMIDQLYLESGILQKMIYKNKNQHRRCSYFRYLLQVKRDLRLLQAAKLEELVSCCFQVIHGNKPKQKIHLLESLKRRKCEVGKYNF
        MA+ +AENLEEKLAS++ QL+LESGIL KMIYKNKNQHRR SYFRYLLQV+RDLRLLQA KLEELVS CFQVI G KPKQKIHLLESLKRRKCEVGKYNF
Subjt:  MASFEAENLEEKLASMIDQLYLESGILQKMIYKNKNQHRRCSYFRYLLQVKRDLRLLQAAKLEELVSCCFQVIHGNKPKQKIHLLESLKRRKCEVGKYNF

Query:  MERLLGATRLLSEMVEPIFKAATEISILLARTFFTGFCFVILALLARIRVLVQQVIACRLWVKILLDVVSVLNMVSSMSKKKRVVRISQEGI--------
        MERLLGA RLLSEMVEPIFKAATEISILLAR FFTGFCF+ILALLARIRVLVQQ         ILL+VVSV NMVSS+S+KK +VRI+QEGI        
Subjt:  MERLLGATRLLSEMVEPIFKAATEISILLARTFFTGFCFVILALLARIRVLVQQVIACRLWVKILLDVVSVLNMVSSMSKKKRVVRISQEGI--------

Query:  --------------------QKKQEVGTKNQEEHLVPNVSVATSAVRYQSIESFLGDDESAIKQEEANQGNQGPDPMKMT
                            + KQ++ T+N EEH  P+VS ATSAVRYQSIESFL DDES IKQ +ANQ  +G D MKM+
Subjt:  --------------------QKKQEVGTKNQEEHLVPNVSVATSAVRYQSIESFLGDDESAIKQEEANQGNQGPDPMKMT

XP_022985227.1 uncharacterized protein LOC111483286 [Cucurbita maxima]7.3e-9071.17Show/hide
Query:  MASFEAENLEEKLASMIDQLYLESGILQKMIYKNKNQHRRCSYFRYLLQVKRDLRLLQAAKLEELVSCCFQVIHGNKPKQKIHLLESLKRRKCEVGKYNF
        MAS EAEN +EKLASM+DQL LESGIL KMIYKNKNQHRR  YFRYLLQV+RDLRLLQAAKL+EL+S CFQVI G KPKQKIH LESLKRRKCEVGKYNF
Subjt:  MASFEAENLEEKLASMIDQLYLESGILQKMIYKNKNQHRRCSYFRYLLQVKRDLRLLQAAKLEELVSCCFQVIHGNKPKQKIHLLESLKRRKCEVGKYNF

Query:  MERLLGATRLLSEMVEPIFKAATEISILLARTFFTGFCFVILALLARIRVLVQQVIACRLWVKILLDVVSVLNMVSSMSKKKRVVRISQEGIQ-------
        ME+LLGA+RLLSEMVEPIFKAATEISILLARTFFTGFCF+ILALLARI VLVQQ         ILLDVVS+ NMV+S+SKKK VV I+QEGIQ       
Subjt:  MERLLGATRLLSEMVEPIFKAATEISILLARTFFTGFCFVILALLARIRVLVQQVIACRLWVKILLDVVSVLNMVSSMSKKKRVVRISQEGIQ-------

Query:  ---------------------KKQEVGTKNQEEHLVPNVSVATSAVRYQSIESFLGDDESAIKQEEANQGNQ-GPDPMKMT
                              KQEV T NQEEH+ PNVS A S VRYQS+ESFLGDDE A KQ EANQ N+ G D MKM+
Subjt:  ---------------------KKQEVGTKNQEEHLVPNVSVATSAVRYQSIESFLGDDESAIKQEEANQGNQ-GPDPMKMT

XP_023520444.1 uncharacterized protein LOC111783829 [Cucurbita pepo subsp. pepo]5.6e-9071.17Show/hide
Query:  MASFEAENLEEKLASMIDQLYLESGILQKMIYKNKNQHRRCSYFRYLLQVKRDLRLLQAAKLEELVSCCFQVIHGNKPKQKIHLLESLKRRKCEVGKYNF
        MAS  AEN +EKLASM+DQLYLE GIL KMIYKNKNQHRR SYFRYLLQV+RDLRLLQAAKLEELV+ CFQVI G KPKQKIH LESLKRRKCEVGKYNF
Subjt:  MASFEAENLEEKLASMIDQLYLESGILQKMIYKNKNQHRRCSYFRYLLQVKRDLRLLQAAKLEELVSCCFQVIHGNKPKQKIHLLESLKRRKCEVGKYNF

Query:  MERLLGATRLLSEMVEPIFKAATEISILLARTFFTGFCFVILALLARIRVLVQQVIACRLWVKILLDVVSVLNMVSSMSKKKRVVRISQEGIQ-------
        ME+LLGA RLLSEMVEPIFKAATEISILLARTFFTGFCF+ILALLARIRVLVQQ         ILLDVVS+ NMV+S+SKKK VV I+QEGIQ       
Subjt:  MERLLGATRLLSEMVEPIFKAATEISILLARTFFTGFCFVILALLARIRVLVQQVIACRLWVKILLDVVSVLNMVSSMSKKKRVVRISQEGIQ-------

Query:  ---------------------KKQEVGTKNQEEHLVPNVSVATSAVRYQSIESFLGDDESAIKQEEANQGN-QGPDPMKMT
                              KQEV T NQEEH+ PNVS A S VRYQS++SFLGDDE A KQ EANQ N +  D MKM+
Subjt:  ---------------------KKQEVGTKNQEEHLVPNVSVATSAVRYQSIESFLGDDESAIKQEEANQGN-QGPDPMKMT

TrEMBL top hitse value%identityAlignment
A0A6J1DA39 uncharacterized protein LOC1110190493.0e-8970.71Show/hide
Query:  MASFEAENLEEKLASMIDQLYLESGILQKMIYKNKNQHRRCSYFRYLLQVKRDLRLLQAAKLEELVSCCFQVIHGNKPKQKIHLLESLKRRKCEVGKYNF
        MA+ +AENLEEKLAS++ QL+LESGIL KMIYKNKNQHRR SYFRYLLQV+RDLRLLQA KLEELVS CFQVI G KPKQKIHLLESLKRRKCEVGKYNF
Subjt:  MASFEAENLEEKLASMIDQLYLESGILQKMIYKNKNQHRRCSYFRYLLQVKRDLRLLQAAKLEELVSCCFQVIHGNKPKQKIHLLESLKRRKCEVGKYNF

Query:  MERLLGATRLLSEMVEPIFKAATEISILLARTFFTGFCFVILALLARIRVLVQQVIACRLWVKILLDVVSVLNMVSSMSKKKRVVRISQEGI--------
        MERLLGA RLLSEMVEPIFKAATEISILLAR FFTGFCF+ILALLARIRVLVQQ         ILL+VVSV NMVSS+S+KK +VRI+QEGI        
Subjt:  MERLLGATRLLSEMVEPIFKAATEISILLARTFFTGFCFVILALLARIRVLVQQVIACRLWVKILLDVVSVLNMVSSMSKKKRVVRISQEGI--------

Query:  --------------------QKKQEVGTKNQEEHLVPNVSVATSAVRYQSIESFLGDDESAIKQEEANQGNQGPDPMKMT
                            + KQ++ T+N EEH  P+VS ATSAVRYQSIESFL DDES IKQ +ANQ  +G D MKM+
Subjt:  --------------------QKKQEVGTKNQEEHLVPNVSVATSAVRYQSIESFLGDDESAIKQEEANQGNQGPDPMKMT

A0A6J1DSS0 uncharacterized protein LOC111023972 isoform X29.9e-8568.21Show/hide
Query:  MASFEAENLEEKLASMIDQLYLESGILQKMIYKNKNQHRRCSYFRYLLQVKRDLRLLQAAKLEELVSCCFQVIHGNKPKQKIHLLESLKRRKCEVGKYNF
        MAS EAEN EEKL S++ QL+LESGIL KMIYKNKNQHRR SYFRYLLQV RDLRLLQA KLE+LVS CFQVI G KPKQKIHLLESLKRRKCEVGKYNF
Subjt:  MASFEAENLEEKLASMIDQLYLESGILQKMIYKNKNQHRRCSYFRYLLQVKRDLRLLQAAKLEELVSCCFQVIHGNKPKQKIHLLESLKRRKCEVGKYNF

Query:  MERLLGATRLLSEMVEPIFKAATEISILLARTFFTGFCFVILALLARIRVLVQQVIACRLWVKILLDVVSVLNMVSSMSKKKRVVRISQEGIQ-------
        MERLLGA RLLSEMVEPIFKAATEIS LLAR FFTGFCF+ILALLARIRVLVQQ         IL+DVVSV NMVSS+S+KK  V I+QEGIQ       
Subjt:  MERLLGATRLLSEMVEPIFKAATEISILLARTFFTGFCFVILALLARIRVLVQQVIACRLWVKILLDVVSVLNMVSSMSKKKRVVRISQEGIQ-------

Query:  ---------------------KKQEVGTKNQEEHLVPNVSVATSAVRYQSIESFLGDDESAIKQEEANQGNQGPDPMKMT
                              KQ   ++N +E+L P+VS++TSA++Y SIESFL DDESAIKQ E NQ  +G D MKM+
Subjt:  ---------------------KKQEVGTKNQEEHLVPNVSVATSAVRYQSIESFLGDDESAIKQEEANQGNQGPDPMKMT

A0A6J1DTY3 uncharacterized protein LOC111023972 isoform X15.8e-7756.85Show/hide
Query:  MASFEAENLEEKLASMIDQLYLESGILQKMIYKNKNQHRRCSYFRYLLQVKRDLRLLQAAKLEELVSCCFQVIHGNKPKQKIHLLESLKRRKCEVGKYNF
        MAS EAEN EEKL S++ QL+LESGIL KMIYKNKNQHRR SYFRYLLQV RDLRLLQA KLE+LVS CFQVI G KPKQKIHLLESLKRRKCEVGKYNF
Subjt:  MASFEAENLEEKLASMIDQLYLESGILQKMIYKNKNQHRRCSYFRYLLQVKRDLRLLQAAKLEELVSCCFQVIHGNKPKQKIHLLESLKRRKCEVGKYNF

Query:  MERLLGATRLLSEMVEPIFKAAT--------------------------------------------------------EISILLARTFFTGFCFVILAL
        MERLLGA RLLSEMVEPIFKAAT                                                        EIS LLAR FFTGFCF+ILAL
Subjt:  MERLLGATRLLSEMVEPIFKAAT--------------------------------------------------------EISILLARTFFTGFCFVILAL

Query:  LARIRVLVQQVIACRLWVKILLDVVSVLNMVSSMSKKKRVVRISQEGIQ----------------------------KKQEVGTKNQEEHLVPNVSVATS
        LARIRVLVQQ         IL+DVVSV NMVSS+S+KK  V I+QEGIQ                             KQ   ++N +E+L P+VS++TS
Subjt:  LARIRVLVQQVIACRLWVKILLDVVSVLNMVSSMSKKKRVVRISQEGIQ----------------------------KKQEVGTKNQEEHLVPNVSVATS

Query:  AVRYQSIESFLGDDESAIKQEEANQGNQGPDPMKMT
        A++Y SIESFL DDESAIKQ E NQ  +G D MKM+
Subjt:  AVRYQSIESFLGDDESAIKQEEANQGNQGPDPMKMT

A0A6J1F0Y0 uncharacterized protein LOC1114384935.1e-8970.46Show/hide
Query:  MASFEAENLEEKLASMIDQLYLESGILQKMIYKNKNQHRRCSYFRYLLQVKRDLRLLQAAKLEELVSCCFQVIHGNKPKQKIHLLESLKRRKCEVGKYNF
        MAS  AEN +EKLASM+DQLYLE GIL KMIYKNKNQHRR SYF+YLLQV+RDLRLLQAAKLEELV+ CFQVI G KPKQKIH LESLKRRKCEVGKYNF
Subjt:  MASFEAENLEEKLASMIDQLYLESGILQKMIYKNKNQHRRCSYFRYLLQVKRDLRLLQAAKLEELVSCCFQVIHGNKPKQKIHLLESLKRRKCEVGKYNF

Query:  MERLLGATRLLSEMVEPIFKAATEISILLARTFFTGFCFVILALLARIRVLVQQVIACRLWVKILLDVVSVLNMVSSMSKKKRVVRISQEGIQ-------
        ME+LLGA RLLSEMVEPIFKAATEISILLARTFFTGFCF+ILALLARIRVLVQQ         ILLDVVS+ NMV+S+SKKK VV I+QEGIQ       
Subjt:  MERLLGATRLLSEMVEPIFKAATEISILLARTFFTGFCFVILALLARIRVLVQQVIACRLWVKILLDVVSVLNMVSSMSKKKRVVRISQEGIQ-------

Query:  ---------------------KKQEVGTKNQEEHLVPNVSVATSAVRYQSIESFLGDDESAIKQEEANQGN-QGPDPMKMT
                              KQEV T NQEEH+ PNVS A S V YQS+ESFLGD+E A KQ EANQ N +  D MKM+
Subjt:  ---------------------KKQEVGTKNQEEHLVPNVSVATSAVRYQSIESFLGDDESAIKQEEANQGN-QGPDPMKMT

A0A6J1J4A8 uncharacterized protein LOC1114832863.5e-9071.17Show/hide
Query:  MASFEAENLEEKLASMIDQLYLESGILQKMIYKNKNQHRRCSYFRYLLQVKRDLRLLQAAKLEELVSCCFQVIHGNKPKQKIHLLESLKRRKCEVGKYNF
        MAS EAEN +EKLASM+DQL LESGIL KMIYKNKNQHRR  YFRYLLQV+RDLRLLQAAKL+EL+S CFQVI G KPKQKIH LESLKRRKCEVGKYNF
Subjt:  MASFEAENLEEKLASMIDQLYLESGILQKMIYKNKNQHRRCSYFRYLLQVKRDLRLLQAAKLEELVSCCFQVIHGNKPKQKIHLLESLKRRKCEVGKYNF

Query:  MERLLGATRLLSEMVEPIFKAATEISILLARTFFTGFCFVILALLARIRVLVQQVIACRLWVKILLDVVSVLNMVSSMSKKKRVVRISQEGIQ-------
        ME+LLGA+RLLSEMVEPIFKAATEISILLARTFFTGFCF+ILALLARI VLVQQ         ILLDVVS+ NMV+S+SKKK VV I+QEGIQ       
Subjt:  MERLLGATRLLSEMVEPIFKAATEISILLARTFFTGFCFVILALLARIRVLVQQVIACRLWVKILLDVVSVLNMVSSMSKKKRVVRISQEGIQ-------

Query:  ---------------------KKQEVGTKNQEEHLVPNVSVATSAVRYQSIESFLGDDESAIKQEEANQGNQ-GPDPMKMT
                              KQEV T NQEEH+ PNVS A S VRYQS+ESFLGDDE A KQ EANQ N+ G D MKM+
Subjt:  ---------------------KKQEVGTKNQEEHLVPNVSVATSAVRYQSIESFLGDDESAIKQEEANQGNQ-GPDPMKMT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G50910.1 unknown protein1.2e-5049.38Show/hide
Query:  MASFEAENLEEKLASMIDQLYLESGILQKMIYKNKNQHRRCSYFRYLLQVKRDLRLLQAAKLEELVSCCFQVIHGNKPKQKIHLLESLKRRKCEVGKYNF
        M   + + LEEKL S + QL LE  + ++M+YKNKNQHRRCSYF+YLL+V+R+LRLL+ A +E ++  CF VI G   KQKIH+LESLK +K + GK N 
Subjt:  MASFEAENLEEKLASMIDQLYLESGILQKMIYKNKNQHRRCSYFRYLLQVKRDLRLLQAAKLEELVSCCFQVIHGNKPKQKIHLLESLKRRKCEVGKYNF

Query:  MERLLGATRLLSEMVEPIFKAATEISILLARTFFTGFCFVILALLARIRVLVQQVIACRLWVKILLDVVSVLNMVSSMSKKKRVVRISQEGIQKKQEVGT
        +ERLLGA RLLS+M EPI KAA+ IS LLAR+FF GF    LALLAR+RVLVQQ         ILLD VSV N V+S S KK+ V+I+Q+G++  +E   
Subjt:  MERLLGATRLLSEMVEPIFKAATEISILLARTFFTGFCFVILALLARIRVLVQQVIACRLWVKILLDVVSVLNMVSSMSKKKRVVRISQEGIQKKQEVGT

Query:  KNQEEHLVPNVSVATSAVRYQSIESFLGDDESAIKQEEANQ
        K  EE  V  +       +Y  +E+    + S   ++  ++
Subjt:  KNQEEHLVPNVSVATSAVRYQSIESFLGDDESAIKQEEANQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTTTGAAGCTGAGAACCTTGAGGAGAAGTTGGCTTCCATGATTGACCAGCTCTATTTAGAAAGTGGCATTCTACAAAAAATGATTTACAAGAACAAGAACCA
GCACCGTCGGTGTTCCTATTTTCGATACCTTTTGCAGGTAAAGAGGGATTTAAGACTTCTACAGGCTGCCAAGTTGGAGGAGTTGGTAAGTTGCTGTTTTCAAGTTATCC
ATGGAAATAAACCTAAGCAAAAGATTCATCTTTTAGAAAGTTTGAAACGGAGAAAATGTGAAGTTGGGAAATATAATTTCATGGAACGGCTTCTGGGAGCTACACGCCTA
CTGTCAGAGATGGTGGAGCCAATTTTTAAGGCAGCAACTGAGATATCTATCTTGCTTGCTCGAACATTTTTCACAGGGTTTTGCTTTGTAATTTTGGCATTACTGGCACG
TATTCGGGTGCTAGTTCAACAAGTAATTGCCTGCAGACTATGGGTCAAGATATTACTGGATGTTGTTTCAGTATTGAACATGGTTTCGTCTATGTCCAAAAAGAAGCGTG
TAGTTAGAATAAGCCAGGAAGGAATTCAGAAAAAACAAGAAGTTGGGACTAAGAATCAGGAAGAACATCTTGTGCCAAATGTTTCTGTGGCCACGTCAGCTGTGCGGTAT
CAGAGCATTGAATCTTTTCTTGGAGATGATGAATCTGCTATCAAGCAGGAAGAGGCAAATCAAGGCAACCAAGGACCTGATCCGATGAAAATGACTTGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTTTTGAAGCTGAGAACCTTGAGGAGAAGTTGGCTTCCATGATTGACCAGCTCTATTTAGAAAGTGGCATTCTACAAAAAATGATTTACAAGAACAAGAACCA
GCACCGTCGGTGTTCCTATTTTCGATACCTTTTGCAGGTAAAGAGGGATTTAAGACTTCTACAGGCTGCCAAGTTGGAGGAGTTGGTAAGTTGCTGTTTTCAAGTTATCC
ATGGAAATAAACCTAAGCAAAAGATTCATCTTTTAGAAAGTTTGAAACGGAGAAAATGTGAAGTTGGGAAATATAATTTCATGGAACGGCTTCTGGGAGCTACACGCCTA
CTGTCAGAGATGGTGGAGCCAATTTTTAAGGCAGCAACTGAGATATCTATCTTGCTTGCTCGAACATTTTTCACAGGGTTTTGCTTTGTAATTTTGGCATTACTGGCACG
TATTCGGGTGCTAGTTCAACAAGTAATTGCCTGCAGACTATGGGTCAAGATATTACTGGATGTTGTTTCAGTATTGAACATGGTTTCGTCTATGTCCAAAAAGAAGCGTG
TAGTTAGAATAAGCCAGGAAGGAATTCAGAAAAAACAAGAAGTTGGGACTAAGAATCAGGAAGAACATCTTGTGCCAAATGTTTCTGTGGCCACGTCAGCTGTGCGGTAT
CAGAGCATTGAATCTTTTCTTGGAGATGATGAATCTGCTATCAAGCAGGAAGAGGCAAATCAAGGCAACCAAGGACCTGATCCGATGAAAATGACTTGCTAG
Protein sequenceShow/hide protein sequence
MASFEAENLEEKLASMIDQLYLESGILQKMIYKNKNQHRRCSYFRYLLQVKRDLRLLQAAKLEELVSCCFQVIHGNKPKQKIHLLESLKRRKCEVGKYNFMERLLGATRL
LSEMVEPIFKAATEISILLARTFFTGFCFVILALLARIRVLVQQVIACRLWVKILLDVVSVLNMVSSMSKKKRVVRISQEGIQKKQEVGTKNQEEHLVPNVSVATSAVRY
QSIESFLGDDESAIKQEEANQGNQGPDPMKMTC