; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg007970 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg007970
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein of unknown function (DUF604)
Genome locationscaffold4:7027589..7028330
RNA-Seq ExpressionSpg007970
SyntenySpg007970
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0008375 - acetylglucosaminyltransferase activity (molecular function)
InterPro domainsIPR006740 - Protein of unknown function DUF604


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK07087.1 uncharacterized protein E5676_scaffold13G004050 [Cucumis melo var. makuwa]1.0e-10590.43Show/hide
Query:  MRGNLFGFLSAHALSPIVSIHHLDATDPIFPNMNNTQALHHLFEAVNVDPGRIFQQTVCYDRSHSLTISVSWGFAIQVFEGNHLLPDLLTLQRTFTSWRR
        MRGNLFG LSAHALSPI+S+HHLDA DPIFPNMNNTQALHHLFEAVNVDPGRIFQQ VCYDRSHSLTISVSWGFAIQVFEGN LLPDLL+LQRTF SWRR
Subjt:  MRGNLFGFLSAHALSPIVSIHHLDATDPIFPNMNNTQALHHLFEAVNVDPGRIFQQTVCYDRSHSLTISVSWGFAIQVFEGNHLLPDLLTLQRTFTSWRR

Query:  AAIIDANKYLFNMRNYPKDPCKRNIFYMQNLKSSKNNVLTNYTRKSVTDCPASGAIKNLRQVRVFSPKLELDVEEMKAPRRQCCDIISASKESMLLEIRQ
        AA IDAN+YLFN+R YPKDPCKRNIFYMQNL++SKNN LTNYTRK VTDCPASGAIKNL Q+RVFS KLELDVEEMKAPRRQCCDIIS+SKESMLLEIRQ
Subjt:  AAIIDANKYLFNMRNYPKDPCKRNIFYMQNLKSSKNNVLTNYTRKSVTDCPASGAIKNLRQVRVFSPKLELDVEEMKAPRRQCCDIISASKESMLLEIRQ

Query:  CGVEELISM
        CGVEELISM
Subjt:  CGVEELISM

XP_008455407.1 PREDICTED: uncharacterized protein LOC103495576 [Cucumis melo]7.1e-10790.05Show/hide
Query:  MRGNLFGFLSAHALSPIVSIHHLDATDPIFPNMNNTQALHHLFEAVNVDPGRIFQQTVCYDRSHSLTISVSWGFAIQVFEGNHLLPDLLTLQRTFTSWRR
        MRGNLFG LSAHALSPI+++HHLDA DPIFPNMNNTQALHHLFEAVNVDPGRIFQQ VCYDRSHSLTISVSWGFAIQVFEGN LLPDLL+LQRTF SWRR
Subjt:  MRGNLFGFLSAHALSPIVSIHHLDATDPIFPNMNNTQALHHLFEAVNVDPGRIFQQTVCYDRSHSLTISVSWGFAIQVFEGNHLLPDLLTLQRTFTSWRR

Query:  AAIIDANKYLFNMRNYPKDPCKRNIFYMQNLKSSKNNVLTNYTRKSVTDCPASGAIKNLRQVRVFSPKLELDVEEMKAPRRQCCDIISASKESMLLEIRQ
        AA IDAN+YLFN+R YPKDPCKRNIFYMQNL++SKNN LTNYTRK VTDCPASGAIKNL Q+RVFS KLELDVEEMKAPRRQCCDIIS+SKESMLLEIRQ
Subjt:  AAIIDANKYLFNMRNYPKDPCKRNIFYMQNLKSSKNNVLTNYTRKSVTDCPASGAIKNLRQVRVFSPKLELDVEEMKAPRRQCCDIISASKESMLLEIRQ

Query:  CGVEELISMYF
        CGVEELISMYF
Subjt:  CGVEELISMYF

XP_022969069.1 uncharacterized protein LOC111468179 [Cucurbita maxima]1.2e-10689.05Show/hide
Query:  MRGNLFGFLSAHALSPIVSIHHLDATDPIFPNMNNTQALHHLFEAVNVDPGRIFQQTVCYDRSHSLTISVSWGFAIQVFEGNHLLPDLLTLQRTFTSWRR
        MRGNLFG LSAHALSPIVS+HHLDATDPIFPNMNNTQA+HHLFEAVNVDPGRIFQQ VCYDRSHSLTISVSWGFAIQVFEGN LLPDLL+LQRTF  WRR
Subjt:  MRGNLFGFLSAHALSPIVSIHHLDATDPIFPNMNNTQALHHLFEAVNVDPGRIFQQTVCYDRSHSLTISVSWGFAIQVFEGNHLLPDLLTLQRTFTSWRR

Query:  AAIIDANKYLFNMRNYPKDPCKRNIFYMQNLKSSKNNVLTNYTRKSVTDCPASGAIKNLRQVRVFSPKLELDVEEMKAPRRQCCDIISASKESMLLEIRQ
        AA +DAN+Y+FNMR YPKDPCKRNIFYMQNL+SSKNN LTNYTRK VTDCPASGA+KNL+Q+RVFS KLELDVEEMKAPRRQCCD+IS+SKESMLLE+RQ
Subjt:  AAIIDANKYLFNMRNYPKDPCKRNIFYMQNLKSSKNNVLTNYTRKSVTDCPASGAIKNLRQVRVFSPKLELDVEEMKAPRRQCCDIISASKESMLLEIRQ

Query:  CGVEELISMY
        CGVEELISMY
Subjt:  CGVEELISMY

XP_031745247.1 uncharacterized protein LOC101222721 [Cucumis sativus]2.1e-10690.05Show/hide
Query:  MRGNLFGFLSAHALSPIVSIHHLDATDPIFPNMNNTQALHHLFEAVNVDPGRIFQQTVCYDRSHSLTISVSWGFAIQVFEGNHLLPDLLTLQRTFTSWRR
        MRGNL G LSAHALSPIVS+HHLDA DPIFPNMNNTQAL+HLFEAVNVDPGR+FQQ VCYDRSHSLTISVSWGFAIQVFEGN LLPDLL+LQRTFTSWRR
Subjt:  MRGNLFGFLSAHALSPIVSIHHLDATDPIFPNMNNTQALHHLFEAVNVDPGRIFQQTVCYDRSHSLTISVSWGFAIQVFEGNHLLPDLLTLQRTFTSWRR

Query:  AAIIDANKYLFNMRNYPKDPCKRNIFYMQNLKSSKNNVLTNYTRKSVTDCPASGAIKNLRQVRVFSPKLELDVEEMKAPRRQCCDIISASKESMLLEIRQ
        AA IDAN+YLFNMR YPKDPCKRNIFYMQNL+ SKNN LTNYTRK VTDCPASGAIKNL Q+RVFS KLELDVEEMKAPRRQCCDIIS+SKESMLLEIRQ
Subjt:  AAIIDANKYLFNMRNYPKDPCKRNIFYMQNLKSSKNNVLTNYTRKSVTDCPASGAIKNLRQVRVFSPKLELDVEEMKAPRRQCCDIISASKESMLLEIRQ

Query:  CGVEELISMYF
        CGVEELI+MYF
Subjt:  CGVEELISMYF

XP_038886838.1 uncharacterized protein LOC120077061 [Benincasa hispida]2.1e-10690.05Show/hide
Query:  MRGNLFGFLSAHALSPIVSIHHLDATDPIFPNMNNTQALHHLFEAVNVDPGRIFQQTVCYDRSHSLTISVSWGFAIQVFEGNHLLPDLLTLQRTFTSWRR
        MRGNLFG LSAHALSPIVS+HHLDATDPIFPNMNNTQALHHLFEAVNVDPGRIFQQ VCYDRSHSLTISVSWGFAIQVFEGN LLPDLL+LQRTF SWRR
Subjt:  MRGNLFGFLSAHALSPIVSIHHLDATDPIFPNMNNTQALHHLFEAVNVDPGRIFQQTVCYDRSHSLTISVSWGFAIQVFEGNHLLPDLLTLQRTFTSWRR

Query:  AAIIDANKYLFNMRNYPKDPCKRNIFYMQNLKSSKNNVLTNYTRKSVTDCPASGAIKNLRQVRVFSPKLELDVEEMKAPRRQCCDIISASKESMLLEIRQ
        AA ID N+Y+FNMR+YPKDPCKRNIFYMQNL+SSKNN LTNYTRK VTDCP S AIKNLRQ+RVFS KLEL+VEEMKAPRRQCCDI+SASKESMLLEIRQ
Subjt:  AAIIDANKYLFNMRNYPKDPCKRNIFYMQNLKSSKNNVLTNYTRKSVTDCPASGAIKNLRQVRVFSPKLELDVEEMKAPRRQCCDIISASKESMLLEIRQ

Query:  CGVEELISMYF
        C VEELISMYF
Subjt:  CGVEELISMYF

TrEMBL top hitse value%identityAlignment
A0A0A0K5K9 Uncharacterized protein1.0e-10690.05Show/hide
Query:  MRGNLFGFLSAHALSPIVSIHHLDATDPIFPNMNNTQALHHLFEAVNVDPGRIFQQTVCYDRSHSLTISVSWGFAIQVFEGNHLLPDLLTLQRTFTSWRR
        MRGNL G LSAHALSPIVS+HHLDA DPIFPNMNNTQAL+HLFEAVNVDPGR+FQQ VCYDRSHSLTISVSWGFAIQVFEGN LLPDLL+LQRTFTSWRR
Subjt:  MRGNLFGFLSAHALSPIVSIHHLDATDPIFPNMNNTQALHHLFEAVNVDPGRIFQQTVCYDRSHSLTISVSWGFAIQVFEGNHLLPDLLTLQRTFTSWRR

Query:  AAIIDANKYLFNMRNYPKDPCKRNIFYMQNLKSSKNNVLTNYTRKSVTDCPASGAIKNLRQVRVFSPKLELDVEEMKAPRRQCCDIISASKESMLLEIRQ
        AA IDAN+YLFNMR YPKDPCKRNIFYMQNL+ SKNN LTNYTRK VTDCPASGAIKNL Q+RVFS KLELDVEEMKAPRRQCCDIIS+SKESMLLEIRQ
Subjt:  AAIIDANKYLFNMRNYPKDPCKRNIFYMQNLKSSKNNVLTNYTRKSVTDCPASGAIKNLRQVRVFSPKLELDVEEMKAPRRQCCDIISASKESMLLEIRQ

Query:  CGVEELISMYF
        CGVEELI+MYF
Subjt:  CGVEELISMYF

A0A1S3C0Z6 uncharacterized protein LOC1034955763.5e-10790.05Show/hide
Query:  MRGNLFGFLSAHALSPIVSIHHLDATDPIFPNMNNTQALHHLFEAVNVDPGRIFQQTVCYDRSHSLTISVSWGFAIQVFEGNHLLPDLLTLQRTFTSWRR
        MRGNLFG LSAHALSPI+++HHLDA DPIFPNMNNTQALHHLFEAVNVDPGRIFQQ VCYDRSHSLTISVSWGFAIQVFEGN LLPDLL+LQRTF SWRR
Subjt:  MRGNLFGFLSAHALSPIVSIHHLDATDPIFPNMNNTQALHHLFEAVNVDPGRIFQQTVCYDRSHSLTISVSWGFAIQVFEGNHLLPDLLTLQRTFTSWRR

Query:  AAIIDANKYLFNMRNYPKDPCKRNIFYMQNLKSSKNNVLTNYTRKSVTDCPASGAIKNLRQVRVFSPKLELDVEEMKAPRRQCCDIISASKESMLLEIRQ
        AA IDAN+YLFN+R YPKDPCKRNIFYMQNL++SKNN LTNYTRK VTDCPASGAIKNL Q+RVFS KLELDVEEMKAPRRQCCDIIS+SKESMLLEIRQ
Subjt:  AAIIDANKYLFNMRNYPKDPCKRNIFYMQNLKSSKNNVLTNYTRKSVTDCPASGAIKNLRQVRVFSPKLELDVEEMKAPRRQCCDIISASKESMLLEIRQ

Query:  CGVEELISMYF
        CGVEELISMYF
Subjt:  CGVEELISMYF

A0A5A7TF18 Uncharacterized protein8.5e-10689.52Show/hide
Query:  MRGNLFGFLSAHALSPIVSIHHLDATDPIFPNMNNTQALHHLFEAVNVDPGRIFQQTVCYDRSHSLTISVSWGFAIQVFEGNHLLPDLLTLQRTFTSWRR
        MRGNLFG LSAHALSPI+S+HHLDA DPIFPNMNNTQALHHLFEAVNVDPGRIFQQ VCYDRSHSLTISVSWGFAIQVFEGN LLPDLL+LQRTF SWRR
Subjt:  MRGNLFGFLSAHALSPIVSIHHLDATDPIFPNMNNTQALHHLFEAVNVDPGRIFQQTVCYDRSHSLTISVSWGFAIQVFEGNHLLPDLLTLQRTFTSWRR

Query:  AAIIDANKYLFNMRNYPKDPCKRNIFYMQNLKSSKNNVLTNYTRKSVTDCPASGAIKNLRQVRVFSPKLELDVEEMKAPRRQCCDIISASKESMLLEIRQ
        AA IDAN+YLFN+R YPKDPCKRNIFYMQNL++SKNN LTNYTRK VTDCPASGAIKNL Q+RVFS KLELDVEEMKAPRRQCCDIIS+SKESMLLEIRQ
Subjt:  AAIIDANKYLFNMRNYPKDPCKRNIFYMQNLKSSKNNVLTNYTRKSVTDCPASGAIKNLRQVRVFSPKLELDVEEMKAPRRQCCDIISASKESMLLEIRQ

Query:  CGVEELISMY
        CGVEELIS++
Subjt:  CGVEELISMY

A0A5D3C5I9 Uncharacterized protein5.0e-10690.43Show/hide
Query:  MRGNLFGFLSAHALSPIVSIHHLDATDPIFPNMNNTQALHHLFEAVNVDPGRIFQQTVCYDRSHSLTISVSWGFAIQVFEGNHLLPDLLTLQRTFTSWRR
        MRGNLFG LSAHALSPI+S+HHLDA DPIFPNMNNTQALHHLFEAVNVDPGRIFQQ VCYDRSHSLTISVSWGFAIQVFEGN LLPDLL+LQRTF SWRR
Subjt:  MRGNLFGFLSAHALSPIVSIHHLDATDPIFPNMNNTQALHHLFEAVNVDPGRIFQQTVCYDRSHSLTISVSWGFAIQVFEGNHLLPDLLTLQRTFTSWRR

Query:  AAIIDANKYLFNMRNYPKDPCKRNIFYMQNLKSSKNNVLTNYTRKSVTDCPASGAIKNLRQVRVFSPKLELDVEEMKAPRRQCCDIISASKESMLLEIRQ
        AA IDAN+YLFN+R YPKDPCKRNIFYMQNL++SKNN LTNYTRK VTDCPASGAIKNL Q+RVFS KLELDVEEMKAPRRQCCDIIS+SKESMLLEIRQ
Subjt:  AAIIDANKYLFNMRNYPKDPCKRNIFYMQNLKSSKNNVLTNYTRKSVTDCPASGAIKNLRQVRVFSPKLELDVEEMKAPRRQCCDIISASKESMLLEIRQ

Query:  CGVEELISM
        CGVEELISM
Subjt:  CGVEELISM

A0A6J1I1H7 uncharacterized protein LOC1114681795.9e-10789.05Show/hide
Query:  MRGNLFGFLSAHALSPIVSIHHLDATDPIFPNMNNTQALHHLFEAVNVDPGRIFQQTVCYDRSHSLTISVSWGFAIQVFEGNHLLPDLLTLQRTFTSWRR
        MRGNLFG LSAHALSPIVS+HHLDATDPIFPNMNNTQA+HHLFEAVNVDPGRIFQQ VCYDRSHSLTISVSWGFAIQVFEGN LLPDLL+LQRTF  WRR
Subjt:  MRGNLFGFLSAHALSPIVSIHHLDATDPIFPNMNNTQALHHLFEAVNVDPGRIFQQTVCYDRSHSLTISVSWGFAIQVFEGNHLLPDLLTLQRTFTSWRR

Query:  AAIIDANKYLFNMRNYPKDPCKRNIFYMQNLKSSKNNVLTNYTRKSVTDCPASGAIKNLRQVRVFSPKLELDVEEMKAPRRQCCDIISASKESMLLEIRQ
        AA +DAN+Y+FNMR YPKDPCKRNIFYMQNL+SSKNN LTNYTRK VTDCPASGA+KNL+Q+RVFS KLELDVEEMKAPRRQCCD+IS+SKESMLLE+RQ
Subjt:  AAIIDANKYLFNMRNYPKDPCKRNIFYMQNLKSSKNNVLTNYTRKSVTDCPASGAIKNLRQVRVFSPKLELDVEEMKAPRRQCCDIISASKESMLLEIRQ

Query:  CGVEELISMY
        CGVEELISMY
Subjt:  CGVEELISMY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05280.1 Protein of unknown function (DUF604)4.5e-3541.44Show/hide
Query:  MRGNLFGFLSAHALSPIVSIHHLDATDPIFPNMNNTQALHHLFEAVNVDPGRIFQQTVCYDRSHSLTISVSWGFAIQVFEGNHL-LPDLLTLQRTFTSWR
        +RGN  G L++H+  P+VS+HH+   DPIFPN     A+ HLF AV +DP RIFQ +VCYDR +S TISVSWG+ +Q+ +G HL L D+L  Q TF  W+
Subjt:  MRGNLFGFLSAHALSPIVSIHHLDATDPIFPNMNNTQALHHLFEAVNVDPGRIFQQTVCYDRSHSLTISVSWGFAIQVFEGNHL-LPDLLTLQRTFTSWR

Query:  RAAIIDANKYLFNMRNYPKDPCKRNI-FYMQNLKSSKNN-VLTNYTRKSVTDC---PASGAIKNLRQVRVFSPKLELDVEE
        ++  + A+ Y FN R   +DPC+R + FYMQ++ SS ++  + +  +++  +C   P +   + + ++RVFS +L+ ++ +
Subjt:  RAAIIDANKYLFNMRNYPKDPCKRNI-FYMQNLKSSKNN-VLTNYTRKSVTDC---PASGAIKNLRQVRVFSPKLELDVEE

AT3G11420.1 Protein of unknown function (DUF604)2.3e-3134.12Show/hide
Query:  MRGNLFGFLSAHALSPIVSIHHLDATDPIFPNMNNTQALHHLFEAVNVDPGRIFQQTVCYDRSHSLTISVSWGFAIQVFEGNHLLPDLLTLQRTFTSWRR
        +RG+ +GFL+AH L+P+VS+HHL   DP+FPN N  ++L  L +   +DP RI QQ  C+DR    +IS+SWG+ IQ++       +L T  +TF +WR 
Subjt:  MRGNLFGFLSAHALSPIVSIHHLDATDPIFPNMNNTQALHHLFEAVNVDPGRIFQQTVCYDRSHSLTISVSWGFAIQVFEGNHLLPDLLTLQRTFTSWRR

Query:  AAIIDANKYLFNMRNYPKDPCKRNIFYM----QNLKSSKNNVLTNYTRKSVTDCPASGAIKNLRQVRVFSPKLELDVEEM-KAPRRQCCDIISA-----S
        ++      ++FN R    DPC+R + Y     ++++ S      +   K+   C      +  +  R+    ++ D E   KAPRRQCC+++        
Subjt:  AAIIDANKYLFNMRNYPKDPCKRNIFYM----QNLKSSKNNVLTNYTRKSVTDCPASGAIKNLRQVRVFSPKLELDVEEM-KAPRRQCCDIISA-----S

Query:  KESMLLEIRQC
        ++ MLL IR+C
Subjt:  KESMLLEIRQC

AT4G11350.1 Protein of unknown function (DUF604)8.0e-3235.18Show/hide
Query:  MRGNLFGFLSAHALSPIVSIHHLDATDPIFPNMNNTQALHHLFEAVNVDPGRIFQQTVCYDRSHSLTISVSWGFAIQVFEGNHLLPDLLTLQRTFTSWRR
        + GNLFG L+AH ++P VS+HHLD  +PIFPNM   +A+  L   + +D   + QQ++CYD+  S TISVSWGFA+QVF G+    ++    RTF +W +
Subjt:  MRGNLFGFLSAHALSPIVSIHHLDATDPIFPNMNNTQALHHLFEAVNVDPGRIFQQTVCYDRSHSLTISVSWGFAIQVFEGNHLLPDLLTLQRTFTSWRR

Query:  AAIIDANKYLFNMRNYPKDPCKRN-IFYMQNLK--SSKNNVLTNYTRKSVTDCPASGAIKNLRQVRVFSPKLELDVEEM-KAPRRQCCDIISASKESML
         A  D   Y FN R   ++ C++  +F+M + K     N  ++ YTR  V        + N  ++       + D     ++PRR CC ++   + + L
Subjt:  AAIIDANKYLFNMRNYPKDPCKRN-IFYMQNLK--SSKNNVLTNYTRKSVTDCPASGAIKNLRQVRVFSPKLELDVEEM-KAPRRQCCDIISASKESML

AT4G15240.1 Protein of unknown function (DUF604)1.4e-6051.66Show/hide
Query:  MRGNLFGFLSAHALSPIVSIHHLDATDPIFPNMNNTQALHHLFEAVNVDPGRIFQQTVCYDRSHSLTISVSWGFAIQVFEGNHLLPDLLTLQRTFTSWRR
        +RGN+FG L AH LSP+VS+HHLDA DP FP  N T+++ HL  A + D GRI QQ+VCYD  +++T+SV WG+A+QV+EGN LLPDLLTLQ+TF++WRR
Subjt:  MRGNLFGFLSAHALSPIVSIHHLDATDPIFPNMNNTQALHHLFEAVNVDPGRIFQQTVCYDRSHSLTISVSWGFAIQVFEGNHLLPDLLTLQRTFTSWRR

Query:  AAIIDANKYLFNMRNYPKDPCKRN-IFYMQNLKS-SKNNVLTNYTRKSVTDCPASGAIKNLRQVRVFSPKLELDVEEMKAPRRQCCDIISASKESMLLEI
         + + +N Y+F+ R YP+DPC R  +F++ ++ S       +NY    V  C  + A++ L ++RV SPKLE +VE+M  PRRQCCDI S   +SM++ I
Subjt:  AAIIDANKYLFNMRNYPKDPCKRN-IFYMQNLKS-SKNNVLTNYTRKSVTDCPASGAIKNLRQVRVFSPKLELDVEEMKAPRRQCCDIISASKESMLLEI

Query:  RQCGVEELISM
        RQC  +ELI+M
Subjt:  RQCGVEELISM

AT4G23490.1 Protein of unknown function (DUF604)4.7e-3236.36Show/hide
Query:  GNLFGFLSAHALSPIVSIHHLDATDPIFPNMNNTQALHHLFEAVNVDPGRIFQQTVCYDRSHSLTISVSWGFAIQVFEGNHLLPDLLTLQRTFTSWRRAA
        GNLFG L+AH ++P VS+HHLD  +PIFPNM   +AL  + E + +D   + QQ++CYD+  S TISVSWG+A+Q+F G     ++    RTF +W + A
Subjt:  GNLFGFLSAHALSPIVSIHHLDATDPIFPNMNNTQALHHLFEAVNVDPGRIFQQTVCYDRSHSLTISVSWGFAIQVFEGNHLLPDLLTLQRTFTSWRRAA

Query:  IIDANKYLFNMRNYPKDPCKRN-IFYMQNLKSSK--NNVLTNYTRKSVTDCPASGAIKNLRQVR--VFSPKLELDVEEMKAPRRQCCDIISASKESML
          D   Y FN R   ++PC++  +FYM + K  +  N  ++ YT   V+       + N  ++   V   K +  + E ++PRR CC ++   + + L
Subjt:  IIDANKYLFNMRNYPKDPCKRN-IFYMQNLKSSK--NNVLTNYTRKSVTDCPASGAIKNLRQVR--VFSPKLELDVEEMKAPRRQCCDIISASKESML


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGGGGAAATCTGTTTGGATTTCTATCTGCACATGCATTGTCACCGATTGTTTCAATTCATCATTTGGATGCCACGGACCCGATATTCCCAAACATGAATAATACTCA
GGCCCTGCACCATCTCTTTGAAGCAGTGAATGTGGATCCCGGCAGGATTTTCCAACAAACTGTGTGCTATGACCGTTCTCATTCATTGACTATTTCGGTGTCGTGGGGTT
TTGCTATTCAGGTCTTTGAAGGCAATCATCTTCTCCCAGATCTCCTTACGCTCCAAAGAACTTTTACGTCATGGAGAAGGGCTGCCATCATTGATGCGAATAAATACCTG
TTCAACATGAGAAACTACCCAAAAGATCCTTGTAAAAGAAATATCTTCTATATGCAGAATTTGAAATCTAGTAAGAATAATGTCTTGACTAACTATACTAGGAAGTCGGT
CACTGATTGTCCAGCTTCTGGTGCAATTAAGAATTTGAGACAGGTTCGAGTGTTCTCCCCGAAGTTGGAACTCGATGTGGAAGAGATGAAGGCTCCACGTCGTCAGTGCT
GCGATATCATTTCCGCTTCCAAGGAATCGATGCTCCTTGAAATTCGACAATGTGGAGTTGAAGAATTAATATCTATGTACTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCGGGGAAATCTGTTTGGATTTCTATCTGCACATGCATTGTCACCGATTGTTTCAATTCATCATTTGGATGCCACGGACCCGATATTCCCAAACATGAATAATACTCA
GGCCCTGCACCATCTCTTTGAAGCAGTGAATGTGGATCCCGGCAGGATTTTCCAACAAACTGTGTGCTATGACCGTTCTCATTCATTGACTATTTCGGTGTCGTGGGGTT
TTGCTATTCAGGTCTTTGAAGGCAATCATCTTCTCCCAGATCTCCTTACGCTCCAAAGAACTTTTACGTCATGGAGAAGGGCTGCCATCATTGATGCGAATAAATACCTG
TTCAACATGAGAAACTACCCAAAAGATCCTTGTAAAAGAAATATCTTCTATATGCAGAATTTGAAATCTAGTAAGAATAATGTCTTGACTAACTATACTAGGAAGTCGGT
CACTGATTGTCCAGCTTCTGGTGCAATTAAGAATTTGAGACAGGTTCGAGTGTTCTCCCCGAAGTTGGAACTCGATGTGGAAGAGATGAAGGCTCCACGTCGTCAGTGCT
GCGATATCATTTCCGCTTCCAAGGAATCGATGCTCCTTGAAATTCGACAATGTGGAGTTGAAGAATTAATATCTATGTACTTCTAG
Protein sequenceShow/hide protein sequence
MRGNLFGFLSAHALSPIVSIHHLDATDPIFPNMNNTQALHHLFEAVNVDPGRIFQQTVCYDRSHSLTISVSWGFAIQVFEGNHLLPDLLTLQRTFTSWRRAAIIDANKYL
FNMRNYPKDPCKRNIFYMQNLKSSKNNVLTNYTRKSVTDCPASGAIKNLRQVRVFSPKLELDVEEMKAPRRQCCDIISASKESMLLEIRQCGVEELISMYF