; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg034039 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg034039
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDUF4050 domain-containing protein
Genome locationscaffold13:36873854..36882443
RNA-Seq ExpressionSpg034039
SyntenySpg034039
Gene Ontology termsNA
InterPro domainsIPR025124 - Domain of unknown function DUF4050


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064925.1 uncharacterized protein E6C27_scaffold82G002430 [Cucumis melo var. makuwa]2.6e-9177.09Show/hide
Query:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNLKLLFHRRSCHGCCTAPALGDAMDGPSKGLRVEDQEAKKQCLPENFPSSSTCEMDNSTV
        MYSRCCLL+RLEGCSSK PCCSFLQFSGEY+RALILLMVD +KLLFH+R   GCC+A ALG+AMDGPSKGLRV+D+EAKKQCLPENFPSSSTCEMDNSTV
Subjt:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNLKLLFHRRSCHGCCTAPALGDAMDGPSKGLRVEDQEAKKQCLPENFPSSSTCEMDNSTV

Query:  WSQRSMASAQSESHDSHSNIGSSTDFVNSGLLLWNETRKQWIGNKMSEGQKQVREPKISWNATYDSLLTTNKPFPETIPLAVSCQQVSLVIYCIFLSNQT
        WSQRSMASAQ  SHDS SNIGSSTDFVNSGLLLWNETRKQW+GNKMS+ QKQV+EPKISWNATYDSLLTTNKPFPE IPL                    
Subjt:  WSQRSMASAQSESHDSHSNIGSSTDFVNSGLLLWNETRKQWIGNKMSEGQKQVREPKISWNATYDSLLTTNKPFPETIPLAVSCQQVSLVIYCIFLSNQT

Query:  MIVHLLMKCQEMIEFLVDVWEQEGLYD
                  EMIEFLVDVWEQEGLYD
Subjt:  MIVHLLMKCQEMIEFLVDVWEQEGLYD

KAG7020386.1 hypothetical protein SDJN02_17070 [Cucurbita argyrosperma subsp. argyrosperma]4.1e-9278.85Show/hide
Query:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNLKLLFHRRSCHGCCTAPALGDAMDGPSKGLRVEDQEAKKQCLPENFPSSSTCEMDNSTV
        MYSRCCLLSRLEGCSS KPCCSFLQFSG+YLRALI+L+VDNLKLLFHRRSC G CT PALGDAMDGPS GLRVEDQEAKKQCLPENF SSSTCEMDNSTV
Subjt:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNLKLLFHRRSCHGCCTAPALGDAMDGPSKGLRVEDQEAKKQCLPENFPSSSTCEMDNSTV

Query:  WSQRSMASAQSESHDSHSNIGSSTDFVNSGLLLWNETRKQWIGNKMSEGQKQVREPKISWNATYDSLLTTNKPFPETIPLAVSCQQVSLVIYCIFLSNQT
        WSQRSMASAQ  SHDSH+N+GSST+FVNSGLLLWNETRKQW+GNK SE QK+VREPKISWNATYDSLLTTNKPFPE IPLA                   
Subjt:  WSQRSMASAQSESHDSHSNIGSSTDFVNSGLLLWNETRKQWIGNKMSEGQKQVREPKISWNATYDSLLTTNKPFPETIPLAVSCQQVSLVIYCIFLSNQT

Query:  MIVHLLMKCQEMIEFLVDVWEQEGLYD
                  EMIEFLVDVWEQEGLYD
Subjt:  MIVHLLMKCQEMIEFLVDVWEQEGLYD

XP_008445211.1 PREDICTED: uncharacterized protein LOC103488310 isoform X1 [Cucumis melo]2.6e-9177.09Show/hide
Query:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNLKLLFHRRSCHGCCTAPALGDAMDGPSKGLRVEDQEAKKQCLPENFPSSSTCEMDNSTV
        MYSRCCLL+RLEGCSSK PCCSFLQFSGEY+RALILLMVD +KLLFH+R   GCC+A ALG+AMDGPSKGLRV+D+EAKKQCLPENFPSSSTCEMDNSTV
Subjt:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNLKLLFHRRSCHGCCTAPALGDAMDGPSKGLRVEDQEAKKQCLPENFPSSSTCEMDNSTV

Query:  WSQRSMASAQSESHDSHSNIGSSTDFVNSGLLLWNETRKQWIGNKMSEGQKQVREPKISWNATYDSLLTTNKPFPETIPLAVSCQQVSLVIYCIFLSNQT
        WSQRSMASAQ  SHDS SNIGSSTDFVNSGLLLWNETRKQW+GNKMS+ QKQV+EPKISWNATYDSLLTTNKPFPE IPL                    
Subjt:  WSQRSMASAQSESHDSHSNIGSSTDFVNSGLLLWNETRKQWIGNKMSEGQKQVREPKISWNATYDSLLTTNKPFPETIPLAVSCQQVSLVIYCIFLSNQT

Query:  MIVHLLMKCQEMIEFLVDVWEQEGLYD
                  EMIEFLVDVWEQEGLYD
Subjt:  MIVHLLMKCQEMIEFLVDVWEQEGLYD

XP_022951409.1 uncharacterized protein LOC111454240 isoform X1 [Cucurbita moschata]4.1e-9278.85Show/hide
Query:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNLKLLFHRRSCHGCCTAPALGDAMDGPSKGLRVEDQEAKKQCLPENFPSSSTCEMDNSTV
        MYSRCCLLSRLEGCSS KPCCSFLQFSG+YLRALI+L+VDNLKLLFHRRSC G CT PALGDAMDGPS GLRVEDQEAKKQCLPENF SSSTCEMDNSTV
Subjt:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNLKLLFHRRSCHGCCTAPALGDAMDGPSKGLRVEDQEAKKQCLPENFPSSSTCEMDNSTV

Query:  WSQRSMASAQSESHDSHSNIGSSTDFVNSGLLLWNETRKQWIGNKMSEGQKQVREPKISWNATYDSLLTTNKPFPETIPLAVSCQQVSLVIYCIFLSNQT
        WSQRSMASAQ  SHDSH+N+GSST+FVNSGLLLWNETRKQW+GNK SE QK+VREPKISWNATYDSLLTTNKPFPE IPLA                   
Subjt:  WSQRSMASAQSESHDSHSNIGSSTDFVNSGLLLWNETRKQWIGNKMSEGQKQVREPKISWNATYDSLLTTNKPFPETIPLAVSCQQVSLVIYCIFLSNQT

Query:  MIVHLLMKCQEMIEFLVDVWEQEGLYD
                  EMIEFLVDVWEQEGLYD
Subjt:  MIVHLLMKCQEMIEFLVDVWEQEGLYD

XP_038885342.1 uncharacterized protein LOC120075759 isoform X1 [Benincasa hispida]7.4e-9478.41Show/hide
Query:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNLKLLFHRRSCHGCCTAPALGDAMDGPSKGLRVEDQEAKKQCLPENFPSSSTCEMDNSTV
        MYSRCCLL RLEGCSSKKPCCSFLQFSGEYLRALILLMVDN+KLLFHRRSCHGCCTA AL +AMDGPSKGLRV+DQEAKKQCLPEN PSSSTCEMDNSTV
Subjt:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNLKLLFHRRSCHGCCTAPALGDAMDGPSKGLRVEDQEAKKQCLPENFPSSSTCEMDNSTV

Query:  WSQRSMASAQSESHDSHSNIGSSTDFVNSGLLLWNETRKQWIGNKMSEGQKQVREPKISWNATYDSLLTTNKPFPETIPLAVSCQQVSLVIYCIFLSNQT
        WSQRSMASA   SHDSHSNIGSSTDFVNSGLLLWNETRKQW+GNKMSE QKQV+EPKISW+ATYDSLL TNKPFPE +PL                    
Subjt:  WSQRSMASAQSESHDSHSNIGSSTDFVNSGLLLWNETRKQWIGNKMSEGQKQVREPKISWNATYDSLLTTNKPFPETIPLAVSCQQVSLVIYCIFLSNQT

Query:  MIVHLLMKCQEMIEFLVDVWEQEGLYD
                  EMI+FLVDVWEQ+GLYD
Subjt:  MIVHLLMKCQEMIEFLVDVWEQEGLYD

TrEMBL top hitse value%identityAlignment
A0A0A0LPL3 Uncharacterized protein8.3e-9176.21Show/hide
Query:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNLKLLFHRRSCHGCCTAPALGDAMDGPSKGLRVEDQEAKKQCLPENFPSSSTCEMDNSTV
        MYSRCCLL+RLEGCSSK PCCSFLQFSGEY+RALILLMVD +KLLFH+R   GCCTA ALG+AMDGPSKGLRV+++EAKKQCLPENFPSSSTCEMDNSTV
Subjt:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNLKLLFHRRSCHGCCTAPALGDAMDGPSKGLRVEDQEAKKQCLPENFPSSSTCEMDNSTV

Query:  WSQRSMASAQSESHDSHSNIGSSTDFVNSGLLLWNETRKQWIGNKMSEGQKQVREPKISWNATYDSLLTTNKPFPETIPLAVSCQQVSLVIYCIFLSNQT
        WSQRSMAS Q  +HDSHSNIGSSTDFVNSGLLLWNETRKQW+GNKMS  QKQV+EPKISWNATYD+LLTTNKPFPE IPL                    
Subjt:  WSQRSMASAQSESHDSHSNIGSSTDFVNSGLLLWNETRKQWIGNKMSEGQKQVREPKISWNATYDSLLTTNKPFPETIPLAVSCQQVSLVIYCIFLSNQT

Query:  MIVHLLMKCQEMIEFLVDVWEQEGLYD
                  EMIEFLVDVWEQEGLYD
Subjt:  MIVHLLMKCQEMIEFLVDVWEQEGLYD

A0A1S3BC47 uncharacterized protein LOC103488310 isoform X11.3e-9177.09Show/hide
Query:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNLKLLFHRRSCHGCCTAPALGDAMDGPSKGLRVEDQEAKKQCLPENFPSSSTCEMDNSTV
        MYSRCCLL+RLEGCSSK PCCSFLQFSGEY+RALILLMVD +KLLFH+R   GCC+A ALG+AMDGPSKGLRV+D+EAKKQCLPENFPSSSTCEMDNSTV
Subjt:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNLKLLFHRRSCHGCCTAPALGDAMDGPSKGLRVEDQEAKKQCLPENFPSSSTCEMDNSTV

Query:  WSQRSMASAQSESHDSHSNIGSSTDFVNSGLLLWNETRKQWIGNKMSEGQKQVREPKISWNATYDSLLTTNKPFPETIPLAVSCQQVSLVIYCIFLSNQT
        WSQRSMASAQ  SHDS SNIGSSTDFVNSGLLLWNETRKQW+GNKMS+ QKQV+EPKISWNATYDSLLTTNKPFPE IPL                    
Subjt:  WSQRSMASAQSESHDSHSNIGSSTDFVNSGLLLWNETRKQWIGNKMSEGQKQVREPKISWNATYDSLLTTNKPFPETIPLAVSCQQVSLVIYCIFLSNQT

Query:  MIVHLLMKCQEMIEFLVDVWEQEGLYD
                  EMIEFLVDVWEQEGLYD
Subjt:  MIVHLLMKCQEMIEFLVDVWEQEGLYD

A0A5A7VGA9 Uncharacterized protein1.3e-9177.09Show/hide
Query:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNLKLLFHRRSCHGCCTAPALGDAMDGPSKGLRVEDQEAKKQCLPENFPSSSTCEMDNSTV
        MYSRCCLL+RLEGCSSK PCCSFLQFSGEY+RALILLMVD +KLLFH+R   GCC+A ALG+AMDGPSKGLRV+D+EAKKQCLPENFPSSSTCEMDNSTV
Subjt:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNLKLLFHRRSCHGCCTAPALGDAMDGPSKGLRVEDQEAKKQCLPENFPSSSTCEMDNSTV

Query:  WSQRSMASAQSESHDSHSNIGSSTDFVNSGLLLWNETRKQWIGNKMSEGQKQVREPKISWNATYDSLLTTNKPFPETIPLAVSCQQVSLVIYCIFLSNQT
        WSQRSMASAQ  SHDS SNIGSSTDFVNSGLLLWNETRKQW+GNKMS+ QKQV+EPKISWNATYDSLLTTNKPFPE IPL                    
Subjt:  WSQRSMASAQSESHDSHSNIGSSTDFVNSGLLLWNETRKQWIGNKMSEGQKQVREPKISWNATYDSLLTTNKPFPETIPLAVSCQQVSLVIYCIFLSNQT

Query:  MIVHLLMKCQEMIEFLVDVWEQEGLYD
                  EMIEFLVDVWEQEGLYD
Subjt:  MIVHLLMKCQEMIEFLVDVWEQEGLYD

A0A6J1GIP5 uncharacterized protein LOC111454240 isoform X12.0e-9278.85Show/hide
Query:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNLKLLFHRRSCHGCCTAPALGDAMDGPSKGLRVEDQEAKKQCLPENFPSSSTCEMDNSTV
        MYSRCCLLSRLEGCSS KPCCSFLQFSG+YLRALI+L+VDNLKLLFHRRSC G CT PALGDAMDGPS GLRVEDQEAKKQCLPENF SSSTCEMDNSTV
Subjt:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNLKLLFHRRSCHGCCTAPALGDAMDGPSKGLRVEDQEAKKQCLPENFPSSSTCEMDNSTV

Query:  WSQRSMASAQSESHDSHSNIGSSTDFVNSGLLLWNETRKQWIGNKMSEGQKQVREPKISWNATYDSLLTTNKPFPETIPLAVSCQQVSLVIYCIFLSNQT
        WSQRSMASAQ  SHDSH+N+GSST+FVNSGLLLWNETRKQW+GNK SE QK+VREPKISWNATYDSLLTTNKPFPE IPLA                   
Subjt:  WSQRSMASAQSESHDSHSNIGSSTDFVNSGLLLWNETRKQWIGNKMSEGQKQVREPKISWNATYDSLLTTNKPFPETIPLAVSCQQVSLVIYCIFLSNQT

Query:  MIVHLLMKCQEMIEFLVDVWEQEGLYD
                  EMIEFLVDVWEQEGLYD
Subjt:  MIVHLLMKCQEMIEFLVDVWEQEGLYD

A0A6J1KQM2 uncharacterized protein LOC111496323 isoform X11.3e-9177.97Show/hide
Query:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNLKLLFHRRSCHGCCTAPALGDAMDGPSKGLRVEDQEAKKQCLPENFPSSSTCEMDNSTV
        MYSRCCLLSRLEGCSS KPCCSFLQFSG+YLRALI+L+VDNLKLLFHRRSC G CT PALGDAMDGPS GLRV+DQEAKKQCLP+NF SSSTCEMDNSTV
Subjt:  MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNLKLLFHRRSCHGCCTAPALGDAMDGPSKGLRVEDQEAKKQCLPENFPSSSTCEMDNSTV

Query:  WSQRSMASAQSESHDSHSNIGSSTDFVNSGLLLWNETRKQWIGNKMSEGQKQVREPKISWNATYDSLLTTNKPFPETIPLAVSCQQVSLVIYCIFLSNQT
        WSQRSMASAQ  SHDSH+N+GSST+FVNSGLLLWNETRKQW+GNK SE QK+VREPKISWNATYDSLLTTNKPFPE IPLA                   
Subjt:  WSQRSMASAQSESHDSHSNIGSSTDFVNSGLLLWNETRKQWIGNKMSEGQKQVREPKISWNATYDSLLTTNKPFPETIPLAVSCQQVSLVIYCIFLSNQT

Query:  MIVHLLMKCQEMIEFLVDVWEQEGLYD
                  EMIEFLVDVWEQEGLYD
Subjt:  MIVHLLMKCQEMIEFLVDVWEQEGLYD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15350.1 unknown protein4.0e-2137.63Show/hide
Query:  CHGC-----CTAPALGDAMDGPSKGLRVEDQEAKKQCLPENFPSSSTCEMDNSTVWSQRSMASAQS--ESHDSHSNIGSSTDFVNSGLLLWNETRKQWIG
        C GC      TA +L    D PS  +    +  KK  + E+F S+ST +MDN T  SQ S++S+    +S  +  N  +  ++VN GLLLWN+TR++W+G
Subjt:  CHGC-----CTAPALGDAMDGPSKGLRVEDQEAKKQCLPENFPSSSTCEMDNSTVWSQRSMASAQS--ESHDSHSNIGSSTDFVNSGLLLWNETRKQWIG

Query:  -NKMSEGQKQVREPKISWN-ATYDSLLTTNKPFPETIPLAVSCQQVSLVIYCIFLSNQTMIVHLLMKCQEMIEFLVDVWEQEGLYD
         +K +      +  K++WN ATYDSLL +NK FP+ IPL                              EM++FLVD+WEQEGLYD
Subjt:  -NKMSEGQKQVREPKISWN-ATYDSLLTTNKPFPETIPLAVSCQQVSLVIYCIFLSNQTMIVHLLMKCQEMIEFLVDVWEQEGLYD

AT1G15350.2 unknown protein4.0e-2137.63Show/hide
Query:  CHGC-----CTAPALGDAMDGPSKGLRVEDQEAKKQCLPENFPSSSTCEMDNSTVWSQRSMASAQS--ESHDSHSNIGSSTDFVNSGLLLWNETRKQWIG
        C GC      TA +L    D PS  +    +  KK  + E+F S+ST +MDN T  SQ S++S+    +S  +  N  +  ++VN GLLLWN+TR++W+G
Subjt:  CHGC-----CTAPALGDAMDGPSKGLRVEDQEAKKQCLPENFPSSSTCEMDNSTVWSQRSMASAQS--ESHDSHSNIGSSTDFVNSGLLLWNETRKQWIG

Query:  -NKMSEGQKQVREPKISWN-ATYDSLLTTNKPFPETIPLAVSCQQVSLVIYCIFLSNQTMIVHLLMKCQEMIEFLVDVWEQEGLYD
         +K +      +  K++WN ATYDSLL +NK FP+ IPL                              EM++FLVD+WEQEGLYD
Subjt:  -NKMSEGQKQVREPKISWN-ATYDSLLTTNKPFPETIPLAVSCQQVSLVIYCIFLSNQTMIVHLLMKCQEMIEFLVDVWEQEGLYD

AT4G32342.1 unknown protein1.7e-2741.27Show/hide
Query:  NLKLLFHRRSCHGCCTAP-ALGDAMDGPSKGLRVEDQEAKK-QCLPENFPSSSTCEMD-NSTVWSQRSMASAQSESHDSHSNIGSSTDFVNSGLLLWNET
        N K L +  +C GCC     L   +D PSKGL+++ +  KK     ++F S+STC+MD N T+ SQ S     +   D   +  +ST+FVN GL+LWN T
Subjt:  NLKLLFHRRSCHGCCTAP-ALGDAMDGPSKGLRVEDQEAKK-QCLPENFPSSSTCEMD-NSTVWSQRSMASAQSESHDSHSNIGSSTDFVNSGLLLWNET

Query:  RKQWIGNKMSEGQKQVREPKISWNATYDSLLTTNKPFPETIPLAVSCQQVSLVIYCIFLSNQTMIVHLLMKCQEMIEFLVDVWEQEGLY
        R+QW    ++  Q  V EP ISWN+TYDSLL+TNK FP+ IPL                             +EM+ FLVDVWE+EGLY
Subjt:  RKQWIGNKMSEGQKQVREPKISWNATYDSLLTTNKPFPETIPLAVSCQQVSLVIYCIFLSNQTMIVHLLMKCQEMIEFLVDVWEQEGLY

AT5G25360.1 unknown protein2.3e-3748.31Show/hide
Query:  CHGCCTAPALGDAMDGPSKGLRVEDQEAKKQCLPENFPSSSTCEMDNSTVWSQRSMASAQSESHDSHS-NIGSSTDFVNSGLLLWNETRKQWIGNKMSEG
        C GCC  P L  A+D PSKGLR++ +  KK  + E+F S+STCEMDNST+ SQRSM+S    ++ S S +  + T+FVN GL LWN+TR+QW+ N  S+ 
Subjt:  CHGCCTAPALGDAMDGPSKGLRVEDQEAKKQCLPENFPSSSTCEMDNSTVWSQRSMASAQSESHDSHS-NIGSSTDFVNSGLLLWNETRKQWIGNKMSEG

Query:  QKQVREPKISWNATYDSLLTTNKPFPETIPLAVSCQQVSLVIYCIFLSNQTMIVHLLMKCQEMIEFLVDVWEQEGLYD
        + +VREP ISWNATY+SLL  NK F   IPL                              EM++FLVDVWEQEGLYD
Subjt:  QKQVREPKISWNATYDSLLTTNKPFPETIPLAVSCQQVSLVIYCIFLSNQTMIVHLLMKCQEMIEFLVDVWEQEGLYD

AT5G25360.2 unknown protein2.3e-3748.31Show/hide
Query:  CHGCCTAPALGDAMDGPSKGLRVEDQEAKKQCLPENFPSSSTCEMDNSTVWSQRSMASAQSESHDSHS-NIGSSTDFVNSGLLLWNETRKQWIGNKMSEG
        C GCC  P L  A+D PSKGLR++ +  KK  + E+F S+STCEMDNST+ SQRSM+S    ++ S S +  + T+FVN GL LWN+TR+QW+ N  S+ 
Subjt:  CHGCCTAPALGDAMDGPSKGLRVEDQEAKKQCLPENFPSSSTCEMDNSTVWSQRSMASAQSESHDSHS-NIGSSTDFVNSGLLLWNETRKQWIGNKMSEG

Query:  QKQVREPKISWNATYDSLLTTNKPFPETIPLAVSCQQVSLVIYCIFLSNQTMIVHLLMKCQEMIEFLVDVWEQEGLYD
        + +VREP ISWNATY+SLL  NK F   IPL                              EM++FLVDVWEQEGLYD
Subjt:  QKQVREPKISWNATYDSLLTTNKPFPETIPLAVSCQQVSLVIYCIFLSNQTMIVHLLMKCQEMIEFLVDVWEQEGLYD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATTCTAGGTGTTGTCTCCTCAGCCGTTTAGAGGGTTGCTCTAGCAAGAAACCATGTTGCTCGTTTTTACAGTTTTCTGGAGAATATCTGCGCGCTCTTATACTTTT
GATGGTGGATAATCTCAAGCTTCTTTTCCATAGAAGAAGCTGTCATGGATGCTGCACTGCACCTGCACTAGGTGATGCAATGGATGGGCCATCTAAAGGTCTGAGAGTTG
AAGACCAAGAAGCAAAGAAACAATGCTTACCGGAAAATTTCCCGAGCTCTAGCACATGTGAAATGGACAACAGTACAGTTTGGTCCCAGAGAAGCATGGCATCAGCCCAG
TCAGAGTCACATGATTCACACAGCAATATTGGGAGCAGTACAGACTTTGTAAATTCTGGATTACTTCTTTGGAATGAGACCAGGAAGCAATGGATTGGAAACAAAATGTC
CGAGGGCCAAAAGCAAGTTCGAGAACCCAAAATAAGTTGGAATGCTACTTACGATAGCTTATTAACGACGAACAAGCCGTTCCCCGAGACCATTCCTCTTGCTGTAAGTT
GTCAGCAGGTTTCTCTTGTTATCTACTGTATTTTCCTGTCCAACCAGACAATGATTGTTCATTTGCTTATGAAATGCCAGGAGATGATAGAGTTTCTTGTTGATGTCTGG
GAGCAGGAGGGTCTATATGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTATTCTAGGTGTTGTCTCCTCAGCCGTTTAGAGGGTTGCTCTAGCAAGAAACCATGTTGCTCGTTTTTACAGTTTTCTGGAGAATATCTGCGCGCTCTTATACTTTT
GATGGTGGATAATCTCAAGCTTCTTTTCCATAGAAGAAGCTGTCATGGATGCTGCACTGCACCTGCACTAGGTGATGCAATGGATGGGCCATCTAAAGGTCTGAGAGTTG
AAGACCAAGAAGCAAAGAAACAATGCTTACCGGAAAATTTCCCGAGCTCTAGCACATGTGAAATGGACAACAGTACAGTTTGGTCCCAGAGAAGCATGGCATCAGCCCAG
TCAGAGTCACATGATTCACACAGCAATATTGGGAGCAGTACAGACTTTGTAAATTCTGGATTACTTCTTTGGAATGAGACCAGGAAGCAATGGATTGGAAACAAAATGTC
CGAGGGCCAAAAGCAAGTTCGAGAACCCAAAATAAGTTGGAATGCTACTTACGATAGCTTATTAACGACGAACAAGCCGTTCCCCGAGACCATTCCTCTTGCTGTAAGTT
GTCAGCAGGTTTCTCTTGTTATCTACTGTATTTTCCTGTCCAACCAGACAATGATTGTTCATTTGCTTATGAAATGCCAGGAGATGATAGAGTTTCTTGTTGATGTCTGG
GAGCAGGAGGGTCTATATGACTGA
Protein sequenceShow/hide protein sequence
MYSRCCLLSRLEGCSSKKPCCSFLQFSGEYLRALILLMVDNLKLLFHRRSCHGCCTAPALGDAMDGPSKGLRVEDQEAKKQCLPENFPSSSTCEMDNSTVWSQRSMASAQ
SESHDSHSNIGSSTDFVNSGLLLWNETRKQWIGNKMSEGQKQVREPKISWNATYDSLLTTNKPFPETIPLAVSCQQVSLVIYCIFLSNQTMIVHLLMKCQEMIEFLVDVW
EQEGLYD