; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI02G23520 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI02G23520
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionDUF4050 domain-containing protein
Genome locationChr2:20270154..20273958
RNA-Seq ExpressionCSPI02G23520
SyntenyCSPI02G23520
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064925.1 uncharacterized protein E6C27_scaffold82G002430 [Cucumis melo var. makuwa]4.8e-10595.92Show/hide
Query:  MYSRCCLLNRLEGCSSKNPCCSFLQFSGEYMRALILLMVDKIKLLFHKRRRDGCCTASALGNAMDGPSKGLRVKNKEAKKQCLPENFPSSSTCEMDNSTV
        MYSRCCLLNRLEGCSSKNPCCSFLQFSGEYMRALILLMVDKIKLLFHKRRR+GCC+ASALGNAMDGPSKGLRVK+KEAKKQCLPENFPSSSTCEMDNSTV
Subjt:  MYSRCCLLNRLEGCSSKNPCCSFLQFSGEYMRALILLMVDKIKLLFHKRRRDGCCTASALGNAMDGPSKGLRVKNKEAKKQCLPENFPSSSTCEMDNSTV

Query:  WSQRSMASGQAHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSGSQKQVQEPKISWNATYDNLLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD
        WSQRSMAS Q+HDS SNIGSSTDFVNSGLLLWNETRKQWVGNKMS SQKQVQEPKISWNATYD+LLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD
Subjt:  WSQRSMASGQAHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSGSQKQVQEPKISWNATYDNLLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD

XP_004138726.1 uncharacterized protein LOC101216869 [Cucumis sativus]4.9e-110100Show/hide
Query:  MYSRCCLLNRLEGCSSKNPCCSFLQFSGEYMRALILLMVDKIKLLFHKRRRDGCCTASALGNAMDGPSKGLRVKNKEAKKQCLPENFPSSSTCEMDNSTV
        MYSRCCLLNRLEGCSSKNPCCSFLQFSGEYMRALILLMVDKIKLLFHKRRRDGCCTASALGNAMDGPSKGLRVKNKEAKKQCLPENFPSSSTCEMDNSTV
Subjt:  MYSRCCLLNRLEGCSSKNPCCSFLQFSGEYMRALILLMVDKIKLLFHKRRRDGCCTASALGNAMDGPSKGLRVKNKEAKKQCLPENFPSSSTCEMDNSTV

Query:  WSQRSMASGQAHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSGSQKQVQEPKISWNATYDNLLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD
        WSQRSMASGQAHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSGSQKQVQEPKISWNATYDNLLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD
Subjt:  WSQRSMASGQAHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSGSQKQVQEPKISWNATYDNLLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD

XP_008445211.1 PREDICTED: uncharacterized protein LOC103488310 isoform X1 [Cucumis melo]4.8e-10595.92Show/hide
Query:  MYSRCCLLNRLEGCSSKNPCCSFLQFSGEYMRALILLMVDKIKLLFHKRRRDGCCTASALGNAMDGPSKGLRVKNKEAKKQCLPENFPSSSTCEMDNSTV
        MYSRCCLLNRLEGCSSKNPCCSFLQFSGEYMRALILLMVDKIKLLFHKRRR+GCC+ASALGNAMDGPSKGLRVK+KEAKKQCLPENFPSSSTCEMDNSTV
Subjt:  MYSRCCLLNRLEGCSSKNPCCSFLQFSGEYMRALILLMVDKIKLLFHKRRRDGCCTASALGNAMDGPSKGLRVKNKEAKKQCLPENFPSSSTCEMDNSTV

Query:  WSQRSMASGQAHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSGSQKQVQEPKISWNATYDNLLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD
        WSQRSMAS Q+HDS SNIGSSTDFVNSGLLLWNETRKQWVGNKMS SQKQVQEPKISWNATYD+LLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD
Subjt:  WSQRSMASGQAHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSGSQKQVQEPKISWNATYDNLLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD

XP_022951409.1 uncharacterized protein LOC111454240 isoform X1 [Cucurbita moschata]3.3e-9083.67Show/hide
Query:  MYSRCCLLNRLEGCSSKNPCCSFLQFSGEYMRALILLMVDKIKLLFHKRRRDGCCTASALGNAMDGPSKGLRVKNKEAKKQCLPENFPSSSTCEMDNSTV
        MYSRCCLL+RLEGCSSK PCCSFLQFSG+Y+RALI+L+VD +KLLFH+R   G CT  ALG+AMDGPS GLRV+++EAKKQCLPENF SSSTCEMDNSTV
Subjt:  MYSRCCLLNRLEGCSSKNPCCSFLQFSGEYMRALILLMVDKIKLLFHKRRRDGCCTASALGNAMDGPSKGLRVKNKEAKKQCLPENFPSSSTCEMDNSTV

Query:  WSQRSMASGQAHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSGSQKQVQEPKISWNATYDNLLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD
        WSQRSMAS Q+HDSH+N+GSST+FVNSGLLLWNETRKQWVGNK S SQK+V+EPKISWNATYD+LLTTNKPFPEAIPL EMIEFLVDVWEQEGLYD
Subjt:  WSQRSMASGQAHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSGSQKQVQEPKISWNATYDNLLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD

XP_038885342.1 uncharacterized protein LOC120075759 isoform X1 [Benincasa hispida]4.0e-9687.76Show/hide
Query:  MYSRCCLLNRLEGCSSKNPCCSFLQFSGEYMRALILLMVDKIKLLFHKRRRDGCCTASALGNAMDGPSKGLRVKNKEAKKQCLPENFPSSSTCEMDNSTV
        MYSRCCLL RLEGCSSK PCCSFLQFSGEY+RALILLMVD IKLLFH+R   GCCTASAL NAMDGPSKGLRVK++EAKKQCLPEN PSSSTCEMDNSTV
Subjt:  MYSRCCLLNRLEGCSSKNPCCSFLQFSGEYMRALILLMVDKIKLLFHKRRRDGCCTASALGNAMDGPSKGLRVKNKEAKKQCLPENFPSSSTCEMDNSTV

Query:  WSQRSMASGQAHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSGSQKQVQEPKISWNATYDNLLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD
        WSQRSMAS  +HDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMS  QKQVQEPKISW+ATYD+LL TNKPFPE +PLTEMI+FLVDVWEQ+GLYD
Subjt:  WSQRSMASGQAHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSGSQKQVQEPKISWNATYDNLLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD

TrEMBL top hitse value%identityAlignment
A0A0A0LPL3 Uncharacterized protein2.4e-110100Show/hide
Query:  MYSRCCLLNRLEGCSSKNPCCSFLQFSGEYMRALILLMVDKIKLLFHKRRRDGCCTASALGNAMDGPSKGLRVKNKEAKKQCLPENFPSSSTCEMDNSTV
        MYSRCCLLNRLEGCSSKNPCCSFLQFSGEYMRALILLMVDKIKLLFHKRRRDGCCTASALGNAMDGPSKGLRVKNKEAKKQCLPENFPSSSTCEMDNSTV
Subjt:  MYSRCCLLNRLEGCSSKNPCCSFLQFSGEYMRALILLMVDKIKLLFHKRRRDGCCTASALGNAMDGPSKGLRVKNKEAKKQCLPENFPSSSTCEMDNSTV

Query:  WSQRSMASGQAHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSGSQKQVQEPKISWNATYDNLLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD
        WSQRSMASGQAHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSGSQKQVQEPKISWNATYDNLLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD
Subjt:  WSQRSMASGQAHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSGSQKQVQEPKISWNATYDNLLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD

A0A1S3BC47 uncharacterized protein LOC103488310 isoform X12.3e-10595.92Show/hide
Query:  MYSRCCLLNRLEGCSSKNPCCSFLQFSGEYMRALILLMVDKIKLLFHKRRRDGCCTASALGNAMDGPSKGLRVKNKEAKKQCLPENFPSSSTCEMDNSTV
        MYSRCCLLNRLEGCSSKNPCCSFLQFSGEYMRALILLMVDKIKLLFHKRRR+GCC+ASALGNAMDGPSKGLRVK+KEAKKQCLPENFPSSSTCEMDNSTV
Subjt:  MYSRCCLLNRLEGCSSKNPCCSFLQFSGEYMRALILLMVDKIKLLFHKRRRDGCCTASALGNAMDGPSKGLRVKNKEAKKQCLPENFPSSSTCEMDNSTV

Query:  WSQRSMASGQAHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSGSQKQVQEPKISWNATYDNLLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD
        WSQRSMAS Q+HDS SNIGSSTDFVNSGLLLWNETRKQWVGNKMS SQKQVQEPKISWNATYD+LLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD
Subjt:  WSQRSMASGQAHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSGSQKQVQEPKISWNATYDNLLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD

A0A5A7VGA9 Uncharacterized protein2.3e-10595.92Show/hide
Query:  MYSRCCLLNRLEGCSSKNPCCSFLQFSGEYMRALILLMVDKIKLLFHKRRRDGCCTASALGNAMDGPSKGLRVKNKEAKKQCLPENFPSSSTCEMDNSTV
        MYSRCCLLNRLEGCSSKNPCCSFLQFSGEYMRALILLMVDKIKLLFHKRRR+GCC+ASALGNAMDGPSKGLRVK+KEAKKQCLPENFPSSSTCEMDNSTV
Subjt:  MYSRCCLLNRLEGCSSKNPCCSFLQFSGEYMRALILLMVDKIKLLFHKRRRDGCCTASALGNAMDGPSKGLRVKNKEAKKQCLPENFPSSSTCEMDNSTV

Query:  WSQRSMASGQAHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSGSQKQVQEPKISWNATYDNLLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD
        WSQRSMAS Q+HDS SNIGSSTDFVNSGLLLWNETRKQWVGNKMS SQKQVQEPKISWNATYD+LLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD
Subjt:  WSQRSMASGQAHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSGSQKQVQEPKISWNATYDNLLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD

A0A6J1GIP5 uncharacterized protein LOC111454240 isoform X11.6e-9083.67Show/hide
Query:  MYSRCCLLNRLEGCSSKNPCCSFLQFSGEYMRALILLMVDKIKLLFHKRRRDGCCTASALGNAMDGPSKGLRVKNKEAKKQCLPENFPSSSTCEMDNSTV
        MYSRCCLL+RLEGCSSK PCCSFLQFSG+Y+RALI+L+VD +KLLFH+R   G CT  ALG+AMDGPS GLRV+++EAKKQCLPENF SSSTCEMDNSTV
Subjt:  MYSRCCLLNRLEGCSSKNPCCSFLQFSGEYMRALILLMVDKIKLLFHKRRRDGCCTASALGNAMDGPSKGLRVKNKEAKKQCLPENFPSSSTCEMDNSTV

Query:  WSQRSMASGQAHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSGSQKQVQEPKISWNATYDNLLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD
        WSQRSMAS Q+HDSH+N+GSST+FVNSGLLLWNETRKQWVGNK S SQK+V+EPKISWNATYD+LLTTNKPFPEAIPL EMIEFLVDVWEQEGLYD
Subjt:  WSQRSMASGQAHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSGSQKQVQEPKISWNATYDNLLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD

A0A6J1KQM2 uncharacterized protein LOC111496323 isoform X18.0e-9083.16Show/hide
Query:  MYSRCCLLNRLEGCSSKNPCCSFLQFSGEYMRALILLMVDKIKLLFHKRRRDGCCTASALGNAMDGPSKGLRVKNKEAKKQCLPENFPSSSTCEMDNSTV
        MYSRCCLL+RLEGCSSK PCCSFLQFSG+Y+RALI+L+VD +KLLFH+R   G CT  ALG+AMDGPS GLRV ++EAKKQCLP+NF SSSTCEMDNSTV
Subjt:  MYSRCCLLNRLEGCSSKNPCCSFLQFSGEYMRALILLMVDKIKLLFHKRRRDGCCTASALGNAMDGPSKGLRVKNKEAKKQCLPENFPSSSTCEMDNSTV

Query:  WSQRSMASGQAHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSGSQKQVQEPKISWNATYDNLLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD
        WSQRSMAS Q+HDSH+N+GSST+FVNSGLLLWNETRKQWVGNK S SQK+V+EPKISWNATYD+LLTTNKPFPEAIPL EMIEFLVDVWEQEGLYD
Subjt:  WSQRSMASGQAHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSGSQKQVQEPKISWNATYDNLLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15350.1 unknown protein1.1e-2748.28Show/hide
Query:  SALGNAMDGPSKGLRVKNKEAKKQCLPENFPSSSTCEMDNSTVWSQRSM-ASGQAHDSHS---NIGSSTDFVNSGLLLWNETRKQWVG-NKMSGSQKQVQ
        S   +  D PS  +    +  KK  + E+F S+ST +MDN T  SQ S+ +S Q  DS S   N  +  ++VN GLLLWN+TR++WVG +K +      Q
Subjt:  SALGNAMDGPSKGLRVKNKEAKKQCLPENFPSSSTCEMDNSTVWSQRSM-ASGQAHDSHS---NIGSSTDFVNSGLLLWNETRKQWVG-NKMSGSQKQVQ

Query:  EPKISWN-ATYDNLLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD
          K++WN ATYD+LL +NK FP+ IPLTEM++FLVD+WEQEGLYD
Subjt:  EPKISWN-ATYDNLLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD

AT1G15350.2 unknown protein1.1e-2748.28Show/hide
Query:  SALGNAMDGPSKGLRVKNKEAKKQCLPENFPSSSTCEMDNSTVWSQRSM-ASGQAHDSHS---NIGSSTDFVNSGLLLWNETRKQWVG-NKMSGSQKQVQ
        S   +  D PS  +    +  KK  + E+F S+ST +MDN T  SQ S+ +S Q  DS S   N  +  ++VN GLLLWN+TR++WVG +K +      Q
Subjt:  SALGNAMDGPSKGLRVKNKEAKKQCLPENFPSSSTCEMDNSTVWSQRSM-ASGQAHDSHS---NIGSSTDFVNSGLLLWNETRKQWVG-NKMSGSQKQVQ

Query:  EPKISWN-ATYDNLLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD
          K++WN ATYD+LL +NK FP+ IPLTEM++FLVD+WEQEGLYD
Subjt:  EPKISWN-ATYDNLLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD

AT4G32342.1 unknown protein4.9e-3150.68Show/hide
Query:  GCCTAS-ALGNAMDGPSKGLRVKNKEAKK-QCLPENFPSSSTCEMD-NSTVWSQRSMASGQAHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSGSQK
        GCC     L   +D PSKGL+++ K  KK     ++F S+STC+MD N T+ SQ   +S    D   +  +ST+FVN GL+LWN TR+QW    ++  Q 
Subjt:  GCCTAS-ALGNAMDGPSKGLRVKNKEAKK-QCLPENFPSSSTCEMD-NSTVWSQRSMASGQAHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSGSQK

Query:  QVQEPKISWNATYDNLLTTNKPFPEAIPLTEMIEFLVDVWEQEGLY
         V EP ISWN+TYD+LL+TNK FP+ IPL EM+ FLVDVWE+EGLY
Subjt:  QVQEPKISWNATYDNLLTTNKPFPEAIPLTEMIEFLVDVWEQEGLY

AT5G25360.1 unknown protein2.8e-3955.78Show/hide
Query:  GCCTASALGNAMDGPSKGLRVKNKEAKKQCLPENFPSSSTCEMDNSTVWSQRSMASGQAHDSHSNIGSS---TDFVNSGLLLWNETRKQWVGNKMSGSQK
        GCC    L  A+D PSKGLR++ +  KK  + E+F S+STCEMDNST+ SQRSM+S    ++ S   S+   T+FVN GL LWN+TR+QW+ N  S  + 
Subjt:  GCCTASALGNAMDGPSKGLRVKNKEAKKQCLPENFPSSSTCEMDNSTVWSQRSMASGQAHDSHSNIGSS---TDFVNSGLLLWNETRKQWVGNKMSGSQK

Query:  QVQEPKISWNATYDNLLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD
        +V+EP ISWNATY++LL  NK F   IPL EM++FLVDVWEQEGLYD
Subjt:  QVQEPKISWNATYDNLLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD

AT5G25360.2 unknown protein2.8e-3955.78Show/hide
Query:  GCCTASALGNAMDGPSKGLRVKNKEAKKQCLPENFPSSSTCEMDNSTVWSQRSMASGQAHDSHSNIGSS---TDFVNSGLLLWNETRKQWVGNKMSGSQK
        GCC    L  A+D PSKGLR++ +  KK  + E+F S+STCEMDNST+ SQRSM+S    ++ S   S+   T+FVN GL LWN+TR+QW+ N  S  + 
Subjt:  GCCTASALGNAMDGPSKGLRVKNKEAKKQCLPENFPSSSTCEMDNSTVWSQRSMASGQAHDSHSNIGSS---TDFVNSGLLLWNETRKQWVGNKMSGSQK

Query:  QVQEPKISWNATYDNLLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD
        +V+EP ISWNATY++LL  NK F   IPL EM++FLVDVWEQEGLYD
Subjt:  QVQEPKISWNATYDNLLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATTCTAGGTGTTGTCTCCTCAACCGCTTAGAAGGTTGCTCTAGCAAGAATCCATGTTGTTCTTTCTTACAGTTTTCTGGAGAATATATGCGCGCTCTTATTCTTTT
GATGGTGGATAAAATCAAGCTTCTTTTCCATAAAAGAAGGCGTGATGGATGCTGCACTGCATCTGCACTAGGTAATGCAATGGATGGGCCGTCTAAAGGTCTGAGAGTTA
AGAACAAAGAAGCAAAGAAACAATGCTTACCAGAAAATTTCCCAAGCTCTAGCACATGTGAAATGGACAACAGTACAGTTTGGTCCCAGAGAAGCATGGCATCAGGCCAA
GCACATGATTCTCACAGCAATATTGGGAGCAGTACAGACTTTGTAAATTCTGGACTACTTCTTTGGAATGAGACCAGGAAGCAATGGGTTGGAAATAAAATGTCCGGGAG
CCAAAAGCAAGTTCAGGAACCTAAAATAAGCTGGAATGCTACTTACGACAACTTACTAACAACGAACAAGCCGTTCCCTGAGGCCATACCTCTTACTGAGATGATAGAGT
TTCTTGTTGATGTCTGGGAGCAGGAGGGTCTATATGACTGA
mRNA sequenceShow/hide mRNA sequence
GTTTTTTTTGCTTTTCCCTCTTACCATTGCTGAAGAACATACCAGATTAGAAAATGGACAATGAATTGACTCTGTTTTCCACTTGGTAAGGAGCCATCCTTATTCAATGG
GTTATTTATTTCGACCCTTCTAAAATAATCTTCGACTGTGGACATTATTCGAAGGCTTCTCCCTTTCACGGAAAACACAATCAAGAACTTTAAGCCTCTCCCCACCAGGA
TTTAGTGCTCTTCTTCGGACAATTTTGCTTGGAGACGCGTTAATGCATTATCCAGGAATATTTGGCTTGTGCTTTAACAGATTATAACAGATGAGTGACAAATTCAAGAG
ACGACAAGCGATCACTCGGAGTAGCAACGATGGACAACCATTAATTTTCAATCTTTTCTTCTGTCTTTCTGGGGGTGGTGTTTTTTCTTTTCTTTTCTGGCTTTGGATCT
TGTAGATCACATTAAAAGGAGGATTTATTGGTCATTTTGTGGTTCTTCCCTTAGCTTGCAGGGTAAACAAATAGAAATGGAAGGATTAGCTTTTCGTGTGTAGTTTAAGA
GTATGTTTACTTTGGTTTGGAATCTTTAAATTTTCTTTTTGCTTCTTCATCAAGTAAGGTTGAGAGTTAGTGAACAGGGAACTTCTCCTGCCTTTTTACCCATCATTCCT
TTCTTCAAGTTCTGGGAAATTGTTTACGCAGGGGAAACTATGTATTCTAGGTGTTGTCTCCTCAACCGCTTAGAAGGTTGCTCTAGCAAGAATCCATGTTGTTCTTTCTT
ACAGTTTTCTGGAGAATATATGCGCGCTCTTATTCTTTTGATGGTGGATAAAATCAAGCTTCTTTTCCATAAAAGAAGGCGTGATGGATGCTGCACTGCATCTGCACTAG
GTAATGCAATGGATGGGCCGTCTAAAGGTCTGAGAGTTAAGAACAAAGAAGCAAAGAAACAATGCTTACCAGAAAATTTCCCAAGCTCTAGCACATGTGAAATGGACAAC
AGTACAGTTTGGTCCCAGAGAAGCATGGCATCAGGCCAAGCACATGATTCTCACAGCAATATTGGGAGCAGTACAGACTTTGTAAATTCTGGACTACTTCTTTGGAATGA
GACCAGGAAGCAATGGGTTGGAAATAAAATGTCCGGGAGCCAAAAGCAAGTTCAGGAACCTAAAATAAGCTGGAATGCTACTTACGACAACTTACTAACAACGAACAAGC
CGTTCCCTGAGGCCATACCTCTTACTGAGATGATAGAGTTTCTTGTTGATGTCTGGGAGCAGGAGGGTCTATATGACTGAGCCTACAATTCAAGGGAGTCTTCATATTGT
TTAAATTTGTGCAGCTTGAATTTTTTTTGCTGCCGTTCAATTACGAGCAATCGTTCGATGCTTATAGTATAAATATTACTGTTTATGTTGAATTCAAGCTACCCATTTTG
GATCTTTTTCAAGAATGGAATCATTTTGAAAACAGTCTTTTACGTTGAAGTTCTGCTAGATTCGATCTGTCTGGGGCTGGCATCTACTGGCTTTGGCATAATGAACACAG
TTTGGGCTCTAAAATTCTTAAGCCCGTGAAGATAGATAAGTCTTGTGCCCAATCATAAAAAAAATTTCAAGGGTATAAAGTGTATAAATACCATTTGTGATGGACTTTTC
TTTTTCTTTTTAAGTATATGATCTGCTTAGTAATTTGATGGATGTGC
Protein sequenceShow/hide protein sequence
MYSRCCLLNRLEGCSSKNPCCSFLQFSGEYMRALILLMVDKIKLLFHKRRRDGCCTASALGNAMDGPSKGLRVKNKEAKKQCLPENFPSSSTCEMDNSTVWSQRSMASGQ
AHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSGSQKQVQEPKISWNATYDNLLTTNKPFPEAIPLTEMIEFLVDVWEQEGLYD