; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0001129 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0001129
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionTransmembrane protein
Genome locationchr08:21106518..21108994
RNA-Seq ExpressionPI0001129
SyntenyPI0001129
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036314.1 uncharacterized protein E6C27_scaffold18G001130 [Cucumis melo var. makuwa]2.9e-11086.47Show/hide
Query:  MSLPFQSLSLTSPSSSTLCFSTFFSRNPCVSFRFP--------------LAFRSPFNFGSINAHQFCPRVSTSGGVGRRP----GGDGDFDIDSLLSAAE
        MSLPFQSLSLTSPSSSTLCFST FSRNP VS  FP                FRSPFNFGSI+AHQFCPRVSTSGGVGR P    GG GDFDIDSLLSAAE
Subjt:  MSLPFQSLSLTSPSSSTLCFSTFFSRNPCVSFRFP--------------LAFRSPFNFGSINAHQFCPRVSTSGGVGRRP----GGDGDFDIDSLLSAAE

Query:  LFCLVASLIGSVGFALNCAKTRSKSVFLAVFGDGVFVGAILFLVAGVAIGAWIRRRQWNRIFRETAKGVLEVNLMEKTNKLEEDLRSSATLIRVLSRQLE
        LFCLVASLIGSVGFALNCAKTRSKSVFLAVFGDGV VGAILFLVAGVAIGAWIRRRQWNR+FRET KGVLEVNLMEKTNKLEEDLRSSATLIRVLSRQLE
Subjt:  LFCLVASLIGSVGFALNCAKTRSKSVFLAVFGDGVFVGAILFLVAGVAIGAWIRRRQWNRIFRETAKGVLEVNLMEKTNKLEEDLRSSATLIRVLSRQLE

Query:  KLGIRFRVTRKALKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQECSTSKLILI
        KLGIRFRVTRKALKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQE    +L LI
Subjt:  KLGIRFRVTRKALKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQECSTSKLILI

KAE8647553.1 hypothetical protein Csa_003483 [Cucumis sativus]6.5e-11085.98Show/hide
Query:  MSLPFQSLSLT--SPSSSTLCFSTFFSRNPCVSFRFP--------------LAFRSPFNFGSINAHQFCPRVSTSGGVGRRPGGDGDFDIDSLLSAAELF
        MSLPFQSLSLT  SPSSST CFSTF SRNPCVS  FP                FRSPFNFGSINAH FCPRVSTSGGVGRRPGG  DFDIDSLLSA E F
Subjt:  MSLPFQSLSLT--SPSSSTLCFSTFFSRNPCVSFRFP--------------LAFRSPFNFGSINAHQFCPRVSTSGGVGRRPGGDGDFDIDSLLSAAELF

Query:  CLVASLIGSVGFALNCAKTRSKSVFLAVFGDGVFVGAILFLVAGVAIGAWIRRRQWNRIFRETAKGVLEVNLMEKTNKLEEDLRSSATLIRVLSRQLEKL
        CLVASLIGSVGFALNCAKTRSKS+FLAVFGDGV VG ILFLVAGVAIGAWIRRRQWNR+FRETAKGVLEVNLMEKTNKLEEDLRSSATLIRVLSRQLEKL
Subjt:  CLVASLIGSVGFALNCAKTRSKSVFLAVFGDGVFVGAILFLVAGVAIGAWIRRRQWNRIFRETAKGVLEVNLMEKTNKLEEDLRSSATLIRVLSRQLEKL

Query:  GIRFRVTRKALKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQECSTSKLILI
        GIRFRVTRKALKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQE    +L LI
Subjt:  GIRFRVTRKALKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQECSTSKLILI

XP_004143460.1 uncharacterized protein LOC101207421 [Cucumis sativus]2.9e-11885.26Show/hide
Query:  MSPFPCLNTNPRFFNCLHSKTMSLPFQSLSLT--SPSSSTLCFSTFFSRNPCVSFRFP--------------LAFRSPFNFGSINAHQFCPRVSTSGGVG
        MSPFPCLNTNPRF   L S TMSLPFQSLSLT  SPSSST CFSTF SRNPCVS  FP                FRSPFNFGSINAH FCPRVSTSGGVG
Subjt:  MSPFPCLNTNPRFFNCLHSKTMSLPFQSLSLT--SPSSSTLCFSTFFSRNPCVSFRFP--------------LAFRSPFNFGSINAHQFCPRVSTSGGVG

Query:  RRPGGDGDFDIDSLLSAAELFCLVASLIGSVGFALNCAKTRSKSVFLAVFGDGVFVGAILFLVAGVAIGAWIRRRQWNRIFRETAKGVLEVNLMEKTNKL
        RRPGG  DFDIDSLLSA E FCLVASLIGSVGFALNCAKTRSKS+FLAVFGDGV VG ILFLVAGVAIGAWIRRRQWNR+FRETAKGVLEVNLMEKTNKL
Subjt:  RRPGGDGDFDIDSLLSAAELFCLVASLIGSVGFALNCAKTRSKSVFLAVFGDGVFVGAILFLVAGVAIGAWIRRRQWNRIFRETAKGVLEVNLMEKTNKL

Query:  EEDLRSSATLIRVLSRQLEKLGIRFRVTRKALKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQECSTSKLILI
        EEDLRSSATLIRVLSRQLEKLGIRFRVTRKALKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQE    +L LI
Subjt:  EEDLRSSATLIRVLSRQLEKLGIRFRVTRKALKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQECSTSKLILI

XP_008440583.1 PREDICTED: uncharacterized protein LOC103484959 isoform X1 [Cucumis melo]3.0e-12386.76Show/hide
Query:  MSPFPCLNTNPRFFNCLHSKTMSLPFQSLSLTSPSSSTLCFSTFFSRNPCVSFRFP--------------LAFRSPFNFGSINAHQFCPRVSTSGGVGRR
        MSPFPCL+TNPRF NCLHSKTMSLPFQSLSLTSPSSSTLCFST FSRNP VS  FP                FRSPFNFGSI+AHQFCPRVSTSGGVGR 
Subjt:  MSPFPCLNTNPRFFNCLHSKTMSLPFQSLSLTSPSSSTLCFSTFFSRNPCVSFRFP--------------LAFRSPFNFGSINAHQFCPRVSTSGGVGRR

Query:  P----GGDGDFDIDSLLSAAELFCLVASLIGSVGFALNCAKTRSKSVFLAVFGDGVFVGAILFLVAGVAIGAWIRRRQWNRIFRETAKGVLEVNLMEKTN
        P    GG GDFDIDSLLSAAELFCLVASLIGSVGFALNCAKTRSKSVFLAVFGDGV VGAILFLVAGVAIGAWIRRRQWNR+FRET KGVLEVNLMEKTN
Subjt:  P----GGDGDFDIDSLLSAAELFCLVASLIGSVGFALNCAKTRSKSVFLAVFGDGVFVGAILFLVAGVAIGAWIRRRQWNRIFRETAKGVLEVNLMEKTN

Query:  KLEEDLRSSATLIRVLSRQLEKLGIRFRVTRKALKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQECSTSKLILI
        KLEEDLRSSATLIRVLSRQLEKLGIRFRVTRKALKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQE    +L LI
Subjt:  KLEEDLRSSATLIRVLSRQLEKLGIRFRVTRKALKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQECSTSKLILI

XP_038881992.1 uncharacterized protein LOC120073309 [Benincasa hispida]9.4e-10985.34Show/hide
Query:  MSLPFQSLSLTSPS----SSTLCFSTFFSRNPCVSFRFPLA--------------FRSPFNFGSINAHQFCPRVSTSGGVGRRPGGDGDFDIDSLLSAAE
        MSL FQSLSLTSPS     STLCFSTFFSRNPC+S RF  +              FRSPFNFGSINAHQFCPRVSTSGGVGR+ GGDGDFDIDSLLSAAE
Subjt:  MSLPFQSLSLTSPS----SSTLCFSTFFSRNPCVSFRFPLA--------------FRSPFNFGSINAHQFCPRVSTSGGVGRRPGGDGDFDIDSLLSAAE

Query:  LFCLVASLIGSVGFALNCAKTRSKSVFLAVFGDGVFVGAILFLVAGVAIGAWIRRRQWNRIFRETAKGVLEVNLMEKTNKLEEDLRSSATLIRVLSRQLE
        LFCLV SLIGSVGFALN AK RSKSVFLAVFGDG+FVGAILFLVAGVAIGAWIRRRQWNRIFRETAKGVL VNLMEKTN+LEEDLRSSATLIRVLSRQLE
Subjt:  LFCLVASLIGSVGFALNCAKTRSKSVFLAVFGDGVFVGAILFLVAGVAIGAWIRRRQWNRIFRETAKGVLEVNLMEKTNKLEEDLRSSATLIRVLSRQLE

Query:  KLGIRFRVTRKALKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQECSTSKLILI
        KLGIRFRVTRKALKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQE    +L LI
Subjt:  KLGIRFRVTRKALKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQECSTSKLILI

TrEMBL top hitse value%identityAlignment
A0A0A0KK16 Uncharacterized protein1.4e-14585.29Show/hide
Query:  MKNKEHTLIKVKGGGGPKQRAKKAQYFGRSYPPLFHSRVKSNLSSLSFSILKDGPMSPFPCLNTNPRFFNCLHSKTMSLPFQSLSLT--SPSSSTLCFST
        MKN+E TLIKVKGG GPK+RAKKAQYFGR YP L HSRVKSNLSSL FSILKDGPMSPFPCLNTNPRF   L S TMSLPFQSLSLT  SPSSST CFST
Subjt:  MKNKEHTLIKVKGGGGPKQRAKKAQYFGRSYPPLFHSRVKSNLSSLSFSILKDGPMSPFPCLNTNPRFFNCLHSKTMSLPFQSLSLT--SPSSSTLCFST

Query:  FFSRNPCVSFRFP--------------LAFRSPFNFGSINAHQFCPRVSTSGGVGRRPGGDGDFDIDSLLSAAELFCLVASLIGSVGFALNCAKTRSKSV
        F SRNPCVS  FP                FRSPFNFGSINAH FCPRVSTSGGVGRRPGG  DFDIDSLLSA E FCLVASLIGSVGFALNCAKTRSKS+
Subjt:  FFSRNPCVSFRFP--------------LAFRSPFNFGSINAHQFCPRVSTSGGVGRRPGGDGDFDIDSLLSAAELFCLVASLIGSVGFALNCAKTRSKSV

Query:  FLAVFGDGVFVGAILFLVAGVAIGAWIRRRQWNRIFRETAKGVLEVNLMEKTNKLEEDLRSSATLIRVLSRQLEKLGIRFRVTRKALKKPVEETAALAQK
        FLAVFGDGV VG ILFLVAGVAIGAWIRRRQWNR+FRETAKGVLEVNLMEKTNKLEEDLRSSATLIRVLSRQLEKLGIRFRVTRKALKKPVEETAALAQK
Subjt:  FLAVFGDGVFVGAILFLVAGVAIGAWIRRRQWNRIFRETAKGVLEVNLMEKTNKLEEDLRSSATLIRVLSRQLEKLGIRFRVTRKALKKPVEETAALAQK

Query:  TSEATRALAVRGDILEKELAEIQKVLLAMQECSTSKLILI
        TSEATRALAVRGDILEKELAEIQKVLLAMQE    +L LI
Subjt:  TSEATRALAVRGDILEKELAEIQKVLLAMQECSTSKLILI

A0A1S3B274 uncharacterized protein LOC103484959 isoform X11.5e-12386.76Show/hide
Query:  MSPFPCLNTNPRFFNCLHSKTMSLPFQSLSLTSPSSSTLCFSTFFSRNPCVSFRFP--------------LAFRSPFNFGSINAHQFCPRVSTSGGVGRR
        MSPFPCL+TNPRF NCLHSKTMSLPFQSLSLTSPSSSTLCFST FSRNP VS  FP                FRSPFNFGSI+AHQFCPRVSTSGGVGR 
Subjt:  MSPFPCLNTNPRFFNCLHSKTMSLPFQSLSLTSPSSSTLCFSTFFSRNPCVSFRFP--------------LAFRSPFNFGSINAHQFCPRVSTSGGVGRR

Query:  P----GGDGDFDIDSLLSAAELFCLVASLIGSVGFALNCAKTRSKSVFLAVFGDGVFVGAILFLVAGVAIGAWIRRRQWNRIFRETAKGVLEVNLMEKTN
        P    GG GDFDIDSLLSAAELFCLVASLIGSVGFALNCAKTRSKSVFLAVFGDGV VGAILFLVAGVAIGAWIRRRQWNR+FRET KGVLEVNLMEKTN
Subjt:  P----GGDGDFDIDSLLSAAELFCLVASLIGSVGFALNCAKTRSKSVFLAVFGDGVFVGAILFLVAGVAIGAWIRRRQWNRIFRETAKGVLEVNLMEKTN

Query:  KLEEDLRSSATLIRVLSRQLEKLGIRFRVTRKALKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQECSTSKLILI
        KLEEDLRSSATLIRVLSRQLEKLGIRFRVTRKALKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQE    +L LI
Subjt:  KLEEDLRSSATLIRVLSRQLEKLGIRFRVTRKALKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQECSTSKLILI

A0A1S4DT19 uncharacterized protein LOC103484959 isoform X27.5e-10481.37Show/hide
Query:  MSPFPCLNTNPRFFNCLHSKTMSLPFQSLSLTSPSSSTLCFSTFFSRNPCVSFRFP--------------LAFRSPFNFGSINAHQFCPRVSTSGGVGRR
        MSPFPCL+TNPRF NCLHSKTMSLPFQSLSLTSPSSSTLCFST FSRNP VS  FP                FRSPFNFGSI+AHQFCPRVSTSGGVGR 
Subjt:  MSPFPCLNTNPRFFNCLHSKTMSLPFQSLSLTSPSSSTLCFSTFFSRNPCVSFRFP--------------LAFRSPFNFGSINAHQFCPRVSTSGGVGRR

Query:  P----GGDGDFDIDSLLSAAELFCLVASLIGSVGFALNCAKTRSKSVFLAVFGDGVFVGAILFLVAGVAIGAWIRRRQWNRIFRETAKGVLEVNLMEKTN
        P    GG GDFDIDSLLSAAELFCLVASLIGSVGFALNCAKTRSKSVFLAVFGDGV VGAILFLVAGVAIGAWIRRRQWNR+FRET KGVLEVNLMEKTN
Subjt:  P----GGDGDFDIDSLLSAAELFCLVASLIGSVGFALNCAKTRSKSVFLAVFGDGVFVGAILFLVAGVAIGAWIRRRQWNRIFRETAKGVLEVNLMEKTN

Query:  KLEEDLRSSATLIRVLSRQLEKLGIRFRVTRKALKKPVEETAALAQKTSEATRALAVRGDILE
        KLEEDLRSSATLIRVLSRQLEKLGIRFRVTRKALKKPVEE     QK  E   A+   G + E
Subjt:  KLEEDLRSSATLIRVLSRQLEKLGIRFRVTRKALKKPVEETAALAQKTSEATRALAVRGDILE

A0A5A7SYF0 Uncharacterized protein1.4e-11086.47Show/hide
Query:  MSLPFQSLSLTSPSSSTLCFSTFFSRNPCVSFRFP--------------LAFRSPFNFGSINAHQFCPRVSTSGGVGRRP----GGDGDFDIDSLLSAAE
        MSLPFQSLSLTSPSSSTLCFST FSRNP VS  FP                FRSPFNFGSI+AHQFCPRVSTSGGVGR P    GG GDFDIDSLLSAAE
Subjt:  MSLPFQSLSLTSPSSSTLCFSTFFSRNPCVSFRFP--------------LAFRSPFNFGSINAHQFCPRVSTSGGVGRRP----GGDGDFDIDSLLSAAE

Query:  LFCLVASLIGSVGFALNCAKTRSKSVFLAVFGDGVFVGAILFLVAGVAIGAWIRRRQWNRIFRETAKGVLEVNLMEKTNKLEEDLRSSATLIRVLSRQLE
        LFCLVASLIGSVGFALNCAKTRSKSVFLAVFGDGV VGAILFLVAGVAIGAWIRRRQWNR+FRET KGVLEVNLMEKTNKLEEDLRSSATLIRVLSRQLE
Subjt:  LFCLVASLIGSVGFALNCAKTRSKSVFLAVFGDGVFVGAILFLVAGVAIGAWIRRRQWNRIFRETAKGVLEVNLMEKTNKLEEDLRSSATLIRVLSRQLE

Query:  KLGIRFRVTRKALKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQECSTSKLILI
        KLGIRFRVTRKALKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQE    +L LI
Subjt:  KLGIRFRVTRKALKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQECSTSKLILI

A0A6J1BTL1 uncharacterized protein LOC111005633 isoform X12.4e-9475.28Show/hide
Query:  MSLPFQSLSLTSPS----SSTLCFSTFFSRNPCVSFRF-PLAF--------------RSPFNFGSINAHQFCPRVSTSGGVGRRPGGDGDFDIDSLLSAA
        MS+ FQ LSL+SPS     ST  FS+FFSRNPC S RF P  F              RSPFNF SIN HQFCPRVSTSGGVGRR   DGDF++DS LSAA
Subjt:  MSLPFQSLSLTSPS----SSTLCFSTFFSRNPCVSFRF-PLAF--------------RSPFNFGSINAHQFCPRVSTSGGVGRRPGGDGDFDIDSLLSAA

Query:  ELFCLVASLIGSVGFALNCAKTRSKSVFLAVFGDGVFVGAILFLVAGVAIGAWIRRRQWNRIFRETAKGVLEVNLMEKTNKLEEDLRSSATLIRVLSRQL
        ELFCLV+SL+ SVG ALN  K RSKS+FLAVFGDG+FVGA LFLVAGVAIGAWIRRRQWNRI+R TAK  LE++L+E+TNKLEEDL+SSATLIRVLSRQL
Subjt:  ELFCLVASLIGSVGFALNCAKTRSKSVFLAVFGDGVFVGAILFLVAGVAIGAWIRRRQWNRIFRETAKGVLEVNLMEKTNKLEEDLRSSATLIRVLSRQL

Query:  EKLGIRFRVTRKALKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQECSTSKLILI
        EKLGIRFRVTRKALKKP+EETA LAQKTSEATRALAVRGDILEKELAEIQKVLLAMQE    +L LI
Subjt:  EKLGIRFRVTRKALKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQECSTSKLILI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G65250.1 unknown protein1.7e-3651.28Show/hide
Query:  STSGGVGRRPGGDGDFDIDSLLSAAELFCLVASLIGSVGFALNCAKTRSKSVFLAVFGDGVFVGAILFLVAGVAIGAWIRRRQWNRIFRETAKGVLE---
        S+S  +      DG FD+ S +S AE  C+++S + SV  A+N        V +   G  V     + LV  VA G+W+RRRQW RI     KG  E   
Subjt:  STSGGVGRRPGGDGDFDIDSLLSAAELFCLVASLIGSVGFALNCAKTRSKSVFLAVFGDGVFVGAILFLVAGVAIGAWIRRRQWNRIFRETAKGVLE---

Query:  VNLMEKTNKLEEDLRSSATLIRVLSRQLEKLGIRFRVTRKALKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQECSTSKLILI
         NL+ +  KLE+DL+SS +++RVLSR LEKLGIRFRVTRKALK+P+ ETAALAQK SEATR L  + +ILEKEL EIQKVLLAMQE    +L LI
Subjt:  VNLMEKTNKLEEDLRSSATLIRVLSRQLEKLGIRFRVTRKALKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQECSTSKLILI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAATAAAGAACACACATTGATAAAGGTCAAAGGAGGCGGGGGCCCAAAGCAGCGGGCCAAGAAGGCCCAATACTTTGGACGCAGTTATCCGCCTCTTTTCCATTC
CCGTGTGAAGTCAAATCTGTCTTCACTTTCATTTTCTATTCTGAAGGATGGACCTATGTCCCCCTTCCCATGTCTGAATACAAACCCTAGATTCTTCAACTGTCTACACT
CTAAAACGATGTCGCTTCCTTTCCAATCCCTTTCACTCACTTCACCTTCTTCTTCAACCTTGTGCTTTTCCACCTTCTTTTCTAGAAATCCGTGCGTGTCTTTTCGATTC
CCCCTAGCCTTTCGAAGCCCTTTTAATTTTGGTTCCATCAATGCCCATCAGTTCTGTCCTCGAGTTTCTACTTCTGGAGGAGTAGGTCGGAGACCCGGTGGTGATGGTGA
TTTCGATATCGATTCTTTACTTTCAGCTGCCGAGTTGTTTTGCCTTGTTGCGTCGTTGATCGGTTCTGTTGGTTTTGCTTTGAATTGCGCGAAAACCAGGTCTAAGAGCG
TTTTCTTGGCGGTGTTTGGTGATGGGGTTTTCGTTGGCGCGATTTTATTTCTGGTGGCTGGGGTTGCGATTGGTGCTTGGATTCGTAGGCGGCAGTGGAATCGAATATTT
CGAGAGACAGCGAAGGGCGTGTTAGAGGTGAATTTAATGGAAAAGACTAACAAACTTGAGGAGGATTTGAGGAGCTCGGCAACGCTAATTCGAGTTTTGTCGAGGCAGCT
GGAGAAGTTAGGGATTAGGTTTAGAGTTACTCGAAAGGCTCTGAAGAAGCCCGTTGAGGAGACTGCAGCTTTAGCTCAAAAGACTTCTGAGGCCACTCGAGCATTAGCAG
TTCGGGGAGATATTTTGGAGAAGGAGCTTGCTGAAATCCAGAAGGTTTTACTAGCTATGCAGGAGTGCTCTACATCAAAGCTGATTCTGATTTCTTGTTTGAGTGGGGAT
TGTAGGAACAACAACAAAAGCAACTTGAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAATAAAGAACACACATTGATAAAGGTCAAAGGAGGCGGGGGCCCAAAGCAGCGGGCCAAGAAGGCCCAATACTTTGGACGCAGTTATCCGCCTCTTTTCCATTC
CCGTGTGAAGTCAAATCTGTCTTCACTTTCATTTTCTATTCTGAAGGATGGACCTATGTCCCCCTTCCCATGTCTGAATACAAACCCTAGATTCTTCAACTGTCTACACT
CTAAAACGATGTCGCTTCCTTTCCAATCCCTTTCACTCACTTCACCTTCTTCTTCAACCTTGTGCTTTTCCACCTTCTTTTCTAGAAATCCGTGCGTGTCTTTTCGATTC
CCCCTAGCCTTTCGAAGCCCTTTTAATTTTGGTTCCATCAATGCCCATCAGTTCTGTCCTCGAGTTTCTACTTCTGGAGGAGTAGGTCGGAGACCCGGTGGTGATGGTGA
TTTCGATATCGATTCTTTACTTTCAGCTGCCGAGTTGTTTTGCCTTGTTGCGTCGTTGATCGGTTCTGTTGGTTTTGCTTTGAATTGCGCGAAAACCAGGTCTAAGAGCG
TTTTCTTGGCGGTGTTTGGTGATGGGGTTTTCGTTGGCGCGATTTTATTTCTGGTGGCTGGGGTTGCGATTGGTGCTTGGATTCGTAGGCGGCAGTGGAATCGAATATTT
CGAGAGACAGCGAAGGGCGTGTTAGAGGTGAATTTAATGGAAAAGACTAACAAACTTGAGGAGGATTTGAGGAGCTCGGCAACGCTAATTCGAGTTTTGTCGAGGCAGCT
GGAGAAGTTAGGGATTAGGTTTAGAGTTACTCGAAAGGCTCTGAAGAAGCCCGTTGAGGAGACTGCAGCTTTAGCTCAAAAGACTTCTGAGGCCACTCGAGCATTAGCAG
TTCGGGGAGATATTTTGGAGAAGGAGCTTGCTGAAATCCAGAAGGTTTTACTAGCTATGCAGGAGTGCTCTACATCAAAGCTGATTCTGATTTCTTGTTTGAGTGGGGAT
TGTAGGAACAACAACAAAAGCAACTTGAGTTGATTCTAGCAATAGGAAAGTCAGGAAAGATGTGGGAAAGCAGACAGGAGCATGGTGGAGGACAAAGTCATATTGGGAGG
CATGATCTGATTGATGAACGCTTAAATCGAAAGGAAGTCCAGGACGTTTGAGCTGTTTGAAGAAGCGAATGGAGATAGCTTAGGATTCTTTTGTTACAAATTAGGATGGT
AGTTGTTGCATAACGACAGAATAGAAGAATGGGATGTAATTTTGTGGCAGTTTAAACTTTGGCATTTATCATGCTTGATACTTCTTGTCAGAAAAGATTTTGGGTTTCCA
TAATGTGCAATTTAATGGGAAATGAACGTTGCACCGAGAGAAAGTGTAAAGCGGCAGGGGAATTGTAATGGGAGAAATGTATTTGTTATGGTCATTGGGTTAATGTAATA
AACATTCTGCTTAACCATTTTAATTTTGATTTTCGTTGTATTA
Protein sequenceShow/hide protein sequence
MKNKEHTLIKVKGGGGPKQRAKKAQYFGRSYPPLFHSRVKSNLSSLSFSILKDGPMSPFPCLNTNPRFFNCLHSKTMSLPFQSLSLTSPSSSTLCFSTFFSRNPCVSFRF
PLAFRSPFNFGSINAHQFCPRVSTSGGVGRRPGGDGDFDIDSLLSAAELFCLVASLIGSVGFALNCAKTRSKSVFLAVFGDGVFVGAILFLVAGVAIGAWIRRRQWNRIF
RETAKGVLEVNLMEKTNKLEEDLRSSATLIRVLSRQLEKLGIRFRVTRKALKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQECSTSKLILISCLSGD
CRNNNKSNLS