; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg022274 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg022274
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein of unknown function (DUF1639)
Genome locationscaffold2:10490159..10492821
RNA-Seq ExpressionSpg022274
SyntenySpg022274
Gene Ontology termsNA
InterPro domainsIPR012438 - Protein of unknown function DUF1639


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004144072.1 uncharacterized protein LOC101218228 isoform X4 [Cucumis sativus]4.9e-10392.79Show/hide
Query:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVAAEKASSTQQTHRLNKNIVSPTNNQRTSVTSPEKEDRYYT
        ME E I QKSNKSGETDFLLQWGNRKRMRYMKVKEPQR+TSKPDCLGKKKVSS GDRRVV AEKASSTQQTHRLNK+I SPT+NQRT++TSPEKEDRYYT
Subjt:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVAAEKASSTQQTHRLNKNIVSPTNNQRTSVTSPEKEDRYYT

Query:  TRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRPRGLKAM
        TRGSMGVD+K SMDHP GNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRPRGLKAM
Subjt:  TRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRPRGLKAM

Query:  RSMDSDSE
        RSM+SDSE
Subjt:  RSMDSDSE

XP_008451016.1 PREDICTED: uncharacterized protein LOC103492424 isoform X1 [Cucumis melo]2.0e-10491.59Show/hide
Query:  KLLDIDMENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVAAEKASSTQQTHRLNKNIVSPTNNQRTSVTSPEK
        K  + DME E I QKSNKSGETDFLLQWGNRKRMRYMKVKEPQR+TSKPDCLGKKKVSS GDRRVV AEKASSTQQTHRLNK+I SPT+NQRT++TSPEK
Subjt:  KLLDIDMENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVAAEKASSTQQTHRLNKNIVSPTNNQRTSVTSPEK

Query:  EDRYYTTRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRP
        EDRYYTTRGSMGVDEK S+DHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRP
Subjt:  EDRYYTTRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRP

Query:  RGLKAMRSMDSDSE
        RGLKAMRSM+SDSE
Subjt:  RGLKAMRSMDSDSE

XP_008451017.1 PREDICTED: uncharacterized protein LOC103492424 isoform X2 [Cucumis melo]1.3e-10393.27Show/hide
Query:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVAAEKASSTQQTHRLNKNIVSPTNNQRTSVTSPEKEDRYYT
        ME E I QKSNKSGETDFLLQWGNRKRMRYMKVKEPQR+TSKPDCLGKKKVSS GDRRVV AEKASSTQQTHRLNK+I SPT+NQRT++TSPEKEDRYYT
Subjt:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVAAEKASSTQQTHRLNKNIVSPTNNQRTSVTSPEKEDRYYT

Query:  TRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRPRGLKAM
        TRGSMGVDEK S+DHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRPRGLKAM
Subjt:  TRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRPRGLKAM

Query:  RSMDSDSE
        RSM+SDSE
Subjt:  RSMDSDSE

XP_011660058.1 uncharacterized protein LOC101218228 isoform X3 [Cucumis sativus]5.8e-10491.12Show/hide
Query:  KLLDIDMENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVAAEKASSTQQTHRLNKNIVSPTNNQRTSVTSPEK
        K  + DME E I QKSNKSGETDFLLQWGNRKRMRYMKVKEPQR+TSKPDCLGKKKVSS GDRRVV AEKASSTQQTHRLNK+I SPT+NQRT++TSPEK
Subjt:  KLLDIDMENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVAAEKASSTQQTHRLNKNIVSPTNNQRTSVTSPEK

Query:  EDRYYTTRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRP
        EDRYYTTRGSMGVD+K SMDHP GNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRP
Subjt:  EDRYYTTRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRP

Query:  RGLKAMRSMDSDSE
        RGLKAMRSM+SDSE
Subjt:  RGLKAMRSMDSDSE

XP_038879793.1 uncharacterized protein LOC120071537 [Benincasa hispida]2.6e-10493.75Show/hide
Query:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVAAEKASSTQQTHRLNKNIVSPTNNQRTSVTSPEKEDRYYT
        ME E I QKSNKSGETDFLLQWGNRKRMRYMKVKEPQR+T+KPDCLGKKKVSS GDRRVV AEKASST QTHRLNK+IV+PTNNQRT++TSPEKEDRYYT
Subjt:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVAAEKASSTQQTHRLNKNIVSPTNNQRTSVTSPEKEDRYYT

Query:  TRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRPRGLKAM
        TRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRPRGLKAM
Subjt:  TRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRPRGLKAM

Query:  RSMDSDSE
        RSMDS+SE
Subjt:  RSMDSDSE

TrEMBL top hitse value%identityAlignment
A0A0A0LZ81 Uncharacterized protein2.4e-10392.79Show/hide
Query:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVAAEKASSTQQTHRLNKNIVSPTNNQRTSVTSPEKEDRYYT
        ME E I QKSNKSGETDFLLQWGNRKRMRYMKVKEPQR+TSKPDCLGKKKVSS GDRRVV AEKASSTQQTHRLNK+I SPT+NQRT++TSPEKEDRYYT
Subjt:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVAAEKASSTQQTHRLNKNIVSPTNNQRTSVTSPEKEDRYYT

Query:  TRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRPRGLKAM
        TRGSMGVD+K SMDHP GNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRPRGLKAM
Subjt:  TRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRPRGLKAM

Query:  RSMDSDSE
        RSM+SDSE
Subjt:  RSMDSDSE

A0A1S3BPZ2 uncharacterized protein LOC103492424 isoform X19.6e-10591.59Show/hide
Query:  KLLDIDMENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVAAEKASSTQQTHRLNKNIVSPTNNQRTSVTSPEK
        K  + DME E I QKSNKSGETDFLLQWGNRKRMRYMKVKEPQR+TSKPDCLGKKKVSS GDRRVV AEKASSTQQTHRLNK+I SPT+NQRT++TSPEK
Subjt:  KLLDIDMENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVAAEKASSTQQTHRLNKNIVSPTNNQRTSVTSPEK

Query:  EDRYYTTRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRP
        EDRYYTTRGSMGVDEK S+DHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRP
Subjt:  EDRYYTTRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRP

Query:  RGLKAMRSMDSDSE
        RGLKAMRSM+SDSE
Subjt:  RGLKAMRSMDSDSE

A0A1S3BQL5 uncharacterized protein LOC103492424 isoform X26.2e-10493.27Show/hide
Query:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVAAEKASSTQQTHRLNKNIVSPTNNQRTSVTSPEKEDRYYT
        ME E I QKSNKSGETDFLLQWGNRKRMRYMKVKEPQR+TSKPDCLGKKKVSS GDRRVV AEKASSTQQTHRLNK+I SPT+NQRT++TSPEKEDRYYT
Subjt:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVAAEKASSTQQTHRLNKNIVSPTNNQRTSVTSPEKEDRYYT

Query:  TRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRPRGLKAM
        TRGSMGVDEK S+DHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRPRGLKAM
Subjt:  TRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRPRGLKAM

Query:  RSMDSDSE
        RSM+SDSE
Subjt:  RSMDSDSE

A0A6J1D1C5 uncharacterized protein LOC1110164521.2e-10292.38Show/hide
Query:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSS--NGDRRVVAAEKASSTQQTHRLNKNIVSPTNNQRTSVTSPEKEDRY
        M+NEGIAQKSNKSGETDF+LQWGNRKRMRYMKVKEP+RVTSKPDCLGKKKVSS  NGDRRVV+AEKASST Q +RLNKN+V PTNNQRTS TSPEKEDRY
Subjt:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSS--NGDRRVVAAEKASSTQQTHRLNKNIVSPTNNQRTSVTSPEKEDRY

Query:  YTTRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRPRGLK
        YTTRGSMGVDEKAS+D PTGNDRKGFVWPRL++SLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRPRGLK
Subjt:  YTTRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRPRGLK

Query:  AMRSMDSDSE
        AMRSMDSDSE
Subjt:  AMRSMDSDSE

A0A6J1H9M1 uncharacterized protein LOC1114613605.8e-10293.3Show/hide
Query:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVAA-EKASSTQQTHRLNKNIVSPTNNQRTSVTSPEKEDRYY
        MENE IAQKSNKSGETDF+LQWGNRKRMRYMKVKEPQR++SK DCLGKKKVSS GDRRV+AA EK SS  QTHRLNKNIVSPTNNQRTSVTSPEKEDRYY
Subjt:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVAA-EKASSTQQTHRLNKNIVSPTNNQRTSVTSPEKEDRYY

Query:  TTRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRPRGLKA
        TTRGSMGVDEKAS++H TGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRPRGLKA
Subjt:  TTRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRPRGLKA

Query:  MRSMDSDSE
        MRSMDSDSE
Subjt:  MRSMDSDSE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55340.1 Protein of unknown function (DUF1639)4.9e-6161.08Show/hide
Query:  QKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTS--KPDCLGKKKVSSNGDRRVVAAEKASSTQQTHRLNKNIVSPTNNQRTSVTSPEKEDRYYTTRGSM
        Q+ + + E DF+LQWG RKR+R MKVK+ Q + +    DCL K+K+ S    R V++E+ S ++  +R NK   S  N +R+ + SPEKEDRYYTTRGSM
Subjt:  QKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTS--KPDCLGKKKVSSNGDRRVVAAEKASSTQQTHRLNKNIVSPTNNQRTSVTSPEKEDRYYTTRGSM

Query:  GVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRPRGLKAMRSMDS
        G+DE   +      + K  VWP+LYI+LS+KEKEEDF+AMKGCKLPQRPKKRAKL+QK+L+LV PG+WLSDLC+ERYEVREKKTSKKRPRGLKAM SM+S
Subjt:  GVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRPRGLKAMRSMDS

Query:  DSE
        DSE
Subjt:  DSE

AT1G55340.2 Protein of unknown function (DUF1639)3.5e-5959.61Show/hide
Query:  QKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTS--KPDCLGKKKVSSNGDRRVVAAEKASSTQQTHRLNKNIVSPTNNQRTSVTSPEKEDRYYTTRGSM
        Q+ + + E DF+LQWG RKR+R MKVK+ Q + +    DCL K+K+ S    R V++E+ S ++  +R NK+ ++          SPEKEDRYYTTRGSM
Subjt:  QKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTS--KPDCLGKKKVSSNGDRRVVAAEKASSTQQTHRLNKNIVSPTNNQRTSVTSPEKEDRYYTTRGSM

Query:  GVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRPRGLKAMRSMDS
        G+DE   +      + K  VWP+LYI+LS+KEKEEDF+AMKGCKLPQRPKKRAKL+QK+L+LV PG+WLSDLC+ERYEVREKKTSKKRPRGLKAM SM+S
Subjt:  GVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRPRGLKAMRSMDS

Query:  DSE
        DSE
Subjt:  DSE

AT3G03880.1 Protein of unknown function (DUF1639)7.1e-3644.33Show/hide
Query:  NKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVAAEKASSTQQTHRLNKNIVSPTNNQRTSVTSPEKEDRYYTTRG---SMGV
        + S E +  LQWGN+KR+R ++ K  +   SK           N  R +        + +  R ++  +  +        SPEKE+RYYTTRG   ++G 
Subjt:  NKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVAAEKASSTQQTHRLNKNIVSPTNNQRTSVTSPEKEDRYYTTRG---SMGV

Query:  D-EKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK-RPRGLKAMRSMDS
        D    +++    N+++  +WP+L+I+LS+KEKEEDFMAMKGCK   RPKKRAKL+Q+SL+LV PG+WL+DLC +RY+VR KK+SKK R RGLKAM +M++
Subjt:  D-EKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK-RPRGLKAMRSMDS

Query:  DSE
        DS+
Subjt:  DSE

AT4G20300.1 Protein of unknown function (DUF1639)3.2e-2067.16Show/hide
Query:  WPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK
        WPR+YI+LS KEKEEDF+ MKG KLP RP+KRAK + K+L    PG WLSDL + RYEVREKK  KK
Subjt:  WPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK

AT4G20300.2 Protein of unknown function (DUF1639)1.8e-2333.72Show/hide
Query:  DFLLQWGNRKRMRYMK--VKEPQRVTSKPD----CLGKKKVSSNGDRR----------------------------VVAAE-----------KASSTQQT
        D LLQWG RKR R  +  ++    +T+  D      G+ K+ SN  +R                            V+  E            A+ +   
Subjt:  DFLLQWGNRKRMRYMK--VKEPQRVTSKPD----CLGKKKVSSNGDRR----------------------------VVAAE-----------KASSTQQT

Query:  HRLNKNIVSPTNNQRTSVTSPEKEDRYYTTRGSM--GVD------------EKASMDHP----TGNDRKGF-VWPRLYISLSSKEKEEDFMAMKGCKLPQ
        + +N  ++S +   + S  SP++ ++  + R     G D            E  +  H      G   K    WPR+YI+LS KEKEEDF+ MKG KLP 
Subjt:  HRLNKNIVSPTNNQRTSVTSPEKEDRYYTTRGSM--GVD------------EKASMDHP----TGNDRKGF-VWPRLYISLSSKEKEEDFMAMKGCKLPQ

Query:  RPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK--RPRGLKAMRSMDSDSE
        RP+KRAK + K+L    PG WLSDL + RYEVREKK  KK  + RGLK M +MD+DSE
Subjt:  RPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK--RPRGLKAMRSMDSDSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCTTTTGGACATAGATATGGAGAACGAAGGAATAGCTCAGAAAAGCAATAAATCTGGTGAAACGGATTTTCTGTTGCAGTGGGGGAATAGGAAGAGGATGAGATA
TATGAAGGTCAAGGAACCACAAAGAGTTACTAGTAAACCCGATTGTTTAGGAAAGAAGAAAGTTTCTTCTAATGGCGATCGCCGTGTTGTTGCAGCTGAGAAAGCTTCAT
CTACTCAGCAAACCCATCGCCTTAACAAGAACATTGTTTCACCAACTAACAATCAAAGAACATCAGTAACATCACCCGAAAAGGAAGATCGATACTACACAACAAGAGGT
TCAATGGGGGTGGACGAAAAAGCCTCAATGGATCACCCCACAGGGAACGACCGAAAAGGGTTCGTCTGGCCAAGGCTCTATATATCTCTATCTAGCAAGGAGAAGGAAGA
AGATTTCATGGCCATGAAAGGGTGCAAACTTCCTCAAAGGCCTAAAAAGAGAGCCAAGTTGCTACAGAAAAGCTTAGTCCTGGTGATGCCAGGCTCATGGCTATCAGACT
TGTGCCAAGAGAGGTATGAAGTGAGGGAGAAGAAGACTTCAAAGAAGAGACCTAGAGGATTGAAGGCCATGAGAAGTATGGACAGTGACTCAGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGCTTTTGGACATAGATATGGAGAACGAAGGAATAGCTCAGAAAAGCAATAAATCTGGTGAAACGGATTTTCTGTTGCAGTGGGGGAATAGGAAGAGGATGAGATA
TATGAAGGTCAAGGAACCACAAAGAGTTACTAGTAAACCCGATTGTTTAGGAAAGAAGAAAGTTTCTTCTAATGGCGATCGCCGTGTTGTTGCAGCTGAGAAAGCTTCAT
CTACTCAGCAAACCCATCGCCTTAACAAGAACATTGTTTCACCAACTAACAATCAAAGAACATCAGTAACATCACCCGAAAAGGAAGATCGATACTACACAACAAGAGGT
TCAATGGGGGTGGACGAAAAAGCCTCAATGGATCACCCCACAGGGAACGACCGAAAAGGGTTCGTCTGGCCAAGGCTCTATATATCTCTATCTAGCAAGGAGAAGGAAGA
AGATTTCATGGCCATGAAAGGGTGCAAACTTCCTCAAAGGCCTAAAAAGAGAGCCAAGTTGCTACAGAAAAGCTTAGTCCTGGTGATGCCAGGCTCATGGCTATCAGACT
TGTGCCAAGAGAGGTATGAAGTGAGGGAGAAGAAGACTTCAAAGAAGAGACCTAGAGGATTGAAGGCCATGAGAAGTATGGACAGTGACTCAGAATGA
Protein sequenceShow/hide protein sequence
MKLLDIDMENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVAAEKASSTQQTHRLNKNIVSPTNNQRTSVTSPEKEDRYYTTRG
SMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKRPRGLKAMRSMDSDSE