; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC05G092060 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC05G092060
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
Description50S ribosomal protein L33-like
Genome locationCicolChr05:9778202..9784568
RNA-Seq ExpressionCcUC05G092060
SyntenyCcUC05G092060
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0005840 - ribosome (cellular component)
GO:0009536 - plastid (cellular component)
GO:0003735 - structural constituent of ribosome (molecular function)
InterPro domainsIPR001705 - Ribosomal protein L33
IPR011332 - Zinc-binding ribosomal protein
IPR018264 - Ribosomal protein L33, conserved site
IPR038584 - Ribosomal protein L33 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058825.1 50S ribosomal protein L33-like [Cucumis melo var. makuwa]1.3e-4382.64Show/hide
Query:  PSQCQRRNLTFGGLSKFSGSLSQRAWERSIRYGSVICMAKRYAPETTKRKRLSRKRGGDPDKKKKTRRKGGKKDFKIVRLSSTARTGFFYAKKKSRKMAD
        PS     +LT   +    G L QRA ERSIRYGSVICMAKRYAP+TTKRKRLSRKRGGDP KKKKTRRKGG+KDFKIVRLSSTARTGFFYAKKKSRKMAD
Subjt:  PSQCQRRNLTFGGLSKFSGSLSQRAWERSIRYGSVICMAKRYAPETTKRKRLSRKRGGDPDKKKKTRRKGGKKDFKIVRLSSTARTGFFYAKKKSRKMAD

Query:  KIELQKYDPIANRHVLFTEVK
        KIE+QKYDPIANRHVLFTEVK
Subjt:  KIELQKYDPIANRHVLFTEVK

KAE8650541.1 hypothetical protein Csa_009952 [Cucumis sativus]3.8e-4383.47Show/hide
Query:  PSQCQRRNLTFGGLSKFSGSLSQRAWERSIRYGSVICMAKRYAPETTKRKRLSRKRGGDPDKKKKTRRKGGKKDFKIVRLSSTARTGFFYAKKKSRKMAD
        PS     +LT   L    G LSQRA ERSIRY SVICMAKRYAPETTKRKRLSRKRGGDP KKKKTRRKGG KDFKIVRLSSTARTGFFYAKKKSRK+AD
Subjt:  PSQCQRRNLTFGGLSKFSGSLSQRAWERSIRYGSVICMAKRYAPETTKRKRLSRKRGGDPDKKKKTRRKGGKKDFKIVRLSSTARTGFFYAKKKSRKMAD

Query:  KIELQKYDPIANRHVLFTEVK
        KIE+QKYDPIANRHVLFTEVK
Subjt:  KIELQKYDPIANRHVLFTEVK

XP_008456021.1 PREDICTED: uncharacterized protein LOC103496073 [Cucumis melo]1.3e-4382.64Show/hide
Query:  PSQCQRRNLTFGGLSKFSGSLSQRAWERSIRYGSVICMAKRYAPETTKRKRLSRKRGGDPDKKKKTRRKGGKKDFKIVRLSSTARTGFFYAKKKSRKMAD
        PS     +LT   +    G L QRA ERSIRYGSVICMAKRYAP+TTKRKRLSRKRGGDP KKKKTRRKGG+KDFKIVRLSSTARTGFFYAKKKSRKMAD
Subjt:  PSQCQRRNLTFGGLSKFSGSLSQRAWERSIRYGSVICMAKRYAPETTKRKRLSRKRGGDPDKKKKTRRKGGKKDFKIVRLSSTARTGFFYAKKKSRKMAD

Query:  KIELQKYDPIANRHVLFTEVK
        KIE+QKYDPIANRHVLFTEVK
Subjt:  KIELQKYDPIANRHVLFTEVK

XP_022142796.1 uncharacterized protein LOC111012832 [Momordica charantia]9.2e-4282.76Show/hide
Query:  RRNLTFGGLSKFSGSLSQRAWERSIRYGSVICMAKRYAPETTKRKRLSRKRGGDPDKKKKTRRKGGKKDFKIVRLSSTARTGFFYAKKKSRKMADKIELQ
        RRNL FGGL KFSGSLS  A ERSIR GSVICMAKRYAP TTKRKRLSRKRGGDP KKKKTRRKG K+DFKIVRLSS A TGFFY K+KS+ MADKI+LQ
Subjt:  RRNLTFGGLSKFSGSLSQRAWERSIRYGSVICMAKRYAPETTKRKRLSRKRGGDPDKKKKTRRKGGKKDFKIVRLSSTARTGFFYAKKKSRKMADKIELQ

Query:  KYDPIANRHVLFTEVK
        KYDP+  RHVLFTEVK
Subjt:  KYDPIANRHVLFTEVK

XP_038880588.1 uncharacterized protein LOC120072240 [Benincasa hispida]8.4e-4368.83Show/hide
Query:  LRHFQLSLLLCCP-----LAERPSQCQRRNLTFGGLS-----------KFSGSLSQRAWERSIRYGSVICMAKRYAPETTKRKRLSRKRGGDPDKKKKTR
        ++ F L L + C      L++  S      +TF  LS              GSLSQRA ERSIRY SVICMAKRYAP+TTKRKRLSRKRGGDP KKKKTR
Subjt:  LRHFQLSLLLCCP-----LAERPSQCQRRNLTFGGLS-----------KFSGSLSQRAWERSIRYGSVICMAKRYAPETTKRKRLSRKRGGDPDKKKKTR

Query:  RKGGKKDFKIVRLSSTARTGFFYAKKKSRKMADKIELQKYDPIANRHVLFTEVK
        RKGGKKDFKIVRLSSTARTGFFYAKKKSRKMADK+EL+KYDPIANRHVLF EVK
Subjt:  RKGGKKDFKIVRLSSTARTGFFYAKKKSRKMADKIELQKYDPIANRHVLFTEVK

TrEMBL top hitse value%identityAlignment
A0A0A0LAF8 Uncharacterized protein1.3e-4194.06Show/hide
Query:  LSQRAWERSIRYGSVICMAKRYAPETTKRKRLSRKRGGDPDKKKKTRRKGGKKDFKIVRLSSTARTGFFYAKKKSRKMADKIELQKYDPIANRHVLFTEV
        LSQRA ERSIRY SVICMAKRYAPETTKRKRLSRKRGGDP KKKKTRRKGG KDFKIVRLSSTARTGFFYAKKKSRK+ADKIE+QKYDPIANRHVLFTEV
Subjt:  LSQRAWERSIRYGSVICMAKRYAPETTKRKRLSRKRGGDPDKKKKTRRKGGKKDFKIVRLSSTARTGFFYAKKKSRKMADKIELQKYDPIANRHVLFTEV

Query:  K
        K
Subjt:  K

A0A1S3C284 uncharacterized protein LOC1034960736.2e-4482.64Show/hide
Query:  PSQCQRRNLTFGGLSKFSGSLSQRAWERSIRYGSVICMAKRYAPETTKRKRLSRKRGGDPDKKKKTRRKGGKKDFKIVRLSSTARTGFFYAKKKSRKMAD
        PS     +LT   +    G L QRA ERSIRYGSVICMAKRYAP+TTKRKRLSRKRGGDP KKKKTRRKGG+KDFKIVRLSSTARTGFFYAKKKSRKMAD
Subjt:  PSQCQRRNLTFGGLSKFSGSLSQRAWERSIRYGSVICMAKRYAPETTKRKRLSRKRGGDPDKKKKTRRKGGKKDFKIVRLSSTARTGFFYAKKKSRKMAD

Query:  KIELQKYDPIANRHVLFTEVK
        KIE+QKYDPIANRHVLFTEVK
Subjt:  KIELQKYDPIANRHVLFTEVK

A0A5A7UZ21 50S ribosomal protein L33-like6.2e-4482.64Show/hide
Query:  PSQCQRRNLTFGGLSKFSGSLSQRAWERSIRYGSVICMAKRYAPETTKRKRLSRKRGGDPDKKKKTRRKGGKKDFKIVRLSSTARTGFFYAKKKSRKMAD
        PS     +LT   +    G L QRA ERSIRYGSVICMAKRYAP+TTKRKRLSRKRGGDP KKKKTRRKGG+KDFKIVRLSSTARTGFFYAKKKSRKMAD
Subjt:  PSQCQRRNLTFGGLSKFSGSLSQRAWERSIRYGSVICMAKRYAPETTKRKRLSRKRGGDPDKKKKTRRKGGKKDFKIVRLSSTARTGFFYAKKKSRKMAD

Query:  KIELQKYDPIANRHVLFTEVK
        KIE+QKYDPIANRHVLFTEVK
Subjt:  KIELQKYDPIANRHVLFTEVK

A0A6J1CN96 uncharacterized protein LOC1110128324.5e-4282.76Show/hide
Query:  RRNLTFGGLSKFSGSLSQRAWERSIRYGSVICMAKRYAPETTKRKRLSRKRGGDPDKKKKTRRKGGKKDFKIVRLSSTARTGFFYAKKKSRKMADKIELQ
        RRNL FGGL KFSGSLS  A ERSIR GSVICMAKRYAP TTKRKRLSRKRGGDP KKKKTRRKG K+DFKIVRLSS A TGFFY K+KS+ MADKI+LQ
Subjt:  RRNLTFGGLSKFSGSLSQRAWERSIRYGSVICMAKRYAPETTKRKRLSRKRGGDPDKKKKTRRKGGKKDFKIVRLSSTARTGFFYAKKKSRKMADKIELQ

Query:  KYDPIANRHVLFTEVK
        KYDP+  RHVLFTEVK
Subjt:  KYDPIANRHVLFTEVK

A0A6J1E4T6 uncharacterized protein LOC111430611 isoform X21.6e-3987.25Show/hide
Query:  SLSQRAWERSIRYGSVICMAKRYAPETTKRKRLSRKRGGDPDKKKKTRRKGGKKDFKIVRLSSTARTGFFYAKKKSRKMADKIELQKYDPIANRHVLFTE
        SLSQ A ERSIR+GSVICMA+RYAP  TKRKRLSRKRGGDP+KKKKTRRKGG+KDFKI+RLSSTA TGFFYAKKKSRKMADKIELQK+DP+A RHVLFTE
Subjt:  SLSQRAWERSIRYGSVICMAKRYAPETTKRKRLSRKRGGDPDKKKKTRRKGGKKDFKIVRLSSTARTGFFYAKKKSRKMADKIELQKYDPIANRHVLFTE

Query:  VK
        VK
Subjt:  VK

SwissProt top hitse value%identityAlignment
B6IN38 50S ribosomal protein L333.4e-0750.98Show/hide
Query:  KKDFKIVRLSSTARTGFFYAKKKS-RKMADKIELQKYDPIANRHVLFTEVK
        K++  +++L S+A TGFFY KKK+ RK  +K+E +KYDP+A +HV+F E K
Subjt:  KKDFKIVRLSSTARTGFFYAKKKS-RKMADKIELQKYDPIANRHVLFTEVK

Q2W1A2 50S ribosomal protein L331.7e-0654.35Show/hide
Query:  IVRLSSTARTGFFY-AKKKSRKMADKIELQKYDPIANRHVLFTEVK
        +++L STA TGFFY AKK  RK  +K+E +KYDP+  +HV F E K
Subjt:  IVRLSSTARTGFFY-AKKKSRKMADKIELQKYDPIANRHVLFTEVK

Q3YSN4 50S ribosomal protein L337.7e-0750Show/hide
Query:  IVRLSSTARTGFFYAKKKS-RKMADKIELQKYDPIANRHVLFTEVK
        +V+L+S+A+TGFFY KK++ +K+ +K+  +KYDP+  +HVLF+E K
Subjt:  IVRLSSTARTGFFYAKKKS-RKMADKIELQKYDPIANRHVLFTEVK

Q5FFJ4 50S ribosomal protein L335.9e-0752.17Show/hide
Query:  IVRLSSTARTGFFYAKKKS-RKMADKIELQKYDPIANRHVLFTEVK
        +V+L+S+A TG+FY KK++ +K+ +K+  +KYDP+A +HVLFTE K
Subjt:  IVRLSSTARTGFFYAKKKS-RKMADKIELQKYDPIANRHVLFTEVK

Q5HBV6 50S ribosomal protein L335.9e-0752.17Show/hide
Query:  IVRLSSTARTGFFYAKKKS-RKMADKIELQKYDPIANRHVLFTEVK
        +V+L+S+A TG+FY KK++ +K+ +K+  +KYDP+A +HVLFTE K
Subjt:  IVRLSSTARTGFFYAKKKS-RKMADKIELQKYDPIANRHVLFTEVK

Arabidopsis top hitse value%identityAlignment
AT3G06320.1 Ribosomal protein L33 family protein7.6e-1060.78Show/hide
Query:  KKDFKIVRLSSTARTGFFYAKKKSRK-MADKIELQKYDPIANRHVLFTEVK
        KK F  +RL S A TGFFY K+KS K + +K+E +KYDP  NRHVLFTE K
Subjt:  KKDFKIVRLSSTARTGFFYAKKKSRK-MADKIELQKYDPIANRHVLFTEVK

AT5G18790.1 Ribosomal protein L33 family protein7.6e-1060.78Show/hide
Query:  KKDFKIVRLSSTARTGFFYAKKKSRK-MADKIELQKYDPIANRHVLFTEVK
        KK F  +RL S A TGFFY K+KS K + +K+E +KYDP  NRHVLFTE K
Subjt:  KKDFKIVRLSSTARTGFFYAKKKSRK-MADKIELQKYDPIANRHVLFTEVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAGTATTAAAAAGGAAGAAGGGTTCATTTTCATGGAAGATGATGGCGACAGCAATTCAAAGGTTTTGTTTGGATTTGGCGCTTTCTTGTGGAAAAAGGAACCGAT
TTCTCAGTGGTTCTGCGTCTTCAGCTCCTTCAAACTTCGTCACTTTCAACTCTCTCTCCTTCTTTGCTGCCCTCTCGCTGAGCGTCCTTCCCAATGTCAAAGGAGAAATT
TAACATTTGGGGGCCTGTCGAAATTTTCAGGATCGTTGTCACAGAGGGCTTGGGAGAGATCAATTCGTTATGGATCCGTAATTTGTATGGCCAAAAGATATGCTCCTGAA
ACTACAAAGAGAAAGAGGTTAAGTAGGAAGAGGGGCGGTGATCCAGACAAGAAAAAGAAGACAAGGAGAAAAGGAGGAAAGAAAGATTTCAAAATAGTCAGGCTCTCCTC
AACTGCTAGGACTGGCTTTTTCTATGCCAAGAAGAAAAGCAGGAAGATGGCTGATAAGATTGAGCTCCAGAAATATGACCCTATTGCAAATCGCCATGTTCTGTTCACTG
AAGTCAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAGTATTAAAAAGGAAGAAGGGTTCATTTTCATGGAAGATGATGGCGACAGCAATTCAAAGGTTTTGTTTGGATTTGGCGCTTTCTTGTGGAAAAAGGAACCGAT
TTCTCAGTGGTTCTGCGTCTTCAGCTCCTTCAAACTTCGTCACTTTCAACTCTCTCTCCTTCTTTGCTGCCCTCTCGCTGAGCGTCCTTCCCAATGTCAAAGGAGAAATT
TAACATTTGGGGGCCTGTCGAAATTTTCAGGATCGTTGTCACAGAGGGCTTGGGAGAGATCAATTCGTTATGGATCCGTAATTTGTATGGCCAAAAGATATGCTCCTGAA
ACTACAAAGAGAAAGAGGTTAAGTAGGAAGAGGGGCGGTGATCCAGACAAGAAAAAGAAGACAAGGAGAAAAGGAGGAAAGAAAGATTTCAAAATAGTCAGGCTCTCCTC
AACTGCTAGGACTGGCTTTTTCTATGCCAAGAAGAAAAGCAGGAAGATGGCTGATAAGATTGAGCTCCAGAAATATGACCCTATTGCAAATCGCCATGTTCTGTTCACTG
AAGTCAAGTGAGTCAGGTTGTTCCCCTGTGATGGGTATAAATAAAAGATAGGTAGATAACTGGATGGATAGATAAATAGGTAAAACTTTGTTATTATTCAGTATGGATGC
TTACTGATTGCATGAACCTCACATTTTGAATCGAAAGCTGAAAGTTAGGGAAGATATTATTGGATGCTGAGTACATTGCCATATTTTGGAACATATGATCGAAGAAAATA
TTAAAGAAACTTTTGGAGGCCGTATTAAAAAAGAATCAATCTTTGGTAGTGGTCAGTCCACTCTTGGATTTTGTTGGGGGTCATAAAAGGTCGAGTATTTTGAGTTATTG
CTACTTTGATTATAGTAATCTTTTGTGCTAGTTGAAATGTGAATTGTTGACAATTGAACTTGGTCTGAAAGATCCATTGCTGTCAAAATGAAATCTATGAGTCAAAAAAC
ATCTATTTCTAAAATTGACCGAAATGAACTGGTTTAGTAGGCACCTTCTGAACATGTCTATTCTAAATAGTGAAACATGTTAAAGAAAAAACTTCGCTTTGGACTCGGTT
ACAACTGTAAGTTAAAAAAAAAAAGGTCACTAAGTTATGAACAGCCTCCATCTATTCTACCTCCATCATTTGATGGGGTTGCGTTGTGCTTCATAGACTTCAATGGTTTT
GCTATACACAATGTAGTTTGATATGAGGGATGAAATGCAAACCAAGCATTTAGTGTAGAAAAACCTACAACGTAACACCATCTAGCCATGAAGAAACTTCTTCAATCGAA
GGTCTCTTCCTTGGATCTTGGTCTATGCATTTACAAGTAATACCAAGAACCTCTAAAATCTGTTTTTTGCTATTAGTGTCCCAAATTGCAGGATCAATAATCTCCTCTTC
TCTCTTCTCTGATTTTTTTTGAATCACCCAAGACACCAGATCCCTGCAGGCTTTACCTTTGCATACCTCTACAGGCCTCCGACCAGTAAGAAGCTCAAGAAGAACAACAC
CAAAACTATAAACATCACCCCTGCAAGTTGCAGTTAATGTCTGGCTGTATTCAGGAGGAATATAACCCAAAGTCCCAACCAAGTCCGTGGTAACATGAGTGTCATATGGC
CGGAGTAATCTTGAAAGACCAAAATCAGCTAAATGAGCCTCAAATCTGTCATCTAGAAGTATGTTGCTTGATTTTACATCTCGGTGGATTATGTTGGGCTGACATTCCTT
ATGCAATAACCTTGAAGAGAAACAAGATTCTTATGCTGTGCTCGAGAAAGAGCTTCTACTTCGGCTTGGAATTCCCGTTCCATCTGACCACAATCTCCTGTAAGCCTCTT
GACTGCAGCTTTTGAACCATTGGGAAGGCTGGCTTTGTAAACCAGGCCAAATCCACCACAGCCTATTATGTTCGCTTGGTTGAAATTGCAAGTAGCTTTTAACAACTCTG
CAACTGTAAGATCCTTGCATTCTGAATTCTGAAAGAGCACCAACTTTGACGATCCAAGAGCTCCAGATAACCTGTCAGGCCTGTCAAATTCCTCATCAAACCTGTTATTT
CGTCGATCTCCAACATCTTTCCTGGACATTTTGAGCAAAACCACTGTAAGGAGCAGGAGGACTGCTGCAGCTGCACCGACTGTAAGGCAAAGAATGATGCTTCTATTAAC
TTTTCTCTTTGAGTAATTGTTGGTTTCGGGTTTTGTTTCTAACCCATCAACAGAATGACAAGGGTTGTCAATTTCCCCACAAAGTCCTGTATTACCATCAAAGCAGGAGC
TGGGAAAGCTTAGGAATTGGCCTCCATTGGGAATTGGTCCCTCCAAGTGATTATTTGCCACGCTAAACTTTGATAAGAAGGTGAGCTTGTTGAGTGATGGTGGAATTTGT
CCATAAAGATCATTATTCGACAAATCTAATGTTTCCAAGTTCTCCATCTCTGATATGGTGCTCGGTATGGACCCAGTAATATTATTCCTACTCAAATCCAAGACATGCAG
CCATTTTAATCTGCCAATTTCCGGAAAAATAGTTCCATTAATTCTGTTGTAGCTCAAGTAAATCGATGGAGGAAAGCTTGACGCTTGGTTGTATTGCAAACCAGTAGCAC
TTTGATTTCGTTTGACAAAAAGGGGGATTCCAGCGGATGATGTCGAACCCGACAAGCTGCCGTTTCTGGAAATTAGTGCTTTCATCTGTGTCAAGCTTTTTGGGATTTCT
CCTGTCAGAGAATTGTTGGATAAGTCCAAATAAAACAAATTTTCCAGCTGACCAATCCAAGCAGGGATACTTCCATTTAAGTGATTCCAAGACAAGTCGAGGATGCTTAA
TTTCTTGCAACCTATTAACCAACCTGGAATTTGTCCTCTCAGACCACAGTTACCAAATGCCAAAAGCATCAAGTTGTTGAAAACAGTTTCACTCTGTGGAAGTACCTCAT
TACGAAAGTTCTTTGTAAGAATAAGAACAGTCAGATTTCTGCAATTTTGTAAAATAGATAGTGCACCACTCAAGTCTATGATACTGTTGTTTGACAAAGATAGAAAGGAC
AGAGAGGAAAGCTTTGCATAATTTCGAGGAATTTGACCAGTTAACTTGTTTTTGGCAAGGCTAAGAGTTTTCAGTTCATGGCAATCGGATAGAGAATTTGGTAGGGGCCC
AGAAAAGTGATTCGAGGCTAGATCAAGCATTTGAAGATTTGGTAGCCTAGAAAAATTGAGATCTATAGTACCAGTTAAAGAATTATTTCTAAGATCAAACACTCTGAGCT
TTGAGCACAATGACAGAGATGAAGGCAATAATCCAGAGAACAAGTTGGAGTGTGCAACTAACTCCTGTAGTTCTGAAAAATTACCAAACACATTTGGAAGTTCACCAGAA
AATTTGTTTCCAAATACTATAAAGGATTTAAGCCTAGAAAGCTTACTCAGTTCCATGCTTAATTGGCCAAAGAAACTGTTCCCAGGGATTGAGAAATACTCCATGGATGA
TAATGAATACAACGAATCTGGCAGATGGCCAGTAAGGAAGTTACTGTCTGCACGAAAATGTTTGAGAGATTTGCTGCAATTGTCCAAACCTTGAAGATTTCCAGATATTC
GGTTCAGTGAAATATCAACAACCTGAATCATGATGGAGGAATTGCAAATCTGTGACGCTAACCGGCCCGTGAAAGAATTGTTACTTATGTTGAAGGCAACAAGATTTTGA
AAACCCACAAGCTGGGGAAAATCCCCAATGAATGAATTGCTTGAAATATTCAGAACGTGAACAGATTTCAATCCTGAAGTTGCATTGGTGACTGGTCCTGACAGTTTATT
GTAACTCAAATCCAAAACCTGGAGTTGCTTCAAGCTTGAGAGTTCTGTTGGCAATACACCTTCAAGCTGATTGTATGATAGATTCAGCCAAATTAACTGATCTAAACCAC
CAAGTGATTGTGAAATCTTGCCTTTCAGATTCATATTAGGCAGTTCTAACCTGGTAACTCTGTTAGTGCGCGAGCTGTTGCCATCATATCCACAATCCACACCATCCCAA
TTGCAGCAGTTGGATTCATTTAACCATACAGAGAGAACAGAACTATTTGCAAGGCTATTCACAAAGCCCCTCAATGCCAGTAAATCTTTTGAATCACAGACTTTATTAGT
CTGTTTCAGAGCCAGCGAGAAAGAAAGCAAGCAACTCAATAATATCCATTTGAGGAAAGACGGTTTGACAAGATTAACCACCATCACCTCTGAGACCAGTAGAGATCTCA
TGTTGAAATTAAAAAGCAGAGCCACTGTAAAACTCAGTCAAAACTCCATCTGGTCATACTCAGCTTTTCAGAAATGAAAGGAATCCAAATAAAGAAATCTTGCAAACTCT
AGTTTGAGGAGGAGGAGAATCCCAAGTTGGGCAACTTGCAAAAGAAGCAAAAAGAGGAAAGGGAGAAGATAAATGAAGAAAGAAAAAGACCGTTCACCTGGCAGCAACAC
CAGAAAGCCACAGAAACAGAGAGGTATTCAAAAATGAAGAAACATGGAGAAAGCCAAGGCGATCTCAGATCCCGAAGACAAGAGTAGGAGAAGAAGCTTGAAAAGGAAGA
AAGAAGCAAATAAAGAAATGATTTCCCAGTTTCCCAGTAGCAACTCCAGTGGAGATCATGACTTACAAATGAAGAGCCCAAAAGAGGGGAAAAAGATTTCTAGAGTGGGT
GTTGAACAGAGGAGAGGCAGGCAGATCAGAGGGGGAGAGGAGTGTACTTTACAATATATATAATTGGAGAGGAGGCAAGTAAGTAAGGGATTGGGTGAGTATTTGTTTTT
CCATATAAAGACAGAGAGAGCTGACTTGAGGAAAGTTTGTGTGACTTTTTGGAGCACCTACGACGTTGATCTGCTTCTTGTTTCTCTCTCTCTAGAAGAAATTGCAGAGA
AAAAGAGGAAAGTTGAGAGTGGAGTTGGAAAAGTTGGGAGCGCAGCTCAAGGACCGATGTGAGTTTTCTATTCTTCACGCCTACTCCATCTTTCATGCACGTGTTCACAC
CAATTATTTCTCTTTTAATCTTTTTGCTTTCTAAAATAAAAAAGAATACTAAAATACCCCAAATACTTTTTAGTTTTTATTTTTAATGAACTTTAAATTTAGTCTCTTGC
AGCTCATTTTTGT
Protein sequenceShow/hide protein sequence
MKSIKKEEGFIFMEDDGDSNSKVLFGFGAFLWKKEPISQWFCVFSSFKLRHFQLSLLLCCPLAERPSQCQRRNLTFGGLSKFSGSLSQRAWERSIRYGSVICMAKRYAPE
TTKRKRLSRKRGGDPDKKKKTRRKGGKKDFKIVRLSSTARTGFFYAKKKSRKMADKIELQKYDPIANRHVLFTEVK