; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g1664 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g1664
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Description50S ribosomal protein L18
Genome locationMC09:22097300..22104040
RNA-Seq ExpressionMC09g1664
SyntenyMC09g1664
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0005840 - ribosome (cellular component)
GO:0003735 - structural constituent of ribosome (molecular function)
GO:0008097 - 5S rRNA binding (molecular function)
InterPro domainsIPR005484 - Ribosomal protein L18


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151144.1 uncharacterized protein LOC111019142 [Momordica charantia]2.77e-128100Show/hide
Query:  MPLSLLNTLDFQKVYGGMPLVNLPGLLSKSSTPFGFQSKPFNFPSLKSKNCDLKPLIIQARGHSKPESAKIRNRRMQKKYNGTPKRPRLSVFCSDKQLYA
        MPLSLLNTLDFQKVYGGMPLVNLPGLLSKSSTPFGFQSKPFNFPSLKSKNCDLKPLIIQARGHSKPESAKIRNRRMQKKYNGTPKRPRLSVFCSDKQLYA
Subjt:  MPLSLLNTLDFQKVYGGMPLVNLPGLLSKSSTPFGFQSKPFNFPSLKSKNCDLKPLIIQARGHSKPESAKIRNRRMQKKYNGTPKRPRLSVFCSDKQLYA

Query:  MLVDDQNKKCLFYGSTLQKSMRPNPSCTTIEAAQYVGEELVKTCEDLNIHEISSYDRNGFARGEKMQAFEIAISHYGFLQR
        MLVDDQNKKCLFYGSTLQKSMRPNPSCTTIEAAQYVGEELVKTCEDLNIHEISSYDRNGFARGEKMQAFEIAISHYGFLQR
Subjt:  MLVDDQNKKCLFYGSTLQKSMRPNPSCTTIEAAQYVGEELVKTCEDLNIHEISSYDRNGFARGEKMQAFEIAISHYGFLQR

XP_022987454.1 uncharacterized protein LOC111484997 [Cucurbita maxima]2.05e-10583.43Show/hide
Query:  MPLSLLNTLDFQKVYGGMPLVNLPGLLSKSSTPFGFQSKPFNFPSLKSKNCDLKPLIIQARGHSKPESAKIRNRRMQKKYNGTPKRPRLSVFCSDKQLYA
        MP   LNT++F+KV GGM  +N+PGLLSK+STPF FQSKPFNFPSLK ++CD+KPL+IQARG+SK ESAK+RNRR+QKKYNG+  +PRLSVFCSDKQLYA
Subjt:  MPLSLLNTLDFQKVYGGMPLVNLPGLLSKSSTPFGFQSKPFNFPSLKSKNCDLKPLIIQARGHSKPESAKIRNRRMQKKYNGTPKRPRLSVFCSDKQLYA

Query:  MLVDDQNKKCLFYGSTLQKSMRPNPSCTTIEAAQYVGEELVKTCEDLNIHEISSYDRNGFARGEKMQAFEIAISHYGFLQR
        MLVDDQNKKCLFYGSTLQKSMRP+PSCTTIEAAQ+VGEELVK CEDLNIHEISSYDRNGFARGEKMQAFEIAIS YGFLQR
Subjt:  MLVDDQNKKCLFYGSTLQKSMRPNPSCTTIEAAQYVGEELVKTCEDLNIHEISSYDRNGFARGEKMQAFEIAISHYGFLQR

XP_023547360.1 uncharacterized protein LOC111806273 [Cucurbita pepo subsp. pepo]8.36e-10583.43Show/hide
Query:  MPLSLLNTLDFQKVYGGMPLVNLPGLLSKSSTPFGFQSKPFNFPSLKSKNCDLKPLIIQARGHSKPESAKIRNRRMQKKYNGTPKRPRLSVFCSDKQLYA
        MP   LNT++F+KV GGM  +N+PGLLSK+STPF FQSKPFNFPSLK ++CD+KPL+IQARG+SK ESAK+RNRR QKKYNG+  +PRLSVFCSDKQLYA
Subjt:  MPLSLLNTLDFQKVYGGMPLVNLPGLLSKSSTPFGFQSKPFNFPSLKSKNCDLKPLIIQARGHSKPESAKIRNRRMQKKYNGTPKRPRLSVFCSDKQLYA

Query:  MLVDDQNKKCLFYGSTLQKSMRPNPSCTTIEAAQYVGEELVKTCEDLNIHEISSYDRNGFARGEKMQAFEIAISHYGFLQR
        MLVDDQNKKCLFYGSTLQKSMRP+PSCTTIEAAQ+VGEELVK CEDLNIHEISSYDRNGFARGEKMQAFEIAIS YGFLQR
Subjt:  MLVDDQNKKCLFYGSTLQKSMRPNPSCTTIEAAQYVGEELVKTCEDLNIHEISSYDRNGFARGEKMQAFEIAISHYGFLQR

XP_038891854.1 50S ribosomal protein L18 isoform X1 [Benincasa hispida]1.83e-10786.19Show/hide
Query:  MPLSLLNTLDFQKVYGGMPLVNLPGLLSKSSTPFGFQSKPFNFPSLKSKNCDLKPLIIQARGHSKPESAKIRNRRMQKKYNGTPKRPRLSVFCSDKQLYA
        MP  LLNTL+ +KVYGGM  +N PGLLSK+STPFG QSKPFNFPSLK++NCD+KPL IQARG+SK ESAKI NRR+QKKYNGTP RPRLSVFCSDKQLYA
Subjt:  MPLSLLNTLDFQKVYGGMPLVNLPGLLSKSSTPFGFQSKPFNFPSLKSKNCDLKPLIIQARGHSKPESAKIRNRRMQKKYNGTPKRPRLSVFCSDKQLYA

Query:  MLVDDQNKKCLFYGSTLQKSMRPNPSCTTIEAAQYVGEELVKTCEDLNIHEISSYDRNGFARGEKMQAFEIAISHYGFLQR
        MLVDDQNKKCLFYGSTLQKSMRP+PSCTTIEAAQ+VGEELVK CEDLNIHEISSYDRNGFARGEKMQAFEIAIS YG LQR
Subjt:  MLVDDQNKKCLFYGSTLQKSMRPNPSCTTIEAAQYVGEELVKTCEDLNIHEISSYDRNGFARGEKMQAFEIAISHYGFLQR

XP_038891856.1 50S ribosomal protein L18 isoform X2 [Benincasa hispida]3.71e-10886.19Show/hide
Query:  MPLSLLNTLDFQKVYGGMPLVNLPGLLSKSSTPFGFQSKPFNFPSLKSKNCDLKPLIIQARGHSKPESAKIRNRRMQKKYNGTPKRPRLSVFCSDKQLYA
        MP  LLNTL+ +KVYGGM  +N PGLLSK+STPFG QSKPFNFPSLK++NCD+KPL IQARG+SK ESAKI NRR+QKKYNGTP RPRLSVFCSDKQLYA
Subjt:  MPLSLLNTLDFQKVYGGMPLVNLPGLLSKSSTPFGFQSKPFNFPSLKSKNCDLKPLIIQARGHSKPESAKIRNRRMQKKYNGTPKRPRLSVFCSDKQLYA

Query:  MLVDDQNKKCLFYGSTLQKSMRPNPSCTTIEAAQYVGEELVKTCEDLNIHEISSYDRNGFARGEKMQAFEIAISHYGFLQR
        MLVDDQNKKCLFYGSTLQKSMRP+PSCTTIEAAQ+VGEELVK CEDLNIHEISSYDRNGFARGEKMQAFEIAIS YG LQR
Subjt:  MLVDDQNKKCLFYGSTLQKSMRPNPSCTTIEAAQYVGEELVKTCEDLNIHEISSYDRNGFARGEKMQAFEIAISHYGFLQR

TrEMBL top hitse value%identityAlignment
A0A1S3BEE0 50S ribosomal protein L187.16e-9981.92Show/hide
Query:  LLNTLDFQKVYGGMPLVNLPGLLSKSSTPFGFQSKPFNFPSLKSKNCDLKPLIIQARGHSKPESAKIRNRRMQKKYNGTPKRPRLSVFCSDKQLYAMLVD
        LLNT +F+KV+GG+   N+ GLL K+S PFG QSK F+F SLK+K+CD+KPL+IQARG+SK ESAK+RNRR+QKKYNGTP RPRLSVFCSDKQLYAMLVD
Subjt:  LLNTLDFQKVYGGMPLVNLPGLLSKSSTPFGFQSKPFNFPSLKSKNCDLKPLIIQARGHSKPESAKIRNRRMQKKYNGTPKRPRLSVFCSDKQLYAMLVD

Query:  DQNKKCLFYGSTLQKSMRPNPSCTTIEAAQYVGEELVKTCEDLNIHEISSYDRNGFARGEKMQAFEIAISHYGFLQR
        DQNKKCLFYGSTLQKSMRP  SCTTIEAAQ+VGEELVK C DLNIHEISSYDRNGFARGEKMQAFEIAIS YGFLQR
Subjt:  DQNKKCLFYGSTLQKSMRPNPSCTTIEAAQYVGEELVKTCEDLNIHEISSYDRNGFARGEKMQAFEIAISHYGFLQR

A0A5D3D244 50S ribosomal protein L187.16e-9981.92Show/hide
Query:  LLNTLDFQKVYGGMPLVNLPGLLSKSSTPFGFQSKPFNFPSLKSKNCDLKPLIIQARGHSKPESAKIRNRRMQKKYNGTPKRPRLSVFCSDKQLYAMLVD
        LLNT +F+KV+GG+   N+ GLL K+S PFG QSK F+F SLK+K+CD+KPL+IQARG+SK ESAK+RNRR+QKKYNGTP RPRLSVFCSDKQLYAMLVD
Subjt:  LLNTLDFQKVYGGMPLVNLPGLLSKSSTPFGFQSKPFNFPSLKSKNCDLKPLIIQARGHSKPESAKIRNRRMQKKYNGTPKRPRLSVFCSDKQLYAMLVD

Query:  DQNKKCLFYGSTLQKSMRPNPSCTTIEAAQYVGEELVKTCEDLNIHEISSYDRNGFARGEKMQAFEIAISHYGFLQR
        DQNKKCLFYGSTLQKSMRP  SCTTIEAAQ+VGEELVK C DLNIHEISSYDRNGFARGEKMQAFEIAIS YGFLQR
Subjt:  DQNKKCLFYGSTLQKSMRPNPSCTTIEAAQYVGEELVKTCEDLNIHEISSYDRNGFARGEKMQAFEIAISHYGFLQR

A0A6J1DC55 uncharacterized protein LOC1110191421.34e-128100Show/hide
Query:  MPLSLLNTLDFQKVYGGMPLVNLPGLLSKSSTPFGFQSKPFNFPSLKSKNCDLKPLIIQARGHSKPESAKIRNRRMQKKYNGTPKRPRLSVFCSDKQLYA
        MPLSLLNTLDFQKVYGGMPLVNLPGLLSKSSTPFGFQSKPFNFPSLKSKNCDLKPLIIQARGHSKPESAKIRNRRMQKKYNGTPKRPRLSVFCSDKQLYA
Subjt:  MPLSLLNTLDFQKVYGGMPLVNLPGLLSKSSTPFGFQSKPFNFPSLKSKNCDLKPLIIQARGHSKPESAKIRNRRMQKKYNGTPKRPRLSVFCSDKQLYA

Query:  MLVDDQNKKCLFYGSTLQKSMRPNPSCTTIEAAQYVGEELVKTCEDLNIHEISSYDRNGFARGEKMQAFEIAISHYGFLQR
        MLVDDQNKKCLFYGSTLQKSMRPNPSCTTIEAAQYVGEELVKTCEDLNIHEISSYDRNGFARGEKMQAFEIAISHYGFLQR
Subjt:  MLVDDQNKKCLFYGSTLQKSMRPNPSCTTIEAAQYVGEELVKTCEDLNIHEISSYDRNGFARGEKMQAFEIAISHYGFLQR

A0A6J1GYP6 uncharacterized protein LOC1114586892.34e-10482.87Show/hide
Query:  MPLSLLNTLDFQKVYGGMPLVNLPGLLSKSSTPFGFQSKPFNFPSLKSKNCDLKPLIIQARGHSKPESAKIRNRRMQKKYNGTPKRPRLSVFCSDKQLYA
        MP   LNT++F+KV GGM  +N+PGLLSK+ TPF FQSKPFNFPSLK ++CD+KPL+IQARG+SK ESAK+RNRR QKKYNG+  +PRLSVFCSDKQLYA
Subjt:  MPLSLLNTLDFQKVYGGMPLVNLPGLLSKSSTPFGFQSKPFNFPSLKSKNCDLKPLIIQARGHSKPESAKIRNRRMQKKYNGTPKRPRLSVFCSDKQLYA

Query:  MLVDDQNKKCLFYGSTLQKSMRPNPSCTTIEAAQYVGEELVKTCEDLNIHEISSYDRNGFARGEKMQAFEIAISHYGFLQR
        MLVDDQNKKCLFYGSTLQKSMRP+PSCTTIEAAQ+VGEELVK CEDLNIHEISSYDRNGFARGEKMQAFEIAIS YGFLQR
Subjt:  MLVDDQNKKCLFYGSTLQKSMRPNPSCTTIEAAQYVGEELVKTCEDLNIHEISSYDRNGFARGEKMQAFEIAISHYGFLQR

A0A6J1JIY0 uncharacterized protein LOC1114849979.94e-10683.43Show/hide
Query:  MPLSLLNTLDFQKVYGGMPLVNLPGLLSKSSTPFGFQSKPFNFPSLKSKNCDLKPLIIQARGHSKPESAKIRNRRMQKKYNGTPKRPRLSVFCSDKQLYA
        MP   LNT++F+KV GGM  +N+PGLLSK+STPF FQSKPFNFPSLK ++CD+KPL+IQARG+SK ESAK+RNRR+QKKYNG+  +PRLSVFCSDKQLYA
Subjt:  MPLSLLNTLDFQKVYGGMPLVNLPGLLSKSSTPFGFQSKPFNFPSLKSKNCDLKPLIIQARGHSKPESAKIRNRRMQKKYNGTPKRPRLSVFCSDKQLYA

Query:  MLVDDQNKKCLFYGSTLQKSMRPNPSCTTIEAAQYVGEELVKTCEDLNIHEISSYDRNGFARGEKMQAFEIAISHYGFLQR
        MLVDDQNKKCLFYGSTLQKSMRP+PSCTTIEAAQ+VGEELVK CEDLNIHEISSYDRNGFARGEKMQAFEIAIS YGFLQR
Subjt:  MLVDDQNKKCLFYGSTLQKSMRPNPSCTTIEAAQYVGEELVKTCEDLNIHEISSYDRNGFARGEKMQAFEIAISHYGFLQR

SwissProt top hitse value%identityAlignment
A9FGF8 50S ribosomal protein L187.7e-1237.38Show/hide
Query:  RNRRMQKKYNGTPKRPRLSVFCSDKQLYAMLVDDQNKKCLFYGSTLQKSMRPN-PSCTTIEAAQYVGEELVKTCEDLNIHEISSYDRNGFARGEKMQAFE
        R  R++KK  GTP+RPRLSVF S K +YA ++DD + K L + STL K ++ +      +EAA+ VG  + + C++  I ++  +DRNG+    ++ A  
Subjt:  RNRRMQKKYNGTPKRPRLSVFCSDKQLYAMLVDDQNKKCLFYGSTLQKSMRPN-PSCTTIEAAQYVGEELVKTCEDLNIHEISSYDRNGFARGEKMQAFE

Query:  IAISHYG
         A    G
Subjt:  IAISHYG

B8E1E9 50S ribosomal protein L183.8e-1138.68Show/hide
Query:  SKPESAKIRNRRMQKKYNGTPKRPRLSVFCSDKQLYAMLVDDQNKKCLFYGSTLQKSMRPN-PSCTTIEAAQYVGEELVKTCEDLNIHEISSYDRNGFAR
        S+ E  +IR+ R++KK  GTP+RPRL+V+ S + +YA ++DD     L   S+L+K +R    S   IEAA+ VGE + K   +  I  +  +DR GF  
Subjt:  SKPESAKIRNRRMQKKYNGTPKRPRLSVFCSDKQLYAMLVDDQNKKCLFYGSTLQKSMRPN-PSCTTIEAAQYVGEELVKTCEDLNIHEISSYDRNGFAR

Query:  GEKMQA
          K++A
Subjt:  GEKMQA

P82195 50S ribosomal protein L18, chloroplastic8.8e-1633.76Show/hide
Query:  LLSKSSTPFGFQSKPFNFPSLKSKNCDLKPLIIQARGHSKPESAKIRNRRMQKKYNGTPKRPRLSVFCSDKQLYAMLVDDQNKKCLFYGSTLQKSMRPN-
        L S SS+     S P  F + + +   L    IQA+ H++ E    R+ R++KK  GTP+RPRL VF S+K LY  ++DD     L   ST+QK++  N 
Subjt:  LLSKSSTPFGFQSKPFNFPSLKSKNCDLKPLIIQARGHSKPESAKIRNRRMQKKYNGTPKRPRLSVFCSDKQLYAMLVDDQNKKCLFYGSTLQKSMRPN-

Query:  --PSCTTIEAAQYVGEELVKTCEDLNIHEISSYDRNGFARGEKMQAFEIAISHYGFL
           +  T+E AQ +GE + K+C +  I ++ ++DR G+    +++A   A   +G +
Subjt:  --PSCTTIEAAQYVGEELVKTCEDLNIHEISSYDRNGFARGEKMQAFEIAISHYGFL

Q8SAY0 50S ribosomal protein L18, chloroplastic1.0e-1137.29Show/hide
Query:  SKPESAKI-RNRRMQKKYNGTPKRPRLSVFCSDKQLYAMLVDDQNKKCLFYGSTLQKSMRPN---PSCTTIEAAQYVGEELVKTCEDLNIHEISSYDRNG
        S P++ +I R+ R++KK +GT +RPRLSVF S+K LYA ++DD     L   ST+ KS+  +    +  T+E AQ +GE + K+C +  I ++  +DR G
Subjt:  SKPESAKI-RNRRMQKKYNGTPKRPRLSVFCSDKQLYAMLVDDQNKKCLFYGSTLQKSMRPN---PSCTTIEAAQYVGEELVKTCEDLNIHEISSYDRNG

Query:  FARGEKMQAFEIAISHYG
        F    +++A   A    G
Subjt:  FARGEKMQAFEIAISHYG

Q9SX68 50S ribosomal protein L18, chloroplastic1.1e-1333.8Show/hide
Query:  KPFNFPSLKSKNCDLKPLIIQARGHSKPESAKIRNRRMQKKYNGTPKRPRLSVFCSDKQLYAMLVDDQNKKCLFYGSTLQKSMRPN---PSCTTIEAAQY
        KP++  SL+S++     ++++A+  +  E    R+ R++KK NGT +RPRL VF S+K LY  ++DD     L   ST QK +       S  TIE A+ 
Subjt:  KPFNFPSLKSKNCDLKPLIIQARGHSKPESAKIRNRRMQKKYNGTPKRPRLSVFCSDKQLYAMLVDDQNKKCLFYGSTLQKSMRPN---PSCTTIEAAQY

Query:  VGEELVKTCEDLNIHEISSYDRNGFARGEKMQAFEIAISHYG
        VGE + K+C +  I ++ ++DR G+    +++A   A   +G
Subjt:  VGEELVKTCEDLNIHEISSYDRNGFARGEKMQAFEIAISHYG

Arabidopsis top hitse value%identityAlignment
AT1G14205.1 Ribosomal L18p/L5e family protein4.4e-4760.25Show/hide
Query:  LVNLPGLLSKSSTPFGFQSKPF-NFPSLKSKNCDLKPLIIQARGHSKPESAKIRNRRMQKKYNGTPKRPRLSVFCSDKQLYAMLVDDQNKKCLFYGSTLQ
        L +L  +  + S   G + KP+  FP   +    +KP +I+ARG+++ ES K RNRR +KK+ GT  +PRLSVFCSDKQLYAMLVDD NKKCLFY STLQ
Subjt:  LVNLPGLLSKSSTPFGFQSKPF-NFPSLKSKNCDLKPLIIQARGHSKPESAKIRNRRMQKKYNGTPKRPRLSVFCSDKQLYAMLVDDQNKKCLFYGSTLQ

Query:  KSMRPNPSCTTIEAAQYVGEELVKTCEDLNIHEISSYDRNGFARGEKMQAFEIAISHYGFL
        KS+R +P CT IEAA+ VGEEL+K   DL I+EISSYDRNG  RGE+MQAFEIAI+ +GFL
Subjt:  KSMRPNPSCTTIEAAQYVGEELVKTCEDLNIHEISSYDRNGFARGEKMQAFEIAISHYGFL

AT1G48350.1 Ribosomal L18p/L5e family protein7.6e-1533.8Show/hide
Query:  KPFNFPSLKSKNCDLKPLIIQARGHSKPESAKIRNRRMQKKYNGTPKRPRLSVFCSDKQLYAMLVDDQNKKCLFYGSTLQKSMRPN---PSCTTIEAAQY
        KP++  SL+S++     ++++A+  +  E    R+ R++KK NGT +RPRL VF S+K LY  ++DD     L   ST QK +       S  TIE A+ 
Subjt:  KPFNFPSLKSKNCDLKPLIIQARGHSKPESAKIRNRRMQKKYNGTPKRPRLSVFCSDKQLYAMLVDDQNKKCLFYGSTLQKSMRPN---PSCTTIEAAQY

Query:  VGEELVKTCEDLNIHEISSYDRNGFARGEKMQAFEIAISHYG
        VGE + K+C +  I ++ ++DR G+    +++A   A   +G
Subjt:  VGEELVKTCEDLNIHEISSYDRNGFARGEKMQAFEIAISHYG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCCTTTCACTTCTCAATACTTTAGATTTTCAAAAAGTATACGGTGGCATGCCTCTTGTAAATCTTCCCGGTCTGCTATCGAAGTCTTCTACGCCCTTTGGGTTTCA
GTCAAAGCCATTTAATTTCCCCTCCTTGAAAAGTAAAAATTGCGATTTGAAGCCGTTGATTATCCAAGCAAGAGGACACTCTAAACCAGAAAGTGCAAAAATTAGAAATA
GGAGGATGCAGAAAAAGTATAATGGCACTCCTAAAAGACCAAGGCTTTCAGTATTCTGTTCTGATAAGCAGTTATATGCAATGCTGGTAGACGACCAGAACAAGAAGTGC
CTCTTTTATGGGAGCACCCTGCAGAAATCTATGCGTCCAAATCCCTCGTGCACCACCATTGAAGCTGCTCAGTACGTTGGTGAAGAACTTGTGAAGACATGTGAGGATTT
GAACATCCACGAAATATCATCTTATGACCGAAACGGGTTTGCTCGTGGAGAGAAGATGCAAGCCTTTGAGATTGCAATTTCACACTATGGGTTCTTACAGCGATAG
mRNA sequenceShow/hide mRNA sequence
TCGATGTTTATATAAAATTGAGAATTCTACATTTCCATCCACATAGACATGTAGCGTAAAATATTGTGCTCGATCACTCTATTGAAAGTGTTCCATAGCCAATGTAACGA
ATCCCGCTCAAGCGGATCCAATAGTGTGAGCCGTAAAGCATAGATCGTGAGAAATAAAAATAGTAATAAACGGAAAGTTGAAATTTGAAGAAAAAGAATGGGGTTGAGCT
ATCCCGTCCATATACACGTGGAAGAAGACGGTAGGCAACATATTTTGGTTTCTCCTGCAACCACAGAGACGCCAATCGGGCTTTTACAGTACAGAATCAAACAGGCTTCA
ACAACGAAACCACATTCATTCCGCCGGAACTCTGGCTCGGCAGGTGTCGCGTGCAAGGTGTTCGATAAAATGCCTGAATGAAGAGGACCGCGTCGAAATGCCCCTTTCAC
TTCTCAATACTTTAGATTTTCAAAAAGTATACGGTGGCATGCCTCTTGTAAATCTTCCCGGTCTGCTATCGAAGTCTTCTACGCCCTTTGGGTTTCAGTCAAAGCCATTT
AATTTCCCCTCCTTGAAAAGTAAAAATTGCGATTTGAAGCCGTTGATTATCCAAGCAAGAGGACACTCTAAACCAGAAAGTGCAAAAATTAGAAATAGGAGGATGCAGAA
AAAGTATAATGGCACTCCTAAAAGACCAAGGCTTTCAGTATTCTGTTCTGATAAGCAGTTATATGCAATGCTGGTAGACGACCAGAACAAGAAGTGCCTCTTTTATGGGA
GCACCCTGCAGAAATCTATGCGTCCAAATCCCTCGTGCACCACCATTGAAGCTGCTCAGTACGTTGGTGAAGAACTTGTGAAGACATGTGAGGATTTGAACATCCACGAA
ATATCATCTTATGACCGAAACGGGTTTGCTCGTGGAGAGAAGATGCAAGCCTTTGAGATTGCAATTTCACACTATGGGTTCTTACAGCGATAGATTTGCCAGGAATTAAG
GCAAGTTGTAATCGTAAATGCCTTGTTCACACTGTTAATTTATCTAAACACATTCGTTTTCATTTCTATGATAAGTTGTCCAGAAAAGGATGAAGTTATGATAACAAATG
ATGCTGTGTACTTGTTTGTGCCCCGCAGCTGTGTGAGAGAACCGTTGTTAAAACAATAGAAATCCGATTCAAACAATTTCAACAGCATGATTTGATAAAATTCAATTGAT
TCACCATTCTATATACCACTTTACAGGCTACAGCTCTCTTTCTATCATGTCTCTCTCGGTCACTTGAAATATACAAAATTTCAAGCAACTCTTGAAAATCCATATACTTC
TCTACAGTAACACCTAATTATACAGCAGTCGTTAAGAGGGTTTAAAAATTCTCATGTTACCTCTTCTGTTTCAGGGGTATCAACCCTGGTGTGATCTTTCATCTAACTCT
ACTTCGATGGTGTCAGAATAGTTACTCCATTTCCAGGCTGCTTAGAACATAACTCCTCTTAGTTATCAGAACCACAGCATCTAATACCATTTCCTTCCTGAAGTTTCTGC
CACAAACAGAGCATCTCTCGAGGTTCCTGGTAGTGAGCAACCACATAATCATCCTTACAACCTTCAATATGGATCCTATCGTCCTTCTCGTACCGAATCTGAAGGCCCTC
CCTCTTCATATTTTCAATCCAGATTCCCATGGCTACATCTTCTAATTTGAACATCTGAAGATATAATAGCCTGGTTTTAATTCATCGTTTATGGAAAAGTACCTATTTCA
AGCAAAGAAATAACGGATACCTTTAAGCGGCCCTCTTTATATTTCTTGGAGATCGTCTTTGCTATGTCACTAGACACCACATAACCAGGACCATGTGCCCATGTTGGATA
ATTCTCTTCAGGCCATTCCTAAATTGGAGCAACCAATGAAGATAAGATGGTAGAAAGAAAACTAATTTTCTTTTATTAAATTAATCCAAGTACGCTACATTGGCAAATGG
AACCATATTCTGAGCGAGCTTGTTCAAAAGTGTGAATAATATTCGACAGACTCTTTTATAATATAAATATCAGCCTAATTCAAAGCATACCAACTCAAACATGGTGGAAT
AAACCAGGGTTTTTAGTAAAGCAAGGGATAGATTAAGACCCATAAAAGTGGGAAAAAGGTGAGTGAATTTTCAGCCTTGAGTTATCCACGCAATGTTAGATCCTATGCCC
ATATAATAAAAAACCAGTGTTGAAAATTACCTCCATACTAATATACCACTTGCTCTCAGGGTTTCTGTGAGGTTGAGAATCTGAGTTGATGAGTCCATAAAGCAATCCAC
TTTGTGCATTGATCCTTTTTAAAGAAGCTAAAACTTCATCCACTCGAACAAAGGCGTCATCATCGGTCTTCATGATGTACTTTGCTGAAGCTATCTCTGCCTGCCAAAGT
GGTGAGATTATTAACATGAATCATAATGATGAGTTCACATCATTTAGTGAAAGAACGAAACTTTACCCCAAAGATACAGATACCCAACGTTTTCCAGGTTATAAGGCTGT
AGTAGTCAACAAAAGGCATCATCTGTATGTCACCATATGTTCGGGCTTCATCCCACAGTTCCTCATTGACTATTTGGTTCTTATGCTGCAAATGATGTAAAGTGATTGTA
AACCACTACGGGTTGTACTCTTCATAGTATGTGAATTTTAAGAGACGGATGATGAAAGAAACAAAAACATTTGTATACTTTTAATATGAACAGCAGATGAATCATTCCAG
TTCAAGTATCCAACCGATGAAATTTTGAACTAGTGTAGCAGATCTTATCAAGCACCTCACTCAATTTCACTTGAAGGCATTATGAGCAGACTTAGAAGTGGCCTCAAGAT
AAACTTTGGAAGTAAACTACGTGGTTCATTCAACAAACAGCTAGACTTTTTCATACTCAATGAAACCGCCAGCTTCAATATGCATGAAAAGTTTTTAGCCTAATTTTTTT
TGAGGTTCTTTTCAATCCAAGAAGAGTACATGTGAAGCTCACCAACCCAACAAAAAAGCGAACTGCAACAGATCCCGACTGTACTGCAAGATACTGCATCCACGTTCTTC
GAACAGCCATTCGATATTTAAAATTGTTTGCCGTTGAGAAAACACCGATGAACAGATCTAATTGTTTTTCTGTAGAGAGGGGAGTTGATTTTAATGCTTCTATATTAACT
ATATGATCTGAATCCTCAGATGTGGGCAAACCACTTGCCAGGACAGAAATTAATTTCAAGTCTCCAGAAATCTTTACTTCACTGACCAGCCATGGCTCCAAAGTCTGAGT
AAGAAGCAAGTCAAAACAGCATAAGATTGAGTGATCAGAGAACAATGTAAAGGTAATTGATATAAGACTGGGTGAGATTACTTCACGGTAAGCAAAAGAAGTAACGTGCT
TTCCATCAACTGTCATCTGGATTCCATCCACTCCCACTCTAAGTGTAGCAGCAAATGGATGACCGAGTTTGAAAGGGAAGTACTTTATTGTTTTAGCTCCTTGCACCACA
GATTTTGATTTGTTGAAATTCTTGTTCAACTCAGGAAGTCGATTTTCGACATTGCCCACAATCTTGTTGCATTTTTCCAATTCATCCACTAGCATAAAAAATGAACAACG
TTAACTTCAAAGTGAATGGAAATAGTGCTTGATTGGGGGAGGGAACCTAATACATTGTTCAGTTCAATGTTATGAAGTTTATGGATGCAATGTTCTCAATTTCCCTAGCC
ATTGATGAGTCAAATATGGCCTGCTTCGTAACAAAGTATAAAGGTAAGGTTTTTTATCAGCAAACCTGAATTGTTTATAAAAACGACACATTTCAATCGGAATATACAAA
CAGCTACATATTTTAAATGCAGAAATCTGACTAGAACAGTTATTTTACGTTAGTGGTGTCTTGAATGAAATAATTCAAAGGAAACATGAATAGACATCTATACCTTTTCC
ATTCTCGTCAGAACCAGGGGGCGGGCATCGATCCTCTTCTCCCCAATCACGAGATACTGTCCAGGTGTTCTGGACAATTACAGGATCCTCAGTTAATTTATCCCCCAAAA
GCCTAACATTATAATGCAAAATGATGGGTGGCTCAGGCTCCCCAGGCAGTGGTTCCCCAGTCAAGTCAATTCGAAAGTTACCAAGAAGACCATCTGGAATGCCAATGACT
GTGATGGAAGAACCCTGAGTAAGGCCACAAGGCATTCTTAGCTTATGACCATTGCTGTCAACCTTTGTTGCATTCATTTTGGTCAGAAAATGGGGGCATTGCTTCTCTTT
CGCACGTGCACTTTCATTCGTATAACCAAGCCTTTCCTTTTCAATCGAAGTTTTTAGAAAATCCCATACGCCTCCAGCTTCCTTAATGGCCTCAACTGCATTAGGCAAAC
CTCGGGTACAATTAATTATGCTTCTTAAATGGTTCCAAGTCTGCAGAGATTGATTCTCTCCGTCAGTAAAATTCCCCTGAGTGAAGAGACTGGATACTATAGAGTCGGCA
GAAATCACTTGGTATGCAATTTCAGGATTTTGAACTGCAGGTGGAACTGCAGGATTGAGCCACTGAAGAGGATCAATAGAATTGTACAAGGGGTTTGCATAATAATTTTC
CCGAGACGAACTTCTCGTGATAACATAAATCATCAACAGCAATGTAAACAAGGACGCAACCAGAACCCGGCCATAGCTTTTCTTCATTCTGGCGTTCAAATGATACCCTA
ATAGGCAATAGCTCTAAGCCCAAGTTTGACCATCATTAAGCATCACGAAACTGAAGTTCCAATGGCTGGAAACCTTGTTTCTTGAGACGAACTCTGAAAGCGCATTGTAT
GAAAGATTCGATCATGAAGAACATATATATCTAGAAACAAATGGTAAATGAGTTTTGACAATATGAATATATACATATAAAAAAAATCATCTTCAACTTTCCGGTTAAAA
TTAATATATATACAATACCGAGGAACGGCCAGGACTTCAACAAGGGATAAAACTCACTAATTTTTTCTTCTCTTTGAATAAGATAGTCACGCCCAGCCCAAAATCAAAAG
GCAAATAACAAGGAGAAAGTTCCTATTTAGAAGTTGATTCAAACTTTTGACATATATTAATTGTAAAATAATGAATCAATAAATGCACAAGAAAGTAGAACTACTTATTG
CAGATATTTAATAGGTTCCTGCAGTTAATGGCAATATCAATAATCCTAACTTACATCCCGCGTCAATCGCAAAAAATATTTTCTTAAAAGAATTATTAAACAAAAAACCA
GTGACGGTACCAGAAAGGAAATGGAGGAGTAAAGAAATCGAAAACTAAACAGAAACAAACAACAAACGATTCGTCCGCACTCCACTATGCACAGTTTTTCCAAACTATGA
ACGATTTCAAAAAGTAGCAAAACACAGAACAGAAAGAAGTGGAAAAGTAGACTTCGGTGTTCTTTAAGAAGTAAATCATCAGAACCTCGCAAATAGACTTCCGGCGACCG
GAATCACGGCGAGATCCAAAGACATTGTGTAGCCGTTCCGCAGAACAAGAACCAGTACTCCGACCTCAACAGTCCGTCGGAGAAGAACATGCTGAGGCAAGGCGAAGAAA
AAAGAGAGACCAAATTTAGTTTGAGAATGTGAACAGAAACTCCCGCCGGAAAAAATGCCGAGAAAATAAAGCCGACGCGGCAAGAAATTTAGGAAGTTAAAGGCGGTGGA
GGAAGAAGAAAGAAGAGAAGAAGAAGACGAAGAAGGAGAGAGAAGGGGAAGTTGCAGTGAGTAATAGATCGCTGTCGTTTCAAGCTTGGAGTCGACGTCAAAGCATAGAA
AAATTAAAATTAAAAAAAAAAAAGTAAAAAAGAAAAAACAACGTTGAA
Protein sequenceShow/hide protein sequence
MPLSLLNTLDFQKVYGGMPLVNLPGLLSKSSTPFGFQSKPFNFPSLKSKNCDLKPLIIQARGHSKPESAKIRNRRMQKKYNGTPKRPRLSVFCSDKQLYAMLVDDQNKKC
LFYGSTLQKSMRPNPSCTTIEAAQYVGEELVKTCEDLNIHEISSYDRNGFARGEKMQAFEIAISHYGFLQR