; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr012583 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr012583
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Description50S ribosomal protein L33-like
Genome locationtig00153447:102987..107919
RNA-Seq ExpressionSgr012583
SyntenySgr012583
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0005840 - ribosome (cellular component)
GO:0009536 - plastid (cellular component)
GO:0003735 - structural constituent of ribosome (molecular function)
InterPro domainsIPR001705 - Ribosomal protein L33
IPR011332 - Zinc-binding ribosomal protein
IPR018264 - Ribosomal protein L33, conserved site
IPR038584 - Ribosomal protein L33 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577004.1 hypothetical protein SDJN03_24578, partial [Cucurbita argyrosperma subsp. sororia]3.3e-5156.09Show/hide
Query:  MATTIQRLCLDLALSCGKRNPFLCNSASSTPSNVVTFKSLAFSASLSLTLVSNIKGNSSQFHPKSSTTNPFIILVNSASEYVPDSSSLYRFFPPILMKII
        MA TI+RLCLD ALSCG+RN FLCNSASSTP + V+FK L+FSA+LSL++V N+K                                             
Subjt:  MATTIQRLCLDLALSCGKRNPFLCNSASSTPSNVVTFKSLAFSASLSLTLVSNIKGNSSQFHPKSSTTNPFIILVNSASEYVPDSSSLYRFFPPILMKII

Query:  IKTQEKTGMKGIIIESICWEWEKFVWWGSLSHSARERSIRHCSVICMAKRYAPNTTKRMRLSRKRGGDPSKKKKSRRKGGKRDFKIVRLSSAAGTGFFYA
                                    SLS SARERSI + SVICMAKRYAPNTTKR RLSRKRGGDP KKKK+RRKGG++DFKI+RLSS AGTGFFYA
Subjt:  IKTQEKTGMKGIIIESICWEWEKFVWWGSLSHSARERSIRHCSVICMAKRYAPNTTKRMRLSRKRGGDPSKKKKSRRKGGKRDFKIVRLSSAAGTGFFYA

Query:  KKKSRKMADKIELQKYDPLAKRHVLFTEVK
        KKKSRKMADKIELQK+DPLAKRHVLFTEVK
Subjt:  KKKSRKMADKIELQKYDPLAKRHVLFTEVK

KAG7015026.1 rpmG, partial [Cucurbita argyrosperma subsp. argyrosperma]5.1e-5256.52Show/hide
Query:  MATTIQRLCLDLALSCGKRNPFLCNSASSTPSNVVTFKSLAFSASLSLTLVSNIKGNSSQFHPKSSTTNPFIILVNSASEYVPDSSSLYRFFPPILMKII
        MA TI+RLCLD ALSCG+RN FLCNSASSTP + V+FK L+FSA+LSL++V N+K                                             
Subjt:  MATTIQRLCLDLALSCGKRNPFLCNSASSTPSNVVTFKSLAFSASLSLTLVSNIKGNSSQFHPKSSTTNPFIILVNSASEYVPDSSSLYRFFPPILMKII

Query:  IKTQEKTGMKGIIIESICWEWEKFVWWGSLSHSARERSIRHCSVICMAKRYAPNTTKRMRLSRKRGGDPSKKKKSRRKGGKRDFKIVRLSSAAGTGFFYA
                                    SLS SARERSIR+ SVICMAKRYAPNTTKR RLSRKRGGDP KKKK+RRKGG++DFKI+RLSS AGTGFFYA
Subjt:  IKTQEKTGMKGIIIESICWEWEKFVWWGSLSHSARERSIRHCSVICMAKRYAPNTTKRMRLSRKRGGDPSKKKKSRRKGGKRDFKIVRLSSAAGTGFFYA

Query:  KKKSRKMADKIELQKYDPLAKRHVLFTEVK
        KKKSRKMADKIELQK+DPLAKRHVLFTEVK
Subjt:  KKKSRKMADKIELQKYDPLAKRHVLFTEVK

XP_022922689.1 uncharacterized protein LOC111430611 isoform X1 [Cucurbita moschata]2.8e-5054.74Show/hide
Query:  MATTIQRLCLDLALSCGKRNPFLCNSASSTPSNVVTFKSLAFSASLSLTLVSNIKGNSSQFHPKSSTTNPFIILVNSASEYVPDSSSLYRFFPPILMKII
        MA TI+R CLD ALSCG+RN FLC+SASSTP + V+FK L+FSA+LSL++V N+K                                             
Subjt:  MATTIQRLCLDLALSCGKRNPFLCNSASSTPSNVVTFKSLAFSASLSLTLVSNIKGNSSQFHPKSSTTNPFIILVNSASEYVPDSSSLYRFFPPILMKII

Query:  IKTQEKTGMKGIIIESICWEWEKFVWWGSLSHSARERSIRHCSVICMAKRYAPNTTKRMRLSRKRGGDPSKKKKSRRKGGKRDFKIVRLSSAAGTGFFYA
                                    SLS SARERSIRH SVICMA+RYAPN TKR RLSRKRGGDP KKKK+RRKGG++DFKI+RLSS AGTGFFYA
Subjt:  IKTQEKTGMKGIIIESICWEWEKFVWWGSLSHSARERSIRHCSVICMAKRYAPNTTKRMRLSRKRGGDPSKKKKSRRKGGKRDFKIVRLSSAAGTGFFYA

Query:  KKKSRKMADKIELQKYDPLAKRHVLFTEVKSL
        KKKSRKMADKIELQK+DPLAKRHVLFTEV  L
Subjt:  KKKSRKMADKIELQKYDPLAKRHVLFTEVKSL

XP_022922690.1 uncharacterized protein LOC111430611 isoform X2 [Cucurbita moschata]1.6e-5054.98Show/hide
Query:  MATTIQRLCLDLALSCGKRNPFLCNSASSTPSNVVTFKSLAFSASLSLTLVSNIKGNSSQFHPKSSTTNPFIILVNSASEYVPDSSSLYRFFPPILMKII
        MA TI+R CLD ALSCG+RN FLC+SASSTP + V+FK L+FSA+LSL++V N+K                                             
Subjt:  MATTIQRLCLDLALSCGKRNPFLCNSASSTPSNVVTFKSLAFSASLSLTLVSNIKGNSSQFHPKSSTTNPFIILVNSASEYVPDSSSLYRFFPPILMKII

Query:  IKTQEKTGMKGIIIESICWEWEKFVWWGSLSHSARERSIRHCSVICMAKRYAPNTTKRMRLSRKRGGDPSKKKKSRRKGGKRDFKIVRLSSAAGTGFFYA
                                    SLS SARERSIRH SVICMA+RYAPN TKR RLSRKRGGDP KKKK+RRKGG++DFKI+RLSS AGTGFFYA
Subjt:  IKTQEKTGMKGIIIESICWEWEKFVWWGSLSHSARERSIRHCSVICMAKRYAPNTTKRMRLSRKRGGDPSKKKKSRRKGGKRDFKIVRLSSAAGTGFFYA

Query:  KKKSRKMADKIELQKYDPLAKRHVLFTEVKS
        KKKSRKMADKIELQK+DPLAKRHVLFTEVK+
Subjt:  KKKSRKMADKIELQKYDPLAKRHVLFTEVKS

XP_023552499.1 uncharacterized protein LOC111810144 [Cucurbita pepo subsp. pepo]3.3e-5155.41Show/hide
Query:  MATTIQRLCLDLALSCGKRNPFLCNSASSTPSNVVTFKSLAFSASLSLTLVSNIKGNSSQFHPKSSTTNPFIILVNSASEYVPDSSSLYRFFPPILMKII
        MA TI+RLC D ALSCG+RN FLC+SASSTP + V+FK L+FSA+LSL++V N+K                                             
Subjt:  MATTIQRLCLDLALSCGKRNPFLCNSASSTPSNVVTFKSLAFSASLSLTLVSNIKGNSSQFHPKSSTTNPFIILVNSASEYVPDSSSLYRFFPPILMKII

Query:  IKTQEKTGMKGIIIESICWEWEKFVWWGSLSHSARERSIRHCSVICMAKRYAPNTTKRMRLSRKRGGDPSKKKKSRRKGGKRDFKIVRLSSAAGTGFFYA
                                    SLS SARERSIRH SV CMAKRYAPNTTKR RLSRKRGGDP KKKK+RRKGG++DFKI+RLSS AGTGFFYA
Subjt:  IKTQEKTGMKGIIIESICWEWEKFVWWGSLSHSARERSIRHCSVICMAKRYAPNTTKRMRLSRKRGGDPSKKKKSRRKGGKRDFKIVRLSSAAGTGFFYA

Query:  KKKSRKMADKIELQKYDPLAKRHVLFTEVKS
        KKKSRKMADKIELQK+DPLAKRHVLFTEVK+
Subjt:  KKKSRKMADKIELQKYDPLAKRHVLFTEVKS

TrEMBL top hitse value%identityAlignment
A0A5A7UZ21 50S ribosomal protein L33-like5.7e-4953.91Show/hide
Query:  MATTIQRLCLDLALSCGKRNPFLCNSASSTPSNVVTFKSLAFSASLSLTLVSNIKGNSSQFHPKSSTTNPFIILVNSASEYVPDSSSLYRFFPPILMKII
        MATTIQR CLDL LSCGKRN FL NS+SS PSN +TF S +F A+LSLTL                              +VP+                
Subjt:  MATTIQRLCLDLALSCGKRNPFLCNSASSTPSNVVTFKSLAFSASLSLTLVSNIKGNSSQFHPKSSTTNPFIILVNSASEYVPDSSSLYRFFPPILMKII

Query:  IKTQEKTGMKGIIIESICWEWEKFVWWGSLSHSARERSIRHCSVICMAKRYAPNTTKRMRLSRKRGGDPSKKKKSRRKGGKRDFKIVRLSSAAGTGFFYA
                +KG+                 L   A ERSIR+ SVICMAKRYAP+TTKR RLSRKRGGDP+KKKK+RRKGG++DFKIVRLSS A TGFFYA
Subjt:  IKTQEKTGMKGIIIESICWEWEKFVWWGSLSHSARERSIRHCSVICMAKRYAPNTTKRMRLSRKRGGDPSKKKKSRRKGGKRDFKIVRLSSAAGTGFFYA

Query:  KKKSRKMADKIELQKYDPLAKRHVLFTEVKSLQAFPLHTLLAS
        KKKSRKMADKIE+QKYDP+A RHVLFTEVK L  F +  LLAS
Subjt:  KKKSRKMADKIELQKYDPLAKRHVLFTEVKSLQAFPLHTLLAS

A0A6J1E445 uncharacterized protein LOC111430611 isoform X11.3e-5054.74Show/hide
Query:  MATTIQRLCLDLALSCGKRNPFLCNSASSTPSNVVTFKSLAFSASLSLTLVSNIKGNSSQFHPKSSTTNPFIILVNSASEYVPDSSSLYRFFPPILMKII
        MA TI+R CLD ALSCG+RN FLC+SASSTP + V+FK L+FSA+LSL++V N+K                                             
Subjt:  MATTIQRLCLDLALSCGKRNPFLCNSASSTPSNVVTFKSLAFSASLSLTLVSNIKGNSSQFHPKSSTTNPFIILVNSASEYVPDSSSLYRFFPPILMKII

Query:  IKTQEKTGMKGIIIESICWEWEKFVWWGSLSHSARERSIRHCSVICMAKRYAPNTTKRMRLSRKRGGDPSKKKKSRRKGGKRDFKIVRLSSAAGTGFFYA
                                    SLS SARERSIRH SVICMA+RYAPN TKR RLSRKRGGDP KKKK+RRKGG++DFKI+RLSS AGTGFFYA
Subjt:  IKTQEKTGMKGIIIESICWEWEKFVWWGSLSHSARERSIRHCSVICMAKRYAPNTTKRMRLSRKRGGDPSKKKKSRRKGGKRDFKIVRLSSAAGTGFFYA

Query:  KKKSRKMADKIELQKYDPLAKRHVLFTEVKSL
        KKKSRKMADKIELQK+DPLAKRHVLFTEV  L
Subjt:  KKKSRKMADKIELQKYDPLAKRHVLFTEVKSL

A0A6J1E4T6 uncharacterized protein LOC111430611 isoform X27.9e-5154.98Show/hide
Query:  MATTIQRLCLDLALSCGKRNPFLCNSASSTPSNVVTFKSLAFSASLSLTLVSNIKGNSSQFHPKSSTTNPFIILVNSASEYVPDSSSLYRFFPPILMKII
        MA TI+R CLD ALSCG+RN FLC+SASSTP + V+FK L+FSA+LSL++V N+K                                             
Subjt:  MATTIQRLCLDLALSCGKRNPFLCNSASSTPSNVVTFKSLAFSASLSLTLVSNIKGNSSQFHPKSSTTNPFIILVNSASEYVPDSSSLYRFFPPILMKII

Query:  IKTQEKTGMKGIIIESICWEWEKFVWWGSLSHSARERSIRHCSVICMAKRYAPNTTKRMRLSRKRGGDPSKKKKSRRKGGKRDFKIVRLSSAAGTGFFYA
                                    SLS SARERSIRH SVICMA+RYAPN TKR RLSRKRGGDP KKKK+RRKGG++DFKI+RLSS AGTGFFYA
Subjt:  IKTQEKTGMKGIIIESICWEWEKFVWWGSLSHSARERSIRHCSVICMAKRYAPNTTKRMRLSRKRGGDPSKKKKSRRKGGKRDFKIVRLSSAAGTGFFYA

Query:  KKKSRKMADKIELQKYDPLAKRHVLFTEVKS
        KKKSRKMADKIELQK+DPLAKRHVLFTEVK+
Subjt:  KKKSRKMADKIELQKYDPLAKRHVLFTEVKS

A0A6J1J250 uncharacterized protein LOC111482725 isoform X21.1e-4954.11Show/hide
Query:  MATTIQRLCLDLALSCGKRNPFLCNSASSTPSNVVTFKSLAFSASLSLTLVSNIKGNSSQFHPKSSTTNPFIILVNSASEYVPDSSSLYRFFPPILMKII
        MA TI+RLCL  AL CG+RN FLC+SASSTP + V+FK L+FSA+LSL +V N+K                                             
Subjt:  MATTIQRLCLDLALSCGKRNPFLCNSASSTPSNVVTFKSLAFSASLSLTLVSNIKGNSSQFHPKSSTTNPFIILVNSASEYVPDSSSLYRFFPPILMKII

Query:  IKTQEKTGMKGIIIESICWEWEKFVWWGSLSHSARERSIRHCSVICMAKRYAPNTTKRMRLSRKRGGDPSKKKKSRRKGGKRDFKIVRLSSAAGTGFFYA
                                    SLS SARERSIRH SVICMA+RYAPNTTKR RLSRKRGGDP KK+K+RRKGG++DFKI+RLSS AGTGFFYA
Subjt:  IKTQEKTGMKGIIIESICWEWEKFVWWGSLSHSARERSIRHCSVICMAKRYAPNTTKRMRLSRKRGGDPSKKKKSRRKGGKRDFKIVRLSSAAGTGFFYA

Query:  KKKSRKMADKIELQKYDPLAKRHVLFTEVKS
        KKKSRKMADKIELQK+DPLAKRHVLFTE K+
Subjt:  KKKSRKMADKIELQKYDPLAKRHVLFTEVKS

A0A6J1J566 uncharacterized protein LOC111482725 isoform X11.5e-4954.35Show/hide
Query:  MATTIQRLCLDLALSCGKRNPFLCNSASSTPSNVVTFKSLAFSASLSLTLVSNIKGNSSQFHPKSSTTNPFIILVNSASEYVPDSSSLYRFFPPILMKII
        MA TI+RLCL  AL CG+RN FLC+SASSTP + V+FK L+FSA+LSL +V N+K                                             
Subjt:  MATTIQRLCLDLALSCGKRNPFLCNSASSTPSNVVTFKSLAFSASLSLTLVSNIKGNSSQFHPKSSTTNPFIILVNSASEYVPDSSSLYRFFPPILMKII

Query:  IKTQEKTGMKGIIIESICWEWEKFVWWGSLSHSARERSIRHCSVICMAKRYAPNTTKRMRLSRKRGGDPSKKKKSRRKGGKRDFKIVRLSSAAGTGFFYA
                                    SLS SARERSIRH SVICMA+RYAPNTTKR RLSRKRGGDP KK+K+RRKGG++DFKI+RLSS AGTGFFYA
Subjt:  IKTQEKTGMKGIIIESICWEWEKFVWWGSLSHSARERSIRHCSVICMAKRYAPNTTKRMRLSRKRGGDPSKKKKSRRKGGKRDFKIVRLSSAAGTGFFYA

Query:  KKKSRKMADKIELQKYDPLAKRHVLFTEVK
        KKKSRKMADKIELQK+DPLAKRHVLFTE K
Subjt:  KKKSRKMADKIELQKYDPLAKRHVLFTEVK

SwissProt top hitse value%identityAlignment
B6IN38 50S ribosomal protein L331.2e-0650.98Show/hide
Query:  KRDFKIVRLSSAAGTGFFYAKKKS-RKMADKIELQKYDPLAKRHVLFTEVK
        K++  +++L S+A TGFFY KKK+ RK  +K+E +KYDP+A++HV+F E K
Subjt:  KRDFKIVRLSSAAGTGFFYAKKKS-RKMADKIELQKYDPLAKRHVLFTEVK

Q2W1A2 50S ribosomal protein L333.3e-0654.35Show/hide
Query:  IVRLSSAAGTGFFY-AKKKSRKMADKIELQKYDPLAKRHVLFTEVK
        +++L S AGTGFFY AKK  RK  +K+E +KYDP+ ++HV F E K
Subjt:  IVRLSSAAGTGFFY-AKKKSRKMADKIELQKYDPLAKRHVLFTEVK

Q5FFJ4 50S ribosomal protein L333.0e-0754.35Show/hide
Query:  IVRLSSAAGTGFFYAKKKS-RKMADKIELQKYDPLAKRHVLFTEVK
        +V+L+S+AGTG+FY KK++ +K+ +K+  +KYDP+A++HVLFTE K
Subjt:  IVRLSSAAGTGFFYAKKKS-RKMADKIELQKYDPLAKRHVLFTEVK

Q5HBV6 50S ribosomal protein L333.0e-0754.35Show/hide
Query:  IVRLSSAAGTGFFYAKKKS-RKMADKIELQKYDPLAKRHVLFTEVK
        +V+L+S+AGTG+FY KK++ +K+ +K+  +KYDP+A++HVLFTE K
Subjt:  IVRLSSAAGTGFFYAKKKS-RKMADKIELQKYDPLAKRHVLFTEVK

Q9RSS4 50S ribosomal protein L335.7e-0654.35Show/hide
Query:  IVRLSSAAGTGFFYAKKKSRKMAD-KIELQKYDPLAKRHVLFTEVK
        IV++ S+AGTGF+Y   K+R+    K+EL+KYDP+AK+HV+F E K
Subjt:  IVRLSSAAGTGFFYAKKKSRKMAD-KIELQKYDPLAKRHVLFTEVK

Arabidopsis top hitse value%identityAlignment
AT3G06320.1 Ribosomal protein L33 family protein1.1e-0960.78Show/hide
Query:  KRDFKIVRLSSAAGTGFFYAKKKSRK-MADKIELQKYDPLAKRHVLFTEVK
        K+ F  +RL SAAGTGFFY K+KS K + +K+E +KYDP   RHVLFTE K
Subjt:  KRDFKIVRLSSAAGTGFFYAKKKSRK-MADKIELQKYDPLAKRHVLFTEVK

AT5G18790.1 Ribosomal protein L33 family protein1.1e-0960.78Show/hide
Query:  KRDFKIVRLSSAAGTGFFYAKKKSRK-MADKIELQKYDPLAKRHVLFTEVK
        K+ F  +RL SAAGTGFFY K+KS K + +K+E +KYDP   RHVLFTE K
Subjt:  KRDFKIVRLSSAAGTGFFYAKKKSRK-MADKIELQKYDPLAKRHVLFTEVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACAACAATTCAAAGGCTTTGTTTGGACTTGGCGCTTTCTTGTGGGAAAAGGAACCCATTTCTCTGTAATTCCGCGTCTTCCACCCCTTCGAACGTTGTCACTTT
CAAGTCTCTTGCCTTCTCTGCATCCCTCTCGTTGACCCTCGTCTCCAACATCAAAGGTAATAGCTCACAATTTCACCCGAAATCATCAACAACAAATCCATTCATCATAC
TCGTCAATTCTGCGAGCGAATATGTTCCTGATTCGAGTTCCCTTTACCGTTTTTTTCCTCCTATTTTGATGAAAATCATAATCAAGACACAAGAAAAAACAGGAATGAAG
GGTATCATAATCGAGAGTATCTGCTGGGAGTGGGAGAAATTTGTTTGGTGGGGATCGTTGTCACATAGCGCTCGGGAGAGATCAATCCGCCATTGTTCCGTGATTTGCAT
GGCCAAAAGATACGCTCCCAACACTACAAAGAGAATGAGGTTGAGTAGGAAGAGGGGTGGTGATCCAAGCAAGAAAAAGAAGTCAAGGAGGAAAGGGGGAAAGAGAGATT
TCAAAATAGTTCGACTCTCCTCAGCTGCTGGGACTGGTTTTTTCTATGCAAAGAAGAAGAGCAGGAAGATGGCTGATAAGATTGAGCTCCAGAAATATGACCCTCTTGCA
AAACGTCATGTTCTGTTCACTGAAGTCAAATCCCTGCAGGCTTTTCCTTTGCATACTCTACTGGCCTCCGACCAGTTAGAAGCTCAAGAAGAACAACGCCAAAACTGTAA
ACATCACCCCTGCAGGTTGCTGTTAATGTCTGGCTGTATTCAGGAGGAATATAGCCCAAATAACCTTGAAGAGAAACAAGGTTCTTATGCTGTGCTCGAGAAAGAGCTTC
GACTTCAGCTTGGAATTCCCGTTCCATCTGACCACAATCTCCTCACTAACTTTGACGATCCAAGAGCTCCAGATAACCTGTCAGGCCTGTTAAATTCCTCGTCAAAATTG
TTGTTTCGATCTCCAACATCTTTCCTGGACATTTTGAGTAGAACCACTGCAAGGAGCAGGAGGATTGCTGCAGCTGCACCAACGAATGACAAGGATTGTCAATTTCCCCG
CAAAGTCCTGCATTACCCTCAAAGCTTGAGCTGGGAAAGCTTAAGAATTGTCCTCCATTTGGAATTGGCCCCTCCAAAGAATTGTTGGACAAATCCAAATAAAACAAATT
CTCCAGCTGACCAATCCAAGCGGGGATACTGCCATTTAAGTGATTCCAAGACAAATCAAGGATACTTAATTTCTTGCAACCTAGTAACCAACCAGGAATTGGTCCTCTCA
GACCACAGTTACCAAATGCCAAAAGCATTAAGTTGTCGAAACCAGTTACACTCTGTGGAATTTCCTCATTGCGAAAGTTCTTTGATAGAAATGACAGAGAGGAAAGCTTT
GCGTAATCTGGAGGAATTTGACCGATCTGGTAATCCAGAAAAATTGAGATGGATAGTACCCATTAAAGAATTGTTTCTAAGATCAAACACTCTGAGCTTTGAGCACAATG
ACAGAGATGAAGGCAATATTCCAGAGAACAAGTTAGAATGTGCAACTAACTCCTGTAGTTCTGAAAAATTACTAAACACATTTGGAAGTTCACCAGAAAACTGGTTTCCA
AATATTATAAAGGATTTGAGCCTAGAAAGCTTACTTAGTTCCATGCTTAACTGGCCCAAGAAGCTGTTTGCAGGGATTGAGAAGTACTCCAAAGATGATAATGAATACAA
AGAATCTGGAAGATTGCCGGTAACTTATTGTAGCTCAAATCCAAAACCTGGAGCTGCTTCAAGCTTGAAAATTCTGTTGGCAGTCCACCTTCAAGCTGATT
mRNA sequenceShow/hide mRNA sequence
ATGGCGACAACAATTCAAAGGCTTTGTTTGGACTTGGCGCTTTCTTGTGGGAAAAGGAACCCATTTCTCTGTAATTCCGCGTCTTCCACCCCTTCGAACGTTGTCACTTT
CAAGTCTCTTGCCTTCTCTGCATCCCTCTCGTTGACCCTCGTCTCCAACATCAAAGGTAATAGCTCACAATTTCACCCGAAATCATCAACAACAAATCCATTCATCATAC
TCGTCAATTCTGCGAGCGAATATGTTCCTGATTCGAGTTCCCTTTACCGTTTTTTTCCTCCTATTTTGATGAAAATCATAATCAAGACACAAGAAAAAACAGGAATGAAG
GGTATCATAATCGAGAGTATCTGCTGGGAGTGGGAGAAATTTGTTTGGTGGGGATCGTTGTCACATAGCGCTCGGGAGAGATCAATCCGCCATTGTTCCGTGATTTGCAT
GGCCAAAAGATACGCTCCCAACACTACAAAGAGAATGAGGTTGAGTAGGAAGAGGGGTGGTGATCCAAGCAAGAAAAAGAAGTCAAGGAGGAAAGGGGGAAAGAGAGATT
TCAAAATAGTTCGACTCTCCTCAGCTGCTGGGACTGGTTTTTTCTATGCAAAGAAGAAGAGCAGGAAGATGGCTGATAAGATTGAGCTCCAGAAATATGACCCTCTTGCA
AAACGTCATGTTCTGTTCACTGAAGTCAAATCCCTGCAGGCTTTTCCTTTGCATACTCTACTGGCCTCCGACCAGTTAGAAGCTCAAGAAGAACAACGCCAAAACTGTAA
ACATCACCCCTGCAGGTTGCTGTTAATGTCTGGCTGTATTCAGGAGGAATATAGCCCAAATAACCTTGAAGAGAAACAAGGTTCTTATGCTGTGCTCGAGAAAGAGCTTC
GACTTCAGCTTGGAATTCCCGTTCCATCTGACCACAATCTCCTCACTAACTTTGACGATCCAAGAGCTCCAGATAACCTGTCAGGCCTGTTAAATTCCTCGTCAAAATTG
TTGTTTCGATCTCCAACATCTTTCCTGGACATTTTGAGTAGAACCACTGCAAGGAGCAGGAGGATTGCTGCAGCTGCACCAACGAATGACAAGGATTGTCAATTTCCCCG
CAAAGTCCTGCATTACCCTCAAAGCTTGAGCTGGGAAAGCTTAAGAATTGTCCTCCATTTGGAATTGGCCCCTCCAAAGAATTGTTGGACAAATCCAAATAAAACAAATT
CTCCAGCTGACCAATCCAAGCGGGGATACTGCCATTTAAGTGATTCCAAGACAAATCAAGGATACTTAATTTCTTGCAACCTAGTAACCAACCAGGAATTGGTCCTCTCA
GACCACAGTTACCAAATGCCAAAAGCATTAAGTTGTCGAAACCAGTTACACTCTGTGGAATTTCCTCATTGCGAAAGTTCTTTGATAGAAATGACAGAGAGGAAAGCTTT
GCGTAATCTGGAGGAATTTGACCGATCTGGTAATCCAGAAAAATTGAGATGGATAGTACCCATTAAAGAATTGTTTCTAAGATCAAACACTCTGAGCTTTGAGCACAATG
ACAGAGATGAAGGCAATATTCCAGAGAACAAGTTAGAATGTGCAACTAACTCCTGTAGTTCTGAAAAATTACTAAACACATTTGGAAGTTCACCAGAAAACTGGTTTCCA
AATATTATAAAGGATTTGAGCCTAGAAAGCTTACTTAGTTCCATGCTTAACTGGCCCAAGAAGCTGTTTGCAGGGATTGAGAAGTACTCCAAAGATGATAATGAATACAA
AGAATCTGGAAGATTGCCGGTAACTTATTGTAGCTCAAATCCAAAACCTGGAGCTGCTTCAAGCTTGAAAATTCTGTTGGCAGTCCACCTTCAAGCTGATT
Protein sequenceShow/hide protein sequence
MATTIQRLCLDLALSCGKRNPFLCNSASSTPSNVVTFKSLAFSASLSLTLVSNIKGNSSQFHPKSSTTNPFIILVNSASEYVPDSSSLYRFFPPILMKIIIKTQEKTGMK
GIIIESICWEWEKFVWWGSLSHSARERSIRHCSVICMAKRYAPNTTKRMRLSRKRGGDPSKKKKSRRKGGKRDFKIVRLSSAAGTGFFYAKKKSRKMADKIELQKYDPLA
KRHVLFTEVKSLQAFPLHTLLASDQLEAQEEQRQNCKHHPCRLLLMSGCIQEEYSPNNLEEKQGSYAVLEKELRLQLGIPVPSDHNLLTNFDDPRAPDNLSGLLNSSSKL
LFRSPTSFLDILSRTTARSRRIAAAAPTNDKDCQFPRKVLHYPQSLSWESLRIVLHLELAPPKNCWTNPNKTNSPADQSKRGYCHLSDSKTNQGYLISCNLVTNQELVLS
DHSYQMPKALSCRNQLHSVEFPHCESSLIEMTERKALRNLEEFDRSGNPEKLRWIVPIKELFLRSNTLSFEHNDRDEGNIPENKLECATNSCSSEKLLNTFGSSPENWFP
NIIKDLSLESLLSSMLNWPKKLFAGIEKYSKDDNEYKESGRLPVTYCSSNPKPGAASSLKILLAVHLQADX