; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC11g0855 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC11g0855
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Description30S ribosomal protein S20
Genome locationMC11:7274003..7278857
RNA-Seq ExpressionMC11g0855
SyntenyMC11g0855
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0015935 - small ribosomal subunit (cellular component)
GO:0003735 - structural constituent of ribosome (molecular function)
GO:0070181 - small ribosomal subunit rRNA binding (molecular function)
InterPro domainsIPR002583 - Ribosomal protein S20
IPR036510 - Ribosomal protein S20 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607892.1 30S ribosomal protein S20, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.39e-9385.87Show/hide
Query:  MAAAAITCFSISSKFRNLSLNASSSSFPASSSSSTLKSLTFSSNLSFCAFSNGCLSMSEAQRPLRYSVVCEAAPKKKVDSAAKRARQAEKRRIYNKARKS
        MAAA+ +CF++ SKF NLSLNASSS  P SSSS  LKSL+FSSN+S  AFSNGCLSMS AQRPLRYSVVCEAAP+KK DSAAKRARQAEKRR YNKARKS
Subjt:  MAAAAITCFSISSKFRNLSLNASSSSFPASSSSSTLKSLTFSSNLSFCAFSNGCLSMSEAQRPLRYSVVCEAAPKKKVDSAAKRARQAEKRRIYNKARKS

Query:  EIKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYAPASPAAA
        EI+TRMKK +EALDDLKKKPEAQSEEVL IEKLIAEAYSVIDKAVKVGTLHRNTAA RKSRLARRKKAVEIHHGWY PASPAAA
Subjt:  EIKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYAPASPAAA

XP_022139608.1 30S ribosomal protein S20, chloroplastic [Momordica charantia]7.18e-112100Show/hide
Query:  MAAAAITCFSISSKFRNLSLNASSSSFPASSSSSTLKSLTFSSNLSFCAFSNGCLSMSEAQRPLRYSVVCEAAPKKKVDSAAKRARQAEKRRIYNKARKS
        MAAAAITCFSISSKFRNLSLNASSSSFPASSSSSTLKSLTFSSNLSFCAFSNGCLSMSEAQRPLRYSVVCEAAPKKKVDSAAKRARQAEKRRIYNKARKS
Subjt:  MAAAAITCFSISSKFRNLSLNASSSSFPASSSSSTLKSLTFSSNLSFCAFSNGCLSMSEAQRPLRYSVVCEAAPKKKVDSAAKRARQAEKRRIYNKARKS

Query:  EIKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYAPASPAAA
        EIKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYAPASPAAA
Subjt:  EIKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYAPASPAAA

XP_022976587.1 30S ribosomal protein S20, chloroplastic-like [Cucurbita maxima]9.75e-9486.89Show/hide
Query:  AAAAITCFSISSKFRNLSLNASSSSFPASSSSSTLKSLTFSSNLSFCAFSNGCLSMSEAQRPLRYSVVCEAAPKKKVDSAAKRARQAEKRRIYNKARKSE
        A A I+CF++ SKFRNLSLNASSSS P S SSSTL+SL+FSSN+S  AFSNGCLS+S+AQRP RYSVVCEAAPKK  DSAAKRARQAEKRR+YNKARKSE
Subjt:  AAAAITCFSISSKFRNLSLNASSSSFPASSSSSTLKSLTFSSNLSFCAFSNGCLSMSEAQRPLRYSVVCEAAPKKKVDSAAKRARQAEKRRIYNKARKSE

Query:  IKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYAPASPAAA
        IKTRMKKV+EALDDLKKKPEAQSEEVL IEKLIAEA+SVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWY PASPAAA
Subjt:  IKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYAPASPAAA

XP_023534903.1 30S ribosomal protein S20, chloroplastic-like [Cucurbita pepo subsp. pepo]1.39e-9386.89Show/hide
Query:  AAAAITCFSISSKFRNLSLNASSSSFPASSSSSTLKSLTFSSNLSFCAFSNGCLSMSEAQRPLRYSVVCEAAPKKKVDSAAKRARQAEKRRIYNKARKSE
        A A I+CFS+ SKFRNLSLNASSSS P S SSSTL+SL+FSSN+S  AFSNGCLS+S+AQRP RYSVVCEAAPKK  DSAAKRARQAEKRR+YNKARKSE
Subjt:  AAAAITCFSISSKFRNLSLNASSSSFPASSSSSTLKSLTFSSNLSFCAFSNGCLSMSEAQRPLRYSVVCEAAPKKKVDSAAKRARQAEKRRIYNKARKSE

Query:  IKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYAPASPAAA
        IKTRMKKV+EALDDLKKKPE QSEEVL IEKLIAEA+SVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWY PASPAAA
Subjt:  IKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYAPASPAAA

XP_038899721.1 30S ribosomal protein S20, chloroplastic [Benincasa hispida]2.79e-9385.87Show/hide
Query:  MAAAAITCFSISSKFRNLSLNASSSSFPASSSSSTLKSLTFSSNLSFCAFSNGCLSMSEAQRPLRYSVVCEAAPKKKVDSAAKRARQAEKRRIYNKARKS
        MAA A++CFS+ SKFRNLSLNASSS  PAS SS TL+SL+FSSN S  AFSNGCL+MS+AQRP RYSVVCEAAPK K DSAAKR RQAEKRRIYNKARKS
Subjt:  MAAAAITCFSISSKFRNLSLNASSSSFPASSSSSTLKSLTFSSNLSFCAFSNGCLSMSEAQRPLRYSVVCEAAPKKKVDSAAKRARQAEKRRIYNKARKS

Query:  EIKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYAPASPAAA
        EIKTR+KKVLEALD LKKKPEAQSEEVL IEKLIAEAYSVIDKAV+VGTLHRNTAARRKSRLARRKKAVEIHHGWY P SPAAA
Subjt:  EIKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYAPASPAAA

TrEMBL top hitse value%identityAlignment
A0A6J1CG14 30S ribosomal protein S20, chloroplastic3.48e-112100Show/hide
Query:  MAAAAITCFSISSKFRNLSLNASSSSFPASSSSSTLKSLTFSSNLSFCAFSNGCLSMSEAQRPLRYSVVCEAAPKKKVDSAAKRARQAEKRRIYNKARKS
        MAAAAITCFSISSKFRNLSLNASSSSFPASSSSSTLKSLTFSSNLSFCAFSNGCLSMSEAQRPLRYSVVCEAAPKKKVDSAAKRARQAEKRRIYNKARKS
Subjt:  MAAAAITCFSISSKFRNLSLNASSSSFPASSSSSTLKSLTFSSNLSFCAFSNGCLSMSEAQRPLRYSVVCEAAPKKKVDSAAKRARQAEKRRIYNKARKS

Query:  EIKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYAPASPAAA
        EIKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYAPASPAAA
Subjt:  EIKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYAPASPAAA

A0A6J1F6F1 30S ribosomal protein S20, chloroplastic-like1.57e-9286.34Show/hide
Query:  AAAAITCFSISSKFRNLSLNASSSSFPASSSSSTLKSLTFSSNLSFCAFSNGCLSMSEAQRPLRYSVVCEAAPKKKVDSAAKRARQAEKRRIYNKARKSE
        A A I+CF++ SKFRNLSLNASSSS   S SSSTL+SL+FSSN+S  AFSNGCLS+S+AQRP RYSVVCEAAPKK  DSAAKRARQAEKRR+YNKARKSE
Subjt:  AAAAITCFSISSKFRNLSLNASSSSFPASSSSSTLKSLTFSSNLSFCAFSNGCLSMSEAQRPLRYSVVCEAAPKKKVDSAAKRARQAEKRRIYNKARKSE

Query:  IKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYAPASPAAA
        IKTRMKKV+EALDDLKKKPEAQSEEVL IEKLIAEA+SVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWY PASPAAA
Subjt:  IKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYAPASPAAA

A0A6J1FKZ1 30S ribosomal protein S20, chloroplastic3.87e-9385.33Show/hide
Query:  MAAAAITCFSISSKFRNLSLNASSSSFPASSSSSTLKSLTFSSNLSFCAFSNGCLSMSEAQRPLRYSVVCEAAPKKKVDSAAKRARQAEKRRIYNKARKS
        MAAA+ +CF++ SKF NLSLNAS S  P SSSS  LKSL+FSSN+S  AFSNGCLSMS AQRPLRYSVVCEAAP+KK DSAAKRARQAEKRR YNKARKS
Subjt:  MAAAAITCFSISSKFRNLSLNASSSSFPASSSSSTLKSLTFSSNLSFCAFSNGCLSMSEAQRPLRYSVVCEAAPKKKVDSAAKRARQAEKRRIYNKARKS

Query:  EIKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYAPASPAAA
        EI+TRMKK +EALDDLKKKPEAQSEEVL IEKLIAEAYSVIDKAVKVGTLHRNTAA RKSRLARRKKAVEIHHGWY PASPAAA
Subjt:  EIKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYAPASPAAA

A0A6J1IG57 30S ribosomal protein S20, chloroplastic-like4.72e-9486.89Show/hide
Query:  AAAAITCFSISSKFRNLSLNASSSSFPASSSSSTLKSLTFSSNLSFCAFSNGCLSMSEAQRPLRYSVVCEAAPKKKVDSAAKRARQAEKRRIYNKARKSE
        A A I+CF++ SKFRNLSLNASSSS P S SSSTL+SL+FSSN+S  AFSNGCLS+S+AQRP RYSVVCEAAPKK  DSAAKRARQAEKRR+YNKARKSE
Subjt:  AAAAITCFSISSKFRNLSLNASSSSFPASSSSSTLKSLTFSSNLSFCAFSNGCLSMSEAQRPLRYSVVCEAAPKKKVDSAAKRARQAEKRRIYNKARKSE

Query:  IKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYAPASPAAA
        IKTRMKKV+EALDDLKKKPEAQSEEVL IEKLIAEA+SVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWY PASPAAA
Subjt:  IKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYAPASPAAA

A0A6J1IVC5 30S ribosomal protein S20, chloroplastic-like2.24e-9284.24Show/hide
Query:  MAAAAITCFSISSKFRNLSLNASSSSFPASSSSSTLKSLTFSSNLSFCAFSNGCLSMSEAQRPLRYSVVCEAAPKKKVDSAAKRARQAEKRRIYNKARKS
        MAAA+ +CF++ SKF NLSLNASSS  P SSSS  L+SL+FSSN+S  AFS+GCLSMS AQRPLRYSVVCEAAP+KK DSAAKRARQAEKRR YNKARKS
Subjt:  MAAAAITCFSISSKFRNLSLNASSSSFPASSSSSTLKSLTFSSNLSFCAFSNGCLSMSEAQRPLRYSVVCEAAPKKKVDSAAKRARQAEKRRIYNKARKS

Query:  EIKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYAPASPAAA
        EI+TRMKK +EALDDLKKKPEAQSEEVL +EKLIAEAYSVIDKAVKVGTLHRNTAA RKSRLARRKKAVEIHHGWY PASPAAA
Subjt:  EIKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYAPASPAAA

SwissProt top hitse value%identityAlignment
B0C2C8 30S ribosomal protein S205.6e-1048.35Show/hide
Query:  VDSAAKRARQAEKRRIYNKARKSEIKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKA
        + SA KR   AE+ R+ NKA KS IKT  K+   A+DD K  P    +++ +I+  ++  YS IDKAVKVG  HRNT AR+K+ LAR  KA
Subjt:  VDSAAKRARQAEKRRIYNKARKSEIKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKA

B1XHW6 30S ribosomal protein S202.1e-0944.44Show/hide
Query:  VDSAAKRARQAEKRRIYNKARKSEIKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKK
        + SA KR + AE+ R+ NK+ KS +KT  KK L+A++     P A ++     +K ++  YS IDKAVK G  H NTAAR+K+RLA+  K
Subjt:  VDSAAKRARQAEKRRIYNKARKSEIKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKK

B7KG02 30S ribosomal protein S209.6e-1044.57Show/hide
Query:  VDSAAKRARQAEKRRIYNKARKSEIKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAV
        + SA KR + AE+ R+ NK+ KS +KT MKK   A++D K  P  ++++  ++++ ++ AYS IDKAVK   LHRN  AR+KS+LA   K V
Subjt:  VDSAAKRARQAEKRRIYNKARKSEIKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAV

P82130 30S ribosomal protein S20, chloroplastic6.4e-4665.36Show/hide
Query:  MAAAAITCFSISSKFRNLSLNASSSSFPASSSSSTLKSLTFSSNLSFCAFSNGCLSMSEAQRPLRYSVVCEAAPKKKVDSAAKRARQAEKRRIYNKARKS
        MA  +  C SISSK  NLS   SS+ FP    +S+LK LTFS+NLS   FS GC S+   QR   +SVVCE A  KK DSAAKR RQAE RR+ NKARKS
Subjt:  MAAAAITCFSISSKFRNLSLNASSSSFPASSSSSTLKSLTFSSNLSFCAFSNGCLSMSEAQRPLRYSVVCEAAPKKKVDSAAKRARQAEKRRIYNKARKS

Query:  EIKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYAPA
        E+KTRM+KV EALD LKKK  A +EE++ I+ LIAEAYS IDKAV  GTLHRNTAARRKSRLAR KK VEIHHGWY P+
Subjt:  EIKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYAPA

Q9ASV6 30S ribosomal protein S20, chloroplastic1.4e-4561.41Show/hide
Query:  TCFSISSKFRNLSLNASSSSFPASSSSSTL-KSLTFSSNLSFCAFSNGCLSMSE----AQRPLRYSVVCE-AAPKKKVDSAAKRARQAEKRRIYNKARKS
        +C ++ S+F+ LSL   S S P+SS S+    S T SS+LSF    + C++ S      Q+P+R  +VCE AAP KK DSAAKRARQAEKRR+YNK++KS
Subjt:  TCFSISSKFRNLSLNASSSSFPASSSSSTL-KSLTFSSNLSFCAFSNGCLSMSE----AQRPLRYSVVCE-AAPKKKVDSAAKRARQAEKRRIYNKARKS

Query:  EIKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYAPASPAAA
        E +TRMKKVLEAL+ LKKK +AQ++E++++EKLI EAYS IDKAVKV  LH+NT ARRKSRLARRKKAVEIHHGWY P + AAA
Subjt:  EIKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYAPASPAAA

Arabidopsis top hitse value%identityAlignment
AT3G15190.1 chloroplast 30S ribosomal protein S20, putative1.0e-4661.41Show/hide
Query:  TCFSISSKFRNLSLNASSSSFPASSSSSTL-KSLTFSSNLSFCAFSNGCLSMSE----AQRPLRYSVVCE-AAPKKKVDSAAKRARQAEKRRIYNKARKS
        +C ++ S+F+ LSL   S S P+SS S+    S T SS+LSF    + C++ S      Q+P+R  +VCE AAP KK DSAAKRARQAEKRR+YNK++KS
Subjt:  TCFSISSKFRNLSLNASSSSFPASSSSSTL-KSLTFSSNLSFCAFSNGCLSMSE----AQRPLRYSVVCE-AAPKKKVDSAAKRARQAEKRRIYNKARKS

Query:  EIKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYAPASPAAA
        E +TRMKKVLEAL+ LKKK +AQ++E++++EKLI EAYS IDKAVKV  LH+NT ARRKSRLARRKKAVEIHHGWY P + AAA
Subjt:  EIKTRMKKVLEALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYAPASPAAA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCAGCTGCAATTACTTGCTTCTCTATCTCTTCTAAATTCAGAAATCTTTCTCTCAATGCTTCTTCCTCCTCTTTCCCCGCTTCCTCTTCCTCTTCAACCCTCAA
ATCTCTCACTTTCTCCTCCAATCTTTCATTCTGTGCCTTTTCCAATGGGTGTCTGTCGATGAGTGAAGCTCAGAGGCCACTTCGATACTCTGTTGTGTGCGAGGCGGCTC
CTAAGAAAAAGGTTGATTCTGCTGCAAAGAGGGCTCGCCAGGCTGAGAAAAGGCGTATTTACAACAAAGCCCGGAAGTCTGAAATCAAAACCAGGATGAAGAAGGTTTTG
GAAGCTCTAGATGATCTCAAGAAGAAACCTGAAGCACAGTCAGAGGAAGTCCTTTCAATTGAGAAGCTCATTGCAGAGGCATACTCAGTGATCGACAAAGCCGTGAAAGT
GGGAACATTGCACAGAAACACTGCTGCACGTCGGAAATCTCGACTTGCCAGAAGAAAGAAAGCCGTAGAAATCCACCATGGCTGGTACGCCCCTGCTTCACCTGCAGCTG
CCTGA
mRNA sequenceShow/hide mRNA sequence
ATCCGAATAACATCCTATCGGGATTAAATTAAAGCGGAACCCGGATGGGGGTTATATAAAACTCGAATCCAAACAGCCAAAATAACAGAGGATAAAACGATACTCCAAAA
AACTCTGCTTCCACTTCCATTCCAGTGGATAGCGCCAGCAAAGGTGCAAGAAGAGAACCACCATTCTTCAACCTCTGCAGCTTTCACAAAAATGGCGGCAGCTGCAATTA
CTTGCTTCTCTATCTCTTCTAAATTCAGAAATCTTTCTCTCAATGCTTCTTCCTCCTCTTTCCCCGCTTCCTCTTCCTCTTCAACCCTCAAATCTCTCACTTTCTCCTCC
AATCTTTCATTCTGTGCCTTTTCCAATGGGTGTCTGTCGATGAGTGAAGCTCAGAGGCCACTTCGATACTCTGTTGTGTGCGAGGCGGCTCCTAAGAAAAAGGTTGATTC
TGCTGCAAAGAGGGCTCGCCAGGCTGAGAAAAGGCGTATTTACAACAAAGCCCGGAAGTCTGAAATCAAAACCAGGATGAAGAAGGTTTTGGAAGCTCTAGATGATCTCA
AGAAGAAACCTGAAGCACAGTCAGAGGAAGTCCTTTCAATTGAGAAGCTCATTGCAGAGGCATACTCAGTGATCGACAAAGCCGTGAAAGTGGGAACATTGCACAGAAAC
ACTGCTGCACGTCGGAAATCTCGACTTGCCAGAAGAAAGAAAGCCGTAGAAATCCACCATGGCTGGTACGCCCCTGCTTCACCTGCAGCTGCCTGATGTCGACGGGGTTG
AGGAATGAATGGGAAGATCATTCTATTCTTTTTTGACGTAGATAATGGCAACTTTTTTCATTTTTCCATCATCTGTTTCTTTTATTTTTGTATTTAGTTCGGACCTTTTA
TAATTGAAGTTTTAATTCTATTGCATTTGCTCTTTGAAATGACATACTTCTAATTCTTGAAAATACAATAAGCATTGAACTGTATAACTCATTTACAAATAGTTCTCATC
TCCAAATTTATAGGTCTGAAATAGATCTTATGACAATTCTAAAACGTCCCATCAGGTTGGAGTTCATTTTGAATCTCTAGAAGAAATTACTCAGAGCTAAATAATACAGA
AATTACCCAAAAAAAAAAAGAGGAACAGAGCTGATCCCCTCTAGGTTTACCAAGTTTGTAATCTTGTGAAAGAAGATGGTTTAATTTTCACTCTCCTTTGTTATCTTCCT
TTGGCTTTCCTTTCTTGCAGTTTGATTTCATGCCATTCACCAACTTTTCAGTCCTTGGCTCAAGCATCATTCCCTTCAATGTCATCTCTCTGAAATAGCGCAACCCGTCC
TCTTTTCGTCCCTTCTCGTACAACCCATGAATCATTATAGTGTAAGAGCGACGGTCCGGTCCCAACCCCATCTGTTCCATTTCATTCCAAGTAGATTTAACCCTTTCCTG
AATGTCCCAGTCCATGTACAATCTCAAAACCAGGTTGTATGTGTCAATGGTCATCTTGCAGCCACTTCTTTCCATCCGCTGCAAGAGTACAGTAACTTCCTCCGGTTCTC
TCGACGACCCAACTAAATAGCTGAAAGTCACTGAATTCGGCCAACAGCTCTCTTTTCCCTTCTCCATTTCATCCAAAAGCTCATTAACCTTCTCCATCCTCCTAATCTTA
CACAAGTGTTTTATGAGAGTGTTGTAAGTCGCTACATTTGGAACACAACCTCTCTCATTCATTTCGCTGAAGATCTCCAGAGCTTCAGGAATCCTTTTCTTGAAACAAAG
TGCATCAATGATGCAGTTGCAGATCACCACATCTGGTTTCAAACCCCTTCCCCACATGGCTCTAAACAACTTCAGAGCTGTCCCCAACTTTCCCTTCTTTGTCAATGAGT
TTATCAAAGTCCCATAAGTGTATATATCAGGCTCGCATTTTGATTCGATTATCTCGCGCCAAAACCGCTTGGCCTCATGAACATTCCCCAGTATACACCAACCATTGAGG
ACGATGTTCCTCGTCTTTATATCTGGATGAAACTCGTGCTTCTTGGAGTGCAGCAATGTCTCTGCAACTTCTACATGCTTGTATCTGCATAACCACATTAAAAGTGACTG
AAACGCAACCAAATTCATCTCCAGCCCGAGTTCTTGCCGTTTGTGGAAGATCCCAATTGCTTCCTCCACTTTATGTGCTGCAGCATATCTATTAAGAAGAACTGAATAAG
TTCCCTCGTTTATGAGTTCTTTTCTTTGTGACATTTCTACAAACACTTCGTCTACTTCGTCGAATCGTCGGAATTTTCCAAGAATGTCGAGAATCTCGTTGTAAATAACA
GAACCAGGGGAGTATTCATCTTCTCCACCTCCTTTCGAAACCCAATTGAAGAAAATGAAGGCGGGTTTCCAATCCGATCGATGCCTCCCCAGCACATTGAGAACAAAATC
ATCGGTTAACACAAGCCCGCACAGATTAAGAGCCCTCTCGATCTCCTCCACTGGTTTATCTCTTCTAAATTTGAGGACGTTTTGGACATAAACAGCAGATTGGTCATCTC
GAAGAACATCGATCGATTCATCTGAATCAGCAACCGAGTTCGAAACCCTAGTTCTTCCATCCACTTGTGGGTGATGAACTCCATGGTTTCTGGGCCGAGGTCCGAGATTT
GGGTCCTCGCGACCACGGAAATGATGAATCTGGTCGGAGACGCCGGAGAAGCGCCGGTGTGAACCGGGAGTGTGAGGAAGAAGATATTTTATCAGGTTTGATCGGAACCC
TAATTCGAATATTGGCTGGAAATTTTGCCGCGTAATCTTCCTCCACAATGGGAATTTGAACCTCATCTCGAATCCTTCGATTACAAATCGTTCTGCTTTTGACACTCATT
ATGATCCATAATCCATTCCTTTAAAAATAGAAGAGTATTTCTCTTTCTACGTTTCTCTCACGAATCTGTCAAAACTTTAGTAGTCAGTCCCTAAGAACTGAAGTTTGACT
CATTATATTGGTCTCAAACGTTTGAA
Protein sequenceShow/hide protein sequence
MAAAAITCFSISSKFRNLSLNASSSSFPASSSSSTLKSLTFSSNLSFCAFSNGCLSMSEAQRPLRYSVVCEAAPKKKVDSAAKRARQAEKRRIYNKARKSEIKTRMKKVL
EALDDLKKKPEAQSEEVLSIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYAPASPAAA