; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021483 (gene) of Snake gourd v1 genome

Gene IDTan0021483
OrganismTrichosanthes anguina (Snake gourd v1)
Description30S ribosomal protein S20
Genome locationLG01:108634114..108637182
RNA-Seq ExpressionTan0021483
SyntenyTan0021483
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0015935 - small ribosomal subunit (cellular component)
GO:0003735 - structural constituent of ribosome (molecular function)
GO:0070181 - small ribosomal subunit rRNA binding (molecular function)
InterPro domainsIPR002583 - Ribosomal protein S20
IPR036510 - Ribosomal protein S20 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607892.1 30S ribosomal protein S20, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]4.6e-7892.39Show/hide
Query:  MAAASISCFALPSKFRNLSLNASSSCLPACPSSSTLKSLSFSSSISVSAFSNGCLSMSKAQRPLRYSVVCEAAPQKKADSAAKRARQAEKRRIYNKARKS
        MAAAS SCFALPSKF NLSLNASSSCLP   SS  LKSLSFSS+ISVSAFSNGCLSMS AQRPLRYSVVCEAAPQKKADSAAKRARQAEKRR YNKARKS
Subjt:  MAAASISCFALPSKFRNLSLNASSSCLPACPSSSTLKSLSFSSSISVSAFSNGCLSMSKAQRPLRYSVVCEAAPQKKADSAAKRARQAEKRRIYNKARKS

Query:  EIKTRIKKALEALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPASPAAA
        EI+TR+KKA+EALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAA RKSRLARRKKAVEIHHGWYTPASPAAA
Subjt:  EIKTRIKKALEALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPASPAAA

XP_022941386.1 30S ribosomal protein S20, chloroplastic [Cucurbita moschata]1.8e-7791.85Show/hide
Query:  MAAASISCFALPSKFRNLSLNASSSCLPACPSSSTLKSLSFSSSISVSAFSNGCLSMSKAQRPLRYSVVCEAAPQKKADSAAKRARQAEKRRIYNKARKS
        MAAAS SCFALPSKF NLSLNAS SCLP   SS  LKSLSFSS+ISVSAFSNGCLSMS AQRPLRYSVVCEAAPQKKADSAAKRARQAEKRR YNKARKS
Subjt:  MAAASISCFALPSKFRNLSLNASSSCLPACPSSSTLKSLSFSSSISVSAFSNGCLSMSKAQRPLRYSVVCEAAPQKKADSAAKRARQAEKRRIYNKARKS

Query:  EIKTRIKKALEALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPASPAAA
        EI+TR+KKA+EALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAA RKSRLARRKKAVEIHHGWYTPASPAAA
Subjt:  EIKTRIKKALEALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPASPAAA

XP_022981947.1 30S ribosomal protein S20, chloroplastic-like [Cucurbita maxima]6.7e-7790.22Show/hide
Query:  MAAASISCFALPSKFRNLSLNASSSCLPACPSSSTLKSLSFSSSISVSAFSNGCLSMSKAQRPLRYSVVCEAAPQKKADSAAKRARQAEKRRIYNKARKS
        MAAAS SCFALPSKF NLSLNASSSCLP   SS  L+SLSFSS++SVSAFS+GCLSMS AQRPLRYSVVCEAAPQKKADSAAKRARQAEKRR YNKARKS
Subjt:  MAAASISCFALPSKFRNLSLNASSSCLPACPSSSTLKSLSFSSSISVSAFSNGCLSMSKAQRPLRYSVVCEAAPQKKADSAAKRARQAEKRRIYNKARKS

Query:  EIKTRIKKALEALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPASPAAA
        EI+TR+KKA+EALDDLKKKPEAQSEEVLP+EKLIAEAYSVIDKAVKVGTLHRNTAA RKSRLARRKKAVEIHHGWYTPASPAAA
Subjt:  EIKTRIKKALEALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPASPAAA

XP_023525527.1 30S ribosomal protein S20, chloroplastic [Cucurbita pepo subsp. pepo]2.0e-7690.76Show/hide
Query:  MAAASISCFALPSKFRNLSLNASSSCLPACPSSSTLKSLSFSSSISVSAFSNGCLSMSKAQRPLRYSVVCEAAPQKKADSAAKRARQAEKRRIYNKARKS
        MAAAS SCFALPSKF NLSLNASSSCLP   SS  LK LSFSS+ISVSAFS+GCLSMS AQRPLRYSVVCEAAPQKKADSAAKRARQAEKRR YNKARKS
Subjt:  MAAASISCFALPSKFRNLSLNASSSCLPACPSSSTLKSLSFSSSISVSAFSNGCLSMSKAQRPLRYSVVCEAAPQKKADSAAKRARQAEKRRIYNKARKS

Query:  EIKTRIKKALEALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPASPAAA
        EI+TR+KKA+EALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAA RKSRLARRKKAVEIHHGWYTP SPAAA
Subjt:  EIKTRIKKALEALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPASPAAA

XP_038899721.1 30S ribosomal protein S20, chloroplastic [Benincasa hispida]6.7e-7789.67Show/hide
Query:  MAAASISCFALPSKFRNLSLNASSSCLPACPSSSTLKSLSFSSSISVSAFSNGCLSMSKAQRPLRYSVVCEAAPQKKADSAAKRARQAEKRRIYNKARKS
        MAA ++SCF+LPSKFRNLSLNASSSC+PA PSS TL+SLSFSS+ SVSAFSNGCL+MSKAQRP RYSVVCEAAP+ KADSAAKR RQAEKRRIYNKARKS
Subjt:  MAAASISCFALPSKFRNLSLNASSSCLPACPSSSTLKSLSFSSSISVSAFSNGCLSMSKAQRPLRYSVVCEAAPQKKADSAAKRARQAEKRRIYNKARKS

Query:  EIKTRIKKALEALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPASPAAA
        EIKTRIKK LEALD LKKKPEAQSEEVLPIEKLIAEAYSVIDKAV+VGTLHRNTAARRKSRLARRKKAVEIHHGWYTP SPAAA
Subjt:  EIKTRIKKALEALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPASPAAA

TrEMBL top hitse value%identityAlignment
A0A6J1CG14 30S ribosomal protein S20, chloroplastic5.2e-7588.59Show/hide
Query:  MAAASISCFALPSKFRNLSLNASSSCLPACPSSSTLKSLSFSSSISVSAFSNGCLSMSKAQRPLRYSVVCEAAPQKKADSAAKRARQAEKRRIYNKARKS
        MAAA+I+CF++ SKFRNLSLNASSS  PA  SSSTLKSL+FSS++S  AFSNGCLSMS+AQRPLRYSVVCEAAP+KK DSAAKRARQAEKRRIYNKARKS
Subjt:  MAAASISCFALPSKFRNLSLNASSSCLPACPSSSTLKSLSFSSSISVSAFSNGCLSMSKAQRPLRYSVVCEAAPQKKADSAAKRARQAEKRRIYNKARKS

Query:  EIKTRIKKALEALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPASPAAA
        EIKTR+KK LEALDDLKKKPEAQSEEVL IEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWY PASPAAA
Subjt:  EIKTRIKKALEALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPASPAAA

A0A6J1F6F1 30S ribosomal protein S20, chloroplastic-like1.8e-7590.16Show/hide
Query:  AAASISCFALPSKFRNLSLNASSSCLPACPSSSTLKSLSFSSSISVSAFSNGCLSMSKAQRPLRYSVVCEAAPQKKADSAAKRARQAEKRRIYNKARKSE
        A A ISCFALPSKFRNLSLNASSS +   PSSSTL+SLSFSS++SVSAFSNGCLS+SKAQRP RYSVVCEAAP KKADSAAKRARQAEKRR+YNKARKSE
Subjt:  AAASISCFALPSKFRNLSLNASSSCLPACPSSSTLKSLSFSSSISVSAFSNGCLSMSKAQRPLRYSVVCEAAPQKKADSAAKRARQAEKRRIYNKARKSE

Query:  IKTRIKKALEALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPASPAAA
        IKTR+KK +EALDDLKKKPEAQSEEVLPIEKLIAEA+SVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPASPAAA
Subjt:  IKTRIKKALEALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPASPAAA

A0A6J1FKZ1 30S ribosomal protein S20, chloroplastic8.5e-7891.85Show/hide
Query:  MAAASISCFALPSKFRNLSLNASSSCLPACPSSSTLKSLSFSSSISVSAFSNGCLSMSKAQRPLRYSVVCEAAPQKKADSAAKRARQAEKRRIYNKARKS
        MAAAS SCFALPSKF NLSLNAS SCLP   SS  LKSLSFSS+ISVSAFSNGCLSMS AQRPLRYSVVCEAAPQKKADSAAKRARQAEKRR YNKARKS
Subjt:  MAAASISCFALPSKFRNLSLNASSSCLPACPSSSTLKSLSFSSSISVSAFSNGCLSMSKAQRPLRYSVVCEAAPQKKADSAAKRARQAEKRRIYNKARKS

Query:  EIKTRIKKALEALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPASPAAA
        EI+TR+KKA+EALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAA RKSRLARRKKAVEIHHGWYTPASPAAA
Subjt:  EIKTRIKKALEALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPASPAAA

A0A6J1IG57 30S ribosomal protein S20, chloroplastic-like1.2e-7690.71Show/hide
Query:  AAASISCFALPSKFRNLSLNASSSCLPACPSSSTLKSLSFSSSISVSAFSNGCLSMSKAQRPLRYSVVCEAAPQKKADSAAKRARQAEKRRIYNKARKSE
        A A ISCFALPSKFRNLSLNASSS +P  PSSSTL+SLSFSS++SVSAFSNGCLS+SKAQRP RYSVVCEAAP KKADSAAKRARQAEKRR+YNKARKSE
Subjt:  AAASISCFALPSKFRNLSLNASSSCLPACPSSSTLKSLSFSSSISVSAFSNGCLSMSKAQRPLRYSVVCEAAPQKKADSAAKRARQAEKRRIYNKARKSE

Query:  IKTRIKKALEALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPASPAAA
        IKTR+KK +EALDDLKKKPEAQSEEVLPIEKLIAEA+SVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPASPAAA
Subjt:  IKTRIKKALEALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPASPAAA

A0A6J1IVC5 30S ribosomal protein S20, chloroplastic-like3.2e-7790.22Show/hide
Query:  MAAASISCFALPSKFRNLSLNASSSCLPACPSSSTLKSLSFSSSISVSAFSNGCLSMSKAQRPLRYSVVCEAAPQKKADSAAKRARQAEKRRIYNKARKS
        MAAAS SCFALPSKF NLSLNASSSCLP   SS  L+SLSFSS++SVSAFS+GCLSMS AQRPLRYSVVCEAAPQKKADSAAKRARQAEKRR YNKARKS
Subjt:  MAAASISCFALPSKFRNLSLNASSSCLPACPSSSTLKSLSFSSSISVSAFSNGCLSMSKAQRPLRYSVVCEAAPQKKADSAAKRARQAEKRRIYNKARKS

Query:  EIKTRIKKALEALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPASPAAA
        EI+TR+KKA+EALDDLKKKPEAQSEEVLP+EKLIAEAYSVIDKAVKVGTLHRNTAA RKSRLARRKKAVEIHHGWYTPASPAAA
Subjt:  EIKTRIKKALEALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPASPAAA

SwissProt top hitse value%identityAlignment
B0C2C8 30S ribosomal protein S202.1e-0949.44Show/hide
Query:  SAAKRARQAEKRRIYNKARKSEIKTRIKKALEALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKA
        SA KR   AE+ R+ NKA KS IKT  K+   A+DD K  P    +++  I+  ++  YS IDKAVKVG  HRNT AR+K+ LAR  KA
Subjt:  SAAKRARQAEKRRIYNKARKSEIKTRIKKALEALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKA

P62661 30S ribosomal protein S202.8e-0951.11Show/hide
Query:  KKADSAAKRARQAEKRRIYNKARKSEIKTRIKKALEALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARR
        K+  SA KR RQ+ KRR+ NKA+KS IKT  KKA++         E ++EE L   K++ +A S+IDKA K  TLH+N AARRKSRL R+
Subjt:  KKADSAAKRARQAEKRRIYNKARKSEIKTRIKKALEALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARR

P80380 30S ribosomal protein S202.1e-0951.11Show/hide
Query:  KKADSAAKRARQAEKRRIYNKARKSEIKTRIKKALEALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARR
        K+  SA KR RQ+ KRR+ NKA+KS IKT  KKA++         E ++EE L   K++ +A S+IDKA K  TLH+N AARRKSRL R+
Subjt:  KKADSAAKRARQAEKRRIYNKARKSEIKTRIKKALEALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARR

P82130 30S ribosomal protein S20, chloroplastic1.2e-4462.57Show/hide
Query:  MAAASISCFALPSKFRNLSLNASSSCLPACPSSSTLKSLSFSSSISVSAFSNGCLSMSKAQRPLRYSVVCEAAPQKKADSAAKRARQAEKRRIYNKARKS
        MA  S  C ++ SK  NLS   SS+  P    +S+LK L+FS+++S   FS GC S+   QR   +SVVCE A  KKADSAAKR RQAE RR+ NKARKS
Subjt:  MAAASISCFALPSKFRNLSLNASSSCLPACPSSSTLKSLSFSSSISVSAFSNGCLSMSKAQRPLRYSVVCEAAPQKKADSAAKRARQAEKRRIYNKARKS

Query:  EIKTRIKKALEALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPA
        E+KTR++K  EALD LKKK  A +EE++PI+ LIAEAYS IDKAV  GTLHRNTAARRKSRLAR KK VEIHHGWYTP+
Subjt:  EIKTRIKKALEALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPA

Q9ASV6 30S ribosomal protein S20, chloroplastic3.5e-4462.03Show/hide
Query:  SCFALPSKFRNLSLNASSSCLPACP------SSSTL-KSLSFSSSIS-VSAFSNGCLSMSKAQRPLRYSVVCE-AAPQKKADSAAKRARQAEKRRIYNKA
        SC  L S+F+ LSL   S   P+        +S+TL  SLSFS S+S   AFS G L +   Q+P+R  +VCE AAP KKADSAAKRARQAEKRR+YNK+
Subjt:  SCFALPSKFRNLSLNASSSCLPACP------SSSTL-KSLSFSSSIS-VSAFSNGCLSMSKAQRPLRYSVVCE-AAPQKKADSAAKRARQAEKRRIYNKA

Query:  RKSEIKTRIKKALEALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPASPAAA
        +KSE +TR+KK LEAL+ LKKK +AQ++E++ +EKLI EAYS IDKAVKV  LH+NT ARRKSRLARRKKAVEIHHGWY P + AAA
Subjt:  RKSEIKTRIKKALEALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPASPAAA

Arabidopsis top hitse value%identityAlignment
AT3G15190.1 chloroplast 30S ribosomal protein S20, putative2.5e-4562.03Show/hide
Query:  SCFALPSKFRNLSLNASSSCLPACP------SSSTL-KSLSFSSSIS-VSAFSNGCLSMSKAQRPLRYSVVCE-AAPQKKADSAAKRARQAEKRRIYNKA
        SC  L S+F+ LSL   S   P+        +S+TL  SLSFS S+S   AFS G L +   Q+P+R  +VCE AAP KKADSAAKRARQAEKRR+YNK+
Subjt:  SCFALPSKFRNLSLNASSSCLPACP------SSSTL-KSLSFSSSIS-VSAFSNGCLSMSKAQRPLRYSVVCE-AAPQKKADSAAKRARQAEKRRIYNKA

Query:  RKSEIKTRIKKALEALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPASPAAA
        +KSE +TR+KK LEAL+ LKKK +AQ++E++ +EKLI EAYS IDKAVKV  LH+NT ARRKSRLARRKKAVEIHHGWY P + AAA
Subjt:  RKSEIKTRIKKALEALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPASPAAA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCAGCTTCAATTAGTTGCTTCGCTCTTCCTTCTAAGTTCAGAAATCTTTCGCTTAATGCTTCGTCCTCCTGTTTGCCCGCTTGCCCATCTTCTTCAACCCTCAA
ATCTCTCAGTTTCTCCTCCAGCATTTCAGTTTCTGCCTTCTCCAATGGGTGCCTGTCGATGAGTAAAGCTCAGAGGCCACTTCGTTACTCTGTGGTTTGCGAGGCGGCTC
CTCAGAAAAAGGCTGATTCTGCTGCAAAGAGAGCTCGGCAGGCAGAGAAAAGGCGCATTTACAATAAAGCCCGGAAGTCTGAAATCAAAACCAGGATCAAGAAGGCTTTG
GAAGCTTTAGATGATCTGAAGAAGAAACCTGAAGCACAATCAGAGGAAGTCCTTCCAATCGAGAAGCTCATTGCAGAAGCATACTCGGTGATCGACAAAGCCGTGAAAGT
GGGAACATTGCATCGAAACACTGCAGCACGTCGAAAATCTCGGCTAGCCAGACGAAAGAAAGCTGTAGAAATCCACCATGGCTGGTACACCCCTGCTTCACCAGCAGCAG
CCTGA
mRNA sequenceShow/hide mRNA sequence
GTGGGGTTGGAATTAAAACAGAACCCGGATGAGAATATACTAAACTTGAATCCAAACAGCCAAAAATAACAGAGGATAAACCGATACTGCAAAAACTCTTGCTTTCAGTG
GATAAGCGCCATCATTCTTCAACCTTTGGAGCTCTGAGAATTGCCCAGAAAAATGGCGGCAGCTTCAATTAGTTGCTTCGCTCTTCCTTCTAAGTTCAGAAATCTTTCGC
TTAATGCTTCGTCCTCCTGTTTGCCCGCTTGCCCATCTTCTTCAACCCTCAAATCTCTCAGTTTCTCCTCCAGCATTTCAGTTTCTGCCTTCTCCAATGGGTGCCTGTCG
ATGAGTAAAGCTCAGAGGCCACTTCGTTACTCTGTGGTTTGCGAGGCGGCTCCTCAGAAAAAGGCTGATTCTGCTGCAAAGAGAGCTCGGCAGGCAGAGAAAAGGCGCAT
TTACAATAAAGCCCGGAAGTCTGAAATCAAAACCAGGATCAAGAAGGCTTTGGAAGCTTTAGATGATCTGAAGAAGAAACCTGAAGCACAATCAGAGGAAGTCCTTCCAA
TCGAGAAGCTCATTGCAGAAGCATACTCGGTGATCGACAAAGCCGTGAAAGTGGGAACATTGCATCGAAACACTGCAGCACGTCGAAAATCTCGGCTAGCCAGACGAAAG
AAAGCTGTAGAAATCCACCATGGCTGGTACACCCCTGCTTCACCAGCAGCAGCCTGATTGCTACCCTTTTTTCTCATTGTTAATGGGATTGAGGAATGAATGGGAAGATC
ATTCTATTCTTTTTTTGCCGTAGATAATGACAACTTTTTACTTCATCTGTTTCTTTCATTTTTGTATTTAGTTTGAACCTTCATCTATGTGATATGTCTACAGTTGAAGT
TTTACTTCTATTTCATCTGCCTTGGTTGATGGTTCCAATTTTTTGAATACC
Protein sequenceShow/hide protein sequence
MAAASISCFALPSKFRNLSLNASSSCLPACPSSSTLKSLSFSSSISVSAFSNGCLSMSKAQRPLRYSVVCEAAPQKKADSAAKRARQAEKRRIYNKARKSEIKTRIKKAL
EALDDLKKKPEAQSEEVLPIEKLIAEAYSVIDKAVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPASPAAA