; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr017757 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr017757
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionU11/U12 small nuclear ribonucleoprotein 25 kDa protein
Genome locationtig00153055:583735..587784
RNA-Seq ExpressionSgr017757
SyntenySgr017757
Gene Ontology termsGO:0000398 - mRNA splicing, via spliceosome (biological process)
GO:0006749 - glutathione metabolic process (biological process)
GO:0005689 - U12-type spliceosomal complex (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0004364 - glutathione transferase activity (molecular function)
GO:0043295 - glutathione binding (molecular function)
InterPro domainsIPR029071 - Ubiquitin-like domain superfamily
IPR039690 - U11/U12 small nuclear ribonucleoprotein 25kDa protein
IPR040610 - SNRNP25, ubiquitin-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591246.1 U11/U12 small nuclear ribonucleoprotein 25 kDa protein, partial [Cucurbita argyrosperma subsp. sororia]3.2e-4469.54Show/hide
Query:  MKEDGDDKMKSSSKDEE------------MSLN------------------PTLSDVDTLISLELGSAMRISVLKLDGTAFDVAIMNSATLKDLKLAIKK
        MKEDGD+ +KSSSK+EE            + LN                  PTL DVDTLISLELGSAMRISVLKLDGT  DVAIMNSA+LKDLKLAIKK
Subjt:  MKEDGDDKMKSSSKDEE------------MSLN------------------PTLSDVDTLISLELGSAMRISVLKLDGTAFDVAIMNSATLKDLKLAIKK

Query:  KINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIRNNSQV
        K+N++EQSKMGHRHISWKHVWANFCLAH NEK+LDD SALQDFGIRNNSQV
Subjt:  KINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIRNNSQV

XP_004145525.1 U11/U12 small nuclear ribonucleoprotein 25 kDa protein [Cucumis sativus]9.4e-4468.87Show/hide
Query:  MKEDGDDKMKSSSKDEEMSL------------------------------NPTLSDVDTLISLELGSAMRISVLKLDGTAFDVAIMNSATLKDLKLAIKK
        MKE+GD  MKS  KDEE ++                              NPTLS VDTLISLELGSAMRISVLKLDGTA DV IMNSATLKDLKLAIKK
Subjt:  MKEDGDDKMKSSSKDEEMSL------------------------------NPTLSDVDTLISLELGSAMRISVLKLDGTAFDVAIMNSATLKDLKLAIKK

Query:  KINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIRNNSQV
        K+N++EQSKMGHRHISWKHVWANFCLAH NEKLLDD S LQDFGIRNNSQV
Subjt:  KINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIRNNSQV

XP_022140942.1 U11/U12 small nuclear ribonucleoprotein 25 kDa protein [Momordica charantia]8.5e-4569.54Show/hide
Query:  MKEDGDDKMKSSSKDEE------------------------------MSLNPTLSDVDTLISLELGSAMRISVLKLDGTAFDVAIMNSATLKDLKLAIKK
        MKE  DDK+KSS  DEE                              +  NPTLSDVDTLISLELGSAMRISVLKLDG A DVAIMNSATLKDLKL IKK
Subjt:  MKEDGDDKMKSSSKDEE------------------------------MSLNPTLSDVDTLISLELGSAMRISVLKLDGTAFDVAIMNSATLKDLKLAIKK

Query:  KINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIRNNSQV
        K+ND+EQSKMGHRHISWKHVW NFCLAH NEKLLD+GSALQDFGIRNNSQV
Subjt:  KINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIRNNSQV

XP_022975470.1 U11/U12 small nuclear ribonucleoprotein 25 kDa protein isoform X1 [Cucurbita maxima]1.2e-4369.33Show/hide
Query:  MKEDGDDKMKSSSKDEE------------MSLN------------------PTLSDVDTLISLELGSAMRISVLKLDGTAFDVAIMNSATLKDLKLAIKK
        MKEDGD+ +KSSSK+EE            + LN                  PTL DVDTLISLELGSAMRISVLKLDGT  DVAIMNSA+LKDLKLAIKK
Subjt:  MKEDGDDKMKSSSKDEE------------MSLN------------------PTLSDVDTLISLELGSAMRISVLKLDGTAFDVAIMNSATLKDLKLAIKK

Query:  KINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIRNNSQ
        K+N++EQSKMGHRHISWKHVWANFCLAH NEK+LDD SALQDFGIRNNSQ
Subjt:  KINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIRNNSQ

XP_022975482.1 U11/U12 small nuclear ribonucleoprotein 25 kDa protein isoform X3 [Cucurbita maxima]3.2e-4469.54Show/hide
Query:  MKEDGDDKMKSSSKDEE------------MSLN------------------PTLSDVDTLISLELGSAMRISVLKLDGTAFDVAIMNSATLKDLKLAIKK
        MKEDGD+ +KSSSK+EE            + LN                  PTL DVDTLISLELGSAMRISVLKLDGT  DVAIMNSA+LKDLKLAIKK
Subjt:  MKEDGDDKMKSSSKDEE------------MSLN------------------PTLSDVDTLISLELGSAMRISVLKLDGTAFDVAIMNSATLKDLKLAIKK

Query:  KINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIRNNSQV
        K+N++EQSKMGHRHISWKHVWANFCLAH NEK+LDD SALQDFGIRNNSQV
Subjt:  KINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIRNNSQV

TrEMBL top hitse value%identityAlignment
A0A0A0L4F3 Ubiquitin_4 domain-containing protein4.6e-4468.87Show/hide
Query:  MKEDGDDKMKSSSKDEEMSL------------------------------NPTLSDVDTLISLELGSAMRISVLKLDGTAFDVAIMNSATLKDLKLAIKK
        MKE+GD  MKS  KDEE ++                              NPTLS VDTLISLELGSAMRISVLKLDGTA DV IMNSATLKDLKLAIKK
Subjt:  MKEDGDDKMKSSSKDEEMSL------------------------------NPTLSDVDTLISLELGSAMRISVLKLDGTAFDVAIMNSATLKDLKLAIKK

Query:  KINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIRNNSQV
        K+N++EQSKMGHRHISWKHVWANFCLAH NEKLLDD S LQDFGIRNNSQV
Subjt:  KINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIRNNSQV

A0A6J1CH74 U11/U12 small nuclear ribonucleoprotein 25 kDa protein4.1e-4569.54Show/hide
Query:  MKEDGDDKMKSSSKDEE------------------------------MSLNPTLSDVDTLISLELGSAMRISVLKLDGTAFDVAIMNSATLKDLKLAIKK
        MKE  DDK+KSS  DEE                              +  NPTLSDVDTLISLELGSAMRISVLKLDG A DVAIMNSATLKDLKL IKK
Subjt:  MKEDGDDKMKSSSKDEE------------------------------MSLNPTLSDVDTLISLELGSAMRISVLKLDGTAFDVAIMNSATLKDLKLAIKK

Query:  KINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIRNNSQV
        K+ND+EQSKMGHRHISWKHVW NFCLAH NEKLLD+GSALQDFGIRNNSQV
Subjt:  KINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIRNNSQV

A0A6J1IGT3 U11/U12 small nuclear ribonucleoprotein 25 kDa protein isoform X16.0e-4469.33Show/hide
Query:  MKEDGDDKMKSSSKDEE------------MSLN------------------PTLSDVDTLISLELGSAMRISVLKLDGTAFDVAIMNSATLKDLKLAIKK
        MKEDGD+ +KSSSK+EE            + LN                  PTL DVDTLISLELGSAMRISVLKLDGT  DVAIMNSA+LKDLKLAIKK
Subjt:  MKEDGDDKMKSSSKDEE------------MSLN------------------PTLSDVDTLISLELGSAMRISVLKLDGTAFDVAIMNSATLKDLKLAIKK

Query:  KINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIRNNSQ
        K+N++EQSKMGHRHISWKHVWANFCLAH NEK+LDD SALQDFGIRNNSQ
Subjt:  KINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIRNNSQ

A0A6J1IJC5 U11/U12 small nuclear ribonucleoprotein 25 kDa protein isoform X31.6e-4469.54Show/hide
Query:  MKEDGDDKMKSSSKDEE------------MSLN------------------PTLSDVDTLISLELGSAMRISVLKLDGTAFDVAIMNSATLKDLKLAIKK
        MKEDGD+ +KSSSK+EE            + LN                  PTL DVDTLISLELGSAMRISVLKLDGT  DVAIMNSA+LKDLKLAIKK
Subjt:  MKEDGDDKMKSSSKDEE------------MSLN------------------PTLSDVDTLISLELGSAMRISVLKLDGTAFDVAIMNSATLKDLKLAIKK

Query:  KINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIRNNSQV
        K+N++EQSKMGHRHISWKHVWANFCLAH NEK+LDD SALQDFGIRNNSQV
Subjt:  KINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIRNNSQV

A0A6J1IKL9 U11/U12 small nuclear ribonucleoprotein 25 kDa protein isoform X26.0e-4469.33Show/hide
Query:  MKEDGDDKMKSSSKDEE------------MSLN------------------PTLSDVDTLISLELGSAMRISVLKLDGTAFDVAIMNSATLKDLKLAIKK
        MKEDGD+ +KSSSK+EE            + LN                  PTL DVDTLISLELGSAMRISVLKLDGT  DVAIMNSA+LKDLKLAIKK
Subjt:  MKEDGDDKMKSSSKDEE------------MSLN------------------PTLSDVDTLISLELGSAMRISVLKLDGTAFDVAIMNSATLKDLKLAIKK

Query:  KINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIRNNSQ
        K+N++EQSKMGHRHISWKHVWANFCLAH NEK+LDD SALQDFGIRNNSQ
Subjt:  KINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIRNNSQ

SwissProt top hitse value%identityAlignment
Q3ZBQ4 U11/U12 small nuclear ribonucleoprotein 25 kDa protein1.9e-1538.1Show/hide
Query:  EMSLNPTLSDVDTLISLELGSAMRISVLKLDGTAFDVAIMNSATLKDLKLAIKKKINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIR
        ++ +  TL +V++ I+LE G AM + V K+DG    V ++ +AT+ DLK AI++ +   ++ + G +HISW +VW  + L  + EKL +D   L+D+GIR
Subjt:  EMSLNPTLSDVDTLISLELGSAMRISVLKLDGTAFDVAIMNSATLKDLKLAIKKKINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIR

Query:  NNSQV
        N  +V
Subjt:  NNSQV

Q84WS8 U11/U12 small nuclear ribonucleoprotein 25 kDa protein1.9e-3977.23Show/hide
Query:  NPTLSDVDTLISLELGSAMRISVLKLDGTAFDVAIMNSATLKDLKLAIKKKINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIRNNSQ
        NPTLSDV TL+SLE GSAMR+SV+KLDG++ DVA+MNSATLKDLKL IKKK+N++EQ+ MGHRHISWKHVW+NFCL+ +NEKLLDD + LQD GIRNNSQ
Subjt:  NPTLSDVDTLISLELGSAMRISVLKLDGTAFDVAIMNSATLKDLKLAIKKKINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIRNNSQ

Query:  V
        V
Subjt:  V

Q8VIK1 U11/U12 small nuclear ribonucleoprotein 25 kDa protein1.9e-1538.1Show/hide
Query:  EMSLNPTLSDVDTLISLELGSAMRISVLKLDGTAFDVAIMNSATLKDLKLAIKKKINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIR
        ++ +  TL +V++ I+LE G AM + V K+DG    V ++ +AT+ DLK AI++ +   ++ + G +HISW +VW  + L  + EKL +D   L+D+GIR
Subjt:  EMSLNPTLSDVDTLISLELGSAMRISVLKLDGTAFDVAIMNSATLKDLKLAIKKKINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIR

Query:  NNSQV
        N  +V
Subjt:  NNSQV

Q9BV90 U11/U12 small nuclear ribonucleoprotein 25 kDa protein8.6e-1639.05Show/hide
Query:  EMSLNPTLSDVDTLISLELGSAMRISVLKLDGTAFDVAIMNSATLKDLKLAIKKKINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIR
        ++ +  TL +V++ I+LE G AM + V K+DG    V ++ SAT+ DLK AI++ +   ++ + G +HISW +VW  + L  + EKL +D   L+D+GIR
Subjt:  EMSLNPTLSDVDTLISLELGSAMRISVLKLDGTAFDVAIMNSATLKDLKLAIKKKINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIR

Query:  NNSQV
        N  +V
Subjt:  NNSQV

Arabidopsis top hitse value%identityAlignment
AT1G80060.1 Ubiquitin-like superfamily protein5.6e-1032.94Show/hide
Query:  MRISVLKLDGTAFDVAIMNSATLKDLKLAIKK--KINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIRNNSQV
        +++SV+KL+G+ FDV +    ++ +LK A+++   I+ +E    GH  ISW HVW +FCL + +++L++D ++++  G+ +  Q+
Subjt:  MRISVLKLDGTAFDVAIMNSATLKDLKLAIKK--KINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIRNNSQV

AT3G07860.1 Ubiquitin-like superfamily protein1.4e-4077.23Show/hide
Query:  NPTLSDVDTLISLELGSAMRISVLKLDGTAFDVAIMNSATLKDLKLAIKKKINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIRNNSQ
        NPTLSDV TL+SLE GSAMR+SV+KLDG++ DVA+MNSATLKDLKL IKKK+N++EQ+ MGHRHISWKHVW+NFCL+ +NEKLLDD + LQD GIRNNSQ
Subjt:  NPTLSDVDTLISLELGSAMRISVLKLDGTAFDVAIMNSATLKDLKLAIKKKINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIRNNSQ

Query:  V
        V
Subjt:  V

AT4G32270.1 Ubiquitin-like superfamily protein2.0e-1233.7Show/hide
Query:  MRISVLKLDGTAFDVAIMNSATLKDLKLAIKKKINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIRNNSQV--VPRLSNH
        ++++VLKLDG++F + ++ +AT+ +LK+A++   + +  S  G   ISW HVW  FCL++ +++L+++   L +FGI++  Q+  +  +SN+
Subjt:  MRISVLKLDGTAFDVAIMNSATLKDLKLAIKKKINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIRNNSQV--VPRLSNH

AT4G32270.2 Ubiquitin-like superfamily protein1.1e-1033.7Show/hide
Query:  MRISVLKLDGTAFDVAIMNSATLKDLKLAIKKKINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIRNNSQV--VPRLSNH
        ++++VLKLDG++F   ++ +AT+ +LK+A++   + +  S  G   ISW HVW  FCL++ +++L+++   L +FGI++  Q+  +  +SN+
Subjt:  MRISVLKLDGTAFDVAIMNSATLKDLKLAIKKKINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIRNNSQV--VPRLSNH

AT5G25340.1 Ubiquitin-like superfamily protein5.8e-1541.76Show/hide
Query:  MRISVLKLDGTAFDVAIMNSATLKDLKLAIKKKINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIRNNSQVVPRLSNHL
        +R+SVLKLDG++FDV ++ SAT+ DLK+AI+   + +   K G   ISW HVW +FCL    +KL+ D   + ++G+++  +V  R  NH+
Subjt:  MRISVLKLDGTAFDVAIMNSATLKDLKLAIKKKINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSALQDFGIRNNSQVVPRLSNHL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGAGGACGGAGATGATAAAATGAAATCCAGCAGTAAAGACGAAGAAATGTCGTTGAATCCCACCTTGTCGGACGTCGATACCCTCATCAGCCTCGAATTGGGCAG
CGCCATGCGCATCTCCGTCCTCAAGCTCGACGGCACCGCCTTCGATGTGGCCATCATGAATTCCGCGACATTGAAGGACTTGAAGCTTGCTATCAAGAAGAAAATAAACG
ACATAGAGCAATCCAAGATGGGCCATCGCCACATTTCATGGAAGCATGTGTGGGCAAATTTTTGCCTAGCGCACAGCAACGAGAAGCTTCTGGATGATGGCTCCGCGCTT
CAGGATTTTGGAATCCGCAACAATTCACAGGTTGTTCCAAGACTCTCCAATCATCTTGGTGAACTCCCTCTCTCTGTTTGGGTACAGAGATTTCAGCTTGTAATGCTTCT
CTGCAAAGAAGAAATTGTACCCACTTCTGTTTGGTTTCGGGTGGTTCGGGTCTCCCTTTCTCCGGCTTCTCCGGTGCCGCCGACCGGAGTGTCGGTGTCTACCAGTGTAC
GGTACAATGGCGTTGATAGATTGGGGCCGTGGATCGGACGGGGCTGGCTGGTCTGGATGGCGGACCAGGCGAAAACGACGTTGTTTTGGGCGTATATTCCACCAGAGCCA
ATTCGTTTTCACTCGAAGGGCTACCGAAAGAAAATGGCGCTGCAGAAGAAGAAGACAAAGCAGAAATCAGATTTACAGCAAAATATTCTGAAAATGGAAAAAGAAGAAAA
AAAAAAAAACCGGAAGGAGAAACAAAAAGCCACCACCTTGTGGGGAACAGATGGGGCCCTGCCGGCTGAAGAAGTAAACTTGCTCGTAATGAACTTGAAGACACTGCCCA
CTTCCCTCCATTTCTTCTCTGCAACAACCTACGCCGCCATGAAAGCAGTTTACAAACAGTTCACTTCTCAAACAAAAGCTTCGAAAAAGAACGAAGCTGAGAAATCAGAG
AAAGGAAAGTGTGGTGTCTTTTTGTTGTGTAAAACTTTTAGAGGAGCCGCCATTAATATCATCGAGCTTTTTGGATCGAAGGCGTCTCAAAGTGTCCCAGAAAACAATGG
GGTCCGAAATAACTTCGTCATGGGTGGCAAGCGGCGGAGGATAATGCTTATCGAGGCCGCCGTTCCAGCTCTTGCTCCTCGCCGGCGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGAGGACGGAGATGATAAAATGAAATCCAGCAGTAAAGACGAAGAAATGTCGTTGAATCCCACCTTGTCGGACGTCGATACCCTCATCAGCCTCGAATTGGGCAG
CGCCATGCGCATCTCCGTCCTCAAGCTCGACGGCACCGCCTTCGATGTGGCCATCATGAATTCCGCGACATTGAAGGACTTGAAGCTTGCTATCAAGAAGAAAATAAACG
ACATAGAGCAATCCAAGATGGGCCATCGCCACATTTCATGGAAGCATGTGTGGGCAAATTTTTGCCTAGCGCACAGCAACGAGAAGCTTCTGGATGATGGCTCCGCGCTT
CAGGATTTTGGAATCCGCAACAATTCACAGGTTGTTCCAAGACTCTCCAATCATCTTGGTGAACTCCCTCTCTCTGTTTGGGTACAGAGATTTCAGCTTGTAATGCTTCT
CTGCAAAGAAGAAATTGTACCCACTTCTGTTTGGTTTCGGGTGGTTCGGGTCTCCCTTTCTCCGGCTTCTCCGGTGCCGCCGACCGGAGTGTCGGTGTCTACCAGTGTAC
GGTACAATGGCGTTGATAGATTGGGGCCGTGGATCGGACGGGGCTGGCTGGTCTGGATGGCGGACCAGGCGAAAACGACGTTGTTTTGGGCGTATATTCCACCAGAGCCA
ATTCGTTTTCACTCGAAGGGCTACCGAAAGAAAATGGCGCTGCAGAAGAAGAAGACAAAGCAGAAATCAGATTTACAGCAAAATATTCTGAAAATGGAAAAAGAAGAAAA
AAAAAAAAACCGGAAGGAGAAACAAAAAGCCACCACCTTGTGGGGAACAGATGGGGCCCTGCCGGCTGAAGAAGTAAACTTGCTCGTAATGAACTTGAAGACACTGCCCA
CTTCCCTCCATTTCTTCTCTGCAACAACCTACGCCGCCATGAAAGCAGTTTACAAACAGTTCACTTCTCAAACAAAAGCTTCGAAAAAGAACGAAGCTGAGAAATCAGAG
AAAGGAAAGTGTGGTGTCTTTTTGTTGTGTAAAACTTTTAGAGGAGCCGCCATTAATATCATCGAGCTTTTTGGATCGAAGGCGTCTCAAAGTGTCCCAGAAAACAATGG
GGTCCGAAATAACTTCGTCATGGGTGGCAAGCGGCGGAGGATAATGCTTATCGAGGCCGCCGTTCCAGCTCTTGCTCCTCGCCGGCGGTGA
Protein sequenceShow/hide protein sequence
MKEDGDDKMKSSSKDEEMSLNPTLSDVDTLISLELGSAMRISVLKLDGTAFDVAIMNSATLKDLKLAIKKKINDIEQSKMGHRHISWKHVWANFCLAHSNEKLLDDGSAL
QDFGIRNNSQVVPRLSNHLGELPLSVWVQRFQLVMLLCKEEIVPTSVWFRVVRVSLSPASPVPPTGVSVSTSVRYNGVDRLGPWIGRGWLVWMADQAKTTLFWAYIPPEP
IRFHSKGYRKKMALQKKKTKQKSDLQQNILKMEKEEKKKNRKEKQKATTLWGTDGALPAEEVNLLVMNLKTLPTSLHFFSATTYAAMKAVYKQFTSQTKASKKNEAEKSE
KGKCGVFLLCKTFRGAAINIIELFGSKASQSVPENNGVRNNFVMGGKRRRIMLIEAAVPALAPRRR