; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS001303 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS001303
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUPF0481 protein At3g47200-like
Genome locationscaffold36:3152979..3153503
RNA-Seq ExpressionMS001303
SyntenyMS001303
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064930.1 UPF0481 protein [Cucumis melo var. makuwa]1.3e-5060Show/hide
Query:  PPTATELYDYGISFEKKSHYSQKM-FDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEV
        P TATEL+D GISF  +      + F ER G+L++P III+++FE   RN+IAYEY   KS  VSNF MFM FL+N++ DV+LL+ +GII NHL S +E+
Subjt:  PPTATELYDYGISFEKKSHYSQKM-FDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEV

Query:  TKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAVLSM
          LF DLCKN++ ERNLY+ EC+KM++YCKHRRHRWM SLK DYF TPWA ISF+AAV+LLLLTL+Q VVA +++
Subjt:  TKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAVLSM

XP_004138858.1 UPF0481 protein At3g47200 [Cucumis sativus]4.0e-4757.71Show/hide
Query:  PPTATELYDYGISFE-KKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEV
        P TATEL++ GISF  +K       F ER G+L++P III+++FE   RN+IAYEY   KS   SNF MFM FL+N++ DV+LL+ +GII N L S KE+
Subjt:  PPTATELYDYGISFE-KKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEV

Query:  TKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAVLSM
          LF DLCKN++ ERN Y+  C +M++YCKHRRHRWM SLK DYF TPWA ISF+AAV+LLLLTL+Q VVA +++
Subjt:  TKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAVLSM

XP_008445209.1 PREDICTED: UPF0481 protein At3g47200-like [Cucumis melo]2.3e-5060Show/hide
Query:  PPTATELYDYGISFEKKSHYSQKM-FDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEV
        P TATEL+D GISF  +      + F ER G+L++P III+++FE   RN+IAYEY   KS  VSNF MFM FL+N++ DV+LL+ +GII NHL S +E+
Subjt:  PPTATELYDYGISFEKKSHYSQKM-FDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEV

Query:  TKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAVLSM
          LF DLCKN++ ERNLY+ EC+KM++YCKHRRHRWM SLK DYF TPWA ISF+AAV+LLLLTL+Q VVA +++
Subjt:  TKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAVLSM

XP_022131636.1 UPF0481 protein At3g47200-like [Momordica charantia]2.0e-9199.43Show/hide
Query:  PPTATELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVT
        PPTATELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVT
Subjt:  PPTATELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVT

Query:  KLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAVLSMP
        KLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLM+AVVAVLSMP
Subjt:  KLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAVLSMP

XP_022132033.1 UPF0481 protein At3g47200-like [Momordica charantia]4.6e-7585.14Show/hide
Query:  PPTATELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVT
        PPTATELYDYGISFEKKSHYSQKMFDER GILR+PHIIINETFES +RNIIA+E+  RK   VSNF +FMRFLLNSDNDV LLIKEGIIHNHLES K VT
Subjt:  PPTATELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVT

Query:  KLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAVLSMP
        KLF DLC+NVV E NLYNYEC++MR+Y KHRRHRWMASLK DYFNTPWALISFIAAVVLLLLTLMQ VVAVLSMP
Subjt:  KLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAVLSMP

TrEMBL top hitse value%identityAlignment
A0A0A0LPK8 Uncharacterized protein2.0e-4757.71Show/hide
Query:  PPTATELYDYGISFE-KKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEV
        P TATEL++ GISF  +K       F ER G+L++P III+++FE   RN+IAYEY   KS   SNF MFM FL+N++ DV+LL+ +GII N L S KE+
Subjt:  PPTATELYDYGISFE-KKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEV

Query:  TKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAVLSM
          LF DLCKN++ ERN Y+  C +M++YCKHRRHRWM SLK DYF TPWA ISF+AAV+LLLLTL+Q VVA +++
Subjt:  TKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAVLSM

A0A1S3BD00 UPF0481 protein At3g47200-like1.1e-5060Show/hide
Query:  PPTATELYDYGISFEKKSHYSQKM-FDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEV
        P TATEL+D GISF  +      + F ER G+L++P III+++FE   RN+IAYEY   KS  VSNF MFM FL+N++ DV+LL+ +GII NHL S +E+
Subjt:  PPTATELYDYGISFEKKSHYSQKM-FDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEV

Query:  TKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAVLSM
          LF DLCKN++ ERNLY+ EC+KM++YCKHRRHRWM SLK DYF TPWA ISF+AAV+LLLLTL+Q VVA +++
Subjt:  TKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAVLSM

A0A5A7VCL1 UPF0481 protein6.5e-5160Show/hide
Query:  PPTATELYDYGISFEKKSHYSQKM-FDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEV
        P TATEL+D GISF  +      + F ER G+L++P III+++FE   RN+IAYEY   KS  VSNF MFM FL+N++ DV+LL+ +GII NHL S +E+
Subjt:  PPTATELYDYGISFEKKSHYSQKM-FDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEV

Query:  TKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAVLSM
          LF DLCKN++ ERNLY+ EC+KM++YCKHRRHRWM SLK DYF TPWA ISF+AAV+LLLLTL+Q VVA +++
Subjt:  TKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAVLSM

A0A6J1BQ21 UPF0481 protein At3g47200-like9.9e-9299.43Show/hide
Query:  PPTATELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVT
        PPTATELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVT
Subjt:  PPTATELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVT

Query:  KLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAVLSMP
        KLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLM+AVVAVLSMP
Subjt:  KLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAVLSMP

A0A6J1BR42 UPF0481 protein At3g47200-like2.2e-7585.14Show/hide
Query:  PPTATELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVT
        PPTATELYDYGISFEKKSHYSQKMFDER GILR+PHIIINETFES +RNIIA+E+  RK   VSNF +FMRFLLNSDNDV LLIKEGIIHNHLES K VT
Subjt:  PPTATELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVT

Query:  KLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAVLSMP
        KLF DLC+NVV E NLYNYEC++MR+Y KHRRHRWMASLK DYFNTPWALISFIAAVVLLLLTLMQ VVAVLSMP
Subjt:  KLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAVLSMP

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026451.6e-0926.14Show/hide
Query:  PTATELYDYGISFEKKSH--YSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGV-SNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKE
        P+ ++L+  G+ F+  +H   S   FD  +G   +P I ++   E+ +RN++AYE T    P V + +   +  +++S+ DV LL ++G++ + L+S +E
Subjt:  PTATELYDYGISFEKKSH--YSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGV-SNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKE

Query:  VTKLFQDLCKNV-VAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAVLS
          +++  + K+V + +    +   + + +Y   R    +  L   Y    W +++F+AAV+LL+L  +Q    V S
Subjt:  VTKLFQDLCKNV-VAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAVLS

Q9SD53 UPF0481 protein At3g472002.4e-1028.81Show/hide
Query:  TATELYDYGISFEKKSHYSQKMFDER--TGILRVPHIIINETFESTMRNIIAYE-YTIRKSPGVSNFLMFMRFLLNSDNDVNLLIKEG-IIHNHLESAKE
        +A  L   GI F  +      + + R     L++P +  +    S   N +A+E +    S  ++ +++FM  LLN++ DV  L  +  II NH  S  E
Subjt:  TATELYDYGISFEKKSHYSQKMFDER--TGILRVPHIIINETFESTMRNIIAYE-YTIRKSPGVSNFLMFMRFLLNSDNDVNLLIKEG-IIHNHLESAKE

Query:  VTKLFQDLCKNVVAE--RNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAVLS
        V++ F+ + K+VV E   +  N   + + +Y K   +   A  +H +F +PW  +S  A + ++LLT++Q+ VA+LS
Subjt:  VTKLFQDLCKNVVAE--RNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAVLS

Arabidopsis top hitse value%identityAlignment
AT3G50120.1 Plant protein of unknown function (DUF247)3.1e-2137.06Show/hide
Query:  TELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYT-IRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLF
        TEL + GI F ++          + G L +P ++I++  +S   N+IA+E   I  S  ++++++FM  L++S  DV+ L   GII + L S  EV  LF
Subjt:  TELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYT-IRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLF

Query:  QDLCKNVV--AERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAV
          LC+ VV   E +  +    ++ +Y  H+ + W A+LKH YFN PWA++SF AAV+LL+LT  Q+  AV
Subjt:  QDLCKNVV--AERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAV

AT3G50150.1 Plant protein of unknown function (DUF247)2.2e-1935.67Show/hide
Query:  TELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYT-IRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLF
        TEL   G++F +K        + + G L++P ++I++  +S   N+IA+E    + S  ++++++FM  L+NS  DV+ L  +GII + L S  EV  LF
Subjt:  TELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYT-IRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLF

Query:  QDLCKNVVAERNLYNYECQKMRKYCKHRRHRW---MASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAV
          LCK V+ +     Y  Q  R+  ++   +W    A+L+  YFN PWA  SF AAV+LL LT  Q+  AV
Subjt:  QDLCKNVVAERNLYNYECQKMRKYCKHRRHRW---MASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAV

AT3G50160.1 Plant protein of unknown function (DUF247)6.5e-1935.47Show/hide
Query:  TELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYT-IRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLF
        TEL + G+ F +K        + + G L++P ++I++  +S   N+IA+E   I+ S  ++++++FM  L+NS  DV+ L   GII N L S  EV+ LF
Subjt:  TELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYT-IRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLF

Query:  QDLCKNVVAERN--LYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAVLS
          L K V+ + N    +    ++  Y + + +   A+L+H YFN PWA  SFIAAV LL+ T  Q+  AV +
Subjt:  QDLCKNVVAERN--LYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAVLS

AT3G50170.1 Plant protein of unknown function (DUF247)1.9e-1836.47Show/hide
Query:  TELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYT-IRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLF
        TEL + G+ F K+        + + G L +P ++I++  +S   N+IA+E   I  S  ++++++FM  L+NS  DV+ L   GII + L S  EV  LF
Subjt:  TELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYT-IRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLF

Query:  QDLCKNVVAERNLYNYE--CQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAV
          LC+ VV +    +       + +Y   + +   A+L H YFN PWA  SF AAV+LLLLTL Q+  AV
Subjt:  QDLCKNVVAERNLYNYE--CQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAV

AT4G31980.1 unknown protein9.6e-2334.66Show/hide
Query:  PTATELYDYGISFEKKSHYSQKMFD--ERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEV
        P ATEL+  G+ F K +  S  + D     G+L++P I++++  ES  +NII +E     +    +++M +   + S  D +LLI  GII N+L ++ +V
Subjt:  PTATELYDYGISFEKKSHYSQKMFD--ERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEV

Query:  TKLFQDLCKNVVAERNLY-NYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAVLSM
        + LF  + K V+ +R  Y +   + ++ YC    +RW A L+ DYF+ PWA+ S  AA++LLLLT +Q+V ++L++
Subjt:  TKLFQDLCKNVVAERNLY-NYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAVLSM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CCACCCACTGCCACTGAGCTTTACGATTACGGCATTTCTTTCGAGAAGAAATCACATTATTCTCAAAAGATGTTTGATGAACGTACCGGCATTCTCAGAGTGCCTCACAT
CATAATAAATGAGACTTTCGAAAGCACGATGAGAAACATCATAGCTTACGAGTATACAATTCGCAAGAGTCCAGGCGTAAGCAACTTCTTGATGTTCATGCGTTTCTTGT
TGAACTCCGACAACGATGTAAATTTGCTCATAAAGGAGGGGATTATCCACAACCATTTGGAAAGCGCAAAGGAAGTTACTAAGTTGTTCCAGGACCTTTGTAAGAACGTT
GTGGCCGAAAGAAATTTGTACAACTATGAATGTCAGAAAATGAGAAAATACTGCAAGCACCGCCGCCATCGATGGATGGCTTCGTTGAAACACGACTATTTTAACACGCC
GTGGGCTTTGATCTCCTTCATCGCTGCCGTCGTCCTGCTTTTACTCACTCTCATGCAAGCGGTGGTAGCTGTACTCTCCATGCCT
mRNA sequenceShow/hide mRNA sequence
CCACCCACTGCCACTGAGCTTTACGATTACGGCATTTCTTTCGAGAAGAAATCACATTATTCTCAAAAGATGTTTGATGAACGTACCGGCATTCTCAGAGTGCCTCACAT
CATAATAAATGAGACTTTCGAAAGCACGATGAGAAACATCATAGCTTACGAGTATACAATTCGCAAGAGTCCAGGCGTAAGCAACTTCTTGATGTTCATGCGTTTCTTGT
TGAACTCCGACAACGATGTAAATTTGCTCATAAAGGAGGGGATTATCCACAACCATTTGGAAAGCGCAAAGGAAGTTACTAAGTTGTTCCAGGACCTTTGTAAGAACGTT
GTGGCCGAAAGAAATTTGTACAACTATGAATGTCAGAAAATGAGAAAATACTGCAAGCACCGCCGCCATCGATGGATGGCTTCGTTGAAACACGACTATTTTAACACGCC
GTGGGCTTTGATCTCCTTCATCGCTGCCGTCGTCCTGCTTTTACTCACTCTCATGCAAGCGGTGGTAGCTGTACTCTCCATGCCT
Protein sequenceShow/hide protein sequence
PPTATELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNV
VAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMQAVVAVLSMP