; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS010735 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS010735
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUnknown protein
Genome locationscaffold35:1648131..1648772
RNA-Seq ExpressionMS010735
SyntenyMS010735
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK13010.1 uncharacterized protein E5676_scaffold255G006090 [Cucumis melo var. makuwa]3.9e-6870.56Show/hide
Query:  MKKLCRKSTVHPSPPIISDFLSFLPAAILALTLALSADDKEVLAYLISCSSSGSNATASNFSGSRKASRKHGGGKGGVDHAPLFDCDCFMCYRRYWARWD
        MKKLCRKSTVHPSPPIISDFLSFLPAAI ALT+ALSADDKEVLAYLISCS+  S A+ SN SGSRK  RK    K G+DHAPLFDCDCFMCYRRYWARWD
Subjt:  MKKLCRKSTVHPSPPIISDFLSFLPAAILALTLALSADDKEVLAYLISCSSSGSNATASNFSGSRKASRKHGGGKGGVDHAPLFDCDCFMCYRRYWARWD

Query:  ASPNRQLIHEIIDAYEEGLAKTKGTGSGATPRNCKRERRKRNTESAPGESNRSEVKGEGSDSRPLETGRDANGGERNRKKDDGQEEEEEEEEESRGSVRR
        +SPNRQLIHEIIDAYE+GL K+K T S  T RNCK+ERRK+N ES  GES+     G+G  +  L       G +RN K+   +EEE+EE  E RGSVRR
Subjt:  ASPNRQLIHEIIDAYEEGLAKTKGTGSGATPRNCKRERRKRNTESAPGESNRSEVKGEGSDSRPLETGRDANGGERNRKKDDGQEEEEEEEEESRGSVRR

Query:  FVSFVGEKIWSAWG
        FVSFVGEKIW AWG
Subjt:  FVSFVGEKIWSAWG

XP_004134788.1 uncharacterized protein LOC101204826 [Cucumis sativus]7.8e-6971.5Show/hide
Query:  MKKLCRKSTVHPSPPIISDFLSFLPAAILALTLALSADDKEVLAYLISCSSSGSNATASNFSGSRKASRKHGGGKGGVDHAPLFDCDCFMCYRRYWARWD
        MKKLCRKSTVHPSPPIISDFLSFLPAAI ALTLALSADDKEVLAYLISCS+  S A+ SN SG RK  RK    K GVDHAPLFDCDCFMCYRRYWARWD
Subjt:  MKKLCRKSTVHPSPPIISDFLSFLPAAILALTLALSADDKEVLAYLISCSSSGSNATASNFSGSRKASRKHGGGKGGVDHAPLFDCDCFMCYRRYWARWD

Query:  ASPNRQLIHEIIDAYEEGLAKTKGTGSGATPRNCKRERRKRNTESAPGESNRSEVKGEGSDSRPLETGRDANGGERNRKKDDGQEEEEEEEEESRGSVRR
        +SPNRQLIHEIIDAYE+GL K+K T S  T RNCK+ERRK+N ES  GES+     G+G  +  L       G +RN K+   +EEEEE E E RGSVRR
Subjt:  ASPNRQLIHEIIDAYEEGLAKTKGTGSGATPRNCKRERRKRNTESAPGESNRSEVKGEGSDSRPLETGRDANGGERNRKKDDGQEEEEEEEEESRGSVRR

Query:  FVSFVGEKIWSAWG
        FVSFVGEKIW AWG
Subjt:  FVSFVGEKIWSAWG

XP_008440055.1 PREDICTED: uncharacterized protein LOC103484646 [Cucumis melo]2.3e-6871.03Show/hide
Query:  MKKLCRKSTVHPSPPIISDFLSFLPAAILALTLALSADDKEVLAYLISCSSSGSNATASNFSGSRKASRKHGGGKGGVDHAPLFDCDCFMCYRRYWARWD
        MKKLCRKSTVHPSPPIISDFLSFLPAAI ALT+ALSADDKEVLAYLISCS+  S A+ SN SGSRK  RK    K G+DHAPLFDCDCFMCYRRYWARWD
Subjt:  MKKLCRKSTVHPSPPIISDFLSFLPAAILALTLALSADDKEVLAYLISCSSSGSNATASNFSGSRKASRKHGGGKGGVDHAPLFDCDCFMCYRRYWARWD

Query:  ASPNRQLIHEIIDAYEEGLAKTKGTGSGATPRNCKRERRKRNTESAPGESNRSEVKGEGSDSRPLETGRDANGGERNRKKDDGQEEEEEEEEESRGSVRR
        +SPNRQLIHEIIDAYE+GL K+K T S  T RNCK+ERRK+N ES  GES+     G+G  +  L       G +RN K+   +EEEEEE  E RGSVRR
Subjt:  ASPNRQLIHEIIDAYEEGLAKTKGTGSGATPRNCKRERRKRNTESAPGESNRSEVKGEGSDSRPLETGRDANGGERNRKKDDGQEEEEEEEEESRGSVRR

Query:  FVSFVGEKIWSAWG
        FVSFVGEKIW AWG
Subjt:  FVSFVGEKIWSAWG

XP_022132898.1 uncharacterized protein LOC111005626 [Momordica charantia]1.5e-11299.53Show/hide
Query:  MKKLCRKSTVHPSPPIISDFLSFLPAAILALTLALSADDKEVLAYLISCSSSGSNATASNFSGSRKASRKHGGGKGGVDHAPLFDCDCFMCYRRYWARWD
        MKKLCRKSTVHPSPPIISDFLSFLPAAILALTLALSADDKEVLAYLISCSSSGSNATASNFSG+RKASRKHGGGKGGVDHAPLFDCDCFMCYRRYWARWD
Subjt:  MKKLCRKSTVHPSPPIISDFLSFLPAAILALTLALSADDKEVLAYLISCSSSGSNATASNFSGSRKASRKHGGGKGGVDHAPLFDCDCFMCYRRYWARWD

Query:  ASPNRQLIHEIIDAYEEGLAKTKGTGSGATPRNCKRERRKRNTESAPGESNRSEVKGEGSDSRPLETGRDANGGERNRKKDDGQEEEEEEEEESRGSVRR
        ASPNRQLIHEIIDAYEEGLAKTKGTGSGATPRNCKRERRKRNTESAPGESNRSEVKGEGSDSRPLETGRDANGGERNRKKDDGQEEEEEEEEESRGSVRR
Subjt:  ASPNRQLIHEIIDAYEEGLAKTKGTGSGATPRNCKRERRKRNTESAPGESNRSEVKGEGSDSRPLETGRDANGGERNRKKDDGQEEEEEEEEESRGSVRR

Query:  FVSFVGEKIWSAWG
        FVSFVGEKIWSAWG
Subjt:  FVSFVGEKIWSAWG

XP_038882712.1 uncharacterized protein LOC120073876 [Benincasa hispida]9.2e-7071.76Show/hide
Query:  MKKLCRKSTVHPSPPIISDFLSFLPAAILALTLALSADDKEVLAYLISCSSSGSNATASNFSGSRKASRKHGGGKGGVDHAPLFDCDCFMCYRRYWARWD
        MKKLCRKSTVHPSPPIISDFLSFLPAAI ALT+ALSADDKEVLAYLISCS+  + A+ SN SGSRK +RK   GK GVDHAP+FDCDCFMCYRRYWARWD
Subjt:  MKKLCRKSTVHPSPPIISDFLSFLPAAILALTLALSADDKEVLAYLISCSSSGSNATASNFSGSRKASRKHGGGKGGVDHAPLFDCDCFMCYRRYWARWD

Query:  ASPNRQLIHEIIDAYEEGLAKTKGTGSGATPRNCKRERRKRNTESAPGESNRSEVKGEG--SDSRPLETGRDANGGERNRKKDDGQEEEEEEEEESRGSV
        +SPNRQLIHEIIDAYE+GL K+K T S  T RNCK+ERRK+N ESA GES+  + K     SDS   +TGR     +RN K    +EEEE+EE   RGSV
Subjt:  ASPNRQLIHEIIDAYEEGLAKTKGTGSGATPRNCKRERRKRNTESAPGESNRSEVKGEG--SDSRPLETGRDANGGERNRKKDDGQEEEEEEEEESRGSV

Query:  RRFVSFVGEKIWSAWG
        RRFVSFVGEKIW AWG
Subjt:  RRFVSFVGEKIWSAWG

TrEMBL top hitse value%identityAlignment
A0A0A0KMY4 Uncharacterized protein3.8e-6971.5Show/hide
Query:  MKKLCRKSTVHPSPPIISDFLSFLPAAILALTLALSADDKEVLAYLISCSSSGSNATASNFSGSRKASRKHGGGKGGVDHAPLFDCDCFMCYRRYWARWD
        MKKLCRKSTVHPSPPIISDFLSFLPAAI ALTLALSADDKEVLAYLISCS+  S A+ SN SG RK  RK    K GVDHAPLFDCDCFMCYRRYWARWD
Subjt:  MKKLCRKSTVHPSPPIISDFLSFLPAAILALTLALSADDKEVLAYLISCSSSGSNATASNFSGSRKASRKHGGGKGGVDHAPLFDCDCFMCYRRYWARWD

Query:  ASPNRQLIHEIIDAYEEGLAKTKGTGSGATPRNCKRERRKRNTESAPGESNRSEVKGEGSDSRPLETGRDANGGERNRKKDDGQEEEEEEEEESRGSVRR
        +SPNRQLIHEIIDAYE+GL K+K T S  T RNCK+ERRK+N ES  GES+     G+G  +  L       G +RN K+   +EEEEE E E RGSVRR
Subjt:  ASPNRQLIHEIIDAYEEGLAKTKGTGSGATPRNCKRERRKRNTESAPGESNRSEVKGEGSDSRPLETGRDANGGERNRKKDDGQEEEEEEEEESRGSVRR

Query:  FVSFVGEKIWSAWG
        FVSFVGEKIW AWG
Subjt:  FVSFVGEKIWSAWG

A0A1S3B0U5 uncharacterized protein LOC1034846461.1e-6871.03Show/hide
Query:  MKKLCRKSTVHPSPPIISDFLSFLPAAILALTLALSADDKEVLAYLISCSSSGSNATASNFSGSRKASRKHGGGKGGVDHAPLFDCDCFMCYRRYWARWD
        MKKLCRKSTVHPSPPIISDFLSFLPAAI ALT+ALSADDKEVLAYLISCS+  S A+ SN SGSRK  RK    K G+DHAPLFDCDCFMCYRRYWARWD
Subjt:  MKKLCRKSTVHPSPPIISDFLSFLPAAILALTLALSADDKEVLAYLISCSSSGSNATASNFSGSRKASRKHGGGKGGVDHAPLFDCDCFMCYRRYWARWD

Query:  ASPNRQLIHEIIDAYEEGLAKTKGTGSGATPRNCKRERRKRNTESAPGESNRSEVKGEGSDSRPLETGRDANGGERNRKKDDGQEEEEEEEEESRGSVRR
        +SPNRQLIHEIIDAYE+GL K+K T S  T RNCK+ERRK+N ES  GES+     G+G  +  L       G +RN K+   +EEEEEE  E RGSVRR
Subjt:  ASPNRQLIHEIIDAYEEGLAKTKGTGSGATPRNCKRERRKRNTESAPGESNRSEVKGEGSDSRPLETGRDANGGERNRKKDDGQEEEEEEEEESRGSVRR

Query:  FVSFVGEKIWSAWG
        FVSFVGEKIW AWG
Subjt:  FVSFVGEKIWSAWG

A0A5D3CNJ0 Uncharacterized protein1.9e-6870.56Show/hide
Query:  MKKLCRKSTVHPSPPIISDFLSFLPAAILALTLALSADDKEVLAYLISCSSSGSNATASNFSGSRKASRKHGGGKGGVDHAPLFDCDCFMCYRRYWARWD
        MKKLCRKSTVHPSPPIISDFLSFLPAAI ALT+ALSADDKEVLAYLISCS+  S A+ SN SGSRK  RK    K G+DHAPLFDCDCFMCYRRYWARWD
Subjt:  MKKLCRKSTVHPSPPIISDFLSFLPAAILALTLALSADDKEVLAYLISCSSSGSNATASNFSGSRKASRKHGGGKGGVDHAPLFDCDCFMCYRRYWARWD

Query:  ASPNRQLIHEIIDAYEEGLAKTKGTGSGATPRNCKRERRKRNTESAPGESNRSEVKGEGSDSRPLETGRDANGGERNRKKDDGQEEEEEEEEESRGSVRR
        +SPNRQLIHEIIDAYE+GL K+K T S  T RNCK+ERRK+N ES  GES+     G+G  +  L       G +RN K+   +EEE+EE  E RGSVRR
Subjt:  ASPNRQLIHEIIDAYEEGLAKTKGTGSGATPRNCKRERRKRNTESAPGESNRSEVKGEGSDSRPLETGRDANGGERNRKKDDGQEEEEEEEEESRGSVRR

Query:  FVSFVGEKIWSAWG
        FVSFVGEKIW AWG
Subjt:  FVSFVGEKIWSAWG

A0A6J1BXK4 uncharacterized protein LOC1110056267.3e-11399.53Show/hide
Query:  MKKLCRKSTVHPSPPIISDFLSFLPAAILALTLALSADDKEVLAYLISCSSSGSNATASNFSGSRKASRKHGGGKGGVDHAPLFDCDCFMCYRRYWARWD
        MKKLCRKSTVHPSPPIISDFLSFLPAAILALTLALSADDKEVLAYLISCSSSGSNATASNFSG+RKASRKHGGGKGGVDHAPLFDCDCFMCYRRYWARWD
Subjt:  MKKLCRKSTVHPSPPIISDFLSFLPAAILALTLALSADDKEVLAYLISCSSSGSNATASNFSGSRKASRKHGGGKGGVDHAPLFDCDCFMCYRRYWARWD

Query:  ASPNRQLIHEIIDAYEEGLAKTKGTGSGATPRNCKRERRKRNTESAPGESNRSEVKGEGSDSRPLETGRDANGGERNRKKDDGQEEEEEEEEESRGSVRR
        ASPNRQLIHEIIDAYEEGLAKTKGTGSGATPRNCKRERRKRNTESAPGESNRSEVKGEGSDSRPLETGRDANGGERNRKKDDGQEEEEEEEEESRGSVRR
Subjt:  ASPNRQLIHEIIDAYEEGLAKTKGTGSGATPRNCKRERRKRNTESAPGESNRSEVKGEGSDSRPLETGRDANGGERNRKKDDGQEEEEEEEEESRGSVRR

Query:  FVSFVGEKIWSAWG
        FVSFVGEKIWSAWG
Subjt:  FVSFVGEKIWSAWG

A0A6J1IPN3 uncharacterized protein LOC1114788014.8e-6466.51Show/hide
Query:  MKKLCRKSTVHPSPPIISDFLSFLPAAILALTLALSADDKEVLAYLISCSSSGSNATASNFSGSRKASRKHGGGKGGVDHAPLFDCDCFMCYRRYWARWD
        MKKLCRK+TVHPSPPIISDFLSFLPA I  LT+ALSADDKEVLAYLISCS+  ++A+ SN S +RK+ RK   GK GVDHAPLFDCDCFMCYRRYWARWD
Subjt:  MKKLCRKSTVHPSPPIISDFLSFLPAAILALTLALSADDKEVLAYLISCSSSGSNATASNFSGSRKASRKHGGGKGGVDHAPLFDCDCFMCYRRYWARWD

Query:  ASPNRQLIHEIIDAYEEGLAKTKGTGSGATPRNCKRERRKRNTESAPGESNRSEVK-GEGSDSRPLETGRDANGGERNRKKDDGQEEEEEEEEESRGSVR
        +SPNRQLIHEII+AYE+GLAK KGT S    RN K+ERRK+N ES   ES+  + K  E S+S   E+ RD NG              ++ E E RGSV 
Subjt:  ASPNRQLIHEIIDAYEEGLAKTKGTGSGATPRNCKRERRKRNTESAPGESNRSEVK-GEGSDSRPLETGRDANGGERNRKKDDGQEEEEEEEEESRGSVR

Query:  RFVSFVGEKIWSAWG
        RFVSFVGEKIWSAWG
Subjt:  RFVSFVGEKIWSAWG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12020.1 unknown protein1.1e-3141.56Show/hide
Query:  MKKLCRKSTVHPSPPIISD---FLSFLPAAILALTLALSADDKEVLAYLISCSSSGSNATASNFSGSRKASRKHGGGKGGVDHAPLFDCDCFMCYRRYWA
        MKKL RK TVHPSPP I      L+ LP AI +L   LS +D+EVLAYLIS +S       ++     KA +K        +H+PLF CDCF CY  YW 
Subjt:  MKKLCRKSTVHPSPPIISD---FLSFLPAAILALTLALSADDKEVLAYLISCSSSGSNATASNFSGSRKASRKHGGGKGGVDHAPLFDCDCFMCYRRYWA

Query:  RWDASPNRQLIHEIIDAYEEGLAKTKGTGSGATPRNCKRERRKRNTE---SAPGESNRSEVK---GEG--------SDSRPLETGRDANGGERNRKKDDG
        RWD+SP+RQLIHEIIDA+E+ L K K      T +  +R+R  +++    S+   ++ SE+    GE         S S   + G   +GG    +    
Subjt:  RWDASPNRQLIHEIIDAYEEGLAKTKGTGSGATPRNCKRERRKRNTE---SAPGESNRSEVK---GEG--------SDSRPLETGRDANGGERNRKKDDG

Query:  QEEEEEEEEESRGSVRRFVSFVGEKIWSAWG
         +  E+ EEE +G+VRRFVSF+GEK++  WG
Subjt:  QEEEEEEEEESRGSVRRFVSFVGEKIWSAWG

AT1G24270.1 unknown protein1.1e-2040.65Show/hide
Query:  KLCRKSTVHPSPPIIS-------DFLS---FLPAAILALTLALSADDKEVLAYLISCSSSGSNATASNFSGSRKASRKHGGGKGGVDHAPLFDCDCFMCY
        K+ +K  VHPSPP+ S       D LS    L +AIL L   LSA+D EVLAYLI+ S + +N        S K  R H         APL DC CF CY
Subjt:  KLCRKSTVHPSPPIIS-------DFLS---FLPAAILALTLALSADDKEVLAYLISCSSSGSNATASNFSGSRKASRKHGGGKGGVDHAPLFDCDCFMCY

Query:  RRYWARWDASPNRQLIHEIIDAYEEGLAKTKGTGSGATPRNCKRERRKRNTESAP
          YW++WD+S NR+LI++II+A+E+ L + + + S  + +N KR ++   +E  P
Subjt:  RRYWARWDASPNRQLIHEIIDAYEEGLAKTKGTGSGATPRNCKRERRKRNTESAP

AT1G62422.1 unknown protein8.1e-3243.58Show/hide
Query:  MKKLCRKSTVHPSPP--IISD--FLSFLPAAILALTLALSADDKEVLAYLISCSSSGSNATASNFSGSRKASRKHGGGKGGVDHAPLFDCDCFMCYRRYW
        MKKLCRK TVHPSPP  I +D  FLS LP AIL+L  ALS +D+EVLAYLI  S+SG +   S    +++ +           H+PLF CDCF CY  YW
Subjt:  MKKLCRKSTVHPSPP--IISD--FLSFLPAAILALTLALSADDKEVLAYLISCSSSGSNATASNFSGSRKASRKHGGGKGGVDHAPLFDCDCFMCYRRYW

Query:  ARWDASPNRQLIHEIIDAYEEGLAKTKGTGSGATPRNCKRERRKRNTESAPGESNRSEVKGEGSDSRPLETGRDANGGERNRKKDDGQEEEEEEEEESRG
         RWD SP RQLIHEIIDAYE+ L   K           K++RRKR+ ++    S R    G    S    +  +  GG+  +  + G EE E+E    +G
Subjt:  ARWDASPNRQLIHEIIDAYEEGLAKTKGTGSGATPRNCKRERRKRNTESAPGESNRSEVKGEGSDSRPLETGRDANGGERNRKKDDGQEEEEEEEEESRG

Query:  SVRRFVSFVGEKIWSAWG
        SV + +SF+G++    WG
Subjt:  SVRRFVSFVGEKIWSAWG

AT5G13090.1 unknown protein1.3e-1832.24Show/hide
Query:  KLCRKSTVHPSPP--------------------IISDFLSFLPAAILALTLALSADDKEVLAYLISCSSSGSNATASNFSGSRKASRKHGGGKGGVDHAP
        K+ +K  V+PSPP                         L  LPA IL L   LS++++EVLAYLI   + G+  +    S S+  ++K           P
Subjt:  KLCRKSTVHPSPP--------------------IISDFLSFLPAAILALTLALSADDKEVLAYLISCSSSGSNATASNFSGSRKASRKHGGGKGGVDHAP

Query:  LFDCDCFMCYRRYWARWDASPNRQLIHEIIDAYEEGLAKTKGTGSGATPRNCKRERRKRNTESAPGESNRSEVKGEGSDSRPL
        +FDC+CF CY  YW RWD+SPNR+LIHEII+A+E    +        + R  K+E+  R    +  +           DS+P+
Subjt:  LFDCDCFMCYRRYWARWDASPNRQLIHEIIDAYEEGLAKTKGTGSGATPRNCKRERRKRNTESAPGESNRSEVKGEGSDSRPL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAGCTCTGCCGCAAAAGCACCGTCCATCCCTCGCCGCCGATCATCTCCGACTTCCTCTCGTTTCTCCCCGCCGCCATACTCGCCCTAACCCTGGCGCTCTCCGC
CGACGACAAAGAGGTCCTCGCCTACCTCATCTCCTGTTCCAGTTCCGGTTCCAACGCCACCGCCTCCAACTTCTCCGGCTCCCGCAAGGCCAGCCGCAAACACGGCGGCG
GGAAGGGCGGTGTCGACCACGCTCCGCTCTTCGACTGCGATTGTTTCATGTGCTACCGGCGGTATTGGGCCAGATGGGACGCGTCGCCCAATCGCCAACTTATTCACGAA
ATAATCGACGCATACGAGGAGGGATTGGCCAAAACCAAAGGCACTGGATCCGGAGCCACGCCGAGGAACTGTAAGAGAGAGAGAAGGAAGAGGAATACCGAGTCGGCACC
CGGTGAGTCGAATCGGTCCGAGGTGAAGGGCGAGGGGTCCGATTCTCGTCCGTTGGAGACCGGCCGCGACGCTAATGGCGGCGAGAGAAACCGCAAAAAAGATGACGGAC
AAGAAGAAGAAGAAGAAGAAGAAGAAGAATCAAGAGGATCGGTGAGAAGGTTTGTGAGTTTTGTAGGGGAGAAAATTTGGAGTGCTTGGGGG
mRNA sequenceShow/hide mRNA sequence
ATGAAGAAGCTCTGCCGCAAAAGCACCGTCCATCCCTCGCCGCCGATCATCTCCGACTTCCTCTCGTTTCTCCCCGCCGCCATACTCGCCCTAACCCTGGCGCTCTCCGC
CGACGACAAAGAGGTCCTCGCCTACCTCATCTCCTGTTCCAGTTCCGGTTCCAACGCCACCGCCTCCAACTTCTCCGGCTCCCGCAAGGCCAGCCGCAAACACGGCGGCG
GGAAGGGCGGTGTCGACCACGCTCCGCTCTTCGACTGCGATTGTTTCATGTGCTACCGGCGGTATTGGGCCAGATGGGACGCGTCGCCCAATCGCCAACTTATTCACGAA
ATAATCGACGCATACGAGGAGGGATTGGCCAAAACCAAAGGCACTGGATCCGGAGCCACGCCGAGGAACTGTAAGAGAGAGAGAAGGAAGAGGAATACCGAGTCGGCACC
CGGTGAGTCGAATCGGTCCGAGGTGAAGGGCGAGGGGTCCGATTCTCGTCCGTTGGAGACCGGCCGCGACGCTAATGGCGGCGAGAGAAACCGCAAAAAAGATGACGGAC
AAGAAGAAGAAGAAGAAGAAGAAGAAGAATCAAGAGGATCGGTGAGAAGGTTTGTGAGTTTTGTAGGGGAGAAAATTTGGAGTGCTTGGGGG
Protein sequenceShow/hide protein sequence
MKKLCRKSTVHPSPPIISDFLSFLPAAILALTLALSADDKEVLAYLISCSSSGSNATASNFSGSRKASRKHGGGKGGVDHAPLFDCDCFMCYRRYWARWDASPNRQLIHE
IIDAYEEGLAKTKGTGSGATPRNCKRERRKRNTESAPGESNRSEVKGEGSDSRPLETGRDANGGERNRKKDDGQEEEEEEEEESRGSVRRFVSFVGEKIWSAWG