; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg037722 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg037722
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold1:33925723..33926829
RNA-Seq ExpressionSpg037722
SyntenySpg037722
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8695166.1 hypothetical protein F3Y22_tig00110733pilonHSYRG00282 [Hibiscus syriacus]2.4e-2530.72Show/hide
Query:  SPRKFISVAAASMFEELKNRELMLERGF------NSNLESLPHMLAVTIFFQNWQKRCSKLEPAVANIVKEFYANFQDNGIWMTKVWGQLVMWSPTAINE
        SP  F+  AA   ++ +KNR +  E GF      N+NL S   +L V +    WQK      P  A IVKEFY+N  +       V G  + ++PTAIN 
Subjt:  SPRKFISVAAASMFEELKNRELMLERGF------NSNLESLPHMLAVTIFFQNWQKRCSKLEPAVANIVKEFYANFQDNGIWMTKVWGQLVMWSPTAINE

Query:  YYDVLDFPFAIYNSMEIAPSNDQFQAALTYCALEGACWKMSKNGNRSLLSAYVTPEANVWLWFVHNILLQTTHDTTVSKERMLLIFYIMSQINIDVGRII
        Y+  L        S      ++ +Q  L    L G  W   +   +++    + P   +W  F+ + L+ T+H+TTVS +RMLL+  I++   ID+G+II
Subjt:  YYDVLDFPFAIYNSMEIAPSNDQFQAALTYCALEGACWKMSKNGNRSLLSAYVTPEANVWLWFVHNILLQTTHDTTVSKERMLLIFYIMSQINIDVGRII

Query:  DREIASCARRKSGRLFFPNLITALCLKANVKILKDDDIMMDKGIIDSTTINCLMG--DQKGRKNSTIASGIKEL---------LNQQRELLHQ-VQYQGA
              C +R++  L FPNLITALC K  V+    D+I+     ++   I  L+G  + KG+K+    S +            L Q  +  HQ V     
Subjt:  DREIASCARRKSGRLFFPNLITALCLKANVKILKDDDIMMDKGIIDSTTINCLMG--DQKGRKNSTIASGIKEL---------LNQQRELLHQ-VQYQGA

Query:  QQRLYWEYALKRDEMVEKA
        +  +Y+ YA +RD  +  A
Subjt:  QQRLYWEYALKRDEMVEKA

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]7.2e-3034.56Show/hide
Query:  GMEPIAAERESPRKFISVAAASMFE-ELKNRELMLERGF---NSNLESLPHMLAVTIFFQNWQKRCSKLEPAVANIVKEFYANFQDNGIWMTKVWGQLVM
        G++ +A +     KF + AA + +E  ++NR L  E+GF   NS        +A  I   NW++ C+  E  +  +V+EFYAN  D       V G  V 
Subjt:  GMEPIAAERESPRKFISVAAASMFE-ELKNRELMLERGF---NSNLESLPHMLAVTIFFQNWQKRCSKLEPAVANIVKEFYANFQDNGIWMTKVWGQLVM

Query:  WSPTAINEYYDVLDFPFAIYNSMEIAPSNDQFQAALTYCALEGACWKMSKNGNRSLLSAYVTPEANVWLWFVHNILLQTTHDTTVSKERMLLIFYIMSQI
        WS  AIN  + + D P   ++      +       L   A+ GA W +S  G  + + + +TP A VW  F+ + LL TTH  TVSK+RMLL+  ++   
Subjt:  WSPTAINEYYDVLDFPFAIYNSMEIAPSNDQFQAALTYCALEGACWKMSKNGNRSLLSAYVTPEANVWLWFVHNILLQTTHDTTVSKERMLLIFYIMSQI

Query:  NIDVGRIIDREIASCARRKSGRLFFPNLITALCLKANVKILKDDDIMMDKGIIDSTTINCLMGDQKGRKNST
        +I+VGR+I  EI +CA RK+G LFFP+LIT LC  A    L +++ + + G ID+  +  +   Q+G   ST
Subjt:  NIDVGRIIDREIASCARRKSGRLFFPNLITALCLKANVKILKDDDIMMDKGIIDSTTINCLMGDQKGRKNST

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]4.1e-3331.2Show/hide
Query:  GMEPIAAERESPRKFISVAAASMFE-ELKNRELMLERGF---NSNLESLPHMLAVTIFFQNWQKRCSKLEPAVANIVKEFYANFQDNGIWMTKVWGQLVM
        G++ +A +     KF + AAA+ +E  ++NR L  E+GF   NS        +A  I   NW++ C+  E  +  +V+EFYAN  D       V G  V 
Subjt:  GMEPIAAERESPRKFISVAAASMFE-ELKNRELMLERGF---NSNLESLPHMLAVTIFFQNWQKRCSKLEPAVANIVKEFYANFQDNGIWMTKVWGQLVM

Query:  WSPTAINEYYDVLDFPFAIYNSMEIAPSNDQFQAALTYCALEGACWKMSKNGNRSLLSAYVTPEANVWLWFVHNILLQTTHDTTVSKERMLLIFYIMSQI
        WS  AIN  + + D P   ++      +       L   A  GA W +S  G  + + + +TP A VW  F+ + LL TTH  TVSK+RMLL+  ++   
Subjt:  WSPTAINEYYDVLDFPFAIYNSMEIAPSNDQFQAALTYCALEGACWKMSKNGNRSLLSAYVTPEANVWLWFVHNILLQTTHDTTVSKERMLLIFYIMSQI

Query:  NIDVGRIIDREIASCARRKSGRLFFPNLITALCLKANVKILKDDDIMMDKGIIDSTTINCLMGD-----------------QKGRKNSTIASGIKELLN-
        +I+VGR+I  EI +CA RK+G LFFP+LIT LC  A    L +++ + + G ID+  +  +  +                    R N  I   +K L   
Subjt:  NIDVGRIIDREIASCARRKSGRLFFPNLITALCLKANVKILKDDDIMMDKGIIDSTTINCLMGD-----------------QKGRKNSTIASGIKELLN-

Query:  ------QQRELLHQVQYQGAQQRLYWEYALKRDEMVEKAFNSD
              QQ  ++  +Q+   QQ+ +W Y+ +RD  ++KA  ++
Subjt:  ------QQRELLHQVQYQGAQQRLYWEYALKRDEMVEKAFNSD

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]6.7e-2833.86Show/hide
Query:  KNKGMEPIAAERESPRKFISVAAASMFEE-LKNRELMLERGF---NSNLESLPHMLAVTIFFQNWQKRCSKLEPAVANIVKEFYANFQDNGIWMTKVWGQ
        +  G+E +A       KF S AA   +EE ++NR L +E+ F   NS     P  +A  I   NWQ  C+  E  +  +V+EFY N  +       + G 
Subjt:  KNKGMEPIAAERESPRKFISVAAASMFEE-LKNRELMLERGF---NSNLESLPHMLAVTIFFQNWQKRCSKLEPAVANIVKEFYANFQDNGIWMTKVWGQ

Query:  LVMWSPTAINEYYDVLDFPFAIYNSMEIAPSNDQFQAALTYCALEGACWKMSKNGNRSLLSAYVTPEANVWLWFVHNILLQTTHDTTVSKERMLLIFYIM
         V  S  AIN  + + D P   ++      +  +    L   A+ GA W +S  G  + L + + P A VW  F+ + LL TTH  TVSKE + L++ ++
Subjt:  LVMWSPTAINEYYDVLDFPFAIYNSMEIAPSNDQFQAALTYCALEGACWKMSKNGNRSLLSAYVTPEANVWLWFVHNILLQTTHDTTVSKERMLLIFYIM

Query:  SQINIDVGRIIDREIASCARRKSGRLFFPNLITALCLKANVKILKDDDIMMDKG
        +  +I+VGR+I REI +CA RKSG LFFP+LIT++C       L +++ + + G
Subjt:  SQINIDVGRIIDREIASCARRKSGRLFFPNLITALCLKANVKILKDDDIMMDKG

TYH88163.1 hypothetical protein ES332_D01G168900v1 [Gossypium tomentosum]1.6e-2431.6Show/hide
Query:  ISPPRRRAPKNKGMEPIAAERESPRKFISVAAASMFEELKNRELMLERGF---NSNLESLPHMLAVTIFFQNWQKRCSKLEPAVANIVKEFYANFQDNGI
        +S  R R+ K     PI  + E   +F S+         K++ +M E+GF   +++L   P  +   I    W++ C+    +   +V+EFYA+      
Subjt:  ISPPRRRAPKNKGMEPIAAERESPRKFISVAAASMFEELKNRELMLERGF---NSNLESLPHMLAVTIFFQNWQKRCSKLEPAVANIVKEFYANFQDNGI

Query:  WMTKVWGQLVMWSPTAINEYYDVLDFPFAIYNSMEIAPSNDQFQAALTYCALEGACWKMSKNGNRSLLSAYVTPEANVWLWFVHNILLQTTHDTTVSKER
            V  + V  +  +IN+ +++ D     Y  M    + D  Q  L      G+ W + K G+ S    Y+ P ANVW +FV    +  +H  T+S ER
Subjt:  WMTKVWGQLVMWSPTAINEYYDVLDFPFAIYNSMEIAPSNDQFQAALTYCALEGACWKMSKNGNRSLLSAYVTPEANVWLWFVHNILLQTTHDTTVSKER

Query:  MLLIFYIMSQINIDVGRIIDREIASCARRKSGRLFFPNLITALCLKANVK
        MLL++ I+++ +I+VG+II +EI +CA++K+G ++FP+LIT+LCLKA VK
Subjt:  MLLIFYIMSQINIDVGRIIDREIASCARRKSGRLFFPNLITALCLKANVK

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)3.5e-3034.56Show/hide
Query:  GMEPIAAERESPRKFISVAAASMFE-ELKNRELMLERGF---NSNLESLPHMLAVTIFFQNWQKRCSKLEPAVANIVKEFYANFQDNGIWMTKVWGQLVM
        G++ +A +     KF + AA + +E  ++NR L  E+GF   NS        +A  I   NW++ C+  E  +  +V+EFYAN  D       V G  V 
Subjt:  GMEPIAAERESPRKFISVAAASMFE-ELKNRELMLERGF---NSNLESLPHMLAVTIFFQNWQKRCSKLEPAVANIVKEFYANFQDNGIWMTKVWGQLVM

Query:  WSPTAINEYYDVLDFPFAIYNSMEIAPSNDQFQAALTYCALEGACWKMSKNGNRSLLSAYVTPEANVWLWFVHNILLQTTHDTTVSKERMLLIFYIMSQI
        WS  AIN  + + D P   ++      +       L   A+ GA W +S  G  + + + +TP A VW  F+ + LL TTH  TVSK+RMLL+  ++   
Subjt:  WSPTAINEYYDVLDFPFAIYNSMEIAPSNDQFQAALTYCALEGACWKMSKNGNRSLLSAYVTPEANVWLWFVHNILLQTTHDTTVSKERMLLIFYIMSQI

Query:  NIDVGRIIDREIASCARRKSGRLFFPNLITALCLKANVKILKDDDIMMDKGIIDSTTINCLMGDQKGRKNST
        +I+VGR+I  EI +CA RK+G LFFP+LIT LC  A    L +++ + + G ID+  +  +   Q+G   ST
Subjt:  NIDVGRIIDREIASCARRKSGRLFFPNLITALCLKANVKILKDDDIMMDKGIIDSTTINCLMGDQKGRKNST

A0A2P5BCG4 Uncharacterized protein (Fragment)2.0e-3331.2Show/hide
Query:  GMEPIAAERESPRKFISVAAASMFE-ELKNRELMLERGF---NSNLESLPHMLAVTIFFQNWQKRCSKLEPAVANIVKEFYANFQDNGIWMTKVWGQLVM
        G++ +A +     KF + AAA+ +E  ++NR L  E+GF   NS        +A  I   NW++ C+  E  +  +V+EFYAN  D       V G  V 
Subjt:  GMEPIAAERESPRKFISVAAASMFE-ELKNRELMLERGF---NSNLESLPHMLAVTIFFQNWQKRCSKLEPAVANIVKEFYANFQDNGIWMTKVWGQLVM

Query:  WSPTAINEYYDVLDFPFAIYNSMEIAPSNDQFQAALTYCALEGACWKMSKNGNRSLLSAYVTPEANVWLWFVHNILLQTTHDTTVSKERMLLIFYIMSQI
        WS  AIN  + + D P   ++      +       L   A  GA W +S  G  + + + +TP A VW  F+ + LL TTH  TVSK+RMLL+  ++   
Subjt:  WSPTAINEYYDVLDFPFAIYNSMEIAPSNDQFQAALTYCALEGACWKMSKNGNRSLLSAYVTPEANVWLWFVHNILLQTTHDTTVSKERMLLIFYIMSQI

Query:  NIDVGRIIDREIASCARRKSGRLFFPNLITALCLKANVKILKDDDIMMDKGIIDSTTINCLMGD-----------------QKGRKNSTIASGIKELLN-
        +I+VGR+I  EI +CA RK+G LFFP+LIT LC  A    L +++ + + G ID+  +  +  +                    R N  I   +K L   
Subjt:  NIDVGRIIDREIASCARRKSGRLFFPNLITALCLKANVKILKDDDIMMDKGIIDSTTINCLMGD-----------------QKGRKNSTIASGIKELLN-

Query:  ------QQRELLHQVQYQGAQQRLYWEYALKRDEMVEKAFNSD
              QQ  ++  +Q+   QQ+ +W Y+ +RD  ++KA  ++
Subjt:  ------QQRELLHQVQYQGAQQRLYWEYALKRDEMVEKAFNSD

A0A2P5DAQ2 Uncharacterized protein3.3e-2833.86Show/hide
Query:  KNKGMEPIAAERESPRKFISVAAASMFEE-LKNRELMLERGF---NSNLESLPHMLAVTIFFQNWQKRCSKLEPAVANIVKEFYANFQDNGIWMTKVWGQ
        +  G+E +A       KF S AA   +EE ++NR L +E+ F   NS     P  +A  I   NWQ  C+  E  +  +V+EFY N  +       + G 
Subjt:  KNKGMEPIAAERESPRKFISVAAASMFEE-LKNRELMLERGF---NSNLESLPHMLAVTIFFQNWQKRCSKLEPAVANIVKEFYANFQDNGIWMTKVWGQ

Query:  LVMWSPTAINEYYDVLDFPFAIYNSMEIAPSNDQFQAALTYCALEGACWKMSKNGNRSLLSAYVTPEANVWLWFVHNILLQTTHDTTVSKERMLLIFYIM
         V  S  AIN  + + D P   ++      +  +    L   A+ GA W +S  G  + L + + P A VW  F+ + LL TTH  TVSKE + L++ ++
Subjt:  LVMWSPTAINEYYDVLDFPFAIYNSMEIAPSNDQFQAALTYCALEGACWKMSKNGNRSLLSAYVTPEANVWLWFVHNILLQTTHDTTVSKERMLLIFYIM

Query:  SQINIDVGRIIDREIASCARRKSGRLFFPNLITALCLKANVKILKDDDIMMDKG
        +  +I+VGR+I REI +CA RKSG LFFP+LIT++C       L +++ + + G
Subjt:  SQINIDVGRIIDREIASCARRKSGRLFFPNLITALCLKANVKILKDDDIMMDKG

A0A5D2MA47 Uncharacterized protein7.5e-2531.6Show/hide
Query:  ISPPRRRAPKNKGMEPIAAERESPRKFISVAAASMFEELKNRELMLERGF---NSNLESLPHMLAVTIFFQNWQKRCSKLEPAVANIVKEFYANFQDNGI
        +S  R R+ K     PI  + E   +F S+         K++ +M E+GF   +++L   P  +   I    W++ C+    +   +V+EFYA+      
Subjt:  ISPPRRRAPKNKGMEPIAAERESPRKFISVAAASMFEELKNRELMLERGF---NSNLESLPHMLAVTIFFQNWQKRCSKLEPAVANIVKEFYANFQDNGI

Query:  WMTKVWGQLVMWSPTAINEYYDVLDFPFAIYNSMEIAPSNDQFQAALTYCALEGACWKMSKNGNRSLLSAYVTPEANVWLWFVHNILLQTTHDTTVSKER
            V  + V  +  +IN+ +++ D     Y  M    + D  Q  L      G+ W + K G+ S    Y+ P ANVW +FV    +  +H  T+S ER
Subjt:  WMTKVWGQLVMWSPTAINEYYDVLDFPFAIYNSMEIAPSNDQFQAALTYCALEGACWKMSKNGNRSLLSAYVTPEANVWLWFVHNILLQTTHDTTVSKER

Query:  MLLIFYIMSQINIDVGRIIDREIASCARRKSGRLFFPNLITALCLKANVK
        MLL++ I+++ +I+VG+II +EI +CA++K+G ++FP+LIT+LCLKA VK
Subjt:  MLLIFYIMSQINIDVGRIIDREIASCARRKSGRLFFPNLITALCLKANVK

A0A6A2ZUE4 Uncharacterized protein1.2e-2530.72Show/hide
Query:  SPRKFISVAAASMFEELKNRELMLERGF------NSNLESLPHMLAVTIFFQNWQKRCSKLEPAVANIVKEFYANFQDNGIWMTKVWGQLVMWSPTAINE
        SP  F+  AA   ++ +KNR +  E GF      N+NL S   +L V +    WQK      P  A IVKEFY+N  +       V G  + ++PTAIN 
Subjt:  SPRKFISVAAASMFEELKNRELMLERGF------NSNLESLPHMLAVTIFFQNWQKRCSKLEPAVANIVKEFYANFQDNGIWMTKVWGQLVMWSPTAINE

Query:  YYDVLDFPFAIYNSMEIAPSNDQFQAALTYCALEGACWKMSKNGNRSLLSAYVTPEANVWLWFVHNILLQTTHDTTVSKERMLLIFYIMSQINIDVGRII
        Y+  L        S      ++ +Q  L    L G  W   +   +++    + P   +W  F+ + L+ T+H+TTVS +RMLL+  I++   ID+G+II
Subjt:  YYDVLDFPFAIYNSMEIAPSNDQFQAALTYCALEGACWKMSKNGNRSLLSAYVTPEANVWLWFVHNILLQTTHDTTVSKERMLLIFYIMSQINIDVGRII

Query:  DREIASCARRKSGRLFFPNLITALCLKANVKILKDDDIMMDKGIIDSTTINCLMG--DQKGRKNSTIASGIKEL---------LNQQRELLHQ-VQYQGA
              C +R++  L FPNLITALC K  V+    D+I+     ++   I  L+G  + KG+K+    S +            L Q  +  HQ V     
Subjt:  DREIASCARRKSGRLFFPNLITALCLKANVKILKDDDIMMDKGIIDSTTINCLMG--DQKGRKNSTIASGIKEL---------LNQQRELLHQ-VQYQGA

Query:  QQRLYWEYALKRDEMVEKA
        +  +Y+ YA +RD  +  A
Subjt:  QQRLYWEYALKRDEMVEKA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGCCAAGGTCGAGAGCCAAGAAGATCTTCAGGTCGAAAGTAGCTGGGGAAGAAGGCTCTTCAGCCCGAAAGAAAGACAAGCAGCCCGTTCAAGATGACTTTAGCAT
GATTCAGCCCATTTCTCCTCCAAGAAGGAGAGCTCCGAAAAACAAAGGAATGGAGCCCATAGCAGCCGAGAGGGAAAGTCCAAGGAAGTTTATCAGTGTGGCAGCGGCCT
CAATGTTTGAGGAGCTAAAGAATAGGGAATTAATGTTAGAGAGGGGGTTTAACTCCAACTTGGAAAGCTTACCCCACATGTTGGCCGTAACAATTTTCTTCCAAAATTGG
CAAAAACGCTGTAGCAAACTGGAGCCAGCCGTGGCCAATATTGTGAAAGAATTTTATGCGAATTTTCAAGATAACGGGATATGGATGACCAAAGTTTGGGGACAGCTTGT
AATGTGGAGTCCTACCGCCATCAACGAGTATTATGACGTGCTTGACTTTCCCTTCGCCATATACAACTCCATGGAAATTGCACCCTCCAACGATCAATTTCAGGCAGCCT
TGACTTACTGTGCTTTGGAGGGGGCGTGTTGGAAAATGTCAAAGAATGGAAACCGATCCTTGTTGTCTGCATATGTCACACCCGAAGCTAACGTGTGGTTGTGGTTTGTT
CACAACATATTGCTTCAAACAACACACGACACAACCGTATCTAAGGAGAGGATGCTCCTTATTTTCTACATAATGAGCCAAATCAACATTGATGTGGGGAGAATTATTGA
TCGAGAGATCGCATCGTGTGCTCGTAGGAAGTCAGGGAGGTTGTTTTTCCCAAACCTCATCACAGCTCTCTGCTTAAAGGCCAATGTGAAAATTTTGAAAGATGATGACA
TCATGATGGATAAGGGAATTATCGATTCGACAACTATCAACTGCCTCATGGGGGACCAAAAGGGGAGAAAGAATTCAACCATTGCAAGCGGCATAAAGGAGCTACTCAAT
CAGCAAAGGGAGTTATTGCACCAAGTGCAATATCAAGGAGCGCAACAACGTTTATATTGGGAGTACGCTCTCAAGAGGGACGAGATGGTAGAAAAAGCATTTAATTCGGA
TCCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGCCAAGGTCGAGAGCCAAGAAGATCTTCAGGTCGAAAGTAGCTGGGGAAGAAGGCTCTTCAGCCCGAAAGAAAGACAAGCAGCCCGTTCAAGATGACTTTAGCAT
GATTCAGCCCATTTCTCCTCCAAGAAGGAGAGCTCCGAAAAACAAAGGAATGGAGCCCATAGCAGCCGAGAGGGAAAGTCCAAGGAAGTTTATCAGTGTGGCAGCGGCCT
CAATGTTTGAGGAGCTAAAGAATAGGGAATTAATGTTAGAGAGGGGGTTTAACTCCAACTTGGAAAGCTTACCCCACATGTTGGCCGTAACAATTTTCTTCCAAAATTGG
CAAAAACGCTGTAGCAAACTGGAGCCAGCCGTGGCCAATATTGTGAAAGAATTTTATGCGAATTTTCAAGATAACGGGATATGGATGACCAAAGTTTGGGGACAGCTTGT
AATGTGGAGTCCTACCGCCATCAACGAGTATTATGACGTGCTTGACTTTCCCTTCGCCATATACAACTCCATGGAAATTGCACCCTCCAACGATCAATTTCAGGCAGCCT
TGACTTACTGTGCTTTGGAGGGGGCGTGTTGGAAAATGTCAAAGAATGGAAACCGATCCTTGTTGTCTGCATATGTCACACCCGAAGCTAACGTGTGGTTGTGGTTTGTT
CACAACATATTGCTTCAAACAACACACGACACAACCGTATCTAAGGAGAGGATGCTCCTTATTTTCTACATAATGAGCCAAATCAACATTGATGTGGGGAGAATTATTGA
TCGAGAGATCGCATCGTGTGCTCGTAGGAAGTCAGGGAGGTTGTTTTTCCCAAACCTCATCACAGCTCTCTGCTTAAAGGCCAATGTGAAAATTTTGAAAGATGATGACA
TCATGATGGATAAGGGAATTATCGATTCGACAACTATCAACTGCCTCATGGGGGACCAAAAGGGGAGAAAGAATTCAACCATTGCAAGCGGCATAAAGGAGCTACTCAAT
CAGCAAAGGGAGTTATTGCACCAAGTGCAATATCAAGGAGCGCAACAACGTTTATATTGGGAGTACGCTCTCAAGAGGGACGAGATGGTAGAAAAAGCATTTAATTCGGA
TCCATAA
Protein sequenceShow/hide protein sequence
MGPRSRAKKIFRSKVAGEEGSSARKKDKQPVQDDFSMIQPISPPRRRAPKNKGMEPIAAERESPRKFISVAAASMFEELKNRELMLERGFNSNLESLPHMLAVTIFFQNW
QKRCSKLEPAVANIVKEFYANFQDNGIWMTKVWGQLVMWSPTAINEYYDVLDFPFAIYNSMEIAPSNDQFQAALTYCALEGACWKMSKNGNRSLLSAYVTPEANVWLWFV
HNILLQTTHDTTVSKERMLLIFYIMSQINIDVGRIIDREIASCARRKSGRLFFPNLITALCLKANVKILKDDDIMMDKGIIDSTTINCLMGDQKGRKNSTIASGIKELLN
QQRELLHQVQYQGAQQRLYWEYALKRDEMVEKAFNSDP