; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc10G06845 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc10G06845
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionEnzymatic polyprotein
Genome locationClcChr10:9482824..9483835
RNA-Seq ExpressionClc10G06845
SyntenyClc10G06845
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056776.1 Enzymatic polyprotein [Cucumis melo var. makuwa]3.4e-6142.3Show/hide
Query:  MDKQNTVLKTDYLGLPKIDPYKSIIQPNTLPIGVIKNDPEDLLKEINKKLTTFKFDKPKAEASKNIEASRNINMIRK---VEESFLKILPIRTYNTDMKN
        M+     LK     +P+I+P + I QPN+  IG ++ D  D L EIN++L     +K    A +  E S+ INMI+K    + S  KILP+  +  DMKN
Subjt:  MDKQNTVLKTDYLGLPKIDPYKSIIQPNTLPIGVIKNDPEDLLKEINKKLTTFKFDKPKAEASKNIEASRNINMIRK---VEESFLKILPIRTYNTDMKN

Query:  YYPRPSPPDLGWD------------------------------------AASAYNTKKTVLATTYILIFGFTRNSRSWWHNQLTNQDRSAILTATKAIVK
        +YP+PSPPDLGWD                                    AA+AY+TKK+   T  ILI GF  N RSWWHN LT QDR  ILTAT+ +VK
Subjt:  YYPRPSPPDLGWD------------------------------------AASAYNTKKTVLATTYILIFGFTRNSRSWWHNQLTNQDRSAILTATKAIVK

Query:  QEGN-------------------PQAGTGQTVVYSNQATEALLGMRCRKMSDFKSYKDTFLACLYNLTSCEDEIWKRKFVEGLPQYIAEKFYQKMSEHFE
         E                      +   G T ++ N ATEALLG++C KMS +K YKDTF+A LY LT+C  +IWK+KFVEGLP YI++KFYQ M+ +  
Subjt:  QEGN-------------------PQAGTGQTVVYSNQATEALLGMRCRKMSDFKSYKDTFLACLYNLTSCEDEIWKRKFVEGLPQYIAEKFYQKMSEHFE

Query:  GNTINWSTLTYGDITSSIQAICLSLCRDTKH
           I+W+ LTYGDI+S++Q IC++LC + KH
Subjt:  GNTINWSTLTYGDITSSIQAICLSLCRDTKH

KAA0057417.1 Enzymatic polyprotein [Cucumis melo var. makuwa]4.8e-6343.83Show/hide
Query:  LKTDYLGLPKIDPYKSIIQPNTLPIGVIKNDPEDLLKEINKKLTTFKFDKPKAEASKNIEASRNINMIRK---VEESFLKILPIRTYNTDMKNYYPRPSP
        LK     +P+I+P + I QPN+  IG +K D  D L EINK+L     +K    A +  EA + INMI+K    + S  KILP+  +  DMKN+YP+PSP
Subjt:  LKTDYLGLPKIDPYKSIIQPNTLPIGVIKNDPEDLLKEINKKLTTFKFDKPKAEASKNIEASRNINMIRK---VEESFLKILPIRTYNTDMKNYYPRPSP

Query:  PDLGWD------------------------------------AASAYNTKKTVLATTYILIFGFTRNSRSWWHNQLTNQDRSAILTATKAIVKQEGN---
        PDLGWD                                    AA+AY+TKK+   T  ILI GF  N RSWWHN LT QDR  ILTAT+ +VK E +   
Subjt:  PDLGWD------------------------------------AASAYNTKKTVLATTYILIFGFTRNSRSWWHNQLTNQDRSAILTATKAIVKQEGN---

Query:  ----------------PQAGTGQTVVYSNQATEALLGMRCRKMSDFKSYKDTFLACLYNLTSCEDEIWKRKFVEGLPQYIAEKFYQKMSEHFEGNTINWS
                         +   G T ++ N ATEALLG++C KMS +K YKDTF+A LY LT+C  +IWK+KFVEGLP YI++KFYQ M+E+     I+W+
Subjt:  ----------------PQAGTGQTVVYSNQATEALLGMRCRKMSDFKSYKDTFLACLYNLTSCEDEIWKRKFVEGLPQYIAEKFYQKMSEHFEGNTINWS

Query:  TLTYGDITSSIQAICLSLCRDTKH
         LTYGDI+S++Q IC++LC + KH
Subjt:  TLTYGDITSSIQAICLSLCRDTKH

TYJ97599.1 Enzymatic polyprotein [Cucumis melo var. makuwa]2.6e-6142.3Show/hide
Query:  MDKQNTVLKTDYLGLPKIDPYKSIIQPNTLPIGVIKNDPEDLLKEINKKLTTFKFDKPKAEASKNIEASRNINMIRK---VEESFLKILPIRTYNTDMKN
        M+     LK     +P+I+P + I QPN+  IG ++ D  D L EIN++L     +K    A +  E S+ INMI+K    + S  KILP+  +  DMKN
Subjt:  MDKQNTVLKTDYLGLPKIDPYKSIIQPNTLPIGVIKNDPEDLLKEINKKLTTFKFDKPKAEASKNIEASRNINMIRK---VEESFLKILPIRTYNTDMKN

Query:  YYPRPSPPDLGWD------------------------------------AASAYNTKKTVLATTYILIFGFTRNSRSWWHNQLTNQDRSAILTATKAIVK
        +YP+PSPPDLGWD                                    AA+AY+TKK+   T  ILI GF  N RSWWHN LT QDR  ILTAT+ +VK
Subjt:  YYPRPSPPDLGWD------------------------------------AASAYNTKKTVLATTYILIFGFTRNSRSWWHNQLTNQDRSAILTATKAIVK

Query:  QEGN-------------------PQAGTGQTVVYSNQATEALLGMRCRKMSDFKSYKDTFLACLYNLTSCEDEIWKRKFVEGLPQYIAEKFYQKMSEHFE
         E                      +   G T ++ N ATEALLG++C KMS +K YKDTF+A LY LT+C  +IWK+KFVEGLP YI++KFYQ M+ +  
Subjt:  QEGN-------------------PQAGTGQTVVYSNQATEALLGMRCRKMSDFKSYKDTFLACLYNLTSCEDEIWKRKFVEGLPQYIAEKFYQKMSEHFE

Query:  GNTINWSTLTYGDITSSIQAICLSLCRDTKH
           I+W+ LTYGDI+S++Q IC++LC + KH
Subjt:  GNTINWSTLTYGDITSSIQAICLSLCRDTKH

TYJ98087.1 Enzymatic polyprotein [Cucumis melo var. makuwa]2.9e-6042.59Show/hide
Query:  LKTDYLGLPKIDPYKSIIQPNTLPIGVIKNDPEDLLKEINKKLTTFKFDKPKAEASKNIEASRNINMIRK---VEESFLKILPIRTYNTDMKNYYPRPSP
        LK     +P+I+P + I QPN+  IG +K D  D L EINK+L     +K    A++  +  + INMI+K    + S LKILP+  +  DMKN+YP+PSP
Subjt:  LKTDYLGLPKIDPYKSIIQPNTLPIGVIKNDPEDLLKEINKKLTTFKFDKPKAEASKNIEASRNINMIRK---VEESFLKILPIRTYNTDMKNYYPRPSP

Query:  PDLGWD------------------------------------AASAYNTKKTVLATTYILIFGFTRNSRSWWHNQLTNQDRSAILTATKAIVKQEGN---
        PDLGWD                                    AA+AY+TKK+   T  ILI GF  N RSWWHN LT QDR  ILTAT+ +VK E     
Subjt:  PDLGWD------------------------------------AASAYNTKKTVLATTYILIFGFTRNSRSWWHNQLTNQDRSAILTATKAIVKQEGN---

Query:  ----------------PQAGTGQTVVYSNQATEALLGMRCRKMSDFKSYKDTFLACLYNLTSCEDEIWKRKFVEGLPQYIAEKFYQKMSEHFEGNTINWS
                         +   G T ++ N ATEALLG++  KMS +K YKDTF+A LY LT+C  +IWK+KFVEGLP YI++KFYQ M+ +     I+W+
Subjt:  ----------------PQAGTGQTVVYSNQATEALLGMRCRKMSDFKSYKDTFLACLYNLTSCEDEIWKRKFVEGLPQYIAEKFYQKMSEHFEGNTINWS

Query:  TLTYGDITSSIQAICLSLCRDTKH
         LTYGDI+S++Q I ++LC + KH
Subjt:  TLTYGDITSSIQAICLSLCRDTKH

XP_022151716.1 uncharacterized protein LOC111019629 [Momordica charantia]1.2e-6141.9Show/hide
Query:  QNTVLKTDYLGLPKIDPYKSIIQPNTLPIGVIKNDPEDLLKEINKKLTTFKFDKPKAEASKNIEASRNINM---IRKVEESFLKILPIRTYNTDMKNYYP
        +N VL T    +P +DP + I QPN+  IG +K DP DL  +IN++L++   +  K ++S+  E +++IN+   I  + ++    + + T +T++KN+YP
Subjt:  QNTVLKTDYLGLPKIDPYKSIIQPNTLPIGVIKNDPEDLLKEINKKLTTFKFDKPKAEASKNIEASRNINM---IRKVEESFLKILPIRTYNTDMKNYYP

Query:  RPSPPDLGWD------------------------------------AASAYNTKKTVLATTYILIFGFTRNSRSWWHNQLTNQDRSAILTATKAIVKQEG
        RPSPPD+GWD                                    AA+A++TKK VL T  ILI   + N RSWWHNQLT++DR+ IL ATKA+VKQEG
Subjt:  RPSPPDLGWD------------------------------------AASAYNTKKTVLATTYILIFGFTRNSRSWWHNQLTNQDRSAILTATKAIVKQEG

Query:  N------------------PQAGTGQTVVYSNQATEALLGMRCRKMSDFKSYKDTFLACLYNLTSCEDEIWKRKFVEGLPQYIAEKFYQKMSEHFEGNTI
        +                   +   G T VYS+   EALL +RCRKMS++K YKDTFLA LY +T+C  +IWK+KFVEGLP YIA+KFYQ +  +   N I
Subjt:  N------------------PQAGTGQTVVYSNQATEALLGMRCRKMSDFKSYKDTFLACLYNLTSCEDEIWKRKFVEGLPQYIAEKFYQKMSEHFEGNTI

Query:  NWSTLTYGDITSSIQAICLSLCRDTKH
        +W+ LT GDI ++IQ IC++LC + KH
Subjt:  NWSTLTYGDITSSIQAICLSLCRDTKH

TrEMBL top hitse value%identityAlignment
A0A5A7UR29 Enzymatic polyprotein1.7e-6142.3Show/hide
Query:  MDKQNTVLKTDYLGLPKIDPYKSIIQPNTLPIGVIKNDPEDLLKEINKKLTTFKFDKPKAEASKNIEASRNINMIRK---VEESFLKILPIRTYNTDMKN
        M+     LK     +P+I+P + I QPN+  IG ++ D  D L EIN++L     +K    A +  E S+ INMI+K    + S  KILP+  +  DMKN
Subjt:  MDKQNTVLKTDYLGLPKIDPYKSIIQPNTLPIGVIKNDPEDLLKEINKKLTTFKFDKPKAEASKNIEASRNINMIRK---VEESFLKILPIRTYNTDMKN

Query:  YYPRPSPPDLGWD------------------------------------AASAYNTKKTVLATTYILIFGFTRNSRSWWHNQLTNQDRSAILTATKAIVK
        +YP+PSPPDLGWD                                    AA+AY+TKK+   T  ILI GF  N RSWWHN LT QDR  ILTAT+ +VK
Subjt:  YYPRPSPPDLGWD------------------------------------AASAYNTKKTVLATTYILIFGFTRNSRSWWHNQLTNQDRSAILTATKAIVK

Query:  QEGN-------------------PQAGTGQTVVYSNQATEALLGMRCRKMSDFKSYKDTFLACLYNLTSCEDEIWKRKFVEGLPQYIAEKFYQKMSEHFE
         E                      +   G T ++ N ATEALLG++C KMS +K YKDTF+A LY LT+C  +IWK+KFVEGLP YI++KFYQ M+ +  
Subjt:  QEGN-------------------PQAGTGQTVVYSNQATEALLGMRCRKMSDFKSYKDTFLACLYNLTSCEDEIWKRKFVEGLPQYIAEKFYQKMSEHFE

Query:  GNTINWSTLTYGDITSSIQAICLSLCRDTKH
           I+W+ LTYGDI+S++Q IC++LC + KH
Subjt:  GNTINWSTLTYGDITSSIQAICLSLCRDTKH

A0A5A7URX9 Enzymatic polyprotein2.3e-6343.83Show/hide
Query:  LKTDYLGLPKIDPYKSIIQPNTLPIGVIKNDPEDLLKEINKKLTTFKFDKPKAEASKNIEASRNINMIRK---VEESFLKILPIRTYNTDMKNYYPRPSP
        LK     +P+I+P + I QPN+  IG +K D  D L EINK+L     +K    A +  EA + INMI+K    + S  KILP+  +  DMKN+YP+PSP
Subjt:  LKTDYLGLPKIDPYKSIIQPNTLPIGVIKNDPEDLLKEINKKLTTFKFDKPKAEASKNIEASRNINMIRK---VEESFLKILPIRTYNTDMKNYYPRPSP

Query:  PDLGWD------------------------------------AASAYNTKKTVLATTYILIFGFTRNSRSWWHNQLTNQDRSAILTATKAIVKQEGN---
        PDLGWD                                    AA+AY+TKK+   T  ILI GF  N RSWWHN LT QDR  ILTAT+ +VK E +   
Subjt:  PDLGWD------------------------------------AASAYNTKKTVLATTYILIFGFTRNSRSWWHNQLTNQDRSAILTATKAIVKQEGN---

Query:  ----------------PQAGTGQTVVYSNQATEALLGMRCRKMSDFKSYKDTFLACLYNLTSCEDEIWKRKFVEGLPQYIAEKFYQKMSEHFEGNTINWS
                         +   G T ++ N ATEALLG++C KMS +K YKDTF+A LY LT+C  +IWK+KFVEGLP YI++KFYQ M+E+     I+W+
Subjt:  ----------------PQAGTGQTVVYSNQATEALLGMRCRKMSDFKSYKDTFLACLYNLTSCEDEIWKRKFVEGLPQYIAEKFYQKMSEHFEGNTINWS

Query:  TLTYGDITSSIQAICLSLCRDTKH
         LTYGDI+S++Q IC++LC + KH
Subjt:  TLTYGDITSSIQAICLSLCRDTKH

A0A5D3BEY3 Enzymatic polyprotein1.3e-6142.3Show/hide
Query:  MDKQNTVLKTDYLGLPKIDPYKSIIQPNTLPIGVIKNDPEDLLKEINKKLTTFKFDKPKAEASKNIEASRNINMIRK---VEESFLKILPIRTYNTDMKN
        M+     LK     +P+I+P + I QPN+  IG ++ D  D L EIN++L     +K    A +  E S+ INMI+K    + S  KILP+  +  DMKN
Subjt:  MDKQNTVLKTDYLGLPKIDPYKSIIQPNTLPIGVIKNDPEDLLKEINKKLTTFKFDKPKAEASKNIEASRNINMIRK---VEESFLKILPIRTYNTDMKN

Query:  YYPRPSPPDLGWD------------------------------------AASAYNTKKTVLATTYILIFGFTRNSRSWWHNQLTNQDRSAILTATKAIVK
        +YP+PSPPDLGWD                                    AA+AY+TKK+   T  ILI GF  N RSWWHN LT QDR  ILTAT+ +VK
Subjt:  YYPRPSPPDLGWD------------------------------------AASAYNTKKTVLATTYILIFGFTRNSRSWWHNQLTNQDRSAILTATKAIVK

Query:  QEGN-------------------PQAGTGQTVVYSNQATEALLGMRCRKMSDFKSYKDTFLACLYNLTSCEDEIWKRKFVEGLPQYIAEKFYQKMSEHFE
         E                      +   G T ++ N ATEALLG++C KMS +K YKDTF+A LY LT+C  +IWK+KFVEGLP YI++KFYQ M+ +  
Subjt:  QEGN-------------------PQAGTGQTVVYSNQATEALLGMRCRKMSDFKSYKDTFLACLYNLTSCEDEIWKRKFVEGLPQYIAEKFYQKMSEHFE

Query:  GNTINWSTLTYGDITSSIQAICLSLCRDTKH
           I+W+ LTYGDI+S++Q IC++LC + KH
Subjt:  GNTINWSTLTYGDITSSIQAICLSLCRDTKH

A0A5D3BG41 Enzymatic polyprotein1.4e-6042.59Show/hide
Query:  LKTDYLGLPKIDPYKSIIQPNTLPIGVIKNDPEDLLKEINKKLTTFKFDKPKAEASKNIEASRNINMIRK---VEESFLKILPIRTYNTDMKNYYPRPSP
        LK     +P+I+P + I QPN+  IG +K D  D L EINK+L     +K    A++  +  + INMI+K    + S LKILP+  +  DMKN+YP+PSP
Subjt:  LKTDYLGLPKIDPYKSIIQPNTLPIGVIKNDPEDLLKEINKKLTTFKFDKPKAEASKNIEASRNINMIRK---VEESFLKILPIRTYNTDMKNYYPRPSP

Query:  PDLGWD------------------------------------AASAYNTKKTVLATTYILIFGFTRNSRSWWHNQLTNQDRSAILTATKAIVKQEGN---
        PDLGWD                                    AA+AY+TKK+   T  ILI GF  N RSWWHN LT QDR  ILTAT+ +VK E     
Subjt:  PDLGWD------------------------------------AASAYNTKKTVLATTYILIFGFTRNSRSWWHNQLTNQDRSAILTATKAIVKQEGN---

Query:  ----------------PQAGTGQTVVYSNQATEALLGMRCRKMSDFKSYKDTFLACLYNLTSCEDEIWKRKFVEGLPQYIAEKFYQKMSEHFEGNTINWS
                         +   G T ++ N ATEALLG++  KMS +K YKDTF+A LY LT+C  +IWK+KFVEGLP YI++KFYQ M+ +     I+W+
Subjt:  ----------------PQAGTGQTVVYSNQATEALLGMRCRKMSDFKSYKDTFLACLYNLTSCEDEIWKRKFVEGLPQYIAEKFYQKMSEHFEGNTINWS

Query:  TLTYGDITSSIQAICLSLCRDTKH
         LTYGDI+S++Q I ++LC + KH
Subjt:  TLTYGDITSSIQAICLSLCRDTKH

A0A6J1DFI7 uncharacterized protein LOC1110196295.7e-6241.9Show/hide
Query:  QNTVLKTDYLGLPKIDPYKSIIQPNTLPIGVIKNDPEDLLKEINKKLTTFKFDKPKAEASKNIEASRNINM---IRKVEESFLKILPIRTYNTDMKNYYP
        +N VL T    +P +DP + I QPN+  IG +K DP DL  +IN++L++   +  K ++S+  E +++IN+   I  + ++    + + T +T++KN+YP
Subjt:  QNTVLKTDYLGLPKIDPYKSIIQPNTLPIGVIKNDPEDLLKEINKKLTTFKFDKPKAEASKNIEASRNINM---IRKVEESFLKILPIRTYNTDMKNYYP

Query:  RPSPPDLGWD------------------------------------AASAYNTKKTVLATTYILIFGFTRNSRSWWHNQLTNQDRSAILTATKAIVKQEG
        RPSPPD+GWD                                    AA+A++TKK VL T  ILI   + N RSWWHNQLT++DR+ IL ATKA+VKQEG
Subjt:  RPSPPDLGWD------------------------------------AASAYNTKKTVLATTYILIFGFTRNSRSWWHNQLTNQDRSAILTATKAIVKQEG

Query:  N------------------PQAGTGQTVVYSNQATEALLGMRCRKMSDFKSYKDTFLACLYNLTSCEDEIWKRKFVEGLPQYIAEKFYQKMSEHFEGNTI
        +                   +   G T VYS+   EALL +RCRKMS++K YKDTFLA LY +T+C  +IWK+KFVEGLP YIA+KFYQ +  +   N I
Subjt:  N------------------PQAGTGQTVVYSNQATEALLGMRCRKMSDFKSYKDTFLACLYNLTSCEDEIWKRKFVEGLPQYIAEKFYQKMSEHFEGNTI

Query:  NWSTLTYGDITSSIQAICLSLCRDTKH
        +W+ LT GDI ++IQ IC++LC + KH
Subjt:  NWSTLTYGDITSSIQAICLSLCRDTKH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAAGCAAAATACTGTTCTCAAAACAGACTATCTAGGTCTACCAAAGATTGATCCCTACAAGTCGATCATCCAACCAAATACTCTTCCCATTGGAGTAATCAAGAA
TGATCCAGAAGACTTATTGAAGGAGATCAATAAGAAGCTCACTACCTTCAAATTTGATAAACCAAAAGCTGAAGCCTCTAAGAACATAGAAGCTTCTAGGAATATTAATA
TGATAAGAAAAGTCGAGGAATCTTTTTTAAAAATTCTTCCTATCAGAACCTACAACACTGATATGAAGAATTATTACCCAAGGCCATCTCCTCCAGACCTTGGATGGGAT
GCTGCCTCAGCCTACAACACCAAAAAGACTGTGCTAGCAACAACTTATATTCTAATTTTCGGATTCACTAGAAATTCACGAAGCTGGTGGCATAATCAATTAACGAATCA
AGATAGAAGTGCTATTCTGACAGCAACAAAGGCAATTGTTAAACAAGAAGGCAACCCACAAGCAGGGACAGGCCAAACTGTTGTTTACAGCAATCAGGCAACTGAAGCAC
TCTTGGGAATGAGGTGCCGAAAAATGAGCGACTTCAAATCGTATAAAGACACCTTTTTGGCATGTCTTTATAATCTCACTTCCTGTGAAGATGAAATCTGGAAGCGAAAA
TTCGTGGAAGGATTGCCTCAATATATTGCCGAGAAATTCTATCAGAAAATGTCTGAGCATTTCGAAGGTAATACTATCAACTGGTCCACTTTAACCTACGGCGACATTAC
CAGTTCAATCCAAGCAATATGCTTAAGTCTTTGCAGAGATACTAAGCATAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATAAGCAAAATACTGTTCTCAAAACAGACTATCTAGGTCTACCAAAGATTGATCCCTACAAGTCGATCATCCAACCAAATACTCTTCCCATTGGAGTAATCAAGAA
TGATCCAGAAGACTTATTGAAGGAGATCAATAAGAAGCTCACTACCTTCAAATTTGATAAACCAAAAGCTGAAGCCTCTAAGAACATAGAAGCTTCTAGGAATATTAATA
TGATAAGAAAAGTCGAGGAATCTTTTTTAAAAATTCTTCCTATCAGAACCTACAACACTGATATGAAGAATTATTACCCAAGGCCATCTCCTCCAGACCTTGGATGGGAT
GCTGCCTCAGCCTACAACACCAAAAAGACTGTGCTAGCAACAACTTATATTCTAATTTTCGGATTCACTAGAAATTCACGAAGCTGGTGGCATAATCAATTAACGAATCA
AGATAGAAGTGCTATTCTGACAGCAACAAAGGCAATTGTTAAACAAGAAGGCAACCCACAAGCAGGGACAGGCCAAACTGTTGTTTACAGCAATCAGGCAACTGAAGCAC
TCTTGGGAATGAGGTGCCGAAAAATGAGCGACTTCAAATCGTATAAAGACACCTTTTTGGCATGTCTTTATAATCTCACTTCCTGTGAAGATGAAATCTGGAAGCGAAAA
TTCGTGGAAGGATTGCCTCAATATATTGCCGAGAAATTCTATCAGAAAATGTCTGAGCATTTCGAAGGTAATACTATCAACTGGTCCACTTTAACCTACGGCGACATTAC
CAGTTCAATCCAAGCAATATGCTTAAGTCTTTGCAGAGATACTAAGCATAATTGA
Protein sequenceShow/hide protein sequence
MDKQNTVLKTDYLGLPKIDPYKSIIQPNTLPIGVIKNDPEDLLKEINKKLTTFKFDKPKAEASKNIEASRNINMIRKVEESFLKILPIRTYNTDMKNYYPRPSPPDLGWD
AASAYNTKKTVLATTYILIFGFTRNSRSWWHNQLTNQDRSAILTATKAIVKQEGNPQAGTGQTVVYSNQATEALLGMRCRKMSDFKSYKDTFLACLYNLTSCEDEIWKRK
FVEGLPQYIAEKFYQKMSEHFEGNTINWSTLTYGDITSSIQAICLSLCRDTKHN