; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038537 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038537
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUnknown protein
Genome locationchr2:19902095..19903641
RNA-Seq ExpressionLag0038537
SyntenyLag0038537
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB53755.1 hypothetical protein L484_022412 [Morus notabilis]4.1e-3036.62Show/hide
Query:  PQFLRTDIANHDWELFCAKLESVNAQVVREFYANIDKEDGFQVIVRGVKVDWSPSAINALYNLQNFPHAAYNEMVVAPSDEQLSDVVREVGIEGAQWQLS
        P F+   I  H W  FC    +    +VREFYAN+   +   V V+ VKV ++  AIN+++ L+      Y +     +DEQL  V+ EV IEGA WQ+S
Subjt:  PQFLRTDIANHDWELFCAKLESVNAQVVREFYANIDKEDGFQVIVRGVKVDWSPSAINALYNLQNFPHAAYNEMVVAPSDEQLSDVVREVGIEGAQWQLS

Query:  KTQKRTFQSTYFKREANMWMRFIRQRMLPTTHDSTISKERILLAFAILRSLSINVGRIIASEISGC-WKKKVGKLFFPNTITMLCRKAGVPVDEGDVILF
             T      KR A +W  F+  R +P+TH  T++K+R+LL ++IL  +S+N+  I   EI  C   +K G L+FP+ IT L  KA VP  + + I+ 
Subjt:  KTQKRTFQSTYFKREANMWMRFIRQRMLPTTHDSTISKERILLAFAILRSLSINVGRIIASEISGC-WKKKVGKLFFPNTITMLCRKAGVPVDEGDVILF

Query:  DKGIIDTSNLARL
        + G I T +++R+
Subjt:  DKGIIDTSNLARL

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]4.5e-2936.57Show/hide
Query:  EREKKEAEEKVREEAE-KKAKEEQLLKRR---AEKGKNIVEASKEHDEIEEQHGDLPQFLRTDIANHDWELFCAKLESVNAQVVREFYANIDKEDGFQVI
        +R  ++A + V+ E E  + + E  ++ R   AEKG  +        +  E  G LP F+   I  H+W+ FCA  E     +VREFYAN+       V 
Subjt:  EREKKEAEEKVREEAE-KKAKEEQLLKRR---AEKGKNIVEASKEHDEIEEQHGDLPQFLRTDIANHDWELFCAKLESVNAQVVREFYANIDKEDGFQVI

Query:  VRGVKVDWSPSAINALYNLQNFPHAAYNEMVVAPSDEQLSDVVREVGIEGAQWQLSKTQKRTFQSTYFKREANMWMRFIRQRMLPTTHDSTISKERILLA
        VRGV+V WS  AINA++ L + P   ++E +   ++  L  V+  V + GA+W +S     T   +     A +W  F++  +LPTTH  T+SK+R+LL 
Subjt:  VRGVKVDWSPSAINALYNLQNFPHAAYNEMVVAPSDEQLSDVVREVGIEGAQWQLSKTQKRTFQSTYFKREANMWMRFIRQRMLPTTHDSTISKERILLA

Query:  FAILRSLSINVGRIIASEISGCWKKKVGKLFFPNTITMLCRKAGVPVDEGDVILFDKGIIDTSNLARL
         ++L   SINVGR+I SEI  C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+
Subjt:  FAILRSLSINVGRIIASEISGCWKKKVGKLFFPNTITMLCRKAGVPVDEGDVILFDKGIIDTSNLARL

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]4.6e-3433.33Show/hide
Query:  EREKKEAEEKVREEAEKKA-KEEQLLKRR---AEKGKNIVEASKEHDEIEEQHGDLPQFLRTDIANHDWELFCAKLESVNAQVVREFYANIDKEDGFQVI
        +R  ++A + V+ E E  A + E  ++ R   AEKG  +        +  E  G LP F+   I  H+W+ FCA  E     +VREFYAN+   +   V 
Subjt:  EREKKEAEEKVREEAEKKA-KEEQLLKRR---AEKGKNIVEASKEHDEIEEQHGDLPQFLRTDIANHDWELFCAKLESVNAQVVREFYANIDKEDGFQVI

Query:  VRGVKVDWSPSAINALYNLQNFPHAAYNEMVVAPSDEQLSDVVREVGIEGAQWQLSKTQKRTFQSTYFKREANMWMRFIRQRMLPTTHDSTISKERILLA
        VRGV+V WS  AINA++ L + P   ++E +   + + L  V+  V   GA+W +S     T   +     A +W  F++ R+LPTTH  T+SK+R+LL 
Subjt:  VRGVKVDWSPSAINALYNLQNFPHAAYNEMVVAPSDEQLSDVVREVGIEGAQWQLSKTQKRTFQSTYFKREANMWMRFIRQRMLPTTHDSTISKERILLA

Query:  FAILRSLSINVGRIIASEISGCWKKKVGKLFFPNTITMLCRKAGVPVDEGDVILFDKGIIDTSNLARLLR---MQEVRQ---------------GGLIYD
         ++L   SINVGR+I SEI  C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+ +    +  +Q               G ++  
Subjt:  FAILRSLSINVGRIIASEISGCWKKKVGKLFFPNTITMLCRKAGVPVDEGDVILFDKGIIDTSNLARLLR---MQEVRQ---------------GGLIYD

Query:  INTILEQLALSASRQ-------EFAERQTLTFWNYVKNRDAGLKRALQEIF
        +  + ++L+    +Q       +   +Q   FW Y K RD  LK+ALQ  F
Subjt:  INTILEQLALSASRQ-------EFAERQTLTFWNYVKNRDAGLKRALQEIF

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]9.4e-2736.29Show/hide
Query:  EKVREEAEKKAK-EEQLLKRRAEKGKNIVEASKEHDEIEEQHGDL--PQFLRTDIANHDWELFCAKLESVNAQVVREFYANIDKEDGFQVIVRGVKVDWS
        E+V   A K  K E +  + R E+       S E + + +    L  P F+   I  H+W+LFCA  E     +VREFY N+   D   V +RGV+V  S
Subjt:  EKVREEAEKKAK-EEQLLKRRAEKGKNIVEASKEHDEIEEQHGDL--PQFLRTDIANHDWELFCAKLESVNAQVVREFYANIDKEDGFQVIVRGVKVDWS

Query:  PSAINALYNLQNFPHAAYNEMVVAPSDEQLSDVVREVGIEGAQWQLSKTQKRTFQSTYFKREANMWMRFIRQRMLPTTHDSTISKERILLAFAILRSLSI
          AIN +++L + P   ++E V   +  +L  V+  V I GA+W +S     T   +     A +W  F++ R+LPTTH  T+SKE + L +++L   SI
Subjt:  PSAINALYNLQNFPHAAYNEMVVAPSDEQLSDVVREVGIEGAQWQLSKTQKRTFQSTYFKREANMWMRFIRQRMLPTTHDSTISKERILLAFAILRSLSI

Query:  NVGRIIASEISGCWKKKVGKLFFPNTITMLCRKAGVP
        NVGR+I  EI  C  +K G LFFP+ IT +CR    P
Subjt:  NVGRIIASEISGCWKKKVGKLFFPNTITMLCRKAGVP

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]3.9e-3336.82Show/hide
Query:  VVREFYANIDKEDGFQVIVRGVKVDWSPSAINALYNLQNFPHAAYNEMVVAPSDEQLSDVVREVGIEGAQWQLSKTQKRTFQSTYFKREANMWMRFIRQR
        +VREFYAN+   +   + VRGV+V WS  AINA++ L + P   ++E +   ++ +L  V+  V   GA+W +S     T   +     A +W  F++ R
Subjt:  VVREFYANIDKEDGFQVIVRGVKVDWSPSAINALYNLQNFPHAAYNEMVVAPSDEQLSDVVREVGIEGAQWQLSKTQKRTFQSTYFKREANMWMRFIRQR

Query:  MLPTTHDSTISKERILLAFAILRSLSINVGRIIASEISGCWKKKVGKLFFPNTITMLCRKAGVPVDEGDVILFDKGIIDTSNLARLL-------------
        +LPTTH   +SK+R+LL  ++L   SINVGR+I SEI  C  +K G LFFP+ IT LCR A   V+E    L + G ID   +AR+              
Subjt:  MLPTTHDSTISKERILLAFAILRSLSINVGRIIASEISGCWKKKVGKLFFPNTITMLCRKAGVPVDEGDVILFDKGIIDTSNLARLL-------------

Query:  -RMQEVRQGGLIYDINTILEQLALSASRQEFAERQTLTFWNYVKNRDAGLKRALQEIF
         R           D+   L+ L    S+QE   +Q   FW Y K RD  LK+ALQ  F
Subjt:  -RMQEVRQGGLIYDINTILEQLALSASRQEFAERQTLTFWNYVKNRDAGLKRALQEIF

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)2.2e-2936.57Show/hide
Query:  EREKKEAEEKVREEAE-KKAKEEQLLKRR---AEKGKNIVEASKEHDEIEEQHGDLPQFLRTDIANHDWELFCAKLESVNAQVVREFYANIDKEDGFQVI
        +R  ++A + V+ E E  + + E  ++ R   AEKG  +        +  E  G LP F+   I  H+W+ FCA  E     +VREFYAN+       V 
Subjt:  EREKKEAEEKVREEAE-KKAKEEQLLKRR---AEKGKNIVEASKEHDEIEEQHGDLPQFLRTDIANHDWELFCAKLESVNAQVVREFYANIDKEDGFQVI

Query:  VRGVKVDWSPSAINALYNLQNFPHAAYNEMVVAPSDEQLSDVVREVGIEGAQWQLSKTQKRTFQSTYFKREANMWMRFIRQRMLPTTHDSTISKERILLA
        VRGV+V WS  AINA++ L + P   ++E +   ++  L  V+  V + GA+W +S     T   +     A +W  F++  +LPTTH  T+SK+R+LL 
Subjt:  VRGVKVDWSPSAINALYNLQNFPHAAYNEMVVAPSDEQLSDVVREVGIEGAQWQLSKTQKRTFQSTYFKREANMWMRFIRQRMLPTTHDSTISKERILLA

Query:  FAILRSLSINVGRIIASEISGCWKKKVGKLFFPNTITMLCRKAGVPVDEGDVILFDKGIIDTSNLARL
         ++L   SINVGR+I SEI  C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+
Subjt:  FAILRSLSINVGRIIASEISGCWKKKVGKLFFPNTITMLCRKAGVPVDEGDVILFDKGIIDTSNLARL

A0A2P5BCG4 Uncharacterized protein (Fragment)2.2e-3433.33Show/hide
Query:  EREKKEAEEKVREEAEKKA-KEEQLLKRR---AEKGKNIVEASKEHDEIEEQHGDLPQFLRTDIANHDWELFCAKLESVNAQVVREFYANIDKEDGFQVI
        +R  ++A + V+ E E  A + E  ++ R   AEKG  +        +  E  G LP F+   I  H+W+ FCA  E     +VREFYAN+   +   V 
Subjt:  EREKKEAEEKVREEAEKKA-KEEQLLKRR---AEKGKNIVEASKEHDEIEEQHGDLPQFLRTDIANHDWELFCAKLESVNAQVVREFYANIDKEDGFQVI

Query:  VRGVKVDWSPSAINALYNLQNFPHAAYNEMVVAPSDEQLSDVVREVGIEGAQWQLSKTQKRTFQSTYFKREANMWMRFIRQRMLPTTHDSTISKERILLA
        VRGV+V WS  AINA++ L + P   ++E +   + + L  V+  V   GA+W +S     T   +     A +W  F++ R+LPTTH  T+SK+R+LL 
Subjt:  VRGVKVDWSPSAINALYNLQNFPHAAYNEMVVAPSDEQLSDVVREVGIEGAQWQLSKTQKRTFQSTYFKREANMWMRFIRQRMLPTTHDSTISKERILLA

Query:  FAILRSLSINVGRIIASEISGCWKKKVGKLFFPNTITMLCRKAGVPVDEGDVILFDKGIIDTSNLARLLR---MQEVRQ---------------GGLIYD
         ++L   SINVGR+I SEI  C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+ +    +  +Q               G ++  
Subjt:  FAILRSLSINVGRIIASEISGCWKKKVGKLFFPNTITMLCRKAGVPVDEGDVILFDKGIIDTSNLARLLR---MQEVRQ---------------GGLIYD

Query:  INTILEQLALSASRQ-------EFAERQTLTFWNYVKNRDAGLKRALQEIF
        +  + ++L+    +Q       +   +Q   FW Y K RD  LK+ALQ  F
Subjt:  INTILEQLALSASRQ-------EFAERQTLTFWNYVKNRDAGLKRALQEIF

A0A2P5DAQ2 Uncharacterized protein4.5e-2736.29Show/hide
Query:  EKVREEAEKKAK-EEQLLKRRAEKGKNIVEASKEHDEIEEQHGDL--PQFLRTDIANHDWELFCAKLESVNAQVVREFYANIDKEDGFQVIVRGVKVDWS
        E+V   A K  K E +  + R E+       S E + + +    L  P F+   I  H+W+LFCA  E     +VREFY N+   D   V +RGV+V  S
Subjt:  EKVREEAEKKAK-EEQLLKRRAEKGKNIVEASKEHDEIEEQHGDL--PQFLRTDIANHDWELFCAKLESVNAQVVREFYANIDKEDGFQVIVRGVKVDWS

Query:  PSAINALYNLQNFPHAAYNEMVVAPSDEQLSDVVREVGIEGAQWQLSKTQKRTFQSTYFKREANMWMRFIRQRMLPTTHDSTISKERILLAFAILRSLSI
          AIN +++L + P   ++E V   +  +L  V+  V I GA+W +S     T   +     A +W  F++ R+LPTTH  T+SKE + L +++L   SI
Subjt:  PSAINALYNLQNFPHAAYNEMVVAPSDEQLSDVVREVGIEGAQWQLSKTQKRTFQSTYFKREANMWMRFIRQRMLPTTHDSTISKERILLAFAILRSLSI

Query:  NVGRIIASEISGCWKKKVGKLFFPNTITMLCRKAGVP
        NVGR+I  EI  C  +K G LFFP+ IT +CR    P
Subjt:  NVGRIIASEISGCWKKKVGKLFFPNTITMLCRKAGVP

A0A2P5DXM3 Uncharacterized protein1.9e-3336.82Show/hide
Query:  VVREFYANIDKEDGFQVIVRGVKVDWSPSAINALYNLQNFPHAAYNEMVVAPSDEQLSDVVREVGIEGAQWQLSKTQKRTFQSTYFKREANMWMRFIRQR
        +VREFYAN+   +   + VRGV+V WS  AINA++ L + P   ++E +   ++ +L  V+  V   GA+W +S     T   +     A +W  F++ R
Subjt:  VVREFYANIDKEDGFQVIVRGVKVDWSPSAINALYNLQNFPHAAYNEMVVAPSDEQLSDVVREVGIEGAQWQLSKTQKRTFQSTYFKREANMWMRFIRQR

Query:  MLPTTHDSTISKERILLAFAILRSLSINVGRIIASEISGCWKKKVGKLFFPNTITMLCRKAGVPVDEGDVILFDKGIIDTSNLARLL-------------
        +LPTTH   +SK+R+LL  ++L   SINVGR+I SEI  C  +K G LFFP+ IT LCR A   V+E    L + G ID   +AR+              
Subjt:  MLPTTHDSTISKERILLAFAILRSLSINVGRIIASEISGCWKKKVGKLFFPNTITMLCRKAGVPVDEGDVILFDKGIIDTSNLARLL-------------

Query:  -RMQEVRQGGLIYDINTILEQLALSASRQEFAERQTLTFWNYVKNRDAGLKRALQEIF
         R           D+   L+ L    S+QE   +Q   FW Y K RD  LK+ALQ  F
Subjt:  -RMQEVRQGGLIYDINTILEQLALSASRQEFAERQTLTFWNYVKNRDAGLKRALQEIF

W9QTD9 Uncharacterized protein2.0e-3036.62Show/hide
Query:  PQFLRTDIANHDWELFCAKLESVNAQVVREFYANIDKEDGFQVIVRGVKVDWSPSAINALYNLQNFPHAAYNEMVVAPSDEQLSDVVREVGIEGAQWQLS
        P F+   I  H W  FC    +    +VREFYAN+   +   V V+ VKV ++  AIN+++ L+      Y +     +DEQL  V+ EV IEGA WQ+S
Subjt:  PQFLRTDIANHDWELFCAKLESVNAQVVREFYANIDKEDGFQVIVRGVKVDWSPSAINALYNLQNFPHAAYNEMVVAPSDEQLSDVVREVGIEGAQWQLS

Query:  KTQKRTFQSTYFKREANMWMRFIRQRMLPTTHDSTISKERILLAFAILRSLSINVGRIIASEISGC-WKKKVGKLFFPNTITMLCRKAGVPVDEGDVILF
             T      KR A +W  F+  R +P+TH  T++K+R+LL ++IL  +S+N+  I   EI  C   +K G L+FP+ IT L  KA VP  + + I+ 
Subjt:  KTQKRTFQSTYFKREANMWMRFIRQRMLPTTHDSTISKERILLAFAILRSLSINVGRIIASEISGC-WKKKVGKLFFPNTITMLCRKAGVPVDEGDVILF

Query:  DKGIIDTSNLARL
        + G I T +++R+
Subjt:  DKGIIDTSNLARL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAAAACGAGAGCTAGAAAAGAGAGGGAGAGTGAGGAGGAAGAGATATTCGTTACCCCCGAAGCTCAGAAAGTAAAAACCAAGAAGAAAAGAACACCGGAGGAGAA
AGAAGCTAAAAGAAGAAGACGACAACAGTGGGCTGAAGAACAAGAAAAGGCAACAGAGGATGAGGCTATTGCAATAGAAGGAGGAGACCCGAAAGAATCTGATAAACAGA
ATCCAGAAGAGGATGAGCAGGGAATGACGGCTACAGAAGAATTCGAGAGGAAATTCAGGAGAAACAACGTGAGGATGTACAGGCAGAGGCTGAGATTGAAAGAGAACCAG
TTCAGGAGGCTCGTATTGAGGACTGATACCCCATTGCCTCCATCGTCAGATTCCGAGAGAGAGAAGGCAGAGCGAGAGGAACGAGAGAAAAAAGAGGCTGAGGAAAAAGT
GCGAGAAGAAGCAGAGAAGAAGGCTAAGGAGGAACAGTTGCTCAAGCGAAGGGCGGAAAAGGGCAAAAACATTGTTGAAGCATCGAAGGAACACGATGAAATAGAAGAAC
AACACGGTGATCTTCCACAGTTTCTGAGGACCGATATTGCAAACCACGACTGGGAGCTGTTTTGTGCGAAGCTGGAGTCTGTAAACGCACAGGTGGTGCGTGAATTCTAT
GCCAACATTGATAAAGAGGATGGTTTCCAGGTAATTGTCCGAGGAGTCAAGGTAGATTGGAGTCCGAGTGCTATCAACGCACTGTATAATCTTCAGAACTTCCCCCATGC
TGCATATAATGAGATGGTTGTGGCGCCATCTGATGAGCAACTAAGTGATGTTGTGCGGGAGGTAGGAATTGAAGGGGCACAGTGGCAGTTATCCAAAACTCAGAAGAGGA
CATTCCAGTCGACTTATTTTAAAAGGGAAGCGAATATGTGGATGAGATTTATTAGACAGAGGATGCTTCCAACAACACATGACTCGACAATCTCCAAGGAACGGATTCTC
CTAGCTTTTGCCATCTTGCGGTCTCTCAGTATTAACGTAGGAAGGATCATTGCGAGTGAAATTTCTGGTTGCTGGAAAAAAAAGGTGGGGAAGCTGTTCTTCCCAAATAC
AATTACAATGCTTTGCAGAAAAGCAGGGGTTCCAGTGGATGAGGGAGATGTGATTCTGTTTGACAAAGGGATCATCGACACGTCCAATTTGGCACGGCTCCTGCGCATGC
AGGAGGTACGTCAAGGTGGGCTTATCTACGACATTAACACGATTCTAGAACAACTGGCACTTTCGGCCAGTAGGCAGGAGTTTGCTGAAAGGCAAACTTTAACCTTCTGG
AACTACGTTAAGAATCGGGATGCCGGCTTAAAGAGGGCGCTGCAAGAAATTTTTCAAAACCATACCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAAAAACGAGAGCTAGAAAAGAGAGGGAGAGTGAGGAGGAAGAGATATTCGTTACCCCCGAAGCTCAGAAAGTAAAAACCAAGAAGAAAAGAACACCGGAGGAGAA
AGAAGCTAAAAGAAGAAGACGACAACAGTGGGCTGAAGAACAAGAAAAGGCAACAGAGGATGAGGCTATTGCAATAGAAGGAGGAGACCCGAAAGAATCTGATAAACAGA
ATCCAGAAGAGGATGAGCAGGGAATGACGGCTACAGAAGAATTCGAGAGGAAATTCAGGAGAAACAACGTGAGGATGTACAGGCAGAGGCTGAGATTGAAAGAGAACCAG
TTCAGGAGGCTCGTATTGAGGACTGATACCCCATTGCCTCCATCGTCAGATTCCGAGAGAGAGAAGGCAGAGCGAGAGGAACGAGAGAAAAAAGAGGCTGAGGAAAAAGT
GCGAGAAGAAGCAGAGAAGAAGGCTAAGGAGGAACAGTTGCTCAAGCGAAGGGCGGAAAAGGGCAAAAACATTGTTGAAGCATCGAAGGAACACGATGAAATAGAAGAAC
AACACGGTGATCTTCCACAGTTTCTGAGGACCGATATTGCAAACCACGACTGGGAGCTGTTTTGTGCGAAGCTGGAGTCTGTAAACGCACAGGTGGTGCGTGAATTCTAT
GCCAACATTGATAAAGAGGATGGTTTCCAGGTAATTGTCCGAGGAGTCAAGGTAGATTGGAGTCCGAGTGCTATCAACGCACTGTATAATCTTCAGAACTTCCCCCATGC
TGCATATAATGAGATGGTTGTGGCGCCATCTGATGAGCAACTAAGTGATGTTGTGCGGGAGGTAGGAATTGAAGGGGCACAGTGGCAGTTATCCAAAACTCAGAAGAGGA
CATTCCAGTCGACTTATTTTAAAAGGGAAGCGAATATGTGGATGAGATTTATTAGACAGAGGATGCTTCCAACAACACATGACTCGACAATCTCCAAGGAACGGATTCTC
CTAGCTTTTGCCATCTTGCGGTCTCTCAGTATTAACGTAGGAAGGATCATTGCGAGTGAAATTTCTGGTTGCTGGAAAAAAAAGGTGGGGAAGCTGTTCTTCCCAAATAC
AATTACAATGCTTTGCAGAAAAGCAGGGGTTCCAGTGGATGAGGGAGATGTGATTCTGTTTGACAAAGGGATCATCGACACGTCCAATTTGGCACGGCTCCTGCGCATGC
AGGAGGTACGTCAAGGTGGGCTTATCTACGACATTAACACGATTCTAGAACAACTGGCACTTTCGGCCAGTAGGCAGGAGTTTGCTGAAAGGCAAACTTTAACCTTCTGG
AACTACGTTAAGAATCGGGATGCCGGCTTAAAGAGGGCGCTGCAAGAAATTTTTCAAAACCATACCTAG
Protein sequenceShow/hide protein sequence
MAKTRARKERESEEEEIFVTPEAQKVKTKKKRTPEEKEAKRRRRQQWAEEQEKATEDEAIAIEGGDPKESDKQNPEEDEQGMTATEEFERKFRRNNVRMYRQRLRLKENQ
FRRLVLRTDTPLPPSSDSEREKAEREEREKKEAEEKVREEAEKKAKEEQLLKRRAEKGKNIVEASKEHDEIEEQHGDLPQFLRTDIANHDWELFCAKLESVNAQVVREFY
ANIDKEDGFQVIVRGVKVDWSPSAINALYNLQNFPHAAYNEMVVAPSDEQLSDVVREVGIEGAQWQLSKTQKRTFQSTYFKREANMWMRFIRQRMLPTTHDSTISKERIL
LAFAILRSLSINVGRIIASEISGCWKKKVGKLFFPNTITMLCRKAGVPVDEGDVILFDKGIIDTSNLARLLRMQEVRQGGLIYDINTILEQLALSASRQEFAERQTLTFW
NYVKNRDAGLKRALQEIFQNHT