; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0021385 (gene) of Chayote v1 genome

Gene IDSed0021385
OrganismSechium edule (Chayote v1)
DescriptionB3 domain-containing protein At2g31720-like
Genome locationLG08:30891254..30891993
RNA-Seq ExpressionSed0021385
SyntenySed0021385
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
InterPro domainsIPR005508 - B3 domain-containing protein At2g31720-like
IPR015300 - DNA-binding pseudobarrel domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6599340.1 B3 domain-containing protein, partial [Cucurbita argyrosperma subsp. sororia]1.1e-3948Show/hide
Query:  YTNCPGPHTHP--TQTQYLAADTTAPPIDG-----HPLSAIPPNNAYDPEPKTPTSEENRNDPDAEIENGDDPAAGILPAEITAVITGRMGGYRVQKVIE
        Y +CPG H +P  + T     D  A  + G      P++ IP      PEP       N +DPD EI+     A   +PAE+   IT RMGGYRVQ +IE
Subjt:  YTNCPGPHTHP--TQTQYLAADTTAPPIDG-----HPLSAIPPNNAYDPEPKTPTSEENRNDPDAEIENGDDPAAGILPAEITAVITGRMGGYRVQKVIE

Query:  KALTATDMKKDQARMLLPRKRAKMEFLTAAEVECLEDRGGG---LNYMDTVIVGPDLVETPIRFKMWLMGTSFSGCLTKEWNSFAKTYGLKVGNKVRVWS
        K LTATDM+ +Q R+ LP+K  ++EF+TA E + LE R  G   LNY+DT IVG DL   PIRFK WLMG ++  CL  +WNSFA+ YGLK G KVRVWS
Subjt:  KALTATDMKKDQARMLLPRKRAKMEFLTAAEVECLEDRGGG---LNYMDTVIVGPDLVETPIRFKMWLMGTSFSGCLTKEWNSFAKTYGLKVGNKVRVWS

Query:  FRVDRRDDQNVSNLNCALRFLISKI
        FRVD RD+ +V  +   LRF + KI
Subjt:  FRVDRRDDQNVSNLNCALRFLISKI

XP_022946870.1 B3 domain-containing protein At2g31720-like [Cucurbita moschata]8.4e-4048Show/hide
Query:  YTNCPGPHTHP--TQTQYLAADTTAPPIDG-----HPLSAIPPNNAYDPEPKTPTSEENRNDPDAEIENGDDPAAGILPAEITAVITGRMGGYRVQKVIE
        Y +CPG H +P  + T     D  A  + G      P++ IP      PEP       N +DPD EI+     A   +PAE+   IT RMGGYRVQ +IE
Subjt:  YTNCPGPHTHP--TQTQYLAADTTAPPIDG-----HPLSAIPPNNAYDPEPKTPTSEENRNDPDAEIENGDDPAAGILPAEITAVITGRMGGYRVQKVIE

Query:  KALTATDMKKDQARMLLPRKRAKMEFLTAAEVECLEDRGGG---LNYMDTVIVGPDLVETPIRFKMWLMGTSFSGCLTKEWNSFAKTYGLKVGNKVRVWS
        K LTATDM+ +Q R+ LP+K  ++EF+TA E + LE R  G   LNY+DT IVG DL   PIRFK WLMG ++  CL  +WNSFA+ YGLK G KVRVWS
Subjt:  KALTATDMKKDQARMLLPRKRAKMEFLTAAEVECLEDRGGG---LNYMDTVIVGPDLVETPIRFKMWLMGTSFSGCLTKEWNSFAKTYGLKVGNKVRVWS

Query:  FRVDRRDDQNVSNLNCALRFLISKI
        FRVD RD+ +V  +   LRF + KI
Subjt:  FRVDRRDDQNVSNLNCALRFLISKI

XP_022990807.1 B3 domain-containing protein At2g32645-like [Cucurbita maxima]2.5e-1539.23Show/hide
Query:  PAAGILPAEITAVITGRMGGYRVQKVIEKALTATDMKKDQARMLLPRKRAKMEFLTAAEVECL-EDRGGGLNYMDTVIVGPDLVETPIRFKMWLMGTSFS
        P    +PA +   I G MG Y++Q  I+K L  TDM K+  R+  P K+ K++F T  E   L +   GG   M+ +IV P L E+ I  K+W +G+  S
Subjt:  PAAGILPAEITAVITGRMGGYRVQKVIEKALTATDMKKDQARMLLPRKRAKMEFLTAAEVECL-EDRGGGLNYMDTVIVGPDLVETPIRFKMWLMGTSFS

Query:  GCLTKEWNSFAKTYGLKVGNKVRVWSFRVD
         CL  +WNS  +  G K G+ V++WSFR D
Subjt:  GCLTKEWNSFAKTYGLKVGNKVRVWSFRVD

XP_022998957.1 B3 domain-containing protein At2g31720-like [Cucurbita maxima]1.1e-3950.75Show/hide
Query:  HPLSAIPPNNAYDPEPKTPTSEENRNDPDAEIE----NGDDPAAGI-------LPAEITAVITGRMGGYRVQKVIEKALTATDMKKDQARMLLPRKRAKM
        +P  +  P N  DPE    +   N N P AEI     N DDP   I       +PAE+   IT RMGGYRVQ +IEK LTATDM+ +Q R+ LP+K  ++
Subjt:  HPLSAIPPNNAYDPEPKTPTSEENRNDPDAEIE----NGDDPAAGI-------LPAEITAVITGRMGGYRVQKVIEKALTATDMKKDQARMLLPRKRAKM

Query:  EFLTAAEVECLEDRGGG---LNYMDTVIVGPDLVETPIRFKMWLMGTSFSGCLTKEWNSFAKTYGLKVGNKVRVWSFRVDRRDDQNVSNLNCALRFLISK
        EF+TA E + LE R  G   LNY+DT IVG DL   PIRFK WLMG ++  CL  +WNSFA+ YGLK G KVRVWSFRVD RD+ +V  +   LRF + K
Subjt:  EFLTAAEVECLEDRGGG---LNYMDTVIVGPDLVETPIRFKMWLMGTSFSGCLTKEWNSFAKTYGLKVGNKVRVWSFRVDRRDDQNVSNLNCALRFLISK

Query:  I
        I
Subjt:  I

XP_023545398.1 B3 domain-containing protein At2g31720-like [Cucurbita pepo subsp. pepo]1.9e-3947.25Show/hide
Query:  YTNCPGPHTHPTQTQYLAADTTAPPIDGHPLSAIPPNNAYDPEPKTPTSEENRNDPDAEIENGDDPAAGILPAEITAVITGRMGGYRVQKVIEKALTATD
        Y +CPG H +P       + T    +D    +     N   P  + P    N +DPD EI+     A   +PAE+   IT RMGGYRVQ +IEK LTATD
Subjt:  YTNCPGPHTHPTQTQYLAADTTAPPIDGHPLSAIPPNNAYDPEPKTPTSEENRNDPDAEIENGDDPAAGILPAEITAVITGRMGGYRVQKVIEKALTATD

Query:  MKKDQARMLLPRKRAKMEFLTAAEVECLEDRGGG---LNYMDTVIVGPDLVETPIRFKMWLMGTSFSGCLTKEWNSFAKTYGLKVGNKVRVWSFRVDRRD
        M+ +Q R+ LP+K  ++EF+TA E + LE R  G   LNY+DT IVG DL   PIRFK WLMG ++  CL  +WNSFA+ YGLK G KVRVWSFRVD RD
Subjt:  MKKDQARMLLPRKRAKMEFLTAAEVECLEDRGGG---LNYMDTVIVGPDLVETPIRFKMWLMGTSFSGCLTKEWNSFAKTYGLKVGNKVRVWSFRVDRRD

Query:  DQNVSNLNCALRFLISKI
        + +V  +   LRF + KI
Subjt:  DQNVSNLNCALRFLISKI

TrEMBL top hitse value%identityAlignment
A0A6J1G587 B3 domain-containing protein At2g31720-like4.1e-4048Show/hide
Query:  YTNCPGPHTHP--TQTQYLAADTTAPPIDG-----HPLSAIPPNNAYDPEPKTPTSEENRNDPDAEIENGDDPAAGILPAEITAVITGRMGGYRVQKVIE
        Y +CPG H +P  + T     D  A  + G      P++ IP      PEP       N +DPD EI+     A   +PAE+   IT RMGGYRVQ +IE
Subjt:  YTNCPGPHTHP--TQTQYLAADTTAPPIDG-----HPLSAIPPNNAYDPEPKTPTSEENRNDPDAEIENGDDPAAGILPAEITAVITGRMGGYRVQKVIE

Query:  KALTATDMKKDQARMLLPRKRAKMEFLTAAEVECLEDRGGG---LNYMDTVIVGPDLVETPIRFKMWLMGTSFSGCLTKEWNSFAKTYGLKVGNKVRVWS
        K LTATDM+ +Q R+ LP+K  ++EF+TA E + LE R  G   LNY+DT IVG DL   PIRFK WLMG ++  CL  +WNSFA+ YGLK G KVRVWS
Subjt:  KALTATDMKKDQARMLLPRKRAKMEFLTAAEVECLEDRGGG---LNYMDTVIVGPDLVETPIRFKMWLMGTSFSGCLTKEWNSFAKTYGLKVGNKVRVWS

Query:  FRVDRRDDQNVSNLNCALRFLISKI
        FRVD RD+ +V  +   LRF + KI
Subjt:  FRVDRRDDQNVSNLNCALRFLISKI

A0A6J1GM07 putative B3 domain-containing protein At3g248501.0e-1440Show/hide
Query:  GRMGGYRVQKVIEKALTATDMKKDQARMLLPRKRAKMEFLTAAEVECL-EDRGGGLNYMDTVIVGPDLVETPIRFKMWLMGTSFSGCLTKEWNSFAKTYG
        G MG Y++Q  I+K L  TDM K+  R+  P K+ K++F T  E   L +   GG   M+ +IV P L E+ I  K+W +G+  S CL  +WN   +  G
Subjt:  GRMGGYRVQKVIEKALTATDMKKDQARMLLPRKRAKMEFLTAAEVECL-EDRGGGLNYMDTVIVGPDLVETPIRFKMWLMGTSFSGCLTKEWNSFAKTYG

Query:  LKVGNKVRVWSFRVD
         K G+ V++WSFR D
Subjt:  LKVGNKVRVWSFRVD

A0A6J1H5Z0 putative B3 domain-containing protein At3g248505.0e-1433.51Show/hide
Query:  THPTQTQYLAADTTAPPIDGHPLSAIPPNNAYDP-EPKTPTSEENRNDPDAEIENGDDPAAGILPAEITAVITGRMGGYRVQKVIEKALTATDMKKDQAR
        TH   + YL   TT    +G P++   P     P   + P +E  R       +    P    +PA +   I G MG Y++Q  I+K L  TDM K+  R
Subjt:  THPTQTQYLAADTTAPPIDGHPLSAIPPNNAYDP-EPKTPTSEENRNDPDAEIENGDDPAAGILPAEITAVITGRMGGYRVQKVIEKALTATDMKKDQAR

Query:  MLLPRKRAKMEFLTAAEVECL-EDRGGGLNYMDTVIVGPDLVETPIRFKMWLMGTSFSGCLTKEWNSFAKTYGLKVGNKVRVWSFRVD
        +  P K+ K++F T  E   L +    G   M+ +IV P L E+ I  K+W +G+  S CL  +WNS  +  G K G+ V++WSFR D
Subjt:  MLLPRKRAKMEFLTAAEVECL-EDRGGGLNYMDTVIVGPDLVETPIRFKMWLMGTSFSGCLTKEWNSFAKTYGLKVGNKVRVWSFRVD

A0A6J1JKC5 B3 domain-containing protein At2g32645-like1.2e-1539.23Show/hide
Query:  PAAGILPAEITAVITGRMGGYRVQKVIEKALTATDMKKDQARMLLPRKRAKMEFLTAAEVECL-EDRGGGLNYMDTVIVGPDLVETPIRFKMWLMGTSFS
        P    +PA +   I G MG Y++Q  I+K L  TDM K+  R+  P K+ K++F T  E   L +   GG   M+ +IV P L E+ I  K+W +G+  S
Subjt:  PAAGILPAEITAVITGRMGGYRVQKVIEKALTATDMKKDQARMLLPRKRAKMEFLTAAEVECL-EDRGGGLNYMDTVIVGPDLVETPIRFKMWLMGTSFS

Query:  GCLTKEWNSFAKTYGLKVGNKVRVWSFRVD
         CL  +WNS  +  G K G+ V++WSFR D
Subjt:  GCLTKEWNSFAKTYGLKVGNKVRVWSFRVD

A0A6J1K9G9 B3 domain-containing protein At2g31720-like5.3e-4050.75Show/hide
Query:  HPLSAIPPNNAYDPEPKTPTSEENRNDPDAEIE----NGDDPAAGI-------LPAEITAVITGRMGGYRVQKVIEKALTATDMKKDQARMLLPRKRAKM
        +P  +  P N  DPE    +   N N P AEI     N DDP   I       +PAE+   IT RMGGYRVQ +IEK LTATDM+ +Q R+ LP+K  ++
Subjt:  HPLSAIPPNNAYDPEPKTPTSEENRNDPDAEIE----NGDDPAAGI-------LPAEITAVITGRMGGYRVQKVIEKALTATDMKKDQARMLLPRKRAKM

Query:  EFLTAAEVECLEDRGGG---LNYMDTVIVGPDLVETPIRFKMWLMGTSFSGCLTKEWNSFAKTYGLKVGNKVRVWSFRVDRRDDQNVSNLNCALRFLISK
        EF+TA E + LE R  G   LNY+DT IVG DL   PIRFK WLMG ++  CL  +WNSFA+ YGLK G KVRVWSFRVD RD+ +V  +   LRF + K
Subjt:  EFLTAAEVECLEDRGGG---LNYMDTVIVGPDLVETPIRFKMWLMGTSFSGCLTKEWNSFAKTYGLKVGNKVRVWSFRVDRRDDQNVSNLNCALRFLISK

Query:  I
        I
Subjt:  I

SwissProt top hitse value%identityAlignment
Q5RM09 B3 domain-containing protein At1g059201.2e-0429.84Show/hide
Query:  ITGRMGGYRVQK-VIEKALTATDMKKDQARMLLPRKRA-KMEFLTAAEVECLED---RGGGLNYMDTVIVGPDLVETPIRFKMWLMGTS-----FSGCLT
        +  RM G    K +IEK L + D+   Q R+ +P     + +FLT  E   +++      G   +   +V     +  + FK W M T      +S  L 
Subjt:  ITGRMGGYRVQK-VIEKALTATDMKKDQARMLLPRKRA-KMEFLTAAEVECLED---RGGGLNYMDTVIVGPDLVETPIRFKMWLMGTS-----FSGCLT

Query:  KEWNSFAKTYGLKVGNKVRVWSFR
         EW++  +T GLK G+K+ +WSFR
Subjt:  KEWNSFAKTYGLKVGNKVRVWSFR

Q9FFX2 B3 domain-containing protein At5g384904.8e-0629.82Show/hide
Query:  VIEKALTATDMKKDQARMLLP-RKRAKMEFLTAAEVECLE-----DRGGGLNY-MDTVIVGPDLVETPIRFKMWLMGT-----SFSGCLTKEWNSFAKTY
        + E+ L  +D+  + +R+L+P +K  + +FLT AE   ++     D     N  + TV+V     +  +RFK+W M       + +  L   WN   K+ 
Subjt:  VIEKALTATDMKKDQARMLLP-RKRAKMEFLTAAEVECLE-----DRGGGLNY-MDTVIVGPDLVETPIRFKMWLMGT-----SFSGCLTKEWNSFAKTY

Query:  GLKVGNKVRVWSFR
         LKVG+K+ +W+FR
Subjt:  GLKVGNKVRVWSFR

Q9SCJ8 Putative B3 domain-containing protein At3g496102.2e-0629.66Show/hide
Query:  VQKVIEKALTATDMKKDQARMLLP-RKRAKMEFLTAAEVECL------EDRGGGLNYMDTVIVGPDLVETPIRFKMWLMGTSFSG------CLTKEWNSF
        ++ + EK LTATD+K  ++R+L+P  K  + +FLT  E   +      E+       + T+IV     E  +RF +W+M    SG       L + WN  
Subjt:  VQKVIEKALTATDMKKDQARMLLP-RKRAKMEFLTAAEVECL------EDRGGGLNYMDTVIVGPDLVETPIRFKMWLMGTSFSG------CLTKEWNSF

Query:  AKTYGLKVGNKVRVWSFR
             LK  + + +W+FR
Subjt:  AKTYGLKVGNKVRVWSFR

Arabidopsis top hitse value%identityAlignment
AT1G05615.1 Domain of unknown function (DUF313)6.1e-0431.65Show/hide
Query:  EFLTAAEVECLEDRGGGLNY--MDTVIVGPDLVETPIRFKMWLMGTSFSGCLTKEWNSFAKTYGLKVGNKVRVWSFRVD
        EFL   E   +E+         +D ++V  D  E  +  + W MGTS    L     +  K   LK GN++R+WSF  D
Subjt:  EFLTAAEVECLEDRGGGLNY--MDTVIVGPDLVETPIRFKMWLMGTSFSGCLTKEWNSFAKTYGLKVGNKVRVWSFRVD

AT1G05920.1 Domain of unknown function (DUF313)8.5e-0629.84Show/hide
Query:  ITGRMGGYRVQK-VIEKALTATDMKKDQARMLLPRKRA-KMEFLTAAEVECLED---RGGGLNYMDTVIVGPDLVETPIRFKMWLMGTS-----FSGCLT
        +  RM G    K +IEK L + D+   Q R+ +P     + +FLT  E   +++      G   +   +V     +  + FK W M T      +S  L 
Subjt:  ITGRMGGYRVQK-VIEKALTATDMKKDQARMLLPRKRA-KMEFLTAAEVECLED---RGGGLNYMDTVIVGPDLVETPIRFKMWLMGTS-----FSGCLT

Query:  KEWNSFAKTYGLKVGNKVRVWSFR
         EW++  +T GLK G+K+ +WSFR
Subjt:  KEWNSFAKTYGLKVGNKVRVWSFR

AT2G27410.1 Domain of unknown function (DUF313)6.1e-0425Show/hide
Query:  PNNAYDPEP----------KTPTSEENRNDPDAEI--ENGDDPAAGILPAEITAVITGRMGGYRVQKVIEKALTATDMKKDQARMLLPRKRAKM-EFLTA
        PN   +P P          K P + E R+    +I     ++P     P  +  V+     GY  + +  + L  TD+KK +AR+ +P K+ K  +FLT 
Subjt:  PNNAYDPEP----------KTPTSEENRNDPDAEI--ENGDDPAAGILPAEITAVITGRMGGYRVQKVIEKALTATDMKKDQARMLLPRKRAKM-EFLTA

Query:  AEVEC-------LEDRGGGLNYMDTVIVGPDLVETPIRFKMWLMGTSFSGCLTKEWNSF---AKTYGLKVGNKVRVWSFR
         E          + D G  +N++D     P+L +  +  + W M  ++     K W +     KT   K  +   +WSFR
Subjt:  AEVEC-------LEDRGGGLNYMDTVIVGPDLVETPIRFKMWLMGTSFSGCLTKEWNSF---AKTYGLKVGNKVRVWSFR

AT3G49610.1 Domain of unknown function (DUF313)1.5e-0729.66Show/hide
Query:  VQKVIEKALTATDMKKDQARMLLP-RKRAKMEFLTAAEVECL------EDRGGGLNYMDTVIVGPDLVETPIRFKMWLMGTSFSG------CLTKEWNSF
        ++ + EK LTATD+K  ++R+L+P  K  + +FLT  E   +      E+       + T+IV     E  +RF +W+M    SG       L + WN  
Subjt:  VQKVIEKALTATDMKKDQARMLLP-RKRAKMEFLTAAEVECL------EDRGGGLNYMDTVIVGPDLVETPIRFKMWLMGTSFSG------CLTKEWNSF

Query:  AKTYGLKVGNKVRVWSFR
             LK  + + +W+FR
Subjt:  AKTYGLKVGNKVRVWSFR

AT5G38490.1 Domain of unknown function (DUF313)3.4e-0729.82Show/hide
Query:  VIEKALTATDMKKDQARMLLP-RKRAKMEFLTAAEVECLE-----DRGGGLNY-MDTVIVGPDLVETPIRFKMWLMGT-----SFSGCLTKEWNSFAKTY
        + E+ L  +D+  + +R+L+P +K  + +FLT AE   ++     D     N  + TV+V     +  +RFK+W M       + +  L   WN   K+ 
Subjt:  VIEKALTATDMKKDQARMLLP-RKRAKMEFLTAAEVECLE-----DRGGGLNY-MDTVIVGPDLVETPIRFKMWLMGT-----SFSGCLTKEWNSFAKTY

Query:  GLKVGNKVRVWSFR
         LKVG+K+ +W+FR
Subjt:  GLKVGNKVRVWSFR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACACCAACTGCCCCGGCCCCCACACCCACCCAACCCAAACCCAATACCTGGCGGCGGACACGACGGCGCCGCCAATTGACGGCCACCCACTTTCTGCAATTCCGCC
AAATAACGCCTACGACCCAGAACCCAAAACCCCCACTTCGGAGGAAAACAGAAACGACCCAGATGCCGAAATTGAAAACGGCGACGACCCAGCTGCCGGAATACTGCCGG
CGGAAATTACGGCGGTAATAACGGGAAGAATGGGCGGTTACAGGGTGCAGAAGGTAATCGAGAAGGCGTTGACGGCGACGGACATGAAGAAGGATCAGGCGAGGATGTTG
CTGCCGCGGAAGAGGGCGAAAATGGAGTTTCTGACGGCGGCGGAGGTGGAGTGTCTGGAGGATCGCGGCGGCGGGTTGAATTACATGGACACGGTGATTGTTGGGCCGGA
TCTGGTAGAGACCCCAATTCGGTTCAAAATGTGGCTAATGGGGACTTCGTTTTCAGGCTGTTTGACAAAGGAATGGAATTCTTTTGCCAAAACTTATGGACTTAAGGTTG
GGAATAAGGTTAGGGTTTGGAGTTTCAGGGTCGATCGGAGAGATGATCAAAATGTGTCAAATCTCAATTGTGCTTTGCGCTTCTTAATCTCTAAGATAAAAAGAAGAAGA
GAGTTAACTATCATTGGACTAATTTTAAATTTGAATATATTCTTAAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTACACCAACTGCCCCGGCCCCCACACCCACCCAACCCAAACCCAATACCTGGCGGCGGACACGACGGCGCCGCCAATTGACGGCCACCCACTTTCTGCAATTCCGCC
AAATAACGCCTACGACCCAGAACCCAAAACCCCCACTTCGGAGGAAAACAGAAACGACCCAGATGCCGAAATTGAAAACGGCGACGACCCAGCTGCCGGAATACTGCCGG
CGGAAATTACGGCGGTAATAACGGGAAGAATGGGCGGTTACAGGGTGCAGAAGGTAATCGAGAAGGCGTTGACGGCGACGGACATGAAGAAGGATCAGGCGAGGATGTTG
CTGCCGCGGAAGAGGGCGAAAATGGAGTTTCTGACGGCGGCGGAGGTGGAGTGTCTGGAGGATCGCGGCGGCGGGTTGAATTACATGGACACGGTGATTGTTGGGCCGGA
TCTGGTAGAGACCCCAATTCGGTTCAAAATGTGGCTAATGGGGACTTCGTTTTCAGGCTGTTTGACAAAGGAATGGAATTCTTTTGCCAAAACTTATGGACTTAAGGTTG
GGAATAAGGTTAGGGTTTGGAGTTTCAGGGTCGATCGGAGAGATGATCAAAATGTGTCAAATCTCAATTGTGCTTTGCGCTTCTTAATCTCTAAGATAAAAAGAAGAAGA
GAGTTAACTATCATTGGACTAATTTTAAATTTGAATATATTCTTAAAGTGA
Protein sequenceShow/hide protein sequence
MYTNCPGPHTHPTQTQYLAADTTAPPIDGHPLSAIPPNNAYDPEPKTPTSEENRNDPDAEIENGDDPAAGILPAEITAVITGRMGGYRVQKVIEKALTATDMKKDQARML
LPRKRAKMEFLTAAEVECLEDRGGGLNYMDTVIVGPDLVETPIRFKMWLMGTSFSGCLTKEWNSFAKTYGLKVGNKVRVWSFRVDRRDDQNVSNLNCALRFLISKIKRRR
ELTIIGLILNLNIFLK