; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g0251 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g0251
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionAvr9/Cf-9 rapidly elicited protein
Genome locationMC09:2324794..2325417
RNA-Seq ExpressionMC09g0251
SyntenyMC09g0251
Gene Ontology termsNA
InterPro domainsIPR008480 - Protein of unknown function DUF761, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593486.1 hypothetical protein SDJN03_12962, partial [Cucurbita argyrosperma subsp. sororia]2.78e-9078.26Show/hide
Query:  LDLNLMMKRGKIAGKAISNLMFHHHH----GGASPSAAA----TGDLPFTVGADEYEFSCSNSPAFPGFHVGKHRRRNQHHNSFFACAHAPETLDDDAAA
        LDLNLM KRGK+AGKAISNLMFHHH+      ASPS+++     G LPF +GADEYEFSCSNSPAFP FHVGK RRRNQ+HNSFFACAHAP+TLDDDAAA
Subjt:  LDLNLMMKRGKIAGKAISNLMFHHHH----GGASPSAAA----TGDLPFTVGADEYEFSCSNSPAFPGFHVGKHRRRNQHHNSFFACAHAPETLDDDAAA

Query:  TANAVKAVIEILNNHNGGASSPA-QASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRFYKELRLQTKADD
          NAV AV+EILNNH G +S+P   ASPALPGFG TPRRVRQLRITDSPFPLQDANADP VDKAADEFISRFYKELRLQ  AD+
Subjt:  TANAVKAVIEILNNHNGGASSPA-QASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRFYKELRLQTKADD

XP_022964083.1 uncharacterized protein LOC111464220 [Cucurbita moschata]9.10e-10881.16Show/hide
Query:  MAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHHH--GGASPSAA---ATGDLPFTVGADEYEFSCSNSPAFPGFHVGKHRRR
        MAKK WNLVRVV+FLLRKGISKSKL+LDLNLM KRGK+AGKAISNLMFHHH+    ASPS++   + G LPF VGADEYEFSCSNSPAFP FHVGK RRR
Subjt:  MAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHHH--GGASPSAA---ATGDLPFTVGADEYEFSCSNSPAFPGFHVGKHRRR

Query:  NQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASSPA-QASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRFYKELR
        NQ+HNSFFACAHAP+TLDDDAAA  NAV AV+EILNNH G +S+P   ASPALPGFG TPRRVRQLRITDSPFPLQDANADP VDKAADEFISRFYKELR
Subjt:  NQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASSPA-QASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRFYKELR

Query:  LQTKADD
        LQ  AD+
Subjt:  LQTKADD

XP_022991792.1 uncharacterized protein LOC111488328 [Cucurbita maxima]2.80e-8869.67Show/hide
Query:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHH-HGGASPSAAATGDLPFTVGADEYEFSCSNSPAFPGFHVGKH
        MENNLPV++K+ W LVRV +FLLRKGISKSKLILDLNLMMKRGKIAGKAI+NLMFHHH HGGA+PS++A   LP  VG D+YEF+CS+SPAFP  H    
Subjt:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHH-HGGASPSAAATGDLPFTVGADEYEFSCSNSPAFPGFHVGKH

Query:  --RRRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASSPAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRFY
          RRRNQ+H SFFACAHAP+TLDDDAAA    VKA +EI N H+G ASSP+  S +          VRQLRITDSPFPL DANAD  VDKAADE+ISRFY
Subjt:  --RRRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASSPAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRFY

Query:  KELRLQTKADD
        KELRLQ  AD+
Subjt:  KELRLQTKADD

XP_023000392.1 uncharacterized protein LOC111494650 [Cucurbita maxima]8.43e-11781.69Show/hide
Query:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHH-HGGASPSAA---ATGDLPFTVGADEYEFSCSNSPAFPGFHV
        MENNLP+MAKK WNLVRV +FLLRKGISKSKL+LDLNLM KRGK+AGKAISNLMFHHH H  ASPS++   + G LPF +GADEYEFSCSNSPAFP FHV
Subjt:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHH-HGGASPSAA---ATGDLPFTVGADEYEFSCSNSPAFPGFHV

Query:  GKHRRRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASSP-AQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISR
        GK RRRNQ+HNSFFACAHAP+TLDDDAAA  NAV AV+EILNNH+G +S+P   ASPALPGFGRTPRRVRQLRITDSPFPLQDANADP VDKAADEFISR
Subjt:  GKHRRRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASSP-AQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISR

Query:  FYKELRLQTKADD
        FYKELRLQ  AD+
Subjt:  FYKELRLQTKADD

XP_023514516.1 uncharacterized protein LOC111778773 [Cucurbita pepo subsp. pepo]9.75e-10879.9Show/hide
Query:  MAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHHH---GGASPSAAA----TGDLPFTVGADEYEFSCSNSPAFPGFHVGKHR
        MAKK WNLVRV +FLLRKGISKSKL+LDLNLM KRGK+AGKAISNLMFHHH+     ASPS+++    TG LPF +GADEYEFSCSNSPAFP FHVGK R
Subjt:  MAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHHH---GGASPSAAA----TGDLPFTVGADEYEFSCSNSPAFPGFHVGKHR

Query:  RRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASSPA-QASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRFYKE
        RRNQ+HN FFACAHAP+TLDDDAAA  NAV AV+EILNNH G +S+P   ASPALPGFGRTPRRVRQLRITDSPFPLQDANADP VDKAADEFISRFYKE
Subjt:  RRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASSPA-QASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRFYKE

Query:  LRLQTKADD
        LRLQ  AD+
Subjt:  LRLQTKADD

TrEMBL top hitse value%identityAlignment
A0A0A0K800 Uncharacterized protein1.56e-8068.14Show/hide
Query:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKL-ILDLNLMMKRGKIAGKAISNLMFHHHHGGASPSAAATGDLPFTVGADEYEFSCSNSPAFPGFHVGKH
        ME+N+PV+AKK WNLVRV +FLLRKGISKSK+ +LDLNLMMKRGKIAGKAISNLMF HH+  A         LPF V AD+YEFSCSN+P++  F  GK 
Subjt:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKL-ILDLNLMMKRGKIAGKAISNLMFHHHHGGASPSAAATGDLPFTVGADEYEFSCSNSPAFPGFHVGKH

Query:  RRRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASSPAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRFYKE
        RR N +HNSFFACAHAP+TLDDD   T NA+KAV++ILNN N     P+  SPA       P  VRQLRITDSPFPLQD NADP VDKAADEFISRFYKE
Subjt:  RRRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASSPAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRFYKE

Query:  LRLQ
        L LQ
Subjt:  LRLQ

A0A6J1GPL9 uncharacterized protein LOC1114563342.20e-8870.28Show/hide
Query:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHH-HGGASPSAAATGDLPFTVGADEYEFSCSNSPAFPGFHVGKH
        MENNLPV++K+ W LVRV +FLLRKGISKSKLILDLNLMMKRGKIAGKAI+NLMFHHH HGGA+PS++A   LP  VG D+YEF+CS+SPAFP  H    
Subjt:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHH-HGGASPSAAATGDLPFTVGADEYEFSCSNSPAFPGFHVGKH

Query:  --RRRNQHH-NSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASSPAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRF
          RRRNQ+H +SFFACAHAP TLDDDAAA  NAVKA +EI N H+G ASSP+  S +          VRQLRITDSPFPL DANAD  VDKAADE+ISRF
Subjt:  --RRRNQHH-NSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASSPAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRF

Query:  YKELRLQTKADD
        YKELRLQ  AD+
Subjt:  YKELRLQTKADD

A0A6J1HJS6 uncharacterized protein LOC1114642204.40e-10881.16Show/hide
Query:  MAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHHH--GGASPSAA---ATGDLPFTVGADEYEFSCSNSPAFPGFHVGKHRRR
        MAKK WNLVRVV+FLLRKGISKSKL+LDLNLM KRGK+AGKAISNLMFHHH+    ASPS++   + G LPF VGADEYEFSCSNSPAFP FHVGK RRR
Subjt:  MAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHHH--GGASPSAA---ATGDLPFTVGADEYEFSCSNSPAFPGFHVGKHRRR

Query:  NQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASSPA-QASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRFYKELR
        NQ+HNSFFACAHAP+TLDDDAAA  NAV AV+EILNNH G +S+P   ASPALPGFG TPRRVRQLRITDSPFPLQDANADP VDKAADEFISRFYKELR
Subjt:  NQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASSPA-QASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRFYKELR

Query:  LQTKADD
        LQ  AD+
Subjt:  LQTKADD

A0A6J1JMV1 uncharacterized protein LOC1114883281.35e-8869.67Show/hide
Query:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHH-HGGASPSAAATGDLPFTVGADEYEFSCSNSPAFPGFHVGKH
        MENNLPV++K+ W LVRV +FLLRKGISKSKLILDLNLMMKRGKIAGKAI+NLMFHHH HGGA+PS++A   LP  VG D+YEF+CS+SPAFP  H    
Subjt:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHH-HGGASPSAAATGDLPFTVGADEYEFSCSNSPAFPGFHVGKH

Query:  --RRRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASSPAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRFY
          RRRNQ+H SFFACAHAP+TLDDDAAA    VKA +EI N H+G ASSP+  S +          VRQLRITDSPFPL DANAD  VDKAADE+ISRFY
Subjt:  --RRRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASSPAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRFY

Query:  KELRLQTKADD
        KELRLQ  AD+
Subjt:  KELRLQTKADD

A0A6J1KDI8 uncharacterized protein LOC1114946504.08e-11781.69Show/hide
Query:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHH-HGGASPSAA---ATGDLPFTVGADEYEFSCSNSPAFPGFHV
        MENNLP+MAKK WNLVRV +FLLRKGISKSKL+LDLNLM KRGK+AGKAISNLMFHHH H  ASPS++   + G LPF +GADEYEFSCSNSPAFP FHV
Subjt:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHH-HGGASPSAA---ATGDLPFTVGADEYEFSCSNSPAFPGFHV

Query:  GKHRRRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASSP-AQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISR
        GK RRRNQ+HNSFFACAHAP+TLDDDAAA  NAV AV+EILNNH+G +S+P   ASPALPGFGRTPRRVRQLRITDSPFPLQDANADP VDKAADEFISR
Subjt:  GKHRRRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASSP-AQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISR

Query:  FYKELRLQTKADD
        FYKELRLQ  AD+
Subjt:  FYKELRLQTKADD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52140.1 unknown protein5.1e-3945.87Show/hide
Query:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHH---HGGASPSAAATGDLPFTVGADEYEFSCSNSP--AFPGFH
        M+ N+P+ +KK WN+VR + +++RKG+SK+KLI D N  +KRGK       NLMFH     H G++ SAA            EYEFSCSN+P  +FP  +
Subjt:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHH---HGGASPSAAATGDLPFTVGADEYEFSCSNSP--AFPGFH

Query:  VGKHRRRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNN-HNGGASSPAQ----ASPALPGFGRTPRRVRQLRITDSPFPLQDANAD---PQVDK
        +   R+++  HN+ F C   P+TLDDD A    A +AV+E+LN     G  +PA      SP  PGFG+TP  VR LR+TDSPFPL   N D     VDK
Subjt:  VGKHRRRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNN-HNGGASSPAQ----ASPALPGFGRTPRRVRQLRITDSPFPLQDANAD---PQVDK

Query:  AADEFISRFYKELRLQTK
        AAD+FI +FYK L  Q K
Subjt:  AADEFISRFYKELRLQTK

AT3G16330.1 unknown protein2.4e-3644.29Show/hide
Query:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHHHGGASPSAAATGDLPFTVGADEYEFSCSNSP--AFPGFHVGK
        ME N+ + +KK  N+VR V ++L KGISK KL+ D N  +KRGK       NLMFH+       + A+          +EYEFSCS++P   FP F++  
Subjt:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHHHGGASPSAAATGDLPFTVGADEYEFSCSNSP--AFPGFHVGK

Query:  HRRRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEILN---NHNGGASSPA-------QASPALPGFGRTPRRVRQLRITDSPFPLQDAN--ADPQVD
         ++++ HHNS F+C  AP TLDDD + +    +AV+E+LN   +H+ G+++PA         SP LPGFGR+   VR LR+TDSPFPL++    A+  VD
Subjt:  HRRRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEILN---NHNGGASSPA-------QASPALPGFGRTPRRVRQLRITDSPFPLQDAN--ADPQVD

Query:  KAADEFISRFYKELRLQTK
        KAADEFI +FYK L  Q K
Subjt:  KAADEFISRFYKELRLQTK

AT4G29110.1 unknown protein5.7e-2238.46Show/hide
Query:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHHHGGASPSAAATGDLPFTVGADEYEFSCSNSPAFPGFHVGKHR
        ME N  V AK+ W +VR+VF +L+ G  K+KL+LDLNLM+KRG    KAI+NL           S+  + D+  +    +Y+         P   + K +
Subjt:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHHHGGASPSAAATGDLPFTVGADEYEFSCSNSPAFPGFHVGKHR

Query:  RRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEIL--NNHNGGASSPAQASPALPGFGRTPRRVRQLRITDSPFPLQD-ANADPQVDKAADEFISRFY
        RR   H  +          D++  A   AVK V E+L  N+    A+  A+ SP +         VRQLR+TDSPFPL D  + D  VDKAA+EFI +FY
Subjt:  RRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEIL--NNHNGGASSPAQASPALPGFGRTPRRVRQLRITDSPFPLQD-ANADPQVDKAADEFISRFY

Query:  KELRLQTK
        K L+LQ K
Subjt:  KELRLQTK

AT4G32860.1 unknown protein3.1e-0428.57Show/hide
Query:  MENNLPVMAKKAWNLVRVVFFLLRK--GISKSKLI--LDLNLMMKRGKIAGKAISN-LMFHHHHGGASPS------AAATGDLPFTVGADEYEFSCSNSP
        ME    V  KK  +L +++ F ++K    S+ KL+  LD +L+ KRGKI  K+++  +   H      PS      ++    +P  +   EYEFSCS++P
Subjt:  MENNLPVMAKKAWNLVRVVFFLLRK--GISKSKLI--LDLNLMMKRGKIAGKAISN-LMFHHHHGGASPS------AAATGDLPFTVGADEYEFSCSNSP

Query:  AFPGF--HVGKHRRRNQHHNSFFACAHAPETLDDDAAATANAVKAVIE-ILNNHNGGASSPAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVD
            +   V K RR N  HN               A    N +  V + I + H   A  P  AS                    S   ++  +    VD
Subjt:  AFPGF--HVGKHRRRNQHHNSFFACAHAPETLDDDAAATANAVKAVIE-ILNNHNGGASSPAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVD

Query:  KAADEFISRFYKELRLQ
        +AA+EFI  FY++LRLQ
Subjt:  KAADEFISRFYKELRLQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAATAATCTTCCTGTGATGGCCAAAAAAGCCTGGAATTTAGTTCGCGTGGTTTTTTTCTTGCTCAGAAAAGGCATCTCCAAGAGCAAGCTCATCCTCGACCTCAA
TCTCATGATGAAGCGCGGCAAAATCGCCGGAAAAGCCATCAGCAATCTCATGTTCCACCACCACCACGGCGGCGCCTCTCCCTCCGCCGCCGCCACCGGCGACCTCCCCT
TCACCGTCGGCGCCGACGAGTACGAATTCAGCTGTAGCAACAGCCCCGCCTTCCCCGGCTTCCACGTCGGCAAGCACCGCCGACGCAACCAACACCACAACTCCTTCTTC
GCGTGTGCTCACGCACCCGAAACCCTCGACGACGACGCCGCCGCCACCGCGAACGCCGTCAAGGCCGTAATTGAGATCCTGAACAACCACAACGGCGGCGCGTCGTCTCC
GGCGCAGGCCTCCCCGGCCCTCCCCGGCTTCGGCCGGACTCCGAGGAGGGTCCGGCAGCTCAGGATAACGGACTCGCCGTTCCCTCTCCAAGACGCTAACGCCGATCCGC
AAGTCGACAAAGCCGCCGACGAATTCATCTCCAGGTTTTACAAGGAGCTCAGGCTCCAGACCAAGGCCGACGAC
mRNA sequenceShow/hide mRNA sequence
ATGGAAAATAATCTTCCTGTGATGGCCAAAAAAGCCTGGAATTTAGTTCGCGTGGTTTTTTTCTTGCTCAGAAAAGGCATCTCCAAGAGCAAGCTCATCCTCGACCTCAA
TCTCATGATGAAGCGCGGCAAAATCGCCGGAAAAGCCATCAGCAATCTCATGTTCCACCACCACCACGGCGGCGCCTCTCCCTCCGCCGCCGCCACCGGCGACCTCCCCT
TCACCGTCGGCGCCGACGAGTACGAATTCAGCTGTAGCAACAGCCCCGCCTTCCCCGGCTTCCACGTCGGCAAGCACCGCCGACGCAACCAACACCACAACTCCTTCTTC
GCGTGTGCTCACGCACCCGAAACCCTCGACGACGACGCCGCCGCCACCGCGAACGCCGTCAAGGCCGTAATTGAGATCCTGAACAACCACAACGGCGGCGCGTCGTCTCC
GGCGCAGGCCTCCCCGGCCCTCCCCGGCTTCGGCCGGACTCCGAGGAGGGTCCGGCAGCTCAGGATAACGGACTCGCCGTTCCCTCTCCAAGACGCTAACGCCGATCCGC
AAGTCGACAAAGCCGCCGACGAATTCATCTCCAGGTTTTACAAGGAGCTCAGGCTCCAGACCAAGGCCGACGAC
Protein sequenceShow/hide protein sequence
MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHHHGGASPSAAATGDLPFTVGADEYEFSCSNSPAFPGFHVGKHRRRNQHHNSFF
ACAHAPETLDDDAAATANAVKAVIEILNNHNGGASSPAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRFYKELRLQTKADD