; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS009522 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS009522
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionAvr9/Cf-9 rapidly elicited protein
Genome locationscaffold813:1880731..1881357
RNA-Seq ExpressionMS009522
SyntenyMS009522
Gene Ontology termsNA
InterPro domainsIPR008480 - Protein of unknown function DUF761, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593486.1 hypothetical protein SDJN03_12962, partial [Cucurbita argyrosperma subsp. sororia]7.6e-6977.84Show/hide
Query:  ILDLNLMMKRGKIAGKAISNLMFHHHH----GGASPSAA----ATGDLPFTVGADEYEFSCSNSPAFPGFHVGKHRRRNQHHNSFFACAHAPETLDDDAA
        +LDLNLM KRGK+AGKAISNLMFHHH+      ASPS++    + G LPF +GADEYEFSCSNSPAFP FHVGK RRRNQ+HNSFFACAHAP+TLDDDAA
Subjt:  ILDLNLMMKRGKIAGKAISNLMFHHHH----GGASPSAA----ATGDLPFTVGADEYEFSCSNSPAFPGFHVGKHRRRNQHHNSFFACAHAPETLDDDAA

Query:  ATANAVKAVIEILNNHNGGASS-PAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRFYKELRLQTKADD
        A  NAV AV+EILNNH G +S+ P  ASPALPGFG TPRRVRQLRITDSPFPLQDANADP VDKAADEFISRFYKELRLQ  AD+
Subjt:  ATANAVKAVIEILNNHNGGASS-PAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRFYKELRLQTKADD

XP_022964083.1 uncharacterized protein LOC111464220 [Cucurbita moschata]1.6e-8281.16Show/hide
Query:  MAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHHH--GGASPSAA---ATGDLPFTVGADEYEFSCSNSPAFPGFHVGKHRRR
        MAKK WNLVRVV+FLLRKGISKSKL+LDLNLM KRGK+AGKAISNLMFHHH+    ASPS++   + G LPF VGADEYEFSCSNSPAFP FHVGK RRR
Subjt:  MAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHHH--GGASPSAA---ATGDLPFTVGADEYEFSCSNSPAFPGFHVGKHRRR

Query:  NQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASS-PAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRFYKELR
        NQ+HNSFFACAHAP+TLDDDAAA  NAV AV+EILNNH G +S+ P  ASPALPGFG TPRRVRQLRITDSPFPLQDANADP VDKAADEFISRFYKELR
Subjt:  NQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASS-PAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRFYKELR

Query:  LQTKADD
        LQ  AD+
Subjt:  LQTKADD

XP_022991792.1 uncharacterized protein LOC111488328 [Cucurbita maxima]1.9e-6769.67Show/hide
Query:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHH-HGGASPSAAATGDLPFTVGADEYEFSCSNSPAFPGFH--VG
        MENNLPV++K+ W LVRV +FLLRKGISKSKLILDLNLMMKRGKIAGKAI+NLMFHHH HGGA+PS++A   LP  VG D+YEF+CS+SPAFP  H    
Subjt:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHH-HGGASPSAAATGDLPFTVGADEYEFSCSNSPAFPGFH--VG

Query:  KHRRRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASSPAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRFY
          RRRNQ+H SFFACAHAP+TLDDDAA    AVKA +EI N H+ GASSP+  S +          VRQLRITDSPFPL DANAD  VDKAADE+ISRFY
Subjt:  KHRRRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASSPAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRFY

Query:  KELRLQTKADD
        KELRLQ  AD+
Subjt:  KELRLQTKADD

XP_023000392.1 uncharacterized protein LOC111494650 [Cucurbita maxima]1.7e-8981.69Show/hide
Query:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHH-HGGASPSAA---ATGDLPFTVGADEYEFSCSNSPAFPGFHV
        MENNLP+MAKK WNLVRV +FLLRKGISKSKL+LDLNLM KRGK+AGKAISNLMFHHH H  ASPS++   + G LPF +GADEYEFSCSNSPAFP FHV
Subjt:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHH-HGGASPSAA---ATGDLPFTVGADEYEFSCSNSPAFPGFHV

Query:  GKHRRRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASS-PAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISR
        GK RRRNQ+HNSFFACAHAP+TLDDDAAA  NAV AV+EILNNH+G +S+ P  ASPALPGFGRTPRRVRQLRITDSPFPLQDANADP VDKAADEFISR
Subjt:  GKHRRRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASS-PAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISR

Query:  FYKELRLQTKADD
        FYKELRLQ  AD+
Subjt:  FYKELRLQTKADD

XP_023514516.1 uncharacterized protein LOC111778773 [Cucurbita pepo subsp. pepo]1.6e-8279.9Show/hide
Query:  MAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHHH---GGASPSAA----ATGDLPFTVGADEYEFSCSNSPAFPGFHVGKHR
        MAKK WNLVRV +FLLRKGISKSKL+LDLNLM KRGK+AGKAISNLMFHHH+     ASPS++    +TG LPF +GADEYEFSCSNSPAFP FHVGK R
Subjt:  MAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHHH---GGASPSAA----ATGDLPFTVGADEYEFSCSNSPAFPGFHVGKHR

Query:  RRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASS-PAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRFYKE
        RRNQ+HN FFACAHAP+TLDDDAAA  NAV AV+EILNNH G +S+ P  ASPALPGFGRTPRRVRQLRITDSPFPLQDANADP VDKAADEFISRFYKE
Subjt:  RRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASS-PAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRFYKE

Query:  LRLQTKADD
        LRLQ  AD+
Subjt:  LRLQTKADD

TrEMBL top hitse value%identityAlignment
A0A0A0K800 Uncharacterized protein7.5e-6268.14Show/hide
Query:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKL-ILDLNLMMKRGKIAGKAISNLMFHHHHGGASPSAAATGDLPFTVGADEYEFSCSNSPAFPGFHVGKH
        ME+N+PV+AKK WNLVRV +FLLRKGISKSK+ +LDLNLMMKRGKIAGKAISNLMF HH+  A         LPF V AD+YEFSCSN+P++  F  GK 
Subjt:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKL-ILDLNLMMKRGKIAGKAISNLMFHHHHGGASPSAAATGDLPFTVGADEYEFSCSNSPAFPGFHVGKH

Query:  RRRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASSPAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRFYKE
        RR N +HNSFFACAHAP+TLDDD   T NA+KAV++ILNN N     P  +SPA       P  VRQLRITDSPFPLQD NADP VDKAADEFISRFYKE
Subjt:  RRRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASSPAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRFYKE

Query:  LRLQ
        L LQ
Subjt:  LRLQ

A0A6J1GPL9 uncharacterized protein LOC1114563341.2e-6770.28Show/hide
Query:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHH-HGGASPSAAATGDLPFTVGADEYEFSCSNSPAFPGFH--VG
        MENNLPV++K+ W LVRV +FLLRKGISKSKLILDLNLMMKRGKIAGKAI+NLMFHHH HGGA+PS++A   LP  VG D+YEF+CS+SPAFP  H    
Subjt:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHH-HGGASPSAAATGDLPFTVGADEYEFSCSNSPAFPGFH--VG

Query:  KHRRRNQHH-NSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASSPAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRF
          RRRNQ+H +SFFACAHAP TLDDDAAA  NAVKA +EI N H+ GASSP+  S +          VRQLRITDSPFPL DANAD  VDKAADE+ISRF
Subjt:  KHRRRNQHH-NSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASSPAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRF

Query:  YKELRLQTKADD
        YKELRLQ  AD+
Subjt:  YKELRLQTKADD

A0A6J1HJS6 uncharacterized protein LOC1114642207.7e-8381.16Show/hide
Query:  MAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHHH--GGASPSAA---ATGDLPFTVGADEYEFSCSNSPAFPGFHVGKHRRR
        MAKK WNLVRVV+FLLRKGISKSKL+LDLNLM KRGK+AGKAISNLMFHHH+    ASPS++   + G LPF VGADEYEFSCSNSPAFP FHVGK RRR
Subjt:  MAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHHH--GGASPSAA---ATGDLPFTVGADEYEFSCSNSPAFPGFHVGKHRRR

Query:  NQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASS-PAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRFYKELR
        NQ+HNSFFACAHAP+TLDDDAAA  NAV AV+EILNNH G +S+ P  ASPALPGFG TPRRVRQLRITDSPFPLQDANADP VDKAADEFISRFYKELR
Subjt:  NQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASS-PAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRFYKELR

Query:  LQTKADD
        LQ  AD+
Subjt:  LQTKADD

A0A6J1JMV1 uncharacterized protein LOC1114883289.1e-6869.67Show/hide
Query:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHH-HGGASPSAAATGDLPFTVGADEYEFSCSNSPAFPGFH--VG
        MENNLPV++K+ W LVRV +FLLRKGISKSKLILDLNLMMKRGKIAGKAI+NLMFHHH HGGA+PS++A   LP  VG D+YEF+CS+SPAFP  H    
Subjt:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHH-HGGASPSAAATGDLPFTVGADEYEFSCSNSPAFPGFH--VG

Query:  KHRRRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASSPAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRFY
          RRRNQ+H SFFACAHAP+TLDDDAA    AVKA +EI N H+ GASSP+  S +          VRQLRITDSPFPL DANAD  VDKAADE+ISRFY
Subjt:  KHRRRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASSPAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRFY

Query:  KELRLQTKADD
        KELRLQ  AD+
Subjt:  KELRLQTKADD

A0A6J1KDI8 uncharacterized protein LOC1114946508.5e-9081.69Show/hide
Query:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHH-HGGASPSAA---ATGDLPFTVGADEYEFSCSNSPAFPGFHV
        MENNLP+MAKK WNLVRV +FLLRKGISKSKL+LDLNLM KRGK+AGKAISNLMFHHH H  ASPS++   + G LPF +GADEYEFSCSNSPAFP FHV
Subjt:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHH-HGGASPSAA---ATGDLPFTVGADEYEFSCSNSPAFPGFHV

Query:  GKHRRRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASS-PAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISR
        GK RRRNQ+HNSFFACAHAP+TLDDDAAA  NAV AV+EILNNH+G +S+ P  ASPALPGFGRTPRRVRQLRITDSPFPLQDANADP VDKAADEFISR
Subjt:  GKHRRRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNNHNGGASS-PAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISR

Query:  FYKELRLQTKADD
        FYKELRLQ  AD+
Subjt:  FYKELRLQTKADD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52140.1 unknown protein5.1e-3945.87Show/hide
Query:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHH---HGGASPSAAATGDLPFTVGADEYEFSCSNSP--AFPGFH
        M+ N+P+ +KK WN+VR + +++RKG+SK+KLI D N  +KRGK       NLMFH     H G++ SAA            EYEFSCSN+P  +FP  +
Subjt:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHH---HGGASPSAAATGDLPFTVGADEYEFSCSNSP--AFPGFH

Query:  VGKHRRRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNN-HNGGASSPAQ----ASPALPGFGRTPRRVRQLRITDSPFPLQDANAD---PQVDK
        +   R+++  HN+ F C   P+TLDDD A    A +AV+E+LN     G  +PA      SP  PGFG+TP  VR LR+TDSPFPL   N D     VDK
Subjt:  VGKHRRRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEILNN-HNGGASSPAQ----ASPALPGFGRTPRRVRQLRITDSPFPLQDANAD---PQVDK

Query:  AADEFISRFYKELRLQTK
        AAD+FI +FYK L  Q K
Subjt:  AADEFISRFYKELRLQTK

AT3G16330.1 unknown protein2.4e-3644.29Show/hide
Query:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHHHGGASPSAAATGDLPFTVGADEYEFSCSNSP--AFPGFHVGK
        ME N+ + +KK  N+VR V ++L KGISK KL+ D N  +KRGK       NLMFH+       + A+          +EYEFSCS++P   FP F++  
Subjt:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHHHGGASPSAAATGDLPFTVGADEYEFSCSNSP--AFPGFHVGK

Query:  HRRRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEILN---NHNGGASSPA-------QASPALPGFGRTPRRVRQLRITDSPFPLQDAN--ADPQVD
         ++++ HHNS F+C  AP TLDDD + +    +AV+E+LN   +H+ G+++PA         SP LPGFGR+   VR LR+TDSPFPL++    A+  VD
Subjt:  HRRRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEILN---NHNGGASSPA-------QASPALPGFGRTPRRVRQLRITDSPFPLQDAN--ADPQVD

Query:  KAADEFISRFYKELRLQTK
        KAADEFI +FYK L  Q K
Subjt:  KAADEFISRFYKELRLQTK

AT4G29110.1 unknown protein4.4e-2238.46Show/hide
Query:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHHHGGASPSAAATGDLPFTVGADEYEFSCSNSPAFPGFHVGKHR
        ME N  V AK+ W +VR+VF +L+ G  K+KL+LDLNLM+KRG    KAI+NL           S+  + D+  +    +Y+         P   + K +
Subjt:  MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHHHGGASPSAAATGDLPFTVGADEYEFSCSNSPAFPGFHVGKHR

Query:  RRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEIL--NNHNGGASSPAQASPALPGFGRTPRRVRQLRITDSPFPLQD-ANADPQVDKAADEFISRFY
        RR   H  +          D++  A   AVK V E+L  N+    A+  A+ SP +         VRQLR+TDSPFPL D  + D  VDKAA+EFI +FY
Subjt:  RRNQHHNSFFACAHAPETLDDDAAATANAVKAVIEIL--NNHNGGASSPAQASPALPGFGRTPRRVRQLRITDSPFPLQD-ANADPQVDKAADEFISRFY

Query:  KELRLQTK
        K L+LQ K
Subjt:  KELRLQTK

AT4G32860.1 unknown protein3.1e-0428.57Show/hide
Query:  MENNLPVMAKKAWNLVRVVFFLLRK--GISKSKLI--LDLNLMMKRGKIAGKAISN-LMFHHHHGGASPS------AAATGDLPFTVGADEYEFSCSNSP
        ME    V  KK  +L +++ F ++K    S+ KL+  LD +L+ KRGKI  K+++  +   H      PS      ++    +P  +   EYEFSCS++P
Subjt:  MENNLPVMAKKAWNLVRVVFFLLRK--GISKSKLI--LDLNLMMKRGKIAGKAISN-LMFHHHHGGASPS------AAATGDLPFTVGADEYEFSCSNSP

Query:  AFPGF--HVGKHRRRNQHHNSFFACAHAPETLDDDAAATANAVKAVIE-ILNNHNGGASSPAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVD
            +   V K RR N  HN               A    N +  V + I + H   A  P  AS                    S   ++  +    VD
Subjt:  AFPGF--HVGKHRRRNQHHNSFFACAHAPETLDDDAAATANAVKAVIE-ILNNHNGGASSPAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVD

Query:  KAADEFISRFYKELRLQ
        +AA+EFI  FY++LRLQ
Subjt:  KAADEFISRFYKELRLQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAATAATCTTCCTGTGATGGCCAAAAAAGCCTGGAATTTAGTTCGCGTGGTTTTTTTCTTGCTCAGAAAAGGCATCTCCAAGAGCAAGCTCATCCTCGACCTCAA
TCTCATGATGAAGCGCGGCAAAATCGCCGGAAAAGCCATCAGCAATCTCATGTTCCACCACCACCACGGCGGCGCCTCTCCCTCCGCCGCCGCCACCGGCGACCTCCCCT
TCACCGTCGGCGCCGACGAGTACGAATTCAGCTGTAGCAACAGCCCCGCCTTCCCCGGCTTCCACGTCGGCAAGCACCGCCGCCGTAACCAACACCACAACTCCTTCTTC
GCGTGTGCTCACGCACCCGAAACCCTCGACGACGACGCCGCCGCTACCGCGAACGCCGTCAAGGCCGTAATTGAGATCCTGAACAACCACAACGGCGGCGCGTCGTCTCC
GGCGCAGGCCTCCCCGGCCCTCCCCGGCTTCGGCCGGACTCCGAGGAGGGTCCGCCAGCTCAGGATAACGGACTCGCCGTTCCCTCTCCAAGACGCTAACGCCGATCCGC
AAGTCGACAAAGCCGCCGACGAATTCATCTCCAGGTTTTACAAGGAGCTCAGGCTCCAGACCAAGGCCGACGACGGC
mRNA sequenceShow/hide mRNA sequence
ATGGAAAATAATCTTCCTGTGATGGCCAAAAAAGCCTGGAATTTAGTTCGCGTGGTTTTTTTCTTGCTCAGAAAAGGCATCTCCAAGAGCAAGCTCATCCTCGACCTCAA
TCTCATGATGAAGCGCGGCAAAATCGCCGGAAAAGCCATCAGCAATCTCATGTTCCACCACCACCACGGCGGCGCCTCTCCCTCCGCCGCCGCCACCGGCGACCTCCCCT
TCACCGTCGGCGCCGACGAGTACGAATTCAGCTGTAGCAACAGCCCCGCCTTCCCCGGCTTCCACGTCGGCAAGCACCGCCGCCGTAACCAACACCACAACTCCTTCTTC
GCGTGTGCTCACGCACCCGAAACCCTCGACGACGACGCCGCCGCTACCGCGAACGCCGTCAAGGCCGTAATTGAGATCCTGAACAACCACAACGGCGGCGCGTCGTCTCC
GGCGCAGGCCTCCCCGGCCCTCCCCGGCTTCGGCCGGACTCCGAGGAGGGTCCGCCAGCTCAGGATAACGGACTCGCCGTTCCCTCTCCAAGACGCTAACGCCGATCCGC
AAGTCGACAAAGCCGCCGACGAATTCATCTCCAGGTTTTACAAGGAGCTCAGGCTCCAGACCAAGGCCGACGACGGC
Protein sequenceShow/hide protein sequence
MENNLPVMAKKAWNLVRVVFFLLRKGISKSKLILDLNLMMKRGKIAGKAISNLMFHHHHGGASPSAAATGDLPFTVGADEYEFSCSNSPAFPGFHVGKHRRRNQHHNSFF
ACAHAPETLDDDAAATANAVKAVIEILNNHNGGASSPAQASPALPGFGRTPRRVRQLRITDSPFPLQDANADPQVDKAADEFISRFYKELRLQTKADDG