; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS023739 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS023739
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUnknown protein
Genome locationscaffold570:650446..651042
RNA-Seq ExpressionMS023739
SyntenyMS023739
Gene Ontology termsNA
InterPro domainsIPR008480 - Protein of unknown function DUF761, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004150847.2 uncharacterized protein LOC101210555 [Cucumis sativus]9.8e-9088.5Show/hide
Query:  IEANRPAVKKKLWNVVRAIVFMLRKGLSKSKIVLDLHLMIKQSKIAGKAIANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNSPA-PAYPFRYFNKL
        +E  RPAVKKKLWNV+RA+VFMLRKGLSKSKI  DLHLM+K+SKIAGKAIANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNSPA  AYPFRYFNK 
Subjt:  IEANRPAVKKKLWNVVRAIVFMLRKGLSKSKIVLDLHLMIKQSKIAGKAIANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNSPA-PAYPFRYFNKL

Query:  RKHQNHHYFSKSYRYDDFSTVTAVHRVLDILHSDQKSEASPLVPLPGFGKSPRSVRQLRVTDSPFSLKDDGDSQFVDKAAEEFIKKFYTDLRLERSLATF
         K +  HYF KSYRYDDFSTVTAV RVLDILH+DQKSEASPLVPLPGFGKSP  VRQLRVTDSPFSLKDDGDSQFVDKAAEEFIKKFYTDLRLERSLA F
Subjt:  RKHQNHHYFSKSYRYDDFSTVTAVHRVLDILHSDQKSEASPLVPLPGFGKSPRSVRQLRVTDSPFSLKDDGDSQFVDKAAEEFIKKFYTDLRLERSLATF

XP_022154731.1 uncharacterized protein LOC111021911 [Momordica charantia]1.3e-10599.5Show/hide
Query:  IEANRPAVKKKLWNVVRAIVFMLRKGLSKSKIVLDLHLMIKQSKIAGKAIANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNSPAPAYPFRYFNKLR
        IEANRPAVKKKLWNVVRAIVFMLRKGLSKSKIVLDLHLMIKQSKI GKAIANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNSPAPAYPFRYFNKLR
Subjt:  IEANRPAVKKKLWNVVRAIVFMLRKGLSKSKIVLDLHLMIKQSKIAGKAIANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNSPAPAYPFRYFNKLR

Query:  KHQNHHYFSKSYRYDDFSTVTAVHRVLDILHSDQKSEASPLVPLPGFGKSPRSVRQLRVTDSPFSLKDDGDSQFVDKAAEEFIKKFYTDLRLERSLATF
        KHQNHHYFSKSYRYDDFSTVTAVHRVLDILHSDQKSEASPLVPLPGFGKSPRSVRQLRVTDSPFSLKDDGDSQFVDKAAEEFIKKFYTDLRLERSLATF
Subjt:  KHQNHHYFSKSYRYDDFSTVTAVHRVLDILHSDQKSEASPLVPLPGFGKSPRSVRQLRVTDSPFSLKDDGDSQFVDKAAEEFIKKFYTDLRLERSLATF

XP_022998953.1 uncharacterized protein LOC111493479 [Cucurbita maxima]1.4e-9188.44Show/hide
Query:  IEANRPAVKKKLWNVVRAIVFMLRKGLSKSKIVLDLHLMIKQSKIAGKAIANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNSPAPAYPFRYFNKLR
        IE  RPAVKKKLWNV+RA+VFMLRKGL+KSKIV DLHLM+K+SK+AGKA+ANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNS  PAYPFRYFNK R
Subjt:  IEANRPAVKKKLWNVVRAIVFMLRKGLSKSKIVLDLHLMIKQSKIAGKAIANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNSPAPAYPFRYFNKLR

Query:  KHQNHHYFSKSYRYDDFSTVTAVHRVLDILHSDQKSEASPLVPLPGFGKSPRSVRQLRVTDSPFSLKDDGDSQFVDKAAEEFIKKFYTDLRLERSLATF
        KHQN HYF KSYRYDDFSTVTAV RVLDILHSDQKSEASPLVPLPGFGKSP  VRQLRVTDSPFSLKDD DSQ VDKAAEEFIKKFYTDLRLE+SLA +
Subjt:  KHQNHHYFSKSYRYDDFSTVTAVHRVLDILHSDQKSEASPLVPLPGFGKSPRSVRQLRVTDSPFSLKDDGDSQFVDKAAEEFIKKFYTDLRLERSLATF

XP_023545626.1 uncharacterized protein LOC111805003 [Cucurbita pepo subsp. pepo]3.6e-9289.45Show/hide
Query:  IEANRPAVKKKLWNVVRAIVFMLRKGLSKSKIVLDLHLMIKQSKIAGKAIANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNSPAPAYPFRYFNKLR
        IE  RPAVKKKLWNV+RA+VFMLRKGLSKSKIV DLHLM+K+SKIAGKA+ANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNS  PAYPFRYFNK R
Subjt:  IEANRPAVKKKLWNVVRAIVFMLRKGLSKSKIVLDLHLMIKQSKIAGKAIANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNSPAPAYPFRYFNKLR

Query:  KHQNHHYFSKSYRYDDFSTVTAVHRVLDILHSDQKSEASPLVPLPGFGKSPRSVRQLRVTDSPFSLKDDGDSQFVDKAAEEFIKKFYTDLRLERSLATF
        KHQN HYF KSYRYDDFSTVTAV RVLDILHSDQKSEASPLVPLPGFGKSP  VRQLRVTDSPFSLKDD DSQ VDKAAEEFIKKFYTDLRLE+SLA +
Subjt:  KHQNHHYFSKSYRYDDFSTVTAVHRVLDILHSDQKSEASPLVPLPGFGKSPRSVRQLRVTDSPFSLKDDGDSQFVDKAAEEFIKKFYTDLRLERSLATF

XP_038877425.1 uncharacterized protein LOC120069707 [Benincasa hispida]1.7e-8987.94Show/hide
Query:  IEANRPAVKKKLWNVVRAIVFMLRKGLSKSKIVLDLHLMIKQSKIAGKAIANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNSPAPAYPFRYFNKLR
        +E  RPAVKKKLWNV+RA+VFMLRKGLSKSKI  DLHLM+K+SKIAGKAIANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNSPA AYPFRYFNK  
Subjt:  IEANRPAVKKKLWNVVRAIVFMLRKGLSKSKIVLDLHLMIKQSKIAGKAIANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNSPAPAYPFRYFNKLR

Query:  KHQNHHYFSKSYRYDDFSTVTAVHRVLDILHSDQKSEASPLVPLPGFGKSPRSVRQLRVTDSPFSLKDDGDSQFVDKAAEEFIKKFYTDLRLERSLATF
        K +  HYF KSYRYDDFSTVTAV RVLDILH+DQKSEASPLV LPGFGKSP  VRQLRVTDSPFSLKDDGDSQFVDKAAEEFIKKFYTDLRLERS A F
Subjt:  KHQNHHYFSKSYRYDDFSTVTAVHRVLDILHSDQKSEASPLVPLPGFGKSPRSVRQLRVTDSPFSLKDDGDSQFVDKAAEEFIKKFYTDLRLERSLATF

TrEMBL top hitse value%identityAlignment
A0A0A0LHW0 Uncharacterized protein4.7e-9088.5Show/hide
Query:  IEANRPAVKKKLWNVVRAIVFMLRKGLSKSKIVLDLHLMIKQSKIAGKAIANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNSPA-PAYPFRYFNKL
        +E  RPAVKKKLWNV+RA+VFMLRKGLSKSKI  DLHLM+K+SKIAGKAIANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNSPA  AYPFRYFNK 
Subjt:  IEANRPAVKKKLWNVVRAIVFMLRKGLSKSKIVLDLHLMIKQSKIAGKAIANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNSPA-PAYPFRYFNKL

Query:  RKHQNHHYFSKSYRYDDFSTVTAVHRVLDILHSDQKSEASPLVPLPGFGKSPRSVRQLRVTDSPFSLKDDGDSQFVDKAAEEFIKKFYTDLRLERSLATF
         K +  HYF KSYRYDDFSTVTAV RVLDILH+DQKSEASPLVPLPGFGKSP  VRQLRVTDSPFSLKDDGDSQFVDKAAEEFIKKFYTDLRLERSLA F
Subjt:  RKHQNHHYFSKSYRYDDFSTVTAVHRVLDILHSDQKSEASPLVPLPGFGKSPRSVRQLRVTDSPFSLKDDGDSQFVDKAAEEFIKKFYTDLRLERSLATF

A0A1S3C8H8 uncharacterized protein LOC1034980402.3e-8988Show/hide
Query:  IEANRPAVKKKLWNVVRAIVFMLRKGLSKSKIVLDLHLMIKQSKIAGKAIANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNSPA-PAYPFRYFNKL
        +E  RPAVKKKLWNV+RA+VFMLRKGLSKSKI  DLHLM+K+SKIAGKAIANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNSPA  AYPFRYFNK 
Subjt:  IEANRPAVKKKLWNVVRAIVFMLRKGLSKSKIVLDLHLMIKQSKIAGKAIANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNSPA-PAYPFRYFNKL

Query:  RKHQNHHYFSKSYRYDDFSTVTAVHRVLDILHSDQKSEASPLVPLPGFGKSPRSVRQLRVTDSPFSLKDDGDSQFVDKAAEEFIKKFYTDLRLERSLATF
         K +  HYF KSYRYDDFSTVTAV RVLDILH+DQKSEASPLVPLPGFGKSP  VRQLRVTDSPFSLKDDGDSQ VDKAAEEFIKKFYTDLRLERSLA F
Subjt:  RKHQNHHYFSKSYRYDDFSTVTAVHRVLDILHSDQKSEASPLVPLPGFGKSPRSVRQLRVTDSPFSLKDDGDSQFVDKAAEEFIKKFYTDLRLERSLATF

A0A5A7V077 Uncharacterized protein2.3e-8988Show/hide
Query:  IEANRPAVKKKLWNVVRAIVFMLRKGLSKSKIVLDLHLMIKQSKIAGKAIANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNSPA-PAYPFRYFNKL
        +E  RPAVKKKLWNV+RA+VFMLRKGLSKSKI  DLHLM+K+SKIAGKAIANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNSPA  AYPFRYFNK 
Subjt:  IEANRPAVKKKLWNVVRAIVFMLRKGLSKSKIVLDLHLMIKQSKIAGKAIANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNSPA-PAYPFRYFNKL

Query:  RKHQNHHYFSKSYRYDDFSTVTAVHRVLDILHSDQKSEASPLVPLPGFGKSPRSVRQLRVTDSPFSLKDDGDSQFVDKAAEEFIKKFYTDLRLERSLATF
         K +  HYF KSYRYDDFSTVTAV RVLDILH+DQKSEASPLVPLPGFGKSP  VRQLRVTDSPFSLKDDGDSQ VDKAAEEFIKKFYTDLRLERSLA F
Subjt:  RKHQNHHYFSKSYRYDDFSTVTAVHRVLDILHSDQKSEASPLVPLPGFGKSPRSVRQLRVTDSPFSLKDDGDSQFVDKAAEEFIKKFYTDLRLERSLATF

A0A6J1DPK7 uncharacterized protein LOC1110219116.1e-10699.5Show/hide
Query:  IEANRPAVKKKLWNVVRAIVFMLRKGLSKSKIVLDLHLMIKQSKIAGKAIANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNSPAPAYPFRYFNKLR
        IEANRPAVKKKLWNVVRAIVFMLRKGLSKSKIVLDLHLMIKQSKI GKAIANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNSPAPAYPFRYFNKLR
Subjt:  IEANRPAVKKKLWNVVRAIVFMLRKGLSKSKIVLDLHLMIKQSKIAGKAIANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNSPAPAYPFRYFNKLR

Query:  KHQNHHYFSKSYRYDDFSTVTAVHRVLDILHSDQKSEASPLVPLPGFGKSPRSVRQLRVTDSPFSLKDDGDSQFVDKAAEEFIKKFYTDLRLERSLATF
        KHQNHHYFSKSYRYDDFSTVTAVHRVLDILHSDQKSEASPLVPLPGFGKSPRSVRQLRVTDSPFSLKDDGDSQFVDKAAEEFIKKFYTDLRLERSLATF
Subjt:  KHQNHHYFSKSYRYDDFSTVTAVHRVLDILHSDQKSEASPLVPLPGFGKSPRSVRQLRVTDSPFSLKDDGDSQFVDKAAEEFIKKFYTDLRLERSLATF

A0A6J1KI86 uncharacterized protein LOC1114934796.6e-9288.44Show/hide
Query:  IEANRPAVKKKLWNVVRAIVFMLRKGLSKSKIVLDLHLMIKQSKIAGKAIANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNSPAPAYPFRYFNKLR
        IE  RPAVKKKLWNV+RA+VFMLRKGL+KSKIV DLHLM+K+SK+AGKA+ANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNS  PAYPFRYFNK R
Subjt:  IEANRPAVKKKLWNVVRAIVFMLRKGLSKSKIVLDLHLMIKQSKIAGKAIANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNSPAPAYPFRYFNKLR

Query:  KHQNHHYFSKSYRYDDFSTVTAVHRVLDILHSDQKSEASPLVPLPGFGKSPRSVRQLRVTDSPFSLKDDGDSQFVDKAAEEFIKKFYTDLRLERSLATF
        KHQN HYF KSYRYDDFSTVTAV RVLDILHSDQKSEASPLVPLPGFGKSP  VRQLRVTDSPFSLKDD DSQ VDKAAEEFIKKFYTDLRLE+SLA +
Subjt:  KHQNHHYFSKSYRYDDFSTVTAVHRVLDILHSDQKSEASPLVPLPGFGKSPRSVRQLRVTDSPFSLKDDGDSQFVDKAAEEFIKKFYTDLRLERSLATF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52140.1 unknown protein4.4e-2437.38Show/hide
Query:  VKKKLWNVVRAIVFMLRKGLSKSKIVLDLHLMIKQSKIAGKAIANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNSPAPAYPFRYFNKLRKHQNHHY
        + KKLWN+VR +++M+RKG+SK+K++ D +  +K+ K            H GS  S      + +  S ++YEFSCSN+P  ++PF     +RK  +++ 
Subjt:  VKKKLWNVVRAIVFMLRKGLSKSKIVLDLHLMIKQSKIAGKAIANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNSPAPAYPFRYFNKLRKHQNHHY

Query:  FS---KSYRYDDFSTVTAVHRVLDILHS-DQKSEASP------LVP-LPGFGKSPRSVRQLRVTDSPFSL-KDDGD--SQFVDKAAEEFIKKFYTDLRLE
        F+        DD   V A   VL++L+   +K   +P      L P  PGFG++P  VR LRVTDSPF L  ++GD  +  VDKAA++FIKKFY +L  +
Subjt:  FS---KSYRYDDFSTVTAVHRVLDILHS-DQKSEASP------LVP-LPGFGKSPRSVRQLRVTDSPFSL-KDDGD--SQFVDKAAEEFIKKFYTDLRLE

Query:  RSLATF
        + +  F
Subjt:  RSLATF

AT3G16330.1 unknown protein1.4e-2237.91Show/hide
Query:  AVKKKLWNVVRAIVFMLRKGLSKSKIVLDLHLMIKQSKIAGKAIANLVEFHH-----GSAFSCQTIDIANSYISTRDYEFSCSNSPAPAYPFRYFNKLRK
        ++ KKL N+VR +++ML KG+SK K++ D +  +K+ K       NL+ FH+     GSA +              +YEFSCS++P   +PF      +K
Subjt:  AVKKKLWNVVRAIVFMLRKGLSKSKIVLDLHLMIKQSKIAGKAIANLVEFHH-----GSAFSCQTIDIANSYISTRDYEFSCSNSPAPAYPFRYFNKLRK

Query:  HQNHHYFSKSYRYDDFSTVTAVHR-VLDILHS----DQKSEA--------SPLVP-LPGFGKSPRSVRQLRVTDSPFSLKDDGD--SQFVDKAAEEFIKK
          ++  FS           T+V R VL++L+S    DQ S          + L P LPGFG+S  SVR LRVTDSPF L+++GD  +  VDKAA+EFIKK
Subjt:  HQNHHYFSKSYRYDDFSTVTAVHR-VLDILHS----DQKSEA--------SPLVP-LPGFGKSPRSVRQLRVTDSPFSLKDDGD--SQFVDKAAEEFIKK

Query:  FYTDLRLERSL
        FY +L  ++ +
Subjt:  FYTDLRLERSL

AT4G29110.1 unknown protein3.0e-2037.75Show/hide
Query:  IEANRPAVKKKLWNVVRAIVFMLRKGLSKSKIVLDLHLMIKQSKIAGKAIANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNSPAPAYPFRYFNKLR
        +E N     K+LW VVR +  +L+ G  K+K++LDL+LM+K+     KAI NL       + S  + D+++S    RDY+           PF + +K R
Subjt:  IEANRPAVKKKLWNVVRAIVFMLRKGLSKSKIVLDLHLMIKQSKIAGKAIANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNSPAPAYPFRYFNKLR

Query:  KHQNHHYFSKSYRYDDFSTVTAVHRVLDIL-HSDQKSEA------SPLVPLPGFGKSPRSVRQLRVTDSPFSLKDDGD-SQFVDKAAEEFIKKFYTDLRL
        K + H      Y  ++ +   AV +V ++L  +D+K+ A      SPL+  P       +VRQLRVTDSPF L D GD    VDKAAEEFIKKFY +L+L
Subjt:  KHQNHHYFSKSYRYDDFSTVTAVHRVLDIL-HSDQKSEA------SPLVPLPGFGKSPRSVRQLRVTDSPFSLKDDGD-SQFVDKAAEEFIKKFYTDLRL

Query:  ERSL
        ++ +
Subjt:  ERSL

AT4G32860.1 unknown protein7.9e-0528.86Show/hide
Query:  KKLWNVVRAIVFMLRK--GLSKSKIV--LDLHLMIKQSKIAGKAIANLVEFHHGSAFSCQTI---DIANSYIS----TRDYEFSCSNSPAPAYPFRYFNK
        KKL ++ + I+F ++K    S+ K++  LD HL+ K+ KI  K++   V   H S  +C+     D+ +S+IS      +YEFSCS++P         +K
Subjt:  KKLWNVVRAIVFMLRK--GLSKSKIV--LDLHLMIKQSKIAGKAIANLVEFHHGSAFSCQTI---DIANSYIS----TRDYEFSCSNSPAPAYPFRYFNK

Query:  LRKHQNHH---YFSKSYR--YDDFSTVTAVHRVLDILHSDQKSEASPLVPLPGFGKSPRSVRQLRVTDSPFSLKDDGDSQFVDKAAEEFIKKFYTDLRLE
         R+    H     +K  R  Y  ++T+  V   +   H      A+ + P               V  S  ++    +S  VD+AAEEFI+ FY  LRL+
Subjt:  LRKHQNHH---YFSKSYR--YDDFSTVTAVHRVLDILHSDQKSEASPLVPLPGFGKSPRSVRQLRVTDSPFSLKDDGDSQFVDKAAEEFIKKFYTDLRLE

Query:  R
        +
Subjt:  R


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATTGAGGCGAATCGGCCGGCGGTGAAGAAGAAGCTGTGGAACGTGGTGCGAGCGATTGTATTTATGTTGAGGAAAGGGCTGAGTAAAAGCAAGATAGTGTTGGATCTTCA
TTTGATGATCAAACAGAGCAAAATCGCCGGAAAAGCGATAGCGAATCTGGTGGAGTTCCACCACGGCTCCGCCTTCAGTTGCCAGACGATCGACATCGCCAATTCCTACA
TCTCCACTCGCGATTACGAGTTCAGTTGCAGCAACAGTCCGGCGCCGGCGTATCCGTTCCGTTACTTCAATAAGCTTCGAAAACACCAAAACCACCACTATTTCTCCAAA
TCCTACCGCTACGACGATTTCTCCACCGTTACGGCCGTCCACAGAGTTCTGGATATTCTTCACAGCGATCAGAAGTCGGAGGCGTCGCCGTTAGTGCCGTTACCGGGATT
CGGAAAGAGTCCGCGGTCAGTACGGCAGTTGCGCGTTACGGACTCGCCGTTTTCTCTGAAAGACGACGGCGATAGCCAGTTCGTCGACAAGGCGGCGGAGGAATTTATCA
AGAAGTTCTACACGGATCTACGGCTGGAGAGAAGTTTAGCAACTTTC
mRNA sequenceShow/hide mRNA sequence
ATTGAGGCGAATCGGCCGGCGGTGAAGAAGAAGCTGTGGAACGTGGTGCGAGCGATTGTATTTATGTTGAGGAAAGGGCTGAGTAAAAGCAAGATAGTGTTGGATCTTCA
TTTGATGATCAAACAGAGCAAAATCGCCGGAAAAGCGATAGCGAATCTGGTGGAGTTCCACCACGGCTCCGCCTTCAGTTGCCAGACGATCGACATCGCCAATTCCTACA
TCTCCACTCGCGATTACGAGTTCAGTTGCAGCAACAGTCCGGCGCCGGCGTATCCGTTCCGTTACTTCAATAAGCTTCGAAAACACCAAAACCACCACTATTTCTCCAAA
TCCTACCGCTACGACGATTTCTCCACCGTTACGGCCGTCCACAGAGTTCTGGATATTCTTCACAGCGATCAGAAGTCGGAGGCGTCGCCGTTAGTGCCGTTACCGGGATT
CGGAAAGAGTCCGCGGTCAGTACGGCAGTTGCGCGTTACGGACTCGCCGTTTTCTCTGAAAGACGACGGCGATAGCCAGTTCGTCGACAAGGCGGCGGAGGAATTTATCA
AGAAGTTCTACACGGATCTACGGCTGGAGAGAAGTTTAGCAACTTTC
Protein sequenceShow/hide protein sequence
IEANRPAVKKKLWNVVRAIVFMLRKGLSKSKIVLDLHLMIKQSKIAGKAIANLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNSPAPAYPFRYFNKLRKHQNHHYFSK
SYRYDDFSTVTAVHRVLDILHSDQKSEASPLVPLPGFGKSPRSVRQLRVTDSPFSLKDDGDSQFVDKAAEEFIKKFYTDLRLERSLATF