; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS025055 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS025055
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionSASA domain-containing protein
Genome locationscaffold123_1:235424..236214
RNA-Seq ExpressionMS025055
SyntenyMS025055
Gene Ontology termsNA
InterPro domainsIPR005181 - Sialate O-acetylesterase domain
IPR036514 - SGNH hydrolase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578894.1 putative carbohydrate esterase, partial [Cucurbita argyrosperma subsp. sororia]8.8e-6757.79Show/hide
Query:  MAGRGGVSKDPIIGKNIWDGYIPPASRSHESILQLTAHLKWEQAREPLHWDIDYNKT----------NKLLAKAGKSIGVIGLVPCAIGGTHLREWVKG-
        MAGRGGVSKDP   KN+WDGYIPP S+ ++SI + TA + WEQAREPLHWDID  KT          N+LLAKAG SIG IGLVPCAIGG+HLREWVKG 
Subjt:  MAGRGGVSKDPIIGKNIWDGYIPPASRSHESILQLTAHLKWEQAREPLHWDIDYNKT----------NKLLAKAGKSIGVIGLVPCAIGGTHLREWVKG-

Query:  ---------------------------GESDASVEEESKYYEGNLTKFFTDLRKDSYHPTLPIIL----------------------AEEAVTHKLPNVR
                                   GESDA+VEEE+K YE  L+KFFTDLR D  HP LPIIL                      A+EAVT KLPNVR
Subjt:  ---------------------------GESDASVEEESKYYEGNLTKFFTDLRKDSYHPTLPIIL----------------------AEEAVTHKLPNVR

Query:  MVDGMKAVGNIDKGLNEDRGHLNVKSEVKLGKMLAHAFYSNFGH
        MVDG  AVGN D+GLNEDRGHLNV+SEV LGKM AH++YSNF H
Subjt:  MVDGMKAVGNIDKGLNEDRGHLNVKSEVKLGKMLAHAFYSNFGH

KAG7016422.1 putative carbohydrate esterase, partial [Cucurbita argyrosperma subsp. argyrosperma]3.9e-6758.2Show/hide
Query:  MAGRGGVSKDPIIGKNIWDGYIPPASRSHESILQLTAHLKWEQAREPLHWDIDYNKT----------NKLLAKAGKSIGVIGLVPCAIGGTHLREWVKG-
        MAGRGGVSKDP   KN+WDGYIPP S+ ++SI + TA + WEQAREPLHWDID  KT          N+LLAKAG SIG IGLVPCAIGG+HLREWVKG 
Subjt:  MAGRGGVSKDPIIGKNIWDGYIPPASRSHESILQLTAHLKWEQAREPLHWDIDYNKT----------NKLLAKAGKSIGVIGLVPCAIGGTHLREWVKG-

Query:  ---------------------------GESDASVEEESKYYEGNLTKFFTDLRKDSYHPTLPIIL----------------------AEEAVTHKLPNVR
                                   GESDA+VEEE+K YE  L+KFFTDLR D  HP LPIIL                      A+EAVT KLPNVR
Subjt:  ---------------------------GESDASVEEESKYYEGNLTKFFTDLRKDSYHPTLPIIL----------------------AEEAVTHKLPNVR

Query:  MVDGMKAVGNIDKGLNEDRGHLNVKSEVKLGKMLAHAFYSNFGH
        MVDG  AVGN D+GLNEDRGHLNVKSEV LGKM AH++YSNF H
Subjt:  MVDGMKAVGNIDKGLNEDRGHLNVKSEVKLGKMLAHAFYSNFGH

XP_022141681.1 probable carbohydrate esterase At4g34215 [Momordica charantia]4.5e-7161.13Show/hide
Query:  MAGRGGVSKDPIIGKNIWDGYIPPASRSHESILQLTAHLKWEQAREPLHWDIDYNKT----------NKLLAKAGKSIGVIGLVPCAIGGTHLREWVKG-
        MAGRGGVSKDPI  KNIWDGYIPP S+S+ESIL+LTA L+WEQA EPLHWDIDY+KT          N+L  + GKSIGVIGLVPCAIGGTHLREW+KG 
Subjt:  MAGRGGVSKDPIIGKNIWDGYIPPASRSHESILQLTAHLKWEQAREPLHWDIDYNKT----------NKLLAKAGKSIGVIGLVPCAIGGTHLREWVKG-

Query:  ---------------------------GESDASVEEESKYYEGNLTKFFTDLRKDSYHPTLPIIL----------------------AEEAVTHKLPNVR
                                   GESDASVEEESK YE  LTKFFTDLR DS +  LPIIL                      A+E VT KL NVR
Subjt:  ---------------------------GESDASVEEESKYYEGNLTKFFTDLRKDSYHPTLPIIL----------------------AEEAVTHKLPNVR

Query:  MVDGMKAVGNIDKGLNEDRGHLNVKSEVKLGKMLAHAFYSNFGHNLT
        MVDG +AVGN ++GLNED+GHLNVKSEVKLGKMLAHAFYSNF H LT
Subjt:  MVDGMKAVGNIDKGLNEDRGHLNVKSEVKLGKMLAHAFYSNFGHNLT

XP_022939276.1 probable carbohydrate esterase At4g34215 [Cucurbita moschata]5.2e-6757.79Show/hide
Query:  MAGRGGVSKDPIIGKNIWDGYIPPASRSHESILQLTAHLKWEQAREPLHWDIDYNKT----------NKLLAKAGKSIGVIGLVPCAIGGTHLREWVKG-
        MAGRGGVSKDP   KN+WDGYIPP S+ ++SI + TA + WEQAREPLHWDID  KT          N+LLAKAG SIG IGLVPCAIGG+HLREWVKG 
Subjt:  MAGRGGVSKDPIIGKNIWDGYIPPASRSHESILQLTAHLKWEQAREPLHWDIDYNKT----------NKLLAKAGKSIGVIGLVPCAIGGTHLREWVKG-

Query:  ---------------------------GESDASVEEESKYYEGNLTKFFTDLRKDSYHPTLPIIL----------------------AEEAVTHKLPNVR
                                   GESDA+VEEE+K YE  L+KFFTDLR D  HP LPIIL                      A+EAVT KLPN+R
Subjt:  ---------------------------GESDASVEEESKYYEGNLTKFFTDLRKDSYHPTLPIIL----------------------AEEAVTHKLPNVR

Query:  MVDGMKAVGNIDKGLNEDRGHLNVKSEVKLGKMLAHAFYSNFGH
        MVDG  AVGN D+GLNEDRGHLNVKSEV LGKM AH++YSNF H
Subjt:  MVDGMKAVGNIDKGLNEDRGHLNVKSEVKLGKMLAHAFYSNFGH

XP_022993914.1 probable carbohydrate esterase At4g34215 [Cucurbita maxima]5.2e-6758.2Show/hide
Query:  MAGRGGVSKDPIIGKNIWDGYIPPASRSHESILQLTAHLKWEQAREPLHWDIDYNKT----------NKLLAKAGKSIGVIGLVPCAIGGTHLREWVKG-
        MAGRGGVSKDP   KN+WDGYIPP S+ ++SI + TA + WEQAREPLHWDID  KT          N+LLAKAG SIG IGLVPCAIGG+HLREWVKG 
Subjt:  MAGRGGVSKDPIIGKNIWDGYIPPASRSHESILQLTAHLKWEQAREPLHWDIDYNKT----------NKLLAKAGKSIGVIGLVPCAIGGTHLREWVKG-

Query:  ---------------------------GESDASVEEESKYYEGNLTKFFTDLRKDSYHPTLPIIL----------------------AEEAVTHKLPNVR
                                   GESDA+VEEE+K YE  L+KFFTDLR D  HP LPIIL                      A+EAVT KLPNVR
Subjt:  ---------------------------GESDASVEEESKYYEGNLTKFFTDLRKDSYHPTLPIIL----------------------AEEAVTHKLPNVR

Query:  MVDGMKAVGNIDKGLNEDRGHLNVKSEVKLGKMLAHAFYSNFGH
        MVDG  AVGN D+GLNEDRGHLNVKSEV LGKM AH++YSNF H
Subjt:  MVDGMKAVGNIDKGLNEDRGHLNVKSEVKLGKMLAHAFYSNFGH

TrEMBL top hitse value%identityAlignment
A0A0A0K9J9 SASA domain-containing protein4.1e-6254.84Show/hide
Query:  MAGRGGVSKDPIIGKNIWDGYIPPASRSHESILQLTAHLKWEQAREPLHWDIDYNKT----------NKLLAKAGKSIGVIGLVPCAIGGTHLREWVKG-
        MAGRGGVS DP   K +WDGYIP    S++SI +L A + WEQA EPLHWDID  KT          N+LLA  GK IG IGLVPCAIGG+HL+EWVKG 
Subjt:  MAGRGGVSKDPIIGKNIWDGYIPPASRSHESILQLTAHLKWEQAREPLHWDIDYNKT----------NKLLAKAGKSIGVIGLVPCAIGGTHLREWVKG-

Query:  ---------------------------GESDASVEEESKYYEGNLTKFFTDLRKDSYHPTLPIIL----------------------AEEAVTHKLPNVR
                                   GESDA+VEEE+  YE  LTKFF DLR D+ HP LPIIL                      A EAVTH+LPNV 
Subjt:  ---------------------------GESDASVEEESKYYEGNLTKFFTDLRKDSYHPTLPIIL----------------------AEEAVTHKLPNVR

Query:  MVDGMKAVGNIDKGLNEDRGHLNVKSEVKLGKMLAHAFYSNFGHNLTS
        MVDG  AVGN D GLNED+GHLNVKSEVKLGKM AH+FYSNF HN  S
Subjt:  MVDGMKAVGNIDKGLNEDRGHLNVKSEVKLGKMLAHAFYSNFGHNLTS

A0A6J1CJZ1 probable carbohydrate esterase At4g342152.2e-7161.13Show/hide
Query:  MAGRGGVSKDPIIGKNIWDGYIPPASRSHESILQLTAHLKWEQAREPLHWDIDYNKT----------NKLLAKAGKSIGVIGLVPCAIGGTHLREWVKG-
        MAGRGGVSKDPI  KNIWDGYIPP S+S+ESIL+LTA L+WEQA EPLHWDIDY+KT          N+L  + GKSIGVIGLVPCAIGGTHLREW+KG 
Subjt:  MAGRGGVSKDPIIGKNIWDGYIPPASRSHESILQLTAHLKWEQAREPLHWDIDYNKT----------NKLLAKAGKSIGVIGLVPCAIGGTHLREWVKG-

Query:  ---------------------------GESDASVEEESKYYEGNLTKFFTDLRKDSYHPTLPIIL----------------------AEEAVTHKLPNVR
                                   GESDASVEEESK YE  LTKFFTDLR DS +  LPIIL                      A+E VT KL NVR
Subjt:  ---------------------------GESDASVEEESKYYEGNLTKFFTDLRKDSYHPTLPIIL----------------------AEEAVTHKLPNVR

Query:  MVDGMKAVGNIDKGLNEDRGHLNVKSEVKLGKMLAHAFYSNFGHNLT
        MVDG +AVGN ++GLNED+GHLNVKSEVKLGKMLAHAFYSNF H LT
Subjt:  MVDGMKAVGNIDKGLNEDRGHLNVKSEVKLGKMLAHAFYSNFGHNLT

A0A6J1CKF9 probable carbohydrate esterase At4g342151.6e-6152.82Show/hide
Query:  MAGRGGVSKDPIIGKNIWDGYIPPASRSHESILQLTAHLKWEQAREPLHWDIDYNKTN----------KLLAKAGKSIGVIGLVPCAIGGTHLREWVKG-
        MAGRGGV KDP   K +WDG +PP  + ++SIL+ +A+  WE+A EPLHWDID NKTN          ++LAKAG   GVIGLVPCAIGGTHLREWVKG 
Subjt:  MAGRGGVSKDPIIGKNIWDGYIPPASRSHESILQLTAHLKWEQAREPLHWDIDYNKTN----------KLLAKAGKSIGVIGLVPCAIGGTHLREWVKG-

Query:  ---------------------------GESDASVEEESKYYEGNLTKFFTDLRKDSYHPTLPIIL----------------------AEEAVTHKLPNVR
                                   GESDA+VEEESK+YE NLTKF+TDLR D+ HP LPIIL                      A+E +T  L NVR
Subjt:  ---------------------------GESDASVEEESKYYEGNLTKFFTDLRKDSYHPTLPIIL----------------------AEEAVTHKLPNVR

Query:  MVDGMKAVGNIDKGLNEDRGHLNVKSEVKLGKMLAHAFYSNFGHNLTS
        +VDG +AVGN D G+N+D GHL+ KSEVKLGKMLA +FYSNFG+ LT+
Subjt:  MVDGMKAVGNIDKGLNEDRGHLNVKSEVKLGKMLAHAFYSNFGHNLTS

A0A6J1FFF9 probable carbohydrate esterase At4g342152.5e-6757.79Show/hide
Query:  MAGRGGVSKDPIIGKNIWDGYIPPASRSHESILQLTAHLKWEQAREPLHWDIDYNKT----------NKLLAKAGKSIGVIGLVPCAIGGTHLREWVKG-
        MAGRGGVSKDP   KN+WDGYIPP S+ ++SI + TA + WEQAREPLHWDID  KT          N+LLAKAG SIG IGLVPCAIGG+HLREWVKG 
Subjt:  MAGRGGVSKDPIIGKNIWDGYIPPASRSHESILQLTAHLKWEQAREPLHWDIDYNKT----------NKLLAKAGKSIGVIGLVPCAIGGTHLREWVKG-

Query:  ---------------------------GESDASVEEESKYYEGNLTKFFTDLRKDSYHPTLPIIL----------------------AEEAVTHKLPNVR
                                   GESDA+VEEE+K YE  L+KFFTDLR D  HP LPIIL                      A+EAVT KLPN+R
Subjt:  ---------------------------GESDASVEEESKYYEGNLTKFFTDLRKDSYHPTLPIIL----------------------AEEAVTHKLPNVR

Query:  MVDGMKAVGNIDKGLNEDRGHLNVKSEVKLGKMLAHAFYSNFGH
        MVDG  AVGN D+GLNEDRGHLNVKSEV LGKM AH++YSNF H
Subjt:  MVDGMKAVGNIDKGLNEDRGHLNVKSEVKLGKMLAHAFYSNFGH

A0A6J1K1G7 probable carbohydrate esterase At4g342152.5e-6758.2Show/hide
Query:  MAGRGGVSKDPIIGKNIWDGYIPPASRSHESILQLTAHLKWEQAREPLHWDIDYNKT----------NKLLAKAGKSIGVIGLVPCAIGGTHLREWVKG-
        MAGRGGVSKDP   KN+WDGYIPP S+ ++SI + TA + WEQAREPLHWDID  KT          N+LLAKAG SIG IGLVPCAIGG+HLREWVKG 
Subjt:  MAGRGGVSKDPIIGKNIWDGYIPPASRSHESILQLTAHLKWEQAREPLHWDIDYNKT----------NKLLAKAGKSIGVIGLVPCAIGGTHLREWVKG-

Query:  ---------------------------GESDASVEEESKYYEGNLTKFFTDLRKDSYHPTLPIIL----------------------AEEAVTHKLPNVR
                                   GESDA+VEEE+K YE  L+KFFTDLR D  HP LPIIL                      A+EAVT KLPNVR
Subjt:  ---------------------------GESDASVEEESKYYEGNLTKFFTDLRKDSYHPTLPIIL----------------------AEEAVTHKLPNVR

Query:  MVDGMKAVGNIDKGLNEDRGHLNVKSEVKLGKMLAHAFYSNFGH
        MVDG  AVGN D+GLNEDRGHLNVKSEV LGKM AH++YSNF H
Subjt:  MVDGMKAVGNIDKGLNEDRGHLNVKSEVKLGKMLAHAFYSNFGH

SwissProt top hitse value%identityAlignment
Q8L9J9 Probable carbohydrate esterase At4g342151.1e-2434.47Show/hide
Query:  MAGRGGVSKDPIIGKNIWDGYIPPASRSHESILQLTAHLKWEQAREPLHWDIDYNK----------TNKLLAKAGKSIGVIGLVPCAIGGTHLREWVKG-
        MAGRGGV KD    + +WD  +PP    + SIL+L+A L+WE+A EPLH DID  K           N +  +      VIGLVPCA GGT ++EW +G 
Subjt:  MAGRGGVSKDPIIGKNIWDGYIPPASRSHESILQLTAHLKWEQAREPLHWDIDYNK----------TNKLLAKAGKSIGVIGLVPCAIGGTHLREWVKG-

Query:  ---------------------------GESDASVEEESKYYEGNLTKFFTDLRKDSYHPTLPII-------------LAEEAVTHKLPNVRMVDGMKAVG
                                   GESD     +++ Y  N+ +   +LR D   P+LPII             + E  +  KL NV  VD      
Subjt:  ---------------------------GESDASVEEESKYYEGNLTKFFTDLRKDSYHPTLPII-------------LAEEAVTHKLPNVRMVDGMKAVG

Query:  NIDKG--LNEDRGHLNVKSEVKLGKMLAHAFYSNF
           KG  L  D  HL  +++V+LG  LA A+ SNF
Subjt:  NIDKG--LNEDRGHLNVKSEVKLGKMLAHAFYSNF

Arabidopsis top hitse value%identityAlignment
AT3G53010.1 Domain of unknown function (DUF303)1.7e-3136.86Show/hide
Query:  MAGRGGVSKDPIIGKNIWDGYIPPASRSHESILQLTAHLKWEQAREPLHWDIDYNKTN------KLLAKAGKSIGVIGLVPCAIGGTHLREWVKG-----
        MAGRGGV  D      +WDG IPP  RS+ SIL+LT+ L+W++A+EPLH DID NKTN          +     G +GLVPC+IGGT L +W KG     
Subjt:  MAGRGGVSKDPIIGKNIWDGYIPPASRSHESILQLTAHLKWEQAREPLHWDIDYNKTN------KLLAKAGKSIGVIGLVPCAIGGTHLREWVKG-----

Query:  -------------------------GESDASVEEESKYYEGNLTKFFTDLRKDSYHPTLPII---LAEEA-----VTHKLPNVRMVDGMKAVGNIDKGLN
                                 GESD     ++  Y+  L KFF+DLR D  HP LPII   LA  A        K      ++ +  V      L 
Subjt:  -------------------------GESDASVEEESKYYEGNLTKFFTDLRKDSYHPTLPII---LAEEA-----VTHKLPNVRMVDGMKAVGNIDKGLN

Query:  EDRGHLNVKSEVKLGKMLAHAFYSNFGHNLTSLNLH
         D  HL   S+V+LG M+A +F +       SL +H
Subjt:  EDRGHLNVKSEVKLGKMLAHAFYSNFGHNLTSLNLH

AT4G34215.1 Domain of unknown function (DUF303)8.0e-2634.47Show/hide
Query:  MAGRGGVSKDPIIGKNIWDGYIPPASRSHESILQLTAHLKWEQAREPLHWDIDYNK----------TNKLLAKAGKSIGVIGLVPCAIGGTHLREWVKG-
        MAGRGGV KD    + +WD  +PP    + SIL+L+A L+WE+A EPLH DID  K           N +  +      VIGLVPCA GGT ++EW +G 
Subjt:  MAGRGGVSKDPIIGKNIWDGYIPPASRSHESILQLTAHLKWEQAREPLHWDIDYNK----------TNKLLAKAGKSIGVIGLVPCAIGGTHLREWVKG-

Query:  ---------------------------GESDASVEEESKYYEGNLTKFFTDLRKDSYHPTLPII-------------LAEEAVTHKLPNVRMVDGMKAVG
                                   GESD     +++ Y  N+ +   +LR D   P+LPII             + E  +  KL NV  VD      
Subjt:  ---------------------------GESDASVEEESKYYEGNLTKFFTDLRKDSYHPTLPII-------------LAEEAVTHKLPNVRMVDGMKAVG

Query:  NIDKG--LNEDRGHLNVKSEVKLGKMLAHAFYSNF
           KG  L  D  HL  +++V+LG  LA A+ SNF
Subjt:  NIDKG--LNEDRGHLNVKSEVKLGKMLAHAFYSNF

AT4G34215.2 Domain of unknown function (DUF303)8.0e-2634.47Show/hide
Query:  MAGRGGVSKDPIIGKNIWDGYIPPASRSHESILQLTAHLKWEQAREPLHWDIDYNK----------TNKLLAKAGKSIGVIGLVPCAIGGTHLREWVKG-
        MAGRGGV KD    + +WD  +PP    + SIL+L+A L+WE+A EPLH DID  K           N +  +      VIGLVPCA GGT ++EW +G 
Subjt:  MAGRGGVSKDPIIGKNIWDGYIPPASRSHESILQLTAHLKWEQAREPLHWDIDYNK----------TNKLLAKAGKSIGVIGLVPCAIGGTHLREWVKG-

Query:  ---------------------------GESDASVEEESKYYEGNLTKFFTDLRKDSYHPTLPII-------------LAEEAVTHKLPNVRMVDGMKAVG
                                   GESD     +++ Y  N+ +   +LR D   P+LPII             + E  +  KL NV  VD      
Subjt:  ---------------------------GESDASVEEESKYYEGNLTKFFTDLRKDSYHPTLPII-------------LAEEAVTHKLPNVRMVDGMKAVG

Query:  NIDKG--LNEDRGHLNVKSEVKLGKMLAHAFYSNF
           KG  L  D  HL  +++V+LG  LA A+ SNF
Subjt:  NIDKG--LNEDRGHLNVKSEVKLGKMLAHAFYSNF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGGCCGAGGTGGTGTCTCGAAAGATCCAATCATTGGCAAAAATATATGGGATGGATATATCCCGCCAGCATCTCGATCCCACGAATCAATCCTTCAATTGACTGC
TCATTTGAAATGGGAACAAGCTCGTGAGCCACTTCATTGGGACATTGATTATAACAAGACCAATAAGCTTTTGGCCAAAGCTGGCAAGAGCATTGGTGTCATCGGGCTCG
TTCCATGCGCCATTGGAGGAACTCACCTGAGAGAATGGGTTAAAGGGGGAGAGTCAGATGCTTCGGTGGAAGAAGAATCTAAGTACTACGAAGGAAACCTCACCAAGTTC
TTCACGGACTTGCGCAAAGACTCGTACCACCCAACACTACCCATTATCCTGGCTGAAGAGGCAGTCACACACAAGCTACCAAATGTAAGAATGGTGGATGGAATGAAAGC
AGTTGGCAACATTGATAAAGGCCTTAATGAAGATAGAGGCCATCTTAATGTCAAATCTGAAGTGAAATTGGGCAAAATGTTGGCTCATGCCTTCTACTCTAACTTTGGCC
ACAATCTCACCAGCTTGAATTTGCATGTGAAAATTACTCCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGGCCGAGGTGGTGTCTCGAAAGATCCAATCATTGGCAAAAATATATGGGATGGATATATCCCGCCAGCATCTCGATCCCACGAATCAATCCTTCAATTGACTGC
TCATTTGAAATGGGAACAAGCTCGTGAGCCACTTCATTGGGACATTGATTATAACAAGACCAATAAGCTTTTGGCCAAAGCTGGCAAGAGCATTGGTGTCATCGGGCTCG
TTCCATGCGCCATTGGAGGAACTCACCTGAGAGAATGGGTTAAAGGGGGAGAGTCAGATGCTTCGGTGGAAGAAGAATCTAAGTACTACGAAGGAAACCTCACCAAGTTC
TTCACGGACTTGCGCAAAGACTCGTACCACCCAACACTACCCATTATCCTGGCTGAAGAGGCAGTCACACACAAGCTACCAAATGTAAGAATGGTGGATGGAATGAAAGC
AGTTGGCAACATTGATAAAGGCCTTAATGAAGATAGAGGCCATCTTAATGTCAAATCTGAAGTGAAATTGGGCAAAATGTTGGCTCATGCCTTCTACTCTAACTTTGGCC
ACAATCTCACCAGCTTGAATTTGCATGTGAAAATTACTCCCTAA
Protein sequenceShow/hide protein sequence
MAGRGGVSKDPIIGKNIWDGYIPPASRSHESILQLTAHLKWEQAREPLHWDIDYNKTNKLLAKAGKSIGVIGLVPCAIGGTHLREWVKGGESDASVEEESKYYEGNLTKF
FTDLRKDSYHPTLPIILAEEAVTHKLPNVRMVDGMKAVGNIDKGLNEDRGHLNVKSEVKLGKMLAHAFYSNFGHNLTSLNLHVKITP