; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001931 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001931
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionNatterin-3 like
Genome locationchr4:37213347..37215026
RNA-Seq ExpressionLag0001931
SyntenyLag0001931
Gene Ontology termsGO:0005576 - extracellular region (cellular component)
InterPro domainsIPR005830 - Aerolysin
IPR008998 - Agglutinin domain
IPR036242 - Agglutinin domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039924.1 uncharacterized protein E6C27_scaffold122G002040 [Cucumis melo var. makuwa]3.3e-22064.72Show/hide
Query:  MAGLIRGFGKV------------STGIVGGLAKGTGKVVESIGEATEDLGDALFGNHQNHPKPVKFEDD------------------------------E
        + G ++G GK+              G +G + +GTGK +E++GEATED G+ +F   +  PK  +  DD                              +
Subjt:  MAGLIRGFGKV------------STGIVGGLAKGTGKVVESIGEATEDLGDALFGNHQNHPKPVKFEDD------------------------------E

Query:  DDEIDEAEEKLMKSEEGAEIDQSDHDDNEDGEEGEANVIPRHFSLKSKHNNKYLRYINEDETSAGLLRYAGKNVVGPYSKFAVQASKTDKGFVHIRCCYN
        DD+IDEAE+KLMKS    +ID S++++ E+ EE    VIP++ SLKS  N KYLRYI+E E + GLLRY+GKN+VGPYSKF+V ASKT  GF HIRCCYN
Subjt:  DDEIDEAEEKLMKSEEGAEIDQSDHDDNEDGEEGEANVIPRHFSLKSKHNNKYLRYINEDETSAGLLRYAGKNVVGPYSKFAVQASKTDKGFVHIRCCYN

Query:  NKYWVRSSEHSNYIAAVADENEDDKSKWSCTLFEPIFVPEQNG-FYFRHVQLNTFVCLAQGDPAPYGDCLAARVEDITTIDENLILSIVVDWDSIFILPK
        NK+WVR SE SNYIAA+A+E EDD SKWSCTLFEPIFVPE+ G +Y RHVQLNTF+C+A+GDP+PY DCL ARVEDIT IDENL+LS V DWDSIFILPK
Subjt:  NKYWVRSSEHSNYIAAVADENEDDKSKWSCTLFEPIFVPEQNG-FYFRHVQLNTFVCLAQGDPAPYGDCLAARVEDITTIDENLILSIVVDWDSIFILPK

Query:  YVAFKSNNDRYLEPSSQYLKFSASDIQDPVVVHEIIPMGDGYVRIKHVESGKYWIRDPNWIHCKSIQVDRDDPNTLFWPVKVDDNFVALRNKGNGRFCKR
        YVAFKSNND+YLEPS +YLKFSAS ++DP VV EII M DGYVRIKHV SGKYWIRDP+WI C SI + RD+PNTLFWPVKVD+N VA RNKGN RFCKR
Subjt:  YVAFKSNNDRYLEPSSQYLKFSASDIQDPVVVHEIIPMGDGYVRIKHVESGKYWIRDPNWIHCKSIQVDRDDPNTLFWPVKVDDNFVALRNKGNGRFCKR

Query:  LSTEGKSNCLNAAVSTITDTARFEVTEVVVARSIDNVEYRLNDARIYGKKIVTVSKGTAINESQVAEKVTLKFRYEKKVERTWSSSVSTTLGLTAKFTSK
        LST+GK+NCLNAAV TIT+TAR EVTE+VVARS+++V+YR+NDAR+YGKKI+TVSKG AIN ++V++K++LKFRYEKKVERTWSSSVS+T G+  KF +K
Subjt:  LSTEGKSNCLNAAVSTITDTARFEVTEVVVARSIDNVEYRLNDARIYGKKIVTVSKGTAINESQVAEKVTLKFRYEKKVERTWSSSVSTTLGLTAKFTSK

Query:  VPTVGKLKFEFSFEATRGSTWEETEKEKSFEEITETITIPAKSKVKFSGMITQARCDVPFSYTRRDTLKDGKQVTNHFEDGIFTGVTTFDYKFETEKL
        +PTVG +KFE S E +  +T EETEKEKSF E  ETITIPA SKVKFS M+TQA CDVPFSYTRRDTLKDG+QVT+  EDG+FTGVTT+DYKFETEK+
Subjt:  VPTVGKLKFEFSFEATRGSTWEETEKEKSFEEITETITIPAKSKVKFSGMITQARCDVPFSYTRRDTLKDGKQVTNHFEDGIFTGVTTFDYKFETEKL

KAE8646727.1 hypothetical protein Csa_005365 [Cucumis sativus]3.7e-21966.9Show/hide
Query:  GIVGGLAKGTGKVVESIGEATEDLGDALFGNHQNHPK--PVKFE--------------------------------DDEDDEIDEAEEKLMKSEEGAEID
        G VG + +GTGK +E++GEATED G+ +F   +N P+  P + E                                D++DD+IDEAE+KLMKS    +ID
Subjt:  GIVGGLAKGTGKVVESIGEATEDLGDALFGNHQNHPK--PVKFE--------------------------------DDEDDEIDEAEEKLMKSEEGAEID

Query:  QSDHDDNEDGEEGEANVIPRHFSLKSKHNNKYLRYINEDETSAGLLRYAGKNVVGPYSKFAVQASKTDKGFVHIRCCYNNKYWVRSSEHSNYIAAVADEN
         S++++ E+ EE    VIP++ SLKS  N KYLRYI+E E + GLLR++GKN+VGPYSKF+V ASKT  GF HIRCCYNNK+WVR SE SNYIAAVA+E 
Subjt:  QSDHDDNEDGEEGEANVIPRHFSLKSKHNNKYLRYINEDETSAGLLRYAGKNVVGPYSKFAVQASKTDKGFVHIRCCYNNKYWVRSSEHSNYIAAVADEN

Query:  EDDKSKWSCTLFEPIFVPEQNG-FYFRHVQLNTFVCLAQGDPAPYGDCLAARVEDITTIDENLILSIVVDWDSIFILPKYVAFKSNNDRYLEPSSQYLKF
        EDD SKWSCTLFEPIFVPE+ G +Y RHVQLNTF+C+A+GDP+PY DCL ARVEDITTIDENL+L  V DWDSIFILPKYVAFKSNNDRYLEPS +YLKF
Subjt:  EDDKSKWSCTLFEPIFVPEQNG-FYFRHVQLNTFVCLAQGDPAPYGDCLAARVEDITTIDENLILSIVVDWDSIFILPKYVAFKSNNDRYLEPSSQYLKF

Query:  SASDIQDPVVVHEIIPMGDGYVRIKHVESGKYWIRDPNWIHCKSIQVDRDDPNTLFWPVKVDDNFVALRNKGNGRFCKRLSTEGKSNCLNAAVSTITDTA
        SAS ++DP VV EII M DGYVRIKHV SGKYWIRDP+WI C SI ++RD+PNTLFWPVKVD+N VA RNKGN RFCKRL+T+GK+NCLNAAV TIT+TA
Subjt:  SASDIQDPVVVHEIIPMGDGYVRIKHVESGKYWIRDPNWIHCKSIQVDRDDPNTLFWPVKVDDNFVALRNKGNGRFCKRLSTEGKSNCLNAAVSTITDTA

Query:  RFEVTEVVVARSIDNVEYRLNDARIYGKKIVTVSKGTAINESQVAEKVTLKFRYEKKVERTWSSSVSTTLGLTAKFTSKVPTVGKLKFEFSFEATRGSTW
        R E TE+VVARS+++VEYR+NDAR+YGKKI+TVSKG AIN ++V +K++LKFRYEKKVERTWSSSVS+T G+  KF +K+PTVG LKFE S E +  +T 
Subjt:  RFEVTEVVVARSIDNVEYRLNDARIYGKKIVTVSKGTAINESQVAEKVTLKFRYEKKVERTWSSSVSTTLGLTAKFTSKVPTVGKLKFEFSFEATRGSTW

Query:  EETEKEKSFEEITETITIPAKSKVKFSGMITQARCDVPFSYTRRDTLKDGKQVTNHFEDGIFTGVTTFDYKFETEKL
        EETEKEKSF E  ETITIPA SKVKFS M+TQA CDVPFSYTRRDTLKDG+QVT+  EDG+FTGVTT+DYKFETEK+
Subjt:  EETEKEKSFEEITETITIPAKSKVKFSGMITQARCDVPFSYTRRDTLKDGKQVTNHFEDGIFTGVTTFDYKFETEKL

XP_004140683.2 uncharacterized protein LOC101212952 [Cucumis sativus]1.3e-21964.78Show/hide
Query:  MAGLIRGFGKV------------STGIVGGLAKGTGKVVESIGEATEDLGDALFGNHQNHPK--PVKFE-------------------------------
        + G ++G GK+              G VG + +GTGK +E++GEATED G+ +F   +N P+  P + E                               
Subjt:  MAGLIRGFGKV------------STGIVGGLAKGTGKVVESIGEATEDLGDALFGNHQNHPK--PVKFE-------------------------------

Query:  -DDEDDEIDEAEEKLMKSEEGAEIDQSDHDDNEDGEEGEANVIPRHFSLKSKHNNKYLRYINEDETSAGLLRYAGKNVVGPYSKFAVQASKTDKGFVHIR
         D++DD+IDEAE+KLMKS    +ID S++++ E+ EE    VIP++ SLKS  N KYLRYI+E E + GLLR++GKN+VGPYSKF+V ASKT  GF HIR
Subjt:  -DDEDDEIDEAEEKLMKSEEGAEIDQSDHDDNEDGEEGEANVIPRHFSLKSKHNNKYLRYINEDETSAGLLRYAGKNVVGPYSKFAVQASKTDKGFVHIR

Query:  CCYNNKYWVRSSEHSNYIAAVADENEDDKSKWSCTLFEPIFVPEQNG-FYFRHVQLNTFVCLAQGDPAPYGDCLAARVEDITTIDENLILSIVVDWDSIF
        CCYNNK+WVR SE SNYIAAVA+E EDD SKWSCTLFEPIFVPE+ G +Y RHVQLNTF+C+A+GDP+PY DCL ARVEDITTIDENL+L  V DWDSIF
Subjt:  CCYNNKYWVRSSEHSNYIAAVADENEDDKSKWSCTLFEPIFVPEQNG-FYFRHVQLNTFVCLAQGDPAPYGDCLAARVEDITTIDENLILSIVVDWDSIF

Query:  ILPKYVAFKSNNDRYLEPSSQYLKFSASDIQDPVVVHEIIPMGDGYVRIKHVESGKYWIRDPNWIHCKSIQVDRDDPNTLFWPVKVDDNFVALRNKGNGR
        ILPKYVAFKSNNDRYLEPS +YLKFSAS ++DP VV EII M DGYVRIKHV SGKYWIRDP+WI C SI ++RD+PNTLFWPVKVD+N VA RNKGN R
Subjt:  ILPKYVAFKSNNDRYLEPSSQYLKFSASDIQDPVVVHEIIPMGDGYVRIKHVESGKYWIRDPNWIHCKSIQVDRDDPNTLFWPVKVDDNFVALRNKGNGR

Query:  FCKRLSTEGKSNCLNAAVSTITDTARFEVTEVVVARSIDNVEYRLNDARIYGKKIVTVSKGTAINESQVAEKVTLKFRYEKKVERTWSSSVSTTLGLTAK
        FCKRL+T+GK+NCLNAAV TIT+TAR E TE+VVARS+++VEYR+NDAR+YGKKI+TVSKG AIN ++V +K++LKFRYEKKVERTWSSSVS+T G+  K
Subjt:  FCKRLSTEGKSNCLNAAVSTITDTARFEVTEVVVARSIDNVEYRLNDARIYGKKIVTVSKGTAINESQVAEKVTLKFRYEKKVERTWSSSVSTTLGLTAK

Query:  FTSKVPTVGKLKFEFSFEATRGSTWEETEKEKSFEEITETITIPAKSKVKFSGMITQARCDVPFSYTRRDTLKDGKQVTNHFEDGIFTGVTTFDYKFETE
        F +K+PTVG LKFE S E +  +T EETEKEKSF E  ETITIPA SKVKFS M+TQA CDVPFSYTRRDTLKDG+QVT+  EDG+FTGVTT+DYKFETE
Subjt:  FTSKVPTVGKLKFEFSFEATRGSTWEETEKEKSFEEITETITIPAKSKVKFSGMITQARCDVPFSYTRRDTLKDGKQVTNHFEDGIFTGVTTFDYKFETE

Query:  KL
        K+
Subjt:  KL

XP_008460195.1 PREDICTED: uncharacterized protein LOC103499080 [Cucumis melo]6.7e-22164.88Show/hide
Query:  MAGLIRGFGKV------------STGIVGGLAKGTGKVVESIGEATEDLGDALFGNHQNHPKPVKFEDD------------------------------E
        + G ++G GK+              G +G + +GTGK +E++GEATED G+ +F   +  PK  +  DD                              +
Subjt:  MAGLIRGFGKV------------STGIVGGLAKGTGKVVESIGEATEDLGDALFGNHQNHPKPVKFEDD------------------------------E

Query:  DDEIDEAEEKLMKSEEGAEIDQSDHDDNEDGEEGEANVIPRHFSLKSKHNNKYLRYINEDETSAGLLRYAGKNVVGPYSKFAVQASKTDKGFVHIRCCYN
        DD+IDEAE+KLMKS    +ID S++++ E+ EE    VIP++ SLKS  N KYLRYI+E E + GLLRY+GKN+VGPYSKF+V ASKT  GF HIRCCYN
Subjt:  DDEIDEAEEKLMKSEEGAEIDQSDHDDNEDGEEGEANVIPRHFSLKSKHNNKYLRYINEDETSAGLLRYAGKNVVGPYSKFAVQASKTDKGFVHIRCCYN

Query:  NKYWVRSSEHSNYIAAVADENEDDKSKWSCTLFEPIFVPEQNGFYF-RHVQLNTFVCLAQGDPAPYGDCLAARVEDITTIDENLILSIVVDWDSIFILPK
        NK+WVR SE SNYIAA+A+E EDD SKWSCTLFEPIFVPE+ GFY+ RHVQLNTF+C+A+GDP+PY DCL ARVEDIT IDENL+LS V DWDSIFILPK
Subjt:  NKYWVRSSEHSNYIAAVADENEDDKSKWSCTLFEPIFVPEQNGFYF-RHVQLNTFVCLAQGDPAPYGDCLAARVEDITTIDENLILSIVVDWDSIFILPK

Query:  YVAFKSNNDRYLEPSSQYLKFSASDIQDPVVVHEIIPMGDGYVRIKHVESGKYWIRDPNWIHCKSIQVDRDDPNTLFWPVKVDDNFVALRNKGNGRFCKR
        YVAFKSNND+YLEPS +YLKFSAS ++DP VV EII M DGYVRIKHV SGKYWIRDP+WI C SI + RD+PNTLFWPVKVD+N VA RNKGN RFCKR
Subjt:  YVAFKSNNDRYLEPSSQYLKFSASDIQDPVVVHEIIPMGDGYVRIKHVESGKYWIRDPNWIHCKSIQVDRDDPNTLFWPVKVDDNFVALRNKGNGRFCKR

Query:  LSTEGKSNCLNAAVSTITDTARFEVTEVVVARSIDNVEYRLNDARIYGKKIVTVSKGTAINESQVAEKVTLKFRYEKKVERTWSSSVSTTLGLTAKFTSK
        LST+GK+NCLNAAV TIT+TAR EVTE+VVARS+++V+YR+NDAR+YGKKI+TVSKG AIN ++V++K++LKFRYEKKVERTWSSSVS+T G+  KF +K
Subjt:  LSTEGKSNCLNAAVSTITDTARFEVTEVVVARSIDNVEYRLNDARIYGKKIVTVSKGTAINESQVAEKVTLKFRYEKKVERTWSSSVSTTLGLTAKFTSK

Query:  VPTVGKLKFEFSFEATRGSTWEETEKEKSFEEITETITIPAKSKVKFSGMITQARCDVPFSYTRRDTLKDGKQVTNHFEDGIFTGVTTFDYKFETEKL
        +PTVG +KFE S E +  +T EETEKEKSF E  ETITIPA SKVKFS M+TQA CDVPFSYTRRDTLKDG+QVT+  EDG+FTGVTT+DYKFETEK+
Subjt:  VPTVGKLKFEFSFEATRGSTWEETEKEKSFEEITETITIPAKSKVKFSGMITQARCDVPFSYTRRDTLKDGKQVTNHFEDGIFTGVTTFDYKFETEKL

XP_022155429.1 uncharacterized protein LOC111022577 [Momordica charantia]2.9e-24875.09Show/hide
Query:  MAGLIRGFGKVSTGIVGGLAKGTGKVVESIGEATEDLGDALFGNHQNHPK--------------------PVKFEDDEDDEIDEAEEKLMKSEEGAEI-D
        MAGL  GFGK  TG+VGGL + TGKVVE++GEA EDLG+AL G  QN+PK                    P++FE+D DD IDEAE+KLM S+EGA + +
Subjt:  MAGLIRGFGKVSTGIVGGLAKGTGKVVESIGEATEDLGDALFGNHQNHPK--------------------PVKFEDDEDDEIDEAEEKLMKSEEGAEI-D

Query:  QSDHDDNEDGEEGEANVIPRHFSLKSKHNNKYLRYINEDETSAGLLRYAGKNVVGPYSKFAVQASKTDKGFVHIRCCYNNKYWVRSSEHSNYIAAVADEN
        +SD +D+ED +  EAN+IP+HFSLKSKHNN+YLRY   DE S GLLRY+GKNVVGPYSKFA+  SKT+KGFVHIRCCYNNK+WVRSSEHSNYIAAVADE 
Subjt:  QSDHDDNEDGEEGEANVIPRHFSLKSKHNNKYLRYINEDETSAGLLRYAGKNVVGPYSKFAVQASKTDKGFVHIRCCYNNKYWVRSSEHSNYIAAVADEN

Query:  EDDKSKWSCTLFEPIFVPEQNGFYFRHVQLNTFVCLAQGDPAPYGDCLAARVEDITTIDENLILSIVVDWDSIFILPKYVAFKSNNDRYLEPSSQYLKFS
        E+DKSKWSCTLFEPIFVP+QNG+YFRHVQLN F+C A        DCLAA+VED+TTID+NL++S  +DWDSIFILPKYVAFKSNN  YLEPS +YLKFS
Subjt:  EDDKSKWSCTLFEPIFVPEQNGFYFRHVQLNTFVCLAQGDPAPYGDCLAARVEDITTIDENLILSIVVDWDSIFILPKYVAFKSNNDRYLEPSSQYLKFS

Query:  ASDIQDPVVVHEIIPMGDGYVRIKHVESGKYWIRDPNWIHCKSIQVDRDDPNTLFWPVKVDDNFVALRNKGNGRFCKRLSTEGKSNCLNAAVSTITDTAR
         S+I+DP VV EII M DGYVRIKHVES KYWIRDPNWIHC+SI   +D+PNTLFWPVKVD+NFVALRNK N RFCKRL+TEGK+NCLNAAV TITDTAR
Subjt:  ASDIQDPVVVHEIIPMGDGYVRIKHVESGKYWIRDPNWIHCKSIQVDRDDPNTLFWPVKVDDNFVALRNKGNGRFCKRLSTEGKSNCLNAAVSTITDTAR

Query:  FEVTEVVVARSIDNVEYRLNDARIYGKKIVTVSKGTAINESQVAEKVTLKFRYEKKVERTWSSSVSTTLGLTAKFTSKVPTVGKLKFEFSFEATRGSTWE
        FEVTEVVVARSID+VEYRLNDAR+YGKKI+TVSKGTAIN+++VAEKVTLKFRYEKKVER+WSSSVSTT+GL AKFT+K+PTVGKLKFE SFE T G TWE
Subjt:  FEVTEVVVARSIDNVEYRLNDARIYGKKIVTVSKGTAINESQVAEKVTLKFRYEKKVERTWSSSVSTTLGLTAKFTSKVPTVGKLKFEFSFEATRGSTWE

Query:  ETEKEKSFEEITETITIPAKSKVKFSGMITQARCDVPFSYTRRDTLKDGKQVTNHFEDGIFTGVTTFDYKFETEKLPL
        ETEKEKSFEE+TET+T+P  SKV+F G+ITQA CDVPFSY RRDTLKDG+QVTN FEDGIFTGVTTFDYKFETEKLPL
Subjt:  ETEKEKSFEEITETITIPAKSKVKFSGMITQARCDVPFSYTRRDTLKDGKQVTNHFEDGIFTGVTTFDYKFETEKLPL

TrEMBL top hitse value%identityAlignment
A0A0A0K983 Uncharacterized protein6.1e-22064.78Show/hide
Query:  MAGLIRGFGKV------------STGIVGGLAKGTGKVVESIGEATEDLGDALFGNHQNHPK--PVKFE-------------------------------
        + G ++G GK+              G VG + +GTGK +E++GEATED G+ +F   +N P+  P + E                               
Subjt:  MAGLIRGFGKV------------STGIVGGLAKGTGKVVESIGEATEDLGDALFGNHQNHPK--PVKFE-------------------------------

Query:  -DDEDDEIDEAEEKLMKSEEGAEIDQSDHDDNEDGEEGEANVIPRHFSLKSKHNNKYLRYINEDETSAGLLRYAGKNVVGPYSKFAVQASKTDKGFVHIR
         D++DD+IDEAE+KLMKS    +ID S++++ E+ EE    VIP++ SLKS  N KYLRYI+E E + GLLR++GKN+VGPYSKF+V ASKT  GF HIR
Subjt:  -DDEDDEIDEAEEKLMKSEEGAEIDQSDHDDNEDGEEGEANVIPRHFSLKSKHNNKYLRYINEDETSAGLLRYAGKNVVGPYSKFAVQASKTDKGFVHIR

Query:  CCYNNKYWVRSSEHSNYIAAVADENEDDKSKWSCTLFEPIFVPEQNG-FYFRHVQLNTFVCLAQGDPAPYGDCLAARVEDITTIDENLILSIVVDWDSIF
        CCYNNK+WVR SE SNYIAAVA+E EDD SKWSCTLFEPIFVPE+ G +Y RHVQLNTF+C+A+GDP+PY DCL ARVEDITTIDENL+L  V DWDSIF
Subjt:  CCYNNKYWVRSSEHSNYIAAVADENEDDKSKWSCTLFEPIFVPEQNG-FYFRHVQLNTFVCLAQGDPAPYGDCLAARVEDITTIDENLILSIVVDWDSIF

Query:  ILPKYVAFKSNNDRYLEPSSQYLKFSASDIQDPVVVHEIIPMGDGYVRIKHVESGKYWIRDPNWIHCKSIQVDRDDPNTLFWPVKVDDNFVALRNKGNGR
        ILPKYVAFKSNNDRYLEPS +YLKFSAS ++DP VV EII M DGYVRIKHV SGKYWIRDP+WI C SI ++RD+PNTLFWPVKVD+N VA RNKGN R
Subjt:  ILPKYVAFKSNNDRYLEPSSQYLKFSASDIQDPVVVHEIIPMGDGYVRIKHVESGKYWIRDPNWIHCKSIQVDRDDPNTLFWPVKVDDNFVALRNKGNGR

Query:  FCKRLSTEGKSNCLNAAVSTITDTARFEVTEVVVARSIDNVEYRLNDARIYGKKIVTVSKGTAINESQVAEKVTLKFRYEKKVERTWSSSVSTTLGLTAK
        FCKRL+T+GK+NCLNAAV TIT+TAR E TE+VVARS+++VEYR+NDAR+YGKKI+TVSKG AIN ++V +K++LKFRYEKKVERTWSSSVS+T G+  K
Subjt:  FCKRLSTEGKSNCLNAAVSTITDTARFEVTEVVVARSIDNVEYRLNDARIYGKKIVTVSKGTAINESQVAEKVTLKFRYEKKVERTWSSSVSTTLGLTAK

Query:  FTSKVPTVGKLKFEFSFEATRGSTWEETEKEKSFEEITETITIPAKSKVKFSGMITQARCDVPFSYTRRDTLKDGKQVTNHFEDGIFTGVTTFDYKFETE
        F +K+PTVG LKFE S E +  +T EETEKEKSF E  ETITIPA SKVKFS M+TQA CDVPFSYTRRDTLKDG+QVT+  EDG+FTGVTT+DYKFETE
Subjt:  FTSKVPTVGKLKFEFSFEATRGSTWEETEKEKSFEEITETITIPAKSKVKFSGMITQARCDVPFSYTRRDTLKDGKQVTNHFEDGIFTGVTTFDYKFETE

Query:  KL
        K+
Subjt:  KL

A0A1S3CBI1 uncharacterized protein LOC1034990803.2e-22164.88Show/hide
Query:  MAGLIRGFGKV------------STGIVGGLAKGTGKVVESIGEATEDLGDALFGNHQNHPKPVKFEDD------------------------------E
        + G ++G GK+              G +G + +GTGK +E++GEATED G+ +F   +  PK  +  DD                              +
Subjt:  MAGLIRGFGKV------------STGIVGGLAKGTGKVVESIGEATEDLGDALFGNHQNHPKPVKFEDD------------------------------E

Query:  DDEIDEAEEKLMKSEEGAEIDQSDHDDNEDGEEGEANVIPRHFSLKSKHNNKYLRYINEDETSAGLLRYAGKNVVGPYSKFAVQASKTDKGFVHIRCCYN
        DD+IDEAE+KLMKS    +ID S++++ E+ EE    VIP++ SLKS  N KYLRYI+E E + GLLRY+GKN+VGPYSKF+V ASKT  GF HIRCCYN
Subjt:  DDEIDEAEEKLMKSEEGAEIDQSDHDDNEDGEEGEANVIPRHFSLKSKHNNKYLRYINEDETSAGLLRYAGKNVVGPYSKFAVQASKTDKGFVHIRCCYN

Query:  NKYWVRSSEHSNYIAAVADENEDDKSKWSCTLFEPIFVPEQNGFYF-RHVQLNTFVCLAQGDPAPYGDCLAARVEDITTIDENLILSIVVDWDSIFILPK
        NK+WVR SE SNYIAA+A+E EDD SKWSCTLFEPIFVPE+ GFY+ RHVQLNTF+C+A+GDP+PY DCL ARVEDIT IDENL+LS V DWDSIFILPK
Subjt:  NKYWVRSSEHSNYIAAVADENEDDKSKWSCTLFEPIFVPEQNGFYF-RHVQLNTFVCLAQGDPAPYGDCLAARVEDITTIDENLILSIVVDWDSIFILPK

Query:  YVAFKSNNDRYLEPSSQYLKFSASDIQDPVVVHEIIPMGDGYVRIKHVESGKYWIRDPNWIHCKSIQVDRDDPNTLFWPVKVDDNFVALRNKGNGRFCKR
        YVAFKSNND+YLEPS +YLKFSAS ++DP VV EII M DGYVRIKHV SGKYWIRDP+WI C SI + RD+PNTLFWPVKVD+N VA RNKGN RFCKR
Subjt:  YVAFKSNNDRYLEPSSQYLKFSASDIQDPVVVHEIIPMGDGYVRIKHVESGKYWIRDPNWIHCKSIQVDRDDPNTLFWPVKVDDNFVALRNKGNGRFCKR

Query:  LSTEGKSNCLNAAVSTITDTARFEVTEVVVARSIDNVEYRLNDARIYGKKIVTVSKGTAINESQVAEKVTLKFRYEKKVERTWSSSVSTTLGLTAKFTSK
        LST+GK+NCLNAAV TIT+TAR EVTE+VVARS+++V+YR+NDAR+YGKKI+TVSKG AIN ++V++K++LKFRYEKKVERTWSSSVS+T G+  KF +K
Subjt:  LSTEGKSNCLNAAVSTITDTARFEVTEVVVARSIDNVEYRLNDARIYGKKIVTVSKGTAINESQVAEKVTLKFRYEKKVERTWSSSVSTTLGLTAKFTSK

Query:  VPTVGKLKFEFSFEATRGSTWEETEKEKSFEEITETITIPAKSKVKFSGMITQARCDVPFSYTRRDTLKDGKQVTNHFEDGIFTGVTTFDYKFETEKL
        +PTVG +KFE S E +  +T EETEKEKSF E  ETITIPA SKVKFS M+TQA CDVPFSYTRRDTLKDG+QVT+  EDG+FTGVTT+DYKFETEK+
Subjt:  VPTVGKLKFEFSFEATRGSTWEETEKEKSFEEITETITIPAKSKVKFSGMITQARCDVPFSYTRRDTLKDGKQVTNHFEDGIFTGVTTFDYKFETEKL

A0A5A7T8Z0 Uncharacterized protein1.6e-22064.72Show/hide
Query:  MAGLIRGFGKV------------STGIVGGLAKGTGKVVESIGEATEDLGDALFGNHQNHPKPVKFEDD------------------------------E
        + G ++G GK+              G +G + +GTGK +E++GEATED G+ +F   +  PK  +  DD                              +
Subjt:  MAGLIRGFGKV------------STGIVGGLAKGTGKVVESIGEATEDLGDALFGNHQNHPKPVKFEDD------------------------------E

Query:  DDEIDEAEEKLMKSEEGAEIDQSDHDDNEDGEEGEANVIPRHFSLKSKHNNKYLRYINEDETSAGLLRYAGKNVVGPYSKFAVQASKTDKGFVHIRCCYN
        DD+IDEAE+KLMKS    +ID S++++ E+ EE    VIP++ SLKS  N KYLRYI+E E + GLLRY+GKN+VGPYSKF+V ASKT  GF HIRCCYN
Subjt:  DDEIDEAEEKLMKSEEGAEIDQSDHDDNEDGEEGEANVIPRHFSLKSKHNNKYLRYINEDETSAGLLRYAGKNVVGPYSKFAVQASKTDKGFVHIRCCYN

Query:  NKYWVRSSEHSNYIAAVADENEDDKSKWSCTLFEPIFVPEQNG-FYFRHVQLNTFVCLAQGDPAPYGDCLAARVEDITTIDENLILSIVVDWDSIFILPK
        NK+WVR SE SNYIAA+A+E EDD SKWSCTLFEPIFVPE+ G +Y RHVQLNTF+C+A+GDP+PY DCL ARVEDIT IDENL+LS V DWDSIFILPK
Subjt:  NKYWVRSSEHSNYIAAVADENEDDKSKWSCTLFEPIFVPEQNG-FYFRHVQLNTFVCLAQGDPAPYGDCLAARVEDITTIDENLILSIVVDWDSIFILPK

Query:  YVAFKSNNDRYLEPSSQYLKFSASDIQDPVVVHEIIPMGDGYVRIKHVESGKYWIRDPNWIHCKSIQVDRDDPNTLFWPVKVDDNFVALRNKGNGRFCKR
        YVAFKSNND+YLEPS +YLKFSAS ++DP VV EII M DGYVRIKHV SGKYWIRDP+WI C SI + RD+PNTLFWPVKVD+N VA RNKGN RFCKR
Subjt:  YVAFKSNNDRYLEPSSQYLKFSASDIQDPVVVHEIIPMGDGYVRIKHVESGKYWIRDPNWIHCKSIQVDRDDPNTLFWPVKVDDNFVALRNKGNGRFCKR

Query:  LSTEGKSNCLNAAVSTITDTARFEVTEVVVARSIDNVEYRLNDARIYGKKIVTVSKGTAINESQVAEKVTLKFRYEKKVERTWSSSVSTTLGLTAKFTSK
        LST+GK+NCLNAAV TIT+TAR EVTE+VVARS+++V+YR+NDAR+YGKKI+TVSKG AIN ++V++K++LKFRYEKKVERTWSSSVS+T G+  KF +K
Subjt:  LSTEGKSNCLNAAVSTITDTARFEVTEVVVARSIDNVEYRLNDARIYGKKIVTVSKGTAINESQVAEKVTLKFRYEKKVERTWSSSVSTTLGLTAKFTSK

Query:  VPTVGKLKFEFSFEATRGSTWEETEKEKSFEEITETITIPAKSKVKFSGMITQARCDVPFSYTRRDTLKDGKQVTNHFEDGIFTGVTTFDYKFETEKL
        +PTVG +KFE S E +  +T EETEKEKSF E  ETITIPA SKVKFS M+TQA CDVPFSYTRRDTLKDG+QVT+  EDG+FTGVTT+DYKFETEK+
Subjt:  VPTVGKLKFEFSFEATRGSTWEETEKEKSFEEITETITIPAKSKVKFSGMITQARCDVPFSYTRRDTLKDGKQVTNHFEDGIFTGVTTFDYKFETEKL

A0A6J1DQ91 uncharacterized protein LOC1110225771.4e-24875.09Show/hide
Query:  MAGLIRGFGKVSTGIVGGLAKGTGKVVESIGEATEDLGDALFGNHQNHPK--------------------PVKFEDDEDDEIDEAEEKLMKSEEGAEI-D
        MAGL  GFGK  TG+VGGL + TGKVVE++GEA EDLG+AL G  QN+PK                    P++FE+D DD IDEAE+KLM S+EGA + +
Subjt:  MAGLIRGFGKVSTGIVGGLAKGTGKVVESIGEATEDLGDALFGNHQNHPK--------------------PVKFEDDEDDEIDEAEEKLMKSEEGAEI-D

Query:  QSDHDDNEDGEEGEANVIPRHFSLKSKHNNKYLRYINEDETSAGLLRYAGKNVVGPYSKFAVQASKTDKGFVHIRCCYNNKYWVRSSEHSNYIAAVADEN
        +SD +D+ED +  EAN+IP+HFSLKSKHNN+YLRY   DE S GLLRY+GKNVVGPYSKFA+  SKT+KGFVHIRCCYNNK+WVRSSEHSNYIAAVADE 
Subjt:  QSDHDDNEDGEEGEANVIPRHFSLKSKHNNKYLRYINEDETSAGLLRYAGKNVVGPYSKFAVQASKTDKGFVHIRCCYNNKYWVRSSEHSNYIAAVADEN

Query:  EDDKSKWSCTLFEPIFVPEQNGFYFRHVQLNTFVCLAQGDPAPYGDCLAARVEDITTIDENLILSIVVDWDSIFILPKYVAFKSNNDRYLEPSSQYLKFS
        E+DKSKWSCTLFEPIFVP+QNG+YFRHVQLN F+C A        DCLAA+VED+TTID+NL++S  +DWDSIFILPKYVAFKSNN  YLEPS +YLKFS
Subjt:  EDDKSKWSCTLFEPIFVPEQNGFYFRHVQLNTFVCLAQGDPAPYGDCLAARVEDITTIDENLILSIVVDWDSIFILPKYVAFKSNNDRYLEPSSQYLKFS

Query:  ASDIQDPVVVHEIIPMGDGYVRIKHVESGKYWIRDPNWIHCKSIQVDRDDPNTLFWPVKVDDNFVALRNKGNGRFCKRLSTEGKSNCLNAAVSTITDTAR
         S+I+DP VV EII M DGYVRIKHVES KYWIRDPNWIHC+SI   +D+PNTLFWPVKVD+NFVALRNK N RFCKRL+TEGK+NCLNAAV TITDTAR
Subjt:  ASDIQDPVVVHEIIPMGDGYVRIKHVESGKYWIRDPNWIHCKSIQVDRDDPNTLFWPVKVDDNFVALRNKGNGRFCKRLSTEGKSNCLNAAVSTITDTAR

Query:  FEVTEVVVARSIDNVEYRLNDARIYGKKIVTVSKGTAINESQVAEKVTLKFRYEKKVERTWSSSVSTTLGLTAKFTSKVPTVGKLKFEFSFEATRGSTWE
        FEVTEVVVARSID+VEYRLNDAR+YGKKI+TVSKGTAIN+++VAEKVTLKFRYEKKVER+WSSSVSTT+GL AKFT+K+PTVGKLKFE SFE T G TWE
Subjt:  FEVTEVVVARSIDNVEYRLNDARIYGKKIVTVSKGTAINESQVAEKVTLKFRYEKKVERTWSSSVSTTLGLTAKFTSKVPTVGKLKFEFSFEATRGSTWE

Query:  ETEKEKSFEEITETITIPAKSKVKFSGMITQARCDVPFSYTRRDTLKDGKQVTNHFEDGIFTGVTTFDYKFETEKLPL
        ETEKEKSFEE+TET+T+P  SKV+F G+ITQA CDVPFSY RRDTLKDG+QVTN FEDGIFTGVTTFDYKFETEKLPL
Subjt:  ETEKEKSFEEITETITIPAKSKVKFSGMITQARCDVPFSYTRRDTLKDGKQVTNHFEDGIFTGVTTFDYKFETEKLPL

A0A6J1JVU2 uncharacterized protein LOC1114883384.3e-21363.82Show/hide
Query:  LIRGFGKVSTGIVGGLAKGTGKVV---------------------------ESIGEATEDLGDALFGNHQNHPKP--VKFEDDE---DDEIDEAEEKLMK
        L+RG GK  T  +GG+ KG GK+V                           E+IGE TED G+ +F  ++N+PK    + +DD+   D++IDEAE+KLM 
Subjt:  LIRGFGKVSTGIVGGLAKGTGKVV---------------------------ESIGEATEDLGDALFGNHQNHPKP--VKFEDDE---DDEIDEAEEKLMK

Query:  SEEGAEIDQSDHDDNEDGEEGEANVIPRHFSLKSKHNNKYLRYINEDETSAGLLRYAGKNVVGPYSKFAVQASKTDKGFVHIRCCYNNKYWVRSSEHSNY
         E  A  D SD DD+   +E  A  IPR+FSLKS  NNKYLRYI+E E S GLLR++GKN+VGPYSKFA++ASKT  G VHIRCCYNNK+WVR SE SNY
Subjt:  SEEGAEIDQSDHDDNEDGEEGEANVIPRHFSLKSKHNNKYLRYINEDETSAGLLRYAGKNVVGPYSKFAVQASKTDKGFVHIRCCYNNKYWVRSSEHSNY

Query:  IAAVADENEDDKSKWSCTLFEPIFVPEQNGFYFRHVQLNTFVCLAQGDPAPYGDCLAARVEDITTIDENLILSIVVDWDSIFILPKYVAFKSNNDRYLEP
        IAA+A+E E+D+SKWSCTLFEPIF+P++   Y RHVQLNTF+C+A+ DP+PY DC+AAR+EDI+TID+NL+L   +DWDSIFILPKYVAFKSNN  YLEP
Subjt:  IAAVADENEDDKSKWSCTLFEPIFVPEQNGFYFRHVQLNTFVCLAQGDPAPYGDCLAARVEDITTIDENLILSIVVDWDSIFILPKYVAFKSNNDRYLEP

Query:  SSQYLKFSASDIQDPVVVHEIIPMGDGYVRIKHVESGKYWIRDPNWIHCKSIQVDRDDPNTLFWPVKVDDNFVALRNKGNGRFCKRLSTEGKSNCLNAAV
        S +YLKFSAS+++D  +V EII   DGYV IKHV SGKYW+RDPNWI C S    +D+PN LFWPVKVD N VALRNKGN  FCKRL+TEGK+NCLNAAV
Subjt:  SSQYLKFSASDIQDPVVVHEIIPMGDGYVRIKHVESGKYWIRDPNWIHCKSIQVDRDDPNTLFWPVKVDDNFVALRNKGNGRFCKRLSTEGKSNCLNAAV

Query:  STITDTARFEVTEVVVARSIDNVEYRLNDARIYGKKIVTVSKGTAINESQVAEKVTLKFRYEKKVERTWSSSVSTTLGLTAKFTSKVPTVGKLKFEFSFE
         TITDTAR EV E+VVARSI++VEYR+NDAR+YGKKI+TVSKG AIN ++VA+KV +KFRYEKKVE +WSSSVS+T G++ K ++K+PTVGKLKFE S E
Subjt:  STITDTARFEVTEVVVARSIDNVEYRLNDARIYGKKIVTVSKGTAINESQVAEKVTLKFRYEKKVERTWSSSVSTTLGLTAKFTSKVPTVGKLKFEFSFE

Query:  ATRGSTWEETEKEKSFEEITETITIPAKSKVKFSGMITQARCDVPFSYTRRDTLKDGKQVTNHFEDGIFTGVTTFDYKFETEKLPL
         ++GS+    E+EKSF E TETITIP  SKVKFS ++TQA CDVPFSYT++DTLKDG+QV++  EDGIF GVTT+DYKFETEKLPL
Subjt:  ATRGSTWEETEKEKSFEEITETITIPAKSKVKFSGMITQARCDVPFSYTRRDTLKDGKQVTNHFEDGIFTGVTTFDYKFETEKLPL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGGTCTTATTAGAGGATTTGGGAAAGTTAGCACTGGCATCGTGGGTGGCCTAGCTAAAGGAACTGGGAAAGTGGTTGAATCCATAGGAGAAGCCACTGAGGATCT
CGGCGATGCATTGTTTGGCAACCACCAAAACCACCCAAAACCTGTCAAGTTTGAAGACGACGAGGATGACGAGATAGACGAAGCAGAGGAGAAGTTGATGAAATCGGAAG
AAGGAGCTGAGATAGATCAGTCCGACCACGACGACAACGAAGATGGAGAAGAAGGAGAAGCAAACGTTATCCCAAGACATTTCTCTCTCAAATCCAAGCACAACAACAAA
TACCTCCGTTACATAAACGAAGATGAGACCTCCGCCGGGCTTCTCCGGTACGCCGGGAAGAACGTCGTCGGTCCGTATTCGAAATTCGCAGTCCAAGCGTCGAAAACCGA
CAAGGGTTTCGTCCACATTCGGTGTTGCTACAACAACAAATACTGGGTTCGCTCGTCGGAGCATTCCAACTACATCGCCGCCGTCGCCGACGAAAACGAAGACGACAAAT
CAAAATGGTCCTGCACCTTGTTCGAACCCATCTTCGTGCCCGAGCAAAATGGATTCTACTTTCGCCATGTTCAGCTCAACACATTTGTCTGTTTGGCTCAAGGGGATCCA
GCACCGTACGGAGATTGTTTAGCTGCAAGAGTTGAAGATATCACCACAATCGATGAGAATCTTATTCTCTCCATCGTTGTGGATTGGGACTCCATATTCATACTCCCCAA
ATACGTAGCTTTCAAGAGCAACAACGATAGATATCTCGAACCATCGAGTCAATACCTAAAATTCTCCGCTTCCGACATCCAAGATCCCGTGGTCGTCCACGAGATCATTC
CCATGGGAGATGGATACGTTCGGATAAAGCACGTCGAGTCAGGAAAGTATTGGATTCGCGACCCGAATTGGATCCATTGCAAATCGATCCAAGTCGACCGAGACGATCCC
AACACATTGTTTTGGCCGGTCAAGGTCGACGACAACTTCGTCGCTCTCCGCAACAAGGGCAACGGTCGTTTCTGCAAGAGGCTCTCGACCGAAGGAAAGTCGAATTGTCT
CAACGCCGCAGTCTCGACGATCACCGACACTGCACGCTTCGAGGTGACGGAAGTTGTCGTGGCTAGGAGCATCGACAACGTCGAGTATCGGCTCAACGATGCAAGGATCT
ATGGGAAGAAGATCGTCACGGTGTCGAAAGGAACCGCCATTAACGAATCTCAAGTTGCAGAGAAAGTAACATTGAAGTTTCGGTATGAGAAGAAGGTGGAAAGAACTTGG
AGTTCTTCTGTGTCGACCACTCTGGGATTGACTGCAAAATTCACGTCGAAGGTTCCAACGGTTGGGAAGCTGAAGTTTGAATTTTCGTTTGAGGCTACGCGAGGAAGCAC
TTGGGAAGAAACCGAGAAGGAGAAATCGTTTGAAGAGATTACAGAAACGATTACAATACCTGCGAAGTCGAAAGTGAAGTTCAGTGGCATGATAACGCAAGCACGTTGCG
ATGTTCCTTTTTCTTACACTCGACGTGATACACTGAAAGATGGGAAGCAAGTTACCAATCACTTTGAAGATGGGATTTTTACTGGCGTTACAACCTTTGACTACAAATTT
GAGACAGAAAAACTACCGTTGGATGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGGTCTTATTAGAGGATTTGGGAAAGTTAGCACTGGCATCGTGGGTGGCCTAGCTAAAGGAACTGGGAAAGTGGTTGAATCCATAGGAGAAGCCACTGAGGATCT
CGGCGATGCATTGTTTGGCAACCACCAAAACCACCCAAAACCTGTCAAGTTTGAAGACGACGAGGATGACGAGATAGACGAAGCAGAGGAGAAGTTGATGAAATCGGAAG
AAGGAGCTGAGATAGATCAGTCCGACCACGACGACAACGAAGATGGAGAAGAAGGAGAAGCAAACGTTATCCCAAGACATTTCTCTCTCAAATCCAAGCACAACAACAAA
TACCTCCGTTACATAAACGAAGATGAGACCTCCGCCGGGCTTCTCCGGTACGCCGGGAAGAACGTCGTCGGTCCGTATTCGAAATTCGCAGTCCAAGCGTCGAAAACCGA
CAAGGGTTTCGTCCACATTCGGTGTTGCTACAACAACAAATACTGGGTTCGCTCGTCGGAGCATTCCAACTACATCGCCGCCGTCGCCGACGAAAACGAAGACGACAAAT
CAAAATGGTCCTGCACCTTGTTCGAACCCATCTTCGTGCCCGAGCAAAATGGATTCTACTTTCGCCATGTTCAGCTCAACACATTTGTCTGTTTGGCTCAAGGGGATCCA
GCACCGTACGGAGATTGTTTAGCTGCAAGAGTTGAAGATATCACCACAATCGATGAGAATCTTATTCTCTCCATCGTTGTGGATTGGGACTCCATATTCATACTCCCCAA
ATACGTAGCTTTCAAGAGCAACAACGATAGATATCTCGAACCATCGAGTCAATACCTAAAATTCTCCGCTTCCGACATCCAAGATCCCGTGGTCGTCCACGAGATCATTC
CCATGGGAGATGGATACGTTCGGATAAAGCACGTCGAGTCAGGAAAGTATTGGATTCGCGACCCGAATTGGATCCATTGCAAATCGATCCAAGTCGACCGAGACGATCCC
AACACATTGTTTTGGCCGGTCAAGGTCGACGACAACTTCGTCGCTCTCCGCAACAAGGGCAACGGTCGTTTCTGCAAGAGGCTCTCGACCGAAGGAAAGTCGAATTGTCT
CAACGCCGCAGTCTCGACGATCACCGACACTGCACGCTTCGAGGTGACGGAAGTTGTCGTGGCTAGGAGCATCGACAACGTCGAGTATCGGCTCAACGATGCAAGGATCT
ATGGGAAGAAGATCGTCACGGTGTCGAAAGGAACCGCCATTAACGAATCTCAAGTTGCAGAGAAAGTAACATTGAAGTTTCGGTATGAGAAGAAGGTGGAAAGAACTTGG
AGTTCTTCTGTGTCGACCACTCTGGGATTGACTGCAAAATTCACGTCGAAGGTTCCAACGGTTGGGAAGCTGAAGTTTGAATTTTCGTTTGAGGCTACGCGAGGAAGCAC
TTGGGAAGAAACCGAGAAGGAGAAATCGTTTGAAGAGATTACAGAAACGATTACAATACCTGCGAAGTCGAAAGTGAAGTTCAGTGGCATGATAACGCAAGCACGTTGCG
ATGTTCCTTTTTCTTACACTCGACGTGATACACTGAAAGATGGGAAGCAAGTTACCAATCACTTTGAAGATGGGATTTTTACTGGCGTTACAACCTTTGACTACAAATTT
GAGACAGAAAAACTACCGTTGGATGCTTGA
Protein sequenceShow/hide protein sequence
MAGLIRGFGKVSTGIVGGLAKGTGKVVESIGEATEDLGDALFGNHQNHPKPVKFEDDEDDEIDEAEEKLMKSEEGAEIDQSDHDDNEDGEEGEANVIPRHFSLKSKHNNK
YLRYINEDETSAGLLRYAGKNVVGPYSKFAVQASKTDKGFVHIRCCYNNKYWVRSSEHSNYIAAVADENEDDKSKWSCTLFEPIFVPEQNGFYFRHVQLNTFVCLAQGDP
APYGDCLAARVEDITTIDENLILSIVVDWDSIFILPKYVAFKSNNDRYLEPSSQYLKFSASDIQDPVVVHEIIPMGDGYVRIKHVESGKYWIRDPNWIHCKSIQVDRDDP
NTLFWPVKVDDNFVALRNKGNGRFCKRLSTEGKSNCLNAAVSTITDTARFEVTEVVVARSIDNVEYRLNDARIYGKKIVTVSKGTAINESQVAEKVTLKFRYEKKVERTW
SSSVSTTLGLTAKFTSKVPTVGKLKFEFSFEATRGSTWEETEKEKSFEEITETITIPAKSKVKFSGMITQARCDVPFSYTRRDTLKDGKQVTNHFEDGIFTGVTTFDYKF
ETEKLPLDA