; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g01320 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g01320
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionAgglutinin domain-containing protein
Genome locationchr7:1065197..1068266
RNA-Seq ExpressionMoc07g01320
SyntenyMoc07g01320
Gene Ontology termsNA
InterPro domainsIPR004991 - Aerolysin-like toxin
IPR008998 - Agglutinin domain
IPR036242 - Agglutinin domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039763.1 uncharacterized protein E6C27_scaffold122G00260 [Cucumis melo var. makuwa]4.6e-14452.63Show/hide
Query:  ISGKDIEISGGEDDKSIIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKK-LPSPVSKFHSEASESDPRFMHIRCSYNNKYWVRQSPDSN
        I    +++S   DD+SIIPK+FALQNY PR PQP TAP+L+ +        G+L+F+ +  L SP+SKF SE S+SD + +HIRC+ NN+YWVR+S DSN
Subjt:  ISGKDIEISGGEDDKSIIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKK-LPSPVSKFHSEASESDPRFMHIRCSYNNKYWVRQSPDSN

Query:  YIVAIGTKQENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWED--NAFNTLIDWDSLVILPKHVTFKGSNGKY
        +IV   TK+E+D+SKWSCTLFEPI DA +K YRFRHVQLGYELFR     +  + L A E G    E ED    F  +IDW+SL + PKHVTFKG NG+Y
Subjt:  YIVAIGTKQENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWED--NAFNTLIDWDSLVILPKHVTFKGSNGKY

Query:  LKYNGHYLQFSGTDVENPSHIHEIFPKNDGTIRIKNVGCQKFWIRDPNWIVVVAEDSSRDDLNSLFQPVKLGNNIVALRSLGNNHFCTSLSIDGKSNCLN
        L++ G YLQ SG D  +PS IHEIFP+ DG + +KNV  ++FWI DPNWIV  A D +RDD N  F PV L +N+VALR LGN  FCT ++ D K NCLN
Subjt:  LKYNGHYLQFSGTDVENPSHIHEIFPKNDGTIRIKNVGCQKFWIRDPNWIVVVAEDSSRDDLNSLFQPVKLGNNIVALRSLGNNHFCTSLSIDGKSNCLN

Query:  ADLENPIVETEMEFAE--AVMSSRIENIEYRMKDAKIYGERVWSMVKGDAINKTRAADTVQFTFSFEDKMKRNWTNALGVKFGVSKQFTAGVPMIGDGSI
        A   + + E + E +E   + S RI++ +Y + D +IYGERVWSM KG AINKT   + ++FTFSFEDK  + WT+    +F V K+F    P I DG +
Subjt:  ADLENPIVETEMEFAE--AVMSSRIENIEYRMKDAKIYGERVWSMVKGDAINKTRAADTVQFTFSFEDKMKRNWTNALGVKFGVSKQFTAGVPMIGDGSI

Query:  TVSVVAGGEYAWGET-QKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCNVPFSYTKIDTLRDGTQISREYDDGVFNGIQSYDFQFRSDKVVLPL
         +     G Y WGET   +K  MSC+STITVPPMSKVK+N +VKRGFC VPFSY + +   +G +  + Y DGVF G  SY FQ R+DK  LP+
Subjt:  TVSVVAGGEYAWGET-QKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCNVPFSYTKIDTLRDGTQISREYDDGVFNGIQSYDFQFRSDKVVLPL

XP_004140504.1 uncharacterized protein LOC101208463 [Cucumis sativus]3.9e-14352.67Show/hide
Query:  SGGEDDKSIIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGK-KLPSPVSKFHSEASESDPRFMHIRCSYNNKYWVRQSPDSNYIVAIGTK
        S   DDKSIIPK+FALQNY PR PQP+TAP+L+      +   G+L+F+G+  L SP SKF SE SESDP+ +HIRC+ NNKYWVR+S DSN+IV   TK
Subjt:  SGGEDDKSIIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGK-KLPSPVSKFHSEASESDPRFMHIRCSYNNKYWVRQSPDSNYIVAIGTK

Query:  QENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWED--NAFNTLIDWDSLVILPKHVTFKGSNGKYLKYNGHYL
        +E+++SK SCTLF+PIYDA HK Y FRHVQLGYELFR     +  + LLA+E G    E ED    F  +IDW+SL + PK VT KG NG+YL+Y G YL
Subjt:  QENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWED--NAFNTLIDWDSLVILPKHVTFKGSNGKYLKYNGHYL

Query:  QFSGTDVENPSHIHEIFPKNDGTIRIKNVGCQKFWIRDPNWIVVVAEDSSRDDLNSLFQPVKLGNNIVALRSLGNNHFCTSLSIDGKSNCLNADLENPIV
        Q +G +  +PS IHEI+P+ DG ++IKN+   +FWI DP+WIV  A D +RDD   LF+PV L +N+V   SLGN   C  +S+D K NCLNA   +P  
Subjt:  QFSGTDVENPSHIHEIFPKNDGTIRIKNVGCQKFWIRDPNWIVVVAEDSSRDDLNSLFQPVKLGNNIVALRSLGNNHFCTSLSIDGKSNCLNADLENPIV

Query:  ETEMEFAE--AVMSSRIENIEYRMKDAKIYGERVWSMVKGDAINKTRAADTVQFTFSFEDKMKRNWTNALGVKFGVSKQFTAGVPMIGDGSITVSVVAGG
        ET+ + +E   +   +I+ ++Y++++ +IYGERVWS+ KG AINKT   D ++FTFSFEDK  + WT+    +F  +K F A  P I DG +      GG
Subjt:  ETEMEFAE--AVMSSRIENIEYRMKDAKIYGERVWSMVKGDAINKTRAADTVQFTFSFEDKMKRNWTNALGVKFGVSKQFTAGVPMIGDGSITVSVVAGG

Query:  EYAWGET-QKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCNVPFSYTKIDTLRDGTQISREYDDGVFNGIQSYDFQFRSDKVVLPL
         Y W ET  K+K  MSC+STITVPP SKVK+N +VKRGFC VPFSYT+I+T  +G   ++ Y+DGVF G+ SY FQ  +DKV LP+
Subjt:  EYAWGET-QKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCNVPFSYTKIDTLRDGTQISREYDDGVFNGIQSYDFQFRSDKVVLPL

XP_022155409.1 uncharacterized protein LOC111022557 [Momordica charantia]9.8e-304100Show/hide
Query:  MLDSEVEDRLREERLENRYKEISGKDIEISGGEDDKSIIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKKLPSPVSKFHSEASESDPRF
        MLDSEVEDRLREERLENRYKEISGKDIEISGGEDDKSIIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKKLPSPVSKFHSEASESDPRF
Subjt:  MLDSEVEDRLREERLENRYKEISGKDIEISGGEDDKSIIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKKLPSPVSKFHSEASESDPRF

Query:  MHIRCSYNNKYWVRQSPDSNYIVAIGTKQENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWEDNAFNTLIDWD
        MHIRCSYNNKYWVRQSPDSNYIVAIGTKQENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWEDNAFNTLIDWD
Subjt:  MHIRCSYNNKYWVRQSPDSNYIVAIGTKQENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWEDNAFNTLIDWD

Query:  SLVILPKHVTFKGSNGKYLKYNGHYLQFSGTDVENPSHIHEIFPKNDGTIRIKNVGCQKFWIRDPNWIVVVAEDSSRDDLNSLFQPVKLGNNIVALRSLG
        SLVILPKHVTFKGSNGKYLKYNGHYLQFSGTDVENPSHIHEIFPKNDGTIRIKNVGCQKFWIRDPNWIVVVAEDSSRDDLNSLFQPVKLGNNIVALRSLG
Subjt:  SLVILPKHVTFKGSNGKYLKYNGHYLQFSGTDVENPSHIHEIFPKNDGTIRIKNVGCQKFWIRDPNWIVVVAEDSSRDDLNSLFQPVKLGNNIVALRSLG

Query:  NNHFCTSLSIDGKSNCLNADLENPIVETEMEFAEAVMSSRIENIEYRMKDAKIYGERVWSMVKGDAINKTRAADTVQFTFSFEDKMKRNWTNALGVKFGV
        NNHFCTSLSIDGKSNCLNADLENPIVETEMEFAEAVMSSRIENIEYRMKDAKIYGERVWSMVKGDAINKTRAADTVQFTFSFEDKMKRNWTNALGVKFGV
Subjt:  NNHFCTSLSIDGKSNCLNADLENPIVETEMEFAEAVMSSRIENIEYRMKDAKIYGERVWSMVKGDAINKTRAADTVQFTFSFEDKMKRNWTNALGVKFGV

Query:  SKQFTAGVPMIGDGSITVSVVAGGEYAWGETQKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCNVPFSYTKIDTLRDGTQISREYDDGVFNGIQSYDFQF
        SKQFTAGVPMIGDGSITVSVVAGGEYAWGETQKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCNVPFSYTKIDTLRDGTQISREYDDGVFNGIQSYDFQF
Subjt:  SKQFTAGVPMIGDGSITVSVVAGGEYAWGETQKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCNVPFSYTKIDTLRDGTQISREYDDGVFNGIQSYDFQF

Query:  RSDKVVLPL
        RSDKVVLPL
Subjt:  RSDKVVLPL

XP_038906851.1 uncharacterized protein LOC120092742 [Benincasa hispida]5.0e-19964.75Show/hide
Query:  EVEDRLREERLENRYKEISGKDIEISGGEDDKSIIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKKLPSPVSKFHSEASESDPRFMHIR
        E+ + +  +++E+ Y E+ GK++++S GEDDKSIIP+ FALQN  P+ PQPKTAPYLRYV + +K  +G L FSGK + SP SKF SE SE+DP+  HI+
Subjt:  EVEDRLREERLENRYKEISGKDIEISGGEDDKSIIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKKLPSPVSKFHSEASESDPRFMHIR

Query:  CSYNNKYWVRQSPDSNYIVAIGTKQENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWEDNAFNTLIDWDSLVI
        C YNNKYWVR+S +S+YI+A  TK+E D+SKW+CTLFEPIYD+D+K +RF HVQ   ELFRA  +D + D LLAKE  AT+   ED  F T+IDW SL I
Subjt:  CSYNNKYWVRQSPDSNYIVAIGTKQENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWEDNAFNTLIDWDSLVI

Query:  LPKHVTFKGSNGKYLKYNGHYLQFSGTDVENPSHIHEIFPKNDGTIRIKNVGCQKFWIRDPNWIVVVAEDSSRDDLNSLFQPVKLGNNIVALRSLGNNHF
         PKHVTFKG NGKYLK+ G++LQFSGTD+E+PS IHEIFP+NDGT+RIKNVG +KFWIRD NWI+  A + S +D N+ FQPVKLG+NIVALR+LGNNHF
Subjt:  LPKHVTFKGSNGKYLKYNGHYLQFSGTDVENPSHIHEIFPKNDGTIRIKNVGCQKFWIRDPNWIVVVAEDSSRDDLNSLFQPVKLGNNIVALRSLGNNHF

Query:  CTSLSIDGKSNCLNADLENPIVETEMEFAEAVMSSRIENIEYRMKDAKIYGERVWSMVKGDAINKTRAADTVQFTFSFEDKMKRNWTNALGVKFGVSKQF
        CTSLS+D K++CLNA+  NP  E  ME +EAV+SS+IENIEYR++DAKIYGERVWSM KGDAINKT+AADT+QFTFSFEDK K+NWTN +  KFGV+++F
Subjt:  CTSLSIDGKSNCLNADLENPIVETEMEFAEAVMSSRIENIEYRMKDAKIYGERVWSMVKGDAINKTRAADTVQFTFSFEDKMKRNWTNALGVKFGVSKQF

Query:  TAGVPMIGDGSITVSVVAGGEYAWGETQKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCNVPFSYTKIDTLRDGTQISREYDDGVFNGIQSYDFQFRSDK
        TAGVP+IGD  + +    GG Y+WGET K+K  M+CSSTITVPPMSKVK++ +VKRGFCNVP+SYT+ DTLRDG Q + EY+DGVF+G+ SY F  R+DK
Subjt:  TAGVPMIGDGSITVSVVAGGEYAWGETQKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCNVPFSYTKIDTLRDGTQISREYDDGVFNGIQSYDFQFRSDK

Query:  VVLPL
        V LPL
Subjt:  VVLPL

XP_038906982.1 uncharacterized protein LOC120092830 [Benincasa hispida]3.0e-19965.28Show/hide
Query:  VEDRLREERLENRYKEISGKDIEISGGEDDKSIIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKKLPSPVSKFHSEASESDPRFMHIRC
        +E+R R  ++E++Y E+ GK++++S GEDDKSIIP+ FALQN  P+ PQPKTAPYLRYV + +K  +G L FSGK + SP SKF SE SE+DP+  HI+C
Subjt:  VEDRLREERLENRYKEISGKDIEISGGEDDKSIIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKKLPSPVSKFHSEASESDPRFMHIRC

Query:  SYNNKYWVRQSPDSNYIVAIGTKQENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWEDNAFNTLIDWDSLVIL
         YNNKYWVR+S +S+YI+A  TK+E D+SKW+CTLFEPIYD+D+K +RF HVQ   ELFRA  +D + D LLAKE  AT+   ED  F T+IDW SL I 
Subjt:  SYNNKYWVRQSPDSNYIVAIGTKQENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWEDNAFNTLIDWDSLVIL

Query:  PKHVTFKGSNGKYLKYNGHYLQFSGTDVENPSHIHEIFPKNDGTIRIKNVGCQKFWIRDPNWIVVVAEDSSRDDLNSLFQPVKLGNNIVALRSLGNNHFC
        PKHVTFKG NGKYLK+ G++LQFSGTD+E+PS IHEIFP+NDGT+RIKNVG +KFWIRD NWI+  A + S +D N+ FQPVKLG+NIVALR+LGNNHFC
Subjt:  PKHVTFKGSNGKYLKYNGHYLQFSGTDVENPSHIHEIFPKNDGTIRIKNVGCQKFWIRDPNWIVVVAEDSSRDDLNSLFQPVKLGNNIVALRSLGNNHFC

Query:  TSLSIDGKSNCLNADLENPIVETEMEFAEAVMSSRIENIEYRMKDAKIYGERVWSMVKGDAINKTRAADTVQFTFSFEDKMKRNWTNALGVKFGVSKQFT
        TSLS+D K++CLNA+  NP  E  ME +EAV+SS+IENIEYR++DAKIYGERVWSM KGDAINKT+AADT+QFTFSFEDK K+NWTN +  KFGV+++FT
Subjt:  TSLSIDGKSNCLNADLENPIVETEMEFAEAVMSSRIENIEYRMKDAKIYGERVWSMVKGDAINKTRAADTVQFTFSFEDKMKRNWTNALGVKFGVSKQFT

Query:  AGVPMIGDGSITVSVVAGGEYAWGETQKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCNVPFSYTKIDTLRDGTQISREYDDGVFNGIQSYDFQFRSDKV
        AGVP+IGD  + +    GG Y+WGET K+K  M+CSSTITVPPMSKVK++ +VKRGFCNVP+SYT+ DTLRDG Q + EY+DGVF+G+ SY FQ R+DKV
Subjt:  AGVPMIGDGSITVSVVAGGEYAWGETQKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCNVPFSYTKIDTLRDGTQISREYDDGVFNGIQSYDFQFRSDKV

Query:  VLPL
         LP+
Subjt:  VLPL

TrEMBL top hitse value%identityAlignment
A0A0A0KAP4 Agglutinin domain-containing protein4.0e-14152.35Show/hide
Query:  ENRYKEISGKDIEISGGEDDKSIIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKK-LPSPVSKFHSEASESDPRFMHIRCSYNNKYWVR
        E RYK      ++ S   DDKSI PK+FALQNY PR PQP+TAP+L+Y+       + +L+F+G+  L  P SKF SE S+S+P+ +HIRC+  NKYWVR
Subjt:  ENRYKEISGKDIEISGGEDDKSIIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKK-LPSPVSKFHSEASESDPRFMHIRCSYNNKYWVR

Query:  QSPDSNYIVAIGTKQENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWED--NAFNTLIDWDSLVILPKHVTFK
        +S DSN+IV I TK+E++ SK SCTLFEPIYDA +K YRFRHVQLGYELFR     +  D LLA+E G+   E ED    F  +IDW+SL + PKHVTFK
Subjt:  QSPDSNYIVAIGTKQENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWED--NAFNTLIDWDSLVILPKHVTFK

Query:  GSNGKYLKYNGHYLQFSGTDVENPSHIHEIFPKNDGTIRIKNVGCQKFWIRDPNWIVVVAEDSSRDDLNSLFQPVKLGNNIVALRSLGNNHFCTSLSIDG
        G NGKYL++ G YLQ SG +  + S IHEI+P+ DG + IKN+  ++FWI DPNWIV  A D +RDD N LFQPV L NN+VALRSLGN  FC  +S+D 
Subjt:  GSNGKYLKYNGHYLQFSGTDVENPSHIHEIFPKNDGTIRIKNVGCQKFWIRDPNWIVVVAEDSSRDDLNSLFQPVKLGNNIVALRSLGNNHFCTSLSIDG

Query:  KSNCLNADLENPIVETEMEFAEAVMSSRIE---NIEYRMKDAKIYGERVWSMVKGDAINKTRAADTVQFTFSFEDKMKRNWTNALGVKFGVSKQFTAGVP
        + NCLNA   +P  ET+ E +E  +  R +   NI YR+ + +IYGERVWSM KG AINKT   + ++FTFSFED+    WTN    +F  +K F A  P
Subjt:  KSNCLNADLENPIVETEMEFAEAVMSSRIE---NIEYRMKDAKIYGERVWSMVKGDAINKTRAADTVQFTFSFEDKMKRNWTNALGVKFGVSKQFTAGVP

Query:  MIGDGSITVSVVAGGEYAWGET-QKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCNVPFSYTKIDT---------LRDGTQISREYDDGVFNGIQSYDFQ
        +I DG IT+         WGET +K+K  MSC +TITVPPMSKVK+N +VKRGFC VPFSY    T          RDG      + DG F G+ SY FQ
Subjt:  MIGDGSITVSVVAGGEYAWGET-QKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCNVPFSYTKIDT---------LRDGTQISREYDDGVFNGIQSYDFQ

Query:  FRSDKVVLPL
          +D+  LP+
Subjt:  FRSDKVVLPL

A0A0A0KFN1 Agglutinin domain-containing protein1.9e-14352.67Show/hide
Query:  SGGEDDKSIIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGK-KLPSPVSKFHSEASESDPRFMHIRCSYNNKYWVRQSPDSNYIVAIGTK
        S   DDKSIIPK+FALQNY PR PQP+TAP+L+      +   G+L+F+G+  L SP SKF SE SESDP+ +HIRC+ NNKYWVR+S DSN+IV   TK
Subjt:  SGGEDDKSIIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGK-KLPSPVSKFHSEASESDPRFMHIRCSYNNKYWVRQSPDSNYIVAIGTK

Query:  QENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWED--NAFNTLIDWDSLVILPKHVTFKGSNGKYLKYNGHYL
        +E+++SK SCTLF+PIYDA HK Y FRHVQLGYELFR     +  + LLA+E G    E ED    F  +IDW+SL + PK VT KG NG+YL+Y G YL
Subjt:  QENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWED--NAFNTLIDWDSLVILPKHVTFKGSNGKYLKYNGHYL

Query:  QFSGTDVENPSHIHEIFPKNDGTIRIKNVGCQKFWIRDPNWIVVVAEDSSRDDLNSLFQPVKLGNNIVALRSLGNNHFCTSLSIDGKSNCLNADLENPIV
        Q +G +  +PS IHEI+P+ DG ++IKN+   +FWI DP+WIV  A D +RDD   LF+PV L +N+V   SLGN   C  +S+D K NCLNA   +P  
Subjt:  QFSGTDVENPSHIHEIFPKNDGTIRIKNVGCQKFWIRDPNWIVVVAEDSSRDDLNSLFQPVKLGNNIVALRSLGNNHFCTSLSIDGKSNCLNADLENPIV

Query:  ETEMEFAE--AVMSSRIENIEYRMKDAKIYGERVWSMVKGDAINKTRAADTVQFTFSFEDKMKRNWTNALGVKFGVSKQFTAGVPMIGDGSITVSVVAGG
        ET+ + +E   +   +I+ ++Y++++ +IYGERVWS+ KG AINKT   D ++FTFSFEDK  + WT+    +F  +K F A  P I DG +      GG
Subjt:  ETEMEFAE--AVMSSRIENIEYRMKDAKIYGERVWSMVKGDAINKTRAADTVQFTFSFEDKMKRNWTNALGVKFGVSKQFTAGVPMIGDGSITVSVVAGG

Query:  EYAWGET-QKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCNVPFSYTKIDTLRDGTQISREYDDGVFNGIQSYDFQFRSDKVVLPL
         Y W ET  K+K  MSC+STITVPP SKVK+N +VKRGFC VPFSYT+I+T  +G   ++ Y+DGVF G+ SY FQ  +DKV LP+
Subjt:  EYAWGET-QKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCNVPFSYTKIDTLRDGTQISREYDDGVFNGIQSYDFQFRSDKVVLPL

A0A5D3DM66 Agglutinin domain-containing protein2.2e-14452.63Show/hide
Query:  ISGKDIEISGGEDDKSIIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKK-LPSPVSKFHSEASESDPRFMHIRCSYNNKYWVRQSPDSN
        I    +++S   DD+SIIPK+FALQNY PR PQP TAP+L+ +        G+L+F+ +  L SP+SKF SE S+SD + +HIRC+ NN+YWVR+S DSN
Subjt:  ISGKDIEISGGEDDKSIIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKK-LPSPVSKFHSEASESDPRFMHIRCSYNNKYWVRQSPDSN

Query:  YIVAIGTKQENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWED--NAFNTLIDWDSLVILPKHVTFKGSNGKY
        +IV   TK+E+D+SKWSCTLFEPI DA +K YRFRHVQLGYELFR     +  + L A E G    E ED    F  +IDW+SL + PKHVTFKG NG+Y
Subjt:  YIVAIGTKQENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWED--NAFNTLIDWDSLVILPKHVTFKGSNGKY

Query:  LKYNGHYLQFSGTDVENPSHIHEIFPKNDGTIRIKNVGCQKFWIRDPNWIVVVAEDSSRDDLNSLFQPVKLGNNIVALRSLGNNHFCTSLSIDGKSNCLN
        L++ G YLQ SG D  +PS IHEIFP+ DG + +KNV  ++FWI DPNWIV  A D +RDD N  F PV L +N+VALR LGN  FCT ++ D K NCLN
Subjt:  LKYNGHYLQFSGTDVENPSHIHEIFPKNDGTIRIKNVGCQKFWIRDPNWIVVVAEDSSRDDLNSLFQPVKLGNNIVALRSLGNNHFCTSLSIDGKSNCLN

Query:  ADLENPIVETEMEFAE--AVMSSRIENIEYRMKDAKIYGERVWSMVKGDAINKTRAADTVQFTFSFEDKMKRNWTNALGVKFGVSKQFTAGVPMIGDGSI
        A   + + E + E +E   + S RI++ +Y + D +IYGERVWSM KG AINKT   + ++FTFSFEDK  + WT+    +F V K+F    P I DG +
Subjt:  ADLENPIVETEMEFAE--AVMSSRIENIEYRMKDAKIYGERVWSMVKGDAINKTRAADTVQFTFSFEDKMKRNWTNALGVKFGVSKQFTAGVPMIGDGSI

Query:  TVSVVAGGEYAWGET-QKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCNVPFSYTKIDTLRDGTQISREYDDGVFNGIQSYDFQFRSDKVVLPL
         +     G Y WGET   +K  MSC+STITVPPMSKVK+N +VKRGFC VPFSY + +   +G +  + Y DGVF G  SY FQ R+DK  LP+
Subjt:  TVSVVAGGEYAWGET-QKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCNVPFSYTKIDTLRDGTQISREYDDGVFNGIQSYDFQFRSDKVVLPL

A0A6J1DQ71 uncharacterized protein LOC1110225574.8e-304100Show/hide
Query:  MLDSEVEDRLREERLENRYKEISGKDIEISGGEDDKSIIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKKLPSPVSKFHSEASESDPRF
        MLDSEVEDRLREERLENRYKEISGKDIEISGGEDDKSIIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKKLPSPVSKFHSEASESDPRF
Subjt:  MLDSEVEDRLREERLENRYKEISGKDIEISGGEDDKSIIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKKLPSPVSKFHSEASESDPRF

Query:  MHIRCSYNNKYWVRQSPDSNYIVAIGTKQENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWEDNAFNTLIDWD
        MHIRCSYNNKYWVRQSPDSNYIVAIGTKQENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWEDNAFNTLIDWD
Subjt:  MHIRCSYNNKYWVRQSPDSNYIVAIGTKQENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWEDNAFNTLIDWD

Query:  SLVILPKHVTFKGSNGKYLKYNGHYLQFSGTDVENPSHIHEIFPKNDGTIRIKNVGCQKFWIRDPNWIVVVAEDSSRDDLNSLFQPVKLGNNIVALRSLG
        SLVILPKHVTFKGSNGKYLKYNGHYLQFSGTDVENPSHIHEIFPKNDGTIRIKNVGCQKFWIRDPNWIVVVAEDSSRDDLNSLFQPVKLGNNIVALRSLG
Subjt:  SLVILPKHVTFKGSNGKYLKYNGHYLQFSGTDVENPSHIHEIFPKNDGTIRIKNVGCQKFWIRDPNWIVVVAEDSSRDDLNSLFQPVKLGNNIVALRSLG

Query:  NNHFCTSLSIDGKSNCLNADLENPIVETEMEFAEAVMSSRIENIEYRMKDAKIYGERVWSMVKGDAINKTRAADTVQFTFSFEDKMKRNWTNALGVKFGV
        NNHFCTSLSIDGKSNCLNADLENPIVETEMEFAEAVMSSRIENIEYRMKDAKIYGERVWSMVKGDAINKTRAADTVQFTFSFEDKMKRNWTNALGVKFGV
Subjt:  NNHFCTSLSIDGKSNCLNADLENPIVETEMEFAEAVMSSRIENIEYRMKDAKIYGERVWSMVKGDAINKTRAADTVQFTFSFEDKMKRNWTNALGVKFGV

Query:  SKQFTAGVPMIGDGSITVSVVAGGEYAWGETQKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCNVPFSYTKIDTLRDGTQISREYDDGVFNGIQSYDFQF
        SKQFTAGVPMIGDGSITVSVVAGGEYAWGETQKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCNVPFSYTKIDTLRDGTQISREYDDGVFNGIQSYDFQF
Subjt:  SKQFTAGVPMIGDGSITVSVVAGGEYAWGETQKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCNVPFSYTKIDTLRDGTQISREYDDGVFNGIQSYDFQF

Query:  RSDKVVLPL
        RSDKVVLPL
Subjt:  RSDKVVLPL

A0A6J1DTM1 uncharacterized protein LOC1110242914.8e-13952.6Show/hide
Query:  EDRLREERLENRYKEISGKDIEISGGEDDKSIIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKKLPSPVSKFHSEASESDPRFMHIRCS
        E  LRE  LEN+YK I+ K  + S        +PK+FALQ + P    PKT  YLR VQDHE    GFL+ SGK + SP SK  SEASES P+ +HIR  
Subjt:  EDRLREERLENRYKEISGKDIEISGGEDDKSIIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKKLPSPVSKFHSEASESDPRFMHIRCS

Query:  YNNKYWVRQSPDSNYIVAIGTKQENDQSKWSCTLFEPIY--DADHKGYRFRHVQLGYE-LFRASLFDEFPDGLLAKEKGATIE-----EWEDNAFNTLID
         NNKYWVRQSPDS YIV    ++E D+SKW+CTLF   Y     H+ +   HVQLG   L+R+   ++F + L A++K   ++        +++F+  +D
Subjt:  YNNKYWVRQSPDSNYIVAIGTKQENDQSKWSCTLFEPIY--DADHKGYRFRHVQLGYE-LFRASLFDEFPDGLLAKEKGATIE-----EWEDNAFNTLID

Query:  WDSLVILPKHVTFKGSNGKYLKYNGHYLQFSGTDVENPSHIHEIFPKNDGTIRIKNVGCQKFWIRDPNWIVVVAEDSSRDDLNSLFQPVKLGNNIVALRS
        WDSL I PKHVTFK                   DVE+ S IHEIFP+NDGTIRI+NVG +KFWIRDPNWI+ +AE  S+DD N+LF+ VK+ +NIVAL +
Subjt:  WDSLVILPKHVTFKGSNGKYLKYNGHYLQFSGTDVENPSHIHEIFPKNDGTIRIKNVGCQKFWIRDPNWIVVVAEDSSRDDLNSLFQPVKLGNNIVALRS

Query:  LGNNHFCTSLSIDGKSNCLNADLENPIVETEMEFAEAVMSSRIENIEYRMKDAKIYGERVWSMVKGDAINKTRAADTVQFTFSFEDKMKRNWTNALGVKF
                                       ME  +AV+S +IENIEY + DAKIYGERVWSM KGDA NKT AAD VQFTF+FEDK K +WTN LG +F
Subjt:  LGNNHFCTSLSIDGKSNCLNADLENPIVETEMEFAEAVMSSRIENIEYRMKDAKIYGERVWSMVKGDAINKTRAADTVQFTFSFEDKMKRNWTNALGVKF

Query:  GVSKQFTAGVPMIGDGSITVSVVAGGEYAWGETQKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCNVPFSYTKIDTLRDGTQISREYDDGVFNGIQSYDF
        GVSK F+ G+P IG+G+I+VS   G  Y+WGET K+K  MSC+ST+T+PPMSKVKMN +VKRGFC+VPF YT+IDTLRDG QISREY+DG+F+G  SYDF
Subjt:  GVSKQFTAGVPMIGDGSITVSVVAGGEYAWGETQKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCNVPFSYTKIDTLRDGTQISREYDDGVFNGIQSYDF

SwissProt top hitse value%identityAlignment
Q5CZR5 Aerolysin-like protein7.5e-0422.02Show/hide
Query:  FAEAVMSSRIENIEYRMKDAKIYGERVWSMVKGDAINKTRAADTVQFTFSFEDKMKRNWTNALGVKFGVSKQFTAGVPMIGDGSITVSVVAGGEYAWGET
        F  AV S+ + N+ Y   +  I       +      NKT      +   S +     +W+         S + +AG+P I + S   S+  G E      
Subjt:  FAEAVMSSRIENIEYRMKDAKIYGERVWSMVKGDAINKTRAADTVQFTFSFEDKMKRNWTNALGVKFGVSKQFTAGVPMIGDGSITVSVVAGGEYAWGET

Query:  QKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCNVPFSYTKIDTLRDGTQISREYDDGVFNGIQSYDFQ
        Q +++  + ++T+ VPP  KV ++  + R   ++P++ T   T ++G+ +  E   G + G+   D +
Subjt:  QKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCNVPFSYTKIDTLRDGTQISREYDDGVFNGIQSYDFQ

Q66S13 Natterin-45.0e-0823.7Show/hide
Query:  VMSSRIENIEYRMKDAKIYGERVWSMVKGDAINKTRAADTVQFTFSFEDKMKRNWTNALGVKFGVSKQFTAGVPMIGDGSITVSVVAGGEYAWGETQKEK
        V+  +I N+ Y MK  +++ ++  ++      N      T Q T     +  ++W  +  +  GVS + +AG+P I D S+ VS     E + G ++ E 
Subjt:  VMSSRIENIEYRMKDAKIYGERVWSMVKGDAINKTRAADTVQFTFSFEDKMKRNWTNALGVKFGVSKQFTAGVPMIGDGSITVSVVAGGEYAWGETQKEK

Query:  RFMSCSSTITVPPMSKVKMNAIVKRGFCNVPFSYTKIDTLRDGTQISREYDDGVFNGIQSYDFQ---FRSDKV
           S S + T+PP S   +         N+PF+        +G +++     G++  +Q  + Q    R DK+
Subjt:  RFMSCSSTITVPPMSKVKMNAIVKRGFCNVPFSYTKIDTLRDGTQISREYDDGVFNGIQSYDFQ---FRSDKV

Q66S17 Natterin-32.6e-0421.38Show/hide
Query:  VMSSRIENIEYRMKDAKIYGERVWSMVKGDAINKTRAADTVQFTFSFEDKMKRNWTNALGVKFGVSKQFTAGVPMIGDGSITVSVVAGGEYAWGETQKEK
        V+   +  + Y++  A        ++ +  A N      T         + +++W     V FGV    TAG+P I   +++VSV      + G T  + 
Subjt:  VMSSRIENIEYRMKDAKIYGERVWSMVKGDAINKTRAADTVQFTFSFEDKMKRNWTNALGVKFGVSKQFTAGVPMIGDGSITVSVVAGGEYAWGETQKEK

Query:  RFMSCSSTITVPPMSKVKMNAIVKRGFCNVPFSYTKIDTLRDGTQISREYDDGVFNGIQ
           + S  +TVPP     +  +  +   ++PF+     T R+G + +     G +  IQ
Subjt:  RFMSCSSTITVPPMSKVKMNAIVKRGFCNVPFSYTKIDTLRDGTQISREYDDGVFNGIQ

Q66S21 Natterin-21.9e-0726.42Show/hide
Query:  IEYRMKDAKIYGERVWSMVKG-------DAINKTRAAD-TVQFTFSFEDKMKRNWTNALGVKFGVSKQFTAGVPMIGDGSITVSVVAGGEYAWGETQKEK
        IE  MKD K   E V +++KG         +N     + T   T + +      W     V FGV+   TAG+P +   S+ +S+ A  ++A G ++ E 
Subjt:  IEYRMKDAKIYGERVWSMVKG-------DAINKTRAAD-TVQFTFSFEDKMKRNWTNALGVKFGVSKQFTAGVPMIGDGSITVSVVAGGEYAWGETQKEK

Query:  RFMSCSSTITVPPMSKVKMNAIVKRGFCNVPFSYTKIDTLRDGTQISREYDDGVFNGIQ
        +    + ++ VPP     ++ + +    ++PF+ T I T R G + ++    GV+  IQ
Subjt:  RFMSCSSTITVPPMSKVKMNAIVKRGFCNVPFSYTKIDTLRDGTQISREYDDGVFNGIQ

Q66S25 Natterin-13.8e-0825.17Show/hide
Query:  VMSSRIENIEYRMKDAKIYGERVWSMVKGDAINKTRAADTVQFTFSFEDKMKRNWTNALGVKFGVSKQFTAGVPMIGDGSITVSVVAGGEYAWGETQKEK
        V+   +++++Y+ +   +   +   M K    NK     T   T S +   +  W     V FGV+   TAG+P +   S+ VS+ A  ++A G ++ E 
Subjt:  VMSSRIENIEYRMKDAKIYGERVWSMVKGDAINKTRAADTVQFTFSFEDKMKRNWTNALGVKFGVSKQFTAGVPMIGDGSITVSVVAGGEYAWGETQKEK

Query:  RFMSCSSTITVPPMSKVKMNAIVKRGFCNVPFSYTKIDTLRDG
        +    + ++ VPP     ++ + +    +VPF+ T I T R G
Subjt:  RFMSCSSTITVPPMSKVKMNAIVKRGFCNVPFSYTKIDTLRDG

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGACTCTGAAGTAGAGGATAGATTGAGGGAAGAACGGCTGGAAAATAGGTACAAAGAAATATCAGGAAAAGACATCGAAATATCAGGTGGTGAAGACGATAAATC
CATAATCCCAAAACATTTTGCTTTGCAAAACTACAGGCCAAGGTTTCCACAGCCCAAAACTGCACCATATCTTCGCTATGTACAAGATCATGAGAAACAAGTAGATGGAT
TCCTCCAATTCTCTGGAAAAAAGCTGCCGAGTCCAGTCTCCAAGTTCCACTCCGAGGCCTCCGAATCCGACCCCCGATTCATGCACATACGATGCAGTTACAACAATAAA
TACTGGGTTCGTCAGTCGCCTGACTCCAACTACATTGTTGCCATCGGCACAAAGCAAGAAAACGACCAGTCCAAATGGTCGTGCACGCTGTTTGAGCCCATCTACGATGC
TGACCACAAAGGCTACCGCTTTCGCCATGTGCAGCTCGGCTACGAGCTGTTTCGAGCCAGCTTGTTCGACGAATTCCCCGATGGCCTCCTGGCGAAGGAGAAGGGTGCAA
CTATTGAAGAATGGGAGGATAATGCATTCAATACACTTATCGATTGGGATTCATTAGTTATACTCCCAAAACATGTGACCTTTAAGGGTAGCAATGGAAAGTACTTGAAA
TACAATGGTCATTATTTGCAATTCTCTGGAACAGATGTTGAAAATCCATCGCATATCCATGAAATCTTCCCAAAGAATGATGGAACTATTCGTATCAAGAATGTGGGTTG
CCAAAAGTTCTGGATTCGTGATCCTAATTGGATAGTCGTGGTAGCAGAAGATAGCAGTAGGGACGATCTCAACTCATTGTTTCAGCCAGTGAAACTTGGCAACAACATTG
TGGCTCTTCGTAGCTTGGGCAACAACCACTTCTGCACAAGCCTCAGCATAGATGGAAAGTCAAATTGCTTGAATGCTGACCTGGAAAATCCAATCGTAGAAACCGAAATG
GAATTCGCAGAGGCTGTAATGTCGAGCAGAATAGAAAACATTGAGTATCGCATGAAGGATGCCAAAATCTACGGCGAGAGGGTGTGGTCGATGGTTAAAGGAGATGCTAT
TAACAAAACAAGAGCCGCGGACACCGTGCAATTCACATTCTCTTTTGAGGATAAAATGAAGAGGAATTGGACCAATGCATTGGGTGTCAAATTTGGAGTCTCAAAACAAT
TCACTGCCGGGGTTCCAATGATTGGAGATGGAAGCATTACTGTTTCTGTTGTGGCTGGTGGAGAGTATGCATGGGGAGAGACACAAAAAGAGAAAAGGTTCATGTCTTGT
AGCAGCACTATAACAGTGCCTCCAATGTCGAAAGTGAAGATGAATGCAATCGTAAAACGGGGCTTTTGCAACGTCCCTTTTTCGTATACTAAGATCGACACTCTCCGAGA
TGGAACACAGATCTCCCGTGAGTATGACGACGGAGTTTTCAATGGCATTCAGTCCTACGACTTCCAATTTAGGTCCGATAAGGTAGTACTGCCTTTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGGACTCTGAAGTAGAGGATAGATTGAGGGAAGAACGGCTGGAAAATAGGTACAAAGAAATATCAGGAAAAGACATCGAAATATCAGGTGGTGAAGACGATAAATC
CATAATCCCAAAACATTTTGCTTTGCAAAACTACAGGCCAAGGTTTCCACAGCCCAAAACTGCACCATATCTTCGCTATGTACAAGATCATGAGAAACAAGTAGATGGAT
TCCTCCAATTCTCTGGAAAAAAGCTGCCGAGTCCAGTCTCCAAGTTCCACTCCGAGGCCTCCGAATCCGACCCCCGATTCATGCACATACGATGCAGTTACAACAATAAA
TACTGGGTTCGTCAGTCGCCTGACTCCAACTACATTGTTGCCATCGGCACAAAGCAAGAAAACGACCAGTCCAAATGGTCGTGCACGCTGTTTGAGCCCATCTACGATGC
TGACCACAAAGGCTACCGCTTTCGCCATGTGCAGCTCGGCTACGAGCTGTTTCGAGCCAGCTTGTTCGACGAATTCCCCGATGGCCTCCTGGCGAAGGAGAAGGGTGCAA
CTATTGAAGAATGGGAGGATAATGCATTCAATACACTTATCGATTGGGATTCATTAGTTATACTCCCAAAACATGTGACCTTTAAGGGTAGCAATGGAAAGTACTTGAAA
TACAATGGTCATTATTTGCAATTCTCTGGAACAGATGTTGAAAATCCATCGCATATCCATGAAATCTTCCCAAAGAATGATGGAACTATTCGTATCAAGAATGTGGGTTG
CCAAAAGTTCTGGATTCGTGATCCTAATTGGATAGTCGTGGTAGCAGAAGATAGCAGTAGGGACGATCTCAACTCATTGTTTCAGCCAGTGAAACTTGGCAACAACATTG
TGGCTCTTCGTAGCTTGGGCAACAACCACTTCTGCACAAGCCTCAGCATAGATGGAAAGTCAAATTGCTTGAATGCTGACCTGGAAAATCCAATCGTAGAAACCGAAATG
GAATTCGCAGAGGCTGTAATGTCGAGCAGAATAGAAAACATTGAGTATCGCATGAAGGATGCCAAAATCTACGGCGAGAGGGTGTGGTCGATGGTTAAAGGAGATGCTAT
TAACAAAACAAGAGCCGCGGACACCGTGCAATTCACATTCTCTTTTGAGGATAAAATGAAGAGGAATTGGACCAATGCATTGGGTGTCAAATTTGGAGTCTCAAAACAAT
TCACTGCCGGGGTTCCAATGATTGGAGATGGAAGCATTACTGTTTCTGTTGTGGCTGGTGGAGAGTATGCATGGGGAGAGACACAAAAAGAGAAAAGGTTCATGTCTTGT
AGCAGCACTATAACAGTGCCTCCAATGTCGAAAGTGAAGATGAATGCAATCGTAAAACGGGGCTTTTGCAACGTCCCTTTTTCGTATACTAAGATCGACACTCTCCGAGA
TGGAACACAGATCTCCCGTGAGTATGACGACGGAGTTTTCAATGGCATTCAGTCCTACGACTTCCAATTTAGGTCCGATAAGGTAGTACTGCCTTTGTGA
Protein sequenceShow/hide protein sequence
MLDSEVEDRLREERLENRYKEISGKDIEISGGEDDKSIIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKKLPSPVSKFHSEASESDPRFMHIRCSYNNK
YWVRQSPDSNYIVAIGTKQENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWEDNAFNTLIDWDSLVILPKHVTFKGSNGKYLK
YNGHYLQFSGTDVENPSHIHEIFPKNDGTIRIKNVGCQKFWIRDPNWIVVVAEDSSRDDLNSLFQPVKLGNNIVALRSLGNNHFCTSLSIDGKSNCLNADLENPIVETEM
EFAEAVMSSRIENIEYRMKDAKIYGERVWSMVKGDAINKTRAADTVQFTFSFEDKMKRNWTNALGVKFGVSKQFTAGVPMIGDGSITVSVVAGGEYAWGETQKEKRFMSC
SSTITVPPMSKVKMNAIVKRGFCNVPFSYTKIDTLRDGTQISREYDDGVFNGIQSYDFQFRSDKVVLPL