; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS002933 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS002933
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionAgglutinin domain-containing protein
Genome locationscaffold595_1:106795..109948
RNA-Seq ExpressionMS002933
SyntenyMS002933
Gene Ontology termsNA
InterPro domainsIPR004991 - Aerolysin-like toxin
IPR008998 - Agglutinin domain
IPR036242 - Agglutinin domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039763.1 uncharacterized protein E6C27_scaffold122G00260 [Cucumis melo var. makuwa]7.5e-14453.48Show/hide
Query:  EISGGEDDKSMIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKK-LPSPVSKFHTEASESDPRFMHIRCSYNNKYWVRQSPDSNYIVAIG
        ++S   DD+S+IPK+FALQNY PR PQP TAP+L+ +        G+L+F+ +  L SP+SKF +E S+SD + +HIRC+ NN+YWVR+S DSN+IV   
Subjt:  EISGGEDDKSMIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKK-LPSPVSKFHTEASESDPRFMHIRCSYNNKYWVRQSPDSNYIVAIG

Query:  TKQENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWED--NAFNTLIDWDSLVVLPKHVTFKGSNGKYLKYNGH
        TK+E+D+SKWSCTLFEPI DA +K YRFRHVQLGYELFR     +  + L A E G    E ED    F  +IDW+SL V PKHVTFKG NG+YL++ G 
Subjt:  TKQENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWED--NAFNTLIDWDSLVVLPKHVTFKGSNGKYLKYNGH

Query:  YLQFSGTDVENPSHIHEIFPNNDGTIRIKNVGCQKFWIRDPNWIVAVAEDSSRDDPNSLFQPVKLGNNNIVALRSLGNNHFCTSLSIDGKSNCLNANLEN
        YLQ SG D  +PS IHEIFP  DG + +KNV  ++FWI DPNWIVA A D +RDD N  F PV L ++N+VALR LGN  FCT ++ D K NCLNA+  +
Subjt:  YLQFSGTDVENPSHIHEIFPNNDGTIRIKNVGCQKFWIRDPNWIVAVAEDSSRDDPNSLFQPVKLGNNNIVALRSLGNNHFCTSLSIDGKSNCLNANLEN

Query:  PIVETEMEFAE--AVMSSRIENIEYRMKDAKIYGERMWSMAKGDAINKTRVADTVQFTFSFEDKRKRNWTNALGAKFGVSKQFTAGVPMIGDGSITVSVV
         + E + E +E   + S RI++ +Y + D +IYGER+WSMAKG AINKT   + ++FTFSFEDKR + WT+    +F V K+F    P I DG + +   
Subjt:  PIVETEMEFAE--AVMSSRIENIEYRMKDAKIYGERMWSMAKGDAINKTRVADTVQFTFSFEDKRKRNWTNALGAKFGVSKQFTAGVPMIGDGSITVSVV

Query:  AGGEYAWGET-QKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCDVPFSYTKIDTLRDGTQISREYDDGVFTGIQSYDFQFRSDKVVLP
          G Y WGET   +K  MSC+STITVPPMSKVK+N +VKRGFC+VPFSY + +   +G +  + Y DGVFTG  SY FQ R+DK  LP
Subjt:  AGGEYAWGET-QKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCDVPFSYTKIDTLRDGTQISREYDDGVFTGIQSYDFQFRSDKVVLP

XP_004140504.1 uncharacterized protein LOC101208463 [Cucumis sativus]2.0e-14453.29Show/hide
Query:  SGGEDDKSMIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGK-KLPSPVSKFHTEASESDPRFMHIRCSYNNKYWVRQSPDSNYIVAIGTK
        S   DDKS+IPK+FALQNY PR PQP+TAP+L+      +   G+L+F+G+  L SP SKF +E SESDP+ +HIRC+ NNKYWVR+S DSN+IV   TK
Subjt:  SGGEDDKSMIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGK-KLPSPVSKFHTEASESDPRFMHIRCSYNNKYWVRQSPDSNYIVAIGTK

Query:  QENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWED--NAFNTLIDWDSLVVLPKHVTFKGSNGKYLKYNGHYL
        +E+++SK SCTLF+PIYDA HK Y FRHVQLGYELFR     +  + LLA+E G    E ED    F  +IDW+SL V PK VT KG NG+YL+Y G YL
Subjt:  QENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWED--NAFNTLIDWDSLVVLPKHVTFKGSNGKYLKYNGHYL

Query:  QFSGTDVENPSHIHEIFPNNDGTIRIKNVGCQKFWIRDPNWIVAVAEDSSRDDPNSLFQPVKLGNNNIVALRSLGNNHFCTSLSIDGKSNCLNANLENPI
        Q +G +  +PS IHEI+P  DG ++IKN+   +FWI DP+WIVA A D +RDDP  LF+PV L ++N+V   SLGN   C  +S+D K NCLNA   +P 
Subjt:  QFSGTDVENPSHIHEIFPNNDGTIRIKNVGCQKFWIRDPNWIVAVAEDSSRDDPNSLFQPVKLGNNNIVALRSLGNNHFCTSLSIDGKSNCLNANLENPI

Query:  VETEMEFAE--AVMSSRIENIEYRMKDAKIYGERMWSMAKGDAINKTRVADTVQFTFSFEDKRKRNWTNALGAKFGVSKQFTAGVPMIGDGSITVSVVAG
         ET+ + +E   +   +I+ ++Y++++ +IYGER+WS+AKG AINKT   D ++FTFSFEDKR + WT+    +F  +K F A  P I DG +      G
Subjt:  VETEMEFAE--AVMSSRIENIEYRMKDAKIYGERMWSMAKGDAINKTRVADTVQFTFSFEDKRKRNWTNALGAKFGVSKQFTAGVPMIGDGSITVSVVAG

Query:  GEYAWGET-QKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCDVPFSYTKIDTLRDGTQISREYDDGVFTGIQSYDFQFRSDKVVLP
        G Y W ET  K+K  MSC+STITVPP SKVK+N +VKRGFC+VPFSYT+I+T  +G   ++ Y+DGVFTG+ SY FQ  +DKV LP
Subjt:  GEYAWGET-QKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCDVPFSYTKIDTLRDGTQISREYDDGVFTGIQSYDFQFRSDKVVLP

XP_022155409.1 uncharacterized protein LOC111022557 [Momordica charantia]4.2e-28096.89Show/hide
Query:  EISGGEDDKSMIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKKLPSPVSKFHTEASESDPRFMHIRCSYNNKYWVRQSPDSNYIVAIGT
        EISGGEDDKS+IPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKKLPSPVSKFH+EASESDPRFMHIRCSYNNKYWVRQSPDSNYIVAIGT
Subjt:  EISGGEDDKSMIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKKLPSPVSKFHTEASESDPRFMHIRCSYNNKYWVRQSPDSNYIVAIGT

Query:  KQENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWEDNAFNTLIDWDSLVVLPKHVTFKGSNGKYLKYNGHYLQ
        KQENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWEDNAFNTLIDWDSLV+LPKHVTFKGSNGKYLKYNGHYLQ
Subjt:  KQENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWEDNAFNTLIDWDSLVVLPKHVTFKGSNGKYLKYNGHYLQ

Query:  FSGTDVENPSHIHEIFPNNDGTIRIKNVGCQKFWIRDPNWIVAVAEDSSRDDPNSLFQPVKLGNNNIVALRSLGNNHFCTSLSIDGKSNCLNANLENPIV
        FSGTDVENPSHIHEIFP NDGTIRIKNVGCQKFWIRDPNWIV VAEDSSRDD NSLFQPVKLG NNIVALRSLGNNHFCTSLSIDGKSNCLNA+LENPIV
Subjt:  FSGTDVENPSHIHEIFPNNDGTIRIKNVGCQKFWIRDPNWIVAVAEDSSRDDPNSLFQPVKLGNNNIVALRSLGNNHFCTSLSIDGKSNCLNANLENPIV

Query:  ETEMEFAEAVMSSRIENIEYRMKDAKIYGERMWSMAKGDAINKTRVADTVQFTFSFEDKRKRNWTNALGAKFGVSKQFTAGVPMIGDGSITVSVVAGGEY
        ETEMEFAEAVMSSRIENIEYRMKDAKIYGER+WSM KGDAINKTR ADTVQFTFSFEDK KRNWTNALG KFGVSKQFTAGVPMIGDGSITVSVVAGGEY
Subjt:  ETEMEFAEAVMSSRIENIEYRMKDAKIYGERMWSMAKGDAINKTRVADTVQFTFSFEDKRKRNWTNALGAKFGVSKQFTAGVPMIGDGSITVSVVAGGEY

Query:  AWGETQKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCDVPFSYTKIDTLRDGTQISREYDDGVFTGIQSYDFQFRSDKVVLP
        AWGETQKEKRFMSCSSTITVPPMSKVKMNAIVKRGFC+VPFSYTKIDTLRDGTQISREYDDGVF GIQSYDFQFRSDKVVLP
Subjt:  AWGETQKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCDVPFSYTKIDTLRDGTQISREYDDGVFTGIQSYDFQFRSDKVVLP

XP_038906851.1 uncharacterized protein LOC120092742 [Benincasa hispida]4.6e-19466.53Show/hide
Query:  GEDDKSMIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKKLPSPVSKFHTEASESDPRFMHIRCSYNNKYWVRQSPDSNYIVAIGTKQEN
        GEDDKS+IP+ FALQN  P+ PQPKTAPYLRYV + +K  +G L FSGK + SP SKF +E SE+DP+  HI+C YNNKYWVR+S +S+YI+A  TK+E 
Subjt:  GEDDKSMIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKKLPSPVSKFHTEASESDPRFMHIRCSYNNKYWVRQSPDSNYIVAIGTKQEN

Query:  DQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWEDNAFNTLIDWDSLVVLPKHVTFKGSNGKYLKYNGHYLQFSGT
        D+SKW+CTLFEPIYD+D+K +RF HVQ   ELFRA  +D + D LLAKE  AT+   ED  F T+IDW SL + PKHVTFKG NGKYLK+ G++LQFSGT
Subjt:  DQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWEDNAFNTLIDWDSLVVLPKHVTFKGSNGKYLKYNGHYLQFSGT

Query:  DVENPSHIHEIFPNNDGTIRIKNVGCQKFWIRDPNWIVAVAEDSSRDDPNSLFQPVKLGNNNIVALRSLGNNHFCTSLSIDGKSNCLNANLENPIVETEM
        D+E+PS IHEIFP NDGT+RIKNVG +KFWIRD NWI+A A + S +DPN+ FQPVKLG +NIVALR+LGNNHFCTSLS+D K++CLNAN  NP  E  M
Subjt:  DVENPSHIHEIFPNNDGTIRIKNVGCQKFWIRDPNWIVAVAEDSSRDDPNSLFQPVKLGNNNIVALRSLGNNHFCTSLSIDGKSNCLNANLENPIVETEM

Query:  EFAEAVMSSRIENIEYRMKDAKIYGERMWSMAKGDAINKTRVADTVQFTFSFEDKRKRNWTNALGAKFGVSKQFTAGVPMIGDGSITVSVVAGGEYAWGE
        E +EAV+SS+IENIEYR++DAKIYGER+WSMAKGDAINKT+ ADT+QFTFSFEDKRK+NWTN +  KFGV+++FTAGVP+IGD  + +    GG Y+WGE
Subjt:  EFAEAVMSSRIENIEYRMKDAKIYGERMWSMAKGDAINKTRVADTVQFTFSFEDKRKRNWTNALGAKFGVSKQFTAGVPMIGDGSITVSVVAGGEYAWGE

Query:  TQKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCDVPFSYTKIDTLRDGTQISREYDDGVFTGIQSYDFQFRSDKVVLP
        T K+K  M+CSSTITVPPMSKVK++ +VKRGFC+VP+SYT+ DTLRDG Q + EY+DGVF+G+ SY F  R+DKV LP
Subjt:  TQKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCDVPFSYTKIDTLRDGTQISREYDDGVFTGIQSYDFQFRSDKVVLP

XP_038906982.1 uncharacterized protein LOC120092830 [Benincasa hispida]9.3e-19566.74Show/hide
Query:  GEDDKSMIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKKLPSPVSKFHTEASESDPRFMHIRCSYNNKYWVRQSPDSNYIVAIGTKQEN
        GEDDKS+IP+ FALQN  P+ PQPKTAPYLRYV + +K  +G L FSGK + SP SKF +E SE+DP+  HI+C YNNKYWVR+S +S+YI+A  TK+E 
Subjt:  GEDDKSMIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKKLPSPVSKFHTEASESDPRFMHIRCSYNNKYWVRQSPDSNYIVAIGTKQEN

Query:  DQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWEDNAFNTLIDWDSLVVLPKHVTFKGSNGKYLKYNGHYLQFSGT
        D+SKW+CTLFEPIYD+D+K +RF HVQ   ELFRA  +D + D LLAKE  AT+   ED  F T+IDW SL + PKHVTFKG NGKYLK+ G++LQFSGT
Subjt:  DQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWEDNAFNTLIDWDSLVVLPKHVTFKGSNGKYLKYNGHYLQFSGT

Query:  DVENPSHIHEIFPNNDGTIRIKNVGCQKFWIRDPNWIVAVAEDSSRDDPNSLFQPVKLGNNNIVALRSLGNNHFCTSLSIDGKSNCLNANLENPIVETEM
        D+E+PS IHEIFP NDGT+RIKNVG +KFWIRD NWI+A A + S +DPN+ FQPVKLG +NIVALR+LGNNHFCTSLS+D K++CLNAN  NP  E  M
Subjt:  DVENPSHIHEIFPNNDGTIRIKNVGCQKFWIRDPNWIVAVAEDSSRDDPNSLFQPVKLGNNNIVALRSLGNNHFCTSLSIDGKSNCLNANLENPIVETEM

Query:  EFAEAVMSSRIENIEYRMKDAKIYGERMWSMAKGDAINKTRVADTVQFTFSFEDKRKRNWTNALGAKFGVSKQFTAGVPMIGDGSITVSVVAGGEYAWGE
        E +EAV+SS+IENIEYR++DAKIYGER+WSMAKGDAINKT+ ADT+QFTFSFEDKRK+NWTN +  KFGV+++FTAGVP+IGD  + +    GG Y+WGE
Subjt:  EFAEAVMSSRIENIEYRMKDAKIYGERMWSMAKGDAINKTRVADTVQFTFSFEDKRKRNWTNALGAKFGVSKQFTAGVPMIGDGSITVSVVAGGEYAWGE

Query:  TQKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCDVPFSYTKIDTLRDGTQISREYDDGVFTGIQSYDFQFRSDKVVLP
        T K+K  M+CSSTITVPPMSKVK++ +VKRGFC+VP+SYT+ DTLRDG Q + EY+DGVF+G+ SY FQ R+DKV LP
Subjt:  TQKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCDVPFSYTKIDTLRDGTQISREYDDGVFTGIQSYDFQFRSDKVVLP

TrEMBL top hitse value%identityAlignment
A0A0A0KAP4 Agglutinin domain-containing protein2.6e-14253.86Show/hide
Query:  DDKSMIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKK-LPSPVSKFHTEASESDPRFMHIRCSYNNKYWVRQSPDSNYIVAIGTKQEND
        DDKS+ PK+FALQNY PR PQP+TAP+L+Y+       + +L+F+G+  L  P SKF +E S+S+P+ +HIRC+  NKYWVR+S DSN+IV I TK+E++
Subjt:  DDKSMIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKK-LPSPVSKFHTEASESDPRFMHIRCSYNNKYWVRQSPDSNYIVAIGTKQEND

Query:  QSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWED--NAFNTLIDWDSLVVLPKHVTFKGSNGKYLKYNGHYLQFSG
         SK SCTLFEPIYDA +K YRFRHVQLGYELFR     +  D LLA+E G+   E ED    F  +IDW+SL V PKHVTFKG NGKYL++ G YLQ SG
Subjt:  QSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWED--NAFNTLIDWDSLVVLPKHVTFKGSNGKYLKYNGHYLQFSG

Query:  TDVENPSHIHEIFPNNDGTIRIKNVGCQKFWIRDPNWIVAVAEDSSRDDPNSLFQPVKLGNNNIVALRSLGNNHFCTSLSIDGKSNCLNANLENPIVETE
         +  + S IHEI+P  DG + IKN+  ++FWI DPNWIVA A D +RDDPN LFQPV L +NN+VALRSLGN  FC  +S+D + NCLNA   +P  ET+
Subjt:  TDVENPSHIHEIFPNNDGTIRIKNVGCQKFWIRDPNWIVAVAEDSSRDDPNSLFQPVKLGNNNIVALRSLGNNHFCTSLSIDGKSNCLNANLENPIVETE

Query:  MEFAEAVMSSRIE---NIEYRMKDAKIYGERMWSMAKGDAINKTRVADTVQFTFSFEDKRKRNWTNALGAKFGVSKQFTAGVPMIGDGSITVSVVAGGEY
         E +E  +  R +   NI YR+ + +IYGER+WSMAKG AINKT   + ++FTFSFED+R   WTN    +F  +K F A  P+I DG IT+        
Subjt:  MEFAEAVMSSRIE---NIEYRMKDAKIYGERMWSMAKGDAINKTRVADTVQFTFSFEDKRKRNWTNALGAKFGVSKQFTAGVPMIGDGSITVSVVAGGEY

Query:  AWGET-QKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCDVPFSYTKIDT---------LRDGTQISREYDDGVFTGIQSYDFQFRSDKVVLP
         WGET +K+K  MSC +TITVPPMSKVK+N +VKRGFC+VPFSY    T          RDG      + DG FTG+ SY FQ  +D+  LP
Subjt:  AWGET-QKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCDVPFSYTKIDT---------LRDGTQISREYDDGVFTGIQSYDFQFRSDKVVLP

A0A0A0KFN1 Agglutinin domain-containing protein9.5e-14553.29Show/hide
Query:  SGGEDDKSMIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGK-KLPSPVSKFHTEASESDPRFMHIRCSYNNKYWVRQSPDSNYIVAIGTK
        S   DDKS+IPK+FALQNY PR PQP+TAP+L+      +   G+L+F+G+  L SP SKF +E SESDP+ +HIRC+ NNKYWVR+S DSN+IV   TK
Subjt:  SGGEDDKSMIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGK-KLPSPVSKFHTEASESDPRFMHIRCSYNNKYWVRQSPDSNYIVAIGTK

Query:  QENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWED--NAFNTLIDWDSLVVLPKHVTFKGSNGKYLKYNGHYL
        +E+++SK SCTLF+PIYDA HK Y FRHVQLGYELFR     +  + LLA+E G    E ED    F  +IDW+SL V PK VT KG NG+YL+Y G YL
Subjt:  QENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWED--NAFNTLIDWDSLVVLPKHVTFKGSNGKYLKYNGHYL

Query:  QFSGTDVENPSHIHEIFPNNDGTIRIKNVGCQKFWIRDPNWIVAVAEDSSRDDPNSLFQPVKLGNNNIVALRSLGNNHFCTSLSIDGKSNCLNANLENPI
        Q +G +  +PS IHEI+P  DG ++IKN+   +FWI DP+WIVA A D +RDDP  LF+PV L ++N+V   SLGN   C  +S+D K NCLNA   +P 
Subjt:  QFSGTDVENPSHIHEIFPNNDGTIRIKNVGCQKFWIRDPNWIVAVAEDSSRDDPNSLFQPVKLGNNNIVALRSLGNNHFCTSLSIDGKSNCLNANLENPI

Query:  VETEMEFAE--AVMSSRIENIEYRMKDAKIYGERMWSMAKGDAINKTRVADTVQFTFSFEDKRKRNWTNALGAKFGVSKQFTAGVPMIGDGSITVSVVAG
         ET+ + +E   +   +I+ ++Y++++ +IYGER+WS+AKG AINKT   D ++FTFSFEDKR + WT+    +F  +K F A  P I DG +      G
Subjt:  VETEMEFAE--AVMSSRIENIEYRMKDAKIYGERMWSMAKGDAINKTRVADTVQFTFSFEDKRKRNWTNALGAKFGVSKQFTAGVPMIGDGSITVSVVAG

Query:  GEYAWGET-QKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCDVPFSYTKIDTLRDGTQISREYDDGVFTGIQSYDFQFRSDKVVLP
        G Y W ET  K+K  MSC+STITVPP SKVK+N +VKRGFC+VPFSYT+I+T  +G   ++ Y+DGVFTG+ SY FQ  +DKV LP
Subjt:  GEYAWGET-QKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCDVPFSYTKIDTLRDGTQISREYDDGVFTGIQSYDFQFRSDKVVLP

A0A5D3DM66 Agglutinin domain-containing protein3.6e-14453.48Show/hide
Query:  EISGGEDDKSMIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKK-LPSPVSKFHTEASESDPRFMHIRCSYNNKYWVRQSPDSNYIVAIG
        ++S   DD+S+IPK+FALQNY PR PQP TAP+L+ +        G+L+F+ +  L SP+SKF +E S+SD + +HIRC+ NN+YWVR+S DSN+IV   
Subjt:  EISGGEDDKSMIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKK-LPSPVSKFHTEASESDPRFMHIRCSYNNKYWVRQSPDSNYIVAIG

Query:  TKQENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWED--NAFNTLIDWDSLVVLPKHVTFKGSNGKYLKYNGH
        TK+E+D+SKWSCTLFEPI DA +K YRFRHVQLGYELFR     +  + L A E G    E ED    F  +IDW+SL V PKHVTFKG NG+YL++ G 
Subjt:  TKQENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWED--NAFNTLIDWDSLVVLPKHVTFKGSNGKYLKYNGH

Query:  YLQFSGTDVENPSHIHEIFPNNDGTIRIKNVGCQKFWIRDPNWIVAVAEDSSRDDPNSLFQPVKLGNNNIVALRSLGNNHFCTSLSIDGKSNCLNANLEN
        YLQ SG D  +PS IHEIFP  DG + +KNV  ++FWI DPNWIVA A D +RDD N  F PV L ++N+VALR LGN  FCT ++ D K NCLNA+  +
Subjt:  YLQFSGTDVENPSHIHEIFPNNDGTIRIKNVGCQKFWIRDPNWIVAVAEDSSRDDPNSLFQPVKLGNNNIVALRSLGNNHFCTSLSIDGKSNCLNANLEN

Query:  PIVETEMEFAE--AVMSSRIENIEYRMKDAKIYGERMWSMAKGDAINKTRVADTVQFTFSFEDKRKRNWTNALGAKFGVSKQFTAGVPMIGDGSITVSVV
         + E + E +E   + S RI++ +Y + D +IYGER+WSMAKG AINKT   + ++FTFSFEDKR + WT+    +F V K+F    P I DG + +   
Subjt:  PIVETEMEFAE--AVMSSRIENIEYRMKDAKIYGERMWSMAKGDAINKTRVADTVQFTFSFEDKRKRNWTNALGAKFGVSKQFTAGVPMIGDGSITVSVV

Query:  AGGEYAWGET-QKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCDVPFSYTKIDTLRDGTQISREYDDGVFTGIQSYDFQFRSDKVVLP
          G Y WGET   +K  MSC+STITVPPMSKVK+N +VKRGFC+VPFSY + +   +G +  + Y DGVFTG  SY FQ R+DK  LP
Subjt:  AGGEYAWGET-QKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCDVPFSYTKIDTLRDGTQISREYDDGVFTGIQSYDFQFRSDKVVLP

A0A6J1DQ71 uncharacterized protein LOC1110225572.0e-28096.89Show/hide
Query:  EISGGEDDKSMIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKKLPSPVSKFHTEASESDPRFMHIRCSYNNKYWVRQSPDSNYIVAIGT
        EISGGEDDKS+IPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKKLPSPVSKFH+EASESDPRFMHIRCSYNNKYWVRQSPDSNYIVAIGT
Subjt:  EISGGEDDKSMIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKKLPSPVSKFHTEASESDPRFMHIRCSYNNKYWVRQSPDSNYIVAIGT

Query:  KQENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWEDNAFNTLIDWDSLVVLPKHVTFKGSNGKYLKYNGHYLQ
        KQENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWEDNAFNTLIDWDSLV+LPKHVTFKGSNGKYLKYNGHYLQ
Subjt:  KQENDQSKWSCTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWEDNAFNTLIDWDSLVVLPKHVTFKGSNGKYLKYNGHYLQ

Query:  FSGTDVENPSHIHEIFPNNDGTIRIKNVGCQKFWIRDPNWIVAVAEDSSRDDPNSLFQPVKLGNNNIVALRSLGNNHFCTSLSIDGKSNCLNANLENPIV
        FSGTDVENPSHIHEIFP NDGTIRIKNVGCQKFWIRDPNWIV VAEDSSRDD NSLFQPVKLG NNIVALRSLGNNHFCTSLSIDGKSNCLNA+LENPIV
Subjt:  FSGTDVENPSHIHEIFPNNDGTIRIKNVGCQKFWIRDPNWIVAVAEDSSRDDPNSLFQPVKLGNNNIVALRSLGNNHFCTSLSIDGKSNCLNANLENPIV

Query:  ETEMEFAEAVMSSRIENIEYRMKDAKIYGERMWSMAKGDAINKTRVADTVQFTFSFEDKRKRNWTNALGAKFGVSKQFTAGVPMIGDGSITVSVVAGGEY
        ETEMEFAEAVMSSRIENIEYRMKDAKIYGER+WSM KGDAINKTR ADTVQFTFSFEDK KRNWTNALG KFGVSKQFTAGVPMIGDGSITVSVVAGGEY
Subjt:  ETEMEFAEAVMSSRIENIEYRMKDAKIYGERMWSMAKGDAINKTRVADTVQFTFSFEDKRKRNWTNALGAKFGVSKQFTAGVPMIGDGSITVSVVAGGEY

Query:  AWGETQKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCDVPFSYTKIDTLRDGTQISREYDDGVFTGIQSYDFQFRSDKVVLP
        AWGETQKEKRFMSCSSTITVPPMSKVKMNAIVKRGFC+VPFSYTKIDTLRDGTQISREYDDGVF GIQSYDFQFRSDKVVLP
Subjt:  AWGETQKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCDVPFSYTKIDTLRDGTQISREYDDGVFTGIQSYDFQFRSDKVVLP

A0A6J1DTM1 uncharacterized protein LOC1110242914.3e-13753.94Show/hide
Query:  IPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKKLPSPVSKFHTEASESDPRFMHIRCSYNNKYWVRQSPDSNYIVAIGTKQENDQSKWSC
        +PK+FALQ + P    PKT  YLR VQDHE    GFL+ SGK + SP SK  +EASES P+ +HIR   NNKYWVRQSPDS YIV    ++E D+SKW+C
Subjt:  IPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKKLPSPVSKFHTEASESDPRFMHIRCSYNNKYWVRQSPDSNYIVAIGTKQENDQSKWSC

Query:  TLFEPIY--DADHKGYRFRHVQLGYE-LFRASLFDEFPDGLLAKEKGATIE-----EWEDNAFNTLIDWDSLVVLPKHVTFKGSNGKYLKYNGHYLQFSG
        TLF   Y     H+ +   HVQLG   L+R+   ++F + L A++K   ++        +++F+  +DWDSL + PKHVTFK                  
Subjt:  TLFEPIY--DADHKGYRFRHVQLGYE-LFRASLFDEFPDGLLAKEKGATIE-----EWEDNAFNTLIDWDSLVVLPKHVTFKGSNGKYLKYNGHYLQFSG

Query:  TDVENPSHIHEIFPNNDGTIRIKNVGCQKFWIRDPNWIVAVAEDSSRDDPNSLFQPVKLGNNNIVALRSLGNNHFCTSLSIDGKSNCLNANLENPIVETE
         DVE+ S IHEIFP NDGTIRI+NVG +KFWIRDPNWI+A+AE  S+DDPN+LF+ VK+ ++NIVAL +                               
Subjt:  TDVENPSHIHEIFPNNDGTIRIKNVGCQKFWIRDPNWIVAVAEDSSRDDPNSLFQPVKLGNNNIVALRSLGNNHFCTSLSIDGKSNCLNANLENPIVETE

Query:  MEFAEAVMSSRIENIEYRMKDAKIYGERMWSMAKGDAINKTRVADTVQFTFSFEDKRKRNWTNALGAKFGVSKQFTAGVPMIGDGSITVSVVAGGEYAWG
        ME  +AV+S +IENIEY + DAKIYGER+WSMAKGDA NKT  AD VQFTF+FEDKRK +WTN LGA+FGVSK F+ G+P IG+G+I+VS   G  Y+WG
Subjt:  MEFAEAVMSSRIENIEYRMKDAKIYGERMWSMAKGDAINKTRVADTVQFTFSFEDKRKRNWTNALGAKFGVSKQFTAGVPMIGDGSITVSVVAGGEYAWG

Query:  ETQKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCDVPFSYTKIDTLRDGTQISREYDDGVFTGIQSYDF
        ET K+K  MSC+ST+T+PPMSKVKMN +VKRGFCDVPF YT+IDTLRDG QISREY+DG+F+G  SYDF
Subjt:  ETQKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCDVPFSYTKIDTLRDGTQISREYDDGVFTGIQSYDF

SwissProt top hitse value%identityAlignment
Q5CZR5 Aerolysin-like protein6.4e-0523.21Show/hide
Query:  FAEAVMSSRIENIEYRMKDAKIYGERMWSMAKGDAINKTRVADTVQFTFSFEDKRKRNWTNALGAKFGVSKQFTAGVPMIGDGSITVSVVAGGEYAWGET
        F  AV S+ + N+ Y   +  I       +      NKT V    +   S +  +  +W+         S + +AG+P I + S   S+  G E      
Subjt:  FAEAVMSSRIENIEYRMKDAKIYGERMWSMAKGDAINKTRVADTVQFTFSFEDKRKRNWTNALGAKFGVSKQFTAGVPMIGDGSITVSVVAGGEYAWGET

Query:  QKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCDVPFSYTKIDTLRDGTQISREYDDGVFTGIQSYDFQ
        Q +++  + ++T+ VPP  KV ++  + R   D+P++ T   T ++G+ +  E   G + G+   D +
Subjt:  QKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCDVPFSYTKIDTLRDGTQISREYDDGVFTGIQSYDFQ

Q66S13 Natterin-41.2e-0623.72Show/hide
Query:  VMSSRIENIEYRMKDAKIYGERMWSMAKGDAINKTRVADTVQFTFSFEDKRKRNWTNALGAKFGVSKQFTAGVPMIGDGSITVSVVAGGEYAWGETQKEK
        V+  +I N+ Y MK  +++ ++  ++      N      T Q T     +  ++W  +     GVS + +AG+P I D S+ VS     E + G ++ E 
Subjt:  VMSSRIENIEYRMKDAKIYGERMWSMAKGDAINKTRVADTVQFTFSFEDKRKRNWTNALGAKFGVSKQFTAGVPMIGDGSITVSVVAGGEYAWGETQKEK

Query:  RFMSCSSTITVPPMSKVKMNAIVKRGFCDVPFSYTKIDTLRDGTQISREYDDGVFT
           S S + T+PP S   +         ++PF+           +++R+Y +G  T
Subjt:  RFMSCSSTITVPPMSKVKMNAIVKRGFCDVPFSYTKIDTLRDGTQISREYDDGVFT

Q66S17 Natterin-34.1e-0421.68Show/hide
Query:  VMSSRIENIEYRMKDAKIYGERMWSMAKGDAINKTRVADTVQFTFSFEDKRKRNWTNALGAKFGVSKQFTAGVPMIGDGSITVSVVAGGEYAWGETQKEK
        V+   +  + Y++  A        ++ +  A N      T         + +++W       FGV    TAG+P I   +++VSV      + G T  + 
Subjt:  VMSSRIENIEYRMKDAKIYGERMWSMAKGDAINKTRVADTVQFTFSFEDKRKRNWTNALGAKFGVSKQFTAGVPMIGDGSITVSVVAGGEYAWGETQKEK

Query:  RFMSCSSTITVPPMSKVKMNAIVKRGFCDVPFSYTKIDTLRDG
           + S  +TVPP     +  +  +   D+PF+     T R+G
Subjt:  RFMSCSSTITVPPMSKVKMNAIVKRGFCDVPFSYTKIDTLRDG

Q66S21 Natterin-28.9e-0722.01Show/hide
Query:  VMSSRIENIEYRMKDAKIYGERMWSMAKGDAINKTRVADTVQFTFSFEDKRKRNWTNALGAKFGVSKQFTAGVPMIGDGSITVSVVAGGEYAWGETQKEK
        V+   +++++Y+ +   +   +   M +    N+     T   T + +      W       FGV+   TAG+P +   S+ +S+ A  ++A G ++ E 
Subjt:  VMSSRIENIEYRMKDAKIYGERMWSMAKGDAINKTRVADTVQFTFSFEDKRKRNWTNALGAKFGVSKQFTAGVPMIGDGSITVSVVAGGEYAWGETQKEK

Query:  RFMSCSSTITVPPMSKVKMNAIVKRGFCDVPFSYTKIDTLRDGTQISREYDDGVFTGIQ
        +    + ++ VPP     ++ + +    D+PF+ T I T R G + ++    GV+  IQ
Subjt:  RFMSCSSTITVPPMSKVKMNAIVKRGFCDVPFSYTKIDTLRDGTQISREYDDGVFTGIQ

Q66S25 Natterin-12.8e-0825.17Show/hide
Query:  VMSSRIENIEYRMKDAKIYGERMWSMAKGDAINKTRVADTVQFTFSFEDKRKRNWTNALGAKFGVSKQFTAGVPMIGDGSITVSVVAGGEYAWGETQKEK
        V+   +++++Y+ +   +   +   M K    NK     T   T S +   +  W       FGV+   TAG+P +   S+ VS+ A  ++A G ++ E 
Subjt:  VMSSRIENIEYRMKDAKIYGERMWSMAKGDAINKTRVADTVQFTFSFEDKRKRNWTNALGAKFGVSKQFTAGVPMIGDGSITVSVVAGGEYAWGETQKEK

Query:  RFMSCSSTITVPPMSKVKMNAIVKRGFCDVPFSYTKIDTLRDG
        +    + ++ VPP     ++ + +    DVPF+ T I T R G
Subjt:  RFMSCSSTITVPPMSKVKMNAIVKRGFCDVPFSYTKIDTLRDG

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GAAATATCAGGTGGTGAAGACGATAAATCCATGATCCCAAAACATTTTGCTTTGCAAAACTACAGGCCAAGGTTTCCACAGCCCAAAACTGCACCATATCTTCGCTATGT
ACAAGATCATGAGAAACAAGTAGATGGATTCCTCCAATTCTCTGGAAAAAAGCTGCCGAGTCCAGTCTCCAAGTTCCACACCGAGGCCTCCGAATCCGACCCCCGATTCA
TGCACATACGATGCAGTTACAACAACAAATACTGGGTTCGTCAGTCGCCTGACTCCAACTACATTGTTGCCATCGGCACAAAGCAAGAAAACGACCAGTCCAAATGGTCG
TGCACGCTGTTTGAGCCCATCTACGATGCTGACCACAAAGGCTACCGCTTTCGCCATGTGCAGCTCGGCTACGAGCTGTTTCGAGCCAGCTTGTTCGACGAATTCCCCGA
TGGCCTCCTGGCGAAGGAGAAAGGTGCAACTATTGAAGAATGGGAGGATAATGCATTCAACACACTTATCGATTGGGATTCATTAGTTGTACTCCCAAAACATGTGACCT
TTAAGGGTAGCAATGGAAAGTACTTGAAATACAACGGTCATTATTTGCAATTCTCTGGAACAGATGTTGAAAATCCATCGCATATCCATGAAATCTTCCCAAATAATGAT
GGAACTATTCGTATCAAGAATGTGGGTTGCCAAAAGTTCTGGATTCGTGATCCTAATTGGATAGTCGCGGTAGCAGAAGATAGCAGTAGGGACGATCCCAACTCATTGTT
TCAGCCAGTGAAACTTGGCAACAACAATATTGTGGCTCTTCGTAGCTTGGGCAACAACCACTTCTGCACAAGCCTCAGCATCGATGGAAAGTCAAATTGCTTGAATGCTA
ACCTAGAAAATCCAATCGTAGAAACTGAAATGGAATTCGCAGAGGCTGTAATGTCGAGCAGAATAGAAAACATTGAGTATCGCATGAAGGATGCCAAAATCTATGGTGAG
AGGATGTGGTCGATGGCTAAAGGAGATGCTATTAACAAAACAAGAGTCGCGGACACCGTGCAATTCACATTCTCTTTTGAGGATAAAAGGAAGAGGAATTGGACCAATGC
ATTGGGTGCCAAATTTGGAGTCTCAAAACAATTCACTGCCGGGGTTCCAATGATTGGAGATGGAAGCATTACTGTTTCTGTTGTGGCTGGTGGAGAGTATGCATGGGGAG
AGACACAAAAAGAGAAAAGGTTCATGTCTTGTAGCAGCACTATAACAGTGCCTCCAATGTCGAAAGTGAAGATGAATGCAATCGTAAAACGGGGCTTTTGCGACGTCCCT
TTTTCATATACTAAGATCGACACTCTCCGAGATGGAACACAGATCTCTCGTGAGTATGACGACGGAGTTTTCACCGGCATTCAGTCCTACGACTTCCAATTTAGGTCCGA
TAAGGTAGTACTGCCT
mRNA sequenceShow/hide mRNA sequence
GAAATATCAGGTGGTGAAGACGATAAATCCATGATCCCAAAACATTTTGCTTTGCAAAACTACAGGCCAAGGTTTCCACAGCCCAAAACTGCACCATATCTTCGCTATGT
ACAAGATCATGAGAAACAAGTAGATGGATTCCTCCAATTCTCTGGAAAAAAGCTGCCGAGTCCAGTCTCCAAGTTCCACACCGAGGCCTCCGAATCCGACCCCCGATTCA
TGCACATACGATGCAGTTACAACAACAAATACTGGGTTCGTCAGTCGCCTGACTCCAACTACATTGTTGCCATCGGCACAAAGCAAGAAAACGACCAGTCCAAATGGTCG
TGCACGCTGTTTGAGCCCATCTACGATGCTGACCACAAAGGCTACCGCTTTCGCCATGTGCAGCTCGGCTACGAGCTGTTTCGAGCCAGCTTGTTCGACGAATTCCCCGA
TGGCCTCCTGGCGAAGGAGAAAGGTGCAACTATTGAAGAATGGGAGGATAATGCATTCAACACACTTATCGATTGGGATTCATTAGTTGTACTCCCAAAACATGTGACCT
TTAAGGGTAGCAATGGAAAGTACTTGAAATACAACGGTCATTATTTGCAATTCTCTGGAACAGATGTTGAAAATCCATCGCATATCCATGAAATCTTCCCAAATAATGAT
GGAACTATTCGTATCAAGAATGTGGGTTGCCAAAAGTTCTGGATTCGTGATCCTAATTGGATAGTCGCGGTAGCAGAAGATAGCAGTAGGGACGATCCCAACTCATTGTT
TCAGCCAGTGAAACTTGGCAACAACAATATTGTGGCTCTTCGTAGCTTGGGCAACAACCACTTCTGCACAAGCCTCAGCATCGATGGAAAGTCAAATTGCTTGAATGCTA
ACCTAGAAAATCCAATCGTAGAAACTGAAATGGAATTCGCAGAGGCTGTAATGTCGAGCAGAATAGAAAACATTGAGTATCGCATGAAGGATGCCAAAATCTATGGTGAG
AGGATGTGGTCGATGGCTAAAGGAGATGCTATTAACAAAACAAGAGTCGCGGACACCGTGCAATTCACATTCTCTTTTGAGGATAAAAGGAAGAGGAATTGGACCAATGC
ATTGGGTGCCAAATTTGGAGTCTCAAAACAATTCACTGCCGGGGTTCCAATGATTGGAGATGGAAGCATTACTGTTTCTGTTGTGGCTGGTGGAGAGTATGCATGGGGAG
AGACACAAAAAGAGAAAAGGTTCATGTCTTGTAGCAGCACTATAACAGTGCCTCCAATGTCGAAAGTGAAGATGAATGCAATCGTAAAACGGGGCTTTTGCGACGTCCCT
TTTTCATATACTAAGATCGACACTCTCCGAGATGGAACACAGATCTCTCGTGAGTATGACGACGGAGTTTTCACCGGCATTCAGTCCTACGACTTCCAATTTAGGTCCGA
TAAGGTAGTACTGCCT
Protein sequenceShow/hide protein sequence
EISGGEDDKSMIPKHFALQNYRPRFPQPKTAPYLRYVQDHEKQVDGFLQFSGKKLPSPVSKFHTEASESDPRFMHIRCSYNNKYWVRQSPDSNYIVAIGTKQENDQSKWS
CTLFEPIYDADHKGYRFRHVQLGYELFRASLFDEFPDGLLAKEKGATIEEWEDNAFNTLIDWDSLVVLPKHVTFKGSNGKYLKYNGHYLQFSGTDVENPSHIHEIFPNND
GTIRIKNVGCQKFWIRDPNWIVAVAEDSSRDDPNSLFQPVKLGNNNIVALRSLGNNHFCTSLSIDGKSNCLNANLENPIVETEMEFAEAVMSSRIENIEYRMKDAKIYGE
RMWSMAKGDAINKTRVADTVQFTFSFEDKRKRNWTNALGAKFGVSKQFTAGVPMIGDGSITVSVVAGGEYAWGETQKEKRFMSCSSTITVPPMSKVKMNAIVKRGFCDVP
FSYTKIDTLRDGTQISREYDDGVFTGIQSYDFQFRSDKVVLP