; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10014983 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10014983
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionDUF4283 domain-containing protein
Genome locationChr02:22643118..22644827
RNA-Seq ExpressionHG10014983
SyntenyHG10014983
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034059.1 hypothetical protein E6C27_scaffold65G00450 [Cucumis melo var. makuwa]1.9e-21569.42Show/hide
Query:  MAPVQSKHSLSGRH----DEPTTTSRKKYKTPIFSSSDFKLHHPTTTAASVCNLTPLQTARITQHFDHSLIAWIVGKKIHPQRLAFRLHRHLHLAGDLDV
        MAPV SKHSLS R     DEPT+TSRKKYK P+ SSS+   H+PT       +LTP QTARI Q F HSLIA + G  +H + LA RL R+L L GDLDV
Subjt:  MAPVQSKHSLSGRH----DEPTTTSRKKYKTPIFSSSDFKLHHPTTTAASVCNLTPLQTARITQHFDHSLIAWIVGKKIHPQRLAFRLHRHLHLAGDLDV

Query:  IELGLGFFVLKFSNSSDYFKALEERPWSISHLCIHVFPWIPNFKPSEALVPFVDIWIRLPELSIEYYDKEVFEKISEAIGGCLVKIDPVTEKREKCMFAR
         EL LGFFVLKFSNSSDY +ALEE PWSISHLCIHV PW+PNFKPSEAL+  VD+WIRLPEL IEYYDKE+ EKI+EAIG CLVKIDPVTE+R+KCMFAR
Subjt:  IELGLGFFVLKFSNSSDYFKALEERPWSISHLCIHVFPWIPNFKPSEALVPFVDIWIRLPELSIEYYDKEVFEKISEAIGGCLVKIDPVTEKREKCMFAR

Query:  ICIRITLCNPLIYSIQLERIQQKIEYEGLDSLCPICGCVYDLKHDCLNQNNPSGSSGFDPHQHRPRPLQAIDPISSSG---SLSHPLIHSLPSSESGLGS
        ICIRITLCNPLIYSIQ  +  QK++YEGLDSLC +CGC+ +LKH CLN NNPSGSSG DPHQ  P PLQAIDP SSSG       PLIHSLPSSES LGS
Subjt:  ICIRITLCNPLIYSIQLERIQQKIEYEGLDSLCPICGCVYDLKHDCLNQNNPSGSSGFDPHQHRPRPLQAIDPISSSG---SLSHPLIHSLPSSESGLGS

Query:  KSQVKDPFIELKLKDCPRLKMGECKPHVHVEAKVVENEKKTLPNFPKESSTTTMKTPESKHTNLIQSVRLASSVVVDQFRAAKASNPTKLAVHKNESQLS
        KSQ K+PF+ELKLKDCP+LKMG          KVVENEKK LPNFP+ESSTTT +TPE        SV LA+ +VVDQFRAAK S+PTKL V  N S  S
Subjt:  KSQVKDPFIELKLKDCPRLKMGECKPHVHVEAKVVENEKKTLPNFPKESSTTTMKTPESKHTNLIQSVRLASSVVVDQFRAAKASNPTKLAVHKNESQLS

Query:  SPVEAGLSFFPGVIQQSTVEKEMVNTPFGGINVVDSFPTVYTIDPTATSLGIDLSEVPTTRGSNQTQYATDFVLNSKGENENEVDSEVVPMPTLCSKKML
        S VEAG++ F   IQQ+  EK+M+NTPFGGI VVDS+PTVYTIDPT  SLGID SEVPT  GSNQ +YA +FVLNS+ EN+NEVDS+   MP LC KKML
Subjt:  SPVEAGLSFFPGVIQQSTVEKEMVNTPFGGINVVDSFPTVYTIDPTATSLGIDLSEVPTTRGSNQTQYATDFVLNSKGENENEVDSEVVPMPTLCSKKML

Query:  CWNFRGSENTKLIRASKDLIRLQEPSIVLIFGSKISSADADEVAQELAFDGSYCRKPNCYNGGVCKIMN
        CWNFRG +  KLI+ASK LIRL EPSIVLIFGSKISSADA+EV +ELAF+GSYCRKP+ YNGGV  I++
Subjt:  CWNFRGSENTKLIRASKDLIRLQEPSIVLIFGSKISSADADEVAQELAFDGSYCRKPNCYNGGVCKIMN

KAA0034060.1 hypothetical protein E6C27_scaffold65G00460 [Cucumis melo var. makuwa]3.5e-18559.48Show/hide
Query:  MAPVQSKHSLSGRH----DEPTTTSRKKYKTPIFSSSDFKLHHPTTTAASVCNLTPLQTARITQHFDHSLIAWIVGKKIHPQRLAFRLHRHLHLAGDLDV
        MAPVQSK SLSG      D+  +T RK+Y  P+ SS + K HHPT       NLTP QTARI Q F H LIA ++GK IH + L FRL  HL L GDL+V
Subjt:  MAPVQSKHSLSGRH----DEPTTTSRKKYKTPIFSSSDFKLHHPTTTAASVCNLTPLQTARITQHFDHSLIAWIVGKKIHPQRLAFRLHRHLHLAGDLDV

Query:  IELGLGFFVLKFSNSSDYFKALEERPWSISHLCIHVFPWIPNFKPSEALVPFVDIWIRLPELSIEYYDKEVFEKISEAIGGCLVKIDPVTEKREKCMFAR
          LGLGFF L FSN SDY +AL+ERPW I  LCIHVFPWIPNFKPS+A + FVD+W+RLPEL +E+Y++E+FE I++AIG  LVKIDPVTE+++K +FAR
Subjt:  IELGLGFFVLKFSNSSDYFKALEERPWSISHLCIHVFPWIPNFKPSEALVPFVDIWIRLPELSIEYYDKEVFEKISEAIGGCLVKIDPVTEKREKCMFAR

Query:  ICIRITLCNPLIYSIQLERIQQKIEYEGLDSLCPICGCVYDLKHDCLNQNN------------------------------------------------P
        ICI ITL NPLI+ I +E  +Q I YEGLDSLC +CGCV DLKH CLNQNN                                                P
Subjt:  ICIRITLCNPLIYSIQLERIQQKIEYEGLDSLCPICGCVYDLKHDCLNQNN------------------------------------------------P

Query:  SGSSGFDPHQHRPRPLQAIDPISSSG---SLSHPLIHSLPSSESGLGSKSQVKDPFIELKLKDCPRLKMGECKPHVHVEAKVVENEKKTLPNFPKESSTT
        SGSSG DPHQ  P P Q ID  SSSG       PLIHSLPS ES L SKSQ KDPF EL LKD P+LKM           KVVENEKKTLPNFP+ESSTT
Subjt:  SGSSGFDPHQHRPRPLQAIDPISSSG---SLSHPLIHSLPSSESGLGSKSQVKDPFIELKLKDCPRLKMGECKPHVHVEAKVVENEKKTLPNFPKESSTT

Query:  TMKTPESKHTNLIQSVRLASSVVVDQFRAAKASNPTKLAVHKNESQLSSPVEAGLSFFPGVIQQSTVEKEMVNTPFGGINVVDSFPTVYTIDPTATSLGI
        TMKT E        SV LA S+V DQFRAAKAS PTKLA+H N S+ SS VEAGL+ F  V QQ T  KEM+NTPFG +N VDS+PTVYTIDPT  SL I
Subjt:  TMKTPESKHTNLIQSVRLASSVVVDQFRAAKASNPTKLAVHKNESQLSSPVEAGLSFFPGVIQQSTVEKEMVNTPFGGINVVDSFPTVYTIDPTATSLGI

Query:  DLSEVPTTRGSNQTQYATDFVLNSKGENENEVDSEVVPMPTLCSKKMLCWNFRGSENTKLIRASKDLIRLQEPSIVLIFGSKISSADADEVAQELAFDGS
        + SEVPTT GSNQTQYA +FVLN   ENENEVDS+   MP+LCSKKMLCWNF G +   L +A KDLI L EPSIVLIFGSKISS+DADEV +EL FDG 
Subjt:  DLSEVPTTRGSNQTQYATDFVLNSKGENENEVDSEVVPMPTLCSKKMLCWNFRGSENTKLIRASKDLIRLQEPSIVLIFGSKISSADADEVAQELAFDGS

Query:  YCRKPNCYNGGVCKIMN
        Y RKP+ YNGGV  +++
Subjt:  YCRKPNCYNGGVCKIMN

KAG7030784.1 hypothetical protein SDJN02_04821, partial [Cucurbita argyrosperma subsp. argyrosperma]2.1e-15355.42Show/hide
Query:  TTTAASVCNLTPLQTARITQHFDHSLIAWIVGKKIHPQRLAFRLHRHLHLAGDLDVIELGLGFFVLKFSNSSDYFKALEERPWSISHLCIHVFPWIPNFK
        +T  A+VCNLTP QTARI Q FD SLI W+VGKKIHP++LA RL R+LHLAGDLDV ELGLGFFVLKFSN+ DY++ALEERPWSI HLCI+VFPWIPNFK
Subjt:  TTTAASVCNLTPLQTARITQHFDHSLIAWIVGKKIHPQRLAFRLHRHLHLAGDLDVIELGLGFFVLKFSNSSDYFKALEERPWSISHLCIHVFPWIPNFK

Query:  PSEALVPFVDIWIRLPELSIEYYDKEVFEKISEAIGGCLVKIDPVTEKREKCMFARICIRITLCNPLIYSIQLERIQQKIEYEGLDSLCPICGCVYDLKH
        PSEA +PFVD+WIRLPELSIEYYDKEV EKI+E IGG LVKIDPVTE REKCM+ARICIR+ L  PL  S Q  +  QKI YEGLD LC +CGCV DLKH
Subjt:  PSEALVPFVDIWIRLPELSIEYYDKEVFEKISEAIGGCLVKIDPVTEKREKCMFARICIRITLCNPLIYSIQLERIQQKIEYEGLDSLCPICGCVYDLKH

Query:  DCLNQNNPSGSSGFDPHQHRPRPLQA--------IDPISSS---------------------------GSLSHPLIHSLPSSESGLGSKSQVKDPFIELK
        DCL  +N S SSGFDPH HR RPLQA         +P SSS                            +L   LI S P+  S  GS+ QV    +EL 
Subjt:  DCLNQNNPSGSSGFDPHQHRPRPLQA--------IDPISSS---------------------------GSLSHPLIHSLPSSESGLGSKSQVKDPFIELK

Query:  LKDCPRLKMGECKPHVHVEAKVVENEKKTLPNFPKESSTTTMKTPESKHTNLIQSVRLASSVVVD-QFRAAKASNPTKLAVHKNES-------------Q
        L + P L + E    V                  KES + TMK P  K TNLIQSV LA  V+ D QFR  K S+PT LAV  NE              Q
Subjt:  LKDCPRLKMGECKPHVHVEAKVVENEKKTLPNFPKESSTTTMKTPESKHTNLIQSVRLASSVVVD-QFRAAKASNPTKLAVHKNES-------------Q

Query:  LSSPVEAGLSFFPGVIQQSTVEKEMVNTPFGGINVVDSFPTVYTIDPTATSLGIDLSEV-PTTRGSNQTQYATDFVLNSKGENENEVDSEVVPMPTLCSK
         SS +EAGL F+   IQQST++K + NTP   I+ VDS PT+YTIDPT TSL I+L E+  TT  SNQ ++A   V  S+          V    + CSK
Subjt:  LSSPVEAGLSFFPGVIQQSTVEKEMVNTPFGGINVVDSFPTVYTIDPTATSLGIDLSEV-PTTRGSNQTQYATDFVLNSKGENENEVDSEVVPMPTLCSK

Query:  KMLCWNFRGSENTKLIRASKDLIRLQEPSIVLIFGSKISSADADEVAQELAFDGSYCRKPNCYNGGVCKIMN
        KMLCWNFR ++N KL+RA KDLI+L +PSIVLIFG+KI   DAD V +ELAFDGSYCRKP+ Y GG   +++
Subjt:  KMLCWNFRGSENTKLIRASKDLIRLQEPSIVLIFGSKISSADADEVAQELAFDGSYCRKPNCYNGGVCKIMN

KGN50456.1 hypothetical protein Csa_000264 [Cucumis sativus]2.6e-16753.54Show/hide
Query:  MAPVQSKHSLSGR----HDEPTTTSRKKYKTPIFSSSDFKLHHPTTTAASVCNLTPLQTARITQHFDHSLIAWIVGKKIHPQRLAFRLHRHLHLAGDLDV
        MAPVQSK SLS      +D+PT+T RKKY  P+ SS + K HHPT       NLTP QTAR    F HSLIA ++GK IH + L FRL RHL L GDL+V
Subjt:  MAPVQSKHSLSGR----HDEPTTTSRKKYKTPIFSSSDFKLHHPTTTAASVCNLTPLQTARITQHFDHSLIAWIVGKKIHPQRLAFRLHRHLHLAGDLDV

Query:  IELGLGFFVLKFSNSSDYFKALEERPWSISHLCIHVFPWIPNFKPSEALVPFVDIWIRLPELSIEYYDKEVFEKISEAIGGCLVKIDPVTEKREKCMFAR
        + LGLGFF L FSN  DY++AL+ERPW I HLCIH  PWIPNFKPS+A + FVD+WIRLPEL +E+Y++E+FE I++AIG  LVKIDPVTE+++KCMFAR
Subjt:  IELGLGFFVLKFSNSSDYFKALEERPWSISHLCIHVFPWIPNFKPSEALVPFVDIWIRLPELSIEYYDKEVFEKISEAIGGCLVKIDPVTEKREKCMFAR

Query:  ICIRITLCNPLIYSIQLERIQQKIEYEGLDSLCPICGCVYDLKHDCLNQNNPSGSSGFDPHQHRPRPLQAIDP----ISSSGSLS---------------
        ICI ITL NPLI+ I +E  +Q I YEGLDSLC +CGCV  LKHDCLNQN PS SSG+DPHQ  P PLQA DP     SSSGS S               
Subjt:  ICIRITLCNPLIYSIQLERIQQKIEYEGLDSLCPICGCVYDLKHDCLNQNNPSGSSGFDPHQHRPRPLQAIDP----ISSSGSLS---------------

Query:  ------------------------------------------------------HPLIHSLPSSESGLGSKSQVKDPFIELKLKDCPRLKMGECKPHVHV
                                                               PLIHSLPS ES   SKSQ KDPF EL LK+  +LKMGE       
Subjt:  ------------------------------------------------------HPLIHSLPSSESGLGSKSQVKDPFIELKLKDCPRLKMGECKPHVHV

Query:  EAKVVENEKKTLPNFPKESSTTTMKTPESKHTNLIQSVRLASSVVVDQFRAAKASNPTKLAVHKNESQLSSPVEAGLSFFPGVIQQSTVEKEMVNTPFGG
           VVENEKKTLPN P+ES                                AKAS PTKLA+H N S+  S VEAGL+ F   +Q+ T  KEM+NTPFG 
Subjt:  EAKVVENEKKTLPNFPKESSTTTMKTPESKHTNLIQSVRLASSVVVDQFRAAKASNPTKLAVHKNESQLSSPVEAGLSFFPGVIQQSTVEKEMVNTPFGG

Query:  INVVDSFPTVYTIDPTATSLGIDLSEVPTTRGSNQTQYATDFVLNSKGENENEVDSEVV-PMPTLCSKKMLCWNFRGSENTKLIRASKDLIRLQEPSIVL
        ++VVDS+PTVYTI+PT  SLGI+ SEVPT  GSNQTQYA  FVLNS  EN+NEVDS+    +P  CSK MLC NF   +    IRA KDLI L +PSIVL
Subjt:  INVVDSFPTVYTIDPTATSLGIDLSEVPTTRGSNQTQYATDFVLNSKGENENEVDSEVV-PMPTLCSKKMLCWNFRGSENTKLIRASKDLIRLQEPSIVL

Query:  IFGSKISSADADEVAQELAFDGSYCRKPNCYNGGV
        IFGSKISS+DADEV +E AF+G YCRKP+  NGGV
Subjt:  IFGSKISSADADEVAQELAFDGSYCRKPNCYNGGV

XP_022941632.1 uncharacterized protein LOC111446932 isoform X2 [Cucurbita moschata]2.7e-15356.83Show/hide
Query:  TTTAASVCNLTPLQTARITQHFDHSLIAWIVGKKIHPQRLAFRLHRHLHLAGDLDVIELGLGFFVLKFSNSSDYFKALEERPWSISHLCIHVFPWIPNFK
        +T  A+VCNLTP QTARI Q FD SLI W+VGKKIHP++LA RL R+LHLAGDLDV ELGLGFFVLKFSN+ DY++ALEERPWSI HLCI+VFPWIPNFK
Subjt:  TTTAASVCNLTPLQTARITQHFDHSLIAWIVGKKIHPQRLAFRLHRHLHLAGDLDVIELGLGFFVLKFSNSSDYFKALEERPWSISHLCIHVFPWIPNFK

Query:  PSEALVPFVDIWIRLPELSIEYYDKEVFEKISEAIGGCLVKIDPVTEKREKCMFARICIRITLCNPLIYSIQLERIQQKIEYEGLDSLCPICGCVYDLKH
        PSEA +PFVD+WIRLPELSIEYYDKEV EKI+E IGG LVKIDPVT  REKCM+ARICIR+ L  PL  S Q  +  QKI YEGLD LC +CGCV DLKH
Subjt:  PSEALVPFVDIWIRLPELSIEYYDKEVFEKISEAIGGCLVKIDPVTEKREKCMFARICIRITLCNPLIYSIQLERIQQKIEYEGLDSLCPICGCVYDLKH

Query:  DCLNQNNPSGSSGFDPHQHRPRPLQA--------IDPISSSGSLSHP-----------LIHSLPSSESGLGSKSQVKDPFIELKLKDCPRLKMGECKPHV
        DCL  +N S SSGFDPH H  RPLQA        ++P SSS    +P           LI S P+  S  GS+ QV    +EL L + P L + E    V
Subjt:  DCLNQNNPSGSSGFDPHQHRPRPLQA--------IDPISSSGSLSHP-----------LIHSLPSSESGLGSKSQVKDPFIELKLKDCPRLKMGECKPHV

Query:  HVEAKVVENEKKTLPNFPKESSTTTMKTPESKHTNLIQSVRLASSVVVD-QFRAAKASNPTKLAVHKNES-------------QLSSPVEAGLSFFPGVI
                          KES + TM  P  K TNLIQSV LA  V+ D QFR  K S+PT LAV  NE              Q SS +EAGL F+   I
Subjt:  HVEAKVVENEKKTLPNFPKESSTTTMKTPESKHTNLIQSVRLASSVVVD-QFRAAKASNPTKLAVHKNES-------------QLSSPVEAGLSFFPGVI

Query:  QQSTVEKEMVNTPFGGINVVDSFPTVYTIDPTATSLGIDLSEV-PTTRGSNQTQYATDFVLNSKGENENEVDSEVVPMPTLCSKKMLCWNFRGSENTKLI
        QQST++K + NTP   I+ VDS PT+YTIDPT TSL I+L E+  TT  SNQ ++A   V  S+          V    + CSKKMLCWNFR ++N KL+
Subjt:  QQSTVEKEMVNTPFGGINVVDSFPTVYTIDPTATSLGIDLSEV-PTTRGSNQTQYATDFVLNSKGENENEVDSEVVPMPTLCSKKMLCWNFRGSENTKLI

Query:  RASKDLIRLQEPSIVLIFGSKISSADADEVAQELAFDGSYCRKPNCYNGGVCKIMN
        RA KDLI+L +PSIVLIFG+KIS ADAD V +ELAFDGSYCRKP+ Y GG   +++
Subjt:  RASKDLIRLQEPSIVLIFGSKISSADADEVAQELAFDGSYCRKPNCYNGGVCKIMN

TrEMBL top hitse value%identityAlignment
A0A0A0KRY0 DUF4283 domain-containing protein1.2e-16753.54Show/hide
Query:  MAPVQSKHSLSGR----HDEPTTTSRKKYKTPIFSSSDFKLHHPTTTAASVCNLTPLQTARITQHFDHSLIAWIVGKKIHPQRLAFRLHRHLHLAGDLDV
        MAPVQSK SLS      +D+PT+T RKKY  P+ SS + K HHPT       NLTP QTAR    F HSLIA ++GK IH + L FRL RHL L GDL+V
Subjt:  MAPVQSKHSLSGR----HDEPTTTSRKKYKTPIFSSSDFKLHHPTTTAASVCNLTPLQTARITQHFDHSLIAWIVGKKIHPQRLAFRLHRHLHLAGDLDV

Query:  IELGLGFFVLKFSNSSDYFKALEERPWSISHLCIHVFPWIPNFKPSEALVPFVDIWIRLPELSIEYYDKEVFEKISEAIGGCLVKIDPVTEKREKCMFAR
        + LGLGFF L FSN  DY++AL+ERPW I HLCIH  PWIPNFKPS+A + FVD+WIRLPEL +E+Y++E+FE I++AIG  LVKIDPVTE+++KCMFAR
Subjt:  IELGLGFFVLKFSNSSDYFKALEERPWSISHLCIHVFPWIPNFKPSEALVPFVDIWIRLPELSIEYYDKEVFEKISEAIGGCLVKIDPVTEKREKCMFAR

Query:  ICIRITLCNPLIYSIQLERIQQKIEYEGLDSLCPICGCVYDLKHDCLNQNNPSGSSGFDPHQHRPRPLQAIDP----ISSSGSLS---------------
        ICI ITL NPLI+ I +E  +Q I YEGLDSLC +CGCV  LKHDCLNQN PS SSG+DPHQ  P PLQA DP     SSSGS S               
Subjt:  ICIRITLCNPLIYSIQLERIQQKIEYEGLDSLCPICGCVYDLKHDCLNQNNPSGSSGFDPHQHRPRPLQAIDP----ISSSGSLS---------------

Query:  ------------------------------------------------------HPLIHSLPSSESGLGSKSQVKDPFIELKLKDCPRLKMGECKPHVHV
                                                               PLIHSLPS ES   SKSQ KDPF EL LK+  +LKMGE       
Subjt:  ------------------------------------------------------HPLIHSLPSSESGLGSKSQVKDPFIELKLKDCPRLKMGECKPHVHV

Query:  EAKVVENEKKTLPNFPKESSTTTMKTPESKHTNLIQSVRLASSVVVDQFRAAKASNPTKLAVHKNESQLSSPVEAGLSFFPGVIQQSTVEKEMVNTPFGG
           VVENEKKTLPN P+ES                                AKAS PTKLA+H N S+  S VEAGL+ F   +Q+ T  KEM+NTPFG 
Subjt:  EAKVVENEKKTLPNFPKESSTTTMKTPESKHTNLIQSVRLASSVVVDQFRAAKASNPTKLAVHKNESQLSSPVEAGLSFFPGVIQQSTVEKEMVNTPFGG

Query:  INVVDSFPTVYTIDPTATSLGIDLSEVPTTRGSNQTQYATDFVLNSKGENENEVDSEVV-PMPTLCSKKMLCWNFRGSENTKLIRASKDLIRLQEPSIVL
        ++VVDS+PTVYTI+PT  SLGI+ SEVPT  GSNQTQYA  FVLNS  EN+NEVDS+    +P  CSK MLC NF   +    IRA KDLI L +PSIVL
Subjt:  INVVDSFPTVYTIDPTATSLGIDLSEVPTTRGSNQTQYATDFVLNSKGENENEVDSEVV-PMPTLCSKKMLCWNFRGSENTKLIRASKDLIRLQEPSIVL

Query:  IFGSKISSADADEVAQELAFDGSYCRKPNCYNGGV
        IFGSKISS+DADEV +E AF+G YCRKP+  NGGV
Subjt:  IFGSKISSADADEVAQELAFDGSYCRKPNCYNGGV

A0A5A7SSW6 DUF4283 domain-containing protein1.7e-18559.48Show/hide
Query:  MAPVQSKHSLSGRH----DEPTTTSRKKYKTPIFSSSDFKLHHPTTTAASVCNLTPLQTARITQHFDHSLIAWIVGKKIHPQRLAFRLHRHLHLAGDLDV
        MAPVQSK SLSG      D+  +T RK+Y  P+ SS + K HHPT       NLTP QTARI Q F H LIA ++GK IH + L FRL  HL L GDL+V
Subjt:  MAPVQSKHSLSGRH----DEPTTTSRKKYKTPIFSSSDFKLHHPTTTAASVCNLTPLQTARITQHFDHSLIAWIVGKKIHPQRLAFRLHRHLHLAGDLDV

Query:  IELGLGFFVLKFSNSSDYFKALEERPWSISHLCIHVFPWIPNFKPSEALVPFVDIWIRLPELSIEYYDKEVFEKISEAIGGCLVKIDPVTEKREKCMFAR
          LGLGFF L FSN SDY +AL+ERPW I  LCIHVFPWIPNFKPS+A + FVD+W+RLPEL +E+Y++E+FE I++AIG  LVKIDPVTE+++K +FAR
Subjt:  IELGLGFFVLKFSNSSDYFKALEERPWSISHLCIHVFPWIPNFKPSEALVPFVDIWIRLPELSIEYYDKEVFEKISEAIGGCLVKIDPVTEKREKCMFAR

Query:  ICIRITLCNPLIYSIQLERIQQKIEYEGLDSLCPICGCVYDLKHDCLNQNN------------------------------------------------P
        ICI ITL NPLI+ I +E  +Q I YEGLDSLC +CGCV DLKH CLNQNN                                                P
Subjt:  ICIRITLCNPLIYSIQLERIQQKIEYEGLDSLCPICGCVYDLKHDCLNQNN------------------------------------------------P

Query:  SGSSGFDPHQHRPRPLQAIDPISSSG---SLSHPLIHSLPSSESGLGSKSQVKDPFIELKLKDCPRLKMGECKPHVHVEAKVVENEKKTLPNFPKESSTT
        SGSSG DPHQ  P P Q ID  SSSG       PLIHSLPS ES L SKSQ KDPF EL LKD P+LKM           KVVENEKKTLPNFP+ESSTT
Subjt:  SGSSGFDPHQHRPRPLQAIDPISSSG---SLSHPLIHSLPSSESGLGSKSQVKDPFIELKLKDCPRLKMGECKPHVHVEAKVVENEKKTLPNFPKESSTT

Query:  TMKTPESKHTNLIQSVRLASSVVVDQFRAAKASNPTKLAVHKNESQLSSPVEAGLSFFPGVIQQSTVEKEMVNTPFGGINVVDSFPTVYTIDPTATSLGI
        TMKT E        SV LA S+V DQFRAAKAS PTKLA+H N S+ SS VEAGL+ F  V QQ T  KEM+NTPFG +N VDS+PTVYTIDPT  SL I
Subjt:  TMKTPESKHTNLIQSVRLASSVVVDQFRAAKASNPTKLAVHKNESQLSSPVEAGLSFFPGVIQQSTVEKEMVNTPFGGINVVDSFPTVYTIDPTATSLGI

Query:  DLSEVPTTRGSNQTQYATDFVLNSKGENENEVDSEVVPMPTLCSKKMLCWNFRGSENTKLIRASKDLIRLQEPSIVLIFGSKISSADADEVAQELAFDGS
        + SEVPTT GSNQTQYA +FVLN   ENENEVDS+   MP+LCSKKMLCWNF G +   L +A KDLI L EPSIVLIFGSKISS+DADEV +EL FDG 
Subjt:  DLSEVPTTRGSNQTQYATDFVLNSKGENENEVDSEVVPMPTLCSKKMLCWNFRGSENTKLIRASKDLIRLQEPSIVLIFGSKISSADADEVAQELAFDGS

Query:  YCRKPNCYNGGVCKIMN
        Y RKP+ YNGGV  +++
Subjt:  YCRKPNCYNGGVCKIMN

A0A5A7SY10 DUF4283 domain-containing protein9.3e-21669.42Show/hide
Query:  MAPVQSKHSLSGRH----DEPTTTSRKKYKTPIFSSSDFKLHHPTTTAASVCNLTPLQTARITQHFDHSLIAWIVGKKIHPQRLAFRLHRHLHLAGDLDV
        MAPV SKHSLS R     DEPT+TSRKKYK P+ SSS+   H+PT       +LTP QTARI Q F HSLIA + G  +H + LA RL R+L L GDLDV
Subjt:  MAPVQSKHSLSGRH----DEPTTTSRKKYKTPIFSSSDFKLHHPTTTAASVCNLTPLQTARITQHFDHSLIAWIVGKKIHPQRLAFRLHRHLHLAGDLDV

Query:  IELGLGFFVLKFSNSSDYFKALEERPWSISHLCIHVFPWIPNFKPSEALVPFVDIWIRLPELSIEYYDKEVFEKISEAIGGCLVKIDPVTEKREKCMFAR
         EL LGFFVLKFSNSSDY +ALEE PWSISHLCIHV PW+PNFKPSEAL+  VD+WIRLPEL IEYYDKE+ EKI+EAIG CLVKIDPVTE+R+KCMFAR
Subjt:  IELGLGFFVLKFSNSSDYFKALEERPWSISHLCIHVFPWIPNFKPSEALVPFVDIWIRLPELSIEYYDKEVFEKISEAIGGCLVKIDPVTEKREKCMFAR

Query:  ICIRITLCNPLIYSIQLERIQQKIEYEGLDSLCPICGCVYDLKHDCLNQNNPSGSSGFDPHQHRPRPLQAIDPISSSG---SLSHPLIHSLPSSESGLGS
        ICIRITLCNPLIYSIQ  +  QK++YEGLDSLC +CGC+ +LKH CLN NNPSGSSG DPHQ  P PLQAIDP SSSG       PLIHSLPSSES LGS
Subjt:  ICIRITLCNPLIYSIQLERIQQKIEYEGLDSLCPICGCVYDLKHDCLNQNNPSGSSGFDPHQHRPRPLQAIDPISSSG---SLSHPLIHSLPSSESGLGS

Query:  KSQVKDPFIELKLKDCPRLKMGECKPHVHVEAKVVENEKKTLPNFPKESSTTTMKTPESKHTNLIQSVRLASSVVVDQFRAAKASNPTKLAVHKNESQLS
        KSQ K+PF+ELKLKDCP+LKMG          KVVENEKK LPNFP+ESSTTT +TPE        SV LA+ +VVDQFRAAK S+PTKL V  N S  S
Subjt:  KSQVKDPFIELKLKDCPRLKMGECKPHVHVEAKVVENEKKTLPNFPKESSTTTMKTPESKHTNLIQSVRLASSVVVDQFRAAKASNPTKLAVHKNESQLS

Query:  SPVEAGLSFFPGVIQQSTVEKEMVNTPFGGINVVDSFPTVYTIDPTATSLGIDLSEVPTTRGSNQTQYATDFVLNSKGENENEVDSEVVPMPTLCSKKML
        S VEAG++ F   IQQ+  EK+M+NTPFGGI VVDS+PTVYTIDPT  SLGID SEVPT  GSNQ +YA +FVLNS+ EN+NEVDS+   MP LC KKML
Subjt:  SPVEAGLSFFPGVIQQSTVEKEMVNTPFGGINVVDSFPTVYTIDPTATSLGIDLSEVPTTRGSNQTQYATDFVLNSKGENENEVDSEVVPMPTLCSKKML

Query:  CWNFRGSENTKLIRASKDLIRLQEPSIVLIFGSKISSADADEVAQELAFDGSYCRKPNCYNGGVCKIMN
        CWNFRG +  KLI+ASK LIRL EPSIVLIFGSKISSADA+EV +ELAF+GSYCRKP+ YNGGV  I++
Subjt:  CWNFRGSENTKLIRASKDLIRLQEPSIVLIFGSKISSADADEVAQELAFDGSYCRKPNCYNGGVCKIMN

A0A6J1FN13 uncharacterized protein LOC111446932 isoform X21.3e-15356.83Show/hide
Query:  TTTAASVCNLTPLQTARITQHFDHSLIAWIVGKKIHPQRLAFRLHRHLHLAGDLDVIELGLGFFVLKFSNSSDYFKALEERPWSISHLCIHVFPWIPNFK
        +T  A+VCNLTP QTARI Q FD SLI W+VGKKIHP++LA RL R+LHLAGDLDV ELGLGFFVLKFSN+ DY++ALEERPWSI HLCI+VFPWIPNFK
Subjt:  TTTAASVCNLTPLQTARITQHFDHSLIAWIVGKKIHPQRLAFRLHRHLHLAGDLDVIELGLGFFVLKFSNSSDYFKALEERPWSISHLCIHVFPWIPNFK

Query:  PSEALVPFVDIWIRLPELSIEYYDKEVFEKISEAIGGCLVKIDPVTEKREKCMFARICIRITLCNPLIYSIQLERIQQKIEYEGLDSLCPICGCVYDLKH
        PSEA +PFVD+WIRLPELSIEYYDKEV EKI+E IGG LVKIDPVT  REKCM+ARICIR+ L  PL  S Q  +  QKI YEGLD LC +CGCV DLKH
Subjt:  PSEALVPFVDIWIRLPELSIEYYDKEVFEKISEAIGGCLVKIDPVTEKREKCMFARICIRITLCNPLIYSIQLERIQQKIEYEGLDSLCPICGCVYDLKH

Query:  DCLNQNNPSGSSGFDPHQHRPRPLQA--------IDPISSSGSLSHP-----------LIHSLPSSESGLGSKSQVKDPFIELKLKDCPRLKMGECKPHV
        DCL  +N S SSGFDPH H  RPLQA        ++P SSS    +P           LI S P+  S  GS+ QV    +EL L + P L + E    V
Subjt:  DCLNQNNPSGSSGFDPHQHRPRPLQA--------IDPISSSGSLSHP-----------LIHSLPSSESGLGSKSQVKDPFIELKLKDCPRLKMGECKPHV

Query:  HVEAKVVENEKKTLPNFPKESSTTTMKTPESKHTNLIQSVRLASSVVVD-QFRAAKASNPTKLAVHKNES-------------QLSSPVEAGLSFFPGVI
                          KES + TM  P  K TNLIQSV LA  V+ D QFR  K S+PT LAV  NE              Q SS +EAGL F+   I
Subjt:  HVEAKVVENEKKTLPNFPKESSTTTMKTPESKHTNLIQSVRLASSVVVD-QFRAAKASNPTKLAVHKNES-------------QLSSPVEAGLSFFPGVI

Query:  QQSTVEKEMVNTPFGGINVVDSFPTVYTIDPTATSLGIDLSEV-PTTRGSNQTQYATDFVLNSKGENENEVDSEVVPMPTLCSKKMLCWNFRGSENTKLI
        QQST++K + NTP   I+ VDS PT+YTIDPT TSL I+L E+  TT  SNQ ++A   V  S+          V    + CSKKMLCWNFR ++N KL+
Subjt:  QQSTVEKEMVNTPFGGINVVDSFPTVYTIDPTATSLGIDLSEV-PTTRGSNQTQYATDFVLNSKGENENEVDSEVVPMPTLCSKKMLCWNFRGSENTKLI

Query:  RASKDLIRLQEPSIVLIFGSKISSADADEVAQELAFDGSYCRKPNCYNGGVCKIMN
        RA KDLI+L +PSIVLIFG+KIS ADAD V +ELAFDGSYCRKP+ Y GG   +++
Subjt:  RASKDLIRLQEPSIVLIFGSKISSADADEVAQELAFDGSYCRKPNCYNGGVCKIMN

A0A6J1FU80 uncharacterized protein LOC111446932 isoform X13.0e-15356.63Show/hide
Query:  TTTAASVCNLTPLQTARITQHFDHSLIAWIVGKKIHPQRLAFRLHRHLHLAGDLDVIELGLGFFVLKFSNSSDYFKALEERPWSISHLCIHVFPWIPNFK
        +T  A+VCNLTP QTARI Q FD SLI W+VGKKIHP++LA RL R+LHLAGDLDV ELGLGFFVLKFSN+ DY++ALEERPWSI HLCI+VFPWIPNFK
Subjt:  TTTAASVCNLTPLQTARITQHFDHSLIAWIVGKKIHPQRLAFRLHRHLHLAGDLDVIELGLGFFVLKFSNSSDYFKALEERPWSISHLCIHVFPWIPNFK

Query:  PSEALVPFVDIWIRLPELSIEYYDKEVFEKISEAIGGCLVKIDPVTEKREKCMFARICIRITLCNPLIYSIQLERIQQKIEYEGLDSLCPICGCVYDLKH
        PSEA +PFVD+WIRLPELSIEYYDKEV EKI+E IGG LVKIDPVT  REKCM+ARICIR+ L  PL  S Q  +  QKI YEGLD LC +CGCV DLKH
Subjt:  PSEALVPFVDIWIRLPELSIEYYDKEVFEKISEAIGGCLVKIDPVTEKREKCMFARICIRITLCNPLIYSIQLERIQQKIEYEGLDSLCPICGCVYDLKH

Query:  DCLNQNNPSGSSGFDPHQHRPRPLQA--------IDPISSS-------------GSLSHPLIHSLPSSESGLGSKSQVKDPFIELKLKDCPRLKMGECKP
        DCL  +N S SSGFDPH H  RPLQA        ++P SSS              +L   LI S P+  S  GS+ QV    +EL L + P L + E   
Subjt:  DCLNQNNPSGSSGFDPHQHRPRPLQA--------IDPISSS-------------GSLSHPLIHSLPSSESGLGSKSQVKDPFIELKLKDCPRLKMGECKP

Query:  HVHVEAKVVENEKKTLPNFPKESSTTTMKTPESKHTNLIQSVRLASSVVVD-QFRAAKASNPTKLAVHKNES-------------QLSSPVEAGLSFFPG
         V                  KES + TM  P  K TNLIQSV LA  V+ D QFR  K S+PT LAV  NE              Q SS +EAGL F+  
Subjt:  HVHVEAKVVENEKKTLPNFPKESSTTTMKTPESKHTNLIQSVRLASSVVVD-QFRAAKASNPTKLAVHKNES-------------QLSSPVEAGLSFFPG

Query:  VIQQSTVEKEMVNTPFGGINVVDSFPTVYTIDPTATSLGIDLSEV-PTTRGSNQTQYATDFVLNSKGENENEVDSEVVPMPTLCSKKMLCWNFRGSENTK
         IQQST++K + NTP   I+ VDS PT+YTIDPT TSL I+L E+  TT  SNQ ++A   V  S+          V    + CSKKMLCWNFR ++N K
Subjt:  VIQQSTVEKEMVNTPFGGINVVDSFPTVYTIDPTATSLGIDLSEV-PTTRGSNQTQYATDFVLNSKGENENEVDSEVVPMPTLCSKKMLCWNFRGSENTK

Query:  LIRASKDLIRLQEPSIVLIFGSKISSADADEVAQELAFDGSYCRKPNCYNGGVCKIMN
        L+RA KDLI+L +PSIVLIFG+KIS ADAD V +ELAFDGSYCRKP+ Y GG   +++
Subjt:  LIRASKDLIRLQEPSIVLIFGSKISSADADEVAQELAFDGSYCRKPNCYNGGVCKIMN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding2.1e-1828.25Show/hide
Query:  LIAWIVGKKIHPQRLAFRLHRHLHLAGDLDVIELGLGFFVLKFSNSSDYFKALEERPWSISHLCIHVFPWIPNFKPSEALVPFVDIWIRLPELSIEYYDK
        +I  ++G +I    L  +L      +G + V++L   FF+++F    +Y  AL   PW +    + V  W   F P    +    +W+RL  +   YY +
Subjt:  LIAWIVGKKIHPQRLAFRLHRHLHLAGDLDVIELGLGFFVLKFSNSSDYFKALEERPWSISHLCIHVFPWIPNFKPSEALVPFVDIWIRLPELSIEYYDK

Query:  EVFEKISEAIGGCLVKIDPVTEKREKCMFARICIRITLCNPLIYSIQLERIQQKIEYEGLDSLCPICGCVYDLKHDC
         +  +I+  +G  L K+D  T   +K  FAR+CI + L  PL  ++ +   +  + YEGL  +C  CG    L H C
Subjt:  EVFEKISEAIGGCLVKIDPVTEKREKCMFARICIRITLCNPLIYSIQLERIQQKIEYEGLDSLCPICGCVYDLKHDC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCCGGTTCAATCCAAACATTCCCTTTCCGGCCGCCACGACGAACCCACCACCACTAGCAGGAAGAAGTACAAGACACCGATTTTTTCCTCATCGGACTTCAAACT
CCATCATCCAACCACCACCGCCGCCTCTGTCTGTAACCTTACTCCCTTACAAACAGCACGTATCACTCAACACTTCGATCACTCTCTCATAGCCTGGATCGTCGGCAAGA
AGATCCATCCACAGCGACTTGCCTTTCGTCTTCACCGTCATCTTCATCTCGCCGGAGATTTGGACGTCATCGAGCTAGGGCTTGGGTTTTTTGTCCTTAAATTTTCCAAC
TCTTCGGACTACTTCAAAGCTCTCGAAGAGCGTCCTTGGTCGATTTCTCACCTTTGCATCCATGTATTTCCATGGATTCCCAATTTCAAACCCTCGGAAGCCTTGGTTCC
TTTTGTTGATATCTGGATTCGCCTCCCGGAGCTGAGTATCGAGTATTACGACAAGGAGGTTTTCGAGAAAATTTCGGAAGCCATCGGCGGCTGTCTCGTGAAGATCGATC
CGGTAACTGAAAAACGAGAGAAATGTATGTTTGCTCGAATCTGTATTAGGATAACTCTATGTAATCCCCTTATTTATAGTATCCAACTTGAGCGAATCCAGCAAAAAATT
GAATATGAGGGTTTAGACTCTTTGTGCCCTATTTGTGGATGTGTTTATGATCTGAAACATGATTGTTTGAATCAGAATAACCCTTCTGGTTCTTCTGGATTTGATCCCCA
TCAACATAGACCTCGTCCATTGCAGGCAATTGACCCGATTTCGAGTTCGGGTTCGTTGTCGCACCCTTTGATTCATTCTCTACCTTCATCAGAATCAGGATTGGGATCCA
AATCCCAAGTAAAAGACCCATTTATTGAGTTGAAATTGAAGGATTGTCCAAGGCTTAAAATGGGTGAATGTAAGCCTCATGTTCATGTGGAAGCTAAAGTAGTTGAAAAT
GAAAAGAAAACTTTGCCCAACTTCCCAAAAGAATCTTCAACTACAACCATGAAAACTCCCGAGTCAAAACACACCAATTTGATTCAATCTGTGCGTTTAGCTTCTTCTGT
TGTTGTAGATCAGTTCAGGGCTGCAAAAGCCAGTAACCCCACAAAGCTTGCAGTCCATAAGAATGAGTCACAATTATCATCTCCTGTGGAGGCTGGCCTCAGTTTCTTTC
CGGGTGTGATCCAACAATCAACAGTAGAGAAAGAGATGGTCAACACACCATTTGGAGGAATCAATGTCGTTGATAGTTTTCCAACTGTTTACACGATTGATCCAACGGCC
ACAAGCCTCGGAATTGATCTGTCAGAAGTGCCAACAACCAGAGGATCAAACCAAACCCAGTATGCTACTGACTTTGTGCTGAATTCAAAAGGTGAAAACGAGAATGAAGT
TGATTCGGAGGTTGTACCAATGCCTACATTGTGTTCTAAGAAAATGTTGTGCTGGAATTTTCGTGGGAGTGAGAATACAAAGCTGATACGAGCATCGAAAGATTTGATTA
GACTCCAGGAGCCATCCATTGTGCTGATCTTTGGCTCCAAGATCAGCAGTGCTGATGCGGATGAGGTTGCGCAGGAGCTTGCTTTCGACGGCTCGTATTGTAGGAAGCCT
AATTGCTACAATGGTGGTGTTTGTAAGATTATGAATTGTTGGAGATATAGTGAATTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGCCGGTTCAATCCAAACATTCCCTTTCCGGCCGCCACGACGAACCCACCACCACTAGCAGGAAGAAGTACAAGACACCGATTTTTTCCTCATCGGACTTCAAACT
CCATCATCCAACCACCACCGCCGCCTCTGTCTGTAACCTTACTCCCTTACAAACAGCACGTATCACTCAACACTTCGATCACTCTCTCATAGCCTGGATCGTCGGCAAGA
AGATCCATCCACAGCGACTTGCCTTTCGTCTTCACCGTCATCTTCATCTCGCCGGAGATTTGGACGTCATCGAGCTAGGGCTTGGGTTTTTTGTCCTTAAATTTTCCAAC
TCTTCGGACTACTTCAAAGCTCTCGAAGAGCGTCCTTGGTCGATTTCTCACCTTTGCATCCATGTATTTCCATGGATTCCCAATTTCAAACCCTCGGAAGCCTTGGTTCC
TTTTGTTGATATCTGGATTCGCCTCCCGGAGCTGAGTATCGAGTATTACGACAAGGAGGTTTTCGAGAAAATTTCGGAAGCCATCGGCGGCTGTCTCGTGAAGATCGATC
CGGTAACTGAAAAACGAGAGAAATGTATGTTTGCTCGAATCTGTATTAGGATAACTCTATGTAATCCCCTTATTTATAGTATCCAACTTGAGCGAATCCAGCAAAAAATT
GAATATGAGGGTTTAGACTCTTTGTGCCCTATTTGTGGATGTGTTTATGATCTGAAACATGATTGTTTGAATCAGAATAACCCTTCTGGTTCTTCTGGATTTGATCCCCA
TCAACATAGACCTCGTCCATTGCAGGCAATTGACCCGATTTCGAGTTCGGGTTCGTTGTCGCACCCTTTGATTCATTCTCTACCTTCATCAGAATCAGGATTGGGATCCA
AATCCCAAGTAAAAGACCCATTTATTGAGTTGAAATTGAAGGATTGTCCAAGGCTTAAAATGGGTGAATGTAAGCCTCATGTTCATGTGGAAGCTAAAGTAGTTGAAAAT
GAAAAGAAAACTTTGCCCAACTTCCCAAAAGAATCTTCAACTACAACCATGAAAACTCCCGAGTCAAAACACACCAATTTGATTCAATCTGTGCGTTTAGCTTCTTCTGT
TGTTGTAGATCAGTTCAGGGCTGCAAAAGCCAGTAACCCCACAAAGCTTGCAGTCCATAAGAATGAGTCACAATTATCATCTCCTGTGGAGGCTGGCCTCAGTTTCTTTC
CGGGTGTGATCCAACAATCAACAGTAGAGAAAGAGATGGTCAACACACCATTTGGAGGAATCAATGTCGTTGATAGTTTTCCAACTGTTTACACGATTGATCCAACGGCC
ACAAGCCTCGGAATTGATCTGTCAGAAGTGCCAACAACCAGAGGATCAAACCAAACCCAGTATGCTACTGACTTTGTGCTGAATTCAAAAGGTGAAAACGAGAATGAAGT
TGATTCGGAGGTTGTACCAATGCCTACATTGTGTTCTAAGAAAATGTTGTGCTGGAATTTTCGTGGGAGTGAGAATACAAAGCTGATACGAGCATCGAAAGATTTGATTA
GACTCCAGGAGCCATCCATTGTGCTGATCTTTGGCTCCAAGATCAGCAGTGCTGATGCGGATGAGGTTGCGCAGGAGCTTGCTTTCGACGGCTCGTATTGTAGGAAGCCT
AATTGCTACAATGGTGGTGTTTGTAAGATTATGAATTGTTGGAGATATAGTGAATTGTAA
Protein sequenceShow/hide protein sequence
MAPVQSKHSLSGRHDEPTTTSRKKYKTPIFSSSDFKLHHPTTTAASVCNLTPLQTARITQHFDHSLIAWIVGKKIHPQRLAFRLHRHLHLAGDLDVIELGLGFFVLKFSN
SSDYFKALEERPWSISHLCIHVFPWIPNFKPSEALVPFVDIWIRLPELSIEYYDKEVFEKISEAIGGCLVKIDPVTEKREKCMFARICIRITLCNPLIYSIQLERIQQKI
EYEGLDSLCPICGCVYDLKHDCLNQNNPSGSSGFDPHQHRPRPLQAIDPISSSGSLSHPLIHSLPSSESGLGSKSQVKDPFIELKLKDCPRLKMGECKPHVHVEAKVVEN
EKKTLPNFPKESSTTTMKTPESKHTNLIQSVRLASSVVVDQFRAAKASNPTKLAVHKNESQLSSPVEAGLSFFPGVIQQSTVEKEMVNTPFGGINVVDSFPTVYTIDPTA
TSLGIDLSEVPTTRGSNQTQYATDFVLNSKGENENEVDSEVVPMPTLCSKKMLCWNFRGSENTKLIRASKDLIRLQEPSIVLIFGSKISSADADEVAQELAFDGSYCRKP
NCYNGGVCKIMNCWRYSEL