; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G008030 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G008030
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionDUF4283 domain-containing protein
Genome locationchr04:7592879..7598481
RNA-Seq ExpressionLsi04G008030
SyntenyLsi04G008030
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034059.1 hypothetical protein E6C27_scaffold65G00450 [Cucumis melo var. makuwa]5.4e-22872.17Show/hide
Query:  MAPVQSKNSFSSRQPTADDEPTITSRKKYKTPISSSSDFKPHHPTTTAASVCNLSPSQTARITQQFDHSLIAWIVGKKIHPRLLALRLRRHLCLTGELDV
        MAPV SK+S SSR+ T DDEPT TSRKKYK P+SSSS+  PH+PT       +L+PSQTARI Q F HSLIA + G  +H RLLA RLRR+L LTG+LDV
Subjt:  MAPVQSKNSFSSRQPTADDEPTITSRKKYKTPISSSSDFKPHHPTTTAASVCNLSPSQTARITQQFDHSLIAWIVGKKIHPRLLALRLRRHLCLTGELDV

Query:  FELGLGFFVLKFSNSSDYSKALEERPWSISHLCIHVFPWIPNFKPSEASIPFVDVWIRLPELSIEYYDKEVFEKIAKAIGVCLVKIDSVTERREKCMFAR
        FEL LGFFVLKFSNSSDY++ALEE PWSISHLCIHV PW+PNFKPSEA I  VDVWIRLPEL IEYYDKE+ EKIA+AIGVCLVKID VTERR+KCMFAR
Subjt:  FELGLGFFVLKFSNSSDYSKALEERPWSISHLCIHVFPWIPNFKPSEASIPFVDVWIRLPELSIEYYDKEVFEKIAKAIGVCLVKIDSVTERREKCMFAR

Query:  LCIRITLCNPLILSIQFGRILQKIEYEGLDSLCSVCGCVDDLKHDCL--NNPSGSSG-------------CGPSSSS-----VSNPLIHSLHSSESALGS
        +CIRITLCNPLI SIQFG+ LQK++YEGLDSLCSVCGC+D+LKH CL  NNPSGSSG               PSSSS        PLIHSL SSESALGS
Subjt:  LCIRITLCNPLILSIQFGRILQKIEYEGLDSLCSVCGCVDDLKHDCL--NNPSGSSG-------------CGPSSSS-----VSNPLIHSLHSSESALGS

Query:  QSQEKDPFIELKLKDCK-----PVVENEKKTLPNFPKESSTTTMRTPESKHTNLIQSVPLASSVVVDQFRAAKASSHTKLAVHKNESQSSSPVEAGLNFF
        +SQEK+PF+ELKLKDC       VVENEKK LPNFP+ESSTTT  TPE        SVPLA+ +VVDQFRAAK SS TKL V  N S SSS VEAG+N F
Subjt:  QSQEKDPFIELKLKDCK-----PVVENEKKTLPNFPKESSTTTMRTPESKHTNLIQSVPLASSVVVDQFRAAKASSHTKLAVHKNESQSSSPVEAGLNFF

Query:  SSVIQQSTVEKEMINTPFGGINVVDSFPTVYMIDPTTTSLGIDLSEVPTTTGSNQAQYATNFVLNSRGENENEVDSEVVSMSALCSKKMLCWNFRGTDNT
        S  IQQ+  EK+MINTPFGGI VVDS+PTVY IDPTT SLGID SEVPT TGSNQ +YA NFVLNSR EN+NEVDS+  SM  LC KKMLCWNFRG D  
Subjt:  SSVIQQSTVEKEMINTPFGGINVVDSFPTVYMIDPTTTSLGIDLSEVPTTTGSNQAQYATNFVLNSRGENENEVDSEVVSMSALCSKKMLCWNFRGTDNT

Query:  KLIRASKDLIRLHEPSIVLIFGSKISSADADEVAQELAFDGSYCRKPNCYNGGVWMLLSRQDVQIEVSSYSPKKVSASVHFHYKLNEPAQGLLDEDIETS
        KLI+ASK LIRL EPSIVLIFGSKISSADA+EV +ELAF+GSYCRKP+ YNGGVWM+LS QDV+IEVSSYSP+KVSASV+F  KLNEP   LLDED ETS
Subjt:  KLIRASKDLIRLHEPSIVLIFGSKISSADADEVAQELAFDGSYCRKPNCYNGGVWMLLSRQDVQIEVSSYSPKKVSASVHFHYKLNEPAQGLLDEDIETS

KAA0034060.1 hypothetical protein E6C27_scaffold65G00460 [Cucumis melo var. makuwa]6.2e-19259.32Show/hide
Query:  RSSFLRKFAIPLLQSQSSPPPNSNPPFMAPVQSKHSLSGRH----DEPTTTSRKKYKTPIFSSSDFKLHHPTTTAASVCNLTPLQTARITQHFDHSLIAW
        RSS+L +  +PLL S SS   NSNP FMAPVQSK SLSG      D+  +T RK+Y  P+ SS + K HHPT       NLTP QTARI Q F H LIA 
Subjt:  RSSFLRKFAIPLLQSQSSPPPNSNPPFMAPVQSKHSLSGRH----DEPTTTSRKKYKTPIFSSSDFKLHHPTTTAASVCNLTPLQTARITQHFDHSLIAW

Query:  IVGKKIHPQRLAFRLHRHLHLAGDLDVIELGLGFFVLKFSNSSDYFKALEERPWSISHLCIHVFPWIPNFKPSEALVPFVDIWIRLPELSIEYYDKEVFE
        ++GK IH + L FRL  HL L GDL+V  LGLGFF L FSN SDY +AL+ERPW I  LCIHVFPWIPNFKPS+A + FVD+W+RLPEL +E+Y++E+FE
Subjt:  IVGKKIHPQRLAFRLHRHLHLAGDLDVIELGLGFFVLKFSNSSDYFKALEERPWSISHLCIHVFPWIPNFKPSEALVPFVDIWIRLPELSIEYYDKEVFE

Query:  KISEAIGGCLVKIDPVTEKREKCMFARICIRITLCNPLIYSIQLERIQQKIEYEGLDSLCPICGCVYDLKHDCLNQNN----------------------
         I++AIG  LVKIDPVTE+++K +FARICI ITL NPLI+ I +E  +Q I YEGLDSLC +CGCV DLKH CLNQNN                      
Subjt:  KISEAIGGCLVKIDPVTEKREKCMFARICIRITLCNPLIYSIQLERIQQKIEYEGLDSLCPICGCVYDLKHDCLNQNN----------------------

Query:  --------------------------PSGSSGFDPHQHRPRPLQAIDPISSSG---SLSHPLIHSLPSSESGLGSKSQVKDPFIELKLKDCPRLKMGECK
                                  PSGSSG DPHQ  P P Q ID  SSSG       PLIHSLPS ES L SKSQ KDPF EL LKD P+LKM    
Subjt:  --------------------------PSGSSGFDPHQHRPRPLQAIDPISSSG---SLSHPLIHSLPSSESGLGSKSQVKDPFIELKLKDCPRLKMGECK

Query:  PHVHVEAKVVENEKKTLPNFPKESSTTTMKTPESKHTNLIQSVRLASSVVVDQFRAAKASNPTKLAVHKNESQLSSPVEAGLSFFPGVIQQSTVEKEMVN
               KVVENEKKTLPNFP+ESSTTTMKT E        SV LA S+V DQFRAAKAS PTKLA+H N S+ SS VEAGL+ F  V QQ T  KEM+N
Subjt:  PHVHVEAKVVENEKKTLPNFPKESSTTTMKTPESKHTNLIQSVRLASSVVVDQFRAAKASNPTKLAVHKNESQLSSPVEAGLSFFPGVIQQSTVEKEMVN

Query:  TPFGGINVVDSFPTVYTIDPTATSLGIDLSEVPTTRGSNQTQYATDFVLNSKGENENEVDSEVVPMPTLCSKKMLCWNFRGSENTKLIRASKDLIRLQEP
        TPFG +N VDS+PTVYTIDPT  SL I+ SEVPTT GSNQTQYA +FVLN   ENENEVDS+   MP+LCSKKMLCWNF G +   L +A KDLI L EP
Subjt:  TPFGGINVVDSFPTVYTIDPTATSLGIDLSEVPTTRGSNQTQYATDFVLNSKGENENEVDSEVVPMPTLCSKKMLCWNFRGSENTKLIRASKDLIRLQEP

Query:  SIVLIFGSKISSADADEVAQELAFDGSYCRKPNCYNGGVCKIMN
        SIVLIFGSKISS+DADEV +EL FDG Y RKP+ YNGGV  +++
Subjt:  SIVLIFGSKISSADADEVAQELAFDGSYCRKPNCYNGGVCKIMN

KGN50456.1 hypothetical protein Csa_000264 [Cucumis sativus]1.3e-16852.99Show/hide
Query:  MAPVQSKNSFSSRQPTADDEPTITSRKKYKTPISSSSDFKPHHPTTTAASVCNLSPSQTARITQQFDHSLIAWIVGKKIHPRLLALRLRRHLCLTGELDV
        MAPVQSK+S S  Q   +D+PT T RKKY  P+SSS + K HHPT       NL+PSQTAR   +F HSLIA ++GK IH   L  RLRRHL LTG+L+V
Subjt:  MAPVQSKNSFSSRQPTADDEPTITSRKKYKTPISSSSDFKPHHPTTTAASVCNLSPSQTARITQQFDHSLIAWIVGKKIHPRLLALRLRRHLCLTGELDV

Query:  FELGLGFFVLKFSNSSDYSKALEERPWSISHLCIHVFPWIPNFKPSEASIPFVDVWIRLPELSIEYYDKEVFEKIAKAIGVCLVKIDSVTERREKCMFAR
          LGLGFF L FSN  DY +AL+ERPW I HLCIH  PWIPNFKPS+A I FVDVWIRLPEL +E+Y++E+FE IAKAIGV LVKID VTER++KCMFAR
Subjt:  FELGLGFFVLKFSNSSDYSKALEERPWSISHLCIHVFPWIPNFKPSEASIPFVDVWIRLPELSIEYYDKEVFEKIAKAIGVCLVKIDSVTERREKCMFAR

Query:  LCIRITLCNPLILSIQFGRILQKIEYEGLDSLCSVCGCVDDLKHDCLN---------------NP------------------SGSSGCGPSSSSVS---
        +CI ITL NPLI  I      Q I YEGLDSLCSVCGCVD LKHDCLN               NP                  S  SG G SSSS S   
Subjt:  LCIRITLCNPLILSIQFGRILQKIEYEGLDSLCSVCGCVDDLKHDCLN---------------NP------------------SGSSGCGPSSSSVS---

Query:  ------------------------------------------------------NPLIHSLHSSESALGSQSQEKDPFIELKLKD-----CKPVVENEKK
                                                               PLIHSL S ES+  S+SQEKDPF EL LK+        VVENEKK
Subjt:  ------------------------------------------------------NPLIHSLHSSESALGSQSQEKDPFIELKLKD-----CKPVVENEKK

Query:  TLPNFPKESSTTTMRTPESKHTNLIQSVPLASSVVVDQFRAAKASSHTKLAVHKNESQSSSPVEAGLNFFSSVIQQSTVEKEMINTPFGGINVVDSFPTV
        TLPN P+ES                                AKAS  TKLA+H N S+S S VEAGL  FS+ +Q+ T  KEMINTPFG ++VVDS+PTV
Subjt:  TLPNFPKESSTTTMRTPESKHTNLIQSVPLASSVVVDQFRAAKASSHTKLAVHKNESQSSSPVEAGLNFFSSVIQQSTVEKEMINTPFGGINVVDSFPTV

Query:  YMIDPTTTSLGIDLSEVPTTTGSNQAQYATNFVLNSRGENENEVDSEVV-SMSALCSKKMLCWNFRGTDNTKLIRASKDLIRLHEPSIVLIFGSKISSAD
        Y I+PTT SLGI+ SEVPT TGSNQ QYA +FVLNS  EN+NEVDS+   S+   CSK MLC NF   D    IRA KDLI LH+PSIVLIFGSKISS+D
Subjt:  YMIDPTTTSLGIDLSEVPTTTGSNQAQYATNFVLNSRGENENEVDSEVV-SMSALCSKKMLCWNFRGTDNTKLIRASKDLIRLHEPSIVLIFGSKISSAD

Query:  ADEVAQELAFDGSYCRKPNCYNGGVWMLLSRQDVQIEVSSYSPKKVSASVHFHYKLNEPAQGLLDEDIETSPQPWGPNFFFASTR
        ADEV +E AF+G YCRKP+  NGGVW++LSR+DVQIE ++ SP+KV ASVHFH  LNE             P+ WG  FF+ASTR
Subjt:  ADEVAQELAFDGSYCRKPNCYNGGVWMLLSRQDVQIEVSSYSPKKVSASVHFHYKLNEPAQGLLDEDIETSPQPWGPNFFFASTR

XP_022941630.1 uncharacterized protein LOC111446932 isoform X1 [Cucurbita moschata]1.6e-16360.56Show/hide
Query:  TTTAASVCNLSPSQTARITQQFDHSLIAWIVGKKIHPRLLALRLRRHLCLTGELDVFELGLGFFVLKFSNSSDYSKALEERPWSISHLCIHVFPWIPNFK
        +T  A+VCNL+PSQTARI QQFD SLI W+VGKKIHPR LA+RLRR+L L G+LDVFELGLGFFVLKFSN+ DY +ALEERPWSI HLCI+VFPWIPNFK
Subjt:  TTTAASVCNLSPSQTARITQQFDHSLIAWIVGKKIHPRLLALRLRRHLCLTGELDVFELGLGFFVLKFSNSSDYSKALEERPWSISHLCIHVFPWIPNFK

Query:  PSEASIPFVDVWIRLPELSIEYYDKEVFEKIAKAIGVCLVKIDSVTERREKCMFARLCIRITLCNPLILSIQFGRILQKIEYEGLDSLCSVCGCVDDLKH
        PSEASIPFVDVWIRLPELSIEYYDKEV EKIA+ IG  LVKID VT  REKCM+AR+CIR+ L  PL LS QFG+  QKI YEGLD LC VCGCVDDLKH
Subjt:  PSEASIPFVDVWIRLPELSIEYYDKEVFEKIAKAIGVCLVKIDSVTERREKCMFARLCIRITLCNPLILSIQFGRILQKIEYEGLDSLCSVCGCVDDLKH

Query:  DCLNNPSGSSG---------------------CGPSSSSVSNPLIHSLHSSESALGSQSQEKDP-----------FIELKLKD--CKPVVENEKKTLPNF
        DCL+N S SSG                       P SSS  NP + S  +S S L  Q     P            +EL L +    PV E++K+     
Subjt:  DCLNNPSGSSG---------------------CGPSSSSVSNPLIHSLHSSESALGSQSQEKDP-----------FIELKLKD--CKPVVENEKKTLPNF

Query:  PKESSTTTMRTPESKHTNLIQSVPLASSVVVD-QFRAAKASSHTKLAVHKNES-------------QSSSPVEAGLNFFSSVIQQSTVEKEMINTPFGGI
         KES + TM  P  K TNLIQSVPLA  V+ D QFR  K SS T LAV  NE              Q SS +EAGL F+S+ IQQST++K + NTP   I
Subjt:  PKESSTTTMRTPESKHTNLIQSVPLASSVVVD-QFRAAKASSHTKLAVHKNES-------------QSSSPVEAGLNFFSSVIQQSTVEKEMINTPFGGI

Query:  NVVDSFPTVYMIDPTTTSLGIDLSEV-PTTTGSNQAQYATNFVLNSRGENENEVDSEVVSMSA-LCSKKMLCWNFRGTDNTKLIRASKDLIRLHEPSIVL
        + VDS PT+Y IDPT TSL I+L E+  TTT SNQ ++A + V            SE VSMSA  CSKKMLCWNFR TDN KL+RA KDLI+LH+PSIVL
Subjt:  NVVDSFPTVYMIDPTTTSLGIDLSEV-PTTTGSNQAQYATNFVLNSRGENENEVDSEVVSMSA-LCSKKMLCWNFRGTDNTKLIRASKDLIRLHEPSIVL

Query:  IFGSKISSADADEVAQELAFDGSYCRKPNCYNGGVWMLLSRQDVQIEVSSYSPKKVSASVHFHYKLNE
        IFG+KIS ADAD V +ELAFDGSYCRKP+ Y GG W+LLS+QDVQIEVSSYSP++VSASV  H K N+
Subjt:  IFGSKISSADADEVAQELAFDGSYCRKPNCYNGGVWMLLSRQDVQIEVSSYSPKKVSASVHFHYKLNE

XP_022941632.1 uncharacterized protein LOC111446932 isoform X2 [Cucurbita moschata]3.6e-16360.92Show/hide
Query:  TTTAASVCNLSPSQTARITQQFDHSLIAWIVGKKIHPRLLALRLRRHLCLTGELDVFELGLGFFVLKFSNSSDYSKALEERPWSISHLCIHVFPWIPNFK
        +T  A+VCNL+PSQTARI QQFD SLI W+VGKKIHPR LA+RLRR+L L G+LDVFELGLGFFVLKFSN+ DY +ALEERPWSI HLCI+VFPWIPNFK
Subjt:  TTTAASVCNLSPSQTARITQQFDHSLIAWIVGKKIHPRLLALRLRRHLCLTGELDVFELGLGFFVLKFSNSSDYSKALEERPWSISHLCIHVFPWIPNFK

Query:  PSEASIPFVDVWIRLPELSIEYYDKEVFEKIAKAIGVCLVKIDSVTERREKCMFARLCIRITLCNPLILSIQFGRILQKIEYEGLDSLCSVCGCVDDLKH
        PSEASIPFVDVWIRLPELSIEYYDKEV EKIA+ IG  LVKID VT  REKCM+AR+CIR+ L  PL LS QFG+  QKI YEGLD LC VCGCVDDLKH
Subjt:  PSEASIPFVDVWIRLPELSIEYYDKEVFEKIAKAIGVCLVKIDSVTERREKCMFARLCIRITLCNPLILSIQFGRILQKIEYEGLDSLCSVCGCVDDLKH

Query:  DCLNNPSGSSG------------------------C-------GPSSSSVSN---PLIHSLHSSESALGSQSQEKDPFIELKLKDCKPVVENEKKTLPNF
        DCL+N S SSG                        C        PSSSS SN    LI S  +  SA GS+ Q  +  + L  +   PV E++K+     
Subjt:  DCLNNPSGSSG------------------------C-------GPSSSSVSN---PLIHSLHSSESALGSQSQEKDPFIELKLKDCKPVVENEKKTLPNF

Query:  PKESSTTTMRTPESKHTNLIQSVPLASSVVVD-QFRAAKASSHTKLAVHKNES-------------QSSSPVEAGLNFFSSVIQQSTVEKEMINTPFGGI
         KES + TM  P  K TNLIQSVPLA  V+ D QFR  K SS T LAV  NE              Q SS +EAGL F+S+ IQQST++K + NTP   I
Subjt:  PKESSTTTMRTPESKHTNLIQSVPLASSVVVD-QFRAAKASSHTKLAVHKNES-------------QSSSPVEAGLNFFSSVIQQSTVEKEMINTPFGGI

Query:  NVVDSFPTVYMIDPTTTSLGIDLSEV-PTTTGSNQAQYATNFVLNSRGENENEVDSEVVSMSA-LCSKKMLCWNFRGTDNTKLIRASKDLIRLHEPSIVL
        + VDS PT+Y IDPT TSL I+L E+  TTT SNQ ++A + V            SE VSMSA  CSKKMLCWNFR TDN KL+RA KDLI+LH+PSIVL
Subjt:  NVVDSFPTVYMIDPTTTSLGIDLSEV-PTTTGSNQAQYATNFVLNSRGENENEVDSEVVSMSA-LCSKKMLCWNFRGTDNTKLIRASKDLIRLHEPSIVL

Query:  IFGSKISSADADEVAQELAFDGSYCRKPNCYNGGVWMLLSRQDVQIEVSSYSPKKVSASVHFHYKLNE
        IFG+KIS ADAD V +ELAFDGSYCRKP+ Y GG W+LLS+QDVQIEVSSYSP++VSASV  H K N+
Subjt:  IFGSKISSADADEVAQELAFDGSYCRKPNCYNGGVWMLLSRQDVQIEVSSYSPKKVSASVHFHYKLNE

TrEMBL top hitse value%identityAlignment
A0A0A0KRY0 DUF4283 domain-containing protein6.1e-16952.99Show/hide
Query:  MAPVQSKNSFSSRQPTADDEPTITSRKKYKTPISSSSDFKPHHPTTTAASVCNLSPSQTARITQQFDHSLIAWIVGKKIHPRLLALRLRRHLCLTGELDV
        MAPVQSK+S S  Q   +D+PT T RKKY  P+SSS + K HHPT       NL+PSQTAR   +F HSLIA ++GK IH   L  RLRRHL LTG+L+V
Subjt:  MAPVQSKNSFSSRQPTADDEPTITSRKKYKTPISSSSDFKPHHPTTTAASVCNLSPSQTARITQQFDHSLIAWIVGKKIHPRLLALRLRRHLCLTGELDV

Query:  FELGLGFFVLKFSNSSDYSKALEERPWSISHLCIHVFPWIPNFKPSEASIPFVDVWIRLPELSIEYYDKEVFEKIAKAIGVCLVKIDSVTERREKCMFAR
          LGLGFF L FSN  DY +AL+ERPW I HLCIH  PWIPNFKPS+A I FVDVWIRLPEL +E+Y++E+FE IAKAIGV LVKID VTER++KCMFAR
Subjt:  FELGLGFFVLKFSNSSDYSKALEERPWSISHLCIHVFPWIPNFKPSEASIPFVDVWIRLPELSIEYYDKEVFEKIAKAIGVCLVKIDSVTERREKCMFAR

Query:  LCIRITLCNPLILSIQFGRILQKIEYEGLDSLCSVCGCVDDLKHDCLN---------------NP------------------SGSSGCGPSSSSVS---
        +CI ITL NPLI  I      Q I YEGLDSLCSVCGCVD LKHDCLN               NP                  S  SG G SSSS S   
Subjt:  LCIRITLCNPLILSIQFGRILQKIEYEGLDSLCSVCGCVDDLKHDCLN---------------NP------------------SGSSGCGPSSSSVS---

Query:  ------------------------------------------------------NPLIHSLHSSESALGSQSQEKDPFIELKLKD-----CKPVVENEKK
                                                               PLIHSL S ES+  S+SQEKDPF EL LK+        VVENEKK
Subjt:  ------------------------------------------------------NPLIHSLHSSESALGSQSQEKDPFIELKLKD-----CKPVVENEKK

Query:  TLPNFPKESSTTTMRTPESKHTNLIQSVPLASSVVVDQFRAAKASSHTKLAVHKNESQSSSPVEAGLNFFSSVIQQSTVEKEMINTPFGGINVVDSFPTV
        TLPN P+ES                                AKAS  TKLA+H N S+S S VEAGL  FS+ +Q+ T  KEMINTPFG ++VVDS+PTV
Subjt:  TLPNFPKESSTTTMRTPESKHTNLIQSVPLASSVVVDQFRAAKASSHTKLAVHKNESQSSSPVEAGLNFFSSVIQQSTVEKEMINTPFGGINVVDSFPTV

Query:  YMIDPTTTSLGIDLSEVPTTTGSNQAQYATNFVLNSRGENENEVDSEVV-SMSALCSKKMLCWNFRGTDNTKLIRASKDLIRLHEPSIVLIFGSKISSAD
        Y I+PTT SLGI+ SEVPT TGSNQ QYA +FVLNS  EN+NEVDS+   S+   CSK MLC NF   D    IRA KDLI LH+PSIVLIFGSKISS+D
Subjt:  YMIDPTTTSLGIDLSEVPTTTGSNQAQYATNFVLNSRGENENEVDSEVV-SMSALCSKKMLCWNFRGTDNTKLIRASKDLIRLHEPSIVLIFGSKISSAD

Query:  ADEVAQELAFDGSYCRKPNCYNGGVWMLLSRQDVQIEVSSYSPKKVSASVHFHYKLNEPAQGLLDEDIETSPQPWGPNFFFASTR
        ADEV +E AF+G YCRKP+  NGGVW++LSR+DVQIE ++ SP+KV ASVHFH  LNE             P+ WG  FF+ASTR
Subjt:  ADEVAQELAFDGSYCRKPNCYNGGVWMLLSRQDVQIEVSSYSPKKVSASVHFHYKLNEPAQGLLDEDIETSPQPWGPNFFFASTR

A0A5A7SSW6 DUF4283 domain-containing protein3.0e-19259.32Show/hide
Query:  RSSFLRKFAIPLLQSQSSPPPNSNPPFMAPVQSKHSLSGRH----DEPTTTSRKKYKTPIFSSSDFKLHHPTTTAASVCNLTPLQTARITQHFDHSLIAW
        RSS+L +  +PLL S SS   NSNP FMAPVQSK SLSG      D+  +T RK+Y  P+ SS + K HHPT       NLTP QTARI Q F H LIA 
Subjt:  RSSFLRKFAIPLLQSQSSPPPNSNPPFMAPVQSKHSLSGRH----DEPTTTSRKKYKTPIFSSSDFKLHHPTTTAASVCNLTPLQTARITQHFDHSLIAW

Query:  IVGKKIHPQRLAFRLHRHLHLAGDLDVIELGLGFFVLKFSNSSDYFKALEERPWSISHLCIHVFPWIPNFKPSEALVPFVDIWIRLPELSIEYYDKEVFE
        ++GK IH + L FRL  HL L GDL+V  LGLGFF L FSN SDY +AL+ERPW I  LCIHVFPWIPNFKPS+A + FVD+W+RLPEL +E+Y++E+FE
Subjt:  IVGKKIHPQRLAFRLHRHLHLAGDLDVIELGLGFFVLKFSNSSDYFKALEERPWSISHLCIHVFPWIPNFKPSEALVPFVDIWIRLPELSIEYYDKEVFE

Query:  KISEAIGGCLVKIDPVTEKREKCMFARICIRITLCNPLIYSIQLERIQQKIEYEGLDSLCPICGCVYDLKHDCLNQNN----------------------
         I++AIG  LVKIDPVTE+++K +FARICI ITL NPLI+ I +E  +Q I YEGLDSLC +CGCV DLKH CLNQNN                      
Subjt:  KISEAIGGCLVKIDPVTEKREKCMFARICIRITLCNPLIYSIQLERIQQKIEYEGLDSLCPICGCVYDLKHDCLNQNN----------------------

Query:  --------------------------PSGSSGFDPHQHRPRPLQAIDPISSSG---SLSHPLIHSLPSSESGLGSKSQVKDPFIELKLKDCPRLKMGECK
                                  PSGSSG DPHQ  P P Q ID  SSSG       PLIHSLPS ES L SKSQ KDPF EL LKD P+LKM    
Subjt:  --------------------------PSGSSGFDPHQHRPRPLQAIDPISSSG---SLSHPLIHSLPSSESGLGSKSQVKDPFIELKLKDCPRLKMGECK

Query:  PHVHVEAKVVENEKKTLPNFPKESSTTTMKTPESKHTNLIQSVRLASSVVVDQFRAAKASNPTKLAVHKNESQLSSPVEAGLSFFPGVIQQSTVEKEMVN
               KVVENEKKTLPNFP+ESSTTTMKT E        SV LA S+V DQFRAAKAS PTKLA+H N S+ SS VEAGL+ F  V QQ T  KEM+N
Subjt:  PHVHVEAKVVENEKKTLPNFPKESSTTTMKTPESKHTNLIQSVRLASSVVVDQFRAAKASNPTKLAVHKNESQLSSPVEAGLSFFPGVIQQSTVEKEMVN

Query:  TPFGGINVVDSFPTVYTIDPTATSLGIDLSEVPTTRGSNQTQYATDFVLNSKGENENEVDSEVVPMPTLCSKKMLCWNFRGSENTKLIRASKDLIRLQEP
        TPFG +N VDS+PTVYTIDPT  SL I+ SEVPTT GSNQTQYA +FVLN   ENENEVDS+   MP+LCSKKMLCWNF G +   L +A KDLI L EP
Subjt:  TPFGGINVVDSFPTVYTIDPTATSLGIDLSEVPTTRGSNQTQYATDFVLNSKGENENEVDSEVVPMPTLCSKKMLCWNFRGSENTKLIRASKDLIRLQEP

Query:  SIVLIFGSKISSADADEVAQELAFDGSYCRKPNCYNGGVCKIMN
        SIVLIFGSKISS+DADEV +EL FDG Y RKP+ YNGGV  +++
Subjt:  SIVLIFGSKISSADADEVAQELAFDGSYCRKPNCYNGGVCKIMN

A0A5A7SY10 DUF4283 domain-containing protein2.6e-22872.17Show/hide
Query:  MAPVQSKNSFSSRQPTADDEPTITSRKKYKTPISSSSDFKPHHPTTTAASVCNLSPSQTARITQQFDHSLIAWIVGKKIHPRLLALRLRRHLCLTGELDV
        MAPV SK+S SSR+ T DDEPT TSRKKYK P+SSSS+  PH+PT       +L+PSQTARI Q F HSLIA + G  +H RLLA RLRR+L LTG+LDV
Subjt:  MAPVQSKNSFSSRQPTADDEPTITSRKKYKTPISSSSDFKPHHPTTTAASVCNLSPSQTARITQQFDHSLIAWIVGKKIHPRLLALRLRRHLCLTGELDV

Query:  FELGLGFFVLKFSNSSDYSKALEERPWSISHLCIHVFPWIPNFKPSEASIPFVDVWIRLPELSIEYYDKEVFEKIAKAIGVCLVKIDSVTERREKCMFAR
        FEL LGFFVLKFSNSSDY++ALEE PWSISHLCIHV PW+PNFKPSEA I  VDVWIRLPEL IEYYDKE+ EKIA+AIGVCLVKID VTERR+KCMFAR
Subjt:  FELGLGFFVLKFSNSSDYSKALEERPWSISHLCIHVFPWIPNFKPSEASIPFVDVWIRLPELSIEYYDKEVFEKIAKAIGVCLVKIDSVTERREKCMFAR

Query:  LCIRITLCNPLILSIQFGRILQKIEYEGLDSLCSVCGCVDDLKHDCL--NNPSGSSG-------------CGPSSSS-----VSNPLIHSLHSSESALGS
        +CIRITLCNPLI SIQFG+ LQK++YEGLDSLCSVCGC+D+LKH CL  NNPSGSSG               PSSSS        PLIHSL SSESALGS
Subjt:  LCIRITLCNPLILSIQFGRILQKIEYEGLDSLCSVCGCVDDLKHDCL--NNPSGSSG-------------CGPSSSS-----VSNPLIHSLHSSESALGS

Query:  QSQEKDPFIELKLKDCK-----PVVENEKKTLPNFPKESSTTTMRTPESKHTNLIQSVPLASSVVVDQFRAAKASSHTKLAVHKNESQSSSPVEAGLNFF
        +SQEK+PF+ELKLKDC       VVENEKK LPNFP+ESSTTT  TPE        SVPLA+ +VVDQFRAAK SS TKL V  N S SSS VEAG+N F
Subjt:  QSQEKDPFIELKLKDCK-----PVVENEKKTLPNFPKESSTTTMRTPESKHTNLIQSVPLASSVVVDQFRAAKASSHTKLAVHKNESQSSSPVEAGLNFF

Query:  SSVIQQSTVEKEMINTPFGGINVVDSFPTVYMIDPTTTSLGIDLSEVPTTTGSNQAQYATNFVLNSRGENENEVDSEVVSMSALCSKKMLCWNFRGTDNT
        S  IQQ+  EK+MINTPFGGI VVDS+PTVY IDPTT SLGID SEVPT TGSNQ +YA NFVLNSR EN+NEVDS+  SM  LC KKMLCWNFRG D  
Subjt:  SSVIQQSTVEKEMINTPFGGINVVDSFPTVYMIDPTTTSLGIDLSEVPTTTGSNQAQYATNFVLNSRGENENEVDSEVVSMSALCSKKMLCWNFRGTDNT

Query:  KLIRASKDLIRLHEPSIVLIFGSKISSADADEVAQELAFDGSYCRKPNCYNGGVWMLLSRQDVQIEVSSYSPKKVSASVHFHYKLNEPAQGLLDEDIETS
        KLI+ASK LIRL EPSIVLIFGSKISSADA+EV +ELAF+GSYCRKP+ YNGGVWM+LS QDV+IEVSSYSP+KVSASV+F  KLNEP   LLDED ETS
Subjt:  KLIRASKDLIRLHEPSIVLIFGSKISSADADEVAQELAFDGSYCRKPNCYNGGVWMLLSRQDVQIEVSSYSPKKVSASVHFHYKLNEPAQGLLDEDIETS

A0A6J1FN13 uncharacterized protein LOC111446932 isoform X21.7e-16360.92Show/hide
Query:  TTTAASVCNLSPSQTARITQQFDHSLIAWIVGKKIHPRLLALRLRRHLCLTGELDVFELGLGFFVLKFSNSSDYSKALEERPWSISHLCIHVFPWIPNFK
        +T  A+VCNL+PSQTARI QQFD SLI W+VGKKIHPR LA+RLRR+L L G+LDVFELGLGFFVLKFSN+ DY +ALEERPWSI HLCI+VFPWIPNFK
Subjt:  TTTAASVCNLSPSQTARITQQFDHSLIAWIVGKKIHPRLLALRLRRHLCLTGELDVFELGLGFFVLKFSNSSDYSKALEERPWSISHLCIHVFPWIPNFK

Query:  PSEASIPFVDVWIRLPELSIEYYDKEVFEKIAKAIGVCLVKIDSVTERREKCMFARLCIRITLCNPLILSIQFGRILQKIEYEGLDSLCSVCGCVDDLKH
        PSEASIPFVDVWIRLPELSIEYYDKEV EKIA+ IG  LVKID VT  REKCM+AR+CIR+ L  PL LS QFG+  QKI YEGLD LC VCGCVDDLKH
Subjt:  PSEASIPFVDVWIRLPELSIEYYDKEVFEKIAKAIGVCLVKIDSVTERREKCMFARLCIRITLCNPLILSIQFGRILQKIEYEGLDSLCSVCGCVDDLKH

Query:  DCLNNPSGSSG------------------------C-------GPSSSSVSN---PLIHSLHSSESALGSQSQEKDPFIELKLKDCKPVVENEKKTLPNF
        DCL+N S SSG                        C        PSSSS SN    LI S  +  SA GS+ Q  +  + L  +   PV E++K+     
Subjt:  DCLNNPSGSSG------------------------C-------GPSSSSVSN---PLIHSLHSSESALGSQSQEKDPFIELKLKDCKPVVENEKKTLPNF

Query:  PKESSTTTMRTPESKHTNLIQSVPLASSVVVD-QFRAAKASSHTKLAVHKNES-------------QSSSPVEAGLNFFSSVIQQSTVEKEMINTPFGGI
         KES + TM  P  K TNLIQSVPLA  V+ D QFR  K SS T LAV  NE              Q SS +EAGL F+S+ IQQST++K + NTP   I
Subjt:  PKESSTTTMRTPESKHTNLIQSVPLASSVVVD-QFRAAKASSHTKLAVHKNES-------------QSSSPVEAGLNFFSSVIQQSTVEKEMINTPFGGI

Query:  NVVDSFPTVYMIDPTTTSLGIDLSEV-PTTTGSNQAQYATNFVLNSRGENENEVDSEVVSMSA-LCSKKMLCWNFRGTDNTKLIRASKDLIRLHEPSIVL
        + VDS PT+Y IDPT TSL I+L E+  TTT SNQ ++A + V            SE VSMSA  CSKKMLCWNFR TDN KL+RA KDLI+LH+PSIVL
Subjt:  NVVDSFPTVYMIDPTTTSLGIDLSEV-PTTTGSNQAQYATNFVLNSRGENENEVDSEVVSMSA-LCSKKMLCWNFRGTDNTKLIRASKDLIRLHEPSIVL

Query:  IFGSKISSADADEVAQELAFDGSYCRKPNCYNGGVWMLLSRQDVQIEVSSYSPKKVSASVHFHYKLNE
        IFG+KIS ADAD V +ELAFDGSYCRKP+ Y GG W+LLS+QDVQIEVSSYSP++VSASV  H K N+
Subjt:  IFGSKISSADADEVAQELAFDGSYCRKPNCYNGGVWMLLSRQDVQIEVSSYSPKKVSASVHFHYKLNE

A0A6J1FU80 uncharacterized protein LOC111446932 isoform X17.7e-16460.56Show/hide
Query:  TTTAASVCNLSPSQTARITQQFDHSLIAWIVGKKIHPRLLALRLRRHLCLTGELDVFELGLGFFVLKFSNSSDYSKALEERPWSISHLCIHVFPWIPNFK
        +T  A+VCNL+PSQTARI QQFD SLI W+VGKKIHPR LA+RLRR+L L G+LDVFELGLGFFVLKFSN+ DY +ALEERPWSI HLCI+VFPWIPNFK
Subjt:  TTTAASVCNLSPSQTARITQQFDHSLIAWIVGKKIHPRLLALRLRRHLCLTGELDVFELGLGFFVLKFSNSSDYSKALEERPWSISHLCIHVFPWIPNFK

Query:  PSEASIPFVDVWIRLPELSIEYYDKEVFEKIAKAIGVCLVKIDSVTERREKCMFARLCIRITLCNPLILSIQFGRILQKIEYEGLDSLCSVCGCVDDLKH
        PSEASIPFVDVWIRLPELSIEYYDKEV EKIA+ IG  LVKID VT  REKCM+AR+CIR+ L  PL LS QFG+  QKI YEGLD LC VCGCVDDLKH
Subjt:  PSEASIPFVDVWIRLPELSIEYYDKEVFEKIAKAIGVCLVKIDSVTERREKCMFARLCIRITLCNPLILSIQFGRILQKIEYEGLDSLCSVCGCVDDLKH

Query:  DCLNNPSGSSG---------------------CGPSSSSVSNPLIHSLHSSESALGSQSQEKDP-----------FIELKLKD--CKPVVENEKKTLPNF
        DCL+N S SSG                       P SSS  NP + S  +S S L  Q     P            +EL L +    PV E++K+     
Subjt:  DCLNNPSGSSG---------------------CGPSSSSVSNPLIHSLHSSESALGSQSQEKDP-----------FIELKLKD--CKPVVENEKKTLPNF

Query:  PKESSTTTMRTPESKHTNLIQSVPLASSVVVD-QFRAAKASSHTKLAVHKNES-------------QSSSPVEAGLNFFSSVIQQSTVEKEMINTPFGGI
         KES + TM  P  K TNLIQSVPLA  V+ D QFR  K SS T LAV  NE              Q SS +EAGL F+S+ IQQST++K + NTP   I
Subjt:  PKESSTTTMRTPESKHTNLIQSVPLASSVVVD-QFRAAKASSHTKLAVHKNES-------------QSSSPVEAGLNFFSSVIQQSTVEKEMINTPFGGI

Query:  NVVDSFPTVYMIDPTTTSLGIDLSEV-PTTTGSNQAQYATNFVLNSRGENENEVDSEVVSMSA-LCSKKMLCWNFRGTDNTKLIRASKDLIRLHEPSIVL
        + VDS PT+Y IDPT TSL I+L E+  TTT SNQ ++A + V            SE VSMSA  CSKKMLCWNFR TDN KL+RA KDLI+LH+PSIVL
Subjt:  NVVDSFPTVYMIDPTTTSLGIDLSEV-PTTTGSNQAQYATNFVLNSRGENENEVDSEVVSMSA-LCSKKMLCWNFRGTDNTKLIRASKDLIRLHEPSIVL

Query:  IFGSKISSADADEVAQELAFDGSYCRKPNCYNGGVWMLLSRQDVQIEVSSYSPKKVSASVHFHYKLNE
        IFG+KIS ADAD V +ELAFDGSYCRKP+ Y GG W+LLS+QDVQIEVSSYSP++VSASV  H K N+
Subjt:  IFGSKISSADADEVAQELAFDGSYCRKPNCYNGGVWMLLSRQDVQIEVSSYSPKKVSASVHFHYKLNE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding2.3e-1931.11Show/hide
Query:  LIAWIVGKKIHPRLLALRLRRHLCLTGELDVFELGLGFFVLKFSNSSDYSKALEERPWSISHLCIHVFPWIPNFKPSEASIPFVDVWIRLPELSIEYYDK
        +I  ++G +I   +L  +LR     +G + V +L   FF+++F    +Y  AL   PW +    + V  W   F P    I    VW+RL  +   YY +
Subjt:  LIAWIVGKKIHPRLLALRLRRHLCLTGELDVFELGLGFFVLKFSNSSDYSKALEERPWSISHLCIHVFPWIPNFKPSEASIPFVDVWIRLPELSIEYYDK

Query:  EVFEKIAKAIGVCLVKIDSVTERREKCMFARLCIRITLCNPLILSIQFGRILQKIEYEGLDSLCSVCGCVDDLKHDCLNN
         +  +IA+ +G  L K+D  T   +K  FAR+CI + L  PL  ++        + YEGL  +CS CG    L H C  N
Subjt:  EVFEKIAKAIGVCLVKIDSVTERREKCMFARLCIRITLCNPLILSIQFGRILQKIEYEGLDSLCSVCGCVDDLKHDCLNN

AT5G36228.1 nucleic acid binding;zinc ion binding5.9e-0724.58Show/hide
Query:  WIVGKKIHPRLLALRLRRHLCLTGELDVFELGLGFFVLKFSNSSDYSKALEERPWSISHLCIHVFPWIPNFKPSEASIPFVDVWIRLPELSIEYYDKEVF
        W +G ++H R+L  R                    F ++F +  D    L   PW  +   I +  W  +F P+E  + F+DVW+ +  + + Y  +   
Subjt:  WIVGKKIHPRLLALRLRRHLCLTGELDVFELGLGFFVLKFSNSSDYSKALEERPWSISHLCIHVFPWIPNFKPSEASIPFVDVWIRLPELSIEYYDKEVF

Query:  EKIAKAIGVCLVKIDSVTERREKCMFARLCIRITLCNPL--ILSIQFG---RILQKIEYEGLDSLCSVCGCVDDLKHDC
        E IA  +G  +V +D   E   +  F R+ +R+    PL     ++F    R +   EYE L  +C+ C  V+     C
Subjt:  EKIAKAIGVCLVKIDSVTERREKCMFARLCIRITLCNPL--ILSIQFG---RILQKIEYEGLDSLCSVCGCVDDLKHDC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCCGGTTCAATCCAAAAACTCCTTCTCCAGCCGCCAACCCACCGCCGACGACGAACCCACTATCACTAGCAGGAAGAAGTACAAGACACCGATTTCTTCCTCATC
GGACTTCAAACCCCATCATCCAACCACCACCGCCGCCTCCGTCTGTAACCTTAGTCCCTCACAAACAGCTCGTATCACTCAACAGTTCGATCACTCTCTCATAGCCTGGA
TCGTCGGCAAGAAGATCCATCCACGGCTACTCGCCCTTCGCCTTCGCCGCCATCTTTGTCTCACTGGAGAATTGGACGTCTTCGAGCTAGGGCTTGGCTTTTTCGTTCTC
AAATTCTCCAACTCTTCGGACTACTCCAAAGCTCTCGAAGAGCGTCCTTGGTCGATTTCTCACCTTTGCATCCATGTATTTCCATGGATTCCCAATTTCAAACCCTCGGA
AGCCTCGATTCCTTTTGTTGATGTCTGGATTCGCCTCCCGGAGCTTAGTATCGAGTATTACGACAAGGAGGTTTTCGAGAAAATTGCGAAAGCCATTGGCGTCTGTCTCG
TGAAGATCGATTCTGTTACTGAAAGACGAGAGAAATGTATGTTTGCTCGACTCTGTATTAGGATAACTTTATGCAATCCCCTTATTTTGAGTATCCAATTTGGGCGAATT
CTGCAGAAAATTGAATATGAGGGTTTAGATTCTTTGTGCTCTGTTTGTGGATGTGTTGATGATCTGAAACATGATTGTTTGAATAACCCTTCTGGTTCTTCTGGGTGTGG
CCCGAGTTCGAGTTCGGTGTCGAACCCATTGATTCATTCTTTACATTCATCAGAATCAGCATTGGGATCCCAATCCCAAGAAAAAGACCCATTTATTGAGTTGAAATTGA
AGGACTGTAAGCCAGTAGTTGAAAATGAAAAGAAAACTTTGCCCAACTTCCCAAAAGAATCTTCAACTACAACCATGAGAACTCCCGAGTCAAAACACACCAATTTGATT
CAATCTGTGCCTTTAGCTTCTTCTGTTGTTGTAGATCAGTTCAGGGCTGCAAAAGCCAGTAGCCACACAAAGCTTGCAGTCCATAAGAATGAATCACAATCATCATCTCC
TGTGGAGGCTGGCCTCAATTTCTTTTCGAGCGTGATCCAACAATCAACAGTAGAGAAAGAGATGATCAACACACCATTTGGAGGAATCAATGTTGTTGATAGTTTTCCGA
CTGTTTACATGATTGATCCAACGACCACGAGCCTTGGAATTGATCTGTCAGAAGTGCCAACAACCACAGGATCAAACCAAGCCCAGTATGCTACTAACTTTGTGCTGAAT
TCGAGAGGTGAAAACGAGAATGAAGTTGATTCGGAGGTTGTATCAATGTCTGCATTGTGTTCTAAGAAAATGTTGTGCTGGAATTTTCGTGGGACTGACAATACAAAGCT
GATACGAGCATCGAAAGATTTGATTCGACTCCATGAGCCATCCATTGTGCTGATCTTTGGCTCCAAGATCAGCAGTGCTGATGCAGATGAGGTTGCGCAGGAGCTTGCTT
TTGACGGCTCGTATTGTAGGAAGCCTAATTGCTACAATGGTGGTGTTTGGATGTTGTTGTCCCGGCAAGATGTTCAAATTGAAGTCAGTTCCTACAGCCCAAAGAAGGTT
TCTGCATCAGTGCATTTCCATTATAAACTCAATGAACCAGCGCAAGGTCTTTTGGATGAAGATATCGAAACATCACCGCAACCATGGGGACCAAACTTCTTCTTTGCTTC
AACAAGGTCTTCTTTCCTACGTAAATTTGCCATTCCATTGCTCCAATCTCAATCATCTCCTCCTCCAAATTCCAATCCTCCGTTCATGGCGCCGGTTCAATCCAAACATT
CCCTTTCCGGCCGCCACGACGAACCCACCACCACTAGCAGGAAGAAGTACAAGACACCGATTTTTTCCTCATCGGACTTCAAACTCCATCATCCAACCACCACCGCCGCC
TCTGTCTGTAACCTTACTCCCTTACAAACAGCACGTATCACTCAACACTTCGATCACTCTCTCATAGCCTGGATCGTCGGCAAGAAGATCCATCCACAGCGACTTGCCTT
TCGTCTTCACCGTCATCTTCATCTCGCCGGAGATTTGGACGTCATCGAGCTAGGGCTTGGGTTTTTTGTCCTTAAATTTTCCAACTCTTCGGACTACTTCAAAGCTCTCG
AAGAGCGTCCTTGGTCGATTTCTCACCTTTGCATCCATGTATTTCCATGGATTCCCAATTTCAAACCCTCGGAAGCCTTGGTTCCTTTTGTTGATATCTGGATTCGCCTC
CCGGAGCTGAGTATCGAGTATTACGACAAGGAGGTTTTCGAGAAAATTTCGGAAGCCATCGGCGGCTGTCTCGTGAAGATCGATCCGGTAACTGAAAAACGAGAGAAATG
TATGTTTGCTCGAATCTGTATTAGGATAACTCTATGTAATCCCCTTATTTATAGTATCCAACTTGAGCGAATCCAGCAAAAAATTGAATATGAGGGTTTAGACTCTTTGT
GCCCTATTTGTGGATGTGTTTATGATCTGAAACATGATTGTTTGAATCAGAATAACCCTTCTGGTTCTTCTGGATTTGATCCCCATCAACATAGACCTCGTCCATTGCAG
GCAATTGACCCGATTTCGAGTTCGGGTTCGTTGTCGCACCCTTTGATTCATTCTCTACCTTCATCAGAATCAGGATTGGGATCCAAATCCCAAGTAAAAGACCCATTTAT
TGAGTTGAAATTGAAGGATTGTCCAAGGCTTAAAATGGGTGAATGTAAGCCTCATGTTCATGTGGAAGCTAAAGTAGTTGAAAATGAAAAGAAAACTTTGCCCAACTTCC
CAAAAGAATCTTCAACTACAACCATGAAAACTCCCGAGTCAAAACACACCAATTTGATTCAATCTGTGCGTTTAGCTTCTTCTGTTGTTGTAGATCAGTTCAGGGCTGCA
AAAGCCAGTAACCCCACAAAGCTTGCAGTCCATAAGAATGAGTCACAATTATCATCTCCTGTGGAGGCTGGCCTCAGTTTCTTTCCGGGTGTGATCCAACAATCAACAGT
AGAGAAAGAGATGGTCAACACACCATTTGGAGGAATCAATGTCGTTGATAGTTTTCCAACTGTTTACACGATTGATCCAACGGCCACAAGCCTCGGAATTGATCTGTCAG
AAGTGCCAACAACCAGAGGATCAAACCAAACCCAGTATGCTACTGACTTTGTGCTGAATTCAAAAGGTGAAAACGAGAATGAAGTTGATTCGGAGGTTGTACCAATGCCT
ACATTGTGTTCTAAGAAAATGTTGTGCTGGAATTTTCGTGGGAGTGAGAATACAAAGCTGATACGAGCATCGAAAGATTTGATTAGACTCCAGGAGCCATCCATTGTGCT
GATCTTTGGCTCCAAGATCAGCAGTGCTGATGCGGATGAGGTTGCGCAGGAGCTTGCTTTCGACGGCTCGTATTGTAGGAAGCCTAATTGCTACAATGGTGGTGTTTGTA
AGATTATGAATTGTTGGAGATATAGTGAATTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGCCGGTTCAATCCAAAAACTCCTTCTCCAGCCGCCAACCCACCGCCGACGACGAACCCACTATCACTAGCAGGAAGAAGTACAAGACACCGATTTCTTCCTCATC
GGACTTCAAACCCCATCATCCAACCACCACCGCCGCCTCCGTCTGTAACCTTAGTCCCTCACAAACAGCTCGTATCACTCAACAGTTCGATCACTCTCTCATAGCCTGGA
TCGTCGGCAAGAAGATCCATCCACGGCTACTCGCCCTTCGCCTTCGCCGCCATCTTTGTCTCACTGGAGAATTGGACGTCTTCGAGCTAGGGCTTGGCTTTTTCGTTCTC
AAATTCTCCAACTCTTCGGACTACTCCAAAGCTCTCGAAGAGCGTCCTTGGTCGATTTCTCACCTTTGCATCCATGTATTTCCATGGATTCCCAATTTCAAACCCTCGGA
AGCCTCGATTCCTTTTGTTGATGTCTGGATTCGCCTCCCGGAGCTTAGTATCGAGTATTACGACAAGGAGGTTTTCGAGAAAATTGCGAAAGCCATTGGCGTCTGTCTCG
TGAAGATCGATTCTGTTACTGAAAGACGAGAGAAATGTATGTTTGCTCGACTCTGTATTAGGATAACTTTATGCAATCCCCTTATTTTGAGTATCCAATTTGGGCGAATT
CTGCAGAAAATTGAATATGAGGGTTTAGATTCTTTGTGCTCTGTTTGTGGATGTGTTGATGATCTGAAACATGATTGTTTGAATAACCCTTCTGGTTCTTCTGGGTGTGG
CCCGAGTTCGAGTTCGGTGTCGAACCCATTGATTCATTCTTTACATTCATCAGAATCAGCATTGGGATCCCAATCCCAAGAAAAAGACCCATTTATTGAGTTGAAATTGA
AGGACTGTAAGCCAGTAGTTGAAAATGAAAAGAAAACTTTGCCCAACTTCCCAAAAGAATCTTCAACTACAACCATGAGAACTCCCGAGTCAAAACACACCAATTTGATT
CAATCTGTGCCTTTAGCTTCTTCTGTTGTTGTAGATCAGTTCAGGGCTGCAAAAGCCAGTAGCCACACAAAGCTTGCAGTCCATAAGAATGAATCACAATCATCATCTCC
TGTGGAGGCTGGCCTCAATTTCTTTTCGAGCGTGATCCAACAATCAACAGTAGAGAAAGAGATGATCAACACACCATTTGGAGGAATCAATGTTGTTGATAGTTTTCCGA
CTGTTTACATGATTGATCCAACGACCACGAGCCTTGGAATTGATCTGTCAGAAGTGCCAACAACCACAGGATCAAACCAAGCCCAGTATGCTACTAACTTTGTGCTGAAT
TCGAGAGGTGAAAACGAGAATGAAGTTGATTCGGAGGTTGTATCAATGTCTGCATTGTGTTCTAAGAAAATGTTGTGCTGGAATTTTCGTGGGACTGACAATACAAAGCT
GATACGAGCATCGAAAGATTTGATTCGACTCCATGAGCCATCCATTGTGCTGATCTTTGGCTCCAAGATCAGCAGTGCTGATGCAGATGAGGTTGCGCAGGAGCTTGCTT
TTGACGGCTCGTATTGTAGGAAGCCTAATTGCTACAATGGTGGTGTTTGGATGTTGTTGTCCCGGCAAGATGTTCAAATTGAAGTCAGTTCCTACAGCCCAAAGAAGGTT
TCTGCATCAGTGCATTTCCATTATAAACTCAATGAACCAGCGCAAGGTCTTTTGGATGAAGATATCGAAACATCACCGCAACCATGGGGACCAAACTTCTTCTTTGCTTC
AACAAGGTCTTCTTTCCTACGTAAATTTGCCATTCCATTGCTCCAATCTCAATCATCTCCTCCTCCAAATTCCAATCCTCCGTTCATGGCGCCGGTTCAATCCAAACATT
CCCTTTCCGGCCGCCACGACGAACCCACCACCACTAGCAGGAAGAAGTACAAGACACCGATTTTTTCCTCATCGGACTTCAAACTCCATCATCCAACCACCACCGCCGCC
TCTGTCTGTAACCTTACTCCCTTACAAACAGCACGTATCACTCAACACTTCGATCACTCTCTCATAGCCTGGATCGTCGGCAAGAAGATCCATCCACAGCGACTTGCCTT
TCGTCTTCACCGTCATCTTCATCTCGCCGGAGATTTGGACGTCATCGAGCTAGGGCTTGGGTTTTTTGTCCTTAAATTTTCCAACTCTTCGGACTACTTCAAAGCTCTCG
AAGAGCGTCCTTGGTCGATTTCTCACCTTTGCATCCATGTATTTCCATGGATTCCCAATTTCAAACCCTCGGAAGCCTTGGTTCCTTTTGTTGATATCTGGATTCGCCTC
CCGGAGCTGAGTATCGAGTATTACGACAAGGAGGTTTTCGAGAAAATTTCGGAAGCCATCGGCGGCTGTCTCGTGAAGATCGATCCGGTAACTGAAAAACGAGAGAAATG
TATGTTTGCTCGAATCTGTATTAGGATAACTCTATGTAATCCCCTTATTTATAGTATCCAACTTGAGCGAATCCAGCAAAAAATTGAATATGAGGGTTTAGACTCTTTGT
GCCCTATTTGTGGATGTGTTTATGATCTGAAACATGATTGTTTGAATCAGAATAACCCTTCTGGTTCTTCTGGATTTGATCCCCATCAACATAGACCTCGTCCATTGCAG
GCAATTGACCCGATTTCGAGTTCGGGTTCGTTGTCGCACCCTTTGATTCATTCTCTACCTTCATCAGAATCAGGATTGGGATCCAAATCCCAAGTAAAAGACCCATTTAT
TGAGTTGAAATTGAAGGATTGTCCAAGGCTTAAAATGGGTGAATGTAAGCCTCATGTTCATGTGGAAGCTAAAGTAGTTGAAAATGAAAAGAAAACTTTGCCCAACTTCC
CAAAAGAATCTTCAACTACAACCATGAAAACTCCCGAGTCAAAACACACCAATTTGATTCAATCTGTGCGTTTAGCTTCTTCTGTTGTTGTAGATCAGTTCAGGGCTGCA
AAAGCCAGTAACCCCACAAAGCTTGCAGTCCATAAGAATGAGTCACAATTATCATCTCCTGTGGAGGCTGGCCTCAGTTTCTTTCCGGGTGTGATCCAACAATCAACAGT
AGAGAAAGAGATGGTCAACACACCATTTGGAGGAATCAATGTCGTTGATAGTTTTCCAACTGTTTACACGATTGATCCAACGGCCACAAGCCTCGGAATTGATCTGTCAG
AAGTGCCAACAACCAGAGGATCAAACCAAACCCAGTATGCTACTGACTTTGTGCTGAATTCAAAAGGTGAAAACGAGAATGAAGTTGATTCGGAGGTTGTACCAATGCCT
ACATTGTGTTCTAAGAAAATGTTGTGCTGGAATTTTCGTGGGAGTGAGAATACAAAGCTGATACGAGCATCGAAAGATTTGATTAGACTCCAGGAGCCATCCATTGTGCT
GATCTTTGGCTCCAAGATCAGCAGTGCTGATGCGGATGAGGTTGCGCAGGAGCTTGCTTTCGACGGCTCGTATTGTAGGAAGCCTAATTGCTACAATGGTGGTGTTTGTA
AGATTATGAATTGTTGGAGATATAGTGAATTGTAA
Protein sequenceShow/hide protein sequence
MAPVQSKNSFSSRQPTADDEPTITSRKKYKTPISSSSDFKPHHPTTTAASVCNLSPSQTARITQQFDHSLIAWIVGKKIHPRLLALRLRRHLCLTGELDVFELGLGFFVL
KFSNSSDYSKALEERPWSISHLCIHVFPWIPNFKPSEASIPFVDVWIRLPELSIEYYDKEVFEKIAKAIGVCLVKIDSVTERREKCMFARLCIRITLCNPLILSIQFGRI
LQKIEYEGLDSLCSVCGCVDDLKHDCLNNPSGSSGCGPSSSSVSNPLIHSLHSSESALGSQSQEKDPFIELKLKDCKPVVENEKKTLPNFPKESSTTTMRTPESKHTNLI
QSVPLASSVVVDQFRAAKASSHTKLAVHKNESQSSSPVEAGLNFFSSVIQQSTVEKEMINTPFGGINVVDSFPTVYMIDPTTTSLGIDLSEVPTTTGSNQAQYATNFVLN
SRGENENEVDSEVVSMSALCSKKMLCWNFRGTDNTKLIRASKDLIRLHEPSIVLIFGSKISSADADEVAQELAFDGSYCRKPNCYNGGVWMLLSRQDVQIEVSSYSPKKV
SASVHFHYKLNEPAQGLLDEDIETSPQPWGPNFFFASTRSSFLRKFAIPLLQSQSSPPPNSNPPFMAPVQSKHSLSGRHDEPTTTSRKKYKTPIFSSSDFKLHHPTTTAA
SVCNLTPLQTARITQHFDHSLIAWIVGKKIHPQRLAFRLHRHLHLAGDLDVIELGLGFFVLKFSNSSDYFKALEERPWSISHLCIHVFPWIPNFKPSEALVPFVDIWIRL
PELSIEYYDKEVFEKISEAIGGCLVKIDPVTEKREKCMFARICIRITLCNPLIYSIQLERIQQKIEYEGLDSLCPICGCVYDLKHDCLNQNNPSGSSGFDPHQHRPRPLQ
AIDPISSSGSLSHPLIHSLPSSESGLGSKSQVKDPFIELKLKDCPRLKMGECKPHVHVEAKVVENEKKTLPNFPKESSTTTMKTPESKHTNLIQSVRLASSVVVDQFRAA
KASNPTKLAVHKNESQLSSPVEAGLSFFPGVIQQSTVEKEMVNTPFGGINVVDSFPTVYTIDPTATSLGIDLSEVPTTRGSNQTQYATDFVLNSKGENENEVDSEVVPMP
TLCSKKMLCWNFRGSENTKLIRASKDLIRLQEPSIVLIFGSKISSADADEVAQELAFDGSYCRKPNCYNGGVCKIMNCWRYSEL