; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh17G005160 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh17G005160
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionDUF761 domain-containing protein
Genome locationCma_Chr17:3497451..3499226
RNA-Seq ExpressionCmaCh17G005160
SyntenyCmaCh17G005160
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR008480 - Protein of unknown function DUF761, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575261.1 hypothetical protein SDJN03_25900, partial [Cucurbita argyrosperma subsp. sororia]3.6e-30594.6Show/hide
Query:  MASSASSPFTKLHFPHSPLPQPPATHHSNSCAQFLCKSLFFCFFLLLLPLFPSEAPDFVDQTLFTKFWELFHLMLVGIAVSYGLFSTRNNQMNVDEPRYS
        MASSASSPFTKLHFPHSPLPQPPA    NSCAQFLCKS+FFCFFLLLLPLFPSEAPDFVDQTLFTKFWELFHLM VGIAVSYGLFSTRNNQMNVDEPRYS
Subjt:  MASSASSPFTKLHFPHSPLPQPPATHHSNSCAQFLCKSLFFCFFLLLLPLFPSEAPDFVDQTLFTKFWELFHLMLVGIAVSYGLFSTRNNQMNVDEPRYS

Query:  SFENPQSYLSKMLYVASIFDDVDDFSVSDERKLSEVLYIQPNLGSASDLNAQSRQQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPET
        SFENPQSYLSKMLYVASIFDDVDDF VSDERK+SEVLYIQP LGSASDLNAQSR QEKLRYS+PKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPET
Subjt:  SFENPQSYLSKMLYVASIFDDVDDFSVSDERKLSEVLYIQPNLGSASDLNAQSRQQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPET

Query:  NRSSSGGIVNYKPLGLPVRSLKSSLTESDDVEFDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFG
        NRSSSGGIVNYKPLGLPVRSL+SSLTESDDVEFDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFG
Subjt:  NRSSSGGIVNYKPLGLPVRSLKSSLTESDDVEFDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFG

Query:  NAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRKHHKMSSLSNISYKSLHSRQYSMSSLSENSRGSSEDPLIEQENSSECNES
        NAVLRPSHFRPPSIDETQFESL+KSGSLHS+LSQSSQTSSLSS LSSTTRKH KMSSLSNISYKSLHSRQYSMSSLSENSRGSSEDPLIEQENSSECNES
Subjt:  NAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRKHHKMSSLSNISYKSLHSRQYSMSSLSENSRGSSEDPLIEQENSSECNES

Query:  VVSSPRSDMNFRSIPKALSQGKSIRRIQANAAAIEDIKAQEMHRKQVKHDDIIGNKFEEGGTS-PYIREDGTGHGWPDVANPNASNMSRFPTTTFLGIKE
        VVSSPRSD NF SIPKALSQGKS+RRI+ANAAAIED+KAQEMHRKQVKHDDIIGNKFEEGG S PY+REDGTGHGWPDV NPNA NM+RFP TTFLGIKE
Subjt:  VVSSPRSDMNFRSIPKALSQGKSIRRIQANAAAIEDIKAQEMHRKQVKHDDIIGNKFEEGGTS-PYIREDGTGHGWPDVANPNASNMSRFPTTTFLGIKE

Query:  QKEETESLVADDSKDDSEGEDESFFASSDEEAASSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLR-GGGGWGSFSSTSSSYFS
        QKEETESLVADDSKD SEGEDES FASSDEEA SSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLR GGGGWGSFSSTSSSYFS
Subjt:  QKEETESLVADDSKDDSEGEDESFFASSDEEAASSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLR-GGGGWGSFSSTSSSYFS

KAG7013816.1 hypothetical protein SDJN02_23985, partial [Cucurbita argyrosperma subsp. argyrosperma]7.9e-30594.6Show/hide
Query:  MASSASSPFTKLHFPHSPLPQPPATHHSNSCAQFLCKSLFFCFFLLLLPLFPSEAPDFVDQTLFTKFWELFHLMLVGIAVSYGLFSTRNNQMNVDEPRYS
        MASSASSPFTKLHFPHSPLPQPPA    NSCAQFLCKS+FFCFFLLLLPLFPSEAPDFVDQTLFTKFWELFHLM VGIAVSYGLFSTRNNQMNVDEPRYS
Subjt:  MASSASSPFTKLHFPHSPLPQPPATHHSNSCAQFLCKSLFFCFFLLLLPLFPSEAPDFVDQTLFTKFWELFHLMLVGIAVSYGLFSTRNNQMNVDEPRYS

Query:  SFENPQSYLSKMLYVASIFDDVDDFSVSDERKLSEVLYIQPNLGSASDLNAQSRQQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPET
        SFENPQSYLSKMLYVASIFDDVDDF VSDERK+SEVLYIQP LGSASDLNAQSR QEKLRYS+PKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPET
Subjt:  SFENPQSYLSKMLYVASIFDDVDDFSVSDERKLSEVLYIQPNLGSASDLNAQSRQQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPET

Query:  NRSSSGGIVNYKPLGLPVRSLKSSLTESDDVEFDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFG
        NRSSSGGIVNYKPLGLPVRSL+SSLTESDDVEFDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFG
Subjt:  NRSSSGGIVNYKPLGLPVRSLKSSLTESDDVEFDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFG

Query:  NAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRKHHKMSSLSNISYKSLHSRQYSMSSLSENSRGSSEDPLIEQENSSECNES
        NAVLRPSHFRPPSIDETQFESL+KSGSLHS+LSQSSQTSSLSS LSSTTRKH KMSSLSNISYKSLHSRQYSMSSLSENSRGSSEDPLIEQENSSECNES
Subjt:  NAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRKHHKMSSLSNISYKSLHSRQYSMSSLSENSRGSSEDPLIEQENSSECNES

Query:  VVSSPRSDMNFRSIPKALSQGKSIRRIQANAAAIEDIKAQEMHRKQVKHDDIIGNKFEEGGTS-PYIREDGTGHGWPDVANPNASNMSRFPTTTFLGIKE
        VVSSPRSD NF SIPKALSQGKS+RRI+ANAAAIED+KAQEMHRKQVKHDDIIGNKFEEGG S PY+REDGTGHGWPDV NPNA NM+RFP TTFLGIKE
Subjt:  VVSSPRSDMNFRSIPKALSQGKSIRRIQANAAAIEDIKAQEMHRKQVKHDDIIGNKFEEGGTS-PYIREDGTGHGWPDVANPNASNMSRFPTTTFLGIKE

Query:  QKEETESLVADDSKDDSEGEDESFFASSDEEAASSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLR-GGGGWGSFSSTSSSYFS
        QKEETESLVADDSKD SEGEDES FASSDEEA SSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLR GGGGWGSFSSTSSSYFS
Subjt:  QKEETESLVADDSKDDSEGEDESFFASSDEEAASSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLR-GGGGWGSFSSTSSSYFS

XP_022958845.1 uncharacterized protein LOC111459998 [Cucurbita moschata]7.4e-30393.96Show/hide
Query:  MASSASSPFTKLHFPHSPLPQPPATHHSNSCAQFLCKSLFFCFFLLLLPLFPSEAPDFVDQTLFTKFWELFHLMLVGIAVSYGLFSTRNNQMNVDEPRYS
        MASSASSPFTKLHFPHSPLPQPPA    NSCAQFLCKS+FFCFFLLLLPLFPSEAPDFVDQTLFTKFWELFHLM VGIAVSYGLFSTRNNQMNVDEPRYS
Subjt:  MASSASSPFTKLHFPHSPLPQPPATHHSNSCAQFLCKSLFFCFFLLLLPLFPSEAPDFVDQTLFTKFWELFHLMLVGIAVSYGLFSTRNNQMNVDEPRYS

Query:  SFENPQSYLSKMLYVASIFDDVDDFSVSDERKLSEVLYIQPNLGSASDLNAQSRQQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPET
        SFENPQSYLSKMLYVASIFDDVDDF VSDERK+SEVLYIQP LGSASDLNAQSR QEKLRYS+PKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPET
Subjt:  SFENPQSYLSKMLYVASIFDDVDDFSVSDERKLSEVLYIQPNLGSASDLNAQSRQQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPET

Query:  NRSSSGGIVNYKPLGLPVRSLKSSLTESDDVEFDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFG
        NRSSSGGIVNYKPLGLPVRSL+SSLTESDDVEFDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFG
Subjt:  NRSSSGGIVNYKPLGLPVRSLKSSLTESDDVEFDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFG

Query:  NAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRKHHKMSSLSNISYKSLHSRQYSMSSLSENSRGSSEDPLIEQENSSECNES
        NAVLRPSHFRPPSIDETQFESLKKSGSLHS LSQSSQTSSLSS LSSTTRK  KMSSLSNISYKSLHSRQYS SSLSENSRGSSEDPLIEQENSSECNES
Subjt:  NAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRKHHKMSSLSNISYKSLHSRQYSMSSLSENSRGSSEDPLIEQENSSECNES

Query:  VVSSPRSDMNFRSIPKALSQGKSIRRIQANAAAIEDIKAQEMHRKQVKHDDIIGNKFEEGGTS-PYIREDGTGHGWPDVANPNASNMSRFPTTTFLGIKE
        VVSSPRSD NF SIPKALSQGKS+RRI+ANAAAIED+KAQEMHRKQVKHDDIIGNKFEEGG S PY+REDGTG GWPDV NPNA NM+RFP TTFLGIKE
Subjt:  VVSSPRSDMNFRSIPKALSQGKSIRRIQANAAAIEDIKAQEMHRKQVKHDDIIGNKFEEGGTS-PYIREDGTGHGWPDVANPNASNMSRFPTTTFLGIKE

Query:  QKEETESLVADDSKDDSEGEDESFFASSDEEAASSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLR----GGGGWGSFSSTSSSYFS
        QKEETESLVADDSKDDSEGEDES FASSDEEA SSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLR    GGGGWGSFSSTSSSYFS
Subjt:  QKEETESLVADDSKDDSEGEDESFFASSDEEAASSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLR----GGGGWGSFSSTSSSYFS

XP_023006022.1 uncharacterized protein LOC111498900 [Cucurbita maxima]0.0e+00100Show/hide
Query:  MASSASSPFTKLHFPHSPLPQPPATHHSNSCAQFLCKSLFFCFFLLLLPLFPSEAPDFVDQTLFTKFWELFHLMLVGIAVSYGLFSTRNNQMNVDEPRYS
        MASSASSPFTKLHFPHSPLPQPPATHHSNSCAQFLCKSLFFCFFLLLLPLFPSEAPDFVDQTLFTKFWELFHLMLVGIAVSYGLFSTRNNQMNVDEPRYS
Subjt:  MASSASSPFTKLHFPHSPLPQPPATHHSNSCAQFLCKSLFFCFFLLLLPLFPSEAPDFVDQTLFTKFWELFHLMLVGIAVSYGLFSTRNNQMNVDEPRYS

Query:  SFENPQSYLSKMLYVASIFDDVDDFSVSDERKLSEVLYIQPNLGSASDLNAQSRQQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPET
        SFENPQSYLSKMLYVASIFDDVDDFSVSDERKLSEVLYIQPNLGSASDLNAQSRQQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPET
Subjt:  SFENPQSYLSKMLYVASIFDDVDDFSVSDERKLSEVLYIQPNLGSASDLNAQSRQQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPET

Query:  NRSSSGGIVNYKPLGLPVRSLKSSLTESDDVEFDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFG
        NRSSSGGIVNYKPLGLPVRSLKSSLTESDDVEFDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFG
Subjt:  NRSSSGGIVNYKPLGLPVRSLKSSLTESDDVEFDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFG

Query:  NAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRKHHKMSSLSNISYKSLHSRQYSMSSLSENSRGSSEDPLIEQENSSECNES
        NAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRKHHKMSSLSNISYKSLHSRQYSMSSLSENSRGSSEDPLIEQENSSECNES
Subjt:  NAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRKHHKMSSLSNISYKSLHSRQYSMSSLSENSRGSSEDPLIEQENSSECNES

Query:  VVSSPRSDMNFRSIPKALSQGKSIRRIQANAAAIEDIKAQEMHRKQVKHDDIIGNKFEEGGTSPYIREDGTGHGWPDVANPNASNMSRFPTTTFLGIKEQ
        VVSSPRSDMNFRSIPKALSQGKSIRRIQANAAAIEDIKAQEMHRKQVKHDDIIGNKFEEGGTSPYIREDGTGHGWPDVANPNASNMSRFPTTTFLGIKEQ
Subjt:  VVSSPRSDMNFRSIPKALSQGKSIRRIQANAAAIEDIKAQEMHRKQVKHDDIIGNKFEEGGTSPYIREDGTGHGWPDVANPNASNMSRFPTTTFLGIKEQ

Query:  KEETESLVADDSKDDSEGEDESFFASSDEEAASSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGWGSFSSTSSSYFS
        KEETESLVADDSKDDSEGEDESFFASSDEEAASSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGWGSFSSTSSSYFS
Subjt:  KEETESLVADDSKDDSEGEDESFFASSDEEAASSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGWGSFSSTSSSYFS

XP_023548366.1 uncharacterized protein LOC111807030 [Cucurbita pepo subsp. pepo]7.4e-30393.61Show/hide
Query:  MASSASSPFTKLHFPHSPLPQPPATHHSNSCAQFLCKSLFFCFFLLLLPLFPSEAPDFVDQTLFTKFWELFHLMLVGIAVSYGLFSTRNNQMNVDEPRYS
        MASSASSPFTKLHFPHSPLPQPPA    NSCAQFLCKS+FFCFFLLLLPLFPSEAP FVD TLFTKFWELFHLM VGIAVSYGLFSTRNNQMNVDEPRYS
Subjt:  MASSASSPFTKLHFPHSPLPQPPATHHSNSCAQFLCKSLFFCFFLLLLPLFPSEAPDFVDQTLFTKFWELFHLMLVGIAVSYGLFSTRNNQMNVDEPRYS

Query:  SFENPQSYLSKMLYVASIFDDVDDFSVSDERKLSEVLYIQPNLGSASDLNAQSRQQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPET
        SFENPQSYLSKMLYVASIFDDVDDF VSDERK+SEVLYIQP LGSASD NAQSR QEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPET
Subjt:  SFENPQSYLSKMLYVASIFDDVDDFSVSDERKLSEVLYIQPNLGSASDLNAQSRQQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPET

Query:  NRSSSGGIVNYKPLGLPVRSLKSSLTESDDVEFDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFG
        NRSSSGGIVNYKPLGLPVRSL+SSLTESDDVEFDCGDESCLSSKSSP+SSENNCEGNSEFGD+CCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFG
Subjt:  NRSSSGGIVNYKPLGLPVRSLKSSLTESDDVEFDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFG

Query:  NAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRKHHKMSSLSNISYKSLHSRQYSMSSLSENSRGSSEDPLIEQENSSECNES
        NAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTR+H KMSSLSNISYKSLHSRQYSMSS+SENSRGSSEDPLIEQENSSECNES
Subjt:  NAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRKHHKMSSLSNISYKSLHSRQYSMSSLSENSRGSSEDPLIEQENSSECNES

Query:  VVSSPRSDMNFRSIPKALSQGKSIRRIQANAAAIEDIKAQEMHRKQVKHDDIIGNKFEEGGTS-PYIREDGTGHGWPDVANPNASNMSRFPTTTFLGIKE
        VVSSPRSD NF SIPKALSQGKS+RRI+ANAAA+ED+KAQEMHRKQVK DDIIGNKFEEGG S PY+REDGTGHGWPDV NPNASNM+RFP TTFLGIKE
Subjt:  VVSSPRSDMNFRSIPKALSQGKSIRRIQANAAAIEDIKAQEMHRKQVKHDDIIGNKFEEGGTS-PYIREDGTGHGWPDVANPNASNMSRFPTTTFLGIKE

Query:  QKEETESLVADDSKDDSEGEDESFFASSDEEAASSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLR---GGGGWGSFSSTSSSYFS
        QKEETES+VADDSKDDSEGEDES FASSDEEA SSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLR   GGGGWGSFSSTSSSYFS
Subjt:  QKEETESLVADDSKDDSEGEDESFFASSDEEAASSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLR---GGGGWGSFSSTSSSYFS

TrEMBL top hitse value%identityAlignment
A0A0A0K9X1 Uncharacterized protein3.0e-24978.77Show/hide
Query:  MASSASSPFTKLHFPHSPLPQPPATHHSNSCAQFLCKSLFFCFFLLLLPLFPSEAPDFVDQTLFTKFWELFHLMLVGIAVSYGLFSTRNNQMNV--DEPR
        MA S S+PFTK HFPHSPLP    T HSNSC QF+CKSLFFC FLLLLPLFPSEAP+FV+QT  TKFWELFHLM +GIAVSYGLFS RN Q++V  DEPR
Subjt:  MASSASSPFTKLHFPHSPLPQPPATHHSNSCAQFLCKSLFFCFFLLLLPLFPSEAPDFVDQTLFTKFWELFHLMLVGIAVSYGLFSTRNNQMNV--DEPR

Query:  YSSFENPQSYLSKMLYVASIFDDVDDFSVSDERKLSEVLYIQPNLGSASDLNAQSRQQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVP
        +S+FENPQSYLSKM +VASIF+DVDDFSVSDERKLSEVLYIQPNLGS S LNA SRQQE   YSIPKKRYENS EFA+TDNV HACKSRYTRGGSVVVV 
Subjt:  YSSFENPQSYLSKMLYVASIFDDVDDFSVSDERKLSEVLYIQPNLGSASDLNAQSRQQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVP

Query:  ETNRSS------SGGIVNYKPLGLPVRSLKSSLTESDDVEFDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKK
        ETNRS+      SG IVNYKPLGLPVRSLKSSLTE DDVEFDCGDESCLSSKSS K+SE+NCE  SEFGDNCCVNLEEKFDET IASMS FQLREKF K 
Subjt:  ETNRSS------SGGIVNYKPLGLPVRSLKSSLTESDDVEFDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKK

Query:  VIRERGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRKHHKMSSLSNISYKSLHSRQYSMSSLSENSRGSSEDPLIEQE
        ++RER   NAVLRPSHFRP SIDETQFESLKKS SLHSNLSQSSQTSSLSS LSS TRKH KMSSL NISYKS HSRQYS+SSLSENSRGSSEDPLI+ E
Subjt:  VIRERGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRKHHKMSSLSNISYKSLHSRQYSMSSLSENSRGSSEDPLIEQE

Query:  NSSECNESVVSSPRSDMNFRSIPKALSQGKSIRRIQANAAAIEDIKAQEMHRKQVKHDDIIGNKFEEGGTSPYIREDGTGHGWPDVANPNASNMSRF---
        NSSECNESVVSSPR D NF + PKALS+GKS+R ++A+ +AIE++KAQEM+R QV+HDD + NKF EGG SPY+RED TGHGWP + N NA+  +R+   
Subjt:  NSSECNESVVSSPRSDMNFRSIPKALSQGKSIRRIQANAAAIEDIKAQEMHRKQVKHDDIIGNKFEEGGTSPYIREDGTGHGWPDVANPNASNMSRF---

Query:  -PTTTFLGIKEQKEETESLVADDSKDDSEGEDESFFASSDEEAASSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGWGSFSSTSSS
          TTTF GI+EQKE+TES V DD KD+SE ED+SFF SSDEEAA SM GDSESGA EVDKKAGEFIAKFREQIQLQRMASV+KRLR  GGWGSFSST+SS
Subjt:  -PTTTFLGIKEQKEETESLVADDSKDDSEGEDESFFASSDEEAASSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGWGSFSSTSSS

Query:  YFS
        YFS
Subjt:  YFS

A0A5D3DMA5 DUF761 domain-containing protein3.2e-25179.4Show/hide
Query:  MASSASSPFTKLHFPHSPLPQPPATHHSNSCAQFLCKSLFFCFFLLLLPLFPSEAPDFVDQTLFTKFWELFHLMLVGIAVSYGLFSTRNNQMNV--DEPR
        MASS S+PFTK HFPHSPLP    T HSNSC  FLCKSLFFC FLLLLPLFPSEAP+FV+QTL TKFWELFHLM VGIAVSYGLFS RN Q++V  DEPR
Subjt:  MASSASSPFTKLHFPHSPLPQPPATHHSNSCAQFLCKSLFFCFFLLLLPLFPSEAPDFVDQTLFTKFWELFHLMLVGIAVSYGLFSTRNNQMNV--DEPR

Query:  YSSFENPQSYLSKMLYVASIFDDVDDFSVSDERKLSEVLYIQPNLGSASDLNAQSRQQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVP
        +S+FENPQSYLSKML+VASIF+DVDDFSVSDERKLSEVLYIQPNLGS    NA SRQQE   YSIPKKRYENS EF DT++V HACKSRYTRGGSVVVV 
Subjt:  YSSFENPQSYLSKMLYVASIFDDVDDFSVSDERKLSEVLYIQPNLGSASDLNAQSRQQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVP

Query:  ETNRSS------SGGIVNYKPLGLPVRSLKSSLTESDDVEFDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKK
        ETNRS+      SG IVNYKPLGLPVRSL+S+LTE DDVEFDCGDESCLSSKSS K+SE+NCE  SEFGDNCCVNLEEKFDET IA MS FQLRE FGK 
Subjt:  ETNRSS------SGGIVNYKPLGLPVRSLKSSLTESDDVEFDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKK

Query:  VIRERGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRKHHKMSSLSNISYKSLHSRQYSMSSLSENSRGSSEDPLIEQE
        ++RERG  NAVLRPSHFRP SIDETQFESLKKS SLHSNLSQSSQTSSLS SLSSTTRKH KMSSL NISYKS HSRQYS+SSLSENSRGSSEDPLIE E
Subjt:  VIRERGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRKHHKMSSLSNISYKSLHSRQYSMSSLSENSRGSSEDPLIEQE

Query:  NSSECNESVVSSPRSDMNFRSIPKALSQGKSIRRIQANAAAIEDIKAQEMHRKQVKHDDIIGNKFEEGGTSPYIREDGTGHGWPDVANPNASNMSRFP-T
        NSSECNES++SSPR D NF  IPKALS+GKS+R I+AN +AIE++KAQEM+R QV+HDD +GNKF EGG SPY+REDGTGHGWP + +PNA   +R P T
Subjt:  NSSECNESVVSSPRSDMNFRSIPKALSQGKSIRRIQANAAAIEDIKAQEMHRKQVKHDDIIGNKFEEGGTSPYIREDGTGHGWPDVANPNASNMSRFP-T

Query:  TTFLGIKEQKEETESLVADD--SKDDSEGEDESFFASSDEEAASSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGWGSFSSTSSSY
        TTF GI+EQKE+ ES + DD   +D+SE ED SFF SSDEEAASSMAG+SESGA+EVDKKAGEFIAKFREQIQLQRMASV+KRLR  GGWGSFSSTSSSY
Subjt:  TTFLGIKEQKEETESLVADD--SKDDSEGEDESFFASSDEEAASSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGWGSFSSTSSSY

Query:  FS
        FS
Subjt:  FS

A0A6J1H4M0 uncharacterized protein LOC1114599983.6e-30393.96Show/hide
Query:  MASSASSPFTKLHFPHSPLPQPPATHHSNSCAQFLCKSLFFCFFLLLLPLFPSEAPDFVDQTLFTKFWELFHLMLVGIAVSYGLFSTRNNQMNVDEPRYS
        MASSASSPFTKLHFPHSPLPQPPA    NSCAQFLCKS+FFCFFLLLLPLFPSEAPDFVDQTLFTKFWELFHLM VGIAVSYGLFSTRNNQMNVDEPRYS
Subjt:  MASSASSPFTKLHFPHSPLPQPPATHHSNSCAQFLCKSLFFCFFLLLLPLFPSEAPDFVDQTLFTKFWELFHLMLVGIAVSYGLFSTRNNQMNVDEPRYS

Query:  SFENPQSYLSKMLYVASIFDDVDDFSVSDERKLSEVLYIQPNLGSASDLNAQSRQQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPET
        SFENPQSYLSKMLYVASIFDDVDDF VSDERK+SEVLYIQP LGSASDLNAQSR QEKLRYS+PKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPET
Subjt:  SFENPQSYLSKMLYVASIFDDVDDFSVSDERKLSEVLYIQPNLGSASDLNAQSRQQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPET

Query:  NRSSSGGIVNYKPLGLPVRSLKSSLTESDDVEFDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFG
        NRSSSGGIVNYKPLGLPVRSL+SSLTESDDVEFDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFG
Subjt:  NRSSSGGIVNYKPLGLPVRSLKSSLTESDDVEFDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFG

Query:  NAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRKHHKMSSLSNISYKSLHSRQYSMSSLSENSRGSSEDPLIEQENSSECNES
        NAVLRPSHFRPPSIDETQFESLKKSGSLHS LSQSSQTSSLSS LSSTTRK  KMSSLSNISYKSLHSRQYS SSLSENSRGSSEDPLIEQENSSECNES
Subjt:  NAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRKHHKMSSLSNISYKSLHSRQYSMSSLSENSRGSSEDPLIEQENSSECNES

Query:  VVSSPRSDMNFRSIPKALSQGKSIRRIQANAAAIEDIKAQEMHRKQVKHDDIIGNKFEEGGTS-PYIREDGTGHGWPDVANPNASNMSRFPTTTFLGIKE
        VVSSPRSD NF SIPKALSQGKS+RRI+ANAAAIED+KAQEMHRKQVKHDDIIGNKFEEGG S PY+REDGTG GWPDV NPNA NM+RFP TTFLGIKE
Subjt:  VVSSPRSDMNFRSIPKALSQGKSIRRIQANAAAIEDIKAQEMHRKQVKHDDIIGNKFEEGGTS-PYIREDGTGHGWPDVANPNASNMSRFPTTTFLGIKE

Query:  QKEETESLVADDSKDDSEGEDESFFASSDEEAASSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLR----GGGGWGSFSSTSSSYFS
        QKEETESLVADDSKDDSEGEDES FASSDEEA SSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLR    GGGGWGSFSSTSSSYFS
Subjt:  QKEETESLVADDSKDDSEGEDESFFASSDEEAASSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLR----GGGGWGSFSSTSSSYFS

A0A6J1KUS4 uncharacterized protein LOC1114989000.0e+00100Show/hide
Query:  MASSASSPFTKLHFPHSPLPQPPATHHSNSCAQFLCKSLFFCFFLLLLPLFPSEAPDFVDQTLFTKFWELFHLMLVGIAVSYGLFSTRNNQMNVDEPRYS
        MASSASSPFTKLHFPHSPLPQPPATHHSNSCAQFLCKSLFFCFFLLLLPLFPSEAPDFVDQTLFTKFWELFHLMLVGIAVSYGLFSTRNNQMNVDEPRYS
Subjt:  MASSASSPFTKLHFPHSPLPQPPATHHSNSCAQFLCKSLFFCFFLLLLPLFPSEAPDFVDQTLFTKFWELFHLMLVGIAVSYGLFSTRNNQMNVDEPRYS

Query:  SFENPQSYLSKMLYVASIFDDVDDFSVSDERKLSEVLYIQPNLGSASDLNAQSRQQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPET
        SFENPQSYLSKMLYVASIFDDVDDFSVSDERKLSEVLYIQPNLGSASDLNAQSRQQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPET
Subjt:  SFENPQSYLSKMLYVASIFDDVDDFSVSDERKLSEVLYIQPNLGSASDLNAQSRQQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPET

Query:  NRSSSGGIVNYKPLGLPVRSLKSSLTESDDVEFDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFG
        NRSSSGGIVNYKPLGLPVRSLKSSLTESDDVEFDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFG
Subjt:  NRSSSGGIVNYKPLGLPVRSLKSSLTESDDVEFDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFG

Query:  NAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRKHHKMSSLSNISYKSLHSRQYSMSSLSENSRGSSEDPLIEQENSSECNES
        NAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRKHHKMSSLSNISYKSLHSRQYSMSSLSENSRGSSEDPLIEQENSSECNES
Subjt:  NAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRKHHKMSSLSNISYKSLHSRQYSMSSLSENSRGSSEDPLIEQENSSECNES

Query:  VVSSPRSDMNFRSIPKALSQGKSIRRIQANAAAIEDIKAQEMHRKQVKHDDIIGNKFEEGGTSPYIREDGTGHGWPDVANPNASNMSRFPTTTFLGIKEQ
        VVSSPRSDMNFRSIPKALSQGKSIRRIQANAAAIEDIKAQEMHRKQVKHDDIIGNKFEEGGTSPYIREDGTGHGWPDVANPNASNMSRFPTTTFLGIKEQ
Subjt:  VVSSPRSDMNFRSIPKALSQGKSIRRIQANAAAIEDIKAQEMHRKQVKHDDIIGNKFEEGGTSPYIREDGTGHGWPDVANPNASNMSRFPTTTFLGIKEQ

Query:  KEETESLVADDSKDDSEGEDESFFASSDEEAASSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGWGSFSSTSSSYFS
        KEETESLVADDSKDDSEGEDESFFASSDEEAASSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGWGSFSSTSSSYFS
Subjt:  KEETESLVADDSKDDSEGEDESFFASSDEEAASSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGWGSFSSTSSSYFS

E5GCN2 Uncharacterized protein3.2e-25179.4Show/hide
Query:  MASSASSPFTKLHFPHSPLPQPPATHHSNSCAQFLCKSLFFCFFLLLLPLFPSEAPDFVDQTLFTKFWELFHLMLVGIAVSYGLFSTRNNQMNV--DEPR
        MASS S+PFTK HFPHSPLP    T HSNSC  FLCKSLFFC FLLLLPLFPSEAP+FV+QTL TKFWELFHLM VGIAVSYGLFS RN Q++V  DEPR
Subjt:  MASSASSPFTKLHFPHSPLPQPPATHHSNSCAQFLCKSLFFCFFLLLLPLFPSEAPDFVDQTLFTKFWELFHLMLVGIAVSYGLFSTRNNQMNV--DEPR

Query:  YSSFENPQSYLSKMLYVASIFDDVDDFSVSDERKLSEVLYIQPNLGSASDLNAQSRQQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVP
        +S+FENPQSYLSKML+VASIF+DVDDFSVSDERKLSEVLYIQPNLGS    NA SRQQE   YSIPKKRYENS EF DT++V HACKSRYTRGGSVVVV 
Subjt:  YSSFENPQSYLSKMLYVASIFDDVDDFSVSDERKLSEVLYIQPNLGSASDLNAQSRQQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVP

Query:  ETNRSS------SGGIVNYKPLGLPVRSLKSSLTESDDVEFDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKK
        ETNRS+      SG IVNYKPLGLPVRSL+S+LTE DDVEFDCGDESCLSSKSS K+SE+NCE  SEFGDNCCVNLEEKFDET IA MS FQLRE FGK 
Subjt:  ETNRSS------SGGIVNYKPLGLPVRSLKSSLTESDDVEFDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKK

Query:  VIRERGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRKHHKMSSLSNISYKSLHSRQYSMSSLSENSRGSSEDPLIEQE
        ++RERG  NAVLRPSHFRP SIDETQFESLKKS SLHSNLSQSSQTSSLS SLSSTTRKH KMSSL NISYKS HSRQYS+SSLSENSRGSSEDPLIE E
Subjt:  VIRERGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRKHHKMSSLSNISYKSLHSRQYSMSSLSENSRGSSEDPLIEQE

Query:  NSSECNESVVSSPRSDMNFRSIPKALSQGKSIRRIQANAAAIEDIKAQEMHRKQVKHDDIIGNKFEEGGTSPYIREDGTGHGWPDVANPNASNMSRFP-T
        NSSECNES++SSPR D NF  IPKALS+GKS+R I+AN +AIE++KAQEM+R QV+HDD +GNKF EGG SPY+REDGTGHGWP + +PNA   +R P T
Subjt:  NSSECNESVVSSPRSDMNFRSIPKALSQGKSIRRIQANAAAIEDIKAQEMHRKQVKHDDIIGNKFEEGGTSPYIREDGTGHGWPDVANPNASNMSRFP-T

Query:  TTFLGIKEQKEETESLVADD--SKDDSEGEDESFFASSDEEAASSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGWGSFSSTSSSY
        TTF GI+EQKE+ ES + DD   +D+SE ED SFF SSDEEAASSMAG+SESGA+EVDKKAGEFIAKFREQIQLQRMASV+KRLR  GGWGSFSSTSSSY
Subjt:  TTFLGIKEQKEETESLVADD--SKDDSEGEDESFFASSDEEAASSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGWGSFSSTSSSY

Query:  FS
        FS
Subjt:  FS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G60380.1 FUNCTIONS IN: molecular_function unknown1.6e-3234.97Show/hide
Query:  SASSPFTKLHFPHSPLPQPPATHHSNSCAQFLCKSLFFCFFLLLLPLFPSEAPDFVDQTLFTKFWELFHLMLVGIAVSYGLFSTRNNQMNVDEPRYSSFE
        ++ +P+TK   P + +  P   + S     F CKS+ F  FLL LPLFPS+APDFV +T+ TKFWEL HL+ VGIAV+YGLFS RN +  VD       E
Subjt:  SASSPFTKLHFPHSPLPQPPATHHSNSCAQFLCKSLFFCFFLLLLPLFPSEAPDFVDQTLFTKFWELFHLMLVGIAVSYGLFSTRNNQMNVDEPRYSSFE

Query:  NPQSYLSKMLYVASIFD-DVDDFSVSDERKLSEVLYIQPNLGSASDLNAQSRQQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPETNR
        +  SY+S++  V+S+FD + DD S        E + ++ +   ++  +   + +    + +     E S EF +T+ V  A  S+Y +G S VVV     
Subjt:  NPQSYLSKMLYVASIFD-DVDDFSVSDERKLSEVLYIQPNLGSASDLNAQSRQQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPETNR

Query:  SSSGGIVNYKPLGLPVRSLKSSLTESDDVEFDCGDESCLSSKSSPKSSEN--NCEGNSEFGDNCCVNLEEKFDE--TAIASMSSFQLREKFGKKVIRERG
           G +V ++PLGLP+R L+SSL           D + L  KS   S +   N E  S   DN        FDE   A AS   +Q R +         G
Subjt:  SSSGGIVNYKPLGLPVRSLKSSLTESDDVEFDCGDESCLSSKSSPKSSEN--NCEGNSEFGDNCCVNLEEKFDE--TAIASMSSFQLREKFGKKVIRERG

Query:  FGNAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRKHHKMSSLSNISYKSLHSRQYSMSSLSENSRGSS
         G+    PS+F+P S+DET      KS S  S  S SSQTS  S       +  ++ S   ++S +SL+S    +  + E SR SS
Subjt:  FGNAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRKHHKMSSLSNISYKSLHSRQYSMSSLSENSRGSS

AT3G60380.1 FUNCTIONS IN: molecular_function unknown2.6e-0342.31Show/hide
Query:  KEETESLVADDSKDDSEGEDESFFASSDEEAASSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGG
        K E E +  ++ +  +E + E  F   +EEAA     ++     EVD+KAGEFIAKFREQI+LQ++ S E+   GG G
Subjt:  KEETESLVADDSKDDSEGEDESFFASSDEEAASSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGG

AT4G16790.1 hydroxyproline-rich glycoprotein family protein5.4e-0936.21Show/hide
Query:  AQFLCKSLFFCFFLLLLPLFPSEAPDFVDQTLFTKFWELFHLMLVGIAVSYGLFSTRN-------NQMNVDEPRYS-SFENPQSYLSKMLYVASIFD---
        ++F+ K+L       ++P+F S+ P+  +Q   T+  EL HL+ VGIAVSYGLFS RN          N D  +   S  N  SY+ K+L V+S+F+   
Subjt:  AQFLCKSLFFCFFLLLLPLFPSEAPDFVDQTLFTKFWELFHLMLVGIAVSYGLFSTRN-------NQMNVDEPRYS-SFENPQSYLSKMLYVASIFD---

Query:  ----DVDDFSVSDERK
            +  D S  D+RK
Subjt:  ----DVDDFSVSDERK

AT4G16790.1 hydroxyproline-rich glycoprotein family protein2.4e-0136.49Show/hide
Query:  EQKEETESLVADDSKDDSEGEDESFFASSDEEAASSMAGDSE-SGAFEVDKKAGEFIAKFREQIQLQRMASVEK
        +Q+    S   ++S++  +   E+      E+      G SE +   +VDKKA EFIAKFREQI+LQR+ S+++
Subjt:  EQKEETESLVADDSKDDSEGEDESFFASSDEEAASSMAGDSE-SGAFEVDKKAGEFIAKFREQIQLQRMASVEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCTTCAGCTTCTAGCCCGTTCACGAAGCTCCATTTCCCCCATTCTCCACTTCCACAACCACCAGCCACTCACCATTCCAACTCCTGCGCACAGTTTCTCTGTAA
ATCCCTCTTCTTCTGCTTTTTTCTCCTCCTCCTCCCTCTCTTCCCTTCCGAAGCTCCAGATTTCGTCGATCAGACTTTGTTCACCAAATTCTGGGAGCTTTTTCACCTCA
TGCTCGTCGGCATTGCTGTTTCCTATGGTCTCTTCAGTACAAGGAACAACCAGATGAATGTAGACGAACCTCGCTACTCCAGTTTTGAGAATCCGCAGTCTTATTTGTCT
AAGATGCTTTACGTCGCTTCAATTTTTGATGATGTTGACGATTTTAGTGTTTCTGATGAAAGGAAATTGAGTGAAGTTCTGTACATTCAGCCGAATCTTGGATCTGCGAG
TGATTTGAATGCGCAATCTCGCCAGCAGGAAAAACTCCGTTATTCAATACCGAAAAAAAGGTACGAAAATTCTTATGAATTTGCTGATACTGATAATGTCGCTCATGCTT
GTAAATCGAGATATACTCGTGGTGGATCTGTGGTGGTTGTGCCTGAAACAAATCGTAGTTCATCAGGAGGCATTGTAAATTATAAACCTCTAGGTTTGCCTGTTAGGAGT
CTGAAATCGAGTCTTACTGAATCCGATGATGTCGAATTCGATTGTGGTGATGAATCTTGTTTGAGTTCTAAAAGTTCACCCAAAAGCTCTGAGAATAATTGTGAAGGAAA
TAGTGAATTTGGTGATAATTGTTGTGTGAATTTGGAGGAGAAGTTTGATGAGACTGCAATTGCATCAATGTCCTCATTTCAATTGCGTGAGAAATTTGGAAAGAAGGTGA
TTAGAGAGAGAGGATTTGGGAATGCCGTTCTTCGCCCTTCCCATTTTAGACCTCCCTCCATTGATGAAACTCAATTTGAATCACTGAAAAAATCAGGATCTCTTCATTCT
AATCTATCTCAGTCATCACAAACTAGTTCCCTCTCTTCTTCATTGTCATCGACAACGAGAAAGCACCATAAAATGTCGTCACTCAGTAACATTTCTTACAAGTCGTTGCA
TTCTCGACAATACAGTATGAGTTCTCTGTCTGAAAACAGTAGAGGGAGCTCTGAAGACCCTCTGATTGAACAAGAAAACTCATCCGAGTGCAATGAATCCGTGGTGAGTT
CGCCACGTTCGGACATGAATTTCAGAAGTATTCCGAAAGCTTTATCCCAAGGAAAATCAATTCGAAGAATTCAAGCAAATGCAGCTGCCATAGAGGATATAAAAGCTCAA
GAGATGCATAGAAAGCAAGTTAAACATGATGACATTATAGGGAATAAGTTTGAAGAAGGTGGAACGTCACCATATATAAGAGAAGATGGAACGGGACACGGATGGCCTGA
TGTTGCTAACCCGAATGCTAGTAATATGAGTCGTTTTCCGACGACGACGTTCTTGGGGATTAAGGAGCAGAAGGAAGAGACTGAGAGTCTGGTGGCAGATGATAGTAAAG
ATGACTCTGAGGGGGAGGATGAAAGTTTTTTTGCAAGTTCAGATGAAGAAGCTGCTTCAAGTATGGCCGGAGATTCGGAGTCGGGGGCTTTCGAGGTCGACAAGAAGGCG
GGCGAGTTCATAGCCAAGTTCAGGGAGCAAATACAGCTTCAGAGGATGGCTTCAGTTGAAAAAAGATTGAGAGGAGGAGGAGGATGGGGGTCGTTCAGCAGCACAAGCAG
CAGCTATTTCAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCTTCAGCTTCTAGCCCGTTCACGAAGCTCCATTTCCCCCATTCTCCACTTCCACAACCACCAGCCACTCACCATTCCAACTCCTGCGCACAGTTTCTCTGTAA
ATCCCTCTTCTTCTGCTTTTTTCTCCTCCTCCTCCCTCTCTTCCCTTCCGAAGCTCCAGATTTCGTCGATCAGACTTTGTTCACCAAATTCTGGGAGCTTTTTCACCTCA
TGCTCGTCGGCATTGCTGTTTCCTATGGTCTCTTCAGTACAAGGAACAACCAGATGAATGTAGACGAACCTCGCTACTCCAGTTTTGAGAATCCGCAGTCTTATTTGTCT
AAGATGCTTTACGTCGCTTCAATTTTTGATGATGTTGACGATTTTAGTGTTTCTGATGAAAGGAAATTGAGTGAAGTTCTGTACATTCAGCCGAATCTTGGATCTGCGAG
TGATTTGAATGCGCAATCTCGCCAGCAGGAAAAACTCCGTTATTCAATACCGAAAAAAAGGTACGAAAATTCTTATGAATTTGCTGATACTGATAATGTCGCTCATGCTT
GTAAATCGAGATATACTCGTGGTGGATCTGTGGTGGTTGTGCCTGAAACAAATCGTAGTTCATCAGGAGGCATTGTAAATTATAAACCTCTAGGTTTGCCTGTTAGGAGT
CTGAAATCGAGTCTTACTGAATCCGATGATGTCGAATTCGATTGTGGTGATGAATCTTGTTTGAGTTCTAAAAGTTCACCCAAAAGCTCTGAGAATAATTGTGAAGGAAA
TAGTGAATTTGGTGATAATTGTTGTGTGAATTTGGAGGAGAAGTTTGATGAGACTGCAATTGCATCAATGTCCTCATTTCAATTGCGTGAGAAATTTGGAAAGAAGGTGA
TTAGAGAGAGAGGATTTGGGAATGCCGTTCTTCGCCCTTCCCATTTTAGACCTCCCTCCATTGATGAAACTCAATTTGAATCACTGAAAAAATCAGGATCTCTTCATTCT
AATCTATCTCAGTCATCACAAACTAGTTCCCTCTCTTCTTCATTGTCATCGACAACGAGAAAGCACCATAAAATGTCGTCACTCAGTAACATTTCTTACAAGTCGTTGCA
TTCTCGACAATACAGTATGAGTTCTCTGTCTGAAAACAGTAGAGGGAGCTCTGAAGACCCTCTGATTGAACAAGAAAACTCATCCGAGTGCAATGAATCCGTGGTGAGTT
CGCCACGTTCGGACATGAATTTCAGAAGTATTCCGAAAGCTTTATCCCAAGGAAAATCAATTCGAAGAATTCAAGCAAATGCAGCTGCCATAGAGGATATAAAAGCTCAA
GAGATGCATAGAAAGCAAGTTAAACATGATGACATTATAGGGAATAAGTTTGAAGAAGGTGGAACGTCACCATATATAAGAGAAGATGGAACGGGACACGGATGGCCTGA
TGTTGCTAACCCGAATGCTAGTAATATGAGTCGTTTTCCGACGACGACGTTCTTGGGGATTAAGGAGCAGAAGGAAGAGACTGAGAGTCTGGTGGCAGATGATAGTAAAG
ATGACTCTGAGGGGGAGGATGAAAGTTTTTTTGCAAGTTCAGATGAAGAAGCTGCTTCAAGTATGGCCGGAGATTCGGAGTCGGGGGCTTTCGAGGTCGACAAGAAGGCG
GGCGAGTTCATAGCCAAGTTCAGGGAGCAAATACAGCTTCAGAGGATGGCTTCAGTTGAAAAAAGATTGAGAGGAGGAGGAGGATGGGGGTCGTTCAGCAGCACAAGCAG
CAGCTATTTCAGTTGA
Protein sequenceShow/hide protein sequence
MASSASSPFTKLHFPHSPLPQPPATHHSNSCAQFLCKSLFFCFFLLLLPLFPSEAPDFVDQTLFTKFWELFHLMLVGIAVSYGLFSTRNNQMNVDEPRYSSFENPQSYLS
KMLYVASIFDDVDDFSVSDERKLSEVLYIQPNLGSASDLNAQSRQQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPETNRSSSGGIVNYKPLGLPVRS
LKSSLTESDDVEFDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHS
NLSQSSQTSSLSSSLSSTTRKHHKMSSLSNISYKSLHSRQYSMSSLSENSRGSSEDPLIEQENSSECNESVVSSPRSDMNFRSIPKALSQGKSIRRIQANAAAIEDIKAQ
EMHRKQVKHDDIIGNKFEEGGTSPYIREDGTGHGWPDVANPNASNMSRFPTTTFLGIKEQKEETESLVADDSKDDSEGEDESFFASSDEEAASSMAGDSESGAFEVDKKA
GEFIAKFREQIQLQRMASVEKRLRGGGGWGSFSSTSSSYFS