; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi09G016780 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi09G016780
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionDUF761 domain-containing protein
Genome locationchr09:25214204..25215985
RNA-Seq ExpressionLsi09G016780
SyntenyLsi09G016780
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR008480 - Protein of unknown function DUF761, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN34231.1 hypothetical protein [Cucumis melo subsp. melo]3.4e-27987.5Show/hide
Query:  MASSTSSPFTKPHFPHSPLPSISTTHHSKSCTQFLCKSLFFCIFLLLLPLFPSEAPEFVNQTLLTKFWELFHLLFVGIAVSYGLFSRRNIQVSV--EEPR
        MASS S+PFTKPHFPHSPLP  STT HS SCT FLCKSLFFCIFLLLLPLFPSEAPEFVNQTLLTKFWELFHL+FVGIAVSYGLFSRRN+QVSV  +EPR
Subjt:  MASSTSSPFTKPHFPHSPLPSISTTHHSKSCTQFLCKSLFFCIFLLLLPLFPSEAPEFVNQTLLTKFWELFHLLFVGIAVSYGLFSRRNIQVSV--EEPR

Query:  FSNFENPQSYLSKMFQVASIFEDVDDLSVSDERKLSEVLYIQPNLGSVSDFNAQSHELEKYRYSIPKKRYENSHEFADTDNVGHVCKSRYTRGGSVVVVT
        FSNFENPQSYLSKM  VASIFEDVDD SVSDERKLSEVLYIQPNLGSV  FNA S + E + YSIPKKRYENS EF DT++VGH CKSRYTRGGSVVVV 
Subjt:  FSNFENPQSYLSKMFQVASIFEDVDDLSVSDERKLSEVLYIQPNLGSVSDFNAQSHELEKYRYSIPKKRYENSHEFADTDNVGHVCKSRYTRGGSVVVVT

Query:  ETNRS-SGEWLESGAIVNYKPLGLPVRSLRSSLTEPDDVEFDCNDESCLSSKSSSKSSENNCER-SEFGDNCCVNLEEKFDETVIASMSPFQLREKFGKK
        ETNRS SGEWLESGAIVNYKPLGLPVRSLRS+LTEPDDVEFDC DESCLSSKSSSK+SE+NCER SEFGDNCCVNLEEKFDETVIA MSPFQLRE FGK 
Subjt:  ETNRS-SGEWLESGAIVNYKPLGLPVRSLRSSLTEPDDVEFDCNDESCLSSKSSSKSSENNCER-SEFGDNCCVNLEEKFDETVIASMSPFQLREKFGKK

Query:  MMRERGVRNAVLRPSHFRPPSIDETQFESLKKSSSFHSNLSQSSQTSSLSSSLSSTTRKHRKMSSLGNISYKSLHSRQYSMSSLSENSRGSSEDPLIEPE
        MMRERGV+NAVLRPSHFRP SIDETQFESLKKS S HSNLSQSSQTSSLS SLSSTTRKHRKMSSLGNISYKS HSRQYS+SSLSENSRGSSEDPLIEPE
Subjt:  MMRERGVRNAVLRPSHFRPPSIDETQFESLKKSSSFHSNLSQSSQTSSLSSSLSSTTRKHRKMSSLGNISYKSLHSRQYSMSSLSENSRGSSEDPLIEPE

Query:  NSSECNESVVSSPRLDRNFASIPKALSRGKSVRTIRANAIAIEEMKAQEEMYRNQVEHDDNIGNKFEGGMSPYMREDGIGHGWSGVVNPNAGNSNRFSK-
        NSSECNES++SSPRLDRNFA IPKALSRGKSVRTIRAN  AIEEMKAQ EMYRNQVEHDDN+GNKFEGGMSPYMREDG GHGW G+ +PNAG SNR  K 
Subjt:  NSSECNESVVSSPRLDRNFASIPKALSRGKSVRTIRANAIAIEEMKAQEEMYRNQVEHDDNIGNKFEGGMSPYMREDGIGHGWSGVVNPNAGNSNRFSK-

Query:  MAFSGIEEQKEDTESQLTDDG--KDNSEREDESLFASSDEEAALSMAGDSESGAHEVDKKAGEFIAKFREQIQLQRMASVDKRLRGGWGSFSSTSSSYFS
          FSGIEEQKED ESQLTDD   +DNSERED S F SSDEEAA SMAG+SESGA+EVDKKAGEFIAKFREQIQLQRMASVDKRLRGGWGSFSSTSSSYFS
Subjt:  MAFSGIEEQKEDTESQLTDDG--KDNSEREDESLFASSDEEAALSMAGDSESGAHEVDKKAGEFIAKFREQIQLQRMASVDKRLRGGWGSFSSTSSSYFS

KAG6575261.1 hypothetical protein SDJN03_25900, partial [Cucurbita argyrosperma subsp. sororia]3.6e-25782.8Show/hide
Query:  MASSTSSPFTKPHFPHSPLPSISTTHHSKSCTQFLCKSLFFCIFLLLLPLFPSEAPEFVNQTLLTKFWELFHLLFVGIAVSYGLFSRRNIQVSVEEPRFS
        MASS SSPFTK HFPHSPLP       + SC QFLCKS+FFC FLLLLPLFPSEAP+FV+QTL TKFWELFHL+FVGIAVSYGLFS RN Q++V+EPR+S
Subjt:  MASSTSSPFTKPHFPHSPLPSISTTHHSKSCTQFLCKSLFFCIFLLLLPLFPSEAPEFVNQTLLTKFWELFHLLFVGIAVSYGLFSRRNIQVSVEEPRFS

Query:  NFENPQSYLSKMFQVASIFEDVDDLSVSDERKLSEVLYIQPNLGSVSDFNAQSHELEKYRYSIPKKRYENSHEFADTDNVGHVCKSRYTRGGSVVVVTET
        +FENPQSYLSKM  VASIF+DVDD  VSDERK+SEVLYIQP LGS SD NAQS   EK RYS+PKKRYENS+EFADTDNV H CKSRYTRGGSVVVV ET
Subjt:  NFENPQSYLSKMFQVASIFEDVDDLSVSDERKLSEVLYIQPNLGSVSDFNAQSHELEKYRYSIPKKRYENSHEFADTDNVGHVCKSRYTRGGSVVVVTET

Query:  NRSSGEWLESGAIVNYKPLGLPVRSLRSSLTEPDDVEFDCNDESCLSSKSSSKSSENNCE-RSEFGDNCCVNLEEKFDETVIASMSPFQLREKFGKKMMR
        NRSS     SG IVNYKPLGLPVRSLRSSLTE DDVEFDC DESCLSSKSS KSSENNCE  SEFGDNCCVNLEEKFDET IASMS FQLREKFGKK++R
Subjt:  NRSSGEWLESGAIVNYKPLGLPVRSLRSSLTEPDDVEFDCNDESCLSSKSSSKSSENNCE-RSEFGDNCCVNLEEKFDETVIASMSPFQLREKFGKKMMR

Query:  ERGVRNAVLRPSHFRPPSIDETQFESLKKSSSFHSNLSQSSQTSSLSSSLSSTTRKHRKMSSLGNISYKSLHSRQYSMSSLSENSRGSSEDPLIEPENSS
        ERG  NAVLRPSHFRPPSIDETQFESL+KS S HS+LSQSSQTSSLSS LSSTTRKH KMSSL NISYKSLHSRQYSMSSLSENSRGSSEDPLIE ENSS
Subjt:  ERGVRNAVLRPSHFRPPSIDETQFESLKKSSSFHSNLSQSSQTSSLSSSLSSTTRKHRKMSSLGNISYKSLHSRQYSMSSLSENSRGSSEDPLIEPENSS

Query:  ECNESVVSSPRLDRNFASIPKALSRGKSVRTIRANAIAIEEMKAQEEMYRNQVEHDDNIGNKF-EGGMS-PYMREDGIGHGWSGVVNPNAGNSNRFSKMA
        ECNESVVSSPR DRNFASIPKALS+GKSVR IRANA AIE+MKAQ EM+R QV+HDD IGNKF EGGMS PYMREDG GHGW  VVNPNAGN NRF K  
Subjt:  ECNESVVSSPRLDRNFASIPKALSRGKSVRTIRANAIAIEEMKAQEEMYRNQVEHDDNIGNKF-EGGMS-PYMREDGIGHGWSGVVNPNAGNSNRFSKMA

Query:  FSGIEEQKEDTESQLTDDGKDNSEREDESLFASSDEEAALSMAGDSESGAHEVDKKAGEFIAKFREQIQLQRMASVDKRLR---GGWGSFSSTSSSYFS
        F GI+EQKE+TES + DD KD SE EDESLFASSDEEA  SMAGDSESGA EVDKKAGEFIAKFREQIQLQRMASV+KRLR   GGWGSFSSTSSSYFS
Subjt:  FSGIEEQKEDTESQLTDDGKDNSEREDESLFASSDEEAALSMAGDSESGAHEVDKKAGEFIAKFREQIQLQRMASVDKRLR---GGWGSFSSTSSSYFS

KAG7013816.1 hypothetical protein SDJN02_23985, partial [Cucurbita argyrosperma subsp. argyrosperma]2.3e-25682.64Show/hide
Query:  MASSTSSPFTKPHFPHSPLPSISTTHHSKSCTQFLCKSLFFCIFLLLLPLFPSEAPEFVNQTLLTKFWELFHLLFVGIAVSYGLFSRRNIQVSVEEPRFS
        MASS SSPFTK HFPHSPLP       + SC QFLCKS+FFC FLLLLPLFPSEAP+FV+QTL TKFWELFHL+FVGIAVSYGLFS RN Q++V+EPR+S
Subjt:  MASSTSSPFTKPHFPHSPLPSISTTHHSKSCTQFLCKSLFFCIFLLLLPLFPSEAPEFVNQTLLTKFWELFHLLFVGIAVSYGLFSRRNIQVSVEEPRFS

Query:  NFENPQSYLSKMFQVASIFEDVDDLSVSDERKLSEVLYIQPNLGSVSDFNAQSHELEKYRYSIPKKRYENSHEFADTDNVGHVCKSRYTRGGSVVVVTET
        +FENPQSYLSKM  VASIF+DVDD  VSDERK+SEVLYIQP LGS SD NAQS   EK RYS+PKKRYENS+EFADTDNV H CKSRYTRGGSVVVV ET
Subjt:  NFENPQSYLSKMFQVASIFEDVDDLSVSDERKLSEVLYIQPNLGSVSDFNAQSHELEKYRYSIPKKRYENSHEFADTDNVGHVCKSRYTRGGSVVVVTET

Query:  NRSSGEWLESGAIVNYKPLGLPVRSLRSSLTEPDDVEFDCNDESCLSSKSSSKSSENNCE-RSEFGDNCCVNLEEKFDETVIASMSPFQLREKFGKKMMR
        NRSS     SG IVNYKPLGLPVRSLRSSLTE DDVEFDC DESCLSSKSS KSSENNCE  SEFGDNCCVNLEEKFDET IASMS FQLREKFGKK++R
Subjt:  NRSSGEWLESGAIVNYKPLGLPVRSLRSSLTEPDDVEFDCNDESCLSSKSSSKSSENNCE-RSEFGDNCCVNLEEKFDETVIASMSPFQLREKFGKKMMR

Query:  ERGVRNAVLRPSHFRPPSIDETQFESLKKSSSFHSNLSQSSQTSSLSSSLSSTTRKHRKMSSLGNISYKSLHSRQYSMSSLSENSRGSSEDPLIEPENSS
        ERG  NAVLRPSHFRPPSIDETQFESL+KS S HS+LSQSSQTSSLSS LSSTTRKH KMSSL NISYKSLHSRQYSMSSLSENSRGSSEDPLIE ENSS
Subjt:  ERGVRNAVLRPSHFRPPSIDETQFESLKKSSSFHSNLSQSSQTSSLSSSLSSTTRKHRKMSSLGNISYKSLHSRQYSMSSLSENSRGSSEDPLIEPENSS

Query:  ECNESVVSSPRLDRNFASIPKALSRGKSVRTIRANAIAIEEMKAQEEMYRNQVEHDDNIGNKF-EGGMS-PYMREDGIGHGWSGVVNPNAGNSNRFSKMA
        ECNESVVSSPR DRNFASIPKALS+GKSVR IRANA AIE+MKAQ EM+R QV+HDD IGNKF EGGMS PYMREDG GHGW  VVNPNAGN NRF K  
Subjt:  ECNESVVSSPRLDRNFASIPKALSRGKSVRTIRANAIAIEEMKAQEEMYRNQVEHDDNIGNKF-EGGMS-PYMREDGIGHGWSGVVNPNAGNSNRFSKMA

Query:  FSGIEEQKEDTESQLTDDGKDNSEREDESLFASSDEEAALSMAGDSESGAHEVDKKAGEFIAKFREQIQLQRMASVDKRLR---GGWGSFSSTSSSYFS
        F GI+EQKE+TES + DD KD SE EDES FASSDEEA  SMAGDSESGA EVDKKAGEFIAKFREQIQLQRMASV+KRLR   GGWGSFSSTSSSYFS
Subjt:  FSGIEEQKEDTESQLTDDGKDNSEREDESLFASSDEEAALSMAGDSESGAHEVDKKAGEFIAKFREQIQLQRMASVDKRLR---GGWGSFSSTSSSYFS

XP_004140631.1 uncharacterized protein LOC101220435 [Cucumis sativus]4.0e-28087.02Show/hide
Query:  MASSTSSPFTKPHFPHSPLPSISTTHHSKSCTQFLCKSLFFCIFLLLLPLFPSEAPEFVNQTLLTKFWELFHLLFVGIAVSYGLFSRRNIQVSV--EEPR
        MA S S+PFTKPHFPHSPLP  STT HS SCTQF+CKSLFFCIFLLLLPLFPSEAPEFVNQT LTKFWELFHL+F+GIAVSYGLFSRRN+QVSV  +EPR
Subjt:  MASSTSSPFTKPHFPHSPLPSISTTHHSKSCTQFLCKSLFFCIFLLLLPLFPSEAPEFVNQTLLTKFWELFHLLFVGIAVSYGLFSRRNIQVSV--EEPR

Query:  FSNFENPQSYLSKMFQVASIFEDVDDLSVSDERKLSEVLYIQPNLGSVSDFNAQSHELEKYRYSIPKKRYENSHEFADTDNVGHVCKSRYTRGGSVVVVT
        FSNFENPQSYLSKMF VASIFEDVDD SVSDERKLSEVLYIQPNLGSVS  NA S + E + YSIPKKRYENS EFA+TDNVGH CKSRYTRGGSVVVV 
Subjt:  FSNFENPQSYLSKMFQVASIFEDVDDLSVSDERKLSEVLYIQPNLGSVSDFNAQSHELEKYRYSIPKKRYENSHEFADTDNVGHVCKSRYTRGGSVVVVT

Query:  ETNRS-SGEWLESGAIVNYKPLGLPVRSLRSSLTEPDDVEFDCNDESCLSSKSSSKSSENNCER-SEFGDNCCVNLEEKFDETVIASMSPFQLREKFGKK
        ETNRS SGEWLESGAIVNYKPLGLPVRSL+SSLTEPDDVEFDC DESCLSSKSSSK+SE+NCER SEFGDNCCVNLEEKFDETVIASMSPFQLREKF K 
Subjt:  ETNRS-SGEWLESGAIVNYKPLGLPVRSLRSSLTEPDDVEFDCNDESCLSSKSSSKSSENNCER-SEFGDNCCVNLEEKFDETVIASMSPFQLREKFGKK

Query:  MMRERGVRNAVLRPSHFRPPSIDETQFESLKKSSSFHSNLSQSSQTSSLSSSLSSTTRKHRKMSSLGNISYKSLHSRQYSMSSLSENSRGSSEDPLIEPE
        MMRER V+NAVLRPSHFRP SIDETQFESLKKS+S HSNLSQSSQTSSLSS LSS TRKHRKMSSLGNISYKS HSRQYS+SSLSENSRGSSEDPLI+PE
Subjt:  MMRERGVRNAVLRPSHFRPPSIDETQFESLKKSSSFHSNLSQSSQTSSLSSSLSSTTRKHRKMSSLGNISYKSLHSRQYSMSSLSENSRGSSEDPLIEPE

Query:  NSSECNESVVSSPRLDRNFASIPKALSRGKSVRTIRANAIAIEEMKAQEEMYRNQVEHDDNIGNKFEGGMSPYMREDGIGHGWSGVVNPNAGNSNRFSK-
        NSSECNESVVSSPRLDRNFA+ PKALSRGKSVRT+RA+  AIEEMKAQ EMYRNQVEHDDN+ NKFEGGMSPYMRED  GHGW G+ N NA  SNR+SK 
Subjt:  NSSECNESVVSSPRLDRNFASIPKALSRGKSVRTIRANAIAIEEMKAQEEMYRNQVEHDDNIGNKFEGGMSPYMREDGIGHGWSGVVNPNAGNSNRFSK-

Query:  ---MAFSGIEEQKEDTESQLTDDGKDNSEREDESLFASSDEEAALSMAGDSESGAHEVDKKAGEFIAKFREQIQLQRMASVDKRLRGGWGSFSSTSSSYF
             FSGIEEQKEDTESQ+TDDGKDNSERED+S F SSDEEAALSM GDSESGAHEVDKKAGEFIAKFREQIQLQRMASVDKRLRGGWGSFSST+SSYF
Subjt:  ---MAFSGIEEQKEDTESQLTDDGKDNSEREDESLFASSDEEAALSMAGDSESGAHEVDKKAGEFIAKFREQIQLQRMASVDKRLRGGWGSFSSTSSSYF

Query:  S
        S
Subjt:  S

XP_023006022.1 uncharacterized protein LOC111498900 [Cucurbita maxima]4.6e-26082.91Show/hide
Query:  MASSTSSPFTKPHFPHSPLPSISTTHHSKSCTQFLCKSLFFCIFLLLLPLFPSEAPEFVNQTLLTKFWELFHLLFVGIAVSYGLFSRRNIQVSVEEPRFS
        MASS SSPFTK HFPHSPLP    THHS SC QFLCKSLFFC FLLLLPLFPSEAP+FV+QTL TKFWELFHL+ VGIAVSYGLFS RN Q++V+EPR+S
Subjt:  MASSTSSPFTKPHFPHSPLPSISTTHHSKSCTQFLCKSLFFCIFLLLLPLFPSEAPEFVNQTLLTKFWELFHLLFVGIAVSYGLFSRRNIQVSVEEPRFS

Query:  NFENPQSYLSKMFQVASIFEDVDDLSVSDERKLSEVLYIQPNLGSVSDFNAQSHELEKYRYSIPKKRYENSHEFADTDNVGHVCKSRYTRGGSVVVVTET
        +FENPQSYLSKM  VASIF+DVDD SVSDERKLSEVLYIQPNLGS SD NAQS + EK RYSIPKKRYENS+EFADTDNV H CKSRYTRGGSVVVV ET
Subjt:  NFENPQSYLSKMFQVASIFEDVDDLSVSDERKLSEVLYIQPNLGSVSDFNAQSHELEKYRYSIPKKRYENSHEFADTDNVGHVCKSRYTRGGSVVVVTET

Query:  NRSSGEWLESGAIVNYKPLGLPVRSLRSSLTEPDDVEFDCNDESCLSSKSSSKSSENNCE-RSEFGDNCCVNLEEKFDETVIASMSPFQLREKFGKKMMR
        NRSS     SG IVNYKPLGLPVRSL+SSLTE DDVEFDC DESCLSSKSS KSSENNCE  SEFGDNCCVNLEEKFDET IASMS FQLREKFGKK++R
Subjt:  NRSSGEWLESGAIVNYKPLGLPVRSLRSSLTEPDDVEFDCNDESCLSSKSSSKSSENNCE-RSEFGDNCCVNLEEKFDETVIASMSPFQLREKFGKKMMR

Query:  ERGVRNAVLRPSHFRPPSIDETQFESLKKSSSFHSNLSQSSQTSSLSSSLSSTTRKHRKMSSLGNISYKSLHSRQYSMSSLSENSRGSSEDPLIEPENSS
        ERG  NAVLRPSHFRPPSIDETQFESLKKS S HSNLSQSSQTSSLSSSLSSTTRKH KMSSL NISYKSLHSRQYSMSSLSENSRGSSEDPLIE ENSS
Subjt:  ERGVRNAVLRPSHFRPPSIDETQFESLKKSSSFHSNLSQSSQTSSLSSSLSSTTRKHRKMSSLGNISYKSLHSRQYSMSSLSENSRGSSEDPLIEPENSS

Query:  ECNESVVSSPRLDRNFASIPKALSRGKSVRTIRANAIAIEEMKAQEEMYRNQVEHDDNIGNKF-EGGMSPYMREDGIGHGWSGVVNPNAGNSNRFSKMAF
        ECNESVVSSPR D NF SIPKALS+GKS+R I+ANA AIE++KAQ EM+R QV+HDD IGNKF EGG SPY+REDG GHGW  V NPNA N +RF    F
Subjt:  ECNESVVSSPRLDRNFASIPKALSRGKSVRTIRANAIAIEEMKAQEEMYRNQVEHDDNIGNKF-EGGMSPYMREDGIGHGWSGVVNPNAGNSNRFSKMAF

Query:  SGIEEQKEDTESQLTDDGKDNSEREDESLFASSDEEAALSMAGDSESGAHEVDKKAGEFIAKFREQIQLQRMASVDKRLR--GGWGSFSSTSSSYFS
         GI+EQKE+TES + DD KD+SE EDES FASSDEEAA SMAGDSESGA EVDKKAGEFIAKFREQIQLQRMASV+KRLR  GGWGSFSSTSSSYFS
Subjt:  SGIEEQKEDTESQLTDDGKDNSEREDESLFASSDEEAALSMAGDSESGAHEVDKKAGEFIAKFREQIQLQRMASVDKRLR--GGWGSFSSTSSSYFS

TrEMBL top hitse value%identityAlignment
A0A0A0K9X1 Uncharacterized protein1.9e-28087.02Show/hide
Query:  MASSTSSPFTKPHFPHSPLPSISTTHHSKSCTQFLCKSLFFCIFLLLLPLFPSEAPEFVNQTLLTKFWELFHLLFVGIAVSYGLFSRRNIQVSV--EEPR
        MA S S+PFTKPHFPHSPLP  STT HS SCTQF+CKSLFFCIFLLLLPLFPSEAPEFVNQT LTKFWELFHL+F+GIAVSYGLFSRRN+QVSV  +EPR
Subjt:  MASSTSSPFTKPHFPHSPLPSISTTHHSKSCTQFLCKSLFFCIFLLLLPLFPSEAPEFVNQTLLTKFWELFHLLFVGIAVSYGLFSRRNIQVSV--EEPR

Query:  FSNFENPQSYLSKMFQVASIFEDVDDLSVSDERKLSEVLYIQPNLGSVSDFNAQSHELEKYRYSIPKKRYENSHEFADTDNVGHVCKSRYTRGGSVVVVT
        FSNFENPQSYLSKMF VASIFEDVDD SVSDERKLSEVLYIQPNLGSVS  NA S + E + YSIPKKRYENS EFA+TDNVGH CKSRYTRGGSVVVV 
Subjt:  FSNFENPQSYLSKMFQVASIFEDVDDLSVSDERKLSEVLYIQPNLGSVSDFNAQSHELEKYRYSIPKKRYENSHEFADTDNVGHVCKSRYTRGGSVVVVT

Query:  ETNRS-SGEWLESGAIVNYKPLGLPVRSLRSSLTEPDDVEFDCNDESCLSSKSSSKSSENNCER-SEFGDNCCVNLEEKFDETVIASMSPFQLREKFGKK
        ETNRS SGEWLESGAIVNYKPLGLPVRSL+SSLTEPDDVEFDC DESCLSSKSSSK+SE+NCER SEFGDNCCVNLEEKFDETVIASMSPFQLREKF K 
Subjt:  ETNRS-SGEWLESGAIVNYKPLGLPVRSLRSSLTEPDDVEFDCNDESCLSSKSSSKSSENNCER-SEFGDNCCVNLEEKFDETVIASMSPFQLREKFGKK

Query:  MMRERGVRNAVLRPSHFRPPSIDETQFESLKKSSSFHSNLSQSSQTSSLSSSLSSTTRKHRKMSSLGNISYKSLHSRQYSMSSLSENSRGSSEDPLIEPE
        MMRER V+NAVLRPSHFRP SIDETQFESLKKS+S HSNLSQSSQTSSLSS LSS TRKHRKMSSLGNISYKS HSRQYS+SSLSENSRGSSEDPLI+PE
Subjt:  MMRERGVRNAVLRPSHFRPPSIDETQFESLKKSSSFHSNLSQSSQTSSLSSSLSSTTRKHRKMSSLGNISYKSLHSRQYSMSSLSENSRGSSEDPLIEPE

Query:  NSSECNESVVSSPRLDRNFASIPKALSRGKSVRTIRANAIAIEEMKAQEEMYRNQVEHDDNIGNKFEGGMSPYMREDGIGHGWSGVVNPNAGNSNRFSK-
        NSSECNESVVSSPRLDRNFA+ PKALSRGKSVRT+RA+  AIEEMKAQ EMYRNQVEHDDN+ NKFEGGMSPYMRED  GHGW G+ N NA  SNR+SK 
Subjt:  NSSECNESVVSSPRLDRNFASIPKALSRGKSVRTIRANAIAIEEMKAQEEMYRNQVEHDDNIGNKFEGGMSPYMREDGIGHGWSGVVNPNAGNSNRFSK-

Query:  ---MAFSGIEEQKEDTESQLTDDGKDNSEREDESLFASSDEEAALSMAGDSESGAHEVDKKAGEFIAKFREQIQLQRMASVDKRLRGGWGSFSSTSSSYF
             FSGIEEQKEDTESQ+TDDGKDNSERED+S F SSDEEAALSM GDSESGAHEVDKKAGEFIAKFREQIQLQRMASVDKRLRGGWGSFSST+SSYF
Subjt:  ---MAFSGIEEQKEDTESQLTDDGKDNSEREDESLFASSDEEAALSMAGDSESGAHEVDKKAGEFIAKFREQIQLQRMASVDKRLRGGWGSFSSTSSSYF

Query:  S
        S
Subjt:  S

A0A5D3DMA5 DUF761 domain-containing protein1.6e-27987.5Show/hide
Query:  MASSTSSPFTKPHFPHSPLPSISTTHHSKSCTQFLCKSLFFCIFLLLLPLFPSEAPEFVNQTLLTKFWELFHLLFVGIAVSYGLFSRRNIQVSV--EEPR
        MASS S+PFTKPHFPHSPLP  STT HS SCT FLCKSLFFCIFLLLLPLFPSEAPEFVNQTLLTKFWELFHL+FVGIAVSYGLFSRRN+QVSV  +EPR
Subjt:  MASSTSSPFTKPHFPHSPLPSISTTHHSKSCTQFLCKSLFFCIFLLLLPLFPSEAPEFVNQTLLTKFWELFHLLFVGIAVSYGLFSRRNIQVSV--EEPR

Query:  FSNFENPQSYLSKMFQVASIFEDVDDLSVSDERKLSEVLYIQPNLGSVSDFNAQSHELEKYRYSIPKKRYENSHEFADTDNVGHVCKSRYTRGGSVVVVT
        FSNFENPQSYLSKM  VASIFEDVDD SVSDERKLSEVLYIQPNLGSV  FNA S + E + YSIPKKRYENS EF DT++VGH CKSRYTRGGSVVVV 
Subjt:  FSNFENPQSYLSKMFQVASIFEDVDDLSVSDERKLSEVLYIQPNLGSVSDFNAQSHELEKYRYSIPKKRYENSHEFADTDNVGHVCKSRYTRGGSVVVVT

Query:  ETNRS-SGEWLESGAIVNYKPLGLPVRSLRSSLTEPDDVEFDCNDESCLSSKSSSKSSENNCER-SEFGDNCCVNLEEKFDETVIASMSPFQLREKFGKK
        ETNRS SGEWLESGAIVNYKPLGLPVRSLRS+LTEPDDVEFDC DESCLSSKSSSK+SE+NCER SEFGDNCCVNLEEKFDETVIA MSPFQLRE FGK 
Subjt:  ETNRS-SGEWLESGAIVNYKPLGLPVRSLRSSLTEPDDVEFDCNDESCLSSKSSSKSSENNCER-SEFGDNCCVNLEEKFDETVIASMSPFQLREKFGKK

Query:  MMRERGVRNAVLRPSHFRPPSIDETQFESLKKSSSFHSNLSQSSQTSSLSSSLSSTTRKHRKMSSLGNISYKSLHSRQYSMSSLSENSRGSSEDPLIEPE
        MMRERGV+NAVLRPSHFRP SIDETQFESLKKS S HSNLSQSSQTSSLS SLSSTTRKHRKMSSLGNISYKS HSRQYS+SSLSENSRGSSEDPLIEPE
Subjt:  MMRERGVRNAVLRPSHFRPPSIDETQFESLKKSSSFHSNLSQSSQTSSLSSSLSSTTRKHRKMSSLGNISYKSLHSRQYSMSSLSENSRGSSEDPLIEPE

Query:  NSSECNESVVSSPRLDRNFASIPKALSRGKSVRTIRANAIAIEEMKAQEEMYRNQVEHDDNIGNKFEGGMSPYMREDGIGHGWSGVVNPNAGNSNRFSK-
        NSSECNES++SSPRLDRNFA IPKALSRGKSVRTIRAN  AIEEMKAQ EMYRNQVEHDDN+GNKFEGGMSPYMREDG GHGW G+ +PNAG SNR  K 
Subjt:  NSSECNESVVSSPRLDRNFASIPKALSRGKSVRTIRANAIAIEEMKAQEEMYRNQVEHDDNIGNKFEGGMSPYMREDGIGHGWSGVVNPNAGNSNRFSK-

Query:  MAFSGIEEQKEDTESQLTDDG--KDNSEREDESLFASSDEEAALSMAGDSESGAHEVDKKAGEFIAKFREQIQLQRMASVDKRLRGGWGSFSSTSSSYFS
          FSGIEEQKED ESQLTDD   +DNSERED S F SSDEEAA SMAG+SESGA+EVDKKAGEFIAKFREQIQLQRMASVDKRLRGGWGSFSSTSSSYFS
Subjt:  MAFSGIEEQKEDTESQLTDDG--KDNSEREDESLFASSDEEAALSMAGDSESGAHEVDKKAGEFIAKFREQIQLQRMASVDKRLRGGWGSFSSTSSSYFS

A0A6J1H4M0 uncharacterized protein LOC1114599984.8e-25582.23Show/hide
Query:  MASSTSSPFTKPHFPHSPLPSISTTHHSKSCTQFLCKSLFFCIFLLLLPLFPSEAPEFVNQTLLTKFWELFHLLFVGIAVSYGLFSRRNIQVSVEEPRFS
        MASS SSPFTK HFPHSPLP       + SC QFLCKS+FFC FLLLLPLFPSEAP+FV+QTL TKFWELFHL+FVGIAVSYGLFS RN Q++V+EPR+S
Subjt:  MASSTSSPFTKPHFPHSPLPSISTTHHSKSCTQFLCKSLFFCIFLLLLPLFPSEAPEFVNQTLLTKFWELFHLLFVGIAVSYGLFSRRNIQVSVEEPRFS

Query:  NFENPQSYLSKMFQVASIFEDVDDLSVSDERKLSEVLYIQPNLGSVSDFNAQSHELEKYRYSIPKKRYENSHEFADTDNVGHVCKSRYTRGGSVVVVTET
        +FENPQSYLSKM  VASIF+DVDD  VSDERK+SEVLYIQP LGS SD NAQS   EK RYS+PKKRYENS+EFADTDNV H CKSRYTRGGSVVVV ET
Subjt:  NFENPQSYLSKMFQVASIFEDVDDLSVSDERKLSEVLYIQPNLGSVSDFNAQSHELEKYRYSIPKKRYENSHEFADTDNVGHVCKSRYTRGGSVVVVTET

Query:  NRSSGEWLESGAIVNYKPLGLPVRSLRSSLTEPDDVEFDCNDESCLSSKSSSKSSENNCE-RSEFGDNCCVNLEEKFDETVIASMSPFQLREKFGKKMMR
        NRSS     SG IVNYKPLGLPVRSLRSSLTE DDVEFDC DESCLSSKSS KSSENNCE  SEFGDNCCVNLEEKFDET IASMS FQLREKFGKK++R
Subjt:  NRSSGEWLESGAIVNYKPLGLPVRSLRSSLTEPDDVEFDCNDESCLSSKSSSKSSENNCE-RSEFGDNCCVNLEEKFDETVIASMSPFQLREKFGKKMMR

Query:  ERGVRNAVLRPSHFRPPSIDETQFESLKKSSSFHSNLSQSSQTSSLSSSLSSTTRKHRKMSSLGNISYKSLHSRQYSMSSLSENSRGSSEDPLIEPENSS
        ERG  NAVLRPSHFRPPSIDETQFESLKKS S HS LSQSSQTSSLSS LSSTTRK RKMSSL NISYKSLHSRQYS SSLSENSRGSSEDPLIE ENSS
Subjt:  ERGVRNAVLRPSHFRPPSIDETQFESLKKSSSFHSNLSQSSQTSSLSSSLSSTTRKHRKMSSLGNISYKSLHSRQYSMSSLSENSRGSSEDPLIEPENSS

Query:  ECNESVVSSPRLDRNFASIPKALSRGKSVRTIRANAIAIEEMKAQEEMYRNQVEHDDNIGNKF-EGGMS-PYMREDGIGHGWSGVVNPNAGNSNRFSKMA
        ECNESVVSSPR DRNFASIPKALS+GKSVR IRANA AIE+MKAQ EM+R QV+HDD IGNKF EGGMS PYMREDG G GW  VVNPNAGN NRF K  
Subjt:  ECNESVVSSPRLDRNFASIPKALSRGKSVRTIRANAIAIEEMKAQEEMYRNQVEHDDNIGNKF-EGGMS-PYMREDGIGHGWSGVVNPNAGNSNRFSKMA

Query:  FSGIEEQKEDTESQLTDDGKDNSEREDESLFASSDEEAALSMAGDSESGAHEVDKKAGEFIAKFREQIQLQRMASVDKRLR------GGWGSFSSTSSSY
        F GI+EQKE+TES + DD KD+SE EDESLFASSDEEA  SMAGDSESGA EVDKKAGEFIAKFREQIQLQRMASV+KRLR      GGWGSFSSTSSSY
Subjt:  FSGIEEQKEDTESQLTDDGKDNSEREDESLFASSDEEAALSMAGDSESGAHEVDKKAGEFIAKFREQIQLQRMASVDKRLR------GGWGSFSSTSSSY

Query:  FS
        FS
Subjt:  FS

A0A6J1KUS4 uncharacterized protein LOC1114989002.2e-26082.91Show/hide
Query:  MASSTSSPFTKPHFPHSPLPSISTTHHSKSCTQFLCKSLFFCIFLLLLPLFPSEAPEFVNQTLLTKFWELFHLLFVGIAVSYGLFSRRNIQVSVEEPRFS
        MASS SSPFTK HFPHSPLP    THHS SC QFLCKSLFFC FLLLLPLFPSEAP+FV+QTL TKFWELFHL+ VGIAVSYGLFS RN Q++V+EPR+S
Subjt:  MASSTSSPFTKPHFPHSPLPSISTTHHSKSCTQFLCKSLFFCIFLLLLPLFPSEAPEFVNQTLLTKFWELFHLLFVGIAVSYGLFSRRNIQVSVEEPRFS

Query:  NFENPQSYLSKMFQVASIFEDVDDLSVSDERKLSEVLYIQPNLGSVSDFNAQSHELEKYRYSIPKKRYENSHEFADTDNVGHVCKSRYTRGGSVVVVTET
        +FENPQSYLSKM  VASIF+DVDD SVSDERKLSEVLYIQPNLGS SD NAQS + EK RYSIPKKRYENS+EFADTDNV H CKSRYTRGGSVVVV ET
Subjt:  NFENPQSYLSKMFQVASIFEDVDDLSVSDERKLSEVLYIQPNLGSVSDFNAQSHELEKYRYSIPKKRYENSHEFADTDNVGHVCKSRYTRGGSVVVVTET

Query:  NRSSGEWLESGAIVNYKPLGLPVRSLRSSLTEPDDVEFDCNDESCLSSKSSSKSSENNCE-RSEFGDNCCVNLEEKFDETVIASMSPFQLREKFGKKMMR
        NRSS     SG IVNYKPLGLPVRSL+SSLTE DDVEFDC DESCLSSKSS KSSENNCE  SEFGDNCCVNLEEKFDET IASMS FQLREKFGKK++R
Subjt:  NRSSGEWLESGAIVNYKPLGLPVRSLRSSLTEPDDVEFDCNDESCLSSKSSSKSSENNCE-RSEFGDNCCVNLEEKFDETVIASMSPFQLREKFGKKMMR

Query:  ERGVRNAVLRPSHFRPPSIDETQFESLKKSSSFHSNLSQSSQTSSLSSSLSSTTRKHRKMSSLGNISYKSLHSRQYSMSSLSENSRGSSEDPLIEPENSS
        ERG  NAVLRPSHFRPPSIDETQFESLKKS S HSNLSQSSQTSSLSSSLSSTTRKH KMSSL NISYKSLHSRQYSMSSLSENSRGSSEDPLIE ENSS
Subjt:  ERGVRNAVLRPSHFRPPSIDETQFESLKKSSSFHSNLSQSSQTSSLSSSLSSTTRKHRKMSSLGNISYKSLHSRQYSMSSLSENSRGSSEDPLIEPENSS

Query:  ECNESVVSSPRLDRNFASIPKALSRGKSVRTIRANAIAIEEMKAQEEMYRNQVEHDDNIGNKF-EGGMSPYMREDGIGHGWSGVVNPNAGNSNRFSKMAF
        ECNESVVSSPR D NF SIPKALS+GKS+R I+ANA AIE++KAQ EM+R QV+HDD IGNKF EGG SPY+REDG GHGW  V NPNA N +RF    F
Subjt:  ECNESVVSSPRLDRNFASIPKALSRGKSVRTIRANAIAIEEMKAQEEMYRNQVEHDDNIGNKF-EGGMSPYMREDGIGHGWSGVVNPNAGNSNRFSKMAF

Query:  SGIEEQKEDTESQLTDDGKDNSEREDESLFASSDEEAALSMAGDSESGAHEVDKKAGEFIAKFREQIQLQRMASVDKRLR--GGWGSFSSTSSSYFS
         GI+EQKE+TES + DD KD+SE EDES FASSDEEAA SMAGDSESGA EVDKKAGEFIAKFREQIQLQRMASV+KRLR  GGWGSFSSTSSSYFS
Subjt:  SGIEEQKEDTESQLTDDGKDNSEREDESLFASSDEEAALSMAGDSESGAHEVDKKAGEFIAKFREQIQLQRMASVDKRLR--GGWGSFSSTSSSYFS

E5GCN2 Uncharacterized protein1.6e-27987.5Show/hide
Query:  MASSTSSPFTKPHFPHSPLPSISTTHHSKSCTQFLCKSLFFCIFLLLLPLFPSEAPEFVNQTLLTKFWELFHLLFVGIAVSYGLFSRRNIQVSV--EEPR
        MASS S+PFTKPHFPHSPLP  STT HS SCT FLCKSLFFCIFLLLLPLFPSEAPEFVNQTLLTKFWELFHL+FVGIAVSYGLFSRRN+QVSV  +EPR
Subjt:  MASSTSSPFTKPHFPHSPLPSISTTHHSKSCTQFLCKSLFFCIFLLLLPLFPSEAPEFVNQTLLTKFWELFHLLFVGIAVSYGLFSRRNIQVSV--EEPR

Query:  FSNFENPQSYLSKMFQVASIFEDVDDLSVSDERKLSEVLYIQPNLGSVSDFNAQSHELEKYRYSIPKKRYENSHEFADTDNVGHVCKSRYTRGGSVVVVT
        FSNFENPQSYLSKM  VASIFEDVDD SVSDERKLSEVLYIQPNLGSV  FNA S + E + YSIPKKRYENS EF DT++VGH CKSRYTRGGSVVVV 
Subjt:  FSNFENPQSYLSKMFQVASIFEDVDDLSVSDERKLSEVLYIQPNLGSVSDFNAQSHELEKYRYSIPKKRYENSHEFADTDNVGHVCKSRYTRGGSVVVVT

Query:  ETNRS-SGEWLESGAIVNYKPLGLPVRSLRSSLTEPDDVEFDCNDESCLSSKSSSKSSENNCER-SEFGDNCCVNLEEKFDETVIASMSPFQLREKFGKK
        ETNRS SGEWLESGAIVNYKPLGLPVRSLRS+LTEPDDVEFDC DESCLSSKSSSK+SE+NCER SEFGDNCCVNLEEKFDETVIA MSPFQLRE FGK 
Subjt:  ETNRS-SGEWLESGAIVNYKPLGLPVRSLRSSLTEPDDVEFDCNDESCLSSKSSSKSSENNCER-SEFGDNCCVNLEEKFDETVIASMSPFQLREKFGKK

Query:  MMRERGVRNAVLRPSHFRPPSIDETQFESLKKSSSFHSNLSQSSQTSSLSSSLSSTTRKHRKMSSLGNISYKSLHSRQYSMSSLSENSRGSSEDPLIEPE
        MMRERGV+NAVLRPSHFRP SIDETQFESLKKS S HSNLSQSSQTSSLS SLSSTTRKHRKMSSLGNISYKS HSRQYS+SSLSENSRGSSEDPLIEPE
Subjt:  MMRERGVRNAVLRPSHFRPPSIDETQFESLKKSSSFHSNLSQSSQTSSLSSSLSSTTRKHRKMSSLGNISYKSLHSRQYSMSSLSENSRGSSEDPLIEPE

Query:  NSSECNESVVSSPRLDRNFASIPKALSRGKSVRTIRANAIAIEEMKAQEEMYRNQVEHDDNIGNKFEGGMSPYMREDGIGHGWSGVVNPNAGNSNRFSK-
        NSSECNES++SSPRLDRNFA IPKALSRGKSVRTIRAN  AIEEMKAQ EMYRNQVEHDDN+GNKFEGGMSPYMREDG GHGW G+ +PNAG SNR  K 
Subjt:  NSSECNESVVSSPRLDRNFASIPKALSRGKSVRTIRANAIAIEEMKAQEEMYRNQVEHDDNIGNKFEGGMSPYMREDGIGHGWSGVVNPNAGNSNRFSK-

Query:  MAFSGIEEQKEDTESQLTDDG--KDNSEREDESLFASSDEEAALSMAGDSESGAHEVDKKAGEFIAKFREQIQLQRMASVDKRLRGGWGSFSSTSSSYFS
          FSGIEEQKED ESQLTDD   +DNSERED S F SSDEEAA SMAG+SESGA+EVDKKAGEFIAKFREQIQLQRMASVDKRLRGGWGSFSSTSSSYFS
Subjt:  MAFSGIEEQKEDTESQLTDDG--KDNSEREDESLFASSDEEAALSMAGDSESGAHEVDKKAGEFIAKFREQIQLQRMASVDKRLRGGWGSFSSTSSSYFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G60380.1 FUNCTIONS IN: molecular_function unknown2.8e-3435.34Show/hide
Query:  STSSPFTKPHFPHSPLPSISTTHHSKSCTQFLCKSLFFCIFLLLLPLFPSEAPEFVNQTLLTKFWELFHLLFVGIAVSYGLFSRRNIQVSVEEPRFSNFE
        ++ +P+TK   P + +      + S     F CKS+ F +FLL LPLFPS+AP+FV +T+LTKFWEL HLLFVGIAV+YGLFSRRN++ +V+       E
Subjt:  STSSPFTKPHFPHSPLPSISTTHHSKSCTQFLCKSLFFCIFLLLLPLFPSEAPEFVNQTLLTKFWELFHLLFVGIAVSYGLFSRRNIQVSVEEPRFSNFE

Query:  NPQSYLSKMFQVASIF-EDVDDLSVS--DERKLSEVLYIQPNLGSVSDFNAQSHELEKYRYSIPKKRYENSHEFADTDNVGHVCKSRYTRGGSVVVVTET
        +  SY+S++FQV+S+F E+ DD S    D R    V      +G    F  +S ELE+            S EF +T+ V     S+Y +G S VVV   
Subjt:  NPQSYLSKMFQVASIF-EDVDDLSVS--DERKLSEVLYIQPNLGSVSDFNAQSHELEKYRYSIPKKRYENSHEFADTDNVGHVCKSRYTRGGSVVVVTET

Query:  NRSSGEWLESGAIVNYKPLGLPVRSLRSSLTEPDDVEFDCNDESCLSSKSSSKSSENNCERSEFGDNCCVNLEEKFDETVIASMSPFQLREKFGKKMMRE
              +   G +V ++PLGLP+R LRSSL           D + L  KS + S    C+ +   +   +  +  FDE + A  SP   + +   +MM  
Subjt:  NRSSGEWLESGAIVNYKPLGLPVRSLRSSLTEPDDVEFDCNDESCLSSKSSSKSSENNCERSEFGDNCCVNLEEKFDETVIASMSPFQLREKFGKKMMRE

Query:  RGVRNAVLRPSHFRPPSIDETQFESLKKSSSFHSNLSQSSQTSSLSSSLSSTTRKHRKMSSLGNISYKSLHSR-QYSMSSLSENSRGSSEDPLIEPENS
         G+ +    PS+F+P S+DET      KS S  S  S SSQTS  S       +   + S   ++S +SL+S  +  +   S  S   S  P + P  S
Subjt:  RGVRNAVLRPSHFRPPSIDETQFESLKKSSSFHSNLSQSSQTSSLSSSLSSTTRKHRKMSSLGNISYKSLHSR-QYSMSSLSENSRGSSEDPLIEPENS

AT3G60380.1 FUNCTIONS IN: molecular_function unknown5.2e-0437.65Show/hide
Query:  EQKEDTESQLTDDGKDNSEREDESLFASSDEEAALSMAGDSESGAHEVDKKAGEFIAKFREQIQLQRMASVDKRLRGGWGSFSST
        E +  +E +     +  +E++ E  F   +EEAA     ++    +EVD+KAGEFIAKFREQI+LQ++ S ++   GG G F ++
Subjt:  EQKEDTESQLTDDGKDNSEREDESLFASSDEEAALSMAGDSESGAHEVDKKAGEFIAKFREQIQLQRMASVDKRLRGGWGSFSST

AT4G16790.1 hydroxyproline-rich glycoprotein family protein1.3e-1037.82Show/hide
Query:  KSCTQFLCKSLFFCIFLLLLPLFPSEAPEFVNQTLLTKFWELFHLLFVGIAVSYGLFSRRN--------IQVSVEEPRFSNFENPQSYLSKMFQVASIF-
        K  ++F+ K+L   +   ++P+F S+ PE  NQT L    EL HL+FVGIAVSYGLFSRRN           S       +  N  SY+ K+ +V+S+F 
Subjt:  KSCTQFLCKSLFFCIFLLLLPLFPSEAPEFVNQTLLTKFWELFHLLFVGIAVSYGLFSRRN--------IQVSVEEPRFSNFENPQSYLSKMFQVASIF-

Query:  ------EDVDDLSVSDERK
               +  D S  D+RK
Subjt:  ------EDVDDLSVSDERK

AT4G16790.1 hydroxyproline-rich glycoprotein family protein5.8e-0335.29Show/hide
Query:  IEEQKEDTE---------SQLTDDGKDNSEREDESLFASSDEEAALSMAGDSE-SGAHEVDKKAGEFIAKFREQIQLQRMASVDK
        +E +++DTE         S+  ++ ++  +R  E+      E+  +   G SE +   +VDKKA EFIAKFREQI+LQR+ S+ +
Subjt:  IEEQKEDTE---------SQLTDDGKDNSEREDESLFASSDEEAALSMAGDSE-SGAHEVDKKAGEFIAKFREQIQLQRMASVDK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCTTCAACTTCCAGCCCTTTCACCAAACCCCATTTTCCCCATTCTCCACTTCCATCAATATCTACCACTCACCATAGCAAGTCCTGCACACAGTTCCTCTGTAA
ATCCCTCTTCTTCTGCATTTTTCTCCTCCTCCTTCCTCTCTTCCCTTCCGAAGCTCCAGAATTCGTCAATCAGACTTTGCTCACCAAATTCTGGGAGCTCTTTCATCTCT
TGTTCGTCGGCATTGCTGTTTCCTATGGTCTCTTCAGCAGAAGGAACATCCAGGTGAGTGTAGAAGAACCTCGCTTCTCCAATTTTGAAAATCCGCAGTCGTATTTGTCT
AAGATGTTTCAAGTCGCTTCGATTTTTGAAGATGTTGACGATTTGAGTGTTTCTGATGAGAGGAAATTGAGTGAAGTTTTGTACATTCAGCCGAATCTTGGATCTGTGAG
TGATTTTAATGCCCAATCTCACGAACTGGAAAAATACCGTTATTCAATACCCAAAAAAAGGTATGAAAATTCTCATGAATTTGCTGATACTGATAATGTCGGTCATGTTT
GTAAATCGAGATATACTCGGGGTGGATCTGTGGTGGTTGTTACTGAAACAAATCGTAGTTCTGGTGAATGGTTGGAATCAGGAGCCATTGTAAATTATAAACCTCTAGGT
TTGCCTGTTAGGAGTCTGAGGTCGAGTCTTACTGAGCCCGACGATGTTGAATTTGATTGTAATGATGAATCTTGTTTGAGTTCTAAAAGTTCATCCAAGAGCTCTGAGAA
TAATTGTGAAAGAAGTGAATTTGGTGATAATTGTTGTGTGAATTTGGAGGAGAAGTTTGATGAGACTGTAATTGCATCAATGTCCCCATTTCAATTGCGTGAGAAATTTG
GAAAGAAGATGATGAGAGAGAGAGGAGTTAGGAATGCTGTTCTTCGCCCTTCCCATTTTAGACCTCCTTCCATTGATGAAACTCAATTTGAATCACTAAAAAAATCAAGT
TCTTTTCATTCTAATCTTTCTCAGTCATCACAAACTAGTTCCCTCTCTTCTTCGTTGTCATCGACAACGAGAAAGCACCGTAAAATGTCGTCGCTCGGTAACATTTCCTA
TAAATCGTTGCATTCTCGACAATATAGTATGAGTTCTCTGTCTGAAAACAGTAGAGGGAGCTCTGAAGACCCTCTGATTGAACCAGAAAATTCATCTGAATGCAACGAAT
CCGTGGTAAGTTCCCCGCGTTTGGACAGGAATTTCGCAAGTATTCCGAAAGCTTTATCTCGGGGAAAATCCGTTAGAACAATTAGAGCAAATGCAATTGCCATAGAGGAA
ATGAAAGCTCAAGAGGAGATGTATAGAAACCAAGTTGAACATGATGACAATATAGGGAATAAGTTTGAAGGTGGAATGTCACCATATATGAGAGAAGATGGAATAGGACA
TGGATGGTCTGGTGTTGTTAACCCGAATGCTGGTAACTCTAATCGTTTTTCGAAGATGGCATTCTCGGGGATTGAGGAGCAAAAGGAAGACACTGAGAGTCAGCTAACAG
ATGATGGTAAAGATAACTCTGAGAGGGAGGATGAAAGTTTGTTTGCGAGTTCAGATGAAGAAGCTGCTTTGAGTATGGCGGGCGATTCGGAATCGGGGGCTCACGAGGTT
GACAAGAAGGCAGGCGAGTTCATAGCCAAGTTCAGGGAGCAAATACAGCTTCAGAGGATGGCTTCAGTAGATAAAAGATTGAGAGGAGGATGGGGCTCATTCAGCAGCAC
AAGCAGCAGCTATTTTAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCTTCAACTTCCAGCCCTTTCACCAAACCCCATTTTCCCCATTCTCCACTTCCATCAATATCTACCACTCACCATAGCAAGTCCTGCACACAGTTCCTCTGTAA
ATCCCTCTTCTTCTGCATTTTTCTCCTCCTCCTTCCTCTCTTCCCTTCCGAAGCTCCAGAATTCGTCAATCAGACTTTGCTCACCAAATTCTGGGAGCTCTTTCATCTCT
TGTTCGTCGGCATTGCTGTTTCCTATGGTCTCTTCAGCAGAAGGAACATCCAGGTGAGTGTAGAAGAACCTCGCTTCTCCAATTTTGAAAATCCGCAGTCGTATTTGTCT
AAGATGTTTCAAGTCGCTTCGATTTTTGAAGATGTTGACGATTTGAGTGTTTCTGATGAGAGGAAATTGAGTGAAGTTTTGTACATTCAGCCGAATCTTGGATCTGTGAG
TGATTTTAATGCCCAATCTCACGAACTGGAAAAATACCGTTATTCAATACCCAAAAAAAGGTATGAAAATTCTCATGAATTTGCTGATACTGATAATGTCGGTCATGTTT
GTAAATCGAGATATACTCGGGGTGGATCTGTGGTGGTTGTTACTGAAACAAATCGTAGTTCTGGTGAATGGTTGGAATCAGGAGCCATTGTAAATTATAAACCTCTAGGT
TTGCCTGTTAGGAGTCTGAGGTCGAGTCTTACTGAGCCCGACGATGTTGAATTTGATTGTAATGATGAATCTTGTTTGAGTTCTAAAAGTTCATCCAAGAGCTCTGAGAA
TAATTGTGAAAGAAGTGAATTTGGTGATAATTGTTGTGTGAATTTGGAGGAGAAGTTTGATGAGACTGTAATTGCATCAATGTCCCCATTTCAATTGCGTGAGAAATTTG
GAAAGAAGATGATGAGAGAGAGAGGAGTTAGGAATGCTGTTCTTCGCCCTTCCCATTTTAGACCTCCTTCCATTGATGAAACTCAATTTGAATCACTAAAAAAATCAAGT
TCTTTTCATTCTAATCTTTCTCAGTCATCACAAACTAGTTCCCTCTCTTCTTCGTTGTCATCGACAACGAGAAAGCACCGTAAAATGTCGTCGCTCGGTAACATTTCCTA
TAAATCGTTGCATTCTCGACAATATAGTATGAGTTCTCTGTCTGAAAACAGTAGAGGGAGCTCTGAAGACCCTCTGATTGAACCAGAAAATTCATCTGAATGCAACGAAT
CCGTGGTAAGTTCCCCGCGTTTGGACAGGAATTTCGCAAGTATTCCGAAAGCTTTATCTCGGGGAAAATCCGTTAGAACAATTAGAGCAAATGCAATTGCCATAGAGGAA
ATGAAAGCTCAAGAGGAGATGTATAGAAACCAAGTTGAACATGATGACAATATAGGGAATAAGTTTGAAGGTGGAATGTCACCATATATGAGAGAAGATGGAATAGGACA
TGGATGGTCTGGTGTTGTTAACCCGAATGCTGGTAACTCTAATCGTTTTTCGAAGATGGCATTCTCGGGGATTGAGGAGCAAAAGGAAGACACTGAGAGTCAGCTAACAG
ATGATGGTAAAGATAACTCTGAGAGGGAGGATGAAAGTTTGTTTGCGAGTTCAGATGAAGAAGCTGCTTTGAGTATGGCGGGCGATTCGGAATCGGGGGCTCACGAGGTT
GACAAGAAGGCAGGCGAGTTCATAGCCAAGTTCAGGGAGCAAATACAGCTTCAGAGGATGGCTTCAGTAGATAAAAGATTGAGAGGAGGATGGGGCTCATTCAGCAGCAC
AAGCAGCAGCTATTTTAGTTGA
Protein sequenceShow/hide protein sequence
MASSTSSPFTKPHFPHSPLPSISTTHHSKSCTQFLCKSLFFCIFLLLLPLFPSEAPEFVNQTLLTKFWELFHLLFVGIAVSYGLFSRRNIQVSVEEPRFSNFENPQSYLS
KMFQVASIFEDVDDLSVSDERKLSEVLYIQPNLGSVSDFNAQSHELEKYRYSIPKKRYENSHEFADTDNVGHVCKSRYTRGGSVVVVTETNRSSGEWLESGAIVNYKPLG
LPVRSLRSSLTEPDDVEFDCNDESCLSSKSSSKSSENNCERSEFGDNCCVNLEEKFDETVIASMSPFQLREKFGKKMMRERGVRNAVLRPSHFRPPSIDETQFESLKKSS
SFHSNLSQSSQTSSLSSSLSSTTRKHRKMSSLGNISYKSLHSRQYSMSSLSENSRGSSEDPLIEPENSSECNESVVSSPRLDRNFASIPKALSRGKSVRTIRANAIAIEE
MKAQEEMYRNQVEHDDNIGNKFEGGMSPYMREDGIGHGWSGVVNPNAGNSNRFSKMAFSGIEEQKEDTESQLTDDGKDNSEREDESLFASSDEEAALSMAGDSESGAHEV
DKKAGEFIAKFREQIQLQRMASVDKRLRGGWGSFSSTSSSYFS