; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg034463 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg034463
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDUF761 domain-containing protein
Genome locationscaffold4:14272894..14275686
RNA-Seq ExpressionSpg034463
SyntenySpg034463
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008480 - Protein of unknown function DUF761, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN34231.1 hypothetical protein [Cucumis melo subsp. melo]1.0e-27185.24Show/hide
Query:  MASSTSSPFTKPHFPHSPLPPTPTTHKRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSV--DEPR
        MASS S+PFTKPHFPHSPLPPT TT    SC  FLCKSLFFCIFLLLLPLFPSEAP+FVNQTLLTKFWELFHLMFVGIAVSYGLFSRRN+QVSV  DEPR
Subjt:  MASSTSSPFTKPHFPHSPLPPTPTTHKRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSV--DEPR

Query:  FSNFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVV
        FSNFENPQSYLSK  HVASIFEDVDDFS SDERKLSEVLYIQPN GSV  F   NA SRQQE   YSIPKKRYENS E  D +SVGHACKSRYTRGGSVV
Subjt:  FSNFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVV

Query:  VVAETNR-RSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKF
        VVAETNR  S EWL+SGAIVNYKPLGLPVRSLRSNLTEPDDVEF+CGDESCLSSKSSSK+SE+NCER SEFGDNCCVNLEEKFDE VIA MS FQLRE F
Subjt:  VVAETNR-RSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKF

Query:  GKKVMRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLI
        GK +MRERGV NA LRPSHFRP SIDETQFESLKKSRSL S LSQSSQTSS S SLS TTRKHRKMSSLGNI YKS HSRQYS+SSLSENSRGSSEDPLI
Subjt:  GKKVMRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLI

Query:  EPENSSECNESVLSSPNLDRNFASIPKALSRGKSVRTIRANAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGESPYMREDGVGHGWSGIINPNAGNSNRL
        EPENSSECNES++SSP LDRNFA IPKALSRGKSVRTIRAN  +IEEMKAQEMYRNQVEH +N+G KF EGG SPYMREDG GHGW GI +PNAG SNR 
Subjt:  EPENSSECNESVLSSPNLDRNFASIPKALSRGKSVRTIRANAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGESPYMREDGVGHGWSGIINPNAGNSNRL

Query:  PK-MTFSGIEEQKEETESLLTDD--CKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGWGSFSSTSSS
        PK  TFSGIEEQKE+ ES LTDD   +DNSERED S F SSDEEAASSMAGESESGA+EVDKKAGEFIAKFREQIQLQRMASV+KRLRGGWGSFSSTSSS
Subjt:  PK-MTFSGIEEQKEETESLLTDD--CKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGWGSFSSTSSS

Query:  YFS
        YFS
Subjt:  YFS

KAG6575261.1 hypothetical protein SDJN03_25900, partial [Cucurbita argyrosperma subsp. sororia]3.3e-25981.7Show/hide
Query:  MASSTSSPFTKPHFPHSPLPPTPTTHKRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSVDEPRFS
        MASS SSPFTK HFPHSPLP  P      SCAQFLCKS+FFC FLLLLPLFPSEAPDFV+QTL TKFWELFHLMFVGIAVSYGLFS RN Q++VDEPR+S
Subjt:  MASSTSSPFTKPHFPHSPLPPTPTTHKRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSVDEPRFS

Query:  NFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVVVV
        +FENPQSYLSK  +VASIF+DVDDF  SDERK+SEVLYIQP  GS S   DLNAQSR QEKLRYS+PKKRYENSYE AD D+V HACKSRYTRGGSVVVV
Subjt:  NFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVVVV

Query:  AETNRRSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKFGKK
         ETNR S     SG IVNYKPLGLPVRSLRS+LTE DDVEF+CGDESCLSSKSS KSSENNCE  SEFGDNCCVNLEEKFDE  IASMS FQLREKFGKK
Subjt:  AETNRRSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKFGKK

Query:  VMRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLIEPE
        V+RERG GNA LRPSHFRPPSIDETQFESL+KS SL S LSQSSQTSS SS LS TTRKH KMSSL NI YKSLHSRQYSMSSLSENSRGSSEDPLIE E
Subjt:  VMRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLIEPE

Query:  NSSECNESVLSSPNLDRNFASIPKALSRGKSVRTIRANAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGES-PYMREDGVGHGWSGIINPNAGNSNRLPK
        NSSECNESV+SSP  DRNFASIPKALS+GKSVR IRANA +IE+MKAQEM+R QV+H + IG KF EGG S PYMREDG GHGW  ++NPNAGN NR PK
Subjt:  NSSECNESVLSSPNLDRNFASIPKALSRGKSVRTIRANAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGES-PYMREDGVGHGWSGIINPNAGNSNRLPK

Query:  MTFSGIEEQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLR---GGWGSFSSTSSSYF
         TF GI+EQKEETESL+ DD KD SE EDES+FASSDEEA SSMAG+SESGA EVDKKAGEFIAKFREQIQLQRMASVEKRLR   GGWGSFSSTSSSYF
Subjt:  MTFSGIEEQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLR---GGWGSFSSTSSSYF

Query:  S
        S
Subjt:  S

KAG7013816.1 hypothetical protein SDJN02_23985, partial [Cucurbita argyrosperma subsp. argyrosperma]1.3e-25881.7Show/hide
Query:  MASSTSSPFTKPHFPHSPLPPTPTTHKRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSVDEPRFS
        MASS SSPFTK HFPHSPLP  P      SCAQFLCKS+FFC FLLLLPLFPSEAPDFV+QTL TKFWELFHLMFVGIAVSYGLFS RN Q++VDEPR+S
Subjt:  MASSTSSPFTKPHFPHSPLPPTPTTHKRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSVDEPRFS

Query:  NFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVVVV
        +FENPQSYLSK  +VASIF+DVDDF  SDERK+SEVLYIQP  GS S   DLNAQSR QEKLRYS+PKKRYENSYE AD D+V HACKSRYTRGGSVVVV
Subjt:  NFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVVVV

Query:  AETNRRSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKFGKK
         ETNR S     SG IVNYKPLGLPVRSLRS+LTE DDVEF+CGDESCLSSKSS KSSENNCE  SEFGDNCCVNLEEKFDE  IASMS FQLREKFGKK
Subjt:  AETNRRSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKFGKK

Query:  VMRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLIEPE
        V+RERG GNA LRPSHFRPPSIDETQFESL+KS SL S LSQSSQTSS SS LS TTRKH KMSSL NI YKSLHSRQYSMSSLSENSRGSSEDPLIE E
Subjt:  VMRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLIEPE

Query:  NSSECNESVLSSPNLDRNFASIPKALSRGKSVRTIRANAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGES-PYMREDGVGHGWSGIINPNAGNSNRLPK
        NSSECNESV+SSP  DRNFASIPKALS+GKSVR IRANA +IE+MKAQEM+R QV+H + IG KF EGG S PYMREDG GHGW  ++NPNAGN NR PK
Subjt:  NSSECNESVLSSPNLDRNFASIPKALSRGKSVRTIRANAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGES-PYMREDGVGHGWSGIINPNAGNSNRLPK

Query:  MTFSGIEEQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLR---GGWGSFSSTSSSYF
         TF GI+EQKEETESL+ DD KD SE EDES FASSDEEA SSMAG+SESGA EVDKKAGEFIAKFREQIQLQRMASVEKRLR   GGWGSFSSTSSSYF
Subjt:  MTFSGIEEQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLR---GGWGSFSSTSSSYF

Query:  S
        S
Subjt:  S

XP_004140631.1 uncharacterized protein LOC101220435 [Cucumis sativus]1.8e-26582.78Show/hide
Query:  MASSTSSPFTKPHFPHSPLPPTPTTHKRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSV--DEPR
        MA S S+PFTKPHFPHSPLPPT TT    SC QF+CKSLFFCIFLLLLPLFPSEAP+FVNQT LTKFWELFHLMF+GIAVSYGLFSRRN+QVSV  DEPR
Subjt:  MASSTSSPFTKPHFPHSPLPPTPTTHKRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSV--DEPR

Query:  FSNFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVV
        FSNFENPQSYLSK FHVASIFEDVDDFS SDERKLSEVLYIQPN GSVS    LNA SRQQE   YSIPKKRYENS E A+ D+VGHACKSRYTRGGSVV
Subjt:  FSNFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVV

Query:  VVAETNR-RSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKF
        VVAETNR  S EWL+SGAIVNYKPLGLPVRSL+S+LTEPDDVEF+CGDESCLSSKSSSK+SE+NCER SEFGDNCCVNLEEKFDE VIASMS FQLREKF
Subjt:  VVAETNR-RSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKF

Query:  GKKVMRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLI
         K +MRER V NA LRPSHFRP SIDETQFESLKKS SL S LSQSSQTSS SS LS  TRKHRKMSSLGNI YKS HSRQYS+SSLSENSRGSSEDPLI
Subjt:  GKKVMRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLI

Query:  EPENSSECNESVLSSPNLDRNFASIPKALSRGKSVRTIRANAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGESPYMREDGVGHGWSGIINPNAGNSNRL
        +PENSSECNESV+SSP LDRNFA+ PKALSRGKSVRT+RA+  +IEEMKAQEMYRNQVEH +N+  KF EGG SPYMRED  GHGW GI N NA  SNR 
Subjt:  EPENSSECNESVLSSPNLDRNFASIPKALSRGKSVRTIRANAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGESPYMREDGVGHGWSGIINPNAGNSNRL

Query:  PK----MTFSGIEEQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGWGSFSSTSS
         K     TFSGIEEQKE+TES +TDD KDNSERED+S F SSDEEAA SM G+SESGAHEVDKKAGEFIAKFREQIQLQRMASV+KRLRGGWGSFSST+S
Subjt:  PK----MTFSGIEEQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGWGSFSSTSS

Query:  SYFS
        SYFS
Subjt:  SYFS

XP_023006022.1 uncharacterized protein LOC111498900 [Cucurbita maxima]1.6e-26181.97Show/hide
Query:  MASSTSSPFTKPHFPHSPLPPTPTTHKRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSVDEPRFS
        MASS SSPFTK HFPHSPLP  P TH   SCAQFLCKSLFFC FLLLLPLFPSEAPDFV+QTL TKFWELFHLM VGIAVSYGLFS RN Q++VDEPR+S
Subjt:  MASSTSSPFTKPHFPHSPLPPTPTTHKRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSVDEPRFS

Query:  NFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVVVV
        +FENPQSYLSK  +VASIF+DVDDFS SDERKLSEVLYIQPN GS S   DLNAQSRQQEKLRYSIPKKRYENSYE AD D+V HACKSRYTRGGSVVVV
Subjt:  NFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVVVV

Query:  AETNRRSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKFGKK
         ETNR S     SG IVNYKPLGLPVRSL+S+LTE DDVEF+CGDESCLSSKSS KSSENNCE  SEFGDNCCVNLEEKFDE  IASMS FQLREKFGKK
Subjt:  AETNRRSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKFGKK

Query:  VMRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLIEPE
        V+RERG GNA LRPSHFRPPSIDETQFESLKKS SL S LSQSSQTSS SSSLS TTRKH KMSSL NI YKSLHSRQYSMSSLSENSRGSSEDPLIE E
Subjt:  VMRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLIEPE

Query:  NSSECNESVLSSPNLDRNFASIPKALSRGKSVRTIRANAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGESPYMREDGVGHGWSGIINPNAGNSNRLPKM
        NSSECNESV+SSP  D NF SIPKALS+GKS+R I+ANA +IE++KAQEM+R QV+H + IG KF EGG SPY+REDG GHGW  + NPNA N +R P  
Subjt:  NSSECNESVLSSPNLDRNFASIPKALSRGKSVRTIRANAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGESPYMREDGVGHGWSGIINPNAGNSNRLPKM

Query:  TFSGIEEQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLR--GGWGSFSSTSSSYFS
        TF GI+EQKEETESL+ DD KD+SE EDES FASSDEEAASSMAG+SESGA EVDKKAGEFIAKFREQIQLQRMASVEKRLR  GGWGSFSSTSSSYFS
Subjt:  TFSGIEEQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLR--GGWGSFSSTSSSYFS

TrEMBL top hitse value%identityAlignment
A0A0A0K9X1 Uncharacterized protein8.8e-26682.78Show/hide
Query:  MASSTSSPFTKPHFPHSPLPPTPTTHKRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSV--DEPR
        MA S S+PFTKPHFPHSPLPPT TT    SC QF+CKSLFFCIFLLLLPLFPSEAP+FVNQT LTKFWELFHLMF+GIAVSYGLFSRRN+QVSV  DEPR
Subjt:  MASSTSSPFTKPHFPHSPLPPTPTTHKRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSV--DEPR

Query:  FSNFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVV
        FSNFENPQSYLSK FHVASIFEDVDDFS SDERKLSEVLYIQPN GSVS    LNA SRQQE   YSIPKKRYENS E A+ D+VGHACKSRYTRGGSVV
Subjt:  FSNFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVV

Query:  VVAETNR-RSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKF
        VVAETNR  S EWL+SGAIVNYKPLGLPVRSL+S+LTEPDDVEF+CGDESCLSSKSSSK+SE+NCER SEFGDNCCVNLEEKFDE VIASMS FQLREKF
Subjt:  VVAETNR-RSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKF

Query:  GKKVMRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLI
         K +MRER V NA LRPSHFRP SIDETQFESLKKS SL S LSQSSQTSS SS LS  TRKHRKMSSLGNI YKS HSRQYS+SSLSENSRGSSEDPLI
Subjt:  GKKVMRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLI

Query:  EPENSSECNESVLSSPNLDRNFASIPKALSRGKSVRTIRANAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGESPYMREDGVGHGWSGIINPNAGNSNRL
        +PENSSECNESV+SSP LDRNFA+ PKALSRGKSVRT+RA+  +IEEMKAQEMYRNQVEH +N+  KF EGG SPYMRED  GHGW GI N NA  SNR 
Subjt:  EPENSSECNESVLSSPNLDRNFASIPKALSRGKSVRTIRANAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGESPYMREDGVGHGWSGIINPNAGNSNRL

Query:  PK----MTFSGIEEQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGWGSFSSTSS
         K     TFSGIEEQKE+TES +TDD KDNSERED+S F SSDEEAA SM G+SESGAHEVDKKAGEFIAKFREQIQLQRMASV+KRLRGGWGSFSST+S
Subjt:  PK----MTFSGIEEQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGWGSFSSTSS

Query:  SYFS
        SYFS
Subjt:  SYFS

A0A5D3DMA5 DUF761 domain-containing protein4.8e-27285.24Show/hide
Query:  MASSTSSPFTKPHFPHSPLPPTPTTHKRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSV--DEPR
        MASS S+PFTKPHFPHSPLPPT TT    SC  FLCKSLFFCIFLLLLPLFPSEAP+FVNQTLLTKFWELFHLMFVGIAVSYGLFSRRN+QVSV  DEPR
Subjt:  MASSTSSPFTKPHFPHSPLPPTPTTHKRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSV--DEPR

Query:  FSNFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVV
        FSNFENPQSYLSK  HVASIFEDVDDFS SDERKLSEVLYIQPN GSV  F   NA SRQQE   YSIPKKRYENS E  D +SVGHACKSRYTRGGSVV
Subjt:  FSNFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVV

Query:  VVAETNR-RSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKF
        VVAETNR  S EWL+SGAIVNYKPLGLPVRSLRSNLTEPDDVEF+CGDESCLSSKSSSK+SE+NCER SEFGDNCCVNLEEKFDE VIA MS FQLRE F
Subjt:  VVAETNR-RSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKF

Query:  GKKVMRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLI
        GK +MRERGV NA LRPSHFRP SIDETQFESLKKSRSL S LSQSSQTSS S SLS TTRKHRKMSSLGNI YKS HSRQYS+SSLSENSRGSSEDPLI
Subjt:  GKKVMRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLI

Query:  EPENSSECNESVLSSPNLDRNFASIPKALSRGKSVRTIRANAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGESPYMREDGVGHGWSGIINPNAGNSNRL
        EPENSSECNES++SSP LDRNFA IPKALSRGKSVRTIRAN  +IEEMKAQEMYRNQVEH +N+G KF EGG SPYMREDG GHGW GI +PNAG SNR 
Subjt:  EPENSSECNESVLSSPNLDRNFASIPKALSRGKSVRTIRANAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGESPYMREDGVGHGWSGIINPNAGNSNRL

Query:  PK-MTFSGIEEQKEETESLLTDD--CKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGWGSFSSTSSS
        PK  TFSGIEEQKE+ ES LTDD   +DNSERED S F SSDEEAASSMAGESESGA+EVDKKAGEFIAKFREQIQLQRMASV+KRLRGGWGSFSSTSSS
Subjt:  PK-MTFSGIEEQKEETESLLTDD--CKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGWGSFSSTSSS

Query:  YFS
        YFS
Subjt:  YFS

A0A6J1H4M0 uncharacterized protein LOC1114599983.4e-25781.13Show/hide
Query:  MASSTSSPFTKPHFPHSPLPPTPTTHKRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSVDEPRFS
        MASS SSPFTK HFPHSPLP  P      SCAQFLCKS+FFC FLLLLPLFPSEAPDFV+QTL TKFWELFHLMFVGIAVSYGLFS RN Q++VDEPR+S
Subjt:  MASSTSSPFTKPHFPHSPLPPTPTTHKRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSVDEPRFS

Query:  NFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVVVV
        +FENPQSYLSK  +VASIF+DVDDF  SDERK+SEVLYIQP  GS S   DLNAQSR QEKLRYS+PKKRYENSYE AD D+V HACKSRYTRGGSVVVV
Subjt:  NFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVVVV

Query:  AETNRRSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKFGKK
         ETNR S     SG IVNYKPLGLPVRSLRS+LTE DDVEF+CGDESCLSSKSS KSSENNCE  SEFGDNCCVNLEEKFDE  IASMS FQLREKFGKK
Subjt:  AETNRRSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKFGKK

Query:  VMRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLIEPE
        V+RERG GNA LRPSHFRPPSIDETQFESLKKS SL S LSQSSQTSS SS LS TTRK RKMSSL NI YKSLHSRQYS SSLSENSRGSSEDPLIE E
Subjt:  VMRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLIEPE

Query:  NSSECNESVLSSPNLDRNFASIPKALSRGKSVRTIRANAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGES-PYMREDGVGHGWSGIINPNAGNSNRLPK
        NSSECNESV+SSP  DRNFASIPKALS+GKSVR IRANA +IE+MKAQEM+R QV+H + IG KF EGG S PYMREDG G GW  ++NPNAGN NR PK
Subjt:  NSSECNESVLSSPNLDRNFASIPKALSRGKSVRTIRANAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGES-PYMREDGVGHGWSGIINPNAGNSNRLPK

Query:  MTFSGIEEQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLR------GGWGSFSSTSS
         TF GI+EQKEETESL+ DD KD+SE EDES+FASSDEEA SSMAG+SESGA EVDKKAGEFIAKFREQIQLQRMASVEKRLR      GGWGSFSSTSS
Subjt:  MTFSGIEEQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLR------GGWGSFSSTSS

Query:  SYFS
        SYFS
Subjt:  SYFS

A0A6J1KUS4 uncharacterized protein LOC1114989007.7e-26281.97Show/hide
Query:  MASSTSSPFTKPHFPHSPLPPTPTTHKRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSVDEPRFS
        MASS SSPFTK HFPHSPLP  P TH   SCAQFLCKSLFFC FLLLLPLFPSEAPDFV+QTL TKFWELFHLM VGIAVSYGLFS RN Q++VDEPR+S
Subjt:  MASSTSSPFTKPHFPHSPLPPTPTTHKRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSVDEPRFS

Query:  NFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVVVV
        +FENPQSYLSK  +VASIF+DVDDFS SDERKLSEVLYIQPN GS S   DLNAQSRQQEKLRYSIPKKRYENSYE AD D+V HACKSRYTRGGSVVVV
Subjt:  NFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVVVV

Query:  AETNRRSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKFGKK
         ETNR S     SG IVNYKPLGLPVRSL+S+LTE DDVEF+CGDESCLSSKSS KSSENNCE  SEFGDNCCVNLEEKFDE  IASMS FQLREKFGKK
Subjt:  AETNRRSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKFGKK

Query:  VMRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLIEPE
        V+RERG GNA LRPSHFRPPSIDETQFESLKKS SL S LSQSSQTSS SSSLS TTRKH KMSSL NI YKSLHSRQYSMSSLSENSRGSSEDPLIE E
Subjt:  VMRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLIEPE

Query:  NSSECNESVLSSPNLDRNFASIPKALSRGKSVRTIRANAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGESPYMREDGVGHGWSGIINPNAGNSNRLPKM
        NSSECNESV+SSP  D NF SIPKALS+GKS+R I+ANA +IE++KAQEM+R QV+H + IG KF EGG SPY+REDG GHGW  + NPNA N +R P  
Subjt:  NSSECNESVLSSPNLDRNFASIPKALSRGKSVRTIRANAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGESPYMREDGVGHGWSGIINPNAGNSNRLPKM

Query:  TFSGIEEQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLR--GGWGSFSSTSSSYFS
        TF GI+EQKEETESL+ DD KD+SE EDES FASSDEEAASSMAG+SESGA EVDKKAGEFIAKFREQIQLQRMASVEKRLR  GGWGSFSSTSSSYFS
Subjt:  TFSGIEEQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLR--GGWGSFSSTSSSYFS

E5GCN2 Uncharacterized protein4.8e-27285.24Show/hide
Query:  MASSTSSPFTKPHFPHSPLPPTPTTHKRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSV--DEPR
        MASS S+PFTKPHFPHSPLPPT TT    SC  FLCKSLFFCIFLLLLPLFPSEAP+FVNQTLLTKFWELFHLMFVGIAVSYGLFSRRN+QVSV  DEPR
Subjt:  MASSTSSPFTKPHFPHSPLPPTPTTHKRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSV--DEPR

Query:  FSNFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVV
        FSNFENPQSYLSK  HVASIFEDVDDFS SDERKLSEVLYIQPN GSV  F   NA SRQQE   YSIPKKRYENS E  D +SVGHACKSRYTRGGSVV
Subjt:  FSNFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVV

Query:  VVAETNR-RSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKF
        VVAETNR  S EWL+SGAIVNYKPLGLPVRSLRSNLTEPDDVEF+CGDESCLSSKSSSK+SE+NCER SEFGDNCCVNLEEKFDE VIA MS FQLRE F
Subjt:  VVAETNR-RSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKF

Query:  GKKVMRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLI
        GK +MRERGV NA LRPSHFRP SIDETQFESLKKSRSL S LSQSSQTSS S SLS TTRKHRKMSSLGNI YKS HSRQYS+SSLSENSRGSSEDPLI
Subjt:  GKKVMRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLI

Query:  EPENSSECNESVLSSPNLDRNFASIPKALSRGKSVRTIRANAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGESPYMREDGVGHGWSGIINPNAGNSNRL
        EPENSSECNES++SSP LDRNFA IPKALSRGKSVRTIRAN  +IEEMKAQEMYRNQVEH +N+G KF EGG SPYMREDG GHGW GI +PNAG SNR 
Subjt:  EPENSSECNESVLSSPNLDRNFASIPKALSRGKSVRTIRANAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGESPYMREDGVGHGWSGIINPNAGNSNRL

Query:  PK-MTFSGIEEQKEETESLLTDD--CKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGWGSFSSTSSS
        PK  TFSGIEEQKE+ ES LTDD   +DNSERED S F SSDEEAASSMAGESESGA+EVDKKAGEFIAKFREQIQLQRMASV+KRLRGGWGSFSSTSSS
Subjt:  PK-MTFSGIEEQKEETESLLTDD--CKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGWGSFSSTSSS

Query:  YFS
        YFS
Subjt:  YFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G60380.1 FUNCTIONS IN: molecular_function unknown1.0e-3535.04Show/hide
Query:  STSSPFTKPHFPHSPLPPTPTTHKRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSVDEPRFSNFE
        ++ +P+TK   P + + P    +K      F CKS+ F +FLL LPLFPS+APDFV +T+LTKFWEL HL+FVGIAV+YGLFSRRN++ +VD       E
Subjt:  STSSPFTKPHFPHSPLPPTPTTHKRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSVDEPRFSNFE

Query:  NPQSYLSKTFHVASIF-EDVDDFSA------SDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGS
        +  SY+S+ F V+S+F E+ DD S       SDE   +    +  +   V + G+L                   E S E  + + V  A  S+Y +G S
Subjt:  NPQSYLSKTFHVASIF-EDVDDFSA------SDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGS

Query:  VVVVAETNRRSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSEN--NCERRSEFGDNCCVNLEEKFDEAVIASMS--QFQ
         VVVA    R +  LD   +  ++PLGLP+R LRS+L           D + L  KS + S +   N E  S   DN        FDE + A  S   +Q
Subjt:  VVVVAETNRRSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSEN--NCERRSEFGDNCCVNLEEKFDEAVIASMS--QFQ

Query:  LREKFGKKVMRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSR-QYSMSSLSENSRGS
         R +         G+G+    PS+F+P S+DET      KS S RS  S SSQTS  S       +   + S   ++  +SL+S  +  +   S  S   
Subjt:  LREKFGKKVMRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSR-QYSMSSLSENSRGS

Query:  SEDPLIEPENS
        S  P + P  S
Subjt:  SEDPLIEPENS

AT3G60380.1 FUNCTIONS IN: molecular_function unknown2.6e-0440.96Show/hide
Query:  KEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGWGSFSST
        K E E +  ++    +E++ E  F   +EEAA      +    +EVD+KAGEFIAKFREQI+LQ++ S E+   GG G F ++
Subjt:  KEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGWGSFSST

AT4G16790.1 hydroxyproline-rich glycoprotein family protein2.9e-1138.02Show/hide
Query:  RKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNI---------QVSVDEPRFSNFENPQSYLSKTFHVASI
        RK  ++F+ K+L   +   ++P+F S+ P+  NQT L    EL HL+FVGIAVSYGLFSRRN              ++   SN  N  SY+ K   V+S+
Subjt:  RKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNI---------QVSVDEPRFSNFENPQSYLSKTFHVASI

Query:  F-------EDVDDFSASDERK
        F        +  D S+ D+RK
Subjt:  F-------EDVDDFSASDERK

AT4G16790.1 hydroxyproline-rich glycoprotein family protein5.5e-0236.49Show/hide
Query:  EQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESE-SGAHEVDKKAGEFIAKFREQIQLQRMASVEK
        +Q+    S   ++ ++  +R  E+      E+      G SE +   +VDKKA EFIAKFREQI+LQR+ S+++
Subjt:  EQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESE-SGAHEVDKKAGEFIAKFREQIQLQRMASVEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCTTCAACTTCCAGCCCTTTCACCAAGCCCCATTTTCCACATTCTCCACTTCCACCAACACCTACTACTCACAAGCGCAAGTCCTGCGCACAATTTCTCTGTAA
ATCCCTCTTCTTCTGCATTTTCCTCCTCCTTCTCCCTCTCTTCCCTTCCGAAGCTCCAGATTTCGTCAATCAGACTTTGCTCACCAAATTCTGGGAGCTTTTTCACCTCA
TGTTCGTCGGCATTGCTGTTTCCTATGGTCTCTTTAGCAGAAGGAACATCCAGGTGAGTGTTGACGAACCTCGCTTCTCCAATTTTGAAAATCCGCAGTCCTATTTGTCT
AAGACGTTTCACGTCGCGTCCATTTTTGAAGATGTTGACGATTTCAGTGCTTCTGATGAGAGGAAACTTAGTGAAGTTTTGTACATTCAGCCGAATCGTGGATCCGTGAG
TGATTTTGGGGATTTGAATGCCCAATCTCGCCAACAGGAAAAACTCCGTTACTCCATACCCAAAAAGAGGTATGAAAACTCTTATGAATCTGCTGATAATGATAGTGTTG
GTCATGCTTGTAAATCGAGATATACTCGTGGTGGATCTGTTGTGGTAGTTGCTGAAACAAATCGTAGGTCTAGTGAATGGTTGGATTCAGGGGCCATTGTAAATTATAAA
CCTCTAGGTTTGCCTGTTAGGAGTTTGAGGTCGAATCTTACTGAACCCGACGATGTTGAATTTGAATGTGGTGATGAATCTTGTTTGAGTTCTAAAAGTTCATCCAAGAG
CTCTGAGAATAATTGTGAAAGAAGAAGTGAATTTGGTGATAACTGTTGTGTGAATTTGGAGGAGAAGTTTGATGAAGCTGTTATTGCGTCAATGTCCCAATTTCAATTGC
GTGAGAAATTTGGAAAGAAGGTGATGAGAGAGAGAGGAGTTGGGAATGCTGCTCTTCGTCCTTCCCATTTTAGACCTCCCTCCATTGATGAAACTCAATTTGAATCACTG
AAAAAATCAAGGTCTCTTCGTTCTCCTCTATCTCAGTCATCACAAACTAGTTCCTTCTCTTCTTCGTTGTCACCAACAACAAGAAAGCACCGTAAAATGTCGTCACTTGG
TAACATTCCATATAAATCATTGCATTCTCGACAATACAGTATGAGTTCTCTGTCTGAAAACAGTAGAGGGAGCTCTGAAGACCCTCTGATTGAACCAGAAAATTCATCTG
AGTGCAATGAATCCGTCTTAAGTTCCCCAAATTTGGACAGGAATTTCGCAAGTATTCCGAAAGCTTTATCCCGAGGAAAATCCGTTAGAACAATTAGAGCAAATGCAGTT
TCCATAGAGGAAATGAAAGCTCAAGAGATGTATAGAAACCAAGTTGAACATGGTGAAAATATAGGGAAGAAGTTTGCAGAAGGTGGAGAGTCACCATATATGAGAGAAGA
TGGAGTGGGGCATGGATGGTCTGGTATTATTAACCCGAATGCTGGTAATTCAAATCGCTTGCCGAAGATGACGTTCTCAGGGATTGAGGAGCAGAAGGAAGAGACTGAGA
GTCTCCTGACAGATGATTGTAAAGATAACTCTGAGAGGGAGGATGAAAGTATTTTTGCAAGTTCAGATGAAGAAGCTGCTTCAAGTATGGCGGGGGAGTCGGAATCGGGG
GCTCACGAGGTCGACAAGAAAGCTGGTGAGTTCATAGCCAAGTTCAGGGAGCAAATACAGCTTCAGAGGATGGCTTCAGTAGAAAAAAGATTGAGAGGAGGATGGGGGTC
ATTCAGCAGCACAAGCAGCAGCTATTTCAGTTTGAAGGCCAAAGTACAGTCAAAGAAAGGGAAAAGACTCCCTGCATTCTCCTCCTCGGCCTCGGCCCATTACCGAGGTC
GAGGAGAGGGTCGCCTCGGCCTAATGTTGAGACTGACCAGCAATGTGGGTCCAAGCCCATTGTGCATTGTTCTGTCGCCGTCTCGTAGAGCTTCGAGGAAAACCCTAGCA
GGGGAAGGAATGAATACGCACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCTTCAACTTCCAGCCCTTTCACCAAGCCCCATTTTCCACATTCTCCACTTCCACCAACACCTACTACTCACAAGCGCAAGTCCTGCGCACAATTTCTCTGTAA
ATCCCTCTTCTTCTGCATTTTCCTCCTCCTTCTCCCTCTCTTCCCTTCCGAAGCTCCAGATTTCGTCAATCAGACTTTGCTCACCAAATTCTGGGAGCTTTTTCACCTCA
TGTTCGTCGGCATTGCTGTTTCCTATGGTCTCTTTAGCAGAAGGAACATCCAGGTGAGTGTTGACGAACCTCGCTTCTCCAATTTTGAAAATCCGCAGTCCTATTTGTCT
AAGACGTTTCACGTCGCGTCCATTTTTGAAGATGTTGACGATTTCAGTGCTTCTGATGAGAGGAAACTTAGTGAAGTTTTGTACATTCAGCCGAATCGTGGATCCGTGAG
TGATTTTGGGGATTTGAATGCCCAATCTCGCCAACAGGAAAAACTCCGTTACTCCATACCCAAAAAGAGGTATGAAAACTCTTATGAATCTGCTGATAATGATAGTGTTG
GTCATGCTTGTAAATCGAGATATACTCGTGGTGGATCTGTTGTGGTAGTTGCTGAAACAAATCGTAGGTCTAGTGAATGGTTGGATTCAGGGGCCATTGTAAATTATAAA
CCTCTAGGTTTGCCTGTTAGGAGTTTGAGGTCGAATCTTACTGAACCCGACGATGTTGAATTTGAATGTGGTGATGAATCTTGTTTGAGTTCTAAAAGTTCATCCAAGAG
CTCTGAGAATAATTGTGAAAGAAGAAGTGAATTTGGTGATAACTGTTGTGTGAATTTGGAGGAGAAGTTTGATGAAGCTGTTATTGCGTCAATGTCCCAATTTCAATTGC
GTGAGAAATTTGGAAAGAAGGTGATGAGAGAGAGAGGAGTTGGGAATGCTGCTCTTCGTCCTTCCCATTTTAGACCTCCCTCCATTGATGAAACTCAATTTGAATCACTG
AAAAAATCAAGGTCTCTTCGTTCTCCTCTATCTCAGTCATCACAAACTAGTTCCTTCTCTTCTTCGTTGTCACCAACAACAAGAAAGCACCGTAAAATGTCGTCACTTGG
TAACATTCCATATAAATCATTGCATTCTCGACAATACAGTATGAGTTCTCTGTCTGAAAACAGTAGAGGGAGCTCTGAAGACCCTCTGATTGAACCAGAAAATTCATCTG
AGTGCAATGAATCCGTCTTAAGTTCCCCAAATTTGGACAGGAATTTCGCAAGTATTCCGAAAGCTTTATCCCGAGGAAAATCCGTTAGAACAATTAGAGCAAATGCAGTT
TCCATAGAGGAAATGAAAGCTCAAGAGATGTATAGAAACCAAGTTGAACATGGTGAAAATATAGGGAAGAAGTTTGCAGAAGGTGGAGAGTCACCATATATGAGAGAAGA
TGGAGTGGGGCATGGATGGTCTGGTATTATTAACCCGAATGCTGGTAATTCAAATCGCTTGCCGAAGATGACGTTCTCAGGGATTGAGGAGCAGAAGGAAGAGACTGAGA
GTCTCCTGACAGATGATTGTAAAGATAACTCTGAGAGGGAGGATGAAAGTATTTTTGCAAGTTCAGATGAAGAAGCTGCTTCAAGTATGGCGGGGGAGTCGGAATCGGGG
GCTCACGAGGTCGACAAGAAAGCTGGTGAGTTCATAGCCAAGTTCAGGGAGCAAATACAGCTTCAGAGGATGGCTTCAGTAGAAAAAAGATTGAGAGGAGGATGGGGGTC
ATTCAGCAGCACAAGCAGCAGCTATTTCAGTTTGAAGGCCAAAGTACAGTCAAAGAAAGGGAAAAGACTCCCTGCATTCTCCTCCTCGGCCTCGGCCCATTACCGAGGTC
GAGGAGAGGGTCGCCTCGGCCTAATGTTGAGACTGACCAGCAATGTGGGTCCAAGCCCATTGTGCATTGTTCTGTCGCCGTCTCGTAGAGCTTCGAGGAAAACCCTAGCA
GGGGAAGGAATGAATACGCACTGA
Protein sequenceShow/hide protein sequence
MASSTSSPFTKPHFPHSPLPPTPTTHKRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSVDEPRFSNFENPQSYLS
KTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVVVVAETNRRSSEWLDSGAIVNYK
PLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKFGKKVMRERGVGNAALRPSHFRPPSIDETQFESL
KKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLIEPENSSECNESVLSSPNLDRNFASIPKALSRGKSVRTIRANAV
SIEEMKAQEMYRNQVEHGENIGKKFAEGGESPYMREDGVGHGWSGIINPNAGNSNRLPKMTFSGIEEQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESG
AHEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGWGSFSSTSSSYFSLKAKVQSKKGKRLPAFSSSASAHYRGRGEGRLGLMLRLTSNVGPSPLCIVLSPSRRASRKTLA
GEGMNTH