; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015424 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015424
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDUF761 domain-containing protein
Genome locationchr12:12830380..12832173
RNA-Seq ExpressionLag0015424
SyntenyLag0015424
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR008480 - Protein of unknown function DUF761, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN34231.1 hypothetical protein [Cucumis melo subsp. melo]4.0e-27285.41Show/hide
Query:  MASSTSSPFTKPHFPHSPLPPTPTTHQRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSV--DEPR
        MASS S+PFTKPHFPHSPLPPT TT    SC  FLCKSLFFCIFLLLLPLFPSEAP+FVNQTLLTKFWELFHLMFVGIAVSYGLFSRRN+QVSV  DEPR
Subjt:  MASSTSSPFTKPHFPHSPLPPTPTTHQRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSV--DEPR

Query:  FSNFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVV
        FSNFENPQSYLSK  HVASIFEDVDDFS SDERKLSEVLYIQPN GSV  F   NA SRQQE   YSIPKKRYENS E  D +SVGHACKSRYTRGGSVV
Subjt:  FSNFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVV

Query:  VVAETNR-RSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKF
        VVAETNR  S EWL+SGAIVNYKPLGLPVRSLRSNLTEPDDVEF+CGDESCLSSKSSSK+SE+NCER SEFGDNCCVNLEEKFDE VIA MS FQLRE F
Subjt:  VVAETNR-RSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKF

Query:  GKKVTRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLI
        GK + RERGV NA LRPSHFRP SIDETQFESLKKSRSL S LSQSSQTSS S SLS TTRKHRKMSSLGNI YKS HSRQYS+SSLSENSRGSSEDPLI
Subjt:  GKKVTRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLI

Query:  EPENSSECNESILSSPHLDRNFASIPKALSRGKSVRTIRTNAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGESPYMREDGVGHGWSGINNPNAGNSNRL
        EPENSSECNESI+SSP LDRNFA IPKALSRGKSVRTIR N  +IEEMKAQEMYRNQVEH +N+G KF EGG SPYMREDG GHGW GIN+PNAG SNR 
Subjt:  EPENSSECNESILSSPHLDRNFASIPKALSRGKSVRTIRTNAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGESPYMREDGVGHGWSGINNPNAGNSNRL

Query:  PK-TTFSGIEEQKEETESLLTDD--CKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGWGSFSSTSSS
        PK TTFSGIEEQKE+ ES LTDD   +DNSERED S F SSDEEAASSMAGESESGA+EVDKKAGEFIAKFREQIQLQRMASV+KRLRGGWGSFSSTSSS
Subjt:  PK-TTFSGIEEQKEETESLLTDD--CKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGWGSFSSTSSS

Query:  YFS
        YFS
Subjt:  YFS

KAG6575261.1 hypothetical protein SDJN03_25900, partial [Cucurbita argyrosperma subsp. sororia]5.6e-25881.53Show/hide
Query:  MASSTSSPFTKPHFPHSPLPPTPTTHQRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSVDEPRFS
        MASS SSPFTK HFPHSPLP  P      SCAQFLCKS+FFC FLLLLPLFPSEAPDFV+QTL TKFWELFHLMFVGIAVSYGLFS RN Q++VDEPR+S
Subjt:  MASSTSSPFTKPHFPHSPLPPTPTTHQRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSVDEPRFS

Query:  NFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVVVV
        +FENPQSYLSK  +VASIF+DVDDF  SDERK+SEVLYIQP  GS S   DLNAQSR QEKLRYS+PKKRYENSYE AD D+V HACKSRYTRGGSVVVV
Subjt:  NFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVVVV

Query:  AETNRRSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKFGKK
         ETNR S     SG IVNYKPLGLPVRSLRS+LTE DDVEF+CGDESCLSSKSS KSSENNCE  SEFGDNCCVNLEEKFDE  IASMS FQLREKFGKK
Subjt:  AETNRRSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKFGKK

Query:  VTRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLIEPE
        V RERG GNA LRPSHFRPPSIDETQFESL+KS SL S LSQSSQTSS SS LS TTRKH KMSSL NI YKSLHSRQYSMSSLSENSRGSSEDPLIE E
Subjt:  VTRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLIEPE

Query:  NSSECNESILSSPHLDRNFASIPKALSRGKSVRTIRTNAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGES-PYMREDGVGHGWSGINNPNAGNSNRLPK
        NSSECNES++SSP  DRNFASIPKALS+GKSVR IR NA +IE+MKAQEM+R QV+H + IG KF EGG S PYMREDG GHGW  + NPNAGN NR PK
Subjt:  NSSECNESILSSPHLDRNFASIPKALSRGKSVRTIRTNAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGES-PYMREDGVGHGWSGINNPNAGNSNRLPK

Query:  TTFSGIEEQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLR---GGWGSFSSTSSSYF
        TTF GI+EQKEETESL+ DD KD SE EDES+FASSDEEA SSMAG+SESGA EVDKKAGEFIAKFREQIQLQRMASVEKRLR   GGWGSFSSTSSSYF
Subjt:  TTFSGIEEQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLR---GGWGSFSSTSSSYF

Query:  S
        S
Subjt:  S

KAG7013816.1 hypothetical protein SDJN02_23985, partial [Cucurbita argyrosperma subsp. argyrosperma]2.1e-25781.53Show/hide
Query:  MASSTSSPFTKPHFPHSPLPPTPTTHQRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSVDEPRFS
        MASS SSPFTK HFPHSPLP  P      SCAQFLCKS+FFC FLLLLPLFPSEAPDFV+QTL TKFWELFHLMFVGIAVSYGLFS RN Q++VDEPR+S
Subjt:  MASSTSSPFTKPHFPHSPLPPTPTTHQRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSVDEPRFS

Query:  NFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVVVV
        +FENPQSYLSK  +VASIF+DVDDF  SDERK+SEVLYIQP  GS S   DLNAQSR QEKLRYS+PKKRYENSYE AD D+V HACKSRYTRGGSVVVV
Subjt:  NFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVVVV

Query:  AETNRRSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKFGKK
         ETNR S     SG IVNYKPLGLPVRSLRS+LTE DDVEF+CGDESCLSSKSS KSSENNCE  SEFGDNCCVNLEEKFDE  IASMS FQLREKFGKK
Subjt:  AETNRRSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKFGKK

Query:  VTRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLIEPE
        V RERG GNA LRPSHFRPPSIDETQFESL+KS SL S LSQSSQTSS SS LS TTRKH KMSSL NI YKSLHSRQYSMSSLSENSRGSSEDPLIE E
Subjt:  VTRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLIEPE

Query:  NSSECNESILSSPHLDRNFASIPKALSRGKSVRTIRTNAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGES-PYMREDGVGHGWSGINNPNAGNSNRLPK
        NSSECNES++SSP  DRNFASIPKALS+GKSVR IR NA +IE+MKAQEM+R QV+H + IG KF EGG S PYMREDG GHGW  + NPNAGN NR PK
Subjt:  NSSECNESILSSPHLDRNFASIPKALSRGKSVRTIRTNAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGES-PYMREDGVGHGWSGINNPNAGNSNRLPK

Query:  TTFSGIEEQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLR---GGWGSFSSTSSSYF
        TTF GI+EQKEETESL+ DD KD SE EDES FASSDEEA SSMAG+SESGA EVDKKAGEFIAKFREQIQLQRMASVEKRLR   GGWGSFSSTSSSYF
Subjt:  TTFSGIEEQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLR---GGWGSFSSTSSSYF

Query:  S
        S
Subjt:  S

XP_004140631.1 uncharacterized protein LOC101220435 [Cucumis sativus]1.6e-26582.62Show/hide
Query:  MASSTSSPFTKPHFPHSPLPPTPTTHQRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSV--DEPR
        MA S S+PFTKPHFPHSPLPPT TT    SC QF+CKSLFFCIFLLLLPLFPSEAP+FVNQT LTKFWELFHLMF+GIAVSYGLFSRRN+QVSV  DEPR
Subjt:  MASSTSSPFTKPHFPHSPLPPTPTTHQRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSV--DEPR

Query:  FSNFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVV
        FSNFENPQSYLSK FHVASIFEDVDDFS SDERKLSEVLYIQPN GSVS    LNA SRQQE   YSIPKKRYENS E A+ D+VGHACKSRYTRGGSVV
Subjt:  FSNFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVV

Query:  VVAETNR-RSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKF
        VVAETNR  S EWL+SGAIVNYKPLGLPVRSL+S+LTEPDDVEF+CGDESCLSSKSSSK+SE+NCER SEFGDNCCVNLEEKFDE VIASMS FQLREKF
Subjt:  VVAETNR-RSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKF

Query:  GKKVTRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLI
         K + RER V NA LRPSHFRP SIDETQFESLKKS SL S LSQSSQTSS SS LS  TRKHRKMSSLGNI YKS HSRQYS+SSLSENSRGSSEDPLI
Subjt:  GKKVTRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLI

Query:  EPENSSECNESILSSPHLDRNFASIPKALSRGKSVRTIRTNAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGESPYMREDGVGHGWSGINNPNAGNSNRL
        +PENSSECNES++SSP LDRNFA+ PKALSRGKSVRT+R +  +IEEMKAQEMYRNQVEH +N+  KF EGG SPYMRED  GHGW GINN NA  SNR 
Subjt:  EPENSSECNESILSSPHLDRNFASIPKALSRGKSVRTIRTNAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGESPYMREDGVGHGWSGINNPNAGNSNRL

Query:  PK----TTFSGIEEQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGWGSFSSTSS
         K    TTFSGIEEQKE+TES +TDD KDNSERED+S F SSDEEAA SM G+SESGAHEVDKKAGEFIAKFREQIQLQRMASV+KRLRGGWGSFSST+S
Subjt:  PK----TTFSGIEEQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGWGSFSSTSS

Query:  SYFS
        SYFS
Subjt:  SYFS

XP_023006022.1 uncharacterized protein LOC111498900 [Cucurbita maxima]7.1e-26181.8Show/hide
Query:  MASSTSSPFTKPHFPHSPLPPTPTTHQRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSVDEPRFS
        MASS SSPFTK HFPHSPLP  P TH   SCAQFLCKSLFFC FLLLLPLFPSEAPDFV+QTL TKFWELFHLM VGIAVSYGLFS RN Q++VDEPR+S
Subjt:  MASSTSSPFTKPHFPHSPLPPTPTTHQRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSVDEPRFS

Query:  NFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVVVV
        +FENPQSYLSK  +VASIF+DVDDFS SDERKLSEVLYIQPN GS S   DLNAQSRQQEKLRYSIPKKRYENSYE AD D+V HACKSRYTRGGSVVVV
Subjt:  NFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVVVV

Query:  AETNRRSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKFGKK
         ETNR S     SG IVNYKPLGLPVRSL+S+LTE DDVEF+CGDESCLSSKSS KSSENNCE  SEFGDNCCVNLEEKFDE  IASMS FQLREKFGKK
Subjt:  AETNRRSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKFGKK

Query:  VTRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLIEPE
        V RERG GNA LRPSHFRPPSIDETQFESLKKS SL S LSQSSQTSS SSSLS TTRKH KMSSL NI YKSLHSRQYSMSSLSENSRGSSEDPLIE E
Subjt:  VTRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLIEPE

Query:  NSSECNESILSSPHLDRNFASIPKALSRGKSVRTIRTNAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGESPYMREDGVGHGWSGINNPNAGNSNRLPKT
        NSSECNES++SSP  D NF SIPKALS+GKS+R I+ NA +IE++KAQEM+R QV+H + IG KF EGG SPY+REDG GHGW  + NPNA N +R P T
Subjt:  NSSECNESILSSPHLDRNFASIPKALSRGKSVRTIRTNAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGESPYMREDGVGHGWSGINNPNAGNSNRLPKT

Query:  TFSGIEEQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLR--GGWGSFSSTSSSYFS
        TF GI+EQKEETESL+ DD KD+SE EDES FASSDEEAASSMAG+SESGA EVDKKAGEFIAKFREQIQLQRMASVEKRLR  GGWGSFSSTSSSYFS
Subjt:  TFSGIEEQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLR--GGWGSFSSTSSSYFS

TrEMBL top hitse value%identityAlignment
A0A0A0K9X1 Uncharacterized protein7.9e-26682.62Show/hide
Query:  MASSTSSPFTKPHFPHSPLPPTPTTHQRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSV--DEPR
        MA S S+PFTKPHFPHSPLPPT TT    SC QF+CKSLFFCIFLLLLPLFPSEAP+FVNQT LTKFWELFHLMF+GIAVSYGLFSRRN+QVSV  DEPR
Subjt:  MASSTSSPFTKPHFPHSPLPPTPTTHQRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSV--DEPR

Query:  FSNFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVV
        FSNFENPQSYLSK FHVASIFEDVDDFS SDERKLSEVLYIQPN GSVS    LNA SRQQE   YSIPKKRYENS E A+ D+VGHACKSRYTRGGSVV
Subjt:  FSNFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVV

Query:  VVAETNR-RSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKF
        VVAETNR  S EWL+SGAIVNYKPLGLPVRSL+S+LTEPDDVEF+CGDESCLSSKSSSK+SE+NCER SEFGDNCCVNLEEKFDE VIASMS FQLREKF
Subjt:  VVAETNR-RSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKF

Query:  GKKVTRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLI
         K + RER V NA LRPSHFRP SIDETQFESLKKS SL S LSQSSQTSS SS LS  TRKHRKMSSLGNI YKS HSRQYS+SSLSENSRGSSEDPLI
Subjt:  GKKVTRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLI

Query:  EPENSSECNESILSSPHLDRNFASIPKALSRGKSVRTIRTNAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGESPYMREDGVGHGWSGINNPNAGNSNRL
        +PENSSECNES++SSP LDRNFA+ PKALSRGKSVRT+R +  +IEEMKAQEMYRNQVEH +N+  KF EGG SPYMRED  GHGW GINN NA  SNR 
Subjt:  EPENSSECNESILSSPHLDRNFASIPKALSRGKSVRTIRTNAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGESPYMREDGVGHGWSGINNPNAGNSNRL

Query:  PK----TTFSGIEEQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGWGSFSSTSS
         K    TTFSGIEEQKE+TES +TDD KDNSERED+S F SSDEEAA SM G+SESGAHEVDKKAGEFIAKFREQIQLQRMASV+KRLRGGWGSFSST+S
Subjt:  PK----TTFSGIEEQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGWGSFSSTSS

Query:  SYFS
        SYFS
Subjt:  SYFS

A0A5D3DMA5 DUF761 domain-containing protein1.9e-27285.41Show/hide
Query:  MASSTSSPFTKPHFPHSPLPPTPTTHQRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSV--DEPR
        MASS S+PFTKPHFPHSPLPPT TT    SC  FLCKSLFFCIFLLLLPLFPSEAP+FVNQTLLTKFWELFHLMFVGIAVSYGLFSRRN+QVSV  DEPR
Subjt:  MASSTSSPFTKPHFPHSPLPPTPTTHQRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSV--DEPR

Query:  FSNFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVV
        FSNFENPQSYLSK  HVASIFEDVDDFS SDERKLSEVLYIQPN GSV  F   NA SRQQE   YSIPKKRYENS E  D +SVGHACKSRYTRGGSVV
Subjt:  FSNFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVV

Query:  VVAETNR-RSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKF
        VVAETNR  S EWL+SGAIVNYKPLGLPVRSLRSNLTEPDDVEF+CGDESCLSSKSSSK+SE+NCER SEFGDNCCVNLEEKFDE VIA MS FQLRE F
Subjt:  VVAETNR-RSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKF

Query:  GKKVTRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLI
        GK + RERGV NA LRPSHFRP SIDETQFESLKKSRSL S LSQSSQTSS S SLS TTRKHRKMSSLGNI YKS HSRQYS+SSLSENSRGSSEDPLI
Subjt:  GKKVTRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLI

Query:  EPENSSECNESILSSPHLDRNFASIPKALSRGKSVRTIRTNAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGESPYMREDGVGHGWSGINNPNAGNSNRL
        EPENSSECNESI+SSP LDRNFA IPKALSRGKSVRTIR N  +IEEMKAQEMYRNQVEH +N+G KF EGG SPYMREDG GHGW GIN+PNAG SNR 
Subjt:  EPENSSECNESILSSPHLDRNFASIPKALSRGKSVRTIRTNAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGESPYMREDGVGHGWSGINNPNAGNSNRL

Query:  PK-TTFSGIEEQKEETESLLTDD--CKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGWGSFSSTSSS
        PK TTFSGIEEQKE+ ES LTDD   +DNSERED S F SSDEEAASSMAGESESGA+EVDKKAGEFIAKFREQIQLQRMASV+KRLRGGWGSFSSTSSS
Subjt:  PK-TTFSGIEEQKEETESLLTDD--CKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGWGSFSSTSSS

Query:  YFS
        YFS
Subjt:  YFS

A0A6J1H4M0 uncharacterized protein LOC1114599985.7e-25680.96Show/hide
Query:  MASSTSSPFTKPHFPHSPLPPTPTTHQRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSVDEPRFS
        MASS SSPFTK HFPHSPLP  P      SCAQFLCKS+FFC FLLLLPLFPSEAPDFV+QTL TKFWELFHLMFVGIAVSYGLFS RN Q++VDEPR+S
Subjt:  MASSTSSPFTKPHFPHSPLPPTPTTHQRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSVDEPRFS

Query:  NFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVVVV
        +FENPQSYLSK  +VASIF+DVDDF  SDERK+SEVLYIQP  GS S   DLNAQSR QEKLRYS+PKKRYENSYE AD D+V HACKSRYTRGGSVVVV
Subjt:  NFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVVVV

Query:  AETNRRSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKFGKK
         ETNR S     SG IVNYKPLGLPVRSLRS+LTE DDVEF+CGDESCLSSKSS KSSENNCE  SEFGDNCCVNLEEKFDE  IASMS FQLREKFGKK
Subjt:  AETNRRSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKFGKK

Query:  VTRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLIEPE
        V RERG GNA LRPSHFRPPSIDETQFESLKKS SL S LSQSSQTSS SS LS TTRK RKMSSL NI YKSLHSRQYS SSLSENSRGSSEDPLIE E
Subjt:  VTRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLIEPE

Query:  NSSECNESILSSPHLDRNFASIPKALSRGKSVRTIRTNAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGES-PYMREDGVGHGWSGINNPNAGNSNRLPK
        NSSECNES++SSP  DRNFASIPKALS+GKSVR IR NA +IE+MKAQEM+R QV+H + IG KF EGG S PYMREDG G GW  + NPNAGN NR PK
Subjt:  NSSECNESILSSPHLDRNFASIPKALSRGKSVRTIRTNAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGES-PYMREDGVGHGWSGINNPNAGNSNRLPK

Query:  TTFSGIEEQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLR------GGWGSFSSTSS
        TTF GI+EQKEETESL+ DD KD+SE EDES+FASSDEEA SSMAG+SESGA EVDKKAGEFIAKFREQIQLQRMASVEKRLR      GGWGSFSSTSS
Subjt:  TTFSGIEEQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLR------GGWGSFSSTSS

Query:  SYFS
        SYFS
Subjt:  SYFS

A0A6J1KUS4 uncharacterized protein LOC1114989003.4e-26181.8Show/hide
Query:  MASSTSSPFTKPHFPHSPLPPTPTTHQRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSVDEPRFS
        MASS SSPFTK HFPHSPLP  P TH   SCAQFLCKSLFFC FLLLLPLFPSEAPDFV+QTL TKFWELFHLM VGIAVSYGLFS RN Q++VDEPR+S
Subjt:  MASSTSSPFTKPHFPHSPLPPTPTTHQRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSVDEPRFS

Query:  NFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVVVV
        +FENPQSYLSK  +VASIF+DVDDFS SDERKLSEVLYIQPN GS S   DLNAQSRQQEKLRYSIPKKRYENSYE AD D+V HACKSRYTRGGSVVVV
Subjt:  NFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVVVV

Query:  AETNRRSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKFGKK
         ETNR S     SG IVNYKPLGLPVRSL+S+LTE DDVEF+CGDESCLSSKSS KSSENNCE  SEFGDNCCVNLEEKFDE  IASMS FQLREKFGKK
Subjt:  AETNRRSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKFGKK

Query:  VTRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLIEPE
        V RERG GNA LRPSHFRPPSIDETQFESLKKS SL S LSQSSQTSS SSSLS TTRKH KMSSL NI YKSLHSRQYSMSSLSENSRGSSEDPLIE E
Subjt:  VTRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLIEPE

Query:  NSSECNESILSSPHLDRNFASIPKALSRGKSVRTIRTNAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGESPYMREDGVGHGWSGINNPNAGNSNRLPKT
        NSSECNES++SSP  D NF SIPKALS+GKS+R I+ NA +IE++KAQEM+R QV+H + IG KF EGG SPY+REDG GHGW  + NPNA N +R P T
Subjt:  NSSECNESILSSPHLDRNFASIPKALSRGKSVRTIRTNAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGESPYMREDGVGHGWSGINNPNAGNSNRLPKT

Query:  TFSGIEEQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLR--GGWGSFSSTSSSYFS
        TF GI+EQKEETESL+ DD KD+SE EDES FASSDEEAASSMAG+SESGA EVDKKAGEFIAKFREQIQLQRMASVEKRLR  GGWGSFSSTSSSYFS
Subjt:  TFSGIEEQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLR--GGWGSFSSTSSSYFS

E5GCN2 Uncharacterized protein1.9e-27285.41Show/hide
Query:  MASSTSSPFTKPHFPHSPLPPTPTTHQRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSV--DEPR
        MASS S+PFTKPHFPHSPLPPT TT    SC  FLCKSLFFCIFLLLLPLFPSEAP+FVNQTLLTKFWELFHLMFVGIAVSYGLFSRRN+QVSV  DEPR
Subjt:  MASSTSSPFTKPHFPHSPLPPTPTTHQRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSV--DEPR

Query:  FSNFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVV
        FSNFENPQSYLSK  HVASIFEDVDDFS SDERKLSEVLYIQPN GSV  F   NA SRQQE   YSIPKKRYENS E  D +SVGHACKSRYTRGGSVV
Subjt:  FSNFENPQSYLSKTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVV

Query:  VVAETNR-RSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKF
        VVAETNR  S EWL+SGAIVNYKPLGLPVRSLRSNLTEPDDVEF+CGDESCLSSKSSSK+SE+NCER SEFGDNCCVNLEEKFDE VIA MS FQLRE F
Subjt:  VVAETNR-RSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKF

Query:  GKKVTRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLI
        GK + RERGV NA LRPSHFRP SIDETQFESLKKSRSL S LSQSSQTSS S SLS TTRKHRKMSSLGNI YKS HSRQYS+SSLSENSRGSSEDPLI
Subjt:  GKKVTRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLI

Query:  EPENSSECNESILSSPHLDRNFASIPKALSRGKSVRTIRTNAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGESPYMREDGVGHGWSGINNPNAGNSNRL
        EPENSSECNESI+SSP LDRNFA IPKALSRGKSVRTIR N  +IEEMKAQEMYRNQVEH +N+G KF EGG SPYMREDG GHGW GIN+PNAG SNR 
Subjt:  EPENSSECNESILSSPHLDRNFASIPKALSRGKSVRTIRTNAVSIEEMKAQEMYRNQVEHGENIGKKFAEGGESPYMREDGVGHGWSGINNPNAGNSNRL

Query:  PK-TTFSGIEEQKEETESLLTDD--CKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGWGSFSSTSSS
        PK TTFSGIEEQKE+ ES LTDD   +DNSERED S F SSDEEAASSMAGESESGA+EVDKKAGEFIAKFREQIQLQRMASV+KRLRGGWGSFSSTSSS
Subjt:  PK-TTFSGIEEQKEETESLLTDD--CKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGWGSFSSTSSS

Query:  YFS
        YFS
Subjt:  YFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G60380.1 FUNCTIONS IN: molecular_function unknown1.2e-3534.47Show/hide
Query:  STSSPFTKPHFPHSPLPPTPTTHQRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSVDEPRFSNFE
        ++ +P+TK   P + + P    ++      F CKS+ F +FLL LPLFPS+APDFV +T+LTKFWEL HL+FVGIAV+YGLFSRRN++ +VD       E
Subjt:  STSSPFTKPHFPHSPLPPTPTTHQRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSVDEPRFSNFE

Query:  NPQSYLSKTFHVASIF-EDVDDFSA------SDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGS
        +  SY+S+ F V+S+F E+ DD S       SDE   +    +  +   V + G+L                   E S E  + + V  A  S+Y +G S
Subjt:  NPQSYLSKTFHVASIF-EDVDDFSA------SDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGS

Query:  VVVVAETNRRSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSEN--NCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLR
         VVVA    R +  LD   +  ++PLGLP+R LRS+L           D + L  KS + S +   N E  S   DN        FDE + A  S    +
Subjt:  VVVVAETNRRSSEWLDSGAIVNYKPLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSEN--NCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLR

Query:  EKFGKKVTRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSR-QYSMSSLSENSRGSSE
                R   +G     PS+F+P S+DET      KS S RS  S SSQTS  S       +   + S   ++  +SL+S  +  +   S  S   S 
Subjt:  EKFGKKVTRERGVGNAALRPSHFRPPSIDETQFESLKKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSR-QYSMSSLSENSRGSSE

Query:  DPLIEPENS
         P + P  S
Subjt:  DPLIEPENS

AT3G60380.1 FUNCTIONS IN: molecular_function unknown3.1e-0440.96Show/hide
Query:  KEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGWGSFSST
        K E E +  ++    +E++ E  F   +EEAA      +    +EVD+KAGEFIAKFREQI+LQ++ S E+   GG G F ++
Subjt:  KEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESGAHEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGWGSFSST

AT4G16790.1 hydroxyproline-rich glycoprotein family protein2.6e-1138.02Show/hide
Query:  RKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNI---------QVSVDEPRFSNFENPQSYLSKTFHVASI
        RK  ++F+ K+L   +   ++P+F S+ P+  NQT L    EL HL+FVGIAVSYGLFSRRN              ++   SN  N  SY+ K   V+S+
Subjt:  RKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNI---------QVSVDEPRFSNFENPQSYLSKTFHVASI

Query:  F-------EDVDDFSASDERK
        F        +  D S+ D+RK
Subjt:  F-------EDVDDFSASDERK

AT4G16790.1 hydroxyproline-rich glycoprotein family protein6.4e-0236.49Show/hide
Query:  EQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESE-SGAHEVDKKAGEFIAKFREQIQLQRMASVEK
        +Q+    S   ++ ++  +R  E+      E+      G SE +   +VDKKA EFIAKFREQI+LQR+ S+++
Subjt:  EQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESE-SGAHEVDKKAGEFIAKFREQIQLQRMASVEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCTTCAACTTCCAGCCCTTTCACCAAGCCCCATTTTCCACATTCTCCACTTCCACCAACACCTACTACTCACCAGCGCAAGTCCTGCGCACAATTTCTCTGTAA
ATCCCTCTTCTTCTGCATTTTCCTCCTCCTTCTCCCTCTCTTCCCTTCCGAGGCTCCAGATTTCGTCAATCAGACTTTGCTCACCAAATTCTGGGAGCTTTTTCACCTCA
TGTTCGTCGGCATTGCTGTTTCCTATGGTCTCTTTAGCAGAAGGAACATCCAGGTGAGTGTTGACGAACCTCGCTTCTCCAATTTTGAAAATCCGCAGTCCTATTTGTCT
AAGACGTTTCACGTCGCGTCCATTTTTGAAGATGTTGACGATTTCAGTGCTTCTGATGAGAGGAAACTGAGTGAAGTTTTGTACATTCAGCCGAATCGTGGATCGGTGAG
TGATTTTGGGGATTTGAATGCCCAATCTCGCCAACAGGAAAAACTCCGTTACTCCATACCCAAAAAGAGGTATGAAAACTCTTATGAATCTGCTGATAATGATAGTGTTG
GTCATGCTTGTAAATCGAGATATACTCGTGGTGGATCTGTTGTGGTAGTTGCTGAAACAAATCGTAGGTCTAGTGAATGGTTGGATTCAGGGGCCATTGTAAATTATAAA
CCTCTAGGTTTGCCTGTTAGGAGTTTGAGGTCGAATCTTACTGAACCCGACGATGTTGAATTTGAATGTGGTGATGAATCTTGTTTGAGTTCTAAAAGTTCATCCAAGAG
CTCTGAGAATAATTGTGAAAGAAGAAGTGAATTTGGTGATAACTGTTGTGTGAATTTGGAGGAGAAGTTTGATGAAGCTGTTATTGCGTCAATGTCCCAATTTCAATTGC
GTGAGAAATTTGGAAAGAAGGTGACGAGAGAGAGAGGAGTTGGGAATGCTGCTCTTCGTCCTTCCCATTTTAGACCTCCCTCCATTGATGAAACTCAATTTGAATCACTG
AAAAAATCAAGGTCTCTTCGTTCTCCTCTATCTCAGTCATCACAAACTAGTTCCTTCTCTTCTTCGTTGTCACCAACAACAAGAAAGCACCGTAAAATGTCGTCACTTGG
TAACATTCCATATAAATCATTGCATTCTCGACAATACAGTATGAGTTCTCTGTCTGAAAACAGTAGAGGGAGCTCTGAAGACCCTCTGATTGAACCAGAAAATTCATCTG
AGTGCAATGAATCCATCTTAAGTTCCCCACATTTGGACAGGAATTTCGCAAGTATTCCGAAAGCTTTATCCCGAGGAAAATCCGTTAGAACAATTAGAACAAATGCAGTT
TCCATAGAGGAAATGAAAGCTCAAGAGATGTATAGAAACCAAGTTGAACATGGTGAAAATATAGGGAAGAAGTTTGCAGAAGGTGGAGAGTCACCATATATGAGAGAAGA
TGGAGTGGGGCATGGATGGTCTGGTATTAATAACCCGAATGCTGGTAATTCAAATCGCTTGCCGAAGACGACGTTCTCGGGGATTGAGGAGCAGAAGGAAGAGACTGAGA
GTCTCCTGACAGATGATTGTAAAGATAACTCTGAGAGGGAGGATGAAAGTATTTTTGCAAGTTCAGATGAAGAAGCTGCTTCAAGTATGGCGGGGGAGTCGGAATCGGGG
GCTCACGAGGTCGACAAGAAAGCTGGCGAGTTCATAGCCAAGTTCAGGGAGCAAATACAGCTTCAGAGGATGGCTTCAGTAGAAAAAAGATTGAGAGGAGGATGGGGGTC
ATTCAGCAGCACAAGCAGCAGCTATTTCAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCTTCAACTTCCAGCCCTTTCACCAAGCCCCATTTTCCACATTCTCCACTTCCACCAACACCTACTACTCACCAGCGCAAGTCCTGCGCACAATTTCTCTGTAA
ATCCCTCTTCTTCTGCATTTTCCTCCTCCTTCTCCCTCTCTTCCCTTCCGAGGCTCCAGATTTCGTCAATCAGACTTTGCTCACCAAATTCTGGGAGCTTTTTCACCTCA
TGTTCGTCGGCATTGCTGTTTCCTATGGTCTCTTTAGCAGAAGGAACATCCAGGTGAGTGTTGACGAACCTCGCTTCTCCAATTTTGAAAATCCGCAGTCCTATTTGTCT
AAGACGTTTCACGTCGCGTCCATTTTTGAAGATGTTGACGATTTCAGTGCTTCTGATGAGAGGAAACTGAGTGAAGTTTTGTACATTCAGCCGAATCGTGGATCGGTGAG
TGATTTTGGGGATTTGAATGCCCAATCTCGCCAACAGGAAAAACTCCGTTACTCCATACCCAAAAAGAGGTATGAAAACTCTTATGAATCTGCTGATAATGATAGTGTTG
GTCATGCTTGTAAATCGAGATATACTCGTGGTGGATCTGTTGTGGTAGTTGCTGAAACAAATCGTAGGTCTAGTGAATGGTTGGATTCAGGGGCCATTGTAAATTATAAA
CCTCTAGGTTTGCCTGTTAGGAGTTTGAGGTCGAATCTTACTGAACCCGACGATGTTGAATTTGAATGTGGTGATGAATCTTGTTTGAGTTCTAAAAGTTCATCCAAGAG
CTCTGAGAATAATTGTGAAAGAAGAAGTGAATTTGGTGATAACTGTTGTGTGAATTTGGAGGAGAAGTTTGATGAAGCTGTTATTGCGTCAATGTCCCAATTTCAATTGC
GTGAGAAATTTGGAAAGAAGGTGACGAGAGAGAGAGGAGTTGGGAATGCTGCTCTTCGTCCTTCCCATTTTAGACCTCCCTCCATTGATGAAACTCAATTTGAATCACTG
AAAAAATCAAGGTCTCTTCGTTCTCCTCTATCTCAGTCATCACAAACTAGTTCCTTCTCTTCTTCGTTGTCACCAACAACAAGAAAGCACCGTAAAATGTCGTCACTTGG
TAACATTCCATATAAATCATTGCATTCTCGACAATACAGTATGAGTTCTCTGTCTGAAAACAGTAGAGGGAGCTCTGAAGACCCTCTGATTGAACCAGAAAATTCATCTG
AGTGCAATGAATCCATCTTAAGTTCCCCACATTTGGACAGGAATTTCGCAAGTATTCCGAAAGCTTTATCCCGAGGAAAATCCGTTAGAACAATTAGAACAAATGCAGTT
TCCATAGAGGAAATGAAAGCTCAAGAGATGTATAGAAACCAAGTTGAACATGGTGAAAATATAGGGAAGAAGTTTGCAGAAGGTGGAGAGTCACCATATATGAGAGAAGA
TGGAGTGGGGCATGGATGGTCTGGTATTAATAACCCGAATGCTGGTAATTCAAATCGCTTGCCGAAGACGACGTTCTCGGGGATTGAGGAGCAGAAGGAAGAGACTGAGA
GTCTCCTGACAGATGATTGTAAAGATAACTCTGAGAGGGAGGATGAAAGTATTTTTGCAAGTTCAGATGAAGAAGCTGCTTCAAGTATGGCGGGGGAGTCGGAATCGGGG
GCTCACGAGGTCGACAAGAAAGCTGGCGAGTTCATAGCCAAGTTCAGGGAGCAAATACAGCTTCAGAGGATGGCTTCAGTAGAAAAAAGATTGAGAGGAGGATGGGGGTC
ATTCAGCAGCACAAGCAGCAGCTATTTCAGTTGA
Protein sequenceShow/hide protein sequence
MASSTSSPFTKPHFPHSPLPPTPTTHQRKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRNIQVSVDEPRFSNFENPQSYLS
KTFHVASIFEDVDDFSASDERKLSEVLYIQPNRGSVSDFGDLNAQSRQQEKLRYSIPKKRYENSYESADNDSVGHACKSRYTRGGSVVVVAETNRRSSEWLDSGAIVNYK
PLGLPVRSLRSNLTEPDDVEFECGDESCLSSKSSSKSSENNCERRSEFGDNCCVNLEEKFDEAVIASMSQFQLREKFGKKVTRERGVGNAALRPSHFRPPSIDETQFESL
KKSRSLRSPLSQSSQTSSFSSSLSPTTRKHRKMSSLGNIPYKSLHSRQYSMSSLSENSRGSSEDPLIEPENSSECNESILSSPHLDRNFASIPKALSRGKSVRTIRTNAV
SIEEMKAQEMYRNQVEHGENIGKKFAEGGESPYMREDGVGHGWSGINNPNAGNSNRLPKTTFSGIEEQKEETESLLTDDCKDNSEREDESIFASSDEEAASSMAGESESG
AHEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGWGSFSSTSSSYFS