; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cp4.1LG12g04410 (gene) of Cucurbita pepo (MU-CU-16) v4.1 genome

Gene IDCp4.1LG12g04410
OrganismCucurbita pepo var. pepo MU-CU-16 (Cucurbita pepo (MU-CU-16) v4.1)
DescriptionDUF761 domain-containing protein
Genome locationCp4.1LG12:3731848..3733623
RNA-Seq ExpressionCp4.1LG12g04410
SyntenyCp4.1LG12g04410
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR008480 - Protein of unknown function DUF761, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575261.1 hypothetical protein SDJN03_25900, partial [Cucurbita argyrosperma subsp. sororia]0.096.79Show/hide
Query:  MASSASSPFTKLHFPHSPLPQPPANSCAQFLCKSIFFCFFLLLLPLFPSEAPHFVDHTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFEN
        MASSASSPFTKLHFPHSPLPQPPANSCAQFLCKSIFFCFFLLLLPLFPSEAP FVD TLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFEN
Subjt:  MASSASSPFTKLHFPHSPLPQPPANSCAQFLCKSIFFCFFLLLLPLFPSEAPHFVDHTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFEN

Query:  PQSYLSKMLYVASIFDDVDDFGVSDERKVSEVLYIQPKLGSASDFNAQSRHQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPETNRSS
        PQSYLSKMLYVASIFDDVDDFGVSDERKVSEVLYIQPKLGSASD NAQSRHQEKLRYS+PKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPETNRSS
Subjt:  PQSYLSKMLYVASIFDDVDDFGVSDERKVSEVLYIQPKLGSASDFNAQSRHQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPETNRSS

Query:  SGGIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSKSSPRSSENNCEGNSEFGDDCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFGNAVL
        SGGIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSKSSP+SSENNCEGNSEFGD+CCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFGNAVL
Subjt:  SGGIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSKSSPRSSENNCEGNSEFGDDCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFGNAVL

Query:  RPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRQHRKMSSLSNISYKSLHSRQYSMSSVSENSRGSSEDPLIEQENSSECNESVVSS
        RPSHFRPPSIDETQFESL+KSGSLHS+LSQSSQTSSLSS LSSTTR+H KMSSLSNISYKSLHSRQYSMSS+SENSRGSSEDPLIEQENSSECNESVVSS
Subjt:  RPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRQHRKMSSLSNISYKSLHSRQYSMSSVSENSRGSSEDPLIEQENSSECNESVVSS

Query:  PRSDRNFASIPKALSQGKSVRRIRANAAAMEDMKAQEMHRKQVKQDDIIGNKFEEGGMSPPYMREDGTGHGWPDVVNPNASNMNRFPKTTFLGIKEQKEE
        PRSDRNFASIPKALSQGKSVRRIRANAAA+EDMKAQEMHRKQVK DDIIGNKFEEGGMSPPYMREDGTGHGWPDVVNPNA NMNRFPKTTFLGIKEQKEE
Subjt:  PRSDRNFASIPKALSQGKSVRRIRANAAAMEDMKAQEMHRKQVKQDDIIGNKFEEGGMSPPYMREDGTGHGWPDVVNPNASNMNRFPKTTFLGIKEQKEE

Query:  TESVVADDSKDDSEGEDESLFASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGGWGSFSSTSSSYFS
        TES+VADDSKD SEGEDESLFASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGG  WGSFSSTSSSYFS
Subjt:  TESVVADDSKDDSEGEDESLFASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGGWGSFSSTSSSYFS

KAG7013816.1 hypothetical protein SDJN02_23985, partial [Cucurbita argyrosperma subsp. argyrosperma]0.096.62Show/hide
Query:  MASSASSPFTKLHFPHSPLPQPPANSCAQFLCKSIFFCFFLLLLPLFPSEAPHFVDHTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFEN
        MASSASSPFTKLHFPHSPLPQPPANSCAQFLCKSIFFCFFLLLLPLFPSEAP FVD TLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFEN
Subjt:  MASSASSPFTKLHFPHSPLPQPPANSCAQFLCKSIFFCFFLLLLPLFPSEAPHFVDHTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFEN

Query:  PQSYLSKMLYVASIFDDVDDFGVSDERKVSEVLYIQPKLGSASDFNAQSRHQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPETNRSS
        PQSYLSKMLYVASIFDDVDDFGVSDERKVSEVLYIQPKLGSASD NAQSRHQEKLRYS+PKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPETNRSS
Subjt:  PQSYLSKMLYVASIFDDVDDFGVSDERKVSEVLYIQPKLGSASDFNAQSRHQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPETNRSS

Query:  SGGIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSKSSPRSSENNCEGNSEFGDDCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFGNAVL
        SGGIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSKSSP+SSENNCEGNSEFGD+CCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFGNAVL
Subjt:  SGGIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSKSSPRSSENNCEGNSEFGDDCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFGNAVL

Query:  RPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRQHRKMSSLSNISYKSLHSRQYSMSSVSENSRGSSEDPLIEQENSSECNESVVSS
        RPSHFRPPSIDETQFESL+KSGSLHS+LSQSSQTSSLSS LSSTTR+H KMSSLSNISYKSLHSRQYSMSS+SENSRGSSEDPLIEQENSSECNESVVSS
Subjt:  RPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRQHRKMSSLSNISYKSLHSRQYSMSSVSENSRGSSEDPLIEQENSSECNESVVSS

Query:  PRSDRNFASIPKALSQGKSVRRIRANAAAMEDMKAQEMHRKQVKQDDIIGNKFEEGGMSPPYMREDGTGHGWPDVVNPNASNMNRFPKTTFLGIKEQKEE
        PRSDRNFASIPKALSQGKSVRRIRANAAA+EDMKAQEMHRKQVK DDIIGNKFEEGGMSPPYMREDGTGHGWPDVVNPNA NMNRFPKTTFLGIKEQKEE
Subjt:  PRSDRNFASIPKALSQGKSVRRIRANAAAMEDMKAQEMHRKQVKQDDIIGNKFEEGGMSPPYMREDGTGHGWPDVVNPNASNMNRFPKTTFLGIKEQKEE

Query:  TESVVADDSKDDSEGEDESLFASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGGWGSFSSTSSSYFS
        TES+VADDSKD SEGEDES FASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGG  WGSFSSTSSSYFS
Subjt:  TESVVADDSKDDSEGEDESLFASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGGWGSFSSTSSSYFS

XP_022958845.1 uncharacterized protein LOC111459998 [Cucurbita moschata]0.096.96Show/hide
Query:  MASSASSPFTKLHFPHSPLPQPPANSCAQFLCKSIFFCFFLLLLPLFPSEAPHFVDHTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFEN
        MASSASSPFTKLHFPHSPLPQPPANSCAQFLCKSIFFCFFLLLLPLFPSEAP FVD TLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFEN
Subjt:  MASSASSPFTKLHFPHSPLPQPPANSCAQFLCKSIFFCFFLLLLPLFPSEAPHFVDHTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFEN

Query:  PQSYLSKMLYVASIFDDVDDFGVSDERKVSEVLYIQPKLGSASDFNAQSRHQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPETNRSS
        PQSYLSKMLYVASIFDDVDDFGVSDERKVSEVLYIQPKLGSASD NAQSRHQEKLRYS+PKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPETNRSS
Subjt:  PQSYLSKMLYVASIFDDVDDFGVSDERKVSEVLYIQPKLGSASDFNAQSRHQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPETNRSS

Query:  SGGIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSKSSPRSSENNCEGNSEFGDDCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFGNAVL
        SGGIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSKSSP+SSENNCEGNSEFGD+CCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFGNAVL
Subjt:  SGGIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSKSSPRSSENNCEGNSEFGDDCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFGNAVL

Query:  RPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRQHRKMSSLSNISYKSLHSRQYSMSSVSENSRGSSEDPLIEQENSSECNESVVSS
        RPSHFRPPSIDETQFESLKKSGSLHS LSQSSQTSSLSS LSSTTR+ RKMSSLSNISYKSLHSRQYS SS+SENSRGSSEDPLIEQENSSECNESVVSS
Subjt:  RPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRQHRKMSSLSNISYKSLHSRQYSMSSVSENSRGSSEDPLIEQENSSECNESVVSS

Query:  PRSDRNFASIPKALSQGKSVRRIRANAAAMEDMKAQEMHRKQVKQDDIIGNKFEEGGMSPPYMREDGTGHGWPDVVNPNASNMNRFPKTTFLGIKEQKEE
        PRSDRNFASIPKALSQGKSVRRIRANAAA+EDMKAQEMHRKQVK DDIIGNKFEEGGMSPPYMREDGTG GWPDVVNPNA NMNRFPKTTFLGIKEQKEE
Subjt:  PRSDRNFASIPKALSQGKSVRRIRANAAAMEDMKAQEMHRKQVKQDDIIGNKFEEGGMSPPYMREDGTGHGWPDVVNPNASNMNRFPKTTFLGIKEQKEE

Query:  TESVVADDSKDDSEGEDESLFASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGG-WGSFSSTSSSYFS
        TES+VADDSKDDSEGEDESLFASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGG WGSFSSTSSSYFS
Subjt:  TESVVADDSKDDSEGEDESLFASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGG-WGSFSSTSSSYFS

XP_023006022.1 uncharacterized protein LOC111498900 [Cucurbita maxima]0.093.61Show/hide
Query:  MASSASSPFTKLHFPHSPLPQPPA----NSCAQFLCKSIFFCFFLLLLPLFPSEAPHFVDHTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYS
        MASSASSPFTKLHFPHSPLPQPPA    NSCAQFLCKS+FFCFFLLLLPLFPSEAP FVD TLFTKFWELFHLM VGIAVSYGLFSTRNNQMNVDEPRYS
Subjt:  MASSASSPFTKLHFPHSPLPQPPA----NSCAQFLCKSIFFCFFLLLLPLFPSEAPHFVDHTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYS

Query:  SFENPQSYLSKMLYVASIFDDVDDFGVSDERKVSEVLYIQPKLGSASDFNAQSRHQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPET
        SFENPQSYLSKMLYVASIFDDVDDF VSDERK+SEVLYIQP LGSASD NAQSR QEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPET
Subjt:  SFENPQSYLSKMLYVASIFDDVDDFGVSDERKVSEVLYIQPKLGSASDFNAQSRHQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPET

Query:  NRSSSGGIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSKSSPRSSENNCEGNSEFGDDCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFG
        NRSSSGGIVNYKPLGLPVRSL+SSLTESDDVEFDCGDESCLSSKSSP+SSENNCEGNSEFGD+CCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFG
Subjt:  NRSSSGGIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSKSSPRSSENNCEGNSEFGDDCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFG

Query:  NAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRQHRKMSSLSNISYKSLHSRQYSMSSVSENSRGSSEDPLIEQENSSECNES
        NAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTR+H KMSSLSNISYKSLHSRQYSMSS+SENSRGSSEDPLIEQENSSECNES
Subjt:  NAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRQHRKMSSLSNISYKSLHSRQYSMSSVSENSRGSSEDPLIEQENSSECNES

Query:  VVSSPRSDRNFASIPKALSQGKSVRRIRANAAAMEDMKAQEMHRKQVKQDDIIGNKFEEGGMSPPYMREDGTGHGWPDVVNPNASNMNRFPKTTFLGIKE
        VVSSPRSD NF SIPKALSQGKS+RRI+ANAAA+ED+KAQEMHRKQVK DDIIGNKFEEGG SP Y+REDGTGHGWPDV NPNASNM+RFP TTFLGIKE
Subjt:  VVSSPRSDRNFASIPKALSQGKSVRRIRANAAAMEDMKAQEMHRKQVKQDDIIGNKFEEGGMSPPYMREDGTGHGWPDVVNPNASNMNRFPKTTFLGIKE

Query:  QKEETESVVADDSKDDSEGEDESLFASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGGWGSFSSTSSSYFS
        QKEETES+VADDSKDDSEGEDES FASSDEEA SSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGG   WGSFSSTSSSYFS
Subjt:  QKEETESVVADDSKDDSEGEDESLFASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGGWGSFSSTSSSYFS

XP_023548366.1 uncharacterized protein LOC111807030 [Cucurbita pepo subsp. pepo]0.0100Show/hide
Query:  MASSASSPFTKLHFPHSPLPQPPANSCAQFLCKSIFFCFFLLLLPLFPSEAPHFVDHTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFEN
        MASSASSPFTKLHFPHSPLPQPPANSCAQFLCKSIFFCFFLLLLPLFPSEAPHFVDHTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFEN
Subjt:  MASSASSPFTKLHFPHSPLPQPPANSCAQFLCKSIFFCFFLLLLPLFPSEAPHFVDHTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFEN

Query:  PQSYLSKMLYVASIFDDVDDFGVSDERKVSEVLYIQPKLGSASDFNAQSRHQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPETNRSS
        PQSYLSKMLYVASIFDDVDDFGVSDERKVSEVLYIQPKLGSASDFNAQSRHQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPETNRSS
Subjt:  PQSYLSKMLYVASIFDDVDDFGVSDERKVSEVLYIQPKLGSASDFNAQSRHQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPETNRSS

Query:  SGGIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSKSSPRSSENNCEGNSEFGDDCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFGNAVL
        SGGIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSKSSPRSSENNCEGNSEFGDDCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFGNAVL
Subjt:  SGGIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSKSSPRSSENNCEGNSEFGDDCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFGNAVL

Query:  RPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRQHRKMSSLSNISYKSLHSRQYSMSSVSENSRGSSEDPLIEQENSSECNESVVSS
        RPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRQHRKMSSLSNISYKSLHSRQYSMSSVSENSRGSSEDPLIEQENSSECNESVVSS
Subjt:  RPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRQHRKMSSLSNISYKSLHSRQYSMSSVSENSRGSSEDPLIEQENSSECNESVVSS

Query:  PRSDRNFASIPKALSQGKSVRRIRANAAAMEDMKAQEMHRKQVKQDDIIGNKFEEGGMSPPYMREDGTGHGWPDVVNPNASNMNRFPKTTFLGIKEQKEE
        PRSDRNFASIPKALSQGKSVRRIRANAAAMEDMKAQEMHRKQVKQDDIIGNKFEEGGMSPPYMREDGTGHGWPDVVNPNASNMNRFPKTTFLGIKEQKEE
Subjt:  PRSDRNFASIPKALSQGKSVRRIRANAAAMEDMKAQEMHRKQVKQDDIIGNKFEEGGMSPPYMREDGTGHGWPDVVNPNASNMNRFPKTTFLGIKEQKEE

Query:  TESVVADDSKDDSEGEDESLFASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGGWGSFSSTSSSYFS
        TESVVADDSKDDSEGEDESLFASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGGWGSFSSTSSSYFS
Subjt:  TESVVADDSKDDSEGEDESLFASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGGWGSFSSTSSSYFS

TrEMBL top hitse value%identityAlignment
A0A0A0K9X1 Uncharacterized protein1.24e-30476.77Show/hide
Query:  MASSASSPFTKLHFPHSPLPQPP----ANSCAQFLCKSIFFCFFLLLLPLFPSEAPHFVDHTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVD--EPR
        MA S S+PFTK HFPHSPLP       +NSC QF+CKS+FFC FLLLLPLFPSEAP FV+ T  TKFWELFHLMF+GIAVSYGLFS RN Q++VD  EPR
Subjt:  MASSASSPFTKLHFPHSPLPQPP----ANSCAQFLCKSIFFCFFLLLLPLFPSEAPHFVDHTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVD--EPR

Query:  YSSFENPQSYLSKMLYVASIFDDVDDFGVSDERKVSEVLYIQPKLGSASDFNAQSRHQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVP
        +S+FENPQSYLSKM +VASIF+DVDDF VSDERK+SEVLYIQP LGS S  NA SR QE   YSIPKKRYENS EFA+TDNV HACKSRYTRGGSVVVV 
Subjt:  YSSFENPQSYLSKMLYVASIFDDVDDFGVSDERKVSEVLYIQPKLGSASDFNAQSRHQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVP

Query:  ETNRSSSG------GIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSKSSPRSSENNCEGNSEFGDDCCVNLEEKFDETAIASMSSFQLREKFGKK
        ETNRS+SG       IVNYKPLGLPVRSL+SSLTE DDVEFDCGDESCLSSKSS ++SE+NCE  SEFGD+CCVNLEEKFDET IASMS FQLREKF K 
Subjt:  ETNRSSSG------GIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSKSSPRSSENNCEGNSEFGDDCCVNLEEKFDETAIASMSSFQLREKFGKK

Query:  VIRERGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRQHRKMSSLSNISYKSLHSRQYSMSSVSENSRGSSEDPLIEQE
        ++RER   NAVLRPSHFRP SIDETQFESLKKS SLHSNLSQSSQTSSLSS LSS TR+HRKMSSL NISYKS HSRQYS+SS+SENSRGSSEDPLI+ E
Subjt:  VIRERGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRQHRKMSSLSNISYKSLHSRQYSMSSVSENSRGSSEDPLIEQE

Query:  NSSECNESVVSSPRSDRNFASIPKALSQGKSVRRIRANAAAMEDMKAQEMHRKQVKQDDIIGNKFEEGGMSPPYMREDGTGHGWPDVVNPNASNMNRFPK
        NSSECNESVVSSPR DRNFA+ PKALS+GKSVR +RA+ +A+E+MKAQEM+R QV+ DD + NKF EGGMSP YMRED TGHGWP + N NA+  NR+ K
Subjt:  NSSECNESVVSSPRSDRNFASIPKALSQGKSVRRIRANAAAMEDMKAQEMHRKQVKQDDIIGNKFEEGGMSPPYMREDGTGHGWPDVVNPNASNMNRFPK

Query:  TT----FLGIKEQKEETESVVADDSKDDSEGEDESLFASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGGWGSFSS
        TT    F GI+EQKE+TES V DD KD+SE ED+S F SSDEEA  SM GDSESGA EVDKKAGEFIAKFREQIQLQRMASV+KRLRGG     WGSFSS
Subjt:  TT----FLGIKEQKEETESVVADDSKDDSEGEDESLFASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGGWGSFSS

Query:  TSSSYFS
        T+SSYFS
Subjt:  TSSSYFS

A0A5D3DMA5 DUF761 domain-containing protein1.15e-30978.05Show/hide
Query:  MASSASSPFTKLHFPHSPLPQPP----ANSCAQFLCKSIFFCFFLLLLPLFPSEAPHFVDHTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVD--EPR
        MASS S+PFTK HFPHSPLP       +NSC  FLCKS+FFC FLLLLPLFPSEAP FV+ TL TKFWELFHLMFVGIAVSYGLFS RN Q++VD  EPR
Subjt:  MASSASSPFTKLHFPHSPLPQPP----ANSCAQFLCKSIFFCFFLLLLPLFPSEAPHFVDHTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVD--EPR

Query:  YSSFENPQSYLSKMLYVASIFDDVDDFGVSDERKVSEVLYIQPKLGSASDFNAQSRHQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVP
        +S+FENPQSYLSKML+VASIF+DVDDF VSDERK+SEVLYIQP LGS   FNA SR QE   YSIPKKRYENS EF DT++V HACKSRYTRGGSVVVV 
Subjt:  YSSFENPQSYLSKMLYVASIFDDVDDFGVSDERKVSEVLYIQPKLGSASDFNAQSRHQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVP

Query:  ETNRSSSG------GIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSKSSPRSSENNCEGNSEFGDDCCVNLEEKFDETAIASMSSFQLREKFGKK
        ETNRS+SG       IVNYKPLGLPVRSLRS+LTE DDVEFDCGDESCLSSKSS ++SE+NCE  SEFGD+CCVNLEEKFDET IA MS FQLRE FGK 
Subjt:  ETNRSSSG------GIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSKSSPRSSENNCEGNSEFGDDCCVNLEEKFDETAIASMSSFQLREKFGKK

Query:  VIRERGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRQHRKMSSLSNISYKSLHSRQYSMSSVSENSRGSSEDPLIEQE
        ++RERG  NAVLRPSHFRP SIDETQFESLKKS SLHSNLSQSSQTSSLS SLSSTTR+HRKMSSL NISYKS HSRQYS+SS+SENSRGSSEDPLIE E
Subjt:  VIRERGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRQHRKMSSLSNISYKSLHSRQYSMSSVSENSRGSSEDPLIEQE

Query:  NSSECNESVVSSPRSDRNFASIPKALSQGKSVRRIRANAAAMEDMKAQEMHRKQVKQDDIIGNKFEEGGMSPPYMREDGTGHGWPDVVNPNASNMNRFPK
        NSSECNES++SSPR DRNFA IPKALS+GKSVR IRAN +A+E+MKAQEM+R QV+ DD +GNKF EGGMSP YMREDGTGHGWP + +PNA   NR PK
Subjt:  NSSECNESVVSSPRSDRNFASIPKALSQGKSVRRIRANAAAMEDMKAQEMHRKQVKQDDIIGNKFEEGGMSPPYMREDGTGHGWPDVVNPNASNMNRFPK

Query:  TT-FLGIKEQKEETESVVADDS--KDDSEGEDESLFASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGGWGSFSST
        TT F GI+EQKE+ ES + DD   +D+SE ED S F SSDEEA SSMAG+SESGA+EVDKKAGEFIAKFREQIQLQRMASV+KRLRGG     WGSFSST
Subjt:  TT-FLGIKEQKEETESVVADDS--KDDSEGEDESLFASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGGWGSFSST

Query:  SSSYFS
        SSSYFS
Subjt:  SSSYFS

A0A6J1H4M0 uncharacterized protein LOC1114599980.096.96Show/hide
Query:  MASSASSPFTKLHFPHSPLPQPPANSCAQFLCKSIFFCFFLLLLPLFPSEAPHFVDHTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFEN
        MASSASSPFTKLHFPHSPLPQPPANSCAQFLCKSIFFCFFLLLLPLFPSEAP FVD TLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFEN
Subjt:  MASSASSPFTKLHFPHSPLPQPPANSCAQFLCKSIFFCFFLLLLPLFPSEAPHFVDHTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFEN

Query:  PQSYLSKMLYVASIFDDVDDFGVSDERKVSEVLYIQPKLGSASDFNAQSRHQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPETNRSS
        PQSYLSKMLYVASIFDDVDDFGVSDERKVSEVLYIQPKLGSASD NAQSRHQEKLRYS+PKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPETNRSS
Subjt:  PQSYLSKMLYVASIFDDVDDFGVSDERKVSEVLYIQPKLGSASDFNAQSRHQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPETNRSS

Query:  SGGIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSKSSPRSSENNCEGNSEFGDDCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFGNAVL
        SGGIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSKSSP+SSENNCEGNSEFGD+CCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFGNAVL
Subjt:  SGGIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSKSSPRSSENNCEGNSEFGDDCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFGNAVL

Query:  RPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRQHRKMSSLSNISYKSLHSRQYSMSSVSENSRGSSEDPLIEQENSSECNESVVSS
        RPSHFRPPSIDETQFESLKKSGSLHS LSQSSQTSSLSS LSSTTR+ RKMSSLSNISYKSLHSRQYS SS+SENSRGSSEDPLIEQENSSECNESVVSS
Subjt:  RPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRQHRKMSSLSNISYKSLHSRQYSMSSVSENSRGSSEDPLIEQENSSECNESVVSS

Query:  PRSDRNFASIPKALSQGKSVRRIRANAAAMEDMKAQEMHRKQVKQDDIIGNKFEEGGMSPPYMREDGTGHGWPDVVNPNASNMNRFPKTTFLGIKEQKEE
        PRSDRNFASIPKALSQGKSVRRIRANAAA+EDMKAQEMHRKQVK DDIIGNKFEEGGMSPPYMREDGTG GWPDVVNPNA NMNRFPKTTFLGIKEQKEE
Subjt:  PRSDRNFASIPKALSQGKSVRRIRANAAAMEDMKAQEMHRKQVKQDDIIGNKFEEGGMSPPYMREDGTGHGWPDVVNPNASNMNRFPKTTFLGIKEQKEE

Query:  TESVVADDSKDDSEGEDESLFASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGG-WGSFSSTSSSYFS
        TES+VADDSKDDSEGEDESLFASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGG WGSFSSTSSSYFS
Subjt:  TESVVADDSKDDSEGEDESLFASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGG-WGSFSSTSSSYFS

A0A6J1KUS4 uncharacterized protein LOC1114989000.093.61Show/hide
Query:  MASSASSPFTKLHFPHSPLPQPPA----NSCAQFLCKSIFFCFFLLLLPLFPSEAPHFVDHTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYS
        MASSASSPFTKLHFPHSPLPQPPA    NSCAQFLCKS+FFCFFLLLLPLFPSEAP FVD TLFTKFWELFHLM VGIAVSYGLFSTRNNQMNVDEPRYS
Subjt:  MASSASSPFTKLHFPHSPLPQPPA----NSCAQFLCKSIFFCFFLLLLPLFPSEAPHFVDHTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYS

Query:  SFENPQSYLSKMLYVASIFDDVDDFGVSDERKVSEVLYIQPKLGSASDFNAQSRHQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPET
        SFENPQSYLSKMLYVASIFDDVDDF VSDERK+SEVLYIQP LGSASD NAQSR QEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPET
Subjt:  SFENPQSYLSKMLYVASIFDDVDDFGVSDERKVSEVLYIQPKLGSASDFNAQSRHQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPET

Query:  NRSSSGGIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSKSSPRSSENNCEGNSEFGDDCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFG
        NRSSSGGIVNYKPLGLPVRSL+SSLTESDDVEFDCGDESCLSSKSSP+SSENNCEGNSEFGD+CCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFG
Subjt:  NRSSSGGIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSKSSPRSSENNCEGNSEFGDDCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFG

Query:  NAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRQHRKMSSLSNISYKSLHSRQYSMSSVSENSRGSSEDPLIEQENSSECNES
        NAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTR+H KMSSLSNISYKSLHSRQYSMSS+SENSRGSSEDPLIEQENSSECNES
Subjt:  NAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRQHRKMSSLSNISYKSLHSRQYSMSSVSENSRGSSEDPLIEQENSSECNES

Query:  VVSSPRSDRNFASIPKALSQGKSVRRIRANAAAMEDMKAQEMHRKQVKQDDIIGNKFEEGGMSPPYMREDGTGHGWPDVVNPNASNMNRFPKTTFLGIKE
        VVSSPRSD NF SIPKALSQGKS+RRI+ANAAA+ED+KAQEMHRKQVK DDIIGNKFEEGG SP Y+REDGTGHGWPDV NPNASNM+RFP TTFLGIKE
Subjt:  VVSSPRSDRNFASIPKALSQGKSVRRIRANAAAMEDMKAQEMHRKQVKQDDIIGNKFEEGGMSPPYMREDGTGHGWPDVVNPNASNMNRFPKTTFLGIKE

Query:  QKEETESVVADDSKDDSEGEDESLFASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGGWGSFSSTSSSYFS
        QKEETES+VADDSKDDSEGEDES FASSDEEA SSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGG   WGSFSSTSSSYFS
Subjt:  QKEETESVVADDSKDDSEGEDESLFASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGGWGSFSSTSSSYFS

E5GCN2 Uncharacterized protein1.15e-30978.05Show/hide
Query:  MASSASSPFTKLHFPHSPLPQPP----ANSCAQFLCKSIFFCFFLLLLPLFPSEAPHFVDHTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVD--EPR
        MASS S+PFTK HFPHSPLP       +NSC  FLCKS+FFC FLLLLPLFPSEAP FV+ TL TKFWELFHLMFVGIAVSYGLFS RN Q++VD  EPR
Subjt:  MASSASSPFTKLHFPHSPLPQPP----ANSCAQFLCKSIFFCFFLLLLPLFPSEAPHFVDHTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVD--EPR

Query:  YSSFENPQSYLSKMLYVASIFDDVDDFGVSDERKVSEVLYIQPKLGSASDFNAQSRHQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVP
        +S+FENPQSYLSKML+VASIF+DVDDF VSDERK+SEVLYIQP LGS   FNA SR QE   YSIPKKRYENS EF DT++V HACKSRYTRGGSVVVV 
Subjt:  YSSFENPQSYLSKMLYVASIFDDVDDFGVSDERKVSEVLYIQPKLGSASDFNAQSRHQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVP

Query:  ETNRSSSG------GIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSKSSPRSSENNCEGNSEFGDDCCVNLEEKFDETAIASMSSFQLREKFGKK
        ETNRS+SG       IVNYKPLGLPVRSLRS+LTE DDVEFDCGDESCLSSKSS ++SE+NCE  SEFGD+CCVNLEEKFDET IA MS FQLRE FGK 
Subjt:  ETNRSSSG------GIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSKSSPRSSENNCEGNSEFGDDCCVNLEEKFDETAIASMSSFQLREKFGKK

Query:  VIRERGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRQHRKMSSLSNISYKSLHSRQYSMSSVSENSRGSSEDPLIEQE
        ++RERG  NAVLRPSHFRP SIDETQFESLKKS SLHSNLSQSSQTSSLS SLSSTTR+HRKMSSL NISYKS HSRQYS+SS+SENSRGSSEDPLIE E
Subjt:  VIRERGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRQHRKMSSLSNISYKSLHSRQYSMSSVSENSRGSSEDPLIEQE

Query:  NSSECNESVVSSPRSDRNFASIPKALSQGKSVRRIRANAAAMEDMKAQEMHRKQVKQDDIIGNKFEEGGMSPPYMREDGTGHGWPDVVNPNASNMNRFPK
        NSSECNES++SSPR DRNFA IPKALS+GKSVR IRAN +A+E+MKAQEM+R QV+ DD +GNKF EGGMSP YMREDGTGHGWP + +PNA   NR PK
Subjt:  NSSECNESVVSSPRSDRNFASIPKALSQGKSVRRIRANAAAMEDMKAQEMHRKQVKQDDIIGNKFEEGGMSPPYMREDGTGHGWPDVVNPNASNMNRFPK

Query:  TT-FLGIKEQKEETESVVADDS--KDDSEGEDESLFASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGGWGSFSST
        TT F GI+EQKE+ ES + DD   +D+SE ED S F SSDEEA SSMAG+SESGA+EVDKKAGEFIAKFREQIQLQRMASV+KRLRGG     WGSFSST
Subjt:  TT-FLGIKEQKEETESVVADDS--KDDSEGEDESLFASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGGWGSFSST

Query:  SSSYFS
        SSSYFS
Subjt:  SSSYFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G60380.1 FUNCTIONS IN: molecular_function unknown2.0e-3236.6Show/hide
Query:  SASSPFTKLHFPHSPL--PQPPANSC--AQFLCKSIFFCFFLLLLPLFPSEAPHFVDHTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFE
        ++ +P+TK   P + +  PQP   S     F CKS+ F  FLL LPLFPS+AP FV  T+ TKFWEL HL+FVGIAV+YGLFS RN +  VD       E
Subjt:  SASSPFTKLHFPHSPL--PQPPANSC--AQFLCKSIFFCFFLLLLPLFPSEAPHFVDHTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFE

Query:  NPQSYLSKMLYVASIFD-DVDDFGVS--DERKVSEVLYIQPKLGSASDFNAQSRHQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPET
        +  SY+S++  V+S+FD + DD      D R    V      +G +  F  +S               E S EF +T+ V  A  S+Y +G S VVV   
Subjt:  NPQSYLSKMLYVASIFD-DVDDFGVS--DERKVSEVLYIQPKLGSASDFNAQSRHQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPET

Query:  NRSSSGGIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSKSSPRSSEN--NCEGNSEFGDDCCVNLEEKFDE--TAIASMSSFQLREKFGKKVIRE
             G +V ++PLGLP+R LRSSL           D + L  KS   S +   N E  S   D+        FDE   A AS   +Q R +        
Subjt:  NRSSSGGIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSKSSPRSSEN--NCEGNSEFGDDCCVNLEEKFDE--TAIASMSSFQLREKFGKKVIRE

Query:  RGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRQHRKMSSLSNISYKSLHSRQYSMSSVSENSRGSS
         G G+    PS+F+P S+DET      KS S  S  S SSQTS  S       +   + S   ++S +SL+S    +  V E SR SS
Subjt:  RGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRQHRKMSSLSNISYKSLHSRQYSMSSVSENSRGSS

AT3G60380.1 FUNCTIONS IN: molecular_function unknown8.9e-0445.57Show/hide
Query:  KEETESVVADDSKDDSEGEDESLFASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGG
        K E E V  ++ +  +E + E  F   +E A  S +  S     EVD+KAGEFIAKFREQI+LQ++ S E+  RGGG G
Subjt:  KEETESVVADDSKDDSEGEDESLFASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGG

AT4G16790.1 hydroxyproline-rich glycoprotein family protein9.2e-0934.15Show/hide
Query:  QPPANSCAQFLCKSIFFCFFLLLLPLFPSEAPHFVDHTLFTKFWELFHLMFVGIAVSYGLFSTRN-------NQMNVDEPRYS-SFENPQSYLSKMLYVA
        Q P    ++F+ K++       ++P+F S+ P   +    T+  EL HL+FVGIAVSYGLFS RN          N D  +   S  N  SY+ K+L V+
Subjt:  QPPANSCAQFLCKSIFFCFFLLLLPLFPSEAPHFVDHTLFTKFWELFHLMFVGIAVSYGLFSTRN-------NQMNVDEPRYS-SFENPQSYLSKMLYVA

Query:  SIFD-------DVDDFGVSDERK
        S+F+       +  D    D+RK
Subjt:  SIFD-------DVDDFGVSDERK

AT4G16790.1 hydroxyproline-rich glycoprotein family protein1.1e-0135.71Show/hide
Query:  WPD-VVNPNASNMNRFPKTTFLGIKEQKEETESVVADDSKDDSEGEDESLFASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEK
        W D +V     +  +  + + LG K   EE+E+   +  + ++E  DE      +EE  S +   S     +VDKKA EFIAKFREQI+LQR+ S+++
Subjt:  WPD-VVNPNASNMNRFPKTTFLGIKEQKEETESVVADDSKDDSEGEDESLFASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCTTCAGCTTCTAGCCCCTTCACGAAGCTCCATTTCCCCCATTCTCCACTTCCACAACCACCAGCCAACTCCTGCGCACAGTTTCTCTGTAAATCCATC
TTCTTCTGCTTTTTTCTCCTCCTCCTCCCTCTCTTCCCTTCCGAAGCTCCACATTTCGTCGATCACACTTTGTTCACCAAATTCTGGGAGCTTTTTCACCTCATG
TTCGTCGGCATTGCTGTTTCCTATGGTCTCTTCAGCACAAGGAACAACCAGATGAATGTAGACGAACCTCGCTACTCCAGTTTTGAGAATCCGCAGTCTTATTTG
TCTAAGATGCTTTACGTCGCTTCAATTTTTGATGATGTTGACGATTTTGGTGTTTCTGATGAAAGGAAAGTGAGTGAAGTTCTGTACATTCAGCCGAAACTTGGA
TCTGCGAGTGATTTCAATGCGCAATCTCGCCACCAGGAAAAACTCCGTTACTCAATACCGAAAAAAAGGTACGAAAACTCTTATGAATTTGCTGATACTGATAAT
GTCGCTCATGCTTGTAAATCGAGATATACTCGTGGTGGATCTGTGGTGGTTGTGCCTGAAACAAACCGTAGTTCATCAGGAGGCATTGTAAATTATAAACCTCTA
GGTTTGCCTGTTAGGAGTCTGAGATCGAGTCTTACTGAATCCGACGATGTCGAATTCGATTGTGGTGATGAATCTTGTTTGAGTTCTAAAAGTTCACCCAGAAGC
TCTGAGAATAATTGTGAAGGAAACAGTGAATTTGGTGATGATTGTTGTGTGAATTTGGAGGAGAAGTTTGATGAGACTGCAATTGCATCAATGTCCTCATTTCAA
TTGCGTGAGAAATTTGGAAAGAAGGTGATTAGAGAGAGAGGATTTGGGAATGCTGTTCTTCGCCCTTCCCATTTTAGACCTCCCTCCATTGATGAAACTCAATTT
GAATCACTGAAAAAATCAGGATCTCTTCATTCTAATCTATCTCAGTCATCACAAACTAGTTCCCTCTCTTCTTCGTTGTCATCGACGACGAGACAGCACCGTAAA
ATGTCGTCACTCAGTAACATTTCCTATAAGTCGTTGCATTCTCGACAATACAGTATGAGTTCTGTGTCTGAAAACAGTAGAGGGAGCTCTGAAGACCCTCTGATT
GAACAAGAAAACTCATCCGAGTGCAATGAATCCGTGGTGAGTTCGCCACGTTCGGACAGGAATTTCGCAAGTATTCCGAAAGCTTTATCCCAAGGAAAATCGGTT
CGAAGAATTCGAGCAAATGCAGCTGCCATGGAGGATATGAAAGCTCAAGAGATGCACAGAAAGCAAGTTAAACAAGATGACATTATAGGGAATAAGTTTGAAGAA
GGTGGAATGTCACCACCATATATGAGAGAAGATGGAACGGGACACGGATGGCCTGATGTTGTTAACCCGAATGCTAGTAATATGAATCGTTTTCCGAAGACGACG
TTCTTGGGGATTAAGGAGCAGAAGGAAGAGACAGAGAGTGTGGTGGCAGATGATAGTAAAGATGACTCTGAGGGGGAGGATGAAAGTTTGTTTGCAAGTTCAGAT
GAAGAAGCTGGTTCAAGTATGGCCGGAGATTCGGAGTCGGGGGCTTTCGAGGTCGACAAGAAGGCGGGCGAGTTCATAGCCAAGTTCAGGGAGCAAATACAGCTT
CAGAGGATGGCTTCAGTTGAAAAAAGATTGAGAGGAGGAGGAGGAGGAGGAGGGTGGGGGTCATTCAGCAGCACAAGCAGCAGCTATTTCAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCTTCAGCTTCTAGCCCCTTCACGAAGCTCCATTTCCCCCATTCTCCACTTCCACAACCACCAGCCAACTCCTGCGCACAGTTTCTCTGTAAATCCATC
TTCTTCTGCTTTTTTCTCCTCCTCCTCCCTCTCTTCCCTTCCGAAGCTCCACATTTCGTCGATCACACTTTGTTCACCAAATTCTGGGAGCTTTTTCACCTCATG
TTCGTCGGCATTGCTGTTTCCTATGGTCTCTTCAGCACAAGGAACAACCAGATGAATGTAGACGAACCTCGCTACTCCAGTTTTGAGAATCCGCAGTCTTATTTG
TCTAAGATGCTTTACGTCGCTTCAATTTTTGATGATGTTGACGATTTTGGTGTTTCTGATGAAAGGAAAGTGAGTGAAGTTCTGTACATTCAGCCGAAACTTGGA
TCTGCGAGTGATTTCAATGCGCAATCTCGCCACCAGGAAAAACTCCGTTACTCAATACCGAAAAAAAGGTACGAAAACTCTTATGAATTTGCTGATACTGATAAT
GTCGCTCATGCTTGTAAATCGAGATATACTCGTGGTGGATCTGTGGTGGTTGTGCCTGAAACAAACCGTAGTTCATCAGGAGGCATTGTAAATTATAAACCTCTA
GGTTTGCCTGTTAGGAGTCTGAGATCGAGTCTTACTGAATCCGACGATGTCGAATTCGATTGTGGTGATGAATCTTGTTTGAGTTCTAAAAGTTCACCCAGAAGC
TCTGAGAATAATTGTGAAGGAAACAGTGAATTTGGTGATGATTGTTGTGTGAATTTGGAGGAGAAGTTTGATGAGACTGCAATTGCATCAATGTCCTCATTTCAA
TTGCGTGAGAAATTTGGAAAGAAGGTGATTAGAGAGAGAGGATTTGGGAATGCTGTTCTTCGCCCTTCCCATTTTAGACCTCCCTCCATTGATGAAACTCAATTT
GAATCACTGAAAAAATCAGGATCTCTTCATTCTAATCTATCTCAGTCATCACAAACTAGTTCCCTCTCTTCTTCGTTGTCATCGACGACGAGACAGCACCGTAAA
ATGTCGTCACTCAGTAACATTTCCTATAAGTCGTTGCATTCTCGACAATACAGTATGAGTTCTGTGTCTGAAAACAGTAGAGGGAGCTCTGAAGACCCTCTGATT
GAACAAGAAAACTCATCCGAGTGCAATGAATCCGTGGTGAGTTCGCCACGTTCGGACAGGAATTTCGCAAGTATTCCGAAAGCTTTATCCCAAGGAAAATCGGTT
CGAAGAATTCGAGCAAATGCAGCTGCCATGGAGGATATGAAAGCTCAAGAGATGCACAGAAAGCAAGTTAAACAAGATGACATTATAGGGAATAAGTTTGAAGAA
GGTGGAATGTCACCACCATATATGAGAGAAGATGGAACGGGACACGGATGGCCTGATGTTGTTAACCCGAATGCTAGTAATATGAATCGTTTTCCGAAGACGACG
TTCTTGGGGATTAAGGAGCAGAAGGAAGAGACAGAGAGTGTGGTGGCAGATGATAGTAAAGATGACTCTGAGGGGGAGGATGAAAGTTTGTTTGCAAGTTCAGAT
GAAGAAGCTGGTTCAAGTATGGCCGGAGATTCGGAGTCGGGGGCTTTCGAGGTCGACAAGAAGGCGGGCGAGTTCATAGCCAAGTTCAGGGAGCAAATACAGCTT
CAGAGGATGGCTTCAGTTGAAAAAAGATTGAGAGGAGGAGGAGGAGGAGGAGGGTGGGGGTCATTCAGCAGCACAAGCAGCAGCTATTTCAGTTGA
Protein sequenceShow/hide protein sequence
MASSASSPFTKLHFPHSPLPQPPANSCAQFLCKSIFFCFFLLLLPLFPSEAPHFVDHTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFENPQSYL
SKMLYVASIFDDVDDFGVSDERKVSEVLYIQPKLGSASDFNAQSRHQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPETNRSSSGGIVNYKPL
GLPVRSLRSSLTESDDVEFDCGDESCLSSKSSPRSSENNCEGNSEFGDDCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFGNAVLRPSHFRPPSIDETQF
ESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRQHRKMSSLSNISYKSLHSRQYSMSSVSENSRGSSEDPLIEQENSSECNESVVSSPRSDRNFASIPKALSQGKSV
RRIRANAAAMEDMKAQEMHRKQVKQDDIIGNKFEEGGMSPPYMREDGTGHGWPDVVNPNASNMNRFPKTTFLGIKEQKEETESVVADDSKDDSEGEDESLFASSD
EEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGGWGSFSSTSSSYFS