; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10023379 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10023379
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function (DUF1666)
Genome locationChr05:33570388..33572889
RNA-Seq ExpressionHG10023379
SyntenyHG10023379
Gene Ontology termsNA
InterPro domainsIPR012870 - Protein of unknown function DUF1666


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7013928.1 hypothetical protein SDJN02_24097 [Cucurbita argyrosperma subsp. argyrosperma]2.1e-18084.46Show/hide
Query:  KEEDGTDSKCIAEAEA----------EEEEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEEGEEGEEGEEVGEAEPEWRDVEAEGRQWWGGFGAVYDDY
        KEEDG D KC+AEAEA          +E++EDFIMEEVKRRLKELRRNSFMVLIPEEEEEEE EE  EG E    EPEWRDVEAEGRQWWGGFGAVYDDY
Subjt:  KEEDGTDSKCIAEAEA----------EEEEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEEGEEGEEGEEVGEAEPEWRDVEAEGRQWWGGFGAVYDDY

Query:  CKRMFFFDRMSLRSGPDSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIDSNHHIETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHY
        C+RM FFDRMS +SG +S SQRSA KKSASPLRCLSLKRIEEPEDEMED+DPS  LIDSNHH+ETAYVAHICLSWEALHCQYTQLNHLISCQPQN TT Y
Subjt:  CKRMFFFDRMSLRSGPDSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIDSNHHIETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHY

Query:  NLTAQLFQQFQVLLQRFIENEPFQQALRPTLYARTRRTFPKMLHVPNIQASDPNGVQEQESDFLILAPDLLLIIEASIFTFHRFLKMDKKTSNSASLSFR
        NLTAQLFQQFQVLLQRFIENEPFQQALRP +YARTRRTFPKMLHVPNIQASD N  QEQESD LILA DLL+IIEASIFTFHRFLKMDKK S SASLS R
Subjt:  NLTAQLFQQFQVLLQRFIENEPFQQALRPTLYARTRRTFPKMLHVPNIQASDPNGVQEQESDFLILAPDLLLIIEASIFTFHRFLKMDKKTSNSASLSFR

Query:  NHTQDAALLARVRSSLDKKKTKLKEVRKKSRGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFPC
        N TQDA LLAR+RSSLDKKK KLKEVRKKSRGWKQKTWPQTYEDMQLLFG+VDIKIISRL+KM R +KEQLLWCEEK+ KLDVS+GKL RDPSPLLFPC
Subjt:  NHTQDAALLARVRSSLDKKKTKLKEVRKKSRGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFPC

XP_004141832.1 uncharacterized protein LOC101216166 [Cucumis sativus]3.0e-23590.06Show/hide
Query:  MVGISGSSSALDLIKKKLQDSGTSVASSPISAPAIAQLDVNLPRDVDVAVKALQLENNKDKPKDANGDGNVSDSFLDSEDVESGPTNEQLIIQFKEEDGT
        MVGISG SS L+LIKKKLQD+GT VASSPISAP  AQLD+NLPRDV+VA+KALQ EN KDKPK AN DGNVSDS LDSEDVESGPT+EQLIIQFKEEDGT
Subjt:  MVGISGSSSALDLIKKKLQDSGTSVASSPISAPAIAQLDVNLPRDVDVAVKALQLENNKDKPKDANGDGNVSDSFLDSEDVESGPTNEQLIIQFKEEDGT

Query:  DSKCIAEAEAEEEEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEEGEEGEEGEEVGEAEPEWRDVEAEGRQWWGGFGAVYDDYCKRMFFFDRMSLRSGP
        D K IAE EAEE+E+DFIMEEVKRRLKELRRNSFMVLIPEEEEEE   EG E EEVGE E EWRDVEAEGRQWWGGFGAVYDDYC+RM FFDR S+ SGP
Subjt:  DSKCIAEAEAEEEEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEEGEEGEEGEEVGEAEPEWRDVEAEGRQWWGGFGAVYDDYCKRMFFFDRMSLRSGP

Query:  DSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIDSNHHIETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQR
         STSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIDSNHHIE AYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQR
Subjt:  DSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIDSNHHIETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQR

Query:  FIENEPFQQALRPTLYARTRRTFPKMLHVPNIQASDPNGVQEQESDFLILAPDLLLIIEASIFTFHRFLKMDKKTSNSASLSFRNHTQDAALLARVRSSL
        FIENEPFQQALRPT+YARTRRT+PKMLHVPNIQASDPNGVQEQESD LILAPDLL IIEASIFTFHRFLKM+KKTS SASLSFRNHTQDAALLARVRSSL
Subjt:  FIENEPFQQALRPTLYARTRRTFPKMLHVPNIQASDPNGVQEQESDFLILAPDLLLIIEASIFTFHRFLKMDKKTSNSASLSFRNHTQDAALLARVRSSL

Query:  DKKKTKLKEVRKKSRGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFPC
        DKKKTKLKEVRKKS+GWKQKT PQTYEDMQLLFG+VDIKII+RL+KMSR TKEQLLWCEEKMNKLDVSNGKL RDPSPLLFPC
Subjt:  DKKKTKLKEVRKKSRGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFPC

XP_008462200.1 PREDICTED: uncharacterized protein LOC103500615 [Cucumis melo]4.4e-23489.23Show/hide
Query:  MVGISGSSSALDLIKKKLQDSGTSVASSPISAPAIAQLDVNLPRDVDVAVKALQLENNKDKPKDANGDGNVSDSFLDSEDVESGPTNEQLIIQFKEEDGT
        MVGISGSSS L+LIKKKLQD+GT VASSPISAP  AQLD+NL RDVDVA+KALQ+EN++DKPK AN DGNVSDS LDSEDVE+ PTNEQLI QFKEEDGT
Subjt:  MVGISGSSSALDLIKKKLQDSGTSVASSPISAPAIAQLDVNLPRDVDVAVKALQLENNKDKPKDANGDGNVSDSFLDSEDVESGPTNEQLIIQFKEEDGT

Query:  DSKCIAEAEAEEEEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEEGEEGEEGEEVGEAEPEWRDVEAEGRQWWGGFGAVYDDYCKRMFFFDRMSLRSGP
        D K IAE EAEE+E+DFIMEEVKRRLKELRRNSFMVLIPEEEEEE   EG E EEVGE EPEWRDVEAEGRQWWGGFGAVYDDYC+RM FFDR S+ SGP
Subjt:  DSKCIAEAEAEEEEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEEGEEGEEGEEVGEAEPEWRDVEAEGRQWWGGFGAVYDDYCKRMFFFDRMSLRSGP

Query:  DSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIDSNHHIETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQR
        +STSQRSASKKSASPLRCLSLKRIEEPEDEMED DP LT IDSNHHIE AYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQR
Subjt:  DSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIDSNHHIETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQR

Query:  FIENEPFQQALRPTLYARTRRTFPKMLHVPNIQASDPNGVQEQESDFLILAPDLLLIIEASIFTFHRFLKMDKKTSNSASLSFRNHTQDAALLARVRSSL
        FIENEPFQQALRPT+YARTRRTFPKMLHVPNIQASDPNGVQEQESD LILAPDLL I+EASIFTFHRFLKM+KKTSNSASLSFRNHTQDAAL ARVRSSL
Subjt:  FIENEPFQQALRPTLYARTRRTFPKMLHVPNIQASDPNGVQEQESDFLILAPDLLLIIEASIFTFHRFLKMDKKTSNSASLSFRNHTQDAALLARVRSSL

Query:  DKKKTKLKEVRKKSRGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFPC
        DKKKTKLKEVRKKS+GWKQKT PQTYEDMQLLFG+VDIKII+RL+KMSR TKEQLLWCEEKMNKLDVSNGKL RDPSPLLFPC
Subjt:  DKKKTKLKEVRKKSRGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFPC

XP_022954221.1 uncharacterized protein LOC111456540 [Cucurbita moschata]9.6e-18184.71Show/hide
Query:  KEEDGTDSKCIAEAEA----------EEEEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEEGEEGEEGEEVGEAEPEWRDVEAEGRQWWGGFGAVYDDY
        KEEDG D KC+AEAEA          +E++EDFIMEEVKRRLKELRRNSFMVLIPEEEEEEE EE  EG E    EPEWRDVEAEGRQWWGGFGAVYDDY
Subjt:  KEEDGTDSKCIAEAEA----------EEEEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEEGEEGEEGEEVGEAEPEWRDVEAEGRQWWGGFGAVYDDY

Query:  CKRMFFFDRMSLRSGPDSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIDSNHHIETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHY
        C+RM FFDRMS +SG +S SQRSA KKSASPLRCLSLKRIEEPEDEMED+DPS  LIDSNHH+ETAYVAHICLSWEALHCQYTQLNHLISCQPQN TT Y
Subjt:  CKRMFFFDRMSLRSGPDSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIDSNHHIETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHY

Query:  NLTAQLFQQFQVLLQRFIENEPFQQALRPTLYARTRRTFPKMLHVPNIQASDPNGVQEQESDFLILAPDLLLIIEASIFTFHRFLKMDKKTSNSASLSFR
        NLTAQLFQQFQVLLQRFIENEPFQQALRP +YARTRRTFPKMLHVPNIQASD N  QEQESD LILA DLL+IIEASIFTFHRFLKMDKK S SASLS R
Subjt:  NLTAQLFQQFQVLLQRFIENEPFQQALRPTLYARTRRTFPKMLHVPNIQASDPNGVQEQESDFLILAPDLLLIIEASIFTFHRFLKMDKKTSNSASLSFR

Query:  NHTQDAALLARVRSSLDKKKTKLKEVRKKSRGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFPC
        N TQDA LLAR+RSSLDKKK KLKEVRKKSRGWKQKTWPQTYEDMQLLFG+VDIKIISRLVKM R +KEQLLWCEEK+ KLDVS+GKL RDPSPLLFPC
Subjt:  NHTQDAALLARVRSSLDKKKTKLKEVRKKSRGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFPC

XP_038898126.1 uncharacterized protein LOC120085908 isoform X1 [Benincasa hispida]6.3e-24191.93Show/hide
Query:  MVGISGSSSALDLIKKKLQDSGTSVASSPISAPAIAQLDVNLPRDVDVAVKALQLENNKDKPKDANGDGNVSDSFLDSEDVESGPTNEQLIIQFKEEDGT
        MVGISGSSSALDLIKKKL DSGT VASSPISA  IAQLDVNLPRDVDVAVKALQ+EN K+KPKDANGDGNVS+S LDSE+V+SGPTNEQL IQF EEDG 
Subjt:  MVGISGSSSALDLIKKKLQDSGTSVASSPISAPAIAQLDVNLPRDVDVAVKALQLENNKDKPKDANGDGNVSDSFLDSEDVESGPTNEQLIIQFKEEDGT

Query:  DSKCIAEAEAEEEEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEEGEEGEEGEEVGEAEPEWRDVEAEGRQWWGGFGAVYDDYCKRMFFFDRMSLRSGP
        D KC AE EAEE+EEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEE EEGEE  EVGE +PEWRDVEAEGRQWWGGFGAVYDDYC+RMFFFDRMS+RSGP
Subjt:  DSKCIAEAEAEEEEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEEGEEGEEGEEVGEAEPEWRDVEAEGRQWWGGFGAVYDDYCKRMFFFDRMSLRSGP

Query:  DSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIDSNHHIETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQR
        +STS+RSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIDSN+HIETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQR
Subjt:  DSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIDSNHHIETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQR

Query:  FIENEPFQQALRPTLYARTRRTFPKMLHVPNIQASDPNGVQEQESDFLILAPDLLLIIEASIFTFHRFLKMDKKTSNSASLSFRNHTQDAALLARVRSSL
        FIENEPFQQ LRPT+YARTRRTFPKMLHVPNIQASDPNGVQEQESD LILAPDLLLIIEASIFTFHRFLKM+KKTSNSASLSF+NHTQDAALLARVRSSL
Subjt:  FIENEPFQQALRPTLYARTRRTFPKMLHVPNIQASDPNGVQEQESDFLILAPDLLLIIEASIFTFHRFLKMDKKTSNSASLSFRNHTQDAALLARVRSSL

Query:  DKKKTKLKEVRKKSRGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFPC
        DKKKTKLKEVRKKSRGWKQKT PQTYEDMQLLFGIVDIKIISRLVKMSR TKEQLLWCEEK+NKLD+SNGKL RD SPLLFPC
Subjt:  DKKKTKLKEVRKKSRGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFPC

TrEMBL top hitse value%identityAlignment
A0A0A0K6Q7 Uncharacterized protein1.5e-23590.06Show/hide
Query:  MVGISGSSSALDLIKKKLQDSGTSVASSPISAPAIAQLDVNLPRDVDVAVKALQLENNKDKPKDANGDGNVSDSFLDSEDVESGPTNEQLIIQFKEEDGT
        MVGISG SS L+LIKKKLQD+GT VASSPISAP  AQLD+NLPRDV+VA+KALQ EN KDKPK AN DGNVSDS LDSEDVESGPT+EQLIIQFKEEDGT
Subjt:  MVGISGSSSALDLIKKKLQDSGTSVASSPISAPAIAQLDVNLPRDVDVAVKALQLENNKDKPKDANGDGNVSDSFLDSEDVESGPTNEQLIIQFKEEDGT

Query:  DSKCIAEAEAEEEEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEEGEEGEEGEEVGEAEPEWRDVEAEGRQWWGGFGAVYDDYCKRMFFFDRMSLRSGP
        D K IAE EAEE+E+DFIMEEVKRRLKELRRNSFMVLIPEEEEEE   EG E EEVGE E EWRDVEAEGRQWWGGFGAVYDDYC+RM FFDR S+ SGP
Subjt:  DSKCIAEAEAEEEEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEEGEEGEEGEEVGEAEPEWRDVEAEGRQWWGGFGAVYDDYCKRMFFFDRMSLRSGP

Query:  DSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIDSNHHIETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQR
         STSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIDSNHHIE AYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQR
Subjt:  DSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIDSNHHIETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQR

Query:  FIENEPFQQALRPTLYARTRRTFPKMLHVPNIQASDPNGVQEQESDFLILAPDLLLIIEASIFTFHRFLKMDKKTSNSASLSFRNHTQDAALLARVRSSL
        FIENEPFQQALRPT+YARTRRT+PKMLHVPNIQASDPNGVQEQESD LILAPDLL IIEASIFTFHRFLKM+KKTS SASLSFRNHTQDAALLARVRSSL
Subjt:  FIENEPFQQALRPTLYARTRRTFPKMLHVPNIQASDPNGVQEQESDFLILAPDLLLIIEASIFTFHRFLKMDKKTSNSASLSFRNHTQDAALLARVRSSL

Query:  DKKKTKLKEVRKKSRGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFPC
        DKKKTKLKEVRKKS+GWKQKT PQTYEDMQLLFG+VDIKII+RL+KMSR TKEQLLWCEEKMNKLDVSNGKL RDPSPLLFPC
Subjt:  DKKKTKLKEVRKKSRGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFPC

A0A1S3CGF8 uncharacterized protein LOC1035006152.1e-23489.23Show/hide
Query:  MVGISGSSSALDLIKKKLQDSGTSVASSPISAPAIAQLDVNLPRDVDVAVKALQLENNKDKPKDANGDGNVSDSFLDSEDVESGPTNEQLIIQFKEEDGT
        MVGISGSSS L+LIKKKLQD+GT VASSPISAP  AQLD+NL RDVDVA+KALQ+EN++DKPK AN DGNVSDS LDSEDVE+ PTNEQLI QFKEEDGT
Subjt:  MVGISGSSSALDLIKKKLQDSGTSVASSPISAPAIAQLDVNLPRDVDVAVKALQLENNKDKPKDANGDGNVSDSFLDSEDVESGPTNEQLIIQFKEEDGT

Query:  DSKCIAEAEAEEEEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEEGEEGEEGEEVGEAEPEWRDVEAEGRQWWGGFGAVYDDYCKRMFFFDRMSLRSGP
        D K IAE EAEE+E+DFIMEEVKRRLKELRRNSFMVLIPEEEEEE   EG E EEVGE EPEWRDVEAEGRQWWGGFGAVYDDYC+RM FFDR S+ SGP
Subjt:  DSKCIAEAEAEEEEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEEGEEGEEGEEVGEAEPEWRDVEAEGRQWWGGFGAVYDDYCKRMFFFDRMSLRSGP

Query:  DSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIDSNHHIETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQR
        +STSQRSASKKSASPLRCLSLKRIEEPEDEMED DP LT IDSNHHIE AYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQR
Subjt:  DSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIDSNHHIETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQR

Query:  FIENEPFQQALRPTLYARTRRTFPKMLHVPNIQASDPNGVQEQESDFLILAPDLLLIIEASIFTFHRFLKMDKKTSNSASLSFRNHTQDAALLARVRSSL
        FIENEPFQQALRPT+YARTRRTFPKMLHVPNIQASDPNGVQEQESD LILAPDLL I+EASIFTFHRFLKM+KKTSNSASLSFRNHTQDAAL ARVRSSL
Subjt:  FIENEPFQQALRPTLYARTRRTFPKMLHVPNIQASDPNGVQEQESDFLILAPDLLLIIEASIFTFHRFLKMDKKTSNSASLSFRNHTQDAALLARVRSSL

Query:  DKKKTKLKEVRKKSRGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFPC
        DKKKTKLKEVRKKS+GWKQKT PQTYEDMQLLFG+VDIKII+RL+KMSR TKEQLLWCEEKMNKLDVSNGKL RDPSPLLFPC
Subjt:  DKKKTKLKEVRKKSRGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFPC

A0A5A7UTV1 DUF1666 domain-containing protein2.1e-23489.23Show/hide
Query:  MVGISGSSSALDLIKKKLQDSGTSVASSPISAPAIAQLDVNLPRDVDVAVKALQLENNKDKPKDANGDGNVSDSFLDSEDVESGPTNEQLIIQFKEEDGT
        MVGISGSSS L+LIKKKLQD+GT VASSPISAP  AQLD+NL RDVDVA+KALQ+EN++DKPK AN DGNVSDS LDSEDVE+ PTNEQLI QFKEEDGT
Subjt:  MVGISGSSSALDLIKKKLQDSGTSVASSPISAPAIAQLDVNLPRDVDVAVKALQLENNKDKPKDANGDGNVSDSFLDSEDVESGPTNEQLIIQFKEEDGT

Query:  DSKCIAEAEAEEEEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEEGEEGEEGEEVGEAEPEWRDVEAEGRQWWGGFGAVYDDYCKRMFFFDRMSLRSGP
        D K IAE EAEE+E+DFIMEEVKRRLKELRRNSFMVLIPEEEEEE   EG E EEVGE EPEWRDVEAEGRQWWGGFGAVYDDYC+RM FFDR S+ SGP
Subjt:  DSKCIAEAEAEEEEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEEGEEGEEGEEVGEAEPEWRDVEAEGRQWWGGFGAVYDDYCKRMFFFDRMSLRSGP

Query:  DSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIDSNHHIETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQR
        +STSQRSASKKSASPLRCLSLKRIEEPEDEMED DP LT IDSNHHIE AYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQR
Subjt:  DSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIDSNHHIETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQR

Query:  FIENEPFQQALRPTLYARTRRTFPKMLHVPNIQASDPNGVQEQESDFLILAPDLLLIIEASIFTFHRFLKMDKKTSNSASLSFRNHTQDAALLARVRSSL
        FIENEPFQQALRPT+YARTRRTFPKMLHVPNIQASDPNGVQEQESD LILAPDLL I+EASIFTFHRFLKM+KKTSNSASLSFRNHTQDAAL ARVRSSL
Subjt:  FIENEPFQQALRPTLYARTRRTFPKMLHVPNIQASDPNGVQEQESDFLILAPDLLLIIEASIFTFHRFLKMDKKTSNSASLSFRNHTQDAALLARVRSSL

Query:  DKKKTKLKEVRKKSRGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFPC
        DKKKTKLKEVRKKS+GWKQKT PQTYEDMQLLFG+VDIKII+RL+KMSR TKEQLLWCEEKMNKLDVSNGKL RDPSPLLFPC
Subjt:  DKKKTKLKEVRKKSRGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFPC

A0A6J1GQB8 uncharacterized protein LOC1114565404.6e-18184.71Show/hide
Query:  KEEDGTDSKCIAEAEA----------EEEEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEEGEEGEEGEEVGEAEPEWRDVEAEGRQWWGGFGAVYDDY
        KEEDG D KC+AEAEA          +E++EDFIMEEVKRRLKELRRNSFMVLIPEEEEEEE EE  EG E    EPEWRDVEAEGRQWWGGFGAVYDDY
Subjt:  KEEDGTDSKCIAEAEA----------EEEEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEEGEEGEEGEEVGEAEPEWRDVEAEGRQWWGGFGAVYDDY

Query:  CKRMFFFDRMSLRSGPDSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIDSNHHIETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHY
        C+RM FFDRMS +SG +S SQRSA KKSASPLRCLSLKRIEEPEDEMED+DPS  LIDSNHH+ETAYVAHICLSWEALHCQYTQLNHLISCQPQN TT Y
Subjt:  CKRMFFFDRMSLRSGPDSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIDSNHHIETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHY

Query:  NLTAQLFQQFQVLLQRFIENEPFQQALRPTLYARTRRTFPKMLHVPNIQASDPNGVQEQESDFLILAPDLLLIIEASIFTFHRFLKMDKKTSNSASLSFR
        NLTAQLFQQFQVLLQRFIENEPFQQALRP +YARTRRTFPKMLHVPNIQASD N  QEQESD LILA DLL+IIEASIFTFHRFLKMDKK S SASLS R
Subjt:  NLTAQLFQQFQVLLQRFIENEPFQQALRPTLYARTRRTFPKMLHVPNIQASDPNGVQEQESDFLILAPDLLLIIEASIFTFHRFLKMDKKTSNSASLSFR

Query:  NHTQDAALLARVRSSLDKKKTKLKEVRKKSRGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFPC
        N TQDA LLAR+RSSLDKKK KLKEVRKKSRGWKQKTWPQTYEDMQLLFG+VDIKIISRLVKM R +KEQLLWCEEK+ KLDVS+GKL RDPSPLLFPC
Subjt:  NHTQDAALLARVRSSLDKKKTKLKEVRKKSRGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFPC

A0A6J1JYQ1 uncharacterized protein LOC1114886371.8e-18084.46Show/hide
Query:  KEEDGTDSKCIAEAEA----------EEEEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEEGEEGEEGEEVGEAEPEWRDVEAEGRQWWGGFGAVYDDY
        KEEDG D KC+AEAEA          +E++EDFIMEEVKRRLKELRRNSFMVLIPEEEEEEE EE  EG E    EPEWRDVEAEGRQWWGGFGAVYDDY
Subjt:  KEEDGTDSKCIAEAEA----------EEEEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEEGEEGEEGEEVGEAEPEWRDVEAEGRQWWGGFGAVYDDY

Query:  CKRMFFFDRMSLRSGPDSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIDSNHHIETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHY
        C+RM FFDR+S +SG +S SQRSA KKSASPLRCLSLKRIEEPEDEMED+DPS TL DSNHH+ETAYVAHICLSWEALHCQYTQLNHLISCQPQN TT Y
Subjt:  CKRMFFFDRMSLRSGPDSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIDSNHHIETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHY

Query:  NLTAQLFQQFQVLLQRFIENEPFQQALRPTLYARTRRTFPKMLHVPNIQASDPNGVQEQESDFLILAPDLLLIIEASIFTFHRFLKMDKKTSNSASLSFR
        NLTAQLFQQFQVLLQRFIENEPFQQALRP +YARTRRTFPKMLHVPNIQASD N  QEQESD LILA DLL+IIEASIFTFHRFLKMDKK S SASLS R
Subjt:  NLTAQLFQQFQVLLQRFIENEPFQQALRPTLYARTRRTFPKMLHVPNIQASDPNGVQEQESDFLILAPDLLLIIEASIFTFHRFLKMDKKTSNSASLSFR

Query:  NHTQDAALLARVRSSLDKKKTKLKEVRKKSRGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFPC
        N TQDA LLAR+RSSLDKKK KLKEVRKKSRGWKQKTWPQTYEDMQLLFG+VDIKIISRLVKM R +KEQLLWCEEK+ KLDVS+GKL RDPSPLLFPC
Subjt:  NHTQDAALLARVRSSLDKKKTKLKEVRKKSRGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFPC

SwissProt top hitse value%identityAlignment
Q9LT25 Pre-mRNA-processing protein 40C2.7e-0833.8Show/hide
Query:  QVSSRQIPNE--------DEKTKEHLAPLRNNKAFTDLGSSPININTPAINTGGCEATPLRMVGISGSSSALDLIKKKLQDSGTSVASSPISAPAIAQLD
        +VSS QIP E        +E+  E +A +  +   T+ GS   +++ PAI+ GG +A  L+       SSALDL+KKKL DSG  V+S+  S       +
Subjt:  QVSSRQIPNE--------DEKTKEHLAPLRNNKAFTDLGSSPININTPAINTGGCEATPLRMVGISGSSSALDLIKKKLQDSGTSVASSPISAPAIAQLD

Query:  VNLPRDVDVAVKALQLENNKDKPKDANGDGNVSDSFLDSEDVESGPTNEQLIIQFKEEDGTDSKCIAEAEAEEEEEDFIMEEVKRRL---KELRRNSFMV
         N  +  +V     +  N+  K KDA G G +SDS  DSED +SGP+ E+   QFKE      + IA     E+E   I+ + + +      +RR+ F  
Subjt:  VNLPRDVDVAVKALQLENNKDKPKDANGDGNVSDSFLDSEDVESGPTNEQLIIQFKEEDGTDSKCIAEAEAEEEEEDFIMEEVKRRL---KELRRNSFMV

Query:  LIPEEEEEEEGEE
         +    EEE  E+
Subjt:  LIPEEEEEEEGEE

Arabidopsis top hitse value%identityAlignment
AT1G69610.1 Protein of unknown function (DUF1666)4.5e-2728.18Show/hide
Query:  DSFLDSEDVESGPTNEQLIIQFKEEDGTDSKCIAEAEAEEEEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEEGEEGEEGEEVGEAEPEWRDVEAEGRQ
        + F+  E+      +EQ  +  + +DG+DS      + +E E   ++E++K  L+  R      ++   EE E   +  +  ++     + +D  AE   
Subjt:  DSFLDSEDVESGPTNEQLIIQFKEEDGTDSKCIAEAEAEEEEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEEGEEGEEGEEVGEAEPEWRDVEAEGRQ

Query:  WWGGFGAVYDDYCKRMFFFD--------RMSLRSGPDSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLI-DSNHHIETAYVAHICLSWEALH
               VY +Y  +M   D         +SL    DS+     S+ +  P +    + I   +    + DPS  L+ +++   ET YV  +CLSWE L 
Subjt:  WWGGFGAVYDDYCKRMFFFD--------RMSLRSGPDSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLI-DSNHHIETAYVAHICLSWEALH

Query:  CQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFIENEPFQQALRPTLYARTRRTFPKMLHVPNIQASDPNGVQ-EQESDFLILAPDLLLIIEASI
         QY   + ++    Q +T  YNL A  FQ FQVLLQRF+ENEPFQ + R   Y + RR F   L +P ++    +  +   E +F +    L  II  S+
Subjt:  CQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFIENEPFQQALRPTLYARTRRTFPKMLHVPNIQASDPNGVQ-EQESDFLILAPDLLLIIEASI

Query:  FTFHRFLKMDK-------KTSNSASLSFRNHTQDAALLARVRSSLDKKKTKLKEVRKKSRGWKQKTWPQTYEDM-----QLLFGIVDIKIISRLVKMSRT
          F  FL  DK       K S+   +S ++ + D  LL  +R+ L KK+ KLKE+++      +K      +       +LL   ++++++SR++ MS+ 
Subjt:  FTFHRFLKMDK-------KTSNSASLSFRNHTQDAALLARVRSSLDKKKTKLKEVRKKSRGWKQKTWPQTYEDM-----QLLFGIVDIKIISRLVKMSRT

Query:  TKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFPC
        T E+L WC+EK+ K+  +  K+  +P   L PC
Subjt:  TKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFPC

AT1G73850.1 Protein of unknown function (DUF1666)5.1e-3130.87Show/hide
Query:  IPEEEEEEEGE---EGEEGEEVGEAEPEWRD-------VEAEGRQ---WWGGFGAVYDDYCKRMFFFDRMSLRSGPDSTSQRSASKKSASPLRCLSLKRI
        + EEEEEE G+   E        ++  EWR+            R+    W  +  V+  Y + M F  R+S +   ++ S +S   +  S    +  K  
Subjt:  IPEEEEEEEGE---EGEEGEEVGEAEPEWRD-------VEAEGRQ---WWGGFGAVYDDYCKRMFFFDRMSLRSGPDSTSQRSASKKSASPLRCLSLKRI

Query:  EEPEDEMEDVDPSLTLIDSNHHI--ETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLT--------AQLFQQFQVLLQRFIENEPFQQALRPT
             + +   P       N ++  E+AYVA ICL+WEAL   Y         + + STT  +          A  F+ F +LLQR++ENEP++   RP 
Subjt:  EEPEDEMEDVDPSLTLIDSNHHI--ETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLT--------AQLFQQFQVLLQRFIENEPFQQALRPT

Query:  LYARTRRTFPKMLHVPNIQASDPNGVQEQESD----FLILAPDLLLIIEASIFTFHRFLKMDK-KTSNSASLSFRNHTQ----DAALLARVRSSLDKKKT
        +YAR R   PK+L VP  Q  +    +E E++      I +   L+I+E  I TF  FL+ DK K       +F   ++    D  L+  ++    KKKT
Subjt:  LYARTRRTFPKMLHVPNIQASDPNGVQEQESD----FLILAPDLLLIIEASIFTFHRFLKMDK-KTSNSASLSFRNHTQ----DAALLARVRSSLDKKKT

Query:  KLKEVRKKSRGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNG--KLWRDPSPLLFP
        KLKE+R+  +  ++K      E+M++L G++D+K++SR+++M+   +E L WCEEKM+K+ +  G   L RD +PL FP
Subjt:  KLKEVRKKSRGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNG--KLWRDPSPLLFP

AT3G20260.1 Protein of unknown function (DUF1666)3.5e-11256.09Show/hide
Query:  EAEEEEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEE-----GEEGEEGEEVGEAEPEWRDVEAEGRQWWGGFGAVYDDYCKRMFFFDRMS--------
        E E++++DFI  EVKRRLKELRRNSFMVLIPEEEEEEE      E+ ++GE+  +   EWRDV AEG QWWGGF AVY+ YC+RM FFDR+S        
Subjt:  EAEEEEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEE-----GEEGEEGEEVGEAEPEWRDVEAEGRQWWGGFGAVYDDYCKRMFFFDRMS--------

Query:  --LRSGPDSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIDSNHHIETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQ
          +   P + S RSASKK +SP RCLSLK+ + PE+++E + P+  + D    +ETAYVA +CL+WEALHCQYTQL+HLISCQP+  T  YN TAQLFQQ
Subjt:  --LRSGPDSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIDSNHHIETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQ

Query:  FQVLLQRFIENEPFQQALRPTLYARTRRTFPKMLHVPNIQASDPNGVQEQESDFLILAPDLLLIIEASIFTFHRFLKMDKKTSNSASLSF----RNHTQD
        F VLLQR+IENEPF+Q  R  LYAR R   PK+L  P IQ SD   + E+++ F++LA DL+ +IE+SI TF+ FLKMDKK  N     F     NH   
Subjt:  FQVLLQRFIENEPFQQALRPTLYARTRRTFPKMLHVPNIQASDPNGVQEQESDFLILAPDLLLIIEASIFTFHRFLKMDKKTSNSASLSF----RNHTQD

Query:  AALLARVRSSLDKKKTKLKEVRKKSRGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFPC
           L  V+SS+DKK+ K KE+ KK++G ++K+WPQT+E +QLLF  +DIK+ +R+++MS+ +KEQLLWCEEKM KL+ S GKL R PSP+LFPC
Subjt:  AALLARVRSSLDKKKTKLKEVRKKSRGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFPC

AT5G39785.1 Protein of unknown function (DUF1666)5.1e-3128.48Show/hide
Query:  DGNVSDSFLDSEDVESGPTNEQLIIQFKEEDGTDSKCIAEAEAEEEEEDF--------IMEEVKRRLKELRRNSFMVLIPEEEEEEEGEEGEEGEEVGEA
        DG +SDS      ++ G        Q ++ D + S   +E E EE+   F        ++E++K  +K+++    +  I EEEEE+     ++  ++ E 
Subjt:  DGNVSDSFLDSEDVESGPTNEQLIIQFKEEDGTDSKCIAEAEAEEEEEDF--------IMEEVKRRLKELRRNSFMVLIPEEEEEEEGEEGEEGEEVGEA

Query:  EPEWRDVEAEGRQWWGGFGAVYD---DYCKRMFFFDRMSLR----------SGPDSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLI-DSNH
           WR  E +  +     G V+     Y +RM   D +S +            P   +    S  S +    +    I   + +  +++P +  + +   
Subjt:  EPEWRDVEAEGRQWWGGFGAVYD---DYCKRMFFFDRMSLR----------SGPDSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLI-DSNH

Query:  HIETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFIENEPFQQALRPTLYARTRRTFPKMLHVPNIQ----ASDPNGVQ
         +E  YV  +CLSWE LH QY +   L+      S   YN  A  FQQFQVLLQRF+ENEPF++  R   Y + R     +L +P I+        NG +
Subjt:  HIETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFIENEPFQQALRPTLYARTRRTFPKMLHVPNIQ----ASDPNGVQ

Query:  ---EQESDFLILAPDLLLIIEASIFTFHRFLKMDKKTSNSASLSFRNHTQ----------DAALLARVRSSLDKKKTKLKEVRKKS-----RGWKQKTWP
           E+ +D +I +  L+ I+E +I  F RF++ DK TS+      R  +Q          D  + A V+S L  K+ +L++V K       R  K K   
Subjt:  ---EQESDFLILAPDLLLIIEASIFTFHRFLKMDKKTSNSASLSFRNHTQ----------DAALLARVRSSLDKKKTKLKEVRKKS-----RGWKQKTWP

Query:  QTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFPC
         T + +   F  VD+K+++R++ MS+ T++ L+WC  K+ K++  N +L  DPS  LFPC
Subjt:  QTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFPC

AT5G39785.2 Protein of unknown function (DUF1666)4.8e-2928.2Show/hide
Query:  DGNVSDSFLDSEDVESGPTNEQLIIQFKEEDGTDSKCIAEAEAEEEEEDF--------IMEEVKRRLKELRRNSFMVLIPEEEEEEEGEEGEEGEEVGEA
        DG +SDS      ++ G        Q ++ D + S   +E E EE+   F        ++E++K  +K+++    +  I EEEEE+     ++  ++ E 
Subjt:  DGNVSDSFLDSEDVESGPTNEQLIIQFKEEDGTDSKCIAEAEAEEEEEDF--------IMEEVKRRLKELRRNSFMVLIPEEEEEEEGEEGEEGEEVGEA

Query:  EPEWRDVEAEGRQWWGGFGAVYD---DYCKRMFFFDRMSLR----------SGPDSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLI-DSNH
           WR  E +  +     G V+     Y +RM   D +S +            P   +    S  S +    +    I   + +  +++P +  + +   
Subjt:  EPEWRDVEAEGRQWWGGFGAVYD---DYCKRMFFFDRMSLR----------SGPDSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLI-DSNH

Query:  HIETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFIENEPFQQALRPTLYARTRRTFPKMLHVPNIQ----ASDPNGVQ
         +E  YV  +CLSWE LH QY +   L+      S   YN  A  FQQFQVLLQRF+ENEPF++  R   Y + R     +L +P I+        NG +
Subjt:  HIETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFIENEPFQQALRPTLYARTRRTFPKMLHVPNIQ----ASDPNGVQ

Query:  ---EQESDFLILAPDLLLIIEASIFTFHRFLKMDKKTSNSASLSFRNHTQ----------DAALLARVRSSLDK-KKTKLKEVRKKS-----RGWKQKTW
           E+ +D +I +  L+ I+E +I  F RF++ DK TS+      R  +Q          D  + A V+S L    + +L++V K       R  K K  
Subjt:  ---EQESDFLILAPDLLLIIEASIFTFHRFLKMDKKTSNSASLSFRNHTQ----------DAALLARVRSSLDK-KKTKLKEVRKKS-----RGWKQKTW

Query:  PQTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFPC
          T + +   F  VD+K+++R++ MS+ T++ L+WC  K+ K++  N +L  DPS  LFPC
Subjt:  PQTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFPC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTATATGAGGTAGTTTGGGGCTCAAAAGTGGATAGATTTATTAGAGTATTTATCTGCTACCTGATGGTCCATTTACCATTAGATATTGTTTATGTAAACTCA
TTGAATGAGAAAATGAACCTCAAAATCACATTATCTATTCTCTGTCCTTTACATCAAATGTTAGTGAAACTTGTGTCAATCCTGACCCTCACTCTGAATCAGGTA
AGCAGTCGCCAAATCCCAAATGAAGATGAGAAAACAAAAGAACATTTAGCTCCTTTGCGAAATAATAAAGCATTCACTGATCTAGGATCTTCTCCTATCAATATC
AATACTCCTGCCATCAACACAGGTGGTTGTGAAGCCACGCCCCTCAGAATGGTAGGAATATCAGGGTCATCTTCTGCCCTGGATTTGATCAAGAAAAAATTACAA
GACTCTGGAACTTCTGTAGCTTCCTCGCCTATTTCAGCTCCAGCAATAGCTCAATTAGATGTAAATCTACCGAGAGATGTTGATGTTGCAGTTAAGGCACTGCAG
CTAGAGAACAACAAGGATAAACCGAAAGATGCTAATGGTGATGGAAATGTATCCGACTCCTTCTTGGACTCTGAGGATGTAGAAAGTGGGCCAACTAATGAGCAA
TTAATTATCCAGTTTAAGGAGGAGGATGGCACAGATTCTAAGTGTATTGCAGAAGCGGAAGCTGAAGAGGAGGAGGAGGATTTCATTATGGAGGAGGTAAAAAGG
AGATTGAAAGAGCTGAGGAGGAACAGTTTCATGGTGTTGATTCCAGAGGAAGAAGAAGAAGAAGAAGGAGAAGAAGGAGAAGAAGGAGAAGAAGTAGGCGAAGCC
GAGCCCGAGTGGAGAGACGTGGAAGCAGAAGGCCGACAATGGTGGGGAGGGTTTGGTGCTGTTTATGATGATTACTGCAAGAGGATGTTTTTCTTTGATCGGATG
AGCCTTCGATCTGGTCCCGATTCAACCTCCCAAAGATCTGCATCGAAAAAGAGTGCATCTCCTCTTCGATGTCTTTCTCTGAAGAGGATCGAAGAACCTGAAGAT
GAAATGGAAGATGTTGACCCATCATTGACTCTGATTGACTCCAATCACCACATAGAAACAGCCTATGTTGCTCACATTTGCTTGTCCTGGGAGGCCCTTCACTGT
CAGTACACTCAACTTAACCACTTAATATCATGCCAACCCCAAAACTCTACTACTCATTATAATCTTACTGCTCAGCTCTTTCAGCAATTTCAAGTCCTCTTGCAA
AGGTTTATTGAAAATGAACCCTTCCAACAAGCTCTCAGGCCTACACTTTATGCCCGAACCCGTCGAACTTTTCCTAAAATGTTGCATGTTCCTAACATACAAGCT
TCAGATCCAAACGGGGTGCAGGAACAAGAATCTGATTTCCTCATCCTCGCTCCTGATCTGCTGCTCATTATTGAGGCTTCAATCTTTACTTTCCACCGCTTCCTG
AAGATGGACAAGAAAACCTCAAACTCTGCTTCTTTGTCGTTTCGGAACCACACACAGGATGCCGCTCTGCTTGCTCGTGTTCGGTCTTCTCTCGACAAGAAGAAG
ACGAAGCTGAAAGAGGTTAGGAAGAAGAGTAGAGGGTGGAAACAGAAAACGTGGCCTCAAACGTATGAAGACATGCAATTACTTTTTGGAATCGTGGACATTAAA
ATCATATCAAGGCTTGTTAAGATGTCGAGGACTACTAAAGAACAGCTGCTCTGGTGCGAGGAGAAAATGAACAAGTTAGATGTGTCTAATGGAAAATTGTGGAGA
GATCCGTCTCCTCTTCTTTTCCCATGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTATATGAGGTAGTTTGGGGCTCAAAAGTGGATAGATTTATTAGAGTATTTATCTGCTACCTGATGGTCCATTTACCATTAGATATTGTTTATGTAAACTCA
TTGAATGAGAAAATGAACCTCAAAATCACATTATCTATTCTCTGTCCTTTACATCAAATGTTAGTGAAACTTGTGTCAATCCTGACCCTCACTCTGAATCAGGTA
AGCAGTCGCCAAATCCCAAATGAAGATGAGAAAACAAAAGAACATTTAGCTCCTTTGCGAAATAATAAAGCATTCACTGATCTAGGATCTTCTCCTATCAATATC
AATACTCCTGCCATCAACACAGGTGGTTGTGAAGCCACGCCCCTCAGAATGGTAGGAATATCAGGGTCATCTTCTGCCCTGGATTTGATCAAGAAAAAATTACAA
GACTCTGGAACTTCTGTAGCTTCCTCGCCTATTTCAGCTCCAGCAATAGCTCAATTAGATGTAAATCTACCGAGAGATGTTGATGTTGCAGTTAAGGCACTGCAG
CTAGAGAACAACAAGGATAAACCGAAAGATGCTAATGGTGATGGAAATGTATCCGACTCCTTCTTGGACTCTGAGGATGTAGAAAGTGGGCCAACTAATGAGCAA
TTAATTATCCAGTTTAAGGAGGAGGATGGCACAGATTCTAAGTGTATTGCAGAAGCGGAAGCTGAAGAGGAGGAGGAGGATTTCATTATGGAGGAGGTAAAAAGG
AGATTGAAAGAGCTGAGGAGGAACAGTTTCATGGTGTTGATTCCAGAGGAAGAAGAAGAAGAAGAAGGAGAAGAAGGAGAAGAAGGAGAAGAAGTAGGCGAAGCC
GAGCCCGAGTGGAGAGACGTGGAAGCAGAAGGCCGACAATGGTGGGGAGGGTTTGGTGCTGTTTATGATGATTACTGCAAGAGGATGTTTTTCTTTGATCGGATG
AGCCTTCGATCTGGTCCCGATTCAACCTCCCAAAGATCTGCATCGAAAAAGAGTGCATCTCCTCTTCGATGTCTTTCTCTGAAGAGGATCGAAGAACCTGAAGAT
GAAATGGAAGATGTTGACCCATCATTGACTCTGATTGACTCCAATCACCACATAGAAACAGCCTATGTTGCTCACATTTGCTTGTCCTGGGAGGCCCTTCACTGT
CAGTACACTCAACTTAACCACTTAATATCATGCCAACCCCAAAACTCTACTACTCATTATAATCTTACTGCTCAGCTCTTTCAGCAATTTCAAGTCCTCTTGCAA
AGGTTTATTGAAAATGAACCCTTCCAACAAGCTCTCAGGCCTACACTTTATGCCCGAACCCGTCGAACTTTTCCTAAAATGTTGCATGTTCCTAACATACAAGCT
TCAGATCCAAACGGGGTGCAGGAACAAGAATCTGATTTCCTCATCCTCGCTCCTGATCTGCTGCTCATTATTGAGGCTTCAATCTTTACTTTCCACCGCTTCCTG
AAGATGGACAAGAAAACCTCAAACTCTGCTTCTTTGTCGTTTCGGAACCACACACAGGATGCCGCTCTGCTTGCTCGTGTTCGGTCTTCTCTCGACAAGAAGAAG
ACGAAGCTGAAAGAGGTTAGGAAGAAGAGTAGAGGGTGGAAACAGAAAACGTGGCCTCAAACGTATGAAGACATGCAATTACTTTTTGGAATCGTGGACATTAAA
ATCATATCAAGGCTTGTTAAGATGTCGAGGACTACTAAAGAACAGCTGCTCTGGTGCGAGGAGAAAATGAACAAGTTAGATGTGTCTAATGGAAAATTGTGGAGA
GATCCGTCTCCTCTTCTTTTCCCATGTTAA
Protein sequenceShow/hide protein sequence
MLYEVVWGSKVDRFIRVFICYLMVHLPLDIVYVNSLNEKMNLKITLSILCPLHQMLVKLVSILTLTLNQVSSRQIPNEDEKTKEHLAPLRNNKAFTDLGSSPINI
NTPAINTGGCEATPLRMVGISGSSSALDLIKKKLQDSGTSVASSPISAPAIAQLDVNLPRDVDVAVKALQLENNKDKPKDANGDGNVSDSFLDSEDVESGPTNEQ
LIIQFKEEDGTDSKCIAEAEAEEEEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEEGEEGEEGEEVGEAEPEWRDVEAEGRQWWGGFGAVYDDYCKRMFFFDRM
SLRSGPDSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIDSNHHIETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQ
RFIENEPFQQALRPTLYARTRRTFPKMLHVPNIQASDPNGVQEQESDFLILAPDLLLIIEASIFTFHRFLKMDKKTSNSASLSFRNHTQDAALLARVRSSLDKKK
TKLKEVRKKSRGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFPC