; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G21760 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G21760
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionProtein of unknown function (DUF1666)
Genome locationChr7:18533003..18536597
RNA-Seq ExpressionCSPI07G21760
SyntenyCSPI07G21760
Gene Ontology termsNA
InterPro domainsIPR012870 - Protein of unknown function DUF1666


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7013928.1 hypothetical protein SDJN02_24097 [Cucurbita argyrosperma subsp. argyrosperma]3.0e-17783.58Show/hide
Query:  KEEDGTDLKSIAEVEA----------EEDEDDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGE-----GEPEWRDVEAEGRQWWGGFGAVY
        KEEDG DLK +AE EA          +ED++DFIMEEVKRRLKELRRNSFMVLIPEEEEEE    EEEEVGE     GEPEWRDVEAEGRQWWGGFGAVY
Subjt:  KEEDGTDLKSIAEVEA----------EEDEDDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGE-----GEPEWRDVEAEGRQWWGGFGAVY

Query:  DDYCERMRFFDRKSIESGPASTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNHHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNST
        DDYCERM FFDR S +SG  S SQRSA KKSASPLRCLSLKRIEEPEDEMED+DPS   IDSNHH+E AYVAHICLSWEALHCQYTQLNHLISCQPQN T
Subjt:  DDYCERMRFFDRKSIESGPASTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNHHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNST

Query:  THYNLTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTYPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASL
        T YNLTAQLFQQFQVLLQRFIENEPFQQALRP IYARTRRT+PKMLHVPNIQASD N  QEQESDSLILA DLL IIEASIFTFHRFLKM+KK S SASL
Subjt:  THYNLTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTYPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASL

Query:  SFRNHTQDAALLARVRSSLDKKKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLF
        S RN TQDA LLAR+RSSLDKKK KLKEVRKKS+GWKQKT PQTYEDMQLLFGVVDIKII+RL+KM RI+KEQLLWCEEK+ KLDVS+GKLRRDPSPLLF
Subjt:  SFRNHTQDAALLARVRSSLDKKKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLF

Query:  PC
        PC
Subjt:  PC

XP_004141832.1 uncharacterized protein LOC101216166 [Cucumis sativus]3.6e-26399.38Show/hide
Query:  MVGISGPSSVLNLIKKKLQDTGTPVASSPISAPTTAQLDLNLPRDVNVALKALQKENGKDKPKYANADGNVSDSSLDSEDVESGPTDEQLIIQFKEEDGT
        MVGISGPSSVLNLIKKKLQDTGTPVASSPISAPTTAQLDLNLPRDVNVALKALQKENGKDKPKYANADGNVSDSSLDSEDVESGPTDEQLIIQFKEEDGT
Subjt:  MVGISGPSSVLNLIKKKLQDTGTPVASSPISAPTTAQLDLNLPRDVNVALKALQKENGKDKPKYANADGNVSDSSLDSEDVESGPTDEQLIIQFKEEDGT

Query:  DLKSIAEVEAEEDEDDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKSIESGPAS
        D KSIAEVEAEEDEDDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGE EWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKSIESGPAS
Subjt:  DLKSIAEVEAEEDEDDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKSIESGPAS

Query:  TSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNHHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFI
        TSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLT IDSNHHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFI
Subjt:  TSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNHHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFI

Query:  ENEPFQQALRPTIYARTRRTYPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASLSFRNHTQDAALLARVRSSLDK
        ENEPFQQALRPTIYARTRRTYPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASLSFRNHTQDAALLARVRSSLDK
Subjt:  ENEPFQQALRPTIYARTRRTYPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASLSFRNHTQDAALLARVRSSLDK

Query:  KKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC
        KKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC
Subjt:  KKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC

XP_008462200.1 PREDICTED: uncharacterized protein LOC103500615 [Cucumis melo]5.2e-25496.05Show/hide
Query:  MVGISGPSSVLNLIKKKLQDTGTPVASSPISAPTTAQLDLNLPRDVNVALKALQKENGKDKPKYANADGNVSDSSLDSEDVESGPTDEQLIIQFKEEDGT
        MVGISG SSVLNLIKKKLQDTGTPVASSPISAPTTAQLDLNL RDV+VALKALQ EN +DKPKYANADGNVSDSSLDSEDVE+ PT+EQLI QFKEEDGT
Subjt:  MVGISGPSSVLNLIKKKLQDTGTPVASSPISAPTTAQLDLNLPRDVNVALKALQKENGKDKPKYANADGNVSDSSLDSEDVESGPTDEQLIIQFKEEDGT

Query:  DLKSIAEVEAEEDEDDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKSIESGPAS
        D KSIAEVEAEEDEDDFIMEEVKRRLKELRRNSFMVLIPEEEEEE EGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKSIESGP S
Subjt:  DLKSIAEVEAEEDEDDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKSIESGPAS

Query:  TSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNHHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFI
        TSQRSASKKSASPLRCLSLKRIEEPEDEMED DP LTPIDSNHHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFI
Subjt:  TSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNHHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFI

Query:  ENEPFQQALRPTIYARTRRTYPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASLSFRNHTQDAALLARVRSSLDK
        ENEPFQQALRPTIYARTRRT+PKMLHVPNIQASDPNGVQEQESDSLILAPDLLFI+EASIFTFHRFLKMEKKTS SASLSFRNHTQDAAL ARVRSSLDK
Subjt:  ENEPFQQALRPTIYARTRRTYPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASLSFRNHTQDAALLARVRSSLDK

Query:  KKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC
        KKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC
Subjt:  KKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC

XP_022954221.1 uncharacterized protein LOC111456540 [Cucurbita moschata]3.9e-17783.58Show/hide
Query:  KEEDGTDLKSIAEVEA----------EEDEDDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGE-----GEPEWRDVEAEGRQWWGGFGAVY
        KEEDG DLK +AE EA          +ED++DFIMEEVKRRLKELRRNSFMVLIPEEEEEE    EEEEVGE     GEPEWRDVEAEGRQWWGGFGAVY
Subjt:  KEEDGTDLKSIAEVEA----------EEDEDDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGE-----GEPEWRDVEAEGRQWWGGFGAVY

Query:  DDYCERMRFFDRKSIESGPASTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNHHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNST
        DDYCERM FFDR S +SG  S SQRSA KKSASPLRCLSLKRIEEPEDEMED+DPS   IDSNHH+E AYVAHICLSWEALHCQYTQLNHLISCQPQN T
Subjt:  DDYCERMRFFDRKSIESGPASTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNHHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNST

Query:  THYNLTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTYPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASL
        T YNLTAQLFQQFQVLLQRFIENEPFQQALRP IYARTRRT+PKMLHVPNIQASD N  QEQESDSLILA DLL IIEASIFTFHRFLKM+KK S SASL
Subjt:  THYNLTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTYPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASL

Query:  SFRNHTQDAALLARVRSSLDKKKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLF
        S RN TQDA LLAR+RSSLDKKK KLKEVRKKS+GWKQKT PQTYEDMQLLFGVVDIKII+RL+KM RI+KEQLLWCEEK+ KLDVS+GKLRRDPSPLLF
Subjt:  SFRNHTQDAALLARVRSSLDKKKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLF

Query:  PC
        PC
Subjt:  PC

XP_038898126.1 uncharacterized protein LOC120085908 isoform X1 [Benincasa hispida]4.3e-23289.23Show/hide
Query:  MVGISGPSSVLNLIKKKLQDTGTPVASSPISAPTTAQLDLNLPRDVNVALKALQKENGKDKPKYANADGNVSDSSLDSEDVESGPTDEQLIIQFKEEDGT
        MVGISG SS L+LIKKKL D+GTPVASSPISA T AQLD+NLPRDV+VA+KALQ EN K+KPK AN DGNVS+SSLDSE+V+SGPT+EQL IQF EEDG 
Subjt:  MVGISGPSSVLNLIKKKLQDTGTPVASSPISAPTTAQLDLNLPRDVNVALKALQKENGKDKPKYANADGNVSDSSLDSEDVESGPTDEQLIIQFKEEDGT

Query:  DLKSIAEVEAEEDEDDFIMEEVKRRLKELRRNSFMVLIP--EEEEEEIEGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKSIESGP
        D K  AEVEAEEDE+DFIMEEVKRRLKELRRNSFMVLIP  EEEEEE EG EE EVGEG+PEWRDVEAEGRQWWGGFGAVYDDYCERM FFDR SI SGP
Subjt:  DLKSIAEVEAEEDEDDFIMEEVKRRLKELRRNSFMVLIP--EEEEEEIEGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKSIESGP

Query:  ASTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNHHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQR
         STS+RSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLT IDSN+HIE AYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQR
Subjt:  ASTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNHHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQR

Query:  FIENEPFQQALRPTIYARTRRTYPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASLSFRNHTQDAALLARVRSSL
        FIENEPFQQ LRPTIYARTRRT+PKMLHVPNIQASDPNGVQEQESDSLILAPDLL IIEASIFTFHRFLKMEKKTS SASLSF+NHTQDAALLARVRSSL
Subjt:  FIENEPFQQALRPTIYARTRRTYPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASLSFRNHTQDAALLARVRSSL

Query:  DKKKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC
        DKKKTKLKEVRKKS+GWKQKTCPQTYEDMQLLFG+VDIKII+RL+KMSRITKEQLLWCEEK+NKLD+SNGKLRRD SPLLFPC
Subjt:  DKKKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC

TrEMBL top hitse value%identityAlignment
A0A0A0K6Q7 Uncharacterized protein1.7e-26399.38Show/hide
Query:  MVGISGPSSVLNLIKKKLQDTGTPVASSPISAPTTAQLDLNLPRDVNVALKALQKENGKDKPKYANADGNVSDSSLDSEDVESGPTDEQLIIQFKEEDGT
        MVGISGPSSVLNLIKKKLQDTGTPVASSPISAPTTAQLDLNLPRDVNVALKALQKENGKDKPKYANADGNVSDSSLDSEDVESGPTDEQLIIQFKEEDGT
Subjt:  MVGISGPSSVLNLIKKKLQDTGTPVASSPISAPTTAQLDLNLPRDVNVALKALQKENGKDKPKYANADGNVSDSSLDSEDVESGPTDEQLIIQFKEEDGT

Query:  DLKSIAEVEAEEDEDDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKSIESGPAS
        D KSIAEVEAEEDEDDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGE EWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKSIESGPAS
Subjt:  DLKSIAEVEAEEDEDDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKSIESGPAS

Query:  TSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNHHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFI
        TSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLT IDSNHHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFI
Subjt:  TSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNHHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFI

Query:  ENEPFQQALRPTIYARTRRTYPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASLSFRNHTQDAALLARVRSSLDK
        ENEPFQQALRPTIYARTRRTYPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASLSFRNHTQDAALLARVRSSLDK
Subjt:  ENEPFQQALRPTIYARTRRTYPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASLSFRNHTQDAALLARVRSSLDK

Query:  KKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC
        KKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC
Subjt:  KKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC

A0A1S3CGF8 uncharacterized protein LOC1035006152.5e-25496.05Show/hide
Query:  MVGISGPSSVLNLIKKKLQDTGTPVASSPISAPTTAQLDLNLPRDVNVALKALQKENGKDKPKYANADGNVSDSSLDSEDVESGPTDEQLIIQFKEEDGT
        MVGISG SSVLNLIKKKLQDTGTPVASSPISAPTTAQLDLNL RDV+VALKALQ EN +DKPKYANADGNVSDSSLDSEDVE+ PT+EQLI QFKEEDGT
Subjt:  MVGISGPSSVLNLIKKKLQDTGTPVASSPISAPTTAQLDLNLPRDVNVALKALQKENGKDKPKYANADGNVSDSSLDSEDVESGPTDEQLIIQFKEEDGT

Query:  DLKSIAEVEAEEDEDDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKSIESGPAS
        D KSIAEVEAEEDEDDFIMEEVKRRLKELRRNSFMVLIPEEEEEE EGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKSIESGP S
Subjt:  DLKSIAEVEAEEDEDDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKSIESGPAS

Query:  TSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNHHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFI
        TSQRSASKKSASPLRCLSLKRIEEPEDEMED DP LTPIDSNHHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFI
Subjt:  TSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNHHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFI

Query:  ENEPFQQALRPTIYARTRRTYPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASLSFRNHTQDAALLARVRSSLDK
        ENEPFQQALRPTIYARTRRT+PKMLHVPNIQASDPNGVQEQESDSLILAPDLLFI+EASIFTFHRFLKMEKKTS SASLSFRNHTQDAAL ARVRSSLDK
Subjt:  ENEPFQQALRPTIYARTRRTYPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASLSFRNHTQDAALLARVRSSLDK

Query:  KKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC
        KKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC
Subjt:  KKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC

A0A5A7UTV1 DUF1666 domain-containing protein2.5e-25496.05Show/hide
Query:  MVGISGPSSVLNLIKKKLQDTGTPVASSPISAPTTAQLDLNLPRDVNVALKALQKENGKDKPKYANADGNVSDSSLDSEDVESGPTDEQLIIQFKEEDGT
        MVGISG SSVLNLIKKKLQDTGTPVASSPISAPTTAQLDLNL RDV+VALKALQ EN +DKPKYANADGNVSDSSLDSEDVE+ PT+EQLI QFKEEDGT
Subjt:  MVGISGPSSVLNLIKKKLQDTGTPVASSPISAPTTAQLDLNLPRDVNVALKALQKENGKDKPKYANADGNVSDSSLDSEDVESGPTDEQLIIQFKEEDGT

Query:  DLKSIAEVEAEEDEDDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKSIESGPAS
        D KSIAEVEAEEDEDDFIMEEVKRRLKELRRNSFMVLIPEEEEEE EGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKSIESGP S
Subjt:  DLKSIAEVEAEEDEDDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKSIESGPAS

Query:  TSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNHHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFI
        TSQRSASKKSASPLRCLSLKRIEEPEDEMED DP LTPIDSNHHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFI
Subjt:  TSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNHHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFI

Query:  ENEPFQQALRPTIYARTRRTYPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASLSFRNHTQDAALLARVRSSLDK
        ENEPFQQALRPTIYARTRRT+PKMLHVPNIQASDPNGVQEQESDSLILAPDLLFI+EASIFTFHRFLKMEKKTS SASLSFRNHTQDAAL ARVRSSLDK
Subjt:  ENEPFQQALRPTIYARTRRTYPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASLSFRNHTQDAALLARVRSSLDK

Query:  KKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC
        KKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC
Subjt:  KKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC

A0A6J1GQB8 uncharacterized protein LOC1114565401.9e-17783.58Show/hide
Query:  KEEDGTDLKSIAEVEA----------EEDEDDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGE-----GEPEWRDVEAEGRQWWGGFGAVY
        KEEDG DLK +AE EA          +ED++DFIMEEVKRRLKELRRNSFMVLIPEEEEEE    EEEEVGE     GEPEWRDVEAEGRQWWGGFGAVY
Subjt:  KEEDGTDLKSIAEVEA----------EEDEDDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGE-----GEPEWRDVEAEGRQWWGGFGAVY

Query:  DDYCERMRFFDRKSIESGPASTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNHHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNST
        DDYCERM FFDR S +SG  S SQRSA KKSASPLRCLSLKRIEEPEDEMED+DPS   IDSNHH+E AYVAHICLSWEALHCQYTQLNHLISCQPQN T
Subjt:  DDYCERMRFFDRKSIESGPASTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNHHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNST

Query:  THYNLTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTYPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASL
        T YNLTAQLFQQFQVLLQRFIENEPFQQALRP IYARTRRT+PKMLHVPNIQASD N  QEQESDSLILA DLL IIEASIFTFHRFLKM+KK S SASL
Subjt:  THYNLTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTYPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASL

Query:  SFRNHTQDAALLARVRSSLDKKKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLF
        S RN TQDA LLAR+RSSLDKKK KLKEVRKKS+GWKQKT PQTYEDMQLLFGVVDIKII+RL+KM RI+KEQLLWCEEK+ KLDVS+GKLRRDPSPLLF
Subjt:  SFRNHTQDAALLARVRSSLDKKKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLF

Query:  PC
        PC
Subjt:  PC

A0A6J1JYQ1 uncharacterized protein LOC1114886374.2e-17783.58Show/hide
Query:  KEEDGTDLKSIAEVEA----------EEDEDDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGE-----GEPEWRDVEAEGRQWWGGFGAVY
        KEEDG DLK +AE EA          +ED++DFIMEEVKRRLKELRRNSFMVLIPEEEEEE    EEEEVGE     GEPEWRDVEAEGRQWWGGFGAVY
Subjt:  KEEDGTDLKSIAEVEA----------EEDEDDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGE-----GEPEWRDVEAEGRQWWGGFGAVY

Query:  DDYCERMRFFDRKSIESGPASTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNHHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNST
        DDYCERM FFDR S +SG  S SQRSA KKSASPLRCLSLKRIEEPEDEMED+DPS T  DSNHH+E AYVAHICLSWEALHCQYTQLNHLISCQPQN T
Subjt:  DDYCERMRFFDRKSIESGPASTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNHHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNST

Query:  THYNLTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTYPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASL
        T YNLTAQLFQQFQVLLQRFIENEPFQQALRP IYARTRRT+PKMLHVPNIQASD N  QEQESDSLILA DLL IIEASIFTFHRFLKM+KK S SASL
Subjt:  THYNLTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTYPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASL

Query:  SFRNHTQDAALLARVRSSLDKKKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLF
        S RN TQDA LLAR+RSSLDKKK KLKEVRKKS+GWKQKT PQTYEDMQLLFGVVDIKII+RL+KM RI+KEQLLWCEEK+ KLDVS+GKLRRDPSPLLF
Subjt:  SFRNHTQDAALLARVRSSLDKKKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLF

Query:  PC
        PC
Subjt:  PC

SwissProt top hitse value%identityAlignment
Q9LT25 Pre-mRNA-processing protein 40C8.4e-0540.45Show/hide
Query:  SSVLNLIKKKLQDTGTPVASSPISAPTTAQLDLNLPRDVNVALKALQKENGKDKPKYANADGNVSDSSLDSEDVESGPTDEQLIIQFKE
        SS L+L+KKKL D+G PV+S+  S   + +     P          +  N   K K A   G +SDSS DSED +SGP+ E+   QFKE
Subjt:  SSVLNLIKKKLQDTGTPVASSPISAPTTAQLDLNLPRDVNVALKALQKENGKDKPKYANADGNVSDSSLDSEDVESGPTDEQLIIQFKE

Arabidopsis top hitse value%identityAlignment
AT1G69610.1 Protein of unknown function (DUF1666)4.4e-2527.96Show/hide
Query:  VALKALQKEN--GKDKPKYANADGNVS-DSSLDSEDVESGPTDEQLIIQFKEEDGTDLKSIAEV------EAEEDEDDFIMEEVKRRLK-ELR--RNSFM
        V  ++L  EN  G +   + + DG +  + SL   +   G  +E+  I  +EE         EV       ++ D+D+F   +V  +LK ELR  R   +
Subjt:  VALKALQKEN--GKDKPKYANADGNVS-DSSLDSEDVESGPTDEQLIIQFKEEDGTDLKSIAEV------EAEEDEDDFIMEEVKRRLK-ELR--RNSFM

Query:  VLIPEEEEEEIEGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMR---FFDRKSIESGPASTSQRSA--SKKSASPLRCLSLKRIEEPEDEME
          I EE E  ++  +  ++     + +D  AE          VY +Y  +MR     D +++ S      + S+  S+ +  P +    + I   +    
Subjt:  VLIPEEEEEEIEGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMR---FFDRKSIESGPASTSQRSA--SKKSASPLRCLSLKRIEEPEDEME

Query:  DVDPSLTPI-DSNHHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTYPKMLHVPN
        + DPS   + +++   E  YV  +CLSWE L  QY   + ++    Q +T  YNL A  FQ FQVLLQRF+ENEPFQ + R   Y + RR +   L +P 
Subjt:  DVDPSLTPI-DSNHHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTYPKMLHVPN

Query:  IQASDPNGVQ-EQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASLSFRNHTQ-------DAALLARVRSSLDKKKTKLKEVRKK-----SKGWK
        ++    +  +   E +  +    L  II  S+  F  FL  +K   TS  +   + TQ       D  LL  +R+ L KK+ KLKE+++       K  K
Subjt:  IQASDPNGVQ-EQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASLSFRNHTQ-------DAALLARVRSSLDKKKTKLKEVRKK-----SKGWK

Query:  QKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC
         ++        +LL   +++++++R++ MS++T E+L WC+EK+ K+  +  K+  +P   L PC
Subjt:  QKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC

AT1G73850.1 Protein of unknown function (DUF1666)2.4e-3131.39Show/hide
Query:  EEEEEEIEGGEEEEVGE-------------GEPEWRD-------VEAEGRQ---WWGGFGAVYDDYCERMRFFDR------------KSIESGPASTSQR
        E + EE+E  EEEE G+                EWR+            R+    W  +  V+  Y E M F  R            KSI   P S S+R
Subjt:  EEEEEEIEGGEEEEVGE-------------GEPEWRD-------VEAEGRQ---WWGGFGAVYDDYCERMRFFDR------------KSIESGPASTSQR

Query:  SASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNHHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLT--------AQLFQQFQVLL
           K S++  +    K+ + P       +P +        +E AYVA ICL+WEAL   Y         + + STT  +          A  F+ F +LL
Subjt:  SASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNHHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLT--------AQLFQQFQVLL

Query:  QRFIENEPFQQALRPTIYARTRRTYPKMLHVPNIQASDPNGVQEQESD----SLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASLS-----FRNHTQD
        QR++ENEP++   RP IYAR R   PK+L VP  Q  +    +E E++    S I +   L I+E  I TF  FL+ +K+      +       +    D
Subjt:  QRFIENEPFQQALRPTIYARTRRTYPKMLHVPNIQASDPNGVQEQESD----SLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASLS-----FRNHTQD

Query:  AALLARVRSSLDKKKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNG--KLRRDPSPLLFP
          L+  ++    KKKTKLKE+R+  K  ++K      E+M++L G++D+K+++R+L+M+ + +E L WCEEKM+K+ +  G   L+RD +PL FP
Subjt:  AALLARVRSSLDKKKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNG--KLRRDPSPLLFP

AT3G20260.1 Protein of unknown function (DUF1666)2.0e-11057Show/hide
Query:  EAEEDEDDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGG---EEEEVGEGE--PEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKS----------
        E E+D+DDFI  EVKRRLKELRRNSFMVLIPEEEEEE E     E+++ GE +   EWRDV AEG QWWGGF AVY+ YCERM FFDR S          
Subjt:  EAEEDEDDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGG---EEEEVGEGE--PEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKS----------

Query:  IESGPASTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNHH-IEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQF
        I   P++ S RSASKK +SP RCLSLK+ + PE+++E + P  T +D  +  +E AYVA +CL+WEALHCQYTQL+HLISCQP+  T  YN TAQLFQQF
Subjt:  IESGPASTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNHH-IEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQF

Query:  QVLLQRFIENEPFQQALRPTIYARTRRTYPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASLSF----RNHTQDA
         VLLQR+IENEPF+Q  R  +YAR R   PK+L  P IQ SD   + E+++  ++LA DL+ +IE+SI TF+ FLKM+KK        F     NH    
Subjt:  QVLLQRFIENEPFQQALRPTIYARTRRTYPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASLSF----RNHTQDA

Query:  ALLARVRSSLDKKKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC
          L  V+SS+DKK+ K KE+ KK+KG ++K+ PQT+E +QLLF  +DIK+ TR+L+MS+I+KEQLLWCEEKM KL+ S GKL+R PSP+LFPC
Subjt:  ALLARVRSSLDKKKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC

AT5G39785.1 Protein of unknown function (DUF1666)3.3e-3330Show/hide
Query:  DGNVSDSSLDSEDVESGPTDEQLIIQFKEEDGTDLKSIAEVEAEEDEDDF--------IMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGEP
        DG +SDS      ++ G        Q ++ D +   S +E E EED + F        ++E++K  +K+++    +  I EEEEE+    +  ++ E   
Subjt:  DGNVSDSSLDSEDVESGPTDEQLIIQFKEEDGTDLKSIAEVEAEEDEDDF--------IMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGEP

Query:  EWRDVEAEGRQWWGGFGAVYD---DYCERMRFFDRKSIESGPA-------------STSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNH
         WR  E +  +     G V+     Y ERMR  D  S +   A             ST   + S+ S S +  ++++  +  + E+E +   +  I    
Subjt:  EWRDVEAEGRQWWGGFGAVYD---DYCERMRFFDRKSIESGPA-------------STSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNH

Query:  HIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTYPKMLHVPNIQ----ASDPNGVQ
         +E  YV  +CLSWE LH QY +   L+      S   YN  A  FQQFQVLLQRF+ENEPF++  R   Y + R     +L +P I+        NG +
Subjt:  HIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTYPKMLHVPNIQ----ASDPNGVQ

Query:  ---EQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASLSFRNHTQ----------DAALLARVRSSLDKKKTKLKEVRKKS-----KGWKQKTCP
           E+ +D +I +  L+ I+E +I  F RF++ +K TS+      R  +Q          D  + A V+S L  K+ +L++V K       +  K K   
Subjt:  ---EQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASLSFRNHTQ----------DAALLARVRSSLDKKKTKLKEVRKKS-----KGWKQKTCP

Query:  QTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC
         T + +   F  VD+K++TR+L MS++T++ L+WC  K+ K++  N +L  DPS  LFPC
Subjt:  QTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC

AT5G39785.2 Protein of unknown function (DUF1666)3.1e-3129.72Show/hide
Query:  DGNVSDSSLDSEDVESGPTDEQLIIQFKEEDGTDLKSIAEVEAEEDEDDF--------IMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGEP
        DG +SDS      ++ G        Q ++ D +   S +E E EED + F        ++E++K  +K+++    +  I EEEEE+    +  ++ E   
Subjt:  DGNVSDSSLDSEDVESGPTDEQLIIQFKEEDGTDLKSIAEVEAEEDEDDF--------IMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGEP

Query:  EWRDVEAEGRQWWGGFGAVYD---DYCERMRFFDRKSIESGPA-------------STSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNH
         WR  E +  +     G V+     Y ERMR  D  S +   A             ST   + S+ S S +  ++++  +  + E+E +   +  I    
Subjt:  EWRDVEAEGRQWWGGFGAVYD---DYCERMRFFDRKSIESGPA-------------STSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNH

Query:  HIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTYPKMLHVPNIQ----ASDPNGVQ
         +E  YV  +CLSWE LH QY +   L+      S   YN  A  FQQFQVLLQRF+ENEPF++  R   Y + R     +L +P I+        NG +
Subjt:  HIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTYPKMLHVPNIQ----ASDPNGVQ

Query:  ---EQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASLSFRNHTQ----------DAALLARVRSSLDK-KKTKLKEVRKKS-----KGWKQKTC
           E+ +D +I +  L+ I+E +I  F RF++ +K TS+      R  +Q          D  + A V+S L    + +L++V K       +  K K  
Subjt:  ---EQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASLSFRNHTQ----------DAALLARVRSSLDK-KKTKLKEVRKKS-----KGWKQKTC

Query:  PQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC
          T + +   F  VD+K++TR+L MS++T++ L+WC  K+ K++  N +L  DPS  LFPC
Subjt:  PQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGGAATATCAGGGCCATCTTCTGTTTTGAATTTGATCAAGAAGAAATTGCAAGACACTGGAACTCCTGTAGCTTCCTCACCTATTTCAGCTCCAACAACAGCTCA
ATTAGATCTAAATCTACCGAGAGATGTTAATGTTGCACTTAAGGCACTGCAAAAAGAGAACGGCAAAGATAAACCGAAATATGCTAATGCTGATGGAAATGTATCCGACT
CCTCTTTGGACTCTGAAGACGTAGAAAGTGGGCCAACTGATGAGCAATTAATCATCCAGTTTAAGGAAGAGGATGGCACAGATCTTAAGAGTATTGCAGAAGTGGAAGCT
GAAGAGGACGAGGATGATTTCATAATGGAGGAGGTAAAGAGGAGACTGAAGGAGCTGAGGAGGAACAGTTTCATGGTTTTGATTCCCGAGGAAGAAGAAGAAGAAATCGA
AGGAGGAGAAGAAGAAGAAGTGGGTGAAGGGGAGCCTGAGTGGAGAGACGTGGAAGCAGAAGGTCGACAATGGTGGGGAGGGTTTGGTGCTGTTTATGATGATTACTGTG
AGAGGATGCGTTTCTTTGATCGTAAGAGCATTGAATCTGGTCCTGCATCAACCTCCCAAAGATCTGCATCGAAAAAGAGTGCATCCCCTCTTCGGTGTCTTTCTCTGAAG
AGGATTGAAGAACCTGAAGACGAAATGGAGGATGTCGATCCTTCACTGACTCCGATTGACTCCAATCACCACATAGAAATAGCGTATGTTGCTCACATTTGCTTGTCCTG
GGAGGCCCTTCACTGTCAGTACACTCAACTTAACCACTTAATATCATGCCAACCCCAAAACTCTACTACTCATTATAATCTTACTGCTCAGCTCTTTCAGCAATTTCAAG
TCCTCTTGCAAAGGTTTATTGAAAATGAACCCTTTCAGCAAGCTCTCAGGCCTACAATTTATGCCCGAACCCGTCGAACTTATCCTAAAATGTTGCATGTTCCTAACATA
CAAGCTTCAGATCCAAACGGGGTGCAGGAACAGGAATCTGATTCCCTCATCCTTGCTCCTGACCTGCTGTTCATTATTGAGGCTTCAATCTTTACTTTCCACCGCTTCCT
GAAGATGGAGAAGAAAACCTCAACTTCTGCTTCATTATCATTTCGGAACCACACCCAGGATGCTGCTCTGCTTGCTCGTGTTCGATCTTCTCTTGACAAGAAGAAAACGA
AGCTGAAAGAGGTTAGGAAGAAAAGTAAAGGATGGAAACAGAAAACGTGTCCTCAAACCTATGAAGACATGCAATTACTTTTTGGTGTTGTGGACATTAAAATCATAACA
AGGCTTCTGAAGATGTCAAGGATTACTAAAGAGCAGCTGCTTTGGTGCGAGGAGAAAATGAATAAGTTAGATGTGTCTAATGGAAAATTGCGGAGAGATCCGTCTCCCCT
TCTTTTTCCATGTTAA
mRNA sequenceShow/hide mRNA sequence
CACTCACTCCCTCCACTCTCTTCCCATTTTCATCTCCATCCCCTCGCAAAATACAGATCTTTGCCACAACCCCGCCGAACAACCGCGAACGGACTTCAAGCGCGAACAAA
GCCAGCGAACAATGTCGCCAAACACGACTACAAAACGGACTTCCAAGTCGACGGGTGGGGTGGACTAAAGGGTGATCTGACAGGCAGGAATAGATACTTCGACCACAAAC
ACGCCGGAGGTATTGTGTGCTGTTAAAGACGGTAGCTAGAGTTCCGGTCGGTTTCCATCGGCGCTGGCTAGGCCAAATTTCTGAGGTTAGGATTGCGGTTTGGTTTCTAT
GGGCGATGGTAAAAAGTTTCACAAAAACAGAAATACGCAGATAAGCAGTTGGCAGATCCCAAATGAAGTAACTGAATTGAGCCAACATAATGATGAAAAACAAAAGAACA
TTCAACTCCTTTGCCAAATGATCAAGCATTGACCGATCTAGGAACTTCTTTTATCACTATCAACACTCCAGCCACTAACACAGGGGGTCGTGCAGCCACACGTCTTAGAA
TGGTAGGAATATCAGGGCCATCTTCTGTTTTGAATTTGATCAAGAAGAAATTGCAAGACACTGGAACTCCTGTAGCTTCCTCACCTATTTCAGCTCCAACAACAGCTCAA
TTAGATCTAAATCTACCGAGAGATGTTAATGTTGCACTTAAGGCACTGCAAAAAGAGAACGGCAAAGATAAACCGAAATATGCTAATGCTGATGGAAATGTATCCGACTC
CTCTTTGGACTCTGAAGACGTAGAAAGTGGGCCAACTGATGAGCAATTAATCATCCAGTTTAAGGAAGAGGATGGCACAGATCTTAAGAGTATTGCAGAAGTGGAAGCTG
AAGAGGACGAGGATGATTTCATAATGGAGGAGGTAAAGAGGAGACTGAAGGAGCTGAGGAGGAACAGTTTCATGGTTTTGATTCCCGAGGAAGAAGAAGAAGAAATCGAA
GGAGGAGAAGAAGAAGAAGTGGGTGAAGGGGAGCCTGAGTGGAGAGACGTGGAAGCAGAAGGTCGACAATGGTGGGGAGGGTTTGGTGCTGTTTATGATGATTACTGTGA
GAGGATGCGTTTCTTTGATCGTAAGAGCATTGAATCTGGTCCTGCATCAACCTCCCAAAGATCTGCATCGAAAAAGAGTGCATCCCCTCTTCGGTGTCTTTCTCTGAAGA
GGATTGAAGAACCTGAAGACGAAATGGAGGATGTCGATCCTTCACTGACTCCGATTGACTCCAATCACCACATAGAAATAGCGTATGTTGCTCACATTTGCTTGTCCTGG
GAGGCCCTTCACTGTCAGTACACTCAACTTAACCACTTAATATCATGCCAACCCCAAAACTCTACTACTCATTATAATCTTACTGCTCAGCTCTTTCAGCAATTTCAAGT
CCTCTTGCAAAGGTTTATTGAAAATGAACCCTTTCAGCAAGCTCTCAGGCCTACAATTTATGCCCGAACCCGTCGAACTTATCCTAAAATGTTGCATGTTCCTAACATAC
AAGCTTCAGATCCAAACGGGGTGCAGGAACAGGAATCTGATTCCCTCATCCTTGCTCCTGACCTGCTGTTCATTATTGAGGCTTCAATCTTTACTTTCCACCGCTTCCTG
AAGATGGAGAAGAAAACCTCAACTTCTGCTTCATTATCATTTCGGAACCACACCCAGGATGCTGCTCTGCTTGCTCGTGTTCGATCTTCTCTTGACAAGAAGAAAACGAA
GCTGAAAGAGGTTAGGAAGAAAAGTAAAGGATGGAAACAGAAAACGTGTCCTCAAACCTATGAAGACATGCAATTACTTTTTGGTGTTGTGGACATTAAAATCATAACAA
GGCTTCTGAAGATGTCAAGGATTACTAAAGAGCAGCTGCTTTGGTGCGAGGAGAAAATGAATAAGTTAGATGTGTCTAATGGAAAATTGCGGAGAGATCCGTCTCCCCTT
CTTTTTCCATGTTAATTACTTATTTTTCTGCTGCTGCACTGCATCACAAAGTTGTTGTATCTGTGATTATGTTTTATGTAACTGGATTATGGCTGCAGATGGACCCTACT
ACTAAGATGAATCTGAATTTTTGGGGGTTTTCGAACTCCTTTTTCCCAATTCATCTTTAGATAAGCTCAAGACTCCCCGGTTACCAACTCTGACC
Protein sequenceShow/hide protein sequence
MVGISGPSSVLNLIKKKLQDTGTPVASSPISAPTTAQLDLNLPRDVNVALKALQKENGKDKPKYANADGNVSDSSLDSEDVESGPTDEQLIIQFKEEDGTDLKSIAEVEA
EEDEDDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKSIESGPASTSQRSASKKSASPLRCLSLK
RIEEPEDEMEDVDPSLTPIDSNHHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTYPKMLHVPNI
QASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSTSASLSFRNHTQDAALLARVRSSLDKKKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIIT
RLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC