; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0027035 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0027035
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionProtein of unknown function (DUF1666)
Genome locationchr01:27926082..27928151
RNA-Seq ExpressionPI0027035
SyntenyPI0027035
Gene Ontology termsNA
InterPro domainsIPR012870 - Protein of unknown function DUF1666


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7013928.1 hypothetical protein SDJN02_24097 [Cucurbita argyrosperma subsp. argyrosperma]6.7e-17783.58Show/hide
Query:  KEEDGTDPKSIAQVEA----------EEDEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGE-----GEPEWRDVEAEGRQWWGGFGAVY
        KEEDG D K +A+ EA          +ED+EDFIMEEVKRRLKELRRNSFMVLIPEEEEEE    EEEEVGE     GEPEWRDVEAEGRQWWGGFGAVY
Subjt:  KEEDGTDPKSIAQVEA----------EEDEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGE-----GEPEWRDVEAEGRQWWGGFGAVY

Query:  DDYCERMRFFDRKSVESGPESTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNYHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNST
        DDYCERM FFDR S +SG ES SQRSA KKSASPLRCLSLKRIEEPEDEMED+DPS   IDSN+H+E AYVAHICLSWEALHCQYTQLNHLISCQPQN T
Subjt:  DDYCERMRFFDRKSVESGPESTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNYHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNST

Query:  THYNLTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTFPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSNSASL
        T YNLTAQLFQQFQVLLQRFIENEPFQQALRP IYARTRRTFPKMLHVPNIQASD N  QEQESDSLILA DLL IIEASIFTFHRFLKM+KK S SASL
Subjt:  THYNLTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTFPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSNSASL

Query:  SFRNHTQDAALLARVRSSLDKKKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLF
        S RN TQDA LLAR+RSSLDKKK KLKEVRKKS+GWKQKT PQTYEDMQLLFGVVDIKII+RL+KM RI+KEQLLWCEEK+ KLDVS+GKLRRDPSPLLF
Subjt:  SFRNHTQDAALLARVRSSLDKKKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLF

Query:  PC
        PC
Subjt:  PC

XP_004141832.1 uncharacterized protein LOC101216166 [Cucumis sativus]9.4e-25696.47Show/hide
Query:  MVGISGSSSVLNLIKKKLQDSGIPVASSPISAPTTAQLDLNLPRNVDVALKALQIENSKDKPKYANADGNVSDSSLDSEDVESGPTNEQLIIQFKEEDGT
        MVGISG SSVLNLIKKKLQD+G PVASSPISAPTTAQLDLNLPR+V+VALKALQ EN KDKPKYANADGNVSDSSLDSEDVESGPT+EQLIIQFKEEDGT
Subjt:  MVGISGSSSVLNLIKKKLQDSGIPVASSPISAPTTAQLDLNLPRNVDVALKALQIENSKDKPKYANADGNVSDSSLDSEDVESGPTNEQLIIQFKEEDGT

Query:  DPKSIAQVEAEEDEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKSVESGPES
        DPKSIA+VEAEEDE+DFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGE EWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKS+ESGP S
Subjt:  DPKSIAQVEAEEDEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKSVESGPES

Query:  TSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNYHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFI
        TSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLT IDSN+HIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFI
Subjt:  TSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNYHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFI

Query:  ENEPFQQALRPTIYARTRRTFPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSNSASLSFRNHTQDAALLARVRSSLDK
        ENEPFQQALRPTIYARTRRT+PKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTS SASLSFRNHTQDAALLARVRSSLDK
Subjt:  ENEPFQQALRPTIYARTRRTFPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSNSASLSFRNHTQDAALLARVRSSLDK

Query:  KKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC
        KKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC
Subjt:  KKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC

XP_008462200.1 PREDICTED: uncharacterized protein LOC103500615 [Cucumis melo]1.1e-25696.47Show/hide
Query:  MVGISGSSSVLNLIKKKLQDSGIPVASSPISAPTTAQLDLNLPRNVDVALKALQIENSKDKPKYANADGNVSDSSLDSEDVESGPTNEQLIIQFKEEDGT
        MVGISGSSSVLNLIKKKLQD+G PVASSPISAPTTAQLDLNL R+VDVALKALQIENS+DKPKYANADGNVSDSSLDSEDVE+ PTNEQLI QFKEEDGT
Subjt:  MVGISGSSSVLNLIKKKLQDSGIPVASSPISAPTTAQLDLNLPRNVDVALKALQIENSKDKPKYANADGNVSDSSLDSEDVESGPTNEQLIIQFKEEDGT

Query:  DPKSIAQVEAEEDEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKSVESGPES
        DPKSIA+VEAEEDE+DFIMEEVKRRLKELRRNSFMVLIPEEEEEE EGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKS+ESGPES
Subjt:  DPKSIAQVEAEEDEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKSVESGPES

Query:  TSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNYHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFI
        TSQRSASKKSASPLRCLSLKRIEEPEDEMED DP LTPIDSN+HIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFI
Subjt:  TSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNYHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFI

Query:  ENEPFQQALRPTIYARTRRTFPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSNSASLSFRNHTQDAALLARVRSSLDK
        ENEPFQQALRPTIYARTRRTFPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFI+EASIFTFHRFLKMEKKTSNSASLSFRNHTQDAAL ARVRSSLDK
Subjt:  ENEPFQQALRPTIYARTRRTFPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSNSASLSFRNHTQDAALLARVRSSLDK

Query:  KKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC
        KKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC
Subjt:  KKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC

XP_022954221.1 uncharacterized protein LOC111456540 [Cucurbita moschata]8.7e-17783.58Show/hide
Query:  KEEDGTDPKSIAQVEA----------EEDEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGE-----GEPEWRDVEAEGRQWWGGFGAVY
        KEEDG D K +A+ EA          +ED+EDFIMEEVKRRLKELRRNSFMVLIPEEEEEE    EEEEVGE     GEPEWRDVEAEGRQWWGGFGAVY
Subjt:  KEEDGTDPKSIAQVEA----------EEDEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGE-----GEPEWRDVEAEGRQWWGGFGAVY

Query:  DDYCERMRFFDRKSVESGPESTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNYHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNST
        DDYCERM FFDR S +SG ES SQRSA KKSASPLRCLSLKRIEEPEDEMED+DPS   IDSN+H+E AYVAHICLSWEALHCQYTQLNHLISCQPQN T
Subjt:  DDYCERMRFFDRKSVESGPESTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNYHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNST

Query:  THYNLTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTFPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSNSASL
        T YNLTAQLFQQFQVLLQRFIENEPFQQALRP IYARTRRTFPKMLHVPNIQASD N  QEQESDSLILA DLL IIEASIFTFHRFLKM+KK S SASL
Subjt:  THYNLTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTFPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSNSASL

Query:  SFRNHTQDAALLARVRSSLDKKKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLF
        S RN TQDA LLAR+RSSLDKKK KLKEVRKKS+GWKQKT PQTYEDMQLLFGVVDIKII+RL+KM RI+KEQLLWCEEK+ KLDVS+GKLRRDPSPLLF
Subjt:  SFRNHTQDAALLARVRSSLDKKKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLF

Query:  PC
        PC
Subjt:  PC

XP_038898126.1 uncharacterized protein LOC120085908 isoform X1 [Benincasa hispida]2.0e-23790.68Show/hide
Query:  MVGISGSSSVLNLIKKKLQDSGIPVASSPISAPTTAQLDLNLPRNVDVALKALQIENSKDKPKYANADGNVSDSSLDSEDVESGPTNEQLIIQFKEEDGT
        MVGISGSSS L+LIKKKL DSG PVASSPISA T AQLD+NLPR+VDVA+KALQIEN K+KPK AN DGNVS+SSLDSE+V+SGPTNEQL IQF EEDG 
Subjt:  MVGISGSSSVLNLIKKKLQDSGIPVASSPISAPTTAQLDLNLPRNVDVALKALQIENSKDKPKYANADGNVSDSSLDSEDVESGPTNEQLIIQFKEEDGT

Query:  DPKSIAQVEAEEDEEDFIMEEVKRRLKELRRNSFMVLIP--EEEEEEIEGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKSVESGP
        DPK  A+VEAEEDEEDFIMEEVKRRLKELRRNSFMVLIP  EEEEEE EG EE EVGEG+PEWRDVEAEGRQWWGGFGAVYDDYCERM FFDR S+ SGP
Subjt:  DPKSIAQVEAEEDEEDFIMEEVKRRLKELRRNSFMVLIP--EEEEEEIEGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKSVESGP

Query:  ESTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNYHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQR
        ESTS+RSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLT IDSNYHIE AYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQR
Subjt:  ESTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNYHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQR

Query:  FIENEPFQQALRPTIYARTRRTFPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSNSASLSFRNHTQDAALLARVRSSL
        FIENEPFQQ LRPTIYARTRRTFPKMLHVPNIQASDPNGVQEQESDSLILAPDLL IIEASIFTFHRFLKMEKKTSNSASLSF+NHTQDAALLARVRSSL
Subjt:  FIENEPFQQALRPTIYARTRRTFPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSNSASLSFRNHTQDAALLARVRSSL

Query:  DKKKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC
        DKKKTKLKEVRKKS+GWKQKTCPQTYEDMQLLFG+VDIKII+RL+KMSRITKEQLLWCEEK+NKLD+SNGKLRRD SPLLFPC
Subjt:  DKKKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC

TrEMBL top hitse value%identityAlignment
A0A0A0K6Q7 Uncharacterized protein4.6e-25696.47Show/hide
Query:  MVGISGSSSVLNLIKKKLQDSGIPVASSPISAPTTAQLDLNLPRNVDVALKALQIENSKDKPKYANADGNVSDSSLDSEDVESGPTNEQLIIQFKEEDGT
        MVGISG SSVLNLIKKKLQD+G PVASSPISAPTTAQLDLNLPR+V+VALKALQ EN KDKPKYANADGNVSDSSLDSEDVESGPT+EQLIIQFKEEDGT
Subjt:  MVGISGSSSVLNLIKKKLQDSGIPVASSPISAPTTAQLDLNLPRNVDVALKALQIENSKDKPKYANADGNVSDSSLDSEDVESGPTNEQLIIQFKEEDGT

Query:  DPKSIAQVEAEEDEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKSVESGPES
        DPKSIA+VEAEEDE+DFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGE EWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKS+ESGP S
Subjt:  DPKSIAQVEAEEDEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKSVESGPES

Query:  TSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNYHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFI
        TSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLT IDSN+HIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFI
Subjt:  TSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNYHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFI

Query:  ENEPFQQALRPTIYARTRRTFPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSNSASLSFRNHTQDAALLARVRSSLDK
        ENEPFQQALRPTIYARTRRT+PKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTS SASLSFRNHTQDAALLARVRSSLDK
Subjt:  ENEPFQQALRPTIYARTRRTFPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSNSASLSFRNHTQDAALLARVRSSLDK

Query:  KKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC
        KKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC
Subjt:  KKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC

A0A1S3CGF8 uncharacterized protein LOC1035006155.4e-25796.47Show/hide
Query:  MVGISGSSSVLNLIKKKLQDSGIPVASSPISAPTTAQLDLNLPRNVDVALKALQIENSKDKPKYANADGNVSDSSLDSEDVESGPTNEQLIIQFKEEDGT
        MVGISGSSSVLNLIKKKLQD+G PVASSPISAPTTAQLDLNL R+VDVALKALQIENS+DKPKYANADGNVSDSSLDSEDVE+ PTNEQLI QFKEEDGT
Subjt:  MVGISGSSSVLNLIKKKLQDSGIPVASSPISAPTTAQLDLNLPRNVDVALKALQIENSKDKPKYANADGNVSDSSLDSEDVESGPTNEQLIIQFKEEDGT

Query:  DPKSIAQVEAEEDEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKSVESGPES
        DPKSIA+VEAEEDE+DFIMEEVKRRLKELRRNSFMVLIPEEEEEE EGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKS+ESGPES
Subjt:  DPKSIAQVEAEEDEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKSVESGPES

Query:  TSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNYHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFI
        TSQRSASKKSASPLRCLSLKRIEEPEDEMED DP LTPIDSN+HIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFI
Subjt:  TSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNYHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFI

Query:  ENEPFQQALRPTIYARTRRTFPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSNSASLSFRNHTQDAALLARVRSSLDK
        ENEPFQQALRPTIYARTRRTFPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFI+EASIFTFHRFLKMEKKTSNSASLSFRNHTQDAAL ARVRSSLDK
Subjt:  ENEPFQQALRPTIYARTRRTFPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSNSASLSFRNHTQDAALLARVRSSLDK

Query:  KKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC
        KKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC
Subjt:  KKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC

A0A5A7UTV1 DUF1666 domain-containing protein5.4e-25796.47Show/hide
Query:  MVGISGSSSVLNLIKKKLQDSGIPVASSPISAPTTAQLDLNLPRNVDVALKALQIENSKDKPKYANADGNVSDSSLDSEDVESGPTNEQLIIQFKEEDGT
        MVGISGSSSVLNLIKKKLQD+G PVASSPISAPTTAQLDLNL R+VDVALKALQIENS+DKPKYANADGNVSDSSLDSEDVE+ PTNEQLI QFKEEDGT
Subjt:  MVGISGSSSVLNLIKKKLQDSGIPVASSPISAPTTAQLDLNLPRNVDVALKALQIENSKDKPKYANADGNVSDSSLDSEDVESGPTNEQLIIQFKEEDGT

Query:  DPKSIAQVEAEEDEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKSVESGPES
        DPKSIA+VEAEEDE+DFIMEEVKRRLKELRRNSFMVLIPEEEEEE EGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKS+ESGPES
Subjt:  DPKSIAQVEAEEDEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKSVESGPES

Query:  TSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNYHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFI
        TSQRSASKKSASPLRCLSLKRIEEPEDEMED DP LTPIDSN+HIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFI
Subjt:  TSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNYHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFI

Query:  ENEPFQQALRPTIYARTRRTFPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSNSASLSFRNHTQDAALLARVRSSLDK
        ENEPFQQALRPTIYARTRRTFPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFI+EASIFTFHRFLKMEKKTSNSASLSFRNHTQDAAL ARVRSSLDK
Subjt:  ENEPFQQALRPTIYARTRRTFPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSNSASLSFRNHTQDAALLARVRSSLDK

Query:  KKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC
        KKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC
Subjt:  KKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC

A0A6J1GQB8 uncharacterized protein LOC1114565404.2e-17783.58Show/hide
Query:  KEEDGTDPKSIAQVEA----------EEDEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGE-----GEPEWRDVEAEGRQWWGGFGAVY
        KEEDG D K +A+ EA          +ED+EDFIMEEVKRRLKELRRNSFMVLIPEEEEEE    EEEEVGE     GEPEWRDVEAEGRQWWGGFGAVY
Subjt:  KEEDGTDPKSIAQVEA----------EEDEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGE-----GEPEWRDVEAEGRQWWGGFGAVY

Query:  DDYCERMRFFDRKSVESGPESTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNYHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNST
        DDYCERM FFDR S +SG ES SQRSA KKSASPLRCLSLKRIEEPEDEMED+DPS   IDSN+H+E AYVAHICLSWEALHCQYTQLNHLISCQPQN T
Subjt:  DDYCERMRFFDRKSVESGPESTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNYHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNST

Query:  THYNLTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTFPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSNSASL
        T YNLTAQLFQQFQVLLQRFIENEPFQQALRP IYARTRRTFPKMLHVPNIQASD N  QEQESDSLILA DLL IIEASIFTFHRFLKM+KK S SASL
Subjt:  THYNLTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTFPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSNSASL

Query:  SFRNHTQDAALLARVRSSLDKKKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLF
        S RN TQDA LLAR+RSSLDKKK KLKEVRKKS+GWKQKT PQTYEDMQLLFGVVDIKII+RL+KM RI+KEQLLWCEEK+ KLDVS+GKLRRDPSPLLF
Subjt:  SFRNHTQDAALLARVRSSLDKKKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLF

Query:  PC
        PC
Subjt:  PC

A0A6J1JYQ1 uncharacterized protein LOC1114886379.4e-17783.58Show/hide
Query:  KEEDGTDPKSIAQVEA----------EEDEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGE-----GEPEWRDVEAEGRQWWGGFGAVY
        KEEDG D K +A+ EA          +ED+EDFIMEEVKRRLKELRRNSFMVLIPEEEEEE    EEEEVGE     GEPEWRDVEAEGRQWWGGFGAVY
Subjt:  KEEDGTDPKSIAQVEA----------EEDEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGE-----GEPEWRDVEAEGRQWWGGFGAVY

Query:  DDYCERMRFFDRKSVESGPESTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNYHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNST
        DDYCERM FFDR S +SG ES SQRSA KKSASPLRCLSLKRIEEPEDEMED+DPS T  DSN+H+E AYVAHICLSWEALHCQYTQLNHLISCQPQN T
Subjt:  DDYCERMRFFDRKSVESGPESTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNYHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNST

Query:  THYNLTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTFPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSNSASL
        T YNLTAQLFQQFQVLLQRFIENEPFQQALRP IYARTRRTFPKMLHVPNIQASD N  QEQESDSLILA DLL IIEASIFTFHRFLKM+KK S SASL
Subjt:  THYNLTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTFPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSNSASL

Query:  SFRNHTQDAALLARVRSSLDKKKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLF
        S RN TQDA LLAR+RSSLDKKK KLKEVRKKS+GWKQKT PQTYEDMQLLFGVVDIKII+RL+KM RI+KEQLLWCEEK+ KLDVS+GKLRRDPSPLLF
Subjt:  SFRNHTQDAALLARVRSSLDKKKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLF

Query:  PC
        PC
Subjt:  PC

SwissProt top hitse value%identityAlignment
Q9LT25 Pre-mRNA-processing protein 40C9.2e-0436.17Show/hide
Query:  SSVLNLIKKKLQDSGIPVASSPISAPTTAQLDLNLPRNVDVALKALQIENSKDKPKYANADGNVSDSSLDSEDVESGPTNEQLIIQFKE---EDGTDPKS
        SS L+L+KKKL DSG+PV+S       T   + N  +  +V     +  NS  K K A   G +SDSS DSED +SGP+ E+   QFKE   E G  P S
Subjt:  SSVLNLIKKKLQDSGIPVASSPISAPTTAQLDLNLPRNVDVALKALQIENSKDKPKYANADGNVSDSSLDSEDVESGPTNEQLIIQFKE---EDGTDPKS

Query:  IAQVEAEEDEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEE
          + E  +   D   + +      +RR+ F   +    EEE
Subjt:  IAQVEAEEDEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEE

Arabidopsis top hitse value%identityAlignment
AT1G69610.1 Protein of unknown function (DUF1666)4.4e-2527.75Show/hide
Query:  NEQLIIQFKEEDGTDPKSIAQVEAEEDEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERM
        +EQ  +  + +DG+D       + +E E   ++E++K  L+  R      ++ EE E  ++  +  ++     + +D  AE          VY +Y  +M
Subjt:  NEQLIIQFKEEDGTDPKSIAQVEAEEDEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERM

Query:  R---FFDRKSVES------GPESTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPI-DSNYHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQ
        R     D +++ S         S   R+  K   S L     + I   +    + DPS   + +++   E  YV  +CLSWE L  QY   + ++    Q
Subjt:  R---FFDRKSVES------GPESTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPI-DSNYHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQ

Query:  NSTTHYNLTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTFPKMLHVPNIQASDPNGVQ-EQESDSLILAPDLLFIIEASIFTFHRFLKMEK----
         +T  YNL A  FQ FQVLLQRF+ENEPFQ + R   Y + RR F   L +P ++    +  +   E +  +    L  II  S+  F  FL  +K    
Subjt:  NSTTHYNLTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTFPKMLHVPNIQASDPNGVQ-EQESDSLILAPDLLFIIEASIFTFHRFLKMEK----

Query:  ---KTSNSASLSFRNHTQDAALLARVRSSLDKKKTKLKEVRKK-----SKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKL
           K S+   +S ++ + D  LL  +R+ L KK+ KLKE+++       K  K ++        +LL   +++++++R++ MS++T E+L WC+EK+ K+
Subjt:  ---KTSNSASLSFRNHTQDAALLARVRSSLDKKKTKLKEVRKK-----SKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKL

Query:  DVSNGKLRRDPSPLLFPC
          +  K+  +P   L PC
Subjt:  DVSNGKLRRDPSPLLFPC

AT1G73850.1 Protein of unknown function (DUF1666)2.8e-3231.69Show/hide
Query:  EEEEEEIEGGEEEEVGE-------------GEPEWRD-------VEAEGRQ---WWGGFGAVYDDYCERMRFFDRKSVESGPESTSQRSASKKSASPLRC
        E + EE+E  EEEE G+                EWR+            R+    W  +  V+  Y E M F  R S +   E+ S +S   +  S    
Subjt:  EEEEEEIEGGEEEEVGE-------------GEPEWRD-------VEAEGRQ---WWGGFGAVYDDYCERMRFFDRKSVESGPESTSQRSASKKSASPLRC

Query:  LSLKRIEEPEDEMEDVDPSLTPIDSNYHIEI--AYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLT--------AQLFQQFQVLLQRFIENEPFQ
        +  K       + +   P       N ++E+  AYVA ICL+WEAL   Y         + + STT  +          A  F+ F +LLQR++ENEP++
Subjt:  LSLKRIEEPEDEMEDVDPSLTPIDSNYHIEI--AYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLT--------AQLFQQFQVLLQRFIENEPFQ

Query:  QALRPTIYARTRRTFPKMLHVPNIQASDPNGVQEQESD----SLILAPDLLFIIEASIFTFHRFLKMEK-KTSNSASLSFRNHTQ----DAALLARVRSS
           RP IYAR R   PK+L VP  Q  +    +E E++    S I +   L I+E  I TF  FL+ +K K       +F   ++    D  L+  ++  
Subjt:  QALRPTIYARTRRTFPKMLHVPNIQASDPNGVQEQESD----SLILAPDLLFIIEASIFTFHRFLKMEK-KTSNSASLSFRNHTQ----DAALLARVRSS

Query:  LDKKKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNG--KLRRDPSPLLFP
          KKKTKLKE+R+  K  ++K      E+M++L G++D+K+++R+L+M+ + +E L WCEEKM+K+ +  G   L+RD +PL FP
Subjt:  LDKKKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNG--KLRRDPSPLLFP

AT3G20260.1 Protein of unknown function (DUF1666)4.0e-11157Show/hide
Query:  EAEEDEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGG---EEEEVGEGE--PEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKS----------
        E E+D++DFI  EVKRRLKELRRNSFMVLIPEEEEEE E     E+++ GE +   EWRDV AEG QWWGGF AVY+ YCERM FFDR S          
Subjt:  EAEEDEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGG---EEEEVGEGE--PEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKS----------

Query:  VESGPESTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNYH-IEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQF
        +   P + S RSASKK +SP RCLSLK+ + PE+++E + P  T +D  Y  +E AYVA +CL+WEALHCQYTQL+HLISCQP+  T  YN TAQLFQQF
Subjt:  VESGPESTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNYH-IEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQF

Query:  QVLLQRFIENEPFQQALRPTIYARTRRTFPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSNSASLSF----RNHTQDA
         VLLQR+IENEPF+Q  R  +YAR R   PK+L  P IQ SD   + E+++  ++LA DL+ +IE+SI TF+ FLKM+KK  N     F     NH    
Subjt:  QVLLQRFIENEPFQQALRPTIYARTRRTFPKMLHVPNIQASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSNSASLSF----RNHTQDA

Query:  ALLARVRSSLDKKKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC
          L  V+SS+DKK+ K KE+ KK+KG ++K+ PQT+E +QLLF  +DIK+ TR+L+MS+I+KEQLLWCEEKM KL+ S GKL+R PSP+LFPC
Subjt:  ALLARVRSSLDKKKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC

AT5G39785.1 Protein of unknown function (DUF1666)1.7e-3230Show/hide
Query:  DGNVSDSSLDSEDVESGPTNEQLIIQFKEEDGTDPKSIAQVEAEEDEEDF--------IMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGEP
        DG +SDS      ++ G        Q ++ D +   S ++ E EED   F        ++E++K  +K+++    +  I EEEEE+    +  ++ E   
Subjt:  DGNVSDSSLDSEDVESGPTNEQLIIQFKEEDGTDPKSIAQVEAEEDEEDF--------IMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGEP

Query:  EWRDVEAEGRQWWGGFGAVYD---DYCERMRFFD----RKSVESG---------PESTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNY
         WR  E +  +     G V+     Y ERMR  D    +KS   G           ST   + S+ S S +  ++++  +  + E+E +   +  I    
Subjt:  EWRDVEAEGRQWWGGFGAVYD---DYCERMRFFD----RKSVESG---------PESTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNY

Query:  HIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTFPKMLHVPNIQ----ASDPNGVQ
         +E  YV  +CLSWE LH QY +   L+      S   YN  A  FQQFQVLLQRF+ENEPF++  R   Y + R     +L +P I+        NG +
Subjt:  HIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTFPKMLHVPNIQ----ASDPNGVQ

Query:  ---EQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSNSASLSFRNHTQ----------DAALLARVRSSLDKKKTKLKEVRKKS-----KGWKQKTCP
           E+ +D +I +  L+ I+E +I  F RF++ +K TS+      R  +Q          D  + A V+S L  K+ +L++V K       +  K K   
Subjt:  ---EQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSNSASLSFRNHTQ----------DAALLARVRSSLDKKKTKLKEVRKKS-----KGWKQKTCP

Query:  QTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC
         T + +   F  VD+K++TR+L MS++T++ L+WC  K+ K++  N +L  DPS  LFPC
Subjt:  QTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC

AT5G39785.2 Protein of unknown function (DUF1666)1.5e-3029.72Show/hide
Query:  DGNVSDSSLDSEDVESGPTNEQLIIQFKEEDGTDPKSIAQVEAEEDEEDF--------IMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGEP
        DG +SDS      ++ G        Q ++ D +   S ++ E EED   F        ++E++K  +K+++    +  I EEEEE+    +  ++ E   
Subjt:  DGNVSDSSLDSEDVESGPTNEQLIIQFKEEDGTDPKSIAQVEAEEDEEDF--------IMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGEP

Query:  EWRDVEAEGRQWWGGFGAVYD---DYCERMRFFD----RKSVESG---------PESTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNY
         WR  E +  +     G V+     Y ERMR  D    +KS   G           ST   + S+ S S +  ++++  +  + E+E +   +  I    
Subjt:  EWRDVEAEGRQWWGGFGAVYD---DYCERMRFFD----RKSVESG---------PESTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTPIDSNY

Query:  HIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTFPKMLHVPNIQ----ASDPNGVQ
         +E  YV  +CLSWE LH QY +   L+      S   YN  A  FQQFQVLLQRF+ENEPF++  R   Y + R     +L +P I+        NG +
Subjt:  HIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTFPKMLHVPNIQ----ASDPNGVQ

Query:  ---EQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSNSASLSFRNHTQ----------DAALLARVRSSLDK-KKTKLKEVRKKS-----KGWKQKTC
           E+ +D +I +  L+ I+E +I  F RF++ +K TS+      R  +Q          D  + A V+S L    + +L++V K       +  K K  
Subjt:  ---EQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSNSASLSFRNHTQ----------DAALLARVRSSLDK-KKTKLKEVRKKS-----KGWKQKTC

Query:  PQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC
          T + +   F  VD+K++TR+L MS++T++ L+WC  K+ K++  N +L  DPS  LFPC
Subjt:  PQTYEDMQLLFGVVDIKIITRLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGGAATATCAGGGTCATCTTCTGTTTTGAACTTGATCAAGAAGAAACTGCAAGACTCTGGAATTCCCGTAGCTTCCTCACCTATTTCAGCTCCAACAACAGCTCA
ATTAGATCTAAATCTACCAAGAAATGTCGATGTTGCACTGAAGGCACTGCAAATAGAGAACAGCAAAGATAAACCCAAATATGCTAATGCTGATGGAAATGTATCTGACT
CCTCCTTGGACTCTGAAGATGTAGAAAGCGGACCAACTAATGAGCAATTAATCATCCAGTTTAAGGAAGAGGATGGCACAGATCCTAAGAGTATTGCACAAGTGGAAGCT
GAAGAGGATGAGGAGGATTTCATAATGGAGGAGGTAAAGAGGAGACTGAAGGAGCTGAGGAGGAACAGTTTTATGGTGTTGATTCCAGAGGAAGAAGAAGAAGAAATCGA
AGGAGGAGAAGAAGAAGAAGTAGGTGAAGGGGAGCCTGAGTGGAGAGACGTGGAAGCAGAAGGTCGACAATGGTGGGGAGGGTTTGGTGCTGTTTATGATGATTACTGCG
AGAGGATGCGTTTCTTTGATCGTAAGAGCGTTGAATCTGGTCCTGAATCAACCTCCCAAAGATCTGCATCGAAAAAAAGTGCATCTCCTCTTCGGTGTCTTTCTCTGAAG
AGGATTGAAGAACCTGAAGATGAGATGGAGGATGTTGATCCTTCATTGACTCCGATTGACTCCAATTACCACATAGAAATAGCGTATGTTGCTCACATTTGCTTGTCCTG
GGAGGCCCTTCACTGTCAGTACACTCAACTTAACCACTTAATATCATGCCAACCCCAAAACTCTACTACTCATTATAATCTTACTGCTCAGCTCTTTCAGCAATTTCAAG
TCCTCTTGCAAAGGTTTATTGAAAACGAACCCTTTCAACAAGCTCTCAGGCCTACAATTTATGCCCGAACCCGTCGAACTTTTCCTAAAATGTTGCATGTTCCTAACATA
CAAGCTTCAGATCCAAACGGGGTGCAGGAACAGGAATCTGATTCCCTCATCCTCGCTCCTGACCTGCTGTTCATTATTGAGGCTTCAATCTTTACTTTCCACCGCTTCCT
GAAGATGGAGAAGAAAACCTCAAATTCTGCTTCTTTATCATTTCGGAACCACACCCAGGATGCTGCTCTGCTTGCTCGTGTTCGATCTTCTCTTGACAAGAAGAAAACGA
AGCTGAAAGAGGTTAGGAAGAAAAGTAAAGGGTGGAAACAGAAAACGTGTCCTCAAACGTATGAAGACATGCAATTACTTTTTGGAGTTGTGGACATTAAAATCATAACA
AGGCTTCTTAAGATGTCGAGGATTACTAAAGAGCAGCTGCTTTGGTGCGAGGAGAAAATGAACAAGTTAGATGTGTCTAATGGAAAATTGCGGAGAGATCCGTCTCCCCT
TCTTTTCCCATGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTAGGAATATCAGGGTCATCTTCTGTTTTGAACTTGATCAAGAAGAAACTGCAAGACTCTGGAATTCCCGTAGCTTCCTCACCTATTTCAGCTCCAACAACAGCTCA
ATTAGATCTAAATCTACCAAGAAATGTCGATGTTGCACTGAAGGCACTGCAAATAGAGAACAGCAAAGATAAACCCAAATATGCTAATGCTGATGGAAATGTATCTGACT
CCTCCTTGGACTCTGAAGATGTAGAAAGCGGACCAACTAATGAGCAATTAATCATCCAGTTTAAGGAAGAGGATGGCACAGATCCTAAGAGTATTGCACAAGTGGAAGCT
GAAGAGGATGAGGAGGATTTCATAATGGAGGAGGTAAAGAGGAGACTGAAGGAGCTGAGGAGGAACAGTTTTATGGTGTTGATTCCAGAGGAAGAAGAAGAAGAAATCGA
AGGAGGAGAAGAAGAAGAAGTAGGTGAAGGGGAGCCTGAGTGGAGAGACGTGGAAGCAGAAGGTCGACAATGGTGGGGAGGGTTTGGTGCTGTTTATGATGATTACTGCG
AGAGGATGCGTTTCTTTGATCGTAAGAGCGTTGAATCTGGTCCTGAATCAACCTCCCAAAGATCTGCATCGAAAAAAAGTGCATCTCCTCTTCGGTGTCTTTCTCTGAAG
AGGATTGAAGAACCTGAAGATGAGATGGAGGATGTTGATCCTTCATTGACTCCGATTGACTCCAATTACCACATAGAAATAGCGTATGTTGCTCACATTTGCTTGTCCTG
GGAGGCCCTTCACTGTCAGTACACTCAACTTAACCACTTAATATCATGCCAACCCCAAAACTCTACTACTCATTATAATCTTACTGCTCAGCTCTTTCAGCAATTTCAAG
TCCTCTTGCAAAGGTTTATTGAAAACGAACCCTTTCAACAAGCTCTCAGGCCTACAATTTATGCCCGAACCCGTCGAACTTTTCCTAAAATGTTGCATGTTCCTAACATA
CAAGCTTCAGATCCAAACGGGGTGCAGGAACAGGAATCTGATTCCCTCATCCTCGCTCCTGACCTGCTGTTCATTATTGAGGCTTCAATCTTTACTTTCCACCGCTTCCT
GAAGATGGAGAAGAAAACCTCAAATTCTGCTTCTTTATCATTTCGGAACCACACCCAGGATGCTGCTCTGCTTGCTCGTGTTCGATCTTCTCTTGACAAGAAGAAAACGA
AGCTGAAAGAGGTTAGGAAGAAAAGTAAAGGGTGGAAACAGAAAACGTGTCCTCAAACGTATGAAGACATGCAATTACTTTTTGGAGTTGTGGACATTAAAATCATAACA
AGGCTTCTTAAGATGTCGAGGATTACTAAAGAGCAGCTGCTTTGGTGCGAGGAGAAAATGAACAAGTTAGATGTGTCTAATGGAAAATTGCGGAGAGATCCGTCTCCCCT
TCTTTTCCCATGTTAA
Protein sequenceShow/hide protein sequence
MVGISGSSSVLNLIKKKLQDSGIPVASSPISAPTTAQLDLNLPRNVDVALKALQIENSKDKPKYANADGNVSDSSLDSEDVESGPTNEQLIIQFKEEDGTDPKSIAQVEA
EEDEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEIEGGEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYCERMRFFDRKSVESGPESTSQRSASKKSASPLRCLSLK
RIEEPEDEMEDVDPSLTPIDSNYHIEIAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFIENEPFQQALRPTIYARTRRTFPKMLHVPNI
QASDPNGVQEQESDSLILAPDLLFIIEASIFTFHRFLKMEKKTSNSASLSFRNHTQDAALLARVRSSLDKKKTKLKEVRKKSKGWKQKTCPQTYEDMQLLFGVVDIKIIT
RLLKMSRITKEQLLWCEEKMNKLDVSNGKLRRDPSPLLFPC