; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10014971 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10014971
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionDUF155 domain-containing protein
Genome locationChr02:22464835..22469159
RNA-Seq ExpressionHG10014971
SyntenyHG10014971
Gene Ontology termsGO:0007005 - mitochondrion organization (biological process)
GO:0009738 - abscisic acid-activated signaling pathway (biological process)
GO:0031930 - mitochondria-nucleus signaling pathway (biological process)
GO:0005739 - mitochondrion (cellular component)
InterPro domainsIPR003734 - Domain of unknown function DUF155


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004143605.3 uncharacterized protein LOC101222647 [Cucumis sativus]1.3e-21992.74Show/hide
Query:  MWRTIDAHLRSVRLLPNLPAHSSSSSASSLFSSGRSFLARS-NSTFLSPVPKPHSIALSKTLGFRATINSLSSVSCFGLGIQRFGGSSCGVLVLARCITS
        MWRTIDAHLRSVRLLP+L + SSSSS+SS FSSGRSF+ RS ++T  SP PKPHSI LSKTL F   IN LSSVSCF LGIQR  GS+ GVLVLARCITS
Subjt:  MWRTIDAHLRSVRLLPNLPAHSSSSSASSLFSSGRSFLARS-NSTFLSPVPKPHSIALSKTLGFRATINSLSSVSCFGLGIQRFGGSSCGVLVLARCITS

Query:  SGHTLEWNEPVSCSEVGDGGFRSVAEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNRRNFIPPSSRMTNYVVLKFGDLCNVNTHGASISGSD
        S ++LEWNEPVSCSEVGDGGFRSV EGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQN+RNFIPPSSRMTNYVVLKFGDLCNVNTHGASI GSD
Subjt:  SGHTLEWNEPVSCSEVGDGGFRSVAEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNRRNFIPPSSRMTNYVVLKFGDLCNVNTHGASISGSD

Query:  CCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAE
        CCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAE
Subjt:  CCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAE

Query:  FTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFL
        FTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFL
Subjt:  FTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFL

Query:  EWLIIALIGAEILLSLYDIIHRSAANL
        EWLIIALIGAEILLSLYDIIHRSAANL
Subjt:  EWLIIALIGAEILLSLYDIIHRSAANL

XP_008445840.1 PREDICTED: uncharacterized protein LOC103488742 [Cucumis melo]1.9e-22091.94Show/hide
Query:  MWRTIDAHLRSVRLLPNLPAHSSSS-------SASSLFSSGRSFLARS-NSTFLSPVPKPHSIALSKTLGFRATINSLSSVSCFGLGIQRFGGSSCGVLV
        MWRTIDAHLRSVRLLP+L +HSSSS       S+SSLFSSGRSFL RS ++T +SP+PKPHSI LSKTL F   IN  SSVSCF LGIQRF GS+ GVLV
Subjt:  MWRTIDAHLRSVRLLPNLPAHSSSS-------SASSLFSSGRSFLARS-NSTFLSPVPKPHSIALSKTLGFRATINSLSSVSCFGLGIQRFGGSSCGVLV

Query:  LARCITSSGHTLEWNEPVSCSEVGDGGFRSVAEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNRRNFIPPSSRMTNYVVLKFGDLCNVNTHG
        LARCITSS +TLEWNEPVSCSEVGDGGFRSV EGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQN+RNFIPPSSRMTNYVVLKFGDLCNVNTH 
Subjt:  LARCITSSGHTLEWNEPVSCSEVGDGGFRSVAEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNRRNFIPPSSRMTNYVVLKFGDLCNVNTHG

Query:  ASISGSDCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQ
        ASISGSDCCFMVVFQYGSIVLFNVREH+VDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQ
Subjt:  ASISGSDCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQ

Query:  VDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQ
        VDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQ
Subjt:  VDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQ

Query:  NRKSDFLEWLIIALIGAEILLSLYDIIHRSAANL
        NRKSDFLEWLIIALIGAEILLSLYDIIHRSAANL
Subjt:  NRKSDFLEWLIIALIGAEILLSLYDIIHRSAANL

XP_022139336.1 uncharacterized protein LOC111010276 [Momordica charantia]1.0e-21690.63Show/hide
Query:  MWRTIDAHLRSVRLLPNLPAHSSSSSASS-LFSSGRSFLARSNSTFLSPVPKPHSIALSKTLGFRATINSLSSVSCFGLGIQRFGGSSCGVLVLARCITS
        MWRTIDAHLRSVRL+P L A+SSSSS+SS LF++GRSFL RS+S+ LSPVP+ HSI L +TL  RA +N  SS  C GLGI+RFG SSCG++VLARCITS
Subjt:  MWRTIDAHLRSVRLLPNLPAHSSSSSASS-LFSSGRSFLARSNSTFLSPVPKPHSIALSKTLGFRATINSLSSVSCFGLGIQRFGGSSCGVLVLARCITS

Query:  SGHTLEWNEPVSCSEVGDGGFRSVAEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNRRNFIPPSSRMTNYVVLKFGDLCNVNTHGASISGSD
        S HTLEWNEPVSCSEVGDGGFRS+ EG+SDGEGDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQN+ NFIPPSSRMTNYVVLKFGDLCN NT  ASI+GSD
Subjt:  SGHTLEWNEPVSCSEVGDGGFRSVAEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNRRNFIPPSSRMTNYVVLKFGDLCNVNTHGASISGSD

Query:  CCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAE
        CCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAE
Subjt:  CCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAE

Query:  FTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFL
        FTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFL
Subjt:  FTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFL

Query:  EWLIIALIGAEILLSLYDIIHRSAANL
        EWLIIALIGAEILLSLYDIIHRSAANL
Subjt:  EWLIIALIGAEILLSLYDIIHRSAANL

XP_023513788.1 uncharacterized protein LOC111778014 [Cucurbita pepo subsp. pepo]6.3e-21190.38Show/hide
Query:  MWRTIDAHLRSVRLLPNLPAHSSSSSASSLFSSGRSFLARSNSTFLSPVPKPHSIALSKTLGFRATINSLSSVSCFGLGIQRFGGSSCGVLVLARCITSS
        MWR IDAHLRSVRLLPNL A+SSS   S LFSSGRSFLARS+STF+SPVPKPHSI LSKTL F  TIN LSSVSC G+GI+RFGGSSCGV+VLARCITSS
Subjt:  MWRTIDAHLRSVRLLPNLPAHSSSSSASSLFSSGRSFLARSNSTFLSPVPKPHSIALSKTLGFRATINSLSSVSCFGLGIQRFGGSSCGVLVLARCITSS

Query:  GHTLEWNEPVSCSEVGDGGFRSVAEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNRRNFIPPSSRMTNYVVLKFGDLCNVNTHGASISGSDC
         HTLEWNEPVSCSEVG        EGI +GE DEVEEDSRPSIPVRAYF STSVDLRSLVDQN+RNFIPPSSRMTNYVVLKFGDLCNVNT GAS+SGSD 
Subjt:  GHTLEWNEPVSCSEVGDGGFRSVAEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNRRNFIPPSSRMTNYVVLKFGDLCNVNTHGASISGSDC

Query:  CFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAEF
         +MVVFQYGSIVLFN+RE EVDGYLKIVEKHASGLLPEMRKDEYEVREK ALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAEF
Subjt:  CFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAEF

Query:  TDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFLE
        TDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFLE
Subjt:  TDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFLE

Query:  WLIIALIGAEILLSLYDIIHRSAANL
        WLIIALIGAEILLS+YDIIHRSAANL
Subjt:  WLIIALIGAEILLSLYDIIHRSAANL

XP_038891437.1 uncharacterized protein LOC120080856 [Benincasa hispida]2.7e-23095.36Show/hide
Query:  MWRTIDAHLRSVRLLPNLPAHSS-----SSSASSLFSSGRSFLARSNSTFLSPVPKPHSIALSKTLGFRATINSLSSVSCFGLGIQRFGGSSCGVLVLAR
        MWR+IDAHLRSVRLLPNL AHSS     SSSASSLFSSGRSF ARS+STFLSPVPKPHS+ L KTLG RATIN LSSVSCFGLGIQRFGGS+CGVLVLA+
Subjt:  MWRTIDAHLRSVRLLPNLPAHSS-----SSSASSLFSSGRSFLARSNSTFLSPVPKPHSIALSKTLGFRATINSLSSVSCFGLGIQRFGGSSCGVLVLAR

Query:  CITSSGHTLEWNEPVSCSEVGDGGFRSVAEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNRRNFIPPSSRMTNYVVLKFGDLCNVNTHGASI
        CITSS HTLEWNEPV CSEVGDGGFRSV EGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQN+RNFIPPSSRMTNYVVLKFGDLCNVNTHGASI
Subjt:  CITSSGHTLEWNEPVSCSEVGDGGFRSVAEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNRRNFIPPSSRMTNYVVLKFGDLCNVNTHGASI

Query:  SGSDCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDG
        SGSDCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDG
Subjt:  SGSDCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDG

Query:  MVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRK
        MVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRK
Subjt:  MVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRK

Query:  SDFLEWLIIALIGAEILLSLYDIIHRSAANL
        SDFLEWLIIALIGAEILLSLYDIIHRSAANL
Subjt:  SDFLEWLIIALIGAEILLSLYDIIHRSAANL

TrEMBL top hitse value%identityAlignment
A0A0A0KNM3 DUF155 domain-containing protein6.1e-22092.74Show/hide
Query:  MWRTIDAHLRSVRLLPNLPAHSSSSSASSLFSSGRSFLARS-NSTFLSPVPKPHSIALSKTLGFRATINSLSSVSCFGLGIQRFGGSSCGVLVLARCITS
        MWRTIDAHLRSVRLLP+L + SSSSS+SS FSSGRSF+ RS ++T  SP PKPHSI LSKTL F   IN LSSVSCF LGIQR  GS+ GVLVLARCITS
Subjt:  MWRTIDAHLRSVRLLPNLPAHSSSSSASSLFSSGRSFLARS-NSTFLSPVPKPHSIALSKTLGFRATINSLSSVSCFGLGIQRFGGSSCGVLVLARCITS

Query:  SGHTLEWNEPVSCSEVGDGGFRSVAEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNRRNFIPPSSRMTNYVVLKFGDLCNVNTHGASISGSD
        S ++LEWNEPVSCSEVGDGGFRSV EGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQN+RNFIPPSSRMTNYVVLKFGDLCNVNTHGASI GSD
Subjt:  SGHTLEWNEPVSCSEVGDGGFRSVAEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNRRNFIPPSSRMTNYVVLKFGDLCNVNTHGASISGSD

Query:  CCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAE
        CCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAE
Subjt:  CCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAE

Query:  FTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFL
        FTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFL
Subjt:  FTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFL

Query:  EWLIIALIGAEILLSLYDIIHRSAANL
        EWLIIALIGAEILLSLYDIIHRSAANL
Subjt:  EWLIIALIGAEILLSLYDIIHRSAANL

A0A1S3BEH0 uncharacterized protein LOC1034887429.4e-22191.94Show/hide
Query:  MWRTIDAHLRSVRLLPNLPAHSSSS-------SASSLFSSGRSFLARS-NSTFLSPVPKPHSIALSKTLGFRATINSLSSVSCFGLGIQRFGGSSCGVLV
        MWRTIDAHLRSVRLLP+L +HSSSS       S+SSLFSSGRSFL RS ++T +SP+PKPHSI LSKTL F   IN  SSVSCF LGIQRF GS+ GVLV
Subjt:  MWRTIDAHLRSVRLLPNLPAHSSSS-------SASSLFSSGRSFLARS-NSTFLSPVPKPHSIALSKTLGFRATINSLSSVSCFGLGIQRFGGSSCGVLV

Query:  LARCITSSGHTLEWNEPVSCSEVGDGGFRSVAEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNRRNFIPPSSRMTNYVVLKFGDLCNVNTHG
        LARCITSS +TLEWNEPVSCSEVGDGGFRSV EGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQN+RNFIPPSSRMTNYVVLKFGDLCNVNTH 
Subjt:  LARCITSSGHTLEWNEPVSCSEVGDGGFRSVAEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNRRNFIPPSSRMTNYVVLKFGDLCNVNTHG

Query:  ASISGSDCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQ
        ASISGSDCCFMVVFQYGSIVLFNVREH+VDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQ
Subjt:  ASISGSDCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQ

Query:  VDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQ
        VDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQ
Subjt:  VDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQ

Query:  NRKSDFLEWLIIALIGAEILLSLYDIIHRSAANL
        NRKSDFLEWLIIALIGAEILLSLYDIIHRSAANL
Subjt:  NRKSDFLEWLIIALIGAEILLSLYDIIHRSAANL

A0A6J1CCC7 uncharacterized protein LOC1110102764.8e-21790.63Show/hide
Query:  MWRTIDAHLRSVRLLPNLPAHSSSSSASS-LFSSGRSFLARSNSTFLSPVPKPHSIALSKTLGFRATINSLSSVSCFGLGIQRFGGSSCGVLVLARCITS
        MWRTIDAHLRSVRL+P L A+SSSSS+SS LF++GRSFL RS+S+ LSPVP+ HSI L +TL  RA +N  SS  C GLGI+RFG SSCG++VLARCITS
Subjt:  MWRTIDAHLRSVRLLPNLPAHSSSSSASS-LFSSGRSFLARSNSTFLSPVPKPHSIALSKTLGFRATINSLSSVSCFGLGIQRFGGSSCGVLVLARCITS

Query:  SGHTLEWNEPVSCSEVGDGGFRSVAEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNRRNFIPPSSRMTNYVVLKFGDLCNVNTHGASISGSD
        S HTLEWNEPVSCSEVGDGGFRS+ EG+SDGEGDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQN+ NFIPPSSRMTNYVVLKFGDLCN NT  ASI+GSD
Subjt:  SGHTLEWNEPVSCSEVGDGGFRSVAEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNRRNFIPPSSRMTNYVVLKFGDLCNVNTHGASISGSD

Query:  CCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAE
        CCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAE
Subjt:  CCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAE

Query:  FTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFL
        FTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFL
Subjt:  FTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFL

Query:  EWLIIALIGAEILLSLYDIIHRSAANL
        EWLIIALIGAEILLSLYDIIHRSAANL
Subjt:  EWLIIALIGAEILLSLYDIIHRSAANL

A0A6J1FTZ0 uncharacterized protein LOC1114472852.2e-20987.33Show/hide
Query:  MWRTIDAHLRSVRLLPNLPAHSSSSSASS--------LFSSGRSFLARSNSTFLSPVPKPHSIALSKTLGFRATINSLSSVSCFGLGIQRFGGSSCGVLV
        MWRTIDAHLRSVRLLP+L   SSSSS+SS        LF+SGRSF ARS+S+ LSPVPKPH I LSK L  RA  N LSSV CFGL   R GGSSCG +V
Subjt:  MWRTIDAHLRSVRLLPNLPAHSSSSSASS--------LFSSGRSFLARSNSTFLSPVPKPHSIALSKTLGFRATINSLSSVSCFGLGIQRFGGSSCGVLV

Query:  LARCITSSGHTLEWNEPVSCSEVGDGGFRSVAEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNRRNFIPPSSRMTNYVVLKFGDLCNVNTHG
        LARCIT+S +TLEWNEPVSCSEVG+G FRS  +G SDGE DEV EDSRPSIPVRA+F STSVDLR LVDQN+ NFIPPSSRMTNYVVLKFGDLC+VN++G
Subjt:  LARCITSSGHTLEWNEPVSCSEVGDGGFRSVAEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNRRNFIPPSSRMTNYVVLKFGDLCNVNTHG

Query:  ASISGSDCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQ
        ASISGSDCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVRE PAL+TWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQ
Subjt:  ASISGSDCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQ

Query:  VDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQ
        VDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWK+AKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQ
Subjt:  VDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQ

Query:  NRKSDFLEWLIIALIGAEILLSLYDIIHRSAANL
        NRKSDFLEWLIIALIGAEILLSLYDIIHRSAANL
Subjt:  NRKSDFLEWLIIALIGAEILLSLYDIIHRSAANL

A0A6J1GZZ1 uncharacterized protein LOC1114587773.4e-21090.14Show/hide
Query:  MWRTIDAHLRSVRLLPNLPAHSSSSSASSLFSSGRSFLARSNSTFLSPVPKPHSIALSKTLGFRATINSLSSVSCFGLGIQRFGGSSCGVLVLARCITSS
        MWR IDAHLRSVRLLPNL A+SSS   S LF SGRS LARS+STFLSPVPKPHSI LSKTL F  TIN LSSVSC G+GI+RFGGSSCGV+VLARCITSS
Subjt:  MWRTIDAHLRSVRLLPNLPAHSSSSSASSLFSSGRSFLARSNSTFLSPVPKPHSIALSKTLGFRATINSLSSVSCFGLGIQRFGGSSCGVLVLARCITSS

Query:  GHTLEWNEPVSCSEVGDGGFRSVAEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNRRNFIPPSSRMTNYVVLKFGDLCNVNTHGASISGSDC
         HTLEWNEPVSCSEVG        EGI +GE DEVEEDSRPSIPVRAYF STSVDLRSLVDQN+RNFIPPSSRMTNYVVLKFGDLCNVNT GAS+SGSD 
Subjt:  GHTLEWNEPVSCSEVGDGGFRSVAEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNRRNFIPPSSRMTNYVVLKFGDLCNVNTHGASISGSDC

Query:  CFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAEF
         +MVVFQYGSIVLFN+RE EVDGYLKIVEKHASGLLPEMRKDEYEVREK ALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAEF
Subjt:  CFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAEF

Query:  TDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFLE
        TDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFLE
Subjt:  TDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFLE

Query:  WLIIALIGAEILLSLYDIIHRSAANL
        WLIIALIGAEILLS+YDIIHRSAANL
Subjt:  WLIIALIGAEILLSLYDIIHRSAANL

SwissProt top hitse value%identityAlignment
O74446 Sad1-interacting factor 26.5e-0921.1Show/hide
Query:  FQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVRE-KPALNTWMEGGL--DYIMLQYLNIDGIR-TIGSVLGQSIALDYYGRQVDGMVAEFT
        F YG +VL+     E   +L+ + +     + +++ ++ EV E    + T  +  +  D+I L+  +   IR +I   + QS+ +  +   V+  +    
Subjt:  FQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVRE-KPALNTWMEGGL--DYIMLQYLNIDGIR-TIGSVLGQSIALDYYGRQVDGMVAEFT

Query:  DINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFLEW
        D  + +  TG+  +KR+++   VG+      ++ L+  + +  ++ W + +   I+   R   E+ QR A L+ +++ +   +  L+E + +   + LEW
Subjt:  DINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFLEW

Query:  LIIALIGAEILLSLYDII
        +++ L+G  +L++L+ I+
Subjt:  LIIALIGAEILLSLYDII

Q03441 Sporulation protein RMD15.1e-1422.95Show/hide
Query:  DLCNVNTHGASISGSD-CCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPE--MRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIR-----T
        D+  ++  G  I  SD    + +F+YG +V++   E E   +L  +EK     L E  ++ +E+      +    +    D+I L+    DG       +
Subjt:  DLCNVNTHGASISGSD-CCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPE--MRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIR-----T

Query:  IGSVLGQSIALDYYGRQVDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDF
        I   + QS+ +  +   VD  + +  DI +E+  +GK  M ++ + + +G+      ++ L   + +  +I W + +   I++  R   E+ QR + L+ 
Subjt:  IGSVLGQSIALDYYGRQVDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDF

Query:  KLKFVEHNIRFLQEILQNRKSDFLEWLIIALIGAEILLSLYDII
        +L+ +   ++ L+E L +   ++LE+++I L+G E+L+S+ +I+
Subjt:  KLKFVEHNIRFLQEILQNRKSDFLEWLIIALIGAEILLSLYDII

Q05648 MIOREX complex component 101.5e-0520.66Show/hide
Query:  VLKFGDLCNVNTHGASISGSDCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNID----GIR
        V+  G   ++ + G S S    C + + +  S+   N  + E +  +  VE        ++   + +V  + A  +++ G  D I++  L+ D       
Subjt:  VLKFGDLCNVNTHGASISGSDCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNID----GIR

Query:  TIGSVLGQSIALDYYGRQVDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLD
           S L +S  L      ++  +++   I   +    K  ++     + +G+       + L   L E  D+ W + +  +I++ +    ++  R   L+
Subjt:  TIGSVLGQSIALDYYGRQVDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLD

Query:  FKLKFVEHNIRFLQEILQNRKSDFLEWLIIALIGAEILLSLY
         KL +     R L  +L  R S FLEW+II LI  E+   +Y
Subjt:  FKLKFVEHNIRFLQEILQNRKSDFLEWLIIALIGAEILLSLY

Q9C565 Protein RETARDED ROOT GROWTH, mitochondrial7.7e-9559.86Show/hide
Query:  GDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNRRNFIPPSSRMTNYVVLKFGDL--CNVNTHGASISGSDCCFMVVFQYGSIVLFNVREHEVDGYLKIVE
        G E EE +   IP++AYF STS+DL+++  +N  N +PP+SR TNY+ LKF D     + +     S S+C FMVVFQYGS +LFNV +++VD YL IV 
Subjt:  GDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNRRNFIPPSSRMTNYVVLKFGDL--CNVNTHGASISGSDCCFMVVFQYGSIVLFNVREHEVDGYLKIVE

Query:  KHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANS
        +HASGLL EMRKD+Y V+EKP L   M+GG DYI+L+ L+ + IR IGSVLGQSIALDY   QV+ +V EF DINR M  TG F M RKKLFQLVGKANS
Subjt:  KHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANS

Query:  NLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFLEWLIIALIGAEILLSLYDIIHRSA
        N+ADVILK+GLFERS+IAW++A+YAQI+EYLR+E+E++QRF  LD+KLKF+EHNI FLQE++QNR+SD LEW II L+  E  + +Y+I+  SA
Subjt:  NLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFLEWLIIALIGAEILLSLYDIIHRSA

Q9FNB2 Protein RETARDED ROOT GROWTH-LIKE5.1e-12375.42Show/hide
Query:  VAEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNRRNFIPPSSRMTNYVVLKFGDLCN-VNTHGASISGSDCCFMVVFQYGSIVLFNVREHEV
        V E IS G    +E++++ SIPVRAYFFSTSVDLRSL++QN++NFIPP+SRMTNYVVLKFG+  +  +T    ISGS+  +MVVF YGSIVLFNVREHEV
Subjt:  VAEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNRRNFIPPSSRMTNYVVLKFGDLCN-VNTHGASISGSDCCFMVVFQYGSIVLFNVREHEV

Query:  DGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAEFTDINREMEATGKFKMKRKKLF
        D YLK+VE+HASGLLPEMRKDEYEVRE P L+TWME G D+I LQ+LN DGIRTIG VLGQSIALDYYGRQVDGMVAEFT+INR++E TG F MKRKKLF
Subjt:  DGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAEFTDINREMEATGKFKMKRKKLF

Query:  QLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFLEWLIIALIGAEILLSLYDIIHR
        QLVGKAN  LADVILKLGLFERSDIAWKDAKY QIWE+LRDEFELTQ FA+LD+KLKFVEHN+RFLQEILQNRKS  LEWLII LI  EI +S Y++   
Subjt:  QLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFLEWLIIALIGAEILLSLYDIIHR

Query:  S
        S
Subjt:  S

Arabidopsis top hitse value%identityAlignment
AT1G69380.1 Protein of unknown function (DUF155)5.5e-9659.86Show/hide
Query:  GDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNRRNFIPPSSRMTNYVVLKFGDL--CNVNTHGASISGSDCCFMVVFQYGSIVLFNVREHEVDGYLKIVE
        G E EE +   IP++AYF STS+DL+++  +N  N +PP+SR TNY+ LKF D     + +     S S+C FMVVFQYGS +LFNV +++VD YL IV 
Subjt:  GDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNRRNFIPPSSRMTNYVVLKFGDL--CNVNTHGASISGSDCCFMVVFQYGSIVLFNVREHEVDGYLKIVE

Query:  KHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANS
        +HASGLL EMRKD+Y V+EKP L   M+GG DYI+L+ L+ + IR IGSVLGQSIALDY   QV+ +V EF DINR M  TG F M RKKLFQLVGKANS
Subjt:  KHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANS

Query:  NLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFLEWLIIALIGAEILLSLYDIIHRSA
        N+ADVILK+GLFERS+IAW++A+YAQI+EYLR+E+E++QRF  LD+KLKF+EHNI FLQE++QNR+SD LEW II L+  E  + +Y+I+  SA
Subjt:  NLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFLEWLIIALIGAEILLSLYDIIHRSA

AT5G13610.1 Protein of unknown function (DUF155)3.6e-12475.42Show/hide
Query:  VAEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNRRNFIPPSSRMTNYVVLKFGDLCN-VNTHGASISGSDCCFMVVFQYGSIVLFNVREHEV
        V E IS G    +E++++ SIPVRAYFFSTSVDLRSL++QN++NFIPP+SRMTNYVVLKFG+  +  +T    ISGS+  +MVVF YGSIVLFNVREHEV
Subjt:  VAEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNRRNFIPPSSRMTNYVVLKFGDLCN-VNTHGASISGSDCCFMVVFQYGSIVLFNVREHEV

Query:  DGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAEFTDINREMEATGKFKMKRKKLF
        D YLK+VE+HASGLLPEMRKDEYEVRE P L+TWME G D+I LQ+LN DGIRTIG VLGQSIALDYYGRQVDGMVAEFT+INR++E TG F MKRKKLF
Subjt:  DGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAEFTDINREMEATGKFKMKRKKLF

Query:  QLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFLEWLIIALIGAEILLSLYDIIHR
        QLVGKAN  LADVILKLGLFERSDIAWKDAKY QIWE+LRDEFELTQ FA+LD+KLKFVEHN+RFLQEILQNRKS  LEWLII LI  EI +S Y++   
Subjt:  QLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFLEWLIIALIGAEILLSLYDIIHR

Query:  S
        S
Subjt:  S


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGCGCACCATTGACGCCCATCTGAGGTCCGTACGCCTCCTTCCGAATCTCCCTGCTCATTCTTCTTCATCTTCAGCCTCTTCTCTCTTCTCCTCCGGCCGTTCATT
TCTTGCTCGCTCAAATTCGACTTTCCTCTCGCCTGTACCGAAGCCTCACTCAATTGCTCTCTCTAAAACCCTAGGATTTCGTGCTACCATTAATTCTTTGTCGAGTGTTT
CCTGTTTTGGCCTTGGAATTCAACGCTTCGGAGGATCGAGTTGCGGTGTGTTGGTGTTGGCGAGGTGCATTACCTCTTCAGGGCACACGTTGGAGTGGAATGAACCAGTG
TCGTGTTCAGAGGTTGGAGATGGTGGTTTTCGGAGTGTTGCTGAAGGAATTAGTGACGGTGAAGGGGATGAAGTCGAGGAGGATTCTAGACCGTCTATTCCTGTCAGAGC
TTATTTCTTCTCCACTAGTGTGGATTTGAGAAGCTTGGTGGATCAGAATAGACGTAACTTTATCCCGCCATCATCTCGTATGACGAATTATGTAGTCCTTAAGTTTGGGG
ATCTTTGTAATGTGAACACTCATGGCGCCAGCATAAGTGGAAGTGATTGCTGTTTCATGGTAGTTTTTCAGTATGGCTCTATTGTGCTGTTTAACGTTCGTGAACATGAG
GTTGATGGGTATTTGAAAATTGTAGAGAAACATGCATCGGGATTGCTGCCTGAAATGAGAAAGGATGAGTATGAGGTTAGAGAGAAGCCTGCCTTAAATACATGGATGGA
GGGGGGCTTGGACTACATAATGCTGCAGTACTTGAATATTGATGGCATACGTACCATAGGTAGTGTTCTTGGTCAGAGTATTGCTCTTGATTACTATGGGCGACAGGTTG
ATGGGATGGTTGCGGAATTCACTGACATCAATCGTGAAATGGAAGCAACTGGGAAGTTTAAAATGAAGAGGAAGAAACTTTTCCAGTTGGTGGGAAAGGCAAATTCTAAT
CTTGCCGATGTCATTCTCAAGCTTGGACTTTTTGAGAGATCTGACATCGCATGGAAGGATGCAAAATATGCTCAAATATGGGAATATCTCAGAGACGAGTTTGAGTTAAC
ACAGAGATTCGCAAGTCTGGACTTCAAATTGAAGTTTGTGGAGCATAATATTCGCTTCCTACAAGAGATTCTGCAAAACAGGAAATCAGATTTTTTGGAATGGCTGATCA
TTGCATTGATTGGTGCAGAAATTTTACTCTCACTTTACGACATTATTCATAGATCAGCAGCTAATCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGGCGCACCATTGACGCCCATCTGAGGTCCGTACGCCTCCTTCCGAATCTCCCTGCTCATTCTTCTTCATCTTCAGCCTCTTCTCTCTTCTCCTCCGGCCGTTCATT
TCTTGCTCGCTCAAATTCGACTTTCCTCTCGCCTGTACCGAAGCCTCACTCAATTGCTCTCTCTAAAACCCTAGGATTTCGTGCTACCATTAATTCTTTGTCGAGTGTTT
CCTGTTTTGGCCTTGGAATTCAACGCTTCGGAGGATCGAGTTGCGGTGTGTTGGTGTTGGCGAGGTGCATTACCTCTTCAGGGCACACGTTGGAGTGGAATGAACCAGTG
TCGTGTTCAGAGGTTGGAGATGGTGGTTTTCGGAGTGTTGCTGAAGGAATTAGTGACGGTGAAGGGGATGAAGTCGAGGAGGATTCTAGACCGTCTATTCCTGTCAGAGC
TTATTTCTTCTCCACTAGTGTGGATTTGAGAAGCTTGGTGGATCAGAATAGACGTAACTTTATCCCGCCATCATCTCGTATGACGAATTATGTAGTCCTTAAGTTTGGGG
ATCTTTGTAATGTGAACACTCATGGCGCCAGCATAAGTGGAAGTGATTGCTGTTTCATGGTAGTTTTTCAGTATGGCTCTATTGTGCTGTTTAACGTTCGTGAACATGAG
GTTGATGGGTATTTGAAAATTGTAGAGAAACATGCATCGGGATTGCTGCCTGAAATGAGAAAGGATGAGTATGAGGTTAGAGAGAAGCCTGCCTTAAATACATGGATGGA
GGGGGGCTTGGACTACATAATGCTGCAGTACTTGAATATTGATGGCATACGTACCATAGGTAGTGTTCTTGGTCAGAGTATTGCTCTTGATTACTATGGGCGACAGGTTG
ATGGGATGGTTGCGGAATTCACTGACATCAATCGTGAAATGGAAGCAACTGGGAAGTTTAAAATGAAGAGGAAGAAACTTTTCCAGTTGGTGGGAAAGGCAAATTCTAAT
CTTGCCGATGTCATTCTCAAGCTTGGACTTTTTGAGAGATCTGACATCGCATGGAAGGATGCAAAATATGCTCAAATATGGGAATATCTCAGAGACGAGTTTGAGTTAAC
ACAGAGATTCGCAAGTCTGGACTTCAAATTGAAGTTTGTGGAGCATAATATTCGCTTCCTACAAGAGATTCTGCAAAACAGGAAATCAGATTTTTTGGAATGGCTGATCA
TTGCATTGATTGGTGCAGAAATTTTACTCTCACTTTACGACATTATTCATAGATCAGCAGCTAATCTTTAG
Protein sequenceShow/hide protein sequence
MWRTIDAHLRSVRLLPNLPAHSSSSSASSLFSSGRSFLARSNSTFLSPVPKPHSIALSKTLGFRATINSLSSVSCFGLGIQRFGGSSCGVLVLARCITSSGHTLEWNEPV
SCSEVGDGGFRSVAEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNRRNFIPPSSRMTNYVVLKFGDLCNVNTHGASISGSDCCFMVVFQYGSIVLFNVREHE
VDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSN
LADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFLEWLIIALIGAEILLSLYDIIHRSAANL