; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr025390 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr025390
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function (DUF1666)
Genome locationtig00004836:2071222..2073873
RNA-Seq ExpressionSgr025390
SyntenySgr025390
Gene Ontology termsNA
InterPro domainsIPR012870 - Protein of unknown function DUF1666


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022157097.1 uncharacterized protein LOC111023905 isoform X1 [Momordica charantia]6.7e-17876.33Show/hide
Query:  KDDPAADPKYENGIDDESAKADKADYIAEAE------AEAEAEDDDEDFIMNEVKRRLKELRRNSFMVLIPEEDSCAEEEEEEEEGETSCGEREWRDVEA
        +D    DPK +NG  DES KA+K +   E E      AEAE EDD+EDFIMNEVKRRLKELRRN+FMVLIPEEDSCA  EEEEEEGETSCGE+EWRDVEA
Subjt:  KDDPAADPKYENGIDDESAKADKADYIAEAE------AEAEAEDDDEDFIMNEVKRRLKELRRNSFMVLIPEEDSCAEEEEEEEEGETSCGEREWRDVEA

Query:  EGRQWWGGFGAVYDKYCERMLFFDRMSMQPLIEAGNTQILFLINHPVGYFIEVRFVLHYITLAGSRTPSPSP--RSASKKTASPLRCLSLKRIEEPQDET
        EGR WWGGFG +YDKY ERMLF+DRMSMQPLIE                              GS TPS SP  RSASKKTASPLRCLSLKRIEEP+D+T
Subjt:  EGRQWWGGFGAVYDKYCERMLFFDRMSMQPLIEAGNTQILFLINHPVGYFIEVRFVLHYITLAGSRTPSPSP--RSASKKTASPLRCLSLKRIEEPQDET

Query:  EHLHP--PFTDSYHDIETAYVAHICLSWEALHCQYTQLNHLISCQSQNPTHYNLSAQQFQQFQVLLQRFIENEPFEQGPRPGIYARTRRSFPKMLQVPNI
        EHLHP  PFTDS H IETAYVAHICLSWEALHCQYTQLNHLISCQ Q P+HYNL+AQQFQQFQVLLQRFIE+EPF+QGPRPGIYARTRRSFPK+LQVPNI
Subjt:  EHLHP--PFTDSYHDIETAYVAHICLSWEALHCQYTQLNHLISCQSQNPTHYNLSAQQFQQFQVLLQRFIENEPFEQGPRPGIYARTRRSFPKMLQVPNI

Query:  QGSDPNRTQEEESDLVILAPELIVIIEASIFTFHRFLKMDKKISNCGSLSFGGNQSQDATLLARIRSSLDKKKTKLKELRKKSRRWKQKTWPQTYEDMQL
        QGSDP   QEEESDLV+LAP+LI+IIEA+IFTFHRFLKMDKK SN G     GN +QD TLLA IRSSLDKKKTKLKE+RKKSRR KQKTWPQTYEDMQL
Subjt:  QGSDPNRTQEEESDLVILAPELIVIIEASIFTFHRFLKMDKKISNCGSLSFGGNQSQDATLLARIRSSLDKKKTKLKELRKKSRRWKQKTWPQTYEDMQL

Query:  LFGVVDIKIISRLVKMSRITKEQLLWCEEKMKKLDVSDGKLQRDPSPLLFPC
        L GVVDIKIISRLVKMSRITKEQL+WCEEKM KLD+S+G+LQRDPSPLLFPC
Subjt:  LFGVVDIKIISRLVKMSRITKEQLLWCEEKMKKLDVSDGKLQRDPSPLLFPC

XP_022157098.1 uncharacterized protein LOC111023905 isoform X2 [Momordica charantia]4.8e-17676.11Show/hide
Query:  KDDPAADPKYENGIDDESAKADKADYIAEAE------AEAEAEDDDEDFIMNEVKRRLKELRRNSFMVLIPEEDSCAEEEEEEEEGETSCGEREWRDVEA
        +D    DPK +NG  DES KA+K +   E E      AEAE EDD+EDFIMNEVKRRLKELRRN+FMVLIPEEDSCA  EEEEEEGETSCGE+EWRDVEA
Subjt:  KDDPAADPKYENGIDDESAKADKADYIAEAE------AEAEAEDDDEDFIMNEVKRRLKELRRNSFMVLIPEEDSCAEEEEEEEEGETSCGEREWRDVEA

Query:  EGRQWWGGFGAVYDKYCERMLFFDRMSMQPLIEAGNTQILFLINHPVGYFIEVRFVLHYITLAGSRTPSPSP--RSASKKTASPLRCLSLKRIEEPQDET
        EGR WWGGFG +YDKY ERMLF+DRMSMQPLIE                              GS TPS SP  RSASKKTASPLRCLSLKRIEEP+D+T
Subjt:  EGRQWWGGFGAVYDKYCERMLFFDRMSMQPLIEAGNTQILFLINHPVGYFIEVRFVLHYITLAGSRTPSPSP--RSASKKTASPLRCLSLKRIEEPQDET

Query:  EHLHP--PFTDSYHDIETAYVAHICLSWEALHCQYTQLNHLISCQSQNPTHYNLSAQQFQQFQVLLQRFIENEPFEQGPRPGIYARTRRSFPKMLQVPNI
        EHLHP  PFTDS H IETAYVAHICLSWEALHCQYTQLNHLISCQ Q P+HYNL+AQQFQQFQVLLQRFIE+EPF+QGPRPGIYARTRRSFPK+LQVPNI
Subjt:  EHLHP--PFTDSYHDIETAYVAHICLSWEALHCQYTQLNHLISCQSQNPTHYNLSAQQFQQFQVLLQRFIENEPFEQGPRPGIYARTRRSFPKMLQVPNI

Query:  QGSDPNRTQEEESDLVILAPELIVIIEASIFTFHRFLKMDKKISNCGSLSFGGNQSQDATLLARIRSSLDKKKTKLKELRKKSRRWKQKTWPQTYEDMQL
        QGSDP   QEEESDLV+LAP+LI+IIEA+IFTFHRFLKMDKK SN G     GN +QD TLLA IRSSLD KKTKLKE+RKKSRR KQKTWPQTYEDMQL
Subjt:  QGSDPNRTQEEESDLVILAPELIVIIEASIFTFHRFLKMDKKISNCGSLSFGGNQSQDATLLARIRSSLDKKKTKLKELRKKSRRWKQKTWPQTYEDMQL

Query:  LFGVVDIKIISRLVKMSRITKEQLLWCEEKMKKLDVSDGKLQRDPSPLLFPC
        L GVVDIKIISRLVKMSRITKEQL+WCEEKM KLD+S+G+LQRDPSPLLFPC
Subjt:  LFGVVDIKIISRLVKMSRITKEQLLWCEEKMKKLDVSDGKLQRDPSPLLFPC

XP_022157099.1 uncharacterized protein LOC111023905 isoform X3 [Momordica charantia]6.7e-17876.33Show/hide
Query:  KDDPAADPKYENGIDDESAKADKADYIAEAE------AEAEAEDDDEDFIMNEVKRRLKELRRNSFMVLIPEEDSCAEEEEEEEEGETSCGEREWRDVEA
        +D    DPK +NG  DES KA+K +   E E      AEAE EDD+EDFIMNEVKRRLKELRRN+FMVLIPEEDSCA  EEEEEEGETSCGE+EWRDVEA
Subjt:  KDDPAADPKYENGIDDESAKADKADYIAEAE------AEAEAEDDDEDFIMNEVKRRLKELRRNSFMVLIPEEDSCAEEEEEEEEGETSCGEREWRDVEA

Query:  EGRQWWGGFGAVYDKYCERMLFFDRMSMQPLIEAGNTQILFLINHPVGYFIEVRFVLHYITLAGSRTPSPSP--RSASKKTASPLRCLSLKRIEEPQDET
        EGR WWGGFG +YDKY ERMLF+DRMSMQPLIE                              GS TPS SP  RSASKKTASPLRCLSLKRIEEP+D+T
Subjt:  EGRQWWGGFGAVYDKYCERMLFFDRMSMQPLIEAGNTQILFLINHPVGYFIEVRFVLHYITLAGSRTPSPSP--RSASKKTASPLRCLSLKRIEEPQDET

Query:  EHLHP--PFTDSYHDIETAYVAHICLSWEALHCQYTQLNHLISCQSQNPTHYNLSAQQFQQFQVLLQRFIENEPFEQGPRPGIYARTRRSFPKMLQVPNI
        EHLHP  PFTDS H IETAYVAHICLSWEALHCQYTQLNHLISCQ Q P+HYNL+AQQFQQFQVLLQRFIE+EPF+QGPRPGIYARTRRSFPK+LQVPNI
Subjt:  EHLHP--PFTDSYHDIETAYVAHICLSWEALHCQYTQLNHLISCQSQNPTHYNLSAQQFQQFQVLLQRFIENEPFEQGPRPGIYARTRRSFPKMLQVPNI

Query:  QGSDPNRTQEEESDLVILAPELIVIIEASIFTFHRFLKMDKKISNCGSLSFGGNQSQDATLLARIRSSLDKKKTKLKELRKKSRRWKQKTWPQTYEDMQL
        QGSDP   QEEESDLV+LAP+LI+IIEA+IFTFHRFLKMDKK SN G     GN +QD TLLA IRSSLDKKKTKLKE+RKKSRR KQKTWPQTYEDMQL
Subjt:  QGSDPNRTQEEESDLVILAPELIVIIEASIFTFHRFLKMDKKISNCGSLSFGGNQSQDATLLARIRSSLDKKKTKLKELRKKSRRWKQKTWPQTYEDMQL

Query:  LFGVVDIKIISRLVKMSRITKEQLLWCEEKMKKLDVSDGKLQRDPSPLLFPC
        L GVVDIKIISRLVKMSRITKEQL+WCEEKM KLD+S+G+LQRDPSPLLFPC
Subjt:  LFGVVDIKIISRLVKMSRITKEQLLWCEEKMKKLDVSDGKLQRDPSPLLFPC

XP_022157100.1 uncharacterized protein LOC111023905 isoform X4 [Momordica charantia]6.7e-17876.33Show/hide
Query:  KDDPAADPKYENGIDDESAKADKADYIAEAE------AEAEAEDDDEDFIMNEVKRRLKELRRNSFMVLIPEEDSCAEEEEEEEEGETSCGEREWRDVEA
        +D    DPK +NG  DES KA+K +   E E      AEAE EDD+EDFIMNEVKRRLKELRRN+FMVLIPEEDSCA  EEEEEEGETSCGE+EWRDVEA
Subjt:  KDDPAADPKYENGIDDESAKADKADYIAEAE------AEAEAEDDDEDFIMNEVKRRLKELRRNSFMVLIPEEDSCAEEEEEEEEGETSCGEREWRDVEA

Query:  EGRQWWGGFGAVYDKYCERMLFFDRMSMQPLIEAGNTQILFLINHPVGYFIEVRFVLHYITLAGSRTPSPSP--RSASKKTASPLRCLSLKRIEEPQDET
        EGR WWGGFG +YDKY ERMLF+DRMSMQPLIE                              GS TPS SP  RSASKKTASPLRCLSLKRIEEP+D+T
Subjt:  EGRQWWGGFGAVYDKYCERMLFFDRMSMQPLIEAGNTQILFLINHPVGYFIEVRFVLHYITLAGSRTPSPSP--RSASKKTASPLRCLSLKRIEEPQDET

Query:  EHLHP--PFTDSYHDIETAYVAHICLSWEALHCQYTQLNHLISCQSQNPTHYNLSAQQFQQFQVLLQRFIENEPFEQGPRPGIYARTRRSFPKMLQVPNI
        EHLHP  PFTDS H IETAYVAHICLSWEALHCQYTQLNHLISCQ Q P+HYNL+AQQFQQFQVLLQRFIE+EPF+QGPRPGIYARTRRSFPK+LQVPNI
Subjt:  EHLHP--PFTDSYHDIETAYVAHICLSWEALHCQYTQLNHLISCQSQNPTHYNLSAQQFQQFQVLLQRFIENEPFEQGPRPGIYARTRRSFPKMLQVPNI

Query:  QGSDPNRTQEEESDLVILAPELIVIIEASIFTFHRFLKMDKKISNCGSLSFGGNQSQDATLLARIRSSLDKKKTKLKELRKKSRRWKQKTWPQTYEDMQL
        QGSDP   QEEESDLV+LAP+LI+IIEA+IFTFHRFLKMDKK SN G     GN +QD TLLA IRSSLDKKKTKLKE+RKKSRR KQKTWPQTYEDMQL
Subjt:  QGSDPNRTQEEESDLVILAPELIVIIEASIFTFHRFLKMDKKISNCGSLSFGGNQSQDATLLARIRSSLDKKKTKLKELRKKSRRWKQKTWPQTYEDMQL

Query:  LFGVVDIKIISRLVKMSRITKEQLLWCEEKMKKLDVSDGKLQRDPSPLLFPC
        L GVVDIKIISRLVKMSRITKEQL+WCEEKM KLD+S+G+LQRDPSPLLFPC
Subjt:  LFGVVDIKIISRLVKMSRITKEQLLWCEEKMKKLDVSDGKLQRDPSPLLFPC

XP_022157101.1 uncharacterized protein LOC111023905 isoform X5 [Momordica charantia]6.7e-17876.33Show/hide
Query:  KDDPAADPKYENGIDDESAKADKADYIAEAE------AEAEAEDDDEDFIMNEVKRRLKELRRNSFMVLIPEEDSCAEEEEEEEEGETSCGEREWRDVEA
        +D    DPK +NG  DES KA+K +   E E      AEAE EDD+EDFIMNEVKRRLKELRRN+FMVLIPEEDSCA  EEEEEEGETSCGE+EWRDVEA
Subjt:  KDDPAADPKYENGIDDESAKADKADYIAEAE------AEAEAEDDDEDFIMNEVKRRLKELRRNSFMVLIPEEDSCAEEEEEEEEGETSCGEREWRDVEA

Query:  EGRQWWGGFGAVYDKYCERMLFFDRMSMQPLIEAGNTQILFLINHPVGYFIEVRFVLHYITLAGSRTPSPSP--RSASKKTASPLRCLSLKRIEEPQDET
        EGR WWGGFG +YDKY ERMLF+DRMSMQPLIE                              GS TPS SP  RSASKKTASPLRCLSLKRIEEP+D+T
Subjt:  EGRQWWGGFGAVYDKYCERMLFFDRMSMQPLIEAGNTQILFLINHPVGYFIEVRFVLHYITLAGSRTPSPSP--RSASKKTASPLRCLSLKRIEEPQDET

Query:  EHLHP--PFTDSYHDIETAYVAHICLSWEALHCQYTQLNHLISCQSQNPTHYNLSAQQFQQFQVLLQRFIENEPFEQGPRPGIYARTRRSFPKMLQVPNI
        EHLHP  PFTDS H IETAYVAHICLSWEALHCQYTQLNHLISCQ Q P+HYNL+AQQFQQFQVLLQRFIE+EPF+QGPRPGIYARTRRSFPK+LQVPNI
Subjt:  EHLHP--PFTDSYHDIETAYVAHICLSWEALHCQYTQLNHLISCQSQNPTHYNLSAQQFQQFQVLLQRFIENEPFEQGPRPGIYARTRRSFPKMLQVPNI

Query:  QGSDPNRTQEEESDLVILAPELIVIIEASIFTFHRFLKMDKKISNCGSLSFGGNQSQDATLLARIRSSLDKKKTKLKELRKKSRRWKQKTWPQTYEDMQL
        QGSDP   QEEESDLV+LAP+LI+IIEA+IFTFHRFLKMDKK SN G     GN +QD TLLA IRSSLDKKKTKLKE+RKKSRR KQKTWPQTYEDMQL
Subjt:  QGSDPNRTQEEESDLVILAPELIVIIEASIFTFHRFLKMDKKISNCGSLSFGGNQSQDATLLARIRSSLDKKKTKLKELRKKSRRWKQKTWPQTYEDMQL

Query:  LFGVVDIKIISRLVKMSRITKEQLLWCEEKMKKLDVSDGKLQRDPSPLLFPC
        L GVVDIKIISRLVKMSRITKEQL+WCEEKM KLD+S+G+LQRDPSPLLFPC
Subjt:  LFGVVDIKIISRLVKMSRITKEQLLWCEEKMKKLDVSDGKLQRDPSPLLFPC

TrEMBL top hitse value%identityAlignment
A0A6J1DS57 uncharacterized protein LOC111023905 isoform X43.3e-17876.33Show/hide
Query:  KDDPAADPKYENGIDDESAKADKADYIAEAE------AEAEAEDDDEDFIMNEVKRRLKELRRNSFMVLIPEEDSCAEEEEEEEEGETSCGEREWRDVEA
        +D    DPK +NG  DES KA+K +   E E      AEAE EDD+EDFIMNEVKRRLKELRRN+FMVLIPEEDSCA  EEEEEEGETSCGE+EWRDVEA
Subjt:  KDDPAADPKYENGIDDESAKADKADYIAEAE------AEAEAEDDDEDFIMNEVKRRLKELRRNSFMVLIPEEDSCAEEEEEEEEGETSCGEREWRDVEA

Query:  EGRQWWGGFGAVYDKYCERMLFFDRMSMQPLIEAGNTQILFLINHPVGYFIEVRFVLHYITLAGSRTPSPSP--RSASKKTASPLRCLSLKRIEEPQDET
        EGR WWGGFG +YDKY ERMLF+DRMSMQPLIE                              GS TPS SP  RSASKKTASPLRCLSLKRIEEP+D+T
Subjt:  EGRQWWGGFGAVYDKYCERMLFFDRMSMQPLIEAGNTQILFLINHPVGYFIEVRFVLHYITLAGSRTPSPSP--RSASKKTASPLRCLSLKRIEEPQDET

Query:  EHLHP--PFTDSYHDIETAYVAHICLSWEALHCQYTQLNHLISCQSQNPTHYNLSAQQFQQFQVLLQRFIENEPFEQGPRPGIYARTRRSFPKMLQVPNI
        EHLHP  PFTDS H IETAYVAHICLSWEALHCQYTQLNHLISCQ Q P+HYNL+AQQFQQFQVLLQRFIE+EPF+QGPRPGIYARTRRSFPK+LQVPNI
Subjt:  EHLHP--PFTDSYHDIETAYVAHICLSWEALHCQYTQLNHLISCQSQNPTHYNLSAQQFQQFQVLLQRFIENEPFEQGPRPGIYARTRRSFPKMLQVPNI

Query:  QGSDPNRTQEEESDLVILAPELIVIIEASIFTFHRFLKMDKKISNCGSLSFGGNQSQDATLLARIRSSLDKKKTKLKELRKKSRRWKQKTWPQTYEDMQL
        QGSDP   QEEESDLV+LAP+LI+IIEA+IFTFHRFLKMDKK SN G     GN +QD TLLA IRSSLDKKKTKLKE+RKKSRR KQKTWPQTYEDMQL
Subjt:  QGSDPNRTQEEESDLVILAPELIVIIEASIFTFHRFLKMDKKISNCGSLSFGGNQSQDATLLARIRSSLDKKKTKLKELRKKSRRWKQKTWPQTYEDMQL

Query:  LFGVVDIKIISRLVKMSRITKEQLLWCEEKMKKLDVSDGKLQRDPSPLLFPC
        L GVVDIKIISRLVKMSRITKEQL+WCEEKM KLD+S+G+LQRDPSPLLFPC
Subjt:  LFGVVDIKIISRLVKMSRITKEQLLWCEEKMKKLDVSDGKLQRDPSPLLFPC

A0A6J1DSH8 uncharacterized protein LOC111023905 isoform X22.3e-17676.11Show/hide
Query:  KDDPAADPKYENGIDDESAKADKADYIAEAE------AEAEAEDDDEDFIMNEVKRRLKELRRNSFMVLIPEEDSCAEEEEEEEEGETSCGEREWRDVEA
        +D    DPK +NG  DES KA+K +   E E      AEAE EDD+EDFIMNEVKRRLKELRRN+FMVLIPEEDSCA  EEEEEEGETSCGE+EWRDVEA
Subjt:  KDDPAADPKYENGIDDESAKADKADYIAEAE------AEAEAEDDDEDFIMNEVKRRLKELRRNSFMVLIPEEDSCAEEEEEEEEGETSCGEREWRDVEA

Query:  EGRQWWGGFGAVYDKYCERMLFFDRMSMQPLIEAGNTQILFLINHPVGYFIEVRFVLHYITLAGSRTPSPSP--RSASKKTASPLRCLSLKRIEEPQDET
        EGR WWGGFG +YDKY ERMLF+DRMSMQPLIE                              GS TPS SP  RSASKKTASPLRCLSLKRIEEP+D+T
Subjt:  EGRQWWGGFGAVYDKYCERMLFFDRMSMQPLIEAGNTQILFLINHPVGYFIEVRFVLHYITLAGSRTPSPSP--RSASKKTASPLRCLSLKRIEEPQDET

Query:  EHLHP--PFTDSYHDIETAYVAHICLSWEALHCQYTQLNHLISCQSQNPTHYNLSAQQFQQFQVLLQRFIENEPFEQGPRPGIYARTRRSFPKMLQVPNI
        EHLHP  PFTDS H IETAYVAHICLSWEALHCQYTQLNHLISCQ Q P+HYNL+AQQFQQFQVLLQRFIE+EPF+QGPRPGIYARTRRSFPK+LQVPNI
Subjt:  EHLHP--PFTDSYHDIETAYVAHICLSWEALHCQYTQLNHLISCQSQNPTHYNLSAQQFQQFQVLLQRFIENEPFEQGPRPGIYARTRRSFPKMLQVPNI

Query:  QGSDPNRTQEEESDLVILAPELIVIIEASIFTFHRFLKMDKKISNCGSLSFGGNQSQDATLLARIRSSLDKKKTKLKELRKKSRRWKQKTWPQTYEDMQL
        QGSDP   QEEESDLV+LAP+LI+IIEA+IFTFHRFLKMDKK SN G     GN +QD TLLA IRSSLD KKTKLKE+RKKSRR KQKTWPQTYEDMQL
Subjt:  QGSDPNRTQEEESDLVILAPELIVIIEASIFTFHRFLKMDKKISNCGSLSFGGNQSQDATLLARIRSSLDKKKTKLKELRKKSRRWKQKTWPQTYEDMQL

Query:  LFGVVDIKIISRLVKMSRITKEQLLWCEEKMKKLDVSDGKLQRDPSPLLFPC
        L GVVDIKIISRLVKMSRITKEQL+WCEEKM KLD+S+G+LQRDPSPLLFPC
Subjt:  LFGVVDIKIISRLVKMSRITKEQLLWCEEKMKKLDVSDGKLQRDPSPLLFPC

A0A6J1DTN2 uncharacterized protein LOC111023905 isoform X13.3e-17876.33Show/hide
Query:  KDDPAADPKYENGIDDESAKADKADYIAEAE------AEAEAEDDDEDFIMNEVKRRLKELRRNSFMVLIPEEDSCAEEEEEEEEGETSCGEREWRDVEA
        +D    DPK +NG  DES KA+K +   E E      AEAE EDD+EDFIMNEVKRRLKELRRN+FMVLIPEEDSCA  EEEEEEGETSCGE+EWRDVEA
Subjt:  KDDPAADPKYENGIDDESAKADKADYIAEAE------AEAEAEDDDEDFIMNEVKRRLKELRRNSFMVLIPEEDSCAEEEEEEEEGETSCGEREWRDVEA

Query:  EGRQWWGGFGAVYDKYCERMLFFDRMSMQPLIEAGNTQILFLINHPVGYFIEVRFVLHYITLAGSRTPSPSP--RSASKKTASPLRCLSLKRIEEPQDET
        EGR WWGGFG +YDKY ERMLF+DRMSMQPLIE                              GS TPS SP  RSASKKTASPLRCLSLKRIEEP+D+T
Subjt:  EGRQWWGGFGAVYDKYCERMLFFDRMSMQPLIEAGNTQILFLINHPVGYFIEVRFVLHYITLAGSRTPSPSP--RSASKKTASPLRCLSLKRIEEPQDET

Query:  EHLHP--PFTDSYHDIETAYVAHICLSWEALHCQYTQLNHLISCQSQNPTHYNLSAQQFQQFQVLLQRFIENEPFEQGPRPGIYARTRRSFPKMLQVPNI
        EHLHP  PFTDS H IETAYVAHICLSWEALHCQYTQLNHLISCQ Q P+HYNL+AQQFQQFQVLLQRFIE+EPF+QGPRPGIYARTRRSFPK+LQVPNI
Subjt:  EHLHP--PFTDSYHDIETAYVAHICLSWEALHCQYTQLNHLISCQSQNPTHYNLSAQQFQQFQVLLQRFIENEPFEQGPRPGIYARTRRSFPKMLQVPNI

Query:  QGSDPNRTQEEESDLVILAPELIVIIEASIFTFHRFLKMDKKISNCGSLSFGGNQSQDATLLARIRSSLDKKKTKLKELRKKSRRWKQKTWPQTYEDMQL
        QGSDP   QEEESDLV+LAP+LI+IIEA+IFTFHRFLKMDKK SN G     GN +QD TLLA IRSSLDKKKTKLKE+RKKSRR KQKTWPQTYEDMQL
Subjt:  QGSDPNRTQEEESDLVILAPELIVIIEASIFTFHRFLKMDKKISNCGSLSFGGNQSQDATLLARIRSSLDKKKTKLKELRKKSRRWKQKTWPQTYEDMQL

Query:  LFGVVDIKIISRLVKMSRITKEQLLWCEEKMKKLDVSDGKLQRDPSPLLFPC
        L GVVDIKIISRLVKMSRITKEQL+WCEEKM KLD+S+G+LQRDPSPLLFPC
Subjt:  LFGVVDIKIISRLVKMSRITKEQLLWCEEKMKKLDVSDGKLQRDPSPLLFPC

A0A6J1DVJ0 uncharacterized protein LOC111023905 isoform X33.3e-17876.33Show/hide
Query:  KDDPAADPKYENGIDDESAKADKADYIAEAE------AEAEAEDDDEDFIMNEVKRRLKELRRNSFMVLIPEEDSCAEEEEEEEEGETSCGEREWRDVEA
        +D    DPK +NG  DES KA+K +   E E      AEAE EDD+EDFIMNEVKRRLKELRRN+FMVLIPEEDSCA  EEEEEEGETSCGE+EWRDVEA
Subjt:  KDDPAADPKYENGIDDESAKADKADYIAEAE------AEAEAEDDDEDFIMNEVKRRLKELRRNSFMVLIPEEDSCAEEEEEEEEGETSCGEREWRDVEA

Query:  EGRQWWGGFGAVYDKYCERMLFFDRMSMQPLIEAGNTQILFLINHPVGYFIEVRFVLHYITLAGSRTPSPSP--RSASKKTASPLRCLSLKRIEEPQDET
        EGR WWGGFG +YDKY ERMLF+DRMSMQPLIE                              GS TPS SP  RSASKKTASPLRCLSLKRIEEP+D+T
Subjt:  EGRQWWGGFGAVYDKYCERMLFFDRMSMQPLIEAGNTQILFLINHPVGYFIEVRFVLHYITLAGSRTPSPSP--RSASKKTASPLRCLSLKRIEEPQDET

Query:  EHLHP--PFTDSYHDIETAYVAHICLSWEALHCQYTQLNHLISCQSQNPTHYNLSAQQFQQFQVLLQRFIENEPFEQGPRPGIYARTRRSFPKMLQVPNI
        EHLHP  PFTDS H IETAYVAHICLSWEALHCQYTQLNHLISCQ Q P+HYNL+AQQFQQFQVLLQRFIE+EPF+QGPRPGIYARTRRSFPK+LQVPNI
Subjt:  EHLHP--PFTDSYHDIETAYVAHICLSWEALHCQYTQLNHLISCQSQNPTHYNLSAQQFQQFQVLLQRFIENEPFEQGPRPGIYARTRRSFPKMLQVPNI

Query:  QGSDPNRTQEEESDLVILAPELIVIIEASIFTFHRFLKMDKKISNCGSLSFGGNQSQDATLLARIRSSLDKKKTKLKELRKKSRRWKQKTWPQTYEDMQL
        QGSDP   QEEESDLV+LAP+LI+IIEA+IFTFHRFLKMDKK SN G     GN +QD TLLA IRSSLDKKKTKLKE+RKKSRR KQKTWPQTYEDMQL
Subjt:  QGSDPNRTQEEESDLVILAPELIVIIEASIFTFHRFLKMDKKISNCGSLSFGGNQSQDATLLARIRSSLDKKKTKLKELRKKSRRWKQKTWPQTYEDMQL

Query:  LFGVVDIKIISRLVKMSRITKEQLLWCEEKMKKLDVSDGKLQRDPSPLLFPC
        L GVVDIKIISRLVKMSRITKEQL+WCEEKM KLD+S+G+LQRDPSPLLFPC
Subjt:  LFGVVDIKIISRLVKMSRITKEQLLWCEEKMKKLDVSDGKLQRDPSPLLFPC

A0A6J1DWZ2 uncharacterized protein LOC111023905 isoform X53.3e-17876.33Show/hide
Query:  KDDPAADPKYENGIDDESAKADKADYIAEAE------AEAEAEDDDEDFIMNEVKRRLKELRRNSFMVLIPEEDSCAEEEEEEEEGETSCGEREWRDVEA
        +D    DPK +NG  DES KA+K +   E E      AEAE EDD+EDFIMNEVKRRLKELRRN+FMVLIPEEDSCA  EEEEEEGETSCGE+EWRDVEA
Subjt:  KDDPAADPKYENGIDDESAKADKADYIAEAE------AEAEAEDDDEDFIMNEVKRRLKELRRNSFMVLIPEEDSCAEEEEEEEEGETSCGEREWRDVEA

Query:  EGRQWWGGFGAVYDKYCERMLFFDRMSMQPLIEAGNTQILFLINHPVGYFIEVRFVLHYITLAGSRTPSPSP--RSASKKTASPLRCLSLKRIEEPQDET
        EGR WWGGFG +YDKY ERMLF+DRMSMQPLIE                              GS TPS SP  RSASKKTASPLRCLSLKRIEEP+D+T
Subjt:  EGRQWWGGFGAVYDKYCERMLFFDRMSMQPLIEAGNTQILFLINHPVGYFIEVRFVLHYITLAGSRTPSPSP--RSASKKTASPLRCLSLKRIEEPQDET

Query:  EHLHP--PFTDSYHDIETAYVAHICLSWEALHCQYTQLNHLISCQSQNPTHYNLSAQQFQQFQVLLQRFIENEPFEQGPRPGIYARTRRSFPKMLQVPNI
        EHLHP  PFTDS H IETAYVAHICLSWEALHCQYTQLNHLISCQ Q P+HYNL+AQQFQQFQVLLQRFIE+EPF+QGPRPGIYARTRRSFPK+LQVPNI
Subjt:  EHLHP--PFTDSYHDIETAYVAHICLSWEALHCQYTQLNHLISCQSQNPTHYNLSAQQFQQFQVLLQRFIENEPFEQGPRPGIYARTRRSFPKMLQVPNI

Query:  QGSDPNRTQEEESDLVILAPELIVIIEASIFTFHRFLKMDKKISNCGSLSFGGNQSQDATLLARIRSSLDKKKTKLKELRKKSRRWKQKTWPQTYEDMQL
        QGSDP   QEEESDLV+LAP+LI+IIEA+IFTFHRFLKMDKK SN G     GN +QD TLLA IRSSLDKKKTKLKE+RKKSRR KQKTWPQTYEDMQL
Subjt:  QGSDPNRTQEEESDLVILAPELIVIIEASIFTFHRFLKMDKKISNCGSLSFGGNQSQDATLLARIRSSLDKKKTKLKELRKKSRRWKQKTWPQTYEDMQL

Query:  LFGVVDIKIISRLVKMSRITKEQLLWCEEKMKKLDVSDGKLQRDPSPLLFPC
        L GVVDIKIISRLVKMSRITKEQL+WCEEKM KLD+S+G+LQRDPSPLLFPC
Subjt:  LFGVVDIKIISRLVKMSRITKEQLLWCEEKMKKLDVSDGKLQRDPSPLLFPC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G69610.1 Protein of unknown function (DUF1666)3.5e-3132.79Show/hide
Query:  LHYITLAGSRTPSPSPRSASKKTASPL--RCLSLKRIEEPQDETEHLHPPFTDSYHDIETAYVAHICLSWEALHCQYTQLNHLISCQSQNPTH-YNLSAQ
        +H I+L   +  S   R+  K   S L       K+     D +E L     ++  D ET YV  +CLSWE L  QY   + ++   SQ  T+ YNL A 
Subjt:  LHYITLAGSRTPSPSPRSASKKTASPL--RCLSLKRIEEPQDETEHLHPPFTDSYHDIETAYVAHICLSWEALHCQYTQLNHLISCQSQNPTH-YNLSAQ

Query:  QFQQFQVLLQRFIENEPFEQGPRPGIYARTRRSFPKMLQVPNIQGSDPNRTQ-EEESDLVILAPELIVIIEASIFTFHRFLKMDK-------KISNCGSL
        +FQ FQVLLQRF+ENEPF+   R   Y + RR F   LQ+P ++    ++ +   E +  +    L  II  S+  F  FL  DK       K+S+   +
Subjt:  QFQQFQVLLQRFIENEPFEQGPRPGIYARTRRSFPKMLQVPNIQGSDPNRTQ-EEESDLVILAPELIVIIEASIFTFHRFLKMDK-------KISNCGSL

Query:  SFGGNQSQDATLLARIRSSLDKKKTKLKELRKK-----SRRWKQKTWPQTYEDMQLLFGVVDIKIISRLVKMSRITKEQLLWCEEKMKKLDVSDGKLQRD
        S     S D  LL  IR+ L KK+ KLKE+++       +  K ++        +LL   ++++++SR++ MS++T E+L WC+EK++K+  +  K+  +
Subjt:  SFGGNQSQDATLLARIRSSLDKKKTKLKELRKK-----SRRWKQKTWPQTYEDMQLLFGVVDIKIISRLVKMSRITKEQLLWCEEKMKKLDVSDGKLQRD

Query:  PSPLLFPC
        P   L PC
Subjt:  PSPLLFPC

AT1G73850.1 Protein of unknown function (DUF1666)3.5e-3931.26Show/hide
Query:  DFIMN-EVKRRLKELRRNSFMVLIPEEDSCAEEEEEEEEGE----------TSCGEREWRD-------VEAEGRQ---WWGGFGAVYDKYCERMLFFDRM
        +F+ N  +K  + E++ +S   +   +    EEEEEEE G+          TS    EWR+            R+    W  +  V+ KY E M F  R+
Subjt:  DFIMN-EVKRRLKELRRNSFMVLIPEEDSCAEEEEEEEEGE----------TSCGEREWRD-------VEAEGRQ---WWGGFGAVYDKYCERMLFFDRM

Query:  SMQPLIEAGNTQILFLINHPVGYFIEVRFVLHYITLAGSRTPSPSPRSASKKTASPLRCLSLKRIEEPQDETEHLHPPFTDSYHDIETAYVAHICLSWEA
        S Q L EA + + + +                             PRS S++    L     K+ ++    +    P   + Y ++E+AYVA ICL+WEA
Subjt:  SMQPLIEAGNTQILFLINHPVGYFIEVRFVLHYITLAGSRTPSPSPRSASKKTASPLRCLSLKRIEEPQDETEHLHPPFTDSYHDIETAYVAHICLSWEA

Query:  LHCQYTQLNHLISCQSQNPTHYNLS---AQQFQQFQVLLQRFIENEPFEQGPRPGIYARTRRSFPKMLQVPNIQGSDPNRTQEEESD----LVILAPELI
        L   Y       S   ++          A QF+ F +LLQR++ENEP+E G RP IYAR R   PK+L VP  Q  +    +E+E++      I +   +
Subjt:  LHCQYTQLNHLISCQSQNPTHYNLS---AQQFQQFQVLLQRFIENEPFEQGPRPGIYARTRRSFPKMLQVPNIQGSDPNRTQEEESD----LVILAPELI

Query:  VIIEASIFTFHRFLKMDKKISNCGSL--SFGGNQSQ---DATLLARIRSSLDKKKTKLKELRKKSRRWKQKTWPQTYEDMQLLFGVVDIKIISRLVKMSR
        +I+E  I TF  FL+ DK+   C  +  +F G   +   D TL+  ++    KKKTKLKE+R+  +  ++K      E+M++L G++D+K++SR+++M+ 
Subjt:  VIIEASIFTFHRFLKMDKKISNCGSL--SFGGNQSQ---DATLLARIRSSLDKKKTKLKELRKKSRRWKQKTWPQTYEDMQLLFGVVDIKIISRLVKMSR

Query:  ITKEQLLWCEEKMKKLDVSDG--KLQRDPSPLLFP
        + +E L WCEEKM K+ +  G   LQRD +PL FP
Subjt:  ITKEQLLWCEEKMKKLDVSDG--KLQRDPSPLLFP

AT3G20260.1 Protein of unknown function (DUF1666)1.3e-12353.63Show/hide
Query:  IQSLRKAPKHDTQRDLKDDPAADPKYENGIDDESAKAD--KADYIAEAEAEAEAEDDDEDFIMNEVKRRLKELRRNSFMVLIPEEDSCAEEE----EEEE
        ++  RK+ K + +++ +D   A  +     + E+   D  K D         E EDDD+DFI NEVKRRLKELRRNSFMVLIPEE+   EEE    E+++
Subjt:  IQSLRKAPKHDTQRDLKDDPAADPKYENGIDDESAKAD--KADYIAEAEAEAEAEDDDEDFIMNEVKRRLKELRRNSFMVLIPEEDSCAEEE----EEEE

Query:  EGETSCGEREWRDVEAEGRQWWGGFGAVYDKYCERMLFFDRMSMQPLIEAGNTQILFLINHPVGYFIEVRFVLHYITLAGSRTPSPSPRSASKKTASPLR
        +GE  C   EWRDV AEG QWWGGF AVY+KYCERMLFFDR+S Q L E G                        I +A S + +PSPRSASKK +SP R
Subjt:  EGETSCGEREWRDVEAEGRQWWGGFGAVYDKYCERMLFFDRMSMQPLIEAGNTQILFLINHPVGYFIEVRFVLHYITLAGSRTPSPSPRSASKKTASPLR

Query:  CLSLKRIEEPQDETEHLHP-PFTDSYHDIETAYVAHICLSWEALHCQYTQLNHLISCQSQNPTHYNLSAQQFQQFQVLLQRFIENEPFEQGPRPGIYART
        CLSLK+ + P+++ EHL P    D Y D+ETAYVA +CL+WEALHCQYTQL+HLISCQ + PT YN +AQ FQQF VLLQR+IENEPFEQG R  +YAR 
Subjt:  CLSLKRIEEPQDETEHLHP-PFTDSYHDIETAYVAHICLSWEALHCQYTQLNHLISCQSQNPTHYNLSAQQFQQFQVLLQRFIENEPFEQGPRPGIYART

Query:  RRSFPKMLQVPNIQGSDPNRTQEEESDLVILAPELIVIIEASIFTFHRFLKMDKKISNCGSLSFG---GNQSQDATLLARIRSSLDKKKTKLKELRKKSR
        R + PK+LQ P IQGSD  +  E+++  ++LA +LI +IE+SI TF+ FLKMDKK  N G   FG    N     T L  ++SS+DKK+ K KEL KK++
Subjt:  RRSFPKMLQVPNIQGSDPNRTQEEESDLVILAPELIVIIEASIFTFHRFLKMDKKISNCGSLSFG---GNQSQDATLLARIRSSLDKKKTKLKELRKKSR

Query:  RWKQKTWPQTYEDMQLLFGVVDIKIISRLVKMSRITKEQLLWCEEKMKKLDVSDGKLQRDPSPLLFPC
          ++K+WPQT+E +QLLF  +DIK+ +R+++MS+I+KEQLLWCEEKMKKL+ S GKLQR PSP+LFPC
Subjt:  RWKQKTWPQTYEDMQLLFGVVDIKIISRLVKMSRITKEQLLWCEEKMKKLDVSDGKLQRDPSPLLFPC

AT5G39785.1 Protein of unknown function (DUF1666)2.7e-3127.09Show/hide
Query:  IQSLRKAPKHDTQRDLKDDPAADPKYENGIDDESAKADKADYIAEAEAEAEAEDDDEDF--------IMNEVKRRLKELRRNSFMVLIPEEDSCAEEEEE
        + S +    +D    L D   A+   + G   ++ K+D +   + +++E E E+D   F        ++ ++K  +K+++    +  I E     EEEE+
Subjt:  IQSLRKAPKHDTQRDLKDDPAADPKYENGIDDESAKADKADYIAEAEAEAEAEDDDEDF--------IMNEVKRRLKELRRNSFMVLIPEEDSCAEEEEE

Query:  EEEGETSCGEREWRDVEAEGRQWWGGFGAVYD---KYCERMLFFDRMSMQPLIEAGNTQILFLINHPVGYFIEVRFVLHYITLAGSRTPSPSPRSASKKT
        ++  +     + WR  E +  +     G V+     Y ERM   D +S Q     G  Q                      + +  +  S    + S+ +
Subjt:  EEEGETSCGEREWRDVEAEGRQWWGGFGAVYD---KYCERMLFFDRMSMQPLIEAGNTQILFLINHPVGYFIEVRFVLHYITLAGSRTPSPSPRSASKKT

Query:  ASPLRCLSLKRIEEPQDETEHLHPPFTDSYHDIETAYVAHICLSWEALHCQYTQLNHLISCQSQNPTHYNLSAQQFQQFQVLLQRFIENEPFEQGPRPGI
         S +  ++++  +  + E E +     +   ++E  YV  +CLSWE LH QY +   L+         YN  A +FQQFQVLLQRF+ENEPFE+ PR   
Subjt:  ASPLRCLSLKRIEEPQDETEHLHPPFTDSYHDIETAYVAHICLSWEALHCQYTQLNHLISCQSQNPTHYNLSAQQFQQFQVLLQRFIENEPFEQGPRPGI

Query:  YARTRRSFPKMLQVPNIQGSDPN--------RTQEEESDLVILAPELIVIIEASIFTFHRFLKMDKKISNCGSLSF---------GGNQSQDATLLARIR
        Y + R     +LQ+P I+  D N        R  EE +D VI + +L+ I+E +I  F RF++ DK  S+                   S+D  + A ++
Subjt:  YARTRRSFPKMLQVPNIQGSDPN--------RTQEEESDLVILAPELIVIIEASIFTFHRFLKMDKKISNCGSLSF---------GGNQSQDATLLARIR

Query:  SSLDKKKTKLKELRKKS----RRWKQKTWPQTYEDMQL-LFGVVDIKIISRLVKMSRITKEQLLWCEEKMKKLDVSDGKLQRDPSPLLFPC
        S L  K+ +L+++ K      RR+++     + ED  L  F  VD+K+++R++ MS++T++ L+WC  K+ K++  + +L  DPS  LFPC
Subjt:  SSLDKKKTKLKELRKKS----RRWKQKTWPQTYEDMQL-LFGVVDIKIISRLVKMSRITKEQLLWCEEKMKKLDVSDGKLQRDPSPLLFPC

AT5G39785.2 Protein of unknown function (DUF1666)1.5e-2927.24Show/hide
Query:  IQSLRKAPKHDTQRDLKDDPAADPKYENGIDDESAKADKADYIAEAEAEAEAEDDDEDF--------IMNEVKRRLKELRRNSFMVLIPEEDSCAEEEEE
        + S +    +D    L D   A+   + G   ++ K+D +   + +++E E E+D   F        ++ ++K  +K+++    +  I E     EEEE+
Subjt:  IQSLRKAPKHDTQRDLKDDPAADPKYENGIDDESAKADKADYIAEAEAEAEAEDDDEDF--------IMNEVKRRLKELRRNSFMVLIPEEDSCAEEEEE

Query:  EEEGETSCGEREWRDVEAEGRQWWGGFGAVYD---KYCERMLFFDRMSMQPLIEAGNTQILFLINHPVGYFIEVRFVLHYITLAGSRTPSPSPRSASKKT
        ++  +     + WR  E +  +     G V+     Y ERM   D +S Q     G  Q                      + +  +  S    + S+ +
Subjt:  EEEGETSCGEREWRDVEAEGRQWWGGFGAVYD---KYCERMLFFDRMSMQPLIEAGNTQILFLINHPVGYFIEVRFVLHYITLAGSRTPSPSPRSASKKT

Query:  ASPLRCLSLKRIEEPQDETEHLHPPFTDSYHDIETAYVAHICLSWEALHCQYTQLNHLISCQSQNPTHYNLSAQQFQQFQVLLQRFIENEPFEQGPRPGI
         S +  ++++  +  + E E +     +   ++E  YV  +CLSWE LH QY +   L+         YN  A +FQQFQVLLQRF+ENEPFE+ PR   
Subjt:  ASPLRCLSLKRIEEPQDETEHLHPPFTDSYHDIETAYVAHICLSWEALHCQYTQLNHLISCQSQNPTHYNLSAQQFQQFQVLLQRFIENEPFEQGPRPGI

Query:  YARTRRSFPKMLQVPNIQGSDPN--------RTQEEESDLVILAPELIVIIEASIFTFHRFLKMDKKISNCGSLSF---------GGNQSQDATLLARIR
        Y + R     +LQ+P I+  D N        R  EE +D VI + +L+ I+E +I  F RF++ DK  S+                   S+D  + A ++
Subjt:  YARTRRSFPKMLQVPNIQGSDPN--------RTQEEESDLVILAPELIVIIEASIFTFHRFLKMDKKISNCGSLSF---------GGNQSQDATLLARIR

Query:  SSLDKKKTK-----LKELRKKSRRWKQKTWPQTYEDMQL-LFGVVDIKIISRLVKMSRITKEQLLWCEEKMKKLDVSDGKLQRDPSPLLFPC
        S L     K     LK  R   RR+++     + ED  L  F  VD+K+++R++ MS++T++ L+WC  K+ K++  + +L  DPS  LFPC
Subjt:  SSLDKKKTK-----LKELRKKSRRWKQKTWPQTYEDMQL-LFGVVDIKIISRLVKMSRITKEQLLWCEEKMKKLDVSDGKLQRDPSPLLFPC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAGTGGTCCGACCCGACCCCGATCCGGACGTAGTAGAACGCGTCCCTTCTCTCTCTCTACGCCTTCCACTGCCACGGGCGCCATTGCTTTGCATTCTACTTCTAG
ACACTACCAGTACGCATCCTGCATTGTATTTCCAGCAGCGATTAACACATGGATGGGCCAGGCCAGCAACTGTTTTCATTCCTCATCTCCTATAGACTATAGTTATACTA
TAGGTGCCCGTCCGATTCAGTCTTTAAGGAAAGCCCCTAAACACGATACACAGAGGGATTTGAAGGATGACCCGGCTGCAGATCCCAAGTACGAGAACGGCATTGACGAT
GAGTCGGCGAAAGCGGACAAGGCCGACTATATTGCAGAAGCTGAAGCTGAAGCTGAAGCTGAAGACGATGACGAGGATTTTATAATGAACGAGGTAAAGAGGAGATTGAA
GGAACTGAGGAGGAACAGTTTCATGGTATTGATTCCGGAGGAAGATTCGTGCGCGGAGGAGGAGGAAGAAGAAGAGGAAGGGGAGACGAGCTGTGGGGAGCGTGAGTGGA
GAGACGTGGAAGCAGAAGGTCGACAGTGGTGGGGTGGGTTTGGTGCTGTGTATGACAAGTACTGTGAGAGGATGCTGTTTTTTGATCGGATGAGCATGCAACCGCTGATT
GAAGCCGGTAACACTCAAATCTTGTTCCTAATCAATCATCCTGTTGGTTATTTTATTGAAGTGAGATTTGTTTTACATTACATTACACTTGCAGGCTCTCGAACACCCTC
CCCTTCCCCAAGATCTGCATCAAAAAAGACTGCATCTCCTCTTCGCTGTCTCTCTCTGAAGAGGATCGAAGAACCCCAAGATGAGACGGAGCATCTCCACCCTCCCTTCA
CCGACTCCTATCACGATATCGAAACAGCCTATGTCGCTCACATTTGCTTGTCTTGGGAGGCCCTCCACTGTCAGTACACTCAACTCAACCACTTAATATCATGCCAATCC
CAAAACCCTACTCATTATAATCTTAGTGCTCAGCAGTTTCAGCAATTTCAAGTCCTCTTGCAAAGGTTCATTGAAAACGAACCCTTCGAACAAGGTCCCAGGCCAGGAAT
TTATGCTCGAACCCGTCGAAGTTTTCCTAAAATGCTACAGGTTCCTAACATACAAGGTTCAGATCCAAACAGGACGCAGGAAGAAGAATCTGATTTGGTTATCCTTGCTC
CTGAGCTGATTGTGATTATTGAGGCCTCAATCTTTACTTTCCACCGTTTCCTGAAGATGGACAAGAAAATCTCAAATTGTGGTTCTTTATCATTTGGTGGGAACCAAAGC
CAGGATGCCACTCTACTTGCTCGCATTCGATCTTCGCTTGACAAGAAGAAGACGAAGCTGAAAGAACTCAGGAAAAAGAGTAGACGGTGGAAGCAGAAAACATGGCCTCA
AACGTATGAAGACATGCAGTTACTTTTTGGAGTGGTGGATATTAAAATCATATCAAGACTTGTTAAAATGTCGAGGATTACTAAAGAACAGTTGCTCTGGTGCGAGGAGA
AAATGAAGAAGTTAGATGTGTCTGATGGAAAATTGCAGAGAGATCCGTCTCCTCTTCTCTTCCCATGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAGTGGTCCGACCCGACCCCGATCCGGACGTAGTAGAACGCGTCCCTTCTCTCTCTCTACGCCTTCCACTGCCACGGGCGCCATTGCTTTGCATTCTACTTCTAG
ACACTACCAGTACGCATCCTGCATTGTATTTCCAGCAGCGATTAACACATGGATGGGCCAGGCCAGCAACTGTTTTCATTCCTCATCTCCTATAGACTATAGTTATACTA
TAGGTGCCCGTCCGATTCAGTCTTTAAGGAAAGCCCCTAAACACGATACACAGAGGGATTTGAAGGATGACCCGGCTGCAGATCCCAAGTACGAGAACGGCATTGACGAT
GAGTCGGCGAAAGCGGACAAGGCCGACTATATTGCAGAAGCTGAAGCTGAAGCTGAAGCTGAAGACGATGACGAGGATTTTATAATGAACGAGGTAAAGAGGAGATTGAA
GGAACTGAGGAGGAACAGTTTCATGGTATTGATTCCGGAGGAAGATTCGTGCGCGGAGGAGGAGGAAGAAGAAGAGGAAGGGGAGACGAGCTGTGGGGAGCGTGAGTGGA
GAGACGTGGAAGCAGAAGGTCGACAGTGGTGGGGTGGGTTTGGTGCTGTGTATGACAAGTACTGTGAGAGGATGCTGTTTTTTGATCGGATGAGCATGCAACCGCTGATT
GAAGCCGGTAACACTCAAATCTTGTTCCTAATCAATCATCCTGTTGGTTATTTTATTGAAGTGAGATTTGTTTTACATTACATTACACTTGCAGGCTCTCGAACACCCTC
CCCTTCCCCAAGATCTGCATCAAAAAAGACTGCATCTCCTCTTCGCTGTCTCTCTCTGAAGAGGATCGAAGAACCCCAAGATGAGACGGAGCATCTCCACCCTCCCTTCA
CCGACTCCTATCACGATATCGAAACAGCCTATGTCGCTCACATTTGCTTGTCTTGGGAGGCCCTCCACTGTCAGTACACTCAACTCAACCACTTAATATCATGCCAATCC
CAAAACCCTACTCATTATAATCTTAGTGCTCAGCAGTTTCAGCAATTTCAAGTCCTCTTGCAAAGGTTCATTGAAAACGAACCCTTCGAACAAGGTCCCAGGCCAGGAAT
TTATGCTCGAACCCGTCGAAGTTTTCCTAAAATGCTACAGGTTCCTAACATACAAGGTTCAGATCCAAACAGGACGCAGGAAGAAGAATCTGATTTGGTTATCCTTGCTC
CTGAGCTGATTGTGATTATTGAGGCCTCAATCTTTACTTTCCACCGTTTCCTGAAGATGGACAAGAAAATCTCAAATTGTGGTTCTTTATCATTTGGTGGGAACCAAAGC
CAGGATGCCACTCTACTTGCTCGCATTCGATCTTCGCTTGACAAGAAGAAGACGAAGCTGAAAGAACTCAGGAAAAAGAGTAGACGGTGGAAGCAGAAAACATGGCCTCA
AACGTATGAAGACATGCAGTTACTTTTTGGAGTGGTGGATATTAAAATCATATCAAGACTTGTTAAAATGTCGAGGATTACTAAAGAACAGTTGCTCTGGTGCGAGGAGA
AAATGAAGAAGTTAGATGTGTCTGATGGAAAATTGCAGAGAGATCCGTCTCCTCTTCTCTTCCCATGTTAA
Protein sequenceShow/hide protein sequence
MESGPTRPRSGRSRTRPFSLSTPSTATGAIALHSTSRHYQYASCIVFPAAINTWMGQASNCFHSSSPIDYSYTIGARPIQSLRKAPKHDTQRDLKDDPAADPKYENGIDD
ESAKADKADYIAEAEAEAEAEDDDEDFIMNEVKRRLKELRRNSFMVLIPEEDSCAEEEEEEEEGETSCGEREWRDVEAEGRQWWGGFGAVYDKYCERMLFFDRMSMQPLI
EAGNTQILFLINHPVGYFIEVRFVLHYITLAGSRTPSPSPRSASKKTASPLRCLSLKRIEEPQDETEHLHPPFTDSYHDIETAYVAHICLSWEALHCQYTQLNHLISCQS
QNPTHYNLSAQQFQQFQVLLQRFIENEPFEQGPRPGIYARTRRSFPKMLQVPNIQGSDPNRTQEEESDLVILAPELIVIIEASIFTFHRFLKMDKKISNCGSLSFGGNQS
QDATLLARIRSSLDKKKTKLKELRKKSRRWKQKTWPQTYEDMQLLFGVVDIKIISRLVKMSRITKEQLLWCEEKMKKLDVSDGKLQRDPSPLLFPC