; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0021109 (gene) of Chayote v1 genome

Gene IDSed0021109
OrganismSechium edule (Chayote v1)
DescriptionPolymerase/histidinol phosphatase-like, putative isoform 1
Genome locationLG05:40117811..40121504
RNA-Seq ExpressionSed0021109
SyntenySed0021109
Gene Ontology termsGO:0008033 - tRNA processing (biological process)
GO:0032501 - multicellular organismal process (biological process)
GO:0048856 - anatomical structure development (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0005655 - nucleolar ribonuclease P complex (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR002738 - RNase P subunit p30
IPR016195 - Polymerase/histidinol phosphatase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605499.1 Protein GAMETOPHYTE DEFECTIVE 1, partial [Cucurbita argyrosperma subsp. sororia]3.0e-26970.57Show/hide
Query:  MGFFDLNIPYDTNSSSSSIRANRIKIVAKLMELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTRLTVSI
        MGFFDLNIPYD +SSSSSIR NRIKIVAKLMELGYSGIAYNRT+KGVMSD DRC+IPLLNVSSLQS+LP+FSA++E HR+LLGVPRSSPFRQYTRLT+ I
Subjt:  MGFFDLNIPYDTNSSSSSIRANRIKIVAKLMELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTRLTVSI

Query:  NSLQEVLAVNSGNLLLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLLVDWTK
        NSLQE+LAVNSGNL+LKTYDLIAVKPL+Q+AF+QAC+KLEIDII+IDFAEKLPFRLKQG IKSAIQRGVYFEIMYS LLSDVH RRQMIS TK+LVDWTK
Subjt:  NSLQEVLAVNSGNLLLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLLVDWTK

Query:  GKNLILSSAASSVNEIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVERISSDEKLDLNDPWSVDLFKWDPISSGEGDLLLDD
        GKNLILSSAASSVNEIRGP+DVANLSSL GVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVERISSD+KLDLNDPWSVDLFKWDPISSGEGDLLLDD
Subjt:  GKNLILSSAASSVNEIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVERISSDEKLDLNDPWSVDLFKWDPISSGEGDLLLDD

Query:  IAKSFATSNTKSKNVKAIDFTSVIDSMPPQGFLVKNAMSGSEAKLSLNDNRGLSHVTDGIEPPVSVNGVIQQSHP-DEEHFSTSDRQCSVIESYQILYSH
        IAKSFA SN KSK VKAIDFTSVID+MPPQGFLVKN  SGSEAK+SLNDNRG SHV D IEPP++VNGVIQQS P D +H STSDRQCS           
Subjt:  IAKSFATSNTKSKNVKAIDFTSVIDSMPPQGFLVKNAMSGSEAKLSLNDNRGLSHVTDGIEPPVSVNGVIQQSHP-DEEHFSTSDRQCSVIESYQILYSH

Query:  GDAGKVSSNSEEGKSTVEETVQLKTSMQEEPTEMDIDNVQHQSPLPSSELNVISTNELVHSPTSTRAALAVVSGNGGKETFEVSENVASHQNEYGLKSSN
        GDAG V SNS+E K TVEE VQ K SMQE+P  MDIDNVQ Q+PLPSSELNV STN  VH PTSTR  L VVS N GKETF +SE+VASH++EY      
Subjt:  GDAGKVSSNSEEGKSTVEETVQLKTSMQEEPTEMDIDNVQHQSPLPSSELNVISTNELVHSPTSTRAALAVVSGNGGKETFEVSENVASHQNEYGLKSSN

Query:  ALSGSENVQLQSPLPSSELNVVSVNELVHPASTRDVLANVSGNDGEESIKMFEDADYHQNECDLKSSDTLSDSETVLRDKSSTNMVPENQKNMTMVIDGT
                                                                     C LKSSDTLS  E+VLRDKSSTN+V EN KNMT V+DG+
Subjt:  ALSGSENVQLQSPLPSSELNVVSVNELVHPASTRDVLANVSGNDGEESIKMFEDADYHQNECDLKSSDTLSDSETVLRDKSSTNMVPENQKNMTMVIDGT

Query:  LTEAEGCLLGSRHGEPGDLAVAEDQEVSPHSSCINGIENDYLISTQQPTSEVLMEEQKSGGSDPHIESPPLDQSISGVMSTVNAGTFLTKGRHHPASKLP
         T AE CL+G+R GEP D+A  ED+ VSP SSC+  I++DYLIS +Q TSEVLMEEQKSG +DP I+ PPLDQSISGVMST        + +HHPA  L 
Subjt:  LTEAEGCLLGSRHGEPGDLAVAEDQEVSPHSSCINGIENDYLISTQQPTSEVLMEEQKSGGSDPHIESPPLDQSISGVMSTVNAGTFLTKGRHHPASKLP

Query:  LKQLIDPMVFKRV--------RSKRRIQQPTLWLPFKRSINPLAFKKVRKT
        L +L +P VFKRV        RSKRR  +P L LPFKR INPL FKKVRKT
Subjt:  LKQLIDPMVFKRV--------RSKRRIQQPTLWLPFKRSINPLAFKKVRKT

KAG7035438.1 Ribonuclease P protein subunit p30, partial [Cucurbita argyrosperma subsp. argyrosperma]4.4e-26870.31Show/hide
Query:  MGFFDLNIPYDTNSSSSSIRANRIKIVAKLMELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTRLTVSI
        MGFFDLNIPYD +SSSSSIR NRIKIVAKLMELGYSGIAYNRT+KGVMSD DRC+IPLLNVSSLQS+LP+FSA++E HR+LLGVPRSSPFRQYTRLT+ I
Subjt:  MGFFDLNIPYDTNSSSSSIRANRIKIVAKLMELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTRLTVSI

Query:  NSLQEVLAVNSGNLLLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLLVDWTK
        NSLQE+LAVNSGNL+LKTYDLIAVKPL+Q+AF+QAC+KLEIDII+IDFAEKLPFRLKQG IKSAIQRGVYFEIMYS LLSDVH RRQMIS TK+LVDWTK
Subjt:  NSLQEVLAVNSGNLLLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLLVDWTK

Query:  GKNLILSSAASSVNEIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVERISSDEKLDLNDPWSVDLFKWDPISSGEGDLLLDD
        GKNLILSSAASSVNEIRGP+DVAN SSL GVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVERISSD+KLDLNDPWSVDLFKWDPISSGEGDLLLDD
Subjt:  GKNLILSSAASSVNEIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVERISSDEKLDLNDPWSVDLFKWDPISSGEGDLLLDD

Query:  IAKSFATSNTKSKNVKAIDFTSVIDSMPPQGFLVKNAMSGSEAKLSLNDNRGLSHVTDGIEPPVSVNGVIQQSHP-DEEHFSTSDRQCSVIESYQILYSH
        IAKSFA SN KSK VKAIDFTSVID+MPPQGFLVKN  SGSEAK+SLNDNRG SHV D IEPP++VNGVIQQS P D +H STSDR CS           
Subjt:  IAKSFATSNTKSKNVKAIDFTSVIDSMPPQGFLVKNAMSGSEAKLSLNDNRGLSHVTDGIEPPVSVNGVIQQSHP-DEEHFSTSDRQCSVIESYQILYSH

Query:  GDAGKVSSNSEEGKSTVEETVQLKTSMQEEPTEMDIDNVQHQSPLPSSELNVISTNELVHSPTSTRAALAVVSGNGGKETFEVSENVASHQNEYGLKSSN
        GDAG V SNS+E K TVEE VQ K SMQE+P  MDIDNVQ Q+PLPSSELNV STN  VH PTSTR  L VVS N GKETF +SE+VASH++EY      
Subjt:  GDAGKVSSNSEEGKSTVEETVQLKTSMQEEPTEMDIDNVQHQSPLPSSELNVISTNELVHSPTSTRAALAVVSGNGGKETFEVSENVASHQNEYGLKSSN

Query:  ALSGSENVQLQSPLPSSELNVVSVNELVHPASTRDVLANVSGNDGEESIKMFEDADYHQNECDLKSSDTLSDSETVLRDKSSTNMVPENQKNMTMVIDGT
                                                                     C LKSSDTLS  E+VLRDKSSTN+V EN KNMT V+DG+
Subjt:  ALSGSENVQLQSPLPSSELNVVSVNELVHPASTRDVLANVSGNDGEESIKMFEDADYHQNECDLKSSDTLSDSETVLRDKSSTNMVPENQKNMTMVIDGT

Query:  LTEAEGCLLGSRHGEPGDLAVAEDQEVSPHSSCINGIENDYLISTQQPTSEVLMEEQKSGGSDPHIESPPLDQSISGVMSTVNAGTFLTKGRHHPASKLP
         T AE CL+G+R GEP D+A  ED+ VSP SSC+  I++DYLIS +Q TSEVLMEEQKSG +DP I+ PPLDQSISGVMST        + +HHPA  L 
Subjt:  LTEAEGCLLGSRHGEPGDLAVAEDQEVSPHSSCINGIENDYLISTQQPTSEVLMEEQKSGGSDPHIESPPLDQSISGVMSTVNAGTFLTKGRHHPASKLP

Query:  LKQLIDPMVFKRV--------RSKRRIQQPTLWLPFKRSINPLAFKKVRKT
        L +L +P VFKRV        RSKRR  +P L LPFKR INPL FKKVRKT
Subjt:  LKQLIDPMVFKRV--------RSKRRIQQPTLWLPFKRSINPLAFKKVRKT

XP_023007324.1 uncharacterized protein LOC111499856 isoform X1 [Cucurbita maxima]2.2e-27270.71Show/hide
Query:  MGFFDLNIPYDTNSSSSSIRANRIKIVAKLMELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTRLTVSI
        MGFFDLNIPYD +SSSSSIR NRIKIVAKLMELGYSGIAYNRT+KGVMSD DRC+IPLLNVSSLQS+LP+FSA++E HR+LLGVPRSSPFRQYTRLT+ I
Subjt:  MGFFDLNIPYDTNSSSSSIRANRIKIVAKLMELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTRLTVSI

Query:  NSLQEVLAVNSGNLLLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLLVDWTK
        NSLQE+LAVNSGNL+LKTYDLIAVKPL+Q+AF+QAC+KLEIDII+IDFAEKLPFRLKQG IKSAIQRGVYFEIMYS LLSDVH RRQMISTTK+LVDWTK
Subjt:  NSLQEVLAVNSGNLLLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLLVDWTK

Query:  GKNLILSSAASSVNEIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVERISSDEKLDLNDPWSVDLFKWDPISSGEGDLLLDD
        GKNLILSSAASSVNEIRGP+DVANLSSL GVSMERAK AVSKNCRNLIANALKRKQFYKETIRVERI+SD+KLD NDPWSVDLFKWDPISSGEGDLLLDD
Subjt:  GKNLILSSAASSVNEIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVERISSDEKLDLNDPWSVDLFKWDPISSGEGDLLLDD

Query:  IAKSFATSNTKSKNVKAIDFTSVIDSMPPQGFLVKNAMSGSEAKLSLNDNRGLSHVTDGIEPPVSVNGVIQQSHP-DEEHFSTSDRQCSVIESYQILYSH
        IAKSFA +N KSK VKAIDFTSVID+MPPQGFL+KN ++GSEAK+SLNDNRGLS V D IEPP++VNGVIQQS P D +H STSDRQCSVIE+ +I  SH
Subjt:  IAKSFATSNTKSKNVKAIDFTSVIDSMPPQGFLVKNAMSGSEAKLSLNDNRGLSHVTDGIEPPVSVNGVIQQSHP-DEEHFSTSDRQCSVIESYQILYSH

Query:  GDAGKVSSNSEEGKSTVEETVQLKTSMQEEPTEMDIDNVQHQSPLPSSELNVISTNELVHSPTSTRAALAVVSGNGGKETFEVSENVASHQNEYGLKSSN
         DAGKV SNS+E K TVEE VQ K S+QEEP EMDIDNVQ Q+PLPSSELNV+STN  VH PTSTR  L VVS N GKETF +SE+VASH++EY      
Subjt:  GDAGKVSSNSEEGKSTVEETVQLKTSMQEEPTEMDIDNVQHQSPLPSSELNVISTNELVHSPTSTRAALAVVSGNGGKETFEVSENVASHQNEYGLKSSN

Query:  ALSGSENVQLQSPLPSSELNVVSVNELVHPASTRDVLANVSGNDGEESIKMFEDADYHQNECDLKSSDTLSDSETVLRDKSSTNMVPENQKNMTMVIDGT
                                                                     C LKSSDTLS  E+VLRDKSSTN+V ENQ NMT V+DG+
Subjt:  ALSGSENVQLQSPLPSSELNVVSVNELVHPASTRDVLANVSGNDGEESIKMFEDADYHQNECDLKSSDTLSDSETVLRDKSSTNMVPENQKNMTMVIDGT

Query:  LTEAEGCLLGSRHGEPGDLAVAEDQEVSPHSSCINGIENDYLISTQQPTSEVLMEEQKSGGSDPHIESPPLDQSISGVMSTVNAGTFLTKGRHHPASKLP
         T AE CL+G+R GEP D+A  ED  VSP SSC+  I++DYLIS +Q TSEVLMEEQKS  +DP I+ PP+DQSISGVMST        + +HHPA  L 
Subjt:  LTEAEGCLLGSRHGEPGDLAVAEDQEVSPHSSCINGIENDYLISTQQPTSEVLMEEQKSGGSDPHIESPPLDQSISGVMSTVNAGTFLTKGRHHPASKLP

Query:  LKQLIDPMVFKRV--------RSKRRIQQPTLWLPFKRSINPLAFKKVRKT
        L QL +P VFKRV        RSKRR  +P L LPFKR INPL FKKVRKT
Subjt:  LKQLIDPMVFKRV--------RSKRRIQQPTLWLPFKRSINPLAFKKVRKT

XP_023007325.1 uncharacterized protein LOC111499856 isoform X2 [Cucurbita maxima]1.2e-27070.59Show/hide
Query:  MGFFDLNIPYDTNSSSSSIRANRIKIVAKLMELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTRLTVSI
        MGFFDLNIPYD +SSSSSIR NRIKIVAKLMELGYSGIAYNRT+KGVMSD DRC+IPLLNVSSLQS+LP+FSA++E HR+LLGVPRSSPFRQYTRLT+ I
Subjt:  MGFFDLNIPYDTNSSSSSIRANRIKIVAKLMELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTRLTVSI

Query:  NSLQEVLAVNSGNLLLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLLVDWTK
        NSLQE+LAVNSGNL+LKTYDLIAVKPL+Q+AF+QAC+KLEIDII+IDFAEKLPFRLKQG IKSAIQRGVYFEIMYS LLSDVH RRQMISTTK+LVDWTK
Subjt:  NSLQEVLAVNSGNLLLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLLVDWTK

Query:  GKNLILSSAASSVNEIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVERISSDEKLDLNDPWSVDLFKWDPISSGEGDLLLDD
        GKNLILSSAASSVNEIRGP+DVANLSSL GVSMERAK AVSKNCRNLIANALKRKQFYKETIRVERI+SD+KLD NDPWSVDLFKWDPISSGEGDLLLDD
Subjt:  GKNLILSSAASSVNEIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVERISSDEKLDLNDPWSVDLFKWDPISSGEGDLLLDD

Query:  IAKSFATSNTKSKNVKAIDFTSVIDSMPPQGFLVKNAMSGSEAKLSLNDNRGLSHVTDGIEPPVSVNGVIQQSHP-DEEHFSTSDRQCSVIESYQILYSH
        IAKSFA +N KSK VKAIDFTSVID+MPPQGFL+KN ++GSEAK+SLNDNRGLS V D IEPP++VNGVIQQS P D +H STSDRQCSVIE+ +I  SH
Subjt:  IAKSFATSNTKSKNVKAIDFTSVIDSMPPQGFLVKNAMSGSEAKLSLNDNRGLSHVTDGIEPPVSVNGVIQQSHP-DEEHFSTSDRQCSVIESYQILYSH

Query:  GDAGKVSSNSEEGKSTVEETVQLKTSMQEEPTEMDIDNVQHQSPLPSSELNVISTNELVHSPTSTRAALAVVSGNGGKETFEVSENVASHQNEYGLKSSN
         DAGKV SNS+E K TVEE VQ K S+QEEP EMDIDNVQ Q+PLPSSELNV+STN  VH PTSTR  L VVS N GKETF +SE+VASH++EY      
Subjt:  GDAGKVSSNSEEGKSTVEETVQLKTSMQEEPTEMDIDNVQHQSPLPSSELNVISTNELVHSPTSTRAALAVVSGNGGKETFEVSENVASHQNEYGLKSSN

Query:  ALSGSENVQLQSPLPSSELNVVSVNELVHPASTRDVLANVSGNDGEESIKMFEDADYHQNECDLKSSDTLSDSETVLRDKSSTNMVPENQKNMTMVIDGT
                                                                     C LKSSDTLS  E+VLRDKSSTN+V ENQ NMT V+DG+
Subjt:  ALSGSENVQLQSPLPSSELNVVSVNELVHPASTRDVLANVSGNDGEESIKMFEDADYHQNECDLKSSDTLSDSETVLRDKSSTNMVPENQKNMTMVIDGT

Query:  LTEAEGCLLGSRHGEPGDLAVAEDQEVSPHSSCINGIENDYLISTQQPTSEVLMEEQKSGGSDPHIESPPLDQSISGVMSTVNAGTFLTKGRHHPASKLP
         T AE CL+G+R GEP D+A  ED  VSP SSC+  I++DYLIS +Q TSEVLMEEQKS  +DP I+ PP+DQSISGVMST        + +HHPA  L 
Subjt:  LTEAEGCLLGSRHGEPGDLAVAEDQEVSPHSSCINGIENDYLISTQQPTSEVLMEEQKSGGSDPHIESPPLDQSISGVMSTVNAGTFLTKGRHHPASKLP

Query:  LKQLIDPMVFKRV--------RSKRRIQQPTLWLPFKRSINPLAFKKV
        L QL +P VFKRV        RSKRR  +P L LPFKR INPL FKKV
Subjt:  LKQLIDPMVFKRV--------RSKRRIQQPTLWLPFKRSINPLAFKKV

XP_023007326.1 uncharacterized protein LOC111499856 isoform X3 [Cucurbita maxima]4.8e-26769.91Show/hide
Query:  MGFFDLNIPYDTNSSSSSIRANRIKIVAKLMELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTRLTVSI
        MGFFDLNIPYD +SSSSSIR NRIKIVAKLMELGYSGIAYNRT+KGVMSD DRC+IPLLNVSSLQS+LP+FSA++E HR+LLGVPRSSPFRQYTRLT+ I
Subjt:  MGFFDLNIPYDTNSSSSSIRANRIKIVAKLMELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTRLTVSI

Query:  NSLQEVLAVNSGNLLLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLLVDWTK
        NSLQE+LAVNSGNL+LKTYDLIAVKPL+Q+AF+QAC+KLEIDII+IDFAEKLPFRLKQG IKSAIQRGVYFEIMYS LLSDVH RRQMISTTK+LVDWTK
Subjt:  NSLQEVLAVNSGNLLLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLLVDWTK

Query:  GKNLILSSAASSVNEIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVERISSDEKLDLNDPWSVDLFKWDPISSGEGDLLLDD
        GKNLILSSAASSVNEIRGP+DVANLSSL GVSMERAK AVSKNCRNLIANALKRKQFYKETIRVERI+SD+KLD NDPWSVDLFKWDPISSGEGDLLLDD
Subjt:  GKNLILSSAASSVNEIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVERISSDEKLDLNDPWSVDLFKWDPISSGEGDLLLDD

Query:  IAKSFATSNTKSKNVKAIDFTSVIDSMPPQGFLVKNAMSGSEAKLSLNDNRGLSHVTDGIEPPVSVNGVIQQSHP-DEEHFSTSDRQCSVIESYQILYSH
        IAKSFA +N KSK VKAIDFTSVID+MPPQGFL+KN ++GSEAK+SLNDNRGLS V D IEPP++VNGVIQQS P D +H STSDRQCS           
Subjt:  IAKSFATSNTKSKNVKAIDFTSVIDSMPPQGFLVKNAMSGSEAKLSLNDNRGLSHVTDGIEPPVSVNGVIQQSHP-DEEHFSTSDRQCSVIESYQILYSH

Query:  GDAGKVSSNSEEGKSTVEETVQLKTSMQEEPTEMDIDNVQHQSPLPSSELNVISTNELVHSPTSTRAALAVVSGNGGKETFEVSENVASHQNEYGLKSSN
         DAGKV SNS+E K TVEE VQ K S+QEEP EMDIDNVQ Q+PLPSSELNV+STN  VH PTSTR  L VVS N GKETF +SE+VASH++EY      
Subjt:  GDAGKVSSNSEEGKSTVEETVQLKTSMQEEPTEMDIDNVQHQSPLPSSELNVISTNELVHSPTSTRAALAVVSGNGGKETFEVSENVASHQNEYGLKSSN

Query:  ALSGSENVQLQSPLPSSELNVVSVNELVHPASTRDVLANVSGNDGEESIKMFEDADYHQNECDLKSSDTLSDSETVLRDKSSTNMVPENQKNMTMVIDGT
                                                                     C LKSSDTLS  E+VLRDKSSTN+V ENQ NMT V+DG+
Subjt:  ALSGSENVQLQSPLPSSELNVVSVNELVHPASTRDVLANVSGNDGEESIKMFEDADYHQNECDLKSSDTLSDSETVLRDKSSTNMVPENQKNMTMVIDGT

Query:  LTEAEGCLLGSRHGEPGDLAVAEDQEVSPHSSCINGIENDYLISTQQPTSEVLMEEQKSGGSDPHIESPPLDQSISGVMSTVNAGTFLTKGRHHPASKLP
         T AE CL+G+R GEP D+A  ED  VSP SSC+  I++DYLIS +Q TSEVLMEEQKS  +DP I+ PP+DQSISGVMST        + +HHPA  L 
Subjt:  LTEAEGCLLGSRHGEPGDLAVAEDQEVSPHSSCINGIENDYLISTQQPTSEVLMEEQKSGGSDPHIESPPLDQSISGVMSTVNAGTFLTKGRHHPASKLP

Query:  LKQLIDPMVFKRV--------RSKRRIQQPTLWLPFKRSINPLAFKKVRKT
        L QL +P VFKRV        RSKRR  +P L LPFKR INPL FKKVRKT
Subjt:  LKQLIDPMVFKRV--------RSKRRIQQPTLWLPFKRSINPLAFKKVRKT

TrEMBL top hitse value%identityAlignment
A0A6J1G517 uncharacterized protein LOC111450984 isoform X11.8e-25964.52Show/hide
Query:  MGFFDLNIPYDTNSSSSSIRANRIKIVAKLMELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTRLTVSI
        MGFFDLN+PYD +SSSSSIR NRIKIVAKLMELGYSGIAYNRT+KGVMSD DRC+IPLLNVSSLQS+LP+FSA++E HR+LLGVPRSSPFRQYTRLT+ I
Subjt:  MGFFDLNIPYDTNSSSSSIRANRIKIVAKLMELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTRLTVSI

Query:  NSLQEVLAVNSGNLLLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLLVDWTK
        NSLQE+LAVNSGNL+LKTYDLIAVKPL+Q+AF+QAC+KLEIDII+IDFAEKLPFRLKQG IKSAIQRGVYFEIMYS LLSDVH RRQMISTTK+LVDWTK
Subjt:  NSLQEVLAVNSGNLLLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLLVDWTK

Query:  GKNLILSSAASSVNEIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVERISSDEKLDLNDPWSVDLFKWDPISSGEGDLLLDD
        GKNLILSSAASSVNEIRGP+DVANLSSL GVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVERISSD+KLD NDPWSVDLFKWDP+SSGEGDLLLDD
Subjt:  GKNLILSSAASSVNEIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVERISSDEKLDLNDPWSVDLFKWDPISSGEGDLLLDD

Query:  IAKSFATSNTKSKNVKAIDFTSVIDSMPPQGFLVKNAMSGSEAKLSLNDNRGLSHVTDGIEPPVSVNGVIQQSHP-------------------------
        IA+SFA SN KSK VKAIDFTSVID+ PPQGFLVKN +SG EAK+SLNDNRG SHV D IEPP++VNGVIQ S P                         
Subjt:  IAKSFATSNTKSKNVKAIDFTSVIDSMPPQGFLVKNAMSGSEAKLSLNDNRGLSHVTDGIEPPVSVNGVIQQSHP-------------------------

Query:  ----------------------------------DEEHFSTSDRQCSVIESYQILYSHGDAGKVSSNSEEGKSTVEETVQLKTSMQEEPTEMDIDNVQHQ
                                          D +H STS+RQCS           GDAGKV SNS+E K TVEE VQ K SMQE+P  MDIDNVQ Q
Subjt:  ----------------------------------DEEHFSTSDRQCSVIESYQILYSHGDAGKVSSNSEEGKSTVEETVQLKTSMQEEPTEMDIDNVQHQ

Query:  SPLPSSELNVISTNELVHSPTSTRAALAVVSGNGGKETFEVSENVASHQNEYGLKSSNALSGSENVQLQSPLPSSELNVVSVNELVHPASTRDVLANVSG
        +PLPSSELNV STN  VH PTSTR  L V+S N GKE F +SE+VASH++EY                                                
Subjt:  SPLPSSELNVISTNELVHSPTSTRAALAVVSGNGGKETFEVSENVASHQNEYGLKSSNALSGSENVQLQSPLPSSELNVVSVNELVHPASTRDVLANVSG

Query:  NDGEESIKMFEDADYHQNECDLKSSDTLSDSETVLRDKSSTNMVPENQKNMTMVIDGTLTEAEGCLLGSRHGEPGDLAVAEDQEVSPHSSCINGIENDYL
                           C LKSSDTLS  E+VLRDKSSTN+V EN KNMT V+DG+ T AE CL+G+R GEP D+A  ED+ VSP SSC+  I++DYL
Subjt:  NDGEESIKMFEDADYHQNECDLKSSDTLSDSETVLRDKSSTNMVPENQKNMTMVIDGTLTEAEGCLLGSRHGEPGDLAVAEDQEVSPHSSCINGIENDYL

Query:  ISTQQPTSEVLMEEQKSGGSDPHIESPPLDQSISGVMSTVNAGTFLTKGRHHPASKLPLKQLIDPMVFKRV--------RSKRRIQQPTLWLPFKRSINP
        IS +Q TSEVLMEEQKSG +DP I+ PPLDQSISGVMST        + +HHPA  L L +L +P VFKRV        RSKRR  +P L LPFKR INP
Subjt:  ISTQQPTSEVLMEEQKSGGSDPHIESPPLDQSISGVMSTVNAGTFLTKGRHHPASKLPLKQLIDPMVFKRV--------RSKRRIQQPTLWLPFKRSINP

Query:  LAFKKVRKT
        L FKKVRKT
Subjt:  LAFKKVRKT

A0A6J1G550 uncharacterized protein LOC111450984 isoform X21.5e-25864.36Show/hide
Query:  MGFFDLNIPYDTNSSSSSIRANRIKIVAKLMELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTRLTVSI
        MGFFDLN+PYD +SSSSSIR NRIKIVAKLMELGYSGIAYNRT+KGVMSD DRC+IPLLNVSSLQS+LP+FSA++E HR+LLGVPRSSPFRQYTRLT+ I
Subjt:  MGFFDLNIPYDTNSSSSSIRANRIKIVAKLMELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTRLTVSI

Query:  NSLQEVLAVNSGNLLLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLLVDWTK
        NSLQE+LAVNSGNL+LKTYDLIAVKPL+Q+AF+QAC+KLEIDII+IDFAEKLPFRLKQG IKSAIQRGVYFEIMYS LLSDVH RRQMISTTK+LVDWTK
Subjt:  NSLQEVLAVNSGNLLLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLLVDWTK

Query:  GKNLILSSAASSVNEIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVERISSDEKLDLNDPWSVDLFKWDPISSGEGDLLLDD
        GKNLILSSAASSVNEIRGP+DVANLSSL GVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVERISSD+KLD NDPWSVDLFKWDP+SSGEGDLLLDD
Subjt:  GKNLILSSAASSVNEIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVERISSDEKLDLNDPWSVDLFKWDPISSGEGDLLLDD

Query:  IAKSFATSNTKSKNVKAIDFTSVIDSMPPQGFLVKNAMSGSEAKLSLNDNRGLSHVTDGIEPPVSVNGVIQQSHP-------------------------
        IA+SFA SN KSK VKAIDFTSVID+ PPQGFLVKN +SG EAK+SLNDNRG SHV D IEPP++VNGVIQ S P                         
Subjt:  IAKSFATSNTKSKNVKAIDFTSVIDSMPPQGFLVKNAMSGSEAKLSLNDNRGLSHVTDGIEPPVSVNGVIQQSHP-------------------------

Query:  ----------------------------------DEEHFSTSDRQCSVIESYQILYSHGDAGKVSSNSEEGKSTVEETVQLKTSMQEEPTEMDIDNVQHQ
                                          D +H STS+RQCS           GDAGKV SNS+E K TVEE VQ K SMQE+P  MDIDNVQ Q
Subjt:  ----------------------------------DEEHFSTSDRQCSVIESYQILYSHGDAGKVSSNSEEGKSTVEETVQLKTSMQEEPTEMDIDNVQHQ

Query:  SPLPSSELNVISTNELVHSPTSTRAALAVVSGNGGKETFEVSENVASHQNEYGLKSSNALSGSENVQLQSPLPSSELNVVSVNELVHPASTRDVLANVSG
        +PLPSSELNV STN  VH PTSTR  L V+S N GKE F +SE+VASH++EY                                                
Subjt:  SPLPSSELNVISTNELVHSPTSTRAALAVVSGNGGKETFEVSENVASHQNEYGLKSSNALSGSENVQLQSPLPSSELNVVSVNELVHPASTRDVLANVSG

Query:  NDGEESIKMFEDADYHQNECDLKSSDTLSDSETVLRDKSSTNMVPENQKNMTMVIDGTLTEAEGCLLGSRHGEPGDLAVAEDQEVSPHSSCINGIENDYL
                           C LKSSDTLS  E+VLRDKSSTN+V EN KNMT V+DG+ T AE CL+G+R GEP D+A  ED+ VSP SSC+  I++DYL
Subjt:  NDGEESIKMFEDADYHQNECDLKSSDTLSDSETVLRDKSSTNMVPENQKNMTMVIDGTLTEAEGCLLGSRHGEPGDLAVAEDQEVSPHSSCINGIENDYL

Query:  ISTQQPTSEVLMEEQKSGGSDPHIESPPLDQSISGVMSTVNAGTFLTKGRHHPASKLPLKQLIDPMVFKRV--------RSKRRIQQPTLWLPFKRSINP
        IS +Q TSEVLMEEQKSG +DP I+ PPLDQSISGVMST        + +HHPA  L L +L +P VFKRV        RSKRR  +P L LPFKR INP
Subjt:  ISTQQPTSEVLMEEQKSGGSDPHIESPPLDQSISGVMSTVNAGTFLTKGRHHPASKLPLKQLIDPMVFKRV--------RSKRRIQQPTLWLPFKRSINP

Query:  LAFKKVRK
        L FKKV+K
Subjt:  LAFKKVRK

A0A6J1L082 uncharacterized protein LOC111499856 isoform X32.3e-26769.91Show/hide
Query:  MGFFDLNIPYDTNSSSSSIRANRIKIVAKLMELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTRLTVSI
        MGFFDLNIPYD +SSSSSIR NRIKIVAKLMELGYSGIAYNRT+KGVMSD DRC+IPLLNVSSLQS+LP+FSA++E HR+LLGVPRSSPFRQYTRLT+ I
Subjt:  MGFFDLNIPYDTNSSSSSIRANRIKIVAKLMELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTRLTVSI

Query:  NSLQEVLAVNSGNLLLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLLVDWTK
        NSLQE+LAVNSGNL+LKTYDLIAVKPL+Q+AF+QAC+KLEIDII+IDFAEKLPFRLKQG IKSAIQRGVYFEIMYS LLSDVH RRQMISTTK+LVDWTK
Subjt:  NSLQEVLAVNSGNLLLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLLVDWTK

Query:  GKNLILSSAASSVNEIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVERISSDEKLDLNDPWSVDLFKWDPISSGEGDLLLDD
        GKNLILSSAASSVNEIRGP+DVANLSSL GVSMERAK AVSKNCRNLIANALKRKQFYKETIRVERI+SD+KLD NDPWSVDLFKWDPISSGEGDLLLDD
Subjt:  GKNLILSSAASSVNEIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVERISSDEKLDLNDPWSVDLFKWDPISSGEGDLLLDD

Query:  IAKSFATSNTKSKNVKAIDFTSVIDSMPPQGFLVKNAMSGSEAKLSLNDNRGLSHVTDGIEPPVSVNGVIQQSHP-DEEHFSTSDRQCSVIESYQILYSH
        IAKSFA +N KSK VKAIDFTSVID+MPPQGFL+KN ++GSEAK+SLNDNRGLS V D IEPP++VNGVIQQS P D +H STSDRQCS           
Subjt:  IAKSFATSNTKSKNVKAIDFTSVIDSMPPQGFLVKNAMSGSEAKLSLNDNRGLSHVTDGIEPPVSVNGVIQQSHP-DEEHFSTSDRQCSVIESYQILYSH

Query:  GDAGKVSSNSEEGKSTVEETVQLKTSMQEEPTEMDIDNVQHQSPLPSSELNVISTNELVHSPTSTRAALAVVSGNGGKETFEVSENVASHQNEYGLKSSN
         DAGKV SNS+E K TVEE VQ K S+QEEP EMDIDNVQ Q+PLPSSELNV+STN  VH PTSTR  L VVS N GKETF +SE+VASH++EY      
Subjt:  GDAGKVSSNSEEGKSTVEETVQLKTSMQEEPTEMDIDNVQHQSPLPSSELNVISTNELVHSPTSTRAALAVVSGNGGKETFEVSENVASHQNEYGLKSSN

Query:  ALSGSENVQLQSPLPSSELNVVSVNELVHPASTRDVLANVSGNDGEESIKMFEDADYHQNECDLKSSDTLSDSETVLRDKSSTNMVPENQKNMTMVIDGT
                                                                     C LKSSDTLS  E+VLRDKSSTN+V ENQ NMT V+DG+
Subjt:  ALSGSENVQLQSPLPSSELNVVSVNELVHPASTRDVLANVSGNDGEESIKMFEDADYHQNECDLKSSDTLSDSETVLRDKSSTNMVPENQKNMTMVIDGT

Query:  LTEAEGCLLGSRHGEPGDLAVAEDQEVSPHSSCINGIENDYLISTQQPTSEVLMEEQKSGGSDPHIESPPLDQSISGVMSTVNAGTFLTKGRHHPASKLP
         T AE CL+G+R GEP D+A  ED  VSP SSC+  I++DYLIS +Q TSEVLMEEQKS  +DP I+ PP+DQSISGVMST        + +HHPA  L 
Subjt:  LTEAEGCLLGSRHGEPGDLAVAEDQEVSPHSSCINGIENDYLISTQQPTSEVLMEEQKSGGSDPHIESPPLDQSISGVMSTVNAGTFLTKGRHHPASKLP

Query:  LKQLIDPMVFKRV--------RSKRRIQQPTLWLPFKRSINPLAFKKVRKT
        L QL +P VFKRV        RSKRR  +P L LPFKR INPL FKKVRKT
Subjt:  LKQLIDPMVFKRV--------RSKRRIQQPTLWLPFKRSINPLAFKKVRKT

A0A6J1L2N3 uncharacterized protein LOC111499856 isoform X25.9e-27170.59Show/hide
Query:  MGFFDLNIPYDTNSSSSSIRANRIKIVAKLMELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTRLTVSI
        MGFFDLNIPYD +SSSSSIR NRIKIVAKLMELGYSGIAYNRT+KGVMSD DRC+IPLLNVSSLQS+LP+FSA++E HR+LLGVPRSSPFRQYTRLT+ I
Subjt:  MGFFDLNIPYDTNSSSSSIRANRIKIVAKLMELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTRLTVSI

Query:  NSLQEVLAVNSGNLLLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLLVDWTK
        NSLQE+LAVNSGNL+LKTYDLIAVKPL+Q+AF+QAC+KLEIDII+IDFAEKLPFRLKQG IKSAIQRGVYFEIMYS LLSDVH RRQMISTTK+LVDWTK
Subjt:  NSLQEVLAVNSGNLLLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLLVDWTK

Query:  GKNLILSSAASSVNEIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVERISSDEKLDLNDPWSVDLFKWDPISSGEGDLLLDD
        GKNLILSSAASSVNEIRGP+DVANLSSL GVSMERAK AVSKNCRNLIANALKRKQFYKETIRVERI+SD+KLD NDPWSVDLFKWDPISSGEGDLLLDD
Subjt:  GKNLILSSAASSVNEIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVERISSDEKLDLNDPWSVDLFKWDPISSGEGDLLLDD

Query:  IAKSFATSNTKSKNVKAIDFTSVIDSMPPQGFLVKNAMSGSEAKLSLNDNRGLSHVTDGIEPPVSVNGVIQQSHP-DEEHFSTSDRQCSVIESYQILYSH
        IAKSFA +N KSK VKAIDFTSVID+MPPQGFL+KN ++GSEAK+SLNDNRGLS V D IEPP++VNGVIQQS P D +H STSDRQCSVIE+ +I  SH
Subjt:  IAKSFATSNTKSKNVKAIDFTSVIDSMPPQGFLVKNAMSGSEAKLSLNDNRGLSHVTDGIEPPVSVNGVIQQSHP-DEEHFSTSDRQCSVIESYQILYSH

Query:  GDAGKVSSNSEEGKSTVEETVQLKTSMQEEPTEMDIDNVQHQSPLPSSELNVISTNELVHSPTSTRAALAVVSGNGGKETFEVSENVASHQNEYGLKSSN
         DAGKV SNS+E K TVEE VQ K S+QEEP EMDIDNVQ Q+PLPSSELNV+STN  VH PTSTR  L VVS N GKETF +SE+VASH++EY      
Subjt:  GDAGKVSSNSEEGKSTVEETVQLKTSMQEEPTEMDIDNVQHQSPLPSSELNVISTNELVHSPTSTRAALAVVSGNGGKETFEVSENVASHQNEYGLKSSN

Query:  ALSGSENVQLQSPLPSSELNVVSVNELVHPASTRDVLANVSGNDGEESIKMFEDADYHQNECDLKSSDTLSDSETVLRDKSSTNMVPENQKNMTMVIDGT
                                                                     C LKSSDTLS  E+VLRDKSSTN+V ENQ NMT V+DG+
Subjt:  ALSGSENVQLQSPLPSSELNVVSVNELVHPASTRDVLANVSGNDGEESIKMFEDADYHQNECDLKSSDTLSDSETVLRDKSSTNMVPENQKNMTMVIDGT

Query:  LTEAEGCLLGSRHGEPGDLAVAEDQEVSPHSSCINGIENDYLISTQQPTSEVLMEEQKSGGSDPHIESPPLDQSISGVMSTVNAGTFLTKGRHHPASKLP
         T AE CL+G+R GEP D+A  ED  VSP SSC+  I++DYLIS +Q TSEVLMEEQKS  +DP I+ PP+DQSISGVMST        + +HHPA  L 
Subjt:  LTEAEGCLLGSRHGEPGDLAVAEDQEVSPHSSCINGIENDYLISTQQPTSEVLMEEQKSGGSDPHIESPPLDQSISGVMSTVNAGTFLTKGRHHPASKLP

Query:  LKQLIDPMVFKRV--------RSKRRIQQPTLWLPFKRSINPLAFKKV
        L QL +P VFKRV        RSKRR  +P L LPFKR INPL FKKV
Subjt:  LKQLIDPMVFKRV--------RSKRRIQQPTLWLPFKRSINPLAFKKV

A0A6J1L4M3 uncharacterized protein LOC111499856 isoform X11.1e-27270.71Show/hide
Query:  MGFFDLNIPYDTNSSSSSIRANRIKIVAKLMELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTRLTVSI
        MGFFDLNIPYD +SSSSSIR NRIKIVAKLMELGYSGIAYNRT+KGVMSD DRC+IPLLNVSSLQS+LP+FSA++E HR+LLGVPRSSPFRQYTRLT+ I
Subjt:  MGFFDLNIPYDTNSSSSSIRANRIKIVAKLMELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTRLTVSI

Query:  NSLQEVLAVNSGNLLLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLLVDWTK
        NSLQE+LAVNSGNL+LKTYDLIAVKPL+Q+AF+QAC+KLEIDII+IDFAEKLPFRLKQG IKSAIQRGVYFEIMYS LLSDVH RRQMISTTK+LVDWTK
Subjt:  NSLQEVLAVNSGNLLLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLLVDWTK

Query:  GKNLILSSAASSVNEIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVERISSDEKLDLNDPWSVDLFKWDPISSGEGDLLLDD
        GKNLILSSAASSVNEIRGP+DVANLSSL GVSMERAK AVSKNCRNLIANALKRKQFYKETIRVERI+SD+KLD NDPWSVDLFKWDPISSGEGDLLLDD
Subjt:  GKNLILSSAASSVNEIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVERISSDEKLDLNDPWSVDLFKWDPISSGEGDLLLDD

Query:  IAKSFATSNTKSKNVKAIDFTSVIDSMPPQGFLVKNAMSGSEAKLSLNDNRGLSHVTDGIEPPVSVNGVIQQSHP-DEEHFSTSDRQCSVIESYQILYSH
        IAKSFA +N KSK VKAIDFTSVID+MPPQGFL+KN ++GSEAK+SLNDNRGLS V D IEPP++VNGVIQQS P D +H STSDRQCSVIE+ +I  SH
Subjt:  IAKSFATSNTKSKNVKAIDFTSVIDSMPPQGFLVKNAMSGSEAKLSLNDNRGLSHVTDGIEPPVSVNGVIQQSHP-DEEHFSTSDRQCSVIESYQILYSH

Query:  GDAGKVSSNSEEGKSTVEETVQLKTSMQEEPTEMDIDNVQHQSPLPSSELNVISTNELVHSPTSTRAALAVVSGNGGKETFEVSENVASHQNEYGLKSSN
         DAGKV SNS+E K TVEE VQ K S+QEEP EMDIDNVQ Q+PLPSSELNV+STN  VH PTSTR  L VVS N GKETF +SE+VASH++EY      
Subjt:  GDAGKVSSNSEEGKSTVEETVQLKTSMQEEPTEMDIDNVQHQSPLPSSELNVISTNELVHSPTSTRAALAVVSGNGGKETFEVSENVASHQNEYGLKSSN

Query:  ALSGSENVQLQSPLPSSELNVVSVNELVHPASTRDVLANVSGNDGEESIKMFEDADYHQNECDLKSSDTLSDSETVLRDKSSTNMVPENQKNMTMVIDGT
                                                                     C LKSSDTLS  E+VLRDKSSTN+V ENQ NMT V+DG+
Subjt:  ALSGSENVQLQSPLPSSELNVVSVNELVHPASTRDVLANVSGNDGEESIKMFEDADYHQNECDLKSSDTLSDSETVLRDKSSTNMVPENQKNMTMVIDGT

Query:  LTEAEGCLLGSRHGEPGDLAVAEDQEVSPHSSCINGIENDYLISTQQPTSEVLMEEQKSGGSDPHIESPPLDQSISGVMSTVNAGTFLTKGRHHPASKLP
         T AE CL+G+R GEP D+A  ED  VSP SSC+  I++DYLIS +Q TSEVLMEEQKS  +DP I+ PP+DQSISGVMST        + +HHPA  L 
Subjt:  LTEAEGCLLGSRHGEPGDLAVAEDQEVSPHSSCINGIENDYLISTQQPTSEVLMEEQKSGGSDPHIESPPLDQSISGVMSTVNAGTFLTKGRHHPASKLP

Query:  LKQLIDPMVFKRV--------RSKRRIQQPTLWLPFKRSINPLAFKKVRKT
        L QL +P VFKRV        RSKRR  +P L LPFKR INPL FKKVRKT
Subjt:  LKQLIDPMVFKRV--------RSKRRIQQPTLWLPFKRSINPLAFKKVRKT

SwissProt top hitse value%identityAlignment
F4JXF1 Protein GAMETOPHYTE DEFECTIVE 14.3e-10957.31Show/hide
Query:  MGFFDLNIPYDTNSSS-----SSIRANRIKIVAKLMELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTR
        MGFFDL+IPY+    S     +  +  R+K+  K MELGY GIA+NR++KGVMSD D C+IPLL + SL  V P  +++V  HRDLLGVPR++PFRQYTR
Subjt:  MGFFDLNIPYDTNSSS-----SSIRANRIKIVAKLMELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTR

Query:  LTVSINSLQEVLAVNSGNLLLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLL
        LTV + S  +  ++NSGN +LK+YD+IAV+P++Q+AFD AC+K E+D+ISIDF +K+ FRLK   +K+AIQRG+YFEI YS +L D   RRQ+IS  KLL
Subjt:  LTVSINSLQEVLAVNSGNLLLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLL

Query:  VDWTKGKNLILSSAASSVNEIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVERISSDEKLDLNDPWSVDLFKWDPISSGEGD
        VDWT+GKNLI+SS A SV E+RGP DV NL  L G+S ERA+AA+SKNCRN+IA  LK+K+F+KE +RVE +S+ +   L  P S D  KWD +SSGEGD
Subjt:  VDWTKGKNLILSSAASSVNEIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVERISSDEKLDLNDPWSVDLFKWDPISSGEGD

Query:  LLLDDIAKSF-ATSNTKSKNVKAIDFTSVIDSMPPQGFLVKNAMSGSEA
        +LLDD+AK+F AT+    K+ KAIDFTSV+D +P  GF VK+ + G+E+
Subjt:  LLLDDIAKSF-ATSNTKSKNVKAIDFTSVIDSMPPQGFLVKNAMSGSEA

O88796 Ribonuclease P protein subunit p308.9e-3036.51Show/hide
Query:  SSSSIRANRIKIVAKLMELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTRLTVSINSLQEVLAVNSGNL
        + S ++A R  +V     LGYS +A N  V     +  R     + VS L + LP                +S P +  TRLT+ +        + + + 
Subjt:  SSSSIRANRIKIVAKLMELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTRLTVSINSLQEVLAVNSGNL

Query:  LLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLLVDWTKGKNLILSSAASSVN
         ++ YD++AV P ++  F  AC  L++D++ I   EKLPF  K+  +  AI+RG+ FE++Y   + D   RR  IS    L+   KGKN+ILSSAA    
Subjt:  LLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLLVDWTKGKNLILSSAASSVN

Query:  EIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRK
        EIRGPYDVANL  LFG+S    KAAVS NCR +  +   RK
Subjt:  EIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRK

P78346 Ribonuclease P protein subunit p308.9e-3037.34Show/hide
Query:  SSSSIRANRIKIVAKLMELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTRLTVSINSLQEVLAVNSGNL
        + S ++A R  +V     LGYS +A N  V            P+  VS L + LP                +S P +  TRLT+ ++       + + + 
Subjt:  SSSSIRANRIKIVAKLMELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTRLTVSINSLQEVLAVNSGNL

Query:  LLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLLVDWTKGKNLILSSAASSVN
          + YD++AV P ++  F  AC  L++D++ I   EKLPF  K+  I  AI RG+ FE++YS  + D   RR  IS+   L+   KGKN+I+SSAA    
Subjt:  LLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLLVDWTKGKNLILSSAASSVN

Query:  EIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRK
        EIRGPYDVANL  LFG+S   AKAAVS NCR  + +   RK
Subjt:  EIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRK

Q3SZ21 Ribonuclease P protein subunit p302.8e-3136.15Show/hide
Query:  SSSSIRANRIKIVAKLMELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTRLTVSINSLQEVLAVNSGNL
        + S ++A R  +V     LGYS +A N  V+           P+  VS L + LP                +S P +  TRLT+ ++       + + + 
Subjt:  SSSSIRANRIKIVAKLMELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTRLTVSINSLQEVLAVNSGNL

Query:  LLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLLVDWTKGKNLILSSAASSVN
         ++ YD++AV P ++  F  AC  L++D++ I   EKLPF  K+  I  AI RGV FE++YS  + D   RR  IS    L+   KGKN+I+SSAA    
Subjt:  LLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLLVDWTKGKNLILSSAASSVN

Query:  EIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVERISSDEKLD
        EIRGPYDVANL  LFG+S   AKAAVS NCR ++ +   RK  +     V++  + E  D
Subjt:  EIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVERISSDEKLD

Q3ZE13 Ribonuclease P protein subunit drpp304.9e-2832.03Show/hide
Query:  MGFFDLNIPYDTNSSSSSIRANRIKIVAKL-MELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTRLTVS
        M ++DLNI        SS+   +IK +  L  + GY  +A   TV+G +   D C I  + +    S   + S  +++         +   +QYTRL V 
Subjt:  MGFFDLNIPYDTNSSSSSIRANRIKIVAKL-MELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTRLTVS

Query:  INSLQEVLAVNSGNLLLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLLVDWT
          ++ E   + + N ++++YD+I+V P     F+ AC+  EIDII+ID   K  F +K  +++  I +G++ EI+Y  L     +R         LV  +
Subjt:  INSLQEVLAVNSGNLLLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLLVDWT

Query:  KGKNLILSSAASSVNEIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRK
         GKN+ILSS+  S   +R PYD++NL  LFG++ ++AKAAVSK+    + +A+ R+
Subjt:  KGKNLILSSAASSVNEIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRK

Arabidopsis top hitse value%identityAlignment
AT5G59980.1 Polymerase/histidinol phosphatase-like3.0e-11057.31Show/hide
Query:  MGFFDLNIPYDTNSSS-----SSIRANRIKIVAKLMELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTR
        MGFFDL+IPY+    S     +  +  R+K+  K MELGY GIA+NR++KGVMSD D C+IPLL + SL  V P  +++V  HRDLLGVPR++PFRQYTR
Subjt:  MGFFDLNIPYDTNSSS-----SSIRANRIKIVAKLMELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTR

Query:  LTVSINSLQEVLAVNSGNLLLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLL
        LTV + S  +  ++NSGN +LK+YD+IAV+P++Q+AFD AC+K E+D+ISIDF +K+ FRLK   +K+AIQRG+YFEI YS +L D   RRQ+IS  KLL
Subjt:  LTVSINSLQEVLAVNSGNLLLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLL

Query:  VDWTKGKNLILSSAASSVNEIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVERISSDEKLDLNDPWSVDLFKWDPISSGEGD
        VDWT+GKNLI+SS A SV E+RGP DV NL  L G+S ERA+AA+SKNCRN+IA  LK+K+F+KE +RVE +S+ +   L  P S D  KWD +SSGEGD
Subjt:  VDWTKGKNLILSSAASSVNEIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVERISSDEKLDLNDPWSVDLFKWDPISSGEGD

Query:  LLLDDIAKSF-ATSNTKSKNVKAIDFTSVIDSMPPQGFLVKNAMSGSEA
        +LLDD+AK+F AT+    K+ KAIDFTSV+D +P  GF VK+ + G+E+
Subjt:  LLLDDIAKSF-ATSNTKSKNVKAIDFTSVIDSMPPQGFLVKNAMSGSEA

AT5G59980.2 Polymerase/histidinol phosphatase-like3.0e-11057.31Show/hide
Query:  MGFFDLNIPYDTNSSS-----SSIRANRIKIVAKLMELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTR
        MGFFDL+IPY+    S     +  +  R+K+  K MELGY GIA+NR++KGVMSD D C+IPLL + SL  V P  +++V  HRDLLGVPR++PFRQYTR
Subjt:  MGFFDLNIPYDTNSSS-----SSIRANRIKIVAKLMELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTR

Query:  LTVSINSLQEVLAVNSGNLLLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLL
        LTV + S  +  ++NSGN +LK+YD+IAV+P++Q+AFD AC+K E+D+ISIDF +K+ FRLK   +K+AIQRG+YFEI YS +L D   RRQ+IS  KLL
Subjt:  LTVSINSLQEVLAVNSGNLLLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLL

Query:  VDWTKGKNLILSSAASSVNEIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVERISSDEKLDLNDPWSVDLFKWDPISSGEGD
        VDWT+GKNLI+SS A SV E+RGP DV NL  L G+S ERA+AA+SKNCRN+IA  LK+K+F+KE +RVE +S+ +   L  P S D  KWD +SSGEGD
Subjt:  VDWTKGKNLILSSAASSVNEIRGPYDVANLSSLFGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVERISSDEKLDLNDPWSVDLFKWDPISSGEGD

Query:  LLLDDIAKSF-ATSNTKSKNVKAIDFTSVIDSMPPQGFLVKNAMSGSEA
        +LLDD+AK+F AT+    K+ KAIDFTSV+D +P  GF VK+ + G+E+
Subjt:  LLLDDIAKSF-ATSNTKSKNVKAIDFTSVIDSMPPQGFLVKNAMSGSEA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTCTTCGACCTCAATATCCCTTACGACACGAATTCATCTTCATCTTCAATTAGGGCTAATCGCATCAAGATCGTGGCCAAACTCATGGAGCTCGGCTACTCCGG
AATCGCCTACAACCGTACCGTCAAAGGAGTCATGTCCGATCACGACCGCTGTTCCATTCCCCTCCTCAACGTCTCCTCGCTTCAATCTGTCCTTCCGTCCTTCTCCGCCG
CCGTGGAGCTCCACCGCGACCTTCTCGGCGTCCCGCGCTCATCTCCTTTCCGCCAGTACACGCGCCTTACTGTCTCTATCAATAGCTTGCAGGAGGTTCTGGCTGTCAAT
TCTGGTAACCTCCTTCTGAAGACCTACGATTTGATTGCTGTTAAGCCTCTTAGTCAGCACGCGTTTGATCAGGCTTGTGACAAACTGGAGATAGATATAATTTCCATTGA
TTTTGCGGAGAAACTTCCTTTCAGGTTGAAGCAAGGTCAGATTAAATCTGCAATTCAGCGTGGAGTTTACTTCGAAATTATGTATTCTGGTCTTCTTTCGGACGTTCATG
AAAGGAGGCAAATGATATCCACTACTAAGCTATTGGTGGATTGGACAAAGGGAAAGAATCTCATATTATCCAGTGCTGCCTCCTCAGTAAATGAAATCAGAGGACCTTAT
GATGTTGCGAACTTGTCATCATTGTTTGGTGTCTCTATGGAACGAGCAAAAGCTGCTGTTTCAAAGAACTGTAGGAATCTTATAGCTAATGCTCTAAAGAGAAAGCAGTT
CTACAAGGAGACCATTCGAGTTGAAAGGATATCATCAGATGAGAAATTAGATCTGAACGACCCTTGGTCAGTGGATTTGTTCAAATGGGATCCAATATCAAGTGGGGAAG
GGGATTTGCTTTTGGATGATATAGCAAAATCTTTTGCTACCTCTAATACAAAATCAAAAAATGTGAAAGCCATTGATTTTACTTCAGTCATTGACAGCATGCCACCACAA
GGTTTTTTAGTCAAGAATGCAATGTCAGGCTCGGAGGCTAAATTGTCGTTGAATGATAACAGAGGCTTGTCGCATGTTACTGACGGCATTGAGCCACCGGTTTCAGTAAA
TGGAGTAATTCAACAGTCCCATCCTGATGAAGAACACTTTTCTACATCTGATCGTCAGTGTTCTGTAATTGAAAGTTATCAAATTTTATATTCACACGGTGATGCAGGAA
AAGTTTCAAGTAATTCTGAGGAAGGCAAAAGCACCGTTGAAGAAACTGTGCAGCTCAAGACCTCAATGCAGGAAGAACCTACTGAAATGGACATTGATAATGTGCAACAT
CAAAGCCCTCTCCCCAGTAGCGAGTTAAATGTAATATCGACAAATGAGCTTGTGCATTCTCCTACATCTACCAGAGCTGCATTAGCCGTTGTTTCCGGTAATGGCGGTAA
AGAAACTTTTGAAGTGTCAGAGAATGTTGCGTCACATCAGAATGAGTATGGTTTGAAAAGTTCTAATGCATTGTCAGGTTCAGAAAATGTGCAACTGCAAAGCCCGCTAC
CCAGCAGCGAGTTGAATGTAGTATCAGTAAATGAGCTTGTGCATCCTGCATCTACCAGAGATGTATTAGCTAATGTTTCTGGGAATGATGGAGAAGAAAGTATTAAAATG
TTTGAGGATGCTGATTATCATCAGAATGAGTGTGACTTAAAAAGTTCTGATACATTATCTGATTCAGAAACTGTGTTGAGGGACAAAAGCTCAACTAATATGGTTCCAGA
AAACCAAAAGAATATGACTATGGTAATAGATGGTACATTGACTGAAGCTGAAGGATGTTTACTCGGGTCAAGACATGGGGAGCCTGGGGATTTGGCAGTAGCCGAGGATC
AGGAGGTTTCTCCTCATAGTTCTTGTATTAATGGTATTGAAAATGATTATTTGATCTCAACTCAACAACCAACATCTGAGGTGTTAATGGAGGAGCAAAAAAGTGGAGGA
TCTGATCCTCATATCGAATCACCTCCTTTAGATCAGTCTATATCTGGTGTAATGTCAACAGTAAATGCAGGGACATTCCTAACAAAAGGGAGACATCATCCAGCATCTAA
GCTTCCGCTCAAGCAGTTAATCGATCCTATGGTTTTCAAGAGAGTCCGATCAAAGCGGAGGATACAACAGCCAACGTTATGGCTTCCATTCAAGCGCTCAATCAATCCTC
TGGCCTTCAAGAAGGTTCGCAAAACATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTTCTTCGACCTCAATATCCCTTACGACACGAATTCATCTTCATCTTCAATTAGGGCTAATCGCATCAAGATCGTGGCCAAACTCATGGAGCTCGGCTACTCCGG
AATCGCCTACAACCGTACCGTCAAAGGAGTCATGTCCGATCACGACCGCTGTTCCATTCCCCTCCTCAACGTCTCCTCGCTTCAATCTGTCCTTCCGTCCTTCTCCGCCG
CCGTGGAGCTCCACCGCGACCTTCTCGGCGTCCCGCGCTCATCTCCTTTCCGCCAGTACACGCGCCTTACTGTCTCTATCAATAGCTTGCAGGAGGTTCTGGCTGTCAAT
TCTGGTAACCTCCTTCTGAAGACCTACGATTTGATTGCTGTTAAGCCTCTTAGTCAGCACGCGTTTGATCAGGCTTGTGACAAACTGGAGATAGATATAATTTCCATTGA
TTTTGCGGAGAAACTTCCTTTCAGGTTGAAGCAAGGTCAGATTAAATCTGCAATTCAGCGTGGAGTTTACTTCGAAATTATGTATTCTGGTCTTCTTTCGGACGTTCATG
AAAGGAGGCAAATGATATCCACTACTAAGCTATTGGTGGATTGGACAAAGGGAAAGAATCTCATATTATCCAGTGCTGCCTCCTCAGTAAATGAAATCAGAGGACCTTAT
GATGTTGCGAACTTGTCATCATTGTTTGGTGTCTCTATGGAACGAGCAAAAGCTGCTGTTTCAAAGAACTGTAGGAATCTTATAGCTAATGCTCTAAAGAGAAAGCAGTT
CTACAAGGAGACCATTCGAGTTGAAAGGATATCATCAGATGAGAAATTAGATCTGAACGACCCTTGGTCAGTGGATTTGTTCAAATGGGATCCAATATCAAGTGGGGAAG
GGGATTTGCTTTTGGATGATATAGCAAAATCTTTTGCTACCTCTAATACAAAATCAAAAAATGTGAAAGCCATTGATTTTACTTCAGTCATTGACAGCATGCCACCACAA
GGTTTTTTAGTCAAGAATGCAATGTCAGGCTCGGAGGCTAAATTGTCGTTGAATGATAACAGAGGCTTGTCGCATGTTACTGACGGCATTGAGCCACCGGTTTCAGTAAA
TGGAGTAATTCAACAGTCCCATCCTGATGAAGAACACTTTTCTACATCTGATCGTCAGTGTTCTGTAATTGAAAGTTATCAAATTTTATATTCACACGGTGATGCAGGAA
AAGTTTCAAGTAATTCTGAGGAAGGCAAAAGCACCGTTGAAGAAACTGTGCAGCTCAAGACCTCAATGCAGGAAGAACCTACTGAAATGGACATTGATAATGTGCAACAT
CAAAGCCCTCTCCCCAGTAGCGAGTTAAATGTAATATCGACAAATGAGCTTGTGCATTCTCCTACATCTACCAGAGCTGCATTAGCCGTTGTTTCCGGTAATGGCGGTAA
AGAAACTTTTGAAGTGTCAGAGAATGTTGCGTCACATCAGAATGAGTATGGTTTGAAAAGTTCTAATGCATTGTCAGGTTCAGAAAATGTGCAACTGCAAAGCCCGCTAC
CCAGCAGCGAGTTGAATGTAGTATCAGTAAATGAGCTTGTGCATCCTGCATCTACCAGAGATGTATTAGCTAATGTTTCTGGGAATGATGGAGAAGAAAGTATTAAAATG
TTTGAGGATGCTGATTATCATCAGAATGAGTGTGACTTAAAAAGTTCTGATACATTATCTGATTCAGAAACTGTGTTGAGGGACAAAAGCTCAACTAATATGGTTCCAGA
AAACCAAAAGAATATGACTATGGTAATAGATGGTACATTGACTGAAGCTGAAGGATGTTTACTCGGGTCAAGACATGGGGAGCCTGGGGATTTGGCAGTAGCCGAGGATC
AGGAGGTTTCTCCTCATAGTTCTTGTATTAATGGTATTGAAAATGATTATTTGATCTCAACTCAACAACCAACATCTGAGGTGTTAATGGAGGAGCAAAAAAGTGGAGGA
TCTGATCCTCATATCGAATCACCTCCTTTAGATCAGTCTATATCTGGTGTAATGTCAACAGTAAATGCAGGGACATTCCTAACAAAAGGGAGACATCATCCAGCATCTAA
GCTTCCGCTCAAGCAGTTAATCGATCCTATGGTTTTCAAGAGAGTCCGATCAAAGCGGAGGATACAACAGCCAACGTTATGGCTTCCATTCAAGCGCTCAATCAATCCTC
TGGCCTTCAAGAAGGTTCGCAAAACATAA
Protein sequenceShow/hide protein sequence
MGFFDLNIPYDTNSSSSSIRANRIKIVAKLMELGYSGIAYNRTVKGVMSDHDRCSIPLLNVSSLQSVLPSFSAAVELHRDLLGVPRSSPFRQYTRLTVSINSLQEVLAVN
SGNLLLKTYDLIAVKPLSQHAFDQACDKLEIDIISIDFAEKLPFRLKQGQIKSAIQRGVYFEIMYSGLLSDVHERRQMISTTKLLVDWTKGKNLILSSAASSVNEIRGPY
DVANLSSLFGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVERISSDEKLDLNDPWSVDLFKWDPISSGEGDLLLDDIAKSFATSNTKSKNVKAIDFTSVIDSMPPQ
GFLVKNAMSGSEAKLSLNDNRGLSHVTDGIEPPVSVNGVIQQSHPDEEHFSTSDRQCSVIESYQILYSHGDAGKVSSNSEEGKSTVEETVQLKTSMQEEPTEMDIDNVQH
QSPLPSSELNVISTNELVHSPTSTRAALAVVSGNGGKETFEVSENVASHQNEYGLKSSNALSGSENVQLQSPLPSSELNVVSVNELVHPASTRDVLANVSGNDGEESIKM
FEDADYHQNECDLKSSDTLSDSETVLRDKSSTNMVPENQKNMTMVIDGTLTEAEGCLLGSRHGEPGDLAVAEDQEVSPHSSCINGIENDYLISTQQPTSEVLMEEQKSGG
SDPHIESPPLDQSISGVMSTVNAGTFLTKGRHHPASKLPLKQLIDPMVFKRVRSKRRIQQPTLWLPFKRSINPLAFKKVRKT