; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg001517 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg001517
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPolymerase/histidinol phosphatase-like, putative isoform 1
Genome locationscaffold8:44823394..44826965
RNA-Seq ExpressionSpg001517
SyntenySpg001517
Gene Ontology termsGO:0008033 - tRNA processing (biological process)
GO:0032501 - multicellular organismal process (biological process)
GO:0048856 - anatomical structure development (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0005655 - nucleolar ribonuclease P complex (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR002738 - RNase P subunit p30
IPR016195 - Polymerase/histidinol phosphatase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605499.1 Protein GAMETOPHYTE DEFECTIVE 1, partial [Cucurbita argyrosperma subsp. sororia]4.3e-28772.43Show/hide
Query:  MGFFDLNIPYDEHSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRL
        MGFFDLNIPYDEH    SSSSSIR NRIKIVA+LMELGYSGIAYNRTIKGVMSD+DRCTIPLLNV+SL SILP+FSAS+EFHR+LLGVP+SSPFRQYTRL
Subjt:  MGFFDLNIPYDEHSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRL

Query:  TICINSQQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLLV
        TICINS QE++AVNSGNLILKTYDLIAVKPLNQ+AF+QACE LEIDIIAIDFAEK PFRLKQG IKSAIQRGVYFEIMYSDLL DVH RRQMIS +K+LV
Subjt:  TICINSQQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLLV

Query:  DWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEGDL
        DWTKGKNLILSSAA +VNEIRGP+DVANLSSLLGVSMER KAAVSKNCRNLIANALKRKQFYKETIRVE+ISSDDKLDLNDPWSVDLFK+DPISSGEGDL
Subjt:  DWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEGDL

Query:  LLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLSLNDDRDLSHAADVIEPPIGLN--IEQPHTFDGEHCSTSDRQCSVIESFEI
        LLDDIAKSFAAS NKSK VKAIDFTSVIDNMPPQGFLVKNV SGSE K+SLND+R  SH  D IEPPI +N  I+Q    DG+HCSTSDRQCS       
Subjt:  LLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLSLNDDRDLSHAADVIEPPIGLN--IEQPHTFDGEHCSTSDRQCSVIESFEI

Query:  LHSHGNAGRILSNSEEDKSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVVFGNDGKETFKMSDDVESHQNEYGL
            G+AG + SNS+EDK TVEEI+QPK SMQE+ + MDIDNVQPQ                                                      
Subjt:  LHSHGNAGRILSNSEEDKSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVVFGNDGKETFKMSDDVESHQNEYGL

Query:  KSSDTLSGSENELRENVLTRNPLPSSDLNVVSTNELVHSPTSTKDVLAVVCGNDGKETFKMLEDVDSHQNEYVLQSSDALSGLENVLRDKSSTNLVSENE
                            NPLPSS+LNV STN  VH PTST+DVL VV  NDGKETF M EDV SH++EY L+SSD LSGLE+VLRDKSSTNLVSEN 
Subjt:  KSSDTLSGSENELRENVLTRNPLPSSDLNVVSTNELVHSPTSTKDVLAVVCGNDGKETFKMLEDVDSHQNEYVLQSSDALSGLENVLRDKSSTNLVSENE

Query:  KSVTMVIDGSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQPSEVLMEEQKSEEADRQITPPPLDQSISDVISTGKHGAKRRRHR
        K++T V+DGSFTAEECL GARLGEP DVA  ED+VSPLSSCM DIKDDYLISIRQQ SEVLMEEQKS EAD QI PPPLDQSIS V+STG   AK++RH+
Subjt:  KSVTMVIDGSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQPSEVLMEEQKSEEADRQITPPPLDQSISDVISTGKHGAKRRRHR

Query:  PSLKLPLKRLINPLAFKKVRETNAEKYRSKHRRHHSALLLPFKRLISPLAFKKVRKTK
            L L RL NP  FK+V+ET +EKYRSK RRH  ALLLPFKRLI+PL FKKVRKTK
Subjt:  PSLKLPLKRLINPLAFKKVRETNAEKYRSKHRRHHSALLLPFKRLISPLAFKKVRKTK

KAG7035438.1 Ribonuclease P protein subunit p30, partial [Cucurbita argyrosperma subsp. argyrosperma]2.1e-28672.3Show/hide
Query:  MGFFDLNIPYDEHSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRL
        MGFFDLNIPYDEH    SSSSSIR NRIKIVA+LMELGYSGIAYNRTIKGVMSD+DRCTIPLLNV+SL SILP+FSAS+EFHR+LLGVP+SSPFRQYTRL
Subjt:  MGFFDLNIPYDEHSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRL

Query:  TICINSQQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLLV
        TICINS QE++AVNSGNLILKTYDLIAVKPLNQ+AF+QACE LEIDIIAIDFAEK PFRLKQG IKSAIQRGVYFEIMYSDLL DVH RRQMIS +K+LV
Subjt:  TICINSQQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLLV

Query:  DWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEGDL
        DWTKGKNLILSSAA +VNEIRGP+DVAN SSLLGVSMER KAAVSKNCRNLIANALKRKQFYKETIRVE+ISSDDKLDLNDPWSVDLFK+DPISSGEGDL
Subjt:  DWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEGDL

Query:  LLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLSLNDDRDLSHAADVIEPPIGLN--IEQPHTFDGEHCSTSDRQCSVIESFEI
        LLDDIAKSFAAS NKSK VKAIDFTSVIDNMPPQGFLVKNV SGSE K+SLND+R  SH  D IEPPI +N  I+Q    DG+HCSTSDR CS       
Subjt:  LLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLSLNDDRDLSHAADVIEPPIGLN--IEQPHTFDGEHCSTSDRQCSVIESFEI

Query:  LHSHGNAGRILSNSEEDKSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVVFGNDGKETFKMSDDVESHQNEYGL
            G+AG + SNS+EDK TVEEI+QPK SMQE+ + MDIDNVQPQ                                                      
Subjt:  LHSHGNAGRILSNSEEDKSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVVFGNDGKETFKMSDDVESHQNEYGL

Query:  KSSDTLSGSENELRENVLTRNPLPSSDLNVVSTNELVHSPTSTKDVLAVVCGNDGKETFKMLEDVDSHQNEYVLQSSDALSGLENVLRDKSSTNLVSENE
                            NPLPSS+LNV STN  VH PTST+DVL VV  NDGKETF M EDV SH++EY L+SSD LSGLE+VLRDKSSTNLVSEN 
Subjt:  KSSDTLSGSENELRENVLTRNPLPSSDLNVVSTNELVHSPTSTKDVLAVVCGNDGKETFKMLEDVDSHQNEYVLQSSDALSGLENVLRDKSSTNLVSENE

Query:  KSVTMVIDGSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQPSEVLMEEQKSEEADRQITPPPLDQSISDVISTGKHGAKRRRHR
        K++T V+DGSFTAEECL GARLGEP DVA  ED+VSPLSSCM DIKDDYLISIRQQ SEVLMEEQKS EAD QI PPPLDQSIS V+STG   AKR+RH+
Subjt:  KSVTMVIDGSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQPSEVLMEEQKSEEADRQITPPPLDQSISDVISTGKHGAKRRRHR

Query:  PSLKLPLKRLINPLAFKKVRETNAEKYRSKHRRHHSALLLPFKRLISPLAFKKVRKTK
            L L RL NP  FK+V+ET +EKYRSK RRH  ALLLPFKRLI+PL FKKVRKTK
Subjt:  PSLKLPLKRLINPLAFKKVRETNAEKYRSKHRRHHSALLLPFKRLISPLAFKKVRKTK

XP_023007324.1 uncharacterized protein LOC111499856 isoform X1 [Cucurbita maxima]4.9e-29172.69Show/hide
Query:  MGFFDLNIPYDEHSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRL
        MGFFDLNIPYDEH    SSSSSIR NRIKIVA+LMELGYSGIAYNRTIKGVMSD+DRCTIPLLNV+SL SILP+FSAS+EFHR+LLGVP+SSPFRQYTRL
Subjt:  MGFFDLNIPYDEHSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRL

Query:  TICINSQQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLLV
        TICINS QE++AVNSGNLILKTYDLIAVKPLNQ+AF+QACE LEIDIIAIDFAEK PFRLKQG IKSAIQRGVYFEIMYSDLL DVH RRQMIST+K+LV
Subjt:  TICINSQQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLLV

Query:  DWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEGDL
        DWTKGKNLILSSAA +VNEIRGP+DVANLSSLLGVSMER K AVSKNCRNLIANALKRKQFYKETIRVE+I+SDDKLD NDPWSVDLFK+DPISSGEGDL
Subjt:  DWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEGDL

Query:  LLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLSLNDDRDLSHAADVIEPPIGLN--IEQPHTFDGEHCSTSDRQCSVIESFEI
        LLDDIAKSFAA+ NKSK VKAIDFTSVIDNMPPQGFL+KNVI+GSE K+SLND+R LS   D IEPPI +N  I+Q    DG+HCSTSDRQCSVIE+ +I
Subjt:  LLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLSLNDDRDLSHAADVIEPPIGLN--IEQPHTFDGEHCSTSDRQCSVIESFEI

Query:  LHSHGNAGRILSNSEEDKSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVVFGNDGKETFKMSDDVESHQNEYGL
          SH +AG++ SNS+EDK TVEEI+QPK S+QEE +EMDIDNVQPQ                                                      
Subjt:  LHSHGNAGRILSNSEEDKSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVVFGNDGKETFKMSDDVESHQNEYGL

Query:  KSSDTLSGSENELRENVLTRNPLPSSDLNVVSTNELVHSPTSTKDVLAVVCGNDGKETFKMLEDVDSHQNEYVLQSSDALSGLENVLRDKSSTNLVSENE
                            NPLPSS+LNVVSTN  VH PTST+DVL VV  NDGKETF M EDV SH++EY L+SSD LSGLE+VLRDKSSTNLVSEN+
Subjt:  KSSDTLSGSENELRENVLTRNPLPSSDLNVVSTNELVHSPTSTKDVLAVVCGNDGKETFKMLEDVDSHQNEYVLQSSDALSGLENVLRDKSSTNLVSENE

Query:  KSVTMVIDGSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQPSEVLMEEQKSEEADRQITPPPLDQSISDVISTGKHGAKRRRHR
         ++T V+DGSFTAEECL GARLGEP DVA  ED VSPLSSCM DIKDDYLISIRQQ SEVLMEEQKSEEAD QI PPP+DQSIS V+STG   AKR+RH+
Subjt:  KSVTMVIDGSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQPSEVLMEEQKSEEADRQITPPPLDQSISDVISTGKHGAKRRRHR

Query:  PSLKLPLKRLINPLAFKKVRETNAEKYRSKHRRHHSALLLPFKRLISPLAFKKVRKTK
            L L +L NP  FK+V+ET +EKYRSK RRH  ALLLPFKRLI+PL FKKVRKTK
Subjt:  PSLKLPLKRLINPLAFKKVRETNAEKYRSKHRRHHSALLLPFKRLISPLAFKKVRKTK

XP_023007325.1 uncharacterized protein LOC111499856 isoform X2 [Cucurbita maxima]1.0e-28872.55Show/hide
Query:  MGFFDLNIPYDEHSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRL
        MGFFDLNIPYDEH    SSSSSIR NRIKIVA+LMELGYSGIAYNRTIKGVMSD+DRCTIPLLNV+SL SILP+FSAS+EFHR+LLGVP+SSPFRQYTRL
Subjt:  MGFFDLNIPYDEHSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRL

Query:  TICINSQQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLLV
        TICINS QE++AVNSGNLILKTYDLIAVKPLNQ+AF+QACE LEIDIIAIDFAEK PFRLKQG IKSAIQRGVYFEIMYSDLL DVH RRQMIST+K+LV
Subjt:  TICINSQQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLLV

Query:  DWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEGDL
        DWTKGKNLILSSAA +VNEIRGP+DVANLSSLLGVSMER K AVSKNCRNLIANALKRKQFYKETIRVE+I+SDDKLD NDPWSVDLFK+DPISSGEGDL
Subjt:  DWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEGDL

Query:  LLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLSLNDDRDLSHAADVIEPPIGLN--IEQPHTFDGEHCSTSDRQCSVIESFEI
        LLDDIAKSFAA+ NKSK VKAIDFTSVIDNMPPQGFL+KNVI+GSE K+SLND+R LS   D IEPPI +N  I+Q    DG+HCSTSDRQCSVIE+ +I
Subjt:  LLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLSLNDDRDLSHAADVIEPPIGLN--IEQPHTFDGEHCSTSDRQCSVIESFEI

Query:  LHSHGNAGRILSNSEEDKSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVVFGNDGKETFKMSDDVESHQNEYGL
          SH +AG++ SNS+EDK TVEEI+QPK S+QEE +EMDIDNVQPQ                                                      
Subjt:  LHSHGNAGRILSNSEEDKSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVVFGNDGKETFKMSDDVESHQNEYGL

Query:  KSSDTLSGSENELRENVLTRNPLPSSDLNVVSTNELVHSPTSTKDVLAVVCGNDGKETFKMLEDVDSHQNEYVLQSSDALSGLENVLRDKSSTNLVSENE
                            NPLPSS+LNVVSTN  VH PTST+DVL VV  NDGKETF M EDV SH++EY L+SSD LSGLE+VLRDKSSTNLVSEN+
Subjt:  KSSDTLSGSENELRENVLTRNPLPSSDLNVVSTNELVHSPTSTKDVLAVVCGNDGKETFKMLEDVDSHQNEYVLQSSDALSGLENVLRDKSSTNLVSENE

Query:  KSVTMVIDGSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQPSEVLMEEQKSEEADRQITPPPLDQSISDVISTGKHGAKRRRHR
         ++T V+DGSFTAEECL GARLGEP DVA  ED VSPLSSCM DIKDDYLISIRQQ SEVLMEEQKSEEAD QI PPP+DQSIS V+STG   AKR+RH+
Subjt:  KSVTMVIDGSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQPSEVLMEEQKSEEADRQITPPPLDQSISDVISTGKHGAKRRRHR

Query:  PSLKLPLKRLINPLAFKKVRETNAEKYRSKHRRHHSALLLPFKRLISPLAFKKV
            L L +L NP  FK+V+ET +EKYRSK RRH  ALLLPFKRLI+PL FKKV
Subjt:  PSLKLPLKRLINPLAFKKVRETNAEKYRSKHRRHHSALLLPFKRLISPLAFKKV

XP_038900491.1 protein GAMETOPHYTE DEFECTIVE 1 [Benincasa hispida]8.1e-28671.13Show/hide
Query:  MGFFDLNIPYDEHSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRL
        MGFFDLNIPYDEHSSSSS+    R+NRIKIVA+LMELGYSGIAYNRTIKGVMS++DRC+IPLLNV+SLHSILPSFSASVEFHRDLLGV +SSPFRQYTRL
Subjt:  MGFFDLNIPYDEHSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRL

Query:  TICINSQQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLLV
        TI IN+  E++AVNSGNLILKTYDLIAVKPLNQHAF+QACE LEIDIIAIDFAEK PFRLKQG IKSAIQRGVYFEIMYSDLL DVHERRQMIST+K+LV
Subjt:  TICINSQQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLLV

Query:  DWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEGDL
        DWT GKNLILSSAAP+VNEIRGPYDVANLSSLLGVSMER KAAVSKNCRNLIANALKRK FYKETIR+E+ISSDDKLDLNDPWSVDLFK+DPISSGEGDL
Subjt:  DWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEGDL

Query:  LLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLSLNDDRDLSHAADVIEPPIGLN--IEQPHTFDGEHCSTSDRQCSVIESFEI
        LLDDIAKSFAA N+KSKNVKAIDF+S+IDN+PPQGFLVK+V++GSE KLSLND+RDL   +D IEP I +N  I+Q HT DGEHC  SD Q SVIESF+I
Subjt:  LLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLSLNDDRDLSHAADVIEPPIGLN--IEQPHTFDGEHCSTSDRQCSVIESFEI

Query:  LHSHGNAGRILSNSEEDKSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVVFGNDGKETFKMSDDVESHQNEYGL
         HSHG+AG + SNSEE KST+EEI+QPKTSMQEE ++MD+DN+QP+NL+ S +LNV STNELVHSPTST DV +VVF ND  ET KM             
Subjt:  LHSHGNAGRILSNSEEDKSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVVFGNDGKETFKMSDDVESHQNEYGL

Query:  KSSDTLSGSENELRENVLTRNPLPSSDLNVVSTNELVHSPTSTKDVLAVVCGNDGKETFKMLEDVDSHQNEYVLQSSDALSGLENVLRDKSSTNLVSENE
                                                                      EDVDSH NEY L+SS  LSGLENV RDKSSTNL+SE++
Subjt:  KSSDTLSGSENELRENVLTRNPLPSSDLNVVSTNELVHSPTSTKDVLAVVCGNDGKETFKMLEDVDSHQNEYVLQSSDALSGLENVLRDKSSTNLVSENE

Query:  KSVTMVIDGSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQPSEVLMEEQKSEEADRQITPPPLDQSISDVIST---GKHGAKRR
        K  TMV+D SFTAEECLH ARL EPGDVA+  DQVSPL SC SDIKDDYLISIRQQ SE LM++Q+S +AD QIT  P DQSI+ V+ST    K  +KR+
Subjt:  KSVTMVIDGSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQPSEVLMEEQKSEEADRQITPPPLDQSISDVIST---GKHGAKRR

Query:  RHRPSLKLPLKRLINPLAFKKVRETNAEKYRSKHRRHHSALLLPFKRLISPLAFKKVRKTKC
        RH P  KLPL++LINPL FKKV +T+ EKYRSK RRHH ALLLPFKR I+ LAFKKV KTKC
Subjt:  RHRPSLKLPLKRLINPLAFKKVRETNAEKYRSKHRRHHSALLLPFKRLISPLAFKKVRKTKC

TrEMBL top hitse value%identityAlignment
A0A6J1G517 uncharacterized protein LOC111450984 isoform X11.3e-27866.54Show/hide
Query:  MGFFDLNIPYDEHSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRL
        MGFFDLN+PYDEH    SSSSSIR NRIKIVA+LMELGYSGIAYNRTIKGVMSD+DRCTIPLLNV+SL SILP+FSAS+EFHR+LLGVP+SSPFRQYTRL
Subjt:  MGFFDLNIPYDEHSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRL

Query:  TICINSQQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLLV
        TICINS QE++AVNSGNLILKTYDLIAVKPLNQ+AF+QACE LEIDIIAIDFAEK PFRLKQG IKSAIQRGVYFEIMYSDLL DVH RRQMIST+K+LV
Subjt:  TICINSQQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLLV

Query:  DWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEGDL
        DWTKGKNLILSSAA +VNEIRGP+DVANLSSLLGVSMER KAAVSKNCRNLIANALKRKQFYKETIRVE+ISSDDKLD NDPWSVDLFK+DP+SSGEGDL
Subjt:  DWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEGDL

Query:  LLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLSLNDDRDLSHAADVIEPPIGLN-----------------------------
        LLDDIA+SFAAS NKSK VKAIDFTSVIDN PPQGFLVKNVISG E K+SLND+R  SH  D IEPPI +N                             
Subjt:  LLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLSLNDDRDLSHAADVIEPPIGLN-----------------------------

Query:  -------------------------------IEQPHTFDGEHCSTSDRQCSVIESFEILHSHGNAGRILSNSEEDKSTVEEILQPKTSMQEEIVEMDIDN
                                       I+Q    DG+HCSTS+RQCS           G+AG++ SNS+EDK TVEEI+QPK SMQE+ + MDIDN
Subjt:  -------------------------------IEQPHTFDGEHCSTSDRQCSVIESFEILHSHGNAGRILSNSEEDKSTVEEILQPKTSMQEEIVEMDIDN

Query:  VQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVVFGNDGKETFKMSDDVESHQNEYGLKSSDTLSGSENELRENVLTRNPLPSSDLNVVSTNELVHSPTS
        VQPQ                                                                          NPLPSS+LNV STN  VH PTS
Subjt:  VQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVVFGNDGKETFKMSDDVESHQNEYGLKSSDTLSGSENELRENVLTRNPLPSSDLNVVSTNELVHSPTS

Query:  TKDVLAVVCGNDGKETFKMLEDVDSHQNEYVLQSSDALSGLENVLRDKSSTNLVSENEKSVTMVIDGSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCM
        T+DVL V+  NDGKE F M EDV SH++EY L+SSD LSGLE+VLRDKSSTNLVSEN K++T V+DGSFTAEECL GARLGEP DVA  ED+VSPLSSCM
Subjt:  TKDVLAVVCGNDGKETFKMLEDVDSHQNEYVLQSSDALSGLENVLRDKSSTNLVSENEKSVTMVIDGSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCM

Query:  SDIKDDYLISIRQQPSEVLMEEQKSEEADRQITPPPLDQSISDVISTGKHGAKRRRHRPSLKLPLKRLINPLAFKKVRETNAEKYRSKHRRHHSALLLPF
         DIKDDYLISIRQQ SEVLMEEQKS EAD QI PPPLDQSIS V+STG   AKR+RH+    L L RL NP  FK+V+ET +EKYRSK RRH  ALLLPF
Subjt:  SDIKDDYLISIRQQPSEVLMEEQKSEEADRQITPPPLDQSISDVISTGKHGAKRRRHRPSLKLPLKRLINPLAFKKVRETNAEKYRSKHRRHHSALLLPF

Query:  KRLISPLAFKKVRKTK
        KRLI+PL FKKVRKTK
Subjt:  KRLISPLAFKKVRKTK

A0A6J1G550 uncharacterized protein LOC111450984 isoform X23.3e-27766.34Show/hide
Query:  MGFFDLNIPYDEHSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRL
        MGFFDLN+PYDEH    SSSSSIR NRIKIVA+LMELGYSGIAYNRTIKGVMSD+DRCTIPLLNV+SL SILP+FSAS+EFHR+LLGVP+SSPFRQYTRL
Subjt:  MGFFDLNIPYDEHSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRL

Query:  TICINSQQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLLV
        TICINS QE++AVNSGNLILKTYDLIAVKPLNQ+AF+QACE LEIDIIAIDFAEK PFRLKQG IKSAIQRGVYFEIMYSDLL DVH RRQMIST+K+LV
Subjt:  TICINSQQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLLV

Query:  DWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEGDL
        DWTKGKNLILSSAA +VNEIRGP+DVANLSSLLGVSMER KAAVSKNCRNLIANALKRKQFYKETIRVE+ISSDDKLD NDPWSVDLFK+DP+SSGEGDL
Subjt:  DWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEGDL

Query:  LLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLSLNDDRDLSHAADVIEPPIGLN-----------------------------
        LLDDIA+SFAAS NKSK VKAIDFTSVIDN PPQGFLVKNVISG E K+SLND+R  SH  D IEPPI +N                             
Subjt:  LLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLSLNDDRDLSHAADVIEPPIGLN-----------------------------

Query:  -------------------------------IEQPHTFDGEHCSTSDRQCSVIESFEILHSHGNAGRILSNSEEDKSTVEEILQPKTSMQEEIVEMDIDN
                                       I+Q    DG+HCSTS+RQCS           G+AG++ SNS+EDK TVEEI+QPK SMQE+ + MDIDN
Subjt:  -------------------------------IEQPHTFDGEHCSTSDRQCSVIESFEILHSHGNAGRILSNSEEDKSTVEEILQPKTSMQEEIVEMDIDN

Query:  VQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVVFGNDGKETFKMSDDVESHQNEYGLKSSDTLSGSENELRENVLTRNPLPSSDLNVVSTNELVHSPTS
        VQPQ                                                                          NPLPSS+LNV STN  VH PTS
Subjt:  VQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVVFGNDGKETFKMSDDVESHQNEYGLKSSDTLSGSENELRENVLTRNPLPSSDLNVVSTNELVHSPTS

Query:  TKDVLAVVCGNDGKETFKMLEDVDSHQNEYVLQSSDALSGLENVLRDKSSTNLVSENEKSVTMVIDGSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCM
        T+DVL V+  NDGKE F M EDV SH++EY L+SSD LSGLE+VLRDKSSTNLVSEN K++T V+DGSFTAEECL GARLGEP DVA  ED+VSPLSSCM
Subjt:  TKDVLAVVCGNDGKETFKMLEDVDSHQNEYVLQSSDALSGLENVLRDKSSTNLVSENEKSVTMVIDGSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCM

Query:  SDIKDDYLISIRQQPSEVLMEEQKSEEADRQITPPPLDQSISDVISTGKHGAKRRRHRPSLKLPLKRLINPLAFKKVRETNAEKYRSKHRRHHSALLLPF
         DIKDDYLISIRQQ SEVLMEEQKS EAD QI PPPLDQSIS V+STG   AKR+RH+    L L RL NP  FK+V+ET +EKYRSK RRH  ALLLPF
Subjt:  SDIKDDYLISIRQQPSEVLMEEQKSEEADRQITPPPLDQSISDVISTGKHGAKRRRHRPSLKLPLKRLINPLAFKKVRETNAEKYRSKHRRHHSALLLPF

Query:  KRLISPLAFKKVRK
        KRLI+PL FKKV+K
Subjt:  KRLISPLAFKKVRK

A0A6J1L082 uncharacterized protein LOC111499856 isoform X38.7e-28671.9Show/hide
Query:  MGFFDLNIPYDEHSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRL
        MGFFDLNIPYDEH    SSSSSIR NRIKIVA+LMELGYSGIAYNRTIKGVMSD+DRCTIPLLNV+SL SILP+FSAS+EFHR+LLGVP+SSPFRQYTRL
Subjt:  MGFFDLNIPYDEHSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRL

Query:  TICINSQQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLLV
        TICINS QE++AVNSGNLILKTYDLIAVKPLNQ+AF+QACE LEIDIIAIDFAEK PFRLKQG IKSAIQRGVYFEIMYSDLL DVH RRQMIST+K+LV
Subjt:  TICINSQQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLLV

Query:  DWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEGDL
        DWTKGKNLILSSAA +VNEIRGP+DVANLSSLLGVSMER K AVSKNCRNLIANALKRKQFYKETIRVE+I+SDDKLD NDPWSVDLFK+DPISSGEGDL
Subjt:  DWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEGDL

Query:  LLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLSLNDDRDLSHAADVIEPPIGLN--IEQPHTFDGEHCSTSDRQCSVIESFEI
        LLDDIAKSFAA+ NKSK VKAIDFTSVIDNMPPQGFL+KNVI+GSE K+SLND+R LS   D IEPPI +N  I+Q    DG+HCSTSDRQCS       
Subjt:  LLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLSLNDDRDLSHAADVIEPPIGLN--IEQPHTFDGEHCSTSDRQCSVIESFEI

Query:  LHSHGNAGRILSNSEEDKSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVVFGNDGKETFKMSDDVESHQNEYGL
             +AG++ SNS+EDK TVEEI+QPK S+QEE +EMDIDNVQPQ                                                      
Subjt:  LHSHGNAGRILSNSEEDKSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVVFGNDGKETFKMSDDVESHQNEYGL

Query:  KSSDTLSGSENELRENVLTRNPLPSSDLNVVSTNELVHSPTSTKDVLAVVCGNDGKETFKMLEDVDSHQNEYVLQSSDALSGLENVLRDKSSTNLVSENE
                            NPLPSS+LNVVSTN  VH PTST+DVL VV  NDGKETF M EDV SH++EY L+SSD LSGLE+VLRDKSSTNLVSEN+
Subjt:  KSSDTLSGSENELRENVLTRNPLPSSDLNVVSTNELVHSPTSTKDVLAVVCGNDGKETFKMLEDVDSHQNEYVLQSSDALSGLENVLRDKSSTNLVSENE

Query:  KSVTMVIDGSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQPSEVLMEEQKSEEADRQITPPPLDQSISDVISTGKHGAKRRRHR
         ++T V+DGSFTAEECL GARLGEP DVA  ED VSPLSSCM DIKDDYLISIRQQ SEVLMEEQKSEEAD QI PPP+DQSIS V+STG   AKR+RH+
Subjt:  KSVTMVIDGSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQPSEVLMEEQKSEEADRQITPPPLDQSISDVISTGKHGAKRRRHR

Query:  PSLKLPLKRLINPLAFKKVRETNAEKYRSKHRRHHSALLLPFKRLISPLAFKKVRKTK
            L L +L NP  FK+V+ET +EKYRSK RRH  ALLLPFKRLI+PL FKKVRKTK
Subjt:  PSLKLPLKRLINPLAFKKVRETNAEKYRSKHRRHHSALLLPFKRLISPLAFKKVRKTK

A0A6J1L2N3 uncharacterized protein LOC111499856 isoform X24.9e-28972.55Show/hide
Query:  MGFFDLNIPYDEHSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRL
        MGFFDLNIPYDEH    SSSSSIR NRIKIVA+LMELGYSGIAYNRTIKGVMSD+DRCTIPLLNV+SL SILP+FSAS+EFHR+LLGVP+SSPFRQYTRL
Subjt:  MGFFDLNIPYDEHSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRL

Query:  TICINSQQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLLV
        TICINS QE++AVNSGNLILKTYDLIAVKPLNQ+AF+QACE LEIDIIAIDFAEK PFRLKQG IKSAIQRGVYFEIMYSDLL DVH RRQMIST+K+LV
Subjt:  TICINSQQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLLV

Query:  DWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEGDL
        DWTKGKNLILSSAA +VNEIRGP+DVANLSSLLGVSMER K AVSKNCRNLIANALKRKQFYKETIRVE+I+SDDKLD NDPWSVDLFK+DPISSGEGDL
Subjt:  DWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEGDL

Query:  LLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLSLNDDRDLSHAADVIEPPIGLN--IEQPHTFDGEHCSTSDRQCSVIESFEI
        LLDDIAKSFAA+ NKSK VKAIDFTSVIDNMPPQGFL+KNVI+GSE K+SLND+R LS   D IEPPI +N  I+Q    DG+HCSTSDRQCSVIE+ +I
Subjt:  LLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLSLNDDRDLSHAADVIEPPIGLN--IEQPHTFDGEHCSTSDRQCSVIESFEI

Query:  LHSHGNAGRILSNSEEDKSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVVFGNDGKETFKMSDDVESHQNEYGL
          SH +AG++ SNS+EDK TVEEI+QPK S+QEE +EMDIDNVQPQ                                                      
Subjt:  LHSHGNAGRILSNSEEDKSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVVFGNDGKETFKMSDDVESHQNEYGL

Query:  KSSDTLSGSENELRENVLTRNPLPSSDLNVVSTNELVHSPTSTKDVLAVVCGNDGKETFKMLEDVDSHQNEYVLQSSDALSGLENVLRDKSSTNLVSENE
                            NPLPSS+LNVVSTN  VH PTST+DVL VV  NDGKETF M EDV SH++EY L+SSD LSGLE+VLRDKSSTNLVSEN+
Subjt:  KSSDTLSGSENELRENVLTRNPLPSSDLNVVSTNELVHSPTSTKDVLAVVCGNDGKETFKMLEDVDSHQNEYVLQSSDALSGLENVLRDKSSTNLVSENE

Query:  KSVTMVIDGSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQPSEVLMEEQKSEEADRQITPPPLDQSISDVISTGKHGAKRRRHR
         ++T V+DGSFTAEECL GARLGEP DVA  ED VSPLSSCM DIKDDYLISIRQQ SEVLMEEQKSEEAD QI PPP+DQSIS V+STG   AKR+RH+
Subjt:  KSVTMVIDGSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQPSEVLMEEQKSEEADRQITPPPLDQSISDVISTGKHGAKRRRHR

Query:  PSLKLPLKRLINPLAFKKVRETNAEKYRSKHRRHHSALLLPFKRLISPLAFKKV
            L L +L NP  FK+V+ET +EKYRSK RRH  ALLLPFKRLI+PL FKKV
Subjt:  PSLKLPLKRLINPLAFKKVRETNAEKYRSKHRRHHSALLLPFKRLISPLAFKKV

A0A6J1L4M3 uncharacterized protein LOC111499856 isoform X12.4e-29172.69Show/hide
Query:  MGFFDLNIPYDEHSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRL
        MGFFDLNIPYDEH    SSSSSIR NRIKIVA+LMELGYSGIAYNRTIKGVMSD+DRCTIPLLNV+SL SILP+FSAS+EFHR+LLGVP+SSPFRQYTRL
Subjt:  MGFFDLNIPYDEHSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRL

Query:  TICINSQQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLLV
        TICINS QE++AVNSGNLILKTYDLIAVKPLNQ+AF+QACE LEIDIIAIDFAEK PFRLKQG IKSAIQRGVYFEIMYSDLL DVH RRQMIST+K+LV
Subjt:  TICINSQQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLLV

Query:  DWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEGDL
        DWTKGKNLILSSAA +VNEIRGP+DVANLSSLLGVSMER K AVSKNCRNLIANALKRKQFYKETIRVE+I+SDDKLD NDPWSVDLFK+DPISSGEGDL
Subjt:  DWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEGDL

Query:  LLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLSLNDDRDLSHAADVIEPPIGLN--IEQPHTFDGEHCSTSDRQCSVIESFEI
        LLDDIAKSFAA+ NKSK VKAIDFTSVIDNMPPQGFL+KNVI+GSE K+SLND+R LS   D IEPPI +N  I+Q    DG+HCSTSDRQCSVIE+ +I
Subjt:  LLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLSLNDDRDLSHAADVIEPPIGLN--IEQPHTFDGEHCSTSDRQCSVIESFEI

Query:  LHSHGNAGRILSNSEEDKSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVVFGNDGKETFKMSDDVESHQNEYGL
          SH +AG++ SNS+EDK TVEEI+QPK S+QEE +EMDIDNVQPQ                                                      
Subjt:  LHSHGNAGRILSNSEEDKSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVVFGNDGKETFKMSDDVESHQNEYGL

Query:  KSSDTLSGSENELRENVLTRNPLPSSDLNVVSTNELVHSPTSTKDVLAVVCGNDGKETFKMLEDVDSHQNEYVLQSSDALSGLENVLRDKSSTNLVSENE
                            NPLPSS+LNVVSTN  VH PTST+DVL VV  NDGKETF M EDV SH++EY L+SSD LSGLE+VLRDKSSTNLVSEN+
Subjt:  KSSDTLSGSENELRENVLTRNPLPSSDLNVVSTNELVHSPTSTKDVLAVVCGNDGKETFKMLEDVDSHQNEYVLQSSDALSGLENVLRDKSSTNLVSENE

Query:  KSVTMVIDGSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQPSEVLMEEQKSEEADRQITPPPLDQSISDVISTGKHGAKRRRHR
         ++T V+DGSFTAEECL GARLGEP DVA  ED VSPLSSCM DIKDDYLISIRQQ SEVLMEEQKSEEAD QI PPP+DQSIS V+STG   AKR+RH+
Subjt:  KSVTMVIDGSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQPSEVLMEEQKSEEADRQITPPPLDQSISDVISTGKHGAKRRRHR

Query:  PSLKLPLKRLINPLAFKKVRETNAEKYRSKHRRHHSALLLPFKRLISPLAFKKVRKTK
            L L +L NP  FK+V+ET +EKYRSK RRH  ALLLPFKRLI+PL FKKVRKTK
Subjt:  PSLKLPLKRLINPLAFKKVRETNAEKYRSKHRRHHSALLLPFKRLISPLAFKKVRKTK

SwissProt top hitse value%identityAlignment
F4JXF1 Protein GAMETOPHYTE DEFECTIVE 12.2e-11358.72Show/hide
Query:  MGFFDLNIPYDEHSSSSSSS-SSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTR
        MGFFDL+IPY+E   S     +  +  R+K+  + MELGY GIA+NR+IKGVMSD+D CTIPLL + SL  + P  ++SV FHRDLLGVP+++PFRQYTR
Subjt:  MGFFDLNIPYDEHSSSSSSS-SSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTR

Query:  LTICINSQQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLL
        LT+ + S  +  ++NSGN ILK+YD+IAV+P+NQ+AFD ACE  E+D+I+IDF +K  FRLK   +K+AIQRG+YFEI YSD+LMD   RRQ+IS +KLL
Subjt:  LTICINSQQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLL

Query:  VDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEGD
        VDWT+GKNLI+SS AP+V E+RGP DV NL  LLG+S ER +AA+SKNCRN+IA  LK+K+F+KE +RVE +S+ D   L  P S D  K+D +SSGEGD
Subjt:  VDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEGD

Query:  LLLDDIAKSFAASNNKS-KNVKAIDFTSVIDNMPPQGFLVKNVI
        +LLDD+AK+F A+N  + K+ KAIDFTSV+D +P  GF VK+++
Subjt:  LLLDDIAKSFAASNNKS-KNVKAIDFTSVIDNMPPQGFLVKNVI

O88796 Ribonuclease P protein subunit p301.4e-3035.32Show/hide
Query:  SSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRLTICINSQQEVMAVNSGNL
        + S ++A R  +V     LGYS +A N  +     ++ R     + V+ L + LP                KS P +  TRLTI +        + + + 
Subjt:  SSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRLTICINSQQEVMAVNSGNL

Query:  ILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLLVDWTKGKNLILSSAAPTVN
         ++ YD++AV P  +  F  AC +L++D++ I   EK PF  K+  +  AI+RG+ FE++Y   + D   RR  IS +  L+   KGKN+ILSSAA    
Subjt:  ILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLLVDWTKGKNLILSSAAPTVN

Query:  EIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRKQFYKETIRVEK
        EIRGPYDVANL  L G+S   GKAAVS NCR +  +   RK  +     V+K
Subjt:  EIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRKQFYKETIRVEK

P78346 Ribonuclease P protein subunit p303.4e-2934.92Show/hide
Query:  SSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRLTICINSQQEVMAVNSGNL
        + S ++A R  +V     LGYS +A N  +     ++ +     + V+ L + LP                KS P +  TRLTI ++       + + + 
Subjt:  SSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRLTICINSQQEVMAVNSGNL

Query:  ILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLLVDWTKGKNLILSSAAPTVN
          + YD++AV P  +  F  AC +L++D++ I   EK PF  K+  I  AI RG+ FE++YS  + D   RR  IS++  L+   KGKN+I+SSAA    
Subjt:  ILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLLVDWTKGKNLILSSAAPTVN

Query:  EIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRKQFYKETIRVEK
        EIRGPYDVANL  L G+S    KAAVS NCR  + +   RK  +     V+K
Subjt:  EIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRKQFYKETIRVEK

Q3SZ21 Ribonuclease P protein subunit p301.1e-3035.32Show/hide
Query:  SSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRLTICINSQQEVMAVNSGNL
        + S ++A R  +V     LGYS +A N  ++    ++ +     + V+ L + LP                KS P +  TRLTI ++       + + + 
Subjt:  SSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRLTICINSQQEVMAVNSGNL

Query:  ILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLLVDWTKGKNLILSSAAPTVN
         ++ YD++AV P  +  F  AC +L++D++ I   EK PF  K+  I  AI RGV FE++YS  + D   RR  IS +  L+   KGKN+I+SSAA    
Subjt:  ILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLLVDWTKGKNLILSSAAPTVN

Query:  EIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRKQFYKETIRVEK
        EIRGPYDVANL  L G+S    KAAVS NCR ++ +   RK  +     V+K
Subjt:  EIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRKQFYKETIRVEK

Q3ZE13 Ribonuclease P protein subunit drpp307.9e-2629.62Show/hide
Query:  MGFFDLNIPYDEHSSSSSSSSSIRANRIKIVARL-MELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTR
        M ++DLNI            SS+   +IK +  L  + GY  +A   T++G +  +D C I  + +            S       +G   +   +QYTR
Subjt:  MGFFDLNIPYDEHSSSSSSSSSIRANRIKIVARL-MELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTR

Query:  LTICINSQQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLL
        L +   +  E   + + N ++++YD+I+V P +   F+ AC + EIDII ID   K  F +K  +++  I +G++ EI+Y +L     +R      +  L
Subjt:  LTICINSQQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLL

Query:  VDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRK
        V  + GKN+ILSS+  +   +R PYD++NL  L G++ ++ KAAVSK+    + +A+ R+
Subjt:  VDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRK

Arabidopsis top hitse value%identityAlignment
AT5G59980.1 Polymerase/histidinol phosphatase-like1.6e-11458.72Show/hide
Query:  MGFFDLNIPYDEHSSSSSSS-SSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTR
        MGFFDL+IPY+E   S     +  +  R+K+  + MELGY GIA+NR+IKGVMSD+D CTIPLL + SL  + P  ++SV FHRDLLGVP+++PFRQYTR
Subjt:  MGFFDLNIPYDEHSSSSSSS-SSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTR

Query:  LTICINSQQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLL
        LT+ + S  +  ++NSGN ILK+YD+IAV+P+NQ+AFD ACE  E+D+I+IDF +K  FRLK   +K+AIQRG+YFEI YSD+LMD   RRQ+IS +KLL
Subjt:  LTICINSQQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLL

Query:  VDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEGD
        VDWT+GKNLI+SS AP+V E+RGP DV NL  LLG+S ER +AA+SKNCRN+IA  LK+K+F+KE +RVE +S+ D   L  P S D  K+D +SSGEGD
Subjt:  VDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEGD

Query:  LLLDDIAKSFAASNNKS-KNVKAIDFTSVIDNMPPQGFLVKNVI
        +LLDD+AK+F A+N  + K+ KAIDFTSV+D +P  GF VK+++
Subjt:  LLLDDIAKSFAASNNKS-KNVKAIDFTSVIDNMPPQGFLVKNVI

AT5G59980.2 Polymerase/histidinol phosphatase-like1.3e-11336.66Show/hide
Query:  MGFFDLNIPYDEHSSSSSSS-SSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTR
        MGFFDL+IPY+E   S     +  +  R+K+  + MELGY GIA+NR+IKGVMSD+D CTIPLL + SL  + P  ++SV FHRDLLGVP+++PFRQYTR
Subjt:  MGFFDLNIPYDEHSSSSSSS-SSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTR

Query:  LTICINSQQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLL
        LT+ + S  +  ++NSGN ILK+YD+IAV+P+NQ+AFD ACE  E+D+I+IDF +K  FRLK   +K+AIQRG+YFEI YSD+LMD   RRQ+IS +KLL
Subjt:  LTICINSQQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLL

Query:  VDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEGD
        VDWT+GKNLI+SS AP+V E+RGP DV NL  LLG+S ER +AA+SKNCRN+IA  LK+K+F+KE +RVE +S+ D   L  P S D  K+D +SSGEGD
Subjt:  VDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEGD

Query:  LLLDDIAKSFAASNNKS-KNVKAIDFTSVIDNMPPQGFLVKNVI---SGSEPKLSLNDDRDLSHAADVIEPPIGL-----NIEQPHTFDGEHCSTSDRQC
        +LLDD+AK+F A+N  + K+ KAIDFTSV+D +P  GF VK+++   S ++P  +   D  +  +  V E  +       N+ +  T        S+   
Subjt:  LLLDDIAKSFAASNNKS-KNVKAIDFTSVIDNMPPQGFLVKNVI---SGSEPKLSLNDDRDLSHAADVIEPPIGL-----NIEQPHTFDGEHCSTSDRQC

Query:  SVIESFEILHSHGNAGRILSNSEEDKSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGD----VLAVVFGNDGKETFKMS
         V  +  +L     A R  S S      V+          +      +           + +N+ ST+E      S  D       V   N G   F+  
Subjt:  SVIESFEILHSHGNAGRILSNSEEDKSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGD----VLAVVFGNDGKETFKMS

Query:  DDVESHQNE---YGLKSSDTLSGSENELRENVLTRNPLPSSDLNVVSTNELVHSPTSTKDVLAVVCGNDGKETFKMLEDVDSHQNEYVLQSSDALSGLEN
          V+ +  E    G  S+D +  +E+    ++     +P  +             TS  D + + C ++      M   ++   +E    + D+   L N
Subjt:  DDVESHQNE---YGLKSSDTLSGSENELRENVLTRNPLPSSDLNVVSTNELVHSPTSTKDVLAVVCGNDGKETFKMLEDVDSHQNEYVLQSSDALSGLEN

Query:  VLRDKSSTNLVSENEKSVTMVIDGSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQPSEVLMEEQKSEE---ADRQITPPPLDQS
        +     +T+L+SE+ KS++                    P  V  D D+ S L S  + ++++  +       E+ ME+ K EE    D ++     + S
Subjt:  VLRDKSSTNLVSENEKSVTMVIDGSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQPSEVLMEEQKSEE---ADRQITPPPLDQS

Query:  ISDVI---------STGKHGAKRRRHRPSLKLPLKRLINPLAFKKVRETNAEKYRSKHRRH
            +          +GK  AKR R R     P K  +    FK++         SK R+H
Subjt:  ISDVI---------STGKHGAKRRRHRPSLKLPLKRLINPLAFKKVRETNAEKYRSKHRRH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTCTTCGACCTCAACATACCTTACGATGAGCATTCATCTTCATCTTCATCTTCATCTTCAATCAGGGCTAATCGCATCAAGATTGTCGCCAGACTCATGGAGCT
CGGCTACTCTGGTATCGCCTACAACCGTACGATTAAAGGAGTGATGTCCGATCAGGACCGCTGTACTATTCCCCTCCTCAATGTCACATCGCTTCACAGTATCCTTCCGT
CGTTCTCCGCCTCCGTGGAGTTCCACCGCGATCTCCTCGGCGTCCCTAAATCCTCTCCTTTCCGTCAGTACACGCGTCTTACTATTTGTATCAACAGTCAGCAGGAGGTT
ATGGCAGTCAATTCTGGCAACCTCATTCTGAAGACCTACGATTTGATTGCTGTGAAGCCTCTTAATCAGCATGCTTTCGACCAGGCTTGCGAGAATTTGGAGATAGATAT
AATTGCCATTGATTTTGCGGAGAAACGGCCTTTCAGGTTGAAGCAAGGTCAGATAAAATCTGCAATTCAGCGTGGGGTTTACTTCGAAATTATGTACTCTGATCTTCTTA
TGGACGTTCATGAAAGGAGGCAAATGATATCCACTTCAAAGTTATTGGTGGATTGGACGAAAGGAAAGAATCTCATATTATCCAGTGCGGCCCCCACTGTAAATGAAATC
AGAGGACCTTATGATGTTGCAAACTTGTCATCATTGCTTGGTGTCTCTATGGAACGAGGAAAAGCTGCTGTTTCGAAAAATTGTAGGAATCTTATAGCTAATGCCCTAAA
GAGAAAGCAGTTCTACAAGGAGACAATTCGAGTTGAAAAGATATCATCAGATGATAAATTAGATCTGAATGACCCTTGGTCAGTGGATTTGTTCAAATATGATCCTATAT
CAAGTGGCGAAGGTGATTTGCTATTGGATGATATAGCAAAATCGTTTGCTGCCTCTAACAATAAATCAAAAAATGTGAAAGCCATTGATTTCACTTCAGTCATTGACAAC
ATGCCACCACAAGGTTTTCTAGTCAAGAATGTAATATCAGGCTCTGAGCCAAAACTGTCGTTGAATGATGACAGAGACTTGTCGCATGCTGCTGATGTCATTGAGCCACC
GATTGGATTAAATATTGAACAACCCCATACTTTTGATGGAGAACACTGTTCTACATCTGATCGTCAATGCTCTGTAATTGAAAGTTTTGAAATTTTACATTCACATGGTA
ATGCAGGAAGAATTTTAAGTAATTCTGAGGAAGACAAAAGCACTGTTGAAGAAATTCTGCAGCCCAAGACCTCAATGCAGGAAGAAATTGTTGAAATGGACATTGATAAT
GTGCAACCACAAAACCTGGTACCCAGTATTGAGTTGAATGTAGTATCGACAAATGAGCTTGTGCATTCTCCTACATCTACTGGAGATGTATTAGCTGTTGTCTTCGGGAA
TGATGGAAAAGAAACTTTTAAAATGTCAGATGATGTTGAATCTCATCAGAATGAGTATGGTTTAAAAAGTTCTGATACATTGTCTGGTTCAGAAAATGAGTTGAGGGAAA
ATGTGCTAACACGAAACCCACTACCCAGTAGCGATTTGAATGTTGTGTCAACAAATGAGCTTGTGCATTCTCCTACATCTACCAAAGATGTATTAGCTGTTGTTTGCGGG
AATGATGGAAAAGAAACTTTTAAAATGTTAGAGGATGTTGATTCTCATCAGAATGAGTATGTCTTACAAAGTTCTGATGCATTGTCTGGTTTAGAAAATGTGTTGAGGGA
CAAAAGTTCAACTAATTTGGTTTCAGAAAATGAAAAGAGTGTGACTATGGTAATAGATGGTTCATTTACAGCTGAAGAATGTTTACATGGTGCAAGGCTTGGAGAGCCTG
GGGATGTGGCAGTAGACGAGGATCAGGTTTCTCCTCTTAGTTCTTGTATGAGTGACATAAAGGATGATTATTTGATCTCGATTCGACAACAACCATCTGAGGTGTTGATG
GAAGAGCAAAAAAGTGAAGAAGCTGACCGTCAGATCACACCACCACCTTTAGATCAATCTATATCTGATGTGATTTCAACAGGGAAGCATGGAGCGAAACGGAGGAGGCA
TCGTCCATCATTAAAACTTCCGCTCAAGCGGCTAATCAATCCTCTAGCCTTCAAGAAAGTCCGTGAAACGAATGCAGAGAAATACCGATCAAAACATAGGAGACATCACT
CAGCACTATTGCTTCCATTCAAGCGGTTAATCAGTCCTCTAGCCTTCAAGAAGGTCCGCAAAACTAAATGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTTTCTTCGACCTCAACATACCTTACGATGAGCATTCATCTTCATCTTCATCTTCATCTTCAATCAGGGCTAATCGCATCAAGATTGTCGCCAGACTCATGGAGCT
CGGCTACTCTGGTATCGCCTACAACCGTACGATTAAAGGAGTGATGTCCGATCAGGACCGCTGTACTATTCCCCTCCTCAATGTCACATCGCTTCACAGTATCCTTCCGT
CGTTCTCCGCCTCCGTGGAGTTCCACCGCGATCTCCTCGGCGTCCCTAAATCCTCTCCTTTCCGTCAGTACACGCGTCTTACTATTTGTATCAACAGTCAGCAGGAGGTT
ATGGCAGTCAATTCTGGCAACCTCATTCTGAAGACCTACGATTTGATTGCTGTGAAGCCTCTTAATCAGCATGCTTTCGACCAGGCTTGCGAGAATTTGGAGATAGATAT
AATTGCCATTGATTTTGCGGAGAAACGGCCTTTCAGGTTGAAGCAAGGTCAGATAAAATCTGCAATTCAGCGTGGGGTTTACTTCGAAATTATGTACTCTGATCTTCTTA
TGGACGTTCATGAAAGGAGGCAAATGATATCCACTTCAAAGTTATTGGTGGATTGGACGAAAGGAAAGAATCTCATATTATCCAGTGCGGCCCCCACTGTAAATGAAATC
AGAGGACCTTATGATGTTGCAAACTTGTCATCATTGCTTGGTGTCTCTATGGAACGAGGAAAAGCTGCTGTTTCGAAAAATTGTAGGAATCTTATAGCTAATGCCCTAAA
GAGAAAGCAGTTCTACAAGGAGACAATTCGAGTTGAAAAGATATCATCAGATGATAAATTAGATCTGAATGACCCTTGGTCAGTGGATTTGTTCAAATATGATCCTATAT
CAAGTGGCGAAGGTGATTTGCTATTGGATGATATAGCAAAATCGTTTGCTGCCTCTAACAATAAATCAAAAAATGTGAAAGCCATTGATTTCACTTCAGTCATTGACAAC
ATGCCACCACAAGGTTTTCTAGTCAAGAATGTAATATCAGGCTCTGAGCCAAAACTGTCGTTGAATGATGACAGAGACTTGTCGCATGCTGCTGATGTCATTGAGCCACC
GATTGGATTAAATATTGAACAACCCCATACTTTTGATGGAGAACACTGTTCTACATCTGATCGTCAATGCTCTGTAATTGAAAGTTTTGAAATTTTACATTCACATGGTA
ATGCAGGAAGAATTTTAAGTAATTCTGAGGAAGACAAAAGCACTGTTGAAGAAATTCTGCAGCCCAAGACCTCAATGCAGGAAGAAATTGTTGAAATGGACATTGATAAT
GTGCAACCACAAAACCTGGTACCCAGTATTGAGTTGAATGTAGTATCGACAAATGAGCTTGTGCATTCTCCTACATCTACTGGAGATGTATTAGCTGTTGTCTTCGGGAA
TGATGGAAAAGAAACTTTTAAAATGTCAGATGATGTTGAATCTCATCAGAATGAGTATGGTTTAAAAAGTTCTGATACATTGTCTGGTTCAGAAAATGAGTTGAGGGAAA
ATGTGCTAACACGAAACCCACTACCCAGTAGCGATTTGAATGTTGTGTCAACAAATGAGCTTGTGCATTCTCCTACATCTACCAAAGATGTATTAGCTGTTGTTTGCGGG
AATGATGGAAAAGAAACTTTTAAAATGTTAGAGGATGTTGATTCTCATCAGAATGAGTATGTCTTACAAAGTTCTGATGCATTGTCTGGTTTAGAAAATGTGTTGAGGGA
CAAAAGTTCAACTAATTTGGTTTCAGAAAATGAAAAGAGTGTGACTATGGTAATAGATGGTTCATTTACAGCTGAAGAATGTTTACATGGTGCAAGGCTTGGAGAGCCTG
GGGATGTGGCAGTAGACGAGGATCAGGTTTCTCCTCTTAGTTCTTGTATGAGTGACATAAAGGATGATTATTTGATCTCGATTCGACAACAACCATCTGAGGTGTTGATG
GAAGAGCAAAAAAGTGAAGAAGCTGACCGTCAGATCACACCACCACCTTTAGATCAATCTATATCTGATGTGATTTCAACAGGGAAGCATGGAGCGAAACGGAGGAGGCA
TCGTCCATCATTAAAACTTCCGCTCAAGCGGCTAATCAATCCTCTAGCCTTCAAGAAAGTCCGTGAAACGAATGCAGAGAAATACCGATCAAAACATAGGAGACATCACT
CAGCACTATTGCTTCCATTCAAGCGGTTAATCAGTCCTCTAGCCTTCAAGAAGGTCCGCAAAACTAAATGCTAG
Protein sequenceShow/hide protein sequence
MGFFDLNIPYDEHSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVTSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRLTICINSQQEV
MAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFAEKRPFRLKQGQIKSAIQRGVYFEIMYSDLLMDVHERRQMISTSKLLVDWTKGKNLILSSAAPTVNEI
RGPYDVANLSSLLGVSMERGKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEGDLLLDDIAKSFAASNNKSKNVKAIDFTSVIDN
MPPQGFLVKNVISGSEPKLSLNDDRDLSHAADVIEPPIGLNIEQPHTFDGEHCSTSDRQCSVIESFEILHSHGNAGRILSNSEEDKSTVEEILQPKTSMQEEIVEMDIDN
VQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVVFGNDGKETFKMSDDVESHQNEYGLKSSDTLSGSENELRENVLTRNPLPSSDLNVVSTNELVHSPTSTKDVLAVVCG
NDGKETFKMLEDVDSHQNEYVLQSSDALSGLENVLRDKSSTNLVSENEKSVTMVIDGSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQPSEVLM
EEQKSEEADRQITPPPLDQSISDVISTGKHGAKRRRHRPSLKLPLKRLINPLAFKKVRETNAEKYRSKHRRHHSALLLPFKRLISPLAFKKVRKTKC