; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010011 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010011
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPolymerase/histidinol phosphatase-like, putative isoform 1
Genome locationchr9:43922351..43925873
RNA-Seq ExpressionLag0010011
SyntenyLag0010011
Gene Ontology termsGO:0008033 - tRNA processing (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR002738 - RNase P subunit p30
IPR016195 - Polymerase/histidinol phosphatase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605499.1 Protein GAMETOPHYTE DEFECTIVE 1, partial [Cucurbita argyrosperma subsp. sororia]1.5e-28471.97Show/hide
Query:  MGFFDLNIPYDEHSSSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQYT
        MGFFDLNIPYDEH      SSSSSIR NRIKIVA+LMELGYSGIAYNRTIKGVMSD+DRCTIPLLNVSSL SILP+FSAS+EFHR+LLGVP+SSPFRQYT
Subjt:  MGFFDLNIPYDEHSSSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQYT

Query:  RLTICINSMQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSKL
        RLTICINS+QE++AVNSGNLILKTYDLIAVKPLNQ+AF+QACE LEIDIIAIDF+ K PF+LK G IKSAIQRGVYFEIMYSD L DVH RRQMIS +K+
Subjt:  RLTICINSMQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSKL

Query:  LVDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEG
        LVDWTKGKNLILSSAA +VNEIRGP+DVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVE+ISSDDKLDLNDPWSVDLFK+DPISSGEG
Subjt:  LVDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEG

Query:  DLLLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLPLNDDRDLSHAADVIEPPIGLN--IEQPHTFDGEHCSTSDRQCSVIESF
        DLLLDDIAKSFAAS NKSK VKAIDFTSVIDNMPPQGFLVKNV SGSE K+ LND+R  SH  D IEPPI +N  I+Q    DG+HCSTSDRQCS     
Subjt:  DLLLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLPLNDDRDLSHAADVIEPPIGLN--IEQPHTFDGEHCSTSDRQCSVIESF

Query:  EILHSHGNAGRILSNSEEDRSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVFFGNDGKETFKMSDDVESHQNEY
              G+AG + SNS+ED+ TVEEI+QPK SMQE+ + MDIDNVQPQ                                                    
Subjt:  EILHSHGNAGRILSNSEEDRSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVFFGNDGKETFKMSDDVESHQNEY

Query:  GLKSSDTLSGSENVLRENVLTRNPLPSSDLNVVSTNELVHSPTSTEDVLAVVCGNDGKETFKMLEDVDSHQNEYVLKSSDTLSGLENVLRDKSSTNLVSE
                              NPLPSS+LNV STN  VH PTST DVL VV  NDGKETF M EDV SH++EY LKSSDTLSGLE+VLRDKSSTNLVSE
Subjt:  GLKSSDTLSGSENVLRENVLTRNPLPSSDLNVVSTNELVHSPTSTEDVLAVVCGNDGKETFKMLEDVDSHQNEYVLKSSDTLSGLENVLRDKSSTNLVSE

Query:  NEKSMTMVIDCSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQTSEVLMEEQKSEEADRQITPPPSDQSISDVISTGKHGAKWRR
        N K+MT V+D SFTAEECL GARLGEP DVA  ED+VSPLSSCM DIKDDYLISIRQQTSEVLMEEQKS EAD QI PPP DQSIS V+STG   AK +R
Subjt:  NEKSMTMVIDCSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQTSEVLMEEQKSEEADRQITPPPSDQSISDVISTGKHGAKWRR

Query:  HRPSLKLPLKRLINPLAFKKVRETNAEKYRSKRRRHHSALLLPFKRLISPLAFKKVRKTK
        H+    L L RL NP  FK+V+ET +EKYRSKRRRH  ALLLPFKRLI+PL FKKVRKTK
Subjt:  HRPSLKLPLKRLINPLAFKKVRETNAEKYRSKRRRHHSALLLPFKRLISPLAFKKVRKTK

KAG7035438.1 Ribonuclease P protein subunit p30, partial [Cucurbita argyrosperma subsp. argyrosperma]2.9e-28371.71Show/hide
Query:  MGFFDLNIPYDEHSSSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQYT
        MGFFDLNIPYDEH      SSSSSIR NRIKIVA+LMELGYSGIAYNRTIKGVMSD+DRCTIPLLNVSSL SILP+FSAS+EFHR+LLGVP+SSPFRQYT
Subjt:  MGFFDLNIPYDEHSSSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQYT

Query:  RLTICINSMQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSKL
        RLTICINS+QE++AVNSGNLILKTYDLIAVKPLNQ+AF+QACE LEIDIIAIDF+ K PF+LK G IKSAIQRGVYFEIMYSD L DVH RRQMIS +K+
Subjt:  RLTICINSMQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSKL

Query:  LVDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEG
        LVDWTKGKNLILSSAA +VNEIRGP+DVAN SSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVE+ISSDDKLDLNDPWSVDLFK+DPISSGEG
Subjt:  LVDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEG

Query:  DLLLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLPLNDDRDLSHAADVIEPPIGLN--IEQPHTFDGEHCSTSDRQCSVIESF
        DLLLDDIAKSFAAS NKSK VKAIDFTSVIDNMPPQGFLVKNV SGSE K+ LND+R  SH  D IEPPI +N  I+Q    DG+HCSTSDR CS     
Subjt:  DLLLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLPLNDDRDLSHAADVIEPPIGLN--IEQPHTFDGEHCSTSDRQCSVIESF

Query:  EILHSHGNAGRILSNSEEDRSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVFFGNDGKETFKMSDDVESHQNEY
              G+AG + SNS+ED+ TVEEI+QPK SMQE+ + MDIDNVQPQ                                                    
Subjt:  EILHSHGNAGRILSNSEEDRSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVFFGNDGKETFKMSDDVESHQNEY

Query:  GLKSSDTLSGSENVLRENVLTRNPLPSSDLNVVSTNELVHSPTSTEDVLAVVCGNDGKETFKMLEDVDSHQNEYVLKSSDTLSGLENVLRDKSSTNLVSE
                              NPLPSS+LNV STN  VH PTST DVL VV  NDGKETF M EDV SH++EY LKSSDTLSGLE+VLRDKSSTNLVSE
Subjt:  GLKSSDTLSGSENVLRENVLTRNPLPSSDLNVVSTNELVHSPTSTEDVLAVVCGNDGKETFKMLEDVDSHQNEYVLKSSDTLSGLENVLRDKSSTNLVSE

Query:  NEKSMTMVIDCSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQTSEVLMEEQKSEEADRQITPPPSDQSISDVISTGKHGAKWRR
        N K+MT V+D SFTAEECL GARLGEP DVA  ED+VSPLSSCM DIKDDYLISIRQQTSEVLMEEQKS EAD QI PPP DQSIS V+STG   AK +R
Subjt:  NEKSMTMVIDCSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQTSEVLMEEQKSEEADRQITPPPSDQSISDVISTGKHGAKWRR

Query:  HRPSLKLPLKRLINPLAFKKVRETNAEKYRSKRRRHHSALLLPFKRLISPLAFKKVRKTK
        H+    L L RL NP  FK+V+ET +EKYRSKRRRH  ALLLPFKRLI+PL FKKVRKTK
Subjt:  HRPSLKLPLKRLINPLAFKKVRETNAEKYRSKRRRHHSALLLPFKRLISPLAFKKVRKTK

XP_023007324.1 uncharacterized protein LOC111499856 isoform X1 [Cucurbita maxima]3.0e-28872.24Show/hide
Query:  MGFFDLNIPYDEHSSSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQYT
        MGFFDLNIPYDEH      SSSSSIR NRIKIVA+LMELGYSGIAYNRTIKGVMSD+DRCTIPLLNVSSL SILP+FSAS+EFHR+LLGVP+SSPFRQYT
Subjt:  MGFFDLNIPYDEHSSSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQYT

Query:  RLTICINSMQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSKL
        RLTICINS+QE++AVNSGNLILKTYDLIAVKPLNQ+AF+QACE LEIDIIAIDF+ K PF+LK G IKSAIQRGVYFEIMYSD L DVH RRQMIST+K+
Subjt:  RLTICINSMQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSKL

Query:  LVDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEG
        LVDWTKGKNLILSSAA +VNEIRGP+DVANLSSLLGVSMERAK AVSKNCRNLIANALKRKQFYKETIRVE+I+SDDKLD NDPWSVDLFK+DPISSGEG
Subjt:  LVDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEG

Query:  DLLLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLPLNDDRDLSHAADVIEPPIGLN--IEQPHTFDGEHCSTSDRQCSVIESF
        DLLLDDIAKSFAA+ NKSK VKAIDFTSVIDNMPPQGFL+KNVI+GSE K+ LND+R LS   D IEPPI +N  I+Q    DG+HCSTSDRQCSVIE+ 
Subjt:  DLLLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLPLNDDRDLSHAADVIEPPIGLN--IEQPHTFDGEHCSTSDRQCSVIESF

Query:  EILHSHGNAGRILSNSEEDRSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVFFGNDGKETFKMSDDVESHQNEY
        +I  SH +AG++ SNS+ED+ TVEEI+QPK S+QEE +EMDIDNVQPQ                                                    
Subjt:  EILHSHGNAGRILSNSEEDRSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVFFGNDGKETFKMSDDVESHQNEY

Query:  GLKSSDTLSGSENVLRENVLTRNPLPSSDLNVVSTNELVHSPTSTEDVLAVVCGNDGKETFKMLEDVDSHQNEYVLKSSDTLSGLENVLRDKSSTNLVSE
                              NPLPSS+LNVVSTN  VH PTST DVL VV  NDGKETF M EDV SH++EY LKSSDTLSGLE+VLRDKSSTNLVSE
Subjt:  GLKSSDTLSGSENVLRENVLTRNPLPSSDLNVVSTNELVHSPTSTEDVLAVVCGNDGKETFKMLEDVDSHQNEYVLKSSDTLSGLENVLRDKSSTNLVSE

Query:  NEKSMTMVIDCSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQTSEVLMEEQKSEEADRQITPPPSDQSISDVISTGKHGAKWRR
        N+ +MT V+D SFTAEECL GARLGEP DVA  ED VSPLSSCM DIKDDYLISIRQQTSEVLMEEQKSEEAD QI PPP DQSIS V+STG   AK +R
Subjt:  NEKSMTMVIDCSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQTSEVLMEEQKSEEADRQITPPPSDQSISDVISTGKHGAKWRR

Query:  HRPSLKLPLKRLINPLAFKKVRETNAEKYRSKRRRHHSALLLPFKRLISPLAFKKVRKTK
        H+    L L +L NP  FK+V+ET +EKYRSKRRRH  ALLLPFKRLI+PL FKKVRKTK
Subjt:  HRPSLKLPLKRLINPLAFKKVRETNAEKYRSKRRRHHSALLLPFKRLISPLAFKKVRKTK

XP_023007325.1 uncharacterized protein LOC111499856 isoform X2 [Cucurbita maxima]6.2e-28672.09Show/hide
Query:  MGFFDLNIPYDEHSSSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQYT
        MGFFDLNIPYDEH      SSSSSIR NRIKIVA+LMELGYSGIAYNRTIKGVMSD+DRCTIPLLNVSSL SILP+FSAS+EFHR+LLGVP+SSPFRQYT
Subjt:  MGFFDLNIPYDEHSSSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQYT

Query:  RLTICINSMQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSKL
        RLTICINS+QE++AVNSGNLILKTYDLIAVKPLNQ+AF+QACE LEIDIIAIDF+ K PF+LK G IKSAIQRGVYFEIMYSD L DVH RRQMIST+K+
Subjt:  RLTICINSMQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSKL

Query:  LVDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEG
        LVDWTKGKNLILSSAA +VNEIRGP+DVANLSSLLGVSMERAK AVSKNCRNLIANALKRKQFYKETIRVE+I+SDDKLD NDPWSVDLFK+DPISSGEG
Subjt:  LVDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEG

Query:  DLLLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLPLNDDRDLSHAADVIEPPIGLN--IEQPHTFDGEHCSTSDRQCSVIESF
        DLLLDDIAKSFAA+ NKSK VKAIDFTSVIDNMPPQGFL+KNVI+GSE K+ LND+R LS   D IEPPI +N  I+Q    DG+HCSTSDRQCSVIE+ 
Subjt:  DLLLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLPLNDDRDLSHAADVIEPPIGLN--IEQPHTFDGEHCSTSDRQCSVIESF

Query:  EILHSHGNAGRILSNSEEDRSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVFFGNDGKETFKMSDDVESHQNEY
        +I  SH +AG++ SNS+ED+ TVEEI+QPK S+QEE +EMDIDNVQPQ                                                    
Subjt:  EILHSHGNAGRILSNSEEDRSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVFFGNDGKETFKMSDDVESHQNEY

Query:  GLKSSDTLSGSENVLRENVLTRNPLPSSDLNVVSTNELVHSPTSTEDVLAVVCGNDGKETFKMLEDVDSHQNEYVLKSSDTLSGLENVLRDKSSTNLVSE
                              NPLPSS+LNVVSTN  VH PTST DVL VV  NDGKETF M EDV SH++EY LKSSDTLSGLE+VLRDKSSTNLVSE
Subjt:  GLKSSDTLSGSENVLRENVLTRNPLPSSDLNVVSTNELVHSPTSTEDVLAVVCGNDGKETFKMLEDVDSHQNEYVLKSSDTLSGLENVLRDKSSTNLVSE

Query:  NEKSMTMVIDCSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQTSEVLMEEQKSEEADRQITPPPSDQSISDVISTGKHGAKWRR
        N+ +MT V+D SFTAEECL GARLGEP DVA  ED VSPLSSCM DIKDDYLISIRQQTSEVLMEEQKSEEAD QI PPP DQSIS V+STG   AK +R
Subjt:  NEKSMTMVIDCSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQTSEVLMEEQKSEEADRQITPPPSDQSISDVISTGKHGAKWRR

Query:  HRPSLKLPLKRLINPLAFKKVRETNAEKYRSKRRRHHSALLLPFKRLISPLAFKKV
        H+    L L +L NP  FK+V+ET +EKYRSKRRRH  ALLLPFKRLI+PL FKKV
Subjt:  HRPSLKLPLKRLINPLAFKKVRETNAEKYRSKRRRHHSALLLPFKRLISPLAFKKV

XP_038900491.1 protein GAMETOPHYTE DEFECTIVE 1 [Benincasa hispida]1.7e-28370.42Show/hide
Query:  MGFFDLNIPYDEHSSSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQYT
        MGFFDLNIPYDEHSSSSS+      R+NRIKIVA+LMELGYSGIAYNRTIKGVMS++DRC+IPLLNVSSLHSILPSFSASVEFHRDLLGV +SSPFRQYT
Subjt:  MGFFDLNIPYDEHSSSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQYT

Query:  RLTICINSMQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSKL
        RLTI IN+  E++AVNSGNLILKTYDLIAVKPLNQHAF+QACE LEIDIIAIDF+ K PF+LK G IKSAIQRGVYFEIMYSD L DVH RRQMIST+K+
Subjt:  RLTICINSMQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSKL

Query:  LVDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEG
        LVDWT GKNLILSSAAP+VNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRK FYKETIR+E+ISSDDKLDLNDPWSVDLFK+DPISSGEG
Subjt:  LVDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEG

Query:  DLLLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLPLNDDRDLSHAADVIEPPIGLN--IEQPHTFDGEHCSTSDRQCSVIESF
        DLLLDDIAKSFAA N+KSKNVKAIDF+S+IDN+PPQGFLVK+V++GSE KL LND+RDL   +D IEP I +N  I+Q HT DGEHC  SD Q SVIESF
Subjt:  DLLLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLPLNDDRDLSHAADVIEPPIGLN--IEQPHTFDGEHCSTSDRQCSVIESF

Query:  EILHSHGNAGRILSNSEEDRSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVFFGNDGKETFKMSDDVESHQNEY
        +I HSHG+AG + SNSEE +ST+EEI+QPKTSMQEE ++MD+DN+QP+NL+ S +LNV STNELVHSPTST DV  VF                      
Subjt:  EILHSHGNAGRILSNSEEDRSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVFFGNDGKETFKMSDDVESHQNEY

Query:  GLKSSDTLSGSENVLRENVLTRNPLPSSDLNVVSTNELVHSPTSTEDVLAVVCGNDGKETFKMLEDVDSHQNEYVLKSSDTLSGLENVLRDKSSTNLVSE
                                                              ND  ET KM EDVDSH NEY L+SS  LSGLENV RDKSSTNL+SE
Subjt:  GLKSSDTLSGSENVLRENVLTRNPLPSSDLNVVSTNELVHSPTSTEDVLAVVCGNDGKETFKMLEDVDSHQNEYVLKSSDTLSGLENVLRDKSSTNLVSE

Query:  NEKSMTMVIDCSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQTSEVLMEEQKSEEADRQITPPPSDQSISDVIST---GKHGAK
        ++K  TMV+DCSFTAEECLH ARL EPGDVA+  DQVSPL SC SDIKDDYLISIRQQTSE LM++Q+S +AD QIT  PSDQSI+ V+ST    K  +K
Subjt:  NEKSMTMVIDCSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQTSEVLMEEQKSEEADRQITPPPSDQSISDVIST---GKHGAK

Query:  WRRHRPSLKLPLKRLINPLAFKKVRETNAEKYRSKRRRHHSALLLPFKRLISPLAFKKVRKTKC
         +RH P  KLPL++LINPL FKKV +T+ EKYRSKRRRHH ALLLPFKR I+ LAFKKV KTKC
Subjt:  WRRHRPSLKLPLKRLINPLAFKKVRETNAEKYRSKRRRHHSALLLPFKRLISPLAFKKVRKTKC

TrEMBL top hitse value%identityAlignment
A0A6J1G517 uncharacterized protein LOC111450984 isoform X11.8e-27566.01Show/hide
Query:  MGFFDLNIPYDEHSSSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQYT
        MGFFDLN+PYDEH      SSSSSIR NRIKIVA+LMELGYSGIAYNRTIKGVMSD+DRCTIPLLNVSSL SILP+FSAS+EFHR+LLGVP+SSPFRQYT
Subjt:  MGFFDLNIPYDEHSSSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQYT

Query:  RLTICINSMQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSKL
        RLTICINS+QE++AVNSGNLILKTYDLIAVKPLNQ+AF+QACE LEIDIIAIDF+ K PF+LK G IKSAIQRGVYFEIMYSD L DVH RRQMIST+K+
Subjt:  RLTICINSMQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSKL

Query:  LVDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEG
        LVDWTKGKNLILSSAA +VNEIRGP+DVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVE+ISSDDKLD NDPWSVDLFK+DP+SSGEG
Subjt:  LVDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEG

Query:  DLLLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLPLNDDRDLSHAADVIEPPIGLN---------------------------
        DLLLDDIA+SFAAS NKSK VKAIDFTSVIDN PPQGFLVKNVISG E K+ LND+R  SH  D IEPPI +N                           
Subjt:  DLLLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLPLNDDRDLSHAADVIEPPIGLN---------------------------

Query:  ---------------------------------IEQPHTFDGEHCSTSDRQCSVIESFEILHSHGNAGRILSNSEEDRSTVEEILQPKTSMQEEIVEMDI
                                         I+Q    DG+HCSTS+RQCS           G+AG++ SNS+ED+ TVEEI+QPK SMQE+ + MDI
Subjt:  ---------------------------------IEQPHTFDGEHCSTSDRQCSVIESFEILHSHGNAGRILSNSEEDRSTVEEILQPKTSMQEEIVEMDI

Query:  DNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVFFGNDGKETFKMSDDVESHQNEYGLKSSDTLSGSENVLRENVLTRNPLPSSDLNVVSTNELVHSP
        DNVQPQ                                                                          NPLPSS+LNV STN  VH P
Subjt:  DNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVFFGNDGKETFKMSDDVESHQNEYGLKSSDTLSGSENVLRENVLTRNPLPSSDLNVVSTNELVHSP

Query:  TSTEDVLAVVCGNDGKETFKMLEDVDSHQNEYVLKSSDTLSGLENVLRDKSSTNLVSENEKSMTMVIDCSFTAEECLHGARLGEPGDVAVDEDQVSPLSS
        TST DVL V+  NDGKE F M EDV SH++EY LKSSDTLSGLE+VLRDKSSTNLVSEN K+MT V+D SFTAEECL GARLGEP DVA  ED+VSPLSS
Subjt:  TSTEDVLAVVCGNDGKETFKMLEDVDSHQNEYVLKSSDTLSGLENVLRDKSSTNLVSENEKSMTMVIDCSFTAEECLHGARLGEPGDVAVDEDQVSPLSS

Query:  CMSDIKDDYLISIRQQTSEVLMEEQKSEEADRQITPPPSDQSISDVISTGKHGAKWRRHRPSLKLPLKRLINPLAFKKVRETNAEKYRSKRRRHHSALLL
        CM DIKDDYLISIRQQTSEVLMEEQKS EAD QI PPP DQSIS V+STG   AK +RH+    L L RL NP  FK+V+ET +EKYRSKRRRH  ALLL
Subjt:  CMSDIKDDYLISIRQQTSEVLMEEQKSEEADRQITPPPSDQSISDVISTGKHGAKWRRHRPSLKLPLKRLINPLAFKKVRETNAEKYRSKRRRHHSALLL

Query:  PFKRLISPLAFKKVRKTK
        PFKRLI+PL FKKVRKTK
Subjt:  PFKRLISPLAFKKVRKTK

A0A6J1G550 uncharacterized protein LOC111450984 isoform X25.9e-27465.81Show/hide
Query:  MGFFDLNIPYDEHSSSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQYT
        MGFFDLN+PYDEH      SSSSSIR NRIKIVA+LMELGYSGIAYNRTIKGVMSD+DRCTIPLLNVSSL SILP+FSAS+EFHR+LLGVP+SSPFRQYT
Subjt:  MGFFDLNIPYDEHSSSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQYT

Query:  RLTICINSMQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSKL
        RLTICINS+QE++AVNSGNLILKTYDLIAVKPLNQ+AF+QACE LEIDIIAIDF+ K PF+LK G IKSAIQRGVYFEIMYSD L DVH RRQMIST+K+
Subjt:  RLTICINSMQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSKL

Query:  LVDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEG
        LVDWTKGKNLILSSAA +VNEIRGP+DVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVE+ISSDDKLD NDPWSVDLFK+DP+SSGEG
Subjt:  LVDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEG

Query:  DLLLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLPLNDDRDLSHAADVIEPPIGLN---------------------------
        DLLLDDIA+SFAAS NKSK VKAIDFTSVIDN PPQGFLVKNVISG E K+ LND+R  SH  D IEPPI +N                           
Subjt:  DLLLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLPLNDDRDLSHAADVIEPPIGLN---------------------------

Query:  ---------------------------------IEQPHTFDGEHCSTSDRQCSVIESFEILHSHGNAGRILSNSEEDRSTVEEILQPKTSMQEEIVEMDI
                                         I+Q    DG+HCSTS+RQCS           G+AG++ SNS+ED+ TVEEI+QPK SMQE+ + MDI
Subjt:  ---------------------------------IEQPHTFDGEHCSTSDRQCSVIESFEILHSHGNAGRILSNSEEDRSTVEEILQPKTSMQEEIVEMDI

Query:  DNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVFFGNDGKETFKMSDDVESHQNEYGLKSSDTLSGSENVLRENVLTRNPLPSSDLNVVSTNELVHSP
        DNVQPQ                                                                          NPLPSS+LNV STN  VH P
Subjt:  DNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVFFGNDGKETFKMSDDVESHQNEYGLKSSDTLSGSENVLRENVLTRNPLPSSDLNVVSTNELVHSP

Query:  TSTEDVLAVVCGNDGKETFKMLEDVDSHQNEYVLKSSDTLSGLENVLRDKSSTNLVSENEKSMTMVIDCSFTAEECLHGARLGEPGDVAVDEDQVSPLSS
        TST DVL V+  NDGKE F M EDV SH++EY LKSSDTLSGLE+VLRDKSSTNLVSEN K+MT V+D SFTAEECL GARLGEP DVA  ED+VSPLSS
Subjt:  TSTEDVLAVVCGNDGKETFKMLEDVDSHQNEYVLKSSDTLSGLENVLRDKSSTNLVSENEKSMTMVIDCSFTAEECLHGARLGEPGDVAVDEDQVSPLSS

Query:  CMSDIKDDYLISIRQQTSEVLMEEQKSEEADRQITPPPSDQSISDVISTGKHGAKWRRHRPSLKLPLKRLINPLAFKKVRETNAEKYRSKRRRHHSALLL
        CM DIKDDYLISIRQQTSEVLMEEQKS EAD QI PPP DQSIS V+STG   AK +RH+    L L RL NP  FK+V+ET +EKYRSKRRRH  ALLL
Subjt:  CMSDIKDDYLISIRQQTSEVLMEEQKSEEADRQITPPPSDQSISDVISTGKHGAKWRRHRPSLKLPLKRLINPLAFKKVRETNAEKYRSKRRRHHSALLL

Query:  PFKRLISPLAFKKVRK
        PFKRLI+PL FKKV+K
Subjt:  PFKRLISPLAFKKVRK

A0A6J1L082 uncharacterized protein LOC111499856 isoform X35.3e-28371.45Show/hide
Query:  MGFFDLNIPYDEHSSSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQYT
        MGFFDLNIPYDEH      SSSSSIR NRIKIVA+LMELGYSGIAYNRTIKGVMSD+DRCTIPLLNVSSL SILP+FSAS+EFHR+LLGVP+SSPFRQYT
Subjt:  MGFFDLNIPYDEHSSSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQYT

Query:  RLTICINSMQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSKL
        RLTICINS+QE++AVNSGNLILKTYDLIAVKPLNQ+AF+QACE LEIDIIAIDF+ K PF+LK G IKSAIQRGVYFEIMYSD L DVH RRQMIST+K+
Subjt:  RLTICINSMQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSKL

Query:  LVDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEG
        LVDWTKGKNLILSSAA +VNEIRGP+DVANLSSLLGVSMERAK AVSKNCRNLIANALKRKQFYKETIRVE+I+SDDKLD NDPWSVDLFK+DPISSGEG
Subjt:  LVDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEG

Query:  DLLLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLPLNDDRDLSHAADVIEPPIGLN--IEQPHTFDGEHCSTSDRQCSVIESF
        DLLLDDIAKSFAA+ NKSK VKAIDFTSVIDNMPPQGFL+KNVI+GSE K+ LND+R LS   D IEPPI +N  I+Q    DG+HCSTSDRQCS     
Subjt:  DLLLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLPLNDDRDLSHAADVIEPPIGLN--IEQPHTFDGEHCSTSDRQCSVIESF

Query:  EILHSHGNAGRILSNSEEDRSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVFFGNDGKETFKMSDDVESHQNEY
               +AG++ SNS+ED+ TVEEI+QPK S+QEE +EMDIDNVQPQ                                                    
Subjt:  EILHSHGNAGRILSNSEEDRSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVFFGNDGKETFKMSDDVESHQNEY

Query:  GLKSSDTLSGSENVLRENVLTRNPLPSSDLNVVSTNELVHSPTSTEDVLAVVCGNDGKETFKMLEDVDSHQNEYVLKSSDTLSGLENVLRDKSSTNLVSE
                              NPLPSS+LNVVSTN  VH PTST DVL VV  NDGKETF M EDV SH++EY LKSSDTLSGLE+VLRDKSSTNLVSE
Subjt:  GLKSSDTLSGSENVLRENVLTRNPLPSSDLNVVSTNELVHSPTSTEDVLAVVCGNDGKETFKMLEDVDSHQNEYVLKSSDTLSGLENVLRDKSSTNLVSE

Query:  NEKSMTMVIDCSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQTSEVLMEEQKSEEADRQITPPPSDQSISDVISTGKHGAKWRR
        N+ +MT V+D SFTAEECL GARLGEP DVA  ED VSPLSSCM DIKDDYLISIRQQTSEVLMEEQKSEEAD QI PPP DQSIS V+STG   AK +R
Subjt:  NEKSMTMVIDCSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQTSEVLMEEQKSEEADRQITPPPSDQSISDVISTGKHGAKWRR

Query:  HRPSLKLPLKRLINPLAFKKVRETNAEKYRSKRRRHHSALLLPFKRLISPLAFKKVRKTK
        H+    L L +L NP  FK+V+ET +EKYRSKRRRH  ALLLPFKRLI+PL FKKVRKTK
Subjt:  HRPSLKLPLKRLINPLAFKKVRETNAEKYRSKRRRHHSALLLPFKRLISPLAFKKVRKTK

A0A6J1L2N3 uncharacterized protein LOC111499856 isoform X23.0e-28672.09Show/hide
Query:  MGFFDLNIPYDEHSSSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQYT
        MGFFDLNIPYDEH      SSSSSIR NRIKIVA+LMELGYSGIAYNRTIKGVMSD+DRCTIPLLNVSSL SILP+FSAS+EFHR+LLGVP+SSPFRQYT
Subjt:  MGFFDLNIPYDEHSSSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQYT

Query:  RLTICINSMQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSKL
        RLTICINS+QE++AVNSGNLILKTYDLIAVKPLNQ+AF+QACE LEIDIIAIDF+ K PF+LK G IKSAIQRGVYFEIMYSD L DVH RRQMIST+K+
Subjt:  RLTICINSMQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSKL

Query:  LVDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEG
        LVDWTKGKNLILSSAA +VNEIRGP+DVANLSSLLGVSMERAK AVSKNCRNLIANALKRKQFYKETIRVE+I+SDDKLD NDPWSVDLFK+DPISSGEG
Subjt:  LVDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEG

Query:  DLLLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLPLNDDRDLSHAADVIEPPIGLN--IEQPHTFDGEHCSTSDRQCSVIESF
        DLLLDDIAKSFAA+ NKSK VKAIDFTSVIDNMPPQGFL+KNVI+GSE K+ LND+R LS   D IEPPI +N  I+Q    DG+HCSTSDRQCSVIE+ 
Subjt:  DLLLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLPLNDDRDLSHAADVIEPPIGLN--IEQPHTFDGEHCSTSDRQCSVIESF

Query:  EILHSHGNAGRILSNSEEDRSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVFFGNDGKETFKMSDDVESHQNEY
        +I  SH +AG++ SNS+ED+ TVEEI+QPK S+QEE +EMDIDNVQPQ                                                    
Subjt:  EILHSHGNAGRILSNSEEDRSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVFFGNDGKETFKMSDDVESHQNEY

Query:  GLKSSDTLSGSENVLRENVLTRNPLPSSDLNVVSTNELVHSPTSTEDVLAVVCGNDGKETFKMLEDVDSHQNEYVLKSSDTLSGLENVLRDKSSTNLVSE
                              NPLPSS+LNVVSTN  VH PTST DVL VV  NDGKETF M EDV SH++EY LKSSDTLSGLE+VLRDKSSTNLVSE
Subjt:  GLKSSDTLSGSENVLRENVLTRNPLPSSDLNVVSTNELVHSPTSTEDVLAVVCGNDGKETFKMLEDVDSHQNEYVLKSSDTLSGLENVLRDKSSTNLVSE

Query:  NEKSMTMVIDCSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQTSEVLMEEQKSEEADRQITPPPSDQSISDVISTGKHGAKWRR
        N+ +MT V+D SFTAEECL GARLGEP DVA  ED VSPLSSCM DIKDDYLISIRQQTSEVLMEEQKSEEAD QI PPP DQSIS V+STG   AK +R
Subjt:  NEKSMTMVIDCSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQTSEVLMEEQKSEEADRQITPPPSDQSISDVISTGKHGAKWRR

Query:  HRPSLKLPLKRLINPLAFKKVRETNAEKYRSKRRRHHSALLLPFKRLISPLAFKKV
        H+    L L +L NP  FK+V+ET +EKYRSKRRRH  ALLLPFKRLI+PL FKKV
Subjt:  HRPSLKLPLKRLINPLAFKKVRETNAEKYRSKRRRHHSALLLPFKRLISPLAFKKV

A0A6J1L4M3 uncharacterized protein LOC111499856 isoform X11.4e-28872.24Show/hide
Query:  MGFFDLNIPYDEHSSSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQYT
        MGFFDLNIPYDEH      SSSSSIR NRIKIVA+LMELGYSGIAYNRTIKGVMSD+DRCTIPLLNVSSL SILP+FSAS+EFHR+LLGVP+SSPFRQYT
Subjt:  MGFFDLNIPYDEHSSSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQYT

Query:  RLTICINSMQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSKL
        RLTICINS+QE++AVNSGNLILKTYDLIAVKPLNQ+AF+QACE LEIDIIAIDF+ K PF+LK G IKSAIQRGVYFEIMYSD L DVH RRQMIST+K+
Subjt:  RLTICINSMQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSKL

Query:  LVDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEG
        LVDWTKGKNLILSSAA +VNEIRGP+DVANLSSLLGVSMERAK AVSKNCRNLIANALKRKQFYKETIRVE+I+SDDKLD NDPWSVDLFK+DPISSGEG
Subjt:  LVDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEG

Query:  DLLLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLPLNDDRDLSHAADVIEPPIGLN--IEQPHTFDGEHCSTSDRQCSVIESF
        DLLLDDIAKSFAA+ NKSK VKAIDFTSVIDNMPPQGFL+KNVI+GSE K+ LND+R LS   D IEPPI +N  I+Q    DG+HCSTSDRQCSVIE+ 
Subjt:  DLLLDDIAKSFAASNNKSKNVKAIDFTSVIDNMPPQGFLVKNVISGSEPKLPLNDDRDLSHAADVIEPPIGLN--IEQPHTFDGEHCSTSDRQCSVIESF

Query:  EILHSHGNAGRILSNSEEDRSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVFFGNDGKETFKMSDDVESHQNEY
        +I  SH +AG++ SNS+ED+ TVEEI+QPK S+QEE +EMDIDNVQPQ                                                    
Subjt:  EILHSHGNAGRILSNSEEDRSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVFFGNDGKETFKMSDDVESHQNEY

Query:  GLKSSDTLSGSENVLRENVLTRNPLPSSDLNVVSTNELVHSPTSTEDVLAVVCGNDGKETFKMLEDVDSHQNEYVLKSSDTLSGLENVLRDKSSTNLVSE
                              NPLPSS+LNVVSTN  VH PTST DVL VV  NDGKETF M EDV SH++EY LKSSDTLSGLE+VLRDKSSTNLVSE
Subjt:  GLKSSDTLSGSENVLRENVLTRNPLPSSDLNVVSTNELVHSPTSTEDVLAVVCGNDGKETFKMLEDVDSHQNEYVLKSSDTLSGLENVLRDKSSTNLVSE

Query:  NEKSMTMVIDCSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQTSEVLMEEQKSEEADRQITPPPSDQSISDVISTGKHGAKWRR
        N+ +MT V+D SFTAEECL GARLGEP DVA  ED VSPLSSCM DIKDDYLISIRQQTSEVLMEEQKSEEAD QI PPP DQSIS V+STG   AK +R
Subjt:  NEKSMTMVIDCSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQTSEVLMEEQKSEEADRQITPPPSDQSISDVISTGKHGAKWRR

Query:  HRPSLKLPLKRLINPLAFKKVRETNAEKYRSKRRRHHSALLLPFKRLISPLAFKKVRKTK
        H+    L L +L NP  FK+V+ET +EKYRSKRRRH  ALLLPFKRLI+PL FKKVRKTK
Subjt:  HRPSLKLPLKRLINPLAFKKVRETNAEKYRSKRRRHHSALLLPFKRLISPLAFKKVRKTK

SwissProt top hitse value%identityAlignment
F4JXF1 Protein GAMETOPHYTE DEFECTIVE 15.5e-11258.55Show/hide
Query:  MGFFDLNIPYDEHSSSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQYT
        MGFFDL+IPY+E   S     +   +  R+K+  + MELGY GIA+NR+IKGVMSD+D CTIPLL + SL  + P  ++SV FHRDLLGVP+++PFRQYT
Subjt:  MGFFDLNIPYDEHSSSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQYT

Query:  RLTICINSMQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSKL
        RLT+ + S  +  ++NSGN ILK+YD+IAV+P+NQ+AFD ACE  E+D+I+IDF+ K  F+LK   +K+AIQRG+YFEI YSD LMD   RRQ+IS +KL
Subjt:  RLTICINSMQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSKL

Query:  LVDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEG
        LVDWT+GKNLI+SS AP+V E+RGP DV NL  LLG+S ERA+AA+SKNCRN+IA  LK+K+F+KE +RVE +S+ D   L  P S D  K+D +SSGEG
Subjt:  LVDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEG

Query:  DLLLDDIAKSFAASNNKS-KNVKAIDFTSVIDNMPPQGFLVKNVI
        D+LLDD+AK+F A+N  + K+ KAIDFTSV+D +P  GF VK+++
Subjt:  DLLLDDIAKSFAASNNKS-KNVKAIDFTSVIDNMPPQGFLVKNVI

O88796 Ribonuclease P protein subunit p301.2e-2935.32Show/hide
Query:  SSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRLTICINSMQEVMAVNSGNL
        + S ++A R  +V     LGYS +A N  +     ++ R     + VS L + LP                KS P +  TRLTI +        + + + 
Subjt:  SSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRLTICINSMQEVMAVNSGNL

Query:  ILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSKLLVDWTKGKNLILSSAAPTVN
         ++ YD++AV P  +  F  AC +L++D++ I  + K PF  K  P+  AI+RG+ FE++Y   + D   RR  IS +  L+   KGKN+ILSSAA    
Subjt:  ILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSKLLVDWTKGKNLILSSAAPTVN

Query:  EIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVEK
        EIRGPYDVANL  L G+S    KAAVS NCR +  +   RK  +     V+K
Subjt:  EIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVEK

P78346 Ribonuclease P protein subunit p302.0e-2935.71Show/hide
Query:  SSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRLTICINSMQEVMAVNSGNL
        + S ++A R  +V     LGYS +A N  +     ++ +     + VS L + LP                KS P +  TRLTI ++       + + + 
Subjt:  SSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRLTICINSMQEVMAVNSGNL

Query:  ILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSKLLVDWTKGKNLILSSAAPTVN
          + YD++AV P  +  F  AC +L++D++ I  + K PF  K  PI  AI RG+ FE++YS  + D   RR  IS++  L+   KGKN+I+SSAA    
Subjt:  ILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSKLLVDWTKGKNLILSSAAPTVN

Query:  EIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVEK
        EIRGPYDVANL  L G+S   AKAAVS NCR  + +   RK  +     V+K
Subjt:  EIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVEK

Q3SZ21 Ribonuclease P protein subunit p306.3e-3136.11Show/hide
Query:  SSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRLTICINSMQEVMAVNSGNL
        + S ++A R  +V     LGYS +A N  ++    ++ +     + VS L + LP                KS P +  TRLTI ++       + + + 
Subjt:  SSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRLTICINSMQEVMAVNSGNL

Query:  ILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSKLLVDWTKGKNLILSSAAPTVN
         ++ YD++AV P  +  F  AC +L++D++ I  + K PF  K  PI  AI RGV FE++YS  + D   RR  IS +  L+   KGKN+I+SSAA    
Subjt:  ILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSKLLVDWTKGKNLILSSAAPTVN

Query:  EIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVEK
        EIRGPYDVANL  L G+S   AKAAVS NCR ++ +   RK  +     V+K
Subjt:  EIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVEK

Q3ZE13 Ribonuclease P protein subunit drpp303.9e-2529.77Show/hide
Query:  MGFFDLNIPYDEHSSSSSSSSSSSIRANRIKIVARL-MELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQY
        M ++DLNI              SS+   +IK +  L  + GY  +A   T++G +  +D C I  + +            S       +G   +   +QY
Subjt:  MGFFDLNIPYDEHSSSSSSSSSSSIRANRIKIVARL-MELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQY

Query:  TRLTICINSMQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSK
        TRL +   +M E   + + N ++++YD+I+V P +   F+ AC + EIDII ID   K  F +K   ++  I +G++ EI+Y +       R      + 
Subjt:  TRLTICINSMQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSK

Query:  LLVDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRK
         LV  + GKN+ILSS+  +   +R PYD++NL  L G++ ++AKAAVSK+    + +A+ R+
Subjt:  LLVDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRK

Arabidopsis top hitse value%identityAlignment
AT5G59980.1 Polymerase/histidinol phosphatase-like3.9e-11358.55Show/hide
Query:  MGFFDLNIPYDEHSSSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQYT
        MGFFDL+IPY+E   S     +   +  R+K+  + MELGY GIA+NR+IKGVMSD+D CTIPLL + SL  + P  ++SV FHRDLLGVP+++PFRQYT
Subjt:  MGFFDLNIPYDEHSSSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQYT

Query:  RLTICINSMQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSKL
        RLT+ + S  +  ++NSGN ILK+YD+IAV+P+NQ+AFD ACE  E+D+I+IDF+ K  F+LK   +K+AIQRG+YFEI YSD LMD   RRQ+IS +KL
Subjt:  RLTICINSMQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSKL

Query:  LVDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEG
        LVDWT+GKNLI+SS AP+V E+RGP DV NL  LLG+S ERA+AA+SKNCRN+IA  LK+K+F+KE +RVE +S+ D   L  P S D  K+D +SSGEG
Subjt:  LVDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEG

Query:  DLLLDDIAKSFAASNNKS-KNVKAIDFTSVIDNMPPQGFLVKNVI
        D+LLDD+AK+F A+N  + K+ KAIDFTSV+D +P  GF VK+++
Subjt:  DLLLDDIAKSFAASNNKS-KNVKAIDFTSVIDNMPPQGFLVKNVI

AT5G59980.2 Polymerase/histidinol phosphatase-like1.1e-11236.52Show/hide
Query:  MGFFDLNIPYDEHSSSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQYT
        MGFFDL+IPY+E   S     +   +  R+K+  + MELGY GIA+NR+IKGVMSD+D CTIPLL + SL  + P  ++SV FHRDLLGVP+++PFRQYT
Subjt:  MGFFDLNIPYDEHSSSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQYT

Query:  RLTICINSMQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSKL
        RLT+ + S  +  ++NSGN ILK+YD+IAV+P+NQ+AFD ACE  E+D+I+IDF+ K  F+LK   +K+AIQRG+YFEI YSD LMD   RRQ+IS +KL
Subjt:  RLTICINSMQEVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSKL

Query:  LVDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEG
        LVDWT+GKNLI+SS AP+V E+RGP DV NL  LLG+S ERA+AA+SKNCRN+IA  LK+K+F+KE +RVE +S+ D   L  P S D  K+D +SSGEG
Subjt:  LVDWTKGKNLILSSAAPTVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEG

Query:  DLLLDDIAKSFAASNNKS-KNVKAIDFTSVIDNMPPQGFLVKNVI---SGSEPKLPLNDDRDLSHAADVIEPPIGL-----NIEQPHTFDGEHCSTSDRQ
        D+LLDD+AK+F A+N  + K+ KAIDFTSV+D +P  GF VK+++   S ++P      D  +  +  V E  +       N+ +  T        S+  
Subjt:  DLLLDDIAKSFAASNNKS-KNVKAIDFTSVIDNMPPQGFLVKNVI---SGSEPKLPLNDDRDLSHAADVIEPPIGL-----NIEQPHTFDGEHCSTSDRQ

Query:  CSVIESFEILHSHGNAGRILSNSEEDRSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGD----VLAVFFGNDGKETFKM
          V  +  +L     A R  S S      V+          +      +           + +N+ ST+E      S  D       V   N G   F+ 
Subjt:  CSVIESFEILHSHGNAGRILSNSEEDRSTVEEILQPKTSMQEEIVEMDIDNVQPQNLVPSIELNVVSTNELVHSPTSTGD----VLAVFFGNDGKETFKM

Query:  SDDVESHQNE---YGLKSSDTLSGSENVLRENVLTRNPLPSSDLNVVSTNELVHSPTSTEDVLAVVCGNDGKETFKMLEDVDSHQNEYVLKSSDTLSGLE
           V+ +  E    G  S+D +  +E+    ++     +P  +             TS  D + + C ++      M   ++   +E    + D+   L 
Subjt:  SDDVESHQNE---YGLKSSDTLSGSENVLRENVLTRNPLPSSDLNVVSTNELVHSPTSTEDVLAVVCGNDGKETFKMLEDVDSHQNEYVLKSSDTLSGLE

Query:  NVLRDKSSTNLVSENEKSMTMVIDCSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQTSEVLMEEQKSEE---ADRQITPPPSDQ
        N+     +T+L+SE+ KS++                    P  V  D D+ S L S  + ++++  +     T E+ ME+ K EE    D ++    ++ 
Subjt:  NVLRDKSSTNLVSENEKSMTMVIDCSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQTSEVLMEEQKSEE---ADRQITPPPSDQ

Query:  SISDVISTGKHGAKWRRHRPSLKLPLKRLINPLAFKKVRETNAEKYRSKRRRH
        S    + + ++    +  +   K    RL     FK     +  K  SK+R+H
Subjt:  SISDVISTGKHGAKWRRHRPSLKLPLKRLINPLAFKKVRETNAEKYRSKRRRH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTCTTCGACCTCAACATACCTTACGATGAGCATTCATCTTCATCTTCATCTTCATCTTCATCTTCAATCAGGGCTAATCGCATCAAGATTGTGGCCAGACTCAT
GGAGCTCGGCTACTCAGGTATCGCCTACAACCGTACGATTAAAGGAGTGATGTCCGATCAGGACCGCTGTACTATTCCCCTCCTCAATGTCTCGTCGCTTCACAGTATCC
TTCCGTCGTTCTCCGCCTCCGTGGAGTTCCACCGCGATCTCCTCGGCGTCCCTAAATCCTCTCCTTTCCGTCAGTACACGCGTCTTACTATTTGTATCAACAGTATGCAG
GAGGTAATGGCAGTCAATTCTGGCAACCTCATTCTGAAGACCTACGATTTGATTGCTGTGAAGCCTCTTAATCAGCATGCTTTCGATCAGGCTTGCGAGAATTTGGAGAT
AGATATAATTGCCATTGATTTTTCGGTGAAACGGCCTTTCAAGTTGAAGCTAGGTCCGATAAAATCTGCAATTCAGCGTGGGGTTTACTTCGAAATTATGTACTCTGATT
TTCTTATGGACGTTCATGGAAGGAGGCAAATGATATCCACTTCAAAGTTATTGGTGGATTGGACGAAAGGAAAGAATCTCATATTATCCAGTGCGGCCCCCACTGTAAAT
GAAATCAGAGGACCTTATGATGTTGCAAACTTGTCATCATTGCTTGGTGTCTCTATGGAACGAGCAAAAGCTGCCGTTTCGAAAAATTGTAGGAATCTTATAGCTAATGC
CCTAAAGAGAAAGCAGTTCTACAAGGAGACAATTCGAGTTGAAAAGATATCATCAGATGATAAATTAGATCTGAATGACCCTTGGTCAGTGGATTTGTTCAAATATGATC
CTATATCAAGTGGCGAAGGTGATTTGCTATTGGATGATATAGCAAAATCGTTTGCTGCCTCTAACAATAAATCAAAAAATGTGAAAGCCATTGATTTCACTTCAGTCATT
GACAACATGCCACCACAAGGTTTTCTAGTCAAGAATGTAATATCAGGCTCTGAGCCAAAACTGCCATTGAATGATGACAGAGACTTGTCGCATGCTGCTGATGTTATTGA
GCCACCGATTGGATTAAATATTGAACAACCCCATACTTTTGATGGAGAACACTGTTCTACATCTGATCGTCAATGCTCTGTAATTGAAAGTTTTGAAATTTTACATTCAC
ATGGTAATGCAGGAAGAATTTTAAGTAATTCTGAGGAAGACAGAAGCACTGTTGAAGAAATTCTGCAGCCCAAGACCTCAATGCAGGAAGAAATTGTTGAAATGGACATT
GATAATGTGCAACCACAAAACCTGGTACCCAGTATTGAGTTGAATGTAGTATCGACAAATGAGCTTGTGCATTCTCCTACATCTACTGGAGATGTATTAGCTGTTTTCTT
CGGGAATGATGGAAAAGAAACTTTTAAAATGTCAGATGATGTTGAATCTCATCAGAATGAGTATGGTTTAAAAAGTTCTGATACATTGTCTGGTTCGGAAAATGTGTTGA
GGGAAAATGTGCTAACAAGAAACCCACTACCCAGTAGCGATTTGAATGTTGTGTCAACAAATGAGCTTGTGCATTCTCCTACATCTACCGAAGATGTATTAGCTGTTGTT
TGCGGGAATGATGGAAAAGAAACTTTTAAAATGTTAGAGGATGTTGATTCTCATCAGAATGAGTATGTCTTAAAAAGTTCTGATACATTGTCTGGTTTAGAAAATGTGTT
GAGGGACAAAAGTTCAACTAATTTGGTTTCAGAAAATGAAAAGAGTATGACTATGGTAATAGATTGTTCATTTACAGCTGAAGAATGTTTACATGGTGCAAGGCTTGGAG
AGCCTGGGGATGTGGCAGTAGACGAGGATCAGGTTTCTCCTCTTAGTTCTTGTATGAGTGACATAAAGGATGATTATTTGATCTCGATTCGACAACAAACATCTGAGGTG
TTAATGGAAGAGCAAAAAAGTGAAGAAGCTGACCGTCAGATCACACCACCACCTTCAGATCAATCTATATCTGATGTGATTTCAACAGGGAAGCATGGAGCGAAATGGAG
GAGGCATCGTCCATCATTAAAACTTCCGCTAAAGCGGCTAATCAATCCTCTAGCCTTCAAGAAAGTCCGTGAAACGAATGCAGAGAAATACCGATCAAAACGGAGGAGAC
ATCACTCAGCACTATTGCTTCCATTCAAGCGGTTAATCAGTCCTCTAGCCTTCAAGAAGGTCCGCAAAACTAAATGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTTTCTTCGACCTCAACATACCTTACGATGAGCATTCATCTTCATCTTCATCTTCATCTTCATCTTCAATCAGGGCTAATCGCATCAAGATTGTGGCCAGACTCAT
GGAGCTCGGCTACTCAGGTATCGCCTACAACCGTACGATTAAAGGAGTGATGTCCGATCAGGACCGCTGTACTATTCCCCTCCTCAATGTCTCGTCGCTTCACAGTATCC
TTCCGTCGTTCTCCGCCTCCGTGGAGTTCCACCGCGATCTCCTCGGCGTCCCTAAATCCTCTCCTTTCCGTCAGTACACGCGTCTTACTATTTGTATCAACAGTATGCAG
GAGGTAATGGCAGTCAATTCTGGCAACCTCATTCTGAAGACCTACGATTTGATTGCTGTGAAGCCTCTTAATCAGCATGCTTTCGATCAGGCTTGCGAGAATTTGGAGAT
AGATATAATTGCCATTGATTTTTCGGTGAAACGGCCTTTCAAGTTGAAGCTAGGTCCGATAAAATCTGCAATTCAGCGTGGGGTTTACTTCGAAATTATGTACTCTGATT
TTCTTATGGACGTTCATGGAAGGAGGCAAATGATATCCACTTCAAAGTTATTGGTGGATTGGACGAAAGGAAAGAATCTCATATTATCCAGTGCGGCCCCCACTGTAAAT
GAAATCAGAGGACCTTATGATGTTGCAAACTTGTCATCATTGCTTGGTGTCTCTATGGAACGAGCAAAAGCTGCCGTTTCGAAAAATTGTAGGAATCTTATAGCTAATGC
CCTAAAGAGAAAGCAGTTCTACAAGGAGACAATTCGAGTTGAAAAGATATCATCAGATGATAAATTAGATCTGAATGACCCTTGGTCAGTGGATTTGTTCAAATATGATC
CTATATCAAGTGGCGAAGGTGATTTGCTATTGGATGATATAGCAAAATCGTTTGCTGCCTCTAACAATAAATCAAAAAATGTGAAAGCCATTGATTTCACTTCAGTCATT
GACAACATGCCACCACAAGGTTTTCTAGTCAAGAATGTAATATCAGGCTCTGAGCCAAAACTGCCATTGAATGATGACAGAGACTTGTCGCATGCTGCTGATGTTATTGA
GCCACCGATTGGATTAAATATTGAACAACCCCATACTTTTGATGGAGAACACTGTTCTACATCTGATCGTCAATGCTCTGTAATTGAAAGTTTTGAAATTTTACATTCAC
ATGGTAATGCAGGAAGAATTTTAAGTAATTCTGAGGAAGACAGAAGCACTGTTGAAGAAATTCTGCAGCCCAAGACCTCAATGCAGGAAGAAATTGTTGAAATGGACATT
GATAATGTGCAACCACAAAACCTGGTACCCAGTATTGAGTTGAATGTAGTATCGACAAATGAGCTTGTGCATTCTCCTACATCTACTGGAGATGTATTAGCTGTTTTCTT
CGGGAATGATGGAAAAGAAACTTTTAAAATGTCAGATGATGTTGAATCTCATCAGAATGAGTATGGTTTAAAAAGTTCTGATACATTGTCTGGTTCGGAAAATGTGTTGA
GGGAAAATGTGCTAACAAGAAACCCACTACCCAGTAGCGATTTGAATGTTGTGTCAACAAATGAGCTTGTGCATTCTCCTACATCTACCGAAGATGTATTAGCTGTTGTT
TGCGGGAATGATGGAAAAGAAACTTTTAAAATGTTAGAGGATGTTGATTCTCATCAGAATGAGTATGTCTTAAAAAGTTCTGATACATTGTCTGGTTTAGAAAATGTGTT
GAGGGACAAAAGTTCAACTAATTTGGTTTCAGAAAATGAAAAGAGTATGACTATGGTAATAGATTGTTCATTTACAGCTGAAGAATGTTTACATGGTGCAAGGCTTGGAG
AGCCTGGGGATGTGGCAGTAGACGAGGATCAGGTTTCTCCTCTTAGTTCTTGTATGAGTGACATAAAGGATGATTATTTGATCTCGATTCGACAACAAACATCTGAGGTG
TTAATGGAAGAGCAAAAAAGTGAAGAAGCTGACCGTCAGATCACACCACCACCTTCAGATCAATCTATATCTGATGTGATTTCAACAGGGAAGCATGGAGCGAAATGGAG
GAGGCATCGTCCATCATTAAAACTTCCGCTAAAGCGGCTAATCAATCCTCTAGCCTTCAAGAAAGTCCGTGAAACGAATGCAGAGAAATACCGATCAAAACGGAGGAGAC
ATCACTCAGCACTATTGCTTCCATTCAAGCGGTTAATCAGTCCTCTAGCCTTCAAGAAGGTCCGCAAAACTAAATGCTAG
Protein sequenceShow/hide protein sequence
MGFFDLNIPYDEHSSSSSSSSSSSIRANRIKIVARLMELGYSGIAYNRTIKGVMSDQDRCTIPLLNVSSLHSILPSFSASVEFHRDLLGVPKSSPFRQYTRLTICINSMQ
EVMAVNSGNLILKTYDLIAVKPLNQHAFDQACENLEIDIIAIDFSVKRPFKLKLGPIKSAIQRGVYFEIMYSDFLMDVHGRRQMISTSKLLVDWTKGKNLILSSAAPTVN
EIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALKRKQFYKETIRVEKISSDDKLDLNDPWSVDLFKYDPISSGEGDLLLDDIAKSFAASNNKSKNVKAIDFTSVI
DNMPPQGFLVKNVISGSEPKLPLNDDRDLSHAADVIEPPIGLNIEQPHTFDGEHCSTSDRQCSVIESFEILHSHGNAGRILSNSEEDRSTVEEILQPKTSMQEEIVEMDI
DNVQPQNLVPSIELNVVSTNELVHSPTSTGDVLAVFFGNDGKETFKMSDDVESHQNEYGLKSSDTLSGSENVLRENVLTRNPLPSSDLNVVSTNELVHSPTSTEDVLAVV
CGNDGKETFKMLEDVDSHQNEYVLKSSDTLSGLENVLRDKSSTNLVSENEKSMTMVIDCSFTAEECLHGARLGEPGDVAVDEDQVSPLSSCMSDIKDDYLISIRQQTSEV
LMEEQKSEEADRQITPPPSDQSISDVISTGKHGAKWRRHRPSLKLPLKRLINPLAFKKVRETNAEKYRSKRRRHHSALLLPFKRLISPLAFKKVRKTKC