; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003017 (gene) of Snake gourd v1 genome

Gene IDTan0003017
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPolymerase/histidinol phosphatase-like, putative isoform 1
Genome locationLG02:93232673..93238072
RNA-Seq ExpressionTan0003017
SyntenyTan0003017
Gene Ontology termsGO:0008033 - tRNA processing (biological process)
GO:0009553 - embryo sac development (biological process)
GO:0048868 - pollen tube development (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0005655 - nucleolar ribonuclease P complex (cellular component)
GO:0005739 - mitochondrion (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605499.1 Protein GAMETOPHYTE DEFECTIVE 1, partial [Cucurbita argyrosperma subsp. sororia]2.0e-28378.01Show/hide
Query:  MGFFDLNIPYNEHSSSSSIKANRIKIVAKLMEFGYSGIAYNRTIKGVMSDLDRCTIPLLNVSSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTRLTICI
        MGFFDLNIPY+EHSSSSSI+ NRIKIVAKLME GYSGIAYNRTIKGVMSD DRCTIPLLNVSSL SILP+F AS+EFHR+LLGVP+SSPFRQYTRLTICI
Subjt:  MGFFDLNIPYNEHSSSSSIKANRIKIVAKLMEFGYSGIAYNRTIKGVMSDLDRCTIPLLNVSSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTRLTICI

Query:  NNLQEVLAVNSGNLILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFLVDWTK
        N+LQE+LAVNSGNLILKTYDLIAVKPLNQ AFE ACEKLEIDIIAIDFAEKLPFRLKQG IKSAIQRGVYFEIMYSDLL DVH RRQMIS TK LVDWTK
Subjt:  NNLQEVLAVNSGNLILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFLVDWTK

Query:  GKNLIFSSAASSVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRKQFYKETIRVERISSDDKLDLNNPWSVDLLNWDPISTGEGDLLLDD
        GKNLI SSAASSVNEIRGP+DVANLSSLLGVSMERAKAAVSKNCRNLIANAL+RKQFYKETIRVERISSDDKLDLN+PWSVDL  WDPIS+GEGDLLLDD
Subjt:  GKNLIFSSAASSVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRKQFYKETIRVERISSDDKLDLNNPWSVDLLNWDPISTGEGDLLLDD

Query:  IAKSFAASNKKSKNVEAIDFTSVIDNMPPQGFLVNNVIFGSEAKLSLNDNRDLSHIANALEPPIAVNGVIQQPHPLGGDHYSTLDRQCSVIESFKVFHSH
        IAKSFAASN KSK V+AIDFTSVIDNMPPQGFLV NV  GSEAK+SLNDNR  SH+ +A+EPPIAVNGVIQQ  PL G H ST DRQCS           
Subjt:  IAKSFAASNKKSKNVEAIDFTSVIDNMPPQGFLVNNVIFGSEAKLSLNDNRDLSHIANALEPPIAVNGVIQQPHPLGGDHYSTLDRQCSVIESFKVFHSH

Query:  GDARKVSNNFEENKSTVEEIVEPKASFPEEPIEMDADNVQPQNQPSSSELNVVSTNELLLSPTFTRDILAVVFGNDEKNTVQMSEDVDSHRNEYDLKSSD
        GDA  V +N +E+K TVEEIV+PKAS  E+PI MD DNVQPQN   SSELNV STN  +  PT TRD+L VV  ND K T  MSEDV SHR+EY LKSSD
Subjt:  GDARKVSNNFEENKSTVEEIVEPKASFPEEPIEMDADNVQPQNQPSSSELNVVSTNELLLSPTFTRDILAVVFGNDEKNTVQMSEDVDSHRNEYDLKSSD

Query:  TSFGLENVLRNKSSITFVSENQKNMAVVMDGSFTSEECLLGGSLGEIEDVVVAADQVSTLSSCMSDRKDDYLISIRQQTSEVLMEEQKSGEADPQITPPP
        T  GLE+VLR+KSS   VSEN KNM  V+DGSFT+EECL+G  LGE  DV    D+VS LSSCM D KDDYLISIRQQTSEVLMEEQKSGEADPQI PPP
Subjt:  TSFGLENVLRNKSSITFVSENQKNMAVVMDGSFTSEECLLGGSLGEIEDVVVAADQVSTLSSCMSDRKDDYLISIRQQTSEVLMEEQKSGEADPQITPPP

Query:  LDQSICGVMSTVNAR----RRRHHLALKLPLKRLISPLAFKKVRETNAEKYRLSWRRHHPTLLLPFKRIMNSLALKKVRKTK
        LDQSI GVMST   R    R +HH ALK  L RL +P  FK+V+ET +EKYR   RRH P LLLPFKR++N L  KKVRKTK
Subjt:  LDQSICGVMSTVNAR----RRRHHLALKLPLKRLISPLAFKKVRETNAEKYRLSWRRHHPTLLLPFKRIMNSLALKKVRKTK

KAG7035438.1 Ribonuclease P protein subunit p30, partial [Cucurbita argyrosperma subsp. argyrosperma]2.8e-28277.65Show/hide
Query:  MGFFDLNIPYNEHSSSSSIKANRIKIVAKLMEFGYSGIAYNRTIKGVMSDLDRCTIPLLNVSSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTRLTICI
        MGFFDLNIPY+EHSSSSSI+ NRIKIVAKLME GYSGIAYNRTIKGVMSD DRCTIPLLNVSSL SILP+F AS+EFHR+LLGVP+SSPFRQYTRLTICI
Subjt:  MGFFDLNIPYNEHSSSSSIKANRIKIVAKLMEFGYSGIAYNRTIKGVMSDLDRCTIPLLNVSSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTRLTICI

Query:  NNLQEVLAVNSGNLILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFLVDWTK
        N+LQE+LAVNSGNLILKTYDLIAVKPLNQ AFE ACEKLEIDIIAIDFAEKLPFRLKQG IKSAIQRGVYFEIMYSDLL DVH RRQMIS TK LVDWTK
Subjt:  NNLQEVLAVNSGNLILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFLVDWTK

Query:  GKNLIFSSAASSVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRKQFYKETIRVERISSDDKLDLNNPWSVDLLNWDPISTGEGDLLLDD
        GKNLI SSAASSVNEIRGP+DVAN SSLLGVSMERAKAAVSKNCRNLIANAL+RKQFYKETIRVERISSDDKLDLN+PWSVDL  WDPIS+GEGDLLLDD
Subjt:  GKNLIFSSAASSVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRKQFYKETIRVERISSDDKLDLNNPWSVDLLNWDPISTGEGDLLLDD

Query:  IAKSFAASNKKSKNVEAIDFTSVIDNMPPQGFLVNNVIFGSEAKLSLNDNRDLSHIANALEPPIAVNGVIQQPHPLGGDHYSTLDRQCSVIESFKVFHSH
        IAKSFAASN KSK V+AIDFTSVIDNMPPQGFLV NV  GSEAK+SLNDNR  SH+ +A+EPPIAVNGVIQQ  PL G H ST DR CS           
Subjt:  IAKSFAASNKKSKNVEAIDFTSVIDNMPPQGFLVNNVIFGSEAKLSLNDNRDLSHIANALEPPIAVNGVIQQPHPLGGDHYSTLDRQCSVIESFKVFHSH

Query:  GDARKVSNNFEENKSTVEEIVEPKASFPEEPIEMDADNVQPQNQPSSSELNVVSTNELLLSPTFTRDILAVVFGNDEKNTVQMSEDVDSHRNEYDLKSSD
        GDA  V +N +E+K TVEEIV+PKAS  E+PI MD DNVQPQN   SSELNV STN  +  PT TRD+L VV  ND K T  MSEDV SHR+EY LKSSD
Subjt:  GDARKVSNNFEENKSTVEEIVEPKASFPEEPIEMDADNVQPQNQPSSSELNVVSTNELLLSPTFTRDILAVVFGNDEKNTVQMSEDVDSHRNEYDLKSSD

Query:  TSFGLENVLRNKSSITFVSENQKNMAVVMDGSFTSEECLLGGSLGEIEDVVVAADQVSTLSSCMSDRKDDYLISIRQQTSEVLMEEQKSGEADPQITPPP
        T  GLE+VLR+KSS   VSEN KNM  V+DGSFT+EECL+G  LGE  DV    D+VS LSSCM D KDDYLISIRQQTSEVLMEEQKSGEADPQI PPP
Subjt:  TSFGLENVLRNKSSITFVSENQKNMAVVMDGSFTSEECLLGGSLGEIEDVVVAADQVSTLSSCMSDRKDDYLISIRQQTSEVLMEEQKSGEADPQITPPP

Query:  LDQSICGVMS--TVNARRRRHHLALKLPLKRLISPLAFKKVRETNAEKYRLSWRRHHPTLLLPFKRIMNSLALKKVRKTK
        LDQSI GVMS  T  A+R+RH     L L RL +P  FK+V+ET +EKYR   RRH P LLLPFKR++N L  KKVRKTK
Subjt:  LDQSICGVMS--TVNARRRRHHLALKLPLKRLISPLAFKKVRETNAEKYRLSWRRHHPTLLLPFKRIMNSLALKKVRKTK

XP_023007324.1 uncharacterized protein LOC111499856 isoform X1 [Cucurbita maxima]2.3e-28778.38Show/hide
Query:  MGFFDLNIPYNEHSSSSSIKANRIKIVAKLMEFGYSGIAYNRTIKGVMSDLDRCTIPLLNVSSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTRLTICI
        MGFFDLNIPY+EHSSSSSI+ NRIKIVAKLME GYSGIAYNRTIKGVMSD DRCTIPLLNVSSL SILP+F AS+EFHR+LLGVP+SSPFRQYTRLTICI
Subjt:  MGFFDLNIPYNEHSSSSSIKANRIKIVAKLMEFGYSGIAYNRTIKGVMSDLDRCTIPLLNVSSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTRLTICI

Query:  NNLQEVLAVNSGNLILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFLVDWTK
        N+LQE+LAVNSGNLILKTYDLIAVKPLNQ AFE ACEKLEIDIIAIDFAEKLPFRLKQG IKSAIQRGVYFEIMYSDLL DVH RRQMISTTK LVDWTK
Subjt:  NNLQEVLAVNSGNLILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFLVDWTK

Query:  GKNLIFSSAASSVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRKQFYKETIRVERISSDDKLDLNNPWSVDLLNWDPISTGEGDLLLDD
        GKNLI SSAASSVNEIRGP+DVANLSSLLGVSMERAK AVSKNCRNLIANAL+RKQFYKETIRVERI+SDDKLD N+PWSVDL  WDPIS+GEGDLLLDD
Subjt:  GKNLIFSSAASSVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRKQFYKETIRVERISSDDKLDLNNPWSVDLLNWDPISTGEGDLLLDD

Query:  IAKSFAASNKKSKNVEAIDFTSVIDNMPPQGFLVNNVIFGSEAKLSLNDNRDLSHIANALEPPIAVNGVIQQPHPLGGDHYSTLDRQCSVIESFKVFHSH
        IAKSFAA+N KSK V+AIDFTSVIDNMPPQGFL+ NVI GSEAK+SLNDNR LS + +A+EPPIAVNGVIQQ  PL G H ST DRQCSVIE+ K+  SH
Subjt:  IAKSFAASNKKSKNVEAIDFTSVIDNMPPQGFLVNNVIFGSEAKLSLNDNRDLSHIANALEPPIAVNGVIQQPHPLGGDHYSTLDRQCSVIESFKVFHSH

Query:  GDARKVSNNFEENKSTVEEIVEPKASFPEEPIEMDADNVQPQNQPSSSELNVVSTNELLLSPTFTRDILAVVFGNDEKNTVQMSEDVDSHRNEYDLKSSD
         DA KV +N +E+K TVEEIV+PKAS  EEPIEMD DNVQPQN   SSELNVVSTN  +  PT TRD+L VV  ND K T  MSEDV SHR+EY LKSSD
Subjt:  GDARKVSNNFEENKSTVEEIVEPKASFPEEPIEMDADNVQPQNQPSSSELNVVSTNELLLSPTFTRDILAVVFGNDEKNTVQMSEDVDSHRNEYDLKSSD

Query:  TSFGLENVLRNKSSITFVSENQKNMAVVMDGSFTSEECLLGGSLGEIEDVVVAADQVSTLSSCMSDRKDDYLISIRQQTSEVLMEEQKSGEADPQITPPP
        T  GLE+VLR+KSS   VSENQ NM  V+DGSFT+EECL+G  LGE  DV    D VS LSSCM D KDDYLISIRQQTSEVLMEEQKS EADPQI PPP
Subjt:  TSFGLENVLRNKSSITFVSENQKNMAVVMDGSFTSEECLLGGSLGEIEDVVVAADQVSTLSSCMSDRKDDYLISIRQQTSEVLMEEQKSGEADPQITPPP

Query:  LDQSICGVMS--TVNARRRRHHLALKLPLKRLISPLAFKKVRETNAEKYRLSWRRHHPTLLLPFKRIMNSLALKKVRKTK
        +DQSI GVMS  T  A+R+RH     L L +L +P  FK+V+ET +EKYR   RRH P LLLPFKR++N L  KKVRKTK
Subjt:  LDQSICGVMS--TVNARRRRHHLALKLPLKRLISPLAFKKVRETNAEKYRLSWRRHHPTLLLPFKRIMNSLALKKVRKTK

XP_023007325.1 uncharacterized protein LOC111499856 isoform X2 [Cucurbita maxima]2.8e-28578.25Show/hide
Query:  MGFFDLNIPYNEHSSSSSIKANRIKIVAKLMEFGYSGIAYNRTIKGVMSDLDRCTIPLLNVSSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTRLTICI
        MGFFDLNIPY+EHSSSSSI+ NRIKIVAKLME GYSGIAYNRTIKGVMSD DRCTIPLLNVSSL SILP+F AS+EFHR+LLGVP+SSPFRQYTRLTICI
Subjt:  MGFFDLNIPYNEHSSSSSIKANRIKIVAKLMEFGYSGIAYNRTIKGVMSDLDRCTIPLLNVSSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTRLTICI

Query:  NNLQEVLAVNSGNLILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFLVDWTK
        N+LQE+LAVNSGNLILKTYDLIAVKPLNQ AFE ACEKLEIDIIAIDFAEKLPFRLKQG IKSAIQRGVYFEIMYSDLL DVH RRQMISTTK LVDWTK
Subjt:  NNLQEVLAVNSGNLILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFLVDWTK

Query:  GKNLIFSSAASSVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRKQFYKETIRVERISSDDKLDLNNPWSVDLLNWDPISTGEGDLLLDD
        GKNLI SSAASSVNEIRGP+DVANLSSLLGVSMERAK AVSKNCRNLIANAL+RKQFYKETIRVERI+SDDKLD N+PWSVDL  WDPIS+GEGDLLLDD
Subjt:  GKNLIFSSAASSVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRKQFYKETIRVERISSDDKLDLNNPWSVDLLNWDPISTGEGDLLLDD

Query:  IAKSFAASNKKSKNVEAIDFTSVIDNMPPQGFLVNNVIFGSEAKLSLNDNRDLSHIANALEPPIAVNGVIQQPHPLGGDHYSTLDRQCSVIESFKVFHSH
        IAKSFAA+N KSK V+AIDFTSVIDNMPPQGFL+ NVI GSEAK+SLNDNR LS + +A+EPPIAVNGVIQQ  PL G H ST DRQCSVIE+ K+  SH
Subjt:  IAKSFAASNKKSKNVEAIDFTSVIDNMPPQGFLVNNVIFGSEAKLSLNDNRDLSHIANALEPPIAVNGVIQQPHPLGGDHYSTLDRQCSVIESFKVFHSH

Query:  GDARKVSNNFEENKSTVEEIVEPKASFPEEPIEMDADNVQPQNQPSSSELNVVSTNELLLSPTFTRDILAVVFGNDEKNTVQMSEDVDSHRNEYDLKSSD
         DA KV +N +E+K TVEEIV+PKAS  EEPIEMD DNVQPQN   SSELNVVSTN  +  PT TRD+L VV  ND K T  MSEDV SHR+EY LKSSD
Subjt:  GDARKVSNNFEENKSTVEEIVEPKASFPEEPIEMDADNVQPQNQPSSSELNVVSTNELLLSPTFTRDILAVVFGNDEKNTVQMSEDVDSHRNEYDLKSSD

Query:  TSFGLENVLRNKSSITFVSENQKNMAVVMDGSFTSEECLLGGSLGEIEDVVVAADQVSTLSSCMSDRKDDYLISIRQQTSEVLMEEQKSGEADPQITPPP
        T  GLE+VLR+KSS   VSENQ NM  V+DGSFT+EECL+G  LGE  DV    D VS LSSCM D KDDYLISIRQQTSEVLMEEQKS EADPQI PPP
Subjt:  TSFGLENVLRNKSSITFVSENQKNMAVVMDGSFTSEECLLGGSLGEIEDVVVAADQVSTLSSCMSDRKDDYLISIRQQTSEVLMEEQKSGEADPQITPPP

Query:  LDQSICGVMS--TVNARRRRHHLALKLPLKRLISPLAFKKVRETNAEKYRLSWRRHHPTLLLPFKRIMNSLALKKV
        +DQSI GVMS  T  A+R+RH     L L +L +P  FK+V+ET +EKYR   RRH P LLLPFKR++N L  KKV
Subjt:  LDQSICGVMS--TVNARRRRHHLALKLPLKRLISPLAFKKVRETNAEKYRLSWRRHHPTLLLPFKRIMNSLALKKV

XP_023532173.1 uncharacterized protein LOC111794415 isoform X2 [Cucurbita pepo subsp. pepo]1.4e-28177.29Show/hide
Query:  MGFFDLNIPYNEHSSSSSIKANRIKIVAKLMEFGYSGIAYNRTIKGVMSDLDRCTIPLLNVSSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTRLTICI
        MGFFDLNIPY+EHSSSSSI+ NRIKIVAKLME GYSGIAYNRTIKGVMSD DRCTIPLLNVSSL SILP+F AS+EFHR+LLGVP+SSPFRQYTRLTICI
Subjt:  MGFFDLNIPYNEHSSSSSIKANRIKIVAKLMEFGYSGIAYNRTIKGVMSDLDRCTIPLLNVSSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTRLTICI

Query:  NNLQEVLAVNSGNLILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFLVDWTK
        N+LQE+LAVNSGNL+LKTYDLIAVKPLNQ AFE ACEKLEIDIIAIDFAEKLPFRLKQG IKSAIQRGVYFEIMYSDLL DVH RRQMIS TK LVDWTK
Subjt:  NNLQEVLAVNSGNLILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFLVDWTK

Query:  GKNLIFSSAASSVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRKQFYKETIRVERISSDDKLDLNNPWSVDLLNWDPISTGEGDLLLDD
        GKNLI SSAASSVNEIRGP+DVANLSSLLGVSMERAKAAVSKNCRNLIANAL+RKQFYKETIRVERISSDDKLD N+PWSVDL  WDPIS+GEGDLLLDD
Subjt:  GKNLIFSSAASSVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRKQFYKETIRVERISSDDKLDLNNPWSVDLLNWDPISTGEGDLLLDD

Query:  IAKSFAASNKKSKNVEAIDFTSVIDNMPPQGFLVNNVIFGSEAKLSLNDNRDLSHIANALEPPIAVNGVIQQPHPLGGDHYSTLDRQCSVIESFKVFHSH
        IAKSFAASN  SK V+AIDFTSVIDNMPPQGFLV NVI GSEAK+SLNDNR  SH+ +A+EPPIAVNGVIQQ  PL G H ST DRQCS           
Subjt:  IAKSFAASNKKSKNVEAIDFTSVIDNMPPQGFLVNNVIFGSEAKLSLNDNRDLSHIANALEPPIAVNGVIQQPHPLGGDHYSTLDRQCSVIESFKVFHSH

Query:  GDARKVSNNFEENKSTVEEIVEPKASFPEEPIEMDADNVQPQNQPSSSELNVVSTNELLLSPTFTRDILAVVFGNDEKNTVQMSEDVDSHRNEYDLKSSD
        GDA  V +N +E+K TVEEIV+PKAS  E+PI MD DNVQPQN   SSELNVVSTN  +  PT TRD+L VV  ND K T  MSEDVDSHRNEY L+SSD
Subjt:  GDARKVSNNFEENKSTVEEIVEPKASFPEEPIEMDADNVQPQNQPSSSELNVVSTNELLLSPTFTRDILAVVFGNDEKNTVQMSEDVDSHRNEYDLKSSD

Query:  TSFGLENVLRNKSSITFVSENQKNMAVVMDGSFTSEECLLGGSLGEIEDVVVAADQVSTLSSCMSDRKDDYLISIRQQTSEVLMEEQKSGEADPQITPPP
        T  GLE+VLR+KSS   VSENQKNM +V+DGSFT EECL+G  LGE  DV    D+VS LSSCM D KDDYLISIRQQTSEVL+EE+KSGE DPQI PPP
Subjt:  TSFGLENVLRNKSSITFVSENQKNMAVVMDGSFTSEECLLGGSLGEIEDVVVAADQVSTLSSCMSDRKDDYLISIRQQTSEVLMEEQKSGEADPQITPPP

Query:  LDQSICGVMSTVNARRRRHHLALKLPLKRLISPLAFKKVRETNAEKYRLSWRRHHPTLLLPFKRIMNSLALKKVRKTK
        LDQSI  VMST   R +R       P  RL +P+ FK+V+ET +EKYR   RRH P LLLPFKR++N L  KKVRKTK
Subjt:  LDQSICGVMSTVNARRRRHHLALKLPLKRLISPLAFKKVRETNAEKYRLSWRRHHPTLLLPFKRIMNSLALKKVRKTK

TrEMBL top hitse value%identityAlignment
A0A6J1G517 uncharacterized protein LOC111450984 isoform X11.5e-27371Show/hide
Query:  MGFFDLNIPYNEHSSSSSIKANRIKIVAKLMEFGYSGIAYNRTIKGVMSDLDRCTIPLLNVSSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTRLTICI
        MGFFDLN+PY+EHSSSSSI+ NRIKIVAKLME GYSGIAYNRTIKGVMSD DRCTIPLLNVSSL SILP+F AS+EFHR+LLGVP+SSPFRQYTRLTICI
Subjt:  MGFFDLNIPYNEHSSSSSIKANRIKIVAKLMEFGYSGIAYNRTIKGVMSDLDRCTIPLLNVSSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTRLTICI

Query:  NNLQEVLAVNSGNLILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFLVDWTK
        N+LQE+LAVNSGNLILKTYDLIAVKPLNQ AFE ACEKLEIDIIAIDFAEKLPFRLKQG IKSAIQRGVYFEIMYSDLL DVH RRQMISTTK LVDWTK
Subjt:  NNLQEVLAVNSGNLILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFLVDWTK

Query:  GKNLIFSSAASSVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRKQFYKETIRVERISSDDKLDLNNPWSVDLLNWDPISTGEGDLLLDD
        GKNLI SSAASSVNEIRGP+DVANLSSLLGVSMERAKAAVSKNCRNLIANAL+RKQFYKETIRVERISSDDKLD N+PWSVDL  WDP+S+GEGDLLLDD
Subjt:  GKNLIFSSAASSVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRKQFYKETIRVERISSDDKLDLNNPWSVDLLNWDPISTGEGDLLLDD

Query:  IAKSFAASNKKSKNVEAIDFTSVIDNMPPQGFLVNNVIFGSEAKLSLNDNRDLSHIANALEPPIAV----------------------------------
        IA+SFAASN KSK V+AIDFTSVIDN PPQGFLV NVI G EAK+SLNDNR  SH+ +A+EPPIAV                                  
Subjt:  IAKSFAASNKKSKNVEAIDFTSVIDNMPPQGFLVNNVIFGSEAKLSLNDNRDLSHIANALEPPIAV----------------------------------

Query:  ------------------------NGVIQQPHPLGGDHYSTLDRQCSVIESFKVFHSHGDARKVSNNFEENKSTVEEIVEPKASFPEEPIEMDADNVQPQ
                                NGVIQQ  PL G H ST +RQCS           GDA KV +N +E+K TVEEIV+PKAS  E+PI MD DNVQPQ
Subjt:  ------------------------NGVIQQPHPLGGDHYSTLDRQCSVIESFKVFHSHGDARKVSNNFEENKSTVEEIVEPKASFPEEPIEMDADNVQPQ

Query:  NQPSSSELNVVSTNELLLSPTFTRDILAVVFGNDEKNTVQMSEDVDSHRNEYDLKSSDTSFGLENVLRNKSSITFVSENQKNMAVVMDGSFTSEECLLGG
        N   SSELNV STN  +  PT TRD+L V+  ND K    MSEDV SHR+EY LKSSDT  GLE+VLR+KSS   VSEN KNM  V+DGSFT+EECL+G 
Subjt:  NQPSSSELNVVSTNELLLSPTFTRDILAVVFGNDEKNTVQMSEDVDSHRNEYDLKSSDTSFGLENVLRNKSSITFVSENQKNMAVVMDGSFTSEECLLGG

Query:  SLGEIEDVVVAADQVSTLSSCMSDRKDDYLISIRQQTSEVLMEEQKSGEADPQITPPPLDQSICGVMS--TVNARRRRHHLALKLPLKRLISPLAFKKVR
         LGE  DV    D+VS LSSCM D KDDYLISIRQQTSEVLMEEQKSGEADPQI PPPLDQSI GVMS  T  A+R+RH     L L RL +P  FK+V+
Subjt:  SLGEIEDVVVAADQVSTLSSCMSDRKDDYLISIRQQTSEVLMEEQKSGEADPQITPPPLDQSICGVMS--TVNARRRRHHLALKLPLKRLISPLAFKKVR

Query:  ETNAEKYRLSWRRHHPTLLLPFKRIMNSLALKKVRKTK
        ET +EKYR   RRH P LLLPFKR++N L  KKVRKTK
Subjt:  ETNAEKYRLSWRRHHPTLLLPFKRIMNSLALKKVRKTK

A0A6J1G550 uncharacterized protein LOC111450984 isoform X23.8e-27270.79Show/hide
Query:  MGFFDLNIPYNEHSSSSSIKANRIKIVAKLMEFGYSGIAYNRTIKGVMSDLDRCTIPLLNVSSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTRLTICI
        MGFFDLN+PY+EHSSSSSI+ NRIKIVAKLME GYSGIAYNRTIKGVMSD DRCTIPLLNVSSL SILP+F AS+EFHR+LLGVP+SSPFRQYTRLTICI
Subjt:  MGFFDLNIPYNEHSSSSSIKANRIKIVAKLMEFGYSGIAYNRTIKGVMSDLDRCTIPLLNVSSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTRLTICI

Query:  NNLQEVLAVNSGNLILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFLVDWTK
        N+LQE+LAVNSGNLILKTYDLIAVKPLNQ AFE ACEKLEIDIIAIDFAEKLPFRLKQG IKSAIQRGVYFEIMYSDLL DVH RRQMISTTK LVDWTK
Subjt:  NNLQEVLAVNSGNLILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFLVDWTK

Query:  GKNLIFSSAASSVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRKQFYKETIRVERISSDDKLDLNNPWSVDLLNWDPISTGEGDLLLDD
        GKNLI SSAASSVNEIRGP+DVANLSSLLGVSMERAKAAVSKNCRNLIANAL+RKQFYKETIRVERISSDDKLD N+PWSVDL  WDP+S+GEGDLLLDD
Subjt:  GKNLIFSSAASSVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRKQFYKETIRVERISSDDKLDLNNPWSVDLLNWDPISTGEGDLLLDD

Query:  IAKSFAASNKKSKNVEAIDFTSVIDNMPPQGFLVNNVIFGSEAKLSLNDNRDLSHIANALEPPIAV----------------------------------
        IA+SFAASN KSK V+AIDFTSVIDN PPQGFLV NVI G EAK+SLNDNR  SH+ +A+EPPIAV                                  
Subjt:  IAKSFAASNKKSKNVEAIDFTSVIDNMPPQGFLVNNVIFGSEAKLSLNDNRDLSHIANALEPPIAV----------------------------------

Query:  ------------------------NGVIQQPHPLGGDHYSTLDRQCSVIESFKVFHSHGDARKVSNNFEENKSTVEEIVEPKASFPEEPIEMDADNVQPQ
                                NGVIQQ  PL G H ST +RQCS           GDA KV +N +E+K TVEEIV+PKAS  E+PI MD DNVQPQ
Subjt:  ------------------------NGVIQQPHPLGGDHYSTLDRQCSVIESFKVFHSHGDARKVSNNFEENKSTVEEIVEPKASFPEEPIEMDADNVQPQ

Query:  NQPSSSELNVVSTNELLLSPTFTRDILAVVFGNDEKNTVQMSEDVDSHRNEYDLKSSDTSFGLENVLRNKSSITFVSENQKNMAVVMDGSFTSEECLLGG
        N   SSELNV STN  +  PT TRD+L V+  ND K    MSEDV SHR+EY LKSSDT  GLE+VLR+KSS   VSEN KNM  V+DGSFT+EECL+G 
Subjt:  NQPSSSELNVVSTNELLLSPTFTRDILAVVFGNDEKNTVQMSEDVDSHRNEYDLKSSDTSFGLENVLRNKSSITFVSENQKNMAVVMDGSFTSEECLLGG

Query:  SLGEIEDVVVAADQVSTLSSCMSDRKDDYLISIRQQTSEVLMEEQKSGEADPQITPPPLDQSICGVMS--TVNARRRRHHLALKLPLKRLISPLAFKKVR
         LGE  DV    D+VS LSSCM D KDDYLISIRQQTSEVLMEEQKSGEADPQI PPPLDQSI GVMS  T  A+R+RH     L L RL +P  FK+V+
Subjt:  SLGEIEDVVVAADQVSTLSSCMSDRKDDYLISIRQQTSEVLMEEQKSGEADPQITPPPLDQSICGVMS--TVNARRRRHHLALKLPLKRLISPLAFKKVR

Query:  ETNAEKYRLSWRRHHPTLLLPFKRIMNSLALKKVRK
        ET +EKYR   RRH P LLLPFKR++N L  KKV+K
Subjt:  ETNAEKYRLSWRRHHPTLLLPFKRIMNSLALKKVRK

A0A6J1L082 uncharacterized protein LOC111499856 isoform X36.8e-28277.5Show/hide
Query:  MGFFDLNIPYNEHSSSSSIKANRIKIVAKLMEFGYSGIAYNRTIKGVMSDLDRCTIPLLNVSSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTRLTICI
        MGFFDLNIPY+EHSSSSSI+ NRIKIVAKLME GYSGIAYNRTIKGVMSD DRCTIPLLNVSSL SILP+F AS+EFHR+LLGVP+SSPFRQYTRLTICI
Subjt:  MGFFDLNIPYNEHSSSSSIKANRIKIVAKLMEFGYSGIAYNRTIKGVMSDLDRCTIPLLNVSSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTRLTICI

Query:  NNLQEVLAVNSGNLILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFLVDWTK
        N+LQE+LAVNSGNLILKTYDLIAVKPLNQ AFE ACEKLEIDIIAIDFAEKLPFRLKQG IKSAIQRGVYFEIMYSDLL DVH RRQMISTTK LVDWTK
Subjt:  NNLQEVLAVNSGNLILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFLVDWTK

Query:  GKNLIFSSAASSVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRKQFYKETIRVERISSDDKLDLNNPWSVDLLNWDPISTGEGDLLLDD
        GKNLI SSAASSVNEIRGP+DVANLSSLLGVSMERAK AVSKNCRNLIANAL+RKQFYKETIRVERI+SDDKLD N+PWSVDL  WDPIS+GEGDLLLDD
Subjt:  GKNLIFSSAASSVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRKQFYKETIRVERISSDDKLDLNNPWSVDLLNWDPISTGEGDLLLDD

Query:  IAKSFAASNKKSKNVEAIDFTSVIDNMPPQGFLVNNVIFGSEAKLSLNDNRDLSHIANALEPPIAVNGVIQQPHPLGGDHYSTLDRQCSVIESFKVFHSH
        IAKSFAA+N KSK V+AIDFTSVIDNMPPQGFL+ NVI GSEAK+SLNDNR LS + +A+EPPIAVNGVIQQ  PL G H ST DRQCS           
Subjt:  IAKSFAASNKKSKNVEAIDFTSVIDNMPPQGFLVNNVIFGSEAKLSLNDNRDLSHIANALEPPIAVNGVIQQPHPLGGDHYSTLDRQCSVIESFKVFHSH

Query:  GDARKVSNNFEENKSTVEEIVEPKASFPEEPIEMDADNVQPQNQPSSSELNVVSTNELLLSPTFTRDILAVVFGNDEKNTVQMSEDVDSHRNEYDLKSSD
         DA KV +N +E+K TVEEIV+PKAS  EEPIEMD DNVQPQN   SSELNVVSTN  +  PT TRD+L VV  ND K T  MSEDV SHR+EY LKSSD
Subjt:  GDARKVSNNFEENKSTVEEIVEPKASFPEEPIEMDADNVQPQNQPSSSELNVVSTNELLLSPTFTRDILAVVFGNDEKNTVQMSEDVDSHRNEYDLKSSD

Query:  TSFGLENVLRNKSSITFVSENQKNMAVVMDGSFTSEECLLGGSLGEIEDVVVAADQVSTLSSCMSDRKDDYLISIRQQTSEVLMEEQKSGEADPQITPPP
        T  GLE+VLR+KSS   VSENQ NM  V+DGSFT+EECL+G  LGE  DV    D VS LSSCM D KDDYLISIRQQTSEVLMEEQKS EADPQI PPP
Subjt:  TSFGLENVLRNKSSITFVSENQKNMAVVMDGSFTSEECLLGGSLGEIEDVVVAADQVSTLSSCMSDRKDDYLISIRQQTSEVLMEEQKSGEADPQITPPP

Query:  LDQSICGVMS--TVNARRRRHHLALKLPLKRLISPLAFKKVRETNAEKYRLSWRRHHPTLLLPFKRIMNSLALKKVRKTK
        +DQSI GVMS  T  A+R+RH     L L +L +P  FK+V+ET +EKYR   RRH P LLLPFKR++N L  KKVRKTK
Subjt:  LDQSICGVMS--TVNARRRRHHLALKLPLKRLISPLAFKKVRETNAEKYRLSWRRHHPTLLLPFKRIMNSLALKKVRKTK

A0A6J1L2N3 uncharacterized protein LOC111499856 isoform X21.3e-28578.25Show/hide
Query:  MGFFDLNIPYNEHSSSSSIKANRIKIVAKLMEFGYSGIAYNRTIKGVMSDLDRCTIPLLNVSSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTRLTICI
        MGFFDLNIPY+EHSSSSSI+ NRIKIVAKLME GYSGIAYNRTIKGVMSD DRCTIPLLNVSSL SILP+F AS+EFHR+LLGVP+SSPFRQYTRLTICI
Subjt:  MGFFDLNIPYNEHSSSSSIKANRIKIVAKLMEFGYSGIAYNRTIKGVMSDLDRCTIPLLNVSSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTRLTICI

Query:  NNLQEVLAVNSGNLILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFLVDWTK
        N+LQE+LAVNSGNLILKTYDLIAVKPLNQ AFE ACEKLEIDIIAIDFAEKLPFRLKQG IKSAIQRGVYFEIMYSDLL DVH RRQMISTTK LVDWTK
Subjt:  NNLQEVLAVNSGNLILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFLVDWTK

Query:  GKNLIFSSAASSVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRKQFYKETIRVERISSDDKLDLNNPWSVDLLNWDPISTGEGDLLLDD
        GKNLI SSAASSVNEIRGP+DVANLSSLLGVSMERAK AVSKNCRNLIANAL+RKQFYKETIRVERI+SDDKLD N+PWSVDL  WDPIS+GEGDLLLDD
Subjt:  GKNLIFSSAASSVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRKQFYKETIRVERISSDDKLDLNNPWSVDLLNWDPISTGEGDLLLDD

Query:  IAKSFAASNKKSKNVEAIDFTSVIDNMPPQGFLVNNVIFGSEAKLSLNDNRDLSHIANALEPPIAVNGVIQQPHPLGGDHYSTLDRQCSVIESFKVFHSH
        IAKSFAA+N KSK V+AIDFTSVIDNMPPQGFL+ NVI GSEAK+SLNDNR LS + +A+EPPIAVNGVIQQ  PL G H ST DRQCSVIE+ K+  SH
Subjt:  IAKSFAASNKKSKNVEAIDFTSVIDNMPPQGFLVNNVIFGSEAKLSLNDNRDLSHIANALEPPIAVNGVIQQPHPLGGDHYSTLDRQCSVIESFKVFHSH

Query:  GDARKVSNNFEENKSTVEEIVEPKASFPEEPIEMDADNVQPQNQPSSSELNVVSTNELLLSPTFTRDILAVVFGNDEKNTVQMSEDVDSHRNEYDLKSSD
         DA KV +N +E+K TVEEIV+PKAS  EEPIEMD DNVQPQN   SSELNVVSTN  +  PT TRD+L VV  ND K T  MSEDV SHR+EY LKSSD
Subjt:  GDARKVSNNFEENKSTVEEIVEPKASFPEEPIEMDADNVQPQNQPSSSELNVVSTNELLLSPTFTRDILAVVFGNDEKNTVQMSEDVDSHRNEYDLKSSD

Query:  TSFGLENVLRNKSSITFVSENQKNMAVVMDGSFTSEECLLGGSLGEIEDVVVAADQVSTLSSCMSDRKDDYLISIRQQTSEVLMEEQKSGEADPQITPPP
        T  GLE+VLR+KSS   VSENQ NM  V+DGSFT+EECL+G  LGE  DV    D VS LSSCM D KDDYLISIRQQTSEVLMEEQKS EADPQI PPP
Subjt:  TSFGLENVLRNKSSITFVSENQKNMAVVMDGSFTSEECLLGGSLGEIEDVVVAADQVSTLSSCMSDRKDDYLISIRQQTSEVLMEEQKSGEADPQITPPP

Query:  LDQSICGVMS--TVNARRRRHHLALKLPLKRLISPLAFKKVRETNAEKYRLSWRRHHPTLLLPFKRIMNSLALKKV
        +DQSI GVMS  T  A+R+RH     L L +L +P  FK+V+ET +EKYR   RRH P LLLPFKR++N L  KKV
Subjt:  LDQSICGVMS--TVNARRRRHHLALKLPLKRLISPLAFKKVRETNAEKYRLSWRRHHPTLLLPFKRIMNSLALKKV

A0A6J1L4M3 uncharacterized protein LOC111499856 isoform X11.1e-28778.38Show/hide
Query:  MGFFDLNIPYNEHSSSSSIKANRIKIVAKLMEFGYSGIAYNRTIKGVMSDLDRCTIPLLNVSSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTRLTICI
        MGFFDLNIPY+EHSSSSSI+ NRIKIVAKLME GYSGIAYNRTIKGVMSD DRCTIPLLNVSSL SILP+F AS+EFHR+LLGVP+SSPFRQYTRLTICI
Subjt:  MGFFDLNIPYNEHSSSSSIKANRIKIVAKLMEFGYSGIAYNRTIKGVMSDLDRCTIPLLNVSSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTRLTICI

Query:  NNLQEVLAVNSGNLILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFLVDWTK
        N+LQE+LAVNSGNLILKTYDLIAVKPLNQ AFE ACEKLEIDIIAIDFAEKLPFRLKQG IKSAIQRGVYFEIMYSDLL DVH RRQMISTTK LVDWTK
Subjt:  NNLQEVLAVNSGNLILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFLVDWTK

Query:  GKNLIFSSAASSVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRKQFYKETIRVERISSDDKLDLNNPWSVDLLNWDPISTGEGDLLLDD
        GKNLI SSAASSVNEIRGP+DVANLSSLLGVSMERAK AVSKNCRNLIANAL+RKQFYKETIRVERI+SDDKLD N+PWSVDL  WDPIS+GEGDLLLDD
Subjt:  GKNLIFSSAASSVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRKQFYKETIRVERISSDDKLDLNNPWSVDLLNWDPISTGEGDLLLDD

Query:  IAKSFAASNKKSKNVEAIDFTSVIDNMPPQGFLVNNVIFGSEAKLSLNDNRDLSHIANALEPPIAVNGVIQQPHPLGGDHYSTLDRQCSVIESFKVFHSH
        IAKSFAA+N KSK V+AIDFTSVIDNMPPQGFL+ NVI GSEAK+SLNDNR LS + +A+EPPIAVNGVIQQ  PL G H ST DRQCSVIE+ K+  SH
Subjt:  IAKSFAASNKKSKNVEAIDFTSVIDNMPPQGFLVNNVIFGSEAKLSLNDNRDLSHIANALEPPIAVNGVIQQPHPLGGDHYSTLDRQCSVIESFKVFHSH

Query:  GDARKVSNNFEENKSTVEEIVEPKASFPEEPIEMDADNVQPQNQPSSSELNVVSTNELLLSPTFTRDILAVVFGNDEKNTVQMSEDVDSHRNEYDLKSSD
         DA KV +N +E+K TVEEIV+PKAS  EEPIEMD DNVQPQN   SSELNVVSTN  +  PT TRD+L VV  ND K T  MSEDV SHR+EY LKSSD
Subjt:  GDARKVSNNFEENKSTVEEIVEPKASFPEEPIEMDADNVQPQNQPSSSELNVVSTNELLLSPTFTRDILAVVFGNDEKNTVQMSEDVDSHRNEYDLKSSD

Query:  TSFGLENVLRNKSSITFVSENQKNMAVVMDGSFTSEECLLGGSLGEIEDVVVAADQVSTLSSCMSDRKDDYLISIRQQTSEVLMEEQKSGEADPQITPPP
        T  GLE+VLR+KSS   VSENQ NM  V+DGSFT+EECL+G  LGE  DV    D VS LSSCM D KDDYLISIRQQTSEVLMEEQKS EADPQI PPP
Subjt:  TSFGLENVLRNKSSITFVSENQKNMAVVMDGSFTSEECLLGGSLGEIEDVVVAADQVSTLSSCMSDRKDDYLISIRQQTSEVLMEEQKSGEADPQITPPP

Query:  LDQSICGVMS--TVNARRRRHHLALKLPLKRLISPLAFKKVRETNAEKYRLSWRRHHPTLLLPFKRIMNSLALKKVRKTK
        +DQSI GVMS  T  A+R+RH     L L +L +P  FK+V+ET +EKYR   RRH P LLLPFKR++N L  KKVRKTK
Subjt:  LDQSICGVMS--TVNARRRRHHLALKLPLKRLISPLAFKKVRETNAEKYRLSWRRHHPTLLLPFKRIMNSLALKKVRKTK

SwissProt top hitse value%identityAlignment
F4JXF1 Protein GAMETOPHYTE DEFECTIVE 14.2e-11157.31Show/hide
Query:  MGFFDLNIPYNEHSSS-----SSIKANRIKIVAKLMEFGYSGIAYNRTIKGVMSDLDRCTIPLLNVSSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTR
        MGFFDL+IPYNE   S     +  K  R+K+  K ME GY GIA+NR+IKGVMSD D CTIPLL + SL  + P   +SV FHRDLLGVP+++PFRQYTR
Subjt:  MGFFDLNIPYNEHSSS-----SSIKANRIKIVAKLMEFGYSGIAYNRTIKGVMSDLDRCTIPLLNVSSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTR

Query:  LTICINNLQEVLAVNSGNLILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFL
        LT+ + +  +  ++NSGN ILK+YD+IAV+P+NQ AF++ACEK E+D+I+IDF +K+ FRLK   +K+AIQRG+YFEI YSD+L+D   RRQ+IS  K L
Subjt:  LTICINNLQEVLAVNSGNLILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFL

Query:  VDWTKGKNLIFSSAASSVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRKQFYKETIRVERISSDDKLDLNNPWSVDLLNWDPISTGEGD
        VDWT+GKNLI SS A SV E+RGP DV NL  LLG+S ERA+AA+SKNCRN+IA  L++K+F+KE +RVE +S+ D   L  P S D + WD +S+GEGD
Subjt:  VDWTKGKNLIFSSAASSVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRKQFYKETIRVERISSDDKLDLNNPWSVDLLNWDPISTGEGD

Query:  LLLDDIAKSFAASNKKS-KNVEAIDFTSVIDNMPPQGFLVNNVIFGSEA
        +LLDD+AK+F A+N  + K+ +AIDFTSV+D +P  GF V +++ G+E+
Subjt:  LLLDDIAKSFAASNKKS-KNVEAIDFTSVIDNMPPQGFLVNNVIFGSEA

O88796 Ribonuclease P protein subunit p302.8e-3035.52Show/hide
Query:  SSSSIKANRIKIVAKLMEFGYSGIAYNRTIKGVMSDLDRCTIPLLNVSSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTRLTICINNLQEVLAVNSGNL
        + S +KA R  +V      GYS +A N  +     +  R     + VS L + LP            +   KS P +  TRLTI + +      + + + 
Subjt:  SSSSIKANRIKIVAKLMEFGYSGIAYNRTIKGVMSDLDRCTIPLLNVSSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTRLTICINNLQEVLAVNSGNL

Query:  ILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFLVDWTKGKNLIFSSAASSVN
         ++ YD++AV P  +  F  AC  L++D++ I   EKLPF  K+ P+  AI+RG+ FE++Y   + D   RR  IS    L+   KGKN+I SSAA    
Subjt:  ILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFLVDWTKGKNLIFSSAASSVN

Query:  EIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRKQFYK--ETIRVERISSDD
        EIRGPYDVANL  L G+S    KAAVS NCR +  +   RK  +    T++  R S  D
Subjt:  EIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRKQFYK--ETIRVERISSDD

P78346 Ribonuclease P protein subunit p305.6e-3136.26Show/hide
Query:  SSSSIKANRIKIVAKLMEFGYSGIAYNRTI--KGVMSDLDRCTIPLLNVSSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTRLTICINNLQEVLAVNSG
        + S +KA R  +V      GYS +A N  +  K    ++++     + VS L + LP            +   KS P +  TRLTI +++      + + 
Subjt:  SSSSIKANRIKIVAKLMEFGYSGIAYNRTI--KGVMSDLDRCTIPLLNVSSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTRLTICINNLQEVLAVNSG

Query:  NLILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFLVDWTKGKNLIFSSAASS
        +   + YD++AV P  +  F  AC  L++D++ I   EKLPF  K+ PI  AI RG+ FE++YS  + D   RR  IS+   L+   KGKN+I SSAA  
Subjt:  NLILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFLVDWTKGKNLIFSSAASS

Query:  VNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRKQFYK--ETIRVERISSDDK
          EIRGPYDVANL  L G+S   AKAAVS NCR  + +   RK  +    T++  R S  D+
Subjt:  VNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRKQFYK--ETIRVERISSDDK

Q3SZ21 Ribonuclease P protein subunit p303.0e-3236.78Show/hide
Query:  SSSSIKANRIKIVAKLMEFGYSGIAYNRTI--KGVMSDLDRCTIPLLNVSSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTRLTICINNLQEVLAVNSG
        + S +KA R  +V      GYS +A N  +  K    ++++     + VS L + LP            +   KS P +  TRLTI +++      + + 
Subjt:  SSSSIKANRIKIVAKLMEFGYSGIAYNRTI--KGVMSDLDRCTIPLLNVSSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTRLTICINNLQEVLAVNSG

Query:  NLILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFLVDWTKGKNLIFSSAASS
        +  ++ YD++AV P  +  F  AC  L++D++ I   EKLPF  K+ PI  AI RGV FE++YS  + D   RR  IS    L+   KGKN+I SSAA  
Subjt:  NLILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFLVDWTKGKNLIFSSAASS

Query:  VNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRKQFYK--ETIRVERISSDD
          EIRGPYDVANL  L G+S   AKAAVS NCR ++ +   RK  +    T++  R S  D
Subjt:  VNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRKQFYK--ETIRVERISSDD

Q3ZE13 Ribonuclease P protein subunit drpp305.4e-2630.23Show/hide
Query:  MGFFDLNIPYNEHSSSSSIKANRIKIVAKL-MEFGYSGIAYNRTIKGVMSDLDRCTIPLLNV--SSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTRLT
        M ++DLNI        SS+   +IK +  L  ++GY  +A   T++G +   D C I  + +   S  S    +    + ++ +         +QYTRL 
Subjt:  MGFFDLNIPYNEHSSSSSIKANRIKIVAKL-MEFGYSGIAYNRTIKGVMSDLDRCTIPLLNV--SSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTRLT

Query:  ICINNLQEVLAVNSGNLILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFLVD
        +    + E   + + N ++++YD+I+V P +   F  AC   EIDII ID   K  F +K   ++  I +G++ EI+Y +L     +R         LV 
Subjt:  ICINNLQEVLAVNSGNLILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFLVD

Query:  WTKGKNLIFSSAASSVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRK
         + GKN+I SS+  S   +R PYD++NL  L G++ ++AKAAVSK+    + +A+ R+
Subjt:  WTKGKNLIFSSAASSVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRK

Arabidopsis top hitse value%identityAlignment
AT5G59980.1 Polymerase/histidinol phosphatase-like3.0e-11257.31Show/hide
Query:  MGFFDLNIPYNEHSSS-----SSIKANRIKIVAKLMEFGYSGIAYNRTIKGVMSDLDRCTIPLLNVSSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTR
        MGFFDL+IPYNE   S     +  K  R+K+  K ME GY GIA+NR+IKGVMSD D CTIPLL + SL  + P   +SV FHRDLLGVP+++PFRQYTR
Subjt:  MGFFDLNIPYNEHSSS-----SSIKANRIKIVAKLMEFGYSGIAYNRTIKGVMSDLDRCTIPLLNVSSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTR

Query:  LTICINNLQEVLAVNSGNLILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFL
        LT+ + +  +  ++NSGN ILK+YD+IAV+P+NQ AF++ACEK E+D+I+IDF +K+ FRLK   +K+AIQRG+YFEI YSD+L+D   RRQ+IS  K L
Subjt:  LTICINNLQEVLAVNSGNLILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFL

Query:  VDWTKGKNLIFSSAASSVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRKQFYKETIRVERISSDDKLDLNNPWSVDLLNWDPISTGEGD
        VDWT+GKNLI SS A SV E+RGP DV NL  LLG+S ERA+AA+SKNCRN+IA  L++K+F+KE +RVE +S+ D   L  P S D + WD +S+GEGD
Subjt:  VDWTKGKNLIFSSAASSVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRKQFYKETIRVERISSDDKLDLNNPWSVDLLNWDPISTGEGD

Query:  LLLDDIAKSFAASNKKS-KNVEAIDFTSVIDNMPPQGFLVNNVIFGSEA
        +LLDD+AK+F A+N  + K+ +AIDFTSV+D +P  GF V +++ G+E+
Subjt:  LLLDDIAKSFAASNKKS-KNVEAIDFTSVIDNMPPQGFLVNNVIFGSEA

AT5G59980.2 Polymerase/histidinol phosphatase-like3.0e-11257.31Show/hide
Query:  MGFFDLNIPYNEHSSS-----SSIKANRIKIVAKLMEFGYSGIAYNRTIKGVMSDLDRCTIPLLNVSSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTR
        MGFFDL+IPYNE   S     +  K  R+K+  K ME GY GIA+NR+IKGVMSD D CTIPLL + SL  + P   +SV FHRDLLGVP+++PFRQYTR
Subjt:  MGFFDLNIPYNEHSSS-----SSIKANRIKIVAKLMEFGYSGIAYNRTIKGVMSDLDRCTIPLLNVSSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTR

Query:  LTICINNLQEVLAVNSGNLILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFL
        LT+ + +  +  ++NSGN ILK+YD+IAV+P+NQ AF++ACEK E+D+I+IDF +K+ FRLK   +K+AIQRG+YFEI YSD+L+D   RRQ+IS  K L
Subjt:  LTICINNLQEVLAVNSGNLILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFL

Query:  VDWTKGKNLIFSSAASSVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRKQFYKETIRVERISSDDKLDLNNPWSVDLLNWDPISTGEGD
        VDWT+GKNLI SS A SV E+RGP DV NL  LLG+S ERA+AA+SKNCRN+IA  L++K+F+KE +RVE +S+ D   L  P S D + WD +S+GEGD
Subjt:  VDWTKGKNLIFSSAASSVNEIRGPYDVANLSSLLGVSMERAKAAVSKNCRNLIANALRRKQFYKETIRVERISSDDKLDLNNPWSVDLLNWDPISTGEGD

Query:  LLLDDIAKSFAASNKKS-KNVEAIDFTSVIDNMPPQGFLVNNVIFGSEA
        +LLDD+AK+F A+N  + K+ +AIDFTSV+D +P  GF V +++ G+E+
Subjt:  LLLDDIAKSFAASNKKS-KNVEAIDFTSVIDNMPPQGFLVNNVIFGSEA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTCTTCGACCTCAATATTCCTTACAACGAGCATTCATCTTCATCTTCAATTAAGGCTAATCGCATCAAGATTGTGGCCAAGCTCATGGAGTTCGGCTACTCTGG
AATCGCCTACAATCGTACGATCAAAGGAGTGATGTCCGATTTGGACCGATGTACCATTCCCCTCCTCAATGTCTCGTCGCTTCACAGTATCCTTCCATCCTTTTTCGCCT
CTGTTGAGTTCCACCGCGATCTCCTTGGTGTCCCAAAGTCCTCTCCTTTCCGTCAGTACACGCGTCTTACTATTTGTATCAACAATTTACAGGAGGTTCTGGCAGTCAAT
TCTGGCAACCTTATTCTGAAGACCTACGATTTGATTGCTGTGAAGCCTCTTAATCAATATGCTTTTGAGCATGCTTGCGAGAAATTGGAGATAGATATAATTGCCATTGA
TTTTGCAGAGAAACTGCCTTTCAGGTTGAAGCAAGGTCCGATTAAATCTGCAATTCAGCGTGGAGTTTACTTTGAAATTATGTACTCTGATCTTCTTTTGGATGTTCATG
AAAGGAGGCAAATGATATCTACTACTAAGTTCTTGGTGGATTGGACAAAAGGAAAGAATCTCATATTTTCGAGTGCTGCCTCTTCTGTAAATGAAATCAGAGGACCTTAT
GATGTTGCAAACTTGTCATCATTGCTTGGTGTCTCTATGGAACGAGCAAAAGCTGCTGTTTCAAAAAATTGTAGGAATCTTATAGCTAATGCACTAAGGAGAAAGCAGTT
CTACAAGGAGACCATTCGAGTTGAAAGGATATCATCGGATGATAAATTAGATTTGAACAACCCTTGGTCAGTGGATTTGCTTAACTGGGATCCTATATCAACTGGCGAAG
GTGATTTGCTATTGGATGATATAGCAAAATCTTTTGCTGCCTCTAATAAAAAATCTAAAAATGTGGAAGCAATTGATTTCACTTCAGTCATTGACAACATGCCACCACAA
GGTTTTCTAGTCAATAATGTAATATTTGGCTCTGAGGCAAAACTGTCATTGAATGATAACAGAGACTTGTCGCACATTGCAAATGCCCTTGAGCCACCAATTGCAGTGAA
TGGAGTAATTCAACAACCCCATCCTCTTGGTGGTGACCACTATTCTACATTAGATCGTCAATGCTCTGTAATTGAAAGTTTTAAAGTTTTTCATTCACATGGTGATGCAA
GAAAAGTTTCAAATAATTTTGAGGAAAACAAAAGTACTGTTGAAGAAATTGTGGAGCCCAAAGCCTCATTTCCGGAAGAACCTATTGAAATGGACGCTGATAATGTGCAA
CCACAAAACCAGCCATCCAGTAGCGAGTTAAATGTAGTATCAACAAATGAGCTTCTGCTTTCTCCTACATTTACCAGAGATATATTAGCTGTTGTTTTTGGGAATGATGA
AAAAAATACCGTTCAAATGTCAGAGGATGTTGATTCTCATCGGAATGAGTATGACTTAAAAAGTTCTGATACATCGTTTGGTTTGGAAAATGTGTTGAGGAACAAAAGCT
CAATTACTTTTGTTTCAGAAAATCAGAAGAATATGGCTGTTGTAATGGATGGTTCATTTACTTCTGAAGAATGTTTACTTGGTGGAAGCCTTGGGGAGATCGAGGATGTT
GTAGTAGCCGCTGATCAGGTTTCTACTCTTAGTTCTTGTATGAGTGATAGAAAAGATGATTATTTGATCTCAATTCGACAACAAACATCTGAGGTGTTAATGGAAGAGCA
AAAAAGTGGAGAAGCTGACCCTCAGATAACACCACCACCTTTAGATCAGTCTATATGTGGTGTAATGTCAACAGTAAACGCGAGGCGGAGGCGACATCATCTAGCATTAA
AGCTTCCCCTCAAGAGGTTAATCAGTCCTTTGGCCTTCAAGAAAGTCCGGGAAACAAATGCAGAGAAATACCGACTAAGCTGGAGGAGACATCACCCAACGCTATTGCTT
CCATTCAAGCGCATAATGAATTCTCTAGCATTGAAGAAGGTTCGCAAGACTAAATGCTAG
mRNA sequenceShow/hide mRNA sequence
CGCTAGTCCAGTCGTCCCTCGCAACGCAACAAAGTCGCAGACGCAAATCCTCGCAATCGAAGCAGCCAGCCAGATCCTGCCACCTGACTCGCAGCCGCCGCCGACGGCCC
GACAACTCCTCAAACGGCTCAGTCCTCAGTCCTCGGGCTCAGATCATATTTCCAGTCTCTTTCCTGCCTAAACCCTGTGCTTCTTCAGTTCAAAGCTCTCTCTCCATCGC
CATGGGTTTCTTCGACCTCAATATTCCTTACAACGAGCATTCATCTTCATCTTCAATTAAGGCTAATCGCATCAAGATTGTGGCCAAGCTCATGGAGTTCGGCTACTCTG
GAATCGCCTACAATCGTACGATCAAAGGAGTGATGTCCGATTTGGACCGATGTACCATTCCCCTCCTCAATGTCTCGTCGCTTCACAGTATCCTTCCATCCTTTTTCGCC
TCTGTTGAGTTCCACCGCGATCTCCTTGGTGTCCCAAAGTCCTCTCCTTTCCGTCAGTACACGCGTCTTACTATTTGTATCAACAATTTACAGGAGGTTCTGGCAGTCAA
TTCTGGCAACCTTATTCTGAAGACCTACGATTTGATTGCTGTGAAGCCTCTTAATCAATATGCTTTTGAGCATGCTTGCGAGAAATTGGAGATAGATATAATTGCCATTG
ATTTTGCAGAGAAACTGCCTTTCAGGTTGAAGCAAGGTCCGATTAAATCTGCAATTCAGCGTGGAGTTTACTTTGAAATTATGTACTCTGATCTTCTTTTGGATGTTCAT
GAAAGGAGGCAAATGATATCTACTACTAAGTTCTTGGTGGATTGGACAAAAGGAAAGAATCTCATATTTTCGAGTGCTGCCTCTTCTGTAAATGAAATCAGAGGACCTTA
TGATGTTGCAAACTTGTCATCATTGCTTGGTGTCTCTATGGAACGAGCAAAAGCTGCTGTTTCAAAAAATTGTAGGAATCTTATAGCTAATGCACTAAGGAGAAAGCAGT
TCTACAAGGAGACCATTCGAGTTGAAAGGATATCATCGGATGATAAATTAGATTTGAACAACCCTTGGTCAGTGGATTTGCTTAACTGGGATCCTATATCAACTGGCGAA
GGTGATTTGCTATTGGATGATATAGCAAAATCTTTTGCTGCCTCTAATAAAAAATCTAAAAATGTGGAAGCAATTGATTTCACTTCAGTCATTGACAACATGCCACCACA
AGGTTTTCTAGTCAATAATGTAATATTTGGCTCTGAGGCAAAACTGTCATTGAATGATAACAGAGACTTGTCGCACATTGCAAATGCCCTTGAGCCACCAATTGCAGTGA
ATGGAGTAATTCAACAACCCCATCCTCTTGGTGGTGACCACTATTCTACATTAGATCGTCAATGCTCTGTAATTGAAAGTTTTAAAGTTTTTCATTCACATGGTGATGCA
AGAAAAGTTTCAAATAATTTTGAGGAAAACAAAAGTACTGTTGAAGAAATTGTGGAGCCCAAAGCCTCATTTCCGGAAGAACCTATTGAAATGGACGCTGATAATGTGCA
ACCACAAAACCAGCCATCCAGTAGCGAGTTAAATGTAGTATCAACAAATGAGCTTCTGCTTTCTCCTACATTTACCAGAGATATATTAGCTGTTGTTTTTGGGAATGATG
AAAAAAATACCGTTCAAATGTCAGAGGATGTTGATTCTCATCGGAATGAGTATGACTTAAAAAGTTCTGATACATCGTTTGGTTTGGAAAATGTGTTGAGGAACAAAAGC
TCAATTACTTTTGTTTCAGAAAATCAGAAGAATATGGCTGTTGTAATGGATGGTTCATTTACTTCTGAAGAATGTTTACTTGGTGGAAGCCTTGGGGAGATCGAGGATGT
TGTAGTAGCCGCTGATCAGGTTTCTACTCTTAGTTCTTGTATGAGTGATAGAAAAGATGATTATTTGATCTCAATTCGACAACAAACATCTGAGGTGTTAATGGAAGAGC
AAAAAAGTGGAGAAGCTGACCCTCAGATAACACCACCACCTTTAGATCAGTCTATATGTGGTGTAATGTCAACAGTAAACGCGAGGCGGAGGCGACATCATCTAGCATTA
AAGCTTCCCCTCAAGAGGTTAATCAGTCCTTTGGCCTTCAAGAAAGTCCGGGAAACAAATGCAGAGAAATACCGACTAAGCTGGAGGAGACATCACCCAACGCTATTGCT
TCCATTCAAGCGCATAATGAATTCTCTAGCATTGAAGAAGGTTCGCAAGACTAAATGCTAGCAGTAATCTTTTCTTCAGTTTTACTTGGTTAATATATCCAATTTTGTTT
ATTTTACTTGGATACACATTTTGCATCAATTAATCTCAATTAATTATTTTTGGCTATCGGGTATCAAAAGGTTGCTATAGTTTTCTTGACCTCTTTTTAAGTTAAGGTGG
TTGTTTTGGGAAGTCTGTGTTGCATCGGTTATGCCCCCCTTTTTTTCCAGGGCCAAAGTCCATTCGCGAGTGGCTTACACAGTAACTTACTTTTACATACTAATGTCTCA
TGAGCTTGTTGCAACTTTTCCTTTCTAAATAA
Protein sequenceShow/hide protein sequence
MGFFDLNIPYNEHSSSSSIKANRIKIVAKLMEFGYSGIAYNRTIKGVMSDLDRCTIPLLNVSSLHSILPSFFASVEFHRDLLGVPKSSPFRQYTRLTICINNLQEVLAVN
SGNLILKTYDLIAVKPLNQYAFEHACEKLEIDIIAIDFAEKLPFRLKQGPIKSAIQRGVYFEIMYSDLLLDVHERRQMISTTKFLVDWTKGKNLIFSSAASSVNEIRGPY
DVANLSSLLGVSMERAKAAVSKNCRNLIANALRRKQFYKETIRVERISSDDKLDLNNPWSVDLLNWDPISTGEGDLLLDDIAKSFAASNKKSKNVEAIDFTSVIDNMPPQ
GFLVNNVIFGSEAKLSLNDNRDLSHIANALEPPIAVNGVIQQPHPLGGDHYSTLDRQCSVIESFKVFHSHGDARKVSNNFEENKSTVEEIVEPKASFPEEPIEMDADNVQ
PQNQPSSSELNVVSTNELLLSPTFTRDILAVVFGNDEKNTVQMSEDVDSHRNEYDLKSSDTSFGLENVLRNKSSITFVSENQKNMAVVMDGSFTSEECLLGGSLGEIEDV
VVAADQVSTLSSCMSDRKDDYLISIRQQTSEVLMEEQKSGEADPQITPPPLDQSICGVMSTVNARRRRHHLALKLPLKRLISPLAFKKVRETNAEKYRLSWRRHHPTLLL
PFKRIMNSLALKKVRKTKC