; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10014394 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10014394
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function (DUF300)
Genome locationChr02:10575847..10588530
RNA-Seq ExpressionHG10014394
SyntenyHG10014394
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005178 - Organic solute transporter subunit alpha/Transmembrane protein 184
IPR009651 - Putative methionine gamma-lyase
IPR015421 - Pyridoxal phosphate-dependent transferase, major domain
IPR015424 - Pyridoxal phosphate-dependent transferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4107049.1 unnamed protein product [Lactuca saligna]2.1e-27667.37Show/hide
Query:  PSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAGGREALDNA
        P+ NFR    ++       +S  + +   H  S   F PEV  AVD+L  EFRAVDNLVA+NS++VLKAFQNAR+GSHHF G TGYGH+EAGGREALD A
Subjt:  PSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAGGREALDNA

Query:  FAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASALKPQTKCALIQ
        FAEI GAESAIVRSQFFSGTHAITCALFA LRPGDELLAVAGAPYDTLEEVIG RD  GLGSLKDFG+ YREV LADDGGLDW+ L  ALKP+TKCALIQ
Subjt:  FAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASALKPQTKCALIQ

Query:  RSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLSAPGLGVDSGST
        RSCGYSWR+SLSV+EI +AI +IK QNP+CLVMVDNCYGEF ET EPP VGADLIAGSLIKNPGGT+APCGGYVAG++KWVKAAAARLSAPGLGVD GST
Subjt:  RSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLSAPGLGVDSGST

Query:  PGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRALRHDIVQAVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPGYASEVIFADGT
        PGDIMR FFQGL+LSPQMVGE++KG +LIAEVM++KGYKVQPLPR  RHDIVQAVQLGSRE LLAFCEAVQRSSPV+S+TKP+ G+T GYASEVIFADGT
Subjt:  PGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRALRHDIVQAVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPGYASEVIFADGT

Query:  FIDGSTSELSCDGPLREPFAVFCQ----------VCAIFISHFLVTAV-----VNISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQK
        FIDGSTSELSCDGPLREPF VFCQ          V   F    L+T +     + +  V M   Q   +G  + V +T + +L L+  HLS+WKKP EQK
Subjt:  FIDGSTSELSCDGPLREPFAVFCQ----------VCAIFISHFLVTAV-----VNISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQK

Query:  AIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQWLCEYPSMVNVNTD--
        AI++IILMAP+YA  SY+GLL+   S TFF+ L+SIKECYEALV++KFL+LLY+YLNISISKNIVPDEIKGREIHH+FPMTLFQ     P  V +N    
Subjt:  AIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQWLCEYPSMVNVNTD--

Query:  ------------IR-TCTLSEYFVGQLRYFLEY------------VSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIK
                    IR  C++    +  L  + ++            VSLALY+LV+FYHVF KEL PH PLAKFLC+KGIVFFCFWQGIVL  L A+GIIK
Subjt:  ------------IR-TCTLSEYFVGQLRYFLEY------------VSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIK

Query:  AEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPKLEKKE
        + H W DV HI +ALQN LV VEMVFFAM QM AY+A+PYK   AA  K +KKE
Subjt:  AEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPKLEKKE

KAA8523841.1 hypothetical protein F0562_010264 [Nyssa sinensis]2.1e-28165.47Show/hide
Query:  LSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAG
        LSC+   YP+   RAS   A   +R+   + VP  R H+  D+PFAPEV KAVDSL  EFRAVDNLVARN+++VL+A+QNAR+G HHFGG TGYGH+EAG
Subjt:  LSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAG

Query:  GREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASALKP
        GREALD  FAEI GAESAIVRSQFFSGTHAITCALFA LRPGDELLAVAGAPYDTLEEVIG RDS GLGSLKDFGV+YREVPLA+DGGLDW+ L  ALKP
Subjt:  GREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASALKP

Query:  QTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLSAPG
        QTKCALIQRSCGYSWRRSLSV EIG+AI+++K QNPDCLVMVDNCYGEFVE  EPP VGADLIAGSLIKNPGGT+APCGGYVAG++KWVKAAAARLSAPG
Subjt:  QTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLSAPG

Query:  LGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRALRHDIVQAVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPGYAS
        LG+D GSTPGDIMRTFFQGLFLSPQMVGEA+KG  LIAEVMA+KGYKVQPLPR  RHD VQAVQLG+RE LLAFCEAVQRSSPV SFTKPV G TPGYAS
Subjt:  LGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRALRHDIVQAVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPGYAS

Query:  EVIFADGTFIDGSTSELSCDGPLREPFAVFCQ-------------------------------------------VCAIFISHFL---------------
        EVIFADGTFIDGSTSELSCDGPLREPF+VFCQ                                             ++ IS FL               
Subjt:  EVIFADGTFIDGSTSELSCDGPLREPFAVFCQ-------------------------------------------VCAIFISHFL---------------

Query:  VTAVVNISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFL
        +T    IS   M+ GQ+  +G T  V+LT  F++ LL+QH   WKKP EQKAI+IIILMAP+YA  S++GLL+F  S  FF FL+S+KECYEALV++KFL
Subjt:  VTAVVNISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFL

Query:  SLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQWLCEYPSMVNVN--------------TDIR-TCTLSEYFVGQLRYFLEY------------VSLA
        +L+Y+YLNISISKNIVPDEIKGREIHH+FPMTLFQ     P  V +N                IR  C++    +  L  +  +            VSLA
Subjt:  SLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQWLCEYPSMVNVN--------------TDIR-TCTLSEYFVGQLRYFLEY------------VSLA

Query:  LYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPK
        LYSLVVFYHVF KEL+PH PLAKFLCIKGIVFFCFWQG+VLE+LAA+G+I++ H W DVE I EALQN LVCVEMVFF+  Q  AYSA+PY   +    K
Subjt:  LYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPK

Query:  LEKKEHID
         +K E  D
Subjt:  LEKKEHID

KAG6591988.1 hypothetical protein SDJN03_14334, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0083.96Show/hide
Query:  MWGLSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHD
        MWGLSCS+  YPSPNFR S P  AATLR+ TSLPV LDRKHYTSD+PFAPEV+KAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHD
Subjt:  MWGLSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHD

Query:  EAGGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASA
        EAGGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLA+DGGLDWEKLAS+
Subjt:  EAGGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASA

Query:  LKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLS
        LKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVET EPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLS
Subjt:  LKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLS

Query:  APGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRALRHDIVQAVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPG
        APGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPR  RHD VQAVQLGSRE+LLAFCEAVQRSSPVAS+TKPVPGITPG
Subjt:  APGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRALRHDIVQAVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPG

Query:  YASEVIFADGTFIDGSTSELSCDGPLREPFAVFCQVCAIFISHFLVT---AVVNISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKA
        YASEVIFADGTFIDGSTSELSCDGPLREPFAVFCQV    + +   T    V N S V++ Y           ++   +FSLWLL+QHLSNW+KPAEQKA
Subjt:  YASEVIFADGTFIDGSTSELSCDGPLREPFAVFCQVCAIFISHFLVT---AVVNISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKA

Query:  IVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQ----------------W
        IV+IILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHH+FPMTLFQ                W
Subjt:  IVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQ----------------W

Query:  LCEYPSMVNVNTDIR-TCTLSEYFVGQLRYFLEY-----VSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWF
          ++  +  V + +  +  L + +   L +         VSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWF
Subjt:  LCEYPSMVNVNTDIR-TCTLSEYFVGQLRYFLEY-----VSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWF

Query:  DVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPKLEKKE
        DVEHINEA+QNTLVCVEMVFFAM+QMSAYSASPY+ +SAAK K EKKE
Subjt:  DVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPKLEKKE

KAG7024863.1 ynbB [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0079.79Show/hide
Query:  MWGLSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHD
        MWGLSCS+  YPSPNFR S P  AATLR+ TSLPV LDRKHYTSD+PFAPEV+KAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHD
Subjt:  MWGLSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHD

Query:  EAGGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASA
        EAGGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLA+DGGLDWEKLAS+
Subjt:  EAGGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASA

Query:  LKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLS
        LKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVET EPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLS
Subjt:  LKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLS

Query:  APGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRALRHDIVQAVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPG
        APGLGVDSGSTPGDIMRTFFQGLFLSPQM                                   AVQLGSRE+LLAFCEAVQRSSPVAS+TKPVPGITPG
Subjt:  APGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRALRHDIVQAVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPG

Query:  YASEVIFADGTFIDGSTSELSCDGPLREPFAVFCQVCAIFISHFLVTAVVN----------------------ISAVTMDYGQMIFLGVTSSVVLTAIFS
        YASEVIFADGTFIDGSTSELSCDGPLREPFAVFCQ    +    LV   V+                      ISA+TMDYG MIFL VTSSVVLT++FS
Subjt:  YASEVIFADGTFIDGSTSELSCDGPLREPFAVFCQVCAIFISHFLVTAVVN----------------------ISAVTMDYGQMIFLGVTSSVVLTAIFS

Query:  LWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTL
        LWLL+QHLSNW+KPAEQKAIV+IILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHH+FPMTL
Subjt:  LWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTL

Query:  FQ----------------WLCEYPSMVNVNTDIR-TCTLSEYFVGQLRYFLEY-----VSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQG
        FQ                W  ++  +  V + +  +  L + +   L +         VSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQG
Subjt:  FQ----------------WLCEYPSMVNVNTDIR-TCTLSEYFVGQLRYFLEY-----VSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQG

Query:  IVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPK
        IVLEMLAAVGIIKAEHAWFDVEHINEA+QNTLVCVEMVFFAM+QMSAYSASPY+ +SAAK K
Subjt:  IVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPK

RXI03688.1 hypothetical protein DVH24_004340 [Malus domestica]5.4e-27767.14Show/hide
Query:  MWGLSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHD
        MW LSC    YP+   RASVP   AT R+ + L VP +   +  DSPF PEV  AVDSL  EFRAVDNLVARN+ +VLKAFQNAR+GSHHF G TGYGHD
Subjt:  MWGLSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHD

Query:  EAGGREALDNAFAEIVGAESAIVRS----------------------QFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFG
        EAGGREALD AFAEIVGAESAIVRS                      QFFSGTHAITCALFA LRPGDELLAVAG PYDTLEEVIGKRDS G+GSL DFG
Subjt:  EAGGREALDNAFAEIVGAESAIVRS----------------------QFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFG

Query:  VEYREVPLADDGGLDWEKLASALKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTL
        V+YREVPLA+DGGL+W+ L  AL+P+TKCALIQRSCGYSWRRSLSVDEIG+AI++IK QNP+CLVMVDNCYGEFVE+ EPP VGADLIAGSLIKNPGGT+
Subjt:  VEYREVPLADDGGLDWEKLASALKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTL

Query:  APCGGYVAGRDKWVKAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRALRHDIVQAVQLGSREVLLAFC
        APCGGYVAGR+KWVKAA+ARLSAPGLGVD G+TPGDIMR+FFQGLFLSPQMVGEA+KG +++AEVMA++GYKVQPLPR  RHD VQAVQLGSRE LLAFC
Subjt:  APCGGYVAGRDKWVKAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRALRHDIVQAVQLGSREVLLAFC

Query:  EAVQRSSPVASFTKPVPGITPGYASE--------------VIFADGTFIDGSTSELSCDGPLREPFAVFCQVCAIFISHFLVTAVVNISAVTMDYGQMIF
        EAVQR+SPV SFTKPV G TPGYASE              VIFADGTFIDGSTSELSCDGPLREPFAVFCQ      SH+    +V       +  Q++ 
Subjt:  EAVQRSSPVASFTKPVPGITPGYASE--------------VIFADGTFIDGSTSELSCDGPLREPFAVFCQVCAIFISHFLVTAVVNISAVTMDYGQMIF

Query:  LGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDE
        LG T  +++T  FSL LL++H   W KP EQKAIVIIILMAPLYA  S++GLL++  S   F  L+SIKECYEALVI+KFL+LLYSYLNISISKNIVPDE
Subjt:  LGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDE

Query:  IKGREIHHTFPMTLFQWLCEYPSMVNVNTDIRTCTLSEYFVGQ--------------LRYFLEY---------------VSLALYSLVVFYHVFDKELKP
        IKGREIHH+FPMTLF      P  V +N    T  L +Y+  Q              L+    Y               VSLALYSL+ FYHVF KEL P
Subjt:  IKGREIHHTFPMTLFQWLCEYPSMVNVNTDIRTCTLSEYFVGQ--------------LRYFLEY---------------VSLALYSLVVFYHVFDKELKP

Query:  HSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYS
        H PL KFLCIKGIVFFCFWQGIVL++LAA+ II++ H W DVEHI EALQN LVCVEMVFF+++Q  AY+
Subjt:  HSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYS

TrEMBL top hitse value%identityAlignment
A0A3Q7HH85 Uncharacterized protein6.9e-27066.85Show/hide
Query:  LSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAG
        L C+   YP+   R  V +A A +R+ + + VP   + + SDSPF PEV KAVDSL  EFR VDNLVARN+A+VL+AFQ  ++GSHHFGGSTGYGH+EAG
Subjt:  LSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAG

Query:  GREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASALKP
        GREALD AFAEIVGAESAIVRSQFFSGTHAITCALFA LRPGDELLA+AGAPYDTLEEVIGKRDS G GSLKDFGVEYREVPLA+DGGLDW+ L ++++P
Subjt:  GREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASALKP

Query:  QTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLSAPG
         TKCALIQRSCGYSWRRSLSV EIG+AI +IKMQNP C+VMVDNCYGEFV+  EPP VGADLIAGSLIKNPGGT+APCGGYVAGR KWV+AAAARLSAPG
Subjt:  QTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLSAPG

Query:  LGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRALRHDIVQAVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPGYAS
        LGVD GSTPGDIMRT FQGLFLSPQMVGEA+KG  LIAEVMA+KGYKVQPL R  RHD VQAVQLG+RE LL+FCEAVQRSSPV+SF +PV G T GYAS
Subjt:  LGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRALRHDIVQAVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPGYAS

Query:  EVIFADGTFIDGSTSELSCDGPLREPFAVFCQVCAIFISHFLVTAVVNISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIIL
        EVIFADGTFIDGSTSELSCDGPLREPF+VFCQ    +    LV   ++            + G+ +   L+A   + L+T+H ++WKKP EQKAI+II+L
Subjt:  EVIFADGTFIDGSTSELSCDGPLREPFAVFCQVCAIFISHFLVTAVVNISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIIL

Query:  MAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQWLCEYPSMVNVNTDIRTCTLSEY
        MAPLYA +S+IGL++FM S  FF FLES+KECYEA+V++KFL L+Y+YLNISISKNIVPDEIKGR+IHH+FPMTLFQ     P   ++N    T  L + 
Subjt:  MAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQWLCEYPSMVNVNTDIRTCTLSEY

Query:  FVGQ--------------LRYFLEY---------------VSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAW
        +  Q              L+ F  Y               VSLALYSLVVFYHVF KEL PH PLAKFLC+KGIVFF FWQGI+L +L ++GIIK+ + W
Subjt:  FVGQ--------------LRYFLEY---------------VSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAW

Query:  FDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKS
         +VE + E +QN LV +EMVFFA++   AYSA+PY++++
Subjt:  FDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKS

A0A498K8E2 Uncharacterized protein2.6e-27767.14Show/hide
Query:  MWGLSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHD
        MW LSC    YP+   RASVP   AT R+ + L VP +   +  DSPF PEV  AVDSL  EFRAVDNLVARN+ +VLKAFQNAR+GSHHF G TGYGHD
Subjt:  MWGLSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHD

Query:  EAGGREALDNAFAEIVGAESAIVRS----------------------QFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFG
        EAGGREALD AFAEIVGAESAIVRS                      QFFSGTHAITCALFA LRPGDELLAVAG PYDTLEEVIGKRDS G+GSL DFG
Subjt:  EAGGREALDNAFAEIVGAESAIVRS----------------------QFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFG

Query:  VEYREVPLADDGGLDWEKLASALKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTL
        V+YREVPLA+DGGL+W+ L  AL+P+TKCALIQRSCGYSWRRSLSVDEIG+AI++IK QNP+CLVMVDNCYGEFVE+ EPP VGADLIAGSLIKNPGGT+
Subjt:  VEYREVPLADDGGLDWEKLASALKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTL

Query:  APCGGYVAGRDKWVKAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRALRHDIVQAVQLGSREVLLAFC
        APCGGYVAGR+KWVKAA+ARLSAPGLGVD G+TPGDIMR+FFQGLFLSPQMVGEA+KG +++AEVMA++GYKVQPLPR  RHD VQAVQLGSRE LLAFC
Subjt:  APCGGYVAGRDKWVKAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRALRHDIVQAVQLGSREVLLAFC

Query:  EAVQRSSPVASFTKPVPGITPGYASE--------------VIFADGTFIDGSTSELSCDGPLREPFAVFCQVCAIFISHFLVTAVVNISAVTMDYGQMIF
        EAVQR+SPV SFTKPV G TPGYASE              VIFADGTFIDGSTSELSCDGPLREPFAVFCQ      SH+    +V       +  Q++ 
Subjt:  EAVQRSSPVASFTKPVPGITPGYASE--------------VIFADGTFIDGSTSELSCDGPLREPFAVFCQVCAIFISHFLVTAVVNISAVTMDYGQMIF

Query:  LGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDE
        LG T  +++T  FSL LL++H   W KP EQKAIVIIILMAPLYA  S++GLL++  S   F  L+SIKECYEALVI+KFL+LLYSYLNISISKNIVPDE
Subjt:  LGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDE

Query:  IKGREIHHTFPMTLFQWLCEYPSMVNVNTDIRTCTLSEYFVGQ--------------LRYFLEY---------------VSLALYSLVVFYHVFDKELKP
        IKGREIHH+FPMTLF      P  V +N    T  L +Y+  Q              L+    Y               VSLALYSL+ FYHVF KEL P
Subjt:  IKGREIHHTFPMTLFQWLCEYPSMVNVNTDIRTCTLSEYFVGQ--------------LRYFLEY---------------VSLALYSLVVFYHVFDKELKP

Query:  HSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYS
        H PL KFLCIKGIVFFCFWQGIVL++LAA+ II++ H W DVEHI EALQN LVCVEMVFF+++Q  AY+
Subjt:  HSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYS

A0A5J5A352 Uncharacterized protein1.0e-28165.47Show/hide
Query:  LSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAG
        LSC+   YP+   RAS   A   +R+   + VP  R H+  D+PFAPEV KAVDSL  EFRAVDNLVARN+++VL+A+QNAR+G HHFGG TGYGH+EAG
Subjt:  LSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAG

Query:  GREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASALKP
        GREALD  FAEI GAESAIVRSQFFSGTHAITCALFA LRPGDELLAVAGAPYDTLEEVIG RDS GLGSLKDFGV+YREVPLA+DGGLDW+ L  ALKP
Subjt:  GREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASALKP

Query:  QTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLSAPG
        QTKCALIQRSCGYSWRRSLSV EIG+AI+++K QNPDCLVMVDNCYGEFVE  EPP VGADLIAGSLIKNPGGT+APCGGYVAG++KWVKAAAARLSAPG
Subjt:  QTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLSAPG

Query:  LGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRALRHDIVQAVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPGYAS
        LG+D GSTPGDIMRTFFQGLFLSPQMVGEA+KG  LIAEVMA+KGYKVQPLPR  RHD VQAVQLG+RE LLAFCEAVQRSSPV SFTKPV G TPGYAS
Subjt:  LGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRALRHDIVQAVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPGYAS

Query:  EVIFADGTFIDGSTSELSCDGPLREPFAVFCQ-------------------------------------------VCAIFISHFL---------------
        EVIFADGTFIDGSTSELSCDGPLREPF+VFCQ                                             ++ IS FL               
Subjt:  EVIFADGTFIDGSTSELSCDGPLREPFAVFCQ-------------------------------------------VCAIFISHFL---------------

Query:  VTAVVNISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFL
        +T    IS   M+ GQ+  +G T  V+LT  F++ LL+QH   WKKP EQKAI+IIILMAP+YA  S++GLL+F  S  FF FL+S+KECYEALV++KFL
Subjt:  VTAVVNISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFL

Query:  SLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQWLCEYPSMVNVN--------------TDIR-TCTLSEYFVGQLRYFLEY------------VSLA
        +L+Y+YLNISISKNIVPDEIKGREIHH+FPMTLFQ     P  V +N                IR  C++    +  L  +  +            VSLA
Subjt:  SLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQWLCEYPSMVNVN--------------TDIR-TCTLSEYFVGQLRYFLEY------------VSLA

Query:  LYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPK
        LYSLVVFYHVF KEL+PH PLAKFLCIKGIVFFCFWQG+VLE+LAA+G+I++ H W DVE I EALQN LVCVEMVFF+  Q  AYSA+PY   +    K
Subjt:  LYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPK

Query:  LEKKEHID
         +K E  D
Subjt:  LEKKEHID

A0A5N5GL16 Uncharacterized protein1.4e-25963.97Show/hide
Query:  MWGLSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHD
        MW LSC+   YP+ + RASVP   AT R+ + L VP +   +  DSPF PEV  AVDSL  EFRAVDNLVARN+ +VLKAFQNAR+GSHHF G TGYGHD
Subjt:  MWGLSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHD

Query:  EAGGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASA
        EAGGREALD AFAEIVGAESAIVRSQFFSGTHAITCALFA LRPGDELLAVAG PYDTLEEVIGKRDS G+GSL DFGV+YREVPLA+DGGL+W+ L  A
Subjt:  EAGGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASA

Query:  LKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLS
        L+P+TKCALIQRSCGYSWRRSLSVDEIG+AI++IK QN +CLVMVDNCYGEFVE+ EPP VGADLIAGSLIKNPGGT+APCGGYVAGR+KWVKAA+ARLS
Subjt:  LKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLS

Query:  APGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRALRHDIVQAVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPG
        APGLGVD G+TPGDIMR FFQGLFLSPQMVGEA+KG +L+AEVMA++GYKVQPLPR  RHD VQAVQLGSRE LLAFCEAVQR+SPV SFTKPV G TPG
Subjt:  APGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRALRHDIVQAVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPG

Query:  YASE--------------VIFADGTFIDGSTSELSCDGPLREPFAVFCQVCAIFISHFLVTAVVNISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQHL
        YASE              VIFADGTFIDGSTSELSCDGPLREPFAVFCQ                                                 H 
Subjt:  YASE--------------VIFADGTFIDGSTSELSCDGPLREPFAVFCQVCAIFISHFLVTAVVNISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQHL

Query:  SNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQWLCEYP
        + W                         GL+    S   F  L+SIKECYEALVI+KFL+LLYSYLNISISKNIVPDEIKGREIHH+FPMTLF      P
Subjt:  SNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQWLCEYP

Query:  SMVNVNTDIRTCTLSEYFVGQ--------------LRYFLEY---------------VSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGI
          V +N    T  L +Y+  Q              L+    Y               VSLALYSLV FYHVF KEL PH PL KFLCIKGIVFFCFWQGI
Subjt:  SMVNVNTDIRTCTLSEYFVGQ--------------LRYFLEY---------------VSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGI

Query:  VLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPKLEKKE
        VL++LAA+ II++ H W DVEHI EALQN LVCVEMVFF+++Q  AYSA PY+    +    ++K+
Subjt:  VLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPKLEKKE

A0A6S7PJH8 Uncharacterized protein1.0e-27667.37Show/hide
Query:  PSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAGGREALDNA
        P+ NFR    ++       +S  + +   H  S   F PEV  AVD+L  EFRAVDNLVA+NS++VLKAFQNAR+GSHHF G TGYGH+EAGGREALD A
Subjt:  PSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAGGREALDNA

Query:  FAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASALKPQTKCALIQ
        FAEI GAESAIVRSQFFSGTHAITCALFA LRPGDELLAVAGAPYDTLEEVIG RD  GLGSLKDFG+ YREV LADDGGLDW+ L  ALKP+TKCALIQ
Subjt:  FAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASALKPQTKCALIQ

Query:  RSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLSAPGLGVDSGST
        RSCGYSWR+SLSV+EI +AI +IK QNP+CLVMVDNCYGEF ET EPP VGADLIAGSLIKNPGGT+APCGGYVAG++KWVKAAAARLSAPGLGVD GST
Subjt:  RSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLSAPGLGVDSGST

Query:  PGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRALRHDIVQAVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPGYASEVIFADGT
        PGDIMR FFQGL+LSPQMVGE++KG +LIAEVM++KGYKVQPLPR  RHDIVQAVQLGSRE LLAFCEAVQRSSPV+S+TKP+ G+T GYASEVIFADGT
Subjt:  PGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRALRHDIVQAVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPGYASEVIFADGT

Query:  FIDGSTSELSCDGPLREPFAVFCQ----------VCAIFISHFLVTAV-----VNISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQK
        FIDGSTSELSCDGPLREPF VFCQ          V   F    L+T +     + +  V M   Q   +G  + V +T + +L L+  HLS+WKKP EQK
Subjt:  FIDGSTSELSCDGPLREPFAVFCQ----------VCAIFISHFLVTAV-----VNISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQK

Query:  AIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQWLCEYPSMVNVNTD--
        AI++IILMAP+YA  SY+GLL+   S TFF+ L+SIKECYEALV++KFL+LLY+YLNISISKNIVPDEIKGREIHH+FPMTLFQ     P  V +N    
Subjt:  AIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQWLCEYPSMVNVNTD--

Query:  ------------IR-TCTLSEYFVGQLRYFLEY------------VSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIK
                    IR  C++    +  L  + ++            VSLALY+LV+FYHVF KEL PH PLAKFLC+KGIVFFCFWQGIVL  L A+GIIK
Subjt:  ------------IR-TCTLSEYFVGQLRYFLEY------------VSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIK

Query:  AEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPKLEKKE
        + H W DV HI +ALQN LV VEMVFFAM QM AY+A+PYK   AA  K +KKE
Subjt:  AEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPKLEKKE

SwissProt top hitse value%identityAlignment
P45624 Uncharacterized 33.9 kDa protein in glnA 5'region7.6e-6443.33Show/hide
Query:  PYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASALK-PQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFV
        PYDT+++VIG    +  G+L   G+ +  VPL ++GG+D+E+    LK  Q    +IQRS GY  R+S +VD+I K    +K  +P+ LV VDNCYGEF 
Subjt:  PYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASALK-PQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFV

Query:  ETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQP
        E  EP   G D  AGSLIKN GG +A  GGY+ G+++ V+ AA RL+APG+G + G+T  + M  F++G FL+P   GEA+KGMI  A ++   G +V P
Subjt:  ETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQP

Query:  LPRALRHDIVQAVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPGYASEVIFADGTFIDGSTSELSCDGPLREPFAVFCQVCAIFISH--FLVTAVVN
             R D++Q +     E ++ F + VQ++SP+ SF +P+P   PGY  +VI A G F+ GST E S DGP+R P+A++ Q C +  +H    VT  VN
Subjt:  LPRALRHDIVQAVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPGYASEVIFADGTFIDGSTSELSCDGPLREPFAVFCQVCAIFISH--FLVTAVVN

P94479 Uncharacterized protein YnbB2.9e-9545.16Show/hide
Query:  RAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAGGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVI
        + ++ +  RN  +VL++++  ++   HF  STGYG+D+  GR+ L++ +A++ G E+ +VR Q  SGTHAI+ ALF +LRPGDELL + G PYDTLEE++
Subjt:  RAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAGGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVI

Query:  GKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASALKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGA
        G R  +  GSLKDF + Y  V L  DG +D++ +A+A+ P+TK   IQRS GY+ R S  + EI + IR +K  N + +V VDNCYGEFVE  EP  VGA
Subjt:  GKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASALKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGA

Query:  DLIAGSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRALRHDIV
        DL+AGSLIKNPGG LA  GGY+ G+ KW++A + R+++PG+G ++G++    ++  +QG FL+P +V +++KG +  A  +   G+   P   A R D++
Subjt:  DLIAGSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRALRHDIV

Query:  QAVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPGYASEVIFADGTFIDGSTSELSCDGPLREPFAVFCQ
        Q+V+   RE ++AFC+A+Q +SP+ +   P P   PGY  +VI A GTFI G++ ELS DGP+R P+  + Q
Subjt:  QAVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPGYASEVIFADGTFIDGSTSELSCDGPLREPFAVFCQ

Q17QL9 Transmembrane protein 184C9.0e-2529.3Show/hide
Query:  VVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREI
        ++LT   SLW++ QHL ++ +P  QK I+ I+ M P+Y+  S+I L       +  +++++ +ECYEA VI  F+  L +YL       ++  E K ++ 
Subjt:  VVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREI

Query:  HHTFPMTLFQWLCEYPSMVNVNTDIRTCTLSEYFVGQLRYFLEYVSL--------------------------------ALYSLVVFYHVFDKELKPHSP
        H       F  LC  P        +  C L       +R F   ++L                                A+Y L++FY V  +EL P  P
Subjt:  HHTFPMTLFQWLCEYPSMVNVNTDIRTCTLSEYFVGQLRYFLEYVSL--------------------------------ALYSLVVFYHVFDKELKPHSP

Query:  LAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHA--WFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPY
        + KFLC+K +VF  FWQ +V+ +L  VG+I  +H   W  VE +   LQ+ ++C+EM   A+     +S  PY
Subjt:  LAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHA--WFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPY

Q5RET6 Transmembrane protein 184C2.0e-2429.67Show/hide
Query:  VVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREI
        ++LT   SLW++ QHL ++ +P  QK I+ I+ M P+Y+  S+I L          +++++ +ECYEA VI  F+  L +YL       ++  E K ++ 
Subjt:  VVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREI

Query:  HHTFPMTLFQWLCEYPSMVNVNTDIRTCTLSEYFVGQLRYFLEYVSL--------------------------------ALYSLVVFYHVFDKELKPHSP
        H       F  LC  P        +  C L       +R F   V+L                                A+Y L++FY V  +EL P  P
Subjt:  HHTFPMTLFQWLCEYPSMVNVNTDIRTCTLSEYFVGQLRYFLEYVSL--------------------------------ALYSLVVFYHVFDKELKPHSP

Query:  LAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHA--WFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPY
        + KFLC+K +VF  FWQ +V+ +L  VG+I  +H   W  VE +   LQ+ ++C+EM   A+     +S  PY
Subjt:  LAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHA--WFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPY

Q9NVA4 Transmembrane protein 184C2.0e-2429.67Show/hide
Query:  VVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREI
        ++LT   SLW++ QHL ++ +P  QK I+ I+ M P+Y+  S+I L          +++++ +ECYEA VI  F+  L +YL       ++  E K ++ 
Subjt:  VVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREI

Query:  HHTFPMTLFQWLCEYPSMVNVNTDIRTCTLSEYFVGQLRYFLEYVSL--------------------------------ALYSLVVFYHVFDKELKPHSP
        H       F  LC  P        +  C L       +R F   V+L                                A+Y L++FY V  +EL P  P
Subjt:  HHTFPMTLFQWLCEYPSMVNVNTDIRTCTLSEYFVGQLRYFLEYVSL--------------------------------ALYSLVVFYHVFDKELKPHSP

Query:  LAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHA--WFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPY
        + KFLC+K +VF  FWQ +V+ +L  VG+I  +H   W  VE +   LQ+ ++C+EM   A+     +S  PY
Subjt:  LAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHA--WFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPY

Arabidopsis top hitse value%identityAlignment
AT1G11200.1 Protein of unknown function (DUF300)2.7e-7248.15Show/hide
Query:  ISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSY
        I   T+   ++  +G    V+L+  F++ L++QHL  WKKP EQ+AI+II+LMAP+YA  S++GLL+   S  FF+FL+++KECYEALVI+KFL+L+YSY
Subjt:  ISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSY

Query:  LNISISKNIVPDEIKGREIHHTFPMTLF----------------QWLCEYPSMVNVNTDIRTCTLSEYFVGQLRYFLEY---------VSLALYSLVVFY
        +NIS+S  I+PDE KGREIHH+FPMTLF                QW  ++  ++     I   TL    +G    +L +         VSLALYSLV FY
Subjt:  LNISISKNIVPDEIKGREIHHTFPMTLF----------------QWLCEYPSMVNVNTDIRTCTLSEYFVGQLRYFLEY---------VSLALYSLVVFY

Query:  HVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPKLEKKE
        HVF KEL+PH PL KF+C+KGIVFFCFWQGIVL++L  +G+IK+ H W +V+ + EALQN LVC+EM+ F++IQ  A+  +PY  ++ AK +  K++
Subjt:  HVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPKLEKKE

AT1G77220.1 Protein of unknown function (DUF300)1.6e-2127.52Show/hide
Query:  MDYGQMI---FLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLN
        +D GQ +    L  +  VV+  +  ++L+ +HL+++ +P EQK ++ +ILM P+YA  S++ L+   A+       E I++CYEA  +  F   L + L+
Subjt:  MDYGQMI---FLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLN

Query:  --------------ISISKNIVPDEIKGREIHHTFPMTLF--QWLCEYPSMVNVNTDIRTCTLSEYFVGQLRYFLE-----------------YVSL---
                      I+ S  ++        + H FPM  F   W         V   I    + +     L   LE                 Y+++   
Subjt:  --------------ISISKNIVPDEIKGREIHHTFPMTLF--QWLCEYPSMVNVNTDIRTCTLSEYFVGQLRYFLE-----------------YVSL---

Query:  -----ALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYK
             ALY LV FY+V   +L P  PLAKFL  K IVF  +WQGI++  L ++G++K   A    + +   +Q+ ++C+EM   A++ +  + A+PYK
Subjt:  -----ALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYK

AT4G21570.1 Protein of unknown function (DUF300)6.5e-7956.1Show/hide
Query:  QMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNI
        Q+ F     SV+LT  F++ L++QHL +WK P EQKAI+II+LMAP+YA +S+IGLLE   S TFFLFLESIKECYEALVI+KFL+L+YSYLNIS+SKNI
Subjt:  QMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNI

Query:  VPDEIKGREIHHTFPMTLFQ----------------WLCEYPSMVNVNTDIRTCTLSEYFVGQLRYFLEY---------VSLALYSLVVFYHVFDKELKP
        +PD IKGREIHH+FPMTLFQ                W  ++   V +     T  ++   +G    +L +         VSLALYSLV+FYHVF KEL P
Subjt:  VPDEIKGREIHHTFPMTLFQ----------------WLCEYPSMVNVNTDIRTCTLSEYFVGQLRYFLEY---------VSLALYSLVVFYHVFDKELKP

Query:  HSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPKLEKK
        H+PLAKFLCIKGIVFF FWQGI L++L A+G IK+ H W +VE I EA+QN LVC+EMV FA +Q  AY A PY  ++  K KL+KK
Subjt:  HSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPKLEKK

AT5G26740.1 Protein of unknown function (DUF300)3.4e-1929.54Show/hide
Query:  TSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKG
        T   +  AIF ++   +HL N+ +P  Q+ IV II M P+YA +S++ L+   +S    ++ +SI+E YEA VI  FLSL  +++        V   + G
Subjt:  TSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKG

Query:  REIHHTFPMTLFQWLCEYPSMVNVNTDIRTC-----------------TLSEYFVGQLR----------------YFLEYVSLALYSLVVFYHVFDKELK
        R +  ++ +      C +P +      IR C                 TL  Y  G+ +                Y + Y ++ALY+LV+FY      L+
Subjt:  REIHHTFPMTLFQWLCEYPSMVNVNTDIRTC-----------------TLSEYFVGQLR----------------YFLEYVSLALYSLVVFYHVFDKELK

Query:  PHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAA
        P +P+ KF+ IK +VF  +WQG+++ + A  G IK+  A     H     QN ++CVEM+  A     A+   PYK  + A
Subjt:  PHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAA

AT5G26740.2 Protein of unknown function (DUF300)3.4e-1929.54Show/hide
Query:  TSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKG
        T   +  AIF ++   +HL N+ +P  Q+ IV II M P+YA +S++ L+   +S    ++ +SI+E YEA VI  FLSL  +++        V   + G
Subjt:  TSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKG

Query:  REIHHTFPMTLFQWLCEYPSMVNVNTDIRTC-----------------TLSEYFVGQLR----------------YFLEYVSLALYSLVVFYHVFDKELK
        R +  ++ +      C +P +      IR C                 TL  Y  G+ +                Y + Y ++ALY+LV+FY      L+
Subjt:  REIHHTFPMTLFQWLCEYPSMVNVNTDIRTC-----------------TLSEYFVGQLR----------------YFLEYVSLALYSLVVFYHVFDKELK

Query:  PHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAA
        P +P+ KF+ IK +VF  +WQG+++ + A  G IK+  A     H     QN ++CVEM+  A     A+   PYK  + A
Subjt:  PHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGGGCTTATCCTGCTCTTTGCTTCCTTATCCGTCGCCTAATTTCAGAGCGTCCGTTCCGGCAGCGGCAGCAACCCTTCGCACCGACACTTCTTTACCGGTGCCCCT
CGATCGGAAACATTACACTTCCGACTCCCCATTTGCTCCGGAGGTTATTAAGGCGGTAGACTCCTTGCAGTATGAATTCAGGGCAGTGGATAATTTGGTGGCACGTAATT
CTGCTAAAGTTCTCAAAGCCTTTCAGAATGCTCGGTTAGGATCTCATCATTTTGGAGGATCCACTGGTTATGGTCATGATGAAGCTGGAGGACGTGAGGCACTTGACAAC
GCTTTTGCTGAGATAGTTGGAGCAGAATCTGCAATAGTCCGATCACAGTTTTTCTCAGGTACTCATGCTATTACGTGTGCTTTATTTGCACTTTTGAGGCCAGGGGATGA
GCTTTTGGCAGTAGCTGGTGCTCCATATGACACACTAGAGGAGGTCATTGGGAAAAGAGATTCTCAGGGGCTGGGTTCCTTGAAAGATTTTGGAGTAGAGTATCGAGAAG
TTCCACTTGCTGATGACGGTGGACTCGACTGGGAAAAACTTGCAAGTGCTTTGAAACCTCAGACAAAATGTGCGCTCATACAAAGGTCATGTGGTTATTCTTGGCGGCGA
AGTTTAAGTGTTGACGAAATAGGAAAAGCAATAAGACTGATCAAGATGCAAAACCCTGATTGCTTGGTGATGGTGGATAACTGTTATGGAGAATTTGTGGAAACCACTGA
ACCTCCAACTGTGGGCGCGGACTTAATTGCAGGAAGTTTGATAAAAAATCCTGGTGGAACGCTTGCACCTTGTGGCGGATATGTTGCAGGTCGAGACAAATGGGTGAAAG
CGGCTGCAGCTCGTTTGTCTGCACCCGGCTTGGGGGTGGATTCGGGCTCTACCCCTGGTGATATCATGAGGACATTTTTTCAAGGATTATTCCTTTCACCTCAAATGGTT
GGTGAGGCAGTTAAGGGAATGATCCTAATAGCTGAAGTCATGGCATCAAAAGGCTACAAAGTGCAGCCACTTCCACGTGCACTCCGCCACGACATCGTACAGGCTGTACA
ACTTGGAAGTCGTGAAGTTTTGCTTGCATTCTGCGAGGCTGTACAGAGAAGCTCTCCTGTCGCTTCGTTTACTAAACCGGTTCCGGGAATAACTCCTGGATATGCATCAG
AGGTGATCTTTGCTGATGGAACTTTTATTGATGGGAGCACAAGTGAACTTTCTTGTGATGGACCTCTAAGAGAGCCATTTGCAGTCTTTTGCCAGGTTTGTGCCATTTTT
ATATCTCATTTCTTGGTAACCGCAGTGGTTAACATATCAGCAGTCACAATGGATTATGGACAGATGATTTTTCTTGGAGTTACTTCCTCTGTTGTTCTCACTGCAATATT
TTCATTATGGCTCCTTACCCAACATCTGTCTAACTGGAAAAAACCAGCGGAACAAAAGGCCATTGTTATTATAATTCTTATGGCTCCTTTATATGCTGGTATCTCCTATA
TTGGTCTGTTGGAATTTATGGCAAGCAGTACTTTCTTTTTGTTTTTGGAATCAATTAAGGAATGTTATGAGGCTTTGGTGATATCTAAGTTCTTGAGTTTACTCTACAGC
TACTTAAATATATCCATAAGCAAAAACATTGTGCCAGATGAGATCAAAGGTAGAGAAATTCACCATACTTTTCCGATGACCCTCTTTCAGTGGCTCTGTGAATATCCATC
TATGGTCAACGTTAACACTGACATTCGTACTTGTACATTATCTGAGTATTTTGTTGGTCAACTTCGCTATTTTTTGGAATATGTGTCGCTTGCTCTGTATTCCCTGGTGG
TTTTCTATCATGTATTTGATAAGGAGTTGAAACCACATAGCCCTCTTGCGAAGTTCTTGTGCATCAAAGGGATTGTCTTCTTCTGCTTCTGGCAGGGAATTGTTCTTGAG
ATGCTTGCTGCAGTGGGCATAATCAAAGCAGAACACGCTTGGTTTGATGTTGAGCACATAAATGAAGCCTTACAAAACACTCTAGTTTGTGTGGAGATGGTTTTCTTTGC
AATGATTCAGATGTCTGCATACAGTGCTAGCCCTTACAAATCTAAATCTGCAGCAAAACCTAAACTGGAGAAGAAGGAACACATCGATTATGTGAGAAATATCGAGTGTT
ATACTTGGAAACTCTATTTTGGCTATTATTACCACATCTCAGGCGATATTGCTGTGACATATCCTTCGTCGAAGTCGAGCTCTCTGGTATACAAACTTCCTTCACTTCTA
GCCTCTATTCTCTTCTTTCTGTGTCTATTCGAATCAGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGTGGGGCTTATCCTGCTCTTTGCTTCCTTATCCGTCGCCTAATTTCAGAGCGTCCGTTCCGGCAGCGGCAGCAACCCTTCGCACCGACACTTCTTTACCGGTGCCCCT
CGATCGGAAACATTACACTTCCGACTCCCCATTTGCTCCGGAGGTTATTAAGGCGGTAGACTCCTTGCAGTATGAATTCAGGGCAGTGGATAATTTGGTGGCACGTAATT
CTGCTAAAGTTCTCAAAGCCTTTCAGAATGCTCGGTTAGGATCTCATCATTTTGGAGGATCCACTGGTTATGGTCATGATGAAGCTGGAGGACGTGAGGCACTTGACAAC
GCTTTTGCTGAGATAGTTGGAGCAGAATCTGCAATAGTCCGATCACAGTTTTTCTCAGGTACTCATGCTATTACGTGTGCTTTATTTGCACTTTTGAGGCCAGGGGATGA
GCTTTTGGCAGTAGCTGGTGCTCCATATGACACACTAGAGGAGGTCATTGGGAAAAGAGATTCTCAGGGGCTGGGTTCCTTGAAAGATTTTGGAGTAGAGTATCGAGAAG
TTCCACTTGCTGATGACGGTGGACTCGACTGGGAAAAACTTGCAAGTGCTTTGAAACCTCAGACAAAATGTGCGCTCATACAAAGGTCATGTGGTTATTCTTGGCGGCGA
AGTTTAAGTGTTGACGAAATAGGAAAAGCAATAAGACTGATCAAGATGCAAAACCCTGATTGCTTGGTGATGGTGGATAACTGTTATGGAGAATTTGTGGAAACCACTGA
ACCTCCAACTGTGGGCGCGGACTTAATTGCAGGAAGTTTGATAAAAAATCCTGGTGGAACGCTTGCACCTTGTGGCGGATATGTTGCAGGTCGAGACAAATGGGTGAAAG
CGGCTGCAGCTCGTTTGTCTGCACCCGGCTTGGGGGTGGATTCGGGCTCTACCCCTGGTGATATCATGAGGACATTTTTTCAAGGATTATTCCTTTCACCTCAAATGGTT
GGTGAGGCAGTTAAGGGAATGATCCTAATAGCTGAAGTCATGGCATCAAAAGGCTACAAAGTGCAGCCACTTCCACGTGCACTCCGCCACGACATCGTACAGGCTGTACA
ACTTGGAAGTCGTGAAGTTTTGCTTGCATTCTGCGAGGCTGTACAGAGAAGCTCTCCTGTCGCTTCGTTTACTAAACCGGTTCCGGGAATAACTCCTGGATATGCATCAG
AGGTGATCTTTGCTGATGGAACTTTTATTGATGGGAGCACAAGTGAACTTTCTTGTGATGGACCTCTAAGAGAGCCATTTGCAGTCTTTTGCCAGGTTTGTGCCATTTTT
ATATCTCATTTCTTGGTAACCGCAGTGGTTAACATATCAGCAGTCACAATGGATTATGGACAGATGATTTTTCTTGGAGTTACTTCCTCTGTTGTTCTCACTGCAATATT
TTCATTATGGCTCCTTACCCAACATCTGTCTAACTGGAAAAAACCAGCGGAACAAAAGGCCATTGTTATTATAATTCTTATGGCTCCTTTATATGCTGGTATCTCCTATA
TTGGTCTGTTGGAATTTATGGCAAGCAGTACTTTCTTTTTGTTTTTGGAATCAATTAAGGAATGTTATGAGGCTTTGGTGATATCTAAGTTCTTGAGTTTACTCTACAGC
TACTTAAATATATCCATAAGCAAAAACATTGTGCCAGATGAGATCAAAGGTAGAGAAATTCACCATACTTTTCCGATGACCCTCTTTCAGTGGCTCTGTGAATATCCATC
TATGGTCAACGTTAACACTGACATTCGTACTTGTACATTATCTGAGTATTTTGTTGGTCAACTTCGCTATTTTTTGGAATATGTGTCGCTTGCTCTGTATTCCCTGGTGG
TTTTCTATCATGTATTTGATAAGGAGTTGAAACCACATAGCCCTCTTGCGAAGTTCTTGTGCATCAAAGGGATTGTCTTCTTCTGCTTCTGGCAGGGAATTGTTCTTGAG
ATGCTTGCTGCAGTGGGCATAATCAAAGCAGAACACGCTTGGTTTGATGTTGAGCACATAAATGAAGCCTTACAAAACACTCTAGTTTGTGTGGAGATGGTTTTCTTTGC
AATGATTCAGATGTCTGCATACAGTGCTAGCCCTTACAAATCTAAATCTGCAGCAAAACCTAAACTGGAGAAGAAGGAACACATCGATTATGTGAGAAATATCGAGTGTT
ATACTTGGAAACTCTATTTTGGCTATTATTACCACATCTCAGGCGATATTGCTGTGACATATCCTTCGTCGAAGTCGAGCTCTCTGGTATACAAACTTCCTTCACTTCTA
GCCTCTATTCTCTTCTTTCTGTGTCTATTCGAATCAGAATAG
Protein sequenceShow/hide protein sequence
MWGLSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAGGREALDN
AFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASALKPQTKCALIQRSCGYSWRR
SLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMV
GEAVKGMILIAEVMASKGYKVQPLPRALRHDIVQAVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPGYASEVIFADGTFIDGSTSELSCDGPLREPFAVFCQVCAIF
ISHFLVTAVVNISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYS
YLNISISKNIVPDEIKGREIHHTFPMTLFQWLCEYPSMVNVNTDIRTCTLSEYFVGQLRYFLEYVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLE
MLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPKLEKKEHIDYVRNIECYTWKLYFGYYYHISGDIAVTYPSSKSSSLVYKLPSLL
ASILFFLCLFESE