; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G012840 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G012840
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionProtein of unknown function (DUF300)
Genome locationchr04:20264860..20277753
RNA-Seq ExpressionLsi04G012840
SyntenyLsi04G012840
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005178 - Organic solute transporter subunit alpha/Transmembrane protein 184
IPR009651 - Putative methionine gamma-lyase
IPR015421 - Pyridoxal phosphate-dependent transferase, major domain
IPR015424 - Pyridoxal phosphate-dependent transferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4107049.1 unnamed protein product [Lactuca saligna]3.0e-26164.02Show/hide
Query:  PSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAGGREALDNA
        P+ NFR    ++       +S  + +   H  S   F PEV  AVD+L  EFRAVDNLVA+NS++VLKAFQNAR+GSHHF G TGYGH+EAGGREALD A
Subjt:  PSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAGGREALDNA

Query:  FAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASALKPQTKCALIQ
        FAEI GAESAIVRSQFFSGTHAITCALFA LRPGDELLAVAGAPYDTLEEVIG RD  GLGSLKDFG+ YREV LADDGGLDW+ L  ALKP+TKCALIQ
Subjt:  FAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASALKPQTKCALIQ

Query:  RSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTV-----VRSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLSAPGLGVDSGST
        RSCGYSWR+SLSV+EI +AI +IK QNP+CLVMVDNCYGEF ET EPP V       SLIKNPGGT+APCGGYVAG++KWVKAAAARLSAPGLGVD GST
Subjt:  RSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTV-----VRSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLSAPGLGVDSGST

Query:  PGDIMRTFFQGLFLSPQMVGEAVK-----------------------------AVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPGYASEVIFADGT
        PGDIMR FFQGL+LSPQMVGE++K                             AVQLGSRE LLAFCEAVQRSSPV+S+TKP+ G+T GYASEVIFADGT
Subjt:  PGDIMRTFFQGLFLSPQMVGEAVK-----------------------------AVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPGYASEVIFADGT

Query:  FIDGSTSELSCDGPLREPFAVFCQGGTHWTQWGLVLGEIAADVVFLNDLCLLSEVISHFLVTAVVNISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQH
        FIDGSTSELSCDGPLREPF VFCQGGTHWTQWGLVL       VF     LL+ +      T  + +  V M   Q   +G  + V +T + +L L+  H
Subjt:  FIDGSTSELSCDGPLREPFAVFCQGGTHWTQWGLVLGEIAADVVFLNDLCLLSEVISHFLVTAVVNISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQH

Query:  LSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQLWVLS
        LS+WKKP EQKAI++IILMAP+YA  SY+GLL+   S TFF+ L+SIKECYEALV++KFL+LLY+YLNISISKNIVPDEIKGREIHH+FPMTLFQ   + 
Subjt:  LSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQLWVLS

Query:  LQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQELDLSIRGDSSLSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQG
        L     K +  + W  +  + V      L        I    +  +  + L++    S+SLALY+LV+FYHVF KEL PH PLAKFLC+KGIVFFCFWQG
Subjt:  LQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQELDLSIRGDSSLSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQG

Query:  IVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPKLEKKE
        IVL  L A+GIIK+ H W DV HI +ALQN LV VEMVFFAM QM AY+A+PYK   AA  K +KKE
Subjt:  IVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPKLEKKE

KAA8523841.1 hypothetical protein F0562_010264 [Nyssa sinensis]7.7e-27364.44Show/hide
Query:  LSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAG
        LSC+   YP+   RAS   A   +R+   + VP  R H+  D+PFAPEV KAVDSL  EFRAVDNLVARN+++VL+A+QNAR+G HHFGG TGYGH+EAG
Subjt:  LSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAG

Query:  GREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASALKP
        GREALD  FAEI GAESAIVRSQFFSGTHAITCALFA LRPGDELLAVAGAPYDTLEEVIG RDS GLGSLKDFGV+YREVPLA+DGGLDW+ L  ALKP
Subjt:  GREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASALKP

Query:  QTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTV-----VRSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLSAPG
        QTKCALIQRSCGYSWRRSLSV EIG+AI+++K QNPDCLVMVDNCYGEFVE  EPP V       SLIKNPGGT+APCGGYVAG++KWVKAAAARLSAPG
Subjt:  QTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTV-----VRSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLSAPG

Query:  LGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVK-----------------------------AVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPGYAS
        LG+D GSTPGDIMRTFFQGLFLSPQMVGEA+K                             AVQLG+RE LLAFCEAVQRSSPV SFTKPV G TPGYAS
Subjt:  LGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVK-----------------------------AVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPGYAS

Query:  EVIFADGTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWGLVLGEI-----AADVVFLNDLCLL------------SEVISHFL---------------
        EVIFADGTFIDGSTSELSCDGPLREPF+VFCQGGTHWTQWGLVLGE+       D+ F    C              S +IS FL               
Subjt:  EVIFADGTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWGLVLGEI-----AADVVFLNDLCLL------------SEVISHFL---------------

Query:  VTAVVNISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFL
        +T    IS   M+ GQ+  +G T  V+LT  F++ LL+QH   WKKP EQKAI+IIILMAP+YA  S++GLL+F  S  FF FL+S+KECYEALV++KFL
Subjt:  VTAVVNISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFL

Query:  SLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQLWVLSLQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQELDLSIRGDSSLS
        +L+Y+YLNISISKNIVPDEIKGREIHH+FPMTLFQ   + L     K +  + W  +  + V      L  A     I  S V  +  + L+I    S+S
Subjt:  SLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQLWVLSLQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQELDLSIRGDSSLS

Query:  LALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAK
        LALYSLVVFYHVF KEL+PH PLAKFLCIKGIVFFCFWQG+VLE+LAA+G+I++ H W DVE I EALQN LVCVEMVFF+  Q  AYSA+PY   +   
Subjt:  LALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAK

Query:  PKLEKKE
         K +K E
Subjt:  PKLEKKE

KAG6591988.1 hypothetical protein SDJN03_14334, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0076.14Show/hide
Query:  MWGLSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHD
        MWGLSCS+  YPSPNFR S P  AATLR+ TSLPV LDRKHYTSD+PFAPEV+KAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHD
Subjt:  MWGLSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHD

Query:  EAGGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASA
        EAGGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLA+DGGLDWEKLAS+
Subjt:  EAGGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASA

Query:  LKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTV-----VRSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLS
        LKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVET EPPTV       SLIKNPGGTLAPCGGYVAGRDKWVKAAAARLS
Subjt:  LKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTV-----VRSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLS

Query:  APGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVK-----------------------------AVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPG
        APGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVK                             AVQLGSRE+LLAFCEAVQRSSPVAS+TKPVPGITPG
Subjt:  APGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVK-----------------------------AVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPG

Query:  YASEVIFADGTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWGLVLGEIAADVVFLNDLCLLSEVISHFLVTAVVNISAVTMDYGQMIFLGVTSSVVLT
        YASEVIFADGTFIDGSTSELSCDGPLREPFAVFCQ           + +   D VF                + V N S V++ Y           ++  
Subjt:  YASEVIFADGTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWGLVLGEIAADVVFLNDLCLLSEVISHFLVTAVVNISAVTMDYGQMIFLGVTSSVVLT

Query:  AIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTF
         +FSLWLL+QHLSNW+KPAEQKAIV+IILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHH+F
Subjt:  AIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTF

Query:  PMTLFQ-------------LWVLSLQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQELDLSIRGDSSLSLALYSLVVFYHVFDKE
        PMTLFQ             L + + Q    +PV   + + L  Q + +  D LS                     +I  + S+SLALYSLVVFYHVFDKE
Subjt:  PMTLFQ-------------LWVLSLQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQELDLSIRGDSSLSLALYSLVVFYHVFDKE

Query:  LKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPKLEKKEQ
        LKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEA+QNTLVCVEMVFFAM+QMSAYSASPY+ +SAAK K EKKEQ
Subjt:  LKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPKLEKKEQ

KAG7024863.1 ynbB [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0081.84Show/hide
Query:  MWGLSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHD
        MWGLSCS+  YPSPNFR S P  AATLR+ TSLPV LDRKHYTSD+PFAPEV+KAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHD
Subjt:  MWGLSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHD

Query:  EAGGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASA
        EAGGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLA+DGGLDWEKLAS+
Subjt:  EAGGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASA

Query:  LKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTV-----VRSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLS
        LKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVET EPPTV       SLIKNPGGTLAPCGGYVAGRDKWVKAAAARLS
Subjt:  LKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTV-----VRSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLS

Query:  APGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKAVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPGYASEVIFADGTFIDGSTSELSCDGPLREP
        APGLGVDSGSTPGDIMRTFFQGLFLSPQM      AVQLGSRE+LLAFCEAVQRSSPVAS+TKPVPGITPGYASEVIFADGTFIDGSTSELSCDGPLREP
Subjt:  APGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKAVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPGYASEVIFADGTFIDGSTSELSCDGPLREP

Query:  FAVFCQGGTHWTQWGLVLGEIA---ADVVFLNDLCLLSEVISHFLVTAVVNISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVI
        FAVFCQGGTHWTQWGLVLGE++    D VF           S    +++  ISA+TMDYG MIFL VTSSVVLT++FSLWLL+QHLSNW+KPAEQKAIV+
Subjt:  FAVFCQGGTHWTQWGLVLGEIA---ADVVFLNDLCLLSEVISHFLVTAVVNISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVI

Query:  IILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQ-------------LWVLSLQ
        IILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHH+FPMTLFQ             L + + Q
Subjt:  IILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQ-------------LWVLSLQ

Query:  LSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQELDLSIRGDSSLSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIV
            +PV   + + L  Q + +  D LS                     +I  + S+SLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIV
Subjt:  LSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQELDLSIRGDSSLSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIV

Query:  LEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPK
        LEMLAAVGIIKAEHAWFDVEHINEA+QNTLVCVEMVFFAM+QMSAYSASPY+ +SAAK K
Subjt:  LEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPK

RXI03688.1 hypothetical protein DVH24_004340 [Malus domestica]9.4e-26362.81Show/hide
Query:  MWGLSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHD
        MW LSC    YP+   RASVP   AT R+ + L VP +   +  DSPF PEV  AVDSL  EFRAVDNLVARN+ +VLKAFQNAR+GSHHF G TGYGHD
Subjt:  MWGLSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHD

Query:  EAGGREALDNAFAEIVGAESAIVRS----------------------QFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFG
        EAGGREALD AFAEIVGAESAIVRS                      QFFSGTHAITCALFA LRPGDELLAVAG PYDTLEEVIGKRDS G+GSL DFG
Subjt:  EAGGREALDNAFAEIVGAESAIVRS----------------------QFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFG

Query:  VEYREVPLADDGGLDWEKLASALKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTV-----VRSLIKNPGGTL
        V+YREVPLA+DGGL+W+ L  AL+P+TKCALIQRSCGYSWRRSLSVDEIG+AI++IK QNP+CLVMVDNCYGEFVE+ EPP V       SLIKNPGGT+
Subjt:  VEYREVPLADDGGLDWEKLASALKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTV-----VRSLIKNPGGTL

Query:  APCGGYVAGRDKWVKAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVK-----------------------------AVQLGSREVLLAFC
        APCGGYVAGR+KWVKAA+ARLSAPGLGVD G+TPGDIMR+FFQGLFLSPQMVGEA+K                             AVQLGSRE LLAFC
Subjt:  APCGGYVAGRDKWVKAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVK-----------------------------AVQLGSREVLLAFC

Query:  EAVQRSSPVASFTKPVPGITPGYASE--------------VIFADGTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWGLVLGEIAADVVFLNDLCLLS
        EAVQR+SPV SFTKPV G TPGYASE              VIFADGTFIDGSTSELSCDGPLREPFAVFCQGG+HWTQWGLVLGE               
Subjt:  EAVQRSSPVASFTKPVPGITPGYASE--------------VIFADGTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWGLVLGEIAADVVFLNDLCLLS

Query:  EVISHFLVTAVVNISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEA
                           +  Q++ LG T  +++T  FSL LL++H   W KP EQKAIVIIILMAPLYA  S++GLL++  S   F  L+SIKECYEA
Subjt:  EVISHFLVTAVVNISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEA

Query:  LVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQLWVLSLQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQELDLSI
        LVI+KFL+LLYSYLNISISKNIVPDEIKGREIHH+FPMTLF    + L     K +  + W  +  + V      L        +  S V  +  + L+I
Subjt:  LVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQLWVLSLQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQELDLSI

Query:  RGDSSLSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYS
            S+SLALYSL+ FYHVF KEL PH PL KFLCIKGIVFFCFWQGIVL++LAA+ II++ H W DVEHI EALQN LVCVEMVFF+++Q  AY+
Subjt:  RGDSSLSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYS

TrEMBL top hitse value%identityAlignment
A0A3Q7HH85 Uncharacterized protein8.9e-25963.01Show/hide
Query:  LSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAG
        L C+   YP+   R  V +A A +R+ + + VP   + + SDSPF PEV KAVDSL  EFR VDNLVARN+A+VL+AFQ  ++GSHHFGGSTGYGH+EAG
Subjt:  LSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAG

Query:  GREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASALKP
        GREALD AFAEIVGAESAIVRSQFFSGTHAITCALFA LRPGDELLA+AGAPYDTLEEVIGKRDS G GSLKDFGVEYREVPLA+DGGLDW+ L ++++P
Subjt:  GREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASALKP

Query:  QTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTV-----VRSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLSAPG
         TKCALIQRSCGYSWRRSLSV EIG+AI +IKMQNP C+VMVDNCYGEFV+  EPP V       SLIKNPGGT+APCGGYVAGR KWV+AAAARLSAPG
Subjt:  QTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTV-----VRSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLSAPG

Query:  LGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVK-----------------------------AVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPGYAS
        LGVD GSTPGDIMRT FQGLFLSPQMVGEA+K                             AVQLG+RE LL+FCEAVQRSSPV+SF +PV G T GYAS
Subjt:  LGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVK-----------------------------AVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPGYAS

Query:  EVIFADGTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWGLVLGEIAADVVFLNDLCLLSEVISHFLVTAVVNISAVTMDYGQMIFLGVTSSVVLTAIF
        EVIFADGTFIDGSTSELSCDGPLREPF+VFCQGGTHWTQWGLVLGEI     +L +L  + +     L T                         L+A  
Subjt:  EVIFADGTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWGLVLGEIAADVVFLNDLCLLSEVISHFLVTAVVNISAVTMDYGQMIFLGVTSSVVLTAIF

Query:  SLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMT
         + L+T+H ++WKKP EQKAI+II+LMAPLYA +S+IGL++FM S  FF FLES+KECYEA+V++KFL L+Y+YLNISISKNIVPDEIKGR+IHH+FPMT
Subjt:  SLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMT

Query:  LFQLWVLSLQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQELDLSIRGDSSLSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGI
        LFQ     L     K +  + W  +  + V      L        +  S V  +  + L+I    S+SLALYSLVVFYHVF KEL PH PLAKFLC+KGI
Subjt:  LFQLWVLSLQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQELDLSIRGDSSLSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGI

Query:  VFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKS
        VFF FWQGI+L +L ++GIIK+ + W +VE + E +QN LV +EMVFFA++   AYSA+PY++++
Subjt:  VFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKS

A0A498K8E2 Uncharacterized protein4.6e-26362.81Show/hide
Query:  MWGLSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHD
        MW LSC    YP+   RASVP   AT R+ + L VP +   +  DSPF PEV  AVDSL  EFRAVDNLVARN+ +VLKAFQNAR+GSHHF G TGYGHD
Subjt:  MWGLSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHD

Query:  EAGGREALDNAFAEIVGAESAIVRS----------------------QFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFG
        EAGGREALD AFAEIVGAESAIVRS                      QFFSGTHAITCALFA LRPGDELLAVAG PYDTLEEVIGKRDS G+GSL DFG
Subjt:  EAGGREALDNAFAEIVGAESAIVRS----------------------QFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFG

Query:  VEYREVPLADDGGLDWEKLASALKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTV-----VRSLIKNPGGTL
        V+YREVPLA+DGGL+W+ L  AL+P+TKCALIQRSCGYSWRRSLSVDEIG+AI++IK QNP+CLVMVDNCYGEFVE+ EPP V       SLIKNPGGT+
Subjt:  VEYREVPLADDGGLDWEKLASALKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTV-----VRSLIKNPGGTL

Query:  APCGGYVAGRDKWVKAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVK-----------------------------AVQLGSREVLLAFC
        APCGGYVAGR+KWVKAA+ARLSAPGLGVD G+TPGDIMR+FFQGLFLSPQMVGEA+K                             AVQLGSRE LLAFC
Subjt:  APCGGYVAGRDKWVKAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVK-----------------------------AVQLGSREVLLAFC

Query:  EAVQRSSPVASFTKPVPGITPGYASE--------------VIFADGTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWGLVLGEIAADVVFLNDLCLLS
        EAVQR+SPV SFTKPV G TPGYASE              VIFADGTFIDGSTSELSCDGPLREPFAVFCQGG+HWTQWGLVLGE               
Subjt:  EAVQRSSPVASFTKPVPGITPGYASE--------------VIFADGTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWGLVLGEIAADVVFLNDLCLLS

Query:  EVISHFLVTAVVNISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEA
                           +  Q++ LG T  +++T  FSL LL++H   W KP EQKAIVIIILMAPLYA  S++GLL++  S   F  L+SIKECYEA
Subjt:  EVISHFLVTAVVNISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEA

Query:  LVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQLWVLSLQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQELDLSI
        LVI+KFL+LLYSYLNISISKNIVPDEIKGREIHH+FPMTLF    + L     K +  + W  +  + V      L        +  S V  +  + L+I
Subjt:  LVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQLWVLSLQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQELDLSI

Query:  RGDSSLSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYS
            S+SLALYSL+ FYHVF KEL PH PL KFLCIKGIVFFCFWQGIVL++LAA+ II++ H W DVEHI EALQN LVCVEMVFF+++Q  AY+
Subjt:  RGDSSLSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYS

A0A5J5A352 Uncharacterized protein3.7e-27364.44Show/hide
Query:  LSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAG
        LSC+   YP+   RAS   A   +R+   + VP  R H+  D+PFAPEV KAVDSL  EFRAVDNLVARN+++VL+A+QNAR+G HHFGG TGYGH+EAG
Subjt:  LSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAG

Query:  GREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASALKP
        GREALD  FAEI GAESAIVRSQFFSGTHAITCALFA LRPGDELLAVAGAPYDTLEEVIG RDS GLGSLKDFGV+YREVPLA+DGGLDW+ L  ALKP
Subjt:  GREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASALKP

Query:  QTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTV-----VRSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLSAPG
        QTKCALIQRSCGYSWRRSLSV EIG+AI+++K QNPDCLVMVDNCYGEFVE  EPP V       SLIKNPGGT+APCGGYVAG++KWVKAAAARLSAPG
Subjt:  QTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTV-----VRSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLSAPG

Query:  LGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVK-----------------------------AVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPGYAS
        LG+D GSTPGDIMRTFFQGLFLSPQMVGEA+K                             AVQLG+RE LLAFCEAVQRSSPV SFTKPV G TPGYAS
Subjt:  LGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVK-----------------------------AVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPGYAS

Query:  EVIFADGTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWGLVLGEI-----AADVVFLNDLCLL------------SEVISHFL---------------
        EVIFADGTFIDGSTSELSCDGPLREPF+VFCQGGTHWTQWGLVLGE+       D+ F    C              S +IS FL               
Subjt:  EVIFADGTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWGLVLGEI-----AADVVFLNDLCLL------------SEVISHFL---------------

Query:  VTAVVNISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFL
        +T    IS   M+ GQ+  +G T  V+LT  F++ LL+QH   WKKP EQKAI+IIILMAP+YA  S++GLL+F  S  FF FL+S+KECYEALV++KFL
Subjt:  VTAVVNISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFL

Query:  SLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQLWVLSLQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQELDLSIRGDSSLS
        +L+Y+YLNISISKNIVPDEIKGREIHH+FPMTLFQ   + L     K +  + W  +  + V      L  A     I  S V  +  + L+I    S+S
Subjt:  SLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQLWVLSLQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQELDLSIRGDSSLS

Query:  LALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAK
        LALYSLVVFYHVF KEL+PH PLAKFLCIKGIVFFCFWQG+VLE+LAA+G+I++ H W DVE I EALQN LVCVEMVFF+  Q  AYSA+PY   +   
Subjt:  LALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAK

Query:  PKLEKKE
         K +K E
Subjt:  PKLEKKE

A0A5N5GL16 Uncharacterized protein2.4e-24359.47Show/hide
Query:  MWGLSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHD
        MW LSC+   YP+ + RASVP   AT R+ + L VP +   +  DSPF PEV  AVDSL  EFRAVDNLVARN+ +VLKAFQNAR+GSHHF G TGYGHD
Subjt:  MWGLSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHD

Query:  EAGGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASA
        EAGGREALD AFAEIVGAESAIVRSQFFSGTHAITCALFA LRPGDELLAVAG PYDTLEEVIGKRDS G+GSL DFGV+YREVPLA+DGGL+W+ L  A
Subjt:  EAGGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASA

Query:  LKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTV-----VRSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLS
        L+P+TKCALIQRSCGYSWRRSLSVDEIG+AI++IK QN +CLVMVDNCYGEFVE+ EPP V       SLIKNPGGT+APCGGYVAGR+KWVKAA+ARLS
Subjt:  LKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTV-----VRSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLS

Query:  APGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVK-----------------------------AVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPG
        APGLGVD G+TPGDIMR FFQGLFLSPQMVGEA+K                             AVQLGSRE LLAFCEAVQR+SPV SFTKPV G TPG
Subjt:  APGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVK-----------------------------AVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPG

Query:  YASE--------------VIFADGTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWGLVLGEIAADVVFLNDLCLLSEVISHFLVTAVVNISAVTMDYG
        YASE              VIFADGTFIDGSTSELSCDGPLREPFAVFCQGG+HWTQWGLVLGE                                     
Subjt:  YASE--------------VIFADGTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWGLVLGEIAADVVFLNDLCLLSEVISHFLVTAVVNISAVTMDYG

Query:  QMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNI
                                                                     S   F  L+SIKECYEALVI+KFL+LLYSYLNISISKNI
Subjt:  QMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNI

Query:  VPDEIKGREIHHTFPMTLFQLWVLSLQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQELDLSIRGDSSLSLALYSLVVFYHVFDK
        VPDEIKGREIHH+FPMTLF    + L     K +  + W  +  + V      L        +  S V  +  + L+I    S+SLALYSLV FYHVF K
Subjt:  VPDEIKGREIHHTFPMTLFQLWVLSLQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQELDLSIRGDSSLSLALYSLVVFYHVFDK

Query:  ELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPKLEKKE
        EL PH PL KFLCIKGIVFFCFWQGIVL++LAA+ II++ H W DVEHI EALQN LVCVEMVFF+++Q  AYSA PY+    +    ++K+
Subjt:  ELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPKLEKKE

A0A6S7PJH8 Uncharacterized protein1.5e-26164.02Show/hide
Query:  PSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAGGREALDNA
        P+ NFR    ++       +S  + +   H  S   F PEV  AVD+L  EFRAVDNLVA+NS++VLKAFQNAR+GSHHF G TGYGH+EAGGREALD A
Subjt:  PSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAGGREALDNA

Query:  FAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASALKPQTKCALIQ
        FAEI GAESAIVRSQFFSGTHAITCALFA LRPGDELLAVAGAPYDTLEEVIG RD  GLGSLKDFG+ YREV LADDGGLDW+ L  ALKP+TKCALIQ
Subjt:  FAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASALKPQTKCALIQ

Query:  RSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTV-----VRSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLSAPGLGVDSGST
        RSCGYSWR+SLSV+EI +AI +IK QNP+CLVMVDNCYGEF ET EPP V       SLIKNPGGT+APCGGYVAG++KWVKAAAARLSAPGLGVD GST
Subjt:  RSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTV-----VRSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLSAPGLGVDSGST

Query:  PGDIMRTFFQGLFLSPQMVGEAVK-----------------------------AVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPGYASEVIFADGT
        PGDIMR FFQGL+LSPQMVGE++K                             AVQLGSRE LLAFCEAVQRSSPV+S+TKP+ G+T GYASEVIFADGT
Subjt:  PGDIMRTFFQGLFLSPQMVGEAVK-----------------------------AVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPGYASEVIFADGT

Query:  FIDGSTSELSCDGPLREPFAVFCQGGTHWTQWGLVLGEIAADVVFLNDLCLLSEVISHFLVTAVVNISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQH
        FIDGSTSELSCDGPLREPF VFCQGGTHWTQWGLVL       VF     LL+ +      T  + +  V M   Q   +G  + V +T + +L L+  H
Subjt:  FIDGSTSELSCDGPLREPFAVFCQGGTHWTQWGLVLGEIAADVVFLNDLCLLSEVISHFLVTAVVNISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQH

Query:  LSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQLWVLS
        LS+WKKP EQKAI++IILMAP+YA  SY+GLL+   S TFF+ L+SIKECYEALV++KFL+LLY+YLNISISKNIVPDEIKGREIHH+FPMTLFQ   + 
Subjt:  LSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQLWVLS

Query:  LQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQELDLSIRGDSSLSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQG
        L     K +  + W  +  + V      L        I    +  +  + L++    S+SLALY+LV+FYHVF KEL PH PLAKFLC+KGIVFFCFWQG
Subjt:  LQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQELDLSIRGDSSLSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQG

Query:  IVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPKLEKKE
        IVL  L A+GIIK+ H W DV HI +ALQN LV VEMVFFAM QM AY+A+PYK   AA  K +KKE
Subjt:  IVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPKLEKKE

SwissProt top hitse value%identityAlignment
P45624 Uncharacterized 33.9 kDa protein in glnA 5'region7.0e-5139.44Show/hide
Query:  PYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASALK-PQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFV
        PYDT+++VIG    +  G+L   G+ +  VPL ++GG+D+E+    LK  Q    +IQRS GY  R+S +VD+I K    +K  +P+ LV VDNCYGEF 
Subjt:  PYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASALK-PQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFV

Query:  ETTEPPT-----VVRSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKAV--------QLGSR---
        E  EP          SLIKN GG +A  GGY+ G+++ V+ AA RL+APG+G + G+T  + M  F++G FL+P   GEA+K +        ++G     
Subjt:  ETTEPPT-----VVRSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKAV--------QLGSR---

Query:  ------------------EVLLAFCEAVQRSSPVASFTKPVPGITPGYASEVIFADGTFIDGSTSELSCDGPLREPFAVFCQGG
                          E ++ F + VQ++SP+ SF +P+P   PGY  +VI A G F+ GST E S DGP+R P+A++ Q G
Subjt:  ------------------EVLLAFCEAVQRSSPVASFTKPVPGITPGYASEVIFADGTFIDGSTSELSCDGPLREPFAVFCQGG

P94479 Uncharacterized protein YnbB6.9e-8341.44Show/hide
Query:  RAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAGGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVI
        + ++ +  RN  +VL++++  ++   HF  STGYG+D+  GR+ L++ +A++ G E+ +VR Q  SGTHAI+ ALF +LRPGDELL + G PYDTLEE++
Subjt:  RAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAGGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVI

Query:  GKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASALKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEP-----
        G R  +  GSLKDF + Y  V L  DG +D++ +A+A+ P+TK   IQRS GY+ R S  + EI + IR +K  N + +V VDNCYGEFVE  EP     
Subjt:  GKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASALKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEP-----

Query:  PTVVRSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVK----------------------------
          +  SLIKNPGG LA  GGY+ G+ KW++A + R+++PG+G ++G++    ++  +QG FL+P +V +++K                            
Subjt:  PTVVRSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVK----------------------------

Query:  -AVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPGYASEVIFADGTFIDGSTSELSCDGPLREPFAVFCQGG
         +V+   RE ++AFC+A+Q +SP+ +   P P   PGY  +VI A GTFI G++ ELS DGP+R P+  + QGG
Subjt:  -AVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPGYASEVIFADGTFIDGSTSELSCDGPLREPFAVFCQGG

Q17QL9 Transmembrane protein 184C3.0e-2530Show/hide
Query:  VVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREI
        ++LT   SLW++ QHL ++ +P  QK I+ I+ M P+Y+  S+I L       +  +++++ +ECYEA VI  F+  L +YL       ++  E K ++ 
Subjt:  VVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREI

Query:  HHTFPMTLFQLWVLSLQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQELDLSIRGDSSLSLALYSLVVFYHVFDKELKPHSPLAK
        H   P+     W +   L     + V  + ++   R    + +L         E +   ++    L I  + S   A+Y L++FY V  +EL P  P+ K
Subjt:  HHTFPMTLFQLWVLSLQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQELDLSIRGDSSLSLALYSLVVFYHVFDKELKPHSPLAK

Query:  FLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHA--WFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPY
        FLC+K +VF  FWQ +V+ +L  VG+I  +H   W  VE +   LQ+ ++C+EM   A+     +S  PY
Subjt:  FLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHA--WFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPY

Q54WM0 Transmembrane protein 184 homolog DDB_G02795553.0e-2531.88Show/hide
Query:  IFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVP
        I  GV S V +  + S +L+ +HL N+  P  QK IV I++M P+Y+  S++  L F+  S +F   + +++ YEA V+  F SL+ +Y+      ++V 
Subjt:  IFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVP

Query:  DEIKGREIHHTFPMTLFQLWVLS-----------LQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQELDLSIRGDSSLSLALYSL
               + H FP+T      L            LQ    KP+   I ++L+ Q              +  + + +V       L++  + S+ L+LY L
Subjt:  DEIKGREIHHTFPMTLFQLWVLS-----------LQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQELDLSIRGDSSLSLALYSL

Query:  VVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAM
        V++Y   ++ELKP  PL KFLCIK I+FF FWQ I +  L   G+I    +W  V++I+ ALQ+ + CVEMV  A+
Subjt:  VVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAM

Q5ZMP3 Transmembrane protein 184C1.0e-2529.37Show/hide
Query:  VVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREI
        +++T   SLW + QHL ++ +P  QK I+ I+ M P+Y+  S+I L          +++++ +ECYEA VI  F+  L +YL       ++  E K ++ 
Subjt:  VVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREI

Query:  HHTFPMTLFQLWVLS-----------LQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQELD-----LSIRGDSSLSLALYSLVVF
         H  P+     W +            LQ +  +P T  I ++                   C +   + E +   D     L I  + S   A+Y LV+F
Subjt:  HHTFPMTLFQLWVLS-----------LQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQELD-----LSIRGDSSLSLALYSLVVF

Query:  YHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHA--WFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPY
        Y V  +EL P  P+ KFLC+K +VF  FWQ +++ +L  VG+I  +H   W  VE +   LQ+ ++CVEM   A+    ++S  PY
Subjt:  YHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHA--WFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPY

Arabidopsis top hitse value%identityAlignment
AT1G11200.1 Protein of unknown function (DUF300)3.1e-7047.51Show/hide
Query:  ISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSY
        I   T+   ++  +G    V+L+  F++ L++QHL  WKKP EQ+AI+II+LMAP+YA  S++GLL+   S  FF+FL+++KECYEALVI+KFL+L+YSY
Subjt:  ISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSY

Query:  LNISISKNIVPDEIKGREIHHTFPMTLFQLWVLSLQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQELDLSIRGDSSLSLALYSL
        +NIS+S  I+PDE KGREIHH+FPMTLF      L     K +  + W     + V      L        I    +       L++    S+SLALYSL
Subjt:  LNISISKNIVPDEIKGREIHHTFPMTLFQLWVLSLQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQELDLSIRGDSSLSLALYSL

Query:  VVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPKLEKK
        V FYHVF KEL+PH PL KF+C+KGIVFFCFWQGIVL++L  +G+IK+ H W +V+ + EALQN LVC+EM+ F++IQ  A+  +PY  ++ AK +  K+
Subjt:  VVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPKLEKK

Query:  E
        +
Subjt:  E

AT1G77220.1 Protein of unknown function (DUF300)2.0e-2126.73Show/hide
Query:  LSEVISHFLVTAVVNISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECY
        + E  S F +    N+   +  Y     L  +  VV+  +  ++L+ +HL+++ +P EQK ++ +ILM P+YA  S++ L+   A+       E I++CY
Subjt:  LSEVISHFLVTAVVNISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECY

Query:  EALVISKFLSLLYSYLN--------------ISISKNIVPDEIKGREIHHTFPMTLF-QLWVLSLQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQC
        EA  +  F   L + L+              I+ S  ++        + H FPM  F + W L  Q      + +  +++L   ++   + ++   A   
Subjt:  EALVISKFLSLLYSYLN--------------ISISKNIVPDEIKGREIHHTFPMTLF-QLWVLSLQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQC

Query:  SIESSHVEASQELDLSIRGDSSLSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVE
          E           L++  + S + ALY LV FY+V   +L P  PLAKFL  K IVF  +WQGI++  L ++G++K   A    + +   +Q+ ++C+E
Subjt:  SIESSHVEASQELDLSIRGDSSLSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVE

Query:  MVFFAMIQMSAYSASPYK
        M   A++ +  + A+PYK
Subjt:  MVFFAMIQMSAYSASPYK

AT4G21570.1 Protein of unknown function (DUF300)2.1e-7956.31Show/hide
Query:  QMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNI
        Q+ F     SV+LT  F++ L++QHL +WK P EQKAI+II+LMAP+YA +S+IGLLE   S TFFLFLESIKECYEALVI+KFL+L+YSYLNIS+SKNI
Subjt:  QMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNI

Query:  VPDEIKGREIHHTFPMTLFQLWVLSLQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQELDLSIRGDSSLSLALYSLVVFYHVFDK
        +PD IKGREIHH+FPMTLFQ  V+ L     K +  + W  +  + V     +L  A        S +  +     +I  + S+SLALYSLV+FYHVF K
Subjt:  VPDEIKGREIHHTFPMTLFQLWVLSLQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQELDLSIRGDSSLSLALYSLVVFYHVFDK

Query:  ELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPKLEKKEQ
        EL PH+PLAKFLCIKGIVFF FWQGI L++L A+G IK+ H W +VE I EA+QN LVC+EMV FA +Q  AY A PY  ++  K KL+KK +
Subjt:  ELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPKLEKKEQ

AT4G38360.2 Protein of unknown function (DUF300)2.1e-1828.18Show/hide
Query:  VVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGRE-
        +VLT   SL+L+  HLS +K P EQK ++ +ILM P Y+  S+  L++   S    +    +++CYE+  +  F   L + +     + I   E +GR+ 
Subjt:  VVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGRE-

Query:  --------------IHHTFPMTLF-QLWVLSLQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQELD-------LSIRGDSSLSLA
                      I H FPM LF + W LS    +     +  ++++             TA +   +E+  V    E         L++  + S S A
Subjt:  --------------IHHTFPMTLF-QLWVLSLQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQELD-------LSIRGDSSLSLA

Query:  LYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPY
        LY LV FY     EL    PLAKFL  K IVF  +WQG+ + +L+++G+ K+  A      +  ++Q+ ++C+EM   +++ +  + A PY
Subjt:  LYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPY

AT5G26740.1 Protein of unknown function (DUF300)2.8e-1829.55Show/hide
Query:  TSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKG
        T   +  AIF ++   +HL N+ +P  Q+ IV II M P+YA +S++ L+   +S    ++ +SI+E YEA VI  FLSL  +++        V   + G
Subjt:  TSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKG

Query:  REIHHTF--------PMTLFQLWVL-----SLQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQE-LDLSIRGDSSLSLALYSLVV
        R +  ++        P+TL   ++       LQ    KP+ V + ++L                ++   +  +    Q  L L+I    S ++ALY+LV+
Subjt:  REIHHTF--------PMTLFQLWVL-----SLQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQE-LDLSIRGDSSLSLALYSLVV

Query:  FYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAA
        FY      L+P +P+ KF+ IK +VF  +WQG+++ + A  G IK+  A     H     QN ++CVEM+  A     A+   PYK  + A
Subjt:  FYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGGGCTTATCCTGCTCTTTGCTTCCTTATCCGTCGCCTAATTTCAGAGCGTCCGTTCCGGCAGCGGCAGCAACCCTTCGCACCGACACTTCTTTACCGGTGCCCCT
CGATCGGAAACATTACACTTCCGACTCCCCATTTGCTCCGGAGGTTATTAAGGCGGTAGACTCCTTGCAGTATGAATTCAGGGCAGTGGATAATTTGGTGGCACGTAATT
CTGCTAAAGTTCTCAAAGCCTTTCAGAATGCTCGGTTAGGATCTCATCATTTTGGAGGATCCACTGGTTATGGTCATGATGAAGCTGGAGGACGTGAGGCACTTGACAAC
GCTTTTGCTGAGATAGTTGGAGCAGAATCTGCAATAGTCCGATCACAGTTTTTCTCAGGTACTCATGCTATTACGTGTGCTTTATTTGCACTTTTGAGGCCAGGGGATGA
GCTTTTGGCAGTAGCTGGTGCTCCATATGACACACTAGAGGAGGTCATTGGGAAAAGAGATTCTCAGGGGCTGGGTTCCTTGAAAGATTTTGGAGTAGAGTATCGAGAAG
TTCCACTTGCTGATGACGGTGGACTCGACTGGGAAAAACTTGCAAGTGCTTTGAAACCTCAGACAAAATGTGCGCTCATACAAAGGTCATGTGGTTATTCTTGGCGGCGA
AGTTTAAGTGTTGACGAAATAGGAAAAGCAATAAGACTGATCAAGATGCAAAACCCTGATTGCTTGGTGATGGTGGATAACTGTTATGGAGAATTTGTGGAAACCACTGA
ACCTCCAACTGTGGTAAGAAGTTTGATAAAAAATCCTGGTGGAACGCTTGCACCTTGTGGCGGATATGTTGCAGGTCGAGACAAATGGGTGAAAGCGGCTGCAGCTCGTT
TGTCTGCACCCGGCTTGGGGGTGGATTCGGGCTCTACCCCTGGTGATATCATGAGGACATTTTTTCAAGGATTATTCCTTTCACCTCAAATGGTTGGTGAGGCAGTTAAG
GCTGTACAACTTGGAAGTCGTGAAGTTTTGCTTGCATTCTGCGAGGCTGTACAGAGAAGCTCTCCTGTCGCTTCGTTTACTAAACCGGTTCCGGGAATAACTCCTGGATA
TGCATCAGAGGTGATCTTTGCTGATGGAACTTTTATTGATGGGAGCACAAGTGAACTTTCTTGTGATGGACCTCTAAGAGAGCCATTTGCAGTCTTTTGCCAGGGTGGAA
CACATTGGACCCAGTGGGGCTTAGTTCTAGGAGAGATTGCTGCTGATGTAGTATTTCTAAATGATTTATGCCTATTAAGCGAAGTTATATCTCATTTCTTGGTAACCGCA
GTGGTTAACATATCAGCAGTCACAATGGATTATGGACAGATGATTTTTCTTGGAGTTACTTCCTCTGTTGTTCTCACTGCAATATTTTCATTATGGCTCCTTACCCAACA
TCTGTCTAACTGGAAAAAACCAGCGGAACAAAAGGCCATTGTTATTATAATTCTTATGGCTCCTTTATATGCTGGTATCTCCTATATTGGTCTGTTGGAATTTATGGCAA
GCAGTACTTTCTTTTTGTTTTTGGAATCAATTAAGGAATGTTATGAGGCTTTGGTGATATCTAAGTTCTTGAGTTTACTCTACAGCTACTTAAATATATCCATAAGCAAA
AACATTGTGCCAGATGAGATCAAAGGTAGAGAAATTCACCATACTTTTCCGATGACCCTCTTTCAGCTGTGGGTCTTAAGTTTACAGCTTAGTAAAACTAAACCAGTCAC
TGTATTTATTTGGGTTTTATTGGATGATCAGAGAGTGACTTTGAATGTTGACTCCCTATCAACTGCAGCCTCACAGTGCTCGATTGAATCATCACACGTTGAAGCTTCTC
AAGAACTGGACCTATCAATTCGTGGTGATTCGTCCTTGTCGCTTGCTCTGTATTCCCTGGTGGTTTTCTATCATGTATTTGATAAGGAGTTGAAACCACATAGCCCTCTT
GCGAAGTTCTTGTGCATCAAAGGGATTGTCTTCTTCTGCTTCTGGCAGGGAATTGTTCTTGAGATGCTTGCTGCAGTGGGCATAATCAAAGCAGAACACGCTTGGTTTGA
TGTTGAGCACATAAATGAAGCCTTACAAAACACTCTAGTTTGTGTGGAGATGGTTTTCTTTGCAATGATTCAGATGTCTGCATACAGTGCTAGCCCTTACAAATCTAAAT
CTGCAGCAAAACCTAAACTGGAGAAGAAGGAACAGTGA
mRNA sequenceShow/hide mRNA sequence
TTAGCTAAATTTTCAATATTTAAGAACAGCGCAAAGAGAAAAAAAAAAAATGGACATTTTTTTATCTTCATAGTCAAAATCCCATAAAATTCAACCGGATATTTAGAGAG
ACGAAGTTATAATTTTCCAGACGAAGGTGGCAGTTTCTCGCGGAAGATGTGGGGCTTATCCTGCTCTTTGCTTCCTTATCCGTCGCCTAATTTCAGAGCGTCCGTTCCGG
CAGCGGCAGCAACCCTTCGCACCGACACTTCTTTACCGGTGCCCCTCGATCGGAAACATTACACTTCCGACTCCCCATTTGCTCCGGAGGTTATTAAGGCGGTAGACTCC
TTGCAGTATGAATTCAGGGCAGTGGATAATTTGGTGGCACGTAATTCTGCTAAAGTTCTCAAAGCCTTTCAGAATGCTCGGTTAGGATCTCATCATTTTGGAGGATCCAC
TGGTTATGGTCATGATGAAGCTGGAGGACGTGAGGCACTTGACAACGCTTTTGCTGAGATAGTTGGAGCAGAATCTGCAATAGTCCGATCACAGTTTTTCTCAGGTACTC
ATGCTATTACGTGTGCTTTATTTGCACTTTTGAGGCCAGGGGATGAGCTTTTGGCAGTAGCTGGTGCTCCATATGACACACTAGAGGAGGTCATTGGGAAAAGAGATTCT
CAGGGGCTGGGTTCCTTGAAAGATTTTGGAGTAGAGTATCGAGAAGTTCCACTTGCTGATGACGGTGGACTCGACTGGGAAAAACTTGCAAGTGCTTTGAAACCTCAGAC
AAAATGTGCGCTCATACAAAGGTCATGTGGTTATTCTTGGCGGCGAAGTTTAAGTGTTGACGAAATAGGAAAAGCAATAAGACTGATCAAGATGCAAAACCCTGATTGCT
TGGTGATGGTGGATAACTGTTATGGAGAATTTGTGGAAACCACTGAACCTCCAACTGTGGTAAGAAGTTTGATAAAAAATCCTGGTGGAACGCTTGCACCTTGTGGCGGA
TATGTTGCAGGTCGAGACAAATGGGTGAAAGCGGCTGCAGCTCGTTTGTCTGCACCCGGCTTGGGGGTGGATTCGGGCTCTACCCCTGGTGATATCATGAGGACATTTTT
TCAAGGATTATTCCTTTCACCTCAAATGGTTGGTGAGGCAGTTAAGGCTGTACAACTTGGAAGTCGTGAAGTTTTGCTTGCATTCTGCGAGGCTGTACAGAGAAGCTCTC
CTGTCGCTTCGTTTACTAAACCGGTTCCGGGAATAACTCCTGGATATGCATCAGAGGTGATCTTTGCTGATGGAACTTTTATTGATGGGAGCACAAGTGAACTTTCTTGT
GATGGACCTCTAAGAGAGCCATTTGCAGTCTTTTGCCAGGGTGGAACACATTGGACCCAGTGGGGCTTAGTTCTAGGAGAGATTGCTGCTGATGTAGTATTTCTAAATGA
TTTATGCCTATTAAGCGAAGTTATATCTCATTTCTTGGTAACCGCAGTGGTTAACATATCAGCAGTCACAATGGATTATGGACAGATGATTTTTCTTGGAGTTACTTCCT
CTGTTGTTCTCACTGCAATATTTTCATTATGGCTCCTTACCCAACATCTGTCTAACTGGAAAAAACCAGCGGAACAAAAGGCCATTGTTATTATAATTCTTATGGCTCCT
TTATATGCTGGTATCTCCTATATTGGTCTGTTGGAATTTATGGCAAGCAGTACTTTCTTTTTGTTTTTGGAATCAATTAAGGAATGTTATGAGGCTTTGGTGATATCTAA
GTTCTTGAGTTTACTCTACAGCTACTTAAATATATCCATAAGCAAAAACATTGTGCCAGATGAGATCAAAGGTAGAGAAATTCACCATACTTTTCCGATGACCCTCTTTC
AGCTGTGGGTCTTAAGTTTACAGCTTAGTAAAACTAAACCAGTCACTGTATTTATTTGGGTTTTATTGGATGATCAGAGAGTGACTTTGAATGTTGACTCCCTATCAACT
GCAGCCTCACAGTGCTCGATTGAATCATCACACGTTGAAGCTTCTCAAGAACTGGACCTATCAATTCGTGGTGATTCGTCCTTGTCGCTTGCTCTGTATTCCCTGGTGGT
TTTCTATCATGTATTTGATAAGGAGTTGAAACCACATAGCCCTCTTGCGAAGTTCTTGTGCATCAAAGGGATTGTCTTCTTCTGCTTCTGGCAGGGAATTGTTCTTGAGA
TGCTTGCTGCAGTGGGCATAATCAAAGCAGAACACGCTTGGTTTGATGTTGAGCACATAAATGAAGCCTTACAAAACACTCTAGTTTGTGTGGAGATGGTTTTCTTTGCA
ATGATTCAGATGTCTGCATACAGTGCTAGCCCTTACAAATCTAAATCTGCAGCAAAACCTAAACTGGAGAAGAAGGAACAGTGAGATGTCACTTGGGTTGGTAACACGCA
TCAATTCAGATGGGTACTGAATAATAAATCAGCCATAATGGTAGGTAGGTTTCAAACTTTCATGGATGCTTTCTGCTCTTCAGCATCGATTATGTGAGAAATATCGAGTG
TTATACTTGGAAACTCTATTTTGGCTATTATTACCACATCTCAGGCGATATTGCTGTGACATATCCTTCGTCGAAGTCGAGCTCTCTGGTATACAAACTTCCTTCACTTC
TAGCCTCTATTCTCTTCTTTCTGTGTCTATTCGAATCAGAATAGATCGTGTGGCTTATCAATCTTGCTTATGTTGTTTGCCGTT
Protein sequenceShow/hide protein sequence
MWGLSCSLLPYPSPNFRASVPAAAATLRTDTSLPVPLDRKHYTSDSPFAPEVIKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAGGREALDN
AFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLADDGGLDWEKLASALKPQTKCALIQRSCGYSWRR
SLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVVRSLIKNPGGTLAPCGGYVAGRDKWVKAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVK
AVQLGSREVLLAFCEAVQRSSPVASFTKPVPGITPGYASEVIFADGTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWGLVLGEIAADVVFLNDLCLLSEVISHFLVTA
VVNISAVTMDYGQMIFLGVTSSVVLTAIFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISK
NIVPDEIKGREIHHTFPMTLFQLWVLSLQLSKTKPVTVFIWVLLDDQRVTLNVDSLSTAASQCSIESSHVEASQELDLSIRGDSSLSLALYSLVVFYHVFDKELKPHSPL
AKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKPKLEKKEQ