; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC11G201100 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC11G201100
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionProtein of unknown function (DUF300)
Genome locationCiama_Chr11:678439..690262
RNA-Seq ExpressionCaUC11G201100
SyntenyCaUC11G201100
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005178 - Organic solute transporter subunit alpha/Transmembrane protein 184
IPR009651 - Putative methionine gamma-lyase
IPR015421 - Pyridoxal phosphate-dependent transferase, major domain
IPR015424 - Pyridoxal phosphate-dependent transferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4107049.1 unnamed protein product [Lactuca saligna]5.8e-27567.11Show/hide
Query:  PSPNFRV----SVPAATLRSGTSLPVPLDRQHCTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAGGREALD
        P+ NFRV    S  AA L+S + + +    QH  S   F PEV  AVD+L  EFRAVDNLVA+NS++VLKAFQNAR+GSHHF G TGYGH+EAGGREALD
Subjt:  PSPNFRV----SVPAATLRSGTSLPVPLDRQHCTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAGGREALD

Query:  NAFAEIVGAESAIVRSQFFSGTHAITCALFSLLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLAEDGGLDWEKLACALKPQTKCAL
         AFAEI GAESAIVRSQFFSGTHAITCALF+ LRPGDELLAVAGAPYDTLEEVIG RD  GLGSLKDFG+ YREV LA+DGGLDW+ L  ALKP+TKCAL
Subjt:  NAFAEIVGAESAIVRSQFFSGTHAITCALFSLLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLAEDGGLDWEKLACALKPQTKCAL

Query:  IQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKLVKAAAARLSAPGLGVDSG
        IQRSCGYSWR+SLSV+EI +AI +IK QNP+CLVMVDNCYGEF ET EPP VGADLIAGSLIKNPGGT+APCGGYVAG++K VKAAAARLSAPGLGVD G
Subjt:  IQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKLVKAAAARLSAPGLGVDSG

Query:  STPGDIMRTFFQGLFLSPQMVG-----------------------------------------------EAVKRSSPVASFTKPVPGITPGYASEVIFAD
        STPGDIMR FFQGL+LSPQMVG                                               EAV+RSSPV+S+TKP+ G+T GYASEVIFAD
Subjt:  STPGDIMRTFFQGLFLSPQMVG-----------------------------------------------EAVKRSSPVASFTKPVPGITPGYASEVIFAD

Query:  GTFIDGSTSELSCDGPLREPFAVFCQS---------------VRGLVGFALMYGTSSMQGLERKPCLQNLPGS--------VFSLWLLTQHLSNWKKPAE
        GTFIDGSTSELSCDGPLREPF VFCQ                VR L+      G   ++ ++     + L GS        + +L L+  HLS+WKKP E
Subjt:  GTFIDGSTSELSCDGPLREPFAVFCQS---------------VRGLVGFALMYGTSSMQGLERKPCLQNLPGS--------VFSLWLLTQHLSNWKKPAE

Query:  QKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLL
        QKAI++IILMAP+YA  SY+GLL+   S TFF+ L+SIKECYEALV++KFL+LLY+YLNISISKNIVPDEIKGREIHH+FPMTLFQPHS RLNH  LKLL
Subjt:  QKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLL

Query:  KNWTYQFVVIRPVCSILMIALQLIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEH
        K WT+QFVVIRPVCS+LMI LQL+++YP WLSWTFT+ILNVSVSLALY+LV+FYHVF KEL PH PLAKFLC+KGIVFFCFWQGIVL  L A+GIIK+ H
Subjt:  KNWTYQFVVIRPVCSILMIALQLIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEH

Query:  AWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKSKVE
         W DV HI +ALQN LV VEMVFFAM QM AY+A+PYK+    + K E
Subjt:  AWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKSKVE

KAA8523841.1 hypothetical protein F0562_010264 [Nyssa sinensis]1.9e-28165.12Show/hide
Query:  MWSLSSSFLSYPSPNFRVSVPAATLRSGTSLPVPLDRQHCTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEA
        M  LS +  +YP+   R S     +RS   + VP  R H   D+PFAPEV KAVDSL  EFRAVDNLVARN+++VL+A+QNAR+G HHFGG TGYGH+EA
Subjt:  MWSLSSSFLSYPSPNFRVSVPAATLRSGTSLPVPLDRQHCTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEA

Query:  GGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFSLLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLAEDGGLDWEKLACALK
        GGREALD  FAEI GAESAIVRSQFFSGTHAITCALF+ LRPGDELLAVAGAPYDTLEEVIG RDS GLGSLKDFGV+YREVPLAEDGGLDW+ L  ALK
Subjt:  GGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFSLLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLAEDGGLDWEKLACALK

Query:  PQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKLVKAAAARLSAP
        PQTKCALIQRSCGYSWRRSLSV EIG+AI+++K QNPDCLVMVDNCYGEFVE  EPP VGADLIAGSLIKNPGGT+APCGGYVAG++K VKAAAARLSAP
Subjt:  PQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKLVKAAAARLSAP

Query:  GLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVK-----------------------------------------------RSSPVASFTKPVPGITPGYA
        GLG+D GSTPGDIMRTFFQGLFLSPQMVGEA+K                                               RSSPV SFTKPV G TPGYA
Subjt:  GLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVK-----------------------------------------------RSSPVASFTKPVPGITPGYA

Query:  SEVIFADGTFIDGSTSELSCDGPLREPFAVFCQSVRGLVGFALMYG------------------------------------TSSMQGLERKPCL-----
        SEVIFADGTFIDGSTSELSCDGPLREPF+VFCQ       + L+ G                                    +  +  ++R  CL     
Subjt:  SEVIFADGTFIDGSTSELSCDGPLREPFAVFCQSVRGLVGFALMYG------------------------------------TSSMQGLERKPCL-----

Query:  ---------------QNLPGSV--------FSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLS
                         L GS         F++ LL+QH   WKKP EQKAI+IIILMAP+YA  S++GLL+F  S  FF FL+S+KECYEALV++KFL+
Subjt:  ---------------QNLPGSV--------FSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLS

Query:  LLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLLKNWTYQFVVIRPVCSILMIALQLIDVYPSWLSWTFTIILNVSVSLALYSLVV
        L+Y+YLNISISKNIVPDEIKGREIHH+FPMTLFQP + RLNHHTLKLLK+WT+QFVVIRPVCSILMIALQL+ +YPSW+SWTFTIILN+SVSLALYSLVV
Subjt:  LLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLLKNWTYQFVVIRPVCSILMIALQLIDVYPSWLSWTFTIILNVSVSLALYSLVV

Query:  FYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKSKVEKKDE
        FYHVF KEL+PH PLAKFLCIKGIVFFCFWQG+VLE+LAA+G+I++ H W DVE I EALQN LVCVEMVFF+  Q  AYSA+PY +  A  S  +KK+E
Subjt:  FYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKSKVEKKDE

KAG6591988.1 hypothetical protein SDJN03_14334, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0085.19Show/hide
Query:  MWSLSSSFLSYPSPNFRVSVPAATLRSGTSLPVPLDRQHCTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEA
        MW LS S  SYPSPNFR S PAATLRS TSLPV LDR+H TSD+PFAPEVVKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEA
Subjt:  MWSLSSSFLSYPSPNFRVSVPAATLRSGTSLPVPLDRQHCTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEA

Query:  GGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFSLLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLAEDGGLDWEKLACALK
        GGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALF+LLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLAEDGGLDWEKLA +LK
Subjt:  GGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFSLLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLAEDGGLDWEKLACALK

Query:  PQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKLVKAAAARLSAP
        PQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVET EPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDK VKAAAARLSAP
Subjt:  PQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKLVKAAAARLSAP

Query:  GLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVK-----------------------------------------------RSSPVASFTKPVPGITPGYA
        GLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVK                                               RSSPVAS+TKPVPGITPGYA
Subjt:  GLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVK-----------------------------------------------RSSPVASFTKPVPGITPGYA

Query:  SEVIFADGTFIDGSTSELSCDGPLREPFAVFCQSVRGLVGFALMYGTSSMQGLERKPCLQNLPGSVFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAG
        SEVIFADGTFIDGSTSELSCDGPLREPFAVFCQ  +  V        S +           +   VFSLWLL+QHLSNW+KPAEQKAIV+IILMAPLYAG
Subjt:  SEVIFADGTFIDGSTSELSCDGPLREPFAVFCQSVRGLVGFALMYGTSSMQGLERKPCLQNLPGSVFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAG

Query:  ISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLLKNWTYQFVVIRPVCSI
        ISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHH+FPMTLFQPHSARLNHHTLKLLK WTYQFVVIRPVCSI
Subjt:  ISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLLKNWTYQFVVIRPVCSI

Query:  LMIALQLIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTL
        LMI+LQLIDVYP WLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEA+QNTL
Subjt:  LMIALQLIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTL

Query:  VCVEMVFFAMIQMSAYSASPYKSKSAAKSKVEKKDE
        VCVEMVFFAM+QMSAYSASPY+ +SAAKSK EKK++
Subjt:  VCVEMVFFAMIQMSAYSASPYKSKSAAKSKVEKKDE

KAG7024863.1 ynbB [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0086.21Show/hide
Query:  MWSLSSSFLSYPSPNFRVSVPAATLRSGTSLPVPLDRQHCTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEA
        MW LS S  SYPSPNFR S PAATLRS TSLPV LDR+H TSD+PFAPEVVKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEA
Subjt:  MWSLSSSFLSYPSPNFRVSVPAATLRSGTSLPVPLDRQHCTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEA

Query:  GGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFSLLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLAEDGGLDWEKLACALK
        GGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALF+LLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLAEDGGLDWEKLA +LK
Subjt:  GGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFSLLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLAEDGGLDWEKLACALK

Query:  PQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKLVKAAAARLSAP
        PQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVET EPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDK VKAAAARLSAP
Subjt:  PQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKLVKAAAARLSAP

Query:  GLGVDSGSTPGDIMRTFFQGLFLSPQMV------------GEAVKRSSPVASFTKPVPGITPGYASEVIFADGTFIDGSTSELSCDGPLREPFAVFCQSV
        GLGVDSGSTPGDIMRTFFQGLFLSPQM              EAV+RSSPVAS+TKPVPGITPGYASEVIFADGTFIDGSTSELSCDGPLREPFAVFCQ  
Subjt:  GLGVDSGSTPGDIMRTFFQGLFLSPQMV------------GEAVKRSSPVASFTKPVPGITPGYASEVIFADGTFIDGSTSELSCDGPLREPFAVFCQSV

Query:  RGLVGFALMYGTSSMQGLE-----RKPCLQN-------------------------LPGSVFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIG
             + L+ G  S + ++     R   + N                         +  SVFSLWLL+QHLSNW+KPAEQKAIV+IILMAPLYAGISYIG
Subjt:  RGLVGFALMYGTSSMQGLE-----RKPCLQN-------------------------LPGSVFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIG

Query:  LLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLLKNWTYQFVVIRPVCSILMIAL
        LLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHH+FPMTLFQPHSARLNHHTLKLLK WTYQFVVIRPVCSILMI+L
Subjt:  LLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLLKNWTYQFVVIRPVCSILMIAL

Query:  QLIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEM
        QLIDVYP WLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEA+QNTLVCVEM
Subjt:  QLIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEM

Query:  VFFAMIQMSAYSASPYKSKSAAKSK
        VFFAM+QMSAYSASPY+ +SAAKSK
Subjt:  VFFAMIQMSAYSASPYKSKSAAKSK

RXI03688.1 hypothetical protein DVH24_004340 [Malus domestica]6.7e-27967.54Show/hide
Query:  MWSLSSSFLSYPSPNFRVSVPAATLRSGTSLPVPLDRQHCTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEA
        MW+LS    +YP+   R SVP AT RS + L VP +      DSPF PEV  AVDSL  EFRAVDNLVARN+ +VLKAFQNAR+GSHHF G TGYGHDEA
Subjt:  MWSLSSSFLSYPSPNFRVSVPAATLRSGTSLPVPLDRQHCTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEA

Query:  GGREALDNAFAEIVGAESAIVRS----------------------QFFSGTHAITCALFSLLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVE
        GGREALD AFAEIVGAESAIVRS                      QFFSGTHAITCALF+ LRPGDELLAVAG PYDTLEEVIGKRDS G+GSL DFGV+
Subjt:  GGREALDNAFAEIVGAESAIVRS----------------------QFFSGTHAITCALFSLLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVE

Query:  YREVPLAEDGGLDWEKLACALKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAP
        YREVPLAEDGGL+W+ L  AL+P+TKCALIQRSCGYSWRRSLSVDEIG+AI++IK QNP+CLVMVDNCYGEFVE+ EPP VGADLIAGSLIKNPGGT+AP
Subjt:  YREVPLAEDGGLDWEKLACALKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAP

Query:  CGGYVAGRDKLVKAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVK---------------------------------------------
        CGGYVAGR+K VKAA+ARLSAPGLGVD G+TPGDIMR+FFQGLFLSPQMVGEA+K                                             
Subjt:  CGGYVAGRDKLVKAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVK---------------------------------------------

Query:  --RSSPVASFTKPVPGITPGYASE--------------VIFADGTFIDGSTSELSCDGPLREPFAVFCQSVRGLVGFALMYGTSSMQGLERKPCLQNLPG
          R+SPV SFTKPV G TPGYASE              VIFADGTFIDGSTSELSCDGPLREPFAVFCQ       + L+ G   M      P    L G
Subjt:  --RSSPVASFTKPVPGITPGYASE--------------VIFADGTFIDGSTSELSCDGPLREPFAVFCQSVRGLVGFALMYGTSSMQGLERKPCLQNLPG

Query:  SV--------FSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIK
        S         FSL LL++H   W KP EQKAIVIIILMAPLYA  S++GLL++  S   F  L+SIKECYEALVI+KFL+LLYSYLNISISKNIVPDEIK
Subjt:  SV--------FSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIK

Query:  GREIHHTFPMTLFQPHSARLNHHTLKLLKNWTYQFVVIRPVCSILMIALQLIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLC
        GREIHH+FPMTLF P + RLNHHTLKLLK WT+QFVVIRPVCSILMI LQL+ VYPSW+SWTFTIILN+SVSLALYSL+ FYHVF KEL PH PL KFLC
Subjt:  GREIHHTFPMTLFQPHSARLNHHTLKLLKNWTYQFVVIRPVCSILMIALQLIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLC

Query:  IKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYS
        IKGIVFFCFWQGIVL++LAA+ II++ H W DVEHI EALQN LVCVEMVFF+++Q  AY+
Subjt:  IKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYS

TrEMBL top hitse value%identityAlignment
A0A3Q7HH85 Uncharacterized protein4.7e-27868.38Show/hide
Query:  SYPSPNFRVSVPAATLRSGTSLPVPLDRQHCTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAGGREALDNA
        +YP+   RV+   A +RS + + VP   Q   SDSPF PEV KAVDSL  EFR VDNLVARN+A+VL+AFQ  ++GSHHFGGSTGYGH+EAGGREALD A
Subjt:  SYPSPNFRVSVPAATLRSGTSLPVPLDRQHCTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAGGREALDNA

Query:  FAEIVGAESAIVRSQFFSGTHAITCALFSLLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLAEDGGLDWEKLACALKPQTKCALIQ
        FAEIVGAESAIVRSQFFSGTHAITCALF+ LRPGDELLA+AGAPYDTLEEVIGKRDS G GSLKDFGVEYREVPLAEDGGLDW+ L  +++P TKCALIQ
Subjt:  FAEIVGAESAIVRSQFFSGTHAITCALFSLLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLAEDGGLDWEKLACALKPQTKCALIQ

Query:  RSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKLVKAAAARLSAPGLGVDSGST
        RSCGYSWRRSLSV EIG+AI +IKMQNP C+VMVDNCYGEFV+  EPP VGADLIAGSLIKNPGGT+APCGGYVAGR K V+AAAARLSAPGLGVD GST
Subjt:  RSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKLVKAAAARLSAPGLGVDSGST

Query:  PGDIMRTFFQGLFLSPQMVGEAVK-----------------------------------------------RSSPVASFTKPVPGITPGYASEVIFADGT
        PGDIMRT FQGLFLSPQMVGEA+K                                               RSSPV+SF +PV G T GYASEVIFADGT
Subjt:  PGDIMRTFFQGLFLSPQMVGEAVK-----------------------------------------------RSSPVASFTKPVPGITPGYASEVIFADGT

Query:  FIDGSTSELSCDGPLREPFAVFCQSVRGLVGFALMYG-----TSSMQGLERKPCLQNLPGSVFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYI
        FIDGSTSELSCDGPLREPF+VFCQ       + L+ G     T     ++ K        +   + L+T+H ++WKKP EQKAI+II+LMAPLYA +S+I
Subjt:  FIDGSTSELSCDGPLREPFAVFCQSVRGLVGFALMYG-----TSSMQGLERKPCLQNLPGSVFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYI

Query:  GLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLLKNWTYQFVVIRPVCSILMIA
        GL++FM S  FF FLES+KECYEA+V++KFL L+Y+YLNISISKNIVPDEIKGR+IHH+FPMTLFQPH+A LNHHTLKLLKNWT+QFVVIRPVCSILMI 
Subjt:  GLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLLKNWTYQFVVIRPVCSILMIA

Query:  LQLIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVE
        LQ+  VYPSW+SWTFTIILN+SVSLALYSLVVFYHVF KEL PH PLAKFLC+KGIVFF FWQGI+L +L ++GIIK+ + W +VE + E +QN LV +E
Subjt:  LQLIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVE

Query:  MVFFAMIQMSAYSASPYKSKS
        MVFFA++   AYSA+PY++++
Subjt:  MVFFAMIQMSAYSASPYKSKS

A0A498K8E2 Uncharacterized protein3.2e-27967.54Show/hide
Query:  MWSLSSSFLSYPSPNFRVSVPAATLRSGTSLPVPLDRQHCTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEA
        MW+LS    +YP+   R SVP AT RS + L VP +      DSPF PEV  AVDSL  EFRAVDNLVARN+ +VLKAFQNAR+GSHHF G TGYGHDEA
Subjt:  MWSLSSSFLSYPSPNFRVSVPAATLRSGTSLPVPLDRQHCTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEA

Query:  GGREALDNAFAEIVGAESAIVRS----------------------QFFSGTHAITCALFSLLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVE
        GGREALD AFAEIVGAESAIVRS                      QFFSGTHAITCALF+ LRPGDELLAVAG PYDTLEEVIGKRDS G+GSL DFGV+
Subjt:  GGREALDNAFAEIVGAESAIVRS----------------------QFFSGTHAITCALFSLLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVE

Query:  YREVPLAEDGGLDWEKLACALKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAP
        YREVPLAEDGGL+W+ L  AL+P+TKCALIQRSCGYSWRRSLSVDEIG+AI++IK QNP+CLVMVDNCYGEFVE+ EPP VGADLIAGSLIKNPGGT+AP
Subjt:  YREVPLAEDGGLDWEKLACALKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAP

Query:  CGGYVAGRDKLVKAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVK---------------------------------------------
        CGGYVAGR+K VKAA+ARLSAPGLGVD G+TPGDIMR+FFQGLFLSPQMVGEA+K                                             
Subjt:  CGGYVAGRDKLVKAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVK---------------------------------------------

Query:  --RSSPVASFTKPVPGITPGYASE--------------VIFADGTFIDGSTSELSCDGPLREPFAVFCQSVRGLVGFALMYGTSSMQGLERKPCLQNLPG
          R+SPV SFTKPV G TPGYASE              VIFADGTFIDGSTSELSCDGPLREPFAVFCQ       + L+ G   M      P    L G
Subjt:  --RSSPVASFTKPVPGITPGYASE--------------VIFADGTFIDGSTSELSCDGPLREPFAVFCQSVRGLVGFALMYGTSSMQGLERKPCLQNLPG

Query:  SV--------FSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIK
        S         FSL LL++H   W KP EQKAIVIIILMAPLYA  S++GLL++  S   F  L+SIKECYEALVI+KFL+LLYSYLNISISKNIVPDEIK
Subjt:  SV--------FSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIK

Query:  GREIHHTFPMTLFQPHSARLNHHTLKLLKNWTYQFVVIRPVCSILMIALQLIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLC
        GREIHH+FPMTLF P + RLNHHTLKLLK WT+QFVVIRPVCSILMI LQL+ VYPSW+SWTFTIILN+SVSLALYSL+ FYHVF KEL PH PL KFLC
Subjt:  GREIHHTFPMTLFQPHSARLNHHTLKLLKNWTYQFVVIRPVCSILMIALQLIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLC

Query:  IKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYS
        IKGIVFFCFWQGIVL++LAA+ II++ H W DVEHI EALQN LVCVEMVFF+++Q  AY+
Subjt:  IKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYS

A0A5J5A352 Uncharacterized protein9.1e-28265.12Show/hide
Query:  MWSLSSSFLSYPSPNFRVSVPAATLRSGTSLPVPLDRQHCTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEA
        M  LS +  +YP+   R S     +RS   + VP  R H   D+PFAPEV KAVDSL  EFRAVDNLVARN+++VL+A+QNAR+G HHFGG TGYGH+EA
Subjt:  MWSLSSSFLSYPSPNFRVSVPAATLRSGTSLPVPLDRQHCTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEA

Query:  GGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFSLLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLAEDGGLDWEKLACALK
        GGREALD  FAEI GAESAIVRSQFFSGTHAITCALF+ LRPGDELLAVAGAPYDTLEEVIG RDS GLGSLKDFGV+YREVPLAEDGGLDW+ L  ALK
Subjt:  GGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFSLLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLAEDGGLDWEKLACALK

Query:  PQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKLVKAAAARLSAP
        PQTKCALIQRSCGYSWRRSLSV EIG+AI+++K QNPDCLVMVDNCYGEFVE  EPP VGADLIAGSLIKNPGGT+APCGGYVAG++K VKAAAARLSAP
Subjt:  PQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKLVKAAAARLSAP

Query:  GLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVK-----------------------------------------------RSSPVASFTKPVPGITPGYA
        GLG+D GSTPGDIMRTFFQGLFLSPQMVGEA+K                                               RSSPV SFTKPV G TPGYA
Subjt:  GLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVK-----------------------------------------------RSSPVASFTKPVPGITPGYA

Query:  SEVIFADGTFIDGSTSELSCDGPLREPFAVFCQSVRGLVGFALMYG------------------------------------TSSMQGLERKPCL-----
        SEVIFADGTFIDGSTSELSCDGPLREPF+VFCQ       + L+ G                                    +  +  ++R  CL     
Subjt:  SEVIFADGTFIDGSTSELSCDGPLREPFAVFCQSVRGLVGFALMYG------------------------------------TSSMQGLERKPCL-----

Query:  ---------------QNLPGSV--------FSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLS
                         L GS         F++ LL+QH   WKKP EQKAI+IIILMAP+YA  S++GLL+F  S  FF FL+S+KECYEALV++KFL+
Subjt:  ---------------QNLPGSV--------FSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLS

Query:  LLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLLKNWTYQFVVIRPVCSILMIALQLIDVYPSWLSWTFTIILNVSVSLALYSLVV
        L+Y+YLNISISKNIVPDEIKGREIHH+FPMTLFQP + RLNHHTLKLLK+WT+QFVVIRPVCSILMIALQL+ +YPSW+SWTFTIILN+SVSLALYSLVV
Subjt:  LLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLLKNWTYQFVVIRPVCSILMIALQLIDVYPSWLSWTFTIILNVSVSLALYSLVV

Query:  FYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKSKVEKKDE
        FYHVF KEL+PH PLAKFLCIKGIVFFCFWQG+VLE+LAA+G+I++ H W DVE I EALQN LVCVEMVFF+  Q  AYSA+PY +  A  S  +KK+E
Subjt:  FYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKSKVEKKDE

A0A5N5GL16 Uncharacterized protein5.5e-26365.47Show/hide
Query:  MWSLSSSFLSYPSPNFRVSVPAATLRSGTSLPVPLDRQHCTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEA
        MW+LS +  +YP+ + R SVP AT RS + L VP +      DSPF PEV  AVDSL  EFRAVDNLVARN+ +VLKAFQNAR+GSHHF G TGYGHDEA
Subjt:  MWSLSSSFLSYPSPNFRVSVPAATLRSGTSLPVPLDRQHCTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEA

Query:  GGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFSLLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLAEDGGLDWEKLACALK
        GGREALD AFAEIVGAESAIVRSQFFSGTHAITCALF+ LRPGDELLAVAG PYDTLEEVIGKRDS G+GSL DFGV+YREVPLAEDGGL+W+ L  AL+
Subjt:  GGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFSLLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLAEDGGLDWEKLACALK

Query:  PQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKLVKAAAARLSAP
        P+TKCALIQRSCGYSWRRSLSVDEIG+AI++IK QN +CLVMVDNCYGEFVE+ EPP VGADLIAGSLIKNPGGT+APCGGYVAGR+K VKAA+ARLSAP
Subjt:  PQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKLVKAAAARLSAP

Query:  GLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVK-----------------------------------------------RSSPVASFTKPVPGITPGYA
        GLGVD G+TPGDIMR FFQGLFLSPQMVGEA+K                                               R+SPV SFTKPV G TPGYA
Subjt:  GLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVK-----------------------------------------------RSSPVASFTKPVPGITPGYA

Query:  SE--------------VIFADGTFIDGSTSELSCDGPLREPFAVFCQSVRGLVGFALMYGTSSMQGLERKPCLQNLPGSVFSLWLLTQHLSNWKKPAEQK
        SE              VIFADGTFIDGSTSELSCDGPLREPFAVFCQ                              GS ++ W                
Subjt:  SE--------------VIFADGTFIDGSTSELSCDGPLREPFAVFCQSVRGLVGFALMYGTSSMQGLERKPCLQNLPGSVFSLWLLTQHLSNWKKPAEQK

Query:  AIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLLKN
                          GL+    S   F  L+SIKECYEALVI+KFL+LLYSYLNISISKNIVPDEIKGREIHH+FPMTLF P + RLNHHTLKLLK 
Subjt:  AIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLLKN

Query:  WTYQFVVIRPVCSILMIALQLIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAW
        WT+QFVVIRPVCSILMI LQL+ VYPSW+SWTFTIILN+SVSLALYSLV FYHVF KEL PH PL KFLCIKGIVFFCFWQGIVL++LAA+ II++ H W
Subjt:  WTYQFVVIRPVCSILMIALQLIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAW

Query:  FDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYK-SKSAAKSKVEKKD
         DVEHI EALQN LVCVEMVFF+++Q  AYSA PY+ ++ + K+   KKD
Subjt:  FDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYK-SKSAAKSKVEKKD

A0A6S7PJH8 Uncharacterized protein2.8e-27567.11Show/hide
Query:  PSPNFRV----SVPAATLRSGTSLPVPLDRQHCTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAGGREALD
        P+ NFRV    S  AA L+S + + +    QH  S   F PEV  AVD+L  EFRAVDNLVA+NS++VLKAFQNAR+GSHHF G TGYGH+EAGGREALD
Subjt:  PSPNFRV----SVPAATLRSGTSLPVPLDRQHCTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAGGREALD

Query:  NAFAEIVGAESAIVRSQFFSGTHAITCALFSLLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLAEDGGLDWEKLACALKPQTKCAL
         AFAEI GAESAIVRSQFFSGTHAITCALF+ LRPGDELLAVAGAPYDTLEEVIG RD  GLGSLKDFG+ YREV LA+DGGLDW+ L  ALKP+TKCAL
Subjt:  NAFAEIVGAESAIVRSQFFSGTHAITCALFSLLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLAEDGGLDWEKLACALKPQTKCAL

Query:  IQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKLVKAAAARLSAPGLGVDSG
        IQRSCGYSWR+SLSV+EI +AI +IK QNP+CLVMVDNCYGEF ET EPP VGADLIAGSLIKNPGGT+APCGGYVAG++K VKAAAARLSAPGLGVD G
Subjt:  IQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKLVKAAAARLSAPGLGVDSG

Query:  STPGDIMRTFFQGLFLSPQMVG-----------------------------------------------EAVKRSSPVASFTKPVPGITPGYASEVIFAD
        STPGDIMR FFQGL+LSPQMVG                                               EAV+RSSPV+S+TKP+ G+T GYASEVIFAD
Subjt:  STPGDIMRTFFQGLFLSPQMVG-----------------------------------------------EAVKRSSPVASFTKPVPGITPGYASEVIFAD

Query:  GTFIDGSTSELSCDGPLREPFAVFCQS---------------VRGLVGFALMYGTSSMQGLERKPCLQNLPGS--------VFSLWLLTQHLSNWKKPAE
        GTFIDGSTSELSCDGPLREPF VFCQ                VR L+      G   ++ ++     + L GS        + +L L+  HLS+WKKP E
Subjt:  GTFIDGSTSELSCDGPLREPFAVFCQS---------------VRGLVGFALMYGTSSMQGLERKPCLQNLPGS--------VFSLWLLTQHLSNWKKPAE

Query:  QKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLL
        QKAI++IILMAP+YA  SY+GLL+   S TFF+ L+SIKECYEALV++KFL+LLY+YLNISISKNIVPDEIKGREIHH+FPMTLFQPHS RLNH  LKLL
Subjt:  QKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLL

Query:  KNWTYQFVVIRPVCSILMIALQLIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEH
        K WT+QFVVIRPVCS+LMI LQL+++YP WLSWTFT+ILNVSVSLALY+LV+FYHVF KEL PH PLAKFLC+KGIVFFCFWQGIVL  L A+GIIK+ H
Subjt:  KNWTYQFVVIRPVCSILMIALQLIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEH

Query:  AWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKSKVE
         W DV HI +ALQN LV VEMVFFAM QM AY+A+PYK+    + K E
Subjt:  AWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKSKVE

SwissProt top hitse value%identityAlignment
P45624 Uncharacterized 33.9 kDa protein in glnA 5'region3.8e-5139.72Show/hide
Query:  PYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLAEDGGLDWEKLACALK-PQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFV
        PYDT+++VIG    +  G+L   G+ +  VPL E+GG+D+E+    LK  Q    +IQRS GY  R+S +VD+I K    +K  +P+ LV VDNCYGEF 
Subjt:  PYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLAEDGGLDWEKLACALK-PQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFV

Query:  ETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKLVKAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVK------------------
        E  EP   G D  AGSLIKN GG +A  GGY+ G+++LV+ AA RL+APG+G + G+T  + M  F++G FL+P   GEA+K                  
Subjt:  ETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKLVKAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVK------------------

Query:  -----------------------------RSSPVASFTKPVPGITPGYASEVIFADGTFIDGSTSELSCDGPLREPFAVFCQ
                                     ++SP+ SF +P+P   PGY  +VI A G F+ GST E S DGP+R P+A++ Q
Subjt:  -----------------------------RSSPVASFTKPVPGITPGYASEVIFADGTFIDGSTSELSCDGPLREPFAVFCQ

P94479 Uncharacterized protein YnbB1.8e-7740.59Show/hide
Query:  RAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAGGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFSLLRPGDELLAVAGAPYDTLEEVI
        + ++ +  RN  +VL++++  ++   HF  STGYG+D+  GR+ L++ +A++ G E+ +VR Q  SGTHAI+ ALF +LRPGDELL + G PYDTLEE++
Subjt:  RAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAGGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFSLLRPGDELLAVAGAPYDTLEEVI

Query:  GKRDSQGLGSLKDFGVEYREVPLAEDGGLDWEKLACALKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGA
        G R  +  GSLKDF + Y  V L +DG +D++ +A A+ P+TK   IQRS GY+ R S  + EI + IR +K  N + +V VDNCYGEFVE  EP  VGA
Subjt:  GKRDSQGLGSLKDFGVEYREVPLAEDGGLDWEKLACALKPQTKCALIQRSCGYSWRRSLSVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGA

Query:  DLIAGSLIKNPGGTLAPCGGYVAGRDKLVKAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVG--------------------------------
        DL+AGSLIKNPGG LA  GGY+ G+ K ++A + R+++PG+G ++G++    ++  +QG FL+P +V                                 
Subjt:  DLIAGSLIKNPGGTLAPCGGYVAGRDKLVKAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVG--------------------------------

Query:  ---------------EAVKRSSPVASFTKPVPGITPGYASEVIFADGTFIDGSTSELSCDGPLREPFAVFCQ
                       +A++ +SP+ +   P P   PGY  +VI A GTFI G++ ELS DGP+R P+  + Q
Subjt:  ---------------EAVKRSSPVASFTKPVPGITPGYASEVIFADGTFIDGSTSELSCDGPLREPFAVFCQ

Q17QL9 Transmembrane protein 184C8.8e-3232.82Show/hide
Query:  SLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMT
        SLW++ QHL ++ +P  QK I+ I+ M P+Y+  S+I L       +  +++++ +ECYEA VI  F+  L +YL       ++  E K ++ H   P+ 
Subjt:  SLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMT

Query:  LFQPHSARLNHHTLKLLKNWTYQFVVIRPVCSILMIALQLIDVYPSW-----LSWTFTIIL-NVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIV
           P +  +    L   K    Q+ V+RP  +I+ +  +L+D+Y         +WT+ +I+ N+S   A+Y L++FY V  +EL P  P+ KFLC+K +V
Subjt:  LFQPHSARLNHHTLKLLKNWTYQFVVIRPVCSILMIALQLIDVYPSW-----LSWTFTIIL-NVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIV

Query:  FFCFWQGIVLEMLAAVGIIKAEHA--WFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPY
        F  FWQ +V+ +L  VG+I  +H   W  VE +   LQ+ ++C+EM   A+     +S  PY
Subjt:  FFCFWQGIVLEMLAAVGIIKAEHA--WFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPY

Q3TPR7 Transmembrane protein 184C3.3e-3132.44Show/hide
Query:  SLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMT
        SLW + QHL ++ +P  QK I+ I+ M P+Y+  S++ L+         +++++ +ECYEA VI  F+  L +YL I     I+  E K ++ +H  P+ 
Subjt:  SLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMT

Query:  LFQPHSARLNHHTLKLLKNWTYQFVVIRPVCSILMIALQLIDVYPSW-----LSWTFTIIL-NVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIV
           P +  +    L   K    Q+ V+RP+ ++  +  +++DVY         +WT+ +IL N+S   A+Y L++FY V  +EL P  P+ KFLC+K +V
Subjt:  LFQPHSARLNHHTLKLLKNWTYQFVVIRPVCSILMIALQLIDVYPSW-----LSWTFTIIL-NVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIV

Query:  FFCFWQGIVLEMLAAVGII--KAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPY
        F  FWQ +++ +L  +G+I  K    W   E +   LQ+ ++C+EM F A+     +S  PY
Subjt:  FFCFWQGIVLEMLAAVGII--KAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPY

Q54WM0 Transmembrane protein 184 homolog DDB_G02795554.4e-3134.9Show/hide
Query:  VFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFP
        + S +L+ +HL N+  P  QK IV I++M P+Y+  S++  L F+  S +F   + +++ YEA V+  F SL+ +Y+      ++V        + H FP
Subjt:  VFSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFP

Query:  MTLFQPHSARLNHHTLKLLKNWTYQFVVIRPVCSILMIALQLIDVYPS----------WLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFL
        +T       +L+   L   K +  QFV I+P+ +I+ + L+    Y            WL    T++ N+SV L+LY LV++Y   ++ELKP  PL KFL
Subjt:  MTLFQPHSARLNHHTLKLLKNWTYQFVVIRPVCSILMIALQLIDVYPS----------WLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFL

Query:  CIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAM
        CIK I+FF FWQ I +  L   G+I    +W  V++I+ ALQ+ + CVEMV  A+
Subjt:  CIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAM

Arabidopsis top hitse value%identityAlignment
AT1G11200.1 Protein of unknown function (DUF300)1.2e-9762.45Show/hide
Query:  FSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPM
        F++ L++QHL  WKKP EQ+AI+II+LMAP+YA  S++GLL+   S  FF+FL+++KECYEALVI+KFL+L+YSY+NIS+S  I+PDE KGREIHH+FPM
Subjt:  FSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPM

Query:  TLFQPHSARLNHHTLKLLKNWTYQFVVIRPVCSILMIALQLIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFW
        TLF P +  L++ TLK LK WT+QF +IRPVCSILMI LQ++ +YP WLSW FT ILNVSVSLALYSLV FYHVF KEL+PH PL KF+C+KGIVFFCFW
Subjt:  TLFQPHSARLNHHTLKLLKNWTYQFVVIRPVCSILMIALQLIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFW

Query:  QGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKSKVEKKD
        QGIVL++L  +G+IK+ H W +V+ + EALQN LVC+EM+ F++IQ  A+  +PY  ++ AK +  K+D
Subjt:  QGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKSKVEKKD

AT1G23070.1 Protein of unknown function (DUF300)3.4e-2331.14Show/hide
Query:  SLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYS----------YLNISISKNIVPD---E
        SL+ + QHL  +  PAEQK IV ++ M P+YA  S I L    ++S F L  + ++ CYEA  +  F S L +          YL     K ++ +   E
Subjt:  SLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYS----------YLNISISKNIVPD---E

Query:  IKGREIHHTFPMTLFQPHSARLNHHTLKLLKNWTYQFVVIRPVCSILMIALQLIDVY-PSWLSWTF-----TIILNVSVSLALYSLVVFYHVFDKELKPH
         K ++  ++F   L  P+   L      + K    Q+++++  C+ L   L+L+ VY      W +      ++LN S   AL+ LV FY+V  + LK  
Subjt:  IKGREIHHTFPMTLFQPHSARLNHHTLKLLKNWTYQFVVIRPVCSILMIALQLIDVY-PSWLSWTF-----TIILNVSVSLALYSLVVFYHVFDKELKPH

Query:  SPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPY
         PLAKF+  K IVF  +WQG  + +L   GI+  E            LQ+ L+C+EM   A+  +  + A PY
Subjt:  SPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPY

AT1G77220.1 Protein of unknown function (DUF300)2.3e-2729.56Show/hide
Query:  LWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLN--------------ISISKNIVPDE
        ++L+ +HL+++ +P EQK ++ +ILM P+YA  S++ L+   A+       E I++CYEA  +  F   L + L+              I+ S  ++   
Subjt:  LWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLN--------------ISISKNIVPDE

Query:  IKGREIHHTFPMTLFQPHSARLNHHTLKLLKNWTYQFVVIRPVCSILMIALQLIDVY-PSWLSWTF-----TIILNVSVSLALYSLVVFYHVFDKELKPH
             + H FPM  F    + L       +K    Q+++++ +C++L + L+   VY     +W +      ++LN S + ALY LV FY+V   +L P 
Subjt:  IKGREIHHTFPMTLFQPHSARLNHHTLKLLKNWTYQFVVIRPVCSILMIALQLIDVY-PSWLSWTF-----TIILNVSVSLALYSLVVFYHVFDKELKPH

Query:  SPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYK
         PLAKFL  K IVF  +WQGI++  L ++G++K   A    + +   +Q+ ++C+EM   A++ +  + A+PYK
Subjt:  SPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYK

AT4G21570.1 Protein of unknown function (DUF300)5.9e-10872.22Show/hide
Query:  FSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPM
        F++ L++QHL +WK P EQKAI+II+LMAP+YA +S+IGLLE   S TFFLFLESIKECYEALVI+KFL+L+YSYLNIS+SKNI+PD IKGREIHH+FPM
Subjt:  FSLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPM

Query:  TLFQPHSARLNHHTLKLLKNWTYQFVVIRPVCSILMIALQLIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFW
        TLFQPH  RL+ HTLKLLK WT+QFVVIRPVCS LMIALQLI  YPSWLSWTFTII+N SVSLALYSLV+FYHVF KEL PH+PLAKFLCIKGIVFF FW
Subjt:  TLFQPHSARLNHHTLKLLKNWTYQFVVIRPVCSILMIALQLIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFW

Query:  QGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKSKVEKKDE
        QGI L++L A+G IK+ H W +VE I EA+QN LVC+EMV FA +Q  AY A PY  ++  K K++KK E
Subjt:  QGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAAKSKVEKKDE

AT5G26740.1 Protein of unknown function (DUF300)7.6e-2330.45Show/hide
Query:  SLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMT
        +++ + +HL N+ +P  Q+ IV II M P+YA +S++ L+   +S    ++ +SI+E YEA VI  FLSL  +++        V   + GR +  ++ + 
Subjt:  SLWLLTQHLSNWKKPAEQKAIVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMT

Query:  LFQPHSARLNHHTLKLLKNWTYQFVVIRPVCSILMIALQLIDVY------PSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIV
                L+   ++  K    QFV+++P+   + + L     Y      P       TII  +S ++ALY+LV+FY      L+P +P+ KF+ IK +V
Subjt:  LFQPHSARLNHHTLKLLKNWTYQFVVIRPVCSILMIALQLIDVY------PSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIV

Query:  FFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAA
        F  +WQG+++ + A  G IK+  A     H     QN ++CVEM+  A     A+   PYK  + A
Subjt:  FFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFAMIQMSAYSASPYKSKSAA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGAGCTTATCCTCCTCTTTCCTTTCTTATCCGTCGCCTAATTTCAGAGTTTCCGTTCCGGCGGCCACCCTTCGCTCCGGCACTTCTTTACCGGTGCCCCTCGATCG
GCAACATTGCACTTCCGACTCCCCATTTGCTCCGGAGGTTGTTAAGGCGGTAGACTCCTTGCAGTATGAATTCAGGGCAGTGGATAATTTGGTGGCTCGTAACTCTGCTA
AAGTTCTCAAAGCCTTTCAGAATGCTCGGTTAGGATCTCATCATTTTGGAGGATCCACTGGTTATGGTCATGATGAAGCTGGAGGTCGTGAGGCACTTGACAATGCTTTT
GCTGAGATAGTTGGAGCAGAATCCGCAATAGTCCGATCACAGTTTTTCTCAGGTACTCATGCTATTACGTGTGCTTTATTTTCACTTTTGAGGCCGGGGGATGAGCTTTT
GGCAGTAGCTGGTGCTCCATACGACACACTAGAAGAGGTCATTGGGAAAAGAGATTCTCAGGGGCTGGGTTCTTTGAAAGATTTTGGAGTAGAGTATCGAGAAGTTCCAC
TTGCTGAGGACGGTGGACTTGACTGGGAAAAACTTGCATGTGCTTTGAAACCTCAAACAAAATGTGCGCTCATACAAAGGTCATGTGGTTATTCTTGGCGGCGAAGTTTA
AGTGTTGACGAAATAGGAAAAGCAATAAGACTGATCAAGATGCAAAACCCTGATTGCTTGGTGATGGTGGATAACTGTTATGGAGAATTTGTGGAAACCACTGAACCTCC
AACAGTGGGCGCGGATTTAATTGCAGGAAGTTTGATAAAAAATCCTGGTGGAACGCTTGCACCTTGTGGCGGATATGTTGCAGGTCGTGACAAATTGGTGAAGGCGGCTG
CAGCTCGTTTATCTGCACCTGGCTTGGGGGTGGATTCAGGCTCTACCCCTGGTGATATCATGAGGACGTTTTTTCAAGGACTATTCCTTTCACCTCAAATGGTTGGTGAG
GCAGTTAAGAGAAGCTCCCCTGTTGCTTCGTTTACTAAACCAGTTCCAGGAATAACTCCTGGCTATGCATCAGAGGTGATCTTTGCTGATGGAACATTCATTGATGGGAG
CACTAGTGAGCTTTCTTGCGACGGACCTCTGAGAGAGCCATTTGCCGTCTTTTGCCAGAGTGTACGAGGCTTAGTTGGCTTCGCTTTAATGTATGGTACCTCTAGTATGC
AAGGGTTGGAGCGGAAACCTTGCCTACAAAATTTACCGGGATCAGTATTCTCATTATGGCTCCTTACCCAACATCTGTCCAACTGGAAAAAACCAGCGGAACAAAAGGCC
ATTGTTATTATAATTCTTATGGCTCCTTTATATGCCGGTATCTCCTACATTGGTCTGTTGGAATTTATGGCAAGCAGTACTTTCTTTTTGTTTTTGGAATCAATTAAGGA
ATGTTATGAGGCTTTGGTGATATCTAAGTTCTTGAGTTTACTCTACAGCTACTTGAATATATCCATAAGCAAAAACATTGTGCCAGATGAGATCAAAGGTAGAGAAATCC
ACCATACTTTTCCAATGACCCTCTTTCAGCCTCACAGTGCTCGATTGAATCATCACACGTTGAAGCTTCTCAAGAACTGGACCTATCAATTTGTGGTGATTCGTCCTGTA
TGTTCCATCTTGATGATTGCTCTTCAACTAATTGATGTATATCCAAGCTGGCTCAGTTGGACATTTACTATCATATTAAATGTCTCAGTGTCACTTGCACTGTATTCCCT
GGTGGTTTTCTATCATGTATTTGATAAGGAGTTGAAACCACATAGCCCTCTTGCGAAGTTCTTGTGCATCAAAGGGATTGTCTTTTTCTGCTTCTGGCAGGGAATCGTTC
TTGAGATGCTTGCTGCAGTGGGCATAATCAAAGCAGAACATGCCTGGTTTGATGTTGAACACATAAATGAAGCCTTACAAAATACTCTAGTTTGTGTGGAGATGGTTTTC
TTTGCGATGATTCAGATGTCTGCATACAGTGCTAGCCCCTACAAATCTAAATCTGCAGCAAAATCTAAAGTGGAGAAGAAGGATGAGTGA
mRNA sequenceShow/hide mRNA sequence
GACGAAGTTATAATTTTTCAGAAGAAGATGTGGAGCTTATCCTCCTCTTTCCTTTCTTATCCGTCGCCTAATTTCAGAGTTTCCGTTCCGGCGGCCACCCTTCGCTCCGG
CACTTCTTTACCGGTGCCCCTCGATCGGCAACATTGCACTTCCGACTCCCCATTTGCTCCGGAGGTTGTTAAGGCGGTAGACTCCTTGCAGTATGAATTCAGGGCAGTGG
ATAATTTGGTGGCTCGTAACTCTGCTAAAGTTCTCAAAGCCTTTCAGAATGCTCGGTTAGGATCTCATCATTTTGGAGGATCCACTGGTTATGGTCATGATGAAGCTGGA
GGTCGTGAGGCACTTGACAATGCTTTTGCTGAGATAGTTGGAGCAGAATCCGCAATAGTCCGATCACAGTTTTTCTCAGGTACTCATGCTATTACGTGTGCTTTATTTTC
ACTTTTGAGGCCGGGGGATGAGCTTTTGGCAGTAGCTGGTGCTCCATACGACACACTAGAAGAGGTCATTGGGAAAAGAGATTCTCAGGGGCTGGGTTCTTTGAAAGATT
TTGGAGTAGAGTATCGAGAAGTTCCACTTGCTGAGGACGGTGGACTTGACTGGGAAAAACTTGCATGTGCTTTGAAACCTCAAACAAAATGTGCGCTCATACAAAGGTCA
TGTGGTTATTCTTGGCGGCGAAGTTTAAGTGTTGACGAAATAGGAAAAGCAATAAGACTGATCAAGATGCAAAACCCTGATTGCTTGGTGATGGTGGATAACTGTTATGG
AGAATTTGTGGAAACCACTGAACCTCCAACAGTGGGCGCGGATTTAATTGCAGGAAGTTTGATAAAAAATCCTGGTGGAACGCTTGCACCTTGTGGCGGATATGTTGCAG
GTCGTGACAAATTGGTGAAGGCGGCTGCAGCTCGTTTATCTGCACCTGGCTTGGGGGTGGATTCAGGCTCTACCCCTGGTGATATCATGAGGACGTTTTTTCAAGGACTA
TTCCTTTCACCTCAAATGGTTGGTGAGGCAGTTAAGAGAAGCTCCCCTGTTGCTTCGTTTACTAAACCAGTTCCAGGAATAACTCCTGGCTATGCATCAGAGGTGATCTT
TGCTGATGGAACATTCATTGATGGGAGCACTAGTGAGCTTTCTTGCGACGGACCTCTGAGAGAGCCATTTGCCGTCTTTTGCCAGAGTGTACGAGGCTTAGTTGGCTTCG
CTTTAATGTATGGTACCTCTAGTATGCAAGGGTTGGAGCGGAAACCTTGCCTACAAAATTTACCGGGATCAGTATTCTCATTATGGCTCCTTACCCAACATCTGTCCAAC
TGGAAAAAACCAGCGGAACAAAAGGCCATTGTTATTATAATTCTTATGGCTCCTTTATATGCCGGTATCTCCTACATTGGTCTGTTGGAATTTATGGCAAGCAGTACTTT
CTTTTTGTTTTTGGAATCAATTAAGGAATGTTATGAGGCTTTGGTGATATCTAAGTTCTTGAGTTTACTCTACAGCTACTTGAATATATCCATAAGCAAAAACATTGTGC
CAGATGAGATCAAAGGTAGAGAAATCCACCATACTTTTCCAATGACCCTCTTTCAGCCTCACAGTGCTCGATTGAATCATCACACGTTGAAGCTTCTCAAGAACTGGACC
TATCAATTTGTGGTGATTCGTCCTGTATGTTCCATCTTGATGATTGCTCTTCAACTAATTGATGTATATCCAAGCTGGCTCAGTTGGACATTTACTATCATATTAAATGT
CTCAGTGTCACTTGCACTGTATTCCCTGGTGGTTTTCTATCATGTATTTGATAAGGAGTTGAAACCACATAGCCCTCTTGCGAAGTTCTTGTGCATCAAAGGGATTGTCT
TTTTCTGCTTCTGGCAGGGAATCGTTCTTGAGATGCTTGCTGCAGTGGGCATAATCAAAGCAGAACATGCCTGGTTTGATGTTGAACACATAAATGAAGCCTTACAAAAT
ACTCTAGTTTGTGTGGAGATGGTTTTCTTTGCGATGATTCAGATGTCTGCATACAGTGCTAGCCCCTACAAATCTAAATCTGCAGCAAAATCTAAAGTGGAGAAGAAGGA
TGAGTGACAATGTCACTTGGTCTGGTAACACGCATCACTTCAGATGAGTATTAAACAATAAATCAGCCATAATGGTAGGTAGGTTTCAAACTTTCATGGATACTTTCTGC
TCCTCAGCGTCTATTATGCGTGAAATATTGAGTTTTATATTTGGGAACTCTATTTTGGCTACTACCACATCGCAGGCGATATTGTTGTGCATATCCTTTGTCAAAGTCAA
GCTCTCTGGTATACAAACTTCCTTCACTTCAAGCCCCTATTCTCTTCTTTCTGTGTCAATTTGCATCACATCGTGTGGCTGATCTATGTTTGTTTGCCGCTTGCCTGTTT
TGTGTTTCCGTGTCTCAATATATTATAGCATAAATGGACCTGCCTGTCTTGTTCATGATGCTAATTAGGCGGAAAAGAAAAATAATAATAAAAGGTAACAATAGAATTGG
GGATATCTTTCAGTCTTCTATGAATATAAACCTTCTGTTCGGTATTACTTTGCTTCCCTTTTTAATCCAAAAAGAAGATTAGATAGAAATATACCTAGAATATCCCGAAG
TTGATCGTACAAACTCTATGCTAATTAACCCACGCTCATTTTGAAAGGATACATATATGTATTAATGCCACAGTTCATGATTGCTATCTCTACTTTATAAAGGAGGTTAA
CAATACCTTATCTTGTTCCGTTATGCTTAGATTGGTATTCTGATGTGGAATATTTTAGGATGAGTGCCAAAAAACTTTTACTAAAAATTAGTGTAGGGATATGAGTTTCC
TGAATACTTGTGCCAACTAAATGCAGAAACTAGCATTTTCGTATTAAAAAGAAATCCTAGTAACATATATTCTCCCATTGACCTATAATTCAAGAACCCTCCCATTTTAC
CAATAAACACTGATAAATGACCAATGCAGGCTTCAAACAATAGTATTCAAGTCAAACATGGAAAATTTTGGTAGTTAAAAATAGAACGCATGATGTTCAATGGTCGAGAC
CTCCTACTATTACACCAAACACATAATTAAAGTAATGTAAGTGAAACGATCATAGATAGGAGATGAGGTGAAAGGAACCAAGAATATAAGATAATCACACTGAAAGTTTG
TTTCGTAACTTAAACATTTAAGGATATTGATACAAAATCGTATATCAGACCGGGGCGACCAACAAAATCCTAGGAACCCTGCATTCTATAATTTTACTTGTATAGAGAAG
TTATACTGTACCAACCATTAACAGGTTACTACTGTATCACAGAGAAAGCCCAGCAAACAGCAACTCCAGTTCCTCTTCCTGAGATACAGTTCTCTGACAAAAGAAAATGA
GCGAACAAACACAAAATATATCACAGATATCAGATACCAATGTTCATCAAATTCCACAATACTTGTAACTTTGCAAGAAATTTAAGCAACCATGAGAAAGTTTATGACTC
GAATACAGAACAGAACACGTTGAAATTATGGTGTATATACTCTTTGGAACAAAACTAAGATCGTTTTGGTTATGATGTTAT
Protein sequenceShow/hide protein sequence
MWSLSSSFLSYPSPNFRVSVPAATLRSGTSLPVPLDRQHCTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNSAKVLKAFQNARLGSHHFGGSTGYGHDEAGGREALDNAF
AEIVGAESAIVRSQFFSGTHAITCALFSLLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGVEYREVPLAEDGGLDWEKLACALKPQTKCALIQRSCGYSWRRSL
SVDEIGKAIRLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKLVKAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVGE
AVKRSSPVASFTKPVPGITPGYASEVIFADGTFIDGSTSELSCDGPLREPFAVFCQSVRGLVGFALMYGTSSMQGLERKPCLQNLPGSVFSLWLLTQHLSNWKKPAEQKA
IVIIILMAPLYAGISYIGLLEFMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLLKNWTYQFVVIRPV
CSILMIALQLIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVF
FAMIQMSAYSASPYKSKSAAKSKVEKKDE