; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0037943 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0037943
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of unknown function (DUF300)
Genome locationchr2:10856868..10869003
RNA-Seq ExpressionLag0037943
SyntenyLag0037943
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005178 - Organic solute transporter subunit alpha/Transmembrane protein 184
IPR009651 - Putative methionine gamma-lyase
IPR015421 - Pyridoxal phosphate-dependent transferase, major domain
IPR015424 - Pyridoxal phosphate-dependent transferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4107049.1 unnamed protein product [Lactuca saligna]0.0e+0073.14Show/hide
Query:  PSPNF----GASPPAATLHSGASLPVPVDRKHYTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNTAKVLKAFQNARLGSHHFGGSTGYGHDEAGGREALD
        P+ NF     +S  AA L S + + +    +H  S   F PEV  AVD+L  EFRAVDNLVA+N+++VLKAFQNAR+GSHHF G TGYGH+EAGGREALD
Subjt:  PSPNF----GASPPAATLHSGASLPVPVDRKHYTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNTAKVLKAFQNARLGSHHFGGSTGYGHDEAGGREALD

Query:  NAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGIEYREVPLAEDGGLDWEKLASSLKPQTKCAL
         AFAEI GAESAIVRSQFFSGTHAITCALFA LRPGDELLAVAGAPYDTLEEVIG RD  GLGSLKDFGI YREV LA+DGGLDW+ L  +LKP+TKCAL
Subjt:  NAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGIEYREVPLAEDGGLDWEKLASSLKPQTKCAL

Query:  IQRSCGYSWRRSLSVDEIGRAIKLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVRAAAARLSAPGLGVDSG
        IQRSCGYSWR+SLSV+EI RAI +IK QNP+CLVMVDNCYGEF ET EPP VGADLIAGSLIKNPGGT+APCGGYVAG++KWV+AAAARLSAPGLGVD G
Subjt:  IQRSCGYSWRRSLSVDEIGRAIKLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVRAAAARLSAPGLGVDSG

Query:  STPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRAPRHDTVQAVQLESRELLLAFCEAVQRSSPVASFTKPVPGITPGYASEVIFAD
        STPGDIMR FFQGL+LSPQMVGE++KG +LIAEVM++KGYKVQPLPR PRHD VQAVQL SRE LLAFCEAVQRSSPV+S+TKP+ G+T GYASEVIFAD
Subjt:  STPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRAPRHDTVQAVQLESRELLLAFCEAVQRSSPVASFTKPVPGITPGYASEVIFAD

Query:  GTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWG---SHFWVASVVS---------ILAITMHFGQMIFLGVTSSVVLTAILSSWLLVQHLSNWKKPTE
        GTFIDGSTSELSCDGPLREPF VFCQGGTHWTQWG     F+V  +++         +  + M   Q   +G  + V +T +L+  L+  HLS+WKKP E
Subjt:  GTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWG---SHFWVASVVS---------ILAITMHFGQMIFLGVTSSVVLTAILSSWLLVQHLSNWKKPTE

Query:  QKAIVIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLL
        QKAI++IILMAP+YA  SY+GLL+   S TFF+ L+SIKECYEALV++KFL+LLY+YLNISISKNIVPDEIKGREIHH+FPMTLFQPHS RLNH  LKLL
Subjt:  QKAIVIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLL

Query:  KYWTYQFVVIRPVCSILMITFELIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEH
        KYWT+QFVVIRPVCS+LMI  +L+++YP WLSWTFT+ILNVSVSLALY+LV+FYHVF KEL PH PLAKFLC+KGIVFFCFWQGIVL  L A+GIIK+ H
Subjt:  KYWTYQFVVIRPVCSILMITFELIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEH

Query:  AWFDVEHINEALQNTLVCVEMVFFALVQMSAYSASPYKPQSAAKSKLEKKKD
         W DV HI +ALQN LV VEMVFFA+ QM AY+A+PYK   AA  K EKKK+
Subjt:  AWFDVEHINEALQNTLVCVEMVFFALVQMSAYSASPYKPQSAAKSKLEKKKD

KAA8523841.1 hypothetical protein F0562_010264 [Nyssa sinensis]0.0e+0071.09Show/hide
Query:  LSCSFLPYPSPNFGASPPAATLHSGASLPVPVDRKHYTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNTAKVLKAFQNARLGSHHFGGSTGYGHDEAGGR
        LSC+   YP+    AS     + S A + VP  R H+  D+PFAPEV KAVDSL  EFRAVDNLVARNT++VL+A+QNAR+G HHFGG TGYGH+EAGGR
Subjt:  LSCSFLPYPSPNFGASPPAATLHSGASLPVPVDRKHYTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNTAKVLKAFQNARLGSHHFGGSTGYGHDEAGGR

Query:  EALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGIEYREVPLAEDGGLDWEKLASSLKPQT
        EALD  FAEI GAESAIVRSQFFSGTHAITCALFA LRPGDELLAVAGAPYDTLEEVIG RDS GLGSLKDFG++YREVPLAEDGGLDW+ L  +LKPQT
Subjt:  EALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGIEYREVPLAEDGGLDWEKLASSLKPQT

Query:  KCALIQRSCGYSWRRSLSVDEIGRAIKLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVRAAAARLSAPGLG
        KCALIQRSCGYSWRRSLSV EIGRAIK++K QNPDCLVMVDNCYGEFVE  EPP VGADLIAGSLIKNPGGT+APCGGYVAG++KWV+AAAARLSAPGLG
Subjt:  KCALIQRSCGYSWRRSLSVDEIGRAIKLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVRAAAARLSAPGLG

Query:  VDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRAPRHDTVQAVQLESRELLLAFCEAVQRSSPVASFTKPVPGITPGYASEV
        +D GSTPGDIMRTFFQGLFLSPQMVGEA+KG  LIAEVMA+KGYKVQPLPR PRHDTVQAVQL +RE LLAFCEAVQRSSPV SFTKPV G TPGYASEV
Subjt:  VDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRAPRHDTVQAVQLESRELLLAFCEAVQRSSPVASFTKPVPGITPGYASEV

Query:  IFADGTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWG-----------------------------SHFWVASVVSILAI------------------
        IFADGTFIDGSTSELSCDGPLREPF+VFCQGGTHWTQWG                             +H +    + I                     
Subjt:  IFADGTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWG-----------------------------SHFWVASVVSILAI------------------

Query:  ------TMHFGQMIFLGVTSSVVLTAILSSWLLVQHLSNWKKPTEQKAIVIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLY
               M+ GQ+  +G T  V+LT   +  LL QH   WKKP EQKAI+IIILMAP+YA  S++GLL+   S  FF FL+S+KECYEALV++KFL+L+Y
Subjt:  ------TMHFGQMIFLGVTSSVVLTAILSSWLLVQHLSNWKKPTEQKAIVIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLY

Query:  SYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLLKYWTYQFVVIRPVCSILMITFELIDVYPSWLSWTFTIILNVSVSLALYSLVVFYH
        +YLNISISKNIVPDEIKGREIHH+FPMTLFQP + RLNHHTLKLLK WT+QFVVIRPVCSILMI  +L+ +YPSW+SWTFTIILN+SVSLALYSLVVFYH
Subjt:  SYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLLKYWTYQFVVIRPVCSILMITFELIDVYPSWLSWTFTIILNVSVSLALYSLVVFYH

Query:  VFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFALVQMSAYSASPYK--PQSAAKSKLEKKKD
        VF KEL+PH PLAKFLCIKGIVFFCFWQG+VLE+LAA+G+I++ H W DVE I EALQN LVCVEMVFF+  Q  AYSA+PY     ++ K K  + KD
Subjt:  VFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFALVQMSAYSASPYK--PQSAAKSKLEKKKD

KAG6591988.1 hypothetical protein SDJN03_14334, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0088.95Show/hide
Query:  MWGLSCSFLPYPSPNFGASPPAATLHSGASLPVPVDRKHYTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNTAKVLKAFQNARLGSHHFGGSTGYGHDEA
        MWGLSCS   YPSPNF  S PAATL S  SLPV +DRKHYTSD+PFAPEVVKAVDSLQYEFRAVDNLVARN+AKVLKAFQNARLGSHHFGGSTGYGHDEA
Subjt:  MWGLSCSFLPYPSPNFGASPPAATLHSGASLPVPVDRKHYTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNTAKVLKAFQNARLGSHHFGGSTGYGHDEA

Query:  GGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGIEYREVPLAEDGGLDWEKLASSLK
        GGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFG+EYREVPLAEDGGLDWEKLASSLK
Subjt:  GGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGIEYREVPLAEDGGLDWEKLASSLK

Query:  PQTKCALIQRSCGYSWRRSLSVDEIGRAIKLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVRAAAARLSAP
        PQTKCALIQRSCGYSWRRSLSVDEIG+AI+LIKMQNPDCLVMVDNCYGEFVET EPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWV+AAAARLSAP
Subjt:  PQTKCALIQRSCGYSWRRSLSVDEIGRAIKLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVRAAAARLSAP

Query:  GLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRAPRHDTVQAVQLESRELLLAFCEAVQRSSPVASFTKPVPGITPGYA
        GLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPR PRHDTVQAVQL SRELLLAFCEAVQRSSPVAS+TKPVPGITPGYA
Subjt:  GLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRAPRHDTVQAVQLESRELLLAFCEAVQRSSPVASFTKPVPGITPGYA

Query:  SEVIFADGTFIDGSTSELSCDGPLREPFAVFCQGGTH-----WTQWGSHFWVASVVSILAITMHFGQMIFLGVTSSVVLTAILSSWLLVQHLSNWKKPTE
        SEVIFADGTFIDGSTSELSCDGPLREPFAVFCQ         +    S    +S+VSI                S ++   + S WLL QHLSNW+KP E
Subjt:  SEVIFADGTFIDGSTSELSCDGPLREPFAVFCQGGTH-----WTQWGSHFWVASVVSILAITMHFGQMIFLGVTSSVVLTAILSSWLLVQHLSNWKKPTE

Query:  QKAIVIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLL
        QKAIV+IILMAPLYAGISYIGLLE MASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHH+FPMTLFQPHSARLNHHTLKLL
Subjt:  QKAIVIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLL

Query:  KYWTYQFVVIRPVCSILMITFELIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEH
        K WTYQFVVIRPVCSILMI+ +LIDVYP WLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEH
Subjt:  KYWTYQFVVIRPVCSILMITFELIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEH

Query:  AWFDVEHINEALQNTLVCVEMVFFALVQMSAYSASPYKPQSAAKSKLEKKK
        AWFDVEHINEA+QNTLVCVEMVFFA+VQMSAYSASPY+ QSAAKSK EKK+
Subjt:  AWFDVEHINEALQNTLVCVEMVFFALVQMSAYSASPYKPQSAAKSKLEKKK

KAG7024863.1 ynbB [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0086.05Show/hide
Query:  MWGLSCSFLPYPSPNFGASPPAATLHSGASLPVPVDRKHYTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNTAKVLKAFQNARLGSHHFGGSTGYGHDEA
        MWGLSCS   YPSPNF  S PAATL S  SLPV +DRKHYTSD+PFAPEVVKAVDSLQYEFRAVDNLVARN+AKVLKAFQNARLGSHHFGGSTGYGHDEA
Subjt:  MWGLSCSFLPYPSPNFGASPPAATLHSGASLPVPVDRKHYTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNTAKVLKAFQNARLGSHHFGGSTGYGHDEA

Query:  GGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGIEYREVPLAEDGGLDWEKLASSLK
        GGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFG+EYREVPLAEDGGLDWEKLASSLK
Subjt:  GGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGIEYREVPLAEDGGLDWEKLASSLK

Query:  PQTKCALIQRSCGYSWRRSLSVDEIGRAIKLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVRAAAARLSAP
        PQTKCALIQRSCGYSWRRSLSVDEIG+AI+LIKMQNPDCLVMVDNCYGEFVET EPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWV+AAAARLSAP
Subjt:  PQTKCALIQRSCGYSWRRSLSVDEIGRAIKLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVRAAAARLSAP

Query:  GLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRAPRHDTVQAVQLESRELLLAFCEAVQRSSPVASFTKPVPGITPGYA
        GLGVDSGSTPGDIMRTFFQGLFLSPQM                                   AVQL SRELLLAFCEAVQRSSPVAS+TKPVPGITPGYA
Subjt:  GLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRAPRHDTVQAVQLESRELLLAFCEAVQRSSPVASFTKPVPGITPGYA

Query:  SEVIFADGTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWG-------------------SHFWVASVVSILAITMHFGQMIFLGVTSSVVLTAILSSW
        SEVIFADGTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWG                   S    +S+  I AITM +G MIFL VTSSVVLT++ S W
Subjt:  SEVIFADGTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWG-------------------SHFWVASVVSILAITMHFGQMIFLGVTSSVVLTAILSSW

Query:  LLVQHLSNWKKPTEQKAIVIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQ
        LL QHLSNW+KP EQKAIV+IILMAPLYAGISYIGLLE MASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHH+FPMTLFQ
Subjt:  LLVQHLSNWKKPTEQKAIVIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQ

Query:  PHSARLNHHTLKLLKYWTYQFVVIRPVCSILMITFELIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIV
        PHSARLNHHTLKLLK WTYQFVVIRPVCSILMI+ +LIDVYP WLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIV
Subjt:  PHSARLNHHTLKLLKYWTYQFVVIRPVCSILMITFELIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIV

Query:  LEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFALVQMSAYSASPYKPQSAAKSK
        LEMLAAVGIIKAEHAWFDVEHINEA+QNTLVCVEMVFFA+VQMSAYSASPY+ QSAAKSK
Subjt:  LEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFALVQMSAYSASPYKPQSAAKSK

RXI03688.1 hypothetical protein DVH24_004340 [Malus domestica]0.0e+0073.04Show/hide
Query:  MWGLSCSFLPYPSPNFGASPPAATLHSGASLPVPVDRKHYTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNTAKVLKAFQNARLGSHHFGGSTGYGHDEA
        MW LSC    YP+    AS P AT  S + L VP +   +  DSPF PEV  AVDSL  EFRAVDNLVARNT +VLKAFQNAR+GSHHF G TGYGHDEA
Subjt:  MWGLSCSFLPYPSPNFGASPPAATLHSGASLPVPVDRKHYTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNTAKVLKAFQNARLGSHHFGGSTGYGHDEA

Query:  GGREALDNAFAEIVGAESAIVRS----------------------QFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGIE
        GGREALD AFAEIVGAESAIVRS                      QFFSGTHAITCALFA LRPGDELLAVAG PYDTLEEVIGKRDS G+GSL DFG++
Subjt:  GGREALDNAFAEIVGAESAIVRS----------------------QFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGIE

Query:  YREVPLAEDGGLDWEKLASSLKPQTKCALIQRSCGYSWRRSLSVDEIGRAIKLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAP
        YREVPLAEDGGL+W+ L  +L+P+TKCALIQRSCGYSWRRSLSVDEIGRAIK+IK QNP+CLVMVDNCYGEFVE+ EPP VGADLIAGSLIKNPGGT+AP
Subjt:  YREVPLAEDGGLDWEKLASSLKPQTKCALIQRSCGYSWRRSLSVDEIGRAIKLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAP

Query:  CGGYVAGRDKWVRAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRAPRHDTVQAVQLESRELLLAFCEA
        CGGYVAGR+KWV+AA+ARLSAPGLGVD G+TPGDIMR+FFQGLFLSPQMVGEA+KG +++AEVMA++GYKVQPLPR PRHDTVQAVQL SRE LLAFCEA
Subjt:  CGGYVAGRDKWVRAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRAPRHDTVQAVQLESRELLLAFCEA

Query:  VQRSSPVASFTKPVPGITPGYASE--------------VIFADGTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWGSHFWVASVVSILAITMHFGQMI
        VQR+SPV SFTKPV G TPGYASE              VIFADGTFIDGSTSELSCDGPLREPFAVFCQGG+HWTQWG       + +         Q++
Subjt:  VQRSSPVASFTKPVPGITPGYASE--------------VIFADGTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWGSHFWVASVVSILAITMHFGQMI

Query:  FLGVTSSVVLTAILSSWLLVQHLSNWKKPTEQKAIVIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPD
         LG T  +++T   S  LL +H   W KP EQKAIVIIILMAPLYA  S++GLL+   S   F  L+SIKECYEALVI+KFL+LLYSYLNISISKNIVPD
Subjt:  FLGVTSSVVLTAILSSWLLVQHLSNWKKPTEQKAIVIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPD

Query:  EIKGREIHHTFPMTLFQPHSARLNHHTLKLLKYWTYQFVVIRPVCSILMITFELIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAK
        EIKGREIHH+FPMTLF P + RLNHHTLKLLKYWT+QFVVIRPVCSILMIT +L+ VYPSW+SWTFTIILN+SVSLALYSL+ FYHVF KEL PH PL K
Subjt:  EIKGREIHHTFPMTLFQPHSARLNHHTLKLLKYWTYQFVVIRPVCSILMITFELIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAK

Query:  FLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFALVQMSAYS
        FLCIKGIVFFCFWQGIVL++LAA+ II++ H W DVEHI EALQN LVCVEMVFF++VQ  AY+
Subjt:  FLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFALVQMSAYS

TrEMBL top hitse value%identityAlignment
A0A3Q7HH85 Uncharacterized protein1.1e-30973Show/hide
Query:  LSCSFLPYPSPNFGASPPAATLHSGASLPVPVDRKHYTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNTAKVLKAFQNARLGSHHFGGSTGYGHDEAGGR
        L C+   YP+     +   A + S + + VP   + + SDSPF PEV KAVDSL  EFR VDNLVARNTA+VL+AFQ  ++GSHHFGGSTGYGH+EAGGR
Subjt:  LSCSFLPYPSPNFGASPPAATLHSGASLPVPVDRKHYTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNTAKVLKAFQNARLGSHHFGGSTGYGHDEAGGR

Query:  EALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGIEYREVPLAEDGGLDWEKLASSLKPQT
        EALD AFAEIVGAESAIVRSQFFSGTHAITCALFA LRPGDELLA+AGAPYDTLEEVIGKRDS G GSLKDFG+EYREVPLAEDGGLDW+ L +S++P T
Subjt:  EALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGIEYREVPLAEDGGLDWEKLASSLKPQT

Query:  KCALIQRSCGYSWRRSLSVDEIGRAIKLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVRAAAARLSAPGLG
        KCALIQRSCGYSWRRSLSV EIGRAI +IKMQNP C+VMVDNCYGEFV+  EPP VGADLIAGSLIKNPGGT+APCGGYVAGR KWV AAAARLSAPGLG
Subjt:  KCALIQRSCGYSWRRSLSVDEIGRAIKLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVRAAAARLSAPGLG

Query:  VDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRAPRHDTVQAVQLESRELLLAFCEAVQRSSPVASFTKPVPGITPGYASEV
        VD GSTPGDIMRT FQGLFLSPQMVGEA+KG  LIAEVMA+KGYKVQPL R  RHDTVQAVQL +RE LL+FCEAVQRSSPV+SF +PV G T GYASEV
Subjt:  VDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRAPRHDTVQAVQLESRELLLAFCEAVQRSSPVASFTKPVPGITPGYASEV

Query:  IFADGTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWGSHF----WVASVVSILAITMHFGQMIFLGVTSSVVLTAILSSWLLVQHLSNWKKPTEQKAI
        IFADGTFIDGSTSELSCDGPLREPF+VFCQGGTHWTQWG       W+  +  I+       +   LG      L+A     L+ +H ++WKKP EQKAI
Subjt:  IFADGTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWGSHF----WVASVVSILAITMHFGQMIFLGVTSSVVLTAILSSWLLVQHLSNWKKPTEQKAI

Query:  VIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLLKYWT
        +II+LMAPLYA +S+IGL++ M S  FF FLES+KECYEA+V++KFL L+Y+YLNISISKNIVPDEIKGR+IHH+FPMTLFQPH+A LNHHTLKLLK WT
Subjt:  VIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLLKYWT

Query:  YQFVVIRPVCSILMITFELIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFD
        +QFVVIRPVCSILMI  ++  VYPSW+SWTFTIILN+SVSLALYSLVVFYHVF KEL PH PLAKFLC+KGIVFF FWQGI+L +L ++GIIK+ + W +
Subjt:  YQFVVIRPVCSILMITFELIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFD

Query:  VEHINEALQNTLVCVEMVFFALVQMSAYSASPYKPQS
        VE + E +QN LV +EMVFFA++   AYSA+PY+ ++
Subjt:  VEHINEALQNTLVCVEMVFFALVQMSAYSASPYKPQS

A0A498K8E2 Uncharacterized protein0.0e+0073.04Show/hide
Query:  MWGLSCSFLPYPSPNFGASPPAATLHSGASLPVPVDRKHYTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNTAKVLKAFQNARLGSHHFGGSTGYGHDEA
        MW LSC    YP+    AS P AT  S + L VP +   +  DSPF PEV  AVDSL  EFRAVDNLVARNT +VLKAFQNAR+GSHHF G TGYGHDEA
Subjt:  MWGLSCSFLPYPSPNFGASPPAATLHSGASLPVPVDRKHYTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNTAKVLKAFQNARLGSHHFGGSTGYGHDEA

Query:  GGREALDNAFAEIVGAESAIVRS----------------------QFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGIE
        GGREALD AFAEIVGAESAIVRS                      QFFSGTHAITCALFA LRPGDELLAVAG PYDTLEEVIGKRDS G+GSL DFG++
Subjt:  GGREALDNAFAEIVGAESAIVRS----------------------QFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGIE

Query:  YREVPLAEDGGLDWEKLASSLKPQTKCALIQRSCGYSWRRSLSVDEIGRAIKLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAP
        YREVPLAEDGGL+W+ L  +L+P+TKCALIQRSCGYSWRRSLSVDEIGRAIK+IK QNP+CLVMVDNCYGEFVE+ EPP VGADLIAGSLIKNPGGT+AP
Subjt:  YREVPLAEDGGLDWEKLASSLKPQTKCALIQRSCGYSWRRSLSVDEIGRAIKLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAP

Query:  CGGYVAGRDKWVRAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRAPRHDTVQAVQLESRELLLAFCEA
        CGGYVAGR+KWV+AA+ARLSAPGLGVD G+TPGDIMR+FFQGLFLSPQMVGEA+KG +++AEVMA++GYKVQPLPR PRHDTVQAVQL SRE LLAFCEA
Subjt:  CGGYVAGRDKWVRAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRAPRHDTVQAVQLESRELLLAFCEA

Query:  VQRSSPVASFTKPVPGITPGYASE--------------VIFADGTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWGSHFWVASVVSILAITMHFGQMI
        VQR+SPV SFTKPV G TPGYASE              VIFADGTFIDGSTSELSCDGPLREPFAVFCQGG+HWTQWG       + +         Q++
Subjt:  VQRSSPVASFTKPVPGITPGYASE--------------VIFADGTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWGSHFWVASVVSILAITMHFGQMI

Query:  FLGVTSSVVLTAILSSWLLVQHLSNWKKPTEQKAIVIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPD
         LG T  +++T   S  LL +H   W KP EQKAIVIIILMAPLYA  S++GLL+   S   F  L+SIKECYEALVI+KFL+LLYSYLNISISKNIVPD
Subjt:  FLGVTSSVVLTAILSSWLLVQHLSNWKKPTEQKAIVIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPD

Query:  EIKGREIHHTFPMTLFQPHSARLNHHTLKLLKYWTYQFVVIRPVCSILMITFELIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAK
        EIKGREIHH+FPMTLF P + RLNHHTLKLLKYWT+QFVVIRPVCSILMIT +L+ VYPSW+SWTFTIILN+SVSLALYSL+ FYHVF KEL PH PL K
Subjt:  EIKGREIHHTFPMTLFQPHSARLNHHTLKLLKYWTYQFVVIRPVCSILMITFELIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAK

Query:  FLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFALVQMSAYS
        FLCIKGIVFFCFWQGIVL++LAA+ II++ H W DVEHI EALQN LVCVEMVFF++VQ  AY+
Subjt:  FLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFALVQMSAYS

A0A5J5A352 Uncharacterized protein0.0e+0071.09Show/hide
Query:  LSCSFLPYPSPNFGASPPAATLHSGASLPVPVDRKHYTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNTAKVLKAFQNARLGSHHFGGSTGYGHDEAGGR
        LSC+   YP+    AS     + S A + VP  R H+  D+PFAPEV KAVDSL  EFRAVDNLVARNT++VL+A+QNAR+G HHFGG TGYGH+EAGGR
Subjt:  LSCSFLPYPSPNFGASPPAATLHSGASLPVPVDRKHYTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNTAKVLKAFQNARLGSHHFGGSTGYGHDEAGGR

Query:  EALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGIEYREVPLAEDGGLDWEKLASSLKPQT
        EALD  FAEI GAESAIVRSQFFSGTHAITCALFA LRPGDELLAVAGAPYDTLEEVIG RDS GLGSLKDFG++YREVPLAEDGGLDW+ L  +LKPQT
Subjt:  EALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGIEYREVPLAEDGGLDWEKLASSLKPQT

Query:  KCALIQRSCGYSWRRSLSVDEIGRAIKLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVRAAAARLSAPGLG
        KCALIQRSCGYSWRRSLSV EIGRAIK++K QNPDCLVMVDNCYGEFVE  EPP VGADLIAGSLIKNPGGT+APCGGYVAG++KWV+AAAARLSAPGLG
Subjt:  KCALIQRSCGYSWRRSLSVDEIGRAIKLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVRAAAARLSAPGLG

Query:  VDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRAPRHDTVQAVQLESRELLLAFCEAVQRSSPVASFTKPVPGITPGYASEV
        +D GSTPGDIMRTFFQGLFLSPQMVGEA+KG  LIAEVMA+KGYKVQPLPR PRHDTVQAVQL +RE LLAFCEAVQRSSPV SFTKPV G TPGYASEV
Subjt:  VDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRAPRHDTVQAVQLESRELLLAFCEAVQRSSPVASFTKPVPGITPGYASEV

Query:  IFADGTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWG-----------------------------SHFWVASVVSILAI------------------
        IFADGTFIDGSTSELSCDGPLREPF+VFCQGGTHWTQWG                             +H +    + I                     
Subjt:  IFADGTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWG-----------------------------SHFWVASVVSILAI------------------

Query:  ------TMHFGQMIFLGVTSSVVLTAILSSWLLVQHLSNWKKPTEQKAIVIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLY
               M+ GQ+  +G T  V+LT   +  LL QH   WKKP EQKAI+IIILMAP+YA  S++GLL+   S  FF FL+S+KECYEALV++KFL+L+Y
Subjt:  ------TMHFGQMIFLGVTSSVVLTAILSSWLLVQHLSNWKKPTEQKAIVIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLY

Query:  SYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLLKYWTYQFVVIRPVCSILMITFELIDVYPSWLSWTFTIILNVSVSLALYSLVVFYH
        +YLNISISKNIVPDEIKGREIHH+FPMTLFQP + RLNHHTLKLLK WT+QFVVIRPVCSILMI  +L+ +YPSW+SWTFTIILN+SVSLALYSLVVFYH
Subjt:  SYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLLKYWTYQFVVIRPVCSILMITFELIDVYPSWLSWTFTIILNVSVSLALYSLVVFYH

Query:  VFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFALVQMSAYSASPYK--PQSAAKSKLEKKKD
        VF KEL+PH PLAKFLCIKGIVFFCFWQG+VLE+LAA+G+I++ H W DVE I EALQN LVCVEMVFF+  Q  AYSA+PY     ++ K K  + KD
Subjt:  VFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFALVQMSAYSASPYK--PQSAAKSKLEKKKD

A0A5N5GL16 Uncharacterized protein4.2e-30170.43Show/hide
Query:  MWGLSCSFLPYPSPNFGASPPAATLHSGASLPVPVDRKHYTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNTAKVLKAFQNARLGSHHFGGSTGYGHDEA
        MW LSC+   YP+ +  AS P AT  S + L VP +   +  DSPF PEV  AVDSL  EFRAVDNLVARNT +VLKAFQNAR+GSHHF G TGYGHDEA
Subjt:  MWGLSCSFLPYPSPNFGASPPAATLHSGASLPVPVDRKHYTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNTAKVLKAFQNARLGSHHFGGSTGYGHDEA

Query:  GGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGIEYREVPLAEDGGLDWEKLASSLK
        GGREALD AFAEIVGAESAIVRSQFFSGTHAITCALFA LRPGDELLAVAG PYDTLEEVIGKRDS G+GSL DFG++YREVPLAEDGGL+W+ L  +L+
Subjt:  GGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGIEYREVPLAEDGGLDWEKLASSLK

Query:  PQTKCALIQRSCGYSWRRSLSVDEIGRAIKLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVRAAAARLSAP
        P+TKCALIQRSCGYSWRRSLSVDEIG+AIK+IK QN +CLVMVDNCYGEFVE+ EPP VGADLIAGSLIKNPGGT+APCGGYVAGR+KWV+AA+ARLSAP
Subjt:  PQTKCALIQRSCGYSWRRSLSVDEIGRAIKLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVRAAAARLSAP

Query:  GLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRAPRHDTVQAVQLESRELLLAFCEAVQRSSPVASFTKPVPGITPGYA
        GLGVD G+TPGDIMR FFQGLFLSPQMVGEA+KG +L+AEVMA++GYKVQPLPR PRHDTVQAVQL SRE LLAFCEAVQR+SPV SFTKPV G TPGYA
Subjt:  GLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRAPRHDTVQAVQLESRELLLAFCEAVQRSSPVASFTKPVPGITPGYA

Query:  SE--------------VIFADGTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWGSHFWVASVVSILAITMHFGQMIFLGVTSSVVLTAILSSWLLVQH
        SE              VIFADGTFIDGSTSELSCDGPLREPFAVFCQGG+HWTQWG                     + LG  S V  T           
Subjt:  SE--------------VIFADGTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWGSHFWVASVVSILAITMHFGQMIFLGVTSSVVLTAILSSWLLVQH

Query:  LSNWKKPTEQKAIVIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSAR
                                                   L+SIKECYEALVI+KFL+LLYSYLNISISKNIVPDEIKGREIHH+FPMTLF P + R
Subjt:  LSNWKKPTEQKAIVIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSAR

Query:  LNHHTLKLLKYWTYQFVVIRPVCSILMITFELIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLA
        LNHHTLKLLKYWT+QFVVIRPVCSILMIT +L+ VYPSW+SWTFTIILN+SVSLALYSLV FYHVF KEL PH PL KFLCIKGIVFFCFWQGIVL++LA
Subjt:  LNHHTLKLLKYWTYQFVVIRPVCSILMITFELIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLA

Query:  AVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFALVQMSAYSASPYKPQSAAKSKLEKKKD
        A+ II++ H W DVEHI EALQN LVCVEMVFF++VQ  AYSA PY+    +    ++KKD
Subjt:  AVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFALVQMSAYSASPYKPQSAAKSKLEKKKD

A0A6S7PJH8 Uncharacterized protein0.0e+0073.14Show/hide
Query:  PSPNF----GASPPAATLHSGASLPVPVDRKHYTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNTAKVLKAFQNARLGSHHFGGSTGYGHDEAGGREALD
        P+ NF     +S  AA L S + + +    +H  S   F PEV  AVD+L  EFRAVDNLVA+N+++VLKAFQNAR+GSHHF G TGYGH+EAGGREALD
Subjt:  PSPNF----GASPPAATLHSGASLPVPVDRKHYTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNTAKVLKAFQNARLGSHHFGGSTGYGHDEAGGREALD

Query:  NAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGIEYREVPLAEDGGLDWEKLASSLKPQTKCAL
         AFAEI GAESAIVRSQFFSGTHAITCALFA LRPGDELLAVAGAPYDTLEEVIG RD  GLGSLKDFGI YREV LA+DGGLDW+ L  +LKP+TKCAL
Subjt:  NAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGIEYREVPLAEDGGLDWEKLASSLKPQTKCAL

Query:  IQRSCGYSWRRSLSVDEIGRAIKLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVRAAAARLSAPGLGVDSG
        IQRSCGYSWR+SLSV+EI RAI +IK QNP+CLVMVDNCYGEF ET EPP VGADLIAGSLIKNPGGT+APCGGYVAG++KWV+AAAARLSAPGLGVD G
Subjt:  IQRSCGYSWRRSLSVDEIGRAIKLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVRAAAARLSAPGLGVDSG

Query:  STPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRAPRHDTVQAVQLESRELLLAFCEAVQRSSPVASFTKPVPGITPGYASEVIFAD
        STPGDIMR FFQGL+LSPQMVGE++KG +LIAEVM++KGYKVQPLPR PRHD VQAVQL SRE LLAFCEAVQRSSPV+S+TKP+ G+T GYASEVIFAD
Subjt:  STPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRAPRHDTVQAVQLESRELLLAFCEAVQRSSPVASFTKPVPGITPGYASEVIFAD

Query:  GTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWG---SHFWVASVVS---------ILAITMHFGQMIFLGVTSSVVLTAILSSWLLVQHLSNWKKPTE
        GTFIDGSTSELSCDGPLREPF VFCQGGTHWTQWG     F+V  +++         +  + M   Q   +G  + V +T +L+  L+  HLS+WKKP E
Subjt:  GTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQWG---SHFWVASVVS---------ILAITMHFGQMIFLGVTSSVVLTAILSSWLLVQHLSNWKKPTE

Query:  QKAIVIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLL
        QKAI++IILMAP+YA  SY+GLL+   S TFF+ L+SIKECYEALV++KFL+LLY+YLNISISKNIVPDEIKGREIHH+FPMTLFQPHS RLNH  LKLL
Subjt:  QKAIVIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLL

Query:  KYWTYQFVVIRPVCSILMITFELIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEH
        KYWT+QFVVIRPVCS+LMI  +L+++YP WLSWTFT+ILNVSVSLALY+LV+FYHVF KEL PH PLAKFLC+KGIVFFCFWQGIVL  L A+GIIK+ H
Subjt:  KYWTYQFVVIRPVCSILMITFELIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEH

Query:  AWFDVEHINEALQNTLVCVEMVFFALVQMSAYSASPYKPQSAAKSKLEKKKD
         W DV HI +ALQN LV VEMVFFA+ QM AY+A+PYK   AA  K EKKK+
Subjt:  AWFDVEHINEALQNTLVCVEMVFFALVQMSAYSASPYKPQSAAKSKLEKKKD

SwissProt top hitse value%identityAlignment
P45624 Uncharacterized 33.9 kDa protein in glnA 5'region1.1e-6444.72Show/hide
Query:  PYDTLEEVIGKRDSQGLGSLKDFGIEYREVPLAEDGGLDWEKLASSLK-PQTKCALIQRSCGYSWRRSLSVDEIGRAIKLIKMQNPDCLVMVDNCYGEFV
        PYDT+++VIG    +  G+L   GI +  VPL E+GG+D+E+    LK  Q    +IQRS GY  R+S +VD+I +    +K  +P+ LV VDNCYGEF 
Subjt:  PYDTLEEVIGKRDSQGLGSLKDFGIEYREVPLAEDGGLDWEKLASSLK-PQTKCALIQRSCGYSWRRSLSVDEIGRAIKLIKMQNPDCLVMVDNCYGEFV

Query:  ETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVRAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQP
        E  EP   G D  AGSLIKN GG +A  GGY+ G+++ V  AA RL+APG+G + G+T  + M  F++G FL+P   GEA+KGMI  A ++   G +V P
Subjt:  ETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVRAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQP

Query:  LPRAPRHDTVQAVQLESRELLLAFCEAVQRSSPVASFTKPVPGITPGYASEVIFADGTFIDGSTSELSCDGPLREPFAVFCQGG
            PR D +Q +     E ++ F + VQ++SP+ SF +P+P   PGY  +VI A G F+ GST E S DGP+R P+A++ Q G
Subjt:  LPRAPRHDTVQAVQLESRELLLAFCEAVQRSSPVASFTKPVPGITPGYASEVIFADGTFIDGSTSELSCDGPLREPFAVFCQGG

P94479 Uncharacterized protein YnbB4.2e-9645.19Show/hide
Query:  RAVDNLVARNTAKVLKAFQNARLGSHHFGGSTGYGHDEAGGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVI
        + ++ +  RN  +VL++++  ++   HF  STGYG+D+  GR+ L++ +A++ G E+ +VR Q  SGTHAI+ ALF +LRPGDELL + G PYDTLEE++
Subjt:  RAVDNLVARNTAKVLKAFQNARLGSHHFGGSTGYGHDEAGGREALDNAFAEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVI

Query:  GKRDSQGLGSLKDFGIEYREVPLAEDGGLDWEKLASSLKPQTKCALIQRSCGYSWRRSLSVDEIGRAIKLIKMQNPDCLVMVDNCYGEFVETTEPPTVGA
        G R  +  GSLKDF I Y  V L +DG +D++ +A+++ P+TK   IQRS GY+ R S  + EI   I+ +K  N + +V VDNCYGEFVE  EP  VGA
Subjt:  GKRDSQGLGSLKDFGIEYREVPLAEDGGLDWEKLASSLKPQTKCALIQRSCGYSWRRSLSVDEIGRAIKLIKMQNPDCLVMVDNCYGEFVETTEPPTVGA

Query:  DLIAGSLIKNPGGTLAPCGGYVAGRDKWVRAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRAPRHDTV
        DL+AGSLIKNPGG LA  GGY+ G+ KW+ A + R+++PG+G ++G++    ++  +QG FL+P +V +++KG +  A  +   G+   P   A R D +
Subjt:  DLIAGSLIKNPGGTLAPCGGYVAGRDKWVRAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVGEAVKGMILIAEVMASKGYKVQPLPRAPRHDTV

Query:  QAVQLESRELLLAFCEAVQRSSPVASFTKPVPGITPGYASEVIFADGTFIDGSTSELSCDGPLREPFAVFCQGG
        Q+V+   RE ++AFC+A+Q +SP+ +   P P   PGY  +VI A GTFI G++ ELS DGP+R P+  + QGG
Subjt:  QAVQLESRELLLAFCEAVQRSSPVASFTKPVPGITPGYASEVIFADGTFIDGSTSELSCDGPLREPFAVFCQGG

Q17QL9 Transmembrane protein 184C3.3e-3230.06Show/hide
Query:  HWTQW----GSHFWVASVVSILAITMHFGQMIFLGVTSS--------VVLTAILSSWLLVQHLSNWKKPTEQKAIVIIILMAPLYAGISYIGLLEAMASS
        +W QW        ++ S+V  + + +   Q + +G+ +         ++LT  +S W+++QHL ++ +P  QK I+ I+ M P+Y+  S+I    A+   
Subjt:  HWTQW----GSHFWVASVVSILAITMHFGQMIFLGVTSS--------VVLTAILSSWLLVQHLSNWKKPTEQKAIVIIILMAPLYAGISYIGLLEAMASS

Query:  TFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLLKYWTYQFVVIRPVCSILMITFELIDVYPS
        +  +++++ +ECYEA VI  F+  L +YL       ++  E K ++ H   P+    P +  +    L   K    Q+ V+RP  +I+ +  EL+D+Y  
Subjt:  TFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLLKYWTYQFVVIRPVCSILMITFELIDVYPS

Query:  W-----LSWTFTIIL-NVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHA--WFDVEHINEALQNTLVCVEM
               +WT+ +I+ N+S   A+Y L++FY V  +EL P  P+ KFLC+K +VF  FWQ +V+ +L  VG+I  +H   W  VE +   LQ+ ++C+EM
Subjt:  W-----LSWTFTIIL-NVSVSLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHA--WFDVEHINEALQNTLVCVEM

Query:  VFFALVQMSAYSASPY
           A+     +S  PY
Subjt:  VFFALVQMSAYSASPY

Q3TPR7 Transmembrane protein 184C5.6e-3231.36Show/hide
Query:  LAITMHFGQMIFLGVTSSVVLTAILSSWLLVQHLSNWKKPTEQKAIVIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLYSYL
        + + MH       G+   ++LT  +S W ++QHL ++ +P  QK I+ I+ M P+Y+  S++ L+    +    +++++ +ECYEA VI  F+  L +YL
Subjt:  LAITMHFGQMIFLGVTSSVVLTAILSSWLLVQHLSNWKKPTEQKAIVIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLYSYL

Query:  NISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLLKYWTYQFVVIRPVCSILMITFELIDVYPSW-----LSWTFTIIL-NVSVSLALYSLVV
         I     I+  E K ++ +H  P+    P +  +    L   K    Q+ V+RP+ ++  +  E++DVY         +WT+ +IL N+S   A+Y L++
Subjt:  NISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLLKYWTYQFVVIRPVCSILMITFELIDVYPSW-----LSWTFTIIL-NVSVSLALYSLVV

Query:  FYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGII--KAEHAWFDVEHINEALQNTLVCVEMVFFALVQMSAYSASPY
        FY V  +EL P  P+ KFLC+K +VF  FWQ +++ +L  +G+I  K    W   E +   LQ+ ++C+EM F A+     +S  PY
Subjt:  FYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGII--KAEHAWFDVEHINEALQNTLVCVEMVFFALVQMSAYSASPY

Q5ZMP3 Transmembrane protein 184C3.6e-3133.46Show/hide
Query:  VVLTAILSSWLLVQHLSNWKKPTEQKAIVIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREI
        +++T  +S W ++QHL ++ +P  QK I+ I+ M P+Y+  S+I    A+      +++++ +ECYEA VI  F+  L +YL       ++  E K ++ 
Subjt:  VVLTAILSSWLLVQHLSNWKKPTEQKAIVIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGREI

Query:  HHTFPMTLFQPHSARLNHHTLKLLKYWTYQFVVIRPVCSILMITFELIDVYPSW-----LSWTFTIIL-NVSVSLALYSLVVFYHVFDKELKPHSPLAKF
        H   P     P S  +    L   K    Q+ V+RP  +I+ +  EL+ VY         +WT+ +IL N+S   A+Y LV+FY V  +EL P  P+ KF
Subjt:  HHTFPMTLFQPHSARLNHHTLKLLKYWTYQFVVIRPVCSILMITFELIDVYPSW-----LSWTFTIIL-NVSVSLALYSLVVFYHVFDKELKPHSPLAKF

Query:  LCIKGIVFFCFWQGIVLEMLAAVGIIKAEHA--WFDVEHINEALQNTLVCVEMVFFALVQMSAYSASPY
        LC+K +VF  FWQ +++ +L  VG+I  +H   W  VE +   LQ+ ++CVEM   A+    ++S  PY
Subjt:  LCIKGIVFFCFWQGIVLEMLAAVGIIKAEHA--WFDVEHINEALQNTLVCVEMVFFALVQMSAYSASPY

Arabidopsis top hitse value%identityAlignment
AT1G11200.1 Protein of unknown function (DUF300)8.7e-9758.48Show/hide
Query:  TMHFGQMIFLGVTSSVVLTAILSSWLLVQHLSNWKKPTEQKAIVIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLYSYLNIS
        T+   ++  +G    V+L+   +  L+ QHL  WKKP EQ+AI+II+LMAP+YA  S++GLL+A  S  FF+FL+++KECYEALVI+KFL+L+YSY+NIS
Subjt:  TMHFGQMIFLGVTSSVVLTAILSSWLLVQHLSNWKKPTEQKAIVIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLYSYLNIS

Query:  ISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLLKYWTYQFVVIRPVCSILMITFELIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKEL
        +S  I+PDE KGREIHH+FPMTLF P +  L++ TLK LK WT+QF +IRPVCSILMIT +++ +YP WLSW FT ILNVSVSLALYSLV FYHVF KEL
Subjt:  ISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLLKYWTYQFVVIRPVCSILMITFELIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKEL

Query:  KPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFALVQMSAYSASPYKPQSAAKSKLEKK
        +PH PL KF+C+KGIVFFCFWQGIVL++L  +G+IK+ H W +V+ + EALQN LVC+EM+ F+++Q  A+  +PY  ++ AK +  K+
Subjt:  KPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFALVQMSAYSASPYKPQSAAKSKLEKK

AT1G23070.1 Protein of unknown function (DUF300)4.9e-2329.93Show/hide
Query:  MHFGQMIFLGVTSSVVLTAILSSWLLVQHLSNWKKPTEQKAIVIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLYS------
        +H   +I  G  ++V +   LS + ++QHL  +  P EQK IV ++ M P+YA  S I L    ++S F L  + ++ CYEA  +  F S L +      
Subjt:  MHFGQMIFLGVTSSVVLTAILSSWLLVQHLSNWKKPTEQKAIVIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLYS------

Query:  ----YLNISISKNIVPD---EIKGREIHHTFPMTLFQPHSARLNHHTLKLLKYWTYQFVVIRPVCSILMITFELIDVY-PSWLSWTF-----TIILNVSV
            YL     K ++ +   E K ++  ++F   L  P+   L      + K+   Q+++++  C+ L    EL+ VY      W +      ++LN S 
Subjt:  ----YLNISISKNIVPD---EIKGREIHHTFPMTLFQPHSARLNHHTLKLLKYWTYQFVVIRPVCSILMITFELIDVY-PSWLSWTF-----TIILNVSV

Query:  SLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFALVQMSAYSASPY
          AL+ LV FY+V  + LK   PLAKF+  K IVF  +WQG  + +L   GI+  E            LQ+ L+C+EM   A+  +  + A PY
Subjt:  SLALYSLVVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFALVQMSAYSASPY

AT1G77220.1 Protein of unknown function (DUF300)1.0e-2829.86Show/hide
Query:  LGVTSSVVLTAILSSWLLVQHLSNWKKPTEQKAIVIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLYSYLN-----------
        L  +  VV+  +L  +L+ +HL+++ +P EQK ++ +ILM P+YA  S++ L+ + A+       E I++CYEA  +  F   L + L+           
Subjt:  LGVTSSVVLTAILSSWLLVQHLSNWKKPTEQKAIVIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLYSYLN-----------

Query:  ---ISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLLKYWTYQFVVIRPVCSILMITFELIDVY-PSWLSWTF-----TIILNVSVSLALYSL
           I+ S  ++        + H FPM  F    + L       +K    Q+++++ +C++L +  E   VY     +W +      ++LN S + ALY L
Subjt:  ---ISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLLKYWTYQFVVIRPVCSILMITFELIDVY-PSWLSWTF-----TIILNVSVSLALYSL

Query:  VVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFALVQMSAYSASPYK
        V FY+V   +L P  PLAKFL  K IVF  +WQGI++  L ++G++K   A    + +   +Q+ ++C+EM   A+V +  + A+PYK
Subjt:  VVFYHVFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFALVQMSAYSASPYK

AT4G21570.1 Protein of unknown function (DUF300)1.4e-10769.58Show/hide
Query:  QMIFLGVTSSVVLTAILSSWLLVQHLSNWKKPTEQKAIVIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNI
        Q+ F     SV+LT   +  L+ QHL +WK P EQKAI+II+LMAP+YA +S+IGLLE   S TFFLFLESIKECYEALVI+KFL+L+YSYLNIS+SKNI
Subjt:  QMIFLGVTSSVVLTAILSSWLLVQHLSNWKKPTEQKAIVIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNI

Query:  VPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLLKYWTYQFVVIRPVCSILMITFELIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSP
        +PD IKGREIHH+FPMTLFQPH  RL+ HTLKLLKYWT+QFVVIRPVCS LMI  +LI  YPSWLSWTFTII+N SVSLALYSLV+FYHVF KEL PH+P
Subjt:  VPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLLKYWTYQFVVIRPVCSILMITFELIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHSP

Query:  LAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFALVQMSAYSASPYKPQSAAKSKLEKKKD
        LAKFLCIKGIVFF FWQGI L++L A+G IK+ H W +VE I EA+QN LVC+EMV FA VQ  AY A PY  ++  K KL+KK +
Subjt:  LAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFALVQMSAYSASPYKPQSAAKSKLEKKKD

AT4G38360.2 Protein of unknown function (DUF300)2.8e-2330.14Show/hide
Query:  VVLTAILSSWLLVQHLSNWKKPTEQKAIVIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGRE-
        +VLT  LS +L+  HLS +K P EQK ++ +ILM P Y+  S+  L++   S    +    +++CYE+  +  F   L + +     + I   E +GR+ 
Subjt:  VVLTAILSSWLLVQHLSNWKKPTEQKAIVIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLYSYLNISISKNIVPDEIKGRE-

Query:  --------------IHHTFPMTLFQPHSARLNHHTLKLLKYWTYQFVVIRPVCSILMITFELIDVY-PSWLSW-----TFTIILNVSVSLALYSLVVFYH
                      I H FPM LF     RL+    +++K+   Q+++I+ + ++  +  E   VY      W        ++LN S S ALY LV FY 
Subjt:  --------------IHHTFPMTLFQPHSARLNHHTLKLLKYWTYQFVVIRPVCSILMITFELIDVY-PSWLSW-----TFTIILNVSVSLALYSLVVFYH

Query:  VFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFALVQMSAYSASPY
            EL    PLAKFL  K IVF  +WQG+ + +L+++G+ K+  A      +  ++Q+ ++C+EM   ++V +  + A PY
Subjt:  VFDKELKPHSPLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFALVQMSAYSASPY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGGGCTTATCCTGCAGCTTCCTGCCTTATCCTTCGCCTAATTTCGGAGCTTCACCACCGGCGGCAACCCTTCACTCCGGCGCTTCTCTACCGGTGCCCGTCGATCG
GAAACATTACACTTCCGACAGCCCATTTGCTCCAGAGGTTGTTAAGGCGGTAGACTCCTTGCAGTATGAATTCAGGGCAGTGGATAATTTGGTGGCACGTAATACTGCTA
AAGTTCTCAAAGCCTTTCAGAATGCTCGGTTAGGATCTCATCATTTTGGAGGATCCACTGGTTATGGTCATGATGAAGCTGGAGGACGTGAGGCACTTGACAATGCTTTT
GCCGAGATAGTTGGAGCTGAATCTGCAATAGTCCGATCACAGTTTTTCTCAGGTACTCATGCTATTACGTGTGCCTTATTTGCGCTTTTGAGGCCAGGAGATGAGCTTTT
GGCAGTAGCTGGTGCTCCATATGACACATTGGAGGAGGTCATTGGAAAAAGAGATTCTCAGGGGCTGGGTTCCTTGAAAGATTTTGGAATTGAGTATCGAGAAGTTCCAC
TTGCTGAGGACGGTGGACTGGACTGGGAAAAACTTGCAAGTTCTTTAAAACCTCAAACAAAATGTGCACTAATACAAAGATCATGTGGTTATTCTTGGCGTCGAAGTTTA
AGTGTGGACGAAATAGGAAGAGCAATAAAACTGATCAAGATGCAAAACCCTGATTGCTTGGTGATGGTGGATAACTGTTATGGAGAATTTGTGGAAACCACTGAACCTCC
AACTGTGGGCGCGGACTTAATTGCAGGAAGTTTGATAAAAAATCCTGGTGGAACACTCGCACCTTGTGGGGGATATGTCGCAGGTCGAGACAAATGGGTGAGAGCAGCAG
CAGCTCGTTTATCTGCACCCGGCTTGGGCGTGGATTCGGGCTCTACCCCTGGCGATATCATGAGGACATTCTTTCAAGGGTTATTCCTTTCACCTCAAATGGTTGGTGAG
GCAGTTAAGGGGATGATCCTAATAGCCGAAGTCATGGCATCAAAAGGCTACAAAGTGCAACCACTTCCACGCGCTCCCCGCCACGACACTGTACAGGCAGTACAACTTGA
AAGTCGCGAACTTTTGCTTGCATTTTGTGAGGCCGTGCAGAGAAGCTCTCCTGTCGCTTCGTTTACTAAACCAGTTCCGGGAATAACTCCTGGATATGCATCAGAGGTGA
TCTTTGCTGATGGAACTTTCATTGATGGGAGCACAAGTGAACTTTCTTGTGATGGACCTTTGAGGGAGCCATTTGCAGTCTTTTGCCAGGGTGGAACACATTGGACCCAG
TGGGGATCTCATTTCTGGGTAGCCTCAGTGGTTAGCATATTAGCAATCACAATGCATTTTGGACAGATGATTTTTCTGGGAGTTACTTCCTCTGTTGTTCTTACAGCAAT
ACTTTCATCATGGCTCCTTGTCCAACATCTGTCCAACTGGAAAAAACCAACGGAGCAAAAGGCCATTGTTATTATAATTCTTATGGCTCCTTTATATGCTGGTATCTCCT
ACATTGGTCTGTTGGAAGCTATGGCTAGCAGTACTTTCTTTTTGTTCTTGGAATCAATTAAGGAATGTTATGAGGCTTTGGTGATATCTAAGTTCTTGAGTTTACTCTAC
AGCTACTTGAATATATCCATAAGCAAAAACATTGTGCCAGATGAGATCAAAGGCAGAGAAATCCACCATACTTTTCCAATGACCCTTTTTCAGCCTCACAGTGCTCGATT
GAATCATCACACGTTGAAGCTTCTCAAGTACTGGACCTATCAATTTGTGGTGATTCGTCCTGTATGTTCCATCTTGATGATCACTTTTGAACTAATTGATGTATATCCAA
GCTGGCTCAGTTGGACATTTACTATCATATTAAATGTCTCAGTGTCGCTTGCTTTGTATTCCTTGGTGGTTTTCTATCATGTATTTGATAAGGAGTTGAAACCACATAGC
CCCCTTGCGAAGTTCTTGTGCATCAAAGGGATTGTCTTTTTCTGCTTCTGGCAGGGAATCGTTCTTGAGATGCTTGCTGCAGTGGGCATAATCAAAGCGGAACATGCCTG
GTTTGATGTGGAGCACATAAATGAAGCCTTACAAAATACTCTGGTTTGTGTGGAGATGGTTTTCTTTGCGCTCGTCCAGATGTCTGCATACAGTGCTAGCCCGTATAAAC
CTCAATCTGCAGCAAAATCTAAATTGGAGAAGAAGAAAGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTGGGGCTTATCCTGCAGCTTCCTGCCTTATCCTTCGCCTAATTTCGGAGCTTCACCACCGGCGGCAACCCTTCACTCCGGCGCTTCTCTACCGGTGCCCGTCGATCG
GAAACATTACACTTCCGACAGCCCATTTGCTCCAGAGGTTGTTAAGGCGGTAGACTCCTTGCAGTATGAATTCAGGGCAGTGGATAATTTGGTGGCACGTAATACTGCTA
AAGTTCTCAAAGCCTTTCAGAATGCTCGGTTAGGATCTCATCATTTTGGAGGATCCACTGGTTATGGTCATGATGAAGCTGGAGGACGTGAGGCACTTGACAATGCTTTT
GCCGAGATAGTTGGAGCTGAATCTGCAATAGTCCGATCACAGTTTTTCTCAGGTACTCATGCTATTACGTGTGCCTTATTTGCGCTTTTGAGGCCAGGAGATGAGCTTTT
GGCAGTAGCTGGTGCTCCATATGACACATTGGAGGAGGTCATTGGAAAAAGAGATTCTCAGGGGCTGGGTTCCTTGAAAGATTTTGGAATTGAGTATCGAGAAGTTCCAC
TTGCTGAGGACGGTGGACTGGACTGGGAAAAACTTGCAAGTTCTTTAAAACCTCAAACAAAATGTGCACTAATACAAAGATCATGTGGTTATTCTTGGCGTCGAAGTTTA
AGTGTGGACGAAATAGGAAGAGCAATAAAACTGATCAAGATGCAAAACCCTGATTGCTTGGTGATGGTGGATAACTGTTATGGAGAATTTGTGGAAACCACTGAACCTCC
AACTGTGGGCGCGGACTTAATTGCAGGAAGTTTGATAAAAAATCCTGGTGGAACACTCGCACCTTGTGGGGGATATGTCGCAGGTCGAGACAAATGGGTGAGAGCAGCAG
CAGCTCGTTTATCTGCACCCGGCTTGGGCGTGGATTCGGGCTCTACCCCTGGCGATATCATGAGGACATTCTTTCAAGGGTTATTCCTTTCACCTCAAATGGTTGGTGAG
GCAGTTAAGGGGATGATCCTAATAGCCGAAGTCATGGCATCAAAAGGCTACAAAGTGCAACCACTTCCACGCGCTCCCCGCCACGACACTGTACAGGCAGTACAACTTGA
AAGTCGCGAACTTTTGCTTGCATTTTGTGAGGCCGTGCAGAGAAGCTCTCCTGTCGCTTCGTTTACTAAACCAGTTCCGGGAATAACTCCTGGATATGCATCAGAGGTGA
TCTTTGCTGATGGAACTTTCATTGATGGGAGCACAAGTGAACTTTCTTGTGATGGACCTTTGAGGGAGCCATTTGCAGTCTTTTGCCAGGGTGGAACACATTGGACCCAG
TGGGGATCTCATTTCTGGGTAGCCTCAGTGGTTAGCATATTAGCAATCACAATGCATTTTGGACAGATGATTTTTCTGGGAGTTACTTCCTCTGTTGTTCTTACAGCAAT
ACTTTCATCATGGCTCCTTGTCCAACATCTGTCCAACTGGAAAAAACCAACGGAGCAAAAGGCCATTGTTATTATAATTCTTATGGCTCCTTTATATGCTGGTATCTCCT
ACATTGGTCTGTTGGAAGCTATGGCTAGCAGTACTTTCTTTTTGTTCTTGGAATCAATTAAGGAATGTTATGAGGCTTTGGTGATATCTAAGTTCTTGAGTTTACTCTAC
AGCTACTTGAATATATCCATAAGCAAAAACATTGTGCCAGATGAGATCAAAGGCAGAGAAATCCACCATACTTTTCCAATGACCCTTTTTCAGCCTCACAGTGCTCGATT
GAATCATCACACGTTGAAGCTTCTCAAGTACTGGACCTATCAATTTGTGGTGATTCGTCCTGTATGTTCCATCTTGATGATCACTTTTGAACTAATTGATGTATATCCAA
GCTGGCTCAGTTGGACATTTACTATCATATTAAATGTCTCAGTGTCGCTTGCTTTGTATTCCTTGGTGGTTTTCTATCATGTATTTGATAAGGAGTTGAAACCACATAGC
CCCCTTGCGAAGTTCTTGTGCATCAAAGGGATTGTCTTTTTCTGCTTCTGGCAGGGAATCGTTCTTGAGATGCTTGCTGCAGTGGGCATAATCAAAGCGGAACATGCCTG
GTTTGATGTGGAGCACATAAATGAAGCCTTACAAAATACTCTGGTTTGTGTGGAGATGGTTTTCTTTGCGCTCGTCCAGATGTCTGCATACAGTGCTAGCCCGTATAAAC
CTCAATCTGCAGCAAAATCTAAATTGGAGAAGAAGAAAGATTGA
Protein sequenceShow/hide protein sequence
MWGLSCSFLPYPSPNFGASPPAATLHSGASLPVPVDRKHYTSDSPFAPEVVKAVDSLQYEFRAVDNLVARNTAKVLKAFQNARLGSHHFGGSTGYGHDEAGGREALDNAF
AEIVGAESAIVRSQFFSGTHAITCALFALLRPGDELLAVAGAPYDTLEEVIGKRDSQGLGSLKDFGIEYREVPLAEDGGLDWEKLASSLKPQTKCALIQRSCGYSWRRSL
SVDEIGRAIKLIKMQNPDCLVMVDNCYGEFVETTEPPTVGADLIAGSLIKNPGGTLAPCGGYVAGRDKWVRAAAARLSAPGLGVDSGSTPGDIMRTFFQGLFLSPQMVGE
AVKGMILIAEVMASKGYKVQPLPRAPRHDTVQAVQLESRELLLAFCEAVQRSSPVASFTKPVPGITPGYASEVIFADGTFIDGSTSELSCDGPLREPFAVFCQGGTHWTQ
WGSHFWVASVVSILAITMHFGQMIFLGVTSSVVLTAILSSWLLVQHLSNWKKPTEQKAIVIIILMAPLYAGISYIGLLEAMASSTFFLFLESIKECYEALVISKFLSLLY
SYLNISISKNIVPDEIKGREIHHTFPMTLFQPHSARLNHHTLKLLKYWTYQFVVIRPVCSILMITFELIDVYPSWLSWTFTIILNVSVSLALYSLVVFYHVFDKELKPHS
PLAKFLCIKGIVFFCFWQGIVLEMLAAVGIIKAEHAWFDVEHINEALQNTLVCVEMVFFALVQMSAYSASPYKPQSAAKSKLEKKKD