; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr024208 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr024208
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionhydrolase family protein / HAD-superfamily protein
Genome locationtig00001047:4142262..4157766
RNA-Seq ExpressionSgr024208
SyntenySgr024208
Gene Ontology termsGO:0046474 - glycerophospholipid biosynthetic process (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR006353 - HAD-superfamily hydrolase, subfamily IIA, CECR5
IPR006357 - HAD-superfamily hydrolase, subfamily IIA
IPR006594 - LIS1 homology motif
IPR023214 - HAD superfamily
IPR036412 - HAD-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578738.1 hypothetical protein SDJN03_23186, partial [Cucurbita argyrosperma subsp. sororia]2.5e-16683.1Show/hide
Query:  MKILTVFRVSQGKNREPFLFQSQASAS-RSFSRIQSRPQRSSFGIAFDIDGVVLRGNLPIGGSPRALRRLYADSTSSGTLKVPFLFLTNGGGTPESRRAI
        MKILTVFRVSQG  R    FQSQASA   SFS IQSR +RSSFGIAFDIDGVVLRG+ PIGGS +AL+RLYADSTSSGTLKVPFLFLTNGGGTPES+RAI
Subjt:  MKILTVFRVSQGKNREPFLFQSQASAS-RSFSRIQSRPQRSSFGIAFDIDGVVLRGNLPIGGSPRALRRLYADSTSSGTLKVPFLFLTNGGGTPESRRAI

Query:  ELSDLLGVNILPSQVVQGHSTFKSLLNSFENEFIIATGKGQPALVMSEYGFKKVFSIDEYASFFENIDPVSQYKNWATKQGFVLSCNSEKLMRRQSVLSE
        ELS+LLGVN+LPSQVVQGHS+F+SLLNSFENE IIATGKGQPALVMSEYGFKKVFS+DEYASFFENIDPVSQYKNW ++Q F  +CN EKLMR+QSVLSE
Subjt:  ELSDLLGVNILPSQVVQGHSTFKSLLNSFENEFIIATGKGQPALVMSEYGFKKVFSIDEYASFFENIDPVSQYKNWATKQGFVLSCNSEKLMRRQSVLSE

Query:  RVKAAFVVSDPVDWGRDIQVLCDILRSGGLPGRENGIQVPLYFAADDFEYQAAFPFKRLGIGAFRIALESVFNRIHHRPLEYVSYGKPNPLVFKIVETVL
        RVKAAFVVSDPVDWGRDIQVLCD+LRSGGLPG+ENG QVPLYF+ADDFEYQAAFP KRLGIGAF+IALES+FNRIHH+PLEYVSYGKPNPLVFK VETVL
Subjt:  RVKAAFVVSDPVDWGRDIQVLCDILRSGGLPGRENGIQVPLYFAADDFEYQAAFPFKRLGIGAFRIALESVFNRIHHRPLEYVSYGKPNPLVFKIVETVL

Query:  RQILSSHCDDHFVNNGDTEFNHFKTLYMIGDNPVVDIKGAREDTLVFYS--NKDGVFRAKK
        RQILSSHCDDHFVN  D EFN F+TLYMIGDNPVVDIKGARE    ++S   + GVFR ++
Subjt:  RQILSSHCDDHFVNNGDTEFNHFKTLYMIGDNPVVDIKGAREDTLVFYS--NKDGVFRAKK

KAG7016268.1 hypothetical protein SDJN02_21374 [Cucurbita argyrosperma subsp. argyrosperma]3.3e-16683.1Show/hide
Query:  MKILTVFRVSQGKNREPFLFQSQASAS-RSFSRIQSRPQRSSFGIAFDIDGVVLRGNLPIGGSPRALRRLYADSTSSGTLKVPFLFLTNGGGTPESRRAI
        MKILTVFRVSQG  R    FQSQASA   SFS IQSR +RSSFGIAFDIDGVVLRG+ PIGGS +AL+RLYADSTSSGTLKVPFLFLTNGGGTPES+RAI
Subjt:  MKILTVFRVSQGKNREPFLFQSQASAS-RSFSRIQSRPQRSSFGIAFDIDGVVLRGNLPIGGSPRALRRLYADSTSSGTLKVPFLFLTNGGGTPESRRAI

Query:  ELSDLLGVNILPSQVVQGHSTFKSLLNSFENEFIIATGKGQPALVMSEYGFKKVFSIDEYASFFENIDPVSQYKNWATKQGFVLSCNSEKLMRRQSVLSE
        ELS+LLGVN+LPSQVVQGHS+F+SLLNSFENE IIATGKGQPALVMSEYGFKKVFS+DEYASFFENIDPVSQYKNW ++Q F  +CN EKLMR+Q VLSE
Subjt:  ELSDLLGVNILPSQVVQGHSTFKSLLNSFENEFIIATGKGQPALVMSEYGFKKVFSIDEYASFFENIDPVSQYKNWATKQGFVLSCNSEKLMRRQSVLSE

Query:  RVKAAFVVSDPVDWGRDIQVLCDILRSGGLPGRENGIQVPLYFAADDFEYQAAFPFKRLGIGAFRIALESVFNRIHHRPLEYVSYGKPNPLVFKIVETVL
        RVKAAFVVSDPVDWGRDIQVLCD+LRSGGLPG+ENG QVPLYF+ADDFEYQAAFP KRLGIGAF+IALES+FNRIHHRPLEYVSYGKPNPLVFK VETVL
Subjt:  RVKAAFVVSDPVDWGRDIQVLCDILRSGGLPGRENGIQVPLYFAADDFEYQAAFPFKRLGIGAFRIALESVFNRIHHRPLEYVSYGKPNPLVFKIVETVL

Query:  RQILSSHCDDHFVNNGDTEFNHFKTLYMIGDNPVVDIKGAREDTLVFYS--NKDGVFRAKK
        RQILSSHCDDHFVN  D EFN F+TLYMIGDNPVVDIKGARE    ++S   + GVFR ++
Subjt:  RQILSSHCDDHFVNNGDTEFNHFKTLYMIGDNPVVDIKGAREDTLVFYS--NKDGVFRAKK

XP_022938837.1 uncharacterized protein YKR070W-like [Cucurbita moschata]1.5e-16683.38Show/hide
Query:  MKILTVFRVSQGKNREPFLFQSQASAS-RSFSRIQSRPQRSSFGIAFDIDGVVLRGNLPIGGSPRALRRLYADSTSSGTLKVPFLFLTNGGGTPESRRAI
        MKILTVFRVSQG  R    FQSQASA   SFS IQSR +RSSFGIAFDIDGVVLRG+ PIGGS +AL+RLYADSTSSGTLKVPFLFLTNGGGTPES+RAI
Subjt:  MKILTVFRVSQGKNREPFLFQSQASAS-RSFSRIQSRPQRSSFGIAFDIDGVVLRGNLPIGGSPRALRRLYADSTSSGTLKVPFLFLTNGGGTPESRRAI

Query:  ELSDLLGVNILPSQVVQGHSTFKSLLNSFENEFIIATGKGQPALVMSEYGFKKVFSIDEYASFFENIDPVSQYKNWATKQGFVLSCNSEKLMRRQSVLSE
        ELS+LLGVN+LPSQVVQGHS+F+SLLNSFENE IIATGKGQPALVMSEYGFKKVFS+DEYASFFENIDPVSQYKNW ++Q F  +CN EKLMR+QSVLSE
Subjt:  ELSDLLGVNILPSQVVQGHSTFKSLLNSFENEFIIATGKGQPALVMSEYGFKKVFSIDEYASFFENIDPVSQYKNWATKQGFVLSCNSEKLMRRQSVLSE

Query:  RVKAAFVVSDPVDWGRDIQVLCDILRSGGLPGRENGIQVPLYFAADDFEYQAAFPFKRLGIGAFRIALESVFNRIHHRPLEYVSYGKPNPLVFKIVETVL
        RVKAAFVVSDPVDWGRDIQVLCD+LRSGGLPG+ENG QVPLYFAADDFEYQAAFP KRLGIGAF+IALES+FNRIHHRPLEYVSYGKPNPLVFK VETVL
Subjt:  RVKAAFVVSDPVDWGRDIQVLCDILRSGGLPGRENGIQVPLYFAADDFEYQAAFPFKRLGIGAFRIALESVFNRIHHRPLEYVSYGKPNPLVFKIVETVL

Query:  RQILSSHCDDHFVNNGDTEFNHFKTLYMIGDNPVVDIKGAREDTLVFYS--NKDGVFRAKK
        RQILSSHCDDHFVN  D EFN F+T YMIGDNPVVDIKGARE    ++S   + GVFR ++
Subjt:  RQILSSHCDDHFVNNGDTEFNHFKTLYMIGDNPVVDIKGAREDTLVFYS--NKDGVFRAKK

XP_022993080.1 uncharacterized protein YKR070W-like [Cucurbita maxima]5.9e-16884.21Show/hide
Query:  MKILTVFRVSQGKNREPFLFQSQASAS-RSFSRIQSRPQRSSFGIAFDIDGVVLRGNLPIGGSPRALRRLYADSTSSGTLKVPFLFLTNGGGTPESRRAI
        MKILTVFRVSQG  R    FQSQASA   SFS IQSR +RSSFGIAFDIDGVVLRG+ PIGGS +AL+RLYADSTSSGTLKVPFLFLTNGGGTPESRRAI
Subjt:  MKILTVFRVSQGKNREPFLFQSQASAS-RSFSRIQSRPQRSSFGIAFDIDGVVLRGNLPIGGSPRALRRLYADSTSSGTLKVPFLFLTNGGGTPESRRAI

Query:  ELSDLLGVNILPSQVVQGHSTFKSLLNSFENEFIIATGKGQPALVMSEYGFKKVFSIDEYASFFENIDPVSQYKNWATKQGFVLSCNSEKLMRRQSVLSE
        ELS+LLGVN+LPSQVVQGHS+F+SLLNSFENE IIATGKGQPALVMSEYGFKKVFS+DEYASFFENIDPVSQYKNW ++Q F L+CN EKLMR+QSVLSE
Subjt:  ELSDLLGVNILPSQVVQGHSTFKSLLNSFENEFIIATGKGQPALVMSEYGFKKVFSIDEYASFFENIDPVSQYKNWATKQGFVLSCNSEKLMRRQSVLSE

Query:  RVKAAFVVSDPVDWGRDIQVLCDILRSGGLPGRENGIQVPLYFAADDFEYQAAFPFKRLGIGAFRIALESVFNRIHHRPLEYVSYGKPNPLVFKIVETVL
        RVKAAFVVSDPVDWGRDIQVLCD+LRSGGLPG+ENG QVPLYFAADDFEYQAAFPFKRLGIGAF+IALES+FNRIHHRPLEYVSYGKPNPLVFK VETVL
Subjt:  RVKAAFVVSDPVDWGRDIQVLCDILRSGGLPGRENGIQVPLYFAADDFEYQAAFPFKRLGIGAFRIALESVFNRIHHRPLEYVSYGKPNPLVFKIVETVL

Query:  RQILSSHCDDHFVNNGDTEFNHFKTLYMIGDNPVVDIKGAREDTLVFYS--NKDGVFRAKK
        RQILSSHCDDH VN  D EFN F+TLYMIGDNPVVDIKGARE    ++S   + GVFR ++
Subjt:  RQILSSHCDDHFVNNGDTEFNHFKTLYMIGDNPVVDIKGAREDTLVFYS--NKDGVFRAKK

XP_023549962.1 uncharacterized protein YKR070W-like isoform X1 [Cucurbita pepo subsp. pepo]2.0e-16881.87Show/hide
Query:  MKILTVFRVSQGKNREPFLFQSQASAS-RSFSRIQSRPQRSSFGIAFDIDGVVLRGNLPIGGSPRALRRLYADSTSSGTLKVPFLFLTNGGGTPESRRAI
        MKILTVFRVSQG  R    FQS ASA   SFS IQSR +RSSFGIAFDIDGVVLRG+ PIGGS +AL+RLYADSTSSGTLKVPFLFLTNGGGTPESRRAI
Subjt:  MKILTVFRVSQGKNREPFLFQSQASAS-RSFSRIQSRPQRSSFGIAFDIDGVVLRGNLPIGGSPRALRRLYADSTSSGTLKVPFLFLTNGGGTPESRRAI

Query:  ELSDLLGVNILPSQVVQGHSTFKSLLNSFENEFIIATGKGQPALVMSEYGFKKVFSIDEYASFFENIDPVSQYKNWATKQGFVLSCNSEKLMRRQSVLSE
        ELS+LLGVN+LPSQVVQGHS+F+SLLNSFENE IIATGKGQPALVMSEYGFKKVFS+DEYASFFENIDPVSQYKNW ++Q F L+CN EKLMR+QSVLSE
Subjt:  ELSDLLGVNILPSQVVQGHSTFKSLLNSFENEFIIATGKGQPALVMSEYGFKKVFSIDEYASFFENIDPVSQYKNWATKQGFVLSCNSEKLMRRQSVLSE

Query:  RVKAAFVVSDPVDWGRDIQVLCDILRSGGLPGRENGIQVPLYFAADDFEYQAAFPFKRLGIGAFRIALESVFNRIHHRPLEYVSYGKPNPLVFKIVETVL
        RVKAAFVVSDPVDWGRDIQVLCD+LRSGGLPG+ENG QVPLYFAADDFEYQAAFPFKRLGIGAF+IALES+FNRIHHRPLEYVSYGKPNPLVFK VETVL
Subjt:  RVKAAFVVSDPVDWGRDIQVLCDILRSGGLPGRENGIQVPLYFAADDFEYQAAFPFKRLGIGAFRIALESVFNRIHHRPLEYVSYGKPNPLVFKIVETVL

Query:  RQILSSHCDDHFVNNGDTEFNHFKTLYMIGDNPVVDIKGAREDTLVFYS--NKDGVFRAKKITQTFQLILYLHVL
        RQILSSHCDDHFVN  D EFN FKTLYMIGDNPVVDIKGARE    ++S   + GVFR ++    F   L ++ +
Subjt:  RQILSSHCDDHFVNNGDTEFNHFKTLYMIGDNPVVDIKGAREDTLVFYS--NKDGVFRAKKITQTFQLILYLHVL

TrEMBL top hitse value%identityAlignment
A0A0A0KU31 Uncharacterized protein1.8e-14678.55Show/hide
Query:  FQSQASASRSFSRIQSR--PQRSSFGIAFDIDGVVLRGNLPIGGSPRALRRLYADSTSSGTLKVPFLFLTNGGGTPESRRAIELSDLLGVNILPSQVVQG
        FQS    S + SRIQ R    R SFGIAFDIDGV+LRG  PIGGS +ALRRLY DST SGTLKVPFLFLTNGGGTPESRRAIELS+LLGVN+LPSQVVQG
Subjt:  FQSQASASRSFSRIQSR--PQRSSFGIAFDIDGVVLRGNLPIGGSPRALRRLYADSTSSGTLKVPFLFLTNGGGTPESRRAIELSDLLGVNILPSQVVQG

Query:  HSTFKSLLNSFENEFIIATGKGQPALVMSEYGFKKVFSIDEYASFFENIDPVSQYKNWATKQGFVLSCNSEKLMRRQSVLSERVKAAFVVSDPVDWGRDI
        HS+FKSLLNSFENE IIATGKGQP LVMSEYGFKKVFSI EYASFFENIDPVS YK+W +KQ F  +CN  +LMRRQSVLSERVKAAFVVSDPVDWGRDI
Subjt:  HSTFKSLLNSFENEFIIATGKGQPALVMSEYGFKKVFSIDEYASFFENIDPVSQYKNWATKQGFVLSCNSEKLMRRQSVLSERVKAAFVVSDPVDWGRDI

Query:  QVLCDILRSGGLPGRENG--IQVPLYFAADDFEYQAAFPFKRLGIGAFRIALESVFNRIHHRPLEYVSYGKPNPLVFKIVETVLRQILSSHCDDHFVNNG
        QVLCD+LRSGGLPG +NG   QVPLYFAADD EYQ AFP KRLGIGAF+IALES+FNRIHH PLEYV YGKPNPLVFK VE V +QIL SHCDDHFVN G
Subjt:  QVLCDILRSGGLPGRENG--IQVPLYFAADDFEYQAAFPFKRLGIGAFRIALESVFNRIHHRPLEYVSYGKPNPLVFKIVETVLRQILSSHCDDHFVNNG

Query:  DTEFNHFKTLYMIGDNPVVDIKGAREDTLVFYS--NKDGVFRAKK
        D E N FK LYMIGDNPVVDIKGARE    ++S   + GVF+ K+
Subjt:  DTEFNHFKTLYMIGDNPVVDIKGAREDTLVFYS--NKDGVFRAKK

A0A1S3CK22 uncharacterized protein YKR070W-like1.8e-14678.55Show/hide
Query:  FQSQASASRSFSRIQSR--PQRSSFGIAFDIDGVVLRGNLPIGGSPRALRRLYADSTSSGTLKVPFLFLTNGGGTPESRRAIELSDLLGVNILPSQVVQG
        FQS    S + SRIQ R    R SFGIAFDIDGV+LRG  PIGGS +ALRRLY DST SGTLKVPFLFLTNGGGTPESRRAIELS+LLGVN+LPSQVVQG
Subjt:  FQSQASASRSFSRIQSR--PQRSSFGIAFDIDGVVLRGNLPIGGSPRALRRLYADSTSSGTLKVPFLFLTNGGGTPESRRAIELSDLLGVNILPSQVVQG

Query:  HSTFKSLLNSFENEFIIATGKGQPALVMSEYGFKKVFSIDEYASFFENIDPVSQYKNWATKQGFVLSCNSEKLMRRQSVLSERVKAAFVVSDPVDWGRDI
        HS+FKSLLN FENE IIATGKGQP LVMSEYGFKKVFSI EYASFFENIDPVSQYK+W +KQ F  +C+    MRRQSVLSERVKAAFVVSDPVDWGRDI
Subjt:  HSTFKSLLNSFENEFIIATGKGQPALVMSEYGFKKVFSIDEYASFFENIDPVSQYKNWATKQGFVLSCNSEKLMRRQSVLSERVKAAFVVSDPVDWGRDI

Query:  QVLCDILRSGGLPGRENG--IQVPLYFAADDFEYQAAFPFKRLGIGAFRIALESVFNRIHHRPLEYVSYGKPNPLVFKIVETVLRQILSSHCDDHFVNNG
        QVLCD+LRSGGLPG +NG   QVPLYFAADD EYQ AFPFKRLGIGAF+IALES+F RIHH PLEYV YGKPNPLVFK VE V RQIL SHCDDHFVN G
Subjt:  QVLCDILRSGGLPGRENG--IQVPLYFAADDFEYQAAFPFKRLGIGAFRIALESVFNRIHHRPLEYVSYGKPNPLVFKIVETVLRQILSSHCDDHFVNNG

Query:  DTEFNHFKTLYMIGDNPVVDIKGAREDTLVFYS--NKDGVFRAKK
        D E N FKTLYMIGDNPVVDIKGARE    ++S   + GVF+ K+
Subjt:  DTEFNHFKTLYMIGDNPVVDIKGAREDTLVFYS--NKDGVFRAKK

A0A5D3C5R2 Hydrolase family protein / HAD-superfamily protein isoform 21.5e-14578.11Show/hide
Query:  SASRSFSRIQSRPQRSSFGIAFDIDGVVLRGNLPIGGSPRALRRLYADSTSSGTLKVPFLFLTNGGGTPESRRAIELSDLLGVNILPSQVVQGHSTFKSL
        S+ +  + I ++P   SFGIAFDIDGV+LRG  PIGGS +ALRRLY DST SGTLKVPFLFLTNGGGTPESRRAIELS+LLGVN+LPSQVVQGHS+FKSL
Subjt:  SASRSFSRIQSRPQRSSFGIAFDIDGVVLRGNLPIGGSPRALRRLYADSTSSGTLKVPFLFLTNGGGTPESRRAIELSDLLGVNILPSQVVQGHSTFKSL

Query:  LNSFENEFIIATGKGQPALVMSEYGFKKVFSIDEYASFFENIDPVSQYKNWATKQGFVLSCNSEKLMRRQSVLSERVKAAFVVSDPVDWGRDIQVLCDIL
        LN FENE IIATGKGQP LVMSEYGFKKVFSI EYASFFENIDPVSQYK+W +KQ F  +C+    MRRQSVLSERVKAAFVVSDPVDWGRDIQVLCD+L
Subjt:  LNSFENEFIIATGKGQPALVMSEYGFKKVFSIDEYASFFENIDPVSQYKNWATKQGFVLSCNSEKLMRRQSVLSERVKAAFVVSDPVDWGRDIQVLCDIL

Query:  RSGGLPGRENG--IQVPLYFAADDFEYQAAFPFKRLGIGAFRIALESVFNRIHHRPLEYVSYGKPNPLVFKIVETVLRQILSSHCDDHFVNNGDTEFNHF
        RSGGLPG +NG   QVPLYFAADD EYQ AFPFKRLGIGAF+IALES+F RIHH PLEYV YGKPNPLVFK VE V RQIL SHCDDHFVN GD E N F
Subjt:  RSGGLPGRENG--IQVPLYFAADDFEYQAAFPFKRLGIGAFRIALESVFNRIHHRPLEYVSYGKPNPLVFKIVETVLRQILSSHCDDHFVNNGDTEFNHF

Query:  KTLYMIGDNPVVDIKGAREDTLVFYS--NKDGVFRAKK
        KTLYMIGDNPVVDIKGARE    ++S   + GVF+ K+
Subjt:  KTLYMIGDNPVVDIKGAREDTLVFYS--NKDGVFRAKK

A0A6J1FF73 uncharacterized protein YKR070W-like7.1e-16783.38Show/hide
Query:  MKILTVFRVSQGKNREPFLFQSQASAS-RSFSRIQSRPQRSSFGIAFDIDGVVLRGNLPIGGSPRALRRLYADSTSSGTLKVPFLFLTNGGGTPESRRAI
        MKILTVFRVSQG  R    FQSQASA   SFS IQSR +RSSFGIAFDIDGVVLRG+ PIGGS +AL+RLYADSTSSGTLKVPFLFLTNGGGTPES+RAI
Subjt:  MKILTVFRVSQGKNREPFLFQSQASAS-RSFSRIQSRPQRSSFGIAFDIDGVVLRGNLPIGGSPRALRRLYADSTSSGTLKVPFLFLTNGGGTPESRRAI

Query:  ELSDLLGVNILPSQVVQGHSTFKSLLNSFENEFIIATGKGQPALVMSEYGFKKVFSIDEYASFFENIDPVSQYKNWATKQGFVLSCNSEKLMRRQSVLSE
        ELS+LLGVN+LPSQVVQGHS+F+SLLNSFENE IIATGKGQPALVMSEYGFKKVFS+DEYASFFENIDPVSQYKNW ++Q F  +CN EKLMR+QSVLSE
Subjt:  ELSDLLGVNILPSQVVQGHSTFKSLLNSFENEFIIATGKGQPALVMSEYGFKKVFSIDEYASFFENIDPVSQYKNWATKQGFVLSCNSEKLMRRQSVLSE

Query:  RVKAAFVVSDPVDWGRDIQVLCDILRSGGLPGRENGIQVPLYFAADDFEYQAAFPFKRLGIGAFRIALESVFNRIHHRPLEYVSYGKPNPLVFKIVETVL
        RVKAAFVVSDPVDWGRDIQVLCD+LRSGGLPG+ENG QVPLYFAADDFEYQAAFP KRLGIGAF+IALES+FNRIHHRPLEYVSYGKPNPLVFK VETVL
Subjt:  RVKAAFVVSDPVDWGRDIQVLCDILRSGGLPGRENGIQVPLYFAADDFEYQAAFPFKRLGIGAFRIALESVFNRIHHRPLEYVSYGKPNPLVFKIVETVL

Query:  RQILSSHCDDHFVNNGDTEFNHFKTLYMIGDNPVVDIKGAREDTLVFYS--NKDGVFRAKK
        RQILSSHCDDHFVN  D EFN F+T YMIGDNPVVDIKGARE    ++S   + GVFR ++
Subjt:  RQILSSHCDDHFVNNGDTEFNHFKTLYMIGDNPVVDIKGAREDTLVFYS--NKDGVFRAKK

A0A6J1JXI7 uncharacterized protein YKR070W-like2.9e-16884.21Show/hide
Query:  MKILTVFRVSQGKNREPFLFQSQASAS-RSFSRIQSRPQRSSFGIAFDIDGVVLRGNLPIGGSPRALRRLYADSTSSGTLKVPFLFLTNGGGTPESRRAI
        MKILTVFRVSQG  R    FQSQASA   SFS IQSR +RSSFGIAFDIDGVVLRG+ PIGGS +AL+RLYADSTSSGTLKVPFLFLTNGGGTPESRRAI
Subjt:  MKILTVFRVSQGKNREPFLFQSQASAS-RSFSRIQSRPQRSSFGIAFDIDGVVLRGNLPIGGSPRALRRLYADSTSSGTLKVPFLFLTNGGGTPESRRAI

Query:  ELSDLLGVNILPSQVVQGHSTFKSLLNSFENEFIIATGKGQPALVMSEYGFKKVFSIDEYASFFENIDPVSQYKNWATKQGFVLSCNSEKLMRRQSVLSE
        ELS+LLGVN+LPSQVVQGHS+F+SLLNSFENE IIATGKGQPALVMSEYGFKKVFS+DEYASFFENIDPVSQYKNW ++Q F L+CN EKLMR+QSVLSE
Subjt:  ELSDLLGVNILPSQVVQGHSTFKSLLNSFENEFIIATGKGQPALVMSEYGFKKVFSIDEYASFFENIDPVSQYKNWATKQGFVLSCNSEKLMRRQSVLSE

Query:  RVKAAFVVSDPVDWGRDIQVLCDILRSGGLPGRENGIQVPLYFAADDFEYQAAFPFKRLGIGAFRIALESVFNRIHHRPLEYVSYGKPNPLVFKIVETVL
        RVKAAFVVSDPVDWGRDIQVLCD+LRSGGLPG+ENG QVPLYFAADDFEYQAAFPFKRLGIGAF+IALES+FNRIHHRPLEYVSYGKPNPLVFK VETVL
Subjt:  RVKAAFVVSDPVDWGRDIQVLCDILRSGGLPGRENGIQVPLYFAADDFEYQAAFPFKRLGIGAFRIALESVFNRIHHRPLEYVSYGKPNPLVFKIVETVL

Query:  RQILSSHCDDHFVNNGDTEFNHFKTLYMIGDNPVVDIKGAREDTLVFYS--NKDGVFRAKK
        RQILSSHCDDH VN  D EFN F+TLYMIGDNPVVDIKGARE    ++S   + GVFR ++
Subjt:  RQILSSHCDDHFVNNGDTEFNHFKTLYMIGDNPVVDIKGAREDTLVFYS--NKDGVFRAKK

SwissProt top hitse value%identityAlignment
P36151 Uncharacterized protein YKR070W1.1e-3129.06Show/hide
Query:  AFDIDGVVLRGNLPIGGSPRALRRLYADSTSSGTLKVPFLFLTNGGGTPESRRAIELSDLLGVNILPSQVVQGHSTFKSLLNSFENEFIIATGKGQPALV
        AFDIDGV+ RG  PI G+  AL+ L  +       K+P++ LTNGGG  E  R   +S  L V++ P Q++Q H+ +KSL+N +    I+A G      V
Subjt:  AFDIDGVVLRGNLPIGGSPRALRRLYADSTSSGTLKVPFLFLTNGGGTPESRRAIELSDLLGVNILPSQVVQGHSTFKSLLNSFENEFIIATGKGQPALV

Query:  MSEYGFKKVFSIDEYASFFENIDPVSQYKNWATKQGFVLSCNSEKLMRRQ----SVLSERVKAAFVVSDPVDWGRDIQVLCDILRSGGLPGRENGI----
           YGF+ V    +   +  +I P S               + E++M        + +++  A  V +DP DW  DIQ++ D + S      ENG+    
Subjt:  MSEYGFKKVFSIDEYASFFENIDPVSQYKNWATKQGFVLSCNSEKLMRRQ----SVLSERVKAAFVVSDPVDWGRDIQVLCDILRSGGLPGRENGI----

Query:  --------QVPLYFAADDFEYQAAFPFKRLGIGAFRIALESVFNRIHHRPLEYVSYGKPNPLVFKIVETVL---RQILSSHCDDHF-----VNNGDTEFN
                 +P+YF+  D  +   +   R G GAFR+ +  ++  ++  PL+  + GKP  L +     VL    + LS            +       +
Subjt:  --------QVPLYFAADDFEYQAAFPFKRLGIGAFRIALESVFNRIHHRPLEYVSYGKPNPLVFKIVETVL---RQILSSHCDDHF-----VNNGDTEFN

Query:  HFKTLYMIGDNPVVDIKGAR
         F  ++M+GDNP  DI GA+
Subjt:  HFKTLYMIGDNPVVDIKGAR

Q91WM2 Haloacid dehalogenase-like hydrolase domain-containing 57.8e-3030Show/hide
Query:  SRPQRSSFGIAFDIDGVVLRGNLPIGGSPRALRRLYADSTSSGTLKVPFLFLTNGGGTPESRRAIELSDLLGVNILPSQVVQGHSTFKSLLNSFENEFII
        S  +  +FG+ FDIDGV++RG+  I   P AL        S G L+VP +F+TN G   +  +A ELSDLL   + P QV+  HS  K  L  + ++ ++
Subjt:  SRPQRSSFGIAFDIDGVVLRGNLPIGGSPRALRRLYADSTSSGTLKVPFLFLTNGGGTPESRRAIELSDLLGVNILPSQVVQGHSTFKSLLNSFENEFII

Query:  ATGKGQPALVMSEYGFKKVFSIDEYASFFENIDPVSQYKNWATKQGFVLSCNSEKLMRRQSVLSERVKAAFVVSDPVDWGRDIQVLCDILRSGGLPGREN
         +G+G         GF+ V +IDE    F  +D V   +               K MR +S     ++   ++ +PV W  ++Q++ D+L S G PG   
Subjt:  ATGKGQPALVMSEYGFKKVFSIDEYASFFENIDPVSQYKNWATKQGFVLSCNSEKLMRRQSVLSERVKAAFVVSDPVDWGRDIQVLCDILRSGGLPGREN

Query:  GI----QVPLYFAADDFEYQAAFPFKRLGIGAFRIALESVFNRIHHRPLEYVS-YGKPNPLVFKIVETVLRQILSSHCDDHFVNNGDTEFNHFKTLYMIG
               +P+  +  D  + A     R G G F + LE+++ +I    L+Y    GKP+ L ++  E V+RQ                     + LY IG
Subjt:  GI----QVPLYFAADDFEYQAAFPFKRLGIGAFRIALESVFNRIHHRPLEYVS-YGKPNPLVFKIVETVLRQILSSHCDDHFVNNGDTEFNHFKTLYMIG

Query:  DNPVVDIKGA
        DNP+ D+ GA
Subjt:  DNPVVDIKGA

Q9BXW7 Haloacid dehalogenase-like hydrolase domain-containing 51.5e-2828.61Show/hide
Query:  ASRSFSRIQSRPQR-----------SSFGIAFDIDGVVLRGNLPIGGSPRALRRLYADSTSSGTLKVPFLFLTNGGGTPESRRAIELSDLLGVNILPSQV
        A+R+ + +Q RP R            +FG   DIDGV++RG+  I  + +A RRL     S G L+VP +F+TN G   +  +A ELS LLG  +   QV
Subjt:  ASRSFSRIQSRPQR-----------SSFGIAFDIDGVVLRGNLPIGGSPRALRRLYADSTSSGTLKVPFLFLTNGGGTPESRRAIELSDLLGVNILPSQV

Query:  VQGHSTFKSLLNSFENEFIIATGKGQPALVMSEYGFKKVFSIDEYASFFENIDPVSQYKNWATKQGFVLSCNSEKLMRRQSVLSERVKAAFVVSDPVDWG
        +  HS  K L + +  + ++ +G+G         GF+ V ++DE    F  +D V   +   T            L R       R++   ++ +PV W 
Subjt:  VQGHSTFKSLLNSFENEFIIATGKGQPALVMSEYGFKKVFSIDEYASFFENIDPVSQYKNWATKQGFVLSCNSEKLMRRQSVLSERVKAAFVVSDPVDWG

Query:  RDIQVLCDILRSGGLPGRENGIQVPLY-----FAAD-DFEYQAAFPFKRLGIGAFRIALESVFNRIHHRPLEYVS-YGKPNPLVFKIVETVLRQILSSHC
          +Q++ D+L S G PG   G+  P Y      A++ D  + A     R G G F + LE+++ ++  + L Y    GKP+ L ++  E ++R+      
Subjt:  RDIQVLCDILRSGGLPGRENGIQVPLY-----FAAD-DFEYQAAFPFKRLGIGAFRIALESVFNRIHHRPLEYVS-YGKPNPLVFKIVETVLRQILSSHC

Query:  DDHFVNNGDTEFNHFKTLYMIGDNPVVDIKGA
                       + LY +GDNP+ D+ GA
Subjt:  DDHFVNNGDTEFNHFKTLYMIGDNPVVDIKGA

Q9FQ24 Protein TONNEAU 1b6.7e-8254.44Show/hide
Query:  MDDYTREMMDLKTLVTRTLEKKGVLAKIRVIFYFFLLLQFCVISFRFPPFPASTDLLLVFLLDFLEFGGYSAVGWVLCPWYLGRLRPSLFMGVLELLDLG
        MDDYTREMMDLKTLVTRTLEKKGVLAKIR                                                                       
Subjt:  MDDYTREMMDLKTLVTRTLEKKGVLAKIRVIFYFFLLLQFCVISFRFPPFPASTDLLLVFLLDFLEFGGYSAVGWVLCPWYLGRLRPSLFMGVLELLDLG

Query:  LIEKKNFLVLIGVRMCSEAELRASVFEAIEEEDRVIEKDEGLPPALLGSCNDRAKQLHASPSGRLLTALICEYLDWAQLNHTLKVYLPECNMQKDSWKAE
                          AELRASVFEAIEEEDRVIE +EGLPPALLGSCNDRA+QLHASPSGRLL+ALICEYLDWAQLNHTLKVY PECN  KDSWK+E
Subjt:  LIEKKNFLVLIGVRMCSEAELRASVFEAIEEEDRVIEKDEGLPPALLGSCNDRAKQLHASPSGRLLTALICEYLDWAQLNHTLKVYLPECNMQKDSWKAE

Query:  LKEFSSKNGYDLNRNGDSGPLLLDVLEGFLKFENLSQTRGTGRRITTSESDSMSSHESRNSRRPSSSVAGGLPPLGRPGAGSQASDRRVGSSMSGYRKDE
        +++FS  NGY+LNRN DS PLLLDVLEGFLKFEN++Q  G   R   SE++S  S ++RN  R  SS +  LP   R  + SQAS    G++ SGYRKDE
Subjt:  LKEFSSKNGYDLNRNGDSGPLLLDVLEGFLKFENLSQTRGTGRRITTSESDSMSSHESRNSRRPSSSVAGGLPPLGRPGAGSQASDRRVGSSMSGYRKDE

Query:  YSWRYDGNELPEDVIRTSTALENLQLDRKARNLTSSWR
         +WRYD  ++PE+V+R STALENLQLDRK RNLTSSWR
Subjt:  YSWRYDGNELPEDVIRTSTALENLQLDRKARNLTSSWR

Q9FQ25 Protein TONNEAU 1a1.9e-8454.84Show/hide
Query:  MDDYTREMMDLKTLVTRTLEKKGVLAKIRVIFYFFLLLQFCVISFRFPPFPASTDLLLVFLLDFLEFGGYSAVGWVLCPWYLGRLRPSLFMGVLELLDLG
        MDDYTREMMDLKTLVTRTLEKKGVLAKIR                                                                       
Subjt:  MDDYTREMMDLKTLVTRTLEKKGVLAKIRVIFYFFLLLQFCVISFRFPPFPASTDLLLVFLLDFLEFGGYSAVGWVLCPWYLGRLRPSLFMGVLELLDLG

Query:  LIEKKNFLVLIGVRMCSEAELRASVFEAIEEEDRVIEKDEGLPPALLGSCNDRAKQLHASPSGRLLTALICEYLDWAQLNHTLKVYLPECNMQKDSWKAE
                          AELRASVFEAIEEEDRVIE +EG PPALLGSCNDRA++LHASPSGRLL+ALICEYLDWAQLNHTL VY PE N+ KDSWK+E
Subjt:  LIEKKNFLVLIGVRMCSEAELRASVFEAIEEEDRVIEKDEGLPPALLGSCNDRAKQLHASPSGRLLTALICEYLDWAQLNHTLKVYLPECNMQKDSWKAE

Query:  LKEFSSKNGYDLNRNGDSGPLLLDVLEGFLKFENLSQTRGTGRRITTSESDSMSSHESRNSRRPSSSVAGGLPPLGRPGAGSQASDRRVGSSMSGYRKDE
        L++F+S NG++LNRNGDSGPLLLDVLEGFLKFE+++Q  G+  R   SE++S SS ESRN  R  SS +  LPP  RP + SQASDRR G S SGYRKDE
Subjt:  LKEFSSKNGYDLNRNGDSGPLLLDVLEGFLKFENLSQTRGTGRRITTSESDSMSSHESRNSRRPSSSVAGGLPPLGRPGAGSQASDRRVGSSMSGYRKDE

Query:  YSWRYDGNELPEDVIRTSTALENLQLDRKARNLTSSWRKER
        ++WR    +  E+V R S ALENLQLDRK RNLTSSWR  R
Subjt:  YSWRYDGNELPEDVIRTSTALENLQLDRKARNLTSSWRKER

Arabidopsis top hitse value%identityAlignment
AT3G45740.1 hydrolase family protein / HAD-superfamily protein4.2e-11160.78Show/hide
Query:  SRSF-SRIQSRPQRSSFGIAFDIDGVVLRGNLPIGGSPRALRRLYADSTSSGTLKVPFLFLTNGGGTPESRRAIELSDLLGVNILPSQVVQGHSTFKSLL
        SRSF S I     RSSFGIAFDIDGV+L G+ P+GGSP ALRRLY D   SG LK+PFLFLTNGGG PES+RA E+S LLGV + P QV+Q HS F+ L+
Subjt:  SRSF-SRIQSRPQRSSFGIAFDIDGVVLRGNLPIGGSPRALRRLYADSTSSGTLKVPFLFLTNGGGTPESRRAIELSDLLGVNILPSQVVQGHSTFKSLL

Query:  NSFENEFIIATGKGQPALVMSEYGFKKVFSIDEYASFFENIDPVSQYKNWATKQGFVLSCNSEKLMRRQSVLSERVKAAFVVSDPVDWGRDIQVLCDILR
        N FENE ++A GKG+PA VMS YGFK V S+DEYAS+F+NIDP++ YK    +Q        ++L  R+ VLS+RV+AAF+VSDPVDW RDIQVLCDILR
Subjt:  NSFENEFIIATGKGQPALVMSEYGFKKVFSIDEYASFFENIDPVSQYKNWATKQGFVLSCNSEKLMRRQSVLSERVKAAFVVSDPVDWGRDIQVLCDILR

Query:  SGGLPGRENGIQVPLYFAADDFEYQAAFPFKRLGIGAFRIALESVFNRIHHRPLEYVSYGKPNPLVFKIVETVLRQILSSHCDDHFVNNGDTEFNHFKTL
        +GGLPG+E G Q  LY A DD +YQ  FP +RLG+GAFRIALES+FNRIH +PLEY S+GKPNP VFK  E VL++I +S    +  N G    +HFKTL
Subjt:  SGGLPGRENGIQVPLYFAADDFEYQAAFPFKRLGIGAFRIALESVFNRIHHRPLEYVSYGKPNPLVFKIVETVLRQILSSHCDDHFVNNGDTEFNHFKTL

Query:  YMIGDNPVVDIKGAREDTLVFYS--NKDGVFRAK
        YMIGDNP +DI+GAR+    ++S   + GVF+ K
Subjt:  YMIGDNPVVDIKGAREDTLVFYS--NKDGVFRAK

AT3G55000.1 tonneau family protein1.3e-8554.84Show/hide
Query:  MDDYTREMMDLKTLVTRTLEKKGVLAKIRVIFYFFLLLQFCVISFRFPPFPASTDLLLVFLLDFLEFGGYSAVGWVLCPWYLGRLRPSLFMGVLELLDLG
        MDDYTREMMDLKTLVTRTLEKKGVLAKIR                                                                       
Subjt:  MDDYTREMMDLKTLVTRTLEKKGVLAKIRVIFYFFLLLQFCVISFRFPPFPASTDLLLVFLLDFLEFGGYSAVGWVLCPWYLGRLRPSLFMGVLELLDLG

Query:  LIEKKNFLVLIGVRMCSEAELRASVFEAIEEEDRVIEKDEGLPPALLGSCNDRAKQLHASPSGRLLTALICEYLDWAQLNHTLKVYLPECNMQKDSWKAE
                          AELRASVFEAIEEEDRVIE +EG PPALLGSCNDRA++LHASPSGRLL+ALICEYLDWAQLNHTL VY PE N+ KDSWK+E
Subjt:  LIEKKNFLVLIGVRMCSEAELRASVFEAIEEEDRVIEKDEGLPPALLGSCNDRAKQLHASPSGRLLTALICEYLDWAQLNHTLKVYLPECNMQKDSWKAE

Query:  LKEFSSKNGYDLNRNGDSGPLLLDVLEGFLKFENLSQTRGTGRRITTSESDSMSSHESRNSRRPSSSVAGGLPPLGRPGAGSQASDRRVGSSMSGYRKDE
        L++F+S NG++LNRNGDSGPLLLDVLEGFLKFE+++Q  G+  R   SE++S SS ESRN  R  SS +  LPP  RP + SQASDRR G S SGYRKDE
Subjt:  LKEFSSKNGYDLNRNGDSGPLLLDVLEGFLKFENLSQTRGTGRRITTSESDSMSSHESRNSRRPSSSVAGGLPPLGRPGAGSQASDRRVGSSMSGYRKDE

Query:  YSWRYDGNELPEDVIRTSTALENLQLDRKARNLTSSWRKER
        ++WR    +  E+V R S ALENLQLDRK RNLTSSWR  R
Subjt:  YSWRYDGNELPEDVIRTSTALENLQLDRKARNLTSSWRKER

AT3G55005.1 tonneau 1b (TON1b)4.8e-8354.44Show/hide
Query:  MDDYTREMMDLKTLVTRTLEKKGVLAKIRVIFYFFLLLQFCVISFRFPPFPASTDLLLVFLLDFLEFGGYSAVGWVLCPWYLGRLRPSLFMGVLELLDLG
        MDDYTREMMDLKTLVTRTLEKKGVLAKIR                                                                       
Subjt:  MDDYTREMMDLKTLVTRTLEKKGVLAKIRVIFYFFLLLQFCVISFRFPPFPASTDLLLVFLLDFLEFGGYSAVGWVLCPWYLGRLRPSLFMGVLELLDLG

Query:  LIEKKNFLVLIGVRMCSEAELRASVFEAIEEEDRVIEKDEGLPPALLGSCNDRAKQLHASPSGRLLTALICEYLDWAQLNHTLKVYLPECNMQKDSWKAE
                          AELRASVFEAIEEEDRVIE +EGLPPALLGSCNDRA+QLHASPSGRLL+ALICEYLDWAQLNHTLKVY PECN  KDSWK+E
Subjt:  LIEKKNFLVLIGVRMCSEAELRASVFEAIEEEDRVIEKDEGLPPALLGSCNDRAKQLHASPSGRLLTALICEYLDWAQLNHTLKVYLPECNMQKDSWKAE

Query:  LKEFSSKNGYDLNRNGDSGPLLLDVLEGFLKFENLSQTRGTGRRITTSESDSMSSHESRNSRRPSSSVAGGLPPLGRPGAGSQASDRRVGSSMSGYRKDE
        +++FS  NGY+LNRN DS PLLLDVLEGFLKFEN++Q  G   R   SE++S  S ++RN  R  SS +  LP   R  + SQAS    G++ SGYRKDE
Subjt:  LKEFSSKNGYDLNRNGDSGPLLLDVLEGFLKFENLSQTRGTGRRITTSESDSMSSHESRNSRRPSSSVAGGLPPLGRPGAGSQASDRRVGSSMSGYRKDE

Query:  YSWRYDGNELPEDVIRTSTALENLQLDRKARNLTSSWR
         +WRYD  ++PE+V+R STALENLQLDRK RNLTSSWR
Subjt:  YSWRYDGNELPEDVIRTSTALENLQLDRKARNLTSSWR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATTCTTACGGTCTTCAGAGTCTCTCAAGGCAAGAACCGAGAGCCATTTTTGTTTCAATCTCAGGCTTCTGCTTCCCGTTCCTTCTCTCGGATTCAATCTCGACC
TCAACGTTCTTCTTTCGGGATTGCATTTGACATTGATGGGGTTGTTCTACGTGGCAACCTTCCTATTGGAGGGTCTCCCCGAGCTTTGAGAAGGTTGTATGCGGATTCTA
CATCCTCTGGTACTTTGAAAGTTCCTTTTCTTTTCTTGACCAATGGAGGCGGTACCCCAGAATCCAGACGTGCCATTGAATTGAGTGACCTCTTGGGAGTGAATATATTA
CCTTCTCAGGTTGTACAGGGCCACTCAACTTTTAAAAGTTTGTTGAATAGTTTTGAGAATGAATTCATTATTGCAACTGGGAAGGGGCAGCCAGCTCTCGTTATGTCTGA
GTATGGTTTCAAAAAAGTTTTCTCCATTGATGAGTATGCCTCCTTCTTTGAGAATATTGATCCTGTATCACAATACAAGAACTGGGCAACCAAACAAGGTTTTGTTTTGA
GCTGCAACTCCGAGAAGTTGATGAGAAGGCAGAGTGTTCTCTCCGAAAGAGTGAAGGCCGCTTTTGTTGTCAGTGATCCTGTTGATTGGGGCAGAGATATCCAAGTTCTT
TGTGATATTTTAAGATCTGGAGGCCTTCCAGGACGGGAGAATGGGATTCAAGTGCCTCTATATTTTGCTGCAGATGATTTTGAATATCAGGCTGCATTTCCGTTTAAGCG
GCTTGGGATAGGTGCCTTCAGAATTGCACTGGAAAGTGTATTTAATAGGATTCACCATAGACCTCTCGAATATGTATCTTATGGGAAACCAAATCCTCTTGTGTTTAAGA
TTGTTGAAACTGTATTGAGACAAATTCTCTCATCTCATTGTGATGATCATTTTGTGAACAATGGAGATACTGAATTTAATCATTTTAAGACACTTTATATGATTGGTGAC
AATCCCGTGGTAGACATCAAAGGTGCACGAGAGGACACCTTGGTTTTCTATTCTAACAAGGACGGAGTTTTCAGGGCAAAGAAAATCACGCAGACTTTCCAGCTAATCTT
ATACTTACATGTACTTGGAAAAGATGGACTCAACAACCCTTTGAGCGATCGGGTGATAACTTTCGCGGGCTTCGGAATATATCCTCTTCGCCAAAATCTTCTCTTCTTCC
ATCCCGGGGCCTTGCGTAAGAGCGGTGTAGAGCGGGCGGAGGTTCCCAGTCCAGCTATGAGCAAGCTCATGAGCAACCACATGCGCACCCGTAGCATCCCCCTTTATCAC
CGTCGGTGTCAAGAATACCATCTTCGGATTTTCCATCCCACCGTAAGGAAAGCTCGGAGCTGCAGCGTCGAGCACCGCGGGCACCGATTCCGCATAAACCCTAGTCCTCG
GCCCCACTTCCCTGAATGCGATCTCTCCGACGGCAAAAGCGAACAAGTACGGCGGAACTGGCTGCTCCATCACAAACTCTTCCACCACTCTCCCTTCACCGCACCACAGA
GAATCAACCCCTCCGGCCAACATCGAACCCTCGCCAACGCCTGGTGGGCGGCGCTCCACATTCCGTGCAGCCATCACCGTGACAGACTGATCAGAAAGCGAAACGGAGAG
AAGAGACCCTTTGATCGCATCGGGAGGAGAAAGCGAAAAGGGTATCGGGGCAAGGGATTGAGGGTCGATAACAGAATGAATAGTAAGAGAACGAGTATCGAGCGAGAGAG
GACCAGAGTGCGAACTCGAGAGAGAAATCAGGGCAGAAGCATGGATGGTAGTGGAGGGAAAGTCGAAGAAGAGAGAGAGGGAAATGTGAACGGTGAGAGGGTGGGAAGAA
TCGGTGAAAGAGTGTGGGTCAACGGGCGCCATTGCGTCGAAGAAGAAGAAGGAGAAGAACCAAAATGGCTTTCTGCGTTGAGACACTTCCATTTCCGCTTTCTTCCTCCC
CACGGCCGTGTAGTTGATGCTGTCACGCGCCTTCCTGCTGAGCAGCAGAGAACGGGAACCGTCAGAGCGAGCGAGCGAGCAGCGAAGCGAAAGCCTGAGAAGTGGAGAAC
GATGGACGACTATACGAGAGAGATGATGGACTTGAAGACCCTTGTCACTCGGACTCTCGAGAAGAAGGGCGTCCTCGCCAAGATCCGTGTAATTTTTTACTTTTTCCTCC
TTCTTCAATTTTGTGTTATTTCGTTCAGATTTCCCCCTTTTCCGGCCTCTACAGATCTTCTGCTCGTTTTTCTACTGGATTTCTTGGAATTTGGGGGTTATTCTGCCGTT
GGTTGGGTTTTGTGCCCTTGGTATCTGGGTAGACTGCGACCCTCGTTATTCATGGGTGTTCTAGAGCTCTTGGATCTTGGACTGATTGAAAAGAAGAATTTCCTGGTTCT
CATTGGAGTTCGAATGTGCTCTGAGGCTGAACTTAGAGCTAGTGTATTCGAGGCAATCGAAGAAGAGGATCGTGTTATTGAGAAAGATGAAGGCTTGCCACCTGCATTAC
TTGGTAGTTGCAATGATCGAGCAAAGCAGCTTCATGCTTCACCATCAGGGAGGCTGCTTACTGCTCTAATTTGTGAATACTTAGATTGGGCTCAGCTAAACCACACTCTA
AAAGTTTATCTCCCGGAGTGTAATATGCAAAAAGATTCTTGGAAAGCTGAATTGAAGGAATTTAGTAGTAAAAATGGATATGATCTTAACAGAAATGGAGATAGTGGACC
GTTGCTCTTGGATGTTCTTGAAGGATTCTTGAAATTTGAGAATTTATCTCAAACAAGGGGTACTGGAAGGAGAATAACCACTTCAGAATCTGATTCAATGTCCAGTCATG
AGTCTCGAAACAGTAGGAGACCTTCATCATCTGTTGCAGGTGGTTTACCTCCACTAGGAAGGCCTGGTGCTGGTTCACAGGCATCCGATAGGAGGGTAGGATCTTCAATG
TCTGGTTATCGAAAAGATGAATACAGTTGGAGATATGATGGCAATGAGCTTCCAGAGGATGTTATAAGAACCTCAACTGCTTTGGAAAACCTGCAACTGGATCGAAAAGC
AAGAAATTTAACCTCATCTTGGCGGAAAGAAAGACACTTCGCACCTTCTCAACAATGGAACAATCAGATCTTGAAAGAGGAGTTGGGCTACAACAGAAAGGATTTCCTAG
GCCGGCCTAGAAAGTTTTGGAGGATTGCAACTTATGTCAGCCAAAGAGCATTTAGCCCTAGTGATACTTGGGCCAGAAGCTCCTCCAAGGGAATGAAGTGCACTATTGAG
GCCCAAAGAATACACCCATCCTTCTCAAAACAAAAGAAAAGAGCATCATCTCTGATCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAATTCTTACGGTCTTCAGAGTCTCTCAAGGCAAGAACCGAGAGCCATTTTTGTTTCAATCTCAGGCTTCTGCTTCCCGTTCCTTCTCTCGGATTCAATCTCGACC
TCAACGTTCTTCTTTCGGGATTGCATTTGACATTGATGGGGTTGTTCTACGTGGCAACCTTCCTATTGGAGGGTCTCCCCGAGCTTTGAGAAGGTTGTATGCGGATTCTA
CATCCTCTGGTACTTTGAAAGTTCCTTTTCTTTTCTTGACCAATGGAGGCGGTACCCCAGAATCCAGACGTGCCATTGAATTGAGTGACCTCTTGGGAGTGAATATATTA
CCTTCTCAGGTTGTACAGGGCCACTCAACTTTTAAAAGTTTGTTGAATAGTTTTGAGAATGAATTCATTATTGCAACTGGGAAGGGGCAGCCAGCTCTCGTTATGTCTGA
GTATGGTTTCAAAAAAGTTTTCTCCATTGATGAGTATGCCTCCTTCTTTGAGAATATTGATCCTGTATCACAATACAAGAACTGGGCAACCAAACAAGGTTTTGTTTTGA
GCTGCAACTCCGAGAAGTTGATGAGAAGGCAGAGTGTTCTCTCCGAAAGAGTGAAGGCCGCTTTTGTTGTCAGTGATCCTGTTGATTGGGGCAGAGATATCCAAGTTCTT
TGTGATATTTTAAGATCTGGAGGCCTTCCAGGACGGGAGAATGGGATTCAAGTGCCTCTATATTTTGCTGCAGATGATTTTGAATATCAGGCTGCATTTCCGTTTAAGCG
GCTTGGGATAGGTGCCTTCAGAATTGCACTGGAAAGTGTATTTAATAGGATTCACCATAGACCTCTCGAATATGTATCTTATGGGAAACCAAATCCTCTTGTGTTTAAGA
TTGTTGAAACTGTATTGAGACAAATTCTCTCATCTCATTGTGATGATCATTTTGTGAACAATGGAGATACTGAATTTAATCATTTTAAGACACTTTATATGATTGGTGAC
AATCCCGTGGTAGACATCAAAGGTGCACGAGAGGACACCTTGGTTTTCTATTCTAACAAGGACGGAGTTTTCAGGGCAAAGAAAATCACGCAGACTTTCCAGCTAATCTT
ATACTTACATGTACTTGGAAAAGATGGACTCAACAACCCTTTGAGCGATCGGGTGATAACTTTCGCGGGCTTCGGAATATATCCTCTTCGCCAAAATCTTCTCTTCTTCC
ATCCCGGGGCCTTGCGTAAGAGCGGTGTAGAGCGGGCGGAGGTTCCCAGTCCAGCTATGAGCAAGCTCATGAGCAACCACATGCGCACCCGTAGCATCCCCCTTTATCAC
CGTCGGTGTCAAGAATACCATCTTCGGATTTTCCATCCCACCGTAAGGAAAGCTCGGAGCTGCAGCGTCGAGCACCGCGGGCACCGATTCCGCATAAACCCTAGTCCTCG
GCCCCACTTCCCTGAATGCGATCTCTCCGACGGCAAAAGCGAACAAGTACGGCGGAACTGGCTGCTCCATCACAAACTCTTCCACCACTCTCCCTTCACCGCACCACAGA
GAATCAACCCCTCCGGCCAACATCGAACCCTCGCCAACGCCTGGTGGGCGGCGCTCCACATTCCGTGCAGCCATCACCGTGACAGACTGATCAGAAAGCGAAACGGAGAG
AAGAGACCCTTTGATCGCATCGGGAGGAGAAAGCGAAAAGGGTATCGGGGCAAGGGATTGAGGGTCGATAACAGAATGAATAGTAAGAGAACGAGTATCGAGCGAGAGAG
GACCAGAGTGCGAACTCGAGAGAGAAATCAGGGCAGAAGCATGGATGGTAGTGGAGGGAAAGTCGAAGAAGAGAGAGAGGGAAATGTGAACGGTGAGAGGGTGGGAAGAA
TCGGTGAAAGAGTGTGGGTCAACGGGCGCCATTGCGTCGAAGAAGAAGAAGGAGAAGAACCAAAATGGCTTTCTGCGTTGAGACACTTCCATTTCCGCTTTCTTCCTCCC
CACGGCCGTGTAGTTGATGCTGTCACGCGCCTTCCTGCTGAGCAGCAGAGAACGGGAACCGTCAGAGCGAGCGAGCGAGCAGCGAAGCGAAAGCCTGAGAAGTGGAGAAC
GATGGACGACTATACGAGAGAGATGATGGACTTGAAGACCCTTGTCACTCGGACTCTCGAGAAGAAGGGCGTCCTCGCCAAGATCCGTGTAATTTTTTACTTTTTCCTCC
TTCTTCAATTTTGTGTTATTTCGTTCAGATTTCCCCCTTTTCCGGCCTCTACAGATCTTCTGCTCGTTTTTCTACTGGATTTCTTGGAATTTGGGGGTTATTCTGCCGTT
GGTTGGGTTTTGTGCCCTTGGTATCTGGGTAGACTGCGACCCTCGTTATTCATGGGTGTTCTAGAGCTCTTGGATCTTGGACTGATTGAAAAGAAGAATTTCCTGGTTCT
CATTGGAGTTCGAATGTGCTCTGAGGCTGAACTTAGAGCTAGTGTATTCGAGGCAATCGAAGAAGAGGATCGTGTTATTGAGAAAGATGAAGGCTTGCCACCTGCATTAC
TTGGTAGTTGCAATGATCGAGCAAAGCAGCTTCATGCTTCACCATCAGGGAGGCTGCTTACTGCTCTAATTTGTGAATACTTAGATTGGGCTCAGCTAAACCACACTCTA
AAAGTTTATCTCCCGGAGTGTAATATGCAAAAAGATTCTTGGAAAGCTGAATTGAAGGAATTTAGTAGTAAAAATGGATATGATCTTAACAGAAATGGAGATAGTGGACC
GTTGCTCTTGGATGTTCTTGAAGGATTCTTGAAATTTGAGAATTTATCTCAAACAAGGGGTACTGGAAGGAGAATAACCACTTCAGAATCTGATTCAATGTCCAGTCATG
AGTCTCGAAACAGTAGGAGACCTTCATCATCTGTTGCAGGTGGTTTACCTCCACTAGGAAGGCCTGGTGCTGGTTCACAGGCATCCGATAGGAGGGTAGGATCTTCAATG
TCTGGTTATCGAAAAGATGAATACAGTTGGAGATATGATGGCAATGAGCTTCCAGAGGATGTTATAAGAACCTCAACTGCTTTGGAAAACCTGCAACTGGATCGAAAAGC
AAGAAATTTAACCTCATCTTGGCGGAAAGAAAGACACTTCGCACCTTCTCAACAATGGAACAATCAGATCTTGAAAGAGGAGTTGGGCTACAACAGAAAGGATTTCCTAG
GCCGGCCTAGAAAGTTTTGGAGGATTGCAACTTATGTCAGCCAAAGAGCATTTAGCCCTAGTGATACTTGGGCCAGAAGCTCCTCCAAGGGAATGAAGTGCACTATTGAG
GCCCAAAGAATACACCCATCCTTCTCAAAACAAAAGAAAAGAGCATCATCTCTGATCTAA
Protein sequenceShow/hide protein sequence
MKILTVFRVSQGKNREPFLFQSQASASRSFSRIQSRPQRSSFGIAFDIDGVVLRGNLPIGGSPRALRRLYADSTSSGTLKVPFLFLTNGGGTPESRRAIELSDLLGVNIL
PSQVVQGHSTFKSLLNSFENEFIIATGKGQPALVMSEYGFKKVFSIDEYASFFENIDPVSQYKNWATKQGFVLSCNSEKLMRRQSVLSERVKAAFVVSDPVDWGRDIQVL
CDILRSGGLPGRENGIQVPLYFAADDFEYQAAFPFKRLGIGAFRIALESVFNRIHHRPLEYVSYGKPNPLVFKIVETVLRQILSSHCDDHFVNNGDTEFNHFKTLYMIGD
NPVVDIKGAREDTLVFYSNKDGVFRAKKITQTFQLILYLHVLGKDGLNNPLSDRVITFAGFGIYPLRQNLLFFHPGALRKSGVERAEVPSPAMSKLMSNHMRTRSIPLYH
RRCQEYHLRIFHPTVRKARSCSVEHRGHRFRINPSPRPHFPECDLSDGKSEQVRRNWLLHHKLFHHSPFTAPQRINPSGQHRTLANAWWAALHIPCSHHRDRLIRKRNGE
KRPFDRIGRRKRKGYRGKGLRVDNRMNSKRTSIERERTRVRTRERNQGRSMDGSGGKVEEEREGNVNGERVGRIGERVWVNGRHCVEEEEGEEPKWLSALRHFHFRFLPP
HGRVVDAVTRLPAEQQRTGTVRASERAAKRKPEKWRTMDDYTREMMDLKTLVTRTLEKKGVLAKIRVIFYFFLLLQFCVISFRFPPFPASTDLLLVFLLDFLEFGGYSAV
GWVLCPWYLGRLRPSLFMGVLELLDLGLIEKKNFLVLIGVRMCSEAELRASVFEAIEEEDRVIEKDEGLPPALLGSCNDRAKQLHASPSGRLLTALICEYLDWAQLNHTL
KVYLPECNMQKDSWKAELKEFSSKNGYDLNRNGDSGPLLLDVLEGFLKFENLSQTRGTGRRITTSESDSMSSHESRNSRRPSSSVAGGLPPLGRPGAGSQASDRRVGSSM
SGYRKDEYSWRYDGNELPEDVIRTSTALENLQLDRKARNLTSSWRKERHFAPSQQWNNQILKEELGYNRKDFLGRPRKFWRIATYVSQRAFSPSDTWARSSSKGMKCTIE
AQRIHPSFSKQKKRASSLI