; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr016472 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr016472
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Description1,2-dihydroxy-3-keto-5-methylthiopentene dioxygenase
Genome locationtig00152936:287846..297516
RNA-Seq ExpressionSgr016472
SyntenySgr016472
Gene Ontology termsGO:0009611 - response to wounding (biological process)
GO:0019509 - L-methionine salvage from methylthioadenosine (biological process)
GO:0031347 - regulation of defense response (biological process)
GO:2000022 - regulation of jasmonic acid mediated signaling pathway (biological process)
GO:0005634 - nucleus (cellular component)
GO:0010309 - acireductone dioxygenase [iron(II)-requiring] activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR004313 - Acireductone dioxygenase ARD family
IPR010399 - Tify domain
IPR011051 - RmlC-like cupin domain superfamily
IPR014710 - RmlC-like jelly roll fold
IPR018467 - CO/COL/TOC1, conserved site
IPR027496 - 1,2-dihydroxy-3-keto-5-methylthiopentene dioxygenase, eukaryotes
IPR040390 - TIFY/JAZ family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4292121.1 unnamed protein product [Prunus armeniaca]3.9e-17365.42Show/hide
Query:  MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGALKKVVV-----LPRVNSNQGDSPKEPSD
        M+A   +FRSIL+KPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEP DDSGAGAL+++VV      PR  SN   S KE S 
Subjt:  MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGALKKVVV-----LPRVNSNQGDSPKEPSD

Query:  DAQVTMSVDESAYSNVETAKSTPEDPPAEPDNKVASPRDLCDTNGVDGQMTIFYCGKVNVYDGVPPDKAWAIMHLAASPIHFPQNHPLGGTAACQSPPCL
        D QV++S DE     V   K   ED PA+ D K  SPR+ C T+    QMTIFYCGKVNVYDGVPPDKA AIMHLAA P H P ++  GGTAA +S  C 
Subjt:  DAQVTMSVDESAYSNVETAKSTPEDPPAEPDNKVASPRDLCDTNGVDGQMTIFYCGKVNVYDGVPPDKAWAIMHLAASPIHFPQNHPLGGTAACQSPPCL

Query:  LQTANERDDFFPPSATIYRNVHTEKMVEYPQQQHVKGTSTRDSDVEGQASRKVSLQRYLEKRKDRGRLKNKKNTGLPSSSLEGYMNHQMRTHISNKNLGQ
         QTA ++D F PPSAT  + + TEK+ EY QQ   KG STRD D EGQASRKVSL+RY EKRKDRGRLK KKN G  SSSLE ++NHQ+RTH SN N  Q
Subjt:  LQTANERDDFFPPSATIYRNVHTEKMVEYPQQQHVKGTSTRDSDVEGQASRKVSLQRYLEKRKDRGRLKNKKNTGLPSSSLEGYMNHQMRTHISNKNLGQ

Query:  IVTSSLSPTGVAKAFVGTADNQPKLACFPVDLNVKGRGVYIRMNGMSISSADQFCNSDTSGVFSTIQLSEQVRRARSSMAIEAWFMDDTNEDQRLPHHRD
          TSS    G+ +    TADNQPK  C PVDLN KG G +    GM++ S            F   +L  +     S+MAIEAWFMD+++ED RLPHHR+
Subjt:  IVTSSLSPTGVAKAFVGTADNQPKLACFPVDLNVKGRGVYIRMNGMSISSADQFCNSDTSGVFSTIQLSEQVRRARSSMAIEAWFMDDTNEDQRLPHHRD

Query:  PKEFVSMDQLEELGVLYWKLNPKDYENDEELQKIREDRGYNYVDLLDICPEKLANYEGKLKDFYTEHIHANEEIRYCLDGSGYFDVRDKNDRWIRIWIKL
        PKE V +D L ELGVLYW+LNPKDYENDE+L+ IRE RGYNY+DLLDICPEKL NYE KLK+FYTEHIHA+EEIRYCL+GSGYFDVRDKNDRWIRIWIK 
Subjt:  PKEFVSMDQLEELGVLYWKLNPKDYENDEELQKIREDRGYNYVDLLDICPEKLANYEGKLKDFYTEHIHANEEIRYCLDGSGYFDVRDKNDRWIRIWIKL

Query:  GDLIILPAG
        GDLIILPAG
Subjt:  GDLIILPAG

XP_022138751.1 protein TIFY 4B isoform X2 [Momordica charantia]1.8e-17394.56Show/hide
Query:  MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGALKKVVVLPRVNSNQGDSPKEPSDDAQVT
        MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGAL+KVVV PRVNSNQGDSPKEPSDDAQ T
Subjt:  MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGALKKVVVLPRVNSNQGDSPKEPSDDAQVT

Query:  MSVDESAYSNVETAKSTPEDPPAEPDNKVASPRDLCDTNGVDGQMTIFYCGKVNVYDGVPPDKAWAIMHLAASPIHFPQNHPLGGTAACQSPPCLLQTAN
        MSVDESAYSNVETAKSTPEDPP EPDNKV SPRD C+TNG DGQMTIFYCGKVNVYDGV PDKA AIMHLAASP+ FPQNHPLGGTAACQSPPCLLQTAN
Subjt:  MSVDESAYSNVETAKSTPEDPPAEPDNKVASPRDLCDTNGVDGQMTIFYCGKVNVYDGVPPDKAWAIMHLAASPIHFPQNHPLGGTAACQSPPCLLQTAN

Query:  ERDDFFPPSATIYRNVHT-EKMVEYPQQQHVKGTSTRDSDVEGQASRKVSLQRYLEKRKDRGRLKNKKNTGLPSSSLEGYMNHQMRTHISNKNLGQIVTS
        ERDDFFPPSA+IYRNVHT EKMVEYPQQQHVKG STRDSDVEGQASRKVSLQRYLEKRKDRGRLKNKKNTGLPSSSLEGYMNHQMRTHISNKNLGQ VTS
Subjt:  ERDDFFPPSATIYRNVHT-EKMVEYPQQQHVKGTSTRDSDVEGQASRKVSLQRYLEKRKDRGRLKNKKNTGLPSSSLEGYMNHQMRTHISNKNLGQIVTS

Query:  SLSPTGVAKAFVGTADNQPKLACFPVDLNVK
        SLSPTGVAKAFVGTADNQ K ACFPVDLNVK
Subjt:  SLSPTGVAKAFVGTADNQPKLACFPVDLNVK

XP_023000521.1 protein TIFY 4B-like isoform X1 [Cucurbita maxima]6.7e-17392.45Show/hide
Query:  MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGALKKVVVLPRVNSNQGDSPKEPSDDAQVT
        MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVI LKALLEPCDDSGA AL+KV V PRVNSNQG SP+EPSDDAQVT
Subjt:  MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGALKKVVVLPRVNSNQGDSPKEPSDDAQVT

Query:  MSVDESAYSNVETAKSTPEDPPAEPDNKVASPRDLCDTNGVDGQMTIFYCGKVNVYDGVPPDKAWAIMHLAASPIHFPQNHPLGGTAACQSPPCLLQTAN
        +SVDESAYSN+ETAKSTPEDPPA PDNKV SPRD CDTNG+DGQMTIFYCGKVNVYDGVP DKAWAIMHLAASP+HFPQNH +GGTA+CQSPPC+LQ AN
Subjt:  MSVDESAYSNVETAKSTPEDPPAEPDNKVASPRDLCDTNGVDGQMTIFYCGKVNVYDGVPPDKAWAIMHLAASPIHFPQNHPLGGTAACQSPPCLLQTAN

Query:  ERDDFFPPSATIYRNVHTEKMVEYP-QQQHVKGTSTRDSDVEGQASRKVSLQRYLEKRKDRGRLKNKKNTGLPSSSLEGYMNHQMRTHISNKNLGQIVTS
        ERDDFFPPS TIYRNVHTEKMVEYP QQQH+KGTSTRDSD+EGQASRKVSLQRYLEKRKDRGRLKNKKNTGL SSSLEGYMNHQMRTHISNKNLGQIVTS
Subjt:  ERDDFFPPSATIYRNVHTEKMVEYP-QQQHVKGTSTRDSDVEGQASRKVSLQRYLEKRKDRGRLKNKKNTGLPSSSLEGYMNHQMRTHISNKNLGQIVTS

Query:  SLSPTGVAKAFVGTADNQPKLACFPVDLNVK
        SLSPTGVAKAFVGTADNQPKLACFPVDLNVK
Subjt:  SLSPTGVAKAFVGTADNQPKLACFPVDLNVK

XP_023514342.1 protein TIFY 4B-like isoform X1 [Cucurbita pepo subsp. pepo]1.0e-17392.75Show/hide
Query:  MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGALKKVVVLPRVNSNQGDSPKEPSDDAQVT
        MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVI LKALLEPCDDSGA AL+KV V PRVNSNQG SP++PSDDAQVT
Subjt:  MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGALKKVVVLPRVNSNQGDSPKEPSDDAQVT

Query:  MSVDESAYSNVETAKSTPEDPPAEPDNKVASPRDLCDTNGVDGQMTIFYCGKVNVYDGVPPDKAWAIMHLAASPIHFPQNHPLGGTAACQSPPCLLQTAN
        +SVDESAYSN+ETAKSTPEDPPAEPDNKV SPRD CDTNG+DGQMTIFYCGKVNVYDGVP DKAWAIMHLAASP+HFPQNH +GGTA+CQSPPCLLQ AN
Subjt:  MSVDESAYSNVETAKSTPEDPPAEPDNKVASPRDLCDTNGVDGQMTIFYCGKVNVYDGVPPDKAWAIMHLAASPIHFPQNHPLGGTAACQSPPCLLQTAN

Query:  ERDDFFPPSATIYRNVHTEKMVEYP-QQQHVKGTSTRDSDVEGQASRKVSLQRYLEKRKDRGRLKNKKNTGLPSSSLEGYMNHQMRTHISNKNLGQIVTS
        ERDDFFPPS TIYRNVHTEKMVEYP QQQH+KGTSTRDSD+EGQASRKVSLQRYLEKRKDRGRLKNKKNTGL SSSLEGYMNHQMRTHISNKNLGQIVTS
Subjt:  ERDDFFPPSATIYRNVHTEKMVEYP-QQQHVKGTSTRDSDVEGQASRKVSLQRYLEKRKDRGRLKNKKNTGLPSSSLEGYMNHQMRTHISNKNLGQIVTS

Query:  SLSPTGVAKAFVGTADNQPKLACFPVDLNVK
        SLSPTGVAKAFVGTADNQPKLACFPVDLNVK
Subjt:  SLSPTGVAKAFVGTADNQPKLACFPVDLNVK

XP_038906698.1 protein TIFY 4B-like isoform X3 [Benincasa hispida]3.0e-17393.07Show/hide
Query:  MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGALKKVVVLPRVNSNQGDSPKEPSDDAQVT
        MSAG ATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGAL+KVVV PRVNSNQGDSPKEPSDDAQVT
Subjt:  MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGALKKVVVLPRVNSNQGDSPKEPSDDAQVT

Query:  MSVDESAYSNVETAKSTPEDPPAEPDNKVASPRDLCDTNGVDGQMTIFYCGKVNVYDGVPPDKAWAIMHLAASPIHFPQNHPLGGTAACQSPPCLLQTAN
        MSVDESAYSNVETAKSTPEDPP EPDN V SPRD  DTNGVDGQMTIFYCGKVNVYDGVPPDKAWAIMHLAASPIHFPQNHPL GTA CQSPPCLLQ ++
Subjt:  MSVDESAYSNVETAKSTPEDPPAEPDNKVASPRDLCDTNGVDGQMTIFYCGKVNVYDGVPPDKAWAIMHLAASPIHFPQNHPLGGTAACQSPPCLLQTAN

Query:  ERDDFFPPSATIYRNVHTEKMVEYP--QQQHVKGTSTRDSDVEGQASRKVSLQRYLEKRKDRGRLKNKKNTGLPSSSLEGYMNHQMRTHISNKNLGQIVT
        +RDDFFPPSAT +RNVHTEKMVE+P  QQQH KGTSTRDSDVEGQASRKVSLQRYLEKRKDRGRLKNKKNTGL SSSLEGYMNHQMRTH+SNKNLGQIVT
Subjt:  ERDDFFPPSATIYRNVHTEKMVEYP--QQQHVKGTSTRDSDVEGQASRKVSLQRYLEKRKDRGRLKNKKNTGLPSSSLEGYMNHQMRTHISNKNLGQIVT

Query:  SSLSPTGVAKAFVGTADNQPKLACFPVDLNVK
        SSLSPTGVAKAFVGTADNQPKL CFPVDLNVK
Subjt:  SSLSPTGVAKAFVGTADNQPKLACFPVDLNVK

TrEMBL top hitse value%identityAlignment
A0A6J1CAB6 protein TIFY 4B isoform X18.0e-17292.88Show/hide
Query:  MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGALKKVVVLPRVNSNQGDSPKEPSDDAQVT
        MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGAL+KVVV PRVNSNQGDSPKEPSDDAQ T
Subjt:  MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGALKKVVVLPRVNSNQGDSPKEPSDDAQVT

Query:  MSVDESAYSNVETAKSTPEDPPAEPDNKVASPRDLCDTNGVDGQMTIFYCGKVNVYDGVPPDK------AWAIMHLAASPIHFPQNHPLGGTAACQSPPC
        MSVDESAYSNVETAKSTPEDPP EPDNKV SPRD C+TNG DGQMTIFYCGKVNVYDGV PDK      A AIMHLAASP+ FPQNHPLGGTAACQSPPC
Subjt:  MSVDESAYSNVETAKSTPEDPPAEPDNKVASPRDLCDTNGVDGQMTIFYCGKVNVYDGVPPDK------AWAIMHLAASPIHFPQNHPLGGTAACQSPPC

Query:  LLQTANERDDFFPPSATIYRNVHT-EKMVEYPQQQHVKGTSTRDSDVEGQASRKVSLQRYLEKRKDRGRLKNKKNTGLPSSSLEGYMNHQMRTHISNKNL
        LLQTANERDDFFPPSA+IYRNVHT EKMVEYPQQQHVKG STRDSDVEGQASRKVSLQRYLEKRKDRGRLKNKKNTGLPSSSLEGYMNHQMRTHISNKNL
Subjt:  LLQTANERDDFFPPSATIYRNVHT-EKMVEYPQQQHVKGTSTRDSDVEGQASRKVSLQRYLEKRKDRGRLKNKKNTGLPSSSLEGYMNHQMRTHISNKNL

Query:  GQIVTSSLSPTGVAKAFVGTADNQPKLACFPVDLNVK
        GQ VTSSLSPTGVAKAFVGTADNQ K ACFPVDLNVK
Subjt:  GQIVTSSLSPTGVAKAFVGTADNQPKLACFPVDLNVK

A0A6J1CAL8 protein TIFY 4B isoform X28.6e-17494.56Show/hide
Query:  MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGALKKVVVLPRVNSNQGDSPKEPSDDAQVT
        MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGAL+KVVV PRVNSNQGDSPKEPSDDAQ T
Subjt:  MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGALKKVVVLPRVNSNQGDSPKEPSDDAQVT

Query:  MSVDESAYSNVETAKSTPEDPPAEPDNKVASPRDLCDTNGVDGQMTIFYCGKVNVYDGVPPDKAWAIMHLAASPIHFPQNHPLGGTAACQSPPCLLQTAN
        MSVDESAYSNVETAKSTPEDPP EPDNKV SPRD C+TNG DGQMTIFYCGKVNVYDGV PDKA AIMHLAASP+ FPQNHPLGGTAACQSPPCLLQTAN
Subjt:  MSVDESAYSNVETAKSTPEDPPAEPDNKVASPRDLCDTNGVDGQMTIFYCGKVNVYDGVPPDKAWAIMHLAASPIHFPQNHPLGGTAACQSPPCLLQTAN

Query:  ERDDFFPPSATIYRNVHT-EKMVEYPQQQHVKGTSTRDSDVEGQASRKVSLQRYLEKRKDRGRLKNKKNTGLPSSSLEGYMNHQMRTHISNKNLGQIVTS
        ERDDFFPPSA+IYRNVHT EKMVEYPQQQHVKG STRDSDVEGQASRKVSLQRYLEKRKDRGRLKNKKNTGLPSSSLEGYMNHQMRTHISNKNLGQ VTS
Subjt:  ERDDFFPPSATIYRNVHT-EKMVEYPQQQHVKGTSTRDSDVEGQASRKVSLQRYLEKRKDRGRLKNKKNTGLPSSSLEGYMNHQMRTHISNKNLGQIVTS

Query:  SLSPTGVAKAFVGTADNQPKLACFPVDLNVK
        SLSPTGVAKAFVGTADNQ K ACFPVDLNVK
Subjt:  SLSPTGVAKAFVGTADNQPKLACFPVDLNVK

A0A6J1KG25 protein TIFY 4B-like isoform X13.2e-17392.45Show/hide
Query:  MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGALKKVVVLPRVNSNQGDSPKEPSDDAQVT
        MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVI LKALLEPCDDSGA AL+KV V PRVNSNQG SP+EPSDDAQVT
Subjt:  MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGALKKVVVLPRVNSNQGDSPKEPSDDAQVT

Query:  MSVDESAYSNVETAKSTPEDPPAEPDNKVASPRDLCDTNGVDGQMTIFYCGKVNVYDGVPPDKAWAIMHLAASPIHFPQNHPLGGTAACQSPPCLLQTAN
        +SVDESAYSN+ETAKSTPEDPPA PDNKV SPRD CDTNG+DGQMTIFYCGKVNVYDGVP DKAWAIMHLAASP+HFPQNH +GGTA+CQSPPC+LQ AN
Subjt:  MSVDESAYSNVETAKSTPEDPPAEPDNKVASPRDLCDTNGVDGQMTIFYCGKVNVYDGVPPDKAWAIMHLAASPIHFPQNHPLGGTAACQSPPCLLQTAN

Query:  ERDDFFPPSATIYRNVHTEKMVEYP-QQQHVKGTSTRDSDVEGQASRKVSLQRYLEKRKDRGRLKNKKNTGLPSSSLEGYMNHQMRTHISNKNLGQIVTS
        ERDDFFPPS TIYRNVHTEKMVEYP QQQH+KGTSTRDSD+EGQASRKVSLQRYLEKRKDRGRLKNKKNTGL SSSLEGYMNHQMRTHISNKNLGQIVTS
Subjt:  ERDDFFPPSATIYRNVHTEKMVEYP-QQQHVKGTSTRDSDVEGQASRKVSLQRYLEKRKDRGRLKNKKNTGLPSSSLEGYMNHQMRTHISNKNLGQIVTS

Query:  SLSPTGVAKAFVGTADNQPKLACFPVDLNVK
        SLSPTGVAKAFVGTADNQPKLACFPVDLNVK
Subjt:  SLSPTGVAKAFVGTADNQPKLACFPVDLNVK

A0A6J5TCG8 1,2-dihydroxy-3-keto-5-methylthiopentene dioxygenase4.2e-17365.23Show/hide
Query:  MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGALKKVVV-----LPRVNSNQGDSPKEPSD
        M+A   +FRSIL+KPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEP DDSGAGAL+++VV      PR  SN   S KE S 
Subjt:  MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGALKKVVV-----LPRVNSNQGDSPKEPSD

Query:  DAQVTMSVDESAYSNVETAKSTPEDPPAEPDNKVASPRDLCDTNGVDGQMTIFYCGKVNVYDGVPPDKAWAIMHLAASPIHFPQNHPLGGTAACQSPPCL
        D QV++S DE     V   K   ED PA+ D K  SPR+ C T+    QMTIFYCGKVNVYDGVPPDKA AIMHLAA P H P ++  GGTAA +S  C 
Subjt:  DAQVTMSVDESAYSNVETAKSTPEDPPAEPDNKVASPRDLCDTNGVDGQMTIFYCGKVNVYDGVPPDKAWAIMHLAASPIHFPQNHPLGGTAACQSPPCL

Query:  LQTANERDDFFPPSATIYRNVHTEKMVEYPQQQHVKGTSTRDSDVEGQASRKVSLQRYLEKRKDRGRLKNKKNTGLPSSSLEGYMNHQMRTHISNKNLGQ
         QTA ++D F PPSAT  + + TEK+ EY QQ   KG STRD D EGQASRKVSL+RY EKRKDRGRLK KKN G  SSSLE ++NHQ+RTH SN N  Q
Subjt:  LQTANERDDFFPPSATIYRNVHTEKMVEYPQQQHVKGTSTRDSDVEGQASRKVSLQRYLEKRKDRGRLKNKKNTGLPSSSLEGYMNHQMRTHISNKNLGQ

Query:  IVTSSLSPTGVAKAFVGTADNQPKLACFPVDLNVKGRGVYIRMNGMSISSADQFCNSDTSGVFSTIQLSEQVRRARSSMAIEAWFMDDTNEDQRLPHHRD
          TSS    G+ +    TADNQPK  C PVDLN KG G +    GM++ S            F   +L  +     S+MAIEAWFMD+++ED RLPHHR+
Subjt:  IVTSSLSPTGVAKAFVGTADNQPKLACFPVDLNVKGRGVYIRMNGMSISSADQFCNSDTSGVFSTIQLSEQVRRARSSMAIEAWFMDDTNEDQRLPHHRD

Query:  PKEFVSMDQLEELGVLYWKLNPKDYENDEELQKIREDRGYNYVDLLDICPEKLANYEGKLKDFYTEHIHANEEIRYCLDGSGYFDVRDKNDRWIRIWIKL
        PKE V +D L ELGVLYW+LNPKDYEND++L+ IRE RGYNY+DLLDICPEKL NYE KLK+FYTEHIHA+EEIRYCL+GSGYFDVRDKNDRWIRIWIK 
Subjt:  PKEFVSMDQLEELGVLYWKLNPKDYENDEELQKIREDRGYNYVDLLDICPEKLANYEGKLKDFYTEHIHANEEIRYCLDGSGYFDVRDKNDRWIRIWIKL

Query:  GDLIILPAG
        GDLIILPAG
Subjt:  GDLIILPAG

A0A6J5VVP3 1,2-dihydroxy-3-keto-5-methylthiopentene dioxygenase1.9e-17365.42Show/hide
Query:  MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGALKKVVV-----LPRVNSNQGDSPKEPSD
        M+A   +FRSIL+KPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEP DDSGAGAL+++VV      PR  SN   S KE S 
Subjt:  MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGALKKVVV-----LPRVNSNQGDSPKEPSD

Query:  DAQVTMSVDESAYSNVETAKSTPEDPPAEPDNKVASPRDLCDTNGVDGQMTIFYCGKVNVYDGVPPDKAWAIMHLAASPIHFPQNHPLGGTAACQSPPCL
        D QV++S DE     V   K   ED PA+ D K  SPR+ C T+    QMTIFYCGKVNVYDGVPPDKA AIMHLAA P H P ++  GGTAA +S  C 
Subjt:  DAQVTMSVDESAYSNVETAKSTPEDPPAEPDNKVASPRDLCDTNGVDGQMTIFYCGKVNVYDGVPPDKAWAIMHLAASPIHFPQNHPLGGTAACQSPPCL

Query:  LQTANERDDFFPPSATIYRNVHTEKMVEYPQQQHVKGTSTRDSDVEGQASRKVSLQRYLEKRKDRGRLKNKKNTGLPSSSLEGYMNHQMRTHISNKNLGQ
         QTA ++D F PPSAT  + + TEK+ EY QQ   KG STRD D EGQASRKVSL+RY EKRKDRGRLK KKN G  SSSLE ++NHQ+RTH SN N  Q
Subjt:  LQTANERDDFFPPSATIYRNVHTEKMVEYPQQQHVKGTSTRDSDVEGQASRKVSLQRYLEKRKDRGRLKNKKNTGLPSSSLEGYMNHQMRTHISNKNLGQ

Query:  IVTSSLSPTGVAKAFVGTADNQPKLACFPVDLNVKGRGVYIRMNGMSISSADQFCNSDTSGVFSTIQLSEQVRRARSSMAIEAWFMDDTNEDQRLPHHRD
          TSS    G+ +    TADNQPK  C PVDLN KG G +    GM++ S            F   +L  +     S+MAIEAWFMD+++ED RLPHHR+
Subjt:  IVTSSLSPTGVAKAFVGTADNQPKLACFPVDLNVKGRGVYIRMNGMSISSADQFCNSDTSGVFSTIQLSEQVRRARSSMAIEAWFMDDTNEDQRLPHHRD

Query:  PKEFVSMDQLEELGVLYWKLNPKDYENDEELQKIREDRGYNYVDLLDICPEKLANYEGKLKDFYTEHIHANEEIRYCLDGSGYFDVRDKNDRWIRIWIKL
        PKE V +D L ELGVLYW+LNPKDYENDE+L+ IRE RGYNY+DLLDICPEKL NYE KLK+FYTEHIHA+EEIRYCL+GSGYFDVRDKNDRWIRIWIK 
Subjt:  PKEFVSMDQLEELGVLYWKLNPKDYENDEELQKIREDRGYNYVDLLDICPEKLANYEGKLKDFYTEHIHANEEIRYCLDGSGYFDVRDKNDRWIRIWIKL

Query:  GDLIILPAG
        GDLIILPAG
Subjt:  GDLIILPAG

SwissProt top hitse value%identityAlignment
A2XCT8 1,2-dihydroxy-3-keto-5-methylthiopentene dioxygenase 29.4e-5367.44Show/hide
Query:  IEAWFMDDTNEDQRLPHHRDPKEFVSMDQLEELGVLYWKLNPKDYENDEELQKIREDRGYNYVDLLDICPEKLANYEGKLKDFYTEHIHANEEIRYCLDG
        IEAW+MDD+ EDQRLPHHR+PKEF+ + +L ELG+L W+LN  D+ENDE L+KIRE RGY+Y+D+ D+CPEKL NYE KLK+F+ EH+H +EEIRYCL+G
Subjt:  IEAWFMDDTNEDQRLPHHRDPKEFVSMDQLEELGVLYWKLNPKDYENDEELQKIREDRGYNYVDLLDICPEKLANYEGKLKDFYTEHIHANEEIRYCLDG

Query:  SGYFDVRDKNDRWIRIWIKLGDLIILPAG
        SGYFDVRD+ND+WIR+ +K G +I+LPAG
Subjt:  SGYFDVRDKNDRWIRIWIKLGDLIILPAG

D7T737 1,2-dihydroxy-3-keto-5-methylthiopentene dioxygenase 12.1e-6078.03Show/hide
Query:  MAIE-AWFMDDTNEDQRLPHHRDPKEFVSMDQLEELGVLYWKLNPKDYENDEELQKIREDRGYNYVDLLDICPEKLANYEGKLKDFYTEHIHANEEIRYC
        MAIE AWFM++ +EDQRLPHHR+PK+FVS+D L +LGVLYWKLNPKDYEND+EL++IRE RGYNY+DLLD+CPE++ NYE KLK+FYTEHIH +EEIRYC
Subjt:  MAIE-AWFMDDTNEDQRLPHHRDPKEFVSMDQLEELGVLYWKLNPKDYENDEELQKIREDRGYNYVDLLDICPEKLANYEGKLKDFYTEHIHANEEIRYC

Query:  LDGSGYFDVRDKNDRWIRIWIKLGDLIILPAG
        L+GSGYFDVRDK DRWIRIWIK GD+I+LPAG
Subjt:  LDGSGYFDVRDKNDRWIRIWIKLGDLIILPAG

Q7XA73 Protein TIFY 4A5.2e-5944.02Show/hide
Query:  MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGALKKVVVL-----PRVNSNQGDSPKEPSD
        M  G +  +SIL KPL  LTE+DISQLTREDCRK+LK+KGMRRPSWNKSQAIQQV+SLKAL EP DDSGAG  +K++V      PRV +   +   E   
Subjt:  MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGALKKVVVL-----PRVNSNQGDSPKEPSD

Query:  DAQVTMSVDESAYSNVETAKSTPED-------PPAEPDNKVASPRDLCDTNGVDGQMTIFYCGKVNVYDGVPPDKAWAIMHLAASPIHFPQNHPLGGTAA
          +V+   D  A   +++ +S              +      SPR   +T+ + GQMTIFY GKVNVYDG+PP+KA +IMH AA+PI  P+N        
Subjt:  DAQVTMSVDESAYSNVETAKSTPED-------PPAEPDNKVASPRDLCDTNGVDGQMTIFYCGKVNVYDGVPPDKAWAIMHLAASPIHFPQNHPLGGTAA

Query:  CQSPPCLLQTANERDDFFPPSATIYRNVHTEKMVEYPQQQHVKGTSTRDSDVEGQASRKVSLQRYLEKRKDRGRLKNKKNTGLPSSSLEGYMNHQMRTHI
                         F  S  I + +  EKM+E PQ+   K  S+RDS +EGQA+RKVSLQRY EKRKDR   K KK  G+ SSSLE ++N Q R   
Subjt:  CQSPPCLLQTANERDDFFPPSATIYRNVHTEKMVEYPQQQHVKGTSTRDSDVEGQASRKVSLQRYLEKRKDRGRLKNKKNTGLPSSSLEGYMNHQMRTHI

Query:  SNKNLGQIVTSSLSPTGVAKAFVGTADNQPKLACFPVDLNVKG
            +    + +L  TG +     + ++Q K     VDLN +G
Subjt:  SNKNLGQIVTSSLSPTGVAKAFVGTADNQPKLACFPVDLNVKG

Q8GY55 Protein TIFY 4B6.3e-6547.51Show/hide
Query:  MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGALKKVVVL-----PRVNSNQGDSPKEPSD
        M  G  T +SIL+KPL  LTE+DISQLTREDCRK+LKEKGMRRPSWNKSQAIQQV+SLKAL EP DDSGAG L+K++V      PRV +   +   E   
Subjt:  MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGALKKVVVL-----PRVNSNQGDSPKEPSD

Query:  DAQVTMSVDESAYSNVETAKSTPEDPP-----AEPDNK---VASPRDLCDTNGVDGQMTIFYCGKVNVYDGVPPDKAWAIMHLAASPIHFPQNHPLGGTA
          ++ +  D+ A    ++ +S           A+ D+      SPR   +TN V GQMTIFY GKVNVYDGVPP+KA +IMH AA+PI  P+N       
Subjt:  DAQVTMSVDESAYSNVETAKSTPEDPP-----AEPDNK---VASPRDLCDTNGVDGQMTIFYCGKVNVYDGVPPDKAWAIMHLAASPIHFPQNHPLGGTA

Query:  ACQSPPCLLQTANERDDFFPPSATIYRNVHTEKMVEYPQQQHVKGTSTRDSDVEGQASRKVSLQRYLEKRKDRGRLKNKKNTGLPSSSLEGYMNHQMRTH
                          F  S  I + +  EKMVE PQ    K  ++RDSDVEGQA+RKVSLQRYLEKRKDR   K KK  G+ SSSLE ++N Q R  
Subjt:  ACQSPPCLLQTANERDDFFPPSATIYRNVHTEKMVEYPQQQHVKGTSTRDSDVEGQASRKVSLQRYLEKRKDRGRLKNKKNTGLPSSSLEGYMNHQMRTH

Query:  ISNKNLGQIVTSSLSPTGVAKAFVGTADNQPKLACFPVDLN
             +    + +LS TG  +    + +NQ K     VDLN
Subjt:  ISNKNLGQIVTSSLSPTGVAKAFVGTADNQPKLACFPVDLN

Q8H185 1,2-dihydroxy-3-keto-5-methylthiopentene dioxygenase 42.5e-6179.39Show/hide
Query:  MAIEAWFMDDTNEDQRLPHHRDPKEFVSMDQLEELGVLYWKLNPKDYENDEELQKIREDRGYNYVDLLDICPEKLANYEGKLKDFYTEHIHANEEIRYCL
        MA+EAWFMDD+NEDQRLPHHR+PKE VS+D L ELGVLYWKLNP++YEND EL KIREDRGY+Y+DLLD+CPEK++NYE KLK+F+TEHIH +EEIRYCL
Subjt:  MAIEAWFMDDTNEDQRLPHHRDPKEFVSMDQLEELGVLYWKLNPKDYENDEELQKIREDRGYNYVDLLDICPEKLANYEGKLKDFYTEHIHANEEIRYCL

Query:  DGSGYFDVRDKNDRWIRIWIKLGDLIILPAG
         GSGYFDVRDK+DRWIRIW++ GDLI+LPAG
Subjt:  DGSGYFDVRDKNDRWIRIWIKLGDLIILPAG

Arabidopsis top hitse value%identityAlignment
AT4G14710.1 RmlC-like cupins superfamily protein7.4e-5361.27Show/hide
Query:  LSEQVRRARSSMAIEAWFMDDTNEDQRLPHHRDPKEFVSMDQLEELGVLYWKLNPKDYENDEELQKIREDRGYNYVDLLDICPEKLANYEGKLKDFYTEH
        + E V+  R  + I+AW+MDD+ EDQRLPHH+DPKEF+S+D+L ELGVL W+L+  +YE DE+L+KIRE RGY+Y+D  ++CPEKL NYE K+K F+ EH
Subjt:  LSEQVRRARSSMAIEAWFMDDTNEDQRLPHHRDPKEFVSMDQLEELGVLYWKLNPKDYENDEELQKIREDRGYNYVDLLDICPEKLANYEGKLKDFYTEH

Query:  IHANEEIRYCLDGSGYFDVRDKNDRWIRIWIKLGDLIILPAG
        +H +EEIRYC+ GSGYFDVRD+N+ WIR+W+K G +I+LPAG
Subjt:  IHANEEIRYCLDGSGYFDVRDKNDRWIRIWIKLGDLIILPAG

AT4G14713.1 TIFY domain/Divergent CCT motif family protein3.7e-6044.02Show/hide
Query:  MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGALKKVVVL-----PRVNSNQGDSPKEPSD
        M  G +  +SIL KPL  LTE+DISQLTREDCRK+LK+KGMRRPSWNKSQAIQQV+SLKAL EP DDSGAG  +K++V      PRV +   +   E   
Subjt:  MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGALKKVVVL-----PRVNSNQGDSPKEPSD

Query:  DAQVTMSVDESAYSNVETAKSTPED-------PPAEPDNKVASPRDLCDTNGVDGQMTIFYCGKVNVYDGVPPDKAWAIMHLAASPIHFPQNHPLGGTAA
          +V+   D  A   +++ +S              +      SPR   +T+ + GQMTIFY GKVNVYDG+PP+KA +IMH AA+PI  P+N        
Subjt:  DAQVTMSVDESAYSNVETAKSTPED-------PPAEPDNKVASPRDLCDTNGVDGQMTIFYCGKVNVYDGVPPDKAWAIMHLAASPIHFPQNHPLGGTAA

Query:  CQSPPCLLQTANERDDFFPPSATIYRNVHTEKMVEYPQQQHVKGTSTRDSDVEGQASRKVSLQRYLEKRKDRGRLKNKKNTGLPSSSLEGYMNHQMRTHI
                         F  S  I + +  EKM+E PQ+   K  S+RDS +EGQA+RKVSLQRY EKRKDR   K KK  G+ SSSLE ++N Q R   
Subjt:  CQSPPCLLQTANERDDFFPPSATIYRNVHTEKMVEYPQQQHVKGTSTRDSDVEGQASRKVSLQRYLEKRKDRGRLKNKKNTGLPSSSLEGYMNHQMRTHI

Query:  SNKNLGQIVTSSLSPTGVAKAFVGTADNQPKLACFPVDLNVKG
            +    + +L  TG +     + ++Q K     VDLN +G
Subjt:  SNKNLGQIVTSSLSPTGVAKAFVGTADNQPKLACFPVDLNVKG

AT4G14713.2 TIFY domain/Divergent CCT motif family protein6.1e-5547.43Show/hide
Query:  MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGALKKVVVL-----PRVNSNQGDSPKEPSD
        M  G +  +SIL KPL  LTE+DISQLTREDCRK+LK+KGMRRPSWNKSQAIQQV+SLKAL EP DDSGAG  +K++V      PRV +   +   E   
Subjt:  MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGALKKVVVL-----PRVNSNQGDSPKEPSD

Query:  DAQVTMSVDESAYSNVETAKSTPED-------PPAEPDNKVASPRDLCDTNGVDGQMTIFYCGKVNVYDGVPPDKAWAIMHLAASPIHFPQNHPLGGTAA
          +V+   D  A   +++ +S              +      SPR   +T+ + GQMTIFY GKVNVYDG+PP+KA +IMH AA+PI  P+N        
Subjt:  DAQVTMSVDESAYSNVETAKSTPED-------PPAEPDNKVASPRDLCDTNGVDGQMTIFYCGKVNVYDGVPPDKAWAIMHLAASPIHFPQNHPLGGTAA

Query:  CQSPPCLLQTANERDDFFPPSATIYRNVHTEKMVEYPQQQHVKGTSTRDSDVEGQASRKVSLQRYLEKRKDR
                         F  S  I + +  EKM+E PQ+   K  S+RDS +EGQA+RKVSLQRY EKRKDR
Subjt:  CQSPPCLLQTANERDDFFPPSATIYRNVHTEKMVEYPQQQHVKGTSTRDSDVEGQASRKVSLQRYLEKRKDR

AT4G14720.1 TIFY domain/Divergent CCT motif family protein4.5e-6647.51Show/hide
Query:  MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGALKKVVVL-----PRVNSNQGDSPKEPSD
        M  G  T +SIL+KPL  LTE+DISQLTREDCRK+LKEKGMRRPSWNKSQAIQQV+SLKAL EP DDSGAG L+K++V      PRV +   +   E   
Subjt:  MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGALKKVVVL-----PRVNSNQGDSPKEPSD

Query:  DAQVTMSVDESAYSNVETAKSTPEDPP-----AEPDNK---VASPRDLCDTNGVDGQMTIFYCGKVNVYDGVPPDKAWAIMHLAASPIHFPQNHPLGGTA
          ++ +  D+ A    ++ +S           A+ D+      SPR   +TN V GQMTIFY GKVNVYDGVPP+KA +IMH AA+PI  P+N       
Subjt:  DAQVTMSVDESAYSNVETAKSTPEDPP-----AEPDNK---VASPRDLCDTNGVDGQMTIFYCGKVNVYDGVPPDKAWAIMHLAASPIHFPQNHPLGGTA

Query:  ACQSPPCLLQTANERDDFFPPSATIYRNVHTEKMVEYPQQQHVKGTSTRDSDVEGQASRKVSLQRYLEKRKDRGRLKNKKNTGLPSSSLEGYMNHQMRTH
                          F  S  I + +  EKMVE PQ    K  ++RDSDVEGQA+RKVSLQRYLEKRKDR   K KK  G+ SSSLE ++N Q R  
Subjt:  ACQSPPCLLQTANERDDFFPPSATIYRNVHTEKMVEYPQQQHVKGTSTRDSDVEGQASRKVSLQRYLEKRKDRGRLKNKKNTGLPSSSLEGYMNHQMRTH

Query:  ISNKNLGQIVTSSLSPTGVAKAFVGTADNQPKLACFPVDLN
             +    + +LS TG  +    + +NQ K     VDLN
Subjt:  ISNKNLGQIVTSSLSPTGVAKAFVGTADNQPKLACFPVDLN

AT5G43850.1 RmlC-like cupins superfamily protein1.8e-6279.39Show/hide
Query:  MAIEAWFMDDTNEDQRLPHHRDPKEFVSMDQLEELGVLYWKLNPKDYENDEELQKIREDRGYNYVDLLDICPEKLANYEGKLKDFYTEHIHANEEIRYCL
        MA+EAWFMDD+NEDQRLPHHR+PKE VS+D L ELGVLYWKLNP++YEND EL KIREDRGY+Y+DLLD+CPEK++NYE KLK+F+TEHIH +EEIRYCL
Subjt:  MAIEAWFMDDTNEDQRLPHHRDPKEFVSMDQLEELGVLYWKLNPKDYENDEELQKIREDRGYNYVDLLDICPEKLANYEGKLKDFYTEHIHANEEIRYCL

Query:  DGSGYFDVRDKNDRWIRIWIKLGDLIILPAG
         GSGYFDVRDK+DRWIRIW++ GDLI+LPAG
Subjt:  DGSGYFDVRDKNDRWIRIWIKLGDLIILPAG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGCCGGCGCGGCGACGTTCCGGTCGATACTTGACAAACCCCTCAACCAGCTGACGGAGGATGACATTTCGCAGCTCACTCGGGAGGATTGTCGGAAATACCTCAA
GGAAAAAGGAATGCGGCGGCCCTCGTGGAACAAATCTCAGGCGATCCAGCAGGTTATTTCCCTCAAAGCATTGCTCGAACCTTGTGATGATTCCGGCGCCGGTGCTCTCA
AGAAGGTCGTCGTTTTGCCTCGGGTGAATTCAAATCAAGGCGATTCACCTAAAGAACCAAGTGACGATGCTCAGGTTACGATGTCAGTTGATGAATCTGCGTATAGCAAT
GTAGAGACTGCTAAATCTACTCCAGAGGATCCACCAGCTGAACCAGACAACAAGGTCGCCAGTCCCAGAGATCTATGCGACACAAATGGAGTGGATGGGCAGATGACAAT
TTTCTATTGTGGCAAGGTGAATGTGTATGATGGAGTTCCACCAGATAAGGCATGGGCAATCATGCATCTTGCAGCAAGCCCAATTCATTTCCCTCAGAATCACCCATTGG
GTGGAACTGCTGCATGTCAGTCTCCACCATGTCTTTTGCAGACTGCCAATGAGAGAGATGACTTTTTCCCTCCCAGTGCCACTATCTATCGAAATGTGCATACAGAGAAG
ATGGTTGAGTACCCTCAGCAGCAGCATGTAAAAGGAACCAGTACTCGAGATTCTGATGTTGAGGGTCAGGCGAGTCGGAAAGTTTCATTACAGAGATATCTTGAAAAGCG
AAAAGACAGGGGAAGGTTAAAGAACAAGAAAAATACAGGATTGCCTTCTTCTAGCCTGGAGGGGTATATGAACCATCAAATGAGGACGCACATATCCAATAAGAATTTAG
GTCAGATTGTGACAAGCTCTTTATCCCCTACTGGAGTAGCAAAAGCCTTCGTTGGAACAGCTGACAATCAGCCAAAACTTGCATGTTTTCCTGTCGACCTTAATGTCAAA
GGGAGAGGTGTCTATATCCGAATGAACGGTATGAGTATCTCCTCGGCTGATCAATTTTGTAACTCGGACACGTCGGGCGTATTCTCGACGATTCAACTCAGTGAACAAGT
TCGGAGAGCTCGTTCGAGCATGGCGATCGAGGCTTGGTTTATGGACGATACTAATGAAGATCAAAGGCTTCCGCACCACCGCGACCCTAAAGAGTTTGTCTCTATGGACC
AATTGGAAGAATTGGGAGTGTTGTACTGGAAATTGAACCCTAAGGACTATGAAAACGATGAGGAATTGCAAAAAATCAGAGAAGACAGAGGATACAATTACGTGGATTTA
CTTGATATATGCCCAGAGAAACTTGCCAATTACGAAGGGAAGCTGAAGGACTTCTACACAGAGCACATTCATGCCAACGAGGAAATTCGCTACTGTTTGGATGGAAGTGG
CTACTTCGATGTCCGGGACAAGAACGACCGTTGGATTCGAATCTGGATCAAGCTCGGCGATCTTATCATCTTGCCGGCCGGAAGAGGAAGAGCGTACCTGGGATGGGAAC
TATATCAATACCGTGATGGAGCAGCCACGCTAAAGCCAAATGAGCTGTCGTACAGTGATGCTTCCCAGCCAGATCAACAAGCCGTTTCAAGCTCTCAGTCGGCAAGCTCT
CTGCAACTGCCTTTCCGCCAAAGAACCCTCGACCCAGAGGACTGTAAGCCACCATCCCAATTCCAAGCTCTCTGCAAAACACCAGGTCTTCATCCACTGAGTCTATCAAG
ATATAACTTTGTATCATCTGGCTGTCTTTCTCACCTGCAAAGTGGGATTATCTCATCTTCAATGTCACGGCTCCACAGCGAGTACTCCATCTGTAAGGCAACAAACCGAT
GTACTTTATCTTCCCCTCCTCCACCAGCTTCTTCAGCTCTCCCATCTTCACGACCAGCCATTGAAGTTTTAAAAGTTGTTTTCTTTTCTTTCTCTGAAACAGTGCGAAAA
GAAGACTCAATACAGTCGTCAATGGAAGAAAGAGCTCACCGTTTCCTCAATGGGCACAGATGGGTCGACACGATGCTGATAGTAGAGGTCGATGTGATTAACTTGAAGCC
GCTCGAGACTCGCCTCACAGCACTTCCTTACATATTTGGGTGTCCCGTTTACTGCAAATTGAAAGCCTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGCCGGCGCGGCGACGTTCCGGTCGATACTTGACAAACCCCTCAACCAGCTGACGGAGGATGACATTTCGCAGCTCACTCGGGAGGATTGTCGGAAATACCTCAA
GGAAAAAGGAATGCGGCGGCCCTCGTGGAACAAATCTCAGGCGATCCAGCAGGTTATTTCCCTCAAAGCATTGCTCGAACCTTGTGATGATTCCGGCGCCGGTGCTCTCA
AGAAGGTCGTCGTTTTGCCTCGGGTGAATTCAAATCAAGGCGATTCACCTAAAGAACCAAGTGACGATGCTCAGGTTACGATGTCAGTTGATGAATCTGCGTATAGCAAT
GTAGAGACTGCTAAATCTACTCCAGAGGATCCACCAGCTGAACCAGACAACAAGGTCGCCAGTCCCAGAGATCTATGCGACACAAATGGAGTGGATGGGCAGATGACAAT
TTTCTATTGTGGCAAGGTGAATGTGTATGATGGAGTTCCACCAGATAAGGCATGGGCAATCATGCATCTTGCAGCAAGCCCAATTCATTTCCCTCAGAATCACCCATTGG
GTGGAACTGCTGCATGTCAGTCTCCACCATGTCTTTTGCAGACTGCCAATGAGAGAGATGACTTTTTCCCTCCCAGTGCCACTATCTATCGAAATGTGCATACAGAGAAG
ATGGTTGAGTACCCTCAGCAGCAGCATGTAAAAGGAACCAGTACTCGAGATTCTGATGTTGAGGGTCAGGCGAGTCGGAAAGTTTCATTACAGAGATATCTTGAAAAGCG
AAAAGACAGGGGAAGGTTAAAGAACAAGAAAAATACAGGATTGCCTTCTTCTAGCCTGGAGGGGTATATGAACCATCAAATGAGGACGCACATATCCAATAAGAATTTAG
GTCAGATTGTGACAAGCTCTTTATCCCCTACTGGAGTAGCAAAAGCCTTCGTTGGAACAGCTGACAATCAGCCAAAACTTGCATGTTTTCCTGTCGACCTTAATGTCAAA
GGGAGAGGTGTCTATATCCGAATGAACGGTATGAGTATCTCCTCGGCTGATCAATTTTGTAACTCGGACACGTCGGGCGTATTCTCGACGATTCAACTCAGTGAACAAGT
TCGGAGAGCTCGTTCGAGCATGGCGATCGAGGCTTGGTTTATGGACGATACTAATGAAGATCAAAGGCTTCCGCACCACCGCGACCCTAAAGAGTTTGTCTCTATGGACC
AATTGGAAGAATTGGGAGTGTTGTACTGGAAATTGAACCCTAAGGACTATGAAAACGATGAGGAATTGCAAAAAATCAGAGAAGACAGAGGATACAATTACGTGGATTTA
CTTGATATATGCCCAGAGAAACTTGCCAATTACGAAGGGAAGCTGAAGGACTTCTACACAGAGCACATTCATGCCAACGAGGAAATTCGCTACTGTTTGGATGGAAGTGG
CTACTTCGATGTCCGGGACAAGAACGACCGTTGGATTCGAATCTGGATCAAGCTCGGCGATCTTATCATCTTGCCGGCCGGAAGAGGAAGAGCGTACCTGGGATGGGAAC
TATATCAATACCGTGATGGAGCAGCCACGCTAAAGCCAAATGAGCTGTCGTACAGTGATGCTTCCCAGCCAGATCAACAAGCCGTTTCAAGCTCTCAGTCGGCAAGCTCT
CTGCAACTGCCTTTCCGCCAAAGAACCCTCGACCCAGAGGACTGTAAGCCACCATCCCAATTCCAAGCTCTCTGCAAAACACCAGGTCTTCATCCACTGAGTCTATCAAG
ATATAACTTTGTATCATCTGGCTGTCTTTCTCACCTGCAAAGTGGGATTATCTCATCTTCAATGTCACGGCTCCACAGCGAGTACTCCATCTGTAAGGCAACAAACCGAT
GTACTTTATCTTCCCCTCCTCCACCAGCTTCTTCAGCTCTCCCATCTTCACGACCAGCCATTGAAGTTTTAAAAGTTGTTTTCTTTTCTTTCTCTGAAACAGTGCGAAAA
GAAGACTCAATACAGTCGTCAATGGAAGAAAGAGCTCACCGTTTCCTCAATGGGCACAGATGGGTCGACACGATGCTGATAGTAGAGGTCGATGTGATTAACTTGAAGCC
GCTCGAGACTCGCCTCACAGCACTTCCTTACATATTTGGGTGTCCCGTTTACTGCAAATTGAAAGCCTCCTAA
Protein sequenceShow/hide protein sequence
MSAGAATFRSILDKPLNQLTEDDISQLTREDCRKYLKEKGMRRPSWNKSQAIQQVISLKALLEPCDDSGAGALKKVVVLPRVNSNQGDSPKEPSDDAQVTMSVDESAYSN
VETAKSTPEDPPAEPDNKVASPRDLCDTNGVDGQMTIFYCGKVNVYDGVPPDKAWAIMHLAASPIHFPQNHPLGGTAACQSPPCLLQTANERDDFFPPSATIYRNVHTEK
MVEYPQQQHVKGTSTRDSDVEGQASRKVSLQRYLEKRKDRGRLKNKKNTGLPSSSLEGYMNHQMRTHISNKNLGQIVTSSLSPTGVAKAFVGTADNQPKLACFPVDLNVK
GRGVYIRMNGMSISSADQFCNSDTSGVFSTIQLSEQVRRARSSMAIEAWFMDDTNEDQRLPHHRDPKEFVSMDQLEELGVLYWKLNPKDYENDEELQKIREDRGYNYVDL
LDICPEKLANYEGKLKDFYTEHIHANEEIRYCLDGSGYFDVRDKNDRWIRIWIKLGDLIILPAGRGRAYLGWELYQYRDGAATLKPNELSYSDASQPDQQAVSSSQSASS
LQLPFRQRTLDPEDCKPPSQFQALCKTPGLHPLSLSRYNFVSSGCLSHLQSGIISSSMSRLHSEYSICKATNRCTLSSPPPPASSALPSSRPAIEVLKVVFFSFSETVRK
EDSIQSSMEERAHRFLNGHRWVDTMLIVEVDVINLKPLETRLTALPYIFGCPVYCKLKAS