; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh06G013840 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh06G013840
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Description1,2-dihydroxy-3-keto-5-methylthiopentene dioxygenase
Genome locationCmo_Chr06:10166157..10173534
RNA-Seq ExpressionCmoCh06G013840
SyntenyCmoCh06G013840
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0019509 - L-methionine salvage from methylthioadenosine (biological process)
GO:0034605 - cellular response to heat (biological process)
GO:0005634 - nucleus (cellular component)
GO:0046872 - metal ion binding (molecular function)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0010309 - acireductone dioxygenase [iron(II)-requiring] activity (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR036390 - Winged helix DNA-binding domain superfamily
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR027725 - Heat shock transcription factor family
IPR027496 - 1,2-dihydroxy-3-keto-5-methylthiopentene dioxygenase, eukaryotes
IPR000232 - Heat shock factor (HSF)-type, DNA-binding
IPR004313 - Acireductone dioxygenase ARD family
IPR011051 - RmlC-like cupin domain superfamily
IPR014710 - RmlC-like jelly roll fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597418.1 Heat stress transcription factor A-6b, partial [Cucurbita argyrosperma subsp. sororia]2.4e-18694.94Show/hide
Query:  MNRLRRVKEEFPASNSAFSNGAPSPTVAPQPMEGLHEAGPPPFLTKTYEFVEDSTTNHIVSWSRDNNSFIVWDPQSFSLTLLPKYFKHSNFSSFVRQLNT
        MNRLRRVKEEFPASNSAFSNGAPSPTVAPQPMEGLHEAGPPPFLTKTYEFVEDSTTNHIVSWSRDNNSFIVWDPQSFSLTLLPKYFKHSNFSSFVRQLNT
Subjt:  MNRLRRVKEEFPASNSAFSNGAPSPTVAPQPMEGLHEAGPPPFLTKTYEFVEDSTTNHIVSWSRDNNSFIVWDPQSFSLTLLPKYFKHSNFSSFVRQLNT

Query:  Y------------GFRKIDPDKWEFAHEGFLRGQKHLLKLIRRRKTVQPNATLQAPDPCVEVGRFGLDGEIDRLQRDKQVLMGELVKLRQQQQTTKGQLQ
        Y            GFRKIDPDKWEFAHEGFLRGQKHLLKLIRRRKTVQPNATLQAPDPCVEVGRFGLDGEIDRLQRDKQVLMGELVKLRQQQQTTKGQLQ
Subjt:  Y------------GFRKIDPDKWEFAHEGFLRGQKHLLKLIRRRKTVQPNATLQAPDPCVEVGRFGLDGEIDRLQRDKQVLMGELVKLRQQQQTTKGQLQ

Query:  TMERRLKKTETRQQLMMNFLARAIQFPDFIQQLIHQKDKHKELEEAINRKRRRHIDQGLSPTSDEYDEEDKALFSNDIHVDVELLAVEMNQNNQHFAKEE
        TMERRLKKTETRQQLMMNFLARAIQFPDFIQQLIHQKDKHKELEEAINRKRRRHIDQGLSPTSDEYD+EDKA FSNDIHVDVELLAVEM+QNNQHFAKEE
Subjt:  TMERRLKKTETRQQLMMNFLARAIQFPDFIQQLIHQKDKHKELEEAINRKRRRHIDQGLSPTSDEYDEEDKALFSNDIHVDVELLAVEMNQNNQHFAKEE

Query:  MGEKGDEVMDDGFWENLLNEANEEGFGVHGFDEQDDEVEDGDELVAHFGFLNYNLK
        MGEKGDEVMDDGFWENLLNEANEEGFGVHGFDEQDD  EDGDELVAHFGFLN NLK
Subjt:  MGEKGDEVMDDGFWENLLNEANEEGFGVHGFDEQDDEVEDGDELVAHFGFLNYNLK

KAG7028877.1 Heat stress transcription factor A-6b, partial [Cucurbita argyrosperma subsp. argyrosperma]2.8e-18798.81Show/hide
Query:  MNRLRRVKEEFPASNSAFSNGAPSPTVAPQPMEGLHEAGPPPFLTKTYEFVEDSTTNHIVSWSRDNNSFIVWDPQSFSLTLLPKYFKHSNFSSFVRQLNT
        MNRLRRVKEEFPASNSAFSNGAPSPTVAPQPMEGLHEAGPPPFLTKTYEFVEDSTTNHIVSWSRDNNSFIVWDPQSFSLTLLPKYFKHSNFSSFVRQLNT
Subjt:  MNRLRRVKEEFPASNSAFSNGAPSPTVAPQPMEGLHEAGPPPFLTKTYEFVEDSTTNHIVSWSRDNNSFIVWDPQSFSLTLLPKYFKHSNFSSFVRQLNT

Query:  YGFRKIDPDKWEFAHEGFLRGQKHLLKLIRRRKTVQPNATLQAPDPCVEVGRFGLDGEIDRLQRDKQVLMGELVKLRQQQQTTKGQLQTMERRLKKTETR
        YGFRKIDPDKWEFAHEGFLRGQKHLLKLIRRRKTVQPNATLQAPDPCVEVGRFGLDGEIDRLQRDKQVLMGELVKLRQQQQTTKGQLQTMERRLKKTETR
Subjt:  YGFRKIDPDKWEFAHEGFLRGQKHLLKLIRRRKTVQPNATLQAPDPCVEVGRFGLDGEIDRLQRDKQVLMGELVKLRQQQQTTKGQLQTMERRLKKTETR

Query:  QQLMMNFLARAIQFPDFIQQLIHQKDKHKELEEAINRKRRRHIDQGLSPTSDEYDEEDKALFSNDIHVDVELLAVEMNQNNQHFAKEEMGEKGDEVMDDG
        QQLMMNFLARAIQFPDFIQQLIHQKDKHKELEEAINRKRRRHIDQGLSPTSDEYD+EDKA FSNDIHVDVELLA+EM+QNNQHFAKEEMGEKGDEVMDDG
Subjt:  QQLMMNFLARAIQFPDFIQQLIHQKDKHKELEEAINRKRRRHIDQGLSPTSDEYDEEDKALFSNDIHVDVELLAVEMNQNNQHFAKEEMGEKGDEVMDDG

Query:  FWENLLNEANEEGFGVHGFDEQDDEVEDGDELVAHF
        FWENLLNEANEEGFGVHGFDEQDDEVEDGDELVAHF
Subjt:  FWENLLNEANEEGFGVHGFDEQDDEVEDGDELVAHF

XP_022921481.1 heat stress transcription factor A-6b-like [Cucurbita moschata]2.4e-194100Show/hide
Query:  MNRLRRVKEEFPASNSAFSNGAPSPTVAPQPMEGLHEAGPPPFLTKTYEFVEDSTTNHIVSWSRDNNSFIVWDPQSFSLTLLPKYFKHSNFSSFVRQLNT
        MNRLRRVKEEFPASNSAFSNGAPSPTVAPQPMEGLHEAGPPPFLTKTYEFVEDSTTNHIVSWSRDNNSFIVWDPQSFSLTLLPKYFKHSNFSSFVRQLNT
Subjt:  MNRLRRVKEEFPASNSAFSNGAPSPTVAPQPMEGLHEAGPPPFLTKTYEFVEDSTTNHIVSWSRDNNSFIVWDPQSFSLTLLPKYFKHSNFSSFVRQLNT

Query:  YGFRKIDPDKWEFAHEGFLRGQKHLLKLIRRRKTVQPNATLQAPDPCVEVGRFGLDGEIDRLQRDKQVLMGELVKLRQQQQTTKGQLQTMERRLKKTETR
        YGFRKIDPDKWEFAHEGFLRGQKHLLKLIRRRKTVQPNATLQAPDPCVEVGRFGLDGEIDRLQRDKQVLMGELVKLRQQQQTTKGQLQTMERRLKKTETR
Subjt:  YGFRKIDPDKWEFAHEGFLRGQKHLLKLIRRRKTVQPNATLQAPDPCVEVGRFGLDGEIDRLQRDKQVLMGELVKLRQQQQTTKGQLQTMERRLKKTETR

Query:  QQLMMNFLARAIQFPDFIQQLIHQKDKHKELEEAINRKRRRHIDQGLSPTSDEYDEEDKALFSNDIHVDVELLAVEMNQNNQHFAKEEMGEKGDEVMDDG
        QQLMMNFLARAIQFPDFIQQLIHQKDKHKELEEAINRKRRRHIDQGLSPTSDEYDEEDKALFSNDIHVDVELLAVEMNQNNQHFAKEEMGEKGDEVMDDG
Subjt:  QQLMMNFLARAIQFPDFIQQLIHQKDKHKELEEAINRKRRRHIDQGLSPTSDEYDEEDKALFSNDIHVDVELLAVEMNQNNQHFAKEEMGEKGDEVMDDG

Query:  FWENLLNEANEEGFGVHGFDEQDDEVEDGDELVAHFGFLNYNLK
        FWENLLNEANEEGFGVHGFDEQDDEVEDGDELVAHFGFLNYNLK
Subjt:  FWENLLNEANEEGFGVHGFDEQDDEVEDGDELVAHFGFLNYNLK

XP_022974463.1 heat stress transcription factor A-6b-like [Cucurbita maxima]3.9e-18997.38Show/hide
Query:  MNRLRRVKEEFPASNSAFSNGAPSPTVAPQPMEGLHEAGPPPFLTKTYEFVEDSTTNHIVSWSRDNNSFIVWDPQSFSLTLLPKYFKHSNFSSFVRQLNT
        MNRLRRVKEEFPASNSAFSNGAPSPTVAPQPMEGLHEAGPPPFLTKTYEFVEDSTTNHIVSWSRDNNSFIVWDPQSFSLTLLPKYFKHSNFSSFVRQLNT
Subjt:  MNRLRRVKEEFPASNSAFSNGAPSPTVAPQPMEGLHEAGPPPFLTKTYEFVEDSTTNHIVSWSRDNNSFIVWDPQSFSLTLLPKYFKHSNFSSFVRQLNT

Query:  YGFRKIDPDKWEFAHEGFLRGQKHLLKLIRRRKTVQPNATLQAPDPCVEVGRFGLDGEIDRLQRDKQVLMGELVKLRQQQQTTKGQLQTMERRLKKTETR
        YGFRKIDPDKWEFAHEGFLRGQKHLLKLIRRRKTVQPNATLQAPDPCVEV RFGLDGEIDRLQRDKQVLMGELVKLRQQQQTTKGQLQTMERRLKKTETR
Subjt:  YGFRKIDPDKWEFAHEGFLRGQKHLLKLIRRRKTVQPNATLQAPDPCVEVGRFGLDGEIDRLQRDKQVLMGELVKLRQQQQTTKGQLQTMERRLKKTETR

Query:  QQLMMNFLARAIQFPDFIQQLIHQKDKHKELEEAINRKRRRHIDQGLSPTSDEYDEEDKALFSNDIHVDVELLAVEMNQNNQHFAKEEMGEKGDEVMDDG
        QQLMMNFLARAIQFPDFIQQLIHQKDKHKELEE INRKRRRHIDQGLSPTSDEYDEED+A F+NDIHVDVELLAVEMNQNNQHFAKEEMGEKGDE MDDG
Subjt:  QQLMMNFLARAIQFPDFIQQLIHQKDKHKELEEAINRKRRRHIDQGLSPTSDEYDEEDKALFSNDIHVDVELLAVEMNQNNQHFAKEEMGEKGDEVMDDG

Query:  FWENLLNEANEEGFGVHGFDEQDDEVEDGDELVAHFGFLNYNLK
        FWENL+NEANEEGFGVHGF+EQDDEVEDGDELVAHFGFLN NLK
Subjt:  FWENLLNEANEEGFGVHGFDEQDDEVEDGDELVAHFGFLNYNLK

XP_023540223.1 heat stress transcription factor A-6b-like [Cucurbita pepo subsp. pepo]2.3e-18997.38Show/hide
Query:  MNRLRRVKEEFPASNSAFSNGAPSPTVAPQPMEGLHEAGPPPFLTKTYEFVEDSTTNHIVSWSRDNNSFIVWDPQSFSLTLLPKYFKHSNFSSFVRQLNT
        MNRLRRVKEEFPASNSAFSNGAPSPTVAPQPMEGLHEAGPPPFLTKTYEFVEDSTTNHIVSWSRDNNSFIVWDPQSFSLTLLPKYFKHSNFSSFVRQLNT
Subjt:  MNRLRRVKEEFPASNSAFSNGAPSPTVAPQPMEGLHEAGPPPFLTKTYEFVEDSTTNHIVSWSRDNNSFIVWDPQSFSLTLLPKYFKHSNFSSFVRQLNT

Query:  YGFRKIDPDKWEFAHEGFLRGQKHLLKLIRRRKTVQPNATLQAPDPCVEVGRFGLDGEIDRLQRDKQVLMGELVKLRQQQQTTKGQLQTMERRLKKTETR
        YGF+KIDPDKWEFAHEGFLRGQKHLLKLIRRRKTVQPNATLQAPDPCVEVGRFGLDGEIDRLQRDKQVLMGELVKLRQQQQTTKGQLQTMERRLKKTETR
Subjt:  YGFRKIDPDKWEFAHEGFLRGQKHLLKLIRRRKTVQPNATLQAPDPCVEVGRFGLDGEIDRLQRDKQVLMGELVKLRQQQQTTKGQLQTMERRLKKTETR

Query:  QQLMMNFLARAIQFPDFIQQLIHQKDKHKELEEAINRKRRRHIDQGLSPTSDEYDEEDKALFSNDIHVDVELLAVEMNQNNQHFAKEEMGEKGDEVMDDG
        QQLMMNFLARAIQFPDFIQQLIHQKDKHKELEEAINRKRRRHIDQGLSPTSDEYDEEDKA F+N+IHVDVELLAV+MNQNN HFAKEEMGEKGD +MDDG
Subjt:  QQLMMNFLARAIQFPDFIQQLIHQKDKHKELEEAINRKRRRHIDQGLSPTSDEYDEEDKALFSNDIHVDVELLAVEMNQNNQHFAKEEMGEKGDEVMDDG

Query:  FWENLLNEANEEGFGVHGFDEQDDEVEDGDELVAHFGFLNYNLK
        FWENLLNEANEEGFGVHGFDEQDDEVEDGDELVAHFGFLN NLK
Subjt:  FWENLLNEANEEGFGVHGFDEQDDEVEDGDELVAHFGFLNYNLK

TrEMBL top hitse value%identityAlignment
A0A0D9VPV2 1,2-dihydroxy-3-keto-5-methylthiopentene dioxygenase1.5e-15753.56Show/hide
Query:  KDAREEVIQAWYMDDSNEDQRLPHHLEPKQYLSLQQLDELGVLSWRLDADKYETDEELKKIRRDRNYSYMDFCEVCPEKLPNYEEKIKNFYEEHLHTDEE
        +D +EEVI+AWYMDDS EDQRLPHH EPK+++ L +L ELG+LSWRL+AD +E DE LKKIR  R YSYMD C+VCPEKLPNYE KIKNF+EEHLHTDEE
Subjt:  KDAREEVIQAWYMDDSNEDQRLPHHLEPKQYLSLQQLDELGVLSWRLDADKYETDEELKKIRRDRNYSYMDFCEVCPEKLPNYEEKIKNFYEEHLHTDEE

Query:  IRYCVAGSGYFDVRDLNDKWIRIWVKKGAMIVLPAGIYHRFTLDSDNYIKAMRLFVGDPVWTPHNRPNDHLPASL-----NSARASEVGSGLTCRTRYYL
        IRYC+ GSGYFDVRD ND+WIR+ VKKG MIVLPAG+YHRFTLDSDNYIKAMRLFVG+PVWTP+NRP+DHLPA L          SE  +     T    
Subjt:  IRYCVAGSGYFDVRDLNDKWIRIWVKKGAMIVLPAGIYHRFTLDSDNYIKAMRLFVGDPVWTPHNRPNDHLPASL-----NSARASEVGSGLTCRTRYYL

Query:  YGPIMNRLRRVKEEFPASNSAFSNGAPSPTVAPQPMEGLHEAGPPPFLTKTYEFVEDSTTNHIVSWSRDNNSFIVWDPQSFSLTLLPKYFKHSNFSSFVR
           I+  +  VKEE+                AP+PMEGLHE G PPFLTKT++ V D  T+ +VSW R  NSF+VWDP  F+  LLP++FKH+NFSSFVR
Subjt:  YGPIMNRLRRVKEEFPASNSAFSNGAPSPTVAPQPMEGLHEAGPPPFLTKTYEFVEDSTTNHIVSWSRDNNSFIVWDPQSFSLTLLPKYFKHSNFSSFVR

Query:  QLNTYGFRKIDPDKWEFAHEGFLRGQKHLLKLIRRRKTVQPNATLQAPDPCVEVGRFGLDGEIDRLQRDKQVLMGELVKLRQQQQTTKGQLQTMERRLKK
        QLNTYGFRKIDPD+WEFA++GFLRGQ+HLLK+I+RR+ +      QA   C+EVG+FG+D EIDRL+RDK +L+ E+VKLR +QQ+TK  ++ ME RL+ 
Subjt:  QLNTYGFRKIDPDKWEFAHEGFLRGQKHLLKLIRRRKTVQPNATLQAPDPCVEVGRFGLDGEIDRLQRDKQVLMGELVKLRQQQQTTKGQLQTMERRLKK

Query:  TETRQQLMMNFLARAIQFPDFIQQLIHQKDKHKELEEAINRKRRRHID--QGLSPTSDEYDEEDKALFSNDIHVDVELLAVEMNQNNQHFAKEEMG-EKG
         E +Q  MM FLARA+Q PDF  QLI  +DK K LE+  ++KR R ID    L P      ++ ++ F  D     EL         ++ A    G  KG
Subjt:  TETRQQLMMNFLARAIQFPDFIQQLIHQKDKHKELEEAINRKRRRHID--QGLSPTSDEYDEEDKALFSNDIHVDVELLAVEMNQNNQHFAKEEMG-EKG

Query:  -------------DEVMDDGFWENLLNEANEEGFGVHGFDEQDDEVEDGDELVAHFGFLNYN
                     +  + D FWE LLNE   +  G    + +     D   L    G+L+ N
Subjt:  -------------DEVMDDGFWENLLNEANEEGFGVHGFDEQDDEVEDGDELVAHFGFLNYN

A0A498I5L8 1,2-dihydroxy-3-keto-5-methylthiopentene dioxygenase2.4e-17659.07Show/hide
Query:  DAREEVIQAWYMDDSNEDQRLPHHLEPKQYLSLQQLDELGVLSWRLDADKYETDEELKKIRRDRNYSYMDFCEVCPEKLPNYEEKIKNFYEEHLHTDEEI
        D REEVIQAWYMDDS+EDQRLPHH EPK+++SL QL ELGVLSW LDAD YETDEELKKIR +R YSYMDFCEVCPEKLPNYEEKIKNF+EEHLHTDEEI
Subjt:  DAREEVIQAWYMDDSNEDQRLPHHLEPKQYLSLQQLDELGVLSWRLDADKYETDEELKKIRRDRNYSYMDFCEVCPEKLPNYEEKIKNFYEEHLHTDEEI

Query:  RYCVAGSGYFDVRDLNDKWIRIWVKKGAMIVLPAGIYHRFTLDSDNYIKAMRLFVGDPVWTPHNRPNDHLPASLNSARASEVGSGLTCRTRYYLYGPIMN
        RY V GSGYFDVRD +++WIR+W+KKG MIVLPAGIYHRFTLD++NYIKAMRLF+GDPVWTP NRP+DHLPA     +A                     
Subjt:  RYCVAGSGYFDVRDLNDKWIRIWVKKGAMIVLPAGIYHRFTLDSDNYIKAMRLFVGDPVWTPHNRPNDHLPASLNSARASEVGSGLTCRTRYYLYGPIMN

Query:  RLRRVKEEFPASNSAFSNGAPSPTVAPQPMEGLHEAGPPPFLTKTYEFVEDSTTNHIVSWSRDNNSFIVWDPQSFSLTLLPKYFKHSNFSSFVRQLNTYG
         + +    F +S++   +        P P+E L++ GPPPFLTKTY+ V+D +TNHIVSWSRDNNSF+V DP SF+++LLP+YFKH+NFSSFVRQLNTYG
Subjt:  RLRRVKEEFPASNSAFSNGAPSPTVAPQPMEGLHEAGPPPFLTKTYEFVEDSTTNHIVSWSRDNNSFIVWDPQSFSLTLLPKYFKHSNFSSFVRQLNTYG

Query:  FRKIDPDKWEFAHEGFLRGQKHLLKLIRRRKTVQ-PNATLQAPDPCVEVGRFGLDGEIDRLQRDKQVLMGELVKLRQQQQTTKGQLQTMERRLKKTETRQ
        FRK+D D+WEFA+E FLRG+KHLLK IRRRK    P ++ +  D CVEVGRFGLD EID+L+RDKQVLMGELVKLRQQQQ T+  +Q ME RLK+TE +Q
Subjt:  FRKIDPDKWEFAHEGFLRGQKHLLKLIRRRKTVQ-PNATLQAPDPCVEVGRFGLDGEIDRLQRDKQVLMGELVKLRQQQQTTKGQLQTMERRLKKTETRQ

Query:  QLMMNFLARAIQFPDFIQQLIHQKDKHKELEEAINRKRRRHIDQGLSPTSDEYDE--------------EDKALFSNDIHV------DVELLAVEMNQN-
        Q M++FLA+A+Q P+F+QQL  QKD  KELEE I +KRRR I+QG  P++ E DE              ++    SN   V       +++L V   QN 
Subjt:  QLMMNFLARAIQFPDFIQQLIHQKDKHKELEEAINRKRRRHIDQGLSPTSDEYDE--------------EDKALFSNDIHV------DVELLAVEMNQN-

Query:  ---NQHFAKEEMGEKGDEVMDDGFWENLLNEANEEGFGVHGFDEQDDEVEDGDELVAHFGFL
            +   KEE  E   + +D GFW++L NE+ +E  G+ G  E+D+  ED D L+   GFL
Subjt:  ---NQHFAKEEMGEKGDEVMDDGFWENLLNEANEEGFGVHGFDEQDDEVEDGDELVAHFGFL

A0A5J5BZY0 1,2-dihydroxy-3-keto-5-methylthiopentene dioxygenase2.9e-16657.69Show/hide
Query:  KDAREEVIQAWYMDDSNEDQRLPHHLEPKQYLSLQQLDELGVLSWRLDADKYETDEELKKIRRDRNYSYMDFCEVCPEKLPNYEEKIKNFYEEHLHTDEE
        KD REEVIQAWYMDDS+ DQRLPHH EPK+++SL QL ELGVLSWRLDAD YETDEELKKIR  R YSY+D C VCPE LPNYEEKIK+ + EHLHTDEE
Subjt:  KDAREEVIQAWYMDDSNEDQRLPHHLEPKQYLSLQQLDELGVLSWRLDADKYETDEELKKIRRDRNYSYMDFCEVCPEKLPNYEEKIKNFYEEHLHTDEE

Query:  IRYCVAGSGYFDVRDLNDKWIRIWVKKGAMIVLPAGIYHRFTLDSDNYIKAMRLFVGDPVWTPHNRPNDHLPASLNSARASEVGSGLTCRTRYYLYGPIM
        IR+CVAGSGYFDVRD ND WIR+W+KKG MIVLPAGIYHRFTLD++NYIK MRLFVGDP+WTP+ RP+DHLPA      A     G             +
Subjt:  IRYCVAGSGYFDVRDLNDKWIRIWVKKGAMIVLPAGIYHRFTLDSDNYIKAMRLFVGDPVWTPHNRPNDHLPASLNSARASEVGSGLTCRTRYYLYGPIM

Query:  NRLRRVKEEFPASNSAFSNGAPSPTVAPQPMEGLHEAGPPPFLTKTYEFVEDSTTNHIVSWSRDNNSFIVWDPQSFSLTLLPKYFKHSNFSSFVRQLNTY
        +    VKEEFP  + A+      P   P+PMEGL++ GPPPFLTKTY+ V+D  T+ I+SW R +NSF+VWDPQ+F++ LLP+YFKH+NFSSFVRQLNTY
Subjt:  NRLRRVKEEFPASNSAFSNGAPSPTVAPQPMEGLHEAGPPPFLTKTYEFVEDSTTNHIVSWSRDNNSFIVWDPQSFSLTLLPKYFKHSNFSSFVRQLNTY

Query:  GFRKIDPDKWEFAHEGFLRGQKHLLKLIRRRK-TVQPNATLQAPDPCVEVGRFGLDGEIDRLQRDKQVLMGELVKLRQQQQTTKGQLQTMERRLKKTETR
        GFRK+DPD+WEFA+E F+RGQKHLLK IRRRK +     + Q  DPCVEVG FGLD E+DRL+RDKQ               TKG           TE +
Subjt:  GFRKIDPDKWEFAHEGFLRGQKHLLKLIRRRK-TVQPNATLQAPDPCVEVGRFGLDGEIDRLQRDKQVLMGELVKLRQQQQTTKGQLQTMERRLKKTETR

Query:  QQLMMNFLARAIQFPDFIQQLIHQKDKHKELEEAINRKRRRHIDQGLSPTS----DEYDEE---------DKALFSNDIHVDVELLAVEM--------NQ
        QQ MM FLARAIQ P FIQQL+ QK++ KELEEAI++KRRR I+QG         D+  EE         D    S    V++E LA +M        N 
Subjt:  QQLMMNFLARAIQFPDFIQQLIHQKDKHKELEEAINRKRRRHIDQGLSPTS----DEYDEE---------DKALFSNDIHVDVELLAVEM--------NQ

Query:  NNQHFAKEEMGEKGDEVMDDGFWENLLNEANEEGFG-VHGFDEQDD
          ++  KEE  + GD+ +D GFWE+LLNE  EE  G + G  E++D
Subjt:  NNQHFAKEEMGEKGDEVMDDGFWENLLNEANEEGFG-VHGFDEQDD

A0A6J1E1I0 heat stress transcription factor A-6b-like1.1e-194100Show/hide
Query:  MNRLRRVKEEFPASNSAFSNGAPSPTVAPQPMEGLHEAGPPPFLTKTYEFVEDSTTNHIVSWSRDNNSFIVWDPQSFSLTLLPKYFKHSNFSSFVRQLNT
        MNRLRRVKEEFPASNSAFSNGAPSPTVAPQPMEGLHEAGPPPFLTKTYEFVEDSTTNHIVSWSRDNNSFIVWDPQSFSLTLLPKYFKHSNFSSFVRQLNT
Subjt:  MNRLRRVKEEFPASNSAFSNGAPSPTVAPQPMEGLHEAGPPPFLTKTYEFVEDSTTNHIVSWSRDNNSFIVWDPQSFSLTLLPKYFKHSNFSSFVRQLNT

Query:  YGFRKIDPDKWEFAHEGFLRGQKHLLKLIRRRKTVQPNATLQAPDPCVEVGRFGLDGEIDRLQRDKQVLMGELVKLRQQQQTTKGQLQTMERRLKKTETR
        YGFRKIDPDKWEFAHEGFLRGQKHLLKLIRRRKTVQPNATLQAPDPCVEVGRFGLDGEIDRLQRDKQVLMGELVKLRQQQQTTKGQLQTMERRLKKTETR
Subjt:  YGFRKIDPDKWEFAHEGFLRGQKHLLKLIRRRKTVQPNATLQAPDPCVEVGRFGLDGEIDRLQRDKQVLMGELVKLRQQQQTTKGQLQTMERRLKKTETR

Query:  QQLMMNFLARAIQFPDFIQQLIHQKDKHKELEEAINRKRRRHIDQGLSPTSDEYDEEDKALFSNDIHVDVELLAVEMNQNNQHFAKEEMGEKGDEVMDDG
        QQLMMNFLARAIQFPDFIQQLIHQKDKHKELEEAINRKRRRHIDQGLSPTSDEYDEEDKALFSNDIHVDVELLAVEMNQNNQHFAKEEMGEKGDEVMDDG
Subjt:  QQLMMNFLARAIQFPDFIQQLIHQKDKHKELEEAINRKRRRHIDQGLSPTSDEYDEEDKALFSNDIHVDVELLAVEMNQNNQHFAKEEMGEKGDEVMDDG

Query:  FWENLLNEANEEGFGVHGFDEQDDEVEDGDELVAHFGFLNYNLK
        FWENLLNEANEEGFGVHGFDEQDDEVEDGDELVAHFGFLNYNLK
Subjt:  FWENLLNEANEEGFGVHGFDEQDDEVEDGDELVAHFGFLNYNLK

A0A6J1IGA5 heat stress transcription factor A-6b-like1.9e-18997.38Show/hide
Query:  MNRLRRVKEEFPASNSAFSNGAPSPTVAPQPMEGLHEAGPPPFLTKTYEFVEDSTTNHIVSWSRDNNSFIVWDPQSFSLTLLPKYFKHSNFSSFVRQLNT
        MNRLRRVKEEFPASNSAFSNGAPSPTVAPQPMEGLHEAGPPPFLTKTYEFVEDSTTNHIVSWSRDNNSFIVWDPQSFSLTLLPKYFKHSNFSSFVRQLNT
Subjt:  MNRLRRVKEEFPASNSAFSNGAPSPTVAPQPMEGLHEAGPPPFLTKTYEFVEDSTTNHIVSWSRDNNSFIVWDPQSFSLTLLPKYFKHSNFSSFVRQLNT

Query:  YGFRKIDPDKWEFAHEGFLRGQKHLLKLIRRRKTVQPNATLQAPDPCVEVGRFGLDGEIDRLQRDKQVLMGELVKLRQQQQTTKGQLQTMERRLKKTETR
        YGFRKIDPDKWEFAHEGFLRGQKHLLKLIRRRKTVQPNATLQAPDPCVEV RFGLDGEIDRLQRDKQVLMGELVKLRQQQQTTKGQLQTMERRLKKTETR
Subjt:  YGFRKIDPDKWEFAHEGFLRGQKHLLKLIRRRKTVQPNATLQAPDPCVEVGRFGLDGEIDRLQRDKQVLMGELVKLRQQQQTTKGQLQTMERRLKKTETR

Query:  QQLMMNFLARAIQFPDFIQQLIHQKDKHKELEEAINRKRRRHIDQGLSPTSDEYDEEDKALFSNDIHVDVELLAVEMNQNNQHFAKEEMGEKGDEVMDDG
        QQLMMNFLARAIQFPDFIQQLIHQKDKHKELEE INRKRRRHIDQGLSPTSDEYDEED+A F+NDIHVDVELLAVEMNQNNQHFAKEEMGEKGDE MDDG
Subjt:  QQLMMNFLARAIQFPDFIQQLIHQKDKHKELEEAINRKRRRHIDQGLSPTSDEYDEEDKALFSNDIHVDVELLAVEMNQNNQHFAKEEMGEKGDEVMDDG

Query:  FWENLLNEANEEGFGVHGFDEQDDEVEDGDELVAHFGFLNYNLK
        FWENL+NEANEEGFGVHGF+EQDDEVEDGDELVAHFGFLN NLK
Subjt:  FWENLLNEANEEGFGVHGFDEQDDEVEDGDELVAHFGFLNYNLK

SwissProt top hitse value%identityAlignment
A2XCT8 1,2-dihydroxy-3-keto-5-methylthiopentene dioxygenase 25.6e-8278.03Show/hide
Query:  KDAREEVIQAWYMDDSNEDQRLPHHLEPKQYLSLQQLDELGVLSWRLDADKYETDEELKKIRRDRNYSYMDFCEVCPEKLPNYEEKIKNFYEEHLHTDEE
        +D +EEVI+AWYMDDS EDQRLPHH EPK+++ L +L ELG+LSWRL+AD +E DE LKKIR  R YSYMD C+VCPEKLPNYE K+KNF+EEHLHTDEE
Subjt:  KDAREEVIQAWYMDDSNEDQRLPHHLEPKQYLSLQQLDELGVLSWRLDADKYETDEELKKIRRDRNYSYMDFCEVCPEKLPNYEEKIKNFYEEHLHTDEE

Query:  IRYCVAGSGYFDVRDLNDKWIRIWVKKGAMIVLPAGIYHRFTLDSDNYIKAMRLFVGDPVWTPHNRPNDHLPA
        IRYC+ GSGYFDVRD ND+WIR+ VKKG MIVLPAG+YHRFTLDSDNYIKAMRLFVG+PVWTP+NRP+DHLPA
Subjt:  IRYCVAGSGYFDVRDLNDKWIRIWVKKGAMIVLPAGIYHRFTLDSDNYIKAMRLFVGDPVWTPHNRPNDHLPA

F6HDT7 1,2-dihydroxy-3-keto-5-methylthiopentene dioxygenase 22.4e-8885.55Show/hide
Query:  KDAREEVIQAWYMDDSNEDQRLPHHLEPKQYLSLQQLDELGVLSWRLDADKYETDEELKKIRRDRNYSYMDFCEVCPEKLPNYEEKIKNFYEEHLHTDEE
        KD REEV+QAWYMDDS+EDQRLPHH +PK+++SL QL +LGVLSWRLDAD YETDEELKKIR  R YSYMDFCEVCPEKLPNYEEKIKNF+EEHLHTDEE
Subjt:  KDAREEVIQAWYMDDSNEDQRLPHHLEPKQYLSLQQLDELGVLSWRLDADKYETDEELKKIRRDRNYSYMDFCEVCPEKLPNYEEKIKNFYEEHLHTDEE

Query:  IRYCVAGSGYFDVRDLNDKWIRIWVKKGAMIVLPAGIYHRFTLDSDNYIKAMRLFVGDPVWTPHNRPNDHLPA
        IRYCVAGSGYFDVRD ND WIR+W+KKG MIVLPAGIYHRFTLDS+NYIKAMRLFVGDPVWTP NRP+D+LPA
Subjt:  IRYCVAGSGYFDVRDLNDKWIRIWVKKGAMIVLPAGIYHRFTLDSDNYIKAMRLFVGDPVWTPHNRPNDHLPA

Q8GXE2 1,2-dihydroxy-3-keto-5-methylthiopentene dioxygenase 26.4e-8682.66Show/hide
Query:  KDAREEVIQAWYMDDSNEDQRLPHHLEPKQYLSLQQLDELGVLSWRLDADKYETDEELKKIRRDRNYSYMDFCEVCPEKLPNYEEKIKNFYEEHLHTDEE
        KD REEVIQAWYMDDS EDQRLPHH +PK+++SL +L ELGVLSWRLDAD YETDE+LKKIR  R YSYMDFCEVCPEKLPNYE K+K+F+EEHLHTDEE
Subjt:  KDAREEVIQAWYMDDSNEDQRLPHHLEPKQYLSLQQLDELGVLSWRLDADKYETDEELKKIRRDRNYSYMDFCEVCPEKLPNYEEKIKNFYEEHLHTDEE

Query:  IRYCVAGSGYFDVRDLNDKWIRIWVKKGAMIVLPAGIYHRFTLDSDNYIKAMRLFVGDPVWTPHNRPNDHLPA
        IRYCVAG+GYFDVRD N+ WIR+ VKKG MIVLPAGIYHRFT+DSDNYIKAMRLFVG+PVWTP+NRP+DHLPA
Subjt:  IRYCVAGSGYFDVRDLNDKWIRIWVKKGAMIVLPAGIYHRFTLDSDNYIKAMRLFVGDPVWTPHNRPNDHLPA

Q8W108 1,2-dihydroxy-3-keto-5-methylthiopentene dioxygenase 34.0e-8884.39Show/hide
Query:  KDAREEVIQAWYMDDSNEDQRLPHHLEPKQYLSLQQLDELGVLSWRLDADKYETDEELKKIRRDRNYSYMDFCEVCPEKLPNYEEKIKNFYEEHLHTDEE
        KD REEVIQAWYMDDS EDQRLPHH +PK++LSL +L ELGVLSWRLDAD YETDE+LKKIR  R YSYMDFCEVCPEKLPNYE K+K+F+EEHLHTDEE
Subjt:  KDAREEVIQAWYMDDSNEDQRLPHHLEPKQYLSLQQLDELGVLSWRLDADKYETDEELKKIRRDRNYSYMDFCEVCPEKLPNYEEKIKNFYEEHLHTDEE

Query:  IRYCVAGSGYFDVRDLNDKWIRIWVKKGAMIVLPAGIYHRFTLDSDNYIKAMRLFVGDPVWTPHNRPNDHLPA
        IRYCVAGSGYFDVRD N+ WIR+WVKKG MIVLPAGIYHRFT+DSDNYIKAMRLFVG+PVWTP+NRP+DHLPA
Subjt:  IRYCVAGSGYFDVRDLNDKWIRIWVKKGAMIVLPAGIYHRFTLDSDNYIKAMRLFVGDPVWTPHNRPNDHLPA

Q9LUH8 Heat stress transcription factor A-6b2.3e-9148.64Show/hide
Query:  RRVKEEFPAS---------------NSAFSNGA---PSPTVAPQPMEGLHEAGPPPFLTKTYEFVEDSTTNHIVSWSRDNNSFIVWDPQSFSLTLLPKYF
        R +KEEFPA                +S+ +  A   P+    PQP+EGLHE+GPPPFLTKTY+ VEDS TNH+VSWS+ NNSFIVWDPQ+FS+TLLP++F
Subjt:  RRVKEEFPAS---------------NSAFSNGA---PSPTVAPQPMEGLHEAGPPPFLTKTYEFVEDSTTNHIVSWSRDNNSFIVWDPQSFSLTLLPKYF

Query:  KHSNFSSFVRQLNTYGFRKIDPDKWEFAHEGFLRGQKHLLKLIRRRKTVQPNATLQAPDP---------CVEVGRFGLDGEIDRLQRDKQVLMGELVKLR
        KH+NFSSFVRQLNTYGFRK++PD+WEFA+EGFLRGQKHLLK IRRRKT   +  +Q P           C+EVGR+GLDGE+D L+RDKQVLM ELV+LR
Subjt:  KHSNFSSFVRQLNTYGFRKIDPDKWEFAHEGFLRGQKHLLKLIRRRKTVQPNATLQAPDP---------CVEVGRFGLDGEIDRLQRDKQVLMGELVKLR

Query:  QQQQTTKGQLQTMERRLKKTETRQQLMMNFLARAIQFPDFIQQLIHQKDKHKELEEAINRKRRRHIDQGLSPTSDEYDEEDKALFSNDI-----------
        QQQQ+TK  L  +E +LKKTE++Q+ MM+FLARA+Q PDFIQQL+ QK+K KE+EEAI++KR+R IDQG     D  DE     + ND+           
Subjt:  QQQQTTKGQLQTMERRLKKTETRQQLMMNFLARAIQFPDFIQQLIHQKDKHKELEEAINRKRRRHIDQGLSPTSDEYDEEDKALFSNDI-----------

Query:  ---------------------HVD---------VELLAVEMNQNNQHFAKEEMG--EKGDEVMDDGFWENLLNEANEEGFGVHGFDEQDDEVEDGDELVA
                             H+           E+L VE   + +    ++ G  ++ +E+  +GFWE+LLNE          FD + D+ E+ D L+ 
Subjt:  ---------------------HVD---------VELLAVEMNQNNQHFAKEEMG--EKGDEVMDDGFWENLLNEANEEGFGVHGFDEQDDEVEDGDELVA

Query:  HFGFL
          G+L
Subjt:  HFGFL

Arabidopsis top hitse value%identityAlignment
AT3G22830.1 heat shock transcription factor A6B1.6e-9248.64Show/hide
Query:  RRVKEEFPAS---------------NSAFSNGA---PSPTVAPQPMEGLHEAGPPPFLTKTYEFVEDSTTNHIVSWSRDNNSFIVWDPQSFSLTLLPKYF
        R +KEEFPA                +S+ +  A   P+    PQP+EGLHE+GPPPFLTKTY+ VEDS TNH+VSWS+ NNSFIVWDPQ+FS+TLLP++F
Subjt:  RRVKEEFPAS---------------NSAFSNGA---PSPTVAPQPMEGLHEAGPPPFLTKTYEFVEDSTTNHIVSWSRDNNSFIVWDPQSFSLTLLPKYF

Query:  KHSNFSSFVRQLNTYGFRKIDPDKWEFAHEGFLRGQKHLLKLIRRRKTVQPNATLQAPDP---------CVEVGRFGLDGEIDRLQRDKQVLMGELVKLR
        KH+NFSSFVRQLNTYGFRK++PD+WEFA+EGFLRGQKHLLK IRRRKT   +  +Q P           C+EVGR+GLDGE+D L+RDKQVLM ELV+LR
Subjt:  KHSNFSSFVRQLNTYGFRKIDPDKWEFAHEGFLRGQKHLLKLIRRRKTVQPNATLQAPDP---------CVEVGRFGLDGEIDRLQRDKQVLMGELVKLR

Query:  QQQQTTKGQLQTMERRLKKTETRQQLMMNFLARAIQFPDFIQQLIHQKDKHKELEEAINRKRRRHIDQGLSPTSDEYDEEDKALFSNDI-----------
        QQQQ+TK  L  +E +LKKTE++Q+ MM+FLARA+Q PDFIQQL+ QK+K KE+EEAI++KR+R IDQG     D  DE     + ND+           
Subjt:  QQQQTTKGQLQTMERRLKKTETRQQLMMNFLARAIQFPDFIQQLIHQKDKHKELEEAINRKRRRHIDQGLSPTSDEYDEEDKALFSNDI-----------

Query:  ---------------------HVD---------VELLAVEMNQNNQHFAKEEMG--EKGDEVMDDGFWENLLNEANEEGFGVHGFDEQDDEVEDGDELVA
                             H+           E+L VE   + +    ++ G  ++ +E+  +GFWE+LLNE          FD + D+ E+ D L+ 
Subjt:  ---------------------HVD---------VELLAVEMNQNNQHFAKEEMG--EKGDEVMDDGFWENLLNEANEEGFGVHGFDEQDDEVEDGDELVA

Query:  HFGFL
          G+L
Subjt:  HFGFL

AT4G14710.1 RmlC-like cupins superfamily protein2.9e-8984.39Show/hide
Query:  KDAREEVIQAWYMDDSNEDQRLPHHLEPKQYLSLQQLDELGVLSWRLDADKYETDEELKKIRRDRNYSYMDFCEVCPEKLPNYEEKIKNFYEEHLHTDEE
        KD REEVIQAWYMDDS EDQRLPHH +PK++LSL +L ELGVLSWRLDAD YETDE+LKKIR  R YSYMDFCEVCPEKLPNYE K+K+F+EEHLHTDEE
Subjt:  KDAREEVIQAWYMDDSNEDQRLPHHLEPKQYLSLQQLDELGVLSWRLDADKYETDEELKKIRRDRNYSYMDFCEVCPEKLPNYEEKIKNFYEEHLHTDEE

Query:  IRYCVAGSGYFDVRDLNDKWIRIWVKKGAMIVLPAGIYHRFTLDSDNYIKAMRLFVGDPVWTPHNRPNDHLPA
        IRYCVAGSGYFDVRD N+ WIR+WVKKG MIVLPAGIYHRFT+DSDNYIKAMRLFVG+PVWTP+NRP+DHLPA
Subjt:  IRYCVAGSGYFDVRDLNDKWIRIWVKKGAMIVLPAGIYHRFTLDSDNYIKAMRLFVGDPVWTPHNRPNDHLPA

AT4G14710.2 RmlC-like cupins superfamily protein2.9e-8984.39Show/hide
Query:  KDAREEVIQAWYMDDSNEDQRLPHHLEPKQYLSLQQLDELGVLSWRLDADKYETDEELKKIRRDRNYSYMDFCEVCPEKLPNYEEKIKNFYEEHLHTDEE
        KD REEVIQAWYMDDS EDQRLPHH +PK++LSL +L ELGVLSWRLDAD YETDE+LKKIR  R YSYMDFCEVCPEKLPNYE K+K+F+EEHLHTDEE
Subjt:  KDAREEVIQAWYMDDSNEDQRLPHHLEPKQYLSLQQLDELGVLSWRLDADKYETDEELKKIRRDRNYSYMDFCEVCPEKLPNYEEKIKNFYEEHLHTDEE

Query:  IRYCVAGSGYFDVRDLNDKWIRIWVKKGAMIVLPAGIYHRFTLDSDNYIKAMRLFVGDPVWTPHNRPNDHLPA
        IRYCVAGSGYFDVRD N+ WIR+WVKKG MIVLPAGIYHRFT+DSDNYIKAMRLFVG+PVWTP+NRP+DHLPA
Subjt:  IRYCVAGSGYFDVRDLNDKWIRIWVKKGAMIVLPAGIYHRFTLDSDNYIKAMRLFVGDPVWTPHNRPNDHLPA

AT4G14710.4 RmlC-like cupins superfamily protein2.2e-8982.95Show/hide
Query:  KDAREEVIQAWYMDDSNEDQRLPHHLEPKQYLSLQQLDELGVLSWRLDADKYETDEELKKIRRDRNYSYMDFCEVCPEKLPNYEEKIKNFYEEHLHTDEE
        KD REEVIQAWYMDDS EDQRLPHH +PK++LSL +L ELGVLSWRLDAD YETDE+LKKIR  R YSYMDFCEVCPEKLPNYE K+K+F+EEHLHTDEE
Subjt:  KDAREEVIQAWYMDDSNEDQRLPHHLEPKQYLSLQQLDELGVLSWRLDADKYETDEELKKIRRDRNYSYMDFCEVCPEKLPNYEEKIKNFYEEHLHTDEE

Query:  IRYCVAGSGYFDVRDLNDKWIRIWVKKGAMIVLPAGIYHRFTLDSDNYIKAMRLFVGDPVWTPHNRPNDHLPASLN
        IRYCVAGSGYFDVRD N+ WIR+WVKKG MIVLPAGIYHRFT+DSDNYIKAMRLFVG+PVWTP+NRP+DHLPA ++
Subjt:  IRYCVAGSGYFDVRDLNDKWIRIWVKKGAMIVLPAGIYHRFTLDSDNYIKAMRLFVGDPVWTPHNRPNDHLPASLN

AT4G14710.5 RmlC-like cupins superfamily protein1.1e-8884.3Show/hide
Query:  DAREEVIQAWYMDDSNEDQRLPHHLEPKQYLSLQQLDELGVLSWRLDADKYETDEELKKIRRDRNYSYMDFCEVCPEKLPNYEEKIKNFYEEHLHTDEEI
        D REEVIQAWYMDDS EDQRLPHH +PK++LSL +L ELGVLSWRLDAD YETDE+LKKIR  R YSYMDFCEVCPEKLPNYE K+K+F+EEHLHTDEEI
Subjt:  DAREEVIQAWYMDDSNEDQRLPHHLEPKQYLSLQQLDELGVLSWRLDADKYETDEELKKIRRDRNYSYMDFCEVCPEKLPNYEEKIKNFYEEHLHTDEEI

Query:  RYCVAGSGYFDVRDLNDKWIRIWVKKGAMIVLPAGIYHRFTLDSDNYIKAMRLFVGDPVWTPHNRPNDHLPA
        RYCVAGSGYFDVRD N+ WIR+WVKKG MIVLPAGIYHRFT+DSDNYIKAMRLFVG+PVWTP+NRP+DHLPA
Subjt:  RYCVAGSGYFDVRDLNDKWIRIWVKKGAMIVLPAGIYHRFTLDSDNYIKAMRLFVGDPVWTPHNRPNDHLPA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGTTCCCGATAAGGATGCGAGAGAGGAAGTGATTCAGGCATGGTATATGGATGATAGTAATGAAGACCAGAGGCTTCCTCATCACCTTGAACCGAAGCAATATTT
GTCCCTCCAACAACTTGATGAACTTGGAGTTCTGAGCTGGAGACTGGATGCAGACAAATACGAAACAGATGAGGAATTGAAGAAGATACGTCGTGATCGTAATTACTCCT
ACATGGACTTTTGCGAGGTCTGTCCAGAAAAGCTTCCTAATTATGAAGAGAAGATTAAGAACTTTTACGAGGAACATTTGCACACCGATGAGGAGATCCGTTACTGTGTG
GCAGGGAGTGGTTATTTTGATGTTAGGGATCTGAATGATAAATGGATTCGCATTTGGGTAAAGAAGGGAGCAATGATTGTCTTACCTGCTGGAATTTATCATCGCTTCAC
TCTGGATTCTGACAACTACATCAAGGCTATGCGACTCTTTGTTGGCGATCCTGTCTGGACTCCTCACAACCGTCCCAACGATCATCTTCCCGCGAGCTTGAACTCTGCTC
GCGCTTCTGAAGTTGGATCGGGTCTTACCTGTCGAACCCGTTACTATTTGTATGGTCCGATCATGAATCGTCTCCGCCGAGTGAAGGAGGAGTTTCCGGCGTCGAATTCT
GCATTTTCTAACGGAGCGCCGTCGCCCACGGTGGCTCCTCAGCCGATGGAGGGTCTTCACGAGGCCGGTCCTCCGCCATTTCTGACCAAAACATACGAATTCGTCGAAGA
TTCAACCACCAATCATATTGTCTCTTGGAGTAGAGACAACAACAGCTTCATCGTTTGGGATCCTCAATCCTTTTCCTTGACCCTTCTCCCCAAATACTTCAAACACAGCA
ATTTTTCCAGCTTTGTTCGACAACTTAACACCTATGGATTTAGGAAGATTGATCCAGACAAGTGGGAGTTTGCTCACGAAGGTTTCTTGAGAGGGCAGAAGCATCTTCTG
AAGCTGATTCGGAGGAGGAAAACGGTGCAACCAAATGCCACTCTCCAAGCTCCGGACCCTTGCGTCGAAGTCGGGCGCTTCGGCCTTGACGGAGAAATCGATCGGCTGCA
GAGAGACAAGCAGGTTTTAATGGGGGAGCTGGTAAAGCTAAGACAACAGCAGCAAACCACCAAAGGGCAGCTCCAAACAATGGAGCGCCGCCTCAAAAAGACAGAGACAA
GGCAGCAACTAATGATGAACTTCTTAGCAAGAGCAATTCAATTCCCTGATTTCATCCAACAACTCATCCACCAAAAGGACAAGCATAAAGAGCTTGAAGAAGCCATCAAT
AGAAAACGAAGACGACACATCGACCAAGGCCTTTCCCCAACCTCCGACGAATATGACGAAGAAGACAAGGCGCTGTTCTCGAACGACATCCATGTGGATGTCGAGCTACT
AGCCGTTGAGATGAACCAAAATAACCAGCATTTTGCTAAAGAAGAAATGGGTGAAAAAGGAGATGAAGTAATGGATGATGGGTTTTGGGAGAATTTGTTGAATGAGGCTA
ATGAGGAAGGGTTTGGAGTTCATGGGTTTGATGAACAAGATGATGAGGTTGAAGATGGAGATGAATTGGTCGCTCATTTTGGGTTTTTGAATTATAATCTTAAATAG
mRNA sequenceShow/hide mRNA sequence
GCTTCTTCGTTTTCGCCTCTCTGCGTTGCCTTGAAGCGTTCCAATTTGATTGGGATTATTATTCGCAGAGGCCGCAGGTTGCAGTTGACGATGGCTGTTCCCGATAAGGA
TGCGAGAGAGGAAGTGATTCAGGCATGGTATATGGATGATAGTAATGAAGACCAGAGGCTTCCTCATCACCTTGAACCGAAGCAATATTTGTCCCTCCAACAACTTGATG
AACTTGGAGTTCTGAGCTGGAGACTGGATGCAGACAAATACGAAACAGATGAGGAATTGAAGAAGATACGTCGTGATCGTAATTACTCCTACATGGACTTTTGCGAGGTC
TGTCCAGAAAAGCTTCCTAATTATGAAGAGAAGATTAAGAACTTTTACGAGGAACATTTGCACACCGATGAGGAGATCCGTTACTGTGTGGCAGGGAGTGGTTATTTTGA
TGTTAGGGATCTGAATGATAAATGGATTCGCATTTGGGTAAAGAAGGGAGCAATGATTGTCTTACCTGCTGGAATTTATCATCGCTTCACTCTGGATTCTGACAACTACA
TCAAGGCTATGCGACTCTTTGTTGGCGATCCTGTCTGGACTCCTCACAACCGTCCCAACGATCATCTTCCCGCGAGCTTGAACTCTGCTCGCGCTTCTGAAGTTGGATCG
GGTCTTACCTGTCGAACCCGTTACTATTTGTATGGTCCGATCATGAATCGTCTCCGCCGAGTGAAGGAGGAGTTTCCGGCGTCGAATTCTGCATTTTCTAACGGAGCGCC
GTCGCCCACGGTGGCTCCTCAGCCGATGGAGGGTCTTCACGAGGCCGGTCCTCCGCCATTTCTGACCAAAACATACGAATTCGTCGAAGATTCAACCACCAATCATATTG
TCTCTTGGAGTAGAGACAACAACAGCTTCATCGTTTGGGATCCTCAATCCTTTTCCTTGACCCTTCTCCCCAAATACTTCAAACACAGCAATTTTTCCAGCTTTGTTCGA
CAACTTAACACCTATGGATTTAGGAAGATTGATCCAGACAAGTGGGAGTTTGCTCACGAAGGTTTCTTGAGAGGGCAGAAGCATCTTCTGAAGCTGATTCGGAGGAGGAA
AACGGTGCAACCAAATGCCACTCTCCAAGCTCCGGACCCTTGCGTCGAAGTCGGGCGCTTCGGCCTTGACGGAGAAATCGATCGGCTGCAGAGAGACAAGCAGGTTTTAA
TGGGGGAGCTGGTAAAGCTAAGACAACAGCAGCAAACCACCAAAGGGCAGCTCCAAACAATGGAGCGCCGCCTCAAAAAGACAGAGACAAGGCAGCAACTAATGATGAAC
TTCTTAGCAAGAGCAATTCAATTCCCTGATTTCATCCAACAACTCATCCACCAAAAGGACAAGCATAAAGAGCTTGAAGAAGCCATCAATAGAAAACGAAGACGACACAT
CGACCAAGGCCTTTCCCCAACCTCCGACGAATATGACGAAGAAGACAAGGCGCTGTTCTCGAACGACATCCATGTGGATGTCGAGCTACTAGCCGTTGAGATGAACCAAA
ATAACCAGCATTTTGCTAAAGAAGAAATGGGTGAAAAAGGAGATGAAGTAATGGATGATGGGTTTTGGGAGAATTTGTTGAATGAGGCTAATGAGGAAGGGTTTGGAGTT
CATGGGTTTGATGAACAAGATGATGAGGTTGAAGATGGAGATGAATTGGTCGCTCATTTTGGGTTTTTGAATTATAATCTTAAATAGATGATGGATCGTAATTTTTTTAT
TTTTATAAATAATCGGGGTTCGGGAGAGCGGAAATATCTCAAGTGGGCTCGTTTGGGAAGAAAGATGAAGTGCAATATGTAGAGCACATGTAAGATACAGTCGGATTGAT
CACACACAGAGATATCTAAAGATCGAGAGACATGAAAGTAACAACCTTCTTTTATTGCTGCCTCCCACTTAGGCTACTACTAC
Protein sequenceShow/hide protein sequence
MAVPDKDAREEVIQAWYMDDSNEDQRLPHHLEPKQYLSLQQLDELGVLSWRLDADKYETDEELKKIRRDRNYSYMDFCEVCPEKLPNYEEKIKNFYEEHLHTDEEIRYCV
AGSGYFDVRDLNDKWIRIWVKKGAMIVLPAGIYHRFTLDSDNYIKAMRLFVGDPVWTPHNRPNDHLPASLNSARASEVGSGLTCRTRYYLYGPIMNRLRRVKEEFPASNS
AFSNGAPSPTVAPQPMEGLHEAGPPPFLTKTYEFVEDSTTNHIVSWSRDNNSFIVWDPQSFSLTLLPKYFKHSNFSSFVRQLNTYGFRKIDPDKWEFAHEGFLRGQKHLL
KLIRRRKTVQPNATLQAPDPCVEVGRFGLDGEIDRLQRDKQVLMGELVKLRQQQQTTKGQLQTMERRLKKTETRQQLMMNFLARAIQFPDFIQQLIHQKDKHKELEEAIN
RKRRRHIDQGLSPTSDEYDEEDKALFSNDIHVDVELLAVEMNQNNQHFAKEEMGEKGDEVMDDGFWENLLNEANEEGFGVHGFDEQDDEVEDGDELVAHFGFLNYNLK