; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G011080 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G011080
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionprotein SULFUR DEFICIENCY-INDUCED 1
Genome locationchr05:18846769..18862369
RNA-Seq ExpressionLsi05G011080
SyntenyLsi05G011080
Gene Ontology termsGO:0015940 - pantothenate biosynthetic process (biological process)
GO:0004592 - pantoate-beta-alanine ligase activity (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0005524 - ATP binding (molecular function)
InterPro domainsIPR003721 - Pantoate-beta-alanine ligase
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold
IPR019734 - Tetratricopeptide repeat
IPR042176 - Pantoate-beta-alanine ligase, C-terminal domain
IPR044961 - Tetratricopeptide repeat protein POLLENLESS 3/SULFUR DEFICIENCY-INDUCED 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7011320.1 Protein SULFUR DEFICIENCY-INDUCED 1, partial [Cucurbita argyrosperma subsp. argyrosperma]1.7e-14587.99Show/hide
Query:  MVLEEEKSKEREEMEGRLKIKGSKGKKEELFHVIHKVPPGDSPYVRAKYAQLIQKDPESAIVLFWEAINSGDRVESALKDMAVVMKQLDRAQEAIHILKT
        M LE EK  ERE++  R      K  K+ELFHVIHKVPPGD+PYVRAKYAQLI+KDPESAI LFWEAIN+GDRVESALKDMAVVMKQ+DRA+EAIHIL+T
Subjt:  MVLEEEKSKEREEMEGRLKIKGSKGKKEELFHVIHKVPPGDSPYVRAKYAQLIQKDPESAIVLFWEAINSGDRVESALKDMAVVMKQLDRAQEAIHILKT

Query:  FRFLCSKHSQESLDNVLIDLFKKCGRIEEQIELLKRKLRMIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQMI
        +RFLCSKHSQ+SLDNVLIDLFKKCGRIEEQIE+LKRKLR IYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQ+I
Subjt:  FRFLCSKHSQESLDNVLIDLFKKCGRIEEQIELLKRKLRMIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQMI

Query:  DPDANKACNLGLCLMKQGRLNEAISVLEQVQQGQIPGSDENKAQKRAADLLTEIRSRQSLPDSIELLGLSIDADLLNGLEQLVNKRGPFRSKRLPVFEEI
        DPDANKACNLGLCLMKQGRLNEAISVL+QVQQG+IPGSDE KAQKRA DLLT+IRSRQSLPDSIELLGLSID DLLNGLEQLV++RGPFRSKRLPVFEEI
Subjt:  DPDANKACNLGLCLMKQGRLNEAISVLEQVQQGQIPGSDENKAQKRAADLLTEIRSRQSLPDSIELLGLSIDADLLNGLEQLVNKRGPFRSKRLPVFEEI

Query:  SSFRDQLA
        SSFRDQLA
Subjt:  SSFRDQLA

XP_022963805.1 protein SULFUR DEFICIENCY-INDUCED 1-like isoform X1 [Cucurbita moschata]1.7e-14587.99Show/hide
Query:  MVLEEEKSKEREEMEGRLKIKGSKGKKEELFHVIHKVPPGDSPYVRAKYAQLIQKDPESAIVLFWEAINSGDRVESALKDMAVVMKQLDRAQEAIHILKT
        M LE EK  ERE++  R      K  K+ELFHVIHKVPPGD+PYVRAKYAQLI+KDPESAI LFWEAIN+GDRVESALKDMAVVMKQ+DRA+EAIHIL+T
Subjt:  MVLEEEKSKEREEMEGRLKIKGSKGKKEELFHVIHKVPPGDSPYVRAKYAQLIQKDPESAIVLFWEAINSGDRVESALKDMAVVMKQLDRAQEAIHILKT

Query:  FRFLCSKHSQESLDNVLIDLFKKCGRIEEQIELLKRKLRMIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQMI
        +RFLCSKHSQ+SLDNVLIDLFKKCGRIEEQIE+LKRKLR IYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQ+I
Subjt:  FRFLCSKHSQESLDNVLIDLFKKCGRIEEQIELLKRKLRMIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQMI

Query:  DPDANKACNLGLCLMKQGRLNEAISVLEQVQQGQIPGSDENKAQKRAADLLTEIRSRQSLPDSIELLGLSIDADLLNGLEQLVNKRGPFRSKRLPVFEEI
        DPDANKACNLGLCLMKQGRLNEAISVL+QVQQG+IPGSDE KAQKRA DLLT+IRSRQSLPDSIELLGLSID DLLNGLEQLV++RGPFRSKRLPVFEEI
Subjt:  DPDANKACNLGLCLMKQGRLNEAISVLEQVQQGQIPGSDENKAQKRAADLLTEIRSRQSLPDSIELLGLSIDADLLNGLEQLVNKRGPFRSKRLPVFEEI

Query:  SSFRDQLA
        SSFRDQLA
Subjt:  SSFRDQLA

XP_022967303.1 protein SULFUR DEFICIENCY-INDUCED 1-like isoform X1 [Cucurbita maxima]5.4e-14487.99Show/hide
Query:  MVLEEEKSKEREEMEGRLKIKGSKGKKEELFHVIHKVPPGDSPYVRAKYAQLIQKDPESAIVLFWEAINSGDRVESALKDMAVVMKQLDRAQEAIHILKT
        M LE EK  ERE+          K  K+ELFHVIHKVPPGD+PYVRAKYAQLI+KDPESAI LFWEAIN+GDRVESALKDMAVVMKQ+DRA+EAI ILKT
Subjt:  MVLEEEKSKEREEMEGRLKIKGSKGKKEELFHVIHKVPPGDSPYVRAKYAQLIQKDPESAIVLFWEAINSGDRVESALKDMAVVMKQLDRAQEAIHILKT

Query:  FRFLCSKHSQESLDNVLIDLFKKCGRIEEQIELLKRKLRMIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQMI
        +RFLCSKHSQESLDNVLIDLFKKCGRIEEQIELLKRKLR IYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPN+MMAEAVYKKAQ+I
Subjt:  FRFLCSKHSQESLDNVLIDLFKKCGRIEEQIELLKRKLRMIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQMI

Query:  DPDANKACNLGLCLMKQGRLNEAISVLEQVQQGQIPGSDENKAQKRAADLLTEIRSRQSLPDSIELLGLSIDADLLNGLEQLVNKRGPFRSKRLPVFEEI
        DPDANKACNLGLCLMKQGRLNEAISVL+QVQQG IPGSDE KAQKRA DLLT+IRSRQSLPDSIELLGLSID DLLNGLEQLV++RGPFRSKRLPVFEEI
Subjt:  DPDANKACNLGLCLMKQGRLNEAISVLEQVQQGQIPGSDENKAQKRAADLLTEIRSRQSLPDSIELLGLSIDADLLNGLEQLVNKRGPFRSKRLPVFEEI

Query:  SSFRDQLA
        SSFRDQLA
Subjt:  SSFRDQLA

XP_023511525.1 protein SULFUR DEFICIENCY-INDUCED 1-like [Cucurbita pepo subsp. pepo]2.1e-14387.79Show/hide
Query:  EKSKEREEMEGRLKIKGSKGKKEELFHVIHKVPPGDSPYVRAKYAQLIQKDPESAIVLFWEAINSGDRVESALKDMAVVMKQLDRAQEAIHILKTFRFLC
        E+ +E+ E E +L  +  K  K+ELFHVIHKVPPGD+PYVRAKYAQLI+KDPESAI LFWEAIN+GDRVESALKDMAVVMKQ+DRA+EAI ILKT+RFLC
Subjt:  EKSKEREEMEGRLKIKGSKGKKEELFHVIHKVPPGDSPYVRAKYAQLIQKDPESAIVLFWEAINSGDRVESALKDMAVVMKQLDRAQEAIHILKTFRFLC

Query:  SKHSQESLDNVLIDLFKKCGRIEEQIELLKRKLRMIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQMIDPDAN
        SKHSQESLDNVLIDLFKKCGRIEEQIE++KRKLR IYEGE FNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQ+IDPDAN
Subjt:  SKHSQESLDNVLIDLFKKCGRIEEQIELLKRKLRMIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQMIDPDAN

Query:  KACNLGLCLMKQGRLNEAISVLEQVQQGQIPGSDENKAQKRAADLLTEIRSRQSLPDSIELLGLSIDADLLNGLEQLVNKRGPFRSKRLPVFEEISSFRD
        KACNLGLCLMKQGRLNEAISVL+QVQQG+IPGSDE KAQKRA DLLT+IRSRQSLPDSIELLGLSID DLLNGLEQLV++RGPFRSKRLPVFEEISSFRD
Subjt:  KACNLGLCLMKQGRLNEAISVLEQVQQGQIPGSDENKAQKRAADLLTEIRSRQSLPDSIELLGLSIDADLLNGLEQLVNKRGPFRSKRLPVFEEISSFRD

Query:  QLA
        QLA
Subjt:  QLA

XP_038888420.1 protein SULFUR DEFICIENCY-INDUCED 1 [Benincasa hispida]1.8e-14790.58Show/hide
Query:  MVLEEEKSKEREEMEGRLKIKGSKGKKEELFHVIHKVPPGDSPYVRAKYAQLIQKDPESAIVLFWEAINSGDRVESALKDMAVVMKQLDRAQEAIHILKT
        MVLEEEKS+E+EE++     +GSKG KEE FHV HKVPPGDSPYVRAKYAQLI+KDPESAI LFWEAIN  DRVESALKDM VVMKQL+RA+EAIHILKT
Subjt:  MVLEEEKSKEREEMEGRLKIKGSKGKKEELFHVIHKVPPGDSPYVRAKYAQLIQKDPESAIVLFWEAINSGDRVESALKDMAVVMKQLDRAQEAIHILKT

Query:  FRFLCSKHSQESLDNVLIDLFKKCGRIEEQIELLKRKLRMIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQMI
        FRFLCSK SQES+DNVLIDLFKKCGRIEEQIELLKRKLRMIY+GEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQMI
Subjt:  FRFLCSKHSQESLDNVLIDLFKKCGRIEEQIELLKRKLRMIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQMI

Query:  DPDANKACNLGLCLMKQGRLNEAISVLEQVQQGQIPGSDENKAQKRAADLLTEIRSRQSLPDSIELLGLSIDADLLNGLEQLVNKRGPFRSKRLPVFEEI
        DPDANKACNLGLCLMKQGRL+EAI VLEQVQQG IPGSDE KAQKRAADLLTEIRSRQSLP+SIELLGLSIDADLLNGLEQLVNK+GPFRSKRLPVFEEI
Subjt:  DPDANKACNLGLCLMKQGRLNEAISVLEQVQQGQIPGSDENKAQKRAADLLTEIRSRQSLPDSIELLGLSIDADLLNGLEQLVNKRGPFRSKRLPVFEEI

Query:  SSFRDQLA
        SSFRDQLA
Subjt:  SSFRDQLA

TrEMBL top hitse value%identityAlignment
A0A0A0L929 TPR_REGION domain-containing protein2.9e-14388.06Show/hide
Query:  VLEEEK--SKEREEMEGRLKIKGSKGKKEELFHVIHKVPPGDSPYVRAKYAQLIQKDPESAIVLFWEAINSGDRVESALKDMAVVMKQLDRAQEAIHILK
        VLEEE+  SK+ E +EG    KGS   K+ELFHVIHKVPPGD+PYVRAKYAQLI+KDPESAI LFWEAIN GDRVESALKDMAVVMKQ+DRA+EAIHIL+
Subjt:  VLEEEK--SKEREEMEGRLKIKGSKGKKEELFHVIHKVPPGDSPYVRAKYAQLIQKDPESAIVLFWEAINSGDRVESALKDMAVVMKQLDRAQEAIHILK

Query:  TFRFLCSKHSQESLDNVLIDLFKKCGRIEEQIELLKRKLRMIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQM
        TFRFLCSKHSQ SLDNVLIDLFKKCGRIEEQIELLKRKLRMIY+GEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQM
Subjt:  TFRFLCSKHSQESLDNVLIDLFKKCGRIEEQIELLKRKLRMIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQM

Query:  IDPDANKACNLGLCLMKQGRLNEAISVLEQVQQGQIPGSDENKAQKRAADLLTEIRSRQSLPDSIELLGLSIDADLLNGLEQLVNKRGPF-RSKRLPVFE
        IDPDANKACNLGLCLMKQGRL+EAI VLEQVQQ QIPGS E KAQKR+ADLLTEIRSRQSLPDSI+LLGLS+D D LNGLE LVNK+GPF RSKRLPVFE
Subjt:  IDPDANKACNLGLCLMKQGRLNEAISVLEQVQQGQIPGSDENKAQKRAADLLTEIRSRQSLPDSIELLGLSIDADLLNGLEQLVNKRGPF-RSKRLPVFE

Query:  EISSFRDQLA
        EISSFRDQLA
Subjt:  EISSFRDQLA

A0A1S3CN21 protein SULFUR DEFICIENCY-INDUCED 1-like isoform X11.1e-14287.82Show/hide
Query:  MVLEEE-----KSKEREEMEGRLKIKGSKGKKEELFHVIHKVPPGDSPYVRAKYAQLIQKDPESAIVLFWEAINSGDRVESALKDMAVVMKQLDRAQEAI
        M +EEE     + K+ E MEG    KGS   K+ELFHVIHKVPPGDSPYVRAKYAQLI+KDPESAI LFWEAIN GDRVESALKDMAVVMKQ+DRA+EAI
Subjt:  MVLEEE-----KSKEREEMEGRLKIKGSKGKKEELFHVIHKVPPGDSPYVRAKYAQLIQKDPESAIVLFWEAINSGDRVESALKDMAVVMKQLDRAQEAI

Query:  HILKTFRFLCSKHSQESLDNVLIDLFKKCGRIEEQIELLKRKLRMIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYK
         IL+TFRFLCSKHSQ SLDNVLIDLFKKCGRIEEQIELLKRKLRMIY+GEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYK
Subjt:  HILKTFRFLCSKHSQESLDNVLIDLFKKCGRIEEQIELLKRKLRMIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYK

Query:  KAQMIDPDANKACNLGLCLMKQGRLNEAISVLEQVQQGQIPGSDENKAQKRAADLLTEIRSRQSLPDSIELLGLSIDADLLNGLEQLVNKRGPF-RSKRL
        KAQMIDPDANKACNLGLCLMKQGRLNEA  VLEQVQQ QIPGSDE KAQKRAADLLTEIRSRQSLPDSIELLGLS+D DLLNGLE LVNK+GPF RSKRL
Subjt:  KAQMIDPDANKACNLGLCLMKQGRLNEAISVLEQVQQGQIPGSDENKAQKRAADLLTEIRSRQSLPDSIELLGLSIDADLLNGLEQLVNKRGPF-RSKRL

Query:  PVFEEISSFRDQ
        PVFEEISSFRDQ
Subjt:  PVFEEISSFRDQ

A0A1S3CNG1 protein SULFUR DEFICIENCY-INDUCED 1-like isoform X25.1e-14087.18Show/hide
Query:  MVLEEE-----KSKEREEMEGRLKIKGSKGKKEELFHVIHKVPPGDSPYVRAKYAQLIQKDPESAIVLFWEAINSGDRVESALKDMAVVMKQLDRAQEAI
        M +EEE     + K+ E MEG    KGS   K+ELFHVIHKVPPGDSPYVRAKYA   QKDPESAI LFWEAIN GDRVESALKDMAVVMKQ+DRA+EAI
Subjt:  MVLEEE-----KSKEREEMEGRLKIKGSKGKKEELFHVIHKVPPGDSPYVRAKYAQLIQKDPESAIVLFWEAINSGDRVESALKDMAVVMKQLDRAQEAI

Query:  HILKTFRFLCSKHSQESLDNVLIDLFKKCGRIEEQIELLKRKLRMIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYK
         IL+TFRFLCSKHSQ SLDNVLIDLFKKCGRIEEQIELLKRKLRMIY+GEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYK
Subjt:  HILKTFRFLCSKHSQESLDNVLIDLFKKCGRIEEQIELLKRKLRMIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYK

Query:  KAQMIDPDANKACNLGLCLMKQGRLNEAISVLEQVQQGQIPGSDENKAQKRAADLLTEIRSRQSLPDSIELLGLSIDADLLNGLEQLVNKRGPF-RSKRL
        KAQMIDPDANKACNLGLCLMKQGRLNEA  VLEQVQQ QIPGSDE KAQKRAADLLTEIRSRQSLPDSIELLGLS+D DLLNGLE LVNK+GPF RSKRL
Subjt:  KAQMIDPDANKACNLGLCLMKQGRLNEAISVLEQVQQGQIPGSDENKAQKRAADLLTEIRSRQSLPDSIELLGLSIDADLLNGLEQLVNKRGPF-RSKRL

Query:  PVFEEISSFRDQ
        PVFEEISSFRDQ
Subjt:  PVFEEISSFRDQ

A0A6J1HJ12 protein SULFUR DEFICIENCY-INDUCED 1-like isoform X18.1e-14687.99Show/hide
Query:  MVLEEEKSKEREEMEGRLKIKGSKGKKEELFHVIHKVPPGDSPYVRAKYAQLIQKDPESAIVLFWEAINSGDRVESALKDMAVVMKQLDRAQEAIHILKT
        M LE EK  ERE++  R      K  K+ELFHVIHKVPPGD+PYVRAKYAQLI+KDPESAI LFWEAIN+GDRVESALKDMAVVMKQ+DRA+EAIHIL+T
Subjt:  MVLEEEKSKEREEMEGRLKIKGSKGKKEELFHVIHKVPPGDSPYVRAKYAQLIQKDPESAIVLFWEAINSGDRVESALKDMAVVMKQLDRAQEAIHILKT

Query:  FRFLCSKHSQESLDNVLIDLFKKCGRIEEQIELLKRKLRMIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQMI
        +RFLCSKHSQ+SLDNVLIDLFKKCGRIEEQIE+LKRKLR IYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQ+I
Subjt:  FRFLCSKHSQESLDNVLIDLFKKCGRIEEQIELLKRKLRMIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQMI

Query:  DPDANKACNLGLCLMKQGRLNEAISVLEQVQQGQIPGSDENKAQKRAADLLTEIRSRQSLPDSIELLGLSIDADLLNGLEQLVNKRGPFRSKRLPVFEEI
        DPDANKACNLGLCLMKQGRLNEAISVL+QVQQG+IPGSDE KAQKRA DLLT+IRSRQSLPDSIELLGLSID DLLNGLEQLV++RGPFRSKRLPVFEEI
Subjt:  DPDANKACNLGLCLMKQGRLNEAISVLEQVQQGQIPGSDENKAQKRAADLLTEIRSRQSLPDSIELLGLSIDADLLNGLEQLVNKRGPFRSKRLPVFEEI

Query:  SSFRDQLA
        SSFRDQLA
Subjt:  SSFRDQLA

A0A6J1HUP6 protein SULFUR DEFICIENCY-INDUCED 1-like isoform X12.6e-14487.99Show/hide
Query:  MVLEEEKSKEREEMEGRLKIKGSKGKKEELFHVIHKVPPGDSPYVRAKYAQLIQKDPESAIVLFWEAINSGDRVESALKDMAVVMKQLDRAQEAIHILKT
        M LE EK  ERE+          K  K+ELFHVIHKVPPGD+PYVRAKYAQLI+KDPESAI LFWEAIN+GDRVESALKDMAVVMKQ+DRA+EAI ILKT
Subjt:  MVLEEEKSKEREEMEGRLKIKGSKGKKEELFHVIHKVPPGDSPYVRAKYAQLIQKDPESAIVLFWEAINSGDRVESALKDMAVVMKQLDRAQEAIHILKT

Query:  FRFLCSKHSQESLDNVLIDLFKKCGRIEEQIELLKRKLRMIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQMI
        +RFLCSKHSQESLDNVLIDLFKKCGRIEEQIELLKRKLR IYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPN+MMAEAVYKKAQ+I
Subjt:  FRFLCSKHSQESLDNVLIDLFKKCGRIEEQIELLKRKLRMIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQMI

Query:  DPDANKACNLGLCLMKQGRLNEAISVLEQVQQGQIPGSDENKAQKRAADLLTEIRSRQSLPDSIELLGLSIDADLLNGLEQLVNKRGPFRSKRLPVFEEI
        DPDANKACNLGLCLMKQGRLNEAISVL+QVQQG IPGSDE KAQKRA DLLT+IRSRQSLPDSIELLGLSID DLLNGLEQLV++RGPFRSKRLPVFEEI
Subjt:  DPDANKACNLGLCLMKQGRLNEAISVLEQVQQGQIPGSDENKAQKRAADLLTEIRSRQSLPDSIELLGLSIDADLLNGLEQLVNKRGPFRSKRLPVFEEI

Query:  SSFRDQLA
        SSFRDQLA
Subjt:  SSFRDQLA

SwissProt top hitse value%identityAlignment
O24035 Pantoate--beta-alanine ligase1.8e-10558.98Show/hide
Query:  PKIITDKNQMRAWTRAMRSQGKTIAFVPTMGFLHDGHLSLIQEAHKHSQLVVVSIYVNPSQFGPSEDLSTYPSDFEGDIRKLMAVPRGIDAVFHPRNLYD
        P +I+DK++MR W+R+MRSQGK IA VPTMGFLH+GHLSL+++AH H+ LV VSIYVNP QF P+EDLS YPSDF+GD++KLM+VP G+D VFHP NLYD
Subjt:  PKIITDKNQMRAWTRAMRSQGKTIAFVPTMGFLHDGHLSLIQEAHKHSQLVVVSIYVNPSQFGPSEDLSTYPSDFEGDIRKLMAVPRGIDAVFHPRNLYD

Query:  YGVESAKDCVGNGSGCGGMAAVSCLE-ESGSGHETWVRVERLEKGMCGRSRPVFFRGVATIVAKLFNIVEPDVAVFGKKDYQQWRIIMRMGTKSCIFVRL
        YG +   D V     CGG   VSC++  SG GHETWVR E+LEK +CG+SRPVFFRGVATIV KLFNIVEPDVAVFGKKDYQQW+II RM          
Subjt:  YGVESAKDCVGNGSGCGGMAAVSCLE-ESGSGHETWVRVERLEKGMCGRSRPVFFRGVATIVAKLFNIVEPDVAVFGKKDYQQWRIIMRMGTKSCIFVRL

Query:  SSNFEKLLRSILLDELVRDLDFSINIVGSEIVRDADGLAMSSRNVRLSPEERQKALSINRSLSKAKSAAESGELNCKRLKNLIVDEVREAGGELDYA---
                        VRDLDFSI ++GSE++R+ DGLAMSSRNV LSPEER+KA+SIN+SL +AKSAAE G+++C++L NL+V  + EAGG +DYA   
Subjt:  SSNFEKLLRSILLDELVRDLDFSINIVGSEIVRDADGLAMSSRNVRLSPEERQKALSINRSLSKAKSAAESGELNCKRLKNLIVDEVREAGGELDYA---

Query:  -----ESMFWKEKQLQF-----FSKTRRIENIVI
             E + W +  + F     F K R I+NI I
Subjt:  -----ESMFWKEKQLQF-----FSKTRRIENIVI

O24210 Pantoate--beta-alanine ligase2.3e-8955.29Show/hide
Query:  KQPKIITDKNQMRAWTRAMRSQGKTIAFVPTMGFLHDGHLSLIQEAHKHSQ----LVVVSIYVNPSQFGPSEDLSTYPSDFEGDIRKLMAVPRGIDAVFH
        ++P++I DK  MRAW+R  R++GKT+A VPTMG+LH GHLSLI  A   +      +VV+IYVNPSQF PSEDL+TYPSDF GD+RKL A    +DAVF+
Subjt:  KQPKIITDKNQMRAWTRAMRSQGKTIAFVPTMGFLHDGHLSLIQEAHKHSQ----LVVVSIYVNPSQFGPSEDLSTYPSDFEGDIRKLMAVPRGIDAVFH

Query:  PRNLYDYGVESAKDCVGNGSGCGGMAAVSCLEE-SGSGHETWVRVERLEKGMCGRSRPVFFRGVATIVAKLFNIVEPDVAVFGKKDYQQWRIIMRMGTKS
        P +LY  G          G+  GG  A+SCLEE +G GHETWVRVERLEKGMCG SRPVFFRGVATIV+KLFNI+EPDVAVFGKKDYQQWR+I RM    
Subjt:  PRNLYDYGVESAKDCVGNGSGCGGMAAVSCLEE-SGSGHETWVRVERLEKGMCGRSRPVFFRGVATIVAKLFNIVEPDVAVFGKKDYQQWRIIMRMGTKS

Query:  CIFVRLSSNFEKLLRSILLDELVRDLDFSINIVGSEIVRDADGLAMSSRNVRLSPEERQKALSINRSLSKAKSAAESGELNCKRLKNLIVDEVREAGGEL
                              VRDLDF+I I+GSEIVR+ADGLAMSSRNV LS EER+KALSI+RSL  A++ A  G  +CK++KN IV  + E GG++
Subjt:  CIFVRLSSNFEKLLRSILLDELVRDLDFSINIVGSEIVRDADGLAMSSRNVRLSPEERQKALSINRSLSKAKSAAESGELNCKRLKNLIVDEVREAGGEL

Query:  DYAESMFWKE----KQLQ---------FFSKTRRIENIVI
        DY E +  +     +Q+          +F K R I+NI I
Subjt:  DYAESMFWKE----KQLQ---------FFSKTRRIENIVI

Q8GXU5 Protein SULFUR DEFICIENCY-INDUCED 11.0e-10062.21Show/hide
Query:  EKSKEREEMEGRLKIKGSKGKKEELFHVIHKVPPGDSPYVRAKYAQLIQKDPESAIVLFWEAINSGDRVESALKDMAVVMKQLDRAQEAIHILKTFRFLC
        E+S ++ +      IK +  K +ELFHVIHKVP GD+PYVRAK+AQLI+K+PE AIV FW+AIN+GDRV+SALKDMAVVMKQLDR++EAI  +K+FR  C
Subjt:  EKSKEREEMEGRLKIKGSKGKKEELFHVIHKVPPGDSPYVRAKYAQLIQKDPESAIVLFWEAINSGDRVESALKDMAVVMKQLDRAQEAIHILKTFRFLC

Query:  SKHSQESLDNVLIDLFKKCGRIEEQIELLKRKLRMIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQMIDPDAN
        SK+SQ+SLDNVLIDL+KKCGR+EEQ+ELLKRKLR IY+GEAFNGKPT+TARSHGKKFQV+V+QE SRLLGNLGWAYMQ+  Y+ AEAVY+KAQM++PDAN
Subjt:  SKHSQESLDNVLIDLFKKCGRIEEQIELLKRKLRMIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQMIDPDAN

Query:  KACNLGLCLMKQGRLNEAISVLEQVQQGQIPGSDENKAQKRAADLLTEIRSRQSLP-----DSIELLGLSIDADLLNGLEQLVNKRGPFRSKRLPVFEEI
        K+CNL +CL+KQGR  E   VL+ V + ++ G+D+ + ++RA +LL+E+ S  SLP     +  ++LG  +D D + GLE++ +    F+SKRLP+FE+I
Subjt:  KACNLGLCLMKQGRLNEAISVLEQVQQGQIPGSDENKAQKRAADLLTEIRSRQSLP-----DSIELLGLSIDADLLNGLEQLVNKRGPFRSKRLPVFEEI

Query:  SSFRDQL
        SSFR+ L
Subjt:  SSFRDQL

Q8L730 Protein SULFUR DEFICIENCY-INDUCED 21.7e-8758.66Show/hide
Query:  FHVIHKVPPGDSPYVRAKYAQLIQKDPESAIVLFWEAINSGDRVESALKDMAVVMKQLDRAQEAIHILKTFRFLCSKHSQESLDNVLIDLFKKCGRIEEQ
        ++V+HK+P GDSPYVRAK+ QL++KD E+AI LFW AI + DRV+SALKDMA++MKQ +RA+EAI  +++FR LCS+ +QESLDNVLIDL+KKCGRIEEQ
Subjt:  FHVIHKVPPGDSPYVRAKYAQLIQKDPESAIVLFWEAINSGDRVESALKDMAVVMKQLDRAQEAIHILKTFRFLCSKHSQESLDNVLIDLFKKCGRIEEQ

Query:  IELLKRKLRMIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQMIDPDANKACNLGLCLMKQGRLNEAISVL-EQ
        +ELLK+KL MIY+GEAFNGKPT+TARSHGKKFQV+V++ETSR+LGNLGWAYMQ  +Y  AEAVY+KAQ+I+PDANKACNL  CL+KQG+ +EA S+L   
Subjt:  IELLKRKLRMIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQMIDPDANKACNLGLCLMKQGRLNEAISVL-EQ

Query:  VQQGQIPGSDENKAQKRAADLLTEIRSRQSLPDSIELLGLSIDAD---LLNGLEQLVNK-RGPFRSKRLPVFEEISSFRDQLA
        V      GS + +   R  +LL+E++ ++    +   +   +  D   ++ GL++ V + R P+R++RLP+FEEI   RDQLA
Subjt:  VQQGQIPGSDENKAQKRAADLLTEIRSRQSLPDSIELLGLSIDAD---LLNGLEQLVNK-RGPFRSKRLPVFEEISSFRDQLA

Q9FKB3 Pantoate--beta-alanine ligase9.7e-9655.93Show/hide
Query:  QPKIITDKNQMRAWTRAMRSQGKTIAFVPTMGFLHDGHLSLIQEAHKHSQLVVVSIYVNPSQFGPSEDLSTYPSDFEGDIRKLMAVPRGIDAVFHPRNLY
        +P++I DK+ MR W+RAMRSQGKTI  VPTMG+LH+GHLSL++++   + + VVSIYVNP QF P+EDLSTYPSDF GD+ KL A+  G   VF+P+NLY
Subjt:  QPKIITDKNQMRAWTRAMRSQGKTIAFVPTMGFLHDGHLSLIQEAHKHSQLVVVSIYVNPSQFGPSEDLSTYPSDFEGDIRKLMAVPRGIDAVFHPRNLY

Query:  DYGVESAKDCVGNGSGCGGMAAVSCLEESGSGHETWVRVERLEKGMCGRSRPVFFRGVATIVAKLFNIVEPDVAVFGKKDYQQWRIIMRMGTKSCIFVRL
        DYG E+ K  + +G G GG   VSC+EE G GHETW+RVERLEKG CG+SRPVFFRGVATIV KLFNIVEPDVA+FGKKDYQQWRII RM          
Subjt:  DYGVESAKDCVGNGSGCGGMAAVSCLEESGSGHETWVRVERLEKGMCGRSRPVFFRGVATIVAKLFNIVEPDVAVFGKKDYQQWRIIMRMGTKSCIFVRL

Query:  SSNFEKLLRSILLDELVRDLDFSINIVGSEIVRDADGLAMSSRNVRLSPEERQKALSINRSLSKAKSAAESGELNCKRLKNLIVDEVREAGGELDYAESM
                        VRDL+F I IVGS+I R+ DGLAMSSRNVRLS EERQ+ALSI+RSL+ AK++   G+ NC  LK++I+ +V  + G +DY E +
Subjt:  SSNFEKLLRSILLDELVRDLDFSINIVGSEIVRDADGLAMSSRNVRLSPEERQKALSINRSLSKAKSAAESGELNCKRLKNLIVDEVREAGGELDYAESM

Query:  FWKEKQLQFFSKTRRIENIVIAIHTYCGT
           ++ L+   + +    +VI +  + GT
Subjt:  FWKEKQLQFFSKTRRIENIVIAIHTYCGT

Arabidopsis top hitse value%identityAlignment
AT1G04770.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-8858.66Show/hide
Query:  FHVIHKVPPGDSPYVRAKYAQLIQKDPESAIVLFWEAINSGDRVESALKDMAVVMKQLDRAQEAIHILKTFRFLCSKHSQESLDNVLIDLFKKCGRIEEQ
        ++V+HK+P GDSPYVRAK+ QL++KD E+AI LFW AI + DRV+SALKDMA++MKQ +RA+EAI  +++FR LCS+ +QESLDNVLIDL+KKCGRIEEQ
Subjt:  FHVIHKVPPGDSPYVRAKYAQLIQKDPESAIVLFWEAINSGDRVESALKDMAVVMKQLDRAQEAIHILKTFRFLCSKHSQESLDNVLIDLFKKCGRIEEQ

Query:  IELLKRKLRMIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQMIDPDANKACNLGLCLMKQGRLNEAISVL-EQ
        +ELLK+KL MIY+GEAFNGKPT+TARSHGKKFQV+V++ETSR+LGNLGWAYMQ  +Y  AEAVY+KAQ+I+PDANKACNL  CL+KQG+ +EA S+L   
Subjt:  IELLKRKLRMIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQMIDPDANKACNLGLCLMKQGRLNEAISVL-EQ

Query:  VQQGQIPGSDENKAQKRAADLLTEIRSRQSLPDSIELLGLSIDAD---LLNGLEQLVNK-RGPFRSKRLPVFEEISSFRDQLA
        V      GS + +   R  +LL+E++ ++    +   +   +  D   ++ GL++ V + R P+R++RLP+FEEI   RDQLA
Subjt:  VQQGQIPGSDENKAQKRAADLLTEIRSRQSLPDSIELLGLSIDAD---LLNGLEQLVNK-RGPFRSKRLPVFEEISSFRDQLA

AT3G51280.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.7e-7557.37Show/hide
Query:  GSKGKKEELFHVIHKVPPGDSPYVRAKYAQLIQKDPESAIVLFWEAINSGDRVESALKDMAVVMKQLDRAQEAIHILKTFRFLCSKHSQESLDNVLIDLF
        G    + E FH IHKVP GDSPYVRAK  QL++KDPE AI LFW+AIN+GDRV+SALKDMA+VMKQ +RA+EAI  +K+ R  CS  +QESLDN+L+DL+
Subjt:  GSKGKKEELFHVIHKVPPGDSPYVRAKYAQLIQKDPESAIVLFWEAINSGDRVESALKDMAVVMKQLDRAQEAIHILKTFRFLCSKHSQESLDNVLIDLF

Query:  KKCGRIEEQIELLKRKLRMIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQMIDPDANKACNLGLCLMKQGRLN
        K+CGR+++QI LLK KL +I +G AFNGK T+TARS GKKFQVSV+QE +RLLGNLGWA MQ+ N++ AE  Y++A  I PD NK CNLG+CLMKQGR++
Subjt:  KKCGRIEEQIELLKRKLRMIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQMIDPDANKACNLGLCLMKQGRLN

Query:  EAISVLEQVQQGQIPG----SDENKAQKRAADLLTEIRS---RQSLPDSIE
        EA   L +V+   + G        KA +RA  +L ++ S   R+   D +E
Subjt:  EAISVLEQVQQGQIPG----SDENKAQKRAADLLTEIRS---RQSLPDSIE

AT4G20900.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.4e-6047.15Show/hide
Query:  SKGKKEELFHVIHKVPPGDSPYVRAKYAQLIQKDPESAIVLFWEAINSGDRVESALKDMAVVMKQLDRAQEAIHILKTFRFLCSKHSQESLDNVLIDLFK
        S  ++ + FH++HKVP GDSPYVRAK+AQLI KDP  AI LFW AIN+GDRV+SALKDMAVVMKQL R+ E I  +K+FR+LCS  SQ+S+DN+L++L+K
Subjt:  SKGKKEELFHVIHKVPPGDSPYVRAKYAQLIQKDPESAIVLFWEAINSGDRVESALKDMAVVMKQLDRAQEAIHILKTFRFLCSKHSQESLDNVLIDLFK

Query:  KCGRIEEQIELLKRKLRMIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYK----------------KAQMIDPDANK
        K GRIEE+  LL+ KL+ + +G  F G+ +R  R  GK   ++++QE +R+LGNLGW ++Q  NY +AE  Y+                +A  ++ D NK
Subjt:  KCGRIEEQIELLKRKLRMIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYK----------------KAQMIDPDANK

Query:  ACNLGLCLMKQGRLNEAISVLEQVQQGQIPG--SDE--NKAQKRAADLLTEIRSRQSLPDSIE
         CNL +CLM+  R+ EA S+L+ V+         DE   K+  RA ++L EI S++   D  E
Subjt:  ACNLGLCLMKQGRLNEAISVLEQVQQGQIPG--SDE--NKAQKRAADLLTEIRSRQSLPDSIE

AT5G48840.1 homolog of bacterial PANC6.9e-9755.93Show/hide
Query:  QPKIITDKNQMRAWTRAMRSQGKTIAFVPTMGFLHDGHLSLIQEAHKHSQLVVVSIYVNPSQFGPSEDLSTYPSDFEGDIRKLMAVPRGIDAVFHPRNLY
        +P++I DK+ MR W+RAMRSQGKTI  VPTMG+LH+GHLSL++++   + + VVSIYVNP QF P+EDLSTYPSDF GD+ KL A+  G   VF+P+NLY
Subjt:  QPKIITDKNQMRAWTRAMRSQGKTIAFVPTMGFLHDGHLSLIQEAHKHSQLVVVSIYVNPSQFGPSEDLSTYPSDFEGDIRKLMAVPRGIDAVFHPRNLY

Query:  DYGVESAKDCVGNGSGCGGMAAVSCLEESGSGHETWVRVERLEKGMCGRSRPVFFRGVATIVAKLFNIVEPDVAVFGKKDYQQWRIIMRMGTKSCIFVRL
        DYG E+ K  + +G G GG   VSC+EE G GHETW+RVERLEKG CG+SRPVFFRGVATIV KLFNIVEPDVA+FGKKDYQQWRII RM          
Subjt:  DYGVESAKDCVGNGSGCGGMAAVSCLEESGSGHETWVRVERLEKGMCGRSRPVFFRGVATIVAKLFNIVEPDVAVFGKKDYQQWRIIMRMGTKSCIFVRL

Query:  SSNFEKLLRSILLDELVRDLDFSINIVGSEIVRDADGLAMSSRNVRLSPEERQKALSINRSLSKAKSAAESGELNCKRLKNLIVDEVREAGGELDYAESM
                        VRDL+F I IVGS+I R+ DGLAMSSRNVRLS EERQ+ALSI+RSL+ AK++   G+ NC  LK++I+ +V  + G +DY E +
Subjt:  SSNFEKLLRSILLDELVRDLDFSINIVGSEIVRDADGLAMSSRNVRLSPEERQKALSINRSLSKAKSAAESGELNCKRLKNLIVDEVREAGGELDYAESM

Query:  FWKEKQLQFFSKTRRIENIVIAIHTYCGT
           ++ L+   + +    +VI +  + GT
Subjt:  FWKEKQLQFFSKTRRIENIVIAIHTYCGT

AT5G48850.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.1e-10262.21Show/hide
Query:  EKSKEREEMEGRLKIKGSKGKKEELFHVIHKVPPGDSPYVRAKYAQLIQKDPESAIVLFWEAINSGDRVESALKDMAVVMKQLDRAQEAIHILKTFRFLC
        E+S ++ +      IK +  K +ELFHVIHKVP GD+PYVRAK+AQLI+K+PE AIV FW+AIN+GDRV+SALKDMAVVMKQLDR++EAI  +K+FR  C
Subjt:  EKSKEREEMEGRLKIKGSKGKKEELFHVIHKVPPGDSPYVRAKYAQLIQKDPESAIVLFWEAINSGDRVESALKDMAVVMKQLDRAQEAIHILKTFRFLC

Query:  SKHSQESLDNVLIDLFKKCGRIEEQIELLKRKLRMIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQMIDPDAN
        SK+SQ+SLDNVLIDL+KKCGR+EEQ+ELLKRKLR IY+GEAFNGKPT+TARSHGKKFQV+V+QE SRLLGNLGWAYMQ+  Y+ AEAVY+KAQM++PDAN
Subjt:  SKHSQESLDNVLIDLFKKCGRIEEQIELLKRKLRMIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQMIDPDAN

Query:  KACNLGLCLMKQGRLNEAISVLEQVQQGQIPGSDENKAQKRAADLLTEIRSRQSLP-----DSIELLGLSIDADLLNGLEQLVNKRGPFRSKRLPVFEEI
        K+CNL +CL+KQGR  E   VL+ V + ++ G+D+ + ++RA +LL+E+ S  SLP     +  ++LG  +D D + GLE++ +    F+SKRLP+FE+I
Subjt:  KACNLGLCLMKQGRLNEAISVLEQVQQGQIPGSDENKAQKRAADLLTEIRSRQSLP-----DSIELLGLSIDADLLNGLEQLVNKRGPFRSKRLPVFEEI

Query:  SSFRDQL
        SSFR+ L
Subjt:  SSFRDQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTTAGAAGAAGAAAAATCAAAAGAGAGAGAGGAAATGGAAGGAAGGCTAAAAATTAAGGGCTCAAAAGGGAAAAAAGAAGAACTCTTTCATGTCATTCATAAGGT
TCCACCTGGTGATAGTCCTTATGTTAGAGCAAAATATGCTCAGTTGATACAGAAGGATCCCGAGAGTGCAATTGTATTATTTTGGGAAGCAATAAATAGTGGGGACAGAG
TAGAAAGTGCACTAAAAGATATGGCTGTGGTTATGAAGCAACTTGATAGAGCACAAGAAGCCATTCATATCCTCAAAACCTTTAGATTTCTTTGCTCAAAACACTCTCAA
GAATCTCTTGACAATGTTCTTATCGATTTGTTTAAGAAATGTGGGAGAATTGAAGAGCAAATTGAGCTGTTGAAGAGAAAATTGAGAATGATTTATGAAGGAGAAGCTTT
TAATGGAAAGCCAACAAGAACAGCTCGTTCTCATGGCAAGAAATTTCAGGTTTCTGTCAAACAAGAAACCTCTAGATTATTGGGAAATCTTGGGTGGGCCTACATGCAAA
AGCCCAATTACATGATGGCTGAAGCAGTGTACAAGAAGGCCCAAATGATAGATCCAGACGCAAACAAGGCCTGCAATTTGGGCCTTTGTTTAATGAAACAGGGCCGTCTC
AATGAGGCCATTTCGGTTCTCGAACAAGTCCAGCAAGGCCAAATTCCGGGCTCAGATGAAAACAAAGCCCAAAAACGGGCAGCAGATTTGCTGACCGAAATCAGGTCAAG
GCAATCTCTGCCCGATTCTATTGAACTATTGGGCCTCAGTATCGATGCTGATTTGCTTAATGGGCTTGAGCAATTGGTCAACAAAAGAGGCCCATTTAGGTCCAAGAGGC
TCCCTGTTTTTGAGGAAATTTCTTCATTTAGGGATCAATTAGCTTATAAAGAAAAGTGGAGGTCAAGTGGAATGGTCAAAGGAAATGTTTGTATTATTCTGTTAATTGTG
TATTTACAGTTTGGAGAAAGATGTAAATTTAGTATTAGGTTCGATTCTCCAATGGCGGGGAAGGAGAAGCAGCCGAAGATAATCACAGACAAGAACCAGATGAGGGCATG
GACGAGAGCCATGAGATCTCAAGGCAAAACCATCGCCTTCGTTCCCACCATGGGATTCCTTCACGACGGCCATCTCTCTCTAATCCAAGAAGCTCACAAGCACTCACAAC
TCGTCGTCGTTTCAATCTATGTAAACCCTAGCCAATTTGGCCCTTCTGAGGACCTCTCAACCTATCCTTCAGATTTCGAGGGCGACATTCGGAAGCTCATGGCTGTTCCT
CGAGGAATTGATGCCGTTTTCCATCCTCGCAATCTCTACGACTATGGAGTTGAATCAGCTAAGGATTGTGTCGGAAATGGCAGCGGTTGTGGTGGAATGGCGGCGGTTTC
TTGCTTGGAGGAGTCTGGTTCAGGGCATGAGACATGGGTGAGGGTCGAGCGTTTGGAGAAGGGGATGTGTGGGAGAAGCAGGCCTGTTTTCTTCAGAGGGGTTGCTACCA
TTGTTGCCAAGTTGTTCAATATTGTGGAGCCTGATGTTGCAGTGTTTGGGAAGAAAGATTATCAGCAATGGCGGATTATCATGCGGATGGGTACTAAATCTTGTATATTT
GTAAGACTATCTTCAAATTTTGAAAAGCTGCTTCGATCAATACTATTAGACGAACTGGTTCGAGATCTTGATTTTTCTATAAACATTGTGGGGTCTGAAATCGTGCGCGA
CGCCGATGGTCTTGCAATGAGTTCTCGCAATGTGCGGCTCTCACCTGAAGAAAGACAGAAGGCATTGTCTATAAACAGGTCATTGTCAAAAGCAAAATCTGCAGCAGAAA
GTGGTGAACTCAATTGCAAAAGATTAAAGAACCTGATTGTTGATGAAGTACGGGAAGCCGGTGGAGAACTTGATTATGCTGAGTCTATGTTTTGGAAGGAAAAACAACTT
CAATTCTTCTCAAAAACGAGAAGAATAGAAAACATCGTTATTGCTATCCACACATATTGTGGAACAAGAGAGTTTGGAGGTGGTGGAGGAGATCAAGAGTCCAGTTGTGA
TTTTGATTGCTGCCTTGTTTGGCAAAATTCTAAGTTAACTCATCCCTCTGCAGCACTTCAATCAAATCCTCCCCTTAAAGACTTATTTCCACCCTTCATTCTTCCACAAA
CAAAGGAGGAGGAAAAGAAGAAGTTTGAGCAGAAGAAGTCTAAGGAGAAGGAGGAAAAGAAGAAGATGTTTGAAAAGAAGAAGAAGTCTGACAAGATATATTTGTACACA
TTGCCGATCAAATTAAAATGA
mRNA sequenceShow/hide mRNA sequence
GGATGTTGTCTTCATTATCTCTTTTTTGCTAATTTATAAGAGTTTTTCCATTTTCAAAGGAAGAGAGAGAGCTTCATTTTTGCAGCTAATTTTAAATTTGGGAATTAGGG
TTTCCATTGAAGAAGAAGAACGAAGAGAATGGTTTTAGAAGAAGAAAAATCAAAAGAGAGAGAGGAAATGGAAGGAAGGCTAAAAATTAAGGGCTCAAAAGGGAAAAAAG
AAGAACTCTTTCATGTCATTCATAAGGTTCCACCTGGTGATAGTCCTTATGTTAGAGCAAAATATGCTCAGTTGATACAGAAGGATCCCGAGAGTGCAATTGTATTATTT
TGGGAAGCAATAAATAGTGGGGACAGAGTAGAAAGTGCACTAAAAGATATGGCTGTGGTTATGAAGCAACTTGATAGAGCACAAGAAGCCATTCATATCCTCAAAACCTT
TAGATTTCTTTGCTCAAAACACTCTCAAGAATCTCTTGACAATGTTCTTATCGATTTGTTTAAGAAATGTGGGAGAATTGAAGAGCAAATTGAGCTGTTGAAGAGAAAAT
TGAGAATGATTTATGAAGGAGAAGCTTTTAATGGAAAGCCAACAAGAACAGCTCGTTCTCATGGCAAGAAATTTCAGGTTTCTGTCAAACAAGAAACCTCTAGATTATTG
GGAAATCTTGGGTGGGCCTACATGCAAAAGCCCAATTACATGATGGCTGAAGCAGTGTACAAGAAGGCCCAAATGATAGATCCAGACGCAAACAAGGCCTGCAATTTGGG
CCTTTGTTTAATGAAACAGGGCCGTCTCAATGAGGCCATTTCGGTTCTCGAACAAGTCCAGCAAGGCCAAATTCCGGGCTCAGATGAAAACAAAGCCCAAAAACGGGCAG
CAGATTTGCTGACCGAAATCAGGTCAAGGCAATCTCTGCCCGATTCTATTGAACTATTGGGCCTCAGTATCGATGCTGATTTGCTTAATGGGCTTGAGCAATTGGTCAAC
AAAAGAGGCCCATTTAGGTCCAAGAGGCTCCCTGTTTTTGAGGAAATTTCTTCATTTAGGGATCAATTAGCTTATAAAGAAAAGTGGAGGTCAAGTGGAATGGTCAAAGG
AAATGTTTGTATTATTCTGTTAATTGTGTATTTACAGTTTGGAGAAAGATGTAAATTTAGTATTAGGTTCGATTCTCCAATGGCGGGGAAGGAGAAGCAGCCGAAGATAA
TCACAGACAAGAACCAGATGAGGGCATGGACGAGAGCCATGAGATCTCAAGGCAAAACCATCGCCTTCGTTCCCACCATGGGATTCCTTCACGACGGCCATCTCTCTCTA
ATCCAAGAAGCTCACAAGCACTCACAACTCGTCGTCGTTTCAATCTATGTAAACCCTAGCCAATTTGGCCCTTCTGAGGACCTCTCAACCTATCCTTCAGATTTCGAGGG
CGACATTCGGAAGCTCATGGCTGTTCCTCGAGGAATTGATGCCGTTTTCCATCCTCGCAATCTCTACGACTATGGAGTTGAATCAGCTAAGGATTGTGTCGGAAATGGCA
GCGGTTGTGGTGGAATGGCGGCGGTTTCTTGCTTGGAGGAGTCTGGTTCAGGGCATGAGACATGGGTGAGGGTCGAGCGTTTGGAGAAGGGGATGTGTGGGAGAAGCAGG
CCTGTTTTCTTCAGAGGGGTTGCTACCATTGTTGCCAAGTTGTTCAATATTGTGGAGCCTGATGTTGCAGTGTTTGGGAAGAAAGATTATCAGCAATGGCGGATTATCAT
GCGGATGGGTACTAAATCTTGTATATTTGTAAGACTATCTTCAAATTTTGAAAAGCTGCTTCGATCAATACTATTAGACGAACTGGTTCGAGATCTTGATTTTTCTATAA
ACATTGTGGGGTCTGAAATCGTGCGCGACGCCGATGGTCTTGCAATGAGTTCTCGCAATGTGCGGCTCTCACCTGAAGAAAGACAGAAGGCATTGTCTATAAACAGGTCA
TTGTCAAAAGCAAAATCTGCAGCAGAAAGTGGTGAACTCAATTGCAAAAGATTAAAGAACCTGATTGTTGATGAAGTACGGGAAGCCGGTGGAGAACTTGATTATGCTGA
GTCTATGTTTTGGAAGGAAAAACAACTTCAATTCTTCTCAAAAACGAGAAGAATAGAAAACATCGTTATTGCTATCCACACATATTGTGGAACAAGAGAGTTTGGAGGTG
GTGGAGGAGATCAAGAGTCCAGTTGTGATTTTGATTGCTGCCTTGTTTGGCAAAATTCTAAGTTAACTCATCCCTCTGCAGCACTTCAATCAAATCCTCCCCTTAAAGAC
TTATTTCCACCCTTCATTCTTCCACAAACAAAGGAGGAGGAAAAGAAGAAGTTTGAGCAGAAGAAGTCTAAGGAGAAGGAGGAAAAGAAGAAGATGTTTGAAAAGAAGAA
GAAGTCTGACAAGATATATTTGTACACATTGCCGATCAAATTAAAATGA
Protein sequenceShow/hide protein sequence
MVLEEEKSKEREEMEGRLKIKGSKGKKEELFHVIHKVPPGDSPYVRAKYAQLIQKDPESAIVLFWEAINSGDRVESALKDMAVVMKQLDRAQEAIHILKTFRFLCSKHSQ
ESLDNVLIDLFKKCGRIEEQIELLKRKLRMIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQMIDPDANKACNLGLCLMKQGRL
NEAISVLEQVQQGQIPGSDENKAQKRAADLLTEIRSRQSLPDSIELLGLSIDADLLNGLEQLVNKRGPFRSKRLPVFEEISSFRDQLAYKEKWRSSGMVKGNVCIILLIV
YLQFGERCKFSIRFDSPMAGKEKQPKIITDKNQMRAWTRAMRSQGKTIAFVPTMGFLHDGHLSLIQEAHKHSQLVVVSIYVNPSQFGPSEDLSTYPSDFEGDIRKLMAVP
RGIDAVFHPRNLYDYGVESAKDCVGNGSGCGGMAAVSCLEESGSGHETWVRVERLEKGMCGRSRPVFFRGVATIVAKLFNIVEPDVAVFGKKDYQQWRIIMRMGTKSCIF
VRLSSNFEKLLRSILLDELVRDLDFSINIVGSEIVRDADGLAMSSRNVRLSPEERQKALSINRSLSKAKSAAESGELNCKRLKNLIVDEVREAGGELDYAESMFWKEKQL
QFFSKTRRIENIVIAIHTYCGTREFGGGGGDQESSCDFDCCLVWQNSKLTHPSAALQSNPPLKDLFPPFILPQTKEEEKKKFEQKKSKEKEEKKKMFEKKKKSDKIYLYT
LPIKLK