; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh19G001630 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh19G001630
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionprotein SULFUR DEFICIENCY-INDUCED 1-like isoform X1
Genome locationCmo_Chr19:1023231..1034099
RNA-Seq ExpressionCmoCh19G001630
SyntenyCmoCh19G001630
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002775 - DNA/RNA-binding protein Alba-like
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR019734 - Tetratricopeptide repeat
IPR036882 - Alba-like domain superfamily
IPR044961 - Tetratricopeptide repeat protein POLLENLESS 3/SULFUR DEFICIENCY-INDUCED 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571577.1 Protein SULFUR DEFICIENCY-INDUCED 1, partial [Cucurbita argyrosperma subsp. sororia]3.9e-30999.64Show/hide
Query:  MDRYQKVEKPKPESPINENEIRITSQGAIRNYITYASTLLQDKRVREIVLKAMGQAISKTVSIAEILKKRTPQLHQETSISSVSIIDVWEPIEEGLVPVE
        MDRYQKVEKPKPESPINENEIRITSQGAIRNYITYASTLLQDKRVREIVLKAMGQAISKTVSIAEILKKRTPQLHQETSISSVSIIDVWEPIEEGLVPVE
Subjt:  MDRYQKVEKPKPESPINENEIRITSQGAIRNYITYASTLLQDKRVREIVLKAMGQAISKTVSIAEILKKRTPQLHQETSISSVSIIDVWEPIEEGLVPVE

Query:  MTRHVSMISITLSNRELNKDSPGYQAPQFVEQPKQQYKQQQQQQPRQSRPYYNAVNEDPYGQSRGRGRGGGRGRGWGRGGYGNYQGGYGNYQGGYGHYQG
        MTRHVSMISITLSNRELNKDSPGYQAPQFVEQPKQQYKQQQQQQPRQSRPYYNAVNEDPYGQSRGRGRGGGRGRGWGRGGYGNYQGGYGNYQGGYGHYQG
Subjt:  MTRHVSMISITLSNRELNKDSPGYQAPQFVEQPKQQYKQQQQQQPRQSRPYYNAVNEDPYGQSRGRGRGGGRGRGWGRGGYGNYQGGYGNYQGGYGHYQG

Query:  GYGHYQGGYGHYQGGSGGYQGGGPGGYQGNYQDNGGYSNWSRGGGRGRGWTYRGTGYDRGRGGRGYGRGRGRGQIDGQTRDNGGSRNQVPPGDTPYVRAK
        GYGHYQGGYGHYQGGSGGYQGGGPGGYQGNYQDNGGYSNWSRGGGRGRGWTYRGTGYDRGRGGRGYGRGRGRGQIDGQTRDNGGSR QVPPGDTPYVRAK
Subjt:  GYGHYQGGYGHYQGGSGGYQGGGPGGYQGNYQDNGGYSNWSRGGGRGRGWTYRGTGYDRGRGGRGYGRGRGRGQIDGQTRDNGGSRNQVPPGDTPYVRAK

Query:  YAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEMLKRKLRKIYEGEAFN
        YAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEMLKRKLRKIYEGEAFN
Subjt:  YAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEMLKRKLRKIYEGEAFN

Query:  GKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQGRIPGSDEIKAQKRAG
        GKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQG+IPGSDEIKAQKRAG
Subjt:  GKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQGRIPGSDEIKAQKRAG

Query:  DLLTQIRSRQSLPDSIELLGLSIDGDLLNGLEQLVHERGPFRSKRLPVFEEISSFRDQLACN
        DLLTQIRSRQSLPDSIELLGLSIDGDLLNGLEQLVHERGPFRSKRLPVFEEISSFRDQLACN
Subjt:  DLLTQIRSRQSLPDSIELLGLSIDGDLLNGLEQLVHERGPFRSKRLPVFEEISSFRDQLACN

KAG7011320.1 Protein SULFUR DEFICIENCY-INDUCED 1, partial [Cucurbita argyrosperma subsp. argyrosperma]4.8e-15098.91Show/hide
Query:  NQVPPGDTPYVRAKYAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEML
        ++VPPGDTPYVRAKYAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEML
Subjt:  NQVPPGDTPYVRAKYAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEML

Query:  KRKLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQGR
        KRKLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQG+
Subjt:  KRKLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQGR

Query:  IPGSDEIKAQKRAGDLLTQIRSRQSLPDSIELLGLSIDGDLLNGLEQLVHERGPFRSKRLPVFEEISSFRDQLACN
        IPGSDEIKAQKRAGDLLTQIRSRQSLPDSIELLGLSIDGDLLNGLEQLVHERGPFRSKRLPVFEEISSFRDQLACN
Subjt:  IPGSDEIKAQKRAGDLLTQIRSRQSLPDSIELLGLSIDGDLLNGLEQLVHERGPFRSKRLPVFEEISSFRDQLACN

XP_022963612.1 protein FAM98A isoform X3 [Cucurbita moschata]1.2e-148100Show/hide
Query:  MDRYQKVEKPKPESPINENEIRITSQGAIRNYITYASTLLQDKRVREIVLKAMGQAISKTVSIAEILKKRTPQLHQETSISSVSIIDVWEPIEEGLVPVE
        MDRYQKVEKPKPESPINENEIRITSQGAIRNYITYASTLLQDKRVREIVLKAMGQAISKTVSIAEILKKRTPQLHQETSISSVSIIDVWEPIEEGLVPVE
Subjt:  MDRYQKVEKPKPESPINENEIRITSQGAIRNYITYASTLLQDKRVREIVLKAMGQAISKTVSIAEILKKRTPQLHQETSISSVSIIDVWEPIEEGLVPVE

Query:  MTRHVSMISITLSNRELNKDSPGYQAPQFVEQPKQQYKQQQQQQPRQSRPYYNAVNEDPYGQSRGRGRGGGRGRGWGRGGYGNYQGGYGNYQGGYGHYQG
        MTRHVSMISITLSNRELNKDSPGYQAPQFVEQPKQQYKQQQQQQPRQSRPYYNAVNEDPYGQSRGRGRGGGRGRGWGRGGYGNYQGGYGNYQGGYGHYQG
Subjt:  MTRHVSMISITLSNRELNKDSPGYQAPQFVEQPKQQYKQQQQQQPRQSRPYYNAVNEDPYGQSRGRGRGGGRGRGWGRGGYGNYQGGYGNYQGGYGHYQG

Query:  GYGHYQGGYGHYQGGSGGYQGGGPGGYQGNYQDNGGYSNWSRGGGRGRGWTYRGTGYDRGRGGRGYGRGRGRGQIDGQTRDNGGSRNQV
        GYGHYQGGYGHYQGGSGGYQGGGPGGYQGNYQDNGGYSNWSRGGGRGRGWTYRGTGYDRGRGGRGYGRGRGRGQIDGQTRDNGGSRNQV
Subjt:  GYGHYQGGYGHYQGGSGGYQGGGPGGYQGNYQDNGGYSNWSRGGGRGRGWTYRGTGYDRGRGGRGYGRGRGRGQIDGQTRDNGGSRNQV

XP_022963805.1 protein SULFUR DEFICIENCY-INDUCED 1-like isoform X1 [Cucurbita moschata]2.1e-15099.28Show/hide
Query:  NQVPPGDTPYVRAKYAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEML
        ++VPPGDTPYVRAKYAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEML
Subjt:  NQVPPGDTPYVRAKYAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEML

Query:  KRKLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQGR
        KRKLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQGR
Subjt:  KRKLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQGR

Query:  IPGSDEIKAQKRAGDLLTQIRSRQSLPDSIELLGLSIDGDLLNGLEQLVHERGPFRSKRLPVFEEISSFRDQLACN
        IPGSDEIKAQKRAGDLLTQIRSRQSLPDSIELLGLSIDGDLLNGLEQLVHERGPFRSKRLPVFEEISSFRDQLACN
Subjt:  IPGSDEIKAQKRAGDLLTQIRSRQSLPDSIELLGLSIDGDLLNGLEQLVHERGPFRSKRLPVFEEISSFRDQLACN

XP_022967303.1 protein SULFUR DEFICIENCY-INDUCED 1-like isoform X1 [Cucurbita maxima]1.1e-14696.74Show/hide
Query:  NQVPPGDTPYVRAKYAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEML
        ++VPPGDTPYVRAKYAQLI+KDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAI IL+TYRFLCSKHSQ+SLDNVLIDLFKKCGRIEEQIE+L
Subjt:  NQVPPGDTPYVRAKYAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEML

Query:  KRKLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQGR
        KRKLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPN+MMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQG 
Subjt:  KRKLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQGR

Query:  IPGSDEIKAQKRAGDLLTQIRSRQSLPDSIELLGLSIDGDLLNGLEQLVHERGPFRSKRLPVFEEISSFRDQLACN
        IPGSDEIKAQKRAGDLLTQIRSRQSLPDSIELLGLSIDGDLLNGLEQLVHERGPFRSKRLPVFEEISSFRDQLACN
Subjt:  IPGSDEIKAQKRAGDLLTQIRSRQSLPDSIELLGLSIDGDLLNGLEQLVHERGPFRSKRLPVFEEISSFRDQLACN

TrEMBL top hitse value%identityAlignment
A0A1R3J0F8 Tetratricopeptide TPR-16.5e-14556.1Show/hide
Query:  MDRYQKVEKPKPESPINENEIRITSQGAIRNYITYASTLLQDKRVREIVLKAMGQAISKTVSIAEILKKRTPQLHQETSISSVSIIDVWEPIEEGLVPVE
        MDRYQ+VEK + E PINENEIRIT+QG IRNYITYA +LLQ+K+   IVLKAMGQAISKTV+IAEI+KKR P LHQETSISS+SI D+WEP+EEGLVP+E
Subjt:  MDRYQKVEKPKPESPINENEIRITSQGAIRNYITYASTLLQDKRVREIVLKAMGQAISKTVSIAEILKKRTPQLHQETSISSVSIIDVWEPIEEGLVPVE

Query:  MTRHVSMISITLSNRELNKDSPGYQAPQFVEQPKQQYKQQQQQQPRQSRPYYNAVNEDPYGQSRGRGRGGGRGRGWGRGGYGNY--QGGYGNYQGGYGHY
        MTR VS+ISI+LS + LNK+SPGYQ+P   EQPK QY+  QQQ+   +   +   NED  G+ + RGRG GRGRGWGR  Y NY   GGY N   G G  
Subjt:  MTRHVSMISITLSNRELNKDSPGYQAPQFVEQPKQQYKQQQQQQPRQSRPYYNAVNEDPYGQSRGRGRGGGRGRGWGRGGYGNY--QGGYGNYQGGYGHY

Query:  QGGYGHYQGGYGHYQGGSGGYQGG----GPGGYQGNYQDNGGYSNWSRGGGRGRGWTYRGTGYDRGRGGRGYGRGRGRGQIDGQTRDNGGS------RNQ
        +G       GY  YQ     +        P      Y     +S                  Y+         RG     +D    ++  S       ++
Subjt:  QGGYGHYQGGYGHYQGGSGGYQGG----GPGGYQGNYQDNGGYSNWSRGGGRGRGWTYRGTGYDRGRGGRGYGRGRGRGQIDGQTRDNGGS------RNQ

Query:  VPPGDTPYVRAKYAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEMLKR
        +PPGD+PYVRAK+ QL++KDPE+A+ LFW+AINAGDRV+SALKDMAVV+KQ DRAEEAI  ++++R  CSK +Q+SLDNVLIDL+KKCGRI+EQI++LK+
Subjt:  VPPGDTPYVRAKYAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEMLKR

Query:  KLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQGRIP
        KLR IY+G AFNGKPT+TARSHGKKFQV++KQETSR+LGNLGWAYMQ+ NY+ AEAVY+KAQIIDPDANKACNL  CL+KQ R  EA S+L+ V Q ++P
Subjt:  KLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQGRIP

Query:  GSDEIKAQKRAGDLLTQIRSRQSLP-DSIELLGLSIDGDLLNGLEQLVHERGPFRSKRLPVFEEISSFRDQLAC
        GS + K++ R  +L+ ++ S+Q +   S   +GL+++   L GL+QL+ +  P+RS+RLP+FEEISSFRDQLAC
Subjt:  GSDEIKAQKRAGDLLTQIRSRQSLP-DSIELLGLSIDGDLLNGLEQLVHERGPFRSKRLPVFEEISSFRDQLAC

A0A3Q7I4K4 Alba domain-containing protein2.3e-15853.92Show/hide
Query:  MDRYQKVEKPKPESPINENEIRITSQGAIRNYITYASTLLQDKRVREIVLKAMGQAISKTVSIAEILKKRTPQLHQETSISSVSIIDVWEPIEEGLVPVE
        MDRYQKVEKPKPESPINE EIRITSQG +RNYI YA+TLLQ++  +EIVLKAMGQAISK V+IAEI+K+R P LHQ+T+ISSVSI DVWEPIEEGL+PVE
Subjt:  MDRYQKVEKPKPESPINENEIRITSQGAIRNYITYASTLLQDKRVREIVLKAMGQAISKTVSIAEILKKRTPQLHQETSISSVSIIDVWEPIEEGLVPVE

Query:  MTRHVSMISITLSNRELNKDSPGYQAPQFVEQPKQQYK----QQQQQQPRQSRPYYNAVNEDPYGQSRGRGRGGGRGRGWGRGGYGNYQ--GGYGNYQGG
         TRHVS+ISITLS +ELN +SPGYQAP  VEQ K QY     Q QQQ PRQ+R  Y A NED YG  RGRGRG GRGRGW RGGYGNYQ  G Y N    
Subjt:  MTRHVSMISITLSNRELNKDSPGYQAPQFVEQPKQQYK----QQQQQQPRQSRPYYNAVNEDPYGQSRGRGRGGGRGRGWGRGGYGNYQ--GGYGNYQGG

Query:  YGHYQGGY---GHYQGGYGHYQGGSGGY--QGGGPGGYQGNYQDNGGYSNWSRGGGRGRGWTYRGTGY-DRGRGGRGYGR--------------------
        +G    GY   G   GG  +   G+GGY  +G G GGY    + NGGYSNW    G    W     GY +RGRG  G  +                    
Subjt:  YGHYQGGY---GHYQGGYGHYQGGSGGY--QGGGPGGYQGNYQDNGGYSNWSRGGGRGRGWTYRGTGY-DRGRGGRGYGR--------------------

Query:  ----------------GRGR-----------------GQIDGQ---------------------------TRDNGGSRN----QVP-------PGDTPYV
                        G G+                  ++DG+                           TR + G  +    Q+P         D PYV
Subjt:  ----------------GRGR-----------------GQIDGQ---------------------------TRDNGGSRN----QVP-------PGDTPYV

Query:  RAKYAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEMLKRKLRKIYEGE
        RAKYAQ++EK+PE+AI  FW+AIN GDRV+SALKDMA+VMKQ+DR+EEAI  + ++R LCSK +QQSLDNVL+DLFKKCG+++EQI +LK+KLR+IY+G+
Subjt:  RAKYAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEMLKRKLRKIYEGE

Query:  AFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQGRIPGSDEIKAQK
         FNGKP++TARSHGKKFQVSV+QET+R+LGNLGWAY QK N+M AE VYKKAQIID D+NKACNL  CL+KQ R +EA ++L++V +G   GS++ K +K
Subjt:  AFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQGRIPGSDEIKAQK

Query:  RAGDLLTQIRSRQSLPDSIELLGLSIDGDLLNGLEQLVHERGPFRSKRLPVFEEISSFRDQLAC
        R  +LL ++ SRQ  P    L GL++D D +NGLE L++E  P +S+RLP+FEEIS+F+DQLAC
Subjt:  RAGDLLTQIRSRQSLPDSIELLGLSIDGDLLNGLEQLVHERGPFRSKRLPVFEEISSFRDQLAC

A0A6J1HFQ0 protein FAM98A isoform X35.7e-149100Show/hide
Query:  MDRYQKVEKPKPESPINENEIRITSQGAIRNYITYASTLLQDKRVREIVLKAMGQAISKTVSIAEILKKRTPQLHQETSISSVSIIDVWEPIEEGLVPVE
        MDRYQKVEKPKPESPINENEIRITSQGAIRNYITYASTLLQDKRVREIVLKAMGQAISKTVSIAEILKKRTPQLHQETSISSVSIIDVWEPIEEGLVPVE
Subjt:  MDRYQKVEKPKPESPINENEIRITSQGAIRNYITYASTLLQDKRVREIVLKAMGQAISKTVSIAEILKKRTPQLHQETSISSVSIIDVWEPIEEGLVPVE

Query:  MTRHVSMISITLSNRELNKDSPGYQAPQFVEQPKQQYKQQQQQQPRQSRPYYNAVNEDPYGQSRGRGRGGGRGRGWGRGGYGNYQGGYGNYQGGYGHYQG
        MTRHVSMISITLSNRELNKDSPGYQAPQFVEQPKQQYKQQQQQQPRQSRPYYNAVNEDPYGQSRGRGRGGGRGRGWGRGGYGNYQGGYGNYQGGYGHYQG
Subjt:  MTRHVSMISITLSNRELNKDSPGYQAPQFVEQPKQQYKQQQQQQPRQSRPYYNAVNEDPYGQSRGRGRGGGRGRGWGRGGYGNYQGGYGNYQGGYGHYQG

Query:  GYGHYQGGYGHYQGGSGGYQGGGPGGYQGNYQDNGGYSNWSRGGGRGRGWTYRGTGYDRGRGGRGYGRGRGRGQIDGQTRDNGGSRNQV
        GYGHYQGGYGHYQGGSGGYQGGGPGGYQGNYQDNGGYSNWSRGGGRGRGWTYRGTGYDRGRGGRGYGRGRGRGQIDGQTRDNGGSRNQV
Subjt:  GYGHYQGGYGHYQGGSGGYQGGGPGGYQGNYQDNGGYSNWSRGGGRGRGWTYRGTGYDRGRGGRGYGRGRGRGQIDGQTRDNGGSRNQV

A0A6J1HJ12 protein SULFUR DEFICIENCY-INDUCED 1-like isoform X11.0e-15099.28Show/hide
Query:  NQVPPGDTPYVRAKYAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEML
        ++VPPGDTPYVRAKYAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEML
Subjt:  NQVPPGDTPYVRAKYAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEML

Query:  KRKLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQGR
        KRKLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQGR
Subjt:  KRKLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQGR

Query:  IPGSDEIKAQKRAGDLLTQIRSRQSLPDSIELLGLSIDGDLLNGLEQLVHERGPFRSKRLPVFEEISSFRDQLACN
        IPGSDEIKAQKRAGDLLTQIRSRQSLPDSIELLGLSIDGDLLNGLEQLVHERGPFRSKRLPVFEEISSFRDQLACN
Subjt:  IPGSDEIKAQKRAGDLLTQIRSRQSLPDSIELLGLSIDGDLLNGLEQLVHERGPFRSKRLPVFEEISSFRDQLACN

A0A6J1HUP6 protein SULFUR DEFICIENCY-INDUCED 1-like isoform X15.3e-14796.74Show/hide
Query:  NQVPPGDTPYVRAKYAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEML
        ++VPPGDTPYVRAKYAQLI+KDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAI IL+TYRFLCSKHSQ+SLDNVLIDLFKKCGRIEEQIE+L
Subjt:  NQVPPGDTPYVRAKYAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEML

Query:  KRKLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQGR
        KRKLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPN+MMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQG 
Subjt:  KRKLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQGR

Query:  IPGSDEIKAQKRAGDLLTQIRSRQSLPDSIELLGLSIDGDLLNGLEQLVHERGPFRSKRLPVFEEISSFRDQLACN
        IPGSDEIKAQKRAGDLLTQIRSRQSLPDSIELLGLSIDGDLLNGLEQLVHERGPFRSKRLPVFEEISSFRDQLACN
Subjt:  IPGSDEIKAQKRAGDLLTQIRSRQSLPDSIELLGLSIDGDLLNGLEQLVHERGPFRSKRLPVFEEISSFRDQLACN

SwissProt top hitse value%identityAlignment
Q8GXU5 Protein SULFUR DEFICIENCY-INDUCED 19.2e-9662.86Show/hide
Query:  NQVPPGDTPYVRAKYAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEML
        ++VP GDTPYVRAK+AQLIEK+PE AI  FW+AIN GDRV+SALKDMAVVMKQ+DR+EEAI  ++++R  CSK+SQ SLDNVLIDL+KKCGR+EEQ+E+L
Subjt:  NQVPPGDTPYVRAKYAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEML

Query:  KRKLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQGR
        KRKLR+IY+GEAFNGKPT+TARSHGKKFQV+V+QE SRLLGNLGWAYMQ+  Y+ AEAVY+KAQ+++PDANK+CNL +CL+KQGR  E   VL  V + R
Subjt:  KRKLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQGR

Query:  IPGSDEIKAQKRAGDLLTQIRSRQSLP-----DSIELLGLSIDGDLLNGLEQLVHERGPFRSKRLPVFEEISSFRDQLAC
        + G+D+ + ++RA +LL+++ S  SLP     +  ++LG  +D D + GLE++      F+SKRLP+FE+ISSFR+ L C
Subjt:  IPGSDEIKAQKRAGDLLTQIRSRQSLP-----DSIELLGLSIDGDLLNGLEQLVHERGPFRSKRLPVFEEISSFRDQLAC

Q8L730 Protein SULFUR DEFICIENCY-INDUCED 21.1e-8558.21Show/hide
Query:  NQVPPGDTPYVRAKYAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEML
        +++P GD+PYVRAK+ QL+EKD E+AI LFW AI A DRV+SALKDMA++MKQ +RAEEAI  +Q++R LCS+ +Q+SLDNVLIDL+KKCGRIEEQ+E+L
Subjt:  NQVPPGDTPYVRAKYAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEML

Query:  KRKLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVL-QQVQQG
        K+KL  IY+GEAFNGKPT+TARSHGKKFQV+V++ETSR+LGNLGWAYMQ  +Y  AEAVY+KAQ+I+PDANKACNL  CL+KQG+ +EA S+L + V   
Subjt:  KRKLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVL-QQVQQG

Query:  RIPGSDEIKAQKRAGDLLTQIRSRQSLPDSIELLGLSIDGD---LLNGLEQLVHE-RGPFRSKRLPVFEEISSFRDQLAC
           GS + +   R  +LL++++ ++    +   +   +  D   ++ GL++ V E R P+R++RLP+FEEI   RDQLAC
Subjt:  RIPGSDEIKAQKRAGDLLTQIRSRQSLPDSIELLGLSIDGD---LLNGLEQLVHE-RGPFRSKRLPVFEEISSFRDQLAC

Q9FKV5 Protein POLLENLESS 3-LIKE 15.3e-5144.64Show/hide
Query:  QVPPGDTPYVRAKYAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEMLK
        +V  GD+PYVRAK+AQL+ KDP  AISLFW AINAGDRV+SALKDM VV+KQ++R +E I  ++++R+LC   SQ S+DN+L++L+ K GRI E  E+L+
Subjt:  QVPPGDTPYVRAKYAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEMLK

Query:  RKLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQ---
         KLR + + + + G+     RSH ++   +++QE +R+LGNL W ++Q  NY +AE  Y+ A  ++PD NK CNL +CL++  R +EA S+L+ V+Q   
Subjt:  RKLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQ---

Query:  GRIPGSDEIKAQKRAGDLLTQIRSRQSLPDSIE
         +       K+ +RA ++L + R + ++ D  E
Subjt:  GRIPGSDEIKAQKRAGDLLTQIRSRQSLPDSIE

Q9SD20 Protein POLLENLESS 3-LIKE 21.1e-6957.14Show/hide
Query:  NQVPPGDTPYVRAKYAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEML
        ++VP GD+PYVRAK  QL+EKDPE AI LFW+AINAGDRV+SALKDMA+VMKQ +RAEEAI  +++ R  CS  +Q+SLDN+L+DL+K+CGR+++QI +L
Subjt:  NQVPPGDTPYVRAKYAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEML

Query:  KRKLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQGR
        K KL  I +G AFNGK T+TARS GKKFQVSV+QE +RLLGNLGWA MQ+ N++ AE  Y++A  I PD NK CNLG+CLMKQGR++EA   L++V+   
Subjt:  KRKLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQGR

Query:  IPG----SDEIKAQKRAGDLLTQIRS---RQSLPDSIE
        + G       +KA +RA  +L  + S   R+   D +E
Subjt:  IPG----SDEIKAQKRAGDLLTQIRS---RQSLPDSIE

Q9SUC3 Protein POLLENLESS 31.3e-5748.52Show/hide
Query:  NQVPPGDTPYVRAKYAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEML
        ++VP GD+PYVRAK+AQLI+KDP  AISLFW AINAGDRV+SALKDMAVVMKQ+ R++E I  ++++R+LCS  SQ S+DN+L++L+KK GRIEE+  +L
Subjt:  NQVPPGDTPYVRAKYAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEML

Query:  KRKLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQGR
        + KL+ + +G  F G+ +R  R  GK   ++++QE +R+LGNLGW ++Q  NY +AE  Y++A  ++ D NK CNL +CLM+  R+ EA S+L  V+   
Subjt:  KRKLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQGR

Query:  IPGSDE------IKAQKRAGDLLTQIRSRQSLPDSIE
         P   E       K+  RA ++L +I S++   D  E
Subjt:  IPGSDE------IKAQKRAGDLLTQIRSRQSLPDSIE

Arabidopsis top hitse value%identityAlignment
AT1G04770.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.0e-8758.21Show/hide
Query:  NQVPPGDTPYVRAKYAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEML
        +++P GD+PYVRAK+ QL+EKD E+AI LFW AI A DRV+SALKDMA++MKQ +RAEEAI  +Q++R LCS+ +Q+SLDNVLIDL+KKCGRIEEQ+E+L
Subjt:  NQVPPGDTPYVRAKYAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEML

Query:  KRKLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVL-QQVQQG
        K+KL  IY+GEAFNGKPT+TARSHGKKFQV+V++ETSR+LGNLGWAYMQ  +Y  AEAVY+KAQ+I+PDANKACNL  CL+KQG+ +EA S+L + V   
Subjt:  KRKLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVL-QQVQQG

Query:  RIPGSDEIKAQKRAGDLLTQIRSRQSLPDSIELLGLSIDGD---LLNGLEQLVHE-RGPFRSKRLPVFEEISSFRDQLAC
           GS + +   R  +LL++++ ++    +   +   +  D   ++ GL++ V E R P+R++RLP+FEEI   RDQLAC
Subjt:  RIPGSDEIKAQKRAGDLLTQIRSRQSLPDSIELLGLSIDGD---LLNGLEQLVHE-RGPFRSKRLPVFEEISSFRDQLAC

AT1G76010.1 Alba DNA/RNA-binding protein1.0e-4147.29Show/hide
Query:  MDRYQKVEKPKPESPINENEIRITSQGAIRNYITYASTLLQDKRVREIVLKAMGQAISKTVSIAEILKKRTPQLHQETSISSVSIIDVWEPIEEGLVPVE
        MD+YQ+V KPK ++PI+ NEIRITSQG  RNYITYA TLLQDK   E+V KAMG+AI+KTV+I E++K+R P LHQ TSI S  I D WEP EEGL+P+E
Subjt:  MDRYQKVEKPKPESPINENEIRITSQGAIRNYITYASTLLQDKRVREIVLKAMGQAISKTVSIAEILKKRTPQLHQETSISSVSIIDVWEPIEEGLVPVE

Query:  MTRHVSMISITLSNRELNKDSPGYQAPQFVEQPKQ----QYKQQQQQQPRQSR-----------------PYYNAVNED---PYGQSRGRGRGGGRG---
         TRHVSMI+ITLS  ELN  S GYQ P  +E  K      Y+ ++     + R                  Y N  +ED      QS GRGRG GRG   
Subjt:  MTRHVSMISITLSNRELNKDSPGYQAPQFVEQPKQ----QYKQQQQQQPRQSR-----------------PYYNAVNED---PYGQSRGRGRGGGRG---

Query:  RGWGRGGYGN--------YQGGYG----------NYQGGYGHYQGGYGHYQG--GYGHYQG------------GSGGYQG--GGPGGYQGNYQDNGGYSN
        RG GRGGY            GGYG          + +GGY     G G Y G  G G Y G            G GGY G   G GGY G  Q  GGY  
Subjt:  RGWGRGGYGN--------YQGGYG----------NYQGGYGHYQGGYGHYQG--GYGHYQG------------GSGGYQG--GGPGGYQGNYQDNGGYSN

Query:  WSRGGGRGRGWTYRGTGYDRGRGGRGYGRGRG
         S+G G   G   RG G  RGRGGRG G GRG
Subjt:  WSRGGGRGRGWTYRGTGYDRGRGGRGYGRGRG

AT3G07030.1 Alba DNA/RNA-binding protein1.5e-4843.98Show/hide
Query:  MDRYQKVEKPKPESPINENEIRITSQGAIRNYITYASTLLQDKRVREIVLKAMGQAISKTVSIAEILKKRTPQLHQETSISSVSIIDVWEPIEEGLVPVE
        MDRYQ+V KPKPESPINENEIRITS+G IRNYI+YA++LLQ+K V++IVLKAMGQAISKTV+I+EILK + P LHQ+ +ISS+SI DVWEP EEGL PVE
Subjt:  MDRYQKVEKPKPESPINENEIRITSQGAIRNYITYASTLLQDKRVREIVLKAMGQAISKTVSIAEILKKRTPQLHQETSISSVSIIDVWEPIEEGLVPVE

Query:  MTRHVSMISITLSNRELNKDSPGYQAPQFVEQPKQQYKQQQQQQPRQSRPYYNAVNE-------------------------------------------
        +TRHVSMISITLS  ELNKDSPGYQAP   +Q K QY   Q QQ RQ+R  YNA  E                                           
Subjt:  MTRHVSMISITLSNRELNKDSPGYQAPQFVEQPKQQYKQQQQQQPRQSRPYYNAVNE-------------------------------------------

Query:  ---------------------------------------------------DPYGQSRGRGRGGGRGRGWGRG---GYGNYQGGY-----GNYQGGYGHY
                                                           D YG  R  G GGGR  G+G G   GYG  +GG+     G    GYG  
Subjt:  ---------------------------------------------------DPYGQSRGRGRGGGRGRGWGRG---GYGNYQGGY-----GNYQGGYGHY

Query:  QGGYGHYQGGYGH-YQGGSG-GYQGGGPGGYQGNYQDNGGYSNWSR---------GGGRGRGW-TYRGTGYDRGRGG-----RGYGRGRGRGQIDGQTRD
        +GGYG   GG G  Y GG G GY GG   GY G   D  G     R         GGGR  G+   +  GY  GRGG      GYGRGRGR    G++RD
Subjt:  QGGYGHYQGGYGH-YQGGSG-GYQGGGPGGYQGNYQDNGGYSNWSR---------GGGRGRGW-TYRGTGYDRGRGG-----RGYGRGRGRGQIDGQTRD

Query:  NGGSRNQ
           ++N+
Subjt:  NGGSRNQ

AT3G51280.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.0e-7157.14Show/hide
Query:  NQVPPGDTPYVRAKYAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEML
        ++VP GD+PYVRAK  QL+EKDPE AI LFW+AINAGDRV+SALKDMA+VMKQ +RAEEAI  +++ R  CS  +Q+SLDN+L+DL+K+CGR+++QI +L
Subjt:  NQVPPGDTPYVRAKYAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEML

Query:  KRKLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQGR
        K KL  I +G AFNGK T+TARS GKKFQVSV+QE +RLLGNLGWA MQ+ N++ AE  Y++A  I PD NK CNLG+CLMKQGR++EA   L++V+   
Subjt:  KRKLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQGR

Query:  IPG----SDEIKAQKRAGDLLTQIRS---RQSLPDSIE
        + G       +KA +RA  +L  + S   R+   D +E
Subjt:  IPG----SDEIKAQKRAGDLLTQIRS---RQSLPDSIE

AT5G48850.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.5e-9762.86Show/hide
Query:  NQVPPGDTPYVRAKYAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEML
        ++VP GDTPYVRAK+AQLIEK+PE AI  FW+AIN GDRV+SALKDMAVVMKQ+DR+EEAI  ++++R  CSK+SQ SLDNVLIDL+KKCGR+EEQ+E+L
Subjt:  NQVPPGDTPYVRAKYAQLIEKDPESAISLFWEAINAGDRVESALKDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEML

Query:  KRKLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQGR
        KRKLR+IY+GEAFNGKPT+TARSHGKKFQV+V+QE SRLLGNLGWAYMQ+  Y+ AEAVY+KAQ+++PDANK+CNL +CL+KQGR  E   VL  V + R
Subjt:  KRKLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQGR

Query:  IPGSDEIKAQKRAGDLLTQIRSRQSLP-----DSIELLGLSIDGDLLNGLEQLVHERGPFRSKRLPVFEEISSFRDQLAC
        + G+D+ + ++RA +LL+++ S  SLP     +  ++LG  +D D + GLE++      F+SKRLP+FE+ISSFR+ L C
Subjt:  IPGSDEIKAQKRAGDLLTQIRSRQSLP-----DSIELLGLSIDGDLLNGLEQLVHERGPFRSKRLPVFEEISSFRDQLAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAGGTACCAGAAGGTGGAGAAGCCTAAACCTGAATCGCCGATAAACGAAAATGAAATCCGCATCACAAGTCAAGGCGCTATTCGAAACTACATCACTTATGCTTC
TACTCTACTTCAGGACAAACGTGTTAGAGAGATAGTGTTGAAAGCCATGGGGCAGGCAATCAGTAAAACAGTTTCCATTGCAGAAATTTTAAAGAAGAGAACTCCACAAT
TGCATCAAGAAACTTCCATCAGCTCAGTAAGCATAATTGATGTATGGGAGCCTATTGAAGAAGGTCTTGTACCTGTGGAGATGACTCGACACGTCTCAATGATATCAATT
ACCCTGTCAAATCGAGAACTAAATAAAGATTCTCCTGGGTATCAAGCTCCACAATTTGTGGAGCAGCCAAAGCAACAATATAAGCAGCAGCAACAACAACAGCCCAGACA
ATCTCGTCCTTATTACAATGCTGTTAACGAAGATCCATATGGTCAAAGCCGAGGTCGAGGTCGAGGTGGAGGGAGAGGGCGTGGTTGGGGCAGAGGTGGATATGGAAACT
ACCAAGGGGGATATGGCAACTACCAGGGCGGATATGGCCATTACCAAGGCGGATATGGCCATTACCAAGGCGGATATGGCCACTATCAAGGTGGATCTGGTGGCTACCAA
GGTGGTGGCCCTGGCGGCTACCAAGGCAACTATCAAGATAATGGTGGGTACTCGAATTGGAGTAGAGGTGGTGGGCGAGGCCGAGGTTGGACATATCGTGGAACTGGCTA
TGATAGAGGCAGAGGGGGGAGGGGTTATGGTCGAGGTCGAGGTCGGGGTCAGATAGACGGACAAACAAGGGACAATGGTGGCAGTAGAAACCAGGTTCCACCTGGTGATA
CTCCTTATGTTAGAGCAAAATATGCTCAGTTGATAGAGAAGGATCCAGAGAGTGCAATTTCATTATTTTGGGAAGCAATAAATGCAGGAGACAGAGTTGAAAGTGCTCTA
AAAGACATGGCTGTGGTTATGAAGCAAATTGATAGAGCTGAAGAAGCCATTCATATCCTCCAAACCTACAGATTTCTCTGCTCCAAACACTCCCAACAATCGCTCGATAA
CGTTCTTATCGATTTGTTTAAGAAATGTGGAAGAATAGAAGAACAAATTGAGATGCTGAAGAGAAAACTAAGAAAGATATATGAAGGAGAAGCTTTTAATGGAAAGCCCA
CAAGAACAGCTCGTTCTCATGGCAAGAAATTTCAGGTTTCTGTCAAACAAGAAACCTCTAGATTATTGGGGAATCTGGGGTGGGCCTACATGCAAAAGCCCAATTACATG
ATGGCGGAGGCAGTGTACAAAAAGGCCCAAATTATCGATCCAGACGCCAACAAAGCATGCAATTTGGGCCTTTGTTTGATGAAACAGGGCCGCCTCAATGAGGCCATTTC
GGTTCTCCAACAAGTCCAGCAAGGCAGAATTCCGGGCTCTGATGAGATCAAAGCCCAAAAACGGGCAGGAGATTTGCTGACCCAAATTAGGTCGAGACAATCGCTGCCCG
ATTCCATCGAGCTATTGGGCCTCAGTATTGATGGCGATTTGCTTAATGGGCTTGAGCAATTGGTCCATGAACGAGGCCCATTTAGGTCCAAGAGGCTTCCGGTTTTTGAG
GAAATTTCTTCATTTAGGGATCAATTAGCATGTAATTAA
mRNA sequenceShow/hide mRNA sequence
TTCCCCTTCCTCTTTCCTTTAGCTCAGAACCACACTCTCAAGCACTTCACTGTCAAGCTTTCTCCTCTTTCCCGTTAGGGTTTCTGTCTTCTTCTTCTTCGTCTTCCTCT
TCCCTCTCTCGCTTCCCGCCTTGCTGAAGAACCGCCATGGATAGGTACCAGAAGGTGGAGAAGCCTAAACCTGAATCGCCGATAAACGAAAATGAAATCCGCATCACAAG
TCAAGGCGCTATTCGAAACTACATCACTTATGCTTCTACTCTACTTCAGGACAAACGTGTTAGAGAGATAGTGTTGAAAGCCATGGGGCAGGCAATCAGTAAAACAGTTT
CCATTGCAGAAATTTTAAAGAAGAGAACTCCACAATTGCATCAAGAAACTTCCATCAGCTCAGTAAGCATAATTGATGTATGGGAGCCTATTGAAGAAGGTCTTGTACCT
GTGGAGATGACTCGACACGTCTCAATGATATCAATTACCCTGTCAAATCGAGAACTAAATAAAGATTCTCCTGGGTATCAAGCTCCACAATTTGTGGAGCAGCCAAAGCA
ACAATATAAGCAGCAGCAACAACAACAGCCCAGACAATCTCGTCCTTATTACAATGCTGTTAACGAAGATCCATATGGTCAAAGCCGAGGTCGAGGTCGAGGTGGAGGGA
GAGGGCGTGGTTGGGGCAGAGGTGGATATGGAAACTACCAAGGGGGATATGGCAACTACCAGGGCGGATATGGCCATTACCAAGGCGGATATGGCCATTACCAAGGCGGA
TATGGCCACTATCAAGGTGGATCTGGTGGCTACCAAGGTGGTGGCCCTGGCGGCTACCAAGGCAACTATCAAGATAATGGTGGGTACTCGAATTGGAGTAGAGGTGGTGG
GCGAGGCCGAGGTTGGACATATCGTGGAACTGGCTATGATAGAGGCAGAGGGGGGAGGGGTTATGGTCGAGGTCGAGGTCGGGGTCAGATAGACGGACAAACAAGGGACA
ATGGTGGCAGTAGAAACCAGGTTCCACCTGGTGATACTCCTTATGTTAGAGCAAAATATGCTCAGTTGATAGAGAAGGATCCAGAGAGTGCAATTTCATTATTTTGGGAA
GCAATAAATGCAGGAGACAGAGTTGAAAGTGCTCTAAAAGACATGGCTGTGGTTATGAAGCAAATTGATAGAGCTGAAGAAGCCATTCATATCCTCCAAACCTACAGATT
TCTCTGCTCCAAACACTCCCAACAATCGCTCGATAACGTTCTTATCGATTTGTTTAAGAAATGTGGAAGAATAGAAGAACAAATTGAGATGCTGAAGAGAAAACTAAGAA
AGATATATGAAGGAGAAGCTTTTAATGGAAAGCCCACAAGAACAGCTCGTTCTCATGGCAAGAAATTTCAGGTTTCTGTCAAACAAGAAACCTCTAGATTATTGGGGAAT
CTGGGGTGGGCCTACATGCAAAAGCCCAATTACATGATGGCGGAGGCAGTGTACAAAAAGGCCCAAATTATCGATCCAGACGCCAACAAAGCATGCAATTTGGGCCTTTG
TTTGATGAAACAGGGCCGCCTCAATGAGGCCATTTCGGTTCTCCAACAAGTCCAGCAAGGCAGAATTCCGGGCTCTGATGAGATCAAAGCCCAAAAACGGGCAGGAGATT
TGCTGACCCAAATTAGGTCGAGACAATCGCTGCCCGATTCCATCGAGCTATTGGGCCTCAGTATTGATGGCGATTTGCTTAATGGGCTTGAGCAATTGGTCCATGAACGA
GGCCCATTTAGGTCCAAGAGGCTTCCGGTTTTTGAGGAAATTTCTTCATTTAGGGATCAATTAGCATGTAATTAATTAATTAATTAACTTTTCCTTTTTTCCTTTTTATT
TATTTATTTTTAATGTAGATTGGGGTTGGTGTTGGGATTTAGAATAATGTGTGTCTCGGTTTTGTAAATAAAGAAAAACGAAGGAGCTGAATGCAAATTTGAAAAGGAAA
TGTTTTGTGTAAATTTAGTATTGATGTTGGAATAAATATTTGTTTTTTTTT
Protein sequenceShow/hide protein sequence
MDRYQKVEKPKPESPINENEIRITSQGAIRNYITYASTLLQDKRVREIVLKAMGQAISKTVSIAEILKKRTPQLHQETSISSVSIIDVWEPIEEGLVPVEMTRHVSMISI
TLSNRELNKDSPGYQAPQFVEQPKQQYKQQQQQQPRQSRPYYNAVNEDPYGQSRGRGRGGGRGRGWGRGGYGNYQGGYGNYQGGYGHYQGGYGHYQGGYGHYQGGSGGYQ
GGGPGGYQGNYQDNGGYSNWSRGGGRGRGWTYRGTGYDRGRGGRGYGRGRGRGQIDGQTRDNGGSRNQVPPGDTPYVRAKYAQLIEKDPESAISLFWEAINAGDRVESAL
KDMAVVMKQIDRAEEAIHILQTYRFLCSKHSQQSLDNVLIDLFKKCGRIEEQIEMLKRKLRKIYEGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYM
MAEAVYKKAQIIDPDANKACNLGLCLMKQGRLNEAISVLQQVQQGRIPGSDEIKAQKRAGDLLTQIRSRQSLPDSIELLGLSIDGDLLNGLEQLVHERGPFRSKRLPVFE
EISSFRDQLACN