; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0023505 (gene) of Chayote v1 genome

Gene IDSed0023505
OrganismSechium edule (Chayote v1)
DescriptionSmr domain-containing protein
Genome locationLG07:3757016..3761171
RNA-Seq ExpressionSed0023505
SyntenySed0023505
Gene Ontology termsNA
InterPro domainsIPR002625 - Smr domain
IPR013899 - Domain of unknown function DUF1771
IPR036063 - Smr domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576051.1 hypothetical protein SDJN03_26690, partial [Cucurbita argyrosperma subsp. sororia]3.1e-21671.5Show/hide
Query:  RGKSLG-AAVNLKTQTRGFQDEIGLEPLPPVPSGLSSLPPRENLPRFNGHSGRSLSPAPLPSGNSLTSAENYGARKTVLGNSSTRNGKKVVEETTDA-TF
        RGKS G AAVNLK    G QDEI  +P PP+ + LS LPPREN+ R NG SGRS S  PLPS +SL S EN+GA+KT+ GNSS ++ KK+VEE+TD   F
Subjt:  RGKSLG-AAVNLKTQTRGFQDEIGLEPLPPVPSGLSSLPPRENLPRFNGHSGRSLSPAPLPSGNSLTSAENYGARKTVLGNSSTRNGKKVVEETTDA-TF

Query:  WKLKELHPWADYSLIMDVMEAVNNNFNDASTLLKTMVSSDKFEINNEISTFGLHSSNDLSSVRGESPG-AAANLKTQTRGFQDEVGLVPLAPISSSLSSL
        WKLKELH WAD SLI+D+MEAVNNNFN+AS LLKTMVSSD FEINNE+ST GLHSSND+S VRG+SPG    NLK Q RG QD +   P  P+ S+LSSL
Subjt:  WKLKELHPWADYSLIMDVMEAVNNNFNDASTLLKTMVSSDKFEINNEISTFGLHSSNDLSSVRGESPG-AAANLKTQTRGFQDEVGLVPLAPISSSLSSL

Query:  PPREKLPRVDGHSRRSFSSAPLPSGNSPTLAENCGAIKTLHGNFSTRGDKKVIEETTDV-TFWKLKELHPWADFSLIVGIMEAVNNNFNEASTLLKILVS
        PPRE L  V+G   RS SS+PLPS +S T  EN  A K L G+ S +  +KV+EETTDV  FWKLKELH WADFSLIV IMEAV+NNFNEAST L  +VS
Subjt:  PPREKLPRVDGHSRRSFSSAPLPSGNSPTLAENCGAIKTLHGNFSTRGDKKVIEETTDV-TFWKLKELHPWADFSLIVGIMEAVNNNFNEASTLLKILVS

Query:  SDTFEINNEMSALGLHSSNNQLCNRKNDASISLEKMVNIPGIGSTLKDVHQDNNACQEDDTKLIENNYYERNFFHDAGNPKRAL-GSTSIPIEPEWEEDD
        SD  EI NEMS LGLHS++   CN KND +ISL + VN P   STLKDV QD +  Q  + KL ENNY+ERNFFH+ GNPK AL  S S PIEPEWEEDD
Subjt:  SDTFEINNEMSALGLHSSNNQLCNRKNDASISLEKMVNIPGIGSTLKDVHQDNNACQEDDTKLIENNYYERNFFHDAGNPKRAL-GSTSIPIEPEWEEDD

Query:  IYLTHRKDAVAMMRSASQHSRAATNAYLKKDHASAKYHSSRAQEQWLAAKKLNDKAAKKILRTRNSKNGLWKLDLHGLHAAEAVQALQENLLKIETQYAS
        IYL+HRKDA+AMMRSASQHSRAATNAYL+KDHASAKYHSSRAQEQWLAAK LN KAA +IL+TRNS+NGLWKLDLHGLHAAEAVQALQ++LLKIET+ AS
Subjt:  IYLTHRKDAVAMMRSASQHSRAATNAYLKKDHASAKYHSSRAQEQWLAAKKLNDKAAKKILRTRNSKNGLWKLDLHGLHAAEAVQALQENLLKIETQYAS

Query:  NRSLSPKKAERKGFLRASSLESLSCFDSKLNKASQSPSSRHRPTSLEVITGRGKHSRGEAALPKAVTNYLSENGYRFEQLRPGTISVRPKFRR
        NRSLSPKKAERKGF R SSLE LSC   KL+K  QSP  RHRPTSLEVITG GKHSRGEAALPKAVT++LSENGYRFEQLRPGTISVRPKFRR
Subjt:  NRSLSPKKAERKGFLRASSLESLSCFDSKLNKASQSPSSRHRPTSLEVITGRGKHSRGEAALPKAVTNYLSENGYRFEQLRPGTISVRPKFRR

XP_004148966.1 uncharacterized protein LOC101223137 [Cucumis sativus]7.5e-21868.03Show/hide
Query:  LSSVRGKSLG-AAVNLKTQTRGFQDEIGLEPLPPVPSGLSSLPPRENLPRFNGHSGRSLSPAPLPSGNS----------LTSAENYGARKTVLGNSSTRN
        +S VRGKS G AA NLK Q  G QDE+  +P PP+ + LSSLPPRENL   NGHSG+S S AP+PS +S           T+  N+GA+KT+LG ++ ++
Subjt:  LSSVRGKSLG-AAVNLKTQTRGFQDEIGLEPLPPVPSGLSSLPPRENLPRFNGHSGRSLSPAPLPSGNS----------LTSAENYGARKTVLGNSSTRN

Query:  GKKVVEETTDA-TFWKLKELHPWADYSLIMDVMEAVNNNFNDASTLLKTMVSSDKFEINNEISTFGLHSSNDLSSVRGESPG-AAANLKTQTRGFQDEVG
        GKK+VEET D  +FWKLKELHPWAD SLIMD+MEAVNN+FN+ASTLL TMVSSD  EINN++ST GLHSSNDL  + G+SPG    NLK   +G QDE+ 
Subjt:  GKKVVEETTDA-TFWKLKELHPWADYSLIMDVMEAVNNNFNDASTLLKTMVSSDKFEINNEISTFGLHSSNDLSSVRGESPG-AAANLKTQTRGFQDEVG

Query:  LVPLAPISSSLSSLPPREKLPRVDGHSRRSFSSAPLPSGNSPTLAENCGAIKTLHGNFSTRGDKKVIEETTDV-TFWKLKELHPWADFSLIVGIMEAVNN
        L    P+ ++ SSLPP E L  V G S RSF+S PLPS +S T  EN GA  T+  + S +  KKV+EE TDV  FWKLKE+H WADFSLIV IM+AVNN
Subjt:  LVPLAPISSSLSSLPPREKLPRVDGHSRRSFSSAPLPSGNSPTLAENCGAIKTLHGNFSTRGDKKVIEETTDV-TFWKLKELHPWADFSLIVGIMEAVNN

Query:  NFNEASTLLKILVSSDTFEINNEMSALGLHSSNNQLCNRKNDASISLEKMVNIPGIGSTLK---DVHQDNNACQEDDTKLIENNYYERNFFHDAGNPKRA
        NF+EASTLLK +VSSD FEINNE+S LGLHS+N+ LCN  ND SI+ E+M+N P + ST+K    +HQ+NN  +ED TKL  N+Y+ERN FH+ GN K A
Subjt:  NFNEASTLLKILVSSDTFEINNEMSALGLHSSNNQLCNRKNDASISLEKMVNIPGIGSTLK---DVHQDNNACQEDDTKLIENNYYERNFFHDAGNPKRA

Query:  LG-STSIPIEPEWEEDDIYLTHRKDAVAMMRSASQHSRAATNAYLKKDHASAKYHSSRAQEQWLAAKKLNDKAAKKILRTRNSKNGLWKLDLHGLHAAEA
        LG S S+PIEPEWEEDDIYL+HRKDA+AMMRSASQHSRAATNAY +KDHASAKYHSSRA+EQWLAAK LNDKAA +IL+TRNSKNGLWKLDLHGLHAAEA
Subjt:  LG-STSIPIEPEWEEDDIYLTHRKDAVAMMRSASQHSRAATNAYLKKDHASAKYHSSRAQEQWLAAKKLNDKAAKKILRTRNSKNGLWKLDLHGLHAAEA

Query:  VQALQENLLKIETQYASNRSLSPKKAERKGFLRASSLESLSCFDSKLNKASQSPSSRHRPTSLEVITGRGKHSRGEAALPKAVTNYLSENGYRFEQLRPG
        VQAL ++LLKIETQ ASNRSLSPKKAERKGF RASSLE LSC +SKL+K  +SPSSRHRPTSLEVITG GKHS+GEAALPKAV ++L+ENGYRFEQ RPG
Subjt:  VQALQENLLKIETQYASNRSLSPKKAERKGFLRASSLESLSCFDSKLNKASQSPSSRHRPTSLEVITGRGKHSRGEAALPKAVTNYLSENGYRFEQLRPG

Query:  TISVRPKFRR
        TISVRPKFRR
Subjt:  TISVRPKFRR

XP_022953352.1 uncharacterized protein LOC111455928 isoform X1 [Cucurbita moschata]5.2e-21972.01Show/hide
Query:  RGKSLG-AAVNLKTQTRGFQDEIGLEPLPPVPSGLSSLPPRENLPRFNGHSGRSLSPAPLPSGNSLTSAENYGARKTVLGNSSTRNGKKVVEETTDA-TF
        RGKS G AAVNLK Q  G QDEI  +P PP+ + LS LPPREN+ R NG SGRS S  PLPS +SL S EN+G +KT+ GNSS R+GKK+VEE+TD   F
Subjt:  RGKSLG-AAVNLKTQTRGFQDEIGLEPLPPVPSGLSSLPPRENLPRFNGHSGRSLSPAPLPSGNSLTSAENYGARKTVLGNSSTRNGKKVVEETTDA-TF

Query:  WKLKELHPWADYSLIMDVMEAVNNNFNDASTLLKTMVSSDKFEINNEISTFGLHSSNDLSSVRGESPG-AAANLKTQTRGFQDEVGLVPLAPISSSLSSL
        WKLKELH WAD SLI+D+MEAVNNNFN+AS LLKTMVSSD FEINNE+ST GLHSSND+S VRG+SPG    NLK Q RG QD +   P  P+ S+LSSL
Subjt:  WKLKELHPWADYSLIMDVMEAVNNNFNDASTLLKTMVSSDKFEINNEISTFGLHSSNDLSSVRGESPG-AAANLKTQTRGFQDEVGLVPLAPISSSLSSL

Query:  PPREKLPRVDGHSRRSFSSAPLPSGNSPTLAENCGAIKTLHGNFSTRGDKKVIEETTDV-TFWKLKELHPWADFSLIVGIMEAVNNNFNEASTLLKILVS
        PPRE L  V+G   RS SS+PLPS +S TL EN  A K L G+ S +  +KV+EETTDV  FWKLKELH WADFSLIV IMEAV+NNFNEAST L  +VS
Subjt:  PPREKLPRVDGHSRRSFSSAPLPSGNSPTLAENCGAIKTLHGNFSTRGDKKVIEETTDV-TFWKLKELHPWADFSLIVGIMEAVNNNFNEASTLLKILVS

Query:  SDTFEINNEMSALGLHSSNNQLCNRKNDASISLEKMVNIPGIGSTLKDVHQDNNACQEDDTKLIENNYYERNFFHDAGNPKRAL-GSTSIPIEPEWEEDD
        SD  EI NEMS LGLHS++   CN KND +ISL + VN P   STLKDV QD +  Q  + KL ENNY+ERNFFH+ GNPK AL  S S PIEPEWEEDD
Subjt:  SDTFEINNEMSALGLHSSNNQLCNRKNDASISLEKMVNIPGIGSTLKDVHQDNNACQEDDTKLIENNYYERNFFHDAGNPKRAL-GSTSIPIEPEWEEDD

Query:  IYLTHRKDAVAMMRSASQHSRAATNAYLKKDHASAKYHSSRAQEQWLAAKKLNDKAAKKILRTRNSKNGLWKLDLHGLHAAEAVQALQENLLKIETQYAS
        IYL+HRKDA+AMMRSASQHSRAATNAYL+KDHASAKYHSSRAQEQWLAAK LN KAA +IL+TRNS+NGLWKLDLHGLHAAEAVQALQ++LLKIET+ AS
Subjt:  IYLTHRKDAVAMMRSASQHSRAATNAYLKKDHASAKYHSSRAQEQWLAAKKLNDKAAKKILRTRNSKNGLWKLDLHGLHAAEAVQALQENLLKIETQYAS

Query:  NRSLSPKKAERKGFLRASSLESLSCFDSKLNKASQSPSSRHRPTSLEVITGRGKHSRGEAALPKAVTNYLSENGYRFEQLRPGTISVRPKFRR
        NRSLSPKKAERKGF R SSLE LSC   KL+K  QSP  RHRPTSLEVITG GKHSRGEAALPKAVT++LSENGYRFEQLRPGTISVRPKFRR
Subjt:  NRSLSPKKAERKGFLRASSLESLSCFDSKLNKASQSPSSRHRPTSLEVITGRGKHSRGEAALPKAVTNYLSENGYRFEQLRPGTISVRPKFRR

XP_023548349.1 uncharacterized protein LOC111807017 [Cucurbita pepo subsp. pepo]2.8e-21771.67Show/hide
Query:  RGKSLG-AAVNLKTQTRGFQDEIGLEPLPPVPSGLSSLPPRENLPRFNGHSGRSLSPAPLPSGNSLTSAENYGARKTVLGNSSTRNGKKVVEETTDA-TF
        RGKS G AAVNLK Q  G QDEI  +P PP+ + LS LPPREN+ R NG SGRS S  PLPS +SL S +N+GA+KT+ GNSS R+GKK+VEE+TD   F
Subjt:  RGKSLG-AAVNLKTQTRGFQDEIGLEPLPPVPSGLSSLPPRENLPRFNGHSGRSLSPAPLPSGNSLTSAENYGARKTVLGNSSTRNGKKVVEETTDA-TF

Query:  WKLKELHPWADYSLIMDVMEAVNNNFNDASTLLKTMVSSDKFEINNEISTFGLHSSNDLSSVRGESPG-AAANLKTQTRGFQDEVGLVPLAPISSSLSSL
        WKLKELH WAD SLI+D+MEAVNNNFN+AS LLKTMVSSD FEINNE+ST GLHSSND+S VRG+SPG    NLK Q RG QD +   P  P+ S+LSSL
Subjt:  WKLKELHPWADYSLIMDVMEAVNNNFNDASTLLKTMVSSDKFEINNEISTFGLHSSNDLSSVRGESPG-AAANLKTQTRGFQDEVGLVPLAPISSSLSSL

Query:  PPREKLPRVDGHSRRSFSSAPLPSGNSPTLAENCGAIKTLHGNFSTRGDKKVIEETTDV-TFWKLKELHPWADFSLIVGIMEAVNNNFNEASTLLKILVS
        PPRE L  V G   RS SS+PLPS +S T  EN  A K L G+ S +  +KV+EETTDV  FWKLKELH WADFSLIV IMEAV+NNFNEAST L  +VS
Subjt:  PPREKLPRVDGHSRRSFSSAPLPSGNSPTLAENCGAIKTLHGNFSTRGDKKVIEETTDV-TFWKLKELHPWADFSLIVGIMEAVNNNFNEASTLLKILVS

Query:  SDTFEINNEMSALGLHSSNNQLCNRKNDASISLEKMVNIPGIGSTLKDVHQDNNACQEDDTKLIENNYYERNFFHDAGNPKRAL-GSTSIPIEPEWEEDD
        SD  EI NEMS LGLHS++   C  KND +ISL + VN P   STLKDV QD +  Q  + KL ENNY+ERNFFH+ GNPK AL  S S PIEPEWEEDD
Subjt:  SDTFEINNEMSALGLHSSNNQLCNRKNDASISLEKMVNIPGIGSTLKDVHQDNNACQEDDTKLIENNYYERNFFHDAGNPKRAL-GSTSIPIEPEWEEDD

Query:  IYLTHRKDAVAMMRSASQHSRAATNAYLKKDHASAKYHSSRAQEQWLAAKKLNDKAAKKILRTRNSKNGLWKLDLHGLHAAEAVQALQENLLKIETQYAS
        IYL+HRKDA+AMMRSASQHSRAATNAYL+KDHASAKYHSSRAQEQWLAAK LN KAA +IL+TRNS+NGLWKLDLHGLHAAEAVQALQ++LLKIET+ AS
Subjt:  IYLTHRKDAVAMMRSASQHSRAATNAYLKKDHASAKYHSSRAQEQWLAAKKLNDKAAKKILRTRNSKNGLWKLDLHGLHAAEAVQALQENLLKIETQYAS

Query:  NRSLSPKKAERKGFLRASSLESLSCFDSKLNKASQSPSSRHRPTSLEVITGRGKHSRGEAALPKAVTNYLSENGYRFEQLRPGTISVRPKFRR
        NRSLSPKKAERKGF R SSLE LSC   KL+K  QSP  RHRPTSLEVITG GKHSRGEAALPKAVT++LSENGYRFEQLRPGTISVRPKFRR
Subjt:  NRSLSPKKAERKGFLRASSLESLSCFDSKLNKASQSPSSRHRPTSLEVITGRGKHSRGEAALPKAVTNYLSENGYRFEQLRPGTISVRPKFRR

XP_038898473.1 uncharacterized protein LOC120086100 [Benincasa hispida]2.5e-22170.98Show/hide
Query:  LSSVRGKSLG-AAVNLKTQTRGFQDEIGLEPLPPVPSGLSSLPPRENLPRFNGHSGRSLSPAPLPSGNSLTS----------AENYGARKTVLGNSSTRN
        +S V+GKS G AA NLK Q  G QDE+  +P PPV + LSSLPP EN    NG SGRS S AP PS NSLTS           EN GA+KT+L  S+ +N
Subjt:  LSSVRGKSLG-AAVNLKTQTRGFQDEIGLEPLPPVPSGLSSLPPRENLPRFNGHSGRSLSPAPLPSGNSLTS----------AENYGARKTVLGNSSTRN

Query:  GKKVVEETTDA-TFWKLKELHPWADYSLIMDVMEAVNNNFNDASTLLKTMVSSDKFEINNEISTFGLHSSNDLSSVRGESPG-AAANLKTQTRGFQDEVG
        GKKVVEET D  +FWKLKELH WAD SLIMDVMEAVNNNF++ASTLLKTMV+SD FEINNE+ST GL  SNDLS V G  PG    NLK   RG QDE  
Subjt:  GKKVVEETTDA-TFWKLKELHPWADYSLIMDVMEAVNNNFNDASTLLKTMVSSDKFEINNEISTFGLHSSNDLSSVRGESPG-AAANLKTQTRGFQDEVG

Query:  LVPLAPISSSLSSLPPREKLPRVDGHSRRSFSSAPLPSGNSPTLAENCGAIKTLHGNFSTRGDKKVIEETTD-VTFWKLKELHPWADFSLIVGIMEAVNN
        L PL P+ +  SSLPP E L RV G S +SFSS P  S +S T  EN GA KT+  + S +  KKV+EE+ D + FWKLKELH WADFSLIV IMEAVNN
Subjt:  LVPLAPISSSLSSLPPREKLPRVDGHSRRSFSSAPLPSGNSPTLAENCGAIKTLHGNFSTRGDKKVIEETTD-VTFWKLKELHPWADFSLIVGIMEAVNN

Query:  NFNEASTLLKILVSSDTFEINNEMSALGLHSSNNQLCNRKNDASISLEKMVNIPGIGSTLKD---VHQDNNACQEDDTKLIENNYYERNFFHDAGNPKRA
        NFNEASTLLK +VSSD F+IN+EMS L L S+N+ LCN KND S SLE+  NIP   STLKD   VHQ+NNAC+E+ TKL ENNY+ERNFFH+AG PK  
Subjt:  NFNEASTLLKILVSSDTFEINNEMSALGLHSSNNQLCNRKNDASISLEKMVNIPGIGSTLKD---VHQDNNACQEDDTKLIENNYYERNFFHDAGNPKRA

Query:  LG-STSIPIEPEWEEDDIYLTHRKDAVAMMRSASQHSRAATNAYLKKDHASAKYHSSRAQEQWLAAKKLNDKAAKKILRTRNSKNGLWKLDLHGLHAAEA
        LG S S+PIEPEWEEDDIYL+HRKDA+AMMRSASQHSRAATNAYL+KDHASAKYHSSRAQEQWLAAK LNDKAA +IL+TRNSKNGLWKLDLHGLHAAEA
Subjt:  LG-STSIPIEPEWEEDDIYLTHRKDAVAMMRSASQHSRAATNAYLKKDHASAKYHSSRAQEQWLAAKKLNDKAAKKILRTRNSKNGLWKLDLHGLHAAEA

Query:  VQALQENLLKIETQYASNRSLSPKKAERKGFLRASSLESLSCFDSKLNKASQSPSSRHRPTSLEVITGRGKHSRGEAALPKAVTNYLSENGYRFEQLRPG
        VQALQE+LLKIET+ ASNRSLSPKK+ERKGF  ASSLE LSC DSK++K  +SPSSRHRPTSLEVITG GKHSRGEA LPKAVT++LSENGYRFEQLRPG
Subjt:  VQALQENLLKIETQYASNRSLSPKKAERKGFLRASSLESLSCFDSKLNKASQSPSSRHRPTSLEVITGRGKHSRGEAALPKAVTNYLSENGYRFEQLRPG

Query:  TISVRPKFRR
        TIS+RPKFRR
Subjt:  TISVRPKFRR

TrEMBL top hitse value%identityAlignment
A0A0A0KA90 Smr domain-containing protein3.6e-21868.03Show/hide
Query:  LSSVRGKSLG-AAVNLKTQTRGFQDEIGLEPLPPVPSGLSSLPPRENLPRFNGHSGRSLSPAPLPSGNS----------LTSAENYGARKTVLGNSSTRN
        +S VRGKS G AA NLK Q  G QDE+  +P PP+ + LSSLPPRENL   NGHSG+S S AP+PS +S           T+  N+GA+KT+LG ++ ++
Subjt:  LSSVRGKSLG-AAVNLKTQTRGFQDEIGLEPLPPVPSGLSSLPPRENLPRFNGHSGRSLSPAPLPSGNS----------LTSAENYGARKTVLGNSSTRN

Query:  GKKVVEETTDA-TFWKLKELHPWADYSLIMDVMEAVNNNFNDASTLLKTMVSSDKFEINNEISTFGLHSSNDLSSVRGESPG-AAANLKTQTRGFQDEVG
        GKK+VEET D  +FWKLKELHPWAD SLIMD+MEAVNN+FN+ASTLL TMVSSD  EINN++ST GLHSSNDL  + G+SPG    NLK   +G QDE+ 
Subjt:  GKKVVEETTDA-TFWKLKELHPWADYSLIMDVMEAVNNNFNDASTLLKTMVSSDKFEINNEISTFGLHSSNDLSSVRGESPG-AAANLKTQTRGFQDEVG

Query:  LVPLAPISSSLSSLPPREKLPRVDGHSRRSFSSAPLPSGNSPTLAENCGAIKTLHGNFSTRGDKKVIEETTDV-TFWKLKELHPWADFSLIVGIMEAVNN
        L    P+ ++ SSLPP E L  V G S RSF+S PLPS +S T  EN GA  T+  + S +  KKV+EE TDV  FWKLKE+H WADFSLIV IM+AVNN
Subjt:  LVPLAPISSSLSSLPPREKLPRVDGHSRRSFSSAPLPSGNSPTLAENCGAIKTLHGNFSTRGDKKVIEETTDV-TFWKLKELHPWADFSLIVGIMEAVNN

Query:  NFNEASTLLKILVSSDTFEINNEMSALGLHSSNNQLCNRKNDASISLEKMVNIPGIGSTLK---DVHQDNNACQEDDTKLIENNYYERNFFHDAGNPKRA
        NF+EASTLLK +VSSD FEINNE+S LGLHS+N+ LCN  ND SI+ E+M+N P + ST+K    +HQ+NN  +ED TKL  N+Y+ERN FH+ GN K A
Subjt:  NFNEASTLLKILVSSDTFEINNEMSALGLHSSNNQLCNRKNDASISLEKMVNIPGIGSTLK---DVHQDNNACQEDDTKLIENNYYERNFFHDAGNPKRA

Query:  LG-STSIPIEPEWEEDDIYLTHRKDAVAMMRSASQHSRAATNAYLKKDHASAKYHSSRAQEQWLAAKKLNDKAAKKILRTRNSKNGLWKLDLHGLHAAEA
        LG S S+PIEPEWEEDDIYL+HRKDA+AMMRSASQHSRAATNAY +KDHASAKYHSSRA+EQWLAAK LNDKAA +IL+TRNSKNGLWKLDLHGLHAAEA
Subjt:  LG-STSIPIEPEWEEDDIYLTHRKDAVAMMRSASQHSRAATNAYLKKDHASAKYHSSRAQEQWLAAKKLNDKAAKKILRTRNSKNGLWKLDLHGLHAAEA

Query:  VQALQENLLKIETQYASNRSLSPKKAERKGFLRASSLESLSCFDSKLNKASQSPSSRHRPTSLEVITGRGKHSRGEAALPKAVTNYLSENGYRFEQLRPG
        VQAL ++LLKIETQ ASNRSLSPKKAERKGF RASSLE LSC +SKL+K  +SPSSRHRPTSLEVITG GKHS+GEAALPKAV ++L+ENGYRFEQ RPG
Subjt:  VQALQENLLKIETQYASNRSLSPKKAERKGFLRASSLESLSCFDSKLNKASQSPSSRHRPTSLEVITGRGKHSRGEAALPKAVTNYLSENGYRFEQLRPG

Query:  TISVRPKFRR
        TISVRPKFRR
Subjt:  TISVRPKFRR

A0A1S3BRS7 uncharacterized protein LOC1034925903.9e-21267.7Show/hide
Query:  LSSVRGKSLG-AAVNLKTQTRGFQDEIGLEPLPPVPSGLSSLPPRENLPRFNGHSGRSLSPAPLPSGNS----------LTSAENYGARKTVLGNSSTRN
        +S VRGKS G AA NLK Q  G QDE+  +P PP+ + LSSLPPRENL   NG SGRS S AP+PS +S           T+  N+ A+KT+LG S+ ++
Subjt:  LSSVRGKSLG-AAVNLKTQTRGFQDEIGLEPLPPVPSGLSSLPPRENLPRFNGHSGRSLSPAPLPSGNS----------LTSAENYGARKTVLGNSSTRN

Query:  GKKVVEETTDA-TFWKLKELHPWADYSLIMDVMEAVNNNFNDASTLLKTMVSSDKFEINNEISTFGLHSSNDLSSVRGESPG-AAANLKTQTRGFQDEVG
        GKK+VEET D  +FWKLKELHPWAD SLIMD+MEAVNN+FN+ASTLL TMVSSD  EINNE+S  GLHSSNDLS + G+SPG    NL+   RG Q E  
Subjt:  GKKVVEETTDA-TFWKLKELHPWADYSLIMDVMEAVNNNFNDASTLLKTMVSSDKFEINNEISTFGLHSSNDLSSVRGESPG-AAANLKTQTRGFQDEVG

Query:  LVPLAPISSSLSSLPPREKLPRVDGHSRRSFSSAPLPSGNSPTLAENCGAIKTLHGNFSTRGDKKVIEETTDV-TFWKLKELHPWADFSLIVGIMEAVNN
             P+ ++  SLPP E L  V G   RSF+S PLPS +S T   N GA  T+  +   +  KKV+EE TDV  FWKLKE+H WADFSLIV IM+AVNN
Subjt:  LVPLAPISSSLSSLPPREKLPRVDGHSRRSFSSAPLPSGNSPTLAENCGAIKTLHGNFSTRGDKKVIEETTDV-TFWKLKELHPWADFSLIVGIMEAVNN

Query:  NFNEASTLLKILVSSDTFEINNEMSALGLHSSNNQLCNRKNDASISLEKMVNIPGIGSTLK---DVHQDNNACQEDDTKLIENNYYERNFFHDAGNPKRA
        NF+EASTLLK +VSSD FEINNE+S LGLHS+N+ LCN  ND SIS E+ +N P +  TLK    +HQ++N   ED TKL  N+Y+ERNFF +AGN K A
Subjt:  NFNEASTLLKILVSSDTFEINNEMSALGLHSSNNQLCNRKNDASISLEKMVNIPGIGSTLK---DVHQDNNACQEDDTKLIENNYYERNFFHDAGNPKRA

Query:  LG-STSIPIEPEWEEDDIYLTHRKDAVAMMRSASQHSRAATNAYLKKDHASAKYHSSRAQEQWLAAKKLNDKAAKKILRTRNSKNGLWKLDLHGLHAAEA
        LG S S+PIEPEWEEDDIYL+HRKDA+AMMRSASQHSRAATNAY +KDHASAKYHSSRAQEQWLAAK LNDKAA +IL+TRNSKNGLWKLDLHGLHAAEA
Subjt:  LG-STSIPIEPEWEEDDIYLTHRKDAVAMMRSASQHSRAATNAYLKKDHASAKYHSSRAQEQWLAAKKLNDKAAKKILRTRNSKNGLWKLDLHGLHAAEA

Query:  VQALQENLLKIETQYASNRSLSPKKAERKGFLRASSLESLSCFDSKLNKASQSPSSRHRPTSLEVITGRGKHSRGEAALPKAVTNYLSENGYRFEQLRPG
        VQALQ++LLKIETQ ASNRSLSPKKAERKGF RASSLE LSC D+KL+K  +SPSSRHRPTSLEVITG GKHS+GEAALPKAVT++L+ENGYRFEQ RPG
Subjt:  VQALQENLLKIETQYASNRSLSPKKAERKGFLRASSLESLSCFDSKLNKASQSPSSRHRPTSLEVITGRGKHSRGEAALPKAVTNYLSENGYRFEQLRPG

Query:  TISVRPKFRR
        TISVRPKFRR
Subjt:  TISVRPKFRR

A0A5D3CAF0 Smr (Small MutS Related) domain-containing protein, putative isoform 11.3e-21267.7Show/hide
Query:  LSSVRGKSLG-AAVNLKTQTRGFQDEIGLEPLPPVPSGLSSLPPRENLPRFNGHSGRSLSPAPLPSGNS----------LTSAENYGARKTVLGNSSTRN
        +S VRGKS G AA NLK Q  G QDE+  +P PP+ + LSSLPPRENL   NG SGRS S AP+PS +S           T+  N+ A+KT+LG S+ ++
Subjt:  LSSVRGKSLG-AAVNLKTQTRGFQDEIGLEPLPPVPSGLSSLPPRENLPRFNGHSGRSLSPAPLPSGNS----------LTSAENYGARKTVLGNSSTRN

Query:  GKKVVEETTDA-TFWKLKELHPWADYSLIMDVMEAVNNNFNDASTLLKTMVSSDKFEINNEISTFGLHSSNDLSSVRGESPG-AAANLKTQTRGFQDEVG
        GKK+VEET D  +FWKLKELHPWAD SLIMD+MEAVNN+FN+ASTLL TMVSSD  EINNE+S  GLHSSNDLS + G+SPG    NL+   RG Q E  
Subjt:  GKKVVEETTDA-TFWKLKELHPWADYSLIMDVMEAVNNNFNDASTLLKTMVSSDKFEINNEISTFGLHSSNDLSSVRGESPG-AAANLKTQTRGFQDEVG

Query:  LVPLAPISSSLSSLPPREKLPRVDGHSRRSFSSAPLPSGNSPTLAENCGAIKTLHGNFSTRGDKKVIEETTDV-TFWKLKELHPWADFSLIVGIMEAVNN
             P+ ++  SLPP E L  V GH  RSF+S PLPS +S T   N GA  T+  +   +  KKV+EE TDV  FWKLKE+H WADFSLIV IM+AVNN
Subjt:  LVPLAPISSSLSSLPPREKLPRVDGHSRRSFSSAPLPSGNSPTLAENCGAIKTLHGNFSTRGDKKVIEETTDV-TFWKLKELHPWADFSLIVGIMEAVNN

Query:  NFNEASTLLKILVSSDTFEINNEMSALGLHSSNNQLCNRKNDASISLEKMVNIPGIGSTLK---DVHQDNNACQEDDTKLIENNYYERNFFHDAGNPKRA
        NF+EASTLLK +VSSD FEINNE+S LGLH +N+ LCN  ND SIS E+ +N P +  TLK    +HQ++N   ED TKL  N+Y+ERNFF +AGN K A
Subjt:  NFNEASTLLKILVSSDTFEINNEMSALGLHSSNNQLCNRKNDASISLEKMVNIPGIGSTLK---DVHQDNNACQEDDTKLIENNYYERNFFHDAGNPKRA

Query:  LG-STSIPIEPEWEEDDIYLTHRKDAVAMMRSASQHSRAATNAYLKKDHASAKYHSSRAQEQWLAAKKLNDKAAKKILRTRNSKNGLWKLDLHGLHAAEA
        LG S S+PIEPEWEEDD+YL+HRKDA+AMMRSASQHSRAATNAY +KDHASAKYHSSRAQEQWLAAK LNDKAA +IL+TRNSKNGLWKLDLHGLHAAEA
Subjt:  LG-STSIPIEPEWEEDDIYLTHRKDAVAMMRSASQHSRAATNAYLKKDHASAKYHSSRAQEQWLAAKKLNDKAAKKILRTRNSKNGLWKLDLHGLHAAEA

Query:  VQALQENLLKIETQYASNRSLSPKKAERKGFLRASSLESLSCFDSKLNKASQSPSSRHRPTSLEVITGRGKHSRGEAALPKAVTNYLSENGYRFEQLRPG
        VQALQ++LLKIETQ ASNRSLSPKKAERKGF RASSLE LSC DSKL+K  +SPSSRHRPTSLEVITG GKHS+GEAALPKAVT++L+ENGYRFEQ RPG
Subjt:  VQALQENLLKIETQYASNRSLSPKKAERKGFLRASSLESLSCFDSKLNKASQSPSSRHRPTSLEVITGRGKHSRGEAALPKAVTNYLSENGYRFEQLRPG

Query:  TISVRPKFRR
        TISVRPKFRR
Subjt:  TISVRPKFRR

A0A6J1GN51 uncharacterized protein LOC111455928 isoform X12.5e-21972.01Show/hide
Query:  RGKSLG-AAVNLKTQTRGFQDEIGLEPLPPVPSGLSSLPPRENLPRFNGHSGRSLSPAPLPSGNSLTSAENYGARKTVLGNSSTRNGKKVVEETTDA-TF
        RGKS G AAVNLK Q  G QDEI  +P PP+ + LS LPPREN+ R NG SGRS S  PLPS +SL S EN+G +KT+ GNSS R+GKK+VEE+TD   F
Subjt:  RGKSLG-AAVNLKTQTRGFQDEIGLEPLPPVPSGLSSLPPRENLPRFNGHSGRSLSPAPLPSGNSLTSAENYGARKTVLGNSSTRNGKKVVEETTDA-TF

Query:  WKLKELHPWADYSLIMDVMEAVNNNFNDASTLLKTMVSSDKFEINNEISTFGLHSSNDLSSVRGESPG-AAANLKTQTRGFQDEVGLVPLAPISSSLSSL
        WKLKELH WAD SLI+D+MEAVNNNFN+AS LLKTMVSSD FEINNE+ST GLHSSND+S VRG+SPG    NLK Q RG QD +   P  P+ S+LSSL
Subjt:  WKLKELHPWADYSLIMDVMEAVNNNFNDASTLLKTMVSSDKFEINNEISTFGLHSSNDLSSVRGESPG-AAANLKTQTRGFQDEVGLVPLAPISSSLSSL

Query:  PPREKLPRVDGHSRRSFSSAPLPSGNSPTLAENCGAIKTLHGNFSTRGDKKVIEETTDV-TFWKLKELHPWADFSLIVGIMEAVNNNFNEASTLLKILVS
        PPRE L  V+G   RS SS+PLPS +S TL EN  A K L G+ S +  +KV+EETTDV  FWKLKELH WADFSLIV IMEAV+NNFNEAST L  +VS
Subjt:  PPREKLPRVDGHSRRSFSSAPLPSGNSPTLAENCGAIKTLHGNFSTRGDKKVIEETTDV-TFWKLKELHPWADFSLIVGIMEAVNNNFNEASTLLKILVS

Query:  SDTFEINNEMSALGLHSSNNQLCNRKNDASISLEKMVNIPGIGSTLKDVHQDNNACQEDDTKLIENNYYERNFFHDAGNPKRAL-GSTSIPIEPEWEEDD
        SD  EI NEMS LGLHS++   CN KND +ISL + VN P   STLKDV QD +  Q  + KL ENNY+ERNFFH+ GNPK AL  S S PIEPEWEEDD
Subjt:  SDTFEINNEMSALGLHSSNNQLCNRKNDASISLEKMVNIPGIGSTLKDVHQDNNACQEDDTKLIENNYYERNFFHDAGNPKRAL-GSTSIPIEPEWEEDD

Query:  IYLTHRKDAVAMMRSASQHSRAATNAYLKKDHASAKYHSSRAQEQWLAAKKLNDKAAKKILRTRNSKNGLWKLDLHGLHAAEAVQALQENLLKIETQYAS
        IYL+HRKDA+AMMRSASQHSRAATNAYL+KDHASAKYHSSRAQEQWLAAK LN KAA +IL+TRNS+NGLWKLDLHGLHAAEAVQALQ++LLKIET+ AS
Subjt:  IYLTHRKDAVAMMRSASQHSRAATNAYLKKDHASAKYHSSRAQEQWLAAKKLNDKAAKKILRTRNSKNGLWKLDLHGLHAAEAVQALQENLLKIETQYAS

Query:  NRSLSPKKAERKGFLRASSLESLSCFDSKLNKASQSPSSRHRPTSLEVITGRGKHSRGEAALPKAVTNYLSENGYRFEQLRPGTISVRPKFRR
        NRSLSPKKAERKGF R SSLE LSC   KL+K  QSP  RHRPTSLEVITG GKHSRGEAALPKAVT++LSENGYRFEQLRPGTISVRPKFRR
Subjt:  NRSLSPKKAERKGFLRASSLESLSCFDSKLNKASQSPSSRHRPTSLEVITGRGKHSRGEAALPKAVTNYLSENGYRFEQLRPGTISVRPKFRR

A0A6J1GPE7 uncharacterized protein LOC111455928 isoform X24.6e-21370.83Show/hide
Query:  RGKSLG-AAVNLKTQTRGFQDEIGLEPLPPVPSGLSSLPPRENLPRFNGHSGRSLSPAPLPSGNSLTSAENYGARKTVLGNSSTRNGKKVVEETTDA-TF
        RGKS G AAVNLK Q  G QDEI  +P PP+ + LS LPPREN+ R NG SGRS             S EN+G +KT+ GNSS R+GKK+VEE+TD   F
Subjt:  RGKSLG-AAVNLKTQTRGFQDEIGLEPLPPVPSGLSSLPPRENLPRFNGHSGRSLSPAPLPSGNSLTSAENYGARKTVLGNSSTRNGKKVVEETTDA-TF

Query:  WKLKELHPWADYSLIMDVMEAVNNNFNDASTLLKTMVSSDKFEINNEISTFGLHSSNDLSSVRGESPG-AAANLKTQTRGFQDEVGLVPLAPISSSLSSL
        WKLKELH WAD SLI+D+MEAVNNNFN+AS LLKTMVSSD FEINNE+ST GLHSSND+S VRG+SPG    NLK Q RG QD +   P  P+ S+LSSL
Subjt:  WKLKELHPWADYSLIMDVMEAVNNNFNDASTLLKTMVSSDKFEINNEISTFGLHSSNDLSSVRGESPG-AAANLKTQTRGFQDEVGLVPLAPISSSLSSL

Query:  PPREKLPRVDGHSRRSFSSAPLPSGNSPTLAENCGAIKTLHGNFSTRGDKKVIEETTDV-TFWKLKELHPWADFSLIVGIMEAVNNNFNEASTLLKILVS
        PPRE L  V+G   RS SS+PLPS +S TL EN  A K L G+ S +  +KV+EETTDV  FWKLKELH WADFSLIV IMEAV+NNFNEAST L  +VS
Subjt:  PPREKLPRVDGHSRRSFSSAPLPSGNSPTLAENCGAIKTLHGNFSTRGDKKVIEETTDV-TFWKLKELHPWADFSLIVGIMEAVNNNFNEASTLLKILVS

Query:  SDTFEINNEMSALGLHSSNNQLCNRKNDASISLEKMVNIPGIGSTLKDVHQDNNACQEDDTKLIENNYYERNFFHDAGNPKRAL-GSTSIPIEPEWEEDD
        SD  EI NEMS LGLHS++   CN KND +ISL + VN P   STLKDV QD +  Q  + KL ENNY+ERNFFH+ GNPK AL  S S PIEPEWEEDD
Subjt:  SDTFEINNEMSALGLHSSNNQLCNRKNDASISLEKMVNIPGIGSTLKDVHQDNNACQEDDTKLIENNYYERNFFHDAGNPKRAL-GSTSIPIEPEWEEDD

Query:  IYLTHRKDAVAMMRSASQHSRAATNAYLKKDHASAKYHSSRAQEQWLAAKKLNDKAAKKILRTRNSKNGLWKLDLHGLHAAEAVQALQENLLKIETQYAS
        IYL+HRKDA+AMMRSASQHSRAATNAYL+KDHASAKYHSSRAQEQWLAAK LN KAA +IL+TRNS+NGLWKLDLHGLHAAEAVQALQ++LLKIET+ AS
Subjt:  IYLTHRKDAVAMMRSASQHSRAATNAYLKKDHASAKYHSSRAQEQWLAAKKLNDKAAKKILRTRNSKNGLWKLDLHGLHAAEAVQALQENLLKIETQYAS

Query:  NRSLSPKKAERKGFLRASSLESLSCFDSKLNKASQSPSSRHRPTSLEVITGRGKHSRGEAALPKAVTNYLSENGYRFEQLRPGTISVRPKFRR
        NRSLSPKKAERKGF R SSLE LSC   KL+K  QSP  RHRPTSLEVITG GKHSRGEAALPKAVT++LSENGYRFEQLRPGTISVRPKFRR
Subjt:  NRSLSPKKAERKGFLRASSLESLSCFDSKLNKASQSPSSRHRPTSLEVITGRGKHSRGEAALPKAVTNYLSENGYRFEQLRPGTISVRPKFRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G23520.1 smr (Small MutS Related) domain-containing protein1.7e-7141.78Show/hide
Query:  LSSVRGESPG-AAANLK-TQTRGFQDEVGLVPLAPISSSL-SSLPPREKLPRVDGHSRRSFSSAPLPSGNSPTLAENCGAIKTLHGNFSTR--GDKKVIE
        +S ++G+S G  A +LK  Q +G + EV   P  P+S+S+ +S   R +L R    S +SFSS  LP    P L EN        G    R      +  
Subjt:  LSSVRGESPG-AAANLK-TQTRGFQDEVGLVPLAPISSSL-SSLPPREKLPRVDGHSRRSFSSAPLPSGNSPTLAENCGAIKTLHGNFSTR--GDKKVIE

Query:  ETTDVTFWKLKELHPWADFSLIVGIMEAVNNNFNEASTLLKILVSSDTFEINNEMSALGLHSSNNQLCNRKNDASISLEKMVNIPGIGSTLKDVHQDNNA
         + D+ F KLKE++ WAD +LI  ++ +  ++F  A   LK +VSS   +        G  S N +   R  + +++    +      ST +D  +  + 
Subjt:  ETTDVTFWKLKELHPWADFSLIVGIMEAVNNNFNEASTLLKILVSSDTFEINNEMSALGLHSSNNQLCNRKNDASISLEKMVNIPGIGSTLKDVHQDNNA

Query:  CQEDDTKLIENNYYERNFFHDAGNPKRALGS-TSIPIEPEWEEDDIYLTHRKDAVAMMRSASQHSRAATNAYLKKDHASAKYHSSRAQEQWLAAKKLNDK
           D +  + N      F  D       +    SIPIEPEWEEDD+YL+HRKDA+ +MRSAS HSRAA NA+ + DHASAK HS +A+E WLAA+KLN +
Subjt:  CQEDDTKLIENNYYERNFFHDAGNPKRALGS-TSIPIEPEWEEDDIYLTHRKDAVAMMRSASQHSRAATNAYLKKDHASAKYHSSRAQEQWLAAKKLNDK

Query:  AAKKILRTRNSKNGLWKLDLHGLHAAEAVQALQENLLKIETQYASNRSLSPKKAERK-GFLRASSLESLSCFDSKLNKASQSPSSRHRPTSLEVITGRGK
        AAKKI+   N  N +WKLDLHGLHA EAVQALQE L  IE  +  NRS+SP +   K   LR++S E     D +     Q  SSR    SL+VITG GK
Subjt:  AAKKILRTRNSKNGLWKLDLHGLHAAEAVQALQENLLKIETQYASNRSLSPKKAERK-GFLRASSLESLSCFDSKLNKASQSPSSRHRPTSLEVITGRGK

Query:  HSRGEAALPKAVTNYLSENGYRFEQLRPGTISVRPKFR
        HSRG+A+LP AV  +  +N YRF++ RPG I+VRPKFR
Subjt:  HSRGEAALPKAVTNYLSENGYRFEQLRPGTISVRPKFR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACACCTTAGGACTGCATTCCTCTAATGATCTATCATCGGTCAGGGGCAAATCTCTTGGGGCAGCAGTTAACCTTAAGACACAGACTAGAGGCTTTCAAGATGAAAT
TGGTCTGGAACCACTTCCACCAGTACCATCCGGCCTTTCTTCTCTGCCACCCCGTGAAAACTTGCCCAGATTTAATGGTCATTCGGGGAGATCTTTGTCACCTGCACCAC
TTCCTTCTGGCAATTCTCTAACTTCGGCTGAAAACTATGGTGCAAGAAAGACTGTACTTGGAAATTCTAGCACTCGAAATGGCAAGAAGGTGGTTGAAGAAACCACTGAT
GCTACCTTTTGGAAGCTTAAGGAGCTTCATCCTTGGGCTGATTATAGCTTGATTATGGATGTAATGGAAGCTGTAAATAATAACTTTAACGATGCGTCTACTTTATTGAA
AACAATGGTTTCGAGTGACAAGTTTGAGATTAATAATGAGATAAGCACCTTTGGACTGCATTCCTCTAATGATCTATCTTCGGTTAGGGGTGAATCCCCTGGGGCAGCAG
CTAACCTTAAGACACAGACTAGAGGCTTTCAAGATGAAGTTGGTCTGGTTCCACTTGCACCAATATCATCCAGCCTTTCCTCTTTGCCACCTCGTGAAAAGTTGCCCAGA
GTTGATGGGCATTCAAGGAGATCTTTCTCATCTGCACCGCTTCCTTCTGGCAATTCTCCAACTTTGGCAGAAAACTGTGGTGCAATAAAGACACTACATGGAAATTTTAG
CACTCGAGGTGACAAGAAGGTAATTGAAGAAACCACCGATGTAACCTTTTGGAAACTTAAGGAACTTCATCCTTGGGCTGATTTTAGCTTGATTGTGGGTATAATGGAAG
CTGTAAATAATAACTTTAATGAGGCATCTACTTTATTAAAAATATTGGTTTCTAGCGACACTTTTGAGATCAATAATGAGATGAGTGCCTTAGGACTGCATTCCTCCAAT
AATCAATTGTGCAATCGGAAGAATGATGCAAGTATATCATTAGAAAAAATGGTCAATATTCCTGGTATTGGTTCCACACTAAAGGACGTGCATCAAGATAACAATGCATG
TCAAGAAGATGATACAAAATTGATTGAAAATAATTATTACGAAAGGAACTTCTTTCATGATGCTGGAAACCCAAAACGGGCTCTTGGCTCAACGTCTATTCCTATTGAGC
CTGAGTGGGAAGAAGACGATATTTACCTGACCCATCGGAAAGATGCCGTAGCAATGATGAGGTCGGCATCTCAACATTCAAGGGCAGCCACTAATGCATATCTTAAGAAA
GACCATGCTTCTGCCAAGTATCATTCATCAAGGGCTCAAGAACAATGGCTAGCTGCAAAAAAGTTAAATGACAAGGCAGCTAAAAAAATTTTACGAACGAGGAATAGTAA
AAATGGGCTCTGGAAATTGGACCTACATGGGCTTCATGCAGCAGAAGCTGTTCAAGCTTTGCAAGAAAACTTGCTGAAAATAGAAACTCAGTATGCTTCTAATCGGTCGT
TGTCCCCAAAGAAAGCAGAAAGGAAAGGGTTCCTACGAGCTTCATCCCTCGAGTCTCTTAGTTGTTTCGACTCAAAGTTGAACAAAGCATCACAATCACCATCATCTAGG
CATAGGCCCACATCATTGGAAGTCATAACAGGCAGAGGTAAGCATAGCAGGGGGGAAGCTGCTCTACCAAAGGCTGTGACAAATTATCTTAGTGAAAATGGGTACCGTTT
TGAGCAGTTGAGGCCTGGGACGATCAGCGTTCGGCCGAAGTTTCGTAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAACACCTTAGGACTGCATTCCTCTAATGATCTATCATCGGTCAGGGGCAAATCTCTTGGGGCAGCAGTTAACCTTAAGACACAGACTAGAGGCTTTCAAGATGAAAT
TGGTCTGGAACCACTTCCACCAGTACCATCCGGCCTTTCTTCTCTGCCACCCCGTGAAAACTTGCCCAGATTTAATGGTCATTCGGGGAGATCTTTGTCACCTGCACCAC
TTCCTTCTGGCAATTCTCTAACTTCGGCTGAAAACTATGGTGCAAGAAAGACTGTACTTGGAAATTCTAGCACTCGAAATGGCAAGAAGGTGGTTGAAGAAACCACTGAT
GCTACCTTTTGGAAGCTTAAGGAGCTTCATCCTTGGGCTGATTATAGCTTGATTATGGATGTAATGGAAGCTGTAAATAATAACTTTAACGATGCGTCTACTTTATTGAA
AACAATGGTTTCGAGTGACAAGTTTGAGATTAATAATGAGATAAGCACCTTTGGACTGCATTCCTCTAATGATCTATCTTCGGTTAGGGGTGAATCCCCTGGGGCAGCAG
CTAACCTTAAGACACAGACTAGAGGCTTTCAAGATGAAGTTGGTCTGGTTCCACTTGCACCAATATCATCCAGCCTTTCCTCTTTGCCACCTCGTGAAAAGTTGCCCAGA
GTTGATGGGCATTCAAGGAGATCTTTCTCATCTGCACCGCTTCCTTCTGGCAATTCTCCAACTTTGGCAGAAAACTGTGGTGCAATAAAGACACTACATGGAAATTTTAG
CACTCGAGGTGACAAGAAGGTAATTGAAGAAACCACCGATGTAACCTTTTGGAAACTTAAGGAACTTCATCCTTGGGCTGATTTTAGCTTGATTGTGGGTATAATGGAAG
CTGTAAATAATAACTTTAATGAGGCATCTACTTTATTAAAAATATTGGTTTCTAGCGACACTTTTGAGATCAATAATGAGATGAGTGCCTTAGGACTGCATTCCTCCAAT
AATCAATTGTGCAATCGGAAGAATGATGCAAGTATATCATTAGAAAAAATGGTCAATATTCCTGGTATTGGTTCCACACTAAAGGACGTGCATCAAGATAACAATGCATG
TCAAGAAGATGATACAAAATTGATTGAAAATAATTATTACGAAAGGAACTTCTTTCATGATGCTGGAAACCCAAAACGGGCTCTTGGCTCAACGTCTATTCCTATTGAGC
CTGAGTGGGAAGAAGACGATATTTACCTGACCCATCGGAAAGATGCCGTAGCAATGATGAGGTCGGCATCTCAACATTCAAGGGCAGCCACTAATGCATATCTTAAGAAA
GACCATGCTTCTGCCAAGTATCATTCATCAAGGGCTCAAGAACAATGGCTAGCTGCAAAAAAGTTAAATGACAAGGCAGCTAAAAAAATTTTACGAACGAGGAATAGTAA
AAATGGGCTCTGGAAATTGGACCTACATGGGCTTCATGCAGCAGAAGCTGTTCAAGCTTTGCAAGAAAACTTGCTGAAAATAGAAACTCAGTATGCTTCTAATCGGTCGT
TGTCCCCAAAGAAAGCAGAAAGGAAAGGGTTCCTACGAGCTTCATCCCTCGAGTCTCTTAGTTGTTTCGACTCAAAGTTGAACAAAGCATCACAATCACCATCATCTAGG
CATAGGCCCACATCATTGGAAGTCATAACAGGCAGAGGTAAGCATAGCAGGGGGGAAGCTGCTCTACCAAAGGCTGTGACAAATTATCTTAGTGAAAATGGGTACCGTTT
TGAGCAGTTGAGGCCTGGGACGATCAGCGTTCGGCCGAAGTTTCGTAGGTAAATGGCTAAAACCCCCATGATCAGTTGGCTTCAAGAGAATGTAAGTTATTTAGTTAGTT
TAGAGTAGTTGTTGAGATTGACTATAAAATGACAGAATGTTATTTAGGAATTTTATCTGTTGAATTGAAATCTTGAACCTTTGCAGTTTGTAAGGTTGTACAGTTGGTGG
TTTGATTGAGAACCACATTACTGTATCACTATGTTCTTTTTCTTGGAAAAAATTGCACTGGCAACATATTCCAAGATTCCTGAGTATTTTATCTTATTTTTTGTATTGAA
ATCCTGAGATGATTTTAGTATGAATCAGTAAGAATATCCAG
Protein sequenceShow/hide protein sequence
MNTLGLHSSNDLSSVRGKSLGAAVNLKTQTRGFQDEIGLEPLPPVPSGLSSLPPRENLPRFNGHSGRSLSPAPLPSGNSLTSAENYGARKTVLGNSSTRNGKKVVEETTD
ATFWKLKELHPWADYSLIMDVMEAVNNNFNDASTLLKTMVSSDKFEINNEISTFGLHSSNDLSSVRGESPGAAANLKTQTRGFQDEVGLVPLAPISSSLSSLPPREKLPR
VDGHSRRSFSSAPLPSGNSPTLAENCGAIKTLHGNFSTRGDKKVIEETTDVTFWKLKELHPWADFSLIVGIMEAVNNNFNEASTLLKILVSSDTFEINNEMSALGLHSSN
NQLCNRKNDASISLEKMVNIPGIGSTLKDVHQDNNACQEDDTKLIENNYYERNFFHDAGNPKRALGSTSIPIEPEWEEDDIYLTHRKDAVAMMRSASQHSRAATNAYLKK
DHASAKYHSSRAQEQWLAAKKLNDKAAKKILRTRNSKNGLWKLDLHGLHAAEAVQALQENLLKIETQYASNRSLSPKKAERKGFLRASSLESLSCFDSKLNKASQSPSSR
HRPTSLEVITGRGKHSRGEAALPKAVTNYLSENGYRFEQLRPGTISVRPKFRR