; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0002288 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0002288
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionSmr domain-containing protein
Genome locationchr4:41324468..41328271
RNA-Seq ExpressionLag0002288
SyntenyLag0002288
Gene Ontology termsNA
InterPro domainsIPR002625 - Smr domain
IPR013899 - Domain of unknown function DUF1771
IPR036063 - Smr domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059625.1 Smr (Small MutS Related) domain-containing protein, putative isoform 1 [Cucumis melo var. makuwa]1.6e-26879.11Show/hide
Query:  MSWVRGKSPGWASFNLKQQNSDLQDEIDPEPFPPMSTGLSSLPPRENLHRVNGRSGGSFSSAPLPSADSLYLP----------KYFGAKKTILGDSSTQS
        MSWVRGKS GWA+FNLKQQN+ +QDE+D +PFPPMST LSSLPPRENL  VNGRSG SFS AP+PSADS  LP            F AKKTILG S+ QS
Subjt:  MSWVRGKSPGWASFNLKQQNSDLQDEIDPEPFPPMSTGLSSLPPRENLHRVNGRSGGSFSSAPLPSADSLYLP----------KYFGAKKTILGDSSTQS

Query:  GKKVVEETTDVLAFWKLKELHSWADISLIVDIMEAVNNDFNEASTLLKTMVSSDNFEINNEMSTLGLHSSNDLSWMRGKSPGWAAFDLKQQNKGIQDEID
        GKK+VEET DVL+FWKLKELH WADISLI+DIMEAVNNDFNEASTLL TMVSSDN EINNEMS LGLHSSNDLSWM GKSPGW  F+L+Q N+G+Q E D
Subjt:  GKKVVEETTDVLAFWKLKELHSWADISLIVDIMEAVNNDFNEASTLLKTMVSSDNFEINNEMSTLGLHSSNDLSWMRGKSPGWAAFDLKQQNKGIQDEID

Query:  PESFPPISNGLSSLPPCENLHRVNGRSGRSFSSAPLPSVDSLTSPDNYGAKKTILGDSSVQSGKKVVEETTDVLAFWKLKELHSWAGISLIVDIMEAVNN
        PE+FPP+     SLPP ENLH V G  GRSF+S PLPS DSLTSP NYGAK TI  DS +QSGKKVVEE TDVLAFWKLKE+HSWA  SLIVDIM+AVNN
Subjt:  PESFPPISNGLSSLPPCENLHRVNGRSGRSFSSAPLPSVDSLTSPDNYGAKKTILGDSSVQSGKKVVEETTDVLAFWKLKELHSWAGISLIVDIMEAVNN

Query:  NFNEASTLLKTMVSSDNFEINNKMSTLGLHSSNDLSCNGNNDVSISLEKTVNIPILSCSLK---DVHQNNNACQEDDTKLFENNYYEWNFFHDVGNPKLA
        NF+EASTLLKTMVSSDNFEINN++STLGLH +NDL CNG+NDVSIS E+T+N PILS +LK    +HQN+N   ED TKLF N+Y+E NFF + GN K+A
Subjt:  NFNEASTLLKTMVSSDNFEINNKMSTLGLHSSNDLSCNGNNDVSISLEKTVNIPILSCSLK---DVHQNNNACQEDDTKLFENNYYEWNFFHDVGNPKLA

Query:  LGCPKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEEWLAAKRLNDKAANEILRTRNSKNGIWKLDLHGLHSAEA
        LGC KSVPIEPEWEEDD+YLSHRKDAIAMMRSASQHSRAATNAY RKDHASAKYHSSRAQE+WLAAK LNDKAANEIL+TRNSKNG+WKLDLHGLH+AEA
Subjt:  LGCPKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEEWLAAKRLNDKAANEILRTRNSKNGIWKLDLHGLHSAEA

Query:  VQALREHLLKIETQNASNRSLSPKKAERKGFLRASSLESFSCIDSKLDKESPSSRHRPISLEVITGIGKHSKGEAALPKAVTNFLSENGYRFEQLRPGTI
        VQAL++HLLKIETQNASNRSLSPKKAERKGF RASSLE  SC+DSKLDKESPSSRHRP SLEVITGIGKHSKGEAALPKAVT+FL+ENGYRFEQ RPGTI
Subjt:  VQALREHLLKIETQNASNRSLSPKKAERKGFLRASSLESFSCIDSKLDKESPSSRHRPISLEVITGIGKHSKGEAALPKAVTNFLSENGYRFEQLRPGTI

Query:  SVRPKFHR
        SVRPKF R
Subjt:  SVRPKFHR

XP_004148966.1 uncharacterized protein LOC101223137 [Cucumis sativus]2.1e-27379.93Show/hide
Query:  MSWVRGKSPGWASFNLKQQNSDLQDEIDPEPFPPMSTGLSSLPPRENLHRVNGRSGGSFSSAPLPSADSLYLP-KY---------FGAKKTILGDSSTQS
        MSWVRGKS GWA+FNLKQQN+ LQDE+D +PFPPMST LSSLPPRENL  VNG SG SFS AP+PSADS  LP K+         FGAKKTILG ++ QS
Subjt:  MSWVRGKSPGWASFNLKQQNSDLQDEIDPEPFPPMSTGLSSLPPRENLHRVNGRSGGSFSSAPLPSADSLYLP-KY---------FGAKKTILGDSSTQS

Query:  GKKVVEETTDVLAFWKLKELHSWADISLIVDIMEAVNNDFNEASTLLKTMVSSDNFEINNEMSTLGLHSSNDLSWMRGKSPGWAAFDLKQQNKGIQDEID
        GKK+VEET DVL+FWKLKELH WADISLI+DIMEAVNNDFNEASTLL TMVSSDN EINN+MSTLGLHSSNDL WM GKSPGW  F+LKQ NKG+QDE+D
Subjt:  GKKVVEETTDVLAFWKLKELHSWADISLIVDIMEAVNNDFNEASTLLKTMVSSDNFEINNEMSTLGLHSSNDLSWMRGKSPGWAAFDLKQQNKGIQDEID

Query:  PESFPPISNGLSSLPPCENLHRVNGRSGRSFSSAPLPSVDSLTSPDNYGAKKTILGDSSVQSGKKVVEETTDVLAFWKLKELHSWAGISLIVDIMEAVNN
         E+FPP+    SSLPP ENLH V GRSGRSF+S PLPSVDSLTSP+NYGAK TI  DSS+QSGKKVVEE TDVLAFWKLKE+HSWA  SLIVDIM+AVNN
Subjt:  PESFPPISNGLSSLPPCENLHRVNGRSGRSFSSAPLPSVDSLTSPDNYGAKKTILGDSSVQSGKKVVEETTDVLAFWKLKELHSWAGISLIVDIMEAVNN

Query:  NFNEASTLLKTMVSSDNFEINNKMSTLGLHSSNDLSCNGNNDVSISLEKTVNIPILSCSLK---DVHQNNNACQEDDTKLFENNYYEWNFFHDVGNPKLA
        NF+EASTLLKTMVSSDNFEINN++STLGLHS+NDL CNGNNDVSI+ E+ +N PILS ++K    +HQNNN  +ED TKLF N+Y+E N FH+ GN K+A
Subjt:  NFNEASTLLKTMVSSDNFEINNKMSTLGLHSSNDLSCNGNNDVSISLEKTVNIPILSCSLK---DVHQNNNACQEDDTKLFENNYYEWNFFHDVGNPKLA

Query:  LGCPKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEEWLAAKRLNDKAANEILRTRNSKNGIWKLDLHGLHSAEA
        LGC KSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAY RKDHASAKYHSSRA+E+WLAAK LNDKAANEIL+TRNSKNG+WKLDLHGLH+AEA
Subjt:  LGCPKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEEWLAAKRLNDKAANEILRTRNSKNGIWKLDLHGLHSAEA

Query:  VQALREHLLKIETQNASNRSLSPKKAERKGFLRASSLESFSCIDSKLDKESPSSRHRPISLEVITGIGKHSKGEAALPKAVTNFLSENGYRFEQLRPGTI
        VQAL +HLLKIETQNASNRSLSPKKAERKGF RASSLE  SC++SKLDKESPSSRHRP SLEVITGIGKHSKGEAALPKAV +FL+ENGYRFEQ RPGTI
Subjt:  VQALREHLLKIETQNASNRSLSPKKAERKGFLRASSLESFSCIDSKLDKESPSSRHRPISLEVITGIGKHSKGEAALPKAVTNFLSENGYRFEQLRPGTI

Query:  SVRPKFHR
        SVRPKF R
Subjt:  SVRPKFHR

XP_008451240.1 PREDICTED: uncharacterized protein LOC103492590 [Cucumis melo]1.4e-26979.44Show/hide
Query:  MSWVRGKSPGWASFNLKQQNSDLQDEIDPEPFPPMSTGLSSLPPRENLHRVNGRSGGSFSSAPLPSADSLYLP----------KYFGAKKTILGDSSTQS
        MSWVRGKS GWA+FNLKQQN+ +QDE+D +PFPPMST LSSLPPRENL  VNGRSG SFS AP+PSADS  LP            F AKKTILG S+ QS
Subjt:  MSWVRGKSPGWASFNLKQQNSDLQDEIDPEPFPPMSTGLSSLPPRENLHRVNGRSGGSFSSAPLPSADSLYLP----------KYFGAKKTILGDSSTQS

Query:  GKKVVEETTDVLAFWKLKELHSWADISLIVDIMEAVNNDFNEASTLLKTMVSSDNFEINNEMSTLGLHSSNDLSWMRGKSPGWAAFDLKQQNKGIQDEID
        GKK+VEET DVL+FWKLKELH WADISLI+DIMEAVNNDFNEASTLL TMVSSDN EINNEMS LGLHSSNDLSWM GKSPGW  F+L+Q N+G+Q E D
Subjt:  GKKVVEETTDVLAFWKLKELHSWADISLIVDIMEAVNNDFNEASTLLKTMVSSDNFEINNEMSTLGLHSSNDLSWMRGKSPGWAAFDLKQQNKGIQDEID

Query:  PESFPPISNGLSSLPPCENLHRVNGRSGRSFSSAPLPSVDSLTSPDNYGAKKTILGDSSVQSGKKVVEETTDVLAFWKLKELHSWAGISLIVDIMEAVNN
        PE+FPP+     SLPP ENLH V GR GRSF+S PLPS DSLTSP NYGAK TI  DS +QSGKKVVEE TDVLAFWKLKE+HSWA  SLIVDIM+AVNN
Subjt:  PESFPPISNGLSSLPPCENLHRVNGRSGRSFSSAPLPSVDSLTSPDNYGAKKTILGDSSVQSGKKVVEETTDVLAFWKLKELHSWAGISLIVDIMEAVNN

Query:  NFNEASTLLKTMVSSDNFEINNKMSTLGLHSSNDLSCNGNNDVSISLEKTVNIPILSCSLK---DVHQNNNACQEDDTKLFENNYYEWNFFHDVGNPKLA
        NF+EASTLLKTMVSSDNFEINN++STLGLHS+NDL CNG+NDVSIS E+T+N PILS +LK    +HQN+N   ED TKLF N+Y+E NFF + GN K+A
Subjt:  NFNEASTLLKTMVSSDNFEINNKMSTLGLHSSNDLSCNGNNDVSISLEKTVNIPILSCSLK---DVHQNNNACQEDDTKLFENNYYEWNFFHDVGNPKLA

Query:  LGCPKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEEWLAAKRLNDKAANEILRTRNSKNGIWKLDLHGLHSAEA
        LGC KSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAY RKDHASAKYHSSRAQE+WLAAK LNDKAANEIL+TRNSKNG+WKLDLHGLH+AEA
Subjt:  LGCPKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEEWLAAKRLNDKAANEILRTRNSKNGIWKLDLHGLHSAEA

Query:  VQALREHLLKIETQNASNRSLSPKKAERKGFLRASSLESFSCIDSKLDKESPSSRHRPISLEVITGIGKHSKGEAALPKAVTNFLSENGYRFEQLRPGTI
        VQAL++HLLKIETQNASNRSLSPKKAERKGF RASSLE  SC+D+KLDKESPSSRHRP SLEVITGIGKHSKGEAALPKAVT+FL+ENGYRFEQ RPGTI
Subjt:  VQALREHLLKIETQNASNRSLSPKKAERKGFLRASSLESFSCIDSKLDKESPSSRHRPISLEVITGIGKHSKGEAALPKAVTNFLSENGYRFEQLRPGTI

Query:  SVRPKFHR
        SVRPKF R
Subjt:  SVRPKFHR

XP_023548349.1 uncharacterized protein LOC111807017 [Cucurbita pepo subsp. pepo]4.6e-26880.83Show/hide
Query:  MSWVRGKSPGWASFNLKQQNSDLQDEIDPEPFPPMSTGLSSLPPRENLHRVNGRSGGSFSSAPLPSADSLYLPKYFGAKKTILGDSSTQSGKKVVEETTD
        MSW RGKSPGWA+ NLKQQNS LQDEIDP+PFPPMST LS LPPREN+HRVNGRSG SFSS PLPSADSL  PK FGAKKTI G+SS +SGKK+VEE+TD
Subjt:  MSWVRGKSPGWASFNLKQQNSDLQDEIDPEPFPPMSTGLSSLPPRENLHRVNGRSGGSFSSAPLPSADSLYLPKYFGAKKTILGDSSTQSGKKVVEETTD

Query:  VLAFWKLKELHSWADISLIVDIMEAVNNDFNEASTLLKTMVSSDNFEINNEMSTLGLHSSNDLSWMRGKSPGWAAFDLKQQNKGIQDEIDPESFPPISNG
        VLAFWKLKELHSWADISLIVDIMEAVNN+FNEAS LLKTMVSSDNFEINNEMSTLGLHSSND+S +RGKSPGW  ++LKQQN+G+QD IDP+ FPP+ + 
Subjt:  VLAFWKLKELHSWADISLIVDIMEAVNNDFNEASTLLKTMVSSDNFEINNEMSTLGLHSSNDLSWMRGKSPGWAAFDLKQQNKGIQDEIDPESFPPISNG

Query:  LSSLPPCENLHRVNGRSGRSFSSAPLPSVDSLTSPDNYGAKKTILGDSSVQSGKKVVEETTDVLAFWKLKELHSWAGISLIVDIMEAVNNNFNEASTLLK
        LSSLPP ENLH V GR GRS SS+PLPS DSLTSP+NY AKK ILGDSS+Q+G+KVVEETTDVLAFWKLKELH+WA  SLIVDIMEAV+NNFNEAST L 
Subjt:  LSSLPPCENLHRVNGRSGRSFSSAPLPSVDSLTSPDNYGAKKTILGDSSVQSGKKVVEETTDVLAFWKLKELHSWAGISLIVDIMEAVNNNFNEASTLLK

Query:  TMVSSDNFEINNKMSTLGLHSSNDLSCNGNNDVSISLEKTVNIPILSCSLKDV---HQNNNACQEDDTKLFENNYYEWNFFHDVGNPKLALGCPKSVPIE
         MVSSDN EI N+MSTLGLHS++ LSC G NDV+ISL +TVN PI S +LKDV   HQN N       KLFENNY+E NFFH+VGNPK+AL C KS PIE
Subjt:  TMVSSDNFEINNKMSTLGLHSSNDLSCNGNNDVSISLEKTVNIPILSCSLKDV---HQNNNACQEDDTKLFENNYYEWNFFHDVGNPKLALGCPKSVPIE

Query:  PEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEEWLAAKRLNDKAANEILRTRNSKNGIWKLDLHGLHSAEAVQALREHLLK
        PEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQE+WLAAK LN KAANEIL+TRNS+NG+WKLDLHGLH+AEAVQAL++HLLK
Subjt:  PEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEEWLAAKRLNDKAANEILRTRNSKNGIWKLDLHGLHSAEAVQALREHLLK

Query:  IETQNASNRSLSPKKAERKGFLRASSLESFSCIDSKLDKE--SPSSRHRPISLEVITGIGKHSKGEAALPKAVTNFLSENGYRFEQLRPGTISVRPKFHR
        IET+NASNRSLSPKKAERKGF R SSLE  SC+  KLDKE  SP  RHRP SLEVITGIGKHS+GEAALPKAVT+FLSENGYRFEQLRPGTISVRPKF R
Subjt:  IETQNASNRSLSPKKAERKGFLRASSLESFSCIDSKLDKE--SPSSRHRPISLEVITGIGKHSKGEAALPKAVTNFLSENGYRFEQLRPGTISVRPKFHR

XP_038898473.1 uncharacterized protein LOC120086100 [Benincasa hispida]1.2e-26878.95Show/hide
Query:  MSWVRGKSPGWASFNLKQQNSDLQDEIDPEPFPPMSTGLSSLPPRENLHRVNGRSGGSFSSAPLPSADSLYLPKYF----------GAKKTILGDSSTQS
        MSWV+GKSPGWA+FNLKQQN+ LQDE+D +PFPP+ST LSSLPP EN H VNGRSG SFS AP PSA+SL  P+ F          GAKKTIL  S+ Q+
Subjt:  MSWVRGKSPGWASFNLKQQNSDLQDEIDPEPFPPMSTGLSSLPPRENLHRVNGRSGGSFSSAPLPSADSLYLPKYF----------GAKKTILGDSSTQS

Query:  GKKVVEETTDVLAFWKLKELHSWADISLIVDIMEAVNNDFNEASTLLKTMVSSDNFEINNEMSTLGLHSSNDLSWMRGKSPGWAAFDLKQQNKGIQDEID
        GKKVVEET DVL+FWKLKELHSWADISLI+D+MEAVNN+F+EASTLLKTMV+SDNFEINNEMSTLGL  SNDLSW+ G  PGW  F+LKQ N+G+QDE D
Subjt:  GKKVVEETTDVLAFWKLKELHSWADISLIVDIMEAVNNDFNEASTLLKTMVSSDNFEINNEMSTLGLHSSNDLSWMRGKSPGWAAFDLKQQNKGIQDEID

Query:  PESFPPISNGLSSLPPCENLHRVNGRSGRSFSSAPLPSVDSLTSPDNYGAKKTILGDSSVQSGKKVVEETTDVLAFWKLKELHSWAGISLIVDIMEAVNN
         E  PP+  G SSLPPCE+LHRV G SG+SFSS P  S DSLTSP+NYGAKKTI  DSS+QSGKKVVEE+ D LAFWKLKELHSWA  SLIVDIMEAVNN
Subjt:  PESFPPISNGLSSLPPCENLHRVNGRSGRSFSSAPLPSVDSLTSPDNYGAKKTILGDSSVQSGKKVVEETTDVLAFWKLKELHSWAGISLIVDIMEAVNN

Query:  NFNEASTLLKTMVSSDNFEINNKMSTLGLHSSNDLSCNGNNDVSISLEKTVNIPILSCSLKD---VHQNNNACQEDDTKLFENNYYEWNFFHDVGNPKLA
        NFNEASTLLKTMVSSDNF+IN++MSTL L S+NDL CNG NDVS SLE+T NIPI S +LKD   VHQNNNAC+E+ TKLFENNY+E NFFH+ G PK+ 
Subjt:  NFNEASTLLKTMVSSDNFEINNKMSTLGLHSSNDLSCNGNNDVSISLEKTVNIPILSCSLKD---VHQNNNACQEDDTKLFENNYYEWNFFHDVGNPKLA

Query:  LGCPKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEEWLAAKRLNDKAANEILRTRNSKNGIWKLDLHGLHSAEA
        LG  KSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQE+WLAAK LNDKAANEIL+TRNSKNG+WKLDLHGLH+AEA
Subjt:  LGCPKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEEWLAAKRLNDKAANEILRTRNSKNGIWKLDLHGLHSAEA

Query:  VQALREHLLKIETQNASNRSLSPKKAERKGFLRASSLESFSCIDSKLDKESPSSRHRPISLEVITGIGKHSKGEAALPKAVTNFLSENGYRFEQLRPGTI
        VQAL+EHLLKIET+NASNRSLSPKK+ERKGF  ASSLE  SC+DSK+DKESPSSRHRP SLEVITGIGKHS+GEA LPKAVT+FLSENGYRFEQLRPGTI
Subjt:  VQALREHLLKIETQNASNRSLSPKKAERKGFLRASSLESFSCIDSKLDKESPSSRHRPISLEVITGIGKHSKGEAALPKAVTNFLSENGYRFEQLRPGTI

Query:  SVRPKFHR
        S+RPKF R
Subjt:  SVRPKFHR

TrEMBL top hitse value%identityAlignment
A0A0A0KA90 Smr domain-containing protein1.0e-27379.93Show/hide
Query:  MSWVRGKSPGWASFNLKQQNSDLQDEIDPEPFPPMSTGLSSLPPRENLHRVNGRSGGSFSSAPLPSADSLYLP-KY---------FGAKKTILGDSSTQS
        MSWVRGKS GWA+FNLKQQN+ LQDE+D +PFPPMST LSSLPPRENL  VNG SG SFS AP+PSADS  LP K+         FGAKKTILG ++ QS
Subjt:  MSWVRGKSPGWASFNLKQQNSDLQDEIDPEPFPPMSTGLSSLPPRENLHRVNGRSGGSFSSAPLPSADSLYLP-KY---------FGAKKTILGDSSTQS

Query:  GKKVVEETTDVLAFWKLKELHSWADISLIVDIMEAVNNDFNEASTLLKTMVSSDNFEINNEMSTLGLHSSNDLSWMRGKSPGWAAFDLKQQNKGIQDEID
        GKK+VEET DVL+FWKLKELH WADISLI+DIMEAVNNDFNEASTLL TMVSSDN EINN+MSTLGLHSSNDL WM GKSPGW  F+LKQ NKG+QDE+D
Subjt:  GKKVVEETTDVLAFWKLKELHSWADISLIVDIMEAVNNDFNEASTLLKTMVSSDNFEINNEMSTLGLHSSNDLSWMRGKSPGWAAFDLKQQNKGIQDEID

Query:  PESFPPISNGLSSLPPCENLHRVNGRSGRSFSSAPLPSVDSLTSPDNYGAKKTILGDSSVQSGKKVVEETTDVLAFWKLKELHSWAGISLIVDIMEAVNN
         E+FPP+    SSLPP ENLH V GRSGRSF+S PLPSVDSLTSP+NYGAK TI  DSS+QSGKKVVEE TDVLAFWKLKE+HSWA  SLIVDIM+AVNN
Subjt:  PESFPPISNGLSSLPPCENLHRVNGRSGRSFSSAPLPSVDSLTSPDNYGAKKTILGDSSVQSGKKVVEETTDVLAFWKLKELHSWAGISLIVDIMEAVNN

Query:  NFNEASTLLKTMVSSDNFEINNKMSTLGLHSSNDLSCNGNNDVSISLEKTVNIPILSCSLK---DVHQNNNACQEDDTKLFENNYYEWNFFHDVGNPKLA
        NF+EASTLLKTMVSSDNFEINN++STLGLHS+NDL CNGNNDVSI+ E+ +N PILS ++K    +HQNNN  +ED TKLF N+Y+E N FH+ GN K+A
Subjt:  NFNEASTLLKTMVSSDNFEINNKMSTLGLHSSNDLSCNGNNDVSISLEKTVNIPILSCSLK---DVHQNNNACQEDDTKLFENNYYEWNFFHDVGNPKLA

Query:  LGCPKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEEWLAAKRLNDKAANEILRTRNSKNGIWKLDLHGLHSAEA
        LGC KSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAY RKDHASAKYHSSRA+E+WLAAK LNDKAANEIL+TRNSKNG+WKLDLHGLH+AEA
Subjt:  LGCPKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEEWLAAKRLNDKAANEILRTRNSKNGIWKLDLHGLHSAEA

Query:  VQALREHLLKIETQNASNRSLSPKKAERKGFLRASSLESFSCIDSKLDKESPSSRHRPISLEVITGIGKHSKGEAALPKAVTNFLSENGYRFEQLRPGTI
        VQAL +HLLKIETQNASNRSLSPKKAERKGF RASSLE  SC++SKLDKESPSSRHRP SLEVITGIGKHSKGEAALPKAV +FL+ENGYRFEQ RPGTI
Subjt:  VQALREHLLKIETQNASNRSLSPKKAERKGFLRASSLESFSCIDSKLDKESPSSRHRPISLEVITGIGKHSKGEAALPKAVTNFLSENGYRFEQLRPGTI

Query:  SVRPKFHR
        SVRPKF R
Subjt:  SVRPKFHR

A0A1S3BRS7 uncharacterized protein LOC1034925906.9e-27079.44Show/hide
Query:  MSWVRGKSPGWASFNLKQQNSDLQDEIDPEPFPPMSTGLSSLPPRENLHRVNGRSGGSFSSAPLPSADSLYLP----------KYFGAKKTILGDSSTQS
        MSWVRGKS GWA+FNLKQQN+ +QDE+D +PFPPMST LSSLPPRENL  VNGRSG SFS AP+PSADS  LP            F AKKTILG S+ QS
Subjt:  MSWVRGKSPGWASFNLKQQNSDLQDEIDPEPFPPMSTGLSSLPPRENLHRVNGRSGGSFSSAPLPSADSLYLP----------KYFGAKKTILGDSSTQS

Query:  GKKVVEETTDVLAFWKLKELHSWADISLIVDIMEAVNNDFNEASTLLKTMVSSDNFEINNEMSTLGLHSSNDLSWMRGKSPGWAAFDLKQQNKGIQDEID
        GKK+VEET DVL+FWKLKELH WADISLI+DIMEAVNNDFNEASTLL TMVSSDN EINNEMS LGLHSSNDLSWM GKSPGW  F+L+Q N+G+Q E D
Subjt:  GKKVVEETTDVLAFWKLKELHSWADISLIVDIMEAVNNDFNEASTLLKTMVSSDNFEINNEMSTLGLHSSNDLSWMRGKSPGWAAFDLKQQNKGIQDEID

Query:  PESFPPISNGLSSLPPCENLHRVNGRSGRSFSSAPLPSVDSLTSPDNYGAKKTILGDSSVQSGKKVVEETTDVLAFWKLKELHSWAGISLIVDIMEAVNN
        PE+FPP+     SLPP ENLH V GR GRSF+S PLPS DSLTSP NYGAK TI  DS +QSGKKVVEE TDVLAFWKLKE+HSWA  SLIVDIM+AVNN
Subjt:  PESFPPISNGLSSLPPCENLHRVNGRSGRSFSSAPLPSVDSLTSPDNYGAKKTILGDSSVQSGKKVVEETTDVLAFWKLKELHSWAGISLIVDIMEAVNN

Query:  NFNEASTLLKTMVSSDNFEINNKMSTLGLHSSNDLSCNGNNDVSISLEKTVNIPILSCSLK---DVHQNNNACQEDDTKLFENNYYEWNFFHDVGNPKLA
        NF+EASTLLKTMVSSDNFEINN++STLGLHS+NDL CNG+NDVSIS E+T+N PILS +LK    +HQN+N   ED TKLF N+Y+E NFF + GN K+A
Subjt:  NFNEASTLLKTMVSSDNFEINNKMSTLGLHSSNDLSCNGNNDVSISLEKTVNIPILSCSLK---DVHQNNNACQEDDTKLFENNYYEWNFFHDVGNPKLA

Query:  LGCPKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEEWLAAKRLNDKAANEILRTRNSKNGIWKLDLHGLHSAEA
        LGC KSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAY RKDHASAKYHSSRAQE+WLAAK LNDKAANEIL+TRNSKNG+WKLDLHGLH+AEA
Subjt:  LGCPKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEEWLAAKRLNDKAANEILRTRNSKNGIWKLDLHGLHSAEA

Query:  VQALREHLLKIETQNASNRSLSPKKAERKGFLRASSLESFSCIDSKLDKESPSSRHRPISLEVITGIGKHSKGEAALPKAVTNFLSENGYRFEQLRPGTI
        VQAL++HLLKIETQNASNRSLSPKKAERKGF RASSLE  SC+D+KLDKESPSSRHRP SLEVITGIGKHSKGEAALPKAVT+FL+ENGYRFEQ RPGTI
Subjt:  VQALREHLLKIETQNASNRSLSPKKAERKGFLRASSLESFSCIDSKLDKESPSSRHRPISLEVITGIGKHSKGEAALPKAVTNFLSENGYRFEQLRPGTI

Query:  SVRPKFHR
        SVRPKF R
Subjt:  SVRPKFHR

A0A5D3CAF0 Smr (Small MutS Related) domain-containing protein, putative isoform 17.6e-26979.11Show/hide
Query:  MSWVRGKSPGWASFNLKQQNSDLQDEIDPEPFPPMSTGLSSLPPRENLHRVNGRSGGSFSSAPLPSADSLYLP----------KYFGAKKTILGDSSTQS
        MSWVRGKS GWA+FNLKQQN+ +QDE+D +PFPPMST LSSLPPRENL  VNGRSG SFS AP+PSADS  LP            F AKKTILG S+ QS
Subjt:  MSWVRGKSPGWASFNLKQQNSDLQDEIDPEPFPPMSTGLSSLPPRENLHRVNGRSGGSFSSAPLPSADSLYLP----------KYFGAKKTILGDSSTQS

Query:  GKKVVEETTDVLAFWKLKELHSWADISLIVDIMEAVNNDFNEASTLLKTMVSSDNFEINNEMSTLGLHSSNDLSWMRGKSPGWAAFDLKQQNKGIQDEID
        GKK+VEET DVL+FWKLKELH WADISLI+DIMEAVNNDFNEASTLL TMVSSDN EINNEMS LGLHSSNDLSWM GKSPGW  F+L+Q N+G+Q E D
Subjt:  GKKVVEETTDVLAFWKLKELHSWADISLIVDIMEAVNNDFNEASTLLKTMVSSDNFEINNEMSTLGLHSSNDLSWMRGKSPGWAAFDLKQQNKGIQDEID

Query:  PESFPPISNGLSSLPPCENLHRVNGRSGRSFSSAPLPSVDSLTSPDNYGAKKTILGDSSVQSGKKVVEETTDVLAFWKLKELHSWAGISLIVDIMEAVNN
        PE+FPP+     SLPP ENLH V G  GRSF+S PLPS DSLTSP NYGAK TI  DS +QSGKKVVEE TDVLAFWKLKE+HSWA  SLIVDIM+AVNN
Subjt:  PESFPPISNGLSSLPPCENLHRVNGRSGRSFSSAPLPSVDSLTSPDNYGAKKTILGDSSVQSGKKVVEETTDVLAFWKLKELHSWAGISLIVDIMEAVNN

Query:  NFNEASTLLKTMVSSDNFEINNKMSTLGLHSSNDLSCNGNNDVSISLEKTVNIPILSCSLK---DVHQNNNACQEDDTKLFENNYYEWNFFHDVGNPKLA
        NF+EASTLLKTMVSSDNFEINN++STLGLH +NDL CNG+NDVSIS E+T+N PILS +LK    +HQN+N   ED TKLF N+Y+E NFF + GN K+A
Subjt:  NFNEASTLLKTMVSSDNFEINNKMSTLGLHSSNDLSCNGNNDVSISLEKTVNIPILSCSLK---DVHQNNNACQEDDTKLFENNYYEWNFFHDVGNPKLA

Query:  LGCPKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEEWLAAKRLNDKAANEILRTRNSKNGIWKLDLHGLHSAEA
        LGC KSVPIEPEWEEDD+YLSHRKDAIAMMRSASQHSRAATNAY RKDHASAKYHSSRAQE+WLAAK LNDKAANEIL+TRNSKNG+WKLDLHGLH+AEA
Subjt:  LGCPKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEEWLAAKRLNDKAANEILRTRNSKNGIWKLDLHGLHSAEA

Query:  VQALREHLLKIETQNASNRSLSPKKAERKGFLRASSLESFSCIDSKLDKESPSSRHRPISLEVITGIGKHSKGEAALPKAVTNFLSENGYRFEQLRPGTI
        VQAL++HLLKIETQNASNRSLSPKKAERKGF RASSLE  SC+DSKLDKESPSSRHRP SLEVITGIGKHSKGEAALPKAVT+FL+ENGYRFEQ RPGTI
Subjt:  VQALREHLLKIETQNASNRSLSPKKAERKGFLRASSLESFSCIDSKLDKESPSSRHRPISLEVITGIGKHSKGEAALPKAVTNFLSENGYRFEQLRPGTI

Query:  SVRPKFHR
        SVRPKF R
Subjt:  SVRPKFHR

A0A6J1GN51 uncharacterized protein LOC111455928 isoform X14.9e-26880.5Show/hide
Query:  MSWVRGKSPGWASFNLKQQNSDLQDEIDPEPFPPMSTGLSSLPPRENLHRVNGRSGGSFSSAPLPSADSLYLPKYFGAKKTILGDSSTQSGKKVVEETTD
        MSW RGKSPGWA+ NLKQQNS LQDEIDP+PFPPMST LS LPPREN+HRVNGRSG SFSS PLPSADSL  P+ FG KKTI G+SS +SGKK+VEE+TD
Subjt:  MSWVRGKSPGWASFNLKQQNSDLQDEIDPEPFPPMSTGLSSLPPRENLHRVNGRSGGSFSSAPLPSADSLYLPKYFGAKKTILGDSSTQSGKKVVEETTD

Query:  VLAFWKLKELHSWADISLIVDIMEAVNNDFNEASTLLKTMVSSDNFEINNEMSTLGLHSSNDLSWMRGKSPGWAAFDLKQQNKGIQDEIDPESFPPISNG
        VLAFWKLKELHSWADISLIVDIMEAVNN+FNEAS LLKTMVSSDNFEINNEMSTLGLHSSND+S +RGKSPGW  F+LKQQN+G+QD IDP+ FPP+ + 
Subjt:  VLAFWKLKELHSWADISLIVDIMEAVNNDFNEASTLLKTMVSSDNFEINNEMSTLGLHSSNDLSWMRGKSPGWAAFDLKQQNKGIQDEIDPESFPPISNG

Query:  LSSLPPCENLHRVNGRSGRSFSSAPLPSVDSLTSPDNYGAKKTILGDSSVQSGKKVVEETTDVLAFWKLKELHSWAGISLIVDIMEAVNNNFNEASTLLK
        LSSLPP ENLH VNGR GRS SS+PLPS DSLT P+NY AKK ILGDSS+Q+G+KVVEETTDVLAFWKLKELH+WA  SLIVDIMEAV+NNFNEAST L 
Subjt:  LSSLPPCENLHRVNGRSGRSFSSAPLPSVDSLTSPDNYGAKKTILGDSSVQSGKKVVEETTDVLAFWKLKELHSWAGISLIVDIMEAVNNNFNEASTLLK

Query:  TMVSSDNFEINNKMSTLGLHSSNDLSCNGNNDVSISLEKTVNIPILSCSLKDV---HQNNNACQEDDTKLFENNYYEWNFFHDVGNPKLALGCPKSVPIE
         MVSSDN EI N+MSTLGLHS++ L CNG NDV+ISL +TVN PI S +LKDV   HQN N       KLFENNY+E NFFH+VGNPK+AL C KS PIE
Subjt:  TMVSSDNFEINNKMSTLGLHSSNDLSCNGNNDVSISLEKTVNIPILSCSLKDV---HQNNNACQEDDTKLFENNYYEWNFFHDVGNPKLALGCPKSVPIE

Query:  PEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEEWLAAKRLNDKAANEILRTRNSKNGIWKLDLHGLHSAEAVQALREHLLK
        PEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQE+WLAAK LN KAANEIL+TRNS+NG+WKLDLHGLH+AEAVQAL++HLLK
Subjt:  PEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEEWLAAKRLNDKAANEILRTRNSKNGIWKLDLHGLHSAEAVQALREHLLK

Query:  IETQNASNRSLSPKKAERKGFLRASSLESFSCIDSKLDKE--SPSSRHRPISLEVITGIGKHSKGEAALPKAVTNFLSENGYRFEQLRPGTISVRPKFHR
        IET+NASNRSLSPKKAERKGF R SSLE  SC+  KLDKE  SP  RHRP SLEVITG+GKHS+GEAALPKAVT+FLSENGYRFEQLRPGTISVRPKF R
Subjt:  IETQNASNRSLSPKKAERKGFLRASSLESFSCIDSKLDKE--SPSSRHRPISLEVITGIGKHSKGEAALPKAVTNFLSENGYRFEQLRPGTISVRPKFHR

A0A6J1GPE7 uncharacterized protein LOC111455928 isoform X24.2e-25978.67Show/hide
Query:  MSWVRGKSPGWASFNLKQQNSDLQDEIDPEPFPPMSTGLSSLPPRENLHRVNGRSGGSFSSAPLPSADSLYLPKYFGAKKTILGDSSTQSGKKVVEETTD
        MSW RGKSPGWA+ NLKQQNS LQDEIDP+PFPPMST LS LPPREN+HRVNGRSG S              P+ FG KKTI G+SS +SGKK+VEE+TD
Subjt:  MSWVRGKSPGWASFNLKQQNSDLQDEIDPEPFPPMSTGLSSLPPRENLHRVNGRSGGSFSSAPLPSADSLYLPKYFGAKKTILGDSSTQSGKKVVEETTD

Query:  VLAFWKLKELHSWADISLIVDIMEAVNNDFNEASTLLKTMVSSDNFEINNEMSTLGLHSSNDLSWMRGKSPGWAAFDLKQQNKGIQDEIDPESFPPISNG
        VLAFWKLKELHSWADISLIVDIMEAVNN+FNEAS LLKTMVSSDNFEINNEMSTLGLHSSND+S +RGKSPGW  F+LKQQN+G+QD IDP+ FPP+ + 
Subjt:  VLAFWKLKELHSWADISLIVDIMEAVNNDFNEASTLLKTMVSSDNFEINNEMSTLGLHSSNDLSWMRGKSPGWAAFDLKQQNKGIQDEIDPESFPPISNG

Query:  LSSLPPCENLHRVNGRSGRSFSSAPLPSVDSLTSPDNYGAKKTILGDSSVQSGKKVVEETTDVLAFWKLKELHSWAGISLIVDIMEAVNNNFNEASTLLK
        LSSLPP ENLH VNGR GRS SS+PLPS DSLT P+NY AKK ILGDSS+Q+G+KVVEETTDVLAFWKLKELH+WA  SLIVDIMEAV+NNFNEAST L 
Subjt:  LSSLPPCENLHRVNGRSGRSFSSAPLPSVDSLTSPDNYGAKKTILGDSSVQSGKKVVEETTDVLAFWKLKELHSWAGISLIVDIMEAVNNNFNEASTLLK

Query:  TMVSSDNFEINNKMSTLGLHSSNDLSCNGNNDVSISLEKTVNIPILSCSLKDV---HQNNNACQEDDTKLFENNYYEWNFFHDVGNPKLALGCPKSVPIE
         MVSSDN EI N+MSTLGLHS++ L CNG NDV+ISL +TVN PI S +LKDV   HQN N       KLFENNY+E NFFH+VGNPK+AL C KS PIE
Subjt:  TMVSSDNFEINNKMSTLGLHSSNDLSCNGNNDVSISLEKTVNIPILSCSLKDV---HQNNNACQEDDTKLFENNYYEWNFFHDVGNPKLALGCPKSVPIE

Query:  PEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEEWLAAKRLNDKAANEILRTRNSKNGIWKLDLHGLHSAEAVQALREHLLK
        PEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQE+WLAAK LN KAANEIL+TRNS+NG+WKLDLHGLH+AEAVQAL++HLLK
Subjt:  PEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEEWLAAKRLNDKAANEILRTRNSKNGIWKLDLHGLHSAEAVQALREHLLK

Query:  IETQNASNRSLSPKKAERKGFLRASSLESFSCIDSKLDKE--SPSSRHRPISLEVITGIGKHSKGEAALPKAVTNFLSENGYRFEQLRPGTISVRPKFHR
        IET+NASNRSLSPKKAERKGF R SSLE  SC+  KLDKE  SP  RHRP SLEVITG+GKHS+GEAALPKAVT+FLSENGYRFEQLRPGTISVRPKF R
Subjt:  IETQNASNRSLSPKKAERKGFLRASSLESFSCIDSKLDKE--SPSSRHRPISLEVITGIGKHSKGEAALPKAVTNFLSENGYRFEQLRPGTISVRPKFHR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G23520.1 smr (Small MutS Related) domain-containing protein3.2e-7842.6Show/hide
Query:  LSWMRGKSPGWAAFDLKQ-QNKGIQDEIDPESFPPISNGL-SSLPPCENLHRVNGRSGRSFSSAPLP--SVDSLTSPDNYGAKKTILGDSSVQSGKKVVE
        +SWM+GKS GW AFDLKQ Q +G++ E++ + FPP+S  + +S      L R +  S +SFSS  LP     +LT   + G ++   G    +     + 
Subjt:  LSWMRGKSPGWAAFDLKQ-QNKGIQDEIDPESFPPISNGL-SSLPPCENLHRVNGRSGRSFSSAPLP--SVDSLTSPDNYGAKKTILGDSSVQSGKKVVE

Query:  ETTDVLAFWKLKELHSWAGISLIVDIMEAVNNNFNEASTLLKTMVSSDNFEINNKMSTLGLHSSNDLSCNGNNDVSISLEKTV--NIPILSCSLKDVHQN
          +  LAF KLKE++SWA  +LI D++ +  ++F  A   LK MVSS   +        G  S N            + EKTV  ++ + + S  +    
Subjt:  ETTDVLAFWKLKELHSWAGISLIVDIMEAVNNNFNEASTLLKTMVSSDNFEINNKMSTLGLHSSNDLSCNGNNDVSISLEKTV--NIPILSCSLKDVHQN

Query:  NNACQEDDTKLFENNYYEWNFFHDVGNPKLALGCPKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEEWLAAKRL
         +    D +    N      F  D+      +   +S+PIEPEWEEDD+YLSHRKDA+ +MRSAS HSRAA NA+ R DHASAK HS +A+E+WLAA++L
Subjt:  NNACQEDDTKLFENNYYEWNFFHDVGNPKLALGCPKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEEWLAAKRL

Query:  NDKAANEILRTRNSKNGIWKLDLHGLHSAEAVQALREHLLKIETQNASNRSLSPKKAERK-GFLRASSLESFSCIDSK-LDKESPSSRHRPISLEVITGI
        N +AA +I+   N  N IWKLDLHGLH+ EAVQAL+E L  IE     NRS+SP +   K   LR++S E F  +D + +  +  SSR    SL+VITGI
Subjt:  NDKAANEILRTRNSKNGIWKLDLHGLHSAEAVQALREHLLKIETQNASNRSLSPKKAERK-GFLRASSLESFSCIDSK-LDKESPSSRHRPISLEVITGI

Query:  GKHSKGEAALPKAVTNFLSENGYRFEQLRPGTISVRPKF
        GKHS+G+A+LP AV  F  +N YRF++ RPG I+VRPKF
Subjt:  GKHSKGEAALPKAVTNFLSENGYRFEQLRPGTISVRPKF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTGGGTGAGGGGTAAATCTCCTGGCTGGGCCTCATTTAACCTTAAGCAACAGAATAGTGACCTTCAAGATGAAATTGACCCGGAACCATTCCCACCAATGTCAAC
TGGCCTTTCCTCTCTGCCACCCCGTGAAAACCTCCACAGAGTTAATGGTCGTTCGGGGGGATCTTTCTCATCTGCACCCCTTCCTTCTGCTGATTCTCTATATTTGCCAA
AATATTTTGGTGCAAAAAAGACAATACTTGGTGATTCTAGTACTCAAAGTGGTAAGAAGGTGGTTGAAGAAACCACTGATGTTTTAGCATTTTGGAAGCTTAAAGAGCTT
CATTCTTGGGCTGATATTAGCTTGATTGTGGATATAATGGAAGCTGTAAATAATGACTTCAATGAGGCATCTACTTTATTAAAAACAATGGTTTCTAGTGACAATTTTGA
GATCAATAATGAGATGAGCACTTTAGGACTGCATTCCTCTAATGATCTATCGTGGATGAGGGGTAAATCTCCTGGCTGGGCAGCATTTGACCTTAAGCAACAGAATAAAG
GCATTCAAGATGAAATTGACCCGGAATCATTCCCACCGATATCAAACGGCCTTTCCTCTCTGCCACCCTGTGAAAACTTGCACAGAGTTAATGGTCGTTCAGGGAGATCT
TTCTCATCTGCACCCCTTCCTTCTGTCGATTCTCTAACTTCGCCTGACAATTATGGTGCAAAAAAGACAATACTTGGTGATTCTAGTGTTCAAAGTGGCAAGAAGGTGGT
CGAAGAAACTACTGATGTTTTAGCTTTTTGGAAGCTTAAAGAGCTTCATTCTTGGGCTGGTATTAGCTTGATTGTGGATATAATGGAAGCTGTAAATAATAACTTCAATG
AGGCGTCTACTTTATTAAAAACAATGGTTTCTAGCGACAATTTTGAGATCAATAATAAGATGAGTACCTTAGGACTGCATTCCTCTAATGATCTATCGTGCAATGGGAAT
AATGATGTAAGTATATCATTAGAAAAAACGGTTAATATTCCCATCCTTAGTTGCTCATTAAAGGATGTGCATCAAAATAATAATGCATGTCAAGAAGATGATACAAAATT
GTTTGAAAATAATTATTATGAATGGAACTTCTTTCATGATGTTGGAAATCCAAAACTAGCTCTTGGTTGCCCAAAGTCTGTTCCTATTGAGCCTGAGTGGGAAGAAGATG
ATATTTACCTGAGCCATCGGAAAGATGCTATAGCAATGATGAGGTCGGCATCTCAACATTCAAGGGCAGCCACTAATGCTTATCTTAGGAAAGATCATGCTTCTGCCAAG
TATCATTCATCAAGGGCTCAAGAAGAATGGCTAGCTGCAAAAAGGTTAAATGATAAGGCAGCTAATGAAATTTTACGAACGAGGAATAGTAAAAATGGGATCTGGAAGTT
GGACTTACATGGGCTTCATTCAGCTGAGGCTGTTCAAGCCTTGCGAGAACACTTGCTGAAAATTGAGACTCAGAATGCCTCCAATAGGTCGTTGTCGCCAAAGAAAGCTG
AAAGGAAAGGGTTCCTACGTGCTTCATCCCTTGAGTCTTTTAGTTGTATAGACTCAAAGTTGGACAAAGAATCACCATCGTCTAGGCATAGGCCAATATCATTGGAAGTC
ATAACAGGTATAGGTAAACATAGCAAGGGAGAAGCTGCTCTACCAAAGGCTGTTACCAATTTTCTTAGTGAAAATGGGTACCGTTTTGAGCAGTTGAGGCCTGGGACGAT
CAGTGTTCGACCAAAGTTTCATAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTGGGTGAGGGGTAAATCTCCTGGCTGGGCCTCATTTAACCTTAAGCAACAGAATAGTGACCTTCAAGATGAAATTGACCCGGAACCATTCCCACCAATGTCAAC
TGGCCTTTCCTCTCTGCCACCCCGTGAAAACCTCCACAGAGTTAATGGTCGTTCGGGGGGATCTTTCTCATCTGCACCCCTTCCTTCTGCTGATTCTCTATATTTGCCAA
AATATTTTGGTGCAAAAAAGACAATACTTGGTGATTCTAGTACTCAAAGTGGTAAGAAGGTGGTTGAAGAAACCACTGATGTTTTAGCATTTTGGAAGCTTAAAGAGCTT
CATTCTTGGGCTGATATTAGCTTGATTGTGGATATAATGGAAGCTGTAAATAATGACTTCAATGAGGCATCTACTTTATTAAAAACAATGGTTTCTAGTGACAATTTTGA
GATCAATAATGAGATGAGCACTTTAGGACTGCATTCCTCTAATGATCTATCGTGGATGAGGGGTAAATCTCCTGGCTGGGCAGCATTTGACCTTAAGCAACAGAATAAAG
GCATTCAAGATGAAATTGACCCGGAATCATTCCCACCGATATCAAACGGCCTTTCCTCTCTGCCACCCTGTGAAAACTTGCACAGAGTTAATGGTCGTTCAGGGAGATCT
TTCTCATCTGCACCCCTTCCTTCTGTCGATTCTCTAACTTCGCCTGACAATTATGGTGCAAAAAAGACAATACTTGGTGATTCTAGTGTTCAAAGTGGCAAGAAGGTGGT
CGAAGAAACTACTGATGTTTTAGCTTTTTGGAAGCTTAAAGAGCTTCATTCTTGGGCTGGTATTAGCTTGATTGTGGATATAATGGAAGCTGTAAATAATAACTTCAATG
AGGCGTCTACTTTATTAAAAACAATGGTTTCTAGCGACAATTTTGAGATCAATAATAAGATGAGTACCTTAGGACTGCATTCCTCTAATGATCTATCGTGCAATGGGAAT
AATGATGTAAGTATATCATTAGAAAAAACGGTTAATATTCCCATCCTTAGTTGCTCATTAAAGGATGTGCATCAAAATAATAATGCATGTCAAGAAGATGATACAAAATT
GTTTGAAAATAATTATTATGAATGGAACTTCTTTCATGATGTTGGAAATCCAAAACTAGCTCTTGGTTGCCCAAAGTCTGTTCCTATTGAGCCTGAGTGGGAAGAAGATG
ATATTTACCTGAGCCATCGGAAAGATGCTATAGCAATGATGAGGTCGGCATCTCAACATTCAAGGGCAGCCACTAATGCTTATCTTAGGAAAGATCATGCTTCTGCCAAG
TATCATTCATCAAGGGCTCAAGAAGAATGGCTAGCTGCAAAAAGGTTAAATGATAAGGCAGCTAATGAAATTTTACGAACGAGGAATAGTAAAAATGGGATCTGGAAGTT
GGACTTACATGGGCTTCATTCAGCTGAGGCTGTTCAAGCCTTGCGAGAACACTTGCTGAAAATTGAGACTCAGAATGCCTCCAATAGGTCGTTGTCGCCAAAGAAAGCTG
AAAGGAAAGGGTTCCTACGTGCTTCATCCCTTGAGTCTTTTAGTTGTATAGACTCAAAGTTGGACAAAGAATCACCATCGTCTAGGCATAGGCCAATATCATTGGAAGTC
ATAACAGGTATAGGTAAACATAGCAAGGGAGAAGCTGCTCTACCAAAGGCTGTTACCAATTTTCTTAGTGAAAATGGGTACCGTTTTGAGCAGTTGAGGCCTGGGACGAT
CAGTGTTCGACCAAAGTTTCATAGGTAA
Protein sequenceShow/hide protein sequence
MSWVRGKSPGWASFNLKQQNSDLQDEIDPEPFPPMSTGLSSLPPRENLHRVNGRSGGSFSSAPLPSADSLYLPKYFGAKKTILGDSSTQSGKKVVEETTDVLAFWKLKEL
HSWADISLIVDIMEAVNNDFNEASTLLKTMVSSDNFEINNEMSTLGLHSSNDLSWMRGKSPGWAAFDLKQQNKGIQDEIDPESFPPISNGLSSLPPCENLHRVNGRSGRS
FSSAPLPSVDSLTSPDNYGAKKTILGDSSVQSGKKVVEETTDVLAFWKLKELHSWAGISLIVDIMEAVNNNFNEASTLLKTMVSSDNFEINNKMSTLGLHSSNDLSCNGN
NDVSISLEKTVNIPILSCSLKDVHQNNNACQEDDTKLFENNYYEWNFFHDVGNPKLALGCPKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAK
YHSSRAQEEWLAAKRLNDKAANEILRTRNSKNGIWKLDLHGLHSAEAVQALREHLLKIETQNASNRSLSPKKAERKGFLRASSLESFSCIDSKLDKESPSSRHRPISLEV
ITGIGKHSKGEAALPKAVTNFLSENGYRFEQLRPGTISVRPKFHR