; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0028243 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0028243
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionSmr domain-containing protein
Genome locationchr01:22889161..22892719
RNA-Seq ExpressionPI0028243
SyntenyPI0028243
Gene Ontology termsNA
InterPro domainsIPR002625 - Smr domain
IPR013899 - Domain of unknown function DUF1771
IPR036063 - Smr domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059625.1 Smr (Small MutS Related) domain-containing protein, putative isoform 1 [Cucumis melo var. makuwa]5.7e-30389.31Show/hide
Query:  MSWVRGKSSGWAAFNLKQQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHGVNGRSGRSFSFAPVPSADSLTVPGKFGAKKTTSENFGAKKTILGASNIQS
        MSWVRGKSSGWAAFNLKQQNNG+QDEVD DPFPPMSTTLSSLPPRENL GVNGRSGRSFSFAP+PSADS T+PGK GAKKTT  NF AKKTILGASNIQS
Subjt:  MSWVRGKSSGWAAFNLKQQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHGVNGRSGRSFSFAPVPSADSLTVPGKFGAKKTTSENFGAKKTILGASNIQS

Query:  GKKMVEETNDVLSFWKLKELHSWADISLIMDIMEAVNNDFNEASTLLNAMVSSDNLEINNEMSALGLHSSNDLSWVTGKSPGWEEFNLKQRNRGLQNETG
        GKKMVEETNDVLSFWKLKELH WADISLIMDIMEAVNNDFNEASTLLN MVSSDNLEINNEMS LGLHSSNDLSW+ GKSPGWEEFNL+Q NRGLQ E  
Subjt:  GKKMVEETNDVLSFWKLKELHSWADISLIMDIMEAVNNDFNEASTLLNAMVSSDNLEINNEMSALGLHSSNDLSWVTGKSPGWEEFNLKQRNRGLQNETG

Query:  LEAFPPMLTDHS-----------FG----SFASEPLPSADSLTSSENYGAKKTIPDDSSIQSGKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNN
         EAFPPMLT+H            +G    SFASEPLPSADSLTS  NYGAK TIPDDS IQSGKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNN
Subjt:  LEAFPPMLTDHS-----------FG----SFASEPLPSADSLTSSENYGAKKTIPDDSSIQSGKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNN

Query:  NFDEASTLLRTMVSSDNFEINNEISTLGLHSANDLLCNENNDVSISSERMINAPILSSTLKDVRGAHQNNSTGGENYTKLFVNNYFERNFFHNAGNSKIA
        NFDEASTLL+TMVSSDNFEINNEISTLGLH ANDLLCN +NDVSISSER IN PILS TLK  +G HQN++TGGE+ TKLFVN+YFERNFF NAGNSKIA
Subjt:  NFDEASTLLRTMVSSDNFEINNEISTLGLHSANDLLCNENNDVSISSERMINAPILSSTLKDVRGAHQNNSTGGENYTKLFVNNYFERNFFHNAGNSKIA

Query:  LGCPNSVPIEPEWEEDDVYLTHRKDAIAMMRSASQHSRAATNAYRRKDHASAKFHSSRAQEQWLAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEA
        LGC  SVPIEPEWEEDDVYL+HRKDAIAMMRSASQHSRAATNAYRRKDHASAK+HSSRAQEQWLAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEA
Subjt:  LGCPNSVPIEPEWEEDDVYLTHRKDAIAMMRSASQHSRAATNAYRRKDHASAKFHSSRAQEQWLAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEA

Query:  VQALQDHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLTENGYRFEQLRPGTI
        VQALQDHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLTENGYRFEQ RPGTI
Subjt:  VQALQDHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLTENGYRFEQLRPGTI

Query:  SVRPKFRR
        SVRPKFRR
Subjt:  SVRPKFRR

XP_004148966.1 uncharacterized protein LOC101223137 [Cucumis sativus]9.1e-30187.99Show/hide
Query:  MSWVRGKSSGWAAFNLKQQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHGVNGRSGRSFSFAPVPSADSLTVPGKFGAKKTTSENFGAKKTILGASNIQS
        MSWVRGKSSGWAAFNLKQQNNGLQDEVD DPFPPMSTTLSSLPPRENL GVNG SG+SFS AP+PSADS T+P KFGAKKTT  NFGAKKTILG +NIQS
Subjt:  MSWVRGKSSGWAAFNLKQQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHGVNGRSGRSFSFAPVPSADSLTVPGKFGAKKTTSENFGAKKTILGASNIQS

Query:  GKKMVEETNDVLSFWKLKELHSWADISLIMDIMEAVNNDFNEASTLLNAMVSSDNLEINNEMSALGLHSSNDLSWVTGKSPGWEEFNLKQRNRGLQNETG
        GKK+VEETNDVLSFWKLKELH WADISLIMDIMEAVNNDFNEASTLLN MVSSDNLEINN+MS LGLHSSNDL W+ GKSPGWEEFNLKQ N+GLQ+E  
Subjt:  GKKMVEETNDVLSFWKLKELHSWADISLIMDIMEAVNNDFNEASTLLNAMVSSDNLEINNEMSALGLHSSNDLSWVTGKSPGWEEFNLKQRNRGLQNETG

Query:  LEAFPPMLTDHS-----------FG----SFASEPLPSADSLTSSENYGAKKTIPDDSSIQSGKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNN
        LEAFPPMLT+ S           +G    SFASEPLPS DSLTS ENYGAK TI DDSSIQSGKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNN
Subjt:  LEAFPPMLTDHS-----------FG----SFASEPLPSADSLTSSENYGAKKTIPDDSSIQSGKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNN

Query:  NFDEASTLLRTMVSSDNFEINNEISTLGLHSANDLLCNENNDVSISSERMINAPILSSTLKDVRGAHQNNSTGGENYTKLFVNNYFERNFFHNAGNSKIA
        NFDEASTLL+TMVSSDNFEINNEISTLGLHSANDLLCN NNDVSI+SERMINAPILSST+K V+G HQNN+T  E+YTKLF N+YFERN FHN GNSKIA
Subjt:  NFDEASTLLRTMVSSDNFEINNEISTLGLHSANDLLCNENNDVSISSERMINAPILSSTLKDVRGAHQNNSTGGENYTKLFVNNYFERNFFHNAGNSKIA

Query:  LGCPNSVPIEPEWEEDDVYLTHRKDAIAMMRSASQHSRAATNAYRRKDHASAKFHSSRAQEQWLAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEA
        LGC  SVPIEPEWEEDD+YL+HRKDAIAMMRSASQHSRAATNAYRRKDHASAK+HSSRA+EQWLAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEA
Subjt:  LGCPNSVPIEPEWEEDDVYLTHRKDAIAMMRSASQHSRAATNAYRRKDHASAKFHSSRAQEQWLAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEA

Query:  VQALQDHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLTENGYRFEQLRPGTI
        VQAL DHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCM+SKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAV SFLTENGYRFEQ RPGTI
Subjt:  VQALQDHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLTENGYRFEQLRPGTI

Query:  SVRPKFRR
        SVRPKFRR
Subjt:  SVRPKFRR

XP_008451240.1 PREDICTED: uncharacterized protein LOC103492590 [Cucumis melo]3.3e-30389.14Show/hide
Query:  MSWVRGKSSGWAAFNLKQQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHGVNGRSGRSFSFAPVPSADSLTVPGKFGAKKTTSENFGAKKTILGASNIQS
        MSWVRGKSSGWAAFNLKQQNNG+QDEVD DPFPPMSTTLSSLPPRENL GVNGRSGRSFSFAP+PSADS T+PGK GAKKTT  NF AKKTILGASNIQS
Subjt:  MSWVRGKSSGWAAFNLKQQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHGVNGRSGRSFSFAPVPSADSLTVPGKFGAKKTTSENFGAKKTILGASNIQS

Query:  GKKMVEETNDVLSFWKLKELHSWADISLIMDIMEAVNNDFNEASTLLNAMVSSDNLEINNEMSALGLHSSNDLSWVTGKSPGWEEFNLKQRNRGLQNETG
        GKKMVEETNDVLSFWKLKELH WADISLIMDIMEAVNNDFNEASTLLN MVSSDNLEINNEMS LGLHSSNDLSW+ GKSPGWEEFNL+Q NRGLQ E  
Subjt:  GKKMVEETNDVLSFWKLKELHSWADISLIMDIMEAVNNDFNEASTLLNAMVSSDNLEINNEMSALGLHSSNDLSWVTGKSPGWEEFNLKQRNRGLQNETG

Query:  LEAFPPMLTDHS-----------FG----SFASEPLPSADSLTSSENYGAKKTIPDDSSIQSGKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNN
         EAFPPMLT+H            +G    SFASEPLPSADSLTS  NYGAK TIPDDS IQSGKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNN
Subjt:  LEAFPPMLTDHS-----------FG----SFASEPLPSADSLTSSENYGAKKTIPDDSSIQSGKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNN

Query:  NFDEASTLLRTMVSSDNFEINNEISTLGLHSANDLLCNENNDVSISSERMINAPILSSTLKDVRGAHQNNSTGGENYTKLFVNNYFERNFFHNAGNSKIA
        NFDEASTLL+TMVSSDNFEINNEISTLGLHSANDLLCN +NDVSISSER IN PILS TLK  +G HQN++TGGE+ TKLFVN+YFERNFF NAGNSKIA
Subjt:  NFDEASTLLRTMVSSDNFEINNEISTLGLHSANDLLCNENNDVSISSERMINAPILSSTLKDVRGAHQNNSTGGENYTKLFVNNYFERNFFHNAGNSKIA

Query:  LGCPNSVPIEPEWEEDDVYLTHRKDAIAMMRSASQHSRAATNAYRRKDHASAKFHSSRAQEQWLAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEA
        LGC  SVPIEPEWEEDD+YL+HRKDAIAMMRSASQHSRAATNAYRRKDHASAK+HSSRAQEQWLAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEA
Subjt:  LGCPNSVPIEPEWEEDDVYLTHRKDAIAMMRSASQHSRAATNAYRRKDHASAKFHSSRAQEQWLAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEA

Query:  VQALQDHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLTENGYRFEQLRPGTI
        VQALQDHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCMD+KLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLTENGYRFEQ RPGTI
Subjt:  VQALQDHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLTENGYRFEQLRPGTI

Query:  SVRPKFRR
        SVRPKFRR
Subjt:  SVRPKFRR

XP_023548349.1 uncharacterized protein LOC111807017 [Cucurbita pepo subsp. pepo]4.7e-24976.23Show/hide
Query:  MSWVRGKSSGWAAFNLKQQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHGVNGRSGRSFSFAPVPSADSLTVPGKFGAKKTTSENFGAKKTILGASNIQS
        MSW RGKS GWAA NLKQQN+GLQDE+DPDPFPPMST LS LPPREN+H VNGRSGRSFS  P+PSADSL  P          +NFGAKKTI G S+I+S
Subjt:  MSWVRGKSSGWAAFNLKQQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHGVNGRSGRSFSFAPVPSADSLTVPGKFGAKKTTSENFGAKKTILGASNIQS

Query:  GKKMVEETNDVLSFWKLKELHSWADISLIMDIMEAVNNDFNEASTLLNAMVSSDNLEINNEMSALGLHSSNDLSWVTGKSPGWEEFNLKQRNRGLQNETG
        GKK+VEE+ DVL+FWKLKELHSWADISLI+DIMEAVNN+FNEAS LL  MVSSDN EINNEMS LGLHSSND+S V GKSPGWEE+NLKQ+NRGLQ+   
Subjt:  GKKMVEETNDVLSFWKLKELHSWADISLIMDIMEAVNNDFNEASTLLNAMVSSDNLEINNEMSALGLHSSNDLSWVTGKSPGWEEFNLKQRNRGLQNETG

Query:  LEAFPPM---------------LTDHSFGSFASEPLPSADSLTSSENYGAKKTIPDDSSIQSGKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNN
         + FPPM               +T     S +S PLPSADSLTS ENY AKK I  DSSIQ+G+KVVEE TDVLAFWKLKE+H+WADFSLIVDIM+AV+N
Subjt:  LEAFPPM---------------LTDHSFGSFASEPLPSADSLTSSENYGAKKTIPDDSSIQSGKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNN

Query:  NFDEASTLLRTMVSSDNFEINNEISTLGLHSANDLLCNENNDVSISSERMINAPILSSTLKDVRGAHQNNSTGGENYTKLFVNNYFERNFFHNAGNSKIA
        NF+EAST L  MVSSDN EI NE+STLGLHSA+ L C   NDV+IS  R +N PI SSTLKDV+  HQN +       KLF NNY ERNFFHN GN KIA
Subjt:  NFDEASTLLRTMVSSDNFEINNEISTLGLHSANDLLCNENNDVSISSERMINAPILSSTLKDVRGAHQNNSTGGENYTKLFVNNYFERNFFHNAGNSKIA

Query:  LGCPNSVPIEPEWEEDDVYLTHRKDAIAMMRSASQHSRAATNAYRRKDHASAKFHSSRAQEQWLAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEA
        L C  S PIEPEWEEDD+YL+HRKDAIAMMRSASQHSRAATNAY RKDHASAK+HSSRAQEQWLAAKMLN KAANEILQTRNS+NGLWKLDLHGLHAAEA
Subjt:  LGCPNSVPIEPEWEEDDVYLTHRKDAIAMMRSASQHSRAATNAYRRKDHASAKFHSSRAQEQWLAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEA

Query:  VQALQDHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKE--SPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLTENGYRFEQLRPG
        VQALQDHLLKIET+NASNRSLSPKKAERKGF R SSLEYLSCM  KLDKE  SP  RHRPTSLEVITGIGKHS+GEAALPKAVTSFL+ENGYRFEQLRPG
Subjt:  VQALQDHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKE--SPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLTENGYRFEQLRPG

Query:  TISVRPKFRR
        TISVRPKFRR
Subjt:  TISVRPKFRR

XP_038898473.1 uncharacterized protein LOC120086100 [Benincasa hispida]7.5e-27982.73Show/hide
Query:  MSWVRGKSSGWAAFNLKQQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHGVNGRSGRSFSFAPVPSADSLTVPGKFGAKKTTSENFGAKKTILGASNIQS
        MSWV+GKS GWAAFNLKQQNNGLQDEVD DPFPP+STTLSSLPP EN H VNGRSGRSFSFAP PSA+SLT P KF AKKTT EN GAKKTIL  SN+Q+
Subjt:  MSWVRGKSSGWAAFNLKQQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHGVNGRSGRSFSFAPVPSADSLTVPGKFGAKKTTSENFGAKKTILGASNIQS

Query:  GKKMVEETNDVLSFWKLKELHSWADISLIMDIMEAVNNDFNEASTLLNAMVSSDNLEINNEMSALGLHSSNDLSWVTGKSPGWEEFNLKQRNRGLQNETG
        GKK+VEET DVLSFWKLKELHSWADISLIMD+MEAVNN+F+EASTLL  MV+SDN EINNEMS LGL  SNDLSWVTG  PGWEEFNLKQ NRGLQ+ET 
Subjt:  GKKMVEETNDVLSFWKLKELHSWADISLIMDIMEAVNNDFNEASTLLNAMVSSDNLEINNEMSALGLHSSNDLSWVTGKSPGWEEFNLKQRNRGLQNETG

Query:  LEAFPPMLTDHS-----------FG----SFASEPLPSADSLTSSENYGAKKTIPDDSSIQSGKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNN
        LE  PPMLT HS           +G    SF+S P  SADSLTS ENYGAKKTIPDDSSIQSGKKVVEE+ D LAFWKLKE+HSWADFSLIVDIM+AVNN
Subjt:  LEAFPPMLTDHS-----------FG----SFASEPLPSADSLTSSENYGAKKTIPDDSSIQSGKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNN

Query:  NFDEASTLLRTMVSSDNFEINNEISTLGLHSANDLLCNENNDVSISSERMINAPILSSTLKDVRGAHQNNSTGGENYTKLFVNNYFERNFFHNAGNSKIA
        NF+EASTLL+TMVSSDNF+IN+E+STL L SANDLLCN  NDVS S ER  N PI SSTLKDV+G HQNN+   ENYTKLF NNYFERNFFHNAG  KI 
Subjt:  NFDEASTLLRTMVSSDNFEINNEISTLGLHSANDLLCNENNDVSISSERMINAPILSSTLKDVRGAHQNNSTGGENYTKLFVNNYFERNFFHNAGNSKIA

Query:  LGCPNSVPIEPEWEEDDVYLTHRKDAIAMMRSASQHSRAATNAYRRKDHASAKFHSSRAQEQWLAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEA
        LG   SVPIEPEWEEDD+YL+HRKDAIAMMRSASQHSRAATNAY RKDHASAK+HSSRAQEQWLAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEA
Subjt:  LGCPNSVPIEPEWEEDDVYLTHRKDAIAMMRSASQHSRAATNAYRRKDHASAKFHSSRAQEQWLAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEA

Query:  VQALQDHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLTENGYRFEQLRPGTI
        VQALQ+HLLKIET+NASNRSLSPKK+ERKGFQ ASSLEYLSCMDSK+DKESPSSRHRPTSLEVITGIGKHS+GEA LPKAVTSFL+ENGYRFEQLRPGTI
Subjt:  VQALQDHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLTENGYRFEQLRPGTI

Query:  SVRPKFRR
        S+RPKFRR
Subjt:  SVRPKFRR

TrEMBL top hitse value%identityAlignment
A0A0A0KA90 Smr domain-containing protein4.4e-30187.99Show/hide
Query:  MSWVRGKSSGWAAFNLKQQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHGVNGRSGRSFSFAPVPSADSLTVPGKFGAKKTTSENFGAKKTILGASNIQS
        MSWVRGKSSGWAAFNLKQQNNGLQDEVD DPFPPMSTTLSSLPPRENL GVNG SG+SFS AP+PSADS T+P KFGAKKTT  NFGAKKTILG +NIQS
Subjt:  MSWVRGKSSGWAAFNLKQQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHGVNGRSGRSFSFAPVPSADSLTVPGKFGAKKTTSENFGAKKTILGASNIQS

Query:  GKKMVEETNDVLSFWKLKELHSWADISLIMDIMEAVNNDFNEASTLLNAMVSSDNLEINNEMSALGLHSSNDLSWVTGKSPGWEEFNLKQRNRGLQNETG
        GKK+VEETNDVLSFWKLKELH WADISLIMDIMEAVNNDFNEASTLLN MVSSDNLEINN+MS LGLHSSNDL W+ GKSPGWEEFNLKQ N+GLQ+E  
Subjt:  GKKMVEETNDVLSFWKLKELHSWADISLIMDIMEAVNNDFNEASTLLNAMVSSDNLEINNEMSALGLHSSNDLSWVTGKSPGWEEFNLKQRNRGLQNETG

Query:  LEAFPPMLTDHS-----------FG----SFASEPLPSADSLTSSENYGAKKTIPDDSSIQSGKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNN
        LEAFPPMLT+ S           +G    SFASEPLPS DSLTS ENYGAK TI DDSSIQSGKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNN
Subjt:  LEAFPPMLTDHS-----------FG----SFASEPLPSADSLTSSENYGAKKTIPDDSSIQSGKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNN

Query:  NFDEASTLLRTMVSSDNFEINNEISTLGLHSANDLLCNENNDVSISSERMINAPILSSTLKDVRGAHQNNSTGGENYTKLFVNNYFERNFFHNAGNSKIA
        NFDEASTLL+TMVSSDNFEINNEISTLGLHSANDLLCN NNDVSI+SERMINAPILSST+K V+G HQNN+T  E+YTKLF N+YFERN FHN GNSKIA
Subjt:  NFDEASTLLRTMVSSDNFEINNEISTLGLHSANDLLCNENNDVSISSERMINAPILSSTLKDVRGAHQNNSTGGENYTKLFVNNYFERNFFHNAGNSKIA

Query:  LGCPNSVPIEPEWEEDDVYLTHRKDAIAMMRSASQHSRAATNAYRRKDHASAKFHSSRAQEQWLAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEA
        LGC  SVPIEPEWEEDD+YL+HRKDAIAMMRSASQHSRAATNAYRRKDHASAK+HSSRA+EQWLAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEA
Subjt:  LGCPNSVPIEPEWEEDDVYLTHRKDAIAMMRSASQHSRAATNAYRRKDHASAKFHSSRAQEQWLAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEA

Query:  VQALQDHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLTENGYRFEQLRPGTI
        VQAL DHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCM+SKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAV SFLTENGYRFEQ RPGTI
Subjt:  VQALQDHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLTENGYRFEQLRPGTI

Query:  SVRPKFRR
        SVRPKFRR
Subjt:  SVRPKFRR

A0A1S3BRS7 uncharacterized protein LOC1034925901.6e-30389.14Show/hide
Query:  MSWVRGKSSGWAAFNLKQQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHGVNGRSGRSFSFAPVPSADSLTVPGKFGAKKTTSENFGAKKTILGASNIQS
        MSWVRGKSSGWAAFNLKQQNNG+QDEVD DPFPPMSTTLSSLPPRENL GVNGRSGRSFSFAP+PSADS T+PGK GAKKTT  NF AKKTILGASNIQS
Subjt:  MSWVRGKSSGWAAFNLKQQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHGVNGRSGRSFSFAPVPSADSLTVPGKFGAKKTTSENFGAKKTILGASNIQS

Query:  GKKMVEETNDVLSFWKLKELHSWADISLIMDIMEAVNNDFNEASTLLNAMVSSDNLEINNEMSALGLHSSNDLSWVTGKSPGWEEFNLKQRNRGLQNETG
        GKKMVEETNDVLSFWKLKELH WADISLIMDIMEAVNNDFNEASTLLN MVSSDNLEINNEMS LGLHSSNDLSW+ GKSPGWEEFNL+Q NRGLQ E  
Subjt:  GKKMVEETNDVLSFWKLKELHSWADISLIMDIMEAVNNDFNEASTLLNAMVSSDNLEINNEMSALGLHSSNDLSWVTGKSPGWEEFNLKQRNRGLQNETG

Query:  LEAFPPMLTDHS-----------FG----SFASEPLPSADSLTSSENYGAKKTIPDDSSIQSGKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNN
         EAFPPMLT+H            +G    SFASEPLPSADSLTS  NYGAK TIPDDS IQSGKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNN
Subjt:  LEAFPPMLTDHS-----------FG----SFASEPLPSADSLTSSENYGAKKTIPDDSSIQSGKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNN

Query:  NFDEASTLLRTMVSSDNFEINNEISTLGLHSANDLLCNENNDVSISSERMINAPILSSTLKDVRGAHQNNSTGGENYTKLFVNNYFERNFFHNAGNSKIA
        NFDEASTLL+TMVSSDNFEINNEISTLGLHSANDLLCN +NDVSISSER IN PILS TLK  +G HQN++TGGE+ TKLFVN+YFERNFF NAGNSKIA
Subjt:  NFDEASTLLRTMVSSDNFEINNEISTLGLHSANDLLCNENNDVSISSERMINAPILSSTLKDVRGAHQNNSTGGENYTKLFVNNYFERNFFHNAGNSKIA

Query:  LGCPNSVPIEPEWEEDDVYLTHRKDAIAMMRSASQHSRAATNAYRRKDHASAKFHSSRAQEQWLAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEA
        LGC  SVPIEPEWEEDD+YL+HRKDAIAMMRSASQHSRAATNAYRRKDHASAK+HSSRAQEQWLAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEA
Subjt:  LGCPNSVPIEPEWEEDDVYLTHRKDAIAMMRSASQHSRAATNAYRRKDHASAKFHSSRAQEQWLAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEA

Query:  VQALQDHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLTENGYRFEQLRPGTI
        VQALQDHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCMD+KLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLTENGYRFEQ RPGTI
Subjt:  VQALQDHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLTENGYRFEQLRPGTI

Query:  SVRPKFRR
        SVRPKFRR
Subjt:  SVRPKFRR

A0A5D3CAF0 Smr (Small MutS Related) domain-containing protein, putative isoform 12.8e-30389.31Show/hide
Query:  MSWVRGKSSGWAAFNLKQQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHGVNGRSGRSFSFAPVPSADSLTVPGKFGAKKTTSENFGAKKTILGASNIQS
        MSWVRGKSSGWAAFNLKQQNNG+QDEVD DPFPPMSTTLSSLPPRENL GVNGRSGRSFSFAP+PSADS T+PGK GAKKTT  NF AKKTILGASNIQS
Subjt:  MSWVRGKSSGWAAFNLKQQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHGVNGRSGRSFSFAPVPSADSLTVPGKFGAKKTTSENFGAKKTILGASNIQS

Query:  GKKMVEETNDVLSFWKLKELHSWADISLIMDIMEAVNNDFNEASTLLNAMVSSDNLEINNEMSALGLHSSNDLSWVTGKSPGWEEFNLKQRNRGLQNETG
        GKKMVEETNDVLSFWKLKELH WADISLIMDIMEAVNNDFNEASTLLN MVSSDNLEINNEMS LGLHSSNDLSW+ GKSPGWEEFNL+Q NRGLQ E  
Subjt:  GKKMVEETNDVLSFWKLKELHSWADISLIMDIMEAVNNDFNEASTLLNAMVSSDNLEINNEMSALGLHSSNDLSWVTGKSPGWEEFNLKQRNRGLQNETG

Query:  LEAFPPMLTDHS-----------FG----SFASEPLPSADSLTSSENYGAKKTIPDDSSIQSGKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNN
         EAFPPMLT+H            +G    SFASEPLPSADSLTS  NYGAK TIPDDS IQSGKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNN
Subjt:  LEAFPPMLTDHS-----------FG----SFASEPLPSADSLTSSENYGAKKTIPDDSSIQSGKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNN

Query:  NFDEASTLLRTMVSSDNFEINNEISTLGLHSANDLLCNENNDVSISSERMINAPILSSTLKDVRGAHQNNSTGGENYTKLFVNNYFERNFFHNAGNSKIA
        NFDEASTLL+TMVSSDNFEINNEISTLGLH ANDLLCN +NDVSISSER IN PILS TLK  +G HQN++TGGE+ TKLFVN+YFERNFF NAGNSKIA
Subjt:  NFDEASTLLRTMVSSDNFEINNEISTLGLHSANDLLCNENNDVSISSERMINAPILSSTLKDVRGAHQNNSTGGENYTKLFVNNYFERNFFHNAGNSKIA

Query:  LGCPNSVPIEPEWEEDDVYLTHRKDAIAMMRSASQHSRAATNAYRRKDHASAKFHSSRAQEQWLAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEA
        LGC  SVPIEPEWEEDDVYL+HRKDAIAMMRSASQHSRAATNAYRRKDHASAK+HSSRAQEQWLAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEA
Subjt:  LGCPNSVPIEPEWEEDDVYLTHRKDAIAMMRSASQHSRAATNAYRRKDHASAKFHSSRAQEQWLAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEA

Query:  VQALQDHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLTENGYRFEQLRPGTI
        VQALQDHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLTENGYRFEQ RPGTI
Subjt:  VQALQDHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLTENGYRFEQLRPGTI

Query:  SVRPKFRR
        SVRPKFRR
Subjt:  SVRPKFRR

A0A6J1GN51 uncharacterized protein LOC111455928 isoform X13.9e-24976.07Show/hide
Query:  MSWVRGKSSGWAAFNLKQQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHGVNGRSGRSFSFAPVPSADSLTVPGKFGAKKTTSENFGAKKTILGASNIQS
        MSW RGKS GWAA NLKQQN+GLQDE+DPDPFPPMST LS LPPREN+H VNGRSGRSFS  P+PSADSL  P          ENFG KKTI G S+I+S
Subjt:  MSWVRGKSSGWAAFNLKQQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHGVNGRSGRSFSFAPVPSADSLTVPGKFGAKKTTSENFGAKKTILGASNIQS

Query:  GKKMVEETNDVLSFWKLKELHSWADISLIMDIMEAVNNDFNEASTLLNAMVSSDNLEINNEMSALGLHSSNDLSWVTGKSPGWEEFNLKQRNRGLQNETG
        GKK+VEE+ DVL+FWKLKELHSWADISLI+DIMEAVNN+FNEAS LL  MVSSDN EINNEMS LGLHSSND+S V GKSPGWEEFNLKQ+NRGLQ+   
Subjt:  GKKMVEETNDVLSFWKLKELHSWADISLIMDIMEAVNNDFNEASTLLNAMVSSDNLEINNEMSALGLHSSNDLSWVTGKSPGWEEFNLKQRNRGLQNETG

Query:  LEAFPPM---------------LTDHSFGSFASEPLPSADSLTSSENYGAKKTIPDDSSIQSGKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNN
         + FPPM               +      S +S PLPSADSLT  ENY AKK I  DSSIQ+G+KVVEE TDVLAFWKLKE+H+WADFSLIVDIM+AV+N
Subjt:  LEAFPPM---------------LTDHSFGSFASEPLPSADSLTSSENYGAKKTIPDDSSIQSGKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNN

Query:  NFDEASTLLRTMVSSDNFEINNEISTLGLHSANDLLCNENNDVSISSERMINAPILSSTLKDVRGAHQNNSTGGENYTKLFVNNYFERNFFHNAGNSKIA
        NF+EAST L  MVSSDN EI NE+STLGLHSA+ L CN  NDV+IS  R +N PI SSTLKDV+  HQN +       KLF NNY ERNFFHN GN KIA
Subjt:  NFDEASTLLRTMVSSDNFEINNEISTLGLHSANDLLCNENNDVSISSERMINAPILSSTLKDVRGAHQNNSTGGENYTKLFVNNYFERNFFHNAGNSKIA

Query:  LGCPNSVPIEPEWEEDDVYLTHRKDAIAMMRSASQHSRAATNAYRRKDHASAKFHSSRAQEQWLAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEA
        L C  S PIEPEWEEDD+YL+HRKDAIAMMRSASQHSRAATNAY RKDHASAK+HSSRAQEQWLAAKMLN KAANEILQTRNS+NGLWKLDLHGLHAAEA
Subjt:  LGCPNSVPIEPEWEEDDVYLTHRKDAIAMMRSASQHSRAATNAYRRKDHASAKFHSSRAQEQWLAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEA

Query:  VQALQDHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKE--SPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLTENGYRFEQLRPG
        VQALQDHLLKIET+NASNRSLSPKKAERKGF R SSLEYLSCM  KLDKE  SP  RHRPTSLEVITG+GKHS+GEAALPKAVTSFL+ENGYRFEQLRPG
Subjt:  VQALQDHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKE--SPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLTENGYRFEQLRPG

Query:  TISVRPKFRR
        TISVRPKFRR
Subjt:  TISVRPKFRR

A0A6J1GPE7 uncharacterized protein LOC111455928 isoform X21.0e-24174.43Show/hide
Query:  MSWVRGKSSGWAAFNLKQQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHGVNGRSGRSFSFAPVPSADSLTVPGKFGAKKTTSENFGAKKTILGASNIQS
        MSW RGKS GWAA NLKQQN+GLQDE+DPDPFPPMST LS LPPREN+H VNGRSGRS                       + ENFG KKTI G S+I+S
Subjt:  MSWVRGKSSGWAAFNLKQQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHGVNGRSGRSFSFAPVPSADSLTVPGKFGAKKTTSENFGAKKTILGASNIQS

Query:  GKKMVEETNDVLSFWKLKELHSWADISLIMDIMEAVNNDFNEASTLLNAMVSSDNLEINNEMSALGLHSSNDLSWVTGKSPGWEEFNLKQRNRGLQNETG
        GKK+VEE+ DVL+FWKLKELHSWADISLI+DIMEAVNN+FNEAS LL  MVSSDN EINNEMS LGLHSSND+S V GKSPGWEEFNLKQ+NRGLQ+   
Subjt:  GKKMVEETNDVLSFWKLKELHSWADISLIMDIMEAVNNDFNEASTLLNAMVSSDNLEINNEMSALGLHSSNDLSWVTGKSPGWEEFNLKQRNRGLQNETG

Query:  LEAFPPM---------------LTDHSFGSFASEPLPSADSLTSSENYGAKKTIPDDSSIQSGKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNN
         + FPPM               +      S +S PLPSADSLT  ENY AKK I  DSSIQ+G+KVVEE TDVLAFWKLKE+H+WADFSLIVDIM+AV+N
Subjt:  LEAFPPM---------------LTDHSFGSFASEPLPSADSLTSSENYGAKKTIPDDSSIQSGKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNN

Query:  NFDEASTLLRTMVSSDNFEINNEISTLGLHSANDLLCNENNDVSISSERMINAPILSSTLKDVRGAHQNNSTGGENYTKLFVNNYFERNFFHNAGNSKIA
        NF+EAST L  MVSSDN EI NE+STLGLHSA+ L CN  NDV+IS  R +N PI SSTLKDV+  HQN +       KLF NNY ERNFFHN GN KIA
Subjt:  NFDEASTLLRTMVSSDNFEINNEISTLGLHSANDLLCNENNDVSISSERMINAPILSSTLKDVRGAHQNNSTGGENYTKLFVNNYFERNFFHNAGNSKIA

Query:  LGCPNSVPIEPEWEEDDVYLTHRKDAIAMMRSASQHSRAATNAYRRKDHASAKFHSSRAQEQWLAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEA
        L C  S PIEPEWEEDD+YL+HRKDAIAMMRSASQHSRAATNAY RKDHASAK+HSSRAQEQWLAAKMLN KAANEILQTRNS+NGLWKLDLHGLHAAEA
Subjt:  LGCPNSVPIEPEWEEDDVYLTHRKDAIAMMRSASQHSRAATNAYRRKDHASAKFHSSRAQEQWLAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEA

Query:  VQALQDHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKE--SPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLTENGYRFEQLRPG
        VQALQDHLLKIET+NASNRSLSPKKAERKGF R SSLEYLSCM  KLDKE  SP  RHRPTSLEVITG+GKHS+GEAALPKAVTSFL+ENGYRFEQLRPG
Subjt:  VQALQDHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKE--SPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLTENGYRFEQLRPG

Query:  TISVRPKFRR
        TISVRPKFRR
Subjt:  TISVRPKFRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G23520.1 smr (Small MutS Related) domain-containing protein3.1e-7341.83Show/hide
Query:  LSWVTGKSPGWEEFNLKQRNR-GLQNETGLEAFPPMLT--DHSFG--------------SFASEPLPSA--DSLT------SSENYGAKKTIPDDSSIQS
        +SW+ GKS GW  F+LKQR + GL++E   + FPP+ T  + SFG              SF+S  LP +   +LT      + E  G  +  PD  S+  
Subjt:  LSWVTGKSPGWEEFNLKQRNR-GLQNETGLEAFPPMLT--DHSFG--------------SFASEPLPSA--DSLT------SSENYGAKKTIPDDSSIQS

Query:  GKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNNNFDEASTLLRTMVSSDNFEINNEISTLGLHSANDLLCNENNDVSISSERMINAPILSSTLKD
               N+  LAF KLKE++SWAD +LI D++ +  ++F+ A   L+ MVSS   +        G  S N        + +++S   + A    ST +D
Subjt:  GKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNNNFDEASTLLRTMVSSDNFEINNEISTLGLHSANDLLCNENNDVSISSERMINAPILSSTLKD

Query:  VRGAHQNNSTGGENYTKLFVNNYFERNFFHNAGNSKIALGCPNSVPIEPEWEEDDVYLTHRKDAIAMMRSASQHSRAATNAYRRKDHASAKFHSSRAQEQ
               NS G    +   VN      F  +       +    S+PIEPEWEEDD+YL+HRKDA+ +MRSAS HSRAA NA++R DHASAK HS +A+E 
Subjt:  VRGAHQNNSTGGENYTKLFVNNYFERNFFHNAGNSKIALGCPNSVPIEPEWEEDDVYLTHRKDAIAMMRSASQHSRAATNAYRRKDHASAKFHSSRAQEQ

Query:  WLAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEAVQALQDHLLKIETQNASNRSLSPKKAERKGFQ-RASSLEYLSCMDSK-LDKESPSSRHRPTS
        WLAA+ LN +AA +I+   N  N +WKLDLHGLHA EAVQALQ+ L  IE     NRS+SP +   K    R++S E    +D + +  +  SSR    S
Subjt:  WLAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEAVQALQDHLLKIETQNASNRSLSPKKAERKGFQ-RASSLEYLSCMDSK-LDKESPSSRHRPTS

Query:  LEVITGIGKHSKGEAALPKAVTSFLTENGYRFEQLRPGTISVRPKFR
        L+VITGIGKHS+G+A+LP AV +F  +N YRF++ RPG I+VRPKFR
Subjt:  LEVITGIGKHSKGEAALPKAVTSFLTENGYRFEQLRPGTISVRPKFR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTGGGTGAGGGGTAAATCTTCTGGTTGGGCAGCTTTTAATCTTAAGCAACAGAATAATGGCCTTCAGGATGAAGTTGACCCGGATCCATTCCCACCAATGTCAAC
TACCCTCTCCTCTCTGCCACCCCGTGAAAACTTGCACGGAGTTAATGGTCGTTCAGGGAGATCTTTCTCATTTGCTCCTGTTCCTTCGGCCGATTCTCTGACTGTACCAG
GAAAATTTGGTGCAAAAAAGACAACATCAGAAAATTTTGGTGCAAAAAAGACGATACTTGGTGCTTCTAACATTCAAAGTGGCAAGAAGATGGTTGAAGAAACCAATGAT
GTTTTATCCTTTTGGAAGCTTAAAGAGCTTCATTCCTGGGCTGATATTAGCTTGATTATGGATATAATGGAAGCTGTAAATAATGACTTCAATGAGGCATCTACTTTATT
AAATGCAATGGTCTCTAGTGACAATTTAGAGATCAATAATGAGATGAGTGCCTTAGGACTGCATTCCTCTAATGATCTATCGTGGGTGACGGGTAAATCTCCTGGCTGGG
AAGAGTTTAACCTTAAGCAACGTAATAGAGGCCTTCAAAATGAAACGGGCCTGGAAGCTTTCCCACCAATGCTAACTGACCATTCCTTTGGATCCTTCGCATCTGAACCC
CTTCCTTCTGCCGATTCTCTGACTTCTTCAGAAAATTATGGTGCAAAGAAGACAATACCTGATGATTCTAGCATTCAAAGTGGCAAGAAGGTGGTTGAAGAGAACACTGA
TGTTTTAGCCTTTTGGAAGCTTAAAGAGATTCATTCTTGGGCCGACTTTAGCTTGATTGTGGATATAATGGACGCTGTGAATAATAACTTCGATGAGGCATCTACTTTAT
TAAGAACAATGGTTTCAAGTGACAATTTTGAGATCAATAATGAGATAAGCACCTTAGGACTGCATTCCGCTAATGATTTGTTGTGCAATGAGAATAATGATGTAAGCATA
TCATCAGAAAGAATGATCAATGCTCCCATCCTTAGTTCCACACTTAAGGATGTGCGAGGGGCGCATCAAAACAATAGTACGGGTGGAGAAAATTATACTAAATTGTTTGT
AAATAATTATTTTGAAAGAAATTTCTTTCATAATGCTGGAAATTCAAAAATAGCTCTTGGTTGCCCAAATTCTGTTCCTATTGAGCCTGAGTGGGAAGAAGATGATGTTT
ACCTGACCCATCGGAAAGATGCCATAGCAATGATGAGGTCTGCATCTCAACATTCAAGGGCAGCCACTAATGCCTATCGTAGGAAGGACCATGCTTCTGCCAAGTTTCAT
TCTTCAAGGGCTCAAGAACAATGGCTTGCTGCAAAAATGTTGAATGATAAGGCGGCTAATGAAATTTTACAAACAAGGAATAGTAAAAATGGGCTCTGGAAGTTGGACTT
ACATGGGCTCCATGCAGCCGAGGCTGTTCAAGCCTTGCAAGACCACTTGCTGAAAATTGAAACTCAGAATGCCTCCAATAGGTCCTTGTCACCAAAGAAAGCTGAAAGGA
AAGGATTCCAACGTGCTTCATCCCTTGAGTATCTTAGTTGTATGGACTCAAAGTTGGACAAAGAATCACCATCATCTAGGCATAGGCCGACATCATTGGAAGTCATAACA
GGTATAGGTAAACATAGTAAGGGGGAAGCTGCTCTACCAAAGGCTGTGACAAGTTTTCTTACTGAAAATGGGTACCGTTTCGAACAGTTAAGGCCTGGAACGATTAGTGT
CCGACCAAAATTTCGTAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTGGGTGAGGGGTAAATCTTCTGGTTGGGCAGCTTTTAATCTTAAGCAACAGAATAATGGCCTTCAGGATGAAGTTGACCCGGATCCATTCCCACCAATGTCAAC
TACCCTCTCCTCTCTGCCACCCCGTGAAAACTTGCACGGAGTTAATGGTCGTTCAGGGAGATCTTTCTCATTTGCTCCTGTTCCTTCGGCCGATTCTCTGACTGTACCAG
GAAAATTTGGTGCAAAAAAGACAACATCAGAAAATTTTGGTGCAAAAAAGACGATACTTGGTGCTTCTAACATTCAAAGTGGCAAGAAGATGGTTGAAGAAACCAATGAT
GTTTTATCCTTTTGGAAGCTTAAAGAGCTTCATTCCTGGGCTGATATTAGCTTGATTATGGATATAATGGAAGCTGTAAATAATGACTTCAATGAGGCATCTACTTTATT
AAATGCAATGGTCTCTAGTGACAATTTAGAGATCAATAATGAGATGAGTGCCTTAGGACTGCATTCCTCTAATGATCTATCGTGGGTGACGGGTAAATCTCCTGGCTGGG
AAGAGTTTAACCTTAAGCAACGTAATAGAGGCCTTCAAAATGAAACGGGCCTGGAAGCTTTCCCACCAATGCTAACTGACCATTCCTTTGGATCCTTCGCATCTGAACCC
CTTCCTTCTGCCGATTCTCTGACTTCTTCAGAAAATTATGGTGCAAAGAAGACAATACCTGATGATTCTAGCATTCAAAGTGGCAAGAAGGTGGTTGAAGAGAACACTGA
TGTTTTAGCCTTTTGGAAGCTTAAAGAGATTCATTCTTGGGCCGACTTTAGCTTGATTGTGGATATAATGGACGCTGTGAATAATAACTTCGATGAGGCATCTACTTTAT
TAAGAACAATGGTTTCAAGTGACAATTTTGAGATCAATAATGAGATAAGCACCTTAGGACTGCATTCCGCTAATGATTTGTTGTGCAATGAGAATAATGATGTAAGCATA
TCATCAGAAAGAATGATCAATGCTCCCATCCTTAGTTCCACACTTAAGGATGTGCGAGGGGCGCATCAAAACAATAGTACGGGTGGAGAAAATTATACTAAATTGTTTGT
AAATAATTATTTTGAAAGAAATTTCTTTCATAATGCTGGAAATTCAAAAATAGCTCTTGGTTGCCCAAATTCTGTTCCTATTGAGCCTGAGTGGGAAGAAGATGATGTTT
ACCTGACCCATCGGAAAGATGCCATAGCAATGATGAGGTCTGCATCTCAACATTCAAGGGCAGCCACTAATGCCTATCGTAGGAAGGACCATGCTTCTGCCAAGTTTCAT
TCTTCAAGGGCTCAAGAACAATGGCTTGCTGCAAAAATGTTGAATGATAAGGCGGCTAATGAAATTTTACAAACAAGGAATAGTAAAAATGGGCTCTGGAAGTTGGACTT
ACATGGGCTCCATGCAGCCGAGGCTGTTCAAGCCTTGCAAGACCACTTGCTGAAAATTGAAACTCAGAATGCCTCCAATAGGTCCTTGTCACCAAAGAAAGCTGAAAGGA
AAGGATTCCAACGTGCTTCATCCCTTGAGTATCTTAGTTGTATGGACTCAAAGTTGGACAAAGAATCACCATCATCTAGGCATAGGCCGACATCATTGGAAGTCATAACA
GGTATAGGTAAACATAGTAAGGGGGAAGCTGCTCTACCAAAGGCTGTGACAAGTTTTCTTACTGAAAATGGGTACCGTTTCGAACAGTTAAGGCCTGGAACGATTAGTGT
CCGACCAAAATTTCGTAGGTAA
Protein sequenceShow/hide protein sequence
MSWVRGKSSGWAAFNLKQQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHGVNGRSGRSFSFAPVPSADSLTVPGKFGAKKTTSENFGAKKTILGASNIQSGKKMVEETND
VLSFWKLKELHSWADISLIMDIMEAVNNDFNEASTLLNAMVSSDNLEINNEMSALGLHSSNDLSWVTGKSPGWEEFNLKQRNRGLQNETGLEAFPPMLTDHSFGSFASEP
LPSADSLTSSENYGAKKTIPDDSSIQSGKKVVEENTDVLAFWKLKEIHSWADFSLIVDIMDAVNNNFDEASTLLRTMVSSDNFEINNEISTLGLHSANDLLCNENNDVSI
SSERMINAPILSSTLKDVRGAHQNNSTGGENYTKLFVNNYFERNFFHNAGNSKIALGCPNSVPIEPEWEEDDVYLTHRKDAIAMMRSASQHSRAATNAYRRKDHASAKFH
SSRAQEQWLAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEAVQALQDHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKESPSSRHRPTSLEVIT
GIGKHSKGEAALPKAVTSFLTENGYRFEQLRPGTISVRPKFRR