; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G08170 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G08170
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionSmr domain-containing protein
Genome locationClcChr09:6784124..6791974
RNA-Seq ExpressionClc09G08170
SyntenyClc09G08170
Gene Ontology termsNA
InterPro domainsIPR002625 - Smr domain
IPR013899 - Domain of unknown function DUF1771
IPR036063 - Smr domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059625.1 Smr (Small MutS Related) domain-containing protein, putative isoform 1 [Cucumis melo var. makuwa]7.4e-27781.79Show/hide
Query:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEKIG----------AKKTVLGASNIQN
        MSWVRGKS GWAAFNLK+QNNG+QDEVD DPFPPMSTTLSSLPPRENL  VNGRSGRSF+F+P+PS DS T P K G          AKKT+LGASNIQ+
Subjt:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEKIG----------AKKTVLGASNIQN

Query:  GKKVVEETADLLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETD
        GKK+VEET D+LS WKLKELH WADISLIMDIMEAVNN+FNEASTLL TMVSSDN  INNEMS LGLH SNDLSW+ GKSPG  EFNL+QH+RGLQ E D
Subjt:  GKKVVEETADLLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETD

Query:  PEPFPPMSTGNSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAV
        PE FPPM T + SLPP ENLHGVYG  G SFAS PLP ADSLTSP NYGA+ T PDD  S IQSGKKVVEE+TDVLAFWKLKE+HSWADFSLIVDIM+AV
Subjt:  PEPFPPMSTGNSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAV

Query:  NNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTN
        NNNFDEASTLLKTMVSSDNFE++NE+STLGLH ANDLL N  ND+S S  RT+N PILS TLK  QG+H+N+N   E+CTKLF N+YFERNFF N  N+ 
Subjt:  NNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTN

Query:  IALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA
        IALG SKSVPIEPEWEEDD+YLSHRKDAIAMMRSASQHSRAATNAY RKDH SAKYHSSRAQEQW AAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA
Subjt:  IALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA

Query:  EAIQALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKESPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRF
        EA+QALQ+HLLKIETQNASNRSLSPKKAERKGFQRASS      LEYLSCMDSKLDKESPSSR RPTSLEVITGIGKHSKGEAALPKAVTSFL+ENGYRF
Subjt:  EAIQALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKESPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRF

Query:  EQLRPGTISVRPKFR
        EQ RPGTISVRPKFR
Subjt:  EQLRPGTISVRPKFR

KAG6576051.1 hypothetical protein SDJN03_26690, partial [Cucurbita argyrosperma subsp. sororia]2.6e-26179.24Show/hide
Query:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEKIGAKKTVLGASNIQNGKKVVEETAD
        MSW RGKSPGWAA NLK+ N+GLQDE+DPDPFPPMST LS LPPREN+HRVNGRSGRSF+ +PLPS DS+ SPE  GAKKT+ G S+IQ+ KK+VEE+ D
Subjt:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEKIGAKKTVLGASNIQNGKKVVEETAD

Query:  LLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETDPEPFPPMSTG
        +L+ WKLKELHSWADISLI+DIMEAVNNNFNEAS LLKTMVSSDNF INNEMSTLGLH SND+S VRGKSPG  EFNLKQ +RGLQD  DP+PFPPM + 
Subjt:  LLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETDPEPFPPMSTG

Query:  NSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAVNNNFDEASTL
         SSLPP ENLHGV GC G S +S PLP ADSLTSPENY A+K   D   SSIQ+G+KVVEE+TDVLAFWKLKELH+WADFSLIVDIMEAV+NNF+EAST 
Subjt:  NSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAVNNNFDEASTL

Query:  LKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTNIALGSSKSVP
        L  MVSSDN E+ NEMSTLGLHSA+ L  N KND++ SLGRTVN PI SSTLKDVQ +H+N N       KLFENNY ERNFFHNV N  IAL  SKS P
Subjt:  LKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTNIALGSSKSVP

Query:  IEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEAIQALQEHL
        IEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDH SAKYHSSRAQEQW AAKMLN KAANEILQTRNS+NGLWKLDLHGLHAAEA+QALQ+HL
Subjt:  IEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEAIQALQEHL

Query:  LKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKE--SPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTI
        LKIET+NASNRSLSPKKAERKGF R SS      LEYLSCM  KLDKE  SP  R RPTSLEVITG+GKHS+GEAALPKAVTSFLSENGYRFEQLRPGTI
Subjt:  LKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKE--SPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTI

Query:  SVRPKFR
        SVRPKFR
Subjt:  SVRPKFR

XP_004148966.1 uncharacterized protein LOC101223137 [Cucumis sativus]3.5e-27480.81Show/hide
Query:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEK----------IGAKKTVLGASNIQN
        MSWVRGKS GWAAFNLK+QNNGLQDEVD DPFPPMSTTLSSLPPRENL  VNG SG+SF+ +P+PS DS T P K           GAKKT+LG +NIQ+
Subjt:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEK----------IGAKKTVLGASNIQN

Query:  GKKVVEETADLLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETD
        GKK+VEET D+LS WKLKELH WADISLIMDIMEAVNN+FNEASTLL TMVSSDN  INN+MSTLGLH SNDL W+ GKSPG  EFNLKQH++GLQDE D
Subjt:  GKKVVEETADLLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETD

Query:  PEPFPPMSTGNSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAV
         E FPPM T  SSLPP ENLHGVYG SG SFAS PLP  DSLTSPENYGA+ T  DD  SSIQSGKKVVEE+TDVLAFWKLKE+HSWADFSLIVDIM+AV
Subjt:  PEPFPPMSTGNSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAV

Query:  NNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTN
        NNNFDEASTLLKTMVSSDNFE++NE+STLGLHSANDLL N  ND+S +  R +N PILSST+K VQG+H+NNN   E+ TKLF N+YFERN FHN  N+ 
Subjt:  NNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTN

Query:  IALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA
        IALG SKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAY RKDH SAKYHSSRA+EQW AAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA
Subjt:  IALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA

Query:  EAIQALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKESPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRF
        EA+QAL +HLLKIETQNASNRSLSPKKAERKGFQRASS      LEYLSCM+SKLDKESPSSR RPTSLEVITGIGKHSKGEAALPKAV SFL+ENGYRF
Subjt:  EAIQALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKESPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRF

Query:  EQLRPGTISVRPKFR
        EQ RPGTISVRPKFR
Subjt:  EQLRPGTISVRPKFR

XP_008451240.1 PREDICTED: uncharacterized protein LOC103492590 [Cucumis melo]2.6e-27781.95Show/hide
Query:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEKIG----------AKKTVLGASNIQN
        MSWVRGKS GWAAFNLK+QNNG+QDEVD DPFPPMSTTLSSLPPRENL  VNGRSGRSF+F+P+PS DS T P K G          AKKT+LGASNIQ+
Subjt:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEKIG----------AKKTVLGASNIQN

Query:  GKKVVEETADLLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETD
        GKK+VEET D+LS WKLKELH WADISLIMDIMEAVNN+FNEASTLL TMVSSDN  INNEMS LGLH SNDLSW+ GKSPG  EFNL+QH+RGLQ E D
Subjt:  GKKVVEETADLLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETD

Query:  PEPFPPMSTGNSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAV
        PE FPPM T + SLPP ENLHGVYG  G SFAS PLP ADSLTSP NYGA+ T PDD  S IQSGKKVVEE+TDVLAFWKLKE+HSWADFSLIVDIM+AV
Subjt:  PEPFPPMSTGNSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAV

Query:  NNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTN
        NNNFDEASTLLKTMVSSDNFE++NE+STLGLHSANDLL N  ND+S S  RT+N PILS TLK  QG+H+N+N   E+CTKLF N+YFERNFF N  N+ 
Subjt:  NNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTN

Query:  IALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA
        IALG SKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAY RKDH SAKYHSSRAQEQW AAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA
Subjt:  IALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA

Query:  EAIQALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKESPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRF
        EA+QALQ+HLLKIETQNASNRSLSPKKAERKGFQRASS      LEYLSCMD+KLDKESPSSR RPTSLEVITGIGKHSKGEAALPKAVTSFL+ENGYRF
Subjt:  EAIQALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKESPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRF

Query:  EQLRPGTISVRPKFR
        EQ RPGTISVRPKFR
Subjt:  EQLRPGTISVRPKFR

XP_038898473.1 uncharacterized protein LOC120086100 [Benincasa hispida]2.3e-28684.07Show/hide
Query:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEK----------IGAKKTVLGASNIQN
        MSWV+GKSPGWAAFNLK+QNNGLQDEVD DPFPP+STTLSSLPP EN H VNGRSGRSF+F+P PS +S+TSPEK          IGAKKT+L  SN+QN
Subjt:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEK----------IGAKKTVLGASNIQN

Query:  GKKVVEETADLLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETD
        GKKVVEETAD+LS WKLKELHSWADISLIMD+MEAVNNNF+EASTLLKTMV+SDNF INNEMSTLGL +SNDLSWV G  PG  EFNLKQH+RGLQDETD
Subjt:  GKKVVEETADLLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETD

Query:  PEPFPPMSTGNSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAV
         EP PPM TG+SSLPPCE+LH VYGCSG SF+SVP   ADSLTSPENYGA+KT PDD  SSIQSGKKVVEES D LAFWKLKELHSWADFSLIVDIMEAV
Subjt:  PEPFPPMSTGNSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAV

Query:  NNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTN
        NNNF+EASTLLKTMVSSDNF++++EMSTL L SANDLL N KND+STSL RT N PI SSTLKDVQGVH+NNNACEEN TKLFENNYFERNFFHN     
Subjt:  NNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTN

Query:  IALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA
        I LG SKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDH SAKYHSSRAQEQW AAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA
Subjt:  IALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA

Query:  EAIQALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKESPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRF
        EA+QALQEHLLKIET+NASNRSLSPKK+ERKGFQ ASS      LEYLSCMDSK+DKESPSSR RPTSLEVITGIGKHS+GEA LPKAVTSFLSENGYRF
Subjt:  EAIQALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKESPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRF

Query:  EQLRPGTISVRPKFR
        EQLRPGTIS+RPKFR
Subjt:  EQLRPGTISVRPKFR

TrEMBL top hitse value%identityAlignment
A0A0A0KA90 Smr domain-containing protein1.7e-27480.81Show/hide
Query:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEK----------IGAKKTVLGASNIQN
        MSWVRGKS GWAAFNLK+QNNGLQDEVD DPFPPMSTTLSSLPPRENL  VNG SG+SF+ +P+PS DS T P K           GAKKT+LG +NIQ+
Subjt:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEK----------IGAKKTVLGASNIQN

Query:  GKKVVEETADLLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETD
        GKK+VEET D+LS WKLKELH WADISLIMDIMEAVNN+FNEASTLL TMVSSDN  INN+MSTLGLH SNDL W+ GKSPG  EFNLKQH++GLQDE D
Subjt:  GKKVVEETADLLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETD

Query:  PEPFPPMSTGNSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAV
         E FPPM T  SSLPP ENLHGVYG SG SFAS PLP  DSLTSPENYGA+ T  DD  SSIQSGKKVVEE+TDVLAFWKLKE+HSWADFSLIVDIM+AV
Subjt:  PEPFPPMSTGNSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAV

Query:  NNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTN
        NNNFDEASTLLKTMVSSDNFE++NE+STLGLHSANDLL N  ND+S +  R +N PILSST+K VQG+H+NNN   E+ TKLF N+YFERN FHN  N+ 
Subjt:  NNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTN

Query:  IALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA
        IALG SKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAY RKDH SAKYHSSRA+EQW AAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA
Subjt:  IALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA

Query:  EAIQALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKESPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRF
        EA+QAL +HLLKIETQNASNRSLSPKKAERKGFQRASS      LEYLSCM+SKLDKESPSSR RPTSLEVITGIGKHSKGEAALPKAV SFL+ENGYRF
Subjt:  EAIQALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKESPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRF

Query:  EQLRPGTISVRPKFR
        EQ RPGTISVRPKFR
Subjt:  EQLRPGTISVRPKFR

A0A1S3BRS7 uncharacterized protein LOC1034925901.2e-27781.95Show/hide
Query:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEKIG----------AKKTVLGASNIQN
        MSWVRGKS GWAAFNLK+QNNG+QDEVD DPFPPMSTTLSSLPPRENL  VNGRSGRSF+F+P+PS DS T P K G          AKKT+LGASNIQ+
Subjt:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEKIG----------AKKTVLGASNIQN

Query:  GKKVVEETADLLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETD
        GKK+VEET D+LS WKLKELH WADISLIMDIMEAVNN+FNEASTLL TMVSSDN  INNEMS LGLH SNDLSW+ GKSPG  EFNL+QH+RGLQ E D
Subjt:  GKKVVEETADLLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETD

Query:  PEPFPPMSTGNSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAV
        PE FPPM T + SLPP ENLHGVYG  G SFAS PLP ADSLTSP NYGA+ T PDD  S IQSGKKVVEE+TDVLAFWKLKE+HSWADFSLIVDIM+AV
Subjt:  PEPFPPMSTGNSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAV

Query:  NNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTN
        NNNFDEASTLLKTMVSSDNFE++NE+STLGLHSANDLL N  ND+S S  RT+N PILS TLK  QG+H+N+N   E+CTKLF N+YFERNFF N  N+ 
Subjt:  NNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTN

Query:  IALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA
        IALG SKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAY RKDH SAKYHSSRAQEQW AAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA
Subjt:  IALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA

Query:  EAIQALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKESPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRF
        EA+QALQ+HLLKIETQNASNRSLSPKKAERKGFQRASS      LEYLSCMD+KLDKESPSSR RPTSLEVITGIGKHSKGEAALPKAVTSFL+ENGYRF
Subjt:  EAIQALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKESPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRF

Query:  EQLRPGTISVRPKFR
        EQ RPGTISVRPKFR
Subjt:  EQLRPGTISVRPKFR

A0A5D3CAF0 Smr (Small MutS Related) domain-containing protein, putative isoform 13.6e-27781.79Show/hide
Query:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEKIG----------AKKTVLGASNIQN
        MSWVRGKS GWAAFNLK+QNNG+QDEVD DPFPPMSTTLSSLPPRENL  VNGRSGRSF+F+P+PS DS T P K G          AKKT+LGASNIQ+
Subjt:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEKIG----------AKKTVLGASNIQN

Query:  GKKVVEETADLLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETD
        GKK+VEET D+LS WKLKELH WADISLIMDIMEAVNN+FNEASTLL TMVSSDN  INNEMS LGLH SNDLSW+ GKSPG  EFNL+QH+RGLQ E D
Subjt:  GKKVVEETADLLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETD

Query:  PEPFPPMSTGNSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAV
        PE FPPM T + SLPP ENLHGVYG  G SFAS PLP ADSLTSP NYGA+ T PDD  S IQSGKKVVEE+TDVLAFWKLKE+HSWADFSLIVDIM+AV
Subjt:  PEPFPPMSTGNSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAV

Query:  NNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTN
        NNNFDEASTLLKTMVSSDNFE++NE+STLGLH ANDLL N  ND+S S  RT+N PILS TLK  QG+H+N+N   E+CTKLF N+YFERNFF N  N+ 
Subjt:  NNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTN

Query:  IALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA
        IALG SKSVPIEPEWEEDD+YLSHRKDAIAMMRSASQHSRAATNAY RKDH SAKYHSSRAQEQW AAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA
Subjt:  IALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA

Query:  EAIQALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKESPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRF
        EA+QALQ+HLLKIETQNASNRSLSPKKAERKGFQRASS      LEYLSCMDSKLDKESPSSR RPTSLEVITGIGKHSKGEAALPKAVTSFL+ENGYRF
Subjt:  EAIQALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKESPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRF

Query:  EQLRPGTISVRPKFR
        EQ RPGTISVRPKFR
Subjt:  EQLRPGTISVRPKFR

A0A6J1GN51 uncharacterized protein LOC111455928 isoform X12.3e-26078.91Show/hide
Query:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEKIGAKKTVLGASNIQNGKKVVEETAD
        MSW RGKSPGWAA NLK+QN+GLQDE+DPDPFPPMST LS LPPREN+HRVNGRSGRSF+ +PLPS DS+ SPE  G KKT+ G S+I++GKK+VEE+ D
Subjt:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEKIGAKKTVLGASNIQNGKKVVEETAD

Query:  LLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETDPEPFPPMSTG
        +L+ WKLKELHSWADISLI+DIMEAVNNNFNEAS LLKTMVSSDNF INNEMSTLGLH SND+S VRGKSPG  EFNLKQ +RGLQD  DP+PFPPM + 
Subjt:  LLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETDPEPFPPMSTG

Query:  NSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAVNNNFDEASTL
         SSLPP ENLHGV G  G S +S PLP ADSLT PENY A+K   D   SSIQ+G+KVVEE+TDVLAFWKLKELH+WADFSLIVDIMEAV+NNF+EAST 
Subjt:  NSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAVNNNFDEASTL

Query:  LKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTNIALGSSKSVP
        L  MVSSDN E+ NEMSTLGLHSA+ L  N KND++ SLGRTVN PI SSTLKDVQ +H+N N       KLFENNY ERNFFHNV N  IAL  SKS P
Subjt:  LKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTNIALGSSKSVP

Query:  IEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEAIQALQEHL
        IEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDH SAKYHSSRAQEQW AAKMLN KAANEILQTRNS+NGLWKLDLHGLHAAEA+QALQ+HL
Subjt:  IEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEAIQALQEHL

Query:  LKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKE--SPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTI
        LKIET+NASNRSLSPKKAERKGF R SS      LEYLSCM  KLDKE  SP  R RPTSLEVITG+GKHS+GEAALPKAVTSFLSENGYRFEQLRPGTI
Subjt:  LKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKE--SPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTI

Query:  SVRPKFR
        SVRPKFR
Subjt:  SVRPKFR

A0A6J1GPE7 uncharacterized protein LOC111455928 isoform X23.6e-25377.76Show/hide
Query:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEKIGAKKTVLGASNIQNGKKVVEETAD
        MSW RGKSPGWAA NLK+QN+GLQDE+DPDPFPPMST LS LPPREN+HRVNGRSGRS             SPE  G KKT+ G S+I++GKK+VEE+ D
Subjt:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEKIGAKKTVLGASNIQNGKKVVEETAD

Query:  LLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETDPEPFPPMSTG
        +L+ WKLKELHSWADISLI+DIMEAVNNNFNEAS LLKTMVSSDNF INNEMSTLGLH SND+S VRGKSPG  EFNLKQ +RGLQD  DP+PFPPM + 
Subjt:  LLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETDPEPFPPMSTG

Query:  NSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAVNNNFDEASTL
         SSLPP ENLHGV G  G S +S PLP ADSLT PENY A+K   D   SSIQ+G+KVVEE+TDVLAFWKLKELH+WADFSLIVDIMEAV+NNF+EAST 
Subjt:  NSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAVNNNFDEASTL

Query:  LKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTNIALGSSKSVP
        L  MVSSDN E+ NEMSTLGLHSA+ L  N KND++ SLGRTVN PI SSTLKDVQ +H+N N       KLFENNY ERNFFHNV N  IAL  SKS P
Subjt:  LKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTNIALGSSKSVP

Query:  IEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEAIQALQEHL
        IEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDH SAKYHSSRAQEQW AAKMLN KAANEILQTRNS+NGLWKLDLHGLHAAEA+QALQ+HL
Subjt:  IEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEAIQALQEHL

Query:  LKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKE--SPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTI
        LKIET+NASNRSLSPKKAERKGF R SS      LEYLSCM  KLDKE  SP  R RPTSLEVITG+GKHS+GEAALPKAVTSFLSENGYRFEQLRPGTI
Subjt:  LKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKE--SPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTI

Query:  SVRPKFR
        SVRPKFR
Subjt:  SVRPKFR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G23520.1 smr (Small MutS Related) domain-containing protein5.1e-7441.36Show/hide
Query:  LSWVRGKSPGGAEFNLKQHDR-GLQDETDPEPFPPMSTG-NSSLPPCENLHGVYGCSGGSFASVPLPPA--DSLT------SPENYGAEKTKPDDSISSI
        +SW++GKS G   F+LKQ  + GL+ E + +PFPP+ST  N+S      L   +  S  SF+SV LPP+   +LT      + E  G  + KPD     +
Subjt:  LSWVRGKSPGGAEFNLKQHDR-GLQDETDPEPFPPMSTG-NSSLPPCENLHGVYGCSGGSFASVPLPPA--DSLT------SPENYGAEKTKPDDSISSI

Query:  QSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAVNNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPI---LS
         S           LAF KLKE++SWAD +LI D++ +  ++F+ A   LK MVSS   E     S +  +S++    N +++  T   +TV + +     
Subjt:  QSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAVNNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPI---LS

Query:  STLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTNIALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSS
        ST +D       N+          +N  F      +++  +  +   +S+PIEPEWEEDD+YLSHRKDA+ +MRSAS HSRAA NA+ R DH SAK HS 
Subjt:  STLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTNIALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSS

Query:  RAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEAIQALQEHLLKIETQNASNRSLSPKKAERKGFQ-RASSLEYLSCLEYLSCMDSKLDKE
        +A+E W AA+ LN +AA +I+   N  N +WKLDLHGLHA EA+QALQE L  IE     NRS+SP +   K    R++S E    L+     +  +  +
Subjt:  RAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEAIQALQEHLLKIETQNASNRSLSPKKAERKGFQ-RASSLEYLSCLEYLSCMDSKLDKE

Query:  SPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFR
          SSR    SL+VITGIGKHS+G+A+LP AV +F  +N YRF++ RPG I+VRPKFR
Subjt:  SPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTGGGTGAGGGGTAAATCTCCTGGTTGGGCAGCTTTTAACCTTAAGGAACAGAACAATGGACTTCAAGATGAAGTTGACCCGGATCCATTCCCACCAATGTCAAC
CACCCTCTCCTCTCTGCCACCCCGTGAAAATTTGCATCGAGTTAATGGTCGTTCGGGGAGATCTTTCACATTTTCTCCCCTTCCTTCTACTGATTCTATGACTTCACCAG
AAAAAATTGGTGCAAAAAAAACAGTACTTGGTGCTTCTAACATTCAAAATGGCAAGAAGGTGGTTGAAGAAACTGCTGACCTTTTATCCGTTTGGAAGCTTAAAGAGCTT
CATTCCTGGGCTGATATTAGCTTGATTATGGATATAATGGAAGCTGTAAATAATAACTTCAATGAGGCATCTACTTTATTAAAAACTATGGTTTCTAGCGACAATTTTGT
GATCAATAATGAGATGAGCACCTTAGGGCTGCATTTCTCTAATGATCTATCCTGGGTGAGGGGTAAATCTCCTGGTGGTGCTGAATTTAACCTTAAGCAACATGATAGAG
GCCTTCAAGATGAAACTGACCCGGAACCATTCCCACCAATGTCAACCGGCAATTCCTCTCTGCCACCCTGTGAAAACTTGCATGGAGTTTATGGATGTTCAGGGGGATCC
TTCGCATCTGTACCCCTTCCTCCTGCTGATTCTCTAACTTCTCCGGAAAATTATGGTGCAGAGAAGACAAAACCTGATGATTCTATTTCTAGCATTCAAAGTGGCAAGAA
GGTGGTTGAAGAAAGCACTGATGTTTTAGCCTTTTGGAAGCTTAAAGAGCTTCATTCTTGGGCTGATTTTAGCTTAATTGTGGATATAATGGAAGCTGTCAATAATAACT
TCGATGAGGCATCTACTTTATTAAAGACAATGGTTTCAAGTGACAATTTTGAGGTCAGTAATGAGATGAGCACCTTAGGACTGCATTCCGCTAATGATTTATTATACAAC
GAGAAGAATGATTTAAGTACATCATTAGGAAGAACGGTCAATACTCCCATCCTTAGTTCCACACTAAAGGATGTGCAAGGCGTGCATAAAAATAATAATGCATGTGAAGA
AAATTGTACCAAATTGTTTGAAAATAATTATTTTGAAAGAAATTTCTTTCATAATGTTGCAAATACAAACATAGCTCTAGGTAGCTCAAAGTCTGTTCCTATTGAGCCTG
AGTGGGAAGAAGATGATATTTACCTGAGCCATCGGAAAGATGCTATAGCAATGATGAGGTCTGCATCTCAACATTCAAGGGCAGCCACTAATGCCTATCTTAGGAAAGAT
CATACTTCTGCCAAGTATCATTCATCAAGAGCTCAAGAACAATGGCAAGCTGCAAAAATGTTAAATGATAAGGCGGCTAATGAAATTTTACAAACAAGGAATAGTAAAAA
TGGGCTGTGGAAGTTGGACTTACATGGGCTTCACGCAGCAGAGGCTATTCAAGCCTTGCAAGAACACTTGCTGAAAATTGAAACTCAGAATGCCTCCAATCGATCGTTGT
CGCCAAAGAAAGCTGAAAGGAAAGGATTTCAACGTGCTTCATCCCTTGAGTATCTTAGTTGCCTTGAGTATCTTAGTTGTATGGACTCAAAGTTGGACAAAGAATCACCA
TCATCTAGGCCTAGGCCGACATCATTGGAAGTCATAACAGGTATAGGTAAACATAGTAAAGGGGAAGCTGCTCTACCAAAGGCTGTGACAAGTTTTCTTAGTGAAAATGG
GTACCGTTTCGAACAGTTAAGGCCTGGGACAATCAGCGTTCGACCAAAGTTTCGTAGCCGTCTCATTCATACAATTCTTTCCGATTCGAAACCCGAAAAGAAAAAAAAGG
AAAGAAAAGGTGACCTGAATTGA
mRNA sequenceShow/hide mRNA sequence
GTTGGATTAAGATGCCATTTTAACCTAATAGCAAGAAACACACGCAAAAGGGAAACAGCCGCTTGCAAATAGAGAGAAAATCTTGGTTTTCCTGCAGCAGATTAAACTTG
GTGAAGTAAGAATTTTGATAGAAAAGCATTTGAATCTCCTTATCCATTTGATCTTTTGAAGGAATTCGTAATCCAAGTCAACGATCAGAGTATGTGGGGTAGGGTTGAAG
TTGAATTTTGCCCTCACAAGATTTTCCGATTTCTCTCTTTCCTTTTGCGCGCTGGAATTCGATCTATGCCGCAAACTTTCCAGCAAAATTTCTCTACTTTCTATCTCCCA
ACTGCATTTCCGTTTCCAAAATCGGTTCCGAAGAGGCACAAACGGCGGACTTGGGGATTCGACATAGTGAGAAGACTCGGTTAGGGGATTTGGGAAGTCAAATCTTTTCA
ACTTTTTCAGTTTGAACCCTAATTTTCTTGTCGAAACTGGAAGGTAATGAACGGGTCTGTTTCAACCAGTTGTGCTACCTCTTCGAGCTGCGGTCTAATTGGAATGGTGG
ATCGGAACAACAGGACATATTCATAGTAGGAGGGGTTTGAGTTAGACCCGGAGTGTAAAATTATAATTTACTGGTGCTCAACTCTATATACTTTATTTTCAGTTCTTTGC
ATTGGAAGATGTCGTGGGTGAGGGGTAAATCTCCTGGTTGGGCAGCTTTTAACCTTAAGGAACAGAACAATGGACTTCAAGATGAAGTTGACCCGGATCCATTCCCACCA
ATGTCAACCACCCTCTCCTCTCTGCCACCCCGTGAAAATTTGCATCGAGTTAATGGTCGTTCGGGGAGATCTTTCACATTTTCTCCCCTTCCTTCTACTGATTCTATGAC
TTCACCAGAAAAAATTGGTGCAAAAAAAACAGTACTTGGTGCTTCTAACATTCAAAATGGCAAGAAGGTGGTTGAAGAAACTGCTGACCTTTTATCCGTTTGGAAGCTTA
AAGAGCTTCATTCCTGGGCTGATATTAGCTTGATTATGGATATAATGGAAGCTGTAAATAATAACTTCAATGAGGCATCTACTTTATTAAAAACTATGGTTTCTAGCGAC
AATTTTGTGATCAATAATGAGATGAGCACCTTAGGGCTGCATTTCTCTAATGATCTATCCTGGGTGAGGGGTAAATCTCCTGGTGGTGCTGAATTTAACCTTAAGCAACA
TGATAGAGGCCTTCAAGATGAAACTGACCCGGAACCATTCCCACCAATGTCAACCGGCAATTCCTCTCTGCCACCCTGTGAAAACTTGCATGGAGTTTATGGATGTTCAG
GGGGATCCTTCGCATCTGTACCCCTTCCTCCTGCTGATTCTCTAACTTCTCCGGAAAATTATGGTGCAGAGAAGACAAAACCTGATGATTCTATTTCTAGCATTCAAAGT
GGCAAGAAGGTGGTTGAAGAAAGCACTGATGTTTTAGCCTTTTGGAAGCTTAAAGAGCTTCATTCTTGGGCTGATTTTAGCTTAATTGTGGATATAATGGAAGCTGTCAA
TAATAACTTCGATGAGGCATCTACTTTATTAAAGACAATGGTTTCAAGTGACAATTTTGAGGTCAGTAATGAGATGAGCACCTTAGGACTGCATTCCGCTAATGATTTAT
TATACAACGAGAAGAATGATTTAAGTACATCATTAGGAAGAACGGTCAATACTCCCATCCTTAGTTCCACACTAAAGGATGTGCAAGGCGTGCATAAAAATAATAATGCA
TGTGAAGAAAATTGTACCAAATTGTTTGAAAATAATTATTTTGAAAGAAATTTCTTTCATAATGTTGCAAATACAAACATAGCTCTAGGTAGCTCAAAGTCTGTTCCTAT
TGAGCCTGAGTGGGAAGAAGATGATATTTACCTGAGCCATCGGAAAGATGCTATAGCAATGATGAGGTCTGCATCTCAACATTCAAGGGCAGCCACTAATGCCTATCTTA
GGAAAGATCATACTTCTGCCAAGTATCATTCATCAAGAGCTCAAGAACAATGGCAAGCTGCAAAAATGTTAAATGATAAGGCGGCTAATGAAATTTTACAAACAAGGAAT
AGTAAAAATGGGCTGTGGAAGTTGGACTTACATGGGCTTCACGCAGCAGAGGCTATTCAAGCCTTGCAAGAACACTTGCTGAAAATTGAAACTCAGAATGCCTCCAATCG
ATCGTTGTCGCCAAAGAAAGCTGAAAGGAAAGGATTTCAACGTGCTTCATCCCTTGAGTATCTTAGTTGCCTTGAGTATCTTAGTTGTATGGACTCAAAGTTGGACAAAG
AATCACCATCATCTAGGCCTAGGCCGACATCATTGGAAGTCATAACAGGTATAGGTAAACATAGTAAAGGGGAAGCTGCTCTACCAAAGGCTGTGACAAGTTTTCTTAGT
GAAAATGGGTACCGTTTCGAACAGTTAAGGCCTGGGACAATCAGCGTTCGACCAAAGTTTCGTAGCCGTCTCATTCATACAATTCTTTCCGATTCGAAACCCGAAAAGAA
AAAAAAGGAAAGAAAAGGTGACCTGAATTGAGTAGCTTGGTGGCCAAATAAGGAGTCAGTAAAACCATGAGCAATAGCAGAAGCAGAGAAGTTTGGGCCTAAAACCTTGA
AGCTCTCCAGGGATGGTGACACCACAAATCTAGGGGTGTAAGAGGAAGCTCCCACTTTCAGTTTCCGCTTCGTCTCGTCGGGAAAAGTAAACAGTCGTCGGTCGGCGTCG
GAGATGAGTTTTCTGAAAAACTCCTTAGGAACGCCATGGTTTCGAATGAAGAAGAATCCCCATTCCTTGCAAGCTCTGCGTAGGGAACATATAGCTGATTCGCTAAAGCT
ATGAGCATCAATGTCCATTACTGGTATTTCATCTATCCATTCTTTTGATGACATTTTCTTTTATTGTAAACAAATACTATGATTATTGCTTTTGGAATTTCATTGATCAC
TTGTGTTGTTCCTTATATA
Protein sequenceShow/hide protein sequence
MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEKIGAKKTVLGASNIQNGKKVVEETADLLSVWKLKEL
HSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETDPEPFPPMSTGNSSLPPCENLHGVYGCSGGS
FASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAVNNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYN
EKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTNIALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKD
HTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEAIQALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKESP
SSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRSRLIHTILSDSKPEKKKKERKGDLN