; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG09G007860 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG09G007860
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionSmr domain-containing protein
Genome locationCG_Chr09:7179357..7184372
RNA-Seq ExpressionClCG09G007860
SyntenyClCG09G007860
Gene Ontology termsNA
InterPro domainsIPR002625 - Smr domain
IPR013899 - Domain of unknown function DUF1771
IPR036063 - Smr domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059625.1 Smr (Small MutS Related) domain-containing protein, putative isoform 1 [Cucumis melo var. makuwa]2.8e-27781.82Show/hide
Query:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEKIG----------AKKTVLGASNIQN
        MSWVRGKS GWAAFNLK+QNNG+QDEVD DPFPPMSTTLSSLPPRENL  VNGRSGRSF+F+P+PS DS T P K G          AKKT+LGASNIQ+
Subjt:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEKIG----------AKKTVLGASNIQN

Query:  GKKVVEETADLLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETD
        GKK+VEET D+LS WKLKELH WADISLIMDIMEAVNN+FNEASTLL TMVSSDN  INNEMS LGLH SNDLSW+ GKSPG  EFNL+QH+RGLQ E D
Subjt:  GKKVVEETADLLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETD

Query:  PEPFPPMSTGNSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAV
        PE FPPM T + SLPP ENLHGVYG  G SFAS PLP ADSLTSP NYGA+ T PDD  S IQSGKKVVEE+TDVLAFWKLKE+HSWADFSLIVDIM+AV
Subjt:  PEPFPPMSTGNSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAV

Query:  NNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTN
        NNNFDEASTLLKTMVSSDNFE++NE+STLGLH ANDLL N  ND+S S  RT+N PILS TLK  QG+H+N+N   E+CTKLF N+YFERNFF N  N+ 
Subjt:  NNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTN

Query:  IALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA
        IALG SKSVPIEPEWEEDD+YLSHRKDAIAMMRSASQHSRAATNAY RKDH SAKYHSSRAQEQW AAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA
Subjt:  IALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA

Query:  EAIQALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKESPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRF
        EA+QALQ+HLLKIETQNASNRSLSPKKAERKGFQRASS      LEYLSCMDSKLDKESPSSR RPTSLEVITGIGKHSKGEAALPKAVTSFL+ENGYRF
Subjt:  EAIQALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKESPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRF

Query:  EQLRPGTISVRPKFRR
        EQ RPGTISVRPKFRR
Subjt:  EQLRPGTISVRPKFRR

KAG6576051.1 hypothetical protein SDJN03_26690, partial [Cucurbita argyrosperma subsp. sororia]9.6e-26279.28Show/hide
Query:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEKIGAKKTVLGASNIQNGKKVVEETAD
        MSW RGKSPGWAA NLK+ N+GLQDE+DPDPFPPMST LS LPPREN+HRVNGRSGRSF+ +PLPS DS+ SPE  GAKKT+ G S+IQ+ KK+VEE+ D
Subjt:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEKIGAKKTVLGASNIQNGKKVVEETAD

Query:  LLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETDPEPFPPMSTG
        +L+ WKLKELHSWADISLI+DIMEAVNNNFNEAS LLKTMVSSDNF INNEMSTLGLH SND+S VRGKSPG  EFNLKQ +RGLQD  DP+PFPPM + 
Subjt:  LLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETDPEPFPPMSTG

Query:  NSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAVNNNFDEASTL
         SSLPP ENLHGV GC G S +S PLP ADSLTSPENY A+K   D   SSIQ+G+KVVEE+TDVLAFWKLKELH+WADFSLIVDIMEAV+NNF+EAST 
Subjt:  NSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAVNNNFDEASTL

Query:  LKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTNIALGSSKSVP
        L  MVSSDN E+ NEMSTLGLHSA+ L  N KND++ SLGRTVN PI SSTLKDVQ +H+N N       KLFENNY ERNFFHNV N  IAL  SKS P
Subjt:  LKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTNIALGSSKSVP

Query:  IEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEAIQALQEHL
        IEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDH SAKYHSSRAQEQW AAKMLN KAANEILQTRNS+NGLWKLDLHGLHAAEA+QALQ+HL
Subjt:  IEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEAIQALQEHL

Query:  LKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKE--SPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTI
        LKIET+NASNRSLSPKKAERKGF R SS      LEYLSCM  KLDKE  SP  R RPTSLEVITG+GKHS+GEAALPKAVTSFLSENGYRFEQLRPGTI
Subjt:  LKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKE--SPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTI

Query:  SVRPKFRR
        SVRPKFRR
Subjt:  SVRPKFRR

XP_004148966.1 uncharacterized protein LOC101223137 [Cucumis sativus]1.3e-27480.84Show/hide
Query:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEK----------IGAKKTVLGASNIQN
        MSWVRGKS GWAAFNLK+QNNGLQDEVD DPFPPMSTTLSSLPPRENL  VNG SG+SF+ +P+PS DS T P K           GAKKT+LG +NIQ+
Subjt:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEK----------IGAKKTVLGASNIQN

Query:  GKKVVEETADLLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETD
        GKK+VEET D+LS WKLKELH WADISLIMDIMEAVNN+FNEASTLL TMVSSDN  INN+MSTLGLH SNDL W+ GKSPG  EFNLKQH++GLQDE D
Subjt:  GKKVVEETADLLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETD

Query:  PEPFPPMSTGNSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAV
         E FPPM T  SSLPP ENLHGVYG SG SFAS PLP  DSLTSPENYGA+ T  DD  SSIQSGKKVVEE+TDVLAFWKLKE+HSWADFSLIVDIM+AV
Subjt:  PEPFPPMSTGNSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAV

Query:  NNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTN
        NNNFDEASTLLKTMVSSDNFE++NE+STLGLHSANDLL N  ND+S +  R +N PILSST+K VQG+H+NNN   E+ TKLF N+YFERN FHN  N+ 
Subjt:  NNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTN

Query:  IALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA
        IALG SKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAY RKDH SAKYHSSRA+EQW AAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA
Subjt:  IALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA

Query:  EAIQALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKESPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRF
        EA+QAL +HLLKIETQNASNRSLSPKKAERKGFQRASS      LEYLSCM+SKLDKESPSSR RPTSLEVITGIGKHSKGEAALPKAV SFL+ENGYRF
Subjt:  EAIQALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKESPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRF

Query:  EQLRPGTISVRPKFRR
        EQ RPGTISVRPKFRR
Subjt:  EQLRPGTISVRPKFRR

XP_008451240.1 PREDICTED: uncharacterized protein LOC103492590 [Cucumis melo]9.5e-27881.98Show/hide
Query:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEKIG----------AKKTVLGASNIQN
        MSWVRGKS GWAAFNLK+QNNG+QDEVD DPFPPMSTTLSSLPPRENL  VNGRSGRSF+F+P+PS DS T P K G          AKKT+LGASNIQ+
Subjt:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEKIG----------AKKTVLGASNIQN

Query:  GKKVVEETADLLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETD
        GKK+VEET D+LS WKLKELH WADISLIMDIMEAVNN+FNEASTLL TMVSSDN  INNEMS LGLH SNDLSW+ GKSPG  EFNL+QH+RGLQ E D
Subjt:  GKKVVEETADLLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETD

Query:  PEPFPPMSTGNSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAV
        PE FPPM T + SLPP ENLHGVYG  G SFAS PLP ADSLTSP NYGA+ T PDD  S IQSGKKVVEE+TDVLAFWKLKE+HSWADFSLIVDIM+AV
Subjt:  PEPFPPMSTGNSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAV

Query:  NNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTN
        NNNFDEASTLLKTMVSSDNFE++NE+STLGLHSANDLL N  ND+S S  RT+N PILS TLK  QG+H+N+N   E+CTKLF N+YFERNFF N  N+ 
Subjt:  NNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTN

Query:  IALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA
        IALG SKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAY RKDH SAKYHSSRAQEQW AAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA
Subjt:  IALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA

Query:  EAIQALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKESPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRF
        EA+QALQ+HLLKIETQNASNRSLSPKKAERKGFQRASS      LEYLSCMD+KLDKESPSSR RPTSLEVITGIGKHSKGEAALPKAVTSFL+ENGYRF
Subjt:  EAIQALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKESPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRF

Query:  EQLRPGTISVRPKFRR
        EQ RPGTISVRPKFRR
Subjt:  EQLRPGTISVRPKFRR

XP_038898473.1 uncharacterized protein LOC120086100 [Benincasa hispida]8.6e-28784.09Show/hide
Query:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEK----------IGAKKTVLGASNIQN
        MSWV+GKSPGWAAFNLK+QNNGLQDEVD DPFPP+STTLSSLPP EN H VNGRSGRSF+F+P PS +S+TSPEK          IGAKKT+L  SN+QN
Subjt:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEK----------IGAKKTVLGASNIQN

Query:  GKKVVEETADLLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETD
        GKKVVEETAD+LS WKLKELHSWADISLIMD+MEAVNNNF+EASTLLKTMV+SDNF INNEMSTLGL +SNDLSWV G  PG  EFNLKQH+RGLQDETD
Subjt:  GKKVVEETADLLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETD

Query:  PEPFPPMSTGNSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAV
         EP PPM TG+SSLPPCE+LH VYGCSG SF+SVP   ADSLTSPENYGA+KT PDD  SSIQSGKKVVEES D LAFWKLKELHSWADFSLIVDIMEAV
Subjt:  PEPFPPMSTGNSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAV

Query:  NNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTN
        NNNF+EASTLLKTMVSSDNF++++EMSTL L SANDLL N KND+STSL RT N PI SSTLKDVQGVH+NNNACEEN TKLFENNYFERNFFHN     
Subjt:  NNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTN

Query:  IALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA
        I LG SKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDH SAKYHSSRAQEQW AAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA
Subjt:  IALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA

Query:  EAIQALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKESPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRF
        EA+QALQEHLLKIET+NASNRSLSPKK+ERKGFQ ASS      LEYLSCMDSK+DKESPSSR RPTSLEVITGIGKHS+GEA LPKAVTSFLSENGYRF
Subjt:  EAIQALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKESPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRF

Query:  EQLRPGTISVRPKFRR
        EQLRPGTIS+RPKFRR
Subjt:  EQLRPGTISVRPKFRR

TrEMBL top hitse value%identityAlignment
A0A0A0KA90 Smr domain-containing protein6.2e-27580.84Show/hide
Query:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEK----------IGAKKTVLGASNIQN
        MSWVRGKS GWAAFNLK+QNNGLQDEVD DPFPPMSTTLSSLPPRENL  VNG SG+SF+ +P+PS DS T P K           GAKKT+LG +NIQ+
Subjt:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEK----------IGAKKTVLGASNIQN

Query:  GKKVVEETADLLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETD
        GKK+VEET D+LS WKLKELH WADISLIMDIMEAVNN+FNEASTLL TMVSSDN  INN+MSTLGLH SNDL W+ GKSPG  EFNLKQH++GLQDE D
Subjt:  GKKVVEETADLLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETD

Query:  PEPFPPMSTGNSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAV
         E FPPM T  SSLPP ENLHGVYG SG SFAS PLP  DSLTSPENYGA+ T  DD  SSIQSGKKVVEE+TDVLAFWKLKE+HSWADFSLIVDIM+AV
Subjt:  PEPFPPMSTGNSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAV

Query:  NNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTN
        NNNFDEASTLLKTMVSSDNFE++NE+STLGLHSANDLL N  ND+S +  R +N PILSST+K VQG+H+NNN   E+ TKLF N+YFERN FHN  N+ 
Subjt:  NNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTN

Query:  IALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA
        IALG SKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAY RKDH SAKYHSSRA+EQW AAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA
Subjt:  IALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA

Query:  EAIQALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKESPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRF
        EA+QAL +HLLKIETQNASNRSLSPKKAERKGFQRASS      LEYLSCM+SKLDKESPSSR RPTSLEVITGIGKHSKGEAALPKAV SFL+ENGYRF
Subjt:  EAIQALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKESPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRF

Query:  EQLRPGTISVRPKFRR
        EQ RPGTISVRPKFRR
Subjt:  EQLRPGTISVRPKFRR

A0A1S3BRS7 uncharacterized protein LOC1034925904.6e-27881.98Show/hide
Query:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEKIG----------AKKTVLGASNIQN
        MSWVRGKS GWAAFNLK+QNNG+QDEVD DPFPPMSTTLSSLPPRENL  VNGRSGRSF+F+P+PS DS T P K G          AKKT+LGASNIQ+
Subjt:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEKIG----------AKKTVLGASNIQN

Query:  GKKVVEETADLLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETD
        GKK+VEET D+LS WKLKELH WADISLIMDIMEAVNN+FNEASTLL TMVSSDN  INNEMS LGLH SNDLSW+ GKSPG  EFNL+QH+RGLQ E D
Subjt:  GKKVVEETADLLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETD

Query:  PEPFPPMSTGNSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAV
        PE FPPM T + SLPP ENLHGVYG  G SFAS PLP ADSLTSP NYGA+ T PDD  S IQSGKKVVEE+TDVLAFWKLKE+HSWADFSLIVDIM+AV
Subjt:  PEPFPPMSTGNSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAV

Query:  NNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTN
        NNNFDEASTLLKTMVSSDNFE++NE+STLGLHSANDLL N  ND+S S  RT+N PILS TLK  QG+H+N+N   E+CTKLF N+YFERNFF N  N+ 
Subjt:  NNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTN

Query:  IALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA
        IALG SKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAY RKDH SAKYHSSRAQEQW AAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA
Subjt:  IALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA

Query:  EAIQALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKESPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRF
        EA+QALQ+HLLKIETQNASNRSLSPKKAERKGFQRASS      LEYLSCMD+KLDKESPSSR RPTSLEVITGIGKHSKGEAALPKAVTSFL+ENGYRF
Subjt:  EAIQALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKESPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRF

Query:  EQLRPGTISVRPKFRR
        EQ RPGTISVRPKFRR
Subjt:  EQLRPGTISVRPKFRR

A0A5D3CAF0 Smr (Small MutS Related) domain-containing protein, putative isoform 11.3e-27781.82Show/hide
Query:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEKIG----------AKKTVLGASNIQN
        MSWVRGKS GWAAFNLK+QNNG+QDEVD DPFPPMSTTLSSLPPRENL  VNGRSGRSF+F+P+PS DS T P K G          AKKT+LGASNIQ+
Subjt:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEKIG----------AKKTVLGASNIQN

Query:  GKKVVEETADLLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETD
        GKK+VEET D+LS WKLKELH WADISLIMDIMEAVNN+FNEASTLL TMVSSDN  INNEMS LGLH SNDLSW+ GKSPG  EFNL+QH+RGLQ E D
Subjt:  GKKVVEETADLLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETD

Query:  PEPFPPMSTGNSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAV
        PE FPPM T + SLPP ENLHGVYG  G SFAS PLP ADSLTSP NYGA+ T PDD  S IQSGKKVVEE+TDVLAFWKLKE+HSWADFSLIVDIM+AV
Subjt:  PEPFPPMSTGNSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAV

Query:  NNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTN
        NNNFDEASTLLKTMVSSDNFE++NE+STLGLH ANDLL N  ND+S S  RT+N PILS TLK  QG+H+N+N   E+CTKLF N+YFERNFF N  N+ 
Subjt:  NNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTN

Query:  IALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA
        IALG SKSVPIEPEWEEDD+YLSHRKDAIAMMRSASQHSRAATNAY RKDH SAKYHSSRAQEQW AAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA
Subjt:  IALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAA

Query:  EAIQALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKESPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRF
        EA+QALQ+HLLKIETQNASNRSLSPKKAERKGFQRASS      LEYLSCMDSKLDKESPSSR RPTSLEVITGIGKHSKGEAALPKAVTSFL+ENGYRF
Subjt:  EAIQALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKESPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRF

Query:  EQLRPGTISVRPKFRR
        EQ RPGTISVRPKFRR
Subjt:  EQLRPGTISVRPKFRR

A0A6J1GN51 uncharacterized protein LOC111455928 isoform X18.7e-26178.95Show/hide
Query:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEKIGAKKTVLGASNIQNGKKVVEETAD
        MSW RGKSPGWAA NLK+QN+GLQDE+DPDPFPPMST LS LPPREN+HRVNGRSGRSF+ +PLPS DS+ SPE  G KKT+ G S+I++GKK+VEE+ D
Subjt:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEKIGAKKTVLGASNIQNGKKVVEETAD

Query:  LLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETDPEPFPPMSTG
        +L+ WKLKELHSWADISLI+DIMEAVNNNFNEAS LLKTMVSSDNF INNEMSTLGLH SND+S VRGKSPG  EFNLKQ +RGLQD  DP+PFPPM + 
Subjt:  LLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETDPEPFPPMSTG

Query:  NSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAVNNNFDEASTL
         SSLPP ENLHGV G  G S +S PLP ADSLT PENY A+K   D   SSIQ+G+KVVEE+TDVLAFWKLKELH+WADFSLIVDIMEAV+NNF+EAST 
Subjt:  NSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAVNNNFDEASTL

Query:  LKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTNIALGSSKSVP
        L  MVSSDN E+ NEMSTLGLHSA+ L  N KND++ SLGRTVN PI SSTLKDVQ +H+N N       KLFENNY ERNFFHNV N  IAL  SKS P
Subjt:  LKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTNIALGSSKSVP

Query:  IEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEAIQALQEHL
        IEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDH SAKYHSSRAQEQW AAKMLN KAANEILQTRNS+NGLWKLDLHGLHAAEA+QALQ+HL
Subjt:  IEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEAIQALQEHL

Query:  LKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKE--SPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTI
        LKIET+NASNRSLSPKKAERKGF R SS      LEYLSCM  KLDKE  SP  R RPTSLEVITG+GKHS+GEAALPKAVTSFLSENGYRFEQLRPGTI
Subjt:  LKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKE--SPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTI

Query:  SVRPKFRR
        SVRPKFRR
Subjt:  SVRPKFRR

A0A6J1GPE7 uncharacterized protein LOC111455928 isoform X21.4e-25377.8Show/hide
Query:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEKIGAKKTVLGASNIQNGKKVVEETAD
        MSW RGKSPGWAA NLK+QN+GLQDE+DPDPFPPMST LS LPPREN+HRVNGRSGRS             SPE  G KKT+ G S+I++GKK+VEE+ D
Subjt:  MSWVRGKSPGWAAFNLKEQNNGLQDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEKIGAKKTVLGASNIQNGKKVVEETAD

Query:  LLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETDPEPFPPMSTG
        +L+ WKLKELHSWADISLI+DIMEAVNNNFNEAS LLKTMVSSDNF INNEMSTLGLH SND+S VRGKSPG  EFNLKQ +RGLQD  DP+PFPPM + 
Subjt:  LLSVWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETDPEPFPPMSTG

Query:  NSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAVNNNFDEASTL
         SSLPP ENLHGV G  G S +S PLP ADSLT PENY A+K   D   SSIQ+G+KVVEE+TDVLAFWKLKELH+WADFSLIVDIMEAV+NNF+EAST 
Subjt:  NSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENYGAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAVNNNFDEASTL

Query:  LKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTNIALGSSKSVP
        L  MVSSDN E+ NEMSTLGLHSA+ L  N KND++ SLGRTVN PI SSTLKDVQ +H+N N       KLFENNY ERNFFHNV N  IAL  SKS P
Subjt:  LKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTNIALGSSKSVP

Query:  IEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEAIQALQEHL
        IEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDH SAKYHSSRAQEQW AAKMLN KAANEILQTRNS+NGLWKLDLHGLHAAEA+QALQ+HL
Subjt:  IEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSSRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEAIQALQEHL

Query:  LKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKE--SPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTI
        LKIET+NASNRSLSPKKAERKGF R SS      LEYLSCM  KLDKE  SP  R RPTSLEVITG+GKHS+GEAALPKAVTSFLSENGYRFEQLRPGTI
Subjt:  LKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKE--SPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTI

Query:  SVRPKFRR
        SVRPKFRR
Subjt:  SVRPKFRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G23520.1 smr (Small MutS Related) domain-containing protein3.2e-7441.36Show/hide
Query:  LSWVRGKSPGGAEFNLKQHDR-GLQDETDPEPFPPMSTG-NSSLPPCENLHGVYGCSGGSFASVPLPPA--DSLT------SPENYGAEKTKPDDSISSI
        +SW++GKS G   F+LKQ  + GL+ E + +PFPP+ST  N+S      L   +  S  SF+SV LPP+   +LT      + E  G  + KPD     +
Subjt:  LSWVRGKSPGGAEFNLKQHDR-GLQDETDPEPFPPMSTG-NSSLPPCENLHGVYGCSGGSFASVPLPPA--DSLT------SPENYGAEKTKPDDSISSI

Query:  QSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAVNNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPI---LS
         S           LAF KLKE++SWAD +LI D++ +  ++F+ A   LK MVSS   E     S +  +S++    N +++  T   +TV + +     
Subjt:  QSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAVNNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTVNTPI---LS

Query:  STLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTNIALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSS
        ST +D       N+          +N  F      +++  +  +   +S+PIEPEWEEDD+YLSHRKDA+ +MRSAS HSRAA NA+ R DH SAK HS 
Subjt:  STLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTNIALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHSS

Query:  RAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEAIQALQEHLLKIETQNASNRSLSPKKAERKGFQ-RASSLEYLSCLEYLSCMDSKLDKE
        +A+E W AA+ LN +AA +I+   N  N +WKLDLHGLHA EA+QALQE L  IE     NRS+SP +   K    R++S E    L+     +  +  +
Subjt:  RAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEAIQALQEHLLKIETQNASNRSLSPKKAERKGFQ-RASSLEYLSCLEYLSCMDSKLDKE

Query:  SPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFR
          SSR    SL+VITGIGKHS+G+A+LP AV +F  +N YRF++ RPG I+VRPKFR
Subjt:  SPSSRPRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGCAAACTTTCCAGCAAAATTTCTCTACTTTCTATCTCCCAACTGCATTTCCGTTTCCAAAATCGGTTCCGAAGAGGCACAAACGGCGGACTTGGGGATTC
GACATAGTGAGAAGACTCGGTAATGAACGGGTCTGTTTCAACCAGTTTGCTACCTCTTCGAGCTGCGGTCTAATTGGAATGGTGGATCGGAACAACAGGACATAT
TCATATTCTTTGCATTGGAAGATGTCGTGGGTGAGGGGTAAATCTCCTGGTTGGGCAGCTTTTAACCTTAAGGAACAGAACAATGGACTTCAAGATGAAGTTGAC
CCGGATCCATTCCCACCAATGTCAACCACCCTCTCCTCTCTGCCACCCCGTGAAAATTTGCATCGAGTTAATGGTCGTTCGGGGAGATCTTTCACATTTTCTCCC
CTTCCTTCTACCGATTCTATGACTTCACCAGAAAAAATTGGTGCAAAAAAAACAGTACTTGGTGCTTCTAACATTCAAAATGGCAAGAAGGTGGTTGAAGAAACT
GCTGACCTTTTATCCGTTTGGAAGCTTAAAGAGCTTCATTCCTGGGCTGATATTAGCTTGATTATGGATATAATGGAAGCTGTAAATAATAACTTCAATGAGGCA
TCTACTTTATTAAAAACTATGGTTTCTAGCGACAATTTTGTGATCAATAATGAGATGAGCACCTTAGGGCTGCATTTCTCTAATGATCTATCCTGGGTGAGGGGT
AAATCTCCTGGTGGTGCTGAATTTAACCTTAAGCAACATGATAGAGGCCTTCAAGATGAAACTGACCCGGAACCATTCCCACCAATGTCAACCGGCAATTCCTCT
CTGCCACCCTGTGAAAACTTGCATGGAGTTTATGGATGTTCAGGGGGATCCTTCGCATCTGTACCCCTTCCTCCTGCTGATTCTCTAACTTCTCCGGAAAATTAT
GGTGCAGAGAAGACAAAACCTGATGATTCTATTTCTAGCATTCAAAGTGGCAAGAAGGTGGTTGAAGAAAGCACTGATGTTTTAGCCTTTTGGAAGCTTAAAGAG
CTTCATTCTTGGGCTGATTTTAGCTTAATTGTGGATATAATGGAAGCTGTCAATAATAACTTCGATGAGGCATCTACTTTATTAAAGACAATGGTTTCAAGTGAC
AATTTTGAGGTCAGTAATGAGATGAGCACCTTAGGACTGCATTCCGCTAATGATTTATTATACAACGAGAAGAATGATTTAAGTACATCATTAGGAAGAACGGTC
AATACTCCCATCCTTAGTTCCACACTAAAGGATGTGCAAGGCGTGCATAAAAATAATAATGCATGTGAAGAAAATTGTACCAAATTGTTTGAAAATAATTATTTT
GAAAGAAATTTCTTTCATAATGTTGCAAATACAAACATAGCTCTAGGTAGCTCAAAGTCTGTTCCTATTGAGCCTGAGTGGGAAGAAGATGATATTTACCTGAGC
CATCGGAAAGATGCTATAGCAATGATGAGGTCTGCATCTCAACATTCAAGGGCAGCCACTAATGCCTATCTTAGGAAAGATCATACTTCTGCCAAGTATCATTCA
TCAAGAGCTCAAGAACAATGGCAAGCTGCAAAAATGTTAAATGATAAGGCGGCTAATGAAATTTTACAAACAAGGAATAGTAAAAATGGGCTGTGGAAGTTGGAC
TTACATGGGCTTCACGCAGCAGAGGCTATTCAAGCCTTGCAAGAACACTTGCTGAAAATTGAAACTCAGAATGCCTCCAATCGGTCGTTGTCGCCAAAGAAAGCT
GAAAGGAAAGGATTTCAACGTGCTTCATCCCTTGAGTATCTTAGTTGCCTTGAGTATCTTAGTTGTATGGACTCAAAGTTGGACAAAGAATCACCATCATCTAGG
CCTAGGCCGACATCATTGGAAGTCATAACAGGTATAGGTAAACATAGTAAAGGGGAAGCTGCTCTACCAAAGGCTGTGACAAGTTTTCTTAGTGAAAATGGGTAC
CGTTTCGAACAGTTAAGGCCTGGGACAATCAGCGTTCGACCAAAGTTTCGTAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCGCAAACTTTCCAGCAAAATTTCTCTACTTTCTATCTCCCAACTGCATTTCCGTTTCCAAAATCGGTTCCGAAGAGGCACAAACGGCGGACTTGGGGATTC
GACATAGTGAGAAGACTCGGTAATGAACGGGTCTGTTTCAACCAGTTTGCTACCTCTTCGAGCTGCGGTCTAATTGGAATGGTGGATCGGAACAACAGGACATAT
TCATATTCTTTGCATTGGAAGATGTCGTGGGTGAGGGGTAAATCTCCTGGTTGGGCAGCTTTTAACCTTAAGGAACAGAACAATGGACTTCAAGATGAAGTTGAC
CCGGATCCATTCCCACCAATGTCAACCACCCTCTCCTCTCTGCCACCCCGTGAAAATTTGCATCGAGTTAATGGTCGTTCGGGGAGATCTTTCACATTTTCTCCC
CTTCCTTCTACCGATTCTATGACTTCACCAGAAAAAATTGGTGCAAAAAAAACAGTACTTGGTGCTTCTAACATTCAAAATGGCAAGAAGGTGGTTGAAGAAACT
GCTGACCTTTTATCCGTTTGGAAGCTTAAAGAGCTTCATTCCTGGGCTGATATTAGCTTGATTATGGATATAATGGAAGCTGTAAATAATAACTTCAATGAGGCA
TCTACTTTATTAAAAACTATGGTTTCTAGCGACAATTTTGTGATCAATAATGAGATGAGCACCTTAGGGCTGCATTTCTCTAATGATCTATCCTGGGTGAGGGGT
AAATCTCCTGGTGGTGCTGAATTTAACCTTAAGCAACATGATAGAGGCCTTCAAGATGAAACTGACCCGGAACCATTCCCACCAATGTCAACCGGCAATTCCTCT
CTGCCACCCTGTGAAAACTTGCATGGAGTTTATGGATGTTCAGGGGGATCCTTCGCATCTGTACCCCTTCCTCCTGCTGATTCTCTAACTTCTCCGGAAAATTAT
GGTGCAGAGAAGACAAAACCTGATGATTCTATTTCTAGCATTCAAAGTGGCAAGAAGGTGGTTGAAGAAAGCACTGATGTTTTAGCCTTTTGGAAGCTTAAAGAG
CTTCATTCTTGGGCTGATTTTAGCTTAATTGTGGATATAATGGAAGCTGTCAATAATAACTTCGATGAGGCATCTACTTTATTAAAGACAATGGTTTCAAGTGAC
AATTTTGAGGTCAGTAATGAGATGAGCACCTTAGGACTGCATTCCGCTAATGATTTATTATACAACGAGAAGAATGATTTAAGTACATCATTAGGAAGAACGGTC
AATACTCCCATCCTTAGTTCCACACTAAAGGATGTGCAAGGCGTGCATAAAAATAATAATGCATGTGAAGAAAATTGTACCAAATTGTTTGAAAATAATTATTTT
GAAAGAAATTTCTTTCATAATGTTGCAAATACAAACATAGCTCTAGGTAGCTCAAAGTCTGTTCCTATTGAGCCTGAGTGGGAAGAAGATGATATTTACCTGAGC
CATCGGAAAGATGCTATAGCAATGATGAGGTCTGCATCTCAACATTCAAGGGCAGCCACTAATGCCTATCTTAGGAAAGATCATACTTCTGCCAAGTATCATTCA
TCAAGAGCTCAAGAACAATGGCAAGCTGCAAAAATGTTAAATGATAAGGCGGCTAATGAAATTTTACAAACAAGGAATAGTAAAAATGGGCTGTGGAAGTTGGAC
TTACATGGGCTTCACGCAGCAGAGGCTATTCAAGCCTTGCAAGAACACTTGCTGAAAATTGAAACTCAGAATGCCTCCAATCGGTCGTTGTCGCCAAAGAAAGCT
GAAAGGAAAGGATTTCAACGTGCTTCATCCCTTGAGTATCTTAGTTGCCTTGAGTATCTTAGTTGTATGGACTCAAAGTTGGACAAAGAATCACCATCATCTAGG
CCTAGGCCGACATCATTGGAAGTCATAACAGGTATAGGTAAACATAGTAAAGGGGAAGCTGCTCTACCAAAGGCTGTGACAAGTTTTCTTAGTGAAAATGGGTAC
CGTTTCGAACAGTTAAGGCCTGGGACAATCAGCGTTCGACCAAAGTTTCGTAGGTAAATATCTAGAGCACCCATTATTATTTGTTAGTATTGGCTTCAGGAGAAT
ATAATGCAAGTTATTTAGTTAGAACTTAGATTAATTCGAGTAGTTGGAGATTGGCTAAGTGAAACTGAAACGGGTAAAATGACTGAAAATGTTATCTAGGAATTT
GTCTATTGAATGAAATCTTGAACCTTAGAAGGGTGTACAGTAGGGAAGGATTAATATTGGTGGATTGATTGTTTACAAACGTTACTGTATTGTTATGATGTTCTC
TTTCTTGAAGAAGATAGCACTAAGAAAGAGGTTACAACAACTTTAGAGGCTGATCCTGAGGAAAATCAATTTTGACTAATG
Protein sequenceShow/hide protein sequence
MPQTFQQNFSTFYLPTAFPFPKSVPKRHKRRTWGFDIVRRLGNERVCFNQFATSSSCGLIGMVDRNNRTYSYSLHWKMSWVRGKSPGWAAFNLKEQNNGLQDEVD
PDPFPPMSTTLSSLPPRENLHRVNGRSGRSFTFSPLPSTDSMTSPEKIGAKKTVLGASNIQNGKKVVEETADLLSVWKLKELHSWADISLIMDIMEAVNNNFNEA
STLLKTMVSSDNFVINNEMSTLGLHFSNDLSWVRGKSPGGAEFNLKQHDRGLQDETDPEPFPPMSTGNSSLPPCENLHGVYGCSGGSFASVPLPPADSLTSPENY
GAEKTKPDDSISSIQSGKKVVEESTDVLAFWKLKELHSWADFSLIVDIMEAVNNNFDEASTLLKTMVSSDNFEVSNEMSTLGLHSANDLLYNEKNDLSTSLGRTV
NTPILSSTLKDVQGVHKNNNACEENCTKLFENNYFERNFFHNVANTNIALGSSKSVPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHTSAKYHS
SRAQEQWQAAKMLNDKAANEILQTRNSKNGLWKLDLHGLHAAEAIQALQEHLLKIETQNASNRSLSPKKAERKGFQRASSLEYLSCLEYLSCMDSKLDKESPSSR
PRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR