; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10022752 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10022752
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionSmr domain-containing protein
Genome locationChr05:27911091..27914082
RNA-Seq ExpressionHG10022752
SyntenyHG10022752
Gene Ontology termsNA
InterPro domainsIPR002625 - Smr domain
IPR013899 - Domain of unknown function DUF1771
IPR036063 - Smr domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059625.1 Smr (Small MutS Related) domain-containing protein, putative isoform 1 [Cucumis melo var. makuwa]1.5e-29385.36Show/hide
Query:  MSWVRGKSPGWAAFNLKEQNNDLRDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFSFAPLPSADSLTSPEKFGAKKTTLENFGAKKTILGASNIQN
        MSWVRGKS GWAAFNLK+QNN ++DEVD DPFPPMSTTLSSLPPRENL  VNGRSGRSFSFAP+PSADS T P K GAKKTTL NF AKKTILGASNIQ+
Subjt:  MSWVRGKSPGWAAFNLKEQNNDLRDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFSFAPLPSADSLTSPEKFGAKKTTLENFGAKKTILGASNIQN

Query:  GKKVVEETAEVLSFWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNLEVSNEMSTLGLHSSNDLSGVRGKSPGWVEFNLEHHNRGLQDETV
        GKK+VEET +VLSFWKLKELH WADISLIMDIMEAVNN+FNEASTLL TMVSSDNLE++NEMS LGLHSSNDLS + GKSPGW EFNL+ HNRGLQ E  
Subjt:  GKKVVEETAEVLSFWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNLEVSNEMSTLGLHSSNDLSGVRGKSPGWVEFNLEHHNRGLQDETV

Query:  PEPFPPMLTGHSSLPPCENMHGVYGCSGKSFSSVPLASADSLTSPENYDAKKTIPDDSSIQSGKKVVEGSTDVVSFWKLKELHSWADFSLIVDIMEAVNN
        PE FPPMLT H SLPP EN+HGVYG  G+SF+S PL SADSLTSP NY AK TIPDDS IQSGKKVVE +TDV++FWKLKE+HSWADFSLIVDIM+AVNN
Subjt:  PEPFPPMLTGHSSLPPCENMHGVYGCSGKSFSSVPLASADSLTSPENYDAKKTIPDDSSIQSGKKVVEGSTDVVSFWKLKELHSWADFSLIVDIMEAVNN

Query:  NFDEASTLLNTMVSRDNFEISNEMSTLGLHSANDLLCNGKIDLSISLERMVNTPILSSTLKDVQGVHQNNNACEEDYTKLFENNYFERNFFHNVGNTKIA
        NFDEASTLL TMVS DNFEI+NE+STLGLH ANDLLCNG  D+SIS ER +N PILS TLK  QG+HQN+N   ED TKLF N+YFERNFF N GN+KIA
Subjt:  NFDEASTLLNTMVSRDNFEISNEMSTLGLHSANDLLCNGKIDLSISLERMVNTPILSSTLKDVQGVHQNNNACEEDYTKLFENNYFERNFFHNVGNTKIA

Query:  LGCSKFVPIEPEWEEDDVYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNDKAANEILRSRNSKNGLWKLDLHGLHAAEA
        LGCSK VPIEPEWEEDDVYLSHRKDAIAMMRSASQHSRAATNAY RKDHASAKYHSSRAQEQWLAAKMLNDKAANEIL++RNSKNGLWKLDLHGLHAAEA
Subjt:  LGCSKFVPIEPEWEEDDVYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNDKAANEILRSRNSKNGLWKLDLHGLHAAEA

Query:  VQALQEHLLKIETRNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTI
        VQALQ+HLLKIET+NASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFL+ENGYRFEQ RPGTI
Subjt:  VQALQEHLLKIETRNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTI

Query:  SVRPKFRR
        SVRPKFRR
Subjt:  SVRPKFRR

KAG6576051.1 hypothetical protein SDJN03_26690, partial [Cucurbita argyrosperma subsp. sororia]5.8e-26980.33Show/hide
Query:  MSWVRGKSPGWAAFNLKEQNNDLRDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFSFAPLPSADSLTSPEKFGAKKTTLENFGAKKTILGASNIQN
        MSW RGKSPGWAA NLK+ N+ L+DE+DPDPFPPMST LS LPPREN+HRVNGRSGRSFS  PLPSADSL SP          ENFGAKKTI G S+IQ+
Subjt:  MSWVRGKSPGWAAFNLKEQNNDLRDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFSFAPLPSADSLTSPEKFGAKKTTLENFGAKKTILGASNIQN

Query:  GKKVVEETAEVLSFWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNLEVSNEMSTLGLHSSNDLSGVRGKSPGWVEFNLEHHNRGLQDETV
         KK+VEE+ +VL+FWKLKELHSWADISLI+DIMEAVNNNFNEAS LLKTMVSSDN E++NEMSTLGLHSSND+S VRGKSPGW EFNL+  NRGLQD   
Subjt:  GKKVVEETAEVLSFWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNLEVSNEMSTLGLHSSNDLSGVRGKSPGWVEFNLEHHNRGLQDETV

Query:  PEPFPPMLTGHSSLPPCENMHGVYGCSGKSFSSVPLASADSLTSPENYDAKKTIPDDSSIQSGKKVVEGSTDVVSFWKLKELHSWADFSLIVDIMEAVNN
        P+PFPPM +  SSLPP EN+HGV GC G+S SS PL SADSLTSPENY AKK I  DSSIQ+G+KVVE +TDV++FWKLKELH+WADFSLIVDIMEAV+N
Subjt:  PEPFPPMLTGHSSLPPCENMHGVYGCSGKSFSSVPLASADSLTSPENYDAKKTIPDDSSIQSGKKVVEGSTDVVSFWKLKELHSWADFSLIVDIMEAVNN

Query:  NFDEASTLLNTMVSRDNFEISNEMSTLGLHSANDLLCNGKIDLSISLERMVNTPILSSTLKDVQGVHQNNNACEEDYTKLFENNYFERNFFHNVGNTKIA
        NF+EAST LN MVS DN EI NEMSTLGLHSA+ L CNGK D++ISL R VN PI SSTLKDVQ +HQN N       KLFENNY ERNFFHNVGN KIA
Subjt:  NFDEASTLLNTMVSRDNFEISNEMSTLGLHSANDLLCNGKIDLSISLERMVNTPILSSTLKDVQGVHQNNNACEEDYTKLFENNYFERNFFHNVGNTKIA

Query:  LGCSKFVPIEPEWEEDDVYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNDKAANEILRSRNSKNGLWKLDLHGLHAAEA
        L CSK  PIEPEWEEDD+YLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLN KAANEIL++RNS+NGLWKLDLHGLHAAEA
Subjt:  LGCSKFVPIEPEWEEDDVYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNDKAANEILRSRNSKNGLWKLDLHGLHAAEA

Query:  VQALQEHLLKIETRNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKE--SPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPG
        VQALQ+HLLKIETRNASNRSLSPKKAERKGF R SSLEYLSCM  KLDKE  SP  RHRPTSLEVITG+GKHS+GEAALPKAVTSFLSENGYRFEQLRPG
Subjt:  VQALQEHLLKIETRNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKE--SPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPG

Query:  TISVRPKFRR
        TISVRPKFRR
Subjt:  TISVRPKFRR

XP_004148966.1 uncharacterized protein LOC101223137 [Cucumis sativus]1.8e-29484.7Show/hide
Query:  MSWVRGKSPGWAAFNLKEQNNDLRDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFSFAPLPSADSLTSPEKFGAKKTTLENFGAKKTILGASNIQN
        MSWVRGKS GWAAFNLK+QNN L+DEVD DPFPPMSTTLSSLPPRENL  VNG SG+SFS AP+PSADS T P KFGAKKTTL NFGAKKTILG +NIQ+
Subjt:  MSWVRGKSPGWAAFNLKEQNNDLRDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFSFAPLPSADSLTSPEKFGAKKTTLENFGAKKTILGASNIQN

Query:  GKKVVEETAEVLSFWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNLEVSNEMSTLGLHSSNDLSGVRGKSPGWVEFNLEHHNRGLQDETV
        GKK+VEET +VLSFWKLKELH WADISLIMDIMEAVNN+FNEASTLL TMVSSDNLE++N+MSTLGLHSSNDL  + GKSPGW EFNL+ HN+GLQDE  
Subjt:  GKKVVEETAEVLSFWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNLEVSNEMSTLGLHSSNDLSGVRGKSPGWVEFNLEHHNRGLQDETV

Query:  PEPFPPMLTGHSSLPPCENMHGVYGCSGKSFSSVPLASADSLTSPENYDAKKTIPDDSSIQSGKKVVEGSTDVVSFWKLKELHSWADFSLIVDIMEAVNN
         E FPPMLT  SSLPP EN+HGVYG SG+SF+S PL S DSLTSPENY AK TI DDSSIQSGKKVVE +TDV++FWKLKE+HSWADFSLIVDIM+AVNN
Subjt:  PEPFPPMLTGHSSLPPCENMHGVYGCSGKSFSSVPLASADSLTSPENYDAKKTIPDDSSIQSGKKVVEGSTDVVSFWKLKELHSWADFSLIVDIMEAVNN

Query:  NFDEASTLLNTMVSRDNFEISNEMSTLGLHSANDLLCNGKIDLSISLERMVNTPILSSTLKDVQGVHQNNNACEEDYTKLFENNYFERNFFHNVGNTKIA
        NFDEASTLL TMVS DNFEI+NE+STLGLHSANDLLCNG  D+SI+ ERM+N PILSST+K VQG+HQNNN   EDYTKLF N+YFERN FHN GN+KIA
Subjt:  NFDEASTLLNTMVSRDNFEISNEMSTLGLHSANDLLCNGKIDLSISLERMVNTPILSSTLKDVQGVHQNNNACEEDYTKLFENNYFERNFFHNVGNTKIA

Query:  LGCSKFVPIEPEWEEDDVYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNDKAANEILRSRNSKNGLWKLDLHGLHAAEA
        LGCSK VPIEPEWEEDD+YLSHRKDAIAMMRSASQHSRAATNAY RKDHASAKYHSSRA+EQWLAAKMLNDKAANEIL++RNSKNGLWKLDLHGLHAAEA
Subjt:  LGCSKFVPIEPEWEEDDVYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNDKAANEILRSRNSKNGLWKLDLHGLHAAEA

Query:  VQALQEHLLKIETRNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTI
        VQAL +HLLKIET+NASNRSLSPKKAERKGFQRASSLEYLSCM+SKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAV SFL+ENGYRFEQ RPGTI
Subjt:  VQALQEHLLKIETRNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTI

Query:  SVRPKFRR
        SVRPKFRR
Subjt:  SVRPKFRR

XP_008451240.1 PREDICTED: uncharacterized protein LOC103492590 [Cucumis melo]9.0e-29485.2Show/hide
Query:  MSWVRGKSPGWAAFNLKEQNNDLRDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFSFAPLPSADSLTSPEKFGAKKTTLENFGAKKTILGASNIQN
        MSWVRGKS GWAAFNLK+QNN ++DEVD DPFPPMSTTLSSLPPRENL  VNGRSGRSFSFAP+PSADS T P K GAKKTTL NF AKKTILGASNIQ+
Subjt:  MSWVRGKSPGWAAFNLKEQNNDLRDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFSFAPLPSADSLTSPEKFGAKKTTLENFGAKKTILGASNIQN

Query:  GKKVVEETAEVLSFWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNLEVSNEMSTLGLHSSNDLSGVRGKSPGWVEFNLEHHNRGLQDETV
        GKK+VEET +VLSFWKLKELH WADISLIMDIMEAVNN+FNEASTLL TMVSSDNLE++NEMS LGLHSSNDLS + GKSPGW EFNL+ HNRGLQ E  
Subjt:  GKKVVEETAEVLSFWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNLEVSNEMSTLGLHSSNDLSGVRGKSPGWVEFNLEHHNRGLQDETV

Query:  PEPFPPMLTGHSSLPPCENMHGVYGCSGKSFSSVPLASADSLTSPENYDAKKTIPDDSSIQSGKKVVEGSTDVVSFWKLKELHSWADFSLIVDIMEAVNN
        PE FPPMLT H SLPP EN+HGVYG  G+SF+S PL SADSLTSP NY AK TIPDDS IQSGKKVVE +TDV++FWKLKE+HSWADFSLIVDIM+AVNN
Subjt:  PEPFPPMLTGHSSLPPCENMHGVYGCSGKSFSSVPLASADSLTSPENYDAKKTIPDDSSIQSGKKVVEGSTDVVSFWKLKELHSWADFSLIVDIMEAVNN

Query:  NFDEASTLLNTMVSRDNFEISNEMSTLGLHSANDLLCNGKIDLSISLERMVNTPILSSTLKDVQGVHQNNNACEEDYTKLFENNYFERNFFHNVGNTKIA
        NFDEASTLL TMVS DNFEI+NE+STLGLHSANDLLCNG  D+SIS ER +N PILS TLK  QG+HQN+N   ED TKLF N+YFERNFF N GN+KIA
Subjt:  NFDEASTLLNTMVSRDNFEISNEMSTLGLHSANDLLCNGKIDLSISLERMVNTPILSSTLKDVQGVHQNNNACEEDYTKLFENNYFERNFFHNVGNTKIA

Query:  LGCSKFVPIEPEWEEDDVYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNDKAANEILRSRNSKNGLWKLDLHGLHAAEA
        LGCSK VPIEPEWEEDD+YLSHRKDAIAMMRSASQHSRAATNAY RKDHASAKYHSSRAQEQWLAAKMLNDKAANEIL++RNSKNGLWKLDLHGLHAAEA
Subjt:  LGCSKFVPIEPEWEEDDVYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNDKAANEILRSRNSKNGLWKLDLHGLHAAEA

Query:  VQALQEHLLKIETRNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTI
        VQALQ+HLLKIET+NASNRSLSPKKAERKGFQRASSLEYLSCMD+KLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFL+ENGYRFEQ RPGTI
Subjt:  VQALQEHLLKIETRNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTI

Query:  SVRPKFRR
        SVRPKFRR
Subjt:  SVRPKFRR

XP_038898473.1 uncharacterized protein LOC120086100 [Benincasa hispida]8.1e-30387.5Show/hide
Query:  MSWVRGKSPGWAAFNLKEQNNDLRDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFSFAPLPSADSLTSPEKFGAKKTTLENFGAKKTILGASNIQN
        MSWV+GKSPGWAAFNLK+QNN L+DEVD DPFPP+STTLSSLPP EN H VNGRSGRSFSFAP PSA+SLTSPEKF AKKTTLEN GAKKTIL  SN+QN
Subjt:  MSWVRGKSPGWAAFNLKEQNNDLRDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFSFAPLPSADSLTSPEKFGAKKTTLENFGAKKTILGASNIQN

Query:  GKKVVEETAEVLSFWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNLEVSNEMSTLGLHSSNDLSGVRGKSPGWVEFNLEHHNRGLQDETV
        GKKVVEETA+VLSFWKLKELHSWADISLIMD+MEAVNNNF+EASTLLKTMV+SDN E++NEMSTLGL  SNDLS V G  PGW EFNL+ HNRGLQDET 
Subjt:  GKKVVEETAEVLSFWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNLEVSNEMSTLGLHSSNDLSGVRGKSPGWVEFNLEHHNRGLQDETV

Query:  PEPFPPMLTGHSSLPPCENMHGVYGCSGKSFSSVPLASADSLTSPENYDAKKTIPDDSSIQSGKKVVEGSTDVVSFWKLKELHSWADFSLIVDIMEAVNN
         EP PPMLTGHSSLPPCE++H VYGCSGKSFSSVP ASADSLTSPENY AKKTIPDDSSIQSGKKVVE S D ++FWKLKELHSWADFSLIVDIMEAVNN
Subjt:  PEPFPPMLTGHSSLPPCENMHGVYGCSGKSFSSVPLASADSLTSPENYDAKKTIPDDSSIQSGKKVVEGSTDVVSFWKLKELHSWADFSLIVDIMEAVNN

Query:  NFDEASTLLNTMVSRDNFEISNEMSTLGLHSANDLLCNGKIDLSISLERMVNTPILSSTLKDVQGVHQNNNACEEDYTKLFENNYFERNFFHNVGNTKIA
        NF+EASTLL TMVS DNF+I++EMSTL L SANDLLCNGK D+S SLER  N PI SSTLKDVQGVHQNNNACEE+YTKLFENNYFERNFFHN G  KI 
Subjt:  NFDEASTLLNTMVSRDNFEISNEMSTLGLHSANDLLCNGKIDLSISLERMVNTPILSSTLKDVQGVHQNNNACEEDYTKLFENNYFERNFFHNVGNTKIA

Query:  LGCSKFVPIEPEWEEDDVYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNDKAANEILRSRNSKNGLWKLDLHGLHAAEA
        LG SK VPIEPEWEEDD+YLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNDKAANEIL++RNSKNGLWKLDLHGLHAAEA
Subjt:  LGCSKFVPIEPEWEEDDVYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNDKAANEILRSRNSKNGLWKLDLHGLHAAEA

Query:  VQALQEHLLKIETRNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTI
        VQALQEHLLKIETRNASNRSLSPKK+ERKGFQ ASSLEYLSCMDSK+DKESPSSRHRPTSLEVITGIGKHS+GEA LPKAVTSFLSENGYRFEQLRPGTI
Subjt:  VQALQEHLLKIETRNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTI

Query:  SVRPKFRR
        S+RPKFRR
Subjt:  SVRPKFRR

TrEMBL top hitse value%identityAlignment
A0A0A0KA90 Smr domain-containing protein8.7e-29584.7Show/hide
Query:  MSWVRGKSPGWAAFNLKEQNNDLRDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFSFAPLPSADSLTSPEKFGAKKTTLENFGAKKTILGASNIQN
        MSWVRGKS GWAAFNLK+QNN L+DEVD DPFPPMSTTLSSLPPRENL  VNG SG+SFS AP+PSADS T P KFGAKKTTL NFGAKKTILG +NIQ+
Subjt:  MSWVRGKSPGWAAFNLKEQNNDLRDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFSFAPLPSADSLTSPEKFGAKKTTLENFGAKKTILGASNIQN

Query:  GKKVVEETAEVLSFWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNLEVSNEMSTLGLHSSNDLSGVRGKSPGWVEFNLEHHNRGLQDETV
        GKK+VEET +VLSFWKLKELH WADISLIMDIMEAVNN+FNEASTLL TMVSSDNLE++N+MSTLGLHSSNDL  + GKSPGW EFNL+ HN+GLQDE  
Subjt:  GKKVVEETAEVLSFWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNLEVSNEMSTLGLHSSNDLSGVRGKSPGWVEFNLEHHNRGLQDETV

Query:  PEPFPPMLTGHSSLPPCENMHGVYGCSGKSFSSVPLASADSLTSPENYDAKKTIPDDSSIQSGKKVVEGSTDVVSFWKLKELHSWADFSLIVDIMEAVNN
         E FPPMLT  SSLPP EN+HGVYG SG+SF+S PL S DSLTSPENY AK TI DDSSIQSGKKVVE +TDV++FWKLKE+HSWADFSLIVDIM+AVNN
Subjt:  PEPFPPMLTGHSSLPPCENMHGVYGCSGKSFSSVPLASADSLTSPENYDAKKTIPDDSSIQSGKKVVEGSTDVVSFWKLKELHSWADFSLIVDIMEAVNN

Query:  NFDEASTLLNTMVSRDNFEISNEMSTLGLHSANDLLCNGKIDLSISLERMVNTPILSSTLKDVQGVHQNNNACEEDYTKLFENNYFERNFFHNVGNTKIA
        NFDEASTLL TMVS DNFEI+NE+STLGLHSANDLLCNG  D+SI+ ERM+N PILSST+K VQG+HQNNN   EDYTKLF N+YFERN FHN GN+KIA
Subjt:  NFDEASTLLNTMVSRDNFEISNEMSTLGLHSANDLLCNGKIDLSISLERMVNTPILSSTLKDVQGVHQNNNACEEDYTKLFENNYFERNFFHNVGNTKIA

Query:  LGCSKFVPIEPEWEEDDVYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNDKAANEILRSRNSKNGLWKLDLHGLHAAEA
        LGCSK VPIEPEWEEDD+YLSHRKDAIAMMRSASQHSRAATNAY RKDHASAKYHSSRA+EQWLAAKMLNDKAANEIL++RNSKNGLWKLDLHGLHAAEA
Subjt:  LGCSKFVPIEPEWEEDDVYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNDKAANEILRSRNSKNGLWKLDLHGLHAAEA

Query:  VQALQEHLLKIETRNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTI
        VQAL +HLLKIET+NASNRSLSPKKAERKGFQRASSLEYLSCM+SKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAV SFL+ENGYRFEQ RPGTI
Subjt:  VQALQEHLLKIETRNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTI

Query:  SVRPKFRR
        SVRPKFRR
Subjt:  SVRPKFRR

A0A1S3BRS7 uncharacterized protein LOC1034925904.3e-29485.2Show/hide
Query:  MSWVRGKSPGWAAFNLKEQNNDLRDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFSFAPLPSADSLTSPEKFGAKKTTLENFGAKKTILGASNIQN
        MSWVRGKS GWAAFNLK+QNN ++DEVD DPFPPMSTTLSSLPPRENL  VNGRSGRSFSFAP+PSADS T P K GAKKTTL NF AKKTILGASNIQ+
Subjt:  MSWVRGKSPGWAAFNLKEQNNDLRDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFSFAPLPSADSLTSPEKFGAKKTTLENFGAKKTILGASNIQN

Query:  GKKVVEETAEVLSFWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNLEVSNEMSTLGLHSSNDLSGVRGKSPGWVEFNLEHHNRGLQDETV
        GKK+VEET +VLSFWKLKELH WADISLIMDIMEAVNN+FNEASTLL TMVSSDNLE++NEMS LGLHSSNDLS + GKSPGW EFNL+ HNRGLQ E  
Subjt:  GKKVVEETAEVLSFWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNLEVSNEMSTLGLHSSNDLSGVRGKSPGWVEFNLEHHNRGLQDETV

Query:  PEPFPPMLTGHSSLPPCENMHGVYGCSGKSFSSVPLASADSLTSPENYDAKKTIPDDSSIQSGKKVVEGSTDVVSFWKLKELHSWADFSLIVDIMEAVNN
        PE FPPMLT H SLPP EN+HGVYG  G+SF+S PL SADSLTSP NY AK TIPDDS IQSGKKVVE +TDV++FWKLKE+HSWADFSLIVDIM+AVNN
Subjt:  PEPFPPMLTGHSSLPPCENMHGVYGCSGKSFSSVPLASADSLTSPENYDAKKTIPDDSSIQSGKKVVEGSTDVVSFWKLKELHSWADFSLIVDIMEAVNN

Query:  NFDEASTLLNTMVSRDNFEISNEMSTLGLHSANDLLCNGKIDLSISLERMVNTPILSSTLKDVQGVHQNNNACEEDYTKLFENNYFERNFFHNVGNTKIA
        NFDEASTLL TMVS DNFEI+NE+STLGLHSANDLLCNG  D+SIS ER +N PILS TLK  QG+HQN+N   ED TKLF N+YFERNFF N GN+KIA
Subjt:  NFDEASTLLNTMVSRDNFEISNEMSTLGLHSANDLLCNGKIDLSISLERMVNTPILSSTLKDVQGVHQNNNACEEDYTKLFENNYFERNFFHNVGNTKIA

Query:  LGCSKFVPIEPEWEEDDVYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNDKAANEILRSRNSKNGLWKLDLHGLHAAEA
        LGCSK VPIEPEWEEDD+YLSHRKDAIAMMRSASQHSRAATNAY RKDHASAKYHSSRAQEQWLAAKMLNDKAANEIL++RNSKNGLWKLDLHGLHAAEA
Subjt:  LGCSKFVPIEPEWEEDDVYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNDKAANEILRSRNSKNGLWKLDLHGLHAAEA

Query:  VQALQEHLLKIETRNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTI
        VQALQ+HLLKIET+NASNRSLSPKKAERKGFQRASSLEYLSCMD+KLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFL+ENGYRFEQ RPGTI
Subjt:  VQALQEHLLKIETRNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTI

Query:  SVRPKFRR
        SVRPKFRR
Subjt:  SVRPKFRR

A0A5D3CAF0 Smr (Small MutS Related) domain-containing protein, putative isoform 17.4e-29485.36Show/hide
Query:  MSWVRGKSPGWAAFNLKEQNNDLRDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFSFAPLPSADSLTSPEKFGAKKTTLENFGAKKTILGASNIQN
        MSWVRGKS GWAAFNLK+QNN ++DEVD DPFPPMSTTLSSLPPRENL  VNGRSGRSFSFAP+PSADS T P K GAKKTTL NF AKKTILGASNIQ+
Subjt:  MSWVRGKSPGWAAFNLKEQNNDLRDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFSFAPLPSADSLTSPEKFGAKKTTLENFGAKKTILGASNIQN

Query:  GKKVVEETAEVLSFWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNLEVSNEMSTLGLHSSNDLSGVRGKSPGWVEFNLEHHNRGLQDETV
        GKK+VEET +VLSFWKLKELH WADISLIMDIMEAVNN+FNEASTLL TMVSSDNLE++NEMS LGLHSSNDLS + GKSPGW EFNL+ HNRGLQ E  
Subjt:  GKKVVEETAEVLSFWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNLEVSNEMSTLGLHSSNDLSGVRGKSPGWVEFNLEHHNRGLQDETV

Query:  PEPFPPMLTGHSSLPPCENMHGVYGCSGKSFSSVPLASADSLTSPENYDAKKTIPDDSSIQSGKKVVEGSTDVVSFWKLKELHSWADFSLIVDIMEAVNN
        PE FPPMLT H SLPP EN+HGVYG  G+SF+S PL SADSLTSP NY AK TIPDDS IQSGKKVVE +TDV++FWKLKE+HSWADFSLIVDIM+AVNN
Subjt:  PEPFPPMLTGHSSLPPCENMHGVYGCSGKSFSSVPLASADSLTSPENYDAKKTIPDDSSIQSGKKVVEGSTDVVSFWKLKELHSWADFSLIVDIMEAVNN

Query:  NFDEASTLLNTMVSRDNFEISNEMSTLGLHSANDLLCNGKIDLSISLERMVNTPILSSTLKDVQGVHQNNNACEEDYTKLFENNYFERNFFHNVGNTKIA
        NFDEASTLL TMVS DNFEI+NE+STLGLH ANDLLCNG  D+SIS ER +N PILS TLK  QG+HQN+N   ED TKLF N+YFERNFF N GN+KIA
Subjt:  NFDEASTLLNTMVSRDNFEISNEMSTLGLHSANDLLCNGKIDLSISLERMVNTPILSSTLKDVQGVHQNNNACEEDYTKLFENNYFERNFFHNVGNTKIA

Query:  LGCSKFVPIEPEWEEDDVYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNDKAANEILRSRNSKNGLWKLDLHGLHAAEA
        LGCSK VPIEPEWEEDDVYLSHRKDAIAMMRSASQHSRAATNAY RKDHASAKYHSSRAQEQWLAAKMLNDKAANEIL++RNSKNGLWKLDLHGLHAAEA
Subjt:  LGCSKFVPIEPEWEEDDVYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNDKAANEILRSRNSKNGLWKLDLHGLHAAEA

Query:  VQALQEHLLKIETRNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTI
        VQALQ+HLLKIET+NASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFL+ENGYRFEQ RPGTI
Subjt:  VQALQEHLLKIETRNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKESPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTI

Query:  SVRPKFRR
        SVRPKFRR
Subjt:  SVRPKFRR

A0A6J1GN51 uncharacterized protein LOC111455928 isoform X17.0e-26880Show/hide
Query:  MSWVRGKSPGWAAFNLKEQNNDLRDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFSFAPLPSADSLTSPEKFGAKKTTLENFGAKKTILGASNIQN
        MSW RGKSPGWAA NLK+QN+ L+DE+DPDPFPPMST LS LPPREN+HRVNGRSGRSFS  PLPSADSL SP          ENFG KKTI G S+I++
Subjt:  MSWVRGKSPGWAAFNLKEQNNDLRDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFSFAPLPSADSLTSPEKFGAKKTTLENFGAKKTILGASNIQN

Query:  GKKVVEETAEVLSFWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNLEVSNEMSTLGLHSSNDLSGVRGKSPGWVEFNLEHHNRGLQDETV
        GKK+VEE+ +VL+FWKLKELHSWADISLI+DIMEAVNNNFNEAS LLKTMVSSDN E++NEMSTLGLHSSND+S VRGKSPGW EFNL+  NRGLQD   
Subjt:  GKKVVEETAEVLSFWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNLEVSNEMSTLGLHSSNDLSGVRGKSPGWVEFNLEHHNRGLQDETV

Query:  PEPFPPMLTGHSSLPPCENMHGVYGCSGKSFSSVPLASADSLTSPENYDAKKTIPDDSSIQSGKKVVEGSTDVVSFWKLKELHSWADFSLIVDIMEAVNN
        P+PFPPM +  SSLPP EN+HGV G  G+S SS PL SADSLT PENY AKK I  DSSIQ+G+KVVE +TDV++FWKLKELH+WADFSLIVDIMEAV+N
Subjt:  PEPFPPMLTGHSSLPPCENMHGVYGCSGKSFSSVPLASADSLTSPENYDAKKTIPDDSSIQSGKKVVEGSTDVVSFWKLKELHSWADFSLIVDIMEAVNN

Query:  NFDEASTLLNTMVSRDNFEISNEMSTLGLHSANDLLCNGKIDLSISLERMVNTPILSSTLKDVQGVHQNNNACEEDYTKLFENNYFERNFFHNVGNTKIA
        NF+EAST LN MVS DN EI NEMSTLGLHSA+ L CNGK D++ISL R VN PI SSTLKDVQ +HQN N       KLFENNY ERNFFHNVGN KIA
Subjt:  NFDEASTLLNTMVSRDNFEISNEMSTLGLHSANDLLCNGKIDLSISLERMVNTPILSSTLKDVQGVHQNNNACEEDYTKLFENNYFERNFFHNVGNTKIA

Query:  LGCSKFVPIEPEWEEDDVYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNDKAANEILRSRNSKNGLWKLDLHGLHAAEA
        L CSK  PIEPEWEEDD+YLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLN KAANEIL++RNS+NGLWKLDLHGLHAAEA
Subjt:  LGCSKFVPIEPEWEEDDVYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNDKAANEILRSRNSKNGLWKLDLHGLHAAEA

Query:  VQALQEHLLKIETRNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKE--SPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPG
        VQALQ+HLLKIETRNASNRSLSPKKAERKGF R SSLEYLSCM  KLDKE  SP  RHRPTSLEVITG+GKHS+GEAALPKAVTSFLSENGYRFEQLRPG
Subjt:  VQALQEHLLKIETRNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKE--SPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPG

Query:  TISVRPKFRR
        TISVRPKFRR
Subjt:  TISVRPKFRR

A0A6J1GPE7 uncharacterized protein LOC111455928 isoform X21.2e-25978.36Show/hide
Query:  MSWVRGKSPGWAAFNLKEQNNDLRDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFSFAPLPSADSLTSPEKFGAKKTTLENFGAKKTILGASNIQN
        MSW RGKSPGWAA NLK+QN+ L+DE+DPDPFPPMST LS LPPREN+HRVNGRSGRS             SP          ENFG KKTI G S+I++
Subjt:  MSWVRGKSPGWAAFNLKEQNNDLRDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFSFAPLPSADSLTSPEKFGAKKTTLENFGAKKTILGASNIQN

Query:  GKKVVEETAEVLSFWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNLEVSNEMSTLGLHSSNDLSGVRGKSPGWVEFNLEHHNRGLQDETV
        GKK+VEE+ +VL+FWKLKELHSWADISLI+DIMEAVNNNFNEAS LLKTMVSSDN E++NEMSTLGLHSSND+S VRGKSPGW EFNL+  NRGLQD   
Subjt:  GKKVVEETAEVLSFWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNLEVSNEMSTLGLHSSNDLSGVRGKSPGWVEFNLEHHNRGLQDETV

Query:  PEPFPPMLTGHSSLPPCENMHGVYGCSGKSFSSVPLASADSLTSPENYDAKKTIPDDSSIQSGKKVVEGSTDVVSFWKLKELHSWADFSLIVDIMEAVNN
        P+PFPPM +  SSLPP EN+HGV G  G+S SS PL SADSLT PENY AKK I  DSSIQ+G+KVVE +TDV++FWKLKELH+WADFSLIVDIMEAV+N
Subjt:  PEPFPPMLTGHSSLPPCENMHGVYGCSGKSFSSVPLASADSLTSPENYDAKKTIPDDSSIQSGKKVVEGSTDVVSFWKLKELHSWADFSLIVDIMEAVNN

Query:  NFDEASTLLNTMVSRDNFEISNEMSTLGLHSANDLLCNGKIDLSISLERMVNTPILSSTLKDVQGVHQNNNACEEDYTKLFENNYFERNFFHNVGNTKIA
        NF+EAST LN MVS DN EI NEMSTLGLHSA+ L CNGK D++ISL R VN PI SSTLKDVQ +HQN N       KLFENNY ERNFFHNVGN KIA
Subjt:  NFDEASTLLNTMVSRDNFEISNEMSTLGLHSANDLLCNGKIDLSISLERMVNTPILSSTLKDVQGVHQNNNACEEDYTKLFENNYFERNFFHNVGNTKIA

Query:  LGCSKFVPIEPEWEEDDVYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNDKAANEILRSRNSKNGLWKLDLHGLHAAEA
        L CSK  PIEPEWEEDD+YLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLN KAANEIL++RNS+NGLWKLDLHGLHAAEA
Subjt:  LGCSKFVPIEPEWEEDDVYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNDKAANEILRSRNSKNGLWKLDLHGLHAAEA

Query:  VQALQEHLLKIETRNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKE--SPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPG
        VQALQ+HLLKIETRNASNRSLSPKKAERKGF R SSLEYLSCM  KLDKE  SP  RHRPTSLEVITG+GKHS+GEAALPKAVTSFLSENGYRFEQLRPG
Subjt:  VQALQEHLLKIETRNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKE--SPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPG

Query:  TISVRPKFRR
        TISVRPKFRR
Subjt:  TISVRPKFRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G23520.1 smr (Small MutS Related) domain-containing protein2.7e-7040Show/hide
Query:  LSGVRGKSPGWVEFNL-EHHNRGLQDETVPEPFPPMLTG-HSSLPPCENMHGVYGCSGKSFSSVPLASADSLTSPENYDAK--------KTIPDDSSIQS
        +S ++GKS GW  F+L +   +GL+ E   +PFPP+ T  ++S      +   +  S KSFSSV L  +      EN D          +  PD  S+  
Subjt:  LSGVRGKSPGWVEFNL-EHHNRGLQDETVPEPFPPMLTG-HSSLPPCENMHGVYGCSGKSFSSVPLASADSLTSPENYDAK--------KTIPDDSSIQS

Query:  GKKVVEGSTDVVSFWKLKELHSWADFSLIVDIMEAVNNNFDEASTLLNTMVSRDNFEISNEMSTLGLHSANDLLCNGKIDLSISLERMVNTPI---LSST
               ++  ++F KLKE++SWAD +LI D++ +  ++F+ A   L  MVS    +        G  S N      +     + E+ V + +     ST
Subjt:  GKKVVEGSTDVVSFWKLKELHSWADFSLIVDIMEAVNNNFDEASTLLNTMVSRDNFEISNEMSTLGLHSANDLLCNGKIDLSISLERMVNTPI---LSST

Query:  LKDVQGVHQNNNACEEDYTKLFENNYFERNFFHNVGNTKIALGCSKFVPIEPEWEEDDVYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRA
         +D  G +   N+   D +    N      F  ++      +   + +PIEPEWEEDD+YLSHRKDA+ +MRSAS HSRAA NA+ R DHASAK HS +A
Subjt:  LKDVQGVHQNNNACEEDYTKLFENNYFERNFFHNVGNTKIALGCSKFVPIEPEWEEDDVYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRA

Query:  QEQWLAAKMLNDKAANEILRSRNSKNGLWKLDLHGLHAAEAVQALQEHLLKIETRNASNRSLSPKKAERKGFQ-RASSLEYLSCMDSK-LDKESPSSRHR
        +E WLAA+ LN +AA +I+   N  N +WKLDLHGLHA EAVQALQE L  IE     NRS+SP +   K    R++S E    +D + +  +  SSR  
Subjt:  QEQWLAAKMLNDKAANEILRSRNSKNGLWKLDLHGLHAAEAVQALQEHLLKIETRNASNRSLSPKKAERKGFQ-RASSLEYLSCMDSK-LDKESPSSRHR

Query:  PTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFR
          SL+VITGIGKHS+G+A+LP AV +F  +N YRF++ RPG I+VRPKFR
Subjt:  PTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTGGGTGAGGGGTAAATCTCCTGGCTGGGCAGCTTTTAACCTTAAGGAACAGAATAATGACCTTCGAGACGAAGTTGACCCGGATCCATTCCCACCAATGTCAAC
CACCCTCTCCTCTCTGCCACCCCGTGAAAACTTACACAGAGTTAATGGTCGTTCAGGGAGATCTTTCTCATTTGCTCCCCTTCCTTCTGCTGATTCTCTGACTTCACCAG
AAAAATTTGGTGCAAAAAAGACAACACTGGAAAATTTTGGTGCAAAAAAAACAATACTCGGTGCTTCTAACATTCAAAATGGCAAGAAGGTGGTTGAAGAAACCGCTGAA
GTTTTATCCTTTTGGAAGCTTAAAGAACTCCATTCCTGGGCTGATATTAGCTTGATTATGGATATAATGGAAGCTGTAAATAATAACTTCAATGAGGCATCTACTTTATT
AAAAACTATGGTTTCTAGCGACAATCTTGAGGTCAGTAATGAGATGAGCACCTTAGGGTTGCATTCCTCTAATGATCTATCGGGGGTGAGGGGTAAATCTCCTGGGTGGG
TCGAATTTAACCTTGAGCATCATAACAGAGGTCTTCAAGATGAAACTGTCCCGGAACCATTCCCACCAATGTTAACTGGCCATTCCTCTCTGCCACCCTGTGAAAACATG
CATGGAGTTTATGGTTGTTCAGGGAAATCCTTCTCATCTGTACCCCTTGCTTCTGCCGATTCTCTAACTTCTCCAGAAAATTATGATGCAAAGAAGACAATACCTGATGA
TTCTAGCATTCAAAGTGGCAAGAAGGTGGTTGAAGGAAGCACTGATGTTGTATCCTTTTGGAAGCTTAAAGAGCTTCATTCTTGGGCTGATTTTAGCTTGATTGTGGATA
TAATGGAAGCTGTAAATAATAACTTCGATGAGGCATCTACTTTACTAAATACCATGGTTTCAAGAGACAATTTTGAGATCAGTAATGAGATGAGCACCTTAGGACTGCAT
TCCGCAAATGATTTATTGTGCAATGGGAAGATTGATTTAAGTATATCATTAGAAAGAATGGTCAATACTCCCATCCTTAGTTCCACACTAAAGGATGTGCAAGGCGTGCA
TCAAAATAATAATGCATGTGAAGAAGATTATACCAAATTGTTTGAAAATAATTATTTTGAAAGAAATTTCTTTCATAATGTTGGAAATACAAAAATAGCTCTAGGTTGCT
CGAAGTTCGTTCCTATTGAGCCGGAGTGGGAAGAAGATGATGTTTACCTGAGCCATCGAAAAGATGCTATAGCAATGATGAGGTCTGCATCTCAACATTCAAGGGCAGCC
ACTAATGCCTATCTTAGGAAAGATCATGCTTCTGCCAAGTATCATTCATCAAGAGCTCAAGAACAATGGCTAGCTGCAAAAATGTTAAATGATAAGGCAGCTAATGAAAT
ATTACGATCAAGGAATAGTAAAAATGGGCTTTGGAAGTTGGACTTACATGGGCTTCACGCAGCAGAAGCTGTTCAAGCCTTGCAAGAACACTTACTGAAAATTGAAACTC
GGAACGCCTCCAATCGGTCGTTGTCGCCAAAGAAAGCTGAAAGGAAAGGATTTCAACGTGCTTCATCCCTCGAGTATCTTAGTTGTATGGACTCAAAGTTGGACAAAGAA
TCACCATCATCTAGGCATAGGCCGACATCATTGGAAGTCATAACAGGTATAGGTAAACATAGCAAGGGGGAAGCTGCTTTACCAAAGGCTGTGACAAGTTTTCTTAGTGA
AAATGGGTACCGTTTCGAACAGTTAAGGCCTGGGACGATCAGCGTCCGACCAAAGTTTCGTAGGCTGGCTGTAGAGGAGGGAAGGATTAACAATGGTGGTTTGATTGGTT
TACAAACATTACTGTATTGTTATGATGTTCTCAGTTCTGCATTTTCCAAGAAAGAGAACATCACATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTGGGTGAGGGGTAAATCTCCTGGCTGGGCAGCTTTTAACCTTAAGGAACAGAATAATGACCTTCGAGACGAAGTTGACCCGGATCCATTCCCACCAATGTCAAC
CACCCTCTCCTCTCTGCCACCCCGTGAAAACTTACACAGAGTTAATGGTCGTTCAGGGAGATCTTTCTCATTTGCTCCCCTTCCTTCTGCTGATTCTCTGACTTCACCAG
AAAAATTTGGTGCAAAAAAGACAACACTGGAAAATTTTGGTGCAAAAAAAACAATACTCGGTGCTTCTAACATTCAAAATGGCAAGAAGGTGGTTGAAGAAACCGCTGAA
GTTTTATCCTTTTGGAAGCTTAAAGAACTCCATTCCTGGGCTGATATTAGCTTGATTATGGATATAATGGAAGCTGTAAATAATAACTTCAATGAGGCATCTACTTTATT
AAAAACTATGGTTTCTAGCGACAATCTTGAGGTCAGTAATGAGATGAGCACCTTAGGGTTGCATTCCTCTAATGATCTATCGGGGGTGAGGGGTAAATCTCCTGGGTGGG
TCGAATTTAACCTTGAGCATCATAACAGAGGTCTTCAAGATGAAACTGTCCCGGAACCATTCCCACCAATGTTAACTGGCCATTCCTCTCTGCCACCCTGTGAAAACATG
CATGGAGTTTATGGTTGTTCAGGGAAATCCTTCTCATCTGTACCCCTTGCTTCTGCCGATTCTCTAACTTCTCCAGAAAATTATGATGCAAAGAAGACAATACCTGATGA
TTCTAGCATTCAAAGTGGCAAGAAGGTGGTTGAAGGAAGCACTGATGTTGTATCCTTTTGGAAGCTTAAAGAGCTTCATTCTTGGGCTGATTTTAGCTTGATTGTGGATA
TAATGGAAGCTGTAAATAATAACTTCGATGAGGCATCTACTTTACTAAATACCATGGTTTCAAGAGACAATTTTGAGATCAGTAATGAGATGAGCACCTTAGGACTGCAT
TCCGCAAATGATTTATTGTGCAATGGGAAGATTGATTTAAGTATATCATTAGAAAGAATGGTCAATACTCCCATCCTTAGTTCCACACTAAAGGATGTGCAAGGCGTGCA
TCAAAATAATAATGCATGTGAAGAAGATTATACCAAATTGTTTGAAAATAATTATTTTGAAAGAAATTTCTTTCATAATGTTGGAAATACAAAAATAGCTCTAGGTTGCT
CGAAGTTCGTTCCTATTGAGCCGGAGTGGGAAGAAGATGATGTTTACCTGAGCCATCGAAAAGATGCTATAGCAATGATGAGGTCTGCATCTCAACATTCAAGGGCAGCC
ACTAATGCCTATCTTAGGAAAGATCATGCTTCTGCCAAGTATCATTCATCAAGAGCTCAAGAACAATGGCTAGCTGCAAAAATGTTAAATGATAAGGCAGCTAATGAAAT
ATTACGATCAAGGAATAGTAAAAATGGGCTTTGGAAGTTGGACTTACATGGGCTTCACGCAGCAGAAGCTGTTCAAGCCTTGCAAGAACACTTACTGAAAATTGAAACTC
GGAACGCCTCCAATCGGTCGTTGTCGCCAAAGAAAGCTGAAAGGAAAGGATTTCAACGTGCTTCATCCCTCGAGTATCTTAGTTGTATGGACTCAAAGTTGGACAAAGAA
TCACCATCATCTAGGCATAGGCCGACATCATTGGAAGTCATAACAGGTATAGGTAAACATAGCAAGGGGGAAGCTGCTTTACCAAAGGCTGTGACAAGTTTTCTTAGTGA
AAATGGGTACCGTTTCGAACAGTTAAGGCCTGGGACGATCAGCGTCCGACCAAAGTTTCGTAGGCTGGCTGTAGAGGAGGGAAGGATTAACAATGGTGGTTTGATTGGTT
TACAAACATTACTGTATTGTTATGATGTTCTCAGTTCTGCATTTTCCAAGAAAGAGAACATCACATAA
Protein sequenceShow/hide protein sequence
MSWVRGKSPGWAAFNLKEQNNDLRDEVDPDPFPPMSTTLSSLPPRENLHRVNGRSGRSFSFAPLPSADSLTSPEKFGAKKTTLENFGAKKTILGASNIQNGKKVVEETAE
VLSFWKLKELHSWADISLIMDIMEAVNNNFNEASTLLKTMVSSDNLEVSNEMSTLGLHSSNDLSGVRGKSPGWVEFNLEHHNRGLQDETVPEPFPPMLTGHSSLPPCENM
HGVYGCSGKSFSSVPLASADSLTSPENYDAKKTIPDDSSIQSGKKVVEGSTDVVSFWKLKELHSWADFSLIVDIMEAVNNNFDEASTLLNTMVSRDNFEISNEMSTLGLH
SANDLLCNGKIDLSISLERMVNTPILSSTLKDVQGVHQNNNACEEDYTKLFENNYFERNFFHNVGNTKIALGCSKFVPIEPEWEEDDVYLSHRKDAIAMMRSASQHSRAA
TNAYLRKDHASAKYHSSRAQEQWLAAKMLNDKAANEILRSRNSKNGLWKLDLHGLHAAEAVQALQEHLLKIETRNASNRSLSPKKAERKGFQRASSLEYLSCMDSKLDKE
SPSSRHRPTSLEVITGIGKHSKGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRRLAVEEGRINNGGLIGLQTLLYCYDVLSSAFSKKENIT