; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh17G013380 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh17G013380
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionSmr domain-containing protein
Genome locationCma_Chr17:8988334..8992506
RNA-Seq ExpressionCmaCh17G013380
SyntenyCmaCh17G013380
Gene Ontology termsNA
InterPro domainsIPR002625 - Smr domain
IPR013899 - Domain of unknown function DUF1771
IPR036063 - Smr domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576051.1 hypothetical protein SDJN03_26690, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0093.76Show/hide
Query:  MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTAHSFLPPCENVHRVNGRSGRS-------------SQENFGAEKTILGNSSIRSSKKLVEESTD
        MSWGRGKSPGWAAVNLKQ NSGLQDEIDPDPFPPMSTA SFLPP ENVHRVNGRSGRS             S ENFGA+KTI GNSSI+SSKKLVEESTD
Subjt:  MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTAHSFLPPCENVHRVNGRSGRS-------------SQENFGAEKTILGNSSIRSSKKLVEESTD

Query:  VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSSGWEEFNLKQQNRGFQDRIDPKPFPPMPSS
        VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKS GWEEFNLKQQNRG QDRIDPKPFPPMPS+
Subjt:  VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSSGWEEFNLKQQNRGFQDRIDPKPFPPMPSS

Query:  LSSLPPRENLHGVNGRPGGSSSSSPLPSADSLTLPENYGAKKILGDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTFLNK
        LSSLPPRENLHGVNG PG SSSSSPLPSADSLT PENY AKKILGDS+IQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEAST+LNK
Subjt:  LSSLPPRENLHGVNGRPGGSSSSSPLPSADSLTLPENYGAKKILGDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTFLNK

Query:  MVSSDNVEICNEMSTLGLHSADGLSCNGKNDVTISSERTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD
        MVSSDNVEICNEMSTLGLHSADGL CNGKNDVTIS  RTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD
Subjt:  MVSSDNVEICNEMSTLGLHSADGLSCNGKNDVTISSERTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD

Query:  IYLSHRKDAIAMMRSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETWNAS
        IYLSHRKDAIAMMRSASQHSR ATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIET NAS
Subjt:  IYLSHRKDAIAMMRSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETWNAS

Query:  NRSLSPKKAERKGFYRVSSLEYLSCLGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
        NRSLSPKKAERKGF+RVSSLEYLSC+GVKLDKELQSPLPRHRPTSLEVITG+GKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
Subjt:  NRSLSPKKAERKGFYRVSSLEYLSCLGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR

XP_022953352.1 uncharacterized protein LOC111455928 isoform X1 [Cucurbita moschata]0.0e+0094.1Show/hide
Query:  MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTAHSFLPPCENVHRVNGRSGRS-------------SQENFGAEKTILGNSSIRSSKKLVEESTD
        MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTA SFLPP ENVHRVNGRSGRS             S ENFG +KTI GNSSIRS KKLVEESTD
Subjt:  MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTAHSFLPPCENVHRVNGRSGRS-------------SQENFGAEKTILGNSSIRSSKKLVEESTD

Query:  VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSSGWEEFNLKQQNRGFQDRIDPKPFPPMPSS
        VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKS GWEEFNLKQQNRG QDRIDPKPFPPMPS+
Subjt:  VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSSGWEEFNLKQQNRGFQDRIDPKPFPPMPSS

Query:  LSSLPPRENLHGVNGRPGGSSSSSPLPSADSLTLPENYGAKKILGDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTFLNK
        LSSLPPRENLHGVNGRPG SSSSSPLPSADSLTLPENY AKKILGDS+IQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEAST+LNK
Subjt:  LSSLPPRENLHGVNGRPGGSSSSSPLPSADSLTLPENYGAKKILGDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTFLNK

Query:  MVSSDNVEICNEMSTLGLHSADGLSCNGKNDVTISSERTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD
        MVSSDNVEICNEMSTLGLHSADGL CNGKNDVTIS  RTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD
Subjt:  MVSSDNVEICNEMSTLGLHSADGLSCNGKNDVTISSERTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD

Query:  IYLSHRKDAIAMMRSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETWNAS
        IYLSHRKDAIAMMRSASQHSR ATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIET NAS
Subjt:  IYLSHRKDAIAMMRSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETWNAS

Query:  NRSLSPKKAERKGFYRVSSLEYLSCLGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
        NRSLSPKKAERKGF+RVSSLEYLSC+GVKLDKELQSPLPRHRPTSLEVITG+GKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
Subjt:  NRSLSPKKAERKGFYRVSSLEYLSCLGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR

XP_022953355.1 uncharacterized protein LOC111455928 isoform X2 [Cucurbita moschata]0.0e+0096.21Show/hide
Query:  MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTAHSFLPPCENVHRVNGRSGRSSQENFGAEKTILGNSSIRSSKKLVEESTDVLAFWKLKELHSW
        MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTA SFLPP ENVHRVNGRSGRSS ENFG +KTI GNSSIRS KKLVEESTDVLAFWKLKELHSW
Subjt:  MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTAHSFLPPCENVHRVNGRSGRSSQENFGAEKTILGNSSIRSSKKLVEESTDVLAFWKLKELHSW

Query:  ADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSSGWEEFNLKQQNRGFQDRIDPKPFPPMPSSLSSLPPRENLHGV
        ADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKS GWEEFNLKQQNRG QDRIDPKPFPPMPS+LSSLPPRENLHGV
Subjt:  ADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSSGWEEFNLKQQNRGFQDRIDPKPFPPMPSSLSSLPPRENLHGV

Query:  NGRPGGSSSSSPLPSADSLTLPENYGAKKILGDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTFLNKMVSSDNVEICNEM
        NGRPG SSSSSPLPSADSLTLPENY AKKILGDS+IQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEAST+LNKMVSSDNVEICNEM
Subjt:  NGRPGGSSSSSPLPSADSLTLPENYGAKKILGDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTFLNKMVSSDNVEICNEM

Query:  STLGLHSADGLSCNGKNDVTISSERTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDDIYLSHRKDAIAMM
        STLGLHSADGL CNGKNDVTIS  RTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDDIYLSHRKDAIAMM
Subjt:  STLGLHSADGLSCNGKNDVTISSERTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDDIYLSHRKDAIAMM

Query:  RSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETWNASNRSLSPKKAERKG
        RSASQHSR ATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIET NASNRSLSPKKAERKG
Subjt:  RSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETWNASNRSLSPKKAERKG

Query:  FYRVSSLEYLSCLGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
        F+RVSSLEYLSC+GVKLDKELQSPLPRHRPTSLEVITG+GKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
Subjt:  FYRVSSLEYLSCLGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR

XP_022991447.1 uncharacterized protein LOC111488062 isoform X1 [Cucurbita maxima]0.0e+00100Show/hide
Query:  MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTAHSFLPPCENVHRVNGRSGRSSQENFGAEKTILGNSSIRSSKKLVEESTDVLAFWKLKELHSW
        MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTAHSFLPPCENVHRVNGRSGRSSQENFGAEKTILGNSSIRSSKKLVEESTDVLAFWKLKELHSW
Subjt:  MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTAHSFLPPCENVHRVNGRSGRSSQENFGAEKTILGNSSIRSSKKLVEESTDVLAFWKLKELHSW

Query:  ADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSSGWEEFNLKQQNRGFQDRIDPKPFPPMPSSLSSLPPRENLHGV
        ADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSSGWEEFNLKQQNRGFQDRIDPKPFPPMPSSLSSLPPRENLHGV
Subjt:  ADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSSGWEEFNLKQQNRGFQDRIDPKPFPPMPSSLSSLPPRENLHGV

Query:  NGRPGGSSSSSPLPSADSLTLPENYGAKKILGDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTFLNKMVSSDNVEICNEM
        NGRPGGSSSSSPLPSADSLTLPENYGAKKILGDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTFLNKMVSSDNVEICNEM
Subjt:  NGRPGGSSSSSPLPSADSLTLPENYGAKKILGDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTFLNKMVSSDNVEICNEM

Query:  STLGLHSADGLSCNGKNDVTISSERTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDDIYLSHRKDAIAMM
        STLGLHSADGLSCNGKNDVTISSERTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDDIYLSHRKDAIAMM
Subjt:  STLGLHSADGLSCNGKNDVTISSERTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDDIYLSHRKDAIAMM

Query:  RSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETWNASNRSLSPKKAERKG
        RSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETWNASNRSLSPKKAERKG
Subjt:  RSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETWNASNRSLSPKKAERKG

Query:  FYRVSSLEYLSCLGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
        FYRVSSLEYLSCLGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
Subjt:  FYRVSSLEYLSCLGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR

XP_023548349.1 uncharacterized protein LOC111807017 [Cucurbita pepo subsp. pepo]0.0e+0093.76Show/hide
Query:  MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTAHSFLPPCENVHRVNGRSGRS-------------SQENFGAEKTILGNSSIRSSKKLVEESTD
        MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTA SFLPP ENVHRVNGRSGRS             S +NFGA+KTI GNSSIRS KKLVEESTD
Subjt:  MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTAHSFLPPCENVHRVNGRSGRS-------------SQENFGAEKTILGNSSIRSSKKLVEESTD

Query:  VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSSGWEEFNLKQQNRGFQDRIDPKPFPPMPSS
        VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKS GWEE+NLKQQNRG QDRIDPKPFPPMPS+
Subjt:  VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSSGWEEFNLKQQNRGFQDRIDPKPFPPMPSS

Query:  LSSLPPRENLHGVNGRPGGSSSSSPLPSADSLTLPENYGAKKILGDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTFLNK
        LSSLPPRENLHGV GRPG SSSSSPLPSADSLT PENY AKKILGDS+IQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEAST+LNK
Subjt:  LSSLPPRENLHGVNGRPGGSSSSSPLPSADSLTLPENYGAKKILGDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTFLNK

Query:  MVSSDNVEICNEMSTLGLHSADGLSCNGKNDVTISSERTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD
        MVSSDNVEICNEMSTLGLHSADGLSC GKNDVTIS  RTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD
Subjt:  MVSSDNVEICNEMSTLGLHSADGLSCNGKNDVTISSERTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD

Query:  IYLSHRKDAIAMMRSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETWNAS
        IYLSHRKDAIAMMRSASQHSR ATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIET NAS
Subjt:  IYLSHRKDAIAMMRSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETWNAS

Query:  NRSLSPKKAERKGFYRVSSLEYLSCLGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
        NRSLSPKKAERKGF+RVSSLEYLSC+GVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
Subjt:  NRSLSPKKAERKGFYRVSSLEYLSCLGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR

TrEMBL top hitse value%identityAlignment
A0A0A0KA90 Smr domain-containing protein9.4e-24874.92Show/hide
Query:  MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTAHSFLPPCENVHRVNGRSGRS-----------------------SQENFGAEKTILGNSSIRS
        MSW RGKS GWAA NLKQQN+GLQDE+D DPFPPMST  S LPP EN+  VNG SG+S                       +  NFGA+KTILG ++I+S
Subjt:  MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTAHSFLPPCENVHRVNGRSGRS-----------------------SQENFGAEKTILGNSSIRS

Query:  SKKLVEESTDVLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSSGWEEFNLKQQNRGFQDRID
         KKLVEE+ DVL+FWKLKELH WADISLI+DIMEAVNN+FNEAS LL TMVSSDN EINN+MSTLGLHSSND+  + GKS GWEEFNLKQ N+G QD +D
Subjt:  SKKLVEESTDVLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSSGWEEFNLKQQNRGFQDRID

Query:  PKPFPPMPSSLSSLPPRENLHGVNGRPGGSSSSSPLPSADSLTLPENYGAKK-ILGDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDN
         + FPPM ++ SSLPP ENLHGV GR G S +S PLPS DSLT PENYGAK  I  DS+IQ+G+KVVEE TDVLAFWKLKE+H+WADFSLIVDIM+AV+N
Subjt:  PKPFPPMPSSLSSLPPRENLHGVNGRPGGSSSSSPLPSADSLTLPENYGAKK-ILGDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDN

Query:  NFNEASTFLNKMVSSDNVEICNEMSTLGLHSADGLSCNGKNDVTISSERTVNNPIPSSTLKDVQDMHQNIN------DKLFENNYHERNFFHNVGNPKIA
        NF+EAST L  MVSSDN EI NE+STLGLHSA+ L CNG NDV+I+SER +N PI SST+K VQ +HQN N       KLF N+Y ERN FHN GN KIA
Subjt:  NFNEASTFLNKMVSSDNVEICNEMSTLGLHSADGLSCNGKNDVTISSERTVNNPIPSSTLKDVQDMHQNIN------DKLFENNYHERNFFHNVGNPKIA

Query:  LYCSKSAPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEA
        L CSKS PIEPEWEEDDIYLSHRKDAIAMMRSASQHSR ATNAY RKDHASAKYHSSRA+EQWLAAKMLN KAANEILQTRNS+NGLWKLDLHGLHAAEA
Subjt:  LYCSKSAPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEA

Query:  VQALQDHLLKIETWNASNRSLSPKKAERKGFYRVSSLEYLSCLGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRPG
        VQAL DHLLKIET NASNRSLSPKKAERKGF R SSLEYLSC+  KLDKE  SP  RHRPTSLEVITGIGKHS+GEAALPKAV SFL+ENGYRFEQ RPG
Subjt:  VQALQDHLLKIETWNASNRSLSPKKAERKGFYRVSSLEYLSCLGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRPG

Query:  TISVRPKFRR
        TISVRPKFRR
Subjt:  TISVRPKFRR

A0A1S3BRS7 uncharacterized protein LOC1034925901.5e-24875.74Show/hide
Query:  MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTAHSFLPPCENVHRVNGRSGRS-----------------------SQENFGAEKTILGNSSIRS
        MSW RGKS GWAA NLKQQN+G+QDE+D DPFPPMST  S LPP EN+  VNGRSGRS                       +  NF A+KTILG S+I+S
Subjt:  MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTAHSFLPPCENVHRVNGRSGRS-----------------------SQENFGAEKTILGNSSIRS

Query:  SKKLVEESTDVLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSSGWEEFNLKQQNRGFQDRID
         KK+VEE+ DVL+FWKLKELH WADISLI+DIMEAVNN+FNEAS LL TMVSSDN EINNEMS LGLHSSND+S + GKS GWEEFNL+Q NRG Q   D
Subjt:  SKKLVEESTDVLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSSGWEEFNLKQQNRGFQDRID

Query:  PKPFPPMPSSLSSLPPRENLHGVNGRPGGSSSSSPLPSADSLTLPENYGAKK-ILGDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDN
        P+ FPPM ++  SLPP ENLHGV GR G S +S PLPSADSLT P NYGAK  I  DS IQ+G+KVVEE TDVLAFWKLKE+H+WADFSLIVDIM+AV+N
Subjt:  PKPFPPMPSSLSSLPPRENLHGVNGRPGGSSSSSPLPSADSLTLPENYGAKK-ILGDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDN

Query:  NFNEASTFLNKMVSSDNVEICNEMSTLGLHSADGLSCNGKNDVTISSERTVNNPIPSSTLKDVQDMHQNIN------DKLFENNYHERNFFHNVGNPKIA
        NF+EAST L  MVSSDN EI NE+STLGLHSA+ L CNG NDV+ISSERT+N PI S TLK  Q MHQN N       KLF N+Y ERNFF N GN KIA
Subjt:  NFNEASTFLNKMVSSDNVEICNEMSTLGLHSADGLSCNGKNDVTISSERTVNNPIPSSTLKDVQDMHQNIN------DKLFENNYHERNFFHNVGNPKIA

Query:  LYCSKSAPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEA
        L CSKS PIEPEWEEDDIYLSHRKDAIAMMRSASQHSR ATNAY RKDHASAKYHSSRAQEQWLAAKMLN KAANEILQTRNS+NGLWKLDLHGLHAAEA
Subjt:  LYCSKSAPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEA

Query:  VQALQDHLLKIETWNASNRSLSPKKAERKGFYRVSSLEYLSCLGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRPG
        VQALQDHLLKIET NASNRSLSPKKAERKGF R SSLEYLSC+  KLDKE  SP  RHRPTSLEVITGIGKHS+GEAALPKAVTSFL+ENGYRFEQ RPG
Subjt:  VQALQDHLLKIETWNASNRSLSPKKAERKGFYRVSSLEYLSCLGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRPG

Query:  TISVRPKFRR
        TISVRPKFRR
Subjt:  TISVRPKFRR

A0A6J1GN51 uncharacterized protein LOC111455928 isoform X10.0e+0094.1Show/hide
Query:  MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTAHSFLPPCENVHRVNGRSGRS-------------SQENFGAEKTILGNSSIRSSKKLVEESTD
        MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTA SFLPP ENVHRVNGRSGRS             S ENFG +KTI GNSSIRS KKLVEESTD
Subjt:  MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTAHSFLPPCENVHRVNGRSGRS-------------SQENFGAEKTILGNSSIRSSKKLVEESTD

Query:  VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSSGWEEFNLKQQNRGFQDRIDPKPFPPMPSS
        VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKS GWEEFNLKQQNRG QDRIDPKPFPPMPS+
Subjt:  VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSSGWEEFNLKQQNRGFQDRIDPKPFPPMPSS

Query:  LSSLPPRENLHGVNGRPGGSSSSSPLPSADSLTLPENYGAKKILGDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTFLNK
        LSSLPPRENLHGVNGRPG SSSSSPLPSADSLTLPENY AKKILGDS+IQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEAST+LNK
Subjt:  LSSLPPRENLHGVNGRPGGSSSSSPLPSADSLTLPENYGAKKILGDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTFLNK

Query:  MVSSDNVEICNEMSTLGLHSADGLSCNGKNDVTISSERTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD
        MVSSDNVEICNEMSTLGLHSADGL CNGKNDVTIS  RTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD
Subjt:  MVSSDNVEICNEMSTLGLHSADGLSCNGKNDVTISSERTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD

Query:  IYLSHRKDAIAMMRSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETWNAS
        IYLSHRKDAIAMMRSASQHSR ATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIET NAS
Subjt:  IYLSHRKDAIAMMRSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETWNAS

Query:  NRSLSPKKAERKGFYRVSSLEYLSCLGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
        NRSLSPKKAERKGF+RVSSLEYLSC+GVKLDKELQSPLPRHRPTSLEVITG+GKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
Subjt:  NRSLSPKKAERKGFYRVSSLEYLSCLGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR

A0A6J1GPE7 uncharacterized protein LOC111455928 isoform X20.0e+0096.21Show/hide
Query:  MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTAHSFLPPCENVHRVNGRSGRSSQENFGAEKTILGNSSIRSSKKLVEESTDVLAFWKLKELHSW
        MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTA SFLPP ENVHRVNGRSGRSS ENFG +KTI GNSSIRS KKLVEESTDVLAFWKLKELHSW
Subjt:  MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTAHSFLPPCENVHRVNGRSGRSSQENFGAEKTILGNSSIRSSKKLVEESTDVLAFWKLKELHSW

Query:  ADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSSGWEEFNLKQQNRGFQDRIDPKPFPPMPSSLSSLPPRENLHGV
        ADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKS GWEEFNLKQQNRG QDRIDPKPFPPMPS+LSSLPPRENLHGV
Subjt:  ADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSSGWEEFNLKQQNRGFQDRIDPKPFPPMPSSLSSLPPRENLHGV

Query:  NGRPGGSSSSSPLPSADSLTLPENYGAKKILGDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTFLNKMVSSDNVEICNEM
        NGRPG SSSSSPLPSADSLTLPENY AKKILGDS+IQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEAST+LNKMVSSDNVEICNEM
Subjt:  NGRPGGSSSSSPLPSADSLTLPENYGAKKILGDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTFLNKMVSSDNVEICNEM

Query:  STLGLHSADGLSCNGKNDVTISSERTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDDIYLSHRKDAIAMM
        STLGLHSADGL CNGKNDVTIS  RTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDDIYLSHRKDAIAMM
Subjt:  STLGLHSADGLSCNGKNDVTISSERTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDDIYLSHRKDAIAMM

Query:  RSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETWNASNRSLSPKKAERKG
        RSASQHSR ATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIET NASNRSLSPKKAERKG
Subjt:  RSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETWNASNRSLSPKKAERKG

Query:  FYRVSSLEYLSCLGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
        F+RVSSLEYLSC+GVKLDKELQSPLPRHRPTSLEVITG+GKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
Subjt:  FYRVSSLEYLSCLGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR

A0A6J1JLU6 uncharacterized protein LOC111488062 isoform X10.0e+00100Show/hide
Query:  MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTAHSFLPPCENVHRVNGRSGRSSQENFGAEKTILGNSSIRSSKKLVEESTDVLAFWKLKELHSW
        MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTAHSFLPPCENVHRVNGRSGRSSQENFGAEKTILGNSSIRSSKKLVEESTDVLAFWKLKELHSW
Subjt:  MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTAHSFLPPCENVHRVNGRSGRSSQENFGAEKTILGNSSIRSSKKLVEESTDVLAFWKLKELHSW

Query:  ADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSSGWEEFNLKQQNRGFQDRIDPKPFPPMPSSLSSLPPRENLHGV
        ADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSSGWEEFNLKQQNRGFQDRIDPKPFPPMPSSLSSLPPRENLHGV
Subjt:  ADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSSGWEEFNLKQQNRGFQDRIDPKPFPPMPSSLSSLPPRENLHGV

Query:  NGRPGGSSSSSPLPSADSLTLPENYGAKKILGDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTFLNKMVSSDNVEICNEM
        NGRPGGSSSSSPLPSADSLTLPENYGAKKILGDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTFLNKMVSSDNVEICNEM
Subjt:  NGRPGGSSSSSPLPSADSLTLPENYGAKKILGDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTFLNKMVSSDNVEICNEM

Query:  STLGLHSADGLSCNGKNDVTISSERTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDDIYLSHRKDAIAMM
        STLGLHSADGLSCNGKNDVTISSERTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDDIYLSHRKDAIAMM
Subjt:  STLGLHSADGLSCNGKNDVTISSERTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDDIYLSHRKDAIAMM

Query:  RSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETWNASNRSLSPKKAERKG
        RSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETWNASNRSLSPKKAERKG
Subjt:  RSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETWNASNRSLSPKKAERKG

Query:  FYRVSSLEYLSCLGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
        FYRVSSLEYLSCLGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
Subjt:  FYRVSSLEYLSCLGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR

SwissProt top hitse value%identityAlignment
Q86UW6 NEDD4-binding protein 21.7e-0431.3Show/hide
Query:  LHAAEAVQALQDHLLKIETWNASNRSLSPKKA-ERKGFYRVSSLEYLSCLGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAA-LPKAVTSFLSENGY
        LH  +  +A  +HL  IE +   N SL P+   +  G +   +LE+L  +   L+K+ +          L VITG G HS+G  A +  AV  +L  + +
Subjt:  LHAAEAVQALQDHLLKIETWNASNRSLSPKKA-ERKGFYRVSSLEYLSCLGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAA-LPKAVTSFLSENGY

Query:  RFEQLRPGTISVRPK
        RF +++PG + V  K
Subjt:  RFEQLRPGTISVRPK

Arabidopsis top hitse value%identityAlignment
AT5G23520.1 smr (Small MutS Related) domain-containing protein1.2e-7442.12Show/hide
Query:  VSLVRGKSSGWEEFNLKQ-QNRGFQDRIDPKPFPPMPSSL-SSLPPRENLHGVNGRPGGSSSSSPL--PSA-DSLTLPENYGAKKILGDSTIQNGRKVVE
        +S ++GKSSGW  F+LKQ Q +G +  ++  PFPP+ +S+ +S   R  L   N  P   S SS L  PS   +LT  ++ G ++  G    +     + 
Subjt:  VSLVRGKSSGWEEFNLKQ-QNRGFQDRIDPKPFPPMPSSL-SSLPPRENLHGVNGRPGGSSSSSPL--PSA-DSLTLPENYGAKKILGDSTIQNGRKVVE

Query:  ETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTFLNKMVSSDNVEICNEMSTLGLHSADGLSCNGKNDVTISSERTVNNPI---PSSTLKDV--
          +  LAF KLKE+++WAD +LI D++ + +++F  A  FL  MVSS   +   E  T      +G S + +     + E+TV + +     ST +D   
Subjt:  ETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTFLNKMVSSDNVEICNEMSTLGLHSADGLSCNGKNDVTISSERTVNNPI---PSSTLKDV--

Query:  QDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKML
         D+  +       N      F  ++      +   +S PIEPEWEEDD+YLSHRKDA+ +MRSAS HSR A NA+ R DHASAK HS +A+E WLAA+ L
Subjt:  QDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKML

Query:  NAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETWNASNRSLSPKKAERK-GFYRVSSLEYLSCLGVKLDKE---LQSPLPRHRPTSLEV
        NA+AA +I+   N +N +WKLDLHGLHA EAVQALQ+ L  IE     NRS+SP +   K    R +S E       +LD+E    Q    R    SL+V
Subjt:  NAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETWNASNRSLSPKKAERK-GFYRVSSLEYLSCLGVKLDKE---LQSPLPRHRPTSLEV

Query:  ITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFR
        ITGIGKHSRG+A+LP AV +F  +N YRF++ RPG I+VRPKFR
Subjt:  ITGIGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTGGGGGAGGGGTAAATCTCCTGGCTGGGCAGCGGTTAACCTTAAGCAACAGAATAGTGGCCTTCAAGATGAAATTGACCCGGACCCATTCCCACCAATGTCTAC
CGCTCATTCCTTTCTGCCACCCTGTGAAAACGTACACAGAGTTAATGGTCGTTCAGGGAGGTCTTCACAAGAAAATTTTGGTGCAGAAAAGACAATACTTGGTAATTCTA
GCATTCGAAGTAGCAAGAAGTTGGTTGAAGAATCCACTGATGTTTTAGCCTTCTGGAAGCTTAAAGAGCTTCATTCTTGGGCTGATATCAGCTTGATTGTGGATATAATG
GAAGCTGTAAATAATAACTTCAACGAGGCGTCTAAGTTATTAAAAACAATGGTTTCTAGTGACAATTTTGAGATCAATAATGAGATGAGCACCTTAGGACTGCATTCCTC
TAATGATGTATCATTGGTGAGGGGTAAATCTTCTGGCTGGGAAGAATTTAACCTTAAGCAACAAAATAGAGGCTTTCAAGATAGAATTGATCCGAAACCATTCCCACCGA
TGCCAAGTTCCCTTTCCTCTTTGCCACCCCGTGAAAACTTGCACGGTGTTAATGGCCGTCCAGGGGGATCCTCCTCATCTTCACCTCTTCCTTCTGCTGATTCTCTAACT
TTGCCAGAAAATTACGGTGCAAAGAAAATACTTGGTGATTCTACCATTCAAAATGGAAGGAAGGTGGTTGAAGAAACCACTGACGTTTTAGCCTTTTGGAAGCTTAAGGA
GCTTCATACTTGGGCTGATTTTAGCTTGATTGTGGATATAATGGAAGCTGTAGATAATAACTTCAATGAGGCGTCTACTTTTTTAAACAAAATGGTTTCTAGTGACAATG
TTGAGATCTGTAACGAGATGAGCACCCTAGGACTGCATTCTGCTGATGGATTATCGTGCAATGGGAAGAATGATGTAACCATATCATCAGAAAGAACTGTTAATAATCCC
ATCCCTAGTTCCACACTAAAGGATGTGCAAGACATGCATCAAAATATTAATGATAAATTGTTTGAAAATAATTATCATGAAAGAAATTTCTTTCACAATGTTGGAAATCC
AAAAATAGCTCTTTATTGCTCAAAGTCTGCTCCTATTGAGCCCGAGTGGGAAGAAGATGATATTTACCTGAGCCATCGGAAAGATGCTATAGCAATGATGAGGTCTGCAT
CTCAACATTCAAGGGTAGCCACTAATGCCTATCTTCGGAAGGATCATGCTTCTGCCAAGTATCATTCATCAAGGGCTCAAGAACAATGGCTAGCTGCAAAAATGTTAAAT
GCTAAGGCAGCCAATGAAATTTTACAAACAAGGAATAGTGAAAATGGGCTCTGGAAGTTGGACTTACATGGGCTTCATGCAGCAGAGGCTGTTCAAGCCTTGCAAGATCA
CTTGCTGAAAATCGAAACTTGGAATGCCTCCAATCGGTCGTTGTCGCCAAAGAAAGCTGAAAGGAAGGGTTTCTATCGTGTTTCATCCCTTGAGTATCTTAGTTGTCTGG
GCGTAAAGTTGGACAAAGAATTACAATCACCATTACCTAGGCATAGGCCGACATCATTGGAAGTCATAACAGGAATAGGTAAACATAGCAGGGGGGAGGCTGCTCTACCA
AAGGCCGTGACAAGTTTTCTTAGTGAAAATGGGTACCGTTTTGAACAGTTGAGGCCTGGAACGATCAGCGTTCGACCGAAGTTTCGTAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTGGGGGAGGGGTAAATCTCCTGGCTGGGCAGCGGTTAACCTTAAGCAACAGAATAGTGGCCTTCAAGATGAAATTGACCCGGACCCATTCCCACCAATGTCTAC
CGCTCATTCCTTTCTGCCACCCTGTGAAAACGTACACAGAGTTAATGGTCGTTCAGGGAGGTCTTCACAAGAAAATTTTGGTGCAGAAAAGACAATACTTGGTAATTCTA
GCATTCGAAGTAGCAAGAAGTTGGTTGAAGAATCCACTGATGTTTTAGCCTTCTGGAAGCTTAAAGAGCTTCATTCTTGGGCTGATATCAGCTTGATTGTGGATATAATG
GAAGCTGTAAATAATAACTTCAACGAGGCGTCTAAGTTATTAAAAACAATGGTTTCTAGTGACAATTTTGAGATCAATAATGAGATGAGCACCTTAGGACTGCATTCCTC
TAATGATGTATCATTGGTGAGGGGTAAATCTTCTGGCTGGGAAGAATTTAACCTTAAGCAACAAAATAGAGGCTTTCAAGATAGAATTGATCCGAAACCATTCCCACCGA
TGCCAAGTTCCCTTTCCTCTTTGCCACCCCGTGAAAACTTGCACGGTGTTAATGGCCGTCCAGGGGGATCCTCCTCATCTTCACCTCTTCCTTCTGCTGATTCTCTAACT
TTGCCAGAAAATTACGGTGCAAAGAAAATACTTGGTGATTCTACCATTCAAAATGGAAGGAAGGTGGTTGAAGAAACCACTGACGTTTTAGCCTTTTGGAAGCTTAAGGA
GCTTCATACTTGGGCTGATTTTAGCTTGATTGTGGATATAATGGAAGCTGTAGATAATAACTTCAATGAGGCGTCTACTTTTTTAAACAAAATGGTTTCTAGTGACAATG
TTGAGATCTGTAACGAGATGAGCACCCTAGGACTGCATTCTGCTGATGGATTATCGTGCAATGGGAAGAATGATGTAACCATATCATCAGAAAGAACTGTTAATAATCCC
ATCCCTAGTTCCACACTAAAGGATGTGCAAGACATGCATCAAAATATTAATGATAAATTGTTTGAAAATAATTATCATGAAAGAAATTTCTTTCACAATGTTGGAAATCC
AAAAATAGCTCTTTATTGCTCAAAGTCTGCTCCTATTGAGCCCGAGTGGGAAGAAGATGATATTTACCTGAGCCATCGGAAAGATGCTATAGCAATGATGAGGTCTGCAT
CTCAACATTCAAGGGTAGCCACTAATGCCTATCTTCGGAAGGATCATGCTTCTGCCAAGTATCATTCATCAAGGGCTCAAGAACAATGGCTAGCTGCAAAAATGTTAAAT
GCTAAGGCAGCCAATGAAATTTTACAAACAAGGAATAGTGAAAATGGGCTCTGGAAGTTGGACTTACATGGGCTTCATGCAGCAGAGGCTGTTCAAGCCTTGCAAGATCA
CTTGCTGAAAATCGAAACTTGGAATGCCTCCAATCGGTCGTTGTCGCCAAAGAAAGCTGAAAGGAAGGGTTTCTATCGTGTTTCATCCCTTGAGTATCTTAGTTGTCTGG
GCGTAAAGTTGGACAAAGAATTACAATCACCATTACCTAGGCATAGGCCGACATCATTGGAAGTCATAACAGGAATAGGTAAACATAGCAGGGGGGAGGCTGCTCTACCA
AAGGCCGTGACAAGTTTTCTTAGTGAAAATGGGTACCGTTTTGAACAGTTGAGGCCTGGAACGATCAGCGTTCGACCGAAGTTTCGTAGGTAAACGACTCTCCCTTATTA
GTAGTAGCTTCAAAGTTATTCAGTCAGAACGTAGATTAGAGTTGTTGGAGGTTGGGTAAGTGAAACGTGTAAAATGACTGAAAATGTTCGTTATTTAGAAGGAGCTCTGT
AGGATTGGTTGTTAACCACATTACTGCATTGTTATTTTGTTGTCTTTCCTGGAAAAGATAGCACCTAGGAAGAAGAGGTTAGAAGAACTCTGGAATCCTGGGATTGTATT
CTATTATGATTCATTATGGCGATTGCACTAGTTGAGTCTACTTCTCTAATCCATTGAAAATATAATAGCTG
Protein sequenceShow/hide protein sequence
MSWGRGKSPGWAAVNLKQQNSGLQDEIDPDPFPPMSTAHSFLPPCENVHRVNGRSGRSSQENFGAEKTILGNSSIRSSKKLVEESTDVLAFWKLKELHSWADISLIVDIM
EAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSSGWEEFNLKQQNRGFQDRIDPKPFPPMPSSLSSLPPRENLHGVNGRPGGSSSSSPLPSADSLT
LPENYGAKKILGDSTIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTFLNKMVSSDNVEICNEMSTLGLHSADGLSCNGKNDVTISSERTVNNP
IPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRVATNAYLRKDHASAKYHSSRAQEQWLAAKMLN
AKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETWNASNRSLSPKKAERKGFYRVSSLEYLSCLGVKLDKELQSPLPRHRPTSLEVITGIGKHSRGEAALP
KAVTSFLSENGYRFEQLRPGTISVRPKFRR