; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg11515 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg11515
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionSmr domain-containing protein
Genome locationCarg_Chr17:9443873..9447918
RNA-Seq ExpressionCarg11515
SyntenyCarg11515
Gene Ontology termsNA
InterPro domainsIPR002625 - Smr domain
IPR013899 - Domain of unknown function DUF1771
IPR036063 - Smr domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576051.1 hypothetical protein SDJN03_26690, partial [Cucurbita argyrosperma subsp. sororia]0.0e+00100Show/hide
Query:  MSWGRGKSPGWAAVNLKQHNSGLQDEIDPDPFPPMSTALSFLPPRENVHRVNGRSGRSFSSTPLPSADSLMSPENFGAKKTIPGNSSIQSSKKLVEESTD
        MSWGRGKSPGWAAVNLKQHNSGLQDEIDPDPFPPMSTALSFLPPRENVHRVNGRSGRSFSSTPLPSADSLMSPENFGAKKTIPGNSSIQSSKKLVEESTD
Subjt:  MSWGRGKSPGWAAVNLKQHNSGLQDEIDPDPFPPMSTALSFLPPRENVHRVNGRSGRSFSSTPLPSADSLMSPENFGAKKTIPGNSSIQSSKKLVEESTD

Query:  VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSPGWEEFNLKQQNRGLQDRIDPKPFPPMPSA
        VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSPGWEEFNLKQQNRGLQDRIDPKPFPPMPSA
Subjt:  VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSPGWEEFNLKQQNRGLQDRIDPKPFPPMPSA

Query:  LSSLPPRENLHGVNGCPGRSSSSSPLPSADSLTSPENYIAKKILGDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTYLNK
        LSSLPPRENLHGVNGCPGRSSSSSPLPSADSLTSPENYIAKKILGDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTYLNK
Subjt:  LSSLPPRENLHGVNGCPGRSSSSSPLPSADSLTSPENYIAKKILGDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTYLNK

Query:  MVSSDNVEICNEMSTLGLHSADGLPCNGKNDVTISLGRTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD
        MVSSDNVEICNEMSTLGLHSADGLPCNGKNDVTISLGRTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD
Subjt:  MVSSDNVEICNEMSTLGLHSADGLPCNGKNDVTISLGRTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD

Query:  IYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNAS
        IYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNAS
Subjt:  IYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNAS

Query:  NRSLSPKKAERKGFHRVSSLEYLSCMGVKLDKELQSPLPRHRPTSLEVITGVGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
        NRSLSPKKAERKGFHRVSSLEYLSCMGVKLDKELQSPLPRHRPTSLEVITGVGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
Subjt:  NRSLSPKKAERKGFHRVSSLEYLSCMGVKLDKELQSPLPRHRPTSLEVITGVGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR

XP_022953352.1 uncharacterized protein LOC111455928 isoform X1 [Cucurbita moschata]0.0e+0098.82Show/hide
Query:  MSWGRGKSPGWAAVNLKQHNSGLQDEIDPDPFPPMSTALSFLPPRENVHRVNGRSGRSFSSTPLPSADSLMSPENFGAKKTIPGNSSIQSSKKLVEESTD
        MSWGRGKSPGWAAVNLKQ NSGLQDEIDPDPFPPMSTALSFLPPRENVHRVNGRSGRSFSSTPLPSADSLMSPENFG KKTIPGNSSI+S KKLVEESTD
Subjt:  MSWGRGKSPGWAAVNLKQHNSGLQDEIDPDPFPPMSTALSFLPPRENVHRVNGRSGRSFSSTPLPSADSLMSPENFGAKKTIPGNSSIQSSKKLVEESTD

Query:  VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSPGWEEFNLKQQNRGLQDRIDPKPFPPMPSA
        VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSPGWEEFNLKQQNRGLQDRIDPKPFPPMPSA
Subjt:  VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSPGWEEFNLKQQNRGLQDRIDPKPFPPMPSA

Query:  LSSLPPRENLHGVNGCPGRSSSSSPLPSADSLTSPENYIAKKILGDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTYLNK
        LSSLPPRENLHGVNG PGRSSSSSPLPSADSLT PENY AKKILGDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTYLNK
Subjt:  LSSLPPRENLHGVNGCPGRSSSSSPLPSADSLTSPENYIAKKILGDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTYLNK

Query:  MVSSDNVEICNEMSTLGLHSADGLPCNGKNDVTISLGRTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD
        MVSSDNVEICNEMSTLGLHSADGLPCNGKNDVTISLGRTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD
Subjt:  MVSSDNVEICNEMSTLGLHSADGLPCNGKNDVTISLGRTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD

Query:  IYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNAS
        IYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNAS
Subjt:  IYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNAS

Query:  NRSLSPKKAERKGFHRVSSLEYLSCMGVKLDKELQSPLPRHRPTSLEVITGVGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
        NRSLSPKKAERKGFHRVSSLEYLSCMGVKLDKELQSPLPRHRPTSLEVITGVGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
Subjt:  NRSLSPKKAERKGFHRVSSLEYLSCMGVKLDKELQSPLPRHRPTSLEVITGVGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR

XP_022953355.1 uncharacterized protein LOC111455928 isoform X2 [Cucurbita moschata]0.0e+0096.63Show/hide
Query:  MSWGRGKSPGWAAVNLKQHNSGLQDEIDPDPFPPMSTALSFLPPRENVHRVNGRSGRSFSSTPLPSADSLMSPENFGAKKTIPGNSSIQSSKKLVEESTD
        MSWGRGKSPGWAAVNLKQ NSGLQDEIDPDPFPPMSTALSFLPPRENVHRVNGRSGRS             SPENFG KKTIPGNSSI+S KKLVEESTD
Subjt:  MSWGRGKSPGWAAVNLKQHNSGLQDEIDPDPFPPMSTALSFLPPRENVHRVNGRSGRSFSSTPLPSADSLMSPENFGAKKTIPGNSSIQSSKKLVEESTD

Query:  VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSPGWEEFNLKQQNRGLQDRIDPKPFPPMPSA
        VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSPGWEEFNLKQQNRGLQDRIDPKPFPPMPSA
Subjt:  VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSPGWEEFNLKQQNRGLQDRIDPKPFPPMPSA

Query:  LSSLPPRENLHGVNGCPGRSSSSSPLPSADSLTSPENYIAKKILGDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTYLNK
        LSSLPPRENLHGVNG PGRSSSSSPLPSADSLT PENY AKKILGDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTYLNK
Subjt:  LSSLPPRENLHGVNGCPGRSSSSSPLPSADSLTSPENYIAKKILGDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTYLNK

Query:  MVSSDNVEICNEMSTLGLHSADGLPCNGKNDVTISLGRTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD
        MVSSDNVEICNEMSTLGLHSADGLPCNGKNDVTISLGRTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD
Subjt:  MVSSDNVEICNEMSTLGLHSADGLPCNGKNDVTISLGRTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD

Query:  IYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNAS
        IYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNAS
Subjt:  IYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNAS

Query:  NRSLSPKKAERKGFHRVSSLEYLSCMGVKLDKELQSPLPRHRPTSLEVITGVGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
        NRSLSPKKAERKGFHRVSSLEYLSCMGVKLDKELQSPLPRHRPTSLEVITGVGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
Subjt:  NRSLSPKKAERKGFHRVSSLEYLSCMGVKLDKELQSPLPRHRPTSLEVITGVGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR

XP_022991447.1 uncharacterized protein LOC111488062 isoform X1 [Cucurbita maxima]0.0e+0093.76Show/hide
Query:  MSWGRGKSPGWAAVNLKQHNSGLQDEIDPDPFPPMSTALSFLPPRENVHRVNGRSGRSFSSTPLPSADSLMSPENFGAKKTIPGNSSIQSSKKLVEESTD
        MSWGRGKSPGWAAVNLKQ NSGLQDEIDPDPFPPMSTA SFLPP ENVHRVNGRSGRS             S ENFGA+KTI GNSSI+SSKKLVEESTD
Subjt:  MSWGRGKSPGWAAVNLKQHNSGLQDEIDPDPFPPMSTALSFLPPRENVHRVNGRSGRSFSSTPLPSADSLMSPENFGAKKTIPGNSSIQSSKKLVEESTD

Query:  VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSPGWEEFNLKQQNRGLQDRIDPKPFPPMPSA
        VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKS GWEEFNLKQQNRG QDRIDPKPFPPMPS+
Subjt:  VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSPGWEEFNLKQQNRGLQDRIDPKPFPPMPSA

Query:  LSSLPPRENLHGVNGCPGRSSSSSPLPSADSLTSPENYIAKKILGDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTYLNK
        LSSLPPRENLHGVNG PG SSSSSPLPSADSLT PENY AKKILGDS+IQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEAST+LNK
Subjt:  LSSLPPRENLHGVNGCPGRSSSSSPLPSADSLTSPENYIAKKILGDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTYLNK

Query:  MVSSDNVEICNEMSTLGLHSADGLPCNGKNDVTISLGRTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD
        MVSSDNVEICNEMSTLGLHSADGL CNGKNDVTIS  RTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD
Subjt:  MVSSDNVEICNEMSTLGLHSADGLPCNGKNDVTISLGRTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD

Query:  IYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNAS
        IYLSHRKDAIAMMRSASQHSR ATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIET NAS
Subjt:  IYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNAS

Query:  NRSLSPKKAERKGFHRVSSLEYLSCMGVKLDKELQSPLPRHRPTSLEVITGVGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
        NRSLSPKKAERKGF+RVSSLEYLSC+GVKLDKELQSPLPRHRPTSLEVITG+GKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
Subjt:  NRSLSPKKAERKGFHRVSSLEYLSCMGVKLDKELQSPLPRHRPTSLEVITGVGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR

XP_023548349.1 uncharacterized protein LOC111807017 [Cucurbita pepo subsp. pepo]0.0e+0098.15Show/hide
Query:  MSWGRGKSPGWAAVNLKQHNSGLQDEIDPDPFPPMSTALSFLPPRENVHRVNGRSGRSFSSTPLPSADSLMSPENFGAKKTIPGNSSIQSSKKLVEESTD
        MSWGRGKSPGWAAVNLKQ NSGLQDEIDPDPFPPMSTALSFLPPRENVHRVNGRSGRSFSSTPLPSADSLMSP+NFGAKKTIPGNSSI+S KKLVEESTD
Subjt:  MSWGRGKSPGWAAVNLKQHNSGLQDEIDPDPFPPMSTALSFLPPRENVHRVNGRSGRSFSSTPLPSADSLMSPENFGAKKTIPGNSSIQSSKKLVEESTD

Query:  VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSPGWEEFNLKQQNRGLQDRIDPKPFPPMPSA
        VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSPGWEE+NLKQQNRGLQDRIDPKPFPPMPSA
Subjt:  VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSPGWEEFNLKQQNRGLQDRIDPKPFPPMPSA

Query:  LSSLPPRENLHGVNGCPGRSSSSSPLPSADSLTSPENYIAKKILGDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTYLNK
        LSSLPPRENLHGV G PGRSSSSSPLPSADSLTSPENY AKKILGDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTYLNK
Subjt:  LSSLPPRENLHGVNGCPGRSSSSSPLPSADSLTSPENYIAKKILGDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTYLNK

Query:  MVSSDNVEICNEMSTLGLHSADGLPCNGKNDVTISLGRTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD
        MVSSDNVEICNEMSTLGLHSADGL C GKNDVTISLGRTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD
Subjt:  MVSSDNVEICNEMSTLGLHSADGLPCNGKNDVTISLGRTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD

Query:  IYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNAS
        IYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNAS
Subjt:  IYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNAS

Query:  NRSLSPKKAERKGFHRVSSLEYLSCMGVKLDKELQSPLPRHRPTSLEVITGVGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
        NRSLSPKKAERKGFHRVSSLEYLSCMGVKLDKELQSPLPRHRPTSLEVITG+GKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
Subjt:  NRSLSPKKAERKGFHRVSSLEYLSCMGVKLDKELQSPLPRHRPTSLEVITGVGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR

TrEMBL top hitse value%identityAlignment
A0A0A0KA90 Smr domain-containing protein5.6e-25677.05Show/hide
Query:  MSWGRGKSPGWAAVNLKQHNSGLQDEIDPDPFPPMSTALSFLPPRENVHRVNGRSGRSFSSTPLPSADSLMSP----------ENFGAKKTIPGNSSIQS
        MSW RGKS GWAA NLKQ N+GLQDE+D DPFPPMST LS LPPREN+  VNG SG+SFS  P+PSADS   P           NFGAKKTI G ++IQS
Subjt:  MSWGRGKSPGWAAVNLKQHNSGLQDEIDPDPFPPMSTALSFLPPRENVHRVNGRSGRSFSSTPLPSADSLMSP----------ENFGAKKTIPGNSSIQS

Query:  SKKLVEESTDVLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSPGWEEFNLKQQNRGLQDRID
         KKLVEE+ DVL+FWKLKELH WADISLI+DIMEAVNN+FNEAS LL TMVSSDN EINN+MSTLGLHSSND+  + GKSPGWEEFNLKQ N+GLQD +D
Subjt:  SKKLVEESTDVLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSPGWEEFNLKQQNRGLQDRID

Query:  PKPFPPMPSALSSLPPRENLHGVNGCPGRSSSSSPLPSADSLTSPENYIAKK-ILGDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDN
         + FPPM +  SSLPP ENLHGV G  GRS +S PLPS DSLTSPENY AK  I  DSSIQ+G+KVVEE TDVLAFWKLKE+H+WADFSLIVDIM+AV+N
Subjt:  PKPFPPMPSALSSLPPRENLHGVNGCPGRSSSSSPLPSADSLTSPENYIAKK-ILGDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDN

Query:  NFNEASTYLNKMVSSDNVEICNEMSTLGLHSADGLPCNGKNDVTISLGRTVNNPIPSSTLKDVQDMHQNIN------DKLFENNYHERNFFHNVGNPKIA
        NF+EAST L  MVSSDN EI NE+STLGLHSA+ L CNG NDV+I+  R +N PI SST+K VQ +HQN N       KLF N+Y ERN FHN GN KIA
Subjt:  NFNEASTYLNKMVSSDNVEICNEMSTLGLHSADGLPCNGKNDVTISLGRTVNNPIPSSTLKDVQDMHQNIN------DKLFENNYHERNFFHNVGNPKIA

Query:  LYCSKSAPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEA
        L CSKS PIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAY RKDHASAKYHSSRA+EQWLAAKMLN KAANEILQTRNS+NGLWKLDLHGLHAAEA
Subjt:  LYCSKSAPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEA

Query:  VQALQDHLLKIETRNASNRSLSPKKAERKGFHRVSSLEYLSCMGVKLDKELQSPLPRHRPTSLEVITGVGKHSRGEAALPKAVTSFLSENGYRFEQLRPG
        VQAL DHLLKIET+NASNRSLSPKKAERKGF R SSLEYLSCM  KLDKE  SP  RHRPTSLEVITG+GKHS+GEAALPKAV SFL+ENGYRFEQ RPG
Subjt:  VQALQDHLLKIETRNASNRSLSPKKAERKGFHRVSSLEYLSCMGVKLDKELQSPLPRHRPTSLEVITGVGKHSRGEAALPKAVTSFLSENGYRFEQLRPG

Query:  TISVRPKFRR
        TISVRPKFRR
Subjt:  TISVRPKFRR

A0A1S3BRS7 uncharacterized protein LOC1034925901.1e-25677.7Show/hide
Query:  MSWGRGKSPGWAAVNLKQHNSGLQDEIDPDPFPPMSTALSFLPPRENVHRVNGRSGRSFSSTPLPSADSLMSP----------ENFGAKKTIPGNSSIQS
        MSW RGKS GWAA NLKQ N+G+QDE+D DPFPPMST LS LPPREN+  VNGRSGRSFS  P+PSADS   P           NF AKKTI G S+IQS
Subjt:  MSWGRGKSPGWAAVNLKQHNSGLQDEIDPDPFPPMSTALSFLPPRENVHRVNGRSGRSFSSTPLPSADSLMSP----------ENFGAKKTIPGNSSIQS

Query:  SKKLVEESTDVLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSPGWEEFNLKQQNRGLQDRID
         KK+VEE+ DVL+FWKLKELH WADISLI+DIMEAVNN+FNEAS LL TMVSSDN EINNEMS LGLHSSND+S + GKSPGWEEFNL+Q NRGLQ   D
Subjt:  SKKLVEESTDVLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSPGWEEFNLKQQNRGLQDRID

Query:  PKPFPPMPSALSSLPPRENLHGVNGCPGRSSSSSPLPSADSLTSPENYIAKK-ILGDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDN
        P+ FPPM +   SLPP ENLHGV G  GRS +S PLPSADSLTSP NY AK  I  DS IQ+G+KVVEE TDVLAFWKLKE+H+WADFSLIVDIM+AV+N
Subjt:  PKPFPPMPSALSSLPPRENLHGVNGCPGRSSSSSPLPSADSLTSPENYIAKK-ILGDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDN

Query:  NFNEASTYLNKMVSSDNVEICNEMSTLGLHSADGLPCNGKNDVTISLGRTVNNPIPSSTLKDVQDMHQNIN------DKLFENNYHERNFFHNVGNPKIA
        NF+EAST L  MVSSDN EI NE+STLGLHSA+ L CNG NDV+IS  RT+N PI S TLK  Q MHQN N       KLF N+Y ERNFF N GN KIA
Subjt:  NFNEASTYLNKMVSSDNVEICNEMSTLGLHSADGLPCNGKNDVTISLGRTVNNPIPSSTLKDVQDMHQNIN------DKLFENNYHERNFFHNVGNPKIA

Query:  LYCSKSAPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEA
        L CSKS PIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAY RKDHASAKYHSSRAQEQWLAAKMLN KAANEILQTRNS+NGLWKLDLHGLHAAEA
Subjt:  LYCSKSAPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEA

Query:  VQALQDHLLKIETRNASNRSLSPKKAERKGFHRVSSLEYLSCMGVKLDKELQSPLPRHRPTSLEVITGVGKHSRGEAALPKAVTSFLSENGYRFEQLRPG
        VQALQDHLLKIET+NASNRSLSPKKAERKGF R SSLEYLSCM  KLDKE  SP  RHRPTSLEVITG+GKHS+GEAALPKAVTSFL+ENGYRFEQ RPG
Subjt:  VQALQDHLLKIETRNASNRSLSPKKAERKGFHRVSSLEYLSCMGVKLDKELQSPLPRHRPTSLEVITGVGKHSRGEAALPKAVTSFLSENGYRFEQLRPG

Query:  TISVRPKFRR
        TISVRPKFRR
Subjt:  TISVRPKFRR

A0A6J1GN51 uncharacterized protein LOC111455928 isoform X10.0e+0098.82Show/hide
Query:  MSWGRGKSPGWAAVNLKQHNSGLQDEIDPDPFPPMSTALSFLPPRENVHRVNGRSGRSFSSTPLPSADSLMSPENFGAKKTIPGNSSIQSSKKLVEESTD
        MSWGRGKSPGWAAVNLKQ NSGLQDEIDPDPFPPMSTALSFLPPRENVHRVNGRSGRSFSSTPLPSADSLMSPENFG KKTIPGNSSI+S KKLVEESTD
Subjt:  MSWGRGKSPGWAAVNLKQHNSGLQDEIDPDPFPPMSTALSFLPPRENVHRVNGRSGRSFSSTPLPSADSLMSPENFGAKKTIPGNSSIQSSKKLVEESTD

Query:  VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSPGWEEFNLKQQNRGLQDRIDPKPFPPMPSA
        VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSPGWEEFNLKQQNRGLQDRIDPKPFPPMPSA
Subjt:  VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSPGWEEFNLKQQNRGLQDRIDPKPFPPMPSA

Query:  LSSLPPRENLHGVNGCPGRSSSSSPLPSADSLTSPENYIAKKILGDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTYLNK
        LSSLPPRENLHGVNG PGRSSSSSPLPSADSLT PENY AKKILGDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTYLNK
Subjt:  LSSLPPRENLHGVNGCPGRSSSSSPLPSADSLTSPENYIAKKILGDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTYLNK

Query:  MVSSDNVEICNEMSTLGLHSADGLPCNGKNDVTISLGRTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD
        MVSSDNVEICNEMSTLGLHSADGLPCNGKNDVTISLGRTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD
Subjt:  MVSSDNVEICNEMSTLGLHSADGLPCNGKNDVTISLGRTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD

Query:  IYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNAS
        IYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNAS
Subjt:  IYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNAS

Query:  NRSLSPKKAERKGFHRVSSLEYLSCMGVKLDKELQSPLPRHRPTSLEVITGVGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
        NRSLSPKKAERKGFHRVSSLEYLSCMGVKLDKELQSPLPRHRPTSLEVITGVGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
Subjt:  NRSLSPKKAERKGFHRVSSLEYLSCMGVKLDKELQSPLPRHRPTSLEVITGVGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR

A0A6J1GPE7 uncharacterized protein LOC111455928 isoform X20.0e+0096.63Show/hide
Query:  MSWGRGKSPGWAAVNLKQHNSGLQDEIDPDPFPPMSTALSFLPPRENVHRVNGRSGRSFSSTPLPSADSLMSPENFGAKKTIPGNSSIQSSKKLVEESTD
        MSWGRGKSPGWAAVNLKQ NSGLQDEIDPDPFPPMSTALSFLPPRENVHRVNGRSGRS             SPENFG KKTIPGNSSI+S KKLVEESTD
Subjt:  MSWGRGKSPGWAAVNLKQHNSGLQDEIDPDPFPPMSTALSFLPPRENVHRVNGRSGRSFSSTPLPSADSLMSPENFGAKKTIPGNSSIQSSKKLVEESTD

Query:  VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSPGWEEFNLKQQNRGLQDRIDPKPFPPMPSA
        VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSPGWEEFNLKQQNRGLQDRIDPKPFPPMPSA
Subjt:  VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSPGWEEFNLKQQNRGLQDRIDPKPFPPMPSA

Query:  LSSLPPRENLHGVNGCPGRSSSSSPLPSADSLTSPENYIAKKILGDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTYLNK
        LSSLPPRENLHGVNG PGRSSSSSPLPSADSLT PENY AKKILGDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTYLNK
Subjt:  LSSLPPRENLHGVNGCPGRSSSSSPLPSADSLTSPENYIAKKILGDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTYLNK

Query:  MVSSDNVEICNEMSTLGLHSADGLPCNGKNDVTISLGRTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD
        MVSSDNVEICNEMSTLGLHSADGLPCNGKNDVTISLGRTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD
Subjt:  MVSSDNVEICNEMSTLGLHSADGLPCNGKNDVTISLGRTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD

Query:  IYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNAS
        IYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNAS
Subjt:  IYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNAS

Query:  NRSLSPKKAERKGFHRVSSLEYLSCMGVKLDKELQSPLPRHRPTSLEVITGVGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
        NRSLSPKKAERKGFHRVSSLEYLSCMGVKLDKELQSPLPRHRPTSLEVITGVGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
Subjt:  NRSLSPKKAERKGFHRVSSLEYLSCMGVKLDKELQSPLPRHRPTSLEVITGVGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR

A0A6J1JLU6 uncharacterized protein LOC111488062 isoform X10.0e+0093.76Show/hide
Query:  MSWGRGKSPGWAAVNLKQHNSGLQDEIDPDPFPPMSTALSFLPPRENVHRVNGRSGRSFSSTPLPSADSLMSPENFGAKKTIPGNSSIQSSKKLVEESTD
        MSWGRGKSPGWAAVNLKQ NSGLQDEIDPDPFPPMSTA SFLPP ENVHRVNGRSGRS             S ENFGA+KTI GNSSI+SSKKLVEESTD
Subjt:  MSWGRGKSPGWAAVNLKQHNSGLQDEIDPDPFPPMSTALSFLPPRENVHRVNGRSGRSFSSTPLPSADSLMSPENFGAKKTIPGNSSIQSSKKLVEESTD

Query:  VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSPGWEEFNLKQQNRGLQDRIDPKPFPPMPSA
        VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKS GWEEFNLKQQNRG QDRIDPKPFPPMPS+
Subjt:  VLAFWKLKELHSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSPGWEEFNLKQQNRGLQDRIDPKPFPPMPSA

Query:  LSSLPPRENLHGVNGCPGRSSSSSPLPSADSLTSPENYIAKKILGDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTYLNK
        LSSLPPRENLHGVNG PG SSSSSPLPSADSLT PENY AKKILGDS+IQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEAST+LNK
Subjt:  LSSLPPRENLHGVNGCPGRSSSSSPLPSADSLTSPENYIAKKILGDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTYLNK

Query:  MVSSDNVEICNEMSTLGLHSADGLPCNGKNDVTISLGRTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD
        MVSSDNVEICNEMSTLGLHSADGL CNGKNDVTIS  RTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD
Subjt:  MVSSDNVEICNEMSTLGLHSADGLPCNGKNDVTISLGRTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDD

Query:  IYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNAS
        IYLSHRKDAIAMMRSASQHSR ATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIET NAS
Subjt:  IYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNAS

Query:  NRSLSPKKAERKGFHRVSSLEYLSCMGVKLDKELQSPLPRHRPTSLEVITGVGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
        NRSLSPKKAERKGF+RVSSLEYLSC+GVKLDKELQSPLPRHRPTSLEVITG+GKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR
Subjt:  NRSLSPKKAERKGFHRVSSLEYLSCMGVKLDKELQSPLPRHRPTSLEVITGVGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR

SwissProt top hitse value%identityAlignment
Q86UW6 NEDD4-binding protein 23.9e-0432.17Show/hide
Query:  LHAAEAVQALQDHLLKIETRNASNRSLSPKKA-ERKGFHRVSSLEYLSCMGVKLDKELQSPLPRHRPTSLEVITGVGKHSRGEAA-LPKAVTSFLSENGY
        LH  +  +A  +HL  IE     N SL P+   +  G H   +LE+L  +   L+K+ +          L VITG G HS+G  A +  AV  +L  + +
Subjt:  LHAAEAVQALQDHLLKIETRNASNRSLSPKKA-ERKGFHRVSSLEYLSCMGVKLDKELQSPLPRHRPTSLEVITGVGKHSRGEAA-LPKAVTSFLSENGY

Query:  RFEQLRPGTISVRPK
        RF +++PG + V  K
Subjt:  RFEQLRPGTISVRPK

Arabidopsis top hitse value%identityAlignment
AT5G23520.1 smr (Small MutS Related) domain-containing protein5.3e-7339.87Show/hide
Query:  VSLVRGKSPGWEEFNLKQ-QNRGLQDRIDPKPFPPMPSALSSLPPRENLHGVNGCPGRSSSSSPLPSADSLTSPENYIAKKILGDSSIQNGRKVVEETTD
        +S ++GKS GW  F+LKQ Q +GL+  ++  PFPP+ +++++        GV G   R+   S    +  L  P  + A     D   Q          D
Subjt:  VSLVRGKSPGWEEFNLKQ-QNRGLQDRIDPKPFPPMPSALSSLPPRENLHGVNGCPGRSSSSSPLPSADSLTSPENYIAKKILGDSSIQNGRKVVEETTD

Query:  V---------LAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTYLNKMVSSDNVEICNEMSTLGLHSADGLPCNGKNDVTISLGRTVNNPI---PSSTL
                  LAF KLKE+++WAD +LI D++ + +++F  A  +L  MVSS   +   E  T      +G   + +     +  +TV + +     ST 
Subjt:  V---------LAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTYLNKMVSSDNVEICNEMSTLGLHSADGLPCNGKNDVTISLGRTVNNPI---PSSTL

Query:  KDV--QDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWL
        +D    D+  +       N      F  ++      +   +S PIEPEWEEDD+YLSHRKDA+ +MRSAS HSRAA NA+ R DHASAK HS +A+E WL
Subjt:  KDV--QDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSSRAQEQWL

Query:  AAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNASNRSLSPKKAERKGFH-RVSSLEYLSCMGVKLDKE---LQSPLPRHRP
        AA+ LNA+AA +I+   N +N +WKLDLHGLHA EAVQALQ+ L  IE     NRS+SP +   K    R +S E       +LD+E    Q    R   
Subjt:  AAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNASNRSLSPKKAERKGFH-RVSSLEYLSCMGVKLDKE---LQSPLPRHRP

Query:  TSLEVITGVGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFR
         SL+VITG+GKHSRG+A+LP AV +F  +N YRF++ RPG I+VRPKFR
Subjt:  TSLEVITGVGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTGGGGGAGGGGTAAATCTCCTGGCTGGGCAGCGGTTAACCTTAAGCAACATAATAGTGGCCTTCAAGATGAAATTGACCCGGACCCATTCCCACCAATGTCTAC
CGCTCTTTCCTTTCTGCCACCCCGTGAAAACGTACACAGAGTTAATGGTCGTTCAGGGAGATCTTTCTCATCGACACCCCTCCCTTCTGCCGATTCTTTAATGTCACCAG
AAAATTTTGGTGCAAAAAAGACCATACCTGGTAATTCTAGCATTCAAAGTAGCAAGAAGTTGGTTGAAGAATCCACGGATGTTTTAGCCTTCTGGAAGCTTAAAGAGCTT
CATTCTTGGGCTGATATCAGCTTGATTGTGGATATAATGGAAGCTGTAAATAATAACTTCAACGAGGCGTCTAAGTTATTAAAAACAATGGTTTCTAGTGACAATTTTGA
GATCAATAATGAGATGAGCACCTTAGGACTGCATTCCTCTAATGATGTATCATTGGTGAGGGGTAAATCTCCTGGCTGGGAAGAATTTAACCTTAAGCAACAAAATAGAG
GCCTTCAAGATAGAATTGATCCGAAACCATTCCCACCGATGCCAAGTGCCCTCTCCTCTTTGCCACCCCGTGAAAACTTGCACGGAGTTAATGGCTGTCCAGGGAGATCC
TCCTCATCTTCACCTCTTCCTTCTGCTGATTCTCTAACTTCGCCAGAAAATTACATTGCAAAGAAAATACTTGGTGATTCTAGCATTCAAAATGGAAGGAAGGTGGTTGA
AGAAACCACTGACGTTTTAGCCTTTTGGAAGCTTAAGGAGCTTCATACTTGGGCTGATTTTAGCTTGATTGTGGATATAATGGAAGCTGTAGATAATAACTTCAATGAGG
CATCTACTTATTTAAACAAAATGGTTTCTAGTGACAATGTTGAGATCTGTAACGAGATGAGCACCTTAGGACTGCATTCTGCTGATGGTCTACCGTGCAATGGGAAGAAT
GATGTAACTATATCATTAGGAAGAACTGTTAATAATCCCATCCCTAGTTCCACACTAAAGGATGTGCAAGACATGCATCAAAATATTAATGATAAATTGTTTGAAAATAA
TTATCATGAAAGAAATTTCTTTCATAATGTTGGAAATCCAAAAATAGCTCTTTATTGCTCAAAGTCTGCTCCTATTGAGCCCGAGTGGGAAGAAGATGATATTTACCTGA
GCCATCGGAAAGATGCTATAGCAATGATGAGGTCTGCATCTCAACATTCAAGGGCAGCCACTAATGCCTATCTTCGGAAGGATCATGCTTCAGCCAAGTATCATTCATCA
AGGGCTCAAGAACAATGGCTAGCTGCAAAAATGTTAAATGCTAAGGCAGCCAATGAAATTTTACAAACAAGGAATAGTGAAAATGGGCTCTGGAAGTTGGACTTACATGG
GCTTCATGCAGCAGAGGCTGTTCAAGCCTTGCAAGATCACTTGCTGAAAATCGAAACTCGGAATGCCTCCAATCGGTCGTTGTCGCCAAAGAAAGCTGAAAGGAAGGGTT
TCCATCGTGTTTCATCCCTTGAGTATCTTAGTTGTATGGGCGTAAAGTTGGACAAAGAATTACAATCACCATTACCTAGGCATAGGCCGACATCATTGGAAGTCATAACA
GGAGTAGGCAAACATAGCAGGGGGGAAGCTGCTCTACCAAAGGCCGTGACAAGTTTTCTTAGTGAAAATGGGTACCGTTTTGAACAGTTGAGGCCCGGGACGATCAGCGT
TCGCCCAAAGTTTCGTAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTGGGGGAGGGGTAAATCTCCTGGCTGGGCAGCGGTTAACCTTAAGCAACATAATAGTGGCCTTCAAGATGAAATTGACCCGGACCCATTCCCACCAATGTCTAC
CGCTCTTTCCTTTCTGCCACCCCGTGAAAACGTACACAGAGTTAATGGTCGTTCAGGGAGATCTTTCTCATCGACACCCCTCCCTTCTGCCGATTCTTTAATGTCACCAG
AAAATTTTGGTGCAAAAAAGACCATACCTGGTAATTCTAGCATTCAAAGTAGCAAGAAGTTGGTTGAAGAATCCACGGATGTTTTAGCCTTCTGGAAGCTTAAAGAGCTT
CATTCTTGGGCTGATATCAGCTTGATTGTGGATATAATGGAAGCTGTAAATAATAACTTCAACGAGGCGTCTAAGTTATTAAAAACAATGGTTTCTAGTGACAATTTTGA
GATCAATAATGAGATGAGCACCTTAGGACTGCATTCCTCTAATGATGTATCATTGGTGAGGGGTAAATCTCCTGGCTGGGAAGAATTTAACCTTAAGCAACAAAATAGAG
GCCTTCAAGATAGAATTGATCCGAAACCATTCCCACCGATGCCAAGTGCCCTCTCCTCTTTGCCACCCCGTGAAAACTTGCACGGAGTTAATGGCTGTCCAGGGAGATCC
TCCTCATCTTCACCTCTTCCTTCTGCTGATTCTCTAACTTCGCCAGAAAATTACATTGCAAAGAAAATACTTGGTGATTCTAGCATTCAAAATGGAAGGAAGGTGGTTGA
AGAAACCACTGACGTTTTAGCCTTTTGGAAGCTTAAGGAGCTTCATACTTGGGCTGATTTTAGCTTGATTGTGGATATAATGGAAGCTGTAGATAATAACTTCAATGAGG
CATCTACTTATTTAAACAAAATGGTTTCTAGTGACAATGTTGAGATCTGTAACGAGATGAGCACCTTAGGACTGCATTCTGCTGATGGTCTACCGTGCAATGGGAAGAAT
GATGTAACTATATCATTAGGAAGAACTGTTAATAATCCCATCCCTAGTTCCACACTAAAGGATGTGCAAGACATGCATCAAAATATTAATGATAAATTGTTTGAAAATAA
TTATCATGAAAGAAATTTCTTTCATAATGTTGGAAATCCAAAAATAGCTCTTTATTGCTCAAAGTCTGCTCCTATTGAGCCCGAGTGGGAAGAAGATGATATTTACCTGA
GCCATCGGAAAGATGCTATAGCAATGATGAGGTCTGCATCTCAACATTCAAGGGCAGCCACTAATGCCTATCTTCGGAAGGATCATGCTTCAGCCAAGTATCATTCATCA
AGGGCTCAAGAACAATGGCTAGCTGCAAAAATGTTAAATGCTAAGGCAGCCAATGAAATTTTACAAACAAGGAATAGTGAAAATGGGCTCTGGAAGTTGGACTTACATGG
GCTTCATGCAGCAGAGGCTGTTCAAGCCTTGCAAGATCACTTGCTGAAAATCGAAACTCGGAATGCCTCCAATCGGTCGTTGTCGCCAAAGAAAGCTGAAAGGAAGGGTT
TCCATCGTGTTTCATCCCTTGAGTATCTTAGTTGTATGGGCGTAAAGTTGGACAAAGAATTACAATCACCATTACCTAGGCATAGGCCGACATCATTGGAAGTCATAACA
GGAGTAGGCAAACATAGCAGGGGGGAAGCTGCTCTACCAAAGGCCGTGACAAGTTTTCTTAGTGAAAATGGGTACCGTTTTGAACAGTTGAGGCCCGGGACGATCAGCGT
TCGCCCAAAGTTTCGTAGGTAAATGAGTACCTCTTATTAGTAATAGCTTCAAAGTTATTCAGTCAGAACGTAGATTAGAGTCGTTGGAGGTTAAGTGAAACGTGTAAAAT
GACTGAAAATGTTATTTAGAAGGAGCTCTGTAGGATTGATTGTTAACCACATTACTGCATTGTTATTTTGTTGTCTTTCCTGGAAAAGATGGCACCTAGGAAGAAGAGGT
TAGAAGAACTCTGGAATCCTGGGATTGTATTCTATTATGATTCATTATGGCGATTGCACTAGTCGAGTCTACTTCTCTAATCTTTTGAAAATATAATAGCTGATTTGCTT
ATGACTTCATCAATGTTGATGCAAATAAACTGATGTGGTAATATA
Protein sequenceShow/hide protein sequence
MSWGRGKSPGWAAVNLKQHNSGLQDEIDPDPFPPMSTALSFLPPRENVHRVNGRSGRSFSSTPLPSADSLMSPENFGAKKTIPGNSSIQSSKKLVEESTDVLAFWKLKEL
HSWADISLIVDIMEAVNNNFNEASKLLKTMVSSDNFEINNEMSTLGLHSSNDVSLVRGKSPGWEEFNLKQQNRGLQDRIDPKPFPPMPSALSSLPPRENLHGVNGCPGRS
SSSSPLPSADSLTSPENYIAKKILGDSSIQNGRKVVEETTDVLAFWKLKELHTWADFSLIVDIMEAVDNNFNEASTYLNKMVSSDNVEICNEMSTLGLHSADGLPCNGKN
DVTISLGRTVNNPIPSSTLKDVQDMHQNINDKLFENNYHERNFFHNVGNPKIALYCSKSAPIEPEWEEDDIYLSHRKDAIAMMRSASQHSRAATNAYLRKDHASAKYHSS
RAQEQWLAAKMLNAKAANEILQTRNSENGLWKLDLHGLHAAEAVQALQDHLLKIETRNASNRSLSPKKAERKGFHRVSSLEYLSCMGVKLDKELQSPLPRHRPTSLEVIT
GVGKHSRGEAALPKAVTSFLSENGYRFEQLRPGTISVRPKFRR