; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg033616 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg033616
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptioncell wall protein RBR3-like
Genome locationscaffold13:38737946..38744827
RNA-Seq ExpressionSpg033616
SyntenySpg033616
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585587.1 hypothetical protein SDJN03_18320, partial [Cucurbita argyrosperma subsp. sororia]1.9e-20165.99Show/hide
Query:  MSYRQFRFRLPWQSVKASSHPADESSRRSSEPTDETKASSSVADTVPNIRHQPGQSPEIKPEQPPLAPAQAAEKSETMPPSKSHKEAKVHAQPPSHSRAK
        M+YRQFRFRLPWQS+KASS   +ESS RSSEPTDE + S+S ADTVP ++H         PE  PL  AQA E+SETM PSKSHK+AKVH+QP SHSRAK
Subjt:  MSYRQFRFRLPWQSVKASSHPADESSRRSSEPTDETKASSSVADTVPNIRHQPGQSPEIKPEQPPLAPAQAAEKSETMPPSKSHKEAKVHAQPPSHSRAK

Query:  NQTRTASKPPSPSKAIPQSSVALNKSPSASGKASPYHDTSKPSSPAGKASPSSKPSSPASKASPSQNTSKPSSLAGKASPSKDASKPSSLAGKASPSKDA
         QTRTA+KPPS SK  PQSSV+ NKSP+ S KASP HD SKPSS AG             K SPS +TSK SS AGK              GK SPS D 
Subjt:  NQTRTASKPPSPSKAIPQSSVALNKSPSASGKASPYHDTSKPSSPAGKASPSSKPSSPASKASPSQNTSKPSSLAGKASPSKDASKPSSLAGKASPSKDA

Query:  SVGKKLFGDWRAQILNIQNQVYFSTASKPSSPAGKASPSQDASKPSS--PAAPRSRIASKPPSPSQTSSKNHPNSKPTSQSRLKADS-----------QP
                                  SKPSSPAGKA PS+DAS+PSS   AAPRS+I SKPPSPSQTSSKNHP SKPTSQSRLKADS            P
Subjt:  SVGKKLFGDWRAQILNIQNQVYFSTASKPSSPAGKASPSQDASKPSS--PAAPRSRIASKPPSPSQTSSKNHPNSKPTSQSRLKADS-----------QP

Query:  QVSSMPRSPSQENSRQQRSKKTSRVQSPSHLSSKPIAQSTSPQHTKSPAAIGDQTTKEIVSHPANQSPQARPRSRESQLETKSKQSPKPDSKPVESKASK
        Q SS+PRSPS ENSRQQ SKK SRVQSPSHLSSKP AQSTS Q T+SPA I DQTTK +VSHPA+QSP AR + +E+Q++TKSKQSPKPD KPVE KASK
Subjt:  QVSSMPRSPSQENSRQQRSKKTSRVQSPSHLSSKPIAQSTSPQHTKSPAAIGDQTTKEIVSHPANQSPQARPRSRESQLETKSKQSPKPDSKPVESKASK

Query:  DEPETNEEPTLTSKNTSHPHSNQDSSEIPMQPNQTIENGPEPSLESQEESKENKERKNDKEDQARTINALQTSASKSTWITSSESPSPFEQADKDSKQEE
         +PET EE    SKNTS+PHS+QD SEIP+  ++T ENGPEPSLESQ ES+E+KE K+ +ED  +T NALQ +ASKS  ITS+E  SPFE  + DS+QE 
Subjt:  DEPETNEEPTLTSKNTSHPHSNQDSSEIPMQPNQTIENGPEPSLESQEESKENKERKNDKEDQARTINALQTSASKSTWITSSESPSPFEQADKDSKQEE

Query:  TMEDLSKAFQKLNIKYSDEENPKSFTTLIGDNKGASMHVHSGEAKRESSIHIHREYKSDPDQSPESSTDDEGNFNHETPQESRTEEDPPLELYININVQG
        TMEDLSKAFQ LNIKY  EENPKSFTTL GDNKGASMH+ SGEA +ES+IHIHR+YKSDPD+ PESSTD EGN N ETPQ+S+TEEDPPLELYININVQG
Subjt:  TMEDLSKAFQKLNIKYSDEENPKSFTTLIGDNKGASMHVHSGEAKRESSIHIHREYKSDPDQSPESSTDDEGNFNHETPQESRTEEDPPLELYININVQG

Query:  INNSILSNTSFTENNPGIKLKFVREPTRSEDELQSHDAREAKYTAKPAERLTYEPTVRRRCLRGLLMESSDSELDDPEKPRRHGCRFSRSCEGK
        INNS+LSN+SFTENNPGIKLKFV + T+SED+  S  A++AKYTAK  E  TYEPTVRRRCL GLLMESSDS+ D+ EKPRRHGCR+  S EGK
Subjt:  INNSILSNTSFTENNPGIKLKFVREPTRSEDELQSHDAREAKYTAKPAERLTYEPTVRRRCLRGLLMESSDSELDDPEKPRRHGCRFSRSCEGK

KAG7020502.1 hypothetical protein SDJN02_17187, partial [Cucurbita argyrosperma subsp. argyrosperma]6.6e-20266.14Show/hide
Query:  MSYRQFRFRLPWQSVKASSHPADESSRRSSEPTDETKASSSVADTVPNIRHQPGQSPEIKPEQPPLAPAQAAEKSETMPPSKSHKEAKVHAQPPSHSRAK
        M+YRQFRFRLPWQS+KASS   +ESS RSSEPTDE + S+S ADTVP ++H         PE  PL  AQA E+SETM PSKSHK+AKVH+QP SHSRAK
Subjt:  MSYRQFRFRLPWQSVKASSHPADESSRRSSEPTDETKASSSVADTVPNIRHQPGQSPEIKPEQPPLAPAQAAEKSETMPPSKSHKEAKVHAQPPSHSRAK

Query:  NQTRTASKPPSPSKAIPQSSVALNKSPSASGKASPYHDTSKPSSPAGKASPSSKPSSPASKASPSQNTSKPSSLAGKASPSKDASKPSSLAGKASPSKDA
         QTRTA+KPPS SK  PQSSV+ NKSP+ S KASP HD SKPSS AG             K SPS +TSK SS AGK              GK SPS D 
Subjt:  NQTRTASKPPSPSKAIPQSSVALNKSPSASGKASPYHDTSKPSSPAGKASPSSKPSSPASKASPSQNTSKPSSLAGKASPSKDASKPSSLAGKASPSKDA

Query:  SVGKKLFGDWRAQILNIQNQVYFSTASKPSSPAGKASPSQDASKPSS--PAAPRSRIASKPPSPSQTSSKNHPNSKPTSQSRLKADS-----------QP
                                  SKPSSPAGKA PS+DAS+PSS   AAPRS+I SKPPSPSQTSSKNHP SKPTSQSRLKADS            P
Subjt:  SVGKKLFGDWRAQILNIQNQVYFSTASKPSSPAGKASPSQDASKPSS--PAAPRSRIASKPPSPSQTSSKNHPNSKPTSQSRLKADS-----------QP

Query:  QVSSMPRSPSQENSRQQRSKKTSRVQSPSHLSSKPIAQSTSPQHTKSPAAIGDQTTKEIVSHPANQSPQARPRSRESQLETKSKQSPKPDSKPVESKASK
        Q SS+PRSPS ENSRQQ SKK S VQSPSHLSSKP AQSTS Q T+SPA IGDQTTK +VSHPA+QSP AR + RE+Q++TKSKQSPKPD KPVE KASK
Subjt:  QVSSMPRSPSQENSRQQRSKKTSRVQSPSHLSSKPIAQSTSPQHTKSPAAIGDQTTKEIVSHPANQSPQARPRSRESQLETKSKQSPKPDSKPVESKASK

Query:  DEPETNEEPTLTSKNTSHPHSNQDSSEIPMQPNQTIENGPEPSLESQEESKENKERKNDKEDQARTINALQTSASKSTWITSSESPSPFEQADKDSKQEE
         +PET EE    SKNTS+PHS+QD SEIP+  ++T ENGPEPSLESQ ES+E+KE K+ +ED  +T NALQ +ASKS  ITS+E  SPFE  + DS+QE 
Subjt:  DEPETNEEPTLTSKNTSHPHSNQDSSEIPMQPNQTIENGPEPSLESQEESKENKERKNDKEDQARTINALQTSASKSTWITSSESPSPFEQADKDSKQEE

Query:  TMEDLSKAFQKLNIKYSDEENPKSFTTLIGDNKGASMHVHSGEAKRESSIHIHREYKSDPDQSPESSTDDEGNFNHETPQESRTEEDPPLELYININVQG
        TMEDLSKAFQ LNIKY  EENPKSFTTL GDNKGASMH+ SGEA +ES+IHIHR+YKSDPD+ PESSTD EGN N ETPQ+S+TEEDPPLELYININVQG
Subjt:  TMEDLSKAFQKLNIKYSDEENPKSFTTLIGDNKGASMHVHSGEAKRESSIHIHREYKSDPDQSPESSTDDEGNFNHETPQESRTEEDPPLELYININVQG

Query:  INNSILSNTSFTENNPGIKLKFVREPTRSEDELQSHDAREAKYTAKPAERLTYEPTVRRRCLRGLLMESSDSELDDPEKPRRHGCRFSRSCEGK
        INNS+LSN+SFTENNPGIKLKFV + T+SED+  S  A++AKYTAK  E  TYEPTVRRRCL GLLMESSDS+ D+ EKPRRHGCR+  S EGK
Subjt:  INNSILSNTSFTENNPGIKLKFVREPTRSEDELQSHDAREAKYTAKPAERLTYEPTVRRRCLRGLLMESSDSELDDPEKPRRHGCRFSRSCEGK

XP_022951875.1 cell wall protein RBR3-like [Cucurbita moschata]7.0e-20466.62Show/hide
Query:  MSYRQFRFRLPWQSVKASSHPADESSRRSSEPTDETKASSSVADTVPNIRHQPGQSPEIKPEQPPLAPAQAAEKSETMPPSKSHKEAKVHAQPPSHSRAK
        M+YRQFRFRLPWQS+KASS   +ESS RSSEPTDE + S+S ADTVP ++H         PE  PL  AQA E+SETM PSKSHK+AKVH+QP SHSRAK
Subjt:  MSYRQFRFRLPWQSVKASSHPADESSRRSSEPTDETKASSSVADTVPNIRHQPGQSPEIKPEQPPLAPAQAAEKSETMPPSKSHKEAKVHAQPPSHSRAK

Query:  NQTRTASKPPSPSKAIPQSSVALNKSPSASGKASPYHDTSKPSSPAGKASPSSKPSSPASKASPSQNTSKPSSLAGKASPSKDASKPSSLAGKASPSKDA
         QTRTA+KPPS SK  PQSSV+ NKSP+ S KASP HD SKPSS AG             K SPS +TSK SS AGK              GK SPS D 
Subjt:  NQTRTASKPPSPSKAIPQSSVALNKSPSASGKASPYHDTSKPSSPAGKASPSSKPSSPASKASPSQNTSKPSSLAGKASPSKDASKPSSLAGKASPSKDA

Query:  SVGKKLFGDWRAQILNIQNQVYFSTASKPSSPAGKASPSQDASKPSS--PAAPRSRIASKPPSPSQTSSKNHPNSKPTSQSRLKADSQ----------PQ
                                  SKPSSPAGKA PS+DAS+PSS   AAPRS+I SKPPSPSQTSSKNHP SKPTSQSRLKADSQ          PQ
Subjt:  SVGKKLFGDWRAQILNIQNQVYFSTASKPSSPAGKASPSQDASKPSS--PAAPRSRIASKPPSPSQTSSKNHPNSKPTSQSRLKADSQ----------PQ

Query:  VSSMPRSPSQENSRQQRSKKTSRVQSPSHLSSKPIAQSTSPQHTKSPAAIGDQTTKEIVSHPANQSPQARPRSRESQLETKSKQSPKPDSKPVESKASKD
         SS+PRSPS ENSRQQ SKK SRVQSPSHLSSKP AQSTS Q T+SPA IGDQTTK +VSHPA+QSP+AR + RE+Q++TKSKQSPKPD KPVE KASK 
Subjt:  VSSMPRSPSQENSRQQRSKKTSRVQSPSHLSSKPIAQSTSPQHTKSPAAIGDQTTKEIVSHPANQSPQARPRSRESQLETKSKQSPKPDSKPVESKASKD

Query:  EPETNEEPTLTSKNTSHPHSNQDSSEIPMQPNQTIENGPEPSLESQEESKENKERKNDKEDQARTINALQTSASKSTWITSSESPSPFEQADKDSKQEET
        +PET EE    SKNTS+PHS+QD SEIP+  ++TIENGPE SLESQ ES+E+KE K+ +ED  +T NALQ +ASKS  ITS+E  SPFE  + DS+QE T
Subjt:  EPETNEEPTLTSKNTSHPHSNQDSSEIPMQPNQTIENGPEPSLESQEESKENKERKNDKEDQARTINALQTSASKSTWITSSESPSPFEQADKDSKQEET

Query:  MEDLSKAFQKLNIKYSDEENPKSFTTLIGDNKGASMHVHSGEAKRESSIHIHREYKSDPDQSPESSTDDEGNFNHETPQESRTEEDPPLELYININVQGI
        MEDLSKAFQ LNIKY  EENPKSFTTL GDNKGASMH+ SGEA +ESSIHIHR+YKSDPD+ PESSTD EGN N ETPQ+S+TEEDPPLELYININVQGI
Subjt:  MEDLSKAFQKLNIKYSDEENPKSFTTLIGDNKGASMHVHSGEAKRESSIHIHREYKSDPDQSPESSTDDEGNFNHETPQESRTEEDPPLELYININVQGI

Query:  NNSILSNTSFTENNPGIKLKFVREPTRSEDELQSHDAREAKYTAKPAERLTYEPTVRRRCLRGLLMESSDSELDDPEKPRRHGCRFSRSCEG
        NNS+LSN+SFTENNPGIKLKFV + T+SED+  S  A++AKYTAK  E  TYEPTVRRRCL GLLMESSDS+ D+ EKPRRHGCR+  S EG
Subjt:  NNSILSNTSFTENNPGIKLKFVREPTRSEDELQSHDAREAKYTAKPAERLTYEPTVRRRCLRGLLMESSDSELDDPEKPRRHGCRFSRSCEG

XP_023002262.1 cell wall protein RBR3-like [Cucurbita maxima]1.4e-20466.52Show/hide
Query:  MSYRQFRFRLPWQSVKASSHPADESSRRSSEPTDETKASSSVADTVPNIRHQPGQSPEIKPEQPPLAPAQAAEKSETMPPSKSHKEAKVHAQPPSHSRAK
        M+YRQFRFRLPWQS+KASS P +ESS RSSEPTDE + S+S ADTVP ++H P QS E KPE  PL  AQA E+SETM PSKSHK+AKVH+QP SHSRAK
Subjt:  MSYRQFRFRLPWQSVKASSHPADESSRRSSEPTDETKASSSVADTVPNIRHQPGQSPEIKPEQPPLAPAQAAEKSETMPPSKSHKEAKVHAQPPSHSRAK

Query:  NQTRTASKPPSPSKAIPQSSVALNKSPSASGKASPYHDTSKPSSPAGKASPSSKPSSPASKASPSQNTSKPSSLAGKASPSKDASKPSSLAGKASPSKDA
         QTRTA+KPPS SK  PQSSV+ NKSP+ S KASP HD SKPSS AG             K SPS +TSK SS AGK              GK SPS+D 
Subjt:  NQTRTASKPPSPSKAIPQSSVALNKSPSASGKASPYHDTSKPSSPAGKASPSSKPSSPASKASPSQNTSKPSSLAGKASPSKDASKPSSLAGKASPSKDA

Query:  SVGKKLFGDWRAQILNIQNQVYFSTASKPSSPAGKASPSQDASKPSS--PAAPRSRIASKPPSPSQTSSKNHPNSKPTSQSRLKADSQ----------PQ
        S+                          PS PAGKA PS+DAS+PSS   AAPRS I SKPPSPSQTSSKNH +SK TSQSRLKADSQ          PQ
Subjt:  SVGKKLFGDWRAQILNIQNQVYFSTASKPSSPAGKASPSQDASKPSS--PAAPRSRIASKPPSPSQTSSKNHPNSKPTSQSRLKADSQ----------PQ

Query:  VSSMPRSPSQENSRQQRSKKTSRVQSPSHLSSKPIAQSTSPQHTKSPAAIGDQTTKEIVSHPANQSPQARPRSRESQLETKSKQSPKPDSKPVESKASKD
         SS+PRSPS ENSRQQ SKK SRVQSPSHLSSK  AQSTS Q T+SPA IGDQTTK +VSHPA+QSP+AR +S+E+Q++TKSKQSPKPD KPVE KASK 
Subjt:  VSSMPRSPSQENSRQQRSKKTSRVQSPSHLSSKPIAQSTSPQHTKSPAAIGDQTTKEIVSHPANQSPQARPRSRESQLETKSKQSPKPDSKPVESKASKD

Query:  EPETNEEPTLTSKNTSHPHSNQDSSEIPMQPNQTIENGPEPSLESQEESKENKERKNDKEDQARTINALQTSASKSTWITSSESPSPFEQADKDSKQEET
        +PET EE    SKNTS+P SN+D SEIP+  ++TIENGPEPSLESQ ES+E+KE K+ +ED  +T NALQ +ASKS  ITS+E  SPFE  + DS+QE T
Subjt:  EPETNEEPTLTSKNTSHPHSNQDSSEIPMQPNQTIENGPEPSLESQEESKENKERKNDKEDQARTINALQTSASKSTWITSSESPSPFEQADKDSKQEET

Query:  MEDLSKAFQKLNIKYSDEENPKSFTTLIGDNKGASMHVHSGEAKRESSIHIHREYKSDPDQSPESSTDDEGNFNHETPQESRTEEDPPLELYININVQGI
        MEDL KAFQ LNIKY  EENPKSFTTL GDNKGASMH+ SGEA +ESSIHIHR+YKSDPD+ PESSTD EGN N ETPQ+S+TEEDPPLELYININVQGI
Subjt:  MEDLSKAFQKLNIKYSDEENPKSFTTLIGDNKGASMHVHSGEAKRESSIHIHREYKSDPDQSPESSTDDEGNFNHETPQESRTEEDPPLELYININVQGI

Query:  NNSILSNTSFTENNPGIKLKFVREPTRSEDELQSHDAREAKYTAKPAERLTYEPTVRRRCLRGLLMESSDSELDDPEKPRRHGCRFSRSCEGK
        NNS+LSN+SFTENNPGIKLKFV + T+SE++  S  A++AKYTAK  E  TYEPTVRRRCL GLLMESSDS+ D+ EKPRRHGCR+  S EGK
Subjt:  NNSILSNTSFTENNPGIKLKFVREPTRSEDELQSHDAREAKYTAKPAERLTYEPTVRRRCLRGLLMESSDSELDDPEKPRRHGCRFSRSCEGK

XP_023537866.1 cell wall protein RBR3-like [Cucurbita pepo subsp. pepo]1.7e-20266.52Show/hide
Query:  MSYRQFRFRLPWQSVKASSHPADESSRRSSEPTDETKASSSVADTVPNIRHQPGQSPEIKPEQPPLAPAQAAEKSETMPPSKSHKEAKVHAQPPSHSRAK
        M+ RQFRFRLPWQSVKASS P +ESS RSSEPTDE + S+S ADTVP ++H         PE  PL  AQA E SETM PSKSHK+AKVH+QP SHSRAK
Subjt:  MSYRQFRFRLPWQSVKASSHPADESSRRSSEPTDETKASSSVADTVPNIRHQPGQSPEIKPEQPPLAPAQAAEKSETMPPSKSHKEAKVHAQPPSHSRAK

Query:  NQTRTASKPPSPSKAIPQSSVALNKSPSASGKASPYHDTSKPSSPAGKASPSSKPSSPASKASPSQNTSKPSSLAGKASPSKDASKPSSLAGKASPSKDA
         QTRTA+KPPS SK  PQSSV+ NKSP+ S KASP HD SKPSS AG             K SPS +TSK SS AGK              GK SPS D 
Subjt:  NQTRTASKPPSPSKAIPQSSVALNKSPSASGKASPYHDTSKPSSPAGKASPSSKPSSPASKASPSQNTSKPSSLAGKASPSKDASKPSSLAGKASPSKDA

Query:  SVGKKLFGDWRAQILNIQNQVYFSTASKPSSPAGKASPSQDASKPSS--PAAPRSRIASKPPSPSQTSSKNHPNSKPTSQSRLKADSQ----------PQ
                                  SKPSSPAGKA PS+DAS+PSS   AAPRS+I SKPPSPSQTSSKNHP SKPTSQSRLKADSQ          PQ
Subjt:  SVGKKLFGDWRAQILNIQNQVYFSTASKPSSPAGKASPSQDASKPSS--PAAPRSRIASKPPSPSQTSSKNHPNSKPTSQSRLKADSQ----------PQ

Query:  VSSMPRSPSQENSRQQRSKKTSRVQSPSHLSSKPIAQSTSPQHTKSPAAIGDQTTKEIVSHPANQSPQARPRSRESQLETKSKQSPKPDSKPVESKASKD
         SS+PRSPS ENSRQQ SKK SRVQSPSHLSSKP AQSTS Q T+SPA IGDQTTK +VSHPA+QSP+AR +SRE+Q++TKSKQSPKPD KPVE KASK 
Subjt:  VSSMPRSPSQENSRQQRSKKTSRVQSPSHLSSKPIAQSTSPQHTKSPAAIGDQTTKEIVSHPANQSPQARPRSRESQLETKSKQSPKPDSKPVESKASKD

Query:  EPETNEEPTLTSKNTSHPHSNQDSSEIPMQPNQTIENGPEPSLESQEESKENKERKNDKEDQARTINALQTSASKSTWITSSESPSPFEQADKDSKQEET
        +PET EE    SKNTS+PHS+QD SEIP+  ++TIENGPE SLESQ ES+E+KE K+ +ED  +T NALQ +ASKS  ITS+E  SPFE  + DS+QE T
Subjt:  EPETNEEPTLTSKNTSHPHSNQDSSEIPMQPNQTIENGPEPSLESQEESKENKERKNDKEDQARTINALQTSASKSTWITSSESPSPFEQADKDSKQEET

Query:  MEDLSKAFQKLNIKYSDEENPKSFTTLIGDNKGASMHVHSGEAKRESSIHIHREYKSDPDQSPESSTDDEGNFNHETPQESRTEEDPPLELYININVQGI
        MEDLSKAFQ LNIKY  EENPKSFTTL GDNKGASMH+ SGEA +ESSIHIHR+YKSDPD+ PESSTD EGN N ETPQ+S+TEEDPPLELYININVQGI
Subjt:  MEDLSKAFQKLNIKYSDEENPKSFTTLIGDNKGASMHVHSGEAKRESSIHIHREYKSDPDQSPESSTDDEGNFNHETPQESRTEEDPPLELYININVQGI

Query:  NNSILSNTSFTENNPGIKLKFVREPTRSEDELQSHDAREAKYTAKPAERLTYEPTVRRRCLRGLLMESSDSELDDPEKPRRHGCRFSRSCEGK
        NNS+LSN+SFTENNPGIKL FV + T+SED+  S  A++AKYTAK  E   YE TVRRRCL GLLMESSDS+ D+ EKPRRHGCR+  S EGK
Subjt:  NNSILSNTSFTENNPGIKLKFVREPTRSEDELQSHDAREAKYTAKPAERLTYEPTVRRRCLRGLLMESSDSELDDPEKPRRHGCRFSRSCEGK

TrEMBL top hitse value%identityAlignment
A0A0A0LLH1 Uncharacterized protein2.5e-11464.04Show/hide
Query:  PRSPSQENSRQQRSKKTSRVQSPSHLSSKPIAQSTSPQHTKSPAAIGDQTTKEIVSHPANQSPQARPRSRESQLETKSKQSPKPDSKPVESKASKDEPET
        PRSPSQENS Q  S+KT RVQSPSHLS KP AQSTS Q  +  A+IGDQTT  I+S PAN SP+A P S ESQ++ +SK+SPKP+ KPVE + SK + ET
Subjt:  PRSPSQENSRQQRSKKTSRVQSPSHLSSKPIAQSTSPQHTKSPAAIGDQTTKEIVSHPANQSPQARPRSRESQLETKSKQSPKPDSKPVESKASKDEPET

Query:  NEEPT--------LTSKNTSHPHSNQDSSEIPMQPNQTIENGPEPSLESQEESKENKERKNDKEDQARTINALQTSASKSTWITSSESPSPFEQADKDSK
         EE T        L SKNTS+PHS +DSSE P Q +Q IE G + SLESQ ESKE       KED A+T NA QT AS+ST ITSS+S S FE  + +++
Subjt:  NEEPT--------LTSKNTSHPHSNQDSSEIPMQPNQTIENGPEPSLESQEESKENKERKNDKEDQARTINALQTSASKSTWITSSESPSPFEQADKDSK

Query:  QEETMEDLSKAFQKLNIKYSDEENPKSFTTLIGDNKGASMHVHSGEAKRESSIHIHREYKSDPDQSPESSTDDEGNFNHETPQESRTEED---PPLELYI
        Q+E+MEDLSKAF KLNIKYSDEENPKS TT+IGDNKG SMH+ S EAK ESSIH++  YKS+PDQSPESSTD + N N+ET ++S TEE+   PPLELYI
Subjt:  QEETMEDLSKAFQKLNIKYSDEENPKSFTTLIGDNKGASMHVHSGEAKRESSIHIHREYKSDPDQSPESSTDDEGNFNHETPQESRTEED---PPLELYI

Query:  NINVQGINNSILSNTSFTENNPGIKLKFVREPTRSEDELQS-HDAREAKYTAKPAERLTYEPTVRRRCLRGLLMESSDSELDDPEKPRRHGCRFSRSCEG
        N+NVQGINNSI  NTSFTENNPGIKLKF  EPT  +DEL+S H  R++KY A PAE++TY+P +RRRCL GLLMESSDSE ++P K + HGCR+S S +G
Subjt:  NINVQGINNSILSNTSFTENNPGIKLKFVREPTRSEDELQS-HDAREAKYTAKPAERLTYEPTVRRRCLRGLLMESSDSELDDPEKPRRHGCRFSRSCEG

Query:  KEVEIL
        KEVE L
Subjt:  KEVEIL

A0A5A7VAN0 Flocculation protein FLO117.6e-16453.07Show/hide
Query:  MSYRQFRFRLPWQSVKASSHPADESSRRSSEPTDETKASSSVADTVPNIRHQPGQSPEIKPEQPPLAPAQAAEKSETMPPSKSHKEAKVHAQPPSHSRAK
        MS  Q R  LPWQS+KAS  PA+ES   S  PTDE+++S+S ADT PNIRHQP QSPEIKPE+PPLA AQAAE+SETMPPSKSHKE K+H+Q  ++SRAK
Subjt:  MSYRQFRFRLPWQSVKASSHPADESSRRSSEPTDETKASSSVADTVPNIRHQPGQSPEIKPEQPPLAPAQAAEKSETMPPSKSHKEAKVHAQPPSHSRAK

Query:  NQTRTASKPPSPSKAIPQSSVALNKSPSASGKASPYHDTSKPSSPAGKA-SP----SSKPSSPASKAS-----PSQNTSKPSSLAGKASPS---------
        N++RTASKP SP  AIPQS +A NK PS SGK S   D+SKPSSPAGK  SP    SSKPSSPA+ A+      S+ +S  S  + K  PS         
Subjt:  NQTRTASKPPSPSKAIPQSSVALNKSPSASGKASPYHDTSKPSSPAGKA-SP----SSKPSSPASKAS-----PSQNTSKPSSLAGKASPS---------

Query:  KDASKPSSLAGKASPSKDASVGKKLFGDWRAQ----------------------------------ILNIQ--------NQVYFSTASKPSSPAGKASPS
        K  S+PSS +  A PS+D S+  +     +++                                   + IQ        +Q  F T S+PS  +    PS
Subjt:  KDASKPSSLAGKASPSKDASVGKKLFGDWRAQ----------------------------------ILNIQ--------NQVYFSTASKPSSPAGKASPS

Query:  QDASKP------------------------SSPAAPRSRIASKPP--SPSQTSSKNHPNSKPTSQSRLKADSQPQVSSM-----------PRSPSQENSR
        QD S P                         S   P ++  SK P  SP+    ++HPN KP+SQSR KA+S+P  SS            PRSPSQENS 
Subjt:  QDASKP------------------------SSPAAPRSRIASKPP--SPSQTSSKNHPNSKPTSQSRLKADSQPQVSSM-----------PRSPSQENSR

Query:  QQRSKKTSRVQSPSHLSSKPIAQSTSPQHTKSPAAIGDQTTKEIVSHPANQSPQARPRSRESQLETKSKQSPKPDSKPVESKASKDEPETNEEPT-----
        Q  S+KTSRVQSPS+LS KP A STS Q  +S A+IGDQTT  I+S PA  SP+A P S E Q++ KSK+SP+P+ KPVE KASK++ +T EE T     
Subjt:  QQRSKKTSRVQSPSHLSSKPIAQSTSPQHTKSPAAIGDQTTKEIVSHPANQSPQARPRSRESQLETKSKQSPKPDSKPVESKASKDEPETNEEPT-----

Query:  ---LTSKNTSHPHSNQDSSEIPMQPNQTIENGPEPSLESQEESKENKERKNDKEDQARTINALQTSASKSTWITSSESPSPFEQADKDSKQEETMEDLSK
           L SKNTS+PHS++DSSE P Q ++T+E G + SLESQ ESKE       KED  +T NALQ  AS+ST ITSS+S S FE  +K+++Q+E+MEDLSK
Subjt:  ---LTSKNTSHPHSNQDSSEIPMQPNQTIENGPEPSLESQEESKENKERKNDKEDQARTINALQTSASKSTWITSSESPSPFEQADKDSKQEETMEDLSK

Query:  AFQKLNIKYSDEENPKSFTTLIGDNKGASMHVHSGEAKRESSIHIHREYKSDPDQSPESSTDDEGNFNHETPQESRTEED---PPLELYININVQGINNS
        AF KLNIKYSDEENPKSFTT+IGDNKG+S+H+ SGEAK ESSIH++  YKS+PDQSP+SST+ + N N+ETPQ+S TEE+   PPLELYIN NVQGINNS
Subjt:  AFQKLNIKYSDEENPKSFTTLIGDNKGASMHVHSGEAKRESSIHIHREYKSDPDQSPESSTDDEGNFNHETPQESRTEED---PPLELYININVQGINNS

Query:  ILSNTSFTENNPGIKLKFV--REPTRSEDELQSHDAREAKYTAKPAERLTYEPTVRRRCLRGLLMESSDSELDDPEKPRRHGCRFSRSCEGKEVEIL
        I+ NTSFTENNPGIKLKF    EPT S+DEL+SH  R++ Y   PAE++TYEP +RRR L GLLMES DSE ++P K R HGCR+SRS +GK+VE L
Subjt:  ILSNTSFTENNPGIKLKFV--REPTRSEDELQSHDAREAKYTAKPAERLTYEPTVRRRCLRGLLMESSDSELDDPEKPRRHGCRFSRSCEGKEVEIL

A0A6J1CRH0 cell wall protein RBR32.4e-12553.61Show/hide
Query:  MPPSKSHKEAKVHAQPPSHSRAKNQTRTASKPPSPSKAIPQSSVALNKSPSASGKASPYHDTSKPSSPAGKASPSSKPSSPASKASPSQNTSKPSSLAGK
        MPPS+S KE++VH+  PS+SRAKNQ R ASK PS  KA P  +VA NKS                            PSSPAS               GK
Subjt:  MPPSKSHKEAKVHAQPPSHSRAKNQTRTASKPPSPSKAIPQSSVALNKSPSASGKASPYHDTSKPSSPAGKASPSSKPSSPASKASPSQNTSKPSSLAGK

Query:  ASPSKDASKPSSLAGKASPSKDASVGKKLFGDWRAQILNIQNQVYFSTASKPSSPAGKASPSQDASKPSSPAAPRSRIASKPPSPSQTSSKNHPNSKPTS
        ASPSKDASKPSS A                                                         AAPR RI+S PPSPSQTSS+NH N KPTS
Subjt:  ASPSKDASKPSSLAGKASPSKDASVGKKLFGDWRAQILNIQNQVYFSTASKPSSPAGKASPSQDASKPSSPAAPRSRIASKPPSPSQTSSKNHPNSKPTS

Query:  -QSRLKADSQ----------PQVSSMPRSPSQENSRQQRSKKTSRVQSPSHLSSKPIAQSTSPQHTKSPAAIGDQTTKEIVSHPANQSPQARPRSRESQL
         QS+LKADSQ          PQ SS  RSPSQ NS+QQ SKKT+             AQSTS QHT   AA  DQTT  + SH AN+S QARP+ RESQ 
Subjt:  -QSRLKADSQ----------PQVSSMPRSPSQENSRQQRSKKTSRVQSPSHLSSKPIAQSTSPQHTKSPAAIGDQTTKEIVSHPANQSPQARPRSRESQL

Query:  ETKSKQSPKPDSKPVESKASKDEPETNEEPTLTSKNTSHPHSNQDSSEIPMQPNQTIENGPEPSLESQEESKENKERKNDKEDQARTINALQTSASKSTW
        +TKSKQSPK       SKASK++P+  EE  LTSKNTS+P SNQ+SSE P + +Q+IENG +PSLESQ ESKE      D+E + +  NA  T    ST 
Subjt:  ETKSKQSPKPDSKPVESKASKDEPETNEEPTLTSKNTSHPHSNQDSSEIPMQPNQTIENGPEPSLESQEESKENKERKNDKEDQARTINALQTSASKSTW

Query:  ITSSESPSPFEQ-ADKDSKQEETM--EDLSKAFQKLNIKYSDEENPKSFTTLIGDNKGASMHVHSGEAKRESSIHIHREYKSDPDQSPESSTDDEGNFNH
        I+SSES SP+E+  D+DS+++E M   D+SKAF KL I YS EENPKSF TLIGDNKG SM++ SG+  RESSIHI REY+S+PDQSP+SST+ EGNFNH
Subjt:  ITSSESPSPFEQ-ADKDSKQEETM--EDLSKAFQKLNIKYSDEENPKSFTTLIGDNKGASMHVHSGEAKRESSIHIHREYKSDPDQSPESSTDDEGNFNH

Query:  ETPQESRTEEDPPLELYININVQGINNSILSNTSFTENNPGIKLKFVREPTRSEDELQSHDAREAKYTAKPAERLTYEPTVRRRCLRGLLMESSDSELDD
        +T ++SRT EDPPL LYIN N QGINNSILSN+SFTE NPG +LKF REPT+SE+  +S   ++AKY AKPAERLTY+PTVRRRCLRGL MESSDSE ++
Subjt:  ETPQESRTEEDPPLELYININVQGINNSILSNTSFTENNPGIKLKFVREPTRSEDELQSHDAREAKYTAKPAERLTYEPTVRRRCLRGLLMESSDSELDD

Query:  PEKPRRHGCRFSRSCEGKEVEIL
        PEKPRRHGCR+S +C+GK+ EI+
Subjt:  PEKPRRHGCRFSRSCEGKEVEIL

A0A6J1GK50 cell wall protein RBR3-like3.4e-20466.62Show/hide
Query:  MSYRQFRFRLPWQSVKASSHPADESSRRSSEPTDETKASSSVADTVPNIRHQPGQSPEIKPEQPPLAPAQAAEKSETMPPSKSHKEAKVHAQPPSHSRAK
        M+YRQFRFRLPWQS+KASS   +ESS RSSEPTDE + S+S ADTVP ++H         PE  PL  AQA E+SETM PSKSHK+AKVH+QP SHSRAK
Subjt:  MSYRQFRFRLPWQSVKASSHPADESSRRSSEPTDETKASSSVADTVPNIRHQPGQSPEIKPEQPPLAPAQAAEKSETMPPSKSHKEAKVHAQPPSHSRAK

Query:  NQTRTASKPPSPSKAIPQSSVALNKSPSASGKASPYHDTSKPSSPAGKASPSSKPSSPASKASPSQNTSKPSSLAGKASPSKDASKPSSLAGKASPSKDA
         QTRTA+KPPS SK  PQSSV+ NKSP+ S KASP HD SKPSS AG             K SPS +TSK SS AGK              GK SPS D 
Subjt:  NQTRTASKPPSPSKAIPQSSVALNKSPSASGKASPYHDTSKPSSPAGKASPSSKPSSPASKASPSQNTSKPSSLAGKASPSKDASKPSSLAGKASPSKDA

Query:  SVGKKLFGDWRAQILNIQNQVYFSTASKPSSPAGKASPSQDASKPSS--PAAPRSRIASKPPSPSQTSSKNHPNSKPTSQSRLKADSQ----------PQ
                                  SKPSSPAGKA PS+DAS+PSS   AAPRS+I SKPPSPSQTSSKNHP SKPTSQSRLKADSQ          PQ
Subjt:  SVGKKLFGDWRAQILNIQNQVYFSTASKPSSPAGKASPSQDASKPSS--PAAPRSRIASKPPSPSQTSSKNHPNSKPTSQSRLKADSQ----------PQ

Query:  VSSMPRSPSQENSRQQRSKKTSRVQSPSHLSSKPIAQSTSPQHTKSPAAIGDQTTKEIVSHPANQSPQARPRSRESQLETKSKQSPKPDSKPVESKASKD
         SS+PRSPS ENSRQQ SKK SRVQSPSHLSSKP AQSTS Q T+SPA IGDQTTK +VSHPA+QSP+AR + RE+Q++TKSKQSPKPD KPVE KASK 
Subjt:  VSSMPRSPSQENSRQQRSKKTSRVQSPSHLSSKPIAQSTSPQHTKSPAAIGDQTTKEIVSHPANQSPQARPRSRESQLETKSKQSPKPDSKPVESKASKD

Query:  EPETNEEPTLTSKNTSHPHSNQDSSEIPMQPNQTIENGPEPSLESQEESKENKERKNDKEDQARTINALQTSASKSTWITSSESPSPFEQADKDSKQEET
        +PET EE    SKNTS+PHS+QD SEIP+  ++TIENGPE SLESQ ES+E+KE K+ +ED  +T NALQ +ASKS  ITS+E  SPFE  + DS+QE T
Subjt:  EPETNEEPTLTSKNTSHPHSNQDSSEIPMQPNQTIENGPEPSLESQEESKENKERKNDKEDQARTINALQTSASKSTWITSSESPSPFEQADKDSKQEET

Query:  MEDLSKAFQKLNIKYSDEENPKSFTTLIGDNKGASMHVHSGEAKRESSIHIHREYKSDPDQSPESSTDDEGNFNHETPQESRTEEDPPLELYININVQGI
        MEDLSKAFQ LNIKY  EENPKSFTTL GDNKGASMH+ SGEA +ESSIHIHR+YKSDPD+ PESSTD EGN N ETPQ+S+TEEDPPLELYININVQGI
Subjt:  MEDLSKAFQKLNIKYSDEENPKSFTTLIGDNKGASMHVHSGEAKRESSIHIHREYKSDPDQSPESSTDDEGNFNHETPQESRTEEDPPLELYININVQGI

Query:  NNSILSNTSFTENNPGIKLKFVREPTRSEDELQSHDAREAKYTAKPAERLTYEPTVRRRCLRGLLMESSDSELDDPEKPRRHGCRFSRSCEG
        NNS+LSN+SFTENNPGIKLKFV + T+SED+  S  A++AKYTAK  E  TYEPTVRRRCL GLLMESSDS+ D+ EKPRRHGCR+  S EG
Subjt:  NNSILSNTSFTENNPGIKLKFVREPTRSEDELQSHDAREAKYTAKPAERLTYEPTVRRRCLRGLLMESSDSELDDPEKPRRHGCRFSRSCEG

A0A6J1KJ10 cell wall protein RBR3-like6.8e-20566.52Show/hide
Query:  MSYRQFRFRLPWQSVKASSHPADESSRRSSEPTDETKASSSVADTVPNIRHQPGQSPEIKPEQPPLAPAQAAEKSETMPPSKSHKEAKVHAQPPSHSRAK
        M+YRQFRFRLPWQS+KASS P +ESS RSSEPTDE + S+S ADTVP ++H P QS E KPE  PL  AQA E+SETM PSKSHK+AKVH+QP SHSRAK
Subjt:  MSYRQFRFRLPWQSVKASSHPADESSRRSSEPTDETKASSSVADTVPNIRHQPGQSPEIKPEQPPLAPAQAAEKSETMPPSKSHKEAKVHAQPPSHSRAK

Query:  NQTRTASKPPSPSKAIPQSSVALNKSPSASGKASPYHDTSKPSSPAGKASPSSKPSSPASKASPSQNTSKPSSLAGKASPSKDASKPSSLAGKASPSKDA
         QTRTA+KPPS SK  PQSSV+ NKSP+ S KASP HD SKPSS AG             K SPS +TSK SS AGK              GK SPS+D 
Subjt:  NQTRTASKPPSPSKAIPQSSVALNKSPSASGKASPYHDTSKPSSPAGKASPSSKPSSPASKASPSQNTSKPSSLAGKASPSKDASKPSSLAGKASPSKDA

Query:  SVGKKLFGDWRAQILNIQNQVYFSTASKPSSPAGKASPSQDASKPSS--PAAPRSRIASKPPSPSQTSSKNHPNSKPTSQSRLKADSQ----------PQ
        S+                          PS PAGKA PS+DAS+PSS   AAPRS I SKPPSPSQTSSKNH +SK TSQSRLKADSQ          PQ
Subjt:  SVGKKLFGDWRAQILNIQNQVYFSTASKPSSPAGKASPSQDASKPSS--PAAPRSRIASKPPSPSQTSSKNHPNSKPTSQSRLKADSQ----------PQ

Query:  VSSMPRSPSQENSRQQRSKKTSRVQSPSHLSSKPIAQSTSPQHTKSPAAIGDQTTKEIVSHPANQSPQARPRSRESQLETKSKQSPKPDSKPVESKASKD
         SS+PRSPS ENSRQQ SKK SRVQSPSHLSSK  AQSTS Q T+SPA IGDQTTK +VSHPA+QSP+AR +S+E+Q++TKSKQSPKPD KPVE KASK 
Subjt:  VSSMPRSPSQENSRQQRSKKTSRVQSPSHLSSKPIAQSTSPQHTKSPAAIGDQTTKEIVSHPANQSPQARPRSRESQLETKSKQSPKPDSKPVESKASKD

Query:  EPETNEEPTLTSKNTSHPHSNQDSSEIPMQPNQTIENGPEPSLESQEESKENKERKNDKEDQARTINALQTSASKSTWITSSESPSPFEQADKDSKQEET
        +PET EE    SKNTS+P SN+D SEIP+  ++TIENGPEPSLESQ ES+E+KE K+ +ED  +T NALQ +ASKS  ITS+E  SPFE  + DS+QE T
Subjt:  EPETNEEPTLTSKNTSHPHSNQDSSEIPMQPNQTIENGPEPSLESQEESKENKERKNDKEDQARTINALQTSASKSTWITSSESPSPFEQADKDSKQEET

Query:  MEDLSKAFQKLNIKYSDEENPKSFTTLIGDNKGASMHVHSGEAKRESSIHIHREYKSDPDQSPESSTDDEGNFNHETPQESRTEEDPPLELYININVQGI
        MEDL KAFQ LNIKY  EENPKSFTTL GDNKGASMH+ SGEA +ESSIHIHR+YKSDPD+ PESSTD EGN N ETPQ+S+TEEDPPLELYININVQGI
Subjt:  MEDLSKAFQKLNIKYSDEENPKSFTTLIGDNKGASMHVHSGEAKRESSIHIHREYKSDPDQSPESSTDDEGNFNHETPQESRTEEDPPLELYININVQGI

Query:  NNSILSNTSFTENNPGIKLKFVREPTRSEDELQSHDAREAKYTAKPAERLTYEPTVRRRCLRGLLMESSDSELDDPEKPRRHGCRFSRSCEGK
        NNS+LSN+SFTENNPGIKLKFV + T+SE++  S  A++AKYTAK  E  TYEPTVRRRCL GLLMESSDS+ D+ EKPRRHGCR+  S EGK
Subjt:  NNSILSNTSFTENNPGIKLKFVREPTRSEDELQSHDAREAKYTAKPAERLTYEPTVRRRCLRGLLMESSDSELDDPEKPRRHGCRFSRSCEGK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G75260.1 oxidoreductases, acting on NADH or NADPH7.4e-1828.39Show/hide
Query:  SPYHDTSKPSSPAGKASPSSKPS-SPASKASPSQNTSKPSSLAGKASP--SKDASKPSSLAGKASPSKDASVGKKLFGDWRAQILNIQNQVYFSTASKPS
        SP   +S  SSP+   +P S+P   PA  A PS++ +KP     KASP  S+  S  ++LA  +S S+  S+G                    +T ++ +
Subjt:  SPYHDTSKPSSPAGKASPSSKPS-SPASKASPSQNTSKPSSLAGKASP--SKDASKPSSLAGKASPSKDASVGKKLFGDWRAQILNIQNQVYFSTASKPS

Query:  SPAGKASPSQDASKPSSPAAPRSRIASKPPSPSQTSSKNHPNSKPTSQSRLKADSQPQVSSMPRSPSQENSR---QQRSKKTSRVQSPSHLSSKPIAQST
            + S S  + K  S      ++A+K   P +T      N  P  +       +P + + P    ++      Q++ + T   +     + K + + +
Subjt:  SPAGKASPSQDASKPSSPAAPRSRIASKPPSPSQTSSKNHPNSKPTSQSRLKADSQPQVSSMPRSPSQENSR---QQRSKKTSRVQSPSHLSSKPIAQST

Query:  SPQHTKSPAAIGDQTTKEIVSHPANQSPQARPRSRESQLET-KSKQSPKPDSKPVESKASKDEPETNEEPTLTSKNTSHPHSNQDSSEIPMQPNQTIENG
             KS A  G Q  +EI      +  +        +LE  + +Q  K   K    +  +       E    SK T H  +  +++            G
Subjt:  SPQHTKSPAAIGDQTTKEIVSHPANQSPQARPRSRESQLET-KSKQSPKPDSKPVESKASKDEPETNEEPTLTSKNTSHPHSNQDSSEIPMQPNQTIENG

Query:  PEPSLESQEESKENKERKNDKEDQARTINALQTSASKSTWITSSESPSPFEQADKDSKQEETMEDLSKAFQKLNI-KYSDEENPKSFTTLIGDNKGASMH
        P    E + E++   E   D   Q +T  AL TS   +  +T+ E  S        S   +  ED+     KL   K + ++   S  TL G+NKGA+M 
Subjt:  PEPSLESQEESKENKERKNDKEDQARTINALQTSASKSTWITSSESPSPFEQADKDSKQEETMEDLSKAFQKLNI-KYSDEENPKSFTTLIGDNKGASMH

Query:  VHSGEAKRESSIHIHREYKSDPDQSPESSTDDEGNFNHETPQESRTEEDPPLELYININVQGINNSILSNTSFTENNPGIKLKFVREPTRSEDELQSHDA
        + S + K++  +HI R Y+S+PD+S  ++  +      E P++   EE+     YIN N QGINNSI+  +S +EN+PG+ + F  E  + E      + 
Subjt:  VHSGEAKRESSIHIHREYKSDPDQSPESSTDDEGNFNHETPQESRTEEDPPLELYININVQGINNSILSNTSFTENNPGIKLKFVREPTRSEDELQSHDA

Query:  REAK-YTAKPAERLTYEPTVRRRCLRGLLMESSDSELDDPEKPRRHGCRFSRSCEGKEVE
         E K  T    ++L  EP VRRRCLRGLL ESS+SE D+P KPRRHGCRF  +C+ K++E
Subjt:  REAK-YTAKPAERLTYEPTVRRRCLRGLLMESSDSELDDPEKPRRHGCRFSRSCEGKEVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATACCGACAATTTCGCTTTCGACTTCCTTGGCAATCTGTAAAAGCTTCTTCTCATCCTGCAGATGAGTCATCAAGACGCAGTTCTGAGCCTACAGATGAAACCAA
AGCTTCTTCTTCAGTGGCTGATACCGTGCCAAATATTCGGCATCAACCAGGCCAGTCTCCTGAGATAAAACCAGAGCAGCCTCCTCTAGCACCAGCTCAGGCAGCTGAAA
AAAGTGAAACTATGCCACCTTCAAAATCTCACAAGGAAGCCAAAGTTCATGCTCAACCACCATCACATTCCCGAGCCAAAAACCAAACCCGAACGGCTTCGAAGCCTCCA
TCACCATCGAAAGCAATCCCTCAATCCTCAGTTGCTTTGAACAAGTCTCCATCAGCATCAGGCAAAGCCTCTCCATATCATGATACTTCAAAGCCTTCATCACCAGCAGG
TAAAGCCTCTCCATCTTCAAAACCTTCATCACCAGCAAGCAAAGCCTCTCCATCTCAAAATACTTCAAAGCCTTCATCACTAGCAGGCAAAGCCTCCCCATCTAAGGATG
CTTCAAAGCCTTCATCACTAGCAGGCAAAGCCTCCCCATCTAAGGATGCTTCTGTTGGAAAGAAACTTTTTGGAGATTGGAGGGCCCAAATCCTAAATATCCAAAATCAA
GTTTATTTTTCAACAGCTTCAAAGCCTTCATCACCAGCAGGCAAAGCCTCTCCATCTCAGGATGCTTCAAAGCCTTCATCACCAGCAGCTCCTCGATCCCGAATTGCTTC
GAAGCCACCGTCTCCATCTCAAACATCCAGTAAAAACCATCCAAATTCAAAACCAACATCACAATCGAGACTGAAAGCTGATTCTCAACCTCAAGTTTCTTCAATGCCGC
GGTCACCATCTCAAGAAAATTCTCGACAACAACGATCGAAAAAAACCTCCCGAGTTCAGTCTCCATCTCATCTGTCCAGTAAACCTATTGCACAATCAACATCACCACAG
CATACCAAATCTCCTGCAGCCATTGGAGACCAAACAACAAAAGAAATTGTTTCTCATCCTGCTAATCAATCGCCACAAGCAAGACCTAGAAGCAGGGAAAGTCAGTTGGA
AACGAAATCGAAGCAGTCTCCAAAACCCGACTCGAAACCAGTGGAGTCCAAAGCATCAAAAGATGAGCCTGAAACCAATGAAGAGCCCACACTCACATCTAAGAACACTT
CCCATCCCCATTCAAACCAGGACTCTTCTGAAATCCCAATGCAACCCAATCAAACCATTGAAAATGGTCCAGAGCCCTCTCTAGAATCACAGGAAGAGTCAAAGGAAAAT
AAGGAAAGAAAGAATGACAAGGAAGATCAGGCAAGAACAATCAATGCACTTCAAACCAGTGCATCTAAAAGCACATGGATCACATCTTCCGAAAGCCCTTCACCATTTGA
ACAAGCAGATAAGGACTCAAAACAGGAAGAAACCATGGAAGACTTATCAAAAGCTTTTCAGAAACTAAACATCAAATATTCAGACGAAGAAAATCCAAAAAGTTTCACAA
CACTCATCGGCGATAACAAAGGGGCTTCAATGCACGTACATTCCGGTGAAGCCAAGAGAGAGAGTTCAATCCACATCCACCGTGAGTACAAGAGCGATCCAGATCAAAGC
CCTGAAAGTTCCACAGACGACGAAGGAAACTTCAATCACGAAACACCTCAAGAATCAAGAACAGAAGAGGATCCACCACTGGAATTATACATAAACATCAACGTACAAGG
TATCAACAACTCAATCCTGTCGAATACCTCATTCACTGAGAATAATCCTGGAATCAAGTTGAAATTCGTTCGAGAACCAACTAGATCTGAAGATGAATTACAGTCTCATG
ACGCTCGAGAGGCCAAGTATACTGCGAAACCTGCCGAGAGGCTTACATATGAGCCCACAGTAAGAAGAAGATGCCTTAGAGGGCTGTTAATGGAGTCGAGCGATTCTGAG
CTCGACGATCCAGAAAAGCCCCGACGCCATGGCTGCCGCTTCAGTCGAAGTTGCGAAGGAAAAGAGGTCGAAATTCTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCATACCGACAATTTCGCTTTCGACTTCCTTGGCAATCTGTAAAAGCTTCTTCTCATCCTGCAGATGAGTCATCAAGACGCAGTTCTGAGCCTACAGATGAAACCAA
AGCTTCTTCTTCAGTGGCTGATACCGTGCCAAATATTCGGCATCAACCAGGCCAGTCTCCTGAGATAAAACCAGAGCAGCCTCCTCTAGCACCAGCTCAGGCAGCTGAAA
AAAGTGAAACTATGCCACCTTCAAAATCTCACAAGGAAGCCAAAGTTCATGCTCAACCACCATCACATTCCCGAGCCAAAAACCAAACCCGAACGGCTTCGAAGCCTCCA
TCACCATCGAAAGCAATCCCTCAATCCTCAGTTGCTTTGAACAAGTCTCCATCAGCATCAGGCAAAGCCTCTCCATATCATGATACTTCAAAGCCTTCATCACCAGCAGG
TAAAGCCTCTCCATCTTCAAAACCTTCATCACCAGCAAGCAAAGCCTCTCCATCTCAAAATACTTCAAAGCCTTCATCACTAGCAGGCAAAGCCTCCCCATCTAAGGATG
CTTCAAAGCCTTCATCACTAGCAGGCAAAGCCTCCCCATCTAAGGATGCTTCTGTTGGAAAGAAACTTTTTGGAGATTGGAGGGCCCAAATCCTAAATATCCAAAATCAA
GTTTATTTTTCAACAGCTTCAAAGCCTTCATCACCAGCAGGCAAAGCCTCTCCATCTCAGGATGCTTCAAAGCCTTCATCACCAGCAGCTCCTCGATCCCGAATTGCTTC
GAAGCCACCGTCTCCATCTCAAACATCCAGTAAAAACCATCCAAATTCAAAACCAACATCACAATCGAGACTGAAAGCTGATTCTCAACCTCAAGTTTCTTCAATGCCGC
GGTCACCATCTCAAGAAAATTCTCGACAACAACGATCGAAAAAAACCTCCCGAGTTCAGTCTCCATCTCATCTGTCCAGTAAACCTATTGCACAATCAACATCACCACAG
CATACCAAATCTCCTGCAGCCATTGGAGACCAAACAACAAAAGAAATTGTTTCTCATCCTGCTAATCAATCGCCACAAGCAAGACCTAGAAGCAGGGAAAGTCAGTTGGA
AACGAAATCGAAGCAGTCTCCAAAACCCGACTCGAAACCAGTGGAGTCCAAAGCATCAAAAGATGAGCCTGAAACCAATGAAGAGCCCACACTCACATCTAAGAACACTT
CCCATCCCCATTCAAACCAGGACTCTTCTGAAATCCCAATGCAACCCAATCAAACCATTGAAAATGGTCCAGAGCCCTCTCTAGAATCACAGGAAGAGTCAAAGGAAAAT
AAGGAAAGAAAGAATGACAAGGAAGATCAGGCAAGAACAATCAATGCACTTCAAACCAGTGCATCTAAAAGCACATGGATCACATCTTCCGAAAGCCCTTCACCATTTGA
ACAAGCAGATAAGGACTCAAAACAGGAAGAAACCATGGAAGACTTATCAAAAGCTTTTCAGAAACTAAACATCAAATATTCAGACGAAGAAAATCCAAAAAGTTTCACAA
CACTCATCGGCGATAACAAAGGGGCTTCAATGCACGTACATTCCGGTGAAGCCAAGAGAGAGAGTTCAATCCACATCCACCGTGAGTACAAGAGCGATCCAGATCAAAGC
CCTGAAAGTTCCACAGACGACGAAGGAAACTTCAATCACGAAACACCTCAAGAATCAAGAACAGAAGAGGATCCACCACTGGAATTATACATAAACATCAACGTACAAGG
TATCAACAACTCAATCCTGTCGAATACCTCATTCACTGAGAATAATCCTGGAATCAAGTTGAAATTCGTTCGAGAACCAACTAGATCTGAAGATGAATTACAGTCTCATG
ACGCTCGAGAGGCCAAGTATACTGCGAAACCTGCCGAGAGGCTTACATATGAGCCCACAGTAAGAAGAAGATGCCTTAGAGGGCTGTTAATGGAGTCGAGCGATTCTGAG
CTCGACGATCCAGAAAAGCCCCGACGCCATGGCTGCCGCTTCAGTCGAAGTTGCGAAGGAAAAGAGGTCGAAATTCTGTAG
Protein sequenceShow/hide protein sequence
MSYRQFRFRLPWQSVKASSHPADESSRRSSEPTDETKASSSVADTVPNIRHQPGQSPEIKPEQPPLAPAQAAEKSETMPPSKSHKEAKVHAQPPSHSRAKNQTRTASKPP
SPSKAIPQSSVALNKSPSASGKASPYHDTSKPSSPAGKASPSSKPSSPASKASPSQNTSKPSSLAGKASPSKDASKPSSLAGKASPSKDASVGKKLFGDWRAQILNIQNQ
VYFSTASKPSSPAGKASPSQDASKPSSPAAPRSRIASKPPSPSQTSSKNHPNSKPTSQSRLKADSQPQVSSMPRSPSQENSRQQRSKKTSRVQSPSHLSSKPIAQSTSPQ
HTKSPAAIGDQTTKEIVSHPANQSPQARPRSRESQLETKSKQSPKPDSKPVESKASKDEPETNEEPTLTSKNTSHPHSNQDSSEIPMQPNQTIENGPEPSLESQEESKEN
KERKNDKEDQARTINALQTSASKSTWITSSESPSPFEQADKDSKQEETMEDLSKAFQKLNIKYSDEENPKSFTTLIGDNKGASMHVHSGEAKRESSIHIHREYKSDPDQS
PESSTDDEGNFNHETPQESRTEEDPPLELYININVQGINNSILSNTSFTENNPGIKLKFVREPTRSEDELQSHDAREAKYTAKPAERLTYEPTVRRRCLRGLLMESSDSE
LDDPEKPRRHGCRFSRSCEGKEVEIL