; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10004848 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10004848
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptioncell wall protein RBR3-like
Genome locationChr08:20903816..20905897
RNA-Seq ExpressionHG10004848
SyntenyHG10004848
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065223.1 flocculation protein FLO11 [Cucumis melo var. makuwa]5.4e-23666.58Show/hide
Query:  MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEINPEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAK
        MS SQLRILLPWQSLKASP PANES   SF PTDE+E+SAS ADT PNIRHQP QS EI PE+PPLA AQA E+SETMPPSKSHK  K+ SQ  +NSRAK
Subjt:  MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEINPEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAK

Query:  NRSRTASKPPSQSKTIPQSSVAS-KSPSTSSKGSPSQDTSKPSSPAGKA-SPSQDASSKPSSPAAVASTAPRTRIALKPSSPSSQTSSKSHPNKKPT---
        NRSRTASKP S    IPQS +AS K PSTS KGS SQD+SKPSSPAGK  SPSQDASSKPSSPA VA+TAP  RIA K SS SSQ S+K HP+ KPT   
Subjt:  NRSRTASKPPSQSKTIPQSSVAS-KSPSTSSKGSPSQDTSKPSSPAGKA-SPSQDASSKPSSPAAVASTAPRTRIALKPSSPSSQTSSKSHPNKKPT---

Query:  ---------------------------------------------------------------------------------SQSRIKADSQPSSSSRSAF
                                                                                         SQSR K DSQPS SSRS F
Subjt:  ---------------------------------------------------------------------------------SQSRIKADSQPSSSSRSAF

Query:  PSQDFSMPLRSPSQENSRQQPLEKTSRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHPNSKPSSQSRFKADSQPSSPSRSAFPSQDSSMPPRSPSQEI
        PSQDFSMP RSPS ENSRQQP +KTS VQSPSH S KPTAQ TS+QPI+SPA IG Q HPN KPSSQSRFKA+S+PSS S+S FPSQDSSMPPRSPSQE 
Subjt:  PSQDFSMPLRSPSQENSRQQPLEKTSRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHPNSKPSSQSRFKADSQPSSPSRSAFPSQDSSMPPRSPSQEI

Query:  SRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDGIISHPANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKASKNQPETKEELT---
        S Q  SEKTSRVQSPS+LS KPTA ST+QQPIES  +IGDQTTDGI+S PA  SPKA PT+ E Q+Q KS +  +P+ KPVE KASKNQ +TKEELT   
Subjt:  SRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDGIISHPANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKASKNQPETKEELT---

Query:  -------SKNTSNPHPNQDYSEIPTQSDQNMENGLHSFLESQAESKETKEDLAKTTNALQTKASRSTLITSSKSRSSFEPEKWYSQQEESMEDLSKVFQK
               SKNTSNPH ++D SE PTQSD+ +E GL S LESQ ESKETKED  KTTNALQ KASRSTLITSSKSRSSFEPEK  +QQ+ESMEDLSK F K
Subjt:  -------SKNTSNPHPNQDYSEIPTQSDQNMENGLHSFLESQAESKETKEDLAKTTNALQTKASRSTLITSSKSRSSFEPEKWYSQQEESMEDLSKVFQK

Query:  LNIKYSDKENQKSFTTLIGDNKGSSMHLLSGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPS--PLELYINVNVQGINNSIMCN
        LNIKYSD+EN KSFTT+IGDNKGSS+HLLSGEAKSES IH++ +YKS+PDQSPKSST I+ N NNETPQDS TEENP   PLELYIN NVQGINNSIM N
Subjt:  LNIKYSDKENQKSFTTLIGDNKGSSMHLLSGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPS--PLELYINVNVQGINNSIMCN

Query:  TSFIENDPGVKLKFP--REPTKSEDELEAHHARKAEYSPRPAEKLTYEPRVRRRCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKGKEVETL
        TSF EN+PG+KLKFP   EPT S+DELE+HH RK+ Y P PAEK+TYEPR+RRR L G+LMES DSE ENP K R HGCRYSR+SKGK+VETL
Subjt:  TSFIENDPGVKLKFP--REPTKSEDELEAHHARKAEYSPRPAEKLTYEPRVRRRCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKGKEVETL

XP_011649631.1 flocculation protein FLO11 [Cucumis sativus]7.0e-22865.66Show/hide
Query:  MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEINPEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAK
        MS SQLRILLPWQSLKAS   ANES   SF PTDE+E SASAADTVPNIRHQP QS E  PE+PPLA AQA E+SETMPPSKSHKA KV SQ PS +RAK
Subjt:  MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEINPEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAK

Query:  NRSRTASKPPSQSKTIPQSSVASKSPSTSSKGSPSQDTSKPSSPAGKA-SPSQDASSKPSSPAAVASTAPRTRIALKPSSPSSQTSSKSHPNKKPTSQSR
        NRSR ASKP S SK IPQ SVAS  PSTS KGS SQD+SKPSSPAGK  SPS+DASSKPSSPA VA+T P  RIA K SS SSQTS+K HPN KPTSQ +
Subjt:  NRSRTASKPPSQSKTIPQSSVASKSPSTSSKGSPSQDTSKPSSPAGKA-SPSQDASSKPSSPAAVASTAPRTRIALKPSSPSSQTSSKSHPNKKPTSQSR

Query:  IKADSQPSSSSRSA------------------------------------------------------------------------------------FP
        +KADSQPSSSSRSA                                                                                    FP
Subjt:  IKADSQPSSSSRSA------------------------------------------------------------------------------------FP

Query:  SQDFSMPLRSPSQENSRQQPLEKTSRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHPNSKPS-SQSRFKADSQPSSPSRSAFPSQDSSMPPRSPSQEI
        SQDFS P RSPS E SRQQP  KTSRVQSPSH S K TAQ T+QQP +SPA IG Q HPN KPS SQSRFKADSQPSS S+  FPSQDSSMPPRSPSQE 
Subjt:  SQDFSMPLRSPSQENSRQQPLEKTSRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHPNSKPS-SQSRFKADSQPSSPSRSAFPSQDSSMPPRSPSQEI

Query:  SRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDGIISHPANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKASKNQPETKEELT---
        S Q  SEKT RVQSPSHLS KPTAQST+QQPIE   +IGDQTTD I+S PAN SPKA PT+ E+Q+Q +S +  KP+ KPVE + SK Q ETKEELT   
Subjt:  SRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDGIISHPANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKASKNQPETKEELT---

Query:  -------SKNTSNPHPNQDYSEIPTQSDQNMENGLHSFLESQAESKETKEDLAKTTNALQTKASRSTLITSSKSRSSFEPEKWYSQQEESMEDLSKVFQK
               SKNTSNPH  +D SE PTQSDQ +E GL S LESQ ESKETKED AKTTNA QTKASRSTLITSSKSRSSFEPE   +QQ+ESMEDLSK F K
Subjt:  -------SKNTSNPHPNQDYSEIPTQSDQNMENGLHSFLESQAESKETKEDLAKTTNALQTKASRSTLITSSKSRSSFEPEKWYSQQEESMEDLSKVFQK

Query:  LNIKYSDKENQKSFTTLIGDNKGSSMHLLSGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPS--PLELYINVNVQGINNSIMCN
        LNIKYSD+EN KS TT+IGDNKG+SMHLLS EAKSES IH++  YKS+PDQSP+SST+I+ N NNET +DS TEENP   PLELYIN+NVQGINNSI  N
Subjt:  LNIKYSDKENQKSFTTLIGDNKGSSMHLLSGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPS--PLELYINVNVQGINNSIMCN

Query:  TSFIENDPGVKLKFPREPTKSEDELEA-HHARKAEYSPRPAEKLTYEPRVRRRCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKGKEVETL
        TSF EN+PG+KLKFP EPT  +DELE+ HH RK++Y   PAEK+TY+PR+RRRCL G+LMESSDSE ENP K + HGCRYS +SKGKEVETL
Subjt:  TSFIENDPGVKLKFPREPTKSEDELEA-HHARKAEYSPRPAEKLTYEPRVRRRCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKGKEVETL

XP_022951875.1 cell wall protein RBR3-like [Cucurbita moschata]1.8e-17058.65Show/hide
Query:  MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEINPEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAK
        M++ Q R  LPWQS+KAS    NESS  S EPTDE ETS SAADTVP ++H         PE  PL  AQA E+SETM PSKSHK +KV SQP S+SRAK
Subjt:  MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEINPEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAK

Query:  NRSRTASKPPSQSKTIPQSSVAS-KSPSTSSKGSPSQDTSKPSSPAGKASPSQDASSKPSSPAAVASTAPRTRIALKPSSPSSQTSSKSHPNKKPTSQSR
         ++RTA+KPPS SK  PQSSV+S KSP+TS+K SPS D SKPSS AGK SPS D +SK SSPA     +P                  SH          
Subjt:  NRSRTASKPPSQSKTIPQSSVAS-KSPSTSSKGSPSQDTSKPSSPAGKASPSQDASSKPSSPAAVASTAPRTRIALKPSSPSSQTSSKSHPNKKPTSQSR

Query:  IKADSQPSSSSRSAFPSQDFSMPLRSPSQENSRQQPLEKTSRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHPNSKPSSQSRFKADSQPSSPSRSAFP
            S+PSS +  AFPS+D S P                       S  ++ P +Q+ S+ P  SP+   ++ HP SKP+SQSR KADSQPSSPSR AF 
Subjt:  IKADSQPSSSSRSAFPSQDFSMPLRSPSQENSRQQPLEKTSRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHPNSKPSSQSRFKADSQPSSPSRSAFP

Query:  SQDSSMPPRSPSQEISRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDGIISHPANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKA
         Q SS+ PRSPS E SRQQ S+K SRVQSPSHLS KPTAQST+QQ  ESP  IGDQTT  ++SHPA+QSP+AR   RE Q+QTKS Q  KPD KPVE KA
Subjt:  SQDSSMPPRSPSQEISRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDGIISHPANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKA

Query:  SKNQPETKEELTSKNTSNPHPNQDYSEIPTQSDQNMENGLHSFLESQAESKETK------EDLAKTTNALQTKASRSTLITSSKSRSSFEPEKWYSQQEE
        SK+QPET EE  SKNTS PH +QD+SEIP   D+ +ENG  + LESQ ES+E+K      EDL KTTNALQ  AS+S LITS++  S FEPE   SQQE 
Subjt:  SKNQPETKEELTSKNTSNPHPNQDYSEIPTQSDQNMENGLHSFLESQAESKETK------EDLAKTTNALQTKASRSTLITSSKSRSSFEPEKWYSQQEE

Query:  SMEDLSKVFQKLNIKYSDKENQKSFTTLIGDNKGSSMHLLSGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPSPLELYINVNVQ
        +MEDLSK FQ LNIKY + EN KSFTTL GDNKG+SMHLLSGEA  ES IHIHRQYKSDPD+ P+SST+IEGN N ETPQDS+TEE+P PLELYIN+NVQ
Subjt:  SMEDLSKVFQKLNIKYSDKENQKSFTTLIGDNKGSSMHLLSGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPSPLELYINVNVQ

Query:  GINNSIMCNTSFIENDPGVKLKFPREPTKSEDELEAHHARKAEYSPRPAEKLTYEPRVRRRCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKG
        GINNS++ N+SF EN+PG+KLKF  + TKSED+  +  A+KA+Y+ +  E  TYEP VRRRCL G+LMESSDS+ +N EK RRHGCRY  + +G
Subjt:  GINNSIMCNTSFIENDPGVKLKFPREPTKSEDELEAHHARKAEYSPRPAEKLTYEPRVRRRCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKG

XP_023002262.1 cell wall protein RBR3-like [Cucurbita maxima]4.3e-16958.27Show/hide
Query:  MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEINPEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAK
        M++ Q R  LPWQS+KAS  P NESS  S EPTDE ETS SAADTVP ++H P+QS E  PE  PL  AQA E+SETM PSKSHK +KV SQP S+SRAK
Subjt:  MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEINPEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAK

Query:  NRSRTASKPPSQSKTIPQSSVAS-KSPSTSSKGSPSQDTSKPSSPAGKASPSQDASSKPSSPAAVASTAPRTRIALKPSSPSSQTSSKSHPNKKPTSQSR
         ++RTA+KPPS SK  PQSSV+S KSP+TS+K SPS D SKPSS AGK SPS D +SK SSPA                SPS  T               
Subjt:  NRSRTASKPPSQSKTIPQSSVAS-KSPSTSSKGSPSQDTSKPSSPAGKASPSQDASSKPSSPAAVASTAPRTRIALKPSSPSSQTSSKSHPNKKPTSQSR

Query:  IKADSQPSSSSRSAFPSQDFSMPLRSPSQENSRQQPLEKTSRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHPNSKPSSQSRFKADSQPSSPSRSAFP
            S PS  +  AFPS+D S P                 +     SH+ SKP           SP+   ++ H +SK +SQSR KADSQPSSPSR AF 
Subjt:  IKADSQPSSSSRSAFPSQDFSMPLRSPSQENSRQQPLEKTSRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHPNSKPSSQSRFKADSQPSSPSRSAFP

Query:  SQDSSMPPRSPSQEISRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDGIISHPANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKA
         Q SS+ PRSPS E SRQQ S+K SRVQSPSHLS K TAQST+QQ  ESP  IGDQTT  ++SHPA+QSP+AR  ++E Q+QTKS Q  KPD KPVE KA
Subjt:  SQDSSMPPRSPSQEISRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDGIISHPANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKA

Query:  SKNQPETKEELTSKNTSNPHPNQDYSEIPTQSDQNMENGLHSFLESQAESKETK------EDLAKTTNALQTKASRSTLITSSKSRSSFEPEKWYSQQEE
        SK+QPET EE  SKNTS P  N+D+SEIP   D+ +ENG    LESQ ES+E+K      EDL KTTNALQ  AS+S LITS++  S FEPE   SQQE 
Subjt:  SKNQPETKEELTSKNTSNPHPNQDYSEIPTQSDQNMENGLHSFLESQAESKETK------EDLAKTTNALQTKASRSTLITSSKSRSSFEPEKWYSQQEE

Query:  SMEDLSKVFQKLNIKYSDKENQKSFTTLIGDNKGSSMHLLSGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPSPLELYINVNVQ
        +MEDL K FQ LNIKY + EN KSFTTL GDNKG+SMHL+SGEA  ES IHIHRQYKSDPD+ P+SST+IEGN N ETPQDS+TEE+P PLELYIN+NVQ
Subjt:  SMEDLSKVFQKLNIKYSDKENQKSFTTLIGDNKGSSMHLLSGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPSPLELYINVNVQ

Query:  GINNSIMCNTSFIENDPGVKLKFPREPTKSEDELEAHHARKAEYSPRPAEKLTYEPRVRRRCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKGK
        GINNS++ N+SF EN+PG+KLKF  + TKSE++  +  A+KA+Y+ +  E  TYEP VRRRCL G+LMESSDS+ +N EK RRHGCRY  + +GK
Subjt:  GINNSIMCNTSFIENDPGVKLKFPREPTKSEDELEAHHARKAEYSPRPAEKLTYEPRVRRRCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKGK

XP_038886773.1 flocculation protein FLO11 [Benincasa hispida]5.5e-26579.86Show/hide
Query:  MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEINPEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAK
        MS SQLRILLPWQSLKASP P NES G SFEPTDE ETSASAADT  NIRHQP QS EI PEQPPLA A A E+SETMPPSKSHKA KV SQPP NSRAK
Subjt:  MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEINPEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAK

Query:  NRSRTASKPPSQSKTIPQSSVAS-KSPSTSSKGSPSQDTSKPSSPAGKASPSQDASSKPSSPAAVASTAPRTRIALKP------SSPSSQTSSKSHPNKK
        NRSRTASKP   SK IPQSSVAS KSPSTS K S SQDTSKPSSPAGK+S SQDASSKPSSPAAVA+TAPR+RI  KP      SSPSSQTSSK+HP  K
Subjt:  NRSRTASKPPSQSKTIPQSSVAS-KSPSTSSKGSPSQDTSKPSSPAGKASPSQDASSKPSSPAAVASTAPRTRIALKP------SSPSSQTSSKSHPNKK

Query:  PTSQSRIKADSQPSSSSRSAFPSQDFSMPLRSPSQENSRQQPLEKTSRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHPNSKPSSQSRFKADSQPSSP
        P+SQSR KADSQP SSSRSAFPSQD S+P R PS ENSR QP E+TSRVQSPSH SSKPTAQ TSQQP +SPA IG Q HPNSKPSSQSRFKADSQPSS 
Subjt:  PTSQSRIKADSQPSSSSRSAFPSQDFSMPLRSPSQENSRQQPLEKTSRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHPNSKPSSQSRFKADSQPSSP

Query:  SRSAFPSQDSSMPPRSPSQEISRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDGIISHPANQSPKARPTNRETQLQTKSTQPLKPDRK
        SRSAF SQDSSM P SPS+E SRQQ  EKTSRVQSPSHLS KP AQST+QQPIESP AIG+QTT+  ISHP NQSPKARPT+RE+Q+QTKS Q LKP+ K
Subjt:  SRSAFPSQDSSMPPRSPSQEISRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDGIISHPANQSPKARPTNRETQLQTKSTQPLKPDRK

Query:  PVESKASKNQPETKEELTSKNTSNPHPNQDYSEIPTQSDQNMENGLHSFLESQAESKETKEDLAKTTNALQTKASRSTLITSSKSRSSFEPEKWYSQQEE
         VE KASKN+ ETKEEL+SKNTSNPH NQD  E PT+SDQ +EN L   LESQAES+ET+E+LAKTTNALQTKASRSTLITSSK   SFEPE    QQEE
Subjt:  PVESKASKNQPETKEELTSKNTSNPHPNQDYSEIPTQSDQNMENGLHSFLESQAESKETKEDLAKTTNALQTKASRSTLITSSKSRSSFEPEKWYSQQEE

Query:  SMEDLSKVFQKLNIKYSDKENQKSFTTLIGDNKGSSMHLLSGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPSPLELYINVNVQ
        SM+D SK FQKLNIKYSD+EN KSFTTLIG NKGSSMHL+SGEAKSES IHIHRQYKS+PDQSPK STEIEGNF NET +DSRTEENP  +E+YIN+NVQ
Subjt:  SMEDLSKVFQKLNIKYSDKENQKSFTTLIGDNKGSSMHLLSGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPSPLELYINVNVQ

Query:  GINNSIMCNTSFIENDPGVKLKFPREPTKSEDELEAHHARKAEYSPRPAEKLTYEPRVRRRCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKGKEVETL
        GINNSIMCNTSF ENDPG+KLK  RE  KSEDELE+HHARKAEYS +PAEK+TYEPRVRRRCLRGMLMESSDSEVENP KSRRHGCRY  +SKGKEVETL
Subjt:  GINNSIMCNTSFIENDPGVKLKFPREPTKSEDELEAHHARKAEYSPRPAEKLTYEPRVRRRCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKGKEVETL

TrEMBL top hitse value%identityAlignment
A0A0A0LLH1 Uncharacterized protein2.2e-13471.14Show/hide
Query:  MPPRSPSQEISRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDGIISHPANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKASKNQP
        MPPRSPSQE S Q  SEKT RVQSPSHLS KPTAQST+QQPIE   +IGDQTTD I+S PAN SPKA PT+ E+Q+Q +S +  KP+ KPVE + SK Q 
Subjt:  MPPRSPSQEISRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDGIISHPANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKASKNQP

Query:  ETKEELT----------SKNTSNPHPNQDYSEIPTQSDQNMENGLHSFLESQAESKETKEDLAKTTNALQTKASRSTLITSSKSRSSFEPEKWYSQQEES
        ETKEELT          SKNTSNPH  +D SE PTQSDQ +E GL S LESQ ESKETKED AKTTNA QTKASRSTLITSSKSRSSFEPE   +QQ+ES
Subjt:  ETKEELT----------SKNTSNPHPNQDYSEIPTQSDQNMENGLHSFLESQAESKETKEDLAKTTNALQTKASRSTLITSSKSRSSFEPEKWYSQQEES

Query:  MEDLSKVFQKLNIKYSDKENQKSFTTLIGDNKGSSMHLLSGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPS--PLELYINVNV
        MEDLSK F KLNIKYSD+EN KS TT+IGDNKG+SMHLLS EAKSES IH++  YKS+PDQSP+SST+I+ N NNET +DS TEENP   PLELYIN+NV
Subjt:  MEDLSKVFQKLNIKYSDKENQKSFTTLIGDNKGSSMHLLSGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPS--PLELYINVNV

Query:  QGINNSIMCNTSFIENDPGVKLKFPREPTKSEDELEA-HHARKAEYSPRPAEKLTYEPRVRRRCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKGKEVE
        QGINNSI  NTSF EN+PG+KLKFP EPT  +DELE+ HH RK++Y   PAEK+TY+PR+RRRCL G+LMESSDSE ENP K + HGCRYS +SKGKEVE
Subjt:  QGINNSIMCNTSFIENDPGVKLKFPREPTKSEDELEA-HHARKAEYSPRPAEKLTYEPRVRRRCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKGKEVE

Query:  TL
        TL
Subjt:  TL

A0A1S4DVD0 micronuclear linker histone polyprotein1.0e-13973.2Show/hide
Query:  MPPRSPSQEISRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDGIISHPANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKASKNQP
        MPPRSPSQE S Q  SEKTSRVQSPS+LS KPTA ST+QQPIES  +IGDQTTDGI+S PA  SPKA PT+ E Q+Q KS +  +P+ KPVE KASKNQ 
Subjt:  MPPRSPSQEISRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDGIISHPANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKASKNQP

Query:  ETKEELT----------SKNTSNPHPNQDYSEIPTQSDQNMENGLHSFLESQAESKETKEDLAKTTNALQTKASRSTLITSSKSRSSFEPEKWYSQQEES
        +TKEELT          SKNTSNPH ++D SE PTQSD+ +E GL S LESQ ESKETKED  KTTNALQ KASRSTLITSSKSRSSFEPEK  +QQ+ES
Subjt:  ETKEELT----------SKNTSNPHPNQDYSEIPTQSDQNMENGLHSFLESQAESKETKEDLAKTTNALQTKASRSTLITSSKSRSSFEPEKWYSQQEES

Query:  MEDLSKVFQKLNIKYSDKENQKSFTTLIGDNKGSSMHLLSGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPS--PLELYINVNV
        MEDLSK F KLNIKYSD+EN KSFTT+IGDNKGSS+HLLSGEAKSES IH++ +YKS+PDQSPKSST I+ N NNETPQDS TEENP   PLELYIN NV
Subjt:  MEDLSKVFQKLNIKYSDKENQKSFTTLIGDNKGSSMHLLSGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPS--PLELYINVNV

Query:  QGINNSIMCNTSFIENDPGVKLKFP--REPTKSEDELEAHHARKAEYSPRPAEKLTYEPRVRRRCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKGKEV
        QGINNSIM NTSF EN+PG+KLKFP   EPT S+DELE+HH RK+ Y P PAEK+TYEPR+RRR L G+LMES DSE ENP K R HGCRYSR+SKGK+V
Subjt:  QGINNSIMCNTSFIENDPGVKLKFP--REPTKSEDELEAHHARKAEYSPRPAEKLTYEPRVRRRCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKGKEV

Query:  ETL
        ETL
Subjt:  ETL

A0A5A7VAN0 Flocculation protein FLO112.6e-23666.58Show/hide
Query:  MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEINPEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAK
        MS SQLRILLPWQSLKASP PANES   SF PTDE+E+SAS ADT PNIRHQP QS EI PE+PPLA AQA E+SETMPPSKSHK  K+ SQ  +NSRAK
Subjt:  MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEINPEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAK

Query:  NRSRTASKPPSQSKTIPQSSVAS-KSPSTSSKGSPSQDTSKPSSPAGKA-SPSQDASSKPSSPAAVASTAPRTRIALKPSSPSSQTSSKSHPNKKPT---
        NRSRTASKP S    IPQS +AS K PSTS KGS SQD+SKPSSPAGK  SPSQDASSKPSSPA VA+TAP  RIA K SS SSQ S+K HP+ KPT   
Subjt:  NRSRTASKPPSQSKTIPQSSVAS-KSPSTSSKGSPSQDTSKPSSPAGKA-SPSQDASSKPSSPAAVASTAPRTRIALKPSSPSSQTSSKSHPNKKPT---

Query:  ---------------------------------------------------------------------------------SQSRIKADSQPSSSSRSAF
                                                                                         SQSR K DSQPS SSRS F
Subjt:  ---------------------------------------------------------------------------------SQSRIKADSQPSSSSRSAF

Query:  PSQDFSMPLRSPSQENSRQQPLEKTSRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHPNSKPSSQSRFKADSQPSSPSRSAFPSQDSSMPPRSPSQEI
        PSQDFSMP RSPS ENSRQQP +KTS VQSPSH S KPTAQ TS+QPI+SPA IG Q HPN KPSSQSRFKA+S+PSS S+S FPSQDSSMPPRSPSQE 
Subjt:  PSQDFSMPLRSPSQENSRQQPLEKTSRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHPNSKPSSQSRFKADSQPSSPSRSAFPSQDSSMPPRSPSQEI

Query:  SRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDGIISHPANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKASKNQPETKEELT---
        S Q  SEKTSRVQSPS+LS KPTA ST+QQPIES  +IGDQTTDGI+S PA  SPKA PT+ E Q+Q KS +  +P+ KPVE KASKNQ +TKEELT   
Subjt:  SRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDGIISHPANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKASKNQPETKEELT---

Query:  -------SKNTSNPHPNQDYSEIPTQSDQNMENGLHSFLESQAESKETKEDLAKTTNALQTKASRSTLITSSKSRSSFEPEKWYSQQEESMEDLSKVFQK
               SKNTSNPH ++D SE PTQSD+ +E GL S LESQ ESKETKED  KTTNALQ KASRSTLITSSKSRSSFEPEK  +QQ+ESMEDLSK F K
Subjt:  -------SKNTSNPHPNQDYSEIPTQSDQNMENGLHSFLESQAESKETKEDLAKTTNALQTKASRSTLITSSKSRSSFEPEKWYSQQEESMEDLSKVFQK

Query:  LNIKYSDKENQKSFTTLIGDNKGSSMHLLSGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPS--PLELYINVNVQGINNSIMCN
        LNIKYSD+EN KSFTT+IGDNKGSS+HLLSGEAKSES IH++ +YKS+PDQSPKSST I+ N NNETPQDS TEENP   PLELYIN NVQGINNSIM N
Subjt:  LNIKYSDKENQKSFTTLIGDNKGSSMHLLSGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPS--PLELYINVNVQGINNSIMCN

Query:  TSFIENDPGVKLKFP--REPTKSEDELEAHHARKAEYSPRPAEKLTYEPRVRRRCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKGKEVETL
        TSF EN+PG+KLKFP   EPT S+DELE+HH RK+ Y P PAEK+TYEPR+RRR L G+LMES DSE ENP K R HGCRYSR+SKGK+VETL
Subjt:  TSFIENDPGVKLKFP--REPTKSEDELEAHHARKAEYSPRPAEKLTYEPRVRRRCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKGKEVETL

A0A6J1GK50 cell wall protein RBR3-like8.5e-17158.65Show/hide
Query:  MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEINPEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAK
        M++ Q R  LPWQS+KAS    NESS  S EPTDE ETS SAADTVP ++H         PE  PL  AQA E+SETM PSKSHK +KV SQP S+SRAK
Subjt:  MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEINPEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAK

Query:  NRSRTASKPPSQSKTIPQSSVAS-KSPSTSSKGSPSQDTSKPSSPAGKASPSQDASSKPSSPAAVASTAPRTRIALKPSSPSSQTSSKSHPNKKPTSQSR
         ++RTA+KPPS SK  PQSSV+S KSP+TS+K SPS D SKPSS AGK SPS D +SK SSPA     +P                  SH          
Subjt:  NRSRTASKPPSQSKTIPQSSVAS-KSPSTSSKGSPSQDTSKPSSPAGKASPSQDASSKPSSPAAVASTAPRTRIALKPSSPSSQTSSKSHPNKKPTSQSR

Query:  IKADSQPSSSSRSAFPSQDFSMPLRSPSQENSRQQPLEKTSRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHPNSKPSSQSRFKADSQPSSPSRSAFP
            S+PSS +  AFPS+D S P                       S  ++ P +Q+ S+ P  SP+   ++ HP SKP+SQSR KADSQPSSPSR AF 
Subjt:  IKADSQPSSSSRSAFPSQDFSMPLRSPSQENSRQQPLEKTSRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHPNSKPSSQSRFKADSQPSSPSRSAFP

Query:  SQDSSMPPRSPSQEISRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDGIISHPANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKA
         Q SS+ PRSPS E SRQQ S+K SRVQSPSHLS KPTAQST+QQ  ESP  IGDQTT  ++SHPA+QSP+AR   RE Q+QTKS Q  KPD KPVE KA
Subjt:  SQDSSMPPRSPSQEISRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDGIISHPANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKA

Query:  SKNQPETKEELTSKNTSNPHPNQDYSEIPTQSDQNMENGLHSFLESQAESKETK------EDLAKTTNALQTKASRSTLITSSKSRSSFEPEKWYSQQEE
        SK+QPET EE  SKNTS PH +QD+SEIP   D+ +ENG  + LESQ ES+E+K      EDL KTTNALQ  AS+S LITS++  S FEPE   SQQE 
Subjt:  SKNQPETKEELTSKNTSNPHPNQDYSEIPTQSDQNMENGLHSFLESQAESKETK------EDLAKTTNALQTKASRSTLITSSKSRSSFEPEKWYSQQEE

Query:  SMEDLSKVFQKLNIKYSDKENQKSFTTLIGDNKGSSMHLLSGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPSPLELYINVNVQ
        +MEDLSK FQ LNIKY + EN KSFTTL GDNKG+SMHLLSGEA  ES IHIHRQYKSDPD+ P+SST+IEGN N ETPQDS+TEE+P PLELYIN+NVQ
Subjt:  SMEDLSKVFQKLNIKYSDKENQKSFTTLIGDNKGSSMHLLSGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPSPLELYINVNVQ

Query:  GINNSIMCNTSFIENDPGVKLKFPREPTKSEDELEAHHARKAEYSPRPAEKLTYEPRVRRRCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKG
        GINNS++ N+SF EN+PG+KLKF  + TKSED+  +  A+KA+Y+ +  E  TYEP VRRRCL G+LMESSDS+ +N EK RRHGCRY  + +G
Subjt:  GINNSIMCNTSFIENDPGVKLKFPREPTKSEDELEAHHARKAEYSPRPAEKLTYEPRVRRRCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKG

A0A6J1KJ10 cell wall protein RBR3-like2.1e-16958.27Show/hide
Query:  MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEINPEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAK
        M++ Q R  LPWQS+KAS  P NESS  S EPTDE ETS SAADTVP ++H P+QS E  PE  PL  AQA E+SETM PSKSHK +KV SQP S+SRAK
Subjt:  MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEINPEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAK

Query:  NRSRTASKPPSQSKTIPQSSVAS-KSPSTSSKGSPSQDTSKPSSPAGKASPSQDASSKPSSPAAVASTAPRTRIALKPSSPSSQTSSKSHPNKKPTSQSR
         ++RTA+KPPS SK  PQSSV+S KSP+TS+K SPS D SKPSS AGK SPS D +SK SSPA                SPS  T               
Subjt:  NRSRTASKPPSQSKTIPQSSVAS-KSPSTSSKGSPSQDTSKPSSPAGKASPSQDASSKPSSPAAVASTAPRTRIALKPSSPSSQTSSKSHPNKKPTSQSR

Query:  IKADSQPSSSSRSAFPSQDFSMPLRSPSQENSRQQPLEKTSRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHPNSKPSSQSRFKADSQPSSPSRSAFP
            S PS  +  AFPS+D S P                 +     SH+ SKP           SP+   ++ H +SK +SQSR KADSQPSSPSR AF 
Subjt:  IKADSQPSSSSRSAFPSQDFSMPLRSPSQENSRQQPLEKTSRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHPNSKPSSQSRFKADSQPSSPSRSAFP

Query:  SQDSSMPPRSPSQEISRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDGIISHPANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKA
         Q SS+ PRSPS E SRQQ S+K SRVQSPSHLS K TAQST+QQ  ESP  IGDQTT  ++SHPA+QSP+AR  ++E Q+QTKS Q  KPD KPVE KA
Subjt:  SQDSSMPPRSPSQEISRQQASEKTSRVQSPSHLSGKPTAQSTTQQPIESPTAIGDQTTDGIISHPANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKA

Query:  SKNQPETKEELTSKNTSNPHPNQDYSEIPTQSDQNMENGLHSFLESQAESKETK------EDLAKTTNALQTKASRSTLITSSKSRSSFEPEKWYSQQEE
        SK+QPET EE  SKNTS P  N+D+SEIP   D+ +ENG    LESQ ES+E+K      EDL KTTNALQ  AS+S LITS++  S FEPE   SQQE 
Subjt:  SKNQPETKEELTSKNTSNPHPNQDYSEIPTQSDQNMENGLHSFLESQAESKETK------EDLAKTTNALQTKASRSTLITSSKSRSSFEPEKWYSQQEE

Query:  SMEDLSKVFQKLNIKYSDKENQKSFTTLIGDNKGSSMHLLSGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPSPLELYINVNVQ
        +MEDL K FQ LNIKY + EN KSFTTL GDNKG+SMHL+SGEA  ES IHIHRQYKSDPD+ P+SST+IEGN N ETPQDS+TEE+P PLELYIN+NVQ
Subjt:  SMEDLSKVFQKLNIKYSDKENQKSFTTLIGDNKGSSMHLLSGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPSPLELYINVNVQ

Query:  GINNSIMCNTSFIENDPGVKLKFPREPTKSEDELEAHHARKAEYSPRPAEKLTYEPRVRRRCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKGK
        GINNS++ N+SF EN+PG+KLKF  + TKSE++  +  A+KA+Y+ +  E  TYEP VRRRCL G+LMESSDS+ +N EK RRHGCRY  + +GK
Subjt:  GINNSIMCNTSFIENDPGVKLKFPREPTKSEDELEAHHARKAEYSPRPAEKLTYEPRVRRRCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKGK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G75260.1 oxidoreductases, acting on NADH or NADPH1.0e-1427.69Show/hide
Query:  SKGSPSQDTSKPSSPAGKASPSQDASSKPSSPAAVASTAPRTRIALKPSSPSSQTSSKSHPNKKPTSQSRIKADSQPSSSSRSAFPSQDFSMPLRSPSQE
        S  SPS+ +S  SSP+   +P    S  P  PA +A          +PS   S+T  K+ P+    S+SR    +  +SSS S  PS   + P R   Q 
Subjt:  SKGSPSQDTSKPSSPAGKASPSQDASSKPSSPAAVASTAPRTRIALKPSSPSSQTSSKSHPNKKPTSQSRIKADSQPSSSSRSAFPSQDFSMPLRSPSQE

Query:  NSRQQPLEKTSRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHPNSKPSSQSRFKADSQPSSPSRSAFPSQDSSMPPRSPSQEISRQQASEKTSRVQSP
        N +                S  P+ +L S +       + T+  P  +    +          P   A P +          QE  R   + +    ++ 
Subjt:  NSRQQPLEKTSRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHPNSKPSSQSRFKADSQPSSPSRSAFPSQDSSMPPRSPSQEISRQQASEKTSRVQSP

Query:  SHL---SGKPTAQSTTQQPIESPTAIGDQTTDGIISHPANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKASKNQPETKEELTSKNTSNPHPNQDYSE
          L   SGK +A +  QQ IE    I  Q    ++     Q  +A     + Q ++K T+ L         +A   +  T+ + T    +     +   +
Subjt:  SHL---SGKPTAQSTTQQPIESPTAIGDQTTDGIISHPANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKASKNQPETKEELTSKNTSNPHPNQDYSE

Query:  IPTQSDQNMENGLHSFLESQAESKETKEDLAKTTNALQTKASRSTLITSSKSRSSFEPEKWYSQQEESMEDLSKVFQKLNIKYSDKENQK-SFTTLIGDN
        +P +             E+Q  ++   +D  + T    T    +  +T+ +  S        S   +  ED+     KL    S+ +++  S  TL G+N
Subjt:  IPTQSDQNMENGLHSFLESQAESKETKEDLAKTTNALQTKASRSTLITSSKSRSSFEPEKWYSQQEESMEDLSKVFQKLNIKYSDKENQK-SFTTLIGDN

Query:  KGSSMHLLSGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPSPLELYINVNVQGINNSIMCNTSFIENDPGVKLKFPREPTKSED
        KG++M + S + K +  +HI R Y+S+PD+S  ++         E P+D   EE  S    YIN N QGINNSI+  +S  ENDPGV + F  E  K E 
Subjt:  KGSSMHLLSGEAKSESPIHIHRQYKSDPDQSPKSSTEIEGNFNNETPQDSRTEENPSPLELYINVNVQGINNSIMCNTSFIENDPGVKLKFPREPTKSED

Query:  ELEAHHA-RKAEYSPRPAEKLTYEPRVRRRCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKGKEVE
             +   K   +    +KL  EPRVRRRCLRG+L ESS+SE +NP K RRHGCR++   K K++E
Subjt:  ELEAHHA-RKAEYSPRPAEKLTYEPRVRRRCLRGMLMESSDSEVENPEKSRRHGCRYSRNSKGKEVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCACACTCACAATTGCGCATTCTACTTCCTTGGCAATCGTTAAAAGCTTCTCCTCATCCTGCAAATGAGTCGTCAGGATGGAGTTTTGAGCCTACAGATGAAACGGA
AACTTCTGCTTCTGCAGCTGATACCGTGCCAAATATTCGACATCAACCGGTACAGTCTCTTGAGATAAATCCAGAACAACCTCCTTTAGCACCAGCTCAGGCACTTGAAA
AAAGTGAAACTATGCCACCTTCAAAATCTCATAAGGCAAGCAAAGTTCAATCTCAGCCACCATCAAATTCTCGAGCCAAAAATCGGTCCCGAACAGCTTCCAAGCCTCCA
TCACAATCTAAAACAATCCCTCAATCTTCAGTTGCTTCCAAGTCTCCTTCAACATCAAGCAAAGGCTCCCCATCTCAGGATACTTCAAAGCCATCATCACCAGCAGGCAA
AGCCTCTCCATCTCAGGATGCTTCTTCAAAGCCTTCATCACCTGCAGCAGTTGCATCTACTGCTCCTCGAACCCGGATTGCTTTGAAGCCATCATCTCCATCGTCTCAAA
CATCCAGTAAAAGCCATCCAAATAAAAAACCAACATCACAATCAAGAATTAAAGCTGATTCTCAGCCTTCATCATCTTCAAGGTCAGCATTTCCATCTCAAGATTTTTCT
ATGCCACTGCGGTCGCCATCTCAAGAAAATTCTCGACAACAACCATTGGAAAAAACCTCTCGGGTTCAGTCTCCATCTCATTTGTCCAGTAAACCTACTGCACAATTAAC
ATCACAACAACCTATTAAGTCTCCAGCAGCCATTGGAACTCAAATCCATCCAAATTCAAAACCATCATCCCAATCAAGATTTAAAGCTGATTCTCAGCCTTCATCACCTT
CAAGGTCAGCATTTCCATCTCAAGATTCTTCTATGCCACCACGGTCGCCATCTCAAGAAATTTCTCGACAACAAGCATCGGAAAAAACCTCTCGAGTTCAGTCTCCATCT
CATTTGTCCGGTAAACCTACTGCACAATCAACAACACAACAACCTATTGAATCTCCTACAGCCATTGGAGACCAAACAACAGATGGAATCATTTCTCATCCCGCAAATCA
ATCCCCAAAAGCAAGACCTACAAACAGGGAAACTCAGTTGCAAACCAAATCAACGCAGCCTCTGAAACCAGACAGGAAACCAGTGGAATCGAAAGCATCAAAAAATCAGC
CTGAAACCAAGGAAGAGCTCACATCTAAGAACACTTCCAATCCCCATCCGAACCAGGACTATTCTGAAATCCCAACACAATCCGATCAAAACATGGAAAATGGCTTACAT
TCCTTTCTAGAATCACAGGCAGAGTCAAAAGAAACTAAGGAAGATCTGGCAAAGACAACCAATGCACTTCAAACCAAAGCATCTAGAAGCACATTAATCACATCTTCCAA
AAGTCGTTCATCATTTGAACCAGAAAAGTGGTACTCACAACAGGAAGAATCCATGGAAGACTTATCCAAAGTTTTTCAGAAACTAAACATCAAATATTCAGACAAGGAAA
ATCAAAAGAGCTTCACAACATTGATCGGCGATAACAAAGGGTCGTCAATGCACTTACTCTCCGGTGAAGCCAAAAGCGAAAGCCCAATCCACATCCACCGTCAATATAAG
AGCGATCCAGATCAAAGCCCTAAAAGTTCCACAGAAATCGAAGGAAATTTCAATAACGAAACACCGCAAGATTCAAGAACAGAGGAGAATCCATCACCCCTGGAATTATA
TATCAACGTCAATGTACAAGGTATCAACAACTCAATCATGTGCAATACCTCATTTATAGAGAATGATCCTGGAGTCAAGTTGAAATTCCCTCGAGAACCAACAAAATCTG
AAGATGAATTAGAGGCTCATCACGCTAGAAAAGCAGAATACAGTCCGAGACCGGCGGAGAAGCTTACGTATGAACCCAGAGTAAGACGAAGATGCCTCAGAGGGATGTTA
ATGGAGTCGAGCGATTCTGAGGTCGAGAATCCAGAAAAGTCTCGACGCCATGGCTGCCGGTATAGTCGTAATAGCAAAGGAAAAGAGGTCGAAACTCTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCACACTCACAATTGCGCATTCTACTTCCTTGGCAATCGTTAAAAGCTTCTCCTCATCCTGCAAATGAGTCGTCAGGATGGAGTTTTGAGCCTACAGATGAAACGGA
AACTTCTGCTTCTGCAGCTGATACCGTGCCAAATATTCGACATCAACCGGTACAGTCTCTTGAGATAAATCCAGAACAACCTCCTTTAGCACCAGCTCAGGCACTTGAAA
AAAGTGAAACTATGCCACCTTCAAAATCTCATAAGGCAAGCAAAGTTCAATCTCAGCCACCATCAAATTCTCGAGCCAAAAATCGGTCCCGAACAGCTTCCAAGCCTCCA
TCACAATCTAAAACAATCCCTCAATCTTCAGTTGCTTCCAAGTCTCCTTCAACATCAAGCAAAGGCTCCCCATCTCAGGATACTTCAAAGCCATCATCACCAGCAGGCAA
AGCCTCTCCATCTCAGGATGCTTCTTCAAAGCCTTCATCACCTGCAGCAGTTGCATCTACTGCTCCTCGAACCCGGATTGCTTTGAAGCCATCATCTCCATCGTCTCAAA
CATCCAGTAAAAGCCATCCAAATAAAAAACCAACATCACAATCAAGAATTAAAGCTGATTCTCAGCCTTCATCATCTTCAAGGTCAGCATTTCCATCTCAAGATTTTTCT
ATGCCACTGCGGTCGCCATCTCAAGAAAATTCTCGACAACAACCATTGGAAAAAACCTCTCGGGTTCAGTCTCCATCTCATTTGTCCAGTAAACCTACTGCACAATTAAC
ATCACAACAACCTATTAAGTCTCCAGCAGCCATTGGAACTCAAATCCATCCAAATTCAAAACCATCATCCCAATCAAGATTTAAAGCTGATTCTCAGCCTTCATCACCTT
CAAGGTCAGCATTTCCATCTCAAGATTCTTCTATGCCACCACGGTCGCCATCTCAAGAAATTTCTCGACAACAAGCATCGGAAAAAACCTCTCGAGTTCAGTCTCCATCT
CATTTGTCCGGTAAACCTACTGCACAATCAACAACACAACAACCTATTGAATCTCCTACAGCCATTGGAGACCAAACAACAGATGGAATCATTTCTCATCCCGCAAATCA
ATCCCCAAAAGCAAGACCTACAAACAGGGAAACTCAGTTGCAAACCAAATCAACGCAGCCTCTGAAACCAGACAGGAAACCAGTGGAATCGAAAGCATCAAAAAATCAGC
CTGAAACCAAGGAAGAGCTCACATCTAAGAACACTTCCAATCCCCATCCGAACCAGGACTATTCTGAAATCCCAACACAATCCGATCAAAACATGGAAAATGGCTTACAT
TCCTTTCTAGAATCACAGGCAGAGTCAAAAGAAACTAAGGAAGATCTGGCAAAGACAACCAATGCACTTCAAACCAAAGCATCTAGAAGCACATTAATCACATCTTCCAA
AAGTCGTTCATCATTTGAACCAGAAAAGTGGTACTCACAACAGGAAGAATCCATGGAAGACTTATCCAAAGTTTTTCAGAAACTAAACATCAAATATTCAGACAAGGAAA
ATCAAAAGAGCTTCACAACATTGATCGGCGATAACAAAGGGTCGTCAATGCACTTACTCTCCGGTGAAGCCAAAAGCGAAAGCCCAATCCACATCCACCGTCAATATAAG
AGCGATCCAGATCAAAGCCCTAAAAGTTCCACAGAAATCGAAGGAAATTTCAATAACGAAACACCGCAAGATTCAAGAACAGAGGAGAATCCATCACCCCTGGAATTATA
TATCAACGTCAATGTACAAGGTATCAACAACTCAATCATGTGCAATACCTCATTTATAGAGAATGATCCTGGAGTCAAGTTGAAATTCCCTCGAGAACCAACAAAATCTG
AAGATGAATTAGAGGCTCATCACGCTAGAAAAGCAGAATACAGTCCGAGACCGGCGGAGAAGCTTACGTATGAACCCAGAGTAAGACGAAGATGCCTCAGAGGGATGTTA
ATGGAGTCGAGCGATTCTGAGGTCGAGAATCCAGAAAAGTCTCGACGCCATGGCTGCCGGTATAGTCGTAATAGCAAAGGAAAAGAGGTCGAAACTCTGTAA
Protein sequenceShow/hide protein sequence
MSHSQLRILLPWQSLKASPHPANESSGWSFEPTDETETSASAADTVPNIRHQPVQSLEINPEQPPLAPAQALEKSETMPPSKSHKASKVQSQPPSNSRAKNRSRTASKPP
SQSKTIPQSSVASKSPSTSSKGSPSQDTSKPSSPAGKASPSQDASSKPSSPAAVASTAPRTRIALKPSSPSSQTSSKSHPNKKPTSQSRIKADSQPSSSSRSAFPSQDFS
MPLRSPSQENSRQQPLEKTSRVQSPSHLSSKPTAQLTSQQPIKSPAAIGTQIHPNSKPSSQSRFKADSQPSSPSRSAFPSQDSSMPPRSPSQEISRQQASEKTSRVQSPS
HLSGKPTAQSTTQQPIESPTAIGDQTTDGIISHPANQSPKARPTNRETQLQTKSTQPLKPDRKPVESKASKNQPETKEELTSKNTSNPHPNQDYSEIPTQSDQNMENGLH
SFLESQAESKETKEDLAKTTNALQTKASRSTLITSSKSRSSFEPEKWYSQQEESMEDLSKVFQKLNIKYSDKENQKSFTTLIGDNKGSSMHLLSGEAKSESPIHIHRQYK
SDPDQSPKSSTEIEGNFNNETPQDSRTEENPSPLELYINVNVQGINNSIMCNTSFIENDPGVKLKFPREPTKSEDELEAHHARKAEYSPRPAEKLTYEPRVRRRCLRGML
MESSDSEVENPEKSRRHGCRYSRNSKGKEVETL