; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004839 (gene) of Snake gourd v1 genome

Gene IDTan0004839
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptioncell wall protein RBR3-like
Genome locationLG02:88714583..88716955
RNA-Seq ExpressionTan0004839
SyntenyTan0004839
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585587.1 hypothetical protein SDJN03_18320, partial [Cucurbita argyrosperma subsp. sororia]2.9e-23473.97Show/hide
Query:  MSYRQFRFRLPWQTLKAFARPEKESSRRSSEPKDEAETSASAADAVPYIRHLPVQSTDIKPEQPPPLEPAQAPESSETMPPSKSHKEAKVQSQPSSHSRA
        M+YRQFRFRLPWQ++KA +R E ESS RSSEP DEAETS SAAD VPY++H         P +  PLE AQAPE SETM PSKSHK+AKV SQPSSHSRA
Subjt:  MSYRQFRFRLPWQTLKAFARPEKESSRRSSEPKDEAETSASAADAVPYIRHLPVQSTDIKPEQPPPLEPAQAPESSETMPPSKSHKEAKVQSQPSSHSRA

Query:  KNQTRTASKPPSPSKATPQSSVASSKSPAASGKDSPSHDTSKPSSPAGKASPTHDTSKPSSPAGKVSQSHNTSKPSSPAGKASPTHDTSKPSSPAGKAFP
        K QTRTA+KPPS SK TPQSSV+S+KSP  S K SPSHD SKPSS AGK SP+HDTSK SSPAGK              GK SP+HDTSKPSSPAG    
Subjt:  KNQTRTASKPPSPSKATPQSSVASSKSPAASGKDSPSHDTSKPSSPAGKASPTHDTSKPSSPAGKVSQSHNTSKPSSPAGKASPTHDTSKPSSPAGKAFP

Query:  SPDASKPSSPAGKAFPSPDASKPSSAARKAFPSPDASKPSSPAAAAPRSRNASKPPSPSQTSSKNHLHSKPTSQSRLKADS-QPSSPSRSAFSPQASSIP
                                    KAFPS DAS+PSS AAAAPRS+  SKPPSPSQTSSKNH  SKPTSQSRLKADS QPSSPS  AFSPQASSIP
Subjt:  SPDASKPSSPAGKAFPSPDASKPSSAARKAFPSPDASKPSSPAAAAPRSRNASKPPSPSQTSSKNHLHSKPTSQSRLKADS-QPSSPSRSAFSPQASSIP

Query:  RSPSHENSRQQPSKKTSRVQSPSHLSSKPTAQSTSQQPIESPAAIGDQTTNRIVSHPANQSPQARPKSRESQSQTKSKQSPKPDLKPVESIASKDQTETK
        RSPSHENSRQQPSKK SRVQSPSHLSSKPTAQSTSQQ  ESPA I DQTT R+VSHPA+QSP AR K +E+Q QTKSKQSPKPDLKPVE  ASK Q ET 
Subjt:  RSPSHENSRQQPSKKTSRVQSPSHLSSKPTAQSTSQQPIESPAAIGDQTTNRIVSHPANQSPQARPKSRESQSQTKSKQSPKPDLKPVESIASKDQTETK

Query:  EDLTSKNSSYPHFDQDSSEIPIVIDQTIDNGLEPSLESQAESKENEEIKSDKEDLAKTTNALQTNASKSKLITSAEIPSPFEPEKRDSQQEETMEDLSKA
        E+  SKN+SYPH DQD SEIPI+ID+T +NG EPSLESQ ES+E++EIKS +EDL KTTNALQ NASKSKLITSAEI SPFEPE  DSQQE TMEDLSKA
Subjt:  EDLTSKNSSYPHFDQDSSEIPIVIDQTIDNGLEPSLESQAESKENEEIKSDKEDLAKTTNALQTNASKSKLITSAEIPSPFEPEKRDSQQEETMEDLSKA

Query:  FQKLNIKYPDEENPKSFTTLTGDNKGASMHLLSGEAKTESAIHIHRQYKSDPDQSPKSSTDIEGNFNHETPQDSRTEEDPPLELYINMNVQAINNSILSN
        FQ LNIKYP EENPKSFTTLTGDNKGASMHLLSGEA  ESAIHIHRQYKSDPD+ P+SSTDIEGN N ETPQDS+TEEDPPLELYIN+NVQ INNS+LSN
Subjt:  FQKLNIKYPDEENPKSFTTLTGDNKGASMHLLSGEAKTESAIHIHRQYKSDPDQSPKSSTDIEGNFNHETPQDSRTEEDPPLELYINMNVQAINNSILSN

Query:  SSFTENNPGIKLKFGREPTKSEDKSQSFHARKAKYTAKAAERLTYESTVRRRCLGGLLMESSDSELDNPEKPRRHGCRYR
        SSFTENNPGIKLKF  + TKSEDK  S HA+KAKYTAK  E  TYE TVRRRCL GLLMESSDS+ DN EKPRRHGCRYR
Subjt:  SSFTENNPGIKLKFGREPTKSEDKSQSFHARKAKYTAKAAERLTYESTVRRRCLGGLLMESSDSELDNPEKPRRHGCRYR

KAG7020502.1 hypothetical protein SDJN02_17187, partial [Cucurbita argyrosperma subsp. argyrosperma]9.9e-23574.12Show/hide
Query:  MSYRQFRFRLPWQTLKAFARPEKESSRRSSEPKDEAETSASAADAVPYIRHLPVQSTDIKPEQPPPLEPAQAPESSETMPPSKSHKEAKVQSQPSSHSRA
        M+YRQFRFRLPWQ++KA +R E ESS RSSEP DEAETS SAAD VPY++H         P +  PLE AQAPE SETM PSKSHK+AKV SQPSSHSRA
Subjt:  MSYRQFRFRLPWQTLKAFARPEKESSRRSSEPKDEAETSASAADAVPYIRHLPVQSTDIKPEQPPPLEPAQAPESSETMPPSKSHKEAKVQSQPSSHSRA

Query:  KNQTRTASKPPSPSKATPQSSVASSKSPAASGKDSPSHDTSKPSSPAGKASPTHDTSKPSSPAGKVSQSHNTSKPSSPAGKASPTHDTSKPSSPAGKAFP
        K QTRTA+KPPS SK TPQSSV+S+KSP  S K SPSHD SKPSS AGK SP+HDTSK SSPAGK              GK SP+HDTSKPSSPAG    
Subjt:  KNQTRTASKPPSPSKATPQSSVASSKSPAASGKDSPSHDTSKPSSPAGKASPTHDTSKPSSPAGKVSQSHNTSKPSSPAGKASPTHDTSKPSSPAGKAFP

Query:  SPDASKPSSPAGKAFPSPDASKPSSAARKAFPSPDASKPSSPAAAAPRSRNASKPPSPSQTSSKNHLHSKPTSQSRLKADS-QPSSPSRSAFSPQASSIP
                                    KAFPS DAS+PSS AAAAPRS+  SKPPSPSQTSSKNH  SKPTSQSRLKADS Q SSPSR AFSPQASSIP
Subjt:  SPDASKPSSPAGKAFPSPDASKPSSAARKAFPSPDASKPSSPAAAAPRSRNASKPPSPSQTSSKNHLHSKPTSQSRLKADS-QPSSPSRSAFSPQASSIP

Query:  RSPSHENSRQQPSKKTSRVQSPSHLSSKPTAQSTSQQPIESPAAIGDQTTNRIVSHPANQSPQARPKSRESQSQTKSKQSPKPDLKPVESIASKDQTETK
        RSPSHENSRQQPSKK S VQSPSHLSSKPTAQSTSQQ  ESPA IGDQTT R+VSHPA+QSP AR K RE+Q QTKSKQSPKPDLKPVE  ASK Q ET 
Subjt:  RSPSHENSRQQPSKKTSRVQSPSHLSSKPTAQSTSQQPIESPAAIGDQTTNRIVSHPANQSPQARPKSRESQSQTKSKQSPKPDLKPVESIASKDQTETK

Query:  EDLTSKNSSYPHFDQDSSEIPIVIDQTIDNGLEPSLESQAESKENEEIKSDKEDLAKTTNALQTNASKSKLITSAEIPSPFEPEKRDSQQEETMEDLSKA
        E+  SKN+SYPH DQD SEIPI+ID+T +NG EPSLESQ ES+E++EIKS +EDL KTTNALQ NASKSKLITSAEI SPFEPE  DSQQE TMEDLSKA
Subjt:  EDLTSKNSSYPHFDQDSSEIPIVIDQTIDNGLEPSLESQAESKENEEIKSDKEDLAKTTNALQTNASKSKLITSAEIPSPFEPEKRDSQQEETMEDLSKA

Query:  FQKLNIKYPDEENPKSFTTLTGDNKGASMHLLSGEAKTESAIHIHRQYKSDPDQSPKSSTDIEGNFNHETPQDSRTEEDPPLELYINMNVQAINNSILSN
        FQ LNIKYP EENPKSFTTLTGDNKGASMHLLSGEA  ESAIHIHRQYKSDPD+ P+SSTDIEGN N ETPQDS+TEEDPPLELYIN+NVQ INNS+LSN
Subjt:  FQKLNIKYPDEENPKSFTTLTGDNKGASMHLLSGEAKTESAIHIHRQYKSDPDQSPKSSTDIEGNFNHETPQDSRTEEDPPLELYINMNVQAINNSILSN

Query:  SSFTENNPGIKLKFGREPTKSEDKSQSFHARKAKYTAKAAERLTYESTVRRRCLGGLLMESSDSELDNPEKPRRHGCRYR
        SSFTENNPGIKLKF  + TKSEDK  S HA+KAKYTAK  E  TYE TVRRRCL GLLMESSDS+ DN EKPRRHGCRYR
Subjt:  SSFTENNPGIKLKFGREPTKSEDKSQSFHARKAKYTAKAAERLTYESTVRRRCLGGLLMESSDSELDNPEKPRRHGCRYR

XP_022951875.1 cell wall protein RBR3-like [Cucurbita moschata]7.9e-24074.82Show/hide
Query:  MSYRQFRFRLPWQTLKAFARPEKESSRRSSEPKDEAETSASAADAVPYIRHLPVQSTDIKPEQPPPLEPAQAPESSETMPPSKSHKEAKVQSQPSSHSRA
        M+YRQFRFRLPWQ++KA +R E ESS RSSEP DEAETS SAAD VPY++HLP  S         PLE AQAPE SETM PSKSHK+AKV SQPSSHSRA
Subjt:  MSYRQFRFRLPWQTLKAFARPEKESSRRSSEPKDEAETSASAADAVPYIRHLPVQSTDIKPEQPPPLEPAQAPESSETMPPSKSHKEAKVQSQPSSHSRA

Query:  KNQTRTASKPPSPSKATPQSSVASSKSPAASGKDSPSHDTSKPSSPAGKASPTHDTSKPSSPAGKVSQSHNTSKPSSPAGKASPTHDTSKPSSPAGKAFP
        K QTRTA+KPPS SK TPQSSV+S+KSP  S K SPSHD SKPSS AGK SP+HDTSK SSPAGK              GK SP+HDTSKPSSPAG    
Subjt:  KNQTRTASKPPSPSKATPQSSVASSKSPAASGKDSPSHDTSKPSSPAGKASPTHDTSKPSSPAGKVSQSHNTSKPSSPAGKASPTHDTSKPSSPAGKAFP

Query:  SPDASKPSSPAGKAFPSPDASKPSSAARKAFPSPDASKPSSPAAAAPRSRNASKPPSPSQTSSKNHLHSKPTSQSRLKADSQPSSPSRSAFSPQASSIPR
                                    KAFPS DAS+PSS AAAAPRS+  SKPPSPSQTSSKNH  SKPTSQSRLKADSQPSSPSR AFSPQASSIPR
Subjt:  SPDASKPSSPAGKAFPSPDASKPSSAARKAFPSPDASKPSSPAAAAPRSRNASKPPSPSQTSSKNHLHSKPTSQSRLKADSQPSSPSRSAFSPQASSIPR

Query:  SPSHENSRQQPSKKTSRVQSPSHLSSKPTAQSTSQQPIESPAAIGDQTTNRIVSHPANQSPQARPKSRESQSQTKSKQSPKPDLKPVESIASKDQTETKE
        SPSHENSRQQPSKK SRVQSPSHLSSKPTAQSTSQQ  ESPA IGDQTT R+VSHPA+QSP+AR K RE+Q QTKSKQSPKPDLKPVE  ASK Q ET E
Subjt:  SPSHENSRQQPSKKTSRVQSPSHLSSKPTAQSTSQQPIESPAAIGDQTTNRIVSHPANQSPQARPKSRESQSQTKSKQSPKPDLKPVESIASKDQTETKE

Query:  DLTSKNSSYPHFDQDSSEIPIVIDQTIDNGLEPSLESQAESKENEEIKSDKEDLAKTTNALQTNASKSKLITSAEIPSPFEPEKRDSQQEETMEDLSKAF
        +  SKN+SYPH DQD SEIPI+ID+TI+NG E SLESQ ES+E++EIKS +EDL KTTNALQ NASKSKLITSAEI SPFEPE  DSQQE TMEDLSKAF
Subjt:  DLTSKNSSYPHFDQDSSEIPIVIDQTIDNGLEPSLESQAESKENEEIKSDKEDLAKTTNALQTNASKSKLITSAEIPSPFEPEKRDSQQEETMEDLSKAF

Query:  QKLNIKYPDEENPKSFTTLTGDNKGASMHLLSGEAKTESAIHIHRQYKSDPDQSPKSSTDIEGNFNHETPQDSRTEEDPPLELYINMNVQAINNSILSNS
        Q LNIKYP EENPKSFTTLTGDNKGASMHLLSGEA  ES+IHIHRQYKSDPD+ P+SSTDIEGN N ETPQDS+TEEDPPLELYIN+NVQ INNS+LSNS
Subjt:  QKLNIKYPDEENPKSFTTLTGDNKGASMHLLSGEAKTESAIHIHRQYKSDPDQSPKSSTDIEGNFNHETPQDSRTEEDPPLELYINMNVQAINNSILSNS

Query:  SFTENNPGIKLKFGREPTKSEDKSQSFHARKAKYTAKAAERLTYESTVRRRCLGGLLMESSDSELDNPEKPRRHGCRYR
        SFTENNPGIKLKF  + TKSEDKS S  A+KAKYTAK  E  TYE TVRRRCLGGLLMESSDS+ DN EKPRRHGCRYR
Subjt:  SFTENNPGIKLKFGREPTKSEDKSQSFHARKAKYTAKAAERLTYESTVRRRCLGGLLMESSDSELDNPEKPRRHGCRYR

XP_023002262.1 cell wall protein RBR3-like [Cucurbita maxima]1.9e-23874.23Show/hide
Query:  MSYRQFRFRLPWQTLKAFARPEKESSRRSSEPKDEAETSASAADAVPYIRHLPVQSTDIKPEQPPPLEPAQAPESSETMPPSKSHKEAKVQSQPSSHSRA
        M+YRQFRFRLPWQ++KA +RPE ESS RSSEP DEAETS SAAD VPY++HLP+QS + KPE   PLE AQAPE SETM PSKSHK+AKV SQPSSHSRA
Subjt:  MSYRQFRFRLPWQTLKAFARPEKESSRRSSEPKDEAETSASAADAVPYIRHLPVQSTDIKPEQPPPLEPAQAPESSETMPPSKSHKEAKVQSQPSSHSRA

Query:  KNQTRTASKPPSPSKATPQSSVASSKSPAASGKDSPSHDTSKPSSPAGKASPTHDTSKPSSPAGKVSQSHNTSKPSSPAGKASPTHDTSKPSSPAGKAFP
        K QTRTA+KPPS SK TPQSSV+S+KSP  S K SPSHD SKPSS AGK SP+HDTSK SSPAGK              GK SP+ DTS PS PAG    
Subjt:  KNQTRTASKPPSPSKATPQSSVASSKSPAASGKDSPSHDTSKPSSPAGKASPTHDTSKPSSPAGKVSQSHNTSKPSSPAGKASPTHDTSKPSSPAGKAFP

Query:  SPDASKPSSPAGKAFPSPDASKPSSAARKAFPSPDASKPSSPAAAAPRSRNASKPPSPSQTSSKNHLHSKPTSQSRLKADSQPSSPSRSAFSPQASSIPR
                                    KAFPS DAS+PSS AAAAPRS   SKPPSPSQTSSKNHLHSK TSQSRLKADSQPSSPSR AFSPQASSIPR
Subjt:  SPDASKPSSPAGKAFPSPDASKPSSAARKAFPSPDASKPSSPAAAAPRSRNASKPPSPSQTSSKNHLHSKPTSQSRLKADSQPSSPSRSAFSPQASSIPR

Query:  SPSHENSRQQPSKKTSRVQSPSHLSSKPTAQSTSQQPIESPAAIGDQTTNRIVSHPANQSPQARPKSRESQSQTKSKQSPKPDLKPVESIASKDQTETKE
        SPSHENSRQQPSKK SRVQSPSHLSSK TAQSTSQQ  ESPA IGDQTT R+VSHPA+QSP+AR KS+E+Q QTKSKQSPKPDLKPVE  ASK Q ET E
Subjt:  SPSHENSRQQPSKKTSRVQSPSHLSSKPTAQSTSQQPIESPAAIGDQTTNRIVSHPANQSPQARPKSRESQSQTKSKQSPKPDLKPVESIASKDQTETKE

Query:  DLTSKNSSYPHFDQDSSEIPIVIDQTIDNGLEPSLESQAESKENEEIKSDKEDLAKTTNALQTNASKSKLITSAEIPSPFEPEKRDSQQEETMEDLSKAF
        +  SKN+SYP  ++D SEIPI+ID+TI+NG EPSLESQ ES+E++EIKS +EDL KTTNALQ NASKSKLITSAEI SPFEPE  DSQQE TMEDL KAF
Subjt:  DLTSKNSSYPHFDQDSSEIPIVIDQTIDNGLEPSLESQAESKENEEIKSDKEDLAKTTNALQTNASKSKLITSAEIPSPFEPEKRDSQQEETMEDLSKAF

Query:  QKLNIKYPDEENPKSFTTLTGDNKGASMHLLSGEAKTESAIHIHRQYKSDPDQSPKSSTDIEGNFNHETPQDSRTEEDPPLELYINMNVQAINNSILSNS
        Q LNIKYP EENPKSFTTLTGDNKGASMHL+SGEA  ES+IHIHRQYKSDPD+ P+SSTDIEGN N ETPQDS+TEEDPPLELYIN+NVQ INNS+LSNS
Subjt:  QKLNIKYPDEENPKSFTTLTGDNKGASMHLLSGEAKTESAIHIHRQYKSDPDQSPKSSTDIEGNFNHETPQDSRTEEDPPLELYINMNVQAINNSILSNS

Query:  SFTENNPGIKLKFGREPTKSEDKSQSFHARKAKYTAKAAERLTYESTVRRRCLGGLLMESSDSELDNPEKPRRHGCRYR
        SFTENNPGIKLKF  + TKSE+K  S  A+KAKYTAK  E  TYE TVRRRCLGGLLMESSDS+ DN EKPRRHGCRYR
Subjt:  SFTENNPGIKLKFGREPTKSEDKSQSFHARKAKYTAKAAERLTYESTVRRRCLGGLLMESSDSELDNPEKPRRHGCRYR

XP_023537866.1 cell wall protein RBR3-like [Cucurbita pepo subsp. pepo]1.1e-23874.52Show/hide
Query:  MSYRQFRFRLPWQTLKAFARPEKESSRRSSEPKDEAETSASAADAVPYIRHLPVQSTDIKPEQPPPLEPAQAPESSETMPPSKSHKEAKVQSQPSSHSRA
        M+ RQFRFRLPWQ++KA +RPE ESS RSSEP DEAETS SAAD VPY++HLP  S         PLE AQAPE+SETM PSKSHK+AKV SQPSSHSRA
Subjt:  MSYRQFRFRLPWQTLKAFARPEKESSRRSSEPKDEAETSASAADAVPYIRHLPVQSTDIKPEQPPPLEPAQAPESSETMPPSKSHKEAKVQSQPSSHSRA

Query:  KNQTRTASKPPSPSKATPQSSVASSKSPAASGKDSPSHDTSKPSSPAGKASPTHDTSKPSSPAGKVSQSHNTSKPSSPAGKASPTHDTSKPSSPAGKAFP
        K QTRTA+KPPS SK TPQSSV+S+KSP  S K SPSHD SKPSS AGK SP+HDTSK SSPAGK              GK SP+HDTSKPSSPAG    
Subjt:  KNQTRTASKPPSPSKATPQSSVASSKSPAASGKDSPSHDTSKPSSPAGKASPTHDTSKPSSPAGKVSQSHNTSKPSSPAGKASPTHDTSKPSSPAGKAFP

Query:  SPDASKPSSPAGKAFPSPDASKPSSAARKAFPSPDASKPSSPAAAAPRSRNASKPPSPSQTSSKNHLHSKPTSQSRLKADSQPSSPSRSAFSPQASSIPR
                                    KAFPS DAS+PSS AAAAPRS+  SKPPSPSQTSSKNH  SKPTSQSRLKADSQPSSPSR A SPQASSIPR
Subjt:  SPDASKPSSPAGKAFPSPDASKPSSAARKAFPSPDASKPSSPAAAAPRSRNASKPPSPSQTSSKNHLHSKPTSQSRLKADSQPSSPSRSAFSPQASSIPR

Query:  SPSHENSRQQPSKKTSRVQSPSHLSSKPTAQSTSQQPIESPAAIGDQTTNRIVSHPANQSPQARPKSRESQSQTKSKQSPKPDLKPVESIASKDQTETKE
        SPSHENSRQQPSKK SRVQSPSHLSSKPTAQSTSQQ  ESPA IGDQTT R+VSHPA+QSP+AR KSRE+Q QTKSKQSPKPDLKPVE  ASK Q ET E
Subjt:  SPSHENSRQQPSKKTSRVQSPSHLSSKPTAQSTSQQPIESPAAIGDQTTNRIVSHPANQSPQARPKSRESQSQTKSKQSPKPDLKPVESIASKDQTETKE

Query:  DLTSKNSSYPHFDQDSSEIPIVIDQTIDNGLEPSLESQAESKENEEIKSDKEDLAKTTNALQTNASKSKLITSAEIPSPFEPEKRDSQQEETMEDLSKAF
        +  SKN+SYPH DQD SEIPI+ID+TI+NG E SLESQ ES+E++EIKS +EDL KTTNALQ NASKSKLITSAEI SPFEPE  DSQQE TMEDLSKAF
Subjt:  DLTSKNSSYPHFDQDSSEIPIVIDQTIDNGLEPSLESQAESKENEEIKSDKEDLAKTTNALQTNASKSKLITSAEIPSPFEPEKRDSQQEETMEDLSKAF

Query:  QKLNIKYPDEENPKSFTTLTGDNKGASMHLLSGEAKTESAIHIHRQYKSDPDQSPKSSTDIEGNFNHETPQDSRTEEDPPLELYINMNVQAINNSILSNS
        Q LNIKYP EENPKSFTTLTGDNKGASMHLLSGEA  ES+IHIHRQYKSDPD+ P+SSTDIEGN N ETPQDS+TEEDPPLELYIN+NVQ INNS+LSNS
Subjt:  QKLNIKYPDEENPKSFTTLTGDNKGASMHLLSGEAKTESAIHIHRQYKSDPDQSPKSSTDIEGNFNHETPQDSRTEEDPPLELYINMNVQAINNSILSNS

Query:  SFTENNPGIKLKFGREPTKSEDKSQSFHARKAKYTAKAAERLTYESTVRRRCLGGLLMESSDSELDNPEKPRRHGCRYR
        SFTENNPGIKL F  + TKSEDK  S  A+KAKYTAK  E   YESTVRRRCLGGLLMESSDS+ DN EKPRRHGCRYR
Subjt:  SFTENNPGIKLKFGREPTKSEDKSQSFHARKAKYTAKAAERLTYESTVRRRCLGGLLMESSDSELDNPEKPRRHGCRYR

TrEMBL top hitse value%identityAlignment
A0A1S4DVD0 micronuclear linker histone polyprotein2.2e-11865.32Show/hide
Query:  PRSPSHENSRQQPSKKTSRVQSPSHLSSKPTAQSTSQQPIESPAAIGDQTTNRIVSHPANQSPQARPKSRESQSQTKSKQSPKPDLKPVESIASKDQTET
        PRSPS ENS Q PS+KTSRVQSPS+LS KPTA STSQQPIES A+IGDQTT+ I+S PA  SP+A P S E Q Q KSK+SP+P++KPVE  ASK+Q +T
Subjt:  PRSPSHENSRQQPSKKTSRVQSPSHLSSKPTAQSTSQQPIESPAAIGDQTTNRIVSHPANQSPQARPKSRESQSQTKSKQSPKPDLKPVESIASKDQTET

Query:  KEDLT----------SKNSSYPHFDQDSSEIPIVIDQTIDNGLEPSLESQAESKENEEIKSDKEDLAKTTNALQTNASKSKLITSAEIPSPFEPEKRDSQ
        KE+LT          SKN+S PH D+DSSE P   D+T++ GL+ SLESQ ESKE       KED  KTTNALQ  AS+S LITS++  S FEPEK ++Q
Subjt:  KEDLT----------SKNSSYPHFDQDSSEIPIVIDQTIDNGLEPSLESQAESKENEEIKSDKEDLAKTTNALQTNASKSKLITSAEIPSPFEPEKRDSQ

Query:  QEETMEDLSKAFQKLNIKYPDEENPKSFTTLTGDNKGASMHLLSGEAKTESAIHIHRQYKSDPDQSPKSSTDIEGNFNHETPQDSRTEED---PPLELYI
        Q+E+MEDLSKAF KLNIKY DEENPKSFTT+ GDNKG+S+HLLSGEAK+ES+IH++ +YKS+PDQSPKSST+I+ N N+ETPQDS TEE+   PPLELYI
Subjt:  QEETMEDLSKAFQKLNIKYPDEENPKSFTTLTGDNKGASMHLLSGEAKTESAIHIHRQYKSDPDQSPKSSTDIEGNFNHETPQDSRTEED---PPLELYI

Query:  NMNVQAINNSILSNSSFTENNPGIKLKF--GREPTKSEDKSQSFHARKAKYTAKAAERLTYESTVRRRCLGGLLMESSDSELDNPEKPRRHGCRY
        N NVQ INNSI+ N+SFTENNPGIKLKF    EPT S+D+ +S H RK+ Y    AE++TYE  +RRR LGGLLMES DSE +NP K R HGCRY
Subjt:  NMNVQAINNSILSNSSFTENNPGIKLKF--GREPTKSEDKSQSFHARKAKYTAKAAERLTYESTVRRRCLGGLLMESSDSELDNPEKPRRHGCRY

A0A5A7VAN0 Flocculation protein FLO111.6e-16153.75Show/hide
Query:  MSYRQFRFRLPWQTLKAFARPEKESSRRSSEPKDEAETSASAADAVPYIRHLPVQSTDIKPEQPPPLEPAQAPESSETMPPSKSHKEAKVQSQPSSHSRA
        MS  Q R  LPWQ+LKA  RP  ES   S  P DE+E+SAS AD  P IRH P QS +IKPE+ PPL  AQA E SETMPPSKSHKE K+ SQ S++SRA
Subjt:  MSYRQFRFRLPWQTLKAFARPEKESSRRSSEPKDEAETSASAADAVPYIRHLPVQSTDIKPEQPPPLEPAQAPESSETMPPSKSHKEAKVQSQPSSHSRA

Query:  KNQTRTASKPPSPSKATPQSSVASSKSPAASGKDSPSHDTSKPSSPAGKA-SPTHD-TSKPSSPA--------------GKVSQSHNTSKPSSPAGKASP
        KN++RTASKP SP  A PQS +AS+K P+ SGK S S D+SKPSSPAGK  SP+ D +SKPSSPA                 SQ+ N   PSS       
Subjt:  KNQTRTASKPPSPSKATPQSSVASSKSPAASGKDSPSHDTSKPSSPAGKA-SPTHD-TSKPSSPA--------------GKVSQSHNTSKPSSPAGKASP

Query:  THDTSKPSSPAGKAFPSPDASKP-------------------------------------------------SSPAGKAFPSPDA----SKPSSAARKAF
            S+PSSP+  AFPS D S P                                                 S P  KA PS       S+PS ++R  F
Subjt:  THDTSKPSSPAGKAFPSPDASKP-------------------------------------------------SSPAGKAFPSPDA----SKPSSAARKAF

Query:  PSPDASKP----------------------SSPAAAAPRSRNASKPP--SPSQTSSKNHLHSKPTSQSRLKADSQPSSPSRSAFSPQASSI-PRSPSHEN
        PS D S P                       S ++  P +++ SK P  SP+    ++H + KP+SQSR KA+S+PSS S+S F  Q SS+ PRSPS EN
Subjt:  PSPDASKP----------------------SSPAAAAPRSRNASKPP--SPSQTSSKNHLHSKPTSQSRLKADSQPSSPSRSAFSPQASSI-PRSPSHEN

Query:  SRQQPSKKTSRVQSPSHLSSKPTAQSTSQQPIESPAAIGDQTTNRIVSHPANQSPQARPKSRESQSQTKSKQSPKPDLKPVESIASKDQTETKEDLT---
        S Q PS+KTSRVQSPS+LS KPTA STSQQPIES A+IGDQTT+ I+S PA  SP+A P S E Q Q KSK+SP+P++KPVE  ASK+Q +TKE+LT   
Subjt:  SRQQPSKKTSRVQSPSHLSSKPTAQSTSQQPIESPAAIGDQTTNRIVSHPANQSPQARPKSRESQSQTKSKQSPKPDLKPVESIASKDQTETKEDLT---

Query:  -------SKNSSYPHFDQDSSEIPIVIDQTIDNGLEPSLESQAESKENEEIKSDKEDLAKTTNALQTNASKSKLITSAEIPSPFEPEKRDSQQEETMEDL
               SKN+S PH D+DSSE P   D+T++ GL+ SLESQ ESKE       KED  KTTNALQ  AS+S LITS++  S FEPEK ++QQ+E+MEDL
Subjt:  -------SKNSSYPHFDQDSSEIPIVIDQTIDNGLEPSLESQAESKENEEIKSDKEDLAKTTNALQTNASKSKLITSAEIPSPFEPEKRDSQQEETMEDL

Query:  SKAFQKLNIKYPDEENPKSFTTLTGDNKGASMHLLSGEAKTESAIHIHRQYKSDPDQSPKSSTDIEGNFNHETPQDSRTEED---PPLELYINMNVQAIN
        SKAF KLNIKY DEENPKSFTT+ GDNKG+S+HLLSGEAK+ES+IH++ +YKS+PDQSPKSST+I+ N N+ETPQDS TEE+   PPLELYIN NVQ IN
Subjt:  SKAFQKLNIKYPDEENPKSFTTLTGDNKGASMHLLSGEAKTESAIHIHRQYKSDPDQSPKSSTDIEGNFNHETPQDSRTEED---PPLELYINMNVQAIN

Query:  NSILSNSSFTENNPGIKLKF--GREPTKSEDKSQSFHARKAKYTAKAAERLTYESTVRRRCLGGLLMESSDSELDNPEKPRRHGCRY
        NSI+ N+SFTENNPGIKLKF    EPT S+D+ +S H RK+ Y    AE++TYE  +RRR LGGLLMES DSE +NP K R HGCRY
Subjt:  NSILSNSSFTENNPGIKLKF--GREPTKSEDKSQSFHARKAKYTAKAAERLTYESTVRRRCLGGLLMESSDSELDNPEKPRRHGCRY

A0A6J1CRH0 cell wall protein RBR35.0e-13156.29Show/hide
Query:  MPPSKSHKEAKVQSQPSSHSRAKNQTRTASKPPSPSKATPQSSVASSKSPAASGKDSPSHDTSKPSSPAGKASPTHDTSKPSSPAGKVSQSHNTSKPSSP
        MPPS+S KE++V S   S+SRAKNQ R ASK PS  KA+P  +VAS+KS                               PSSPA               
Subjt:  MPPSKSHKEAKVQSQPSSHSRAKNQTRTASKPPSPSKATPQSSVASSKSPAASGKDSPSHDTSKPSSPAGKASPTHDTSKPSSPAGKVSQSHNTSKPSSP

Query:  AGKASPTHDTSKPSSPAGKAFPSPDASKPSSPAGKAFPSPDASKPSSAARKAFPSPDASKPSSPAAAAPRSRNASKPPSPSQTSSKNHLHSKPTS-QSRL
        +GKAS                                                PS DASKPSSPA AAPR R +S PPSPSQTSS+NHL+ KPTS QS+L
Subjt:  AGKASPTHDTSKPSSPAGKAFPSPDASKPSSPAGKAFPSPDASKPSSAARKAFPSPDASKPSSPAAAAPRSRNASKPPSPSQTSSKNHLHSKPTS-QSRL

Query:  KADSQPSSPSRSAFSPQASSIPRSPSHENSRQQPSKKTSRVQSPSHLSSKPTAQSTSQQPIESPAAIGDQTTNRIVSHPANQSPQARPKSRESQSQTKSK
        KADSQPSS SRSAF PQASS  RSPS  NS+QQPSKKT             TAQSTS Q     AA  DQTTN + SH AN+S QARPK RESQSQTKSK
Subjt:  KADSQPSSPSRSAFSPQASSIPRSPSHENSRQQPSKKTSRVQSPSHLSSKPTAQSTSQQPIESPAAIGDQTTNRIVSHPANQSPQARPKSRESQSQTKSK

Query:  QSPKPDLKPVESIASKDQTETKEDLTSKNSSYPHFDQDSSEIPIVIDQTIDNGLEPSLESQAESKENEEIKSDKEDLAKTTNALQTNASKSKLITSAEIP
        QSPK       S ASK+Q + KE+LTSKN+S P  +Q+SSE P   DQ+I+NG +PSLESQAESKE+EE K       K TNA  T    S LI+S+E  
Subjt:  QSPKPDLKPVESIASKDQTETKEDLTSKNSSYPHFDQDSSEIPIVIDQTIDNGLEPSLESQAESKENEEIKSDKEDLAKTTNALQTNASKSKLITSAEIP

Query:  SPF-EPEKRDSQQEETM--EDLSKAFQKLNIKYPDEENPKSFTTLTGDNKGASMHLLSGEAKTESAIHIHRQYKSDPDQSPKSSTDIEGNFNHETPQDSR
        SP+ EPE RDSQ++E M   D+SKAF KL I Y  EENPKSF TL GDNKG SM+LLSG+   ES+IHI R+Y+S+PDQSPKSST+IEGNFNH+T +DSR
Subjt:  SPF-EPEKRDSQQEETM--EDLSKAFQKLNIKYPDEENPKSFTTLTGDNKGASMHLLSGEAKTESAIHIHRQYKSDPDQSPKSSTDIEGNFNHETPQDSR

Query:  TEEDPPLELYINMNVQAINNSILSNSSFTENNPGIKLKFGREPTKSEDKSQSFHARKAKYTAKAAERLTYESTVRRRCLGGLLMESSDSELDNPEKPRRH
        T EDPPL LYIN N Q INNSILSNSSFTE NPG +LKF REPTKSE+ S+S   +KAKY AK AERLTY+ TVRRRCL GL MESSDSE +NPEKPRRH
Subjt:  TEEDPPLELYINMNVQAINNSILSNSSFTENNPGIKLKFGREPTKSEDKSQSFHARKAKYTAKAAERLTYESTVRRRCLGGLLMESSDSELDNPEKPRRH

Query:  GCRY
        GCRY
Subjt:  GCRY

A0A6J1GK50 cell wall protein RBR3-like3.8e-24074.82Show/hide
Query:  MSYRQFRFRLPWQTLKAFARPEKESSRRSSEPKDEAETSASAADAVPYIRHLPVQSTDIKPEQPPPLEPAQAPESSETMPPSKSHKEAKVQSQPSSHSRA
        M+YRQFRFRLPWQ++KA +R E ESS RSSEP DEAETS SAAD VPY++HLP  S         PLE AQAPE SETM PSKSHK+AKV SQPSSHSRA
Subjt:  MSYRQFRFRLPWQTLKAFARPEKESSRRSSEPKDEAETSASAADAVPYIRHLPVQSTDIKPEQPPPLEPAQAPESSETMPPSKSHKEAKVQSQPSSHSRA

Query:  KNQTRTASKPPSPSKATPQSSVASSKSPAASGKDSPSHDTSKPSSPAGKASPTHDTSKPSSPAGKVSQSHNTSKPSSPAGKASPTHDTSKPSSPAGKAFP
        K QTRTA+KPPS SK TPQSSV+S+KSP  S K SPSHD SKPSS AGK SP+HDTSK SSPAGK              GK SP+HDTSKPSSPAG    
Subjt:  KNQTRTASKPPSPSKATPQSSVASSKSPAASGKDSPSHDTSKPSSPAGKASPTHDTSKPSSPAGKVSQSHNTSKPSSPAGKASPTHDTSKPSSPAGKAFP

Query:  SPDASKPSSPAGKAFPSPDASKPSSAARKAFPSPDASKPSSPAAAAPRSRNASKPPSPSQTSSKNHLHSKPTSQSRLKADSQPSSPSRSAFSPQASSIPR
                                    KAFPS DAS+PSS AAAAPRS+  SKPPSPSQTSSKNH  SKPTSQSRLKADSQPSSPSR AFSPQASSIPR
Subjt:  SPDASKPSSPAGKAFPSPDASKPSSAARKAFPSPDASKPSSPAAAAPRSRNASKPPSPSQTSSKNHLHSKPTSQSRLKADSQPSSPSRSAFSPQASSIPR

Query:  SPSHENSRQQPSKKTSRVQSPSHLSSKPTAQSTSQQPIESPAAIGDQTTNRIVSHPANQSPQARPKSRESQSQTKSKQSPKPDLKPVESIASKDQTETKE
        SPSHENSRQQPSKK SRVQSPSHLSSKPTAQSTSQQ  ESPA IGDQTT R+VSHPA+QSP+AR K RE+Q QTKSKQSPKPDLKPVE  ASK Q ET E
Subjt:  SPSHENSRQQPSKKTSRVQSPSHLSSKPTAQSTSQQPIESPAAIGDQTTNRIVSHPANQSPQARPKSRESQSQTKSKQSPKPDLKPVESIASKDQTETKE

Query:  DLTSKNSSYPHFDQDSSEIPIVIDQTIDNGLEPSLESQAESKENEEIKSDKEDLAKTTNALQTNASKSKLITSAEIPSPFEPEKRDSQQEETMEDLSKAF
        +  SKN+SYPH DQD SEIPI+ID+TI+NG E SLESQ ES+E++EIKS +EDL KTTNALQ NASKSKLITSAEI SPFEPE  DSQQE TMEDLSKAF
Subjt:  DLTSKNSSYPHFDQDSSEIPIVIDQTIDNGLEPSLESQAESKENEEIKSDKEDLAKTTNALQTNASKSKLITSAEIPSPFEPEKRDSQQEETMEDLSKAF

Query:  QKLNIKYPDEENPKSFTTLTGDNKGASMHLLSGEAKTESAIHIHRQYKSDPDQSPKSSTDIEGNFNHETPQDSRTEEDPPLELYINMNVQAINNSILSNS
        Q LNIKYP EENPKSFTTLTGDNKGASMHLLSGEA  ES+IHIHRQYKSDPD+ P+SSTDIEGN N ETPQDS+TEEDPPLELYIN+NVQ INNS+LSNS
Subjt:  QKLNIKYPDEENPKSFTTLTGDNKGASMHLLSGEAKTESAIHIHRQYKSDPDQSPKSSTDIEGNFNHETPQDSRTEEDPPLELYINMNVQAINNSILSNS

Query:  SFTENNPGIKLKFGREPTKSEDKSQSFHARKAKYTAKAAERLTYESTVRRRCLGGLLMESSDSELDNPEKPRRHGCRYR
        SFTENNPGIKLKF  + TKSEDKS S  A+KAKYTAK  E  TYE TVRRRCLGGLLMESSDS+ DN EKPRRHGCRYR
Subjt:  SFTENNPGIKLKFGREPTKSEDKSQSFHARKAKYTAKAAERLTYESTVRRRCLGGLLMESSDSELDNPEKPRRHGCRYR

A0A6J1KJ10 cell wall protein RBR3-like9.4e-23974.23Show/hide
Query:  MSYRQFRFRLPWQTLKAFARPEKESSRRSSEPKDEAETSASAADAVPYIRHLPVQSTDIKPEQPPPLEPAQAPESSETMPPSKSHKEAKVQSQPSSHSRA
        M+YRQFRFRLPWQ++KA +RPE ESS RSSEP DEAETS SAAD VPY++HLP+QS + KPE   PLE AQAPE SETM PSKSHK+AKV SQPSSHSRA
Subjt:  MSYRQFRFRLPWQTLKAFARPEKESSRRSSEPKDEAETSASAADAVPYIRHLPVQSTDIKPEQPPPLEPAQAPESSETMPPSKSHKEAKVQSQPSSHSRA

Query:  KNQTRTASKPPSPSKATPQSSVASSKSPAASGKDSPSHDTSKPSSPAGKASPTHDTSKPSSPAGKVSQSHNTSKPSSPAGKASPTHDTSKPSSPAGKAFP
        K QTRTA+KPPS SK TPQSSV+S+KSP  S K SPSHD SKPSS AGK SP+HDTSK SSPAGK              GK SP+ DTS PS PAG    
Subjt:  KNQTRTASKPPSPSKATPQSSVASSKSPAASGKDSPSHDTSKPSSPAGKASPTHDTSKPSSPAGKVSQSHNTSKPSSPAGKASPTHDTSKPSSPAGKAFP

Query:  SPDASKPSSPAGKAFPSPDASKPSSAARKAFPSPDASKPSSPAAAAPRSRNASKPPSPSQTSSKNHLHSKPTSQSRLKADSQPSSPSRSAFSPQASSIPR
                                    KAFPS DAS+PSS AAAAPRS   SKPPSPSQTSSKNHLHSK TSQSRLKADSQPSSPSR AFSPQASSIPR
Subjt:  SPDASKPSSPAGKAFPSPDASKPSSAARKAFPSPDASKPSSPAAAAPRSRNASKPPSPSQTSSKNHLHSKPTSQSRLKADSQPSSPSRSAFSPQASSIPR

Query:  SPSHENSRQQPSKKTSRVQSPSHLSSKPTAQSTSQQPIESPAAIGDQTTNRIVSHPANQSPQARPKSRESQSQTKSKQSPKPDLKPVESIASKDQTETKE
        SPSHENSRQQPSKK SRVQSPSHLSSK TAQSTSQQ  ESPA IGDQTT R+VSHPA+QSP+AR KS+E+Q QTKSKQSPKPDLKPVE  ASK Q ET E
Subjt:  SPSHENSRQQPSKKTSRVQSPSHLSSKPTAQSTSQQPIESPAAIGDQTTNRIVSHPANQSPQARPKSRESQSQTKSKQSPKPDLKPVESIASKDQTETKE

Query:  DLTSKNSSYPHFDQDSSEIPIVIDQTIDNGLEPSLESQAESKENEEIKSDKEDLAKTTNALQTNASKSKLITSAEIPSPFEPEKRDSQQEETMEDLSKAF
        +  SKN+SYP  ++D SEIPI+ID+TI+NG EPSLESQ ES+E++EIKS +EDL KTTNALQ NASKSKLITSAEI SPFEPE  DSQQE TMEDL KAF
Subjt:  DLTSKNSSYPHFDQDSSEIPIVIDQTIDNGLEPSLESQAESKENEEIKSDKEDLAKTTNALQTNASKSKLITSAEIPSPFEPEKRDSQQEETMEDLSKAF

Query:  QKLNIKYPDEENPKSFTTLTGDNKGASMHLLSGEAKTESAIHIHRQYKSDPDQSPKSSTDIEGNFNHETPQDSRTEEDPPLELYINMNVQAINNSILSNS
        Q LNIKYP EENPKSFTTLTGDNKGASMHL+SGEA  ES+IHIHRQYKSDPD+ P+SSTDIEGN N ETPQDS+TEEDPPLELYIN+NVQ INNS+LSNS
Subjt:  QKLNIKYPDEENPKSFTTLTGDNKGASMHLLSGEAKTESAIHIHRQYKSDPDQSPKSSTDIEGNFNHETPQDSRTEEDPPLELYINMNVQAINNSILSNS

Query:  SFTENNPGIKLKFGREPTKSEDKSQSFHARKAKYTAKAAERLTYESTVRRRCLGGLLMESSDSELDNPEKPRRHGCRYR
        SFTENNPGIKLKF  + TKSE+K  S  A+KAKYTAK  E  TYE TVRRRCLGGLLMESSDS+ DN EKPRRHGCRYR
Subjt:  SFTENNPGIKLKFGREPTKSEDKSQSFHARKAKYTAKAAERLTYESTVRRRCLGGLLMESSDSELDNPEKPRRHGCRYR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G75260.1 oxidoreductases, acting on NADH or NADPH4.3e-1829.07Show/hide
Query:  SPTHDTSKPSSPAGKAFPSPDASKPSSPAGKAFPSPDASKPSSAARKAFPSPDASKPSSPAAAAPRSRNASKPPS-----PSQTSSKNHLHSKPTSQSRL
        SP+  +S  SSP+    P+P +  P  PAG A PS   +KP +       SP  S+  S  AA   S +AS+ PS     P++ + + +  S   S+   
Subjt:  SPTHDTSKPSSPAGKAFPSPDASKPSSPAGKAFPSPDASKPSSAARKAFPSPDASKPSSPAAAAPRSRNASKPPS-----PSQTSSKNHLHSKPTSQSRL

Query:  KADSQPSSPSRSAFSPQASSIPRSPSHENSRQQPSKKTSRVQSPSHLSSKPTAQSTSQQPIESPAAIGDQTTNRIV------SHPANQSPQARPKSRESQ
        K DS      + A   +        + EN      K     +   HL  K T     Q      A +  Q   +++         ANQ  Q   +  +  
Subjt:  KADSQPSSPSRSAFSPQASSIPRSPSHENSRQQPSKKTSRVQSPSHLSSKPTAQSTSQQPIESPAAIGDQTTNRIV------SHPANQSPQARPKSRESQ

Query:  SQTKSKQSPKPDLKPVESIASKDQTETKEDLT---SKNSSYPHFDQDSSEIP----IVIDQTIDNGLEPSLESQAESKENEEIKSDKEDLAKTTNALQTN
         Q + K      ++ +E+   + +++  E L    +K S      +D++       +        G     E + E++   EI +D ++  KT  AL +N
Subjt:  SQTKSKQSPKPDLKPVESIASKDQTETKEDLT---SKNSSYPHFDQDSSEIP----IVIDQTIDNGLEPSLESQAESKENEEIKSDKEDLAKTTNALQTN

Query:  ASKSKLITSAEIPSPFEPEKRDSQQEETMEDLSKAFQKLNI-KYPDEENPKSFTTLTGDNKGASMHLLSGEAKTESAIHIHRQYKSDPDQSPKSSTDIEG
                    P     E   S   +  ED+     KL   K   ++   S  TLTG+NKGA+M + S + K +  +HI R Y+S+PD+S  ++     
Subjt:  ASKSKLITSAEIPSPFEPEKRDSQQEETMEDLSKAFQKLNI-KYPDEENPKSFTTLTGDNKGASMHLLSGEAKTESAIHIHRQYKSDPDQSPKSSTDIEG

Query:  NFNHETPQDSRTEEDPPLELYINMNVQAINNSILSNSSFTENNPGIKLKFGREPTKSEDKSQSFHARKAK-YTAKAAERLTYESTVRRRCLGGLLMESSD
            E P+D   EE+     YIN N Q INNSI+  SS +EN+PG+ + F  E  K E      +  + K  T    ++L  E  VRRRCL GLL ESS+
Subjt:  NFNHETPQDSRTEEDPPLELYINMNVQAINNSILSNSSFTENNPGIKLKFGREPTKSEDKSQSFHARKAK-YTAKAAERLTYESTVRRRCLGGLLMESSD

Query:  SELDNPEKPRRHGCRY
        SE DNP KPRRHGCR+
Subjt:  SELDNPEKPRRHGCRY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATACCGACAGTTTCGCTTTCGACTTCCTTGGCAAACTCTCAAAGCTTTCGCTCGTCCTGAAAAGGAGTCATCAAGACGCAGTTCTGAGCCTAAAGATGAAGCTGA
AACTTCTGCTTCAGCAGCCGATGCCGTGCCATATATTCGGCATCTACCAGTCCAATCTACTGACATAAAACCTGAACAGCCTCCTCCTTTAGAACCAGCTCAGGCACCTG
AAAGTAGTGAAACTATGCCACCTTCAAAATCTCACAAGGAAGCCAAAGTTCAATCTCAACCATCATCACATTCCCGAGCCAAAAATCAGACCCGAACGGCTTCCAAGCCT
CCATCGCCATCAAAAGCTACCCCGCAATCTTCAGTTGCTTCCAGCAAGTCTCCAGCAGCATCAGGCAAAGACTCTCCATCTCACGATACTTCAAAGCCTTCATCACCAGC
AGGCAAAGCCTCTCCAACTCATGATACTTCAAAGCCTTCATCACCAGCAGGCAAAGTCTCTCAATCTCATAATACTTCAAAGCCTTCATCACCAGCAGGCAAAGCCTCTC
CAACTCATGATACTTCAAAGCCTTCATCACCAGCAGGCAAAGCCTTTCCGTCTCCGGATGCTTCAAAGCCTTCATCGCCAGCAGGGAAAGCCTTTCCGTCTCCGGATGCA
TCAAAGCCTTCATCAGCAGCAAGGAAAGCCTTTCCGTCTCCGGATGCTTCAAAGCCTTCATCACCTGCAGCTGCAGCTCCTCGATCCCGAAATGCTTCGAAGCCCCCATC
TCCATCTCAAACATCCAGTAAAAACCATCTACATTCAAAACCAACATCACAATCAAGATTGAAAGCTGATTCTCAACCTTCATCACCTTCACGGTCAGCATTTTCACCTC
AAGCTTCTTCTATACCACGGTCCCCATCTCATGAAAATTCTCGACAACAACCATCGAAAAAGACCTCTCGGGTTCAATCTCCATCTCATTTGTCCAGTAAACCTACTGCA
CAATCAACATCACAACAGCCCATTGAATCTCCTGCTGCCATTGGAGACCAAACAACAAATAGAATTGTCTCTCATCCCGCAAATCAATCGCCACAAGCAAGACCAAAAAG
CAGGGAAAGTCAGTCACAAACCAAATCAAAGCAGTCACCAAAACCAGACTTGAAACCAGTGGAATCCATAGCATCAAAAGATCAGACCGAAACCAAGGAAGATCTCACAT
CTAAGAACAGTTCCTATCCCCATTTCGACCAGGACTCTTCTGAAATCCCAATAGTAATCGATCAAACCATTGATAATGGCCTAGAGCCCTCTCTAGAATCACAGGCAGAA
TCAAAAGAAAATGAGGAAATAAAGAGTGACAAGGAAGATCTGGCAAAGACAACCAATGCACTTCAAACCAATGCATCTAAAAGCAAATTAATCACATCTGCTGAAATCCC
TTCACCGTTTGAACCAGAAAAGAGGGACTCACAACAGGAAGAAACCATGGAAGACTTATCAAAAGCTTTTCAGAAACTAAACATCAAATATCCAGACGAAGAAAATCCAA
AGAGTTTCACAACACTCACCGGCGATAACAAAGGGGCGTCAATGCACTTACTCTCCGGCGAAGCCAAAACAGAAAGTGCAATCCACATCCACCGTCAGTATAAGAGCGAT
CCAGATCAAAGCCCTAAAAGTTCCACAGACATCGAAGGAAATTTCAATCACGAAACACCTCAAGATTCAAGAACAGAAGAGGATCCACCCCTGGAATTATACATAAACAT
GAATGTACAAGCTATCAACAACTCAATCCTGTCGAATAGCTCATTTACTGAGAATAATCCTGGAATCAAGTTGAAATTCGGTCGAGAACCAACTAAATCTGAAGATAAAT
CACAGTCATTCCACGCTCGAAAGGCGAAATATACAGCGAAAGCTGCAGAGAGGCTTACCTATGAATCCACAGTAAGACGAAGATGCCTCGGAGGGCTGTTAATGGAGTCG
AGCGATTCTGAGCTCGACAATCCAGAAAAGCCCCGACGCCATGGCTGCCGCTACAGGACGTAA
mRNA sequenceShow/hide mRNA sequence
CTTTTGGACAGGATAACTTCCATGACAAACTCATCATTCTTTTATTTTTAGTTATGTCATACCGACAGTTTCGCTTTCGACTTCCTTGGCAAACTCTCAAAGCTTTCGCT
CGTCCTGAAAAGGAGTCATCAAGACGCAGTTCTGAGCCTAAAGATGAAGCTGAAACTTCTGCTTCAGCAGCCGATGCCGTGCCATATATTCGGCATCTACCAGTCCAATC
TACTGACATAAAACCTGAACAGCCTCCTCCTTTAGAACCAGCTCAGGCACCTGAAAGTAGTGAAACTATGCCACCTTCAAAATCTCACAAGGAAGCCAAAGTTCAATCTC
AACCATCATCACATTCCCGAGCCAAAAATCAGACCCGAACGGCTTCCAAGCCTCCATCGCCATCAAAAGCTACCCCGCAATCTTCAGTTGCTTCCAGCAAGTCTCCAGCA
GCATCAGGCAAAGACTCTCCATCTCACGATACTTCAAAGCCTTCATCACCAGCAGGCAAAGCCTCTCCAACTCATGATACTTCAAAGCCTTCATCACCAGCAGGCAAAGT
CTCTCAATCTCATAATACTTCAAAGCCTTCATCACCAGCAGGCAAAGCCTCTCCAACTCATGATACTTCAAAGCCTTCATCACCAGCAGGCAAAGCCTTTCCGTCTCCGG
ATGCTTCAAAGCCTTCATCGCCAGCAGGGAAAGCCTTTCCGTCTCCGGATGCATCAAAGCCTTCATCAGCAGCAAGGAAAGCCTTTCCGTCTCCGGATGCTTCAAAGCCT
TCATCACCTGCAGCTGCAGCTCCTCGATCCCGAAATGCTTCGAAGCCCCCATCTCCATCTCAAACATCCAGTAAAAACCATCTACATTCAAAACCAACATCACAATCAAG
ATTGAAAGCTGATTCTCAACCTTCATCACCTTCACGGTCAGCATTTTCACCTCAAGCTTCTTCTATACCACGGTCCCCATCTCATGAAAATTCTCGACAACAACCATCGA
AAAAGACCTCTCGGGTTCAATCTCCATCTCATTTGTCCAGTAAACCTACTGCACAATCAACATCACAACAGCCCATTGAATCTCCTGCTGCCATTGGAGACCAAACAACA
AATAGAATTGTCTCTCATCCCGCAAATCAATCGCCACAAGCAAGACCAAAAAGCAGGGAAAGTCAGTCACAAACCAAATCAAAGCAGTCACCAAAACCAGACTTGAAACC
AGTGGAATCCATAGCATCAAAAGATCAGACCGAAACCAAGGAAGATCTCACATCTAAGAACAGTTCCTATCCCCATTTCGACCAGGACTCTTCTGAAATCCCAATAGTAA
TCGATCAAACCATTGATAATGGCCTAGAGCCCTCTCTAGAATCACAGGCAGAATCAAAAGAAAATGAGGAAATAAAGAGTGACAAGGAAGATCTGGCAAAGACAACCAAT
GCACTTCAAACCAATGCATCTAAAAGCAAATTAATCACATCTGCTGAAATCCCTTCACCGTTTGAACCAGAAAAGAGGGACTCACAACAGGAAGAAACCATGGAAGACTT
ATCAAAAGCTTTTCAGAAACTAAACATCAAATATCCAGACGAAGAAAATCCAAAGAGTTTCACAACACTCACCGGCGATAACAAAGGGGCGTCAATGCACTTACTCTCCG
GCGAAGCCAAAACAGAAAGTGCAATCCACATCCACCGTCAGTATAAGAGCGATCCAGATCAAAGCCCTAAAAGTTCCACAGACATCGAAGGAAATTTCAATCACGAAACA
CCTCAAGATTCAAGAACAGAAGAGGATCCACCCCTGGAATTATACATAAACATGAATGTACAAGCTATCAACAACTCAATCCTGTCGAATAGCTCATTTACTGAGAATAA
TCCTGGAATCAAGTTGAAATTCGGTCGAGAACCAACTAAATCTGAAGATAAATCACAGTCATTCCACGCTCGAAAGGCGAAATATACAGCGAAAGCTGCAGAGAGGCTTA
CCTATGAATCCACAGTAAGACGAAGATGCCTCGGAGGGCTGTTAATGGAGTCGAGCGATTCTGAGCTCGACAATCCAGAAAAGCCCCGACGCCATGGCTGCCGCTACAGG
ACGTAATTGTGAAGGAAAATAGGTCGAAATTCTGCAACAGAACAATGGCGAAAGTCTTGAAGCATCATCCTACAATCAAAATACATTCAACGCAGAAAAGAAATTTGAGA
GTGTGTCTGTGTGTGTGAAAGAATTTTTTTCTTACAATGTGTGTGAAAGAATTGGAAATCTGCGTGCGAAATATTCAAAATGTGTGGAGCTTTGATTGTAATAACCCAGA
TCTTCCATGAAAATCAGCTTGAAGTTTGCAAATATTTGACCAACTCTTTGATTTCATATTTTG
Protein sequenceShow/hide protein sequence
MSYRQFRFRLPWQTLKAFARPEKESSRRSSEPKDEAETSASAADAVPYIRHLPVQSTDIKPEQPPPLEPAQAPESSETMPPSKSHKEAKVQSQPSSHSRAKNQTRTASKP
PSPSKATPQSSVASSKSPAASGKDSPSHDTSKPSSPAGKASPTHDTSKPSSPAGKVSQSHNTSKPSSPAGKASPTHDTSKPSSPAGKAFPSPDASKPSSPAGKAFPSPDA
SKPSSAARKAFPSPDASKPSSPAAAAPRSRNASKPPSPSQTSSKNHLHSKPTSQSRLKADSQPSSPSRSAFSPQASSIPRSPSHENSRQQPSKKTSRVQSPSHLSSKPTA
QSTSQQPIESPAAIGDQTTNRIVSHPANQSPQARPKSRESQSQTKSKQSPKPDLKPVESIASKDQTETKEDLTSKNSSYPHFDQDSSEIPIVIDQTIDNGLEPSLESQAE
SKENEEIKSDKEDLAKTTNALQTNASKSKLITSAEIPSPFEPEKRDSQQEETMEDLSKAFQKLNIKYPDEENPKSFTTLTGDNKGASMHLLSGEAKTESAIHIHRQYKSD
PDQSPKSSTDIEGNFNHETPQDSRTEEDPPLELYINMNVQAINNSILSNSSFTENNPGIKLKFGREPTKSEDKSQSFHARKAKYTAKAAERLTYESTVRRRCLGGLLMES
SDSELDNPEKPRRHGCRYRT