; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G15470 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G15470
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptioncell wall protein RBR3-like
Genome locationClcChr08:26279889..26281973
RNA-Seq ExpressionClc08G15470
SyntenyClc08G15470
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065223.1 flocculation protein FLO11 [Cucumis melo var. makuwa]6.3e-24568.31Show/hide
Query:  MSYSQLRILLPWQSLKASPRPANDSSGRSFELTDEAETSASAADTMPNIWHPPVQSPEIKPEQPPLAPVQAPENSETMPPSKSHKASKVQSQPPSNSRAK
        MS SQLRILLPWQSLKASPRPAN+S   SF  TDE+E+SAS ADT PNI H P QSPEIKPE+PPLA  QA E SETMPPSKSHK  K+ SQ  +NSRAK
Subjt:  MSYSQLRILLPWQSLKASPRPANDSSGRSFELTDEAETSASAADTMPNIWHPPVQSPEIKPEQPPLAPVQAPENSETMPPSKSHKASKVQSQPPSNSRAK

Query:  NRSRTASKPPSPSKAIPQSSVASNKSPSTSGKGSLSQDTSKPSSPAGKA-SSSQDASSKPSSFAAVAATAPRSRIASKPLSPSSQTSSKNHPNSKPTSQS
        NRSRTASKP SP  AIPQS +ASNK PSTSGKGS SQD+SKPSSPAGK  S SQDASSKPSS A VAATAP  RIASK  S SSQ S+K HP+SKPTS+ 
Subjt:  NRSRTASKPPSPSKAIPQSSVASNKSPSTSGKGSLSQDTSKPSSPAGKA-SSSQDASSKPSSFAAVAATAPRSRIASKPLSPSSQTSSKNHPNSKPTSQS

Query:  RIKADSQPSSPSKSAFPSQGS-------------------------------------------------------------------------------
        R KADSQPSSPS+SAFPSQ                                                                                 
Subjt:  RIKADSQPSSPSKSAFPSQGS-------------------------------------------------------------------------------

Query:  -----SMPPRSPSQENSRQQPSEKTSRVQSPSHLSSKPTAQSTSQQPVKSPAAIGIQNHPNSKPSSQSRFKADSQPSSSSKSTFPSQHSSMPPRSPSQEN
             SMPPRSPS ENSRQQPS+KTS VQSPSH S KPTAQSTS+QP++SPA IGIQ+HPN KPSSQSRFKA+S+PSSSSKS FPSQ SSMPPRSPSQEN
Subjt:  -----SMPPRSPSQENSRQQPSEKTSRVQSPSHLSSKPTAQSTSQQPVKSPAAIGIQNHPNSKPSSQSRFKADSQPSSSSKSTFPSQHSSMPPRSPSQEN

Query:  SQQQPSEKTSRVQSPSHLSAKPTAQSTTQQPIESPTAIGDQTTDGIIFHPANQSPKARPTSRESQLQTKSKQSSKPNAKPVESKASKYQPETSEELT---
        S Q PSEKTSRVQSPS+LS KPTA ST+QQPIES  +IGDQTTDGI+  PA  SPKA PTS E Q+Q KSK+S +PN KPVE KASK Q +T EELT   
Subjt:  SQQQPSEKTSRVQSPSHLSAKPTAQSTTQQPIESPTAIGDQTTDGIIFHPANQSPKARPTSRESQLQTKSKQSSKPNAKPVESKASKYQPETSEELT---

Query:  -------SKNTSNPHPNQDYSENPTQSDQTIENGLDSSLESEAESKETKEDLAKTTNALQTKAARSTLITSSKSRSSFEPEKWDSQQEESMEDLSKAFQK
               SKNTSNPH ++D SENPTQSD+T+E GLDSSLES+ ESKETKED  KTTNALQ KA+RSTLITSSKSRSSFEPEK ++QQ+ESMEDLSKAF K
Subjt:  -------SKNTSNPHPNQDYSENPTQSDQTIENGLDSSLESEAESKETKEDLAKTTNALQTKAARSTLITSSKSRSSFEPEKWDSQQEESMEDLSKAFQK

Query:  LNIKYSDKENPKSFTTLIGDNKGSSMHLLSGEAKSDSPIHIHRQYKSNPDQSPKSSTDIEGNFNNGTPHDSRTEEN--PPPLELYINVNVQGINNSIMCN
        LNIKYSD+ENPKSFTT+IGDNKGSS+HLLSGEAKS+S IH++ +YKSNPDQSPKSST+I+ N NN TP DS TEEN  PPPLELYIN NVQGINNSIM N
Subjt:  LNIKYSDKENPKSFTTLIGDNKGSSMHLLSGEAKSDSPIHIHRQYKSNPDQSPKSSTDIEGNFNNGTPHDSRTEEN--PPPLELYINVNVQGINNSIMCN

Query:  TSFTENDPGIKLKFP--GEPTKSEDELESHHARKANYSAKPAEKLTYEPRVRRRCLRGMLMESSDSEAENPGKSRRHGCRYSRSSKGKEVET
        TSFTEN+PGIKLKFP  GEPT S+DELESHH RK+ Y   PAEK+TYEPR+RRR L G+LMES DSE ENP K R HGCRYSRSSKGK+VET
Subjt:  TSFTENDPGIKLKFP--GEPTKSEDELESHHARKANYSAKPAEKLTYEPRVRRRCLRGMLMESSDSEAENPGKSRRHGCRYSRSSKGKEVET

XP_011649631.1 flocculation protein FLO11 [Cucumis sativus]8.3e-23767.93Show/hide
Query:  MSYSQLRILLPWQSLKASPRPANDSSGRSFELTDEAETSASAADTMPNIWHPPVQSPEIKPEQPPLAPVQAPENSETMPPSKSHKASKVQSQPPSNSRAK
        MS SQLRILLPWQSLKAS R AN+S  RSF  TDE+E SASAADT+PNI H P QSPE KPE+PPLA  QA E SETMPPSKSHKA KV SQ PS +RAK
Subjt:  MSYSQLRILLPWQSLKASPRPANDSSGRSFELTDEAETSASAADTMPNIWHPPVQSPEIKPEQPPLAPVQAPENSETMPPSKSHKASKVQSQPPSNSRAK

Query:  NRSRTASKPPSPSKAIPQSSVASNKSPSTSGKGSLSQDTSKPSSPAGKA-SSSQDASSKPSSFAAVAATAPRSRIASKPLSPSSQTSSKNHPNSKPTSQS
        NRSR ASKP S SKAIPQ SVAS+K PSTSGKGS SQD+SKPSSPAGK  S S+DASSKPSS A VAAT P  RIASK  S SSQTS+K HPNSKPTSQ 
Subjt:  NRSRTASKPPSPSKAIPQSSVASNKSPSTSGKGSLSQDTSKPSSPAGKA-SSSQDASSKPSSFAAVAATAPRSRIASKPLSPSSQTSSKNHPNSKPTSQS

Query:  RIKADSQPSSPSKSA------------------------------------------------------------------------------------F
        ++KADSQPSS S+SA                                                                                    F
Subjt:  RIKADSQPSSPSKSA------------------------------------------------------------------------------------F

Query:  PSQGSSMPPRSPSQENSRQQPSEKTSRVQSPSHLSSKPTAQSTSQQPVKSPAAIGIQNHPNSKPS-SQSRFKADSQPSSSSKSTFPSQHSSMPPRSPSQE
        PSQ  S PPRSPS E SRQQPS KTSRVQSPSH S K TAQST+QQP +SPA IGIQ+HPN KPS SQSRFKADSQPSSSSK  FPSQ SSMPPRSPSQE
Subjt:  PSQGSSMPPRSPSQENSRQQPSEKTSRVQSPSHLSSKPTAQSTSQQPVKSPAAIGIQNHPNSKPS-SQSRFKADSQPSSSSKSTFPSQHSSMPPRSPSQE

Query:  NSQQQPSEKTSRVQSPSHLSAKPTAQSTTQQPIESPTAIGDQTTDGIIFHPANQSPKARPTSRESQLQTKSKQSSKPNAKPVESKASKYQPETSEELT--
        NS Q PSEKT RVQSPSHLS KPTAQST+QQPIE   +IGDQTTD I+  PAN SPKA PTS ESQ+Q +SK+S KPN KPVE + SK Q ET EELT  
Subjt:  NSQQQPSEKTSRVQSPSHLSAKPTAQSTTQQPIESPTAIGDQTTDGIIFHPANQSPKARPTSRESQLQTKSKQSSKPNAKPVESKASKYQPETSEELT--

Query:  --------SKNTSNPHPNQDYSENPTQSDQTIENGLDSSLESEAESKETKEDLAKTTNALQTKAARSTLITSSKSRSSFEPEKWDSQQEESMEDLSKAFQ
                SKNTSNPH  +D SENPTQSDQ IE GLDSSLES+ ESKETKED AKTTNA QTKA+RSTLITSSKSRSSFEPE  ++QQ+ESMEDLSKAF 
Subjt:  --------SKNTSNPHPNQDYSENPTQSDQTIENGLDSSLESEAESKETKEDLAKTTNALQTKAARSTLITSSKSRSSFEPEKWDSQQEESMEDLSKAFQ

Query:  KLNIKYSDKENPKSFTTLIGDNKGSSMHLLSGEAKSDSPIHIHRQYKSNPDQSPKSSTDIEGNFNNGTPHDSRTEEN--PPPLELYINVNVQGINNSIMC
        KLNIKYSD+ENPKS TT+IGDNKG+SMHLLS EAKS+S IH++  YKSNPDQSP+SSTDI+ N NN T  DS TEEN  PPPLELYIN+NVQGINNSI  
Subjt:  KLNIKYSDKENPKSFTTLIGDNKGSSMHLLSGEAKSDSPIHIHRQYKSNPDQSPKSSTDIEGNFNNGTPHDSRTEEN--PPPLELYINVNVQGINNSIMC

Query:  NTSFTENDPGIKLKFPGEPTKSEDELES-HHARKANYSAKPAEKLTYEPRVRRRCLRGMLMESSDSEAENPGKSRRHGCRYSRSSKGKEVET
        NTSFTEN+PGIKLKFPGEPT  +DELES HH RK+ Y A PAEK+TY+PR+RRRCL G+LMESSDSE ENPGK + HGCRYS SSKGKEVET
Subjt:  NTSFTENDPGIKLKFPGEPTKSEDELES-HHARKANYSAKPAEKLTYEPRVRRRCLRGMLMESSDSEAENPGKSRRHGCRYSRSSKGKEVET

XP_022951875.1 cell wall protein RBR3-like [Cucurbita moschata]6.5e-17358.79Show/hide
Query:  MSYSQLRILLPWQSLKASPRPANDSSGRSFELTDEAETSASAADTMPNIWHPPVQSPEIKPEQPPLAPVQAPENSETMPPSKSHKASKVQSQPPSNSRAK
        M+Y Q R  LPWQS+KAS R  N+SS RS E TDEAETS SAADT+P + H         PE  PL   QAPE SETM PSKSHK +KV SQP S+SRAK
Subjt:  MSYSQLRILLPWQSLKASPRPANDSSGRSFELTDEAETSASAADTMPNIWHPPVQSPEIKPEQPPLAPVQAPENSETMPPSKSHKASKVQSQPPSNSRAK

Query:  NRSRTASKPPSPSKAIPQSSVASNKSPSTSGKGSLSQDTSKPSSPAGKASSSQDASSKPSSFAAVAATAPRSRIASKPLSPSSQTSSKNHPNSKPTSQSR
         ++RTA+KPPS SK  PQSSV+SNKSP+TS K S S D SKPSS AGK S S D S        +++ A + ++     SPS  T               
Subjt:  NRSRTASKPPSPSKAIPQSSVASNKSPSTSGKGSLSQDTSKPSSPAGKASSSQDASSKPSSFAAVAATAPRSRIASKPLSPSSQTSSKNHPNSKPTSQSR

Query:  IKADSQPSSPSKSAFPSQGSSMPPRSPSQENSRQQPSEKTSRVQSPSHLSSKPTAQSTSQQPVKSPAAIGIQNHPNSKPSSQSRFKADSQPSSSSKSTFP
            S+PSSP+  AFPS+ +S P                       S  ++ P +Q  S+ P  SP+    +NHP SKP+SQSR KADSQPSS S+  F 
Subjt:  IKADSQPSSPSKSAFPSQGSSMPPRSPSQENSRQQPSEKTSRVQSPSHLSSKPTAQSTSQQPVKSPAAIGIQNHPNSKPSSQSRFKADSQPSSSSKSTFP

Query:  SQHSSMPPRSPSQENSQQQPSEKTSRVQSPSHLSAKPTAQSTTQQPIESPTAIGDQTTDGIIFHPANQSPKARPTSRESQLQTKSKQSSKPNAKPVESKA
         Q SS+ PRSPS ENS+QQPS+K SRVQSPSHLS+KPTAQST+QQ  ESP  IGDQTT  ++ HPA+QSP+AR   RE+Q+QTKSKQS KP+ KPVE KA
Subjt:  SQHSSMPPRSPSQENSQQQPSEKTSRVQSPSHLSAKPTAQSTTQQPIESPTAIGDQTTDGIIFHPANQSPKARPTSRESQLQTKSKQSSKPNAKPVESKA

Query:  SKYQPETSEELTSKNTSNPHPNQDYSENPTQSDQTIENGLDSSLESEAESKETK------EDLAKTTNALQTKAARSTLITSSKSRSSFEPEKWDSQQEE
        SK+QPET EE  SKNTS PH +QD+SE P   D+TIENG ++SLES+ ES+E+K      EDL KTTNALQ  A++S LITS++  S FEPE  DSQQE 
Subjt:  SKYQPETSEELTSKNTSNPHPNQDYSENPTQSDQTIENGLDSSLESEAESKETK------EDLAKTTNALQTKAARSTLITSSKSRSSFEPEKWDSQQEE

Query:  SMEDLSKAFQKLNIKYSDKENPKSFTTLIGDNKGSSMHLLSGEAKSDSPIHIHRQYKSNPDQSPKSSTDIEGNFNNGTPHDSRTEENPPPLELYINVNVQ
        +MEDLSKAFQ LNIKY + ENPKSFTTL GDNKG+SMHLLSGEA  +S IHIHRQYKS+PD+ P+SSTDIEGN N  TP DS+TEE+ PPLELYIN+NVQ
Subjt:  SMEDLSKAFQKLNIKYSDKENPKSFTTLIGDNKGSSMHLLSGEAKSDSPIHIHRQYKSNPDQSPKSSTDIEGNFNNGTPHDSRTEENPPPLELYINVNVQ

Query:  GINNSIMCNTSFTENDPGIKLKFPGEPTKSEDELESHHARKANYSAKPAEKLTYEPRVRRRCLRGMLMESSDSEAENPGKSRRHGCRYSRSSKG
        GINNS++ N+SFTEN+PGIKLKF  + TKSED+  S  A+KA Y+AK  E  TYEP VRRRCL G+LMESSDS+ +N  K RRHGCRY  S +G
Subjt:  GINNSIMCNTSFTENDPGIKLKFPGEPTKSEDELESHHARKANYSAKPAEKLTYEPRVRRRCLRGMLMESSDSEAENPGKSRRHGCRYSRSSKG

XP_023002262.1 cell wall protein RBR3-like [Cucurbita maxima]1.1e-17258.56Show/hide
Query:  MSYSQLRILLPWQSLKASPRPANDSSGRSFELTDEAETSASAADTMPNIWHPPVQSPEIKPEQPPLAPVQAPENSETMPPSKSHKASKVQSQPPSNSRAK
        M+Y Q R  LPWQS+KAS RP N+SS RS E TDEAETS SAADT+P + H P+QS E KPE  PL   QAPE SETM PSKSHK +KV SQP S+SRAK
Subjt:  MSYSQLRILLPWQSLKASPRPANDSSGRSFELTDEAETSASAADTMPNIWHPPVQSPEIKPEQPPLAPVQAPENSETMPPSKSHKASKVQSQPPSNSRAK

Query:  NRSRTASKPPSPSKAIPQSSVASNKSPSTSGKGSLSQDTSKPSSPAGKASSSQDASSKPSSFAAVAATAPRSRIASKPLSPSSQTSSKNHPNSKPTSQSR
         ++RTA+KPPS SK  PQSSV+SNKSP+TS K S S D SKPSS AGK S S D S        +++ A + ++     SPS  T               
Subjt:  NRSRTASKPPSPSKAIPQSSVASNKSPSTSGKGSLSQDTSKPSSPAGKASSSQDASSKPSSFAAVAATAPRSRIASKPLSPSSQTSSKNHPNSKPTSQSR

Query:  IKADSQPSSPSKSAFPSQGSSMPPRSPSQENSRQQPSEKTSRVQSPSHLSSKPTAQSTSQQPVKSPAAIGIQNHPNSKPSSQSRFKADSQPSSSSKSTFP
            S PS P+  AFPS+ +S P                 +     SH+ SKP           SP+    +NH +SK +SQSR KADSQPSS S+  F 
Subjt:  IKADSQPSSPSKSAFPSQGSSMPPRSPSQENSRQQPSEKTSRVQSPSHLSSKPTAQSTSQQPVKSPAAIGIQNHPNSKPSSQSRFKADSQPSSSSKSTFP

Query:  SQHSSMPPRSPSQENSQQQPSEKTSRVQSPSHLSAKPTAQSTTQQPIESPTAIGDQTTDGIIFHPANQSPKARPTSRESQLQTKSKQSSKPNAKPVESKA
         Q SS+ PRSPS ENS+QQPS+K SRVQSPSHLS+K TAQST+QQ  ESP  IGDQTT  ++ HPA+QSP+AR  S+E+Q+QTKSKQS KP+ KPVE KA
Subjt:  SQHSSMPPRSPSQENSQQQPSEKTSRVQSPSHLSAKPTAQSTTQQPIESPTAIGDQTTDGIIFHPANQSPKARPTSRESQLQTKSKQSSKPNAKPVESKA

Query:  SKYQPETSEELTSKNTSNPHPNQDYSENPTQSDQTIENGLDSSLESEAESKETK------EDLAKTTNALQTKAARSTLITSSKSRSSFEPEKWDSQQEE
        SK+QPET EE  SKNTS P  N+D+SE P   D+TIENG + SLES+ ES+E+K      EDL KTTNALQ  A++S LITS++  S FEPE  DSQQE 
Subjt:  SKYQPETSEELTSKNTSNPHPNQDYSENPTQSDQTIENGLDSSLESEAESKETK------EDLAKTTNALQTKAARSTLITSSKSRSSFEPEKWDSQQEE

Query:  SMEDLSKAFQKLNIKYSDKENPKSFTTLIGDNKGSSMHLLSGEAKSDSPIHIHRQYKSNPDQSPKSSTDIEGNFNNGTPHDSRTEENPPPLELYINVNVQ
        +MEDL KAFQ LNIKY + ENPKSFTTL GDNKG+SMHL+SGEA  +S IHIHRQYKS+PD+ P+SSTDIEGN N  TP DS+TEE+ PPLELYIN+NVQ
Subjt:  SMEDLSKAFQKLNIKYSDKENPKSFTTLIGDNKGSSMHLLSGEAKSDSPIHIHRQYKSNPDQSPKSSTDIEGNFNNGTPHDSRTEENPPPLELYINVNVQ

Query:  GINNSIMCNTSFTENDPGIKLKFPGEPTKSEDELESHHARKANYSAKPAEKLTYEPRVRRRCLRGMLMESSDSEAENPGKSRRHGCRYSRSSKGK
        GINNS++ N+SFTEN+PGIKLKF  + TKSE++  S  A+KA Y+AK  E  TYEP VRRRCL G+LMESSDS+ +N  K RRHGCRY  S +GK
Subjt:  GINNSIMCNTSFTENDPGIKLKFPGEPTKSEDELESHHARKANYSAKPAEKLTYEPRVRRRCLRGMLMESSDSEAENPGKSRRHGCRYSRSSKGK

XP_038886773.1 flocculation protein FLO11 [Benincasa hispida]3.1e-27682.26Show/hide
Query:  MSYSQLRILLPWQSLKASPRPANDSSGRSFELTDEAETSASAADTMPNIWHPPVQSPEIKPEQPPLAPVQAPENSETMPPSKSHKASKVQSQPPSNSRAK
        MS SQLRILLPWQSLKASP P N+S GRSFE TDE ETSASAADT  NI H P QSPEIKPEQPPLA   APE SETMPPSKSHKA KV SQPP NSRAK
Subjt:  MSYSQLRILLPWQSLKASPRPANDSSGRSFELTDEAETSASAADTMPNIWHPPVQSPEIKPEQPPLAPVQAPENSETMPPSKSHKASKVQSQPPSNSRAK

Query:  NRSRTASKPPSPSKAIPQSSVASNKSPSTSGKGSLSQDTSKPSSPAGKASSSQDASSKPSSFAAVAATAPRSRIASKP------LSPSSQTSSKNHPNSK
        NRSRTASKP  PSKAIPQSSVASNKSPSTSGK SLSQDTSKPSSPAGK+S SQDASSKPSS AAVAATAPRSRI SKP       SPSSQTSSKNHP  K
Subjt:  NRSRTASKPPSPSKAIPQSSVASNKSPSTSGKGSLSQDTSKPSSPAGKASSSQDASSKPSSFAAVAATAPRSRIASKP------LSPSSQTSSKNHPNSK

Query:  PTSQSRIKADSQPSSPSKSAFPSQGSSMPPRSPSQENSRQQPSEKTSRVQSPSHLSSKPTAQSTSQQPVKSPAAIGIQNHPNSKPSSQSRFKADSQPSSS
        P+SQSR KADSQPSS S+SAFPSQ SS+PPR PS ENSR QPSE+TSRVQSPSH SSKPTAQSTSQQP +SPA IGIQ+HPNSKPSSQSRFKADSQPSSS
Subjt:  PTSQSRIKADSQPSSPSKSAFPSQGSSMPPRSPSQENSRQQPSEKTSRVQSPSHLSSKPTAQSTSQQPVKSPAAIGIQNHPNSKPSSQSRFKADSQPSSS

Query:  SKSTFPSQHSSMPPRSPSQENSQQQPSEKTSRVQSPSHLSAKPTAQSTTQQPIESPTAIGDQTTDGIIFHPANQSPKARPTSRESQLQTKSKQSSKPNAK
        S+S F SQ SSM P SPS+ENS+QQP EKTSRVQSPSHLS+KP AQST+QQPIESP AIG+QTT+  I HP NQSPKARPTSRESQ+QTKSKQS KPN K
Subjt:  SKSTFPSQHSSMPPRSPSQENSQQQPSEKTSRVQSPSHLSAKPTAQSTTQQPIESPTAIGDQTTDGIIFHPANQSPKARPTSRESQLQTKSKQSSKPNAK

Query:  PVESKASKYQPETSEELTSKNTSNPHPNQDYSENPTQSDQTIENGLDSSLESEAESKETKEDLAKTTNALQTKAARSTLITSSKSRSSFEPEKWDSQQEE
         VE KASK + ET EEL+SKNTSNPH NQD  ENPT+SDQTIEN LD SLES+AES+ET+E+LAKTTNALQTKA+RSTLITSSK   SFEPE    QQEE
Subjt:  PVESKASKYQPETSEELTSKNTSNPHPNQDYSENPTQSDQTIENGLDSSLESEAESKETKEDLAKTTNALQTKAARSTLITSSKSRSSFEPEKWDSQQEE

Query:  SMEDLSKAFQKLNIKYSDKENPKSFTTLIGDNKGSSMHLLSGEAKSDSPIHIHRQYKSNPDQSPKSSTDIEGNFNNGTPHDSRTEENPPPLELYINVNVQ
        SM+D SKAFQKLNIKYSD+ENPKSFTTLIG NKGSSMHL+SGEAKS+S IHIHRQYKSNPDQSPK ST+IEGNF N T  DSRTEENPP +E+YIN+NVQ
Subjt:  SMEDLSKAFQKLNIKYSDKENPKSFTTLIGDNKGSSMHLLSGEAKSDSPIHIHRQYKSNPDQSPKSSTDIEGNFNNGTPHDSRTEENPPPLELYINVNVQ

Query:  GINNSIMCNTSFTENDPGIKLKFPGEPTKSEDELESHHARKANYSAKPAEKLTYEPRVRRRCLRGMLMESSDSEAENPGKSRRHGCRYSRSSKGKEVET
        GINNSIMCNTSFTENDPGIKLK   E  KSEDELESHHARKA YSAKPAEK+TYEPRVRRRCLRGMLMESSDSE ENPGKSRRHGCRY  SSKGKEVET
Subjt:  GINNSIMCNTSFTENDPGIKLKFPGEPTKSEDELESHHARKANYSAKPAEKLTYEPRVRRRCLRGMLMESSDSEAENPGKSRRHGCRYSRSSKGKEVET

TrEMBL top hitse value%identityAlignment
A0A0A0LLH1 Uncharacterized protein4.7e-14575.31Show/hide
Query:  MPPRSPSQENSQQQPSEKTSRVQSPSHLSAKPTAQSTTQQPIESPTAIGDQTTDGIIFHPANQSPKARPTSRESQLQTKSKQSSKPNAKPVESKASKYQP
        MPPRSPSQENS Q PSEKT RVQSPSHLS KPTAQST+QQPIE   +IGDQTTD I+  PAN SPKA PTS ESQ+Q +SK+S KPN KPVE + SK Q 
Subjt:  MPPRSPSQENSQQQPSEKTSRVQSPSHLSAKPTAQSTTQQPIESPTAIGDQTTDGIIFHPANQSPKARPTSRESQLQTKSKQSSKPNAKPVESKASKYQP

Query:  ETSEELT----------SKNTSNPHPNQDYSENPTQSDQTIENGLDSSLESEAESKETKEDLAKTTNALQTKAARSTLITSSKSRSSFEPEKWDSQQEES
        ET EELT          SKNTSNPH  +D SENPTQSDQ IE GLDSSLES+ ESKETKED AKTTNA QTKA+RSTLITSSKSRSSFEPE  ++QQ+ES
Subjt:  ETSEELT----------SKNTSNPHPNQDYSENPTQSDQTIENGLDSSLESEAESKETKEDLAKTTNALQTKAARSTLITSSKSRSSFEPEKWDSQQEES

Query:  MEDLSKAFQKLNIKYSDKENPKSFTTLIGDNKGSSMHLLSGEAKSDSPIHIHRQYKSNPDQSPKSSTDIEGNFNNGTPHDSRTEEN--PPPLELYINVNV
        MEDLSKAF KLNIKYSD+ENPKS TT+IGDNKG+SMHLLS EAKS+S IH++  YKSNPDQSP+SSTDI+ N NN T  DS TEEN  PPPLELYIN+NV
Subjt:  MEDLSKAFQKLNIKYSDKENPKSFTTLIGDNKGSSMHLLSGEAKSDSPIHIHRQYKSNPDQSPKSSTDIEGNFNNGTPHDSRTEEN--PPPLELYINVNV

Query:  QGINNSIMCNTSFTENDPGIKLKFPGEPTKSEDELES-HHARKANYSAKPAEKLTYEPRVRRRCLRGMLMESSDSEAENPGKSRRHGCRYSRSSKGKEVE
        QGINNSI  NTSFTEN+PGIKLKFPGEPT  +DELES HH RK+ Y A PAEK+TY+PR+RRRCL G+LMESSDSE ENPGK + HGCRYS SSKGKEVE
Subjt:  QGINNSIMCNTSFTENDPGIKLKFPGEPTKSEDELES-HHARKANYSAKPAEKLTYEPRVRRRCLRGMLMESSDSEAENPGKSRRHGCRYSRSSKGKEVE

Query:  T
        T
Subjt:  T

A0A1S4DVD0 micronuclear linker histone polyprotein2.5e-14675.62Show/hide
Query:  MPPRSPSQENSQQQPSEKTSRVQSPSHLSAKPTAQSTTQQPIESPTAIGDQTTDGIIFHPANQSPKARPTSRESQLQTKSKQSSKPNAKPVESKASKYQP
        MPPRSPSQENS Q PSEKTSRVQSPS+LS KPTA ST+QQPIES  +IGDQTTDGI+  PA  SPKA PTS E Q+Q KSK+S +PN KPVE KASK Q 
Subjt:  MPPRSPSQENSQQQPSEKTSRVQSPSHLSAKPTAQSTTQQPIESPTAIGDQTTDGIIFHPANQSPKARPTSRESQLQTKSKQSSKPNAKPVESKASKYQP

Query:  ETSEELT----------SKNTSNPHPNQDYSENPTQSDQTIENGLDSSLESEAESKETKEDLAKTTNALQTKAARSTLITSSKSRSSFEPEKWDSQQEES
        +T EELT          SKNTSNPH ++D SENPTQSD+T+E GLDSSLES+ ESKETKED  KTTNALQ KA+RSTLITSSKSRSSFEPEK ++QQ+ES
Subjt:  ETSEELT----------SKNTSNPHPNQDYSENPTQSDQTIENGLDSSLESEAESKETKEDLAKTTNALQTKAARSTLITSSKSRSSFEPEKWDSQQEES

Query:  MEDLSKAFQKLNIKYSDKENPKSFTTLIGDNKGSSMHLLSGEAKSDSPIHIHRQYKSNPDQSPKSSTDIEGNFNNGTPHDSRTEEN--PPPLELYINVNV
        MEDLSKAF KLNIKYSD+ENPKSFTT+IGDNKGSS+HLLSGEAKS+S IH++ +YKSNPDQSPKSST+I+ N NN TP DS TEEN  PPPLELYIN NV
Subjt:  MEDLSKAFQKLNIKYSDKENPKSFTTLIGDNKGSSMHLLSGEAKSDSPIHIHRQYKSNPDQSPKSSTDIEGNFNNGTPHDSRTEEN--PPPLELYINVNV

Query:  QGINNSIMCNTSFTENDPGIKLKFP--GEPTKSEDELESHHARKANYSAKPAEKLTYEPRVRRRCLRGMLMESSDSEAENPGKSRRHGCRYSRSSKGKEV
        QGINNSIM NTSFTEN+PGIKLKFP  GEPT S+DELESHH RK+ Y   PAEK+TYEPR+RRR L G+LMES DSE ENP K R HGCRYSRSSKGK+V
Subjt:  QGINNSIMCNTSFTENDPGIKLKFP--GEPTKSEDELESHHARKANYSAKPAEKLTYEPRVRRRCLRGMLMESSDSEAENPGKSRRHGCRYSRSSKGKEV

Query:  ET
        ET
Subjt:  ET

A0A5A7VAN0 Flocculation protein FLO113.1e-24568.31Show/hide
Query:  MSYSQLRILLPWQSLKASPRPANDSSGRSFELTDEAETSASAADTMPNIWHPPVQSPEIKPEQPPLAPVQAPENSETMPPSKSHKASKVQSQPPSNSRAK
        MS SQLRILLPWQSLKASPRPAN+S   SF  TDE+E+SAS ADT PNI H P QSPEIKPE+PPLA  QA E SETMPPSKSHK  K+ SQ  +NSRAK
Subjt:  MSYSQLRILLPWQSLKASPRPANDSSGRSFELTDEAETSASAADTMPNIWHPPVQSPEIKPEQPPLAPVQAPENSETMPPSKSHKASKVQSQPPSNSRAK

Query:  NRSRTASKPPSPSKAIPQSSVASNKSPSTSGKGSLSQDTSKPSSPAGKA-SSSQDASSKPSSFAAVAATAPRSRIASKPLSPSSQTSSKNHPNSKPTSQS
        NRSRTASKP SP  AIPQS +ASNK PSTSGKGS SQD+SKPSSPAGK  S SQDASSKPSS A VAATAP  RIASK  S SSQ S+K HP+SKPTS+ 
Subjt:  NRSRTASKPPSPSKAIPQSSVASNKSPSTSGKGSLSQDTSKPSSPAGKA-SSSQDASSKPSSFAAVAATAPRSRIASKPLSPSSQTSSKNHPNSKPTSQS

Query:  RIKADSQPSSPSKSAFPSQGS-------------------------------------------------------------------------------
        R KADSQPSSPS+SAFPSQ                                                                                 
Subjt:  RIKADSQPSSPSKSAFPSQGS-------------------------------------------------------------------------------

Query:  -----SMPPRSPSQENSRQQPSEKTSRVQSPSHLSSKPTAQSTSQQPVKSPAAIGIQNHPNSKPSSQSRFKADSQPSSSSKSTFPSQHSSMPPRSPSQEN
             SMPPRSPS ENSRQQPS+KTS VQSPSH S KPTAQSTS+QP++SPA IGIQ+HPN KPSSQSRFKA+S+PSSSSKS FPSQ SSMPPRSPSQEN
Subjt:  -----SMPPRSPSQENSRQQPSEKTSRVQSPSHLSSKPTAQSTSQQPVKSPAAIGIQNHPNSKPSSQSRFKADSQPSSSSKSTFPSQHSSMPPRSPSQEN

Query:  SQQQPSEKTSRVQSPSHLSAKPTAQSTTQQPIESPTAIGDQTTDGIIFHPANQSPKARPTSRESQLQTKSKQSSKPNAKPVESKASKYQPETSEELT---
        S Q PSEKTSRVQSPS+LS KPTA ST+QQPIES  +IGDQTTDGI+  PA  SPKA PTS E Q+Q KSK+S +PN KPVE KASK Q +T EELT   
Subjt:  SQQQPSEKTSRVQSPSHLSAKPTAQSTTQQPIESPTAIGDQTTDGIIFHPANQSPKARPTSRESQLQTKSKQSSKPNAKPVESKASKYQPETSEELT---

Query:  -------SKNTSNPHPNQDYSENPTQSDQTIENGLDSSLESEAESKETKEDLAKTTNALQTKAARSTLITSSKSRSSFEPEKWDSQQEESMEDLSKAFQK
               SKNTSNPH ++D SENPTQSD+T+E GLDSSLES+ ESKETKED  KTTNALQ KA+RSTLITSSKSRSSFEPEK ++QQ+ESMEDLSKAF K
Subjt:  -------SKNTSNPHPNQDYSENPTQSDQTIENGLDSSLESEAESKETKEDLAKTTNALQTKAARSTLITSSKSRSSFEPEKWDSQQEESMEDLSKAFQK

Query:  LNIKYSDKENPKSFTTLIGDNKGSSMHLLSGEAKSDSPIHIHRQYKSNPDQSPKSSTDIEGNFNNGTPHDSRTEEN--PPPLELYINVNVQGINNSIMCN
        LNIKYSD+ENPKSFTT+IGDNKGSS+HLLSGEAKS+S IH++ +YKSNPDQSPKSST+I+ N NN TP DS TEEN  PPPLELYIN NVQGINNSIM N
Subjt:  LNIKYSDKENPKSFTTLIGDNKGSSMHLLSGEAKSDSPIHIHRQYKSNPDQSPKSSTDIEGNFNNGTPHDSRTEEN--PPPLELYINVNVQGINNSIMCN

Query:  TSFTENDPGIKLKFP--GEPTKSEDELESHHARKANYSAKPAEKLTYEPRVRRRCLRGMLMESSDSEAENPGKSRRHGCRYSRSSKGKEVET
        TSFTEN+PGIKLKFP  GEPT S+DELESHH RK+ Y   PAEK+TYEPR+RRR L G+LMES DSE ENP K R HGCRYSRSSKGK+VET
Subjt:  TSFTENDPGIKLKFP--GEPTKSEDELESHHARKANYSAKPAEKLTYEPRVRRRCLRGMLMESSDSEAENPGKSRRHGCRYSRSSKGKEVET

A0A6J1GK50 cell wall protein RBR3-like3.1e-17358.79Show/hide
Query:  MSYSQLRILLPWQSLKASPRPANDSSGRSFELTDEAETSASAADTMPNIWHPPVQSPEIKPEQPPLAPVQAPENSETMPPSKSHKASKVQSQPPSNSRAK
        M+Y Q R  LPWQS+KAS R  N+SS RS E TDEAETS SAADT+P + H         PE  PL   QAPE SETM PSKSHK +KV SQP S+SRAK
Subjt:  MSYSQLRILLPWQSLKASPRPANDSSGRSFELTDEAETSASAADTMPNIWHPPVQSPEIKPEQPPLAPVQAPENSETMPPSKSHKASKVQSQPPSNSRAK

Query:  NRSRTASKPPSPSKAIPQSSVASNKSPSTSGKGSLSQDTSKPSSPAGKASSSQDASSKPSSFAAVAATAPRSRIASKPLSPSSQTSSKNHPNSKPTSQSR
         ++RTA+KPPS SK  PQSSV+SNKSP+TS K S S D SKPSS AGK S S D S        +++ A + ++     SPS  T               
Subjt:  NRSRTASKPPSPSKAIPQSSVASNKSPSTSGKGSLSQDTSKPSSPAGKASSSQDASSKPSSFAAVAATAPRSRIASKPLSPSSQTSSKNHPNSKPTSQSR

Query:  IKADSQPSSPSKSAFPSQGSSMPPRSPSQENSRQQPSEKTSRVQSPSHLSSKPTAQSTSQQPVKSPAAIGIQNHPNSKPSSQSRFKADSQPSSSSKSTFP
            S+PSSP+  AFPS+ +S P                       S  ++ P +Q  S+ P  SP+    +NHP SKP+SQSR KADSQPSS S+  F 
Subjt:  IKADSQPSSPSKSAFPSQGSSMPPRSPSQENSRQQPSEKTSRVQSPSHLSSKPTAQSTSQQPVKSPAAIGIQNHPNSKPSSQSRFKADSQPSSSSKSTFP

Query:  SQHSSMPPRSPSQENSQQQPSEKTSRVQSPSHLSAKPTAQSTTQQPIESPTAIGDQTTDGIIFHPANQSPKARPTSRESQLQTKSKQSSKPNAKPVESKA
         Q SS+ PRSPS ENS+QQPS+K SRVQSPSHLS+KPTAQST+QQ  ESP  IGDQTT  ++ HPA+QSP+AR   RE+Q+QTKSKQS KP+ KPVE KA
Subjt:  SQHSSMPPRSPSQENSQQQPSEKTSRVQSPSHLSAKPTAQSTTQQPIESPTAIGDQTTDGIIFHPANQSPKARPTSRESQLQTKSKQSSKPNAKPVESKA

Query:  SKYQPETSEELTSKNTSNPHPNQDYSENPTQSDQTIENGLDSSLESEAESKETK------EDLAKTTNALQTKAARSTLITSSKSRSSFEPEKWDSQQEE
        SK+QPET EE  SKNTS PH +QD+SE P   D+TIENG ++SLES+ ES+E+K      EDL KTTNALQ  A++S LITS++  S FEPE  DSQQE 
Subjt:  SKYQPETSEELTSKNTSNPHPNQDYSENPTQSDQTIENGLDSSLESEAESKETK------EDLAKTTNALQTKAARSTLITSSKSRSSFEPEKWDSQQEE

Query:  SMEDLSKAFQKLNIKYSDKENPKSFTTLIGDNKGSSMHLLSGEAKSDSPIHIHRQYKSNPDQSPKSSTDIEGNFNNGTPHDSRTEENPPPLELYINVNVQ
        +MEDLSKAFQ LNIKY + ENPKSFTTL GDNKG+SMHLLSGEA  +S IHIHRQYKS+PD+ P+SSTDIEGN N  TP DS+TEE+ PPLELYIN+NVQ
Subjt:  SMEDLSKAFQKLNIKYSDKENPKSFTTLIGDNKGSSMHLLSGEAKSDSPIHIHRQYKSNPDQSPKSSTDIEGNFNNGTPHDSRTEENPPPLELYINVNVQ

Query:  GINNSIMCNTSFTENDPGIKLKFPGEPTKSEDELESHHARKANYSAKPAEKLTYEPRVRRRCLRGMLMESSDSEAENPGKSRRHGCRYSRSSKG
        GINNS++ N+SFTEN+PGIKLKF  + TKSED+  S  A+KA Y+AK  E  TYEP VRRRCL G+LMESSDS+ +N  K RRHGCRY  S +G
Subjt:  GINNSIMCNTSFTENDPGIKLKFPGEPTKSEDELESHHARKANYSAKPAEKLTYEPRVRRRCLRGMLMESSDSEAENPGKSRRHGCRYSRSSKG

A0A6J1KJ10 cell wall protein RBR3-like5.3e-17358.56Show/hide
Query:  MSYSQLRILLPWQSLKASPRPANDSSGRSFELTDEAETSASAADTMPNIWHPPVQSPEIKPEQPPLAPVQAPENSETMPPSKSHKASKVQSQPPSNSRAK
        M+Y Q R  LPWQS+KAS RP N+SS RS E TDEAETS SAADT+P + H P+QS E KPE  PL   QAPE SETM PSKSHK +KV SQP S+SRAK
Subjt:  MSYSQLRILLPWQSLKASPRPANDSSGRSFELTDEAETSASAADTMPNIWHPPVQSPEIKPEQPPLAPVQAPENSETMPPSKSHKASKVQSQPPSNSRAK

Query:  NRSRTASKPPSPSKAIPQSSVASNKSPSTSGKGSLSQDTSKPSSPAGKASSSQDASSKPSSFAAVAATAPRSRIASKPLSPSSQTSSKNHPNSKPTSQSR
         ++RTA+KPPS SK  PQSSV+SNKSP+TS K S S D SKPSS AGK S S D S        +++ A + ++     SPS  T               
Subjt:  NRSRTASKPPSPSKAIPQSSVASNKSPSTSGKGSLSQDTSKPSSPAGKASSSQDASSKPSSFAAVAATAPRSRIASKPLSPSSQTSSKNHPNSKPTSQSR

Query:  IKADSQPSSPSKSAFPSQGSSMPPRSPSQENSRQQPSEKTSRVQSPSHLSSKPTAQSTSQQPVKSPAAIGIQNHPNSKPSSQSRFKADSQPSSSSKSTFP
            S PS P+  AFPS+ +S P                 +     SH+ SKP           SP+    +NH +SK +SQSR KADSQPSS S+  F 
Subjt:  IKADSQPSSPSKSAFPSQGSSMPPRSPSQENSRQQPSEKTSRVQSPSHLSSKPTAQSTSQQPVKSPAAIGIQNHPNSKPSSQSRFKADSQPSSSSKSTFP

Query:  SQHSSMPPRSPSQENSQQQPSEKTSRVQSPSHLSAKPTAQSTTQQPIESPTAIGDQTTDGIIFHPANQSPKARPTSRESQLQTKSKQSSKPNAKPVESKA
         Q SS+ PRSPS ENS+QQPS+K SRVQSPSHLS+K TAQST+QQ  ESP  IGDQTT  ++ HPA+QSP+AR  S+E+Q+QTKSKQS KP+ KPVE KA
Subjt:  SQHSSMPPRSPSQENSQQQPSEKTSRVQSPSHLSAKPTAQSTTQQPIESPTAIGDQTTDGIIFHPANQSPKARPTSRESQLQTKSKQSSKPNAKPVESKA

Query:  SKYQPETSEELTSKNTSNPHPNQDYSENPTQSDQTIENGLDSSLESEAESKETK------EDLAKTTNALQTKAARSTLITSSKSRSSFEPEKWDSQQEE
        SK+QPET EE  SKNTS P  N+D+SE P   D+TIENG + SLES+ ES+E+K      EDL KTTNALQ  A++S LITS++  S FEPE  DSQQE 
Subjt:  SKYQPETSEELTSKNTSNPHPNQDYSENPTQSDQTIENGLDSSLESEAESKETK------EDLAKTTNALQTKAARSTLITSSKSRSSFEPEKWDSQQEE

Query:  SMEDLSKAFQKLNIKYSDKENPKSFTTLIGDNKGSSMHLLSGEAKSDSPIHIHRQYKSNPDQSPKSSTDIEGNFNNGTPHDSRTEENPPPLELYINVNVQ
        +MEDL KAFQ LNIKY + ENPKSFTTL GDNKG+SMHL+SGEA  +S IHIHRQYKS+PD+ P+SSTDIEGN N  TP DS+TEE+ PPLELYIN+NVQ
Subjt:  SMEDLSKAFQKLNIKYSDKENPKSFTTLIGDNKGSSMHLLSGEAKSDSPIHIHRQYKSNPDQSPKSSTDIEGNFNNGTPHDSRTEENPPPLELYINVNVQ

Query:  GINNSIMCNTSFTENDPGIKLKFPGEPTKSEDELESHHARKANYSAKPAEKLTYEPRVRRRCLRGMLMESSDSEAENPGKSRRHGCRYSRSSKGK
        GINNS++ N+SFTEN+PGIKLKF  + TKSE++  S  A+KA Y+AK  E  TYEP VRRRCL G+LMESSDS+ +N  K RRHGCRY  S +GK
Subjt:  GINNSIMCNTSFTENDPGIKLKFPGEPTKSEDELESHHARKANYSAKPAEKLTYEPRVRRRCLRGMLMESSDSEAENPGKSRRHGCRYSRSSKGK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G75260.1 oxidoreductases, acting on NADH or NADPH1.8e-1629.08Show/hide
Query:  SPSSQTSSKNHPNSKPTSQSRIKADSQPSSPSKSAFPSQGSSMPPRSPSQENSRQQP---SEKTSRVQSPSHLSSKPT--AQSTSQQ---PVKSPAAIGI
        SPS  +S  + P+  PT  SR      P  P+  A PS+  + P  SPS   SR      +  +S  Q PS  ++ PT  A+ T+QQ   P K   ++ +
Subjt:  SPSSQTSSKNHPNSKPTSQSRIKADSQPSSPSKSAFPSQGSSMPPRSPSQENSRQQP---SEKTSRVQSPSHLSSKPT--AQSTSQQ---PVKSPAAIGI

Query:  QNH---PNSKPSSQSRFKADSQPSSSSK----STFPSQHSSMPPRSPSQENSQQQPSEKTSRVQSPSHL---SAKPTAQSTTQQPIESPTAIGDQTTDGI
        +        KP  ++   A+   S   +       P +H         QE  +   + +    ++   L   S K +A +  QQ IE    I  Q    +
Subjt:  QNH---PNSKPSSQSRFKADSQPSSSSK----STFPSQHSSMPPRSPSQENSQQQPSEKTSRVQSPSHL---SAKPTAQSTTQQPIESPTAIGDQTTDGI

Query:  IFHPANQSPKARPTSRESQLQTKSKQSSKPNAKPVESKASKYQPETSEELTSKNTSNPHPNQDYSENPTQSDQTIENGLDSSLESEAESKETKEDLAKTT
        +     Q  +A       Q Q KSK++ K   +  E+K S       +   SK T +     + +  P              L  +    + + ++    
Subjt:  IFHPANQSPKARPTSRESQLQTKSKQSSKPNAKPVESKASKYQPETSEELTSKNTSNPHPNQDYSENPTQSDQTIENGLDSSLESEAESKETKEDLAKTT

Query:  NALQTKAARSTLITSSKSRSSFEPEKWDSQQEESMEDLSKAFQKLNI-KYSDKENPKSFTTLIGDNKGSSMHLLSGEAKSDSPIHIHRQYKSNPDQSPKS
        N  +TK A ++ + + +  +    E   S   +  ED+     KL   K +  +   S  TL G+NKG++M + S + K D  +HI R Y+SNPD+S  +
Subjt:  NALQTKAARSTLITSSKSRSSFEPEKWDSQQEESMEDLSKAFQKLNI-KYSDKENPKSFTTLIGDNKGSSMHLLSGEAKSDSPIHIHRQYKSNPDQSPKS

Query:  STDIEGNFNNGTPHDSRTEENPPPLELYINVNVQGINNSIMCNTSFTENDPGIKLKFPGEPTKSEDELESHHA-RKANYSAKPAEKLTYEPRVRRRCLRG
        +     N     P D   EE       YIN N QGINNSI+  +S +ENDPG+ + F  E  K E      +   K   +    +KL  EPRVRRRCLRG
Subjt:  STDIEGNFNNGTPHDSRTEENPPPLELYINVNVQGINNSIMCNTSFTENDPGIKLKFPGEPTKSEDELESHHA-RKANYSAKPAEKLTYEPRVRRRCLRG

Query:  MLMESSDSEAENPGKSRRHGCRYSRSSKGKEVE
        +L ESS+SE +NP K RRHGCR+  + K K++E
Subjt:  MLMESSDSEAENPGKSRRHGCRYSRSSKGKEVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATACTCACAATTGCGCATTCTACTTCCTTGGCAATCGTTAAAAGCTTCTCCTCGCCCTGCAAATGATTCGTCAGGACGGAGTTTTGAGCTTACAGATGAAGCGGA
AACTTCTGCCTCTGCAGCCGATACCATGCCAAATATTTGGCATCCACCAGTCCAATCTCCTGAAATAAAACCAGAACAGCCTCCTTTAGCACCAGTTCAGGCACCAGAAA
ATAGTGAAACTATGCCACCTTCAAAATCTCACAAGGCAAGCAAAGTTCAATCTCAGCCACCATCAAATTCCCGAGCCAAAAACCGGTCACGAACAGCTTCCAAGCCTCCA
TCACCATCGAAAGCAATCCCTCAATCTTCAGTTGCTTCCAACAAGTCTCCTTCAACATCAGGCAAAGGCTCCCTATCTCAGGATACTTCAAAGCCATCATCACCCGCAGG
CAAAGCCTCTTCGTCTCAGGATGCTTCTTCAAAGCCTTCATCCTTTGCAGCAGTAGCAGCTACAGCTCCTCGAAGCCGGATTGCTTCAAAGCCATTGTCTCCATCATCTC
AAACATCCAGTAAAAACCATCCAAATTCAAAACCAACATCACAATCAAGAATTAAAGCTGATTCTCAGCCTTCATCACCTTCAAAGTCAGCATTTCCATCTCAAGGTTCT
TCTATGCCACCACGGTCGCCATCTCAAGAAAATTCTCGACAACAACCATCAGAAAAAACCTCTCGGGTTCAGTCTCCATCTCATTTGTCCAGTAAACCTACCGCACAATC
AACATCACAACAGCCTGTTAAGTCTCCCGCAGCCATTGGAATTCAAAACCATCCGAATTCAAAACCATCATCACAATCAAGATTTAAAGCTGATTCTCAGCCTTCATCAT
CTTCAAAGTCAACATTTCCATCTCAACATTCGTCTATGCCACCACGGTCACCATCTCAAGAAAATTCTCAGCAACAACCATCGGAAAAAACCTCTCGGGTTCAGTCTCCA
TCTCATTTGTCCGCTAAACCTACTGCACAATCAACAACACAACAACCTATTGAATCTCCTACAGCCATTGGAGACCAAACAACAGATGGAATCATTTTTCATCCTGCAAA
TCAATCCCCAAAAGCAAGACCTACAAGCAGGGAAAGTCAATTGCAAACCAAATCAAAGCAGTCTTCGAAACCAAATGCGAAACCAGTGGAATCAAAAGCATCAAAATATC
AGCCGGAAACCTCGGAAGAGCTCACATCTAAGAACACTTCCAATCCCCATCCGAACCAGGACTATTCTGAAAACCCAACGCAATCTGATCAAACCATAGAAAATGGCTTA
GATTCCTCTCTAGAATCAGAGGCAGAGTCAAAAGAAACTAAGGAAGATCTGGCAAAGACAACAAATGCACTTCAAACCAAAGCAGCTAGAAGCACATTAATCACATCTTC
TAAAAGCCGTTCATCATTTGAACCAGAAAAGTGGGACTCACAACAGGAAGAATCCATGGAAGACTTATCCAAAGCTTTTCAGAAACTAAACATCAAATATTCAGACAAAG
AAAATCCAAAGAGTTTCACAACACTAATCGGCGATAACAAAGGGTCGTCAATGCACTTACTCTCCGGCGAAGCCAAAAGCGACAGCCCAATCCACATCCACCGTCAGTAT
AAGAGCAATCCAGATCAAAGCCCTAAAAGTTCCACGGACATCGAAGGAAATTTCAATAACGGAACACCGCACGATTCAAGAACAGAAGAGAATCCACCACCTCTGGAATT
ATATATCAACGTCAATGTACAAGGTATCAACAACTCAATCATGTGCAATACCTCATTTACAGAGAATGATCCTGGAATCAAGTTGAAGTTCCCTGGAGAACCAACAAAAT
CTGAAGATGAATTAGAGTCTCATCACGCTAGAAAAGCAAACTACAGTGCGAAACCTGCCGAGAAGCTTACGTATGAACCCAGAGTAAGACGAAGATGCCTTAGAGGGATG
TTAATGGAGTCGAGCGATTCTGAGGCCGAGAATCCAGGAAAGTCCCGACGCCATGGCTGCCGGTACAGTCGTAGTAGCAAAGGAAAAGAGGTCGAAACTCAATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCATACTCACAATTGCGCATTCTACTTCCTTGGCAATCGTTAAAAGCTTCTCCTCGCCCTGCAAATGATTCGTCAGGACGGAGTTTTGAGCTTACAGATGAAGCGGA
AACTTCTGCCTCTGCAGCCGATACCATGCCAAATATTTGGCATCCACCAGTCCAATCTCCTGAAATAAAACCAGAACAGCCTCCTTTAGCACCAGTTCAGGCACCAGAAA
ATAGTGAAACTATGCCACCTTCAAAATCTCACAAGGCAAGCAAAGTTCAATCTCAGCCACCATCAAATTCCCGAGCCAAAAACCGGTCACGAACAGCTTCCAAGCCTCCA
TCACCATCGAAAGCAATCCCTCAATCTTCAGTTGCTTCCAACAAGTCTCCTTCAACATCAGGCAAAGGCTCCCTATCTCAGGATACTTCAAAGCCATCATCACCCGCAGG
CAAAGCCTCTTCGTCTCAGGATGCTTCTTCAAAGCCTTCATCCTTTGCAGCAGTAGCAGCTACAGCTCCTCGAAGCCGGATTGCTTCAAAGCCATTGTCTCCATCATCTC
AAACATCCAGTAAAAACCATCCAAATTCAAAACCAACATCACAATCAAGAATTAAAGCTGATTCTCAGCCTTCATCACCTTCAAAGTCAGCATTTCCATCTCAAGGTTCT
TCTATGCCACCACGGTCGCCATCTCAAGAAAATTCTCGACAACAACCATCAGAAAAAACCTCTCGGGTTCAGTCTCCATCTCATTTGTCCAGTAAACCTACCGCACAATC
AACATCACAACAGCCTGTTAAGTCTCCCGCAGCCATTGGAATTCAAAACCATCCGAATTCAAAACCATCATCACAATCAAGATTTAAAGCTGATTCTCAGCCTTCATCAT
CTTCAAAGTCAACATTTCCATCTCAACATTCGTCTATGCCACCACGGTCACCATCTCAAGAAAATTCTCAGCAACAACCATCGGAAAAAACCTCTCGGGTTCAGTCTCCA
TCTCATTTGTCCGCTAAACCTACTGCACAATCAACAACACAACAACCTATTGAATCTCCTACAGCCATTGGAGACCAAACAACAGATGGAATCATTTTTCATCCTGCAAA
TCAATCCCCAAAAGCAAGACCTACAAGCAGGGAAAGTCAATTGCAAACCAAATCAAAGCAGTCTTCGAAACCAAATGCGAAACCAGTGGAATCAAAAGCATCAAAATATC
AGCCGGAAACCTCGGAAGAGCTCACATCTAAGAACACTTCCAATCCCCATCCGAACCAGGACTATTCTGAAAACCCAACGCAATCTGATCAAACCATAGAAAATGGCTTA
GATTCCTCTCTAGAATCAGAGGCAGAGTCAAAAGAAACTAAGGAAGATCTGGCAAAGACAACAAATGCACTTCAAACCAAAGCAGCTAGAAGCACATTAATCACATCTTC
TAAAAGCCGTTCATCATTTGAACCAGAAAAGTGGGACTCACAACAGGAAGAATCCATGGAAGACTTATCCAAAGCTTTTCAGAAACTAAACATCAAATATTCAGACAAAG
AAAATCCAAAGAGTTTCACAACACTAATCGGCGATAACAAAGGGTCGTCAATGCACTTACTCTCCGGCGAAGCCAAAAGCGACAGCCCAATCCACATCCACCGTCAGTAT
AAGAGCAATCCAGATCAAAGCCCTAAAAGTTCCACGGACATCGAAGGAAATTTCAATAACGGAACACCGCACGATTCAAGAACAGAAGAGAATCCACCACCTCTGGAATT
ATATATCAACGTCAATGTACAAGGTATCAACAACTCAATCATGTGCAATACCTCATTTACAGAGAATGATCCTGGAATCAAGTTGAAGTTCCCTGGAGAACCAACAAAAT
CTGAAGATGAATTAGAGTCTCATCACGCTAGAAAAGCAAACTACAGTGCGAAACCTGCCGAGAAGCTTACGTATGAACCCAGAGTAAGACGAAGATGCCTTAGAGGGATG
TTAATGGAGTCGAGCGATTCTGAGGCCGAGAATCCAGGAAAGTCCCGACGCCATGGCTGCCGGTACAGTCGTAGTAGCAAAGGAAAAGAGGTCGAAACTCAATAA
Protein sequenceShow/hide protein sequence
MSYSQLRILLPWQSLKASPRPANDSSGRSFELTDEAETSASAADTMPNIWHPPVQSPEIKPEQPPLAPVQAPENSETMPPSKSHKASKVQSQPPSNSRAKNRSRTASKPP
SPSKAIPQSSVASNKSPSTSGKGSLSQDTSKPSSPAGKASSSQDASSKPSSFAAVAATAPRSRIASKPLSPSSQTSSKNHPNSKPTSQSRIKADSQPSSPSKSAFPSQGS
SMPPRSPSQENSRQQPSEKTSRVQSPSHLSSKPTAQSTSQQPVKSPAAIGIQNHPNSKPSSQSRFKADSQPSSSSKSTFPSQHSSMPPRSPSQENSQQQPSEKTSRVQSP
SHLSAKPTAQSTTQQPIESPTAIGDQTTDGIIFHPANQSPKARPTSRESQLQTKSKQSSKPNAKPVESKASKYQPETSEELTSKNTSNPHPNQDYSENPTQSDQTIENGL
DSSLESEAESKETKEDLAKTTNALQTKAARSTLITSSKSRSSFEPEKWDSQQEESMEDLSKAFQKLNIKYSDKENPKSFTTLIGDNKGSSMHLLSGEAKSDSPIHIHRQY
KSNPDQSPKSSTDIEGNFNNGTPHDSRTEENPPPLELYINVNVQGINNSIMCNTSFTENDPGIKLKFPGEPTKSEDELESHHARKANYSAKPAEKLTYEPRVRRRCLRGM
LMESSDSEAENPGKSRRHGCRYSRSSKGKEVETQ