; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC11g0859 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC11g0859
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionpre-mRNA-splicing factor CWC22 homolog isoform X1
Genome locationMC11:7293640..7297132
RNA-Seq ExpressionMC11g0859
SyntenyMC11g0859
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7024815.1 hypothetical protein SDJN02_13634, partial [Cucurbita argyrosperma subsp. argyrosperma]5.68e-28576.65Show/hide
Query:  RSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGKKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVKKTN
        RSRRK++SSRKLKSKKLRYRHDSPSCSDTDFESSTS+SSSSSEDD+KV RSRS   KN+KP KKR+KK+S D Q R+ SPHPRKRKHSKR DR E KKTN
Subjt:  RSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGKKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVKKTN

Query:  KRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVVG
        K KKRRRDVSV AT+ DSL  STCGDGS+TS++SEIDR RGRSGKRKRN  KTE  RYRSKSHSPCSLCS+GSD+QNEVED SYVEN+ RRL+S+IVVVG
Subjt:  KRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVVG

Query:  EENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNG-
        EE++LKTF GNEQQE V H  DD+HPSFGDM+S DG SKRELD V S EA EVE+K ++V PD RN L++ D GV+NEGSN NHGGVTNDH L+E  NG 
Subjt:  EENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNG-

Query:  SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSGQGGGNAVNPMIVEENGVESTNA
        SGNT+ INCIDLESILRQ+ALENLRKFKGV P+NVE  ANC+V+N+NDAKQL SPVS SV + SPRDDAEIN KGFS Q GG+AVN MIV+ NG +ST+A
Subjt:  SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSGQGGGNAVNPMIVEENGVESTNA

Query:  IDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMGV
        ID+AVAS HDP+ SSQNLGKIS+ SNGMNELKQDISSLDQE +NDNIC K DADI STT+ SNLV AA RP+SKVD  +K+ASA QE IQTK SISD+ V
Subjt:  IDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMGV

Query:  DEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR
        DE AQ Q Q  NNDDQNI NGF SSA+KPSSSLN  SGENSL+K R ESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPAL RRQLKR
Subjt:  DEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR

XP_022139776.1 uncharacterized protein LOC111010601 [Momordica charantia]0.0100Show/hide
Query:  RSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGKKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVKKTN
        RSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGKKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVKKTN
Subjt:  RSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGKKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVKKTN

Query:  KRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVVG
        KRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVVG
Subjt:  KRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVVG

Query:  EENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNGS
        EENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNGS
Subjt:  EENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNGS

Query:  GNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSGQGGGNAVNPMIVEENGVESTNAI
        GNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSGQGGGNAVNPMIVEENGVESTNAI
Subjt:  GNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSGQGGGNAVNPMIVEENGVESTNAI

Query:  DSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMGVD
        DSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMGVD
Subjt:  DSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMGVD

Query:  EIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR
        EIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR
Subjt:  EIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR

XP_022976451.1 uncharacterized protein LOC111476850 isoform X1 [Cucurbita maxima]3.09e-28275.63Show/hide
Query:  RSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGKKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVKKTN
        RSRRK++SSRKLKSKKLRYRHDSPSCSDTDFESSTS+SSSSSEDD+KV RSRS   KN+KP KKR+KK+S D Q R+ SPHPRKRKHSKR+DR E KKTN
Subjt:  RSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGKKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVKKTN

Query:  KRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVVG
        K KKRRRDVSV AT+ DSLS STCGDGS+TS++SEIDR RGRSGKRKRN  KTE  RYRSKS SPCSLCS+GSD+QNEVED SYV+N  RRL+S+IVVVG
Subjt:  KRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVVG

Query:  EENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNG-
        EE++LKTF GNEQQE V H  DD+HP F DM+S DG  KRELD V S EA EVE+K ++  PD RN L++ + GV+NEGSN NHGGVTNDH L+E  NG 
Subjt:  EENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNG-

Query:  SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSGQGGGNAVNPMIVEENGVESTNA
        SGNTD INCIDLESILRQ+ALENLRKFKG  P+NVE  ANC+V+N+NDAKQL+SPVS SV + SPRDDAE NGKGFS Q GG+AVN MI++ NG +ST+A
Subjt:  SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSGQGGGNAVNPMIVEENGVESTNA

Query:  IDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMGV
        ID+AVAS HDP+ SSQNLGKIS+ SNGMNELKQDISSLDQE +NDNIC K DA+I STT+RSNLV AA RP+SKVD L+++ASA QE IQTKPSISD+ V
Subjt:  IDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMGV

Query:  DEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR
        DEI+Q Q Q  NNDDQNI NGF SSA+KPSSSLN  SGENSL+K R ESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPAL RRQLKR
Subjt:  DEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR

XP_023535556.1 transcriptional regulator ATRX isoform X1 [Cucurbita pepo subsp. pepo]6.95e-28676.14Show/hide
Query:  RSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGKKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVKKTN
        RSRRK++SSRKLKSKKLRYRHDSPSCSDTDFESSTS+SSSSSEDD+KV RSRS   KN+KP KKR+KK+S D Q R+  PHPRKRKHSKR DR E KKTN
Subjt:  RSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGKKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVKKTN

Query:  KRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVVG
        K KKRRRDVSV AT+ DSLS STCGDGS+TS++SEIDR RGRSGKRKRN  KTE  RYRSKS SPCSLCS+G D+QNEVED SYVEN+ RRL+S+IVVVG
Subjt:  KRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVVG

Query:  EENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNG-
        EE++LKTF GNEQQE V H  D++HPSFGDM+S DG SKRELD V S EA EVE+K ++V PD RN L++ D GV+NEGSN NHGGVTNDH L+E  NG 
Subjt:  EENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNG-

Query:  SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSGQGGGNAVNPMIVEENGVESTNA
        SGNTD INCI+LESILRQ+ALENLRKFKGV P+NVE  +NC+V+N+NDAKQL SPVS SV +  PRDDAEINGKGFS Q GG+AVN MIV+ENG +ST+A
Subjt:  SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSGQGGGNAVNPMIVEENGVESTNA

Query:  IDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMGV
        ID+AVAS HDP+ SSQNLGKIS+ SNGMNELKQDISSLDQE +NDNIC K DADI STT+RSNLV AA RP+SKVD L+++ASA QE I+TKPSISD+ V
Subjt:  IDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMGV

Query:  DEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR
        DE AQ + Q +NN+DQNI NGF SSA+KPSSSLN  SGENSL+K RHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPAL RRQLKR
Subjt:  DEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR

XP_038897880.1 histone-lysine N-methyltransferase SETD2 isoform X1 [Benincasa hispida]3.86e-28577.07Show/hide
Query:  RSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGKKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVKKTN
        RSRRK++SS+KLKSKKLRYRHDSPSCSDTDFESSTS+SSSS EDD++V RSRS   KNAKP KKR+K++S D Q R+ SPHPRKRKHSKR+D CE KK  
Subjt:  RSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGKKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVKKTN

Query:  KRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVVG
        K KKRRRD SV A S DS SCSTCG+GSTTSNESE+ R RGRSGKRK N GKTER RYRSKS SPCSL S+ SDYQNEV+D SYV NNFRRLRS+IV+ G
Subjt:  KRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVVG

Query:  EENKLKTFDGNEQQEEVMHHPDD--DHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNN
        EENKLKTF GNEQQE   H P+D  DHPS GDMDS D  SKRELD V S E   VE KKEV +P+ RN +VVKD GVQNEGSN N GGVTNDH L+E  N
Subjt:  EENKLKTFDGNEQQEEVMHHPDD--DHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNN

Query:  G-SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSGQGGGNAVNPMIVEENGVEST
        G SG TD +N IDLESILRQRALENLRKFKG PP+NVET ANC+VD++NDAKQL SPVS SV + SPRDDAEIN KGFS QGGGNAVN MIV+ENGV+ST
Subjt:  G-SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSGQGGGNAVNPMIVEENGVEST

Query:  NAIDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDM
        +AIDS+V S HDP+YSSQNLGKIS+ SNGMNELKQ+ISSLDQE +NDNICQK DADICSTT+RSNLV AALRP+SKVD L+KQA A QE IQTKPSISD+
Subjt:  NAIDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDM

Query:  GVDEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR
        GVDE AQ Q Q RNNDDQNI NG DSSAHKPSS LN  SGENSL+  RHESG+ SQFEQKTMSVMRGGEMVQVNYKVYIPKRAPAL RRQLKR
Subjt:  GVDEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR

TrEMBL top hitse value%identityAlignment
A0A1S3CKB4 uncharacterized protein LOC103501777 isoform X13.30e-27673.82Show/hide
Query:  MPMQISFAST--FVLFCFLVRTMKRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGKKRAKKRSRDDQIR
        MPM+ISFAS   FV F FL  TMKRSRRK++SSRKLKSKK RYRHDSPSCSDTDFESSTS+ SSSSEDD+KV RSRS   KN KP KKR KK+S D Q R
Subjt:  MPMQISFAST--FVLFCFLVRTMKRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGKKRAKKRSRDDQIR

Query:  DLSPHPRKRKHSKRHDRCEVKKTNKRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQ
        + SP+PRKRKHSKR+DR EV K NK KKRRRDVSV  +  DSLSCSTCG+G+TTSNESE+ R RGR  KRK N  KT   RY SKS SPCSL SEGSDYQ
Subjt:  DLSPHPRKRKHSKRHDRCEVKKTNKRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQ

Query:  NEVEDGSYVENNFRRLRSVIVVVGEENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQ
        NEV+D SYVE NFRRLRS+IVVVGEENKLKTF GNEQQE V +   DDHPS G+MDS D   KR LD V + EA  VEN+KEV +P+ RN +VVKDGGVQ
Subjt:  NEVEDGSYVENNFRRLRSVIVVVGEENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQ

Query:  NEGSNNNHGGVTNDHPLNEGNNG-SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGF
        NEGSN NHGGVTNDH  +E  NG S NTD INCIDLES+LRQRALENLRKFKG  P+NVET ANC+VD++N AKQL SPVS+SV + SPR++AEIN K F
Subjt:  NEGSNNNHGGVTNDHPLNEGNNG-SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGF

Query:  SGQGGGNAVNPMIVEENGVESTNAIDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVD
        S QGGGNA+N MI++ENGV+S +AIDSAVA+ HDP+YSSQNLGKIS+ SNGMNE KQDISSLDQE +NDNICQK DADICSTT+RSNLV AALRP+ KVD
Subjt:  SGQGGGNAVNPMIVEENGVESTNAIDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVD

Query:  FLVKQASAPQEYIQTKPSISDMGVDEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVY
         L+KQ SA QE +QTKPSISD+GV E AQ+Q Q RNNDD NI NG  SSA++PSS LN  SGE+SLN    ESGE SQFEQKTMSVMRGGEMVQVNYKVY
Subjt:  FLVKQASAPQEYIQTKPSISDMGVDEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVY

Query:  IPKRAPALARRQLKR
        IPKRAPAL RRQLKR
Subjt:  IPKRAPALARRQLKR

A0A6J1CDR0 uncharacterized protein LOC1110106010.0100Show/hide
Query:  RSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGKKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVKKTN
        RSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGKKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVKKTN
Subjt:  RSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGKKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVKKTN

Query:  KRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVVG
        KRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVVG
Subjt:  KRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVVG

Query:  EENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNGS
        EENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNGS
Subjt:  EENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNGS

Query:  GNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSGQGGGNAVNPMIVEENGVESTNAI
        GNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSGQGGGNAVNPMIVEENGVESTNAI
Subjt:  GNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSGQGGGNAVNPMIVEENGVESTNAI

Query:  DSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMGVD
        DSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMGVD
Subjt:  DSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMGVD

Query:  EIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR
        EIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR
Subjt:  EIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR

A0A6J1F6D1 uncharacterized protein LOC111442542 isoform X12.32e-27975.47Show/hide
Query:  RSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGKKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVKKTN
        RSRRK++SSR+LKSKKLRYRHDSPSCSDTDFESSTS+SSSSSEDD+KV RSRS   KN+KP KKR+KK+S D Q R+ SPHPRKRKHSKR DR E KKTN
Subjt:  RSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGKKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVKKTN

Query:  KRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVVG
        K KKRRRD SV AT+ DSL  STCGDGS+TS++SEIDR RGRSGKRKRN  KTE  RYRSKS SPCSLCS+GSD+QNEVED SYVEN+ RRL+S+IVVVG
Subjt:  KRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVVG

Query:  EENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNG-
        EE++LKTF GNEQQE V H  DD+HPSFGDM+S DG SKRELD V S EA EVE+K ++V PD RN L++ D GV+NEGSN NHGGVTNDH L+E  NG 
Subjt:  EENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNG-

Query:  SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSGQGGGNAVNPMIVEENGVESTNA
        SGNT+ INCIDLESILRQ+ALENLRKFKGV P+NVE  ANC+V+N+NDAKQL SPVS SV +  PRDDAEIN KGFS Q GG+AVN MIV+ NG +ST+A
Subjt:  SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSGQGGGNAVNPMIVEENGVESTNA

Query:  IDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMGV
        ID+AVAS HDP+ SSQNLGKIS+ SNGMNE KQDISSLDQE +NDNIC K DADI STT+RSNLV AA RP+SKVD  +K+AS  QE IQTK SISD+ V
Subjt:  IDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMGV

Query:  DEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR
        DE AQ Q Q  NNDDQNI NGF SSA+K SSSLN  SGEN L+K R ESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPAL RRQLKR
Subjt:  DEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR

A0A6J1FN60 pre-mRNA-splicing factor CWC22 homolog isoform X11.86e-27074.67Show/hide
Query:  VLFCFLVRTMKRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGKKRAKKRSRDDQIRDLSPHPRKRKHSK
        VLF FL +TMKRSRRK+RSSRKLKS  +RYRHDSPS SD D ESSTSLSSSS+EDDEKVGRSRS   KNAK   KRAKKRS D Q RD SPHPRKRKHSK
Subjt:  VLFCFLVRTMKRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGKKRAKKRSRDDQIRDLSPHPRKRKHSK

Query:  RHDRCEVKKTNKRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNF
        R + CE+KK+NK KKR+RD SV ATS DSLSCSTCGDGSTTSNE EIDR +GRS KRK N GK ERSRY SKSHSPCSLCSEGSD+QNEVE+  YVENNF
Subjt:  RHDRCEVKKTNKRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNF

Query:  RRLRSVIVVVGEENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTN
        RRLRSVIVVVGEE+KL+TFDGNE QE V H PD DHPSFGD++ NDGMS RELD + S EA        V+I D RN +VVKD GVQNEGSNNNHGG T+
Subjt:  RRLRSVIVVVGEENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTN

Query:  DHPLNEGNNG-SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSGQGGGNAVNPMI
        DHPL E  NG S NTD  NCIDLESILRQRALENLRK+K V P+NVET AN +VDNSNDAKQL SPVS  V + SPRD+A ING  FS QGGGNAVN MI
Subjt:  DHPLNEGNNG-SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSGQGGGNAVNPMI

Query:  VEENGVESTNAIDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYI
        +E NGV+ST+ +DSAVAST+DP+YSSQ+LGKIS+ SN MNELKQ ISS+DQE VND+IC   DADIC TT+RSNLV AAL+P+S VD L +QASA QE I
Subjt:  VEENGVESTNAIDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYI

Query:  QTKPSISDMGVDEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSS--LNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARR
        QTKPS SDMG  E  Q Q Q RNND QNI +GF SSAHKPSSS  LN  SGE+ LN  RHESGEGSQFEQKTMSV RGGEMVQVNYKVYIPKRAP L+RR
Subjt:  QTKPSISDMGVDEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSS--LNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARR

Query:  QLKR
        QLKR
Subjt:  QLKR

A0A6J1IGY0 uncharacterized protein LOC111476850 isoform X11.49e-28275.63Show/hide
Query:  RSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGKKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVKKTN
        RSRRK++SSRKLKSKKLRYRHDSPSCSDTDFESSTS+SSSSSEDD+KV RSRS   KN+KP KKR+KK+S D Q R+ SPHPRKRKHSKR+DR E KKTN
Subjt:  RSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGKKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVKKTN

Query:  KRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVVG
        K KKRRRDVSV AT+ DSLS STCGDGS+TS++SEIDR RGRSGKRKRN  KTE  RYRSKS SPCSLCS+GSD+QNEVED SYV+N  RRL+S+IVVVG
Subjt:  KRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVVG

Query:  EENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNG-
        EE++LKTF GNEQQE V H  DD+HP F DM+S DG  KRELD V S EA EVE+K ++  PD RN L++ + GV+NEGSN NHGGVTNDH L+E  NG 
Subjt:  EENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNG-

Query:  SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSGQGGGNAVNPMIVEENGVESTNA
        SGNTD INCIDLESILRQ+ALENLRKFKG  P+NVE  ANC+V+N+NDAKQL+SPVS SV + SPRDDAE NGKGFS Q GG+AVN MI++ NG +ST+A
Subjt:  SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSGQGGGNAVNPMIVEENGVESTNA

Query:  IDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMGV
        ID+AVAS HDP+ SSQNLGKIS+ SNGMNELKQDISSLDQE +NDNIC K DA+I STT+RSNLV AA RP+SKVD L+++ASA QE IQTKPSISD+ V
Subjt:  IDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMGV

Query:  DEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR
        DEI+Q Q Q  NNDDQNI NGF SSA+KPSSSLN  SGENSL+K R ESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPAL RRQLKR
Subjt:  DEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G53930.1 unknown protein1.0e-1829.32Show/hide
Query:  RTMKRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGKKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEV
        + +K S  K RS +K KSK+ + +       +++   S S   SSSEDD +  + R +KL   +  K+ +   S DD   D     +K+K SKR D    
Subjt:  RTMKRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGKKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEV

Query:  KKTNK---RKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLR
        KK  K   RK+R+RD+S S+TS +     +  DGS + +     R RG      R  GK + SR RS+        SE  D   + ED    E N RRL+
Subjt:  KKTNK---RKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLR

Query:  SVIVVVGEENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPL
        S++VV         +   E++EE      DD     D+    G   REL    S ++ E++ +       IR               +N  G    ++  
Subjt:  SVIVVVGEENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPL

Query:  NEGNNGSGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSGQGGGNAVNPMIVEENG
        +E +  S   + +   DLE+IL++RALENL++F+GV  K            S  AK+  S VS                             PM +E   
Subjt:  NEGNNGSGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSGQGGGNAVNPMIVEENG

Query:  VESTN-------AIDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQE
        VES +         DSAV+     + +S+ +  + +       L    S  DQ++  D    KV + + S T++  LV   L  DS      K+AS  Q+
Subjt:  VESTN-------AIDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQE

Query:  YIQTKPSISDMGVDE---IAQMQIQTRNNDDQ------NIVNGFDSSAHKPSSSLNYF-SGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYI
              SI    VD+    + + + T+N  +       +     +SS+H  +  ++    G  S  K   E+ + SQ+EQKTM+VMRGGEMVQV+YKVYI
Subjt:  YIQTKPSISDMGVDE---IAQMQIQTRNNDDQ------NIVNGFDSSAHKPSSSLNYF-SGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYI

Query:  PKRAPALARRQLKR
        PK+A +L RR+L R
Subjt:  PKRAPALARRQLKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGATGCAAATTTCATTTGCATCCACTTTTGTTTTGTTTTGTTTTTTGGTTCGAACCATGAAGAGAAGTAGGAGAAAGAATAGAAGTAGTAGGAAATTGAAGTCCAA
GAAGCTGCGGTATCGCCACGATTCTCCCTCTTGCTCTGACACTGATTTTGAAAGTTCAACTTCACTGTCCTCTTCTAGCTCTGAGGATGATGAGAAAGTAGGAAGATCTC
GATCCAATAAGCTAAAGAATGCAAAGCCTGGTAAAAAGAGAGCTAAGAAGCGATCTCGTGATGATCAAATTAGGGATTTATCTCCTCACCCCAGAAAGAGGAAGCATTCG
AAGAGACACGACCGCTGCGAGGTGAAGAAGACCAACAAAAGGAAGAAGCGTAGAAGAGATGTGAGTGTTAGTGCCACGAGTAGGGACTCTTTGAGCTGCTCAACTTGTGG
AGATGGGAGTACAACCAGCAATGAGAGTGAAATTGATAGGCATAGGGGCAGGTCTGGAAAGAGAAAAAGAAATAGGGGGAAGACTGAAAGAAGTAGATACAGGTCAAAGA
GTCATTCACCATGCTCTTTATGTAGTGAAGGTAGTGATTATCAGAATGAGGTTGAGGATGGCAGTTATGTTGAGAACAACTTTAGACGATTAAGATCAGTAATTGTTGTA
GTAGGAGAGGAAAACAAATTAAAGACGTTTGATGGGAATGAACAACAAGAAGAGGTCATGCATCATCCTGATGACGACCACCCTTCTTTTGGAGATATGGACAGTAACGA
TGGGATGAGTAAAAGAGAACTAGATCGTGTTACATCTAATGAGGCATCAGAGGTAGAAAACAAAAAAGAAGTGGTTATACCTGATATTAGAAACTTTTTGGTTGTAAAGG
ATGGTGGAGTTCAAAATGAGGGAAGCAACAATAACCATGGAGGAGTAACTAATGACCATCCTTTAAATGAAGGAAATAATGGCTCTGGAAATACTGACAGGATAAATTGT
ATTGATTTGGAGTCAATTTTAAGACAGAGGGCTCTGGAAAACCTAAGAAAGTTCAAAGGGGTGCCCCCAAAGAACGTAGAAACTTCTGCTAACTGCCAAGTTGATAATAG
TAATGATGCGAAGCAATTGTACTCTCCTGTCTCTAATTCAGTTCGCATTAAGTCCCCTAGGGATGATGCTGAGATAAATGGTAAGGGATTCTCTGGACAAGGTGGGGGGA
ATGCGGTAAACCCAATGATAGTTGAGGAGAATGGTGTTGAATCTACTAATGCGATAGATTCAGCAGTTGCATCTACGCATGATCCTATCTATTCTTCACAGAACCTGGGT
AAGATTTCCAGTACAAGCAATGGTATGAATGAACTAAAGCAGGATATTTCTTCATTAGACCAGGAGGCCGTAAATGACAATATTTGCCAGAAGGTGGATGCAGATATTTG
TTCTACAACTAGCAGAAGCAATTTGGTTTATGCAGCTTTGAGACCCGACTCAAAAGTTGATTTCCTTGTAAAGCAGGCATCTGCTCCTCAAGAATATATCCAAACAAAGC
CATCTATATCTGACATGGGTGTAGATGAGATTGCTCAAATGCAGATCCAGACGAGGAATAATGATGATCAAAACATTGTTAATGGTTTCGATTCTTCAGCTCACAAGCCT
TCTTCTTCCCTTAATTATTTTTCAGGAGAAAATAGCTTGAATAAGCCCAGACATGAGAGTGGGGAAGGCTCACAGTTTGAACAGAAAACAATGTCCGTGATGAGAGGTGG
TGAAATGGTGCAGGTGAACTACAAGGTCTACATCCCGAAGAGAGCTCCTGCTTTGGCCAGGAGGCAACTCAAGCGGTGA
mRNA sequenceShow/hide mRNA sequence
GCCTCACGAAAATAGTAGCCGAGGATGAAAGAGGAAAAAGTATTTATTTGGTAGCGGAATGGTATTTGTGCAAAACTTAGCTATTGAAATGTAACACGAAATAAGTTTGG
AAAGATTTTGGCGTGAAGGACGCGGGTTGGCGAAGGAAGAACAAGAACAGGCCTCCTCTTAGCGATGAAATCCTCCGAAAACAGAGCAACCCTGCATTAAGCTCGAAGCC
ATCCAGTTCTCAAATTTCCAAACCCTAATTCCCACATTCAAACCGATTCTTGTTCTTTTGCATTCTATTTTCCCTTTTTTCTTCTTCTTTTTTTCATCTGTTTTTTCTTT
CTTGGGCGTCGTAGTTGCATTGCAGGTGTTCCAGATGGGAAAATCCCGCAAGAAGGAGCGTTCAAAGACTTCTTCATCCCAGGTACTCTTTGTCTCTCTCTCTCGCCGAA
TGTGTTCTCTACGTTGCTATGTCAATTGGATTTGTCTTTATTTCCCCCTCTTTTGTCGAAATTTCTGCCTAGTCACTGGGTAATGTTCAAAGATTACGCTATAATTTGGC
TTCTGACCTGACAATTACATGTGGGAAATTTTCTATTCGTTGATTCATTTGCATGAACTTTGAGTTTGCGTAAAAGGGTTCCCTCTTCGTCTTCACTTATTTTTTCTTGG
TAGCCAACTGGAAAATGTAATATTGGCATCGTGGTCGCATTAAAGATATGATTTTTTTTGTTTGTTTTCACTTTCTCTTTCTTTTCCCTCTTTTACCTCATATAAGTTTG
CTTTGATGCCGATGCAAATTTCATTTGCATCCACTTTTGTTTTGTTTTGTTTTTTGGTTCGAACCATGAAGAGAAGTAGGAGAAAGAATAGAAGTAGTAGGAAATTGAAG
TCCAAGAAGCTGCGGTATCGCCACGATTCTCCCTCTTGCTCTGACACTGATTTTGAAAGTTCAACTTCACTGTCCTCTTCTAGCTCTGAGGATGATGAGAAAGTAGGAAG
ATCTCGATCCAATAAGCTAAAGAATGCAAAGCCTGGTAAAAAGAGAGCTAAGAAGCGATCTCGTGATGATCAAATTAGGGATTTATCTCCTCACCCCAGAAAGAGGAAGC
ATTCGAAGAGACACGACCGCTGCGAGGTGAAGAAGACCAACAAAAGGAAGAAGCGTAGAAGAGATGTGAGTGTTAGTGCCACGAGTAGGGACTCTTTGAGCTGCTCAACT
TGTGGAGATGGGAGTACAACCAGCAATGAGAGTGAAATTGATAGGCATAGGGGCAGGTCTGGAAAGAGAAAAAGAAATAGGGGGAAGACTGAAAGAAGTAGATACAGGTC
AAAGAGTCATTCACCATGCTCTTTATGTAGTGAAGGTAGTGATTATCAGAATGAGGTTGAGGATGGCAGTTATGTTGAGAACAACTTTAGACGATTAAGATCAGTAATTG
TTGTAGTAGGAGAGGAAAACAAATTAAAGACGTTTGATGGGAATGAACAACAAGAAGAGGTCATGCATCATCCTGATGACGACCACCCTTCTTTTGGAGATATGGACAGT
AACGATGGGATGAGTAAAAGAGAACTAGATCGTGTTACATCTAATGAGGCATCAGAGGTAGAAAACAAAAAAGAAGTGGTTATACCTGATATTAGAAACTTTTTGGTTGT
AAAGGATGGTGGAGTTCAAAATGAGGGAAGCAACAATAACCATGGAGGAGTAACTAATGACCATCCTTTAAATGAAGGAAATAATGGCTCTGGAAATACTGACAGGATAA
ATTGTATTGATTTGGAGTCAATTTTAAGACAGAGGGCTCTGGAAAACCTAAGAAAGTTCAAAGGGGTGCCCCCAAAGAACGTAGAAACTTCTGCTAACTGCCAAGTTGAT
AATAGTAATGATGCGAAGCAATTGTACTCTCCTGTCTCTAATTCAGTTCGCATTAAGTCCCCTAGGGATGATGCTGAGATAAATGGTAAGGGATTCTCTGGACAAGGTGG
GGGGAATGCGGTAAACCCAATGATAGTTGAGGAGAATGGTGTTGAATCTACTAATGCGATAGATTCAGCAGTTGCATCTACGCATGATCCTATCTATTCTTCACAGAACC
TGGGTAAGATTTCCAGTACAAGCAATGGTATGAATGAACTAAAGCAGGATATTTCTTCATTAGACCAGGAGGCCGTAAATGACAATATTTGCCAGAAGGTGGATGCAGAT
ATTTGTTCTACAACTAGCAGAAGCAATTTGGTTTATGCAGCTTTGAGACCCGACTCAAAAGTTGATTTCCTTGTAAAGCAGGCATCTGCTCCTCAAGAATATATCCAAAC
AAAGCCATCTATATCTGACATGGGTGTAGATGAGATTGCTCAAATGCAGATCCAGACGAGGAATAATGATGATCAAAACATTGTTAATGGTTTCGATTCTTCAGCTCACA
AGCCTTCTTCTTCCCTTAATTATTTTTCAGGAGAAAATAGCTTGAATAAGCCCAGACATGAGAGTGGGGAAGGCTCACAGTTTGAACAGAAAACAATGTCCGTGATGAGA
GGTGGTGAAATGGTGCAGGTGAACTACAAGGTCTACATCCCGAAGAGAGCTCCTGCTTTGGCCAGGAGGCAACTCAAGCGGTGACACAAGATCCGTGATTATGCAGTCAA
TATAGGATGTTCCTGTTCATTGATCACTTTCGTTAGCAAGGAGATCCATAAGCCTTAAAACATGAACAGTAATTAGGGAAGTCAATTGTGGATATTTATGTCTGTTAAGT
TGTTAGCTGTTTTCTTTCCATAACAATGATCAAAATGATCTTTTGATCTAACGAGATCCTCGTGTTTACAAAGCCTTCTTATTTAGTTTTCCTTGTGATGCCAAAACCTG
GTATAGGTGAGTTGATGATCACAATTAGTTTCACCTTCTATTCTATCATGTCCTTGTGAATAAACTCAAACCTCTAAGTGGTGTCCTCTCTAAGTATTGGTGCTAAGCCT
TAAGATTGGTTTCGGTTCCGGCTCCTAAGTCTTTCTTCTTTCTCTATCTTTTAGAACTTGCATGGGTGCTTTTGGTTTGGTTTGATTTGTACTTAAATCACTAGCAAGTT
TCAAGATCATGGAGGCTCGGGCTGGCTATGTAGGGGAAAAGGAAGGGGGATGTTGAGGGGTGAGGAGAGAGTGAGAGAGGAAATTACTTTAACGAGATACGGATGAGATT
GTTTTATTTCATGGATTAAGTTGTTTAGATTTAAGTTGAATATATTGTGGATTTTACTTGTAACATATTATGGATTTTTTTGCAAGTTGAATATTTTATTAACCATGGAT
CTCAACGAAATCTTGTCTTTAACGATGGAGGCAGGCTCGATTCTTCAGTAACTGCATGTTTTAGTTAACCATCAGAGTTAACATTTATATTCCAA
Protein sequenceShow/hide protein sequence
MPMQISFASTFVLFCFLVRTMKRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGKKRAKKRSRDDQIRDLSPHPRKRKHS
KRHDRCEVKKTNKRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVV
VGEENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNGSGNTDRINC
IDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSGQGGGNAVNPMIVEENGVESTNAIDSAVASTHDPIYSSQNLG
KISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMGVDEIAQMQIQTRNNDDQNIVNGFDSSAHKP
SSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR