; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS021282 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS021282
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionpre-mRNA-splicing factor CWC22 homolog isoform X1
Genome locationscaffold358:710064..711937
RNA-Seq ExpressionMS021282
SyntenyMS021282
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7024815.1 hypothetical protein SDJN02_13634, partial [Cucurbita argyrosperma subsp. argyrosperma]3.0e-22776.18Show/hide
Query:  KRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGRKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVEKT
        +RSRRK++SSRKLKSKKLRYRHDSPSCSDTDFESSTS+SSSSSEDD+KV RSRS   KN+KP +KR+KK+S D Q R+ SPHPRKRKHSKR DR E +KT
Subjt:  KRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGRKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVEKT

Query:  NKRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVV
        NK KKRRRDVSV AT+ DSL  STCGDGS+TS++SEIDR RGRSGKRKRN  KTE  RYRSKSHSPCSLCS+GSD+QNEVED SYVEN+ RRL+S+IVVV
Subjt:  NKRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVV

Query:  GEENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNG
        GEE++LKTF GNEQQE V H  DD+HPSFGDM+S DG SKRELD V S EA EVE+K ++V PD RN L++ D GV+NEGSN NHGGVTNDH L+E  NG
Subjt:  GEENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNG

Query:  -SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSEQGGGNAVNPMIVEENGVESTN
         SGNT+ INCIDLESILRQ+ALENLRKFKGV P+NVE  ANC+V+N+NDAKQL SPVS SV + SPRDDAEIN KGFS Q GG+AVN MIV+ NG +ST+
Subjt:  -SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSEQGGGNAVNPMIVEENGVESTN

Query:  AIDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMG
        AID+AVAS HDP+ SSQNLGKIS+ SNGMNELKQDISSLDQE +NDNIC K DADI STT+ SNLV AA RP+SKVD  +K+ASA QE IQTK SISD+ 
Subjt:  AIDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMG

Query:  VDEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR
        VDE AQ Q Q  NNDDQNI NGF SSA+KPSSSLN  SGENSL+K R ESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPAL RRQLKR
Subjt:  VDEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR

XP_022139776.1 uncharacterized protein LOC111010601 [Momordica charantia]0.0e+0099.32Show/hide
Query:  KRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGRKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVEKT
        +RSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPG+KRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEV+KT
Subjt:  KRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGRKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVEKT

Query:  NKRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVV
        NKRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVV
Subjt:  NKRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVV

Query:  GEENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNG
        GEENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNG
Subjt:  GEENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNG

Query:  SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSEQGGGNAVNPMIVEENGVESTNA
        SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFS QGGGNAVNPMIVEENGVESTNA
Subjt:  SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSEQGGGNAVNPMIVEENGVESTNA

Query:  IDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMGV
        IDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMGV
Subjt:  IDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMGV

Query:  DEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR
        DEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR
Subjt:  DEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR

XP_022976451.1 uncharacterized protein LOC111476850 isoform X1 [Cucurbita maxima]3.6e-22575.17Show/hide
Query:  KRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGRKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVEKT
        +RSRRK++SSRKLKSKKLRYRHDSPSCSDTDFESSTS+SSSSSEDD+KV RSRS   KN+KP +KR+KK+S D Q R+ SPHPRKRKHSKR+DR E +KT
Subjt:  KRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGRKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVEKT

Query:  NKRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVV
        NK KKRRRDVSV AT+ DSLS STCGDGS+TS++SEIDR RGRSGKRKRN  KTE  RYRSKS SPCSLCS+GSD+QNEVED SYV+N  RRL+S+IVVV
Subjt:  NKRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVV

Query:  GEENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNG
        GEE++LKTF GNEQQE V H  DD+HP F DM+S DG  KRELD V S EA EVE+K ++  PD RN L++ + GV+NEGSN NHGGVTNDH L+E  NG
Subjt:  GEENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNG

Query:  -SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSEQGGGNAVNPMIVEENGVESTN
         SGNTD INCIDLESILRQ+ALENLRKFKG  P+NVE  ANC+V+N+NDAKQL+SPVS SV + SPRDDAE NGKGFS Q GG+AVN MI++ NG +ST+
Subjt:  -SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSEQGGGNAVNPMIVEENGVESTN

Query:  AIDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMG
        AID+AVAS HDP+ SSQNLGKIS+ SNGMNELKQDISSLDQE +NDNIC K DA+I STT+RSNLV AA RP+SKVD L+++ASA QE IQTKPSISD+ 
Subjt:  AIDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMG

Query:  VDEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR
        VDEI+Q Q Q  NNDDQNI NGF SSA+KPSSSLN  SGENSL+K R ESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPAL RRQLKR
Subjt:  VDEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR

XP_023535556.1 transcriptional regulator ATRX isoform X1 [Cucurbita pepo subsp. pepo]6.0e-22875.68Show/hide
Query:  KRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGRKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVEKT
        +RSRRK++SSRKLKSKKLRYRHDSPSCSDTDFESSTS+SSSSSEDD+KV RSRS   KN+KP +KR+KK+S D Q R+  PHPRKRKHSKR DR E +KT
Subjt:  KRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGRKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVEKT

Query:  NKRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVV
        NK KKRRRDVSV AT+ DSLS STCGDGS+TS++SEIDR RGRSGKRKRN  KTE  RYRSKS SPCSLCS+G D+QNEVED SYVEN+ RRL+S+IVVV
Subjt:  NKRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVV

Query:  GEENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNG
        GEE++LKTF GNEQQE V H  D++HPSFGDM+S DG SKRELD V S EA EVE+K ++V PD RN L++ D GV+NEGSN NHGGVTNDH L+E  NG
Subjt:  GEENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNG

Query:  -SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSEQGGGNAVNPMIVEENGVESTN
         SGNTD INCI+LESILRQ+ALENLRKFKGV P+NVE  +NC+V+N+NDAKQL SPVS SV +  PRDDAEINGKGFS Q GG+AVN MIV+ENG +ST+
Subjt:  -SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSEQGGGNAVNPMIVEENGVESTN

Query:  AIDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMG
        AID+AVAS HDP+ SSQNLGKIS+ SNGMNELKQDISSLDQE +NDNIC K DADI STT+RSNLV AA RP+SKVD L+++ASA QE I+TKPSISD+ 
Subjt:  AIDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMG

Query:  VDEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR
        VDE AQ + Q +NN+DQNI NGF SSA+KPSSSLN  SGENSL+K RHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPAL RRQLKR
Subjt:  VDEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR

XP_038897880.1 histone-lysine N-methyltransferase SETD2 isoform X1 [Benincasa hispida]3.0e-22776.6Show/hide
Query:  KRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGRKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVEKT
        +RSRRK++SS+KLKSKKLRYRHDSPSCSDTDFESSTS+ SSSSEDD++V RSRS   KNAKP +KR+K++S D Q R+ SPHPRKRKHSKR+D CE +K 
Subjt:  KRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGRKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVEKT

Query:  NKRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVV
         K KKRRRD SV A S DS SCSTCG+GSTTSNESE+ R RGRSGKRK N GKTER RYRSKS SPCSL S+ SDYQNEV+D SYV NNFRRLRS+IV+ 
Subjt:  NKRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVV

Query:  GEENKLKTFDGNEQQEEVMHHPD--DDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGN
        GEENKLKTF GNEQQE   H P+  DDHPS GDMDS D  SKRELD V S E   VE KKEV +P+ RN +VVKD GVQNEGSN N GGVTNDH L+E  
Subjt:  GEENKLKTFDGNEQQEEVMHHPD--DDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGN

Query:  NG-SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSEQGGGNAVNPMIVEENGVES
        NG SG TD +N IDLESILRQRALENLRKFKG PP+NVET ANC+VD++NDAKQL SPVS SV + SPRDDAEIN KGFS QGGGNAVN MIV+ENGV+S
Subjt:  NG-SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSEQGGGNAVNPMIVEENGVES

Query:  TNAIDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISD
        T+AIDS+V S HDP+YSSQNLGKIS+ SNGMNELKQ+ISSLDQE +NDNICQK DADICSTT+RSNLV AALRP+SKVD L+KQA A QE IQTKPSISD
Subjt:  TNAIDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISD

Query:  MGVDEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR
        +GVDE AQ Q Q RNNDDQNI NG DSSAHKP SSLN  SGENSL+  RHESG+ SQFEQKTMSVMRGGEMVQVNYKVYIPKRAPAL RRQLKR
Subjt:  MGVDEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR

TrEMBL top hitse value%identityAlignment
A0A0A0L248 Uncharacterized protein9.7e-21674.32Show/hide
Query:  KRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGRKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVEKT
        +RSRRK++SSRKLKSKKLRYRHDSPSCSDTDFESSTS+ SSSSE  ++V RSRS   KNAKP +KR+KK+S D Q R+ SP+PRKRKHSKR+DR EV K 
Subjt:  KRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGRKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVEKT

Query:  NKRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVV
        NK KKRRRDVSV  +  +SLSCSTCG+GSTTSNESE+ R RGRSGKRK N  KTE  RY SKSHSPCSL SEGSDYQNEV+D SYVENNFRRLRS+IVVV
Subjt:  NKRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVV

Query:  GEENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNG
        GEENKL  + GNE QE V + P DDHPSFGDMDS D  SKRELD V + EA  VEN+KEV +P+ RN +VV+D GVQNEGSN NHGGVTND   +E  NG
Subjt:  GEENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNG

Query:  -SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSEQGGGNAVNPMIVEENGVESTN
         S NTD INCIDLES+LRQRALENLRKFKG PP+NVET ANC+V ++N AKQL SP+S SV + SPR+DAEIN + FS QGGGNAVN MIV+ENGV S +
Subjt:  -SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSEQGGGNAVNPMIVEENGVESTN

Query:  AIDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMG
        AIDSAVA+ HDP+YSSQNLGKIS+ SNGMNE KQDISSLDQE +NDNICQK +ADICSTT+RSNLV AALRP  KVD L+KQ SA QE +QTKPSISD+ 
Subjt:  AIDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMG

Query:  VDEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR
        V E AQ Q Q RNN+D NI NG  SSAHKP SSLN  SGENSL+   HESGE SQFEQKTMSVMRGGEMVQVNYKVYIPKRAPAL RRQLKR
Subjt:  VDEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR

A0A5D3DWF0 NK-tumor recognition protein9.7e-21674.03Show/hide
Query:  MKRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGRKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVEK
        MKRSRRK++SSRKLKSKK RYRHDSPSCSDTDFESSTS+ SSSSEDD+KV RSRS   KN KP +KR KK+S D Q R+ SP+PRKRKHSKR+DR EV K
Subjt:  MKRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGRKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVEK

Query:  TNKRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVV
         NK KKRRRDVSV  +  DSLSCSTCG+G+TTSNESE+ R RGR  KRK N  KT   RY SKS SPCSL SEGSDYQNEV+D SYVE NFRRLRS+IVV
Subjt:  TNKRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVV

Query:  VGEENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNN
        VGEENKLKTF GNEQQ EV +   DDHPS G+MDS D   KR LD V + EA  VEN+KEV +P+ RN +VVKDGGVQNEGSN NHGGVTNDH  +E  N
Subjt:  VGEENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNN

Query:  G-SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSEQGGGNAVNPMIVEENGVEST
        G S NTD INCIDLES+LRQRALENLRKFKG  P+NVET ANC+VD++N AKQL SPVS+SV + SPR++AEIN K FS QGGGNA+N MI++ENGV+S 
Subjt:  G-SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSEQGGGNAVNPMIVEENGVEST

Query:  NAIDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDM
        +AIDSAVA+ HDP+YSSQNLGKIS+ SNGMNE KQDISSLDQE +NDNICQK DADICSTT+RSNLV AALRP+ KVD L+KQ SA QE +QTKPSISD+
Subjt:  NAIDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDM

Query:  GVDEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR
        GV E AQ+Q Q RNNDD NI NG  SSA++P SSLN  SGE+SLN    ESGE SQFEQKTMSVMRGGEMVQVNYKVYIPKRAPAL RRQLKR
Subjt:  GVDEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR

A0A6J1CDR0 uncharacterized protein LOC1110106010.0e+0099.32Show/hide
Query:  KRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGRKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVEKT
        +RSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPG+KRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEV+KT
Subjt:  KRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGRKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVEKT

Query:  NKRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVV
        NKRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVV
Subjt:  NKRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVV

Query:  GEENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNG
        GEENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNG
Subjt:  GEENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNG

Query:  SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSEQGGGNAVNPMIVEENGVESTNA
        SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFS QGGGNAVNPMIVEENGVESTNA
Subjt:  SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSEQGGGNAVNPMIVEENGVESTNA

Query:  IDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMGV
        IDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMGV
Subjt:  IDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMGV

Query:  DEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR
        DEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR
Subjt:  DEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR

A0A6J1F6D1 uncharacterized protein LOC111442542 isoform X14.8e-22375Show/hide
Query:  KRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGRKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVEKT
        +RSRRK++SSR+LKSKKLRYRHDSPSCSDTDFESSTS+SSSSSEDD+KV RSRS   KN+KP +KR+KK+S D Q R+ SPHPRKRKHSKR DR E +KT
Subjt:  KRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGRKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVEKT

Query:  NKRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVV
        NK KKRRRD SV AT+ DSL  STCGDGS+TS++SEIDR RGRSGKRKRN  KTE  RYRSKS SPCSLCS+GSD+QNEVED SYVEN+ RRL+S+IVVV
Subjt:  NKRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVV

Query:  GEENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNG
        GEE++LKTF GNEQQE V H  DD+HPSFGDM+S DG SKRELD V S EA EVE+K ++V PD RN L++ D GV+NEGSN NHGGVTNDH L+E  NG
Subjt:  GEENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNG

Query:  -SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSEQGGGNAVNPMIVEENGVESTN
         SGNT+ INCIDLESILRQ+ALENLRKFKGV P+NVE  ANC+V+N+NDAKQL SPVS SV +  PRDDAEIN KGFS Q GG+AVN MIV+ NG +ST+
Subjt:  -SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSEQGGGNAVNPMIVEENGVESTN

Query:  AIDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMG
        AID+AVAS HDP+ SSQNLGKIS+ SNGMNE KQDISSLDQE +NDNIC K DADI STT+RSNLV AA RP+SKVD  +K+AS  QE IQTK SISD+ 
Subjt:  AIDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMG

Query:  VDEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR
        VDE AQ Q Q  NNDDQNI NGF SSA+K SSSLN  SGEN L+K R ESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPAL RRQLKR
Subjt:  VDEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR

A0A6J1IGY0 uncharacterized protein LOC111476850 isoform X11.8e-22575.17Show/hide
Query:  KRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGRKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVEKT
        +RSRRK++SSRKLKSKKLRYRHDSPSCSDTDFESSTS+SSSSSEDD+KV RSRS   KN+KP +KR+KK+S D Q R+ SPHPRKRKHSKR+DR E +KT
Subjt:  KRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGRKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVEKT

Query:  NKRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVV
        NK KKRRRDVSV AT+ DSLS STCGDGS+TS++SEIDR RGRSGKRKRN  KTE  RYRSKS SPCSLCS+GSD+QNEVED SYV+N  RRL+S+IVVV
Subjt:  NKRKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVV

Query:  GEENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNG
        GEE++LKTF GNEQQE V H  DD+HP F DM+S DG  KRELD V S EA EVE+K ++  PD RN L++ + GV+NEGSN NHGGVTNDH L+E  NG
Subjt:  GEENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNG

Query:  -SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSEQGGGNAVNPMIVEENGVESTN
         SGNTD INCIDLESILRQ+ALENLRKFKG  P+NVE  ANC+V+N+NDAKQL+SPVS SV + SPRDDAE NGKGFS Q GG+AVN MI++ NG +ST+
Subjt:  -SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSEQGGGNAVNPMIVEENGVESTN

Query:  AIDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMG
        AID+AVAS HDP+ SSQNLGKIS+ SNGMNELKQDISSLDQE +NDNIC K DA+I STT+RSNLV AA RP+SKVD L+++ASA QE IQTKPSISD+ 
Subjt:  AIDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMG

Query:  VDEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR
        VDEI+Q Q Q  NNDDQNI NGF SSA+KPSSSLN  SGENSL+K R ESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPAL RRQLKR
Subjt:  VDEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G53930.1 unknown protein4.1e-1728.93Show/hide
Query:  MKRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGRKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVEK
        +K S  K RS +K KSK+ + +       +++   S S   SSSEDD +  + R +KL   +  ++ +   S DD   D     +K+K SKR D    +K
Subjt:  MKRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGRKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVEK

Query:  TNK---RKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSV
          K   RK+R+RD+S S+TS +     +  DGS + +     R RG      R  GK + SR RS+        SE  D   + ED    E N RRL+S+
Subjt:  TNK---RKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSV

Query:  IVVVGEENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNE
        +VV         +   E++EE      DD     D+    G   REL    S ++ E++ +       IR               +N  G    ++  +E
Subjt:  IVVVGEENKLKTFDGNEQQEEVMHHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNE

Query:  GNNGSGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSEQGGGNAVNPMIVEENGVE
         +  S   + +   DLE+IL++RALENL++F+GV  K            S  AK+  S VS    ++   +  E       EQ                 
Subjt:  GNNGSGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSEQGGGNAVNPMIVEENGVE

Query:  STNAIDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSIS
             DSAV+     + +S+ +  + +       L    S  DQ++  D    KV + + S T++  LV   L  DS      K+AS  Q+      SI 
Subjt:  STNAIDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSIS

Query:  DMGVDE---IAQMQIQTRNNDDQ------NIVNGFDSSAHKPSSSLNYF-SGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALAR
           VD+    + + + T+N  +       +     +SS+H  +  ++    G  S  K   E+ + SQ+EQKTM+VMRGGEMVQV+YKVYIPK+A +L R
Subjt:  DMGVDE---IAQMQIQTRNNDDQ------NIVNGFDSSAHKPSSSLNYF-SGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALAR

Query:  RQLKR
        R+L R
Subjt:  RQLKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAGAAGTAGGAGAAAGAATAGAAGTAGTAGGAAATTGAAGTCCAAGAAGCTGCGGTATCGCCACGATTCTCCCTCTTGCTCTGACACTGATTTTGAAAGTTCAAC
TTCACTGTCCTCTTCTAGCTCTGAGGATGATGAGAAAGTAGGAAGATCTCGATCCAATAAGCTAAAGAATGCAAAGCCTGGTAGAAAGAGAGCTAAGAAGCGATCTCGTG
ATGATCAAATTAGGGATTTATCTCCTCACCCCAGAAAGAGGAAGCATTCGAAGAGACACGACCGTTGCGAGGTGGAGAAGACCAACAAAAGGAAGAAGCGTAGAAGAGAT
GTGAGTGTTAGTGCCACGAGTAGGGACTCTTTGAGCTGCTCAACTTGTGGAGATGGGAGTACAACCAGCAATGAGAGTGAAATTGATAGGCATAGGGGCAGGTCTGGAAA
GAGAAAAAGAAATAGGGGGAAGACTGAAAGAAGTAGATACAGGTCAAAGAGTCATTCACCATGCTCTTTATGTAGTGAAGGTAGTGATTATCAGAATGAGGTTGAGGATG
GCAGTTATGTTGAGAACAACTTTAGACGATTAAGATCAGTAATTGTTGTAGTAGGAGAGGAAAACAAATTAAAGACGTTTGATGGGAATGAACAACAAGAAGAGGTCATG
CATCATCCTGATGACGACCACCCTTCTTTTGGAGATATGGACAGTAACGATGGGATGAGTAAAAGAGAACTAGATCGTGTTACATCTAATGAGGCATCAGAGGTAGAAAA
CAAAAAAGAAGTGGTTATACCTGATATTAGAAACTTTTTGGTTGTAAAGGATGGTGGAGTTCAAAATGAGGGAAGCAACAATAACCATGGAGGAGTAACTAATGACCATC
CTTTAAATGAAGGAAATAATGGCTCTGGAAATACTGACAGGATAAATTGTATTGATTTGGAGTCAATTTTAAGACAGAGGGCTCTGGAAAACCTAAGAAAGTTCAAAGGG
GTGCCCCCAAAGAACGTAGAAACTTCTGCTAACTGCCAAGTTGATAATAGTAATGATGCGAAGCAATTGTACTCTCCTGTCTCTAATTCAGTTCGCATTAAGTCCCCTAG
GGATGATGCTGAGATAAATGGTAAGGGATTCTCTGAACAAGGTGGGGGGAATGCAGTAAACCCAATGATAGTTGAGGAGAATGGTGTTGAATCTACTAATGCGATAGATT
CAGCAGTTGCATCTACGCATGATCCTATCTATTCTTCACAGAACCTGGGTAAGATTTCCAGTACAAGCAATGGTATGAATGAACTAAAGCAGGATATTTCTTCATTAGAC
CAGGAGGCCGTAAATGACAATATTTGCCAGAAGGTGGATGCAGATATTTGTTCTACAACTAGCAGAAGCAATTTGGTTTATGCAGCTTTGAGACCCGACTCAAAAGTTGA
TTTCCTTGTAAAGCAGGCATCTGCTCCTCAAGAATATATCCAAACAAAGCCGTCTATATCTGACATGGGTGTAGATGAGATTGCTCAAATGCAGATCCAGACGAGGAATA
ATGATGATCAAAACATTGTTAATGGTTTCGATTCTTCAGCTCACAAGCCTTCTTCTTCCCTTAATTATTTTTCAGGAGAAAATAGCTTGAATAAGCCCAGACATGAGAGT
GGGGAAGGCTCACAGTTTGAACAGAAAACAATGTCCGTGATGAGAGGTGGTGAAATGGTGCAGGTGAACTACAAGGTCTACATCCCGAAGAGAGCTCCTGCTTTGGCCAG
GAGGCAACTCAAGCGG
mRNA sequenceShow/hide mRNA sequence
ATGAAGAGAAGTAGGAGAAAGAATAGAAGTAGTAGGAAATTGAAGTCCAAGAAGCTGCGGTATCGCCACGATTCTCCCTCTTGCTCTGACACTGATTTTGAAAGTTCAAC
TTCACTGTCCTCTTCTAGCTCTGAGGATGATGAGAAAGTAGGAAGATCTCGATCCAATAAGCTAAAGAATGCAAAGCCTGGTAGAAAGAGAGCTAAGAAGCGATCTCGTG
ATGATCAAATTAGGGATTTATCTCCTCACCCCAGAAAGAGGAAGCATTCGAAGAGACACGACCGTTGCGAGGTGGAGAAGACCAACAAAAGGAAGAAGCGTAGAAGAGAT
GTGAGTGTTAGTGCCACGAGTAGGGACTCTTTGAGCTGCTCAACTTGTGGAGATGGGAGTACAACCAGCAATGAGAGTGAAATTGATAGGCATAGGGGCAGGTCTGGAAA
GAGAAAAAGAAATAGGGGGAAGACTGAAAGAAGTAGATACAGGTCAAAGAGTCATTCACCATGCTCTTTATGTAGTGAAGGTAGTGATTATCAGAATGAGGTTGAGGATG
GCAGTTATGTTGAGAACAACTTTAGACGATTAAGATCAGTAATTGTTGTAGTAGGAGAGGAAAACAAATTAAAGACGTTTGATGGGAATGAACAACAAGAAGAGGTCATG
CATCATCCTGATGACGACCACCCTTCTTTTGGAGATATGGACAGTAACGATGGGATGAGTAAAAGAGAACTAGATCGTGTTACATCTAATGAGGCATCAGAGGTAGAAAA
CAAAAAAGAAGTGGTTATACCTGATATTAGAAACTTTTTGGTTGTAAAGGATGGTGGAGTTCAAAATGAGGGAAGCAACAATAACCATGGAGGAGTAACTAATGACCATC
CTTTAAATGAAGGAAATAATGGCTCTGGAAATACTGACAGGATAAATTGTATTGATTTGGAGTCAATTTTAAGACAGAGGGCTCTGGAAAACCTAAGAAAGTTCAAAGGG
GTGCCCCCAAAGAACGTAGAAACTTCTGCTAACTGCCAAGTTGATAATAGTAATGATGCGAAGCAATTGTACTCTCCTGTCTCTAATTCAGTTCGCATTAAGTCCCCTAG
GGATGATGCTGAGATAAATGGTAAGGGATTCTCTGAACAAGGTGGGGGGAATGCAGTAAACCCAATGATAGTTGAGGAGAATGGTGTTGAATCTACTAATGCGATAGATT
CAGCAGTTGCATCTACGCATGATCCTATCTATTCTTCACAGAACCTGGGTAAGATTTCCAGTACAAGCAATGGTATGAATGAACTAAAGCAGGATATTTCTTCATTAGAC
CAGGAGGCCGTAAATGACAATATTTGCCAGAAGGTGGATGCAGATATTTGTTCTACAACTAGCAGAAGCAATTTGGTTTATGCAGCTTTGAGACCCGACTCAAAAGTTGA
TTTCCTTGTAAAGCAGGCATCTGCTCCTCAAGAATATATCCAAACAAAGCCGTCTATATCTGACATGGGTGTAGATGAGATTGCTCAAATGCAGATCCAGACGAGGAATA
ATGATGATCAAAACATTGTTAATGGTTTCGATTCTTCAGCTCACAAGCCTTCTTCTTCCCTTAATTATTTTTCAGGAGAAAATAGCTTGAATAAGCCCAGACATGAGAGT
GGGGAAGGCTCACAGTTTGAACAGAAAACAATGTCCGTGATGAGAGGTGGTGAAATGGTGCAGGTGAACTACAAGGTCTACATCCCGAAGAGAGCTCCTGCTTTGGCCAG
GAGGCAACTCAAGCGG
Protein sequenceShow/hide protein sequence
MKRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSSSEDDEKVGRSRSNKLKNAKPGRKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVEKTNKRKKRRRD
VSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSKSHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVVGEENKLKTFDGNEQQEEVM
HHPDDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSNNNHGGVTNDHPLNEGNNGSGNTDRINCIDLESILRQRALENLRKFKG
VPPKNVETSANCQVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSEQGGGNAVNPMIVEENGVESTNAIDSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLD
QEAVNDNICQKVDADICSTTSRSNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMGVDEIAQMQIQTRNNDDQNIVNGFDSSAHKPSSSLNYFSGENSLNKPRHES
GEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALARRQLKR