; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg016366 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg016366
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionpre-mRNA-splicing factor CWC22 homolog isoform X1
Genome locationscaffold9:37935478..37938501
RNA-Seq ExpressionSpg016366
SyntenySpg016366
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7024815.1 hypothetical protein SDJN02_13634, partial [Cucurbita argyrosperma subsp. argyrosperma]1.4e-23278Show/hide
Query:  KRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEDDEKLGRSRSKTRKNAKPSKKKVKKRSDDRRSRDYSPHPRKRKNLKRDDHSEVKKT
        +RSRRKS+SSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSS SEDD+K+ RSRSKTRKN+KPSKK+ KK+S D +SR+ SPHPRKRK+ KR D  E KKT
Subjt:  KRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEDDEKLGRSRSKTRKNAKPSKKKVKKRSDDRRSRDYSPHPRKRKNLKRDDHSEVKKT

Query:  NKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIVVVG
        NKKKRRRDVS+ AT+SDSL  STCG+GS+TS++ EIDR RGRS K+K NM KTE  RYRSKSHS CSLCS+GSD QNEVED SYVEN+ RRL+S+IVVVG
Subjt:  NKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIVVVG

Query:  KENKLETFDENGQEEEVVHQPDDDHLSFGDMGSKDGASKRELDYVTSKEAPEVENKIEVVIPDNRNSMVVKDDGVQNEGSNNNHGVVIHDRSLNERKNGC
        +E++L+TF  N Q+E V HQ DD+H SFGDM SKDG SKRELDYV SKEAPEVE+K ++V PDNRNS+++ DDGV+NEGSN NHG V +D SL+ERKNGC
Subjt:  KENKLETFDENGQEEEVVHQPDDDHLSFGDMGSKDGASKRELDYVTSKEAPEVENKIEVVIPDNRNSMVVKDDGVQNEGSNNNHGVVIHDRSLNERKNGC

Query:  SGTNDSINCIDLESILRQRALENLRKFKRVPPRNVKTPDNCKVDNNNDGKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVKSTDA
        SG  +SINCIDLESILRQ+ALENLRKFK V PRNV+   NCKV+NNND KQL SPVSKSVHVTSPRDDA+IN  G SRQ GG+ VNSMIV+ NG KSTDA
Subjt:  SGTNDSINCIDLESILRQRALENLRKFKRVPPRNVKTPDNCKVDNNNDGKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVKSTDA

Query:  IDSAVASMHDPVYSSQNLGRASNGSNGMNELKQDVSSVDQEVINNNICQKADADICPTTNRSNLVIAASRAESKVDSLMKQASASQESIQTKPSISDICV
        ID+AVASMHDPV SSQNLG+ SNGSNGMNELKQD+SS+DQEVIN+NIC KADADI  TTN SNLVIAA R ESKVDS +K+ASA QE IQTK SISDI V
Subjt:  IDSAVASMHDPVYSSQNLGRASNGSNGMNELKQDVSSVDQEVINNNICQKADADICPTTNRSNLVIAASRAESKVDSLMKQASASQESIQTKPSISDICV

Query:  DETAQTQTQMRNNDDQNIGNGFGSSAHKPSSSLNSISGEHGSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRRQLKR
        DETAQTQTQM NNDDQNI NGFGSSA+KPSSSLNSISGE+  +KS  ESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRRQLKR
Subjt:  DETAQTQTQMRNNDDQNIGNGFGSSAHKPSSSLNSISGEHGSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRRQLKR

XP_022139776.1 uncharacterized protein LOC111010601 [Momordica charantia]3.2e-23779.56Show/hide
Query:  KRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEDDEKLGRSRSKTRKNAKPSKKKVKKRSDDRRSRDYSPHPRKRKNLKRDDHSEVKKT
        +RSRRK+RSSRKLKSKKLRYRHDSPSCSDTDFESSTS+SSS SEDDEK+GRSRS   KNAKP KK+ KKRS D + RD SPHPRKRK+ KR D  EVKKT
Subjt:  KRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEDDEKLGRSRSKTRKNAKPSKKKVKKRSDDRRSRDYSPHPRKRKNLKRDDHSEVKKT

Query:  NK-KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIVVV
        NK KKRRRDVS+SATS DSLSCSTCG+GSTTSNE EIDRHRGRS K+K N GKTERSRYRSKSHS CSLCSEGSD+QNEVEDGSYVENNFRRLRSVIVVV
Subjt:  NK-KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIVVV

Query:  GKENKLETFDENGQEEEVVHQPDDDHLSFGDMGSKDGASKRELDYVTSKEAPEVENKIEVVIPDNRNSMVVKDDGVQNEGSNNNHGVVIHDRSLNERKNG
        G+ENKL+TFD N Q+EEV+H PDDDH SFGDM S DG SKRELD VTS EA EVENK EVVIPD RN +VVKD GVQNEGSNNNHG V +D  LNE  NG
Subjt:  GKENKLETFDENGQEEEVVHQPDDDHLSFGDMGSKDGASKRELDYVTSKEAPEVENKIEVVIPDNRNSMVVKDDGVQNEGSNNNHGVVIHDRSLNERKNG

Query:  CSGTNDSINCIDLESILRQRALENLRKFKRVPPRNVKTPDNCKVDNNNDGKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVKSTD
         SG  D INCIDLESILRQRALENLRKFK VPP+NV+T  NC+VDN+ND KQL+SPVS SV + SPRDDA+ING G S QGGGN VN MIVEENGV+ST+
Subjt:  CSGTNDSINCIDLESILRQRALENLRKFKRVPPRNVKTPDNCKVDNNNDGKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVKSTD

Query:  AIDSAVASMHDPVYSSQNLGRASNGSNGMNELKQDVSSVDQEVINNNICQKADADICPTTNRSNLVIAASRAESKVDSLMKQASASQESIQTKPSISDIC
        AIDSAVAS HDP+YSSQNLG+ S+ SNGMNELKQD+SS+DQE +N+NICQK DADIC TT+RSNLV AA R +SKVD L+KQASA QE IQTKPSISD+ 
Subjt:  AIDSAVASMHDPVYSSQNLGRASNGSNGMNELKQDVSSVDQEVINNNICQKADADICPTTNRSNLVIAASRAESKVDSLMKQASASQESIQTKPSISDIC

Query:  VDETAQTQTQMRNNDDQNIGNGFGSSAHKPSSSLNSISGEHGSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRRQLKR
        VDE AQ Q Q RNNDDQNI NGF SSAHKPSSSLN  SGE+  NK  HESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPAL RRQLKR
Subjt:  VDETAQTQTQMRNNDDQNIGNGFGSSAHKPSSSLNSISGEHGSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRRQLKR

XP_023525531.1 uncharacterized protein LOC111789118 isoform X1 [Cucurbita pepo subsp. pepo]7.1e-22977.44Show/hide
Query:  MKRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEDDEKLGRSRSKTRKNAKPSKKKVKKRSDDRRSRDYSPHPRKRKNLKRDDHSEVKK
        MKRSRRKSRSSRKLKSKKLRYRHDSPS SD DFESSTS+SSS +EDDEK+GRSRSK RKNA   KK+ KKRS DR+ RDYSPHPRKRK+ KRD+  E+KK
Subjt:  MKRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEDDEKLGRSRSKTRKNAKPSKKKVKKRSDDRRSRDYSPHPRKRKNLKRDDHSEVKK

Query:  TNKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIVVV
        +NKKKR+RD S+ ATSSDSLSCSTCG+GSTTSNE EIDR +GRSR++K NMGK ERSRYRSKSHS CSLCSEGSDHQNEVED  YVENNFRRLRSVIV+V
Subjt:  TNKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIVVV

Query:  GKENKLETFDENGQEEEVVHQPDDDHLSFGDMGSKDGASKRELDYVTSKEAPEVENKIEVVIPDNRNSMVVKDDGVQNEGSNNNHGVVIHDRSLNERKNG
        G+E+KLETFD N  +E V HQPD DH SFGD+   DG S RELD + S+EAP       V+I DNRNS+VVKDDGVQNEGSNNNHG   HD  L ERKNG
Subjt:  GKENKLETFDENGQEEEVVHQPDDDHLSFGDMGSKDGASKRELDYVTSKEAPEVENKIEVVIPDNRNSMVVKDDGVQNEGSNNNHGVVIHDRSLNERKNG

Query:  CSGTNDSINCIDLESILRQRALENLRKFKRVPPRNVKTPDNCKVDNNNDGKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVKSTD
        CS   D  NCIDLESILRQRALENLRK+KRV PRNV+TP NC+VDN+ND KQL SPVSKSVHVTSPRD+A ING+  SRQGGGN VNSMI+ ENGVKSTD
Subjt:  CSGTNDSINCIDLESILRQRALENLRKFKRVPPRNVKTPDNCKVDNNNDGKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVKSTD

Query:  AIDSAVASMHDPVYSSQNLGRASNGSNGMNELKQDVSSVDQEVINNNICQKADADICPTTNRSNLVIAASRAESKVDSLMKQASASQESIQTKPSISDIC
         +DSAVAS +DPVYSSQ LG+ SNGSN +NELKQ +SSVDQEV+N++ICQKADADICPTTNRSNLVIAA + ES VDSL +QASASQESIQTKPS S++ 
Subjt:  AIDSAVASMHDPVYSSQNLGRASNGSNGMNELKQDVSSVDQEVINNNICQKADADICPTTNRSNLVIAASRAESKVDSLMKQASASQESIQTKPSISDIC

Query:  VDETAQTQTQMRNNDDQNIGNGFGSSAHKP--SSSLNSISGEHGSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRRQLKR
          ETAQTQTQMRNND QNIG+GFGS AHKP  SSSLNSISGE+  N+S HESGEGSQFEQKTMSV RGGEMVQVNYKVYIPKRAP L+RRQLKR
Subjt:  VDETAQTQTQMRNNDDQNIGNGFGSSAHKP--SSSLNSISGEHGSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRRQLKR

XP_023535556.1 transcriptional regulator ATRX isoform X1 [Cucurbita pepo subsp. pepo]5.6e-23477.66Show/hide
Query:  KRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEDDEKLGRSRSKTRKNAKPSKKKVKKRSDDRRSRDYSPHPRKRKNLKRDDHSEVKKT
        +RSRRKS+SSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSS SEDD+K+ RSRSKTRKN+KPSKK+ KK+S D +SR+  PHPRKRK+ KR D  E KKT
Subjt:  KRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEDDEKLGRSRSKTRKNAKPSKKKVKKRSDDRRSRDYSPHPRKRKNLKRDDHSEVKKT

Query:  NKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIVVVG
        NKKKRRRDVS+ AT+SDSLS STCG+GS+TS++ EIDR RGRS K+K NM KTE  RYRSKS S CSLCS+G D QNEVED SYVEN+ RRL+S+IVVVG
Subjt:  NKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIVVVG

Query:  KENKLETFDENGQEEEVVHQPDDDHLSFGDMGSKDGASKRELDYVTSKEAPEVENKIEVVIPDNRNSMVVKDDGVQNEGSNNNHGVVIHDRSLNERKNGC
        +E++L+TF  N Q+E V HQ D++H SFGDM SKDG SKRELDYV SKEAPEVE+K ++V PDNRNS+++ DDGV+NEGSN NHG V +D SL+ERKNGC
Subjt:  KENKLETFDENGQEEEVVHQPDDDHLSFGDMGSKDGASKRELDYVTSKEAPEVENKIEVVIPDNRNSMVVKDDGVQNEGSNNNHGVVIHDRSLNERKNGC

Query:  SGTNDSINCIDLESILRQRALENLRKFKRVPPRNVKTPDNCKVDNNNDGKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVKSTDA
        SG  DSINCI+LESILRQ+ALENLRKFK V PRNV+   NCKV+NNND KQL SPVSKSVHVT PRDDA+ING G SRQ GG+ VNSMIV+ENG KSTDA
Subjt:  SGTNDSINCIDLESILRQRALENLRKFKRVPPRNVKTPDNCKVDNNNDGKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVKSTDA

Query:  IDSAVASMHDPVYSSQNLGRASNGSNGMNELKQDVSSVDQEVINNNICQKADADICPTTNRSNLVIAASRAESKVDSLMKQASASQESIQTKPSISDICV
        ID+AVASMHDPV SSQNLG+ SNGSNGMNELKQD+SS+DQEVIN+NIC KADADI  TTNRSNLVIAA R ESKVDSL+++ASA+QE I+TKPSISDI V
Subjt:  IDSAVASMHDPVYSSQNLGRASNGSNGMNELKQDVSSVDQEVINNNICQKADADICPTTNRSNLVIAASRAESKVDSLMKQASASQESIQTKPSISDICV

Query:  DETAQTQTQMRNNDDQNIGNGFGSSAHKPSSSLNSISGEHGSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRRQLKR
        DETAQT+TQM+NN+DQNI NGFGSSA+KPSSSLNSISGE+  +KS HESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRRQLKR
Subjt:  DETAQTQTQMRNNDDQNIGNGFGSSAHKPSSSLNSISGEHGSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRRQLKR

XP_038897880.1 histone-lysine N-methyltransferase SETD2 isoform X1 [Benincasa hispida]4.7e-23378.75Show/hide
Query:  KRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEDDEKLGRSRSKTRKNAKPSKKKVKKRSDDRRSRDYSPHPRKRKNLKRDDHSEVKKT
        +RSRRKS+SS+KLKSKKLRYRHDSPSCSDTDFESSTSVSSS SEDD+++ RSRSKTRKNAKPSKK+ K++S DR+SR+ SPHPRKRK+ KR+DH E KK 
Subjt:  KRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEDDEKLGRSRSKTRKNAKPSKKKVKKRSDDRRSRDYSPHPRKRKNLKRDDHSEVKKT

Query:  NKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIVVVG
         KKKRRRD S+ A  SDS SCSTCG GSTTSNE E+ R RGRS K+KGNMGKTER RYRSKS S CSL S+ SD+QNEV+D SYV NNFRRLRS+IV+ G
Subjt:  NKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIVVVG

Query:  KENKLETFDENGQEEEVVHQPD--DDHLSFGDMGSKDGASKRELDYVTSKEAPEVENKIEVVIPDNRNSMVVKDDGVQNEGSNNNHGVVIHDRSLNERKN
        +ENKL+TF  N Q+E   HQP+  DDH S GDM SKD  SKRELDYV SKE P VE K EV +P+NRNSMVVKDDGVQNEGSN N G V +D SL+ERKN
Subjt:  KENKLETFDENGQEEEVVHQPD--DDHLSFGDMGSKDGASKRELDYVTSKEAPEVENKIEVVIPDNRNSMVVKDDGVQNEGSNNNHGVVIHDRSLNERKN

Query:  GCSGTNDSINCIDLESILRQRALENLRKFKRVPPRNVKTPDNCKVDNNNDGKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVKST
        GCSG  DS+N IDLESILRQRALENLRKFK  PPRNV+T  NCKVD+NND KQL SPVSKSVHVTSPRDDA+IN  G SRQGGGN VNSMIV+ENGVKST
Subjt:  GCSGTNDSINCIDLESILRQRALENLRKFKRVPPRNVKTPDNCKVDNNNDGKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVKST

Query:  DAIDSAVASMHDPVYSSQNLGRASNGSNGMNELKQDVSSVDQEVINNNICQKADADICPTTNRSNLVIAASRAESKVDSLMKQASASQESIQTKPSISDI
        DAIDS+V SMHDPVYSSQNLG+ SNGSNGMNELKQ++SS+DQEVIN+NICQKADADIC TTNRSNLVIAA R ESKVDSL+KQA A+QESIQTKPSISDI
Subjt:  DAIDSAVASMHDPVYSSQNLGRASNGSNGMNELKQDVSSVDQEVINNNICQKADADICPTTNRSNLVIAASRAESKVDSLMKQASASQESIQTKPSISDI

Query:  CVDETAQTQTQMRNNDDQNIGNGFGSSAHKPSSSLNSISGEHGSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRRQLKR
         VDETAQTQTQMRNNDDQNI NG  SSAHKP SSLNSISGE+  + S HESG+ SQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRRQLKR
Subjt:  CVDETAQTQTQMRNNDDQNIGNGFGSSAHKPSSSLNSISGEHGSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRRQLKR

TrEMBL top hitse value%identityAlignment
A0A0A0L248 Uncharacterized protein1.8e-22275.97Show/hide
Query:  KRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEDDEKLGRSRSKTRKNAKPSKKKVKKRSDDRRSRDYSPHPRKRKNLKRDDHSEVKKT
        +RSRRKS+SSRKLKSKKLRYRHDSPSCSDTDFESSTSV SS SE  +++ RSRSKT+KNAKPSKK+ KK+S DR+SR+ SP+PRKRK+ KR+D  EV K 
Subjt:  KRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEDDEKLGRSRSKTRKNAKPSKKKVKKRSDDRRSRDYSPHPRKRKNLKRDDHSEVKKT

Query:  NKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIVVVG
        NKKKRRRDVS+    S+SLSCSTCG GSTTSNE E+ R RGRS K+K NM KTE  RY SKSHS CSL SEGSD+QNEV+D SYVENNFRRLRS+IVVVG
Subjt:  NKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIVVVG

Query:  KENKLETFDENGQEEEVVHQPDDDHLSFGDMGSKDGASKRELDYVTSKEAPEVENKIEVVIPDNRNSMVVKDDGVQNEGSNNNHGVVIHDRSLNERKNGC
        +ENKL   +E   +E V +QP DDH SFGDM SKD  SKRELDYV +KEAP VEN+ EV +P+ RNSMVV+DDGVQNEGSN NHG V +DRS +E KNGC
Subjt:  KENKLETFDENGQEEEVVHQPDDDHLSFGDMGSKDGASKRELDYVTSKEAPEVENKIEVVIPDNRNSMVVKDDGVQNEGSNNNHGVVIHDRSLNERKNGC

Query:  SGTNDSINCIDLESILRQRALENLRKFKRVPPRNVKTPDNCKVDNNNDGKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVKSTDA
        S   DSINCIDLES+LRQRALENLRKFK  PPRNV+T  NCKV +NN  KQL SP+SKSVHVTSPR+DA+IN    SRQGGGN VNSMIV+ENGV S DA
Subjt:  SGTNDSINCIDLESILRQRALENLRKFKRVPPRNVKTPDNCKVDNNNDGKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVKSTDA

Query:  IDSAVASMHDPVYSSQNLGRASNGSNGMNELKQDVSSVDQEVINNNICQKADADICPTTNRSNLVIAASRAESKVDSLMKQASASQESIQTKPSISDICV
        IDSAVA+MHDPVYSSQNLG+ SNGSNGMNE KQD+SS+DQE+IN+NICQKA+ADIC TTNRSNLVIAA R + KVDSL+KQ SA+QES+QTKPSISD+ V
Subjt:  IDSAVASMHDPVYSSQNLGRASNGSNGMNELKQDVSSVDQEVINNNICQKADADICPTTNRSNLVIAASRAESKVDSLMKQASASQESIQTKPSISDICV

Query:  DETAQTQTQMRNNDDQNIGNGFGSSAHKPSSSLNSISGEHGSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRRQLKR
         ETAQTQTQMRNN+D NI NG GSSAHKP SSLNSISGE+  + S HESGE SQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRRQLKR
Subjt:  DETAQTQTQMRNNDDQNIGNGFGSSAHKPSSSLNSISGEHGSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRRQLKR

A0A6J1CDR0 uncharacterized protein LOC1110106011.5e-23779.56Show/hide
Query:  KRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEDDEKLGRSRSKTRKNAKPSKKKVKKRSDDRRSRDYSPHPRKRKNLKRDDHSEVKKT
        +RSRRK+RSSRKLKSKKLRYRHDSPSCSDTDFESSTS+SSS SEDDEK+GRSRS   KNAKP KK+ KKRS D + RD SPHPRKRK+ KR D  EVKKT
Subjt:  KRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEDDEKLGRSRSKTRKNAKPSKKKVKKRSDDRRSRDYSPHPRKRKNLKRDDHSEVKKT

Query:  NK-KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIVVV
        NK KKRRRDVS+SATS DSLSCSTCG+GSTTSNE EIDRHRGRS K+K N GKTERSRYRSKSHS CSLCSEGSD+QNEVEDGSYVENNFRRLRSVIVVV
Subjt:  NK-KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIVVV

Query:  GKENKLETFDENGQEEEVVHQPDDDHLSFGDMGSKDGASKRELDYVTSKEAPEVENKIEVVIPDNRNSMVVKDDGVQNEGSNNNHGVVIHDRSLNERKNG
        G+ENKL+TFD N Q+EEV+H PDDDH SFGDM S DG SKRELD VTS EA EVENK EVVIPD RN +VVKD GVQNEGSNNNHG V +D  LNE  NG
Subjt:  GKENKLETFDENGQEEEVVHQPDDDHLSFGDMGSKDGASKRELDYVTSKEAPEVENKIEVVIPDNRNSMVVKDDGVQNEGSNNNHGVVIHDRSLNERKNG

Query:  CSGTNDSINCIDLESILRQRALENLRKFKRVPPRNVKTPDNCKVDNNNDGKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVKSTD
         SG  D INCIDLESILRQRALENLRKFK VPP+NV+T  NC+VDN+ND KQL+SPVS SV + SPRDDA+ING G S QGGGN VN MIVEENGV+ST+
Subjt:  CSGTNDSINCIDLESILRQRALENLRKFKRVPPRNVKTPDNCKVDNNNDGKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVKSTD

Query:  AIDSAVASMHDPVYSSQNLGRASNGSNGMNELKQDVSSVDQEVINNNICQKADADICPTTNRSNLVIAASRAESKVDSLMKQASASQESIQTKPSISDIC
        AIDSAVAS HDP+YSSQNLG+ S+ SNGMNELKQD+SS+DQE +N+NICQK DADIC TT+RSNLV AA R +SKVD L+KQASA QE IQTKPSISD+ 
Subjt:  AIDSAVASMHDPVYSSQNLGRASNGSNGMNELKQDVSSVDQEVINNNICQKADADICPTTNRSNLVIAASRAESKVDSLMKQASASQESIQTKPSISDIC

Query:  VDETAQTQTQMRNNDDQNIGNGFGSSAHKPSSSLNSISGEHGSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRRQLKR
        VDE AQ Q Q RNNDDQNI NGF SSAHKPSSSLN  SGE+  NK  HESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPAL RRQLKR
Subjt:  VDETAQTQTQMRNNDDQNIGNGFGSSAHKPSSSLNSISGEHGSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRRQLKR

A0A6J1F6D1 uncharacterized protein LOC111442542 isoform X12.9e-22876.82Show/hide
Query:  KRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEDDEKLGRSRSKTRKNAKPSKKKVKKRSDDRRSRDYSPHPRKRKNLKRDDHSEVKKT
        +RSRRKS+SSR+LKSKKLRYRHDSPSCSDTDFESSTSVSSS SEDD+K+ RSRSKTRKN+KPSKK+ KK+S D +SR+ SPHPRKRK+ KR D  E KKT
Subjt:  KRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEDDEKLGRSRSKTRKNAKPSKKKVKKRSDDRRSRDYSPHPRKRKNLKRDDHSEVKKT

Query:  NKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIVVVG
        NKKKRRRD S+ AT+SDSL  STCG+GS+TS++ EIDR RGRS K+K NM KTE  RYRSKS S CSLCS+GSD QNEVED SYVEN+ RRL+S+IVVVG
Subjt:  NKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIVVVG

Query:  KENKLETFDENGQEEEVVHQPDDDHLSFGDMGSKDGASKRELDYVTSKEAPEVENKIEVVIPDNRNSMVVKDDGVQNEGSNNNHGVVIHDRSLNERKNGC
        +E++L+TF  N Q+E V HQ DD+H SFGDM SKDG SKRELDYV SKEAPEVE+K ++V PDNRNS+++ DDGV+NEGSN NHG V +D SL+ERKNGC
Subjt:  KENKLETFDENGQEEEVVHQPDDDHLSFGDMGSKDGASKRELDYVTSKEAPEVENKIEVVIPDNRNSMVVKDDGVQNEGSNNNHGVVIHDRSLNERKNGC

Query:  SGTNDSINCIDLESILRQRALENLRKFKRVPPRNVKTPDNCKVDNNNDGKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVKSTDA
        SG  +SINCIDLESILRQ+ALENLRKFK V PRNV+   NCKV+NNND KQL SPVSKSVHVT PRDDA+IN  G SRQ GG+ VNSMIV+ NG KSTDA
Subjt:  SGTNDSINCIDLESILRQRALENLRKFKRVPPRNVKTPDNCKVDNNNDGKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVKSTDA

Query:  IDSAVASMHDPVYSSQNLGRASNGSNGMNELKQDVSSVDQEVINNNICQKADADICPTTNRSNLVIAASRAESKVDSLMKQASASQESIQTKPSISDICV
        ID+AVASMHDPV SSQNLG+ SNGSNGMNE KQD+SS+DQEVIN+NIC KADADI  TTNRSNLVIAA R ESKVDS +K+AS  QE IQTK SISDI V
Subjt:  IDSAVASMHDPVYSSQNLGRASNGSNGMNELKQDVSSVDQEVINNNICQKADADICPTTNRSNLVIAASRAESKVDSLMKQASASQESIQTKPSISDICV

Query:  DETAQTQTQMRNNDDQNIGNGFGSSAHKPSSSLNSISGEHGSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRRQLKR
        DETAQTQTQM NNDDQNI NGFGSSA+K SSSLN ISGE+  +KS  ESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRRQLKR
Subjt:  DETAQTQTQMRNNDDQNIGNGFGSSAHKPSSSLNSISGEHGSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRRQLKR

A0A6J1FN60 pre-mRNA-splicing factor CWC22 homolog isoform X14.1e-22276.43Show/hide
Query:  MKRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEDDEKLGRSRSKTRKNAKPSKKKVKKRSDDRRSRDYSPHPRKRKNLKRDDHSEVKK
        MKRSRRKSRSSRKLKS  +RYRHDSPS SD D ESSTS+SSS +EDDEK+GRSRSK RKNA   KK+ KKRS DR+ RDYSPHPRKRK+ KRD+  E+KK
Subjt:  MKRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEDDEKLGRSRSKTRKNAKPSKKKVKKRSDDRRSRDYSPHPRKRKNLKRDDHSEVKK

Query:  TNKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIVVV
        +NKKKR+RD S+ ATSSDSLSCSTCG+GSTTSNE EIDR +GRSRK+K NMGK ERSRY SKSHS CSLCSEGSDHQNEVE+  YVENNFRRLRSVIVVV
Subjt:  TNKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIVVV

Query:  GKENKLETFDENGQEEEVVHQPDDDHLSFGDMGSKDGASKRELDYVTSKEAPEVENKIEVVIPDNRNSMVVKDDGVQNEGSNNNHGVVIHDRSLNERKNG
        G+E+KLETFD N  +E V HQPD DH SFGD+   DG S RELD + S+EAP       V+I DNRNS+VVKDDGVQNEGSNNNHG   HD  L ERKNG
Subjt:  GKENKLETFDENGQEEEVVHQPDDDHLSFGDMGSKDGASKRELDYVTSKEAPEVENKIEVVIPDNRNSMVVKDDGVQNEGSNNNHGVVIHDRSLNERKNG

Query:  CSGTNDSINCIDLESILRQRALENLRKFKRVPPRNVKTPDNCKVDNNNDGKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVKSTD
        CS   D  NCIDLESILRQRALENLRK+KRV PRNV+TP N +VDN+ND KQL SPVSK VHVTSPRD+A ING+  SRQGGGN VNSMI+ ENGVKSTD
Subjt:  CSGTNDSINCIDLESILRQRALENLRKFKRVPPRNVKTPDNCKVDNNNDGKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVKSTD

Query:  AIDSAVASMHDPVYSSQNLGRASNGSNGMNELKQDVSSVDQEVINNNICQKADADICPTTNRSNLVIAASRAESKVDSLMKQASASQESIQTKPSISDIC
         +DSAVAS +DPVYSSQ+LG+ SNGSN MNELKQ +SSVDQEV+N++IC  ADADICPTTNRSNLVIAA + ES VDSL +QASASQESIQTKPS SD+ 
Subjt:  AIDSAVASMHDPVYSSQNLGRASNGSNGMNELKQDVSSVDQEVINNNICQKADADICPTTNRSNLVIAASRAESKVDSLMKQASASQESIQTKPSISDIC

Query:  VDETAQTQTQMRNNDDQNIGNGFGSSAHKP--SSSLNSISGEHGSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRRQLKR
          ET QTQTQMRNND QNIG+GFGSSAHKP  SSSLNSISGE   N S HESGEGSQFEQKTMSV RGGEMVQVNYKVYIPKRAP L+RRQLKR
Subjt:  VDETAQTQTQMRNNDDQNIGNGFGSSAHKP--SSSLNSISGEHGSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRRQLKR

A0A6J1IGY0 uncharacterized protein LOC111476850 isoform X11.4e-22776.14Show/hide
Query:  KRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEDDEKLGRSRSKTRKNAKPSKKKVKKRSDDRRSRDYSPHPRKRKNLKRDDHSEVKKT
        +RSRRKS+SSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSS SEDD+K+ RSRSKTRKN+KPSKK+ KK+S D +SR+ SPHPRKRK+ KR+D  E KKT
Subjt:  KRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEDDEKLGRSRSKTRKNAKPSKKKVKKRSDDRRSRDYSPHPRKRKNLKRDDHSEVKKT

Query:  NKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIVVVG
        NKKKRRRDVS+ AT+SDSLS STCG+GS+TS++ EIDR RGRS K+K NM KTE  RYRSKS S CSLCS+GSD QNEVED SYV+N  RRL+S+IVVVG
Subjt:  NKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIVVVG

Query:  KENKLETFDENGQEEEVVHQPDDDHLSFGDMGSKDGASKRELDYVTSKEAPEVENKIEVVIPDNRNSMVVKDDGVQNEGSNNNHGVVIHDRSLNERKNGC
        +E++L+TF  N Q+E V HQ DD+H  F DM SKDG  KRELDYV SKEAPEVE+K ++  PDNRNS+++ +DGV+NEGSN NHG V +D SL+ERKNGC
Subjt:  KENKLETFDENGQEEEVVHQPDDDHLSFGDMGSKDGASKRELDYVTSKEAPEVENKIEVVIPDNRNSMVVKDDGVQNEGSNNNHGVVIHDRSLNERKNGC

Query:  SGTNDSINCIDLESILRQRALENLRKFKRVPPRNVKTPDNCKVDNNNDGKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVKSTDA
        SG  D+INCIDLESILRQ+ALENLRKFK   PRNV+   NCKV+NNND KQL SPVSKSVHV SPRDDA+ NG G SRQ GG+ VNSMI++ NG KSTDA
Subjt:  SGTNDSINCIDLESILRQRALENLRKFKRVPPRNVKTPDNCKVDNNNDGKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVKSTDA

Query:  IDSAVASMHDPVYSSQNLGRASNGSNGMNELKQDVSSVDQEVINNNICQKADADICPTTNRSNLVIAASRAESKVDSLMKQASASQESIQTKPSISDICV
        ID+AVASMHDPV SSQNLG+ SNGSNGMNELKQD+SS+DQEVIN+NIC KADA+I  TTNRSNLVIAA R ESKVDSL+++ASA+QE IQTKPSISDI V
Subjt:  IDSAVASMHDPVYSSQNLGRASNGSNGMNELKQDVSSVDQEVINNNICQKADADICPTTNRSNLVIAASRAESKVDSLMKQASASQESIQTKPSISDICV

Query:  DETAQTQTQMRNNDDQNIGNGFGSSAHKPSSSLNSISGEHGSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRRQLKR
        DE +QTQTQ  NNDDQNI NGFGSSA+KPSSSLNSISGE+  +KS  ESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRRQLKR
Subjt:  DETAQTQTQMRNNDDQNIGNGFGSSAHKPSSSLNSISGEHGSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRRQLKR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G53930.1 unknown protein2.5e-1428.41Show/hide
Query:  MKRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEDDEKLGRSRSKTRKNAKPSKKKVKKRSDDRRSRDYSPHPR---KRKNLKRDDHSE
        +K S  K RS +K KSK+ + +       +++   S S   S SEDD      R K ++ +K SKK+ +KR     S D S   R   K+K  KR D + 
Subjt:  MKRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEDDEKLGRSRSKTRKNAKPSKKKVKKRSDDRRSRDYSPHPR---KRKNLKRDDHSE

Query:  VKKTNK----KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDGSYVENNFRRL
         KK  K    K+R+RD+S S+TSS+     +  +GS + +     R RGR       +GK + SR RS+        SE  D   + ED    E N RRL
Subjt:  VKKTNK----KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDGSYVENNFRRL

Query:  RSVIVVVGKENKLETFDENGQEEEVVHQPDDDHLSFGDMGSKDGASKRELDYVTSKEAPEVENKIEVVIPDNRNSMVVKDDGVQNEGSNNNHGVVIHDRS
        +S++VV            NG+ +E     +DD     D+    G   REL Y  S+++ E++ +       +  S +  DD    E + +    V H   
Subjt:  RSVIVVVGKENKLETFDENGQEEEVVHQPDDDHLSFGDMGSKDGASKRELDYVTSKEAPEVENKIEVVIPDNRNSMVVKDDGVQNEGSNNNHGVVIHDRS

Query:  LNERKNGCSGTNDSINCIDLESILRQRALENLRKFKRVPPRNVKTPDNCKVDNNNDGKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEE
                  T++S+   DLE+IL++RALENL++F+ V                    Q      K V   S  +  +I    +  Q            +
Subjt:  LNERKNGCSGTNDSINCIDLESILRQRALENLRKFKRVPPRNVKTPDNCKVDNNNDGKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEE

Query:  NGVKSTDAIDSAVASMHDPVYSSQNLGRASNGSNGMNELKQDVSSVDQEVINNNICQKADADICPTTNRSNLVIAASRAESKVDSLMKQASASQESIQTK
        + +      DSAV+     + +S+ +    N       L    S  DQ+   +    K  + +   T +  LV      +S   +  K+AS SQ++    
Subjt:  NGVKSTDAIDSAVASMHDPVYSSQNLGRASNGSNGMNELKQDVSSVDQEVINNNICQKADADICPTTNRSNLVIAASRAESKVDSLMKQASASQESIQTK

Query:  PSISDICVDETA--QTQTQMRNNDDQNIGN-------GFGSSAHKPSSSLNSI-SGEHGSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP
         SI    VD+     T   +  N+ ++I            SS+H  +  ++ +  G     K+  E+ + SQ+EQKTM+VMRGGEMVQV+YKVYIPK+A 
Subjt:  PSISDICVDETA--QTQTQMRNNDDQNIGN-------GFGSSAHKPSSSLNSI-SGEHGSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP

Query:  ALTRRQLKR
        +L RR+L R
Subjt:  ALTRRQLKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAGAAGTAGGAGAAAGAGTAGGAGTAGTAGGAAATTGAAGTCCAAGAAGCTGCGGTATCGCCACGATTCTCCATCTTGCTCTGACACTGATTTTGAAAGTTCAAC
TTCAGTGTCTTCTTCTGGCTCGGAGGATGATGAAAAATTGGGAAGATCTCGATCCAAGACGCGGAAGAATGCGAAGCCTAGTAAAAAGAAAGTTAAGAAGCGATCTGATG
ACCGTCGAAGTAGGGATTATTCTCCTCATCCCAGAAAGAGGAAGAATTTGAAGAGAGACGACCATAGCGAGGTGAAGAAGACCAACAAAAAGAAGCGTAGAAGAGATGTG
AGTATTAGTGCCACAAGTAGTGACTCTTTGAGCTGCTCAACTTGTGGAGAGGGGAGTACAACCAGCAATGAGGGTGAAATCGATAGGCATAGGGGCAGGTCTAGAAAGAA
GAAAGGAAATATGGGGAAGACTGAAAGAAGTAGATACAGGTCAAAGAGTCATTCAGCATGCTCTTTATGTAGTGAAGGTAGTGATCATCAGAATGAGGTTGAGGATGGCA
GTTATGTTGAAAACAACTTTAGACGATTAAGGTCTGTAATTGTTGTAGTAGGAAAGGAAAATAAATTAGAGACGTTTGATGAGAATGGACAAGAAGAAGAGGTCGTGCAT
CAGCCTGATGATGACCACCTGTCTTTTGGAGATATGGGCAGTAAGGATGGGGCAAGTAAAAGAGAATTAGACTATGTTACATCGAAAGAGGCACCAGAGGTAGAAAACAA
AATAGAAGTGGTTATACCTGACAATAGAAACTCTATGGTTGTAAAGGATGATGGAGTTCAAAATGAGGGAAGCAACAATAACCATGGAGTAGTAATTCATGACCGTTCTT
TAAATGAAAGAAAGAATGGCTGTTCTGGAACTAATGACAGCATAAATTGTATCGATTTAGAGTCAATTTTGAGACAGAGGGCTTTGGAAAACCTAAGAAAGTTCAAACGG
GTGCCCCCAAGGAATGTGAAAACTCCTGATAATTGCAAAGTTGACAATAATAATGATGGAAAGCAATTGCACTCTCCCGTCTCTAAGTCAGTTCACGTGACTTCCCCTAG
GGATGATGCCAAGATAAATGGTAACGGGCTCTCTAGACAAGGTGGAGGGAATGAAGTAAATTCAATGATAGTTGAAGAGAATGGTGTCAAATCTACTGATGCAATAGATT
CAGCAGTTGCATCTATGCATGATCCTGTCTATTCTTCACAGAATCTGGGTAGGGCTTCCAATGGAAGCAATGGTATGAATGAACTGAAGCAGGATGTTTCTTCAGTAGAC
CAGGAGGTTATAAATAATAATATTTGCCAGAAGGCAGATGCAGATATTTGTCCTACAACTAACAGAAGCAATTTGGTTATTGCAGCTTCGAGGGCTGAATCAAAAGTTGA
TTCTCTTATGAAGCAGGCATCTGCTTCTCAGGAATCTATCCAAACAAAGCCATCTATATCTGACATTTGTGTTGATGAGACTGCTCAAACTCAGACCCAGATGAGGAATA
ATGATGATCAAAACATTGGTAATGGTTTTGGTTCTTCAGCTCACAAGCCTTCTTCTTCCCTTAATTCTATTTCAGGAGAACATGGCTCTAACAAGTCTGGACACGAGAGT
GGCGAAGGCTCGCAGTTTGAACAGAAAACCATGTCTGTGATGCGGGGTGGTGAAATGGTGCAGGTGAACTACAAGGTCTACATCCCGAAGAGAGCTCCTGCTTTGACTAG
GAGGCAACTCAAGCGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAGAAGTAGGAGAAAGAGTAGGAGTAGTAGGAAATTGAAGTCCAAGAAGCTGCGGTATCGCCACGATTCTCCATCTTGCTCTGACACTGATTTTGAAAGTTCAAC
TTCAGTGTCTTCTTCTGGCTCGGAGGATGATGAAAAATTGGGAAGATCTCGATCCAAGACGCGGAAGAATGCGAAGCCTAGTAAAAAGAAAGTTAAGAAGCGATCTGATG
ACCGTCGAAGTAGGGATTATTCTCCTCATCCCAGAAAGAGGAAGAATTTGAAGAGAGACGACCATAGCGAGGTGAAGAAGACCAACAAAAAGAAGCGTAGAAGAGATGTG
AGTATTAGTGCCACAAGTAGTGACTCTTTGAGCTGCTCAACTTGTGGAGAGGGGAGTACAACCAGCAATGAGGGTGAAATCGATAGGCATAGGGGCAGGTCTAGAAAGAA
GAAAGGAAATATGGGGAAGACTGAAAGAAGTAGATACAGGTCAAAGAGTCATTCAGCATGCTCTTTATGTAGTGAAGGTAGTGATCATCAGAATGAGGTTGAGGATGGCA
GTTATGTTGAAAACAACTTTAGACGATTAAGGTCTGTAATTGTTGTAGTAGGAAAGGAAAATAAATTAGAGACGTTTGATGAGAATGGACAAGAAGAAGAGGTCGTGCAT
CAGCCTGATGATGACCACCTGTCTTTTGGAGATATGGGCAGTAAGGATGGGGCAAGTAAAAGAGAATTAGACTATGTTACATCGAAAGAGGCACCAGAGGTAGAAAACAA
AATAGAAGTGGTTATACCTGACAATAGAAACTCTATGGTTGTAAAGGATGATGGAGTTCAAAATGAGGGAAGCAACAATAACCATGGAGTAGTAATTCATGACCGTTCTT
TAAATGAAAGAAAGAATGGCTGTTCTGGAACTAATGACAGCATAAATTGTATCGATTTAGAGTCAATTTTGAGACAGAGGGCTTTGGAAAACCTAAGAAAGTTCAAACGG
GTGCCCCCAAGGAATGTGAAAACTCCTGATAATTGCAAAGTTGACAATAATAATGATGGAAAGCAATTGCACTCTCCCGTCTCTAAGTCAGTTCACGTGACTTCCCCTAG
GGATGATGCCAAGATAAATGGTAACGGGCTCTCTAGACAAGGTGGAGGGAATGAAGTAAATTCAATGATAGTTGAAGAGAATGGTGTCAAATCTACTGATGCAATAGATT
CAGCAGTTGCATCTATGCATGATCCTGTCTATTCTTCACAGAATCTGGGTAGGGCTTCCAATGGAAGCAATGGTATGAATGAACTGAAGCAGGATGTTTCTTCAGTAGAC
CAGGAGGTTATAAATAATAATATTTGCCAGAAGGCAGATGCAGATATTTGTCCTACAACTAACAGAAGCAATTTGGTTATTGCAGCTTCGAGGGCTGAATCAAAAGTTGA
TTCTCTTATGAAGCAGGCATCTGCTTCTCAGGAATCTATCCAAACAAAGCCATCTATATCTGACATTTGTGTTGATGAGACTGCTCAAACTCAGACCCAGATGAGGAATA
ATGATGATCAAAACATTGGTAATGGTTTTGGTTCTTCAGCTCACAAGCCTTCTTCTTCCCTTAATTCTATTTCAGGAGAACATGGCTCTAACAAGTCTGGACACGAGAGT
GGCGAAGGCTCGCAGTTTGAACAGAAAACCATGTCTGTGATGCGGGGTGGTGAAATGGTGCAGGTGAACTACAAGGTCTACATCCCGAAGAGAGCTCCTGCTTTGACTAG
GAGGCAACTCAAGCGGTGA
Protein sequenceShow/hide protein sequence
MKRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEDDEKLGRSRSKTRKNAKPSKKKVKKRSDDRRSRDYSPHPRKRKNLKRDDHSEVKKTNKKKRRRDV
SISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIVVVGKENKLETFDENGQEEEVVH
QPDDDHLSFGDMGSKDGASKRELDYVTSKEAPEVENKIEVVIPDNRNSMVVKDDGVQNEGSNNNHGVVIHDRSLNERKNGCSGTNDSINCIDLESILRQRALENLRKFKR
VPPRNVKTPDNCKVDNNNDGKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVKSTDAIDSAVASMHDPVYSSQNLGRASNGSNGMNELKQDVSSVD
QEVINNNICQKADADICPTTNRSNLVIAASRAESKVDSLMKQASASQESIQTKPSISDICVDETAQTQTQMRNNDDQNIGNGFGSSAHKPSSSLNSISGEHGSNKSGHES
GEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRRQLKR