; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0037879 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0037879
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionpre-mRNA-splicing factor CWC22 homolog isoform X2
Genome locationchr2:10130877..10133215
RNA-Seq ExpressionLag0037879
SyntenyLag0037879
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7024815.1 hypothetical protein SDJN02_13634, partial [Cucurbita argyrosperma subsp. argyrosperma]2.2e-23377.5Show/hide
Query:  MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPH
        MGK+SSSRKKERSK SSSQRSRRKS+SSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSS SE D+K+ RSRSKTRKN+KPSKK++KK+S D +SR+ SPH
Subjt:  MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPH

Query:  PRKRKNLKRDDHCEVKKTNKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDG
        PRKRK+ KR D  E KKTNKKKRRRDVS+ AT+SDSL  STCG+GS+TS++ EIDR RGRS K+K NM KTE  RYRSKSHS CSLCS+GSD QNEVED 
Subjt:  PRKRKNLKRDDHCEVKKTNKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDG

Query:  SYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPDDDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSNN
        SYVEN+ RRL+S+I+VVGEE++L+TF  N Q+E V HQ DD+H SFG M SKDG SKRELDYV SKEAPEVE+K ++V PDNRNS+++ DDGV+NEGSN 
Subjt:  SYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPDDDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSNN

Query:  NHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCKVDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGG
        NHG V +D SL+ERKNGCSGN +SINCIDLESILRQ+ALENLRKFK V  RNV+   NCKV+NNNDAKQL SPVSKSVHVTSPRDDA+IN  G SRQ GG
Subjt:  NHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCKVDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGG

Query:  NEVNSMIVEENGVISTDAIDSAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRSNLIIAASRAESKVDSLMKQA
        + VNSMIV+ NG  STDAID+AVASMHDPV SSQNLG+ SNGSNGMNELKQD+SS+DQE IN+NIC KADADI  TTN SNL+IAA R ESKVDS +K+A
Subjt:  NEVNSMIVEENGVISTDAIDSAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRSNLIIAASRAESKVDSLMKQA

Query:  SASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGFGSSAHK-HSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP
        SA QE IQTK SISDI VDETA+TQTQM NNDDQNI NGFGSSA+K  SSLNSISGE S +KS  ESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP
Subjt:  SASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGFGSSAHK-HSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP

Query:  ALTRRQLKR
        ALTRRQLKR
Subjt:  ALTRRQLKR

XP_022139776.1 uncharacterized protein LOC111010601 [Momordica charantia]4.9e-24180Show/hide
Query:  MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPH
        MGK   SRKKERSKTSSSQRSRRK+RSSRKLKSKKLRYRHDSPSCSDTDFESSTS+SSS SE DEK+GRSRS   KNAKP KK+AKKRS D + RD SPH
Subjt:  MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPH

Query:  PRKRKNLKRDDHCEVKKTNK-KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVED
        PRKRK+ KR D CEVKKTNK KKRRRDVS+SATS DSLSCSTCG+GSTTSNE EIDRHRGRS K+K N GKTERSRYRSKSHS CSLCSEGSD+QNEVED
Subjt:  PRKRKNLKRDDHCEVKKTNK-KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVED

Query:  GSYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPDDDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSN
        GSYVENNFRRLRSVI+VVGEENKL+TFD N Q+EEV+H PDDDH SFG M S DG SKRELD VTS EA EVENKKEVVIPD RN +VVKD GVQNEGSN
Subjt:  GSYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPDDDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSN

Query:  NNHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCKVDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGG
        NNHG V +D  LNE  NG SGN D INCIDLESILRQRALENLRKFK VP +NV+T  NC+VDN+NDAKQL+SPVS SV + SPRDDA+ING G S QGG
Subjt:  NNHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCKVDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGG

Query:  GNEVNSMIVEENGVISTDAIDSAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRSNLIIAASRAESKVDSLMKQ
        GN VN MIVEENGV ST+AIDSAVAS HDP+YSSQNLG+ S+ SNGMNELKQD+SS+DQEA+N+NICQK DADIC TT+RSNL+ AA R +SKVD L+KQ
Subjt:  GNEVNSMIVEENGVISTDAIDSAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRSNLIIAASRAESKVDSLMKQ

Query:  ASASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGFGSSAHK-HSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRA
        ASA QE IQTKPSISD+GVDE A+ Q Q RNNDDQNI NGF SSAHK  SSLN  SGE S NK  HESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRA
Subjt:  ASASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGFGSSAHK-HSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRA

Query:  PALTRRQLKR
        PAL RRQLKR
Subjt:  PALTRRQLKR

XP_022935712.1 uncharacterized protein LOC111442542 isoform X1 [Cucurbita moschata]6.6e-23076.52Show/hide
Query:  MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPH
        MGK+SSSRKKERSKTSSSQRSRRKS+SSR+LKSKKLRYRHDSPSCSDTDFESSTSVSSS SE D+K+ RSRSKTRKN+KPSKK++KK+S D +SR+ SPH
Subjt:  MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPH

Query:  PRKRKNLKRDDHCEVKKTNKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDG
        PRKRK+ KR D  E KKTNKKKRRRD S+ AT+SDSL  STCG+GS+TS++ EIDR RGRS K+K NM KTE  RYRSKS S CSLCS+GSD QNEVED 
Subjt:  PRKRKNLKRDDHCEVKKTNKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDG

Query:  SYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPDDDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSNN
        SYVEN+ RRL+S+I+VVGEE++L+TF  N Q+E V HQ DD+H SFG M SKDG SKRELDYV SKEAPEVE+K ++V PDNRNS+++ DDGV+NEGSN 
Subjt:  SYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPDDDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSNN

Query:  NHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCKVDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGG
        NHG V +D SL+ERKNGCSGN +SINCIDLESILRQ+ALENLRKFK V  RNV+   NCKV+NNNDAKQL SPVSKSVHVT PRDDA+IN  G SRQ GG
Subjt:  NHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCKVDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGG

Query:  NEVNSMIVEENGVISTDAIDSAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRSNLIIAASRAESKVDSLMKQA
        + VNSMIV+ NG  STDAID+AVASMHDPV SSQNLG+ SNGSNGMNE KQD+SS+DQE IN+NIC KADADI  TTNRSNL+IAA R ESKVDS +K+A
Subjt:  NEVNSMIVEENGVISTDAIDSAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRSNLIIAASRAESKVDSLMKQA

Query:  SASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGFGSSAHK-HSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP
        S  QE IQTK SISDI VDETA+TQTQM NNDDQNI NGFGSSA+K  SSLN ISGE   +KS  ESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP
Subjt:  SASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGFGSSAHK-HSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP

Query:  ALTRRQLKR
        ALTRRQLKR
Subjt:  ALTRRQLKR

XP_023535556.1 transcriptional regulator ATRX isoform X1 [Cucurbita pepo subsp. pepo]2.3e-23577.34Show/hide
Query:  MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPH
        MGK+SSSRKKERSKTSSSQRSRRKS+SSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSS SE D+K+ RSRSKTRKN+KPSKK++KK+S D +SR+  PH
Subjt:  MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPH

Query:  PRKRKNLKRDDHCEVKKTNKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDG
        PRKRK+ KR D  E KKTNKKKRRRDVS+ AT+SDSLS STCG+GS+TS++ EIDR RGRS K+K NM KTE  RYRSKS S CSLCS+G D QNEVED 
Subjt:  PRKRKNLKRDDHCEVKKTNKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDG

Query:  SYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPDDDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSNN
        SYVEN+ RRL+S+I+VVGEE++L+TF  N Q+E V HQ D++H SFG M SKDG SKRELDYV SKEAPEVE+K ++V PDNRNS+++ DDGV+NEGSN 
Subjt:  SYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPDDDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSNN

Query:  NHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCKVDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGG
        NHG V +D SL+ERKNGCSGN DSINCI+LESILRQ+ALENLRKFK V  RNV+   NCKV+NNNDAKQL SPVSKSVHVT PRDDA+ING G SRQ GG
Subjt:  NHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCKVDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGG

Query:  NEVNSMIVEENGVISTDAIDSAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRSNLIIAASRAESKVDSLMKQA
        + VNSMIV+ENG  STDAID+AVASMHDPV SSQNLG+ SNGSNGMNELKQD+SS+DQE IN+NIC KADADI  TTNRSNL+IAA R ESKVDSL+++A
Subjt:  NEVNSMIVEENGVISTDAIDSAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRSNLIIAASRAESKVDSLMKQA

Query:  SASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGFGSSAHK-HSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP
        SA+QE I+TKPSISDI VDETA+T+TQM+NN+DQNI NGFGSSA+K  SSLNSISGE S +KS HESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP
Subjt:  SASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGFGSSAHK-HSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP

Query:  ALTRRQLKR
        ALTRRQLKR
Subjt:  ALTRRQLKR

XP_038897880.1 histone-lysine N-methyltransferase SETD2 isoform X1 [Benincasa hispida]7.0e-24079.02Show/hide
Query:  MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPH
        MGK+SSSRKKERSKTSSSQRSRRKS+SS+KLKSKKLRYRHDSPSCSDTDFESSTSVSSS SE D+++ RSRSKTRKNAKPSKK++K++S DR+SR+ SPH
Subjt:  MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPH

Query:  PRKRKNLKRDDHCEVKKTNKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDG
        PRKRK+ KR+DHCE KK  KKKRRRD S+ A  SDS SCSTCG GSTTSNE E+ R RGRS K+KGNMGKTER RYRSKS S CSL S+ SD+QNEV+D 
Subjt:  PRKRKNLKRDDHCEVKKTNKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDG

Query:  SYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPD--DDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGS
        SYV NNFRRLRS+I++ GEENKL+TF  N Q+E   HQP+  DDH S G M SKD  SKRELDYV SKE P VE KKEV +P+NRNSMVVKDDGVQNEGS
Subjt:  SYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPD--DDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGS

Query:  NNNHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCKVDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQG
        N N G V +D SL+ERKNGCSG  DS+N IDLESILRQRALENLRKFK  P RNV+T  NCKVD+NNDAKQL SPVSKSVHVTSPRDDA+IN  G SRQG
Subjt:  NNNHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCKVDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQG

Query:  GGNEVNSMIVEENGVISTDAIDSAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRSNLIIAASRAESKVDSLMK
        GGN VNSMIV+ENGV STDAIDS+V SMHDPVYSSQNLG+ SNGSNGMNELKQ++SS+DQE IN+NICQKADADIC TTNRSNL+IAA R ESKVDSL+K
Subjt:  GGNEVNSMIVEENGVISTDAIDSAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRSNLIIAASRAESKVDSLMK

Query:  QASASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGFGSSAHKHSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRA
        QA A+QESIQTKPSISDIGVDETA+TQTQMRNNDDQNI NG  SSAHK SSLNSISGE S + S HESG+ SQFEQKTMSVMRGGEMVQVNYKVYIPKRA
Subjt:  QASASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGFGSSAHKHSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRA

Query:  PALTRRQLKR
        PALTRRQLKR
Subjt:  PALTRRQLKR

TrEMBL top hitse value%identityAlignment
A0A0A0L248 Uncharacterized protein6.0e-22976.64Show/hide
Query:  MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPH
        MGK+SSSRKKERSKTSSSQRSRRKS+SSRKLKSKKLRYRHDSPSCSDTDFESSTSV SS SE  +++ RSRSKT+KNAKPSKK++KK+S DR+SR+ SP+
Subjt:  MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPH

Query:  PRKRKNLKRDDHCEVKKTNKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDG
        PRKRK+ KR+D  EV K NKKKRRRDVS+    S+SLSCSTCG GSTTSNE E+ R RGRS K+K NM KTE  RY SKSHS CSL SEGSD+QNEV+D 
Subjt:  PRKRKNLKRDDHCEVKKTNKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDG

Query:  SYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPDDDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSNN
        SYVENNFRRLRS+I+VVGEENKL   +E   +E V +QP DDH SFG M SKD  SKRELDYV +KEAP VEN+KEV +P+ RNSMVV+DDGVQNEGSN 
Subjt:  SYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPDDDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSNN

Query:  NHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCKVDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGG
        NHG V +DRS +E KNGCS N DSINCIDLES+LRQRALENLRKFK  P RNV+T  NCKV +NN AKQL SP+SKSVHVTSPR+DA+IN    SRQGGG
Subjt:  NHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCKVDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGG

Query:  NEVNSMIVEENGVISTDAIDSAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRSNLIIAASRAESKVDSLMKQA
        N VNSMIV+ENGV S DAIDSAVA+MHDPVYSSQNLG+ SNGSNGMNE KQD+SS+DQE IN+NICQKA+ADIC TTNRSNL+IAA R + KVDSL+KQ 
Subjt:  NEVNSMIVEENGVISTDAIDSAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRSNLIIAASRAESKVDSLMKQA

Query:  SASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGFGSSAHKHSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPA
        SA+QES+QTKPSISD+ V ETA+TQTQMRNN+D NI NG GSSAHK SSLNSISGE S + S HESGE SQFEQKTMSVMRGGEMVQVNYKVYIPKRAPA
Subjt:  SASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGFGSSAHKHSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPA

Query:  LTRRQLKR
        LTRRQLKR
Subjt:  LTRRQLKR

A0A1S3CJV0 uncharacterized protein LOC103501777 isoform X21.7e-22375.33Show/hide
Query:  MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPH
        MGK+SSSRKKERSKTSSSQRSRRKS+SSRKLKSKK RYRHDSPSCSDTDFESSTSV SS SE D+K+ RSRSKTRKN KPSKK+ KK+S DR+SR+ SP+
Subjt:  MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPH

Query:  PRKRKNLKRDDHCEVKKTNKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDG
        PRKRK+ KR+D  EV K NKKKRRRDVS+    SDSLSCSTCG G+TTSNE E+ R RGR  K+KGNM KT   RY SKS S CSL SEGSD+QNEV+D 
Subjt:  PRKRKNLKRDDHCEVKKTNKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDG

Query:  SYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPDDDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSNN
        SYVE NFRRLRS+I+VVGEENKL+TF  N +++EV +Q  DDH S G M SKD   KR LDYV +KEA  VEN+KEV +P+ RNSMVVKD GVQNEGSN 
Subjt:  SYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPDDDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSNN

Query:  NHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCKVDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGG
        NHG V +D S +E KNGCS N DSINCIDLES+LRQRALENLRKFK    RNV+T  NCKVD+NN AKQL SPVS SVHVTSPR++A+IN    SRQGGG
Subjt:  NHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCKVDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGG

Query:  NEVNSMIVEENGVISTDAIDSAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRSNLIIAASRAESKVDSLMKQA
        N +NSMI++ENGV S DAIDSAVA+MHDPVYSSQNLG+ SNGSNGMNE KQD+SS+DQE IN+NICQKADADIC TTNRSNL+IAA R E KVDSL+KQ 
Subjt:  NEVNSMIVEENGVISTDAIDSAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRSNLIIAASRAESKVDSLMKQA

Query:  SASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGFGSSAHKHSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPA
        SA+QES+QTKPSISD+GV ETA+ QTQMRNNDD NI NG GSSA++ SSLNSISGE S N S  ESGE SQFEQKTMSVMRGGEMVQVNYKVYIPKRAPA
Subjt:  SASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGFGSSAHKHSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPA

Query:  LTRRQLKR
        LTRRQLKR
Subjt:  LTRRQLKR

A0A6J1CDR0 uncharacterized protein LOC1110106012.4e-24180Show/hide
Query:  MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPH
        MGK   SRKKERSKTSSSQRSRRK+RSSRKLKSKKLRYRHDSPSCSDTDFESSTS+SSS SE DEK+GRSRS   KNAKP KK+AKKRS D + RD SPH
Subjt:  MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPH

Query:  PRKRKNLKRDDHCEVKKTNK-KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVED
        PRKRK+ KR D CEVKKTNK KKRRRDVS+SATS DSLSCSTCG+GSTTSNE EIDRHRGRS K+K N GKTERSRYRSKSHS CSLCSEGSD+QNEVED
Subjt:  PRKRKNLKRDDHCEVKKTNK-KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVED

Query:  GSYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPDDDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSN
        GSYVENNFRRLRSVI+VVGEENKL+TFD N Q+EEV+H PDDDH SFG M S DG SKRELD VTS EA EVENKKEVVIPD RN +VVKD GVQNEGSN
Subjt:  GSYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPDDDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSN

Query:  NNHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCKVDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGG
        NNHG V +D  LNE  NG SGN D INCIDLESILRQRALENLRKFK VP +NV+T  NC+VDN+NDAKQL+SPVS SV + SPRDDA+ING G S QGG
Subjt:  NNHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCKVDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGG

Query:  GNEVNSMIVEENGVISTDAIDSAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRSNLIIAASRAESKVDSLMKQ
        GN VN MIVEENGV ST+AIDSAVAS HDP+YSSQNLG+ S+ SNGMNELKQD+SS+DQEA+N+NICQK DADIC TT+RSNL+ AA R +SKVD L+KQ
Subjt:  GNEVNSMIVEENGVISTDAIDSAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRSNLIIAASRAESKVDSLMKQ

Query:  ASASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGFGSSAHK-HSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRA
        ASA QE IQTKPSISD+GVDE A+ Q Q RNNDDQNI NGF SSAHK  SSLN  SGE S NK  HESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRA
Subjt:  ASASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGFGSSAHK-HSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRA

Query:  PALTRRQLKR
        PAL RRQLKR
Subjt:  PALTRRQLKR

A0A6J1F6D1 uncharacterized protein LOC111442542 isoform X13.2e-23076.52Show/hide
Query:  MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPH
        MGK+SSSRKKERSKTSSSQRSRRKS+SSR+LKSKKLRYRHDSPSCSDTDFESSTSVSSS SE D+K+ RSRSKTRKN+KPSKK++KK+S D +SR+ SPH
Subjt:  MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPH

Query:  PRKRKNLKRDDHCEVKKTNKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDG
        PRKRK+ KR D  E KKTNKKKRRRD S+ AT+SDSL  STCG+GS+TS++ EIDR RGRS K+K NM KTE  RYRSKS S CSLCS+GSD QNEVED 
Subjt:  PRKRKNLKRDDHCEVKKTNKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDG

Query:  SYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPDDDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSNN
        SYVEN+ RRL+S+I+VVGEE++L+TF  N Q+E V HQ DD+H SFG M SKDG SKRELDYV SKEAPEVE+K ++V PDNRNS+++ DDGV+NEGSN 
Subjt:  SYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPDDDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSNN

Query:  NHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCKVDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGG
        NHG V +D SL+ERKNGCSGN +SINCIDLESILRQ+ALENLRKFK V  RNV+   NCKV+NNNDAKQL SPVSKSVHVT PRDDA+IN  G SRQ GG
Subjt:  NHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCKVDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGG

Query:  NEVNSMIVEENGVISTDAIDSAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRSNLIIAASRAESKVDSLMKQA
        + VNSMIV+ NG  STDAID+AVASMHDPV SSQNLG+ SNGSNGMNE KQD+SS+DQE IN+NIC KADADI  TTNRSNL+IAA R ESKVDS +K+A
Subjt:  NEVNSMIVEENGVISTDAIDSAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRSNLIIAASRAESKVDSLMKQA

Query:  SASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGFGSSAHK-HSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP
        S  QE IQTK SISDI VDETA+TQTQM NNDDQNI NGFGSSA+K  SSLN ISGE   +KS  ESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP
Subjt:  SASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGFGSSAHK-HSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP

Query:  ALTRRQLKR
        ALTRRQLKR
Subjt:  ALTRRQLKR

A0A6J1IGY0 uncharacterized protein LOC111476850 isoform X16.0e-22975.86Show/hide
Query:  MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPH
        MGK+SSSRKKERSKTSSSQRSRRKS+SSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSS SE D+K+ RSRSKTRKN+KPSKK++KK+S D +SR+ SPH
Subjt:  MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPH

Query:  PRKRKNLKRDDHCEVKKTNKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDG
        PRKRK+ KR+D  E KKTNKKKRRRDVS+ AT+SDSLS STCG+GS+TS++ EIDR RGRS K+K NM KTE  RYRSKS S CSLCS+GSD QNEVED 
Subjt:  PRKRKNLKRDDHCEVKKTNKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDG

Query:  SYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPDDDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSNN
        SYV+N  RRL+S+I+VVGEE++L+TF  N Q+E V HQ DD+H  F  M SKDG  KRELDYV SKEAPEVE+K ++  PDNRNS+++ +DGV+NEGSN 
Subjt:  SYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPDDDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSNN

Query:  NHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCKVDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGG
        NHG V +D SL+ERKNGCSGN D+INCIDLESILRQ+ALENLRKFK    RNV+   NCKV+NNNDAKQL SPVSKSVHV SPRDDA+ NG G SRQ GG
Subjt:  NHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCKVDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGG

Query:  NEVNSMIVEENGVISTDAIDSAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRSNLIIAASRAESKVDSLMKQA
        + VNSMI++ NG  STDAID+AVASMHDPV SSQNLG+ SNGSNGMNELKQD+SS+DQE IN+NIC KADA+I  TTNRSNL+IAA R ESKVDSL+++A
Subjt:  NEVNSMIVEENGVISTDAIDSAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRSNLIIAASRAESKVDSLMKQA

Query:  SASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGFGSSAHK-HSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP
        SA+QE IQTKPSISDI VDE ++TQTQ  NNDDQNI NGFGSSA+K  SSLNSISGE S +KS  ESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP
Subjt:  SASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGFGSSAHK-HSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP

Query:  ALTRRQLKR
        ALTRRQLKR
Subjt:  ALTRRQLKR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G53930.1 unknown protein8.3e-1328.53Show/hide
Query:  MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPH
        MGKSSSS K    K SS++    K + S++ KSKK+R   D    S +D     S   S SE D      R K ++ +K SKK+++KR     S D S  
Subjt:  MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPH

Query:  PR---KRKNLKRDDHCEVKKTNK----KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDH
         R   K+K  KR D    KK  K    K+R+RD+S S+TSS+     +  +GS + +     R RGR       +GK + SR RS+        SE  D 
Subjt:  PR---KRKNLKRDDHCEVKKTNK----KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDH

Query:  QNEVEDGSYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPDDDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGV
          + ED    E N RRL+S+++V            NG+ +E     +DD   +       G   REL Y  S+++ E++ +      D+ + +   D+G 
Subjt:  QNEVEDGSYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPDDDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGV

Query:  QNEGSNNNHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCKVDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNG
             +    V   D SL +               DLE+IL++RALENL++F+ V               +  AK+  S VS+                G
Subjt:  QNEGSNNNHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCKVDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNG

Query:  LSRQGGGNEVNSMIVEENGVISTDAIDSAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRSNLIIAASRAESKV
           Q    +V S   +++ ++     DSAV+     + +S+ +    N       L    S  DQ++  +    K  + +   T +  L+      +S  
Subjt:  LSRQGGGNEVNSMIVEENGVISTDAIDSAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRSNLIIAASRAESKV

Query:  DSLMKQASASQESIQTKPSISDIGVDETAETQTQMRNNDDQNI-----GNGFGSSAHKHSSLNSI----SGEPSSNKSGHESGEGSQFEQKTMSVMRGGE
         +  K+AS SQ++       S +  +    T   +  N+ ++I      +   + +  H+    +     G  S  K+  E+ + SQ+EQKTM+VMRGGE
Subjt:  DSLMKQASASQESIQTKPSISDIGVDETAETQTQMRNNDDQNI-----GNGFGSSAHKHSSLNSI----SGEPSSNKSGHESGEGSQFEQKTMSVMRGGE

Query:  MVQVNYKVYIPKRAPALTRRQLKR
        MVQV+YKVYIPK+A +L RR+L R
Subjt:  MVQVNYKVYIPKRAPALTRRQLKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAAGTCCTCTTCTTCTCGCAAGAAGGAGCGTTCCAAGACTTCTTCATCCCAGAGAAGTAGGAGAAAGAGTAGGAGTAGTAGGAAATTGAAGTCCAAGAAGCTGCG
GTATCGCCACGATTCTCCATCTTGCTCTGACACTGATTTTGAAAGTTCAACTTCAGTGTCTTCTTCTGGCTCGGAGGGTGATGAAAAATTGGGAAGATCTCGATCCAAGA
CGCGGAAGAATGCGAAGCCTAGTAAAAAGAAAGCTAAGAAGCGATCTGATGACCGTCGAAGTAGGGATTATTCTCCTCATCCCAGAAAGAGGAAGAATTTGAAGAGAGAC
GACCATTGCGAGGTGAAGAAGACAAACAAAAAGAAGCGTAGAAGAGATGTGAGTATTAGTGCCACAAGTAGTGACTCTTTGAGCTGCTCAACTTGTGGAGAGGGGAGTAC
AACCAGCAATGAGGGTGAAATCGATAGGCATAGGGGCAGGTCTAGAAAGAAGAAAGGAAATATGGGGAAGACTGAAAGAAGTAGATACAGGTCAAAGAGTCATTCAGCAT
GCTCTTTATGTAGTGAAGGTAGTGATCATCAGAATGAGGTTGAGGATGGCAGTTATGTTGAAAACAACTTTAGACGATTAAGATCTGTAATTATTGTAGTAGGAGAGGAA
AATAAATTAGAGACGTTTGATGAGAATGGACAAGAAGAAGAGGTCGTGCATCAGCCTGATGATGACCACCTGTCTTTTGGAGTTATGGGCAGTAAGGATGGGGCAAGTAA
AAGAGAATTAGACTATGTTACATCGAAAGAGGCACCAGAGGTAGAAAACAAAAAAGAAGTGGTTATACCTGACAATAGAAACTCTATGGTTGTAAAGGATGATGGAGTTC
AAAATGAGGGAAGCAACAATAACCATGGAGTAGTAATTCATGACCGTTCTTTAAATGAAAGAAAGAATGGCTGTTCTGGAAATAATGACAGCATAAATTGTATCGATTTA
GAGTCAATTTTGAGACAGAGGGCTTTGGAAAACCTAAGAAAGTTCAAACGGGTGCCCCTAAGGAATGTGAAAACTCCTGATAATTGCAAAGTTGACAATAATAATGATGC
AAAGCAATTGCACTCTCCCGTCTCTAAGTCAGTTCACGTGACTTCCCCTAGGGATGATGCCAAGATAAATGGTAACGGGCTCTCTAGACAAGGTGGAGGGAATGAAGTAA
ATTCAATGATAGTTGAAGAGAATGGTGTCATATCTACTGATGCAATAGATTCAGCAGTTGCATCTATGCATGATCCTGTCTATTCTTCACAGAATCTGGGTAGGACTTCC
AATGGAAGCAATGGTATGAATGAACTGAAGCAGGATGTTTCTTCAGTAGACCAGGAGGCTATAAATAATAATATTTGCCAGAAGGCAGATGCAGATATTTGTCCTACAAC
TAACAGAAGCAATTTGATTATTGCAGCTTCGAGGGCTGAGTCCAAAGTTGATTCTCTTATGAAGCAGGCATCTGCTTCTCAGGAATCTATCCAAACAAAGCCATCTATAT
CTGACATTGGTGTTGATGAGACAGCTGAAACTCAGACCCAGATGAGGAATAATGATGATCAAAACATTGGTAATGGTTTTGGTTCTTCAGCTCACAAGCATTCTTCCCTT
AATTCTATTTCAGGAGAACCCAGCTCTAACAAGTCTGGACACGAGAGTGGCGAAGGCTCGCAGTTTGAACAGAAAACCATGTCCGTGATGCGGGGTGGTGAAATGGTGCA
GGTGAACTACAAGGTCTACATCCCAAAGAGAGCTCCTGCTTTGACTAGGAGGCAACTCAAGCGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAAGTCCTCTTCTTCTCGCAAGAAGGAGCGTTCCAAGACTTCTTCATCCCAGAGAAGTAGGAGAAAGAGTAGGAGTAGTAGGAAATTGAAGTCCAAGAAGCTGCG
GTATCGCCACGATTCTCCATCTTGCTCTGACACTGATTTTGAAAGTTCAACTTCAGTGTCTTCTTCTGGCTCGGAGGGTGATGAAAAATTGGGAAGATCTCGATCCAAGA
CGCGGAAGAATGCGAAGCCTAGTAAAAAGAAAGCTAAGAAGCGATCTGATGACCGTCGAAGTAGGGATTATTCTCCTCATCCCAGAAAGAGGAAGAATTTGAAGAGAGAC
GACCATTGCGAGGTGAAGAAGACAAACAAAAAGAAGCGTAGAAGAGATGTGAGTATTAGTGCCACAAGTAGTGACTCTTTGAGCTGCTCAACTTGTGGAGAGGGGAGTAC
AACCAGCAATGAGGGTGAAATCGATAGGCATAGGGGCAGGTCTAGAAAGAAGAAAGGAAATATGGGGAAGACTGAAAGAAGTAGATACAGGTCAAAGAGTCATTCAGCAT
GCTCTTTATGTAGTGAAGGTAGTGATCATCAGAATGAGGTTGAGGATGGCAGTTATGTTGAAAACAACTTTAGACGATTAAGATCTGTAATTATTGTAGTAGGAGAGGAA
AATAAATTAGAGACGTTTGATGAGAATGGACAAGAAGAAGAGGTCGTGCATCAGCCTGATGATGACCACCTGTCTTTTGGAGTTATGGGCAGTAAGGATGGGGCAAGTAA
AAGAGAATTAGACTATGTTACATCGAAAGAGGCACCAGAGGTAGAAAACAAAAAAGAAGTGGTTATACCTGACAATAGAAACTCTATGGTTGTAAAGGATGATGGAGTTC
AAAATGAGGGAAGCAACAATAACCATGGAGTAGTAATTCATGACCGTTCTTTAAATGAAAGAAAGAATGGCTGTTCTGGAAATAATGACAGCATAAATTGTATCGATTTA
GAGTCAATTTTGAGACAGAGGGCTTTGGAAAACCTAAGAAAGTTCAAACGGGTGCCCCTAAGGAATGTGAAAACTCCTGATAATTGCAAAGTTGACAATAATAATGATGC
AAAGCAATTGCACTCTCCCGTCTCTAAGTCAGTTCACGTGACTTCCCCTAGGGATGATGCCAAGATAAATGGTAACGGGCTCTCTAGACAAGGTGGAGGGAATGAAGTAA
ATTCAATGATAGTTGAAGAGAATGGTGTCATATCTACTGATGCAATAGATTCAGCAGTTGCATCTATGCATGATCCTGTCTATTCTTCACAGAATCTGGGTAGGACTTCC
AATGGAAGCAATGGTATGAATGAACTGAAGCAGGATGTTTCTTCAGTAGACCAGGAGGCTATAAATAATAATATTTGCCAGAAGGCAGATGCAGATATTTGTCCTACAAC
TAACAGAAGCAATTTGATTATTGCAGCTTCGAGGGCTGAGTCCAAAGTTGATTCTCTTATGAAGCAGGCATCTGCTTCTCAGGAATCTATCCAAACAAAGCCATCTATAT
CTGACATTGGTGTTGATGAGACAGCTGAAACTCAGACCCAGATGAGGAATAATGATGATCAAAACATTGGTAATGGTTTTGGTTCTTCAGCTCACAAGCATTCTTCCCTT
AATTCTATTTCAGGAGAACCCAGCTCTAACAAGTCTGGACACGAGAGTGGCGAAGGCTCGCAGTTTGAACAGAAAACCATGTCCGTGATGCGGGGTGGTGAAATGGTGCA
GGTGAACTACAAGGTCTACATCCCAAAGAGAGCTCCTGCTTTGACTAGGAGGCAACTCAAGCGGTGA
Protein sequenceShow/hide protein sequence
MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPHPRKRKNLKRD
DHCEVKKTNKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIIVVGEE
NKLETFDENGQEEEVVHQPDDDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSNNNHGVVIHDRSLNERKNGCSGNNDSINCIDL
ESILRQRALENLRKFKRVPLRNVKTPDNCKVDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVISTDAIDSAVASMHDPVYSSQNLGRTS
NGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRSNLIIAASRAESKVDSLMKQASASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGFGSSAHKHSSL
NSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRRQLKR