; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10013665 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10013665
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionpre-mRNA-splicing factor CWC22 homolog isoform X1
Genome locationChr02:3594752..3596628
RNA-Seq ExpressionHG10013665
SyntenyHG10013665
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0066814.1 NK-tumor recognition protein [Cucumis melo var. makuwa]1.1e-25083.76Show/hide
Query:  MKRSRRKSKSSRKLKSKKLRYRHDSPSCSDTDFESSTSMSSSSSEDDKKVRRSRSKTRKNAKPSKKRSKKQSHDHQSRECSPHPRKRKHSKRNDRCEVKK
        MKRSRRKSKSSRKLKSKK RYRHDSPSCSDTDFESSTS+ SSSSEDDKKVRRSRSKTRKN KPSKKR KKQSHD QSRECSP+PRKRKHSKRNDR EV K
Subjt:  MKRSRRKSKSSRKLKSKKLRYRHDSPSCSDTDFESSTSMSSSSSEDDKKVRRSRSKTRKNAKPSKKRSKKQSHDHQSRECSPHPRKRKHSKRNDRCEVKK

Query:  ANKKKRRRDVSVGAYGDSLSSSTCGNGSTTSNESEVVRRRGRCGKRKGNMGKTERIRNRSKSRSPCSLYGEGSDYQNEVDGDSYVEYTFKRLRSIIVVVG
        ANKKKRRRDVSVG + DSLS STCGNG+TTSNESEVVRRRGR  KRKGNM KT   R  SKSRSPCSL+ EGSDYQNEVD DSYVE  F+RLRSIIVVVG
Subjt:  ANKKKRRRDVSVGAYGDSLSSSTCGNGSTTSNESEVVRRRGRCGKRKGNMGKTERIRNRSKSRSPCSLYGEGSDYQNEVDGDSYVEYTFKRLRSIIVVVG

Query:  EENKLKTFVGNEQQEGVTHQPDDDHPSFGDLDSKDVISKRELDCVISEEAPLVENE-EVDIPDSRNSIVVKDDGVQNEGSNKNHGGVTNDHSLDERTYGC
        EENKLKTFVGNEQQE VT+Q  DDHPS G++DSKD   KR LD V+++EA +VENE EVD+P+ RNS+VVKD GVQNEGSNKNHGGVTNDHS DE   GC
Subjt:  EENKLKTFVGNEQQEGVTHQPDDDHPSFGDLDSKDVISKRELDCVISEEAPLVENE-EVDIPDSRNSIVVKDDGVQNEGSNKNHGGVTNDHSLDERTYGC

Query:  SGNTDSVNCIDLESILRQRALENLRKFKGAPPRNVEAIANCKGDHNNDAKQLYSPDSKSVHVTSPRDDAEINSEGFSRQGGGNAVNSMTVKENGVKSTDA
        S NTDS+NCIDLES+LRQRALENLRKFKGA PRNVE IANCK DHNN AKQL SP S SVHVTSPR++AEINS+ FSRQGGGNA+NSM +KENGVKS DA
Subjt:  SGNTDSVNCIDLESILRQRALENLRKFKGAPPRNVEAIANCKGDHNNDAKQLYSPDSKSVHVTSPRDDAEINSEGFSRQGGGNAVNSMTVKENGVKSTDA

Query:  IDSAVASMHDPVYSSQNLGMISNGSNGMNELKQDISSLDQEVINDSICQKTDVDICSTTNRSNLVIAALRPEPKVDSIIKQASAAEESIQTKPSISDIGV
        IDSAVA+MHDPVYSSQNLG ISNGSNGMNE KQDISSLDQEVIND+ICQK D DICSTTNRSNLVIAALRPEPKVDS+IKQ SAA+ES+QTKPSISD+GV
Subjt:  IDSAVASMHDPVYSSQNLGMISNGSNGMNELKQDISSLDQEVINDSICQKTDVDICSTTNRSNLVIAALRPEPKVDSIIKQASAAEESIQTKPSISDIGV

Query:  DETAQTQTQMRNNDDENIHNGLGSSAHKPSSSLNSISGENSLNMSRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRKQLKR
         ETAQ QTQMRNNDD NI NGLGSSA++P SSLNSISGE+SLNMS  ESGE SQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTR+QLKR
Subjt:  DETAQTQTQMRNNDDENIHNGLGSSAHKPSSSLNSISGENSLNMSRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRKQLKR

XP_004146100.1 uncharacterized protein LOC101205593 isoform X1 [Cucumis sativus]1.3e-25184.24Show/hide
Query:  KRSRRKSKSSRKLKSKKLRYRHDSPSCSDTDFESSTSMSSSSSEDDKKVRRSRSKTRKNAKPSKKRSKKQSHDHQSRECSPHPRKRKHSKRNDRCEVKKA
        +RSRRKSKSSRKLKSKKLRYRHDSPSCSDTDFESSTS+ SSSSE  K+VRRSRSKT+KNAKPSKKRSKKQSHD QSRECSP+PRKRKHSKRNDR EV KA
Subjt:  KRSRRKSKSSRKLKSKKLRYRHDSPSCSDTDFESSTSMSSSSSEDDKKVRRSRSKTRKNAKPSKKRSKKQSHDHQSRECSPHPRKRKHSKRNDRCEVKKA

Query:  NKKKRRRDVSVGAYGDSLSSSTCGNGSTTSNESEVVRRRGRCGKRKGNMGKTERIRNRSKSRSPCSLYGEGSDYQNEVDGDSYVEYTFKRLRSIIVVVGE
        NKKKRRRDVSVG + +SLS STCGNGSTTSNESEVVRRRGR GKRK NM KTE  R  SKS SPCSL  EGSDYQNEVD +SYVE  F+RLRSIIVVVGE
Subjt:  NKKKRRRDVSVGAYGDSLSSSTCGNGSTTSNESEVVRRRGRCGKRKGNMGKTERIRNRSKSRSPCSLYGEGSDYQNEVDGDSYVEYTFKRLRSIIVVVGE

Query:  ENKLKTFVGNEQQEGVTHQPDDDHPSFGDLDSKDVISKRELDCVISEEAPLVENE-EVDIPDSRNSIVVKDDGVQNEGSNKNHGGVTNDHSLDERTYGCS
        ENKL  +VGNE QEGVT+QP DDHPSFGD+DSKD  SKRELD VI++EAP+VENE EVD+P+ RNS+VV+DDGVQNEGSNKNHGGVTND S DE   GCS
Subjt:  ENKLKTFVGNEQQEGVTHQPDDDHPSFGDLDSKDVISKRELDCVISEEAPLVENE-EVDIPDSRNSIVVKDDGVQNEGSNKNHGGVTNDHSLDERTYGCS

Query:  GNTDSVNCIDLESILRQRALENLRKFKGAPPRNVEAIANCKGDHNNDAKQLYSPDSKSVHVTSPRDDAEINSEGFSRQGGGNAVNSMTVKENGVKSTDAI
         NTDS+NCIDLES+LRQRALENLRKFKGAPPRNVE IANCK  HNN AKQL SP SKSVHVTSPR+DAEINSE FSRQGGGNAVNSM VKENGV S DAI
Subjt:  GNTDSVNCIDLESILRQRALENLRKFKGAPPRNVEAIANCKGDHNNDAKQLYSPDSKSVHVTSPRDDAEINSEGFSRQGGGNAVNSMTVKENGVKSTDAI

Query:  DSAVASMHDPVYSSQNLGMISNGSNGMNELKQDISSLDQEVINDSICQKTDVDICSTTNRSNLVIAALRPEPKVDSIIKQASAAEESIQTKPSISDIGVD
        DSAVA+MHDPVYSSQNLG ISNGSNGMNE KQDISSLDQE+IND+ICQK + DICSTTNRSNLVIAALRP+PKVDS+IKQ SAA+ES+QTKPSISD+ V 
Subjt:  DSAVASMHDPVYSSQNLGMISNGSNGMNELKQDISSLDQEVINDSICQKTDVDICSTTNRSNLVIAALRPEPKVDSIIKQASAAEESIQTKPSISDIGVD

Query:  ETAQTQTQMRNNDDENIHNGLGSSAHKPSSSLNSISGENSLNMSRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRKQLKR
        ETAQTQTQMRNN+D NI NGLGSSAHKP SSLNSISGENSL+MS HESGE SQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTR+QLKR
Subjt:  ETAQTQTQMRNNDDENIHNGLGSSAHKPSSSLNSISGENSLNMSRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRKQLKR

XP_008463689.1 PREDICTED: uncharacterized protein LOC103501777 isoform X1 [Cucumis melo]1.1e-25083.76Show/hide
Query:  MKRSRRKSKSSRKLKSKKLRYRHDSPSCSDTDFESSTSMSSSSSEDDKKVRRSRSKTRKNAKPSKKRSKKQSHDHQSRECSPHPRKRKHSKRNDRCEVKK
        MKRSRRKSKSSRKLKSKK RYRHDSPSCSDTDFESSTS+ SSSSEDDKKVRRSRSKTRKN KPSKKR KKQSHD QSRECSP+PRKRKHSKRNDR EV K
Subjt:  MKRSRRKSKSSRKLKSKKLRYRHDSPSCSDTDFESSTSMSSSSSEDDKKVRRSRSKTRKNAKPSKKRSKKQSHDHQSRECSPHPRKRKHSKRNDRCEVKK

Query:  ANKKKRRRDVSVGAYGDSLSSSTCGNGSTTSNESEVVRRRGRCGKRKGNMGKTERIRNRSKSRSPCSLYGEGSDYQNEVDGDSYVEYTFKRLRSIIVVVG
        ANKKKRRRDVSVG + DSLS STCGNG+TTSNESEVVRRRGR  KRKGNM KT   R  SKSRSPCSL+ EGSDYQNEVD DSYVE  F+RLRSIIVVVG
Subjt:  ANKKKRRRDVSVGAYGDSLSSSTCGNGSTTSNESEVVRRRGRCGKRKGNMGKTERIRNRSKSRSPCSLYGEGSDYQNEVDGDSYVEYTFKRLRSIIVVVG

Query:  EENKLKTFVGNEQQEGVTHQPDDDHPSFGDLDSKDVISKRELDCVISEEAPLVENE-EVDIPDSRNSIVVKDDGVQNEGSNKNHGGVTNDHSLDERTYGC
        EENKLKTFVGNEQQE VT+Q  DDHPS G++DSKD   KR LD V+++EA +VENE EVD+P+ RNS+VVKD GVQNEGSNKNHGGVTNDHS DE   GC
Subjt:  EENKLKTFVGNEQQEGVTHQPDDDHPSFGDLDSKDVISKRELDCVISEEAPLVENE-EVDIPDSRNSIVVKDDGVQNEGSNKNHGGVTNDHSLDERTYGC

Query:  SGNTDSVNCIDLESILRQRALENLRKFKGAPPRNVEAIANCKGDHNNDAKQLYSPDSKSVHVTSPRDDAEINSEGFSRQGGGNAVNSMTVKENGVKSTDA
        S NTDS+NCIDLES+LRQRALENLRKFKGA PRNVE IANCK DHNN AKQL SP S SVHVTSPR++AEINS+ FSRQGGGNA+NSM +KENGVKS DA
Subjt:  SGNTDSVNCIDLESILRQRALENLRKFKGAPPRNVEAIANCKGDHNNDAKQLYSPDSKSVHVTSPRDDAEINSEGFSRQGGGNAVNSMTVKENGVKSTDA

Query:  IDSAVASMHDPVYSSQNLGMISNGSNGMNELKQDISSLDQEVINDSICQKTDVDICSTTNRSNLVIAALRPEPKVDSIIKQASAAEESIQTKPSISDIGV
        IDSAVA+MHDPVYSSQNLG ISNGSNGMNE KQDISSLDQEVIND+ICQK D DICSTTNRSNLVIAALRPEPKVDS+IKQ SAA+ES+QTKPSISD+GV
Subjt:  IDSAVASMHDPVYSSQNLGMISNGSNGMNELKQDISSLDQEVINDSICQKTDVDICSTTNRSNLVIAALRPEPKVDSIIKQASAAEESIQTKPSISDIGV

Query:  DETAQTQTQMRNNDDENIHNGLGSSAHKPSSSLNSISGENSLNMSRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRKQLKR
         ETAQ QTQMRNNDD NI NGLGSSA++P SSLNSISGE+SLNMS  ESGE SQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTR+QLKR
Subjt:  DETAQTQTQMRNNDDENIHNGLGSSAHKPSSSLNSISGENSLNMSRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRKQLKR

XP_008463691.1 PREDICTED: uncharacterized protein LOC103501777 isoform X2 [Cucumis melo]1.2e-24983.56Show/hide
Query:  KRSRRKSKSSRKLKSKKLRYRHDSPSCSDTDFESSTSMSSSSSEDDKKVRRSRSKTRKNAKPSKKRSKKQSHDHQSRECSPHPRKRKHSKRNDRCEVKKA
        +RSRRKSKSSRKLKSKK RYRHDSPSCSDTDFESSTS+ SSSSEDDKKVRRSRSKTRKN KPSKKR KKQSHD QSRECSP+PRKRKHSKRNDR EV KA
Subjt:  KRSRRKSKSSRKLKSKKLRYRHDSPSCSDTDFESSTSMSSSSSEDDKKVRRSRSKTRKNAKPSKKRSKKQSHDHQSRECSPHPRKRKHSKRNDRCEVKKA

Query:  NKKKRRRDVSVGAYGDSLSSSTCGNGSTTSNESEVVRRRGRCGKRKGNMGKTERIRNRSKSRSPCSLYGEGSDYQNEVDGDSYVEYTFKRLRSIIVVVGE
        NKKKRRRDVSVG + DSLS STCGNG+TTSNESEVVRRRGR  KRKGNM KT   R  SKSRSPCSL+ EGSDYQNEVD DSYVE  F+RLRSIIVVVGE
Subjt:  NKKKRRRDVSVGAYGDSLSSSTCGNGSTTSNESEVVRRRGRCGKRKGNMGKTERIRNRSKSRSPCSLYGEGSDYQNEVDGDSYVEYTFKRLRSIIVVVGE

Query:  ENKLKTFVGNEQQEGVTHQPDDDHPSFGDLDSKDVISKRELDCVISEEAPLVENE-EVDIPDSRNSIVVKDDGVQNEGSNKNHGGVTNDHSLDERTYGCS
        ENKLKTFVGNEQQE VT+Q  DDHPS G++DSKD   KR LD V+++EA +VENE EVD+P+ RNS+VVKD GVQNEGSNKNHGGVTNDHS DE   GCS
Subjt:  ENKLKTFVGNEQQEGVTHQPDDDHPSFGDLDSKDVISKRELDCVISEEAPLVENE-EVDIPDSRNSIVVKDDGVQNEGSNKNHGGVTNDHSLDERTYGCS

Query:  GNTDSVNCIDLESILRQRALENLRKFKGAPPRNVEAIANCKGDHNNDAKQLYSPDSKSVHVTSPRDDAEINSEGFSRQGGGNAVNSMTVKENGVKSTDAI
         NTDS+NCIDLES+LRQRALENLRKFKGA PRNVE IANCK DHNN AKQL SP S SVHVTSPR++AEINS+ FSRQGGGNA+NSM +KENGVKS DAI
Subjt:  GNTDSVNCIDLESILRQRALENLRKFKGAPPRNVEAIANCKGDHNNDAKQLYSPDSKSVHVTSPRDDAEINSEGFSRQGGGNAVNSMTVKENGVKSTDAI

Query:  DSAVASMHDPVYSSQNLGMISNGSNGMNELKQDISSLDQEVINDSICQKTDVDICSTTNRSNLVIAALRPEPKVDSIIKQASAAEESIQTKPSISDIGVD
        DSAVA+MHDPVYSSQNLG ISNGSNGMNE KQDISSLDQEVIND+ICQK D DICSTTNRSNLVIAALRPEPKVDS+IKQ SAA+ES+QTKPSISD+GV 
Subjt:  DSAVASMHDPVYSSQNLGMISNGSNGMNELKQDISSLDQEVINDSICQKTDVDICSTTNRSNLVIAALRPEPKVDSIIKQASAAEESIQTKPSISDIGVD

Query:  ETAQTQTQMRNNDDENIHNGLGSSAHKPSSSLNSISGENSLNMSRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRKQLKR
        ETAQ QTQMRNNDD NI NGLGSSA++P SSLNSISGE+SLNMS  ESGE SQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTR+QLKR
Subjt:  ETAQTQTQMRNNDDENIHNGLGSSAHKPSSSLNSISGENSLNMSRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRKQLKR

XP_038897880.1 histone-lysine N-methyltransferase SETD2 isoform X1 [Benincasa hispida]2.0e-26386.49Show/hide
Query:  KRSRRKSKSSRKLKSKKLRYRHDSPSCSDTDFESSTSMSSSSSEDDKKVRRSRSKTRKNAKPSKKRSKKQSHDHQSRECSPHPRKRKHSKRNDRCEVKKA
        +RSRRKSKSS+KLKSKKLRYRHDSPSCSDTDFESSTS+ SSSSEDDK+VRRSRSKTRKNAKPSKKRSK+QSHD QSRE SPHPRKRKHSKRND CE KKA
Subjt:  KRSRRKSKSSRKLKSKKLRYRHDSPSCSDTDFESSTSMSSSSSEDDKKVRRSRSKTRKNAKPSKKRSKKQSHDHQSRECSPHPRKRKHSKRNDRCEVKKA

Query:  NKKKRRRDVSVGAYGDSLSSSTCGNGSTTSNESEVVRRRGRCGKRKGNMGKTERIRNRSKSRSPCSLYGEGSDYQNEVDGDSYVEYTFKRLRSIIVVVGE
         KKKRRRD SVGAY DS S STCGNGSTTSNESEVVRRRGR GKRKGNMGKTER R RSKSRSPCSL  + SDYQNEVD DSYV   F+RLRSIIV+ GE
Subjt:  NKKKRRRDVSVGAYGDSLSSSTCGNGSTTSNESEVVRRRGRCGKRKGNMGKTERIRNRSKSRSPCSLYGEGSDYQNEVDGDSYVEYTFKRLRSIIVVVGE

Query:  ENKLKTFVGNEQQEGVTHQPD--DDHPSFGDLDSKDVISKRELDCVISEEAPLVE-NEEVDIPDSRNSIVVKDDGVQNEGSNKNHGGVTNDHSLDERTYG
        ENKLKTF GNEQQEG THQP+  DDHPS GD+DSKD  SKRELD VIS+E P+VE  +EVD+P++RNS+VVKDDGVQNEGSNKN GGVTNDHSLDER  G
Subjt:  ENKLKTFVGNEQQEGVTHQPD--DDHPSFGDLDSKDVISKRELDCVISEEAPLVE-NEEVDIPDSRNSIVVKDDGVQNEGSNKNHGGVTNDHSLDERTYG

Query:  CSGNTDSVNCIDLESILRQRALENLRKFKGAPPRNVEAIANCKGDHNNDAKQLYSPDSKSVHVTSPRDDAEINSEGFSRQGGGNAVNSMTVKENGVKSTD
        CSG TDSVN IDLESILRQRALENLRKFKGAPPRNVE IANCK DHNNDAKQL SP SKSVHVTSPRDDAEINS+GFSRQGGGNAVNSM VKENGVKSTD
Subjt:  CSGNTDSVNCIDLESILRQRALENLRKFKGAPPRNVEAIANCKGDHNNDAKQLYSPDSKSVHVTSPRDDAEINSEGFSRQGGGNAVNSMTVKENGVKSTD

Query:  AIDSAVASMHDPVYSSQNLGMISNGSNGMNELKQDISSLDQEVINDSICQKTDVDICSTTNRSNLVIAALRPEPKVDSIIKQASAAEESIQTKPSISDIG
        AIDS+V SMHDPVYSSQNLG ISNGSNGMNELKQ+ISSLDQEVIND+ICQK D DICSTTNRSNLVIAALRPE KVDS+IKQA AA+ESIQTKPSISDIG
Subjt:  AIDSAVASMHDPVYSSQNLGMISNGSNGMNELKQDISSLDQEVINDSICQKTDVDICSTTNRSNLVIAALRPEPKVDSIIKQASAAEESIQTKPSISDIG

Query:  VDETAQTQTQMRNNDDENIHNGLGSSAHKPSSSLNSISGENSLNMSRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRKQLKR
        VDETAQTQTQMRNNDD+NI NGL SSAHKP SSLNSISGENSL+ SRHESG+ SQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTR+QLKR
Subjt:  VDETAQTQTQMRNNDDENIHNGLGSSAHKPSSSLNSISGENSLNMSRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRKQLKR

TrEMBL top hitse value%identityAlignment
A0A0A0L248 Uncharacterized protein6.4e-25284.24Show/hide
Query:  KRSRRKSKSSRKLKSKKLRYRHDSPSCSDTDFESSTSMSSSSSEDDKKVRRSRSKTRKNAKPSKKRSKKQSHDHQSRECSPHPRKRKHSKRNDRCEVKKA
        +RSRRKSKSSRKLKSKKLRYRHDSPSCSDTDFESSTS+ SSSSE  K+VRRSRSKT+KNAKPSKKRSKKQSHD QSRECSP+PRKRKHSKRNDR EV KA
Subjt:  KRSRRKSKSSRKLKSKKLRYRHDSPSCSDTDFESSTSMSSSSSEDDKKVRRSRSKTRKNAKPSKKRSKKQSHDHQSRECSPHPRKRKHSKRNDRCEVKKA

Query:  NKKKRRRDVSVGAYGDSLSSSTCGNGSTTSNESEVVRRRGRCGKRKGNMGKTERIRNRSKSRSPCSLYGEGSDYQNEVDGDSYVEYTFKRLRSIIVVVGE
        NKKKRRRDVSVG + +SLS STCGNGSTTSNESEVVRRRGR GKRK NM KTE  R  SKS SPCSL  EGSDYQNEVD +SYVE  F+RLRSIIVVVGE
Subjt:  NKKKRRRDVSVGAYGDSLSSSTCGNGSTTSNESEVVRRRGRCGKRKGNMGKTERIRNRSKSRSPCSLYGEGSDYQNEVDGDSYVEYTFKRLRSIIVVVGE

Query:  ENKLKTFVGNEQQEGVTHQPDDDHPSFGDLDSKDVISKRELDCVISEEAPLVENE-EVDIPDSRNSIVVKDDGVQNEGSNKNHGGVTNDHSLDERTYGCS
        ENKL  +VGNE QEGVT+QP DDHPSFGD+DSKD  SKRELD VI++EAP+VENE EVD+P+ RNS+VV+DDGVQNEGSNKNHGGVTND S DE   GCS
Subjt:  ENKLKTFVGNEQQEGVTHQPDDDHPSFGDLDSKDVISKRELDCVISEEAPLVENE-EVDIPDSRNSIVVKDDGVQNEGSNKNHGGVTNDHSLDERTYGCS

Query:  GNTDSVNCIDLESILRQRALENLRKFKGAPPRNVEAIANCKGDHNNDAKQLYSPDSKSVHVTSPRDDAEINSEGFSRQGGGNAVNSMTVKENGVKSTDAI
         NTDS+NCIDLES+LRQRALENLRKFKGAPPRNVE IANCK  HNN AKQL SP SKSVHVTSPR+DAEINSE FSRQGGGNAVNSM VKENGV S DAI
Subjt:  GNTDSVNCIDLESILRQRALENLRKFKGAPPRNVEAIANCKGDHNNDAKQLYSPDSKSVHVTSPRDDAEINSEGFSRQGGGNAVNSMTVKENGVKSTDAI

Query:  DSAVASMHDPVYSSQNLGMISNGSNGMNELKQDISSLDQEVINDSICQKTDVDICSTTNRSNLVIAALRPEPKVDSIIKQASAAEESIQTKPSISDIGVD
        DSAVA+MHDPVYSSQNLG ISNGSNGMNE KQDISSLDQE+IND+ICQK + DICSTTNRSNLVIAALRP+PKVDS+IKQ SAA+ES+QTKPSISD+ V 
Subjt:  DSAVASMHDPVYSSQNLGMISNGSNGMNELKQDISSLDQEVINDSICQKTDVDICSTTNRSNLVIAALRPEPKVDSIIKQASAAEESIQTKPSISDIGVD

Query:  ETAQTQTQMRNNDDENIHNGLGSSAHKPSSSLNSISGENSLNMSRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRKQLKR
        ETAQTQTQMRNN+D NI NGLGSSAHKP SSLNSISGENSL+MS HESGE SQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTR+QLKR
Subjt:  ETAQTQTQMRNNDDENIHNGLGSSAHKPSSSLNSISGENSLNMSRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRKQLKR

A0A1S3CJV0 uncharacterized protein LOC103501777 isoform X26.0e-25083.56Show/hide
Query:  KRSRRKSKSSRKLKSKKLRYRHDSPSCSDTDFESSTSMSSSSSEDDKKVRRSRSKTRKNAKPSKKRSKKQSHDHQSRECSPHPRKRKHSKRNDRCEVKKA
        +RSRRKSKSSRKLKSKK RYRHDSPSCSDTDFESSTS+ SSSSEDDKKVRRSRSKTRKN KPSKKR KKQSHD QSRECSP+PRKRKHSKRNDR EV KA
Subjt:  KRSRRKSKSSRKLKSKKLRYRHDSPSCSDTDFESSTSMSSSSSEDDKKVRRSRSKTRKNAKPSKKRSKKQSHDHQSRECSPHPRKRKHSKRNDRCEVKKA

Query:  NKKKRRRDVSVGAYGDSLSSSTCGNGSTTSNESEVVRRRGRCGKRKGNMGKTERIRNRSKSRSPCSLYGEGSDYQNEVDGDSYVEYTFKRLRSIIVVVGE
        NKKKRRRDVSVG + DSLS STCGNG+TTSNESEVVRRRGR  KRKGNM KT   R  SKSRSPCSL+ EGSDYQNEVD DSYVE  F+RLRSIIVVVGE
Subjt:  NKKKRRRDVSVGAYGDSLSSSTCGNGSTTSNESEVVRRRGRCGKRKGNMGKTERIRNRSKSRSPCSLYGEGSDYQNEVDGDSYVEYTFKRLRSIIVVVGE

Query:  ENKLKTFVGNEQQEGVTHQPDDDHPSFGDLDSKDVISKRELDCVISEEAPLVENE-EVDIPDSRNSIVVKDDGVQNEGSNKNHGGVTNDHSLDERTYGCS
        ENKLKTFVGNEQQE VT+Q  DDHPS G++DSKD   KR LD V+++EA +VENE EVD+P+ RNS+VVKD GVQNEGSNKNHGGVTNDHS DE   GCS
Subjt:  ENKLKTFVGNEQQEGVTHQPDDDHPSFGDLDSKDVISKRELDCVISEEAPLVENE-EVDIPDSRNSIVVKDDGVQNEGSNKNHGGVTNDHSLDERTYGCS

Query:  GNTDSVNCIDLESILRQRALENLRKFKGAPPRNVEAIANCKGDHNNDAKQLYSPDSKSVHVTSPRDDAEINSEGFSRQGGGNAVNSMTVKENGVKSTDAI
         NTDS+NCIDLES+LRQRALENLRKFKGA PRNVE IANCK DHNN AKQL SP S SVHVTSPR++AEINS+ FSRQGGGNA+NSM +KENGVKS DAI
Subjt:  GNTDSVNCIDLESILRQRALENLRKFKGAPPRNVEAIANCKGDHNNDAKQLYSPDSKSVHVTSPRDDAEINSEGFSRQGGGNAVNSMTVKENGVKSTDAI

Query:  DSAVASMHDPVYSSQNLGMISNGSNGMNELKQDISSLDQEVINDSICQKTDVDICSTTNRSNLVIAALRPEPKVDSIIKQASAAEESIQTKPSISDIGVD
        DSAVA+MHDPVYSSQNLG ISNGSNGMNE KQDISSLDQEVIND+ICQK D DICSTTNRSNLVIAALRPEPKVDS+IKQ SAA+ES+QTKPSISD+GV 
Subjt:  DSAVASMHDPVYSSQNLGMISNGSNGMNELKQDISSLDQEVINDSICQKTDVDICSTTNRSNLVIAALRPEPKVDSIIKQASAAEESIQTKPSISDIGVD

Query:  ETAQTQTQMRNNDDENIHNGLGSSAHKPSSSLNSISGENSLNMSRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRKQLKR
        ETAQ QTQMRNNDD NI NGLGSSA++P SSLNSISGE+SLNMS  ESGE SQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTR+QLKR
Subjt:  ETAQTQTQMRNNDDENIHNGLGSSAHKPSSSLNSISGENSLNMSRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRKQLKR

A0A1S3CKB4 uncharacterized protein LOC103501777 isoform X15.4e-25183.76Show/hide
Query:  MKRSRRKSKSSRKLKSKKLRYRHDSPSCSDTDFESSTSMSSSSSEDDKKVRRSRSKTRKNAKPSKKRSKKQSHDHQSRECSPHPRKRKHSKRNDRCEVKK
        MKRSRRKSKSSRKLKSKK RYRHDSPSCSDTDFESSTS+ SSSSEDDKKVRRSRSKTRKN KPSKKR KKQSHD QSRECSP+PRKRKHSKRNDR EV K
Subjt:  MKRSRRKSKSSRKLKSKKLRYRHDSPSCSDTDFESSTSMSSSSSEDDKKVRRSRSKTRKNAKPSKKRSKKQSHDHQSRECSPHPRKRKHSKRNDRCEVKK

Query:  ANKKKRRRDVSVGAYGDSLSSSTCGNGSTTSNESEVVRRRGRCGKRKGNMGKTERIRNRSKSRSPCSLYGEGSDYQNEVDGDSYVEYTFKRLRSIIVVVG
        ANKKKRRRDVSVG + DSLS STCGNG+TTSNESEVVRRRGR  KRKGNM KT   R  SKSRSPCSL+ EGSDYQNEVD DSYVE  F+RLRSIIVVVG
Subjt:  ANKKKRRRDVSVGAYGDSLSSSTCGNGSTTSNESEVVRRRGRCGKRKGNMGKTERIRNRSKSRSPCSLYGEGSDYQNEVDGDSYVEYTFKRLRSIIVVVG

Query:  EENKLKTFVGNEQQEGVTHQPDDDHPSFGDLDSKDVISKRELDCVISEEAPLVENE-EVDIPDSRNSIVVKDDGVQNEGSNKNHGGVTNDHSLDERTYGC
        EENKLKTFVGNEQQE VT+Q  DDHPS G++DSKD   KR LD V+++EA +VENE EVD+P+ RNS+VVKD GVQNEGSNKNHGGVTNDHS DE   GC
Subjt:  EENKLKTFVGNEQQEGVTHQPDDDHPSFGDLDSKDVISKRELDCVISEEAPLVENE-EVDIPDSRNSIVVKDDGVQNEGSNKNHGGVTNDHSLDERTYGC

Query:  SGNTDSVNCIDLESILRQRALENLRKFKGAPPRNVEAIANCKGDHNNDAKQLYSPDSKSVHVTSPRDDAEINSEGFSRQGGGNAVNSMTVKENGVKSTDA
        S NTDS+NCIDLES+LRQRALENLRKFKGA PRNVE IANCK DHNN AKQL SP S SVHVTSPR++AEINS+ FSRQGGGNA+NSM +KENGVKS DA
Subjt:  SGNTDSVNCIDLESILRQRALENLRKFKGAPPRNVEAIANCKGDHNNDAKQLYSPDSKSVHVTSPRDDAEINSEGFSRQGGGNAVNSMTVKENGVKSTDA

Query:  IDSAVASMHDPVYSSQNLGMISNGSNGMNELKQDISSLDQEVINDSICQKTDVDICSTTNRSNLVIAALRPEPKVDSIIKQASAAEESIQTKPSISDIGV
        IDSAVA+MHDPVYSSQNLG ISNGSNGMNE KQDISSLDQEVIND+ICQK D DICSTTNRSNLVIAALRPEPKVDS+IKQ SAA+ES+QTKPSISD+GV
Subjt:  IDSAVASMHDPVYSSQNLGMISNGSNGMNELKQDISSLDQEVINDSICQKTDVDICSTTNRSNLVIAALRPEPKVDSIIKQASAAEESIQTKPSISDIGV

Query:  DETAQTQTQMRNNDDENIHNGLGSSAHKPSSSLNSISGENSLNMSRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRKQLKR
         ETAQ QTQMRNNDD NI NGLGSSA++P SSLNSISGE+SLNMS  ESGE SQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTR+QLKR
Subjt:  DETAQTQTQMRNNDDENIHNGLGSSAHKPSSSLNSISGENSLNMSRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRKQLKR

A0A5D3DWF0 NK-tumor recognition protein5.4e-25183.76Show/hide
Query:  MKRSRRKSKSSRKLKSKKLRYRHDSPSCSDTDFESSTSMSSSSSEDDKKVRRSRSKTRKNAKPSKKRSKKQSHDHQSRECSPHPRKRKHSKRNDRCEVKK
        MKRSRRKSKSSRKLKSKK RYRHDSPSCSDTDFESSTS+ SSSSEDDKKVRRSRSKTRKN KPSKKR KKQSHD QSRECSP+PRKRKHSKRNDR EV K
Subjt:  MKRSRRKSKSSRKLKSKKLRYRHDSPSCSDTDFESSTSMSSSSSEDDKKVRRSRSKTRKNAKPSKKRSKKQSHDHQSRECSPHPRKRKHSKRNDRCEVKK

Query:  ANKKKRRRDVSVGAYGDSLSSSTCGNGSTTSNESEVVRRRGRCGKRKGNMGKTERIRNRSKSRSPCSLYGEGSDYQNEVDGDSYVEYTFKRLRSIIVVVG
        ANKKKRRRDVSVG + DSLS STCGNG+TTSNESEVVRRRGR  KRKGNM KT   R  SKSRSPCSL+ EGSDYQNEVD DSYVE  F+RLRSIIVVVG
Subjt:  ANKKKRRRDVSVGAYGDSLSSSTCGNGSTTSNESEVVRRRGRCGKRKGNMGKTERIRNRSKSRSPCSLYGEGSDYQNEVDGDSYVEYTFKRLRSIIVVVG

Query:  EENKLKTFVGNEQQEGVTHQPDDDHPSFGDLDSKDVISKRELDCVISEEAPLVENE-EVDIPDSRNSIVVKDDGVQNEGSNKNHGGVTNDHSLDERTYGC
        EENKLKTFVGNEQQE VT+Q  DDHPS G++DSKD   KR LD V+++EA +VENE EVD+P+ RNS+VVKD GVQNEGSNKNHGGVTNDHS DE   GC
Subjt:  EENKLKTFVGNEQQEGVTHQPDDDHPSFGDLDSKDVISKRELDCVISEEAPLVENE-EVDIPDSRNSIVVKDDGVQNEGSNKNHGGVTNDHSLDERTYGC

Query:  SGNTDSVNCIDLESILRQRALENLRKFKGAPPRNVEAIANCKGDHNNDAKQLYSPDSKSVHVTSPRDDAEINSEGFSRQGGGNAVNSMTVKENGVKSTDA
        S NTDS+NCIDLES+LRQRALENLRKFKGA PRNVE IANCK DHNN AKQL SP S SVHVTSPR++AEINS+ FSRQGGGNA+NSM +KENGVKS DA
Subjt:  SGNTDSVNCIDLESILRQRALENLRKFKGAPPRNVEAIANCKGDHNNDAKQLYSPDSKSVHVTSPRDDAEINSEGFSRQGGGNAVNSMTVKENGVKSTDA

Query:  IDSAVASMHDPVYSSQNLGMISNGSNGMNELKQDISSLDQEVINDSICQKTDVDICSTTNRSNLVIAALRPEPKVDSIIKQASAAEESIQTKPSISDIGV
        IDSAVA+MHDPVYSSQNLG ISNGSNGMNE KQDISSLDQEVIND+ICQK D DICSTTNRSNLVIAALRPEPKVDS+IKQ SAA+ES+QTKPSISD+GV
Subjt:  IDSAVASMHDPVYSSQNLGMISNGSNGMNELKQDISSLDQEVINDSICQKTDVDICSTTNRSNLVIAALRPEPKVDSIIKQASAAEESIQTKPSISDIGV

Query:  DETAQTQTQMRNNDDENIHNGLGSSAHKPSSSLNSISGENSLNMSRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRKQLKR
         ETAQ QTQMRNNDD NI NGLGSSA++P SSLNSISGE+SLNMS  ESGE SQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTR+QLKR
Subjt:  DETAQTQTQMRNNDDENIHNGLGSSAHKPSSSLNSISGENSLNMSRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRKQLKR

A0A6J1IGY0 uncharacterized protein LOC111476850 isoform X19.9e-24580.71Show/hide
Query:  KRSRRKSKSSRKLKSKKLRYRHDSPSCSDTDFESSTSMSSSSSEDDKKVRRSRSKTRKNAKPSKKRSKKQSHDHQSRECSPHPRKRKHSKRNDRCEVKKA
        +RSRRKSKSSRKLKSKKLRYRHDSPSCSDTDFESSTS+SSSSSEDDKKVRRSRSKTRKN+KPSKKRSKKQSHDHQSRECSPHPRKRKHSKRNDR E KK 
Subjt:  KRSRRKSKSSRKLKSKKLRYRHDSPSCSDTDFESSTSMSSSSSEDDKKVRRSRSKTRKNAKPSKKRSKKQSHDHQSRECSPHPRKRKHSKRNDRCEVKKA

Query:  NKKKRRRDVSVGA-YGDSLSSSTCGNGSTTSNESEVVRRRGRCGKRKGNMGKTERIRNRSKSRSPCSLYGEGSDYQNEVDGDSYVEYTFKRLRSIIVVVG
        NKKKRRRDVSVGA   DSLS STCG+GS+TS++SE+ RRRGR GKRK NM KTE  R RSKSRSPCSL  +GSD+QNEV+ DSYV+   +RL+SIIVVVG
Subjt:  NKKKRRRDVSVGA-YGDSLSSSTCGNGSTTSNESEVVRRRGRCGKRKGNMGKTERIRNRSKSRSPCSLYGEGSDYQNEVDGDSYVEYTFKRLRSIIVVVG

Query:  EENKLKTFVGNEQQEGVTHQPDDDHPSFGDLDSKDVISKRELDCVISEEAPLVENE-EVDIPDSRNSIVVKDDGVQNEGSNKNHGGVTNDHSLDERTYGC
        EE++LKTFVGNEQQE VTHQ DD+HP F D++SKD   KRELD VIS+EAP VE++ ++  PD+RNS+++ +DGV+NEGSNKNHGGVTNDHSLDER  GC
Subjt:  EENKLKTFVGNEQQEGVTHQPDDDHPSFGDLDSKDVISKRELDCVISEEAPLVENE-EVDIPDSRNSIVVKDDGVQNEGSNKNHGGVTNDHSLDERTYGC

Query:  SGNTDSVNCIDLESILRQRALENLRKFKGAPPRNVEAIANCKGDHNNDAKQLYSPDSKSVHVTSPRDDAEINSEGFSRQGGGNAVNSMTVKENGVKSTDA
        SGNTD++NCIDLESILRQ+ALENLRKFKGA PRNVE IANCK ++NNDAKQL+SP SKSVHV SPRDDAE N +GFSRQ GG+AVNSM +K NG KSTDA
Subjt:  SGNTDSVNCIDLESILRQRALENLRKFKGAPPRNVEAIANCKGDHNNDAKQLYSPDSKSVHVTSPRDDAEINSEGFSRQGGGNAVNSMTVKENGVKSTDA

Query:  IDSAVASMHDPVYSSQNLGMISNGSNGMNELKQDISSLDQEVINDSICQKTDVDICSTTNRSNLVIAALRPEPKVDSIIKQASAAEESIQTKPSISDIGV
        ID+AVASMHDPV SSQNLG ISNGSNGMNELKQDISSLDQEVIND+IC K D +I STTNRSNLVIAA RPE KVDS+I++ASAA+E IQTKPSISDI V
Subjt:  IDSAVASMHDPVYSSQNLGMISNGSNGMNELKQDISSLDQEVINDSICQKTDVDICSTTNRSNLVIAALRPEPKVDSIIKQASAAEESIQTKPSISDIGV

Query:  DETAQTQTQMRNNDDENIHNGLGSSAHKPSSSLNSISGENSLNMSRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRKQLKR
        DE +QTQTQ  NNDD+NI NG GSSA+KPSSSLNSISGENSL+ SR ESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTR+QLKR
Subjt:  DETAQTQTQMRNNDDENIHNGLGSSAHKPSSSLNSISGENSLNMSRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRKQLKR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G53930.1 unknown protein5.0e-1527.48Show/hide
Query:  KSKSSRKLKSKKLRYRHDSPSCSDTDFESSTSMSSSSSEDDKKVRRSRSKTRKNAKPSKKRSKKQSHDHQSRECSPHPR---KRKHSKRNDRCEVKKANK
        K K S++ KSKK+R   D    S +D     S   SSSEDD      R K ++ +K SKKRS+K+    +S + S   R   K+K SKR D    KK  K
Subjt:  KSKSSRKLKSKKLRYRHDSPSCSDTDFESSTSMSSSSSEDDKKVRRSRSKTRKNAKPSKKRSKKQSHDHQSRECSPHPR---KRKHSKRNDRCEVKKANK

Query:  ----KKRRRDVSVGAYGDSLSSSTCGNGSTTSNESEVVRRRGRCGKRKGNMGKTERIRNRSKSRSPCSLYGEGSDYQNEVDGDSYV--EYTFKRLRSIIV
            K+R+RD+S  +   + S  +  +GS + +     R RGR       +GK +  R+RS+      L GE  +      G+  V  E   +RL+SI+V
Subjt:  ----KKRRRDVSVGAYGDSLSSSTCGNGSTTSNESEVVRRRGRCGKRKGNMGKTERIRNRSKSRSPCSLYGEGSDYQNEVDGDSYV--EYTFKRLRSIIV

Query:  VVGEENKLKTFVGNEQQEGVTHQPDDDHPSFGDLDSKDVISKR---ELDCVISEEAPLVENEEVDIPDSRNSIVVKDDGVQNEGSNKNHGGVTNDHSLDE
        V           GN +++              + D +DV   R     +   SE++  ++ E +   DS + I   D+G      ++       D+SL +
Subjt:  VVGEENKLKTFVGNEQQEGVTHQPDDDHPSFGDLDSKDVISKR---ELDCVISEEAPLVENEEVDIPDSRNSIVVKDDGVQNEGSNKNHGGVTNDHSLDE

Query:  RTYGCSGNTDSVNCIDLESILRQRALENLRKFKGAPPRNVEAIANCKGDHNNDAKQLYSPDSKSVHVTSPRDDAEINSEGFSRQGGGNAVNSMTVKENGV
                       DLE+IL++RALENL++F+G   ++  A                    K V   S  +  +I SE    Q            ++ +
Subjt:  RTYGCSGNTDSVNCIDLESILRQRALENLRKFKGAPPRNVEAIANCKGDHNNDAKQLYSPDSKSVHVTSPRDDAEINSEGFSRQGGGNAVNSMTVKENGV

Query:  KSTDAIDSAVASMHDPVYSSQNLGMISNGSNGMNELKQDISSLDQEVINDSICQKTDVDICSTTNRSNLVIAALRPEPKVDSIIKQASAAEESIQTKPSI
              DSAV+     + +S+ +  + N       L    S  DQ+   D+   K    + S T +  LV   L  +    +  K+AS ++++       
Subjt:  KSTDAIDSAVASMHDPVYSSQNLGMISNGSNGMNELKQDISSLDQEVINDSICQKTDVDICSTTNRSNLVIAALRPEPKVDSIIKQASAAEESIQTKPSI

Query:  SDIGVDETAQTQTQMRNNDDENIHN-------GLGSSAHKPSSSLNSI-SGENSLNMSRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRK
        S +  +    T   +  N+ E+I            SS+H  +  ++ +  G  S   +  E+ + SQ+EQKTM+VMRGGEMVQV+YKVYIPK+A +L R+
Subjt:  SDIGVDETAQTQTQMRNNDDENIHN-------GLGSSAHKPSSSLNSI-SGENSLNMSRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRK

Query:  QLKR
        +L R
Subjt:  QLKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAGAAGTAGGAGAAAGAGTAAGAGTAGTAGGAAATTGAAGTCCAAGAAGCTGCGGTATCGCCACGATTCTCCTTCTTGCTCAGACACTGATTTTGAGAGTTCAAC
TTCAATGTCTTCATCTAGCTCGGAGGATGACAAAAAGGTGAGAAGATCTCGATCCAAGACGCGAAAGAATGCGAAGCCTAGTAAAAAGAGATCCAAAAAGCAATCTCATG
ATCATCAAAGTAGGGAATGCTCTCCTCATCCCAGAAAGAGGAAGCATTCAAAGAGAAATGACCGTTGTGAGGTGAAGAAGGCCAACAAAAAGAAGCGTAGAAGAGATGTG
AGTGTTGGTGCCTATGGTGACTCTTTGAGCTCCTCAACTTGTGGAAATGGCAGTACAACCAGCAATGAGAGTGAAGTTGTTAGGCGTAGGGGCAGGTGTGGAAAGAGGAA
AGGAAATATGGGAAAGACTGAAAGAATTAGAAACAGGTCAAAGAGTCGTTCACCATGCTCTTTATATGGTGAAGGTAGTGATTATCAGAATGAGGTTGATGGTGACAGTT
ATGTTGAATACACCTTTAAACGACTAAGGTCCATAATTGTTGTAGTAGGGGAGGAAAATAAATTAAAGACCTTTGTTGGGAATGAACAACAAGAAGGGGTCACACATCAG
CCTGATGATGACCACCCTTCTTTTGGAGACTTGGACAGTAAGGATGTGATAAGTAAAAGAGAATTAGACTGTGTTATATCAGAAGAGGCACCATTGGTAGAAAACGAAGA
AGTGGATATACCTGACAGTAGGAACTCTATCGTTGTAAAGGATGATGGAGTTCAAAATGAAGGAAGCAACAAAAACCATGGAGGAGTAACTAATGATCATTCTTTAGATG
AAAGAACTTATGGCTGTTCTGGAAATACTGACAGCGTAAATTGTATTGATTTAGAGTCAATTTTAAGGCAGAGGGCTTTGGAAAACCTTAGAAAGTTCAAAGGGGCTCCC
CCAAGGAATGTGGAAGCTATTGCTAATTGCAAAGGTGACCATAATAATGATGCAAAGCAATTGTATTCTCCTGACTCTAAGTCAGTTCATGTGACATCTCCTAGGGATGA
TGCTGAGATAAATAGTGAAGGGTTCTCTAGACAAGGTGGAGGGAATGCGGTAAATTCAATGACAGTTAAGGAGAATGGTGTTAAATCTACAGATGCAATAGATTCAGCAG
TTGCATCTATGCATGACCCAGTCTATTCTTCACAGAATCTGGGTATGATTTCCAATGGAAGCAATGGTATGAATGAACTGAAGCAGGATATTTCTTCATTAGACCAGGAG
GTTATAAATGATAGTATTTGCCAGAAGACAGATGTAGATATTTGTTCTACAACTAACAGAAGTAATTTGGTTATTGCAGCTTTGAGGCCTGAGCCAAAAGTTGATTCTAT
TATAAAGCAGGCATCTGCTGCTGAGGAATCTATCCAAACAAAGCCATCCATATCTGACATTGGTGTAGACGAGACTGCTCAAACTCAGACCCAAATGAGGAATAATGACG
ATGAAAATATTCATAATGGTTTAGGTTCTTCAGCTCACAAGCCTTCTTCTTCCCTTAATTCTATTTCAGGAGAGAATAGCTTGAATATGTCCAGACACGAGAGTGGAGAA
GGCTCACAGTTTGAACAGAAAACCATGTCTGTGATGCGGGGTGGTGAGATGGTGCAGGTGAACTACAAGGTCTACATCCCGAAGAGAGCTCCCGCTTTGACTAGGAAGCA
ACTCAAGCGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAGAAGTAGGAGAAAGAGTAAGAGTAGTAGGAAATTGAAGTCCAAGAAGCTGCGGTATCGCCACGATTCTCCTTCTTGCTCAGACACTGATTTTGAGAGTTCAAC
TTCAATGTCTTCATCTAGCTCGGAGGATGACAAAAAGGTGAGAAGATCTCGATCCAAGACGCGAAAGAATGCGAAGCCTAGTAAAAAGAGATCCAAAAAGCAATCTCATG
ATCATCAAAGTAGGGAATGCTCTCCTCATCCCAGAAAGAGGAAGCATTCAAAGAGAAATGACCGTTGTGAGGTGAAGAAGGCCAACAAAAAGAAGCGTAGAAGAGATGTG
AGTGTTGGTGCCTATGGTGACTCTTTGAGCTCCTCAACTTGTGGAAATGGCAGTACAACCAGCAATGAGAGTGAAGTTGTTAGGCGTAGGGGCAGGTGTGGAAAGAGGAA
AGGAAATATGGGAAAGACTGAAAGAATTAGAAACAGGTCAAAGAGTCGTTCACCATGCTCTTTATATGGTGAAGGTAGTGATTATCAGAATGAGGTTGATGGTGACAGTT
ATGTTGAATACACCTTTAAACGACTAAGGTCCATAATTGTTGTAGTAGGGGAGGAAAATAAATTAAAGACCTTTGTTGGGAATGAACAACAAGAAGGGGTCACACATCAG
CCTGATGATGACCACCCTTCTTTTGGAGACTTGGACAGTAAGGATGTGATAAGTAAAAGAGAATTAGACTGTGTTATATCAGAAGAGGCACCATTGGTAGAAAACGAAGA
AGTGGATATACCTGACAGTAGGAACTCTATCGTTGTAAAGGATGATGGAGTTCAAAATGAAGGAAGCAACAAAAACCATGGAGGAGTAACTAATGATCATTCTTTAGATG
AAAGAACTTATGGCTGTTCTGGAAATACTGACAGCGTAAATTGTATTGATTTAGAGTCAATTTTAAGGCAGAGGGCTTTGGAAAACCTTAGAAAGTTCAAAGGGGCTCCC
CCAAGGAATGTGGAAGCTATTGCTAATTGCAAAGGTGACCATAATAATGATGCAAAGCAATTGTATTCTCCTGACTCTAAGTCAGTTCATGTGACATCTCCTAGGGATGA
TGCTGAGATAAATAGTGAAGGGTTCTCTAGACAAGGTGGAGGGAATGCGGTAAATTCAATGACAGTTAAGGAGAATGGTGTTAAATCTACAGATGCAATAGATTCAGCAG
TTGCATCTATGCATGACCCAGTCTATTCTTCACAGAATCTGGGTATGATTTCCAATGGAAGCAATGGTATGAATGAACTGAAGCAGGATATTTCTTCATTAGACCAGGAG
GTTATAAATGATAGTATTTGCCAGAAGACAGATGTAGATATTTGTTCTACAACTAACAGAAGTAATTTGGTTATTGCAGCTTTGAGGCCTGAGCCAAAAGTTGATTCTAT
TATAAAGCAGGCATCTGCTGCTGAGGAATCTATCCAAACAAAGCCATCCATATCTGACATTGGTGTAGACGAGACTGCTCAAACTCAGACCCAAATGAGGAATAATGACG
ATGAAAATATTCATAATGGTTTAGGTTCTTCAGCTCACAAGCCTTCTTCTTCCCTTAATTCTATTTCAGGAGAGAATAGCTTGAATATGTCCAGACACGAGAGTGGAGAA
GGCTCACAGTTTGAACAGAAAACCATGTCTGTGATGCGGGGTGGTGAGATGGTGCAGGTGAACTACAAGGTCTACATCCCGAAGAGAGCTCCCGCTTTGACTAGGAAGCA
ACTCAAGCGGTAA
Protein sequenceShow/hide protein sequence
MKRSRRKSKSSRKLKSKKLRYRHDSPSCSDTDFESSTSMSSSSSEDDKKVRRSRSKTRKNAKPSKKRSKKQSHDHQSRECSPHPRKRKHSKRNDRCEVKKANKKKRRRDV
SVGAYGDSLSSSTCGNGSTTSNESEVVRRRGRCGKRKGNMGKTERIRNRSKSRSPCSLYGEGSDYQNEVDGDSYVEYTFKRLRSIIVVVGEENKLKTFVGNEQQEGVTHQ
PDDDHPSFGDLDSKDVISKRELDCVISEEAPLVENEEVDIPDSRNSIVVKDDGVQNEGSNKNHGGVTNDHSLDERTYGCSGNTDSVNCIDLESILRQRALENLRKFKGAP
PRNVEAIANCKGDHNNDAKQLYSPDSKSVHVTSPRDDAEINSEGFSRQGGGNAVNSMTVKENGVKSTDAIDSAVASMHDPVYSSQNLGMISNGSNGMNELKQDISSLDQE
VINDSICQKTDVDICSTTNRSNLVIAALRPEPKVDSIIKQASAAEESIQTKPSISDIGVDETAQTQTQMRNNDDENIHNGLGSSAHKPSSSLNSISGENSLNMSRHESGE
GSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRKQLKR