; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr024951 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr024951
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionANK_REP_REGION domain-containing protein
Genome locationtig00002486:4529360..4545432
RNA-Seq ExpressionSgr024951
SyntenySgr024951
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR000641 - CbxX/CfxQ
IPR002110 - Ankyrin repeat
IPR003593 - AAA+ ATPase domain
IPR003959 - ATPase, AAA-type, core
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR020683 - Ankyrin repeat-containing domain
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR034146 - snRNP35, RNA recognition motif
IPR035979 - RNA-binding domain superfamily
IPR036770 - Ankyrin repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582100.1 Protein CfxQ-like protein, partial [Cucurbita argyrosperma subsp. sororia]1.1e-21588.58Show/hide
Query:  QARSPRSRLKNPHSSAGDVLGLQKLLRENPSLLNERNPIVWSLFLKNFSMGQTPLHVSAGYNRAEIVKCLLAWQGPEKVELEAKNMYGETPLHMAAKNGC
        ++RS +    + ++ +GDVL LQKLLRENP LLN+RNP           MGQTPLHVSAGYN+AEIVK LLAWQGPEKVELEAKNMYGETPLHMAAKNGC
Subjt:  QARSPRSRLKNPHSSAGDVLGLQKLLRENPSLLNERNPIVWSLFLKNFSMGQTPLHVSAGYNRAEIVKCLLAWQGPEKVELEAKNMYGETPLHMAAKNGC

Query:  NDAARVLLAHGAFIEAKANNGMTPLHLAVWYSLQAEDCATVKTLLEYNADCSAKDDEGMTPLNHLSQGSCSGKLRELLNQHLEEQRKRKAIEACSETKAK
        NDAA VLLAHGAF+EAKANNGMTPLHLAVWYSLQ+EDCATVKTLLEYNADCSA DDEGMTPLNHLSQGSCS KLRELLN+HL+EQRKRKAIEACSETKAK
Subjt:  NDAARVLLAHGAFIEAKANNGMTPLHLAVWYSLQAEDCATVKTLLEYNADCSAKDDEGMTPLNHLSQGSCSGKLRELLNQHLEEQRKRKAIEACSETKAK

Query:  MKELENELSHIVGLHELKIQLHKWAKGMLLDERRRALGLKVGTRRPPHMAFLGNPGTGKTMVARILGKLLHMVGILPTDKVTEVQRTDLVGEFVGHTGPK
        MKELENELSHIVGLHELKIQL KWAKGMLLDERRRALGLKVGTR+PPHMAFLGNPGTGKTMVARILGKLLHMVGILPTDKVTEVQRTDLVGEFVGHTGPK
Subjt:  MKELENELSHIVGLHELKIQLHKWAKGMLLDERRRALGLKVGTRRPPHMAFLGNPGTGKTMVARILGKLLHMVGILPTDKVTEVQRTDLVGEFVGHTGPK

Query:  TRRKIKEAEGGILFVDEAYRLIPMQKADDKDYGLEALEEIMSVMDSGKIVVIFAGYCEPMKRVIASNDGFCRRVTKFFNFNDFSSEELAKILHIKMDNQT
        TRRKIKEAEGGILFVDEAYRLIPMQK+DDKDYGLEALEEIMSVMDSGK+VVIFAGYCEPM+RVIASN+GF RRVTKFF FNDFSSEELAKILHIKM+NQT
Subjt:  TRRKIKEAEGGILFVDEAYRLIPMQKADDKDYGLEALEEIMSVMDSGKIVVIFAGYCEPMKRVIASNDGFCRRVTKFFNFNDFSSEELAKILHIKMDNQT

Query:  EDSLLYGFKLYPTCTIEAISDLIDRETKE
        EDSLLYGFKL+PTCTIEAISDLI RET+E
Subjt:  EDSLLYGFKLYPTCTIEAISDLIDRETKE

XP_022955612.1 uncharacterized protein LOC111457564 [Cucurbita moschata]1.1e-21588.58Show/hide
Query:  QARSPRSRLKNPHSSAGDVLGLQKLLRENPSLLNERNPIVWSLFLKNFSMGQTPLHVSAGYNRAEIVKCLLAWQGPEKVELEAKNMYGETPLHMAAKNGC
        ++RS +    + ++ +GDVL LQKLLRENP LLN+RNP           MGQTPLHVSAGYN+AEIVK LLAWQGPEKVELEAKNMYGETPLHMAAKNGC
Subjt:  QARSPRSRLKNPHSSAGDVLGLQKLLRENPSLLNERNPIVWSLFLKNFSMGQTPLHVSAGYNRAEIVKCLLAWQGPEKVELEAKNMYGETPLHMAAKNGC

Query:  NDAARVLLAHGAFIEAKANNGMTPLHLAVWYSLQAEDCATVKTLLEYNADCSAKDDEGMTPLNHLSQGSCSGKLRELLNQHLEEQRKRKAIEACSETKAK
        NDAA VLLAHGAF+EAKANNGMTPLHLAVWYSLQ+EDCATVKTLL+YNADCSA DDEGMTPLNHLSQGSCS KLRELLN+HL+EQRKRKAIEACSETKAK
Subjt:  NDAARVLLAHGAFIEAKANNGMTPLHLAVWYSLQAEDCATVKTLLEYNADCSAKDDEGMTPLNHLSQGSCSGKLRELLNQHLEEQRKRKAIEACSETKAK

Query:  MKELENELSHIVGLHELKIQLHKWAKGMLLDERRRALGLKVGTRRPPHMAFLGNPGTGKTMVARILGKLLHMVGILPTDKVTEVQRTDLVGEFVGHTGPK
        MKELENELSHIVGLHELKIQL KWAKGMLLDERRRALGLKVGTRRPPHMAFLGNPGTGKTMVARILGKLLHMVGILPTDKVTEVQRTDLVGEFVGHTGPK
Subjt:  MKELENELSHIVGLHELKIQLHKWAKGMLLDERRRALGLKVGTRRPPHMAFLGNPGTGKTMVARILGKLLHMVGILPTDKVTEVQRTDLVGEFVGHTGPK

Query:  TRRKIKEAEGGILFVDEAYRLIPMQKADDKDYGLEALEEIMSVMDSGKIVVIFAGYCEPMKRVIASNDGFCRRVTKFFNFNDFSSEELAKILHIKMDNQT
        TRRKIKEAEGGILFVDEAYRLIPMQK+DDKDYGLEALEEIMSVMDSGK+VVIFAGYCEPM+RVIASN+GF RRVTKFF FNDFSSEELAKILHIKM+NQT
Subjt:  TRRKIKEAEGGILFVDEAYRLIPMQKADDKDYGLEALEEIMSVMDSGKIVVIFAGYCEPMKRVIASNDGFCRRVTKFFNFNDFSSEELAKILHIKMDNQT

Query:  EDSLLYGFKLYPTCTIEAISDLIDRETKE
        EDSLLYGFKL+PTCTIEAISDLI RET+E
Subjt:  EDSLLYGFKLYPTCTIEAISDLIDRETKE

XP_022979772.1 uncharacterized protein LOC111479377 [Cucurbita maxima]1.2e-21488.11Show/hide
Query:  QARSPRSRLKNPHSSAGDVLGLQKLLRENPSLLNERNPIVWSLFLKNFSMGQTPLHVSAGYNRAEIVKCLLAWQGPEKVELEAKNMYGETPLHMAAKNGC
        ++RS +    + ++ +GDV  LQKLLRENP LLN+RNP           MGQTPLHVSAGYN+AEIVK LLAWQGPEKVELEAKNMYGETPLHMAAKNGC
Subjt:  QARSPRSRLKNPHSSAGDVLGLQKLLRENPSLLNERNPIVWSLFLKNFSMGQTPLHVSAGYNRAEIVKCLLAWQGPEKVELEAKNMYGETPLHMAAKNGC

Query:  NDAARVLLAHGAFIEAKANNGMTPLHLAVWYSLQAEDCATVKTLLEYNADCSAKDDEGMTPLNHLSQGSCSGKLRELLNQHLEEQRKRKAIEACSETKAK
        NDAA +LLAHGAF+EAKANNGMTPLHLAVWYSLQ+EDCATVKTLLEYNADCSA DDEGMTPLNHLSQGSCS KLRELLN+HLEEQRKRKAIEACSETKAK
Subjt:  NDAARVLLAHGAFIEAKANNGMTPLHLAVWYSLQAEDCATVKTLLEYNADCSAKDDEGMTPLNHLSQGSCSGKLRELLNQHLEEQRKRKAIEACSETKAK

Query:  MKELENELSHIVGLHELKIQLHKWAKGMLLDERRRALGLKVGTRRPPHMAFLGNPGTGKTMVARILGKLLHMVGILPTDKVTEVQRTDLVGEFVGHTGPK
        MKELENELSHIVGLHELKIQL KWAKGMLLDERRRALGLKVGTRRPPHMAFLGNPGTGKTMVARILGKLLHMVGILPTDKVTEVQRTDLVGEFVGHTGPK
Subjt:  MKELENELSHIVGLHELKIQLHKWAKGMLLDERRRALGLKVGTRRPPHMAFLGNPGTGKTMVARILGKLLHMVGILPTDKVTEVQRTDLVGEFVGHTGPK

Query:  TRRKIKEAEGGILFVDEAYRLIPMQKADDKDYGLEALEEIMSVMDSGKIVVIFAGYCEPMKRVIASNDGFCRRVTKFFNFNDFSSEELAKILHIKMDNQT
        TRRKIKEAEGGILFVDEAYRLIPMQK+DDKDYGLEALEEIMSVMDSGK+VVIFAGYCEPM+RVI+SN+GF RRVTKFF FNDFSSEELAKILHIKM+NQT
Subjt:  TRRKIKEAEGGILFVDEAYRLIPMQKADDKDYGLEALEEIMSVMDSGKIVVIFAGYCEPMKRVIASNDGFCRRVTKFFNFNDFSSEELAKILHIKMDNQT

Query:  EDSLLYGFKLYPTCTIEAISDLIDRETKE
        EDSLLYGFKL+PTCTIEAISDLI+R+T+E
Subjt:  EDSLLYGFKLYPTCTIEAISDLIDRETKE

XP_023528533.1 uncharacterized protein LOC111791430 [Cucurbita pepo subsp. pepo]1.2e-21689.04Show/hide
Query:  QARSPRSRLKNPHSSAGDVLGLQKLLRENPSLLNERNPIVWSLFLKNFSMGQTPLHVSAGYNRAEIVKCLLAWQGPEKVELEAKNMYGETPLHMAAKNGC
        ++RS +    + ++ +GDVL LQKLLRENP LLN+RNP           MGQTPLHVSAGYN+AEIVK LLAWQGPEKVELEAKNMYGETPLHMAAKNGC
Subjt:  QARSPRSRLKNPHSSAGDVLGLQKLLRENPSLLNERNPIVWSLFLKNFSMGQTPLHVSAGYNRAEIVKCLLAWQGPEKVELEAKNMYGETPLHMAAKNGC

Query:  NDAARVLLAHGAFIEAKANNGMTPLHLAVWYSLQAEDCATVKTLLEYNADCSAKDDEGMTPLNHLSQGSCSGKLRELLNQHLEEQRKRKAIEACSETKAK
        NDAA VLLAHGAF+EAKANNGMTPLHLAVWYSLQ+EDCATVKTLLEYNADCSA DDEGMTPLNHLSQGSCS KLRELLN+HLEEQRKRKAIEACSETKAK
Subjt:  NDAARVLLAHGAFIEAKANNGMTPLHLAVWYSLQAEDCATVKTLLEYNADCSAKDDEGMTPLNHLSQGSCSGKLRELLNQHLEEQRKRKAIEACSETKAK

Query:  MKELENELSHIVGLHELKIQLHKWAKGMLLDERRRALGLKVGTRRPPHMAFLGNPGTGKTMVARILGKLLHMVGILPTDKVTEVQRTDLVGEFVGHTGPK
        MKELENELSHIVGLHELKIQL KWAKGMLLDERRRALGLKVGTRRPPHMAFLGNPGTGKTMVARILGKLLHMVGILPTDKVTEVQRTDLVGEFVGHTGPK
Subjt:  MKELENELSHIVGLHELKIQLHKWAKGMLLDERRRALGLKVGTRRPPHMAFLGNPGTGKTMVARILGKLLHMVGILPTDKVTEVQRTDLVGEFVGHTGPK

Query:  TRRKIKEAEGGILFVDEAYRLIPMQKADDKDYGLEALEEIMSVMDSGKIVVIFAGYCEPMKRVIASNDGFCRRVTKFFNFNDFSSEELAKILHIKMDNQT
        TRRKIKEAEGGILFVDEAYRLIPMQK+DDKDYGLEALEEIMSVMDSGK+VVIFAGYCEPM+RVIASN+GF RRVTKFF FNDFSSEELAKILHIKM+NQT
Subjt:  TRRKIKEAEGGILFVDEAYRLIPMQKADDKDYGLEALEEIMSVMDSGKIVVIFAGYCEPMKRVIASNDGFCRRVTKFFNFNDFSSEELAKILHIKMDNQT

Query:  EDSLLYGFKLYPTCTIEAISDLIDRETKE
        EDSLLYGFKL+PTCTIEAISDLI+RET+E
Subjt:  EDSLLYGFKLYPTCTIEAISDLIDRETKE

XP_038896543.1 ribulose bisphosphate carboxylase/oxygenase activase, chloroplastic [Benincasa hispida]4.0e-21588.6Show/hide
Query:  HQARSPRSRLKNPHSSAGDVLGLQKLLRENPSLLNERNPIVWSLFLKNFSMGQTPLHVSAGYNRAEIVKCLLAWQGPEKVELEAKNMYGETPLHMAAKNG
        H++RS +    + ++ +GDVL LQKLLRENP LLNERNP           MGQTPLHVSAGYNRAEIV  LLAW+GPE VELEAKNMYGETPLHMAAKNG
Subjt:  HQARSPRSRLKNPHSSAGDVLGLQKLLRENPSLLNERNPIVWSLFLKNFSMGQTPLHVSAGYNRAEIVKCLLAWQGPEKVELEAKNMYGETPLHMAAKNG

Query:  CNDAARVLLAHGAFIEAKANNGMTPLHLAVWYSLQAEDCATVKTLLEYNADCSAKDDEGMTPLNHLSQGSCSGKLRELLNQHLEEQRKRKAIEACSETKA
        CNDAAR LLAHGAFIEAKANNGMTPLHLAVWYSLQ+EDCATV+TLLEYNADCSA D+EGMTPLNHLSQG CS KLRELLN+HLE+QRKRKAIEACSETKA
Subjt:  CNDAARVLLAHGAFIEAKANNGMTPLHLAVWYSLQAEDCATVKTLLEYNADCSAKDDEGMTPLNHLSQGSCSGKLRELLNQHLEEQRKRKAIEACSETKA

Query:  KMKELENELSHIVGLHELKIQLHKWAKGMLLDERRRALGLKVGTRRPPHMAFLGNPGTGKTMVARILGKLLHMVGILPTDKVTEVQRTDLVGEFVGHTGP
        KMKELENELSHIVGLHELKIQL KWAKGMLLDERRRALGLKVGTRRPPHMAFLGNPGTGKTMVARILGKLLHMVGILPTDKVTEVQRTDLVGEFVGHTGP
Subjt:  KMKELENELSHIVGLHELKIQLHKWAKGMLLDERRRALGLKVGTRRPPHMAFLGNPGTGKTMVARILGKLLHMVGILPTDKVTEVQRTDLVGEFVGHTGP

Query:  KTRRKIKEAEGGILFVDEAYRLIPMQKADDKDYGLEALEEIMSVMDSGKIVVIFAGYCEPMKRVIASNDGFCRRVTKFFNFNDFSSEELAKILHIKMDNQ
        KTRRKIKEAEGGILFVDEAYRLIPMQKADDKDYGLEALEEIMSVMDSGKIVVIFAGYCEPMKRVIASN+GF RRVTKFF FNDFSSEELA ILHIKMDNQ
Subjt:  KTRRKIKEAEGGILFVDEAYRLIPMQKADDKDYGLEALEEIMSVMDSGKIVVIFAGYCEPMKRVIASNDGFCRRVTKFFNFNDFSSEELAKILHIKMDNQ

Query:  TEDSLLYGFKLYPTCTIEAISDLIDRETKE
        TEDSLLYGFKL+PTCTI+AISDLI+RET+E
Subjt:  TEDSLLYGFKLYPTCTIEAISDLIDRETKE

TrEMBL top hitse value%identityAlignment
A0A1S3AXT1 protein CbxX, chromosomal2.0e-21288.52Show/hide
Query:  RSPRSRLKNPHSSAGDVLGLQKLLRENPSLLNERNPIVWSLFLKNFSMGQTPLHVSAGYNRAEIVKCLLAWQGPEKVELEAKNMYGETPLHMAAKNGCND
        RS +    + ++ +GDVL LQKLLRENP LLNERNP           MGQTPLHVSAGYNRAEIV  LLAW+GPE VELEAKNMYGETPLHMAAKNGCND
Subjt:  RSPRSRLKNPHSSAGDVLGLQKLLRENPSLLNERNPIVWSLFLKNFSMGQTPLHVSAGYNRAEIVKCLLAWQGPEKVELEAKNMYGETPLHMAAKNGCND

Query:  AARVLLAHGAFIEAKANNGMTPLHLAVWYSLQAEDCATVKTLLEYNADCSAKDDEGMTPLNHLSQGSCSGKLRELLNQHLEEQRKRKAIEACSETKAKMK
        AARVLLAHGAF+EAKANNGMTPLHLAVWYSLQ+EDC TVKTLLEYNADCSA D+EGMTPLNHLSQ SCS KLRELLNQHLEEQRKRKAIEACSETKAKMK
Subjt:  AARVLLAHGAFIEAKANNGMTPLHLAVWYSLQAEDCATVKTLLEYNADCSAKDDEGMTPLNHLSQGSCSGKLRELLNQHLEEQRKRKAIEACSETKAKMK

Query:  ELENELSHIVGLHELKIQLHKWAKGMLLDERRRALGLKVGTRRPPHMAFLGNPGTGKTMVARILGKLLHMVGILPTDKVTEVQRTDLVGEFVGHTGPKTR
        ELENELSHIVGLHELKIQL KWAKGMLLDERRRALGLKVGTRR PHMAFLGNPGTGKTMVARILGKLLHMVGILPTDKVTEVQRTDLVGEFVGHTGPKTR
Subjt:  ELENELSHIVGLHELKIQLHKWAKGMLLDERRRALGLKVGTRRPPHMAFLGNPGTGKTMVARILGKLLHMVGILPTDKVTEVQRTDLVGEFVGHTGPKTR

Query:  RKIKEAEGGILFVDEAYRLIPMQKADDKDYGLEALEEIMSVMDSGKIVVIFAGYCEPMKRVIASNDGFCRRVTKFFNFNDFSSEELAKILHIKMDNQTED
        RKIKEAEGGILFVDEAYRLIPMQKADDKDYGLEALEEIMSVMDSGK+VVIFAGYCEPMKRVI SN+GF RRVTKFF FNDFSS+ELA ILHIKMDNQTED
Subjt:  RKIKEAEGGILFVDEAYRLIPMQKADDKDYGLEALEEIMSVMDSGKIVVIFAGYCEPMKRVIASNDGFCRRVTKFFNFNDFSSEELAKILHIKMDNQTED

Query:  SLLYGFKLYPTCTIEAISDLIDRETKE
        SLLYGFKL+ TCTIEAISDLI+RET+E
Subjt:  SLLYGFKLYPTCTIEAISDLIDRETKE

A0A5D3DJT0 Protein CbxX, chromosomal1.2e-21288.76Show/hide
Query:  RSPRSRLKNPHSSAGDVLGLQKLLRENPSLLNERNPIVWSLFLKNFSMGQTPLHVSAGYNRAEIVKCLLAWQGPEKVELEAKNMYGETPLHMAAKNGCND
        RS +    + ++ +GDVL LQKLLRENP LLNERNP           MGQTPLHVSAGYNRAEIV  LLAW+GPE VELEAKNMYGETPLHMAAKNGCND
Subjt:  RSPRSRLKNPHSSAGDVLGLQKLLRENPSLLNERNPIVWSLFLKNFSMGQTPLHVSAGYNRAEIVKCLLAWQGPEKVELEAKNMYGETPLHMAAKNGCND

Query:  AARVLLAHGAFIEAKANNGMTPLHLAVWYSLQAEDCATVKTLLEYNADCSAKDDEGMTPLNHLSQGSCSGKLRELLNQHLEEQRKRKAIEACSETKAKMK
        AARVLLAHGAF+EAKANNGMTPLHLAVWYSLQ+EDC TVKTLLEYNADCSA D+EGMTPLNHLSQ SCS KLRELLNQHLEEQRKRKAIEACSETKAKMK
Subjt:  AARVLLAHGAFIEAKANNGMTPLHLAVWYSLQAEDCATVKTLLEYNADCSAKDDEGMTPLNHLSQGSCSGKLRELLNQHLEEQRKRKAIEACSETKAKMK

Query:  ELENELSHIVGLHELKIQLHKWAKGMLLDERRRALGLKVGTRRPPHMAFLGNPGTGKTMVARILGKLLHMVGILPTDKVTEVQRTDLVGEFVGHTGPKTR
        ELENELSHIVGLHELKIQL KWAKGMLLDERRRALGLKVGTRR PHMAFLGNPGTGKTMVARILGKLLHMVGILPTDKVTEVQRTDLVGEFVGHTGPKTR
Subjt:  ELENELSHIVGLHELKIQLHKWAKGMLLDERRRALGLKVGTRRPPHMAFLGNPGTGKTMVARILGKLLHMVGILPTDKVTEVQRTDLVGEFVGHTGPKTR

Query:  RKIKEAEGGILFVDEAYRLIPMQKADDKDYGLEALEEIMSVMDSGKIVVIFAGYCEPMKRVIASNDGFCRRVTKFFNFNDFSSEELAKILHIKMDNQTED
        RKIKEAEGGILFVDEAYRLIPMQKADDKDYGLEALEEIMSVMDSGKIVVIFAGYCEPMKRVI SN+GF RRVTKFF FNDFSS+ELA ILHIKMDNQTED
Subjt:  RKIKEAEGGILFVDEAYRLIPMQKADDKDYGLEALEEIMSVMDSGKIVVIFAGYCEPMKRVIASNDGFCRRVTKFFNFNDFSSEELAKILHIKMDNQTED

Query:  SLLYGFKLYPTCTIEAISDLIDRETKE
        SLLYGFKL+ TCTIEAISDLI+RET+E
Subjt:  SLLYGFKLYPTCTIEAISDLIDRETKE

A0A6J1C8L1 uncharacterized protein LOC1110093831.3e-21488.58Show/hide
Query:  QARSPRSRLKNPHSSAGDVLGLQKLLRENPSLLNERNPIVWSLFLKNFSMGQTPLHVSAGYNRAEIVKCLLAWQGPEKVELEAKNMYGETPLHMAAKNGC
        ++RS +    + ++  GDVLGLQKLLRENPSLLN+RNP           MGQTPLHVSAGYNRAEIVK LLAWQGPEKVELEAKNMYGETPLHMAAKNGC
Subjt:  QARSPRSRLKNPHSSAGDVLGLQKLLRENPSLLNERNPIVWSLFLKNFSMGQTPLHVSAGYNRAEIVKCLLAWQGPEKVELEAKNMYGETPLHMAAKNGC

Query:  NDAARVLLAHGAFIEAKANNGMTPLHLAVWYSLQAEDCATVKTLLEYNADCSAKDDEGMTPLNHLSQGSCSGKLRELLNQHLEEQRKRKAIEACSETKAK
        NDAARVLLAHGAF+EAKANNGMTPLHLAVWYSLQAE+C TVKTLLEYNADCSA DDEGMTPLNHLSQG CS KLRELLNQHLEEQRKRKAIEACSETKAK
Subjt:  NDAARVLLAHGAFIEAKANNGMTPLHLAVWYSLQAEDCATVKTLLEYNADCSAKDDEGMTPLNHLSQGSCSGKLRELLNQHLEEQRKRKAIEACSETKAK

Query:  MKELENELSHIVGLHELKIQLHKWAKGMLLDERRRALGLKVGTRRPPHMAFLGNPGTGKTMVARILGKLLHMVGILPTDKVTEVQRTDLVGEFVGHTGPK
        MKELENELSHIVGLHELKIQL KWAKGMLLDERRRALGLKVG RRPPHMAFLGNPGTGKTM+ARILGKLLHMVGILPTDKVTEVQRTDLVGEFVGHTGPK
Subjt:  MKELENELSHIVGLHELKIQLHKWAKGMLLDERRRALGLKVGTRRPPHMAFLGNPGTGKTMVARILGKLLHMVGILPTDKVTEVQRTDLVGEFVGHTGPK

Query:  TRRKIKEAEGGILFVDEAYRLIPMQKADDKDYGLEALEEIMSVMDSGKIVVIFAGYCEPMKRVIASNDGFCRRVTKFFNFNDFSSEELAKILHIKMDNQT
        TRRKIKEAEGGILFVDEAYRLIPMQKADDKDYGLEALEEIMSVMDSGKI+VIFAGY EPMKRVIASN+GF RRVTKFF+FNDFS EELAKILHIKMDNQT
Subjt:  TRRKIKEAEGGILFVDEAYRLIPMQKADDKDYGLEALEEIMSVMDSGKIVVIFAGYCEPMKRVIASNDGFCRRVTKFFNFNDFSSEELAKILHIKMDNQT

Query:  EDSLLYGFKLYPTCTIEAISDLIDRETKE
        E+SLLYGFKL+P+CT EAIS+LI+RET+E
Subjt:  EDSLLYGFKLYPTCTIEAISDLIDRETKE

A0A6J1GU47 uncharacterized protein LOC1114575645.1e-21688.58Show/hide
Query:  QARSPRSRLKNPHSSAGDVLGLQKLLRENPSLLNERNPIVWSLFLKNFSMGQTPLHVSAGYNRAEIVKCLLAWQGPEKVELEAKNMYGETPLHMAAKNGC
        ++RS +    + ++ +GDVL LQKLLRENP LLN+RNP           MGQTPLHVSAGYN+AEIVK LLAWQGPEKVELEAKNMYGETPLHMAAKNGC
Subjt:  QARSPRSRLKNPHSSAGDVLGLQKLLRENPSLLNERNPIVWSLFLKNFSMGQTPLHVSAGYNRAEIVKCLLAWQGPEKVELEAKNMYGETPLHMAAKNGC

Query:  NDAARVLLAHGAFIEAKANNGMTPLHLAVWYSLQAEDCATVKTLLEYNADCSAKDDEGMTPLNHLSQGSCSGKLRELLNQHLEEQRKRKAIEACSETKAK
        NDAA VLLAHGAF+EAKANNGMTPLHLAVWYSLQ+EDCATVKTLL+YNADCSA DDEGMTPLNHLSQGSCS KLRELLN+HL+EQRKRKAIEACSETKAK
Subjt:  NDAARVLLAHGAFIEAKANNGMTPLHLAVWYSLQAEDCATVKTLLEYNADCSAKDDEGMTPLNHLSQGSCSGKLRELLNQHLEEQRKRKAIEACSETKAK

Query:  MKELENELSHIVGLHELKIQLHKWAKGMLLDERRRALGLKVGTRRPPHMAFLGNPGTGKTMVARILGKLLHMVGILPTDKVTEVQRTDLVGEFVGHTGPK
        MKELENELSHIVGLHELKIQL KWAKGMLLDERRRALGLKVGTRRPPHMAFLGNPGTGKTMVARILGKLLHMVGILPTDKVTEVQRTDLVGEFVGHTGPK
Subjt:  MKELENELSHIVGLHELKIQLHKWAKGMLLDERRRALGLKVGTRRPPHMAFLGNPGTGKTMVARILGKLLHMVGILPTDKVTEVQRTDLVGEFVGHTGPK

Query:  TRRKIKEAEGGILFVDEAYRLIPMQKADDKDYGLEALEEIMSVMDSGKIVVIFAGYCEPMKRVIASNDGFCRRVTKFFNFNDFSSEELAKILHIKMDNQT
        TRRKIKEAEGGILFVDEAYRLIPMQK+DDKDYGLEALEEIMSVMDSGK+VVIFAGYCEPM+RVIASN+GF RRVTKFF FNDFSSEELAKILHIKM+NQT
Subjt:  TRRKIKEAEGGILFVDEAYRLIPMQKADDKDYGLEALEEIMSVMDSGKIVVIFAGYCEPMKRVIASNDGFCRRVTKFFNFNDFSSEELAKILHIKMDNQT

Query:  EDSLLYGFKLYPTCTIEAISDLIDRETKE
        EDSLLYGFKL+PTCTIEAISDLI RET+E
Subjt:  EDSLLYGFKLYPTCTIEAISDLIDRETKE

A0A6J1IPL6 uncharacterized protein LOC1114793775.6e-21588.11Show/hide
Query:  QARSPRSRLKNPHSSAGDVLGLQKLLRENPSLLNERNPIVWSLFLKNFSMGQTPLHVSAGYNRAEIVKCLLAWQGPEKVELEAKNMYGETPLHMAAKNGC
        ++RS +    + ++ +GDV  LQKLLRENP LLN+RNP           MGQTPLHVSAGYN+AEIVK LLAWQGPEKVELEAKNMYGETPLHMAAKNGC
Subjt:  QARSPRSRLKNPHSSAGDVLGLQKLLRENPSLLNERNPIVWSLFLKNFSMGQTPLHVSAGYNRAEIVKCLLAWQGPEKVELEAKNMYGETPLHMAAKNGC

Query:  NDAARVLLAHGAFIEAKANNGMTPLHLAVWYSLQAEDCATVKTLLEYNADCSAKDDEGMTPLNHLSQGSCSGKLRELLNQHLEEQRKRKAIEACSETKAK
        NDAA +LLAHGAF+EAKANNGMTPLHLAVWYSLQ+EDCATVKTLLEYNADCSA DDEGMTPLNHLSQGSCS KLRELLN+HLEEQRKRKAIEACSETKAK
Subjt:  NDAARVLLAHGAFIEAKANNGMTPLHLAVWYSLQAEDCATVKTLLEYNADCSAKDDEGMTPLNHLSQGSCSGKLRELLNQHLEEQRKRKAIEACSETKAK

Query:  MKELENELSHIVGLHELKIQLHKWAKGMLLDERRRALGLKVGTRRPPHMAFLGNPGTGKTMVARILGKLLHMVGILPTDKVTEVQRTDLVGEFVGHTGPK
        MKELENELSHIVGLHELKIQL KWAKGMLLDERRRALGLKVGTRRPPHMAFLGNPGTGKTMVARILGKLLHMVGILPTDKVTEVQRTDLVGEFVGHTGPK
Subjt:  MKELENELSHIVGLHELKIQLHKWAKGMLLDERRRALGLKVGTRRPPHMAFLGNPGTGKTMVARILGKLLHMVGILPTDKVTEVQRTDLVGEFVGHTGPK

Query:  TRRKIKEAEGGILFVDEAYRLIPMQKADDKDYGLEALEEIMSVMDSGKIVVIFAGYCEPMKRVIASNDGFCRRVTKFFNFNDFSSEELAKILHIKMDNQT
        TRRKIKEAEGGILFVDEAYRLIPMQK+DDKDYGLEALEEIMSVMDSGK+VVIFAGYCEPM+RVI+SN+GF RRVTKFF FNDFSSEELAKILHIKM+NQT
Subjt:  TRRKIKEAEGGILFVDEAYRLIPMQKADDKDYGLEALEEIMSVMDSGKIVVIFAGYCEPMKRVIASNDGFCRRVTKFFNFNDFSSEELAKILHIKMDNQT

Query:  EDSLLYGFKLYPTCTIEAISDLIDRETKE
        EDSLLYGFKL+PTCTIEAISDLI+R+T+E
Subjt:  EDSLLYGFKLYPTCTIEAISDLIDRETKE

SwissProt top hitse value%identityAlignment
Q16560 U11/U12 small nuclear ribonucleoprotein 35 kDa protein2.0e-4755.62Show/hide
Query:  AENYHPIQAGSIDGTDVLPHDNAVYRALLCSSAGLYDPLGDPKVFGDPYCTLFVGRLSHLTTEETLRRAMGKYGRIKNLRLVRHIVTGSSRGYAFVEYET
        A+ Y P++AGSIDGTD  PHD AV+RA+L      Y P  +  V GDP  TLFV RL+  T E+ L+    +YG I+ LRLVR +VTG S+GYAF+EY+ 
Subjt:  AENYHPIQAGSIDGTDVLPHDNAVYRALLCSSAGLYDPLGDPKVFGDPYCTLFVGRLSHLTTEETLRRAMGKYGRIKNLRLVRHIVTGSSRGYAFVEYET

Query:  EKEMRHAYKDAHHSMIDDCEIIVDYNRQQVMPGWIPRRLGGGLGGKKESGQLRFGGRERPFRAPLR-PIPYDDLKRLG
        E+ +  AY+DA   +ID  EI VDY  ++ + GWIPRRLGGGLGGKKESGQLRFGGR+RPFR P+  P+  +DL R G
Subjt:  EKEMRHAYKDAHHSMIDDCEIIVDYNRQQVMPGWIPRRLGGGLGGKKESGQLRFGGRERPFRAPLR-PIPYDDLKRLG

Q1LZH0 U11/U12 small nuclear ribonucleoprotein 35 kDa protein2.6e-4755.62Show/hide
Query:  AENYHPIQAGSIDGTDVLPHDNAVYRALLCSSAGLYDPLGDPKVFGDPYCTLFVGRLSHLTTEETLRRAMGKYGRIKNLRLVRHIVTGSSRGYAFVEYET
        A+ Y P++AGSIDGTD  PHD AV+RA+L      Y P  +  V GDP  TLFV RL+  T EE L+    +YG I+ LRLVR +VTG S+GYAF+EY+ 
Subjt:  AENYHPIQAGSIDGTDVLPHDNAVYRALLCSSAGLYDPLGDPKVFGDPYCTLFVGRLSHLTTEETLRRAMGKYGRIKNLRLVRHIVTGSSRGYAFVEYET

Query:  EKEMRHAYKDAHHSMIDDCEIIVDYNRQQVMPGWIPRRLGGGLGGKKESGQLRFGGRERPFRAPLR-PIPYDDLKRLG
        E+ +  AY+DA   +ID  EI VDY  ++ + GWIPRRLGGGLGGKKESGQLRFGGR+RPFR P+  P+  +D  R G
Subjt:  EKEMRHAYKDAHHSMIDDCEIIVDYNRQQVMPGWIPRRLGGGLGGKKESGQLRFGGRERPFRAPLR-PIPYDDLKRLG

Q5U1W5 U11/U12 small nuclear ribonucleoprotein 35 kDa protein1.6e-4444.75Show/hide
Query:  AENYHPIQAGSIDGTDVLPHDNAVYRALLCSSAGLYDPLGDPKVFGDPYCTLFVGRLSHLTTEETLRRAMGKYGRIKNLRLVRHIVTGSSRGYAFVEYET
        A+ Y P++AGSIDGTD  PHD AV+RA+L      Y P  +  V GDP  TLFV RL+  T EE L+    +YG I+ LRLVR +VTG S+GYAF+EY+ 
Subjt:  AENYHPIQAGSIDGTDVLPHDNAVYRALLCSSAGLYDPLGDPKVFGDPYCTLFVGRLSHLTTEETLRRAMGKYGRIKNLRLVRHIVTGSSRGYAFVEYET

Query:  EKEMRHAYKDAHHSMIDDCEIIVDYNRQQVMPGWIPRRLGGGLGGKKESGQLRFGGRERPFRAPLR-PIPYDDLKRLGITP-PPKGRYMSRFQVPSPPRR
        E+ +  AY+DA   +ID  EI VDY  ++ + GWIPRRLGGGLGGKKESGQLRFGGR+RPFR P+  P+  ++  R G      + R   R   P P  R
Subjt:  EKEMRHAYKDAHHSMIDDCEIIVDYNRQQVMPGWIPRRLGGGLGGKKESGQLRFGGRERPFRAPLR-PIPYDDLKRLGITP-PPKGRYMSRFQVPSPPRR

Query:  ETDTIGREEGSGNYEKMKYESRRALNDKDDYSPHMSSSEKEDRYQRSSSDKDRSRRR
        + D  GRE+         ++ R  +  ++D+       E+E R +R+ +   R R +
Subjt:  ETDTIGREEGSGNYEKMKYESRRALNDKDDYSPHMSSSEKEDRYQRSSSDKDRSRRR

Q8VY74 U11/U12 small nuclear ribonucleoprotein 35 kDa protein7.1e-9867.38Show/hide
Query:  GSN-VNKVFYAENYHPIQAGSIDGTDVLPHDNAVYRALLCSSAGLYDPLGDPKVFGDPYCTLFVGRLSHLTTEETLRRAMGKYGRIKNLRLVRHIVTGSS
        G+N VNKVFYA +YHPIQAGSIDGTDV PHDN V RALLC +AGLYDP GD K  GDPYCTLFVGRLSH TTE+TLR  M KYGRIKNLRLVRHIVTG+S
Subjt:  GSN-VNKVFYAENYHPIQAGSIDGTDVLPHDNAVYRALLCSSAGLYDPLGDPKVFGDPYCTLFVGRLSHLTTEETLRRAMGKYGRIKNLRLVRHIVTGSS

Query:  RGYAFVEYETEKEMRHAYKDAHHSMIDDCEIIVDYNRQQVMPGWIPRRLGGGLGGKKESGQLRFGGRERPFRAPLRPIPYDDLKRLGITPPPKGRYMSRF
        RGY FVEYETEKEM  AY+DAHHS+ID  EIIVDYNRQQ+MPGWIPRRLGGGLGG+KESGQLRFGGR+RPFRAPLRPIP++DLK+LGI  PP+GRYMSR 
Subjt:  RGYAFVEYETEKEMRHAYKDAHHSMIDDCEIIVDYNRQQVMPGWIPRRLGGGLGGKKESGQLRFGGRERPFRAPLRPIPYDDLKRLGITPPPKGRYMSRF

Query:  QVPSPPRRETDTIGREEGSGNYEKMKYESRRALNDKDDYSPHMSSSEKEDRYQRSSSDKDRSRRRGSNERDDHSRKRHRSHR
        Q+PSPPRR+     REE         Y  + ++  ++++     SS +     RSS+    S RR S +R++ SR+  RS R
Subjt:  QVPSPPRRETDTIGREEGSGNYEKMKYESRRALNDKDDYSPHMSSSEKEDRYQRSSSDKDRSRRRGSNERDDHSRKRHRSHR

Q9D384 U11/U12 small nuclear ribonucleoprotein 35 kDa protein1.6e-4445.14Show/hide
Query:  AENYHPIQAGSIDGTDVLPHDNAVYRALLCSSAGLYDPLGDPKVFGDPYCTLFVGRLSHLTTEETLRRAMGKYGRIKNLRLVRHIVTGSSRGYAFVEYET
        A+ Y P++AGSIDGTD  PHD AV+RA+L      Y P  +  V GDP  TLFV RL+  T EE L+    +YG I+ LRLVR +VTG S+GYAF+EY+ 
Subjt:  AENYHPIQAGSIDGTDVLPHDNAVYRALLCSSAGLYDPLGDPKVFGDPYCTLFVGRLSHLTTEETLRRAMGKYGRIKNLRLVRHIVTGSSRGYAFVEYET

Query:  EKEMRHAYKDAHHSMIDDCEIIVDYNRQQVMPGWIPRRLGGGLGGKKESGQLRFGGRERPFRAPLR-PIPYDDLKRLGITP-PPKGRYMSRFQVPSPPRR
        E+ +  AY+DA   +ID  EI VDY  ++ + GWIPRRLGGGLGGKKESGQLRFGGR+RPFR P+  P+  ++  R G      + R   R   P P  R
Subjt:  EKEMRHAYKDAHHSMIDDCEIIVDYNRQQVMPGWIPRRLGGGLGGKKESGQLRFGGRERPFRAPLR-PIPYDDLKRLGITP-PPKGRYMSRFQVPSPPRR

Query:  ETDTIGREEGSGNYEKMKYESRRALNDKDDYSPHMSSSEKEDRYQRSSSDKDRSRRR
        + D  GRE+         ++ R  +  ++D+       E+E R +R  S   R R +
Subjt:  ETDTIGREEGSGNYEKMKYESRRALNDKDDYSPHMSSSEKEDRYQRSSSDKDRSRRR

Arabidopsis top hitse value%identityAlignment
AT2G03430.1 Ankyrin repeat family protein7.5e-1038.6Show/hide
Query:  GQTPLHVSAGYNRAEIVKCLLAWQGPEKVELEAKNMYGETPLHMAAKNGCNDAARVLLAHGAFIEAKANNGMTPLHLAVWYSLQAEDCATVKTLLEYNAD
        G  PLH +A    AE+V+ LL        ++ AKN  G T LH AA  G  + A++LL HGA I      G TPLH A     + E C   + L+E  A+
Subjt:  GQTPLHVSAGYNRAEIVKCLLAWQGPEKVELEAKNMYGETPLHMAAKNGCNDAARVLLAHGAFIEAKANNGMTPLHLAVWYSLQAEDCATVKTLLEYNAD

Query:  CSAKDDEGMTPLNH
          A D  G T L H
Subjt:  CSAKDDEGMTPLNH

AT2G43370.1 RNA-binding (RRM/RBD/RNP motifs) family protein5.0e-9967.38Show/hide
Query:  GSN-VNKVFYAENYHPIQAGSIDGTDVLPHDNAVYRALLCSSAGLYDPLGDPKVFGDPYCTLFVGRLSHLTTEETLRRAMGKYGRIKNLRLVRHIVTGSS
        G+N VNKVFYA +YHPIQAGSIDGTDV PHDN V RALLC +AGLYDP GD K  GDPYCTLFVGRLSH TTE+TLR  M KYGRIKNLRLVRHIVTG+S
Subjt:  GSN-VNKVFYAENYHPIQAGSIDGTDVLPHDNAVYRALLCSSAGLYDPLGDPKVFGDPYCTLFVGRLSHLTTEETLRRAMGKYGRIKNLRLVRHIVTGSS

Query:  RGYAFVEYETEKEMRHAYKDAHHSMIDDCEIIVDYNRQQVMPGWIPRRLGGGLGGKKESGQLRFGGRERPFRAPLRPIPYDDLKRLGITPPPKGRYMSRF
        RGY FVEYETEKEM  AY+DAHHS+ID  EIIVDYNRQQ+MPGWIPRRLGGGLGG+KESGQLRFGGR+RPFRAPLRPIP++DLK+LGI  PP+GRYMSR 
Subjt:  RGYAFVEYETEKEMRHAYKDAHHSMIDDCEIIVDYNRQQVMPGWIPRRLGGGLGGKKESGQLRFGGRERPFRAPLRPIPYDDLKRLGITPPPKGRYMSRF

Query:  QVPSPPRRETDTIGREEGSGNYEKMKYESRRALNDKDDYSPHMSSSEKEDRYQRSSSDKDRSRRRGSNERDDHSRKRHRSHR
        Q+PSPPRR+     REE         Y  + ++  ++++     SS +     RSS+    S RR S +R++ SR+  RS R
Subjt:  QVPSPPRRETDTIGREEGSGNYEKMKYESRRALNDKDDYSPHMSSSEKEDRYQRSSSDKDRSRRRGSNERDDHSRKRHRSHR

AT3G24530.1 AAA-type ATPase family protein / ankyrin repeat family protein8.7e-18474.94Show/hide
Query:  RSPRSRLKNPHSSAGDVLGLQKLLRENPSLLNERNPIVWSLFLKNFSMGQTPLHVSAGYNRAEIVKCLLAWQGPEKVELEAKNMYGETPLHMAAKNGCND
        RS R    +  + +GD++ LQ+LL++NPSLLNERNP+++           TPLHVSAG    +IVK LLAW G +KVELEA N YGETPLHMAAKNGCN+
Subjt:  RSPRSRLKNPHSSAGDVLGLQKLLRENPSLLNERNPIVWSLFLKNFSMGQTPLHVSAGYNRAEIVKCLLAWQGPEKVELEAKNMYGETPLHMAAKNGCND

Query:  AARVLLAHGAFIEAKANNGMTPLHLAVWYSLQAEDCATVKTLLEYNADCSAKDDEGMTPLNHLSQGSCSGKLRELLNQHLEEQRKRKAIEACSETKAKMK
        AA++LL  GAFIEAKA+NGMTPLHLAVWYS+ A++ +TVKTLL++NADCSAKD+EGMTPL+HL QG  S KLRELL   L+EQRKR A+E C +TKAKM+
Subjt:  AARVLLAHGAFIEAKANNGMTPLHLAVWYSLQAEDCATVKTLLEYNADCSAKDDEGMTPLNHLSQGSCSGKLRELLNQHLEEQRKRKAIEACSETKAKMK

Query:  ELENELSHIVGLHELKIQLHKWAKGMLLDERRRALGLKVGTRRPPHMAFLGNPGTGKTMVARILGKLLHMVGILPTDKVTEVQRTDLVGEFVGHTGPKTR
         LE+ELS+IVGL ELK QL KWAKGMLLDERRRALGL +GTRRPPHMAFLGNPGTGKTMVAR+LGKLL+ VGILPTDKVTEVQRTDLVGEFVGHTGPKTR
Subjt:  ELENELSHIVGLHELKIQLHKWAKGMLLDERRRALGLKVGTRRPPHMAFLGNPGTGKTMVARILGKLLHMVGILPTDKVTEVQRTDLVGEFVGHTGPKTR

Query:  RKIKEAEGGILFVDEAYRLIPMQKADDKDYGLEALEEIMSVMDSGKIVVIFAGYCEPMKRVIASNDGFCRRVTKFFNFNDFSSEELAKILHIKMDNQTED
        RKI+EAEGGILFVDEAYRLIPMQKADDKDYGLEALEEIMSVMD+GKIVVIFAGY EPMKRVIASN+GFCRRVTKFFNF+DFS++ELA+ILHIKM+NQ ED
Subjt:  RKIKEAEGGILFVDEAYRLIPMQKADDKDYGLEALEEIMSVMDSGKIVVIFAGYCEPMKRVIASNDGFCRRVTKFFNFNDFSSEELAKILHIKMDNQTED

Query:  SLLYGFKLYPTCTIEAISDLIDRETKE
        +L YGF+L+ +CT++ I+ LI+ ET E
Subjt:  SLLYGFKLYPTCTIEAISDLIDRETKE

AT3G50670.1 U1 small nuclear ribonucleoprotein-70K1.8e-2434.71Show/hide
Query:  YDPLGDPKVFGDPYCTLFVGRLSHLTTEETLRRAMGKYGRIKNLRLVRHIVTGSSRGYAFVEYETEKEMRHAYKDAHHSMIDDCEIIVDYNRQQVMPGWI
        YDP  DP   GDPY TLFV RL++ ++E  ++R    YG IK + LV   +T   +GYAF+EY   ++M+ AYK A    ID   ++VD  R + +P W 
Subjt:  YDPLGDPKVFGDPYCTLFVGRLSHLTTEETLRRAMGKYGRIKNLRLVRHIVTGSSRGYAFVEYETEKEMRHAYKDAHHSMIDDCEIIVDYNRQQVMPGWI

Query:  PRRLGGGLGGKKESGQLRFGGRERPFRAPLRPIPYDDLKRLGITPPPKGRYMSRFQVPSPPRRETDTIGREEG---SGNYEKMKYESRRALNDKDDYSPH
        PRRLGGGLG  +  G     G ++                      P+GR  S+ + PS PR E +   RE+G     + E    + R    D+     H
Subjt:  PRRLGGGLGGKKESGQLRFGGRERPFRAPLRPIPYDDLKRLGITPPPKGRYMSRFQVPSPPRRETDTIGREEG---SGNYEKMKYESRRALNDKDDYSPH

Query:  MSSSEK--EDRYQRSSSDKDRSRRRGSNERDDHSRKRHRSHR
            ++   DR + S  D+DR+R RG  +R D  R R R+ R
Subjt:  MSSSEK--EDRYQRSSSDKDRSRRRGSNERDDHSRKRHRSHR

AT4G29660.1 embryo defective 27521.8e-1944.44Show/hide
Query:  LWRKYADYVYTKWERTILWDMVDPYRQPKCLRLWLPSILRLFIPRSSAVPSPSNFTSYINLFNPYENVQERYWEDHPGEAVPLMKPKFYYGPWRVMRGE
        LWRKYADY Y K+ER  +W+M++PYR+PK     +   +  F            +T  I      +  +E++WE+HPG+ VPLMKP FY GPWRV RGE
Subjt:  LWRKYADYVYTKWERTILWDMVDPYRQPKCLRLWLPSILRLFIPRSSAVPSPSNFTSYINLFNPYENVQERYWEDHPGEAVPLMKPKFYYGPWRVMRGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTCTCGTCCTATATTTACCTTCATCGAAGAGTGTTTTTGTCAATGCAATCCATGGAAATGTAGCTGCAAAGCAAACCGGCGCAATGGAACCATTCATTCGGCCGCC
ATCACGCCTTTGCGCCATTTCTCATTTCATTTCCCTCCGTTTCTCTCTCTTCTCTCTCCCTGCCCTGATCTCGGTTTACTGTCTTCCCTCCTTCCCCAACTCGCCTCCGT
GCTCTCTTCCCCTCTCGAGAAGATCTTGCATTGGCTCTCTCTCTCTCTCTGGTCTCATCTCTGTCTCGCGGGTGGCTCCTTCATTTTTCTACTGGACCCATCAGGCCCGA
TCGCCAAGATCTCGGCTTAAGAATCCTCACTCGTCGGCTGGGGACGTTCTTGGCCTTCAAAAGCTGCTTCGAGAAAACCCTAGTCTTCTTAATGAAAGAAATCCTATTGT
ATGGTCTCTTTTCTTGAAGAACTTTTCGATGGGACAGACACCACTTCATGTTTCTGCAGGCTATAACAGGGCTGAGATAGTTAAATGTCTTCTTGCCTGGCAAGGTCCAG
AAAAGGTTGAGTTGGAAGCCAAAAACATGTATGGAGAGACTCCATTGCATATGGCAGCAAAGAATGGGTGCAACGATGCGGCACGGGTGCTACTGGCTCATGGTGCTTTT
ATTGAAGCCAAAGCAAATAATGGGATGACACCATTACACCTTGCTGTCTGGTATTCACTCCAGGCTGAAGACTGTGCTACTGTCAAGACATTGCTTGAATATAATGCCGA
TTGTAGTGCGAAGGACGATGAGGGTATGACTCCCCTAAACCATCTGTCACAAGGCTCATGTAGTGGAAAGTTGAGGGAATTGTTGAACCAGCATTTAGAAGAGCAGAGAA
AGCGAAAAGCAATTGAAGCATGCAGTGAAACTAAAGCAAAGATGAAAGAACTCGAAAACGAGCTCTCACATATCGTAGGTTTGCACGAGCTCAAGATACAACTTCATAAA
TGGGCTAAGGGAATGCTTTTAGACGAGAGACGCAGGGCCCTTGGCCTCAAAGTTGGCACCAGGAGGCCCCCTCATATGGCATTTCTTGGCAATCCTGGAACAGGTAAGAC
TATGGTCGCTCGAATACTTGGAAAATTGCTTCACATGGTGGGCATCCTGCCAACAGATAAGGTAACAGAAGTACAGAGAACAGATCTGGTCGGCGAATTCGTTGGTCATA
CTGGTCCAAAAACCAGGAGGAAGATCAAAGAAGCAGAGGGTGGAATCCTTTTCGTTGACGAAGCCTATCGACTGATACCAATGCAAAAAGCAGACGATAAGGATTATGGT
TTGGAAGCGCTGGAAGAGATCATGTCTGTCATGGACAGTGGGAAAATTGTAGTCATATTTGCTGGCTACTGTGAACCAATGAAGCGCGTAATAGCTTCAAATGACGGATT
TTGTCGACGGGTAACCAAGTTTTTCAACTTTAACGACTTCAGTTCAGAAGAATTGGCAAAGATTCTCCATATCAAGATGGATAATCAAACAGAGGATAGCTTGTTATATG
GTTTTAAGTTGTATCCTACTTGCACCATAGAAGCCATTTCGGACCTGATAGATAGAGAAACTAAAGAAAACAGCCTTGCAGATGCTTATGTTCAACACCCTCCCCATCAG
TGCCGTGTTGCTCTGGGGAGTCCAAAGGTGGTGCTATTGAATGAATATAAATCAGAAGCTTGTATAGAAATTCTGGATGCACATTTGGGAGTGACGATAAAATGGCGGGA
GCGCAAAGCAAATCTAAAATTTGCTTGGCGCGCACCGAAGACACACACGGAGACCGAGTCAGGGCTCAGCAACGCCATCGTGACGAACCCACGAGCGCCATTGCTATTGC
CATCTCCATCTTCAGCACCACCTTCACCTCCAAATGACTTGCTGCCCAGGCTCTTCAAGGTTTGCCTTTGCCCTACAATCTCTGGTATTTTTTTTTATCGTGTAATTGTG
AATCCGAATGTTGATTATTTAACTTCCGAGATCCTTGAAATTGATCGGATTCTGAAAAATGAGAAATGGAGCTGGCTAAGAAAACAATGGGTAAAGCTTGTGAAAGAGAG
GAACGTTCACTTATCTGGCAATATCTCAGCTCAGCTCACTTTCGCTGAGACACAAATGTTGATATTCAAGTGGATATGTCCTGATCTCTATATATCAATCAATTTCTACT
TCTCTTCTGGTAGCAATTGTATTGGACGAATTATCGCAGTTGCGCGAAATGACGAGCATCTATGGAGAAAGTATGCCGATTATGTCTACACCAAGTGGGAAAGAACAATC
CTTTGGGACATGGTCGATCCATATAGGCAACCGAAATGTTTACGCCTTTGGTTACCATCTATATTGCGGCTTTTTATACCGAGGTCATCGGCAGTGCCATCACCGAGCAA
CTTTACAAGCTATATCAACCTCTTCAATCCTTATGAAAATGTGCAGGAGAGGTATTGGGAAGATCACCCTGGGGAAGCTGTACCTTTAATGAAACCAAAGTTCTATTATG
GACCCTGGAGGGTAATGAGGGGAGAAGTCCCGGTGCACACAAAACATTACCTCACAAAAAAAAAAAAAAAAAAATATCCGACGTCGGTAGAGAATCCACTGGCTGCAATC
TCAAATAATTTCTCCAAAGCCTGCGGGAACGAGGTGGAAGACGAGCTTGCTTTGTGCGCTATGAGCGGGAGCAACGTCAACAAGGTCTTCTACGCCGAGAACTACCACCC
CATACAAGCCGGCAGCATCGACGGCACCGATGTTCTTCCCCATGACAATGCCGTCTACAGAGCATTACTCTGTTCCTCCGCTGGCCTCTATGATCCCCTTGGCGATCCCA
AGGTCTTTGGAGACCCATATTGCACCCTCTTCGTTGGCCGCCTTTCACATCTGACTACCGAAGAAACTCTTCGTAGAGCTATGGGCAAATATGGTCGGATTAAGAATCTA
CGCTTAGTCAGACACATTGTAACTGGTTCTTCACGTGGTTATGCTTTTGTTGAATACGAAACTGAAAAAGAGATGCGACATGCATATAAGGATGCTCATCATTCGATGAT
AGATGATTGTGAAATTATAGTTGATTACAATCGACAGCAGGTAATGCCAGGATGGATTCCACGGAGATTAGGAGGGGGTCTTGGTGGTAAGAAGGAATCTGGGCAACTTC
GATTTGGAGGAAGAGAAAGACCATTCCGAGCTCCCTTACGTCCGATCCCTTACGATGATTTGAAAAGGCTTGGCATTACTCCTCCACCTAAAGGAAGATACATGTCTCGG
TTTCAGGTACCTTCTCCTCCCAGAAGAGAAACAGATACTATAGGTAGGGAAGAAGGGTCAGGGAACTATGAGAAGATGAAATATGAATCTAGACGAGCCTTAAATGACAA
GGACGATTATTCCCCTCACATGAGCTCTTCTGAGAAGGAAGACCGATATCAGAGGAGCTCTTCTGACAAGGACCGATCTCGCAGAAGGGGTTCTAATGAGAGAGATGATC
ACTCTCGAAAGCGCCACAGATCTCATAGATAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTCTCGTCCTATATTTACCTTCATCGAAGAGTGTTTTTGTCAATGCAATCCATGGAAATGTAGCTGCAAAGCAAACCGGCGCAATGGAACCATTCATTCGGCCGCC
ATCACGCCTTTGCGCCATTTCTCATTTCATTTCCCTCCGTTTCTCTCTCTTCTCTCTCCCTGCCCTGATCTCGGTTTACTGTCTTCCCTCCTTCCCCAACTCGCCTCCGT
GCTCTCTTCCCCTCTCGAGAAGATCTTGCATTGGCTCTCTCTCTCTCTCTGGTCTCATCTCTGTCTCGCGGGTGGCTCCTTCATTTTTCTACTGGACCCATCAGGCCCGA
TCGCCAAGATCTCGGCTTAAGAATCCTCACTCGTCGGCTGGGGACGTTCTTGGCCTTCAAAAGCTGCTTCGAGAAAACCCTAGTCTTCTTAATGAAAGAAATCCTATTGT
ATGGTCTCTTTTCTTGAAGAACTTTTCGATGGGACAGACACCACTTCATGTTTCTGCAGGCTATAACAGGGCTGAGATAGTTAAATGTCTTCTTGCCTGGCAAGGTCCAG
AAAAGGTTGAGTTGGAAGCCAAAAACATGTATGGAGAGACTCCATTGCATATGGCAGCAAAGAATGGGTGCAACGATGCGGCACGGGTGCTACTGGCTCATGGTGCTTTT
ATTGAAGCCAAAGCAAATAATGGGATGACACCATTACACCTTGCTGTCTGGTATTCACTCCAGGCTGAAGACTGTGCTACTGTCAAGACATTGCTTGAATATAATGCCGA
TTGTAGTGCGAAGGACGATGAGGGTATGACTCCCCTAAACCATCTGTCACAAGGCTCATGTAGTGGAAAGTTGAGGGAATTGTTGAACCAGCATTTAGAAGAGCAGAGAA
AGCGAAAAGCAATTGAAGCATGCAGTGAAACTAAAGCAAAGATGAAAGAACTCGAAAACGAGCTCTCACATATCGTAGGTTTGCACGAGCTCAAGATACAACTTCATAAA
TGGGCTAAGGGAATGCTTTTAGACGAGAGACGCAGGGCCCTTGGCCTCAAAGTTGGCACCAGGAGGCCCCCTCATATGGCATTTCTTGGCAATCCTGGAACAGGTAAGAC
TATGGTCGCTCGAATACTTGGAAAATTGCTTCACATGGTGGGCATCCTGCCAACAGATAAGGTAACAGAAGTACAGAGAACAGATCTGGTCGGCGAATTCGTTGGTCATA
CTGGTCCAAAAACCAGGAGGAAGATCAAAGAAGCAGAGGGTGGAATCCTTTTCGTTGACGAAGCCTATCGACTGATACCAATGCAAAAAGCAGACGATAAGGATTATGGT
TTGGAAGCGCTGGAAGAGATCATGTCTGTCATGGACAGTGGGAAAATTGTAGTCATATTTGCTGGCTACTGTGAACCAATGAAGCGCGTAATAGCTTCAAATGACGGATT
TTGTCGACGGGTAACCAAGTTTTTCAACTTTAACGACTTCAGTTCAGAAGAATTGGCAAAGATTCTCCATATCAAGATGGATAATCAAACAGAGGATAGCTTGTTATATG
GTTTTAAGTTGTATCCTACTTGCACCATAGAAGCCATTTCGGACCTGATAGATAGAGAAACTAAAGAAAACAGCCTTGCAGATGCTTATGTTCAACACCCTCCCCATCAG
TGCCGTGTTGCTCTGGGGAGTCCAAAGGTGGTGCTATTGAATGAATATAAATCAGAAGCTTGTATAGAAATTCTGGATGCACATTTGGGAGTGACGATAAAATGGCGGGA
GCGCAAAGCAAATCTAAAATTTGCTTGGCGCGCACCGAAGACACACACGGAGACCGAGTCAGGGCTCAGCAACGCCATCGTGACGAACCCACGAGCGCCATTGCTATTGC
CATCTCCATCTTCAGCACCACCTTCACCTCCAAATGACTTGCTGCCCAGGCTCTTCAAGGTTTGCCTTTGCCCTACAATCTCTGGTATTTTTTTTTATCGTGTAATTGTG
AATCCGAATGTTGATTATTTAACTTCCGAGATCCTTGAAATTGATCGGATTCTGAAAAATGAGAAATGGAGCTGGCTAAGAAAACAATGGGTAAAGCTTGTGAAAGAGAG
GAACGTTCACTTATCTGGCAATATCTCAGCTCAGCTCACTTTCGCTGAGACACAAATGTTGATATTCAAGTGGATATGTCCTGATCTCTATATATCAATCAATTTCTACT
TCTCTTCTGGTAGCAATTGTATTGGACGAATTATCGCAGTTGCGCGAAATGACGAGCATCTATGGAGAAAGTATGCCGATTATGTCTACACCAAGTGGGAAAGAACAATC
CTTTGGGACATGGTCGATCCATATAGGCAACCGAAATGTTTACGCCTTTGGTTACCATCTATATTGCGGCTTTTTATACCGAGGTCATCGGCAGTGCCATCACCGAGCAA
CTTTACAAGCTATATCAACCTCTTCAATCCTTATGAAAATGTGCAGGAGAGGTATTGGGAAGATCACCCTGGGGAAGCTGTACCTTTAATGAAACCAAAGTTCTATTATG
GACCCTGGAGGGTAATGAGGGGAGAAGTCCCGGTGCACACAAAACATTACCTCACAAAAAAAAAAAAAAAAAAATATCCGACGTCGGTAGAGAATCCACTGGCTGCAATC
TCAAATAATTTCTCCAAAGCCTGCGGGAACGAGGTGGAAGACGAGCTTGCTTTGTGCGCTATGAGCGGGAGCAACGTCAACAAGGTCTTCTACGCCGAGAACTACCACCC
CATACAAGCCGGCAGCATCGACGGCACCGATGTTCTTCCCCATGACAATGCCGTCTACAGAGCATTACTCTGTTCCTCCGCTGGCCTCTATGATCCCCTTGGCGATCCCA
AGGTCTTTGGAGACCCATATTGCACCCTCTTCGTTGGCCGCCTTTCACATCTGACTACCGAAGAAACTCTTCGTAGAGCTATGGGCAAATATGGTCGGATTAAGAATCTA
CGCTTAGTCAGACACATTGTAACTGGTTCTTCACGTGGTTATGCTTTTGTTGAATACGAAACTGAAAAAGAGATGCGACATGCATATAAGGATGCTCATCATTCGATGAT
AGATGATTGTGAAATTATAGTTGATTACAATCGACAGCAGGTAATGCCAGGATGGATTCCACGGAGATTAGGAGGGGGTCTTGGTGGTAAGAAGGAATCTGGGCAACTTC
GATTTGGAGGAAGAGAAAGACCATTCCGAGCTCCCTTACGTCCGATCCCTTACGATGATTTGAAAAGGCTTGGCATTACTCCTCCACCTAAAGGAAGATACATGTCTCGG
TTTCAGGTACCTTCTCCTCCCAGAAGAGAAACAGATACTATAGGTAGGGAAGAAGGGTCAGGGAACTATGAGAAGATGAAATATGAATCTAGACGAGCCTTAAATGACAA
GGACGATTATTCCCCTCACATGAGCTCTTCTGAGAAGGAAGACCGATATCAGAGGAGCTCTTCTGACAAGGACCGATCTCGCAGAAGGGGTTCTAATGAGAGAGATGATC
ACTCTCGAAAGCGCCACAGATCTCATAGATAG
Protein sequenceShow/hide protein sequence
MFLVLYLPSSKSVFVNAIHGNVAAKQTGAMEPFIRPPSRLCAISHFISLRFSLFSLPALISVYCLPSFPNSPPCSLPLSRRSCIGSLSLSGLISVSRVAPSFFYWTHQAR
SPRSRLKNPHSSAGDVLGLQKLLRENPSLLNERNPIVWSLFLKNFSMGQTPLHVSAGYNRAEIVKCLLAWQGPEKVELEAKNMYGETPLHMAAKNGCNDAARVLLAHGAF
IEAKANNGMTPLHLAVWYSLQAEDCATVKTLLEYNADCSAKDDEGMTPLNHLSQGSCSGKLRELLNQHLEEQRKRKAIEACSETKAKMKELENELSHIVGLHELKIQLHK
WAKGMLLDERRRALGLKVGTRRPPHMAFLGNPGTGKTMVARILGKLLHMVGILPTDKVTEVQRTDLVGEFVGHTGPKTRRKIKEAEGGILFVDEAYRLIPMQKADDKDYG
LEALEEIMSVMDSGKIVVIFAGYCEPMKRVIASNDGFCRRVTKFFNFNDFSSEELAKILHIKMDNQTEDSLLYGFKLYPTCTIEAISDLIDRETKENSLADAYVQHPPHQ
CRVALGSPKVVLLNEYKSEACIEILDAHLGVTIKWRERKANLKFAWRAPKTHTETESGLSNAIVTNPRAPLLLPSPSSAPPSPPNDLLPRLFKVCLCPTISGIFFYRVIV
NPNVDYLTSEILEIDRILKNEKWSWLRKQWVKLVKERNVHLSGNISAQLTFAETQMLIFKWICPDLYISINFYFSSGSNCIGRIIAVARNDEHLWRKYADYVYTKWERTI
LWDMVDPYRQPKCLRLWLPSILRLFIPRSSAVPSPSNFTSYINLFNPYENVQERYWEDHPGEAVPLMKPKFYYGPWRVMRGEVPVHTKHYLTKKKKKKYPTSVENPLAAI
SNNFSKACGNEVEDELALCAMSGSNVNKVFYAENYHPIQAGSIDGTDVLPHDNAVYRALLCSSAGLYDPLGDPKVFGDPYCTLFVGRLSHLTTEETLRRAMGKYGRIKNL
RLVRHIVTGSSRGYAFVEYETEKEMRHAYKDAHHSMIDDCEIIVDYNRQQVMPGWIPRRLGGGLGGKKESGQLRFGGRERPFRAPLRPIPYDDLKRLGITPPPKGRYMSR
FQVPSPPRRETDTIGREEGSGNYEKMKYESRRALNDKDDYSPHMSSSEKEDRYQRSSSDKDRSRRRGSNERDDHSRKRHRSHR