; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G08710 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G08710
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionDUF4283 domain-containing protein
Genome locationChr5:7363906..7366162
RNA-Seq ExpressionCSPI05G08710
SyntenyCSPI05G08710
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034062.1 uncharacterized protein E6C27_scaffold65G00480 [Cucumis melo var. makuwa]0.0e+0090.44Show/hide
Query:  MALQSGHPPTGAGDDEAAARNYLSRKKPKVPPPIPPSSDFHSRRSTTIATVCNCNLTPSETTRITQQFVHSLIARVVGKDTRPGQLAARLRHHLRLTQDV
        MALQS HPPTGAGDDEAAARNYLSRKKPKVPPPI PSSDF SR STTIATVCNCNLTPSETTRITQQF+HSLIARVVGKDTRPGQLAARLRHHLRLTQDV
Subjt:  MALQSGHPPTGAGDDEAAARNYLSRKKPKVPPPIPPSSDFHSRRSTTIATVCNCNLTPSETTRITQQFVHSLIARVVGKDTRPGQLAARLRHHLRLTQDV

Query:  KVFQLGLGYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVGILKRIADAIGGPLVKIDPVTRDRWKCKFAR
        KVF+LGLGYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDV ILKRIADAIGG LVKIDPVTRDRWKCKFAR
Subjt:  KVFQLGLGYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVGILKRIADAIGGPLVKIDPVTRDRWKCKFAR

Query:  FCISVNLCDPLPSMIELGRIRQRIEYEGFELCAKCNRVGDLRHDCSSLNNPSLNNPSLNNPSGSYGFNPHGDEPHHSVTRDFKEIGSTSNSKQPLIPESS
        FCISVNLCDPLPSMIELGRIRQRIEYEGFELCAKCNRVGDLRHDCS          SLNNPSGSYGFNPHGDEPHHSVTR FKE GSTS+SKQPLIPESS
Subjt:  FCISVNLCDPLPSMIELGRIRQRIEYEGFELCAKCNRVGDLRHDCSSLNNPSLNNPSLNNPSGSYGFNPHGDEPHHSVTRDFKEIGSTSNSKQPLIPESS

Query:  PVSAWESSRFIEKNPPLDLKLIDWPNLPKRESGKAGTGVRISSPRVHVKDKESPKKKEKCETSVQRLPSLPK-QCSTITIKAPELKRVVPSVVEDRLKDT
         VSAWESSRFIEKNP LDLK I+WPNLPK ESGKAGT VRISSP VHVKDK  PKKKEKCE SVQ LPSLPK Q STITIKAPELK VVPSVVED+LKD 
Subjt:  PVSAWESSRFIEKNPPLDLKLIDWPNLPKRESGKAGTGVRISSPRVHVKDKESPKKKEKCETSVQRLPSLPK-QCSTITIKAPELKRVVPSVVEDRLKDT

Query:  KTINSTMIADHNSQPPSPTASIPFLQPSPASEATLKFLSDAILCLTRKEEICNSPSKVINDSSFPTVYTIDPKKITSLNIALSEVQ----------TIEL
        KTINSTMIADHNSQPPSPTASIPFLQPSPASEATLKFLSDAILCLTRKEEICNSPSK  NDSSFPTVYTIDPKKITSLNI+LSEVQ          TIEL
Subjt:  KTINSTMIADHNSQPPSPTASIPFLQPSPASEATLKFLSDAILCLTRKEEICNSPSKVINDSSFPTVYTIDPKKITSLNIALSEVQ----------TIEL

Query:  VPTMKGGDEGGVGPEVESGSEPCAKKILVWKFHVMDNAKLMRALKDLIQLHEPSIVLIFGNKISGVDTDKVMRELAFCGSYSSKPDGYNGGVWLLLSKQD
        VPTMKGGD+GGVG EVESGSEPCAKK+LVWKFH MDNAKLMRALKDLIQLHEPSIVLIFGNKI+GVD  KVM+ELAFCGSYSS+PDGYNGGVWLLLSKQD
Subjt:  VPTMKGGDEGGVGPEVESGSEPCAKKILVWKFHVMDNAKLMRALKDLIQLHEPSIVLIFGNKISGVDTDKVMRELAFCGSYSSKPDGYNGGVWLLLSKQD

Query:  VQTKVNSFSSQQVSASVTFHSETNVQSFSPSNADTKTSSGPWGSTFFYTSTNWMTSLAY
        VQTKVNS+S QQVSASVTFHSETNVQ FSPSNADT+TSSGPWGSTFFYTSTNWMTSLAY
Subjt:  VQTKVNSFSSQQVSASVTFHSETNVQSFSPSNADTKTSSGPWGSTFFYTSTNWMTSLAY

KAG6600114.1 hypothetical protein SDJN03_05347, partial [Cucurbita argyrosperma subsp. sororia]9.9e-20061.25Show/hide
Query:  QSGHPPTGAGDDEAAARNYLSRKKPKVPPPIPPSSDFHSRRSTTIATVCNCNLTPSETTRITQQFVHSLIARVVGKDTRPGQLAARLRHHLRLTQDVKVF
        QS   PTGAGDDEAAAR YLSRKK K  P +  SSD  S RSTT ATV  CNL+PS+T RITQQF HSLIA V G+D RP QLA RLR HL LTQDV+VF
Subjt:  QSGHPPTGAGDDEAAARNYLSRKKPKVPPPIPPSSDFHSRRSTTIATVCNCNLTPSETTRITQQFVHSLIARVVGKDTRPGQLAARLRHHLRLTQDVKVF

Query:  QLGLGYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVGILKRIADAIGGPLVKIDPVTRDRWKCKFARFCI
        +LGLGYFVLKFSETDYLALEDLPWSIPNLCI+AF WTPDFKPSEAINSSV+VWIRL ELSIEYYD  IL++IA  IGG LVK DPVT++R KCKFAR CI
Subjt:  QLGLGYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVGILKRIADAIGGPLVKIDPVTRDRWKCKFARFCI

Query:  SVNLCDPLPSMIELGRIRQRIEYEGFE-LCAKCNRVGDLRHDCSSLNNPSLNNPSLNNPSGSYGFNPHGDEP-HHSVTRDFKEIGSTSNSKQPLIPE-SS
         +NLCDPLPSMI+LGRI+Q+IEYEG + LC  C  V DL+ +C       LN+    NPSGS G +  GD P HH  TR   E+GS+S+SKQPLIP  SS
Subjt:  SVNLCDPLPSMIELGRIRQRIEYEGFE-LCAKCNRVGDLRHDCSSLNNPSLNNPSLNNPSGSYGFNPHGDEP-HHSVTRDFKEIGSTSNSKQPLIPE-SS

Query:  PVSAWESSRFIEKNPPLDLKLIDWPNLPKRESGKAGTGVRISSPRVHVKDKESPKKKEKCETSVQRLPSLPKQCSTITIKAPELKRVVPSVVEDRLKDTK
        P SA  S   + +N   DL L+D       E  KA   +RISSP VHVKDK + K KE C   V+ LP LPK+ ST T KAPEL+ V P+VVE + K  K
Subjt:  PVSAWESSRFIEKNPPLDLKLIDWPNLPKRESGKAGTGVRISSPRVHVKDKESPKKKEKCETSVQRLPSLPKQCSTITIKAPELKRVVPSVVEDRLKDTK

Query:  TINSTMIADHNSQPPSPTASIPFLQPSPASEATLKFLSDAILCLTRKEEICNSPSKVINDSSFPTVYTIDPKKITSLNIALSEVQ----------TIELV
        T N T+IADHN+QP      +P        +ATL F+S  I    +++E+ + PSK I     P V+TI+ KKI S  + LS +Q          T++ +
Subjt:  TINSTMIADHNSQPPSPTASIPFLQPSPASEATLKFLSDAILCLTRKEEICNSPSKVINDSSFPTVYTIDPKKITSLNIALSEVQ----------TIELV

Query:  PTMKGGDEGGVGPEVESGSEPCAKKILVWKFHVMDNAKLMRALKDLIQLHEPSIVLIFGNKISGVDTDKVMRELAFCGSYSSKPDGYNGGVWLLLSKQDV
        PT +  DE G G +  SGSE C+KK+L WKFH  DNA LM+ALKDLIQLHEPSIVLIFG KISG + + V+REL+FCGSY  KPDGYNGGVWLLLS+QDV
Subjt:  PTMKGGDEGGVGPEVESGSEPCAKKILVWKFHVMDNAKLMRALKDLIQLHEPSIVLIFGNKISGVDTDKVMRELAFCGSYSSKPDGYNGGVWLLLSKQDV

Query:  QTKVNSFSSQQVSASVTFHSETNVQSFSPSNADTKTSSGPWGSTFFYTSTNWMTSLAY
        Q +V+S+S QQVSASV F S TN  +FSP N DT+TSSGPWGSTFFYTSTNWM+S+AY
Subjt:  QTKVNSFSSQQVSASVTFHSETNVQSFSPSNADTKTSSGPWGSTFFYTSTNWMTSLAY

KAG7030785.1 hypothetical protein SDJN02_04822, partial [Cucurbita argyrosperma subsp. argyrosperma]5.8e-20060.7Show/hide
Query:  QSGHPPTGAGDDEAAARNYLSRKKPKVPPPIPPSSDFHSRRSTTIATVCNCNLTPSETTRITQQFVHSLIARVVGKDTRPGQLAARLRHHLRLTQDVKVF
        QS   PTGAGDDEAAAR YLSRKK K  P +  SSD  S RSTT ATV  CNL+PS+T RITQQF HSLIA V G+D RP QLA RLR HL LTQDV+VF
Subjt:  QSGHPPTGAGDDEAAARNYLSRKKPKVPPPIPPSSDFHSRRSTTIATVCNCNLTPSETTRITQQFVHSLIARVVGKDTRPGQLAARLRHHLRLTQDVKVF

Query:  QLGLGYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVGILKRIADAIGGPLVKIDPVTRDRWKCKFARFCI
        +LGLGYFVLKFSETDYLALEDLPWSIPNLCI+AF WTPDFKPSEAINSSV+VWIRL ELSIEYYD  IL++IA  IGG LVK DPVT++R KCKFAR CI
Subjt:  QLGLGYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVGILKRIADAIGGPLVKIDPVTRDRWKCKFARFCI

Query:  SVNLCDPLPSMIELGRIRQRIEYEGFE-LCAKCNRVGDLRHDCSSLNNPSLNNPSLNNPSGSYGFNPHGDEP-HHSVTRDFKEIGSTSNSKQPLIPESSP
         +NLCDPLPSMI+LGRI+Q+IEYEG + LC  C  V DL+ +C       LN+    NPSGS G +  GD P HH  TR   E+GS+S+SKQPLIP  S 
Subjt:  SVNLCDPLPSMIELGRIRQRIEYEGFE-LCAKCNRVGDLRHDCSSLNNPSLNNPSLNNPSGSYGFNPHGDEP-HHSVTRDFKEIGSTSNSKQPLIPESSP

Query:  VSAWESSRF--IEKNPPLDLKLIDWPNLPKRESGKAGTGVRISSPRVHVKDKESPKKKEKCETSVQRLPSLPKQCSTITIKAPELKRVVPSVVEDRLKDT
         ++   SRF  +E +  LD            E  KA   +RISSP VHVKDK + K KE C   V+ LP LPK+ ST T KAPEL+ V P+VVE + K  
Subjt:  VSAWESSRF--IEKNPPLDLKLIDWPNLPKRESGKAGTGVRISSPRVHVKDKESPKKKEKCETSVQRLPSLPKQCSTITIKAPELKRVVPSVVEDRLKDT

Query:  KTINSTMIADHNSQPPSPTASIPFLQPSPASEATLKFLSDAILCLTRKEEICNSPSKVINDSSFPTVYTIDPKKITSLNIALSEVQ----------TIEL
        KT N T+IADHN+QP      +P        +ATL F+S  I   T+++E+ + PSK I     P V+TI+ KKI S  + LS +Q          T++ 
Subjt:  KTINSTMIADHNSQPPSPTASIPFLQPSPASEATLKFLSDAILCLTRKEEICNSPSKVINDSSFPTVYTIDPKKITSLNIALSEVQ----------TIEL

Query:  VPTMKGGDEGGVGPEVESGSEPCAKKILVWKFHVMDNAKLMRALKDLIQLHEPSIVLIFGNKISGVDTDKVMRELAFCGSYSSKPDGYNGGVWLLLSKQD
        +PT +  DE G G +  SGSE C+KK+L WKFH  DNA LM+ALKDLIQLHEPSIVLIFG KISG + + V+REL+FCGSY  KPDGYNGGVWLLLS+QD
Subjt:  VPTMKGGDEGGVGPEVESGSEPCAKKILVWKFHVMDNAKLMRALKDLIQLHEPSIVLIFGNKISGVDTDKVMRELAFCGSYSSKPDGYNGGVWLLLSKQD

Query:  VQTKVNSFSSQQVSASVTFHSETNVQSFSPSNADTKTSSGPWGSTFFYTSTNWMTSLAY
        VQ +V+S+S QQVSASV F S TN  +FSP N DT+TSSGPWGSTFFYTSTNWM+S+AY
Subjt:  VQTKVNSFSSQQVSASVTFHSETNVQSFSPSNADTKTSSGPWGSTFFYTSTNWMTSLAY

KAG7031864.1 hypothetical protein SDJN02_05905, partial [Cucurbita argyrosperma subsp. argyrosperma]2.3e-18858.27Show/hide
Query:  MALQSGHPPTGAGDDEAAARNYLSRKKPKVPPPIPPSSDFHSRRSTTIATVCNCNLTPSETTRITQQFVHSLIARVVGKDTRPGQLAARLRHHLRLTQDV
        MALQS +  TGAGDDEAAA                      + R+   ATV  C L+ S+T RITQQF HS IA + GKD RP ++A+ LR HL LT  V
Subjt:  MALQSGHPPTGAGDDEAAARNYLSRKKPKVPPPIPPSSDFHSRRSTTIATVCNCNLTPSETTRITQQFVHSLIARVVGKDTRPGQLAARLRHHLRLTQDV

Query:  KVFQLGLGYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVGILKRIADAIGGPLVKIDPVTRDRWKCKFAR
        KVF+LGLGYFVLKF ETD+LAL+DLPWS+PNLCIH  PWTPDFKPSE I SSV+VW+RL ELSIEYYD  +L++IA AIGG LVKIDPVT++R KCKFAR
Subjt:  KVFQLGLGYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVGILKRIADAIGGPLVKIDPVTRDRWKCKFAR

Query:  FCISVNLCDPLPSMIELGRIRQRIEYEGFE-LCAKCNRVGDLRHDCSSLNNPSLNNPSLNNPSGSYGFNPHGDEPHHSVTRDFKEIGSTSNSKQPLIPES
         C+ VNLCDPLPSMI LG+IRQ IEYEGFE LC  C+RV  LRH+C           +L  PSG  GFNPH  +PHH   R F         KQPLIP  
Subjt:  FCISVNLCDPLPSMIELGRIRQRIEYEGFE-LCAKCNRVGDLRHDCSSLNNPSLNNPSLNNPSGSYGFNPHGDEPHHSVTRDFKEIGSTSNSKQPLIPES

Query:  SPVSAWESSRFIEKNPPLDLKLIDWPNLPKRESGKAGTGVRISSPRVHVKDKESPKKKEKCETSVQRLPSLPKQCSTITIKAPELKRVVPSVVEDRLKDT
        S   +   SRF      LDL   +WP L   ESGKAGT +R SS  VHVKDK   KKKEKC  SVQ    LPK+ S +TIK             D+LK  
Subjt:  SPVSAWESSRFIEKNPPLDLKLIDWPNLPKRESGKAGTGVRISSPRVHVKDKESPKKKEKCETSVQRLPSLPKQCSTITIKAPELKRVVPSVVEDRLKDT

Query:  KTINSTMIADHNSQPPSPTASIPFLQPSPASEATLKFLSDAILCLTRKEEICNSPSKVINDSSFPTVYTIDPKKITSLNIALSEVQT----------IEL
        KT N+  +   N QPPSPT S+P L P PASEA L F S AI   TR +EI ++PSK IN  S PTVYTIDP KI +L+IALSE +T          IE 
Subjt:  KTINSTMIADHNSQPPSPTASIPFLQPSPASEATLKFLSDAILCLTRKEEICNSPSKVINDSSFPTVYTIDPKKITSLNIALSEVQT----------IEL

Query:  VPTMKGGDEGGVGPEVESGSEPCAKKILVWKFHVMDNAKLMRALKDLIQLHEPSIVLIFGNKISGVDTDKVMRELAFCGSYSSKPDGYNGGVWLLLSKQD
        VPT + GD+GG    V+SGSE C KKIL WKFH  DN KL+R+LKDLI+LHEPSIVLIFG KISG D DKV++EL FC SY  KPDGY+GGVWLLLS QD
Subjt:  VPTMKGGDEGGVGPEVESGSEPCAKKILVWKFHVMDNAKLMRALKDLIQLHEPSIVLIFGNKISGVDTDKVMRELAFCGSYSSKPDGYNGGVWLLLSKQD

Query:  VQTKVNSFSSQQVSASVTFHSETNVQSFSPSNADTKTSSGPWGSTFFYTSTNWMTSLAY
        V+TKVNS S QQ+ AS+ F S+TN  +F+PS   TK SSGPWGS FF+T TNWMTS+AY
Subjt:  VQTKVNSFSSQQVSASVTFHSETNVQSFSPSNADTKTSSGPWGSTFFYTSTNWMTSLAY

KGN50455.1 hypothetical protein Csa_000357 [Cucumis sativus]0.0e+0098.61Show/hide
Query:  MALQSGHPPTGAGDDEAAARNYLSRKKPKVPPPIPPSSDFHSRRSTTIATVCNCNLTPSETTRITQQFVHSLIARVVGKDTRPGQLAARLRHHLRLTQDV
        MA QSGHPPTGAGDDEAAARNYLSRKKPKVPPPIPPSSDFHSRRSTTIATVCNCNLTPSETTRITQQFVHSLIARVVGKDTRPGQLAARLRHHLRLTQDV
Subjt:  MALQSGHPPTGAGDDEAAARNYLSRKKPKVPPPIPPSSDFHSRRSTTIATVCNCNLTPSETTRITQQFVHSLIARVVGKDTRPGQLAARLRHHLRLTQDV

Query:  KVFQLGLGYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVGILKRIADAIGGPLVKIDPVTRDRWKCKFAR
        KVFQLGLGYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVGILKRIADAIG PLVKIDPVTRDRWKCKFAR
Subjt:  KVFQLGLGYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVGILKRIADAIGGPLVKIDPVTRDRWKCKFAR

Query:  FCISVNLCDPLPSMIELGRIRQRIEYEGFELCAKCNRVGDLRHDCSSLNNPSLNNPSLNNPSGSYGFNPHGDEPHHSVTRDFKEIGSTSNSKQPLIPESS
        FCISVNLCDPLPSMIELGR+RQRIEYEGFELCAKCNRVGDLRHDCSSLNNPSLNNPSLNNPSGSYGFNPHGDEPHHSVTRDFKEIGSTSNSKQPLIPESS
Subjt:  FCISVNLCDPLPSMIELGRIRQRIEYEGFELCAKCNRVGDLRHDCSSLNNPSLNNPSLNNPSGSYGFNPHGDEPHHSVTRDFKEIGSTSNSKQPLIPESS

Query:  PVSAWESSRFIEKNPPLDLKLIDWPNLPKRESGKAGTGVRISSPRVHVKDKESPKKKEKCETSVQRLPSLPKQCSTITIKAPELKRVVPSVVEDRLKDTK
        PVSAWESSRFIEKNPPLDLKLIDWPNLPKRESGKAG+GVRISSPRVHVKDKE PKKKEKCE SVQRLP+LPKQCSTITIKAPELKRVVPSVVEDRLKDTK
Subjt:  PVSAWESSRFIEKNPPLDLKLIDWPNLPKRESGKAGTGVRISSPRVHVKDKESPKKKEKCETSVQRLPSLPKQCSTITIKAPELKRVVPSVVEDRLKDTK

Query:  TINSTMIADHNSQPPSPTASIPFLQPSPASEATLKFLSDAILCLTRKEEICNSPSKVINDSSFPTVYTIDPKKITSLNIALSEVQTIELVPTMKGGDEGG
        TINSTMIADHNSQPPSPTASIPFLQPSPASEATLKFLSDAILCLTRKEEICNSPSKVINDSSFPTVYTIDPKKITSLNIALSEVQTIELVPTMKGGDEGG
Subjt:  TINSTMIADHNSQPPSPTASIPFLQPSPASEATLKFLSDAILCLTRKEEICNSPSKVINDSSFPTVYTIDPKKITSLNIALSEVQTIELVPTMKGGDEGG

Query:  VGPEVESGSEPCAKKILVWKFHVMDNAKLMRALKDLIQLHEPSIVLIFGNKISGVDTDKVMRELAFCGSYSSKPDGYNGGVWLLLSKQDVQTKVNSFSSQ
        VG EVESGSEPCAKKILVWKFHVMDNAKLMRALKDLIQLHEPSIVLIFGNKISGVDTDKVMRELAFCGSYSSKPDGYNGGVWLLLSKQDVQTKVNSFSSQ
Subjt:  VGPEVESGSEPCAKKILVWKFHVMDNAKLMRALKDLIQLHEPSIVLIFGNKISGVDTDKVMRELAFCGSYSSKPDGYNGGVWLLLSKQDVQTKVNSFSSQ

Query:  QVSASVTFHSETNVQSFSPSNADTKTSSGPWGSTFFYTSTNWMTSLAY
        QVSASVTFHSETNVQ FSPSNADTKTSSGPWGSTFFYTSTNWMTSLAY
Subjt:  QVSASVTFHSETNVQSFSPSNADTKTSSGPWGSTFFYTSTNWMTSLAY

TrEMBL top hitse value%identityAlignment
A0A0A0KLB0 DUF4283 domain-containing protein0.0e+0098.61Show/hide
Query:  MALQSGHPPTGAGDDEAAARNYLSRKKPKVPPPIPPSSDFHSRRSTTIATVCNCNLTPSETTRITQQFVHSLIARVVGKDTRPGQLAARLRHHLRLTQDV
        MA QSGHPPTGAGDDEAAARNYLSRKKPKVPPPIPPSSDFHSRRSTTIATVCNCNLTPSETTRITQQFVHSLIARVVGKDTRPGQLAARLRHHLRLTQDV
Subjt:  MALQSGHPPTGAGDDEAAARNYLSRKKPKVPPPIPPSSDFHSRRSTTIATVCNCNLTPSETTRITQQFVHSLIARVVGKDTRPGQLAARLRHHLRLTQDV

Query:  KVFQLGLGYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVGILKRIADAIGGPLVKIDPVTRDRWKCKFAR
        KVFQLGLGYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVGILKRIADAIG PLVKIDPVTRDRWKCKFAR
Subjt:  KVFQLGLGYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVGILKRIADAIGGPLVKIDPVTRDRWKCKFAR

Query:  FCISVNLCDPLPSMIELGRIRQRIEYEGFELCAKCNRVGDLRHDCSSLNNPSLNNPSLNNPSGSYGFNPHGDEPHHSVTRDFKEIGSTSNSKQPLIPESS
        FCISVNLCDPLPSMIELGR+RQRIEYEGFELCAKCNRVGDLRHDCSSLNNPSLNNPSLNNPSGSYGFNPHGDEPHHSVTRDFKEIGSTSNSKQPLIPESS
Subjt:  FCISVNLCDPLPSMIELGRIRQRIEYEGFELCAKCNRVGDLRHDCSSLNNPSLNNPSLNNPSGSYGFNPHGDEPHHSVTRDFKEIGSTSNSKQPLIPESS

Query:  PVSAWESSRFIEKNPPLDLKLIDWPNLPKRESGKAGTGVRISSPRVHVKDKESPKKKEKCETSVQRLPSLPKQCSTITIKAPELKRVVPSVVEDRLKDTK
        PVSAWESSRFIEKNPPLDLKLIDWPNLPKRESGKAG+GVRISSPRVHVKDKE PKKKEKCE SVQRLP+LPKQCSTITIKAPELKRVVPSVVEDRLKDTK
Subjt:  PVSAWESSRFIEKNPPLDLKLIDWPNLPKRESGKAGTGVRISSPRVHVKDKESPKKKEKCETSVQRLPSLPKQCSTITIKAPELKRVVPSVVEDRLKDTK

Query:  TINSTMIADHNSQPPSPTASIPFLQPSPASEATLKFLSDAILCLTRKEEICNSPSKVINDSSFPTVYTIDPKKITSLNIALSEVQTIELVPTMKGGDEGG
        TINSTMIADHNSQPPSPTASIPFLQPSPASEATLKFLSDAILCLTRKEEICNSPSKVINDSSFPTVYTIDPKKITSLNIALSEVQTIELVPTMKGGDEGG
Subjt:  TINSTMIADHNSQPPSPTASIPFLQPSPASEATLKFLSDAILCLTRKEEICNSPSKVINDSSFPTVYTIDPKKITSLNIALSEVQTIELVPTMKGGDEGG

Query:  VGPEVESGSEPCAKKILVWKFHVMDNAKLMRALKDLIQLHEPSIVLIFGNKISGVDTDKVMRELAFCGSYSSKPDGYNGGVWLLLSKQDVQTKVNSFSSQ
        VG EVESGSEPCAKKILVWKFHVMDNAKLMRALKDLIQLHEPSIVLIFGNKISGVDTDKVMRELAFCGSYSSKPDGYNGGVWLLLSKQDVQTKVNSFSSQ
Subjt:  VGPEVESGSEPCAKKILVWKFHVMDNAKLMRALKDLIQLHEPSIVLIFGNKISGVDTDKVMRELAFCGSYSSKPDGYNGGVWLLLSKQDVQTKVNSFSSQ

Query:  QVSASVTFHSETNVQSFSPSNADTKTSSGPWGSTFFYTSTNWMTSLAY
        QVSASVTFHSETNVQ FSPSNADTKTSSGPWGSTFFYTSTNWMTSLAY
Subjt:  QVSASVTFHSETNVQSFSPSNADTKTSSGPWGSTFFYTSTNWMTSLAY

A0A5A7SSJ3 DUF4283 domain-containing protein0.0e+0090.44Show/hide
Query:  MALQSGHPPTGAGDDEAAARNYLSRKKPKVPPPIPPSSDFHSRRSTTIATVCNCNLTPSETTRITQQFVHSLIARVVGKDTRPGQLAARLRHHLRLTQDV
        MALQS HPPTGAGDDEAAARNYLSRKKPKVPPPI PSSDF SR STTIATVCNCNLTPSETTRITQQF+HSLIARVVGKDTRPGQLAARLRHHLRLTQDV
Subjt:  MALQSGHPPTGAGDDEAAARNYLSRKKPKVPPPIPPSSDFHSRRSTTIATVCNCNLTPSETTRITQQFVHSLIARVVGKDTRPGQLAARLRHHLRLTQDV

Query:  KVFQLGLGYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVGILKRIADAIGGPLVKIDPVTRDRWKCKFAR
        KVF+LGLGYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDV ILKRIADAIGG LVKIDPVTRDRWKCKFAR
Subjt:  KVFQLGLGYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVGILKRIADAIGGPLVKIDPVTRDRWKCKFAR

Query:  FCISVNLCDPLPSMIELGRIRQRIEYEGFELCAKCNRVGDLRHDCSSLNNPSLNNPSLNNPSGSYGFNPHGDEPHHSVTRDFKEIGSTSNSKQPLIPESS
        FCISVNLCDPLPSMIELGRIRQRIEYEGFELCAKCNRVGDLRHDCS          SLNNPSGSYGFNPHGDEPHHSVTR FKE GSTS+SKQPLIPESS
Subjt:  FCISVNLCDPLPSMIELGRIRQRIEYEGFELCAKCNRVGDLRHDCSSLNNPSLNNPSLNNPSGSYGFNPHGDEPHHSVTRDFKEIGSTSNSKQPLIPESS

Query:  PVSAWESSRFIEKNPPLDLKLIDWPNLPKRESGKAGTGVRISSPRVHVKDKESPKKKEKCETSVQRLPSLPK-QCSTITIKAPELKRVVPSVVEDRLKDT
         VSAWESSRFIEKNP LDLK I+WPNLPK ESGKAGT VRISSP VHVKDK  PKKKEKCE SVQ LPSLPK Q STITIKAPELK VVPSVVED+LKD 
Subjt:  PVSAWESSRFIEKNPPLDLKLIDWPNLPKRESGKAGTGVRISSPRVHVKDKESPKKKEKCETSVQRLPSLPK-QCSTITIKAPELKRVVPSVVEDRLKDT

Query:  KTINSTMIADHNSQPPSPTASIPFLQPSPASEATLKFLSDAILCLTRKEEICNSPSKVINDSSFPTVYTIDPKKITSLNIALSEVQ----------TIEL
        KTINSTMIADHNSQPPSPTASIPFLQPSPASEATLKFLSDAILCLTRKEEICNSPSK  NDSSFPTVYTIDPKKITSLNI+LSEVQ          TIEL
Subjt:  KTINSTMIADHNSQPPSPTASIPFLQPSPASEATLKFLSDAILCLTRKEEICNSPSKVINDSSFPTVYTIDPKKITSLNIALSEVQ----------TIEL

Query:  VPTMKGGDEGGVGPEVESGSEPCAKKILVWKFHVMDNAKLMRALKDLIQLHEPSIVLIFGNKISGVDTDKVMRELAFCGSYSSKPDGYNGGVWLLLSKQD
        VPTMKGGD+GGVG EVESGSEPCAKK+LVWKFH MDNAKLMRALKDLIQLHEPSIVLIFGNKI+GVD  KVM+ELAFCGSYSS+PDGYNGGVWLLLSKQD
Subjt:  VPTMKGGDEGGVGPEVESGSEPCAKKILVWKFHVMDNAKLMRALKDLIQLHEPSIVLIFGNKISGVDTDKVMRELAFCGSYSSKPDGYNGGVWLLLSKQD

Query:  VQTKVNSFSSQQVSASVTFHSETNVQSFSPSNADTKTSSGPWGSTFFYTSTNWMTSLAY
        VQTKVNS+S QQVSASVTFHSETNVQ FSPSNADT+TSSGPWGSTFFYTSTNWMTSLAY
Subjt:  VQTKVNSFSSQQVSASVTFHSETNVQSFSPSNADTKTSSGPWGSTFFYTSTNWMTSLAY

A0A5A7SUD3 DUF4283 domain-containing protein3.6e-11542.48Show/hide
Query:  SGHPPTGAGDDEAAARNYLSRKKPKVPPPIPPSSDFHSRRSTTIATVCNCNLTPSETTRITQQFVHSLIARVVGKDTRPGQLAARLRHHLRLTQDVKVFQ
        S H PTGAG DEAA+RN+ SRKK K   PI  S DF+S  STT +TV  C  + S+T  I ++F HSLIA VVGK+ RP +LA  L  HLRLT+   VF+
Subjt:  SGHPPTGAGDDEAAARNYLSRKKPKVPPPIPPSSDFHSRRSTTIATVCNCNLTPSETTRITQQFVHSLIARVVGKDTRPGQLAARLRHHLRLTQDVKVFQ

Query:  LGLGYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVGILKRIADAIGGPLVKIDPVTRDRWKCKFARFCIS
        LGLGYFVLKF ETD+LALED PW IPNLCI+AFPWTP+FKPSEA++S+++ WIRL EL IEYY   IL+ I   +G  LVKIDP+T+DR KCK+AR C+ 
Subjt:  LGLGYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVGILKRIADAIGGPLVKIDPVTRDRWKCKFARFCIS

Query:  VNLCDPLPSMIELGRIRQRIEYEGFE-LCAKCNRVGDLRHDCSSLNNPSLNNPSLNNPSGSYGFNPHGDEPHHSVTRDFKEIGSTSNSKQPLIPESSPVS
        +N+ +PLPS I +G+I Q IEYEGF+ LC +C  V  L+HDC            LN+   S  F P  D P +            SNSKQPL+P  S V 
Subjt:  VNLCDPLPSMIELGRIRQRIEYEGFE-LCAKCNRVGDLRHDCSSLNNPSLNNPSLNNPSGSYGFNPHGDEPHHSVTRDFKEIGSTSNSKQPLIPESSPVS

Query:  AWESSRF-----IEKNPPLDLKLIDWPNLPKRESGKAGTGVRISSPRV----HVKDKESPKKKEKCETSVQRLPSLPKQ----------------CSTIT
        AW  SRF       K+P  +LK +  P++   E     T    SSP +     + ++   K+KEKC  S +  P+LPK+                 STI+
Subjt:  AWESSRF-----IEKNPPLDLKLIDWPNLPKRESGKAGTGVRISSPRV----HVKDKESPKKKEKCETSVQRLPSLPKQ----------------CSTIT

Query:  IKAPELKR------VVPSVVEDRLKDTKTINSTMIADHNSQP---PSPTASIPFLQP-SPASEATLKFLSDAILCLTRKEEICNSPSKVINDSSFPTVYT
        +   E K       + P   E+     +T  S  +  HN+QP    SP A+    QP SP+S+    F S  I     K++I ++ S+    +  P +YT
Subjt:  IKAPELKR------VVPSVVEDRLKDTKTINSTMIADHNSQP---PSPTASIPFLQP-SPASEATLKFLSDAILCLTRKEEICNSPSKVINDSSFPTVYT

Query:  IDPKKITSLNIALSEVQ---------TIELVPTMKGGDEGGVGPEVESGSEPCAKKILVWKFHVMDNAKLMRALKDLIQLHEPSIVLIFGNKISGVDTDK
        I P+ I S  + LSE           +I  V T + G         ++ S   +KK+L W F  MDN  L+  L  ++Q +EPSIV+IFG +I+    ++
Subjt:  IDPKKITSLNIALSEVQ---------TIELVPTMKGGDEGGVGPEVESGSEPCAKKILVWKFHVMDNAKLMRALKDLIQLHEPSIVLIFGNKISGVDTDK

Query:  VMRELAFCGSYSSKPDGYNGGVWLLLSKQDVQT---KVNSFSSQQVSASVTFHSETNVQSFSPSNAD-TKTSSGPWGSTFFYTST
        V+ +LAF GSY  K D Y+GGVWL + ++DVQT   +VNS+S+QQVSAS  F  E N      S  D  +TS   WG   F  ST
Subjt:  VMRELAFCGSYSSKPDGYNGGVWLLLSKQDVQT---KVNSFSSQQVSASVTFHSETNVQSFSPSNAD-TKTSSGPWGSTFFYTST

A0A6J1FN13 uncharacterized protein LOC111446932 isoform X23.2e-11946.58Show/hide
Query:  MALQSGH----PPTGAGDDEAAARNYLSRKKPKVPPPIPPSSDFHSRRSTTIATVCNCNLTPSETTRITQQFVHSLIARVVGKDTRPGQLAARLRHHLRL
        MA+QS H      TGAG+D AAA                         ST  ATV  CNLTPS+T RI QQF  SLI  VVGK   P QLA RLR +L L
Subjt:  MALQSGH----PPTGAGDDEAAARNYLSRKKPKVPPPIPPSSDFHSRRSTTIATVCNCNLTPSETTRITQQFVHSLIARVVGKDTRPGQLAARLRHHLRL

Query:  TQDVKVFQLGLGYFVLKFSET--DYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVGILKRIADAIGGPLVKIDPVTRDRW
          D+ VF+LGLG+FVLKFS     Y ALE+ PWSIP+LCI+ FPW P+FKPSEA    V+VWIRLPELSIEYYD  +L++IA+ IGG LVKIDPVT  R 
Subjt:  TQDVKVFQLGLGYFVLKFSET--DYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVGILKRIADAIGGPLVKIDPVTRDRW

Query:  KCKFARFCISVNLCDPLPSMIELGRIRQRIEYEGFE-LCAKCNRVGDLRHDCSSLNNPSLNNPSLNNPSGSYGFNPHGDEPHHSVTRDFKEIGST-----
        KC +AR CI +NL  PL    + G+  Q+I YEG + LC  C  V DL+HDC            L+N S S GF+PH    HHS  R  +  GS+     
Subjt:  KCKFARFCISVNLCDPLPSMIELGRIRQRIEYEGFE-LCAKCNRVGDLRHDCSSLNNPSLNNPSLNNPSGSYGFNPHGDEPHHSVTRDFKEIGST-----

Query:  -----------------SNSKQPLIPESSPVSAWESSRFIEKNPPLDLKLIDWPNLPKRESGKAGTGVRISSPRVHVKDKESPKKKEKCETSVQRLPSLP
                         SN K  LIP     ++   SRF      L+L L + P+LP  ES K              + KESP        S+   P L 
Subjt:  -----------------SNSKQPLIPESSPVSAWESSRFIEKNPPLDLKLIDWPNLPKRESGKAGTGVRISSPRVHVKDKESPKKKEKCETSVQRLPSLP

Query:  KQCSTITIKAPELKRVVPSVVED-RLKDTKTINSTMIADHNSQP-PSPTA--SIPFLQPSPASEATLKFLSDAILCLTRKEEICNSPSKVINDSSFPTVY
        KQ + I     +   + P V+ED + +  KT + T +A  N++P PS  A  SI  LQPS A EA LKF S AI   T ++ I N+PS+ I+  S PT+Y
Subjt:  KQCSTITIKAPELKRVVPSVVED-RLKDTKTINSTMIADHNSQP-PSPTA--SIPFLQPSPASEATLKFLSDAILCLTRKEEICNSPSKVINDSSFPTVY

Query:  TIDPKKITSLNIALSEV-----------QTIELVPTMKGGDEGGVGPEVESGSEPCAKKILVWKFHVMDNAKLMRALKDLIQLHEPSIVLIFGNKISGVD
        TIDP  ITSL I L E+             I +VPT +          V   +  C+KK+L W F   DNAKLMRALKDLIQLH+PSIVLIFG KISG D
Subjt:  TIDPKKITSLNIALSEV-----------QTIELVPTMKGGDEGGVGPEVESGSEPCAKKILVWKFHVMDNAKLMRALKDLIQLHEPSIVLIFGNKISGVD

Query:  TDKVMRELAFCGSYSSKPDGYNGGVWLLLSKQDVQTKVNSFSSQQVSASVTFHSETN
         D V+RELAF GSY  KPDGY GG WLLLS+QDVQ +V+S+S QQVSASV  HS+ N
Subjt:  TDKVMRELAFCGSYSSKPDGYNGGVWLLLSKQDVQTKVNSFSSQQVSASVTFHSETN

A0A6J1FU80 uncharacterized protein LOC111446932 isoform X15.4e-11946.43Show/hide
Query:  MALQSGH----PPTGAGDDEAAARNYLSRKKPKVPPPIPPSSDFHSRRSTTIATVCNCNLTPSETTRITQQFVHSLIARVVGKDTRPGQLAARLRHHLRL
        MA+QS H      TGAG+D AAA                         ST  ATV  CNLTPS+T RI QQF  SLI  VVGK   P QLA RLR +L L
Subjt:  MALQSGH----PPTGAGDDEAAARNYLSRKKPKVPPPIPPSSDFHSRRSTTIATVCNCNLTPSETTRITQQFVHSLIARVVGKDTRPGQLAARLRHHLRL

Query:  TQDVKVFQLGLGYFVLKFSET--DYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVGILKRIADAIGGPLVKIDPVTRDRW
          D+ VF+LGLG+FVLKFS     Y ALE+ PWSIP+LCI+ FPW P+FKPSEA    V+VWIRLPELSIEYYD  +L++IA+ IGG LVKIDPVT  R 
Subjt:  TQDVKVFQLGLGYFVLKFSET--DYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVGILKRIADAIGGPLVKIDPVTRDRW

Query:  KCKFARFCISVNLCDPLPSMIELGRIRQRIEYEGFE-LCAKCNRVGDLRHDCSSLNNPSLNNPSLNNPSGSYGFNPHGDEPHHSVTRDFKEIGST-----
        KC +AR CI +NL  PL    + G+  Q+I YEG + LC  C  V DL+HDC            L+N S S GF+PH    HHS  R  +  GS+     
Subjt:  KCKFARFCISVNLCDPLPSMIELGRIRQRIEYEGFE-LCAKCNRVGDLRHDCSSLNNPSLNNPSLNNPSGSYGFNPHGDEPHHSVTRDFKEIGST-----

Query:  -------------------SNSKQPLIPESSPVSAWESSRFIEKNPPLDLKLIDWPNLPKRESGKAGTGVRISSPRVHVKDKESPKKKEKCETSVQRLPS
                           SN K  LIP     ++   SRF      L+L L + P+LP  ES K              + KESP        S+   P 
Subjt:  -------------------SNSKQPLIPESSPVSAWESSRFIEKNPPLDLKLIDWPNLPKRESGKAGTGVRISSPRVHVKDKESPKKKEKCETSVQRLPS

Query:  LPKQCSTITIKAPELKRVVPSVVED-RLKDTKTINSTMIADHNSQP-PSPTA--SIPFLQPSPASEATLKFLSDAILCLTRKEEICNSPSKVINDSSFPT
        L KQ + I     +   + P V+ED + +  KT + T +A  N++P PS  A  SI  LQPS A EA LKF S AI   T ++ I N+PS+ I+  S PT
Subjt:  LPKQCSTITIKAPELKRVVPSVVED-RLKDTKTINSTMIADHNSQP-PSPTA--SIPFLQPSPASEATLKFLSDAILCLTRKEEICNSPSKVINDSSFPT

Query:  VYTIDPKKITSLNIALSEV-----------QTIELVPTMKGGDEGGVGPEVESGSEPCAKKILVWKFHVMDNAKLMRALKDLIQLHEPSIVLIFGNKISG
        +YTIDP  ITSL I L E+             I +VPT +          V   +  C+KK+L W F   DNAKLMRALKDLIQLH+PSIVLIFG KISG
Subjt:  VYTIDPKKITSLNIALSEV-----------QTIELVPTMKGGDEGGVGPEVESGSEPCAKKILVWKFHVMDNAKLMRALKDLIQLHEPSIVLIFGNKISG

Query:  VDTDKVMRELAFCGSYSSKPDGYNGGVWLLLSKQDVQTKVNSFSSQQVSASVTFHSETN
         D D V+RELAF GSY  KPDGY GG WLLLS+QDVQ +V+S+S QQVSASV  HS+ N
Subjt:  VDTDKVMRELAFCGSYSSKPDGYNGGVWLLLSKQDVQTKVNSFSSQQVSASVTFHSETN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding2.0e-1731.07Show/hide
Query:  LIARVVGKDTRPGQLAARLRHHLRLTQDVKVFQLGLGYFVLKFS-ETDYL-ALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDV
        +I +V+G       L  +LR   + +  + V  L   +F+++F  E +Y+ AL   PW +    +    W+  F P      +  VW+RL  +   YY  
Subjt:  LIARVVGKDTRPGQLAARLRHHLRLTQDVKVFQLGLGYFVLKFS-ETDYL-ALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDV

Query:  GILKRIADAIGGPLVKIDPVTRDRWKCKFARFCISVNLCDPLPSMIELGRIRQRIEYEGF-ELCAKCNRVGDLRHDC
         +L  IA  +G PL K+D  T +  K +FAR CI VNL  PL   + +   R  + YEG  ++C+ C   G L H C
Subjt:  GILKRIADAIGGPLVKIDPVTRDRWKCKFARFCISVNLCDPLPSMIELGRIRQRIEYEGF-ELCAKCNRVGDLRHDC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCTTCAATCCGGTCACCCACCCACCGGCGCCGGCGACGATGAAGCGGCTGCCAGAAACTATCTCAGCAGGAAGAAGCCCAAAGTACCACCCCCGATTCCTCCCTC
ATCGGACTTCCATTCTCGCCGTTCAACCACCATCGCTACTGTCTGTAACTGTAACCTCACTCCCTCTGAAACCACTCGTATTACTCAACAGTTCGTTCACTCTCTCATAG
CTCGGGTTGTCGGAAAGGACACCCGTCCAGGTCAACTCGCCGCTCGCCTTCGCCATCATCTTCGTCTAACCCAAGATGTGAAGGTCTTCCAGCTAGGTCTCGGCTATTTC
GTTCTTAAATTCTCTGAAACTGATTATTTAGCTCTAGAAGACCTTCCTTGGTCAATTCCTAATCTCTGTATCCACGCCTTTCCGTGGACTCCCGATTTCAAACCCTCTGA
AGCCATCAATTCTTCTGTTAATGTTTGGATCCGTCTCCCTGAGCTTTCCATCGAGTATTACGACGTGGGAATTCTGAAACGGATTGCGGACGCCATTGGTGGTCCTCTTG
TTAAGATCGATCCTGTTACTAGAGATCGCTGGAAATGCAAGTTTGCTCGCTTTTGTATTTCGGTAAATTTGTGCGATCCTCTTCCATCGATGATCGAACTTGGTAGAATT
CGACAAAGGATTGAGTATGAGGGTTTTGAGTTGTGTGCTAAGTGTAATCGTGTTGGTGATTTGAGACATGATTGTTCGAGTTTGAATAATCCGAGTCTGAATAATCCGAG
TTTGAACAATCCTTCTGGTTCTTATGGATTCAATCCCCATGGAGATGAACCCCATCACTCTGTAACTCGAGATTTTAAGGAAATTGGGTCAACTTCAAATTCAAAGCAGC
CACTGATTCCTGAATCTTCACCGGTATCAGCTTGGGAATCTTCTAGATTCATTGAAAAGAATCCACCGCTAGATTTGAAGTTGATTGATTGGCCTAACCTTCCAAAGAGG
GAATCTGGAAAAGCAGGTACTGGTGTAAGAATAAGTTCTCCACGTGTCCATGTGAAGGATAAAGAAAGTCCAAAGAAAAAGGAGAAATGTGAAACCTCTGTTCAACGGTT
GCCCAGCTTGCCAAAACAATGTTCAACGATAACCATTAAAGCTCCTGAGTTAAAACGTGTAGTCCCTTCTGTTGTTGAGGATCGGTTAAAGGATACAAAAACCATCAACT
CCACCATGATTGCAGACCATAATAGCCAACCACCATCACCAACCGCAAGCATTCCGTTCCTACAACCATCTCCTGCTTCCGAGGCTACTCTCAAGTTCCTTTCGGATGCA
ATCCTATGCTTGACAAGAAAGGAAGAGATATGCAACTCGCCATCTAAAGTGATCAACGACAGCAGTTTCCCCACTGTTTACACGATCGACCCAAAGAAGATCACAAGCCT
TAACATTGCTTTGTCTGAAGTGCAAACAATCGAGTTAGTTCCAACTATGAAAGGTGGAGATGAAGGTGGAGTTGGTCCGGAGGTGGAGTCAGGATCAGAACCATGTGCTA
AGAAGATATTGGTCTGGAAGTTTCACGTGATGGACAATGCCAAGCTAATGCGAGCATTGAAGGATTTGATTCAGCTACATGAGCCATCCATTGTGCTGATCTTTGGCAAC
AAGATCAGTGGTGTTGATACGGATAAGGTCATGCGGGAACTTGCCTTCTGCGGCTCTTACAGCAGCAAGCCTGATGGCTACAATGGTGGTGTTTGGTTGTTATTGTCTAA
GCAAGATGTGCAAACTAAAGTTAACTCATTTAGCTCACAGCAGGTTTCTGCATCAGTAACTTTTCATTCTGAAACCAATGTACAATCTTTTAGTCCTTCAAATGCAGATA
CCAAAACATCTTCCGGACCGTGGGGATCGACTTTCTTCTATACTTCAACGAACTGGATGACTTCATTGGCATATTGA
mRNA sequenceShow/hide mRNA sequence
ACAAATTTATCTTTTGAAAAGGTCAATAAATGGAAATTTTCCATTTCTTCATCTTCAAGTTTCGGTGTTTCACTTTCAATCCCTTTCCTCCAACGTTCATCGTCACTTCC
TCATCCAACAATGGCGCTTCAATCCGGTCACCCACCCACCGGCGCCGGCGACGATGAAGCGGCTGCCAGAAACTATCTCAGCAGGAAGAAGCCCAAAGTACCACCCCCGA
TTCCTCCCTCATCGGACTTCCATTCTCGCCGTTCAACCACCATCGCTACTGTCTGTAACTGTAACCTCACTCCCTCTGAAACCACTCGTATTACTCAACAGTTCGTTCAC
TCTCTCATAGCTCGGGTTGTCGGAAAGGACACCCGTCCAGGTCAACTCGCCGCTCGCCTTCGCCATCATCTTCGTCTAACCCAAGATGTGAAGGTCTTCCAGCTAGGTCT
CGGCTATTTCGTTCTTAAATTCTCTGAAACTGATTATTTAGCTCTAGAAGACCTTCCTTGGTCAATTCCTAATCTCTGTATCCACGCCTTTCCGTGGACTCCCGATTTCA
AACCCTCTGAAGCCATCAATTCTTCTGTTAATGTTTGGATCCGTCTCCCTGAGCTTTCCATCGAGTATTACGACGTGGGAATTCTGAAACGGATTGCGGACGCCATTGGT
GGTCCTCTTGTTAAGATCGATCCTGTTACTAGAGATCGCTGGAAATGCAAGTTTGCTCGCTTTTGTATTTCGGTAAATTTGTGCGATCCTCTTCCATCGATGATCGAACT
TGGTAGAATTCGACAAAGGATTGAGTATGAGGGTTTTGAGTTGTGTGCTAAGTGTAATCGTGTTGGTGATTTGAGACATGATTGTTCGAGTTTGAATAATCCGAGTCTGA
ATAATCCGAGTTTGAACAATCCTTCTGGTTCTTATGGATTCAATCCCCATGGAGATGAACCCCATCACTCTGTAACTCGAGATTTTAAGGAAATTGGGTCAACTTCAAAT
TCAAAGCAGCCACTGATTCCTGAATCTTCACCGGTATCAGCTTGGGAATCTTCTAGATTCATTGAAAAGAATCCACCGCTAGATTTGAAGTTGATTGATTGGCCTAACCT
TCCAAAGAGGGAATCTGGAAAAGCAGGTACTGGTGTAAGAATAAGTTCTCCACGTGTCCATGTGAAGGATAAAGAAAGTCCAAAGAAAAAGGAGAAATGTGAAACCTCTG
TTCAACGGTTGCCCAGCTTGCCAAAACAATGTTCAACGATAACCATTAAAGCTCCTGAGTTAAAACGTGTAGTCCCTTCTGTTGTTGAGGATCGGTTAAAGGATACAAAA
ACCATCAACTCCACCATGATTGCAGACCATAATAGCCAACCACCATCACCAACCGCAAGCATTCCGTTCCTACAACCATCTCCTGCTTCCGAGGCTACTCTCAAGTTCCT
TTCGGATGCAATCCTATGCTTGACAAGAAAGGAAGAGATATGCAACTCGCCATCTAAAGTGATCAACGACAGCAGTTTCCCCACTGTTTACACGATCGACCCAAAGAAGA
TCACAAGCCTTAACATTGCTTTGTCTGAAGTGCAAACAATCGAGTTAGTTCCAACTATGAAAGGTGGAGATGAAGGTGGAGTTGGTCCGGAGGTGGAGTCAGGATCAGAA
CCATGTGCTAAGAAGATATTGGTCTGGAAGTTTCACGTGATGGACAATGCCAAGCTAATGCGAGCATTGAAGGATTTGATTCAGCTACATGAGCCATCCATTGTGCTGAT
CTTTGGCAACAAGATCAGTGGTGTTGATACGGATAAGGTCATGCGGGAACTTGCCTTCTGCGGCTCTTACAGCAGCAAGCCTGATGGCTACAATGGTGGTGTTTGGTTGT
TATTGTCTAAGCAAGATGTGCAAACTAAAGTTAACTCATTTAGCTCACAGCAGGTTTCTGCATCAGTAACTTTTCATTCTGAAACCAATGTACAATCTTTTAGTCCTTCA
AATGCAGATACCAAAACATCTTCCGGACCGTGGGGATCGACTTTCTTCTATACTTCAACGAACTGGATGACTTCATTGGCATATTGATGAAGTAAAGTAGGAAAGTAAAT
AAGTATTAGATATCATCTTCTTGGAGATCTATAATCTCAAGAACTGTATTGCTTGTTGATAATATGCTAAGAATGGCTGGAAAGACAGGCATCAATGTGTGAGGGGGTAT
ATTATGTTGTGATCTTGTAACTTTGTTGTTGATATAATGGATTATTTTGAGGTGGAC
Protein sequenceShow/hide protein sequence
MALQSGHPPTGAGDDEAAARNYLSRKKPKVPPPIPPSSDFHSRRSTTIATVCNCNLTPSETTRITQQFVHSLIARVVGKDTRPGQLAARLRHHLRLTQDVKVFQLGLGYF
VLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVGILKRIADAIGGPLVKIDPVTRDRWKCKFARFCISVNLCDPLPSMIELGRI
RQRIEYEGFELCAKCNRVGDLRHDCSSLNNPSLNNPSLNNPSGSYGFNPHGDEPHHSVTRDFKEIGSTSNSKQPLIPESSPVSAWESSRFIEKNPPLDLKLIDWPNLPKR
ESGKAGTGVRISSPRVHVKDKESPKKKEKCETSVQRLPSLPKQCSTITIKAPELKRVVPSVVEDRLKDTKTINSTMIADHNSQPPSPTASIPFLQPSPASEATLKFLSDA
ILCLTRKEEICNSPSKVINDSSFPTVYTIDPKKITSLNIALSEVQTIELVPTMKGGDEGGVGPEVESGSEPCAKKILVWKFHVMDNAKLMRALKDLIQLHEPSIVLIFGN
KISGVDTDKVMRELAFCGSYSSKPDGYNGGVWLLLSKQDVQTKVNSFSSQQVSASVTFHSETNVQSFSPSNADTKTSSGPWGSTFFYTSTNWMTSLAY