; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC02G031090 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC02G031090
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionDUF4283 domain-containing protein
Genome locationCicolChr02:26663617..26665560
RNA-Seq ExpressionCcUC02G031090
SyntenyCcUC02G031090
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034062.1 uncharacterized protein E6C27_scaffold65G00480 [Cucumis melo var. makuwa]3.4e-29380.77Show/hide
Query:  MELQSDNQLTGAGDDEAAARNYLSRKKPKGPPPMPSSSDFESRRSTTIATVCNCKLSPSDTTRITQQFNHSLIARIVGKDIRPGQLAARLRHHLRLTEDV
        M LQSD+  TGAGDDEAAARNYLSRKKPK PPP+  SSDFESR STTIATVCNC L+PS+TTRITQQF HSLIAR+VGKD RPGQLAARLRHHLRLT+DV
Subjt:  MELQSDNQLTGAGDDEAAARNYLSRKKPKGPPPMPSSSDFESRRSTTIATVCNCKLSPSDTTRITQQFNHSLIARIVGKDIRPGQLAARLRHHLRLTEDV

Query:  KVFELGLRYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVEILNRIAEAIGGCLLKIDPVTRDRWKCKFAR
        KVFELGL YFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVEIL RIA+AIGG L+KIDPVTRDRWKCKFAR
Subjt:  KVFELGLRYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVEILNRIAEAIGGCLLKIDPVTRDRWKCKFAR

Query:  FCIRVNLCDPLPSMIELGTIRQRIEYEGFELCPKCSHVGDLGHNCLGLSNPSGSSGVNPHGDKPHHPGTRSFKEFGWSSSSKQPLIPFESSPVSAWE-SK
        FCI VNLCDPLPSMIELG IRQRIEYEGFELC KC+ VGDL H+C  L+NPSGS G NPHGD+PHH  TR FKEFG +SSSKQPLIP ESS VSAWE S+
Subjt:  FCIRVNLCDPLPSMIELGTIRQRIEYEGFELCPKCSHVGDLGHNCLGLSNPSGSSGVNPHGDKPHHPGTRSFKEFGWSSSSKQPLIPFESSPVSAWE-SK

Query:  FIGKDPQLDLKLIDLPKLPMGESRKAGTSVRISSPHVHVKDKATGKQKQKCGTLVQPLPNLPK-QSSTITIEAPELKHTVPSVVEDQSKASKTSNPTMIP
        FI K+PQLDLK I+ P LP  ES KAGTSVRISSPHVHVKDKA  K+K+KC   VQPLP+LPK QSSTITI+APELK  VPSVVEDQ K +KT N TMI 
Subjt:  FIGKDPQLDLKLIDLPKLPMGESRKAGTSVRISSPHVHVKDKATGKQKQKCGTLVQPLPNLPK-QSSTITIEAPELKHTVPSVVEDQSKASKTSNPTMIP

Query:  SHKSQPPSPTASIPSLQPCPASEATLKFHSDAIQCLTRKKEITNTLSKEINVDSCPTVYTLD-SKVTNIDIALSEVQTTSMTNQNQYTIELVPTMKGGDE
         H SQPPSPTASIP LQP PASEATLKF SDAI CLTRK+EI N+ SKE N  S PTVYT+D  K+T+++I+LSEVQTTSM+NQNQYTIELVPTMKGGD+
Subjt:  SHKSQPPSPTASIPSLQPCPASEATLKFHSDAIQCLTRKKEITNTLSKEINVDSCPTVYTLD-SKVTNIDIALSEVQTTSMTNQNQYTIELVPTMKGGDE

Query:  GGVGLEVESGSESCAKKILCWKVHGTENAKLMRALKDLIQLHEPSIVLIFGPKINGVDADQVVRELAFCSSYRSKPDGYNSGVWLLLSKQDVQTKINSYN
        GGVGLEVESGSE CAKK+L WK H  +NAKLMRALKDLIQLHEPSIVLIFG KI GVDA +V++ELAFC SY S+PDGYN GVWLLLSKQDVQTK+NSY+
Subjt:  GGVGLEVESGSESCAKKILCWKVHGTENAKLMRALKDLIQLHEPSIVLIFGPKINGVDADQVVRELAFCSSYRSKPDGYNSGVWLLLSKQDVQTKINSYN

Query:  PQQVSASVSFDAETNVPVFSPSDVDTETSSGPWGLPFFYTSNLPMTSLAY
        PQQVSASV+F +ETNV  FSPS+ DTETSSGPWG  FFYTS   MTSLAY
Subjt:  PQQVSASVSFDAETNVPVFSPSDVDTETSSGPWGLPFFYTSNLPMTSLAY

KAG6600114.1 hypothetical protein SDJN03_05347, partial [Cucurbita argyrosperma subsp. sororia]6.4e-21563.52Show/hide
Query:  QSDNQLTGAGDDEAAARNYLSRKKPKGPPPMPSSSDFESRRSTTIATVCNCKLSPSDTTRITQQFNHSLIARIVGKDIRPGQLAARLRHHLRLTEDVKVF
        QS+ Q TGAGDDEAAAR YLSRKK K  P M SSSD ES RSTT ATVCN  LSPS T RITQQF+HSLIA + G+DIRP QLA RLR HL LT+DV+VF
Subjt:  QSDNQLTGAGDDEAAARNYLSRKKPKGPPPMPSSSDFESRRSTTIATVCNCKLSPSDTTRITQQFNHSLIARIVGKDIRPGQLAARLRHHLRLTEDVKVF

Query:  ELGLRYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVEILNRIAEAIGGCLLKIDPVTRDRWKCKFARFCI
        ELGL YFVLKFSETDYLALEDLPWSIPNLCI+AF WTPDFKPSEAINSSV+VWIRL ELSIEYYD EIL +IA  IGG L+K DPVT++R KCKFAR CI
Subjt:  ELGLRYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVEILNRIAEAIGGCLLKIDPVTRDRWKCKFARFCI

Query:  RVNLCDPLPSMIELGTIRQRIEYEGFE-LCPKCSHVGDLGHNCLGLSNPSGSSGVNPHGDKP-HHPGTRSFKEFGWSSSSKQPLIPFESSPVSAWESKFI
        R+NLCDPLPSMI+LG I+Q+IEYEG + LCP C  V DL  NCL   NPSGSSG++  GD+P HH  TR   E G SSSSKQPLIP  SSP SA  S+F 
Subjt:  RVNLCDPLPSMIELGTIRQRIEYEGFE-LCPKCSHVGDLGHNCLGLSNPSGSSGVNPHGDKP-HHPGTRSFKEFGWSSSSKQPLIPFESSPVSAWESKFI

Query:  GKDPQLDLKLIDLPKLPMGESRKAGTSVRISSPHVHVKDKATGKQKQKCGTLVQPLPNLPKQSSTITIEAPELKHTVPSVVEDQSKASKTSNPTMIPSHK
          +  L   L+D       E  KA  S+RISSPHVHVKDKA  K K+ CG  V+ LP LPK+ ST T +APEL+   P+VVE Q K +KTSNPT+I  H 
Subjt:  GKDPQLDLKLIDLPKLPMGESRKAGTSVRISSPHVHVKDKATGKQKQKCGTLVQPLPNLPKQSSTITIEAPELKHTVPSVVEDQSKASKTSNPTMIPSHK

Query:  SQPPSPTASIPSLQPCPASEATLKFHSDAIQCLTRKKEITNTLSKEINVDSCPTVYTLDS-KVTNIDIALSEVQTTSMTNQNQYTIELVPTMKGGDEGGV
        +            QPC   +ATL F S  I+   ++KE+ +  SKEI VD CP V+T+++ K+ +  + LS +QT S+ N+N YT++ +PT +  DE G 
Subjt:  SQPPSPTASIPSLQPCPASEATLKFHSDAIQCLTRKKEITNTLSKEINVDSCPTVYTLDS-KVTNIDIALSEVQTTSMTNQNQYTIELVPTMKGGDEGGV

Query:  GLEVESGSESCAKKILCWKVHGTENAKLMRALKDLIQLHEPSIVLIFGPKINGVDADQVVRELAFCSSYRSKPDGYNSGVWLLLSKQDVQTKINSYNPQQ
        G +  SGSESC+KK+LCWK HGT+NA LM+ALKDLIQLHEPSIVLIFG KI+G +A+ VVREL+FC SY  KPDGYN GVWLLLS+QDVQ +++SY+PQQ
Subjt:  GLEVESGSESCAKKILCWKVHGTENAKLMRALKDLIQLHEPSIVLIFGPKINGVDADQVVRELAFCSSYRSKPDGYNSGVWLLLSKQDVQTKINSYNPQQ

Query:  VSASVSFDAETNVPVFSPSDVDTETSSGPWGLPFFYTSNLPMTSLAY
        VSASV F + TN P FSP +VDTETSSGPWG  FFYTS   M+S+AY
Subjt:  VSASVSFDAETNVPVFSPSDVDTETSSGPWGLPFFYTSNLPMTSLAY

KAG7030785.1 hypothetical protein SDJN02_04822, partial [Cucurbita argyrosperma subsp. argyrosperma]3.7e-21563.33Show/hide
Query:  QSDNQLTGAGDDEAAARNYLSRKKPKGPPPMPSSSDFESRRSTTIATVCNCKLSPSDTTRITQQFNHSLIARIVGKDIRPGQLAARLRHHLRLTEDVKVF
        QS+ Q TGAGDDEAAAR YLSRKK K  P M SSSD ES RSTT ATVCN  LSPS T RITQQF+HSLIA + G+DIRP QLA RLR HL LT+DV+VF
Subjt:  QSDNQLTGAGDDEAAARNYLSRKKPKGPPPMPSSSDFESRRSTTIATVCNCKLSPSDTTRITQQFNHSLIARIVGKDIRPGQLAARLRHHLRLTEDVKVF

Query:  ELGLRYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVEILNRIAEAIGGCLLKIDPVTRDRWKCKFARFCI
        ELGL YFVLKFSETDYLALEDLPWSIPNLCI+AF WTPDFKPSEAINSSV+VWIRL ELSIEYYD EIL +IA  IGG L+K DPVT++R KCKFAR CI
Subjt:  ELGLRYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVEILNRIAEAIGGCLLKIDPVTRDRWKCKFARFCI

Query:  RVNLCDPLPSMIELGTIRQRIEYEGFE-LCPKCSHVGDLGHNCLGLSNPSGSSGVNPHGDKP-HHPGTRSFKEFGWSSSSKQPLIPFESSPVSAWESKF-
        R+NLCDPLPSMI+LG I+Q+IEYEG + LCP C  V DL  NCL   NPSGSSG++  GD+P HH  TR   E G SSSSKQPLIP  SSP S   S+F 
Subjt:  RVNLCDPLPSMIELGTIRQRIEYEGFE-LCPKCSHVGDLGHNCLGLSNPSGSSGVNPHGDKP-HHPGTRSFKEFGWSSSSKQPLIPFESSPVSAWESKF-

Query:  -IGKDPQLDLKLIDLPKLPMGESRKAGTSVRISSPHVHVKDKATGKQKQKCGTLVQPLPNLPKQSSTITIEAPELKHTVPSVVEDQSKASKTSNPTMIPS
         +  D  LD            E  KA  S+RISSPHVHVKDKA  K K+ CG  V+ LP LPK+ ST T +APEL+   P+VVE Q K +KTSNPT+I  
Subjt:  -IGKDPQLDLKLIDLPKLPMGESRKAGTSVRISSPHVHVKDKATGKQKQKCGTLVQPLPNLPKQSSTITIEAPELKHTVPSVVEDQSKASKTSNPTMIPS

Query:  HKSQPPSPTASIPSLQPCPASEATLKFHSDAIQCLTRKKEITNTLSKEINVDSCPTVYTLDS-KVTNIDIALSEVQTTSMTNQNQYTIELVPTMKGGDEG
        H +            QPC   +ATL F S  I+  T++KE+ +  SKEI VD CP V+T+++ K+ +  + LS +QT S+ N+N YT++ +PT +  DE 
Subjt:  HKSQPPSPTASIPSLQPCPASEATLKFHSDAIQCLTRKKEITNTLSKEINVDSCPTVYTLDS-KVTNIDIALSEVQTTSMTNQNQYTIELVPTMKGGDEG

Query:  GVGLEVESGSESCAKKILCWKVHGTENAKLMRALKDLIQLHEPSIVLIFGPKINGVDADQVVRELAFCSSYRSKPDGYNSGVWLLLSKQDVQTKINSYNP
        G G +  SGSESC+KK+LCWK HGT+NA LM+ALKDLIQLHEPSIVLIFG KI+G +A+ VVREL+FC SY  KPDGYN GVWLLLS+QDVQ +++SY+P
Subjt:  GVGLEVESGSESCAKKILCWKVHGTENAKLMRALKDLIQLHEPSIVLIFGPKINGVDADQVVRELAFCSSYRSKPDGYNSGVWLLLSKQDVQTKINSYNP

Query:  QQVSASVSFDAETNVPVFSPSDVDTETSSGPWGLPFFYTSNLPMTSLAY
        QQVSASV F + TN P FSP +VDTETSSGPWG  FFYTS   M+S+AY
Subjt:  QQVSASVSFDAETNVPVFSPSDVDTETSSGPWGLPFFYTSNLPMTSLAY

KAG7031864.1 hypothetical protein SDJN02_05905, partial [Cucurbita argyrosperma subsp. argyrosperma]6.8e-20962.5Show/hide
Query:  MELQSDNQLTGAGDDEAAARNYLSRKKPKGPPPMPSSSDFESRRSTTIATVCNCKLSPSDTTRITQQFNHSLIARIVGKDIRPGQLAARLRHHLRLTEDV
        M LQSD QLTGAGDDEAAA                      + R+   ATV  C+LS S T RITQQF+HS IA I GKD+RP ++A+ LR HL LT  V
Subjt:  MELQSDNQLTGAGDDEAAARNYLSRKKPKGPPPMPSSSDFESRRSTTIATVCNCKLSPSDTTRITQQFNHSLIARIVGKDIRPGQLAARLRHHLRLTEDV

Query:  KVFELGLRYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVEILNRIAEAIGGCLLKIDPVTRDRWKCKFAR
        KVFELGL YFVLKF ETD+LAL+DLPWS+PNLCIH  PWTPDFKPSE I SSV+VW+RL ELSIEYYD E+L +IA AIGG L+KIDPVT++R KCKFAR
Subjt:  KVFELGLRYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVEILNRIAEAIGGCLLKIDPVTRDRWKCKFAR

Query:  FCIRVNLCDPLPSMIELGTIRQRIEYEGFE-LCPKCSHVGDLGHNCLGLSNPSGSSGVNPHGDKPHHPGTRSFKEFGWSSSSKQPLIPFESSPVSAWESK
         C+RVNLCDPLPSMI LG IRQ IEYEGFE LCP CS V  L HNCL L  PSG SG NPH  KPHH G RSF         KQPLIP ESS  S   S+
Subjt:  FCIRVNLCDPLPSMIELGTIRQRIEYEGFE-LCPKCSHVGDLGHNCLGLSNPSGSSGVNPHGDKPHHPGTRSFKEFGWSSSSKQPLIPFESSPVSAWESK

Query:  FIGKDPQLDLKLIDLPKLPMGESRKAGTSVRISSPHVHVKDKATGKQKQKCGTLVQPLPNLPKQSSTITIEAPELKHTVPSVVEDQSKASKTSNPTMIPS
        F      LDL   + P L  GES KAGTS+R SS  VHVKDKA  K+K+KCG  VQP   LPK+SS +TI             +DQ KA+KTSN    P+
Subjt:  FIGKDPQLDLKLIDLPKLPMGESRKAGTSVRISSPHVHVKDKATGKQKQKCGTLVQPLPNLPKQSSTITIEAPELKHTVPSVVEDQSKASKTSNPTMIPS

Query:  HKSQPPSPTASIPSLQPCPASEATLKFHSDAIQCLTRKKEITNTLSKEINVDSCPTVYTLDSKVTNIDIALSEVQTTSMTNQNQYTIELVPTMKGGDEGG
           QPPSPT S+P L PCPASEA L FHS AIQ  TR KEIT+  SKEINVDSCPTVYT+D K+  +DIALSE +TTSM+NQ QY IE VPT + GD+GG
Subjt:  HKSQPPSPTASIPSLQPCPASEATLKFHSDAIQCLTRKKEITNTLSKEINVDSCPTVYTLDSKVTNIDIALSEVQTTSMTNQNQYTIELVPTMKGGDEGG

Query:  VGLEVESGSESCAKKILCWKVHGTENAKLMRALKDLIQLHEPSIVLIFGPKINGVDADQVVRELAFCSSYRSKPDGYNSGVWLLLSKQDVQTKINSYNPQ
            V+SGSESC KKILCWK H T+N KL+R+LKDLI+LHEPSIVLIFG KI+G D D+VV+EL FC SY  KPDGY+ GVWLLLS QDV+TK+NS +PQ
Subjt:  VGLEVESGSESCAKKILCWKVHGTENAKLMRALKDLIQLHEPSIVLIFGPKINGVDADQVVRELAFCSSYRSKPDGYNSGVWLLLSKQDVQTKINSYNPQ

Query:  QVSASVSFDAETNVPVFSPSDVDTETSSGPWGLPFFYTSNLPMTSLAY
        Q+ AS+ FD++TN   F+PS V T+ SSGPWG PFF+T    MTS+AY
Subjt:  QVSASVSFDAETNVPVFSPSDVDTETSSGPWGLPFFYTSNLPMTSLAY

KGN50455.1 hypothetical protein Csa_000357 [Cucumis sativus]1.4e-27876.78Show/hide
Query:  MELQSDNQLTGAGDDEAAARNYLSRKKPKGPPPMPSSSDFESRRSTTIATVCNCKLSPSDTTRITQQFNHSLIARIVGKDIRPGQLAARLRHHLRLTEDV
        M  QS +  TGAGDDEAAARNYLSRKKPK PPP+P SSDF SRRSTTIATVCNC L+PS+TTRITQQF HSLIAR+VGKD RPGQLAARLRHHLRLT+DV
Subjt:  MELQSDNQLTGAGDDEAAARNYLSRKKPKGPPPMPSSSDFESRRSTTIATVCNCKLSPSDTTRITQQFNHSLIARIVGKDIRPGQLAARLRHHLRLTEDV

Query:  KVFELGLRYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVEILNRIAEAIGGCLLKIDPVTRDRWKCKFAR
        KVF+LGL YFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDV IL RIA+AIG  L+KIDPVTRDRWKCKFAR
Subjt:  KVFELGLRYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVEILNRIAEAIGGCLLKIDPVTRDRWKCKFAR

Query:  FCIRVNLCDPLPSMIELGTIRQRIEYEGFELCPKCSHVGDLGHNCL----------GLSNPSGSSGVNPHGDKPHHPGTRSFKEFGWSSSSKQPLIPFES
        FCI VNLCDPLPSMIELG +RQRIEYEGFELC KC+ VGDL H+C            L+NPSGS G NPHGD+PHH  TR FKE G +S+SKQPLIP ES
Subjt:  FCIRVNLCDPLPSMIELGTIRQRIEYEGFELCPKCSHVGDLGHNCL----------GLSNPSGSSGVNPHGDKPHHPGTRSFKEFGWSSSSKQPLIPFES

Query:  SPVSAWE-SKFIGKDPQLDLKLIDLPKLPMGESRKAGTSVRISSPHVHVKDKATGKQKQKCGTLVQPLPNLPKQSSTITIEAPELKHTVPSVVEDQSKAS
        SPVSAWE S+FI K+P LDLKLID P LP  ES KAG+ VRISSP VHVKDK   K+K+KC   VQ LPNLPKQ STITI+APELK  VPSVVED+ K +
Subjt:  SPVSAWE-SKFIGKDPQLDLKLIDLPKLPMGESRKAGTSVRISSPHVHVKDKATGKQKQKCGTLVQPLPNLPKQSSTITIEAPELKHTVPSVVEDQSKAS

Query:  KTSNPTMIPSHKSQPPSPTASIPSLQPCPASEATLKFHSDAIQCLTRKKEITNTLSKEINVDSCPTVYTLD-SKVTNIDIALSEVQTTSMTNQNQYTIEL
        KT N TMI  H SQPPSPTASIP LQP PASEATLKF SDAI CLTRK+EI N+ SK IN  S PTVYT+D  K+T+++IALSEVQ          TIEL
Subjt:  KTSNPTMIPSHKSQPPSPTASIPSLQPCPASEATLKFHSDAIQCLTRKKEITNTLSKEINVDSCPTVYTLD-SKVTNIDIALSEVQTTSMTNQNQYTIEL

Query:  VPTMKGGDEGGVGLEVESGSESCAKKILCWKVHGTENAKLMRALKDLIQLHEPSIVLIFGPKINGVDADQVVRELAFCSSYRSKPDGYNSGVWLLLSKQD
        VPTMKGGDEGGVG EVESGSE CAKKIL WK H  +NAKLMRALKDLIQLHEPSIVLIFG KI+GVD D+V+RELAFC SY SKPDGYN GVWLLLSKQD
Subjt:  VPTMKGGDEGGVGLEVESGSESCAKKILCWKVHGTENAKLMRALKDLIQLHEPSIVLIFGPKINGVDADQVVRELAFCSSYRSKPDGYNSGVWLLLSKQD

Query:  VQTKINSYNPQQVSASVSFDAETNVPVFSPSDVDTETSSGPWGLPFFYTSNLPMTSLAY
        VQTK+NS++ QQVSASV+F +ETNV  FSPS+ DT+TSSGPWG  FFYTS   MTSLAY
Subjt:  VQTKINSYNPQQVSASVSFDAETNVPVFSPSDVDTETSSGPWGLPFFYTSNLPMTSLAY

TrEMBL top hitse value%identityAlignment
A0A0A0KLB0 DUF4283 domain-containing protein6.7e-27976.78Show/hide
Query:  MELQSDNQLTGAGDDEAAARNYLSRKKPKGPPPMPSSSDFESRRSTTIATVCNCKLSPSDTTRITQQFNHSLIARIVGKDIRPGQLAARLRHHLRLTEDV
        M  QS +  TGAGDDEAAARNYLSRKKPK PPP+P SSDF SRRSTTIATVCNC L+PS+TTRITQQF HSLIAR+VGKD RPGQLAARLRHHLRLT+DV
Subjt:  MELQSDNQLTGAGDDEAAARNYLSRKKPKGPPPMPSSSDFESRRSTTIATVCNCKLSPSDTTRITQQFNHSLIARIVGKDIRPGQLAARLRHHLRLTEDV

Query:  KVFELGLRYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVEILNRIAEAIGGCLLKIDPVTRDRWKCKFAR
        KVF+LGL YFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDV IL RIA+AIG  L+KIDPVTRDRWKCKFAR
Subjt:  KVFELGLRYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVEILNRIAEAIGGCLLKIDPVTRDRWKCKFAR

Query:  FCIRVNLCDPLPSMIELGTIRQRIEYEGFELCPKCSHVGDLGHNCL----------GLSNPSGSSGVNPHGDKPHHPGTRSFKEFGWSSSSKQPLIPFES
        FCI VNLCDPLPSMIELG +RQRIEYEGFELC KC+ VGDL H+C            L+NPSGS G NPHGD+PHH  TR FKE G +S+SKQPLIP ES
Subjt:  FCIRVNLCDPLPSMIELGTIRQRIEYEGFELCPKCSHVGDLGHNCL----------GLSNPSGSSGVNPHGDKPHHPGTRSFKEFGWSSSSKQPLIPFES

Query:  SPVSAWE-SKFIGKDPQLDLKLIDLPKLPMGESRKAGTSVRISSPHVHVKDKATGKQKQKCGTLVQPLPNLPKQSSTITIEAPELKHTVPSVVEDQSKAS
        SPVSAWE S+FI K+P LDLKLID P LP  ES KAG+ VRISSP VHVKDK   K+K+KC   VQ LPNLPKQ STITI+APELK  VPSVVED+ K +
Subjt:  SPVSAWE-SKFIGKDPQLDLKLIDLPKLPMGESRKAGTSVRISSPHVHVKDKATGKQKQKCGTLVQPLPNLPKQSSTITIEAPELKHTVPSVVEDQSKAS

Query:  KTSNPTMIPSHKSQPPSPTASIPSLQPCPASEATLKFHSDAIQCLTRKKEITNTLSKEINVDSCPTVYTLD-SKVTNIDIALSEVQTTSMTNQNQYTIEL
        KT N TMI  H SQPPSPTASIP LQP PASEATLKF SDAI CLTRK+EI N+ SK IN  S PTVYT+D  K+T+++IALSEVQ          TIEL
Subjt:  KTSNPTMIPSHKSQPPSPTASIPSLQPCPASEATLKFHSDAIQCLTRKKEITNTLSKEINVDSCPTVYTLD-SKVTNIDIALSEVQTTSMTNQNQYTIEL

Query:  VPTMKGGDEGGVGLEVESGSESCAKKILCWKVHGTENAKLMRALKDLIQLHEPSIVLIFGPKINGVDADQVVRELAFCSSYRSKPDGYNSGVWLLLSKQD
        VPTMKGGDEGGVG EVESGSE CAKKIL WK H  +NAKLMRALKDLIQLHEPSIVLIFG KI+GVD D+V+RELAFC SY SKPDGYN GVWLLLSKQD
Subjt:  VPTMKGGDEGGVGLEVESGSESCAKKILCWKVHGTENAKLMRALKDLIQLHEPSIVLIFGPKINGVDADQVVRELAFCSSYRSKPDGYNSGVWLLLSKQD

Query:  VQTKINSYNPQQVSASVSFDAETNVPVFSPSDVDTETSSGPWGLPFFYTSNLPMTSLAY
        VQTK+NS++ QQVSASV+F +ETNV  FSPS+ DT+TSSGPWG  FFYTS   MTSLAY
Subjt:  VQTKINSYNPQQVSASVSFDAETNVPVFSPSDVDTETSSGPWGLPFFYTSNLPMTSLAY

A0A5A7SSJ3 DUF4283 domain-containing protein1.7e-29380.77Show/hide
Query:  MELQSDNQLTGAGDDEAAARNYLSRKKPKGPPPMPSSSDFESRRSTTIATVCNCKLSPSDTTRITQQFNHSLIARIVGKDIRPGQLAARLRHHLRLTEDV
        M LQSD+  TGAGDDEAAARNYLSRKKPK PPP+  SSDFESR STTIATVCNC L+PS+TTRITQQF HSLIAR+VGKD RPGQLAARLRHHLRLT+DV
Subjt:  MELQSDNQLTGAGDDEAAARNYLSRKKPKGPPPMPSSSDFESRRSTTIATVCNCKLSPSDTTRITQQFNHSLIARIVGKDIRPGQLAARLRHHLRLTEDV

Query:  KVFELGLRYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVEILNRIAEAIGGCLLKIDPVTRDRWKCKFAR
        KVFELGL YFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVEIL RIA+AIGG L+KIDPVTRDRWKCKFAR
Subjt:  KVFELGLRYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVEILNRIAEAIGGCLLKIDPVTRDRWKCKFAR

Query:  FCIRVNLCDPLPSMIELGTIRQRIEYEGFELCPKCSHVGDLGHNCLGLSNPSGSSGVNPHGDKPHHPGTRSFKEFGWSSSSKQPLIPFESSPVSAWE-SK
        FCI VNLCDPLPSMIELG IRQRIEYEGFELC KC+ VGDL H+C  L+NPSGS G NPHGD+PHH  TR FKEFG +SSSKQPLIP ESS VSAWE S+
Subjt:  FCIRVNLCDPLPSMIELGTIRQRIEYEGFELCPKCSHVGDLGHNCLGLSNPSGSSGVNPHGDKPHHPGTRSFKEFGWSSSSKQPLIPFESSPVSAWE-SK

Query:  FIGKDPQLDLKLIDLPKLPMGESRKAGTSVRISSPHVHVKDKATGKQKQKCGTLVQPLPNLPK-QSSTITIEAPELKHTVPSVVEDQSKASKTSNPTMIP
        FI K+PQLDLK I+ P LP  ES KAGTSVRISSPHVHVKDKA  K+K+KC   VQPLP+LPK QSSTITI+APELK  VPSVVEDQ K +KT N TMI 
Subjt:  FIGKDPQLDLKLIDLPKLPMGESRKAGTSVRISSPHVHVKDKATGKQKQKCGTLVQPLPNLPK-QSSTITIEAPELKHTVPSVVEDQSKASKTSNPTMIP

Query:  SHKSQPPSPTASIPSLQPCPASEATLKFHSDAIQCLTRKKEITNTLSKEINVDSCPTVYTLD-SKVTNIDIALSEVQTTSMTNQNQYTIELVPTMKGGDE
         H SQPPSPTASIP LQP PASEATLKF SDAI CLTRK+EI N+ SKE N  S PTVYT+D  K+T+++I+LSEVQTTSM+NQNQYTIELVPTMKGGD+
Subjt:  SHKSQPPSPTASIPSLQPCPASEATLKFHSDAIQCLTRKKEITNTLSKEINVDSCPTVYTLD-SKVTNIDIALSEVQTTSMTNQNQYTIELVPTMKGGDE

Query:  GGVGLEVESGSESCAKKILCWKVHGTENAKLMRALKDLIQLHEPSIVLIFGPKINGVDADQVVRELAFCSSYRSKPDGYNSGVWLLLSKQDVQTKINSYN
        GGVGLEVESGSE CAKK+L WK H  +NAKLMRALKDLIQLHEPSIVLIFG KI GVDA +V++ELAFC SY S+PDGYN GVWLLLSKQDVQTK+NSY+
Subjt:  GGVGLEVESGSESCAKKILCWKVHGTENAKLMRALKDLIQLHEPSIVLIFGPKINGVDADQVVRELAFCSSYRSKPDGYNSGVWLLLSKQDVQTKINSYN

Query:  PQQVSASVSFDAETNVPVFSPSDVDTETSSGPWGLPFFYTSNLPMTSLAY
        PQQVSASV+F +ETNV  FSPS+ DTETSSGPWG  FFYTS   MTSLAY
Subjt:  PQQVSASVSFDAETNVPVFSPSDVDTETSSGPWGLPFFYTSNLPMTSLAY

A0A5A7SY10 DUF4283 domain-containing protein3.3e-12446.63Show/hide
Query:  DDEAAARNYLSRKKPKGPPPMPSSSDFESRRSTTIATVCNCKLSPSDTTRITQQFNHSLIARIVGKDIRPGQLAARLRHHLRLTEDVKVFELGLRYFVLK
        DDE  +    SRKK K   PM SSS+      T         L+PS T RI Q F+HSLIAR+ G ++    LA RLR +LRLT D+ VFEL L +FVLK
Subjt:  DDEAAARNYLSRKKPKGPPPMPSSSDFESRRSTTIATVCNCKLSPSDTTRITQQFNHSLIARIVGKDIRPGQLAARLRHHLRLTEDVKVFELGLRYFVLK

Query:  FS-ETDYL-ALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVEILNRIAEAIGGCLLKIDPVTRDRWKCKFARFCIRVNLCDPL
        FS  +DY  ALE+LPWSI +LCIH  PW P+FKPSEA+   V+VWIRLPEL IEYYD EIL +IAEAIG CL+KIDPVT  R KC FAR CIR+ LC+PL
Subjt:  FS-ETDYL-ALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVEILNRIAEAIGGCLLKIDPVTRDRWKCKFARFCIRVNLCDPL

Query:  PSMIELGTIRQRIEYEGFE-LCPKCSHVGDLGHNCLGLSNPSGSSGVNPHGDKP-----HHPGTRSFKEFGWSSSSKQPLIPFESSPVSAWESKFIGKDP
           I+ G   Q+++YEG + LC  C  + +L H CL  +NPSGSSG +PH   P       P + S    G +  SK+PLI    S  SA  SK   K+P
Subjt:  PSMIELGTIRQRIEYEGFE-LCPKCSHVGDLGHNCLGLSNPSGSSGVNPHGDKP-----HHPGTRSFKEFGWSSSSKQPLIPFESSPVSAWESKFIGKDP

Query:  QLDLKLIDLPKLPMGESRKAGTSVRISSPHVHVKDKATGKQKQKCGTLVQPLPNLPKQSSTITIEAPELKHTVPSVVEDQSKASKTSNPTMIPSHKSQPP
         L+LKL D PKL MG                    K    +K       + LPN P++SST T E PE       +V DQ +A+K S+PT +P   +   
Subjt:  QLDLKLIDLPKLPMGESRKAGTSVRISSPHVHVKDKATGKQKQKCGTLVQPLPNLPKQSSTITIEAPELKHTVPSVVEDQSKASKTSNPTMIPSHKSQPP

Query:  SPTASIPSLQPCPASEATLKFHSDAIQCLTRKKEITNTLSKEIN-VDSCPTVYTLDSKVTNIDIALSEVQTTSMTNQNQYTIELVPTMKGGDEGGVGLEV
        S +          A EA +   S AIQ    +K++ NT    I  VDS PTVYT+D    ++ I  SEV T + +NQN+Y I  V   +  ++  V  + 
Subjt:  SPTASIPSLQPCPASEATLKFHSDAIQCLTRKKEITNTLSKEIN-VDSCPTVYTLDSKVTNIDIALSEVQTTSMTNQNQYTIELVPTMKGGDEGGVGLEV

Query:  ESGSESCAKKILCWKVHGTENAKLMRALKDLIQLHEPSIVLIFGPKINGVDADQVVRELAFCSSYRSKPDGYNSGVWLLLSKQDVQTKINSYNPQQVSAS
         S    C KK+LCW   G + AKL++A K LI+L EPSIVLIFG KI+  DA++VVRELAF  SY  KPDGYN GVW++LS QDV+ +++SY+PQ+VSAS
Subjt:  ESGSESCAKKILCWKVHGTENAKLMRALKDLIQLHEPSIVLIFGPKINGVDADQVVRELAFCSSYRSKPDGYNSGVWLLLSKQDVQTKINSYNPQQVSAS

Query:  VSFDAETNVPVFSPSDVDTETSSG
        V F ++ N P     D DTETS G
Subjt:  VSFDAETNVPVFSPSDVDTETSSG

A0A6J1FN13 uncharacterized protein LOC111446932 isoform X22.3e-12547.17Show/hide
Query:  TGAGDDEAAARNYLSRKKPKGPPPMPSSSDFESRRSTTIATVCNCKLSPSDTTRITQQFNHSLIARIVGKDIRPGQLAARLRHHLRLTEDVKVFELGLRY
        TGAG+D AAA                         ST  ATVCN  L+PS T RI QQF+ SLI  +VGK I P QLA RLR +L L  D+ VFELGL +
Subjt:  TGAGDDEAAARNYLSRKKPKGPPPMPSSSDFESRRSTTIATVCNCKLSPSDTTRITQQFNHSLIARIVGKDIRPGQLAARLRHHLRLTEDVKVFELGLRY

Query:  FVLKFSET--DYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVEILNRIAEAIGGCLLKIDPVTRDRWKCKFARFCIRVNL
        FVLKFS     Y ALE+ PWSIP+LCI+ FPW P+FKPSEA    V+VWIRLPELSIEYYD E+L +IAE IGG L+KIDPVT  R KC +AR CIR+NL
Subjt:  FVLKFSET--DYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVEILNRIAEAIGGCLLKIDPVTRDRWKCKFARFCIRVNL

Query:  CDPLPSMIELGTIRQRIEYEGFE-LCPKCSHVGDLGHNCLGLSNPSGSSGVNPHGDKPHHPGTRSFKEFGWSSSS----------------------KQP
          PL    + G   Q+I YEG + LC  C  V DL H+C  LSN S SSG +     PHH   R  +  G S SS                      K  
Subjt:  CDPLPSMIELGTIRQRIEYEGFE-LCPKCSHVGDLGHNCLGLSNPSGSSGVNPHGDKPHHPGTRSFKEFGWSSSS----------------------KQP

Query:  LIPFESSPVSAWESKFIGKDPQLDLKLIDLPKLPMGESRKAGTSVRISSPHVHVKDKATGKQKQKCGTLVQPLPNLPKQSSTITIEAPELKHTVPSVVED
        LIP + +P SA  S+F      L+L L + P LP+ ES K           V      T     K   L+Q +P                    P V+ED
Subjt:  LIPFESSPVSAWESKFIGKDPQLDLKLIDLPKLPMGESRKAGTSVRISSPHVHVKDKATGKQKQKCGTLVQPLPNLPKQSSTITIEAPELKHTVPSVVED

Query:  -QSKASKTSNPTMIPSHKSQP-PSPTA--SIPSLQPCPASEATLKFHSDAIQCLTRKKEITNTLSKEINVDSCPTVYTLDSKVTNIDIALSEVQ-TTSMT
         Q +  KTS+PT +    ++P PS  A  SI  LQP  A EA LKF+S AIQ  T +K I NT S+ I+VDS PT+YT+D  +T++ I L E+  TT+ +
Subjt:  -QSKASKTSNPTMIPSHKSQP-PSPTA--SIPSLQPCPASEATLKFHSDAIQCLTRKKEITNTLSKEINVDSCPTVYTLDSKVTNIDIALSEVQ-TTSMT

Query:  NQNQYTIELVPTMKGGDEGGVGLEVESGSESCAKKILCWKVHGTENAKLMRALKDLIQLHEPSIVLIFGPKINGVDADQVVRELAFCSSYRSKPDGYNSG
        NQN++ I +VPT +          V   +  C+KK+LCW    T+NAKLMRALKDLIQLH+PSIVLIFG KI+G DAD VVRELAF  SY  KPDGY  G
Subjt:  NQNQYTIELVPTMKGGDEGGVGLEVESGSESCAKKILCWKVHGTENAKLMRALKDLIQLHEPSIVLIFGPKINGVDADQVVRELAFCSSYRSKPDGYNSG

Query:  VWLLLSKQDVQTKINSYNPQQVSASVSFDAETNVPV
         WLLLS+QDVQ +++SY+PQQVSASV   ++ N  V
Subjt:  VWLLLSKQDVQTKINSYNPQQVSASVSFDAETNVPV

A0A6J1FU80 uncharacterized protein LOC111446932 isoform X13.9e-12547.02Show/hide
Query:  TGAGDDEAAARNYLSRKKPKGPPPMPSSSDFESRRSTTIATVCNCKLSPSDTTRITQQFNHSLIARIVGKDIRPGQLAARLRHHLRLTEDVKVFELGLRY
        TGAG+D AAA                         ST  ATVCN  L+PS T RI QQF+ SLI  +VGK I P QLA RLR +L L  D+ VFELGL +
Subjt:  TGAGDDEAAARNYLSRKKPKGPPPMPSSSDFESRRSTTIATVCNCKLSPSDTTRITQQFNHSLIARIVGKDIRPGQLAARLRHHLRLTEDVKVFELGLRY

Query:  FVLKFSET--DYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVEILNRIAEAIGGCLLKIDPVTRDRWKCKFARFCIRVNL
        FVLKFS     Y ALE+ PWSIP+LCI+ FPW P+FKPSEA    V+VWIRLPELSIEYYD E+L +IAE IGG L+KIDPVT  R KC +AR CIR+NL
Subjt:  FVLKFSET--DYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVEILNRIAEAIGGCLLKIDPVTRDRWKCKFARFCIRVNL

Query:  CDPLPSMIELGTIRQRIEYEGFE-LCPKCSHVGDLGHNCLGLSNPSGSSGVNPHGDKPHHPGTRSFKEFGWSSSS------------------------K
          PL    + G   Q+I YEG + LC  C  V DL H+C  LSN S SSG +     PHH   R  +  G S SS                        K
Subjt:  CDPLPSMIELGTIRQRIEYEGFE-LCPKCSHVGDLGHNCLGLSNPSGSSGVNPHGDKPHHPGTRSFKEFGWSSSS------------------------K

Query:  QPLIPFESSPVSAWESKFIGKDPQLDLKLIDLPKLPMGESRKAGTSVRISSPHVHVKDKATGKQKQKCGTLVQPLPNLPKQSSTITIEAPELKHTVPSVV
          LIP + +P SA  S+F      L+L L + P LP+ ES K           V      T     K   L+Q +P                    P V+
Subjt:  QPLIPFESSPVSAWESKFIGKDPQLDLKLIDLPKLPMGESRKAGTSVRISSPHVHVKDKATGKQKQKCGTLVQPLPNLPKQSSTITIEAPELKHTVPSVV

Query:  ED-QSKASKTSNPTMIPSHKSQP-PSPTA--SIPSLQPCPASEATLKFHSDAIQCLTRKKEITNTLSKEINVDSCPTVYTLDSKVTNIDIALSEVQ-TTS
        ED Q +  KTS+PT +    ++P PS  A  SI  LQP  A EA LKF+S AIQ  T +K I NT S+ I+VDS PT+YT+D  +T++ I L E+  TT+
Subjt:  ED-QSKASKTSNPTMIPSHKSQP-PSPTA--SIPSLQPCPASEATLKFHSDAIQCLTRKKEITNTLSKEINVDSCPTVYTLDSKVTNIDIALSEVQ-TTS

Query:  MTNQNQYTIELVPTMKGGDEGGVGLEVESGSESCAKKILCWKVHGTENAKLMRALKDLIQLHEPSIVLIFGPKINGVDADQVVRELAFCSSYRSKPDGYN
         +NQN++ I +VPT +          V   +  C+KK+LCW    T+NAKLMRALKDLIQLH+PSIVLIFG KI+G DAD VVRELAF  SY  KPDGY 
Subjt:  MTNQNQYTIELVPTMKGGDEGGVGLEVESGSESCAKKILCWKVHGTENAKLMRALKDLIQLHEPSIVLIFGPKINGVDADQVVRELAFCSSYRSKPDGYN

Query:  SGVWLLLSKQDVQTKINSYNPQQVSASVSFDAETNVPV
         G WLLLS+QDVQ +++SY+PQQVSASV   ++ N  V
Subjt:  SGVWLLLSKQDVQTKINSYNPQQVSASVSFDAETNVPV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding5.9e-1730.51Show/hide
Query:  LIARIVGKDIRPGQLAARLRHHLRLTEDVKVFELGLRYFVLKFS-ETDYL-ALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDV
        +I +++G  I    L  +LR   + +  + V +L  ++F+++F  E +Y+ AL   PW +    +    W+  F P      +  VW+RL  +   YY  
Subjt:  LIARIVGKDIRPGQLAARLRHHLRLTEDVKVFELGLRYFVLKFS-ETDYL-ALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDV

Query:  EILNRIAEAIGGCLLKIDPVTRDRWKCKFARFCIRVNLCDPLPSMIELGTIRQRIEYEGF-ELCPKCSHVGDLGHNC
         +L  IA  +G   LK+D  T +  K +FAR CI VNL  PL   + +   R  + YEG  ++C  C   G L H+C
Subjt:  EILNRIAEAIGGCLLKIDPVTRDRWKCKFARFCIRVNLCDPLPSMIELGTIRQRIEYEGF-ELCPKCSHVGDLGHNC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCTTCAATCCGACAACCAACTCACCGGCGCCGGCGACGACGAAGCGGCCGCTAGAAACTATCTCAGCAGGAAGAAGCCCAAGGGACCACCACCGATGCCT
TCCTCATCGGACTTCGAATCTCGCCGTTCAACCACCATCGCCACCGTTTGTAACTGTAAGCTCTCTCCCTCTGATACTACTCGTATCACTCAACAGTTCAATCAC
TCTCTCATAGCTCGGATCGTCGGAAAGGACATCCGTCCAGGCCAACTCGCCGCTCGCCTTCGCCATCATCTTCGTCTCACCGAAGATGTGAAGGTCTTCGAGCTT
GGTCTCCGCTATTTCGTACTCAAGTTCTCTGAGACTGACTATTTAGCCCTAGAAGACCTTCCTTGGTCAATCCCTAATCTCTGTATCCATGCCTTCCCATGGACT
CCCGATTTCAAACCCTCTGAGGCCATCAATTCCTCTGTTAATGTTTGGATCCGCCTCCCTGAGCTCTCAATCGAGTATTACGACGTTGAAATTCTGAACCGGATT
GCAGAGGCCATTGGTGGCTGTCTTCTTAAGATCGATCCTGTAACTAGAGATCGATGGAAATGTAAGTTCGCTCGCTTTTGCATTAGGGTAAATCTGTGCGATCCT
CTTCCATCGATGATCGAGCTTGGTACAATTCGACAAAGGATTGAGTATGAGGGTTTTGAATTGTGTCCTAAGTGTAGTCATGTTGGTGATTTGGGACATAATTGT
TTGGGTTTGAGTAATCCTTCTGGTTCTTCCGGAGTCAATCCCCATGGAGATAAACCCCATCACCCTGGAACTCGATCTTTTAAGGAATTTGGCTGGAGCTCAAGT
TCAAAGCAGCCATTGATTCCTTTTGAATCTTCGCCAGTATCAGCTTGGGAATCTAAATTCATTGGAAAAGACCCACAGCTTGATTTGAAGTTGATTGACTTGCCT
AAACTTCCAATGGGTGAATCTAGAAAAGCAGGTACAAGTGTAAGAATAAGTTCTCCACATGTCCATGTGAAGGATAAAGCAACTGGGAAGCAAAAGCAGAAATGT
GGAACCCTTGTTCAACCATTGCCCAACTTGCCAAAACAATCTTCAACAATAACCATCGAAGCTCCTGAGTTAAAACACACAGTCCCTTCTGTTGTTGAGGATCAG
TCCAAGGCTTCAAAAACCAGCAACCCCACCATGATTCCAAGCCATAAAAGCCAACCACCATCACCAACTGCAAGCATTCCATCCCTACAACCATGTCCTGCTTCT
GAGGCTACCCTCAAGTTCCATTCGGATGCGATCCAATGCTTGACAAGAAAGAAAGAGATAACCAACACACTGTCTAAAGAGATCAATGTTGATAGTTGCCCCACT
GTTTACACGCTGGATTCAAAGGTCACAAACATTGACATTGCTTTGTCAGAAGTGCAAACAACCTCAATGACAAACCAAAATCAATACACAATTGAGTTAGTTCCA
ACTATGAAAGGTGGAGATGAGGGTGGAGTTGGTTTGGAGGTCGAGTCAGGATCAGAATCATGTGCTAAGAAGATTTTGTGCTGGAAGGTTCATGGGACGGAGAAT
GCCAAGCTAATGCGAGCATTGAAGGATTTGATTCAGCTACACGAGCCGTCCATTGTGCTGATCTTTGGCCCCAAGATCAATGGTGTTGATGCAGATCAGGTCGTG
CGGGAACTCGCTTTCTGCAGCTCTTACCGCAGCAAGCCGGACGGCTACAATAGTGGTGTTTGGCTGTTATTGTCTAAGCAAGATGTGCAAACCAAAATTAACTCA
TATAACCCACAGCAGGTTTCTGCATCCGTATCATTTGATGCTGAAACCAATGTACCAGTGTTTAGTCCTTCAGATGTAGATACCGAAACATCTTCCGGACCATGG
GGATTGCCTTTCTTCTATACTTCGAACTTGCCGATGACTTCATTGGCATATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCTTCAATCCGACAACCAACTCACCGGCGCCGGCGACGACGAAGCGGCCGCTAGAAACTATCTCAGCAGGAAGAAGCCCAAGGGACCACCACCGATGCCT
TCCTCATCGGACTTCGAATCTCGCCGTTCAACCACCATCGCCACCGTTTGTAACTGTAAGCTCTCTCCCTCTGATACTACTCGTATCACTCAACAGTTCAATCAC
TCTCTCATAGCTCGGATCGTCGGAAAGGACATCCGTCCAGGCCAACTCGCCGCTCGCCTTCGCCATCATCTTCGTCTCACCGAAGATGTGAAGGTCTTCGAGCTT
GGTCTCCGCTATTTCGTACTCAAGTTCTCTGAGACTGACTATTTAGCCCTAGAAGACCTTCCTTGGTCAATCCCTAATCTCTGTATCCATGCCTTCCCATGGACT
CCCGATTTCAAACCCTCTGAGGCCATCAATTCCTCTGTTAATGTTTGGATCCGCCTCCCTGAGCTCTCAATCGAGTATTACGACGTTGAAATTCTGAACCGGATT
GCAGAGGCCATTGGTGGCTGTCTTCTTAAGATCGATCCTGTAACTAGAGATCGATGGAAATGTAAGTTCGCTCGCTTTTGCATTAGGGTAAATCTGTGCGATCCT
CTTCCATCGATGATCGAGCTTGGTACAATTCGACAAAGGATTGAGTATGAGGGTTTTGAATTGTGTCCTAAGTGTAGTCATGTTGGTGATTTGGGACATAATTGT
TTGGGTTTGAGTAATCCTTCTGGTTCTTCCGGAGTCAATCCCCATGGAGATAAACCCCATCACCCTGGAACTCGATCTTTTAAGGAATTTGGCTGGAGCTCAAGT
TCAAAGCAGCCATTGATTCCTTTTGAATCTTCGCCAGTATCAGCTTGGGAATCTAAATTCATTGGAAAAGACCCACAGCTTGATTTGAAGTTGATTGACTTGCCT
AAACTTCCAATGGGTGAATCTAGAAAAGCAGGTACAAGTGTAAGAATAAGTTCTCCACATGTCCATGTGAAGGATAAAGCAACTGGGAAGCAAAAGCAGAAATGT
GGAACCCTTGTTCAACCATTGCCCAACTTGCCAAAACAATCTTCAACAATAACCATCGAAGCTCCTGAGTTAAAACACACAGTCCCTTCTGTTGTTGAGGATCAG
TCCAAGGCTTCAAAAACCAGCAACCCCACCATGATTCCAAGCCATAAAAGCCAACCACCATCACCAACTGCAAGCATTCCATCCCTACAACCATGTCCTGCTTCT
GAGGCTACCCTCAAGTTCCATTCGGATGCGATCCAATGCTTGACAAGAAAGAAAGAGATAACCAACACACTGTCTAAAGAGATCAATGTTGATAGTTGCCCCACT
GTTTACACGCTGGATTCAAAGGTCACAAACATTGACATTGCTTTGTCAGAAGTGCAAACAACCTCAATGACAAACCAAAATCAATACACAATTGAGTTAGTTCCA
ACTATGAAAGGTGGAGATGAGGGTGGAGTTGGTTTGGAGGTCGAGTCAGGATCAGAATCATGTGCTAAGAAGATTTTGTGCTGGAAGGTTCATGGGACGGAGAAT
GCCAAGCTAATGCGAGCATTGAAGGATTTGATTCAGCTACACGAGCCGTCCATTGTGCTGATCTTTGGCCCCAAGATCAATGGTGTTGATGCAGATCAGGTCGTG
CGGGAACTCGCTTTCTGCAGCTCTTACCGCAGCAAGCCGGACGGCTACAATAGTGGTGTTTGGCTGTTATTGTCTAAGCAAGATGTGCAAACCAAAATTAACTCA
TATAACCCACAGCAGGTTTCTGCATCCGTATCATTTGATGCTGAAACCAATGTACCAGTGTTTAGTCCTTCAGATGTAGATACCGAAACATCTTCCGGACCATGG
GGATTGCCTTTCTTCTATACTTCGAACTTGCCGATGACTTCATTGGCATATTGA
Protein sequenceShow/hide protein sequence
MELQSDNQLTGAGDDEAAARNYLSRKKPKGPPPMPSSSDFESRRSTTIATVCNCKLSPSDTTRITQQFNHSLIARIVGKDIRPGQLAARLRHHLRLTEDVKVFEL
GLRYFVLKFSETDYLALEDLPWSIPNLCIHAFPWTPDFKPSEAINSSVNVWIRLPELSIEYYDVEILNRIAEAIGGCLLKIDPVTRDRWKCKFARFCIRVNLCDP
LPSMIELGTIRQRIEYEGFELCPKCSHVGDLGHNCLGLSNPSGSSGVNPHGDKPHHPGTRSFKEFGWSSSSKQPLIPFESSPVSAWESKFIGKDPQLDLKLIDLP
KLPMGESRKAGTSVRISSPHVHVKDKATGKQKQKCGTLVQPLPNLPKQSSTITIEAPELKHTVPSVVEDQSKASKTSNPTMIPSHKSQPPSPTASIPSLQPCPAS
EATLKFHSDAIQCLTRKKEITNTLSKEINVDSCPTVYTLDSKVTNIDIALSEVQTTSMTNQNQYTIELVPTMKGGDEGGVGLEVESGSESCAKKILCWKVHGTEN
AKLMRALKDLIQLHEPSIVLIFGPKINGVDADQVVRELAFCSSYRSKPDGYNSGVWLLLSKQDVQTKINSYNPQQVSASVSFDAETNVPVFSPSDVDTETSSGPW
GLPFFYTSNLPMTSLAY