; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10014985 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10014985
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionDUF4283 domain-containing protein
Genome locationChr02:22652705..22654636
RNA-Seq ExpressionHG10014985
SyntenyHG10014985
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034062.1 uncharacterized protein E6C27_scaffold65G00480 [Cucumis melo var. makuwa]7.8e-30683.54Show/hide
Query:  MALQSDHQPTGAGDDEAAARNYLSRKKPKGSPTMTSSSDFESRRPTTIATVCNCKLSPSETTRITQQFNHSLIARIVGKDIRPGQLASRLRHHLRLTEDV
        MALQSDH PTGAGDDEAAARNYLSRKKPK  P ++ SSDFESR  TTIATVCNC L+PSETTRITQQF HSLIAR+VGKD RPGQLA+RLRHHLRLT+DV
Subjt:  MALQSDHQPTGAGDDEAAARNYLSRKKPKGSPTMTSSSDFESRRPTTIATVCNCKLSPSETTRITQQFNHSLIARIVGKDIRPGQLASRLRHHLRLTEDV

Query:  KVFELGLGYFVLKFSETDYLALEDLPWSIANLCIYAFPWTPDFKPSEAINSSINVWIRLPELSIEYYDVEILKRIADAIGGCLVKIDPVTRDRWKCKFAR
        KVFELGLGYFVLKFSETDYLALEDLPWSI NLCI+AFPWTPDFKPSEAINSS+NVWIRLPELSIEYYDVEILKRIADAIGG LVKIDPVTRDRWKCKFAR
Subjt:  KVFELGLGYFVLKFSETDYLALEDLPWSIANLCIYAFPWTPDFKPSEAINSSINVWIRLPELSIEYYDVEILKRIADAIGGCLVKIDPVTRDRWKCKFAR

Query:  FCIRVNLCDPLPSMIELGRTRQKIEYEGFELCPKCSRVGDLRHDCSGLSNPSGSSGFNPHGDKPHHPRTRSFKEFGSSSSSKQPLIPSESSAASAWE-SR
        FCI VNLCDPLPSMIELGR RQ+IEYEGFELC KC+RVGDLRHDCS L+NPSGS GFNPHGD+PHH  TR FKEFGS+SSSKQPLIP ESS  SAWE SR
Subjt:  FCIRVNLCDPLPSMIELGRTRQKIEYEGFELCPKCSRVGDLRHDCSGLSNPSGSSGFNPHGDKPHHPRTRSFKEFGSSSSSKQPLIPSESSAASAWE-SR

Query:  FIEKDPQLDLKLIDWPKLPMGESGKVGTSVRISSPHVHVTDKAIAKQKEKCGISVQPLPNLPK-QSSTITIKAPELKHIVPSVVEDPSKAAKTSNPTMIA
        FIEK+PQLDLK I+WP LP  ESGK GTSVRISSPHVHV DKAI K+KEKC ISVQPLP+LPK QSSTITIKAPELK +VPSVVED  K AKT N TMIA
Subjt:  FIEKDPQLDLKLIDWPKLPMGESGKVGTSVRISSPHVHVTDKAIAKQKEKCGISVQPLPNLPK-QSSTITIKAPELKHIVPSVVEDPSKAAKTSNPTMIA

Query:  SHKSQPPSPTASIPSLRPCPASEATLKFHSDAIQCLTRKKEITNTLSKETNVDSCPTVYTLEPKKVTSLNIALSEVQTTSMSNQNQYTIEIVPTMRGGDE
         H SQPPSPTASIP L+P PASEATLKF SDAI CLTRK+EI N+ SKETN  S PTVYT++PKK+TSLNI+LSEVQTTSMSNQNQYTIE+VPTM+GGD+
Subjt:  SHKSQPPSPTASIPSLRPCPASEATLKFHSDAIQCLTRKKEITNTLSKETNVDSCPTVYTLEPKKVTSLNIALSEVQTTSMSNQNQYTIEIVPTMRGGDE

Query:  DGVGSEVASGSESCAKKMLCWKVHGTDNAKLMRALKDLIQLHEPSVVLIFGTKINGVDADQVVRELAFYGFYCSKPDGYNGGVWLLLSKQDVQTKVNSYN
         GVG EV SGSE CAKKML WK H  DNAKLMRALKDLIQLHEPS+VLIFG KI GVDA +V++ELAF G Y S+PDGYNGGVWLLLSKQDVQTKVNSY+
Subjt:  DGVGSEVASGSESCAKKMLCWKVHGTDNAKLMRALKDLIQLHEPSVVLIFGTKINGVDADQVVRELAFYGFYCSKPDGYNGGVWLLLSKQDVQTKVNSYN

Query:  PQQVSASVFFHSETNV----PALKD-ETSSGPWGSTFFYTSTNWMTSLAY
        PQQVSASV FHSETNV    P+  D ETSSGPWGSTFFYTSTNWMTSLAY
Subjt:  PQQVSASVFFHSETNV----PALKD-ETSSGPWGSTFFYTSTNWMTSLAY

KAG6600114.1 hypothetical protein SDJN03_05347, partial [Cucurbita argyrosperma subsp. sororia]4.7e-22665.79Show/hide
Query:  QSDHQPTGAGDDEAAARNYLSRKKPKGSPTMTSSSDFESRRPTTIATVCNCKLSPSETTRITQQFNHSLIARIVGKDIRPGQLASRLRHHLRLTEDVKVF
        QS+ QPTGAGDDEAAAR YLSRKK K  P M SSSD ES R TT ATVCN  LSPS+T RITQQF+HSLIA + G+DIRP QLA RLR HL LT+DV+VF
Subjt:  QSDHQPTGAGDDEAAARNYLSRKKPKGSPTMTSSSDFESRRPTTIATVCNCKLSPSETTRITQQFNHSLIARIVGKDIRPGQLASRLRHHLRLTEDVKVF

Query:  ELGLGYFVLKFSETDYLALEDLPWSIANLCIYAFPWTPDFKPSEAINSSINVWIRLPELSIEYYDVEILKRIADAIGGCLVKIDPVTRDRWKCKFARFCI
        ELGLGYFVLKFSETDYLALEDLPWSI NLCIYAF WTPDFKPSEAINSS++VWIRL ELSIEYYD EIL++IA  IGG LVK DPVT++R KCKFAR CI
Subjt:  ELGLGYFVLKFSETDYLALEDLPWSIANLCIYAFPWTPDFKPSEAINSSINVWIRLPELSIEYYDVEILKRIADAIGGCLVKIDPVTRDRWKCKFARFCI

Query:  RVNLCDPLPSMIELGRTRQKIEYEGFE-LCPKCSRVGDLRHDCSGLSNPSGSSGFNPHGDKP-HHPRTRSFKEFGSSSSSKQPLIPSESSAASAWESRF-
        R+NLCDPLPSMI+LGR +QKIEYEG + LCP C  V DL+ +C    NPSGSSG +  GD+P HH RTR   E GSSSSSKQPLIPS SS ASA  SRF 
Subjt:  RVNLCDPLPSMIELGRTRQKIEYEGFE-LCPKCSRVGDLRHDCSGLSNPSGSSGFNPHGDKP-HHPRTRSFKEFGSSSSSKQPLIPSESSAASAWESRF-

Query:  -IEKDPQLDLKLIDWPKLPMGESGKVGTSVRISSPHVHVTDKAIAKQKEKCGISVQPLPNLPKQSSTITIKAPELKHIVPSVVEDPSKAAKTSNPTMIAS
         +E D  LD            E  K   S+RISSPHVHV DKA AK KE CG  V+ LP LPK+ ST T KAPEL+ + P+VVE   K AKTSNPT+IA 
Subjt:  -IEKDPQLDLKLIDWPKLPMGESGKVGTSVRISSPHVHVTDKAIAKQKEKCGISVQPLPNLPKQSSTITIKAPELKHIVPSVVEDPSKAAKTSNPTMIAS

Query:  HKSQPPSPTASIPSLRPCPASEATLKFHSDAIQCLTRKKEITNTLSKETNVDSCPTVYTLEPKKVTSLNIALSEVQTTSMSNQNQYTIEIVPTMRGGDED
        H +Q            PC   +ATL F S  I+   ++KE+ +  SKE  VD CP V+T+  KK+ S  + LS +QT S+ N+N YT++ +PT R  DED
Subjt:  HKSQPPSPTASIPSLRPCPASEATLKFHSDAIQCLTRKKEITNTLSKETNVDSCPTVYTLEPKKVTSLNIALSEVQTTSMSNQNQYTIEIVPTMRGGDED

Query:  GVGSEVASGSESCAKKMLCWKVHGTDNAKLMRALKDLIQLHEPSVVLIFGTKINGVDADQVVRELAFYGFYCSKPDGYNGGVWLLLSKQDVQTKVNSYNP
        G GS+  SGSESC+KKMLCWK HGTDNA LM+ALKDLIQLHEPS+VLIFGTKI+G +A+ VVREL+F G YC KPDGYNGGVWLLLS+QDVQ +V+SY+P
Subjt:  GVGSEVASGSESCAKKMLCWKVHGTDNAKLMRALKDLIQLHEPSVVLIFGTKINGVDADQVVRELAFYGFYCSKPDGYNGGVWLLLSKQDVQTKVNSYNP

Query:  QQVSASVFFHSETNVPA-----LKDETSSGPWGSTFFYTSTNWMTSLAY
        QQVSASV+F S TN PA     +  ETSSGPWGSTFFYTSTNWM+S+AY
Subjt:  QQVSASVFFHSETNVPA-----LKDETSSGPWGSTFFYTSTNWMTSLAY

KAG7030785.1 hypothetical protein SDJN02_04822, partial [Cucurbita argyrosperma subsp. argyrosperma]2.1e-22665.79Show/hide
Query:  QSDHQPTGAGDDEAAARNYLSRKKPKGSPTMTSSSDFESRRPTTIATVCNCKLSPSETTRITQQFNHSLIARIVGKDIRPGQLASRLRHHLRLTEDVKVF
        QS+ QPTGAGDDEAAAR YLSRKK K  P M SSSD ES R TT ATVCN  LSPS+T RITQQF+HSLIA + G+DIRP QLA RLR HL LT+DV+VF
Subjt:  QSDHQPTGAGDDEAAARNYLSRKKPKGSPTMTSSSDFESRRPTTIATVCNCKLSPSETTRITQQFNHSLIARIVGKDIRPGQLASRLRHHLRLTEDVKVF

Query:  ELGLGYFVLKFSETDYLALEDLPWSIANLCIYAFPWTPDFKPSEAINSSINVWIRLPELSIEYYDVEILKRIADAIGGCLVKIDPVTRDRWKCKFARFCI
        ELGLGYFVLKFSETDYLALEDLPWSI NLCIYAF WTPDFKPSEAINSS++VWIRL ELSIEYYD EIL++IA  IGG LVK DPVT++R KCKFAR CI
Subjt:  ELGLGYFVLKFSETDYLALEDLPWSIANLCIYAFPWTPDFKPSEAINSSINVWIRLPELSIEYYDVEILKRIADAIGGCLVKIDPVTRDRWKCKFARFCI

Query:  RVNLCDPLPSMIELGRTRQKIEYEGFE-LCPKCSRVGDLRHDCSGLSNPSGSSGFNPHGDKP-HHPRTRSFKEFGSSSSSKQPLIPSESSAASAWESRF-
        R+NLCDPLPSMI+LGR +QKIEYEG + LCP C  V DL+ +C    NPSGSSG +  GD+P HH RTR   E GSSSSSKQPLIPS SS AS   SRF 
Subjt:  RVNLCDPLPSMIELGRTRQKIEYEGFE-LCPKCSRVGDLRHDCSGLSNPSGSSGFNPHGDKP-HHPRTRSFKEFGSSSSSKQPLIPSESSAASAWESRF-

Query:  -IEKDPQLDLKLIDWPKLPMGESGKVGTSVRISSPHVHVTDKAIAKQKEKCGISVQPLPNLPKQSSTITIKAPELKHIVPSVVEDPSKAAKTSNPTMIAS
         +E D  LD            E  K   S+RISSPHVHV DKA AK KE CG  V+ LP LPK+ ST T KAPEL+ + P+VVE   K AKTSNPT+IA 
Subjt:  -IEKDPQLDLKLIDWPKLPMGESGKVGTSVRISSPHVHVTDKAIAKQKEKCGISVQPLPNLPKQSSTITIKAPELKHIVPSVVEDPSKAAKTSNPTMIAS

Query:  HKSQPPSPTASIPSLRPCPASEATLKFHSDAIQCLTRKKEITNTLSKETNVDSCPTVYTLEPKKVTSLNIALSEVQTTSMSNQNQYTIEIVPTMRGGDED
        H +Q            PC   +ATL F S  I+  T++KE+ +  SKE  VD CP V+T+  KK+ S  + LS +QT S+ N+N YT++ +PT R  DED
Subjt:  HKSQPPSPTASIPSLRPCPASEATLKFHSDAIQCLTRKKEITNTLSKETNVDSCPTVYTLEPKKVTSLNIALSEVQTTSMSNQNQYTIEIVPTMRGGDED

Query:  GVGSEVASGSESCAKKMLCWKVHGTDNAKLMRALKDLIQLHEPSVVLIFGTKINGVDADQVVRELAFYGFYCSKPDGYNGGVWLLLSKQDVQTKVNSYNP
        G GS+  SGSESC+KKMLCWK HGTDNA LM+ALKDLIQLHEPS+VLIFGTKI+G +A+ VVREL+F G YC KPDGYNGGVWLLLS+QDVQ +V+SY+P
Subjt:  GVGSEVASGSESCAKKMLCWKVHGTDNAKLMRALKDLIQLHEPSVVLIFGTKINGVDADQVVRELAFYGFYCSKPDGYNGGVWLLLSKQDVQTKVNSYNP

Query:  QQVSASVFFHSETNVPA-----LKDETSSGPWGSTFFYTSTNWMTSLAY
        QQVSASV+F S TN PA     +  ETSSGPWGSTFFYTSTNWM+S+AY
Subjt:  QQVSASVFFHSETNVPA-----LKDETSSGPWGSTFFYTSTNWMTSLAY

KAG7031864.1 hypothetical protein SDJN02_05905, partial [Cucurbita argyrosperma subsp. argyrosperma]7.2e-21163.54Show/hide
Query:  MALQSDHQPTGAGDDEAAARNYLSRKKPKGSPTMTSSSDFESRRPTTIATVCNCKLSPSETTRITQQFNHSLIARIVGKDIRPGQLASRLRHHLRLTEDV
        MALQSD+Q TGAGDDEAAA    +R + K                   ATV  C+LS S+T RITQQF+HS IA I GKD+RP ++AS LR HL LT  V
Subjt:  MALQSDHQPTGAGDDEAAARNYLSRKKPKGSPTMTSSSDFESRRPTTIATVCNCKLSPSETTRITQQFNHSLIARIVGKDIRPGQLASRLRHHLRLTEDV

Query:  KVFELGLGYFVLKFSETDYLALEDLPWSIANLCIYAFPWTPDFKPSEAINSSINVWIRLPELSIEYYDVEILKRIADAIGGCLVKIDPVTRDRWKCKFAR
        KVFELGLGYFVLKF ETD+LAL+DLPWS+ NLCI+  PWTPDFKPSE I SS++VW+RL ELSIEYYD E+L++IA AIGG LVKIDPVT++R KCKFAR
Subjt:  KVFELGLGYFVLKFSETDYLALEDLPWSIANLCIYAFPWTPDFKPSEAINSSINVWIRLPELSIEYYDVEILKRIADAIGGCLVKIDPVTRDRWKCKFAR

Query:  FCIRVNLCDPLPSMIELGRTRQKIEYEGFE-LCPKCSRVGDLRHDCSGLSNPSGSSGFNPHGDKPHHPRTRSFKEFGSSSSSKQPLIPSESSAASAWESR
         C+RVNLCDPLPSMI LG+ RQ+IEYEGFE LCP CSRV  LRH+C  L  PSG SGFNPH  KPHH   RSF         KQPLIPSESSA S   SR
Subjt:  FCIRVNLCDPLPSMIELGRTRQKIEYEGFE-LCPKCSRVGDLRHDCSGLSNPSGSSGFNPHGDKPHHPRTRSFKEFGSSSSSKQPLIPSESSAASAWESR

Query:  FIEKDPQLDLKLIDWPKLPMGESGKVGTSVRISSPHVHVTDKAIAKQKEKCGISVQPLPNLPKQSSTITIKAPELKHIVPSVVEDPSKAAKTS-NPTMIA
        F      LDL   +WP L  GESGK GTS+R SS  VHV DKAIAK+KEKCG+SVQP   LPK+SS +TIK             D  KAAKTS NPT+  
Subjt:  FIEKDPQLDLKLIDWPKLPMGESGKVGTSVRISSPHVHVTDKAIAKQKEKCGISVQPLPNLPKQSSTITIKAPELKHIVPSVVEDPSKAAKTS-NPTMIA

Query:  SHKSQPPSPTASIPSLRPCPASEATLKFHSDAIQCLTRKKEITNTLSKETNVDSCPTVYTLEPKKVTSLNIALSEVQTTSMSNQNQYTIEIVPTMRGGDE
            QPPSPT S+P L PCPASEA L FHS AIQ  TR KEIT+  SKE NVDSCPTVYT++P K+ +L+IALSE +TTSMSNQ QY IE VPT R GD+
Subjt:  SHKSQPPSPTASIPSLRPCPASEATLKFHSDAIQCLTRKKEITNTLSKETNVDSCPTVYTLEPKKVTSLNIALSEVQTTSMSNQNQYTIEIVPTMRGGDE

Query:  DGVGSEVASGSESCAKKMLCWKVHGTDNAKLMRALKDLIQLHEPSVVLIFGTKINGVDADQVVRELAFYGFYCSKPDGYNGGVWLLLSKQDVQTKVNSYN
         GV     SGSESC KK+LCWK H TDN KL+R+LKDLI+LHEPS+VLIFGTKI+G D D+VV+EL F   Y  KPDGY+GGVWLLLS QDV+TKVNS +
Subjt:  DGVGSEVASGSESCAKKMLCWKVHGTDNAKLMRALKDLIQLHEPSVVLIFGTKINGVDADQVVRELAFYGFYCSKPDGYNGGVWLLLSKQDVQTKVNSYN

Query:  PQQVSASVFFHSETNVPA-----LKDETSSGPWGSTFFYTSTNWMTSLAY
        PQQ+ AS++F S+TN  A     ++ + SSGPWGS FF+T TNWMTS+AY
Subjt:  PQQVSASVFFHSETNVPA-----LKDETSSGPWGSTFFYTSTNWMTSLAY

KGN50455.1 hypothetical protein Csa_000357 [Cucumis sativus]4.3e-28878.91Show/hide
Query:  MALQSDHQPTGAGDDEAAARNYLSRKKPKGSPTMTSSSDFESRRPTTIATVCNCKLSPSETTRITQQFNHSLIARIVGKDIRPGQLASRLRHHLRLTEDV
        MA QS H PTGAGDDEAAARNYLSRKKPK  P +  SSDF SRR TTIATVCNC L+PSETTRITQQF HSLIAR+VGKD RPGQLA+RLRHHLRLT+DV
Subjt:  MALQSDHQPTGAGDDEAAARNYLSRKKPKGSPTMTSSSDFESRRPTTIATVCNCKLSPSETTRITQQFNHSLIARIVGKDIRPGQLASRLRHHLRLTEDV

Query:  KVFELGLGYFVLKFSETDYLALEDLPWSIANLCIYAFPWTPDFKPSEAINSSINVWIRLPELSIEYYDVEILKRIADAIGGCLVKIDPVTRDRWKCKFAR
        KVF+LGLGYFVLKFSETDYLALEDLPWSI NLCI+AFPWTPDFKPSEAINSS+NVWIRLPELSIEYYDV ILKRIADAIG  LVKIDPVTRDRWKCKFAR
Subjt:  KVFELGLGYFVLKFSETDYLALEDLPWSIANLCIYAFPWTPDFKPSEAINSSINVWIRLPELSIEYYDVEILKRIADAIGGCLVKIDPVTRDRWKCKFAR

Query:  FCIRVNLCDPLPSMIELGRTRQKIEYEGFELCPKCSRVGDLRHDCS----------GLSNPSGSSGFNPHGDKPHHPRTRSFKEFGSSSSSKQPLIPSES
        FCI VNLCDPLPSMIELGR RQ+IEYEGFELC KC+RVGDLRHDCS           L+NPSGS GFNPHGD+PHH  TR FKE GS+S+SKQPLIP ES
Subjt:  FCIRVNLCDPLPSMIELGRTRQKIEYEGFELCPKCSRVGDLRHDCS----------GLSNPSGSSGFNPHGDKPHHPRTRSFKEFGSSSSSKQPLIPSES

Query:  SAASAWE-SRFIEKDPQLDLKLIDWPKLPMGESGKVGTSVRISSPHVHVTDKAIAKQKEKCGISVQPLPNLPKQSSTITIKAPELKHIVPSVVEDPSKAA
        S  SAWE SRFIEK+P LDLKLIDWP LP  ESGK G+ VRISSP VHV DK I K+KEKC ISVQ LPNLPKQ STITIKAPELK +VPSVVED  K  
Subjt:  SAASAWE-SRFIEKDPQLDLKLIDWPKLPMGESGKVGTSVRISSPHVHVTDKAIAKQKEKCGISVQPLPNLPKQSSTITIKAPELKHIVPSVVEDPSKAA

Query:  KTSNPTMIASHKSQPPSPTASIPSLRPCPASEATLKFHSDAIQCLTRKKEITNTLSKETNVDSCPTVYTLEPKKVTSLNIALSEVQTTSMSNQNQYTIEI
        KT N TMIA H SQPPSPTASIP L+P PASEATLKF SDAI CLTRK+EI N+ SK  N  S PTVYT++PKK+TSLNIALSEVQ          TIE+
Subjt:  KTSNPTMIASHKSQPPSPTASIPSLRPCPASEATLKFHSDAIQCLTRKKEITNTLSKETNVDSCPTVYTLEPKKVTSLNIALSEVQTTSMSNQNQYTIEI

Query:  VPTMRGGDEDGVGSEVASGSESCAKKMLCWKVHGTDNAKLMRALKDLIQLHEPSVVLIFGTKINGVDADQVVRELAFYGFYCSKPDGYNGGVWLLLSKQD
        VPTM+GGDE GVGSEV SGSE CAKK+L WK H  DNAKLMRALKDLIQLHEPS+VLIFG KI+GVD D+V+RELAF G Y SKPDGYNGGVWLLLSKQD
Subjt:  VPTMRGGDEDGVGSEVASGSESCAKKMLCWKVHGTDNAKLMRALKDLIQLHEPSVVLIFGTKINGVDADQVVRELAFYGFYCSKPDGYNGGVWLLLSKQD

Query:  VQTKVNSYNPQQVSASVFFHSETNV----PALKD-ETSSGPWGSTFFYTSTNWMTSLAY
        VQTKVNS++ QQVSASV FHSETNV    P+  D +TSSGPWGSTFFYTSTNWMTSLAY
Subjt:  VQTKVNSYNPQQVSASVFFHSETNV----PALKD-ETSSGPWGSTFFYTSTNWMTSLAY

TrEMBL top hitse value%identityAlignment
A0A0A0KLB0 DUF4283 domain-containing protein2.1e-28878.91Show/hide
Query:  MALQSDHQPTGAGDDEAAARNYLSRKKPKGSPTMTSSSDFESRRPTTIATVCNCKLSPSETTRITQQFNHSLIARIVGKDIRPGQLASRLRHHLRLTEDV
        MA QS H PTGAGDDEAAARNYLSRKKPK  P +  SSDF SRR TTIATVCNC L+PSETTRITQQF HSLIAR+VGKD RPGQLA+RLRHHLRLT+DV
Subjt:  MALQSDHQPTGAGDDEAAARNYLSRKKPKGSPTMTSSSDFESRRPTTIATVCNCKLSPSETTRITQQFNHSLIARIVGKDIRPGQLASRLRHHLRLTEDV

Query:  KVFELGLGYFVLKFSETDYLALEDLPWSIANLCIYAFPWTPDFKPSEAINSSINVWIRLPELSIEYYDVEILKRIADAIGGCLVKIDPVTRDRWKCKFAR
        KVF+LGLGYFVLKFSETDYLALEDLPWSI NLCI+AFPWTPDFKPSEAINSS+NVWIRLPELSIEYYDV ILKRIADAIG  LVKIDPVTRDRWKCKFAR
Subjt:  KVFELGLGYFVLKFSETDYLALEDLPWSIANLCIYAFPWTPDFKPSEAINSSINVWIRLPELSIEYYDVEILKRIADAIGGCLVKIDPVTRDRWKCKFAR

Query:  FCIRVNLCDPLPSMIELGRTRQKIEYEGFELCPKCSRVGDLRHDCS----------GLSNPSGSSGFNPHGDKPHHPRTRSFKEFGSSSSSKQPLIPSES
        FCI VNLCDPLPSMIELGR RQ+IEYEGFELC KC+RVGDLRHDCS           L+NPSGS GFNPHGD+PHH  TR FKE GS+S+SKQPLIP ES
Subjt:  FCIRVNLCDPLPSMIELGRTRQKIEYEGFELCPKCSRVGDLRHDCS----------GLSNPSGSSGFNPHGDKPHHPRTRSFKEFGSSSSSKQPLIPSES

Query:  SAASAWE-SRFIEKDPQLDLKLIDWPKLPMGESGKVGTSVRISSPHVHVTDKAIAKQKEKCGISVQPLPNLPKQSSTITIKAPELKHIVPSVVEDPSKAA
        S  SAWE SRFIEK+P LDLKLIDWP LP  ESGK G+ VRISSP VHV DK I K+KEKC ISVQ LPNLPKQ STITIKAPELK +VPSVVED  K  
Subjt:  SAASAWE-SRFIEKDPQLDLKLIDWPKLPMGESGKVGTSVRISSPHVHVTDKAIAKQKEKCGISVQPLPNLPKQSSTITIKAPELKHIVPSVVEDPSKAA

Query:  KTSNPTMIASHKSQPPSPTASIPSLRPCPASEATLKFHSDAIQCLTRKKEITNTLSKETNVDSCPTVYTLEPKKVTSLNIALSEVQTTSMSNQNQYTIEI
        KT N TMIA H SQPPSPTASIP L+P PASEATLKF SDAI CLTRK+EI N+ SK  N  S PTVYT++PKK+TSLNIALSEVQ          TIE+
Subjt:  KTSNPTMIASHKSQPPSPTASIPSLRPCPASEATLKFHSDAIQCLTRKKEITNTLSKETNVDSCPTVYTLEPKKVTSLNIALSEVQTTSMSNQNQYTIEI

Query:  VPTMRGGDEDGVGSEVASGSESCAKKMLCWKVHGTDNAKLMRALKDLIQLHEPSVVLIFGTKINGVDADQVVRELAFYGFYCSKPDGYNGGVWLLLSKQD
        VPTM+GGDE GVGSEV SGSE CAKK+L WK H  DNAKLMRALKDLIQLHEPS+VLIFG KI+GVD D+V+RELAF G Y SKPDGYNGGVWLLLSKQD
Subjt:  VPTMRGGDEDGVGSEVASGSESCAKKMLCWKVHGTDNAKLMRALKDLIQLHEPSVVLIFGTKINGVDADQVVRELAFYGFYCSKPDGYNGGVWLLLSKQD

Query:  VQTKVNSYNPQQVSASVFFHSETNV----PALKD-ETSSGPWGSTFFYTSTNWMTSLAY
        VQTKVNS++ QQVSASV FHSETNV    P+  D +TSSGPWGSTFFYTSTNWMTSLAY
Subjt:  VQTKVNSYNPQQVSASVFFHSETNV----PALKD-ETSSGPWGSTFFYTSTNWMTSLAY

A0A5A7SSJ3 DUF4283 domain-containing protein3.8e-30683.54Show/hide
Query:  MALQSDHQPTGAGDDEAAARNYLSRKKPKGSPTMTSSSDFESRRPTTIATVCNCKLSPSETTRITQQFNHSLIARIVGKDIRPGQLASRLRHHLRLTEDV
        MALQSDH PTGAGDDEAAARNYLSRKKPK  P ++ SSDFESR  TTIATVCNC L+PSETTRITQQF HSLIAR+VGKD RPGQLA+RLRHHLRLT+DV
Subjt:  MALQSDHQPTGAGDDEAAARNYLSRKKPKGSPTMTSSSDFESRRPTTIATVCNCKLSPSETTRITQQFNHSLIARIVGKDIRPGQLASRLRHHLRLTEDV

Query:  KVFELGLGYFVLKFSETDYLALEDLPWSIANLCIYAFPWTPDFKPSEAINSSINVWIRLPELSIEYYDVEILKRIADAIGGCLVKIDPVTRDRWKCKFAR
        KVFELGLGYFVLKFSETDYLALEDLPWSI NLCI+AFPWTPDFKPSEAINSS+NVWIRLPELSIEYYDVEILKRIADAIGG LVKIDPVTRDRWKCKFAR
Subjt:  KVFELGLGYFVLKFSETDYLALEDLPWSIANLCIYAFPWTPDFKPSEAINSSINVWIRLPELSIEYYDVEILKRIADAIGGCLVKIDPVTRDRWKCKFAR

Query:  FCIRVNLCDPLPSMIELGRTRQKIEYEGFELCPKCSRVGDLRHDCSGLSNPSGSSGFNPHGDKPHHPRTRSFKEFGSSSSSKQPLIPSESSAASAWE-SR
        FCI VNLCDPLPSMIELGR RQ+IEYEGFELC KC+RVGDLRHDCS L+NPSGS GFNPHGD+PHH  TR FKEFGS+SSSKQPLIP ESS  SAWE SR
Subjt:  FCIRVNLCDPLPSMIELGRTRQKIEYEGFELCPKCSRVGDLRHDCSGLSNPSGSSGFNPHGDKPHHPRTRSFKEFGSSSSSKQPLIPSESSAASAWE-SR

Query:  FIEKDPQLDLKLIDWPKLPMGESGKVGTSVRISSPHVHVTDKAIAKQKEKCGISVQPLPNLPK-QSSTITIKAPELKHIVPSVVEDPSKAAKTSNPTMIA
        FIEK+PQLDLK I+WP LP  ESGK GTSVRISSPHVHV DKAI K+KEKC ISVQPLP+LPK QSSTITIKAPELK +VPSVVED  K AKT N TMIA
Subjt:  FIEKDPQLDLKLIDWPKLPMGESGKVGTSVRISSPHVHVTDKAIAKQKEKCGISVQPLPNLPK-QSSTITIKAPELKHIVPSVVEDPSKAAKTSNPTMIA

Query:  SHKSQPPSPTASIPSLRPCPASEATLKFHSDAIQCLTRKKEITNTLSKETNVDSCPTVYTLEPKKVTSLNIALSEVQTTSMSNQNQYTIEIVPTMRGGDE
         H SQPPSPTASIP L+P PASEATLKF SDAI CLTRK+EI N+ SKETN  S PTVYT++PKK+TSLNI+LSEVQTTSMSNQNQYTIE+VPTM+GGD+
Subjt:  SHKSQPPSPTASIPSLRPCPASEATLKFHSDAIQCLTRKKEITNTLSKETNVDSCPTVYTLEPKKVTSLNIALSEVQTTSMSNQNQYTIEIVPTMRGGDE

Query:  DGVGSEVASGSESCAKKMLCWKVHGTDNAKLMRALKDLIQLHEPSVVLIFGTKINGVDADQVVRELAFYGFYCSKPDGYNGGVWLLLSKQDVQTKVNSYN
         GVG EV SGSE CAKKML WK H  DNAKLMRALKDLIQLHEPS+VLIFG KI GVDA +V++ELAF G Y S+PDGYNGGVWLLLSKQDVQTKVNSY+
Subjt:  DGVGSEVASGSESCAKKMLCWKVHGTDNAKLMRALKDLIQLHEPSVVLIFGTKINGVDADQVVRELAFYGFYCSKPDGYNGGVWLLLSKQDVQTKVNSYN

Query:  PQQVSASVFFHSETNV----PALKD-ETSSGPWGSTFFYTSTNWMTSLAY
        PQQVSASV FHSETNV    P+  D ETSSGPWGSTFFYTSTNWMTSLAY
Subjt:  PQQVSASVFFHSETNV----PALKD-ETSSGPWGSTFFYTSTNWMTSLAY

A0A5A7SY10 DUF4283 domain-containing protein5.5e-13247.14Show/hide
Query:  DDEAAARNYLSRKKPKGSPTMTSSSDFESRRPTTIATVCNCKLSPSETTRITQQFNHSLIARIVGKDIRPGQLASRLRHHLRLTEDVKVFELGLGYFVLK
        DDE  + +    K+P     M+SSS+     PT         L+PS+T RI Q F+HSLIAR+ G ++    LA RLR +LRLT D+ VFEL LG+FVLK
Subjt:  DDEAAARNYLSRKKPKGSPTMTSSSDFESRRPTTIATVCNCKLSPSETTRITQQFNHSLIARIVGKDIRPGQLASRLRHHLRLTEDVKVFELGLGYFVLK

Query:  FS-ETDYL-ALEDLPWSIANLCIYAFPWTPDFKPSEAINSSINVWIRLPELSIEYYDVEILKRIADAIGGCLVKIDPVTRDRWKCKFARFCIRVNLCDPL
        FS  +DY  ALE+LPWSI++LCI+  PW P+FKPSEA+   ++VWIRLPEL IEYYD EIL++IA+AIG CLVKIDPVT  R KC FAR CIR+ LC+PL
Subjt:  FS-ETDYL-ALEDLPWSIANLCIYAFPWTPDFKPSEAINSSINVWIRLPELSIEYYDVEILKRIADAIGGCLVKIDPVTRDRWKCKFARFCIRVNLCDPL

Query:  PSMIELGRTRQKIEYEGFE-LCPKCSRVGDLRHDCSGLSNPSGSSGFNPHGDKP-----HHPRTRSFKEFGSSSSSKQPLIPSESSAASAWESRFIEKDP
           I+ G+T QK++YEG + LC  C  + +L+H C   +NPSGSSG +PH   P       P + S    G +  SK+PLI S  S+ SA  S+  EK+P
Subjt:  PSMIELGRTRQKIEYEGFE-LCPKCSRVGDLRHDCSGLSNPSGSSGFNPHGDKP-----HHPRTRSFKEFGSSSSSKQPLIPSESSAASAWESRFIEKDP

Query:  QLDLKLIDWPKLPMGESGKVGTSVRISSPHVHVTDKAIAKQKEKCGISVQPLPNLPKQSSTITIKAPELKHIVPSVVEDPSKAAKTSNPTMI-----ASH
         L+LKL D PKL MG                    K +  +K       + LPN P++SST T + PE   +   +V D  +AAK S+PT +      SH
Subjt:  QLDLKLIDWPKLPMGESGKVGTSVRISSPHVHVTDKAIAKQKEKCGISVQPLPNLPKQSSTITIKAPELKHIVPSVVEDPSKAAKTSNPTMI-----ASH

Query:  KSQPPSPTASIPSLRPCPASEATLKFHSDAI-QCLTRKKEITNTLSKETNVDSCPTVYTLEPKKVTSLNIALSEVQTTSMSNQNQYTIEIVPTMRGGDED
         S                A EA +   S AI Q +T KK I         VDS PTVYT++P  + SL I  SEV T + SNQN+Y I  V   R  +++
Subjt:  KSQPPSPTASIPSLRPCPASEATLKFHSDAI-QCLTRKKEITNTLSKETNVDSCPTVYTLEPKKVTSLNIALSEVQTTSMSNQNQYTIEIVPTMRGGDED

Query:  GVGSEVASGSESCAKKMLCWKVHGTDNAKLMRALKDLIQLHEPSVVLIFGTKINGVDADQVVRELAFYGFYCSKPDGYNGGVWLLLSKQDVQTKVNSYNP
         V S+ AS    C KKMLCW   G D AKL++A K LI+L EPS+VLIFG+KI+  DA++VVRELAF G YC KPDGYNGGVW++LS QDV+ +V+SY+P
Subjt:  GVGSEVASGSESCAKKMLCWKVHGTDNAKLMRALKDLIQLHEPSVVLIFGTKINGVDADQVVRELAFYGFYCSKPDGYNGGVWLLLSKQDVQTKVNSYNP

Query:  QQVSASVFFHSETNVPALK-----DETSSG
        Q+VSASV+F S+ N P ++      ETS G
Subjt:  QQVSASVFFHSETNVPALK-----DETSSG

A0A6J1FN13 uncharacterized protein LOC111446932 isoform X22.8e-13650Show/hide
Query:  MALQSDH----QPTGAGDDEAAARNYLSRKKPKGSPTMTSSSDFESRRPTTIATVCNCKLSPSETTRITQQFNHSLIARIVGKDIRPGQLASRLRHHLRL
        MA+QS H    + TGAG+D AAA                          T  ATVCN  L+PS+T RI QQF+ SLI  +VGK I P QLA RLR +L L
Subjt:  MALQSDH----QPTGAGDDEAAARNYLSRKKPKGSPTMTSSSDFESRRPTTIATVCNCKLSPSETTRITQQFNHSLIARIVGKDIRPGQLASRLRHHLRL

Query:  TEDVKVFELGLGYFVLKFSET--DYLALEDLPWSIANLCIYAFPWTPDFKPSEAINSSINVWIRLPELSIEYYDVEILKRIADAIGGCLVKIDPVTRDRW
          D+ VFELGLG+FVLKFS     Y ALE+ PWSI +LCIY FPW P+FKPSEA    ++VWIRLPELSIEYYD E+L++IA+ IGG LVKIDPVT  R 
Subjt:  TEDVKVFELGLGYFVLKFSET--DYLALEDLPWSIANLCIYAFPWTPDFKPSEAINSSINVWIRLPELSIEYYDVEILKRIADAIGGCLVKIDPVTRDRW

Query:  KCKFARFCIRVNLCDPLPSMIELGRTRQKIEYEGFE-LCPKCSRVGDLRHDCSGLSNPSGSSGFNPHGDKPHHPRTRSFKEFGSSSSS------------
        KC +AR CIR+NL  PL    + G+  QKI YEG + LC  C  V DL+HDC  LSN S SSGF+     PHH   R  +  GSS SS            
Subjt:  KCKFARFCIRVNLCDPLPSMIELGRTRQKIEYEGFE-LCPKCSRVGDLRHDCSGLSNPSGSSGFNPHGDKPHHPRTRSFKEFGSSSSS------------

Query:  ----------KQPLIPSESSAASAWESRFIEKDPQLDLKLIDWPKLPMGESGKVGTSVRISSPHVHVTDKAIAKQKEKCGISVQPLPNLPKQSSTITIKA
                  K  LIPS+ + ASA  SRF      L+L L + P LP+ ES K                    + KE   I++ P   L KQ++ I    
Subjt:  ----------KQPLIPSESSAASAWESRFIEKDPQLDLKLIDWPKLPMGESGKVGTSVRISSPHVHVTDKAIAKQKEKCGISVQPLPNLPKQSSTITIKA

Query:  PELKHIVPSVVEDPS-KAAKTSNPTMIASHKSQP-PSPTA--SIPSLRPCPASEATLKFHSDAIQCLTRKKEITNTLSKETNVDSCPTVYTLEPKKVTSL
         +   + P V+ED   +  KTS+PT +A   ++P PS  A  SI  L+P  A EA LKF+S AIQ  T +K I NT S+  +VDS PT+YT++P  +TSL
Subjt:  PELKHIVPSVVEDPS-KAAKTSNPTMIASHKSQP-PSPTA--SIPSLRPCPASEATLKFHSDAIQCLTRKKEITNTLSKETNVDSCPTVYTLEPKKVTSL

Query:  NIALSEVQ-TTSMSNQNQYTIEIVPTMRGGDEDGVGSEVASGSES-CAKKMLCWKVHGTDNAKLMRALKDLIQLHEPSVVLIFGTKINGVDADQVVRELA
         I L E+  TT+ SNQN++ I IVPT          SE  S S S C+KKMLCW    TDNAKLMRALKDLIQLH+PS+VLIFGTKI+G DAD VVRELA
Subjt:  NIALSEVQ-TTSMSNQNQYTIEIVPTMRGGDEDGVGSEVASGSES-CAKKMLCWKVHGTDNAKLMRALKDLIQLHEPSVVLIFGTKINGVDADQVVRELA

Query:  FYGFYCSKPDGYNGGVWLLLSKQDVQTKVNSYNPQQVSASVFFHSETN
        F G YC KPDGY GG WLLLS+QDVQ +V+SY+PQQVSASV  HS+ N
Subjt:  FYGFYCSKPDGYNGGVWLLLSKQDVQTKVNSYNPQQVSASVFFHSETN

A0A6J1FU80 uncharacterized protein LOC111446932 isoform X14.8e-13649.85Show/hide
Query:  MALQSDH----QPTGAGDDEAAARNYLSRKKPKGSPTMTSSSDFESRRPTTIATVCNCKLSPSETTRITQQFNHSLIARIVGKDIRPGQLASRLRHHLRL
        MA+QS H    + TGAG+D AAA                          T  ATVCN  L+PS+T RI QQF+ SLI  +VGK I P QLA RLR +L L
Subjt:  MALQSDH----QPTGAGDDEAAARNYLSRKKPKGSPTMTSSSDFESRRPTTIATVCNCKLSPSETTRITQQFNHSLIARIVGKDIRPGQLASRLRHHLRL

Query:  TEDVKVFELGLGYFVLKFSET--DYLALEDLPWSIANLCIYAFPWTPDFKPSEAINSSINVWIRLPELSIEYYDVEILKRIADAIGGCLVKIDPVTRDRW
          D+ VFELGLG+FVLKFS     Y ALE+ PWSI +LCIY FPW P+FKPSEA    ++VWIRLPELSIEYYD E+L++IA+ IGG LVKIDPVT  R 
Subjt:  TEDVKVFELGLGYFVLKFSET--DYLALEDLPWSIANLCIYAFPWTPDFKPSEAINSSINVWIRLPELSIEYYDVEILKRIADAIGGCLVKIDPVTRDRW

Query:  KCKFARFCIRVNLCDPLPSMIELGRTRQKIEYEGFE-LCPKCSRVGDLRHDCSGLSNPSGSSGFNPHGDKPHHPRTRSFKEFGSSSSS------------
        KC +AR CIR+NL  PL    + G+  QKI YEG + LC  C  V DL+HDC  LSN S SSGF+     PHH   R  +  GSS SS            
Subjt:  KCKFARFCIRVNLCDPLPSMIELGRTRQKIEYEGFE-LCPKCSRVGDLRHDCSGLSNPSGSSGFNPHGDKPHHPRTRSFKEFGSSSSS------------

Query:  ------------KQPLIPSESSAASAWESRFIEKDPQLDLKLIDWPKLPMGESGKVGTSVRISSPHVHVTDKAIAKQKEKCGISVQPLPNLPKQSSTITI
                    K  LIPS+ + ASA  SRF      L+L L + P LP+ ES K                    + KE   I++ P   L KQ++ I  
Subjt:  ------------KQPLIPSESSAASAWESRFIEKDPQLDLKLIDWPKLPMGESGKVGTSVRISSPHVHVTDKAIAKQKEKCGISVQPLPNLPKQSSTITI

Query:  KAPELKHIVPSVVEDPS-KAAKTSNPTMIASHKSQP-PSPTA--SIPSLRPCPASEATLKFHSDAIQCLTRKKEITNTLSKETNVDSCPTVYTLEPKKVT
           +   + P V+ED   +  KTS+PT +A   ++P PS  A  SI  L+P  A EA LKF+S AIQ  T +K I NT S+  +VDS PT+YT++P  +T
Subjt:  KAPELKHIVPSVVEDPS-KAAKTSNPTMIASHKSQP-PSPTA--SIPSLRPCPASEATLKFHSDAIQCLTRKKEITNTLSKETNVDSCPTVYTLEPKKVT

Query:  SLNIALSEVQ-TTSMSNQNQYTIEIVPTMRGGDEDGVGSEVASGSES-CAKKMLCWKVHGTDNAKLMRALKDLIQLHEPSVVLIFGTKINGVDADQVVRE
        SL I L E+  TT+ SNQN++ I IVPT          SE  S S S C+KKMLCW    TDNAKLMRALKDLIQLH+PS+VLIFGTKI+G DAD VVRE
Subjt:  SLNIALSEVQ-TTSMSNQNQYTIEIVPTMRGGDEDGVGSEVASGSES-CAKKMLCWKVHGTDNAKLMRALKDLIQLHEPSVVLIFGTKINGVDADQVVRE

Query:  LAFYGFYCSKPDGYNGGVWLLLSKQDVQTKVNSYNPQQVSASVFFHSETN
        LAF G YC KPDGY GG WLLLS+QDVQ +V+SY+PQQVSASV  HS+ N
Subjt:  LAFYGFYCSKPDGYNGGVWLLLSKQDVQTKVNSYNPQQVSASVFFHSETN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding4.5e-1730.51Show/hide
Query:  LIARIVGKDIRPGQLASRLRHHLRLTEDVKVFELGLGYFVLKFS-ETDYL-ALEDLPWSIANLCIYAFPWTPDFKPSEAINSSINVWIRLPELSIEYYDV
        +I +++G  I    L  +LR   + +  + V +L   +F+++F  E +Y+ AL   PW +    +    W+  F P      +  VW+RL  +   YY  
Subjt:  LIARIVGKDIRPGQLASRLRHHLRLTEDVKVFELGLGYFVLKFS-ETDYL-ALEDLPWSIANLCIYAFPWTPDFKPSEAINSSINVWIRLPELSIEYYDV

Query:  EILKRIADAIGGCLVKIDPVTRDRWKCKFARFCIRVNLCDPLPSMIELGRTRQKIEYEGF-ELCPKCSRVGDLRHDC
         +L  IA  +G  L K+D  T +  K +FAR CI VNL  PL   + +   R  + YEG  ++C  C   G L H C
Subjt:  EILKRIADAIGGCLVKIDPVTRDRWKCKFARFCIRVNLCDPLPSMIELGRTRQKIEYEGF-ELCPKCSRVGDLRHDC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCTTCAATCCGATCACCAACCCACCGGCGCCGGCGACGACGAAGCTGCCGCCAGAAACTATCTCAGCAGGAAGAAGCCCAAGGGATCACCAACGATGACTTCCTC
ATCGGACTTCGAATCTCGCCGTCCAACCACCATCGCCACCGTCTGTAACTGTAAGCTCTCTCCCTCTGAAACTACTCGTATTACTCAACAGTTCAATCACTCTCTCATAG
CTCGGATCGTCGGAAAGGATATCCGTCCAGGCCAACTCGCCTCTCGTCTTCGCCATCATCTTCGTCTCACCGAAGATGTGAAGGTCTTCGAGCTTGGTCTCGGCTATTTC
GTACTCAAATTCTCTGAGACTGACTATTTAGCTCTAGAAGACCTTCCTTGGTCAATCGCCAATCTCTGTATCTATGCCTTCCCATGGACTCCCGATTTCAAACCCTCTGA
GGCCATCAATTCCTCTATTAATGTTTGGATCCGCCTCCCTGAGCTCTCCATCGAGTATTACGACGTTGAAATTCTGAAACGGATTGCGGACGCCATTGGTGGCTGTCTTG
TTAAGATCGATCCTGTAACTAGAGATCGATGGAAATGTAAGTTCGCTCGCTTTTGTATTAGGGTAAATTTGTGCGATCCTCTTCCATCGATGATCGAACTTGGTAGAACT
CGACAAAAGATTGAGTATGAGGGTTTTGAGTTGTGCCCTAAGTGTAGTCGTGTTGGTGACCTGAGACATGATTGTTCGGGTCTGAGTAATCCTTCTGGTTCTTCTGGATT
CAATCCCCATGGAGATAAACCCCATCACCCTAGAACTCGATCTTTTAAGGAATTTGGCTCCAGCTCAAGTTCAAAGCAGCCATTGATTCCTTCTGAATCTTCGGCAGCAT
CAGCTTGGGAATCTAGATTCATTGAAAAAGACCCACAGCTTGATTTGAAGTTGATTGACTGGCCTAAACTTCCAATGGGTGAATCTGGAAAAGTAGGTACAAGTGTAAGA
ATAAGTTCTCCACATGTCCATGTGACGGATAAAGCAATTGCAAAGCAAAAGGAGAAATGTGGAATCTCTGTTCAACCATTGCCCAACTTGCCAAAACAATCTTCAACAAT
AACCATAAAAGCCCCTGAGTTAAAACACATAGTCCCTTCTGTTGTTGAGGATCCGTCTAAGGCTGCAAAAACCAGCAACCCCACCATGATTGCAAGCCATAAGAGCCAAC
CACCATCACCAACAGCAAGCATTCCATCCTTACGACCATGTCCTGCTTCAGAGGCTACCCTCAAGTTCCATTCGGATGCGATCCAATGCTTGACAAGAAAGAAAGAGATA
ACCAACACACTGTCTAAAGAGACCAATGTCGATAGTTGCCCCACCGTTTACACACTTGAACCAAAGAAGGTCACAAGCCTTAACATTGCTTTGTCAGAAGTGCAAACAAC
CTCAATGTCAAACCAAAATCAATACACAATCGAGATAGTTCCAACTATGAGAGGTGGAGACGAGGATGGAGTTGGTTCGGAAGTGGCGTCAGGATCAGAATCATGTGCTA
AGAAGATGTTGTGCTGGAAGGTTCATGGGACAGACAATGCCAAGCTAATGCGAGCATTGAAGGATTTGATTCAGCTACACGAGCCATCCGTTGTGCTGATCTTTGGCACC
AAGATCAATGGTGTTGATGCAGATCAGGTCGTGCGAGAACTCGCTTTCTATGGCTTTTACTGCAGCAAGCCTGATGGCTACAATGGTGGTGTTTGGTTGTTATTGTCTAA
GCAAGATGTGCAAACTAAAGTTAACTCATACAACCCACAACAGGTTTCTGCATCAGTATTTTTTCATTCTGAAACCAATGTACCAGCATTGAAGGACGAAACTTCTTCGG
GACCGTGGGGATCAACTTTCTTCTATACTTCAACGAACTGGATGACTTCATTGGCTTACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCCTTCAATCCGATCACCAACCCACCGGCGCCGGCGACGACGAAGCTGCCGCCAGAAACTATCTCAGCAGGAAGAAGCCCAAGGGATCACCAACGATGACTTCCTC
ATCGGACTTCGAATCTCGCCGTCCAACCACCATCGCCACCGTCTGTAACTGTAAGCTCTCTCCCTCTGAAACTACTCGTATTACTCAACAGTTCAATCACTCTCTCATAG
CTCGGATCGTCGGAAAGGATATCCGTCCAGGCCAACTCGCCTCTCGTCTTCGCCATCATCTTCGTCTCACCGAAGATGTGAAGGTCTTCGAGCTTGGTCTCGGCTATTTC
GTACTCAAATTCTCTGAGACTGACTATTTAGCTCTAGAAGACCTTCCTTGGTCAATCGCCAATCTCTGTATCTATGCCTTCCCATGGACTCCCGATTTCAAACCCTCTGA
GGCCATCAATTCCTCTATTAATGTTTGGATCCGCCTCCCTGAGCTCTCCATCGAGTATTACGACGTTGAAATTCTGAAACGGATTGCGGACGCCATTGGTGGCTGTCTTG
TTAAGATCGATCCTGTAACTAGAGATCGATGGAAATGTAAGTTCGCTCGCTTTTGTATTAGGGTAAATTTGTGCGATCCTCTTCCATCGATGATCGAACTTGGTAGAACT
CGACAAAAGATTGAGTATGAGGGTTTTGAGTTGTGCCCTAAGTGTAGTCGTGTTGGTGACCTGAGACATGATTGTTCGGGTCTGAGTAATCCTTCTGGTTCTTCTGGATT
CAATCCCCATGGAGATAAACCCCATCACCCTAGAACTCGATCTTTTAAGGAATTTGGCTCCAGCTCAAGTTCAAAGCAGCCATTGATTCCTTCTGAATCTTCGGCAGCAT
CAGCTTGGGAATCTAGATTCATTGAAAAAGACCCACAGCTTGATTTGAAGTTGATTGACTGGCCTAAACTTCCAATGGGTGAATCTGGAAAAGTAGGTACAAGTGTAAGA
ATAAGTTCTCCACATGTCCATGTGACGGATAAAGCAATTGCAAAGCAAAAGGAGAAATGTGGAATCTCTGTTCAACCATTGCCCAACTTGCCAAAACAATCTTCAACAAT
AACCATAAAAGCCCCTGAGTTAAAACACATAGTCCCTTCTGTTGTTGAGGATCCGTCTAAGGCTGCAAAAACCAGCAACCCCACCATGATTGCAAGCCATAAGAGCCAAC
CACCATCACCAACAGCAAGCATTCCATCCTTACGACCATGTCCTGCTTCAGAGGCTACCCTCAAGTTCCATTCGGATGCGATCCAATGCTTGACAAGAAAGAAAGAGATA
ACCAACACACTGTCTAAAGAGACCAATGTCGATAGTTGCCCCACCGTTTACACACTTGAACCAAAGAAGGTCACAAGCCTTAACATTGCTTTGTCAGAAGTGCAAACAAC
CTCAATGTCAAACCAAAATCAATACACAATCGAGATAGTTCCAACTATGAGAGGTGGAGACGAGGATGGAGTTGGTTCGGAAGTGGCGTCAGGATCAGAATCATGTGCTA
AGAAGATGTTGTGCTGGAAGGTTCATGGGACAGACAATGCCAAGCTAATGCGAGCATTGAAGGATTTGATTCAGCTACACGAGCCATCCGTTGTGCTGATCTTTGGCACC
AAGATCAATGGTGTTGATGCAGATCAGGTCGTGCGAGAACTCGCTTTCTATGGCTTTTACTGCAGCAAGCCTGATGGCTACAATGGTGGTGTTTGGTTGTTATTGTCTAA
GCAAGATGTGCAAACTAAAGTTAACTCATACAACCCACAACAGGTTTCTGCATCAGTATTTTTTCATTCTGAAACCAATGTACCAGCATTGAAGGACGAAACTTCTTCGG
GACCGTGGGGATCAACTTTCTTCTATACTTCAACGAACTGGATGACTTCATTGGCTTACTGA
Protein sequenceShow/hide protein sequence
MALQSDHQPTGAGDDEAAARNYLSRKKPKGSPTMTSSSDFESRRPTTIATVCNCKLSPSETTRITQQFNHSLIARIVGKDIRPGQLASRLRHHLRLTEDVKVFELGLGYF
VLKFSETDYLALEDLPWSIANLCIYAFPWTPDFKPSEAINSSINVWIRLPELSIEYYDVEILKRIADAIGGCLVKIDPVTRDRWKCKFARFCIRVNLCDPLPSMIELGRT
RQKIEYEGFELCPKCSRVGDLRHDCSGLSNPSGSSGFNPHGDKPHHPRTRSFKEFGSSSSSKQPLIPSESSAASAWESRFIEKDPQLDLKLIDWPKLPMGESGKVGTSVR
ISSPHVHVTDKAIAKQKEKCGISVQPLPNLPKQSSTITIKAPELKHIVPSVVEDPSKAAKTSNPTMIASHKSQPPSPTASIPSLRPCPASEATLKFHSDAIQCLTRKKEI
TNTLSKETNVDSCPTVYTLEPKKVTSLNIALSEVQTTSMSNQNQYTIEIVPTMRGGDEDGVGSEVASGSESCAKKMLCWKVHGTDNAKLMRALKDLIQLHEPSVVLIFGT
KINGVDADQVVRELAFYGFYCSKPDGYNGGVWLLLSKQDVQTKVNSYNPQQVSASVFFHSETNVPALKDETSSGPWGSTFFYTSTNWMTSLAY