; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10014270 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10014270
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionprotein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1-like
Genome locationChr02:9031711..9033192
RNA-Seq ExpressionHG10014270
SyntenyHG10014270
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141329.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 [Cucumis sativus]1.3e-20577.62Show/hide
Query:  MNDSTNGSVRKRTRADEADEDDDSMGKNGGGKSLKGLVTSLLLLDEQDKCQQEEHDRVSMEAKISMEVNHRKKTKAMVDFYSEVQDYYSEVEESDRMKRK
        MNDSTNG+VRKRTRADE DEDDD MGKNGGGK LKGLVTSLLLLDEQDKC+Q+E DR+S+EAKISMEVNHRKKTKAMVDFYSEVQDYYSEVEESDRMKRK
Subjt:  MNDSTNGSVRKRTRADEADEDDDSMGKNGGGKSLKGLVTSLLLLDEQDKCQQEEHDRVSMEAKISMEVNHRKKTKAMVDFYSEVQDYYSEVEESDRMKRK

Query:  KSRLAANSVAVAAASDGLQKIEIEKSNKRGGDGG---GGGSGHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLR
        KSRLAANSVAVAA SDGLQKIE EKSNKRGGDGG   GGG GHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLR
Subjt:  KSRLAANSVAVAAASDGLQKIEIEKSNKRGGDGG---GGGSGHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLR

Query:  TAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA
        TAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA
Subjt:  TAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA

Query:  YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLLKG--------------------------------------
        YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLLKG                                      
Subjt:  YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLLKG--------------------------------------

Query:  ---------------------------------------------------------LQDDEMAPEIALRSVPSMKARDAIAHNLLHHGLAGTSFL
                                                                 LQDDEMAPE+ALRSVPSMKARDAIAHNLLHHGLAGTSFL
Subjt:  ---------------------------------------------------------LQDDEMAPEIALRSVPSMKARDAIAHNLLHHGLAGTSFL

XP_008452747.1 PREDICTED: putative nuclease HARBI1 [Cucumis melo]8.2e-20577.37Show/hide
Query:  MNDSTNGSVRKRTRADEADEDDDSMGKNGGGKSLKGLVTSLLLLDEQDKCQQEEHDRVSMEAKISMEVNHRKKTKAMVDFYSEVQDYYSEVEESDRMKRK
        MNDSTNG+VRKRTRADE DEDDD MGKNGGGK LKGLVTSLLLLDEQDKC+Q+E DR+S+EAKISMEVNHRKKTKAMVDFYSEVQDYYSEVEESDRMKRK
Subjt:  MNDSTNGSVRKRTRADEADEDDDSMGKNGGGKSLKGLVTSLLLLDEQDKCQQEEHDRVSMEAKISMEVNHRKKTKAMVDFYSEVQDYYSEVEESDRMKRK

Query:  KSRLAANSVAVAAASDGLQKIEIEKSNKRGGDGGG--GGSGHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRT
        KSRLAANSVAVAA SDGLQ+IE EKSNKRGGDGGG  GG GHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRT
Subjt:  KSRLAANSVAVAAASDGLQKIEIEKSNKRGGDGGG--GGSGHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRT

Query:  AIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAY
        AIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEE+ESISGIPNVVGSMYTTHIPIIAPKISVAAY
Subjt:  AIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAY

Query:  FNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLLKG---------------------------------------
        FNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLLKG                                       
Subjt:  FNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLLKG---------------------------------------

Query:  --------------------------------------------------------LQDDEMAPEIALRSVPSMKARDAIAHNLLHHGLAGTSFL
                                                                LQDDEMAPE+ALRSVPSMKARDAIAHNLLHHGLAGTSFL
Subjt:  --------------------------------------------------------LQDDEMAPEIALRSVPSMKARDAIAHNLLHHGLAGTSFL

XP_022138879.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 [Momordica charantia]3.3e-19875.35Show/hide
Query:  MNDSTNGSVRKRTRADEADEDDDSMGKNGGGKSLKGLVTSLLLLDEQDKCQQEEHDRVSMEAKISMEVNHRKKTKAMVDFYSEVQDYYSEVEESDRMKRK
        MNDS NGS  KR R +EADEDDDSMG+NG GK LKGLVTSLLLLDEQ+KC+QEEHDR SME K+S+EVNHRKKTKAMVDFYSEVQDYYSEVEESDRMKRK
Subjt:  MNDSTNGSVRKRTRADEADEDDDSMGKNGGGKSLKGLVTSLLLLDEQDKCQQEEHDRVSMEAKISMEVNHRKKTKAMVDFYSEVQDYYSEVEESDRMKRK

Query:  KSRLAANSVAVAAASDGLQKIEIEKSNKRGGDGGG--GGSGHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRT
        KSRLAA+SVAVAAASDGLQKIE +KS KRGGDGGG  G  GHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRT
Subjt:  KSRLAANSVAVAAASDGLQKIEIEKSNKRGGDGGG--GGSGHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRT

Query:  AIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAY
        AIPV+QRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAY
Subjt:  AIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAY

Query:  FNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLLKG---------------------------------------
        FNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLLKG                                       
Subjt:  FNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLLKG---------------------------------------

Query:  --------------------------------------------------------LQDDEMAPEIALRSVPSMKARDAIAHNLLHHGLAGTSFL
                                                                LQDDEMAPE+ALRSV SMKARDAIAHNLLHHGLAGT+FL
Subjt:  --------------------------------------------------------LQDDEMAPEIALRSVPSMKARDAIAHNLLHHGLAGTSFL

XP_022977009.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1-like [Cucurbita maxima]1.1e-19875.86Show/hide
Query:  MNDSTNGSVRKRTRADEADEDDDSMGKNGGGKSLKGLVTSLLLLDEQDKCQQEEHDRVSMEAKISMEVNHRKKTKAMVDFYSEVQDYYSEVEESDRMKRK
        MNDSTNG  RKR R DEADEDD S+GKNG GK LKGLVTSLLLLDEQ+K +QEEHDR SMEAK+SMEVNHRKKTKAM DFYSE QDYYSEVEESDR+KRK
Subjt:  MNDSTNGSVRKRTRADEADEDDDSMGKNGGGKSLKGLVTSLLLLDEQDKCQQEEHDRVSMEAKISMEVNHRKKTKAMVDFYSEVQDYYSEVEESDRMKRK

Query:  KSRLAANSVAVAAASDGLQKIEIEKSNKRGGDGGGGGSGHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTAI
        KSRLAANSVAVAAASDGLQKIEI KSNKRGGDGG    GHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTAI
Subjt:  KSRLAANSVAVAAASDGLQKIEIEKSNKRGGDGGGGGSGHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTAI

Query:  PVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFN
        PVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKL+LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFN
Subjt:  PVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFN

Query:  KRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLLKG-----------------------------------------
        KRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLLKG                                         
Subjt:  KRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLLKG-----------------------------------------

Query:  ------------------------------------------------------LQDDEMAPEIALRSVPSMKARDAIAHNLLHHGLAGTSFL
                                                              LQDDEMAPE+ALRSV SMKARDAIAHNLLHHGLAGTSFL
Subjt:  ------------------------------------------------------LQDDEMAPEIALRSVPSMKARDAIAHNLLHHGLAGTSFL

XP_038900256.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 [Benincasa hispida]1.8e-20778.31Show/hide
Query:  MNDSTNGSVRKRTRADEADEDDDSMGKNGGGKSLKGLVTSLLLLDEQDKCQQEEHDRVSMEAKISMEVNHRKKTKAMVDFYSEVQDYYSEVEESDRMKRK
        MNDSTNGSVRKRTRADEADEDDD MGKNGGGK LKGLVTSLLLLDEQDKC+QEEHDR S+EAKISMEVNHRKKTKAMVDFYSEVQDYYSEVEES RMKRK
Subjt:  MNDSTNGSVRKRTRADEADEDDDSMGKNGGGKSLKGLVTSLLLLDEQDKCQQEEHDRVSMEAKISMEVNHRKKTKAMVDFYSEVQDYYSEVEESDRMKRK

Query:  KSRLAANSVAVAAASDGLQKIEIEKSNKRGGD-----GGGGGSGHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTT
        KSRLAANSVAVAAASDGLQKIEIEKSNKRGGD     GGGGG+GHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTT
Subjt:  KSRLAANSVAVAAASDGLQKIEIEKSNKRGGD-----GGGGGSGHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTT

Query:  LRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISV
        LRTAIPV+QRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISV
Subjt:  LRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISV

Query:  AAYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLLKG------------------------------------
        AAYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLLKG                                    
Subjt:  AAYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLLKG------------------------------------

Query:  -----------------------------------------------------------LQDDEMAPEIALRSVPSMKARDAIAHNLLHHGLAGTSFL
                                                                   LQDDEMAPEIALRSVPSMKARDAIAHNLLHHGLAGTSFL
Subjt:  -----------------------------------------------------------LQDDEMAPEIALRSVPSMKARDAIAHNLLHHGLAGTSFL

TrEMBL top hitse value%identityAlignment
A0A0A0L420 DDE Tnp4 domain-containing protein6.1e-20677.62Show/hide
Query:  MNDSTNGSVRKRTRADEADEDDDSMGKNGGGKSLKGLVTSLLLLDEQDKCQQEEHDRVSMEAKISMEVNHRKKTKAMVDFYSEVQDYYSEVEESDRMKRK
        MNDSTNG+VRKRTRADE DEDDD MGKNGGGK LKGLVTSLLLLDEQDKC+Q+E DR+S+EAKISMEVNHRKKTKAMVDFYSEVQDYYSEVEESDRMKRK
Subjt:  MNDSTNGSVRKRTRADEADEDDDSMGKNGGGKSLKGLVTSLLLLDEQDKCQQEEHDRVSMEAKISMEVNHRKKTKAMVDFYSEVQDYYSEVEESDRMKRK

Query:  KSRLAANSVAVAAASDGLQKIEIEKSNKRGGDGG---GGGSGHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLR
        KSRLAANSVAVAA SDGLQKIE EKSNKRGGDGG   GGG GHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLR
Subjt:  KSRLAANSVAVAAASDGLQKIEIEKSNKRGGDGG---GGGSGHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLR

Query:  TAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA
        TAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA
Subjt:  TAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA

Query:  YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLLKG--------------------------------------
        YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLLKG                                      
Subjt:  YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLLKG--------------------------------------

Query:  ---------------------------------------------------------LQDDEMAPEIALRSVPSMKARDAIAHNLLHHGLAGTSFL
                                                                 LQDDEMAPE+ALRSVPSMKARDAIAHNLLHHGLAGTSFL
Subjt:  ---------------------------------------------------------LQDDEMAPEIALRSVPSMKARDAIAHNLLHHGLAGTSFL

A0A1S3BVR8 putative nuclease HARBI14.0e-20577.37Show/hide
Query:  MNDSTNGSVRKRTRADEADEDDDSMGKNGGGKSLKGLVTSLLLLDEQDKCQQEEHDRVSMEAKISMEVNHRKKTKAMVDFYSEVQDYYSEVEESDRMKRK
        MNDSTNG+VRKRTRADE DEDDD MGKNGGGK LKGLVTSLLLLDEQDKC+Q+E DR+S+EAKISMEVNHRKKTKAMVDFYSEVQDYYSEVEESDRMKRK
Subjt:  MNDSTNGSVRKRTRADEADEDDDSMGKNGGGKSLKGLVTSLLLLDEQDKCQQEEHDRVSMEAKISMEVNHRKKTKAMVDFYSEVQDYYSEVEESDRMKRK

Query:  KSRLAANSVAVAAASDGLQKIEIEKSNKRGGDGGG--GGSGHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRT
        KSRLAANSVAVAA SDGLQ+IE EKSNKRGGDGGG  GG GHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRT
Subjt:  KSRLAANSVAVAAASDGLQKIEIEKSNKRGGDGGG--GGSGHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRT

Query:  AIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAY
        AIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEE+ESISGIPNVVGSMYTTHIPIIAPKISVAAY
Subjt:  AIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAY

Query:  FNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLLKG---------------------------------------
        FNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLLKG                                       
Subjt:  FNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLLKG---------------------------------------

Query:  --------------------------------------------------------LQDDEMAPEIALRSVPSMKARDAIAHNLLHHGLAGTSFL
                                                                LQDDEMAPE+ALRSVPSMKARDAIAHNLLHHGLAGTSFL
Subjt:  --------------------------------------------------------LQDDEMAPEIALRSVPSMKARDAIAHNLLHHGLAGTSFL

A0A5D3D937 Putative nuclease HARBI14.0e-20577.37Show/hide
Query:  MNDSTNGSVRKRTRADEADEDDDSMGKNGGGKSLKGLVTSLLLLDEQDKCQQEEHDRVSMEAKISMEVNHRKKTKAMVDFYSEVQDYYSEVEESDRMKRK
        MNDSTNG+VRKRTRADE DEDDD MGKNGGGK LKGLVTSLLLLDEQDKC+Q+E DR+S+EAKISMEVNHRKKTKAMVDFYSEVQDYYSEVEESDRMKRK
Subjt:  MNDSTNGSVRKRTRADEADEDDDSMGKNGGGKSLKGLVTSLLLLDEQDKCQQEEHDRVSMEAKISMEVNHRKKTKAMVDFYSEVQDYYSEVEESDRMKRK

Query:  KSRLAANSVAVAAASDGLQKIEIEKSNKRGGDGGG--GGSGHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRT
        KSRLAANSVAVAA SDGLQ+IE EKSNKRGGDGGG  GG GHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRT
Subjt:  KSRLAANSVAVAAASDGLQKIEIEKSNKRGGDGGG--GGSGHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRT

Query:  AIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAY
        AIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEE+ESISGIPNVVGSMYTTHIPIIAPKISVAAY
Subjt:  AIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAY

Query:  FNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLLKG---------------------------------------
        FNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLLKG                                       
Subjt:  FNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLLKG---------------------------------------

Query:  --------------------------------------------------------LQDDEMAPEIALRSVPSMKARDAIAHNLLHHGLAGTSFL
                                                                LQDDEMAPE+ALRSVPSMKARDAIAHNLLHHGLAGTSFL
Subjt:  --------------------------------------------------------LQDDEMAPEIALRSVPSMKARDAIAHNLLHHGLAGTSFL

A0A6J1CBB8 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 11.6e-19875.35Show/hide
Query:  MNDSTNGSVRKRTRADEADEDDDSMGKNGGGKSLKGLVTSLLLLDEQDKCQQEEHDRVSMEAKISMEVNHRKKTKAMVDFYSEVQDYYSEVEESDRMKRK
        MNDS NGS  KR R +EADEDDDSMG+NG GK LKGLVTSLLLLDEQ+KC+QEEHDR SME K+S+EVNHRKKTKAMVDFYSEVQDYYSEVEESDRMKRK
Subjt:  MNDSTNGSVRKRTRADEADEDDDSMGKNGGGKSLKGLVTSLLLLDEQDKCQQEEHDRVSMEAKISMEVNHRKKTKAMVDFYSEVQDYYSEVEESDRMKRK

Query:  KSRLAANSVAVAAASDGLQKIEIEKSNKRGGDGGG--GGSGHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRT
        KSRLAA+SVAVAAASDGLQKIE +KS KRGGDGGG  G  GHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRT
Subjt:  KSRLAANSVAVAAASDGLQKIEIEKSNKRGGDGGG--GGSGHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRT

Query:  AIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAY
        AIPV+QRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAY
Subjt:  AIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAY

Query:  FNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLLKG---------------------------------------
        FNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLLKG                                       
Subjt:  FNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLLKG---------------------------------------

Query:  --------------------------------------------------------LQDDEMAPEIALRSVPSMKARDAIAHNLLHHGLAGTSFL
                                                                LQDDEMAPE+ALRSV SMKARDAIAHNLLHHGLAGT+FL
Subjt:  --------------------------------------------------------LQDDEMAPEIALRSVPSMKARDAIAHNLLHHGLAGTSFL

A0A6J1IQ93 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1-like5.5e-19975.86Show/hide
Query:  MNDSTNGSVRKRTRADEADEDDDSMGKNGGGKSLKGLVTSLLLLDEQDKCQQEEHDRVSMEAKISMEVNHRKKTKAMVDFYSEVQDYYSEVEESDRMKRK
        MNDSTNG  RKR R DEADEDD S+GKNG GK LKGLVTSLLLLDEQ+K +QEEHDR SMEAK+SMEVNHRKKTKAM DFYSE QDYYSEVEESDR+KRK
Subjt:  MNDSTNGSVRKRTRADEADEDDDSMGKNGGGKSLKGLVTSLLLLDEQDKCQQEEHDRVSMEAKISMEVNHRKKTKAMVDFYSEVQDYYSEVEESDRMKRK

Query:  KSRLAANSVAVAAASDGLQKIEIEKSNKRGGDGGGGGSGHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTAI
        KSRLAANSVAVAAASDGLQKIEI KSNKRGGDGG    GHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTAI
Subjt:  KSRLAANSVAVAAASDGLQKIEIEKSNKRGGDGGGGGSGHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTAI

Query:  PVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFN
        PVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKL+LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFN
Subjt:  PVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFN

Query:  KRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLLKG-----------------------------------------
        KRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLLKG                                         
Subjt:  KRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLLKG-----------------------------------------

Query:  ------------------------------------------------------LQDDEMAPEIALRSVPSMKARDAIAHNLLHHGLAGTSFL
                                                              LQDDEMAPE+ALRSV SMKARDAIAHNLLHHGLAGTSFL
Subjt:  ------------------------------------------------------LQDDEMAPEIALRSVPSMKARDAIAHNLLHHGLAGTSFL

SwissProt top hitse value%identityAlignment
B0BN95 Putative nuclease HARBI14.6e-0923.41Show/hide
Query:  DECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQ
        D     D  DE     +   R     + E L +++++  T    AI  + ++   L    +G     +    G+  ++  + V  V  A+      + + 
Subjt:  DECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQ

Query:  WP-EEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRAN
        +P +E  ++ +K+E+  ++G+P V+G++   H+ I AP     +Y N+       K  +S+    V D RG    V   WPGS+ D  VL++S+L  +  
Subjt:  WP-EEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRAN

Query:  GGLLK
         G+ K
Subjt:  GGLLK

Q17QR8 Putative nuclease HARBI13.5e-0923.9Show/hide
Query:  DECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQ
        D     D  DE     +   R     + E L +++++  T    AI  + ++   L    +G     +    G+  ++  + V  V  A+      + + 
Subjt:  DECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQ

Query:  WP-EEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRAN
        +P +E +++ +K+E+  ++GIP V+G +   H+ I AP     +Y N+       K  +S+    V D RG    V   WPGS+ D  VL++S+L  +  
Subjt:  WP-EEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRAN

Query:  GGLLK
         G+ K
Subjt:  GGLLK

Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 13.1e-2130.67Show/hide
Query:  WWD----ECNSPDYPDEE---FKKQFRMGRATFDMICEELNSAIAKEDTTLR----------TAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHK
        WWD      +SP  P +E   FK  FR  + TF  IC     ++ +ED   R            + V+++VA+ L RLA+GD    V   FG+G ST  +
Subjt:  WWD----ECNSPDYPDEE---FKKQFRMGRATFDMICEELNSAIAKEDTTLR----------TAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHK

Query:  LVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPG
        +      A+       HL+WP+ + +  IK ++E + G+PN  G++ TTHI +  P +  +  +       +Q+ +YS+ +QGV D    F ++  GWPG
Subjt:  LVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPG

Query:  SMPDDQVLEKSALFQRA-NGGLLKG
         M   ++L+ S  F+   N  +L G
Subjt:  SMPDDQVLEKSALFQRA-NGGLLKG

Q96MB7 Putative nuclease HARBI11.0e-0823.9Show/hide
Query:  DECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQ
        D     D  DE     +   R     + E L + +++  T    AI  + +V   L    +G     +    G+  ++  + V  V  A+      + ++
Subjt:  DECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQ

Query:  WP-EEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRAN
        +P +E +++ +K+E+  ++G+P V+G +   H+ I AP     +Y N+       K  +S+    V D RG    V   WPGS+ D  VL++S+L  +  
Subjt:  WP-EEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRAN

Query:  GGLLK
         G+ K
Subjt:  GGLLK

Q9M2U3 Protein ALP1-like7.3e-2332.73Show/hide
Query:  RSKAWWDECNSPDY----PDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTA---------IPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHK
        +S  WWD  +   Y      + F+  F++ R TFD IC     ++ K D T + A         + +  RVAV L RL +G+ L V+ + FG+  ST  +
Subjt:  RSKAWWDECNSPDY----PDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTA---------IPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHK

Query:  LVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPG
        +      ++    +  HL WP +  L  IK ++E ISG+PN  G++  THI +  P +  +   NK   +  +  ++S+T+Q VVDP   F DV  GWPG
Subjt:  LVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPG

Query:  SMPDDQVLEKSALFQRANGG
        S+ DD VL+ S  ++    G
Subjt:  SMPDDQVLEKSALFQRANGG

Arabidopsis top hitse value%identityAlignment
AT3G55350.1 PIF / Ping-Pong family of plant transposases5.2e-2432.73Show/hide
Query:  RSKAWWDECNSPDY----PDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTA---------IPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHK
        +S  WWD  +   Y      + F+  F++ R TFD IC     ++ K D T + A         + +  RVAV L RL +G+ L V+ + FG+  ST  +
Subjt:  RSKAWWDECNSPDY----PDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTA---------IPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHK

Query:  LVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPG
        +      ++    +  HL WP +  L  IK ++E ISG+PN  G++  THI +  P +  +   NK   +  +  ++S+T+Q VVDP   F DV  GWPG
Subjt:  LVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPG

Query:  SMPDDQVLEKSALFQRANGG
        S+ DD VL+ S  ++    G
Subjt:  SMPDDQVLEKSALFQRANGG

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)2.2e-2230.67Show/hide
Query:  WWD----ECNSPDYPDEE---FKKQFRMGRATFDMICEELNSAIAKEDTTLR----------TAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHK
        WWD      +SP  P +E   FK  FR  + TF  IC     ++ +ED   R            + V+++VA+ L RLA+GD    V   FG+G ST  +
Subjt:  WWD----ECNSPDYPDEE---FKKQFRMGRATFDMICEELNSAIAKEDTTLR----------TAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHK

Query:  LVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPG
        +      A+       HL+WP+ + +  IK ++E + G+PN  G++ TTHI +  P +  +  +       +Q+ +YS+ +QGV D    F ++  GWPG
Subjt:  LVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPG

Query:  SMPDDQVLEKSALFQRA-NGGLLKG
         M   ++L+ S  F+   N  +L G
Subjt:  SMPDDQVLEKSALFQRA-NGGLLKG

AT4G29780.1 unknown protein6.2e-10251.48Show/hide
Query:  NDSTNGSVRKRTRADEADEDDDSMGKNGGGKSLKG------------LVTSLLLLDEQDKCQQEEHDRVSMEAKISMEVNHRKKTKAMVDFYSEVQDYYS
        ++  N + +KR R     +DD+  G  GGG  + G            ++ +LLLLDE+ K QQE+ D   ++ K  +E NH+KK K M  +Y+++QD+YS
Subjt:  NDSTNGSVRKRTRADEADEDDDSMGKNGGGKSLKG------------LVTSLLLLDEQDKCQQEEHDRVSMEAKISMEVNHRKKTKAMVDFYSEVQDYYS

Query:  EVEESDRMKRKKSRLAANSVAVAAASDGLQKIEIEKSNKRGGDGGGGGSG-HHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSA
           E+D  + K++R  A +  V+A + G     +           G GSG  HRRLWVK+R+  WWD  + PD+P++EF+++FRM ++TF++ICEEL++ 
Subjt:  EVEESDRMKRKKSRLAANSVAVAAASDGLQKIEIEKSNKRGGDGGGGGSG-HHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSA

Query:  IAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPI
        + K++T LR AIP  +RV VC+WRLATG PLR VS++FGLGISTCHKLV+EVC AI  VLMPK+L WP +  +   K ++ES+  IPNVVGS+YTTHIPI
Subjt:  IAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPI

Query:  IAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALF-QRANGGLLK
        IAPK+ VAAYFNKRHTERNQKTSYSITVQGVV+  G+FTDVCIG PGS+ DDQ+LEKS+L  QRA  G+L+
Subjt:  IAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALF-QRANGGLLK

AT5G12010.1 unknown protein1.2e-13254.35Show/hide
Query:  KNGGGKSLKGLVTSLLLLDEQDKCQQEEHDRVSMEAKISMEVNHRKKTKAMVDFYSEVQDYYSEVEESDRMKRKKSRL--AANSVAVAAASDGLQKIEIE
        +N   K+LKG  TSLLL++E +K  QE  +  S       + N+RK+ + M D+YS++ DYY++ EES  +  KKSR+  A  SVAVAAAS+    IE E
Subjt:  KNGGGKSLKGLVTSLLLLDEQDKCQQEEHDRVSMEAKISMEVNHRKKTKAMVDFYSEVQDYYSEVEESDRMKRKKSRL--AANSVAVAAASDGLQKIEIE

Query:  KSNKRG-GDGGGGGSGHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVS
         S   G G   G GSG  RRLWVKDRS+AWW+EC+  DYP+E+FKK FRM ++TF++IC+ELNSA+AKEDT LR AIPV+QRVAVC+WRLATG+PLR+VS
Subjt:  KSNKRG-GDGGGGGSGHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVS

Query:  KKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPR
        KKFGLGISTCHKLVLEVC AI+ VLMPK+LQWP++E+LR I+E +ES+SGIPNVVGSMYTTHIPIIAPKISVA+YFNKRHTERNQKTSYSIT+Q VV+P+
Subjt:  KKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPR

Query:  GVFTDVCIGWPGSMPDDQVLEKSALFQRA-NGGLLKGLQ-------------------------------------------------------------
        GVFTD+CIGWPGSMPDD+VLEKS L+QRA NGGLLKG+                                                              
Subjt:  GVFTDVCIGWPGSMPDDQVLEKSALFQRA-NGGLLKGLQ-------------------------------------------------------------

Query:  ----------------------------------DDEMAPEIALRSVPSMKARDAIAHNLLHHGLAGTSFL
                                          DDE+ PE  LRSV +MKARD I+HNLLHHGLAGTSFL
Subjt:  ----------------------------------DDEMAPEIALRSVPSMKARDAIAHNLLHHGLAGTSFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGATTCCACCAACGGTAGCGTCAGGAAGAGGACCAGAGCGGATGAAGCCGATGAAGACGACGATTCAATGGGAAAAAATGGTGGAGGTAAGAGTTTGAAAGGATT
GGTTACTTCTCTGTTGCTGTTGGATGAACAGGACAAGTGTCAGCAGGAAGAACATGACAGAGTTTCCATGGAGGCGAAGATTTCGATGGAGGTGAATCACAGGAAGAAGA
CGAAAGCTATGGTCGATTTCTACTCCGAAGTTCAAGATTACTATTCTGAAGTCGAGGAATCAGACCGAATGAAACGGAAAAAATCGCGATTAGCAGCTAACTCTGTTGCG
GTTGCGGCCGCTTCCGATGGATTACAGAAGATCGAAATCGAAAAATCAAACAAACGCGGCGGCGATGGCGGCGGTGGTGGTAGCGGTCATCACCGGAGACTCTGGGTGAA
AGATAGGTCAAAAGCCTGGTGGGATGAATGTAACAGTCCCGATTATCCCGATGAAGAATTCAAGAAGCAATTCAGAATGGGTAGGGCAACTTTCGATATGATTTGTGAAG
AACTTAATTCCGCGATAGCTAAAGAAGACACAACTCTTCGAACCGCCATTCCCGTCCAGCAAAGGGTCGCTGTTTGCTTATGGAGATTAGCCACCGGCGATCCTCTTCGA
GTCGTATCCAAGAAATTCGGATTAGGTATTTCAACTTGCCATAAACTAGTTCTCGAGGTCTGCACCGCCATTAGAACAGTACTAATGCCGAAGCATCTCCAATGGCCAGA
AGAAGAAACGCTCAGAAGAATCAAAGAGGAATACGAATCAATTTCCGGAATCCCTAATGTCGTCGGTTCAATGTACACCACACACATTCCGATCATCGCCCCTAAAATCA
GCGTAGCAGCTTATTTCAACAAGCGTCACACAGAGAGAAATCAAAAAACATCATACTCAATTACAGTTCAAGGAGTCGTGGATCCAAGAGGAGTCTTCACCGACGTTTGC
ATCGGTTGGCCGGGATCAATGCCGGACGATCAAGTTCTAGAGAAATCGGCTCTGTTTCAAAGGGCAAATGGAGGATTATTGAAAGGGCTTCAAGATGATGAAATGGCGCC
TGAAATTGCTTTGAGGTCAGTGCCTTCCATGAAAGCCAGAGATGCCATTGCTCATAATCTGCTGCACCATGGCCTTGCTGGGACTTCTTTTCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATGATTCCACCAACGGTAGCGTCAGGAAGAGGACCAGAGCGGATGAAGCCGATGAAGACGACGATTCAATGGGAAAAAATGGTGGAGGTAAGAGTTTGAAAGGATT
GGTTACTTCTCTGTTGCTGTTGGATGAACAGGACAAGTGTCAGCAGGAAGAACATGACAGAGTTTCCATGGAGGCGAAGATTTCGATGGAGGTGAATCACAGGAAGAAGA
CGAAAGCTATGGTCGATTTCTACTCCGAAGTTCAAGATTACTATTCTGAAGTCGAGGAATCAGACCGAATGAAACGGAAAAAATCGCGATTAGCAGCTAACTCTGTTGCG
GTTGCGGCCGCTTCCGATGGATTACAGAAGATCGAAATCGAAAAATCAAACAAACGCGGCGGCGATGGCGGCGGTGGTGGTAGCGGTCATCACCGGAGACTCTGGGTGAA
AGATAGGTCAAAAGCCTGGTGGGATGAATGTAACAGTCCCGATTATCCCGATGAAGAATTCAAGAAGCAATTCAGAATGGGTAGGGCAACTTTCGATATGATTTGTGAAG
AACTTAATTCCGCGATAGCTAAAGAAGACACAACTCTTCGAACCGCCATTCCCGTCCAGCAAAGGGTCGCTGTTTGCTTATGGAGATTAGCCACCGGCGATCCTCTTCGA
GTCGTATCCAAGAAATTCGGATTAGGTATTTCAACTTGCCATAAACTAGTTCTCGAGGTCTGCACCGCCATTAGAACAGTACTAATGCCGAAGCATCTCCAATGGCCAGA
AGAAGAAACGCTCAGAAGAATCAAAGAGGAATACGAATCAATTTCCGGAATCCCTAATGTCGTCGGTTCAATGTACACCACACACATTCCGATCATCGCCCCTAAAATCA
GCGTAGCAGCTTATTTCAACAAGCGTCACACAGAGAGAAATCAAAAAACATCATACTCAATTACAGTTCAAGGAGTCGTGGATCCAAGAGGAGTCTTCACCGACGTTTGC
ATCGGTTGGCCGGGATCAATGCCGGACGATCAAGTTCTAGAGAAATCGGCTCTGTTTCAAAGGGCAAATGGAGGATTATTGAAAGGGCTTCAAGATGATGAAATGGCGCC
TGAAATTGCTTTGAGGTCAGTGCCTTCCATGAAAGCCAGAGATGCCATTGCTCATAATCTGCTGCACCATGGCCTTGCTGGGACTTCTTTTCTTTAA
Protein sequenceShow/hide protein sequence
MNDSTNGSVRKRTRADEADEDDDSMGKNGGGKSLKGLVTSLLLLDEQDKCQQEEHDRVSMEAKISMEVNHRKKTKAMVDFYSEVQDYYSEVEESDRMKRKKSRLAANSVA
VAAASDGLQKIEIEKSNKRGGDGGGGGSGHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLR
VVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVC
IGWPGSMPDDQVLEKSALFQRANGGLLKGLQDDEMAPEIALRSVPSMKARDAIAHNLLHHGLAGTSFL