; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031533 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031533
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionWD repeat-containing protein 43
Genome locationchr11:9675190..9685980
RNA-Seq ExpressionLag0031533
SyntenyLag0031533
Gene Ontology termsGO:0005730 - nucleolus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR001680 - WD40 repeat
IPR007148 - Small-subunit processome, Utp12
IPR008906 - HAT, C-terminal dimerisation domain
IPR012337 - Ribonuclease H-like superfamily
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR025525 - hAT-like transposase, RNase-H fold
IPR036322 - WD40-repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593991.1 WD repeat-containing protein 43, partial [Cucurbita argyrosperma subsp. sororia]1.1e-30788.08Show/hide
Query:  MGLSNIRDLLTSFSPDLHFFAISSGDGRIKIWDALKGQIQTEFADFFASDSTSILTKPDKGHLSIDYKCMKWFSLEKKRKRKRQCSLLLLGTGSGDVLAL
        M  SNIRDLLTSFSPDLH FAISSGDGRIKIWD LKGQIQTEFADFF +DSTSILTK  KGHLS+DYKCMKW SLEKKRKRKRQCSLLLLGTGSGDV+AL
Subjt:  MGLSNIRDLLTSFSPDLHFFAISSGDGRIKIWDALKGQIQTEFADFFASDSTSILTKPDKGHLSIDYKCMKWFSLEKKRKRKRQCSLLLLGTGSGDVLAL

Query:  DVAAGELKWKISDCHPGGVTTISFPTHGSSIYTAGADGMLCEIDSLTGNLLRKFKASTKAISCISVSPDGKIIATAASQMKIFNCSNHKKIQKFSGHPGA
        DVAAGELKWKISDCHPGGV TISFPTHGS IYTAGADGMLCEIDS+TG+LLRKFKASTKAISCISVSPDGK IATAASQMKIFNCS+HKKIQKFSGHPGA
Subjt:  DVAAGELKWKISDCHPGGVTTISFPTHGSSIYTAGADGMLCEIDSLTGNLLRKFKASTKAISCISVSPDGKIIATAASQMKIFNCSNHKKIQKFSGHPGA

Query:  VRCMVFTEDGKYILSSAVGERYIVVWSLGGGKKQSASCVLALEHPAIFVDSRLSKVDGDDETALYILAISEIGVCYLWYGQNLEELRSAKPTKILISDND
        VRCMVFTEDGKYILSS VGERYIVVWSLGG KKQSASCVLA+EHPAIFVDSR S   G DETALY+LAISEIGVCYLWYGQNLEELRS KPTK+LIS+ND
Subjt:  VRCMVFTEDGKYILSSAVGERYIVVWSLGGGKKQSASCVLALEHPAIFVDSRLSKVDGDDETALYILAISEIGVCYLWYGQNLEELRSAKPTKILISDND

Query:  ISSKSKKRAIPAIYAAKLQGIPKSGSGQVFLAHGLLVKPSFQNVVVHSGTDINLNSSNDGILLPSQSTAKSKKGLDVQGGVVALDRANAEDALRPIPKVF
        ISSKSKKR  P IYAA+ QGIPKSGSGQVFLAHGLLVKPSFQNVVVHSGTDI LNSSNDGILLPS S  KSKKGLDVQGGVVALDRANAEDALRPIPKVF
Subjt:  ISSKSKKRAIPAIYAAKLQGIPKSGSGQVFLAHGLLVKPSFQNVVVHSGTDINLNSSNDGILLPSQSTAKSKKGLDVQGGVVALDRANAEDALRPIPKVF

Query:  DSQEKSTLYHGLQVDRNDVMIQLVDSGSQVEDDVGVEDSGAVCMEDQLRSLGILNSTDDHTSESILESSILKGIDLEANMSQKKLREAVLSLAPGAACKL
        DSQEK TLYH L VDR+DVM +LVDSGSQVED VGVEDS AV +ED+LRSLGIL+STDDHTSESIL+S+I KGIDLEANMSQKKLREAVLSLAPG ACKL
Subjt:  DSQEKSTLYHGLQVDRNDVMIQLVDSGSQVEDDVGVEDSGAVCMEDQLRSLGILNSTDDHTSESILESSILKGIDLEANMSQKKLREAVLSLAPGAACKL

Query:  LENLVSIWQSRLCSGKNVLPWIYSLLLNHSQHILSQEPSTQILDSLFKMTKSKETAVQSLLQLSGRLQLVLAQVERASANKTDQMIRYDHKIDGSESDDD
        L NLVSIWQSRL SGKNVLPWI SLLLNH+QHILSQE STQILDSLFK+TKSKETAVQ LLQLSGRLQLVLAQ+ERASANKTDQ +R+  +IDG ES +D
Subjt:  LENLVSIWQSRLCSGKNVLPWIYSLLLNHSQHILSQEPSTQILDSLFKMTKSKETAVQSLLQLSGRLQLVLAQVERASANKTDQMIRYDHKIDGSESDDD

Query:  EDEDEDEEVDDVLYGEEEYESELSSDDEN
        ED   DEEVDDVLYGEEEYESELSSD+EN
Subjt:  EDEDEDEEVDDVLYGEEEYESELSSDDEN

TYK02930.1 WD repeat-containing protein 43 [Cucumis melo var. makuwa]7.8e-30988.71Show/hide
Query:  MGLSNIRDLLTSFSPDLHFFAISSGDGRIKIWDALKGQIQTEFADFFASDSTSILTKPDKGHLSIDYKCMKWFSLEKKRKRKRQCSLLLLGTGSGDVLAL
        MG SNIRDLLTSFSPDLHFFAISSGDGRIKIWD LKGQIQTEFADFF SDSTSILTKP+KGHLSIDYKCMKW SLEKKRKRKRQC LLLLGTGSGDVLAL
Subjt:  MGLSNIRDLLTSFSPDLHFFAISSGDGRIKIWDALKGQIQTEFADFFASDSTSILTKPDKGHLSIDYKCMKWFSLEKKRKRKRQCSLLLLGTGSGDVLAL

Query:  DVAAGELKWKISDCHPGGVTTISFPTHGSSIYTAGADGMLCEIDSLTGNLLRKFKASTKAISCISVSPDGKIIATAASQMKIFNCSNHKKIQKFSGHPGA
        DVAAGELKWKISDCHPGGV +ISFPTHGS IYTAGADGMLCEI+SLTGN+LRKFKASTKAISCISVSPDGKIIATAASQMKIFNCSNHKKIQKFSGHPGA
Subjt:  DVAAGELKWKISDCHPGGVTTISFPTHGSSIYTAGADGMLCEIDSLTGNLLRKFKASTKAISCISVSPDGKIIATAASQMKIFNCSNHKKIQKFSGHPGA

Query:  VRCMVFTEDGKYILSSAVGERYIVVWSLGGGKKQSASCVLALEHPAIFVDSRLSKVDGDDETALYILAISEIGVCYLWYGQNLEELRSAKPTKILISDND
        VRCMVFTEDG+YILSSAVGERYIVVWS+ GGKKQSASCVLA+EHPAIFVDSR S  DG DE+ALY+LAISEIGVCYLWYGQNLEELRSAKPTK+LISDND
Subjt:  VRCMVFTEDGKYILSSAVGERYIVVWSLGGGKKQSASCVLALEHPAIFVDSRLSKVDGDDETALYILAISEIGVCYLWYGQNLEELRSAKPTKILISDND

Query:  ISSKSKKRAIPAIYAAKLQGIPKSGSGQVFLAHGLLVKPSFQNVVVHSGTDINLNSSNDGILLPSQSTAKSKKGLDVQGGVVALDRANAEDALRPIPKVF
        I SKSKKRA PAIYAAKLQG+PKSGSGQVFLAHGLLVKPSFQNV+V SGTDI LNSSN+GILLPSQS  KSKKGLDVQGGVVALDRANAEDALRPIPKVF
Subjt:  ISSKSKKRAIPAIYAAKLQGIPKSGSGQVFLAHGLLVKPSFQNVVVHSGTDINLNSSNDGILLPSQSTAKSKKGLDVQGGVVALDRANAEDALRPIPKVF

Query:  DSQEKSTLYHGLQVDRNDVMIQLVDSGSQVEDDVGVEDSGAVCMEDQLRSLGILNSTDDHTSESILESSILKGIDLEANMSQKKLREAVLSLAPGAACKL
        DSQEKSTLY  LQVDRNDVM QLVDSGS++EDDVGV+DS AVCMEDQLRSLGIL+STDDH  ESIL+ +  KGIDLEANMSQKKLREAVLSLAPG A KL
Subjt:  DSQEKSTLYHGLQVDRNDVMIQLVDSGSQVEDDVGVEDSGAVCMEDQLRSLGILNSTDDHTSESILESSILKGIDLEANMSQKKLREAVLSLAPGAACKL

Query:  LENLVSIWQSRLCSGKNVLPWIYSLLLNHSQHILSQEPSTQILDSLFKMTKSKETAVQSLLQLSGRLQLVLAQVERASANKTDQMIRYDHKIDGSESDDD
        L NLV+IWQSRL  GKNVL WIYSLLLNHSQHILSQE S Q+LDSLFK+TKSKETAVQ LLQLSGRLQLVLAQ+ERAS NKTDQ I+   +I GSES DD
Subjt:  LENLVSIWQSRLCSGKNVLPWIYSLLLNHSQHILSQEPSTQILDSLFKMTKSKETAVQSLLQLSGRLQLVLAQVERASANKTDQMIRYDHKIDGSESDDD

Query:  EDEDEDEEVDDVLYGEEEYESELSSDDEN
         DE+EDEEVDDVLYGEEE ESELSSDDEN
Subjt:  EDEDEDEEVDDVLYGEEEYESELSSDDEN

XP_008458393.2 PREDICTED: LOW QUALITY PROTEIN: WD repeat-containing protein 43 [Cucumis melo]1.5e-30788.39Show/hide
Query:  MGLSNIRDLLTSFSPDLHFFAISSGDGRIKIWDALKGQIQTEFADFFASDSTSILTKPDKGHLSIDYKCMKWFSLEKKRKRKRQCSLLLLGTGSGDVLAL
        MG SNIRDLLTSFSPDLHFFAISSGDGRIKIWD LKGQIQTEFADFF SDSTSILTKP+KGHLSIDYKCMKW SLEKKRKRKRQC LLLLGTGSGDVLAL
Subjt:  MGLSNIRDLLTSFSPDLHFFAISSGDGRIKIWDALKGQIQTEFADFFASDSTSILTKPDKGHLSIDYKCMKWFSLEKKRKRKRQCSLLLLGTGSGDVLAL

Query:  DVAAGELKWKISDCHPGGVTTISFPTHGSSIYTAGADGMLCEIDSLTGNLLRKFKASTKAISCISVSPDGKIIATAASQMKIFNCSNHKKIQKFSGHPGA
        DVAAGELKWKISDCHPGGV +ISFPTHGS IYTAGADGMLCEI+SLTGN+LRKFKASTKAISCISVSPDGKIIATAASQMKIFNCSNHKKIQKFSGHPGA
Subjt:  DVAAGELKWKISDCHPGGVTTISFPTHGSSIYTAGADGMLCEIDSLTGNLLRKFKASTKAISCISVSPDGKIIATAASQMKIFNCSNHKKIQKFSGHPGA

Query:  VRCMVFTEDGKYILSSAVGERYIVVWSLGGGKKQSASCVLALEHPAIFVDSRLSKVDGDDETALYILAISEIGVCYLWYGQNLEELRSAKPTKILISDND
        VRCMVFTEDG+YILSSAVGERYIVVWS+ G +KQSASCVLA+EHPAIFVDSR S  DG DE+ALY+LAISEIGVCYLWYGQNLEELRSAKPTK+LISDND
Subjt:  VRCMVFTEDGKYILSSAVGERYIVVWSLGGGKKQSASCVLALEHPAIFVDSRLSKVDGDDETALYILAISEIGVCYLWYGQNLEELRSAKPTKILISDND

Query:  ISSKSKKRAIPAIYAAKLQGIPKSGSGQVFLAHGLLVKPSFQNVVVHSGTDINLNSSNDGILLPSQSTAKSKKGLDVQGGVVALDRANAEDALRPIPKVF
        I SKSKKRA PAIYAAKLQG+PKSGSGQVFLAHGLLVKPSFQNV+V SGTDI LNSSN+GILLPSQS  KSKKGLDVQGGVVALDRANAEDALRPIPKVF
Subjt:  ISSKSKKRAIPAIYAAKLQGIPKSGSGQVFLAHGLLVKPSFQNVVVHSGTDINLNSSNDGILLPSQSTAKSKKGLDVQGGVVALDRANAEDALRPIPKVF

Query:  DSQEKSTLYHGLQVDRNDVMIQLVDSGSQVEDDVGVEDSGAVCMEDQLRSLGILNSTDDHTSESILESSILKGIDLEANMSQKKLREAVLSLAPGAACKL
        DSQEKSTLY  LQVDRNDVM QLVDSGS++EDDVGV+DS AVCMEDQLRSLGIL+STDDH  ESIL+ +  KGIDLEANMSQKKLREAVLSLAPG A KL
Subjt:  DSQEKSTLYHGLQVDRNDVMIQLVDSGSQVEDDVGVEDSGAVCMEDQLRSLGILNSTDDHTSESILESSILKGIDLEANMSQKKLREAVLSLAPGAACKL

Query:  LENLVSIWQSRLCSGKNVLPWIYSLLLNHSQHILSQEPSTQILDSLFKMTKSKETAVQSLLQLSGRLQLVLAQVERASANKTDQMIRYDHKIDGSESDDD
        L NLV+IWQSRL  GKNVL WIYSLLLNHSQHILSQE S Q+LDSLFK+TKSKETAVQ LLQLSGRLQLVLAQ+ERAS NKTDQ I+   +I GSES DD
Subjt:  LENLVSIWQSRLCSGKNVLPWIYSLLLNHSQHILSQEPSTQILDSLFKMTKSKETAVQSLLQLSGRLQLVLAQVERASANKTDQMIRYDHKIDGSESDDD

Query:  EDEDEDEEVDDVLYGEEEYESELSSDDEN
         DE+EDEEVDDVLYGEEE ESELSSDDEN
Subjt:  EDEDEDEEVDDVLYGEEEYESELSSDDEN

XP_023000148.1 WD repeat-containing protein 43 [Cucurbita maxima]1.5e-30787.76Show/hide
Query:  MGLSNIRDLLTSFSPDLHFFAISSGDGRIKIWDALKGQIQTEFADFFASDSTSILTKPDKGHLSIDYKCMKWFSLEKKRKRKRQCSLLLLGTGSGDVLAL
        M  SNIRDLLTSFSPDLHFFAISSGDGRIKIWD LKGQIQTEFADFF +DSTSILTK  KGHLS+DYKCMKW SLEKKRKRKRQCSLLLLGTGSGDV+AL
Subjt:  MGLSNIRDLLTSFSPDLHFFAISSGDGRIKIWDALKGQIQTEFADFFASDSTSILTKPDKGHLSIDYKCMKWFSLEKKRKRKRQCSLLLLGTGSGDVLAL

Query:  DVAAGELKWKISDCHPGGVTTISFPTHGSSIYTAGADGMLCEIDSLTGNLLRKFKASTKAISCISVSPDGKIIATAASQMKIFNCSNHKKIQKFSGHPGA
        DVAAGELKW+ISDCHPGGV TISFPTHGS IYTAGADGMLCEIDS+TG+LLRKFKASTKAISCISVSPDGK IATAASQMKIFNCS+HKKIQKFSGHPGA
Subjt:  DVAAGELKWKISDCHPGGVTTISFPTHGSSIYTAGADGMLCEIDSLTGNLLRKFKASTKAISCISVSPDGKIIATAASQMKIFNCSNHKKIQKFSGHPGA

Query:  VRCMVFTEDGKYILSSAVGERYIVVWSLGGGKKQSASCVLALEHPAIFVDSRLSKVDGDDETALYILAISEIGVCYLWYGQNLEELRSAKPTKILISDND
        VRCMVFTEDGKYILSS VGERYIVVWSLGG KKQSASCVLA+EHPAIF+DSR S   G DETALY+LAISEIGVCYLWYGQNLEELRS KPTK+LIS+ND
Subjt:  VRCMVFTEDGKYILSSAVGERYIVVWSLGGGKKQSASCVLALEHPAIFVDSRLSKVDGDDETALYILAISEIGVCYLWYGQNLEELRSAKPTKILISDND

Query:  ISSKSKKRAIPAIYAAKLQGIPKSGSGQVFLAHGLLVKPSFQNVVVHSGTDINLNSSNDGILLPSQSTAKSKKGLDVQGGVVALDRANAEDALRPIPKVF
        ISSKSKKR  P+IYAA+ QGIPKSGSGQVFLAHGLLVKPSFQNVVVHSGTDINLNSSNDGILLPS S  KSKKGLDVQGGVVALDRANAEDAL PIPKVF
Subjt:  ISSKSKKRAIPAIYAAKLQGIPKSGSGQVFLAHGLLVKPSFQNVVVHSGTDINLNSSNDGILLPSQSTAKSKKGLDVQGGVVALDRANAEDALRPIPKVF

Query:  DSQEKSTLYHGLQVDRNDVMIQLVDSGSQVEDDVGVEDSGAVCMEDQLRSLGILNSTDDHTSESILESSILKGIDLEANMSQKKLREAVLSLAPGAACKL
        DSQEK TLYH L VDR+DVM +LVDSGSQVED VGVEDS AVC+ED+LR LGIL+STDDHTSESIL+S+I KGIDLEANMSQKKLREAVLSLAPG ACKL
Subjt:  DSQEKSTLYHGLQVDRNDVMIQLVDSGSQVEDDVGVEDSGAVCMEDQLRSLGILNSTDDHTSESILESSILKGIDLEANMSQKKLREAVLSLAPGAACKL

Query:  LENLVSIWQSRLCSGKNVLPWIYSLLLNHSQHILSQEPSTQILDSLFKMTKSKETAVQSLLQLSGRLQLVLAQVERASANKTDQMIRYDHKIDGSESDDD
        L NLVSIWQSRL SGKNVLPWIYSLLLNH+QHILSQE STQILDSLFK TKSKETAVQ LLQLSGRLQLVLAQ+ERASANKTDQ +R+  +IDG ES + 
Subjt:  LENLVSIWQSRLCSGKNVLPWIYSLLLNHSQHILSQEPSTQILDSLFKMTKSKETAVQSLLQLSGRLQLVLAQVERASANKTDQMIRYDHKIDGSESDDD

Query:  EDEDEDEEVDDVLYGEEEYESELSSDDEN
           DEDEEV DVLYGEEEYESELSSD+EN
Subjt:  EDEDEDEEVDDVLYGEEEYESELSSDDEN

XP_038876234.1 WD repeat-containing protein 43 [Benincasa hispida]0.0e+0089.35Show/hide
Query:  MGLSNIRDLLTSFSPDLHFFAISSGDGRIKIWDALKGQIQTEFADFFASDSTSILTKPDKGHLSIDYKCMKWFSLEKKRKRKRQCSLLLLGTGSGDVLAL
        MG SNI DLLTSFSPDLHFFAISSGDGRIKIWD LKGQIQTEFADFF SDSTSILTKP+KGHLS+DYKCMKW SLEKKRKRKRQCSLLLLGTGSGDVLAL
Subjt:  MGLSNIRDLLTSFSPDLHFFAISSGDGRIKIWDALKGQIQTEFADFFASDSTSILTKPDKGHLSIDYKCMKWFSLEKKRKRKRQCSLLLLGTGSGDVLAL

Query:  DVAAGELKWKISDCHPGGVTTISFPTHGSSIYTAGADGMLCEIDSLTGNLLRKFKASTKAISCISVSPDGKIIATAASQMKIFNCSNHKKIQKFSGHPGA
        DVAAGELKWKISDCHPGGV +ISFP HGS IYTAGADGMLCEIDSLTGNL RKFKASTKAISCISVSPDGKIIATAASQMKIFNCSNHKKIQKFSGHPGA
Subjt:  DVAAGELKWKISDCHPGGVTTISFPTHGSSIYTAGADGMLCEIDSLTGNLLRKFKASTKAISCISVSPDGKIIATAASQMKIFNCSNHKKIQKFSGHPGA

Query:  VRCMVFTEDGKYILSSAVGERYIVVWSLGGGKKQSASCVLALEHPAIFVDSRLSKVDGDDETALYILAISEIGVCYLWYGQNLEELRSAKPTKILISDND
        VRCMVFTEDG+YILSSAVGERY+VVWSL GGKKQSASCVLA+EHPAIFVDSR S  DG DETALYILAISEIGVCYLWYGQNL ELRSAKPTKILISDND
Subjt:  VRCMVFTEDGKYILSSAVGERYIVVWSLGGGKKQSASCVLALEHPAIFVDSRLSKVDGDDETALYILAISEIGVCYLWYGQNLEELRSAKPTKILISDND

Query:  ISSKSKKRAIPAIYAAKLQGIPKSGSGQVFLAHGLLVKPSFQNVVVHSGTDINLNSSNDGILLPSQSTAKSKKGLDVQGGVVALDRANAEDALRPIPKVF
        ISSKSK R  PAIYAAKLQG+PKSGSGQVFLAHGLLVKPSFQNV+V SGTDINLNSSN+GILLPSQS  KSKKGLDVQGGVVALDRANAEDALRPIPKVF
Subjt:  ISSKSKKRAIPAIYAAKLQGIPKSGSGQVFLAHGLLVKPSFQNVVVHSGTDINLNSSNDGILLPSQSTAKSKKGLDVQGGVVALDRANAEDALRPIPKVF

Query:  DSQEKSTLYHGLQVDRNDVMIQLVDSGSQVEDDVGVEDSGAVCMEDQLRSLGILNSTDDHTSESILESSILKGIDLEANMSQKKLREAVLSLAPGAACKL
        DSQEKSTLY  L +DRNDVM QLVDSGSQVED+ GVEDS +VCMEDQLRSLGIL+STDDHTSESIL+ +I KGIDLE NMSQKKLREAVLSLAPG ACKL
Subjt:  DSQEKSTLYHGLQVDRNDVMIQLVDSGSQVEDDVGVEDSGAVCMEDQLRSLGILNSTDDHTSESILESSILKGIDLEANMSQKKLREAVLSLAPGAACKL

Query:  LENLVSIWQSRLCSGKNVLPWIYSLLLNHSQHILSQEPSTQILDSLFKMTKSKETAVQSLLQLSGRLQLVLAQVERASANKTDQMIRYDHKIDGSESDDD
        L NLVSIWQ RL SGKNVLPWIYSLLLNHSQHILSQE S QILDSLFK+TKSKETAVQ LLQLSGRLQLVLAQ+ERASANKTDQ I++  +IDGSES DD
Subjt:  LENLVSIWQSRLCSGKNVLPWIYSLLLNHSQHILSQEPSTQILDSLFKMTKSKETAVQSLLQLSGRLQLVLAQVERASANKTDQMIRYDHKIDGSESDDD

Query:  EDEDEDEEVDDVLYGEEEYESELSSDDEN
        EDED   EVDDV YGEEE ESELSSDDE+
Subjt:  EDEDEDEEVDDVLYGEEEYESELSSDDEN

TrEMBL top hitse value%identityAlignment
A0A0A0KF10 WD_REPEATS_REGION domain-containing protein1.7e-30687.92Show/hide
Query:  MGLSNIRDLLTSFSPDLHFFAISSGDGRIKIWDALKGQIQTEFADFFASDSTSILTKPDKGHLSIDYKCMKWFSLEKKRKRKRQCSLLLLGTGSGDVLAL
        MG SNIRDLLTSFSPDLHFFAISSGDGRIKIWD LKGQIQTEFADFF SDSTSILTKP+KGHLS+DYKCMKW SLEKKRKRKRQC LLLLGTGSGDVLAL
Subjt:  MGLSNIRDLLTSFSPDLHFFAISSGDGRIKIWDALKGQIQTEFADFFASDSTSILTKPDKGHLSIDYKCMKWFSLEKKRKRKRQCSLLLLGTGSGDVLAL

Query:  DVAAGELKWKISDCHPGGVTTISFPTHGSSIYTAGADGMLCEIDSLTGNLLRKFKASTKAISCISVSPDGKIIATAASQMKIFNCSNHKKIQKFSGHPGA
        DVAAGELKWKISDCHPGGV +ISFPTHGS IYTAGADGMLCEI+SLTGNLLRKFKASTKAISCISVSPDGKIIATAASQ+KIFNCSNHKKIQKFSGHPGA
Subjt:  DVAAGELKWKISDCHPGGVTTISFPTHGSSIYTAGADGMLCEIDSLTGNLLRKFKASTKAISCISVSPDGKIIATAASQMKIFNCSNHKKIQKFSGHPGA

Query:  VRCMVFTEDGKYILSSAVGERYIVVWSLGGGKKQSASCVLALEHPAIFVDSRLSKVDGDDETALYILAISEIGVCYLWYGQNLEELRSAKPTKILISDND
        VRCMVFTEDG+YILSSAVGERYIVVWS+ GGK+QSASCVLA+EHPAIFVDSR S  DG DETALYILAISEIG CYLWYGQNLEELR+AKPTKIL+S ND
Subjt:  VRCMVFTEDGKYILSSAVGERYIVVWSLGGGKKQSASCVLALEHPAIFVDSRLSKVDGDDETALYILAISEIGVCYLWYGQNLEELRSAKPTKILISDND

Query:  ISSKSKKRAIPAIYAAKLQGIPKSGSGQVFLAHGLLVKPSFQNVVVHSGTDINLNSSNDGILLPSQSTAKSKKGLDVQGGVVALDRANAEDALRPIPKVF
        I SKSKKRAIPAIYAAKLQG+PKSGSGQVFLAHGLLVKPSFQ+V+V SGTDINLNSSN+GILLPSQS  KSKKGLDVQGGVVALDRANAEDALRPIPK+F
Subjt:  ISSKSKKRAIPAIYAAKLQGIPKSGSGQVFLAHGLLVKPSFQNVVVHSGTDINLNSSNDGILLPSQSTAKSKKGLDVQGGVVALDRANAEDALRPIPKVF

Query:  DSQEKSTLYHGLQVDRNDVMIQLVDSGSQVEDDVGVEDSGAVCMEDQLRSLGILNSTDDHTSESILESSILKGIDLEANMSQKKLREAVLSLAPGAACKL
        DSQEKSTLY  LQVDR+DVM QLVDSGS++EDDVGV+DS AVCMEDQLRSLGIL++TDDH  ESIL+ +I KGIDLEAN+SQKKLREAVLSLAPG A KL
Subjt:  DSQEKSTLYHGLQVDRNDVMIQLVDSGSQVEDDVGVEDSGAVCMEDQLRSLGILNSTDDHTSESILESSILKGIDLEANMSQKKLREAVLSLAPGAACKL

Query:  LENLVSIWQSRLCSGKNVLPWIYSLLLNHSQHILSQEPSTQILDSLFKMTKSKETAVQSLLQLSGRLQLVLAQVERASANKTDQMIRYDHKIDGSE-SDD
        L NLV+IWQSRL  GKNVLPWIYSLLLNHSQHILSQE S Q+LDSLFK+TKSKETAVQ LLQLSGRLQLVLAQ+ER S NKT Q I+   +I GSE SDD
Subjt:  LENLVSIWQSRLCSGKNVLPWIYSLLLNHSQHILSQEPSTQILDSLFKMTKSKETAVQSLLQLSGRLQLVLAQVERASANKTDQMIRYDHKIDGSE-SDD

Query:  DEDEDEDEEVDDVLYGEEEYESELSSDDE
        DEDEDED+EVDDVLYGEEE ESELSSDDE
Subjt:  DEDEDEDEEVDDVLYGEEEYESELSSDDE

A0A1S3C7V9 LOW QUALITY PROTEIN: WD repeat-containing protein 437.0e-30888.39Show/hide
Query:  MGLSNIRDLLTSFSPDLHFFAISSGDGRIKIWDALKGQIQTEFADFFASDSTSILTKPDKGHLSIDYKCMKWFSLEKKRKRKRQCSLLLLGTGSGDVLAL
        MG SNIRDLLTSFSPDLHFFAISSGDGRIKIWD LKGQIQTEFADFF SDSTSILTKP+KGHLSIDYKCMKW SLEKKRKRKRQC LLLLGTGSGDVLAL
Subjt:  MGLSNIRDLLTSFSPDLHFFAISSGDGRIKIWDALKGQIQTEFADFFASDSTSILTKPDKGHLSIDYKCMKWFSLEKKRKRKRQCSLLLLGTGSGDVLAL

Query:  DVAAGELKWKISDCHPGGVTTISFPTHGSSIYTAGADGMLCEIDSLTGNLLRKFKASTKAISCISVSPDGKIIATAASQMKIFNCSNHKKIQKFSGHPGA
        DVAAGELKWKISDCHPGGV +ISFPTHGS IYTAGADGMLCEI+SLTGN+LRKFKASTKAISCISVSPDGKIIATAASQMKIFNCSNHKKIQKFSGHPGA
Subjt:  DVAAGELKWKISDCHPGGVTTISFPTHGSSIYTAGADGMLCEIDSLTGNLLRKFKASTKAISCISVSPDGKIIATAASQMKIFNCSNHKKIQKFSGHPGA

Query:  VRCMVFTEDGKYILSSAVGERYIVVWSLGGGKKQSASCVLALEHPAIFVDSRLSKVDGDDETALYILAISEIGVCYLWYGQNLEELRSAKPTKILISDND
        VRCMVFTEDG+YILSSAVGERYIVVWS+ G +KQSASCVLA+EHPAIFVDSR S  DG DE+ALY+LAISEIGVCYLWYGQNLEELRSAKPTK+LISDND
Subjt:  VRCMVFTEDGKYILSSAVGERYIVVWSLGGGKKQSASCVLALEHPAIFVDSRLSKVDGDDETALYILAISEIGVCYLWYGQNLEELRSAKPTKILISDND

Query:  ISSKSKKRAIPAIYAAKLQGIPKSGSGQVFLAHGLLVKPSFQNVVVHSGTDINLNSSNDGILLPSQSTAKSKKGLDVQGGVVALDRANAEDALRPIPKVF
        I SKSKKRA PAIYAAKLQG+PKSGSGQVFLAHGLLVKPSFQNV+V SGTDI LNSSN+GILLPSQS  KSKKGLDVQGGVVALDRANAEDALRPIPKVF
Subjt:  ISSKSKKRAIPAIYAAKLQGIPKSGSGQVFLAHGLLVKPSFQNVVVHSGTDINLNSSNDGILLPSQSTAKSKKGLDVQGGVVALDRANAEDALRPIPKVF

Query:  DSQEKSTLYHGLQVDRNDVMIQLVDSGSQVEDDVGVEDSGAVCMEDQLRSLGILNSTDDHTSESILESSILKGIDLEANMSQKKLREAVLSLAPGAACKL
        DSQEKSTLY  LQVDRNDVM QLVDSGS++EDDVGV+DS AVCMEDQLRSLGIL+STDDH  ESIL+ +  KGIDLEANMSQKKLREAVLSLAPG A KL
Subjt:  DSQEKSTLYHGLQVDRNDVMIQLVDSGSQVEDDVGVEDSGAVCMEDQLRSLGILNSTDDHTSESILESSILKGIDLEANMSQKKLREAVLSLAPGAACKL

Query:  LENLVSIWQSRLCSGKNVLPWIYSLLLNHSQHILSQEPSTQILDSLFKMTKSKETAVQSLLQLSGRLQLVLAQVERASANKTDQMIRYDHKIDGSESDDD
        L NLV+IWQSRL  GKNVL WIYSLLLNHSQHILSQE S Q+LDSLFK+TKSKETAVQ LLQLSGRLQLVLAQ+ERAS NKTDQ I+   +I GSES DD
Subjt:  LENLVSIWQSRLCSGKNVLPWIYSLLLNHSQHILSQEPSTQILDSLFKMTKSKETAVQSLLQLSGRLQLVLAQVERASANKTDQMIRYDHKIDGSESDDD

Query:  EDEDEDEEVDDVLYGEEEYESELSSDDEN
         DE+EDEEVDDVLYGEEE ESELSSDDEN
Subjt:  EDEDEDEEVDDVLYGEEEYESELSSDDEN

A0A5D3BUY7 WD repeat-containing protein 433.8e-30988.71Show/hide
Query:  MGLSNIRDLLTSFSPDLHFFAISSGDGRIKIWDALKGQIQTEFADFFASDSTSILTKPDKGHLSIDYKCMKWFSLEKKRKRKRQCSLLLLGTGSGDVLAL
        MG SNIRDLLTSFSPDLHFFAISSGDGRIKIWD LKGQIQTEFADFF SDSTSILTKP+KGHLSIDYKCMKW SLEKKRKRKRQC LLLLGTGSGDVLAL
Subjt:  MGLSNIRDLLTSFSPDLHFFAISSGDGRIKIWDALKGQIQTEFADFFASDSTSILTKPDKGHLSIDYKCMKWFSLEKKRKRKRQCSLLLLGTGSGDVLAL

Query:  DVAAGELKWKISDCHPGGVTTISFPTHGSSIYTAGADGMLCEIDSLTGNLLRKFKASTKAISCISVSPDGKIIATAASQMKIFNCSNHKKIQKFSGHPGA
        DVAAGELKWKISDCHPGGV +ISFPTHGS IYTAGADGMLCEI+SLTGN+LRKFKASTKAISCISVSPDGKIIATAASQMKIFNCSNHKKIQKFSGHPGA
Subjt:  DVAAGELKWKISDCHPGGVTTISFPTHGSSIYTAGADGMLCEIDSLTGNLLRKFKASTKAISCISVSPDGKIIATAASQMKIFNCSNHKKIQKFSGHPGA

Query:  VRCMVFTEDGKYILSSAVGERYIVVWSLGGGKKQSASCVLALEHPAIFVDSRLSKVDGDDETALYILAISEIGVCYLWYGQNLEELRSAKPTKILISDND
        VRCMVFTEDG+YILSSAVGERYIVVWS+ GGKKQSASCVLA+EHPAIFVDSR S  DG DE+ALY+LAISEIGVCYLWYGQNLEELRSAKPTK+LISDND
Subjt:  VRCMVFTEDGKYILSSAVGERYIVVWSLGGGKKQSASCVLALEHPAIFVDSRLSKVDGDDETALYILAISEIGVCYLWYGQNLEELRSAKPTKILISDND

Query:  ISSKSKKRAIPAIYAAKLQGIPKSGSGQVFLAHGLLVKPSFQNVVVHSGTDINLNSSNDGILLPSQSTAKSKKGLDVQGGVVALDRANAEDALRPIPKVF
        I SKSKKRA PAIYAAKLQG+PKSGSGQVFLAHGLLVKPSFQNV+V SGTDI LNSSN+GILLPSQS  KSKKGLDVQGGVVALDRANAEDALRPIPKVF
Subjt:  ISSKSKKRAIPAIYAAKLQGIPKSGSGQVFLAHGLLVKPSFQNVVVHSGTDINLNSSNDGILLPSQSTAKSKKGLDVQGGVVALDRANAEDALRPIPKVF

Query:  DSQEKSTLYHGLQVDRNDVMIQLVDSGSQVEDDVGVEDSGAVCMEDQLRSLGILNSTDDHTSESILESSILKGIDLEANMSQKKLREAVLSLAPGAACKL
        DSQEKSTLY  LQVDRNDVM QLVDSGS++EDDVGV+DS AVCMEDQLRSLGIL+STDDH  ESIL+ +  KGIDLEANMSQKKLREAVLSLAPG A KL
Subjt:  DSQEKSTLYHGLQVDRNDVMIQLVDSGSQVEDDVGVEDSGAVCMEDQLRSLGILNSTDDHTSESILESSILKGIDLEANMSQKKLREAVLSLAPGAACKL

Query:  LENLVSIWQSRLCSGKNVLPWIYSLLLNHSQHILSQEPSTQILDSLFKMTKSKETAVQSLLQLSGRLQLVLAQVERASANKTDQMIRYDHKIDGSESDDD
        L NLV+IWQSRL  GKNVL WIYSLLLNHSQHILSQE S Q+LDSLFK+TKSKETAVQ LLQLSGRLQLVLAQ+ERAS NKTDQ I+   +I GSES DD
Subjt:  LENLVSIWQSRLCSGKNVLPWIYSLLLNHSQHILSQEPSTQILDSLFKMTKSKETAVQSLLQLSGRLQLVLAQVERASANKTDQMIRYDHKIDGSESDDD

Query:  EDEDEDEEVDDVLYGEEEYESELSSDDEN
         DE+EDEEVDDVLYGEEE ESELSSDDEN
Subjt:  EDEDEDEEVDDVLYGEEEYESELSSDDEN

A0A6J1EX27 WD repeat-containing protein 432.1e-30787.6Show/hide
Query:  MGLSNIRDLLTSFSPDLHFFAISSGDGRIKIWDALKGQIQTEFADFFASDSTSILTKPDKGHLSIDYKCMKWFSLEKKRKRKRQCSLLLLGTGSGDVLAL
        M  SNIRDLLTSFSPDLH FAISSGDGRIKIWD LKGQIQTEFADFF +DSTSILTK  KGHLS+DYKCMKW SLEKKRKRKRQCSLLLLGTGSGDV+AL
Subjt:  MGLSNIRDLLTSFSPDLHFFAISSGDGRIKIWDALKGQIQTEFADFFASDSTSILTKPDKGHLSIDYKCMKWFSLEKKRKRKRQCSLLLLGTGSGDVLAL

Query:  DVAAGELKWKISDCHPGGVTTISFPTHGSSIYTAGADGMLCEIDSLTGNLLRKFKASTKAISCISVSPDGKIIATAASQMKIFNCSNHKKIQKFSGHPGA
        DVAAGELKWKISDCHPGGV TISFPTHGS IYTAGADGMLCEIDS+TG+LLRKFKASTKAISCISVSPDGK IATAASQMKIFNCS+HKKIQKFSGHPGA
Subjt:  DVAAGELKWKISDCHPGGVTTISFPTHGSSIYTAGADGMLCEIDSLTGNLLRKFKASTKAISCISVSPDGKIIATAASQMKIFNCSNHKKIQKFSGHPGA

Query:  VRCMVFTEDGKYILSSAVGERYIVVWSLGGGKKQSASCVLALEHPAIFVDSRLSKVDGDDETALYILAISEIGVCYLWYGQNLEELRSAKPTKILISDND
        VRCMVFTEDGKYILSS VGERYIVVWSLGG KKQSASCVLA+EHPAIFVDSR S   G DE ALY+LAISEIGVCYLWYGQNLEELRS KPTK+LIS+ND
Subjt:  VRCMVFTEDGKYILSSAVGERYIVVWSLGGGKKQSASCVLALEHPAIFVDSRLSKVDGDDETALYILAISEIGVCYLWYGQNLEELRSAKPTKILISDND

Query:  ISSKSKKRAIPAIYAAKLQGIPKSGSGQVFLAHGLLVKPSFQNVVVHSGTDINLNSSNDGILLPSQSTAKSKKGLDVQGGVVALDRANAEDALRPIPKVF
        ISSKSKKR  P+IYAA+ QGIPKSGSGQVFLAHGLLVKPSFQNV+VHSGTDI LNSSNDGILLPS S  KSKK LDVQGGVVALDRANAEDALRPIPKVF
Subjt:  ISSKSKKRAIPAIYAAKLQGIPKSGSGQVFLAHGLLVKPSFQNVVVHSGTDINLNSSNDGILLPSQSTAKSKKGLDVQGGVVALDRANAEDALRPIPKVF

Query:  DSQEKSTLYHGLQVDRNDVMIQLVDSGSQVEDDVGVEDSGAVCMEDQLRSLGILNSTDDHTSESILESSILKGIDLEANMSQKKLREAVLSLAPGAACKL
        DSQEK TLYH L VDR+DVM +LVDSGSQV+D VGVEDS AVC+ED+LRSLGIL+STDDHTS+SIL+S+I KGIDLEANMSQKKLREAVLSLAPG ACKL
Subjt:  DSQEKSTLYHGLQVDRNDVMIQLVDSGSQVEDDVGVEDSGAVCMEDQLRSLGILNSTDDHTSESILESSILKGIDLEANMSQKKLREAVLSLAPGAACKL

Query:  LENLVSIWQSRLCSGKNVLPWIYSLLLNHSQHILSQEPSTQILDSLFKMTKSKETAVQSLLQLSGRLQLVLAQVERASANKTDQMIRYDHKIDGSESDDD
        L NLVSIWQSRL SGKNVLPWIYSLLLNH+QHILSQE STQILDSLFK+TKSKETAVQ LLQLSGRLQLVLAQ+ERASANKTDQ +R+  +IDG ES +D
Subjt:  LENLVSIWQSRLCSGKNVLPWIYSLLLNHSQHILSQEPSTQILDSLFKMTKSKETAVQSLLQLSGRLQLVLAQVERASANKTDQMIRYDHKIDGSESDDD

Query:  EDEDEDEEVDDVLYGEEEYESELSSDDEN
        ED   DEEVDDVLYGEEEYESELSSD+EN
Subjt:  EDEDEDEEVDDVLYGEEEYESELSSDDEN

A0A6J1KLT8 WD repeat-containing protein 437.0e-30887.76Show/hide
Query:  MGLSNIRDLLTSFSPDLHFFAISSGDGRIKIWDALKGQIQTEFADFFASDSTSILTKPDKGHLSIDYKCMKWFSLEKKRKRKRQCSLLLLGTGSGDVLAL
        M  SNIRDLLTSFSPDLHFFAISSGDGRIKIWD LKGQIQTEFADFF +DSTSILTK  KGHLS+DYKCMKW SLEKKRKRKRQCSLLLLGTGSGDV+AL
Subjt:  MGLSNIRDLLTSFSPDLHFFAISSGDGRIKIWDALKGQIQTEFADFFASDSTSILTKPDKGHLSIDYKCMKWFSLEKKRKRKRQCSLLLLGTGSGDVLAL

Query:  DVAAGELKWKISDCHPGGVTTISFPTHGSSIYTAGADGMLCEIDSLTGNLLRKFKASTKAISCISVSPDGKIIATAASQMKIFNCSNHKKIQKFSGHPGA
        DVAAGELKW+ISDCHPGGV TISFPTHGS IYTAGADGMLCEIDS+TG+LLRKFKASTKAISCISVSPDGK IATAASQMKIFNCS+HKKIQKFSGHPGA
Subjt:  DVAAGELKWKISDCHPGGVTTISFPTHGSSIYTAGADGMLCEIDSLTGNLLRKFKASTKAISCISVSPDGKIIATAASQMKIFNCSNHKKIQKFSGHPGA

Query:  VRCMVFTEDGKYILSSAVGERYIVVWSLGGGKKQSASCVLALEHPAIFVDSRLSKVDGDDETALYILAISEIGVCYLWYGQNLEELRSAKPTKILISDND
        VRCMVFTEDGKYILSS VGERYIVVWSLGG KKQSASCVLA+EHPAIF+DSR S   G DETALY+LAISEIGVCYLWYGQNLEELRS KPTK+LIS+ND
Subjt:  VRCMVFTEDGKYILSSAVGERYIVVWSLGGGKKQSASCVLALEHPAIFVDSRLSKVDGDDETALYILAISEIGVCYLWYGQNLEELRSAKPTKILISDND

Query:  ISSKSKKRAIPAIYAAKLQGIPKSGSGQVFLAHGLLVKPSFQNVVVHSGTDINLNSSNDGILLPSQSTAKSKKGLDVQGGVVALDRANAEDALRPIPKVF
        ISSKSKKR  P+IYAA+ QGIPKSGSGQVFLAHGLLVKPSFQNVVVHSGTDINLNSSNDGILLPS S  KSKKGLDVQGGVVALDRANAEDAL PIPKVF
Subjt:  ISSKSKKRAIPAIYAAKLQGIPKSGSGQVFLAHGLLVKPSFQNVVVHSGTDINLNSSNDGILLPSQSTAKSKKGLDVQGGVVALDRANAEDALRPIPKVF

Query:  DSQEKSTLYHGLQVDRNDVMIQLVDSGSQVEDDVGVEDSGAVCMEDQLRSLGILNSTDDHTSESILESSILKGIDLEANMSQKKLREAVLSLAPGAACKL
        DSQEK TLYH L VDR+DVM +LVDSGSQVED VGVEDS AVC+ED+LR LGIL+STDDHTSESIL+S+I KGIDLEANMSQKKLREAVLSLAPG ACKL
Subjt:  DSQEKSTLYHGLQVDRNDVMIQLVDSGSQVEDDVGVEDSGAVCMEDQLRSLGILNSTDDHTSESILESSILKGIDLEANMSQKKLREAVLSLAPGAACKL

Query:  LENLVSIWQSRLCSGKNVLPWIYSLLLNHSQHILSQEPSTQILDSLFKMTKSKETAVQSLLQLSGRLQLVLAQVERASANKTDQMIRYDHKIDGSESDDD
        L NLVSIWQSRL SGKNVLPWIYSLLLNH+QHILSQE STQILDSLFK TKSKETAVQ LLQLSGRLQLVLAQ+ERASANKTDQ +R+  +IDG ES + 
Subjt:  LENLVSIWQSRLCSGKNVLPWIYSLLLNHSQHILSQEPSTQILDSLFKMTKSKETAVQSLLQLSGRLQLVLAQVERASANKTDQMIRYDHKIDGSESDDD

Query:  EDEDEDEEVDDVLYGEEEYESELSSDDEN
           DEDEEV DVLYGEEEYESELSSD+EN
Subjt:  EDEDEDEEVDDVLYGEEEYESELSSDDEN

SwissProt top hitse value%identityAlignment
P03010 Putative AC9 transposase5.9e-1734.98Show/hide
Query:  MLSSMANATKAKFEKYWGFDERKNNILLYVAVVLDPRF----------KLNTGAASV-----------CFSSGSGGGSSSASASTSANFKSVETTSFDKE
        ++  MA A   KFEKYW    + +NI L VA  LDPR+          K +  +  V            +   S    S+    T+ N  S++ T  + E
Subjt:  MLSSMANATKAKFEKYWGFDERKNNILLYVAVVLDPRF----------KLNTGAASV-----------CFSSGSGGGSSSASASTSANFKSVETTSFDKE

Query:  -------------------SEIDVYLLETLAKDSSSFDILYWWKQNGHRFEVLSRMSRDILAIPVSTVASESAFSTGGRVVNTSRCSLTPKTVKALICAK
                           +E+D Y+ E L K S  FDIL WW+     + +L++++RD+LAI VSTVASESAFS GGRVV+  R  L  + V+ALIC K
Subjt:  -------------------SEIDVYLLETLAKDSSSFDILYWWKQNGHRFEVLSRMSRDILAIPVSTVASESAFSTGGRVVNTSRCSLTPKTVKALICAK

Query:  NWL
        +W+
Subjt:  NWL

P03010 Putative AC9 transposase1.2e-0631.25Show/hide
Query:  FPKKRRVQNNQWFGIILRARDVLEVYARLRDRLKEMFASLKCKVYLTTDYWTSGANMSYMVLTAHFIDSEWKLHKRILSFCQIE-NHRGDTIASKF
        FP K RV   ++         ++++Y   +++L      ++ +   T D WTS  N SYM +T H+ID +W L KRI+ F  +E  H G  ++  F
Subjt:  FPKKRRVQNNQWFGIILRARDVLEVYARLRDRLKEMFASLKCKVYLTTDYWTSGANMSYMVLTAHFIDSEWKLHKRILSFCQIE-NHRGDTIASKF

P08770 Putative AC transposase5.9e-1734.98Show/hide
Query:  MLSSMANATKAKFEKYWGFDERKNNILLYVAVVLDPRF----------KLNTGAASV-----------CFSSGSGGGSSSASASTSANFKSVETTSFDKE
        ++  MA A   KFEKYW    + +NI L VA  LDPR+          K +  +  V            +   S    S+    T+ N  S++ T  + E
Subjt:  MLSSMANATKAKFEKYWGFDERKNNILLYVAVVLDPRF----------KLNTGAASV-----------CFSSGSGGGSSSASASTSANFKSVETTSFDKE

Query:  -------------------SEIDVYLLETLAKDSSSFDILYWWKQNGHRFEVLSRMSRDILAIPVSTVASESAFSTGGRVVNTSRCSLTPKTVKALICAK
                           +E+D Y+ E L K S  FDIL WW+     + +L++++RD+LAI VSTVASESAFS GGRVV+  R  L  + V+ALIC K
Subjt:  -------------------SEIDVYLLETLAKDSSSFDILYWWKQNGHRFEVLSRMSRDILAIPVSTVASESAFSTGGRVVNTSRCSLTPKTVKALICAK

Query:  NWL
        +W+
Subjt:  NWL

P08770 Putative AC transposase1.2e-0631.25Show/hide
Query:  FPKKRRVQNNQWFGIILRARDVLEVYARLRDRLKEMFASLKCKVYLTTDYWTSGANMSYMVLTAHFIDSEWKLHKRILSFCQIE-NHRGDTIASKF
        FP K RV   ++         ++++Y   +++L      ++ +   T D WTS  N SYM +T H+ID +W L KRI+ F  +E  H G  ++  F
Subjt:  FPKKRRVQNNQWFGIILRARDVLEVYARLRDRLKEMFASLKCKVYLTTDYWTSGANMSYMVLTAHFIDSEWKLHKRILSFCQIE-NHRGDTIASKF

Q6AVI0 Zinc finger BED domain-containing protein RICESLEEPER 21.0e-1633.19Show/hide
Query:  LLNSKG-SNRMLSSMANATKAKFEKYWGFDERKNNILLYVAVVLDPRFKLNTGAAS------------------------------------VCFSSGSG
        L N+ G  + + SS+A     +F+KYW    +  N++L +AVV+DPRFK+     S                                         G G
Subjt:  LLNSKG-SNRMLSSMANATKAKFEKYWGFDERKNNILLYVAVVLDPRFKLNTGAAS------------------------------------VCFSSGSG

Query:  GGSSSASASTSAN-------------FKSVETTSFDKESEIDVYLLETLAKDSSSFDILYWWKQNGHRFEVLSRMSRDILAIPVSTVAS-ESAFS--TGG
          + ++   T A              + S   TS   +SE++ YL E+L      FDIL WWK N  +F  LSRM+RDILAIP+S V+S  S FS  TG 
Subjt:  GGSSSASASTSAN-------------FKSVETTSFDKESEIDVYLLETLAKDSSSFDILYWWKQNGHRFEVLSRMSRDILAIPVSTVAS-ESAFS--TGG

Query:  RVVNTSRCSLTPKTVKALICAKNWLDSKP
        R+++  R SL P+ V+AL+CAK+WL   P
Subjt:  RVVNTSRCSLTPKTVKALICAKNWLDSKP

Q6AVI0 Zinc finger BED domain-containing protein RICESLEEPER 23.2e-0731.08Show/hide
Query:  DVLEVYARLRDRLKEMFASLKCKVYLTTDYWTSGANMSYMVLTAHFIDSEWKLHKRILSFCQIENHRGDTIASK
        +V  VY + ++ L + F+++  ++ LT   WT+   + Y+ L   FIDSEWK+H+R+L+F  + +   +   S+
Subjt:  DVLEVYARLRDRLKEMFASLKCKVYLTTDYWTSGANMSYMVLTAHFIDSEWKLHKRILSFCQIENHRGDTIASK

Q75HY5 Zinc finger BED domain-containing protein RICESLEEPER 32.2e-1632.38Show/hide
Query:  SMANATKAKFEKYWGFDERKNNILLYVAVVLDPRFKLN---------------------TGAASVCFSSGSGGGSSSASASTSANFKSVE----------
        S A     +F+KYW    +  N++L +AVV+DPRFK+                        A    +S  +  G ++  A  + N  +V           
Subjt:  SMANATKAKFEKYWGFDERKNNILLYVAVVLDPRFKLN---------------------TGAASVCFSSGSGGGSSSASASTSANFKSVE----------

Query:  -------TTSFDKESEIDVYLLETLAKDSSSFDILYWWKQNGHRFEVLSRMSRDILAIPVSTVASE----SAFSTGGRVVNTSRCSLTPKTVKALICAKN
                TS    SE++ YL E L      F+IL WWK N  +F  LS+M+RD+LAIP+S V+S     SA +TG ++++  R SL P+TV+AL CAK+
Subjt:  -------TTSFDKESEIDVYLLETLAKDSSSFDILYWWKQNGHRFEVLSRMSRDILAIPVSTVASE----SAFSTGGRVVNTSRCSLTPKTVKALICAKN

Query:  WLDSKPISLD
        WL   P + +
Subjt:  WLDSKPISLD

Q75HY5 Zinc finger BED domain-containing protein RICESLEEPER 34.7e-0625Show/hide
Query:  VLEVYARLRDRLKEMFASLKCKVYLTTDYWTSGANMSYMVLTAHFIDSEWKLHKRILSFCQIEN-HRGDTIASKFEKAFDRLEDEDPIY--KNDSPPTKD
        V  VY + R+ L  +F+++  ++ LT   W +   + Y+ L A FID+EW++H+R+++F  + + H  ++++     +      +D ++    D+ P+  
Subjt:  VLEVYARLRDRLKEMFASLKCKVYLTTDYWTSGANMSYMVLTAHFIDSEWKLHKRILSFCQIEN-HRGDTIASKFEKAFDRLEDEDPIY--KNDSPPTKD

Query:  DWTDAKML
        D   A M+
Subjt:  DWTDAKML

Q9M2N5 Zinc finger BED domain-containing protein DAYSLEEPER6.7e-2130.51Show/hide
Query:  ASKFEKAFDRLEDEDPIYKNDSPPTKDDWTDAKMLITTI----------------------------QNCILLNSKGSNRMLSSMANATKAKFEKYWGFD
        AS+ ++ F  L+  DP YK   PP+ +DW   + L T +                            Q+ +     G +  ++ +A   + K +KYW   
Subjt:  ASKFEKAFDRLEDEDPIYKNDSPPTKDDWTDAKMLITTI----------------------------QNCILLNSKGSNRMLSSMANATKAKFEKYWGFD

Query:  ERKNNILLYVAVVLDPRFKLNTGAASVCFSSGSGGGSS-------------------SASASTSANFKS----------VETTSFDKESEIDVYLLETLA
         R  +++L +AVV+DPRFK+     S     G   G +                   S   +TS   K+          +ETT  + +SE+D YL ETL 
Subjt:  ERKNNILLYVAVVLDPRFKLNTGAASVCFSSGSGGGSS-------------------SASASTSANFKS----------VETTSFDKESEIDVYLLETLA

Query:  KDSSSFDILYWWKQNGHRFEVLSRMSRDILAIPVSTVASESAFSTGGRVVNTSRCSLTPKTVKALICAKNWL
             FD+L WWKQN  ++  LS+M+RDIL+IPVS  A +  F    R ++  + SL P+TV+ALICA+ WL
Subjt:  KDSSSFDILYWWKQNGHRFEVLSRMSRDILAIPVSTVASESAFSTGGRVVNTSRCSLTPKTVKALICAKNWL

Q9M2N5 Zinc finger BED domain-containing protein DAYSLEEPER2.0e-0432.2Show/hide
Query:  DVLEVYARLRDRLKEMFASLKCKVYLTTDYWTSGANMSYMVLTAHFIDSEWKLHKRILS
        D +  Y   +  + +    +  +  LT D+WTS   + Y+ +TAH+IDS+WK+ K++L+
Subjt:  DVLEVYARLRDRLKEMFASLKCKVYLTTDYWTSGANMSYMVLTAHFIDSEWKLHKRILS

Arabidopsis top hitse value%identityAlignment
AT1G15420.1 CONTAINS InterPro DOMAIN/s: Small-subunit processome, Utp12 (InterPro:IPR007148); Has 764 Blast hits to 656 proteins in 193 species: Archae - 0; Bacteria - 42; Metazoa - 237; Fungi - 154; Plants - 85; Viruses - 23; Other Eukaryotes - 223 (source: NCBI BLink).1.1e-0526.82Show/hide
Query:  DSG-SQVEDDVGVEDS-GAVCMEDQLRSLGILNSTDDHTSESILESS--------------ILKG----------IDLEANMSQKKLREAVLSLAPGAAC
        DSG  +  D V V+D+     + D+L SL +LN    ++ ES  +S+              +L+           +D   N  ++ +  +V  L      
Subjt:  DSG-SQVEDDVGVEDS-GAVCMEDQLRSLGILNSTDDHTSESILESS--------------ILKG----------IDLEANMSQKKLREAVLSLAPGAAC

Query:  KLLENLVSIWQSRLCSGKNVLPWIYSLLLNHSQHILSQEPSTQILDSLFKMTKSKETAVQSLLQLSGRLQLVLAQVERASANKTDQMIRYDHKIDGSESD
        KLL  L+ I QSR       +PWI SLLL HS  I+SQE S   L++++++ +S+ + + + +++S  L L+   V+     + +  + Y+        D
Subjt:  KLLENLVSIWQSRLCSGKNVLPWIYSLLLNHSQHILSQEPSTQILDSLFKMTKSKETAVQSLLQLSGRLQLVLAQVERASANKTDQMIRYDHKIDGSESD

Query:  DDEDEDEDEEVDDVLYGEEE
         D DEDE+E +++ +  +EE
Subjt:  DDEDEDEDEEVDDVLYGEEE

AT1G18560.1 BED zinc finger ;hAT family dimerisation domain2.8e-0627.93Show/hide
Query:  GGSSSASASTSANFKSVETTSFDKESEIDVYLLETLAKDSSSFDILYWWKQNGHRFEVLSRMSRDILAIPVSTVASESAFSTGGRVVNTSRCSLTPKTVK
        GG + + A   A  K   + S +   E+  YL E++    +  D+L WWK N  R+  LS M+RD LA+  ++ A E  F   G  ++  +  +   + +
Subjt:  GGSSSASASTSANFKSVETTSFDKESEIDVYLLETLAKDSSSFDILYWWKQNGHRFEVLSRMSRDILAIPVSTVASESAFSTGGRVVNTSRCSLTPKTVK

Query:  ALICAKNWLDS
        ++IC ++W+++
Subjt:  ALICAKNWLDS

AT1G18560.1 BED zinc finger ;hAT family dimerisation domain4.0e-0533.33Show/hide
Query:  EVYARLRDRLKEMFASLKCKVYLTTDYWTSGANMSYMVLTAHFIDSEWKLHKRILSFCQI
        EV+  +R  +K     ++ KV +T  +W S  N+ YM +T  +ID  W  H+ +L  C+I
Subjt:  EVYARLRDRLKEMFASLKCKVYLTTDYWTSGANMSYMVLTAHFIDSEWKLHKRILSFCQI

AT1G73720.1 transducin family protein / WD-40 repeat family protein1.6e-0926.06Show/hide
Query:  FSPDLHFFAISSGDGRIKIWDALKGQIQTEFADFFASDSTSILTKPDKGHLSIDYKCMKWFSLEKKRKRKRQCSLLLLGTGSGDVLALDVAAGELKWKIS
        FSPD  F A SS DG I++WD + G+++ +    + +D + ++   D   L ID+               R   +L  G+  G +    +  G    +  
Subjt:  FSPDLHFFAISSGDGRIKIWDALKGQIQTEFADFFASDSTSILTKPDKGHLSIDYKCMKWFSLEKKRKRKRQCSLLLLGTGSGDVLALDVAAGELKWKIS

Query:  DCHPGGVTTISFPTHGSSIYTAGADGMLCEIDSLTGNLLRKFKASTKAISCISVSPDGKIIATAASQ--MKIFNCSNHKKIQKFSGHP
        D H  GVT++SF   GS + +   D         +G LL++F+  T  ++    + DG  I TA+S   +K+++      +Q F   P
Subjt:  DCHPGGVTTISFPTHGSSIYTAGADGMLCEIDSLTGNLLRKFKASTKAISCISVSPDGKIIATAASQ--MKIFNCSNHKKIQKFSGHP

AT3G42170.1 BED zinc finger ;hAT family dimerisation domain4.8e-2230.51Show/hide
Query:  ASKFEKAFDRLEDEDPIYKNDSPPTKDDWTDAKMLITTI----------------------------QNCILLNSKGSNRMLSSMANATKAKFEKYWGFD
        AS+ ++ F  L+  DP YK   PP+ +DW   + L T +                            Q+ +     G +  ++ +A   + K +KYW   
Subjt:  ASKFEKAFDRLEDEDPIYKNDSPPTKDDWTDAKMLITTI----------------------------QNCILLNSKGSNRMLSSMANATKAKFEKYWGFD

Query:  ERKNNILLYVAVVLDPRFKLNTGAASVCFSSGSGGGSS-------------------SASASTSANFKS----------VETTSFDKESEIDVYLLETLA
         R  +++L +AVV+DPRFK+     S     G   G +                   S   +TS   K+          +ETT  + +SE+D YL ETL 
Subjt:  ERKNNILLYVAVVLDPRFKLNTGAASVCFSSGSGGGSS-------------------SASASTSANFKS----------VETTSFDKESEIDVYLLETLA

Query:  KDSSSFDILYWWKQNGHRFEVLSRMSRDILAIPVSTVASESAFSTGGRVVNTSRCSLTPKTVKALICAKNWL
             FD+L WWKQN  ++  LS+M+RDIL+IPVS  A +  F    R ++  + SL P+TV+ALICA+ WL
Subjt:  KDSSSFDILYWWKQNGHRFEVLSRMSRDILAIPVSTVASESAFSTGGRVVNTSRCSLTPKTVKALICAKNWL

AT3G42170.1 BED zinc finger ;hAT family dimerisation domain1.4e-0532.2Show/hide
Query:  DVLEVYARLRDRLKEMFASLKCKVYLTTDYWTSGANMSYMVLTAHFIDSEWKLHKRILS
        D +  Y   +  + +    +  +  LT D+WTS   + Y+ +TAH+IDS+WK+ K++L+
Subjt:  DVLEVYARLRDRLKEMFASLKCKVYLTTDYWTSGANMSYMVLTAHFIDSEWKLHKRILS

AT5G11240.1 transducin family protein / WD-40 repeat family protein4.7e-19557.07Show/hide
Query:  MGLSNIRDLLTSFSPDLHFFAISSGDGRIKIWDALKGQIQTEFADFFASDSTSILTKPDKGHLSIDYKCMKWFSLEKKRKRKRQCSLLLLGTGSGDVLAL
        M L NI+D+LTSFSP L + A+S+GDGRIKIWD +KGQ+QTEFAD  +++ T+I TK  KGHLS+DY CMKW SLEKK+KRK   S+L+LGTG GDVLAL
Subjt:  MGLSNIRDLLTSFSPDLHFFAISSGDGRIKIWDALKGQIQTEFADFFASDSTSILTKPDKGHLSIDYKCMKWFSLEKKRKRKRQCSLLLLGTGSGDVLAL

Query:  DVAAGELKWKISDCHPGGVTTISFPTHGSSIYTAGADGMLCEIDSLTGNLLRKFKASTKAISCISVSPDGKIIATAASQMKIFNCSNHKKIQKFSGHPGA
        DVA+G+LKW+ISDCHPGGV  +S     S IY+ GADGM+C+ID  +GNL+RKFKASTK +S + VSPDGKI+ TA++Q+K FNCS+ KKIQKF+GHPG 
Subjt:  DVAAGELKWKISDCHPGGVTTISFPTHGSSIYTAGADGMLCEIDSLTGNLLRKFKASTKAISCISVSPDGKIIATAASQMKIFNCSNHKKIQKFSGHPGA

Query:  VRCMVFTEDGKYILSSAVGERYIVVWSLGGGKKQSASCVLALEHPAIFVDSRLSKVDGDDETALYILAISEIGVCYLWYGQNLEELRSAKPTKILISDND
        VRC+ FTEDGKY+LSSAVGERYI VW   G KKQSASCVLALEHP +FVDS        +E  LY+LAISEIGVCY WYG N+EEL +A PTK+ ++  D
Subjt:  VRCMVFTEDGKYILSSAVGERYIVVWSLGGGKKQSASCVLALEHPAIFVDSRLSKVDGDDETALYILAISEIGVCYLWYGQNLEELRSAKPTKILISDND

Query:  ISSKSKKRAIPAIYAAKLQGIPKSGSGQVFLAHGLLVKPSFQNVVVHSGTDINLNSSNDGILLP-SQSTAKSKKGLDVQGGVVALDRANAEDALRPIPKV
         S K  K ++P I+AAKLQGI K GS   F+A GLLVKPSFQ +V+  G D+ LN+S DGILLP +QS +KS K   VQ  V  LDRA+AEDAL PI +V
Subjt:  ISSKSKKRAIPAIYAAKLQGIPKSGSGQVFLAHGLLVKPSFQNVVVHSGTDINLNSSNDGILLP-SQSTAKSKKGLDVQGGVVALDRANAEDALRPIPKV

Query:  FDSQEKSTLYHGLQVDRNDVMIQLVDSGSQVEDDVGVEDSGAVCMEDQLRSLGILNSTDDHTSESILESSILKGIDLEANMSQKKLREAVLSLAPGAACK
         D  EK +           V +   D  + + D    +      MED+LRSLGIL  TD+H  +++  +SI+ G DL+A +  KKL+ AVLS+ P  A K
Subjt:  FDSQEKSTLYHGLQVDRNDVMIQLVDSGSQVEDDVGVEDSGAVCMEDQLRSLGILNSTDDHTSESILESSILKGIDLEANMSQKKLREAVLSLAPGAACK

Query:  LLENLVSIWQSRLCSGKNVLPWIYSLLLNHSQHILSQEP-STQILDSLFKMTKSKETAVQSLLQLSGRLQLVLAQVERASANKTDQMIRYDHKIDGSESD
         LE L ++WQ+R C G+++LPWIYS+++NHS +I+SQEP + Q+L++L K+TKS+ TA+Q LLQLSGRLQLV AQ+ +A+ ++T Q+  +D +I      
Subjt:  LLENLVSIWQSRLCSGKNVLPWIYSLLLNHSQHILSQEP-STQILDSLFKMTKSKETAVQSLLQLSGRLQLVLAQVERASANKTDQMIRYDHKIDGSESD

Query:  DDEDEDEDEEVDDVLYGEEEYESELSSDD
         DE EDE+E+V+D  YGE + ES+LSSDD
Subjt:  DDEDEDEDEEVDDVLYGEEEYESELSSDD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATACCGTGGGAACAAACAATTCAAAGAATGAAAATGCTGCAGAAACGACTAAGAGCAAAACTAGCAATACCATTGATCTTGATCCAGATCCACATCCAGAAGCAAA
CTTGAAGGAGAGAATTCTACCATTCCCAAAAAAAAGGCGCGTACAAAACAATCAATGGTTTGGGATCATTTTGAGAGCTAGAGATGTTCTTGAGGTCTATGCTAGGTTGA
GAGATCGTTTGAAAGAAATGTTTGCAAGTTTGAAATGTAAAGTTTATCTCACTACTGATTATTGGACGTCGGGAGCAAACATGAGTTATATGGTACTGACTGCTCATTTC
ATTGATTCTGAATGGAAATTACATAAACGAATTCTTAGCTTTTGTCAAATTGAGAATCATAGAGGGGATACCATTGCTAGTAAGTTTGAAAAAGCTTTTGACAGGTTAGA
GGATGAAGATCCTATTTATAAGAATGATTCACCACCTACCAAGGATGATTGGACAGATGCTAAGATGTTGATTACTACCATTCAGAATTGTATACTCTTGAATTCGAAGG
GTTCAAATAGAATGCTTTCTAGTATGGCTAATGCGACGAAGGCCAAGTTTGAAAAGTATTGGGGGTTTGATGAGAGGAAGAACAATATTTTGTTGTATGTTGCTGTTGTT
TTGGATCCTCGATTTAAGTTGAATACTGGAGCTGCATCTGTTTGTTTTAGTAGTGGGAGTGGTGGTGGTAGTAGTAGTGCTAGTGCTAGTACTAGTGCTAATTTCAAAAG
TGTTGAGACCACAAGTTTTGATAAGGAGTCAGAGATTGATGTATATTTGTTGGAAACTCTTGCTAAAGATAGTAGTTCCTTTGACATACTTTATTGGTGGAAGCAGAATG
GTCATCGGTTTGAGGTTCTCAGTCGTATGAGTAGAGATATATTGGCAATTCCAGTATCTACTGTAGCATCTGAGTCAGCTTTCAGTACTGGAGGACGTGTAGTCAACACA
TCTCGTTGTTCTTTGACTCCAAAGACGGTGAAGGCTCTTATATGCGCTAAAAATTGGCTAGACTCTAAACCGATATCATTAGATCTTGATGATTTGATAGCCATAGTAGA
AGCTTCAATGTTTGAAGATGGATTCCAACCAATTGTGGATGAAGAAAAGAAACCGGATGAAGTGGAGAAAGAGAGGAAGAAAGCGCCGCCGCCGGTCACGAACTCGCGTG
GGTCTTGGTGTTCAGAGACAGTTAGCCGTCGGGGTGAGCTTCTTCCGCTTAGGCACTGCCGTGGGTTTCCGTTTGAGGCGTTGAAACGCCGTGGGTCTCGGATTTGTTTC
AGAGACGTCTTCTTGCCGGCGTATTTCATAGTTAATCGCCGGGGCTGTGGGTATCGGTTCAGCATCGTGAGACGTGGTTTGGTCTGGTTTGGGTCTCAGTTCAGGGTCGT
TTTAGTTCGCCGGTGTGAGCCTCTCAGTAGCAGCAGTGGGTCTCGCTTTGAAGCTCAAATCGAAATCATGGGGTTGTCGAATATCAGAGATCTGTTGACATCTTTTAGTC
CTGATCTGCATTTCTTTGCTATCAGTTCGGGAGATGGTCGAATTAAGATTTGGGATGCGTTGAAGGGTCAGATACAGACTGAGTTTGCAGATTTCTTTGCATCTGATTCG
ACGAGCATACTCACGAAACCAGACAAAGGGCATCTATCAATTGATTATAAATGCATGAAGTGGTTTTCATTGGAGAAAAAGAGAAAAAGAAAGCGTCAGTGTTCGTTGTT
GCTGTTAGGAACTGGTAGTGGGGATGTTTTGGCTCTTGATGTAGCAGCTGGTGAATTGAAGTGGAAAATAAGTGATTGTCATCCTGGGGGTGTTACAACAATTTCGTTTC
CTACACATGGTTCAAGTATTTATACTGCTGGTGCTGATGGAATGCTATGTGAAATTGATTCTTTGACGGGTAATCTGTTGAGGAAATTCAAGGCTTCTACAAAGGCAATA
TCTTGTATTTCTGTTTCACCAGATGGGAAGATAATAGCAACTGCAGCTTCACAAATGAAGATTTTCAATTGTTCCAATCACAAAAAGATACAGAAGTTTTCTGGGCATCC
TGGAGCCGTTCGGTGTATGGTTTTTACCGAAGATGGAAAGTACATCCTTTCATCTGCGGTTGGTGAAAGGTATATCGTTGTTTGGAGTTTAGGTGGAGGGAAAAAGCAAT
CCGCTAGTTGTGTTCTTGCATTGGAACACCCTGCCATCTTTGTGGACAGCAGGCTTTCGAAAGTTGATGGAGATGATGAAACTGCTCTATACATTTTGGCTATATCAGAA
ATTGGTGTTTGTTACTTATGGTATGGACAGAATCTTGAGGAGTTGCGAAGTGCCAAGCCTACTAAAATATTGATATCTGATAATGATATTTCCTCCAAGAGTAAAAAACG
GGCGATACCTGCAATTTATGCTGCAAAATTACAAGGAATTCCTAAATCTGGGTCAGGGCAGGTGTTCCTTGCTCATGGATTGCTAGTGAAACCTTCATTTCAGAACGTTG
TGGTGCATTCTGGAACTGACATAAATTTAAACAGCTCCAACGATGGAATTCTTTTACCAAGTCAGTCGACTGCGAAGTCTAAGAAGGGCTTAGATGTACAGGGTGGAGTC
GTTGCATTAGATCGAGCTAATGCTGAAGATGCCTTACGTCCAATTCCGAAGGTTTTCGATTCTCAAGAGAAAAGTACCTTATATCACGGTTTGCAAGTTGACCGTAATGA
TGTGATGATTCAATTGGTTGACAGTGGGAGCCAAGTAGAAGATGACGTCGGAGTGGAGGACTCTGGAGCAGTTTGCATGGAGGACCAACTCCGATCATTAGGCATACTCA
ACAGTACAGATGATCACACATCTGAATCCATCCTTGAGTCTTCGATACTCAAGGGTATTGATCTTGAAGCTAATATGTCACAGAAAAAGTTAAGAGAAGCAGTTTTATCA
TTGGCACCGGGCGCTGCATGCAAGTTGCTTGAAAACTTGGTTAGCATCTGGCAGTCTAGGTTGTGTAGTGGAAAGAACGTTCTACCGTGGATTTATAGTTTATTGTTGAA
TCACAGTCAACATATCTTGTCTCAAGAACCATCGACCCAGATACTTGATTCTTTATTTAAGATGACTAAATCCAAAGAAACTGCGGTTCAATCTCTTCTTCAATTATCAG
GTCGATTGCAACTGGTGTTGGCACAAGTTGAAAGGGCATCGGCCAACAAAACCGATCAAATGATACGCTATGATCACAAAATAGATGGAAGTGAGAGTGACGATGACGAA
GACGAAGACGAAGATGAAGAAGTCGACGATGTTCTTTACGGGGAAGAAGAATATGAATCTGAATTAAGTAGCGATGATGAGAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATACCGTGGGAACAAACAATTCAAAGAATGAAAATGCTGCAGAAACGACTAAGAGCAAAACTAGCAATACCATTGATCTTGATCCAGATCCACATCCAGAAGCAAA
CTTGAAGGAGAGAATTCTACCATTCCCAAAAAAAAGGCGCGTACAAAACAATCAATGGTTTGGGATCATTTTGAGAGCTAGAGATGTTCTTGAGGTCTATGCTAGGTTGA
GAGATCGTTTGAAAGAAATGTTTGCAAGTTTGAAATGTAAAGTTTATCTCACTACTGATTATTGGACGTCGGGAGCAAACATGAGTTATATGGTACTGACTGCTCATTTC
ATTGATTCTGAATGGAAATTACATAAACGAATTCTTAGCTTTTGTCAAATTGAGAATCATAGAGGGGATACCATTGCTAGTAAGTTTGAAAAAGCTTTTGACAGGTTAGA
GGATGAAGATCCTATTTATAAGAATGATTCACCACCTACCAAGGATGATTGGACAGATGCTAAGATGTTGATTACTACCATTCAGAATTGTATACTCTTGAATTCGAAGG
GTTCAAATAGAATGCTTTCTAGTATGGCTAATGCGACGAAGGCCAAGTTTGAAAAGTATTGGGGGTTTGATGAGAGGAAGAACAATATTTTGTTGTATGTTGCTGTTGTT
TTGGATCCTCGATTTAAGTTGAATACTGGAGCTGCATCTGTTTGTTTTAGTAGTGGGAGTGGTGGTGGTAGTAGTAGTGCTAGTGCTAGTACTAGTGCTAATTTCAAAAG
TGTTGAGACCACAAGTTTTGATAAGGAGTCAGAGATTGATGTATATTTGTTGGAAACTCTTGCTAAAGATAGTAGTTCCTTTGACATACTTTATTGGTGGAAGCAGAATG
GTCATCGGTTTGAGGTTCTCAGTCGTATGAGTAGAGATATATTGGCAATTCCAGTATCTACTGTAGCATCTGAGTCAGCTTTCAGTACTGGAGGACGTGTAGTCAACACA
TCTCGTTGTTCTTTGACTCCAAAGACGGTGAAGGCTCTTATATGCGCTAAAAATTGGCTAGACTCTAAACCGATATCATTAGATCTTGATGATTTGATAGCCATAGTAGA
AGCTTCAATGTTTGAAGATGGATTCCAACCAATTGTGGATGAAGAAAAGAAACCGGATGAAGTGGAGAAAGAGAGGAAGAAAGCGCCGCCGCCGGTCACGAACTCGCGTG
GGTCTTGGTGTTCAGAGACAGTTAGCCGTCGGGGTGAGCTTCTTCCGCTTAGGCACTGCCGTGGGTTTCCGTTTGAGGCGTTGAAACGCCGTGGGTCTCGGATTTGTTTC
AGAGACGTCTTCTTGCCGGCGTATTTCATAGTTAATCGCCGGGGCTGTGGGTATCGGTTCAGCATCGTGAGACGTGGTTTGGTCTGGTTTGGGTCTCAGTTCAGGGTCGT
TTTAGTTCGCCGGTGTGAGCCTCTCAGTAGCAGCAGTGGGTCTCGCTTTGAAGCTCAAATCGAAATCATGGGGTTGTCGAATATCAGAGATCTGTTGACATCTTTTAGTC
CTGATCTGCATTTCTTTGCTATCAGTTCGGGAGATGGTCGAATTAAGATTTGGGATGCGTTGAAGGGTCAGATACAGACTGAGTTTGCAGATTTCTTTGCATCTGATTCG
ACGAGCATACTCACGAAACCAGACAAAGGGCATCTATCAATTGATTATAAATGCATGAAGTGGTTTTCATTGGAGAAAAAGAGAAAAAGAAAGCGTCAGTGTTCGTTGTT
GCTGTTAGGAACTGGTAGTGGGGATGTTTTGGCTCTTGATGTAGCAGCTGGTGAATTGAAGTGGAAAATAAGTGATTGTCATCCTGGGGGTGTTACAACAATTTCGTTTC
CTACACATGGTTCAAGTATTTATACTGCTGGTGCTGATGGAATGCTATGTGAAATTGATTCTTTGACGGGTAATCTGTTGAGGAAATTCAAGGCTTCTACAAAGGCAATA
TCTTGTATTTCTGTTTCACCAGATGGGAAGATAATAGCAACTGCAGCTTCACAAATGAAGATTTTCAATTGTTCCAATCACAAAAAGATACAGAAGTTTTCTGGGCATCC
TGGAGCCGTTCGGTGTATGGTTTTTACCGAAGATGGAAAGTACATCCTTTCATCTGCGGTTGGTGAAAGGTATATCGTTGTTTGGAGTTTAGGTGGAGGGAAAAAGCAAT
CCGCTAGTTGTGTTCTTGCATTGGAACACCCTGCCATCTTTGTGGACAGCAGGCTTTCGAAAGTTGATGGAGATGATGAAACTGCTCTATACATTTTGGCTATATCAGAA
ATTGGTGTTTGTTACTTATGGTATGGACAGAATCTTGAGGAGTTGCGAAGTGCCAAGCCTACTAAAATATTGATATCTGATAATGATATTTCCTCCAAGAGTAAAAAACG
GGCGATACCTGCAATTTATGCTGCAAAATTACAAGGAATTCCTAAATCTGGGTCAGGGCAGGTGTTCCTTGCTCATGGATTGCTAGTGAAACCTTCATTTCAGAACGTTG
TGGTGCATTCTGGAACTGACATAAATTTAAACAGCTCCAACGATGGAATTCTTTTACCAAGTCAGTCGACTGCGAAGTCTAAGAAGGGCTTAGATGTACAGGGTGGAGTC
GTTGCATTAGATCGAGCTAATGCTGAAGATGCCTTACGTCCAATTCCGAAGGTTTTCGATTCTCAAGAGAAAAGTACCTTATATCACGGTTTGCAAGTTGACCGTAATGA
TGTGATGATTCAATTGGTTGACAGTGGGAGCCAAGTAGAAGATGACGTCGGAGTGGAGGACTCTGGAGCAGTTTGCATGGAGGACCAACTCCGATCATTAGGCATACTCA
ACAGTACAGATGATCACACATCTGAATCCATCCTTGAGTCTTCGATACTCAAGGGTATTGATCTTGAAGCTAATATGTCACAGAAAAAGTTAAGAGAAGCAGTTTTATCA
TTGGCACCGGGCGCTGCATGCAAGTTGCTTGAAAACTTGGTTAGCATCTGGCAGTCTAGGTTGTGTAGTGGAAAGAACGTTCTACCGTGGATTTATAGTTTATTGTTGAA
TCACAGTCAACATATCTTGTCTCAAGAACCATCGACCCAGATACTTGATTCTTTATTTAAGATGACTAAATCCAAAGAAACTGCGGTTCAATCTCTTCTTCAATTATCAG
GTCGATTGCAACTGGTGTTGGCACAAGTTGAAAGGGCATCGGCCAACAAAACCGATCAAATGATACGCTATGATCACAAAATAGATGGAAGTGAGAGTGACGATGACGAA
GACGAAGACGAAGATGAAGAAGTCGACGATGTTCTTTACGGGGAAGAAGAATATGAATCTGAATTAAGTAGCGATGATGAGAATTAG
Protein sequenceShow/hide protein sequence
MDTVGTNNSKNENAAETTKSKTSNTIDLDPDPHPEANLKERILPFPKKRRVQNNQWFGIILRARDVLEVYARLRDRLKEMFASLKCKVYLTTDYWTSGANMSYMVLTAHF
IDSEWKLHKRILSFCQIENHRGDTIASKFEKAFDRLEDEDPIYKNDSPPTKDDWTDAKMLITTIQNCILLNSKGSNRMLSSMANATKAKFEKYWGFDERKNNILLYVAVV
LDPRFKLNTGAASVCFSSGSGGGSSSASASTSANFKSVETTSFDKESEIDVYLLETLAKDSSSFDILYWWKQNGHRFEVLSRMSRDILAIPVSTVASESAFSTGGRVVNT
SRCSLTPKTVKALICAKNWLDSKPISLDLDDLIAIVEASMFEDGFQPIVDEEKKPDEVEKERKKAPPPVTNSRGSWCSETVSRRGELLPLRHCRGFPFEALKRRGSRICF
RDVFLPAYFIVNRRGCGYRFSIVRRGLVWFGSQFRVVLVRRCEPLSSSSGSRFEAQIEIMGLSNIRDLLTSFSPDLHFFAISSGDGRIKIWDALKGQIQTEFADFFASDS
TSILTKPDKGHLSIDYKCMKWFSLEKKRKRKRQCSLLLLGTGSGDVLALDVAAGELKWKISDCHPGGVTTISFPTHGSSIYTAGADGMLCEIDSLTGNLLRKFKASTKAI
SCISVSPDGKIIATAASQMKIFNCSNHKKIQKFSGHPGAVRCMVFTEDGKYILSSAVGERYIVVWSLGGGKKQSASCVLALEHPAIFVDSRLSKVDGDDETALYILAISE
IGVCYLWYGQNLEELRSAKPTKILISDNDISSKSKKRAIPAIYAAKLQGIPKSGSGQVFLAHGLLVKPSFQNVVVHSGTDINLNSSNDGILLPSQSTAKSKKGLDVQGGV
VALDRANAEDALRPIPKVFDSQEKSTLYHGLQVDRNDVMIQLVDSGSQVEDDVGVEDSGAVCMEDQLRSLGILNSTDDHTSESILESSILKGIDLEANMSQKKLREAVLS
LAPGAACKLLENLVSIWQSRLCSGKNVLPWIYSLLLNHSQHILSQEPSTQILDSLFKMTKSKETAVQSLLQLSGRLQLVLAQVERASANKTDQMIRYDHKIDGSESDDDE
DEDEDEEVDDVLYGEEEYESELSSDDEN