; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020732 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020732
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionDDE Tnp4 domain-containing protein
Genome locationtig00153554:592868..600620
RNA-Seq ExpressionSgr020732
SyntenySgr020732
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037135.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]5.7e-19982.53Show/hide
Query:  MDSRELAALLSSLISQLLLLLFLLFPSPNPHSLLSNSTSDSSYYGNLYPFFNHLLLSQEIAASLSFLSVSRKRKRTHSTEELELGPTGDDAGDGSLGGGV
        MDS  LAALLSSLISQLLLLLFLLFPS NPHSL SNST DSS+Y NL   F H L SQ+ AASL FLSVSRKRKRT+  + LEL                
Subjt:  MDSRELAALLSSLISQLLLLLFLLFPSPNPHSLLSNSTSDSSYYGNLYPFFNHLLLSQEIAASLSFLSVSRKRKRTHSTEELELGPTGDDAGDGSLGGGV

Query:  GGGRGRV-NLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTN
        G   GRV +LFRTR+PDSFRNHFRMTSSTFEWLSGLLEPLL+CRDPVGSPL+LS EIRLGVGL RLATGCDFSTIS+QFGVSESVARFC+KQLCRVLCTN
Subjt:  GGGRGRV-NLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTN

Query:  FRFWVEFPSPNELESTSSAFEDLGGLLNCCGVIACTRFKIIRNNHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPV
        FRFWVEFP PNELE TSSAFEDL GL NCCGV++CTRFKIIRN+HFYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMSSTLFKDIE+GRLL+SPPV
Subjt:  FRFWVEFPSPNELESTSSAFEDLGGLLNCCGVIACTRFKIIRNNHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPV

Query:  YLHGVAVNQYLFGHGEYPLLPWLMVPFAGAGSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMSEE
        YLHGVAVN+YLFG GEYPLLPWL+VPFAGA SGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDFSAM++E
Subjt:  YLHGVAVNQYLFGHGEYPLLPWLMVPFAGAGSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMSEE

Query:  WESSASLDHSSQYIGVGLNEDSTDEKASIIQRALA
        WES +SLDH SQY+  GLN DST+EKAS+IQRALA
Subjt:  WESSASLDHSSQYIGVGLNEDSTDEKASIIQRALA

KAG6600319.1 Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. sororia]1.0e-19580.96Show/hide
Query:  MDSRELAALLSSLISQLLLLLFLLFPSPNPHSLLSNSTSDSSYYGNLYPFFNHLLLSQEIAASLSFLSVSRKRKRTHSTEELELGPTGDDAGDGSLGGGV
        MDSR+LAALLSSLISQLLLLL LLFPS NPHSLLSNS+SDS++Y NL+P FNH L SQ+IAASLSFLSVSRKRKRTHS+E LELGP        S  GG 
Subjt:  MDSRELAALLSSLISQLLLLLFLLFPSPNPHSLLSNSTSDSSYYGNLYPFFNHLLLSQEIAASLSFLSVSRKRKRTHSTEELELGPTGDDAGDGSLGGGV

Query:  GGGRGRVNLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNF
         GGRGRV+L RTRSPDSFRNHFRMTSSTFEWLSGLLEPLL+CRDPVGSPL+LSAEIRLGVGLSRLATGCDFSTIS+QFGVSESVARFCAKQLCRVLCTNF
Subjt:  GGGRGRVNLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNF

Query:  RFWVEFPSPNELESTSSAFEDLGGLLNCCGVIACTRFKIIRNNHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVY
        RFWVEFP P+ELE TSS+FED+ GL NCCGVI+CT                            SIVAGFRGDKDDSTVLMS+TLFKDIEEGRLL SPPVY
Subjt:  RFWVEFPSPNELESTSSAFEDLGGLLNCCGVIACTRFKIIRNNHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVY

Query:  LHGVAVNQYLFGHGEYPLLPWLMVPFAGAGSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMSEEW
        LHG+AVNQYLFGHGEYPLLPWLMVPFAGA SGSTEESFNEAHRLMCIPALKAI+SLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDF+AM++EW
Subjt:  LHGVAVNQYLFGHGEYPLLPWLMVPFAGAGSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMSEEW

Query:  ESSASLDHSSQYIGVGLNEDSTDEKASIIQRALALK
        ES ASLDHSSQY+G+GLNEDS DEKA +IQ+ALAL+
Subjt:  ESSASLDHSSQYIGVGLNEDSTDEKASIIQRALALK

KAG7030976.1 Protein ALP1-like protein [Cucurbita argyrosperma subsp. argyrosperma]5.0e-19580.73Show/hide
Query:  MDSRELAALLSSLISQLLLLLFLLFPSPNPHSLLSNSTSDSSYYGNLYPFFNHLLLSQEIAASLSFLSVSRKRKRTHSTEELELGPTGDDAGDGSLGGGV
        MDSR+LAALLSSLISQLLLLL LLFPS NPHSLLSNS+SDS++Y NL+P FNH L SQ+IAASLSFLSVSRKRKRTHS+E LELGP        S  GG 
Subjt:  MDSRELAALLSSLISQLLLLLFLLFPSPNPHSLLSNSTSDSSYYGNLYPFFNHLLLSQEIAASLSFLSVSRKRKRTHSTEELELGPTGDDAGDGSLGGGV

Query:  GGGRGRVNLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNF
         GGRGRV+L RTRSPDSFRNHFRMTSSTFEWLSGLLEPLL+CRDPVGSPL+LSAEIRLGVGLSRLATGCDFSTIS+QFGVSESVARFCAKQLCRVLCTNF
Subjt:  GGGRGRVNLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNF

Query:  RFWVEFPSPNELESTSSAFEDLGGLLNCCGVIACTRFKIIRNNHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVY
        RFWVEFP P+ELE TSS+FED+ GL NCCGVI+CT                            SIVAGFRGDKDDSTVLMS+ LFKDIEEGRLL SPPVY
Subjt:  RFWVEFPSPNELESTSSAFEDLGGLLNCCGVIACTRFKIIRNNHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVY

Query:  LHGVAVNQYLFGHGEYPLLPWLMVPFAGAGSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMSEEW
        LHG+AVNQYLFGHGEYPLLPWLMVPFAGA SGSTEESFNEAHRLMCIPALKAI+SLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDF+AM++EW
Subjt:  LHGVAVNQYLFGHGEYPLLPWLMVPFAGAGSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMSEEW

Query:  ESSASLDHSSQYIGVGLNEDSTDEKASIIQRALALK
        ES ASLDHSSQY+G+GLNEDS DEKA +IQ+ALAL+
Subjt:  ESSASLDHSSQYIGVGLNEDSTDEKASIIQRALALK

KGN57516.1 hypothetical protein Csa_011580 [Cucumis sativus]4.0e-20082.38Show/hide
Query:  MDSRELAALLSSLISQLLLLLFLLFPSPNPHSLLSNSTSDSSYYGNLYPFFNHLLLSQEIAASLSFLSVSRKRKRTHSTEELELGPTGDDAGDGSLGGGV
        MDS  LAALLSSLISQLLLLLFLLFPS NPHSL SNS  DSS+Y NL   F H L SQ+ AASL FLSVSRKRKRT+ ++ LEL                
Subjt:  MDSRELAALLSSLISQLLLLLFLLFPSPNPHSLLSNSTSDSSYYGNLYPFFNHLLLSQEIAASLSFLSVSRKRKRTHSTEELELGPTGDDAGDGSLGGGV

Query:  GGGRGRV-NLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTN
        G   GRV +LFRTR+PDSFRNHFRMTSSTFEWLSGLLEPLL+CRDPVGSPL+LS EIRLGVGL RLATGCDFSTIS+QFGVSESVARFC+KQLCRVLCTN
Subjt:  GGGRGRV-NLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTN

Query:  FRFWVEFPSPNELESTSSAFEDLGGLLNCCGVIACTRFKIIRNNHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPV
        FRFWVEFP PNELE TSSAFEDL GL NCCGV++CTRFKIIRN+HFYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMSSTLFKDIE+GRLL+SPPV
Subjt:  FRFWVEFPSPNELESTSSAFEDLGGLLNCCGVIACTRFKIIRNNHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPV

Query:  YLHGVAVNQYLFGHGEYPLLPWLMVPFAGAGSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMSEE
        YLHGVAVN+YLFGHGEYPLLPWL+VPFAGA SGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDFSAM++E
Subjt:  YLHGVAVNQYLFGHGEYPLLPWLMVPFAGAGSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMSEE

Query:  WESSASLDHSSQYIGVGLNEDSTDEKASIIQRALALK
        WES +SLDH SQY+  GLN DST+EKAS+IQRALAL+
Subjt:  WESSASLDHSSQYIGVGLNEDSTDEKASIIQRALALK

XP_023536005.1 protein ALP1-like [Cucurbita pepo subsp. pepo]1.2e-19681.42Show/hide
Query:  MDSRELAALLSSLISQLLLLLFLLFPSPNPHSLLSNSTSDSSYYGNLYPFFNHLLLSQEIAASLSFLSVSRKRKRTHSTEELELGPTGDDAGDGSLGGGV
        MDSR+LAALLSSLISQLLLLL LLFPS NPHSLLSNS+SDS++Y NL+P FNH L SQ+IAASLSFLSVSRKRKRTHS+E LELGP        S  GG 
Subjt:  MDSRELAALLSSLISQLLLLLFLLFPSPNPHSLLSNSTSDSSYYGNLYPFFNHLLLSQEIAASLSFLSVSRKRKRTHSTEELELGPTGDDAGDGSLGGGV

Query:  GGGRGRVNLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNF
         GGRGRV+L RTRSPDSFRNHFRMTSSTFEWLSGLLEPLL+CRDPVGSPL+LSAEIRLGVGLSRLATGCDFSTIS+QFGVSESVARFCAKQLCRVLCTNF
Subjt:  GGGRGRVNLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNF

Query:  RFWVEFPSPNELESTSSAFEDLGGLLNCCGVIACTRFKIIRNNHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVY
        RFWVEFP P+ELE TSSAFED+ GL NCCGVI+CT                            SIVAGFRGDKDDSTVLMS+TLFKDIEEGRLL SPPVY
Subjt:  RFWVEFPSPNELESTSSAFEDLGGLLNCCGVIACTRFKIIRNNHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVY

Query:  LHGVAVNQYLFGHGEYPLLPWLMVPFAGAGSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMSEEW
        LHG+AVNQYLFGHGEYPLLPWLMVPFAGA SGSTEESFNEAHRLMCIPALKAI+SLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDF+AM++EW
Subjt:  LHGVAVNQYLFGHGEYPLLPWLMVPFAGAGSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMSEEW

Query:  ESSASLDHSSQYIGVGLNEDSTDEKASIIQRALALK
        ES ASLDHSSQY+G+GLNEDS DEKAS+IQ+ALAL+
Subjt:  ESSASLDHSSQYIGVGLNEDSTDEKASIIQRALALK

TrEMBL top hitse value%identityAlignment
A0A0A0LBX6 DDE Tnp4 domain-containing protein1.9e-20082.38Show/hide
Query:  MDSRELAALLSSLISQLLLLLFLLFPSPNPHSLLSNSTSDSSYYGNLYPFFNHLLLSQEIAASLSFLSVSRKRKRTHSTEELELGPTGDDAGDGSLGGGV
        MDS  LAALLSSLISQLLLLLFLLFPS NPHSL SNS  DSS+Y NL   F H L SQ+ AASL FLSVSRKRKRT+ ++ LEL                
Subjt:  MDSRELAALLSSLISQLLLLLFLLFPSPNPHSLLSNSTSDSSYYGNLYPFFNHLLLSQEIAASLSFLSVSRKRKRTHSTEELELGPTGDDAGDGSLGGGV

Query:  GGGRGRV-NLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTN
        G   GRV +LFRTR+PDSFRNHFRMTSSTFEWLSGLLEPLL+CRDPVGSPL+LS EIRLGVGL RLATGCDFSTIS+QFGVSESVARFC+KQLCRVLCTN
Subjt:  GGGRGRV-NLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTN

Query:  FRFWVEFPSPNELESTSSAFEDLGGLLNCCGVIACTRFKIIRNNHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPV
        FRFWVEFP PNELE TSSAFEDL GL NCCGV++CTRFKIIRN+HFYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMSSTLFKDIE+GRLL+SPPV
Subjt:  FRFWVEFPSPNELESTSSAFEDLGGLLNCCGVIACTRFKIIRNNHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPV

Query:  YLHGVAVNQYLFGHGEYPLLPWLMVPFAGAGSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMSEE
        YLHGVAVN+YLFGHGEYPLLPWL+VPFAGA SGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDFSAM++E
Subjt:  YLHGVAVNQYLFGHGEYPLLPWLMVPFAGAGSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMSEE

Query:  WESSASLDHSSQYIGVGLNEDSTDEKASIIQRALALK
        WES +SLDH SQY+  GLN DST+EKAS+IQRALAL+
Subjt:  WESSASLDHSSQYIGVGLNEDSTDEKASIIQRALALK

A0A5D3CRB2 Putative nuclease HARBI12.8e-19982.53Show/hide
Query:  MDSRELAALLSSLISQLLLLLFLLFPSPNPHSLLSNSTSDSSYYGNLYPFFNHLLLSQEIAASLSFLSVSRKRKRTHSTEELELGPTGDDAGDGSLGGGV
        MDS  LAALLSSLISQLLLLLFLLFPS NPHSL SNST DSS+Y NL   F H L SQ+ AASL FLSVSRKRKRT+  + LEL                
Subjt:  MDSRELAALLSSLISQLLLLLFLLFPSPNPHSLLSNSTSDSSYYGNLYPFFNHLLLSQEIAASLSFLSVSRKRKRTHSTEELELGPTGDDAGDGSLGGGV

Query:  GGGRGRV-NLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTN
        G   GRV +LFRTR+PDSFRNHFRMTSSTFEWLSGLLEPLL+CRDPVGSPL+LS EIRLGVGL RLATGCDFSTIS+QFGVSESVARFC+KQLCRVLCTN
Subjt:  GGGRGRV-NLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTN

Query:  FRFWVEFPSPNELESTSSAFEDLGGLLNCCGVIACTRFKIIRNNHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPV
        FRFWVEFP PNELE TSSAFEDL GL NCCGV++CTRFKIIRN+HFYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMSSTLFKDIE+GRLL+SPPV
Subjt:  FRFWVEFPSPNELESTSSAFEDLGGLLNCCGVIACTRFKIIRNNHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPV

Query:  YLHGVAVNQYLFGHGEYPLLPWLMVPFAGAGSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMSEE
        YLHGVAVN+YLFG GEYPLLPWL+VPFAGA SGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDFSAM++E
Subjt:  YLHGVAVNQYLFGHGEYPLLPWLMVPFAGAGSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMSEE

Query:  WESSASLDHSSQYIGVGLNEDSTDEKASIIQRALA
        WES +SLDH SQY+  GLN DST+EKAS+IQRALA
Subjt:  WESSASLDHSSQYIGVGLNEDSTDEKASIIQRALA

A0A6J1D7F1 protein ALP1-like1.7e-19380.96Show/hide
Query:  MDSRELAALLSSLISQLLLLLFLLFPSPNPHSLLSNSTSDSSYYGNLYPFFNHLLLSQEIAASLSFLSVSRKRKRTHSTEELELGPTGDDAGDGSLGGGV
        MDSRELAAL+SSLISQLLL LFLLFPS NPHSLLSN  SDS +Y N +P   H L SQEIA+SLSFLSVSRKRKRTH  E+LEL P+         GGG 
Subjt:  MDSRELAALLSSLISQLLLLLFLLFPSPNPHSLLSNSTSDSSYYGNLYPFFNHLLLSQEIAASLSFLSVSRKRKRTHSTEELELGPTGDDAGDGSLGGGV

Query:  GGGRGRVNLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNF
        GGGRGRV+L  TR PDSFRNHFRMTSSTFEWLSGLLEPLL+CRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNF
Subjt:  GGGRGRVNLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNF

Query:  RFWVEFPSPNELESTSSAFEDLGGLLNCCGVIACTRFKIIRNNHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVY
        RFWVEFP PNELESTSSAFE L GL NCCGV+ACT                            SIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVY
Subjt:  RFWVEFPSPNELESTSSAFEDLGGLLNCCGVIACTRFKIIRNNHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVY

Query:  LHGVAVNQYLFGHGEYPLLPWLMVPFAGAGSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMSEEW
        LHG+AVNQY FGHGEYPLLPWLMVPF+GA SGSTEESFN+AHRLMCIPALKAIVSLRNWGVLSQPM EEFKTAVAYIGACSILHNALLMREDFSAM++EW
Subjt:  LHGVAVNQYLFGHGEYPLLPWLMVPFAGAGSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMSEEW

Query:  ESSASLDHSSQYIGVGLNEDSTDEKASIIQRALALK
        E  ASLDH SQYIG GLNEDSTDEKAS+IQRALAL+
Subjt:  ESSASLDHSSQYIGVGLNEDSTDEKASIIQRALALK

A0A6J1FNZ2 protein ALP1-like4.1e-19580.5Show/hide
Query:  MDSRELAALLSSLISQLLLLLFLLFPSPNPHSLLSNSTSDSSYYGNLYPFFNHLLLSQEIAASLSFLSVSRKRKRTHSTEELELGPTGDDAGDGSLGGGV
        MDSR+LAALLSSLISQLLLLL LLFPS NPHSLLSNS+SDS++Y NL+P FNH L SQ+IAASLSFLSVSRKRKRTHS+E LELGP        S  GG 
Subjt:  MDSRELAALLSSLISQLLLLLFLLFPSPNPHSLLSNSTSDSSYYGNLYPFFNHLLLSQEIAASLSFLSVSRKRKRTHSTEELELGPTGDDAGDGSLGGGV

Query:  GGGRGRVNLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNF
         GGRGRV+L RTRSPDSFRNHFRMTSSTFEWLSGLLEPLL+CRDPVGSPL+LSAEIRLGVGLSRLATGCDFSTIS+QFGVSESVARFCAKQLCRVLCTNF
Subjt:  GGGRGRVNLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNF

Query:  RFWVEFPSPNELESTSSAFEDLGGLLNCCGVIACTRFKIIRNNHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVY
        RFWVEFP P+ELE TSS+FED+ GL NCCGVI+CT                            SIVAGFRGDKDDSTVLMS+TLFKDIEE RLL SPPVY
Subjt:  RFWVEFPSPNELESTSSAFEDLGGLLNCCGVIACTRFKIIRNNHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVY

Query:  LHGVAVNQYLFGHGEYPLLPWLMVPFAGAGSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMSEEW
        LHG+AVNQYLFGHGEYPLLPWLMVPFAGA SGSTEESFNEAHRLMCIPALKAI+SLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDF+AM++EW
Subjt:  LHGVAVNQYLFGHGEYPLLPWLMVPFAGAGSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMSEEW

Query:  ESSASLDHSSQYIGVGLNEDSTDEKASIIQRALALK
        ES ASLDHSSQY+G+GLNEDS DEKA+++Q+ALAL+
Subjt:  ESSASLDHSSQYIGVGLNEDSTDEKASIIQRALALK

A0A6J1J0M5 protein ALP1-like7.1e-19580.96Show/hide
Query:  MDSRELAALLSSLISQLLLLLFLLFPSPNPHSLLSNSTSDSSYYGNLYPFFNHLLLSQEIAASLSFLSVSRKRKRTHSTEELELGPTGDDAGDGSLGGGV
        MDSR+LAALLSSLISQLLLLL LLFPS NPHSLLSNS+SDS++Y NL+P FNH L SQ+IAASLSFLSVSRKRKRTHS+E LELGP        S  GG 
Subjt:  MDSRELAALLSSLISQLLLLLFLLFPSPNPHSLLSNSTSDSSYYGNLYPFFNHLLLSQEIAASLSFLSVSRKRKRTHSTEELELGPTGDDAGDGSLGGGV

Query:  GGGRGRVNLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNF
         GGRGRV+L RTRSPDSFRNHFRMTSSTFEWLSGLLEPLL+CRDPVGSPL+LSAEIRLGVGLSRLATGCDFSTIS+QFGVSESVARFCAKQLCRVLCTNF
Subjt:  GGGRGRVNLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNF

Query:  RFWVEFPSPNELESTSSAFEDLGGLLNCCGVIACTRFKIIRNNHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVY
        RFWVEFP P+ELE TSSAFED+ GL NCCGVI+CT                            SIVAGFRGDKDDSTVLMS+TLFKDIEE RLL SPPVY
Subjt:  RFWVEFPSPNELESTSSAFEDLGGLLNCCGVIACTRFKIIRNNHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVY

Query:  LHGVAVNQYLFGHGEYPLLPWLMVPFAGAGSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMSEEW
        LHGVAVNQYLFGHG+YPLLPWLMVPFAGA SGSTEESFNEAHRLM IPALKAI+SLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDF+AM++EW
Subjt:  LHGVAVNQYLFGHGEYPLLPWLMVPFAGAGSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMSEEW

Query:  ESSASLDHSSQYIGVGLNEDSTDEKASIIQRALALK
        ES ASLDHSSQY+G+GLNEDS DEKAS+IQ+ALAL+
Subjt:  ESSASLDHSSQYIGVGLNEDSTDEKASIIQRALALK

SwissProt top hitse value%identityAlignment
Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 11.6e-2628.67Show/hide
Query:  SFRNHFRMTSSTFEWLSGLLEPLLDCRDPVG----SPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPSPNEL
        +F++ FR + +TF ++  L+   L  R P G        LS E ++ + L RLA+G    ++   FGV +S       +    L    +  + +P  + +
Subjt:  SFRNHFRMTSSTFEWLSGLLEPLLDCRDPVG----SPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPSPNEL

Query:  ESTSSAFEDLGGLLNCCGVIACTR----FKIIRNNHFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYL-H
        E   S FE++ GL NCCG I  T        ++ +  + D     S+  Q V D   R L++V G+ G    S +L  S  FK  E  ++LD  P  L  
Subjt:  ESTSSAFEDLGGLLNCCGVIACTR----FKIIRNNHFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYL-H

Query:  GVAVNQYLFGHGEYPLLPWLMVPFAGAGSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPM-HEEFKTAVAYIGACSILHNALLMREDF
        G  + +Y+ G   YPLLPWL+ P        +  +FNE H  +   A  A   L+ +W +LS+ M   + +   + I  C +LHN ++   D+
Subjt:  GVAVNQYLFGHGEYPLLPWLMVPFAGAGSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPM-HEEFKTAVAYIGACSILHNALLMREDF

Q9M2U3 Protein ALP1-like4.9e-2828.57Show/hide
Query:  PDSFRNHFRMTSSTFEWLSGLLEPLLDCR-----DPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVA-----RFCAKQLCRVLCTNFRFWV
        P +F + F+++  TF+++  L++     +     D  G+PL+L+   R+ V L RL +G   S I E FG+++S       RF      R +  +   W 
Subjt:  PDSFRNHFRMTSSTFEWLSGLLEPLLDCR-----DPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVA-----RFCAKQLCRVLCTNFRFWV

Query:  EFPSPNELESTSSAFEDLGGLLNCCGVIACTRF-----KIIRNNHFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLL
            P++L+   S FE + GL NCCG I  T        +  +N  + D     S+  Q VVD   R L ++AG+ G  +D  VL +S  +K +E+G+ L
Subjt:  EFPSPNELESTSSAFEDLGGLLNCCGVIACTRF-----KIIRNNHFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLL

Query:  DSPPVYL-HGVAVNQYLFGHGEYPLLPWLMVPFAGAGSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPMHEEFKTAV-AYIGACSILHNALLMRE
        +   + L     + +Y+ G   +PLLPWL+ P+ G  +   +  FN+ H      A  A+  L++ W +++  M    +  +   I  C +LHN ++  E
Subjt:  DSPPVYL-HGVAVNQYLFGHGEYPLLPWLMVPFAGAGSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPMHEEFKTAV-AYIGACSILHNALLMRE

Query:  D
        D
Subjt:  D

Arabidopsis top hitse value%identityAlignment
AT1G72270.1 CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)5.4e-1427.68Show/hide
Query:  HFRMTSSTFEWLSGLLEPLLDCRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGV-SESVARFCAKQLCRVLCTNFRFWVEFPSPNELESTSSAF
        +FRM+ STF  L  +              L+ S+       + RLA G  +  +  +FG  S S A      +C+++       ++ P P+   +     
Subjt:  HFRMTSSTFEWLSGLLEPLLDCRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGV-SESVARFCAKQLCRVLCTNFRFWVEFPSPNELESTSSAF

Query:  EDLGGLLNCCGVIACTRFKIIRNNHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYL-HGVAVNQYLFGHGEYPL
             L NC GV+   RF++       + SI  Q +VDS+ R + I AG+        +   + LF   EE  +L   P  L +GV V +Y+ G    PL
Subjt:  EDLGGLLNCCGVIACTRFKIIRNNHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYL-HGVAVNQYLFGHGEYPL

Query:  LPWLMVPFAGAGSGSTEESFNEAHRLMCIPALK----AIVSLR-NWGVLS---QPMHEEFKTAVAYIGACSILHNALLMREDFSAMSEE
        LPWL+ P+      S EESF E    +    L     A   +R  W +L    +P   EF   V   G   +LHN L+   D     EE
Subjt:  LPWLMVPFAGAGSGSTEESFNEAHRLMCIPALK----AIVSLR-NWGVLS---QPMHEEFKTAVAYIGACSILHNALLMREDFSAMSEE

AT3G19120.1 PIF / Ping-Pong family of plant transposases1.3e-2024.74Show/hide
Query:  SLISQLLLLLFLLFPSPNPHSLLSNSTSDSSYYGNLYPFFNHL-LLSQEIAASLSFLSVSRKRKRTHSTEELELGPTGDDAGDGSLGGGVGGGRGRVNLF
        +++S LL L   L P+    S  S S+  S+   +L    +   LL   +A+ LSFL+V+R    + S+ E           DG             +++
Subjt:  SLISQLLLLLFLLFPSPNPHSLLSNSTSDSSYYGNLYPFFNHL-LLSQEIAASLSFLSVSRKRKRTHSTEELELGPTGDDAGDGSLGGGVGGGRGRVNLF

Query:  RTRSP---DSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTN-FRFWVEF
           +P     +R+ + ++   F  +   L+P +       S L+L A+  + + LSRLA GC   T++ ++ +   +       + R+L T  +  +++ 
Subjt:  RTRSP---DSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTN-FRFWVEF

Query:  P-SPNELESTSSAFEDLGGLLNCCGVIACTRFKIIRNN----------HFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLD
        P     L  T+  FE+L  L N CG I  T  K+ R             +  D++  Q+V D       +     G +DDS+    S L+K +  G ++ 
Subjt:  P-SPNELESTSSAFEDLGGLLNCCGVIACTRFKIIRNN----------HFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLD

Query:  SPPVYLHGVAVNQYLFGHGEYPLLPWLMVPFAGAGSGSTEESFNEAHRLMCIPALKAIVSL--RNWGVLSQPMHEEFKTAVAYIGACSILHN
           + + G  V  Y+ G   YPLL +LM PF+  GSG+  E+  +   +     +   + L    W +L Q ++     A   I AC +LHN
Subjt:  SPPVYLHGVAVNQYLFGHGEYPLLPWLMVPFAGAGSGSTEESFNEAHRLMCIPALKAIVSL--RNWGVLSQPMHEEFKTAVAYIGACSILHN

AT3G55350.1 PIF / Ping-Pong family of plant transposases3.5e-2928.57Show/hide
Query:  PDSFRNHFRMTSSTFEWLSGLLEPLLDCR-----DPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVA-----RFCAKQLCRVLCTNFRFWV
        P +F + F+++  TF+++  L++     +     D  G+PL+L+   R+ V L RL +G   S I E FG+++S       RF      R +  +   W 
Subjt:  PDSFRNHFRMTSSTFEWLSGLLEPLLDCR-----DPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVA-----RFCAKQLCRVLCTNFRFWV

Query:  EFPSPNELESTSSAFEDLGGLLNCCGVIACTRF-----KIIRNNHFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLL
            P++L+   S FE + GL NCCG I  T        +  +N  + D     S+  Q VVD   R L ++AG+ G  +D  VL +S  +K +E+G+ L
Subjt:  EFPSPNELESTSSAFEDLGGLLNCCGVIACTRF-----KIIRNNHFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLL

Query:  DSPPVYL-HGVAVNQYLFGHGEYPLLPWLMVPFAGAGSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPMHEEFKTAV-AYIGACSILHNALLMRE
        +   + L     + +Y+ G   +PLLPWL+ P+ G  +   +  FN+ H      A  A+  L++ W +++  M    +  +   I  C +LHN ++  E
Subjt:  DSPPVYL-HGVAVNQYLFGHGEYPLLPWLMVPFAGAGSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPMHEEFKTAV-AYIGACSILHNALLMRE

Query:  D
        D
Subjt:  D

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.1e-2728.67Show/hide
Query:  SFRNHFRMTSSTFEWLSGLLEPLLDCRDPVG----SPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPSPNEL
        +F++ FR + +TF ++  L+   L  R P G        LS E ++ + L RLA+G    ++   FGV +S       +    L    +  + +P  + +
Subjt:  SFRNHFRMTSSTFEWLSGLLEPLLDCRDPVG----SPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPSPNEL

Query:  ESTSSAFEDLGGLLNCCGVIACTR----FKIIRNNHFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYL-H
        E   S FE++ GL NCCG I  T        ++ +  + D     S+  Q V D   R L++V G+ G    S +L  S  FK  E  ++LD  P  L  
Subjt:  ESTSSAFEDLGGLLNCCGVIACTR----FKIIRNNHFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYL-H

Query:  GVAVNQYLFGHGEYPLLPWLMVPFAGAGSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPM-HEEFKTAVAYIGACSILHNALLMREDF
        G  + +Y+ G   YPLLPWL+ P        +  +FNE H  +   A  A   L+ +W +LS+ M   + +   + I  C +LHN ++   D+
Subjt:  GVAVNQYLFGHGEYPLLPWLMVPFAGAGSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPM-HEEFKTAVAYIGACSILHNALLMREDF

AT4G29780.1 unknown protein7.9e-2127.39Show/hide
Query:  DSFRNHFRMTSSTFEWLSGLLEPLLD-----CRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCR----VLCTNFRFWVEF
        D FR  FRM+ STF  +   L+  +       RD + +P       R+GV + RLATG     +SE+FG+  S       ++CR    VL   +  W   
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLD-----CRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCR----VLCTNFRFWVEF

Query:  PSPNELESTSSAFEDLGGLLNCCGVIACTRFKII---------------RNNHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKD-IEE
        PS +E+ ST + FE +  + N  G I  T   II                 N     SI  Q VV++      +  G  G   D  +L  S+L +     
Subjt:  PSPNELESTSSAFEDLGGLLNCCGVIACTRFKII---------------RNNHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKD-IEE

Query:  GRLLDSPPVYLHGVAVNQYLFGHGEYPLLPWLMVPFAGAGSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPMHEEFKTAVAYIGACSILHNALLM
        G L DS            ++ G+  +PL  +L+VP+       T+ +FNE+   +   A  A   L+  W  L +    + +     +GAC +LHN   M
Subjt:  GRLLDSPPVYLHGVAVNQYLFGHGEYPLLPWLMVPFAGAGSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPMHEEFKTAVAYIGACSILHNALLM

Query:  RED
        R++
Subjt:  RED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGGAAAACTGTACCTCCAGCACTGATGTGAACCTGCCAACAGCAGTTTTTAACATTGCGTCGACCTGTTCAGCTGATAATAGCCCTCCAGCAAATATTAGAAAACC
TAACACAAATTTGAAGGTTTATGTGCCACATTGCAATGGTGTGGTTTGTCAGCAGTTGCAATGCATGTGCCTGGTCAAACAACTAAAACTGATTGAAGGAATTTCTACTA
TTAAAAATGAATGGAAAAACGTAAAGAAGATGACCTGTTTGCCCTTGCTTCATGAGGCATCACCATTGAAGTTCCTGGCAACCATAGGGCAACCCTTCATTGAAGGTGCA
TGCCATAGTCGCAGATGGGAAATCAAGGCAAGAGGTGCCCACCCTCACCATAGGCACATACCTCACACTGACAGGTATGGGCAGGCTAGGCAGAACGACTTGAACAAAGG
AAGATGGCTGCAATCTAGACGCACGCGCGAGCAAGATTGTAAGGACGAGTGGCAGGAGAAAGGGCGACCTCATCGGTGTCCGGAAGCGAGGCGACAACTTGTCGGAGAGA
GGAAAGGCCACCGCGTCACCTTCGACAGGAAGAGAGAACTTCCGACTTGGACAGAGAATGAGAGAGCTCGAGAGAGTGATTTTTTAGAGATAGGAGCAAGCTCGACTAGA
GGCAGGGCGCCAATGCAGGTTGAAGACGAACATCGGGCGGCAGCCAAATTACGCTGGAGCAATTATATAGATTTGTTGGACTTTGCCAGGAGAGAATTAGTCGAGCTCGA
GCATCGTCTCCGCAAGCAGCCAAGCCAAGTGTTCGTATGGCGTAACTTGTACAAGATTCGACCTTCCACGTCCGGCGCTGACAAAAACATAACCCCATCTCCCGCTCGAT
TCCCGAGAGAATCCAAACCCCACTCACAATTCCCCACAAACGAGCCGCAGAGAGAAATGGATTCCCGGGAATTGGCAGCTTTACTCTCTTCTTTGATCTCCCAACTCCTT
CTCCTCCTCTTCCTCCTCTTTCCTTCCCCCAACCCACATTCCCTTTTGTCCAATTCCACTTCTGATTCCAGTTACTATGGAAATCTTTACCCCTTCTTCAACCACTTGCT
CTTGTCGCAGGAAATTGCTGCCTCCCTTTCGTTTCTCTCGGTTTCGCGGAAGAGGAAGAGAACGCATTCGACGGAGGAACTGGAATTGGGGCCGACTGGGGATGATGCCG
GAGACGGAAGCCTCGGTGGTGGTGTCGGCGGCGGCCGTGGACGAGTCAATTTGTTTCGGACTCGGAGCCCTGACTCGTTCAGGAATCACTTCAGGATGACCTCCTCTACG
TTTGAATGGCTCTCTGGTTTGCTTGAGCCCCTTCTGGACTGCCGAGACCCGGTTGGGTCGCCTCTCAATCTCTCCGCCGAGATTCGACTCGGTGTCGGCCTGTCTCGGCT
AGCCACCGGCTGCGATTTCTCGACGATCTCGGAACAATTCGGCGTCTCGGAGTCGGTAGCGCGGTTCTGTGCCAAGCAATTATGCCGGGTTCTCTGTACCAATTTTCGCT
TCTGGGTCGAATTCCCTTCCCCCAATGAGCTCGAATCGACGTCCTCCGCCTTCGAAGATCTCGGCGGGCTCCTGAATTGCTGTGGCGTGATTGCTTGCACAAGGTTCAAG
ATCATTAGAAATAACCATTTTTATGAAGATAGCATCGCTACTCAACTTGTTGTTGATTCCTCGTCACGAATCCTTAGTATTGTTGCAGGATTTCGTGGCGATAAGGACGA
CTCAACGGTGCTTATGTCCTCGACACTGTTCAAAGACATCGAAGAAGGAAGGCTTCTGGATTCTCCTCCTGTTTACCTTCATGGGGTGGCTGTAAATCAGTACTTGTTTG
GACATGGCGAATACCCCTTGCTTCCATGGTTAATGGTGCCTTTTGCAGGGGCTGGTTCAGGTTCAACTGAAGAGAGTTTCAACGAAGCCCACAGATTGATGTGCATTCCA
GCTCTAAAAGCAATCGTTAGTCTCAGAAACTGGGGAGTTCTGAGTCAACCAATGCACGAGGAGTTCAAAACTGCAGTTGCTTACATTGGTGCTTGTTCAATTCTTCATAA
TGCTTTGTTGATGAGGGAGGATTTTTCTGCCATGTCTGAAGAATGGGAGAGCTCAGCTTCACTTGATCATAGCTCTCAGTATATTGGGGTTGGATTGAATGAGGATTCAA
CAGATGAGAAGGCTTCTATTATACAGAGGGCATTGGCTTTGAAGCTAGAGAGCTTCATAGTTGAAATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGGGAAAACTGTACCTCCAGCACTGATGTGAACCTGCCAACAGCAGTTTTTAACATTGCGTCGACCTGTTCAGCTGATAATAGCCCTCCAGCAAATATTAGAAAACC
TAACACAAATTTGAAGGTTTATGTGCCACATTGCAATGGTGTGGTTTGTCAGCAGTTGCAATGCATGTGCCTGGTCAAACAACTAAAACTGATTGAAGGAATTTCTACTA
TTAAAAATGAATGGAAAAACGTAAAGAAGATGACCTGTTTGCCCTTGCTTCATGAGGCATCACCATTGAAGTTCCTGGCAACCATAGGGCAACCCTTCATTGAAGGTGCA
TGCCATAGTCGCAGATGGGAAATCAAGGCAAGAGGTGCCCACCCTCACCATAGGCACATACCTCACACTGACAGGTATGGGCAGGCTAGGCAGAACGACTTGAACAAAGG
AAGATGGCTGCAATCTAGACGCACGCGCGAGCAAGATTGTAAGGACGAGTGGCAGGAGAAAGGGCGACCTCATCGGTGTCCGGAAGCGAGGCGACAACTTGTCGGAGAGA
GGAAAGGCCACCGCGTCACCTTCGACAGGAAGAGAGAACTTCCGACTTGGACAGAGAATGAGAGAGCTCGAGAGAGTGATTTTTTAGAGATAGGAGCAAGCTCGACTAGA
GGCAGGGCGCCAATGCAGGTTGAAGACGAACATCGGGCGGCAGCCAAATTACGCTGGAGCAATTATATAGATTTGTTGGACTTTGCCAGGAGAGAATTAGTCGAGCTCGA
GCATCGTCTCCGCAAGCAGCCAAGCCAAGTGTTCGTATGGCGTAACTTGTACAAGATTCGACCTTCCACGTCCGGCGCTGACAAAAACATAACCCCATCTCCCGCTCGAT
TCCCGAGAGAATCCAAACCCCACTCACAATTCCCCACAAACGAGCCGCAGAGAGAAATGGATTCCCGGGAATTGGCAGCTTTACTCTCTTCTTTGATCTCCCAACTCCTT
CTCCTCCTCTTCCTCCTCTTTCCTTCCCCCAACCCACATTCCCTTTTGTCCAATTCCACTTCTGATTCCAGTTACTATGGAAATCTTTACCCCTTCTTCAACCACTTGCT
CTTGTCGCAGGAAATTGCTGCCTCCCTTTCGTTTCTCTCGGTTTCGCGGAAGAGGAAGAGAACGCATTCGACGGAGGAACTGGAATTGGGGCCGACTGGGGATGATGCCG
GAGACGGAAGCCTCGGTGGTGGTGTCGGCGGCGGCCGTGGACGAGTCAATTTGTTTCGGACTCGGAGCCCTGACTCGTTCAGGAATCACTTCAGGATGACCTCCTCTACG
TTTGAATGGCTCTCTGGTTTGCTTGAGCCCCTTCTGGACTGCCGAGACCCGGTTGGGTCGCCTCTCAATCTCTCCGCCGAGATTCGACTCGGTGTCGGCCTGTCTCGGCT
AGCCACCGGCTGCGATTTCTCGACGATCTCGGAACAATTCGGCGTCTCGGAGTCGGTAGCGCGGTTCTGTGCCAAGCAATTATGCCGGGTTCTCTGTACCAATTTTCGCT
TCTGGGTCGAATTCCCTTCCCCCAATGAGCTCGAATCGACGTCCTCCGCCTTCGAAGATCTCGGCGGGCTCCTGAATTGCTGTGGCGTGATTGCTTGCACAAGGTTCAAG
ATCATTAGAAATAACCATTTTTATGAAGATAGCATCGCTACTCAACTTGTTGTTGATTCCTCGTCACGAATCCTTAGTATTGTTGCAGGATTTCGTGGCGATAAGGACGA
CTCAACGGTGCTTATGTCCTCGACACTGTTCAAAGACATCGAAGAAGGAAGGCTTCTGGATTCTCCTCCTGTTTACCTTCATGGGGTGGCTGTAAATCAGTACTTGTTTG
GACATGGCGAATACCCCTTGCTTCCATGGTTAATGGTGCCTTTTGCAGGGGCTGGTTCAGGTTCAACTGAAGAGAGTTTCAACGAAGCCCACAGATTGATGTGCATTCCA
GCTCTAAAAGCAATCGTTAGTCTCAGAAACTGGGGAGTTCTGAGTCAACCAATGCACGAGGAGTTCAAAACTGCAGTTGCTTACATTGGTGCTTGTTCAATTCTTCATAA
TGCTTTGTTGATGAGGGAGGATTTTTCTGCCATGTCTGAAGAATGGGAGAGCTCAGCTTCACTTGATCATAGCTCTCAGTATATTGGGGTTGGATTGAATGAGGATTCAA
CAGATGAGAAGGCTTCTATTATACAGAGGGCATTGGCTTTGAAGCTAGAGAGCTTCATAGTTGAAATTTAA
Protein sequenceShow/hide protein sequence
MRENCTSSTDVNLPTAVFNIASTCSADNSPPANIRKPNTNLKVYVPHCNGVVCQQLQCMCLVKQLKLIEGISTIKNEWKNVKKMTCLPLLHEASPLKFLATIGQPFIEGA
CHSRRWEIKARGAHPHHRHIPHTDRYGQARQNDLNKGRWLQSRRTREQDCKDEWQEKGRPHRCPEARRQLVGERKGHRVTFDRKRELPTWTENERARESDFLEIGASSTR
GRAPMQVEDEHRAAAKLRWSNYIDLLDFARRELVELEHRLRKQPSQVFVWRNLYKIRPSTSGADKNITPSPARFPRESKPHSQFPTNEPQREMDSRELAALLSSLISQLL
LLLFLLFPSPNPHSLLSNSTSDSSYYGNLYPFFNHLLLSQEIAASLSFLSVSRKRKRTHSTEELELGPTGDDAGDGSLGGGVGGGRGRVNLFRTRSPDSFRNHFRMTSST
FEWLSGLLEPLLDCRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPSPNELESTSSAFEDLGGLLNCCGVIACTRFK
IIRNNHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYLHGVAVNQYLFGHGEYPLLPWLMVPFAGAGSGSTEESFNEAHRLMCIP
ALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMSEEWESSASLDHSSQYIGVGLNEDSTDEKASIIQRALALKLESFIVEI