; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10011146 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10011146
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionDDE Tnp4 domain-containing protein
Genome locationChr01:2836630..2842673
RNA-Seq ExpressionHG10011146
SyntenyHG10011146
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037135.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]9.4e-23094.15Show/hide
Query:  MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELGPSHGRVHHLFRTRSP
        MDSP+LAALLSSLISQLLLLLFLLFPSSNPHSLFSNST DS+FY N   LFTHFLFSQ+FAASL FLSVSRKRKRTNP DHLELG SHGRVHHLFRTR+P
Subjt:  MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELGPSHGRVHHLFRTRSP

Query:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT
        DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELT
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT

Query:  SSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGE
        SSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMSSTLFKDIEQGRLL++PPVYLHGVAVN+YLFG GE
Subjt:  SSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGE

Query:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHKSQYVKA
        YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+S DH+SQYV+A
Subjt:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHKSQYVKA

Query:  GLNEDSTNEKASVIQRALALRARELHS
        GLN DSTNEKASVIQRALA RARELHS
Subjt:  GLNEDSTNEKASVIQRALALRARELHS

KGN57516.1 hypothetical protein Csa_011580 [Cucumis sativus]4.2e-23094.38Show/hide
Query:  MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELGPSHGRVHHLFRTRSP
        MDSP+LAALLSSLISQLLLLLFLLFPSSNPHSLFSNS  DS+FY N   LF HFLFSQ+FAASL FLSVSRKRKRTN SDHLELG SHGRVHHLFRTR+P
Subjt:  MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELGPSHGRVHHLFRTRSP

Query:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT
        DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELT
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT

Query:  SSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGE
        SSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMSSTLFKDIEQGRLL++PPVYLHGVAVN+YLFGHGE
Subjt:  SSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGE

Query:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHKSQYVKA
        YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+S DHKSQYV+A
Subjt:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHKSQYVKA

Query:  GLNEDSTNEKASVIQRALALRARELHS
        GLN DSTNEKASVIQRALALRARELHS
Subjt:  GLNEDSTNEKASVIQRALALRARELHS

XP_008456140.1 PREDICTED: uncharacterized protein LOC103496169 [Cucumis melo]5.2e-20494.62Show/hide
Query:  FSQEFAASLSFLSVSRKRKRTNPSDHLELGPSHGRVHHLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD
        F + FAASL FLSVSRKRKRTNP DHLELG SHGRVHHLFRTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD
Subjt:  FSQEFAASLSFLSVSRKRKRTNPSDHLELGPSHGRVHHLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD

Query:  FSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFR
        FSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDS+ATQLVVDSSSRILSIVAGFR
Subjt:  FSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFR

Query:  GDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF
        G+KDDSTVLMSSTLFKDIEQGRLL++PPVYLHGVAVN+YLFG GEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF
Subjt:  GDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF

Query:  KTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHKSQYVKAGLNEDSTNEKASVIQRALALRARELHS
        KTAVAYIGACSILHNALLMREDFSAMADEWESL+S DH+SQYV+AGLN DSTNEKASVIQRALA RARELHS
Subjt:  KTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHKSQYVKAGLNEDSTNEKASVIQRALALRARELHS

XP_023536005.1 protein ALP1-like [Cucurbita pepo subsp. pepo]2.9e-19982.99Show/hide
Query:  MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELGPS--------HGRVH
        MDS QLAALLSSLISQLLLLL LLFPSSNPHSL SNS+SDSNFY NLFPLF HFLFSQ+ AASLSFLSVSRKRKRT+ S+ LELGPS         GRV 
Subjt:  MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELGPS--------HGRVH

Query:  HLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
        HL RTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP

Query:  CPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVN
        CP+ELELTSSAFED+AGLPNCCGV+SCT                            SIVAGFRGDKDDSTVLMS+TLFKDIE+GRLL +PPVYLHG+AVN
Subjt:  CPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVN

Query:  QYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFD
        QYLFGHGEYPLLPWL+VPFAGAVSGSTEESFNEAHRLMCIPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESLAS D
Subjt:  QYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFD

Query:  HKSQYVKAGLNEDSTNEKASVIQRALALRARELHS
        H SQYV  GLNEDS +EKAS+IQ+ALALRARELH+
Subjt:  HKSQYVKAGLNEDSTNEKASVIQRALALRARELHS

XP_038880641.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1-like [Benincasa hispida]3.1e-20987.59Show/hide
Query:  MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELGPSHGRVHHLFRTRSP
        MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNST DSNFY N   LFTHFLFSQ+FAASL FLSVSRKRKRTNPSDHLELG SHGR  HLFRTR+P
Subjt:  MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELGPSHGRVHHLFRTRSP

Query:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT
        DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFS IS QFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT

Query:  SSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGE
        SSAFEDLAGLPNCCGVV+CT                            SIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGE
Subjt:  SSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGE

Query:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHKSQYVKA
        YPLLPWL++PFAGAVSGSTEESFN+AHRLMCIPALKAIVSLRNWGVLS+P+HEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDH+SQYV+ 
Subjt:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHKSQYVKA

Query:  GLNEDSTNEKASVIQRALALRARELHS
         LNEDSTNEKAS+IQRALA+RARELHS
Subjt:  GLNEDSTNEKASVIQRALALRARELHS

TrEMBL top hitse value%identityAlignment
A0A0A0LBX6 DDE Tnp4 domain-containing protein2.0e-23094.38Show/hide
Query:  MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELGPSHGRVHHLFRTRSP
        MDSP+LAALLSSLISQLLLLLFLLFPSSNPHSLFSNS  DS+FY N   LF HFLFSQ+FAASL FLSVSRKRKRTN SDHLELG SHGRVHHLFRTR+P
Subjt:  MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELGPSHGRVHHLFRTRSP

Query:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT
        DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELT
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT

Query:  SSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGE
        SSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMSSTLFKDIEQGRLL++PPVYLHGVAVN+YLFGHGE
Subjt:  SSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGE

Query:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHKSQYVKA
        YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+S DHKSQYV+A
Subjt:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHKSQYVKA

Query:  GLNEDSTNEKASVIQRALALRARELHS
        GLN DSTNEKASVIQRALALRARELHS
Subjt:  GLNEDSTNEKASVIQRALALRARELHS

A0A1S3C2M8 uncharacterized protein LOC1034961692.5e-20494.62Show/hide
Query:  FSQEFAASLSFLSVSRKRKRTNPSDHLELGPSHGRVHHLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD
        F + FAASL FLSVSRKRKRTNP DHLELG SHGRVHHLFRTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD
Subjt:  FSQEFAASLSFLSVSRKRKRTNPSDHLELGPSHGRVHHLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD

Query:  FSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFR
        FSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDS+ATQLVVDSSSRILSIVAGFR
Subjt:  FSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFR

Query:  GDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF
        G+KDDSTVLMSSTLFKDIEQGRLL++PPVYLHGVAVN+YLFG GEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF
Subjt:  GDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF

Query:  KTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHKSQYVKAGLNEDSTNEKASVIQRALALRARELHS
        KTAVAYIGACSILHNALLMREDFSAMADEWESL+S DH+SQYV+AGLN DSTNEKASVIQRALA RARELHS
Subjt:  KTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHKSQYVKAGLNEDSTNEKASVIQRALALRARELHS

A0A5D3CRB2 Putative nuclease HARBI14.5e-23094.15Show/hide
Query:  MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELGPSHGRVHHLFRTRSP
        MDSP+LAALLSSLISQLLLLLFLLFPSSNPHSLFSNST DS+FY N   LFTHFLFSQ+FAASL FLSVSRKRKRTNP DHLELG SHGRVHHLFRTR+P
Subjt:  MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELGPSHGRVHHLFRTRSP

Query:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT
        DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELT
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT

Query:  SSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGE
        SSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMSSTLFKDIEQGRLL++PPVYLHGVAVN+YLFG GE
Subjt:  SSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGE

Query:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHKSQYVKA
        YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+S DH+SQYV+A
Subjt:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHKSQYVKA

Query:  GLNEDSTNEKASVIQRALALRARELHS
        GLN DSTNEKASVIQRALA RARELHS
Subjt:  GLNEDSTNEKASVIQRALALRARELHS

A0A6J1FNZ2 protein ALP1-like1.0e-19782.07Show/hide
Query:  MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELGPS--------HGRVH
        MDS QLAALLSSLISQLLLLL LLFPSSNPHSL SNS+SDSNFY NLFPLF HFLFSQ+ AASLSFLSVSRKRKRT+ S+ LELGPS         GRV 
Subjt:  MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELGPS--------HGRVH

Query:  HLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
        HL RTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP

Query:  CPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVN
        CP+ELELTSS+FED+AGLPNCCGV+SCT                            SIVAGFRGDKDDSTVLMS+TLFKDIE+ RLL +PPVYLHG+AVN
Subjt:  CPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVN

Query:  QYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFD
        QYLFGHGEYPLLPWL+VPFAGAVSGSTEESFNEAHRLMCIPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESLAS D
Subjt:  QYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFD

Query:  HKSQYVKAGLNEDSTNEKASVIQRALALRARELHS
        H SQYV  GLNEDS +EKA+++Q+ALALRARELH+
Subjt:  HKSQYVKAGLNEDSTNEKASVIQRALALRARELHS

A0A6J1J0M5 protein ALP1-like1.7e-19782.53Show/hide
Query:  MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELGPS--------HGRVH
        MDS QLAALLSSLISQLLLLL LLFPSSNPHSL SNS+SDSNFY NLFPLF HFLFSQ+ AASLSFLSVSRKRKRT+ S+ LELGPS         GRV 
Subjt:  MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELGPS--------HGRVH

Query:  HLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
        HL RTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP

Query:  CPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVN
        CP+ELELTSSAFED+AGLPNCCGV+SCT                            SIVAGFRGDKDDSTVLMS+TLFKDIE+ RLL +PPVYLHGVAVN
Subjt:  CPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVN

Query:  QYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFD
        QYLFGHG+YPLLPWL+VPFAGAVSGSTEESFNEAHRLM IPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESLAS D
Subjt:  QYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFD

Query:  HKSQYVKAGLNEDSTNEKASVIQRALALRARELHS
        H SQYV  GLNEDS +EKAS+IQ+ALALRARELH+
Subjt:  HKSQYVKAGLNEDSTNEKASVIQRALALRARELHS

SwissProt top hitse value%identityAlignment
Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 11.3e-2729.35Show/hide
Query:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNEL
        +F++ FR + +TF ++  L+   L  R P G        LSVE ++ + L RLA+G    ++   FGV +S       +    L    +  + +P  + +
Subjt:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNEL

Query:  ELTSSAFEDLAGLPNCCGVVSCTR----FKIIRNSHFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYL-H
        E   S FE++ GLPNCCG +  T        ++ S  + D     S+  Q V D   R L++V G+ G    S +L  S  FK  E  ++LD  P  L  
Subjt:  ELTSSAFEDLAGLPNCCGVVSCTR----FKIIRNSHFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYL-H

Query:  GVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF
        G  + +Y+ G   YPLLPWLI P        +  +FNE H  +   A  A   L+ +W +LS+ +   + +   + I  C +LHN ++   D+
Subjt:  GVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF

Q9M2U3 Protein ALP1-like5.7e-2828.05Show/hide
Query:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVA-----RFCAKQLCRVLCTNFRFWV
        P +F + F+++  TF+++  L++     +     D  G+P  LS+  R+ V L RL +G   S I + FG+++S       RF      R +  +   W 
Subjt:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVA-----RFCAKQLCRVLCTNFRFWV

Query:  EFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGR
            P++L+   S FE ++GLPNCCG +  T   I+ N    E             S+  Q VVD   R L ++AG+ G  +D  VL +S  +K +E+G+
Subjt:  EFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGR

Query:  LLDAPPVYL-HGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLM
         L+   + L     + +Y+ G   +PLLPWL+ P+ G  +   +  FN+ H      A  A+  L++ W +++  +    +  +   I  C +LHN ++ 
Subjt:  LLDAPPVYL-HGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLM

Query:  RED
         ED
Subjt:  RED

Arabidopsis top hitse value%identityAlignment
AT1G72270.1 CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)1.3e-1428.12Show/hide
Query:  HFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGV-SESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAF
        +FRM+ STF  L  +L                S        ++RLA G  +  +  +FG  S S A      +C+++       ++ P P+        F
Subjt:  HFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGV-SESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAF

Query:  EDLAGLPNCCGVVSCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGEYPLL
             LPNC GVV   RF++       + SI  Q +VDS+ R + I AG+        +   + LF  I +  L  AP    +GV V +Y+ G    PLL
Subjt:  EDLAGLPNCCGVVSCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGEYPLL

Query:  PWLIVPFAGAVSGSTEESFNEAHRLMCIPALK----AIVSLR-NWGVLS---QPIHEEFKTAVAYIGACSILHNALLMREDFSAMADE
        PWL+ P+      S EESF E    +    L     A   +R  W +L    +P   EF   V   G   +LHN L+   D     +E
Subjt:  PWLIVPFAGAVSGSTEESFNEAHRLMCIPALK----AIVSLR-NWGVLS---QPIHEEFKTAVAYIGACSILHNALLMREDFSAMADE

AT3G19120.1 PIF / Ping-Pong family of plant transposases9.3e-1823.98Show/hide
Query:  SLISQLLLLLFLLFPSSNPHSLFSNSTSDSNFYPNLFPLFTHF-LFSQEFAASLSFLSVSRKRKRT-------NPSDHLELGPSHGRV--------HHLF
        +++S LL L   L P+S   S  S S+  S    +L    +   L     A+ LSFL+V+R    +       +PS    L      V         H++
Subjt:  SLISQLLLLLFLLFPSSNPHSLFSNSTSDSNFYPNLFPLFTHF-LFSQEFAASLSFLSVSRKRKRT-------NPSDHLELGPSHGRV--------HHLF

Query:  RTRSP---DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTN-FRFWVEF
           +P     +R+ + ++   F  +   L+P +       S L L  +  + + L RLA GC   T++ ++ +   +       + R+L T  +  +++ 
Subjt:  RTRSP---DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTN-FRFWVEF

Query:  PC-PNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNS----------HFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLD
        P     L  T+  FE+L  LPN CG +  T  K+ R +           +  D++  Q+V D       +     G +DDS+    S L+K +  G ++ 
Subjt:  PC-PNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNS----------HFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLD

Query:  APPVYLHGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSL--RNWGVLSQPIHEEFKTAVAYIGACSILHN
           + + G  V  Y+ G   YPLL +L+ PF+   SG+  E+  +   +     +   + L    W +L Q ++     A   I AC +LHN
Subjt:  APPVYLHGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSL--RNWGVLSQPIHEEFKTAVAYIGACSILHN

AT3G55350.1 PIF / Ping-Pong family of plant transposases4.0e-2928.05Show/hide
Query:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVA-----RFCAKQLCRVLCTNFRFWV
        P +F + F+++  TF+++  L++     +     D  G+P  LS+  R+ V L RL +G   S I + FG+++S       RF      R +  +   W 
Subjt:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVA-----RFCAKQLCRVLCTNFRFWV

Query:  EFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGR
            P++L+   S FE ++GLPNCCG +  T   I+ N    E             S+  Q VVD   R L ++AG+ G  +D  VL +S  +K +E+G+
Subjt:  EFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGR

Query:  LLDAPPVYL-HGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLM
         L+   + L     + +Y+ G   +PLLPWL+ P+ G  +   +  FN+ H      A  A+  L++ W +++  +    +  +   I  C +LHN ++ 
Subjt:  LLDAPPVYL-HGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLM

Query:  RED
         ED
Subjt:  RED

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)9.0e-2929.35Show/hide
Query:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNEL
        +F++ FR + +TF ++  L+   L  R P G        LSVE ++ + L RLA+G    ++   FGV +S       +    L    +  + +P  + +
Subjt:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNEL

Query:  ELTSSAFEDLAGLPNCCGVVSCTR----FKIIRNSHFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYL-H
        E   S FE++ GLPNCCG +  T        ++ S  + D     S+  Q V D   R L++V G+ G    S +L  S  FK  E  ++LD  P  L  
Subjt:  ELTSSAFEDLAGLPNCCGVVSCTR----FKIIRNSHFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYL-H

Query:  GVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF
        G  + +Y+ G   YPLLPWLI P        +  +FNE H  +   A  A   L+ +W +LS+ +   + +   + I  C +LHN ++   D+
Subjt:  GVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF

AT4G29780.1 unknown protein2.0e-2026.36Show/hide
Query:  GPSHGRVHHLFRTRS-----------PDSFRNHFRMTSSTFEWLSGLLEPLLE-----CRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSES
        GPSH R+    RT              D FR  FRM+ STF  +   L+  +       RD + +P       R+GV ++RLATG     +S++FG+  S
Subjt:  GPSHGRVHHLFRTRS-----------PDSFRNHFRMTSSTFEWLSGLLEPLLE-----CRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSES

Query:  VARFCAKQLCR----VLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIR---------NSHFYED------SIATQLVVDSSSRILS
               ++CR    VL   +  W   P  +E+  T + FE +  +PN  G +  T   II          N    E       SI  Q VV++      
Subjt:  VARFCAKQLCR----VLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIR---------NSHFYED------SIATQLVVDSSSRILS

Query:  IVAGFRGDKDDSTVLMSSTLFKD-IEQGRLLDAPPVYLHGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVL
        +  G  G   D  +L  S+L +    +G L D+            ++ G+  +PL  +L+VP+       T+ +FNE+   +   A  A   L+  W  L
Subjt:  IVAGFRGDKDDSTVLMSSTLFKD-IEQGRLLDAPPVYLHGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVL

Query:  SQPIHEEFKTAVAYIGACSILHNALLMRED
         +    + +     +GAC +LHN   MR++
Subjt:  SQPIHEEFKTAVAYIGACSILHNALLMRED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGTGGCAGCCGAAGTTAGCCACAGGAGGTAGTGGCCAAAGTTGGTCACCAGAGGAGGCATCGGTGGTGATGAGGGCAGGCAAGCTATGGCGTAATTTCTACAAGGA
TTCAACACCTTCCACGTCCGGCGCCGACAAAAATCCAATACCCATCTTCCTCGCAGCCGCTCGATTACCGAGAGAATCGAAAATCCCAATTCCCAACTACTGCAACTGCA
AAACTGACACAAACCCATTAATGGATTCCCCTCAATTGGCTGCTTTACTCTCTTCTTTGATCTCCCAACTCCTCCTCCTCCTCTTCCTCCTCTTCCCTTCCTCCAACCCA
CATTCCCTTTTCTCCAATTCCACTTCCGATTCCAATTTCTATCCCAATCTCTTCCCTCTCTTCACCCACTTCCTCTTTTCCCAAGAATTTGCCGCTTCCCTTTCCTTTCT
CTCCGTTTCCCGCAAGAGAAAGAGGACCAATCCCTCCGACCATCTCGAATTGGGGCCATCTCATGGACGAGTTCATCATCTGTTTCGGACTCGGTCCCCTGATTCTTTCA
GAAATCACTTCAGAATGACCTCCTCAACGTTTGAATGGCTCTCTGGTTTACTTGAGCCCCTTCTGGAGTGTCGCGACCCGGTGGGTTCGCCTCTCGATCTCTCCGTTGAG
ATTCGGCTCGGTGTTGGTCTGTATCGCCTCGCCACCGGCTGCGATTTCTCCACAATCTCGGACCAATTTGGCGTCTCGGAGTCGGTAGCGAGGTTCTGTGCTAAACAATT
GTGTCGAGTTCTTTGTACTAATTTTCGCTTCTGGGTCGAATTCCCTTGCCCCAATGAGCTCGAATTAACGTCCTCGGCTTTTGAAGATCTTGCTGGACTTCCGAATTGCT
GTGGCGTGGTTTCTTGTACAAGGTTCAAGATCATTAGAAATAGCCATTTTTATGAAGATAGCATCGCTACTCAACTTGTTGTTGATTCCTCGTCACGAATCCTTAGTATT
GTTGCAGGATTTCGTGGCGATAAGGACGATTCCACAGTGCTTATGTCCTCGACGCTGTTTAAAGACATTGAACAAGGAAGGCTTCTGGATGCTCCTCCGGTTTACCTTCA
TGGGGTGGCTGTGAATCAGTACTTGTTTGGACATGGTGAATACCCTTTGCTTCCATGGTTAATAGTGCCTTTTGCAGGAGCTGTTTCAGGGTCAACTGAAGAGAGTTTCA
ATGAAGCTCACCGATTGATGTGCATTCCAGCTCTGAAAGCAATTGTTAGTTTGAGAAATTGGGGAGTTTTGAGCCAACCAATTCATGAGGAGTTCAAAACTGCTGTTGCT
TACATTGGTGCTTGCTCAATTCTTCATAATGCTTTGTTGATGAGGGAGGATTTTTCTGCCATGGCTGATGAGTGGGAGAGCTTAGCTTCATTTGATCATAAATCTCAGTA
TGTTAAAGCTGGGTTGAATGAGGATTCAACTAATGAGAAGGCTTCTGTTATACAGAGGGCATTGGCTCTGAGAGCTAGAGAGCTTCACAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGGTGGCAGCCGAAGTTAGCCACAGGAGGTAGTGGCCAAAGTTGGTCACCAGAGGAGGCATCGGTGGTGATGAGGGCAGGCAAGCTATGGCGTAATTTCTACAAGGA
TTCAACACCTTCCACGTCCGGCGCCGACAAAAATCCAATACCCATCTTCCTCGCAGCCGCTCGATTACCGAGAGAATCGAAAATCCCAATTCCCAACTACTGCAACTGCA
AAACTGACACAAACCCATTAATGGATTCCCCTCAATTGGCTGCTTTACTCTCTTCTTTGATCTCCCAACTCCTCCTCCTCCTCTTCCTCCTCTTCCCTTCCTCCAACCCA
CATTCCCTTTTCTCCAATTCCACTTCCGATTCCAATTTCTATCCCAATCTCTTCCCTCTCTTCACCCACTTCCTCTTTTCCCAAGAATTTGCCGCTTCCCTTTCCTTTCT
CTCCGTTTCCCGCAAGAGAAAGAGGACCAATCCCTCCGACCATCTCGAATTGGGGCCATCTCATGGACGAGTTCATCATCTGTTTCGGACTCGGTCCCCTGATTCTTTCA
GAAATCACTTCAGAATGACCTCCTCAACGTTTGAATGGCTCTCTGGTTTACTTGAGCCCCTTCTGGAGTGTCGCGACCCGGTGGGTTCGCCTCTCGATCTCTCCGTTGAG
ATTCGGCTCGGTGTTGGTCTGTATCGCCTCGCCACCGGCTGCGATTTCTCCACAATCTCGGACCAATTTGGCGTCTCGGAGTCGGTAGCGAGGTTCTGTGCTAAACAATT
GTGTCGAGTTCTTTGTACTAATTTTCGCTTCTGGGTCGAATTCCCTTGCCCCAATGAGCTCGAATTAACGTCCTCGGCTTTTGAAGATCTTGCTGGACTTCCGAATTGCT
GTGGCGTGGTTTCTTGTACAAGGTTCAAGATCATTAGAAATAGCCATTTTTATGAAGATAGCATCGCTACTCAACTTGTTGTTGATTCCTCGTCACGAATCCTTAGTATT
GTTGCAGGATTTCGTGGCGATAAGGACGATTCCACAGTGCTTATGTCCTCGACGCTGTTTAAAGACATTGAACAAGGAAGGCTTCTGGATGCTCCTCCGGTTTACCTTCA
TGGGGTGGCTGTGAATCAGTACTTGTTTGGACATGGTGAATACCCTTTGCTTCCATGGTTAATAGTGCCTTTTGCAGGAGCTGTTTCAGGGTCAACTGAAGAGAGTTTCA
ATGAAGCTCACCGATTGATGTGCATTCCAGCTCTGAAAGCAATTGTTAGTTTGAGAAATTGGGGAGTTTTGAGCCAACCAATTCATGAGGAGTTCAAAACTGCTGTTGCT
TACATTGGTGCTTGCTCAATTCTTCATAATGCTTTGTTGATGAGGGAGGATTTTTCTGCCATGGCTGATGAGTGGGAGAGCTTAGCTTCATTTGATCATAAATCTCAGTA
TGTTAAAGCTGGGTTGAATGAGGATTCAACTAATGAGAAGGCTTCTGTTATACAGAGGGCATTGGCTCTGAGAGCTAGAGAGCTTCACAGTTAA
Protein sequenceShow/hide protein sequence
MRWQPKLATGGSGQSWSPEEASVVMRAGKLWRNFYKDSTPSTSGADKNPIPIFLAAARLPRESKIPIPNYCNCKTDTNPLMDSPQLAALLSSLISQLLLLLFLLFPSSNP
HSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELGPSHGRVHHLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVE
IRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSIATQLVVDSSSRILSI
VAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVA
YIGACSILHNALLMREDFSAMADEWESLASFDHKSQYVKAGLNEDSTNEKASVIQRALALRARELHS