; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014047 (gene) of Snake gourd v1 genome

Gene IDTan0014047
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDDE Tnp4 domain-containing protein
Genome locationLG03:69693871..69695994
RNA-Seq ExpressionTan0014047
SyntenyTan0014047
Gene Ontology termsNA
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RZC55946.1 hypothetical protein C5167_014815 [Papaver somniferum]1.4e-9846.12Show/hide
Query:  MHYSSTEVPPNSDEERYLEREFVSSGAHVELGVDDEETEIHGDWGETPAQSSKRRLVGPSLDSRTMAQNGDHDSDSSTSSDEMVGRMLIVTTIVNEYECE
        M  +ST  PP+SD ER LE EF+  GA   L   +   EI    G +  QS K       ++SR  + + D  SDS  S  E++        +V E    
Subjt:  MHYSSTEVPPNSDEERYLEREFVSSGAHVELGVDDEETEIHGDWGETPAQSSKRRLVGPSLDSRTMAQNGDHDSDSSTSSDEMVGRMLIVTTIVNEYECE

Query:  IPKHPCHTSSLNGQEYMLELLNGHPDRIFYSFRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLTVCHSTRNRIVAEQFQHSKVTVSRQFSRVLR
          K P  TS L+G+E++ ELLNGHP R++   RMD +TF  LC  LR +++L +D+ +S+EEAVG+FL TV  S RNR+VAE FQHS  TV R F +VL+
Subjt:  IPKHPCHTSSLNGQEYMLELLNGHPDRIFYSFRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLTVCHSTRNRIVAEQFQHSKVTVSRQFSRVLR

Query:  VMCMLGC-------------------------DNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGR
         +C LGC                          +CVGAIDGTH+S   PASKQ P+RGRK  +TQNIMCACSF+MLFTFVYTGWEGTAND+RVL+DAI  
Subjt:  VMCMLGC-------------------------DNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGR

Query:  EGNNFSLPPEGNIIL-------------------YHLRDYRGRGRHPRGPQEFFNYKHSSLRNVIERCFGVLKARFSILKLMPNYPIRK-----------
        E N F +P EG   +                   YHLRD+RGR R  +GP E FN++HSSLRNVIERCFGV K+RF ILK MPNYP+R+           
Subjt:  EGNNFSLPPEGNIIL-------------------YHLRDYRGRGRHPRGPQEFFNYKHSSLRNVIERCFGVLKARFSILKLMPNYPIRK-----------

Query:  ---TTMTSTRDTLFEEYQVADLEVPDEE--SLGGTQKFLDMNLSQAYISRMNGVRDDIVGTMWL
             + S  D LF ++   DL V DEE  S G     +D+++S A I  MN VRDDI GTMW+
Subjt:  ---TTMTSTRDTLFEEYQVADLEVPDEE--SLGGTQKFLDMNLSQAYISRMNGVRDDIVGTMWL

XP_008237544.2 PREDICTED: uncharacterized protein LOC103336277 [Prunus mume]7.6e-9749.24Show/hide
Query:  AQNGDHDSDSSTSSDEMVGRMLIVTTIVNEYECEIPKHPCHTSSLNGQEYMLELLNGHPDRIFYSFRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGM
        + +   +SDSS++ D+ +G +L+   + NEY     K PC  S L+G+++++ELL GHP+R+F   RMDKNTF  LC  LR  ++L +D+ I +EE++ M
Subjt:  AQNGDHDSDSSTSSDEMVGRMLIVTTIVNEYECEIPKHPCHTSSLNGQEYMLELLNGHPDRIFYSFRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGM

Query:  FLLTVCHSTRNRIVAEQFQHSKVTVSRQFSRVLRVMCMLGC---------------------------DNCVGAIDGTHVSVWAPASKQTPYRGRKVIVT
        FL  + H+TRNR+ AE+FQHSK TVSRQF+RVL+ +C  G                            ++C+GAID TH++ WAPASKQ PYRGRK+ VT
Subjt:  FLLTVCHSTRNRIVAEQFQHSKVTVSRQFSRVLRVMCMLGC---------------------------DNCVGAIDGTHVSVWAPASKQTPYRGRKVIVT

Query:  QNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGREGNNFSLPPEGNIIL-------------------YHLRDYRGRGRHPRGPQEFFNYKHSSLRNV
        QNIMCACSF+MLFT+VYTGWE TANDSRVL+DAI RE NNF LP EG   +                   YHL DYRGRGRHPRG  E FNY+HSSLRNV
Subjt:  QNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGREGNNFSLPPEGNIIL-------------------YHLRDYRGRGRHPRGPQEFFNYKHSSLRNV

Query:  IERCFGVLKARFSILKLMPNYPIRK--------------TTMTSTRDTLFEEYQVADLEVPDEESLGGTQKFLDMNLSQAYISRMNGVRDDIVGTM
        I+RCFGVLKARF ILKLMPN PIRK                M S  D +F +YQ  DL+V DEES    Q+ + ++   A  ++M+ VRD I   M
Subjt:  IERCFGVLKARFSILKLMPNYPIRK--------------TTMTSTRDTLFEEYQVADLEVPDEESLGGTQKFLDMNLSQAYISRMNGVRDDIVGTM

XP_018505745.1 PREDICTED: uncharacterized protein LOC103958625 [Pyrus x bretschneideri]5.6e-10053.48Show/hide
Query:  TSSLNGQEYMLELLNGHPDRIFYSFRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLTVCHSTRNRIVAEQFQHSKVTVSRQFSRVLRVMCMLGC
        TS  +G++Y++ELLNGHP R+F   RMDKNTFR LC  LR+ N+L +D+ I +EEAV MFL T+ H+ RNR++AE FQHSK TVSRQF+RVL+ +C LG 
Subjt:  TSSLNGQEYMLELLNGHPDRIFYSFRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLTVCHSTRNRIVAEQFQHSKVTVSRQFSRVLRVMCMLGC

Query:  -------------------------DNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGREGNNFSL
                                  NC+GAIDGTH+S W P+SKQ  YRGRKV VTQNIM ACSFNM+FT+VYTGWEGTANDSRVL+DAI RE N F +
Subjt:  -------------------------DNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGREGNNFSL

Query:  PPEGNI-------------------ILYHLRDYRGRGRHPRGPQEFFNYKHSSLRNVIERCFGVLKARFSILKLMPNYPIRK--------------TTMT
        P EG                     + YHLRD+RGRG+ PRG  E FN++HSSLRNV+ERC GVLK RF ILKLMPNYPIRK                M 
Subjt:  PPEGNI-------------------ILYHLRDYRGRGRHPRGPQEFFNYKHSSLRNVIERCFGVLKARFSILKLMPNYPIRK--------------TTMT

Query:  STRDTLFEEYQVADLEVPDEESLGGTQKFLDMNLSQAYISRMNGVRDDIVGTMWLDYCN
        S  DTLF++++  D++V DEES G  Q+  +M+L+   +  MN VRD I G+MWLDY N
Subjt:  STRDTLFEEYQVADLEVPDEESLGGTQKFLDMNLSQAYISRMNGVRDDIVGTMWLDYCN

XP_024164615.1 uncharacterized protein LOC112171704 [Rosa chinensis]9.3e-9549.49Show/hide
Query:  DHDSDSSTSSDEMVGRMLIVTTIVNEYECEIPKHPCHTSSLNGQEYMLELLNGHPDRIFYSFRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLT
        D DSDSS+  DEM   M+I   I + +   I K PCHTS L+G EY+ ELLNGHPDRI+ SFRMDK+ F+ LC  L   N L +D+ + I+EAV +FL  
Subjt:  DHDSDSSTSSDEMVGRMLIVTTIVNEYECEIPKHPCHTSSLNGQEYMLELLNGHPDRIFYSFRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLT

Query:  VCHSTRNRIVAEQFQHSKVTVSRQFSRVLRVMCMLG-------------------------CDNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCA
        V HS R R+ AE+FQ SK T+ RQF RVL  +C L                           + C+GAIDGTHV+ WAPA KQT YRGRKV+VTQN+MCA
Subjt:  VCHSTRNRIVAEQFQHSKVTVSRQFSRVLRVMCMLG-------------------------CDNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCA

Query:  CSFNMLFTFVYTGWEGTANDSRVLLDAIGREGNNFSLPPEGNIIL-------------------YHLRDYRGRGRHPRGPQEFFNYKHSSLRNVIERCFG
        CSF+M+FTFVYTGWEGTANDSRV  DA+ R  N F  P EG   +                   YHLRDYRG  R PRGP+E FNY+HSSLRNVIERCFG
Subjt:  CSFNMLFTFVYTGWEGTANDSRVLLDAIGREGNNFSLPPEGNIIL-------------------YHLRDYRGRGRHPRGPQEFFNYKHSSLRNVIERCFG

Query:  VLKARFSILKLMPNYPIRK--------------TTMTSTRDTLFEEYQVADLEVPDEESLGGTQKFLDMNLSQAYISRMNGVRDDIVGTMWLDY
        VLKARF ILK MPNYP R+                  + RD LFE + V D+   +E S   T   LDM  SQ  +++M  VR++I   +W ++
Subjt:  VLKARFSILKLMPNYPIRK--------------TTMTSTRDTLFEEYQVADLEVPDEESLGGTQKFLDMNLSQAYISRMNGVRDDIVGTMWLDY

XP_030483779.1 protein ALP1-like [Cannabis sativa]1.6e-9448.62Show/hide
Query:  SSTSSDEMVGRMLIVTTIVNEYECEIPKHPCHTSSLNGQEYMLELLNGHPDRIFYSFRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLTVCHST
        SS+SS+E+VG++L+V  +   ++ +  + P  TS+++G+EY++ELLNGHPDR+FY+ RMD NTFR+LC RL +   + +DK+IS+EEAV MFL  V H+ 
Subjt:  SSTSSDEMVGRMLIVTTIVNEYECEIPKHPCHTSSLNGQEYMLELLNGHPDRIFYSFRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLTVCHST

Query:  RNRIVAEQFQHSKVTVSRQFSRVLRVMCMLGCD-------------------------NCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNM
        R R VAE++QHS  TV RQFSRVL  +C LG +                         +CVGAIDGTHVS  APA KQ  YRGRKV VTQN+M ACSFNM
Subjt:  RNRIVAEQFQHSKVTVSRQFSRVLRVMCMLGCD-------------------------NCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNM

Query:  LFTFVYTGWEGTANDSRVLLDAIGREGNNFSLPPEGNIIL-------------------YHLRDYRGRGRHPRGPQEFFNYKHSSLRNVIERCFGVLKAR
        +FT+VY GWEGTANDSRVL DAI R+ ++F +PP+G   +                   YHLR +RGRG HPRG  E FNY+HSSLRNVIERCFG+LKAR
Subjt:  LFTFVYTGWEGTANDSRVLLDAIGREGNNFSLPPEGNIIL-------------------YHLRDYRGRGRHPRGPQEFFNYKHSSLRNVIERCFGVLKAR

Query:  FSILKLMPNYPIRK--------------TTMTSTRDTLFEEYQVADLEVPDEESL----GGTQKFL----DMNLSQAYISRMNGVRDDIVGTMWLDYCN
        F ILK MP Y + K                M S  D +F++Y +    + D E      GG +  +    ++N SQAY+++M  VRDDI G MWL Y N
Subjt:  FSILKLMPNYPIRK--------------TTMTSTRDTLFEEYQVADLEVPDEESL----GGTQKFL----DMNLSQAYISRMNGVRDDIVGTMWLDYCN

TrEMBL top hitse value%identityAlignment
A0A2P6SH66 Putative harbinger transposase-derived nuclease domain-containing protein1.3e-8948.15Show/hide
Query:  MLIVTTIVNEYECEIPKHPCHTSSLNGQEYMLELLNGHPDRIFYSFRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLTVCHSTRNRIVAEQFQH
        M+I   I + +   I K PCHTS L+G EY+ ELLNGHPDRI+ SFRMDK+ F+ LC  L+    L +D+ + ++EAV +FL  V HS R R+ AE+FQ 
Subjt:  MLIVTTIVNEYECEIPKHPCHTSSLNGQEYMLELLNGHPDRIFYSFRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLTVCHSTRNRIVAEQFQH

Query:  SKVTVSRQFSRVLRVMCMLG-------------------------CDNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEG
        SK T+ RQF RVL  +CML                           + C+GAIDGTHV+ WAPA KQT YRGRKV+VTQN+MCACSF+M+FTFVYTGWEG
Subjt:  SKVTVSRQFSRVLRVMCMLG-------------------------CDNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEG

Query:  TANDSRVLLDAIGREGNNFSLPPEGNIIL-------------------YHLRDYRGRGRHPRGPQEFFNYKHSSLRNVIERCFGVLKARFSILKLMPNYP
        TANDSR+  DA+ R  N F  P EG   +                   YHLRDYRG  R PRGP+E FNY+HSSLRNVIERCFGVLK RF ILK MPNYP
Subjt:  TANDSRVLLDAIGREGNNFSLPPEGNIIL-------------------YHLRDYRGRGRHPRGPQEFFNYKHSSLRNVIERCFGVLKARFSILKLMPNYP

Query:  IRK--------------TTMTSTRDTLFEEYQVADLEVPDEESLGGTQKFLDMNLSQAYISRMNGVRDDIVGTMWLDY
         R+                  + RD LFE + V D+   +E +   T   LDM  SQ  ++++  VR+ I   +W D+
Subjt:  IRK--------------TTMTSTRDTLFEEYQVADLEVPDEESLGGTQKFLDMNLSQAYISRMNGVRDDIVGTMWLDY

A0A438FKK5 Protein ALP1-like9.1e-8041.67Show/hide
Query:  LDSRTMAQNGDHDSDSSTSSDEMVGRMLIVTTIVNEY-ECEIPKHPCHTSSLNGQEYMLELLNGHPDRIFYSFRMDKNTFRALCERLRQSNYLVNDKIIS
        +D    +    +DS SS+   + +   + +  I+NEY E  + K P  TS L+G +++ +++ GHP   +  FRMDK TF  LC+ L++   L + + ++
Subjt:  LDSRTMAQNGDHDSDSSTSSDEMVGRMLIVTTIVNEY-ECEIPKHPCHTSSLNGQEYMLELLNGHPDRIFYSFRMDKNTFRALCERLRQSNYLVNDKIIS

Query:  IEEAVGMFLLTVCHSTRNRIVAEQFQHSKVTVSRQFSRVLRVMCMLG----CDN----------------------CVGAIDGTHVSVWAPASKQTPYRG
        +EEAV MFLL V H+ R R+VA++FQHS  TV+R F  V R +C LG    C N                      C+GAIDGTH+S W PA +QT +RG
Subjt:  IEEAVGMFLLTVCHSTRNRIVAEQFQHSKVTVSRQFSRVLRVMCMLG----CDN----------------------CVGAIDGTHVSVWAPASKQTPYRG

Query:  RKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGREGNNFSLPPEGNIIL-------------------YHLRDYRGRGRHPRGPQEFFNYKH
        RK ++TQN+MCAC+F+M+FTFVY GWEGTAND+RV LDA+ R   NF  P EG   +                   YHL++YRGR   P   +E FNY+H
Subjt:  RKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGREGNNFSLPPEGNIIL-------------------YHLRDYRGRGRHPRGPQEFFNYKH

Query:  SSLRNVIERCFGVLKARFSILKLMPNY-PIRKTTMT------------STR-DTLFEEYQVADLEVPDEESLGGTQKFLDMNLSQAYISRMNGVRDDIVG
        SSLRN+IERCFGVLK RF ILK+MP Y P R+ ++             STR D LF EY+V DL +  EE    ++    ++LS    + M   RD I  
Subjt:  SSLRNVIERCFGVLKARFSILKLMPNY-PIRKTTMT------------STR-DTLFEEYQVADLEVPDEESLGGTQKFLDMNLSQAYISRMNGVRDDIVG

Query:  TMWLDYCN
         MW +Y N
Subjt:  TMWLDYCN

A0A438IZU1 Protein ALP1-like3.5e-7941.42Show/hide
Query:  LDSRTMAQNGDHDSDSSTSSDEMVGRMLIVTTIVNEY-ECEIPKHPCHTSSLNGQEYMLELLNGHPDRIFYSFRMDKNTFRALCERLRQSNYLVNDKIIS
        +D    +    +DS SS+   + +   + +  I+NEY E  + K P  TS L+G +++ +++ GHP   +  FRMDK TF  LC+ L++   L + + ++
Subjt:  LDSRTMAQNGDHDSDSSTSSDEMVGRMLIVTTIVNEY-ECEIPKHPCHTSSLNGQEYMLELLNGHPDRIFYSFRMDKNTFRALCERLRQSNYLVNDKIIS

Query:  IEEAVGMFLLTVCHSTRNRIVAEQFQHSKVTVSRQFSRVLRVMCMLG----CDN----------------------CVGAIDGTHVSVWAPASKQTPYRG
        +EEA+ MFLL V H+ R R+VA++FQHS  TV+R F  V R +C LG    C N                      C+GAIDGTH+S W PA +QT +RG
Subjt:  IEEAVGMFLLTVCHSTRNRIVAEQFQHSKVTVSRQFSRVLRVMCMLG----CDN----------------------CVGAIDGTHVSVWAPASKQTPYRG

Query:  RKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGREGNNFSLPPEGNIIL-------------------YHLRDYRGRGRHPRGPQEFFNYKH
        RK ++TQN+MCAC+F+M+FTFVY GWEGTAND+RV LDA+ R   NF  P EG   +                   YHL++YRGR   P   +E FNY+H
Subjt:  RKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGREGNNFSLPPEGNIIL-------------------YHLRDYRGRGRHPRGPQEFFNYKH

Query:  SSLRNVIERCFGVLKARFSILKLMPNY-PIRKTTMT------------STR-DTLFEEYQVADLEVPDEESLGGTQKFLDMNLSQAYISRMNGVRDDIVG
        SSLRN+IERCFGVLK RF ILK+MP Y P R+ ++             STR D LF EY+V DL +  EE    ++    ++LS    + M   RD I  
Subjt:  SSLRNVIERCFGVLKARFSILKLMPNY-PIRKTTMT------------STR-DTLFEEYQVADLEVPDEESLGGTQKFLDMNLSQAYISRMNGVRDDIVG

Query:  TMWLDYCN
         MW +Y N
Subjt:  TMWLDYCN

A0A4Y7J673 Uncharacterized protein6.7e-9946.12Show/hide
Query:  MHYSSTEVPPNSDEERYLEREFVSSGAHVELGVDDEETEIHGDWGETPAQSSKRRLVGPSLDSRTMAQNGDHDSDSSTSSDEMVGRMLIVTTIVNEYECE
        M  +ST  PP+SD ER LE EF+  GA   L   +   EI    G +  QS K       ++SR  + + D  SDS  S  E++        +V E    
Subjt:  MHYSSTEVPPNSDEERYLEREFVSSGAHVELGVDDEETEIHGDWGETPAQSSKRRLVGPSLDSRTMAQNGDHDSDSSTSSDEMVGRMLIVTTIVNEYECE

Query:  IPKHPCHTSSLNGQEYMLELLNGHPDRIFYSFRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLTVCHSTRNRIVAEQFQHSKVTVSRQFSRVLR
          K P  TS L+G+E++ ELLNGHP R++   RMD +TF  LC  LR +++L +D+ +S+EEAVG+FL TV  S RNR+VAE FQHS  TV R F +VL+
Subjt:  IPKHPCHTSSLNGQEYMLELLNGHPDRIFYSFRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLTVCHSTRNRIVAEQFQHSKVTVSRQFSRVLR

Query:  VMCMLGC-------------------------DNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGR
         +C LGC                          +CVGAIDGTH+S   PASKQ P+RGRK  +TQNIMCACSF+MLFTFVYTGWEGTAND+RVL+DAI  
Subjt:  VMCMLGC-------------------------DNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGR

Query:  EGNNFSLPPEGNIIL-------------------YHLRDYRGRGRHPRGPQEFFNYKHSSLRNVIERCFGVLKARFSILKLMPNYPIRK-----------
        E N F +P EG   +                   YHLRD+RGR R  +GP E FN++HSSLRNVIERCFGV K+RF ILK MPNYP+R+           
Subjt:  EGNNFSLPPEGNIIL-------------------YHLRDYRGRGRHPRGPQEFFNYKHSSLRNVIERCFGVLKARFSILKLMPNYPIRK-----------

Query:  ---TTMTSTRDTLFEEYQVADLEVPDEE--SLGGTQKFLDMNLSQAYISRMNGVRDDIVGTMWL
             + S  D LF ++   DL V DEE  S G     +D+++S A I  MN VRDDI GTMW+
Subjt:  ---TTMTSTRDTLFEEYQVADLEVPDEE--SLGGTQKFLDMNLSQAYISRMNGVRDDIVGTMWL

A0A6P4ALU8 uncharacterized protein LOC1074220483.7e-8143.15Show/hide
Query:  DHDSDSSTSSDEMVGRMLIVTTIVNEYECEIPKHPCHTSSLNGQEYMLELLNGHPDRIFYSFRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLT
        D +S SS+SSDE     ++   +    +  + K P  TS L+GQ ++ ELLNG     +  FRMDKN F +LC  L+Q  YL + K + +EEA+ MFL+ 
Subjt:  DHDSDSSTSSDEMVGRMLIVTTIVNEYECEIPKHPCHTSSLNGQEYMLELLNGHPDRIFYSFRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLT

Query:  VCHSTRNRIVAEQFQHSKVTVSRQFSRVLRVMCMLG------------------------CDNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCAC
        + H+   R++A++FQHS  TV R F+  LR +C LG                         + C+GAIDGTH+S   PA KQ  YRGRK IVTQN++CAC
Subjt:  VCHSTRNRIVAEQFQHSKVTVSRQFSRVLRVMCMLG------------------------CDNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCAC

Query:  SFNMLFTFVYTGWEGTANDSRVLLDAIGREGNNFSLPPEGNIIL-------------------YHLRDYRGRGRHPRGPQEFFNYKHSSLRNVIERCFGV
        +FNM+FTFVY GWEGTANDSRV LDAI R  N F LP EG   +                   YHL++Y GRGR PRGP+E FNY+HSSLRNVIERCFGV
Subjt:  SFNMLFTFVYTGWEGTANDSRVLLDAIGREGNNFSLPPEGNIIL-------------------YHLRDYRGRGRHPRGPQEFFNYKHSSLRNVIERCFGV

Query:  LKARFSILKLMPNYPIRKTTM--------------TSTRDTLFEEYQVADLEVPDEE-SLGGTQKFLDMNLSQAYISRMNGVRDDIVGTMWLDY
        LKARF ILK+MP Y   +  +              ++  D +F +++  + E+ DEE S  G+   +D  LS    + M   R+ +   MW DY
Subjt:  LKARFSILKLMPNYPIRKTTM--------------TSTRDTLFEEYQVADLEVPDEE-SLGGTQKFLDMNLSQAYISRMNGVRDDIVGTMWLDY

SwissProt top hitse value%identityAlignment
B0BN95 Putative nuclease HARBI16.4e-0626.28Show/hide
Query:  MLGCDNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGREGNNFSLPPEGNII---LYHLRDYRGRG
        + G    +GA+D  HV++ AP ++   Y  RK + + N +  C        V T W G+  D  VL  +         +P +  ++    + L  +    
Subjt:  MLGCDNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGREGNNFSLPPEGNII---LYHLRDYRGRG

Query:  RH-PRGPQEF-FNYKHSSLRNVIERCFGVLKARFSIL
         H P  P E+ +N  HS+  +VIE+    L  RF  L
Subjt:  RH-PRGPQEF-FNYKHSSLRNVIERCFGVLKARFSIL

Arabidopsis top hitse value%identityAlignment
AT1G43722.1 unknown protein5.0e-2228.81Show/hide
Query:  LIVTTIVNEYECEIPKHPCHTSSLNGQEYMLELLNGHPDRIFYSFRMDKNTFRALCERLRQSNYLVNDKI-ISIEEAVGMFLLTVCHSTRNRIVAEQFQH
        L++   +N Y+    + P       G   +   L           RM    F  LC  L Q+NY +   + ISIEE+V MFL    H+   R V  +F  
Subjt:  LIVTTIVNEYECEIPKHPCHTSSLNGQEYMLELLNGHPDRIFYSFRMDKNTFRALCERLRQSNYLVNDKI-ISIEEAVGMFLLTVCHSTRNRIVAEQFQH

Query:  SKVTVSRQFSRVLRVMCMLGCD----------------------------NCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTG
        ++ TV R+F  VL    +L CD                              VGA+DGTHV V      Q  Y  R    + NIM  C   MLFT+++ G
Subjt:  SKVTVSRQFSRVLRVMCMLGCD----------------------------NCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTG

Query:  WEGTANDSRVLLDAIGREGNNFSLPP------------------------EGNIILYHLRDYRGRGRHPRGPQEFFNYKHSSLRNVIERCFGVLK
          G+  D+ VL  A  +  + F LPP                           ++ YH+  +   G  PR   E FN  H+SLR+VIER F + K
Subjt:  WEGTANDSRVLLDAIGREGNNFSLPP------------------------EGNIILYHLRDYRGRGRHPRGPQEFFNYKHSSLRNVIERCFGVLK

AT5G28730.1 unknown protein5.2e-1126.9Show/hide
Query:  RMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLTVCHSTRNRIVAEQFQHSKVTVSRQFSRVLRVMCMLGCDNCVGAIDGTHVSVWAPASKQTPY-
        RM    F  LCE L     L +   IS++E+V +FL+    +   R +A +F H++ T+ R+F  VL+ M  L  +           ++       T Y 
Subjt:  RMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLTVCHSTRNRIVAEQFQHSKVTVSRQFSRVLRVMCMLGCDNCVGAIDGTHVSVWAPASKQTPY-

Query:  ---RGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGREGNNFSLPPEGNIILYHLRDYRGRGRHPRGPQEFFNYKHSSLRNVIERCF
                I + N++  C  +MLFT+ + G  G+ +D+RVL  AI  +   F +PP+     Y+L D      + RG    +  +H   +++I   F
Subjt:  ---RGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGREGNNFSLPPEGNIILYHLRDYRGRGRHPRGPQEFFNYKHSSLRNVIERCF

AT5G28950.1 unknown protein2.7e-1544.16Show/hide
Query:  NCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGREGNNFSLPPE
        +CVGAID TH+       K   +R RK  ++QN++ AC+F++ F +V +GWEG+A+DS+VL DA+ R  N   +P E
Subjt:  NCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGREGNNFSLPPE

AT5G35695.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)6.8e-1939.1Show/hide
Query:  LFTFVYTGWEGTANDSRVLLDAIGR---------EGNNFSLPPEGNIILYHLRDYRGRGRHPRGPQEFFNYKHSSLRNVIERCFGVLKARFSILKLMPNY
        +F +V +GWEG+A+DSRVL DA+ +            NF  P  G  + YHL+++ G+ R P  P E FN +H SLRNVIER FG+ K+RF+I K  P +
Subjt:  LFTFVYTGWEGTANDSRVLLDAIGR---------EGNNFSLPPEGNIILYHLRDYRGRGRHPRGPQEFFNYKHSSLRNVIERCFGVLKARFSILKLMPNY

Query:  PIRK-----TTMTSTRDTLFEEYQVADLEVPDE
          +K      T  +  + L +E +  + + PDE
Subjt:  PIRK-----TTMTSTRDTLFEEYQVADLEVPDE

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)6.1e-3631.31Show/hide
Query:  IPKHPCHTSSLNGQEYMLELLNGHPDRIFYSFRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLTVCHSTRNRIVAEQFQHSKVTVSRQFSRVLR
        +PK     S  +G +++ ++LNG  ++ F +FRMDK  F  LC+ L+    L +   I IE  + +FL  + H+ R R V E F +S  T+SR F+ VL 
Subjt:  IPKHPCHTSSLNGQEYMLELLNGHPDRIFYSFRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLTVCHSTRNRIVAEQFQHSKVTVSRQFSRVLR

Query:  VMCMLGCD-------------------NCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGREGNNFS
         +  +  D                   +CVG +D  H+ V     +Q P+R    ++TQN++ A SF++ F +V  GWEG+A+D +VL  A+ R  N   
Subjt:  VMCMLGCD-------------------NCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGREGNNFS

Query:  LPPEGNIILYH--------LRDYRGRGRHPR-GPQEFFNYKHSSLRNVIERCFGVLKARFSILKLMPNYPIRKTTMTSTRDTLFEEYQVADLEVPDE
        +P     I+ +        +  Y G   + R   +E FN +H  L   I R FG LK RF IL   P YP++              Y    LE PD+
Subjt:  LPPEGNIILYH--------LRDYRGRGRHPR-GPQEFFNYKHSSLRNVIERCFGVLKARFSILKLMPNYPIRKTTMTSTRDTLFEEYQVADLEVPDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACTATTCATCGACTGAGGTACCTCCTAACTCTGATGAAGAGAGATACTTAGAGAGGGAATTTGTTTCATCTGGGGCTCATGTTGAGTTAGGTGTTGATGATGAAGA
AACTGAGATCCATGGAGATTGGGGTGAGACACCTGCTCAATCTTCTAAACGTCGTCTAGTTGGCCCATCATTGGATAGTCGTACTATGGCACAAAATGGCGATCATGATA
GCGACTCATCGACATCTTCTGATGAGATGGTTGGTCGAATGCTTATAGTCACTACAATTGTCAATGAATATGAATGTGAAATTCCTAAACACCCATGTCATACATCTTCA
TTAAATGGACAGGAATATATGTTGGAGTTGTTGAATGGACATCCTGATAGAATTTTTTATTCTTTTCGAATGGATAAAAATACATTTAGAGCTTTATGTGAGAGATTAAG
ACAATCAAATTATTTAGTGAATGATAAGATTATCAGCATTGAGGAAGCAGTTGGAATGTTCTTACTCACAGTATGTCATAGCACACGTAATAGAATTGTAGCTGAACAAT
TTCAACACTCTAAAGTGACTGTGTCTCGACAATTCTCTAGAGTTTTAAGAGTTATGTGTATGTTGGGATGTGATAATTGTGTTGGTGCAATTGATGGAACTCACGTAAGT
GTGTGGGCCCCTGCATCTAAACAAACACCATATCGTGGAAGAAAGGTTATTGTGACTCAAAATATTATGTGTGCATGCTCATTCAATATGTTGTTCACCTTTGTCTATAC
TGGTTGGGAAGGTACTGCTAATGATTCTAGAGTATTATTGGATGCTATTGGTAGGGAAGGGAATAATTTTTCGTTACCACCTGAAGGAAATATTATCTTATATCATTTAA
GAGATTATAGAGGAAGAGGAAGACATCCACGAGGACCACAAGAATTTTTTAATTATAAACACTCTTCATTACGCAATGTGATTGAACGTTGCTTTGGTGTACTTAAAGCT
CGATTCTCCATCTTAAAATTGATGCCAAACTACCCAATTAGAAAAACAACAATGACTTCAACCAGAGATACCTTATTTGAAGAATATCAAGTTGCTGATTTAGAAGTACC
TGATGAAGAAAGCTTGGGTGGAACACAAAAATTTCTTGATATGAACTTAAGTCAAGCTTATATAAGCCGAATGAATGGTGTGAGGGATGATATTGTCGGTACCATGTGGT
TAGATTATTGTAATGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGCACTATTCATCGACTGAGGTACCTCCTAACTCTGATGAAGAGAGATACTTAGAGAGGGAATTTGTTTCATCTGGGGCTCATGTTGAGTTAGGTGTTGATGATGAAGA
AACTGAGATCCATGGAGATTGGGGTGAGACACCTGCTCAATCTTCTAAACGTCGTCTAGTTGGCCCATCATTGGATAGTCGTACTATGGCACAAAATGGCGATCATGATA
GCGACTCATCGACATCTTCTGATGAGATGGTTGGTCGAATGCTTATAGTCACTACAATTGTCAATGAATATGAATGTGAAATTCCTAAACACCCATGTCATACATCTTCA
TTAAATGGACAGGAATATATGTTGGAGTTGTTGAATGGACATCCTGATAGAATTTTTTATTCTTTTCGAATGGATAAAAATACATTTAGAGCTTTATGTGAGAGATTAAG
ACAATCAAATTATTTAGTGAATGATAAGATTATCAGCATTGAGGAAGCAGTTGGAATGTTCTTACTCACAGTATGTCATAGCACACGTAATAGAATTGTAGCTGAACAAT
TTCAACACTCTAAAGTGACTGTGTCTCGACAATTCTCTAGAGTTTTAAGAGTTATGTGTATGTTGGGATGTGATAATTGTGTTGGTGCAATTGATGGAACTCACGTAAGT
GTGTGGGCCCCTGCATCTAAACAAACACCATATCGTGGAAGAAAGGTTATTGTGACTCAAAATATTATGTGTGCATGCTCATTCAATATGTTGTTCACCTTTGTCTATAC
TGGTTGGGAAGGTACTGCTAATGATTCTAGAGTATTATTGGATGCTATTGGTAGGGAAGGGAATAATTTTTCGTTACCACCTGAAGGAAATATTATCTTATATCATTTAA
GAGATTATAGAGGAAGAGGAAGACATCCACGAGGACCACAAGAATTTTTTAATTATAAACACTCTTCATTACGCAATGTGATTGAACGTTGCTTTGGTGTACTTAAAGCT
CGATTCTCCATCTTAAAATTGATGCCAAACTACCCAATTAGAAAAACAACAATGACTTCAACCAGAGATACCTTATTTGAAGAATATCAAGTTGCTGATTTAGAAGTACC
TGATGAAGAAAGCTTGGGTGGAACACAAAAATTTCTTGATATGAACTTAAGTCAAGCTTATATAAGCCGAATGAATGGTGTGAGGGATGATATTGTCGGTACCATGTGGT
TAGATTATTGTAATGCATGA
Protein sequenceShow/hide protein sequence
MHYSSTEVPPNSDEERYLEREFVSSGAHVELGVDDEETEIHGDWGETPAQSSKRRLVGPSLDSRTMAQNGDHDSDSSTSSDEMVGRMLIVTTIVNEYECEIPKHPCHTSS
LNGQEYMLELLNGHPDRIFYSFRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLTVCHSTRNRIVAEQFQHSKVTVSRQFSRVLRVMCMLGCDNCVGAIDGTHVS
VWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGREGNNFSLPPEGNIILYHLRDYRGRGRHPRGPQEFFNYKHSSLRNVIERCFGVLKA
RFSILKLMPNYPIRKTTMTSTRDTLFEEYQVADLEVPDEESLGGTQKFLDMNLSQAYISRMNGVRDDIVGTMWLDYCNA