; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016828 (gene) of Snake gourd v1 genome

Gene IDTan0016828
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein ALP1-like
Genome locationLG02:83927014..83929120
RNA-Seq ExpressionTan0016828
SyntenyTan0016828
Gene Ontology termsNA
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RZC55946.1 hypothetical protein C5167_014815 [Papaver somniferum]1.5e-12551.4Show/hide
Query:  MHYSSTEVPPNSDEERDLEREFVSSGAHVELGVDDEETEIHGDWVGSSLDSRAYKTARTARQAILDDALKVWTKYYGTNDDHDCDSSTSSDEMVGRMLIV
        M  +ST  PP+SD ERDLE EF+  GA  +    +E   + G     S+   +  T+ ++                        DSS+ S++    +L+V
Subjt:  MHYSSTEVPPNSDEERDLEREFVSSGAHVELGVDDEETEIHGDWVGSSLDSRAYKTARTARQAILDDALKVWTKYYGTNDDHDCDSSTSSDEMVGRMLIV

Query:  TTIVNEYECEISKHPCHTSSLNGHEYILELLNGHPDRIFYSLRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLTVCHSTRNRIVAERFQHSKET
          +   Y     K P  TS L+G E+I ELLNGHP R++  +RMD +TF  LC  LR +++L +D+ +S+EEAVG+FL TV  S RNR+VAE FQHS ET
Subjt:  TTIVNEYECEISKHPCHTSSLNGHEYILELLNGHPDRIFYSLRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLTVCHSTRNRIVAERFQHSKET

Query:  VSRQFSRVLRAMCMLGCDVIQGPNMMETPPEILNNPKFNPWFKNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTAND
        V R F +VL+A+C LGC +I+ PNM E PPEI+ NPKF PWF +CVGAIDGTH+S   PASKQ P+RGRK  +TQNIMCACSF+MLFTFVYTGWEGTAND
Subjt:  VSRQFSRVLRAMCMLGCDVIQGPNMMETPPEILNNPKFNPWFKNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTAND

Query:  SRVLLDAIGKEGNNFPLPPE-----------------APHCGERYHLRDYRGRGRHPQGPQEFFNYKHSSLRNVIERCFGVLKARFPILKLMPNYPIRKQ
        +RVL+DAI  E N FP+P E                  P+ GERYHLRD+RGR R  +GP E FN++HSSLRNVIERCFGV K+RFPILK MPNYP+R+Q
Subjt:  SRVLLDAIGKEGNNFPLPPE-----------------APHCGERYHLRDYRGRGRHPQGPQEFFNYKHSSLRNVIERCFGVLKARFPILKLMPNYPIRKQ

Query:  RRIPIACCAIHNFIRMTSTRDTLFEEYQVADLEVPDEESLGGTQE--FLDMNLSQAYISRMNGVR
        R IP+ACC +HNFIR+ S  D LF ++   DL V DEES    QE   +D+++S A I  MN VR
Subjt:  RRIPIACCAIHNFIRMTSTRDTLFEEYQVADLEVPDEESLGGTQE--FLDMNLSQAYISRMNGVR

RZC87187.1 hypothetical protein C5167_042118 [Papaver somniferum]8.1e-12450.97Show/hide
Query:  MHYSSTEVPPNSDEERDLEREFVSSGAHVELGVDDEETEIHGDWVGSSLDSRAYKTARTARQAILDDALKVWTKYYGTNDDHDCDSSTSSDEMVGRMLIV
        M  +ST  PP+SD ERDLE EF+  GA  +    +E   + G     S+   +  T+ ++                        DSS+ S++    +L+V
Subjt:  MHYSSTEVPPNSDEERDLEREFVSSGAHVELGVDDEETEIHGDWVGSSLDSRAYKTARTARQAILDDALKVWTKYYGTNDDHDCDSSTSSDEMVGRMLIV

Query:  TTIVNEYECEISKHPCHTSSLNGHEYILELLNGHPDRIFYSLRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLTVCHSTRNRIVAERFQHSKET
          +   Y     K P  TS L+G E+I ELLNGHP R++  +RMD +TF  LC  LR +++L +D+ +S+EEAVG+FL  V  S RNR+VAE FQHS ET
Subjt:  TTIVNEYECEISKHPCHTSSLNGHEYILELLNGHPDRIFYSLRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLTVCHSTRNRIVAERFQHSKET

Query:  VSRQFSRVLRAMCMLGCDVIQGPNMMETPPEILNNPKFNPWFKNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTAND
        V R F +VL+A+C LGC +I+ PNM E PPEI+ NPKF PWF +CVGAIDGTH+S   PASKQ P+RGRK  +TQNIMCACSF+MLFTFVYTGWEGTAND
Subjt:  VSRQFSRVLRAMCMLGCDVIQGPNMMETPPEILNNPKFNPWFKNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTAND

Query:  SRVLLDAIGKEGNNFPLPPE-----------------APHCGERYHLRDYRGRGRHPQGPQEFFNYKHSSLRNVIERCFGVLKARFPILKLMPNYPIRKQ
        +RVL+DAI  E N F +P E                  P+ GERYHLRD+RGR R  +GP E FN++HSSLRNVIERCFGV K+RFPILK MPNYP+R+Q
Subjt:  SRVLLDAIGKEGNNFPLPPE-----------------APHCGERYHLRDYRGRGRHPQGPQEFFNYKHSSLRNVIERCFGVLKARFPILKLMPNYPIRKQ

Query:  RRIPIACCAIHNFIRMTSTRDTLFEEYQVADLEVPDEESLGGTQE--FLDMNLSQAYISRMNGVR
        R IP+ACC +HNFIR+ S  D LF ++   DL V DEES    QE   +D+++S A I  MN VR
Subjt:  RRIPIACCAIHNFIRMTSTRDTLFEEYQVADLEVPDEESLGGTQE--FLDMNLSQAYISRMNGVR

XP_008237544.2 PREDICTED: uncharacterized protein LOC103336277 [Prunus mume]1.0e-12657.73Show/hide
Query:  TNDDHDCDSSTSSDEMVGRMLIVTTIVNEYECEISKHPCHTSSLNGHEYILELLNGHPDRIFYSLRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMF
        ++   + DSS++ D+ +G +L+   + NEY     K PC  S L+G ++++ELL GHP+R+F   RMDKNTF  LC  LR  ++L +D+ I +EE++ MF
Subjt:  TNDDHDCDSSTSSDEMVGRMLIVTTIVNEYECEISKHPCHTSSLNGHEYILELLNGHPDRIFYSLRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMF

Query:  LLTVCHSTRNRIVAERFQHSKETVSRQFSRVLRAMCMLGCDVIQGPNMMETPPEILNNPKFNPWFK--NCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQ
        L  + H+TRNR+ AE+FQHSKETVSRQF+RVL+A+C  G  +IQ PNM  TPP+I+ NP ++ WFK  +C+GAID TH++ WAPASKQ PYRGRK+ VTQ
Subjt:  LLTVCHSTRNRIVAERFQHSKETVSRQFSRVLRAMCMLGCDVIQGPNMMETPPEILNNPKFNPWFK--NCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQ

Query:  NIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGKEGNNFPLPPE-----------------APHCGERYHLRDYRGRGRHPQGPQEFFNYKHSSLRNVI
        NIMCACSF+MLFT+VYTGWE TANDSRVL+DAI +E NNFPLP E                 AP+ GERYHL DYRGRGRHP+G  E FNY+HSSLRNVI
Subjt:  NIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGKEGNNFPLPPE-----------------APHCGERYHLRDYRGRGRHPQGPQEFFNYKHSSLRNVI

Query:  ERCFGVLKARFPILKLMPNYPIRKQRRIPIACCAIHNFIRMTSTRDTLFEEYQVADLEVPDEESLGGTQEFLDMNLSQAYISRMNGVR
        +RCFGVLKARFPILKLMPN PIRKQ+RIP+ACC +HNFIRM S  D +F +YQ  DL+V DEES    QE + ++   A  ++M+ VR
Subjt:  ERCFGVLKARFPILKLMPNYPIRKQRRIPIACCAIHNFIRMTSTRDTLFEEYQVADLEVPDEESLGGTQEFLDMNLSQAYISRMNGVR

XP_018505745.1 PREDICTED: uncharacterized protein LOC103958625 [Pyrus x bretschneideri]5.1e-12663.29Show/hide
Query:  TSSLNGHEYILELLNGHPDRIFYSLRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLTVCHSTRNRIVAERFQHSKETVSRQFSRVLRAMCMLGC
        TS  +G +Y++ELLNGHP R+F  +RMDKNTFR LC  LR+ N+L +D+ I +EEAV MFL T+ H+ RNR++AE FQHSKETVSRQF+RVL+A+C LG 
Subjt:  TSSLNGHEYILELLNGHPDRIFYSLRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLTVCHSTRNRIVAERFQHSKETVSRQFSRVLRAMCMLGC

Query:  DVIQGPNMMETPPEILNNPKFNPWFKNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGKEGNNFPL
         +IQ PNM  TP EIL NPK++PWF+NC+GAIDGTH+S W P+SKQ  YRGRKV VTQNIM ACSFNM+FT+VYTGWEGTANDSRVL+DAI +E N FP+
Subjt:  DVIQGPNMMETPPEILNNPKFNPWFKNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGKEGNNFPL

Query:  PPE-----------------APHCGERYHLRDYRGRGRHPQGPQEFFNYKHSSLRNVIERCFGVLKARFPILKLMPNYPIRKQRRIPIACCAIHNFIRMT
        P E                 AP+   RYHLRD+RGRG+ P+G  E FN++HSSLRNV+ERC GVLK RFPILKLMPNYPIRKQRRIPIACCA+HNFIRM 
Subjt:  PPE-----------------APHCGERYHLRDYRGRGRHPQGPQEFFNYKHSSLRNVIERCFGVLKARFPILKLMPNYPIRKQRRIPIACCAIHNFIRMT

Query:  STRDTLFEEYQVADLEVPDEESLGGTQEFLDMNLSQAYISRMNGVR
        S  DTLF++++  D++V DEES G  QE  +M+L+   +  MN VR
Subjt:  STRDTLFEEYQVADLEVPDEESLGGTQEFLDMNLSQAYISRMNGVR

XP_024164615.1 uncharacterized protein LOC112171704 [Rosa chinensis]4.7e-12458.85Show/hide
Query:  DDHDCDSSTSSDEMVGRMLIVTTIVNEYECEISKHPCHTSSLNGHEYILELLNGHPDRIFYSLRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLL
        +D D DSS+  DEM   M+I   I + +   I K PCHTS L+G EY+ ELLNGHPDRI+ S RMDK+ F+ LC  L   N L +D+ + I+EAV +FL 
Subjt:  DDHDCDSSTSSDEMVGRMLIVTTIVNEYECEISKHPCHTSSLNGHEYILELLNGHPDRIFYSLRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLL

Query:  TVCHSTRNRIVAERFQHSKETVSRQFSRVLRAMCMLGCDVIQGPNMMETPPEILNNPKFNPWFKNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMC
         V HS R R+ AERFQ SK+T+ RQF RVL A+C L   +I+  +  ETPPEILNNPKF P+F+ C+GAIDGTHV+ WAPA KQT YRGRKV+VTQN+MC
Subjt:  TVCHSTRNRIVAERFQHSKETVSRQFSRVLRAMCMLGCDVIQGPNMMETPPEILNNPKFNPWFKNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMC

Query:  ACSFNMLFTFVYTGWEGTANDSRVLLDAIGKEGNNFPLPPE-----------------APHCGERYHLRDYRGRGRHPQGPQEFFNYKHSSLRNVIERCF
        ACSF+M+FTFVYTGWEGTANDSRV  DA+ +  N FP P E                 AP+ GERYHLRDYRG  R P+GP+E FNY+HSSLRNVIERCF
Subjt:  ACSFNMLFTFVYTGWEGTANDSRVLLDAIGKEGNNFPLPPE-----------------APHCGERYHLRDYRGRGRHPQGPQEFFNYKHSSLRNVIERCF

Query:  GVLKARFPILKLMPNYPIRKQRRIPIACCAIHNFIRMTSTRDTLFEEYQVADLEVPDEESLGGTQEFLDMNLSQAYISRMNGVR
        GVLKARFPILK MPNYP R+QRRIPIACC +HNFIR  + RD LFE + V D+   +E S   T   LDM  SQ  +++M  VR
Subjt:  GVLKARFPILKLMPNYPIRKQRRIPIACCAIHNFIRMTSTRDTLFEEYQVADLEVPDEESLGGTQEFLDMNLSQAYISRMNGVR

TrEMBL top hitse value%identityAlignment
A0A2P6SH66 Putative harbinger transposase-derived nuclease domain-containing protein1.6e-11757.77Show/hide
Query:  MLIVTTIVNEYECEISKHPCHTSSLNGHEYILELLNGHPDRIFYSLRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLTVCHSTRNRIVAERFQH
        M+I   I + +   I K PCHTS L+G EY+ ELLNGHPDRI+ S RMDK+ F+ LC  L+    L +D+ + ++EAV +FL  V HS R R+ AERFQ 
Subjt:  MLIVTTIVNEYECEISKHPCHTSSLNGHEYILELLNGHPDRIFYSLRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLTVCHSTRNRIVAERFQH

Query:  SKETVSRQFSRVLRAMCMLGCDVIQGPNMMETPPEILNNPKFNPWFKNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEG
        SK+T+ RQF RVL A+CML   +I+  +  ETP EILNNPKF P+F+ C+GAIDGTHV+ WAPA KQT YRGRKV+VTQN+MCACSF+M+FTFVYTGWEG
Subjt:  SKETVSRQFSRVLRAMCMLGCDVIQGPNMMETPPEILNNPKFNPWFKNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEG

Query:  TANDSRVLLDAIGKEGNNFPLPPE-----------------APHCGERYHLRDYRGRGRHPQGPQEFFNYKHSSLRNVIERCFGVLKARFPILKLMPNYP
        TANDSR+  DA+ +  N FP P E                 AP+ GERYHLRDYRG  R P+GP+E FNY+HSSLRNVIERCFGVLK RFPILK MPNYP
Subjt:  TANDSRVLLDAIGKEGNNFPLPPE-----------------APHCGERYHLRDYRGRGRHPQGPQEFFNYKHSSLRNVIERCFGVLKARFPILKLMPNYP

Query:  IRKQRRIPIACCAIHNFIRMTSTRDTLFEEYQVADLEVPDEESLGGTQEFLDMNLSQAYISRMNGVR
         R+QRRIPIACC +HNFIR  + RD LFE + V D+   +E +   T   LDM  SQ  ++++  VR
Subjt:  IRKQRRIPIACCAIHNFIRMTSTRDTLFEEYQVADLEVPDEESLGGTQEFLDMNLSQAYISRMNGVR

A0A438CFP0 Protein ALP1-like9.7e-9948.14Show/hide
Query:  DSSTSSDEMVG-RMLIVTTIVNEY-ECEISKHPCHTSSLNGHEYILELLNGHPDRIFYSLRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLTVC
        DS++SS+E      + +  I+NEY E  + K P  TS L+G +++ +++ GHP   +   RMDK TF  LC+ L++   L + + +++EEAV MFLL V 
Subjt:  DSSTSSDEMVG-RMLIVTTIVNEY-ECEISKHPCHTSSLNGHEYILELLNGHPDRIFYSLRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLTVC

Query:  HSTRNRIVAERFQHSKETVSRQFSRVLRAMCMLGCDVIQGPNMM-ETPPEILNNPKFNPWFKNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCAC
        H+ R R+VA+RFQHS ETV+R F  V RA+C LG  +I   NM  E    + +NPK+ PWFK+C+GAIDGTH+S W PA +QT +RGRK ++TQN+MCAC
Subjt:  HSTRNRIVAERFQHSKETVSRQFSRVLRAMCMLGCDVIQGPNMM-ETPPEILNNPKFNPWFKNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCAC

Query:  SFNMLFTFVYTGWEGTANDSRVLLDAIGKEGNNFPLPPEA-----------------PHCGERYHLRDYRGRGRHPQGPQEFFNYKHSSLRNVIERCFGV
        +F+M+FTFVY GWEGTAND+RV LDA+ +   NFP P E                  P+ GERYHL++YRGR   P   +E FNY+HSSLRN+IERCFGV
Subjt:  SFNMLFTFVYTGWEGTANDSRVLLDAIGKEGNNFPLPPEA-----------------PHCGERYHLRDYRGRGRHPQGPQEFFNYKHSSLRNVIERCFGV

Query:  LKARFPILKLMPNYPIRKQRRIPIACCAIHNFIRMTSTRDTLFEEYQVADLEVP-DEESLGGTQEFLDMNLSQAYI
        LK RFPIL++MP Y   +Q  I +ACC +HN+IR+++  D LF EY+V DL +  +EES       +D++   A +
Subjt:  LKARFPILKLMPNYPIRKQRRIPIACCAIHNFIRMTSTRDTLFEEYQVADLEVP-DEESLGGTQEFLDMNLSQAYI

A0A4Y7J673 Uncharacterized protein7.1e-12651.4Show/hide
Query:  MHYSSTEVPPNSDEERDLEREFVSSGAHVELGVDDEETEIHGDWVGSSLDSRAYKTARTARQAILDDALKVWTKYYGTNDDHDCDSSTSSDEMVGRMLIV
        M  +ST  PP+SD ERDLE EF+  GA  +    +E   + G     S+   +  T+ ++                        DSS+ S++    +L+V
Subjt:  MHYSSTEVPPNSDEERDLEREFVSSGAHVELGVDDEETEIHGDWVGSSLDSRAYKTARTARQAILDDALKVWTKYYGTNDDHDCDSSTSSDEMVGRMLIV

Query:  TTIVNEYECEISKHPCHTSSLNGHEYILELLNGHPDRIFYSLRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLTVCHSTRNRIVAERFQHSKET
          +   Y     K P  TS L+G E+I ELLNGHP R++  +RMD +TF  LC  LR +++L +D+ +S+EEAVG+FL TV  S RNR+VAE FQHS ET
Subjt:  TTIVNEYECEISKHPCHTSSLNGHEYILELLNGHPDRIFYSLRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLTVCHSTRNRIVAERFQHSKET

Query:  VSRQFSRVLRAMCMLGCDVIQGPNMMETPPEILNNPKFNPWFKNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTAND
        V R F +VL+A+C LGC +I+ PNM E PPEI+ NPKF PWF +CVGAIDGTH+S   PASKQ P+RGRK  +TQNIMCACSF+MLFTFVYTGWEGTAND
Subjt:  VSRQFSRVLRAMCMLGCDVIQGPNMMETPPEILNNPKFNPWFKNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTAND

Query:  SRVLLDAIGKEGNNFPLPPE-----------------APHCGERYHLRDYRGRGRHPQGPQEFFNYKHSSLRNVIERCFGVLKARFPILKLMPNYPIRKQ
        +RVL+DAI  E N FP+P E                  P+ GERYHLRD+RGR R  +GP E FN++HSSLRNVIERCFGV K+RFPILK MPNYP+R+Q
Subjt:  SRVLLDAIGKEGNNFPLPPE-----------------APHCGERYHLRDYRGRGRHPQGPQEFFNYKHSSLRNVIERCFGVLKARFPILKLMPNYPIRKQ

Query:  RRIPIACCAIHNFIRMTSTRDTLFEEYQVADLEVPDEESLGGTQE--FLDMNLSQAYISRMNGVR
        R IP+ACC +HNFIR+ S  D LF ++   DL V DEES    QE   +D+++S A I  MN VR
Subjt:  RRIPIACCAIHNFIRMTSTRDTLFEEYQVADLEVPDEESLGGTQE--FLDMNLSQAYISRMNGVR

A0A5B6ZZY6 DDE Tnp4 domain-containing protein (Fragment)3.9e-10054.28Show/hide
Query:  YILELLNGHPDRIFYSLRMDKNTFRALCERLRQSNYLVND-KIISIEEAVGMFLLTVCHSTRNRIVAERFQHSKETVSRQFSRVLRAMCMLGCDVIQGPN
        YI ELL+GHP RI+  LRMD  TF +LC  LR      N  + + IEE++ +FLLTV HSTR+R+VAERFQ+S ET++R    V+RA+  LG  +I+   
Subjt:  YILELLNGHPDRIFYSLRMDKNTFRALCERLRQSNYLVND-KIISIEEAVGMFLLTVCHSTRNRIVAERFQHSKETVSRQFSRVLRAMCMLGCDVIQGPN

Query:  MMETPPEILNNPKFNPWFKNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGKEGNNFPLPPE----
          ETPPEILNNPKFNP+F  C+GAIDGTH++ WAPA KQT +RGRKV VTQN++ ACSF++LFTFVY GWEG+ANDSRV L+AI +    FP+PP     
Subjt:  MMETPPEILNNPKFNPWFKNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGKEGNNFPLPPE----

Query:  -------------APHCGERYHLRDYRGRGRHPQGPQEFFNYKHSSLRNVIERCFGVLKARFPILKLMPNYPIRKQRRIPIACCAIHNFIRMTSTRDTLF
                     +P+ GERYH+RD++GRGR P+GP+E FN++HSSLRN IERCFG+LKARFP+LK M NY + +Q  + IACCAIHNF+RM S  D LF
Subjt:  -------------APHCGERYHLRDYRGRGRHPQGPQEFFNYKHSSLRNVIERCFGVLKARFPILKLMPNYPIRKQRRIPIACCAIHNFIRMTSTRDTLF

Query:  EEYQVADLEVPDEESLGGTQEFLDMNLSQAYISRMNGVR
        +E  V D+   +E+  G      D+N+SQ  +++M  +R
Subjt:  EEYQVADLEVPDEESLGGTQEFLDMNLSQAYISRMNGVR

A0A6P4ALU8 uncharacterized protein LOC1074220482.1e-10150.54Show/hide
Query:  DDHDCDSSTSSDEMVGRMLIVTTIVNEYECEISKHPCHTSSLNGHEYILELLNGHPDRIFYSLRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLL
        DD +  SS+SSDE     ++   +    +  + K P  TS L+G  +I ELLNG     +   RMDKN F +LC  L+Q  YL + K + +EEA+ MFL+
Subjt:  DDHDCDSSTSSDEMVGRMLIVTTIVNEYECEISKHPCHTSSLNGHEYILELLNGHPDRIFYSLRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLL

Query:  TVCHSTRNRIVAERFQHSKETVSRQFSRVLRAMCMLGCDVIQGPNMMETPPEILNNPKFNPWFKNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMC
         + H+   R++A+RFQHS ETV R F+  LRA+C LG ++I   N    P  I+NNPK+ PWF+ C+GAIDGTH+S   PA KQ  YRGRK IVTQN++C
Subjt:  TVCHSTRNRIVAERFQHSKETVSRQFSRVLRAMCMLGCDVIQGPNMMETPPEILNNPKFNPWFKNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMC

Query:  ACSFNMLFTFVYTGWEGTANDSRVLLDAIGKEGNNFPLPPEA-----------------PHCGERYHLRDYRGRGRHPQGPQEFFNYKHSSLRNVIERCF
        AC+FNM+FTFVY GWEGTANDSRV LDAI +  N FPLP E                  P  GERYHL++Y GRGR P+GP+E FNY+HSSLRNVIERCF
Subjt:  ACSFNMLFTFVYTGWEGTANDSRVLLDAIGKEGNNFPLPPEA-----------------PHCGERYHLRDYRGRGRHPQGPQEFFNYKHSSLRNVIERCF

Query:  GVLKARFPILKLMPNYPIRKQRRIPIACCAIHNFIRMTSTRDTLFEEYQVADLEVPDEE-SLGGTQEFLDMN
        GVLKARF ILK+MP Y   +Q  I IACC +HNFIR ++  D +F +++  + E+ DEE S  G+   +D++
Subjt:  GVLKARFPILKLMPNYPIRKQRRIPIACCAIHNFIRMTSTRDTLFEEYQVADLEVPDEE-SLGGTQEFLDMN

SwissProt top hitse value%identityAlignment
B0BN95 Putative nuclease HARBI11.4e-0928.85Show/hide
Query:  VGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGKEGNNFPLPPEAPHCGE-RYHLRDYRGRGRH-PQGP
        +GA+D  HV++ AP ++   Y  RK + + N +  C        V T W G+  D  VL  +         +P ++   G+  + L  +     H P+ P
Subjt:  VGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGKEGNNFPLPPEAPHCGE-RYHLRDYRGRGRH-PQGP

Query:  QEF-FNYKHSSLRNVIERCFGVLKARFPIL---KLMPNYPIRKQRRIPIACCAIHN
         E+ +N  HS+  +VIE+    L  RF  L   K    Y   K   I +ACC +HN
Subjt:  QEF-FNYKHSSLRNVIERCFGVLKARFPIL---KLMPNYPIRKQRRIPIACCAIHN

Arabidopsis top hitse value%identityAlignment
AT1G43722.1 unknown protein7.8e-3233.56Show/hide
Query:  LIVTTIVNEYECEISKHPCHTSSLNGHEYILELLNGHPDRIFYSLRMDKNTFRALCERLRQSNYLVNDKI-ISIEEAVGMFLLTVCHSTRNRIVAERFQH
        L++   +N Y+    + P       G   I   L          LRM    F  LC  L Q+NY +   + ISIEE+V MFL    H+   R V  RF  
Subjt:  LIVTTIVNEYECEISKHPCHTSSLNGHEYILELLNGHPDRIFYSLRMDKNTFRALCERLRQSNYLVNDKI-ISIEEAVGMFLLTVCHSTRNRIVAERFQH

Query:  SKETVSRQFSRVLRAMCMLGCDVIQGPNMME---TPPEILNNPKFNPWFKNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTG
        ++ETV R+F  VL A  +L CD I+ P   E    P  +  + ++ P+F   VGA+DGTHV V      Q  Y  R    + NIM  C   MLFT+++ G
Subjt:  SKETVSRQFSRVLRAMCMLGCDVIQGPNMME---TPPEILNNPKFNPWFKNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTG

Query:  WEGTANDSRVLLDAIGKEGNNFPLPPE-----------------APHCGE-----RYHLRDYRGRGRHPQGPQEFFNYKHSSLRNVIERCFGVLK
          G+  D+ VL  A  +  + FPLPP                  AP+        RYH+  +   G  P+   E FN  H+SLR+VIER F + K
Subjt:  WEGTANDSRVLLDAIGKEGNNFPLPPE-----------------APHCGE-----RYHLRDYRGRGRHPQGPQEFFNYKHSSLRNVIERCFGVLK

AT5G28730.1 unknown protein2.5e-1427.96Show/hide
Query:  LRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLTVCHSTRNRIVAERFQHSKETVSRQFSRVLRAMCMLGCDVIQGPNMMETPPEILN----NPK
        +RM    F  LCE L     L +   IS++E+V +FL+    +   R +A RF H++ET+ R+F  VL+AM  L  + I+ P  +E    I N    + +
Subjt:  LRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLTVCHSTRNRIVAERFQHSKETVSRQFSRVLRAMCMLGCDVIQGPNMMETPPEILN----NPK

Query:  FNPWFKNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGKEGNNFPLPPEAP-------HCGERYHL
        + P+  + +G                        I + N++  C  +MLFT+ + G  G+ +D+RVL  AI  +   F +PP++        +  +R +L
Subjt:  FNPWFKNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGKEGNNFPLPPEAP-------HCGERYHL

Query:  RDYRGRGRHPQ
          YR   R  Q
Subjt:  RDYRGRGRHPQ

AT5G28950.1 unknown protein2.0e-1942.39Show/hide
Query:  PPEILNNPKFNPWFKNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGKEGNNFPLPPE
        P +I  + +  P+FK+CVGAID TH+       K   +R RK  ++QN++ AC+F++ F +V +GWEG+A+DS+VL DA+ +  N  P+P E
Subjt:  PPEILNNPKFNPWFKNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGKEGNNFPLPPE

AT5G35695.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)9.2e-2545.97Show/hide
Query:  LFTFVYTGWEGTANDSRVLLDAIGKE-----GNNFPLPPEAPHCGERYHLRDYRGRGRHPQGPQEFFNYKHSSLRNVIERCFGVLKARFPILKLMPNYPI
        +F +V +GWEG+A+DSRVL DA+ K      G    L   AP  G RYHL+++ G+ R P+ P E FN +H SLRNVIER FG+ K+RF I K  P +  
Subjt:  LFTFVYTGWEGTANDSRVLLDAIGKE-----GNNFPLPPEAPHCGERYHLRDYRGRGRHPQGPQEFFNYKHSSLRNVIERCFGVLKARFPILKLMPNYPI

Query:  RKQRRIPIACCAIHNFIRMTSTRD
        +KQ  + + C A+HNF+R     D
Subjt:  RKQRRIPIACCAIHNFIRMTSTRD

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)2.0e-4833.85Show/hide
Query:  ISKHPCHTSSLNGHEYILELLNGHPDRIFYSLRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLTVCHSTRNRIVAERFQHSKETVSRQFSRVLR
        + K     S  +G++++ ++LNG  ++ F + RMDK  F  LC+ L+    L +   I IE  + +FL  + H+ R R V E F +S ET+SR F+ VL 
Subjt:  ISKHPCHTSSLNGHEYILELLNGHPDRIFYSLRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLTVCHSTRNRIVAERFQHSKETVSRQFSRVLR

Query:  AMCMLGCDVIQGPNMMETPPEILNNPKFNPWFKNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGK
        A+  +  D  Q PN   +  + L N   +P+FK+CVG +D  H+ V     +Q P+R    ++TQN++ A SF++ F +V  GWEG+A+D +VL  A+ +
Subjt:  AMCMLGCDVIQGPNMMETPPEILNNPKFNPWFKNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGK

Query:  EGNNFPLP-----------PEAPHCGERYHLRDYRGRGRHPQGPQEFFNYKHSSLRNVIERCFGVLKARFPILKLMPNYPIRKQRRIPIACCAIHNFIRM
          N   +P           P  P     YH      R    +  +E FN +H  L   I R FG LK RFPIL   P YP++ Q ++ IA CA+HN++R+
Subjt:  EGNNFPLP-----------PEAPHCGERYHLRDYRGRGRHPQGPQEFFNYKHSSLRNVIERCFGVLKARFPILKLMPNYPIRKQRRIPIACCAIHNFIRM

Query:  TSTRDTLFEEYQVADLEVPDEE
            D +F  ++   L    E+
Subjt:  TSTRDTLFEEYQVADLEVPDEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACTATTCATCGACTGAGGTACCTCCTAACTCTGATGAAGAGAGAGACTTAGAGAGGGAATTTGTTTCATCTGGGGCTCATGTTGAGTTAGGTGTCGATGATGAAGA
AACTGAGATTCATGGAGATTGGGTTGGCTCATCATTGGATAGTCGTGCTTATAAGACTGCACGAACTGCTCGACAAGCCATTTTAGATGATGCTTTAAAAGTATGGACTA
AGTACTATGGCACAAATGATGATCATGATTGTGACTCATCGACATCTTCTGATGAGATGGTTGGTCGAATGCTTATAGTCACTACAATTGTCAATGAATATGAATGTGAA
ATTTCTAAACACCCATGCCATACATCTTCATTAAATGGACATGAATATATATTGGAGTTGTTGAATGGACATCCTGATAGAATTTTTTATTCTTTACGAATGGATAAAAA
TACATTTAGAGCTTTATGTGAGAGATTAAGACAATCAAATTATTTAGTGAATGATAAGATTATCAGTATTGAGGAAGCAGTTGGAATGTTCTTACTCACAGTATGTCATA
GCACACGTAATAGAATTGTAGCTGAACGATTTCAACACTCTAAAGAGACTGTGTCTCGACAATTCTCTAGAGTTTTAAGAGCTATGTGTATGTTGGGATGTGATGTTATC
CAAGGTCCAAATATGATGGAAACTCCACCTGAAATTTTGAACAATCCCAAGTTTAATCCATGGTTCAAGAATTGTGTTGGTGCAATTGATGGAACTCACGTAAGTGTGTG
GGCCCCTGCATCAAAACAAACACCATATCGTGGAAGAAAGGTTATTGTGACTCAAAATATTATGTGTGCATGCTCATTCAATATGTTGTTCACCTTTGTCTATACTGGTT
GGGAAGGTACTGCTAATGATTCTAGAGTATTATTGGATGCTATTGGTAAGGAAGGGAATAATTTTCCGTTACCACCTGAAGCTCCACATTGTGGTGAGAGATATCATTTA
AGAGATTATAGAGGAAGAGGAAGACATCCACAAGGACCACAAGAATTTTTTAATTATAAACACTCTTCATTACGCAATGTGATTGAACGTTGCTTTGGTGTACTTAAAGC
TCGATTCCCCATCTTAAAATTGATGCCAAACTACCCAATTAGAAAGCAACGTAGAATTCCTATTGCTTGTTGTGCAATACATAATTTTATTAGAATGACTTCAACCAGAG
ATACCTTATTTGAAGAATATCAAGTTGCTGATTTAGAAGTACCTGATGAAGAAAGCTTGGGTGGAACACAAGAATTTCTTGATATGAACTTAAGTCAAGCTTATATAAGC
CGAATGAATGGTGTGAGGATGATATTGTCGGTACCATGTGGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCACTATTCATCGACTGAGGTACCTCCTAACTCTGATGAAGAGAGAGACTTAGAGAGGGAATTTGTTTCATCTGGGGCTCATGTTGAGTTAGGTGTCGATGATGAAGA
AACTGAGATTCATGGAGATTGGGTTGGCTCATCATTGGATAGTCGTGCTTATAAGACTGCACGAACTGCTCGACAAGCCATTTTAGATGATGCTTTAAAAGTATGGACTA
AGTACTATGGCACAAATGATGATCATGATTGTGACTCATCGACATCTTCTGATGAGATGGTTGGTCGAATGCTTATAGTCACTACAATTGTCAATGAATATGAATGTGAA
ATTTCTAAACACCCATGCCATACATCTTCATTAAATGGACATGAATATATATTGGAGTTGTTGAATGGACATCCTGATAGAATTTTTTATTCTTTACGAATGGATAAAAA
TACATTTAGAGCTTTATGTGAGAGATTAAGACAATCAAATTATTTAGTGAATGATAAGATTATCAGTATTGAGGAAGCAGTTGGAATGTTCTTACTCACAGTATGTCATA
GCACACGTAATAGAATTGTAGCTGAACGATTTCAACACTCTAAAGAGACTGTGTCTCGACAATTCTCTAGAGTTTTAAGAGCTATGTGTATGTTGGGATGTGATGTTATC
CAAGGTCCAAATATGATGGAAACTCCACCTGAAATTTTGAACAATCCCAAGTTTAATCCATGGTTCAAGAATTGTGTTGGTGCAATTGATGGAACTCACGTAAGTGTGTG
GGCCCCTGCATCAAAACAAACACCATATCGTGGAAGAAAGGTTATTGTGACTCAAAATATTATGTGTGCATGCTCATTCAATATGTTGTTCACCTTTGTCTATACTGGTT
GGGAAGGTACTGCTAATGATTCTAGAGTATTATTGGATGCTATTGGTAAGGAAGGGAATAATTTTCCGTTACCACCTGAAGCTCCACATTGTGGTGAGAGATATCATTTA
AGAGATTATAGAGGAAGAGGAAGACATCCACAAGGACCACAAGAATTTTTTAATTATAAACACTCTTCATTACGCAATGTGATTGAACGTTGCTTTGGTGTACTTAAAGC
TCGATTCCCCATCTTAAAATTGATGCCAAACTACCCAATTAGAAAGCAACGTAGAATTCCTATTGCTTGTTGTGCAATACATAATTTTATTAGAATGACTTCAACCAGAG
ATACCTTATTTGAAGAATATCAAGTTGCTGATTTAGAAGTACCTGATGAAGAAAGCTTGGGTGGAACACAAGAATTTCTTGATATGAACTTAAGTCAAGCTTATATAAGC
CGAATGAATGGTGTGAGGATGATATTGTCGGTACCATGTGGTTAG
Protein sequenceShow/hide protein sequence
MHYSSTEVPPNSDEERDLEREFVSSGAHVELGVDDEETEIHGDWVGSSLDSRAYKTARTARQAILDDALKVWTKYYGTNDDHDCDSSTSSDEMVGRMLIVTTIVNEYECE
ISKHPCHTSSLNGHEYILELLNGHPDRIFYSLRMDKNTFRALCERLRQSNYLVNDKIISIEEAVGMFLLTVCHSTRNRIVAERFQHSKETVSRQFSRVLRAMCMLGCDVI
QGPNMMETPPEILNNPKFNPWFKNCVGAIDGTHVSVWAPASKQTPYRGRKVIVTQNIMCACSFNMLFTFVYTGWEGTANDSRVLLDAIGKEGNNFPLPPEAPHCGERYHL
RDYRGRGRHPQGPQEFFNYKHSSLRNVIERCFGVLKARFPILKLMPNYPIRKQRRIPIACCAIHNFIRMTSTRDTLFEEYQVADLEVPDEESLGGTQEFLDMNLSQAYIS
RMNGVRMILSVPCG