; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS011709 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS011709
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionDDE Tnp4 domain-containing protein
Genome locationscaffold11:102666..104033
RNA-Seq ExpressionMS011709
SyntenyMS011709
Gene Ontology termsGO:0035098 - ESC/E(Z) complex (cellular component)
GO:0035102 - PRC1 complex (cellular component)
GO:0003682 - chromatin binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6586365.1 Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. sororia]2.3e-21984.93Show/hide
Query:  MAAGGVGGDRRVTRNAAVNAAA--TKSKAKKSDRQSHLNQQLVTLIETTISSAHSFLSHNDLHLLPSQTLALESLISSTSSSLQALAPRLPKLALHPPPR
        MAAGG  GD+R TR++A+NA A  T+SKAKKSDR++HL  QLVTLIETTISSAHSFLS NDLHLLPSQTLALES I STSSSLQAL+P LPKL+LH    
Subjt:  MAAGGVGGDRRVTRNAAVNAAA--TKSKAKKSDRQSHLNQQLVTLIETTISSAHSFLSHNDLHLLPSQTLALESLISSTSSSLQALAPRLPKLALHPPPR

Query:  PPPRLPPPPPPRQCWFQRFLSATSEVDCDPRWNFSFRMSKSSFSLLLRLLSPIESSSSPSVAPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYA
                PPPRQCWFQRFLSAT+EVDCDPRWN  FRMSKSSFSLLLRLLSPI+SSSS SV PDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYA
Subjt:  PPPRLPPPPPPRQCWFQRFLSATSEVDCDPRWNFSFRMSKSSFSLLLRLLSPIESSSSPSVAPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYA

Query:  VCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGIAGEMLGKNGSLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGEL
        VCKAIN+KLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG+ G++LGK+GSLLVQALVDAEGRFLDVSAGWPS++ PETILRQSKLYAEIEKSGEL
Subjt:  VCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGIAGEMLGKNGSLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGEL

Query:  LKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFPFIVLTGCLLHNF
        LKGPVYNLDD KPI QYLIGDSCFPLLPWLLTPY+KL EE+SSGFPERAFNSTHNRAMGLVNTAFC+++ARWKLLSKPWKE CRDFFPFIVLTGCLLHNF
Subjt:  LKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFPFIVLTGCLLHNF

Query:  LIKCSEKLEGEDREGEEEANCLSEEQKFPLYDGEIGDDRGKDIRDALAVHLSRLSFRR
        LIKCSEKLE E+++ ++ A+C SEEQKFPLYDGE GDDRGKDIRDALA+HLSRLSFRR
Subjt:  LIKCSEKLEGEDREGEEEANCLSEEQKFPLYDGEIGDDRGKDIRDALAVHLSRLSFRR

XP_022143194.1 protein ALP1-like [Momordica charantia]7.1e-261100Show/hide
Query:  MAAGGVGGDRRVTRNAAVNAAATKSKAKKSDRQSHLNQQLVTLIETTISSAHSFLSHNDLHLLPSQTLALESLISSTSSSLQALAPRLPKLALHPPPRPP
        MAAGGVGGDRRVTRNAAVNAAATKSKAKKSDRQSHLNQQLVTLIETTISSAHSFLSHNDLHLLPSQTLALESLISSTSSSLQALAPRLPKLALHPPPRPP
Subjt:  MAAGGVGGDRRVTRNAAVNAAATKSKAKKSDRQSHLNQQLVTLIETTISSAHSFLSHNDLHLLPSQTLALESLISSTSSSLQALAPRLPKLALHPPPRPP

Query:  PRLPPPPPPRQCWFQRFLSATSEVDCDPRWNFSFRMSKSSFSLLLRLLSPIESSSSPSVAPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVC
        PRLPPPPPPRQCWFQRFLSATSEVDCDPRWNFSFRMSKSSFSLLLRLLSPIESSSSPSVAPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVC
Subjt:  PRLPPPPPPRQCWFQRFLSATSEVDCDPRWNFSFRMSKSSFSLLLRLLSPIESSSSPSVAPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVC

Query:  KAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGIAGEMLGKNGSLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGELLK
        KAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGIAGEMLGKNGSLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGELLK
Subjt:  KAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGIAGEMLGKNGSLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGELLK

Query:  GPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLI
        GPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLI
Subjt:  GPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLI

Query:  KCSEKLEGEDREGEEEANCLSEEQKFPLYDGEIGDDRGKDIRDALAVHLSRLSFRR
        KCSEKLEGEDREGEEEANCLSEEQKFPLYDGEIGDDRGKDIRDALAVHLSRLSFRR
Subjt:  KCSEKLEGEDREGEEEANCLSEEQKFPLYDGEIGDDRGKDIRDALAVHLSRLSFRR

XP_022938170.1 protein ALP1-like [Cucurbita moschata]1.9e-21884.72Show/hide
Query:  MAAGGVGGDRRVTRNAAVNAAA--TKSKAKKSDRQSHLNQQLVTLIETTISSAHSFLSHNDLHLLPSQTLALESLISSTSSSLQALAPRLPKLALHPPPR
        MAAGG  GD+R TR++A+NA A  T+SKAKKSDR++HL  QLVTLIETTISSAHSFLS NDLHLLPSQTLALES I STSSSLQAL+P LPKL+LH    
Subjt:  MAAGGVGGDRRVTRNAAVNAAA--TKSKAKKSDRQSHLNQQLVTLIETTISSAHSFLSHNDLHLLPSQTLALESLISSTSSSLQALAPRLPKLALHPPPR

Query:  PPPRLPPPPPPRQCWFQRFLSATSEVDCDPRWNFSFRMSKSSFSLLLRLLSPIESSSSPSVAPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYA
                PPPRQCWFQRFLSAT+EVDCDPRWN  FRMSKSSFSLLLRLLSPI+SSSS SV PDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYA
Subjt:  PPPRLPPPPPPRQCWFQRFLSATSEVDCDPRWNFSFRMSKSSFSLLLRLLSPIESSSSPSVAPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYA

Query:  VCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGIAGEMLGKNGSLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGEL
        VCKAIN+KLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG+ G++LGK+GSLLVQALVDAEGRFLDVSAGWPS++ PETILRQSKLYAEIEKSGEL
Subjt:  VCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGIAGEMLGKNGSLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGEL

Query:  LKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFPFIVLTGCLLHNF
        LKGPVYNLDD KPI QYLIGDSCFPLLPWLLTPY+KL EE+SSGFPERAFNSTHNRAMGLVNTAFC+++ARWKLLSKPWKE CRDFFPFIVLTGCLLHNF
Subjt:  LKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFPFIVLTGCLLHNF

Query:  LIKCSEKLEGEDREGEEEANCLSEEQKFPLYDGEIGDDRGKDIRDALAVHLSRLSFRR
        LIKCSEKLE E+++ ++ A+C SEEQKF LYDGE GDDRGKDIRDALA+HLSRLSFRR
Subjt:  LIKCSEKLEGEDREGEEEANCLSEEQKFPLYDGEIGDDRGKDIRDALAVHLSRLSFRR

XP_022965738.1 protein ALP1-like [Cucurbita maxima]1.9e-21884.72Show/hide
Query:  MAAGGVGGDRRVTRNAAVNAAA--TKSKAKKSDRQSHLNQQLVTLIETTISSAHSFLSHNDLHLLPSQTLALESLISSTSSSLQALAPRLPKLALHPPPR
        MAAGG  GD+R TR++A+NA A  T+SKAKKSDR++HL  QLVTLIETTISSAHSFLS NDLHLLPSQTLALES I STSSSLQAL+P LPKL+LH    
Subjt:  MAAGGVGGDRRVTRNAAVNAAA--TKSKAKKSDRQSHLNQQLVTLIETTISSAHSFLSHNDLHLLPSQTLALESLISSTSSSLQALAPRLPKLALHPPPR

Query:  PPPRLPPPPPPRQCWFQRFLSATSEVDCDPRWNFSFRMSKSSFSLLLRLLSPIESSSSPSVAPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYA
                PPPRQCWFQRFLSAT+EVDCDPRWN  FRMSKSSFSLLLRLLSPI+SSSS SV PDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYA
Subjt:  PPPRLPPPPPPRQCWFQRFLSATSEVDCDPRWNFSFRMSKSSFSLLLRLLSPIESSSSPSVAPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYA

Query:  VCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGIAGEMLGKNGSLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGEL
        VCKAIN+KLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG+ G++LGK+GSLLVQALVDAEGRFLDVSAGWPS++ PETILRQSKLYAEIEKSGEL
Subjt:  VCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGIAGEMLGKNGSLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGEL

Query:  LKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFPFIVLTGCLLHNF
        LKGPVYNLDD KPI QYLIGDSCFPLLPWLLTPY+KL EE+SSGFPERAFNSTHNRAMGLVNTAFC+++ARWKLLSKPWKE CRDFFPF+VLTGCLLHNF
Subjt:  LKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFPFIVLTGCLLHNF

Query:  LIKCSEKLEGEDREGEEEANCLSEEQKFPLYDGEIGDDRGKDIRDALAVHLSRLSFRR
        LIKCSEKLE E+++ E+ A+  SEEQKFPLYDGE GDDRGKDIRDALA+HLSRLSFRR
Subjt:  LIKCSEKLEGEDREGEEEANCLSEEQKFPLYDGEIGDDRGKDIRDALAVHLSRLSFRR

XP_038890100.1 protein ALP1-like [Benincasa hispida]4.2e-22185.06Show/hide
Query:  MAAGGVGGDRRVTRNAAVNA------AATKSKAKKSDRQSHLNQQLVTLIETTISSAHSFLSHNDLHLLPSQTLALESLISSTSSSLQALAPRLPKLALH
        MA  G+GGD+R TR++++NA      A T+SKAKK DR+SHL  QLVTLI+TTISSAHSFLS NDLHLLPSQTLALESL+ STSSSL AL+PRLPKL L 
Subjt:  MAAGGVGGDRRVTRNAAVNA------AATKSKAKKSDRQSHLNQQLVTLIETTISSAHSFLSHNDLHLLPSQTLALESLISSTSSSLQALAPRLPKLALH

Query:  PPPRPPPRLPPPPPPRQCWFQRFLSATSEVDCDPRWNFSFRMSKSSFSLLLRLLSPIESSSSPSVAPDCALAAALFRLAHGASYKAVGRRFGIDSADACR
        PPP      PPPPPPRQCWFQRFLSATS+VDCDPRWN SFRMSKSSFSLLLRLLSPI+SSSS SV PDCALAAALFRLAHGASYKAVGRRFGIDSADACR
Subjt:  PPPRPPPRLPPPPPPRQCWFQRFLSATSEVDCDPRWNFSFRMSKSSFSLLLRLLSPIESSSSPSVAPDCALAAALFRLAHGASYKAVGRRFGIDSADACR

Query:  SFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGIAGEMLGKNGSLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLYAEIEK
        SFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG+  E+LGKNGSLLVQALVDAEGRFLDVSAGWPS++ PETILRQSKLY EIEK
Subjt:  SFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGIAGEMLGKNGSLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLYAEIEK

Query:  SGELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFPFIVLTGCL
        S ELLKGPVYNLDD+KPIPQYLIGDSCFPLLPWLLTPY+KL EE+SSGFPERAFNSTHNRAM LVNTAF RL+ARWKLLSKPWKEGCRDFFPFIVLTGCL
Subjt:  SGELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFPFIVLTGCL

Query:  LHNFLIKCSEKLEGEDREGEEEANCLSEEQKFPLYDGEIGDDRGKDIRDALAVHLSRLSFRR
        LHNFLIKCSEKL+ E+++ EEEA C SE+QKFPLYDG+IGDDRGKDIRDALA+HLS LS+RR
Subjt:  LHNFLIKCSEKLEGEDREGEEEANCLSEEQKFPLYDGEIGDDRGKDIRDALAVHLSRLSFRR

TrEMBL top hitse value%identityAlignment
A0A0A0LFB5 DDE Tnp4 domain-containing protein1.1e-21182.79Show/hide
Query:  MAAGGVGGDRRVTRNAAVNAAA---TKSKAKKSDRQSHLNQQLVTLIETTISSAHSFLSHNDLHLLPSQTLALESLISSTSSSLQALAPRLPKLALHPPP
        MA  G+ GD+R TR++A+NAAA   T+SKAKK D+++HLN QL+TLIETTISSAHSFLS NDLHLLPSQTLALESL+ STSSSL AL+PRLPKL+L    
Subjt:  MAAGGVGGDRRVTRNAAVNAAA---TKSKAKKSDRQSHLNQQLVTLIETTISSAHSFLSHNDLHLLPSQTLALESLISSTSSSLQALAPRLPKLALHPPP

Query:  RPPPRLPPPPPPRQCWFQRFLSATSEVDCDPRWNFSFRMSKSSFSLLLRLLSPIESSSSPSVAPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFY
              PP PPPRQCWFQRFLSATS+VDCDPRWN SFRMSKSSFSLLLRLLSPI+SS S SV PDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFY
Subjt:  RPPPRLPPPPPPRQCWFQRFLSATSEVDCDPRWNFSFRMSKSSFSLLLRLLSPIESSSSPSVAPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFY

Query:  AVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGIAGEMLGKNGSLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGE
        AVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG  GE+  KNGSLLVQALVDAEGRFLDVSAGWPS++ P TILRQSKLYAEIEKS E
Subjt:  AVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGIAGEMLGKNGSLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGE

Query:  LLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFPFIVLTGCLLHN
        LLKGPVYNLD+EKPIPQYLIGDSCFPLLPWLLTPY++L EE+SSGF  RAFNSTH RAM LVNTAFCRL+ARWKLLSKPWKEGCRDFFPFI+LTGCLL N
Subjt:  LLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFPFIVLTGCLLHN

Query:  FLIKCSEKLEGEDREGEEEANCLSEEQKFPLYDGEIGDDRGKDIRDALAVHLSRLSFRR
        FLIKCSEKL+ E+++ EE A+C SEEQKFPL+DGEIGD RGKDIRDALA+HLS L++RR
Subjt:  FLIKCSEKLEGEDREGEEEANCLSEEQKFPLYDGEIGDDRGKDIRDALAVHLSRLSFRR

A0A5D3BH79 Putative nuclease HARBI13.6e-21082.53Show/hide
Query:  MAAGGVGGDRRVTRNAAVNAAA--TKSKAKKSDRQSHLNQQLVTLIETTISSAHSFLSHNDLHLLPSQTLALESLISSTSSSLQALAPRLPKLALHPPPR
        MA  G+ GD+R TR++A+NAAA  T+SKAKK D+++HLN QL+TLIETTISSA SFLS NDLHLLPSQTLALESL+ STSSSL AL+PRLPKL+L     
Subjt:  MAAGGVGGDRRVTRNAAVNAAA--TKSKAKKSDRQSHLNQQLVTLIETTISSAHSFLSHNDLHLLPSQTLALESLISSTSSSLQALAPRLPKLALHPPPR

Query:  PPPRLPPPPPPRQCWFQRFLSATSEVDCDPRWNFSFRMSKSSFSLLLRLLSPIESSSSPSVAPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYA
              P PPPRQCWFQRFLSATS+VDCDPRWN SFRMSKSSFSLLLRLLSPI+S SS SV PDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYA
Subjt:  PPPRLPPPPPPRQCWFQRFLSATSEVDCDPRWNFSFRMSKSSFSLLLRLLSPIESSSSPSVAPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYA

Query:  VCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGIAGEMLGKNGSLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGEL
        VCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG  GE+  KNGSLLVQALVDAEGRFLDVSAGWPS++ P TILRQSKLY EIEKS EL
Subjt:  VCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGIAGEMLGKNGSLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGEL

Query:  LKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFPFIVLTGCLLHNF
        LKGPVYNLDDEKPIPQYLIGDSCFPL PWLLTPY++L EE+SSGF ERAFNSTH RAM LVNTAFCRL+ARWKLLSKPWKEGCRDFFPFI+LTGCLL NF
Subjt:  LKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFPFIVLTGCLLHNF

Query:  LIKCSEKLEGEDREGEEEANCLSEEQKFPLYDGEIGDDRGKDIRDALAVHLSRLSFRR
        LIKCSEKL+ E+++ EE A+C SEEQKFP +DGEIGD RGKDIRDALA+HLS LS+RR
Subjt:  LIKCSEKLEGEDREGEEEANCLSEEQKFPLYDGEIGDDRGKDIRDALAVHLSRLSFRR

A0A6J1CNL4 protein ALP1-like3.4e-261100Show/hide
Query:  MAAGGVGGDRRVTRNAAVNAAATKSKAKKSDRQSHLNQQLVTLIETTISSAHSFLSHNDLHLLPSQTLALESLISSTSSSLQALAPRLPKLALHPPPRPP
        MAAGGVGGDRRVTRNAAVNAAATKSKAKKSDRQSHLNQQLVTLIETTISSAHSFLSHNDLHLLPSQTLALESLISSTSSSLQALAPRLPKLALHPPPRPP
Subjt:  MAAGGVGGDRRVTRNAAVNAAATKSKAKKSDRQSHLNQQLVTLIETTISSAHSFLSHNDLHLLPSQTLALESLISSTSSSLQALAPRLPKLALHPPPRPP

Query:  PRLPPPPPPRQCWFQRFLSATSEVDCDPRWNFSFRMSKSSFSLLLRLLSPIESSSSPSVAPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVC
        PRLPPPPPPRQCWFQRFLSATSEVDCDPRWNFSFRMSKSSFSLLLRLLSPIESSSSPSVAPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVC
Subjt:  PRLPPPPPPRQCWFQRFLSATSEVDCDPRWNFSFRMSKSSFSLLLRLLSPIESSSSPSVAPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVC

Query:  KAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGIAGEMLGKNGSLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGELLK
        KAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGIAGEMLGKNGSLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGELLK
Subjt:  KAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGIAGEMLGKNGSLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGELLK

Query:  GPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLI
        GPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLI
Subjt:  GPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLI

Query:  KCSEKLEGEDREGEEEANCLSEEQKFPLYDGEIGDDRGKDIRDALAVHLSRLSFRR
        KCSEKLEGEDREGEEEANCLSEEQKFPLYDGEIGDDRGKDIRDALAVHLSRLSFRR
Subjt:  KCSEKLEGEDREGEEEANCLSEEQKFPLYDGEIGDDRGKDIRDALAVHLSRLSFRR

A0A6J1FIY2 protein ALP1-like9.4e-21984.72Show/hide
Query:  MAAGGVGGDRRVTRNAAVNAAA--TKSKAKKSDRQSHLNQQLVTLIETTISSAHSFLSHNDLHLLPSQTLALESLISSTSSSLQALAPRLPKLALHPPPR
        MAAGG  GD+R TR++A+NA A  T+SKAKKSDR++HL  QLVTLIETTISSAHSFLS NDLHLLPSQTLALES I STSSSLQAL+P LPKL+LH    
Subjt:  MAAGGVGGDRRVTRNAAVNAAA--TKSKAKKSDRQSHLNQQLVTLIETTISSAHSFLSHNDLHLLPSQTLALESLISSTSSSLQALAPRLPKLALHPPPR

Query:  PPPRLPPPPPPRQCWFQRFLSATSEVDCDPRWNFSFRMSKSSFSLLLRLLSPIESSSSPSVAPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYA
                PPPRQCWFQRFLSAT+EVDCDPRWN  FRMSKSSFSLLLRLLSPI+SSSS SV PDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYA
Subjt:  PPPRLPPPPPPRQCWFQRFLSATSEVDCDPRWNFSFRMSKSSFSLLLRLLSPIESSSSPSVAPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYA

Query:  VCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGIAGEMLGKNGSLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGEL
        VCKAIN+KLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG+ G++LGK+GSLLVQALVDAEGRFLDVSAGWPS++ PETILRQSKLYAEIEKSGEL
Subjt:  VCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGIAGEMLGKNGSLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGEL

Query:  LKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFPFIVLTGCLLHNF
        LKGPVYNLDD KPI QYLIGDSCFPLLPWLLTPY+KL EE+SSGFPERAFNSTHNRAMGLVNTAFC+++ARWKLLSKPWKE CRDFFPFIVLTGCLLHNF
Subjt:  LKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFPFIVLTGCLLHNF

Query:  LIKCSEKLEGEDREGEEEANCLSEEQKFPLYDGEIGDDRGKDIRDALAVHLSRLSFRR
        LIKCSEKLE E+++ ++ A+C SEEQKF LYDGE GDDRGKDIRDALA+HLSRLSFRR
Subjt:  LIKCSEKLEGEDREGEEEANCLSEEQKFPLYDGEIGDDRGKDIRDALAVHLSRLSFRR

A0A6J1HRT9 protein ALP1-like9.4e-21984.72Show/hide
Query:  MAAGGVGGDRRVTRNAAVNAAA--TKSKAKKSDRQSHLNQQLVTLIETTISSAHSFLSHNDLHLLPSQTLALESLISSTSSSLQALAPRLPKLALHPPPR
        MAAGG  GD+R TR++A+NA A  T+SKAKKSDR++HL  QLVTLIETTISSAHSFLS NDLHLLPSQTLALES I STSSSLQAL+P LPKL+LH    
Subjt:  MAAGGVGGDRRVTRNAAVNAAA--TKSKAKKSDRQSHLNQQLVTLIETTISSAHSFLSHNDLHLLPSQTLALESLISSTSSSLQALAPRLPKLALHPPPR

Query:  PPPRLPPPPPPRQCWFQRFLSATSEVDCDPRWNFSFRMSKSSFSLLLRLLSPIESSSSPSVAPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYA
                PPPRQCWFQRFLSAT+EVDCDPRWN  FRMSKSSFSLLLRLLSPI+SSSS SV PDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYA
Subjt:  PPPRLPPPPPPRQCWFQRFLSATSEVDCDPRWNFSFRMSKSSFSLLLRLLSPIESSSSPSVAPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYA

Query:  VCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGIAGEMLGKNGSLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGEL
        VCKAIN+KLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG+ G++LGK+GSLLVQALVDAEGRFLDVSAGWPS++ PETILRQSKLYAEIEKSGEL
Subjt:  VCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGIAGEMLGKNGSLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGEL

Query:  LKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFPFIVLTGCLLHNF
        LKGPVYNLDD KPI QYLIGDSCFPLLPWLLTPY+KL EE+SSGFPERAFNSTHNRAMGLVNTAFC+++ARWKLLSKPWKE CRDFFPF+VLTGCLLHNF
Subjt:  LKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFPFIVLTGCLLHNF

Query:  LIKCSEKLEGEDREGEEEANCLSEEQKFPLYDGEIGDDRGKDIRDALAVHLSRLSFRR
        LIKCSEKLE E+++ E+ A+  SEEQKFPLYDGE GDDRGKDIRDALA+HLSRLSFRR
Subjt:  LIKCSEKLEGEDREGEEEANCLSEEQKFPLYDGEIGDDRGKDIRDALAVHLSRLSFRR

SwissProt top hitse value%identityAlignment
B0BN95 Putative nuclease HARBI14.8e-1023.66Show/hide
Query:  SPSVAPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIV----VGFGWISLPNCCGVLGLRRFGIAG------
        S +++P+  + AAL     G+    +G   GI  A   R    V +A+ E+    +   +D   I       +G   +P   G +      I        
Subjt:  SPSVAPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIV----VGFGWISLPNCCGVLGLRRFGIAG------

Query:  EMLGKNG--SLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESSGF
          + + G  SL    + D  G  + V   WP ++    +L+QS L ++ E                 P   +L+GDS F L  WLLTP + + E  +   
Subjt:  EMLGKNG--SLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESSGF

Query:  PERAFNSTHNRAMGLVNTAFCRLQ----ARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIK
          RA ++TH+     + T  CR +    ++  L   P K         I+L  C+LHN  ++
Subjt:  PERAFNSTHNRAMGLVNTAFCRLQ----ARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIK

Q8BR93 Putative nuclease HARBI12.4e-0923.46Show/hide
Query:  SPSVAPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVG------FGWISLPNCCGVLGLRRFGIAG----
        S +++P+  + AAL     G+    +G   GI  A   R    V +A+ E+    +     +D   V       +G   +P   GV       I      
Subjt:  SPSVAPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVG------FGWISLPNCCGVLGLRRFGIAG----

Query:  --EMLGKNG--SLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESS
            + + G  SL    + D  G  + V   WP ++    +L++S L ++ E                 P   +L+GDS F L  WLLTP + + E  + 
Subjt:  --EMLGKNG--SLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESS

Query:  GFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFP----FIVLTGCLLHN
            RA ++TH+     + T  CR +           +G   + P     I+L  C+LHN
Subjt:  GFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFP----FIVLTGCLLHN

Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 18.7e-2827.92Show/hide
Query:  KLALHPPPRPPPRLPPPPPPRQC-WFQRF--LSATSEVDCDPRWNFS--FRMSKSSFSLLLRLLSPIESSSSPS---------VAPDCALAAALFRLAHG
        KLA +   +    +P  P    C W+  F   +++  V  D  + F   FR SK++FS +  L+     S  PS         ++ +  +A AL RLA G
Subjt:  KLALHPPPRPPPRLPPPPPPRQC-WFQRF--LSATSEVDCDPRWNFS--FRMSKSSFSLLLRLLSPIESSSSPS---------VAPDCALAAALFRLAHG

Query:  ASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSD--IDRIVVGF-GWISLPNCCGVLGLRRFGIAGEML---------GKNGSLLVQALVDAE
         S  +VG  FG+  +   +  +   +A+ E+  H L       I+ I   F     LPNCCG +      +    +          KN S+ +Q + D E
Subjt:  ASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSD--IDRIVVGF-GWISLPNCCGVLGLRRFGIAGEML---------GKNGSLLVQALVDAE

Query:  GRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAF
         RFL++  GWP  +    +L+ S  + ++ ++ ++L G    L     I +Y++G   +PLLPWL+TP+    + +       AFN  H +   +  TAF
Subjt:  GRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAF

Query:  CRLQARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSEKLE------GEDREGEEEANCLSEEQKFPLYDGEIGDDRGKDIRDALAVHLSR
         +L+  W++LSK      R   P I+L  CLLHN +I C + L+      G    G  +  C   +Q  PL         G ++R  L  HL R
Subjt:  CRLQARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSEKLE------GEDREGEEEANCLSEEQKFPLYDGEIGDDRGKDIRDALAVHLSR

Q9M2U3 Protein ALP1-like2.3e-3628.06Show/hide
Query:  WFQRFLSATSEVDCDPR-WNFSFRMSKSSFSLLLRLL--------SPIESSSSPSVAPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAI
        W+  F         DP+ +   F++S+ +F  +  L+        +    S+   ++ +  +A AL RL  G S   +G  FG++ +   +  +   +++
Subjt:  WFQRFLSATSEVDCDPR-WNFSFRMSKSSFSLLLRLL--------SPIESSSSPSVAPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAI

Query:  NEKLGHLLELRSDIDRIVVGFGWIS-LPNCCGVLGL-------------RRFGIAGEMLGKNGSLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLY
         E+  H L   S +D I   F  IS LPNCCG + +              +  + GE   KN S+ +QA+VD + RFLDV AGWP ++  + +L+ S  Y
Subjt:  NEKLGHLLELRSDIDRIVVGFGWIS-LPNCCGVLGL-------------RRFGIAGEMLGKNGSLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLY

Query:  AEIEKSGELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFPFIV
          +EK G+ L G    L +   + +Y++GDS FPLLPWLLTPY    + + +  P+  FN  H+ A      A  +L+ RW++++       R+  P I+
Subjt:  AEIEKSGELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFPFIV

Query:  LTGCLLHNFLIKCSEKLEGEDREGEEEANCLSEEQKFPLYDGEIGDDRGKDIRDALAVHL
           CLLHN +I      + ED+  +++      +  +     ++ D+    +RD L+  L
Subjt:  LTGCLLHNFLIKCSEKLEGEDREGEEEANCLSEEQKFPLYDGEIGDDRGKDIRDALAVHL

Arabidopsis top hitse value%identityAlignment
AT1G72270.1 CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)2.3e-7642.02Show/hide
Query:  KKSDR-------QSHLNQQLVTLIETTISSAHSFLSHNDLHLLPSQTLALESLISSTSSSLQALAPRLPKLALHPPPRPPPRLPPPPPPRQCWFQRFLSA
        +KSDR       +  L   L+  + +  +  +SFL  NDL L PSQTL LESLISS                   P  P P           WF RFL++
Subjt:  KKSDR-------QSHLNQQLVTLIETTISSAHSFLSHNDLHLLPSQTLALESLISSTSSSLQALAPRLPKLALHPPPRPPPRLPPPPPPRQCWFQRFLSA

Query:  TSEVDCDPRWNFSFRMSKSSFSLLLRLLSPIESSSSPSVAPDCALAAALFRLAHGASYKAVGRRFGIDS-ADACRSFYAVCKAINEKLGHLLELRSDIDR
         +E + DPRW   FRMSKS+F  L  +LS    SS PS       AA +FRLAHGASY+ +  RFG DS + A RSF+ VCK INEKL         +D 
Subjt:  TSEVDCDPRWNFSFRMSKSSFSLLLRLLSPIESSSSPSVAPDCALAAALFRLAHGASYKAVGRRFGIDS-ADACRSFYAVCKAINEKLGHLLELRSDIDR

Query:  IVVGFGWISLPNCCGVLGLRRFGIAGEMLGKNGSLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGELLKGPVYNLDDEKPIPQYLIGD
            F    LPNC GV+G  RF + G++LG  GS+LVQALVD+ GRF+D+SAGWPST+ PE I RQ+KL++  E   E+L G    L +   +P+Y++GD
Subjt:  IVVGFGWISLPNCCGVLGLRRFGIAGEMLGKNGSLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGELLKGPVYNLDDEKPIPQYLIGD

Query:  SCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSEKLEGEDREGEEEA--
        SC PLLPWL+TPY    +EES  F E  FN+  +  +  V  AF +++ARW++L K WK    +F PF++ TGCLLHNFL+       G+D +  EE   
Subjt:  SCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSEKLEGEDREGEEEA--

Query:  NCLS-----------EEQKFPLYDGEIGDDRGKDIRDALAVHLSR
         C +           +E++   ++GE   +  K IRDA+A +LSR
Subjt:  NCLS-----------EEQKFPLYDGEIGDDRGKDIRDALAVHLSR

AT1G72270.2 LOCATED IN: mitochondrion4.7e-7742Show/hide
Query:  KKSDR-------QSHLNQQLVTLIETTISSAHSFLSHNDLHLLPSQTLALESLISSTSSSLQALAPRLPKLALHPPPRPPPRLPPPPPPRQCWFQRFLSA
        +KSDR       +  L   L+  + +  +  +SFL  NDL L PSQTL LESLISS                   P  P P           WF RFL++
Subjt:  KKSDR-------QSHLNQQLVTLIETTISSAHSFLSHNDLHLLPSQTLALESLISSTSSSLQALAPRLPKLALHPPPRPPPRLPPPPPPRQCWFQRFLSA

Query:  TSEVDCDPRWNFSFRMSKSSFSLLLRLLSPIESSSSPSVAPDCALAAALFRLAHGASYKAVGRRFGIDS-ADACRSFYAVCKAINEKLGHLLELRSDIDR
         +E + DPRW   FRMSKS+F  L  +LS    SS PS       AA +FRLAHGASY+ +  RFG DS + A RSF+ VCK INEKL         +D 
Subjt:  TSEVDCDPRWNFSFRMSKSSFSLLLRLLSPIESSSSPSVAPDCALAAALFRLAHGASYKAVGRRFGIDS-ADACRSFYAVCKAINEKLGHLLELRSDIDR

Query:  IVVGFGWISLPNCCGVLGLRRFGIAGEMLGKNGSLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGELLKGPVYNLDDEKPIPQYLIGD
            F    LPNC GV+G  RF + G++LG  GS+LVQALVD+ GRF+D+SAGWPST+ PE I RQ+KL++  E   E+L G    L +   +P+Y++GD
Subjt:  IVVGFGWISLPNCCGVLGLRRFGIAGEMLGKNGSLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGELLKGPVYNLDDEKPIPQYLIGD

Query:  SCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSEKLEGEDREGEEEA--
        SC PLLPWL+TPY    +EES  F E  FN+  +  +  V  AF +++ARW++L K WK    +F PF++ TGCLLHNFL+       G+D +  EE   
Subjt:  SCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSEKLEGEDREGEEEA--

Query:  NCLS-----------EEQKFPLYDGEIGDDRGKDIRDALAVHLSRLSFRR
         C +           +E++   ++GE   +  K IRDA+A +LSR+S  R
Subjt:  NCLS-----------EEQKFPLYDGEIGDDRGKDIRDALAVHLSRLSFRR

AT3G19120.1 PIF / Ping-Pong family of plant transposases2.9e-1828.54Show/hide
Query:  SQTLALESLISSTSSSLQ----ALAPRLPKLALH--------PPPRPPPRLPPPPPPRQCWFQRFLSATSE----VDC---DPRWNFSFRMSKSSFSLLL
        SQ+    S + STSS+       LA  L  LA++            P P  PPP          F + T++    +D    D RW   + +S   F  ++
Subjt:  SQTLALESLISSTSSSLQ----ALAPRLPKLALH--------PPPRPPPRLPPPPPPRQCWFQRFLSATSE----VDC---DPRWNFSFRMSKSSFSLLL

Query:  RLLSPIESSSSPSVAPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKL-GHLLELRSDIDRIV---VGFGWI-SLPNCCGVLG---
          L P  ++S+ S+  D A+A  L RLAHG S K +  R+ +D     +    V + +  KL    +++     R++    GF  + SLPN CG +    
Subjt:  RLLSPIESSSSPSVAPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKL-GHLLELRSDIDRIV---VGFGWI-SLPNCCGVLG---

Query:  --LRR------FGIAGEMLGKNGSLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLL
          LRR        I G   G + ++L+Q + D +  F DV    P      +  R S LY  +  SG+++   V N+      P Y++GD C+PLL +L+
Subjt:  --LRR------FGIAGEMLGKNGSLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLL

Query:  TPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSEKLEGEDREGEEE----ANCLSEEQK
        TP+       S   PE  F+    +   +V  A   L+ARWK+L +    G  +  P  ++  C+LHN L + + + E E  +  +E    A  L  E++
Subjt:  TPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSEKLEGEDREGEEE----ANCLSEEQK

Query:  FPLYDGEIGDDRGKDIRDALAVHL-SRLSFR
        F  Y        G+ +R ALA  L  RLS R
Subjt:  FPLYDGEIGDDRGKDIRDALAVHL-SRLSFR

AT3G55350.1 PIF / Ping-Pong family of plant transposases1.6e-3728.06Show/hide
Query:  WFQRFLSATSEVDCDPR-WNFSFRMSKSSFSLLLRLL--------SPIESSSSPSVAPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAI
        W+  F         DP+ +   F++S+ +F  +  L+        +    S+   ++ +  +A AL RL  G S   +G  FG++ +   +  +   +++
Subjt:  WFQRFLSATSEVDCDPR-WNFSFRMSKSSFSLLLRLL--------SPIESSSSPSVAPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAI

Query:  NEKLGHLLELRSDIDRIVVGFGWIS-LPNCCGVLGL-------------RRFGIAGEMLGKNGSLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLY
         E+  H L   S +D I   F  IS LPNCCG + +              +  + GE   KN S+ +QA+VD + RFLDV AGWP ++  + +L+ S  Y
Subjt:  NEKLGHLLELRSDIDRIVVGFGWIS-LPNCCGVLGL-------------RRFGIAGEMLGKNGSLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLY

Query:  AEIEKSGELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFPFIV
          +EK G+ L G    L +   + +Y++GDS FPLLPWLLTPY    + + +  P+  FN  H+ A      A  +L+ RW++++       R+  P I+
Subjt:  AEIEKSGELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFPFIV

Query:  LTGCLLHNFLIKCSEKLEGEDREGEEEANCLSEEQKFPLYDGEIGDDRGKDIRDALAVHL
           CLLHN +I      + ED+  +++      +  +     ++ D+    +RD L+  L
Subjt:  LTGCLLHNFLIKCSEKLEGEDREGEEEANCLSEEQKFPLYDGEIGDDRGKDIRDALAVHL

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)6.2e-2927.92Show/hide
Query:  KLALHPPPRPPPRLPPPPPPRQC-WFQRF--LSATSEVDCDPRWNFS--FRMSKSSFSLLLRLLSPIESSSSPS---------VAPDCALAAALFRLAHG
        KLA +   +    +P  P    C W+  F   +++  V  D  + F   FR SK++FS +  L+     S  PS         ++ +  +A AL RLA G
Subjt:  KLALHPPPRPPPRLPPPPPPRQC-WFQRF--LSATSEVDCDPRWNFS--FRMSKSSFSLLLRLLSPIESSSSPS---------VAPDCALAAALFRLAHG

Query:  ASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSD--IDRIVVGF-GWISLPNCCGVLGLRRFGIAGEML---------GKNGSLLVQALVDAE
         S  +VG  FG+  +   +  +   +A+ E+  H L       I+ I   F     LPNCCG +      +    +          KN S+ +Q + D E
Subjt:  ASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSD--IDRIVVGF-GWISLPNCCGVLGLRRFGIAGEML---------GKNGSLLVQALVDAE

Query:  GRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAF
         RFL++  GWP  +    +L+ S  + ++ ++ ++L G    L     I +Y++G   +PLLPWL+TP+    + +       AFN  H +   +  TAF
Subjt:  GRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYVKLKEEESSGFPERAFNSTHNRAMGLVNTAF

Query:  CRLQARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSEKLE------GEDREGEEEANCLSEEQKFPLYDGEIGDDRGKDIRDALAVHLSR
         +L+  W++LSK      R   P I+L  CLLHN +I C + L+      G    G  +  C   +Q  PL         G ++R  L  HL R
Subjt:  CRLQARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSEKLE------GEDREGEEEANCLSEEQKFPLYDGEIGDDRGKDIRDALAVHLSR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCCGGAGGAGTCGGCGGCGATAGGAGAGTCACCAGAAACGCCGCCGTGAACGCCGCCGCTACCAAAAGCAAGGCCAAGAAGTCCGACCGGCAGAGCCACCTGAA
CCAGCAACTGGTAACTCTCATCGAAACCACCATCTCTTCCGCTCACTCATTTCTCTCTCACAACGATCTCCACCTTCTTCCCTCTCAAACCCTCGCCCTTGAATCGCTCA
TCTCTTCTACATCATCCTCTCTCCAAGCCCTCGCTCCTCGCCTCCCCAAACTTGCCCTACATCCGCCGCCGCGGCCGCCTCCGCGGCTACCTCCGCCTCCACCTCCGCGG
CAATGCTGGTTCCAGCGCTTCCTCTCTGCGACGTCGGAGGTGGACTGTGATCCTAGGTGGAATTTCTCCTTCCGCATGTCGAAATCGTCGTTCTCACTCCTCCTTCGTCT
CCTCTCCCCGATTGAAAGCTCCTCGTCTCCTTCCGTTGCCCCCGACTGTGCTTTAGCCGCTGCGCTTTTTCGATTGGCGCATGGCGCGAGCTACAAGGCGGTTGGAAGAC
GGTTTGGGATCGATTCTGCCGACGCTTGCCGCTCGTTTTATGCAGTTTGTAAGGCTATCAATGAGAAATTGGGGCATTTGCTTGAGCTCCGGTCAGACATTGATCGGATT
GTGGTGGGATTTGGGTGGATTTCGCTTCCCAATTGCTGTGGGGTTTTAGGGCTTAGAAGATTTGGGATAGCCGGCGAGATGCTAGGCAAAAACGGGTCGCTTCTGGTTCA
GGCACTGGTGGACGCCGAAGGGAGGTTTCTGGATGTCTCCGCCGGGTGGCCGAGCACGGTAATACCTGAAACAATCTTGCGGCAGAGCAAGCTATATGCAGAAATTGAGA
AATCTGGTGAATTACTCAAAGGCCCTGTCTATAATCTCGACGATGAAAAGCCCATTCCCCAATACCTGATTGGCGATTCTTGCTTCCCCCTTTTGCCATGGCTATTGACG
CCATACGTGAAACTGAAGGAAGAAGAAAGCTCTGGTTTCCCGGAGCGAGCATTCAATTCCACGCATAACCGTGCTATGGGGTTAGTTAACACTGCTTTTTGCAGACTCCA
AGCTCGGTGGAAGCTTCTGTCAAAACCATGGAAGGAAGGATGTAGAGACTTTTTCCCATTTATTGTTTTGACCGGGTGTTTGCTGCATAATTTCCTCATCAAGTGCAGTG
AGAAACTAGAAGGAGAAGATCGAGAAGGAGAAGAAGAAGCAAATTGTTTAAGTGAGGAGCAGAAGTTTCCTCTTTATGATGGCGAGATAGGAGACGATAGAGGAAAGGAT
ATCAGAGACGCGCTTGCCGTGCATTTGAGTAGGCTGAGCTTCAGAAGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCCGGAGGAGTCGGCGGCGATAGGAGAGTCACCAGAAACGCCGCCGTGAACGCCGCCGCTACCAAAAGCAAGGCCAAGAAGTCCGACCGGCAGAGCCACCTGAA
CCAGCAACTGGTAACTCTCATCGAAACCACCATCTCTTCCGCTCACTCATTTCTCTCTCACAACGATCTCCACCTTCTTCCCTCTCAAACCCTCGCCCTTGAATCGCTCA
TCTCTTCTACATCATCCTCTCTCCAAGCCCTCGCTCCTCGCCTCCCCAAACTTGCCCTACATCCGCCGCCGCGGCCGCCTCCGCGGCTACCTCCGCCTCCACCTCCGCGG
CAATGCTGGTTCCAGCGCTTCCTCTCTGCGACGTCGGAGGTGGACTGTGATCCTAGGTGGAATTTCTCCTTCCGCATGTCGAAATCGTCGTTCTCACTCCTCCTTCGTCT
CCTCTCCCCGATTGAAAGCTCCTCGTCTCCTTCCGTTGCCCCCGACTGTGCTTTAGCCGCTGCGCTTTTTCGATTGGCGCATGGCGCGAGCTACAAGGCGGTTGGAAGAC
GGTTTGGGATCGATTCTGCCGACGCTTGCCGCTCGTTTTATGCAGTTTGTAAGGCTATCAATGAGAAATTGGGGCATTTGCTTGAGCTCCGGTCAGACATTGATCGGATT
GTGGTGGGATTTGGGTGGATTTCGCTTCCCAATTGCTGTGGGGTTTTAGGGCTTAGAAGATTTGGGATAGCCGGCGAGATGCTAGGCAAAAACGGGTCGCTTCTGGTTCA
GGCACTGGTGGACGCCGAAGGGAGGTTTCTGGATGTCTCCGCCGGGTGGCCGAGCACGGTAATACCTGAAACAATCTTGCGGCAGAGCAAGCTATATGCAGAAATTGAGA
AATCTGGTGAATTACTCAAAGGCCCTGTCTATAATCTCGACGATGAAAAGCCCATTCCCCAATACCTGATTGGCGATTCTTGCTTCCCCCTTTTGCCATGGCTATTGACG
CCATACGTGAAACTGAAGGAAGAAGAAAGCTCTGGTTTCCCGGAGCGAGCATTCAATTCCACGCATAACCGTGCTATGGGGTTAGTTAACACTGCTTTTTGCAGACTCCA
AGCTCGGTGGAAGCTTCTGTCAAAACCATGGAAGGAAGGATGTAGAGACTTTTTCCCATTTATTGTTTTGACCGGGTGTTTGCTGCATAATTTCCTCATCAAGTGCAGTG
AGAAACTAGAAGGAGAAGATCGAGAAGGAGAAGAAGAAGCAAATTGTTTAAGTGAGGAGCAGAAGTTTCCTCTTTATGATGGCGAGATAGGAGACGATAGAGGAAAGGAT
ATCAGAGACGCGCTTGCCGTGCATTTGAGTAGGCTGAGCTTCAGAAGA
Protein sequenceShow/hide protein sequence
MAAGGVGGDRRVTRNAAVNAAATKSKAKKSDRQSHLNQQLVTLIETTISSAHSFLSHNDLHLLPSQTLALESLISSTSSSLQALAPRLPKLALHPPPRPPPRLPPPPPPR
QCWFQRFLSATSEVDCDPRWNFSFRMSKSSFSLLLRLLSPIESSSSPSVAPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRI
VVGFGWISLPNCCGVLGLRRFGIAGEMLGKNGSLLVQALVDAEGRFLDVSAGWPSTVIPETILRQSKLYAEIEKSGELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLT
PYVKLKEEESSGFPERAFNSTHNRAMGLVNTAFCRLQARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSEKLEGEDREGEEEANCLSEEQKFPLYDGEIGDDRGKD
IRDALAVHLSRLSFRR