; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0040835 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0040835
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon protein
Genome locationchr13:8847272..8851295
RNA-Seq ExpressionLag0040835
SyntenyLag0040835
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035620.1 retrotransposon protein [Cucumis melo var. makuwa]4.2e-12459.33Show/hide
Query:  IEQDELIAILAIICTTQYQWILIFNFIMSLGDLTIANHSPY-VRHQIRQLNFFRLIHESDLACRESTRMDRRCFTILCTLLRTTGGLSGTNYVDVEEMVG
        +++ EL +I+     +Q Q +L+   + +  D     H PY  RH+IRQL +FR+IH SDL CR+STRMDRRCF ILC LLRT  GL+ T  VDVEEMV 
Subjt:  IEQDELIAILAIICTTQYQWILIFNFIMSLGDLTIANHSPY-VRHQIRQLNFFRLIHESDLACRESTRMDRRCFTILCTLLRTTGGLSGTNYVDVEEMVG

Query:  IFLHILAHDVKNRVVRRHFARSGETVSRH-----------------------------------NCLGALDGTYIKVNVSATDRPRYRTRKGEVATNVLG
        +FLHILAHDVKNRV++R F RSGET+SRH                                   NCLGALDGTYIKVNV A+DR RYRTRKGEVATNVLG
Subjt:  IFLHILAHDVKNRVVRRHFARSGETVSRH-----------------------------------NCLGALDGTYIKVNVSATDRPRYRTRKGEVATNVLG

Query:  VCSQNGEFIFVLPGWEGSAADARVLRDAISRPTGLRIPRGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-EHPPTTPKELFNMRHSSARNVIERAFGA
        VC   G+F++VL GWEGSAAD+R+LRDA+SRP  L++P+GYYYL DAGYPNAEGFLAPYRGQRYHL EWRG E+ P+T KE FNM+HSSARNVIERAFG 
Subjt:  VCSQNGEFIFVLPGWEGSAADARVLRDAISRPTGLRIPRGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-EHPPTTPKELFNMRHSSARNVIERAFGA

Query:  LKGRWAILRGRSFYPVRVQCRMITACCLLHNLIIREMGPDPSYEGHPGEVDSN--GMSVENINFVEPTGAWTEWRDSLANQMFEDW
        LKGRWAILRG+S+YPV VQCR I ACCLLHNLI REM  +   E +  EVDS     + ++I+++E +  W++WRD+LA +MF +W
Subjt:  LKGRWAILRGRSFYPVRVQCRMITACCLLHNLIIREMGPDPSYEGHPGEVDSN--GMSVENINFVEPTGAWTEWRDSLANQMFEDW

KAA0046727.1 retrotransposon protein [Cucumis melo var. makuwa]4.2e-12459.33Show/hide
Query:  IEQDELIAILAIICTTQYQWILIFNFIMSLGDLTIANHSPY-VRHQIRQLNFFRLIHESDLACRESTRMDRRCFTILCTLLRTTGGLSGTNYVDVEEMVG
        +++ EL +I+     +Q Q +L+   + +  D     H PY  RH+IRQL +FR+IH SDL CR+STRMDRRCF ILC LLRT  GL+ T  VDVEEMV 
Subjt:  IEQDELIAILAIICTTQYQWILIFNFIMSLGDLTIANHSPY-VRHQIRQLNFFRLIHESDLACRESTRMDRRCFTILCTLLRTTGGLSGTNYVDVEEMVG

Query:  IFLHILAHDVKNRVVRRHFARSGETVSRH-----------------------------------NCLGALDGTYIKVNVSATDRPRYRTRKGEVATNVLG
        +FLHILAHDVKNRV++R F RSGET+SRH                                   NCLGALDGTYIKVNV A+DR RYRTRKGEVATNVLG
Subjt:  IFLHILAHDVKNRVVRRHFARSGETVSRH-----------------------------------NCLGALDGTYIKVNVSATDRPRYRTRKGEVATNVLG

Query:  VCSQNGEFIFVLPGWEGSAADARVLRDAISRPTGLRIPRGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-EHPPTTPKELFNMRHSSARNVIERAFGA
        VC   G+F++VL GWEGSAAD+R+LRDA+SRP  L++P+GYYYL DAGYPNAEGFLAPYRGQRYHL EWRG E+ P+T KE FNM+HSSARNVIERAFG 
Subjt:  VCSQNGEFIFVLPGWEGSAADARVLRDAISRPTGLRIPRGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-EHPPTTPKELFNMRHSSARNVIERAFGA

Query:  LKGRWAILRGRSFYPVRVQCRMITACCLLHNLIIREMGPDPSYEGHPGEVDSN--GMSVENINFVEPTGAWTEWRDSLANQMFEDW
        LKGRWAILRG+S+YPV VQCR I ACCLLHNLI REM  +   E +  EVDS     + ++I+++E +  W++WRD+LA +MF +W
Subjt:  LKGRWAILRGRSFYPVRVQCRMITACCLLHNLIIREMGPDPSYEGHPGEVDSN--GMSVENINFVEPTGAWTEWRDSLANQMFEDW

KAA0047510.1 retrotransposon protein [Cucumis melo var. makuwa]6.1e-12358.81Show/hide
Query:  IEQDELIAILAIICTTQYQWILIFNFIMSLGDLTIANHSPY-VRHQIRQLNFFRLIHESDLACRESTRMDRRCFTILCTLLRTTGGLSGTNYVDVEEMVG
        +++ EL +I+     +Q Q +L+   + +  D     H PY  RH+IRQL +FR+IH SDL CRESTRMDRRCF ILC LLRT  GL+ T  VDVEEMV 
Subjt:  IEQDELIAILAIICTTQYQWILIFNFIMSLGDLTIANHSPY-VRHQIRQLNFFRLIHESDLACRESTRMDRRCFTILCTLLRTTGGLSGTNYVDVEEMVG

Query:  IFLHILAHDVKNRVVRRHFARSGETVSRH-----------------------------------NCLGALDGTYIKVNVSATDRPRYRTRKGEVATNVLG
        +FLHILAHDVKNRV++R F RSGET+S H                                   NCLGALDGTYIKVNV A+DR RYRTRKGEVATNVLG
Subjt:  IFLHILAHDVKNRVVRRHFARSGETVSRH-----------------------------------NCLGALDGTYIKVNVSATDRPRYRTRKGEVATNVLG

Query:  VCSQNGEFIFVLPGWEGSAADARVLRDAISRPTGLRIPRGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-EHPPTTPKELFNMRHSSARNVIERAFGA
        VC + G+F++VL GWEGSAAD+R+LRDA+SRP GL++P+GYYYL DAGYPN EGFLAPYRGQRYHL EW G E+ P+T KE FNM+H SARNVIERAFG 
Subjt:  VCSQNGEFIFVLPGWEGSAADARVLRDAISRPTGLRIPRGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-EHPPTTPKELFNMRHSSARNVIERAFGA

Query:  LKGRWAILRGRSFYPVRVQCRMITACCLLHNLIIREMGPDPSYEGHPGEVDSN--GMSVENINFVEPTGAWTEWRDSLANQMFEDW
        LKGRWAILRG+S+YPV VQCR I ACCLLHNLI REM  +   E +  EVDS     + ++I+++E +  W++WRD LA +MF +W
Subjt:  LKGRWAILRGRSFYPVRVQCRMITACCLLHNLIIREMGPDPSYEGHPGEVDSN--GMSVENINFVEPTGAWTEWRDSLANQMFEDW

KAA0065306.1 retrotransposon protein [Cucumis melo var. makuwa]6.3e-12870.93Show/hide
Query:  MDRRCFTILCTLLRTTGGLSGTNYVDVEEMVGIFLHILAHDVKNRVVRRHFARSGETVSRH-------------------------------NCLGALDG
        MDRRCFTILCT+LRT GGL  T YVDVEEM+ IFLHI+AHDVKNRV RRHFARSGETVSRH                                CLGALDG
Subjt:  MDRRCFTILCTLLRTTGGLSGTNYVDVEEMVGIFLHILAHDVKNRVVRRHFARSGETVSRH-------------------------------NCLGALDG

Query:  TYIKVNVSATDRPRYRTRKGEVATNVLGVCSQNGEFIFVLPGWEGSAADARVLRDAISRPTGLRIPRGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRGE
        T+IKVNVS +DRPRYR+RKG++ TNVLGVC QNGEFIFV+PGWEGSA+D+RVLRDA+SR TGL++P+GYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG 
Subjt:  TYIKVNVSATDRPRYRTRKGEVATNVLGVCSQNGEFIFVLPGWEGSAADARVLRDAISRPTGLRIPRGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRGE

Query:  HPPTTPKELFNMRHSSARNVIERAFGALKGRWAILRGRSFYPVRVQCRMITACCLLHNLIIREMGPDPSY-EGHPGEVDSNGMSVENINFVEPTGAWTEW
        +PP  PKELFNMRHS ARNVIERAFG+L  RW IL+GRS+Y V +QC++ITACCLLHNLI REMG + ++ E H GE DS+ M++ENINFVE T  WTEW
Subjt:  HPPTTPKELFNMRHSSARNVIERAFGALKGRWAILRGRSFYPVRVQCRMITACCLLHNLIIREMGPDPSY-EGHPGEVDSNGMSVENINFVEPTGAWTEW

Query:  RDSLANQMFEDWN
        RD+LANQMFEDWN
Subjt:  RDSLANQMFEDWN

TYK08067.1 retrotransposon protein [Cucumis melo var. makuwa]6.1e-12369.93Show/hide
Query:  MDRRCFTILCTLLRTTGGLSGTNYVDVEEMVGIFLHILAHDVKNRVVRRHFARSGETVSR------------------------HNCLGALDGTYIKVNV
        MD+RCFTILCT+LRT GGL  T YVDVEEM  IFLHI+AHDVKNRV RRHFARS  TVSR                        H+C  ALDGT+IKVNV
Subjt:  MDRRCFTILCTLLRTTGGLSGTNYVDVEEMVGIFLHILAHDVKNRVVRRHFARSGETVSR------------------------HNCLGALDGTYIKVNV

Query:  SATDRPRYRTRKGEVATNVLGVCSQNGEFIFVLPGWEGSAADARVLRDAISRPTGLRIPRGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRGEHPPTTPK
        S +D PRYR+RK ++ TNVLG+CSQNGEFIFV+PGWEGSA+D+RVLRD +SRP GL++P+GYYYLCDA Y N EGFLAPYRGQRYHL EWRG +PP  PK
Subjt:  SATDRPRYRTRKGEVATNVLGVCSQNGEFIFVLPGWEGSAADARVLRDAISRPTGLRIPRGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRGEHPPTTPK

Query:  ELFNMRHSSARNVIERAFGALKGRWAILRGRSFYPVRVQCRMITACCLLHNLIIREMGPDPSY-EGHPGEVDSNGMSVENINFVEPTGAWTEWRDSLANQ
        ELFNMRHSSARNVIERAFG+LK RWAILRGRS+YPV +QC++ITACCLLHNLI REM  + ++ E H GE DS+ M++ENINFVE T  WTEWRD+LANQ
Subjt:  ELFNMRHSSARNVIERAFGALKGRWAILRGRSFYPVRVQCRMITACCLLHNLIIREMGPDPSY-EGHPGEVDSNGMSVENINFVEPTGAWTEWRDSLANQ

Query:  MFEDWN
        MFEDWN
Subjt:  MFEDWN

TrEMBL top hitse value%identityAlignment
A0A5A7TWH8 Retrotransposon protein3.0e-12358.81Show/hide
Query:  IEQDELIAILAIICTTQYQWILIFNFIMSLGDLTIANHSPY-VRHQIRQLNFFRLIHESDLACRESTRMDRRCFTILCTLLRTTGGLSGTNYVDVEEMVG
        +++ EL +I+     +Q Q +L+   + +  D     H PY  RH+IRQL +FR+IH SDL CRESTRMDRRCF ILC LLRT  GL+ T  VDVEEMV 
Subjt:  IEQDELIAILAIICTTQYQWILIFNFIMSLGDLTIANHSPY-VRHQIRQLNFFRLIHESDLACRESTRMDRRCFTILCTLLRTTGGLSGTNYVDVEEMVG

Query:  IFLHILAHDVKNRVVRRHFARSGETVSRH-----------------------------------NCLGALDGTYIKVNVSATDRPRYRTRKGEVATNVLG
        +FLHILAHDVKNRV++R F RSGET+S H                                   NCLGALDGTYIKVNV A+DR RYRTRKGEVATNVLG
Subjt:  IFLHILAHDVKNRVVRRHFARSGETVSRH-----------------------------------NCLGALDGTYIKVNVSATDRPRYRTRKGEVATNVLG

Query:  VCSQNGEFIFVLPGWEGSAADARVLRDAISRPTGLRIPRGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-EHPPTTPKELFNMRHSSARNVIERAFGA
        VC + G+F++VL GWEGSAAD+R+LRDA+SRP GL++P+GYYYL DAGYPN EGFLAPYRGQRYHL EW G E+ P+T KE FNM+H SARNVIERAFG 
Subjt:  VCSQNGEFIFVLPGWEGSAADARVLRDAISRPTGLRIPRGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-EHPPTTPKELFNMRHSSARNVIERAFGA

Query:  LKGRWAILRGRSFYPVRVQCRMITACCLLHNLIIREMGPDPSYEGHPGEVDSN--GMSVENINFVEPTGAWTEWRDSLANQMFEDW
        LKGRWAILRG+S+YPV VQCR I ACCLLHNLI REM  +   E +  EVDS     + ++I+++E +  W++WRD LA +MF +W
Subjt:  LKGRWAILRGRSFYPVRVQCRMITACCLLHNLIIREMGPDPSYEGHPGEVDSN--GMSVENINFVEPTGAWTEWRDSLANQMFEDW

A0A5A7TXW1 Retrotransposon protein2.0e-12459.33Show/hide
Query:  IEQDELIAILAIICTTQYQWILIFNFIMSLGDLTIANHSPY-VRHQIRQLNFFRLIHESDLACRESTRMDRRCFTILCTLLRTTGGLSGTNYVDVEEMVG
        +++ EL +I+     +Q Q +L+   + +  D     H PY  RH+IRQL +FR+IH SDL CR+STRMDRRCF ILC LLRT  GL+ T  VDVEEMV 
Subjt:  IEQDELIAILAIICTTQYQWILIFNFIMSLGDLTIANHSPY-VRHQIRQLNFFRLIHESDLACRESTRMDRRCFTILCTLLRTTGGLSGTNYVDVEEMVG

Query:  IFLHILAHDVKNRVVRRHFARSGETVSRH-----------------------------------NCLGALDGTYIKVNVSATDRPRYRTRKGEVATNVLG
        +FLHILAHDVKNRV++R F RSGET+SRH                                   NCLGALDGTYIKVNV A+DR RYRTRKGEVATNVLG
Subjt:  IFLHILAHDVKNRVVRRHFARSGETVSRH-----------------------------------NCLGALDGTYIKVNVSATDRPRYRTRKGEVATNVLG

Query:  VCSQNGEFIFVLPGWEGSAADARVLRDAISRPTGLRIPRGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-EHPPTTPKELFNMRHSSARNVIERAFGA
        VC   G+F++VL GWEGSAAD+R+LRDA+SRP  L++P+GYYYL DAGYPNAEGFLAPYRGQRYHL EWRG E+ P+T KE FNM+HSSARNVIERAFG 
Subjt:  VCSQNGEFIFVLPGWEGSAADARVLRDAISRPTGLRIPRGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-EHPPTTPKELFNMRHSSARNVIERAFGA

Query:  LKGRWAILRGRSFYPVRVQCRMITACCLLHNLIIREMGPDPSYEGHPGEVDSN--GMSVENINFVEPTGAWTEWRDSLANQMFEDW
        LKGRWAILRG+S+YPV VQCR I ACCLLHNLI REM  +   E +  EVDS     + ++I+++E +  W++WRD+LA +MF +W
Subjt:  LKGRWAILRGRSFYPVRVQCRMITACCLLHNLIIREMGPDPSYEGHPGEVDSN--GMSVENINFVEPTGAWTEWRDSLANQMFEDW

A0A5A7VG45 Retrotransposon protein3.1e-12870.93Show/hide
Query:  MDRRCFTILCTLLRTTGGLSGTNYVDVEEMVGIFLHILAHDVKNRVVRRHFARSGETVSRH-------------------------------NCLGALDG
        MDRRCFTILCT+LRT GGL  T YVDVEEM+ IFLHI+AHDVKNRV RRHFARSGETVSRH                                CLGALDG
Subjt:  MDRRCFTILCTLLRTTGGLSGTNYVDVEEMVGIFLHILAHDVKNRVVRRHFARSGETVSRH-------------------------------NCLGALDG

Query:  TYIKVNVSATDRPRYRTRKGEVATNVLGVCSQNGEFIFVLPGWEGSAADARVLRDAISRPTGLRIPRGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRGE
        T+IKVNVS +DRPRYR+RKG++ TNVLGVC QNGEFIFV+PGWEGSA+D+RVLRDA+SR TGL++P+GYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG 
Subjt:  TYIKVNVSATDRPRYRTRKGEVATNVLGVCSQNGEFIFVLPGWEGSAADARVLRDAISRPTGLRIPRGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRGE

Query:  HPPTTPKELFNMRHSSARNVIERAFGALKGRWAILRGRSFYPVRVQCRMITACCLLHNLIIREMGPDPSY-EGHPGEVDSNGMSVENINFVEPTGAWTEW
        +PP  PKELFNMRHS ARNVIERAFG+L  RW IL+GRS+Y V +QC++ITACCLLHNLI REMG + ++ E H GE DS+ M++ENINFVE T  WTEW
Subjt:  HPPTTPKELFNMRHSSARNVIERAFGALKGRWAILRGRSFYPVRVQCRMITACCLLHNLIIREMGPDPSY-EGHPGEVDSNGMSVENINFVEPTGAWTEW

Query:  RDSLANQMFEDWN
        RD+LANQMFEDWN
Subjt:  RDSLANQMFEDWN

A0A5D3BDX0 Retrotransposon protein2.0e-12459.33Show/hide
Query:  IEQDELIAILAIICTTQYQWILIFNFIMSLGDLTIANHSPY-VRHQIRQLNFFRLIHESDLACRESTRMDRRCFTILCTLLRTTGGLSGTNYVDVEEMVG
        +++ EL +I+     +Q Q +L+   + +  D     H PY  RH+IRQL +FR+IH SDL CR+STRMDRRCF ILC LLRT  GL+ T  VDVEEMV 
Subjt:  IEQDELIAILAIICTTQYQWILIFNFIMSLGDLTIANHSPY-VRHQIRQLNFFRLIHESDLACRESTRMDRRCFTILCTLLRTTGGLSGTNYVDVEEMVG

Query:  IFLHILAHDVKNRVVRRHFARSGETVSRH-----------------------------------NCLGALDGTYIKVNVSATDRPRYRTRKGEVATNVLG
        +FLHILAHDVKNRV++R F RSGET+SRH                                   NCLGALDGTYIKVNV A+DR RYRTRKGEVATNVLG
Subjt:  IFLHILAHDVKNRVVRRHFARSGETVSRH-----------------------------------NCLGALDGTYIKVNVSATDRPRYRTRKGEVATNVLG

Query:  VCSQNGEFIFVLPGWEGSAADARVLRDAISRPTGLRIPRGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-EHPPTTPKELFNMRHSSARNVIERAFGA
        VC   G+F++VL GWEGSAAD+R+LRDA+SRP  L++P+GYYYL DAGYPNAEGFLAPYRGQRYHL EWRG E+ P+T KE FNM+HSSARNVIERAFG 
Subjt:  VCSQNGEFIFVLPGWEGSAADARVLRDAISRPTGLRIPRGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRG-EHPPTTPKELFNMRHSSARNVIERAFGA

Query:  LKGRWAILRGRSFYPVRVQCRMITACCLLHNLIIREMGPDPSYEGHPGEVDSN--GMSVENINFVEPTGAWTEWRDSLANQMFEDW
        LKGRWAILRG+S+YPV VQCR I ACCLLHNLI REM  +   E +  EVDS     + ++I+++E +  W++WRD+LA +MF +W
Subjt:  LKGRWAILRGRSFYPVRVQCRMITACCLLHNLIIREMGPDPSYEGHPGEVDSN--GMSVENINFVEPTGAWTEWRDSLANQMFEDW

A0A5D3C7X6 Retrotransposon protein3.0e-12369.93Show/hide
Query:  MDRRCFTILCTLLRTTGGLSGTNYVDVEEMVGIFLHILAHDVKNRVVRRHFARSGETVSR------------------------HNCLGALDGTYIKVNV
        MD+RCFTILCT+LRT GGL  T YVDVEEM  IFLHI+AHDVKNRV RRHFARS  TVSR                        H+C  ALDGT+IKVNV
Subjt:  MDRRCFTILCTLLRTTGGLSGTNYVDVEEMVGIFLHILAHDVKNRVVRRHFARSGETVSR------------------------HNCLGALDGTYIKVNV

Query:  SATDRPRYRTRKGEVATNVLGVCSQNGEFIFVLPGWEGSAADARVLRDAISRPTGLRIPRGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRGEHPPTTPK
        S +D PRYR+RK ++ TNVLG+CSQNGEFIFV+PGWEGSA+D+RVLRD +SRP GL++P+GYYYLCDA Y N EGFLAPYRGQRYHL EWRG +PP  PK
Subjt:  SATDRPRYRTRKGEVATNVLGVCSQNGEFIFVLPGWEGSAADARVLRDAISRPTGLRIPRGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRGEHPPTTPK

Query:  ELFNMRHSSARNVIERAFGALKGRWAILRGRSFYPVRVQCRMITACCLLHNLIIREMGPDPSY-EGHPGEVDSNGMSVENINFVEPTGAWTEWRDSLANQ
        ELFNMRHSSARNVIERAFG+LK RWAILRGRS+YPV +QC++ITACCLLHNLI REM  + ++ E H GE DS+ M++ENINFVE T  WTEWRD+LANQ
Subjt:  ELFNMRHSSARNVIERAFGALKGRWAILRGRSFYPVRVQCRMITACCLLHNLIIREMGPDPSY-EGHPGEVDSNGMSVENINFVEPTGAWTEWRDSLANQ

Query:  MFEDWN
        MFEDWN
Subjt:  MFEDWN

SwissProt top hitse value%identityAlignment
Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 12.1e-0928.11Show/hide
Query:  NCLGALDGTYIKVNVSATD-RPRYRTRKGEVATNVLGVCSQNGEFIFVLPGWEGSAADARVLRDAISRPTGLRIPRGYYYLCD-----AGYPNAEGFLAP
        NC GA+D T+I + + A      +  ++   +  + GV      F+ ++ GW G    +++L+ +           G++ LC+      G P      A 
Subjt:  NCLGALDGTYIKVNVSATD-RPRYRTRKGEVATNVLGVCSQNGEFIFVLPGWEGSAADARVLRDAISRPTGLRIPRGYYYLCD-----AGYPNAEGFLAP

Query:  YR-----GQRYHLTEW----RGEHPPTTPKELFNMRHSSARNVIERAFGALKGRWAILRGRSFYPVRVQC-RMITACCLLHNLII
         R     G  Y L  W         P+     FN RH   R+V   AF  LKG W IL    + P R +   +I  CCLLHN+II
Subjt:  YR-----GQRYHLTEW----RGEHPPTTPKELFNMRHSSARNVIERAFGALKGRWAILRGRSFYPVRVQC-RMITACCLLHNLII

Q9M2U3 Protein ALP1-like3.0e-1631.72Show/hide
Query:  NCLGALDGTYIKVNVSATDRPRYRTRKGE--VATNVLGVCSQNGEFIFVLPGWEGSAADARVLRDA--------ISRPTGLRIPRG------YYYLCDAG
        NC GA+D T+I +N+ A +        GE   +  +  V   +  F+ V+ GW GS  D  VL+++          R  G ++P         Y + D+G
Subjt:  NCLGALDGTYIKVNVSATDRPRYRTRKGE--VATNVLGVCSQNGEFIFVLPGWEGSAADARVLRDA--------ISRPTGLRIPRG------YYYLCDAG

Query:  YPNAEGFLAPYRGQRYHLTEWRGEHPPTTPKELFNMRHSSARNVIERAFGALKGRWAILRGRSFYPVRVQC-RMITACCLLHNLII
        +P     L PY+G+           P + P+  FN RHS A    + A   LK RW I+ G  + P R +  R+I  CCLLHN+II
Subjt:  YPNAEGFLAPYRGQRYHLTEWRGEHPPTTPKELFNMRHSSARNVIERAFGALKGRWAILRGRSFYPVRVQC-RMITACCLLHNLII

Arabidopsis top hitse value%identityAlignment
AT1G43722.1 unknown protein1.6e-2830.45Show/hide
Query:  NFFRLIHESDLACRESTRMDRRCFTILCTLLRTTGGLSGTNYVDVEEMVGIFLHILAHDVKNRVVRRHFARSGETVSR----------------------
        N +R + +   AC +  RM   CFT LC +L+T   L  T  + +EE V +FL I  H+   R V   F R+ ETV R                      
Subjt:  NFFRLIHESDLACRESTRMDRRCFTILCTLLRTTGGLSGTNYVDVEEMVGIFLHILAHDVKNRVVRRHFARSGETVSR----------------------

Query:  --------------------HNCLGALDGTYIKVNVSATDRPRYRTRKGEVATNVLGVCSQNGEFIFVLPGWEGSAADARVLRDAISRPTGLRIPRG-YY
                               +GA+DGT++ V V    +  Y  R    + N++ +C     F ++  G  GS  D  VL+ A    +   +P    Y
Subjt:  --------------------HNCLGALDGTYIKVNVSATDRPRYRTRKGEVATNVLGVCSQNGEFIFVLPGWEGSAADARVLRDAISRPTGLRIPRG-YY

Query:  YLCDAGYPNAEGFLAPYRGQ-----RYHLTEWRGEHPPTTPKELFNMRHSSARNVIERAFGALKGR
        YL D+GYPN +G LAPYR       RYH++++     P    ELFN  H+S R+VIER F   K +
Subjt:  YLCDAGYPNAEGFLAPYRGQ-----RYHLTEWRGEHPPTTPKELFNMRHSSARNVIERAFGALKGR

AT5G12010.1 unknown protein1.4e-1327.96Show/hide
Query:  NCLGALDGTYI-----KVNVSATDRPRY--RTRKGEVATNVLGVCSQNGEFIFVLPGWEGSAADARVLRDAI--SRPTGLRIPRGYYYLCDAGYPNAEGF
        N +G++  T+I     K++V++    R+  R +K   +  +  V +  G F  +  GW GS  D +VL  ++   R     + +G +     G+P  +  
Subjt:  NCLGALDGTYI-----KVNVSATDRPRY--RTRKGEVATNVLGVCSQNGEFIFVLPGWEGSAADARVLRDAI--SRPTGLRIPRGYYYLCDAGYPNAEGF

Query:  LAPYRGQRYHLTEWRGEHPPTTPKELFNMRHSSARNVIERAFGALKGRWAILRGRSFYPVRVQCRMITACCLLHNLI-IREMGPDP
        L PY  Q             T  +  FN + S  + V + AFG LKGRWA L+ R+   ++    ++ ACC+LHN+  +RE   +P
Subjt:  LAPYRGQRYHLTEWRGEHPPTTPKELFNMRHSSARNVIERAFGALKGRWAILRGRSFYPVRVQCRMITACCLLHNLI-IREMGPDP

AT5G28950.1 unknown protein1.3e-1436.36Show/hide
Query:  NCLGALDGTYIKVNVSATDRPRYRTRKGEVATNVLGVCSQNGEFIFVLPGWEGSAADARVLRDAISRPTG-LRIP---RGYYYLCDAGYPNAEGFLAPYR
        +C+GA+D T+I   VS    P +R RKG+++ N+L  C+ + EF++VL GWEGSA D++VL DA++R +  L +P        + +    N +  L    
Subjt:  NCLGALDGTYIKVNVSATDRPRYRTRKGEVATNVLGVCSQNGEFIFVLPGWEGSAADARVLRDAISRPTG-LRIP---RGYYYLCDAGYPNAEGFLAPYR

Query:  GQRYHLTEWR
         QR +  +WR
Subjt:  GQRYHLTEWR

AT5G35695.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.1e-3241.05Show/hide
Query:  FIFVLPGWEGSAADARVLRDAISRPTGLRIPRGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRGE-HPPTTPKELFNMRHSSARNVIERAFGALKGRWAI
        FI+VL GWEGSA D+RVL DA+ +          +YL D G+ N   FLAP+RG RYHL E+ G+   P TP ELFN+RH S RNVIER FG  K R+AI
Subjt:  FIFVLPGWEGSAADARVLRDAISRPTGLRIPRGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRGE-HPPTTPKELFNMRHSSARNVIERAFGALKGRWAI

Query:  LRGRSFYPVRVQCRMITACCLLHNLIIREMGPD----PSYEGHPGEV---DSNGMSVENINFVEPTGAWTE-------WRDSLANQMFED
         +    +  + Q  ++  C  LHN + +E   D    P   G+ G+V   + N M+   I+  EP  A  +       WR S+A  M++D
Subjt:  LRGRSFYPVRVQCRMITACCLLHNLIIREMGPD----PSYEGHPGEV---DSNGMSVENINFVEPTGAWTE-------WRDSLANQMFED

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.6e-4434.74Show/hide
Query:  FRLIHESDLACRESTRMDRRCFTILCTLLRTTGGLSGTNYVDVEEMVGIFLHILAHDVKNRVVRRHFARSGETVSRH-----------------------
        +++++  +  C E+ RMD+  F  LC LL+T G L  TN + +E  + IFL I+ H+++ R V+  F  SGET+SRH                       
Subjt:  FRLIHESDLACRESTRMDRRCFTILCTLLRTTGGLSGTNYVDVEEMVGIFLHILAHDVKNRVVRRHFARSGETVSRH-----------------------

Query:  ----------NCLGALDGTYIKVNVSATDRPRYRTRKGEVATNVLGVCSQNGEFIFVLPGWEGSAADARVLRDAISRPTGLRIPRGYYYLCDAGYPNAEG
                  +C+G +D  +I V V   ++  +R   G +  NVL   S +  F +VL GWEGSA+D +VL  A++R   L++P+G YY+ D  YPN  G
Subjt:  ----------NCLGALDGTYIKVNVSATDRPRYRTRKGEVATNVLGVCSQNGEFIFVLPGWEGSAADARVLRDAISRPTGLRIPRGYYYLCDAGYPNAEG

Query:  FLAPYRGQRYHLTEWRGEHPPTTPKELFNMRHSSARNVIERAFGALKGRWAILRGRSFYPVRVQCRMITACCLLHNLIIREMGPD
        F+APY G   +  E          KE+FN RH      I R FGALK R+ IL     YP++ Q +++ A C LHN +  E   D
Subjt:  FLAPYRGQRYHLTEWRGEHPPTTPKELFNMRHSSARNVIERAFGALKGRWAILRGRSFYPVRVQCRMITACCLLHNLIIREMGPD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGGTATTTCCTAACGACGGAGGAAGGGTGGATTCTGGCGGGCATTGGAGTGGGGAGGGGGATTGTGAAGCATGTAGTGGACGTGATTCCATCTTGCCGCTCCATAAA
CTCGACCCTCTCTCTTGCTGCCTCCAAATTTCCATCAATTTTCTTGTATCGTGGCTTCCTCCCATTCAAAAAAAGTGTTCTCTGTTCTGATTGTGAGGTAGCGTCTGAGG
CCTGGTATGAGGGAGGTGGTCGGATGAGTGGAGCTTGGTGGGGGCTTTCATTGTCCACTAAACAGGTTGCTGAGTTTATTGTTTCTACTGTTCTTACCTTTCCTTTAAAT
GAGGTTTACCTTCTAACTAATGCCTTTGCTTTGTGTTCCTTTTCTTCTGTTTATCTCCAATATCTTTATGGTACGTGGTCTACTCTTGAATGTGCATTTCATATGTCCAT
GATAACGAATCCTAGAGGTCTTCTTTCCTCATACCCTAGGCCAGGCCGGTCAGTGCAACTTGAAGCTCGTCACCTCCCAATGGATGACCTTGACATAGAACAAGACGAGT
TAATAGCAATACTCGCTATCATATGCACCACACAATACCAATGGATTTTAATTTTCAATTTTATTATGAGTCTGGGAGATCTCACCATAGCAAATCATTCACCCTACGTT
AGACACCAAATTAGGCAGCTGAACTTCTTTCGCCTAATTCATGAGAGTGACTTAGCTTGTCGGGAGAGCACTCGAATGGATAGGAGATGTTTCACCATATTATGCACCTT
GCTAAGAACAACAGGAGGTTTGTCAGGCACGAATTATGTGGACGTCGAAGAGATGGTGGGGATTTTCCTTCATATTCTAGCGCATGACGTTAAGAACAGAGTAGTTCGAC
GACACTTTGCTCGATCCGGTGAGACGGTGTCACGACACAATTGTTTGGGTGCGCTCGATGGTACATACATTAAGGTTAACGTAAGCGCAACCGATCGCCCTAGGTACAGG
ACTAGGAAGGGTGAGGTTGCTACTAATGTACTAGGTGTGTGCTCTCAAAATGGTGAGTTCATCTTCGTACTACCCGGGTGGGAAGGATCTGCTGCAGACGCTCGAGTACT
AAGAGATGCAATATCACGACCTACCGGTCTAAGGATCCCTAGGGGATACTACTACCTGTGCGATGCAGGTTACCCAAATGCAGAAGGTTTTCTCGCCCCATATCGAGGTC
AACGATACCATCTCACCGAGTGGCGTGGAGAACATCCACCAACGACTCCAAAAGAGCTATTCAATATGAGACATTCTTCTGCTAGAAATGTTATTGAGAGAGCCTTTGGA
GCGTTGAAGGGTCGGTGGGCAATTCTACGAGGAAGGTCATTTTATCCTGTGCGAGTGCAATGTAGGATGATAACGGCATGTTGCCTCCTACATAATCTGATAATTCGAGA
AATGGGACCCGACCCATCCTACGAGGGACATCCTGGAGAAGTTGATTCAAACGGTATGAGTGTCGAGAACATAAACTTTGTGGAACCCACTGGGGCATGGACTGAATGGA
GAGACAGTCTTGCAAATCAAATGTTCGAAGATTGGAACGAGGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCGGTATTTCCTAACGACGGAGGAAGGGTGGATTCTGGCGGGCATTGGAGTGGGGAGGGGGATTGTGAAGCATGTAGTGGACGTGATTCCATCTTGCCGCTCCATAAA
CTCGACCCTCTCTCTTGCTGCCTCCAAATTTCCATCAATTTTCTTGTATCGTGGCTTCCTCCCATTCAAAAAAAGTGTTCTCTGTTCTGATTGTGAGGTAGCGTCTGAGG
CCTGGTATGAGGGAGGTGGTCGGATGAGTGGAGCTTGGTGGGGGCTTTCATTGTCCACTAAACAGGTTGCTGAGTTTATTGTTTCTACTGTTCTTACCTTTCCTTTAAAT
GAGGTTTACCTTCTAACTAATGCCTTTGCTTTGTGTTCCTTTTCTTCTGTTTATCTCCAATATCTTTATGGTACGTGGTCTACTCTTGAATGTGCATTTCATATGTCCAT
GATAACGAATCCTAGAGGTCTTCTTTCCTCATACCCTAGGCCAGGCCGGTCAGTGCAACTTGAAGCTCGTCACCTCCCAATGGATGACCTTGACATAGAACAAGACGAGT
TAATAGCAATACTCGCTATCATATGCACCACACAATACCAATGGATTTTAATTTTCAATTTTATTATGAGTCTGGGAGATCTCACCATAGCAAATCATTCACCCTACGTT
AGACACCAAATTAGGCAGCTGAACTTCTTTCGCCTAATTCATGAGAGTGACTTAGCTTGTCGGGAGAGCACTCGAATGGATAGGAGATGTTTCACCATATTATGCACCTT
GCTAAGAACAACAGGAGGTTTGTCAGGCACGAATTATGTGGACGTCGAAGAGATGGTGGGGATTTTCCTTCATATTCTAGCGCATGACGTTAAGAACAGAGTAGTTCGAC
GACACTTTGCTCGATCCGGTGAGACGGTGTCACGACACAATTGTTTGGGTGCGCTCGATGGTACATACATTAAGGTTAACGTAAGCGCAACCGATCGCCCTAGGTACAGG
ACTAGGAAGGGTGAGGTTGCTACTAATGTACTAGGTGTGTGCTCTCAAAATGGTGAGTTCATCTTCGTACTACCCGGGTGGGAAGGATCTGCTGCAGACGCTCGAGTACT
AAGAGATGCAATATCACGACCTACCGGTCTAAGGATCCCTAGGGGATACTACTACCTGTGCGATGCAGGTTACCCAAATGCAGAAGGTTTTCTCGCCCCATATCGAGGTC
AACGATACCATCTCACCGAGTGGCGTGGAGAACATCCACCAACGACTCCAAAAGAGCTATTCAATATGAGACATTCTTCTGCTAGAAATGTTATTGAGAGAGCCTTTGGA
GCGTTGAAGGGTCGGTGGGCAATTCTACGAGGAAGGTCATTTTATCCTGTGCGAGTGCAATGTAGGATGATAACGGCATGTTGCCTCCTACATAATCTGATAATTCGAGA
AATGGGACCCGACCCATCCTACGAGGGACATCCTGGAGAAGTTGATTCAAACGGTATGAGTGTCGAGAACATAAACTTTGTGGAACCCACTGGGGCATGGACTGAATGGA
GAGACAGTCTTGCAAATCAAATGTTCGAAGATTGGAACGAGGAATGA
Protein sequenceShow/hide protein sequence
MRYFLTTEEGWILAGIGVGRGIVKHVVDVIPSCRSINSTLSLAASKFPSIFLYRGFLPFKKSVLCSDCEVASEAWYEGGGRMSGAWWGLSLSTKQVAEFIVSTVLTFPLN
EVYLLTNAFALCSFSSVYLQYLYGTWSTLECAFHMSMITNPRGLLSSYPRPGRSVQLEARHLPMDDLDIEQDELIAILAIICTTQYQWILIFNFIMSLGDLTIANHSPYV
RHQIRQLNFFRLIHESDLACRESTRMDRRCFTILCTLLRTTGGLSGTNYVDVEEMVGIFLHILAHDVKNRVVRRHFARSGETVSRHNCLGALDGTYIKVNVSATDRPRYR
TRKGEVATNVLGVCSQNGEFIFVLPGWEGSAADARVLRDAISRPTGLRIPRGYYYLCDAGYPNAEGFLAPYRGQRYHLTEWRGEHPPTTPKELFNMRHSSARNVIERAFG
ALKGRWAILRGRSFYPVRVQCRMITACCLLHNLIIREMGPDPSYEGHPGEVDSNGMSVENINFVEPTGAWTEWRDSLANQMFEDWNEE