; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg030789 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg030789
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRetrotransposon protein
Genome locationscaffold11:26165192..26166528
RNA-Seq ExpressionSpg030789
SyntenySpg030789
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035620.1 retrotransposon protein [Cucumis melo var. makuwa]2.2e-15168.65Show/hide
Query:  MDPDELVAILTAFTAAQTTILLLLDLYMDDHRRIEHQSPYIRHHIRQLAFFRLIHESDLCCRESTRMDKRCFAILCTLLKTVVGLSSTEVVDVEEMVAMF
        MD  EL +I+ AF A+Q  +LL+L+L  +D +RI H     RH IRQLA+FR+IH SDL CR+STRMD+RCFAILC LL+T+ GL+STEVVDVEEMVAMF
Subjt:  MDPDELVAILTAFTAAQTTILLLLDLYMDDHRRIEHQSPYIRHHIRQLAFFRLIHESDLCCRESTRMDKRCFAILCTLLKTVVGLSSTEVVDVEEMVAMF

Query:  LHIVAHDVKNRVVRMQFARSGETVSRHFNTVLHAVLRLHDVLLKKPEPVTASCTNPRWKWFQNCLGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVC
        LHI+AHDVKNRV++ +F RSGET+SRHFN VL AV+RLH+ LLKKP+PV   CT+ RW+WF+NCLGALDGTYIKVNV   DR RYRTRKGE+A NVL VC
Subjt:  LHIVAHDVKNRVVRMQFARSGETVSRHFNTVLHAVLRLHDVLLKKPEPVTASCTNPRWKWFQNCLGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVC

Query:  DTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVSK-------------EGFLAPYRGKLYHLSEWRDEGNAPTTPKEFFNMKHSSARNVIERAFGLLK
        DTKGDF +VL GWEGSAADSR+LRDA+SRPN L V K             EGFLAPYRG+ YHL EWR   NAP+T KEFFNMKHSSARNVIERAFG+LK
Subjt:  DTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVSK-------------EGFLAPYRGKLYHLSEWRDEGNAPTTPKEFFNMKHSSARNVIERAFGLLK

Query:  GRWAILRGKSYYPVRIQGRTIAACCLLHNLINREMGDREIPDELDEVDSASVTTDGENINFIETSDEWSRWREELATQMFSNWELR
        GRWAILRGKSYYPV +Q RTI ACCLLHNLINREM + +I D +DEVDS   TT  ++I++IETS+EWS+WR+ LA +MF+ WELR
Subjt:  GRWAILRGKSYYPVRIQGRTIAACCLLHNLINREMGDREIPDELDEVDSASVTTDGENINFIETSDEWSRWREELATQMFSNWELR

KAA0046727.1 retrotransposon protein [Cucumis melo var. makuwa]7.6e-15268.91Show/hide
Query:  MDPDELVAILTAFTAAQTTILLLLDLYMDDHRRIEHQSPYIRHHIRQLAFFRLIHESDLCCRESTRMDKRCFAILCTLLKTVVGLSSTEVVDVEEMVAMF
        MD  EL +I+ AF A+Q  +LL+L+L  +D +RI H     RH IRQLA+FR+IH SDL CR+STRMD+RCFAILC LL+T+ GL+STEVVDVEEMVAMF
Subjt:  MDPDELVAILTAFTAAQTTILLLLDLYMDDHRRIEHQSPYIRHHIRQLAFFRLIHESDLCCRESTRMDKRCFAILCTLLKTVVGLSSTEVVDVEEMVAMF

Query:  LHIVAHDVKNRVVRMQFARSGETVSRHFNTVLHAVLRLHDVLLKKPEPVTASCTNPRWKWFQNCLGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVC
        LHI+AHDVKNRV++ +F RSGET+SRHFN VL AV+RLHD LLKKP+PV   CT+ RW+WF+NCLGALDGTYIKVNV   DR RYRTRKGE+A NVL VC
Subjt:  LHIVAHDVKNRVVRMQFARSGETVSRHFNTVLHAVLRLHDVLLKKPEPVTASCTNPRWKWFQNCLGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVC

Query:  DTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVSK-------------EGFLAPYRGKLYHLSEWRDEGNAPTTPKEFFNMKHSSARNVIERAFGLLK
        DTKGDF +VL GWEGSAADSR+LRDA+SRPN L V K             EGFLAPYRG+ YHL EWR   NAP+T KEFFNMKHSSARNVIERAFG+LK
Subjt:  DTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVSK-------------EGFLAPYRGKLYHLSEWRDEGNAPTTPKEFFNMKHSSARNVIERAFGLLK

Query:  GRWAILRGKSYYPVRIQGRTIAACCLLHNLINREMGDREIPDELDEVDSASVTTDGENINFIETSDEWSRWREELATQMFSNWELR
        GRWAILRGKSYYPV +Q RTI ACCLLHNLINREM + +I D +DEVDS   TT  ++I++IETS+EWS+WR+ LA +MF+ WELR
Subjt:  GRWAILRGKSYYPVRIQGRTIAACCLLHNLINREMGDREIPDELDEVDSASVTTDGENINFIETSDEWSRWREELATQMFSNWELR

KAA0047510.1 retrotransposon protein [Cucumis melo var. makuwa]3.2e-15068.13Show/hide
Query:  MDPDELVAILTAFTAAQTTILLLLDLYMDDHRRIEHQSPYIRHHIRQLAFFRLIHESDLCCRESTRMDKRCFAILCTLLKTVVGLSSTEVVDVEEMVAMF
        MD  EL +I+ AF A+Q  +LL+L+L  +D +RI H     RH IRQLA+FR+IH SDL CRESTRMD+RCFAILC LL+T+ GL+STEVVDVEEMVAMF
Subjt:  MDPDELVAILTAFTAAQTTILLLLDLYMDDHRRIEHQSPYIRHHIRQLAFFRLIHESDLCCRESTRMDKRCFAILCTLLKTVVGLSSTEVVDVEEMVAMF

Query:  LHIVAHDVKNRVVRMQFARSGETVSRHFNTVLHAVLRLHDVLLKKPEPVTASCTNPRWKWFQNCLGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVC
        LHI+AHDVKNRV++ +F RSGET+S HFN VL AV+RLH+ LLKKP+PV   CT+ RW+WF+NCLGALDGTYIKVNV   DR RYRTRKGE+A NVL VC
Subjt:  LHIVAHDVKNRVVRMQFARSGETVSRHFNTVLHAVLRLHDVLLKKPEPVTASCTNPRWKWFQNCLGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVC

Query:  DTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVSK-------------EGFLAPYRGKLYHLSEWRDEGNAPTTPKEFFNMKHSSARNVIERAFGLLK
        D KGDF +VL GWEGSAADSR+LRDA+SRPNGL V K             EGFLAPYRG+ YHL EW    NAP+T KEFFNMKH SARNVIERAFG+LK
Subjt:  DTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVSK-------------EGFLAPYRGKLYHLSEWRDEGNAPTTPKEFFNMKHSSARNVIERAFGLLK

Query:  GRWAILRGKSYYPVRIQGRTIAACCLLHNLINREMGDREIPDELDEVDSASVTTDGENINFIETSDEWSRWREELATQMFSNWELR
        GRWAILRGKSYYPV +Q RTI ACCLLHNLINREM + +I D +DEVDS   TT  ++I++IETS+EWS+WR++LA +MF+ WELR
Subjt:  GRWAILRGKSYYPVRIQGRTIAACCLLHNLINREMGDREIPDELDEVDSASVTTDGENINFIETSDEWSRWREELATQMFSNWELR

KAA0048361.1 retrotransposon protein [Cucumis melo var. makuwa]4.9e-15168.65Show/hide
Query:  MDPDELVAILTAFTAAQTTILLLLDLYMDDHRRIEHQSPYIRHHIRQLAFFRLIHESDLCCRESTRMDKRCFAILCTLLKTVVGLSSTEVVDVEEMVAMF
        MD  EL +I+ AF A+Q  +LL+L+L  +D +RI H     RH IRQLA+FR+IH SDL CR+STRMD+RCFAILC LL+T+ GL+STEVVDVEEMVAMF
Subjt:  MDPDELVAILTAFTAAQTTILLLLDLYMDDHRRIEHQSPYIRHHIRQLAFFRLIHESDLCCRESTRMDKRCFAILCTLLKTVVGLSSTEVVDVEEMVAMF

Query:  LHIVAHDVKNRVVRMQFARSGETVSRHFNTVLHAVLRLHDVLLKKPEPVTASCTNPRWKWFQNCLGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVC
        LHI+AHDVKNRV++ +F RSGET+SRHFN VL AV+RLH+ LLKKP+PV   CT+ RWKWF+NCLGALDGTYIKVNV   DR RYRTRKGE+A NVL VC
Subjt:  LHIVAHDVKNRVVRMQFARSGETVSRHFNTVLHAVLRLHDVLLKKPEPVTASCTNPRWKWFQNCLGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVC

Query:  DTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVSK-------------EGFLAPYRGKLYHLSEWRDEGNAPTTPKEFFNMKHSSARNVIERAFGLLK
        DTKGDF +VL GWEG AADSR+LRDA+SRPN L V K             EGFLAPYRG+ YHL EWR   NAP+T KEFFNMKHSSARNVIERAFG+LK
Subjt:  DTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVSK-------------EGFLAPYRGKLYHLSEWRDEGNAPTTPKEFFNMKHSSARNVIERAFGLLK

Query:  GRWAILRGKSYYPVRIQGRTIAACCLLHNLINREMGDREIPDELDEVDSASVTTDGENINFIETSDEWSRWREELATQMFSNWELR
        GRWAILRGKSYYPV +Q RTI ACCLLHNLINREM + +I D +DEVDS   TT  ++I++IETS+EWS+WR+ LA +MF+ WELR
Subjt:  GRWAILRGKSYYPVRIQGRTIAACCLLHNLINREMGDREIPDELDEVDSASVTTDGENINFIETSDEWSRWREELATQMFSNWELR

KAA0065736.1 retrotransposon protein [Cucumis melo var. makuwa]2.9e-15168.13Show/hide
Query:  MDPDELVAILTAFTAAQTTILLLLDLYMDDHRRIEHQSPYIRHHIRQLAFFRLIHESDLCCRESTRMDKRCFAILCTLLKTVVGLSSTEVVDVEEMVAMF
        MD  EL +I+ AF A+Q  +LL+L+L  +D +RI H     RH IRQLA+FR+IHESDL CR+STRMD+RCFAILC LL+T+ GL+STEVVDVEEMVAMF
Subjt:  MDPDELVAILTAFTAAQTTILLLLDLYMDDHRRIEHQSPYIRHHIRQLAFFRLIHESDLCCRESTRMDKRCFAILCTLLKTVVGLSSTEVVDVEEMVAMF

Query:  LHIVAHDVKNRVVRMQFARSGETVSRHFNTVLHAVLRLHDVLLKKPEPVTASCTNPRWKWFQNCLGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVC
        LHI+AHDVKNRV++ +F RSGET+ RHFN +  AV+RLHD LLKKP+PV   CT+ RW+WF+NCLGALDGTYIKVNV   D+ RYRTRKGE+A NVL VC
Subjt:  LHIVAHDVKNRVVRMQFARSGETVSRHFNTVLHAVLRLHDVLLKKPEPVTASCTNPRWKWFQNCLGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVC

Query:  DTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVSK-------------EGFLAPYRGKLYHLSEWRDEGNAPTTPKEFFNMKHSSARNVIERAFGLLK
        DTKGDF +VL GWEGSAADSR+LRDA+SRPN L V K             EGFLAPYRG+ YHL EWRD  NAP+T KEFFNMKHSSARNVIERAFG+LK
Subjt:  DTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVSK-------------EGFLAPYRGKLYHLSEWRDEGNAPTTPKEFFNMKHSSARNVIERAFGLLK

Query:  GRWAILRGKSYYPVRIQGRTIAACCLLHNLINREMGDREIPDELDEVDSASVTTDGENINFIETSDEWSRWREELATQMFSNWELR
        GRWAILRGKSYYPV +Q RTI ACCLL+NLINREM + +I D +DEVDS   TT  ++I++IETS+EWS+WR++LA +MF+ WELR
Subjt:  GRWAILRGKSYYPVRIQGRTIAACCLLHNLINREMGDREIPDELDEVDSASVTTDGENINFIETSDEWSRWREELATQMFSNWELR

TrEMBL top hitse value%identityAlignment
A0A5A7TWH8 Retrotransposon protein1.5e-15068.13Show/hide
Query:  MDPDELVAILTAFTAAQTTILLLLDLYMDDHRRIEHQSPYIRHHIRQLAFFRLIHESDLCCRESTRMDKRCFAILCTLLKTVVGLSSTEVVDVEEMVAMF
        MD  EL +I+ AF A+Q  +LL+L+L  +D +RI H     RH IRQLA+FR+IH SDL CRESTRMD+RCFAILC LL+T+ GL+STEVVDVEEMVAMF
Subjt:  MDPDELVAILTAFTAAQTTILLLLDLYMDDHRRIEHQSPYIRHHIRQLAFFRLIHESDLCCRESTRMDKRCFAILCTLLKTVVGLSSTEVVDVEEMVAMF

Query:  LHIVAHDVKNRVVRMQFARSGETVSRHFNTVLHAVLRLHDVLLKKPEPVTASCTNPRWKWFQNCLGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVC
        LHI+AHDVKNRV++ +F RSGET+S HFN VL AV+RLH+ LLKKP+PV   CT+ RW+WF+NCLGALDGTYIKVNV   DR RYRTRKGE+A NVL VC
Subjt:  LHIVAHDVKNRVVRMQFARSGETVSRHFNTVLHAVLRLHDVLLKKPEPVTASCTNPRWKWFQNCLGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVC

Query:  DTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVSK-------------EGFLAPYRGKLYHLSEWRDEGNAPTTPKEFFNMKHSSARNVIERAFGLLK
        D KGDF +VL GWEGSAADSR+LRDA+SRPNGL V K             EGFLAPYRG+ YHL EW    NAP+T KEFFNMKH SARNVIERAFG+LK
Subjt:  DTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVSK-------------EGFLAPYRGKLYHLSEWRDEGNAPTTPKEFFNMKHSSARNVIERAFGLLK

Query:  GRWAILRGKSYYPVRIQGRTIAACCLLHNLINREMGDREIPDELDEVDSASVTTDGENINFIETSDEWSRWREELATQMFSNWELR
        GRWAILRGKSYYPV +Q RTI ACCLLHNLINREM + +I D +DEVDS   TT  ++I++IETS+EWS+WR++LA +MF+ WELR
Subjt:  GRWAILRGKSYYPVRIQGRTIAACCLLHNLINREMGDREIPDELDEVDSASVTTDGENINFIETSDEWSRWREELATQMFSNWELR

A0A5A7TXW1 Retrotransposon protein3.7e-15268.91Show/hide
Query:  MDPDELVAILTAFTAAQTTILLLLDLYMDDHRRIEHQSPYIRHHIRQLAFFRLIHESDLCCRESTRMDKRCFAILCTLLKTVVGLSSTEVVDVEEMVAMF
        MD  EL +I+ AF A+Q  +LL+L+L  +D +RI H     RH IRQLA+FR+IH SDL CR+STRMD+RCFAILC LL+T+ GL+STEVVDVEEMVAMF
Subjt:  MDPDELVAILTAFTAAQTTILLLLDLYMDDHRRIEHQSPYIRHHIRQLAFFRLIHESDLCCRESTRMDKRCFAILCTLLKTVVGLSSTEVVDVEEMVAMF

Query:  LHIVAHDVKNRVVRMQFARSGETVSRHFNTVLHAVLRLHDVLLKKPEPVTASCTNPRWKWFQNCLGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVC
        LHI+AHDVKNRV++ +F RSGET+SRHFN VL AV+RLHD LLKKP+PV   CT+ RW+WF+NCLGALDGTYIKVNV   DR RYRTRKGE+A NVL VC
Subjt:  LHIVAHDVKNRVVRMQFARSGETVSRHFNTVLHAVLRLHDVLLKKPEPVTASCTNPRWKWFQNCLGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVC

Query:  DTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVSK-------------EGFLAPYRGKLYHLSEWRDEGNAPTTPKEFFNMKHSSARNVIERAFGLLK
        DTKGDF +VL GWEGSAADSR+LRDA+SRPN L V K             EGFLAPYRG+ YHL EWR   NAP+T KEFFNMKHSSARNVIERAFG+LK
Subjt:  DTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVSK-------------EGFLAPYRGKLYHLSEWRDEGNAPTTPKEFFNMKHSSARNVIERAFGLLK

Query:  GRWAILRGKSYYPVRIQGRTIAACCLLHNLINREMGDREIPDELDEVDSASVTTDGENINFIETSDEWSRWREELATQMFSNWELR
        GRWAILRGKSYYPV +Q RTI ACCLLHNLINREM + +I D +DEVDS   TT  ++I++IETS+EWS+WR+ LA +MF+ WELR
Subjt:  GRWAILRGKSYYPVRIQGRTIAACCLLHNLINREMGDREIPDELDEVDSASVTTDGENINFIETSDEWSRWREELATQMFSNWELR

A0A5A7U476 Retrotransposon protein2.4e-15168.65Show/hide
Query:  MDPDELVAILTAFTAAQTTILLLLDLYMDDHRRIEHQSPYIRHHIRQLAFFRLIHESDLCCRESTRMDKRCFAILCTLLKTVVGLSSTEVVDVEEMVAMF
        MD  EL +I+ AF A+Q  +LL+L+L  +D +RI H     RH IRQLA+FR+IH SDL CR+STRMD+RCFAILC LL+T+ GL+STEVVDVEEMVAMF
Subjt:  MDPDELVAILTAFTAAQTTILLLLDLYMDDHRRIEHQSPYIRHHIRQLAFFRLIHESDLCCRESTRMDKRCFAILCTLLKTVVGLSSTEVVDVEEMVAMF

Query:  LHIVAHDVKNRVVRMQFARSGETVSRHFNTVLHAVLRLHDVLLKKPEPVTASCTNPRWKWFQNCLGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVC
        LHI+AHDVKNRV++ +F RSGET+SRHFN VL AV+RLH+ LLKKP+PV   CT+ RWKWF+NCLGALDGTYIKVNV   DR RYRTRKGE+A NVL VC
Subjt:  LHIVAHDVKNRVVRMQFARSGETVSRHFNTVLHAVLRLHDVLLKKPEPVTASCTNPRWKWFQNCLGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVC

Query:  DTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVSK-------------EGFLAPYRGKLYHLSEWRDEGNAPTTPKEFFNMKHSSARNVIERAFGLLK
        DTKGDF +VL GWEG AADSR+LRDA+SRPN L V K             EGFLAPYRG+ YHL EWR   NAP+T KEFFNMKHSSARNVIERAFG+LK
Subjt:  DTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVSK-------------EGFLAPYRGKLYHLSEWRDEGNAPTTPKEFFNMKHSSARNVIERAFGLLK

Query:  GRWAILRGKSYYPVRIQGRTIAACCLLHNLINREMGDREIPDELDEVDSASVTTDGENINFIETSDEWSRWREELATQMFSNWELR
        GRWAILRGKSYYPV +Q RTI ACCLLHNLINREM + +I D +DEVDS   TT  ++I++IETS+EWS+WR+ LA +MF+ WELR
Subjt:  GRWAILRGKSYYPVRIQGRTIAACCLLHNLINREMGDREIPDELDEVDSASVTTDGENINFIETSDEWSRWREELATQMFSNWELR

A0A5A7VHB0 Retrotransposon protein1.4e-15168.13Show/hide
Query:  MDPDELVAILTAFTAAQTTILLLLDLYMDDHRRIEHQSPYIRHHIRQLAFFRLIHESDLCCRESTRMDKRCFAILCTLLKTVVGLSSTEVVDVEEMVAMF
        MD  EL +I+ AF A+Q  +LL+L+L  +D +RI H     RH IRQLA+FR+IHESDL CR+STRMD+RCFAILC LL+T+ GL+STEVVDVEEMVAMF
Subjt:  MDPDELVAILTAFTAAQTTILLLLDLYMDDHRRIEHQSPYIRHHIRQLAFFRLIHESDLCCRESTRMDKRCFAILCTLLKTVVGLSSTEVVDVEEMVAMF

Query:  LHIVAHDVKNRVVRMQFARSGETVSRHFNTVLHAVLRLHDVLLKKPEPVTASCTNPRWKWFQNCLGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVC
        LHI+AHDVKNRV++ +F RSGET+ RHFN +  AV+RLHD LLKKP+PV   CT+ RW+WF+NCLGALDGTYIKVNV   D+ RYRTRKGE+A NVL VC
Subjt:  LHIVAHDVKNRVVRMQFARSGETVSRHFNTVLHAVLRLHDVLLKKPEPVTASCTNPRWKWFQNCLGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVC

Query:  DTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVSK-------------EGFLAPYRGKLYHLSEWRDEGNAPTTPKEFFNMKHSSARNVIERAFGLLK
        DTKGDF +VL GWEGSAADSR+LRDA+SRPN L V K             EGFLAPYRG+ YHL EWRD  NAP+T KEFFNMKHSSARNVIERAFG+LK
Subjt:  DTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVSK-------------EGFLAPYRGKLYHLSEWRDEGNAPTTPKEFFNMKHSSARNVIERAFGLLK

Query:  GRWAILRGKSYYPVRIQGRTIAACCLLHNLINREMGDREIPDELDEVDSASVTTDGENINFIETSDEWSRWREELATQMFSNWELR
        GRWAILRGKSYYPV +Q RTI ACCLL+NLINREM + +I D +DEVDS   TT  ++I++IETS+EWS+WR++LA +MF+ WELR
Subjt:  GRWAILRGKSYYPVRIQGRTIAACCLLHNLINREMGDREIPDELDEVDSASVTTDGENINFIETSDEWSRWREELATQMFSNWELR

A0A5D3BDX0 Retrotransposon protein1.1e-15168.65Show/hide
Query:  MDPDELVAILTAFTAAQTTILLLLDLYMDDHRRIEHQSPYIRHHIRQLAFFRLIHESDLCCRESTRMDKRCFAILCTLLKTVVGLSSTEVVDVEEMVAMF
        MD  EL +I+ AF A+Q  +LL+L+L  +D +RI H     RH IRQLA+FR+IH SDL CR+STRMD+RCFAILC LL+T+ GL+STEVVDVEEMVAMF
Subjt:  MDPDELVAILTAFTAAQTTILLLLDLYMDDHRRIEHQSPYIRHHIRQLAFFRLIHESDLCCRESTRMDKRCFAILCTLLKTVVGLSSTEVVDVEEMVAMF

Query:  LHIVAHDVKNRVVRMQFARSGETVSRHFNTVLHAVLRLHDVLLKKPEPVTASCTNPRWKWFQNCLGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVC
        LHI+AHDVKNRV++ +F RSGET+SRHFN VL AV+RLH+ LLKKP+PV   CT+ RW+WF+NCLGALDGTYIKVNV   DR RYRTRKGE+A NVL VC
Subjt:  LHIVAHDVKNRVVRMQFARSGETVSRHFNTVLHAVLRLHDVLLKKPEPVTASCTNPRWKWFQNCLGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVC

Query:  DTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVSK-------------EGFLAPYRGKLYHLSEWRDEGNAPTTPKEFFNMKHSSARNVIERAFGLLK
        DTKGDF +VL GWEGSAADSR+LRDA+SRPN L V K             EGFLAPYRG+ YHL EWR   NAP+T KEFFNMKHSSARNVIERAFG+LK
Subjt:  DTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVSK-------------EGFLAPYRGKLYHLSEWRDEGNAPTTPKEFFNMKHSSARNVIERAFGLLK

Query:  GRWAILRGKSYYPVRIQGRTIAACCLLHNLINREMGDREIPDELDEVDSASVTTDGENINFIETSDEWSRWREELATQMFSNWELR
        GRWAILRGKSYYPV +Q RTI ACCLLHNLINREM + +I D +DEVDS   TT  ++I++IETS+EWS+WR+ LA +MF+ WELR
Subjt:  GRWAILRGKSYYPVRIQGRTIAACCLLHNLINREMGDREIPDELDEVDSASVTTDGENINFIETSDEWSRWREELATQMFSNWELR

SwissProt top hitse value%identityAlignment
B0BN95 Putative nuclease HARBI13.6e-1129.63Show/hide
Query:  LGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVCDTKGDFTFVLPGWEGSAADSRVL-RDAISRPNGLTVSKEGFLAPYRGKLYHLSEW-RDEGNAPT
        +GA+D  ++ +     +   Y  RKG  ++N LVVCD +G    V   W GS  D  VL + ++S      + K+ +L       + L  W     + P 
Subjt:  LGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVCDTKGDFTFVLPGWEGSAADSRVL-RDAISRPNGLTVSKEGFLAPYRGKLYHLSEW-RDEGNAPT

Query:  TPKEF-FNMKHSSARNVIERAFGLLKGRWAIL---RGKSYYPVRIQGRTIAACCLLHNLINREMGDREIPDELDEVDSASVT------TDGENINFIETS
        TP E+ +N  HS+  +VIE+    L  R+  L   +G   Y        I ACC+LHN I+ E G         +V S+ VT       +GE+       
Subjt:  TPKEF-FNMKHSSARNVIERAFGLLKGRWAIL---RGKSYYPVRIQGRTIAACCLLHNLINREMGDREIPDELDEVDSASVT------TDGENINFIETS

Query:  DEWSRWREELATQMFS
         E  R R+EL    FS
Subjt:  DEWSRWREELATQMFS

Q17QR8 Putative nuclease HARBI18.5e-1330.82Show/hide
Query:  LGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVCDTKGDFTFVLPGWEGSAADSRVL-RDAISRPNGLTVSKEGFLAPYRGKLYHLSEW-RDEGNAPT
        +G +D  ++ +     +   Y  RKG  ++N L+VCD +G    V   W GS  D  VL + ++S      + KE +L       + L  W     + P 
Subjt:  LGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVCDTKGDFTFVLPGWEGSAADSRVL-RDAISRPNGLTVSKEGFLAPYRGKLYHLSEW-RDEGNAPT

Query:  TPKEF-FNMKHSSARNVIERAFGLLKGRWAIL---RGKSYYPVRIQGRTIAACCLLHNL
        TP E+ +NM HS+  +VIE+ F  L  R+  L   +G   Y        I ACC+LHN+
Subjt:  TPKEF-FNMKHSSARNVIERAFGLLKGRWAIL---RGKSYYPVRIQGRTIAACCLLHNL

Q5U538 Putative nuclease HARBI18.8e-1025.82Show/hide
Query:  LGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVCDTKGDFTFVLPGWEGSAADSRVLRDAISRPNGL---TVSKEGFLAPYRGKLYHLSEW-RDEGNA
        LG +D T + +     +   Y   +G  ++N L+VCD +G   +      GS  D+ VL    S  +GL    + K+G+L       + L  W       
Subjt:  LGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVCDTKGDFTFVLPGWEGSAADSRVLRDAISRPNGL---TVSKEGFLAPYRGKLYHLSEW-RDEGNA

Query:  PTTPKEF-FNMKHSSARNVIERAFGLLKGRWAILRGKS---YYPVRIQGRTIAACCLLHNL-INREMGDREIPDELDEVDSASVTTDGENINFIETSDEW
        P +P ++ +NM H++  +V+ER    L+ R+  L G      Y      + + ACC+LHN+ +  ++      D + E  + S+  + E ++      E 
Subjt:  PTTPKEF-FNMKHSSARNVIERAFGLLKGRWAILRGKS---YYPVRIQGRTIAACCLLHNL-INREMGDREIPDELDEVDSASVTTDGENINFIETSDEW

Query:  SRWREELATQMFS
         R R+EL    FS
Subjt:  SRWREELATQMFS

Q8BR93 Putative nuclease HARBI12.3e-1027.23Show/hide
Query:  LGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVCDTKGDFTFVLPGWEGSAADSRVL-RDAISRPNGLTVSKEGFLAPYRGKLYHLSEW-RDEGNAPT
        +G  D  ++ +     +   Y  RKG  ++N LVVCD +G    V   W GS  D  VL R +++      + K+ +L       + L  W       P 
Subjt:  LGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVCDTKGDFTFVLPGWEGSAADSRVL-RDAISRPNGLTVSKEGFLAPYRGKLYHLSEW-RDEGNAPT

Query:  TPKEF-FNMKHSSARNVIERAFGLLKGRWAIL---RGKSYYPVRIQGRTIAACCLLHNLI---NREMGDREIPDELDEVDSASVTTDGENINFIETSDEW
        T  E+ +N  HS+  +VIER    L  R+  L   +G   Y        I ACC+LHN+      ++    +P  +D+        +GE+ +      E 
Subjt:  TPKEF-FNMKHSSARNVIERAFGLLKGRWAIL---RGKSYYPVRIQGRTIAACCLLHNLI---NREMGDREIPDELDEVDSASVTTDGENINFIETSDEW

Query:  SRWREELATQMFS
         R R+EL    FS
Subjt:  SRWREELATQMFS

Q96MB7 Putative nuclease HARBI18.5e-1330.19Show/hide
Query:  LGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVCDTKGDFTFVLPGWEGSAADSRVL-RDAISRPNGLTVSKEGFLAPYRGKLYHLSEW-RDEGNAPT
        +G +D  ++ +     +   Y  RKG  ++N L+VCD +G    V   W GS  D  VL + ++S      + K+ +L       + L  W     + P 
Subjt:  LGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVCDTKGDFTFVLPGWEGSAADSRVL-RDAISRPNGLTVSKEGFLAPYRGKLYHLSEW-RDEGNAPT

Query:  TPKEF-FNMKHSSARNVIERAFGLLKGRWAIL---RGKSYYPVRIQGRTIAACCLLHNL
        TP E+ +NM HS+  +VIE+ F  L  R+  L   +G   Y        I ACC+LHN+
Subjt:  TPKEF-FNMKHSSARNVIERAFGLLKGRWAIL---RGKSYYPVRIQGRTIAACCLLHNL

Arabidopsis top hitse value%identityAlignment
AT1G43722.1 unknown protein1.0e-2931.7Show/hide
Query:  FRLIHESDLCCRESTRMDKRCFAILCTLLKTVVGLSSTEVVDVEEMVAMFLHIVAHDVKNRVVRMQFARSGETVSRHFNTVLHAVLRLHDVLLKKPE---
        +R + +    C +  RM   CF  LC +L+T   L  T  + +EE VAMFL I  H+   R V ++F R+ ETV R F  VL A   L    ++ P    
Subjt:  FRLIHESDLCCRESTRMDKRCFAILCTLLKTVVGLSSTEVVDVEEMVAMFLHIVAHDVKNRVVRMQFARSGETVSRHFNTVLHAVLRLHDVLLKKPE---

Query:  ----PVTASCTNPRWKWFQNCLGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVCDTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTV--------
            P         W +F   +GA+DGT++ V V    +  Y  R    ++N++ +CD K  FT++  G  GS  D+ VL+ A    +   +        
Subjt:  ----PVTASCTNPRWKWFQNCLGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVCDTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTV--------

Query:  ------SKEGFLAPYRGK-----LYHLSEWRDEGNAPTTPKEFFNMKHSSARNVIERAFGLLKGR
              +K+G LAPYR        YH+S++   G  P    E FN  H+S R+VIER F + K +
Subjt:  ------SKEGFLAPYRGK-----LYHLSEWRDEGNAPTTPKEFFNMKHSSARNVIERAFGLLKGR

AT5G12010.1 unknown protein3.1e-1828.33Show/hide
Query:  RESTRMDKRCFAILCTLLKTVVGLSST---EVVDVEEMVAMFLHIVAHDVKNRVVRMQFARSGETVSRHFNTVLHAVLRLHDVLLKK--PEPVTASCTNP
        +++ RM K  F ++C  L + V    T     + V + VA+ +  +A     R+V  +F   G  +S     VL     + DVL+ K    P   S  N 
Subjt:  RESTRMDKRCFAILCTLLKTVVGLSST---EVVDVEEMVAMFLHIVAHDVKNRVVRMQFARSGETVSRHFNTVLHAVLRLHDVLLKK--PEPVTASCTNP

Query:  RWKW-----FQNCLGALDGTYI-----KVNVAIVDRPRY--RTRKGEIAMNVLVVCDTKGDFTFVLPGWEGSAADSRVLRDAI--SRPNGLTVSKEGFLA
        R ++       N +G++  T+I     K++VA     R+  R +K   ++ +  V + KG FT +  GW GS  D +VL  ++   R N   + K  ++A
Subjt:  RWKW-----FQNCLGALDGTYI-----KVNVAIVDRPRY--RTRKGEIAMNVLVVCDTKGDFTFVLPGWEGSAADSRVLRDAI--SRPNGLTVSKEGFLA

Query:  PYRGKLYHLSEW----RDEGNAPTTPKEFFNMKHSSARNVIERAFGLLKGRWAILRGKSYYPVRIQGRTIAACCLLHNLINREMGDREIPDEL
           G  + L +W      + N  T  +  FN K S  + V + AFG LKGRWA L+ ++   ++     + ACC+LHN+   EM + ++  EL
Subjt:  PYRGKLYHLSEW----RDEGNAPTTPKEFFNMKHSSARNVIERAFGLLKGRWAILRGKSYYPVRIQGRTIAACCLLHNLINREMGDREIPDEL

AT5G28950.1 unknown protein2.4e-1545.12Show/hide
Query:  WKWFQNCLGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVCDTKGDFTFVLPGWEGSAADSRVLRDAISR-PNGLTVSKE
        + +F++C+GA+D T+I   V+    P +R RKG+I+ N+L  C+   +F +VL GWEGSA DS+VL DA++R  N L V +E
Subjt:  WKWFQNCLGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVCDTKGDFTFVLPGWEGSAADSRVLRDAISR-PNGLTVSKE

AT5G35695.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)2.0e-2533.66Show/hide
Query:  RYRTRKGEIAMNVLVVCDTKGDFTFVLPGWEGSAADSRVLRDAISR---PNGLTVSKEGFLAPYRGKLYHLSEWRDEGNAPTTPKEFFNMKHSSARNVIE
        R  +RK  + M+          F +VL GWEGSA DSRVL DA+ +    +    ++  FLAP+RG  YHL E+  +   P TP E FN++H S RNVIE
Subjt:  RYRTRKGEIAMNVLVVCDTKGDFTFVLPGWEGSAADSRVLRDAISR---PNGLTVSKEGFLAPYRGKLYHLSEWRDEGNAPTTPKEFFNMKHSSARNVIE

Query:  RAFGLLKGRWAILRGKSYYPVRIQGRTIAACCLLHNLINRE--MGDREIPDEL-DEVDSASVTTDGENINFIETS----------DEWSRWREELATQMF
        R FG+ K R+AI +    +  + Q   +  C  LHN + +E    + + PDE+ +E D  +   +  N N I+            +  + WR+ +A  M+
Subjt:  RAFGLLKGRWAILRGKSYYPVRIQGRTIAACCLLHNLINRE--MGDREIPDEL-DEVDSASVTTDGENINFIETS----------DEWSRWREELATQMF

Query:  SN
         +
Subjt:  SN

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)5.8e-4131.03Show/hide
Query:  FRLIHESDLCCRESTRMDKRCFAILCTLLKTVVGLSSTEVVDVEEMVAMFLHIVAHDVKNRVVRMQFARSGETVSRHFNTVLHAVLRL-HDVLLKKPEPV
        +++++  +  C E+ RMDK  F  LC LL+T   L  T  + +E  +A+FL I+ H+++ R V+  F  SGET+SRHFN VL+AV+ +  D         
Subjt:  FRLIHESDLCCRESTRMDKRCFAILCTLLKTVVGLSSTEVVDVEEMVAMFLHIVAHDVKNRVVRMQFARSGETVSRHFNTVLHAVLRL-HDVLLKKPEPV

Query:  TASCTNPRWKWFQNCLGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVCDTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVSK------------
        T    +P   +F++C+G +D  +I V V + ++  +R   G +  NVL        F +VL GWEGSA+D +VL  A++R N L V +            
Subjt:  TASCTNPRWKWFQNCLGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVCDTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVSK------------

Query:  -EGFLAPYRGKLYHLSEWRDEGNAPTTPKEFFNMKHSSARNVIERAFGLLKGRWAILRGKSYYPVRIQGRTIAACCLLHNLINREMGD----REIPDE--
          GF+APY G            N+    KE FN +H      I R FG LK R+ IL     YP++ Q + + A C LHN +  E  D    R   +E  
Subjt:  -EGFLAPYRGKLYHLSEWRDEGNAPTTPKEFFNMKHSSARNVIERAFGLLKGRWAILRGKSYYPVRIQGRTIAACCLLHNLINREMGD----REIPDE--

Query:  LDEVDSASVTTDGENINFI--------ETSDEWSRWREELATQMFSNW
         +  +   V  + E +  +        E  ++  R R+E+A+++++++
Subjt:  LDEVDSASVTTDGENINFI--------ETSDEWSRWREELATQMFSNW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCTGATGAACTAGTGGCCATATTGACAGCCTTCACCGCAGCTCAAACTACAATTCTCCTACTATTAGATCTATATATGGACGACCATAGGAGAATAGAACACCA
ATCACCCTACATCCGACACCACATTAGGCAACTAGCATTCTTCAGGCTTATACACGAGTCTGACTTATGTTGTCGAGAGAGTACACGGATGGACAAAAGATGTTTTGCCA
TACTTTGCACCTTATTAAAAACAGTTGTCGGTTTATCCAGTACGGAGGTCGTAGATGTAGAAGAGATGGTTGCCATGTTCTTGCACATTGTAGCCCACGACGTAAAGAAC
CGAGTAGTTCGTATGCAGTTCGCTAGGTCTGGTGAGACAGTTTCTAGGCATTTCAACACTGTCCTTCATGCGGTGTTACGATTGCATGATGTTCTCTTAAAAAAACCTGA
GCCAGTCACAGCCTCTTGTACGAATCCAAGGTGGAAATGGTTTCAGAATTGCCTTGGTGCGTTAGATGGAACATACATCAAGGTCAATGTTGCTATTGTCGATCGCCCGA
GGTATAGAACAAGGAAAGGTGAAATTGCAATGAACGTGCTTGTTGTCTGTGATACAAAAGGAGACTTCACATTCGTCTTACCAGGGTGGGAAGGGTCTGCCGCTGATTCT
CGAGTTCTTAGAGATGCAATATCAAGACCAAACGGACTCACAGTTTCGAAGGAGGGTTTCCTGGCACCGTATAGAGGGAAACTGTACCACCTCTCTGAATGGCGTGATGA
GGGGAATGCACCAACTACTCCAAAAGAATTCTTTAACATGAAGCATTCATCTGCGAGGAATGTGATCGAGAGAGCATTCGGTTTGTTGAAAGGAAGGTGGGCTATCCTCC
GAGGGAAATCGTACTATCCAGTTCGAATTCAAGGACGGACCATCGCAGCATGCTGCTTACTTCACAATCTTATTAATAGAGAGATGGGTGACCGTGAAATTCCTGATGAG
CTGGATGAGGTGGATTCTGCTTCGGTTACAACTGATGGTGAGAATATCAATTTCATTGAGACTTCTGACGAATGGAGCCGGTGGAGGGAAGAGTTGGCAACACAGATGTT
TTCGAATTGGGAGTTACGTAAAAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATCCTGATGAACTAGTGGCCATATTGACAGCCTTCACCGCAGCTCAAACTACAATTCTCCTACTATTAGATCTATATATGGACGACCATAGGAGAATAGAACACCA
ATCACCCTACATCCGACACCACATTAGGCAACTAGCATTCTTCAGGCTTATACACGAGTCTGACTTATGTTGTCGAGAGAGTACACGGATGGACAAAAGATGTTTTGCCA
TACTTTGCACCTTATTAAAAACAGTTGTCGGTTTATCCAGTACGGAGGTCGTAGATGTAGAAGAGATGGTTGCCATGTTCTTGCACATTGTAGCCCACGACGTAAAGAAC
CGAGTAGTTCGTATGCAGTTCGCTAGGTCTGGTGAGACAGTTTCTAGGCATTTCAACACTGTCCTTCATGCGGTGTTACGATTGCATGATGTTCTCTTAAAAAAACCTGA
GCCAGTCACAGCCTCTTGTACGAATCCAAGGTGGAAATGGTTTCAGAATTGCCTTGGTGCGTTAGATGGAACATACATCAAGGTCAATGTTGCTATTGTCGATCGCCCGA
GGTATAGAACAAGGAAAGGTGAAATTGCAATGAACGTGCTTGTTGTCTGTGATACAAAAGGAGACTTCACATTCGTCTTACCAGGGTGGGAAGGGTCTGCCGCTGATTCT
CGAGTTCTTAGAGATGCAATATCAAGACCAAACGGACTCACAGTTTCGAAGGAGGGTTTCCTGGCACCGTATAGAGGGAAACTGTACCACCTCTCTGAATGGCGTGATGA
GGGGAATGCACCAACTACTCCAAAAGAATTCTTTAACATGAAGCATTCATCTGCGAGGAATGTGATCGAGAGAGCATTCGGTTTGTTGAAAGGAAGGTGGGCTATCCTCC
GAGGGAAATCGTACTATCCAGTTCGAATTCAAGGACGGACCATCGCAGCATGCTGCTTACTTCACAATCTTATTAATAGAGAGATGGGTGACCGTGAAATTCCTGATGAG
CTGGATGAGGTGGATTCTGCTTCGGTTACAACTGATGGTGAGAATATCAATTTCATTGAGACTTCTGACGAATGGAGCCGGTGGAGGGAAGAGTTGGCAACACAGATGTT
TTCGAATTGGGAGTTACGTAAAAGCTAA
Protein sequenceShow/hide protein sequence
MDPDELVAILTAFTAAQTTILLLLDLYMDDHRRIEHQSPYIRHHIRQLAFFRLIHESDLCCRESTRMDKRCFAILCTLLKTVVGLSSTEVVDVEEMVAMFLHIVAHDVKN
RVVRMQFARSGETVSRHFNTVLHAVLRLHDVLLKKPEPVTASCTNPRWKWFQNCLGALDGTYIKVNVAIVDRPRYRTRKGEIAMNVLVVCDTKGDFTFVLPGWEGSAADS
RVLRDAISRPNGLTVSKEGFLAPYRGKLYHLSEWRDEGNAPTTPKEFFNMKHSSARNVIERAFGLLKGRWAILRGKSYYPVRIQGRTIAACCLLHNLINREMGDREIPDE
LDEVDSASVTTDGENINFIETSDEWSRWREELATQMFSNWELRKS