; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015905 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015905
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon protein
Genome locationchr12:28609958..28611033
RNA-Seq ExpressionLag0015905
SyntenyLag0015905
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN33754.1 retrotransposon protein [Cucumis melo subsp. melo]3.2e-4448.57Show/hide
Query:  MKHSSARNVIERAFGLLKGRWAILISKSYYPVKTQRRIITACCLLNNLIRREMPI-----DPMEDELVHSTLNETELDPITYVETSNEWNAWRDTLAREI
        MKHSSARNVIERAFG+LKGRW IL  KSYYP++ Q R I AC LL+NLI REM       D  E +  ++T   +E   I Y+ET+NEW+ WRD LA  +
Subjt:  MKHSSARNVIERAFGLLKGRWAILISKSYYPVKTQRRIITACCLLNNLIRREMPI-----DPMEDELVHSTLNETELDPITYVETSNEWNAWRDTLAREI

Query:  ------------------------------SSYLQQVEKMMKLKLSGCEVKAQPHIESRVKLLKKQYNAIAEILGPNCSGFGWNEREKCIDCSKVVFDDW
                                        YL Q+ +MM  KLSGC+V+A   I+ R+K LK+ + AIAE+LGP CSGFGWN+ EKCI   K +FD+W
Subjt:  ------------------------------SSYLQQVEKMMKLKLSGCEVKAQPHIESRVKLLKKQYNAIAEILGPNCSGFGWNEREKCIDCSKVVFDDW

Query:  VKSHPTAKGL
        V+S P AKGL
Subjt:  VKSHPTAKGL

ADN34114.1 retrotransposon protein [Cucumis melo subsp. melo]9.9e-4649.08Show/hide
Query:  MKHSSARNVIERAFGLLKGRWAILISKSYYPVKTQRRIITACCLLNNLIRREMPIDPMEDEL--VHSTLNETELDPITYVETSNEWNAWRDTLAREISS-
        MKH SARNVIERAFG+LKGRWAIL  KSYYPV+ Q R I ACCLL+NLI REM    +ED +  V ST   T  D I Y+ETSNEW+ WRD LA EI + 
Subjt:  MKHSSARNVIERAFGLLKGRWAILISKSYYPVKTQRRIITACCLLNNLIRREMPIDPMEDEL--VHSTLNETELDPITYVETSNEWNAWRDTLAREISS-

Query:  ----------------------------------------YLQQVEKMMKLKLSGCEVKAQPHIESRVKLLKKQYNAIAEILGPNCSGFGWNEREKCIDC
                                                YL Q+ +MM  K+ G  + A   I+SR+KL+K+ ++A+AE+ GPNCSGFGWN+ +KCI  
Subjt:  ----------------------------------------YLQQVEKMMKLKLSGCEVKAQPHIESRVKLLKKQYNAIAEILGPNCSGFGWNEREKCIDC

Query:  SKVVFDDWVKSHPTAKGL
         K VFDDW  SHP AKGL
Subjt:  SKVVFDDWVKSHPTAKGL

KAE8667190.1 Histone H4 [Hibiscus syriacus]3.2e-4447.44Show/hide
Query:  MKHSSARNVIERAFGLLKGRWAILISKSYYPVKTQRRIITACCLLNNLIRREMPIDPMEDELVHST-----LNETELDPITYVETSNEWNAWRDTLAREI
        MKH  ARN IER FG+LK  WAIL  KS+YP+KTQ R+I+ACCLL+N IR EMPIDP+E +    +     +NE E+  I + E S+ W  WRD LA ++
Subjt:  MKHSSARNVIERAFGLLKGRWAILISKSYYPVKTQRRIITACCLLNNLIRREMPIDPMEDELVHST-----LNETELDPITYVETSNEWNAWRDTLAREI

Query:  -----------------------------------SSYLQQVEKMMKLKLSGCEVKAQPHIESRVKLLKKQYNAIAEILGPNCSGFGWNEREKCIDCSKV
                                           S YL  +EKM+++KLS  +++A PHIESRVKLLK+QYNA++E+L    SGFGWNE EKC+   K 
Subjt:  -----------------------------------SSYLQQVEKMMKLKLSGCEVKAQPHIESRVKLLKKQYNAIAEILGPNCSGFGWNEREKCIDCSKV

Query:  VFDDWVKSHPTAKGL
        VFDDW  SHPT  GL
Subjt:  VFDDWVKSHPTAKGL

XP_008777637.2 uncharacterized protein LOC103697538 [Phoenix dactylifera]3.4e-4657.14Show/hide
Query:  MKHSSARNVIERAFGLLKGRWAILISKSYYPVKTQRRIITACCLLNNLIRREMPIDPMEDELVH----STLNETELDPITYVETSNE---WNAWRDTLAR
        MKHS ARNVIER FGLLKGRWAIL SKS+Y VKTQ RII+ACCLL+N I  EMPIDPM  E+V     S   ET+ +    VE   E    + W+     
Subjt:  MKHSSARNVIERAFGLLKGRWAILISKSYYPVKTQRRIITACCLLNNLIRREMPIDPMEDELVH----STLNETELDPITYVETSNE---WNAWRDTLAR

Query:  EISSYLQQVEKMMKLKLSGCEVKAQPHIESRVKLLKKQYNAIAEILGPNCSGFGWNEREKCIDCSKVVFDDWVKSHPTAKGL
            + Q +E+MM+ KL GC ++  P IE+ VKLLKKQYNAIAE+LGPN   FGWN+REKC+   K V+D WVKSHP A GL
Subjt:  EISSYLQQVEKMMKLKLSGCEVKAQPHIESRVKLLKKQYNAIAEILGPNCSGFGWNEREKCIDCSKVVFDDWVKSHPTAKGL

XP_039026053.1 uncharacterized protein LOC120159549 [Hibiscus syriacus]4.8e-4852.26Show/hide
Query:  MKHSSARNVIERAFGLLKGRWAILISKSYYPVKTQRRIITACCLLNNLIRREMPIDPMEDELVHST-----LNETELDPITYVETSNEWNAWRDTLAREI
        MK S ARN IER FG+LK RWAIL  KS+YPVKTQ R+I+ACCLL+N IR EMPID +E +    +     +NE E+  I + E S+ W  WRD LA ++
Subjt:  MKHSSARNVIERAFGLLKGRWAILISKSYYPVKTQRRIITACCLLNNLIRREMPIDPMEDELVHST-----LNETELDPITYVETSNEWNAWRDTLAREI

Query:  -------------------SSYLQQVEKMMKLKLSGCEVKAQPHIESRVKLLKKQYNAIAEILGPNCSGFGWNEREKCIDCSKVVFDDWVKSHPTAKGL
                           S YL  +EKM+++KL   +++A PHIESRVKLLK+QYNA++E+L    SGFGWNE EKC+   K VFDDWV+SHPTA GL
Subjt:  -------------------SSYLQQVEKMMKLKLSGCEVKAQPHIESRVKLLKKQYNAIAEILGPNCSGFGWNEREKCIDCSKVVFDDWVKSHPTAKGL

TrEMBL top hitse value%identityAlignment
A0A2Z6NNS9 Uncharacterized protein1.7e-4347.47Show/hide
Query:  MKHSSARNVIERAFGLLKGRWAILISKSYYPVKTQRRIITACCLLNNLIRREMPIDPMEDELVHSTLNETEL---DPITYVETSNEWNAWRDTLAREI--
        MKHSSARNVIER FGLLKGRWAIL  KS+YPVKTQ RIITACCLL+N IR EM +DP+E  L        E+   D IT VE S  W+ WRD  A +I  
Subjt:  MKHSSARNVIERAFGLLKGRWAILISKSYYPVKTQRRIITACCLLNNLIRREMPIDPMEDELVHSTLNETEL---DPITYVETSNEWNAWRDTLAREI--

Query:  -------------------------------------SSYLQQVEKMMKLKLSGCEVKAQPHIESRVKLLKKQYNAIAEILGPNCSGFGWNEREKCIDCS
                                               Y + +EK +  K  GC ++A PHIESRVK LK QY+AI ++LGP+ SGFGW++  K I   
Subjt:  -------------------------------------SSYLQQVEKMMKLKLSGCEVKAQPHIESRVKLLKKQYNAIAEILGPNCSGFGWNEREKCIDCS

Query:  KVVFDDWVKSHPTAKGL
        K ++  W KSHPTA GL
Subjt:  KVVFDDWVKSHPTAKGL

A0A6A2XCI2 Histone H41.5e-4447.44Show/hide
Query:  MKHSSARNVIERAFGLLKGRWAILISKSYYPVKTQRRIITACCLLNNLIRREMPIDPMEDELVHST-----LNETELDPITYVETSNEWNAWRDTLAREI
        MKH  ARN IER FG+LK  WAIL  KS+YP+KTQ R+I+ACCLL+N IR EMPIDP+E +    +     +NE E+  I + E S+ W  WRD LA ++
Subjt:  MKHSSARNVIERAFGLLKGRWAILISKSYYPVKTQRRIITACCLLNNLIRREMPIDPMEDELVHST-----LNETELDPITYVETSNEWNAWRDTLAREI

Query:  -----------------------------------SSYLQQVEKMMKLKLSGCEVKAQPHIESRVKLLKKQYNAIAEILGPNCSGFGWNEREKCIDCSKV
                                           S YL  +EKM+++KLS  +++A PHIESRVKLLK+QYNA++E+L    SGFGWNE EKC+   K 
Subjt:  -----------------------------------SSYLQQVEKMMKLKLSGCEVKAQPHIESRVKLLKKQYNAIAEILGPNCSGFGWNEREKCIDCSKV

Query:  VFDDWVKSHPTAKGL
        VFDDW  SHPT  GL
Subjt:  VFDDWVKSHPTAKGL

A0A803PDI8 Uncharacterized protein1.7e-4649.53Show/hide
Query:  MKHSSARNVIERAFGLLKGRWAILISKSYYPVKTQRRIITACCLLNNLIRREMPIDPMEDELVHSTLNETELDP-------ITYVETSNEWNAWRDTLAR
        M+HSSARNV+ERAFGLLKGRWAI+  +SYYPVK Q RII ACC L+NLIR EM +DP+E    H   N+++ D         TY+E SN W AWRD LAR
Subjt:  MKHSSARNVIERAFGLLKGRWAILISKSYYPVKTQRRIITACCLLNNLIRREMPIDPMEDELVHSTLNETELDP-------ITYVETSNEWNAWRDTLAR

Query:  EISSYLQ---------------------QVEKMMKLKLSGCEV-----------KAQPHIESRVKLLKKQYNAIAEILGPNCSGFGWNEREKCIDCSKVV
        E+    Q                     Q   +  LKL  C +            AQPHI SR+K+LK+QY  I+ +LGP+ SGFGW+E  KC+   K+V
Subjt:  EISSYLQ---------------------QVEKMMKLKLSGCEV-----------KAQPHIESRVKLLKKQYNAIAEILGPNCSGFGWNEREKCIDCSKVV

Query:  FDDWVKSHPTAKGL
        FDDWVKSHPT KGL
Subjt:  FDDWVKSHPTAKGL

E5GBB2 Retrotransposon protein1.5e-4448.57Show/hide
Query:  MKHSSARNVIERAFGLLKGRWAILISKSYYPVKTQRRIITACCLLNNLIRREMPI-----DPMEDELVHSTLNETELDPITYVETSNEWNAWRDTLAREI
        MKHSSARNVIERAFG+LKGRW IL  KSYYP++ Q R I AC LL+NLI REM       D  E +  ++T   +E   I Y+ET+NEW+ WRD LA  +
Subjt:  MKHSSARNVIERAFGLLKGRWAILISKSYYPVKTQRRIITACCLLNNLIRREMPI-----DPMEDELVHSTLNETELDPITYVETSNEWNAWRDTLAREI

Query:  ------------------------------SSYLQQVEKMMKLKLSGCEVKAQPHIESRVKLLKKQYNAIAEILGPNCSGFGWNEREKCIDCSKVVFDDW
                                        YL Q+ +MM  KLSGC+V+A   I+ R+K LK+ + AIAE+LGP CSGFGWN+ EKCI   K +FD+W
Subjt:  ------------------------------SSYLQQVEKMMKLKLSGCEVKAQPHIESRVKLLKKQYNAIAEILGPNCSGFGWNEREKCIDCSKVVFDDW

Query:  VKSHPTAKGL
        V+S P AKGL
Subjt:  VKSHPTAKGL

E5GCB5 Retrotransposon protein4.8e-4649.08Show/hide
Query:  MKHSSARNVIERAFGLLKGRWAILISKSYYPVKTQRRIITACCLLNNLIRREMPIDPMEDEL--VHSTLNETELDPITYVETSNEWNAWRDTLAREISS-
        MKH SARNVIERAFG+LKGRWAIL  KSYYPV+ Q R I ACCLL+NLI REM    +ED +  V ST   T  D I Y+ETSNEW+ WRD LA EI + 
Subjt:  MKHSSARNVIERAFGLLKGRWAILISKSYYPVKTQRRIITACCLLNNLIRREMPIDPMEDEL--VHSTLNETELDPITYVETSNEWNAWRDTLAREISS-

Query:  ----------------------------------------YLQQVEKMMKLKLSGCEVKAQPHIESRVKLLKKQYNAIAEILGPNCSGFGWNEREKCIDC
                                                YL Q+ +MM  K+ G  + A   I+SR+KL+K+ ++A+AE+ GPNCSGFGWN+ +KCI  
Subjt:  ----------------------------------------YLQQVEKMMKLKLSGCEVKAQPHIESRVKLLKKQYNAIAEILGPNCSGFGWNEREKCIDC

Query:  SKVVFDDWVKSHPTAKGL
         K VFDDW  SHP AKGL
Subjt:  SKVVFDDWVKSHPTAKGL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G12010.1 unknown protein2.2e-0632.98Show/hide
Query:  KHSSARNVIERAFGLLKGRWAILISKSYYPVKTQRRIITACCLLNNLIRREMPIDPMEDELVHSTLNETELDPITYVETSNEWNAWRDTLAREI
        K S  + V + AFG LKGRWA L  ++   ++    ++ ACC+L+N+   EM  + ME EL+   +++ E+ P   + + N   A RDT++  +
Subjt:  KHSSARNVIERAFGLLKGRWAILISKSYYPVKTQRRIITACCLLNNLIRREMPIDPMEDELVHSTLNETELDPITYVETSNEWNAWRDTLAREI

AT5G35695.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)3.3e-0730Show/hide
Query:  MKHSSARNVIERAFGLLKGRWAILISKSYYPVKTQRRIITACCLLNNLIRREMPIDPM--------EDELVH---STLNETELDPITYVETSNE----WN
        ++H S RNVIER FG+ K R+AI  S   +  K Q  ++  C  L+N +R+E   D          E ++V+   + +N  E+D    +E   +     N
Subjt:  MKHSSARNVIERAFGLLKGRWAILISKSYYPVKTQRRIITACCLLNNLIRREMPIDPM--------EDELVH---STLNETELDPITYVETSNE----WN

Query:  AWRDTLAREI
         WR ++A ++
Subjt:  AWRDTLAREI

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)9.7e-0737.5Show/hide
Query:  KHSSARNVIERAFGLLKGRWAILISKSYYPVKTQRRIITACCLLNNLIRREMPIDPMEDELVHSTLNETELD
        +H      I R FG LK R+ IL+S   YP++TQ +++ A C L+N +R E P D +       TL E   D
Subjt:  KHSSARNVIERAFGLLKGRWAILISKSYYPVKTQRRIITACCLLNNLIRREMPIDPMEDELVHSTLNETELD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCACTCTTCTGCAAGGAATGTAATTGAGAGGGCCTTTGGCCTTCTTAAAGGAAGATGGGCTATTCTCATATCAAAGTCATATTATCCTGTTAAAACTCAACGTAG
AATAATAACTGCTTGTTGTTTGCTCAACAATTTAATTAGACGGGAAATGCCCATTGATCCAATGGAAGATGAGTTGGTTCATAGTACCTTGAATGAAACTGAGTTAGACC
CTATCACTTATGTGGAGACATCGAATGAGTGGAATGCTTGGCGAGATACCTTAGCAAGGGAGATATCGAGCTATCTACAACAGGTGGAGAAAATGATGAAGTTAAAGTTG
TCTGGATGTGAGGTGAAGGCTCAACCTCACATAGAATCAAGAGTGAAACTTTTGAAGAAACAATATAATGCAATTGCAGAGATATTAGGACCAAATTGTAGTGGTTTTGG
TTGGAATGAAAGAGAGAAGTGTATTGATTGTTCAAAAGTTGTCTTTGATGATTGGGTTAAGAGTCACCCCACAGCAAAAGGATTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGCACTCTTCTGCAAGGAATGTAATTGAGAGGGCCTTTGGCCTTCTTAAAGGAAGATGGGCTATTCTCATATCAAAGTCATATTATCCTGTTAAAACTCAACGTAG
AATAATAACTGCTTGTTGTTTGCTCAACAATTTAATTAGACGGGAAATGCCCATTGATCCAATGGAAGATGAGTTGGTTCATAGTACCTTGAATGAAACTGAGTTAGACC
CTATCACTTATGTGGAGACATCGAATGAGTGGAATGCTTGGCGAGATACCTTAGCAAGGGAGATATCGAGCTATCTACAACAGGTGGAGAAAATGATGAAGTTAAAGTTG
TCTGGATGTGAGGTGAAGGCTCAACCTCACATAGAATCAAGAGTGAAACTTTTGAAGAAACAATATAATGCAATTGCAGAGATATTAGGACCAAATTGTAGTGGTTTTGG
TTGGAATGAAAGAGAGAAGTGTATTGATTGTTCAAAAGTTGTCTTTGATGATTGGGTTAAGAGTCACCCCACAGCAAAAGGATTGTGA
Protein sequenceShow/hide protein sequence
MKHSSARNVIERAFGLLKGRWAILISKSYYPVKTQRRIITACCLLNNLIRREMPIDPMEDELVHSTLNETELDPITYVETSNEWNAWRDTLAREISSYLQQVEKMMKLKL
SGCEVKAQPHIESRVKLLKKQYNAIAEILGPNCSGFGWNEREKCIDCSKVVFDDWVKSHPTAKGL