; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001078 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001078
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCACTA en-spm transposon protein
Genome locationchr4:23759458..23760542
RNA-Seq ExpressionLag0001078
SyntenyLag0001078
Gene Ontology termsNA
InterPro domainsIPR004252 - Probable transposase, Ptta/En/Spm, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038887408.1 poly [ADP-ribose] polymerase 1-like isoform X1 [Benincasa hispida]2.4e-5041.99Show/hide
Query:  MTRNGQQSEARDALADVNRLMIGSSTDQ---AALGSRGRSRN-QRG--RRTRGHSQNIELDRYVSLYGRIRIEITEEIGKPVSGWATRFSGAIGTITRST
        +  +GQ+ E    L   +   IGSST++    A GSR  SR   RG  RRTRGHS+N+ELDR+V+++GRIRIEI EE+GKPV   AT+FS AIGTI R+T
Subjt:  MTRNGQQSEARDALADVNRLMIGSSTDQ---AALGSRGRSRN-QRG--RRTRGHSQNIELDRYVSLYGRIRIEITEEIGKPVSGWATRFSGAIGTITRST

Query:  VPLSCATWRVVPKQVH----------------------------------------DVGKACL------------------------LFDNPAEAPENPP
        +PL C  W  V K+V                                         DVGK  +                         F +P EA   PP
Subjt:  VPLSCATWRVVPKQVH----------------------------------------DVGKACL------------------------LFDNPAEAPENPP

Query:  ERITNLEDWNLLCDRWETPEWKEKADKNKSSRSNLPFNHRAGPKSFLQLQHELVQLSYCFTNYVIIIRITLHAIGLQKIKENRDVDQVDLFHESHFCERG
        +RIT+  DWNLLC+RWETPEWK+K + NK SRS +P+ HR G KSF+Q+Q E+                        KIKE RDVDQVDLF +SHFCE+ 
Subjt:  ERITNLEDWNLLCDRWETPEWKEKADKNKSSRSNLPFNHRAGPKSFLQLQHELVQLSYCFTNYVIIIRITLHAIGLQKIKENRDVDQVDLFHESHFCERG

Query:  GWVNDVAKDAYV
        GWVN+ AKDAY+
Subjt:  GWVNDVAKDAYV

XP_038887409.1 poly [ADP-ribose] polymerase 1-like isoform X2 [Benincasa hispida]1.8e-5345.96Show/hide
Query:  MTRNGQQSEARDALADVNRLMIGSSTDQ---AALGSRGRSRN-QRG--RRTRGHSQNIELDRYVSLYGRIRIEITEEIGKPVSGWATRFSGAIGTITRST
        +  +GQ+ E    L   +   IGSST++    A GSR  SR   RG  RRTRGHS+N+ELDR+V+++GRIRIEI EE+GKPV   AT+FS AIGTI R+T
Subjt:  MTRNGQQSEARDALADVNRLMIGSSTDQ---AALGSRGRSRN-QRG--RRTRGHSQNIELDRYVSLYGRIRIEITEEIGKPVSGWATRFSGAIGTITRST

Query:  VPLSCATWRVVPKQVH-------------DVGKACL------------------------LFDNPAEAPENPPERITNLEDWNLLCDRWETPEWKEKADK
        +PL C  W  V K+V              DVGK  +                         F +P EA   PP+RIT+  DWNLLC+RWETPEWK+K + 
Subjt:  VPLSCATWRVVPKQVH-------------DVGKACL------------------------LFDNPAEAPENPPERITNLEDWNLLCDRWETPEWKEKADK

Query:  NKSSRSNLPFNHRAGPKSFLQLQHELVQLSYCFTNYVIIIRITLHAIGLQKIKENRDVDQVDLFHESHFCERGGWVNDVAKDAYV
        NK SRS +P+ HR G KSF+Q+Q E+                        KIKE RDVDQVDLF +SHFCE+ GWVN+ AKDAY+
Subjt:  NKSSRSNLPFNHRAGPKSFLQLQHELVQLSYCFTNYVIIIRITLHAIGLQKIKENRDVDQVDLFHESHFCERGGWVNDVAKDAYV

XP_038887410.1 poly [ADP-ribose] polymerase 1-like isoform X3 [Benincasa hispida]2.4e-5041.99Show/hide
Query:  MTRNGQQSEARDALADVNRLMIGSSTDQ---AALGSRGRSRN-QRG--RRTRGHSQNIELDRYVSLYGRIRIEITEEIGKPVSGWATRFSGAIGTITRST
        +  +GQ+ E    L   +   IGSST++    A GSR  SR   RG  RRTRGHS+N+ELDR+V+++GRIRIEI EE+GKPV   AT+FS AIGTI R+T
Subjt:  MTRNGQQSEARDALADVNRLMIGSSTDQ---AALGSRGRSRN-QRG--RRTRGHSQNIELDRYVSLYGRIRIEITEEIGKPVSGWATRFSGAIGTITRST

Query:  VPLSCATWRVVPKQVH----------------------------------------DVGKACL------------------------LFDNPAEAPENPP
        +PL C  W  V K+V                                         DVGK  +                         F +P EA   PP
Subjt:  VPLSCATWRVVPKQVH----------------------------------------DVGKACL------------------------LFDNPAEAPENPP

Query:  ERITNLEDWNLLCDRWETPEWKEKADKNKSSRSNLPFNHRAGPKSFLQLQHELVQLSYCFTNYVIIIRITLHAIGLQKIKENRDVDQVDLFHESHFCERG
        +RIT+  DWNLLC+RWETPEWK+K + NK SRS +P+ HR G KSF+Q+Q E+                        KIKE RDVDQVDLF +SHFCE+ 
Subjt:  ERITNLEDWNLLCDRWETPEWKEKADKNKSSRSNLPFNHRAGPKSFLQLQHELVQLSYCFTNYVIIIRITLHAIGLQKIKENRDVDQVDLFHESHFCERG

Query:  GWVNDVAKDAYV
        GWVN+ AKDAY+
Subjt:  GWVNDVAKDAYV

XP_038887411.1 poly [ADP-ribose] polymerase 1-like isoform X4 [Benincasa hispida]3.2e-5042.12Show/hide
Query:  MTRNGQQSEARDALADVNRLMIGSSTDQ---AALGSRGRSRN-QRG--RRTRGHSQNIELDRYVSLYGRIRIEITEEIGKPVSGWATRFSGAIGTITRST
        +  +GQ+ E    L   +   IGSST++    A GSR  SR   RG  RRTRGHS+N+ELDR+V+++GRIRIEI EE+GKPV   AT+FS AIGTI R+T
Subjt:  MTRNGQQSEARDALADVNRLMIGSSTDQ---AALGSRGRSRN-QRG--RRTRGHSQNIELDRYVSLYGRIRIEITEEIGKPVSGWATRFSGAIGTITRST

Query:  VPLSCATWRVVPKQVH----------------------------------------DVGKACL------------------------LFDNPAEAPENPP
        +PL C  W  V K+V                                         DVGK  +                         F +P EA   PP
Subjt:  VPLSCATWRVVPKQVH----------------------------------------DVGKACL------------------------LFDNPAEAPENPP

Query:  ERITNLEDWNLLCDRWETPEWKEKADKNKSSRSNLPFNHRAGPKSFLQLQHELVQLSYCFTNYVIIIRITLHAIGLQKIKENRDVDQVDLFHESHFCERG
        +RIT+  DWNLLC+RWETPEWK+K + NK SRS +P+ HR G KSF+Q+Q E+                        KIKE RDVDQVDLF +SHFCE+ 
Subjt:  ERITNLEDWNLLCDRWETPEWKEKADKNKSSRSNLPFNHRAGPKSFLQLQHELVQLSYCFTNYVIIIRITLHAIGLQKIKENRDVDQVDLFHESHFCERG

Query:  GWVNDVAKDAY
        GWVN+ AKDAY
Subjt:  GWVNDVAKDAY

XP_038887413.1 uncharacterized protein LOC120077557 isoform X5 [Benincasa hispida]2.4e-5041.99Show/hide
Query:  MTRNGQQSEARDALADVNRLMIGSSTDQ---AALGSRGRSRN-QRG--RRTRGHSQNIELDRYVSLYGRIRIEITEEIGKPVSGWATRFSGAIGTITRST
        +  +GQ+ E    L   +   IGSST++    A GSR  SR   RG  RRTRGHS+N+ELDR+V+++GRIRIEI EE+GKPV   AT+FS AIGTI R+T
Subjt:  MTRNGQQSEARDALADVNRLMIGSSTDQ---AALGSRGRSRN-QRG--RRTRGHSQNIELDRYVSLYGRIRIEITEEIGKPVSGWATRFSGAIGTITRST

Query:  VPLSCATWRVVPKQVH----------------------------------------DVGKACL------------------------LFDNPAEAPENPP
        +PL C  W  V K+V                                         DVGK  +                         F +P EA   PP
Subjt:  VPLSCATWRVVPKQVH----------------------------------------DVGKACL------------------------LFDNPAEAPENPP

Query:  ERITNLEDWNLLCDRWETPEWKEKADKNKSSRSNLPFNHRAGPKSFLQLQHELVQLSYCFTNYVIIIRITLHAIGLQKIKENRDVDQVDLFHESHFCERG
        +RIT+  DWNLLC+RWETPEWK+K + NK SRS +P+ HR G KSF+Q+Q E+                        KIKE RDVDQVDLF +SHFCE+ 
Subjt:  ERITNLEDWNLLCDRWETPEWKEKADKNKSSRSNLPFNHRAGPKSFLQLQHELVQLSYCFTNYVIIIRITLHAIGLQKIKENRDVDQVDLFHESHFCERG

Query:  GWVNDVAKDAYV
        GWVN+ AKDAY+
Subjt:  GWVNDVAKDAYV

TrEMBL top hitse value%identityAlignment
A0A5A7SPZ3 Transposase2.0e-3738.21Show/hide
Query:  RGRSR-NQRGRRTRGHSQNIELDRYVSLYGRIRIEITEEIGKPVSGWATRFSGAIGTITRSTVPLSCATWRVVPKQVHD---------------------
        R +SR  +R R  RG+ +NIELD++V  +G+++IEI EE GKPV+ +A + +  IGT  R+T+ LSC  W+ +P  V +                     
Subjt:  RGRSR-NQRGRRTRGHSQNIELDRYVSLYGRIRIEITEEIGKPVSGWATRFSGAIGTITRSTVPLSCATWRVVPKQVHD---------------------

Query:  ----------------VGKACLLFDNPAEAPENPPERITNLEDWNLLCDRWETPEWKEKADKNKSSRSNLPFNHRAGPKSFLQLQHELVQLSYCFTNYVI
                        + K    FD+  EA  NPP++IT+ EDWN++CDRWET  WK+K + NK SRS + FNH  G KSFLQ++HEL +   C      
Subjt:  ----------------VGKACLLFDNPAEAPENPPERITNLEDWNLLCDRWETPEWKEKADKNKSSRSNLPFNHRAGPKSFLQLQHELVQLSYCFTNYVI

Query:  IIRITLHAIGLQKIKENRDVDQVDLFHESHFCERGGWVNDVAKDAY
                          DVD+V++F E+HF E+ GW+ND AKDAY
Subjt:  IIRITLHAIGLQKIKENRDVDQVDLFHESHFCERGGWVNDVAKDAY

A0A5A7T3V0 CACTA en-spm transposon protein5.3e-3543.08Show/hide
Query:  GHSQNIELDRYVSLYGRIRIEITEEIGKPVSGWATRFSGAIGTITRSTVPLSCATWRVVPKQVHDVGKACLLFDNPAEAPENPPERITNLEDWNLLCDRW
        G+ +NIELD++V  +G+++IEI+EE GKPV+ +A   +  IGT  R+T+PLSC   + VP  V +     L+ D           RIT+ EDWN++CDRW
Subjt:  GHSQNIELDRYVSLYGRIRIEITEEIGKPVSGWATRFSGAIGTITRSTVPLSCATWRVVPKQVHDVGKACLLFDNPAEAPENPPERITNLEDWNLLCDRW

Query:  ETPEWKEKADKNKSSRSNLPFNHRAGPKSFLQLQHELVQLSYCFTNYVIIIRITLHAIGLQKIKENRDVDQVDLFHESHFCERGGWVNDVAKDAY
        ET  WK+K + NK SRS + FNH    KSFLQ++HEL                        K K+  DVD+V++FHE+HF E+ GW+ND AK+AY
Subjt:  ETPEWKEKADKNKSSRSNLPFNHRAGPKSFLQLQHELVQLSYCFTNYVIIIRITLHAIGLQKIKENRDVDQVDLFHESHFCERGGWVNDVAKDAY

A0A5A7TRX4 DUF4216 domain-containing protein3.6e-3939.34Show/hide
Query:  RSRNQRGRRTRGHSQNIELDRYVSLYGRIRIEITEEIGKPVSGWATRFSGAIGTITRSTVPLSCATWRVVPKQVH-------------------------
        +S +  GR  RG+ +NIELD++V  +G+I+IEI EE GKPV+ +A + +  IGT  R+T+PLSC  W+ VP  V                          
Subjt:  RSRNQRGRRTRGHSQNIELDRYVSLYGRIRIEITEEIGKPVSGWATRFSGAIGTITRSTVPLSCATWRVVPKQVH-------------------------

Query:  ------------DVGKACLLFDNPAEAPENPPERITNLEDWNLLCDRWETPEWKEKADKNKSSRSNLPFNHRAGPKSFLQLQHELVQLSYCFTNYVIIIR
                    D+ K    FD+  EA  NP  RIT+ EDWN++CDRWET  WK+K + NK S S + FNH  G KSFLQ++HEL               
Subjt:  ------------DVGKACLLFDNPAEAPENPPERITNLEDWNLLCDRWETPEWKEKADKNKSSRSNLPFNHRAGPKSFLQLQHELVQLSYCFTNYVIIIR

Query:  ITLHAIGLQKIKENRDVDQVDLFHESHFCERGGWVNDVAKDAYV
                 K K+  DVD++++FHE+HF E+ GW ND AKDAY+
Subjt:  ITLHAIGLQKIKENRDVDQVDLFHESHFCERGGWVNDVAKDAYV

A0A5D3B974 DUF4216 domain-containing protein5.3e-3543.08Show/hide
Query:  GHSQNIELDRYVSLYGRIRIEITEEIGKPVSGWATRFSGAIGTITRSTVPLSCATWRVVPKQVHDVGKACLLFDNPAEAPENPPERITNLEDWNLLCDRW
        G+ +NIELD++V  +G+++IEI+EE GKPV+ +A   +  IGT  R+T+PLSC   + VP  V +     L+ D           RIT+ EDWN++CDRW
Subjt:  GHSQNIELDRYVSLYGRIRIEITEEIGKPVSGWATRFSGAIGTITRSTVPLSCATWRVVPKQVHDVGKACLLFDNPAEAPENPPERITNLEDWNLLCDRW

Query:  ETPEWKEKADKNKSSRSNLPFNHRAGPKSFLQLQHELVQLSYCFTNYVIIIRITLHAIGLQKIKENRDVDQVDLFHESHFCERGGWVNDVAKDAY
        ET  WK+K + NK SRS + FNH    KSFLQ++HEL                        K K+  DVD+V++FHE+HF E+ GW+ND AK+AY
Subjt:  ETPEWKEKADKNKSSRSNLPFNHRAGPKSFLQLQHELVQLSYCFTNYVIIIRITLHAIGLQKIKENRDVDQVDLFHESHFCERGGWVNDVAKDAY

A0A6J1DUH3 uncharacterized protein LOC1110232122.0e-2956.91Show/hide
Query:  FDNPAEAPENPPERITNLEDWNLLCDRWETPEWKEKADKNKSSRSNLPFNHRAGPKSFLQLQHELVQLSYCFTNYVIIIRITLHAIGLQKIKENRDVDQV
        F++P EA  NPPER+TN EDWN LCDRWETPEWKE   KNK +R+ LPFNHRAG KSFLQLQHEL                        KIKE  D+  V
Subjt:  FDNPAEAPENPPERITNLEDWNLLCDRWETPEWKEKADKNKSSRSNLPFNHRAGPKSFLQLQHELVQLSYCFTNYVIIIRITLHAIGLQKIKENRDVDQV

Query:  DLFHESHFCERGGWVNDVAKDAY
        DLF ESH+ E+ G VND A+DAY
Subjt:  DLFHESHFCERGGWVNDVAKDAY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACACGTAATGGTCAACAGTCGGAGGCACGTGATGCACTTGCAGATGTTAACCGTCTAATGATTGGTTCATCCACTGATCAGGCTGCGTTAGGATCTAGAGGTCGTTC
AAGAAATCAACGAGGAAGGCGGACTAGAGGACATAGTCAGAATATTGAACTAGACCGATATGTCAGTCTTTATGGGAGGATTAGGATTGAGATCACCGAGGAGATTGGAA
AACCAGTAAGTGGTTGGGCTACGAGGTTTAGTGGCGCTATTGGTACCATAACAAGGAGCACAGTTCCTTTGAGTTGTGCGACATGGAGGGTTGTACCAAAACAAGTACAT
GATGTTGGGAAGGCTTGTTTGTTGTTTGACAACCCTGCAGAAGCCCCTGAAAATCCACCTGAGAGGATAACAAACCTTGAAGATTGGAATCTTCTATGTGATCGATGGGA
GACACCTGAATGGAAGGAAAAAGCGGATAAGAATAAATCGAGTCGATCAAACCTTCCATTCAACCATCGAGCTGGCCCGAAGTCATTTCTCCAACTACAACATGAATTGG
TGCAACTTAGCTATTGTTTTACAAATTATGTAATTATTATTCGAATAACACTTCATGCAATTGGGTTGCAAAAAATCAAAGAGAATCGGGATGTTGACCAGGTAGATTTG
TTCCATGAAAGTCATTTTTGTGAAAGAGGTGGATGGGTCAACGATGTTGCCAAAGATGCATATGTAAGCCCTGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACACGTAATGGTCAACAGTCGGAGGCACGTGATGCACTTGCAGATGTTAACCGTCTAATGATTGGTTCATCCACTGATCAGGCTGCGTTAGGATCTAGAGGTCGTTC
AAGAAATCAACGAGGAAGGCGGACTAGAGGACATAGTCAGAATATTGAACTAGACCGATATGTCAGTCTTTATGGGAGGATTAGGATTGAGATCACCGAGGAGATTGGAA
AACCAGTAAGTGGTTGGGCTACGAGGTTTAGTGGCGCTATTGGTACCATAACAAGGAGCACAGTTCCTTTGAGTTGTGCGACATGGAGGGTTGTACCAAAACAAGTACAT
GATGTTGGGAAGGCTTGTTTGTTGTTTGACAACCCTGCAGAAGCCCCTGAAAATCCACCTGAGAGGATAACAAACCTTGAAGATTGGAATCTTCTATGTGATCGATGGGA
GACACCTGAATGGAAGGAAAAAGCGGATAAGAATAAATCGAGTCGATCAAACCTTCCATTCAACCATCGAGCTGGCCCGAAGTCATTTCTCCAACTACAACATGAATTGG
TGCAACTTAGCTATTGTTTTACAAATTATGTAATTATTATTCGAATAACACTTCATGCAATTGGGTTGCAAAAAATCAAAGAGAATCGGGATGTTGACCAGGTAGATTTG
TTCCATGAAAGTCATTTTTGTGAAAGAGGTGGATGGGTCAACGATGTTGCCAAAGATGCATATGTAAGCCCTGTTTGA
Protein sequenceShow/hide protein sequence
MTRNGQQSEARDALADVNRLMIGSSTDQAALGSRGRSRNQRGRRTRGHSQNIELDRYVSLYGRIRIEITEEIGKPVSGWATRFSGAIGTITRSTVPLSCATWRVVPKQVH
DVGKACLLFDNPAEAPENPPERITNLEDWNLLCDRWETPEWKEKADKNKSSRSNLPFNHRAGPKSFLQLQHELVQLSYCFTNYVIIIRITLHAIGLQKIKENRDVDQVDL
FHESHFCERGGWVNDVAKDAYVSPV