; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035448 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035448
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr3:21617969..21618842
RNA-Seq ExpressionLag0035448
SyntenyLag0035448
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
QWX09785.1 hydroxymethylglutaryl-CoA synthase [Pistacia terebinthus subsp. palaestina]9.4e-3745.66Show/hide
Query:  TVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKAQPSEFIETMIGNEKKLDPNPLYDEWMTVDQTLSGWLFGSMSPAIAADVVNFKNSREVWKALEEVYG
        ++KLD  N+LLW  +VL ++RG K  GY+ GTK  P E++           PN  Y++W++ D+ L GWL+ +M+P IA+ ++    S+E+W A +E+ G
Subjt:  TVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKAQPSEFIETMIGNEKKLDPNPLYDEWMTVDQTLSGWLFGSMSPAIAADVVNFKNSREVWKALEEVYG

Query:  ATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSLDDLISYVLAGLDAEYIPIVCTIDDKD
        A +K+RV   +G LQ T+KG MKM +YL  MK  S+NL LAG+P+SL DLI+ +L GLDAEY PIV  + DK+
Subjt:  ATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSLDDLISYVLAGLDAEYIPIVCTIDDKD

TXG48382.1 hypothetical protein EZV62_027676 [Acer yangbiense]1.9e-3739.62Show/hide
Query:  GDDNSSSSTSAAIPVISTSTTSVISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKAQPSEFIETMIGNE--KKLDPNPLYDEWMTV
        GD +  ++TS    V     TS      G+PL +  +VKL+ +NYLLW+ +VL ++RG +++GY+ G K  P EFI T +  E  + L+ NP Y++W+  
Subjt:  GDDNSSSSTSAAIPVISTSTTSVISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKAQPSEFIETMIGNE--KKLDPNPLYDEWMTV

Query:  DQTLSGWLFGSMSPAIAADVVNFKNSREVWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSLDDLISYVLAGLDA-E
        DQ L GWL+ SM P +A++V+  + S+++W+++  ++G  +K+ +   +   Q  +KG MKM DYL   K+ ++NL LAG PV L+DL+S VL GLD+ E
Subjt:  DQTLSGWLFGSMSPAIAADVVNFKNSREVWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSLDDLISYVLAGLDA-E

Query:  YIPIVCTIDDKD
        Y P+VC I++++
Subjt:  YIPIVCTIDDKD

XP_022143579.1 ankyrin repeat-containing protein NPR4-like [Momordica charantia]4.6e-6058.33Show/hide
Query:  LQGESYGDDNSSSSTSAAIPVISTSTTSVISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKAQPSEFIETMIGNEKKLDP--NPLY
        +  E +    +++S  A +        S I+ SFGHPLST LTVKLD+KNY LW+GMVLA+L GQKVDGYVL TK  PS++  T   +   L+P  NP Y
Subjt:  LQGESYGDDNSSSSTSAAIPVISTSTTSVISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKAQPSEFIETMIGNEKKLDP--NPLY

Query:  DEWMTVDQTLSGWLFGSMSPAIAADVVNFKNSREVWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSLDDLISYVLA
        +EW  VDQ   GWLFGSM+P+IAADVVN + S EVW ALE ++G+TSKAR+NQLR  LQNTKKG+MKM  YLA MKQ SE+LKLAG PV+L  L S +L 
Subjt:  DEWMTVDQTLSGWLFGSMSPAIAADVVNFKNSREVWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSLDDLISYVLA

Query:  GLDAEYIPIVCTIDDK
        G +AEY+PI+CTI+DK
Subjt:  GLDAEYIPIVCTIDDK

XP_022157748.1 uncharacterized protein LOC111024384 isoform X1 [Momordica charantia]8.1e-6564.04Show/hide
Query:  TSAAIPVISTSTTSVISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKAQPSEFIET--MIGNEKKLDPNPLYDEWMTVDQTLSGWL
        T+ A+P  + S     ++SFGHPL TVLTVKLD+KNY LWRGMVLA+LRGQK DGYVLGT A+P +F+ +    G    L  NP Y EW  VDQ L GWL
Subjt:  TSAAIPVISTSTTSVISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKAQPSEFIET--MIGNEKKLDPNPLYDEWMTVDQTLSGWL

Query:  FGSMSPAIAADVVNFKNSREVWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSLDDLISYVLAGLDAEYIPIVCTID
        FGSM+P+IA DVV+F++SREVWKALE++YGATSKAR+NQLR +LQNTKK S+KM +YL +MKQASE+LKLAG PV+ + L+S VL+GL+AEY+PIVC I+
Subjt:  FGSMSPAIAADVVNFKNSREVWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSLDDLISYVLAGLDAEYIPIVCTID

Query:  DKD
         KD
Subjt:  DKD

XP_022157750.1 uncharacterized protein LOC111024384 isoform X2 [Momordica charantia]8.1e-6564.04Show/hide
Query:  TSAAIPVISTSTTSVISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKAQPSEFIET--MIGNEKKLDPNPLYDEWMTVDQTLSGWL
        T+ A+P  + S     ++SFGHPL TVLTVKLD+KNY LWRGMVLA+LRGQK DGYVLGT A+P +F+ +    G    L  NP Y EW  VDQ L GWL
Subjt:  TSAAIPVISTSTTSVISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKAQPSEFIET--MIGNEKKLDPNPLYDEWMTVDQTLSGWL

Query:  FGSMSPAIAADVVNFKNSREVWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSLDDLISYVLAGLDAEYIPIVCTID
        FGSM+P+IA DVV+F++SREVWKALE++YGATSKAR+NQLR +LQNTKK S+KM +YL +MKQASE+LKLAG PV+ + L+S VL+GL+AEY+PIVC I+
Subjt:  FGSMSPAIAADVVNFKNSREVWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSLDDLISYVLAGLDAEYIPIVCTID

Query:  DKD
         KD
Subjt:  DKD

TrEMBL top hitse value%identityAlignment
A0A2Z7BJW9 Uncharacterized protein (Fragment)1.0e-3649.39Show/hide
Query:  LLWRGMVLAILRGQKVDGYVLGTKAQPSEFI-ETMIGNEKKLDPNPLYDEWMTVDQTLSGWLFGSMSPAIAADVVNFKNSREVWKALEEVYGATSKARVN
        LLW  M+L I+RG K+DGYVLGTK  P EF+  T  G+  K+ PNP Y+EW++ DQ L GWL+ +MS  IA+ ++    S+E+W   +E+ GA +++R+ 
Subjt:  LLWRGMVLAILRGQKVDGYVLGTKAQPSEFI-ETMIGNEKKLDPNPLYDEWMTVDQTLSGWLFGSMSPAIAADVVNFKNSREVWKALEEVYGATSKARVN

Query:  QLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSLDDLISYVLAGLDAEYIPIVCTIDDK
          +  LQ TKKG MKM +YL  MK  ++NL +AGNP+ L DLI  +L+GLDAEY PIV  + DK
Subjt:  QLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSLDDLISYVLAGLDAEYIPIVCTIDDK

A0A5C7GVK1 Uncharacterized protein9.1e-3839.62Show/hide
Query:  GDDNSSSSTSAAIPVISTSTTSVISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKAQPSEFIETMIGNE--KKLDPNPLYDEWMTV
        GD +  ++TS    V     TS      G+PL +  +VKL+ +NYLLW+ +VL ++RG +++GY+ G K  P EFI T +  E  + L+ NP Y++W+  
Subjt:  GDDNSSSSTSAAIPVISTSTTSVISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKAQPSEFIETMIGNE--KKLDPNPLYDEWMTV

Query:  DQTLSGWLFGSMSPAIAADVVNFKNSREVWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSLDDLISYVLAGLDA-E
        DQ L GWL+ SM P +A++V+  + S+++W+++  ++G  +K+ +   +   Q  +KG MKM DYL   K+ ++NL LAG PV L+DL+S VL GLD+ E
Subjt:  DQTLSGWLFGSMSPAIAADVVNFKNSREVWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSLDDLISYVLAGLDA-E

Query:  YIPIVCTIDDKD
        Y P+VC I++++
Subjt:  YIPIVCTIDDKD

A0A6J1CPQ7 ankyrin repeat-containing protein NPR4-like2.2e-6058.33Show/hide
Query:  LQGESYGDDNSSSSTSAAIPVISTSTTSVISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKAQPSEFIETMIGNEKKLDP--NPLY
        +  E +    +++S  A +        S I+ SFGHPLST LTVKLD+KNY LW+GMVLA+L GQKVDGYVL TK  PS++  T   +   L+P  NP Y
Subjt:  LQGESYGDDNSSSSTSAAIPVISTSTTSVISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKAQPSEFIETMIGNEKKLDP--NPLY

Query:  DEWMTVDQTLSGWLFGSMSPAIAADVVNFKNSREVWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSLDDLISYVLA
        +EW  VDQ   GWLFGSM+P+IAADVVN + S EVW ALE ++G+TSKAR+NQLR  LQNTKKG+MKM  YLA MKQ SE+LKLAG PV+L  L S +L 
Subjt:  DEWMTVDQTLSGWLFGSMSPAIAADVVNFKNSREVWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSLDDLISYVLA

Query:  GLDAEYIPIVCTIDDK
        G +AEY+PI+CTI+DK
Subjt:  GLDAEYIPIVCTIDDK

A0A6J1DTZ7 uncharacterized protein LOC111024384 isoform X23.9e-6564.04Show/hide
Query:  TSAAIPVISTSTTSVISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKAQPSEFIET--MIGNEKKLDPNPLYDEWMTVDQTLSGWL
        T+ A+P  + S     ++SFGHPL TVLTVKLD+KNY LWRGMVLA+LRGQK DGYVLGT A+P +F+ +    G    L  NP Y EW  VDQ L GWL
Subjt:  TSAAIPVISTSTTSVISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKAQPSEFIET--MIGNEKKLDPNPLYDEWMTVDQTLSGWL

Query:  FGSMSPAIAADVVNFKNSREVWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSLDDLISYVLAGLDAEYIPIVCTID
        FGSM+P+IA DVV+F++SREVWKALE++YGATSKAR+NQLR +LQNTKK S+KM +YL +MKQASE+LKLAG PV+ + L+S VL+GL+AEY+PIVC I+
Subjt:  FGSMSPAIAADVVNFKNSREVWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSLDDLISYVLAGLDAEYIPIVCTID

Query:  DKD
         KD
Subjt:  DKD

A0A6J1DU77 uncharacterized protein LOC111024384 isoform X13.9e-6564.04Show/hide
Query:  TSAAIPVISTSTTSVISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKAQPSEFIET--MIGNEKKLDPNPLYDEWMTVDQTLSGWL
        T+ A+P  + S     ++SFGHPL TVLTVKLD+KNY LWRGMVLA+LRGQK DGYVLGT A+P +F+ +    G    L  NP Y EW  VDQ L GWL
Subjt:  TSAAIPVISTSTTSVISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKAQPSEFIET--MIGNEKKLDPNPLYDEWMTVDQTLSGWL

Query:  FGSMSPAIAADVVNFKNSREVWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSLDDLISYVLAGLDAEYIPIVCTID
        FGSM+P+IA DVV+F++SREVWKALE++YGATSKAR+NQLR +LQNTKK S+KM +YL +MKQASE+LKLAG PV+ + L+S VL+GL+AEY+PIVC I+
Subjt:  FGSMSPAIAADVVNFKNSREVWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSLDDLISYVLAGLDAEYIPIVCTID

Query:  DKD
         KD
Subjt:  DKD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)3.3e-0825.15Show/hide
Query:  LDEKNYLLWRGMVLAILRGQKVDGYVLGTKAQPSEFIETMIGNEKKLDPNPLYDEWMTVDQTLSGWLFGSMSP-AIAADVVNFKNSREVWKALEEVYGAT
        ++E NY  WR + L       V G++ GT                 L  N     W   D  +   L+G+++P       V    SR++W  ++  +   
Subjt:  LDEKNYLLWRGMVLAILRGQKVDGYVLGTKAQPSEFIETMIGNEKKLDPNPLYDEWMTVDQTLSGWLFGSMSP-AIAADVVNFKNSREVWKALEEVYGAT

Query:  SKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSLDDLISYVLAGLDAEYIPIVCTI
          AR  +L   L+    G M++ DY   MK+ +++L+    PV+  +L+ YVL GL+ ++  I+  I
Subjt:  SKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSLDDLISYVLAGLDAEYIPIVCTI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCTTTTTCCCTGTATACGCTGTCGTTTTAATTCTCCCATTTTCTGTTATGTACGCTGAGATTTTCTCTGACCTTCTTCTTCTAATTTCGCACTTGGTATCAGAGCT
TCAAGGTGAAAGCTATGGGGATGATAATTCGTCTTCTTCTACCTCAGCTGCGATTCCGGTTATTTCAACTTCAACTACTTCTGTGATTAGCTCTTCTTTTGGACATCCAT
TAAGCACAGTACTCACTGTAAAGCTTGATGAGAAGAACTACCTCTTATGGAGAGGTATGGTGTTGGCAATCCTTCGAGGTCAGAAGGTCGATGGGTATGTCTTAGGGACC
AAGGCCCAACCGTCAGAGTTTATTGAGACCATGATTGGAAATGAAAAGAAGCTTGACCCTAATCCTTTGTATGATGAATGGATGACGGTGGATCAGACACTTTCTGGGTG
GCTGTTCGGCTCAATGTCACCTGCTATTGCCGCTGATGTGGTTAACTTCAAAAATTCCAGAGAAGTATGGAAGGCTCTAGAAGAGGTTTATGGAGCGACAAGCAAGGCTC
GAGTGAATCAACTTCGTGGTATTCTTCAGAACACGAAGAAGGGATCGATGAAAATGATTGATTATCTAGCGGTGATGAAGCAAGCATCGGAAAATTTAAAGCTCGCGGGA
AATCCGGTATCTCTTGACGACCTTATTTCTTATGTTCTCGCTGGATTAGATGCTGAGTATATTCCAATTGTGTGTACGATTGATGATAAAGATATTAAAACATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTCTTTTTCCCTGTATACGCTGTCGTTTTAATTCTCCCATTTTCTGTTATGTACGCTGAGATTTTCTCTGACCTTCTTCTTCTAATTTCGCACTTGGTATCAGAGCT
TCAAGGTGAAAGCTATGGGGATGATAATTCGTCTTCTTCTACCTCAGCTGCGATTCCGGTTATTTCAACTTCAACTACTTCTGTGATTAGCTCTTCTTTTGGACATCCAT
TAAGCACAGTACTCACTGTAAAGCTTGATGAGAAGAACTACCTCTTATGGAGAGGTATGGTGTTGGCAATCCTTCGAGGTCAGAAGGTCGATGGGTATGTCTTAGGGACC
AAGGCCCAACCGTCAGAGTTTATTGAGACCATGATTGGAAATGAAAAGAAGCTTGACCCTAATCCTTTGTATGATGAATGGATGACGGTGGATCAGACACTTTCTGGGTG
GCTGTTCGGCTCAATGTCACCTGCTATTGCCGCTGATGTGGTTAACTTCAAAAATTCCAGAGAAGTATGGAAGGCTCTAGAAGAGGTTTATGGAGCGACAAGCAAGGCTC
GAGTGAATCAACTTCGTGGTATTCTTCAGAACACGAAGAAGGGATCGATGAAAATGATTGATTATCTAGCGGTGATGAAGCAAGCATCGGAAAATTTAAAGCTCGCGGGA
AATCCGGTATCTCTTGACGACCTTATTTCTTATGTTCTCGCTGGATTAGATGCTGAGTATATTCCAATTGTGTGTACGATTGATGATAAAGATATTAAAACATGA
Protein sequenceShow/hide protein sequence
MLFFPVYAVVLILPFSVMYAEIFSDLLLLISHLVSELQGESYGDDNSSSSTSAAIPVISTSTTSVISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGT
KAQPSEFIETMIGNEKKLDPNPLYDEWMTVDQTLSGWLFGSMSPAIAADVVNFKNSREVWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAG
NPVSLDDLISYVLAGLDAEYIPIVCTIDDKDIKT