; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0019876 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0019876
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr5:46296948..46297703
RNA-Seq ExpressionLag0019876
SyntenyLag0019876
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KZV34405.1 hypothetical protein F511_34028, partial [Dorcoceras hygrometricum]1.6e-4144.33Show/hide
Query:  LLWRGMVLAILRGQKVDGYVLGTKTQPAEFIETSTDSGKKLVENPLYEEWTTVDQALSGWLFGSMSPSIAADVVNFKTSREIWKALEEVYGATSKARVNQ
        LLW  M+L I+RG K+DGYVLGTK  P EF+  +T      + NP YEEW + DQ L GWL+ +MS  IA+ ++   TS+E+W   +E+ GA +++R+  
Subjt:  LLWRGMVLAILRGQKVDGYVLGTKTQPAEFIETSTDSGKKLVENPLYEEWTTVDQALSGWLFGSMSPSIAADVVNFKTSREIWKALEEVYGATSKARVNQ

Query:  LRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSFGDLISYVLAGLDPEYIPIVCIIEDKDIKTWQELSSILVTFEGTLARYTTPANTHSDLPDLAA
         +  LQ TKKG MKM +YL  MK  ++NL +AGNP+   DLI  +L+GLD EY PIV ++ DK   +W EL + L+T+E  L +  T           +A
Subjt:  LRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSFGDLISYVLAGLDPEYIPIVCIIEDKDIKTWQELSSILVTFEGTLARYTTPANTHSDLPDLAA

Query:  HFA
        HFA
Subjt:  HFA

QWX09785.1 hydroxymethylglutaryl-CoA synthase [Pistacia terebinthus subsp. palaestina]7.8e-4440.89Show/hide
Query:  NSSASSSAMVAASTTTASIISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKTQPAEFIETSTDSGKKLVENPLYEEWTTVDQALSG
        N+  SS +   +  ++ S +  S    L+   ++KLD  N+LLW  +VL ++RG K  GY+ GTK  P E++  +T      + N  YE+W + D+ L G
Subjt:  NSSASSSAMVAASTTTASIISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKTQPAEFIETSTDSGKKLVENPLYEEWTTVDQALSG

Query:  WLFGSMSPSIAADVVNFKTSREIWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSFGDLISYVLAGLDPEYIPIVCI
        WL+ +M+P IA+ ++   TS+E+W A +E+ GA +K+RV   +G LQ T+KG MKM +YL  MK  S+NL LAG+P+S  DLI+ +L GLD EY PIV  
Subjt:  WLFGSMSPSIAADVVNFKTSREIWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSFGDLISYVLAGLDPEYIPIVCI

Query:  IEDKDIKTWQELSSILVTFEGTLARYTTPANTHSDLPDLAAHFALNR
        + DK+  +W EL + L+TFE  L +  T  N   +L    AH A+NR
Subjt:  IEDKDIKTWQELSSILVTFEGTLARYTTPANTHSDLPDLAAHFALNR

XP_022143579.1 ankyrin repeat-containing protein NPR4-like [Momordica charantia]2.4e-6161.72Show/hide
Query:  MGEENSSASSSAMVAASTTTASIISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKTQPAEFIETSTDSGK-KLVENPLYEEWTTVD
        M  E  S S++   +   T  S I+ SFGHPLST LTVKLD+KNY LW+GMVLA+L GQKVDGYVL TKT P+++  T++D+G  +   NP YEEW+ VD
Subjt:  MGEENSSASSSAMVAASTTTASIISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKTQPAEFIETSTDSGK-KLVENPLYEEWTTVD

Query:  QALSGWLFGSMSPSIAADVVNFKTSREIWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSFGDLISYVLAGLDPEYI
        QA  GWLFGSM+PSIAADVVN +TS E+W ALE ++G+TSKAR+NQLR  LQNTKKG+MKM  YLA MKQ SE+LKLAG PV+   L S +L G + EY+
Subjt:  QALSGWLFGSMSPSIAADVVNFKTSREIWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSFGDLISYVLAGLDPEYI

Query:  PIVCIIEDK
        PI+C IEDK
Subjt:  PIVCIIEDK

XP_022157748.1 uncharacterized protein LOC111024384 isoform X1 [Momordica charantia]2.8e-7062.34Show/hide
Query:  ENSSASSSAMVAASTTT---ASIISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKTQPAEFIETSTDSGKK--LVENPLYEEWTTV
        ENSS     +   +  T   +   ++SFGHPL TVLTVKLD+KNY LWRGMVLA+LRGQK DGYVLGT  +P +F+ +    G    L  NP Y EW  V
Subjt:  ENSSASSSAMVAASTTT---ASIISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKTQPAEFIETSTDSGKK--LVENPLYEEWTTV

Query:  DQALSGWLFGSMSPSIAADVVNFKTSREIWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSFGDLISYVLAGLDPEY
        DQAL GWLFGSM+PSIA DVV+F++SRE+WKALE++YGATSKAR+NQLR +LQNTKK S+KM +YL +MKQASE+LKLAG PV+F  L+S VL+GL+ EY
Subjt:  DQALSGWLFGSMSPSIAADVVNFKTSREIWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSFGDLISYVLAGLDPEY

Query:  IPIVCIIEDKDIKTWQELSSILVTFEGTLAR
        +PIVC IE KD  +WQEL + LVTFE TL R
Subjt:  IPIVCIIEDKDIKTWQELSSILVTFEGTLAR

XP_022157750.1 uncharacterized protein LOC111024384 isoform X2 [Momordica charantia]2.8e-7062.34Show/hide
Query:  ENSSASSSAMVAASTTT---ASIISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKTQPAEFIETSTDSGKK--LVENPLYEEWTTV
        ENSS     +   +  T   +   ++SFGHPL TVLTVKLD+KNY LWRGMVLA+LRGQK DGYVLGT  +P +F+ +    G    L  NP Y EW  V
Subjt:  ENSSASSSAMVAASTTT---ASIISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKTQPAEFIETSTDSGKK--LVENPLYEEWTTV

Query:  DQALSGWLFGSMSPSIAADVVNFKTSREIWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSFGDLISYVLAGLDPEY
        DQAL GWLFGSM+PSIA DVV+F++SRE+WKALE++YGATSKAR+NQLR +LQNTKK S+KM +YL +MKQASE+LKLAG PV+F  L+S VL+GL+ EY
Subjt:  DQALSGWLFGSMSPSIAADVVNFKTSREIWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSFGDLISYVLAGLDPEY

Query:  IPIVCIIEDKDIKTWQELSSILVTFEGTLAR
        +PIVC IE KD  +WQEL + LVTFE TL R
Subjt:  IPIVCIIEDKDIKTWQELSSILVTFEGTLAR

TrEMBL top hitse value%identityAlignment
A0A6J1CPQ7 ankyrin repeat-containing protein NPR4-like1.2e-6161.72Show/hide
Query:  MGEENSSASSSAMVAASTTTASIISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKTQPAEFIETSTDSGK-KLVENPLYEEWTTVD
        M  E  S S++   +   T  S I+ SFGHPLST LTVKLD+KNY LW+GMVLA+L GQKVDGYVL TKT P+++  T++D+G  +   NP YEEW+ VD
Subjt:  MGEENSSASSSAMVAASTTTASIISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKTQPAEFIETSTDSGK-KLVENPLYEEWTTVD

Query:  QALSGWLFGSMSPSIAADVVNFKTSREIWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSFGDLISYVLAGLDPEYI
        QA  GWLFGSM+PSIAADVVN +TS E+W ALE ++G+TSKAR+NQLR  LQNTKKG+MKM  YLA MKQ SE+LKLAG PV+   L S +L G + EY+
Subjt:  QALSGWLFGSMSPSIAADVVNFKTSREIWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSFGDLISYVLAGLDPEYI

Query:  PIVCIIEDK
        PI+C IEDK
Subjt:  PIVCIIEDK

A0A6J1DTZ7 uncharacterized protein LOC111024384 isoform X21.4e-7062.34Show/hide
Query:  ENSSASSSAMVAASTTT---ASIISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKTQPAEFIETSTDSGKK--LVENPLYEEWTTV
        ENSS     +   +  T   +   ++SFGHPL TVLTVKLD+KNY LWRGMVLA+LRGQK DGYVLGT  +P +F+ +    G    L  NP Y EW  V
Subjt:  ENSSASSSAMVAASTTT---ASIISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKTQPAEFIETSTDSGKK--LVENPLYEEWTTV

Query:  DQALSGWLFGSMSPSIAADVVNFKTSREIWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSFGDLISYVLAGLDPEY
        DQAL GWLFGSM+PSIA DVV+F++SRE+WKALE++YGATSKAR+NQLR +LQNTKK S+KM +YL +MKQASE+LKLAG PV+F  L+S VL+GL+ EY
Subjt:  DQALSGWLFGSMSPSIAADVVNFKTSREIWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSFGDLISYVLAGLDPEY

Query:  IPIVCIIEDKDIKTWQELSSILVTFEGTLAR
        +PIVC IE KD  +WQEL + LVTFE TL R
Subjt:  IPIVCIIEDKDIKTWQELSSILVTFEGTLAR

A0A6J1DU77 uncharacterized protein LOC111024384 isoform X11.4e-7062.34Show/hide
Query:  ENSSASSSAMVAASTTT---ASIISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKTQPAEFIETSTDSGKK--LVENPLYEEWTTV
        ENSS     +   +  T   +   ++SFGHPL TVLTVKLD+KNY LWRGMVLA+LRGQK DGYVLGT  +P +F+ +    G    L  NP Y EW  V
Subjt:  ENSSASSSAMVAASTTT---ASIISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKTQPAEFIETSTDSGKK--LVENPLYEEWTTV

Query:  DQALSGWLFGSMSPSIAADVVNFKTSREIWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSFGDLISYVLAGLDPEY
        DQAL GWLFGSM+PSIA DVV+F++SRE+WKALE++YGATSKAR+NQLR +LQNTKK S+KM +YL +MKQASE+LKLAG PV+F  L+S VL+GL+ EY
Subjt:  DQALSGWLFGSMSPSIAADVVNFKTSREIWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSFGDLISYVLAGLDPEY

Query:  IPIVCIIEDKDIKTWQELSSILVTFEGTLAR
        +PIVC IE KD  +WQEL + LVTFE TL R
Subjt:  IPIVCIIEDKDIKTWQELSSILVTFEGTLAR

A0A803R2Q0 Uncharacterized protein1.4e-4339.36Show/hide
Query:  ENSSASSSAMVAASTTTASIISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKTQPAEFIETSTDS----GKKLVENPLYEEWTTVD
        +N+ A+S+  +  S++  S IS+ F + LS   ++KLD  N+ LW+ MV  I+RG ++DG++ G +  P EFI T   +    G  +  NP YE W   D
Subjt:  ENSSASSSAMVAASTTTASIISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKTQPAEFIETSTDS----GKKLVENPLYEEWTTVD

Query:  QALSGWLFGSMSPSIAADVVNFKTSREIWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSFGDLISYVLAGLDPEYI
        Q L GWL+GSM+ +IA++V+  +++  +W ALEE+YGA S+A +++LR  +Q T+KGS  M +YL + +  +++L LAG P     L+S VL+GLD EY+
Subjt:  QALSGWLFGSMSPSIAADVVNFKTSREIWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSFGDLISYVLAGLDPEYI

Query:  PIVCIIEDKDIKTWQELSSILVTFEGTLARYTTPANTHSDLPDLAAHFA
         IV +IE ++  +WQ+L S+L++F+G L R  T + +   + + +A+FA
Subjt:  PIVCIIEDKDIKTWQELSSILVTFEGTLARYTTPANTHSDLPDLAAHFA

A0A803R2Q2 Uncharacterized protein1.4e-4339.36Show/hide
Query:  ENSSASSSAMVAASTTTASIISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKTQPAEFIETSTDS----GKKLVENPLYEEWTTVD
        +N+ A+S+  +  S++  S IS+ F + LS   ++KLD  N+ LW+ MV  I+RG ++DG++ G +  P EFI T   +    G  +  NP YE W   D
Subjt:  ENSSASSSAMVAASTTTASIISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKTQPAEFIETSTDS----GKKLVENPLYEEWTTVD

Query:  QALSGWLFGSMSPSIAADVVNFKTSREIWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSFGDLISYVLAGLDPEYI
        Q L GWL+GSM+ +IA++V+  +++  +W ALEE+YGA S+A +++LR  +Q T+KGS  M +YL + +  +++L LAG P     L+S VL+GLD EY+
Subjt:  QALSGWLFGSMSPSIAADVVNFKTSREIWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSFGDLISYVLAGLDPEYI

Query:  PIVCIIEDKDIKTWQELSSILVTFEGTLARYTTPANTHSDLPDLAAHFA
         IV +IE ++  +WQ+L S+L++F+G L R  T + +   + + +A+FA
Subjt:  PIVCIIEDKDIKTWQELSSILVTFEGTLARYTTPANTHSDLPDLAAHFA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)9.9e-1326.24Show/hide
Query:  LDEKNYLLWRGMVLAILRGQKVDGYVLGTKTQPAEFIETSTDSGKKLVENPLYEEWTTVDQALSGWLFGSMSP-SIAADVVNFKTSREIWKALEEVYGAT
        ++E NY  WR + L       V G++ GT                 L  N     W   D  +   L+G+++P       V   TSR+IW  ++  +   
Subjt:  LDEKNYLLWRGMVLAILRGQKVDGYVLGTKTQPAEFIETSTDSGKKLVENPLYEEWTTVDQALSGWLFGSMSP-SIAADVVNFKTSREIWKALEEVYGAT

Query:  SKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSFGDLISYVLAGLDPEYIPIVCIIEDKD-IKTWQELSSILVTFEGTLARYTTPANTH
          AR  +L   L+    G M++ DY   MK+ +++L+    PV+  +L+ YVL GL+P++  I+ +I+ +    ++ + +++L   E  L R   P  TH
Subjt:  SKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSFGDLISYVLAGLDPEYIPIVCIIEDKD-IKTWQELSSILVTFEGTLARYTTPANTH

Query:  SD
         D
Subjt:  SD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGAAGAAAATTCTTCTGCTTCTTCGTCAGCAATGGTTGCAGCTTCGACTACGACAGCGAGTATCATTAGTTCTTCGTTTGGGCATCCTCTGAGCACAGTTCTTAC
TGTGAAGCTTGACGAAAAGAACTACCTCCTATGGAGAGGTATGGTTCTGGCCATTCTTCGAGGTCAGAAAGTCGATGGGTATGTCTTAGGGACAAAAACCCAGCCCGCAG
AGTTCATCGAAACGTCGACTGATTCAGGTAAAAAGCTTGTTGAAAATCCACTTTATGAGGAGTGGACGACAGTAGACCAGGCCCTTTCTGGCTGGTTGTTTGGTTCGATG
TCTCCATCAATTGCTGCAGATGTGGTCAATTTCAAGACCTCGCGTGAGATTTGGAAGGCATTAGAGGAGGTGTATGGAGCCACTAGCAAAGCTCGGGTGAATCAACTCAG
AGGCATTCTTCAGAATACAAAGAAGGGTTCGATGAAGATGATCGACTATTTGGCCGTCATGAAACAGGCGTCGGAGAATTTGAAGCTTGCGGGTAATCCTGTTTCCTTTG
GCGATTTAATTTCCTATGTACTCGCGGGTCTAGATCCTGAATATATTCCGATAGTCTGTATAATTGAAGACAAGGATATAAAAACTTGGCAAGAACTCAGCTCTATTTTG
GTCACTTTTGAAGGAACGTTGGCTCGTTACACTACACCAGCTAATACTCATTCTGATTTACCTGATTTAGCTGCTCATTTTGCCCTAAATAGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAGAAGAAAATTCTTCTGCTTCTTCGTCAGCAATGGTTGCAGCTTCGACTACGACAGCGAGTATCATTAGTTCTTCGTTTGGGCATCCTCTGAGCACAGTTCTTAC
TGTGAAGCTTGACGAAAAGAACTACCTCCTATGGAGAGGTATGGTTCTGGCCATTCTTCGAGGTCAGAAAGTCGATGGGTATGTCTTAGGGACAAAAACCCAGCCCGCAG
AGTTCATCGAAACGTCGACTGATTCAGGTAAAAAGCTTGTTGAAAATCCACTTTATGAGGAGTGGACGACAGTAGACCAGGCCCTTTCTGGCTGGTTGTTTGGTTCGATG
TCTCCATCAATTGCTGCAGATGTGGTCAATTTCAAGACCTCGCGTGAGATTTGGAAGGCATTAGAGGAGGTGTATGGAGCCACTAGCAAAGCTCGGGTGAATCAACTCAG
AGGCATTCTTCAGAATACAAAGAAGGGTTCGATGAAGATGATCGACTATTTGGCCGTCATGAAACAGGCGTCGGAGAATTTGAAGCTTGCGGGTAATCCTGTTTCCTTTG
GCGATTTAATTTCCTATGTACTCGCGGGTCTAGATCCTGAATATATTCCGATAGTCTGTATAATTGAAGACAAGGATATAAAAACTTGGCAAGAACTCAGCTCTATTTTG
GTCACTTTTGAAGGAACGTTGGCTCGTTACACTACACCAGCTAATACTCATTCTGATTTACCTGATTTAGCTGCTCATTTTGCCCTAAATAGATAG
Protein sequenceShow/hide protein sequence
MGEENSSASSSAMVAASTTTASIISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKTQPAEFIETSTDSGKKLVENPLYEEWTTVDQALSGWLFGSM
SPSIAADVVNFKTSREIWKALEEVYGATSKARVNQLRGILQNTKKGSMKMIDYLAVMKQASENLKLAGNPVSFGDLISYVLAGLDPEYIPIVCIIEDKDIKTWQELSSIL
VTFEGTLARYTTPANTHSDLPDLAAHFALNR