; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018315 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018315
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr5:22810380..22817473
RNA-Seq ExpressionLag0018315
SyntenyLag0018315
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047821.1 uncharacterized protein E6C27_scaffold133G00730 [Cucumis melo var. makuwa]9.8e-0778.38Show/hide
Query:  ESGQHADSISLPFWGQYRVRSWEHNHTRWNSLLPDSR
        ESGQ ADS+ L FWGQ RV SWEHNHTRWNSLLP  R
Subjt:  ESGQHADSISLPFWGQYRVRSWEHNHTRWNSLLPDSR

TYK05792.1 gag/pol protein [Cucumis melo var. makuwa]3.7e-0673.68Show/hide
Query:  GESGQHADSISLPFWGQYRVRSWEHNHTRWNSLLPDSR
        GESGQ  DSI L FWGQ RV SWEHNHTRWNS +P  R
Subjt:  GESGQHADSISLPFWGQYRVRSWEHNHTRWNSLLPDSR

XP_022934554.1 uncharacterized protein LOC111441700 [Cucurbita moschata]9.1e-0534.43Show/hide
Query:  VDTEAKKKKRFIACLRDDVQRVVGALGLADYAAALRAATFMGMPTVNATPVAKESEPNAGQKRKHEQTTTNLLR-SQPSSESSRQKTQHDRQ----EGNG
        V+T+ K  ++F+  L   ++  V A+    YAAALRAA  M  P  +  P +        QKR+ +QT  N L   Q    S RQ+  + R+    E   
Subjt:  VDTEAKKKKRFIACLRDDVQRVVGALGLADYAAALRAATFMGMPTVNATPVAKESEPNAGQKRKHEQTTTNLLR-SQPSSESSRQKTQHDRQ----EGNG

Query:  NEKPKCNSCGRHHWGQCMARRG
          +PKC  C ++HWGQC+ R G
Subjt:  NEKPKCNSCGRHHWGQCMARRG

XP_023537968.1 uncharacterized protein LOC111798850 [Cucurbita pepo subsp. pepo]4.4e-0737.1Show/hide
Query:  VDTEAKKKKRFIACLRDDVQRVVGALGLADYAAALRAATFMGMPTVNATPVAKESEPNAGQKRKHEQTTTN---LLRSQPS----SESSRQKTQHDRQEG
        VDTE KK +RFI  L ++++ +VGA+    Y  ALR+AT +   +V          P  GQKR ++Q   +   L++ Q       +  ++  Q  RQ G
Subjt:  VDTEAKKKKRFIACLRDDVQRVVGALGLADYAAALRAATFMGMPTVNATPVAKESEPNAGQKRKHEQTTTN---LLRSQPS----SESSRQKTQHDRQEG

Query:  NGNEKPKCNSCGRHHWGQCMARRG
         G+ KP C  CG++HWGQC+AR G
Subjt:  NGNEKPKCNSCGRHHWGQCMARRG

XP_038880159.1 uncharacterized protein LOC120071839 [Benincasa hispida]2.2e-1136.99Show/hide
Query:  GQYRVRSWEHNHTRWNSLLPDSRVSRAVDTEAKKKKRFIACLRDDVQRVVGALGLADYAAALRAATFMGMPTVNATPVAKESEPNAGQKRKHEQTTTNLL
        G   V  +E   TR +   PD      V TEAK+ +RF+  LRD+V+ +V AL   +YA A RAA  +G P+      +   EP +GQKRK EQ      
Subjt:  GQYRVRSWEHNHTRWNSLLPDSRVSRAVDTEAKKKKRFIACLRDDVQRVVGALGLADYAAALRAATFMGMPTVNATPVAKESEPNAGQKRKHEQTTTNLL

Query:  RSQPSSESSRQKTQHDRQEGNG--NEKPKCNSCGRHHWGQCMARRG
        +   S++ S+   Q       G   E+P C SCG+HHWG C+   G
Subjt:  RSQPSSESSRQKTQHDRQEGNG--NEKPKCNSCGRHHWGQCMARRG

TrEMBL top hitse value%identityAlignment
A0A5A7U2P4 Integrase catalytic domain-containing protein4.7e-0778.38Show/hide
Query:  ESGQHADSISLPFWGQYRVRSWEHNHTRWNSLLPDSR
        ESGQ ADS+ L FWGQ RV SWEHNHTRWNSLLP  R
Subjt:  ESGQHADSISLPFWGQYRVRSWEHNHTRWNSLLPDSR

A0A5D3C3J6 Gag/pol protein1.8e-0673.68Show/hide
Query:  GESGQHADSISLPFWGQYRVRSWEHNHTRWNSLLPDSR
        GESGQ  DSI L FWGQ RV SWEHNHTRWNS +P  R
Subjt:  GESGQHADSISLPFWGQYRVRSWEHNHTRWNSLLPDSR

A0A6J1F0T4 uncharacterized protein LOC1114384606.4e-0429.58Show/hide
Query:  WGQYRVRSWEHNHTRWNSLLPDSRVSRAVDTEAKKKKRFIACLRDDVQRVVGALGLADYAAALRAATFM-GMPTVNATPVAKESEPNAGQKRKHEQTTTN
        W +++    E  +T       D      V+T+ +  +RF+  L   +++ V A+  A YAAALRAA  M G+ + + TP         GQK +HE     
Subjt:  WGQYRVRSWEHNHTRWNSLLPDSRVSRAVDTEAKKKKRFIACLRDDVQRVVGALGLADYAAALRAATFM-GMPTVNATPVAKESEPNAGQKRKHEQTTTN

Query:  LLRSQPSSESSRQKTQHDRQEGNGNEKPKCNSCGRHHWGQCM
                            E   ++KPKCN CG++HWGQC+
Subjt:  LLRSQPSSESSRQKTQHDRQEGNGNEKPKCNSCGRHHWGQCM

A0A6J1F2Y2 uncharacterized protein LOC1114417004.4e-0534.43Show/hide
Query:  VDTEAKKKKRFIACLRDDVQRVVGALGLADYAAALRAATFMGMPTVNATPVAKESEPNAGQKRKHEQTTTNLLR-SQPSSESSRQKTQHDRQ----EGNG
        V+T+ K  ++F+  L   ++  V A+    YAAALRAA  M  P  +  P +        QKR+ +QT  N L   Q    S RQ+  + R+    E   
Subjt:  VDTEAKKKKRFIACLRDDVQRVVGALGLADYAAALRAATFMGMPTVNATPVAKESEPNAGQKRKHEQTTTNLLR-SQPSSESSRQKTQHDRQ----EGNG

Query:  NEKPKCNSCGRHHWGQCMARRG
          +PKC  C ++HWGQC+ R G
Subjt:  NEKPKCNSCGRHHWGQCMARRG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGAGAGTGGCCAGCACGCCGACTCAATAAGCCTACCATTTTGGGGACAATACCGAGTGCGGAGCTGGGAACATAATCACACAAGATGGAATTCACTCCTTCCCGA
CTCAAGGGTAAGTAGAGCGGTGGATACGGAGGCGAAGAAGAAAAAGAGGTTCATCGCGTGCCTCAGAGATGACGTCCAGAGGGTTGTTGGAGCCCTTGGCCTAGCGGACT
ACGCAGCGGCCCTTCGAGCGGCCACCTTTATGGGCATGCCAACTGTCAATGCAACCCCAGTAGCAAAGGAGTCGGAGCCCAACGCAGGACAGAAGAGGAAACATGAGCAG
ACAACTACCAACCTCCTGCGATCTCAACCTTCATCCGAAAGTTCTAGACAAAAAACTCAGCATGACAGGCAAGAAGGCAATGGAAACGAGAAACCAAAGTGCAACTCTTG
TGGAAGACATCATTGGGGTCAGTGCATGGCGAGGAGAGGTCCCTCGTTGCCACCGCACATCGACGACGCCGTAGCAGGACAGCCGCCGTCACCTTCGGCTCAGCCCTCGT
TGCCGCCGCACATTGACGACTCTGCAACACCCGAGTCCGCTGGCACCTCCCGCACGGTCTGTCGCCGAGCAGCCCCGTCTTCTGCCGCCGACCAGCCCCGTTTTGCCCAC
TGTACTGGTTGCCGTCAAGGATTTATAGACTCTGCCCGTTGTTTAAATCTCAGAAGCAGTGAGATTAACGCTAATTTACAGTATGGTCCGACAAACTTGCAAGTTGGTTC
AAGTGGATTTTGTAGGCGGATAAGGAGTCTGTTGGAAAGGGGAATTAAGCGAGTTCTAGGACTTTGCGAGGAAGATGCACTCGCAAGAGGAGATAAAAGAAAGAAATGCA
CCCAATATGCGGCCACAACTGGGCTTTTCGACAGAATACCGAGATTCGCAGGATCTTCACAGAATCGTCTGAAAATCTCTAATCTGGTATCAAATCTCCGAGGATCGTGG
ATTCTTACCTTTCAAGACCTTCTCACACTGATCTCGGCAGCCAACCTCACCTTCCTAGTTGTCCAGCAGCTTCAGCGTCTCGACGCTGTGACATATGAAACTTGGCAGCC
ACGAAGCTTTAGCATCGAGACGCTGCCTCATTTCCAGATTGTTCTCCATGGCTTGCTTAGCGTCGAGACGCCCAAAGCTCTCGGATCTCCTCTTCAAAATGATGGCCCTT
TGGATTTCATTTTGGGCCTCCAACTCAGCTCTTTTGGGCTTGGATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTGAGAGTGGCCAGCACGCCGACTCAATAAGCCTACCATTTTGGGGACAATACCGAGTGCGGAGCTGGGAACATAATCACACAAGATGGAATTCACTCCTTCCCGA
CTCAAGGGTAAGTAGAGCGGTGGATACGGAGGCGAAGAAGAAAAAGAGGTTCATCGCGTGCCTCAGAGATGACGTCCAGAGGGTTGTTGGAGCCCTTGGCCTAGCGGACT
ACGCAGCGGCCCTTCGAGCGGCCACCTTTATGGGCATGCCAACTGTCAATGCAACCCCAGTAGCAAAGGAGTCGGAGCCCAACGCAGGACAGAAGAGGAAACATGAGCAG
ACAACTACCAACCTCCTGCGATCTCAACCTTCATCCGAAAGTTCTAGACAAAAAACTCAGCATGACAGGCAAGAAGGCAATGGAAACGAGAAACCAAAGTGCAACTCTTG
TGGAAGACATCATTGGGGTCAGTGCATGGCGAGGAGAGGTCCCTCGTTGCCACCGCACATCGACGACGCCGTAGCAGGACAGCCGCCGTCACCTTCGGCTCAGCCCTCGT
TGCCGCCGCACATTGACGACTCTGCAACACCCGAGTCCGCTGGCACCTCCCGCACGGTCTGTCGCCGAGCAGCCCCGTCTTCTGCCGCCGACCAGCCCCGTTTTGCCCAC
TGTACTGGTTGCCGTCAAGGATTTATAGACTCTGCCCGTTGTTTAAATCTCAGAAGCAGTGAGATTAACGCTAATTTACAGTATGGTCCGACAAACTTGCAAGTTGGTTC
AAGTGGATTTTGTAGGCGGATAAGGAGTCTGTTGGAAAGGGGAATTAAGCGAGTTCTAGGACTTTGCGAGGAAGATGCACTCGCAAGAGGAGATAAAAGAAAGAAATGCA
CCCAATATGCGGCCACAACTGGGCTTTTCGACAGAATACCGAGATTCGCAGGATCTTCACAGAATCGTCTGAAAATCTCTAATCTGGTATCAAATCTCCGAGGATCGTGG
ATTCTTACCTTTCAAGACCTTCTCACACTGATCTCGGCAGCCAACCTCACCTTCCTAGTTGTCCAGCAGCTTCAGCGTCTCGACGCTGTGACATATGAAACTTGGCAGCC
ACGAAGCTTTAGCATCGAGACGCTGCCTCATTTCCAGATTGTTCTCCATGGCTTGCTTAGCGTCGAGACGCCCAAAGCTCTCGGATCTCCTCTTCAAAATGATGGCCCTT
TGGATTTCATTTTGGGCCTCCAACTCAGCTCTTTTGGGCTTGGATCTTGA
Protein sequenceShow/hide protein sequence
MGESGQHADSISLPFWGQYRVRSWEHNHTRWNSLLPDSRVSRAVDTEAKKKKRFIACLRDDVQRVVGALGLADYAAALRAATFMGMPTVNATPVAKESEPNAGQKRKHEQ
TTTNLLRSQPSSESSRQKTQHDRQEGNGNEKPKCNSCGRHHWGQCMARRGPSLPPHIDDAVAGQPPSPSAQPSLPPHIDDSATPESAGTSRTVCRRAAPSSAADQPRFAH
CTGCRQGFIDSARCLNLRSSEINANLQYGPTNLQVGSSGFCRRIRSLLERGIKRVLGLCEEDALARGDKRKKCTQYAATTGLFDRIPRFAGSSQNRLKISNLVSNLRGSW
ILTFQDLLTLISAANLTFLVVQQLQRLDAVTYETWQPRSFSIETLPHFQIVLHGLLSVETPKALGSPLQNDGPLDFILGLQLSSFGLGS