; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036482 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036482
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr3:47055843..47056244
RNA-Seq ExpressionLag0036482
SyntenyLag0036482
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022148562.1 uncharacterized protein LOC111017196 [Momordica charantia]4.7e-3262.39Show/hide
Query:  IDVKNKFGFADGTIPRLSDESLNSWIICNSVVTVWILNSLSAEISASINFADTTYEIWTDLRDCYQQKNLPRVFQILREISTLLQNQDSVTTYYARLKTL
        + VKNKFG  DG+IPR  DE LNSWIICN+VV  WILNSLS EISAS+ FAD+  EIW DL++ +Q++N PR+FQ+ R++STL+Q+Q SV+ Y+ +LKTL
Subjt:  IDVKNKFGFADGTIPRLSDESLNSWIICNSVVTVWILNSLSAEISASINFADTTYEIWTDLRDCYQQKNLPRVFQILREISTLLQNQDSVTTYYARLKTL

Query:  WNELSSYRP
        W EL++YRP
Subjt:  WNELSSYRP

XP_038875043.1 uncharacterized protein LOC120067569 [Benincasa hispida]4.0e-3165.14Show/hide
Query:  IDVKNKFGFADGTIPRLSDESLNSWIICNSVVTVWILNSLSAEISASINFADTTYEIWTDLRDCYQQKNLPRVFQILREISTLLQNQDSVTTYYARLKTL
        + VKNK GF +  IP  S E L+SWIICN VVT WILNSLS EIS SINF+++  EIW D ++ YQ KN PRVFQ+  EIS L+QNQDSVT YY +LK L
Subjt:  IDVKNKFGFADGTIPRLSDESLNSWIICNSVVTVWILNSLSAEISASINFADTTYEIWTDLRDCYQQKNLPRVFQILREISTLLQNQDSVTTYYARLKTL

Query:  WNELSSYRP
        WNEL SYRP
Subjt:  WNELSSYRP

XP_038887168.1 uncharacterized protein LOC120077355 [Benincasa hispida]6.4e-3772.48Show/hide
Query:  IDVKNKFGFADGTIPRLSDESLNSWIICNSVVTVWILNSLSAEISASINFADTTYEIWTDLRDCYQQKNLPRVFQILREISTLLQNQDSVTTYYARLKTL
        + VKNK GF +G IPR S E L+SWIICN VVT WILNSLS EISASINF+D+  EIW DL++ YQ++N PRVFQ+ REIS L+QNQDSVTTYYA+LKTL
Subjt:  IDVKNKFGFADGTIPRLSDESLNSWIICNSVVTVWILNSLSAEISASINFADTTYEIWTDLRDCYQQKNLPRVFQILREISTLLQNQDSVTTYYARLKTL

Query:  WNELSSYRP
        WNEL SYRP
Subjt:  WNELSSYRP

XP_038887186.1 uncharacterized protein LOC120077373 [Benincasa hispida]9.6e-3363.93Show/hide
Query:  ESGYDD-------RIDVKNKFGFADGTIPRLSDESLNSWIICNSVVTVWILNSLSAEISASINFADTTYEIWTDLRDCYQQKNLPRVFQILREISTLLQN
        ES YD         + VKNK GF +G IPR S E L+SWII N +VT WILNSLS EI ASINF+D+  EIW DL++ YQ+KN PRVFQ+ REIS L QN
Subjt:  ESGYDD-------RIDVKNKFGFADGTIPRLSDESLNSWIICNSVVTVWILNSLSAEISASINFADTTYEIWTDLRDCYQQKNLPRVFQILREISTLLQN

Query:  QDSVTTYYARLKTLWNELSSYR
        QDSVTTYYA+LK LWNE  SYR
Subjt:  QDSVTTYYARLKTLWNELSSYR

XP_038904477.1 uncharacterized protein LOC120090845 [Benincasa hispida]6.0e-3569.72Show/hide
Query:  IDVKNKFGFADGTIPRLSDESLNSWIICNSVVTVWILNSLSAEISASINFADTTYEIWTDLRDCYQQKNLPRVFQILREISTLLQNQDSVTTYYARLKTL
        + VKNK GF +G I R S E L+SWIICN +VT WILNSLS EISASINF+D+  EIW DL++ YQ+KN PRVFQ+ RE S L QNQDS+TTYYA+LKTL
Subjt:  IDVKNKFGFADGTIPRLSDESLNSWIICNSVVTVWILNSLSAEISASINFADTTYEIWTDLRDCYQQKNLPRVFQILREISTLLQNQDSVTTYYARLKTL

Query:  WNELSSYRP
        WNEL SYRP
Subjt:  WNELSSYRP

TrEMBL top hitse value%identityAlignment
A0A5J5B2C5 Uncharacterized protein1.4e-2454.46Show/hide
Query:  IDVKNKFGFADGTIPR---LSDESLNSWIICNSVVTVWILNSLSAEISASINFADTTYEIWTDLRDCYQQKNLPRVFQILREISTLLQNQDSVTTYYARL
        + VKNK GF DG IP      +  L+SWI  N++V  WILNS+S EISASI FA    EIW DLRD +QQ+N PR+FQ+ RE+  L Q Q SV+ Y+ ++
Subjt:  IDVKNKFGFADGTIPR---LSDESLNSWIICNSVVTVWILNSLSAEISASINFADTTYEIWTDLRDCYQQKNLPRVFQILREISTLLQNQDSVTTYYARL

Query:  KTLWNELSSYRP
        KT+W ELS+YRP
Subjt:  KTLWNELSSYRP

A0A5J5BIH5 Uncharacterized protein4.2e-2655.36Show/hide
Query:  IDVKNKFGFADGTIPR---LSDESLNSWIICNSVVTVWILNSLSAEISASINFADTTYEIWTDLRDCYQQKNLPRVFQILREISTLLQNQDSVTTYYARL
        + VKNK GF DG+IP      ++ +NSWI  N++V  WILNS+S EISASI FA +  EIW DLRD +QQ+N PR+FQ+ RE+  L Q Q SV+ Y+ +L
Subjt:  IDVKNKFGFADGTIPR---LSDESLNSWIICNSVVTVWILNSLSAEISASINFADTTYEIWTDLRDCYQQKNLPRVFQILREISTLLQNQDSVTTYYARL

Query:  KTLWNELSSYRP
        KT+W ELS+YRP
Subjt:  KTLWNELSSYRP

A0A6J1D5E3 uncharacterized protein LOC1110171962.3e-3262.39Show/hide
Query:  IDVKNKFGFADGTIPRLSDESLNSWIICNSVVTVWILNSLSAEISASINFADTTYEIWTDLRDCYQQKNLPRVFQILREISTLLQNQDSVTTYYARLKTL
        + VKNKFG  DG+IPR  DE LNSWIICN+VV  WILNSLS EISAS+ FAD+  EIW DL++ +Q++N PR+FQ+ R++STL+Q+Q SV+ Y+ +LKTL
Subjt:  IDVKNKFGFADGTIPRLSDESLNSWIICNSVVTVWILNSLSAEISASINFADTTYEIWTDLRDCYQQKNLPRVFQILREISTLLQNQDSVTTYYARLKTL

Query:  WNELSSYRP
        W EL++YRP
Subjt:  WNELSSYRP

A0A6J1DIP8 uncharacterized protein LOC1110203991.7e-2756.88Show/hide
Query:  IDVKNKFGFADGTIPRLSDESLNSWIICNSVVTVWILNSLSAEISASINFADTTYEIWTDLRDCYQQKNLPRVFQILREISTLLQNQDSVTTYYARLKTL
        + VKNK GF DG+I R + + L+SWIICN+VV  WILNSLS EISASI F+D+  EIW DL++ ++++N PR+FQ+ R++S L+Q+Q SV+ Y+  LKTL
Subjt:  IDVKNKFGFADGTIPRLSDESLNSWIICNSVVTVWILNSLSAEISASINFADTTYEIWTDLRDCYQQKNLPRVFQILREISTLLQNQDSVTTYYARLKTL

Query:  WNELSSYRP
        W EL+SY P
Subjt:  WNELSSYRP

A0A6J1DNP7 uncharacterized protein LOC1110220651.8e-2958.72Show/hide
Query:  IDVKNKFGFADGTIPRLSDESLNSWIICNSVVTVWILNSLSAEISASINFADTTYEIWTDLRDCYQQKNLPRVFQILREISTLLQNQDSVTTYYARLKTL
        + VKNK GF DG+I R +D  L+SWIICN+VV  WI NSLS +ISAS+ F+D+ +EIW DL++ +Q++N PR+FQ+ RE+S L Q+Q SVT Y+ RLKTL
Subjt:  IDVKNKFGFADGTIPRLSDESLNSWIICNSVVTVWILNSLSAEISASINFADTTYEIWTDLRDCYQQKNLPRVFQILREISTLLQNQDSVTTYYARLKTL

Query:  WNELSSYRP
        W+EL+ YRP
Subjt:  WNELSSYRP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).4.6e-1736.7Show/hide
Query:  VKNKFGFADGTIPRLSDES--LNSWIICNSVVTVWILNSLSAEISASINFADTTYEIWTDLRDCYQQKNLPRVFQILREISTLLQNQDSVTTYYARLKTL
        V  KFGF DGT+P+    S     W  CN++V  W++NS++ ++  S+ +A+T +++W DLR  +      +++Q+ R ++TL Q  DSV  Y+ +L  +
Subjt:  VKNKFGFADGTIPRLSDES--LNSWIICNSVVTVWILNSLSAEISASINFADTTYEIWTDLRDCYQQKNLPRVFQILREISTLLQNQDSVTTYYARLKTL

Query:  WNELSSYRP
        W ELS Y P
Subjt:  WNELSSYRP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTCAGGCTATGACGATCGGATTGACGTCAAGAACAAATTTGGTTTTGCAGATGGAACGATTCCTCGCCTGTCTGATGAATCGCTCAATTCGTGGATCATTTGCAA
CAGCGTAGTCACTGTGTGGATTCTCAATTCCTTATCTGCAGAAATTTCTGCCAGTATCAATTTTGCTGATACAACCTATGAGATTTGGACTGATCTTCGAGATTGCTACC
AGCAAAAGAATCTTCCTCGAGTTTTTCAAATTCTCCGAGAAATTTCCACCTTACTACAGAATCAAGATTCTGTCACCACGTACTATGCAAGACTGAAGACTCTCTGGAAC
GAACTTTCTTCTTACCGACCCTTCTTGTTCTTGCGGTGGAGTAAAGGAGCTTACGAGTTATTTTCAAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTCAGGCTATGACGATCGGATTGACGTCAAGAACAAATTTGGTTTTGCAGATGGAACGATTCCTCGCCTGTCTGATGAATCGCTCAATTCGTGGATCATTTGCAA
CAGCGTAGTCACTGTGTGGATTCTCAATTCCTTATCTGCAGAAATTTCTGCCAGTATCAATTTTGCTGATACAACCTATGAGATTTGGACTGATCTTCGAGATTGCTACC
AGCAAAAGAATCTTCCTCGAGTTTTTCAAATTCTCCGAGAAATTTCCACCTTACTACAGAATCAAGATTCTGTCACCACGTACTATGCAAGACTGAAGACTCTCTGGAAC
GAACTTTCTTCTTACCGACCCTTCTTGTTCTTGCGGTGGAGTAAAGGAGCTTACGAGTTATTTTCAAACTGA
Protein sequenceShow/hide protein sequence
MESGYDDRIDVKNKFGFADGTIPRLSDESLNSWIICNSVVTVWILNSLSAEISASINFADTTYEIWTDLRDCYQQKNLPRVFQILREISTLLQNQDSVTTYYARLKTLWN
ELSSYRPFLFLRWSKGAYELFSN