; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041339 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041339
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr13:15904363..15905046
RNA-Seq ExpressionLag0041339
SyntenyLag0041339
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065480.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]1.1e-5756.28Show/hide
Query:  DALLNPYNLHHSYSSSVVLVTQPLLGANIYSSWRKAMLIALSGKNNEGFIDGTIKKP-EGRKSAAWKCNNDIITSWIFNSISKEIAASINYTGS------
        DA LNPY +HHS   +  +VTQPL GA  Y+SW +AML+A+SG+N  GFI G I+KP +G    AW CNNDI+ SWI NS+SKEIAASI Y GS      
Subjt:  DALLNPYNLHHSYSSSVVLVTQPLLGANIYSSWRKAMLIALSGKNNEGFIDGTIKKP-EGRKSAAWKCNNDIITSWIFNSISKEIAASINYTGS------

Query:  ------------------AQVVTPNQGTMSIESYNTKLKTIWQELCDYRPTLDCTCGGNKTFIEHLDSEFVMIFLMGLSDSYTSVRAQILLMKPIPSIDV
                           + VT  QG ++IE+Y TKLKTIWQ L +YR T DCTCGG K FI+HL+SE++M FLMGL+DSY +VRAQILLM+P+PSI+ 
Subjt:  ------------------AQVVTPNQGTMSIESYNTKLKTIWQELCDYRPTLDCTCGGNKTFIEHLDSEFVMIFLMGLSDSYTSVRAQILLMKPIPSIDV

Query:  VFALIVQEEQQRSVG
        VF+L++QEEQQRS G
Subjt:  VFALIVQEEQQRSVG

KYP75905.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]3.9e-4242.04Show/hide
Query:  ANKEDDSLTVDALLNPYNLHHSYSSSVVLVTQPLLGANIYSSWRKAMLIALSGKNNEGFIDGTIKK--PEGRKSAAWKCNNDIITSWIFNSISKEIAASI
        +     SL  D   NPY LH S +    +V+QPL G N Y+SW +A+L+AL  KN  GF+DGTI K  P  +   +W+ NN+I+ SW+ N ISK++ AS+
Subjt:  ANKEDDSLTVDALLNPYNLHHSYSSSVVLVTQPLLGANIYSSWRKAMLIALSGKNNEGFIDGTIKK--PEGRKSAAWKCNNDIITSWIFNSISKEIAASI

Query:  NYTGSA------------------------QVVTPNQGTMSIESYNTKLKTIWQELCDYRPTLDCTCGGNKTFIEHLDSEFVMIFLMGLSDSYTSVRAQI
         Y+ SA                         +VT  QG+++I  Y TK+K +W+EL +Y+P+  CTCGG K +I+H  SE+ M+FLMGL++ Y+ +R QI
Subjt:  NYTGSA------------------------QVVTPNQGTMSIESYNTKLKTIWQELCDYRPTLDCTCGGNKTFIEHLDSEFVMIFLMGLSDSYTSVRAQI

Query:  LLMKPIPSIDVVFALIVQEEQQRSVG
        LLM PIP I+  F+L++QEE+Q+ +G
Subjt:  LLMKPIPSIDVVFALIVQEEQQRSVG

XP_015936173.1 uncharacterized protein LOC107462121 [Arachis duranensis]2.3e-4241.2Show/hide
Query:  NPYNLHHSYSSSVVLVTQPLLGANIYSSWRKAMLIALSGKNNEGFIDGTIKKPEGRKSAA----WKCNNDIITSWIFNSISKEIAASINYTGSA------
        +P+ LH S   ++ LV+Q L G N Y SW+++M +AL+GKN  GF+DGTI  PE   S +    W+  NDI+++WI NS+SKEIA+S+ + GSA      
Subjt:  NPYNLHHSYSSSVVLVTQPLLGANIYSSWRKAMLIALSGKNNEGFIDGTIKKPEGRKSAA----WKCNNDIITSWIFNSISKEIAASINYTGSA------

Query:  ------------------QVVTPNQGTMSIESYNTKLKTIWQELCDYRPTLDCTCGGNKTFIEHLDSEFVMIFLMGLSDSYTSVRAQILLMKPIPSIDVV
                          +++   Q ++S+  + TKLK +W+EL  +RP++ C CGG + F+ H D E+V++FLMGL+D Y  VR+QILLMKP+PSI  V
Subjt:  ------------------QVVTPNQGTMSIESYNTKLKTIWQELCDYRPTLDCTCGGNKTFIEHLDSEFVMIFLMGLSDSYTSVRAQILLMKPIPSIDVV

Query:  FALIVQEEQQRSVGQSPLLQKSLLMTGGVRDLY
        F+LIVQEE QR +  +P     + +   V++ +
Subjt:  FALIVQEEQQRSVGQSPLLQKSLLMTGGVRDLY

XP_016191419.1 uncharacterized protein LOC107632230 [Arachis ipaensis]5.4e-4445.16Show/hide
Query:  NPYNLHHSYSSSVVLVTQPLLGANIYSSWRKAMLIALSGKNNEGFIDGTIKKPEGRKSAAWKCN----NDIITSWIFNSISKEIAASINYTGSA------
        NP+ LH     ++ LVTQ L G N Y SW+++M +AL+GKN  GF+DGTI  PE   S ++K +    NDI+++WI NS+SKEIA+S+ + GSA      
Subjt:  NPYNLHHSYSSSVVLVTQPLLGANIYSSWRKAMLIALSGKNNEGFIDGTIKKPEGRKSAAWKCN----NDIITSWIFNSISKEIAASINYTGSA------

Query:  ------------------QVVTPNQGTMSIESYNTKLKTIWQELCDYRPTLDCTCGGNKTFIEHLDSEFVMIFLMGLSDSYTSVRAQILLMKPIPSIDVV
                          +++   Q ++SI  + TKLK +W+ELC +RP++ C CGG + F+ H D E+V++FLMGL+D Y  VR+QILLMKP+PSI  V
Subjt:  ------------------QVVTPNQGTMSIESYNTKLKTIWQELCDYRPTLDCTCGGNKTFIEHLDSEFVMIFLMGLSDSYTSVRAQILLMKPIPSIDVV

Query:  FALIVQEEQQRSVGQSP
        F+LIVQEE+QR +  +P
Subjt:  FALIVQEEQQRSVGQSP

XP_022145891.1 uncharacterized protein LOC111015239 [Momordica charantia]2.8e-6459.26Show/hide
Query:  TVDALLNPYNLHHSYSSSVVLVTQPLLGANIYSSWRKAMLIALSGKNNEGFIDGTIKKPEGRKSAAWKCNNDIITSWIFNSISKEIAASINYTGSA----
        T+++ LNPY +HHS + + +LVTQ LLGA+ Y+SW ++MLIALSGKN  GFIDGTIKKP G   AAWKCNNDIITSWI NS+SKEIAASI YTGSA    
Subjt:  TVDALLNPYNLHHSYSSSVVLVTQPLLGANIYSSWRKAMLIALSGKNNEGFIDGTIKKPEGRKSAAWKCNNDIITSWIFNSISKEIAASINYTGSA----

Query:  --------------------QVVTPNQGTMSIESYNTKLKTIWQELCDYRPTLDCTCGGNKTFIEHLDSEFVMIFLMGLSDSYTSVRAQILLMKPIPSID
                            ++VT  QGT+SIE+Y TKLKT+WQEL DYRPT+DCTC G K+  E   SE+VM FLMGL++SY  +RAQILLM PIP ++
Subjt:  --------------------QVVTPNQGTMSIESYNTKLKTIWQELCDYRPTLDCTCGGNKTFIEHLDSEFVMIFLMGLSDSYTSVRAQILLMKPIPSID

Query:  VVFALIVQEEQQRSVG
         VF+L++QEE+QR++G
Subjt:  VVFALIVQEEQQRSVG

TrEMBL top hitse value%identityAlignment
A0A151QT53 Uncharacterized protein1.0e-4043.36Show/hide
Query:  SLTVDALLNPYNLHHSYSSSVVLVTQPLLGANIYSSWRKAMLIALSGKNNEGFIDGTIKKP--EGRKSAAWKCNNDIITSWIFNSISKEIAASINYTGSA
        S T D   +P  LH S + S V+V+QPL  +N + SW  AM +AL GKN  GFIDG++ KP  +  K  +WK NN II SWI NS+SK+IAASI+YT +A
Subjt:  SLTVDALLNPYNLHHSYSSSVVLVTQPLLGANIYSSWRKAMLIALSGKNNEGFIDGTIKKP--EGRKSAAWKCNNDIITSWIFNSISKEIAASINYTGSA

Query:  ------------------------QVVTPNQGTMSIESYNTKLKTIWQELCDYRPTLDCTCGGNKTFIEHLDSEFVMIFLMGLSDSYTSVRAQILLMKPI
                                 ++   QG ++I  Y TK+K+ W+EL +++P   CTCGG K +++H + + V++FL GL+DSY+ VR QILLM P+
Subjt:  ------------------------QVVTPNQGTMSIESYNTKLKTIWQELCDYRPTLDCTCGGNKTFIEHLDSEFVMIFLMGLSDSYTSVRAQILLMKPI

Query:  PSIDVVFALIVQEEQQRSVGQSPLLQ
        P ++ VF+L+ QEE QR +G +  LQ
Subjt:  PSIDVVFALIVQEEQQRSVGQSPLLQ

A0A151U9A5 Retrovirus-related Pol polyprotein from transposon TNT 1-941.9e-4242.04Show/hide
Query:  ANKEDDSLTVDALLNPYNLHHSYSSSVVLVTQPLLGANIYSSWRKAMLIALSGKNNEGFIDGTIKK--PEGRKSAAWKCNNDIITSWIFNSISKEIAASI
        +     SL  D   NPY LH S +    +V+QPL G N Y+SW +A+L+AL  KN  GF+DGTI K  P  +   +W+ NN+I+ SW+ N ISK++ AS+
Subjt:  ANKEDDSLTVDALLNPYNLHHSYSSSVVLVTQPLLGANIYSSWRKAMLIALSGKNNEGFIDGTIKK--PEGRKSAAWKCNNDIITSWIFNSISKEIAASI

Query:  NYTGSA------------------------QVVTPNQGTMSIESYNTKLKTIWQELCDYRPTLDCTCGGNKTFIEHLDSEFVMIFLMGLSDSYTSVRAQI
         Y+ SA                         +VT  QG+++I  Y TK+K +W+EL +Y+P+  CTCGG K +I+H  SE+ M+FLMGL++ Y+ +R QI
Subjt:  NYTGSA------------------------QVVTPNQGTMSIESYNTKLKTIWQELCDYRPTLDCTCGGNKTFIEHLDSEFVMIFLMGLSDSYTSVRAQI

Query:  LLMKPIPSIDVVFALIVQEEQQRSVG
        LLM PIP I+  F+L++QEE+Q+ +G
Subjt:  LLMKPIPSIDVVFALIVQEEQQRSVG

A0A5A7VE66 Cysteine-rich RLK (Receptor-like protein kinase) 85.4e-5856.28Show/hide
Query:  DALLNPYNLHHSYSSSVVLVTQPLLGANIYSSWRKAMLIALSGKNNEGFIDGTIKKP-EGRKSAAWKCNNDIITSWIFNSISKEIAASINYTGS------
        DA LNPY +HHS   +  +VTQPL GA  Y+SW +AML+A+SG+N  GFI G I+KP +G    AW CNNDI+ SWI NS+SKEIAASI Y GS      
Subjt:  DALLNPYNLHHSYSSSVVLVTQPLLGANIYSSWRKAMLIALSGKNNEGFIDGTIKKP-EGRKSAAWKCNNDIITSWIFNSISKEIAASINYTGS------

Query:  ------------------AQVVTPNQGTMSIESYNTKLKTIWQELCDYRPTLDCTCGGNKTFIEHLDSEFVMIFLMGLSDSYTSVRAQILLMKPIPSIDV
                           + VT  QG ++IE+Y TKLKTIWQ L +YR T DCTCGG K FI+HL+SE++M FLMGL+DSY +VRAQILLM+P+PSI+ 
Subjt:  ------------------AQVVTPNQGTMSIESYNTKLKTIWQELCDYRPTLDCTCGGNKTFIEHLDSEFVMIFLMGLSDSYTSVRAQILLMKPIPSIDV

Query:  VFALIVQEEQQRSVG
        VF+L++QEEQQRS G
Subjt:  VFALIVQEEQQRSVG

A0A6J1CXR2 uncharacterized protein LOC1110152391.3e-6459.26Show/hide
Query:  TVDALLNPYNLHHSYSSSVVLVTQPLLGANIYSSWRKAMLIALSGKNNEGFIDGTIKKPEGRKSAAWKCNNDIITSWIFNSISKEIAASINYTGSA----
        T+++ LNPY +HHS + + +LVTQ LLGA+ Y+SW ++MLIALSGKN  GFIDGTIKKP G   AAWKCNNDIITSWI NS+SKEIAASI YTGSA    
Subjt:  TVDALLNPYNLHHSYSSSVVLVTQPLLGANIYSSWRKAMLIALSGKNNEGFIDGTIKKPEGRKSAAWKCNNDIITSWIFNSISKEIAASINYTGSA----

Query:  --------------------QVVTPNQGTMSIESYNTKLKTIWQELCDYRPTLDCTCGGNKTFIEHLDSEFVMIFLMGLSDSYTSVRAQILLMKPIPSID
                            ++VT  QGT+SIE+Y TKLKT+WQEL DYRPT+DCTC G K+  E   SE+VM FLMGL++SY  +RAQILLM PIP ++
Subjt:  --------------------QVVTPNQGTMSIESYNTKLKTIWQELCDYRPTLDCTCGGNKTFIEHLDSEFVMIFLMGLSDSYTSVRAQILLMKPIPSID

Query:  VVFALIVQEEQQRSVG
         VF+L++QEE+QR++G
Subjt:  VVFALIVQEEQQRSVG

A0A6P4C265 uncharacterized protein LOC1074621211.1e-4241.2Show/hide
Query:  NPYNLHHSYSSSVVLVTQPLLGANIYSSWRKAMLIALSGKNNEGFIDGTIKKPEGRKSAA----WKCNNDIITSWIFNSISKEIAASINYTGSA------
        +P+ LH S   ++ LV+Q L G N Y SW+++M +AL+GKN  GF+DGTI  PE   S +    W+  NDI+++WI NS+SKEIA+S+ + GSA      
Subjt:  NPYNLHHSYSSSVVLVTQPLLGANIYSSWRKAMLIALSGKNNEGFIDGTIKKPEGRKSAA----WKCNNDIITSWIFNSISKEIAASINYTGSA------

Query:  ------------------QVVTPNQGTMSIESYNTKLKTIWQELCDYRPTLDCTCGGNKTFIEHLDSEFVMIFLMGLSDSYTSVRAQILLMKPIPSIDVV
                          +++   Q ++S+  + TKLK +W+EL  +RP++ C CGG + F+ H D E+V++FLMGL+D Y  VR+QILLMKP+PSI  V
Subjt:  ------------------QVVTPNQGTMSIESYNTKLKTIWQELCDYRPTLDCTCGGNKTFIEHLDSEFVMIFLMGLSDSYTSVRAQILLMKPIPSIDVV

Query:  FALIVQEEQQRSVGQSPLLQKSLLMTGGVRDLY
        F+LIVQEE QR +  +P     + +   V++ +
Subjt:  FALIVQEEQQRSVGQSPLLQKSLLMTGGVRDLY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).9.6e-1526.89Show/hide
Query:  PYNLHHSYSSSVVLVTQPLLGANIYSSWRKAMLIALSGKNNEGFIDGTIKKPE--GRKSAAWKCNNDIITSWIFNSISKEIAASINYTGSA---------
        P ++HH    S+  +++     + Y +W+      L      GFIDGT+ KP+        W+  N ++  W+ NS++ ++  S+ Y  +A         
Subjt:  PYNLHHSYSSSVVLVTQPLLGANIYSSWRKAMLIALSGKNNEGFIDGTIKKPE--GRKSAAWKCNNDIITSWIFNSISKEIAASINYTGSA---------

Query:  ---------------QVVTPNQGTMSIESYNTKLKTIWQELCDYRPTLDCTCGG-----NKTFIEHLDSEFVMIFLMG--LSDSYTSVRAQILLMKPIPS
                       ++ T  QG  S+E Y  KL  +W EL +Y P  +C CGG      K   E  + E    FLMG  L+  + +V  +I+  KP PS
Subjt:  ---------------QVVTPNQGTMSIESYNTKLKTIWQELCDYRPTLDCTCGG-----NKTFIEHLDSEFVMIFLMG--LSDSYTSVRAQILLMKPIPS

Query:  IDVVFALIVQEE
        +   FA++   E
Subjt:  IDVVFALIVQEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAACAAAGAAGATGACTCCCTCACAGTCGACGCGCTTCTCAACCCTTACAACCTTCATCATTCATATAGTTCTTCCGTTGTGTTAGTGACCCAACCCTTACTTGG
AGCAAACATTTACAGTTCATGGAGGAAAGCAATGTTAATTGCATTATCGGGCAAGAACAATGAGGGCTTTATCGATGGCACAATCAAGAAACCTGAAGGACGAAAATCGG
CTGCTTGGAAATGCAATAACGATATAATCACATCGTGGATCTTCAATTCTATTTCTAAAGAGATAGCCGCAAGCATCAATTACACAGGTTCTGCACAAGTGGTCACCCCA
AACCAAGGTACCATGTCTATTGAATCATATAACACCAAATTGAAGACAATATGGCAAGAACTATGTGATTATCGTCCTACTTTAGACTGCACTTGTGGCGGCAACAAAAC
CTTCATCGAACATCTTGATTCTGAATTTGTAATGATCTTCCTGATGGGTTTAAGTGATTCCTACACAAGTGTTCGCGCCCAGATACTCCTGATGAAACCTATTCCATCGA
TTGATGTTGTTTTTGCTCTAATTGTACAAGAAGAGCAACAACGATCTGTTGGCCAATCGCCACTACTACAAAAATCACTTTTAATGACGGGCGGCGTCCGTGATCTATAC
GAGTGTCGTCGTAACTTGTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAACAAAGAAGATGACTCCCTCACAGTCGACGCGCTTCTCAACCCTTACAACCTTCATCATTCATATAGTTCTTCCGTTGTGTTAGTGACCCAACCCTTACTTGG
AGCAAACATTTACAGTTCATGGAGGAAAGCAATGTTAATTGCATTATCGGGCAAGAACAATGAGGGCTTTATCGATGGCACAATCAAGAAACCTGAAGGACGAAAATCGG
CTGCTTGGAAATGCAATAACGATATAATCACATCGTGGATCTTCAATTCTATTTCTAAAGAGATAGCCGCAAGCATCAATTACACAGGTTCTGCACAAGTGGTCACCCCA
AACCAAGGTACCATGTCTATTGAATCATATAACACCAAATTGAAGACAATATGGCAAGAACTATGTGATTATCGTCCTACTTTAGACTGCACTTGTGGCGGCAACAAAAC
CTTCATCGAACATCTTGATTCTGAATTTGTAATGATCTTCCTGATGGGTTTAAGTGATTCCTACACAAGTGTTCGCGCCCAGATACTCCTGATGAAACCTATTCCATCGA
TTGATGTTGTTTTTGCTCTAATTGTACAAGAAGAGCAACAACGATCTGTTGGCCAATCGCCACTACTACAAAAATCACTTTTAATGACGGGCGGCGTCCGTGATCTATAC
GAGTGTCGTCGTAACTTGTTATGA
Protein sequenceShow/hide protein sequence
MANKEDDSLTVDALLNPYNLHHSYSSSVVLVTQPLLGANIYSSWRKAMLIALSGKNNEGFIDGTIKKPEGRKSAAWKCNNDIITSWIFNSISKEIAASINYTGSAQVVTP
NQGTMSIESYNTKLKTIWQELCDYRPTLDCTCGGNKTFIEHLDSEFVMIFLMGLSDSYTSVRAQILLMKPIPSIDVVFALIVQEEQQRSVGQSPLLQKSLLMTGGVRDLY
ECRRNLL