; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022673 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022673
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr7:35382476..35382961
RNA-Seq ExpressionLag0022673
SyntenyLag0022673
Gene Ontology termsGO:0044237 - cellular metabolic process (biological process)
GO:0005488 - binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065480.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]5.7e-4857.67Show/hide
Query:  NGS---PSTLANTEIEGQLNPYLLHHSTTPTTSLVTQQLVGASNYISWSKAMLLALSGKNKFGFVNGSIPKP-DGALSSAWQCNNDIVTSWIVNSISKDI
        NGS   P T AN++ + QLNPY +HHS  PT ++VTQ L GA NY SWS+AML+A+SG+NK GF+ G I KP DG L  AW CNNDI+ SWI+NS+SK+I
Subjt:  NGS---PSTLANTEIEGQLNPYLLHHSTTPTTSLVTQQLVGASNYISWSKAMLLALSGKNKFGFVNGSIPKP-DGALSSAWQCNNDIVTSWIVNSISKDI

Query:  AASLIFTGCAKDIWDELKTRYRQINGPRIYQLRKDLATFSQGTLSVEAYYVKIMSLWLELIEY
        AAS+I+ G  K+IWDEL+ R++Q NGP IYQLRK+  T  QG L++E YY K+ ++W  L EY
Subjt:  AASLIFTGCAKDIWDELKTRYRQINGPRIYQLRKDLATFSQGTLSVEAYYVKIMSLWLELIEY

XP_022142771.1 uncharacterized protein LOC111012810 [Momordica charantia]7.0e-3869.72Show/hide
Query:  TEIEGQLNPYLLHHSTTPTTSLVTQQLVGASNYISWSKAMLLALSGKNKFGFVNGSIPKPDGALSSAWQCNNDIVTSWIVNSISKDIAASLIFTGCAKDI
        T IE QLNPYL+HHST PTT LVTQQL+GASNY SWS++M++ALSGKNK GFV+G+I KP G L +AW+C NDI+TSWI+NS+SK+IAAS ++TG AK+I
Subjt:  TEIEGQLNPYLLHHSTTPTTSLVTQQLVGASNYISWSKAMLLALSGKNKFGFVNGSIPKPDGALSSAWQCNNDIVTSWIVNSISKDIAASLIFTGCAKDI

Query:  WDELKTRYR
        WDELK R++
Subjt:  WDELKTRYR

XP_022145891.1 uncharacterized protein LOC111015239 [Momordica charantia]5.7e-5667.1Show/hide
Query:  STLANTEIEGQLNPYLLHHSTTPTTSLVTQQLVGASNYISWSKAMLLALSGKNKFGFVNGSIPKPDGALSSAWQCNNDIVTSWIVNSISKDIAASLIFTG
        ST   T IE QLNPYL+HHST PTT LVTQQL+GASNY SW ++ML+ALSGKNK GF++G+I KP+G L +AW+CNNDI+TSWI+NS+SK+IAAS+I+TG
Subjt:  STLANTEIEGQLNPYLLHHSTTPTTSLVTQQLVGASNYISWSKAMLLALSGKNKFGFVNGSIPKPDGALSSAWQCNNDIVTSWIVNSISKDIAASLIFTG

Query:  CAKDIWDELKTRYRQINGPRIYQLRKDLATFSQGTLSVEAYYVKIMSLWLELIEY
         AKDIWDELK R++Q + PRI+QLRK+L T  QGTLS+EAYY K+ ++W EL +Y
Subjt:  CAKDIWDELKTRYRQINGPRIYQLRKDLATFSQGTLSVEAYYVKIMSLWLELIEY

XP_022888913.1 uncharacterized protein LOC111404319 [Olea europaea var. sylvestris]8.6e-3650.63Show/hide
Query:  NGSPSTLANTEIEGQLNPYLLHHSTTPTTSLVTQQLVGASNYISWSKAMLLALSGKNKFGFVNGSIPKPDG--ALSSAWQCNNDIVTSWIVNSISKDIAA
        N S S+++N+ IE   +PY LHHS +P   LV+Q L+G  NY SWS+AM +ALS KNK  F+NGSI KP+    L +AW  NN++V SWI+NS+SK+I+A
Subjt:  NGSPSTLANTEIEGQLNPYLLHHSTTPTTSLVTQQLVGASNYISWSKAMLLALSGKNKFGFVNGSIPKPDG--ALSSAWQCNNDIVTSWIVNSISKDIAA

Query:  SLIFTGCAKDIWDELKTRYRQINGPRIYQLRKDLATFSQGTLSVEAYYVKIMSLWLEL
        S+I+    ++IWD+LK RY+Q N PRI+QLR++L    QG  SV  Y+ K+ ++W EL
Subjt:  SLIFTGCAKDIWDELKTRYRQINGPRIYQLRKDLATFSQGTLSVEAYYVKIMSLWLEL

XP_038888312.1 uncharacterized protein LOC120078158 [Benincasa hispida]7.7e-3764.91Show/hide
Query:  KAMLLALSGKNKFGFVNGSIPKP-DGALSSAWQCNNDIVTSWIVNSISKDIAASLIFTGCAKDIWDELKTRYRQINGPRIYQLRKDLATFSQGTLSVEAY
        + M L LSGKNK GF+ G+I KP +G+L SAW+CNND++TSWI+NS+SK+IA SL++ G  K+IWDELK RY Q NGP IYQLRKDLAT +QG LSVE Y
Subjt:  KAMLLALSGKNKFGFVNGSIPKP-DGALSSAWQCNNDIVTSWIVNSISKDIAASLIFTGCAKDIWDELKTRYRQINGPRIYQLRKDLATFSQGTLSVEAY

Query:  YVKIMSLWLELIEY
        Y KI ++W EL+EY
Subjt:  YVKIMSLWLELIEY

TrEMBL top hitse value%identityAlignment
A0A5A7VE66 Cysteine-rich RLK (Receptor-like protein kinase) 82.8e-4857.67Show/hide
Query:  NGS---PSTLANTEIEGQLNPYLLHHSTTPTTSLVTQQLVGASNYISWSKAMLLALSGKNKFGFVNGSIPKP-DGALSSAWQCNNDIVTSWIVNSISKDI
        NGS   P T AN++ + QLNPY +HHS  PT ++VTQ L GA NY SWS+AML+A+SG+NK GF+ G I KP DG L  AW CNNDI+ SWI+NS+SK+I
Subjt:  NGS---PSTLANTEIEGQLNPYLLHHSTTPTTSLVTQQLVGASNYISWSKAMLLALSGKNKFGFVNGSIPKP-DGALSSAWQCNNDIVTSWIVNSISKDI

Query:  AASLIFTGCAKDIWDELKTRYRQINGPRIYQLRKDLATFSQGTLSVEAYYVKIMSLWLELIEY
        AAS+I+ G  K+IWDEL+ R++Q NGP IYQLRK+  T  QG L++E YY K+ ++W  L EY
Subjt:  AASLIFTGCAKDIWDELKTRYRQINGPRIYQLRKDLATFSQGTLSVEAYYVKIMSLWLELIEY

A0A5J5B2C5 Uncharacterized protein3.5e-3549.38Show/hide
Query:  NGSPSTLANTEIEGQLNPYLLHHSTTPTTSLVTQQLVGASNYISWSKAMLLALSGKNKFGFVNGSIPKPDGA---LSSAWQCNNDIVTSWIVNSISKDIA
        +G  +    + IE   NPY LHHS +P   LV+QQL G  NY +WS+AML+ALS KNK GFV+G IP+P G    L  +W  NN+IV SWI+NSISK+I+
Subjt:  NGSPSTLANTEIEGQLNPYLLHHSTTPTTSLVTQQLVGASNYISWSKAMLLALSGKNKFGFVNGSIPKPDGA---LSSAWQCNNDIVTSWIVNSISKDIA

Query:  ASLIFTGCAKDIWDELKTRYRQINGPRIYQLRKDLATFSQGTLSVEAYYVKIMSLWLELIEY
        AS+IF   A++IW +L+ R++Q NGPRI+QL+++L    Q   SV  Y+ K+ ++W EL  Y
Subjt:  ASLIFTGCAKDIWDELKTRYRQINGPRIYQLRKDLATFSQGTLSVEAYYVKIMSLWLELIEY

A0A6J1CN69 uncharacterized protein LOC1110128103.4e-3869.72Show/hide
Query:  TEIEGQLNPYLLHHSTTPTTSLVTQQLVGASNYISWSKAMLLALSGKNKFGFVNGSIPKPDGALSSAWQCNNDIVTSWIVNSISKDIAASLIFTGCAKDI
        T IE QLNPYL+HHST PTT LVTQQL+GASNY SWS++M++ALSGKNK GFV+G+I KP G L +AW+C NDI+TSWI+NS+SK+IAAS ++TG AK+I
Subjt:  TEIEGQLNPYLLHHSTTPTTSLVTQQLVGASNYISWSKAMLLALSGKNKFGFVNGSIPKPDGALSSAWQCNNDIVTSWIVNSISKDIAASLIFTGCAKDI

Query:  WDELKTRYR
        WDELK R++
Subjt:  WDELKTRYR

A0A6J1CXR2 uncharacterized protein LOC1110152392.8e-5667.1Show/hide
Query:  STLANTEIEGQLNPYLLHHSTTPTTSLVTQQLVGASNYISWSKAMLLALSGKNKFGFVNGSIPKPDGALSSAWQCNNDIVTSWIVNSISKDIAASLIFTG
        ST   T IE QLNPYL+HHST PTT LVTQQL+GASNY SW ++ML+ALSGKNK GF++G+I KP+G L +AW+CNNDI+TSWI+NS+SK+IAAS+I+TG
Subjt:  STLANTEIEGQLNPYLLHHSTTPTTSLVTQQLVGASNYISWSKAMLLALSGKNKFGFVNGSIPKPDGALSSAWQCNNDIVTSWIVNSISKDIAASLIFTG

Query:  CAKDIWDELKTRYRQINGPRIYQLRKDLATFSQGTLSVEAYYVKIMSLWLELIEY
         AKDIWDELK R++Q + PRI+QLRK+L T  QGTLS+EAYY K+ ++W EL +Y
Subjt:  CAKDIWDELKTRYRQINGPRIYQLRKDLATFSQGTLSVEAYYVKIMSLWLELIEY

A0A6J1DKR8 uncharacterized protein LOC1110218311.3e-3447.5Show/hide
Query:  NGSPSTLANTE-IEGQLNPYLLHHSTTPTTSLVTQQLVGASNYISWSKAMLLALSGKNKFGFVNGSIPKPDGALSSAWQCNNDIVTSWIVNSISKDIAAS
        + SP++ ++   ++  LNPY LHH+      LVTQ L    NY SWS++ML+ALS KNK GF++GSI +P G L  AW  NN +V +WI+NS+SK+I++S
Subjt:  NGSPSTLANTE-IEGQLNPYLLHHSTTPTTSLVTQQLVGASNYISWSKAMLLALSGKNKFGFVNGSIPKPDGALSSAWQCNNDIVTSWIVNSISKDIAAS

Query:  LIFTGCAKDIWDELKTRYRQINGPRIYQLRKDLATFSQGTLSVEAYYVKIMSLWLELIEY
        ++F+  A+DIW +LK R+ + NGPRI+QL++DLA   Q   SV  Y+ K+ ++W ELI+Y
Subjt:  LIFTGCAKDIWDELKTRYRQINGPRIYQLRKDLATFSQGTLSVEAYYVKIMSLWLELIEY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.3e-1833.56Show/hide
Query:  NPYLL----HHSTTPTTSLVTQQLVGASNYISWSKAMLLALSGKNKFGFVNGSIPKPD--GALSSAWQCNNDIVTSWIVNSISKDIAASLIFTGCAKDIW
        +PY L    HH   P+   + +      NY++W       L    KFGF++G++PKPD    L   W+  N +V  W++NS++  +  S+++   A  +W
Subjt:  NPYLL----HHSTTPTTSLVTQQLVGASNYISWSKAMLLALSGKNKFGFVNGSIPKPD--GALSSAWQCNNDIVTSWIVNSISKDIAASLIFTGCAKDIW

Query:  DELKTRYRQINGPRIYQLRKDLATFSQGTLSVEAYYVKIMSLWLELIEY
        ++L+  +      +IYQLR+ LAT  QG  SVE Y+ K+  +W+EL EY
Subjt:  DELKTRYRQINGPRIYQLRKDLATFSQGTLSVEAYYVKIMSLWLELIEY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAACGGATCTCCTTCAACTCTTGCAAACACAGAAATTGAAGGTCAATTGAATCCGTATTTGCTGCATCATTCTACCACACCTACCACATCCCTCGTTACTCAGCA
ATTAGTCGGTGCAAGTAACTACATCTCTTGGAGCAAGGCTATGCTTCTTGCCCTCTCAGGTAAAAATAAATTTGGGTTTGTGAATGGATCCATTCCAAAACCAGATGGAG
CACTTTCTTCTGCCTGGCAATGCAACAATGATATAGTAACTTCTTGGATTGTGAATTCCATTTCAAAGGATATTGCTGCGAGCCTTATCTTCACTGGTTGCGCCAAAGAC
ATTTGGGACGAGTTGAAGACTCGATACCGGCAGATTAATGGGCCACGGATTTACCAATTGCGAAAGGATCTTGCAACCTTTTCGCAAGGAACACTTTCTGTTGAAGCATA
TTATGTCAAGATTATGTCTCTTTGGCTCGAATTGATTGAATACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAACGGATCTCCTTCAACTCTTGCAAACACAGAAATTGAAGGTCAATTGAATCCGTATTTGCTGCATCATTCTACCACACCTACCACATCCCTCGTTACTCAGCA
ATTAGTCGGTGCAAGTAACTACATCTCTTGGAGCAAGGCTATGCTTCTTGCCCTCTCAGGTAAAAATAAATTTGGGTTTGTGAATGGATCCATTCCAAAACCAGATGGAG
CACTTTCTTCTGCCTGGCAATGCAACAATGATATAGTAACTTCTTGGATTGTGAATTCCATTTCAAAGGATATTGCTGCGAGCCTTATCTTCACTGGTTGCGCCAAAGAC
ATTTGGGACGAGTTGAAGACTCGATACCGGCAGATTAATGGGCCACGGATTTACCAATTGCGAAAGGATCTTGCAACCTTTTCGCAAGGAACACTTTCTGTTGAAGCATA
TTATGTCAAGATTATGTCTCTTTGGCTCGAATTGATTGAATACTGA
Protein sequenceShow/hide protein sequence
MANGSPSTLANTEIEGQLNPYLLHHSTTPTTSLVTQQLVGASNYISWSKAMLLALSGKNKFGFVNGSIPKPDGALSSAWQCNNDIVTSWIVNSISKDIAASLIFTGCAKD
IWDELKTRYRQINGPRIYQLRKDLATFSQGTLSVEAYYVKIMSLWLELIEY