; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0016065 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0016065
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr12:32728558..32728947
RNA-Seq ExpressionLag0016065
SyntenyLag0016065
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0016020 - membrane (cellular component)
GO:0005488 - binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035612.1 No apical meristem (NAM) protein [Cucumis melo var. makuwa]7.9e-2462.11Show/hide
Query:  MNPFSLHRSFAPTVALVTQPLLGASNYGSWSQAMLMALDGKNKEGFIDGTIKKP-EGTQSVAWKCNNHIIASWILNSVSKEIAGSIVYTSSTRQS
        +NP+ +H S  PT A+V QPL GA NY SWS+AMLMA+ G+NK GFI G I+KP +G    AW CNN I+ASWILNSVSKEIA SI+YT S+  S
Subjt:  MNPFSLHRSFAPTVALVTQPLLGASNYGSWSQAMLMALDGKNKEGFIDGTIKKP-EGTQSVAWKCNNHIIASWILNSVSKEIAGSIVYTSSTRQS

KAA0065480.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]6.4e-2662Show/hide
Query:  SSIDAQMNPFSLHRSFAPTVALVTQPLLGASNYGSWSQAMLMALDGKNKEGFIDGTIKKP-EGTQSVAWKCNNHIIASWILNSVSKEIAGSIVYTSSTRQ
        S  DAQ+NP+ +H S  PT A+VTQPL GA NY SWS+AMLMA+ G+NK GFI G I+KP +G    AW CNN I+ASWILNSVSKEIA SI+Y  S ++
Subjt:  SSIDAQMNPFSLHRSFAPTVALVTQPLLGASNYGSWSQAMLMALDGKNKEGFIDGTIKKP-EGTQSVAWKCNNHIIASWILNSVSKEIAGSIVYTSSTRQ

TYK05760.1 UBN2_3 domain-containing protein [Cucumis melo var. makuwa]2.4e-2562.63Show/hide
Query:  ASSIDAQMNPFSLHRSFAPTVALVTQPLLGASNYGSWSQAMLMALDGKNKEGFIDGTIKKP-EGTQSV-AWKCNNHIIASWILNSVSKEIAGSIVYTSS
        +S IDA +NPF LH S  PT  LV+ P +G++NY SWS+AM++AL GKNK GFI GTIKKP EG   + AWKCNN  IASWI+NS+SKEIA S+VY  S
Subjt:  ASSIDAQMNPFSLHRSFAPTVALVTQPLLGASNYGSWSQAMLMALDGKNKEGFIDGTIKKP-EGTQSV-AWKCNNHIIASWILNSVSKEIAGSIVYTSS

XP_022142771.1 uncharacterized protein LOC111012810 [Momordica charantia]6.2e-2961.76Show/hide
Query:  DEASSIDAQMNPFSLHRSFAPTVALVTQPLLGASNYGSWSQAMLMALDGKNKEGFIDGTIKKPEGTQSVAWKCNNHIIASWILNSVSKEIAGSIVYTSST
        +  ++I++Q+NP+ +H S APT  LVTQ LLGASNY SWS++M++AL GKNK GF+DGTI+KP+G    AWKC N II SWILNSVSKEIA S VYT S 
Subjt:  DEASSIDAQMNPFSLHRSFAPTVALVTQPLLGASNYGSWSQAMLMALDGKNKEGFIDGTIKKPEGTQSVAWKCNNHIIASWILNSVSKEIAGSIVYTSST

Query:  RQ
        ++
Subjt:  RQ

XP_022145891.1 uncharacterized protein LOC111015239 [Momordica charantia]9.6e-3066.33Show/hide
Query:  SSIDAQMNPFSLHRSFAPTVALVTQPLLGASNYGSWSQAMLMALDGKNKEGFIDGTIKKPEGTQSVAWKCNNHIIASWILNSVSKEIAGSIVYTSSTR
        ++I++Q+NP+ +H S APT  LVTQ LLGASNY SW ++ML+AL GKNK GFIDGTIKKP G    AWKCNN II SWI+NSVSKEIA SI+YT S +
Subjt:  SSIDAQMNPFSLHRSFAPTVALVTQPLLGASNYGSWSQAMLMALDGKNKEGFIDGTIKKPEGTQSVAWKCNNHIIASWILNSVSKEIAGSIVYTSSTR

TrEMBL top hitse value%identityAlignment
A0A5A7VE66 Cysteine-rich RLK (Receptor-like protein kinase) 83.1e-2662Show/hide
Query:  SSIDAQMNPFSLHRSFAPTVALVTQPLLGASNYGSWSQAMLMALDGKNKEGFIDGTIKKP-EGTQSVAWKCNNHIIASWILNSVSKEIAGSIVYTSSTRQ
        S  DAQ+NP+ +H S  PT A+VTQPL GA NY SWS+AMLMA+ G+NK GFI G I+KP +G    AW CNN I+ASWILNSVSKEIA SI+Y  S ++
Subjt:  SSIDAQMNPFSLHRSFAPTVALVTQPLLGASNYGSWSQAMLMALDGKNKEGFIDGTIKKP-EGTQSVAWKCNNHIIASWILNSVSKEIAGSIVYTSSTRQ

A0A5D3C350 UBN2_3 domain-containing protein1.2e-2562.63Show/hide
Query:  ASSIDAQMNPFSLHRSFAPTVALVTQPLLGASNYGSWSQAMLMALDGKNKEGFIDGTIKKP-EGTQSV-AWKCNNHIIASWILNSVSKEIAGSIVYTSS
        +S IDA +NPF LH S  PT  LV+ P +G++NY SWS+AM++AL GKNK GFI GTIKKP EG   + AWKCNN  IASWI+NS+SKEIA S+VY  S
Subjt:  ASSIDAQMNPFSLHRSFAPTVALVTQPLLGASNYGSWSQAMLMALDGKNKEGFIDGTIKKP-EGTQSV-AWKCNNHIIASWILNSVSKEIAGSIVYTSS

A0A5D3E5P0 No apical meristem (NAM) protein3.8e-2462.11Show/hide
Query:  MNPFSLHRSFAPTVALVTQPLLGASNYGSWSQAMLMALDGKNKEGFIDGTIKKP-EGTQSVAWKCNNHIIASWILNSVSKEIAGSIVYTSSTRQS
        +NP+ +H S  PT A+V QPL GA NY SWS+AMLMA+ G+NK GFI G I+KP +G    AW CNN I+ASWILNSVSKEIA SI+YT S+  S
Subjt:  MNPFSLHRSFAPTVALVTQPLLGASNYGSWSQAMLMALDGKNKEGFIDGTIKKP-EGTQSVAWKCNNHIIASWILNSVSKEIAGSIVYTSSTRQS

A0A6J1CN69 uncharacterized protein LOC1110128103.0e-2961.76Show/hide
Query:  DEASSIDAQMNPFSLHRSFAPTVALVTQPLLGASNYGSWSQAMLMALDGKNKEGFIDGTIKKPEGTQSVAWKCNNHIIASWILNSVSKEIAGSIVYTSST
        +  ++I++Q+NP+ +H S APT  LVTQ LLGASNY SWS++M++AL GKNK GF+DGTI+KP+G    AWKC N II SWILNSVSKEIA S VYT S 
Subjt:  DEASSIDAQMNPFSLHRSFAPTVALVTQPLLGASNYGSWSQAMLMALDGKNKEGFIDGTIKKPEGTQSVAWKCNNHIIASWILNSVSKEIAGSIVYTSST

Query:  RQ
        ++
Subjt:  RQ

A0A6J1CXR2 uncharacterized protein LOC1110152394.6e-3066.33Show/hide
Query:  SSIDAQMNPFSLHRSFAPTVALVTQPLLGASNYGSWSQAMLMALDGKNKEGFIDGTIKKPEGTQSVAWKCNNHIIASWILNSVSKEIAGSIVYTSSTR
        ++I++Q+NP+ +H S APT  LVTQ LLGASNY SW ++ML+AL GKNK GFIDGTIKKP G    AWKCNN II SWI+NSVSKEIA SI+YT S +
Subjt:  SSIDAQMNPFSLHRSFAPTVALVTQPLLGASNYGSWSQAMLMALDGKNKEGFIDGTIKKPEGTQSVAWKCNNHIIASWILNSVSKEIAGSIVYTSSTR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).3.3e-0428.57Show/hide
Query:  NYGSWSQAMLMALDGKNKEGFIDGTIKKPEGTQSV--AWKCNNHIIASWILNSVSKEIAGSIVYTSSTRQ
        NY +W       L    K GFIDGT+ KP+    +   W+  N ++  W++NS++ ++  S++Y  +  +
Subjt:  NYGSWSQAMLMALDGKNKEGFIDGTIKKPEGTQSV--AWKCNNHIIASWILNSVSKEIAGSIVYTSSTRQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGGCAAAGAAGTTGTAGATGATGAAGCTTCGTCCATTGATGCGCAAATGAACCCATTTTCATTGCATCGTTCTTTTGCGCCAACTGTTGCCTTAGTTACTCAGCC
TCTGTTAGGTGCTTCTAACTATGGGTCTTGGAGTCAGGCAATGTTAATGGCGTTGGATGGTAAAAATAAGGAGGGTTTCATTGATGGAACCATCAAGAAACCAGAAGGCA
CACAATCGGTGGCCTGGAAATGCAACAATCACATAATAGCCTCTTGGATTCTCAACTCAGTATCCAAAGAAATCGCAGGAAGTATTGTTTACACTAGTTCAACAAGGCAG
TCTGGGATGAATAGCGAAATCGATTCAAGCAAACCAATGGACCTCGAATCTATCAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGGCAAAGAAGTTGTAGATGATGAAGCTTCGTCCATTGATGCGCAAATGAACCCATTTTCATTGCATCGTTCTTTTGCGCCAACTGTTGCCTTAGTTACTCAGCC
TCTGTTAGGTGCTTCTAACTATGGGTCTTGGAGTCAGGCAATGTTAATGGCGTTGGATGGTAAAAATAAGGAGGGTTTCATTGATGGAACCATCAAGAAACCAGAAGGCA
CACAATCGGTGGCCTGGAAATGCAACAATCACATAATAGCCTCTTGGATTCTCAACTCAGTATCCAAAGAAATCGCAGGAAGTATTGTTTACACTAGTTCAACAAGGCAG
TCTGGGATGAATAGCGAAATCGATTCAAGCAAACCAATGGACCTCGAATCTATCAGCTGA
Protein sequenceShow/hide protein sequence
MAGKEVVDDEASSIDAQMNPFSLHRSFAPTVALVTQPLLGASNYGSWSQAMLMALDGKNKEGFIDGTIKKPEGTQSVAWKCNNHIIASWILNSVSKEIAGSIVYTSSTRQ
SGMNSEIDSSKPMDLESIS