; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038843 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038843
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr2:28968661..28972482
RNA-Seq ExpressionLag0038843
SyntenyLag0038843
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588985.1 Retrovirus-related Pol polyprotein from transposon RE1, partial [Cucurbita argyrosperma subsp. sororia]3.2e-2850Show/hide
Query:  DLNSPIFLLSNICNLVSIRLDSSNFILWKFQLTSILKAHKMSGYVDGTTPAP-------------SKVLLRSSPAISPEAALAYVVGCSTSK--------
        +L SPI LLSNICNL+SI+LDS+N++LWKFQLT++LKAHK+ G++DGT P P               ++   +  +SPE ALAYVVG +TSK        
Subjt:  DLNSPIFLLSNICNLVSIRLDSSNFILWKFQLTSILKAHKMSGYVDGTTPAP-------------SKVLLRSSPAISPEAALAYVVGCSTSK--------

Query:  -----------------ESISMKLTESIDDYVKRIKEINDELANVSSIVNDEDLFIYALN
                         ++IS K  ESID Y+KRIKEI D+LANVS++VNDEDL IYALN
Subjt:  -----------------ESISMKLTESIDDYVKRIKEINDELANVSSIVNDEDLFIYALN

KAG7015254.1 hypothetical protein SDJN02_22888, partial [Cucurbita argyrosperma subsp. argyrosperma]3.2e-2850Show/hide
Query:  DLNSPIFLLSNICNLVSIRLDSSNFILWKFQLTSILKAHKMSGYVDGTTPAP-------------SKVLLRSSPAISPEAALAYVVGCSTSK--------
        +L SPI LLSNICNL+SI+LDS+N++LWKFQLT++LKAHK+ G++DGT P P               ++   +  +SPE ALAYVVG +TSK        
Subjt:  DLNSPIFLLSNICNLVSIRLDSSNFILWKFQLTSILKAHKMSGYVDGTTPAP-------------SKVLLRSSPAISPEAALAYVVGCSTSK--------

Query:  -----------------ESISMKLTESIDDYVKRIKEINDELANVSSIVNDEDLFIYALN
                         ++IS K  ESID Y+KRIKEI D+LANVS++VNDEDL IYALN
Subjt:  -----------------ESISMKLTESIDDYVKRIKEINDELANVSSIVNDEDLFIYALN

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]2.1e-2748Show/hide
Query:  KDLNSPIFLLSNICNLVSIRLDSSNFILWKFQLTSILKAHKMSGYVDGTTPAP---------SKVLLRSSPA------------------ISPEAALAYV
        KD  SPIFLLSNICNL+S+RLDS+NF+LWKFQLT+ILKAHK+ G++DGT P P         S V  +S+P+                  +SPE ALAYV
Subjt:  KDLNSPIFLLSNICNLVSIRLDSSNFILWKFQLTSILKAHKMSGYVDGTTPAP---------SKVLLRSSPA------------------ISPEAALAYV

Query:  VGCSTSK-------------------------ESISMKLTESIDDYVKRIKEINDELANVSSIVNDEDLFIYALN
        VG ++SK                         ++I  K  ESID Y+KRIKEI D+LANVS+ +N+EDL IYALN
Subjt:  VGCSTSK-------------------------ESISMKLTESIDDYVKRIKEINDELANVSSIVNDEDLFIYALN

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]2.1e-2748Show/hide
Query:  KDLNSPIFLLSNICNLVSIRLDSSNFILWKFQLTSILKAHKMSGYVDGTTPAP---------SKVLLRSSPA------------------ISPEAALAYV
        KD  SPIFLLSNICNL+S+RLDS+NF+LWKFQLT+ILKAHK+ G++DGT P P         S V  +S+P+                  +SPE ALAYV
Subjt:  KDLNSPIFLLSNICNLVSIRLDSSNFILWKFQLTSILKAHKMSGYVDGTTPAP---------SKVLLRSSPA------------------ISPEAALAYV

Query:  VGCSTSK-------------------------ESISMKLTESIDDYVKRIKEINDELANVSSIVNDEDLFIYALN
        VG ++SK                         ++I  K  ESID Y+KRIKEI D+LANVS+ +N+EDL IYALN
Subjt:  VGCSTSK-------------------------ESISMKLTESIDDYVKRIKEINDELANVSSIVNDEDLFIYALN

XP_022158689.1 uncharacterized protein LOC111025150 [Momordica charantia]1.2e-2746.59Show/hide
Query:  KDLNSPIFLLSNICNLVSIRLDSSNFILWKFQLTSILKAHKMSGYVDGTTPAPSKVLL-----------RSSPAIS-----------------PEAALAY
        KDL+SPIFLLSNICNLVS+RLDSSNF+LWKFQLT+ILKAHK+ G++DG+TP P++ L+            ++PA S                   +ALAY
Subjt:  KDLNSPIFLLSNICNLVSIRLDSSNFILWKFQLTSILKAHKMSGYVDGTTPAPSKVLL-----------RSSPAIS-----------------PEAALAY

Query:  VVGCSTSK-------------------------ESISMKLTESIDDYVKRIKEINDELANVSSIVNDEDLFIYALN
        VVGC +S+                         +SIS K   SID YV+RIKE+ D+LANV  +V++EDL IY LN
Subjt:  VVGCSTSK-------------------------ESISMKLTESIDDYVKRIKEINDELANVSSIVNDEDLFIYALN

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X21.0e-2748Show/hide
Query:  KDLNSPIFLLSNICNLVSIRLDSSNFILWKFQLTSILKAHKMSGYVDGTTPAP---------SKVLLRSSPA------------------ISPEAALAYV
        KD  SPIFLLSNICNL+S+RLDS+NF+LWKFQLT+ILKAHK+ G++DGT P P         S V  +S+P+                  +SPE ALAYV
Subjt:  KDLNSPIFLLSNICNLVSIRLDSSNFILWKFQLTSILKAHKMSGYVDGTTPAP---------SKVLLRSSPA------------------ISPEAALAYV

Query:  VGCSTSK-------------------------ESISMKLTESIDDYVKRIKEINDELANVSSIVNDEDLFIYALN
        VG ++SK                         ++I  K  ESID Y+KRIKEI D+LANVS+ +N+EDL IYALN
Subjt:  VGCSTSK-------------------------ESISMKLTESIDDYVKRIKEINDELANVSSIVNDEDLFIYALN

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X31.0e-2748Show/hide
Query:  KDLNSPIFLLSNICNLVSIRLDSSNFILWKFQLTSILKAHKMSGYVDGTTPAP---------SKVLLRSSPA------------------ISPEAALAYV
        KD  SPIFLLSNICNL+S+RLDS+NF+LWKFQLT+ILKAHK+ G++DGT P P         S V  +S+P+                  +SPE ALAYV
Subjt:  KDLNSPIFLLSNICNLVSIRLDSSNFILWKFQLTSILKAHKMSGYVDGTTPAP---------SKVLLRSSPA------------------ISPEAALAYV

Query:  VGCSTSK-------------------------ESISMKLTESIDDYVKRIKEINDELANVSSIVNDEDLFIYALN
        VG ++SK                         ++I  K  ESID Y+KRIKEI D+LANVS+ +N+EDL IYALN
Subjt:  VGCSTSK-------------------------ESISMKLTESIDDYVKRIKEINDELANVSSIVNDEDLFIYALN

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X11.0e-2748Show/hide
Query:  KDLNSPIFLLSNICNLVSIRLDSSNFILWKFQLTSILKAHKMSGYVDGTTPAP---------SKVLLRSSPA------------------ISPEAALAYV
        KD  SPIFLLSNICNL+S+RLDS+NF+LWKFQLT+ILKAHK+ G++DGT P P         S V  +S+P+                  +SPE ALAYV
Subjt:  KDLNSPIFLLSNICNLVSIRLDSSNFILWKFQLTSILKAHKMSGYVDGTTPAP---------SKVLLRSSPA------------------ISPEAALAYV

Query:  VGCSTSK-------------------------ESISMKLTESIDDYVKRIKEINDELANVSSIVNDEDLFIYALN
        VG ++SK                         ++I  K  ESID Y+KRIKEI D+LANVS+ +N+EDL IYALN
Subjt:  VGCSTSK-------------------------ESISMKLTESIDDYVKRIKEINDELANVSSIVNDEDLFIYALN

A0A5D3CLI6 T4.51.0e-2748Show/hide
Query:  KDLNSPIFLLSNICNLVSIRLDSSNFILWKFQLTSILKAHKMSGYVDGTTPAP---------SKVLLRSSPA------------------ISPEAALAYV
        KD  SPIFLLSNICNL+S+RLDS+NF+LWKFQLT+ILKAHK+ G++DGT P P         S V  +S+P+                  +SPE ALAYV
Subjt:  KDLNSPIFLLSNICNLVSIRLDSSNFILWKFQLTSILKAHKMSGYVDGTTPAP---------SKVLLRSSPA------------------ISPEAALAYV

Query:  VGCSTSK-------------------------ESISMKLTESIDDYVKRIKEINDELANVSSIVNDEDLFIYALN
        VG ++SK                         ++I  K  ESID Y+KRIKEI D+LANVS+ +N+EDL IYALN
Subjt:  VGCSTSK-------------------------ESISMKLTESIDDYVKRIKEINDELANVSSIVNDEDLFIYALN

A0A6J1E049 uncharacterized protein LOC1110251505.9e-2846.59Show/hide
Query:  KDLNSPIFLLSNICNLVSIRLDSSNFILWKFQLTSILKAHKMSGYVDGTTPAPSKVLL-----------RSSPAIS-----------------PEAALAY
        KDL+SPIFLLSNICNLVS+RLDSSNF+LWKFQLT+ILKAHK+ G++DG+TP P++ L+            ++PA S                   +ALAY
Subjt:  KDLNSPIFLLSNICNLVSIRLDSSNFILWKFQLTSILKAHKMSGYVDGTTPAPSKVLL-----------RSSPAIS-----------------PEAALAY

Query:  VVGCSTSK-------------------------ESISMKLTESIDDYVKRIKEINDELANVSSIVNDEDLFIYALN
        VVGC +S+                         +SIS K   SID YV+RIKE+ D+LANV  +V++EDL IY LN
Subjt:  VVGCSTSK-------------------------ESISMKLTESIDDYVKRIKEINDELANVSSIVNDEDLFIYALN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATAAGGATCTCAATTCCCCAATTTTTCTTCTATCAAACATATGCAACCTTGTCTCCATTCGTCTCGATTCCTCAAACTTCATTCTCTGGAAATTTCAGCTAACCTC
TATTCTCAAGGCGCACAAAATGTCCGGTTATGTTGATGGAACTACACCGGCTCCTTCTAAGGTTCTTCTGAGATCCTCGCCTGCAATCTCTCCAGAAGCGGCTCTTGCCT
ATGTTGTCGGTTGTTCCACATCGAAGGAGTCTATTTCTATGAAACTCACTGAATCGATTGATGATTATGTGAAGAGAATCAAAGAAATCAATGATGAACTAGCTAATGTT
TCTTCTATTGTTAATGATGAGGATCTATTTATTTATGCACTGAATGCCAAAACAGTCCCTTGTCATCGTAGAGGAATACGAAGAAGAGGGATTCTAATGGAGGATTCAAC
TGAAAAGAGCAAAGAGCTTCCTGGTGAAAACGACGTCGTTGCTCACTGTCCAAAGTCATCACCACAGGAAGATTCCAGCACCGTTGGATCAAGTCCGCCTGCTACGCCTT
GCCCTTCAACTGACCAATCGGGGCAGGAAGGAAGTGAATCCCCCAGCCCCGGTCGTGCTGCTCCTAGGCCTGTACCAAGTAACGGTGGCAATGAACAAAAGAACATAGAT
GGCAATGATCCAAAGAACATAATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCATAAGGATCTCAATTCCCCAATTTTTCTTCTATCAAACATATGCAACCTTGTCTCCATTCGTCTCGATTCCTCAAACTTCATTCTCTGGAAATTTCAGCTAACCTC
TATTCTCAAGGCGCACAAAATGTCCGGTTATGTTGATGGAACTACACCGGCTCCTTCTAAGGTTCTTCTGAGATCCTCGCCTGCAATCTCTCCAGAAGCGGCTCTTGCCT
ATGTTGTCGGTTGTTCCACATCGAAGGAGTCTATTTCTATGAAACTCACTGAATCGATTGATGATTATGTGAAGAGAATCAAAGAAATCAATGATGAACTAGCTAATGTT
TCTTCTATTGTTAATGATGAGGATCTATTTATTTATGCACTGAATGCCAAAACAGTCCCTTGTCATCGTAGAGGAATACGAAGAAGAGGGATTCTAATGGAGGATTCAAC
TGAAAAGAGCAAAGAGCTTCCTGGTGAAAACGACGTCGTTGCTCACTGTCCAAAGTCATCACCACAGGAAGATTCCAGCACCGTTGGATCAAGTCCGCCTGCTACGCCTT
GCCCTTCAACTGACCAATCGGGGCAGGAAGGAAGTGAATCCCCCAGCCCCGGTCGTGCTGCTCCTAGGCCTGTACCAAGTAACGGTGGCAATGAACAAAAGAACATAGAT
GGCAATGATCCAAAGAACATAATTTAG
Protein sequenceShow/hide protein sequence
MHKDLNSPIFLLSNICNLVSIRLDSSNFILWKFQLTSILKAHKMSGYVDGTTPAPSKVLLRSSPAISPEAALAYVVGCSTSKESISMKLTESIDDYVKRIKEINDELANV
SSIVNDEDLFIYALNAKTVPCHRRGIRRRGILMEDSTEKSKELPGENDVVAHCPKSSPQEDSSTVGSSPPATPCPSTDQSGQEGSESPSPGRAAPRPVPSNGGNEQKNID
GNDPKNII