; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0024296 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0024296
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr10:1949329..1949784
RNA-Seq ExpressionLag0024296
SyntenyLag0024296
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049700.1 T4.5 [Cucumis melo var. makuwa]4.3e-2947.31Show/hide
Query:  MDTSSSITKSAAIPSEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPFLRAHKLFGYLDGSISAPAKEIKVEGIS----QPNPAYEAWFEKDQALI
        M +S+++  S+A   E+   SPI LL+NI NLI++RLDS+N+VLW FQL+  L+AHKL+G++DG+   P +       S    Q NP+YE W  KDQAL+
Subjt:  MDTSSSITKSAAIPSEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPFLRAHKLFGYLDGSISAPAKEIKVEGIS----QPNPAYEAWFEKDQALI

Query:  TLINATLSPSALAYAVGCTTSQQIWS---------------KLKSDLQSITKLSTESISGYVQRITD
        T+INATLSP ALAY VG T+S+Q+W                 LKSDLQ+I K   ESI  Y++RI +
Subjt:  TLINATLSPSALAYAVGCTTSQQIWS---------------KLKSDLQSITKLSTESISGYVQRITD

KAG6588985.1 Retrovirus-related Pol polyprotein from transposon RE1, partial [Cucurbita argyrosperma subsp. sororia]4.1e-3250.92Show/hide
Query:  MDTSSSITKSAAIPSEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPFLRAHKLFGYLDGSISAPAKEIKVEGISQPNPAYEAWFEKDQALITLIN
        MD+S+S + S+   +E +L SPIVLL+NI NLI+++LDS+NYVLW FQL+  L+AHKLFG++DG+   P            NP+Y+ WF KDQAL+T+IN
Subjt:  MDTSSSITKSAAIPSEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPFLRAHKLFGYLDGSISAPAKEIKVEGISQPNPAYEAWFEKDQALITLIN

Query:  ATLSPSALAYAVGCTTSQQIWS---------------KLKSDLQSITKLSTESISGYVQRITD
        ATLSP ALAY VG TTS+Q+W+                LKSDLQ+I+K S ESI  Y++RI +
Subjt:  ATLSPSALAYAVGCTTSQQIWS---------------KLKSDLQSITKLSTESISGYVQRITD

KAG7015254.1 hypothetical protein SDJN02_22888, partial [Cucurbita argyrosperma subsp. argyrosperma]4.1e-3250.92Show/hide
Query:  MDTSSSITKSAAIPSEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPFLRAHKLFGYLDGSISAPAKEIKVEGISQPNPAYEAWFEKDQALITLIN
        MD+S+S + S+   +E +L SPIVLL+NI NLI+++LDS+NYVLW FQL+  L+AHKLFG++DG+   P            NP+Y+ WF KDQAL+T+IN
Subjt:  MDTSSSITKSAAIPSEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPFLRAHKLFGYLDGSISAPAKEIKVEGISQPNPAYEAWFEKDQALITLIN

Query:  ATLSPSALAYAVGCTTSQQIWS---------------KLKSDLQSITKLSTESISGYVQRITD
        ATLSP ALAY VG TTS+Q+W+                LKSDLQ+I+K S ESI  Y++RI +
Subjt:  ATLSPSALAYAVGCTTSQQIWS---------------KLKSDLQSITKLSTESISGYVQRITD

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]4.3e-2947.31Show/hide
Query:  MDTSSSITKSAAIPSEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPFLRAHKLFGYLDGSISAPAKEIKVEGIS----QPNPAYEAWFEKDQALI
        M +S+++  S+A   E+   SPI LL+NI NLI++RLDS+N+VLW FQL+  L+AHKL+G++DG+   P +       S    Q NP+YE W  KDQAL+
Subjt:  MDTSSSITKSAAIPSEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPFLRAHKLFGYLDGSISAPAKEIKVEGIS----QPNPAYEAWFEKDQALI

Query:  TLINATLSPSALAYAVGCTTSQQIWS---------------KLKSDLQSITKLSTESISGYVQRITD
        T+INATLSP ALAY VG T+S+Q+W                 LKSDLQ+I K   ESI  Y++RI +
Subjt:  TLINATLSPSALAYAVGCTTSQQIWS---------------KLKSDLQSITKLSTESISGYVQRITD

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]4.3e-2947.31Show/hide
Query:  MDTSSSITKSAAIPSEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPFLRAHKLFGYLDGSISAPAKEIKVEGIS----QPNPAYEAWFEKDQALI
        M +S+++  S+A   E+   SPI LL+NI NLI++RLDS+N+VLW FQL+  L+AHKL+G++DG+   P +       S    Q NP+YE W  KDQAL+
Subjt:  MDTSSSITKSAAIPSEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPFLRAHKLFGYLDGSISAPAKEIKVEGIS----QPNPAYEAWFEKDQALI

Query:  TLINATLSPSALAYAVGCTTSQQIWS---------------KLKSDLQSITKLSTESISGYVQRITD
        T+INATLSP ALAY VG T+S+Q+W                 LKSDLQ+I K   ESI  Y++RI +
Subjt:  TLINATLSPSALAYAVGCTTSQQIWS---------------KLKSDLQSITKLSTESISGYVQRITD

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X22.1e-2947.31Show/hide
Query:  MDTSSSITKSAAIPSEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPFLRAHKLFGYLDGSISAPAKEIKVEGIS----QPNPAYEAWFEKDQALI
        M +S+++  S+A   E+   SPI LL+NI NLI++RLDS+N+VLW FQL+  L+AHKL+G++DG+   P +       S    Q NP+YE W  KDQAL+
Subjt:  MDTSSSITKSAAIPSEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPFLRAHKLFGYLDGSISAPAKEIKVEGIS----QPNPAYEAWFEKDQALI

Query:  TLINATLSPSALAYAVGCTTSQQIWS---------------KLKSDLQSITKLSTESISGYVQRITD
        T+INATLSP ALAY VG T+S+Q+W                 LKSDLQ+I K   ESI  Y++RI +
Subjt:  TLINATLSPSALAYAVGCTTSQQIWS---------------KLKSDLQSITKLSTESISGYVQRITD

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X32.1e-2947.31Show/hide
Query:  MDTSSSITKSAAIPSEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPFLRAHKLFGYLDGSISAPAKEIKVEGIS----QPNPAYEAWFEKDQALI
        M +S+++  S+A   E+   SPI LL+NI NLI++RLDS+N+VLW FQL+  L+AHKL+G++DG+   P +       S    Q NP+YE W  KDQAL+
Subjt:  MDTSSSITKSAAIPSEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPFLRAHKLFGYLDGSISAPAKEIKVEGIS----QPNPAYEAWFEKDQALI

Query:  TLINATLSPSALAYAVGCTTSQQIWS---------------KLKSDLQSITKLSTESISGYVQRITD
        T+INATLSP ALAY VG T+S+Q+W                 LKSDLQ+I K   ESI  Y++RI +
Subjt:  TLINATLSPSALAYAVGCTTSQQIWS---------------KLKSDLQSITKLSTESISGYVQRITD

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X12.1e-2947.31Show/hide
Query:  MDTSSSITKSAAIPSEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPFLRAHKLFGYLDGSISAPAKEIKVEGIS----QPNPAYEAWFEKDQALI
        M +S+++  S+A   E+   SPI LL+NI NLI++RLDS+N+VLW FQL+  L+AHKL+G++DG+   P +       S    Q NP+YE W  KDQAL+
Subjt:  MDTSSSITKSAAIPSEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPFLRAHKLFGYLDGSISAPAKEIKVEGIS----QPNPAYEAWFEKDQALI

Query:  TLINATLSPSALAYAVGCTTSQQIWS---------------KLKSDLQSITKLSTESISGYVQRITD
        T+INATLSP ALAY VG T+S+Q+W                 LKSDLQ+I K   ESI  Y++RI +
Subjt:  TLINATLSPSALAYAVGCTTSQQIWS---------------KLKSDLQSITKLSTESISGYVQRITD

A0A5D3CLI6 T4.52.1e-2947.31Show/hide
Query:  MDTSSSITKSAAIPSEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPFLRAHKLFGYLDGSISAPAKEIKVEGIS----QPNPAYEAWFEKDQALI
        M +S+++  S+A   E+   SPI LL+NI NLI++RLDS+N+VLW FQL+  L+AHKL+G++DG+   P +       S    Q NP+YE W  KDQAL+
Subjt:  MDTSSSITKSAAIPSEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPFLRAHKLFGYLDGSISAPAKEIKVEGIS----QPNPAYEAWFEKDQALI

Query:  TLINATLSPSALAYAVGCTTSQQIWS---------------KLKSDLQSITKLSTESISGYVQRITD
        T+INATLSP ALAY VG T+S+Q+W                 LKSDLQ+I K   ESI  Y++RI +
Subjt:  TLINATLSPSALAYAVGCTTSQQIWS---------------KLKSDLQSITKLSTESISGYVQRITD

A0A6J1D9L6 uncharacterized protein LOC1110188923.5e-2947.88Show/hide
Query:  SAAIPSEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPFLRAHKLFGYLDGSISAPAKEI--KVEGISQP---------NPAYEAWFEKDQALITL
        S++  +++ L SPI LL+NI NL+++RLDS++++LW FQL+  L+AHKLFG++DGS+SAP++ +    E  SQP         NP +E W  KDQAL+TL
Subjt:  SAAIPSEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPFLRAHKLFGYLDGSISAPAKEI--KVEGISQP---------NPAYEAWFEKDQALITL

Query:  INATLSPSALAYAVGCTTSQQIWS---------------KLKSDLQSITKLSTESISGYVQRITD
        INATLS  ALAY V   TS+Q+W                 LKSDLQSI K + ESI  YV+RI +
Subjt:  INATLSPSALAYAVGCTTSQQIWS---------------KLKSDLQSITKLSTESISGYVQRITD

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.1e-0928.19Show/hide
Query:  AAIPSEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPFLRAHKLFGYLDGSISAPAKEIKVEGISQPNPAYEAWFEKDQALITLINATLSPSALAY
        AA   E  L++  +L  N+SN+   +L S+NY++W+ Q+      ++L G+LDGS + P   I  +   + NP Y  W  +D+ + + +   +S S    
Subjt:  AAIPSEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPFLRAHKLFGYLDGSISAPAKEIKVEGISQPNPAYEAWFEKDQALITLINATLSPSALAY

Query:  AVGCTTSQQIW---------------SKLKSDLQSITKLSTESISGYVQ
            TT+ QIW               ++L++ L+  TK  T++I  Y+Q
Subjt:  AVGCTTSQQIW---------------SKLKSDLQSITKLSTESISGYVQ

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.1e-0931.37Show/hide
Query:  VLLTNISNLITVRLDSSNYVLWNFQLSPFLRAHKLFGYLDGSISAPAKEIKVEGISQPNPAYEAWFEKDQALITLINATLSPSALAYAVGCTTSQQIWSK
        +L  N+SN+   +L S+NY++W+ Q+      ++L G+LDGS   P   I  + + + NP Y  W  +D+ + + I   +S S        TT+ QIW  
Subjt:  VLLTNISNLITVRLDSSNYVLWNFQLSPFLRAHKLFGYLDGSISAPAKEIKVEGISQPNPAYEAWFEKDQALITLINATLSPSALAYAVGCTTSQQIWSK

Query:  LK
        L+
Subjt:  LK

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.7e-0423.86Show/hide
Query:  DSSNYVLWNFQLSPFLRAHKLFGYLDGSISAPAKEIKVEGISQPNPAYEAWFEKDQALITLINATLSPSALAYAVGCTTSQQIWSKLK
        D  NYV W  +   FLR  K FG++DG++  P            +P Y+ W + +  ++  +  +++   L   +   T+ ++W  L+
Subjt:  DSSNYVLWNFQLSPFLRAHKLFGYLDGSISAPAKEIKVEGISQPNPAYEAWFEKDQALITLINATLSPSALAYAVGCTTSQQIWSKLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATACTTCGAGTTCAATCACGAAGTCTGCTGCGATTCCTTCAGAACAGTCTTTATCCTCTCCGATTGTTCTTCTCACAAACATTAGCAATTTGATCACCGTT
CGCTTGGATTCTTCAAACTATGTACTCTGGAATTTTCAGCTTTCTCCTTTTCTTCGTGCACACAAGCTTTTTGGTTATTTGGATGGTTCGATCTCGGCTCCGGCC
AAAGAAATCAAGGTTGAGGGAATCTCACAACCTAATCCTGCGTATGAAGCGTGGTTTGAGAAAGATCAAGCTTTGATTACGCTGATTAATGCTACTCTCTCGCCG
TCGGCTTTAGCGTATGCAGTTGGTTGCACTACCTCTCAACAGATCTGGTCGAAGCTCAAGTCTGATTTACAGAGCATCACCAAACTATCTACTGAGTCTATCAGT
GGCTATGTTCAACGAATTACAGATCAGCCAGTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATACTTCGAGTTCAATCACGAAGTCTGCTGCGATTCCTTCAGAACAGTCTTTATCCTCTCCGATTGTTCTTCTCACAAACATTAGCAATTTGATCACCGTT
CGCTTGGATTCTTCAAACTATGTACTCTGGAATTTTCAGCTTTCTCCTTTTCTTCGTGCACACAAGCTTTTTGGTTATTTGGATGGTTCGATCTCGGCTCCGGCC
AAAGAAATCAAGGTTGAGGGAATCTCACAACCTAATCCTGCGTATGAAGCGTGGTTTGAGAAAGATCAAGCTTTGATTACGCTGATTAATGCTACTCTCTCGCCG
TCGGCTTTAGCGTATGCAGTTGGTTGCACTACCTCTCAACAGATCTGGTCGAAGCTCAAGTCTGATTTACAGAGCATCACCAAACTATCTACTGAGTCTATCAGT
GGCTATGTTCAACGAATTACAGATCAGCCAGTATGA
Protein sequenceShow/hide protein sequence
MDTSSSITKSAAIPSEQSLSSPIVLLTNISNLITVRLDSSNYVLWNFQLSPFLRAHKLFGYLDGSISAPAKEIKVEGISQPNPAYEAWFEKDQALITLINATLSP
SALAYAVGCTTSQQIWSKLKSDLQSITKLSTESISGYVQRITDQPV