; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007665 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007665
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr9:2638927..2639274
RNA-Seq ExpressionLag0007665
SyntenyLag0007665
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK17989.1 uncharacterized protein E5676_scaffold306G002980 [Cucumis melo var. makuwa]1.5e-2658.26Show/hide
Query:  ASADLSSAKDLNSPMFLLTNICNLISIRLDSSNYVLWKFQMMSILKAHKLFGFLDGTSAAPPQFLLSST---SDGSTASSSDASTRVNNPSYEDWMAKDH
        AS+  S++K+L+SP+FLLTNICNLISIRLDS+NY LWKFQ   +LKAHKL+GF+D +   PP+ + + T   S  +T ++S ++T ++NP YEDW AKD 
Subjt:  ASADLSSAKDLNSPMFLLTNICNLISIRLDSSNYVLWKFQMMSILKAHKLFGFLDGTSAAPPQFLLSST---SDGSTASSSDASTRVNNPSYEDWMAKDH

Query:  ALMTLINATLSSEAL
        A M LINATLS EAL
Subjt:  ALMTLINATLSSEAL

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]3.7e-2560.75Show/hide
Query:  SSAKDLNSPMFLLTNICNLISIRLDSSNYVLWKFQMMSILKAHKLFGFLDGTSAAPPQFLLSSTSDGSTASSSDASTRVNNPSYEDWMAKDHALMTLINA
        S+ KD  SP+FLL+NICNLIS+RLDS+N+VLWKFQ+ +ILKAHKL+GF+DGT+  PP+         + +SS+      +NPSYEDW+AKD ALMT+INA
Subjt:  SSAKDLNSPMFLLTNICNLISIRLDSSNYVLWKFQMMSILKAHKLFGFLDGTSAAPPQFLLSSTSDGSTASSSDASTRVNNPSYEDWMAKDHALMTLINA

Query:  TLSSEAL
        TLS EAL
Subjt:  TLSSEAL

XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]1.2e-2861.95Show/hide
Query:  MASADLSSAKDLNSPMFLLTNICNLISIRLDSSNYVLWKFQMMSILKAHKLFGFLDGTSAAPPQFLLSSTSDGSTASSSDASTRVNNPSYEDWMAKDHAL
        M S+  ++ KDL+SP+FLL+NICNL+SIRLDS++++LWKFQ+ +ILKAHKLFGF+DG+ +AP QFL SS S+  +  ++  S  V NP +EDW+AKD AL
Subjt:  MASADLSSAKDLNSPMFLLTNICNLISIRLDSSNYVLWKFQMMSILKAHKLFGFLDGTSAAPPQFLLSSTSDGSTASSSDASTRVNNPSYEDWMAKDHAL

Query:  MTLINATLSSEAL
        MTLINATLS+EAL
Subjt:  MTLINATLSSEAL

XP_022152753.1 uncharacterized protein LOC111020396 isoform X3 [Momordica charantia]3.7e-2560.95Show/hide
Query:  ADLSSAKDLNSPMFLLTNICNLISIRLDSSNYVLWKFQMMSILKAHKLFGFLDGTSAAPPQFLLSSTSDGSTASSSDASTRVNNPSYEDWMAKDHALMTL
        A L    DLNSP+FLL+NICNLISIRLDS+N++LWKFQ+ +ILKAHKLF F+DG+S  P +F+ +ST + S++S   AS  + N +Y DW+A+D ALMTL
Subjt:  ADLSSAKDLNSPMFLLTNICNLISIRLDSSNYVLWKFQMMSILKAHKLFGFLDGTSAAPPQFLLSSTSDGSTASSSDASTRVNNPSYEDWMAKDHALMTL

Query:  INATL
        INATL
Subjt:  INATL

XP_022152754.1 uncharacterized protein LOC111020396 isoform X4 [Momordica charantia]3.7e-2560.95Show/hide
Query:  ADLSSAKDLNSPMFLLTNICNLISIRLDSSNYVLWKFQMMSILKAHKLFGFLDGTSAAPPQFLLSSTSDGSTASSSDASTRVNNPSYEDWMAKDHALMTL
        A L    DLNSP+FLL+NICNLISIRLDS+N++LWKFQ+ +ILKAHKLF F+DG+S  P +F+ +ST + S++S   AS  + N +Y DW+A+D ALMTL
Subjt:  ADLSSAKDLNSPMFLLTNICNLISIRLDSSNYVLWKFQMMSILKAHKLFGFLDGTSAAPPQFLLSSTSDGSTASSSDASTRVNNPSYEDWMAKDHALMTL

Query:  INATL
        INATL
Subjt:  INATL

TrEMBL top hitse value%identityAlignment
A0A5D3CLI6 T4.51.8e-2560.75Show/hide
Query:  SSAKDLNSPMFLLTNICNLISIRLDSSNYVLWKFQMMSILKAHKLFGFLDGTSAAPPQFLLSSTSDGSTASSSDASTRVNNPSYEDWMAKDHALMTLINA
        S+ KD  SP+FLL+NICNLIS+RLDS+N+VLWKFQ+ +ILKAHKL+GF+DGT+  PP+         + +SS+      +NPSYEDW+AKD ALMT+INA
Subjt:  SSAKDLNSPMFLLTNICNLISIRLDSSNYVLWKFQMMSILKAHKLFGFLDGTSAAPPQFLLSSTSDGSTASSSDASTRVNNPSYEDWMAKDHALMTLINA

Query:  TLSSEAL
        TLS EAL
Subjt:  TLSSEAL

A0A5D3D3T6 Retrotran_gag_3 domain-containing protein7.3e-2758.26Show/hide
Query:  ASADLSSAKDLNSPMFLLTNICNLISIRLDSSNYVLWKFQMMSILKAHKLFGFLDGTSAAPPQFLLSST---SDGSTASSSDASTRVNNPSYEDWMAKDH
        AS+  S++K+L+SP+FLLTNICNLISIRLDS+NY LWKFQ   +LKAHKL+GF+D +   PP+ + + T   S  +T ++S ++T ++NP YEDW AKD 
Subjt:  ASADLSSAKDLNSPMFLLTNICNLISIRLDSSNYVLWKFQMMSILKAHKLFGFLDGTSAAPPQFLLSST---SDGSTASSSDASTRVNNPSYEDWMAKDH

Query:  ALMTLINATLSSEAL
        A M LINATLS EAL
Subjt:  ALMTLINATLSSEAL

A0A6J1D9L6 uncharacterized protein LOC1110188926.0e-2961.95Show/hide
Query:  MASADLSSAKDLNSPMFLLTNICNLISIRLDSSNYVLWKFQMMSILKAHKLFGFLDGTSAAPPQFLLSSTSDGSTASSSDASTRVNNPSYEDWMAKDHAL
        M S+  ++ KDL+SP+FLL+NICNL+SIRLDS++++LWKFQ+ +ILKAHKLFGF+DG+ +AP QFL SS S+  +  ++  S  V NP +EDW+AKD AL
Subjt:  MASADLSSAKDLNSPMFLLTNICNLISIRLDSSNYVLWKFQMMSILKAHKLFGFLDGTSAAPPQFLLSSTSDGSTASSSDASTRVNNPSYEDWMAKDHAL

Query:  MTLINATLSSEAL
        MTLINATLS+EAL
Subjt:  MTLINATLSSEAL

A0A6J1DFQ7 uncharacterized protein LOC111020396 isoform X31.8e-2560.95Show/hide
Query:  ADLSSAKDLNSPMFLLTNICNLISIRLDSSNYVLWKFQMMSILKAHKLFGFLDGTSAAPPQFLLSSTSDGSTASSSDASTRVNNPSYEDWMAKDHALMTL
        A L    DLNSP+FLL+NICNLISIRLDS+N++LWKFQ+ +ILKAHKLF F+DG+S  P +F+ +ST + S++S   AS  + N +Y DW+A+D ALMTL
Subjt:  ADLSSAKDLNSPMFLLTNICNLISIRLDSSNYVLWKFQMMSILKAHKLFGFLDGTSAAPPQFLLSSTSDGSTASSSDASTRVNNPSYEDWMAKDHALMTL

Query:  INATL
        INATL
Subjt:  INATL

A0A6J1DIP4 uncharacterized protein LOC111020396 isoform X11.8e-2560.95Show/hide
Query:  ADLSSAKDLNSPMFLLTNICNLISIRLDSSNYVLWKFQMMSILKAHKLFGFLDGTSAAPPQFLLSSTSDGSTASSSDASTRVNNPSYEDWMAKDHALMTL
        A L    DLNSP+FLL+NICNLISIRLDS+N++LWKFQ+ +ILKAHKLF F+DG+S  P +F+ +ST + S++S   AS  + N +Y DW+A+D ALMTL
Subjt:  ADLSSAKDLNSPMFLLTNICNLISIRLDSSNYVLWKFQMMSILKAHKLFGFLDGTSAAPPQFLLSSTSDGSTASSSDASTRVNNPSYEDWMAKDHALMTL

Query:  INATL
        INATL
Subjt:  INATL

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.7e-0429.59Show/hide
Query:  LNSPMFLLTNICNLISIRLDSSNYVLWKFQMMSILKAHKLFGFLDGTSAAPPQFLLSSTSDGSTASSSDASTRVNNPSYEDWMAKDHALMTLINATLS
        LN+   L  N+ N+   +L S+NY++W  Q+ ++   ++L GFLDG++  PP  +            +DA+ RV NP Y  W  +D  + + +   +S
Subjt:  LNSPMFLLTNICNLISIRLDSSNYVLWKFQMMSILKAHKLFGFLDGTSAAPPQFLLSSTSDGSTASSSDASTRVNNPSYEDWMAKDHALMTLINATLS

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).8.6e-0435.38Show/hide
Query:  LSSAKDLNSPMFLLTNI-----CNLISIRLDSSNYVLWKFQMMSILKAHKLFGFLDGTSAAPPQF
        +S   D +SP +L  +I      ++  +  D  NYV WK +  S L+  K FGF+DGT   P  F
Subjt:  LSSAKDLNSPMFLLTNI-----CNLISIRLDSSNYVLWKFQMMSILKAHKLFGFLDGTSAAPPQF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTGCTGATCTTTCTTCTGCCAAGGATCTGAATTCACCGATGTTCTTGCTAACAAATATCTGCAATTTGATATCAATTCGACTTGATTCGTCAAATTATGTCCT
ATGGAAGTTTCAGATGATGTCCATTCTCAAGGCGCACAAACTCTTCGGTTTTCTTGATGGAACCTCTGCCGCACCGCCACAATTTTTACTCTCAAGCACCTCTGATGGAT
CGACTGCTTCTTCGTCGGATGCCTCAACTCGGGTGAACAATCCTTCATATGAAGATTGGATGGCAAAGGATCATGCGTTGATGACACTTATCAATGCAACTCTCTCCTCT
GAAGCTCTGATACCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTGCTGATCTTTCTTCTGCCAAGGATCTGAATTCACCGATGTTCTTGCTAACAAATATCTGCAATTTGATATCAATTCGACTTGATTCGTCAAATTATGTCCT
ATGGAAGTTTCAGATGATGTCCATTCTCAAGGCGCACAAACTCTTCGGTTTTCTTGATGGAACCTCTGCCGCACCGCCACAATTTTTACTCTCAAGCACCTCTGATGGAT
CGACTGCTTCTTCGTCGGATGCCTCAACTCGGGTGAACAATCCTTCATATGAAGATTGGATGGCAAAGGATCATGCGTTGATGACACTTATCAATGCAACTCTCTCCTCT
GAAGCTCTGATACCATGA
Protein sequenceShow/hide protein sequence
MASADLSSAKDLNSPMFLLTNICNLISIRLDSSNYVLWKFQMMSILKAHKLFGFLDGTSAAPPQFLLSSTSDGSTASSSDASTRVNNPSYEDWMAKDHALMTLINATLSS
EALIP