; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018044 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018044
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr5:14548545..14549271
RNA-Seq ExpressionLag0018044
SyntenyLag0018044
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645659.1 hypothetical protein Csa_020439 [Cucumis sativus]1.6e-4347.56Show/hide
Query:  ASTTTPSL---KDLHSPIFLLANICNLISIRLDSSNYVLWKFQFSSMLREHKLFGFVDGSTPAPSETISSSSIAESSSGGSVSTVTVSPNPLYEDWLAKD
        +STT PS    KD  SPIFLL+NICNLIS+RLDS+N+VLWKFQ +++L+ HKLFGFVDG+ P P            +S  + STV    NPLYEDW+AKD
Subjt:  ASTTTPSL---KDLHSPIFLLANICNLISIRLDSSNYVLWKFQFSSMLREHKLFGFVDGSTPAPSETISSSSIAESSSGGSVSTVTVSPNPLYEDWLAKD

Query:  QALMTLIKATLSPEALTYIV-------------------------------------------------------------VVQDEDLLIYALNGL--QY
        QALMT+I ATLSPEAL Y+V                                                              + +EDLLIYALNGL  +Y
Subjt:  QALMTLIKATLSPEALTYIV-------------------------------------------------------------VVQDEDLLIYALNGL--QY

Query:  NAFRTSMRTRSQSVSFAELHVLLKSEESALEKQSKCEVLVVQPTTM
        N FRTSMRTRSQ V+F ELHVLL++EESAL KQSKC+    QPT +
Subjt:  NAFRTSMRTRSQSVSFAELHVLLKSEESALEKQSKCEVLVVQPTTM

TYK17989.1 uncharacterized protein E5676_scaffold306G002980 [Cucumis melo var. makuwa]6.0e-4649.39Show/hide
Query:  ASTTTPSLKDLHSPIFLLANICNLISIRLDSSNYVLWKFQFSSMLREHKLFGFVDGSTPAPSETISSSSIAESSSGGSV--STVTVSPNPLYEDWLAKDQ
        AS+++ + K+L SP+FLL NICNLISIRLDS+NY LWKFQF  ML+ HKL+GF+D S P P +TIS+ + A SS+      S+ T   NP YEDW AKDQ
Subjt:  ASTTTPSLKDLHSPIFLLANICNLISIRLDSSNYVLWKFQFSSMLREHKLFGFVDGSTPAPSETISSSSIAESSSGGSV--STVTVSPNPLYEDWLAKDQ

Query:  ALMTLIKATLSPEALTYIV-------------------------------------------------------------VVQDEDLLIYALNGL--QYN
        A M LI ATLS EALTY+V                                                             +V+DEDL+IYALNGL  +YN
Subjt:  ALMTLIKATLSPEALTYIV-------------------------------------------------------------VVQDEDLLIYALNGL--QYN

Query:  AFRTSMRTRSQSVSFAELHVLLKSEESALEKQSKCEVLVVQPTTM
        AFRTSM+TRSQ VSF+ELH+LLKSEESALEKQ+K E + VQPT M
Subjt:  AFRTSMRTRSQSVSFAELHVLLKSEESALEKQSKCEVLVVQPTTM

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]2.4e-4247.15Show/hide
Query:  ASTTTPSL---KDLHSPIFLLANICNLISIRLDSSNYVLWKFQFSSMLREHKLFGFVDGSTPAPSETISSSSIAESSSGGSVSTVTVSPNPLYEDWLAKD
        +STT PS    KD  SPIFLL+NICNLIS+RLDS+N+VLWKFQ +++L+ HKL+GF+DG+ P P  T +SS         S STV    NP YEDW+AKD
Subjt:  ASTTTPSL---KDLHSPIFLLANICNLISIRLDSSNYVLWKFQFSSMLREHKLFGFVDGSTPAPSETISSSSIAESSSGGSVSTVTVSPNPLYEDWLAKD

Query:  QALMTLIKATLSPEALTYIV-------------------------------------------------------------VVQDEDLLIYALNGL--QY
        QALMT+I ATLSPEAL Y+V                                                              + +EDLLIYALNGL  +Y
Subjt:  QALMTLIKATLSPEALTYIV-------------------------------------------------------------VVQDEDLLIYALNGL--QY

Query:  NAFRTSMRTRSQSVSFAELHVLLKSEESALEKQSKCEVLVVQPTTM
        N FRTSMRTRSQ V+F ELHVLL++EESAL KQSK +    QPT +
Subjt:  NAFRTSMRTRSQSVSFAELHVLLKSEESALEKQSKCEVLVVQPTTM

XP_011658579.1 uncharacterized protein LOC105436058 [Cucumis sativus]1.6e-4347.56Show/hide
Query:  ASTTTPSL---KDLHSPIFLLANICNLISIRLDSSNYVLWKFQFSSMLREHKLFGFVDGSTPAPSETISSSSIAESSSGGSVSTVTVSPNPLYEDWLAKD
        +STT PS    KD  SPIFLL+NICNLIS+RLDS+N+VLWKFQ +++L+ HKLFGFVDG+ P P            +S  + STV    NPLYEDW+AKD
Subjt:  ASTTTPSL---KDLHSPIFLLANICNLISIRLDSSNYVLWKFQFSSMLREHKLFGFVDGSTPAPSETISSSSIAESSSGGSVSTVTVSPNPLYEDWLAKD

Query:  QALMTLIKATLSPEALTYIV-------------------------------------------------------------VVQDEDLLIYALNGL--QY
        QALMT+I ATLSPEAL Y+V                                                              + +EDLLIYALNGL  +Y
Subjt:  QALMTLIKATLSPEALTYIV-------------------------------------------------------------VVQDEDLLIYALNGL--QY

Query:  NAFRTSMRTRSQSVSFAELHVLLKSEESALEKQSKCEVLVVQPTTM
        N FRTSMRTRSQ V+F ELHVLL++EESAL KQSKC+    QPT +
Subjt:  NAFRTSMRTRSQSVSFAELHVLLKSEESALEKQSKCEVLVVQPTTM

XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]7.4e-4447.13Show/hide
Query:  MASTTTPSLKDLHSPIFLLANICNLISIRLDSSNYVLWKFQFSSMLREHKLFGFVDGSTPAPSETISSSSIAESSSGGSVSTVTVSPNPLYEDWLAKDQA
        M S++T + KDLHSPIFLL+NICNL+SIRLDS++++LWKFQ +++L+ HKLFGF+DGS  APS+ ++SSS  ES    + S   +  NP +EDW+AKDQA
Subjt:  MASTTTPSLKDLHSPIFLLANICNLISIRLDSSNYVLWKFQFSSMLREHKLFGFVDGSTPAPSETISSSSIAESSSGGSVSTVTVSPNPLYEDWLAKDQA

Query:  LMTLIKATLSPEALTYIV-------------------------------------------------------------VVQDEDLLIYALNGL--QYNA
        LMTLI ATLS EAL Y+V                                                              + DE LLIYALNGL  +YN 
Subjt:  LMTLIKATLSPEALTYIV-------------------------------------------------------------VVQDEDLLIYALNGL--QYNA

Query:  FRTSMRTRSQSVSFAELHVLLKSEESALEKQSKCEVLVVQPTTM
          TSMRTR+QSVSF ELHV +KSEESA+EKQ K E LV QP  +
Subjt:  FRTSMRTRSQSVSFAELHVLLKSEESALEKQSKCEVLVVQPTTM

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X21.1e-4247.15Show/hide
Query:  ASTTTPSL---KDLHSPIFLLANICNLISIRLDSSNYVLWKFQFSSMLREHKLFGFVDGSTPAPSETISSSSIAESSSGGSVSTVTVSPNPLYEDWLAKD
        +STT PS    KD  SPIFLL+NICNLIS+RLDS+N+VLWKFQ +++L+ HKL+GF+DG+ P P  T +SS         S STV    NP YEDW+AKD
Subjt:  ASTTTPSL---KDLHSPIFLLANICNLISIRLDSSNYVLWKFQFSSMLREHKLFGFVDGSTPAPSETISSSSIAESSSGGSVSTVTVSPNPLYEDWLAKD

Query:  QALMTLIKATLSPEALTYIV-------------------------------------------------------------VVQDEDLLIYALNGL--QY
        QALMT+I ATLSPEAL Y+V                                                              + +EDLLIYALNGL  +Y
Subjt:  QALMTLIKATLSPEALTYIV-------------------------------------------------------------VVQDEDLLIYALNGL--QY

Query:  NAFRTSMRTRSQSVSFAELHVLLKSEESALEKQSKCEVLVVQPTTM
        N FRTSMRTRSQ V+F ELHVLL++EESAL KQSK +    QPT +
Subjt:  NAFRTSMRTRSQSVSFAELHVLLKSEESALEKQSKCEVLVVQPTTM

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X11.1e-4247.15Show/hide
Query:  ASTTTPSL---KDLHSPIFLLANICNLISIRLDSSNYVLWKFQFSSMLREHKLFGFVDGSTPAPSETISSSSIAESSSGGSVSTVTVSPNPLYEDWLAKD
        +STT PS    KD  SPIFLL+NICNLIS+RLDS+N+VLWKFQ +++L+ HKL+GF+DG+ P P  T +SS         S STV    NP YEDW+AKD
Subjt:  ASTTTPSL---KDLHSPIFLLANICNLISIRLDSSNYVLWKFQFSSMLREHKLFGFVDGSTPAPSETISSSSIAESSSGGSVSTVTVSPNPLYEDWLAKD

Query:  QALMTLIKATLSPEALTYIV-------------------------------------------------------------VVQDEDLLIYALNGL--QY
        QALMT+I ATLSPEAL Y+V                                                              + +EDLLIYALNGL  +Y
Subjt:  QALMTLIKATLSPEALTYIV-------------------------------------------------------------VVQDEDLLIYALNGL--QY

Query:  NAFRTSMRTRSQSVSFAELHVLLKSEESALEKQSKCEVLVVQPTTM
        N FRTSMRTRSQ V+F ELHVLL++EESAL KQSK +    QPT +
Subjt:  NAFRTSMRTRSQSVSFAELHVLLKSEESALEKQSKCEVLVVQPTTM

A0A5D3CLI6 T4.51.1e-4247.15Show/hide
Query:  ASTTTPSL---KDLHSPIFLLANICNLISIRLDSSNYVLWKFQFSSMLREHKLFGFVDGSTPAPSETISSSSIAESSSGGSVSTVTVSPNPLYEDWLAKD
        +STT PS    KD  SPIFLL+NICNLIS+RLDS+N+VLWKFQ +++L+ HKL+GF+DG+ P P  T +SS         S STV    NP YEDW+AKD
Subjt:  ASTTTPSL---KDLHSPIFLLANICNLISIRLDSSNYVLWKFQFSSMLREHKLFGFVDGSTPAPSETISSSSIAESSSGGSVSTVTVSPNPLYEDWLAKD

Query:  QALMTLIKATLSPEALTYIV-------------------------------------------------------------VVQDEDLLIYALNGL--QY
        QALMT+I ATLSPEAL Y+V                                                              + +EDLLIYALNGL  +Y
Subjt:  QALMTLIKATLSPEALTYIV-------------------------------------------------------------VVQDEDLLIYALNGL--QY

Query:  NAFRTSMRTRSQSVSFAELHVLLKSEESALEKQSKCEVLVVQPTTM
        N FRTSMRTRSQ V+F ELHVLL++EESAL KQSK +    QPT +
Subjt:  NAFRTSMRTRSQSVSFAELHVLLKSEESALEKQSKCEVLVVQPTTM

A0A5D3D3T6 Retrotran_gag_3 domain-containing protein2.9e-4649.39Show/hide
Query:  ASTTTPSLKDLHSPIFLLANICNLISIRLDSSNYVLWKFQFSSMLREHKLFGFVDGSTPAPSETISSSSIAESSSGGSV--STVTVSPNPLYEDWLAKDQ
        AS+++ + K+L SP+FLL NICNLISIRLDS+NY LWKFQF  ML+ HKL+GF+D S P P +TIS+ + A SS+      S+ T   NP YEDW AKDQ
Subjt:  ASTTTPSLKDLHSPIFLLANICNLISIRLDSSNYVLWKFQFSSMLREHKLFGFVDGSTPAPSETISSSSIAESSSGGSV--STVTVSPNPLYEDWLAKDQ

Query:  ALMTLIKATLSPEALTYIV-------------------------------------------------------------VVQDEDLLIYALNGL--QYN
        A M LI ATLS EALTY+V                                                             +V+DEDL+IYALNGL  +YN
Subjt:  ALMTLIKATLSPEALTYIV-------------------------------------------------------------VVQDEDLLIYALNGL--QYN

Query:  AFRTSMRTRSQSVSFAELHVLLKSEESALEKQSKCEVLVVQPTTM
        AFRTSM+TRSQ VSF+ELH+LLKSEESALEKQ+K E + VQPT M
Subjt:  AFRTSMRTRSQSVSFAELHVLLKSEESALEKQSKCEVLVVQPTTM

A0A6J1D9L6 uncharacterized protein LOC1110188923.6e-4447.13Show/hide
Query:  MASTTTPSLKDLHSPIFLLANICNLISIRLDSSNYVLWKFQFSSMLREHKLFGFVDGSTPAPSETISSSSIAESSSGGSVSTVTVSPNPLYEDWLAKDQA
        M S++T + KDLHSPIFLL+NICNL+SIRLDS++++LWKFQ +++L+ HKLFGF+DGS  APS+ ++SSS  ES    + S   +  NP +EDW+AKDQA
Subjt:  MASTTTPSLKDLHSPIFLLANICNLISIRLDSSNYVLWKFQFSSMLREHKLFGFVDGSTPAPSETISSSSIAESSSGGSVSTVTVSPNPLYEDWLAKDQA

Query:  LMTLIKATLSPEALTYIV-------------------------------------------------------------VVQDEDLLIYALNGL--QYNA
        LMTLI ATLS EAL Y+V                                                              + DE LLIYALNGL  +YN 
Subjt:  LMTLIKATLSPEALTYIV-------------------------------------------------------------VVQDEDLLIYALNGL--QYNA

Query:  FRTSMRTRSQSVSFAELHVLLKSEESALEKQSKCEVLVVQPTTM
          TSMRTR+QSVSF ELHV +KSEESA+EKQ K E LV QP  +
Subjt:  FRTSMRTRSQSVSFAELHVLLKSEESALEKQSKCEVLVVQPTTM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.6e-0440.38Show/hide
Query:  DLHSPIFLLANICNLISIRLDSSNYVLWKFQFSSMLREHKLFGFVDGSTPAP
        D+H P     +  ++  +  D  NYV WK +F S LR  K FGF+DG+ P P
Subjt:  DLHSPIFLLANICNLISIRLDSSNYVLWKFQFSSMLREHKLFGFVDGSTPAP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTACTACTACTCCTTCTCTAAAGGATCTTCATTCTCCAATTTTCCTATTAGCCAATATCTGCAATCTTATTTCGATTCGCTTGGATTCATCGAATTATGTTTT
GTGGAAGTTTCAGTTTTCTTCCATGTTAAGGGAGCATAAACTTTTTGGATTTGTTGATGGATCTACTCCTGCTCCATCCGAGACAATTTCGTCTTCTTCGATTGCTGAAT
CGTCTTCTGGAGGTTCTGTATCTACAGTCACTGTGAGTCCGAATCCTCTGTATGAAGATTGGCTTGCGAAGGATCAAGCCTTAATGACTTTAATTAAGGCCACGTTATCT
CCTGAAGCATTGACCTACATTGTTGTGGTTCAAGATGAAGATCTGTTGATCTATGCCTTAAACGGCTTGCAATACAATGCCTTTCGCACTTCTATGCGTACTCGGTCTCA
GTCGGTGAGTTTTGCTGAACTTCATGTTTTACTAAAATCAGAGGAGTCTGCCCTAGAGAAACAATCTAAGTGTGAGGTCTTAGTTGTTCAACCTACGACGATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTACTACTACTCCTTCTCTAAAGGATCTTCATTCTCCAATTTTCCTATTAGCCAATATCTGCAATCTTATTTCGATTCGCTTGGATTCATCGAATTATGTTTT
GTGGAAGTTTCAGTTTTCTTCCATGTTAAGGGAGCATAAACTTTTTGGATTTGTTGATGGATCTACTCCTGCTCCATCCGAGACAATTTCGTCTTCTTCGATTGCTGAAT
CGTCTTCTGGAGGTTCTGTATCTACAGTCACTGTGAGTCCGAATCCTCTGTATGAAGATTGGCTTGCGAAGGATCAAGCCTTAATGACTTTAATTAAGGCCACGTTATCT
CCTGAAGCATTGACCTACATTGTTGTGGTTCAAGATGAAGATCTGTTGATCTATGCCTTAAACGGCTTGCAATACAATGCCTTTCGCACTTCTATGCGTACTCGGTCTCA
GTCGGTGAGTTTTGCTGAACTTCATGTTTTACTAAAATCAGAGGAGTCTGCCCTAGAGAAACAATCTAAGTGTGAGGTCTTAGTTGTTCAACCTACGACGATGTAA
Protein sequenceShow/hide protein sequence
MASTTTPSLKDLHSPIFLLANICNLISIRLDSSNYVLWKFQFSSMLREHKLFGFVDGSTPAPSETISSSSIAESSSGGSVSTVTVSPNPLYEDWLAKDQALMTLIKATLS
PEALTYIVVVQDEDLLIYALNGLQYNAFRTSMRTRSQSVSFAELHVLLKSEESALEKQSKCEVLVVQPTTM